Query lcl|NC_019725.1_cdsid_YP_007112702.1 [gene=B508_00185] [protein=hypothetical protein] [protein_id=YP_007112702.1] [location=16118..16831] Match_columns 237 No_of_seqs 107 out of 182 Neff 6.9 Searched_HMMs 1612 Date Thu Nov 7 16:09:42 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_35 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_35_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:107662 Length: 427 100.0 1E-76 6.4E-80 437.2 26.3 237 1-237 191-427 (427) 2 protein:vir:106716 Length: 698 100.0 4.1E-75 2.5E-78 428.4 22.1 232 1-237 296-548 (698) 3 protein:vir:78589 Length: 695 100.0 1.6E-74 9.6E-78 425.2 22.0 234 1-237 296-545 (695) 4 protein:vir:3648 Length: 695 # 100.0 1.7E-74 1.1E-77 425.0 21.9 234 1-237 296-545 (695) 5 protein:vir:101541 Length: 694 100.0 1.9E-74 1.2E-77 424.7 21.9 234 1-237 295-544 (694) 6 protein:vir:104338 Length: 422 100.0 2.8E-71 1.7E-74 407.4 25.6 233 1-235 190-422 (422) 7 protein:vir:79647 Length: 435 100.0 5.8E-71 3.6E-74 405.6 26.4 233 1-237 202-434 (435) 8 protein:vir:96068 Length: 765 100.0 1.1E-68 6.6E-72 393.2 25.3 228 1-237 279-521 (765) 9 protein:vir:99563 Length: 862 100.0 3.6E-67 2.3E-70 384.8 25.3 226 1-237 307-536 (862) 10 protein:vir:103219 Length: 201 100.0 1.5E-66 9.4E-70 381.4 21.2 201 30-235 1-201 (201) 11 protein:vir:5249 Length: 437 # 100.0 8.9E-66 5.5E-69 377.2 25.2 228 1-237 192-425 (437) 12 protein:vir:94049 Length: 532 100.0 7.9E-66 4.9E-69 377.5 24.2 233 1-237 256-512 (532) 13 protein:vir:107742 Length: 537 100.0 3.3E-65 2.1E-68 374.1 25.2 230 1-237 273-524 (537) 14 protein:vir:80040 Length: 461 100.0 6.9E-61 4.3E-64 350.4 24.4 227 1-235 221-461 (461) 15 protein:vir:105782 Length: 449 100.0 2.2E-55 1.4E-58 320.2 22.0 219 1-235 213-449 (449) 16 protein:vir:100882 Length: 383 99.3 6E-13 3.7E-16 87.6 18.1 207 1-233 169-383 (383) 17 protein:vir:102118 Length: 409 99.2 3.8E-12 2.3E-15 83.2 18.0 211 1-232 183-409 (409) 18 protein:vir:100187 Length: 385 99.2 4.3E-12 2.6E-15 82.9 17.8 209 1-235 169-385 (385) 19 protein:vir:79772 Length: 648 99.1 4.8E-11 3E-14 77.2 18.3 219 1-237 246-498 (648) 20 protein:vir:95378 Length: 406 99.1 2.9E-11 1.8E-14 78.3 17.1 210 1-237 178-405 (406) 21 protein:vir:102727 Length: 945 99.0 1.1E-10 6.7E-14 75.2 19.1 219 1-237 275-533 (945) 22 protein:vir:8100 Length: 466 # 99.0 8.1E-11 5E-14 75.9 18.0 222 1-237 223-466 (466) 23 protein:vir:1431 Length: 419 # 99.0 8.5E-11 5.3E-14 75.8 18.0 220 1-237 177-414 (419) 24 protein:vir:8418 Length: 409 # 99.0 1.8E-10 1.1E-13 74.0 18.6 215 1-237 181-408 (409) 25 protein:vir:1380 Length: 422 # 99.0 1E-10 6.3E-14 75.4 17.1 214 1-236 193-422 (422) 26 protein:vir:6210 Length: 394 # 99.0 2.1E-10 1.3E-13 73.6 18.2 212 1-237 163-394 (394) 27 protein:vir:483 Length: 413 # 99.0 1.9E-10 1.2E-13 73.9 17.8 216 1-237 180-411 (413) 28 protein:vir:5737 Length: 419 # 99.0 2.2E-10 1.3E-13 73.6 17.7 220 1-237 178-415 (419) 29 protein:vir:105002 Length: 432 99.0 1.5E-10 9.1E-14 74.5 16.6 213 1-237 194-431 (432) 30 protein:vir:107605 Length: 432 99.0 1.5E-10 9.1E-14 74.5 16.6 213 1-237 194-431 (432) 31 protein:vir:102855 Length: 432 99.0 1.5E-10 9.1E-14 74.5 16.6 213 1-237 194-431 (432) 32 protein:vir:3843 Length: 397 # 98.9 3.9E-10 2.4E-13 72.2 17.9 210 1-236 171-397 (397) 33 protein:vir:97060 Length: 432 98.9 7.6E-10 4.7E-13 70.6 19.4 212 1-237 195-428 (432) 34 protein:vir:81072 Length: 432 98.9 8.9E-10 5.5E-13 70.2 19.7 213 1-237 195-428 (432) 35 protein:vir:9359 Length: 348 # 98.9 3E-10 1.9E-13 72.8 17.1 213 1-237 120-348 (348) 36 protein:vir:6240 Length: 457 # 98.9 6.6E-10 4.1E-13 70.9 18.9 216 1-237 195-451 (457) 37 protein:vir:4598 Length: 416 # 98.9 5.1E-10 3.2E-13 71.5 17.7 211 1-234 181-416 (416) 38 protein:vir:81095 Length: 416 98.9 5.1E-10 3.2E-13 71.5 17.7 211 1-234 181-416 (416) 39 protein:vir:102080 Length: 429 98.9 4.2E-10 2.6E-13 72.0 16.7 216 1-237 191-428 (429) 40 protein:vir:80134 Length: 403 98.9 5.8E-10 3.6E-13 71.2 17.4 212 1-237 175-402 (403) 41 protein:vir:4337 Length: 434 # 98.9 7.1E-10 4.4E-13 70.8 17.1 215 1-236 194-434 (434) 42 protein:vir:81152 Length: 411 98.8 3.2E-10 2E-13 72.6 15.0 211 1-232 185-411 (411) 43 protein:vir:7853 Length: 518 # 98.8 1.5E-09 9.1E-13 69.0 18.6 216 1-237 189-434 (518) 44 protein:vir:4454 Length: 414 # 98.8 1.1E-09 6.7E-13 69.7 17.8 214 1-235 181-414 (414) 45 protein:vir:1266 Length: 416 # 98.8 1.8E-09 1.1E-12 68.5 18.9 212 1-236 183-416 (416) 46 protein:vir:80333 Length: 419 98.8 9.6E-10 6E-13 70.0 17.0 217 1-237 177-409 (419) 47 protein:vir:93610 Length: 454 98.8 1.8E-09 1.1E-12 68.5 18.3 216 1-237 193-441 (454) 48 protein:vir:100691 Length: 535 98.8 1.5E-09 9.4E-13 68.9 17.7 228 1-237 245-524 (535) 49 protein:vir:1884 Length: 424 # 98.8 1.3E-09 7.8E-13 69.4 17.1 207 1-237 196-421 (424) 50 protein:vir:105064 Length: 421 98.8 1.1E-09 6.7E-13 69.7 16.7 217 1-237 180-415 (421) 51 protein:vir:10362 Length: 432 98.8 4.5E-09 2.8E-12 66.4 19.4 213 1-237 195-428 (432) 52 protein:vir:101648 Length: 518 98.8 3.3E-09 2E-12 67.1 18.4 216 1-237 189-434 (518) 53 protein:vir:96980 Length: 409 98.8 2.4E-09 1.5E-12 67.8 17.7 213 1-237 181-409 (409) 54 protein:vir:2683 Length: 412 # 98.7 3E-09 1.9E-12 67.3 17.3 213 1-237 184-412 (412) 55 protein:vir:189 Length: 424 # 98.7 3.1E-09 1.9E-12 67.3 17.0 210 1-231 196-424 (424) 56 protein:vir:94869 Length: 378 98.7 5.5E-10 3.4E-13 71.3 12.9 199 1-236 152-378 (378) 57 protein:vir:1326 Length: 457 # 98.7 7.9E-09 4.9E-12 65.0 19.2 216 1-237 195-451 (457) 58 protein:vir:93943 Length: 409 98.7 3.5E-09 2.2E-12 67.0 17.0 210 1-237 181-409 (409) 59 protein:vir:94666 Length: 723 98.7 3.5E-09 2.2E-12 67.0 16.9 214 1-237 174-446 (723) 60 protein:vir:79984 Length: 441 98.7 3.9E-09 2.4E-12 66.7 17.0 211 1-234 206-441 (441) 61 protein:vir:9408 Length: 441 # 98.7 3.9E-09 2.4E-12 66.7 17.0 211 1-234 206-441 (441) 62 protein:vir:98396 Length: 441 98.7 6.5E-09 4E-12 65.5 18.3 211 1-234 206-441 (441) 63 protein:vir:4509 Length: 424 # 98.7 6.5E-09 4E-12 65.5 17.7 212 1-236 194-424 (424) 64 protein:vir:4952 Length: 386 # 98.7 1.1E-08 6.6E-12 64.3 18.7 205 1-237 176-386 (386) 65 protein:vir:960 Length: 413 # 98.7 5.8E-09 3.6E-12 65.7 17.0 207 1-232 194-413 (413) 66 protein:vir:7987 Length: 456 # 98.7 1.6E-09 1E-12 68.8 13.3 212 1-237 226-455 (456) 67 protein:vir:94002 Length: 378 98.6 1.8E-09 1.1E-12 68.5 13.0 200 1-236 152-378 (378) 68 protein:vir:80796 Length: 574 98.6 1.5E-08 9.1E-12 63.5 17.9 224 1-237 258-526 (574) 69 protein:vir:858 Length: 378 # 98.6 2.5E-09 1.5E-12 67.8 13.7 200 1-237 152-377 (378) 70 protein:vir:100650 Length: 395 98.6 1.1E-08 6.6E-12 64.3 16.9 207 1-237 161-395 (395) 71 protein:vir:9507 Length: 395 # 98.6 1.1E-08 6.6E-12 64.3 16.9 207 1-237 161-395 (395) 72 protein:vir:101289 Length: 395 98.6 1.1E-08 6.6E-12 64.3 16.9 207 1-237 161-395 (395) 73 protein:vir:99452 Length: 651 98.6 3.1E-08 1.9E-11 61.7 19.4 216 1-237 287-543 (651) 74 protein:vir:104259 Length: 403 98.6 2.8E-08 1.7E-11 62.0 18.8 213 1-237 176-402 (403) 75 protein:vir:1661 Length: 378 # 98.6 2.6E-09 1.6E-12 67.7 12.7 200 1-236 152-378 (378) 76 protein:vir:9641 Length: 395 # 98.6 2.9E-08 1.8E-11 61.9 18.0 212 1-237 163-395 (395) 77 protein:vir:3868 Length: 417 # 98.6 2.4E-08 1.5E-11 62.3 17.5 212 1-237 176-416 (417) 78 protein:vir:94426 Length: 409 98.6 2.5E-08 1.5E-11 62.3 17.5 213 1-237 181-409 (409) 79 protein:vir:100150 Length: 437 98.5 2.9E-08 1.8E-11 61.9 17.4 216 1-237 190-436 (437) 80 protein:vir:4089 Length: 395 # 98.5 2.9E-08 1.8E-11 61.9 17.3 210 1-237 162-393 (395) 81 protein:vir:93867 Length: 378 98.5 5.9E-09 3.6E-12 65.7 13.0 200 1-236 152-378 (378) 82 protein:vir:4854 Length: 386 # 98.5 6.2E-08 3.8E-11 60.1 18.5 204 1-236 176-386 (386) 83 protein:vir:105819 Length: 456 98.5 1.1E-08 6.7E-12 64.3 14.2 212 1-237 226-455 (456) 84 protein:vir:102602 Length: 456 98.5 1.1E-08 6.7E-12 64.3 14.2 212 1-237 226-455 (456) 85 protein:vir:3989 Length: 392 # 98.5 7.2E-08 4.5E-11 59.7 18.6 205 1-235 182-392 (392) 86 protein:vir:1023 Length: 392 # 98.5 7.2E-08 4.5E-11 59.7 18.6 205 1-235 182-392 (392) 87 protein:vir:9702 Length: 406 # 98.5 3.1E-08 1.9E-11 61.7 16.4 213 1-237 173-406 (406) 88 protein:vir:99072 Length: 479 98.5 9.4E-09 5.8E-12 64.6 13.4 217 1-237 224-473 (479) 89 protein:vir:4156 Length: 542 # 98.5 1.3E-07 8.2E-11 58.3 19.6 215 1-237 195-457 (542) 90 protein:vir:7407 Length: 392 # 98.5 1.1E-07 6.7E-11 58.8 18.7 205 1-235 182-392 (392) 91 protein:vir:3153 Length: 467 # 98.4 9.6E-08 6E-11 59.1 17.4 218 1-237 186-444 (467) 92 protein:vir:100249 Length: 431 98.4 9.5E-08 5.9E-11 59.1 17.1 209 1-231 203-431 (431) 93 protein:vir:95965 Length: 385 98.4 7.2E-08 4.5E-11 59.7 16.3 203 1-235 162-385 (385) 94 protein:vir:78083 Length: 537 98.4 2.5E-07 1.5E-10 56.8 19.1 221 1-237 267-528 (537) 95 protein:vir:98643 Length: 395 98.3 3.1E-07 1.9E-10 56.3 18.0 211 1-237 163-395 (395) 96 protein:vir:78310 Length: 376 98.3 2E-07 1.2E-10 57.4 16.3 202 1-232 157-376 (376) 97 protein:vir:95113 Length: 474 98.3 5E-07 3.1E-10 55.1 18.4 205 1-237 244-473 (474) 98 protein:vir:80644 Length: 551 98.3 2.2E-07 1.4E-10 57.1 16.3 226 1-237 254-518 (551) 99 protein:vir:94101 Length: 474 98.3 1.1E-07 6.5E-11 58.8 14.2 202 1-237 243-473 (474) 100 protein:vir:105889 Length: 474 98.3 1.1E-07 6.5E-11 58.8 14.2 202 1-237 243-473 (474) 101 protein:vir:102950 Length: 471 98.3 1.4E-07 8.6E-11 58.2 14.8 203 1-237 247-470 (471) 102 protein:vir:5961 Length: 503 # 98.3 7.3E-07 4.5E-10 54.2 18.8 215 1-237 256-499 (503) 103 protein:vir:105292 Length: 478 98.3 7.3E-07 4.6E-10 54.2 18.6 207 1-236 247-478 (478) 104 protein:vir:99312 Length: 563 98.3 4.4E-07 2.7E-10 55.4 17.2 224 1-237 260-527 (563) 105 protein:vir:95599 Length: 563 98.3 4.4E-07 2.7E-10 55.4 17.2 224 1-237 260-527 (563) 106 protein:vir:78537 Length: 480 98.2 1.6E-07 9.9E-11 57.9 14.7 220 1-237 224-468 (480) 107 protein:vir:8317 Length: 409 # 98.2 2.9E-07 1.8E-10 56.5 16.0 194 1-227 203-409 (409) 108 protein:vir:4898 Length: 502 # 98.2 4E-07 2.5E-10 55.6 16.8 227 1-237 251-499 (502) 109 protein:vir:4828 Length: 382 # 98.2 7.4E-07 4.6E-10 54.2 18.1 204 1-237 173-382 (382) 110 protein:vir:1236 Length: 483 # 98.2 9.4E-07 5.8E-10 53.6 18.6 217 1-236 252-483 (483) 111 protein:vir:63755 Length: 547 98.2 7.1E-07 4.4E-10 54.3 17.8 226 1-237 250-514 (547) 112 protein:vir:96266 Length: 474 98.2 4.3E-07 2.7E-10 55.5 16.6 208 1-236 244-474 (474) 113 protein:vir:95899 Length: 474 98.2 4.3E-07 2.7E-10 55.5 16.6 208 1-236 244-474 (474) 114 protein:vir:81218 Length: 423 98.2 7.2E-07 4.4E-10 54.3 17.8 211 1-235 190-423 (423) 115 protein:vir:99916 Length: 504 98.2 1.1E-06 6.6E-10 53.3 18.6 234 1-237 231-498 (504) 116 protein:vir:97447 Length: 474 98.2 5.6E-07 3.5E-10 54.9 17.1 207 1-237 244-473 (474) 117 protein:vir:94498 Length: 474 98.2 5.6E-07 3.5E-10 54.9 17.1 207 1-237 244-473 (474) 118 protein:vir:107112 Length: 478 98.2 1.2E-06 7.6E-10 53.0 18.5 215 1-236 247-478 (478) 119 protein:vir:1082 Length: 359 # 98.2 1.3E-07 8.3E-11 58.3 13.1 184 1-206 170-359 (359) 120 protein:vir:94805 Length: 492 98.2 1.7E-06 1E-09 52.3 18.6 217 1-237 261-491 (492) 121 protein:vir:78227 Length: 480 98.1 4.3E-07 2.7E-10 55.5 15.1 222 1-237 224-470 (480) 122 protein:vir:101647 Length: 460 98.1 1.1E-06 6.6E-10 53.3 17.2 213 1-235 229-460 (460) 123 protein:vir:96839 Length: 474 98.1 1.4E-06 8.4E-10 52.7 17.6 205 1-237 247-473 (474) 124 protein:vir:106639 Length: 481 98.1 7E-07 4.3E-10 54.3 15.7 209 1-237 241-480 (481) 125 protein:vir:96579 Length: 576 98.1 1.8E-06 1.1E-09 52.0 17.8 225 1-237 259-521 (576) 126 protein:vir:105461 Length: 470 98.1 1.6E-06 9.7E-10 52.4 17.3 202 1-237 242-470 (470) 127 protein:vir:96179 Length: 468 98.1 4.3E-07 2.7E-10 55.5 14.0 202 1-237 247-467 (468) 128 protein:vir:95806 Length: 440 98.1 1.5E-06 9.1E-10 52.6 16.7 216 1-236 203-440 (440) 129 protein:vir:96494 Length: 501 98.1 2.2E-06 1.3E-09 51.6 17.5 220 1-235 250-501 (501) 130 protein:vir:9751 Length: 422 # 98.1 5.3E-07 3.3E-10 55.0 14.1 199 1-227 203-422 (422) 131 protein:vir:97336 Length: 492 98.0 2.3E-06 1.4E-09 51.5 17.3 218 1-236 261-492 (492) 132 protein:vir:2732 Length: 501 # 98.0 2.1E-06 1.3E-09 51.7 17.1 220 1-237 250-501 (501) 133 protein:vir:106571 Length: 499 98.0 6E-06 3.7E-09 49.2 19.4 222 1-237 245-493 (499) 134 protein:vir:9306 Length: 511 # 98.0 7.9E-07 4.9E-10 54.0 14.1 233 1-237 257-511 (511) 135 protein:vir:94742 Length: 409 98.0 1.4E-06 8.4E-10 52.8 15.3 185 1-212 203-409 (409) 136 protein:vir:4194 Length: 540 # 98.0 7.6E-06 4.7E-09 48.7 19.1 220 1-237 193-455 (540) 137 protein:vir:93747 Length: 472 98.0 6E-06 3.7E-09 49.2 18.5 215 1-237 241-470 (472) 138 protein:vir:99522 Length: 470 97.9 3.2E-06 2E-09 50.7 16.5 215 1-237 233-469 (470) 139 protein:vir:103951 Length: 511 97.9 2E-06 1.3E-09 51.8 15.0 233 1-237 257-511 (511) 140 protein:vir:4995 Length: 384 # 97.9 3.6E-06 2.3E-09 50.4 16.2 205 1-235 176-384 (384) 141 protein:vir:96240 Length: 511 97.9 1.9E-06 1.2E-09 51.9 14.5 224 1-237 257-511 (511) 142 protein:vir:99781 Length: 511 97.9 2E-06 1.2E-09 51.9 14.6 226 1-236 257-511 (511) 143 protein:vir:3609 Length: 452 # 97.9 9E-06 5.6E-09 48.2 18.1 200 1-236 219-452 (452) 144 protein:vir:97171 Length: 512 97.9 4.7E-06 2.9E-09 49.8 16.3 233 1-237 257-512 (512) 145 protein:vir:104082 Length: 485 97.9 5.1E-06 3.2E-09 49.6 16.2 219 1-231 230-485 (485) 146 protein:vir:2500 Length: 501 # 97.8 4.3E-06 2.7E-09 50.0 15.6 218 1-236 258-501 (501) 147 protein:vir:9871 Length: 429 # 97.8 7.9E-06 4.9E-09 48.5 16.5 200 1-237 203-426 (429) 148 protein:vir:7768 Length: 484 # 97.8 6E-06 3.7E-09 49.2 15.6 225 1-237 229-481 (484) 149 protein:vir:2427 Length: 485 # 97.8 7.7E-06 4.8E-09 48.6 15.6 219 1-231 230-485 (485) 150 protein:vir:9568 Length: 410 # 97.7 7.6E-06 4.7E-09 48.6 15.5 199 1-231 190-410 (410) 151 protein:vir:9922 Length: 489 # 97.7 8.8E-06 5.4E-09 48.3 15.7 214 1-237 230-484 (489) 152 protein:vir:98444 Length: 434 97.7 1.9E-05 1.2E-08 46.4 16.8 213 1-237 195-430 (434) 153 protein:vir:4223 Length: 486 # 97.7 1.5E-05 9.4E-09 47.0 16.1 221 1-232 230-486 (486) 154 protein:vir:3964 Length: 453 # 97.7 2.3E-05 1.5E-08 46.0 17.1 203 1-236 219-453 (453) 155 protein:vir:102330 Length: 451 97.7 8.7E-06 5.4E-09 48.3 14.7 197 1-231 231-451 (451) 156 protein:vir:38 Length: 496 # N 97.7 4.8E-06 3E-09 49.8 13.2 213 1-236 256-496 (496) 157 protein:vir:1634 Length: 409 # 97.6 8.8E-06 5.5E-09 48.3 14.2 185 1-212 203-409 (409) 158 protein:vir:80959 Length: 499 97.6 6.3E-06 3.9E-09 49.1 13.3 212 1-233 259-499 (499) 159 protein:vir:79043 Length: 479 97.6 4.9E-06 3E-09 49.7 12.2 201 1-237 254-479 (479) 160 protein:vir:96366 Length: 511 97.5 1.6E-05 9.9E-09 46.9 14.7 227 1-233 257-511 (511) 161 protein:vir:78805 Length: 511 97.5 1.6E-05 9.9E-09 46.9 14.7 227 1-233 257-511 (511) 162 protein:vir:2341 Length: 488 # 97.5 1.6E-05 1E-08 46.9 14.5 219 1-237 235-485 (488) 163 protein:vir:107880 Length: 491 97.5 5.1E-05 3.2E-08 44.1 19.0 208 1-237 194-417 (491) 164 protein:vir:8184 Length: 474 # 97.5 2E-05 1.2E-08 46.4 14.2 212 1-231 226-474 (474) 165 protein:vir:79703 Length: 505 97.4 5.5E-06 3.4E-09 49.4 11.0 214 1-231 261-505 (505) 166 protein:vir:94546 Length: 506 97.4 7.6E-05 4.7E-08 43.2 18.0 214 1-237 236-504 (506) 167 protein:vir:78907 Length: 518 97.2 0.00015 9.1E-08 41.6 17.3 211 1-237 270-517 (518) 168 protein:vir:5839 Length: 533 # 97.0 2.3E-05 1.4E-08 46.0 10.1 214 1-237 229-511 (533) 169 protein:vir:80680 Length: 441 97.0 9E-05 5.6E-08 42.8 13.1 207 1-222 209-441 (441) 170 protein:vir:733 Length: 453 # 96.9 0.00023 1.4E-07 40.5 15.2 209 1-237 219-452 (453) 171 protein:vir:79233 Length: 526 96.8 0.00031 1.9E-07 39.8 19.4 222 1-237 205-443 (526) 172 protein:vir:1587 Length: 508 # 96.8 0.00013 8.3E-08 41.8 13.1 208 1-236 263-508 (508) 173 protein:vir:78641 Length: 278 96.8 0.00018 1.1E-07 41.1 13.8 149 1-165 120-278 (278) 174 protein:vir:99232 Length: 526 96.8 0.00034 2.1E-07 39.6 19.3 222 1-237 205-443 (526) 175 protein:vir:108215 Length: 469 96.8 0.00035 2.2E-07 39.5 19.5 210 1-237 220-452 (469) 176 protein:vir:4782 Length: 522 # 96.7 9.8E-05 6.1E-08 42.6 11.5 214 1-237 272-516 (522) 177 protein:vir:96738 Length: 505 96.7 0.00041 2.5E-07 39.2 14.9 214 1-232 254-505 (505) 178 protein:vir:79063 Length: 491 96.5 0.00052 3.3E-07 38.6 19.5 208 1-237 194-417 (491) 179 protein:vir:98883 Length: 517 96.5 0.00055 3.4E-07 38.5 14.4 210 1-233 274-517 (517) 180 protein:vir:79538 Length: 502 96.3 0.00078 4.8E-07 37.6 14.6 219 1-234 241-502 (502) 181 protein:vir:9815 Length: 500 # 96.3 0.00026 1.6E-07 40.2 11.4 213 1-237 259-496 (500) 182 protein:vir:3028 Length: 500 # 96.3 0.00026 1.6E-07 40.2 11.4 213 1-237 259-496 (500) 183 protein:vir:78161 Length: 355 96.0 0.0012 7.4E-07 36.6 18.9 224 1-237 75-336 (355) 184 protein:vir:103860 Length: 528 95.5 0.0019 1.2E-06 35.5 20.3 222 1-237 205-448 (528) 185 protein:vir:10321 Length: 495 95.2 0.0024 1.5E-06 34.9 13.1 221 1-236 233-495 (495) 186 protein:vir:95149 Length: 501 95.2 0.0023 1.4E-06 35.1 12.6 208 1-236 270-501 (501) 187 protein:vir:97265 Length: 513 95.1 0.0027 1.7E-06 34.7 14.1 207 1-237 259-496 (513) 188 protein:vir:79150 Length: 368 95.0 0.001 6.4E-07 37.0 10.0 166 1-178 190-368 (368) 189 protein:vir:3420 Length: 533 # 93.4 0.0077 4.8E-06 32.2 14.7 225 1-237 250-531 (533) 190 protein:vir:95542 Length: 548 93.1 0.0052 3.2E-06 33.1 10.1 221 1-237 246-536 (548) 191 protein:vir:389 Length: 530 # 91.8 0.014 8.8E-06 30.7 14.5 226 1-237 247-528 (530) 192 protein:vir:94956 Length: 452 91.6 0.015 9.4E-06 30.6 13.7 201 1-237 235-450 (452) 193 protein:vir:95254 Length: 488 91.4 0.016 9.8E-06 30.5 17.7 223 1-237 221-487 (488) 194 protein:vir:99853 Length: 488 90.9 0.018 1.1E-05 30.1 18.4 208 1-237 185-408 (488) 195 protein:vir:2013 Length: 344 # 90.8 0.019 1.2E-05 30.0 13.3 154 1-167 168-344 (344) 196 protein:vir:78191 Length: 351 89.3 0.027 1.7E-05 29.2 12.3 158 1-171 182-351 (351) 197 protein:vir:100328 Length: 346 89.2 0.028 1.7E-05 29.1 11.1 156 1-169 179-346 (346) 198 protein:vir:6058 Length: 344 # 88.1 0.034 2.1E-05 28.6 13.5 154 1-167 175-344 (344) 199 protein:vir:79207 Length: 351 86.4 0.046 2.8E-05 27.9 12.4 157 1-171 182-351 (351) 200 protein:vir:103971 Length: 376 85.7 0.051 3.2E-05 27.7 11.7 158 1-171 198-376 (376) 201 protein:vir:1986 Length: 512 # 85.6 0.052 3.2E-05 27.6 20.0 210 1-237 205-433 (512) 202 protein:vir:7208 Length: 524 # 82.5 0.076 4.7E-05 26.7 14.5 211 1-237 263-522 (524) 203 protein:vir:103177 Length: 533 82.5 0.077 4.7E-05 26.7 16.0 216 1-237 243-522 (533) 204 protein:vir:6382 Length: 553 # 82.2 0.079 4.9E-05 26.6 16.4 227 1-236 259-553 (553) 205 protein:vir:79511 Length: 448 80.5 0.094 5.8E-05 26.2 18.4 210 1-237 214-438 (448) 206 protein:vir:103458 Length: 524 79.7 0.1 6.3E-05 26.0 14.6 211 1-237 263-522 (524) 207 protein:vir:267 Length: 348 # 78.7 0.11 6.9E-05 25.8 12.1 158 1-173 164-348 (348) 208 protein:vir:98816 Length: 446 77.6 0.12 7.6E-05 25.6 17.7 206 1-219 217-446 (446) 209 protein:vir:96783 Length: 488 77.1 0.13 7.9E-05 25.5 13.4 201 1-229 273-488 (488) 210 protein:vir:4073 Length: 279 # 75.9 0.091 5.7E-05 26.3 7.2 158 1-209 120-279 (279) 211 protein:vir:3743 Length: 345 # 74.9 0.15 9.5E-05 25.1 13.5 151 1-164 168-345 (345) 212 protein:vir:98567 Length: 340 73.2 0.17 0.00011 24.8 11.9 154 1-168 165-340 (340) 213 protein:vir:80453 Length: 535 71.8 0.19 0.00012 24.5 14.2 212 1-237 290-531 (535) 214 protein:vir:108049 Length: 524 71.8 0.19 0.00012 24.5 14.8 211 1-237 263-522 (524) 215 protein:vir:104500 Length: 537 68.2 0.24 0.00015 24.0 17.3 215 1-237 244-524 (537) 216 protein:vir:101494 Length: 527 67.5 0.25 0.00016 23.9 17.0 212 1-237 270-523 (527) 217 protein:vir:102239 Length: 527 67.2 0.26 0.00016 23.8 17.0 212 1-237 270-523 (527) 218 protein:vir:100598 Length: 516 64.5 0.3 0.00019 23.5 15.7 212 1-237 256-515 (516) 219 protein:vir:101189 Length: 516 64.3 0.3 0.00019 23.4 15.8 211 1-237 256-515 (516) 220 protein:vir:101806 Length: 516 64.3 0.3 0.00019 23.4 15.8 211 1-237 256-515 (516) 221 protein:vir:5691 Length: 344 # 62.4 0.34 0.00021 23.2 12.1 156 1-169 175-344 (344) 222 protein:vir:78393 Length: 489 59.3 0.39 0.00024 22.8 14.6 208 1-237 261-483 (489) 223 protein:vir:6896 Length: 523 # 54.4 0.5 0.00031 22.2 15.2 210 1-237 263-521 (523) 224 protein:vir:3780 Length: 345 # 51.8 0.57 0.00035 21.9 13.2 154 1-167 177-345 (345) 225 protein:vir:78749 Length: 337 51.7 0.58 0.00036 21.9 11.9 153 1-165 168-337 (337) 226 protein:vir:81017 Length: 521 51.4 0.58 0.00036 21.9 15.5 211 1-237 260-519 (521) 227 protein:vir:1150 Length: 350 # 49.7 0.63 0.00039 21.7 12.1 153 1-168 176-350 (350) 228 protein:vir:98853 Length: 219 49.3 0.64 0.0004 21.6 12.0 155 1-169 47-219 (219) 229 protein:vir:6596 Length: 521 # 44.1 0.82 0.00051 21.1 15.7 211 1-237 260-519 (521) 230 protein:vir:95014 Length: 491 43.3 0.85 0.00053 21.0 13.0 205 1-237 261-487 (491) 231 protein:vir:104892 Length: 558 42.2 0.9 0.00056 20.8 16.2 214 1-237 255-538 (558) 232 protein:vir:77981 Length: 448 42.0 0.9 0.00056 20.8 16.9 205 1-237 214-432 (448) 233 protein:vir:106999 Length: 564 40.1 0.99 0.00061 20.6 16.0 216 1-237 257-564 (564) 234 protein:vir:5665 Length: 511 # 36.6 1.2 0.00072 20.2 13.9 203 1-229 252-511 (511) 235 protein:vir:98265 Length: 524 31.6 1.5 0.00092 19.6 14.9 211 1-237 264-522 (524) 236 protein:vir:106282 Length: 521 25.9 2 0.0012 18.9 13.9 209 1-237 259-519 (521) 237 protein:vir:80165 Length: 651 24.7 2.1 0.0013 18.8 13.2 218 1-237 341-605 (651) No 1 >protein:vir:107662 Length: 427 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003893;genbank:gi:45686310;genbank:GeneID:2773002 Probab=100.00 E-value=1e-76 Score=437.18 Aligned_cols=237 Identities=99% Similarity=1.403 Sum_probs=233.0 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeeeec Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVLNS 80 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~~~ 80 (237) |+|++|++|++|+++++++++|++++++.|+|+++++++++.++.+..+++|+..+++++++++.+++|+++|+|+++++ T Consensus 191 l~~~~~~~i~~~~~~~~~~~~l~~k~~~~v~k~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~l~~~~e~~e~~~~ 270 (427) T protein:vir:10 191 LNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVLNS 270 (427) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHhccccccchhHHHHhcCccchHHHHHHHHHHHHhcCcccceeeecCCCceeEEec Confidence 67899999999999999999999999999999999999999998888999999999999999999999999999999999 Q ss_pred CcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhcCCCceeEeC Q lcl|NC_019725. 81 DISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVEEEEWSIEFE 160 (237) Q Consensus 81 ~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~s~~~~~~f~ 160 (237) +||||++++++++++|||+++||+|||||+||+||||||++|++|||++|+++||+.++|+|++|+++|+++++|+|+|| T Consensus 271 ~lsgl~~~~~~~~~~iaaa~~IP~t~L~G~sp~Glnstgd~D~~nyyd~i~~~Qe~~l~p~l~~l~~~i~~s~~~~~~f~ 350 (427) T protein:vir:10 271 DISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVDEEEWSIEFE 350 (427) T ss_pred ccCChHHHHHHHHHHHHhhhCCCeeeeccCCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCcEEEeC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCChhccccCCCCCCCCCCCcCcCC Q lcl|NC_019725. 161 PLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIREPEETTEPEPGLGEKLEDEN 237 (237) Q Consensus 161 pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~~~~e~~~e~~~~~~~~~~~e~ 237 (237) |||+||+||+|++++++|+++++|+++|+++++++|++|++.++++|+.++.++++++++++++++|+.+|++++|| T Consensus 351 pL~~~s~kEkaei~~~~a~a~~~~~~~gvi~~~e~r~~L~~~~~~~~~~~~~~~~~e~~~~~~e~~p~~~e~~~d~~ 427 (427) T protein:vir:10 351 PLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIREPEETTEPEPGLGEKLEDEN 427 (427) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHhhhccccCCCCccccccccchhcCCCCCCCCCCCCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999 No 2 >protein:vir:106716 Length: 698 # NCBI annotation: gp18 # Family: family:all:297 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944326;genbank:gi:38638625;genbank:GeneID:2657345 Probab=100.00 E-value=4.1e-75 Score=428.40 Aligned_cols=232 Identities=14% Similarity=0.163 Sum_probs=198.6 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeeeec Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVLNS 80 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~~~ 80 (237) ++|++|++|++|++++.++++|++++++.++|++ |++.++.+. ...+.+|+++++++|+|+|+++||+++|+|+++++ T Consensus 296 v~q~~~e~V~~~~rT~~~v~~Li~~~~~~~l~~d-la~aL~~g~-~~~l~~R~eli~~~Rsn~G~~llDk~~Eefeq~st 373 (698) T protein:vir:10 296 MTQLAMPYIDNWLRTRQSVSDIVKQFSVSGILMD-LAQALTPGA-NVDLSMRAELINRYRDNRNILFLDKATEEFFQFNT 373 (698) T ss_pred HHHHHHHHHHHHHHHhhhHHHHHHHhhHHHHHHH-HHHhcCChh-hHHHHHHHHHHHHhcCccceEEEecCCcceEEEec Confidence 9999999999999999999999999999999995 899887764 55699999999999999999999987799999999 Q ss_pred CcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhcC------CC Q lcl|NC_019725. 81 DISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVEE------EE 154 (237) Q Consensus 81 ~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~s------~~ 154 (237) +||||++|+++|+++|||+++||+||||||||+|||||||+|++||||+|+++||+.|+|+|++|+++|++| ++ T Consensus 374 ~lSGLddVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii~rS~~G~idp~ 453 (698) T protein:vir:10 374 PLSGLDALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVRAYQRNALQQLMNDVIVMIQLSLFGAVDPS 453 (698) T ss_pred CcCCHHHHHHHHHHHHHhhhcCchhhhhccCCcccCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCc Confidence 999999999999999999999999999999999999999999999999999999999999999999999987 68 Q ss_pred ceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCChhccccCCCCC-------- Q lcl|NC_019725. 155 WSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIREPEETTEPE-------- 226 (237) Q Consensus 155 ~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~~~~e~~~e~~-------- 226 (237) |+|+|||||+||++|+|||++++|+++++|++.|+|+++|+|++|+... .+++.+..+..++ +..++++ T Consensus 454 i~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~-~s~Y~~~~d~~d~--p~~~~~~~~~~~~~~ 530 (698) T protein:vir:10 454 IKWQWNALRELDDLEVAEARYKQAQSDVLYVQEQVIRPDQVAARLNTEP-DGPYAGKLDANDD--PGAPADDDIDGVLTY 530 (698) T ss_pred ceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhccC-CCccccccCCccc--CCCCCCCcchHHHhh Confidence 9999999999999999999999999999999999999999999998642 2333222222221 1111111 Q ss_pred -----CCCCCCc--CcCC Q lcl|NC_019725. 227 -----PGLGEKL--EDEN 237 (237) Q Consensus 227 -----~~~~~~~--~~e~ 237 (237) .+++.+. .+-. T Consensus 531 ~~~~~~~~~~~~~~~~~~ 548 (698) T protein:vir:10 531 VQRMAEGGDTGAPTAPGG 548 (698) T ss_pred hcCCcCCCCccccccccc Confidence 0111111 1000 No 3 >protein:vir:78589 Length: 695 # NCBI annotation: NUDIX hydrolase # Family: family:all:297 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294854;genbank:gi:149882917;genbank:GeneID:5291060 Probab=100.00 E-value=1.6e-74 Score=425.23 Aligned_cols=234 Identities=14% Similarity=0.155 Sum_probs=196.9 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeeeec Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVLNS 80 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~~~ 80 (237) ++|++|++|++|++++.++++|+++++++++|++ +++.+..+. ...+.+|+++++++|||+|+++||+++|+|+++++ T Consensus 296 v~q~~~e~V~~~~rT~~~v~~Li~~~~v~~lk~d-la~~L~~g~-~~~l~~R~eli~~~Rsn~G~~llDk~~Eefeq~st 373 (695) T protein:vir:78 296 MTQLAMPYIDNWLRTRQSVSDIVKQFSVSGILMD-LAQALMPGA-NVDLSMRAELINRYRDNRNILFLDKATEEFFQFNT 373 (695) T ss_pred HHHHHHHHHHHHHHHHhHHHHHHHhhhhHHHHHH-HHHhhcChh-HHHHHHHHHHHHHhcCccceEEEecCCcceEEEec Confidence 9999999999999999999999999999999995 888887664 45689999999999999999999987799999999 Q ss_pred CcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhcC------CC Q lcl|NC_019725. 81 DISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVEE------EE 154 (237) Q Consensus 81 ~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~s------~~ 154 (237) +||||+||+++|+++|||+++||+||||||||+|||||||+|++||||+|+++||+.|+|+|++|+++|++| ++ T Consensus 374 slSGLddVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii~rS~~G~idpd 453 (695) T protein:vir:78 374 PLSGLDALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVRAYQRNALQQLMNDVIVMIQLSLFGAVDPS 453 (695) T ss_pred ccCCHHHHHHHHHHHHHhhhcCchhhhhccCCccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCc Confidence 999999999999999999999999999999999999999999999999999999999999999999999987 68 Q ss_pred ceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCChh-------ccccC---CC Q lcl|NC_019725. 155 WSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIR-------EPEET---TE 224 (237) Q Consensus 155 ~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~~-------~~e~~---~e 224 (237) |+|+|||||+||++|+|+|++++|+++++|++.|+|+++|+|++|+... ++++.+..+..++ +++-. -+ T Consensus 454 i~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~-~s~Y~~~~D~~d~p~~~~~~~~~~~~~~~~ 532 (695) T protein:vir:78 454 IKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEP-DGPYAGKLDANDDPGVPADDDIDGVLTYVQ 532 (695) T ss_pred ceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCC-CcccccccccccCCCcCccchhhhhHhhhc Confidence 9999999999999999999999999999999999999999999998632 2333211222111 11000 00 Q ss_pred CCCCCCCCcCcCC Q lcl|NC_019725. 225 PEPGLGEKLEDEN 237 (237) Q Consensus 225 ~~~~~~~~~~~e~ 237 (237) ..+..++.-+..+ T Consensus 533 ~~~~~~~~~~~~~ 545 (695) T protein:vir:78 533 RLAEGGDTGAPGG 545 (695) T ss_pred CcccccccCCCCC Confidence 0000000000000 No 4 >protein:vir:3648 Length: 695 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705643;genbank:gi:23752328;genbank:GeneID:955749 Probab=100.00 E-value=1.7e-74 Score=425.02 Aligned_cols=234 Identities=14% Similarity=0.155 Sum_probs=197.0 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeeeec Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVLNS 80 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~~~ 80 (237) ++|++|++|++|++++.++++|++++++.++|++ +++.+..+. +..+.+|+++++++|||+|+++||+++|+|+++++ T Consensus 296 v~q~~~e~V~~~~rT~~~v~~Li~~~~v~~lk~d-la~aL~~g~-~~~l~~R~eli~~~Rsn~G~~llDk~~Eefeq~st 373 (695) T protein:vir:36 296 MTQLAMPYIDNWLRTRQSVSDIVKQFSVSGILMD-LAQALMPGA-NVDLSMRAELINRYRDNRNILFLDKATEEFFQFNT 373 (695) T ss_pred HHHHHHHHHHHHHHHHhHHHHHHHhhhHHHHHHH-HHHhhcChh-HHHHHHHHHHHHHhcCccceEEEecCCcceEEEec Confidence 9999999999999999999999999999999995 888877664 45689999999999999999999987799999999 Q ss_pred CcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhcC------CC Q lcl|NC_019725. 81 DISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVEE------EE 154 (237) Q Consensus 81 ~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~s------~~ 154 (237) +||||+||+++|+++|||+++||+||||||||+|||||||+|++||||+|+++||+.|+|+|++|+++|++| ++ T Consensus 374 slSGLddVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii~rS~~G~idpd 453 (695) T protein:vir:36 374 PLSGLDALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVRAYQRNALQQLMNDVIVMIQLSLFGAVDPS 453 (695) T ss_pred ccCCHHHHHHHHHHHHHhhhcCchhhhhccCcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCc Confidence 999999999999999999999999999999999999999999999999999999999999999999999987 68 Q ss_pred ceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCChh-------ccccC---CC Q lcl|NC_019725. 155 WSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIR-------EPEET---TE 224 (237) Q Consensus 155 ~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~~-------~~e~~---~e 224 (237) |+|+|||||+||++|+|+|++++|+++++|++.|+|+++|+|++|+... ++++.+..+..++ +++-. -+ T Consensus 454 i~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~-~s~Y~~~~D~~d~p~~~~~~~~~~~~~~~~ 532 (695) T protein:vir:36 454 IKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEP-DGPYAGKLDANDDPGVPADDDIDGVLTYVQ 532 (695) T ss_pred ceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCC-CcccccccccccCCCcCccchhhhhHhhhc Confidence 9999999999999999999999999999999999999999999998632 2333211222111 11000 00 Q ss_pred CCCCCCCCcCcCC Q lcl|NC_019725. 225 PEPGLGEKLEDEN 237 (237) Q Consensus 225 ~~~~~~~~~~~e~ 237 (237) ..+..++.-+..+ T Consensus 533 ~~~~~~~~~~~~~ 545 (695) T protein:vir:36 533 RLAEGGDTGAPGG 545 (695) T ss_pred CcccccccCCCCc Confidence 0000000000001 No 5 >protein:vir:101541 Length: 694 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958122;genbank:gi:41057668;genbank:GeneID:2716798 Probab=100.00 E-value=1.9e-74 Score=424.72 Aligned_cols=234 Identities=14% Similarity=0.155 Sum_probs=197.0 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeeeec Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVLNS 80 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~~~ 80 (237) ++|++|++|++|++++.++++|+++++++++|++ +++.+..+. ...+.+|+++++++|+|+|+++||+++|+|+++++ T Consensus 295 v~q~~~e~V~~~~rT~~~v~~Li~~~~v~~lk~d-la~~L~~g~-~~~l~~R~eli~~~Rsn~G~~llDk~~Eefeq~st 372 (694) T protein:vir:10 295 MTQLAMPYIDNWLRTRQSVSDIVKQFSVSGILMD-LAQALMPGA-NVDLSMRAELINRYRDNRNILFLDKATEEFFQFNT 372 (694) T ss_pred HHHHHHHHHHHHHHHHhHHHHHHHhhhhHHHHHH-HHHhhcChh-HHHHHHHHHHHHHhcCccceEEEecCCcceEEEec Confidence 9999999999999999999999999999999995 888887664 45689999999999999999999987799999999 Q ss_pred CcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhcC------CC Q lcl|NC_019725. 81 DISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVEE------EE 154 (237) Q Consensus 81 ~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~s------~~ 154 (237) +||||++|+++|+++|||+++||+||||||||+|||||||+|++||||+|+++||+.|+|+|++|+++|++| ++ T Consensus 373 slSGLddVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~D~rnYYD~I~s~Qe~~L~p~L~rl~~ii~rS~~G~idp~ 452 (694) T protein:vir:10 373 PLSGLDALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSEGEIRVWYDYVRAYQRNALQQLMNDVIVMIQLSLFGAVDPS 452 (694) T ss_pred ccCCHHHHHHHHHHHHHhhhcCchhhhhccCcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCc Confidence 999999999999999999999999999999999999999999999999999999999999999999999987 68 Q ss_pred ceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCChh-------ccccC---CC Q lcl|NC_019725. 155 WSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIR-------EPEET---TE 224 (237) Q Consensus 155 ~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~~-------~~e~~---~e 224 (237) |+|+|||||+||++|+|+|++++|+++++|++.|+|+++|+|++|+... ++++.+..+..++ +++-. -+ T Consensus 453 i~~~fnPL~qmtd~EkAeI~~k~A~~d~~~~~~gvI~~~evr~rL~~d~-~s~Y~~~~D~~d~p~~~~~~~~~~~~~~~~ 531 (694) T protein:vir:10 453 IKWQWNALRELDDLEVAESRYKQAQSDVLYVQEQVIRPDQVAARLNTEP-DGPYAGKLDANDDPGVPADDDIDGVLTYVQ 531 (694) T ss_pred ceEEeCCCCCcCHHHHHHHHhhhhHHHHHHHHhcCCCHHHHHHHHhcCC-CcccccccccccCCCcCccchhhhhHhhhc Confidence 9999999999999999999999999999999999999999999998632 2333211222111 11000 00 Q ss_pred CCCCCCCCcCcCC Q lcl|NC_019725. 225 PEPGLGEKLEDEN 237 (237) Q Consensus 225 ~~~~~~~~~~~e~ 237 (237) ..+..++.-+..+ T Consensus 532 ~~~~~~~~~~~~~ 544 (694) T protein:vir:10 532 RLAEGGDTGAPGG 544 (694) T ss_pred CcccccccCCCCc Confidence 0000000000000 No 6 >protein:vir:104338 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398967;genbank:gi:81343951;genbank:GeneID:3778870 Probab=100.00 E-value=2.8e-71 Score=407.41 Aligned_cols=233 Identities=61% Similarity=0.933 Sum_probs=213.8 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeeeec Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVLNS 80 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~~~ 80 (237) |.++||++|++|+++.+++++|++++++.|+|++++++++++++.+..+++|+..+++++++++++++|+++|+|+++++ T Consensus 190 l~~~~~~~i~~~~~~~~~~~~l~~~~~~~v~~~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~l~~~~e~~e~~~~ 269 (422) T protein:vir:10 190 LSSDILDSIKDYTNCERLATQLLKRKQQAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGVGQAIGIDAESEEYSVLNS 269 (422) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhccccccchhHHHhcCCccchHHHHHHHHHHHHhcCCccceeEecCCcceEEEec Confidence 55679999999999999999999999999999999999999988888999999999999999999999998899999999 Q ss_pred CcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhcCCCceeEeC Q lcl|NC_019725. 81 DISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVEEEEWSIEFE 160 (237) Q Consensus 81 ~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~s~~~~~~f~ 160 (237) +|||++++++++++.|||+++||+|||||+||+||||||++|++|||++|+++||+.++|+|++|+++|+++++|+|+|| T Consensus 270 ~lsgl~~~~~~~~~~iaaa~~IP~t~L~G~s~~Glnatgd~d~~~yyd~i~~~Qe~~l~p~l~~l~~~i~~s~~~~~~f~ 349 (422) T protein:vir:10 270 DIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGVSSSQNTALETFHKLVDRKRNAELLPILEFLIPFIVNAEEWSVEFN 349 (422) T ss_pred ccCChHHHHHHHHHHHHhhhCCCeeeeccCCcccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCCcEEEeC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCChhccccCCCCCCCCCCCcCc Q lcl|NC_019725. 161 PLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIREPEETTEPEPGLGEKLED 235 (237) Q Consensus 161 pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~~~~e~~~e~~~~~~~~~~~ 235 (237) |||+||+||+|++++++|+++++|+++|+++++|+|++|+..++..|+.+ ++.++++++....+++..++.++ T Consensus 350 pL~~~sekekaei~~~~a~a~~~~~~~g~i~~~e~r~~L~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~d 422 (422) T protein:vir:10 350 PLAQESSKDKAEILEKNVNSIAALIAAGAMDIDEARDTLRTIAPEVKIND--GSVETEVTISETSNDPLEVPTDD 422 (422) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHhhhhcccccCCC--CCCccccchhhcCCCCCCCCCCC Confidence 99999999999999999999999999999999999999998888888765 34455555444433333333333 No 7 >protein:vir:79647 Length: 435 # NCBI annotation: PorT # Family: family:all:297 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285520;genbank:gi:148734503;genbank:GeneID:5220005 Probab=100.00 E-value=5.8e-71 Score=405.65 Aligned_cols=233 Identities=69% Similarity=1.079 Sum_probs=212.4 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeeeec Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVLNS 80 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~~~ 80 (237) |+|++|++|++|+++.+++++|++++++.|+|+++++++++.+.++..+++|++.++++|++++.+++|+++|+|+++++ T Consensus 202 l~e~~~~~l~~~~~~~~~~~~l~~~~~~~v~~~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~i~~~~e~~e~~~~ 281 (435) T protein:vir:79 202 LNKRLIEAIVDYNYCQELATQLLRRKQQAVWKARDLALMCDDEEGRYAARLRLAQVDDESGVGKAIGIDATDEEYEVLNS 281 (435) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcCccccchhHHHhhcCccchHHHHHHHHHHHHhcCCCCceeEecCCcceEEEec Confidence 67999999999999999999999999999999999999999988888999999999999999999999998899999999 Q ss_pred CcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhcCCCceeEeC Q lcl|NC_019725. 81 DISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVEEEEWSIEFE 160 (237) Q Consensus 81 ~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~s~~~~~~f~ 160 (237) +|||++++++++++.||++++||+|||||+||+||||||++|++|||++|+++||..++|+|++|+++++++++|+|+|+ T Consensus 282 ~lsgl~~~~~~~~~~iaaa~~IP~t~L~G~s~~glnstgd~d~~~yyd~i~~~Qe~~l~p~l~~l~~li~~s~d~~~~f~ 361 (435) T protein:vir:79 282 DVSGVPEFLQEKIDRIVALTGIHEIIIKNKNTGGVSASQNTALETFYKLIDRKRVEDYKPILEFLLPFMISETEWSIEFE 361 (435) T ss_pred ccCCHHHHHHHHHHHHHhhhCCCeeeeccCCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCeEEeC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCChhccccCCCCCCCCCCCcCcCC Q lcl|NC_019725. 161 PLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIREPEETTEPEPGLGEKLEDEN 237 (237) Q Consensus 161 pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~~~~e~~~e~~~~~~~~~~~e~ 237 (237) |||+||+||+||+++++|+++++|+++|+|+++|+|+.|+...+++|+.+......++.++. +.....+.+|| T Consensus 362 pL~~~sekEkAei~~~~a~a~~~~~~~g~i~~~e~r~~L~~~~~~~~~~~~~~~~~~~~~d~----~~~~~~e~g~~ 434 (435) T protein:vir:79 362 PLSVPSDKDKAEIMAKNVESVVKLKAEQAINLKETRDTLRSICPDLKIMDNDNIELPEPEDL----DPEPGQEGGLN 434 (435) T ss_pred CCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHhccccCCCCcccccCCccccC----CCCCCCCCCCC Confidence 99999999999999999999999999999999999999998888888876444433332221 22222223333 No 8 >protein:vir:96068 Length: 765 # NCBI annotation: conserved hypothetical protein ORF017 # Family: family:all:297 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294434;genbank:gi:149408331;genbank:GeneID:5237187 Probab=100.00 E-value=1.1e-68 Score=393.25 Aligned_cols=228 Identities=14% Similarity=0.195 Sum_probs=196.8 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeeeec Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVLNS 80 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~~~ 80 (237) +||++|++|++|+++++++++|+++++++|||++++..+. .+.++++|+++++++|+|+|++++|++ |+|+++++ T Consensus 279 vlq~~yd~I~~~~~t~~~~a~Ll~k~~~~v~k~~~~~~l~----~~~~l~~r~~~~~~~r~n~g~~~id~e-e~~e~~s~ 353 (765) T protein:vir:96 279 LTQRIYERVYAAERTANEAPLLAMSKRTSTIHVDVEKAIA----NEDAFNARLAFWIANRDNHGVKVIGID-ETMEQFDT 353 (765) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhccceeeechHhhhc----cHHHHHHHHHHHHHhcCCceeEEecCC-cceeEEec Confidence 8999999999999999999999999999999998766543 245799999999999999999999986 89999999 Q ss_pred CcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhcC----CCce Q lcl|NC_019725. 81 DISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVEE----EEWS 156 (237) Q Consensus 81 ~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~s----~~~~ 156 (237) +||||+++++++++.||++++||+|||||+||+|||||||+|++|||++|+++||+.++|+|++|+++|+++ .+|+ T Consensus 354 ~lsgl~d~l~~~~~~iAaas~IP~t~LfGqsp~GlnATGe~D~~nYyD~I~s~Qe~~l~p~le~L~~li~~s~~i~~d~~ 433 (765) T protein:vir:96 354 NLSDFDSVIMNQYQLVAAIAKTPATKLLGTSPKGFNATGEHETISYHEELESIQEHIFDPLLERHYLLLAKSESIDVQLE 433 (765) T ss_pred ccCCHHHHHHHHHHHHHhhhCCCeeeeccCCcccccCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcce Confidence 999999999999999999999999999999999999999999999999999999999999999999999876 4999 Q ss_pred eEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCChhccccCCCCCCCCCCCcCc- Q lcl|NC_019725. 157 IEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIREPEETTEPEPGLGEKLED- 235 (237) Q Consensus 157 ~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~~~~e~~~e~~~~~~~~~~~- 235 (237) |+|||||+||+||||++++++|+++++|+++|+|+++|+|++|+... .+| ..++++++++..+..+|..++..+. T Consensus 434 i~FnpL~~~sekEkAei~~k~Aea~~~~~~~Gvis~dEvR~~L~~~~-~~g---~~~l~d~~~e~~~~~~pe~~~~~~~~ 509 (765) T protein:vir:96 434 IVWNPVDSTTSQQQAELNNKKAATDEIYINSGVVSPDEVRERLRDDP-RSG---YNRLTDDQAETEPGMSPENLAELEKA 509 (765) T ss_pred EEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHhccc-cCC---CCCCCccccccccCCCccccccccCC Confidence 99999999999999999999999999999999999999999998632 333 3445555544322222111111110 Q ss_pred ----------CC Q lcl|NC_019725. 236 ----------EN 237 (237) Q Consensus 236 ----------e~ 237 (237) ++ T Consensus 510 ~~~~~~~~~e~~ 521 (765) T protein:vir:96 510 GAQSAKAKGEAE 521 (765) T ss_pred CcccccccCccc Confidence 00 No 9 >protein:vir:99563 Length: 862 # NCBI annotation: minor head protein-like protein # Family: family:all:297 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039808;genbank:gi:126011058;genbank:GeneID:4818258 Probab=100.00 E-value=3.6e-67 Score=384.83 Aligned_cols=226 Identities=12% Similarity=0.161 Sum_probs=197.8 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeeeec Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVLNS 80 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~~~ 80 (237) +||++|++|++|+++++++++|++++++.|+|+++++.+.. +..+.+|+.+++++|+|+|+++||++ |+|+++++ T Consensus 307 vLe~iyd~L~~~d~t~~saa~Ll~ka~l~v~ktd~l~~l~~----ed~l~~r~~~~~~~rdN~Gi~liD~e-Ee~e~ls~ 381 (862) T protein:vir:99 307 LVQRIYERVYAAERTANEAPLLAMNKRTTAIHTDTAKAIAN----EDKFIQRLMFWVRYRDNHAVKVLGTD-ETMEQFDT 381 (862) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhccceeechhHhhhcc----HHHHHHHHHHHHhccCcceeEEecCC-CceeEEec Confidence 99999999999999999999999999999999998876542 45789999999999999999999986 89999999 Q ss_pred CcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhcC----CCce Q lcl|NC_019725. 81 DISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVEE----EEWS 156 (237) Q Consensus 81 ~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~s----~~~~ 156 (237) +||||++++++++++||++++||+|||||+||+||||||++|++|||++|+++||+.|+|+|++|+.+++.+ ++|+ T Consensus 382 slSGL~dll~~~~q~IAaas~IP~tiLfGqspaGlnATGE~D~~nYyD~I~s~QE~~L~P~LerL~~li~~~lg~~~d~~ 461 (862) T protein:vir:99 382 SLADFDAVIMGQYQLVASIAKTPATKLLGTAPKGFNSTGEFETISYHEELESIQEHVYMPFLQRHYLISRLSLGIQHEID 461 (862) T ss_pred ccCChHHHHHHHHHHHHhhhCCCceeecccCcccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCcce Confidence 999999999999999999999999999999999999999999999999999999999999999999988653 6999 Q ss_pred eEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCChhccccCCCCCCCCCCCcCcC Q lcl|NC_019725. 157 IEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIREPEETTEPEPGLGEKLEDE 236 (237) Q Consensus 157 ~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~~~~e~~~e~~~~~~~~~~~e 236 (237) |+|+|||+||++|+|++++++|+++++|+++|+|+++|+|++|+... . .+..+++++++|+.+...++... +.+ T Consensus 462 ieFnpL~~~sekEkAEi~kk~Aea~~~lv~sGvispdEvR~~L~~~~-~---~g~~~l~ded~E~d~~~~~e~~~--~~e 535 (862) T protein:vir:99 462 VVMEPVASMTAQQQADLNKTKAEGGKVLIDGGVISPDEERNRIRDDK-R---SGYNRLTKEDAEETPGASPENLA--AYQ 535 (862) T ss_pred EEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHhcC-C---cCCCCCCcccccccCCCCccccc--ccc Confidence 99999999999999999999999999999999999999999998532 2 33345666666543322111100 001 Q ss_pred C Q lcl|NC_019725. 237 N 237 (237) Q Consensus 237 ~ 237 (237) + T Consensus 536 ~ 536 (862) T protein:vir:99 536 K 536 (862) T ss_pred c Confidence 1 No 10 >protein:vir:103219 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277473;genbank:gi:71834115;genbank:GeneID:3562330 Probab=100.00 E-value=1.5e-66 Score=381.42 Aligned_cols=201 Identities=62% Similarity=0.921 Sum_probs=177.7 Q ss_pred eeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeeeecCcCCHHHHHHHHHHHHhhhhcCceeeeec Q lcl|NC_019725. 30 VWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVLNSDISGVPEFLSSKMDRIVSLSGIHEIIIKN 109 (237) Q Consensus 30 v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~~~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G 109 (237) |||+++|+++++++ +.++++|+++++++|+++++++||+++|+|++++++||||++++++|+++|||+++||+||||| T Consensus 1 V~k~~~l~~~~~~~--~~~~~~r~~~~~~~~~~~~~~~ld~~~e~~e~~~~~lsGl~d~l~~~~~~iaa~s~iP~t~LfG 78 (201) T protein:vir:10 1 MWKAKGLADLCDDS--DGAARLRLAQVDNNSGVGQAIGIDADSEEYNVLNSDIGGIDTFLSQKFDRIVALSGIHEIILKG 78 (201) T ss_pred CccchHHHHHhcCC--hHHHHHHHHHHHHhhhhhhhheeecCCcceeeeecCcCChHHHHHHHHHHHHhHhcCchhhhcC Confidence 99999999999876 4579999999999999999999999889999999999999999999999999999999999999 Q ss_pred cCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhcCCCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCC Q lcl|NC_019725. 110 KNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVEEEEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQI 189 (237) Q Consensus 110 ~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~s~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~ 189 (237) +||+||||||++|++|||++|+++||+.+||+|++|+++++++++|+|+|||||+||+||+|+|++++|+++++|+++|+ T Consensus 79 ~sp~Glnatge~d~~nyyd~i~~~Qe~~l~p~le~l~~~~~~~~~~~~~f~pL~~~s~kekAei~~~~a~a~~~~~~~g~ 158 (201) T protein:vir:10 79 KNVGGVSASQNTALETFYGYVDRKRKAELLPLLEFLLPFIVTEQEWSVEFNPLSQVSDKDKSEILEKNVNSVAALIAAGI 158 (201) T ss_pred CCCccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCceEeeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCHHHHHHHHHhhccccccCCCCCCChhccccCCCCCCCCCCCcCc Q lcl|NC_019725. 190 IDLEEARDTLRSIAPEFKLKDGNNINIREPEETTEPEPGLGEKLED 235 (237) Q Consensus 190 i~~~e~r~~l~~~~~~~g~~~~~~~~~~~~e~~~e~~~~~~~~~~~ 235 (237) |+++|+|++|+..+.. |..+...+ +++++.+.+. .+++.+++. T Consensus 159 i~~~e~r~~L~~~~~~-~~~~~~~~-~~~~~~~e~~-dp~~~~~~~ 201 (201) T protein:vir:10 159 IDADEARDTLRAISTE-VKIGEGSI-QTEVVINESE-DPLDVSANN 201 (201) T ss_pred CCHHHHHHHHHhcCCc-CCCCCCCC-CccccccccC-CCCCCCCCC Confidence 9999999999987654 43333333 3333333322 222222222 No 11 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=100.00 E-value=8.9e-66 Score=377.22 Aligned_cols=228 Identities=19% Similarity=0.349 Sum_probs=202.5 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeeeec Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVLNS 80 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~~~ 80 (237) +||++|++|++|+++.+++++|++++++.|+|++++++.++.+ ++..+.+|+++++.++++++++++|++ |+|+++++ T Consensus 192 ~le~~~~~i~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~l~~~-~~~~~~~~~~~~~~~~~~~~~~~~d~~-~~~e~~~~ 269 (437) T protein:vir:52 192 DLEKIIDVLKRFDSASVNVGDLIFESKIDIFKIAGLSDKIAAG-MENEVASVISAVQEIKSATNSLLLDAE-NEYDRKEL 269 (437) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHcCCCceecchHHHHhcCC-cHHHHHHHHHHHHHhcCCCceEEEcCC-cceEEEec Confidence 9999999999999999999999999999999999999988775 567899999999999999999999986 89999999 Q ss_pred CcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhcC------CC Q lcl|NC_019725. 81 DISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVEE------EE 154 (237) Q Consensus 81 ~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~s------~~ 154 (237) +|||+++++++++++||++++||+|+|||+||+|| |||++|++|||++|+++||+.++|+|++|+++|+++ ++ T Consensus 270 ~~sgl~~~l~~~~~~iaaa~~iP~t~L~G~s~~Gl-asge~D~~~yyd~i~~~Qe~~l~p~le~l~~~i~~~~~g~~~~~ 348 (437) T protein:vir:52 270 TFTGLKDLLTEFRNAVAGAADMPVTILFGQSVSGL-ASGDEDIQNYHEAIRRLQETRLRPIFEIIDPLICNELFGGLPAD 348 (437) T ss_pred CcCCHHHHHHHHHHHHHHHhcCchhhhcCcCcccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCc Confidence 99999999999999999999999999999999999 799999999999999999999999999999999865 58 Q ss_pred ceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCChhccccCCCCCCCCCCCcC Q lcl|NC_019725. 155 WSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIREPEETTEPEPGLGEKLE 234 (237) Q Consensus 155 ~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~~~~e~~~e~~~~~~~~~~ 234 (237) |+|+|||||+||+||+||+++++|+++++|+++|+++++|+|++|+.. |.++ .++++++++.+..++..++.++ T Consensus 349 ~~~~f~pL~~~s~kekae~~~~~a~a~~~~~~~g~i~~~e~r~~L~~~----g~~~--~i~~~~~~~~~~~~~~~~~~~~ 422 (437) T protein:vir:52 349 WWFEFVPLTTVKQEQQINMLNTFATAANTLIQNGVLNEYQIANELRES----GLFA--NISAEHIEELKNADEFAGNFEE 422 (437) T ss_pred ceEEeCCcCCcCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHhc----CCCC--CCCccccccccCCCCCCCccCC Confidence 999999999999999999999999999999999999999999999863 4443 3444444433333332222222 Q ss_pred cCC Q lcl|NC_019725. 235 DEN 237 (237) Q Consensus 235 ~e~ 237 (237) .++ T Consensus 423 ~~~ 425 (437) T protein:vir:52 423 PEK 425 (437) T ss_pred CCC Confidence 222 No 12 >protein:vir:94049 Length: 532 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453629;genbank:gi:84662665;genbank:GeneID:5142559 Probab=100.00 E-value=7.9e-66 Score=377.50 Aligned_cols=233 Identities=14% Similarity=0.215 Sum_probs=199.6 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeeeec Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVLNS 80 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~~~ 80 (237) ++|++|++|++|+++.+++++|++++++.|+|+ +++++++.+ ++..+.+|+.+++++|+|+|++++|+++|+|+++++ T Consensus 256 vlq~~~~~l~~~~~t~~~~~~l~~~~~~~v~k~-~~a~~ls~~-~~~~~~~r~~~~~~~~~n~g~~~id~~~e~~e~~~~ 333 (532) T protein:vir:94 256 ISQLAMPYVDNWLRTRQSVSDTVKQFSMTNLAT-DMAQLLAPG-GAQSLDARLQLFNLYRDNRNIGALDKGTEEIQQTNT 333 (532) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCceeee-chHHhhcch-hHHHHHHHHHHHHhhcCCccceEEcCCCceeEEEec Confidence 999999999999999999999999999999999 688888765 467899999999999999999999998899999999 Q ss_pred CcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhcC------CC Q lcl|NC_019725. 81 DISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVEE------EE 154 (237) Q Consensus 81 ~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~s------~~ 154 (237) +||||+++++++++.|||+++||+|||||+||+|||||||+|++|||++|+++||+.++|+|++|+++|+++ ++ T Consensus 334 ~lsgl~~~l~~~~~~iAaa~~IP~t~LfG~sp~GlnstGe~D~~~yyd~I~s~Qe~~l~p~le~l~~~l~~s~~g~~~~d 413 (532) T protein:vir:94 334 PLSGLDSLQAQSQEQMAAVSHIPLVKLLGITPNGLNASSDGEIRVWYDFIAGYQATNLTPLMEWIIDLIQLSEYGQIDPG 413 (532) T ss_pred ccCCHHHHHHHHHHHHHhHhCCCeeeeecCCcccccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCC Confidence 999999999999999999999999999999999999999999999999999999999999999999999865 49 Q ss_pred ceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCChhcccc------------- Q lcl|NC_019725. 155 WSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIREPEE------------- 221 (237) Q Consensus 155 ~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~~~~e~------------- 221 (237) |+|+|+|||++|+||+||+++++|+++++|+++|+|+++|+|+.|+.. +.+++.+ ..+..++.++ T Consensus 414 ~~~~f~pL~~~s~kEkAei~~~~a~a~~~~~~~Gvi~~~Evr~~l~~~-~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 491 (532) T protein:vir:94 414 LAWEWSPLMELDDKELAEVRQLNASTDSTLMELGVIDAKMVQQRLAAD-PTSGYAG-ALGERDELDDVEEIAKQLMAAAL 491 (532) T ss_pred ceEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHhcC-Ccccccc-ccccccccccccchhhhhccccc Confidence 999999999999999999999999999999999999999999999753 3444332 2222111110 Q ss_pred -----CCCCCCCCCCCcCcCC Q lcl|NC_019725. 222 -----TTEPEPGLGEKLEDEN 237 (237) Q Consensus 222 -----~~e~~~~~~~~~~~e~ 237 (237) .++.+.+.++...++. T Consensus 492 ~~~~~~~~~~~~~~~~~~d~~ 512 (532) T protein:vir:94 492 NPPATAPQTPNPQPDSEDDQT 512 (532) T ss_pred CCCCCCCCCCCCCCCCCCCCC Confidence 0011111122222222 No 13 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=100.00 E-value=3.3e-65 Score=374.08 Aligned_cols=230 Identities=10% Similarity=0.197 Sum_probs=196.7 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeeeec Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVLNS 80 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~~~ 80 (237) +||++|++|++|+++++++++|++++++.|+|+++++.+. + +..+.+|+++++++|+|++++++|+++|+|+++++ T Consensus 273 vlq~~~~~l~~~~~t~~~~~~l~~~~~~~v~k~~~~~~l~-~---~~~~~~r~~~~~~~r~n~g~~~id~e~e~~e~~~~ 348 (537) T protein:vir:10 273 LPQQIMERVYAAERTANEGPMLAMTKRQTVLKVDAAQVLA-N---KQQFDETMSWWTATRDNYQVRVVDKDNEDVVQIDT 348 (537) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCceeeechHHhhc-C---HHHHHHHHHHHHhhcCCcceeEecCCCceeEEEec Confidence 8999999999999999999999999999999999876553 2 35689999999999999999999998899999999 Q ss_pred CcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhcC-----CCc Q lcl|NC_019725. 81 DISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVEE-----EEW 155 (237) Q Consensus 81 ~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~s-----~~~ 155 (237) +|||+++++++++++||++++||+|||||+||+||||||++|++|||++|+++|| .++|+|++|+++|+++ .+| T Consensus 349 ~lsgl~~~l~~~~~~iAa~~~IP~t~L~G~sp~GlnatGe~D~~~yyd~I~~~Qe-~l~p~l~~l~~ll~~~~~~~~~~~ 427 (537) T protein:vir:10 349 TLNDLDKVIMNQYQLVCAIARTPAPKMLGTVPTGFNSTGDYEEASYHEECESTQD-DMRPLIDRHHQLVCRSHLRKRIRV 427 (537) T ss_pred cCCCHHHHHHHHHHHHHhhhCCCceeeccCCccccccchhHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcCCCCcce Confidence 9999999999999999999999999999999999999999999999999999999 5999999999999876 489 Q ss_pred eeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhc--cccccCCCCCCChhccccCC--CCC----- Q lcl|NC_019725. 156 SIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIA--PEFKLKDGNNINIREPEETT--EPE----- 226 (237) Q Consensus 156 ~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~--~~~g~~~~~~~~~~~~e~~~--e~~----- 226 (237) +|+|+|||++|+||+||+++++|+++++|+++|+|+++|+|+.|+... .++|+.+. +++++.++.. +.. T Consensus 428 ~i~f~pL~~~s~kEkAei~~~~a~a~~~~~~~G~i~~~Evr~~L~~~~~~g~~~l~~~--~~~ed~e~~~~~~~~~~~~~ 505 (537) T protein:vir:10 428 KVEFPPMDAPKESERADTFLKKMQAAKLAFEMGAVDGVDVNEYLRMDPTLGFTSITPA--MRPTDAEDIDVDDEGKPVRI 505 (537) T ss_pred EEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHcCCCCHHHHHHHHhccCccccccccCC--CChhhhhcccCCccCCcCCC Confidence 999999999999999999999999999999999999999999998643 34566443 3333333211 100 Q ss_pred ---CC-----CCCCcCcCC Q lcl|NC_019725. 227 ---PG-----LGEKLEDEN 237 (237) Q Consensus 227 ---~~-----~~~~~~~e~ 237 (237) ++ .+....+++ T Consensus 506 ~~~~~~~~~~~~~~~~~~~ 524 (537) T protein:vir:10 506 IEDQPAPSEMFGATSSGES 524 (537) T ss_pred CCCCCCccccCCCCccccc Confidence 00 011111111 No 14 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=100.00 E-value=6.9e-61 Score=350.39 Aligned_cols=227 Identities=21% Similarity=0.264 Sum_probs=190.4 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeeeec Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVLNS 80 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~~~ 80 (237) ++|++|++|++|+++..++++|++++++.|+|+++++.+.... +.++.+|+ .++++|++++++|++ |+|+++++ T Consensus 221 ~le~~~~~l~~~~~~~~~~~~l~~~~~~~v~k~~~l~~~~~~~--~~~~~~~~---~~~~~~~g~~~~d~~-e~~e~~~~ 294 (461) T protein:vir:80 221 IFESLYDIITVMDTSLWSVGQILYDFAFKVYKTDDIDALNKDD--KANLTAML---DFMFRTEALAIIKGD-EQLTKEST 294 (461) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHhCCCceecchHHhhhchH--HHHHHHHH---HHhcCCceEEEEcCC-cceEEEec Confidence 9999999999999999999999999999999999988776543 34455555 467889999999986 88999999 Q ss_pred CcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhcC-------- Q lcl|NC_019725. 81 DISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVEE-------- 152 (237) Q Consensus 81 ~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~s-------- 152 (237) +|||++++++++++.||++++||+|+|||+|| |.||||++|++|||++|+++||+.++|+|++|+++|+++ T Consensus 295 ~lsgl~~~l~~~~~~iaa~s~iP~t~L~G~s~-g~~asge~D~~~yyd~i~~~qe~~l~p~le~l~~~i~~s~~~~~~~~ 373 (461) T protein:vir:80 295 NVSGMKDLLDYGWDYLAGAVRMPKTVLKGQEA-GTLTGAQYDVMNYYARVSSIQENRLRPQLEYLTRLLMWASDDCGPSI 373 (461) T ss_pred CcCCHHHHHHHHHHHHhhhhcCCeeeeecccC-CccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccc Confidence 99999999999999999999999999999999 556899999999999999999999999999999999864 Q ss_pred ----CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhc--cccccCCCCCCChhccccCCCCC Q lcl|NC_019725. 153 ----EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIA--PEFKLKDGNNINIREPEETTEPE 226 (237) Q Consensus 153 ----~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~--~~~g~~~~~~~~~~~~e~~~e~~ 226 (237) .+|+|+|+|||+||+||+||+++++|+++++|+++|+|+++|+|+.|+.+. ++.+.+++.+.++++..+.. .+ T Consensus 374 ~p~~~~~~i~f~~L~~~s~kekAe~~~~~a~a~~~~~~~g~is~~e~r~~l~~~~~~~~~~~~~~~~~~~~~~~~~~-~~ 452 (461) T protein:vir:80 374 DPDSFEWAIEFNPLWNLDSKTDAEVRKLTAEADQIYIVNGVLDPDEVKETRFGRFGLENSSKFSGDSAEIDKLAKLV-YD 452 (461) T ss_pred CccccceEEEeCCCCCCCHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHhcCCCCCccCCCCCchhhhhhhhc-cc Confidence 379999999999999999999999999999999999999999999997643 23444555544444332211 11 Q ss_pred CCCCCCcCc Q lcl|NC_019725. 227 PGLGEKLED 235 (237) Q Consensus 227 ~~~~~~~~~ 235 (237) ++..|.-++ T Consensus 453 ~~~~e~~~g 461 (461) T protein:vir:80 453 AYAKKNADG 461 (461) T ss_pred cccccCCCC Confidence 111111112 No 15 >protein:vir:105782 Length: 449 # NCBI annotation: gp5 # Family: family:all:6783 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224143;genbank:gi:62362218;genbank:GeneID:3342535 Probab=100.00 E-value=2.2e-55 Score=320.22 Aligned_cols=219 Identities=10% Similarity=0.033 Sum_probs=165.6 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccc--------eeechhHHHhhcCCchHHHHHHHH-HHHHHhcCchheeeeecC Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQA--------VWKVKGLAEMCDDDDAQYAARLRL-AQVDDNSGVGRAIGIDAE 71 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~--------v~k~~~l~~~~~~~~~e~~~~~r~-~~~~~~r~~~~~~~iD~~ 71 (237) +||++|+++..++++..+.++.+.+.... .+++.+++.+++.+ .+ .+.+++ ..+..+..+.+++++|.+ T Consensus 213 ~L~~~yn~l~~~~~~~~~~a~~~l~~~~rq~~~~~~~~~~~~~l~~~~~~~-~e-~~~~~~~~~~~~~~~~~~~~~i~~~ 290 (449) T protein:vir:10 213 FLEPAYNAFVSLEKVEGGSGESFLKNAARQLNVNFEKEIDFTNLASLYGVS-ID-ELQDKFNEVAGEINRGNDVLMTTQG 290 (449) T ss_pred HHHHHHHHhhhHHHhhhhHHHHHHHHHHHHHhhhhhhhhhhhhhhHHhhCC-ch-HHHHHHHHHHHHHhccchheeecCC Confidence 89999999999999999888766553221 23445566555443 22 222222 233334345566788875 Q ss_pred CcceeeeecCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhc Q lcl|NC_019725. 72 TEEYDVLNSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVE 151 (237) Q Consensus 72 ~e~~~~~~~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~ 151 (237) ++|++++++|||++++++++++.+||+++||+||||||||||||||| |++|||++|+++|+ .|+|+|++|+++|++ T Consensus 291 -~d~~~~~~~~sgl~d~l~~~~q~iaaa~~IP~t~L~Gqsp~glnst~--D~~nyyd~i~~~Q~-~l~p~le~l~~~l~~ 366 (449) T protein:vir:10 291 -ATVTPLVTSVADPTATYNVNLQTAAAGVDIPTRILIGNQQAERSSTE--DQKYFNARCQSRRV-DLSFEIEDFCDKLIE 366 (449) T ss_pred -cceEEEecccCChhHHHHHHHHHHHHHhCCCeeeeeccCccccccch--hHHHHHHHHHHHHH-hhhHHHHHHHHHHHH Confidence 78999999999999999999999999999999999999999999764 89999999999997 599999999999987 Q ss_pred C------CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCC---CCCHHHHHHHHHhhccccccCCCCCCChhccccC Q lcl|NC_019725. 152 E------EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQ---IIDLEEARDTLRSIAPEFKLKDGNNINIREPEET 222 (237) Q Consensus 152 s------~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g---~i~~~e~r~~l~~~~~~~g~~~~~~~~~~~~e~~ 222 (237) + ++|+|+|+|||+||+||+|+|++++|+++++++++| +++++|+|+.+.. .+..+.....+++ T Consensus 367 s~~g~~~~d~~i~f~pL~~~t~kEkAei~k~~A~a~~~~~~ag~~~~~~~~EiR~~~~~----~~~~~~~~~~e~~---- 438 (449) T protein:vir:10 367 LKIIDAVAKKAVIWDDLNEQTGTEKLTNAKTMGEINQTMLGSGDNPAFSREEIRTAAGY----DNDDEEPLGEEDG---- 438 (449) T ss_pred hhcCCCCCceeEEeCCCCCCCHHHHHHHHHHHHHHHHHHHHccccCCcCHHHHHHHhcc----cCCCCCCCCCCCC---- Confidence 5 589999999999999999999999999999999888 9999999998743 2211111111111 Q ss_pred CCCCCCCCCCcCc Q lcl|NC_019725. 223 TEPEPGLGEKLED 235 (237) Q Consensus 223 ~e~~~~~~~~~~~ 235 (237) ++...+..... T Consensus 439 --de~~~~~d~~a 449 (449) T protein:vir:10 439 --DEEDKATDSAA 449 (449) T ss_pred --ccccccCCcCC Confidence 11111111111 No 16 >protein:vir:100882 Length: 383 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358762;genbank:gi:78000027;genbank:GeneID:3726153 Probab=99.29 E-value=6e-13 Score=87.61 Aligned_cols=207 Identities=12% Similarity=0.118 Sum_probs=128.9 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVL 78 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~ 78 (237) -++.+.+.|.....+......+...... .++++++ -+.+++....++++++-.....+..+.++++. +.+|+.+ T Consensus 169 ~l~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~---~~~~~e~~~~~~~~~~~~~~~~n~~~~~vl~~-g~~~~~l 244 (383) T protein:vir:10 169 PLESLQNALNLDDKASKSNMSAMENQINPAGKLTISN---YLSDGKDLESAREEFEKANTGDNSGRLMVLPD-GFDYTQL 244 (383) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCC---CCCCHHHHHHHHHHHHHHhCccccCCccccCC-CceEEec Confidence 5677888888888888888887776543 3555543 22233334445555554333322334566665 5889998 Q ss_pred ecCcCCHHH---HHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhh---cC Q lcl|NC_019725. 79 NSDISGVPE---FLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIV---EE 152 (237) Q Consensus 79 ~~~lsGl~d---l~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~---~s 152 (237) +.+...... ........||.+-|||-.+|-+...++.+.+.-...+.+|.. .|+|.++.+-..+- .. T Consensus 245 ~~~~~d~~~l~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~~~sn~eq~~~~~~~-------~l~P~~~~ie~~l~~~l~~ 317 (383) T protein:vir:10 245 EMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNIDQIKATYLA-------NLNSYVNPIVDELRLKMNA 317 (383) T ss_pred CCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCCccccHHHHHHHHHH-------HHHHHHHHHHHHHHHhhCC Confidence 887766553 455567999999999998887766665554432333334322 37787777655442 34 Q ss_pred CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCChhccccCCCCCCCCCCC Q lcl|NC_019725. 153 EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIREPEETTEPEPGLGEK 232 (237) Q Consensus 153 ~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~~~~e~~~e~~~~~~~~ 232 (237) ..+.|.+.+|...|.+++ ++++..++++|+++++|+|+.+-. .++.+++ . .+. ..+..+..+|++ T Consensus 318 ~~~~f~~~~l~~~d~~~~-------~~~~~~~~~~G~~t~nE~R~~lg~----~p~~~~d-~--~~~-~~~~~~~~gGd~ 382 (383) T protein:vir:10 318 PDLELDIKDMLDVDDSIL-------INQVSNLAKSGVLGAEQAQFILTR----SGFLPDN-L--PEF-KPLTNETKGGDD 382 (383) T ss_pred ceEEeechhhhccCHHHH-------HHHHHHHHhCCCcCHHHHHHHhCC----CcccCCc-c--ccc-CCCcccCCCCCC Confidence 678999999999998886 556888999999999999997631 2222221 1 111 111122233433 Q ss_pred c Q lcl|NC_019725. 233 L 233 (237) Q Consensus 233 ~ 233 (237) + T Consensus 383 e 383 (383) T protein:vir:10 383 K 383 (383) T ss_pred C Confidence 3 No 17 >protein:vir:102118 Length: 409 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699943;genbank:gi:110804051;genbank:GeneID:4206661 Probab=99.20 E-value=3.8e-12 Score=83.21 Aligned_cols=211 Identities=14% Similarity=0.088 Sum_probs=127.9 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHhcCch-heeeeecCCcceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVG-RAIGIDAETEEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~-~~~~iD~~~e~~~~ 77 (237) .++.+.+.+.....+......++..... .++++++ .++ ++....++++++-....-.|. ++++++. +.+|+. T Consensus 183 ~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~---~l~-~e~~~~~~~~~~~~~~g~~n~~~~~vl~~-g~~~~~ 257 (409) T protein:vir:10 183 VIELLNHLIENGKSSETYLNNFFKNGLQVKGLVQYAG---DLN-PEAEEVFKENFERMSSGLKNAHRIAMLPI-GYKFEP 257 (409) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCC---CCC-HHHHHHHHHHHHHHhccccccCCceecCC-CceEEE Confidence 7788888888888888888887776333 3666653 222 223445566665433332333 4566654 578888 Q ss_pred eecCcCCHH--HHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhc---- Q lcl|NC_019725. 78 LNSDISGVP--EFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVE---- 151 (237) Q Consensus 78 ~~~~lsGl~--dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~---- 151 (237) ++.+..... +......+.||.+-|||...| |...++=.++-+.....||.. .|.|.++++-..+-+ T Consensus 258 l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~~~~~~e~~~~~f~~~-------~l~P~~~~ie~~ln~kL~~ 329 (409) T protein:vir:10 258 ISQKLVDAQFLENSQLTIRQIASVFGVKMHQL-NDLDRATHSNITEQNREFYID-------TLQSILNMYELEINYKLFL 329 (409) T ss_pred ccCChhhHHHHHHHHHHHHHHHHHhCCCHHHc-CCCCCCccccHHHHHHHHHHH-------HHHHHHHHHHHHHHHhhcC Confidence 877664333 456678899999999999977 444444344556667777753 477877776444421 Q ss_pred ----CCCceeE--eCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCC-CCCCChhccccCCC Q lcl|NC_019725. 152 ----EEEWSIE--FEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKD-GNNINIREPEETTE 224 (237) Q Consensus 152 ----s~~~~~~--f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~-~~~~~~~~~e~~~e 224 (237) ..++.|+ +..|...|.+++ ++++..++++|+++++|+|+.+ ...+..|-.- .........+...+ T Consensus 330 ~~~~~~~~~~~fd~~~ll~~d~~~~-------~~~~~~~~~~G~~T~NE~R~~l-gl~p~~ggD~~~~~~n~~~~~~~~~ 401 (409) T protein:vir:10 330 ISEIKNGFYSKFNVDTILRADIKTR-------YESYKEAIQNGFKTPNEIRELE-EDEPLEGGDVLLINGNMIPVKMAGE 401 (409) T ss_pred chhccCCcEEEEechhhhccCHHHH-------HHHHHHHHhCCCcCHHHHHHHh-CCCCCCCcCeeeeccCccchhhccc Confidence 2344445 557777777765 5667789999999999999976 3333222100 00111112233344 Q ss_pred CCCCCCCC Q lcl|NC_019725. 225 PEPGLGEK 232 (237) Q Consensus 225 ~~~~~~~~ 232 (237) +...+||. T Consensus 402 ~~~kgGe~ 409 (409) T protein:vir:10 402 QYSKGGEK 409 (409) T ss_pred cccccCCC Confidence 44555666 No 18 >protein:vir:100187 Length: 385 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025029;genbank:gi:48697262;genbank:GeneID:2948285 Probab=99.19 E-value=4.3e-12 Score=82.93 Aligned_cols=209 Identities=11% Similarity=0.119 Sum_probs=125.4 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhc--cceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQ--QAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVL 78 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~--~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~ 78 (237) -++.+.+.+.....+......+..... -.++++++ -+.+++....++++++-.....+..+.+++++ +.+|+.+ T Consensus 169 ~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~gil~~~~---~~~~~e~~~~~~~~~~~~~~~~n~~~~~vl~~-g~~~~~l 244 (385) T protein:vir:10 169 PLESLQNALNLDDKASKSNMSAMENQINPAGKLTISN---YLSDGKDLESAREEFEKANTGDNSGRLMVLPD-GFDYTQL 244 (385) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCC---CCCCHHHHHHHHHHHHHHhCccccCCccccCC-CceEEec Confidence 467777888877777777777766633 24555543 22233334456666665443333344566665 5788887 Q ss_pred ecCcCCHH---HHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhh---cC Q lcl|NC_019725. 79 NSDISGVP---EFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIV---EE 152 (237) Q Consensus 79 ~~~lsGl~---dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~---~s 152 (237) +.+..-+. +........||.+-|||-..|-+...++.+.+.-...+.||.. .|.|.+.++-..+- .. T Consensus 245 ~~~~~d~~~l~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~~~sn~eq~~~~~~~-------~l~P~~~~ie~~l~~~l~~ 317 (385) T protein:vir:10 245 EMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQHSNIDQIKATYLA-------NLNSYVNPIVDELRLKMNA 317 (385) T ss_pred CCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCcccccHHHHHHHHHH-------HHHHHHHHHHHHHHHhhCC Confidence 77655544 3345557889999999988887755555543432233444421 36777777655553 34 Q ss_pred CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCChhccccCCCCCCCCCCC Q lcl|NC_019725. 153 EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIREPEETTEPEPGLGEK 232 (237) Q Consensus 153 ~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~~~~e~~~e~~~~~~~~ 232 (237) ..+.|.+.+|...|.+++ ++++++++++|+++++|+|+.+.. .++.+ .+.+.-. .+-.....|++ T Consensus 318 ~~~~f~~~~ll~~d~~~~-------~~~~~~~~~~G~~T~NE~R~~~g~----~p~p~-~~~~~~~---~~~~~~~~g~~ 382 (385) T protein:vir:10 318 PDLELDIKDMLDVDDSAL-------INQVSNLAKSGVLGAEQAQFILTR----SGFLP-DNLPEFK---PLTTQVKGGDE 382 (385) T ss_pred ceEEeechhhhccCHHHH-------HHHHHHHHhCCCcCHHHHHHHhCC----CccCC-CCCcccc---CcccccCCCCC Confidence 678888899999988775 677888999999999999987632 22211 1111111 11112223333 Q ss_pred cCc Q lcl|NC_019725. 233 LED 235 (237) Q Consensus 233 ~~~ 235 (237) .++ T Consensus 383 ~dn 385 (385) T protein:vir:10 383 GDN 385 (385) T ss_pred CCC Confidence 222 No 19 >protein:vir:79772 Length: 648 # NCBI annotation: portal protein # Family: family:all:3222 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429612;genbank:gi:156564103;genbank:GeneID:5525537 Probab=99.07 E-value=4.8e-11 Score=77.17 Aligned_cols=219 Identities=14% Similarity=0.134 Sum_probs=121.1 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVL 78 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~ 78 (237) .++.+.+.|.....+....+.+...... .+++++. ......+..+.++.+ ++..+.+.+-+..-.++.+ T Consensus 246 pi~~a~~aI~l~~aa~~~~~~fF~NGa~P~gil~~~~------~~~~~e~~k~~~e~~---~~~~~~~~i~gg~v~~~~~ 316 (648) T protein:vir:79 246 WLLPALDDIRALRQVEENVLRLVYRNLHPLWHVKVGL------EQEGFGAEEGEVDLV---RGEVENMDVEGGMVTTERV 316 (648) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCC------CccchHHHHHHHHHH---HHhccccccccccccccee Confidence 7788889998888888888887776553 3344321 111112222222222 2222222222223345555 Q ss_pred ecCcCCH------HHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhcC Q lcl|NC_019725. 79 NSDISGV------PEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVEE 152 (237) Q Consensus 79 ~~~lsGl------~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~s 152 (237) ..+..+- .+......+.||.+-|||-.+| |...++-.++++....+|++.|...|....+..-.++...+... T Consensus 317 ~i~~~~s~~dlqfle~rk~~~~eIa~aFgVPP~lL-G~~~~ss~stae~~~~~~~~~i~~l~~~i~~~le~~~~~~ll~e 395 (648) T protein:vir:79 317 NISSIASNQIIDAKEYLKHFEQRAFTVLGVSELMM-GRGGTASRSTGDNLSSDFKDRIKALQKVMATFINEFMVKEILME 395 (648) T ss_pred eccccCCHHHHHHHHHHHHHHHHHHHHhCCCHhHc-ccCCCccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 5544322 2234566789999999998755 87666666778888899999999999776666544444433221 Q ss_pred ----------CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccc-cccCC---CCCCC--- Q lcl|NC_019725. 153 ----------EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPE-FKLKD---GNNIN--- 215 (237) Q Consensus 153 ----------~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~-~g~~~---~~~~~--- 215 (237) ..+.|+|++|...+++.+++. +..++++|+++++|+|+.+ ...+. .|..+ ..+.. T Consensus 396 ~~l~~~l~~d~~ieF~~~~Llr~D~~~~a~~-------~~~l~~~GilT~NEaR~~l-GlpPi~~g~~~~~l~~~~~~~~ 467 (648) T protein:vir:79 396 GGFDPVLNPDDKVEFRFNEIDMDSKIKLENQ-------AVFLYEHNAISEDEMRELI-GRDPVDDGEGRAKMHLQMVTIA 467 (648) T ss_pred hhccccccccceEEEeecccchhhHHHHHHH-------HHHHHhCCCcCHHHHHHHh-CCCCCCCCCCccccccccccch Confidence 236788999988877666543 5568999999999999976 33332 22110 01100 Q ss_pred hhccccCCCCCCCC---------CCCcCcCC Q lcl|NC_019725. 216 IREPEETTEPEPGL---------GEKLEDEN 237 (237) Q Consensus 216 ~~~~e~~~e~~~~~---------~~~~~~e~ 237 (237) ...++..+.++|.. +...+.+| T Consensus 468 ~~~~~~~~~~~~~~~~~~~a~~eg~~~e~~~ 498 (648) T protein:vir:79 468 QATALAALAPTPAGGSSASASGDKKKKATDN 498 (648) T ss_pred hccccccCCCCCCCCCCCCccccccccccCC Confidence 01111111111111 11111111 No 20 >protein:vir:95378 Length: 406 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764474;genbank:gi:115334628;genbank:GeneID:5179265 Probab=99.07 E-value=2.9e-11 Score=78.33 Aligned_cols=210 Identities=14% Similarity=0.128 Sum_probs=117.8 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHhcC--c-hheeeeecCCcce Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSG--V-GRAIGIDAETEEY 75 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~--~-~~~~~iD~~~e~~ 75 (237) -++.+.+.+.....+......+...... .++++++ .++.. ....+++++. ..+++ | .+.+++..+++++ T Consensus 178 ~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~---~l~~e-~~~~~~~~~~--~~~~g~~n~~~~~v~~~~~~~~ 251 (406) T protein:vir:95 178 YRVVLKDIADNLKQATATKKSFMSGKYMPSLIVKVDA---ATAEL-SSEEGRNAVF--KKYLQATEAGQPWIIPAELLEV 251 (406) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCC---CCCHH-HHHHHHHHHH--HHhccccccCCceeecCCCccc Confidence 5677778888888888888887766444 3566653 22222 2223333332 33332 3 3355665555666 Q ss_pred eeee-cCc--CCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhh-- Q lcl|NC_019725. 76 DVLN-SDI--SGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIV-- 150 (237) Q Consensus 76 ~~~~-~~l--sGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~-- 150 (237) .++. .+. +-+.+........||.+-|||..+| |. +...+....+||. ..|.|.++++-..|- T Consensus 252 ~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~l-g~-----~~~~~~~~~~~~~-------~~l~P~~~~ie~~l~~~ 318 (406) T protein:vir:95 252 EQVKPLSLKDIAINEAVELDKRTVAGMFGVPAFLL-GI-----GEFNRDEYNNFIN-------STILPIAKGIEQELTRK 318 (406) T ss_pred cccccCChhHHHHHHHHHHHHHHHHHHhCCCHHHc-CC-----CCchHHHHHHHHH-------HHHHHHHHHHHHHHHHh Confidence 5432 232 2334566778899999999998777 42 1112334455554 458898887765553 Q ss_pred --cCCCc--eeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcccccc----CCCCCCChhccccC Q lcl|NC_019725. 151 --EEEEW--SIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKL----KDGNNINIREPEET 222 (237) Q Consensus 151 --~s~~~--~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~----~~~~~~~~~~~e~~ 222 (237) ...++ .|.+..|...|.+++ ++.+..++++|+++++|+|+.+- ..+..|. .+......+...+. T Consensus 319 l~~~~~~~~~fd~~~l~~~d~~~~-------~~~~~~l~~~G~~t~NE~R~~~g-l~p~~~gd~~~~~~n~~~~~~~~~~ 390 (406) T protein:vir:95 319 LLISPDLYFKFNPRSLYAYDLKEL-------AEVGSNMYVRGIMEGNEVRDWLG-LSPKEGLSELVILENYIPLDKIGDQ 390 (406) T ss_pred cCCCCCcEEEeechhhhcCCHHHH-------HHHHHHHHhCCCcCHHHHHHHhC-CCCCCCcceeeeccCccchhhcccc Confidence 23444 555666766666664 56678889999999999999763 2222221 11111112222222 Q ss_pred CCCCCCCCCCcCcCC Q lcl|NC_019725. 223 TEPEPGLGEKLEDEN 237 (237) Q Consensus 223 ~e~~~~~~~~~~~e~ 237 (237) ...+++.+++.++++ T Consensus 391 ~~~k~g~~~~~~~~~ 405 (406) T protein:vir:95 391 SKLKGGDNSGADGQT 405 (406) T ss_pred cccCCCCCCCCCCCC Confidence 223333344444444 No 21 >protein:vir:102727 Length: 945 # NCBI annotation: portal protein # Family: family:all:2446 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874016;genbank:gi:118197623;genbank:GeneID:4495919 Probab=99.04 E-value=1.1e-10 Score=75.23 Aligned_cols=219 Identities=10% Similarity=0.026 Sum_probs=113.9 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc---ceeechhHH-------HhhcCCchHHHHHHHHHHHHHhcCchheeeeec Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ---AVWKVKGLA-------EMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDA 70 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~---~v~k~~~l~-------~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~ 70 (237) -++.+.+.+.....+....+....+-+. .++++++-. ..+ +.+....++++++......++.+.++++ T Consensus 275 PIeaa~~aI~~alAaek~aar~FskNGa~PsGILsvkg~~~~d~k~~~~L-seEq~erlKe~wee~~sG~NnG~piVLd- 352 (945) T protein:vir:10 275 PIEILYKVILSDIFIDKGNLDYYRKGGSIPEGILAIEPPSYKEGDIYPQL-SREQLESIQRQLQAIMMGDYTQVPILSG- 352 (945) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhCCCccceEEEecCcccccccccccc-CHHHHHHHHHHHHHHhCCcccccceecC- Confidence 2567777777777777777776543222 356654311 111 1222234555554443332333334555 Q ss_pred CCcceeeeecCcCCH--HHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHH Q lcl|NC_019725. 71 ETEEYDVLNSDISGV--PEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPF 148 (237) Q Consensus 71 ~~e~~~~~~~~lsGl--~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~ 148 (237) ++-+|..++.+.... -+........||++-|||...| |...++=.++-+.....||.. -|.|.+.++-.. T Consensus 353 eGmef~pLs~s~~DaQfLEsrkfs~eeIArAFGVPP~lL-G~~e~st~SNiEqq~~~Fv~~-------tL~Pil~~IEqe 424 (945) T protein:vir:10 353 GKFTWIDFKGKRRDMQFKELAEFVARKICAVYQVSPQDV-GILEGSNKATAEVMASLTKAK-------GLEPLMATISKG 424 (945) T ss_pred CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHc-ccCCCCCcchHHHHHHHHHHH-------HHHHHHHHHHHH Confidence 467888877665433 3456667789999999998888 433222222334555666643 244444443322 Q ss_pred h----hc---CCCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcccccc----CCCCCC--C Q lcl|NC_019725. 149 I----VE---EEEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKL----KDGNNI--N 215 (237) Q Consensus 149 i----~~---s~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~----~~~~~~--~ 215 (237) | .. ..++.|+|+.+.-++.++ +++++..++++|+++++|+|+.+- ..+..|- .+..+. . T Consensus 425 LNrkLl~~~eg~~i~fdFd~ldl~D~ks-------raEal~kli~sGiLTiNEvRe~lG-LpPIeGGD~lli~~nn~~P~ 496 (945) T protein:vir:10 425 FDEVVSEFRNEKDIKLWFKEDDLEKERD-------WWNIIQGQLNTGFRSINEARMEKG-LEPVPWGDVPFSGLRNWKPE 496 (945) T ss_pred HHHhccccccCceeEEEecchhccCHHH-------HHHHHHHHHhCCCcCHHHHHHHhC-CCCCCCcceeeecccccccc Confidence 2 11 246899999998887654 567788899999999999999762 2222210 000000 0 Q ss_pred h-----------hccccC--CCCCCCCCCC--cCcCC Q lcl|NC_019725. 216 I-----------REPEET--TEPEPGLGEK--LEDEN 237 (237) Q Consensus 216 ~-----------~~~e~~--~e~~~~~~~~--~~~e~ 237 (237) + ..+.+. +++.+.+|+. .++++ T Consensus 497 d~~~ka~~ga~p~q~aq~~~dqp~~kGGe~dEns~~p 533 (945) T protein:vir:10 497 DEQAKAQQGAMPPQLAQAMADQPSQQGGGVDENSSVP 533 (945) T ss_pred ccccccccCCCCcccccCCCCCCCCCCCCCCCCCCCC Confidence 0 011111 1111111111 11111 No 22 >protein:vir:8100 Length: 466 # NCBI annotation: gp4 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817681;genbank:gi:29566112;genbank:GeneID:1259306 Probab=99.03 E-value=8.1e-11 Score=75.90 Aligned_cols=222 Identities=15% Similarity=0.139 Sum_probs=124.4 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccc--eeechhHHHhhcCCchHHHHHHHHHHHHHhcCchh-eeeeecCCcceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQA--VWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGR-AIGIDAETEEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~--v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~-~~~iD~~~e~~~~ 77 (237) .++.+.+.|.....+......+....... ++++++ .++ ++....+++++.-.-..-.|.+ .++++. +-+|+. T Consensus 223 ~i~~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~---~l~-~e~~~~~~~~~~~~~~g~~n~g~~~vl~~-g~~~~~ 297 (466) T protein:vir:81 223 WLTPILREIRADQAMSKHQAKFFDNGATVNLVIKHNP---MAD-PAAVKKWADEVNSKHAGVDNAWKNLNLYP-GADADV 297 (466) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCC---CCC-HHHHHHHHHHHHHHhcCccccccceEcCC-CceEEE Confidence 46777788877777777777777664432 456653 122 2223344444433222223333 456654 577888 Q ss_pred eecCcCCH--HHHHHHHHHHHhhhhcCceeeeeccCcccccccc---hhHHHHHHHHHHHHHHHhhhHHHHHHHHHhh-- Q lcl|NC_019725. 78 LNSDISGV--PEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQ---NTALETFYKLVDRKREEDYRPLLEFLLPFIV-- 150 (237) Q Consensus 78 ~~~~lsGl--~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatG---e~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~-- 150 (237) ++.+.... -+......+.||.+-|||- .++|.+.++-.+|+ |.-.+.||. ..|+|.+.++-..+- T Consensus 298 l~~~~~d~q~le~~~~~~~~Ia~~fgVPp-~~lG~~~~~~~st~sn~eq~~~~f~~-------~tl~P~~~~ie~~l~~~ 369 (466) T protein:vir:81 298 VGSNLQEIDFKNVRGGGETRIAAAAGVPP-VIVGLSEGLAAATYSNYGQARRRLAD-------GTAHPLWQNLSGCIGHV 369 (466) T ss_pred ccCChhHHHHHHHHHHHHHHHHHHhCCCH-HHcccccCCCccccccHHHHHHHHHH-------HHHHHHHHHHHHHHHhh Confidence 77655332 2455678899999999995 45565544333443 334445553 446777666644432 Q ss_pred --c-C--CCceeEe--CCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccc-----cccCCCCCCChhc Q lcl|NC_019725. 151 --E-E--EEWSIEF--EPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPE-----FKLKDGNNINIRE 218 (237) Q Consensus 151 --~-s--~~~~~~f--~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~-----~g~~~~~~~~~~~ 218 (237) . . ..+.|+| .+|...+.++++++.+++++.+..++++| ++++|+|..+.. ++. .++.+-.++.... T Consensus 370 L~~~~~~~~~~~~f~~~~llr~d~~~r~~~~~~~~~~~~~~~~~g-~t~nE~r~~~~~-gd~~~~~~~~~~~~~~~~~~~ 447 (466) T protein:vir:81 370 MPDMGPDVRLWYDADDVPFLREDEKDAADIQKVRAETINTLITAG-YEPESVVAAVNS-GDLRLLKHTGLTSVQLLPPGV 447 (466) T ss_pred cCCcccCcceEEEecchhhhccCHHHHHHHHHHHHHHHHHHHHcC-CChhhccccccC-CccccccCCCcchhhhccccc Confidence 1 1 2345555 58888999999999999999999999999 599999975431 111 1111111111111 Q ss_pred cccCCCCCCCCCCCcCcCC Q lcl|NC_019725. 219 PEETTEPEPGLGEKLEDEN 237 (237) Q Consensus 219 ~e~~~e~~~~~~~~~~~e~ 237 (237) ......++|..+.+.+..| T Consensus 448 ~~~~~~~~~~~~Gg~~ngn 466 (466) T protein:vir:81 448 SASASSDTPTSGGADDNGN 466 (466) T ss_pred ccccCCCCcccCCCCcCCC Confidence 1111112222222233333 No 23 >protein:vir:1431 Length: 419 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536360;genbank:gi:17975165;genbank:GeneID:929165 Probab=99.03 E-value=8.5e-11 Score=75.79 Aligned_cols=220 Identities=11% Similarity=0.026 Sum_probs=118.6 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHhcCc-hheeeeecCCcceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGV-GRAIGIDAETEEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~-~~~~~iD~~~e~~~~ 77 (237) -++.+.+.|.....+......+...... .++++++......+.+..+.++++++-.-..-+| .+.+++++ +-+|.. T Consensus 177 ~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~nag~~~vl~~-g~~~~~ 255 (419) T protein:vir:14 177 PVLLHANAIGHAQAIQQYAGKSFMNGTALSGVIERPKDAPALKDQASVDRITDGWNAKFGGSGNAKKVALLQE-GMTFRP 255 (419) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEecCCCCcccCHHHHHHHHHHHHHHhcCccccCCceecCC-CceEEE Confidence 5677777788777777777777666333 2566653211111222222344444432222223 33566665 467877 Q ss_pred eecCcCC--HHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhh----- Q lcl|NC_019725. 78 LNSDISG--VPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIV----- 150 (237) Q Consensus 78 ~~~~lsG--l~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~----- 150 (237) ++.+... +-+........||.+-|||-.+|.+..-+-.+ +-|.-.+.||.. .|.|.+.++-..+- T Consensus 256 l~~~~~d~q~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~t~s-~~E~~~~~f~~~-------~L~P~~~~ie~~l~~kll~ 327 (419) T protein:vir:14 256 LSMTNVDAALIDALRLSALDIARIYKIPAHMVNELERATFS-NIEHQSLQFVIY-------TLLPWVKRHEQAKTRDLLL 327 (419) T ss_pred ccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCcc-cHHHHHHHHHHH-------HHHHHHHHHHHHHhhhccC Confidence 6654432 23445577789999999998888543333232 234445555554 37888777744432 Q ss_pred cC--CCceeEe--CCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccc----cCCCCCCChhccccC Q lcl|NC_019725. 151 EE--EEWSIEF--EPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFK----LKDGNNINIREPEET 222 (237) Q Consensus 151 ~s--~~~~~~f--~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g----~~~~~~~~~~~~e~~ 222 (237) .+ .++.|+| ..|...|.+++ ++++++++++|+++++|+|+.+- ..+..| +.+..-.....+++. T Consensus 328 ~~~~~~~~i~fd~~~l~r~d~~~~-------~~~~~~~~~~G~~T~NE~R~~~g-l~p~~gGD~~~~~~n~~~~~~~~~~ 399 (419) T protein:vir:14 328 PSERKQYFIEYNLAGLLRGDQSSR-------YAAYAVGRQWGWLSINDIRRLEN-MPPVKGGDIYLSPMNMVDASKPQQL 399 (419) T ss_pred ccccCCeEEEEechhhhccCHHHH-------HHHHHHHHhCCCcCHHHHHHHhC-CCCCCCcCeeeeccccccccccccc Confidence 11 2455555 46666666655 66777899999999999998762 223222 111111112222222 Q ss_pred CCCCCCCCCCcCcCC Q lcl|NC_019725. 223 TEPEPGLGEKLEDEN 237 (237) Q Consensus 223 ~e~~~~~~~~~~~e~ 237 (237) +..++.+.....+|+ T Consensus 400 ~~~~~~~~~~~~~e~ 414 (419) T protein:vir:14 400 PVGKSEPTKAAIDEI 414 (419) T ss_pred cCCCCCCccccccch Confidence 222333333334444 No 24 >protein:vir:8418 Length: 409 # NCBI annotation: gp13 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818314;genbank:gi:29566750;genbank:GeneID:1260067 Probab=98.99 E-value=1.8e-10 Score=73.96 Aligned_cols=215 Identities=11% Similarity=0.013 Sum_probs=113.1 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVL 78 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~ 78 (237) -++.+.+.|.....+......+...... .++++++ .+.. +....+++++ ...+.+..+.+++++ +-+|+.+ T Consensus 181 ~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~---~l~~-e~~~~~~~~~--~~~~~n~g~~~vl~~-g~~~~~~ 253 (409) T protein:vir:84 181 PIEKAASAIGLGLAAERYGLRWFRDSANPSGILSSDA---DLTP-DQVKQTQKQW--IQSHHNRRLPAVMSA-GIKWQSV 253 (409) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCC---CCCH-HHHHHHHHHH--HHHhccCCCeeecCC-CceEEEc Confidence 5677778888777777777776655332 3455543 1212 1122333333 233333344555554 5778887 Q ss_pred ecCcC--CHHHHHHHHHHHHhhhhcCceeeeeccCcccccccc-hhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhc--CC Q lcl|NC_019725. 79 NSDIS--GVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQ-NTALETFYKLVDRKREEDYRPLLEFLLPFIVE--EE 153 (237) Q Consensus 79 ~~~ls--Gl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatG-e~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~--s~ 153 (237) +.+.. .+-+......+.||.+-|||..+|-....+..+++. +....+||.. -|.|.++++-..+-+ .. T Consensus 254 ~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~sn~e~~~~~f~~~-------~l~P~~~~ie~~l~~~L~~ 326 (409) T protein:vir:84 254 SITPNESQFLETRSFQRSEIAMWFRIPPHMIGDVEKSTSWGTGIEEQGINFVRH-------TLLPWLRCIEQALDTFLPR 326 (409) T ss_pred cCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHHHHHHHHHHH-------HHHHHHHHHHHHHHHhccC Confidence 76553 233445577889999999998866433333343232 3344455533 366766665444422 23 Q ss_pred C--ceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCC-CCCCChhccccCCCCCCCCC Q lcl|NC_019725. 154 E--WSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKD-GNNINIREPEETTEPEPGLG 230 (237) Q Consensus 154 ~--~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~-~~~~~~~~~e~~~e~~~~~~ 230 (237) + +.|+++.|...|.+++ ++++.+++++|+++++|+|+.+ ...+..|-.- .........+..+..++..+ T Consensus 327 g~~i~fd~~~l~~~d~~~~-------~~~~~~~~~~G~~t~NE~R~~~-g~~p~~ggD~~~~~~n~~~~~~~~~~~~~~~ 398 (409) T protein:vir:84 327 GQFVKFNVDGLMRGDVTAR-------FTAYQMGLQNGIWSVNEVRAWE-DAPPIPEGDIHLQPMNFVPLGYVPPEEPAQE 398 (409) T ss_pred CCeEEEechhhhccCHHHH-------HHHHHHHHhCCCcCHHHHHHHh-CCCCCCCcceeeecccccccccCCccccCcC Confidence 4 4566677777776664 5678889999999999999976 3333222100 00001111111111111111 Q ss_pred CCcCcC---C Q lcl|NC_019725. 231 EKLEDE---N 237 (237) Q Consensus 231 ~~~~~e---~ 237 (237) ...+++ | T Consensus 399 ~~~~~~~~gn 408 (409) T protein:vir:84 399 PQPNSATEGN 408 (409) T ss_pred CCCCCccCCC Confidence 111111 1 No 25 >protein:vir:1380 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612832;genbank:gi:20065966;genbank:GeneID:935782 Probab=98.99 E-value=1e-10 Score=75.35 Aligned_cols=214 Identities=12% Similarity=0.079 Sum_probs=122.7 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHh--ccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCc-hheeeeecCCcceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRK--QQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGV-GRAIGIDAETEEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~--~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~-~~~~~iD~~~e~~~~ 77 (237) .++.+.+.+.....+......+.... --.++++++ .+. ++....+++++.-......| .+.++++. +-+|+. T Consensus 193 ~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~---~l~-~e~~~~~~~~~~~~~~g~~n~~~~~vl~~-g~~~~~ 267 (422) T protein:vir:13 193 PLDYLRCTIENGRATQEFINKFFKNGLSIKGIVQYVG---DLD-EKAKKIFKKEFESMSNGLENAHSISLLPF-GYQFQP 267 (422) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCC---CCC-HHHHHHHHHHHHHHhcCccccCCceecCC-Cceeee Confidence 67888888888888888888877763 223455543 222 22334566666544333333 34566654 577888 Q ss_pred eecCcCCHH--HHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhh----- Q lcl|NC_019725. 78 LNSDISGVP--EFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIV----- 150 (237) Q Consensus 78 ~~~~lsGl~--dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~----- 150 (237) ++.+..... +........||.+-|||...|.+...+..+ +-+.....||.. .|.|.+.++-..+- T Consensus 268 l~~~~~d~q~le~~~~~~~~Ia~~fgVpp~~lg~~~~~~~s-n~e~~~~~f~~~-------~l~P~~~~ie~~l~~~Ll~ 339 (422) T protein:vir:13 268 ISLSMADAQFLENSKLTKRELAATFGMKSYHLNDLERATFN-NLTEQQKDFYVT-------TLQSSLTVYEQEIQDKLFS 339 (422) T ss_pred ccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcc-cHHHHHHHHHHH-------HHHHHHHHHHHHHHHhhCC Confidence 776654332 444567788999999999887766555444 345566666643 47777766644432 Q ss_pred ---cCCCceeEe--CCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCC-CCCCChhccccCCC Q lcl|NC_019725. 151 ---EEEEWSIEF--EPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKD-GNNINIREPEETTE 224 (237) Q Consensus 151 ---~s~~~~~~f--~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~-~~~~~~~~~e~~~e 224 (237) +..++.|+| ..|...|.+++ +++++.++++|+++++|+|+.+- ..+..|-.- .-....-..+..++ T Consensus 340 ~~~~~~g~~i~fd~~~l~r~d~~~~-------~~~~~~~~~~G~~T~NE~R~~~g-l~p~~ggD~~~~~~n~~~l~~~~~ 411 (422) T protein:vir:13 340 QYETLQDVKAEFNVDTILRSDIKTR-------YEAYRIGIQGGFIEANEARRREN-LPPVEGGDRLLVNGNMIPIEMAGE 411 (422) T ss_pred hhhhcCCceEEeechhhhcCCHHHH-------HHHHHHHHhCCCcCHHHHHHHhC-CCCCCCcCeeeeccCccchhhccc Confidence 123555555 57777776665 55677899999999999999763 333222100 00011111122222 Q ss_pred CCCCCCCCcCcC Q lcl|NC_019725. 225 PEPGLGEKLEDE 236 (237) Q Consensus 225 ~~~~~~~~~~~e 236 (237) ..++.| +..++ T Consensus 412 ~~~~~g-~~~g~ 422 (422) T protein:vir:13 412 QYKKGG-EKGGK 422 (422) T ss_pred ccccCC-CcCCC Confidence 222222 22222 No 26 >protein:vir:6210 Length: 394 # NCBI annotation: Portal protein # Family: family:all:10882 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852590;genbank:gi:31415850;genbank:GeneID:1489208 Probab=98.97 E-value=2.1e-10 Score=73.60 Aligned_cols=212 Identities=12% Similarity=0.087 Sum_probs=118.3 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccc--eeechhHHHhhcCCc-hHHHHHHHHHHHHHhcCchh-eeeeecCCccee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQA--VWKVKGLAEMCDDDD-AQYAARLRLAQVDDNSGVGR-AIGIDAETEEYD 76 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~--v~k~~~l~~~~~~~~-~e~~~~~r~~~~~~~r~~~~-~~~iD~~~e~~~ 76 (237) .++.+.+.|.....+......+....... ++++++. +...+ ....+++++.-.-...++.+ .+++.. +.+|. T Consensus 163 ~~~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~---~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~-g~~~~ 238 (394) T protein:vir:62 163 ILDLGRDTLEGVMSAEKTLTDKYKKGGLLTFLLNLDAH---INPQNGAQSKLINAILDQLESIDEARSVKMIPL-GKGYS 238 (394) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEEeCCC---CCcCHHHHHHHHHHHHHHhccccccCceeEeeC-CCcee Confidence 78888888888888888888877764433 5566431 22111 12233443332222223434 345554 46677 Q ss_pred eeecCcCC----HHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhh-- Q lcl|NC_019725. 77 VLNSDISG----VPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIV-- 150 (237) Q Consensus 77 ~~~~~lsG----l~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~-- 150 (237) ....+.+. +-+........||.+-|||-..|-+.+ ++.-+.-.+.||.. .|.|.+.++-..+- T Consensus 239 ~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~----~sn~e~~~~~~~~~-------~l~P~~~~ie~~l~~k 307 (394) T protein:vir:62 239 IDTLKSPLDDEKTLAYLNVYKKDLGKFLGINVDTYTELI----KEDIEKAMMYIHNK-------AVRPIMKNFEDHLSLL 307 (394) T ss_pred EEecCCCcchHHHHHHHHHHHHHHHHHhCCCHHHcCCCC----CcCHHHHHHHHHHH-------HHHHHHHHHHHHHhhh Confidence 65554433 233445667889999999999884322 12223344555543 47887777744432 Q ss_pred ---cC--CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCC-----CCCCChhccc Q lcl|NC_019725. 151 ---EE--EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKD-----GNNINIREPE 220 (237) Q Consensus 151 ---~s--~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~-----~~~~~~~~~e 220 (237) .. ..+.|+|+.+.-++..++ ++++.+++++|+++++|+|+.+ ...+..+-.+ ..+...-... T Consensus 308 ll~~~~~~~~~~~fd~~~~~~~~~~-------~~~~~~~~~~g~~T~NE~R~~~-gl~p~~~~~gd~~~~~~n~~~~~~~ 379 (394) T protein:vir:62 308 FYAQNSGKRIKFKINILDFVTYSNK-------TNIGYNLVRTAITSPDNVADML-GFPKQNTKESQAIYISNDVTEIGKK 379 (394) T ss_pred hcCccccCceEEEechhhhcCHHHH-------HHHHHHHHhCCCcCHHHHHHHh-CCCCCCCCCCCeeeccccccccccc Confidence 11 367899999988877654 4567899999999999999976 2232211110 0111111111 Q ss_pred cCCCCCCCCCCCcCcCC Q lcl|NC_019725. 221 ETTEPEPGLGEKLEDEN 237 (237) Q Consensus 221 ~~~e~~~~~~~~~~~e~ 237 (237) ++.+.... .++++|| T Consensus 380 ~~~~~~~k--gge~~en 394 (394) T protein:vir:62 380 EATDGSLG--GGEENEN 394 (394) T ss_pred ccccccCC--CCCCCCC Confidence 12222222 3333555 No 27 >protein:vir:483 Length: 413 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543090;swissprot:trembl:q8w629;genbank:gi:18249902;uniprot:Q8W629;genbank:GeneID:929685 Probab=98.97 E-value=1.9e-10 Score=73.92 Aligned_cols=216 Identities=13% Similarity=0.077 Sum_probs=122.7 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHhcCch-heeeeecCCcceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVG-RAIGIDAETEEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~-~~~~iD~~~e~~~~ 77 (237) .++.+.+.|.....+......+...... .++++++ .+. .+....+++++.-.....+|. +.+++++ +-+|.. T Consensus 180 ~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~---~~~-~e~~~~~~~~~~~~~~g~~n~g~~~vl~~-g~~~~~ 254 (413) T protein:vir:48 180 PIAYAREAISLAAATEEHGARLFGNGAVTSGVLRTEQ---KLT-PDAYERLKKDFEERHTGLGNAHRPMILEM-GLDWKS 254 (413) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCC---CCC-HHHHHHHHHHHHHHhcCccccCcceecCC-CceEEe Confidence 6788888888888888888887776443 5566653 121 222334555554333332343 3455554 578888 Q ss_pred eecCcCCH--HHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhh----c Q lcl|NC_019725. 78 LNSDISGV--PEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIV----E 151 (237) Q Consensus 78 ~~~~lsGl--~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~----~ 151 (237) ++.+.... -+........||.+-|||-..|-+..-+..+ +-+....+||.. .|.|.++++-..+- . T Consensus 255 l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~-n~e~~~~~f~~~-------~i~P~~~~ie~~l~~~L~~ 326 (413) T protein:vir:48 255 MALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRATFN-NIEELGLGFINY-------SLVPYLTRIEQRINTGLVR 326 (413) T ss_pred ccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCCcc-cHHHHHHHHHHH-------HHHHHHHHHHHHHHhhccC Confidence 77665544 3566677889999999999998654333333 345555666643 57787777744442 1 Q ss_pred CC---C--ceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCC-CCCCChhccccCCC- Q lcl|NC_019725. 152 EE---E--WSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKD-GNNINIREPEETTE- 224 (237) Q Consensus 152 s~---~--~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~-~~~~~~~~~e~~~e- 224 (237) .. + |.|.+..|...|.+++ ++++++++++|+++++|+|+.+- ..+..|-.- .........+...+ T Consensus 327 ~~~~~~~~~~fd~~~l~~~d~~~~-------~~~~~~~~~~g~~T~NE~R~~~g-~~p~~ggD~~~~~~n~~~~~~~~~~ 398 (413) T protein:vir:48 327 ESKQGKFYAKFNAGALLRGDMKSR-------FEAYATGINWGIYSPNDCRDLED-MNPRPGGDVYLTPMNMTTSPSAGDD 398 (413) T ss_pred ccccCCeEEEEechhhhccCHHHH-------HHHHHHHHhCCCcCHHHHHHHhC-CCCCCCcceeecccccccccccccc Confidence 21 3 4455557766666665 55778899999999999998763 333222100 00001111111111 Q ss_pred CCCCCCCCcCcCC Q lcl|NC_019725. 225 PEPGLGEKLEDEN 237 (237) Q Consensus 225 ~~~~~~~~~~~e~ 237 (237) ..+..+++.++|+ T Consensus 399 ~~~~~~~~~~~~~ 411 (413) T protein:vir:48 399 NGKKKESGDADKT 411 (413) T ss_pred CCCCCCCCCcccc Confidence 1122222333333 No 28 >protein:vir:5737 Length: 419 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892048;genbank:gi:33770511;goa:Q7Y412;interpro:IPR006427;interpro:IPR006944;uniprot:Q7Y412;genbank:GeneID:1732929;interpro:IPR010994 Probab=98.96 E-value=2.2e-10 Score=73.56 Aligned_cols=220 Identities=12% Similarity=0.031 Sum_probs=115.7 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHh-cCchheeeeecCCcceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDN-SGVGRAIGIDAETEEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~-r~~~~~~~iD~~~e~~~~ 77 (237) -++.+...+.....+.....++...... .++++++...-..+.+..+.+++++.-.-.. .+..+.++++. +-+|.. T Consensus 178 ~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~e~~~~~~~~~~~~~~g~~nag~~~vl~~-g~~~~~ 256 (419) T protein:vir:57 178 PIQTNPDVLGLGIAVEQHAAQVFARGTTMSGVIERPFEAKAIASQAAVDAILAKWTERYGGVRNAFSVGMLQE-GMTYKQ 256 (419) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEecCcCCcccCHHHHHHHHHHHHHHhccccccccceecCC-CceEEE Confidence 4667777777777777777776665332 2455543211111112222344433322111 22234556654 577877 Q ss_pred eecCcCC--HHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhc---- Q lcl|NC_019725. 78 LNSDISG--VPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVE---- 151 (237) Q Consensus 78 ~~~~lsG--l~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~---- 151 (237) ++.+... +-+......+.||.+-|||...|-+..-+.. ++-|.....||.. .|.|.++++-..+-+ T Consensus 257 l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~-sn~e~~~~~f~~~-------~l~P~~~~ie~~l~~~ll~ 328 (419) T protein:vir:57 257 LSQDNEKAQLLQSRQYTVNEVCRLYKVPPHMIQDLQKSTN-NNIEHQGLQYVIY-------TMLAILKRHESAMMRDLLL 328 (419) T ss_pred cCCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCcc-ccHHHHHHHHHHH-------HHHHHHHHHHHHHHhhccC Confidence 7765542 2345566778999999999888854433333 2335555666643 478877777544421 Q ss_pred -C--CCceeEe--CCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcccccc----CCCCCCChhccccC Q lcl|NC_019725. 152 -E--EEWSIEF--EPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKL----KDGNNINIREPEET 222 (237) Q Consensus 152 -s--~~~~~~f--~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~----~~~~~~~~~~~e~~ 222 (237) . .++.|+| ..|...|.+++++ ++++++++|+++++|+|+.+- ..+..|- .+......+..++. T Consensus 329 ~~~~~~~~i~fd~~~ll~~d~~~~~~-------~~~~~~~~G~~T~NE~R~~~g-l~p~~ggD~~~~~~n~~~~~~~~~~ 400 (419) T protein:vir:57 329 PSERRDFYIEFNVSSLLRGDQKSRYE-------SYALGRQWGWLSVNDIRRMEN-LTPIPGGDKYLTPLNMVDSKALTGI 400 (419) T ss_pred ccccCCeEEEEechhhhccCHHHHHH-------HHHHHHhCCCcCHHHHHHHhC-CCCCCCcCeeeeccccccccccccc Confidence 1 3455554 4777778777654 667799999999999999763 2332221 01011111111111 Q ss_pred CCCCCCCCCCcCcCC Q lcl|NC_019725. 223 TEPEPGLGEKLEDEN 237 (237) Q Consensus 223 ~e~~~~~~~~~~~e~ 237 (237) ..+.|..-.+.+.-+ T Consensus 401 ~~~~~~~~~~~~~~~ 415 (419) T protein:vir:57 401 GKATPQQLKDIEAIL 415 (419) T ss_pred cCCCcccCcchhhhh Confidence 112222222222222 No 29 >protein:vir:105002 Length: 432 # NCBI annotation: putative phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459967;genbank:gi:85701382;genbank:GeneID:3882143 Probab=98.95 E-value=1.5e-10 Score=74.50 Aligned_cols=213 Identities=14% Similarity=0.181 Sum_probs=122.5 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHh-cCchheeeeecCCcceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDN-SGVGRAIGIDAETEEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~-r~~~~~~~iD~~~e~~~~ 77 (237) .++.+.+.|.....+......+...... .++++++ .+.. +...+++++++..... ++..++++++. +-+|+. T Consensus 194 ~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~---~l~~-e~~~~~~~~~~~~~~g~~n~~~~~vl~~-g~~~~~ 268 (432) T protein:vir:10 194 TMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVG---DLNE-DAKKVFRENFESMSSGLQNSHRIALMPV-GYQFQP 268 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCC---CCCH-HHHHHHHHHHHHHhcccccCCcceecCC-CceEEE Confidence 6788888888888888888887766432 3566653 2222 2234556665543332 23345566665 577888 Q ss_pred eecCcCCHH--HHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhh----- Q lcl|NC_019725. 78 LNSDISGVP--EFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIV----- 150 (237) Q Consensus 78 ~~~~lsGl~--dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~----- 150 (237) ++.+..... +......+.||.+-|||...|-+...+..+ +-+.....|| +..|+|.+.++-..+- T Consensus 269 l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s-~~e~~~~~~~-------~~~l~P~~~~ie~~ln~kLl~ 340 (432) T protein:vir:10 269 ISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLN-NIEQQQQQFY-------TDTLQATLTMYEQEMTYKLFL 340 (432) T ss_pred ccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcc-cHHHHHHHHH-------HHHHHHHHHHHHHHHHHhhcC Confidence 776654433 456677899999999999988544434433 3344555555 4568888777755442 Q ss_pred ---cCCCceeE--eCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccC----CCCCCChhcccc Q lcl|NC_019725. 151 ---EEEEWSIE--FEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLK----DGNNINIREPEE 221 (237) Q Consensus 151 ---~s~~~~~~--f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~----~~~~~~~~~~e~ 221 (237) +..++.|+ +..|...|.++++ ++++.++++|+++++|+|+.+- ..+..|.. +......+ . T Consensus 341 ~~~~~~g~~~~fd~~~l~~~d~~~~~-------~~~~~~~~~G~~t~NE~R~~~g-~~pi~ggD~~~~~~n~~~~~---~ 409 (432) T protein:vir:10 341 DSELDKGFYSKFNVDAILRADIKTRY-------EAYRTGIQGGFLKPNEARSKED-LPPEAGGDRLLVNGNMLPID---M 409 (432) T ss_pred hhhcCCCcEEEeechhhhcCCHHHHH-------HHHHHHHhCCCcCHHHHHHHhC-CCCCCCCCeEeecccccchh---h Confidence 12345555 5578888888765 5678899999999999999762 23322210 11111111 1 Q ss_pred CCCCCCCCCC------CcCcCC Q lcl|NC_019725. 222 TTEPEPGLGE------KLEDEN 237 (237) Q Consensus 222 ~~e~~~~~~~------~~~~e~ 237 (237) ..+.....|+ ..-+|| T Consensus 410 ~~~~~~k~~~~~~~~~~~~~~~ 431 (432) T protein:vir:10 410 AGQAYLKGGDTNGEVSKEGNEG 431 (432) T ss_pred ccccccCCCCCCCCCCCCCCCC Confidence 1111111111 111122 No 30 >protein:vir:107605 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338186;genbank:gi:77020175;genbank:GeneID:3703736 Probab=98.95 E-value=1.5e-10 Score=74.50 Aligned_cols=213 Identities=14% Similarity=0.181 Sum_probs=122.5 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHh-cCchheeeeecCCcceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDN-SGVGRAIGIDAETEEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~-r~~~~~~~iD~~~e~~~~ 77 (237) .++.+.+.|.....+......+...... .++++++ .+.. +...+++++++..... ++..++++++. +-+|+. T Consensus 194 ~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~---~l~~-e~~~~~~~~~~~~~~g~~n~~~~~vl~~-g~~~~~ 268 (432) T protein:vir:10 194 TMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVG---DLNE-DAKKVFRENFESMSSGLQNSHRIALMPV-GYQFQP 268 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCC---CCCH-HHHHHHHHHHHHHhcccccCCcceecCC-CceEEE Confidence 6788888888888888888887766432 3566653 2222 2234556665543332 23345566665 577888 Q ss_pred eecCcCCHH--HHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhh----- Q lcl|NC_019725. 78 LNSDISGVP--EFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIV----- 150 (237) Q Consensus 78 ~~~~lsGl~--dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~----- 150 (237) ++.+..... +......+.||.+-|||...|-+...+..+ +-+.....|| +..|+|.+.++-..+- T Consensus 269 l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s-~~e~~~~~~~-------~~~l~P~~~~ie~~ln~kLl~ 340 (432) T protein:vir:10 269 ISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLN-NIEQQQQQFY-------TDTLQATLTMYEQEMTYKLFL 340 (432) T ss_pred ccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcc-cHHHHHHHHH-------HHHHHHHHHHHHHHHHHhhcC Confidence 776654433 456677899999999999988544434433 3344555555 4568888777755442 Q ss_pred ---cCCCceeE--eCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccC----CCCCCChhcccc Q lcl|NC_019725. 151 ---EEEEWSIE--FEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLK----DGNNINIREPEE 221 (237) Q Consensus 151 ---~s~~~~~~--f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~----~~~~~~~~~~e~ 221 (237) +..++.|+ +..|...|.++++ ++++.++++|+++++|+|+.+- ..+..|.. +......+ . T Consensus 341 ~~~~~~g~~~~fd~~~l~~~d~~~~~-------~~~~~~~~~G~~t~NE~R~~~g-~~pi~ggD~~~~~~n~~~~~---~ 409 (432) T protein:vir:10 341 DSELDKGFYSKFNVDAILRADIKTRY-------EAYRTGIQGGFLKPNEARSKED-LPPEAGGDRLLVNGNMLPID---M 409 (432) T ss_pred hhhcCCCcEEEeechhhhcCCHHHHH-------HHHHHHHhCCCcCHHHHHHHhC-CCCCCCCCeEeecccccchh---h Confidence 12345555 5578888888765 5678899999999999999762 23322210 11111111 1 Q ss_pred CCCCCCCCCC------CcCcCC Q lcl|NC_019725. 222 TTEPEPGLGE------KLEDEN 237 (237) Q Consensus 222 ~~e~~~~~~~------~~~~e~ 237 (237) ..+.....|+ ..-+|| T Consensus 410 ~~~~~~k~~~~~~~~~~~~~~~ 431 (432) T protein:vir:10 410 AGQAYLKGGDTNGEVSKEGNEG 431 (432) T ss_pred ccccccCCCCCCCCCCCCCCCC Confidence 1111111111 111122 No 31 >protein:vir:102855 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338135;genbank:gi:77020228;genbank:GeneID:3703764 Probab=98.95 E-value=1.5e-10 Score=74.50 Aligned_cols=213 Identities=14% Similarity=0.181 Sum_probs=122.5 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHh-cCchheeeeecCCcceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDN-SGVGRAIGIDAETEEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~-r~~~~~~~iD~~~e~~~~ 77 (237) .++.+.+.|.....+......+...... .++++++ .+.. +...+++++++..... ++..++++++. +-+|+. T Consensus 194 ~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~---~l~~-e~~~~~~~~~~~~~~g~~n~~~~~vl~~-g~~~~~ 268 (432) T protein:vir:10 194 TMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVG---DLNE-DAKKVFRENFESMSSGLQNSHRIALMPV-GYQFQP 268 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCC---CCCH-HHHHHHHHHHHHHhcccccCCcceecCC-CceEEE Confidence 6788888888888888888887766432 3566653 2222 2234556665543332 23345566665 577888 Q ss_pred eecCcCCHH--HHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhh----- Q lcl|NC_019725. 78 LNSDISGVP--EFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIV----- 150 (237) Q Consensus 78 ~~~~lsGl~--dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~----- 150 (237) ++.+..... +......+.||.+-|||...|-+...+..+ +-+.....|| +..|+|.+.++-..+- T Consensus 269 l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s-~~e~~~~~~~-------~~~l~P~~~~ie~~ln~kLl~ 340 (432) T protein:vir:10 269 ISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLN-NIEQQQQQFY-------TDTLQATLTMYEQEMTYKLFL 340 (432) T ss_pred ccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcc-cHHHHHHHHH-------HHHHHHHHHHHHHHHHHhhcC Confidence 776654433 456677899999999999988544434433 3344555555 4568888777755442 Q ss_pred ---cCCCceeE--eCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccC----CCCCCChhcccc Q lcl|NC_019725. 151 ---EEEEWSIE--FEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLK----DGNNINIREPEE 221 (237) Q Consensus 151 ---~s~~~~~~--f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~----~~~~~~~~~~e~ 221 (237) +..++.|+ +..|...|.++++ ++++.++++|+++++|+|+.+- ..+..|.. +......+ . T Consensus 341 ~~~~~~g~~~~fd~~~l~~~d~~~~~-------~~~~~~~~~G~~t~NE~R~~~g-~~pi~ggD~~~~~~n~~~~~---~ 409 (432) T protein:vir:10 341 DSELDKGFYSKFNVDAILRADIKTRY-------EAYRTGIQGGFLKPNEARSKED-LPPEAGGDRLLVNGNMLPID---M 409 (432) T ss_pred hhhcCCCcEEEeechhhhcCCHHHHH-------HHHHHHHhCCCcCHHHHHHHhC-CCCCCCCCeEeecccccchh---h Confidence 12345555 5578888888765 5678899999999999999762 23322210 11111111 1 Q ss_pred CCCCCCCCCC------CcCcCC Q lcl|NC_019725. 222 TTEPEPGLGE------KLEDEN 237 (237) Q Consensus 222 ~~e~~~~~~~------~~~~e~ 237 (237) ..+.....|+ ..-+|| T Consensus 410 ~~~~~~k~~~~~~~~~~~~~~~ 431 (432) T protein:vir:10 410 AGQAYLKGGDTNGEVSKEGNEG 431 (432) T ss_pred ccccccCCCCCCCCCCCCCCCC Confidence 1111111111 111122 No 32 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=98.92 E-value=3.9e-10 Score=72.20 Aligned_cols=210 Identities=14% Similarity=0.098 Sum_probs=117.0 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVL 78 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~ 78 (237) .++.+...|.....+......+...... .++++++ .+ ..+....++++++......+..+.++++. +-+|..+ T Consensus 171 ~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~---~~-~~e~~~~~~~~~~~~~~~~n~~~~~vl~~-g~~~~~l 245 (397) T protein:vir:38 171 PLSALINEQQIKDASNELTLKALKQSVTASAVLTIQK---GG-LLDAETRIARSKEISKQIHNSDGPVVIDA-LEDYKPL 245 (397) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCC---CC-CHHHHHHHHHHHHHHhcccccCCceecCC-CceEEec Confidence 5677778888777777777776665432 4566553 12 22334556666766655555566677765 5788887 Q ss_pred ecCcC--CHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhcC--CC Q lcl|NC_019725. 79 NSDIS--GVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVEE--EE 154 (237) Q Consensus 79 ~~~ls--Gl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~s--~~ 154 (237) +.+.. .+.+........||++-|||...|-|...+. ++- ...+.||. ..|.|.+.++-..+-+. .+ T Consensus 246 ~~~~~d~~~~e~~~~~~~~Ia~afgVp~~~lg~~~~~~--~~~-e~~~~~~~-------~~l~P~~~~ie~~ln~~l~~~ 315 (397) T protein:vir:38 246 EVKGNIASLLNQVDWTRDQIAKVYGVPDSYLNGQGDQQ--SSI-TQISGQYA-------KSLNRYVQAIVGELNDKLHAN 315 (397) T ss_pred CCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcc--cHH-HHHHHHHH-------HHHHHHHHHHHHHHHHhccCh Confidence 76544 3345678889999999999999986644221 121 23455552 35778777765543211 22 Q ss_pred ceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCChhc----c----c-cCCC- Q lcl|NC_019725. 155 WSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIRE----P----E-ETTE- 224 (237) Q Consensus 155 ~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~~~----~----e-~~~e- 224 (237) +.+.+.-+...+.+ .++++++++++.|+++++|+|+.+- ..+..+ +.....+. . . ...+ T Consensus 316 ~~~~~~~~~~~d~~-------~~~~~~~~~~~~G~~t~nE~R~~lg-~~p~~~---~d~~~~~~~~~~~~~~~~~~~g~~ 384 (397) T protein:vir:38 316 ISANIRFAIDAMGD-------QYASTISSSVKGGTIAGNQARFILQ-NSGYLA---KDLPDPEKEPQQAIQLIQQEGGEN 384 (397) T ss_pred hcccccccccCCHH-------HHHHHHHHHHhCCCcCHHHHHHHhC-CCCCCC---CccccccccccccccccccccCCC Confidence 23333334555544 4566788899999999999999873 222111 11110000 0 0 0000 Q ss_pred -CCCCCCCCcCcC Q lcl|NC_019725. 225 -PEPGLGEKLEDE 236 (237) Q Consensus 225 -~~~~~~~~~~~e 236 (237) ..+....+.++| T Consensus 385 ~~~~~~e~~~~~~ 397 (397) T protein:vir:38 385 DGNNSDERGSDPE 397 (397) T ss_pred CCCCCCCCCCCCC Confidence 001111111122 No 33 >protein:vir:97060 Length: 432 # NCBI annotation: putative head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453563;genbank:gi:84662598;genbank:GeneID:5142475 Probab=98.92 E-value=7.6e-10 Score=70.59 Aligned_cols=212 Identities=12% Similarity=0.128 Sum_probs=124.5 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVL 78 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~ 78 (237) .++.+.+.|.....+......+...... .++++++. +. ++....+.+++. ...+..+++++++ +-+|+.+ T Consensus 195 pi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~~~---l~-~e~~~~~~~~~~---~~~nag~~~vl~~-g~~~~~l 266 (432) T protein:vir:97 195 AIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDRF---LT-DDQYDSFSKKVS---GSVEAGRAPLLEG-GMDVKSL 266 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCcceeEecCCC---CC-HHHHHHHHHHHh---hhhcCCCceecCC-CceEEEc Confidence 6677778888888888888887665433 37777642 22 222233444332 2334456677765 5788888 Q ss_pred ecCcCCH--HHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHH----HhhhHHHHHHHHHhh-- Q lcl|NC_019725. 79 NSDISGV--PEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKRE----EDYRPLLEFLLPFIV-- 150 (237) Q Consensus 79 ~~~lsGl--~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe----~~l~p~l~~l~~~i~-- 150 (237) +.+.... -+........||.+-|||-..| |....|=.++| ..++..+. .-|.|.++++-..+- T Consensus 267 ~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~t~~~~--------s~~e~~~~~f~~~tl~P~~~~ie~~ln~k 337 (432) T protein:vir:97 267 GLNPVDAQLLQSRQYSVESICRFFGVPPSMI-GHSSAGTTSWG--------SGIESQQLGFLTMTLSPWLRRIEQSIALN 337 (432) T ss_pred cCChhHHHHHHHHHHHHHHHHHHhCCCHHHc-CCcCCcccccc--------hhHHHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 7765433 3456777889999999998776 54433322222 22332222 246676666544332 Q ss_pred ---cC--CC--ceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccC-----CCCCCChhc Q lcl|NC_019725. 151 ---EE--EE--WSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLK-----DGNNINIRE 218 (237) Q Consensus 151 ---~s--~~--~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~-----~~~~~~~~~ 218 (237) .. .. +.|.+..|...|.+++ ++++.+++++|+++++|+|+.+ ...+..|-. ...-...+. T Consensus 338 Ll~~~e~~~~~~~fd~~~llr~d~~~r-------~~~~~~~~~~G~~T~NE~R~~~-glpp~~g~~~~~~~~~~~~pl~~ 409 (432) T protein:vir:97 338 LLTPAERRRYFADFDTSALLRADSAAR-------SSYYSQLVNNGLMTRDEAREIE-GLPKLGGNAAVLTVQSAMVPLDS 409 (432) T ss_pred ccCccccCceEEEeechhhhccCHHHH-------HHHHHHHHhCCCCCHHHHHHHh-CCCCCCCCcceEeecccccchhh Confidence 11 13 4455567888887776 5577889999999999999876 333322210 111112233 Q ss_pred cccCCCCCCCCCCCcCcCC Q lcl|NC_019725. 219 PEETTEPEPGLGEKLEDEN 237 (237) Q Consensus 219 ~e~~~e~~~~~~~~~~~e~ 237 (237) ..+...++|+.+++.+++| T Consensus 410 ~~~~~~~~~~~~~~~~~~~ 428 (432) T protein:vir:97 410 IGLQASPEPASGLGNQQQD 428 (432) T ss_pred hcccCCCCCCCCCCCcccc Confidence 3344556667777777776 No 34 >protein:vir:81072 Length: 432 # NCBI annotation: p07 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285677;genbank:gi:148727185;genbank:GeneID:5247117 Probab=98.92 E-value=8.9e-10 Score=70.21 Aligned_cols=213 Identities=12% Similarity=0.131 Sum_probs=125.3 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVL 78 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~ 78 (237) .++.+.+.|.....+......+...... .++++++ .++. +....+++++ ....+..+.+++++ +.+|+.+ T Consensus 195 pi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~---~l~~-e~~~~~~~~~---~~~~nag~~~vl~~-g~~~~~l 266 (432) T protein:vir:81 195 AIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDR---FLTD-DQYDSFAKKV---SGSVEAGRAPLLEG-GMDVKSL 266 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCC---CCCH-HHHHHHHHHH---hhhhcCCCceecCC-CceEEEc Confidence 6677888888888888777776654333 4666653 2221 2222334333 33334456777775 5788888 Q ss_pred ecCcCC--HHHHHHHHHHHHhhhhcCceeeeeccCcccccccch---hHHHHHHHHHHHHHHHhhhHHHHHHHHHhh--- Q lcl|NC_019725. 79 NSDISG--VPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQN---TALETFYKLVDRKREEDYRPLLEFLLPFIV--- 150 (237) Q Consensus 79 ~~~lsG--l~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe---~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~--- 150 (237) +.+... +-+........||.+-|||-..| |...+|=+++|. .-.+.||. .-|.|.+.++-.-+- T Consensus 267 ~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~l-g~~~~~~~~~~sn~eq~~~~f~~-------~tl~P~~~~ie~~l~~kL 338 (432) T protein:vir:81 267 GLNPVDAQLLQSRQYSVESICRFFGVPPSMI-GHSSAGTTSWGSGIESQQLGFLT-------MTLSPWLRRIEQSIALNL 338 (432) T ss_pred cCCHHHHHHHHHHHHHHHHHHHHhCCCHHHc-CCcCCccccccchHHHHHHHHHH-------HHHHHHHHHHHHHHHhhc Confidence 766543 33556678899999999998766 655544333332 23344553 346777666533332 Q ss_pred -cC---CCcee--EeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcccccc-----CCCCCCChhcc Q lcl|NC_019725. 151 -EE---EEWSI--EFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKL-----KDGNNINIREP 219 (237) Q Consensus 151 -~s---~~~~~--~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~-----~~~~~~~~~~~ 219 (237) .. ..+.| .+..|...|.+++ ++++.+++++|+++++|+|+.+- ..+..|- -...-+..+.. T Consensus 339 l~~~~~~~~~~~fd~~~llr~d~~~r-------~~~~~~~~~~G~~t~NE~R~~~g-lpp~~g~~~~~~~~~~~~pl~~~ 410 (432) T protein:vir:81 339 LSPAERRRYFADFDTSALLRADSAAR-------SSYYSQLVNNGLMTRDEAREIEG-LPKLGGNAAVLTVQSAMVPLDSI 410 (432) T ss_pred cCccccCceEEEeechhhhccCHHHH-------HHHHHHHHhCCCCCHHHHHHHhC-CCCCCCCcceEeecCcccchhhh Confidence 11 23344 4557777777665 56778889999999999999863 2222221 01111122333 Q ss_pred ccCCCCCCCCCCCcCcCC Q lcl|NC_019725. 220 EETTEPEPGLGEKLEDEN 237 (237) Q Consensus 220 e~~~e~~~~~~~~~~~e~ 237 (237) .+.+.++|..+.+.+++| T Consensus 411 ~~~~~~~~~~~~~n~~~~ 428 (432) T protein:vir:81 411 GLQASPEPASGLGNQQQD 428 (432) T ss_pred ccCCCCCCCCCCCCcccc Confidence 444556666666666666 No 35 >protein:vir:9359 Length: 348 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803337;genbank:gi:29028648;genbank:GeneID:1258089 Probab=98.91 E-value=3e-10 Score=72.79 Aligned_cols=213 Identities=9% Similarity=0.075 Sum_probs=111.0 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeeeec Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVLNS 80 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~~~ 80 (237) .++.+.+.+.....+......-......-++++++ .+ +++....+++++. ..+.++.+.++++. +-+|+.++. T Consensus 120 ~~~~~~~~i~~~~~~~~~~~~~~~~~~~~i~~~~~---~l-~~e~~~~~~~~~~--~~~~n~~~~~vl~~-g~~~~~l~~ 192 (348) T protein:vir:93 120 PIDVLKNTTDFDNAVRTFNLTEMQKPDSFMLKYGS---NV-STEKRQQVLEDFK--QYYEENGGILFQEP-GVEIEPLPK 192 (348) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCceeEEecCC---CC-CHHHHHHHHHHHH--HHhhcCCCeeecCC-CceEEEcCC Confidence 34444444443333332221111111112222221 11 1222334555544 23444455556654 577888776 Q ss_pred CcC--CHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhc----C-- Q lcl|NC_019725. 81 DIS--GVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVE----E-- 152 (237) Q Consensus 81 ~ls--Gl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~----s-- 152 (237) +.. .+.+........||.+-|||-.+|-+...+.. ++.+.-.++||..+ |.|.++++-..+-+ . T Consensus 193 ~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~-~~~e~~~~~~~~~~-------l~P~~~~ie~~l~~~l~~~~~ 264 (348) T protein:vir:93 193 KYVSEDIVASENLTRERVANVFQLPSIFLNARSNTNF-AKNEELNRFYLQHT-------LLPIVKQYEEEFNRKLLTKTD 264 (348) T ss_pred ChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCc-ccHHHHHHHHHHHH-------HHHHHHHHHHHHHHhhCCccc Confidence 554 23345556788999999999888855443333 34455566676654 88888877555432 1 Q ss_pred --CC--ceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcccccc----CCCCCCChhccccCCC Q lcl|NC_019725. 153 --EE--WSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKL----KDGNNINIREPEETTE 224 (237) Q Consensus 153 --~~--~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~----~~~~~~~~~~~e~~~e 224 (237) .+ |.|.+..|...|.+++| +++.+++++|+++++|+|+.+ ...+..|- -+..-...+.++ ... T Consensus 265 ~~~g~~i~fd~~~l~~~d~~~~a-------~~~~~~~~~G~~T~NE~R~~~-g~~p~~ggD~~~~~~n~~~~~~~~-~~~ 335 (348) T protein:vir:93 265 REKNRYFKFNVKSYLRADSATQA-------EVYFKAVRSGYYTINDIREWE-DLPPVEGGDKPLISGDLYPIDTPL-ELR 335 (348) T ss_pred ccCcceEEeechhhhccCHHHHH-------HHHHHHHhCCCCCHHHHHHHh-CCCCCCCcCeEeecccccccccch-hhc Confidence 23 44556688777777764 567888999999999999987 22332220 000101111111 111 Q ss_pred CCCCCCCCcCcCC Q lcl|NC_019725. 225 PEPGLGEKLEDEN 237 (237) Q Consensus 225 ~~~~~~~~~~~e~ 237 (237) ...++|++..+|+ T Consensus 336 ~~~~gg~~n~~~~ 348 (348) T protein:vir:93 336 KSLKGGDKNVNES 348 (348) T ss_pred ccccCCCCCcCCC Confidence 2234555555666 No 36 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=98.91 E-value=6.6e-10 Score=70.91 Aligned_cols=216 Identities=16% Similarity=0.151 Sum_probs=116.7 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHhcCc-hheeeeecCCcceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGV-GRAIGIDAETEEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~-~~~~~iD~~~e~~~~ 77 (237) .++.+.+.|.....+......+...... .++++++ .+.. +....++++++-......| .+.+++++ +-+|.. T Consensus 195 p~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~---~ls~-e~~~~~~~~~~~~~~G~~nag~~~vl~~-g~~~~~ 269 (457) T protein:vir:62 195 PISYARESIGLALAAQKYGAHFFRNGAMPGAVVEVPG---TMSE-EGLARAREAWRAANSGVDNAHRVALLTE-GAKFSK 269 (457) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEcCC---CCCH-HHHHHHHHHHHHHhcCccccCcceecCC-CceEEE Confidence 5677777787777777777777665333 4577754 2222 2233455545433332233 33566665 577888 Q ss_pred eecCcCC--HHHHHHHHHHHHhhhhcCceeeeeccCccc-cccc-chhHHHHHHHHHHHHHHHhhhHHHHHHHHHhh--- Q lcl|NC_019725. 78 LNSDISG--VPEFLSSKMDRIVSLSGIHEIIIKNKNVGG-VSAS-QNTALETFYKLVDRKREEDYRPLLEFLLPFIV--- 150 (237) Q Consensus 78 ~~~~lsG--l~dl~~~~~~~iaa~s~iP~t~L~G~sp~G-lnat-Ge~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~--- 150 (237) ++.+... +-+........||.+-|||-..| |...++ ..++ -+.-...||.. .|.|.++++-..+- T Consensus 270 l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~~~~~sn~eq~~~~f~~~-------~l~P~~~~ie~~ln~~L 341 (457) T protein:vir:62 270 VAMSPDEAQFLQTRQFQVPEIARIFGVPPHLI-SDATNSTSWGSGLAEQNIAFTMF-------SLRPWLERIEAGFNRLL 341 (457) T ss_pred ccCChhHHHHHHHHHHHHHHHHHHhCCCHHHc-CCCCCcccccchHHHHHHHHHHH-------HHHHHHHHHHHHHHhhh Confidence 7765543 33455677889999999998765 655443 3211 13333455544 37777776644432 Q ss_pred -cC---CC--ceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCC-C----CCCC---- Q lcl|NC_019725. 151 -EE---EE--WSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKD-G----NNIN---- 215 (237) Q Consensus 151 -~s---~~--~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~-~----~~~~---- 215 (237) .. .. +.|.+..|...|.++++ +++.+++++|+++++|+|+.+- ..+..|-.+ . .+.. T Consensus 342 ~~~~~~~~~~i~fd~~~l~~~d~~~r~-------~~~~~~~~~G~~T~NE~R~~~g-l~pi~~g~~D~~~~~~n~~~~~~ 413 (457) T protein:vir:62 342 FAETADRFRFVKFNLDEIKRGAPKERM-------ELWSLGLQNGIYSIDEVRAAED-MTPLPDGLGEKYRVPLNLGEIGE 413 (457) T ss_pred cCccccCceEEEeechhhhccCHHHHH-------HHHHHHHhCCCcCHHHHHHHhC-CCCCCCCCcceeeeccccccccc Confidence 11 23 44555688888877765 4566788999999999999763 223222100 0 0000 Q ss_pred ----------------hhccccCCCCCCCCCCCcCcCC Q lcl|NC_019725. 216 ----------------IREPEETTEPEPGLGEKLEDEN 237 (237) Q Consensus 216 ----------------~~~~e~~~e~~~~~~~~~~~e~ 237 (237) .+++.+.++++...|++.+.|+ T Consensus 414 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 451 (457) T protein:vir:62 414 EPEPEPAPAPPAIDPPAEEPADDEEPDNAEGDPDEGET 451 (457) T ss_pred cccccccCCCccCCCCccCCCCCCCCCCCCCCCccccc Confidence 0000011112222233333333 No 37 >protein:vir:4598 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058443;genbank:gi:9635169;genbank:GeneID:1262702 Probab=98.89 E-value=5.1e-10 Score=71.51 Aligned_cols=211 Identities=18% Similarity=0.220 Sum_probs=117.9 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHh-cCchheeeeecCCcceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDN-SGVGRAIGIDAETEEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~-r~~~~~~~iD~~~e~~~~ 77 (237) .++.+.+.|.....+......+...... .++|+++ .+.+++...++++++.-.-.. ++..+.+++++ +.+|+. T Consensus 181 ~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~---~~~~~~~~~~~~~~~~~~~~g~~nag~~~vl~~-g~~~~~ 256 (416) T protein:vir:45 181 LLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKG---VLDNKKARDRAREEFHKSFSGTKQAGKVVVLDE-SMTFDQ 256 (416) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCC---CCCCHHHHHHHHHHHHHHhcCccccCceeecCC-CceeEe Confidence 6688888888877777777777666432 4566654 222333333444544433222 22344566765 578888 Q ss_pred eecCcCC--HHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhc---C Q lcl|NC_019725. 78 LNSDISG--VPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVE---E 152 (237) Q Consensus 78 ~~~~lsG--l~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~---s 152 (237) ++.+... +-+........||++-|||...| |...++.+ ..+...+|. ..|.|.+.++-..+-+ + T Consensus 257 l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~~~---~~~~~~~~~-------~~l~P~~~~ie~~ln~~l~~ 325 (416) T protein:vir:45 257 LEVDTEVLKLIRENKSSTREIAGVFGIPLHKF-GIETANMS---ITDANLDYL-------STLKPYITCVCAELNFKFND 325 (416) T ss_pred ccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHc-CCCCCCcc---HHHHHHHHH-------HHHHHHHHHHHHHHhhhccc Confidence 7765432 33455666789999999998764 76555543 122233332 2477877766554432 1 Q ss_pred ----CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCC------CCCCChhccc-- Q lcl|NC_019725. 153 ----EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKD------GNNINIREPE-- 220 (237) Q Consensus 153 ----~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~------~~~~~~~~~e-- 220 (237) -.|.|.+..|...|.+++ ++++++++++|+++++|+|+.+- ..+..|-.. ..-+..+..+ T Consensus 326 ~~~~~~~~f~~~~l~~~D~~~~-------~~~~~~~~~~G~~T~NE~R~~~g-l~p~~~gd~~~~~~~~n~~~~~~~~~~ 397 (416) T protein:vir:45 326 EYVNREFKFDTTEIRVVDEKTQ-------AEIDKINIDSGKMNIDEIRQRDG-LAPIPGGNGSIHRVDLNHVNIELVDEY 397 (416) T ss_pred cccCceEEEechhhhccCHHHH-------HHHHHHHHhCCCcCHHHHHHHhC-CCCCCCCCcceEeeccccccccccccc Confidence 245666677777777775 56678899999999999999873 333222110 0001111111 Q ss_pred ---cCC--CCCCCCCCCcC Q lcl|NC_019725. 221 ---ETT--EPEPGLGEKLE 234 (237) Q Consensus 221 ---~~~--e~~~~~~~~~~ 234 (237) +.. +.+-.+||.-+ T Consensus 398 ~~~~~~~~~~~~kgGe~n~ 416 (416) T protein:vir:45 398 QMNKSRATDKKLKGGEENE 416 (416) T ss_pred CcccccccccccCCCCCCC Confidence 111 11122333222 No 38 >protein:vir:81095 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429872;genbank:gi:156603925;genbank:GeneID:5525315 Probab=98.89 E-value=5.1e-10 Score=71.51 Aligned_cols=211 Identities=18% Similarity=0.220 Sum_probs=117.9 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHh-cCchheeeeecCCcceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDN-SGVGRAIGIDAETEEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~-r~~~~~~~iD~~~e~~~~ 77 (237) .++.+.+.|.....+......+...... .++|+++ .+.+++...++++++.-.-.. ++..+.+++++ +.+|+. T Consensus 181 ~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~---~~~~~~~~~~~~~~~~~~~~g~~nag~~~vl~~-g~~~~~ 256 (416) T protein:vir:81 181 LLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKG---VLDNKKARDRAREEFHKSFSGTKQAGKVVVLDE-SMTFDQ 256 (416) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCC---CCCCHHHHHHHHHHHHHHhcCccccCceeecCC-CceeEe Confidence 6688888888877777777777666432 4566654 222333333444544433222 22344566765 578888 Q ss_pred eecCcCC--HHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhc---C Q lcl|NC_019725. 78 LNSDISG--VPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVE---E 152 (237) Q Consensus 78 ~~~~lsG--l~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~---s 152 (237) ++.+... +-+........||++-|||...| |...++.+ ..+...+|. ..|.|.+.++-..+-+ + T Consensus 257 l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~~~---~~~~~~~~~-------~~l~P~~~~ie~~ln~~l~~ 325 (416) T protein:vir:81 257 LEVDTEVLKLIRENKSSTREIAGVFGIPLHKF-GIETANMS---ITDANLDYL-------STLKPYITCVCAELNFKFND 325 (416) T ss_pred ccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHc-CCCCCCcc---HHHHHHHHH-------HHHHHHHHHHHHHHhhhccc Confidence 7765432 33455666789999999998764 76555543 122233332 2477877766554432 1 Q ss_pred ----CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCC------CCCCChhccc-- Q lcl|NC_019725. 153 ----EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKD------GNNINIREPE-- 220 (237) Q Consensus 153 ----~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~------~~~~~~~~~e-- 220 (237) -.|.|.+..|...|.+++ ++++++++++|+++++|+|+.+- ..+..|-.. ..-+..+..+ T Consensus 326 ~~~~~~~~f~~~~l~~~D~~~~-------~~~~~~~~~~G~~T~NE~R~~~g-l~p~~~gd~~~~~~~~n~~~~~~~~~~ 397 (416) T protein:vir:81 326 EYVNREFKFDTTEIRVVDEKTQ-------AEIDKINIDSGKMNIDEIRQRDG-LAPIPGGNGSIHRVDLNHVNIELVDEY 397 (416) T ss_pred cccCceEEEechhhhccCHHHH-------HHHHHHHHhCCCcCHHHHHHHhC-CCCCCCCCcceEeeccccccccccccc Confidence 245666677777777775 56678899999999999999873 333222110 0001111111 Q ss_pred ---cCC--CCCCCCCCCcC Q lcl|NC_019725. 221 ---ETT--EPEPGLGEKLE 234 (237) Q Consensus 221 ---~~~--e~~~~~~~~~~ 234 (237) +.. +.+-.+||.-+ T Consensus 398 ~~~~~~~~~~~~kgGe~n~ 416 (416) T protein:vir:81 398 QMNKSRATDKKLKGGEENE 416 (416) T ss_pred CcccccccccccCCCCCCC Confidence 111 11122333222 No 39 >protein:vir:102080 Length: 429 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512313;genbank:gi:89152482;genbank:GeneID:3953073 Probab=98.88 E-value=4.2e-10 Score=72.00 Aligned_cols=216 Identities=14% Similarity=0.184 Sum_probs=120.8 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHh-cCchheeeeecCCcceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDN-SGVGRAIGIDAETEEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~-r~~~~~~~iD~~~e~~~~ 77 (237) .++.+.+.|.....+......+...... .++++++ .+.. +...+++++++..... ++..+.++++. +-+|+. T Consensus 191 ~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~---~l~~-e~~~~~~~~~~~~~~g~~n~~~~~vl~~-g~~~~~ 265 (429) T protein:vir:10 191 TMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYVG---DLNE-DAKKVFRENFESMSSGLQNSHRIALMPV-GYQFQP 265 (429) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCC---CCCH-HHHHHHHHHHHHHhccccccCceeecCC-CceEEE Confidence 5778888888888888888887766432 3566653 2222 2234556666543333 33345566664 567888 Q ss_pred eecCcCCHH--HHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhh----- Q lcl|NC_019725. 78 LNSDISGVP--EFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIV----- 150 (237) Q Consensus 78 ~~~~lsGl~--dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~----- 150 (237) ++.+..... +......+.||.+-|||...|-+...+..+ +-+.....|| +..|.|.+..+-..+- T Consensus 266 l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s-n~e~~~~~f~-------~~~l~P~~~~ie~~ln~kl~~ 337 (429) T protein:vir:10 266 ISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLN-NIEQQQQQFY-------TDTLQATLTMYEQEMTYKLFL 337 (429) T ss_pred ccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcc-cHHHHHHHHH-------HHHHHHHHHHHHHHHHHhhcC Confidence 765543332 345577889999999999888554433332 3344555555 3557787776655442 Q ss_pred ---cCCCceeEeC--CCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccC----CCCCCChhcccc Q lcl|NC_019725. 151 ---EEEEWSIEFE--PLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLK----DGNNINIREPEE 221 (237) Q Consensus 151 ---~s~~~~~~f~--pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~----~~~~~~~~~~e~ 221 (237) +..++.|+|+ .|...|.++++ ++++.++++|+++++|+|+.+ ...+..|.. +..-...+...+ T Consensus 338 ~~~~~~g~~~~fd~~~ll~~d~~~~~-------~~~~~~~~~G~~T~NE~R~~~-gl~p~~ggD~~~~~~n~~~~d~~~~ 409 (429) T protein:vir:10 338 DSELDKGFYSKFNVDAILRADIKTRY-------EAYRTGIQGGFLKPNEARSKE-DLPPEAGGDRLLVNGNMLPIDMAGQ 409 (429) T ss_pred hhhcCCCcEEEeechhhhcCCHHHHH-------HHHHHHHhCCCcCHHHHHHHh-CCCCCCCcCeeeecccccchhhccc Confidence 2235556654 78888888764 467889999999999999876 223322210 000000111000 Q ss_pred --CC-CCCCCCCCCcCcCC Q lcl|NC_019725. 222 --TT-EPEPGLGEKLEDEN 237 (237) Q Consensus 222 --~~-e~~~~~~~~~~~e~ 237 (237) .+ .++.+......+|| T Consensus 410 ~~~k~g~~~~~~~~~~~e~ 428 (429) T protein:vir:10 410 AYLKGGDTNGEVSKEGNEG 428 (429) T ss_pred cccCCCCCCCCCCCCCCCC Confidence 00 01111111122222 No 40 >protein:vir:80134 Length: 403 # NCBI annotation: Phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425602;genbank:gi:155042935;genbank:GeneID:5469563 Probab=98.88 E-value=5.8e-10 Score=71.22 Aligned_cols=212 Identities=14% Similarity=0.180 Sum_probs=109.6 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhc--cceeechhHHHhhcCCchHHHHHHHH-HHHHHhcCchheeeeecCCcceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQ--QAVWKVKGLAEMCDDDDAQYAARLRL-AQVDDNSGVGRAIGIDAETEEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~--~~v~k~~~l~~~~~~~~~e~~~~~r~-~~~~~~r~~~~~~~iD~~~e~~~~ 77 (237) .++.+.+.+.....+......+..... -.|+++++ .+.... ..+.++++ +.+....+..+.+++.....++.+ T Consensus 175 ~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~---~~~~~~-~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~ 250 (403) T protein:vir:80 175 YRVVLKDIVNNLKQATTTKKSFMSGKYMPSLIVKVDA---ATAELS-SEEGRNAVFKKYLEASEAGQPWIIPAELLDVEQ 250 (403) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCC---CCChHH-HHHHHHHHHHHHhhhhhcCCeeeecccccccce Confidence 566666777766666666666665433 23555543 122222 22333333 222222333455566544444544 Q ss_pred ee-cCcC--CHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHh----h Q lcl|NC_019725. 78 LN-SDIS--GVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFI----V 150 (237) Q Consensus 78 ~~-~~ls--Gl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i----~ 150 (237) .. .+.. .+-+........||.+-|||..+| | .+..++....+||. ..|.|.++++-..+ . T Consensus 251 ~~~l~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g-----~~~~~~~~~~~f~~-------~~l~P~~~~ie~~l~~kll 317 (403) T protein:vir:80 251 VKPLSLKDLAIHETVELDKRTVAGIFGVPAFLL-G-----VGKYDKDEYNNFIN-------STILPIAKGIEQELTRKLL 317 (403) T ss_pred eccCCHHHHHHHHHHHHhHHHHHHHhCCCHHHc-C-----CCCccHHHHHHHHH-------HHHHHHHHHHHHHHHHhcc Confidence 32 2222 333455667788999999997666 4 23233344566664 34788887665443 3 Q ss_pred cCCCceeEeC--CCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccC----CCCCCChhccccCCC Q lcl|NC_019725. 151 EEEEWSIEFE--PLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLK----DGNNINIREPEETTE 224 (237) Q Consensus 151 ~s~~~~~~f~--pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~----~~~~~~~~~~e~~~e 224 (237) ...++.|+|+ .|...|.++++ +++.+++++|+++++|+|+.+ ...+..|-. +..-+..+...+... T Consensus 318 ~~~~~~~~f~~~~ll~~d~~~~~-------~~~~~~~~~Gi~t~NE~R~~~-gl~p~~ggd~~~~~~n~~pl~~~~~~~~ 389 (403) T protein:vir:80 318 ISPDLYFKFNPRSLYAYDLKELA-------EVGSNMYVRGLMEGNEVRDWL-GLSPKEGLSELVILENYIPLDKIGDQNK 389 (403) T ss_pred CCCCcEEEeechhhhccCHHHHH-------HHHHHHHhCCCcCHHHHHHHh-CCCCCCCCCeEeecccccchhhccchhh Confidence 4567777775 56666666554 577788999999999999976 233322210 000000111111111 Q ss_pred CCCCCCCCcCcCC Q lcl|NC_019725. 225 PEPGLGEKLEDEN 237 (237) Q Consensus 225 ~~~~~~~~~~~e~ 237 (237) ...+.+.+.++++ T Consensus 390 ~k~ge~~~~~~~~ 402 (403) T protein:vir:80 390 LKGGEKGGADGQT 402 (403) T ss_pred ccCCCCCCCCCCC Confidence 1222222222333 No 41 >protein:vir:4337 Length: 434 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061500;genbank:gi:9635589;genbank:GeneID:1262858 Probab=98.85 E-value=7.1e-10 Score=70.75 Aligned_cols=215 Identities=10% Similarity=0.072 Sum_probs=119.4 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVL 78 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~ 78 (237) -++.+.+.|.....+......+...... .++++++ .++. +....+++.++.+....+..+.+++++ +-+|..+ T Consensus 194 pi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~---~l~~-e~~~~~r~~~~~~~g~~nag~~~vl~~-g~~~~~l 268 (434) T protein:vir:43 194 AIRYGVDVFGSVMSAEDAANGTFKNGLLPTVAFKVDR---ILQP-AQREEFREYVKSVSGAMNSGRSPVLEQ-GITPETI 268 (434) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEecCC---CCCH-HHHHHHHHHHHHhcCccccCCccccCC-CceEEEc Confidence 5778888888888888888887765433 3566654 2222 223345544443322222344566665 5788888 Q ss_pred ecCcC--CHHHHHHHHHHHHhhhhcCceeeeeccCccc-ccccc-hhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhc--- Q lcl|NC_019725. 79 NSDIS--GVPEFLSSKMDRIVSLSGIHEIIIKNKNVGG-VSASQ-NTALETFYKLVDRKREEDYRPLLEFLLPFIVE--- 151 (237) Q Consensus 79 ~~~ls--Gl~dl~~~~~~~iaa~s~iP~t~L~G~sp~G-lnatG-e~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~--- 151 (237) +.+.. .+-+........||.+-|||-..| |...++ ...++ +.-...|| ...|.|.+.++-..+-. T Consensus 269 ~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~~~~~s~~e~~~~~f~-------~~~L~P~~~~ie~~ln~kL~ 340 (434) T protein:vir:43 269 GINPVDAQLLETREHGVIEICRWFGVPPWMI-GQTDKGSNWGTGLEQQMLAFL-------TFSISSITNQIQQCVNKRLL 340 (434) T ss_pred cCChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CCCcCCccccchHHHHHHHHH-------HHHHHHHHHHHHHHHHhhcC Confidence 76654 344667788899999999997766 654433 22222 22333444 34578877776444321 Q ss_pred --C--CCceeEeC--CCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcccccc----CCCCCCChhcccc Q lcl|NC_019725. 152 --E--EEWSIEFE--PLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKL----KDGNNINIREPEE 221 (237) Q Consensus 152 --s--~~~~~~f~--pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~----~~~~~~~~~~~e~ 221 (237) . .++.|+|+ .|...|.+++ ++++.+++++|+++++|+|+.+- ..+..|- .+...+..+..++ T Consensus 341 ~~~~~~~~~~~fd~~~llr~d~~~r-------~~~~~~~~~~G~~T~NE~R~~~g-l~p~~ggD~~~~~~n~~~~~~~~~ 412 (434) T protein:vir:43 341 TAPERIRYYAEFSLEGFLKADSAGR-------AAWYSTMAQNGFMTRNEGRRKEN-LPELPGGDILTVQSNLVPIDQLGQ 412 (434) T ss_pred ChhhhcCceEEEechhhhccCHHHH-------HHHHHHHHhCCCcCHHHHHHHhC-CCCCCCCCeEeeccCccchhhhhc Confidence 1 24455554 7767777664 67788899999999999999762 3332221 0111111222221 Q ss_pred CCCCC-------CCCCCCcCcC Q lcl|NC_019725. 222 TTEPE-------PGLGEKLEDE 236 (237) Q Consensus 222 ~~e~~-------~~~~~~~~~e 236 (237) ...++ ...++++-.| T Consensus 413 ~~~~~~~~~~~~~~~~~~~~~~ 434 (434) T protein:vir:43 413 SNKSQAVRAALMNWFSQPEPQE 434 (434) T ss_pred cCCCcchhhhhhccCCCCCCCC Confidence 11111 1122222222 No 42 >protein:vir:81152 Length: 411 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285809;genbank:gi:148747730;genbank:GeneID:5247195 Probab=98.85 E-value=3.2e-10 Score=72.64 Aligned_cols=211 Identities=16% Similarity=0.093 Sum_probs=120.9 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHhcCch-heeeeecCCcceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVG-RAIGIDAETEEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~-~~~~iD~~~e~~~~ 77 (237) .++.+.+.+.....+......+...... .++++++ .++ ++...++++++.-.-..-+|. +.+++++ +-+|+. T Consensus 185 ~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~---~l~-~e~~~~~~~~~~~~~~g~~n~g~~~vl~~-g~~~~~ 259 (411) T protein:vir:81 185 VRDVLKHTVDGALESQKFMNNLYKTGLTGKAVLEYTG---DLN-QEARDRLVKGFEQFANGSKNAGKIIPVPL-GMKLVP 259 (411) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCC---CCC-HHHHHHHHHHHHHHhcCccccCCceecCC-CceEEE Confidence 6778888888888888888887766432 3455543 222 223345666665443333343 3455554 577888 Q ss_pred eecCcC--CHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhc---- Q lcl|NC_019725. 78 LNSDIS--GVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVE---- 151 (237) Q Consensus 78 ~~~~ls--Gl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~---- 151 (237) ++.+.. .+-+........||++-|||...| |...++=.++.+.-..+||. ..|.|.++++-..+-+ T Consensus 260 l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~t~~n~e~~~~~f~~-------~~l~P~~~~ie~~l~~~ll~ 331 (411) T protein:vir:81 260 LDIKLTDSQFFELKKYTALQIAAAFGIKPNQI-NDYEKSSYASAEAQNLAFYV-------DTLLYVLKQYEEEITYKILS 331 (411) T ss_pred ccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHh-CCCCCCCchhHHHHHHHHHH-------HHHHHHHHHHHHHHHhhcCC Confidence 766543 223456677899999999998877 54443322344444555653 4578887777554421 Q ss_pred ----CCCc--eeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccC-CCCCCChhccccCCC Q lcl|NC_019725. 152 ----EEEW--SIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLK-DGNNINIREPEETTE 224 (237) Q Consensus 152 ----s~~~--~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~-~~~~~~~~~~e~~~e 224 (237) ..+. .|.+..|...|.+++ +++++.++++|+++++|+|+.+- ..+..|-. ..-....-..+...+ T Consensus 332 ~~~~~~~~~~~fd~~~ll~~d~~~~-------~~~~~~~~~~g~~t~NE~R~~~g-l~p~~ggD~~~~~~n~~pl~~~~~ 403 (411) T protein:vir:81 332 NDLISQGHYFKFNVNVILRADIKTQ-------MDSLSTAVQNGIMTPNEARDYLD-MPADDYGNNLMANGNYIPLSMLGA 403 (411) T ss_pred hhhcCCCcEEEeechhhhccCHHHH-------HHHHHHHHhCCCcCHHHHHHHhC-CCCCCCCCeeeeccCccchhhhhh Confidence 2344 455666777777665 56788899999999999998762 22222100 000000001122222 Q ss_pred CCCCCCCC Q lcl|NC_019725. 225 PEPGLGEK 232 (237) Q Consensus 225 ~~~~~~~~ 232 (237) ....+||. T Consensus 404 ~~~kgGd~ 411 (411) T protein:vir:81 404 NYGKGGDS 411 (411) T ss_pred hhccCCCC Confidence 22334444 No 43 >protein:vir:7853 Length: 518 # NCBI annotation: gp10 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817460;genbank:gi:29565889;genbank:GeneID:1259085 Probab=98.85 E-value=1.5e-09 Score=69.02 Aligned_cols=216 Identities=13% Similarity=0.131 Sum_probs=115.4 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHhcCch-heeeeecCCcceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVG-RAIGIDAETEEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~-~~~~iD~~~e~~~~ 77 (237) .++.+.+.|.....+......+...... .|+++++ .++. +....++++++-......|. +.++++. +-+|.. T Consensus 189 pi~~~~~~i~~~~aa~~~~~~~f~Ng~~p~gvl~~~~---~ls~-e~~~~~k~~~~~~~~G~~nag~~~vL~~-G~~~~~ 263 (518) T protein:vir:78 189 LMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEK---RLSP-EAQQRLREQFDRAHAGSSNTGKTMVVEE-GMEPIP 263 (518) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCC---CCCH-HHHHHHHHHHHHHhcCcccCCceeEcCC-CceEEe Confidence 5677778888888888888887766444 3666653 2222 22334555555433332343 4556654 578888 Q ss_pred eecCcCC--HHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhh---c- Q lcl|NC_019725. 78 LNSDISG--VPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIV---E- 151 (237) Q Consensus 78 ~~~~lsG--l~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~---~- 151 (237) ++.+... +-+........||.+-|||-.+| |..-++=.++-+.-...||.. .|.|.+.++-..+- . T Consensus 264 l~~~~~d~q~le~r~~~~~eIa~afgVPp~~l-g~~~~st~sn~e~~~~~f~~~-------tL~P~~~~ie~eln~~L~~ 335 (518) T protein:vir:78 264 LQLTAVEMQFIEARQLNREEVCGVYDIAPPIV-HILDRATFSNISAQMRAFYRD-------TMAIPIARIQSAMDKYVGQ 335 (518) T ss_pred ccCChhHHHHHHHHHHHHHHHHHHhCCCHHHh-ccCCCCCchhHHHHHHHHHHH-------HHHHHHHHHHHHHHHhhcc Confidence 7765432 33455567789999999998877 544332112224444555543 37777776654432 1 Q ss_pred --CCCceeEe--CCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCC-----CC-Chhc--- Q lcl|NC_019725. 152 --EEEWSIEF--EPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGN-----NI-NIRE--- 218 (237) Q Consensus 152 --s~~~~~~f--~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~-----~~-~~~~--- 218 (237) .....|+| ..|...|.+++ ++++..++++|+++++|+|+.+- ..+..+..++. +. .... T Consensus 336 ~~~~~~~~~fd~~~Llr~D~~~r-------~~~~~~~~~~G~lT~NE~R~~~g-l~pie~~~gD~~~v~~n~~pl~~~~~ 407 (518) T protein:vir:78 336 YWVRKNRMKFDIDDVIQPDWEAK-------SESTQKMVNSGVATPNEGREIMG-LPRSDDPKADELYANSALQPLGATPD 407 (518) T ss_pred cccCcceEEeechhhhccCHHHH-------HHHHHHHHhCCCcCHHHHHHHhC-CCCCCCCCCceeeecccceecccccc Confidence 13445555 47777776654 66788899999999999999773 22221100000 00 0000 Q ss_pred ----cccCCC-CCCCC---CCCcCcCC Q lcl|NC_019725. 219 ----PEETTE-PEPGL---GEKLEDEN 237 (237) Q Consensus 219 ----~e~~~e-~~~~~---~~~~~~e~ 237 (237) .++++. ++|.. ++..++.+ T Consensus 408 ~~~~g~~~~~~~~~~~~~~~~~~~~~~ 434 (518) T protein:vir:78 408 GAVEGEEAPAPKRPASTPVASLDQSPP 434 (518) T ss_pred cccCCCCCCCCCCCCcccccccccCcc Confidence 000000 00000 01111111 No 44 >protein:vir:4454 Length: 414 # NCBI annotation: Portal Protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700377;genbank:gi:23505449;genbank:GeneID:955656 Probab=98.84 E-value=1.1e-09 Score=69.73 Aligned_cols=214 Identities=12% Similarity=0.074 Sum_probs=117.3 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhc--cceeechhHHHhhcCCchHHHHHHHHHHHHHhcCc-hheeeeecCCcceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQ--QAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGV-GRAIGIDAETEEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~--~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~-~~~~~iD~~~e~~~~ 77 (237) .++.+...|.....+......+..... -.++++++ .+. ++....+++++......-.| .+.++++. +-+|.. T Consensus 181 ~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~---~l~-~e~~~~~~~~~~~~~~g~~n~~~~~vl~~-g~~~~~ 255 (414) T protein:vir:44 181 PIAYAREAISLAAATEEHGARLFSNGAVTSGVLRTEQ---TLS-DQAYERLKKDFEERHTGLGNAHRPMILEM-GLDWKS 255 (414) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCC---CCC-HHHHHHHHHHHHHHhcCccccCcceecCC-CceEEE Confidence 677777888888888777777776643 34566653 222 22334455555433222233 33566654 567888 Q ss_pred eecCcCC--HHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhh----c Q lcl|NC_019725. 78 LNSDISG--VPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIV----E 151 (237) Q Consensus 78 ~~~~lsG--l~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~----~ 151 (237) ++.+... +-+........||.+-|||..+|-+..-+..+ +-+...+.||. ..|.|.++++-..+- . T Consensus 256 l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~~~~~~t~~-n~e~~~~~~~~-------~~l~P~~~~ie~~ln~~L~~ 327 (414) T protein:vir:44 256 MALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRATFN-NIEELGLGFIN-------YSLVPYLTRIEQRINTGLVR 327 (414) T ss_pred ccCChHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcc-cHHHHHHHHHH-------HHHHHHHHHHHHHHHhhcCC Confidence 7765443 23455566788999999999888554333332 33445566664 357787777644442 1 Q ss_pred C---CC--ceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCC-CCCCChhccc----- Q lcl|NC_019725. 152 E---EE--WSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKD-GNNINIREPE----- 220 (237) Q Consensus 152 s---~~--~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~-~~~~~~~~~e----- 220 (237) . .. +.|.+..|...|.++++ +++++++++|+++++|+|+.+ ...+..|-.- .......... T Consensus 328 ~~~~~~~~i~fd~~~ll~~d~~~~~-------~~~~~~~~~G~~t~NE~R~~~-gl~p~~ggD~~~~~~n~~~~~~~~~~ 399 (414) T protein:vir:44 328 KSKQGVFYAKFNAGALLRGDMKSRF-------EAYATGINWGIYSPNDCRDLE-DMNPRPGGDVYLTPMNMTTKPSDGSK 399 (414) T ss_pred ccccCceEEEEechhhhccCHHHHH-------HHHHHHHhCCCcCHHHHHHHh-CCCCCCCcceecccccccccCCcccc Confidence 2 13 34455577777777754 577889999999999999876 2222222100 0000000000 Q ss_pred cCCCCCCCCCCCcCc Q lcl|NC_019725. 221 ETTEPEPGLGEKLED 235 (237) Q Consensus 221 ~~~e~~~~~~~~~~~ 235 (237) ...+.+++..+...+ T Consensus 400 ~~~~~~~~~~d~~~~ 414 (414) T protein:vir:44 400 AGKQKDNANADETTS 414 (414) T ss_pred CCCCCCCCCCCCCCC Confidence 111111111111111 No 45 >protein:vir:1266 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690758;genbank:gi:22854998;genbank:GeneID:955213 Probab=98.84 E-value=1.8e-09 Score=68.55 Aligned_cols=212 Identities=12% Similarity=0.102 Sum_probs=125.5 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVL 78 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~ 78 (237) .++.+.+.+.....+......+...... .++++++ .+ +.+....++++++ ...+..+.+++++ +-+|+.+ T Consensus 183 ~i~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~---~~-~~e~~~~~~~~~~---~~~~~~~~~vl~~-g~~~~~l 254 (416) T protein:vir:12 183 PIGVVREHIGAQAAATKYNAKLYKNEATPRGILKVPA---FL-DEKPKENVRKEWK---RVNKVENIAIIDY-GLEYQSI 254 (416) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCCceEEecCC---CC-CHHHHHHHHHHHH---HHhcCCCeeecCC-CceEEEc Confidence 6888888888888888888887776443 3666653 22 2223344555554 3445566677765 5788888 Q ss_pred ecCcCCHH--HHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhc----C Q lcl|NC_019725. 79 NSDISGVP--EFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVE----E 152 (237) Q Consensus 79 ~~~lsGl~--dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~----s 152 (237) +.+..... +........||.+-|||...|.+...+..+ +-+...+.||. ..|.|.+.++-..+-+ . T Consensus 255 ~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~s-n~e~~~~~f~~-------~~l~P~~~~ie~~l~~~l~~~ 326 (416) T protein:vir:12 255 SMPLQEAQFVESMKFNKAQISMIYKVPLHKLNELDKATFS-NIEHQSIEYVR-------NTLQPWIVNFEQELNVKLFLD 326 (416) T ss_pred cCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCccCCCcc-cHHHHHHHHHH-------HHHHHHHHHHHHHHHHhhcCc Confidence 77665433 667788899999999999999765544443 33445566664 3577877777555421 1 Q ss_pred ----CCceeEe--CCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcccccc----CCCCCCChhccccC Q lcl|NC_019725. 153 ----EEWSIEF--EPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKL----KDGNNINIREPEET 222 (237) Q Consensus 153 ----~~~~~~f--~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~----~~~~~~~~~~~e~~ 222 (237) .++.|+| ..|...|.+++ ++++..++++|+++++|+|+.+- ..+..|- .+..-...+..++. T Consensus 327 ~~~~~g~~i~fd~~~l~~~d~~~~-------~~~~~~~~~~G~~T~NE~R~~~g-l~Pi~ggd~~~~~~n~~~~~~~~~~ 398 (416) T protein:vir:12 327 HDQKSGHYVKFNIDSELRGDSKTQ-------AEYLKTLHETGVLNKDEIRELLE-RNPIENGDKYISSLNYVFLDFLEEY 398 (416) T ss_pred hhhcCCceEEeechhhhccCHHHH-------HHHHHHHHhCCCcCHHHHHHHhC-CCCCCCcceeeeccccccccccchh Confidence 3455665 46666666665 56788899999999999999863 3332221 00000101111111 Q ss_pred CC----CCCCCCCCcCcC Q lcl|NC_019725. 223 TE----PEPGLGEKLEDE 236 (237) Q Consensus 223 ~e----~~~~~~~~~~~e 236 (237) .. .+.++||+.+.= T Consensus 399 ~~~~~~~~~~gge~~~~g 416 (416) T protein:vir:12 399 QRLKAGGAMKGGDNKNEG 416 (416) T ss_pred hccccccccCCCCCcCCC Confidence 00 111222211111 No 46 >protein:vir:80333 Length: 419 # NCBI annotation: gp4, phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111083;genbank:gi:134288632;genbank:GeneID:4960580 Probab=98.83 E-value=9.6e-10 Score=70.02 Aligned_cols=217 Identities=11% Similarity=0.045 Sum_probs=114.1 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccc--eeechhHHHhhcCCchHHHHHHHHHHHHHhcCc-hheeeeecCCcceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQA--VWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGV-GRAIGIDAETEEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~--v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~-~~~~~iD~~~e~~~~ 77 (237) .++.+.+.|.....+......+....... ++++++......+.+....++++++-.-..-.| .+++++++ +-+|.. T Consensus 177 ~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~g~~~vl~~-g~~~~~ 255 (419) T protein:vir:80 177 PVLLHANAIGHAQAIQQYAGKSFMNGTALSGVIERPTDAPALKDQASVDRITDGWNAKFGGSGNAKKVALLQE-GMKFKP 255 (419) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEEecCCCCcccCHHHHHHHHHHHHHHhcCccccCCceecCC-CceEEe Confidence 67777788877777777777766664332 566653211111111122234444322222223 33456654 577877 Q ss_pred eecCcCC--HHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhc---- Q lcl|NC_019725. 78 LNSDISG--VPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVE---- 151 (237) Q Consensus 78 ~~~~lsG--l~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~---- 151 (237) ++.+... +-+......+.||.+-|||...|.....+.. ++-+.-...||..+ |.|.+.++-..+-+ T Consensus 256 l~~s~~d~q~~e~~~~~~~~Ia~~fgVPp~llg~~~~~t~-~n~e~~~~~f~~~~-------l~P~~~~ie~~l~~kll~ 327 (419) T protein:vir:80 256 LSMTNVDAALIDALRLSALDIARIYKIPAHMVNELERATF-SNIEHQSLQFVIYT-------LLPWVKRHEQAKTRDLLL 327 (419) T ss_pred ccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCc-ccHHHHHHHHHHHH-------HHHHHHHHHHHHhhhccC Confidence 7655432 3355667789999999999887754333333 23344555666543 77777766444321 Q ss_pred -C--CCceeEe--CCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCCh-hccccCCCC Q lcl|NC_019725. 152 -E--EEWSIEF--EPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINI-REPEETTEP 225 (237) Q Consensus 152 -s--~~~~~~f--~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~-~~~e~~~e~ 225 (237) . .++.|+| .-|...|.++++ +++.+++++|+++++|+|+.+ ...+..| ++..-. -..-....+ T Consensus 328 ~~~~~~~~i~fd~~~l~~~d~~~~~-------~~~~~~~~~G~~T~NE~R~~~-g~~p~~g---GD~~~~~~n~~~~~~~ 396 (419) T protein:vir:80 328 PSERKQYFIEYNLAGLLRGDQSSRY-------AAYAVGRQWGWLSINDIRRLE-NMPPVKG---GDIYLSPMNMVDASKP 396 (419) T ss_pred ccccCCeEEEEechhhhccCHHHHH-------HHHHHHHhCCCcCHHHHHHHh-CCCCCCC---cceeeecccccccccc Confidence 1 2444555 466666666654 467778999999999999875 2222222 111000 000011111 Q ss_pred CC-CCCCCcCcCC Q lcl|NC_019725. 226 EP-GLGEKLEDEN 237 (237) Q Consensus 226 ~~-~~~~~~~~e~ 237 (237) ++ ..|+..+.++ T Consensus 397 ~~~~~~~~~~~~~ 409 (419) T protein:vir:80 397 QPIPMGKTEPTKA 409 (419) T ss_pred ccccCCCCCchhh Confidence 11 1222222223 No 47 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=98.82 E-value=1.8e-09 Score=68.47 Aligned_cols=216 Identities=14% Similarity=0.129 Sum_probs=114.0 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVL 78 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~ 78 (237) .++.+.+.|.....+......+...... .++++++ .+. ++....++++++......+..+.+++++ +-+|+.+ T Consensus 193 p~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~---~l~-~e~~~~~~~~~~~~~~g~n~g~~~vl~~-g~~~~~l 267 (454) T protein:vir:93 193 PVYAAGLAATQGHHIQENSTSFFRNGGRPSGVIEIPG---SIT-EENAKKLKSNWDSGYTGENAGKTAILSN-GAKYNPT 267 (454) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEecCC---CCC-HHHHHHHHHHHHHHhcccccCCceeccC-CceEEEc Confidence 6677777777777777777776655333 4677764 222 2233456666665444433344566765 4678877 Q ss_pred ecCcCCH--HHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhh----cC Q lcl|NC_019725. 79 NSDISGV--PEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIV----EE 152 (237) Q Consensus 79 ~~~lsGl--~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~----~s 152 (237) +.+.... -+........||.+-|||...| |..-++-.++-+.-.+.|| +..|.|.+.++-..+. .. T Consensus 268 ~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~l-g~~~~~t~sn~e~~~~~f~-------~~~l~P~~~~ie~~ln~~L~~~ 339 (454) T protein:vir:93 268 TFSPVDSQTVEQLKMTAEIVCSVFRVPAYKI-GVGQPPSSDNVEALEQQYY-------SQCLQTLIESIELLLDEALETG 339 (454) T ss_pred ccChhHHHHHHHHHHHHHHHHHHhCCCHHHc-CCCCCCcchhHHHHHHHHH-------HHHHHHHHHHHHHHHHHhhcCC Confidence 7654322 2344566789999999998766 4333322222222223333 3457787777644432 23 Q ss_pred CCceeEe--CCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcccccc----CCCCCCChhccc-----c Q lcl|NC_019725. 153 EEWSIEF--EPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKL----KDGNNINIREPE-----E 221 (237) Q Consensus 153 ~~~~~~f--~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~----~~~~~~~~~~~e-----~ 221 (237) .+..|+| ..|...|.+++ ++++.+++++|+++++|+|+.+- ..+..|- .+......+... + T Consensus 340 ~~~~~~f~~~~ll~~D~~~r-------~~~~~~~~~~G~~T~NE~R~~~g-l~pi~ggD~~~~~~~~~~~~~~~~~~~~~ 411 (454) T protein:vir:93 340 ENESTEFDVTTLLRMDSERR-------MKTLGDAVKNTLLTPNEARKREN-LPPLAGGDALYLQQQNYSLEALSRRDARE 411 (454) T ss_pred CCcEEEeechhhhccCHHHH-------HHHHHHHHhCCCcCHHHHHHHhC-CCCCCCCCeeeeccCccchHhhhccCccc Confidence 4545554 45666666554 55788899999999999999763 2332220 011111111110 0 Q ss_pred CCC-------CCCCC-----C--CCcCcCC Q lcl|NC_019725. 222 TTE-------PEPGL-----G--EKLEDEN 237 (237) Q Consensus 222 ~~e-------~~~~~-----~--~~~~~e~ 237 (237) .+. .+|.. + ++.+.++ T Consensus 412 ~~~~~~~~~~~~~~~~~~~d~~~~~~e~~~ 441 (454) T protein:vir:93 412 DPFASSGKTASVPQAVAASDGNKAITETEH 441 (454) T ss_pred CCCCCCccCCCCCCCCCCCCCCCCccCCcc Confidence 000 00000 0 0011111 No 48 >protein:vir:100691 Length: 535 # NCBI annotation: hypothetical protein # Family: family:all:2446 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164747;genbank:gi:56693160;genbank:GeneID:3197324 Probab=98.82 E-value=1.5e-09 Score=68.95 Aligned_cols=228 Identities=14% Similarity=0.128 Sum_probs=121.1 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchhe-eeeecCCcceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRA-IGIDAETEEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~-~~iD~~~e~~~~ 77 (237) -++.+.+.|.....+......+...... .++++++....--..+....+++.+.-.-..-+|.+- .++.+++-+|.. T Consensus 245 pi~~~~~~i~~~~aa~~~~~~~f~ng~~p~giL~~~~~~~~~ls~e~~e~lk~~~~~~~~G~~nag~~~vl~~~g~~~~~ 324 (535) T protein:vir:10 245 PVEASIPLIRAIYDTEQFNARFFSQGGTTRGILVIDQDGDAQANQMMLAGIRRQWTSQGSGLGGAWKIPILAAKDAKFVN 324 (535) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEecCCCCcccCHHHHHHHHHHHHHHhcCcccccccccccCCCceEEe Confidence 3677888888888888888887776443 4677764211101112222344433322222234343 455554556666 Q ss_pred eecCcCC--HHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHh----hhHHHHHHHHHhh- Q lcl|NC_019725. 78 LNSDISG--VPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREED----YRPLLEFLLPFIV- 150 (237) Q Consensus 78 ~~~~lsG--l~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~----l~p~l~~l~~~i~- 150 (237) ++.+... +-+........||.+-|||-..|--..-+..++...+-...|.+.++..+... |.|.+.++-..|- T Consensus 325 l~~~~~D~qfle~~~~~~~eIa~afgVPp~~lG~~~~at~sn~~~~~~~~~~s~~E~~~~~~~~~~L~P~l~~ie~~ln~ 404 (535) T protein:vir:10 325 MTQNSRDMEFDKFLNFMIYDTAAIFQMQPEEINFPNNGGSTGKSGTKSVNEGSTAKAKLESSKDKGLTPLLSFIEQVIND 404 (535) T ss_pred cCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhccccCcccccchhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 5554432 22344567889999999999988544555565444455566777776666544 7777766655442 Q ss_pred ---c--CCCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccC-CC--------CCC-- Q lcl|NC_019725. 151 ---E--EEEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLK-DG--------NNI-- 214 (237) Q Consensus 151 ---~--s~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~-~~--------~~~-- 214 (237) . ..++.|+|+-|...+.++++++. +.+. .|+++++|+|+.+- ..+..|-. +. .++ T Consensus 405 ~Ll~~~~~~~~f~f~~l~~~d~~~r~~~~-------~~~~-~g~lT~NE~R~~~g-l~piegGD~~~~~~~~~~~~~~~~ 475 (535) T protein:vir:10 405 KIMRYVDTDYRFSFTLGDAQDKLQEEQVW-------KLKL-ANGYFINEYRKDHG-LKTVDGLDVPGFIGSAENFINATG 475 (535) T ss_pred hcccccCCeEEEEeccccccCHHHHHHHH-------HHHH-cCCCCHHHHHHHhC-CCCCCCccccccccchhhcccccc Confidence 1 24788999999999888777653 2222 47799999999762 22221100 00 000 Q ss_pred ----Chhcccc----------------------CCCCCCCCCCCcCcCC Q lcl|NC_019725. 215 ----NIREPEE----------------------TTEPEPGLGEKLEDEN 237 (237) Q Consensus 215 ----~~~~~e~----------------------~~e~~~~~~~~~~~e~ 237 (237) ...+..+ ...++|....+...+| T Consensus 476 ~~~~~~p~~~~~~~~~~~~~~~q~~~~~~~~~~~g~~~~~~~~~~~~~~ 524 (535) T protein:vir:10 476 FGQPNVPDSSDDSGSTLGERERQERIQHSKDYEKGKDDPKSPLPKPSES 524 (535) T ss_pred cccccCCCCCCCccccCCccccCcccccccccccCCCCCCCCCCcCCCC Confidence 0000000 0001111111111111 No 49 >protein:vir:1884 Length: 424 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037664;genbank:gi:9634122;genbank:GeneID:1262519 Probab=98.81 E-value=1.3e-09 Score=69.38 Aligned_cols=207 Identities=8% Similarity=0.022 Sum_probs=119.8 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVL 78 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~ 78 (237) .++.+.+.|.....+......+...... .++++++ ..+ +++....++++++-.....+..+.++++. +-+|+.+ T Consensus 196 pi~~~~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~--~~l-~~e~~~~~~~~~~~~~~g~nag~~~vl~~-g~~~~~l 271 (424) T protein:vir:18 196 PIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGE--KVL-TEQQRSQVEENFKEIAGGPVKKRLWILEA-GFSTSAI 271 (424) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHccCCcceEEEeCC--cCC-CHHHHHHHHHHHHHHhCCcccCCceeccC-CceEEec Confidence 5678888888888888888887776543 3666642 122 23334556777765544444455677765 5678777 Q ss_pred ecCcCC--HHHHHHHHHHHHhhhhcCceeeeeccCccc-cccc-chhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhc--- Q lcl|NC_019725. 79 NSDISG--VPEFLSSKMDRIVSLSGIHEIIIKNKNVGG-VSAS-QNTALETFYKLVDRKREEDYRPLLEFLLPFIVE--- 151 (237) Q Consensus 79 ~~~lsG--l~dl~~~~~~~iaa~s~iP~t~L~G~sp~G-lnat-Ge~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~--- 151 (237) +.+... +-+........||.+-|||-..| |...++ ..++ -+.....||. .-|+|.+.++-..|-+ T Consensus 272 ~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~l-g~~~~~t~~~sn~eq~~~~f~~-------~tl~P~~~~ie~~l~~~L~ 343 (424) T protein:vir:18 272 GVTPQDAEMMASRKFQVSELARFFGVPPHLV-GDVEKSTSWGSGIEQQNLGFLQ-------YTLQPYISRWENSIQRWLI 343 (424) T ss_pred CCChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CCCCCcccccccHHHHHHHHHH-------HHHHHHHHHHHHHHHhhcC Confidence 655432 23355677788999999997776 544333 3222 2334455553 4678888877544432 Q ss_pred -C---CC--ceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcccccc----CCCCCCChhcccc Q lcl|NC_019725. 152 -E---EE--WSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKL----KDGNNINIREPEE 221 (237) Q Consensus 152 -s---~~--~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~----~~~~~~~~~~~e~ 221 (237) . .. +.|.+..|...|.+++ ++++.+++++|+++++|+|+.+ ...+..|- .+......++. T Consensus 344 ~~~~~~~~~~~fd~~~llr~d~~~r-------~~~~~~~~~~G~~T~NE~R~~~-gl~pi~gGD~~~~~~n~~~l~~~-- 413 (424) T protein:vir:18 344 PAKDVGRIHAEHNLDGLLRGDSASR-------AAFMKAMGEAGLRTINEMRRTD-NLPPLPGGDVAMRQSQYVPITDL-- 413 (424) T ss_pred CccccCCeEEEEechhhhccCHHHH-------HHHHHHHHhCCCcCHHHHHHHh-CCCCCCCcCeeeeccCccchHhh-- Confidence 2 23 4455567777777766 5667788999999999999976 33332220 00000111111 Q ss_pred CCCCCCCCCCCcCcCC Q lcl|NC_019725. 222 TTEPEPGLGEKLEDEN 237 (237) Q Consensus 222 ~~e~~~~~~~~~~~e~ 237 (237) ++..+++| T Consensus 414 --------~~~~~p~~ 421 (424) T protein:vir:18 414 --------GTNKEPRN 421 (424) T ss_pred --------hccCCCcc Confidence 11111111 No 50 >protein:vir:105064 Length: 421 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006584;genbank:gi:46402090;genbank:GeneID:2777930 Probab=98.81 E-value=1.1e-09 Score=69.73 Aligned_cols=217 Identities=10% Similarity=0.018 Sum_probs=114.8 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHhcCc-hheeeeecCCcceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGV-GRAIGIDAETEEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~-~~~~~iD~~~e~~~~ 77 (237) .++.+.+.+.....+.....++...... .++++++-..-..+.+...++++++.-.-..-.| .+.++++. +-+|.. T Consensus 180 pi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~~~~~~~e~~~~~~~~~~~~~~g~~n~~~~~vl~~-g~~~~~ 258 (421) T protein:vir:10 180 PIQTNADVLGLNLAVEEHASAVFRRGATMSGVIERPKEAPAIKSQEKIDQLLAKWTDRYSGINNMFSVALLQE-GMSYKQ 258 (421) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEEecCccCccCCHHHHHHHHHHHHHHhcCccccCcceecCC-CceEEe Confidence 5677778888878888888887766332 3666653111111222222344444332222223 34566665 567888 Q ss_pred eecCcCC--HHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhc---- Q lcl|NC_019725. 78 LNSDISG--VPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVE---- 151 (237) Q Consensus 78 ~~~~lsG--l~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~---- 151 (237) ++.+... +-+........||.+-|||-..| |...++=.++-+.-...||. .-|.|.+.++-..+-+ T Consensus 259 l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~t~sn~e~~~~~f~~-------~tl~P~~~~ie~~ln~kL~~ 330 (421) T protein:vir:10 259 MSQDNEKAQLLQSRQWGVEEVCRLYKIPPHMV-QMLAKATNNNIEHQGLQFVM-------YTLLAWLKRHEGALQRDLLL 330 (421) T ss_pred cCCChhHHHHHHHHHHhHHHHHHHhCCCHHHc-CCCcCCccccHHHHHHHHHH-------HHHHHHHHHHHHHHhhhccC Confidence 7765532 23344567888999999997666 43333322333444455554 3477777766444321 Q ss_pred C---CC--ceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCC----CChhccccC Q lcl|NC_019725. 152 E---EE--WSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNN----INIREPEET 222 (237) Q Consensus 152 s---~~--~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~----~~~~~~e~~ 222 (237) . .+ +.|.+..|...|.++++ +++.+++++|+++++|+|+.+- ..+..| ++. ......+.. T Consensus 331 ~~~~~~~~v~fd~~~l~~~d~~~~~-------~~~~~~~~~G~~T~NE~R~~~g-l~p~~g---gD~~~~~~n~~~~~~~ 399 (421) T protein:vir:10 331 PSERRDLYIEFNVSGLLRGDQKSRY-------ESYALGRQWGWLSVNDIRRMEN-LPPIAG---GDKYLTPLNMVDSAQI 399 (421) T ss_pred ccccCCeEEEEechhhhccCHHHHH-------HHHHHHHhCCCcCHHHHHHHhC-CCCCCC---cceeeecccccccccc Confidence 1 23 45556688888888765 4677789999999999999763 222222 111 010111111 Q ss_pred CCCCCCCCCCcCc-CC Q lcl|NC_019725. 223 TEPEPGLGEKLED-EN 237 (237) Q Consensus 223 ~e~~~~~~~~~~~-e~ 237 (237) ...+..+.+.... ++ T Consensus 400 ~~~~~~~~~~~~~e~d 415 (421) T protein:vir:10 400 IPGDKKPTAQQMAEID 415 (421) T ss_pred ccCCCCcccccCcccc Confidence 1111111111111 11 No 51 >protein:vir:10362 Length: 432 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858954;genbank:gi:32128419;genbank:GeneID:2648396 Probab=98.79 E-value=4.5e-09 Score=66.36 Aligned_cols=213 Identities=13% Similarity=0.154 Sum_probs=123.4 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHh--ccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRK--QQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVL 78 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~--~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~ 78 (237) .++.+.+.|.....+......+.... --.|+++++ .++ ++....+.+++. ...+..+.+++++ +-+|+.+ T Consensus 195 pi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~---~l~-~e~~~~~~~~~~---~~~nag~~~vl~~-g~~~~~l 266 (432) T protein:vir:10 195 AIRYGAQIFGTAIAAEAQAARAFRNGQLQSVYYQIDR---FLT-DDQYDSFAKKVS---GSVEAGRAPLLEG-GMDVKSL 266 (432) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEecCC---CCC-HHHHHHHHHHHh---hhhhCCCceecCC-CceEEEc Confidence 66777777877777777777765432 233566653 222 222233444433 3344456677775 5788887 Q ss_pred ecCcCC--HHHHHHHHHHHHhhhhcCceeeeeccCcccccccch---hHHHHHHHHHHHHHHHhhhHHHHHHHHHhh--- Q lcl|NC_019725. 79 NSDISG--VPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQN---TALETFYKLVDRKREEDYRPLLEFLLPFIV--- 150 (237) Q Consensus 79 ~~~lsG--l~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe---~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~--- 150 (237) +.+... +-+........||.+-|||-..| |....|=+++|. .-...||. ..|+|.+.++-..|- T Consensus 267 ~~~~~d~q~le~~~~~~~~Ia~afgVPp~~l-g~~~~~t~~~~sn~e~~~~~f~~-------~tl~P~~~~ie~~ln~kL 338 (432) T protein:vir:10 267 GLNPVDAQLLQSRQYSVESICRFFGVPPSMI-GHSSAGTTSWGSGIESQQLGFLS-------MTLSPWLRRIEQSIALNL 338 (432) T ss_pred cCChHHHHHHHHHHHHHHHHHHHhCCCHHHc-CCccCCcccccchHHHHHHHHHH-------HHHHHHHHHHHHHHHhhh Confidence 766543 23455778889999999999776 555444333332 22334442 346777776644432 Q ss_pred -cC---CCceeEe--CCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccC-----CCCCCChhcc Q lcl|NC_019725. 151 -EE---EEWSIEF--EPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLK-----DGNNINIREP 219 (237) Q Consensus 151 -~s---~~~~~~f--~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~-----~~~~~~~~~~ 219 (237) .. ..+.|+| ..|...|.+++ ++++++++++|+++++|+|+.+ ...+..|-. ...-...+.. T Consensus 339 ~~~~~~~~~~~~fd~~~ll~~d~~~r-------~~~~~~~~~~G~~T~NE~R~~~-glppi~g~~~~~~~~~~~~pl~~~ 410 (432) T protein:vir:10 339 LSPAERRRYFADFDTSALLRADSAAR-------SSYYSQLVNNGLMTRDEAREIE-GLPKLGGNAAVLTVQSAMVPLDSI 410 (432) T ss_pred cCccccCceEEEeechhhhccCHHHH-------HHHHHHHHhCCCCCHHHHHHHh-CCCCCCCCcceEeecCcccchhhh Confidence 11 2344555 57777777765 4577788999999999999976 333322210 0111122333 Q ss_pred ccCCCCCCCCCCCcCcCC Q lcl|NC_019725. 220 EETTEPEPGLGEKLEDEN 237 (237) Q Consensus 220 e~~~e~~~~~~~~~~~e~ 237 (237) .+.+.++|..+++.+++| T Consensus 411 ~~~~~~~~~~~~~~~~~~ 428 (432) T protein:vir:10 411 GLQASPEPASGLGNQQQD 428 (432) T ss_pred cccCCCCCCCCCCCcccc Confidence 344556666666666666 No 52 >protein:vir:101648 Length: 518 # NCBI annotation: gp11 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654766;genbank:gi:109302764;genbank:GeneID:4156082 Probab=98.78 E-value=3.3e-09 Score=67.11 Aligned_cols=216 Identities=13% Similarity=0.143 Sum_probs=112.3 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccc--eeechhHHHhhcCCchHHHHHHHHHHHHHhcCch-heeeeecCCcceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQA--VWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVG-RAIGIDAETEEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~--v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~-~~~~iD~~~e~~~~ 77 (237) .++.+...|.....+......+....... ++++++ .++. +....++++++-....-.|. ++++++. +-+|.. T Consensus 189 pi~~a~~~i~~~~a~~~~~~~~f~ng~~p~gil~~~~---~ls~-e~~~~~k~~~~~~~~G~~nag~v~vL~~-G~~~~~ 263 (518) T protein:vir:10 189 LMESLKSTIFSEDSSRNATAAMWKNAGRPNLVLRHEK---RLSE-AAQQRLREQFDRAHSGSSNTGKTMVVEE-GMEPIP 263 (518) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCccEEEecCC---CCCH-HHHHHHHHHHHHHhcCccccCcceEcCC-CceEEE Confidence 56677777887777777777776664432 566653 2222 22334555444332222333 3556654 577877 Q ss_pred eecCcCC--HHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhc---- Q lcl|NC_019725. 78 LNSDISG--VPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVE---- 151 (237) Q Consensus 78 ~~~~lsG--l~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~---- 151 (237) ++.+... +-+........||.+-|||-.+| |..-++=.++-+.-...||.. .|.|.+.++-..+-+ T Consensus 264 l~~s~~D~q~le~r~~~~~eIa~afgVPp~~l-g~~~~~t~sn~eq~~~~f~~~-------tL~P~l~~ie~~ln~~L~~ 335 (518) T protein:vir:10 264 LQLTAVEMQFIEARQLNREEVCGVYDIAPPIV-HILDRATFSNISAQMRAFYRD-------TMAIPIARIQSAMDKYVGQ 335 (518) T ss_pred ccCChhHHHHHHHHHHHHHHHHHHhCCCHHHh-ccCCCCCchhHHHHHHHHHHH-------HHHHHHHHHHHHHHHhhcc Confidence 7654432 23344566789999999998777 544332122334445555543 377777666544321 Q ss_pred --CCCceeEe--CCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCC-----CCCC-hhcc-- Q lcl|NC_019725. 152 --EEEWSIEF--EPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDG-----NNIN-IREP-- 219 (237) Q Consensus 152 --s~~~~~~f--~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~-----~~~~-~~~~-- 219 (237) ...+.|+| ..|...|.+++ ++++..++++|+++++|+|+.+- ..+..+..++ .+.. .... T Consensus 336 ~~~~~~~~~fd~~~llr~D~~~r-------~~~~~~~~~~G~lT~NE~R~~~G-l~pie~~~gD~~~~~~n~~pl~~~~~ 407 (518) T protein:vir:10 336 YWVRKNRMKFDIDDVIQPDWEAK-------SESTQKMVNSGVATPNEGREIMG-LPRSDDPKADELYANSALQPLGATPD 407 (518) T ss_pred cccCCceEEEechhhhccCHHHH-------HHHHHHHHhCCCcCHHHHHHHhC-CCCCCCCCCCeeeecccceecccccc Confidence 12445555 47767776654 66788899999999999998763 2222110000 0000 0000 Q ss_pred -----ccCC-CCCCCC---CCCcCcCC Q lcl|NC_019725. 220 -----EETT-EPEPGL---GEKLEDEN 237 (237) Q Consensus 220 -----e~~~-e~~~~~---~~~~~~e~ 237 (237) ++.+ .+++.. ++..++.+ T Consensus 408 ~~~~g~~~~~~~~~~~~~~~~~~~~~~ 434 (518) T protein:vir:10 408 GAVEGEEAPAPKRPASTPVASLDQSPP 434 (518) T ss_pred cccCCCCCCCCCCCCcccccccccccc Confidence 0000 000000 00001111 No 53 >protein:vir:96980 Length: 409 # NCBI annotation: ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239857;genbank:gi:66395516;genbank:GeneID:5133013 Probab=98.78 E-value=2.4e-09 Score=67.82 Aligned_cols=213 Identities=10% Similarity=0.079 Sum_probs=109.5 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeeeec Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVLNS 80 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~~~ 80 (237) .+..+.+.+.....+......-.....--+++.+. .++. +....+++++. +.+.++.+.+++++ +-+|+.++. T Consensus 181 ~l~~~~~~i~~~~~~~~~~~~~~~~~~~~i~~~~~---~l~~-e~~~~~~~~~~--~~~~n~g~~~vl~~-g~~~~~l~~ 253 (409) T protein:vir:96 181 PIDVLKNTTDFDNAVRTFNLTEMQKPDSFMLKYGS---NVST-EKRQQVLEDFK--QYYEENGGILFQEP-GVEIEPLPK 253 (409) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCceeEEecCC---CCCH-HHHHHHHHHHH--HHhhcCCCeeecCC-CceEEEcCC Confidence 34444444443333322221111111112333321 1211 12233444443 33445555666654 578888776 Q ss_pred CcC--CHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhc------- Q lcl|NC_019725. 81 DIS--GVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVE------- 151 (237) Q Consensus 81 ~ls--Gl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~------- 151 (237) +.. .+-+........||.+-|||-..|-+...+..+ +-|...+.||..+ |.|.++++-..+-+ T Consensus 254 ~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~s-~~e~~~~~f~~~~-------l~P~~~~ie~~l~~~Ll~~~~ 325 (409) T protein:vir:96 254 KYVSEDIVASENLTRERVANVFQLPSIFLNARSNTNFA-KNEELNRFYLQHT-------LLPIVKQYEEEFNRKLLTKTD 325 (409) T ss_pred ChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcc-cHHHHHHHHHHHH-------HHHHHHHHHHHHHhhcCCccc Confidence 543 233345556788999999998888554433333 3344555666544 88888877555432 Q ss_pred -CCCceeEeC--CCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcccccc----CCCCCCChhccccCCC Q lcl|NC_019725. 152 -EEEWSIEFE--PLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKL----KDGNNINIREPEETTE 224 (237) Q Consensus 152 -s~~~~~~f~--pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~----~~~~~~~~~~~e~~~e 224 (237) ..+..|+|+ +|...|.++ +++++.+++++|+++++|+|+.+ ...+..|- -+..-...+..++. . T Consensus 326 ~~~g~~i~fd~~~ll~~d~~~-------~~e~~~~~~~~G~~T~NE~R~~~-g~~pi~ggD~~~~~~n~~~~~~~~~~-~ 396 (409) T protein:vir:96 326 REKNRYFKFNVKSYLRADSAT-------QAEVYFKAVRSGYYTINDIREWE-DLPPVEGGDKPLISGDLYPIDTPLEL-R 396 (409) T ss_pred ccCcceEEeechhhhccCHHH-------HHHHHHHHHhCCCCCHHHHHHHh-CCCCCCCcceeeecccccccccchhh-c Confidence 234566664 666666655 46677889999999999999987 33332221 01111111222111 2 Q ss_pred CCCCCCCCcCcCC Q lcl|NC_019725. 225 PEPGLGEKLEDEN 237 (237) Q Consensus 225 ~~~~~~~~~~~e~ 237 (237) ...++|+...+|. T Consensus 397 ~~~~gG~~n~~e~ 409 (409) T protein:vir:96 397 KSLKGGDKNVNES 409 (409) T ss_pred ccccCCCCCcCCC Confidence 2344455555555 No 54 >protein:vir:2683 Length: 412 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075502;genbank:gi:12719431;genbank:GeneID:920150 Probab=98.75 E-value=3e-09 Score=67.31 Aligned_cols=213 Identities=9% Similarity=0.068 Sum_probs=106.2 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeeeec Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVLNS 80 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~~~ 80 (237) -++.+.+.+.-...+......-.....-.+++.+. .+ +++....+++++. +.+.+..+.+++++ +.+|+.++. T Consensus 184 ~i~~~~~~i~~~~a~~~~~~~~~~~~~~~i~~~~~---~l-~~e~~~~~~~~~~--~~~~~~g~~~vl~~-g~~~~~l~~ 256 (412) T protein:vir:26 184 PIDVLKNTTDFDNAVRTFNLTEMQKPDSFMLKYGS---NV-GKEKRQQVLEDFK--QYYEENGGILFQEP-GVEIEPLPK 256 (412) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCCceEEecCC---CC-CHHHHHHHHHHHH--HHhhcCCCeeecCC-CceEEEcCC Confidence 23444443333333322211101111111222221 11 1111223444443 23344445555654 577887765 Q ss_pred CcC--CHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhc------- Q lcl|NC_019725. 81 DIS--GVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVE------- 151 (237) Q Consensus 81 ~ls--Gl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~------- 151 (237) +.. .+-+........||.+-|||-..|.+...+..+ +.+.-.+.||.. .|.|.+.+|-+.+-+ T Consensus 257 ~~~d~q~~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~s-n~e~~~~~f~~~-------~l~P~~~~ie~~ln~kLl~~~~ 328 (412) T protein:vir:26 257 KYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFA-KNEELNRFYLQH-------TLLPIVKQYEEEFNRKLLTKTD 328 (412) T ss_pred ChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcc-cHHHHHHHHHHH-------HHHHHHHHHHHHHHhhcCCccc Confidence 433 223344456788999999999888665444333 344455566665 388888777544421 Q ss_pred -CCCceeE--eCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcccccc----CCCCCCChhccccCCC Q lcl|NC_019725. 152 -EEEWSIE--FEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKL----KDGNNINIREPEETTE 224 (237) Q Consensus 152 -s~~~~~~--f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~----~~~~~~~~~~~e~~~e 224 (237) ..+..|+ +.+|...|.+++ ++++.+++++|+++++|+|+.+- ..+..|. .+......+.+++. . T Consensus 329 ~~~~~~~~fd~~~l~~~d~~~~-------~~~~~~~~~~G~~t~NE~R~~~g-l~p~~ggD~~~~~~n~~~~~~~~~~-~ 399 (412) T protein:vir:26 329 REKNRYFKFNVKSYLRADSATQ-------AEVYFKAVRSGYYTINDIREWED-LPPVEGGDKPLISGDLYPIDTPLEL-R 399 (412) T ss_pred ccCcceEEeechhhhccCHHHH-------HHHHHHHHhCCCcCHHHHHHHhC-CCCCCCcCeeeecccccccccchhh-c Confidence 1234444 557777777765 56678889999999999999873 2332221 01111111222111 2 Q ss_pred CCCCCCCCcCcCC Q lcl|NC_019725. 225 PEPGLGEKLEDEN 237 (237) Q Consensus 225 ~~~~~~~~~~~e~ 237 (237) ...++|++...|+ T Consensus 400 ~~~~gG~~n~~e~ 412 (412) T protein:vir:26 400 KSLKGGDKNVNES 412 (412) T ss_pred ccccCCCCCcCCC Confidence 2344555555555 No 55 >protein:vir:189 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037699;genbank:gi:9634156;genbank:GeneID:1262529 Probab=98.74 E-value=3.1e-09 Score=67.27 Aligned_cols=210 Identities=8% Similarity=0.017 Sum_probs=118.3 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVL 78 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~ 78 (237) .++.+.+.|.....+......+...... .++++++ ..+ +++....++++++-.....+..+.++++. +-+|+.+ T Consensus 196 pi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~--~~l-~~e~~~~~~~~~~~~~~~~nag~~~vl~~-g~~~~~l 271 (424) T protein:vir:18 196 PIAFACKSAGVAVAMEDQQRDFFANGAKSPQILSTGE--KVL-TEQQRSQVEENFKEIAGGPVKKRLWILEA-GFSTSAI 271 (424) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCC--cCC-CHHHHHHHHHHHHHHhCCcccCCceeccC-CceEEec Confidence 6677888888888888888887776433 3666643 112 23334456666665444444445677765 5678777 Q ss_pred ecCcCC--HHHHHHHHHHHHhhhhcCceeeeeccCccc-ccccc-hhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhc--- Q lcl|NC_019725. 79 NSDISG--VPEFLSSKMDRIVSLSGIHEIIIKNKNVGG-VSASQ-NTALETFYKLVDRKREEDYRPLLEFLLPFIVE--- 151 (237) Q Consensus 79 ~~~lsG--l~dl~~~~~~~iaa~s~iP~t~L~G~sp~G-lnatG-e~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~--- 151 (237) +.+... +-+........||.+-|||-..| |...++ ..++. +.-...|| +..|.|.+.++-..|-+ T Consensus 272 ~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~t~~~sn~eq~~~~f~-------~~tl~P~~~~ie~~ln~~L~ 343 (424) T protein:vir:18 272 GVTPQDAEMMASRKFQVSELARFFGVPPHLV-GDVEKSTSWGSGIEQQNLGFL-------QYTLQPYISRWENSIQRWLI 343 (424) T ss_pred CCChhHHHHHHHHHHhHHHHHHHhCCCHHHh-CCCCCcccccccHHHHHHHHH-------HHHHHHHHHHHHHHHHhhcC Confidence 655432 23455677789999999996666 655444 32222 33334454 35688888877555432 Q ss_pred -C---CCce--eEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcccccc----CCCCCCChhcccc Q lcl|NC_019725. 152 -E---EEWS--IEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKL----KDGNNINIREPEE 221 (237) Q Consensus 152 -s---~~~~--~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~----~~~~~~~~~~~e~ 221 (237) . .++. |.+..|...|.+++ ++++.+++++|+++++|+|+.+ ...+..|- -+..........+ T Consensus 344 ~~~~~~~~~~~fd~~~llr~d~~~r-------~~~~~~~~~~G~~T~NE~R~~~-gl~pi~ggD~~~~~~n~~~l~~~~~ 415 (424) T protein:vir:18 344 PSKDVGRLHAEHNLDGLLRGDSASR-------AAFMKAMGESGLRTINEMRRTD-NMPPLPGGDVAMRQAQYVPITDLGT 415 (424) T ss_pred CccccCCeEEEEechhhhccCHHHH-------HHHHHHHHhCCCcCHHHHHHHh-CCCCCCCcCeeeeccCccchhhhhc Confidence 2 2334 45567777777765 5566778999999999999975 23332220 0000001111111 Q ss_pred CCCCCCCCCC Q lcl|NC_019725. 222 TTEPEPGLGE 231 (237) Q Consensus 222 ~~e~~~~~~~ 231 (237) ..+ +...|+ T Consensus 416 ~~~-~~~n~a 424 (424) T protein:vir:18 416 NKE-PRNNGA 424 (424) T ss_pred cCC-ccccCC Confidence 110 000111 No 56 >protein:vir:94869 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1532 # MgeName: P008 # Cross-refs: genbank:acc:YP_762515;genbank:gi:115304214;genbank:GeneID:5141182 Probab=98.73 E-value=5.5e-10 Score=71.35 Aligned_cols=199 Identities=13% Similarity=0.121 Sum_probs=99.0 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhc-cceeechhHHHhhcCCch---HHHHHHHHHHHHHhcCchheeeeecCCccee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQ-QAVWKVKGLAEMCDDDDA---QYAARLRLAQVDDNSGVGRAIGIDAETEEYD 76 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~-~~v~k~~~l~~~~~~~~~---e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~ 76 (237) +++.+...+..+- ...+ -.++|+++ .++.... ...++++++-.....++.+.+++++ +.+|+ T Consensus 152 ~~~~~~~~~~~~~----------~~~~~~g~l~~~~---~l~~~~~~~~~e~~~~~~~~~~~~~n~~~~~vl~~-g~~~~ 217 (378) T protein:vir:94 152 ILDNALASIQTKL----------EQGKLRGLLKINA---FLDIDNTQEYREKALATIKNMQEGSSYNGLTPVDN-KTEIV 217 (378) T ss_pred HHHHHHHHHHHHH----------hhCCcccceeeCC---cCCHHHHHHHHHHHHHHHHHhhcccccccceeccC-CceEE Confidence 3333332222111 1111 12345543 1222111 1223333322222233345677775 68899 Q ss_pred eeecCcCCHH-HHHHHHHHHHhhhhcCceeeeeccCcccccccc-hhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhc--- Q lcl|NC_019725. 77 VLNSDISGVP-EFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQ-NTALETFYKLVDRKREEDYRPLLEFLLPFIVE--- 151 (237) Q Consensus 77 ~~~~~lsGl~-dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatG-e~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~--- 151 (237) .++.+...++ +-+......||.+-|||..+|.| ++ +....+||.. .|.|.+.++-.-+-+ T Consensus 218 ~l~~~~~~~~~~~~~~~~~~Ia~~fgvPp~~l~g--------~~~e~~~~~f~~~-------tl~P~~~~ie~~l~~~Ll 282 (378) T protein:vir:94 218 ELKKDYSVLNKDEIDLIKSELLTGYFMNENILLG--------TATQEQQIYFYNS-------TIIPLLIQLEKELTYKLI 282 (378) T ss_pred EccCChHHhhHHHHHHHHHHHHHHhCCCHHHhcC--------CchHHHHHHHHHH-------HHHHHHHHHHHHHHhhcC Confidence 9887775544 23345567889988888777743 22 2233445543 578877766544321 Q ss_pred -C------------CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcccccc----CCCCCC Q lcl|NC_019725. 152 -E------------EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKL----KDGNNI 214 (237) Q Consensus 152 -s------------~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~----~~~~~~ 214 (237) . .++.|++.+|...|.++++ +++.+++++|+++++|+|+.+- ..+..|. .+..-. T Consensus 283 ~~~e~~~g~~~~~~~~~~f~~~~l~~~d~~~~~-------e~~~~~~~~G~~t~NE~R~~~g-~~p~~ggd~~~~~~n~~ 354 (378) T protein:vir:94 283 STNRRRVVKGNLYYERIIVDNQLFKFATLKELI-------DLYHENINGPIFTQNQLLVKMG-EQPIEGGDVYIANLNAV 354 (378) T ss_pred ChhHhhhhhhhcccceeEeecchhhhcCHHHHH-------HHHHHHHhCCCcCHHHHHHHhC-CCCCCCCCeeeeccccc Confidence 1 2366778888888887665 5677799999999999999763 3333331 111111 Q ss_pred Chhcccc--CCCCCCCCCCCcCcC Q lcl|NC_019725. 215 NIREPEE--TTEPEPGLGEKLEDE 236 (237) Q Consensus 215 ~~~~~e~--~~e~~~~~~~~~~~e 236 (237) ..+...+ ....+...+++-+.| T Consensus 355 ~~~~~~~~~~~~~~~~~~~e~~n~ 378 (378) T protein:vir:94 355 AVKNLSDLQGNRKDVTSTDETNNQ 378 (378) T ss_pred chhcchhcccccCCCCCCCCCCCC Confidence 1111211 111111222222222 No 57 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=98.73 E-value=7.9e-09 Score=65.01 Aligned_cols=216 Identities=16% Similarity=0.138 Sum_probs=113.8 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHhcCc-hheeeeecCCcceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGV-GRAIGIDAETEEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~-~~~~~iD~~~e~~~~ 77 (237) .++.+.+.|.....+......+...... .++++++ .++ ++....++++++-......| .+.++++. +-+|+. T Consensus 195 ~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~---~ls-~e~~~~~~~~~~~~~~g~~nag~~~vl~~-g~~~~~ 269 (457) T protein:vir:13 195 PISYARESIGLALAAQKYGSKFFANGAMPGAVVEVPG---TMS-EEGLARAREAWRAANSGVDNAHRVALLTE-GAKFSK 269 (457) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEcCC---CCC-HHHHHHHHHHHHHHhcCccccCcceecCC-CceEEE Confidence 5677777787777777777776665433 4566653 222 22334555555544333334 34556654 577888 Q ss_pred eecCcCC--HHHHHHHHHHHHhhhhcCceeeeeccCccc-ccccc-hhHHHHHHHHHHHHHHHhhhHHHHHHHHHhh--- Q lcl|NC_019725. 78 LNSDISG--VPEFLSSKMDRIVSLSGIHEIIIKNKNVGG-VSASQ-NTALETFYKLVDRKREEDYRPLLEFLLPFIV--- 150 (237) Q Consensus 78 ~~~~lsG--l~dl~~~~~~~iaa~s~iP~t~L~G~sp~G-lnatG-e~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~--- 150 (237) ++.+... +-+........||.+-|||-..| |...++ ..++. +.....||. ..|.|.++++-.-+- T Consensus 270 l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~~~~~sn~eq~~~~f~~-------~tl~P~~~~ie~~ln~~L 341 (457) T protein:vir:13 270 VAMSPDEAQFLQTRQFQVPEIARIFGVPPHLI-SDATNSTSWGSGLAEQNIAFTM-------FSLRPWLERIEAGFNRLL 341 (457) T ss_pred ccCChhHHHHHHHHHHHHHHHHHHhCCCHHHc-CCCCCcccccchHHHHHHHHHH-------HHHHHHHHHHHHHHHHhh Confidence 7665433 23455577889999999998866 655443 32221 222333433 457777776644432 Q ss_pred -cC---CC--ceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccc-cccC-----CCC--CC-C Q lcl|NC_019725. 151 -EE---EE--WSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPE-FKLK-----DGN--NI-N 215 (237) Q Consensus 151 -~s---~~--~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~-~g~~-----~~~--~~-~ 215 (237) .. .. |.|.+..|...+-++++ +++.+++++|+++++|+|+.+ ...+. .|.- +.. .+ + T Consensus 342 ~~~~~~~~~~i~fd~~~l~~~D~~~r~-------~~~~~~~~~G~~T~NE~R~~~-gl~Pi~~g~~d~~~~~~n~~~~~~ 413 (457) T protein:vir:13 342 FAETADRFRFVKFNLDEIKRGAPKERM-------ELWSLGLQNGIYSIDEVRAAE-DMTPLPDGLGEKYRVPLNLGEVGE 413 (457) T ss_pred cCccccCceeEEeechhhhccCHHHHH-------HHHHHHHhCCCcCHHHHHHHh-CCCCCCCCcccceeeccccccccc Confidence 11 12 44555688777777754 556778999999999999876 33222 1110 000 00 0 Q ss_pred hhccc----------------cCCCCCCCCCCCcCcCC Q lcl|NC_019725. 216 IREPE----------------ETTEPEPGLGEKLEDEN 237 (237) Q Consensus 216 ~~~~e----------------~~~e~~~~~~~~~~~e~ 237 (237) ..+.+ +.++.+....+....++ T Consensus 414 ~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~d~~~~~~~ 451 (457) T protein:vir:13 414 EPEPEPAPAPPAIEPPAEEPDEEPEPEGKPDDEGATEE 451 (457) T ss_pred cccccccCCCCCCCCCccccCCCCCCCCCCccccCCCC Confidence 00000 00000000000000111 No 58 >protein:vir:93943 Length: 409 # NCBI annotation: ORF010 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239936;genbank:gi:66395598;genbank:GeneID:5131009 Probab=98.72 E-value=3.5e-09 Score=66.96 Aligned_cols=210 Identities=11% Similarity=0.085 Sum_probs=106.8 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc---ceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ---AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~---~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~ 77 (237) -++.+.+.+.....+.... +..... .+++.+. .+ +++....+++++. +.+.++.+.+++++ +-+|+. T Consensus 181 ~i~~~~~~i~~~~~~~~~~---~~~~~~~~~~i~~~~~---~l-~~e~~~~~~~~~~--~~~~~~g~~~vl~~-g~~~~~ 250 (409) T protein:vir:93 181 PIDVLKNTTDFDNAVRTFN---LTEMQKPDSFMLKYGS---NV-GKEKRQQVLEDFK--QYYEENGGILFQEP-GVEIEP 250 (409) T ss_pred HHHHHHHHHHHHHHHHHHH---HHhcCCCCceEEecCC---CC-CHHHHHHHHHHHH--HHhhcCCCeeecCC-CceEEE Confidence 2344444444333332221 111111 1222221 11 1112223344333 23445555666654 577888 Q ss_pred eecCcC--CHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhc---- Q lcl|NC_019725. 78 LNSDIS--GVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVE---- 151 (237) Q Consensus 78 ~~~~ls--Gl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~---- 151 (237) ++.+.. .+-+......+.||.+-|||-.+|.+...+..+ +.+.-...||..+ |.|.++++-.-+-+ T Consensus 251 l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~~~~s-n~e~~~~~f~~~~-------l~P~~~~ie~~l~~~Ll~ 322 (409) T protein:vir:93 251 LPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFA-KNEELNRFYLQHT-------LLPIVKQYEEEFNRKLLT 322 (409) T ss_pred cCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcc-cHHHHHHHHHHHH-------HHHHHHHHHHHHHhhcCC Confidence 765433 223344457788999999999888665444333 3455556677654 88887777544321 Q ss_pred ----CCCceeEeC--CCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcccccc----CCCCCCChhcccc Q lcl|NC_019725. 152 ----EEEWSIEFE--PLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKL----KDGNNINIREPEE 221 (237) Q Consensus 152 ----s~~~~~~f~--pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~----~~~~~~~~~~~e~ 221 (237) ..++.|+|+ .|...|.+++ ++++++++++|+++++|+|+.+- ..+..|- .+......+..++ T Consensus 323 ~~~~~~~~~~~fd~~~ll~~d~~~~-------~~~~~~~~~~G~~T~NE~R~~~g-~~p~~ggD~~~~~~n~~~~~~~~~ 394 (409) T protein:vir:93 323 KTDREKNRYFKFNVKSYLRADSATQ-------AEVYFKAVRSGYYTINDIREWED-LPPVEGGDKPLISGDLYPIDTPLE 394 (409) T ss_pred cccccCcceEEeechhhhccCHHHH-------HHHHHHHHhCCCcCHHHHHHHhC-CCCCCCcCeeeecccccccccchh Confidence 234556654 5666666554 56778899999999999999872 2332221 0111111111111 Q ss_pred CCCCCCCCCCCcCcCC Q lcl|NC_019725. 222 TTEPEPGLGEKLEDEN 237 (237) Q Consensus 222 ~~e~~~~~~~~~~~e~ 237 (237) . ....++|++...|. T Consensus 395 ~-~~~~~gG~~n~~e~ 409 (409) T protein:vir:93 395 L-RKSLKGGDKNVNES 409 (409) T ss_pred h-cccccCCCCCcCCC Confidence 1 12234444444444 No 59 >protein:vir:94666 Length: 723 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579205;genbank:gi:93007441;genbank:GeneID:5076785 Probab=98.72 E-value=3.5e-09 Score=66.95 Aligned_cols=214 Identities=11% Similarity=0.084 Sum_probs=105.3 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHhcCch-heeeeec------- Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVG-RAIGIDA------- 70 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~-~~~~iD~------- 70 (237) -++.+.+.|.....+......+...... .|++++.+ +++....+++++.-.-..-.|. ..+++.+ T Consensus 174 pi~~a~~~i~~~~aa~~~~~~~f~NG~~p~giL~~~~l-----~~e~~~~~~~~~~~~~~G~~Nagk~~vL~g~~~~~~v 248 (723) T protein:vir:94 174 PWKAARAAVDADFYAATWQRQSFKNGARPGGVVNLGDM-----DEQTFTKTVAAFRSQVEGVQNAGRHLLIAGQGSDGGA 248 (723) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEcCCC-----CHHHHHHHHHHHHHHhhchhhcCcceeeccccccccc Confidence 4566666666666666666665544322 45555432 1222233444443221111232 2344432 Q ss_pred --CCcceeeeecCcCC--HHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHH Q lcl|NC_019725. 71 --ETEEYDVLNSDISG--VPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLL 146 (237) Q Consensus 71 --~~e~~~~~~~~lsG--l~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~ 146 (237) ++-+|+.++.+... +-+......+.||.+-|||-..|.|.+. +++.+.-...||. ..|.|.++++- T Consensus 249 l~~G~~~~~l~~s~~D~q~le~r~~~~~eIa~afgVPp~~i~~~st---~sN~e~~~~~f~~-------~tL~P~~~~ie 318 (723) T protein:vir:94 249 AGKGATFTSLSMSPAEMDYINSRMHSAEEVMLAFGIRKDALLGGST---YENQAEAKAAVWT-------ETLIPQMEVMA 318 (723) T ss_pred ccCCceEEEccCCHHHHHHHHHHHHhHHHHHHHhCCChhHcCCCCC---cccHHHHHHHHHH-------HHHHHHHHHHH Confidence 23355554433221 2234455677899999999888866431 1222334445553 44788877776 Q ss_pred HHhhc---C---CCceeEeCC--CCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCChhc Q lcl|NC_019725. 147 PFIVE---E---EEWSIEFEP--LSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIRE 218 (237) Q Consensus 147 ~~i~~---s---~~~~~~f~p--L~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~~~ 218 (237) ..+-+ + .++.|+|+. |...|.+++ ++++..++++|+++++|+|+.+ ...+..|-.+...+..-. T Consensus 319 ~~ln~~Ll~~~g~~~~~~f~~~~lLr~D~~~r-------~~~~~~~v~~G~~T~NE~R~~l-glpPi~gGd~~~~~~p~~ 390 (723) T protein:vir:94 319 SITDLQLLPDIGWTVEWDFNSVPALQEDLEAQ-------AGRNQGYLVNDVLMVDEVRATI-GLDPLPGGIGQMTLTPYR 390 (723) T ss_pred HHHhHhhcccccCceEEeecchhhhhcCHHHH-------HHHHHHHHhCCCcCHHHHHHHh-CCCCCCCCcccceecccc Confidence 55432 1 356788886 455665544 5688899999999999999976 333322211000011000 Q ss_pred cccCCCCCCCCCCCcC-------------------------------------cCC Q lcl|NC_019725. 219 PEETTEPEPGLGEKLE-------------------------------------DEN 237 (237) Q Consensus 219 ~e~~~e~~~~~~~~~~-------------------------------------~e~ 237 (237) ..-++-+.|.+...+. .+. T Consensus 391 ~~~a~~~~~~p~~~e~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~ 446 (723) T protein:vir:94 391 AQFAPAPAPAPAVEEGAARMLALLERVAADRPLPELPVRATTVLHHDPGPDPQQTL 446 (723) T ss_pred ccccCCCCCCccchhhhHhhhhhccccccccCcCCCCCCCCCCCCCCcccCCchhH Confidence 0111111111110000 000 No 60 >protein:vir:79984 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430000;genbank:gi:156604055;genbank:GeneID:5525444 Probab=98.72 E-value=3.9e-09 Score=66.69 Aligned_cols=211 Identities=18% Similarity=0.212 Sum_probs=115.1 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHhcCc-hheeeeecCCcceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGV-GRAIGIDAETEEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~-~~~~~iD~~~e~~~~ 77 (237) .++.+.+.|.....+......+...... .++++++ .+.+++....++++++-.-..-.| .+.+++++ +-+|+. T Consensus 206 pl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~---~~~~~e~~e~~r~~~~~~~~G~~nag~~~vl~~-G~~~~~ 281 (441) T protein:vir:79 206 LLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKG---VLDNKKARDRAREEFHKSFSGTKQAGKVVVLDE-SMTFDQ 281 (441) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCC---CCCCHHHHHHHHHHHHHHhcCccccCcceecCC-CceEEE Confidence 6788888888877777777777666432 4566653 222333233444444432222123 34566664 578888 Q ss_pred eecCcCC--HHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhc---- Q lcl|NC_019725. 78 LNSDISG--VPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVE---- 151 (237) Q Consensus 78 ~~~~lsG--l~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~---- 151 (237) ++.+... +-+........||.+-|||-..| |...++.| ..+...+|- ..|.|.+.++-..|-+ T Consensus 282 l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~~s---~~q~~~~~~-------~tl~P~~~~ie~eln~kl~~ 350 (441) T protein:vir:79 282 LEVDTEVLKLIRENKSSTREIAGVFGIPLHKF-GIETANMS---ITDANLDYL-------STLKPYITCVCAELNFKFND 350 (441) T ss_pred ccCChhHHHHHHHHHHhHHHHHHHhCCCHHHc-CCCCCCcc---HHHHHHHHH-------HHHHHHHHHHHHHHhhhccc Confidence 7655432 33455667789999999999865 76555443 122233332 2477777766444321 Q ss_pred -CCC--ceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccC------CCCCCChhccc-- Q lcl|NC_019725. 152 -EEE--WSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLK------DGNNINIREPE-- 220 (237) Q Consensus 152 -s~~--~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~------~~~~~~~~~~e-- 220 (237) ..+ |.|.+..|...|.+++ +++++.++++|+++++|+|+.+ ...+..|-. +...+..+..+ T Consensus 351 ~~~~~~~~fd~~~llr~D~~~~-------~~~~~~~i~~G~~T~NE~R~~~-gl~Pi~ggd~~~~~~~~n~~~~~~~~~~ 422 (441) T protein:vir:79 351 EYVNREFKFDTTEIRVVDEKTQ-------AEIDKINIDSGKMNIDEIRQRD-GLAPIPGGNGSIHRVDLNHVNIELVDEY 422 (441) T ss_pred cccCceEEeechhhhccCHHHH-------HHHHHHHHhCCCcCHHHHHHHh-CCCCCCCCCcceEeeccccccccccccc Confidence 123 4455556666666654 6678889999999999999876 333322210 01111111111 Q ss_pred ---cCC--CCCCCCCCCcC Q lcl|NC_019725. 221 ---ETT--EPEPGLGEKLE 234 (237) Q Consensus 221 ---~~~--e~~~~~~~~~~ 234 (237) +.. +..-.+||+.+ T Consensus 423 ~~~~~~~~~~~~kgGe~~e 441 (441) T protein:vir:79 423 QMNKSRATDKKLKGGEENE 441 (441) T ss_pred ccccccccccccCCCCCCC Confidence 111 11112222222 No 61 >protein:vir:9408 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803386;genbank:gi:29028698;genbank:GeneID:1258164 Probab=98.72 E-value=3.9e-09 Score=66.69 Aligned_cols=211 Identities=18% Similarity=0.212 Sum_probs=115.1 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHhcCc-hheeeeecCCcceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGV-GRAIGIDAETEEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~-~~~~~iD~~~e~~~~ 77 (237) .++.+.+.|.....+......+...... .++++++ .+.+++....++++++-.-..-.| .+.+++++ +-+|+. T Consensus 206 pl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~---~~~~~e~~e~~r~~~~~~~~G~~nag~~~vl~~-G~~~~~ 281 (441) T protein:vir:94 206 LLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKG---VLDNKKARDRAREEFHKSFSGTKQAGKVVVLDE-SMTFDQ 281 (441) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcCC---CCCCHHHHHHHHHHHHHHhcCccccCcceecCC-CceEEE Confidence 6788888888877777777777666432 4566653 222333233444444432222123 34566664 578888 Q ss_pred eecCcCC--HHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhc---- Q lcl|NC_019725. 78 LNSDISG--VPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVE---- 151 (237) Q Consensus 78 ~~~~lsG--l~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~---- 151 (237) ++.+... +-+........||.+-|||-..| |...++.| ..+...+|- ..|.|.+.++-..|-+ T Consensus 282 l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~~s---~~q~~~~~~-------~tl~P~~~~ie~eln~kl~~ 350 (441) T protein:vir:94 282 LEVDTEVLKLIRENKSSTREIAGVFGIPLHKF-GIETANMS---ITDANLDYL-------STLKPYITCVCAELNFKFND 350 (441) T ss_pred ccCChhHHHHHHHHHHhHHHHHHHhCCCHHHc-CCCCCCcc---HHHHHHHHH-------HHHHHHHHHHHHHHhhhccc Confidence 7655432 33455667789999999999865 76555443 122233332 2477777766444321 Q ss_pred -CCC--ceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccC------CCCCCChhccc-- Q lcl|NC_019725. 152 -EEE--WSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLK------DGNNINIREPE-- 220 (237) Q Consensus 152 -s~~--~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~------~~~~~~~~~~e-- 220 (237) ..+ |.|.+..|...|.+++ +++++.++++|+++++|+|+.+ ...+..|-. +...+..+..+ T Consensus 351 ~~~~~~~~fd~~~llr~D~~~~-------~~~~~~~i~~G~~T~NE~R~~~-gl~Pi~ggd~~~~~~~~n~~~~~~~~~~ 422 (441) T protein:vir:94 351 EYVNREFKFDTTEIRVVDEKTQ-------AEIDKINIDSGKMNIDEIRQRD-GLAPIPGGNGSIHRVDLNHVNIELVDEY 422 (441) T ss_pred cccCceEEeechhhhccCHHHH-------HHHHHHHHhCCCcCHHHHHHHh-CCCCCCCCCcceEeeccccccccccccc Confidence 123 4455556666666654 6678889999999999999876 333322210 01111111111 Q ss_pred ---cCC--CCCCCCCCCcC Q lcl|NC_019725. 221 ---ETT--EPEPGLGEKLE 234 (237) Q Consensus 221 ---~~~--e~~~~~~~~~~ 234 (237) +.. +..-.+||+.+ T Consensus 423 ~~~~~~~~~~~~kgGe~~e 441 (441) T protein:vir:94 423 QMNKSRATDKKLKGGEENE 441 (441) T ss_pred ccccccccccccCCCCCCC Confidence 111 11112222222 No 62 >protein:vir:98396 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918929;genbank:gi:119443691;genbank:GeneID:4594558 Probab=98.72 E-value=6.5e-09 Score=65.48 Aligned_cols=211 Identities=18% Similarity=0.207 Sum_probs=113.4 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHhcCc-hheeeeecCCcceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGV-GRAIGIDAETEEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~-~~~~~iD~~~e~~~~ 77 (237) .++.+.+.|.....+......+...... .++++++ .+.+++....++++++-......| .+.++++. +-+|+. T Consensus 206 pi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~~---~~~~~e~~~~~~~~~~~~~~G~~nag~~~vl~~-g~~~~~ 281 (441) T protein:vir:98 206 LLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMKG---VLDNKKARDRAREEFHKSFSGTKQAGKVVVLDE-SMTFDQ 281 (441) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCC---CCCCHHHHHHHHHHHHHHhcCccccCcceecCC-CceEEE Confidence 5677888888888787777777666432 4566653 222333233344444433222223 34566664 577887 Q ss_pred eecCcCC--HHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhc---- Q lcl|NC_019725. 78 LNSDISG--VPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVE---- 151 (237) Q Consensus 78 ~~~~lsG--l~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~---- 151 (237) ++.+... +-+........||.+-|||-..| |.+.++.|. .....+| + ..|.|.+.++-..|-+ T Consensus 282 l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~l-g~~~~~~s~---~q~~~~y--~-----~tl~P~~~~ie~~ln~~L~~ 350 (441) T protein:vir:98 282 LEVDTEVLKLIRENKSSTREIAGVFGIPLHKF-GIETANMSI---TDANLDY--L-----STLKPYITCVCAELNFKFND 350 (441) T ss_pred ccCChhHHHHHHHHHHhHHHHHHHhCCCHHHc-CCCCCCccH---HHHHHHH--H-----HHHHHHHHHHHHHHHhhccc Confidence 7654322 33445666778999999999987 655554432 1222222 1 2477877766544321 Q ss_pred -CCCceeE--eCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccC------CCCCCChhccc-- Q lcl|NC_019725. 152 -EEEWSIE--FEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLK------DGNNINIREPE-- 220 (237) Q Consensus 152 -s~~~~~~--f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~------~~~~~~~~~~e-- 220 (237) ..+..|+ ...|...|.+++ +++++.++++|+++++|+|+.+ ...+..|-. +...+..+..+ T Consensus 351 ~~~~~~~~fd~~~llr~d~~~~-------~~~~~~~~~~G~~T~NE~R~~~-gl~pi~gGd~~~~~~~~n~~~~~~~~~~ 422 (441) T protein:vir:98 351 EYVNREFKFDTTEIRVVDEKTQ-------AEIDKINIDSGKMNIDEIRQRD-GLAPIPGGNGSIHRVDLNHVNIELVDEY 422 (441) T ss_pred cccCceEEEechhhhccCHHHH-------HHHHHHHHhCCCcCHHHHHHHh-CCCCCCCCCcceEeeccccccccccccc Confidence 1344444 456666666665 6678889999999999999876 333322210 00111111111 Q ss_pred ---cCCC--CCCCCCCCcC Q lcl|NC_019725. 221 ---ETTE--PEPGLGEKLE 234 (237) Q Consensus 221 ---~~~e--~~~~~~~~~~ 234 (237) +..+ ..-.+||+.+ T Consensus 423 q~~~~~~~~~~~kgGe~ne 441 (441) T protein:vir:98 423 QMNKSRATDKKLKGGEENE 441 (441) T ss_pred ccccccccccccCCCCCCC Confidence 1111 1112233222 No 63 >protein:vir:4509 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599035;genbank:gi:19548993;genbank:GeneID:935206 Probab=98.70 E-value=6.5e-09 Score=65.47 Aligned_cols=212 Identities=11% Similarity=0.068 Sum_probs=114.3 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHh-cCch-heeeeecCCccee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDN-SGVG-RAIGIDAETEEYD 76 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~-r~~~-~~~~iD~~~e~~~ 76 (237) .++.+.+.|.....+......+...... .|+++++ .++ ++....+++++.-.... ..|. +.++++. +-+|. T Consensus 194 pi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~---~l~-~e~~~~~~~~~~~~~~g~~~n~g~~~vl~~-g~~~~ 268 (424) T protein:vir:45 194 PIMQHAETIGMGMSGQKYTESFFSGNARPAGIVSVKS---GLN-KESWGWLKDQWQKASQALRRQENKTMLLPA-DLDYK 268 (424) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCC---CCC-HHHHHHHHHHHHHHhccccccCCceeEcCC-CceEE Confidence 6678888888888888777776665333 4566653 222 22233445555432222 2344 4556664 46777 Q ss_pred eeecCcCC--HHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhh---- Q lcl|NC_019725. 77 VLNSDISG--VPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIV---- 150 (237) Q Consensus 77 ~~~~~lsG--l~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~---- 150 (237) .++.+... +-+......+.||.+-|||-..|-+..-+.. ++-+.-.+.|| +..|.|.+.++-.-+- T Consensus 269 ~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~-sn~eq~~~~f~-------~~tL~P~~~~ie~~ln~kLl 340 (424) T protein:vir:45 269 ALTVSPVDAQIIDMMKLNRSMIAGIFNIPAHMINDLEKATF-SNISAQAIQFV-------RYTMMPWVTNWEQELNRRLF 340 (424) T ss_pred EccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCc-ccHHHHHHHHH-------HHHHHHHHHHHHHHHHHhcC Confidence 76654432 2355667788999999999888754433322 22233333343 3457777776644332 Q ss_pred ----cCCCceeEe--CCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCCh---hcccc Q lcl|NC_019725. 151 ----EEEEWSIEF--EPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINI---REPEE 221 (237) Q Consensus 151 ----~s~~~~~~f--~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~---~~~e~ 221 (237) +..++.|+| ..|...|.+++ ++++++++++|+++++|+|+.+ ...+..| ++..-. -.... T Consensus 341 ~~~e~~~g~~i~fd~~~llr~d~~~r-------~~~~~~~~~~g~~T~NE~R~~~-gl~pi~g---gD~~~~~~n~~~~~ 409 (424) T protein:vir:45 341 TRAELAAGYYVRFNLTGLLRGTPQER-------AQFYHFAITDGWMSRNEARAFE-DMNPVEG---LDEMLVSVNAANPA 409 (424) T ss_pred ChhhhcCCcEEEeechhhhccCHHHH-------HHHHHHHHhCCCcCHHHHHHHh-CCCCCCC---cceeeecccccccc Confidence 123455554 46666666654 5667779999999999999865 2222222 111000 00001 Q ss_pred CCCCCCCCCCCcCcC Q lcl|NC_019725. 222 TTEPEPGLGEKLEDE 236 (237) Q Consensus 222 ~~e~~~~~~~~~~~e 236 (237) ....++..+++.++| T Consensus 410 ~~~~~~~~~~~~~~~ 424 (424) T protein:vir:45 410 GDFKPPKNDEGKTNE 424 (424) T ss_pred cccCCCCCCCCCCCC Confidence 111222222222222 No 64 >protein:vir:4952 Length: 386 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049928;genbank:gi:9632899;genbank:GeneID:1262075 Probab=98.69 E-value=1.1e-08 Score=64.29 Aligned_cols=205 Identities=12% Similarity=0.105 Sum_probs=114.9 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVL 78 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~ 78 (237) .++.+.+.|.....+.....++...... .++++++ .+.... ...+.+ .......+..+.++++. +.+|+.+ T Consensus 176 ~l~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~---~~~~~~-~~~~~~--~~~~~~~n~g~~~vl~~-g~~~~~l 248 (386) T protein:vir:49 176 PLMALGREFNIQKASDKLTISALKNALNANGILKIKG---GGLLDF-KTKVSR--SRQAMKQMQGGPLVLDD-LEDFTPL 248 (386) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHccCCccEEEEeCC---CCChHH-HHHHHH--HHHHhccCCCCceecCC-CceEEEc Confidence 6788888898888888888887776433 3455643 111111 111222 22233344445566665 5788888 Q ss_pred ecCcCC--HHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhh--cCCC Q lcl|NC_019725. 79 NSDISG--VPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIV--EEEE 154 (237) Q Consensus 79 ~~~lsG--l~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~--~s~~ 154 (237) +.+... +-+......+.||++-|||-..|-|.. ++- ++++ ..+.|| ...++|.++.+...+- .... T Consensus 249 ~~~~~d~~~~e~~~~~~~~Ia~~fgVPp~~lg~~~-~~~-~~~~-~~~~~~-------~~~i~~~l~~i~~~~~~~l~~~ 318 (386) T protein:vir:49 249 EIKSNVAQLLSQADWTTGQFAKVYGIPESIVGGDG-DQQ-SSLE-MIYNIY-------FKSVSRYLRPFVSEMSKKLSCE 318 (386) T ss_pred cCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCC-Ccc-chHH-HHHHHH-------HHHHHHHHHHHHHHHHHHhcch Confidence 765543 345668888999999999988875432 222 2332 233343 2344555554444332 1245 Q ss_pred ceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCChhccccCCCCCCCCCCCcC Q lcl|NC_019725. 155 WSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIREPEETTEPEPGLGEKLE 234 (237) Q Consensus 155 ~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~~~~e~~~e~~~~~~~~~~ 234 (237) +.|+..++...+.++++ .....++.+|+++++|+|+.|.. .|+.+.. +.. .+....++..+|+ .+ T Consensus 319 ~~~~~~~~~~~d~~~~~-------~~~~~l~~~g~~t~nE~r~~l~~----~~~~~~~-~~~--~~~~~~~~~~gGd-~~ 383 (386) T protein:vir:49 319 VDVDISPAVDPTGSNYI-------SLINSMVKSGTLAQNQGLYILQQ----AEILPKE-LPD--GKNPNRTSLKGGE-IN 383 (386) T ss_pred hcccchhhhccCHHHHH-------HHHHHHHhCCCcCHHHHHHHHhh----CCCCCCc-Ccc--hhccCCCCCCCCC-CC Confidence 56666666666666554 44567899999999999998753 2333211 111 1111112222333 34 Q ss_pred cCC Q lcl|NC_019725. 235 DEN 237 (237) Q Consensus 235 ~e~ 237 (237) ++| T Consensus 384 ~~~ 386 (386) T protein:vir:49 384 EQD 386 (386) T ss_pred CCC Confidence 555 No 65 >protein:vir:960 Length: 413 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076614;genbank:gi:13095722;genbank:GeneID:920279 Probab=98.68 E-value=5.8e-09 Score=65.73 Aligned_cols=207 Identities=10% Similarity=0.099 Sum_probs=112.3 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHhcC-chheeeeecCCcceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSG-VGRAIGIDAETEEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~-~~~~~~iD~~~e~~~~ 77 (237) .++.+.+.|.....+......+...... .++++++ .++ ++....++++++-.-.... ..+.+++...+.++.. T Consensus 194 ~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~~---~l~-~e~~~~~~~~~~~~~~g~~n~g~~~vl~~~~~~~~~ 269 (413) T protein:vir:96 194 YKVALKDIVGNLKQASVTKKGFMASEYMPNLIVSVDS---DSD-ELSDEEGRENFEEMYLKRKEAGKPWIIPEGMVNVQQ 269 (413) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCC---CCC-HHHHHHHHHHHHHHhcCccccCceeeecCCcccccc Confidence 6888888888888888888887777544 5666654 122 2223445555543322222 3344566544444443 Q ss_pred ee-cCcC--CHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhh---c Q lcl|NC_019725. 78 LN-SDIS--GVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIV---E 151 (237) Q Consensus 78 ~~-~~ls--Gl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~---~ 151 (237) +. .+.. -+-+........||.+-|||..+| |. +...+....+||.. .|.|.++.+-+.|- . T Consensus 270 ~~~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~l-g~-----~~~~~~~~~~~~~~-------~l~P~~~~ie~~ln~~ll 336 (413) T protein:vir:96 270 IKPLTLNDLAINDAVTLDKKTVAGIFGVPAFLL-GV-----GTYNKDEFNNFINT-------KIMSIAQVIQQTYNKLIV 336 (413) T ss_pred cccCChhHHHHHHHHHHHHHHHHHHhCCCHHHc-CC-----CcchHHHHHHHHHH-------HHHHHHHHHHHHHHHhhC Confidence 32 2222 222455566788999999999877 32 11123334455543 37787777655543 3 Q ss_pred CCCceeE--eCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCC-CCCCChhccccCCC-CCC Q lcl|NC_019725. 152 EEEWSIE--FEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKD-GNNINIREPEETTE-PEP 227 (237) Q Consensus 152 s~~~~~~--f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~-~~~~~~~~~e~~~e-~~~ 227 (237) .+++.|+ +..|...|.+++| +++..++++|+++++|+|+.+- ..+..|-.- .-.......+...+ ... T Consensus 337 ~~~~~~~fd~~~ll~~d~~~~~-------~~~~~~~~~G~~t~NE~R~~~g-~~p~~~gd~~~~~~n~~~~~~~~~~~~~ 408 (413) T protein:vir:96 337 EEDMYFSLNPRSLYNYSLTEMV-------SAGAQMTQLNALRRNEFRNWVG-MPPDAEMDDLLVLENYLQQKDLVNQKKL 408 (413) T ss_pred CCCcEEEEechhhhccCHHHHH-------HHHHHHHhCCCcCHHHHHHHhC-CCCCCCcceeeecccccchhhcccccCC Confidence 4555555 4567677766655 5777899999999999998763 233222000 00000010111111 112 Q ss_pred CCCCC Q lcl|NC_019725. 228 GLGEK 232 (237) Q Consensus 228 ~~~~~ 232 (237) +.||. T Consensus 409 ~~~dt 413 (413) T protein:vir:96 409 IQDET 413 (413) T ss_pred CCCCC Confidence 22222 No 66 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=98.66 E-value=1.6e-09 Score=68.79 Aligned_cols=212 Identities=11% Similarity=0.036 Sum_probs=113.4 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHH--HhhcCCchHHHHHHHHHHHHHhcCchh-eeeeecCCcce-e Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLA--EMCDDDDAQYAARLRLAQVDDNSGVGR-AIGIDAETEEY-D 76 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~--~~~~~~~~e~~~~~r~~~~~~~r~~~~-~~~iD~~~e~~-~ 76 (237) .++.+.+.+.+++++......-+..+......+.|.. ...-+..+. ++...+..+...+ ++.++.+ .++ + T Consensus 226 d~e~v~~liD~~~~~~s~~~~~~~~~a~~~~~~~G~~~~~~~~d~~g~-----~i~~~~~~~~~~~~~~~~~~~-~~~~q 299 (456) T protein:vir:79 226 EVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSSEHRLPKVDENGN-----AIDYASIFEAAPGALWELPPG-VDIWE 299 (456) T ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcCCccccccccccc-----ccchhhhhhhhccccccCCCC-cceee Confidence 5777777787888776554433333333222222211 111111111 1111222222222 3445544 444 4 Q ss_pred eeecCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHH---HHHHHHhhhHHHHHHHHHhhcC- Q lcl|NC_019725. 77 VLNSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLV---DRKREEDYRPLLEFLLPFIVEE- 152 (237) Q Consensus 77 ~~~~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I---~~~Qe~~l~p~l~~l~~~i~~s- 152 (237) .-.+++.+..+.+.....++++.+++|...|-|.+. |.||++=..-|...+ +.+| ..+++.|++++.+++.- T Consensus 300 ~~~~~~~~~~~~l~~~i~~i~~~t~~p~~~~~~~~~---N~Sg~Al~~~~~~l~~k~~~~~-~~f~~~l~~~~~l~~~~~ 375 (456) T protein:vir:79 300 SQTNDFTPMLSAIKEHIRQLSSATKTPLPMLMPDSA---NQSAEGAHNIEKGFLFKCEDRL-SIAKIGLEAILVKALQIE 375 (456) T ss_pred ecccChHHHHHHHHHHHHHHHhhcCCChhHhccccc---CcHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhc Confidence 457888999999999999999999999999988652 446654444444443 3333 57899999999987632 Q ss_pred -----CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCChhccccCCC-CC Q lcl|NC_019725. 153 -----EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIREPEETTE-PE 226 (237) Q Consensus 153 -----~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~~~~e~~~e-~~ 226 (237) .++.+.|.|...+|..+.| +++.+++++|+++.+.++..| |+.+ ..+...+.+...+ .+ T Consensus 376 g~~~~~~i~v~w~~~~~~s~~~~a-------da~~kl~~~G~~~~~~~~~~l-------g~~~-~~i~~~e~~r~~~e~~ 440 (456) T protein:vir:79 376 GESVEDTVDVSFESPDRVTLGEKY-------SAASLAKAAGESWASIRRNIL-------NYNA-DQIKQDDLDRAREQIT 440 (456) T ss_pred CCCccccceEEeCCCCCcCHHHHH-------HHHHHHHhcCCChHHHHHhcC-------CCCH-HHHHHHHHHHHHHHHH Confidence 3688899999999887764 455556677777765444322 2211 1111111110000 00 Q ss_pred CC----CCCCcCcCC Q lcl|NC_019725. 227 PG----LGEKLEDEN 237 (237) Q Consensus 227 ~~----~~~~~~~e~ 237 (237) .. ...+..+.+ T Consensus 441 ~~~~~~~~~~~~~~~ 455 (456) T protein:vir:79 441 LFAGNPVQRPQEDGS 455 (456) T ss_pred HHhhhHhhcCCCCCC Confidence 00 000111111 No 67 >protein:vir:94002 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1487 # MgeName: jj50 # Cross-refs: genbank:acc:YP_764318;genbank:gi:115315632;genbank:GeneID:5176589 Probab=98.64 E-value=1.8e-09 Score=68.53 Aligned_cols=200 Identities=14% Similarity=0.123 Sum_probs=100.5 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHH----HHHhcCchheeeeecCCccee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQ----VDDNSGVGRAIGIDAETEEYD 76 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~----~~~~r~~~~~~~iD~~~e~~~ 76 (237) .++.+...+..+-.. +. --.++++++. +... ....+++++.- .....++.+++++++ +.+|. T Consensus 152 ~l~~~~~~i~~~~~~----~~-----~~gil~~~~~---l~~~-~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~-g~~~~ 217 (378) T protein:vir:94 152 ILDNALASIQTKLEQ----GK-----LRGLLKINAF---LDID-NTQEYREKALTTIKNMQEGSSYNGLTPVDN-KTEIV 217 (378) T ss_pred HHHHHHHHHHHHHhc----cc-----ccceeeeCCc---CCHH-HHHHHHHHHHHHHHHhhcccccccceecCC-CceEE Confidence 444444444332211 11 1124555431 1111 11223333322 222223345677775 57888 Q ss_pred eeecCcCCHH-HHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhh----c Q lcl|NC_019725. 77 VLNSDISGVP-EFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIV----E 151 (237) Q Consensus 77 ~~~~~lsGl~-dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~----~ 151 (237) .++.+..... .-.......||.+-|||..+|-|. + + +....+|| ..-|.|.+.++-.-+- . T Consensus 218 ~l~~~~~~~~~~~~~~~~~~Ia~~fgVP~~~l~~~-----~-s-e~~~~~f~-------~~tL~P~~~~ie~~l~~~Ll~ 283 (378) T protein:vir:94 218 ELKKDYSVLNKDEIDLIKSELLTGYFMNENILLGT-----A-S-QEQQIYFY-------NSTIIPLLIQLEKELTYKLIS 283 (378) T ss_pred EccCChhhhhHHHHHHHHHHHHHHhCCCHHHhcCC-----h-H-HHHHHHHH-------HHHHHHHHHHHHHHHHhhcCC Confidence 8776655443 233456678999999999888431 1 1 33444555 3458888876654432 1 Q ss_pred C------------CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcccccc----CCCCCCC Q lcl|NC_019725. 152 E------------EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKL----KDGNNIN 215 (237) Q Consensus 152 s------------~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~----~~~~~~~ 215 (237) . .++.|++..|...|.+++ ++++.+++++|+++++|+|+.+ ...+..|- -+..-.. T Consensus 284 ~~er~~g~~~~~~~~~~f~~~~l~~~d~~~~-------~~~~~~~~~~G~~T~NE~R~~~-gl~p~~gGD~~~~~~n~~~ 355 (378) T protein:vir:94 284 TNRRRVVKGNLYYERIIVDNQLFKFATLKEL-------IDLYHENINGPIFTQNQLLVKM-GEQPIEGGDVYIANLNAVA 355 (378) T ss_pred hhHhhhhhhcccccceeecchhhhhcCHHHH-------HHHHHHHHhCCCcCHHHHHHHh-CCCCCCCCCeeeecccccc Confidence 1 136677778888887765 5667889999999999999976 33333221 1111111 Q ss_pred hhccccC--CCCCCCCCCCcCcC Q lcl|NC_019725. 216 IREPEET--TEPEPGLGEKLEDE 236 (237) Q Consensus 216 ~~~~e~~--~e~~~~~~~~~~~e 236 (237) .+...+. ......++++.+.| T Consensus 356 ~~~~~~~~~~~~~~~~~~e~~n~ 378 (378) T protein:vir:94 356 VKNLSDLQGSRKDVTSTDETNNQ 378 (378) T ss_pred cccchhhcCCcCCCCCCCCCCCC Confidence 1111111 11111222222222 No 68 >protein:vir:80796 Length: 574 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504121;genbank:gi:158079308;genbank:GeneID:5666445 Probab=98.63 E-value=1.5e-08 Score=63.53 Aligned_cols=224 Identities=14% Similarity=0.171 Sum_probs=114.2 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHhc--Cchh-eeeeecCCcce Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNS--GVGR-AIGIDAETEEY 75 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r--~~~~-~~~iD~~~e~~ 75 (237) -++.+.+.|.....+......+...... .++++++-. .+ +.+....+++++. ..+. .|.+ +.++-.++-+| T Consensus 258 pi~~a~~~i~~~~~a~~~~~~~f~ng~~p~gil~~~~~~-~l-s~e~~~~lk~~~~--~~~~G~~n~g~~~vl~~~G~~~ 333 (574) T protein:vir:80 258 ELEIALKQFIAHENTEVFNDRFFSHGGTTRGILHVKTGQ-QQ-SQQALDIFRREWR--SSLAGINGSWQIPVVSAEDVKF 333 (574) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCC-CC-CHHHHHHHHHHHH--HHhccccccccceeecCCCceE Confidence 4577778888888888888887766433 235554211 11 1112223444443 2333 2333 33553444566 Q ss_pred eeeecCcCC--HHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhH--HHHHHHHHHHHHHHhhhHHHHHHHHHhhc Q lcl|NC_019725. 76 DVLNSDISG--VPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTA--LETFYKLVDRKREEDYRPLLEFLLPFIVE 151 (237) Q Consensus 76 ~~~~~~lsG--l~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D--~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~ 151 (237) ..++.+... +-+........||.+-|||-..|--.+.+.+.++|... ..|.-..-..+.+..|.|.+.++-..|-+ T Consensus 334 ~~l~~s~~D~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~t~~gs~~~~~n~sn~E~~~~~f~~~tL~P~~~~ie~~ln~ 413 (574) T protein:vir:80 334 VNMTPSANDMQFEKWLNYLINVISALYGIDPAEINFPNNGGATGSKGGSLNEGNSKEKMQASQNKGLQPLLRFIEDTVNT 413 (574) T ss_pred EEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhcccccccccccccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 666544432 23455667889999999999988666665555444221 12223333445555678877776555421 Q ss_pred ------CCCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccC---CCCC---CCh--- Q lcl|NC_019725. 152 ------EEEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLK---DGNN---INI--- 216 (237) Q Consensus 152 ------s~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~---~~~~---~~~--- 216 (237) ...+.|+|+...-.+..+++. ...++.+|+++++|+|+.+- ..+..|-. -..+ +.. T Consensus 414 ~Ll~~~~~~~~~~f~~~d~~~~~~~~~--------~~~~~~~G~lT~NE~R~~lg-l~Pi~gGD~~~~~~n~~~~~~~~~ 484 (574) T protein:vir:80 414 YIVAEFGEKYQFQFRGGDLSAQLDKLK--------IIEQEGKVFRTVNEIRHDKG-LEPIKGGDVILNGVHIQAIGQALQ 484 (574) T ss_pred hhhhhcCCceEEEecccchhhHHHHHH--------HHHHHhCCccCHHHHHHHhC-CCCCCCCCEeeeccceeecccccc Confidence 246788898776554443332 23467899999999999863 23322210 0000 000 Q ss_pred ----h-----------------ccccCCCCCCCCCCCcCcCC Q lcl|NC_019725. 217 ----R-----------------EPEETTEPEPGLGEKLEDEN 237 (237) Q Consensus 217 ----~-----------------~~e~~~e~~~~~~~~~~~e~ 237 (237) + +++..+.++|..++..+.|+ T Consensus 485 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~d~~~~ 526 (574) T protein:vir:80 485 EEQLEYQRSQDRLNRLLELSGGDVEQPEPEEPKDSQNDTDVS 526 (574) T ss_pred cccCCccchhccccccccccCCCCCCCCCCCCCCccccccch Confidence 0 00000001111111222222 No 69 >protein:vir:858 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:18 # MgeName: bIL170 # Cross-refs: genbank:acc:NP_047117;genbank:gi:9630570;genbank:GeneID:1261758 Probab=98.63 E-value=2.5e-09 Score=67.75 Aligned_cols=200 Identities=15% Similarity=0.139 Sum_probs=98.4 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc-ceeechhHHHhhcCCchHHHHHHHHHHH----HHhcCchheeeeecCCcce Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ-AVWKVKGLAEMCDDDDAQYAARLRLAQV----DDNSGVGRAIGIDAETEEY 75 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~-~v~k~~~l~~~~~~~~~e~~~~~r~~~~----~~~r~~~~~~~iD~~~e~~ 75 (237) .++...+.+..+ +...+. .++|+++ .+... ....+++++.-. ....++.+++++++ +.+| T Consensus 152 ~~~~a~~~~~~~----------~~~~~~~g~l~~~~---~l~~~-~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~-g~~~ 216 (378) T protein:vir:85 152 ILDNALASIQTK----------LEQGKLRGLLKINA---FLDID-NTQEYREKALATIKNMQEGSSYNGLTPVDN-KTEI 216 (378) T ss_pred HHHHHHHHHHHH----------HhcCCcceEEEeCC---cCCHH-HHHHHHHHHHHHHHHhhcccccccceecCC-CceE Confidence 233222222211 111111 2344442 12211 122344444322 12223445667765 5889 Q ss_pred eeeecCcCCHH-HHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhh---- Q lcl|NC_019725. 76 DVLNSDISGVP-EFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIV---- 150 (237) Q Consensus 76 ~~~~~~lsGl~-dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~---- 150 (237) ..++.+...++ +.+......||.+-|||..+|-|. ..+....+||. .-|.|.+.++-.-+- T Consensus 217 ~~l~~~~~~~~~~~~~~~~~~Ia~~fgVPp~~l~~s-------~~e~~~~~f~~-------~tL~P~~~~ie~~l~~kLl 282 (378) T protein:vir:85 217 VELKKDYSVLNKDEIELIKSELLTGYFMNENILLGT-------ATQEQQIYFYN-------STIIPLLIQLEKELTYKLI 282 (378) T ss_pred EeccCChhhhhHHHHHHHHHHHHHHhCCCHHHhcCC-------chHHHHHHHHH-------HHHHHHHHHHHHHHHhhcC Confidence 98877665444 223445568999999998887431 11223344443 458888877755442 Q ss_pred cC------------CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcccccc----CCCCCC Q lcl|NC_019725. 151 EE------------EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKL----KDGNNI 214 (237) Q Consensus 151 ~s------------~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~----~~~~~~ 214 (237) .+ .++.|++..|...|.+++ ++++..++++|+++++|+|+.+ ...+..|. -+..-. T Consensus 283 ~~~er~~~~~~~~~~~~~f~~~~l~~~d~~~~-------~~~~~~~~~~G~~T~NE~R~~l-gl~p~~gGD~~~~~~N~~ 354 (378) T protein:vir:85 283 STNRRRVVKGNLYYERIIVDNQLFKFATLKEL-------IDLYHENINGPIFTQNQLLVKM-GEQPIEGGDIYIANLNAV 354 (378) T ss_pred ChhhhhhhhhccccceeeecchhhhhcCHHHH-------HHHHHHHHhCCCcCHHHHHHHh-CCCCCCCCCeEeeccccc Confidence 11 135566778888887765 5678889999999999999986 33333331 111111 Q ss_pred ChhccccCCCCCCCCCCCcCcCC Q lcl|NC_019725. 215 NIREPEETTEPEPGLGEKLEDEN 237 (237) Q Consensus 215 ~~~~~e~~~e~~~~~~~~~~~e~ 237 (237) ..++..+......+.....++.| T Consensus 355 ~~~~~~~~~~~~~~~~~~~e~~n 377 (378) T protein:vir:85 355 AVKNLSDLQGSRKDVASTDETNN 377 (378) T ss_pred ccccchhhcCccCCCCCCCCCCC Confidence 11122111111111111112222 No 70 >protein:vir:100650 Length: 395 # NCBI annotation: 77ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958604;genbank:gi:41189523;genbank:GeneID:2743796 Probab=98.62 E-value=1.1e-08 Score=64.29 Aligned_cols=207 Identities=15% Similarity=0.108 Sum_probs=97.2 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhc-cceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchhe-ee-eecCCcceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQ-QAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRA-IG-IDAETEEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~-~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~-~~-iD~~~e~~~~ 77 (237) .++.+...+.....+ ...... -.++++++ ...+++....++++++-.-..-...+. ++ +++ +.+|+. T Consensus 161 pi~~~~~~~~~~~~~------~~~~~~~~gii~~~~---~~~~~e~~~~~~~~~~~~~~~~~~~~~~v~~l~~-g~~~~~ 230 (395) T protein:vir:10 161 LFEDYGKIFGRMIGA------QLKNYQIRGILKSAS---SAYDEKNIEKLQAFTNKLFNTFNKNQLAIAPLIE-GFDYEE 230 (395) T ss_pred HHHHHHHHHHHHHHH------HHhcCCCceEEEeCC---CCCCHHHHHHHHHHHHHHhccccccCcceEEcCC-Cceeee Confidence 233332222211111 011111 12233321 111222222333333322112122222 33 444 577888 Q ss_pred eecCcCCH-------HHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhh Q lcl|NC_019725. 78 LNSDISGV-------PEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIV 150 (237) Q Consensus 78 ~~~~lsGl-------~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~ 150 (237) ++.+.... -+........||.+-|||-.+|-| =.++-+...++||. ..|.|.+.++-..+- T Consensus 231 l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~~-----~~sn~e~~~~~~~~-------~~l~P~~~~ie~~l~ 298 (395) T protein:vir:10 231 LSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIYG-----ETADLEKNTLVFEK-------FCLTPLLKKIQNELN 298 (395) T ss_pred ccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhcC-----cccCHHHHHHHHHH-------HHHHHHHHHHHHHHH Confidence 77665443 334445667899999999887732 12222445666665 347777776654442 Q ss_pred c--------CCCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccC------CCCCCCh Q lcl|NC_019725. 151 E--------EEEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLK------DGNNINI 216 (237) Q Consensus 151 ~--------s~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~------~~~~~~~ 216 (237) + ...+.|.|++|...+.+++ +++++.++++|+++++|+|+.+ ...+..|-. +..-... T Consensus 299 ~kL~~~~~~~~~~~f~~~~l~~~D~~~~-------~~~~~~~~~~G~lt~NE~R~~~-g~~p~~~g~~d~~~~~~n~~~~ 370 (395) T protein:vir:10 299 AKLITQSMYLKDTRIEIVGVNKKDPLQY-------AEAIDKLVSSGSFTRNEVRIML-GEEPSDNPELDEYLITKNYEKA 370 (395) T ss_pred HhhcChhhhcccceecchhhhccCHHHH-------HHHHHHHHhCCCcCHHHHHHHh-CCCCCCCCCCceeeeccccccc Confidence 1 1356788888888877764 5667778999999999999976 333332211 1100111 Q ss_pred hccc--cCC--CCCCCCCCCcCcCC Q lcl|NC_019725. 217 REPE--ETT--EPEPGLGEKLEDEN 237 (237) Q Consensus 217 ~~~e--~~~--e~~~~~~~~~~~e~ 237 (237) +..+ +.+ +..+.+|++.+.-+ T Consensus 371 ~~~~~~~~~~~~~~~kgg~~~~~g~ 395 (395) T protein:vir:10 371 NSGENDEKEKDENTLKGGDEDESGD 395 (395) T ss_pred cccccccCcccccccCCCCCCCCCC Confidence 1111 111 11122222222222 No 71 >protein:vir:9507 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835554;genbank:gi:30043953;genbank:GeneID:1260535 Probab=98.62 E-value=1.1e-08 Score=64.29 Aligned_cols=207 Identities=15% Similarity=0.108 Sum_probs=97.2 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhc-cceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchhe-ee-eecCCcceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQ-QAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRA-IG-IDAETEEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~-~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~-~~-iD~~~e~~~~ 77 (237) .++.+...+.....+ ...... -.++++++ ...+++....++++++-.-..-...+. ++ +++ +.+|+. T Consensus 161 pi~~~~~~~~~~~~~------~~~~~~~~gii~~~~---~~~~~e~~~~~~~~~~~~~~~~~~~~~~v~~l~~-g~~~~~ 230 (395) T protein:vir:95 161 LFEDYGKIFGRMIGA------QLKNYQIRGILKSAS---SAYDEKNIEKLQAFTNKLFNTFNKNQLAIAPLIE-GFDYEE 230 (395) T ss_pred HHHHHHHHHHHHHHH------HHhcCCCceEEEeCC---CCCCHHHHHHHHHHHHHHhccccccCcceEEcCC-Cceeee Confidence 233332222211111 011111 12233321 111222222333333322112122222 33 444 577888 Q ss_pred eecCcCCH-------HHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhh Q lcl|NC_019725. 78 LNSDISGV-------PEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIV 150 (237) Q Consensus 78 ~~~~lsGl-------~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~ 150 (237) ++.+.... -+........||.+-|||-.+|-| =.++-+...++||. ..|.|.+.++-..+- T Consensus 231 l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~~-----~~sn~e~~~~~~~~-------~~l~P~~~~ie~~l~ 298 (395) T protein:vir:95 231 LSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIYG-----ETADLEKNTLVFEK-------FCLTPLLKKIQNELN 298 (395) T ss_pred ccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhcC-----cccCHHHHHHHHHH-------HHHHHHHHHHHHHHH Confidence 77665443 334445667899999999887732 12222445666665 347777776654442 Q ss_pred c--------CCCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccC------CCCCCCh Q lcl|NC_019725. 151 E--------EEEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLK------DGNNINI 216 (237) Q Consensus 151 ~--------s~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~------~~~~~~~ 216 (237) + ...+.|.|++|...+.+++ +++++.++++|+++++|+|+.+ ...+..|-. +..-... T Consensus 299 ~kL~~~~~~~~~~~f~~~~l~~~D~~~~-------~~~~~~~~~~G~lt~NE~R~~~-g~~p~~~g~~d~~~~~~n~~~~ 370 (395) T protein:vir:95 299 AKLITQSMYLKDTRIEIVGVNKKDPLQY-------AEAIDKLVSSGSFTRNEVRIML-GEEPSDNPELDEYLITKNYEKA 370 (395) T ss_pred HhhcChhhhcccceecchhhhccCHHHH-------HHHHHHHHhCCCcCHHHHHHHh-CCCCCCCCCCceeeeccccccc Confidence 1 1356788888888877764 5667778999999999999976 333332211 1100111 Q ss_pred hccc--cCC--CCCCCCCCCcCcCC Q lcl|NC_019725. 217 REPE--ETT--EPEPGLGEKLEDEN 237 (237) Q Consensus 217 ~~~e--~~~--e~~~~~~~~~~~e~ 237 (237) +..+ +.+ +..+.+|++.+.-+ T Consensus 371 ~~~~~~~~~~~~~~~kgg~~~~~g~ 395 (395) T protein:vir:95 371 NSGENDEKEKDENTLKGGDEDESGD 395 (395) T ss_pred cccccccCcccccccCCCCCCCCCC Confidence 1111 111 11122222222222 No 72 >protein:vir:101289 Length: 395 # NCBI annotation: phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908829;genbank:gi:118725093;genbank:GeneID:4555860 Probab=98.62 E-value=1.1e-08 Score=64.29 Aligned_cols=207 Identities=15% Similarity=0.108 Sum_probs=97.2 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhc-cceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchhe-ee-eecCCcceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQ-QAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRA-IG-IDAETEEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~-~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~-~~-iD~~~e~~~~ 77 (237) .++.+...+.....+ ...... -.++++++ ...+++....++++++-.-..-...+. ++ +++ +.+|+. T Consensus 161 pi~~~~~~~~~~~~~------~~~~~~~~gii~~~~---~~~~~e~~~~~~~~~~~~~~~~~~~~~~v~~l~~-g~~~~~ 230 (395) T protein:vir:10 161 LFEDYGKIFGRMIGA------QLKNYQIRGILKSAS---SAYDEKNIEKLQAFTNKLFNTFNKNQLAIAPLIE-GFDYEE 230 (395) T ss_pred HHHHHHHHHHHHHHH------HHhcCCCceEEEeCC---CCCCHHHHHHHHHHHHHHhccccccCcceEEcCC-Cceeee Confidence 233332222211111 011111 12233321 111222222333333322112122222 33 444 577888 Q ss_pred eecCcCCH-------HHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhh Q lcl|NC_019725. 78 LNSDISGV-------PEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIV 150 (237) Q Consensus 78 ~~~~lsGl-------~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~ 150 (237) ++.+.... -+........||.+-|||-.+|-| =.++-+...++||. ..|.|.+.++-..+- T Consensus 231 l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~f~VPp~~l~~-----~~sn~e~~~~~~~~-------~~l~P~~~~ie~~l~ 298 (395) T protein:vir:10 231 LSNGGKNSNMPFSELSELMRDAIKNVALMIGIPPGLIYG-----ETADLEKNTLVFEK-------FCLTPLLKKIQNELN 298 (395) T ss_pred ccccccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhcC-----cccCHHHHHHHHHH-------HHHHHHHHHHHHHHH Confidence 77665443 334445667899999999887732 12222445666665 347777776654442 Q ss_pred c--------CCCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccC------CCCCCCh Q lcl|NC_019725. 151 E--------EEEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLK------DGNNINI 216 (237) Q Consensus 151 ~--------s~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~------~~~~~~~ 216 (237) + ...+.|.|++|...+.+++ +++++.++++|+++++|+|+.+ ...+..|-. +..-... T Consensus 299 ~kL~~~~~~~~~~~f~~~~l~~~D~~~~-------~~~~~~~~~~G~lt~NE~R~~~-g~~p~~~g~~d~~~~~~n~~~~ 370 (395) T protein:vir:10 299 AKLITQSMYLKDTRIEIVGVNKKDPLQY-------AEAIDKLVSSGSFTRNEVRIML-GEEPSDNPELDEYLITKNYEKA 370 (395) T ss_pred HhhcChhhhcccceecchhhhccCHHHH-------HHHHHHHHhCCCcCHHHHHHHh-CCCCCCCCCCceeeeccccccc Confidence 1 1356788888888877764 5667778999999999999976 333332211 1100111 Q ss_pred hccc--cCC--CCCCCCCCCcCcCC Q lcl|NC_019725. 217 REPE--ETT--EPEPGLGEKLEDEN 237 (237) Q Consensus 217 ~~~e--~~~--e~~~~~~~~~~~e~ 237 (237) +..+ +.+ +..+.+|++.+.-+ T Consensus 371 ~~~~~~~~~~~~~~~kgg~~~~~g~ 395 (395) T protein:vir:10 371 NSGENDEKEKDENTLKGGDEDESGD 395 (395) T ss_pred cccccccCcccccccCCCCCCCCCC Confidence 1111 111 11122222222222 No 73 >protein:vir:99452 Length: 651 # NCBI annotation: hypothetical protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:1595 # MgeName: BJ1 # Cross-refs: genbank:acc:YP_919077;genbank:gi:119757035;genbank:GeneID:4606105 Probab=98.62 E-value=3.1e-08 Score=61.72 Aligned_cols=216 Identities=13% Similarity=0.096 Sum_probs=107.8 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCC------ Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAET------ 72 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~------ 72 (237) -++.+.+.|.....+......+...... .++++.+ ..+ +.+....+++.++- ...+....+++..++ T Consensus 287 pl~~a~~~i~~a~~a~~~~~~~f~NG~~p~gil~~~~--~~l-s~e~~~~lr~~~~~--~~~nagk~~vL~~~~~~~~~~ 361 (651) T protein:vir:99 287 DWVSAIRTISADEAAKDYNRDFFDNDTIPRMVIKVTG--GEL-SEESKRDLRQMLNG--LREESHRAVVLEVEKFQSQLD 361 (651) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecC--CCC-CHHHHHHHHHHHHH--HhccCCceEEeeccccccccc Confidence 4666667777777777777776665433 4565543 112 22223445555543 334444555554321 Q ss_pred --cceeeeecCcCCHH-----HHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHH Q lcl|NC_019725. 73 --EEYDVLNSDISGVP-----EFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFL 145 (237) Q Consensus 73 --e~~~~~~~~lsGl~-----dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l 145 (237) ..++....+++..+ +........||++-|||-..| |...++=.|+-|.....||.. .|.|.+.++ T Consensus 362 ~~~g~~~~pls~~~~~D~qfle~r~~~~~eIa~afgVPp~~l-G~~~~~~~sn~E~~~~~f~~~-------tL~P~~~~i 433 (651) T protein:vir:99 362 EDVEIELEPMGQGISEEMDFRQFREKNEHEIAKVLEVPPVKI-GVTDSANRSNSDQQDKDFALE-------VIQPEQHTF 433 (651) T ss_pred ccCCceEEEcCcCchhhHHHHHHHHHHHHHHHHHhCCCHHHh-ccCCCCCcccHHHHHHHHHHH-------HHHHHHHHH Confidence 13344444444433 334556778999999997665 655443233445556666554 366776666 Q ss_pred HHHhhc--------CCC--ceeEeCC--CCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCC Q lcl|NC_019725. 146 LPFIVE--------EEE--WSIEFEP--LSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNN 213 (237) Q Consensus 146 ~~~i~~--------s~~--~~~~f~p--L~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~ 213 (237) -..|-+ ..+ +.|+|+. |...+. +++++++..++++|+++++|+|+.+- ..+..+-.++.. T Consensus 434 e~eln~kLl~~~e~~~~~~i~~ef~~~~llr~D~-------~~~~e~~~~~i~~G~~T~NE~R~~lg-lppi~~~~gd~~ 505 (651) T protein:vir:99 434 AEWLYQIIHQQALGVTDWTIEYELRGADQPKQEA-------QLAEQRVRAMRLAGVGLVDEAREELG-LDPLGEPYGEMT 505 (651) T ss_pred HHHHHHhhcCccccccCceEEEEeccchhhhccH-------HHHHHHHHHHHhCCCcCHHHHHHHhC-CCCCCCcccccc Confidence 544421 123 4556654 444444 45567788899999999999999862 222111111111 Q ss_pred CCh---hccc-----------cCCCCCCCCCCCcCcCC Q lcl|NC_019725. 214 INI---REPE-----------ETTEPEPGLGEKLEDEN 237 (237) Q Consensus 214 ~~~---~~~e-----------~~~e~~~~~~~~~~~e~ 237 (237) +.. .... +.+.++...++.+.+.. T Consensus 506 l~~~~~~~~g~~~~gge~~~~~~~~~~~~~~~~e~~~~ 543 (651) T protein:vir:99 506 LSEFEAEVAGDVAGGGETEAVHEPPEENKIGEREWDTV 543 (651) T ss_pred ccccccccccccccCCCCcccccCccccccccchhhhh Confidence 100 0000 00000111111111000 No 74 >protein:vir:104259 Length: 403 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006980;genbank:gi:46401881;genbank:GeneID:2777676 Probab=98.61 E-value=2.8e-08 Score=61.98 Aligned_cols=213 Identities=15% Similarity=0.122 Sum_probs=110.7 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhc--cceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchh-eeeeecCCcceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQ--QAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGR-AIGIDAETEEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~--~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~-~~~iD~~~e~~~~ 77 (237) .++.+.+.+.....+......+..... -.|+++++ .++. +....+++++...-....|.+ .+++++ +-+|.. T Consensus 176 ~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~---~l~~-e~~~~~~~~~~~~~~g~~n~g~~~vl~~-g~~~~~ 250 (403) T protein:vir:10 176 RVATVIDSLEKRSKMLNFKEKFLDNGTVIGLILETDE---ILNK-KLRERKQEELQLDYNPSTGQSSVLILDG-GMKAKP 250 (403) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCC---CCCH-HHHHHHHHHHHHHhCCcccCcceeecCC-CceeEE Confidence 556677777777777777766654422 23566653 2222 223345555554333334434 455654 567887 Q ss_pred eecCcCC----HHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhcC- Q lcl|NC_019725. 78 LNSDISG----VPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVEE- 152 (237) Q Consensus 78 ~~~~lsG----l~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~s- 152 (237) ++.+.+. +-+........||.+-|||...| |.+ -+++-+.....||. ..|.|.+.++-+.+-+. T Consensus 251 ~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~---~~sn~e~~~~~f~~-------~tl~P~~~~ie~~l~~~L 319 (403) T protein:vir:10 251 YSQISSFKDLDFKEDIEGFNKSICLAFGVPQVLL-DGG---NNANIRPNIELFYY-------MTIIPMLNKLTSSLTFFF 319 (403) T ss_pred ecccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHc-CCC---CCcCHHHHHHHHHH-------HHHHHHHHHHHHHHHHhc Confidence 7654442 24455666788999999998765 532 22233344455554 44778777776655432 Q ss_pred -CCceeEeCCC--CCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccc-cCCCCCCC--hhccccCCCCC Q lcl|NC_019725. 153 -EEWSIEFEPL--SVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFK-LKDGNNIN--IREPEETTEPE 226 (237) Q Consensus 153 -~~~~~~f~pL--~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g-~~~~~~~~--~~~~e~~~e~~ 226 (237) ..+.|+|+.+ ...+.+ ++++++..+++.|+++++|+|+.+- ..+... ..+..-+. ........... T Consensus 320 ~~~~~~d~~~~~~l~~D~~-------~~~~~~~~~~~~G~lT~NE~R~~~g-l~pi~~~~~d~~~~p~n~~~~~~~~~~~ 391 (403) T protein:vir:10 320 GYKITPNTKEVAALTPDKE-------AEAKHLTSLVNNGIITGNEARSELN-LEPLDDEQMNKIRIPANVAGSATGVSGQ 391 (403) T ss_pred CceeeeccchhhhcccCHH-------HHHHHHHHHHhCCCcCHHHHHHHhC-CCCCCcccccccccccccccccccCCCC Confidence 3455566644 333433 4578888999999999999999862 222111 00000000 00000000000 Q ss_pred CCCCCCcCcCC Q lcl|NC_019725. 227 PGLGEKLEDEN 237 (237) Q Consensus 227 ~~~~~~~~~e~ 237 (237) ++..+....|+ T Consensus 392 e~~~~~~~~~g 402 (403) T protein:vir:10 392 EGGRPKGSTEG 402 (403) T ss_pred cCCCCCCCcCC Confidence 11111111111 No 75 >protein:vir:1661 Length: 378 # NCBI annotation: unknown # Family: family:all:2379 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044950;genbank:gi:9629657;genbank:GeneID:1261302 Probab=98.59 E-value=2.6e-09 Score=67.69 Aligned_cols=200 Identities=14% Similarity=0.107 Sum_probs=101.1 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHH----HHHhcCchheeeeecCCccee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQ----VDDNSGVGRAIGIDAETEEYD 76 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~----~~~~r~~~~~~~iD~~~e~~~ 76 (237) .++.+...+..+-. + .+--.++++++. +... ....+++++.. .....+..+++++++ +.+|+ T Consensus 152 ~l~~~~~~i~~~~~---~------~~~~g~l~~~~~---l~~~-~~~~~~~~~~~~~~~~~~~~~~g~~~vl~~-g~~~~ 217 (378) T protein:vir:16 152 ILDNALASIQTKLE---Q------GKLRGLLKINAF---LDID-NTQEYREKALTTIKNMQEGSSYNGLTPVDN-KTEIV 217 (378) T ss_pred HHHHHHHHHHHHHh---c------CccceeeEeCCc---CCHH-HHHHHHHHHHHHHHHhhcccccccceEcCC-CceEE Confidence 33333333322111 0 011124454421 1111 11223333332 222223345677775 57888 Q ss_pred eeecCcCCHH-HHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhc---C Q lcl|NC_019725. 77 VLNSDISGVP-EFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVE---E 152 (237) Q Consensus 77 ~~~~~lsGl~-dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~---s 152 (237) .++.+..... .-.......||.+-|||..+|-|.. .+....+||. .-|.|.+.++-.-+-+ + T Consensus 218 ~l~~~~~~~~~~~~~~~~~~Ia~~fgVPp~~l~g~~-------~e~~~~~f~~-------~tl~P~~~~ie~~l~~kLl~ 283 (378) T protein:vir:16 218 ELKKDYSVLNKDEIDLIKSELLTGYFMNENILLGTA-------SQEQQIYFYN-------STIIPLLIQLEKELTYKLIS 283 (378) T ss_pred EccCChhhhhHHHHHHHHHHHHHHhCCCHHHhcCCc-------hHHHHHHHHH-------HHHHHHHHHHHHHHHhhcCC Confidence 8877654322 2335666899999999998884321 1334445543 4578877776554421 1 Q ss_pred -------------CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcccccc----CCCCCCC Q lcl|NC_019725. 153 -------------EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKL----KDGNNIN 215 (237) Q Consensus 153 -------------~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~----~~~~~~~ 215 (237) .++.|++..|...|.++++ +++..++++|+++++|+|+.+ ...+..|- .+..... T Consensus 284 ~~e~~~~~~~~~~~~~~f~~~~l~~~d~~~~~-------~~~~~~~~~G~~T~NE~R~~~-g~~p~~ggD~~~~~~n~~~ 355 (378) T protein:vir:16 284 TNRRRVVKGNLYYERIIVDNQLFKFATLKELI-------DLYHENINGPIFTQNQLLVKM-GEQPIEGGDVYIANLNAVA 355 (378) T ss_pred hhhhhhhhhcccccceeeccchhhhcCHHHHH-------HHHHHHHhCCCcCHHHHHHHh-CCCCCCCCCeEeecccccc Confidence 2467788889888888764 557889999999999999976 23332221 0111111 Q ss_pred hhccc--cCCCCCCCCCCCcCcC Q lcl|NC_019725. 216 IREPE--ETTEPEPGLGEKLEDE 236 (237) Q Consensus 216 ~~~~e--~~~e~~~~~~~~~~~e 236 (237) .++.. +....+..++++.+.| T Consensus 356 ~~~~~~~~~~~~~~~~~~e~~ne 378 (378) T protein:vir:16 356 VKNLSDLQGSRKDVTSTDETNNQ 378 (378) T ss_pred ccchhhhcCccCCCCCCCCCCCC Confidence 11111 1112222233333333 No 76 >protein:vir:9641 Length: 395 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795403;genbank:gi:28876176;genbank:GeneID:1257709 Probab=98.57 E-value=2.9e-08 Score=61.90 Aligned_cols=212 Identities=14% Similarity=0.058 Sum_probs=96.2 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHH---hhcCCchHHHHHHHHHHHH-HhcCchhe-eeeecCCcce Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAE---MCDDDDAQYAARLRLAQVD-DNSGVGRA-IGIDAETEEY 75 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~---~~~~~~~e~~~~~r~~~~~-~~r~~~~~-~~iD~~~e~~ 75 (237) +.+..-+.+............ .++....++..+... ...++.......+.++.+. ..+++.+. +.+++ +-+| T Consensus 163 ~~~~~~~~~~~~i~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~-g~~~ 239 (395) T protein:vir:96 163 LWEEYGELLGHVINNQKIANQ--IRFTMTPPKDKVRERAQENSDGGRQPKSDKDFFKRTIEKIRTESVVGIPVTA-NTNY 239 (395) T ss_pred ccchHHHHHHHHHHHHHHHHH--HHHHhhhcccccccceeeccCchhhHHHHHHHHHHHHHHhhcCCcceEEccC-Ccee Confidence 222221211111111000000 011222222221100 1111212223333333332 22333332 33443 4567 Q ss_pred eeeecCcCCH--------HHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHH Q lcl|NC_019725. 76 DVLNSDISGV--------PEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLP 147 (237) Q Consensus 76 ~~~~~~lsGl--------~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~ 147 (237) ..++.+.... .++.....+.||.+-|||..+|-| .. ++-+.....||. ..|.|.+.++-. T Consensus 240 ~~l~~~~~d~q~~e~~~~~~~~~~~~~eIa~~fgVPp~~l~~----~~-sn~e~~~~~f~~-------~~L~P~~~~ie~ 307 (395) T protein:vir:96 240 EEYGSKNTGSVKSYVDDIKKLKDQYMAEFAEMLGIPISLLHG----DI-ADNQKNYELLLE-------GPIESLITNIVD 307 (395) T ss_pred EecccChhhhhhhhHHHHHHHHHHHHHHHHHHhCCCHHHhcC----CC-ccHHHHHHHHHH-------HHHHHHHHHHHH Confidence 7666554332 222334457899999999998732 11 223334555665 347777766654 Q ss_pred Hhh--------cCCCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCChhcc Q lcl|NC_019725. 148 FIV--------EEEEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIREP 219 (237) Q Consensus 148 ~i~--------~s~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~~~~ 219 (237) .+- +..++.|.|++|...|.++++ ++++.++++|+++++|+|+.+- ..+..|-.++.-.-.... T Consensus 308 ~l~~~Ll~~~e~~~~~~f~~~~l~~~d~~~~~-------~~~~~~~~~G~~T~NE~R~~~g-l~pi~~~~gD~~~~~~N~ 379 (395) T protein:vir:96 308 GLEYAIFDKSETLEGSFIKVTGLKNYDLFSIS-------SQADKLISSGFVFIDEVREEIG-LPELPDGLGKVLYMTKNY 379 (395) T ss_pred HHHhhcCChhhhcCceeEeecchhccCHHHHH-------HHHHHHHhCCCcCHHHHHHHhC-CCCCCCCCCceeeecccc Confidence 432 124677889888888776655 5567789999999999999763 233222111000000000 Q ss_pred ccCCCCCCCCCCCcCcCC Q lcl|NC_019725. 220 EETTEPEPGLGEKLEDEN 237 (237) Q Consensus 220 e~~~e~~~~~~~~~~~e~ 237 (237) .+-++.|+.+..+.|| T Consensus 380 --~~~~~~gge~~~~~~~ 395 (395) T protein:vir:96 380 --ESVLERGGEVDEEVET 395 (395) T ss_pred --eechhccCCCCCCCCC Confidence 1112233344444455 No 77 >protein:vir:3868 Length: 417 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680485;swissprot:trembl:q8ltc2;genbank:gi:22296525;interpro:IPR006427;interpro:IPR006944;uniprot:Q8LTC2;genbank:GeneID:951699 Probab=98.57 E-value=2.4e-08 Score=62.32 Aligned_cols=212 Identities=16% Similarity=0.152 Sum_probs=106.8 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccc--eeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQA--VWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVL 78 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~--v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~ 78 (237) .++.+.+.|.....+......+....... ++++++ .++. +...+++++++......+..+.+++++ +.+|+.+ T Consensus 176 ~l~~~~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~~---~l~~-e~~~~~~~~~~~~~~g~n~g~~~vl~~-g~~~~~l 250 (417) T protein:vir:38 176 PLLSLGDEIGLQESGVSTLQKFFKSGLKGSIIKAKES---RLSA-EARQKIREDFERAQAGADAGSPIIVDA-TMDYQPL 250 (417) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeCC---CCCH-HHHHHHHHHHHHHhcccccCCceeccC-CceEEEc Confidence 56777777777777777777766543222 444432 2222 234567777765544443344566664 5788887 Q ss_pred ecCcCCH--HHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhh-----c Q lcl|NC_019725. 79 NSDISGV--PEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIV-----E 151 (237) Q Consensus 79 ~~~lsGl--~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~-----~ 151 (237) +.+...+ -+........||.+-|||..+| |.+.. +++-+.-...|| +..|.|.+.++-..+- . T Consensus 251 ~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~l-g~~~~--~s~~e~~~~~~~-------~~tl~P~~~~ie~~l~~~Ll~~ 320 (417) T protein:vir:38 251 EVDTNVLNLINSNNYSTAQIAKALRVPAYRL-AQNSP--NQSVKQLADDYI-------RNDLPFYFEPITSEFELKLLDD 320 (417) T ss_pred cCCHHHHHHHHHHHhhHHHHHHHhCCCHHHh-CCCCc--chhHHHHHHHHH-------HHHHHHHHHHHHHHHHhhhcCh Confidence 6554322 1234444677999999998887 43221 122233334444 3457777776644432 1 Q ss_pred C--CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccc-cCCC-----C--CCChhcccc Q lcl|NC_019725. 152 E--EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFK-LKDG-----N--NINIREPEE 221 (237) Q Consensus 152 s--~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g-~~~~-----~--~~~~~~~e~ 221 (237) . .++.|+|+. ..+... ....++.++++|+++++|+|+.+- ..+..| ..+. . .++..+..+ T Consensus 321 ~~~~~~~~~fd~-~~l~~~--------~~~~~~~~~~~G~~T~NE~R~~~g-l~pi~~g~~d~~~~~~n~~~~d~~~~~~ 390 (417) T protein:vir:38 321 AQRHQYCIGFDT-KSVNGL--------PIADVNTAVNGGLWTGNEGRAELG-KKPLKDPNMDRIQSTLNTVFLDQKEAYQ 390 (417) T ss_pred hhcccceEEech-hhhhHH--------HHHHHHHHHhCCCcCHHHHHHHhC-CCCCCCCCCCeeeecccccccccccccc Confidence 1 356777752 112222 223356678999999999999873 333222 1100 0 011111111 Q ss_pred CC-CCCCCCCCC---------cCcCC Q lcl|NC_019725. 222 TT-EPEPGLGEK---------LEDEN 237 (237) Q Consensus 222 ~~-e~~~~~~~~---------~~~e~ 237 (237) .+ ..+..+|+. -+++| T Consensus 391 ~~~~~~~kgg~~~~~~~~~~~~~~~~ 416 (417) T protein:vir:38 391 AEHAAELKGGDTNAKGNQNGSGTNAN 416 (417) T ss_pred cccccccCCCCCCCCCCCcCCCCcCC Confidence 11 111111111 11111 No 78 >protein:vir:94426 Length: 409 # NCBI annotation: ORF009 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240003;genbank:gi:66395665;genbank:GeneID:5133086 Probab=98.57 E-value=2.5e-08 Score=62.28 Aligned_cols=213 Identities=10% Similarity=0.079 Sum_probs=105.6 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeeeec Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVLNS 80 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~~~ 80 (237) .+..+.+.+.....+......-.....--+++.+. .+ +++....+++++. +.+.++.+.+++++ +-+|..++. T Consensus 181 ~l~~~~~~i~~~~~~~~~~~~~~~~~~~~i~~~~~---~l-~~e~~~~~~~~~~--~~~~~~g~~~vl~~-g~~~~~l~~ 253 (409) T protein:vir:94 181 PIDVLKNTTDFDNAVRTFNLTEMQKPDSFMLKYGS---NV-GKEKRQQVLEDFK--QYYEENGGILFQEP-GVEIEPLPK 253 (409) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCCeeEEecCC---CC-CHHHHHHHHHHHH--HHhhcCCCeeecCC-CceEEEcCC Confidence 23444343333332222211111111111222221 11 1122223444443 33455555666654 577877765 Q ss_pred CcC--CHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhh-------- Q lcl|NC_019725. 81 DIS--GVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIV-------- 150 (237) Q Consensus 81 ~ls--Gl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~-------- 150 (237) +.. .+-+........||.+-|||-.+|-+...+..+ +-+.-...||..+ |.|.++++-..+- T Consensus 254 ~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~s-n~e~~~~~f~~~~-------l~P~~~~ie~~ln~~Ll~~~~ 325 (409) T protein:vir:94 254 KYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFA-KNEELNRFYLQHT-------LLPIVKQYEEEFNRKLLTKTD 325 (409) T ss_pred ChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcc-cHHHHHHHHHHHH-------HHHHHHHHHHHHHHhhCCccc Confidence 432 233344556788999999999988654433332 3344455666543 7887777754442 Q ss_pred cCCCceeEeC--CCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcccccc----CCCCCCChhccccCCC Q lcl|NC_019725. 151 EEEEWSIEFE--PLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKL----KDGNNINIREPEETTE 224 (237) Q Consensus 151 ~s~~~~~~f~--pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~----~~~~~~~~~~~e~~~e 224 (237) +..+..|+|+ +|...|.+++ ++++.+++++|+++++|+|+.+ ...+..|- .+......+.+++. . T Consensus 326 ~~~~~~i~fd~~~ll~~d~~~~-------~~~~~~~~~~G~~T~NE~R~~~-g~~p~~ggD~~~~~~n~~~~~~~~~~-~ 396 (409) T protein:vir:94 326 REKNRYFKFNVKSYLRADSATQ-------AEVYFKAVRSGYYTINDIREWE-DLPPVEGGDKPLISGDLYPIDTPLEL-R 396 (409) T ss_pred ccCcceEEeechhhhccCHHHH-------HHHHHHHHhCCCcCHHHHHHHh-CCCCCCCcCeEeecccccccccchhh-c Confidence 1234556654 6666666554 5677889999999999999976 23332221 01111111222111 1 Q ss_pred CCCCCCCCcCcCC Q lcl|NC_019725. 225 PEPGLGEKLEDEN 237 (237) Q Consensus 225 ~~~~~~~~~~~e~ 237 (237) ...++|++...|. T Consensus 397 ~~~kGG~~n~~e~ 409 (409) T protein:vir:94 397 KSLKGGDKNVNES 409 (409) T ss_pred ccccCCCCCcCCC Confidence 2234444444444 No 79 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=98.55 E-value=2.9e-08 Score=61.89 Aligned_cols=216 Identities=11% Similarity=0.115 Sum_probs=115.7 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhc--cceeechhHHHhhcCCchHHHHHHHHHHHHHhcCc-hheeeeecCCcceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQ--QAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGV-GRAIGIDAETEEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~--~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~-~~~~~iD~~~e~~~~ 77 (237) -++.+.+.|.....+......+..... -.++++++ .+.. +....+++++.-....-.| .+.+++++ +-+|.. T Consensus 190 pi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~---~l~~-e~~~~~~~~~~~~~~g~~nag~~~vl~~-g~~~~~ 264 (437) T protein:vir:10 190 PIQYAREVLGNSTAANKTSASVFRNGLRPSGVLSTDQ---ILQK-EKRAEIRTDLAEQFGGAMQAGKTMVLEA-GMKYQA 264 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcCC---CCCH-HHHHHHHHHHHHHhcCccccCcceeccC-CceEEe Confidence 578888888888888888888776643 24566653 2222 2233455555432222233 34566665 577888 Q ss_pred eecCcC--CHHHHHHHHHHHHhhhhcCceeeeeccCccc-cccc-chhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhc-- Q lcl|NC_019725. 78 LNSDIS--GVPEFLSSKMDRIVSLSGIHEIIIKNKNVGG-VSAS-QNTALETFYKLVDRKREEDYRPLLEFLLPFIVE-- 151 (237) Q Consensus 78 ~~~~ls--Gl~dl~~~~~~~iaa~s~iP~t~L~G~sp~G-lnat-Ge~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~-- 151 (237) ++.+.. .+-+........||.+-|||-..| |...++ .+.+ -+.-.+.||. ..|.|.+.++-..+-+ T Consensus 265 l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~~t~~~sn~e~~~~~f~~-------~tl~P~~~~ie~~l~~kl 336 (437) T protein:vir:10 265 ITMNPGDVQLLETRAFNIEEICRWYRVPPFMV-GHSEKSTSWGTGIEQQTLGFLT-------FTLRPWLTRIEQAARRSL 336 (437) T ss_pred ccCChhhHHHHHHHHHHHHHHHHHhCCCHHHh-CCCCCcccccchHHHHHHHHHH-------HHHHHHHHHHHHHHHhhc Confidence 766543 334555567789999999997777 544332 3222 2334445553 4477877776555432 Q ss_pred --C---CC--ceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccC-----CCCCCChhcc Q lcl|NC_019725. 152 --E---EE--WSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLK-----DGNNINIREP 219 (237) Q Consensus 152 --s---~~--~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~-----~~~~~~~~~~ 219 (237) . .. |.|.+..|...|.+++++ ++..++++|+++++|+|+.+ ...+..|-. ...-...+.. T Consensus 337 l~~~e~~~~~~~fd~~~ll~~d~~~r~~-------~~~~~~~~G~~T~NE~R~~~-gl~pi~gg~~~~~~~~~~~~~~~~ 408 (437) T protein:vir:10 337 LRPGERDQFYAEFSVEGLLRADSAGRAA-------FYSTMTQNGLMTRDECRAKE-NLPPMGGNAAVLTVQSALLPIDKL 408 (437) T ss_pred cCccccCceEEEEechhhhccCHHHHHH-------HHHHHHhCCCcCHHHHHHHh-CCCCCCCCcceEeecCcccchhhc Confidence 1 22 445556777777766654 67888999999999999987 333322210 0000111111 Q ss_pred cc-CCC---CC---CCCCCCcCc---CC Q lcl|NC_019725. 220 EE-TTE---PE---PGLGEKLED---EN 237 (237) Q Consensus 220 e~-~~e---~~---~~~~~~~~~---e~ 237 (237) .+ .+. .+ .+.+.++++ |. T Consensus 409 ~~~~~~~~~~~~~~~~~~~~~~~~~~~e 436 (437) T protein:vir:10 409 GEHTTATAAQDALKAWLYQEEKTRATQE 436 (437) T ss_pred cCcCCCcchhccccccCCCCCCCCcccc Confidence 11 000 00 000111111 11 No 80 >protein:vir:4089 Length: 395 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510984;swissprot:trembl:q8w606;genbank:gi:17488506;uniprot:Q8W606;genbank:GeneID:1260314 Probab=98.54 E-value=2.9e-08 Score=61.90 Aligned_cols=210 Identities=13% Similarity=0.088 Sum_probs=94.5 Q ss_pred CchhHHHHHHHHHHHHH-HHHHHHHHhcc-ceeechhHHHhhcCCchHHHHHHHHHHH-HHhcCchh-eeeeecCCccee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCES-LATQILRRKQQ-AVWKVKGLAEMCDDDDAQYAARLRLAQV-DDNSGVGR-AIGIDAETEEYD 76 (237) Q Consensus 1 llq~~~d~v~~~~~~~~-~~~~Ll~~~~~-~v~k~~~l~~~~~~~~~e~~~~~r~~~~-~~~r~~~~-~~~iD~~~e~~~ 76 (237) .+..++. .+..... ......+.... .+++++. .... +++....+++++.-. ....++.+ ++++++ +-+|+ T Consensus 162 ~~~~l~~---~~~~~~~~~~~~~~~~~~~~~~l~~~~-~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~vl~~-g~~~~ 235 (395) T protein:vir:40 162 IIDGFYL---LYGDLLTAAVNKYKKLNSRKIIVKLKA-MFGQ-TPEAEEKLRLMLSERMKKFLAEGDSALPVED-GMEID 235 (395) T ss_pred cchhHHH---HHHHHHHHHHHHHHhcCCCCceEEEec-ccCC-CHHHHHHHHHHHHHHHHHhhccCCceeecCC-CceEE Confidence 1111111 1111111 11111222211 1222211 0111 222233444444322 22233444 455664 57899 Q ss_pred eeecCcCCHHHHH-----HHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhh- Q lcl|NC_019725. 77 VLNSDISGVPEFL-----SSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIV- 150 (237) Q Consensus 77 ~~~~~lsGl~dl~-----~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~- 150 (237) .++.+.....-+- ..+...||.+-|||..+|-| . .++-+.-...|| +..|.|.++++-.-+- T Consensus 236 ~l~~~~~d~q~~e~~~~~~~~~~~Ia~~fgVPp~~l~~----~-~sn~e~~~~~f~-------~~~L~P~~~~ie~~l~~ 303 (395) T protein:vir:40 236 ELAGDSKIAESRDIKKMIDDVFEMVANSFNIPLGLAKG----D-TVGLSEQVNSFL-------MFSINPIAEMFTDEGNR 303 (395) T ss_pred eccCChhhhhHHHHHHHHHHHHHHHHHHhCCCHHHhcC----C-CcCHHHHHHHHH-------HHHHHHHHHHHHHHHHH Confidence 8887766543221 22346899999999988732 1 111222333444 3457777776644332 Q ss_pred -------cCCCc--eeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCC---CCCChhc Q lcl|NC_019725. 151 -------EEEEW--SIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDG---NNINIRE 218 (237) Q Consensus 151 -------~s~~~--~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~---~~~~~~~ 218 (237) +..++ .|++.+|...|.++++ +++.+++++|+++++|+|+.+ ...+..|..++ ...+... T Consensus 304 kLl~~~~~~~g~~i~fd~~~ll~~d~~~~~-------~~~~~~~~~G~~t~NE~R~~~-g~~pi~~~~gD~~~~~~n~~~ 375 (395) T protein:vir:40 304 KFYGRDSVLERTYMKLDTTRIKVQDIQEIA-------SSMDVLFHIGVNTIDDNLRMI-GREPVMSPETQERFVTKNYAP 375 (395) T ss_pred hcCChhhhcCCceEEEechhhhccCHHHHH-------HHHHHHHhCCCCCHHHHHHHh-CCCCCCCCCCceeeecccccc Confidence 11234 4445678777777766 467779999999999999876 23332221111 0111111 Q ss_pred cccCCCCCCCCCCCcCcCC Q lcl|NC_019725. 219 PEETTEPEPGLGEKLEDEN 237 (237) Q Consensus 219 ~e~~~e~~~~~~~~~~~e~ 237 (237) .+.. +.....|+..+.++ T Consensus 376 ~~~~-~~~~kgge~~~~~~ 393 (395) T protein:vir:40 376 LGEN-EEDLKGGDINENKG 393 (395) T ss_pred cccc-ccccCCCCCCCCcC Confidence 1111 12222222222222 No 81 >protein:vir:93867 Length: 378 # NCBI annotation: putative portal protein # Family: family:all:2379 # MgeID: mge:1479 # MgeName: 712 # Cross-refs: genbank:acc:YP_764264;genbank:gi:115315577;genbank:GeneID:5141561 Probab=98.52 E-value=5.9e-09 Score=65.72 Aligned_cols=200 Identities=14% Similarity=0.116 Sum_probs=101.0 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHH----HHhcCchheeeeecCCccee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQV----DDNSGVGRAIGIDAETEEYD 76 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~----~~~r~~~~~~~iD~~~e~~~ 76 (237) .++.+...+..+-.. ..--.++|+++. +... ....+++++... ....+..+++++++ +.+|. T Consensus 152 ~l~~~~~~i~~~~~~---------~~~~g~l~~~~~---l~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~-g~~~~ 217 (378) T protein:vir:93 152 ILDNALASIQTKLEQ---------GKLRGLLKINAF---LDID-NTQEYREKALTTIKNMQEGSSYNGLTPVDN-KTEIV 217 (378) T ss_pred HHHHHHHHHHHHHhc---------CcccceeeeCCc---CCHH-HHHHHHHHHHHHHHHhhcccccccceEcCC-CceEE Confidence 344443333322111 111134555431 2111 122344444322 11223345677776 57898 Q ss_pred eeecCcCCHH-HHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhh----c Q lcl|NC_019725. 77 VLNSDISGVP-EFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIV----E 151 (237) Q Consensus 77 ~~~~~lsGl~-dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~----~ 151 (237) .++.+..... +-.......||.+-|||..+|-|. ..+....+|| ...|.|.+.++-.-+- . T Consensus 218 ~l~~~~~~~~~~~~~~~~~~Ia~~fgVPp~~l~g~-------~~e~~~~~f~-------~~tl~P~~~~ie~~l~~kLl~ 283 (378) T protein:vir:93 218 ELKKDYSVLNKDEIDLIKSELLTGYFMNENILLGT-------ATQEQQIYFY-------NSTIIPLLIQLEKELTYKLIS 283 (378) T ss_pred EccCChhhhhHHHHHHHHHHHHHHhCCCHHHhcCC-------cHHHHHHHHH-------HHHHHHHHHHHHHHHHhhcCC Confidence 8877655433 334567789999999998877431 1133334444 3457777766654432 1 Q ss_pred C------------CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcccccc----CCCCCCC Q lcl|NC_019725. 152 E------------EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKL----KDGNNIN 215 (237) Q Consensus 152 s------------~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~----~~~~~~~ 215 (237) . .++.|+++.|...|.+++ ++++.+++++|+++++|+|+.+- ..+..|. -+..... T Consensus 284 ~~er~~~~~~~~~~~~~fd~~~l~~~d~~~~-------~~~~~~~~~~G~~t~NE~R~~~g-l~p~~ggD~~~~~~n~~~ 355 (378) T protein:vir:93 284 TNRRRVVKGNLYYERIIVDNQLFKFATLKEL-------IDLYHENINGPIFTQNQLLVKMG-EQPIEGGDVYIANLNAVA 355 (378) T ss_pred hhHhhhhhhcccccceeeccchhhhcCHHHH-------HHHHHHHHhCCCcCHHHHHHHhC-CCCCCCCCeeeecccccc Confidence 1 136677788888887766 55688899999999999999762 3333331 0111111 Q ss_pred hhccc--cCCCCCCCCCCCcCcC Q lcl|NC_019725. 216 IREPE--ETTEPEPGLGEKLEDE 236 (237) Q Consensus 216 ~~~~e--~~~e~~~~~~~~~~~e 236 (237) .++.. +.......++++.+.| T Consensus 356 ~~~~~~~~~~~~~~~~~~e~~n~ 378 (378) T protein:vir:93 356 VKNLSDLQGSRKDVTSTDETNNQ 378 (378) T ss_pred ccchhhhcCccCCCCCCCCCCCC Confidence 11111 1111222222222223 No 82 >protein:vir:4854 Length: 386 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049394;genbank:gi:9632422;genbank:GeneID:1258515 Probab=98.52 E-value=6.2e-08 Score=60.12 Aligned_cols=204 Identities=13% Similarity=0.121 Sum_probs=113.8 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHhcCc-hheeeeecCCcceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGV-GRAIGIDAETEEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~-~~~~~iD~~~e~~~~ 77 (237) -++.+.+.+.....+.....++...... .++++++ .+.. +....++ +.+.....+ .+.+++++ +-+|.. T Consensus 176 ~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~ii~~~~---~~~~-e~~~~~~---~~~~~~~~n~g~~~vl~~-g~~~~~ 247 (386) T protein:vir:48 176 PLMALSRELNIQKASDKLTLNSLKNALNANGILKIKG---GGLL-DFKTKLS---RSRQAMKQMQGGPLVLDD-LEEFTP 247 (386) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCC---CCCH-HHHHHHH---HHHHHhhcCCCCceecCC-CceEEE Confidence 5677777788877787787777766332 3445543 1111 1122222 223333344 45566665 578888 Q ss_pred eecCcCC--HHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhc--CC Q lcl|NC_019725. 78 LNSDISG--VPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVE--EE 153 (237) Q Consensus 78 ~~~~lsG--l~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~--s~ 153 (237) ++.+... +-+......+.||.+-|||-..| |.+.. +++.+....+||.. .|.|.++.+-..+-+ -. T Consensus 248 l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~l-g~~~~--~~~~e~~~~~~~~~-------~l~P~~~~ie~~l~~~l~~ 317 (386) T protein:vir:48 248 LEIKSNVSQLLKQADWTTGQFAKVYGIPENVV-GGQGD--QQSSLEMSLDLYNK-------AVSRYLRPFLSELSQKLSC 317 (386) T ss_pred cCCChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CCCCC--cccHHHHHHHHHHH-------HHHHHHHHHHHHHHHhhcc Confidence 8766543 34566778899999999998876 43211 22334455556543 377777766554432 13 Q ss_pred CceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCChhccccCCCCCCCCCCCc Q lcl|NC_019725. 154 EWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIREPEETTEPEPGLGEKL 233 (237) Q Consensus 154 ~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~~~~e~~~e~~~~~~~~~ 233 (237) .+.+.+.++..++...+ +..+..++.+|+++++|+|+.+-. .++.++. + .+.+....++..+|+.. T Consensus 318 ~~~~~~~~~~~~d~~~~-------~~~~~~l~~~g~~t~nE~r~~lg~----~~~~~~~-~--~~~~~~~~~~~~gGd~~ 383 (386) T protein:vir:48 318 DVDADILPAVDPTGSNS-------VSRINSMVKSGTLAQNQGLYILQQ----AEILPKE-L--PEGENPNKTTLKGGEIN 383 (386) T ss_pred hhhcchhhhhccChHHH-------HHHHHHHHhCCCcCHHHHHHHhhc----CCCCCcc-c--hhhcCCCCCccCCCCCC Confidence 44555555555554433 445677899999999999998742 2333211 1 11121112222334443 Q ss_pred CcC Q lcl|NC_019725. 234 EDE 236 (237) Q Consensus 234 ~~e 236 (237) .+| T Consensus 384 ~~~ 386 (386) T protein:vir:48 384 GED 386 (386) T ss_pred CCC Confidence 333 No 83 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=98.51 E-value=1.1e-08 Score=64.27 Aligned_cols=212 Identities=13% Similarity=0.081 Sum_probs=112.2 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhH--HHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcce-ee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGL--AEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEY-DV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l--~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~-~~ 77 (237) -++.+.+.+.+++++......-..-+......+.|. +.-.-+..+. .+ .+...+....+ .++.++.+ .++ +. T Consensus 226 d~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~-~~-~~~~~~~~~~~--~~~~~~~~-~~~~q~ 300 (456) T protein:vir:10 226 EVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGN-AI-DYASIFEAAPG--ALWELPPG-VDIWES 300 (456) T ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCccccccccccc-cc-chhhhhhhhcc--ccccCCCC-cceEEe Confidence 667788888888877765443333333222222221 1111111111 11 12222222221 23445543 454 34 Q ss_pred eecCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHH---HHHHHHHHHHHhhhHHHHHHHHHhhcC-- Q lcl|NC_019725. 78 LNSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALET---FYKLVDRKREEDYRPLLEFLLPFIVEE-- 152 (237) Q Consensus 78 ~~~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~n---yyd~I~~~Qe~~l~p~l~~l~~~i~~s-- 152 (237) -.+++.+..+.+......+++.+++|...|-|.+ + |.||++=..- ....++.+| ..+.+.|++++.+++.- T Consensus 301 ~~~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~-~--N~Sg~Ai~~~~~~l~~k~~~~~-~~f~~~l~~~~rl~~~~~g 376 (456) T protein:vir:10 301 QANDFTPMLSAIKEHIRQLSSATKTPLPMLMPDS-A--NQSAEGAHNIEKGFLFKCEDRL-SIAKIGLEAILVKALQIEG 376 (456) T ss_pred cccChhHHHHHHHHHHHHHHhccCCChHHhcccc-c--ChHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhcC Confidence 4667889999999999999999999998887765 2 4466543333 333344444 56889999999887632 Q ss_pred ----CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCChhccccCCC---- Q lcl|NC_019725. 153 ----EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIREPEETTE---- 224 (237) Q Consensus 153 ----~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~~~~e~~~e---- 224 (237) .++++.|.|...+|..+.| +++.+++++|+++...+++.| |+.+ ..+...+.+...+ T Consensus 377 ~~~~~~~~v~w~~~~~~~~~~~a-------da~~kl~~~gi~~~~~~~~~l-------g~~~-~~i~~~e~er~~~e~~~ 441 (456) T protein:vir:10 377 ESVEDTVDVSFESPDRVTLGEKY-------SAASLAKAAGESWASIRRNIL-------NYNA-DQIKQDDLDRAREQITL 441 (456) T ss_pred CCcccceeEEecCCCCcCHHHHH-------HHHHHHHHcCCChHHHHHhhC-------CCCH-HHHHHHHHHHHHHHHHH Confidence 3688999999999987765 555666677877665444322 2211 1111111110000 Q ss_pred --CCCCCCCCcCcCC Q lcl|NC_019725. 225 --PEPGLGEKLEDEN 237 (237) Q Consensus 225 --~~~~~~~~~~~e~ 237 (237) ..+. ..+.+..+ T Consensus 442 ~~~~~~-~~~~~~~~ 455 (456) T protein:vir:10 442 FAGNPV-QRPQEDGS 455 (456) T ss_pred Hhhhhh-hcCCCCCC Confidence 0000 00000001 No 84 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=98.51 E-value=1.1e-08 Score=64.27 Aligned_cols=212 Identities=13% Similarity=0.081 Sum_probs=112.2 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhH--HHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcce-ee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGL--AEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEY-DV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l--~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~-~~ 77 (237) -++.+.+.+.+++++......-..-+......+.|. +.-.-+..+. .+ .+...+....+ .++.++.+ .++ +. T Consensus 226 d~e~vi~liDa~~~~~s~~~~~~~~~a~~~~~i~G~~~~~~~~d~~g~-~~-~~~~~~~~~~~--~~~~~~~~-~~~~q~ 300 (456) T protein:vir:10 226 EVEPHIDIINRINRAELQLLSTMAIQAFRQRALKSTEHGLPNVDENGN-AI-DYASIFEAAPG--ALWELPPG-VDIWES 300 (456) T ss_pred hhhhhHHHHHHHHHHHHHHHHHHHHhhhHhHhhhccCccccccccccc-cc-chhhhhhhhcc--ccccCCCC-cceEEe Confidence 667788888888877765443333333222222221 1111111111 11 12222222221 23445543 454 34 Q ss_pred eecCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHH---HHHHHHHHHHHhhhHHHHHHHHHhhcC-- Q lcl|NC_019725. 78 LNSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALET---FYKLVDRKREEDYRPLLEFLLPFIVEE-- 152 (237) Q Consensus 78 ~~~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~n---yyd~I~~~Qe~~l~p~l~~l~~~i~~s-- 152 (237) -.+++.+..+.+......+++.+++|...|-|.+ + |.||++=..- ....++.+| ..+.+.|++++.+++.- T Consensus 301 ~~~~~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~-~--N~Sg~Ai~~~~~~l~~k~~~~~-~~f~~~l~~~~rl~~~~~g 376 (456) T protein:vir:10 301 QANDFTPMLSAIKEHIRQLSSATKTPLPMLMPDS-A--NQSAEGAHNIEKGFLFKCEDRL-SIAKIGLEAILVKALQIEG 376 (456) T ss_pred cccChhHHHHHHHHHHHHHHhccCCChHHhcccc-c--ChHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhcC Confidence 4667889999999999999999999998887765 2 4466543333 333344444 56889999999887632 Q ss_pred ----CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCChhccccCCC---- Q lcl|NC_019725. 153 ----EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIREPEETTE---- 224 (237) Q Consensus 153 ----~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~~~~e~~~e---- 224 (237) .++++.|.|...+|..+.| +++.+++++|+++...+++.| |+.+ ..+...+.+...+ T Consensus 377 ~~~~~~~~v~w~~~~~~~~~~~a-------da~~kl~~~gi~~~~~~~~~l-------g~~~-~~i~~~e~er~~~e~~~ 441 (456) T protein:vir:10 377 ESVEDTVDVSFESPDRVTLGEKY-------SAASLAKAAGESWASIRRNIL-------NYNA-DQIKQDDLDRAREQITL 441 (456) T ss_pred CCcccceeEEecCCCCcCHHHHH-------HHHHHHHHcCCChHHHHHhhC-------CCCH-HHHHHHHHHHHHHHHHH Confidence 3688999999999987765 555666677877665444322 2211 1111111110000 Q ss_pred --CCCCCCCCcCcCC Q lcl|NC_019725. 225 --PEPGLGEKLEDEN 237 (237) Q Consensus 225 --~~~~~~~~~~~e~ 237 (237) ..+. ..+.+..+ T Consensus 442 ~~~~~~-~~~~~~~~ 455 (456) T protein:vir:10 442 FAGNPV-QRPQEDGS 455 (456) T ss_pred Hhhhhh-hcCCCCCC Confidence 0000 00000001 No 85 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=98.51 E-value=7.2e-08 Score=59.74 Aligned_cols=205 Identities=15% Similarity=0.106 Sum_probs=112.5 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVL 78 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~ 78 (237) -++.+.+.|.....+............. .++++++- .... +.+..+..+.+....+..+.++++. +-+|+.+ T Consensus 182 ~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~--~~~~---~~~~~~~~~~~~~~~~~g~~~vl~~-g~~~~~l 255 (392) T protein:vir:39 182 PLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGG--GLLS---DKDKASRSRSFMKRSRSGGPVVLDD-LEEFTAL 255 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCC--CCch---HHHHHHHHHHHhccccCCCeeecCC-CceEEEc Confidence 5677777787777777777777666443 35666531 1111 1112222222223333345566665 5788888 Q ss_pred ecCcC--CHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhc--CCC Q lcl|NC_019725. 79 NSDIS--GVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVE--EEE 154 (237) Q Consensus 79 ~~~ls--Gl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~--s~~ 154 (237) +.+.. .+-+........||.+-|||-..|-+.+.+. +.....+.||. ..|.|.++++-.-+-+ -.+ T Consensus 256 ~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~---~~~~~~~~f~~-------~~l~P~~~~ie~~l~~~L~~~ 325 (392) T protein:vir:39 256 EIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQ---SSIQQISGMYA-------SALNRYLRPAISELEYKLSDH 325 (392) T ss_pred cCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcc---cHHHHHHHHHH-------HHHHHHHHHHHHHHHHhcccc Confidence 76543 3445566777899999999977773322111 11223344443 4477777766554432 245 Q ss_pred ceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCChhccccCCCCCCCCCCCcC Q lcl|NC_019725. 155 WSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIREPEETTEPEPGLGEKLE 234 (237) Q Consensus 155 ~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~~~~e~~~e~~~~~~~~~~ 234 (237) +.+...++...+.+++ ++.+..++.+|+++++|+|+-+.. .|+.+.. + .+.+..+..++|.+.+.- T Consensus 326 ~~~d~~~~~~~d~~~~-------~~~~~~l~~~g~~t~nE~r~~l~~----~g~~p~e-~--r~~e~l~~~~~Gd~~~p~ 391 (392) T protein:vir:39 326 ISVNMRPAIDPLGDNY-------LSTISTATRWGALAENQATFVLQE----AGYIPKD-L--PAPENTNKKTTGQSNEPV 391 (392) T ss_pred ccccchhhhccCHHHH-------HHHHHHHHhCCCcCHHHHHHHHHh----cCCCccc-c--chhcCCCCCCCCCCCCCC Confidence 6666777777666554 456778899999999999998753 3444321 1 111111111111111111 Q ss_pred c Q lcl|NC_019725. 235 D 235 (237) Q Consensus 235 ~ 235 (237) + T Consensus 392 p 392 (392) T protein:vir:39 392 P 392 (392) T ss_pred C Confidence 1 No 86 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=98.51 E-value=7.2e-08 Score=59.74 Aligned_cols=205 Identities=15% Similarity=0.106 Sum_probs=112.5 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVL 78 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~ 78 (237) -++.+.+.|.....+............. .++++++- .... +.+..+..+.+....+..+.++++. +-+|+.+ T Consensus 182 ~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~--~~~~---~~~~~~~~~~~~~~~~~g~~~vl~~-g~~~~~l 255 (392) T protein:vir:10 182 PLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGG--GLLS---DKDKASRSRSFMKRSRSGGPVVLDD-LEEFTAL 255 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCC--CCch---HHHHHHHHHHHhccccCCCeeecCC-CceEEEc Confidence 5677777787777777777777666443 35666531 1111 1112222222223333345566665 5788888 Q ss_pred ecCcC--CHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhc--CCC Q lcl|NC_019725. 79 NSDIS--GVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVE--EEE 154 (237) Q Consensus 79 ~~~ls--Gl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~--s~~ 154 (237) +.+.. .+-+........||.+-|||-..|-+.+.+. +.....+.||. ..|.|.++++-.-+-+ -.+ T Consensus 256 ~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~---~~~~~~~~f~~-------~~l~P~~~~ie~~l~~~L~~~ 325 (392) T protein:vir:10 256 EIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQ---SSIQQISGMYA-------SALNRYLRPAISELEYKLSDH 325 (392) T ss_pred cCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcc---cHHHHHHHHHH-------HHHHHHHHHHHHHHHHhcccc Confidence 76543 3445566777899999999977773322111 11223344443 4477777766554432 245 Q ss_pred ceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCChhccccCCCCCCCCCCCcC Q lcl|NC_019725. 155 WSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIREPEETTEPEPGLGEKLE 234 (237) Q Consensus 155 ~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~~~~e~~~e~~~~~~~~~~ 234 (237) +.+...++...+.+++ ++.+..++.+|+++++|+|+-+.. .|+.+.. + .+.+..+..++|.+.+.- T Consensus 326 ~~~d~~~~~~~d~~~~-------~~~~~~l~~~g~~t~nE~r~~l~~----~g~~p~e-~--r~~e~l~~~~~Gd~~~p~ 391 (392) T protein:vir:10 326 ISVNMRPAIDPLGDNY-------LSTISTATRWGALAENQATFVLQE----AGYIPKD-L--PAPENTNKKTTGQSNEPV 391 (392) T ss_pred ccccchhhhccCHHHH-------HHHHHHHHhCCCcCHHHHHHHHHh----cCCCccc-c--chhcCCCCCCCCCCCCCC Confidence 6666777777666554 456778899999999999998753 3444321 1 111111111111111111 Q ss_pred c Q lcl|NC_019725. 235 D 235 (237) Q Consensus 235 ~ 235 (237) + T Consensus 392 p 392 (392) T protein:vir:10 392 P 392 (392) T ss_pred C Confidence 1 No 87 >protein:vir:9702 Length: 406 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795464;genbank:gi:28876227;genbank:GeneID:1257772 Probab=98.50 E-value=3.1e-08 Score=61.72 Aligned_cols=213 Identities=18% Similarity=0.177 Sum_probs=108.4 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHh-ccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRK-QQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVLN 79 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~-~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~~ 79 (237) -++.+.+.|.....+........... .-.++.+.+ ..+ +++....+++++.-.....+..+.+++++ +.+|+.++ T Consensus 173 pi~~~~~~i~~~~a~~~~~~~~f~ng~~~~~i~~~~--~~l-~~e~~~~~~~~~~~~~~g~n~g~~~vl~~-g~~~~~l~ 248 (406) T protein:vir:97 173 PLLSLGDEIDLQTGGINTLIKFFKDGFSSGILTMKG--AQL-SGDARQRARQEFEKMREGSVGGSPLVFDS-TMEYTPLE 248 (406) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEecC--CCC-CHHHHHHHHHHHHHHhcccccCceeecCC-CceEEEcc Confidence 45666677776666666666654332 122222211 112 22233456666665444333344566665 57888876 Q ss_pred cCcCCHH--HHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhh----cC- Q lcl|NC_019725. 80 SDISGVP--EFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIV----EE- 152 (237) Q Consensus 80 ~~lsGl~--dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~----~s- 152 (237) .+...+. +........||.+-|||-..|.+++ .+ + +-+.-.++||. ..|.|.+.+|-..+- .. T Consensus 249 ~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~-~~-~-~~e~~~~~f~~-------~~l~P~~~~ie~~l~~kll~~~ 318 (406) T protein:vir:97 249 IDTNVLQLITSNNFSTAQIAKALRVPSYKLGVNS-PN-Q-SVAQLMEDYVT-------NDLPFYFDAITSELGLKTLNDK 318 (406) T ss_pred CCHHHHHHHHHHHhhHHHHHHHhCCCHHHcCCCC-Cc-c-hHHHHHHHHHH-------HHHHHHHHHHHHHHhhhhcChh Confidence 5432211 2334457889999999999995432 11 1 12223344443 447777776654432 11 Q ss_pred --CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcccccc------CCCCCCChhcccc--- Q lcl|NC_019725. 153 --EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKL------KDGNNINIREPEE--- 221 (237) Q Consensus 153 --~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~------~~~~~~~~~~~e~--- 221 (237) ..+.|+|+- .. ..+.+++++..++++|+++++|+|+.+- ..+..+- .+..-+..+..++ T Consensus 319 ~~~~~~i~fd~-~~--------~~~~~~~~~~~~~~~g~~T~NE~R~~~g-~~p~~~~~gD~~~~~~n~~~~~~~~~~~~ 388 (406) T protein:vir:97 319 DRRLYHIEFDT-RS--------VTGRNVDEIVKLVNNQILTPNQGLVELG-KQKSTDPNMDRYQSSLNYVFLDKKEEYQD 388 (406) T ss_pred hccceeEEEec-Cc--------cchhhHHHHHHHHhCCCcCHHHHHHHhC-CCCCCCCCCCeEeeccCccchhccccccc Confidence 345677752 11 2334567778899999999999999873 2221111 1111111111111 Q ss_pred --CCCCCCCCCCCcCcCC Q lcl|NC_019725. 222 --TTEPEPGLGEKLEDEN 237 (237) Q Consensus 222 --~~e~~~~~~~~~~~e~ 237 (237) .....+|.+.+.+++| T Consensus 389 ~~~~~~~gg~~~~~~~~~ 406 (406) T protein:vir:97 389 KVGIKGKGGEVNAEEDKS 406 (406) T ss_pred ccccccCCCCCCCCCCCC Confidence 1112233333444444 No 88 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=98.49 E-value=9.4e-09 Score=64.59 Aligned_cols=217 Identities=10% Similarity=0.139 Sum_probs=112.1 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeee-e Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVL-N 79 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~-~ 79 (237) -++.+.+.+.+++++......-+..+....+.+.|.. +-....... ... .....++++..+++-++-++ . T Consensus 224 d~e~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~-~~~~~~~~~-~~~-------~~~~~~i~~~~~~~~~~~q~~~ 294 (479) T protein:vir:99 224 DVEPLVTVAKAIDKTGLDILLVQHHQSFQIRWATGLM-LPEGANADQ-EKM-------RFAQESMLISQNEKASFGAIPA 294 (479) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHhhchhhhhcCCC-cccccccch-hcc-------ccccccceeecCCCceEEEecc Confidence 4567888888888887766665555555444444421 111111110 000 11223445554444445444 4 Q ss_pred cCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHH--HHHhhhHHHHHHHHHhhcC----- Q lcl|NC_019725. 80 SDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRK--REEDYRPLLEFLLPFIVEE----- 152 (237) Q Consensus 80 ~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~--Qe~~l~p~l~~l~~~i~~s----- 152 (237) .++.+..+.+.....++|+.++||.. .||.+ + |+||+.=...|...+... .+..+++.|++++.+++.- T Consensus 295 ~~~~~~~~~l~~~i~~i~~~t~~p~~-~~g~~-~--n~Sg~Al~~~~~~l~~ka~~~~~~f~~al~~~~~l~~~~~~~~~ 370 (479) T protein:vir:99 295 APLDGLLNAYKESLLEFLALAQLPPH-IAGQI-V--NVAADALAAGTRQTMQKLFEKQATWKASHNQTMRLVNKIEGRTE 370 (479) T ss_pred cchHHHHHHHHHHHHHHhccCCCCHH-Hcccc-c--chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCc Confidence 56777888899999999999999975 67854 2 567876666665555333 3356889999999887531 Q ss_pred ----CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHH--------------hhccccc-----cC Q lcl|NC_019725. 153 ----EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLR--------------SIAPEFK-----LK 209 (237) Q Consensus 153 ----~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~--------------~~~~~~g-----~~ 209 (237) -++.+.|.+...+|..+.|+.. .+++++|+++.+.+.+.|- ......+ +. T Consensus 371 ~~~~~~i~~~w~~~~~~s~~~~ad~~-------~kl~~ag~is~et~l~~l~gv~~~~~e~~~~~~~~~~~~~~~~~~~~ 443 (479) T protein:vir:99 371 EATDLDFTITWQDVTIQSLAQFADAW-------AKMVESLKIPAEGVWDMIPNLDQSTVNGWKEIYDREGDFGKYMRKLQ 443 (479) T ss_pred cccceeeeEEecCCCCCCHHHHHHHH-------HHHHhcCCCCHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 1577889998889988766544 4455555555554443321 0000000 00 Q ss_pred CCCCCCh--hccccCCCCCCCCCCCcCcCC Q lcl|NC_019725. 210 DGNNINI--REPEETTEPEPGLGEKLEDEN 237 (237) Q Consensus 210 ~~~~~~~--~~~e~~~e~~~~~~~~~~~e~ 237 (237) .+.+... ...+.+.+.++..+.+-++.. T Consensus 444 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 473 (479) T protein:vir:99 444 NGPDPAEQRGGPNGATNMQQANNKTGEPAS 473 (479) T ss_pred cccCcccccCCCCCCCCCCCCCCCCcchhc Confidence 0000000 000000000000000001111 No 89 >protein:vir:4156 Length: 542 # NCBI annotation: portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046965;genbank:gi:9630535;genbank:GeneID:1261709 Probab=98.49 E-value=1.3e-07 Score=58.30 Aligned_cols=215 Identities=13% Similarity=0.146 Sum_probs=99.1 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHH-H-hhc----CCchHHHHHHHHHH-HHHhcCchheeee-e- Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLA-E-MCD----DDDAQYAARLRLAQ-VDDNSGVGRAIGI-D- 69 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~-~-~~~----~~~~e~~~~~r~~~-~~~~r~~~~~~~i-D- 69 (237) -++.+...|.....+......+...... .++++++.- + ... ..+....+++.++- +...++|.+..++ . T Consensus 195 pi~~~~~~i~~~~~~~~~~~~~f~Ng~~p~gIL~~~~~l~de~~~~~~~~~e~~~~lk~~~~~~~~g~~~n~gk~~vL~~ 274 (542) T protein:vir:41 195 RYVSAAPAILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDELEEDPDGNPTGRTVIQALIEDNFKHLKEAPHTPLVFSI 274 (542) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCccccccccccccCHHHHHHHHHHHHHHHhhhhcccCceeEeec Confidence 3444445555444444444444333222 255655321 0 111 11112233333322 2233445554444 2 Q ss_pred --cCCcceeeeecCcCCHHH----HHHHHHHHHhhhhcCceeeeeccCcc-ccccc-chhHHHHHHHHHHHHHHHhhhHH Q lcl|NC_019725. 70 --AETEEYDVLNSDISGVPE----FLSSKMDRIVSLSGIHEIIIKNKNVG-GVSAS-QNTALETFYKLVDRKREEDYRPL 141 (237) Q Consensus 70 --~~~e~~~~~~~~lsGl~d----l~~~~~~~iaa~s~iP~t~L~G~sp~-Glnat-Ge~D~~nyyd~I~~~Qe~~l~p~ 141 (237) +..+.++....+.+..+. ......+.||++-|||..+| |...+ .+|.+ -|.....||. ..|.|. T Consensus 275 ~~~~~~g~~~~pl~~~~~d~qfle~~~~~~~~Ia~afgVPp~~l-G~~~~~t~n~sn~Eq~~~~f~~-------~tL~P~ 346 (542) T protein:vir:41 275 PGGDTVKVTFTPLNTSQKELSFREYAAEKKYDIAAAHMIDPYRL-GIADTGPLGGNFAEVTRRTYYE-------SVVRPQ 346 (542) T ss_pred cCCcccceeEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CcCCCcccccccHHHHHHHHHH-------HHHHHH Confidence 112445555555544333 33556788999999998866 65544 35533 3445555554 445666 Q ss_pred HHHHHHHhh----c--CCCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCC- Q lcl|NC_019725. 142 LEFLLPFIV----E--EEEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNI- 214 (237) Q Consensus 142 l~~l~~~i~----~--s~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~- 214 (237) +.++-..|- . ..++.|+|++..-+... +++.+..++++|+++++|+|+.|... .|+.++ T Consensus 347 ~~~ie~~ln~~L~~~~~~~~~~~f~~~~ll~~d--------~~~~~~~~v~~GilT~NE~Re~L~g~------~pgdd~~ 412 (542) T protein:vir:41 347 QNIISSILTDFFQVKFNPKTRFKFNDETLLESD--------SVRNCALLVQSGVLTPAEARERLFGL------DGGPDIF 412 (542) T ss_pred HHHHHHHHHhhcccccCCceEEEecchhhcchH--------HHHHHHHHHhCCCCCHHHHHHhhCCC------CCCCccc Confidence 665544432 1 13678888765443321 22345668999999999999766321 111111 Q ss_pred ------Ch-------hccccC---------CCCCCCCCCCcCcCC Q lcl|NC_019725. 215 ------NI-------REPEET---------TEPEPGLGEKLEDEN 237 (237) Q Consensus 215 ------~~-------~~~e~~---------~e~~~~~~~~~~~e~ 237 (237) .. ...+.. ....|...+..++.+ T Consensus 413 l~p~~~~~~~~~~~~~n~~~~~~~~~~k~~~k~~~~~~~~~~~~~ 457 (542) T protein:vir:41 413 MVPSKGAAKSVKRQERNYEKNQIREIRKIYAKYRPRFNEIISSKL 457 (542) T ss_pred cccccccccccccCCcCCCCCchhhhhhcccccCccccccccccc Confidence 00 000000 001111222222221 No 90 >protein:vir:7407 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839924;genbank:gi:30089894;genbank:GeneID:1260681 Probab=98.47 E-value=1.1e-07 Score=58.76 Aligned_cols=205 Identities=14% Similarity=0.081 Sum_probs=112.8 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVL 78 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~ 78 (237) .++.+.+.|.....+......+...... .++++++- .... +....++.+.+....+..+.+++++ +-+|+.+ T Consensus 182 ~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~~il~~~~~--~~~~---~~~~~~~~~~~~~~~n~g~~~vl~~-g~~~~~l 255 (392) T protein:vir:74 182 PLYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGG--GLLS---DKDKASRSRSFMKRSRSGGPVVLDD-LEEFTAL 255 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCC--CCch---HHHHHHHHHHHhccccCCCeeecCC-CceEEEc Confidence 5678888888888787777776666443 35666531 1111 1112222222322333344566664 5788888 Q ss_pred ecCcC--CHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhc--CCC Q lcl|NC_019725. 79 NSDIS--GVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVE--EEE 154 (237) Q Consensus 79 ~~~ls--Gl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~--s~~ 154 (237) +.+.. .+-+........||.+-|||-..|-+.+.+ + +.....+.| -+..|.|.+.++-.-+-+ -.+ T Consensus 256 ~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~--~-~~~e~~~~~-------~~~~l~p~~~~ie~~l~~~l~~~ 325 (392) T protein:vir:74 256 EIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQ--Q-SSIQQISGM-------YASALNRYLRPAISELEYKLSDH 325 (392) T ss_pred cCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCc--c-cHHHHHHHH-------HHHHHHHHHHHHHHHHHHhccch Confidence 76532 344566777889999999998776332211 1 111223333 345577777776555432 245 Q ss_pred ceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCChhccccCCCCCCCCCCCcC Q lcl|NC_019725. 155 WSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIREPEETTEPEPGLGEKLE 234 (237) Q Consensus 155 ~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~~~~e~~~e~~~~~~~~~~ 234 (237) +.+.+.++...+.+++ ++.+..++.+|+++++|+|+-+.. .|+.+.... ..+..+..++|-+.+.- T Consensus 326 ~~~~~~~~~~~d~~~~-------~~~~~~l~~~g~~t~near~~~~~----~g~~pne~r---~~enl~~~~~Gd~~~p~ 391 (392) T protein:vir:74 326 ISVNMRPAIDPLGDNY-------LSTISTATRWGALAENQATFVLQE----AGYIPKDLP---APENTNKKTTGQSNEPV 391 (392) T ss_pred hcccchhhhcCCHHHH-------HHHHHHHHhCCCcCHHHHHHHHHh----CCCCccccc---hhcCCCCCCCCCCCCCC Confidence 6677777777776554 556788899999999999998753 344432111 11111111111000000 Q ss_pred c Q lcl|NC_019725. 235 D 235 (237) Q Consensus 235 ~ 235 (237) + T Consensus 392 p 392 (392) T protein:vir:74 392 P 392 (392) T ss_pred C Confidence 0 No 91 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=98.43 E-value=9.6e-08 Score=59.05 Aligned_cols=218 Identities=11% Similarity=0.088 Sum_probs=101.6 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHH-----------hcCchheee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDD-----------NSGVGRAIG 67 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~-----------~r~~~~~~~ 67 (237) -++.+.+.|.....+......+...... .++++.+ ..+ +++....+++++.-... .-.|.+..+ T Consensus 186 ~~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~--~~l-~~e~~~~~~~~~~~~~~~~~~~~~~~~~g~~n~~~~~ 262 (467) T protein:vir:31 186 DIIPAVKTIRGDSAAQDYNIDFFENDGVPRIAIIVKG--AEL-TEKGREEMRNLIEDNNEDNHRTAFIETEKIVQNEDYL 262 (467) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecC--cCC-CHHHHHHHHHHHHhhhcchhhhhhhhhcccccccccc Confidence 3455566666666666555555443322 2444432 112 22222334443321100 011222222 Q ss_pred eecCCcceeeeecCcCC----------HHHHHHHHHHHHhhhhcCceeeeeccCccc-ccccchhHHHHHHHHHHHHHHH Q lcl|NC_019725. 68 IDAETEEYDVLNSDISG----------VPEFLSSKMDRIVSLSGIHEIIIKNKNVGG-VSASQNTALETFYKLVDRKREE 136 (237) Q Consensus 68 iD~~~e~~~~~~~~lsG----------l~dl~~~~~~~iaa~s~iP~t~L~G~sp~G-lnatGe~D~~nyyd~I~~~Qe~ 136 (237) +-..+.++......|.. +.+........||++-|||-..| |..-+| .+++-+.-...||.. T Consensus 263 ~l~~g~~~~~~~~~~~~ls~~~~~d~qf~e~~~~~~~~Ia~~fgVpp~~l-G~~~~~~~~s~~e~~~~~f~~~------- 334 (467) T protein:vir:31 263 NLADGADRSDVEIRLEPLTVGIDEEASFLEFRGRNEHDILKVHDVPPVIA-GVVESGAFSTDAEEQRKEFAEE------- 334 (467) T ss_pred cccCCCcccccceeEEeccccChhhHHHHHHHHHHHHHHHHHhCCCHHHc-ccCCCCCcccCHHHHHHHHHHH------- Confidence 22223445454433322 23444556778999999998655 755433 433334455566543 Q ss_pred hhhHHHHHHHHHhh---c-----CCC--ceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcc-- Q lcl|NC_019725. 137 DYRPLLEFLLPFIV---E-----EEE--WSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAP-- 204 (237) Q Consensus 137 ~l~p~l~~l~~~i~---~-----s~~--~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~-- 204 (237) .|+|.+.++-..+- . ..+ +.|.+..|...+.++++++ ...++++|+++++|+|+.+- ..+ T Consensus 335 ~l~P~~~~ie~~ln~~l~~~~~~~~~~~i~f~~~~l~~~d~~~~~~~-------~~~~~~~G~~T~NE~R~~~G-l~pi~ 406 (467) T protein:vir:31 335 TIQPKQHDFGELLYELVHKQGLDAPDWTIEFELAKPDTKLQDVEIAS-------QRVQAMQGLLTVNELRDEFG-FEPFP 406 (467) T ss_pred HHHHHHHHHHHHHHHhhcchhhccCCceEEEecchhhccCHHHHHHH-------HHHHHhCCCcCHHHHHHHhC-CCCCC Confidence 37777766655442 1 134 4556678888888777654 56688999999999999862 111 Q ss_pred ccccCCCCCCCh-----hccccCCCCCCCCCCCcCcCC Q lcl|NC_019725. 205 EFKLKDGNNINI-----REPEETTEPEPGLGEKLEDEN 237 (237) Q Consensus 205 ~~g~~~~~~~~~-----~~~e~~~e~~~~~~~~~~~e~ 237 (237) +..+.+...+.. ..+....++.+..+.+.+.++ T Consensus 407 d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 444 (467) T protein:vir:31 407 EEHVYGGETLVAEVTGGSGPGGGIGDQIEQLVEDRADE 444 (467) T ss_pred cccccCCcccccccccccCCCCcccCcCCCCCCCcccc Confidence 111111000000 000000011111111111111 No 92 >protein:vir:100249 Length: 431 # NCBI annotation: gp78 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355414;genbank:gi:77864704;genbank:GeneID:3725971 Probab=98.42 E-value=9.5e-08 Score=59.08 Aligned_cols=209 Identities=11% Similarity=0.071 Sum_probs=108.3 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHhcCc-hheeeeecCCcceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGV-GRAIGIDAETEEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~-~~~~~iD~~~e~~~~ 77 (237) .++.+.+.|.....+......+...... .++++++ .++.. ....+++++...-..-+| .+.+++++ +-+|+. T Consensus 203 pi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~---~ls~e-~~~~~~~~~~~~~~g~~n~g~~~vl~~-g~~~~~ 277 (431) T protein:vir:10 203 RVKLSGNALELAEQAERAASRTFRTGVMAGGAIEVPK---ELSDN-AYGRMKASVQENHTGSENAGSWMLLEE-GATAKQ 277 (431) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEecCC---CCCHH-HHHHHHHHHHHHhcCccccCCceecCC-CceEEE Confidence 6778888888888888888887776443 3566654 22222 233455555432222234 34566664 567777 Q ss_pred eecCcCC--HHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhh---cC Q lcl|NC_019725. 78 LNSDISG--VPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIV---EE 152 (237) Q Consensus 78 ~~~~lsG--l~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~---~s 152 (237) ++.+... +-+........||.+-|||..+|-+..-+.. ++-+.-...|+. .-|.|.+.++-.-+- .+ T Consensus 278 l~~~~~d~q~le~r~~~~~~Ia~~fgVPp~~lg~~~~~t~-sn~eq~~~~f~~-------~tL~P~~~~ie~~ln~~Ll~ 349 (431) T protein:vir:10 278 FSNTAASAQQIENRNHQIEEVARMYGVPRPLLMMDDTSWG-SGIEQLAIFFIQ-------YGLSHWFVSWEQAAARAFLP 349 (431) T ss_pred ccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCCcc-ccHHHHHHHHHH-------HHHHHHHHHHHHHHHhhccC Confidence 6554321 1234445578899999999998855332222 222333334443 347787776644332 11 Q ss_pred ----CCceeEe--CCCCCCCHHHHHHHHHHHHHHHHHHHhCC----CCCHHHHHHHHHhhccccccCCCCCCCh--hccc Q lcl|NC_019725. 153 ----EEWSIEF--EPLSVPSKKEESEITKNNVESVTKAITEQ----IIDLEEARDTLRSIAPEFKLKDGNNINI--REPE 220 (237) Q Consensus 153 ----~~~~~~f--~pL~~~seke~Aei~~~~A~a~~~~~~~g----~i~~~e~r~~l~~~~~~~g~~~~~~~~~--~~~e 220 (237) .++.|+| .-|...|.++++ +++++++.+| +++++|+|+.+- ..+..|-.+ +..-. -... T Consensus 350 ~~~~~~~~~~fd~~~llr~d~~~r~-------~~~~~~~~~G~~~g~lT~NE~R~~~g-l~p~~~~~g-D~~~~p~n~~~ 420 (431) T protein:vir:10 350 EKMLGQRQFKFNEGALLRGTLNDQA-------AFFSKALGAGGQSPWMKQNEVREMLD-LPRADDPVA-DQLRNPMTQKQ 420 (431) T ss_pred hhhcCCceEEEechhhhccCHHHHH-------HHHHHHHhcccccCccCHHHHHHHhC-CCCCCCccc-cceeccccccc Confidence 2444554 456566655554 4556666555 599999999762 222222111 00000 0001 Q ss_pred cCCCCCCCCCC Q lcl|NC_019725. 221 ETTEPEPGLGE 231 (237) Q Consensus 221 ~~~e~~~~~~~ 231 (237) .....+|+... T Consensus 421 ~~~~~~~p~~~ 431 (431) T protein:vir:10 421 KGSGDEPPATT 431 (431) T ss_pred CCCCCCCCCCC Confidence 11111222222 No 93 >protein:vir:95965 Length: 385 # NCBI annotation: ORF011 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239800;genbank:gi:66395461;genbank:GeneID:5132882 Probab=98.41 E-value=7.2e-08 Score=59.74 Aligned_cols=203 Identities=10% Similarity=0.106 Sum_probs=97.5 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc-ceeechhHHHhhcCCchHHHHHHHHHHH-HHhcCchh-eeeeecCCcceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ-AVWKVKGLAEMCDDDDAQYAARLRLAQV-DDNSGVGR-AIGIDAETEEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~-~v~k~~~l~~~~~~~~~e~~~~~r~~~~-~~~r~~~~-~~~iD~~~e~~~~ 77 (237) .++.+...+.....+.. +..+. .+++++. ....+++....++++++-. ...+++.+ ++++++ +-+|.. T Consensus 162 ~~~~~~~~i~~~~~~~~------~~~~~~g~l~~~~--~~~~~~e~~~~~~~~~~~~~~g~~~~~~~i~~l~~-g~~~~~ 232 (385) T protein:vir:95 162 LFEDYGEIFGRMIDLQM------LNNQIRGILKVDA--TKFYNKEKQKELQAYIDTLFDAFQNNTIAVVPLTE-GLAYEE 232 (385) T ss_pred HHHHHHHHHHHHHHHHH------hcCCCceEEEeCC--ccCCCHHHHHHHHHHHHHHhhhhhhcCCceEEcCC-CceeEe Confidence 44444443332222111 11111 2233321 1111222223344444332 12233444 345654 578888 Q ss_pred eecCcC--------CHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHh Q lcl|NC_019725. 78 LNSDIS--------GVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFI 149 (237) Q Consensus 78 ~~~~ls--------Gl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i 149 (237) ++.... -+-+........||.+-|||..+|-| .. ++-+.....||. ..|.|.+.++-..+ T Consensus 233 l~~~~~~~~s~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~~----~~-sn~e~~~~~~~~-------~~l~P~~~~ie~~l 300 (385) T protein:vir:95 233 HSNRGAAQSAQQFSELNELKKTVLTDVARMIGVPPSLVLG----EM-ADLEKTIESYLQ-------FCINPLLRKIEAEL 300 (385) T ss_pred ecccccccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhcC----CC-cCHHHHHHHHHH-------HHHHHHHHHHHHHH Confidence 765432 23445666777799999999988832 22 223334455554 34788777776555 Q ss_pred hc----CC-----CceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCC-hhcc Q lcl|NC_019725. 150 VE----EE-----EWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNIN-IREP 219 (237) Q Consensus 150 ~~----s~-----~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~-~~~~ 219 (237) -. .. .+.|.+.+|...|.+++ ++++++++++|+++++|+|+.+- ..+..+-.+ +..- .... T Consensus 301 ~~~L~~~~~~~~~~~~fd~~~l~~~D~~~~-------~~~~~~~~~~g~lt~NE~R~~~g-~~p~~~~~g-d~~~~~~n~ 371 (385) T protein:vir:95 301 NSKFFYQDEYLNDDMHIKVVGIDKRDPLKL-------SEAIDKLVASGTFTRNQVRIMTG-EEPADDPEL-DKFIITKNL 371 (385) T ss_pred HhhcCChhhcccceEEEechhhhccCHHHH-------HHHHHHHHhCCCcCHHHHHHHhC-CCCCCCCCC-ceeeecccc Confidence 32 11 34556668877777764 56677899999999999999772 222111110 0000 0000 Q ss_pred ccCCCCCCCCCCCcCc Q lcl|NC_019725. 220 EETTEPEPGLGEKLED 235 (237) Q Consensus 220 e~~~e~~~~~~~~~~~ 235 (237) ... ....+||..++ T Consensus 372 ~~~--~~~kgge~~~e 385 (385) T protein:vir:95 372 QSA--DAFKGGESNEE 385 (385) T ss_pred eec--ccccCCCCCCC Confidence 011 11112222211 No 94 >protein:vir:78083 Length: 537 # NCBI annotation: gp3 # Family: family:all:125 # MgeID: mge:1844 # MgeName: P35 # Cross-refs: genbank:acc:YP_001468787;genbank:gi:157325368;genbank:GeneID:5601845 Probab=98.41 E-value=2.5e-07 Score=56.78 Aligned_cols=221 Identities=15% Similarity=0.167 Sum_probs=121.4 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCC--cceeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAET--EEYDVL 78 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~--e~~~~~ 78 (237) .++.+.+.+.+|+......+.-+..+.-.++.+.|.. +......++.+ + .++++.+++++ -+|-.. T Consensus 267 d~e~v~~LiDayd~~~S~~an~~~~~~~~ilvi~g~~-----~~~~~~~~~~l------~-~~~~i~v~~d~~~v~~l~~ 334 (537) T protein:vir:78 267 DVKRVKSIIDDYDVMNCFLSNNLQDFSEAIYVVKGFS-----GDSTDKLRQNI------K-AKKMIGVNGDNAGMEIQTV 334 (537) T ss_pred chhhhHHHHHHHHHHHHhhhhHHHHhcCceeeeecCC-----CccchhHHHHH------h-hcCceeecCCCCceeEEEe Confidence 7888999999999999999999999998888887631 11111122211 1 24556666544 346667 Q ss_pred ecCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHH--HHHHHHhhhHHHHHHHHHhhc----- Q lcl|NC_019725. 79 NSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLV--DRKREEDYRPLLEFLLPFIVE----- 151 (237) Q Consensus 79 ~~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I--~~~Qe~~l~p~l~~l~~~i~~----- 151 (237) +.+..+....++.+.+.|-..+..|-+- ..-+| |+||..=..-|.... ....+..+++.|++++++|+. T Consensus 335 ~~~~~~~e~~ld~L~~~I~~~s~~~~~~---~~~~g-n~SGvAlk~~~~~l~~ka~~ke~~f~~~l~~~~~~i~~~~~~~ 410 (537) T protein:vir:78 335 SIPYEARKAKMDIDVENIYRSGMGFNST---AVGDG-NVTNVVIKSRYTLLAMKARKMETSLRKVLRWCADMVVSDIALR 410 (537) T ss_pred cCCHHHHHHHHHHHHHHHHHhcCCCCCc---ccccc-CCcHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 7777899999999999998888777643 33233 567764444444442 455566789999998888752 Q ss_pred ------CCCceeEeCCCCCCCHHHHHHHHHHHHHH--H---HHHHhCCCCCHHHHHHHHHhhcc--ccccCCCCC----- Q lcl|NC_019725. 152 ------EEEWSIEFEPLSVPSKKEESEITKNNVES--V---TKAITEQIIDLEEARDTLRSIAP--EFKLKDGNN----- 213 (237) Q Consensus 152 ------s~~~~~~f~pL~~~seke~Aei~~~~A~a--~---~~~~~~g~i~~~e~r~~l~~~~~--~~g~~~~~~----- 213 (237) ..++.|.|+|-..-+++|.|++..+..++ . ..+-..+.++..|.-+....... .....+... T Consensus 411 ~~~~~d~~~i~i~f~~~~P~n~~e~a~~~~~l~~~giiS~eT~l~~~p~vdd~e~ek~~~ee~~~~~~~~~~~~~~~~~~ 490 (537) T protein:vir:78 411 GLGEYDSNDICFEIEPHVLANELDIATTRKTEAETEALKIGNIMTVAPRIGDDETLKLIAEELDLDYNELKDALAEQDAQ 490 (537) T ss_pred CCcccccceeeEEeccCCCCCHHHHHHHHHHHHhcCcchHHHHHHhCCCCCCHHHHHHHHHHHHhhhhhhhhhhhhhccc Confidence 13688999999999999998876553321 1 11222344432221111111000 000000000 Q ss_pred ----CChhcccc---CCCCCCCCCCCcCcC-------C Q lcl|NC_019725. 214 ----INIREPEE---TTEPEPGLGEKLEDE-------N 237 (237) Q Consensus 214 ----~~~~~~e~---~~e~~~~~~~~~~~e-------~ 237 (237) ..+.++.. ...++|.++++.+++ + T Consensus 491 ~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~ 528 (537) T protein:vir:78 491 SLDVSPDVQAMLDGLPVNANQPPVDPNQPVADPNVVPP 528 (537) T ss_pred ccCcCcchhhhcCCCCCCCCCCCCCccCCCCCCCCCCC Confidence 00000000 000011111111111 0 No 95 >protein:vir:98643 Length: 395 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039921;genbank:gi:126011096;genbank:GeneID:4818479 Probab=98.33 E-value=3.1e-07 Score=56.27 Aligned_cols=211 Identities=12% Similarity=0.076 Sum_probs=93.7 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhc---CCchHHHHHHHHHHHH-HhcCchh-eeeeecCCcce Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCD---DDDAQYAARLRLAQVD-DNSGVGR-AIGIDAETEEY 75 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~---~~~~e~~~~~r~~~~~-~~r~~~~-~~~iD~~~e~~ 75 (237) +.+...+.+...........+ .++.....+..+...... +........+.++... ....+.. ++.+++ +-+| T Consensus 163 ~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~-g~~~ 239 (395) T protein:vir:98 163 LWEEYGELLGHVINNQKIANQ--IRFTMIPPKDKVRERAQENSDGGRQSKSDKDFFKRTVEKIRTESVVGIPVTA-NTNY 239 (395) T ss_pred hhhhHHHHHHHHHHHHHHHHH--HHHhhccccccccccccccCCcHHHHHHHHHHHHHHHhhhhcCCcceeecCC-Ccee Confidence 333222222222111111111 011111111111111111 1111112222222221 1122222 333443 4567 Q ss_pred eeeecCcCC--------HHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHH Q lcl|NC_019725. 76 DVLNSDISG--------VPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLP 147 (237) Q Consensus 76 ~~~~~~lsG--------l~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~ 147 (237) ..++.+... +-++.......||.+-|||..+| | +..+ +-+.....||. ..|.|.+.++-. T Consensus 240 ~~l~~~~~~~~~~~~~q~~e~~~~~~~~Ia~~fgVP~~~l-~---~~~s-n~e~~~~~f~~-------~tl~P~~~~ie~ 307 (395) T protein:vir:98 240 EEYGSKNTGAVKSYVDDIKKLKDQYMAEFAEMLGIPISLL-H---GDIA-DNQKNYELLLE-------GPIESLITNIVD 307 (395) T ss_pred EecccccccccChhHHHHHHHHHHHHHHHHHHhCCCHHHh-c---CCcc-cHHHHHHHHHH-------HHHHHHHHHHHH Confidence 776654332 22344455678999999999887 2 1221 22223344443 557887776654 Q ss_pred Hhh--------cCCCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCC-CCCChhc Q lcl|NC_019725. 148 FIV--------EEEEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDG-NNINIRE 218 (237) Q Consensus 148 ~i~--------~s~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~-~~~~~~~ 218 (237) -+- +..++.|+|+.|...|.++ ++++++.+++.|+++++|+|+.+ ...+..|-.++ .-+. .. T Consensus 308 ~l~~kll~~~~~~~g~~f~~~~l~~~d~~~-------~~~~~~~~~~~G~~T~NE~R~~~-g~~Pi~~~~gD~~~~~-~n 378 (395) T protein:vir:98 308 GLEYAIFDKSETLQGSFIKVTGLKNYDLFS-------ISNQADKLISSGFVFIDEVREEI-GLPELPDGLGKVLYMT-KN 378 (395) T ss_pred HHHHhcCChhhhcCcceeeehhhhccCHHH-------HHHHHHHHHhCCCcCHHHHHHHh-CCCCCCCCCCceeeec-cc Confidence 432 2346789999998888765 56678889999999999999976 23332221110 0000 00 Q ss_pred cccCCCCCCCCCCCcCcCC Q lcl|NC_019725. 219 PEETTEPEPGLGEKLEDEN 237 (237) Q Consensus 219 ~e~~~e~~~~~~~~~~~e~ 237 (237) . .+-++.|+....+.+| T Consensus 379 ~--~~~~~~gge~~~~~~~ 395 (395) T protein:vir:98 379 Y--ESVLERGGEVDEEVET 395 (395) T ss_pred c--eecccccCCCCCCCCC Confidence 0 0111122223333333 No 96 >protein:vir:78310 Length: 376 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468642;genbank:gi:157325220;genbank:GeneID:5601655 Probab=98.30 E-value=2e-07 Score=57.36 Aligned_cols=202 Identities=15% Similarity=0.146 Sum_probs=92.6 Q ss_pred CchhHHHHHHHHHHHHHHHHH-HHHHhccce-eechhHHHhhcCCchHHHHHHHHHHHHHh-cCchh-eeeeecCCccee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQ-ILRRKQQAV-WKVKGLAEMCDDDDAQYAARLRLAQVDDN-SGVGR-AIGIDAETEEYD 76 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~-Ll~~~~~~v-~k~~~l~~~~~~~~~e~~~~~r~~~~~~~-r~~~~-~~~iD~~~e~~~ 76 (237) .+..+... +......... ..+..+... +++. ....+ +++....++++++..-.. .++.+ ++.+++ +-+|. T Consensus 157 ~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~-~~e~~~~~~~~~~~~~~g~~~~~~~v~~l~~-g~~~~ 230 (376) T protein:vir:78 157 FTDGMFED---YGELFGKMIRAQMRNFQIRGAVNFK-MAGVA-DKDKQTKLQEYIDKVYASFNNNEIAIVPQLE-GFNYE 230 (376) T ss_pred hhhHHHHH---HHHHHHHHHHHHHhcCCCceeEEEc-cCCCC-CHHHHHHHHHHHHHHhccccccCcceEEcCC-CceEE Confidence 22222221 1111111111 111111111 1111 01111 222233445554432222 23333 334554 57888 Q ss_pred eeecCcCCH-------HHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHh Q lcl|NC_019725. 77 VLNSDISGV-------PEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFI 149 (237) Q Consensus 77 ~~~~~lsGl-------~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i 149 (237) .++.+...+ -+........||.+-|||-..|-| .. ++-+.-...||.. .|.|.+.++-..+ T Consensus 231 ~l~~~~~~~~~~~~q~~e~~~~~~~~Ia~~fgVPp~~l~~----~~-s~~e~~~~~f~~~-------~l~P~~~~ie~~l 298 (376) T protein:vir:78 231 EFGTTSVNNSQSFDEVKKLRKEMIDYVASILGIPSSLLHG----DM-ADLSNNMKAYMEY-------CIDPLTKKLEDEL 298 (376) T ss_pred eeccCccccchhHHHHHHHHHHHHHHHHHHhCCCHHHhCC----CC-CCHHHHHHHHHHH-------HHHHHHHHHHHHH Confidence 887766533 334455577899999999998832 11 2223333444443 4777777665444 Q ss_pred h----cCCCce--eEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccc-cccCCCCCCChhccccC Q lcl|NC_019725. 150 V----EEEEWS--IEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPE-FKLKDGNNINIREPEET 222 (237) Q Consensus 150 ~----~s~~~~--~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~-~g~~~~~~~~~~~~e~~ 222 (237) - ...+|. |.|..|...|.++ +++++++++++|+++++|+|+.+- ..+. .|..+..-+ +... T Consensus 299 ~~kll~~~~~~~~~~~~~ll~~d~~~-------~~~~~~~~~~~G~~t~NE~R~~lg-~~p~~~g~~d~~~~----~~n~ 366 (376) T protein:vir:78 299 NAKLFTFSEFLAGEHIKIIHKKDIIE-------NAEAVDKLVASGSFNRNEVRELLG-AERVDNPELDKYLI----TKNY 366 (376) T ss_pred HhhhCCcccceecccchhhcccCHHH-------HHHHHHHHHhCCCcCHHHHHHHhC-CCCCCCCCCceeee----ccCc Confidence 2 234444 4455666666554 577888999999999999999873 2221 111100000 0010 Q ss_pred CCCCCCCCCC Q lcl|NC_019725. 223 TEPEPGLGEK 232 (237) Q Consensus 223 ~e~~~~~~~~ 232 (237) ..-+.+..++ T Consensus 367 ~~~~~~~e~g 376 (376) T protein:vir:78 367 QSADEGGEDG 376 (376) T ss_pred eehhccccCC Confidence 0011111111 No 97 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=98.30 E-value=5e-07 Score=55.13 Aligned_cols=205 Identities=15% Similarity=0.114 Sum_probs=116.4 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCc-ceeeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETE-EYDVLN 79 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e-~~~~~~ 79 (237) .++.+.+.+.+++.+....+.-+..+...++.+.|.. . .+.+ ..... ....+++.++++++ +|-..+ T Consensus 244 d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~--~--~~~~-~~~~~-------~~~~~~i~~~~~~~~~~l~~~ 311 (474) T protein:vir:95 244 DIWMYKSLIDAIDKRLSDAQNMFDESVELIYILKGYE--G--QDLE-EFMRG-------LKYYKAINVDGDGGVETIQVE 311 (474) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCC--c--ccch-hhhhh-------hhccceeeccCCCceeEEeec Confidence 7788889999999999999888887777777766532 1 1111 11111 11244555655432 245567 Q ss_pred cCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHH--HHHHHhhhHHHHHHHHHhhcC----- Q lcl|NC_019725. 80 SDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVD--RKREEDYRPLLEFLLPFIVEE----- 152 (237) Q Consensus 80 ~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~--~~Qe~~l~p~l~~l~~~i~~s----- 152 (237) .+.+++...++.+...|...+++|-. .++ +.+| |.||..=..-|..... ...+..++..|++++.++..- T Consensus 312 ~~~~~~~~~~~~l~~~i~~~s~~p~~-~~~-~~~~-n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~g~~~ 388 (474) T protein:vir:95 312 VPVSSTKEYIDLMRAYIMEFGQGVDF-QTD-KFGS-APSGIALKFLYGNLDLKANKLKNKATVAIQELIGFIIDFNNLKM 388 (474) T ss_pred CCHHHHHHHHHHHHHHHHHHhCCccc-ccc-cccc-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCc Confidence 88899999999999999999999952 222 2222 4466654444544433 444466888999988887532 Q ss_pred --CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCChhcc--------ccC Q lcl|NC_019725. 153 --EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIREP--------EET 222 (237) Q Consensus 153 --~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~~~~--------e~~ 222 (237) .++.+.|+|-...+++|.|++. .++|+||.+.+.+.+-. .+......+.+ ... T Consensus 389 d~~~i~v~f~~~~p~d~~e~a~~~----------~~~g~iS~et~i~~l~~-------v~d~~~E~~ri~~E~~~~~~~~ 451 (474) T protein:vir:95 389 DVKDIEISFNFNRMMNDAEQSQII----------AQSQYLSRETLVKSSPL-------VDDYKAELERIEQEQMEYNKQL 451 (474) T ss_pred ccceeeEEeccCCCcCHHHHHHHH----------HhcCCCchHHHHHhCCC-------CCCHHHHHHHHHHHHHHHHhcc Confidence 4788999999999988888743 23466666555543210 00000000000 000 Q ss_pred CCCC-------CCCCCCcCcCC Q lcl|NC_019725. 223 TEPE-------PGLGEKLEDEN 237 (237) Q Consensus 223 ~e~~-------~~~~~~~~~e~ 237 (237) +... +...++.+.|+ T Consensus 452 ~~~~~~~~d~~~~~~~~~~~~~ 473 (474) T protein:vir:95 452 PNLDDGGADGAQQQERSNDKES 473 (474) T ss_pred cccccccCCCCcCCCCCccCCC Confidence 0000 00000000111 No 98 >protein:vir:80644 Length: 551 # NCBI annotation: gp23 # Family: family:all:2446 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468463;genbank:gi:157325038;genbank:GeneID:5601615 Probab=98.29 E-value=2.2e-07 Score=57.06 Aligned_cols=226 Identities=15% Similarity=0.165 Sum_probs=108.1 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhc-cc-eeechhHHHhhcCCchHHHHHHHHHHHHHhcCchhe-eeeecCCcceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQ-QA-VWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRA-IGIDAETEEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~-~~-v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~-~~iD~~~e~~~~ 77 (237) -++.+.+.|.....+......+..... .. ++++++-.. + +.+....+++++.-.-..-.|.+- .++..++=+|.. T Consensus 254 pi~~a~~~i~~~~a~~~~~~~~f~Ng~~p~giL~~~~~~~-l-t~e~~~~lk~~~~~~~~G~~nag~~~vl~~~g~~~~~ 331 (551) T protein:vir:80 254 ELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQ-Q-SQHALEIFKREWKNSLSGINGSWQIPVVSAEDVKFVN 331 (551) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEEcCCCC-C-CHHHHHHHHHHHHHHhcCccccCccccccCCCceEEE Confidence 357778888888888877777776643 22 344442111 1 122223445444432222234443 455544445555 Q ss_pred eecCcCC--HHHHHHHHHHHHhhhhcCceeeeeccCcccccccchh--HHHHHHHHHHHHHHHhhhHHHHHHHHHh---- Q lcl|NC_019725. 78 LNSDISG--VPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNT--ALETFYKLVDRKREEDYRPLLEFLLPFI---- 149 (237) Q Consensus 78 ~~~~lsG--l~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~--D~~nyyd~I~~~Qe~~l~p~l~~l~~~i---- 149 (237) ++.+... +-+........||.+-|||-..|--..-++..+++.+ -..|+-.....+-+..|.|.+.++-..| T Consensus 332 l~~~~~D~qfle~~~~~~~~Ia~aFgVPp~~lG~~~~~~~~~~~~~s~t~sn~e~~~~~f~~~tL~P~~~~ie~~ln~~L 411 (551) T protein:vir:80 332 MTPSARDMEFEKWLNYLINVISALYGIDPAEINIPNNGGATGSKGGSLNEGNSAEKNQASKNKGLQPLLGFIEDFINKHI 411 (551) T ss_pred ccCChhHHHHHHHHHHHHHHHHHHhcCCHHHcCcccccccccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 5443322 2334566778899999999877743332222111111 1122222233333455777766664443 Q ss_pred hc--CCCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcc-ccccC-C--CCCCC-------h Q lcl|NC_019725. 150 VE--EEEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAP-EFKLK-D--GNNIN-------I 216 (237) Q Consensus 150 ~~--s~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~-~~g~~-~--~~~~~-------~ 216 (237) .. ...+.|+|.-+...+.++++++ ..++..|+++++|+|+.+- ..+ ..|-. . ..++. . T Consensus 412 ~~~~~~~~~f~f~~~~~~~~~~~~~~--------~~~~~~g~lT~NE~R~~~g-l~P~~egGD~~~~~~~~~~~~~~~~~ 482 (551) T protein:vir:80 412 VAEFGDKYTFQFVGGDIKSELESVKI--------LAEKAKVAMTVNEVRKELN-LPGDVIGGDIPLNGVIVQRIGQLMQQ 482 (551) T ss_pred ccccCCceEEEeeccChhhHHHHHHH--------HHHHhcCCcCHHHHHHHhC-CCCCCCCCceeecccccccccccccc Confidence 22 2468899998887776665532 2355679999999999763 222 11100 0 00000 0 Q ss_pred h--c----------cccCC---CCCCCCCCCcCcCC Q lcl|NC_019725. 217 R--E----------PEETT---EPEPGLGEKLEDEN 237 (237) Q Consensus 217 ~--~----------~e~~~---e~~~~~~~~~~~e~ 237 (237) . + +.+.. ...+...++.+.++ T Consensus 483 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~ 518 (551) T protein:vir:80 483 EQFEHEKQQSNLQMLQEQTGNRVSTDVEDIPDGKDT 518 (551) T ss_pred cCcchhhhhhccccccCcCCCCCCCCCCCCCCcccc Confidence 0 0 00000 00011111111111 No 99 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=98.27 E-value=1.1e-07 Score=58.84 Aligned_cols=202 Identities=12% Similarity=0.117 Sum_probs=122.1 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeee-- Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVL-- 78 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~-- 78 (237) .++.+.+.+.+++.+....+.-+..++..++.+.|.. ...+ ....+ + ..+++.+..++.+...+ T Consensus 243 d~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g~~-----~~~~--~~~~~------~-~~~~i~~~~~~~~~~~l~~ 308 (474) T protein:vir:94 243 DAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRGMG-----MSEE--MIQET------Q-KSGAFELFDKDMDVKYLTK 308 (474) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhccCC-----CCch--hhhhh------h-hcceeEecCCCCceeEEec Confidence 7788999999999999999998888888877776631 1111 11111 1 23455553334445544 Q ss_pred ecCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHH--HHHHHHhhhHHHHHHHHHhhc----- Q lcl|NC_019725. 79 NSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLV--DRKREEDYRPLLEFLLPFIVE----- 151 (237) Q Consensus 79 ~~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I--~~~Qe~~l~p~l~~l~~~i~~----- 151 (237) +.+.+++...++.+.+.|...+++|-.-.-+- +| |.||..=...|.... ...++..++..|++++++++. T Consensus 309 ~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~--~~-n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~ 385 (474) T protein:vir:94 309 DVNDTMIENHLDRIEKNIMRFAKSVNFNSDEF--NG-NVPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRK 385 (474) T ss_pred cCCHHHHHHHHHHHHHHHHHHhCCcccccccc--cc-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 45668899999999999999999996543221 23 557775555555443 455667789999988888652 Q ss_pred -----C---CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCCh------- Q lcl|NC_019725. 152 -----E---EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINI------- 216 (237) Q Consensus 152 -----s---~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~------- 216 (237) . .++++.|.|-...++++.|++..+.+ |+||.+.+.+.+.. .+ ++.. T Consensus 386 ~~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl~---------g~iS~et~~~~l~~-------v~--d~~~E~eri~~ 447 (474) T protein:vir:94 386 GYNLDDDSYLNLIFKFTRNIPVNKLEESQVLINLK---------GQVSERTRLGQSQL-------VD--DVDYELDEMEK 447 (474) T ss_pred cCCCCccccccceEEeCCCCCCCHHHHHHHHHHHh---------ccCchHHHHHhCCC-------CC--CHHHHHHHHHH Confidence 1 26789999999999999998776542 67777666654421 00 1110 Q ss_pred hc---cccCCCCCCC--CCCCcCcCC Q lcl|NC_019725. 217 RE---PEETTEPEPG--LGEKLEDEN 237 (237) Q Consensus 217 ~~---~e~~~e~~~~--~~~~~~~e~ 237 (237) |. ....++..++ .+++.+.+| T Consensus 448 E~~e~~~~~~~~~~~~~~~~~~~~~s 473 (474) T protein:vir:94 448 ESLEFNDKLPDIDEGDANDKSQNNQS 473 (474) T ss_pred HHHHHHhhcccccCCCcCCCCccccC Confidence 00 0011111111 111222222 No 100 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=98.27 E-value=1.1e-07 Score=58.84 Aligned_cols=202 Identities=12% Similarity=0.117 Sum_probs=122.1 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeee-- Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVL-- 78 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~-- 78 (237) .++.+.+.+.+++.+....+.-+..++..++.+.|.. ...+ ....+ + ..+++.+..++.+...+ T Consensus 243 d~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~i~g~~-----~~~~--~~~~~------~-~~~~i~~~~~~~~~~~l~~ 308 (474) T protein:vir:10 243 DAEKVIHLIDAYDLTMSDASSEISQTRLAYLVLRGMG-----MSEE--MIQET------Q-KSGAFELFDKDMDVKYLTK 308 (474) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhccCC-----CCch--hhhhh------h-hcceeEecCCCCceeEEec Confidence 7788999999999999999998888888877776631 1111 11111 1 23455553334445544 Q ss_pred ecCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHH--HHHHHHhhhHHHHHHHHHhhc----- Q lcl|NC_019725. 79 NSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLV--DRKREEDYRPLLEFLLPFIVE----- 151 (237) Q Consensus 79 ~~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I--~~~Qe~~l~p~l~~l~~~i~~----- 151 (237) +.+.+++...++.+.+.|...+++|-.-.-+- +| |.||..=...|.... ...++..++..|++++++++. T Consensus 309 ~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~--~~-n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~l~~~ 385 (474) T protein:vir:10 309 DVNDTMIENHLDRIEKNIMRFAKSVNFNSDEF--NG-NVPIIGMKLKLMALENKCMTFERKMTAMLRYQFKVILSALKRK 385 (474) T ss_pred cCCHHHHHHHHHHHHHHHHHHhCCcccccccc--cc-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 45668899999999999999999996543221 23 557775555555443 455667789999988888652 Q ss_pred -----C---CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCCh------- Q lcl|NC_019725. 152 -----E---EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINI------- 216 (237) Q Consensus 152 -----s---~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~------- 216 (237) . .++++.|.|-...++++.|++..+.+ |+||.+.+.+.+.. .+ ++.. T Consensus 386 ~~~~~~~~~~~i~~~f~~~~p~d~~e~a~~~~kl~---------g~iS~et~~~~l~~-------v~--d~~~E~eri~~ 447 (474) T protein:vir:10 386 GYNLDDDSYLNLIFKFTRNIPVNKLEESQVLINLK---------GQVSERTRLGQSQL-------VD--DVDYELDEMEK 447 (474) T ss_pred cCCCCccccccceEEeCCCCCCCHHHHHHHHHHHh---------ccCchHHHHHhCCC-------CC--CHHHHHHHHHH Confidence 1 26789999999999999998776542 67777666654421 00 1110 Q ss_pred hc---cccCCCCCCC--CCCCcCcCC Q lcl|NC_019725. 217 RE---PEETTEPEPG--LGEKLEDEN 237 (237) Q Consensus 217 ~~---~e~~~e~~~~--~~~~~~~e~ 237 (237) |. ....++..++ .+++.+.+| T Consensus 448 E~~e~~~~~~~~~~~~~~~~~~~~~s 473 (474) T protein:vir:10 448 ESLEFNDKLPDIDEGDANDKSQNNQS 473 (474) T ss_pred HHHHHHhhcccccCCCcCCCCccccC Confidence 00 0011111111 111222222 No 101 >protein:vir:102950 Length: 471 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945279;genbank:gi:39653714;interpro:IPR006428;uniprot:Q708N3;genbank:GeneID:2672864 Probab=98.27 E-value=1.4e-07 Score=58.17 Aligned_cols=203 Identities=12% Similarity=0.042 Sum_probs=126.9 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecC------Ccc Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAE------TEE 74 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~------~e~ 74 (237) .++.+.+-+.+++......+.-+..+....+.+.|... .... .....+ + ..+++.++.. +-+ T Consensus 247 d~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~---~~~~--~~~~~~------~-~~~~i~~~~~~~~~~~~~~ 314 (471) T protein:vir:10 247 DLKPIKDLVDVYDKVFSGFVNDTDDVQEVIFVLTNYGG---QDKQ--EFLEDL------K-RYKMIKMDNDGMGDQSGVT 314 (471) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCc---cccc--hhHHHh------h-cCCeEEecCCCCccCccce Confidence 68888899999999988888888888877777776310 1111 111111 1 1334444322 124 Q ss_pred eeeeecCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHH--HHHHHHhhhHHHHHHHHHhhc- Q lcl|NC_019725. 75 YDVLNSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLV--DRKREEDYRPLLEFLLPFIVE- 151 (237) Q Consensus 75 ~~~~~~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I--~~~Qe~~l~p~l~~l~~~i~~- 151 (237) |-..+.+..++...++.+.+.|...+++|-.-.-+. | |+||..=..-|.... ....+..+++.+++++++++. T Consensus 315 ~l~~~~~~~~~~~~~~~l~~~I~~~s~tp~~~~~~~---g-n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~ 390 (471) T protein:vir:10 315 TIAIDIPTEARNLILERTKKQIFISGQGVNPETDKL---G-NSSGVALKFLYSLLELKAGNMETQFRSGYATLVKMILKH 390 (471) T ss_pred EEeecCChHHHHHHHHHHHHHHHHHhCCcCCCcccc---c-CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 666677889999999999999999999996543331 2 567765334444432 445567789999999888763 Q ss_pred -----CCCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCCh-------hcc Q lcl|NC_019725. 152 -----EEEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINI-------REP 219 (237) Q Consensus 152 -----s~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~-------~~~ 219 (237) ..++.+.|+|....+++|.|++..+. .|+||.+.+.+.+- +. + +... |.. T Consensus 391 ~~~~d~~~i~i~f~~~~p~n~~e~~~~~~kl---------~g~iS~et~~~~~p------~v-~--D~~~E~eri~~E~~ 452 (471) T protein:vir:10 391 LGLSDKLKIKQTWTRNSINNDTEMAQVVSTL---------ATITSRENVAKSNP------IV-E--DWQDELRLQKAEQE 452 (471) T ss_pred hccCCCceeEEEeCCCCCCCHHHHHHHHHHH---------hccCchHHHHHhCC------CC-C--CHHHHHHHHHHHHH Confidence 24788999999999999999875442 47888877665431 11 1 1111 111 Q ss_pred ccCCCCCCCCCCCcCcCC Q lcl|NC_019725. 220 EETTEPEPGLGEKLEDEN 237 (237) Q Consensus 220 e~~~e~~~~~~~~~~~e~ 237 (237) +.+...++..+-..++|. T Consensus 453 ~~~~~~~~~~~~~~~~e~ 470 (471) T protein:vir:10 453 GRSEKLYDMEEVEHESEV 470 (471) T ss_pred HHHhcccccCCCCCcccc Confidence 122223334444444444 No 102 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=98.27 E-value=7.3e-07 Score=54.23 Aligned_cols=215 Identities=15% Similarity=0.141 Sum_probs=116.0 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCC-cceeeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAET-EEYDVLN 79 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~-e~~~~~~ 79 (237) .++.+.+.+.+++.+....+.-+..+...++.+.|.. +.........+. ..+++.+++++ -+|-..+ T Consensus 256 d~~~~~~liDa~d~~~s~~~~~~~~~~~~~~v~~g~~-----~~~~~~~~~~~~-------~~~~~~~~~~~~~~~l~~~ 323 (503) T protein:vir:59 256 DLKFYKDLIDNYDSITSSTMDSFSDFQQIVYVLKNYD-----GENPKEFTANLR-------YHSVIKVSGDGGVDTLRAE 323 (503) T ss_pred chhhhHHHHHHHHHHHHHHHHHHHHhcCCeeEeecCC-----ccccchhhhhhh-------cccceeccCCCcceeEecc Confidence 6778889999999998888888888888888776531 110001111111 23344554433 2355567 Q ss_pred cCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHH--HHHHHHhhhHHHHHHHHHhhc------ Q lcl|NC_019725. 80 SDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLV--DRKREEDYRPLLEFLLPFIVE------ 151 (237) Q Consensus 80 ~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I--~~~Qe~~l~p~l~~l~~~i~~------ 151 (237) .+.+++...++.+...|...+.+|-.-. +.- +| |.||..=...|...+ ....+..++..|++++.+++. T Consensus 324 ~~~~~~~~~~~~l~~~i~~~s~~p~~~~-~~~-~~-~~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~~~~~ 400 (503) T protein:vir:59 324 IPVDSAAKELERIQDELYKSAQAVDNSP-ETI-GG-GATGPALENLYALLDLKANMAERKIRAGLRLFFWFFAEYLRNTG 400 (503) T ss_pred CCHHHHHHHHHHHHHHHHHHhcccCCCc-ccc-cc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 7788999999999999988888885432 111 11 345654333333333 233446688889988887642 Q ss_pred ------CCCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHH-------------Hhhcccccc-CCC Q lcl|NC_019725. 152 ------EEEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTL-------------RSIAPEFKL-KDG 211 (237) Q Consensus 152 ------s~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l-------------~~~~~~~g~-~~~ 211 (237) ..++++.|+|-...+.++.|+. +..++++|++|.+.+.+.+ .+......- ... T Consensus 401 ~~~~~~~~~i~i~f~~~~p~d~~~~~~~-------~~kl~~~GiiS~et~l~~l~~v~d~~~E~~ri~~E~~~~~~~~~~ 473 (503) T protein:vir:59 401 KGDFNPDKELTMTFTRTRIQNDSEIVQS-------LVQGVTGGIMSKETAVARNPFVQDPEEELARIEEEMNQYAEMQGN 473 (503) T ss_pred CcccccccceeEEeCCCCCCCHHHHHHH-------HHHHHhCCCCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHhhhcc Confidence 1368999999999999886654 4455666666655554432 110000000 000 Q ss_pred CCCChhccccCCCCCCCCCCCcCcCC Q lcl|NC_019725. 212 NNINIREPEETTEPEPGLGEKLEDEN 237 (237) Q Consensus 212 ~~~~~~~~e~~~e~~~~~~~~~~~e~ 237 (237) ..-.....+...+++|..++.....| T Consensus 474 ~~~~~~~~~~~~~~~~~~~~~~~~~~ 499 (503) T protein:vir:59 474 LLDDEGGDDDLEEDDPNAGAAESGGA 499 (503) T ss_pred ccCccCCCCCCCcCCCCCCcccCCCC Confidence 00000000011111111111112222 No 103 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=98.26 E-value=7.3e-07 Score=54.22 Aligned_cols=207 Identities=17% Similarity=0.130 Sum_probs=118.7 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecC---Ccceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAE---TEEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~---~e~~~~ 77 (237) .++.+.+.+.+++.+....+.-+..+....+.+.|... .. .......+. ..+++.+.++ +-+|-. T Consensus 247 d~~~v~~liDa~~~~~S~~~~~~~~~~~p~~~~~g~~~---~~--~~~~~~~~~-------~~~~~~~~~~~~~~~~~l~ 314 (478) T protein:vir:10 247 DLFMYKTIIDALDKRLSDTQNTFDESVELIYILKGYEG---ED--MKDFMHNLK-------YYKAISVAGESGSGVDTIK 314 (478) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCc---cc--cchhhhhhh-------hcceEEecCCCCCcceEEe Confidence 67788899999999999999888888888777766411 11 011111111 1234444322 233556 Q ss_pred eecCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHH--HHHHHHhhhHHHHHHHHHhhcC--- Q lcl|NC_019725. 78 LNSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLV--DRKREEDYRPLLEFLLPFIVEE--- 152 (237) Q Consensus 78 ~~~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I--~~~Qe~~l~p~l~~l~~~i~~s--- 152 (237) .+.+..++...++.+.+.|...+++|-.-.-+- +| |.||..=...|.... ....+..+...|++++.+++.- T Consensus 315 ~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~--~~-n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~g~ 391 (478) T protein:vir:10 315 VEVPIDSVKEYTKMLRDYIIEFGQGVDFQQDKF--GN-SPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYRL 391 (478) T ss_pred ecCChHHHHHHHHHHHHHHHHHhCccccCcccc--cc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC Confidence 677889999999999999999999995432221 22 557765444554443 3455667889999988887632 Q ss_pred ----CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHH-------------HhhccccccCCCCCCC Q lcl|NC_019725. 153 ----EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTL-------------RSIAPEFKLKDGNNIN 215 (237) Q Consensus 153 ----~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l-------------~~~~~~~g~~~~~~~~ 215 (237) .++++.|+|-...++++.|++..+. +|+||.+.+.+.| +....+..-. -.+.. T Consensus 392 ~~~~~~i~i~f~~~~p~d~~e~a~~~~kl---------~g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~-~~~~~ 461 (478) T protein:vir:10 392 DVKVQDIEITFNFNVMVNELENSQIAMNS---------TGLLSKETILSNHAWVEDPVAEMERIEQENIELNQQ-LPDIE 461 (478) T ss_pred CcccccceEEecCCCCCCHHHHHHHHHHH---------hCCCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHhh-ccccc Confidence 4788999999999999988875543 3555555444433 2111100000 00000 Q ss_pred hhccccCCCCCCCCCCCcCcC Q lcl|NC_019725. 216 IREPEETTEPEPGLGEKLEDE 236 (237) Q Consensus 216 ~~~~e~~~e~~~~~~~~~~~e 236 (237) .......+..++..++| T Consensus 462 ----~~~~~~~~~~~~~~~~~ 478 (478) T protein:vir:10 462 ----EGLNGEQQRQSENNQPE 478 (478) T ss_pred ----cccCCCCCCCCCCCCCC Confidence 00000111111111111 No 104 >protein:vir:99312 Length: 563 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024471;genbank:gi:48696430;genbank:GeneID:2948040 Probab=98.25 E-value=4.4e-07 Score=55.42 Aligned_cols=224 Identities=15% Similarity=0.192 Sum_probs=103.3 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHhcC--chh-eeeeecCCcce Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSG--VGR-AIGIDAETEEY 75 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~--~~~-~~~iD~~~e~~ 75 (237) -++.+.+.|.....+......+...... .++++++-. .+ +.+....+++.++ ..+++ |.+ +.++-.++-+| T Consensus 260 pi~~a~~~i~~~~~~~~~~~~~f~ng~~p~giL~~~~~~-~l-s~e~~~~~~~~~~--~~~~G~~nagk~~~vl~~G~~~ 335 (563) T protein:vir:99 260 EVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRSDQ-QQ-SQHALENFKREWK--SSLSGINGSWQIPVVMADDIKF 335 (563) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHccCCCceEEEeCCCC-CC-CHHHHHHHHHHHH--HHhccccccccceEEcCCCceE Confidence 4566777777777777777776665332 235544211 11 1222233444443 23333 333 22433345678 Q ss_pred eeeecCcCC--HHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHH---HHHHHHHHHHHHHhhhHHHHHHHHHhh Q lcl|NC_019725. 76 DVLNSDISG--VPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTAL---ETFYKLVDRKREEDYRPLLEFLLPFIV 150 (237) Q Consensus 76 ~~~~~~lsG--l~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~---~nyyd~I~~~Qe~~l~p~l~~l~~~i~ 150 (237) ..++.+... +-+........||.+-|||-.+|--...++.+++..+-- .|.-..-..+-+..|.|.+.++-..|- T Consensus 336 ~~l~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~~~~~~ss~~~sn~e~~~~~f~~~tL~P~l~~ie~~ln 415 (563) T protein:vir:99 336 VNMTPTANDMQFEKWLNYLINIISALYGIDPAEIGFPNRGGATGSKGGSTLNEADPGKKQQQSQNKGLQPLLRFIEDLVN 415 (563) T ss_pred EeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHccccccccccccccccchhhccHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 777765443 345666788999999999988873333333322221111 111111122333456777666654432 Q ss_pred ----cC--CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccC---CCCCCC------ Q lcl|NC_019725. 151 ----EE--EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLK---DGNNIN------ 215 (237) Q Consensus 151 ----~s--~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~---~~~~~~------ 215 (237) .. ..+.|+| ...+.+.+++... ...++.+|+++++|+|+.+- ..+..|-. ...++. T Consensus 416 ~~L~~~~~~~~~~~f---~r~D~~~~~e~~~-----~~~~~~~G~lT~NE~R~~~g-l~Pi~gGD~~~~~~~~~~~~~~~ 486 (563) T protein:vir:99 416 RHIISEYGDKYTFQF---VGGDTKSATDKLN-----ILKLETQIFKTVNEAREEQG-KKPIEGGDIILDASFLQGTAQLQ 486 (563) T ss_pred hhhchhcccccEEEe---ccCCHHHHHHHHH-----HHHHhcCCccCHHHHHHHhC-CCCCCCcceeecccccccccccc Confidence 22 2445554 3445555544322 23468899999999999762 23322210 000000 Q ss_pred ----hh-cc-----------ccCCCCCCCC---CCCcCcCC Q lcl|NC_019725. 216 ----IR-EP-----------EETTEPEPGL---GEKLEDEN 237 (237) Q Consensus 216 ----~~-~~-----------e~~~e~~~~~---~~~~~~e~ 237 (237) .+ +. ...+.++|.. .++.+++. T Consensus 487 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (563) T protein:vir:99 487 QDKQYNDGKQKERLQMMMSLLEGDNDDSEEGQSTDSSNDDK 527 (563) T ss_pred cccCCCccccchhhhhcccccCCCCCCCCCCCCCCCCCCcc Confidence 00 00 0001111111 11111111 No 105 >protein:vir:95599 Length: 563 # NCBI annotation: ORF014 # Family: family:all:2446 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240900;genbank:gi:66394963;genbank:GeneID:5132540 Probab=98.25 E-value=4.4e-07 Score=55.42 Aligned_cols=224 Identities=15% Similarity=0.192 Sum_probs=103.3 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHhcC--chh-eeeeecCCcce Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSG--VGR-AIGIDAETEEY 75 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~--~~~-~~~iD~~~e~~ 75 (237) -++.+.+.|.....+......+...... .++++++-. .+ +.+....+++.++ ..+++ |.+ +.++-.++-+| T Consensus 260 pi~~a~~~i~~~~~~~~~~~~~f~ng~~p~giL~~~~~~-~l-s~e~~~~~~~~~~--~~~~G~~nagk~~~vl~~G~~~ 335 (563) T protein:vir:95 260 EVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRSDQ-QQ-SQHALENFKREWK--SSLSGINGSWQIPVVMADDIKF 335 (563) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHccCCCceEEEeCCCC-CC-CHHHHHHHHHHHH--HHhccccccccceEEcCCCceE Confidence 4566777777777777777776665332 235544211 11 1222233444443 23333 333 22433345678 Q ss_pred eeeecCcCC--HHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHH---HHHHHHHHHHHHHhhhHHHHHHHHHhh Q lcl|NC_019725. 76 DVLNSDISG--VPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTAL---ETFYKLVDRKREEDYRPLLEFLLPFIV 150 (237) Q Consensus 76 ~~~~~~lsG--l~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~---~nyyd~I~~~Qe~~l~p~l~~l~~~i~ 150 (237) ..++.+... +-+........||.+-|||-.+|--...++.+++..+-- .|.-..-..+-+..|.|.+.++-..|- T Consensus 336 ~~l~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~~~~~~ss~~~sn~e~~~~~f~~~tL~P~l~~ie~~ln 415 (563) T protein:vir:95 336 VNMTPTANDMQFEKWLNYLINIISALYGIDPAEIGFPNRGGATGSKGGSTLNEADPGKKQQQSQNKGLQPLLRFIEDLVN 415 (563) T ss_pred EeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHccccccccccccccccchhhccHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 777765443 345666788999999999988873333333322221111 111111122333456777666654432 Q ss_pred ----cC--CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccC---CCCCCC------ Q lcl|NC_019725. 151 ----EE--EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLK---DGNNIN------ 215 (237) Q Consensus 151 ----~s--~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~---~~~~~~------ 215 (237) .. ..+.|+| ...+.+.+++... ...++.+|+++++|+|+.+- ..+..|-. ...++. T Consensus 416 ~~L~~~~~~~~~~~f---~r~D~~~~~e~~~-----~~~~~~~G~lT~NE~R~~~g-l~Pi~gGD~~~~~~~~~~~~~~~ 486 (563) T protein:vir:95 416 RHIISEYGDKYTFQF---VGGDTKSATDKLN-----ILKLETQIFKTVNEAREEQG-KKPIEGGDIILDASFLQGTAQLQ 486 (563) T ss_pred hhhchhcccccEEEe---ccCCHHHHHHHHH-----HHHHhcCCccCHHHHHHHhC-CCCCCCcceeecccccccccccc Confidence 22 2445554 3445555544322 23468899999999999762 23322210 000000 Q ss_pred ----hh-cc-----------ccCCCCCCCC---CCCcCcCC Q lcl|NC_019725. 216 ----IR-EP-----------EETTEPEPGL---GEKLEDEN 237 (237) Q Consensus 216 ----~~-~~-----------e~~~e~~~~~---~~~~~~e~ 237 (237) .+ +. ...+.++|.. .++.+++. T Consensus 487 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 527 (563) T protein:vir:95 487 QDKQYNDGKQKERLQMMMSLLEGDNDDSEEGQSTDSSNDDK 527 (563) T ss_pred cccCCCccccchhhhhcccccCCCCCCCCCCCCCCCCCCcc Confidence 00 00 0001111111 11111111 No 106 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=98.25 E-value=1.6e-07 Score=57.85 Aligned_cols=220 Identities=11% Similarity=0.126 Sum_probs=112.5 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHH-hhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeee- Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAE-MCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVL- 78 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~-~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~- 78 (237) +-+.|.+.+.+++++......-+..+....+.+.|... ......... .+..+ ...++.+++++-+|-++ T Consensus 224 i~~~i~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~-------~~~~~--~~~~~~~~~~~~~~~~~~ 294 (480) T protein:vir:78 224 ISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENT-------TLDIY--YGRILTLASEAAKISEFK 294 (480) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHhhcchhhhhhCCCccccccccccc-------hhhhh--hhhhccCCCCCceEEecC Confidence 33457777888888887776666655544444433211 011111111 11111 12223333332334333 Q ss_pred ecCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHH--HHHHHHhhhHHHHHHHHHhhcC---- Q lcl|NC_019725. 79 NSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLV--DRKREEDYRPLLEFLLPFIVEE---- 152 (237) Q Consensus 79 ~~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I--~~~Qe~~l~p~l~~l~~~i~~s---- 152 (237) ..++.+..+.+.....++++.+++|...|-| ++.+ ++||+.=...|...+ ...++..+++.|.+++.+++.- T Consensus 295 ~~~~~~~~~~l~~~i~~~~~~~~~p~~~fg~-~~~n-~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~~~~ 372 (480) T protein:vir:78 295 AAELRNFAEEMEVFRKEAASITGLPPQYLSS-SSEN-PASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGRE 372 (480) T ss_pred ccCHHHHHHHHHHHHHHHhcccCCCHHHhcc-ccCc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCC Confidence 2355677788888899999999999887744 3222 246654443444333 3445566899999998887632 Q ss_pred -----CCceeEeCCCCCCCHHHHHHHHHHHHHHH-------HHHHhCCCCCHHHHHH--HHHhhccc---cccCCCCCCC Q lcl|NC_019725. 153 -----EEWSIEFEPLSVPSKKEESEITKNNVESV-------TKAITEQIIDLEEARD--TLRSIAPE---FKLKDGNNIN 215 (237) Q Consensus 153 -----~~~~~~f~pL~~~seke~Aei~~~~A~a~-------~~~~~~g~i~~~e~r~--~l~~~~~~---~g~~~~~~~~ 215 (237) .++.+.|.|-..+|..+.|+...+-+++. ..+-..|+++. ++.+ .+++...+ ..+. .+. T Consensus 373 ~~~~~~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~lg~~~d-~~~e~~~~~~~~~~~~~~~~~---~~~ 448 (480) T protein:vir:78 373 VTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTAT-QREQMRDWDKQETEDMIDTLY---STT 448 (480) T ss_pred ccccceeeeEEecCCCCCCHHHHHHHHHHHHHhcccCCCHHHHHhcCCCCHh-HHHHHHHHHHHHHHHHHHHhh---ccc Confidence 25789999999999999888766655432 22334565432 2222 22211111 1111 111 Q ss_pred hhccccCCCCCCCCCCCcCcCC Q lcl|NC_019725. 216 IREPEETTEPEPGLGEKLEDEN 237 (237) Q Consensus 216 ~~~~e~~~e~~~~~~~~~~~e~ 237 (237) ....+.. ++|+.|+..+..+ T Consensus 449 ~~~~~~~--~~~~~~~~~~~~~ 468 (480) T protein:vir:78 449 KAQADAT--PKPTVTETKTETQ 468 (480) T ss_pred cCCCccc--cCCCCCCCCCccC Confidence 1122222 2223332222222 No 107 >protein:vir:8317 Length: 409 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:154 # MgeName: Corndog # Cross-refs: genbank:acc:NP_817885;genbank:gi:29566318;genbank:GeneID:1259513 Probab=98.24 E-value=2.9e-07 Score=56.47 Aligned_cols=194 Identities=11% Similarity=0.036 Sum_probs=103.6 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHh--ccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchh-eeeeecCCcceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRK--QQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGR-AIGIDAETEEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~--~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~-~~~iD~~~e~~~~ 77 (237) -++.+...+.....+......+.... --.++++++ .++.. ....+++++. ..+..|.+ .+++.+..+.++. T Consensus 203 pi~~~~~~i~~~~a~~~~~~~~f~nga~p~gil~~~~---~ls~e-~~~~~~~~~~--~~~~~nag~~~il~~g~~~~~~ 276 (409) T protein:vir:83 203 PLESAAPRQVVIGLLQKYVQNLAETGGVPLYWLGVER---RLSET-EAVDLMDRWI--ESRSKYAGHPALVTGGATLNQA 276 (409) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEeecCC---CCCHH-HHHHHHHHHH--HhhCCccCccceecCCcccccc Confidence 47777788877777777777766542 223455543 22222 2234555543 23333433 4566554333333 Q ss_pred eecCcCC--HHHHHHHHHHHHhhhhcCceeeeeccCccccccc---chhHHHHHHHHHHHHHHHhhhHHHHHHHHHhh-- Q lcl|NC_019725. 78 LNSDISG--VPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSAS---QNTALETFYKLVDRKREEDYRPLLEFLLPFIV-- 150 (237) Q Consensus 78 ~~~~lsG--l~dl~~~~~~~iaa~s~iP~t~L~G~sp~Glnat---Ge~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~-- 150 (237) .+.+... +-+........||.+-|||- .|+|....+=++| -|.-...|| +..|.|.+.++-..+- T Consensus 277 ~~~s~~d~q~le~r~~~~~eIa~~fgVPp-~llg~~~~~~~~tysn~eq~~~~f~-------~~tL~P~~~~ie~~l~~~ 348 (409) T protein:vir:83 277 KSMSAQDLSLMELTQFNEARIAILLGVPP-FLVGLPGATGSLTYSNIEQLFSFHD-------RSSLRPKATAVMAALDRW 348 (409) T ss_pred cCCCHHHHHHHHHHHhhHHHHHHHhCCCH-HHccCCCCccccccccHHHHHHHHH-------HHHHHHHHHHHHHHHHHh Confidence 4443322 12233455778999999995 6667543221112 133444454 3457777776655443 Q ss_pred -cCCCceeEe--CCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCChhccccCCCCCC Q lcl|NC_019725. 151 -EEEEWSIEF--EPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIREPEETTEPEP 227 (237) Q Consensus 151 -~s~~~~~~f--~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~~~~e~~~e~~~ 227 (237) ...+..|+| ..|...+.++ ++++++.++++|+++++|+|+.+ ...+..|- ++. ++.+. T Consensus 349 Ll~~~~~~~f~~~~llr~d~~~-------r~~~~~~~~~~G~lT~NE~R~~~-glpp~~gg---d~l--------~~~gv 409 (409) T protein:vir:83 349 ALPSPQHLELNRDDYTRPSLVE-------RATAYKIMIEAGVMEPNEARAME-RLHSEAAA---VRL--------SGGGV 409 (409) T ss_pred hCCCCcEEEeehhhhhccCHHH-------HHHHHHHHHhCCCcCHHHHHHHh-CCCCCCCC---ccc--------CCCCC Confidence 223444555 4555555544 56788999999999999999865 22222221 111 11111 No 108 >protein:vir:4898 Length: 502 # NCBI annotation: gp502 # Family: family:all:125 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056676;genbank:gi:9635011;genbank:GeneID:1262662 Probab=98.24 E-value=4e-07 Score=55.63 Aligned_cols=227 Identities=15% Similarity=0.098 Sum_probs=121.4 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeec--CC--ccee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDA--ET--EEYD 76 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~--~~--e~~~ 76 (237) .++.+.+.+.+++.+....+.-+..+...++.+.|......+..+. -.++..++... . ....++ ++ -+|- T Consensus 251 d~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~--~~~~~~~~~~~-~---~~~~~~~~~~~d~~~l 324 (502) T protein:vir:48 251 DYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPQGMQAS--DMKRTRLMQLK-P---PKSADGKEGTVKAEYL 324 (502) T ss_pred chhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCcccccccchh--hhhhcceeecc-c---cccccccccCcceeEe Confidence 6888999999999999999999998888888877643211111111 11111111100 0 000011 11 2345 Q ss_pred eeecCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHH--HHHHHHhhhHHHHHHHHHhhcC-- Q lcl|NC_019725. 77 VLNSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLV--DRKREEDYRPLLEFLLPFIVEE-- 152 (237) Q Consensus 77 ~~~~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I--~~~Qe~~l~p~l~~l~~~i~~s-- 152 (237) ..+.+..++...++.+.+.|...+++|-.-+-+ .-| |.||+.=..-|.... ...++..++..|++++.+++.- T Consensus 325 ~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~-~~~--n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~ 401 (502) T protein:vir:48 325 TKSYDVSGAEAYKTRLNKDIHVFTNTPDMSDNH-FSG--NASGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRLAARIGS 401 (502) T ss_pred eecCCHHHHHHHHHHHHHHHHHHhCCCCcCccc-ccc--CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 566777899999999999999999999654322 212 446654333333322 3445577899999998887521 Q ss_pred ----------CCceeEeCCCCCCCHHHHHHHHHHHHHHH---HHHHhCCCCC-HHHHHHHHHhhccccccCCCCCCChhc Q lcl|NC_019725. 153 ----------EEWSIEFEPLSVPSKKEESEITKNNVESV---TKAITEQIID-LEEARDTLRSIAPEFKLKDGNNINIRE 218 (237) Q Consensus 153 ----------~~~~~~f~pL~~~seke~Aei~~~~A~a~---~~~~~~g~i~-~~e~r~~l~~~~~~~g~~~~~~~~~~~ 218 (237) .++.+.|+|-...+.++.|++..+.+..+ ..+-..+.++ +++-.+++++...+....+......++ T Consensus 402 ~~~~~~~~d~~~i~i~f~~~~p~d~~e~a~~~~kl~g~iS~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~ 481 (502) T protein:vir:48 402 LVNEFKDFDESRLKITFTPNLPKSLYEQVSILNDLGGQVSQETALSLSGLVENPTEELDKINEESSKIDFKGYPSYFYDN 481 (502) T ss_pred hcccccccccccceEEeCCCCCcCHHHHHHHHHHHhccCcHHHHHHhCCCCCCHHHHHHHHHHHHHhhhhhccccccccc Confidence 25789999999999999998876654322 1122233332 333344443211110011000000101 Q ss_pred cccCCCCCCCCCCCcCcCC Q lcl|NC_019725. 219 PEETTEPEPGLGEKLEDEN 237 (237) Q Consensus 219 ~e~~~e~~~~~~~~~~~e~ 237 (237) ... ...+...+.+.+.|| T Consensus 482 ~~~-~~d~~~e~~~~~~~~ 499 (502) T protein:vir:48 482 VGK-YTDEVKETHTDDFER 499 (502) T ss_pred ccc-cCCCccCCCCcCcCC Confidence 000 111112233333333 No 109 >protein:vir:4828 Length: 382 # NCBI annotation: ORF24 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038325;genbank:gi:9634651;genbank:GeneID:1262630 Probab=98.23 E-value=7.4e-07 Score=54.18 Aligned_cols=204 Identities=12% Similarity=0.113 Sum_probs=107.0 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVL 78 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~ 78 (237) .++.+.+.|.....+......+...... .++|+++ .+.... ...+.+ +.....++..+.+++++ +-+|+.+ T Consensus 173 ~l~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~~---~~~~e~-~~~~~~--~~~~~~~n~g~~~vl~~-g~~~~~l 245 (382) T protein:vir:48 173 PLMALSRELDIQKASGNLTINSLKNALNANGILKIKG---GGLLDF-KTKLSR--SRQAMKQMQGGPLVLDD-LEDFTPL 245 (382) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCC---CCChHH-HHHHHH--HHHhhccCCCCeeEcCC-CceEEEc Confidence 6778888888888888888887776544 4666653 111111 112222 22234444455566765 5788888 Q ss_pred ecCcCC--HHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhcC--CC Q lcl|NC_019725. 79 NSDISG--VPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVEE--EE 154 (237) Q Consensus 79 ~~~lsG--l~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~s--~~ 154 (237) +.+... +-+........||.+-|||-..| |.+-.+=| + +...+.|| +..|.|.+..+-..+-.. .+ T Consensus 246 ~~~~~d~q~~e~~~~~~~~Ia~afgVp~~~l-g~~~~~~~-~-~~~~~~~~-------~~~l~p~~~~i~~~l~~~l~~~ 315 (382) T protein:vir:48 246 EIKSNVSQLLKQADWTTGQFAKVYGIPDNVV-GGQGDQQS-S-LEMSSDLY-------SKAVSRYLRPFLSELSQKLSCD 315 (382) T ss_pred cCChhHHHHHHHHHHHHHHHHHHhCCCHHHh-CCCCCccc-H-HHHHHHHH-------HHHHHHHHHHHHHHHHHHhcCh Confidence 766543 33566777899999999997766 43211111 1 11223333 455777766665544321 22 Q ss_pred ceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCChhccccCCCCCCCCCCCcC Q lcl|NC_019725. 155 WSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIREPEETTEPEPGLGEKLE 234 (237) Q Consensus 155 ~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~~~~e~~~e~~~~~~~~~~ 234 (237) +.+...+...++... .......++.+|+++++|+|+.|.. .|+.+.. + ...+ ...+.-.+|++ + T Consensus 316 ~~~~~~~~~~~~~~~-------~~~~~~~l~~~g~~t~~e~r~~l~~----~g~~~~~-~--~~~~-~~~~~~~GGd~-~ 379 (382) T protein:vir:48 316 VDADIFPAVDPTGSN-------YISRINSLVKTGTLAQNQGLYILQQ----AEILPKE-L--PNGE-NPNSTLKGGEE-D 379 (382) T ss_pred hhhhhhhhhccchhH-------HHHHHHHHhhcCccCHHHHHHHHhh----CCCCCcc-h--hhhh-cCCCCCCCCCC-C Confidence 222222222222222 1223456899999999999998864 2332211 1 1111 11122233333 3 Q ss_pred cCC Q lcl|NC_019725. 235 DEN 237 (237) Q Consensus 235 ~e~ 237 (237) ++| T Consensus 380 ~~~ 382 (382) T protein:vir:48 380 GQD 382 (382) T ss_pred CCC Confidence 333 No 110 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=98.23 E-value=9.4e-07 Score=53.63 Aligned_cols=217 Identities=13% Similarity=0.128 Sum_probs=118.0 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCC-cceeeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAET-EEYDVLN 79 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~-e~~~~~~ 79 (237) .++.+.+.+.+++.+....+.-+..+....+.+.|... .+. ......+. ..+++.++.++ -+|-..+ T Consensus 252 d~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~----~~~-~~~~~~~~-------~~~~~~~~~~~~~~~l~~~ 319 (483) T protein:vir:12 252 DIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDD----QEL-PEFKRLLR-------YYGAIKVSDNGGVDTIQVE 319 (483) T ss_pred chhhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCc----ccc-hhHHHhhh-------hccccccCCCCcceEEeec Confidence 67888889999999988888888887777776665321 110 11111111 22334343322 3355567 Q ss_pred cCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHH--HHHHHHhhhHHHHHHHHHhhcC----- Q lcl|NC_019725. 80 SDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLV--DRKREEDYRPLLEFLLPFIVEE----- 152 (237) Q Consensus 80 ~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I--~~~Qe~~l~p~l~~l~~~i~~s----- 152 (237) .+.+++...++.+.+.|...+++|-.-. +.. +| |.||..=..-|...+ ...++..+++.+++++++++.- T Consensus 320 ~~~~~~~~~~~~l~~~I~~~s~~p~~~~-~~~-~~-n~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~li~~~~~~~~ 396 (483) T protein:vir:12 320 VPVENSKKYLDELYQKIMLFGQAVDFSS-DKF-GS-APSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG 396 (483) T ss_pred CCHHHHHHHHHHHHHHHHHHhCCCCCCc-ccc-cc-CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC Confidence 7888999999999999999999996433 211 22 456765333444443 3555567899999988887632 Q ss_pred --CCceeEeCCCCCCCHHHHHHHHHHHHHHHH---HHHhCCCC-CHHHHHHHHHhhccccccCCCCCCChhccccCC-CC Q lcl|NC_019725. 153 --EEWSIEFEPLSVPSKKEESEITKNNVESVT---KAITEQII-DLEEARDTLRSIAPEFKLKDGNNINIREPEETT-EP 225 (237) Q Consensus 153 --~~~~~~f~pL~~~seke~Aei~~~~A~a~~---~~~~~g~i-~~~e~r~~l~~~~~~~g~~~~~~~~~~~~e~~~-e~ 225 (237) .++++.|+|-...+.++.|++..+.+...+ .+-..+.+ ++++..+++++.-.+. ...-.+......+..+ +. T Consensus 397 ~~~~i~v~f~~~~p~~~~~~a~~~~kl~GiiS~et~~~~~~~v~d~~~E~~ri~~E~~~~-~~~~~~~~~~~~d~~~~~~ 475 (483) T protein:vir:12 397 EHKDVDISFNYNKVANTELQVQTAQQSMGIVSHETVLENHPFVEDLQAELERIEQEQMEY-NKQLPNLDDGGADGAQQQE 475 (483) T ss_pred ccceeeEEeCCCCCCCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHH-HhhcccccccccCCcccCC Confidence 478899999999999999887666432111 11122222 2333333332211000 0000001011110000 01 Q ss_pred CCCCCCCcCcC Q lcl|NC_019725. 226 EPGLGEKLEDE 236 (237) Q Consensus 226 ~~~~~~~~~~e 236 (237) +++..+ .| T Consensus 476 ~~~~~e---~e 483 (483) T protein:vir:12 476 RSNNKE---SE 483 (483) T ss_pred CCCccc---CC Confidence 111111 11 No 111 >protein:vir:63755 Length: 547 # NCBI annotation: gp14 # Family: family:all:2446 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547619;genbank:GeneID:3783506 Probab=98.23 E-value=7.1e-07 Score=54.30 Aligned_cols=226 Identities=13% Similarity=0.156 Sum_probs=105.5 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchhe-eeeecCCcceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRA-IGIDAETEEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~-~~iD~~~e~~~~ 77 (237) -++.+.+.|.....+......+...... .++++++-.. + +.+....+++++.-.-..-.|.+- .++..++=+|.. T Consensus 250 pi~~~~~~i~~~~~a~~~~~~~f~Ng~~p~giL~~~~~~~-l-s~e~~~~lk~~~~~~~~G~~nagk~~vl~~~g~~~~~ 327 (547) T protein:vir:63 250 ELEIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQ-Q-SQHALEIFKREWKNSLSGINGSWQIPVVSAEDVKFVN 327 (547) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEecCCCC-C-CHHHHHHHHHHHHHHhcCcccccccccccCCCceEEE Confidence 3677778888887777777776665432 2345443111 1 122222344444322222234443 345443345665 Q ss_pred eecCcCCH--HHHHHHHHHHHhhhhcCceeeeeccCcccccccchhH--HHHHHHHHHHHHHHhhhHHHHHHHHHhh--- Q lcl|NC_019725. 78 LNSDISGV--PEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTA--LETFYKLVDRKREEDYRPLLEFLLPFIV--- 150 (237) Q Consensus 78 ~~~~lsGl--~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D--~~nyyd~I~~~Qe~~l~p~l~~l~~~i~--- 150 (237) ++.+.... -+........||.+-|||-..|--..-+...+++.+. ..|.-.....+-+..|.|.+.++-..|- T Consensus 328 l~~~~~d~qfle~~~~~~~~Ia~afgVPP~~lG~~~~~~~~~~~~~s~t~sn~e~~~~~~~~~tL~P~~~~ie~~ln~~L 407 (547) T protein:vir:63 328 MTPSARDMEFEKWLNYLINVISALYGIDPAEINIPNNGGATGSKGGSLNEGNSAEKNQASKNKGLQPLLGFIEDFINKHI 407 (547) T ss_pred cCCChhHHHHHHHHHHHHHHHHHHhCCCHHHcCcccccccccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 55433222 2334556788999999999888533322221111000 1111111112223457777666644432 Q ss_pred -c--CCCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcc-cccc---CCCCCCC-------- Q lcl|NC_019725. 151 -E--EEEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAP-EFKL---KDGNNIN-------- 215 (237) Q Consensus 151 -~--s~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~-~~g~---~~~~~~~-------- 215 (237) . ...+.|+|.-+...+..++++ +..++.+|+++++|+|+.+- ..+ ..|- +....+. T Consensus 408 ~~~~~~~~~~~f~~~~~~~~~~~~~--------~~~~~~~g~lT~NE~R~~~g-l~P~~egGD~~~~~~~~~~~~~~~~~ 478 (547) T protein:vir:63 408 VAEFGDKYTFQFVGGDIKSELESVK--------ILAEKAKVAMTVNEVRKELN-LPGDVIGGDIPLNGVIVQRIGQLMQQ 478 (547) T ss_pred ccccCCceEEEeeccccccHHHHHH--------HHHHHhCCCcCHHHHHHHhC-CCCCCCCCceeecccccccccccccc Confidence 1 247899999888888776654 22456789999999999763 222 1110 0000000 Q ss_pred ----hhccc--------cCC--CCCCCCCCCcCcCC Q lcl|NC_019725. 216 ----IREPE--------ETT--EPEPGLGEKLEDEN 237 (237) Q Consensus 216 ----~~~~e--------~~~--e~~~~~~~~~~~e~ 237 (237) .+..+ ... +..+...++...++ T Consensus 479 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 514 (547) T protein:vir:63 479 EQFEHEKQQSNLQMLQEQTGNRVSTDVEDIPDGKDT 514 (547) T ss_pred cCCccccchhhccccccccCCCCCCCCCCCCCCccc Confidence 00000 000 00011111111111 No 112 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=98.22 E-value=4.3e-07 Score=55.47 Aligned_cols=208 Identities=15% Similarity=0.114 Sum_probs=119.5 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCC-cceeeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAET-EEYDVLN 79 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~-e~~~~~~ 79 (237) .++.+.+.+.+++.+....+.-+..+....+.+.|... . +.. .....+ + ..+++.++.++ -+|-+.+ T Consensus 244 d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~---~-~~~-~~~~~~------~-~~~~i~~~~~~~~~~l~~~ 311 (474) T protein:vir:96 244 DIWMYKSFVDAIDKRLSDVQNMFDESVELIYILRGYEG---E-DLS-EFMEGL------K-YYKAINVSSDGGVETIQVE 311 (474) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCCc---c-ccc-chhhhh------h-ccceeeccCCCceeEEecc Confidence 78889999999999999999888888877777665311 1 000 111111 1 12344454432 2355567 Q ss_pred cCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHH--HHHHHHhhhHHHHHHHHHhhcC----- Q lcl|NC_019725. 80 SDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLV--DRKREEDYRPLLEFLLPFIVEE----- 152 (237) Q Consensus 80 ~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I--~~~Qe~~l~p~l~~l~~~i~~s----- 152 (237) .+.+++...++.+...|...+++|-.-.-+. +| |.||..=..-|.... ...++..++..|.+++.+++.- T Consensus 312 ~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~--~~-n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g~~~ 388 (474) T protein:vir:96 312 VPVASTKEYLDMMRAYIVEFGQGVDFQTDKF--GS-ATSGIALKFLYTNLNLKANKLKNKANVALQELMQFILDFNKIKL 388 (474) T ss_pred CCHHHHHHHHHHHHHHHHHHhCCcCcccccc--cc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCc Confidence 7888999999999999999999996543221 22 456664333343333 3455677899999998887642 Q ss_pred --CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHH-------------HhhccccccCCCCCCChh Q lcl|NC_019725. 153 --EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTL-------------RSIAPEFKLKDGNNINIR 217 (237) Q Consensus 153 --~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l-------------~~~~~~~g~~~~~~~~~~ 217 (237) .++.+.|+|-...++.|.|++.. ++|+||.+.+...+ +....+.. .....+... T Consensus 389 d~~~i~i~f~~~~p~~~~e~a~~~~----------~~giiS~et~~~~lp~v~D~~~E~eri~~E~~~~~-~~~~~~~~~ 457 (474) T protein:vir:96 389 DAKEIEITFNFNVMVNDLEQSQIGA----------QSQYLSKETLVRHHPWVDDPKAELERLDEEQLELN-KQLPNLDDG 457 (474) T ss_pred ccceeeEEecCCCccCHHHHHHHHH----------HcCCCChHHHHHhCCCCCCHHHHHHHHHHHHHHHH-hhccccccc Confidence 46889999999999999887642 24777766655433 11000000 000000000 Q ss_pred ccccCCCCCCCCCCCcCcC Q lcl|NC_019725. 218 EPEETTEPEPGLGEKLEDE 236 (237) Q Consensus 218 ~~e~~~e~~~~~~~~~~~e 236 (237) . ...+.++..++..+.| T Consensus 458 ~--~~~~~~~~~~~~~e~~ 474 (474) T protein:vir:96 458 G--ADGAQQQQQSENNQSK 474 (474) T ss_pred c--CCCCCCcCCCCccccC Confidence 0 0011111112222222 No 113 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=98.22 E-value=4.3e-07 Score=55.47 Aligned_cols=208 Identities=15% Similarity=0.114 Sum_probs=119.5 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCC-cceeeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAET-EEYDVLN 79 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~-e~~~~~~ 79 (237) .++.+.+.+.+++.+....+.-+..+....+.+.|... . +.. .....+ + ..+++.++.++ -+|-+.+ T Consensus 244 d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~---~-~~~-~~~~~~------~-~~~~i~~~~~~~~~~l~~~ 311 (474) T protein:vir:95 244 DIWMYKSFVDAIDKRLSDVQNMFDESVELIYILRGYEG---E-DLS-EFMEGL------K-YYKAINVSSDGGVETIQVE 311 (474) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCCc---c-ccc-chhhhh------h-ccceeeccCCCceeEEecc Confidence 78889999999999999999888888877777665311 1 000 111111 1 12344454432 2355567 Q ss_pred cCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHH--HHHHHHhhhHHHHHHHHHhhcC----- Q lcl|NC_019725. 80 SDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLV--DRKREEDYRPLLEFLLPFIVEE----- 152 (237) Q Consensus 80 ~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I--~~~Qe~~l~p~l~~l~~~i~~s----- 152 (237) .+.+++...++.+...|...+++|-.-.-+. +| |.||..=..-|.... ...++..++..|.+++.+++.- T Consensus 312 ~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~--~~-n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~~g~~~ 388 (474) T protein:vir:95 312 VPVASTKEYLDMMRAYIVEFGQGVDFQTDKF--GS-ATSGIALKFLYTNLNLKANKLKNKANVALQELMQFILDFNKIKL 388 (474) T ss_pred CCHHHHHHHHHHHHHHHHHHhCCcCcccccc--cc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCc Confidence 7888999999999999999999996543221 22 456664333343333 3455677899999998887642 Q ss_pred --CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHH-------------HhhccccccCCCCCCChh Q lcl|NC_019725. 153 --EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTL-------------RSIAPEFKLKDGNNINIR 217 (237) Q Consensus 153 --~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l-------------~~~~~~~g~~~~~~~~~~ 217 (237) .++.+.|+|-...++.|.|++.. ++|+||.+.+...+ +....+.. .....+... T Consensus 389 d~~~i~i~f~~~~p~~~~e~a~~~~----------~~giiS~et~~~~lp~v~D~~~E~eri~~E~~~~~-~~~~~~~~~ 457 (474) T protein:vir:95 389 DAKEIEITFNFNVMVNDLEQSQIGA----------QSQYLSKETLVRHHPWVDDPKAELERLDEEQLELN-KQLPNLDDG 457 (474) T ss_pred ccceeeEEecCCCccCHHHHHHHHH----------HcCCCChHHHHHhCCCCCCHHHHHHHHHHHHHHHH-hhccccccc Confidence 46889999999999999887642 24777766655433 11000000 000000000 Q ss_pred ccccCCCCCCCCCCCcCcC Q lcl|NC_019725. 218 EPEETTEPEPGLGEKLEDE 236 (237) Q Consensus 218 ~~e~~~e~~~~~~~~~~~e 236 (237) . ...+.++..++..+.| T Consensus 458 ~--~~~~~~~~~~~~~e~~ 474 (474) T protein:vir:95 458 G--ADGAQQQQQSENNQSK 474 (474) T ss_pred c--CCCCCCcCCCCccccC Confidence 0 0011111112222222 No 114 >protein:vir:81218 Length: 423 # NCBI annotation: gp3, phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456733;genbank:gi:157168376;interpro:IPR006427;interpro:IPR006944;uniprot:Q9MBK2;genbank:GeneID:5580341 Probab=98.22 E-value=7.2e-07 Score=54.27 Aligned_cols=211 Identities=13% Similarity=0.120 Sum_probs=106.7 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHH----HHHHHHHHHH-HhcCch-heeeeecCC Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQY----AARLRLAQVD-DNSGVG-RAIGIDAET 72 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~----~~~~r~~~~~-~~r~~~-~~~~iD~~~ 72 (237) -++.+.+.|.....+.....++...... .|++++... ..++... .+++++.-.- ..-++. +.+++++ + T Consensus 190 pi~~~~~~i~~~~~~~~~~~~~f~ng~~p~gvi~~~~~~---~~~~l~~e~~~~~~~~~~~~~~~~~~n~g~~~vl~~-g 265 (423) T protein:vir:81 190 PVQSLRDILGEQIEAAIFRAQMWRNGPRPGMVIMRDPES---KAGKWDAESRTRFMANLRASFSPKSSDVGGTLLLED-G 265 (423) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcc---cCccCCHHHHHHHHHHHHHHhccccccCCcceecCC-C Confidence 4677778887777777777777655322 356654311 1111122 2333333211 112333 3556654 5 Q ss_pred cceeeeecCcCC--HHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHh- Q lcl|NC_019725. 73 EEYDVLNSDISG--VPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFI- 149 (237) Q Consensus 73 e~~~~~~~~lsG--l~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i- 149 (237) -+|..++.+... +-+........||.+-|||-..| |..-++-.++-|.-.+.||.. .|.|.+.++-+-+ T Consensus 266 ~~~~~l~~s~~d~q~~e~~~~~~~eIa~~fgVPp~~l-g~~~~~t~sn~e~~~~~f~~~-------~L~P~~~~ie~~l~ 337 (423) T protein:vir:81 266 MKAENFHTTSKDEQTVETTKLSLQTVAQVYGINPTMV-GQLDNANYSNVREFRKALYGD-------NLGSWIRIIQDVMN 337 (423) T ss_pred ceEEeccCChhhHHHHHHHHhhHHHHHHHhCCCHHHh-cCCCCCCcccHHHHHHHHHHH-------HHHHHHHHHHHHHh Confidence 678777654422 11234456777999999996654 654333222223344455553 4667666554333 Q ss_pred ---hc-----CCCceeEe--CCCCCCCHHHHHHHHHHHHHHHHHHH-hCCCCCHHHHHHHHHhhccccccCCCCCCCh-h Q lcl|NC_019725. 150 ---VE-----EEEWSIEF--EPLSVPSKKEESEITKNNVESVTKAI-TEQIIDLEEARDTLRSIAPEFKLKDGNNINI-R 217 (237) Q Consensus 150 ---~~-----s~~~~~~f--~pL~~~seke~Aei~~~~A~a~~~~~-~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~-~ 217 (237) .. ..++.|+| ..|...|-+++++ ++++++ +.|+++++|+|+.+ ...+..| ++..-. . T Consensus 338 ~~L~~~~~~~~~~~~~~fd~~~llr~d~~~r~~-------~~~~~l~~~G~~T~NE~R~~~-gl~p~~g---GD~~~~p~ 406 (423) T protein:vir:81 338 LFLLPRVGIDNEKFYFEFNLEEKLRASFEEAAE-------IKRAAVGNVAWMTINEVRAMD-NLPSIDG---GDDLARPL 406 (423) T ss_pred hhhcCccccccCccEEEecchhhhccCHHHHHH-------HHHHHHhCCCCcCHHHHHHHh-CCCCCCC---cceeeccc Confidence 21 13445555 4666666665544 455555 46999999999875 2233222 111111 1 Q ss_pred ccccCCCCCCCCCCCcCc Q lcl|NC_019725. 218 EPEETTEPEPGLGEKLED 235 (237) Q Consensus 218 ~~e~~~e~~~~~~~~~~~ 235 (237) .... .+.++.++++.+. T Consensus 407 n~~~-~~~~~~~~~~~~t 423 (423) T protein:vir:81 407 NTEF-GDSEDAPGEEVET 423 (423) T ss_pred cccc-CccCCCCCCCCCC Confidence 1111 1222333344333 No 115 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=98.22 E-value=1.1e-06 Score=53.33 Aligned_cols=234 Identities=10% Similarity=-0.023 Sum_probs=111.4 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHH-h--hcCCchHHHHHHHHHHHHHhcCchheeeeecCCccee- Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAE-M--CDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYD- 76 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~-~--~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~- 76 (237) +.+.+.+.+.+++++......-.+-+......+-|... . ..++........++..+...-......+..+..-++. T Consensus 231 i~~~v~~l~Da~~~~~~~~~~~~e~~a~p~r~i~G~~~~~~~~~d~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~q 310 (504) T protein:vir:99 231 ITRPVMSLQQRALKGCIRMDGHADVYSFPQLILLGADAKNFRNKDGSMKPAWQIALARVFALPDDEDEPDAARARADVKQ 310 (504) T ss_pred chhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhccCCccccccccccccchhhhhhhhhhcCCCccccccccCccceeee Confidence 56677788888877776655444444443333333211 1 1122222222222221111111111111111111222 Q ss_pred eeecCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHH---HHHHHHHHHhhhHHHHHHHHHhhc-- Q lcl|NC_019725. 77 VLNSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFY---KLVDRKREEDYRPLLEFLLPFIVE-- 151 (237) Q Consensus 77 ~~~~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyy---d~I~~~Qe~~l~p~l~~l~~~i~~-- 151 (237) .-.+++.++.+.+.....++|+.++||.. -||...-.-|+||+.=..... ..+..+| ..+.+.|++++.+.+. T Consensus 311 ~~~~~l~~~~~~l~~~i~~~a~~t~~P~~-~lG~~~~~n~sSa~Ai~~~~~~L~~ka~~k~-~~f~~~l~~~~rla~~~~ 388 (504) T protein:vir:99 311 FPASSPQPHIEMLEQIAMMFSGETSIPVE-SLGFSNRANPTSADAYIASREDLIAEAEGAT-DDWSPAFRRSMIRALAIK 388 (504) T ss_pred cCCCChHHHHHHHHHHHHHHHhhhCCCHH-HhcccccccccHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHh Confidence 23355778888999999999999999954 455442223456665443333 3444444 4578889888877531 Q ss_pred ------C---CCceeEeCCCCCCCHHHHHHHHHHHHHHHH--------HHHhCCCCCHHHHHHHHHhhcccc------cc Q lcl|NC_019725. 152 ------E---EEWSIEFEPLSVPSKKEESEITKNNVESVT--------KAITEQIIDLEEARDTLRSIAPEF------KL 208 (237) Q Consensus 152 ------s---~~~~~~f~pL~~~seke~Aei~~~~A~a~~--------~~~~~g~i~~~e~r~~l~~~~~~~------g~ 208 (237) . .++.+.|.|...+|..+.|+...|.+++.. .+-..| ++++++.+..+.+-... .+ T Consensus 389 ~~~~~~~~~~~~~~v~w~d~~~~s~a~~aDa~~Kl~~ag~~l~~~~~~l~~~lg-~~~~ei~r~~~e~~~~~~~~~~~~l 467 (504) T protein:vir:99 389 NGLDRIPPEWKTIDSKFRSPLYLSKAAQADAGAKMLGAGPEWLKETEVGLELLG-LTPQQAKRALAERRRASSVSIIEAL 467 (504) T ss_pred cCCCccccccccceeEecCCCccCHHHHHHHHHHHHhhccccccchHHHHhhcC-CCHHHHHHHHHHHHHHhhHHHHHHH Confidence 1 246788999999999999998777666432 122246 46666654332111111 11 Q ss_pred CCCCCC--ChhccccCCCCCCCCCCCcCcCC Q lcl|NC_019725. 209 KDGNNI--NIREPEETTEPEPGLGEKLEDEN 237 (237) Q Consensus 209 ~~~~~~--~~~~~e~~~e~~~~~~~~~~~e~ 237 (237) .+.... .....++.+..+|..+++..... T Consensus 468 ~~~~~~~~~~~~~~~~~~~e~a~~~~~~~~~ 498 (504) T protein:vir:99 468 NRRQQEAATAGEDQDQGAGEPPANEPPAALG 498 (504) T ss_pred hcccCCCCCCCCCCCcCCCCCCCCCCCccCC Confidence 000000 00001111111111111111111 No 116 >protein:vir:97447 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240744;genbank:gi:66396413;genbank:GeneID:5133803 Probab=98.21 E-value=5.6e-07 Score=54.85 Aligned_cols=207 Identities=15% Similarity=0.108 Sum_probs=116.7 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCC-cceeeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAET-EEYDVLN 79 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~-e~~~~~~ 79 (237) .++.+.+.+.+++.+....+.-+..+....+.+.|.. +.........+ ...+++.++.++ -+|-..+ T Consensus 244 d~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~-----~~~~~~~~~~~-------~~~~~i~~~~~~~~~~l~~~ 311 (474) T protein:vir:97 244 DIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYE-----GEDLEEFMRGL-------KYYKAINVDGDGGVETIQVE 311 (474) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCC-----cccchhhhhhh-------hccceeeccCCCceeEEeec Confidence 6778889999999999999888877777777776631 11001111111 123455555432 2344466 Q ss_pred cCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHH--HHHHHHhhhHHHHHHHHHhhcC----- Q lcl|NC_019725. 80 SDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLV--DRKREEDYRPLLEFLLPFIVEE----- 152 (237) Q Consensus 80 ~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I--~~~Qe~~l~p~l~~l~~~i~~s----- 152 (237) .+.+++...++.+...|...+++|-.-. .+.+| |.||..=..-|...+ ...++..+++.|++++.+++.- T Consensus 312 ~~~~~~~~~~~~l~~~I~~~s~~p~~~~--~~~~~-n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~ 388 (474) T protein:vir:97 312 VPVSSTKEYIDLMRVYIMEFGQGVDFQT--DKFGS-APSGIALKFLYGNLDLKANKLKNKATVAIQELISFIIDFNNLKT 388 (474) T ss_pred CCHHHHHHHHHHHHHHHHHHhCccccCc--ccccc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCc Confidence 7888999999999999999999996432 11122 446654333444333 2445567899999998887632 Q ss_pred --CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHH-------------HhhccccccCCCCCCChh Q lcl|NC_019725. 153 --EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTL-------------RSIAPEFKLKDGNNINIR 217 (237) Q Consensus 153 --~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l-------------~~~~~~~g~~~~~~~~~~ 217 (237) .++.+.|+|-...+++|.|++. .++|+||.+.+.+.+ +....+..- .-..... T Consensus 389 d~~~i~v~f~~~~p~~~~e~a~~~----------~~~g~iS~et~l~~l~~v~D~~~E~eri~~E~~~~~~-~~~~~~~- 456 (474) T protein:vir:97 389 DVKDIEISFNFNRMMNDAEQSQII----------AQSQYLSRETLVKSSPLVDDYKAELERIEQEQMEYNK-QLPNLDD- 456 (474) T ss_pred ccceeeEEeccCcccCHHHHHHHH----------HHcCCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHh-hccccCC- Confidence 4688999999988888877653 334666666555433 110000000 0000000 Q ss_pred ccccCCCCCCCCCCCcCcCC Q lcl|NC_019725. 218 EPEETTEPEPGLGEKLEDEN 237 (237) Q Consensus 218 ~~e~~~e~~~~~~~~~~~e~ 237 (237) ...+.++...++.+.+| T Consensus 457 ---~~~~~~~~~~~~~~~~~ 473 (474) T protein:vir:97 457 ---GGADGAQQQEGSNNKES 473 (474) T ss_pred ---CCCCCcccCCCCccccc Confidence 00011111111112222 No 117 >protein:vir:94498 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240672;genbank:gi:66396340;genbank:GeneID:5133762 Probab=98.21 E-value=5.6e-07 Score=54.85 Aligned_cols=207 Identities=15% Similarity=0.108 Sum_probs=116.7 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCC-cceeeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAET-EEYDVLN 79 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~-e~~~~~~ 79 (237) .++.+.+.+.+++.+....+.-+..+....+.+.|.. +.........+ ...+++.++.++ -+|-..+ T Consensus 244 d~e~v~~liDa~n~~~s~~~~~~~~~~~~~lv~~g~~-----~~~~~~~~~~~-------~~~~~i~~~~~~~~~~l~~~ 311 (474) T protein:vir:94 244 DIWMYKSIIDAIDKRLSDAQNMFDESVELIYILKGYE-----GEDLEEFMRGL-------KYYKAINVDGDGGVETIQVE 311 (474) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCC-----cccchhhhhhh-------hccceeeccCCCceeEEeec Confidence 6778889999999999999888877777777776631 11001111111 123455555432 2344466 Q ss_pred cCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHH--HHHHHHhhhHHHHHHHHHhhcC----- Q lcl|NC_019725. 80 SDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLV--DRKREEDYRPLLEFLLPFIVEE----- 152 (237) Q Consensus 80 ~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I--~~~Qe~~l~p~l~~l~~~i~~s----- 152 (237) .+.+++...++.+...|...+++|-.-. .+.+| |.||..=..-|...+ ...++..+++.|++++.+++.- T Consensus 312 ~~~~~~~~~~~~l~~~I~~~s~~p~~~~--~~~~~-n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~~ 388 (474) T protein:vir:94 312 VPVSSTKEYIDLMRVYIMEFGQGVDFQT--DKFGS-APSGIALKFLYGNLDLKANKLKNKATVAIQELISFIIDFNNLKT 388 (474) T ss_pred CCHHHHHHHHHHHHHHHHHHhCccccCc--ccccc-ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCc Confidence 7888999999999999999999996432 11122 446654333444333 2445567899999998887632 Q ss_pred --CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHH-------------HhhccccccCCCCCCChh Q lcl|NC_019725. 153 --EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTL-------------RSIAPEFKLKDGNNINIR 217 (237) Q Consensus 153 --~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l-------------~~~~~~~g~~~~~~~~~~ 217 (237) .++.+.|+|-...+++|.|++. .++|+||.+.+.+.+ +....+..- .-..... T Consensus 389 d~~~i~v~f~~~~p~~~~e~a~~~----------~~~g~iS~et~l~~l~~v~D~~~E~eri~~E~~~~~~-~~~~~~~- 456 (474) T protein:vir:94 389 DVKDIEISFNFNRMMNDAEQSQII----------AQSQYLSRETLVKSSPLVDDYKAELERIEQEQMEYNK-QLPNLDD- 456 (474) T ss_pred ccceeeEEeccCcccCHHHHHHHH----------HHcCCCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHh-hccccCC- Confidence 4688999999988888877653 334666666555433 110000000 0000000 Q ss_pred ccccCCCCCCCCCCCcCcCC Q lcl|NC_019725. 218 EPEETTEPEPGLGEKLEDEN 237 (237) Q Consensus 218 ~~e~~~e~~~~~~~~~~~e~ 237 (237) ...+.++...++.+.+| T Consensus 457 ---~~~~~~~~~~~~~~~~~ 473 (474) T protein:vir:94 457 ---GGADGAQQQEGSNNKES 473 (474) T ss_pred ---CCCCCcccCCCCccccc Confidence 00011111111112222 No 118 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=98.19 E-value=1.2e-06 Score=52.99 Aligned_cols=215 Identities=17% Similarity=0.117 Sum_probs=115.9 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecC-C--cceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAE-T--EEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~-~--e~~~~ 77 (237) .++.+.+.+.+++.+....+.-+..+....+.+.|... ....+ ....+. ..+++.++.+ + -+|-. T Consensus 247 d~e~v~~liDa~~~~~S~~~~~~~~~~~~~~~~~g~~~---~~~~~--~~~~~~-------~~~~~~~~~~~~~~~~~l~ 314 (478) T protein:vir:10 247 DLFMYKTIIDALDKRLSDTQNTFDESVELIYILKGYEG---EDMKD--FMHNLK-------YYKAISVAGESGSGVDTIK 314 (478) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHhhCcceeeecCCc---ccccc--hhhhhh-------hCceeEecCCCCCcceEEe Confidence 77888899999999999999888887777777665311 11011 111111 1344555332 2 33555 Q ss_pred eecCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHH--HHHHHHhhhHHHHHHHHHhhcC--- Q lcl|NC_019725. 78 LNSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLV--DRKREEDYRPLLEFLLPFIVEE--- 152 (237) Q Consensus 78 ~~~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I--~~~Qe~~l~p~l~~l~~~i~~s--- 152 (237) .+.+.+++...++.+.+.|...+++|-.-. +.. +| |.||..=..-|.... ....+..+.+.|++++.+++.- T Consensus 315 ~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~-~~~-~~-n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~ 391 (478) T protein:vir:10 315 VEVPIDSVKEYTKMLRDYIIEFGQGVDFQQ-DKF-GN-SPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYRL 391 (478) T ss_pred ecCCHHHHHHHHHHHHHHHHHHhCCcCcCc-ccc-cc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC Confidence 667888999999999999999999996432 211 12 456654333444433 2344566889999998887631 Q ss_pred ----CCceeEeCCCCCCCHHHHHHHHHHHHHHHH---HHHhCCCC-CHHHHHHHHHhhccc-cccCCCCCCChhccccCC Q lcl|NC_019725. 153 ----EEWSIEFEPLSVPSKKEESEITKNNVESVT---KAITEQII-DLEEARDTLRSIAPE-FKLKDGNNINIREPEETT 223 (237) Q Consensus 153 ----~~~~~~f~pL~~~seke~Aei~~~~A~a~~---~~~~~g~i-~~~e~r~~l~~~~~~-~g~~~~~~~~~~~~e~~~ 223 (237) .++.+.|+|-..-+++|.|++..+.+...+ .+-..+.+ ++++..+++++...+ ....+ ++.... . T Consensus 392 ~~d~~~i~i~f~~~~p~~~~e~~~~~~~~~g~iS~et~i~~~~~v~d~~~E~~ri~~E~~~~~~~~~--~~~~~~----~ 465 (478) T protein:vir:10 392 DVRVQDIEITFNFNVMVNELENSQIAMNSTGLLSKETILGNHSWVQDPVAEMERIEQENIELNQQLP--DIEEGL----N 465 (478) T ss_pred CcccccceEEeCCCCCCCHHHHHHHHHHHhCCCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhcc--ccCCCC----c Confidence 378899999999999998887654332111 11112222 233333333221110 00000 000000 0 Q ss_pred CCCCCCCCCcCcC Q lcl|NC_019725. 224 EPEPGLGEKLEDE 236 (237) Q Consensus 224 e~~~~~~~~~~~e 236 (237) +++...++..+.| T Consensus 466 d~~~~~~~d~~~e 478 (478) T protein:vir:10 466 DEQQRQSEDNQSE 478 (478) T ss_pred ccccccCcCCCCC Confidence 0000111111111 No 119 >protein:vir:1082 Length: 359 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076736;genbank:gi:13095846;genbank:GeneID:920394 Probab=98.19 E-value=1.3e-07 Score=58.27 Aligned_cols=184 Identities=15% Similarity=0.187 Sum_probs=98.3 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVL 78 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~ 78 (237) -++.+.+.+.....+............. .+++++.- .+ +++....++++++......+..+.+++++ +.+|..+ T Consensus 170 pi~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~--~l-~~e~~~~~~~~~~~~~~~~n~g~~~vl~~-g~~~~~l 245 (359) T protein:vir:10 170 PLESLTSEIGQQKEANRLSLSTLKGALNPTSVVKVPQG--TL-SSEAKDSIRKEFEKANGGNNSGRVMVLDQ-SADFSTV 245 (359) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCC--CC-CHHHHHHHHHHHHHHhCccccCCceecCC-Ccceeee Confidence 3467777777777777777776655332 46666421 11 22233456666665433332234566664 5777776 Q ss_pred ecCcCCH--HHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhc--CCC Q lcl|NC_019725. 79 NSDISGV--PEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVE--EEE 154 (237) Q Consensus 79 ~~~lsGl--~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~--s~~ 154 (237) +.+.... -+........||.+-|||-..|-|. +.-++ .|+.++..-...++|.|..+.+-|-. ... T Consensus 246 ~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~--~~~~~--------~~~~~e~~~~~~l~~~l~p~~~~l~~~l~~~ 315 (359) T protein:vir:10 246 SINADVANYLNSMNWGRTQIAKAFGVSDSYLNGT--GDQQS--------SLDQIKDLYVNALNRFIEPLISELRIKCDSS 315 (359) T ss_pred cCCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCC--Ccccc--------cHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh Confidence 6543322 2344556778999999998877332 22112 23334333334445544444433321 122 Q ss_pred ceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcccc Q lcl|NC_019725. 155 WSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEF 206 (237) Q Consensus 155 ~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~ 206 (237) +.+....+...+.... ...+..++++|+++++|+|+.|.. .+.. T Consensus 316 ~~~~~~~~~~~d~~~~-------~~~~~~~~~~G~~t~NE~R~~l~~-~pv~ 359 (359) T protein:vir:10 316 IGVDMSPITDYSNSVF-------KADILNWVKEGIIEPTEAKTLLES-KGII 359 (359) T ss_pred hcccchhhhhcCHHHH-------HHHHHHHHhCCCcCHHHHHHHhCC-CCCC Confidence 3333344444443222 233556899999999999998732 2211 No 120 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=98.16 E-value=1.7e-06 Score=52.26 Aligned_cols=217 Identities=13% Similarity=0.127 Sum_probs=117.9 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCC-cceeeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAET-EEYDVLN 79 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~-e~~~~~~ 79 (237) .++.+.+-+.+++.+....+..+..+....+.+.|... .+ .......+ ...+++.++.++ -+|-..+ T Consensus 261 d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~----~~-~~~~~~~~-------~~~~~~~~~~~~~~~~l~~~ 328 (492) T protein:vir:94 261 DIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLKNYDD----QE-LPEFKRLL-------RYYGAIKVSDNGGVDTIQVE 328 (492) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCCc----cc-chhhHHHH-------hhccceecCCCCcceeEecc Confidence 67888899999999999999888888888887776421 11 11111111 123344444332 2355577 Q ss_pred cCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHH--HHHHHHHhhhHHHHHHHHHhhcC----- Q lcl|NC_019725. 80 SDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKL--VDRKREEDYRPLLEFLLPFIVEE----- 152 (237) Q Consensus 80 ~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~--I~~~Qe~~l~p~l~~l~~~i~~s----- 152 (237) .+.+++...++.+...|...+++|-.-. + .-+| |.||+.=..-|... -...++..++..|++++++++.- T Consensus 329 ~~~~~~~~~~~~l~~~I~~~s~~p~~~~-~-~~~~-n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~~ 405 (492) T protein:vir:94 329 VPVENSKKYLDELYQKIMLFGQAVDFSS-D-KFGS-APSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG 405 (492) T ss_pred CCHHHHHHHHHHHHHHHHHHhCCcCCCc-c-cccc-CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc Confidence 8888999999999999999999996432 1 1122 44665422223322 23455567888999988887632 Q ss_pred --CCceeEeCCCCCCCHHHHHHHHHHHHHHH---HHHHhCCCC-CHHHHHHHHHhhccccccCCCCCCChhccccCCCCC Q lcl|NC_019725. 153 --EEWSIEFEPLSVPSKKEESEITKNNVESV---TKAITEQII-DLEEARDTLRSIAPEFKLKDGNNINIREPEETTEPE 226 (237) Q Consensus 153 --~~~~~~f~pL~~~seke~Aei~~~~A~a~---~~~~~~g~i-~~~e~r~~l~~~~~~~g~~~~~~~~~~~~e~~~e~~ 226 (237) .++.+.|+|-...++++.|++..+.+... ..+-..+.+ ++++..+++.+...+. +..-.+... ..++.+ T Consensus 406 ~~~~i~v~f~~~~p~~~~e~~~~~~kl~giiS~et~~~~l~~v~d~~~E~eri~~E~~~~-~~~~~~~~~----~~~~~~ 480 (492) T protein:vir:94 406 EHKDVDISFNYNKVANTELQVQTAQQSMGIVSHETVLENHPFVEDLQAELERIEQEQMEY-NKQLPNLDD----GGADSA 480 (492) T ss_pred ccceeeEEecCCCCCCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHH-Hhhcccccc----ccCCCC Confidence 46889999999999999988766643211 111122222 2333333222110000 000000000 000111 Q ss_pred CCCCCCcCcCC Q lcl|NC_019725. 227 PGLGEKLEDEN 237 (237) Q Consensus 227 ~~~~~~~~~e~ 237 (237) +...++.+.|+ T Consensus 481 ~~~~~~~~~e~ 491 (492) T protein:vir:94 481 QQQERSNNKES 491 (492) T ss_pred ccccCCccccC Confidence 11111111122 No 121 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=98.15 E-value=4.3e-07 Score=55.50 Aligned_cols=222 Identities=10% Similarity=0.100 Sum_probs=110.1 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHH-hhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAE-MCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVLN 79 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~-~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~~ 79 (237) +.+.|.+.+.+++++.......+..+....+.+.|... ......... .+..+ ...++.+.+++-+|-+++ T Consensus 224 i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~-------~~~~~--~~~~~~~~~~~~~~~~~~ 294 (480) T protein:vir:78 224 ISPELRKVTDAASRTLMNLQSASQILGTPLRVISGVTTDELTNDGENT-------TLDIY--YGRILTLASEAAKISEFK 294 (480) T ss_pred chhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhhcCCccccccccccc-------hhhhh--hhhhccCCCCCceEEecC Confidence 33457777788888877776666544444333333211 011111110 11111 112233333323343333 Q ss_pred -cCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHH--HHHHhhhHHHHHHHHHhhcC---- Q lcl|NC_019725. 80 -SDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDR--KREEDYRPLLEFLLPFIVEE---- 152 (237) Q Consensus 80 -~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~--~Qe~~l~p~l~~l~~~i~~s---- 152 (237) +++.+..+.+.....++++.++||...|-|.+ .+ ++||+.-...|...+.. .++..+.+.|.+++.+++.- T Consensus 295 ~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~-~n-~~Sg~Alk~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~g~~ 372 (480) T protein:vir:78 295 AAELRNFAEEMEVFRKEAASITGLPPQYLSSSS-EN-PASAEAIIATDSRIVKMAERKGRIFGGAWERAMRIAMQIMGRE 372 (480) T ss_pred ccCHHHHHHHHHHHHHHHhcccCCChHHhcccc-Cc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCC Confidence 35667777888888999999999998885533 22 35776655555555533 33456888999998887631 Q ss_pred -----CCceeEeCCCCCCCHHHHHHHHHHHHHHH-------HHHHhCCCCCHHHHHH--HHHhhccc---cccCCCCCCC Q lcl|NC_019725. 153 -----EEWSIEFEPLSVPSKKEESEITKNNVESV-------TKAITEQIIDLEEARD--TLRSIAPE---FKLKDGNNIN 215 (237) Q Consensus 153 -----~~~~~~f~pL~~~seke~Aei~~~~A~a~-------~~~~~~g~i~~~e~r~--~l~~~~~~---~g~~~~~~~~ 215 (237) .++.+.|.+-..++..+.|+...+-+++. ..+-..|+++ +++.+ ..++...+ ..+. ... T Consensus 373 ~~~~~~~i~v~f~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~lg~~~-d~~~~~~~~~~e~~~~~~~~~~---~~~ 448 (480) T protein:vir:78 373 VTEEYTRLETVWRDPSTPTVAAKADAVSKLYANGQGPIPKEQARIDLGYTA-TQREQMRDWDKQETEDMIDTLY---STT 448 (480) T ss_pred ccccceeeeEEecCCCCCCHHHHHHHHHHHHHhccccCCHHHHHhcCCCCH-hHHHHHHHHHHHHHHHHHHHhh---ccc Confidence 25788999999999999887766654432 1223455443 23221 11111000 1111 011 Q ss_pred hhccccCCCCCCCCCCCcCcCC Q lcl|NC_019725. 216 IREPEETTEPEPGLGEKLEDEN 237 (237) Q Consensus 216 ~~~~e~~~e~~~~~~~~~~~e~ 237 (237) ..+....+++..+...+...+. T Consensus 449 ~~~~~~~~~~~~~~~~~~~~~~ 470 (480) T protein:vir:78 449 KAQADATPKPTVTETKTETQTS 470 (480) T ss_pred cccCCCCCCCCCCCCCCccccc Confidence 1111111111111111111111 No 122 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=98.14 E-value=1.1e-06 Score=53.33 Aligned_cols=213 Identities=11% Similarity=0.110 Sum_probs=99.7 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhc-cce-eechhHHHhhcCCchHHHHHHHHHHHHHhcCchh-eeeeecCCcceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQ-QAV-WKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGR-AIGIDAETEEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~-~~v-~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~-~~~iD~~~e~~~~ 77 (237) -++.+.+.|.....+............ ... +++++ .+ +++....+++++.-......|.+ .+++++ +-+|.. T Consensus 229 p~~~~~~~i~~~~~~~~~~~~~f~ng~~~~~i~~~~~---~l-~~e~~~~~~~~~~~~~~g~~n~g~~~vl~~-g~~~~~ 303 (460) T protein:vir:10 229 PIRAILRNINSQNSTIDNNVKTMQNGGVFGFIHGGST---GL-TQPQADSLKQRLTEMDKSPDRLSQIAGASG-EIAFTK 303 (460) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCcceeeecCC---CC-CHHHHHHHHHHHHHHhcCccccCCceecCC-CceEEE Confidence 355666777776666666666555422 222 22221 12 12223344554443333333433 456654 567777 Q ss_pred eecCcCC--HHHHHHHHHHHHhhhhcCceeeeeccCccccccc-chhHHHHHHHHHHHHHHHhhhHHHHHHHHHhh---- Q lcl|NC_019725. 78 LNSDISG--VPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSAS-QNTALETFYKLVDRKREEDYRPLLEFLLPFIV---- 150 (237) Q Consensus 78 ~~~~lsG--l~dl~~~~~~~iaa~s~iP~t~L~G~sp~Glnat-Ge~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~---- 150 (237) ++.+... +-+........||.+-|||-..|-...-+..|.+ -+.....||.. .|.|.+.++-..+- T Consensus 304 l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~~~~~f~~~-------~l~P~~~~ie~~ln~kl~ 376 (460) T protein:vir:10 304 ISLNTDELKPFDYLKYDQKAICNALGWSDKLLNNNEGGGLNTGNLEEERKRVVTD-------NIQPDLVILKQAFDKKFI 376 (460) T ss_pred ccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCCccccHHHHHHHHHHH-------HHHHHHHHHHHHHHHhhc Confidence 7665432 2345567779999999999885543322333322 23344555543 46666665543332 Q ss_pred ----cCCCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCC-----CChhcccc Q lcl|NC_019725. 151 ----EEEEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNN-----INIREPEE 221 (237) Q Consensus 151 ----~s~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~-----~~~~~~e~ 221 (237) ...++.|+|+ ...+.. -+ ....+...++++|+++++|+|+.+- ..+... ++.+ ...-..+. T Consensus 377 ~~~~~~~~~~i~~d-~~~l~~-l~-----~d~~~~~~~~~~g~~T~NE~R~~~g-~~pi~~--~~gD~~~~~~n~~~~~~ 446 (460) T protein:vir:10 377 KRFKGYENAVIEWD-ISELPE-MQ-----TDMVAMASWLNTIPVTPNEIRIAMK-YETLNQ--DGMDIVFMPSNKVRIDD 446 (460) T ss_pred CcccccCCceEEee-cchhhh-HH-----HHHHHHHHHHhCCCCCHHHHHHHhC-CCCCCC--CCCCeeeecccccchhh Confidence 1235666664 122211 11 1122333477899999999999762 222100 0011 00001111 Q ss_pred CCCCCCCCCCCcCc Q lcl|NC_019725. 222 TTEPEPGLGEKLED 235 (237) Q Consensus 222 ~~e~~~~~~~~~~~ 235 (237) ..+.....++.-+. T Consensus 447 ~~~~~~~~~~nq~~ 460 (460) T protein:vir:10 447 VSNNLIDSAFNQNQ 460 (460) T ss_pred cccccCCCcccCCC Confidence 11111111111111 No 123 >protein:vir:96839 Length: 474 # NCBI annotation: ORF008 # Family: family:all:125 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240152;genbank:gi:66395815;genbank:GeneID:5133180 Probab=98.13 E-value=1.4e-06 Score=52.75 Aligned_cols=205 Identities=17% Similarity=0.136 Sum_probs=124.5 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcc--eeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEE--YDVL 78 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~--~~~~ 78 (237) .++.+.+.+.+++.+....+.-+..+....+.+.|..- .+.+ ..... .+ -.+++.+++++.+ |-+. T Consensus 247 d~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~--~~~~---~~~~~------~~-~~~~i~~~~~~~~~~~l~~ 314 (474) T protein:vir:96 247 DLFMYKTIIDAMDKRLSDTQNTFDESTELIYILKGYEG--QDLD---EFMRN------LK-YYKAINVDGDGSGVDTIQI 314 (474) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCc--cccc---chhhh------hh-cCceEEecCCCCceeEEee Confidence 67788888999999999998888888888887766321 1111 11111 11 2456667665545 4445 Q ss_pred ecCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHH--HHHHHHhhhHHHHHHHHHhhc----- Q lcl|NC_019725. 79 NSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLV--DRKREEDYRPLLEFLLPFIVE----- 151 (237) Q Consensus 79 ~~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I--~~~Qe~~l~p~l~~l~~~i~~----- 151 (237) +.+.+++...++.....|...+++|-.-. +.. | =|.||..=..-|...+ ....+..++..|.+++.+++. T Consensus 315 ~~~~~~~~~~~~~l~~~i~~~s~~p~~~~-~~~-~-~n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~~i~~~~~~~ 391 (474) T protein:vir:96 315 EVPVQSSKEYLDMLRDYVIEFGQGVDFQQ-DKF-G-NSPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYKLN 391 (474) T ss_pred cCChHHHHHHHHHHHHHHHHHhCCccccc-ccc-c-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC Confidence 67788999999999999999999996532 221 2 2456765444444443 355556789999998888753 Q ss_pred --CCCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCChhcc-----c---- Q lcl|NC_019725. 152 --EEEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIREP-----E---- 220 (237) Q Consensus 152 --s~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~~~~-----e---- 220 (237) ..++.+.|+|-...++++.|++ +.++|+||.+.+.+.+-. .++.....+.+ + T Consensus 392 ~~~~~i~i~f~~~~p~~~~e~~~~----------~~~ag~iS~et~~~~~~~-------v~d~~~E~~ri~~E~~e~~~~ 454 (474) T protein:vir:96 392 IKVQDVEITFNFNVMVNELEQSQI----------GVQSQYLSKETVVTNHPW-------VDDPVAELERIEQDNIDFNKQ 454 (474) T ss_pred cccceeeEEeccCCCcCHHHHHHH----------HHhcCCCchHHHHHhCCC-------CCCHHHHHHHHHHHHHHHHhc Confidence 2468899999999998888764 234688888777664311 01000000101 0 Q ss_pred --cCCCCCCCCCCCcCcCC Q lcl|NC_019725. 221 --ETTEPEPGLGEKLEDEN 237 (237) Q Consensus 221 --~~~e~~~~~~~~~~~e~ 237 (237) ....++.+..+..+.|| T Consensus 455 ~~~~~~~~~~~~~d~~~e~ 473 (474) T protein:vir:96 455 LPPLEGDANGRAQDNESET 473 (474) T ss_pred ccccccccccccCCCcccC Confidence 01111222222233333 No 124 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=98.11 E-value=7e-07 Score=54.33 Aligned_cols=209 Identities=14% Similarity=0.101 Sum_probs=112.9 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHH-HHHHHHHhcCchheeee-ecCCc--cee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARL-RLAQVDDNSGVGRAIGI-DAETE--EYD 76 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~-r~~~~~~~r~~~~~~~i-D~~~e--~~~ 76 (237) .++.+.+-+.+++++....+.-+..+....+.+.|... .+++....+.. +... .. .+.... ..++- +|- T Consensus 241 ~~~~v~~lida~~~~~s~~~~~~~~~~~~~~~~~g~~~--~~~~~~~~~~~~~~~~--~~---~~~~~~~~~~~~~~~~l 313 (481) T protein:vir:10 241 DFENVIALIDLYDSAQSDTANYMTDLNDAMLAIIGNVD--LDSEDAKAFRDANMIH--LE---PGTNANGSEGKAEVKYV 313 (481) T ss_pred chhhHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCcC--CCccchhhhhhcccee--cc---ccccccCCCCCcceeEE Confidence 66778889999999998888888888888887766321 12211111111 1100 00 000011 11112 244 Q ss_pred eeecCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHH--HHHHHHhhhHHHHHHHHHhhcC-- Q lcl|NC_019725. 77 VLNSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLV--DRKREEDYRPLLEFLLPFIVEE-- 152 (237) Q Consensus 77 ~~~~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I--~~~Qe~~l~p~l~~l~~~i~~s-- 152 (237) ..+.+..++...++.+...|...+++|-. -+|.. + -|.||+.=...|...+ ...++..+++.+++++.+++.- T Consensus 314 ~~~~~~~~~~~~~~~l~~~i~~~s~~p~~-~~~~~-~-~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~ 390 (481) T protein:vir:10 314 YKQYDVAGVEAYKKRLQNDIHKYTNTPDL-NDEQF-S-GVQSGESMKYKLFGLEQVRAIKERLFKKGLMKRYKLLLNNVN 390 (481) T ss_pred eecCCHHHHHHHHHHHHHHHHHHhCCccc-ccccc-c-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 55666788999999999999999999954 33322 1 2446654333333332 3334567899999988887521 Q ss_pred ---------CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHH-------------HhhccccccCC Q lcl|NC_019725. 153 ---------EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTL-------------RSIAPEFKLKD 210 (237) Q Consensus 153 ---------~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l-------------~~~~~~~g~~~ 210 (237) .++++.|+|-...++++.|++..+.+ |+||.+.+.+.| ++...+..- T Consensus 391 ~~~~~~~~~~~i~v~f~~~~~~~~~~~a~~~~kl~---------g~is~et~~~~l~~i~d~~~E~~ri~~E~~~~~~-- 459 (481) T protein:vir:10 391 LTGLKQHNYAELTITFTPNLPKSMMESINAFNALS---------GGVSESTRLSLLDFIDNPKEELEKMQEEEAQREK-- 459 (481) T ss_pred ccCCCccccceeeEEeCCCCCcCHHHHHHHHHHHh---------ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHh-- Confidence 25789999999999999998766542 555554444332 111100000 Q ss_pred CCCCChhccccCCCCCCCC-CCCcCcCC Q lcl|NC_019725. 211 GNNINIREPEETTEPEPGL-GEKLEDEN 237 (237) Q Consensus 211 ~~~~~~~~~e~~~e~~~~~-~~~~~~e~ 237 (237) ..+....++++. +++.++.| T Consensus 460 -------~~~~~~~~~~~~~~~~~dd~~ 480 (481) T protein:vir:10 460 -------QADKRGYGEAFENHLNVDDSN 480 (481) T ss_pred -------hhhhccCCccCCCCCCCCCCC Confidence 000000011111 11111111 No 125 >protein:vir:96579 Length: 576 # NCBI annotation: ORF012 # Family: family:all:2446 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238542;genbank:gi:66391267;genbank:GeneID:5130361 Probab=98.10 E-value=1.8e-06 Score=52.02 Aligned_cols=225 Identities=13% Similarity=0.132 Sum_probs=103.4 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchhe-eeeecCCcceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRA-IGIDAETEEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~-~~iD~~~e~~~~ 77 (237) -++.+.+.|.....+......+...... .++++++-.. + +.+....++++++-.-..-+|.+- .++-.++-+|.. T Consensus 259 pi~~a~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~~~~~-l-s~e~~~~lr~~~~~~~~G~~nag~~p~vl~~G~~~~~ 336 (576) T protein:vir:96 259 EVEIAMKQFIAYNNTETFNDRFFSHGGTTRGILQIKSEQQ-Q-SQRALENFKREWKSSFSGINGSWQVPVVMADDIKFVN 336 (576) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCC-C-CHHHHHHHHHHHHHHhccccccccceeecCCCceEEe Confidence 3577778888888887777777665433 3455543111 1 122223444444422222223332 344334567777 Q ss_pred eecCcC--CHHHHHHHHHHHHhhhhcCceeeeeccCccc-c---cccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhh- Q lcl|NC_019725. 78 LNSDIS--GVPEFLSSKMDRIVSLSGIHEIIIKNKNVGG-V---SASQNTALETFYKLVDRKREEDYRPLLEFLLPFIV- 150 (237) Q Consensus 78 ~~~~ls--Gl~dl~~~~~~~iaa~s~iP~t~L~G~sp~G-l---natGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~- 150 (237) ++.+.. -+-+........||.+-|||...| |..-++ . ++.|..-..|.-..-..+-+..|.|.+.++-..|- T Consensus 337 ls~~~~d~qfle~~~~~~~~Ia~afgVPp~~l-G~~~~~~~~g~~~~~s~t~sn~e~~~~~f~~~tL~P~~~~ie~~ln~ 415 (576) T protein:vir:96 337 MTPTANDMQFEKWLTYLINIISALYGIDPAEI-GFPNRGGATGGKGGNTLNEADPGKKQQQSQNKGLQPLLRFIEDLINT 415 (576) T ss_pred ccCChhhHHHHHHHHHhHHHHHHHhCCCHHHc-cccccccccccccccccccccHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 765443 344566777899999999998877 543222 1 11111111111222222223347777766644432 Q ss_pred ---cC--CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccC---------CCCCCCh Q lcl|NC_019725. 151 ---EE--EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLK---------DGNNINI 216 (237) Q Consensus 151 ---~s--~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~---------~~~~~~~ 216 (237) .. ..+.|+|. ..+.+.+++.. +....+..|+++++|+|+.+- ..+..|-. ....... T Consensus 416 ~Ll~~~~~~~~~~f~---r~d~~~~~e~~-----~~~~~~~~G~lT~NE~R~~~g-l~piegGD~~~~~~~~~~~~~~~~ 486 (576) T protein:vir:96 416 HIISEYSDKYVFQFV---GGDTKSELDKI-----KILQEEVKTYKTVNEARKEKG-LKPIEGGDVLLDGSFIQSMSLNTQ 486 (576) T ss_pred hhchhccCceEEEec---cCCHHHHHHHH-----HHHHHHhcCccCHHHHHHHhC-CCCCCCcceecccccccccccccc Confidence 22 34555553 34444444322 222345579999999999762 22322200 0000000 Q ss_pred ---hccc----------c-CCCCCCCCCCCcCcCC Q lcl|NC_019725. 217 ---REPE----------E-TTEPEPGLGEKLEDEN 237 (237) Q Consensus 217 ---~~~e----------~-~~e~~~~~~~~~~~e~ 237 (237) .+.+ + ...+++..+.+...++ T Consensus 487 ~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~ 521 (576) T protein:vir:96 487 KEQYEDTKQKERFDMIQQFLNSPDDEEPQQESTED 521 (576) T ss_pred CCCCCCccccccccccccccCCCCCCCCCCCCCCC Confidence 0000 0 0001111111111111 No 126 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=98.10 E-value=1.6e-06 Score=52.42 Aligned_cols=202 Identities=12% Similarity=0.046 Sum_probs=124.5 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCC------cc Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAET------EE 74 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~------e~ 74 (237) .++.+.+.+.+++.+....+.-+..+....+.+.|.. . .+. . ....- .+ ..+++.+...+ -+ T Consensus 242 d~e~v~~liDa~d~~~S~~~~~~~~~~~~~lvl~g~~-~-~~~-~--~~~~~------~~-~~~~i~~~~~~~~~~~~~~ 309 (470) T protein:vir:10 242 ELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYG-G-ADL-H--QFMND------LR-KYKSIKINNTGNGDNSGVD 309 (470) T ss_pred chhHHHHHHHHHHHHHHHHHHHHHHhcCcceeeecCC-c-ccc-c--hhhhh------hh-hcCeEeccCCCCCcCceeE Confidence 6889999999999999999999998888888887631 1 111 1 11111 11 12344443211 24 Q ss_pred eeeeecCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHH--HHHHHHhhhHHHHHHHHHhhc- Q lcl|NC_019725. 75 YDVLNSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLV--DRKREEDYRPLLEFLLPFIVE- 151 (237) Q Consensus 75 ~~~~~~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I--~~~Qe~~l~p~l~~l~~~i~~- 151 (237) |-..+.+..+....++.+...|...+++|-.-..+ .| |.||..=...|.... ....+..+++.|++++.+++. T Consensus 310 ~lt~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~---~g-n~Sg~Alk~~~~~l~~k~~~~~~~~~~~l~~~~~~i~~~ 385 (470) T protein:vir:10 310 KLQIDIPVEARDDALKITRKNIFLFGQGIDPANFE---SS-NASGVAIKMLYSHLELKAAKTQTYFEHAINELVRAIMRY 385 (470) T ss_pred EEeecCChHHHHHHHHHHHHHHHHHhCCCCCCccc---cc-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 66777888999999999999999999999754332 23 677876545555544 556667789999999888753 Q ss_pred -------CCCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCChhccc---- Q lcl|NC_019725. 152 -------EEEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIREPE---- 220 (237) Q Consensus 152 -------s~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~~~~e---- 220 (237) ..++++.|+|-...+++|.|++..+. +|+||.+.+.+.+- +.+ +++ ++.+ T Consensus 386 l~~~~~d~~~i~i~f~~~~p~d~~e~~~~~~~~---------~g~iS~et~l~~~p-------~v~--D~~-~E~eri~~ 446 (470) T protein:vir:10 386 LNFSDADKRHISQHWTRTKVEDSLTKAQIVSTV---------ANYSSKEAVAKANP-------IVD--DWQ-QELKDLAK 446 (470) T ss_pred hcccCcccceeeEEeccCCCCCHHHHHHHHHHH---------hccCcHHHHHHhCC-------CCC--CHH-HHHHHHHH Confidence 14788999999999999999876553 35666655544321 000 111 1110 Q ss_pred -------cCCCCCCCCCCCcCcCC Q lcl|NC_019725. 221 -------ETTEPEPGLGEKLEDEN 237 (237) Q Consensus 221 -------~~~e~~~~~~~~~~~e~ 237 (237) ...+.+...+.+.++|- T Consensus 447 E~~e~~~~~~~~~~~~~~~~dde~ 470 (470) T protein:vir:10 447 DKEENDPYSNQADELNGKGVNDEQ 470 (470) T ss_pred HHHHHHHhhccccccCCCCCCCCC Confidence 01111111111111111 No 127 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=98.08 E-value=4.3e-07 Score=55.47 Aligned_cols=202 Identities=18% Similarity=0.190 Sum_probs=122.4 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCC-c--ceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAET-E--EYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~-e--~~~~ 77 (237) .++.+.+.+.+++.+....+.-+..+....+.+.|.. ..+. + .....+ ...+.+.+++++ - +|-. T Consensus 247 d~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~--~~~~--~-~~~~~~-------~~~~~i~~~~d~~~~~~~l~ 314 (468) T protein:vir:96 247 DLFMYKTIIDAMDKRLSDTQNTFDEATELIYVLKGYE--GEDL--E-EFMYNL-------KYYKAINVDGDGSGGVDTIQ 314 (468) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCC--cccc--c-hhhhhh-------hcCceEEecCCCCCcceEEe Confidence 6777888889999999888888887777777776531 1111 1 111111 124556665442 1 3445 Q ss_pred eecCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHH--HHHHHHhhhHHHHHHHHHhhcC--- Q lcl|NC_019725. 78 LNSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLV--DRKREEDYRPLLEFLLPFIVEE--- 152 (237) Q Consensus 78 ~~~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I--~~~Qe~~l~p~l~~l~~~i~~s--- 152 (237) .+.+.+++...++.+.++|...+++|-.- ++ .-+| |.||..=..-|.... ....+..++..|++++++++.- T Consensus 315 ~~~~~~~~~~~~~~l~~~I~~~s~~p~~~-~~-~~~~-n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~g~ 391 (468) T protein:vir:96 315 IDVPVQSAKEYLDMLRDYVIEFGQGVDFQ-QD-KFGN-SPSGIALKFMYSNLDLKANKLKNKTLTALQELLQYIIDFYKL 391 (468) T ss_pred ecCChHHHHHHHHHHHHHHHHHhCccccc-cc-cccc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC Confidence 56667799999999999999999999642 22 2222 667765444444443 2444567899999998887632 Q ss_pred ----CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCChhccc-------c Q lcl|NC_019725. 153 ----EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIREPE-------E 221 (237) Q Consensus 153 ----~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~~~~e-------~ 221 (237) .++.+.|+|-...+++|.|++. .++|+||.+.+.+.+- +. + ++ .++.+ + T Consensus 392 ~~d~~~i~i~f~~~~p~d~~e~a~~~----------~~~g~iS~et~i~~l~------~v-~--D~-~~E~~ri~~E~~~ 451 (468) T protein:vir:96 392 SIKVQDVEITFNFNVMVNELEQSQIG----------VNSQYLSKETVVTNHP------WV-D--DP-VAEMERIDQEELA 451 (468) T ss_pred CcccceeeEEecCCCCcCHHHHHHHH----------HhcCCCchHHHHHhCC------CC-C--CH-HHHHHHHHHHHHH Confidence 4688999999999988877642 3458999877765431 11 1 11 12221 1 Q ss_pred CCCCCCCCCCCcCcCC Q lcl|NC_019725. 222 TTEPEPGLGEKLEDEN 237 (237) Q Consensus 222 ~~e~~~~~~~~~~~e~ 237 (237) ....+.+.+...++|. T Consensus 452 ~~~~~~~~~~~~~~~~ 467 (468) T protein:vir:96 452 LPSIEEGLNGKENNEP 467 (468) T ss_pred HHHHhhccCCCCCCCC Confidence 1112223333333333 No 128 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=98.07 E-value=1.5e-06 Score=52.55 Aligned_cols=216 Identities=15% Similarity=0.058 Sum_probs=117.8 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchH-HHHHHHHHHHHHhcCchheeeeecC-Ccceeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQ-YAARLRLAQVDDNSGVGRAIGIDAE-TEEYDVL 78 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e-~~~~~r~~~~~~~r~~~~~~~iD~~-~e~~~~~ 78 (237) .++.+.+.+.+++.+....+.-+..+....+.+.|........+.. ..+...-.++ ............+ +-+|-.. T Consensus 203 d~e~v~~lida~~~~~s~~~~~~~~~~~~~~v~~g~~~~~~~~~e~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~lt~ 280 (440) T protein:vir:95 203 DYESEISLIDAYDAGQSDTANYMSDLNDAMLLVKGDLDGIKLSPEDAAKMKDANMLF--LKTGISTTGQQTTADASYIYK 280 (440) T ss_pred chhhhHHHHHHHHHHHHHHHHHHHHhhcceeeeecccccCCCCccchhhhhhcccee--cccccccccCCCCcceeEEee Confidence 7788889999999999999988888888777776643222211111 1111100000 0000011111111 1245556 Q ss_pred ecCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHH--HHHHHHhhhHHHHHHHHHhhc----- Q lcl|NC_019725. 79 NSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLV--DRKREEDYRPLLEFLLPFIVE----- 151 (237) Q Consensus 79 ~~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I--~~~Qe~~l~p~l~~l~~~i~~----- 151 (237) +.+..++...++.+...|...+++|-.-+-+.+ | |.||+.=..-|...+ ...++..++..+.+++.+++. T Consensus 281 ~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~-~--n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~ 357 (440) T protein:vir:95 281 QYDVNGTEAYKNRLANDIHRFSRIPNLDDDRFN-S--TSSGIALLYKMIGLEQVRKDKETYFTKALRRRYELISNIHKAI 357 (440) T ss_pred cCCHHHHHHHHHHHHHHHHHHhCCccccccccc-c--cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 677889999999999999999999975442221 2 345654333333332 333446688889988888652 Q ss_pred ------CCCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCC---CCChhccccC Q lcl|NC_019725. 152 ------EEEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGN---NINIREPEET 222 (237) Q Consensus 152 ------s~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~---~~~~~~~e~~ 222 (237) ..++++.|+|-...++++.|++..+. .|+||.+.+.+.|- +..+.. .+..|..+.. T Consensus 358 ~~~~~~~~~v~i~f~~~~p~~~~~~ad~~~kl---------~g~iS~et~~~~l~------~~d~~~E~~ri~~E~~~~~ 422 (440) T protein:vir:95 358 NGPVIEANKLTFTFHPNIPQDVWTEIKAYIEA---------GGEISQETLMENAS------FTDYKTEHSRILKQGGSSD 422 (440) T ss_pred CCcccccccceEEeCCCCCCCHHHHHHHHHHH---------hccCcHHHHHHhCC------CCCcHHHHHHHHHHHHHhh Confidence 13678999999999999999876553 35677655554431 110000 0000000111 Q ss_pred CC----CCCCCCCCcCcC Q lcl|NC_019725. 223 TE----PEPGLGEKLEDE 236 (237) Q Consensus 223 ~e----~~~~~~~~~~~e 236 (237) .+ ..+..+.+.++| T Consensus 423 ~~~~~~~~~~~~~~~~~e 440 (440) T protein:vir:95 423 LEIGQIVGDADVGQADTE 440 (440) T ss_pred hhHHhhccCCCCCCcCCC Confidence 10 011112222223 No 129 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=98.07 E-value=2.2e-06 Score=51.65 Aligned_cols=220 Identities=15% Similarity=0.117 Sum_probs=121.7 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeee--------cCC Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGID--------AET 72 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD--------~~~ 72 (237) .++.+.+.+.+++.+....+.-+..+....+.+.|.... ..++ . ...+. ..+++.++ ..+ T Consensus 250 d~e~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~G~~~~-~~~~--~--~~~~~-------~~~~~~~~~~~~~~~~~~~ 317 (501) T protein:vir:96 250 DYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLAL-PKGM--Q--ASDMK-------RTRLMQLKPPKSADGKEGT 317 (501) T ss_pred chhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeeccccc-Cccc--c--hhhhh-------hcCeeeecccccccccccC Confidence 777899999999999999999888888888877764211 1111 0 01111 01112221 111 Q ss_pred --cceeeeecCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHH--HHHHHHhhhHHHHHHHHH Q lcl|NC_019725. 73 --EEYDVLNSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLV--DRKREEDYRPLLEFLLPF 148 (237) Q Consensus 73 --e~~~~~~~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I--~~~Qe~~l~p~l~~l~~~ 148 (237) -.|-..+.+.+++...++.+...|...+++|-.-+-+.+ + |.||..=..-|.... ...++..++..|++++.+ T Consensus 318 ~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~-~--n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~l 394 (501) T protein:vir:96 318 VKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNTPDMSDTNFS-G--NTSGEALKYKLFGLDQDRVDTQSQFTKGLKRRYRL 394 (501) T ss_pred cceeeEeccCCHHHHHHHHHHHHHHHHHHhCCcccCccccc-c--cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 234456677789999999999999999999965543221 2 446654332233222 445557788899988887 Q ss_pred hhc---------C---CCceeEeCCCCCCCHHHHHHHHHHHHHHHH---HHHhCCCC-CHHHHHHHHHhhcccc---ccC Q lcl|NC_019725. 149 IVE---------E---EEWSIEFEPLSVPSKKEESEITKNNVESVT---KAITEQII-DLEEARDTLRSIAPEF---KLK 209 (237) Q Consensus 149 i~~---------s---~~~~~~f~pL~~~seke~Aei~~~~A~a~~---~~~~~g~i-~~~e~r~~l~~~~~~~---g~~ 209 (237) ++. . .++.+.|+|-...+.++.|++..+.+..++ .+-..+.+ ++++..+++++...+. +.. T Consensus 395 i~~~~~~~~~~~~~d~~~i~i~f~~~~p~n~~e~ad~~~kl~g~iS~et~~~~l~~v~D~~~E~~ri~~E~~~~~~~~~~ 474 (501) T protein:vir:96 395 AARIGSLVNEFKDFDESLLKITFTPNLPKSLNEQVSILTGLGGQVSQETALSLSGLVESPNEELDKINKEMSEIDFKGYS 474 (501) T ss_pred HHHHHHhcccccccccccceEEeCCCCCcCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHhhccccc Confidence 642 1 257899999999999999988776653321 12233334 4555555554322111 111 Q ss_pred CCC-CCChhccccCCCCCCCCCCCcCc Q lcl|NC_019725. 210 DGN-NINIREPEETTEPEPGLGEKLED 235 (237) Q Consensus 210 ~~~-~~~~~~~e~~~e~~~~~~~~~~~ 235 (237) +.. +...+..++..+.++..+|.... T Consensus 475 ~~~~~~~~~~~~~~~e~~~d~~e~~~~ 501 (501) T protein:vir:96 475 NDFNEHVGKYTDEVKETHTDDFEREYE 501 (501) T ss_pred cchhhcccccCCcCCCCCCCccccccC Confidence 110 00011111111111111111111 No 130 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=98.06 E-value=5.3e-07 Score=54.97 Aligned_cols=199 Identities=13% Similarity=0.042 Sum_probs=116.1 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCC--c--cee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAET--E--EYD 76 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~--e--~~~ 76 (237) +.+.+++.+.+++++......-.+-+......+-|+.. ++........++ ..++.+.++. + ++. T Consensus 203 I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~---d~~~~~~~~~~~---------~~i~~~~~de~~~~~~v~ 270 (422) T protein:vir:97 203 ITKAGMYHQKAAKRTLERAEVTAEFYSFPQKYVLGMDP---DAKPMEKWRATV---------STLLEISKDEDGDKPTVG 270 (422) T ss_pred cchhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcccCc---ccccCchhhhhh---------hhhhccCCCCCCCcceee Confidence 56778888888888877655555555555555545321 121111122111 2344554321 1 232 Q ss_pred -eeecCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHH---HHHHHHHHHHHHHhhhHHHHHHHHHhhcC Q lcl|NC_019725. 77 -VLNSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTAL---ETFYKLVDRKREEDYRPLLEFLLPFIVEE 152 (237) Q Consensus 77 -~~~~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~---~nyyd~I~~~Qe~~l~p~l~~l~~~i~~s 152 (237) .-++++.+..+.+.....++|++++||..-|-|.+- . ++||++=. ......++.+| ..+.+.++++..+++.- T Consensus 271 q~~~~~l~~~~~~l~~~~~~~a~~s~lP~~~lg~~~~-N-psSa~Ai~a~~~~L~~ka~~k~-~~fg~~l~~~~rla~~~ 347 (422) T protein:vir:97 271 QFTTASMAPFMEHLKMYASLFAGGSGLTLDDLGFPSD-N-PSSVESIKAAHENLRAAGRKAQ-RSFSSGFLNVAYIAVCL 347 (422) T ss_pred ecCCCChhHHHHHHHHHHHHHhcccCCCHHHhccccC-c-hhHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHH Confidence 233677888899999999999999999777666552 1 14555433 33444455554 55888899888875421 Q ss_pred -----------CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhC--CCCCHHHHHHHHHhhccccccCCCCCCChhcc Q lcl|NC_019725. 153 -----------EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITE--QIIDLEEARDTLRSIAPEFKLKDGNNINIREP 219 (237) Q Consensus 153 -----------~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~--g~i~~~e~r~~l~~~~~~~g~~~~~~~~~~~~ 219 (237) .++.+.|.|....+..+.|. .|+++.+++++ |+.+.+.+++.| |+. +.++..... T Consensus 348 ~~~~~~~~~~~~~~~~~w~p~~~~~~~s~a~----~aDa~~Kl~~a~~~~~~~~~~~~~l-------g~~-~~~~~~~~~ 415 (422) T protein:vir:97 348 RDEFPYLRNQFMDTVIKWEPLFEADANMLTL----VGDGAIKLNQAIPGFMDADVIRDLT-------GVK-GADKPIPAI 415 (422) T ss_pred hcCCcccchhhccceEEEccCCCCChHHHHH----HHHHHHHHHhhccccccHHHHHHHc-------CCC-chhHHHHHH Confidence 14679999988887665554 45788888887 677777777655 331 122222222 Q ss_pred ccCCCCCC Q lcl|NC_019725. 220 EETTEPEP 227 (237) Q Consensus 220 e~~~e~~~ 227 (237) ++. ..+. T Consensus 416 ~~~-~~d~ 422 (422) T protein:vir:97 416 TEV-TTDG 422 (422) T ss_pred Hhh-hccC Confidence 222 1111 No 131 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=98.05 E-value=2.3e-06 Score=51.54 Aligned_cols=218 Identities=13% Similarity=0.149 Sum_probs=118.2 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCC-cceeeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAET-EEYDVLN 79 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~-e~~~~~~ 79 (237) .++.+.+-+.+++.+....+.-+..+....+.+.|... .+.. .....+ ...+++.++.++ -+|-..+ T Consensus 261 d~e~v~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~----~~~~-~~~~~~-------~~~~~~~~~~~~~~~~l~~~ 328 (492) T protein:vir:97 261 DIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLKNYDD----QELP-EFKRLL-------RYYGAIKVSDNGGVDTIQVE 328 (492) T ss_pred chHhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCc----ccch-hHHHHH-------hhccceecCCCCcceeEecc Confidence 78888899999999999999888888888888776421 1101 111111 122344444332 2344467 Q ss_pred cCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHH--HHHHHHhhhHHHHHHHHHhhcC----- Q lcl|NC_019725. 80 SDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLV--DRKREEDYRPLLEFLLPFIVEE----- 152 (237) Q Consensus 80 ~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I--~~~Qe~~l~p~l~~l~~~i~~s----- 152 (237) .+.+++...++.+.+.|...+++|-.-+ + .-+| |.||+.=..-|.... ....+..++..+++++++++.- T Consensus 329 ~~~~~~~~~~~~L~~~I~~~s~~p~~~~-~-~~~~-n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~~ 405 (492) T protein:vir:97 329 VPVENSKKYLDELYQKIMLFGQAVDFSS-D-KFGS-APSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG 405 (492) T ss_pred CCHHHHHHHHHHHHHHHHHHhCCCCCCc-c-cccc-CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCc Confidence 7888999999999999999999996433 1 1112 446665333444333 3455567888999988887632 Q ss_pred --CCceeEeCCCCCCCHHHHHHHHHHHHHHHH---HHHhCCCC-CHHHHHHHHHhhccccccCCCCCCChhccccCCCCC Q lcl|NC_019725. 153 --EEWSIEFEPLSVPSKKEESEITKNNVESVT---KAITEQII-DLEEARDTLRSIAPEFKLKDGNNINIREPEETTEPE 226 (237) Q Consensus 153 --~~~~~~f~pL~~~seke~Aei~~~~A~a~~---~~~~~g~i-~~~e~r~~l~~~~~~~g~~~~~~~~~~~~e~~~e~~ 226 (237) .++.+.|+|-...++++.|++..+.+..++ .+-..+.+ ++++..+++.+...+.. ....+......+ ...++ T Consensus 406 ~~~~i~v~f~~~~p~~~~e~a~~~~kl~G~iS~et~l~~l~~v~d~~~Eleri~~E~~~~~-~~~~~~~~~~~~-~~~~~ 483 (492) T protein:vir:97 406 EHKDVDISFNYNKVANTELQVQTAQQSMGIVSHETVLENHPFVEDLQAELERIEQEQTEYN-KQLPNLDDGGAD-SAQQQ 483 (492) T ss_pred ccceeeEEecCCCCCCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHH-HhhhccccCCCC-CCccc Confidence 468899999999999999887766542211 11222222 23333333322110000 000000000000 00000 Q ss_pred CCCCCCcCcC Q lcl|NC_019725. 227 PGLGEKLEDE 236 (237) Q Consensus 227 ~~~~~~~~~e 236 (237) ...++..+ | T Consensus 484 ~~~~~~~~-e 492 (492) T protein:vir:97 484 ERSNNKES-E 492 (492) T ss_pred cccccccc-C Confidence 00000000 0 No 132 >protein:vir:2732 Length: 501 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695105;genbank:gi:23455874;genbank:GeneID:955614 Probab=98.04 E-value=2.1e-06 Score=51.69 Aligned_cols=220 Identities=15% Similarity=0.092 Sum_probs=121.7 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeec--------CC Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDA--------ET 72 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~--------~~ 72 (237) .++.+.+-+.+++.+....+.-+..+....+.+.|.... ..++. ...+. . .+++.++. .+ T Consensus 250 d~e~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~---~~~~~--~~~~~------~-~~~~~~~~~~~~~~~~~~ 317 (501) T protein:vir:27 250 DYETELYLIDLYDSAESDTANHMSDMADAILAIYGDLAL---PKGMQ--ASDMK------R-TRLMQLKPPKSADGKEGT 317 (501) T ss_pred chhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccC---Ccccc--hhhhh------h-cCceeecccccccCCCCC Confidence 778899999999999999999888888888887763211 11111 11111 0 12222211 11 Q ss_pred --cceeeeecCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHH--HHHHHHHHhhhHHHHHHHHH Q lcl|NC_019725. 73 --EEYDVLNSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYK--LVDRKREEDYRPLLEFLLPF 148 (237) Q Consensus 73 --e~~~~~~~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd--~I~~~Qe~~l~p~l~~l~~~ 148 (237) -+|-..+.+.+++...++.+.+.|...+++|-.-.-+. + -|.||..=...|.. .-...++..++..|++++.+ T Consensus 318 ~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~--~-~n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~l 394 (501) T protein:vir:27 318 VKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNIPDMSDTNF--S-GNTSGEALKYKLFGLDQDRVDTQSQFTQGLKRRYRL 394 (501) T ss_pred cceeeeeccCCHHHHHHHHHHHHHHHHHHhCCcccCcccc--c-cCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 23555566778999999999999999999996443222 1 24566543333322 22455667789999999888 Q ss_pred hhcC------------CCceeEeCCCCCCCHHHHHHHHHHHHHHHH---HHHhCCCC-CHHHHHHHHHhhccc---cccC Q lcl|NC_019725. 149 IVEE------------EEWSIEFEPLSVPSKKEESEITKNNVESVT---KAITEQII-DLEEARDTLRSIAPE---FKLK 209 (237) Q Consensus 149 i~~s------------~~~~~~f~pL~~~seke~Aei~~~~A~a~~---~~~~~g~i-~~~e~r~~l~~~~~~---~g~~ 209 (237) ++.- .++.+.|+|-...+.++.|++..+.+..++ .+-..+.+ ++++-.+++++...+ .+.. T Consensus 395 i~~~~~~~~~~~~~d~~~i~v~f~~~~p~n~~e~ad~~~kl~g~iS~et~l~~l~~v~D~~~E~eri~~E~~e~~~~~~~ 474 (501) T protein:vir:27 395 AARIGSLVNEFKDFDESLLKITFTPNLPKSLNEQVSILTGLGGQVSQETALSLSGLVESPNEELDKINKEVSEIDFKGYS 474 (501) T ss_pred HHHHHhhcccccccccccceEEeCCCCCcCHHHHHHHHHHHhccCcHHHHHHhCCCCCCHHHHHHHHHHHHHhhhHhhhc Confidence 6521 257899999999999999998777654322 12233444 355555555432111 1111 Q ss_pred CCCCCChhccccCCCCCCCCC-CCcCcCC Q lcl|NC_019725. 210 DGNNINIREPEETTEPEPGLG-EKLEDEN 237 (237) Q Consensus 210 ~~~~~~~~~~e~~~e~~~~~~-~~~~~e~ 237 (237) ++ ..+.......++++..+ +.++... T Consensus 475 ~~--~~~~~~~~~d~~~~~~~d~~e~~~~ 501 (501) T protein:vir:27 475 ND--FNEHVGKYTDEVKETHTDDFERAYE 501 (501) T ss_pred Cc--cccccccccCCCCCCccccccccCC Confidence 11 11110000001111111 1111111 No 133 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=98.03 E-value=6e-06 Score=49.21 Aligned_cols=222 Identities=15% Similarity=0.050 Sum_probs=120.8 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecC-Ccc--eee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAE-TEE--YDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~-~e~--~~~ 77 (237) .++.+.+.+.+++.+....+.-+..+....+.+.|.. +-..... ..++. ...+.+++.+ +.+ |-. T Consensus 245 d~e~v~~liD~~~~~~S~~~~~~~~~~~~~lv~~G~~-~~~~~~~----~~~~~-------~~~~~~~~~~~~~d~~~l~ 312 (499) T protein:vir:10 245 DFEQLISLIDAYNLLQTDRISDKEAFVDALLVTFGFG-LGDDKDD----IQRLK-------RGAIEAPPREEGADIEWLT 312 (499) T ss_pred chHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCc-cccccch----hhhhh-------hcceeccCCCCCCcceEEe Confidence 6777888888899888888888888877777776531 1111110 11111 1223333322 222 444 Q ss_pred eecCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHH--HHHHHHhhhHHHHHHHHHhhcC--- Q lcl|NC_019725. 78 LNSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLV--DRKREEDYRPLLEFLLPFIVEE--- 152 (237) Q Consensus 78 ~~~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I--~~~Qe~~l~p~l~~l~~~i~~s--- 152 (237) .+.+.+++...++.+.+.|...+++|-.-. +.- + -|.||..=..-|.... ....+..+++.+++++.+++.- T Consensus 313 ~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~-~~~-~-gn~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~ 389 (499) T protein:vir:10 313 KSFDETQVNLLSQSIENDIHKISYVPNMND-EKF-M-GNVSGEAMKFKLFGLENLLSIKQRYFFDGLRRRLKLIQTIVNI 389 (499) T ss_pred ccCCHHHHHHHHHHHHHHHHHHhCcccCCc-hhh-c-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 556778999999999999999999996321 111 1 1345654333344433 3444577899999998887631 Q ss_pred -------CCceeEeCCCCCCCHHHHHHHHHHHHHHHH---HHHhCCCC-CHHHHHHHHHhhccc-----cccCCCCCCCh Q lcl|NC_019725. 153 -------EEWSIEFEPLSVPSKKEESEITKNNVESVT---KAITEQII-DLEEARDTLRSIAPE-----FKLKDGNNINI 216 (237) Q Consensus 153 -------~~~~~~f~pL~~~seke~Aei~~~~A~a~~---~~~~~g~i-~~~e~r~~l~~~~~~-----~g~~~~~~~~~ 216 (237) .++++.|+|=...++.+.|++..+.+..++ .+-..+.+ ++++..+++.+...+ .....+.+.+. T Consensus 390 ~~~~~d~~~i~i~f~~~~p~n~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~ 469 (499) T protein:vir:10 390 KGANDDASGCKISLVANIPSNLSDVVNNVKNADGIIPRKYTYSWLPDVDNPQDVIDEMNQQDAETIKKNQEALRGQDPDR 469 (499) T ss_pred cCCccccccceEEeCCCCCCCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHhhhccCCCCC Confidence 367899999999999999998877543221 11223333 344445555332110 11111111111 Q ss_pred hccc-cCCCCCCCCCCCcCc--CC Q lcl|NC_019725. 217 REPE-ETTEPEPGLGEKLED--EN 237 (237) Q Consensus 217 ~~~e-~~~e~~~~~~~~~~~--e~ 237 (237) -..+ ..++++++.+++..+ .+ T Consensus 470 ~~~~~~~~~~~~~~~~~~~~~~~~ 493 (499) T protein:vir:10 470 LELEDKQDDSSENDKEAGSNHNQS 493 (499) T ss_pred CCCCCCCcccCCCCCCCccccccC Confidence 1111 112222233333222 22 No 134 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=98.01 E-value=7.9e-07 Score=54.04 Aligned_cols=233 Identities=12% Similarity=0.080 Sum_probs=116.6 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhc-Cchheeeeec-CCcceeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNS-GVGRAIGIDA-ETEEYDVL 78 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r-~~~~~~~iD~-~~e~~~~~ 78 (237) .++.+.+.+.+++.+....+..+..+...++.+.|.... ........-..++....... .....+-.+. .+-.|-.. T Consensus 257 d~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 335 (511) T protein:vir:93 257 DYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNL-DPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYK 335 (511) T ss_pred chhhHHHHHHHHHHHHHHHHHHHHHhhCcceeeecCccc-CchhhcccccccceecccccccccccccCCCCcceeEEee Confidence 788889999999999999999888888777766653211 01100000001111000000 0000011111 11235556 Q ss_pred ecCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHH--HHHHHHHhhhHHHHHHHHHhhc----- Q lcl|NC_019725. 79 NSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKL--VDRKREEDYRPLLEFLLPFIVE----- 151 (237) Q Consensus 79 ~~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~--I~~~Qe~~l~p~l~~l~~~i~~----- 151 (237) +.+.+++...++.+.+.|...+++|-.-.-+.+ | |.||..=..-|... -...++..++..|++++.+++. T Consensus 336 ~~~~~~~~~~~~~L~~~I~~~s~~P~~~~~~~~-~--n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~l~~~ 412 (511) T protein:vir:93 336 QYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-G--TQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNT 412 (511) T ss_pred cCCHHHHHHHHHHHHHHHHHHhCCccccccccc-c--cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 778889999999999999999999976442221 2 34665433333332 2344567789999998888752 Q ss_pred -----C---CCceeEeCCCCCCCHHHHHHHHHHHHHHHH---HHHhCCCC-CHHHHHHHHHhhcccc-ccCCCCCCChhc Q lcl|NC_019725. 152 -----E---EEWSIEFEPLSVPSKKEESEITKNNVESVT---KAITEQII-DLEEARDTLRSIAPEF-KLKDGNNINIRE 218 (237) Q Consensus 152 -----s---~~~~~~f~pL~~~seke~Aei~~~~A~a~~---~~~~~g~i-~~~e~r~~l~~~~~~~-g~~~~~~~~~~~ 218 (237) . .++++.|+|-...+.++.|++..+.+..++ .+-..+.+ ++++-.++++...... ............ T Consensus 413 ~~~~~~~d~~~i~~~f~~~~p~n~~e~~~~~~kl~g~iS~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~ 492 (511) T protein:vir:93 413 WSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPR 492 (511) T ss_pred cCcccccccccceEEeCCCCCCCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhhcccCCC Confidence 1 257899999999999999987665432111 11112222 2333333332211000 000000000011 Q ss_pred cccCCCCCCCCCCCcCcCC Q lcl|NC_019725. 219 PEETTEPEPGLGEKLEDEN 237 (237) Q Consensus 219 ~e~~~e~~~~~~~~~~~e~ 237 (237) ..+..+++++..+.++.|- T Consensus 493 ~~~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:93 493 DINDDEQDDDTKDTVDKKE 511 (511) T ss_pred CCCCCCCCCcccccccccC Confidence 1111111111111111111 No 135 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=98.00 E-value=1.4e-06 Score=52.75 Aligned_cols=185 Identities=12% Similarity=0.043 Sum_probs=111.6 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecC--Cc--cee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAE--TE--EYD 76 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~--~e--~~~ 76 (237) +.+.+.....++.++......-.+-+......+-|+.. ++........++ ..++.+.++ ++ ++. T Consensus 203 I~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~---d~~~~~~~~~~~---------~~i~~~~~d~dg~~~~v~ 270 (409) T protein:vir:94 203 ITRSGMYWQSNAKRTLERADVTAEFYSFPQKYVTGLSD---DAEPMETWKATV---------SSMLQFTKDEDGDKPTLG 270 (409) T ss_pred cchhHHHHHHHHHHHHHHHHHHHHHhcChhheeEecCC---CCcccchhhhhH---------HHhhcCCCCCCCCCceEE Confidence 66778888888888876655555555555555545321 222222222111 223444222 11 232 Q ss_pred -eeecCcCCHHHHHHHHHHHHhhhhcCceeeeeccCccccc-ccchhHHHHHHH---HHHHHHHHhhhHHHHHHHHHhhc Q lcl|NC_019725. 77 -VLNSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVS-ASQNTALETFYK---LVDRKREEDYRPLLEFLLPFIVE 151 (237) Q Consensus 77 -~~~~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~Gln-atGe~D~~nyyd---~I~~~Qe~~l~p~l~~l~~~i~~ 151 (237) .-++++.++-+.+.....++|+.+++|..-|-|.+- | +||+.=...... .++.+| ..+.+.++++..+.+. T Consensus 271 q~~~~~l~~~~~~l~~~~~~~a~~t~lP~~~lg~~~~---NpsSa~Al~a~~~~L~~~a~~k~-~~fg~~~~~~~rla~~ 346 (409) T protein:vir:94 271 QFTQPSMSPFTEQLRTAAAGFAGETGLTLDDLGFVSD---NPSSVEAIKASHENLRLAGRKAQ-RSLGAGLLNVAYLAAC 346 (409) T ss_pred ecCCCChhHHHHHHHHHHHHHhhhcCCCHHHhccccC---chhHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHH Confidence 234577888899999999999999999888777652 3 556544433333 334443 4477888888877541 Q ss_pred --------CC---CceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCC--CCCHHHHHHHHHhhccccccCCCC Q lcl|NC_019725. 152 --------EE---EWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQ--IIDLEEARDTLRSIAPEFKLKDGN 212 (237) Q Consensus 152 --------s~---~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g--~i~~~e~r~~l~~~~~~~g~~~~~ 212 (237) ++ ++.+.|.|+..++....| ..|+++.+++++| +.+.+.+++.| |+.+.. T Consensus 347 i~~~~~~~~~~~~~~~v~W~p~~~~~~~~~a----~~aDa~~Kl~~ag~~~~~~~~~~~~l-------G~~~~d 409 (409) T protein:vir:94 347 LRDDAPYLREQFRKTKPKWEPLFEADASMLS----LIGDGAIKLNQAIPEFINKDTIRDLT-------GIEGGE 409 (409) T ss_pred HhCCCCccccccccceEEeccCCCcchHHHH----HHHHHHHHHHHhcccccchhHHHHHc-------CCCCCC Confidence 12 467889998888765554 4568999999999 44556666544 332211 No 136 >protein:vir:4194 Length: 540 # NCBI annotation: putative portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071819;genbank:gi:11863102;genbank:GeneID:1257604 Probab=97.98 E-value=7.6e-06 Score=48.65 Aligned_cols=220 Identities=15% Similarity=0.130 Sum_probs=102.0 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhH--HHhhcCCchHHHHHHHHHHHHH-----hcCchheeee-ec Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGL--AEMCDDDDAQYAARLRLAQVDD-----NSGVGRAIGI-DA 70 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l--~~~~~~~~~e~~~~~r~~~~~~-----~r~~~~~~~i-D~ 70 (237) -+..+...+.....+......+...... .++++++- .+.....+....++++++..-. ...|.+.+++ .. T Consensus 193 pi~~~~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~g~l~~e~~~~~~~~~~~~~~~~~~~~~~~~g~~~nag~~~vLe~ 272 (540) T protein:vir:41 193 RYLSAAPSILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDEMELGSDGEPTGRTVLQGLIEDNFKYLKEAPHTPLVFSI 272 (540) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCcccCchhccchHHHHHHHHHHHHHHHHHhccccccccceEEEec Confidence 3444555555555555555555443222 24555431 1111122223334444433211 1234444443 21 Q ss_pred C---CcceeeeecCcCC----HHHHHHHHHHHHhhhhcCceeeeeccCccc-cc-ccchhHHHHHHHHHHHHHHHhhhHH Q lcl|NC_019725. 71 E---TEEYDVLNSDISG----VPEFLSSKMDRIVSLSGIHEIIIKNKNVGG-VS-ASQNTALETFYKLVDRKREEDYRPL 141 (237) Q Consensus 71 ~---~e~~~~~~~~lsG----l~dl~~~~~~~iaa~s~iP~t~L~G~sp~G-ln-atGe~D~~nyyd~I~~~Qe~~l~p~ 141 (237) . .+.++....+++. +-+......+.||++-|||-.+| |...+| .| ++-+.-...||.. .|.|. T Consensus 273 ~~~~~~g~~~~pl~~~~~d~qfle~~~~~~~eIa~afgVPp~~l-G~~~~~~~n~sn~eq~~~~f~~~-------tL~P~ 344 (540) T protein:vir:41 273 PGGDTVEVTFTPLNTSQKELSFREYAAEKKHDIAAAHMIDPYRL-GITDVGPLGGNFAEVARRTYYES-------VVRPQ 344 (540) T ss_pred CCCcccceeEEecccchhHHHHHHHHHHHHHHHHHHhCCCHHHc-CcccCCCCCcccHHHHHHHHHHH-------HHHHH Confidence 1 1334443444432 23466677888999999998766 866543 44 2335556677754 35666 Q ss_pred HHHHHHHhh------cCCCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcc--ccccCCCCC Q lcl|NC_019725. 142 LEFLLPFIV------EEEEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAP--EFKLKDGNN 213 (237) Q Consensus 142 l~~l~~~i~------~s~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~--~~g~~~~~~ 213 (237) ++++-..|- ...++.|+|+.-.-+.. +. +..+..++++|+++++|+|+.|-...+ +.-+.+ .+ T Consensus 345 ~~~ie~~ln~~L~~~~~~~~~i~f~~~~ll~~----D~----~~~~~~lv~~G~lT~NE~Re~L~g~e~gdd~~l~p-~n 415 (540) T protein:vir:41 345 QEIVSSVLTDFIQLKLDPGARFVFNEEILMES----EF----VHNYALLVQCGVLTPSEVREKLFGLDGGPDMFMVP-SS 415 (540) T ss_pred HHHHHHHHHHhhhhccCCceEEEecchhhcch----HH----HHHHHHHHhCCCCCHHHHHHHhCcCcCCCcccccc-cc Confidence 666544332 13477888886443322 21 223557899999999999986622111 100111 01 Q ss_pred CChhcc---------------cc-CCCCCCCCCCCcCcCC Q lcl|NC_019725. 214 INIREP---------------EE-TTEPEPGLGEKLEDEN 237 (237) Q Consensus 214 ~~~~~~---------------e~-~~e~~~~~~~~~~~e~ 237 (237) ....+. +. ..+.+|...+..+.+. T Consensus 416 ~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~ 455 (540) T protein:vir:41 416 IGKSAMKRQKRNYEKNQINEIKRTYAKYKPRIQEIISSES 455 (540) T ss_pred cccccccccccccCCCCccccccccchhcccccCcccccc Confidence 111000 00 0111122222111111 No 137 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=97.98 E-value=6e-06 Score=49.22 Aligned_cols=215 Identities=13% Similarity=0.138 Sum_probs=117.5 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCC-cceeeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAET-EEYDVLN 79 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~-e~~~~~~ 79 (237) .++.+.+.+.+++.+....+.-+..+....+.+.|... .+. ......+. ..+++.++.++ -+|-..+ T Consensus 241 ~~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~~~g~~~----~~~-~~~~~~~~-------~~~~~~~~~~~~~~~l~~~ 308 (472) T protein:vir:93 241 DIFMYKTLIDAYNRRLSDLSNTFKDSNELTYVLTNYDD----QEL-PEFKRLLR-------YYGAIKVSDNGGVDTIQVE 308 (472) T ss_pred chhhhHHHHHHHHHHHHHHHHHHHHhcCceeEeecCCc----ccc-hhhHHHHh-------hccccccCCCCcceeEeec Confidence 67788899999999988888888888877777766421 110 11111111 23344444432 2244457 Q ss_pred cCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHH--HHHHHHhhhHHHHHHHHHhhcC----- Q lcl|NC_019725. 80 SDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLV--DRKREEDYRPLLEFLLPFIVEE----- 152 (237) Q Consensus 80 ~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I--~~~Qe~~l~p~l~~l~~~i~~s----- 152 (237) .+.+++...++.+...|+..+++|-.-+ +. -+| |.||..=...|...+ ....+..+...+++++.+++.- T Consensus 309 ~~~~~~~~~~~~l~~~i~~~s~~p~~~~-~~-~~~-n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~~~ 385 (472) T protein:vir:93 309 VPVENSKKYLDELYQKIMLFGQAVDFSS-DK-FGS-APSGVALEFLYTNLNLKADKLARKAKVAIQELLWFVFEHFDIKG 385 (472) T ss_pred CCHHHHHHHHHHHHHHHHHHhCCCCCCc-cc-ccc-CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCc Confidence 7888999999999999999999996533 11 112 446654333344333 2445567888999988887532 Q ss_pred --CCceeEeCCCCCCCHHHHHHHHHHHHHHHH---HHHhCCCC-CHHHHHHHHHhhcccc-ccCCCCCCChhccccCCCC Q lcl|NC_019725. 153 --EEWSIEFEPLSVPSKKEESEITKNNVESVT---KAITEQII-DLEEARDTLRSIAPEF-KLKDGNNINIREPEETTEP 225 (237) Q Consensus 153 --~~~~~~f~pL~~~seke~Aei~~~~A~a~~---~~~~~g~i-~~~e~r~~l~~~~~~~-g~~~~~~~~~~~~e~~~e~ 225 (237) .++++.|+|-...+.++.|++..+.+..++ .+-..+.+ ++++..+++++.-.+. ...+ +......+.. T Consensus 386 ~~~~i~v~f~~~~p~~~~~~~~~~~k~~giis~et~l~~l~~~~d~~~E~~ri~~E~~~~~~~~~--~~~~~~~d~~--- 460 (472) T protein:vir:93 386 EHKDVDISFNYNKVANTELQVQTAQQSMGIVSHETVLENHPFVEDLQAELERIEQEQMEYNKQLP--NLDDGGADGA--- 460 (472) T ss_pred ccceeeEEeCCCCCCCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHhcc--CcCcccCCCC--- Confidence 378899999999999999887776543221 11222322 2333333332211000 0000 0000000000 Q ss_pred CCCCCCCcCcCC Q lcl|NC_019725. 226 EPGLGEKLEDEN 237 (237) Q Consensus 226 ~~~~~~~~~~e~ 237 (237) +.++..++++ T Consensus 461 --~~~~~~~~~~ 470 (472) T protein:vir:93 461 --QQQERSNNKE 470 (472) T ss_pred --CCCCCCCccc Confidence 0011111111 No 138 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=97.95 E-value=3.2e-06 Score=50.71 Aligned_cols=215 Identities=13% Similarity=0.112 Sum_probs=116.7 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeec----CCcc-- Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDA----ETEE-- 74 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~----~~e~-- 74 (237) .++.+.+.+.+++.+....+.-+..+....+.+.|.. .-.+..++. +. ..+. .+++.+.. ++-+ T Consensus 233 d~e~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~g~~-~~~~~~g~~-~~-------~~~~-~~~~~~~~~~~~~~~~~~ 302 (470) T protein:vir:99 233 IFDSIKTLINALDKVISQKANQVEYFDNAYMYMIGFK-LPEDDEGNP-KF-------DFKN-NRVLYVSQLDPDTNPQIG 302 (470) T ss_pred chHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCC-cccccccch-hh-------hhhh-cceeeecCCCCCCCCcce Confidence 6788889999999999888888887777777776632 111122221 11 1111 23333321 1222 Q ss_pred eeeeecCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHH--HHHHHHhhhHHHHHHHHHhhcC Q lcl|NC_019725. 75 YDVLNSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLV--DRKREEDYRPLLEFLLPFIVEE 152 (237) Q Consensus 75 ~~~~~~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I--~~~Qe~~l~p~l~~l~~~i~~s 152 (237) |-..+.+..++...++.+.+.|+..+++|-.- ++..-| |.||..=..-|.... ...++..+++.|++++.+++.- T Consensus 303 ~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~-~~~~~~--n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~ 379 (470) T protein:vir:99 303 FIAKPDADQMQENLIQHLTDFIFMMAMVPNIQ-DKNFAG--NSSGVALQYKLFAMKNKADSKERKFDKSLMQLYRIVLAT 379 (470) T ss_pred EEeecCChHHHHHHHHHHHHHHHHHhCCcccc-cccccc--CchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 44445667889999999999999999999532 332212 346654333333332 3444567899999988886521 Q ss_pred -----------CCceeEeCCCCCCCHHHHHHHHHHHHHHH---HHHHhCCCCCHHHHHHHHHhhccccccCCCCCCChhc Q lcl|NC_019725. 153 -----------EEWSIEFEPLSVPSKKEESEITKNNVESV---TKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIRE 218 (237) Q Consensus 153 -----------~~~~~~f~pL~~~seke~Aei~~~~A~a~---~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~~~ 218 (237) .++++.|+|-...++.+.|++..+-+..+ ..+-..+.+++++-.+++.+...+.. +. T Consensus 380 ~~~~~~~~~~~~~i~v~f~~~~p~~~~e~a~~~~kl~giis~et~l~~l~~vd~~~E~eri~~E~~~~~---------~~ 450 (470) T protein:vir:99 380 LFNNKQDQELWSELDFKFTRNLPEDMASAIDNAKNAEGIVSKKTQLGMIPDIEPDAEMKQIAKEKADAI---------KQ 450 (470) T ss_pred HhccCCcccccccceEEeCCCCCcCHHHHHHHHHHHhccCCHHHHHHhCCCCCHHHHHHHHHHHHHHHH---------HH Confidence 26789999999999999998876643222 11222334444443333332110000 00 Q ss_pred cccCCCCCCCCCCCcCcCC Q lcl|NC_019725. 219 PEETTEPEPGLGEKLEDEN 237 (237) Q Consensus 219 ~e~~~e~~~~~~~~~~~e~ 237 (237) ......+.+..+...++|. T Consensus 451 ~~~~~~~~d~~~~d~~~ee 469 (470) T protein:vir:99 451 TQQLSMPIDILKRDNNAEE 469 (470) T ss_pred HHhhcCCCCcCCCCCCccC Confidence 0000000011100011111 No 139 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=97.92 E-value=2e-06 Score=51.78 Aligned_cols=233 Identities=12% Similarity=0.076 Sum_probs=115.3 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhc-CchheeeeecC-Ccceeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNS-GVGRAIGIDAE-TEEYDVL 78 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r-~~~~~~~iD~~-~e~~~~~ 78 (237) .++.+.+-+.+++.+....+..+..+....+.+.|.... ........-..++....... ......-.+.. +-.|-.. T Consensus 257 d~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~~ 335 (511) T protein:vir:10 257 DYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNL-DPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYK 335 (511) T ss_pred chhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeeccccC-CchhhccchhccceecccccccccccccCCCCcceeEEee Confidence 788888999999999999999888888877776653211 11110000001111111000 00111111111 1234456 Q ss_pred ecCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHH--HHHHHHhhhHHHHHHHHHhhc----- Q lcl|NC_019725. 79 NSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLV--DRKREEDYRPLLEFLLPFIVE----- 151 (237) Q Consensus 79 ~~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I--~~~Qe~~l~p~l~~l~~~i~~----- 151 (237) +.+.+++...++.+...|...+++|-.-.-+.+ | |.||..=..-|.... ...++..++..|++++++++. T Consensus 336 ~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~-~--n~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~~~~~ 412 (511) T protein:vir:10 336 QYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-G--TQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNT 412 (511) T ss_pred cCCHHHHHHHHHHHHHHHHHHhCCccccccccc-c--cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 777889999999999999999999986442222 2 345654333332222 344556789999998888742 Q ss_pred -----C---CCceeEeCCCCCCCHHHHHHHHHHHHHHHH---HHHhCCCC-CHHHHHHHHHhhccccc-cCCCCCCChhc Q lcl|NC_019725. 152 -----E---EEWSIEFEPLSVPSKKEESEITKNNVESVT---KAITEQII-DLEEARDTLRSIAPEFK-LKDGNNINIRE 218 (237) Q Consensus 152 -----s---~~~~~~f~pL~~~seke~Aei~~~~A~a~~---~~~~~g~i-~~~e~r~~l~~~~~~~g-~~~~~~~~~~~ 218 (237) . .+++|.|+|-...+.++.+++..+.+..++ .+-..+.+ ++++-.+++.+...+.- ........+.. T Consensus 413 ~~~~~~~d~~~i~i~f~~~~p~d~~~~~~~~~kl~G~iS~et~~~~l~~v~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~ 492 (511) T protein:vir:10 413 RSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPR 492 (511) T ss_pred CCcccccccceeeEEeCCCCCcCHHHHHHHHHHHhccCcHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhhcccCCC Confidence 1 257899999999999999887665432111 11122222 23333333322110000 00000000000 Q ss_pred cccCCCCCCCCCCCcCcCC Q lcl|NC_019725. 219 PEETTEPEPGLGEKLEDEN 237 (237) Q Consensus 219 ~e~~~e~~~~~~~~~~~e~ 237 (237) ..+..++++...+..+.|- T Consensus 493 ~~~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:10 493 DINDDEQDDDTKDTVDKKE 511 (511) T ss_pred CCCCCCCCCcccCcccccC Confidence 0001111111111111111 No 140 >protein:vir:4995 Length: 384 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049969;genbank:gi:9632941;genbank:GeneID:1262104 Probab=97.91 E-value=3.6e-06 Score=50.41 Aligned_cols=205 Identities=9% Similarity=0.052 Sum_probs=94.4 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVL 78 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~ 78 (237) .++.+.+.+.....+......+...... .++++++- ... +...++..+......+..+.++++. +.+|..+ T Consensus 176 ~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~~~---~~~---~~~~~~~~~~~~~~~n~~~~~vl~~-g~~~~~l 248 (384) T protein:vir:49 176 PLMALGRELNIQKASDKLTLNALKNALNANGILKIKGG---GLL---DFKTKQSRSRQAMKQMQGGPLVLDD-LEDFTPL 248 (384) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCC---CCh---HHHHHHHHHHHhcccCCccceecCC-CceEEEc Confidence 6778888888888888888888776444 45666531 111 2222222232333344455566664 5788888 Q ss_pred ecCcCCH--HHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhcCCCce Q lcl|NC_019725. 79 NSDISGV--PEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVEEEEWS 156 (237) Q Consensus 79 ~~~lsGl--~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~s~~~~ 156 (237) +.+..-. .+......+.||.+-|||..+|-+.+.+.. ++ +.++......++|.|..+...+...=+.. T Consensus 249 ~~~~~d~q~~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~--~~--------~~~~~~~~~~i~~~l~pi~~~i~~~l~~~ 318 (384) T protein:vir:49 249 EIKSNVAQLLSQADWTTGQFAKVYGIPESVVGGEGDKQS--SL--------EMIYNIYFKAVSRFLRPFVSELSKKLSCE 318 (384) T ss_pred cCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCccc--cH--------HHHHHHHHHHHHHHHHHHHHHHHHHhchh Confidence 7665433 456677889999999999887755332222 22 22333333444444444433332111101 Q ss_pred eEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCChhccccCCCCCCCCCCCcCc Q lcl|NC_019725. 157 IEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIREPEETTEPEPGLGEKLED 235 (237) Q Consensus 157 ~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~~~~e~~~e~~~~~~~~~~~ 235 (237) +.+ .++..++.+-.... .-...++.+|+.+..|+|+.|... |+.+ .+ .-+.+.. .+-+|+.+++.= T Consensus 319 l~~----~~~~~~~~~~~~~~-~~~~~l~~~~~~t~~e~~~~l~~~----g~~~-ne--~r~~~~~-~p~~gGd~~~~~ 384 (384) T protein:vir:49 319 VDA----DILPAVDPTGSNYI-GLINSMVKTGTLAQNQGLYVLQQA----EILP-KD--LPEGETD-STLKGGETNEQY 384 (384) T ss_pred hhh----hhhhhhhccchHHH-HHHHHHhhcCcccHHHHHHHHhhC----CCCC-hh--HHHHcCC-CCCCCCCCCCCC Confidence 110 00111111111111 112235556666666666655421 2221 10 1111111 011111111000 No 141 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=97.90 E-value=1.9e-06 Score=51.92 Aligned_cols=224 Identities=12% Similarity=0.084 Sum_probs=114.6 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhc-CchheeeeecC-Ccceeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNS-GVGRAIGIDAE-TEEYDVL 78 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r-~~~~~~~iD~~-~e~~~~~ 78 (237) .++.+.+.+.+++.+....+.-++.++..++.+.|.... ...+....-..++..+.... ...+..-.+.. +-.|-.. T Consensus 257 d~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 335 (511) T protein:vir:96 257 DYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNL-DPVEVRKQKEANVLFLEPTVYADSEGRETEGSVDGGYIYK 335 (511) T ss_pred chhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCccC-CchhhcccccccceecccccccccccccCCCCcceeEEee Confidence 788889999999999999999898888877777663211 01100000001111110000 01111111111 1234456 Q ss_pred ecCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHH--HHHHHHhhhHHHHHHHHHhhc----- Q lcl|NC_019725. 79 NSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLV--DRKREEDYRPLLEFLLPFIVE----- 151 (237) Q Consensus 79 ~~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I--~~~Qe~~l~p~l~~l~~~i~~----- 151 (237) +.+.+++...++.+.+.|...+++|-.-.-+.+ | |.||..=..-|.... ...++..++..|++++++++. T Consensus 336 ~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~-~--n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l~~~~~li~~~~~~~ 412 (511) T protein:vir:96 336 QYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-G--TQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNT 412 (511) T ss_pred cCCHHHHHHHHHHHHHHHHHHhCCccccccccc-c--cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 677889999999999999999999986543322 2 345654333333222 344556788999988888652 Q ss_pred -----C---CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHH-------------HHhhcccc-ccC Q lcl|NC_019725. 152 -----E---EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDT-------------LRSIAPEF-KLK 209 (237) Q Consensus 152 -----s---~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~-------------l~~~~~~~-g~~ 209 (237) . .++++.|+|-...+.++.+++..+.+ |+||.+.+.+. +.....+. ... T Consensus 413 ~~~~~~~d~~~i~~~f~~~~p~n~~e~~~~~~kl~---------G~iS~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~ 483 (511) T protein:vir:96 413 WSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSG---------GKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKA 483 (511) T ss_pred cCcccccccccceEEeCCCCCCCHHHHHHHHHHHh---------ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHH Confidence 1 25789999999999999988765532 44554444432 22110000 000 Q ss_pred CCCCCChhccccCCCCCCCCCCCcCcCC Q lcl|NC_019725. 210 DGNNINIREPEETTEPEPGLGEKLEDEN 237 (237) Q Consensus 210 ~~~~~~~~~~e~~~e~~~~~~~~~~~e~ 237 (237) ........+.....++++...+..+.|- T Consensus 484 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (511) T protein:vir:96 484 QKGIYKDPRDINDDEQDDDTKDTVDKKE 511 (511) T ss_pred hhccccCCCCCCCCCCCCcccccccccC Confidence 0000000000000111111111111111 No 142 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=97.90 E-value=2e-06 Score=51.86 Aligned_cols=226 Identities=15% Similarity=0.111 Sum_probs=115.8 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHH-HHHHHHHHhc-CchheeeeecC-Ccceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAAR-LRLAQVDDNS-GVGRAIGIDAE-TEEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~-~r~~~~~~~r-~~~~~~~iD~~-~e~~~~ 77 (237) .++.+.+.+.+++.+....+.-+..+....+.+.|.... ..+.. .... .++....... ...+.+-.+.. +-.|-. T Consensus 257 d~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~-~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~ 334 (511) T protein:vir:99 257 DYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNL-DPVEV-RKQKEANVLFLEPTVYADSEGRETEGSVDGGYIY 334 (511) T ss_pred chhhhHHHHHHHHHHHHHHHHHHHHhhchhhhhccCccc-Cchhh-cccccccceecccccccccccccCCCCcceeEEe Confidence 788888999999999999998888877777666553211 11110 0011 1111110000 01111111111 122445 Q ss_pred eecCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHH--HHHHHHHhhhHHHHHHHHHhhcC--- Q lcl|NC_019725. 78 LNSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKL--VDRKREEDYRPLLEFLLPFIVEE--- 152 (237) Q Consensus 78 ~~~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~--I~~~Qe~~l~p~l~~l~~~i~~s--- 152 (237) .+.+.+++...++.+.+.|...+++|-.-.-+.+ | |.||..=..-|... -...++..++..|++++++++.- T Consensus 335 ~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~-g--n~Sg~Alk~~~~~l~~ka~~k~~~~~~~l~~~~~li~~~~~~ 411 (511) T protein:vir:99 335 KQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-G--TQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKN 411 (511) T ss_pred ecCCHHHHHHHHHHHHHHHHHHhCCccccccccc-c--cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 5667789999999999999999999986542222 2 44665433333333 23455677899999988886521 Q ss_pred ----------CCceeEeCCCCCCCHHHHHHHHHHHHHHHH---HHHhCCCC-CHHHHHHHHHhhccc-------cccCCC Q lcl|NC_019725. 153 ----------EEWSIEFEPLSVPSKKEESEITKNNVESVT---KAITEQII-DLEEARDTLRSIAPE-------FKLKDG 211 (237) Q Consensus 153 ----------~~~~~~f~pL~~~seke~Aei~~~~A~a~~---~~~~~g~i-~~~e~r~~l~~~~~~-------~g~~~~ 211 (237) .++.|.|+|-...+.++.|++..+.+..++ .+-..+.+ ++++-.+++++...+ ...... T Consensus 412 ~~~~~~~~~~~~i~i~f~~~~p~n~~e~~~~~~kl~GiiS~et~l~~l~~v~D~~~E~~ri~~E~~~~~~~~~~~~~~~~ 491 (511) T protein:vir:99 412 TRSIDVSKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKNMYQDP 491 (511) T ss_pred cCCcccccccccceEEeCCCCCcCHHHHHHHHHHHhccCCHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhcccccC Confidence 257899999999999999987665542221 11222333 233333333321100 000000 Q ss_pred CCCChhccccCCCCCCCCCCCcCcC Q lcl|NC_019725. 212 NNINIREPEETTEPEPGLGEKLEDE 236 (237) Q Consensus 212 ~~~~~~~~e~~~e~~~~~~~~~~~e 236 (237) ....+++.++.++ .+..+.| T Consensus 492 ~~~~~~~~~~~~~-----~~~d~~e 511 (511) T protein:vir:99 492 RNINDDEQDDSTK-----DSIDKKE 511 (511) T ss_pred CCCCCCCCCCCCc-----CcccccC Confidence 1111111111111 1111111 No 143 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=97.89 E-value=9e-06 Score=48.24 Aligned_cols=200 Identities=14% Similarity=0.136 Sum_probs=114.8 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCc------c Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETE------E 74 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e------~ 74 (237) .++.+.+.+.+++.+....+.-+..+....+.+.|.. + .. + -..++ + ..+++.+..+++ + T Consensus 219 d~e~v~~liDa~d~~~s~~~~~~~~~~~p~~~~~g~~-~-~~---~--~~~~~------~-~~~~~~~~~~~~~~~~~~~ 284 (452) T protein:vir:36 219 IFESVISLVNAFNKAISEKANDVDYFSDQYLTFLGAA-V-EE---E--DLKNI------R-SNRVINYYADGEGKNVDVK 284 (452) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHhcCceeEeecCC-c-Cc---h--hhhhh------h-hcceEEecCCCCccCCcce Confidence 7788889999999999999998888888877776521 1 11 1 11111 1 134455543322 2 Q ss_pred eeeeecCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHH--HHHHHHhhhHHHHHHHHHhhc- Q lcl|NC_019725. 75 YDVLNSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLV--DRKREEDYRPLLEFLLPFIVE- 151 (237) Q Consensus 75 ~~~~~~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I--~~~Qe~~l~p~l~~l~~~i~~- 151 (237) |-..+.+.+++...++.+.+.|...+++|-. -++.. | |+||+.=...|.... ....+..++..|++++.+++. T Consensus 285 ~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~-~~~~~--g-n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~ 360 (452) T protein:vir:36 285 FLEKPDSDSQTENLLDRLTKLIFQTTMVANI-SDESF--G-SSSGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLFCEL 360 (452) T ss_pred eEeecCCHHHHHHHHHHHHHHHHHHhCcccc-Ccccc--c-CCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 4455667789999999999999999999963 23322 2 557765444444433 223345688888888887752 Q ss_pred ------C---CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHH-------------HHhhccccccC Q lcl|NC_019725. 152 ------E---EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDT-------------LRSIAPEFKLK 209 (237) Q Consensus 152 ------s---~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~-------------l~~~~~~~g~~ 209 (237) . .++.+.|+|-...+.++.|++..+.+ |+||.+.+.+. +++.-.+...+ T Consensus 361 ~~~~~~~~~~~~i~i~f~~~~p~d~~~~a~~~~k~~---------g~iS~et~~~~~~~~~d~~~E~~ri~~E~~~~~~~ 431 (452) T protein:vir:36 361 STNVSNKDSWKDIEYTFTRNEPKDIKEQAETANILM---------GITSQETALSVISVIPDVQAEMEKIKKEEASTAIF 431 (452) T ss_pred HhccCCccccccceEEeCCCCCcCHHHHHHHHHHHh---------ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHH Confidence 1 36789999999999999998766543 45555444432 22110000000 Q ss_pred CCCCCChhccccCCCCCCCC---CCCcCcC Q lcl|NC_019725. 210 DGNNINIREPEETTEPEPGL---GEKLEDE 236 (237) Q Consensus 210 ~~~~~~~~~~e~~~e~~~~~---~~~~~~e 236 (237) ..+...++++. ....+.| T Consensus 432 ---------~~~~~~~~~~~~~~~~~~~~e 452 (452) T protein:vir:36 432 ---------DKDKQPSEKGTDTVVSETNEE 452 (452) T ss_pred ---------HhhccCCCCcccccCccccCC Confidence 00000011111 1111111 No 144 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=97.88 E-value=4.7e-06 Score=49.81 Aligned_cols=233 Identities=13% Similarity=0.109 Sum_probs=117.4 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeec-CC--cceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDA-ET--EEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~-~~--e~~~~ 77 (237) .++.+.+.+.+++.+....+.-+..++..++.+.|.... ...........++-.............++. ++ -.|-. T Consensus 257 d~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~l~ 335 (512) T protein:vir:97 257 DYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNL-DPVEVRKQKEANVLFLEPTVYENRDTGIETEGSVDGGYIY 335 (512) T ss_pred chhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccC-CchhhhhhhhcccccccccchhhcccccCCCCCcceEEEe Confidence 778899999999999999998888888877776653211 011111000111111111111111111111 11 23566 Q ss_pred eecCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHH--HHHHHHhhhHHHHHHHHHhhc---- Q lcl|NC_019725. 78 LNSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLV--DRKREEDYRPLLEFLLPFIVE---- 151 (237) Q Consensus 78 ~~~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I--~~~Qe~~l~p~l~~l~~~i~~---- 151 (237) .+.+.+++...++.+...|...+++|-.-.-+.+ | |.||..=..-|.... ...++..++..|++++.+++. T Consensus 336 ~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~-g--n~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~li~~~~~~ 412 (512) T protein:vir:97 336 KQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-G--TQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKN 412 (512) T ss_pred ecCCHHHHHHHHHHHHHHHHHHhCCcccCccccc-c--cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 6778899999999999999999999986542211 2 346654333333322 455567789999998888742 Q ss_pred ------C---CCceeEeCCCCCCCHHHHHHHHHHHHHHHHH---HHhCCCC-CHHHHHHHHHhhcccc-ccCCCCCCChh Q lcl|NC_019725. 152 ------E---EEWSIEFEPLSVPSKKEESEITKNNVESVTK---AITEQII-DLEEARDTLRSIAPEF-KLKDGNNINIR 217 (237) Q Consensus 152 ------s---~~~~~~f~pL~~~seke~Aei~~~~A~a~~~---~~~~g~i-~~~e~r~~l~~~~~~~-g~~~~~~~~~~ 217 (237) . .++++.|+|-...+..+.|++..+.+..++. +-..+.+ ++++..+++.+...+. .........+. T Consensus 413 ~~~~~~~~d~~~i~~~f~~~~p~~~~e~~~~~~kl~giiS~et~~~~l~~v~d~~~E~eri~~E~~~~~~~~~~~~~~~~ 492 (512) T protein:vir:97 413 TRSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDP 492 (512) T ss_pred cCCcccccccccceEEeCCCCCcCHHHHHHHHHHHhccCchHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhcccCCC Confidence 1 2578999999999999998876654321111 1112222 2333333332211000 00000000000 Q ss_pred ccccCCCCCCCCCCCcCcCC Q lcl|NC_019725. 218 EPEETTEPEPGLGEKLEDEN 237 (237) Q Consensus 218 ~~e~~~e~~~~~~~~~~~e~ 237 (237) ...+..++++...+..+.|- T Consensus 493 ~~~~~~~~~~~~~~~~~~~~ 512 (512) T protein:vir:97 493 RDINDDEQDDDTKDTVDKKE 512 (512) T ss_pred CCCCCCCCCCCccccccccC Confidence 11111111111111111111 No 145 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=97.86 E-value=5.1e-06 Score=49.60 Aligned_cols=219 Identities=9% Similarity=0.079 Sum_probs=101.0 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHH--Hh-hcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLA--EM-CDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~--~~-~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~ 77 (237) +...|.+.+.+++++.......+..+......+.|.. +. ..++.+. ..+... ...++.+.+++-+|-+ T Consensus 230 i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~-------~~~~~~--~~~i~~~~~~d~k~~q 300 (485) T protein:vir:10 230 ITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDPETGQ-------TLFDAY--LARILAFEDAEGKIQQ 300 (485) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHhhcchHHHHhcCCcccccccccccc-------hhhhhc--ccceeccCCCCceEEe Confidence 3344666667777766655554444443333332321 11 0111111 111111 1122333222233433 Q ss_pred -eecCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHH--HHHHHhhhHHHHHHHHHhhcC-- Q lcl|NC_019725. 78 -LNSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVD--RKREEDYRPLLEFLLPFIVEE-- 152 (237) Q Consensus 78 -~~~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~--~~Qe~~l~p~l~~l~~~i~~s-- 152 (237) -.+++.+..+.+.....++|+.+++|...|-| +..+ ++||+.=...+...+. ..++..+.+.|.+++.+++.- T Consensus 301 ~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~fg~-~~~n-~~Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~l~~~~~~ 378 (485) T protein:vir:10 301 FSAAELANFTNALDQIAKQVAAYTGLPPQYLST-AADN-PASAEAIRAAESRLIKKVERKNSIFGGAWEEAMRLAYRMMK 378 (485) T ss_pred ecccchHHHHHHHHHHHHHHhcccCCCHHHhcc-ccCc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC Confidence 33445667788888899999999999888744 3222 2466544443433332 233456888899888876521 Q ss_pred --------CCceeEeCCCCCCCHHHHHHHHHHHHHHH-------HHHHhCCCCCHHHHH--HHHHhhcccc------ccC Q lcl|NC_019725. 153 --------EEWSIEFEPLSVPSKKEESEITKNNVESV-------TKAITEQIIDLEEAR--DTLRSIAPEF------KLK 209 (237) Q Consensus 153 --------~~~~~~f~pL~~~seke~Aei~~~~A~a~-------~~~~~~g~i~~~e~r--~~l~~~~~~~------g~~ 209 (237) .++.+.|.|-..+|.++.|+...+-.++- ..+-..|+.+. ++. ++++..-... .+. T Consensus 379 ~~~~~~~~~~i~v~w~~~~~~~~~~~ada~~kl~~ag~~~~s~et~~~~lg~~~~-~~~~~~~~~ee~~~~~~~~~~~~~ 457 (485) T protein:vir:10 379 GGDVPPDMLRMETVWRDPSTPTYAAKADAASKLYNGGTGVIPRERARKDMGYSIA-EREEMRRWDEEEAAMGLGLIGTMV 457 (485) T ss_pred CCCCcccceeeeEEecCCCCCCHHHHHHHHHHHHhccccCCCHHHHHHhCCCCHh-HHHHHHHHHHHHHHHHHHHHHHhh Confidence 26788999999999999887765544321 11223454332 221 1111100000 000 Q ss_pred -CCCCCCh-----hccccCCCCCCCCCC Q lcl|NC_019725. 210 -DGNNINI-----REPEETTEPEPGLGE 231 (237) Q Consensus 210 -~~~~~~~-----~~~e~~~e~~~~~~~ 231 (237) +....++ .+.+...+.++|-|. T Consensus 458 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 485 (485) T protein:vir:10 458 DPNPTVPGSPSPAPAPKPAALESGGDAA 485 (485) T ss_pred ccCCCCCCCCCccccccCcCCCCCCCCC Confidence 0000000 000011111122222 No 146 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=97.84 E-value=4.3e-06 Score=50.00 Aligned_cols=218 Identities=9% Similarity=-0.006 Sum_probs=107.9 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceee-ee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDV-LN 79 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~-~~ 79 (237) =++.+.+.+.+++++......-..-+.....-+.|+. ++... .+.+. ...++++.+++-+|-+ -. T Consensus 258 die~v~~l~Da~~~~~s~~~~~~e~~a~p~~~i~G~~-----~~~~~-------~~~~~--~~~i~~~~~~~~~~~q~~~ 323 (501) T protein:vir:25 258 EVAPLILLQQAINSVNFDRLIVSRFGANPQRVISGWT-----GSKAE-------VLKAS--ALRVWTFEDPEVKAQAFPP 323 (501) T ss_pred hhhhhHHHHHHHHHHHHHHHHHHHhhccHHHHHhCCC-----CCccc-------hhhhc--ccceeccCCCCceEEEecc Confidence 2356777777777776665544443333322222211 11111 11111 1234444433333433 34 Q ss_pred cCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHH--HHHHHhhhHHHHHHHHHhhcC----- Q lcl|NC_019725. 80 SDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVD--RKREEDYRPLLEFLLPFIVEE----- 152 (237) Q Consensus 80 ~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~--~~Qe~~l~p~l~~l~~~i~~s----- 152 (237) +++.+..+.+.....++|+.+++|...+.|.+- |.||+.=...+...+. ..++..+++.|++++.+++.- T Consensus 324 ~~~~~~~~~l~~~i~~i~~~s~~P~~~~~~~~~---N~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~rl~~~~~~~~~ 400 (501) T protein:vir:25 324 ASVEPYNLILEEMLQHVAMVAQISPAQVTGKMI---NVSAEALAAAEANQQRKLAAKRESFGESWEQLLRLAAEMDDDPD 400 (501) T ss_pred cChHHHHHHHHHHHHHHHhhcCCChhhhccccC---ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCc Confidence 567788889999999999999999888876542 4466644444443332 233355788899988886521 Q ss_pred ----CCceeEeCCCCCCCHHHHHHHHHHHHHH-H--HHHH-h-CCCCCHHHHHHHH--HhhccccccCCCC-----CCC- Q lcl|NC_019725. 153 ----EEWSIEFEPLSVPSKKEESEITKNNVES-V--TKAI-T-EQIIDLEEARDTL--RSIAPEFKLKDGN-----NIN- 215 (237) Q Consensus 153 ----~~~~~~f~pL~~~seke~Aei~~~~A~a-~--~~~~-~-~g~i~~~e~r~~l--~~~~~~~g~~~~~-----~~~- 215 (237) .++++.|.|...+|.++.|+...+.+++ + ..+. . -| +++.++.+.. +......++.... ... T Consensus 401 ~~~~~~i~v~w~~~~~~s~~~~ada~~kl~~~gis~et~~~~~~g-~~~~~ie~~~~~~~e~~~~~~~~~~~~~~~~~~~ 479 (501) T protein:vir:25 401 TAADSGAEVLWRDTEARSFGAVVDGITKLASAGIPIEHLLSMVPG-MTQQTIQAIKDSLRGGEVKSLVDKLLSNEPAPVP 479 (501) T ss_pred cccceeeeEEecCCCCCCHHHHHHHHHHHHhcCCCHHHHHHHcCC-CCHHHHHHHHHHHHHHhHHHHHHHhhccCcCCCC Confidence 2678999999999999988877665543 1 1222 2 24 4555543211 1111111111000 000 Q ss_pred hhccccCCCCC-CCCCCCcCcC Q lcl|NC_019725. 216 IREPEETTEPE-PGLGEKLEDE 236 (237) Q Consensus 216 ~~~~e~~~e~~-~~~~~~~~~e 236 (237) ....++.++++ .+...+..+. T Consensus 480 ~~~~~~~~~~~~~~~~~~~~g~ 501 (501) T protein:vir:25 480 PPPPQAAAQALNEGGVNGNGGA 501 (501) T ss_pred CCCCCCCccccccccCCCCCCC Confidence 00000100000 0000111111 No 147 >protein:vir:9871 Length: 429 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795633;genbank:gi:28876408;genbank:GeneID:1257942 Probab=97.81 E-value=7.9e-06 Score=48.55 Aligned_cols=200 Identities=17% Similarity=0.101 Sum_probs=115.3 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCC-----cce Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAET-----EEY 75 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~-----e~~ 75 (237) .++.+.+.+.+++.+....+.-+..+....+.+.|.. .. +. ....+ + ..+++.++..+ -+| T Consensus 203 d~e~v~~liD~~d~~~s~~~~~~~~~~~p~~~i~g~~-----~~-~~-~~~~~------~-~~~~~~~~~~~~~~~~~~~ 268 (429) T protein:vir:98 203 LLASVVTLINAFNKAISEKANDVEYFADAYLKILGAE-----LD-DE-TLKSL------R-DTRIINLKDTDAQQLTVEF 268 (429) T ss_pred cHHHHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCC-----CC-cc-hhhhH------h-hCceeeccCCCCCCcceeE Confidence 6778889999999999998888888877777665521 11 11 11111 1 12344443221 145 Q ss_pred eeeecCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHH--HHHHHHhhhHHHHHHHHHhhcC- Q lcl|NC_019725. 76 DVLNSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLV--DRKREEDYRPLLEFLLPFIVEE- 152 (237) Q Consensus 76 ~~~~~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I--~~~Qe~~l~p~l~~l~~~i~~s- 152 (237) -..+.+..++...++.+.+.|...+++|-.-. +.. | |+||+.=...|...+ ...++..++..+++++.+++.- T Consensus 269 l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~-~~~-g--n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~ 344 (429) T protein:vir:98 269 LQKPDADATQEHLLDRLENLIFRTAMVANISD-ESF-G--TASGIALRYRLQAMDNLAKTKERKFMSGMNRRYKLIASYP 344 (429) T ss_pred EeecCCHHHHHHHHHHHHHHHHHHhCccccCc-ccc-c--cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 56677888999999999999999999996422 221 2 456654444444432 2334466888899888877531 Q ss_pred ---------CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCChhccc--- Q lcl|NC_019725. 153 ---------EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIREPE--- 220 (237) Q Consensus 153 ---------~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~~~~e--- 220 (237) .+++++|+|-...+.++.|++..+. +|++|.+.+.+.|- ..+ +. .++.+ T Consensus 345 ~~~~~~~d~~~i~v~f~~~~p~~~~~~a~~~~kl---------~g~is~et~~~~l~-------~v~--d~-~~E~~ri~ 405 (429) T protein:vir:98 345 TSKIGPKDWIGIKYKFTRNLPANLLEESQIAGNL---------AGIVSEETQVGVLS-------IVE--NP-QKEIERKN 405 (429) T ss_pred ccCCCccccccceEEeCCCCCcCHHHHHHHHHHH---------hccCchHHHHHhCC-------CCC--CH-HHHHHHHH Confidence 3588999999999999998876654 36666655544331 000 11 11111 Q ss_pred --cCC--CCCCCCCCCcCcCC Q lcl|NC_019725. 221 --ETT--EPEPGLGEKLEDEN 237 (237) Q Consensus 221 --~~~--e~~~~~~~~~~~e~ 237 (237) +.. +...+.....+.++ T Consensus 406 ~E~~~~~~~~~~~~~~~~~~~ 426 (429) T protein:vir:98 406 SDKSTLISRQAGGLNGQNTTT 426 (429) T ss_pred HHHHHHHHHHHhhhcCCCCCC Confidence 000 00000111111111 No 148 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=97.79 E-value=6e-06 Score=49.21 Aligned_cols=225 Identities=10% Similarity=0.061 Sum_probs=105.7 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhH--HHhh-cCCchHHHHHHHHHHHHHhcCchheeeeecCCcceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGL--AEMC-DDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l--~~~~-~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~ 77 (237) +.+.|.+.+.+++++.........-+......+.|. .+.. ..+.+ ...+... ...++++.+++-+|-+ T Consensus 229 i~~~v~~L~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~-------~~~~~~~--~~~~~~~~~~~~~~~q 299 (484) T protein:vir:77 229 ITPELRSVTDAAARTLMLMQATAELMGVPQRLLFGVKGEELGVDPETG-------QTLFDAY--LARILAFEDHESKAQQ 299 (484) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHhCCCcchhccccccc-------chhhhhh--hhhhcccCCCCceeEe Confidence 334566667777777666555444333322222221 1111 01111 1111111 1123344443333433 Q ss_pred e-ecCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHH--HHHHHhhhHHHHHHHHHhhcC-- Q lcl|NC_019725. 78 L-NSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVD--RKREEDYRPLLEFLLPFIVEE-- 152 (237) Q Consensus 78 ~-~~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~--~~Qe~~l~p~l~~l~~~i~~s-- 152 (237) + .+++.+..+.+.....++|+.+++|..-|-|.+ .+ ++||+.=...+...+. ..++..+++.|++++.++..- T Consensus 300 ~~~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~-~n-~~Sg~Al~~~~~~l~~ka~~k~~~f~~~l~~~~~l~~~~~~ 377 (484) T protein:vir:77 300 FSAAELRNFVDALDALDRKAAAYTGLPPYYLSFSS-EN-PASAEAIRSSESRLVKTVERKNKIFGGAWEQAMRVAYKVMN 377 (484) T ss_pred ecCCChHHHHHHHHHHHHHHhcccCCCHHHhcccc-Cc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhC Confidence 3 244566778888889999999999998885533 32 1466654444444332 223355888899888886521 Q ss_pred --------CCceeEeCCCCCCCHHHHHHHHHHHHHHH-------HHHHhCCCCCHHHHH--HHHHhhcccccc--CCC-C Q lcl|NC_019725. 153 --------EEWSIEFEPLSVPSKKEESEITKNNVESV-------TKAITEQIIDLEEAR--DTLRSIAPEFKL--KDG-N 212 (237) Q Consensus 153 --------~~~~~~f~pL~~~seke~Aei~~~~A~a~-------~~~~~~g~i~~~e~r--~~l~~~~~~~g~--~~~-~ 212 (237) .++.+.|.|...+|.++.|+...|-+++. ..+-..|+++. ++. ++++......+. .+. . T Consensus 378 ~~~~~~~~~~i~v~w~~~~~~s~~~~ad~~~kl~~~g~gi~s~et~~~~l~~~~~-~~~e~~~~~~ee~~~~~~~~~~~~ 456 (484) T protein:vir:77 378 GGDIPPEYYRMESIWRDPSTPTYAAKADAATKLYNNGQGVIPKERARIDMGYSIT-EREEMRKWDEEEQAQGLGLMGTMF 456 (484) T ss_pred CCCcccccccceEEecCCCCCCHHHHHHHHHHHHhccCCCCCHHHHHhcCCCChh-HHHHHHHHHHHHHHHHHHHHhhhc Confidence 25788999999999999988776655431 12233454433 222 222211110110 000 0 Q ss_pred CCChhccccCCCCCCCCCCCcCcCC Q lcl|NC_019725. 213 NINIREPEETTEPEPGLGEKLEDEN 237 (237) Q Consensus 213 ~~~~~~~e~~~e~~~~~~~~~~~e~ 237 (237) ............++++.+++...++ T Consensus 457 ~~~~~~~~~~~~~~~~~~~~~~~~~ 481 (484) T protein:vir:77 457 GTDPSGGGNPDNPETPEPQPNPAEE 481 (484) T ss_pred cccccCCCCCCCCCcccccCCCccc Confidence 0000000011111111111111111 No 149 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=97.75 E-value=7.7e-06 Score=48.62 Aligned_cols=219 Identities=9% Similarity=0.072 Sum_probs=106.1 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHH-hh--cCCchHHHHHHHHHHHHHhcCchheeeeecCCcceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAE-MC--DDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~-~~--~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~ 77 (237) +.+.+.+.+.+++++.......+..+....+.+.|... .. .++.+. ..+... ...++++.+++-+|-+ T Consensus 230 i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~-------~~~~~~--~~~i~~~~~~~~~~~q 300 (485) T protein:vir:24 230 ITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDPETGQ-------TLFDAY--LARILAFEDAEGKIQQ 300 (485) T ss_pred chhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhccCCcccccccccccc-------chhhhc--ccceeccCCCCceEEe Confidence 44567777888888877766655555444443333211 00 111111 111111 1222333222223322 Q ss_pred -eecCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHH--HHHHHHhhhHHHHHHHHHhhcC-- Q lcl|NC_019725. 78 -LNSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLV--DRKREEDYRPLLEFLLPFIVEE-- 152 (237) Q Consensus 78 -~~~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I--~~~Qe~~l~p~l~~l~~~i~~s-- 152 (237) -.+++.+..+.+.....++|+.+++|...|-|.+ .+ ++||+.=..-|...+ ...++..+++.|++++.+++.- T Consensus 301 ~~~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~-~n-~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~~ 378 (485) T protein:vir:24 301 FSAAELANFTNALDQIAKQVAAYTGLPPQYLSTAA-DN-PASAEAIRAAESRLIKKVERKNAIFGGAWEEAMRLAYRLMK 378 (485) T ss_pred ecccchHHHHHHHHHHHHHHhcccCCCHHHhcccc-Cc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 2334556667777788899999999987774433 22 246654333333333 2344466888999998886421 Q ss_pred --------CCceeEeCCCCCCCHHHHHHHHHHHHHHH-------HHHHhCCCCCHHHHH--HHHHhhccccc------cC Q lcl|NC_019725. 153 --------EEWSIEFEPLSVPSKKEESEITKNNVESV-------TKAITEQIIDLEEAR--DTLRSIAPEFK------LK 209 (237) Q Consensus 153 --------~~~~~~f~pL~~~seke~Aei~~~~A~a~-------~~~~~~g~i~~~e~r--~~l~~~~~~~g------~~ 209 (237) .++++.|.|-..+|..+.|+...+.+++. ..+-..|++ ++++. ++++......+ +. T Consensus 379 ~~~~~~d~~~i~v~f~~~~~~s~~~~ad~~~kl~~~g~~~~s~et~~~~l~~~-~d~~~e~~~~~ee~~~~~~~~~~~~~ 457 (485) T protein:vir:24 379 GGDVPPDMLRMETVWRDPSTPTYAAKADAATKLYGNGQGVIPRERARKDMGYS-IAEREEMRRWDEEEAAMGLGLLGTMV 457 (485) T ss_pred CCCCccccceeeEEecCCCCCCHHHHHHHHHHHHhcccccCCHHHHHhhCCCC-HhHHHHHHHHHHHHhhhhhhHHHhhc Confidence 36889999999999999998776665432 123345544 33322 12211111111 00 Q ss_pred C--CCCCChhccccCCC----CCCCCCC Q lcl|NC_019725. 210 D--GNNINIREPEETTE----PEPGLGE 231 (237) Q Consensus 210 ~--~~~~~~~~~e~~~e----~~~~~~~ 231 (237) + ......++..+..+ ++|+.++ T Consensus 458 ~~~~~~~~~~~~~e~~~~~~~~~~~~~a 485 (485) T protein:vir:24 458 DADPTVPGSPNPTPAPKPQPAIEGGDSA 485 (485) T ss_pred ccCCCCCCCCCCCCCCCCccCCCCCCCC Confidence 0 00000111111111 1111111 No 150 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=97.74 E-value=7.6e-06 Score=48.65 Aligned_cols=199 Identities=13% Similarity=0.027 Sum_probs=109.9 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCc----cee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETE----EYD 76 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e----~~~ 76 (237) +.+.++..+.+++++......-.+-+......+-|+.. ++...+....+ ...++.+..+.+ +|- T Consensus 190 I~~~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~---d~~~~~~~~~~---------~~~i~~~~~~~~~~~~~v~ 257 (410) T protein:vir:95 190 ITRAGMYYQKYAKRTLERADITAEFYSWPQKYILGLDP---DAEPMEKWKAT---------VSSLLTISSSDKGVKPSVG 257 (410) T ss_pred cchhHHHHHHHHHHHHHHHHHHHHHhcchhheeeccCC---CCCcCchhhhh---------hhhheeccCCCCCCcceEE Confidence 66788888888888877655555555555555545321 12211111111 123455543221 232 Q ss_pred e-eecCcCCHHHHHHHHHHHHhhhhcCceeeeeccCccccc-ccchh---HHHHHHHHHHHHHHHhhhHHHHHHHHHhhc Q lcl|NC_019725. 77 V-LNSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVS-ASQNT---ALETFYKLVDRKREEDYRPLLEFLLPFIVE 151 (237) Q Consensus 77 ~-~~~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~Gln-atGe~---D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~ 151 (237) + -++++.+.-+.+.....++|+.+++|..-|-|.+- | +||+. -.......++.+| ..+.+.++++..+... T Consensus 258 q~~~~~l~~~~~~l~~l~~~~a~~s~lP~~~lg~~~~---NpsSa~Al~a~~~~L~~ka~~k~-~~fg~~l~~~~rla~~ 333 (410) T protein:vir:95 258 QFTTASMSPFTEQLRTAAAGFAGEMGLTLDDLGFVSD---NPSSVEAIKASHENLRLAGRKAQ-RSLGAGLLNVAYVAAC 333 (410) T ss_pred ecCCCChHHHHHHHHHHHHHHhhhcCCCHHHhccccC---chhHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHH Confidence 2 34577888899999999999999999888877652 3 45543 3344555555655 4578888888877531 Q ss_pred --------C---CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhC--CCCCHHHHHHHHHhhccccccCCCCCCChhc Q lcl|NC_019725. 152 --------E---EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITE--QIIDLEEARDTLRSIAPEFKLKDGNNINIRE 218 (237) Q Consensus 152 --------s---~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~--g~i~~~e~r~~l~~~~~~~g~~~~~~~~~~~ 218 (237) + .++.+.|.|+..++-... ...|+++.+++++ |+++.+.+++.| |+.+. ++ .. T Consensus 334 i~~~~~~~~~~~~~~~v~W~p~~d~~~~s~----a~~aDa~~Kl~~a~~g~~~~~~~~~~l-------g~~~~-~~--~~ 399 (410) T protein:vir:95 334 LRDEFRYTRSQFVRTAVKWEPLFEADANTM----TMIGDGVVKLNQALPGYINAETIRDLT-------GIAGD-MS--AK 399 (410) T ss_pred HhcCCCCcccccceeeEEeeecCCcchhhH----HHHHHHHHHHHHhccCCccHHHHHHhc-------CCChH-HH--HH Confidence 1 135677998766643332 3356666777776 566666666554 21111 00 00 Q ss_pred cccCCCCCCCCCC Q lcl|NC_019725. 219 PEETTEPEPGLGE 231 (237) Q Consensus 219 ~e~~~e~~~~~~~ 231 (237) + ..+.....|+ T Consensus 400 ~--~~~e~~~~g~ 410 (410) T protein:vir:95 400 P--VVSEGGSNGE 410 (410) T ss_pred H--HHHHHHhCCC Confidence 0 0000001111 No 151 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=97.74 E-value=8.8e-06 Score=48.31 Aligned_cols=214 Identities=15% Similarity=0.106 Sum_probs=111.4 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHH--------HHHhcCchheeeeecC- Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQ--------VDDNSGVGRAIGIDAE- 71 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~--------~~~~r~~~~~~~iD~~- 71 (237) .++.+.+.+.+++.+....+.-+..+...++.+.|... ...+ .......... .... ....++.++.. T Consensus 230 ~~~~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~g~~~--~~~~-~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~ 305 (489) T protein:vir:99 230 AYESVLDNIDAYDLSQSELANFQQDSVNALLVIAGNAY--TGAD-ENDYLDDGRLNPNGRLAISIGF-KKAQVLILDDNP 305 (489) T ss_pred chhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhhccCCc--cccc-chhhhhhccccccccccccccc-ccceeeeecccc Confidence 67788888888888888888777776666666554211 1110 0000000000 0000 01112222221 Q ss_pred -------CcceeeeecCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHH--HHHHHhhhHHH Q lcl|NC_019725. 72 -------TEEYDVLNSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVD--RKREEDYRPLL 142 (237) Q Consensus 72 -------~e~~~~~~~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~--~~Qe~~l~p~l 142 (237) .-.|-.+..+.+++...++.+...|...+++|-.-.- +.+| |.||..=...|...+. ...+..++..| T Consensus 306 ~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~--~~~~-n~Sg~Al~~~~~~l~~k~~~k~~~~~~~l 382 (489) T protein:vir:99 306 NPNGVKPQAYFLKKEYDTAGSEAYKNRLVADILRFTFTPDTQDM--KFSG-VQSGESMKYKLMASDNYREKQERLFKKGL 382 (489) T ss_pred CccccccceeeeeecCChHHHHHHHHHHHHHHHHHhCCcccccc--cccc-cchHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 1245566778889999999999999999999964332 2222 5577653333443332 33345688889 Q ss_pred HHHHHHhhcC--------------CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcccccc Q lcl|NC_019725. 143 EFLLPFIVEE--------------EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKL 208 (237) Q Consensus 143 ~~l~~~i~~s--------------~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~ 208 (237) ++++.+++.- .++++.|+|=...+..+.|++..+.+ |+||.+.+.+.+.. + T Consensus 383 ~~~~~li~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~~~~~~~~kl~---------giis~et~~~~l~~------v 447 (489) T protein:vir:99 383 MRRLRLAANIWAIKGNEATTYSLVNDTSIVFTPNLPQNDNEIVTAAQNLY---------GIVSDQTIFEILNT------V 447 (489) T ss_pred HHHHHHHHHHHhhcCCccccccccccceEEeCCCCCcCHHHHHHHHHHHh---------ccCCHHHHHHhcCC------C Confidence 9888876520 25889999999999999888765532 55665555443211 0 Q ss_pred CCCCCCC-------hhccc--cCCCCCCCCCCCcCcCC Q lcl|NC_019725. 209 KDGNNIN-------IREPE--ETTEPEPGLGEKLEDEN 237 (237) Q Consensus 209 ~~~~~~~-------~~~~e--~~~e~~~~~~~~~~~e~ 237 (237) . ..+.. .|..+ ...+...+.+...+.|+ T Consensus 448 ~-~~d~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~ 484 (489) T protein:vir:99 448 T-GVDAEAELKRLKEEADKKQSLPEPRLVGDASGQEEP 484 (489) T ss_pred C-chhHHHHHHHHHHHHHHHhccccccccCCCCCCcCC Confidence 0 00110 00000 00111111111111111 No 152 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=97.68 E-value=1.9e-05 Score=46.43 Aligned_cols=213 Identities=11% Similarity=0.023 Sum_probs=105.3 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHH-hhcCCchHHHHHHHHHHHHHhcC-chheeeeecCCcceeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAE-MCDDDDAQYAARLRLAQVDDNSG-VGRAIGIDAETEEYDVL 78 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~-~~~~~~~e~~~~~r~~~~~~~r~-~~~~~~iD~~~e~~~~~ 78 (237) -++.+.+.+.+++++......-..-+....+.+.|... ...++.+. .....+..+. ...++++++++-+|-++ T Consensus 195 d~e~vi~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~-----~~~~~~~~~~~~~~i~~~~~~~~~~~q~ 269 (434) T protein:vir:98 195 EFAGVLDIQDRVNLGILNRMAASRFSGFRQKWIKGHKFAKRTDPATG-----MTVVDQPFVPSPSAVWASEGENTQFGQL 269 (434) T ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCCcccccccccc-----cchhhhhhhccccccccCCCCCceEEEe Confidence 45788888888888877665544444333322222110 01111111 1111111111 12334444333333333 Q ss_pred -ecCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHH---HHHHHHHHHHHHHhhhHHHHHHHHHhhcC-- Q lcl|NC_019725. 79 -NSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTAL---ETFYKLVDRKREEDYRPLLEFLLPFIVEE-- 152 (237) Q Consensus 79 -~~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~---~nyyd~I~~~Qe~~l~p~l~~l~~~i~~s-- 152 (237) .+++.+..+.+.....++|+.+++|...|-|.+ + |+||+.=. ...-..+..+| ..+++.|++++.++..- T Consensus 270 ~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~~~~~-~--n~Sg~Al~~~~~~l~~k~~~k~-~~f~~~l~~~~rl~~~~~g 345 (434) T protein:vir:98 270 DATDLSGFLKEHASDVRDMLTISQTPTYLYATDL-V--NISADTIGALDILHVAKVREHI-ASFSEGLESVLALAAAQAG 345 (434) T ss_pred cCcchHHHHHHHHHHHHHHhcccCCCHHHhcccc-C--ChHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhcC Confidence 345667778888889999999999987776532 1 34555433 33344444444 56888899998887632 Q ss_pred -----CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCC----------CCHHHHHHHHHhhccccccCCCCCCChh Q lcl|NC_019725. 153 -----EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQI----------IDLEEARDTLRSIAPEFKLKDGNNINIR 217 (237) Q Consensus 153 -----~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~----------i~~~e~r~~l~~~~~~~g~~~~~~~~~~ 217 (237) .++.+.|.|-..+|..+.|+...+ ++++|+ .+++|+.+..+.. ....... . T Consensus 346 ~~~~~~~~~v~w~~~~~~s~~~~ada~~k-------l~~~g~~~e~~~~~lg~~~~e~~r~~~e~-~~~~~~~----~-- 411 (434) T protein:vir:98 346 VPEDYTEAEVRWANPAHVTMAVKADAATK-------LKSIGYPLDVIAEELDESPARVRRIVAGA-ASQALLA----A-- 411 (434) T ss_pred CChhheeeeEEecCCCCCCHHHHHHHHHH-------HHhcCCcHHHHHHhCCCCHHHHHHHHHHH-HHHHHHH----H-- Confidence 267799999999998887765544 444442 1222322211110 0000000 0 Q ss_pred ccccCCCCCCCCCCCcCcCC Q lcl|NC_019725. 218 EPEETTEPEPGLGEKLEDEN 237 (237) Q Consensus 218 ~~e~~~e~~~~~~~~~~~e~ 237 (237) .......+|..|+.-++++ T Consensus 412 -~~~~~~~~~~~g~~~~~~~ 430 (434) T protein:vir:98 412 -SLLPAPGAPSAGNVPDSGG 430 (434) T ss_pred -hhhccCCCCCCCCCCcccC Confidence 0000011122222222222 No 153 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=97.67 E-value=1.5e-05 Score=47.00 Aligned_cols=221 Identities=10% Similarity=0.063 Sum_probs=105.9 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHH-Hhhc--CCchHHHHHHHHHHHHHhcCchheeeeecCCcceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLA-EMCD--DDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~-~~~~--~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~ 77 (237) +.+.|.+.+.+++++.........-+....+.+.|.. .... ++.+. ..+... ...++++.+.+-+|-+ T Consensus 230 i~~~v~~liDa~~~~~s~~~~~~e~~a~p~~~i~G~~~~~~~~~~~~~~-------~~~~~~--~~~~~~~~~~~~~~~q 300 (486) T protein:vir:42 230 ITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDSETGQ-------TLFDAY--LARILAFEDAEGKIQQ 300 (486) T ss_pred chhhHHHHHHHHHHHHHHHHHHHHhhcchHHHhhcCCcccccccccccc-------chhhhh--hchhcccCCCCceEEe Confidence 3345667777888776665554444333333332211 0111 11111 111111 1112233222233433 Q ss_pred -eecCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHH--HHHHHHhhhHHHHHHHHHhhcC-- Q lcl|NC_019725. 78 -LNSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLV--DRKREEDYRPLLEFLLPFIVEE-- 152 (237) Q Consensus 78 -~~~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I--~~~Qe~~l~p~l~~l~~~i~~s-- 152 (237) -.+++....+.+.....++|+.+++|...|-|.+ .+ ++||+.=...|...+ ...++..+++.|++++.++..- T Consensus 301 ~~~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~-~n-~~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~l~~~~~~ 378 (486) T protein:vir:42 301 FSAAELANFTNALDQIAKQVAAYTGLPPQYLSTAA-DN-PASAEAIRAAESRLIKKVERKNLMFGGAWEEAMRIAYRIMK 378 (486) T ss_pred ecccCHHHHHHHHHHHHHHHhcccCCCHHHhcccc-Cc-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 2335566777888888999999999988774433 33 246765444444443 2344466899999998876421 Q ss_pred --------CCceeEeCCCCCCCHHHHHHHHHHHHHHH-------HHHHhCCCCCHHHH-HHHHHhhcc------ccccCC Q lcl|NC_019725. 153 --------EEWSIEFEPLSVPSKKEESEITKNNVESV-------TKAITEQIIDLEEA-RDTLRSIAP------EFKLKD 210 (237) Q Consensus 153 --------~~~~~~f~pL~~~seke~Aei~~~~A~a~-------~~~~~~g~i~~~e~-r~~l~~~~~------~~g~~~ 210 (237) .++.+.|.|-..+|..+.|+...+.+++. ..+-..|+++.... .++++..-. ...+.+ T Consensus 379 ~~~~~~d~~~i~v~w~~~~~~s~~~~ad~~~kl~~~~~g~~s~et~~~~lg~~~d~~~e~~~~~~e~~~~~~~~~~~~~~ 458 (486) T protein:vir:42 379 GGDVPPDMLRMETVWRDPSTPTYAAKADAATKLYGNGQGVIPRERARIDMGYSVKEREEMRRWDEEEAAMGLGLLGTMVD 458 (486) T ss_pred CCCccccceeeeEEecCCCCCCHHHHHHHHHHHHhcccCCCCHHHHHhcCCCChhHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 25788999999999999998777765532 22334554433211 112211000 000100 Q ss_pred CC------CCChhccccCCCCCCCCCCC Q lcl|NC_019725. 211 GN------NINIREPEETTEPEPGLGEK 232 (237) Q Consensus 211 ~~------~~~~~~~e~~~e~~~~~~~~ 232 (237) .. ....+.....+.....+|++ T Consensus 459 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 486 (486) T protein:vir:42 459 ADPTVPGSPSPTAPPKPQPAIESSGGDA 486 (486) T ss_pred CCCCCCCCCCCCCCCCCCcccCCCCCCC Confidence 00 00001111111111222222 No 154 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=97.66 E-value=2.3e-05 Score=45.97 Aligned_cols=203 Identities=12% Similarity=0.097 Sum_probs=110.6 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeec-----CCc-- Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDA-----ETE-- 73 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~-----~~e-- 73 (237) .++.+.+-+.+++.+....+.-+..+....+.+.|.. +.... ...+. . .+++.+.+ ++. T Consensus 219 d~e~v~~liDa~~~~~s~~~~~~~~~~~p~~~~~g~~--~~~~~-----~~~~~------~-~~~~~~~~~~~~~~~~~~ 284 (453) T protein:vir:39 219 IFESVISLVNAFNKAISEKANDVDYFSDQYLTFLGAA--VEEED-----LKNIR------S-NRVINYYGESSEAKNVDV 284 (453) T ss_pred chhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCC--CCchh-----hhhhh------h-cceeeecCCCCCCCCCce Confidence 6778888888999998888888877777766665521 11111 11111 1 12222211 112 Q ss_pred ceeeeecCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHH--HHHHHHhhhHHHHHHHHHhhc Q lcl|NC_019725. 74 EYDVLNSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLV--DRKREEDYRPLLEFLLPFIVE 151 (237) Q Consensus 74 ~~~~~~~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I--~~~Qe~~l~p~l~~l~~~i~~ 151 (237) .|-+.+.+.+++...++.+...|...+++|-.-. +.. | |+||+.=...|...+ ....+..+...|++++.+++. T Consensus 285 ~~lt~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~-~~~--g-n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~ 360 (453) T protein:vir:39 285 KFLEKPDSDSQTENLLDRLTKLIFQTTMVANISD-ESF--G-SSSGVSLAYKLQAMSNLALSFQRKFQSSLNSRYKLYCE 360 (453) T ss_pred eEEeecCCHHHHHHHHHHHHHHHHHHhCCccccc-ccc--c-CChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3555667788999999999999999999996432 211 2 457765444444322 233335678888888887652 Q ss_pred ----------CCCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHH-------------Hhhcccccc Q lcl|NC_019725. 152 ----------EEEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTL-------------RSIAPEFKL 208 (237) Q Consensus 152 ----------s~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l-------------~~~~~~~g~ 208 (237) ..+++|.|+|-...+.++.|++..+.+ |+||.+.+.+.| ++...+... T Consensus 361 ~~~~~~~~~~~~~i~v~f~~~~p~~~~~~a~~~~kl~---------g~is~et~l~~l~~v~D~~~E~~ri~~E~~~~~~ 431 (453) T protein:vir:39 361 LSTNVSNKEAWKDIEYTFTRNEPKDIKEQAETANILM---------GITSQETALSVISVIPDVQAEMEKIKKEEASTAI 431 (453) T ss_pred HHhccCCccccccceEEeCCCCCcCHHHHHHHHHHHh---------ccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHH Confidence 136789999999999999988766554 455554444332 211100000 Q ss_pred CCCCCCChhccccCCCCCCCCCCCcCcC Q lcl|NC_019725. 209 KDGNNINIREPEETTEPEPGLGEKLEDE 236 (237) Q Consensus 209 ~~~~~~~~~~~e~~~e~~~~~~~~~~~e 236 (237) . ....+...+...+..++.+.| T Consensus 432 ~------~~~~~~~~~~~~~~~~~~~~e 453 (453) T protein:vir:39 432 F------DKDKQPSEKGTDTVVPETNEE 453 (453) T ss_pred H------HHhccCCCCCCCCCCCCcCCC Confidence 0 000000000111111111111 No 155 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=97.66 E-value=8.7e-06 Score=48.34 Aligned_cols=197 Identities=14% Similarity=0.069 Sum_probs=122.3 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecC------Ccc Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAE------TEE 74 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~------~e~ 74 (237) .++.+.+-+.+++.+....+..+..+...++.+.|+.. .... .....+. ..+++.+..+ +-+ T Consensus 231 d~e~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~~----~~~~-~~~~~~~-------~~~~i~~~~~~~~~~~~~~ 298 (451) T protein:vir:10 231 DLSKYKKILDLYDRVMSGFANDLEDIQQIIYILENFGG----EDTS-EFLKELK-------RYKTIKTETDSEGDSGGLK 298 (451) T ss_pred chhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCCc----ccch-hhHHHHh-------hCCeEEecCcCCccCCcce Confidence 68888899999999999999888888888888776321 1111 1122221 1334444321 134 Q ss_pred eeeeecCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHH--HHHHHHhhhHHHHHHHHHhhc- Q lcl|NC_019725. 75 YDVLNSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLV--DRKREEDYRPLLEFLLPFIVE- 151 (237) Q Consensus 75 ~~~~~~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I--~~~Qe~~l~p~l~~l~~~i~~- 151 (237) |-..+.+..++...++.+...|...+++|-.-.- ..| |+||..=..-|.... ...++..+++.|++++++++. T Consensus 299 ~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~---~~g-n~Sg~Alk~~~~~l~~k~~~k~~~f~~~l~~~~~li~~~ 374 (451) T protein:vir:10 299 TMQIEIPTEARKIILEILKKQIYESGQGLQQDTE---NFG-NASGVALKFFYRKLELKSGLLETEFRTSFDKLIKAILYF 374 (451) T ss_pred EEeecCCHHHHHHHHHHHHHHHHHHhCccccccc---ccc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 5567778899999999999999999999964221 112 567764444444432 445556789999999988863 Q ss_pred -----CCCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCChhcc----c-- Q lcl|NC_019725. 152 -----EEEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIREP----E-- 220 (237) Q Consensus 152 -----s~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~~~~----e-- 220 (237) ..++.+.|+|-..-+++|.|++..+. .|+||.+.+...+- +. .++..+.- + T Consensus 375 ~~~~d~~~i~i~f~~~~p~n~~e~~~~~~kl---------~g~iS~et~~~~~p-------~v--~d~~~e~~~~~ee~~ 436 (451) T protein:vir:10 375 LGVTDYKKIQQTYTRNMMSNDLEDADIATKS---------VGIIPTKIILRHHP-------WV--DDVEEAEKLYLEEKK 436 (451) T ss_pred hCCCCccceeEEecCCCCCCHHHHHHHHHHH---------hccCchHHHHHhCC-------CC--CCHHHHHHHHHHHHH Confidence 24788999999999999988766653 26777766655431 11 11111110 0 Q ss_pred ----cCCCCCCCCCC Q lcl|NC_019725. 221 ----ETTEPEPGLGE 231 (237) Q Consensus 221 ----~~~e~~~~~~~ 231 (237) +..+.-++.++ T Consensus 437 ~~~~~~~~~~~~~~~ 451 (451) T protein:vir:10 437 IQASKVSDDYNNFTE 451 (451) T ss_pred HHHHHHHhhcCCCCC Confidence 01111122222 No 156 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=97.66 E-value=4.8e-06 Score=49.75 Aligned_cols=213 Identities=11% Similarity=0.094 Sum_probs=107.0 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecC--Ccceeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAE--TEEYDVL 78 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~--~e~~~~~ 78 (237) .+..+.+.+..++.+......=++...-.++--..+-....+++++.. ..+..-...+-++..+.. .+.++.. T Consensus 256 d~~~~~~lid~ld~~~s~~~~~~~~~~~~i~v~~~~l~~~~~~~g~~~-----~~~~~~~~~~~~~~~~~~~~~~~i~~~ 330 (496) T protein:vir:38 256 VYANALDTLKTLDLMFDSYYQEFKLGKKKVLVPSSFVKTAVNLDGSTT-----QYFDSTDEAFFLYQGDQDDNGKAIKDI 330 (496) T ss_pred hHhhHHHHHHHHHHHHHHHHHHHhhcccceecchHHhhccCCCCCccc-----cCCCCccceEEEeecCCCcccccceee Confidence 567778888888888777776555444444422221111222222210 001100011111111111 1346665 Q ss_pred ecCc--CCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHH---HHHHHHHHHHHhhhHHHHHHHHHhhc-- Q lcl|NC_019725. 79 NSDI--SGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALET---FYKLVDRKREEDYRPLLEFLLPFIVE-- 151 (237) Q Consensus 79 ~~~l--sGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~n---yyd~I~~~Qe~~l~p~l~~l~~~i~~-- 151 (237) +..+ .-....++.+...++..+|+|-.. ||...+|.. |+..=... -+..+..+| ..++..|++++..++. T Consensus 331 ~~~i~~e~~~~~l~~~l~~i~~~~g~~~~~-f~~~~~g~~-tAtei~~~~~~l~~~~~~~~-~~~~~~l~~l~~~il~~~ 407 (496) T protein:vir:38 331 SVEIRSTEFIESINAMLRIYAMQVGLSAGT-FTFDENGLK-TATEVVSEKSETYQTKNSHS-QLIEQGIKEMIVSILEVG 407 (496) T ss_pred ccccCHHHHHHHHHHHHHHHHHhhCCChhh-cCCCccccc-hHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHH Confidence 5554 345667778888889999998776 666667753 44322223 344444444 5677888877666531 Q ss_pred ------------CCCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCC----- Q lcl|NC_019725. 152 ------------EEEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNI----- 214 (237) Q Consensus 152 ------------s~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~----- 214 (237) ..+++|.|+.-...++.+.++ ++..++.+|++|.+.++..+ .+..+ ... T Consensus 408 ~~~~~~~g~~~~~~~i~v~f~d~i~~d~~~~~~-------~~~~~~~~GiiS~et~l~~~------~~~~d-~ea~~el~ 473 (496) T protein:vir:38 408 KFIEAYSGEVVELDTITVDFDDSIAQDEDTTIN-------RYTNAKNQGMIPLKIALQRA------WNITE-AEADEWAE 473 (496) T ss_pred HHHHhhcCCCCCccceEEEeCCCCCCCHHHHHH-------HHHHHHhcCCCCHHHHHHhc------CCCCh-HHHHHHHH Confidence 136889999988888877544 44456678998877665422 12111 111 Q ss_pred --ChhccccCCCCCCCCCCCcCcC Q lcl|NC_019725. 215 --NIREPEETTEPEPGLGEKLEDE 236 (237) Q Consensus 215 --~~~~~e~~~e~~~~~~~~~~~e 236 (237) ..|...+.++++.+. -.-+.| T Consensus 474 ri~~E~~~~~~~~d~~~-~~~~~e 496 (496) T protein:vir:38 474 MLAKEKQAEMPNNDMNG-IFGEEE 496 (496) T ss_pred HHHHhhhccCccccccC-CCCCCC Confidence 011111122222221 111111 No 157 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=97.62 E-value=8.8e-06 Score=48.29 Aligned_cols=185 Identities=11% Similarity=0.014 Sum_probs=108.7 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecC--Cc--ce- Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAE--TE--EY- 75 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~--~e--~~- 75 (237) +.+.+.+...++.++.....--.+-+......+-|+.+ ++........++ ..++.+.++ ++ ++ T Consensus 203 I~~~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~d~---d~~~~~~~~~~~---------~~i~~~~~d~~g~~~~v~ 270 (409) T protein:vir:16 203 ITRSGMYWQSNAKRTLERADVTAEFYSFPQKYVTGLSD---DAEPMETWKATV---------SSMLQFTKDEDGDKPTLG 270 (409) T ss_pred cchhHHHHHHHHHHHHHHHHHHHHHhcChhheeEecCC---CCCccchhhhhh---------hHhhccCCCCCCCCceEE Confidence 66778888888888876655544444444444445421 222222121111 223444221 11 23 Q ss_pred eeeecCcCCHHHHHHHHHHHHhhhhcCceeeeeccCccccc-ccchhHH---HHHHHHHHHHHHHhhhHHHHHHHHHhhc Q lcl|NC_019725. 76 DVLNSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVS-ASQNTAL---ETFYKLVDRKREEDYRPLLEFLLPFIVE 151 (237) Q Consensus 76 ~~~~~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~Gln-atGe~D~---~nyyd~I~~~Qe~~l~p~l~~l~~~i~~ 151 (237) +.-++++.+.-+.+.....++|+.++||..-|-|.+- | +||+.=. ..-...++.+| ..+.+.++++..+... T Consensus 271 q~~~~~l~~~~~~l~~~~~~~a~~s~lP~~~lg~~~~---NpsSa~Ai~a~~~~L~~ka~~k~-~~fg~~l~~~~rla~~ 346 (409) T protein:vir:16 271 QFTQPSMSPFTEQLRTAAAGFAGETGLTLDDLGFVSD---NPSSVEAIKASHENLRLAGRKAQ-RSLGAGLLNVAYLAAC 346 (409) T ss_pred ecCCCChhHHHHHHHHHHHHHhhhcCCCHHHcccccC---chhHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHH Confidence 2345677889999999999999999999888877652 3 4554332 33333445544 4578888888777542 Q ss_pred C-----------CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCC-C-CHHHHHHHHHhhccccccCCCC Q lcl|NC_019725. 152 E-----------EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQI-I-DLEEARDTLRSIAPEFKLKDGN 212 (237) Q Consensus 152 s-----------~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~-i-~~~e~r~~l~~~~~~~g~~~~~ 212 (237) - .++.+.|.|+..++.... ...|+++.+++++|. + +.+.+++.| |+.+.. T Consensus 347 ~~~~~~~~~~~~~~~~v~W~~~~~~~~~s~----a~~aDa~~Kl~~a~~~~~~~~v~~~~~-------g~~~~d 409 (409) T protein:vir:16 347 LRDDVPYLREQFSKTKPKWEPLFEADASML----SLIGDGAIKLNQAIPEFINKDTIRDLT-------GIKGAE 409 (409) T ss_pred HhcCCCccchhhccceEEecCCCCcchhhH----HHHHHHHHHHHhhcccccchhHHHHhc-------cCCCCC Confidence 1 245788999887774444 347888999999873 3 334444443 332211 No 158 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=97.61 E-value=6.3e-06 Score=49.08 Aligned_cols=212 Identities=10% Similarity=0.102 Sum_probs=111.0 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeee-ecC-Ccceeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGI-DAE-TEEYDVL 78 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~i-D~~-~e~~~~~ 78 (237) .+..+.+.+..++.+......-+...+..++=-..+-....+++++.. . .+..-...+..+.. +.+ +..++.. T Consensus 259 ~~~~~~~lid~lD~~~s~~~~e~~~~~~~i~v~~~~l~~~~~~~g~~~--~---~~~~~~~~~~~~~~~~~~~~~~i~~~ 333 (499) T protein:vir:80 259 VYANALDTLKTLDLMFDSYYQEFKLGKKKVLVPSSFVKTAVNLDGSTT--Q---YFDSTDEAFFLYQGEQDDNGKAIKDI 333 (499) T ss_pred hHhhHHHHHHHHHHHHHHHHHHHHhcccceecchhhhhccCCCCCCcc--c---CCCcccceeeEeeccCCCCcCceeEe Confidence 567778888888888888877666555455422221111112222210 0 01111111112211 111 1236655 Q ss_pred ecCc--CCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhH---HHHHHHHHHHHHHHhhhHHHHHHHHHhhc-- Q lcl|NC_019725. 79 NSDI--SGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTA---LETFYKLVDRKREEDYRPLLEFLLPFIVE-- 151 (237) Q Consensus 79 ~~~l--sGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D---~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~-- 151 (237) +..+ ......+..+...+...+|+|-.. ||...+|.. |+..= ...-|.++..+| ..++..|++|+..|.. T Consensus 334 ~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~-fg~~~~g~~-TAtei~s~~~~l~~~~~~~~-~~~~~~l~~l~~~il~~~ 410 (499) T protein:vir:80 334 SVEIRSTEFIESINAMLRIYAMQVGLSAGT-FTFDENGLK-TATEVVSEKSETYQTKNSHS-QLIEQGIKEMIVSILEVG 410 (499) T ss_pred cCcCChHHHHHHHHHHHHHHHHhcCCChhh-cCCCcccch-hHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHH Confidence 5544 345566777778888888998655 666666653 44332 233344555555 5678888888776641 Q ss_pred ------------CCCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCC------C Q lcl|NC_019725. 152 ------------EEEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGN------N 213 (237) Q Consensus 152 ------------s~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~------~ 213 (237) ..+++|.|+.-...++.+. ++....++.+|++|.+.++..+ +|..+.. . T Consensus 411 ~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~-------~~~~~~~~~~Gi~S~et~l~~~------~~~~d~ea~~el~~ 477 (499) T protein:vir:80 411 KLIKAYDGDTVELDTITVDFDDSIAQDEDTT-------INRYTTAKNQGMIPLKIALQRA------WNITEAEADEWAEM 477 (499) T ss_pred HHhccccCCCCCccceEEEeCCCCCCCHHHH-------HHHHHHHHHcCCCCHHHHHhhc------CCCChHHHHHHHHH Confidence 1368899999888887764 4456667788988877664321 2221100 0 Q ss_pred CChhccccCCCCCCCC--CCCc Q lcl|NC_019725. 214 INIREPEETTEPEPGL--GEKL 233 (237) Q Consensus 214 ~~~~~~e~~~e~~~~~--~~~~ 233 (237) +..|.....++++++. |+.+ T Consensus 478 i~~E~~~~~~~~d~~g~~ge~e 499 (499) T protein:vir:80 478 LAKEKQAEIPNNDMTGIFGEEE 499 (499) T ss_pred HHHHhhcCCCCCCccccCCCCC Confidence 0011111223333222 2222 No 159 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=97.57 E-value=4.9e-06 Score=49.70 Aligned_cols=201 Identities=17% Similarity=0.135 Sum_probs=118.4 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcc--eeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEE--YDVL 78 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~--~~~~ 78 (237) .++.+.+.+.+++.+....+.-+..+....+.+.|..... .+ .....+. ..+++.++.+ .+ |-+. T Consensus 254 d~~~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~~~~-~~----~~~~~~~-------~~~~i~~~~~-~~~~~l~~ 320 (479) T protein:vir:79 254 DLTFYKSLIDIYDNNISTLADNLDEIQEVIYVLKEYPGTS-LQ----EFIDNIR-------YYKSIKVDGG-GGVDKLEI 320 (479) T ss_pred chhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccc-cc----cchhhhh-------hccceecCCC-CcceEEec Confidence 6778888888999988888888887777766666532110 01 1111111 2334445443 44 4456 Q ss_pred ecCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHH--HHHHHHhhhHHHHHHHHHhhc----C Q lcl|NC_019725. 79 NSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLV--DRKREEDYRPLLEFLLPFIVE----E 152 (237) Q Consensus 79 ~~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I--~~~Qe~~l~p~l~~l~~~i~~----s 152 (237) +.+.+++...++.+.+.|...+++|-.-.-+ .| |+||..=..-|.... ....+..++..|++++++++. . T Consensus 321 ~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~---~g-n~Sg~Ai~~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~ 396 (479) T protein:vir:79 321 NIPVEAKKELLDRLEKNIIIFGQGVNPESQN---TG-DKSGVALKFLYSLLDLKCSKTEKKFKKAIRELLWFVCEYLKIS 396 (479) T ss_pred cCCHHHHHHHHHHHHHHHHHHhCcccccccc---cc-chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 6777889999999999999999999754322 12 456754333333332 344556688888888888752 1 Q ss_pred -------CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCChhccc----- Q lcl|NC_019725. 153 -------EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIREPE----- 220 (237) Q Consensus 153 -------~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~~~~e----- 220 (237) .+++|.|+|-...++++.|++..+. .|+||.+.+.+.|- ..+ ++. ++++ T Consensus 397 ~~~~~~~~~i~i~f~~~~p~~~~~~a~~~~kl---------~g~iS~et~l~~l~-------~v~--d~~-~E~~ri~~E 457 (479) T protein:vir:79 397 GNKSYDYKTVQITFNHSMIINEAEKIDMAAKS---------TGIVSDETIVSNHP-------WVE--DVN-DELERLKKQ 457 (479) T ss_pred CCCccccccceEEeCCCCCcCHHHHHHHHHHH---------hccCcHHHHHHhCC-------CCC--CHH-HHHHHHHHH Confidence 3688999999999999988765442 37788776665431 111 111 1111 Q ss_pred ---cC--CCCCCCCCCCcCcCC Q lcl|NC_019725. 221 ---ET--TEPEPGLGEKLEDEN 237 (237) Q Consensus 221 ---~~--~e~~~~~~~~~~~e~ 237 (237) .. ...-++.+++..+|. T Consensus 458 ~~~~~~~~~~~~~~~~~~~~e~ 479 (479) T protein:vir:79 458 EDTQKEYDDLIPNNQDGVIDET 479 (479) T ss_pred HHHHHHHHhccCcccCCCcCcC Confidence 00 011122333333333 No 160 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=97.55 E-value=1.6e-05 Score=46.88 Aligned_cols=227 Identities=14% Similarity=0.088 Sum_probs=114.1 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcC--chheeeeecCCcceeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSG--VGRAIGIDAETEEYDVL 78 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~--~~~~~~iD~~~e~~~~~ 78 (237) .++.+.+.+.+++.+....+.-+..++...+.+.|.... ........-..++......+- ..+.-.-++-+-.|-.. T Consensus 257 d~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 335 (511) T protein:vir:96 257 DYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNL-DPVEVRKQKEANVLFLEPTVYVDAEGRETEGSVDGGYIYK 335 (511) T ss_pred chhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccC-CchhhcccccccceeccccceeccccccCCCCcceeEEee Confidence 778888899999999888888888777776666552111 111100000011111000000 00000011111235556 Q ss_pred ecCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHH--HHHHHHhhhHHHHHHHHHhhc----- Q lcl|NC_019725. 79 NSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLV--DRKREEDYRPLLEFLLPFIVE----- 151 (237) Q Consensus 79 ~~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I--~~~Qe~~l~p~l~~l~~~i~~----- 151 (237) +.+.+++...++.+...|...+++|-.-.-+.+ | |.||..=..-|.... ...++..++..|++++.+++. T Consensus 336 ~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~-~--n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~ 412 (511) T protein:vir:96 336 QYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-G--TQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNT 412 (511) T ss_pred cCCHHHHHHHHHHHHHHHHHHhCCccccccccc-c--ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 777899999999999999999999976332222 2 446665434444333 344556788899988887642 Q ss_pred -----C---CCceeEeCCCCCCCHHHHHHHHHHHHHHHH---HHHhCCCC-CHHHHHHHHHhhcccc------ccC-CCC Q lcl|NC_019725. 152 -----E---EEWSIEFEPLSVPSKKEESEITKNNVESVT---KAITEQII-DLEEARDTLRSIAPEF------KLK-DGN 212 (237) Q Consensus 152 -----s---~~~~~~f~pL~~~seke~Aei~~~~A~a~~---~~~~~g~i-~~~e~r~~l~~~~~~~------g~~-~~~ 212 (237) . .++++.|+|-...+.++.|++..+.+..++ .+-..+.+ ++++-.+++.+..... ... ... T Consensus 413 ~~~~~~~~~~~i~~~f~~~~p~n~~e~~d~~~kl~G~iS~et~l~~l~~v~d~~~El~ri~~E~~~~~~~~~~~~~~~~~ 492 (511) T protein:vir:96 413 RSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPR 492 (511) T ss_pred CCCccccccccceEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhccccCCC Confidence 1 257899999999999999887665542221 11122222 2333333332211000 000 000 Q ss_pred CCChhccccCCCCCCCCCCCc Q lcl|NC_019725. 213 NINIREPEETTEPEPGLGEKL 233 (237) Q Consensus 213 ~~~~~~~e~~~e~~~~~~~~~ 233 (237) ...+.+. .++.++...+++ T Consensus 493 ~~~~~~~--~~~~~~~~~e~~ 511 (511) T protein:vir:96 493 DINDDEQ--DDDTKDTVDKKE 511 (511) T ss_pred CCCCCCC--CCCccCcccccC Confidence 0111111 111111111111 No 161 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=97.55 E-value=1.6e-05 Score=46.88 Aligned_cols=227 Identities=14% Similarity=0.088 Sum_probs=114.1 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcC--chheeeeecCCcceeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSG--VGRAIGIDAETEEYDVL 78 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~--~~~~~~iD~~~e~~~~~ 78 (237) .++.+.+.+.+++.+....+.-+..++...+.+.|.... ........-..++......+- ..+.-.-++-+-.|-.. T Consensus 257 d~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 335 (511) T protein:vir:78 257 DYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNL-DPVEVRKQKEANVLFLEPTVYVDAEGRETEGSVDGGYIYK 335 (511) T ss_pred chhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccC-CchhhcccccccceeccccceeccccccCCCCcceeEEee Confidence 778888899999999888888888777776666552111 111100000011111000000 00000011111235556 Q ss_pred ecCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHH--HHHHHHhhhHHHHHHHHHhhc----- Q lcl|NC_019725. 79 NSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLV--DRKREEDYRPLLEFLLPFIVE----- 151 (237) Q Consensus 79 ~~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I--~~~Qe~~l~p~l~~l~~~i~~----- 151 (237) +.+.+++...++.+...|...+++|-.-.-+.+ | |.||..=..-|.... ...++..++..|++++.+++. T Consensus 336 ~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~-~--n~Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~~~~~~~ 412 (511) T protein:vir:78 336 QYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS-G--TQSGEAMKYKLFGLEQRTKTKEGLFTKGLRRRAKLLETILKNT 412 (511) T ss_pred cCCHHHHHHHHHHHHHHHHHHhCCccccccccc-c--ccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 777899999999999999999999976332222 2 446665434444333 344556788899988887642 Q ss_pred -----C---CCceeEeCCCCCCCHHHHHHHHHHHHHHHH---HHHhCCCC-CHHHHHHHHHhhcccc------ccC-CCC Q lcl|NC_019725. 152 -----E---EEWSIEFEPLSVPSKKEESEITKNNVESVT---KAITEQII-DLEEARDTLRSIAPEF------KLK-DGN 212 (237) Q Consensus 152 -----s---~~~~~~f~pL~~~seke~Aei~~~~A~a~~---~~~~~g~i-~~~e~r~~l~~~~~~~------g~~-~~~ 212 (237) . .++++.|+|-...+.++.|++..+.+..++ .+-..+.+ ++++-.+++.+..... ... ... T Consensus 413 ~~~~~~~~~~~i~~~f~~~~p~n~~e~~d~~~kl~G~iS~et~l~~l~~v~d~~~El~ri~~E~~~~~~~~~~~~~~~~~ 492 (511) T protein:vir:78 413 RSIDANKDFNTVRYVYNRNLPKSLIEELKAYIDSGGKISQTTLMSLFSFFQDPELEVKKIEEDEKESIKKAQKGIYKDPR 492 (511) T ss_pred CCCccccccccceEEeCCCCCcCHHHHHHHHHHHhccCChHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHhhccccCCC Confidence 1 257899999999999999887665542221 11122222 2333333332211000 000 000 Q ss_pred CCChhccccCCCCCCCCCCCc Q lcl|NC_019725. 213 NINIREPEETTEPEPGLGEKL 233 (237) Q Consensus 213 ~~~~~~~e~~~e~~~~~~~~~ 233 (237) ...+.+. .++.++...+++ T Consensus 493 ~~~~~~~--~~~~~~~~~e~~ 511 (511) T protein:vir:78 493 DINDDEQ--DDDTKDTVDKKE 511 (511) T ss_pred CCCCCCC--CCCccCcccccC Confidence 0111111 111111111111 No 162 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=97.53 E-value=1.6e-05 Score=46.85 Aligned_cols=219 Identities=11% Similarity=0.073 Sum_probs=105.8 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHH--Hhh-cCCchHHHHHHHHHHHHHhcCchheeeeecCCcc--e Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLA--EMC-DDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEE--Y 75 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~--~~~-~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~--~ 75 (237) |.+.+.+.+.+++++......-+.-+....+.+.|+. ... ....+. ..+.... ..++++. ++++ | T Consensus 235 i~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~~~~-------~~~~~~~--~~v~~~~-~g~~~~~ 304 (488) T protein:vir:23 235 ISPELRSVTDAAAQILMNMQGTANLMAIPQRLIFGAKPEELGINAETGQ-------RMFDAYM--ARILAFE-GGEGAHA 304 (488) T ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHhCCCcccccccccccc-------hhhhhhh--hhhccCC-CCCCcee Confidence 4455667777788777776665555444333333321 111 111111 1111111 1122332 2333 3 Q ss_pred eee-ecCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHH--HHHHHHhhhHHHHHHHHHhhcC Q lcl|NC_019725. 76 DVL-NSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLV--DRKREEDYRPLLEFLLPFIVEE 152 (237) Q Consensus 76 ~~~-~~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I--~~~Qe~~l~p~l~~l~~~i~~s 152 (237) -++ .+++.+..+.+.....++++.+++|...| |.+..+ ++||+.=...|...+ ...++..+.+.|.+++.+++.- T Consensus 305 ~q~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~-g~~~~n-~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~ 382 (488) T protein:vir:23 305 EQFSAAELRNFVDALDALDRKAASYSGLPPQYL-SSSSDN-PASAEAIKAAESRLVKKVERKNKIFGGAWEQAMRLAYKM 382 (488) T ss_pred EecCCCChHHHHHHHHHHHHHHhcccCCCHHHh-ccccCc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 332 34556677888888999999999998766 433332 246765444444444 2344456888999998887521 Q ss_pred ----------CCceeEeCCCCCCCHHHHHHHHHHHHHHH-------HHHHhCCCCCHHHH-HHHHHhhccc------ccc Q lcl|NC_019725. 153 ----------EEWSIEFEPLSVPSKKEESEITKNNVESV-------TKAITEQIIDLEEA-RDTLRSIAPE------FKL 208 (237) Q Consensus 153 ----------~~~~~~f~pL~~~seke~Aei~~~~A~a~-------~~~~~~g~i~~~e~-r~~l~~~~~~------~g~ 208 (237) .++.+.|.+-..+|..+.|+...+-+++. ..+-..|.++.... .+++...-.+ .++ T Consensus 383 ~~~~~~~~~~~~i~v~f~~~~~~s~~~~ada~~kl~~~g~~~~s~et~~~~l~~~~d~~~~~~~~~~~~~~~~~~~~~~~ 462 (488) T protein:vir:23 383 VKGGDIPTEYYRMETVWRDPSTPTYAAKADAAAKLFANGAGLIPRERGWVDMGYTIVEREQMRQWLEQDQKQGLGLIGSL 462 (488) T ss_pred hcCCCcchhhccceEEecCCCCCCHHHHHHHHHHHHhcccccCCHHHHHHhCCCCchHHHHHHHHHHHHHHHHHHHHHHH Confidence 36889999999999999988766654432 11222354432211 1111111000 011 Q ss_pred CCCCCCChhccccCCCCCCCCCCCcCcCC Q lcl|NC_019725. 209 KDGNNINIREPEETTEPEPGLGEKLEDEN 237 (237) Q Consensus 209 ~~~~~~~~~~~e~~~e~~~~~~~~~~~e~ 237 (237) ... ....+..... +.|+..+.|- T Consensus 463 ~~~-~~~~~~~~~~-----~~~~~~~~e~ 485 (488) T protein:vir:23 463 YGA-STPEGKPGEA-----PVGEPPAPEP 485 (488) T ss_pred hcc-CCCcccCCCC-----CCCCCCCCCC Confidence 000 0001110011 1111111111 No 163 >protein:vir:107880 Length: 491 # NCBI annotation: gp29 # Family: family:all:313 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024702;genbank:gi:48696939;genbank:GeneID:2845968 Probab=97.52 E-value=5.1e-05 Score=44.11 Aligned_cols=208 Identities=12% Similarity=0.052 Sum_probs=104.7 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVL 78 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~ 78 (237) |+..||..+.--..+....+..+.++.+ .+.|++. +..+....+=++.+..+.+ .+.++|.. +.+++.+ T Consensus 194 Ll~~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~-------~a~~~ek~~l~~al~~~~~-~a~~viP~-~~~ie~~ 264 (491) T protein:vir:10 194 DLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHPR-------SASDGEKNLLLDCLEDMVQ-DAVAVVPD-DSSIEIK 264 (491) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEEEecCC-------CCCHHHHHHHHHHHHHHhc-CcEEEecC-CceeEEE Confidence 8888887776666677777788888884 4566542 1111112222233333333 34555654 6889998 Q ss_pred ecCcCC-----HHHHHHHHHHHHh-hhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhcC Q lcl|NC_019725. 79 NSDISG-----VPEFLSSKMDRIV-SLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVEE 152 (237) Q Consensus 79 ~~~lsG-----l~dl~~~~~~~ia-a~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~s 152 (237) +..-++ -..+++..-..|+ +..|--+| ..+ +|=.|.|+--.....+.+.+.. ..+...+.+|+.-++.- T Consensus 265 ea~~~~g~~~~y~~li~~~d~~Isk~iLGqtlT---t~~-~gs~a~~~vh~~v~~di~~~D~-~~i~~tln~li~~l~~~ 339 (491) T protein:vir:10 265 EAAGKTGSADVYERLLHFCRGEVSIALLGQNQT---TEA-TSTRASAQAGLEVTDDIRDGDK-AVVSEAMNMLIRWICDL 339 (491) T ss_pred ecCCCCCChhHHHHHHHHHHHHHHHHHhhhhcc---cCc-ccchhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHh Confidence 876432 3446666666776 33333333 122 3333455555556666666655 34555566666544421 Q ss_pred -----CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCC-CCHHHHHHHHHhhccccccCCCCCCChhcccc--CCC Q lcl|NC_019725. 153 -----EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQI-IDLEEARDTLRSIAPEFKLKDGNNINIREPEE--TTE 224 (237) Q Consensus 153 -----~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~-i~~~e~r~~l~~~~~~~g~~~~~~~~~~~~e~--~~e 224 (237) +...|.|. +.+ +..++.|++++++++.|+ |+.+.+++++ |+..........+.. ... T Consensus 340 N~~~~~~p~f~~~------~~~--e~~~~~a~~~~~L~~~G~~i~~~~i~e~~-------Gip~~~~~~~~~~~~~~~~~ 404 (491) T protein:vir:10 340 NFDGADRPVFDMW------EQE--QVDEIQAGRDQKLTQAGARFTPAYFKRAY-------NLQDGDLDERPLPVSAVDTV 404 (491) T ss_pred cCCCCCcceEEec------CcC--chhHHHHHHHHHHHhCCCcCCHHHHHHHh-------CCCCCCcCccccccCCCCCc Confidence 23344443 222 444678999999999998 7888887765 221100000000000 000 Q ss_pred CCCCCCCCcCcCC Q lcl|NC_019725. 225 PEPGLGEKLEDEN 237 (237) Q Consensus 225 ~~~~~~~~~~~e~ 237 (237) +.....+...... T Consensus 405 ~~~~~~~~~~~~~ 417 (491) T protein:vir:10 405 GAASFAEFEAPDQ 417 (491) T ss_pred ccccccccCCCCC Confidence 0000111111111 No 164 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=97.45 E-value=2e-05 Score=46.35 Aligned_cols=212 Identities=8% Similarity=0.028 Sum_probs=105.1 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHH-hhc--CCchHHHHHHHHHHHHHhcCchheeeeecCC----- Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAE-MCD--DDDAQYAARLRLAQVDDNSGVGRAIGIDAET----- 72 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~-~~~--~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~----- 72 (237) +.+.+.+...++.++.....--.+-+......+-|+.. ... ++........++. .++.+.++. T Consensus 226 i~e~v~~l~da~~r~~~~~~~~~e~~a~pqr~i~G~~~~~~~d~d~~~~~~~~~~~~---------~i~~~~~d~d~~~~ 296 (474) T protein:vir:81 226 ITKPMMGLQDAGVRELARREGHMDVFSYPEFWLLGADESALKNADGTIKSVWEARLG---------RIKGLPDDADADIP 296 (474) T ss_pred cchhHHHHHHHHHHHHHHHHHHHHHhcchhheeecCChhhcccccccccchhhhhHH---------HHhcCCCccccccc Confidence 66778787777777766544433333333333333321 111 1222222222222 123333321 Q ss_pred ----ccee-eeecCcCCHHHHHHHHHHHHhhhhcCceeee-eccCcccccccchhHHHH---HHHHHHHHHHHhhhHHHH Q lcl|NC_019725. 73 ----EEYD-VLNSDISGVPEFLSSKMDRIVSLSGIHEIII-KNKNVGGVSASQNTALET---FYKLVDRKREEDYRPLLE 143 (237) Q Consensus 73 ----e~~~-~~~~~lsGl~dl~~~~~~~iaa~s~iP~t~L-~G~sp~GlnatGe~D~~n---yyd~I~~~Qe~~l~p~l~ 143 (237) -++. .-++++.+.-+.+.....++|+.++||..-| ++ +..+- +|+++=... ....++.+| ..+.+.++ T Consensus 297 ~~~~~~~~q~~~a~l~~~~~~l~~~~~~~a~~t~iP~~~lG~~-~~~np-~SaeAi~a~~~~l~~kae~k~-~~fg~~l~ 373 (474) T protein:vir:81 297 QLARADVKQFPAASPDAHWSDINGLAKLFAREASLPDTAVAIS-GLSNP-TSAESYDASQYELIAEAEGAV-DDFTPALR 373 (474) T ss_pred ccccccccccCCCChhHHHHHHHHHHHHHHhhhCCCHHHhccc-ccccc-cHHHHHHHHHHHHHHHHHHHH-HHHHHHHH Confidence 1233 3357888999999999999999999998776 44 22221 345444333 344444444 44788899 Q ss_pred HHHHHhhcC-------------CCceeEeCCCCCCCHHHHHHHHHHHHHHHH------HHHh-CCCCCHHHHHHHHHhhc Q lcl|NC_019725. 144 FLLPFIVEE-------------EEWSIEFEPLSVPSKKEESEITKNNVESVT------KAIT-EQIIDLEEARDTLRSIA 203 (237) Q Consensus 144 ~l~~~i~~s-------------~~~~~~f~pL~~~seke~Aei~~~~A~a~~------~~~~-~g~i~~~e~r~~l~~~~ 203 (237) +++.+.+.- ..+.+.|.+...+|..++|+...|.+++.. .+.. .| ++++++.+..+.+- T Consensus 374 ~~~rla~~i~~~~~~~~~~~~~~~~~v~W~d~~~~s~a~~aDa~~Kl~~a~~~~~~~~~~~~~lg-~t~~~i~~~~~~~~ 452 (474) T protein:vir:81 374 KAFIRALAMKNKVAIDEIPDEWKSIDAKWRDPRYLSKSAQADAGMKQLAAVPWLAETEVGLELIG-LTPQQARRAMADKR 452 (474) T ss_pred HHHHHHHHHhCCCCccccchhhccceeEecCCCccCHHHHHHHHHHHHhcccCCCcHHHHHhhcC-CCHHHHHHHHHHHH Confidence 888876421 145678999999999988766655555431 1111 23 34444432211100 Q ss_pred cccccCCCCCCChhccccCCCCCCCCCC Q lcl|NC_019725. 204 PEFKLKDGNNINIREPEETTEPEPGLGE 231 (237) Q Consensus 204 ~~~g~~~~~~~~~~~~e~~~e~~~~~~~ 231 (237) ...+.. .-+ .. -+...+.+..+ T Consensus 453 ~~~~~~----~~~-~l-~~~~~~~~~aq 474 (474) T protein:vir:81 453 RVQGRG----TLQ-AL-IDRSNNGATAQ 474 (474) T ss_pred HHhHHH----HHH-HH-HhcCCCCCCCC Confidence 001100 000 00 00011111111 No 165 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=97.44 E-value=5.5e-06 Score=49.40 Aligned_cols=214 Identities=11% Similarity=0.083 Sum_probs=113.5 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeeeec Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVLNS 80 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~~~ 80 (237) .+..+.+.+...+.+......-+...+..++=-..+-.....+.++.. .....++..-...+..+..|..+..|+.++. T Consensus 261 ~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~~~~-~~~~~~fd~~~~~y~~~~~~~~~~~i~~~~~ 339 (505) T protein:vir:79 261 LIDNSYTVIDAINRTHDQFVDEVKKGQRRLIVPAEWLKTGSSYGGQAS-ETHPPMFDPDETVYQAMYGDASEVGFHDATS 339 (505) T ss_pred hhhhhHHHHHHHHHHHHHHHHHHHhcccceeechHHhcccCCCCcccc-cccccCCCccceeeeeccCCCCCCceEEecc Confidence 555666777888877777776665544444432221111111111100 0000011111111222233443456777777 Q ss_pred Cc--CCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchh---HHHHHHHHHHHHHHHhhhHHHHHHHHHhhc---- Q lcl|NC_019725. 81 DI--SGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNT---ALETFYKLVDRKREEDYRPLLEFLLPFIVE---- 151 (237) Q Consensus 81 ~l--sGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~---D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~---- 151 (237) .+ ....+.++.+...++..+|++-.. ||...+|.. |+.. ....-|.++..+| ..++.+|+.|+..++. T Consensus 340 ~ir~e~~~~~l~~~l~~i~~~~g~s~~~-~~~~~~~~~-TAtei~s~~~~l~~t~~~~~-~~~~~al~~li~~i~~~~~~ 416 (505) T protein:vir:79 340 PIRVADYQATMDFFLREFENQTGLSQGT-FTTSPSGIQ-TATEVVTNNSQTYQTRSSYI-TQVEKTIKALTYAILELASV 416 (505) T ss_pred cCCHHHHHHHHHHHHHHHHHHhCCChhh-cCCCccccc-hHHHHHHHHhHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHH Confidence 65 445667777778888888887654 444555543 4432 3445677777777 4578888888777641 Q ss_pred ----------------CCCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCC--- Q lcl|NC_019725. 152 ----------------EEEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGN--- 212 (237) Q Consensus 152 ----------------s~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~--- 212 (237) ..+++|.|+.-...++.+.++ ....++.+|++|.+.++... +|+.+.. T Consensus 417 ~~~~~~g~~~~~~~~~~~~i~v~f~d~i~~d~~~~~~-------~~~~~v~~Gi~s~e~~l~~~------~~~~eeea~~ 483 (505) T protein:vir:79 417 PSFYADGQARWTGDVDSLDITINFNDGVFVDQESKRA-------ADLQAVQAQVMPKKQFLMRN------YGLDEEEADE 483 (505) T ss_pred hcccccccccccCCCCceeEEEEeCCCCCCCHHHHHH-------HHHHHHHcCCCCHHHHHHhc------CCCChHHHHH Confidence 126889999988888766543 45566778888887666432 2222100 Q ss_pred ---CCChhccccCCCCCCCCCC Q lcl|NC_019725. 213 ---NINIREPEETTEPEPGLGE 231 (237) Q Consensus 213 ---~~~~~~~e~~~e~~~~~~~ 231 (237) .+..|.....|+....+|| T Consensus 484 el~ri~~E~~~~~p~~~~~gg~ 505 (505) T protein:vir:79 484 WLAQIDAENSTAEPEFNQFGGD 505 (505) T ss_pred HHHHHHHhccccCCCchhccCC Confidence 0111111123333344455 No 166 >protein:vir:94546 Length: 506 # NCBI annotation: minor head protein # Family: family:all:125 # MgeID: mge:1510 # MgeName: phiJL-1 # Cross-refs: genbank:acc:YP_223886;genbank:gi:62327098;genbank:GeneID:5075562 Probab=97.40 E-value=7.6e-05 Score=43.17 Aligned_cols=214 Identities=12% Similarity=0.088 Sum_probs=109.9 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhc----------CC--chH-HHHHHHHHHHHHhcCchheee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCD----------DD--DAQ-YAARLRLAQVDDNSGVGRAIG 67 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~----------~~--~~e-~~~~~r~~~~~~~r~~~~~~~ 67 (237) .++.+.+-+.+++.+....+.-+..+...++.+.|...... .. .+. .....+...+..++. .+++. T Consensus 236 d~e~~~~liDa~d~~~S~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~ 314 (506) T protein:vir:94 236 DFENVLPLIDLYDAAQSDTANYMTDLNEAMLIIQGDIDTLFEGSDMMNTIDPNDEDAMAKLAKDKLELIKEMKD-ANMLL 314 (506) T ss_pred chhhhHHHHHHHHHHHHHHHHHHHHhhhHHHHHhcCccccccchhccccccccccccccccccchhHHHhhhhh-cCeee Confidence 66777777777777777777666655555544443221100 00 000 000112222222222 22333 Q ss_pred eecC----------CcceeeeecCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHH--HHHH Q lcl|NC_019725. 68 IDAE----------TEEYDVLNSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVD--RKRE 135 (237) Q Consensus 68 iD~~----------~e~~~~~~~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~--~~Qe 135 (237) +... +-.|-..+.+..++...++.+...|...+++|-.-. + +-+ -|.||..=..-|..... ...+ T Consensus 315 ~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~-~-~~~-~n~Sg~Aik~~~~~l~~k~~~k~ 391 (506) T protein:vir:94 315 LKSGMTVNGTQTSVDAKYINKTYDVVGSEAYKKRVAGDIHKFSHTPDLTD-E-NFA-SNSSGVAMQYKVLGTVELASTKR 391 (506) T ss_pred ecccccccCccccccceeeeecCCHHHHHHHHHHHHHHHHHHhCcccccc-c-ccc-ccchHHHHHHHHHHHHHHHHHHH Confidence 3221 123455667789999999999999999999996432 1 111 24566544444444332 4445 Q ss_pred HhhhHHHHHHHHHhhc---------C---CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhc Q lcl|NC_019725. 136 EDYRPLLEFLLPFIVE---------E---EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIA 203 (237) Q Consensus 136 ~~l~p~l~~l~~~i~~---------s---~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~ 203 (237) ..++..|++++.+++. + .+++|.|+|-...++.+.|++..+. .|+||.+.+...+- T Consensus 392 ~~~~~~l~~~~~li~~~~~~~~~~~~~d~~~i~i~f~~~~p~d~~e~a~~~~kl---------~g~iS~et~~~~lp--- 459 (506) T protein:vir:94 392 RMFERGLYARYQIISDIENSIHGDWTFDPQELTFTFRDNLPADNISQIKALVQA---------GATLPQKYLYQQLP--- 459 (506) T ss_pred HHHHHHHHHHHHHHHHHHHhcCCccccccccceEEeCCCCCcCHHHHHHHHHHH---------hccCChHHHHHhCC--- Confidence 6688888888877642 1 2567999999999999999876653 25666655554331 Q ss_pred cccccCCCCCCChhccc--------cCCCC--CCCC------C--CCcCcCC Q lcl|NC_019725. 204 PEFKLKDGNNINIREPE--------ETTEP--EPGL------G--EKLEDEN 237 (237) Q Consensus 204 ~~~g~~~~~~~~~~~~e--------~~~e~--~~~~------~--~~~~~e~ 237 (237) + .+ +.. ++.+ ..+.. .... + +.+.+|. T Consensus 460 ---~-v~--d~~-~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 504 (506) T protein:vir:94 460 ---G-VT--NPQ-DIVDMMKEQSANGDYSFDQNGVISNDGQTNTTATQTDEE 504 (506) T ss_pred ---C-CC--CHH-HHHHHHHHHHHHHhhcchhhcCCCcccCccccccccccC Confidence 0 00 000 1100 00000 0000 0 1111111 No 167 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=97.16 E-value=0.00015 Score=41.59 Aligned_cols=211 Identities=15% Similarity=0.110 Sum_probs=107.8 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheee--eecCC---cce Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIG--IDAET---EEY 75 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~--iD~~~---e~~ 75 (237) .+..+.+.+..++.+......-+...+..++=-..+-.-..++.+.... ..+...+..+..+- .|... +.+ T Consensus 270 ~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~v~~~~l~~~~~~~~~~~~----~~fd~~~~~y~~i~~~~~~~~~~~~~i 345 (518) T protein:vir:78 270 DLSQCTNYLFAVDYFFTVYMREGEKTKTKIAASERMFRKKVNKSTDKEE----WSMNVDEDYFMQFKGTLDAGAKLNDMI 345 (518) T ss_pred hHhhhhHHHHHHHHHHHHHHHHHHhCCceeeechhHhccCCCCCCCccc----cccCCCCceEEEecCcCCCCCccccce Confidence 6667778888888888888887766555555333221111111111100 00111111111111 11111 236 Q ss_pred eeeecCcC--CHHHHHHHHHHHHhhhhcCceeeeeccCcccccccc-hhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhc- Q lcl|NC_019725. 76 DVLNSDIS--GVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQ-NTALETFYKLVDRKREEDYRPLLEFLLPFIVE- 151 (237) Q Consensus 76 ~~~~~~ls--Gl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatG-e~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~- 151 (237) +.++..+- -....++.+...+...+|++-.. ||.+.+...||- .+...--|.+|..+| ..++++|++|+..++. T Consensus 346 ~~~~~~Ir~e~~~~~~~~~l~~~~~~~G~s~~t-fg~~~~~~TATei~s~~~~~~~t~~~~~-~~~e~al~~l~~~i~~l 423 (518) T protein:vir:78 346 QFMQGDFRDGSYRETMEYFAQKAVSKSGYNPAT-FNLGNREVKATEIWSLQDATVRKIEKKK-RLIQNVYEQMLWDFLYL 423 (518) T ss_pred eeeecccChHHHHHHHHHHHHHHHHhhCCChhh-cCcccccccHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHH Confidence 66665543 34556667777777788887665 476544333321 123344677787777 5688888888776641 Q ss_pred ---------------CCCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHH-------------Hhhc Q lcl|NC_019725. 152 ---------------EEEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTL-------------RSIA 203 (237) Q Consensus 152 ---------------s~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l-------------~~~~ 203 (237) .-+++|.|+.-...++++++++.. .++.+|++|.+++.+.+ +... T Consensus 424 ~~~~~~~~~~~~~~~~~~v~i~f~D~i~~D~~~~~~~~~-------~~v~aGimS~e~~i~~~~~~~~deea~~e~~ri~ 496 (518) T protein:vir:78 424 LTGGTNNKEKAIMRDEIRVIIEFPDPMSVNLNELSSTLN-------NMNSALAMSVEEKVKLIHPKWEDEEIQAEVKRIY 496 (518) T ss_pred HHhhcCccccccCCCceeEEEEeCCCCCCCHHHHHHHHH-------HHHhcCCCCHHHHHHHhCCCCCHHHHHHHHHHHH Confidence 025889999999999988877544 45566777766544432 2111 Q ss_pred cccccCCCCCCChhccccCCCCCCCCCCCcCcCC Q lcl|NC_019725. 204 PEFKLKDGNNINIREPEETTEPEPGLGEKLEDEN 237 (237) Q Consensus 204 ~~~g~~~~~~~~~~~~e~~~e~~~~~~~~~~~e~ 237 (237) .+.+.. ..++|++..| ++.+- T Consensus 497 ~E~~~~-----------~~~~p~~~~g--~~~~~ 517 (518) T protein:vir:78 497 LENAIG-----------EVPDPEAIGG--METKG 517 (518) T ss_pred HHhccc-----------CCCCCccccC--CCCCC Confidence 111110 1111211111 22222 No 168 >protein:vir:5839 Length: 533 # NCBI annotation: similar to portal vertex protein of head # Family: family:all:1036 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835625;genbank:gi:30044028 Probab=96.98 E-value=2.3e-05 Score=46.04 Aligned_cols=214 Identities=14% Similarity=0.151 Sum_probs=103.6 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcce----- Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEY----- 75 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~----- 75 (237) -|..+-|+++.|--++.- +-+ ||-++ ..+|-. .+.+.=++ .+++.+|+. ++.|+..-++ T Consensus 229 QLkmiEDAlVIYRisRAP------eRR--vFYID-VGNlpk-~KAeqYl~---~im~k~kNk---lvYDa~TGev~ddrk 292 (533) T protein:vir:58 229 QLRLMEDALMLYRVVRSV------DRR--VFYVD-VGNVPP-DKINEYLT---NIAMQYKRD---YWVRNNQNQFLGIDN 292 (533) T ss_pred HHHHHHHHHHHHhhcCCh------hhe--EEEEe-ecCCCc-cCHHHHHH---HHHHhcccc---eEEeccCCeEeeccc Confidence 455555555544433321 112 33332 222211 11121111 111222221 2333332222 Q ss_pred ------------------------eeeec-CcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchh--HHHHHHH Q lcl|NC_019725. 76 ------------------------DVLNS-DISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNT--ALETFYK 128 (237) Q Consensus 76 ------------------------~~~~~-~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~--D~~nyyd 128 (237) +.+.- +| |.-+=+..|+..+=-+.++|++||-.. +|++.+++= |.-.|.. T Consensus 293 ~m~~~sMlEDyWLpRReGgrgTEI~TLpGg~l-gemeDV~YF~kkLy~ALnVP~sRl~~e--~~fgr~~eItRDEiKF~K 369 (533) T protein:vir:58 293 YFSIESILKDYFIPRRGDRRAVEIDILQGSKV-DLAEDVEYMLNRLISALKVPKAFIGYE--GDVNAKNTLATQDIKFNN 369 (533) T ss_pred hhhhhhhHhhhcccccCCCccceeeecCCCCC-CcHHHHHHHHHHHHHHhCCCeeecCCC--CCCccchhhhHHHHHHHH Confidence 22221 23 344556789999999999999999544 467766654 6667999 Q ss_pred HHHHHHHHhhhHHHHHH--HHHhhcCCCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhcc-- Q lcl|NC_019725. 129 LVDRKREEDYRPLLEFL--LPFIVEEEEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAP-- 204 (237) Q Consensus 129 ~I~~~Qe~~l~p~l~~l--~~~i~~s~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~-- 204 (237) .|.++|.. +.+.+++- ++-++..++|.|+|+-=...+|-..++|...+..+++.+- +.|.-+=+++.+-.+.+ T Consensus 370 FI~rLR~r-F~~ll~~qLilk~iit~eew~~~f~~Dn~f~ElKe~Eil~~Ri~~l~~~d--pyvgk~yi~k~ILr~tdei 446 (533) T protein:vir:58 370 TIKRIQGF-FVEELERMVRMNKEFADQDFRLVMNRSNSIVEGERFAVIEQRIGIAERLK--GWVREDWIYSNILQIPYDL 446 (533) T ss_pred HHHHHHHH-HHHHHhcccccccCcchhheeeeeeccchHHHHHHHHHHHHHHHHHHHhc--chhhHHHHHHHHhcCChhh Confidence 99999965 44555442 2334566899999999999999999999998888776643 23333444433211111 Q ss_pred -----------ccccCCCCCCChhcccc----------------------CCCCCCCCCCCcCcCC Q lcl|NC_019725. 205 -----------EFKLKDGNNINIREPEE----------------------TTEPEPGLGEKLEDEN 237 (237) Q Consensus 205 -----------~~g~~~~~~~~~~~~e~----------------------~~e~~~~~~~~~~~e~ 237 (237) ..|.++..+. .+++.. ..++.+..|-.++... T Consensus 447 ~~q~e~ie~E~~~~~~~~~~~-~~e~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 511 (533) T protein:vir:58 447 KPQEEVAEAAGGGGLFDTGGF-GEETTPADFLGERGSPIESPRGRTEFDFGTEGGEELGGELNLGG 511 (533) T ss_pred hHHHHHHHHhhcCCCCCCCCc-ccccCCcccCccccCcccCCCChhhHhcccCCcccccccccccc Confidence 1122211111 011000 0000000011110000 No 169 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=96.96 E-value=9e-05 Score=42.78 Aligned_cols=207 Identities=10% Similarity=0.009 Sum_probs=102.1 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecC--Ccceeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAE--TEEYDVL 78 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~--~e~~~~~ 78 (237) |.+.+.+.+.+++.+.......+..+....+.+.|.. + ...... ..+. ...+++.++++ ++..... T Consensus 209 l~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~G~~-~--~~~~~~--~~~~-------~~~~i~~~~~~~~~~~~~~~ 276 (441) T protein:vir:80 209 ITRSIRAYTDEAVRTLLGQSVNRDFYAYPQRWVTGVS-A--DEFSQP--GWVL-------SMASVWAVDKDDDGDTPNVG 276 (441) T ss_pred chhhHHHHHHHHHHHHHHHHHHHHhhcCceeeeecCC-c--cccccc--hhhh-------cccccccCCCCCCCCcceeE Confidence 4456777777888888777666666655555554421 0 000000 0111 11233444432 2223333 Q ss_pred e---cCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHH--HHHHHHhhhHHHHHHHHHhhcC- Q lcl|NC_019725. 79 N---SDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLV--DRKREEDYRPLLEFLLPFIVEE- 152 (237) Q Consensus 79 ~---~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I--~~~Qe~~l~p~l~~l~~~i~~s- 152 (237) + .++....+.+......+++.++||...| |.++.+ ++||+.=...|...+ ...++..+++.|.+++.+++.- T Consensus 277 ~~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~-g~~~~~-~~Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~ 354 (441) T protein:vir:80 277 SFPVNSPTPYSDQMRLLAQLTAGEAAVPERYF-GFITSN-PPSGEALAAEESRLVKRAERRQTSFGQGWLSVGFLAAKAL 354 (441) T ss_pred ecCccchHHHHHHHHHHHHHHhcccCCCHHHh-ccCCCc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 3 3344455566677899999999997555 655443 346764443333333 3445567888999988887531 Q ss_pred ----------CCceeEeCCCCCCCHHHHHHHHHHHHHH-------HHHHHhCCCCCHHHHHHHHHhhccccccCCC-CCC Q lcl|NC_019725. 153 ----------EEWSIEFEPLSVPSKKEESEITKNNVES-------VTKAITEQIIDLEEARDTLRSIAPEFKLKDG-NNI 214 (237) Q Consensus 153 ----------~~~~~~f~pL~~~seke~Aei~~~~A~a-------~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~-~~~ 214 (237) .++++.|+|-...+.+|.|+...+.+++ -..+-..|.+ ++++.+..+........... ... T Consensus 355 ~~~~~~~~~~~~i~~~f~~~~~~~~~e~ad~~~kl~~~g~~~~s~~~~~~~l~~~-~~e~~~~~~e~~e~~~~~~~~~~~ 433 (441) T protein:vir:80 355 DSRVDEADFFGDVGLRWRDASTPTRAATADAVTKLVGAGILPADSRTVLEMLGLD-DVQVEAVMRHRAESSDPLAVLAGA 433 (441) T ss_pred cCCCcccccceeeeEEeCCCCCcCHHHHHHHHHHHHhcCcccccHHHHHHhCCCC-HHHHHHHHHHHHHHHHHHHHHhhh Confidence 2568899999999998887765444332 1112222322 23333211110000000000 000 Q ss_pred ChhccccC Q lcl|NC_019725. 215 NIREPEET 222 (237) Q Consensus 215 ~~~~~e~~ 222 (237) ....+++- T Consensus 434 ~~~~~~~~ 441 (441) T protein:vir:80 434 ISRQTNEV 441 (441) T ss_pred hhcccccC Confidence 00000000 No 170 >protein:vir:733 Length: 453 # NCBI annotation: minor structural protein 1 # Family: family:all:125 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108710;genbank:gi:13487832;genbank:GeneID:920851 Probab=96.94 E-value=0.00023 Score=40.55 Aligned_cols=209 Identities=12% Similarity=0.056 Sum_probs=111.3 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHh---cCchheeeeecCCcceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDN---SGVGRAIGIDAETEEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~---r~~~~~~~iD~~~e~~~~ 77 (237) ..+.+.+.+.+++.+....+.-+..+....+.+.|.. + ....... .++...+... .+..+....+++ -.|-. T Consensus 219 ~~~~v~~liDa~~~~~S~~~~~~~~~~~~~l~~~g~~-~-~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~d-~~~l~ 293 (453) T protein:vir:73 219 IFEPVHSLINSYNKVTSEKANDVEYFSDQYLVFLGAE-V-DEEDAKN--IKDNRLINFFDKNSNGQGTNAAKVD-VKFLD 293 (453) T ss_pred chhhHHHHHHHHHHHHHHHHHHHHHhccceeeeecCC-C-Cchhhhc--ccccccccccccccccccccccCce-eEEee Confidence 6677888888999998888888877777666665421 1 0111000 0111111111 011111111111 23555 Q ss_pred eecCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHH--HHHHHHhhhHHHHHHHHHhhc---- Q lcl|NC_019725. 78 LNSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLV--DRKREEDYRPLLEFLLPFIVE---- 151 (237) Q Consensus 78 ~~~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I--~~~Qe~~l~p~l~~l~~~i~~---- 151 (237) .+.+.+++...++.+...|...|++|-.-.-+. | |+||+.=...|...+ ...++..++..|++++.+++. T Consensus 294 ~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~--g--n~Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~~~~~~ 369 (453) T protein:vir:73 294 KPDSDVQTENLLNRLERSIFQFTMAANISDENF--G--NSSGVALAYKLQAMSNLALSFQRKFQSALNRRYSLWSSLSTN 369 (453) T ss_pred ecCCHHHHHHHHHHHHHHHHHHhCCcccCcccc--c--CccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 677788999999999999999999996433221 2 456654444444333 333445678888888877642 Q ss_pred C------CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCChhccc---c- Q lcl|NC_019725. 152 E------EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIREPE---E- 221 (237) Q Consensus 152 s------~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~~~~e---~- 221 (237) . .++++.|+|-...++++.|++..+.+ |+||.+.+.+.+- ..+ ++ .++++ + T Consensus 370 ~~~~~~~~~i~v~f~~~~p~~~~~~a~~~~k~~---------giis~et~~~~~~-------~~~--d~-~~E~~ri~~E 430 (453) T protein:vir:73 370 ASNKDAWKDIEYTFTRNEPKDIKEQAETANILK---------GITSEETALSVIS-------VIP--DV-QAEMEKIKKK 430 (453) T ss_pred cCCccccccceEEeCCCCCCCHHHHHHHHHHHh---------ccCcHHHHHHhCC-------CCC--CH-HHHHHHHHHH Confidence 1 36789999999999999988754432 5677655544321 011 11 11111 0 Q ss_pred ----CCC-CCCCCCCCcC-cCC Q lcl|NC_019725. 222 ----TTE-PEPGLGEKLE-DEN 237 (237) Q Consensus 222 ----~~e-~~~~~~~~~~-~e~ 237 (237) ..- ....+-.+.+ ..| T Consensus 431 ~~~~~~~~~~~~~~~~~~~~~~ 452 (453) T protein:vir:73 431 KLLQLSLTRTSNLVRMKQMRGN 452 (453) T ss_pred HHHHHHHHHhccCCcchhhhcC Confidence 000 0000000000 000 No 171 >protein:vir:79233 Length: 526 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1867 # MgeName: Phage MP22 # Cross-refs: genbank:acc:YP_001469155;genbank:gi:157834998;genbank:GeneID:5648814 Probab=96.83 E-value=0.00031 Score=39.81 Aligned_cols=222 Identities=15% Similarity=0.046 Sum_probs=97.7 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhc--cceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQ--QAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVL 78 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~--~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~ 78 (237) |+..||-.+.--..+...-+..+.++. +.+.|++. +..+.....=++.+..+.+ .+.++|- ++.+++.+ T Consensus 205 Llr~~~w~~~fK~~~~~~w~~F~E~yG~P~~igky~~-------~a~~~ek~~L~~av~~i~~-da~~iiP-~~~~ie~~ 275 (526) T protein:vir:79 205 LFRVLAWPYLFRHYATSDLAEMLEIYGLPIRLGKYPP-------GTADEEKATLLRAVTGLGH-AAAGIIP-ETMAIDFQ 275 (526) T ss_pred hHHHHHHHHHHHHhhHHHHHHHHHHcCCceEEEecCC-------CCCHHHHHHHHHHHHHHhc-CcEEEec-CCceeEEe Confidence 777666554433445555556777777 45666641 1112222222333444433 3444554 56889999 Q ss_pred ecCcCCH---HHHHHHHHHHHhhh-hcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHH-HHHHHHhhcCC Q lcl|NC_019725. 79 NSDISGV---PEFLSSKMDRIVSL-SGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLL-EFLLPFIVEEE 153 (237) Q Consensus 79 ~~~lsGl---~dl~~~~~~~iaa~-s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l-~~l~~~i~~s~ 153 (237) +..=+|. ..+++..-..||-+ .|--+|--.|..-+|=+|-|+--.....+.+.+-... +...| +.|+.-++.-. T Consensus 276 ea~~~~~~~f~~li~~~d~~Isk~iLGqtlTs~~~~g~~gS~a~g~vh~~v~~di~~aDa~~-i~~tln~~Li~~l~~~N 354 (526) T protein:vir:79 276 QAAQGSSEPFLAMMRQSEDAISKAVLGGTLTSTTSQSGGGAFALGQVHNEVRHDILASDARQ-LAATLSRDLLWPLLVLN 354 (526) T ss_pred ecCCCCHHHHHHHHHHHHHHHHHHHhhhhhccccccCcchhhhhHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhC Confidence 8764443 22444445555532 2222222111111222333444455556655555543 33334 34655554321 Q ss_pred ---CceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCC-CCHHHHHHHHHhhccccccCCCCCCChhc-cccCCCCCCC Q lcl|NC_019725. 154 ---EWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQI-IDLEEARDTLRSIAPEFKLKDGNNINIRE-PEETTEPEPG 228 (237) Q Consensus 154 ---~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~-i~~~e~r~~l~~~~~~~g~~~~~~~~~~~-~e~~~e~~~~ 228 (237) .-.+.+-|-.....+|..++ ++.|++++.+++.|+ |+.+.+++.+.- +.. .++..+-... ........++ T Consensus 355 ~~~~~~~~~~p~~~~~~~e~eDl-~~~a~~~~~L~~~G~~i~~~~i~e~~gi--p~~--~~~e~~l~~~~~~~~~~~~~~ 429 (526) T protein:vir:79 355 RPGSPDVRRAPRLVFDLREQADI-TSMAQSIPALVNVGLEIPSAWVYDKLGI--PQP--AKNEPVLRPAAQPAILSRQHG 429 (526) T ss_pred CCCcCCccccceEEeCCCCcccH-HHHHHHHHHHHhCCCcCCHHHHHHHhCC--CCC--CCchhhccccCCccccccccc Confidence 11122233333333444444 467999999999998 888888887631 000 0000000000 0000000000 Q ss_pred CCC-----CcCcCC Q lcl|NC_019725. 229 LGE-----KLEDEN 237 (237) Q Consensus 229 ~~~-----~~~~e~ 237 (237) ... .....+ T Consensus 430 ~~~~~~~~~~~~~~ 443 (526) T protein:vir:79 430 QRVAALATIVGPRY 443 (526) T ss_pred cccccccccccccC Confidence 000 000000 No 172 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=96.82 E-value=0.00013 Score=41.83 Aligned_cols=208 Identities=13% Similarity=0.082 Sum_probs=112.5 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhc-CCchHHHHHHHHHHHHHhcCchheeeeecC-Ccceeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCD-DDDAQYAARLRLAQVDDNSGVGRAIGIDAE-TEEYDVL 78 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~-~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~-~e~~~~~ 78 (237) .+..+.+.+...+.+......-++..+..++=-.. ++. ++++... +..-+..+..+-.|.+ +..++.+ T Consensus 263 ~~~~~~~lid~lD~~~s~~~~e~~~~~~~i~v~~~---~l~~d~~~~~~-------~~~~~~~~~~~~~~~~~~~~i~~~ 332 (508) T protein:vir:15 263 VVDNAKHVLDDINDTHDQFIWEIRLGQKHIAVQPG---MLRFDDEHKPT-------FDTEQNVYVGVLSDDNNGLGVKDM 332 (508) T ss_pred hHhhhHHHHHHHHHHHHHHHHHHHhcccceeechH---HhcCCCCCccc-------cCCCCeeEEeccCCCCCCCceeEe Confidence 55666788888888888877777555555553332 333 2322211 1111111222222221 2346666 Q ss_pred ecCc--CCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchh---HHHHHHHHHHHHHHHhhhHHHHHHHHHhhc-- Q lcl|NC_019725. 79 NSDI--SGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNT---ALETFYKLVDRKREEDYRPLLEFLLPFIVE-- 151 (237) Q Consensus 79 ~~~l--sGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~---D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~-- 151 (237) +.++ .-..+.++.+...+...+|++- --||-..+|.- |+.. ....-|.++..+| ..++.+|++|+..|+. T Consensus 333 ~~~ir~e~~~~~~~~~l~~~~~~~gls~-~~f~~~~~~~~-TAtei~s~~~~~~~t~~~~~-~~~~~al~~lv~~il~l~ 409 (508) T protein:vir:15 333 TTPIRTVQYKDAIDHFIKEFEVQIGLST-GTFSYSNDGVK-TATEVVSNNSMTYQTRSSYL-TMVEKAIDELCQSIFELA 409 (508) T ss_pred ecccChHHHHHHHHHHHHHHHHHhCCCc-hhcccccCccc-cHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHH Confidence 5554 3356667777777777788874 45666666653 5533 3455667777776 5688888888777541 Q ss_pred --------------------CCCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCC Q lcl|NC_019725. 152 --------------------EEEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDG 211 (237) Q Consensus 152 --------------------s~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~ 211 (237) ..+++|.|+.-..+++.++ ++.+..++.+|++|.++++..+ +|+.+ T Consensus 410 ~~~~~~~~g~~~~~~~~~~~~~~v~v~f~D~i~~d~~~~-------~~~~~~~v~aGi~s~e~~i~~~------~g~~d- 475 (508) T protein:vir:15 410 NAGALFDDGKPLFTLDSASQPLDIECHFDDGVFVNKDKQ-------LEEDAKVLAIGALSKQTFLQRN------YGMTD- 475 (508) T ss_pred HHhccccccccccccccccCCcceEEEeCCCCCCCHHHH-------HHHHHHHHhcCCCCHHHHHHhc------CCCCh- Confidence 1257789999888887664 3455667788888887766432 23221 Q ss_pred CCCChhcc----ccCCCC---C--CCCCCCcCcC Q lcl|NC_019725. 212 NNINIREP----EETTEP---E--PGLGEKLEDE 236 (237) Q Consensus 212 ~~~~~~~~----e~~~e~---~--~~~~~~~~~e 236 (237) ... .++. ++.+++ + -++.++.++| T Consensus 476 eea-~~el~ri~~E~~~~~~~~~~~~~~~g~~ge 508 (508) T protein:vir:15 476 EQA-AEELAKIQSEAPTDTFEGGRSAILNGGDGE 508 (508) T ss_pred HHH-HHHHHHHHHhccccCccccccccCCCCCCC Confidence 000 0111 111111 1 1223333444 No 173 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=96.82 E-value=0.00018 Score=41.10 Aligned_cols=149 Identities=7% Similarity=-0.005 Sum_probs=77.3 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeeeec Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVLNS 80 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~~~ 80 (237) -+..+...+.....+......-.....-.+++.++ .+ +++....++++++ +...+..+++++++ +.+|+.++. T Consensus 120 ~~~~~~~~i~~~~~~~~~~~~~~~~~~~~i~~~~~---~l-~~e~~~~~~~~~~--~~~~~~g~~~vl~~-g~~~~~l~~ 192 (278) T protein:vir:78 120 PIDVLKNTTDFDNAVRTFNLTEMQKPDSFMLKYGS---NV-GKEKRQQVLEDFK--QYYEENGGILFQEP-GVEIEPLPK 192 (278) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCcEEEEeCC---CC-CHHHHHHHHHHHH--HHhccCCCceecCC-CceEEEccC Confidence 34445454444444333322111111122333332 11 1222334555554 33444555666665 578888877 Q ss_pred CcCCHH--HHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhc----C-- Q lcl|NC_019725. 81 DISGVP--EFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVE----E-- 152 (237) Q Consensus 81 ~lsGl~--dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~----s-- 152 (237) +..... +......+.||.+-||| ..++|...++=.++-+.-.+.||..+ ++|.++++-..+-+ . T Consensus 193 ~~~d~~~~e~~~~~~~~Ia~~fgVp-p~~lg~~~~~~~sn~~~~~~~~~~~~-------l~P~~~~i~~~ln~~L~~~~e 264 (278) T protein:vir:78 193 KYVSEDIVASENLTRERVANVFQLP-SVFLNARSNTNFAKNEELNRFYLQHT-------LLPIVKQYEEEFNRKLLTKTD 264 (278) T ss_pred ChhHHHHHHHHHHHHHHHHHHhCCC-HHHhCCCCCCCcccHHHHHHHHHHHH-------HHHHHHHHHHHHHhhcCChhH Confidence 665443 55667889999999999 45556554432234344455666544 78877777655432 1 Q ss_pred --CCceeEeCCCCCC Q lcl|NC_019725. 153 --EEWSIEFEPLSVP 165 (237) Q Consensus 153 --~~~~~~f~pL~~~ 165 (237) .++.|+|+ +..+ T Consensus 265 ~~~g~~~~f~-~~~l 278 (278) T protein:vir:78 265 REKIGILNLT-LNLI 278 (278) T ss_pred hcCCceEEEe-cccC Confidence 24556655 3333 No 174 >protein:vir:99232 Length: 526 # NCBI annotation: putative portal protein # Family: family:all:313 # MgeID: mge:1649 # MgeName: DMS3 # Cross-refs: genbank:acc:YP_950451;genbank:gi:119953652;genbank:GeneID:4643092 Probab=96.79 E-value=0.00034 Score=39.60 Aligned_cols=222 Identities=15% Similarity=0.054 Sum_probs=95.6 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhc--cceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQ--QAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVL 78 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~--~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~ 78 (237) |+..||-.+.--..+..--+..+.++. +.+.|++. +..+.....=++.+..+.+ .+.++|- ++.+++.+ T Consensus 205 Llr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~-------~a~~~ek~~L~~av~~i~~-d~~~iiP-~~~~ie~~ 275 (526) T protein:vir:99 205 LFRVLAWPYLFRHYATSDLAEMLEIYGLPIRLGKYPP-------GTADEEKATLLRAVTGLGH-AAAGIIP-ETMAIDFQ 275 (526) T ss_pred hHHHHHHHHHHHHhhHHHHHHHHHHcCCceEEEecCC-------CCCHHHHHHHHHHHHHHhh-CcEEEec-CCceeEEe Confidence 777766655444445555556777777 45666641 1112222222333444433 3445554 46889998 Q ss_pred ecCcCCH---HHHHHHHHHHHhhh-hcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHH-HHHHHHhhcCC Q lcl|NC_019725. 79 NSDISGV---PEFLSSKMDRIVSL-SGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLL-EFLLPFIVEEE 153 (237) Q Consensus 79 ~~~lsGl---~dl~~~~~~~iaa~-s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l-~~l~~~i~~s~ 153 (237) +..=+|. ..+++..-..||-+ .|--+|--.|..-+|=+|-|+--.....+.+.+-... +...| +.|+.-++.-. T Consensus 276 ea~~~~~~~f~~li~~~d~~Isk~iLGqtlTs~~~~g~~gS~a~g~vh~~v~~di~~aDa~~-i~~tln~~Li~~l~~~N 354 (526) T protein:vir:99 276 QAAQGSSEPFLAMMRQSEDAISKAVLGGTLTSTTSQSGGGAFALGQVHNEVRHDLLASDARQ-LAATLSRDLLWPLLVLN 354 (526) T ss_pred ecCCCCHHHHHHHHHHHHHHHHHHHhhhhhccccccCcchhhhHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhC Confidence 8764443 22444444555532 2222221111111122233443344555555555433 33334 33555544221 Q ss_pred ---CceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCC-CCHHHHHHHHHhhccccccCCCCCCChhccc-cCCCCCCC Q lcl|NC_019725. 154 ---EWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQI-IDLEEARDTLRSIAPEFKLKDGNNINIREPE-ETTEPEPG 228 (237) Q Consensus 154 ---~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~-i~~~e~r~~l~~~~~~~g~~~~~~~~~~~~e-~~~e~~~~ 228 (237) .-.+..-|-.....+|..++ +..|++++.+++.|+ |+.+.+++.+.--.+ .++..+-..... ..+...++ T Consensus 355 ~~~~~~~~~~p~~~~~~~e~eDl-~~~a~~~~~L~~~G~~i~~~~i~e~~Gip~~----~~~e~~l~~~~~~~~~~~~~~ 429 (526) T protein:vir:99 355 RPGSPDVRRAPRLVFDLREQADI-TSMAQSIPALVNVGLEIPSAWVYDKLGIPQP----AKNEPVLRSAAQPAILSRQHG 429 (526) T ss_pred CCCcCCccccceEEeCCCCcccH-HHHHHHHHHHHhCCCccCHHHHHHHhCCCCC----CCcccccCCCCCCcccccccc Confidence 00111222233333344444 457999999999998 899999987631000 000000000000 00000000 Q ss_pred CC-----CCcCcCC Q lcl|NC_019725. 229 LG-----EKLEDEN 237 (237) Q Consensus 229 ~~-----~~~~~e~ 237 (237) .. +.....+ T Consensus 430 ~~~~~~~~~~~~~~ 443 (526) T protein:vir:99 430 QRVAALATIVGPRY 443 (526) T ss_pred cccccccccccccC Confidence 00 0000000 No 175 >protein:vir:108215 Length: 469 # NCBI annotation: gp6 # Family: family:all:2372 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552335;genbank:gi:160700655;genbank:GeneID:5758935 Probab=96.77 E-value=0.00035 Score=39.53 Aligned_cols=210 Identities=11% Similarity=0.073 Sum_probs=96.8 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhc--cceeechhHHHhhcCCchHHHHHHHHHHHHHhcCc-hheeeeecCCcceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQ--QAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGV-GRAIGIDAETEEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~--~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~-~~~~~iD~~~e~~~~ 77 (237) |+..||-.+.--..+...-+..+.++. +.|.|++. +..+.....=++++..++.. ...++| .++.+++. T Consensus 220 Llr~~~~~~~fK~~~~~~w~~f~EryG~P~~vgky~~-------~a~~~ek~~l~~a~~~~~~g~~a~~ii-p~~~~ie~ 291 (469) T protein:vir:10 220 ILRSAYKHWLLKDKLLRIEAATAERNGMGIPVGTASS-------ATDEDEVRKMAALARSVRGGINAGVGL-AQGQILEL 291 (469) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHcCCcceEEecCC-------CCCHHHHHHHHHHHHHHhcCCceEEEc-cCCceEEE Confidence 888888775555556666667777766 55666542 11111122222333333322 333445 45678888 Q ss_pred eecCcCCH--HHHHHHHHHHHhhh-hcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHH-HHHHHhhcC- Q lcl|NC_019725. 78 LNSDISGV--PEFLSSKMDRIVSL-SGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLE-FLLPFIVEE- 152 (237) Q Consensus 78 ~~~~lsGl--~dl~~~~~~~iaa~-s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~-~l~~~i~~s- 152 (237) ++.+-++- ..+++..-..|+-+ .|--+| .++-||=.|.|+.-...+.+.+.+.... +...|. .|+.-++.- T Consensus 292 ~ea~g~~~~~~~li~~~d~~Isk~iLG~tlT---s~~~gGS~a~~~vh~ev~~d~~~sDa~~-i~~tln~~li~~l~~lN 367 (469) T protein:vir:10 292 LGVSGNLPDIRRAIEGHDRSIALSGLAHFLN---LDGKGGSYALASVLEDPFTQAVHAYATS-ICRIANQHIIEDLVDIN 367 (469) T ss_pred eecCCCchHHHHHHHHHHHHHHHHHhccccc---ccCccchhhHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhc Confidence 88764432 22555555555532 222222 2333343345665666677766666543 444443 455544321 Q ss_pred ---C--CceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCC-----HHHHHHHHHhhccccccCC---CCCCChhcc Q lcl|NC_019725. 153 ---E--EWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIID-----LEEARDTLRSIAPEFKLKD---GNNINIREP 219 (237) Q Consensus 153 ---~--~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~-----~~e~r~~l~~~~~~~g~~~---~~~~~~~~~ 219 (237) . -..|+|...- +..+..|++++.++++|++. .+.+++.+ |+.. +..+..... T Consensus 368 ~g~~~~~P~~~~~~~e--------~~~~~~a~~i~~l~~~G~~~~~~~~~~~~~e~~-------gip~~~~~~~~~~~~~ 432 (469) T protein:vir:10 368 FGVDTPAPVLTFDPIG--------SRQDLTAAAVKLLYDAGVFDDDPAVKRAIRQRF-------NLPSELNDTPSAEPEE 432 (469) T ss_pred CCCCCCccEEEecCCC--------CcHHHHHHHHHHHHhcCCccCccccHHHHHHHh-------CCCCCCCCcccccchh Confidence 1 1356664322 12245689999999999954 44455543 2211 111100000 Q ss_pred ccCCCCCC--CCCCCcCcCC Q lcl|NC_019725. 220 EETTEPEP--GLGEKLEDEN 237 (237) Q Consensus 220 e~~~e~~~--~~~~~~~~e~ 237 (237) +.++...+ +.+....+.. T Consensus 433 ~~~~~~~~~~~~~~~~~~~~ 452 (469) T protein:vir:10 433 PAAVPNQSAAPARTRSSGNA 452 (469) T ss_pred cccCCCCCccccccCCCCCc Confidence 01100000 0011101111 No 176 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=96.69 E-value=9.8e-05 Score=42.57 Aligned_cols=214 Identities=13% Similarity=0.052 Sum_probs=95.8 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhc---CCchHH-HHHHHHHHHHHhcCchheeee-ecCCcce Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCD---DDDAQY-AARLRLAQVDDNSGVGRAIGI-DAETEEY 75 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~---~~~~e~-~~~~r~~~~~~~r~~~~~~~i-D~~~e~~ 75 (237) ....+.+.++..+.+......=+...+..++ ++ ..++. ++.+.. .....+.. -+..+..+-. +.++..+ T Consensus 272 ~~~~~~~~id~lD~~~s~~~~e~~~g~~~i~-v~--~~~l~~~~~~~~g~~~~~~~fd~---~~~~f~~~~~~~~~~~~i 345 (522) T protein:vir:47 272 IFDNAKTTIDFINRSYDEFMWEVRMGQRRVI-VP--EHLTQRQYQRPDGTIDFRPRFDV---EQNVYMQIGGSSMDAGGI 345 (522) T ss_pred hhhhhHHHHHHHHHHHHHHHHHHHhccceee-cc--hHHhccCCCCCCcccccccccCc---ccceEeecCCCCCCCCcc Confidence 4555666677777666555554443333333 21 12322 111111 01111110 0111111111 1222345 Q ss_pred eeeecCc--CCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccch---hHHHHHHHHHHHHHHHhhhHHHHHHHHHhh Q lcl|NC_019725. 76 DVLNSDI--SGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQN---TALETFYKLVDRKREEDYRPLLEFLLPFIV 150 (237) Q Consensus 76 ~~~~~~l--sGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe---~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~ 150 (237) +.++..+ .-+...+..+...++-..|++-. -||-..+|.- |.. +..+.-|.+++.+| ..++.+|++|+..|+ T Consensus 346 ~~~~~~ir~e~~~~~~~~~l~~i~~~~gls~~-tf~~~~~~~k-TAtEi~s~~~~~~~t~~~~~-~~~~~al~~lv~~i~ 422 (522) T protein:vir:47 346 TDLTSPIRANDYILAISEGLKLFEMQIGVSSG-MFTFDGQGMK-TATEIVSENSDTYQMRSSIV-ALVEQSIKELCVSMC 422 (522) T ss_pred eeeccccChHHHHHHHHHHHHHHHHHhCCCcc-ccCccccccc-cHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHH Confidence 5555544 23445566666666766666543 3444444432 332 23455677777777 558888888877764 Q ss_pred --------------cCCCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCC---- Q lcl|NC_019725. 151 --------------EEEEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGN---- 212 (237) Q Consensus 151 --------------~s~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~---- 212 (237) ...+++|.|+.--.+++.++++ .+..++.+|+++.++++..+ +|+.+.. T Consensus 423 ~l~~~~~~~~~~~~~~~~i~v~f~D~i~~D~~~~~~-------~~~~~v~aG~~s~e~~i~~~------~g~~eeea~~e 489 (522) T protein:vir:47 423 ELGKAVGVYSGEIPELDDISVNLDDGVFTDRHAELD-------YWAKMVAAGFSTKKRAIGKT------LNISGVEAEKE 489 (522) T ss_pred HHHhhhhhccCCCCCcceeEEEcCCCCCCCHHHHHH-------HHHHHHhcCCCCHHHHHHhc------CCCChHHHHHH Confidence 1136889999988888765543 34455667777776655432 1221100 Q ss_pred --CCChhccccCCCCCCCCCCCc-CcCC Q lcl|NC_019725. 213 --NINIREPEETTEPEPGLGEKL-EDEN 237 (237) Q Consensus 213 --~~~~~~~e~~~e~~~~~~~~~-~~e~ 237 (237) .+..|..+..+ ++++.+..+ +.+. T Consensus 490 l~ri~~E~~~~~~-~~~~~~~~~~~~~~ 516 (522) T protein:vir:47 490 LNAINSELLPMND-AELAIYGMHDQNEE 516 (522) T ss_pred HHHHHHhhccCCC-CCCCCCCCCCcccc Confidence 00000000011 111111111 1111 No 177 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=96.69 E-value=0.00041 Score=39.16 Aligned_cols=214 Identities=9% Similarity=0.031 Sum_probs=107.5 Q ss_pred CchhHHHHHHH---HHHHHHHHHHHHHHhccceeechhHHHhhcC-CchHHHHHHHHHHHHHhcCchheeeeecCCccee Q lcl|NC_019725. 1 MNKSLIDAICD---YDYCESLATQILRRKQQAVWKVKGLAEMCDD-DDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYD 76 (237) Q Consensus 1 llq~~~d~v~~---~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~-~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~ 76 (237) .+-.++..+++ |..+.--.+ .+...-.-++|.+. ...... .+.......+ -.-|++..-..+|++. T Consensus 254 ~lapvl~~l~~l~~y~dael~~a-~i~A~~a~fi~~~~-~~~~~~~~~~~~~~~~~--------l~pG~i~~L~pGe~i~ 323 (505) T protein:vir:96 254 WTHASMVELHHIGEYRKSEMIAA-ELGAKKVGFYEQDP-EAYDQPPEDDQGEIVEE--------VEAGTYQLLPYGIRFK 323 (505) T ss_pred hHHHHHHHHHHHhHHHHHHHHHH-HHhhhheeeeecCC-ccCCCccccccCccccc--------cCCceeeecCCCCeee Confidence 44444444444 333333333 34444445566642 211111 1101111111 1244555556678888 Q ss_pred eeecC--cCCHHHHHHHHHHHHhhhhcCceeeeeccCc-ccccccchhHHHHHHHHHHHHHHHhhhHHHHHH----HHHh Q lcl|NC_019725. 77 VLNSD--ISGVPEFLSSKMDRIVSLSGIHEIIIKNKNV-GGVSASQNTALETFYKLVDRKREEDYRPLLEFL----LPFI 149 (237) Q Consensus 77 ~~~~~--lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp-~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l----~~~i 149 (237) .++.+ -++..+.+..+...||+..|||--.|.|--. ...+ +.-..+.-+...++..|...+.+++..+ ++.. T Consensus 324 ~~~~~~p~~~~~~f~~~~lr~iaaglgi~ye~lt~D~s~~nYS-S~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a 402 (505) T protein:vir:96 324 EHKIDHPHTNFGAFVKSSLRGVAAGMGPAYNRLAHDLEGVNFS-SLRSGELDERDLYKLLQFFVVTELLERVAGNLISMS 402 (505) T ss_pred eeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 88776 4689999999999999999999999988532 2343 4456777888899999987655444444 3333 Q ss_pred hcCC-------------CceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHH-----------HH---hh Q lcl|NC_019725. 150 VEEE-------------EWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDT-----------LR---SI 202 (237) Q Consensus 150 ~~s~-------------~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~-----------l~---~~ 202 (237) +... ...|..+..-..++ .|.+++....+++|+.|..++..+ +. .. T Consensus 403 ~l~G~i~~p~~~~~~~~~~~w~~p~~~~iDP-------~Ke~~a~~~~i~~G~~t~~~~~a~~G~D~~~v~~q~a~e~~~ 475 (505) T protein:vir:96 403 LLTQALPLNMVDIDRLSQYAFQPRGWDWVDP-------AKDSKAHSESIKNRTRSRSSIIRAAGDDPEDVFDEIAWEEQL 475 (505) T ss_pred HHcCCcCCCCccchhhceeeeccCCccccCh-------HHHHHHHHHHHHcCCCCHHHHHHHcCCCHHHHHHHHHHHHHH Confidence 2221 12222223223333 356666777777777766655433 21 12 Q ss_pred ccccccCCCCCCChhccccCCCCCCCCCCC Q lcl|NC_019725. 203 APEFKLKDGNNINIREPEETTEPEPGLGEK 232 (237) Q Consensus 203 ~~~~g~~~~~~~~~~~~e~~~e~~~~~~~~ 232 (237) ..+.|+.+............++++.+.+++ T Consensus 476 ~~~~Gl~~~~~~~~~~~~~~~~~~~~~~d~ 505 (505) T protein:vir:96 476 MRDKGVNPTPPEQESKDATTDEEDDSASDD 505 (505) T ss_pred HHHcCCCCCCCCCCCCCCCCCCCCCCCCCC Confidence 234554322111111111111111111111 No 178 >protein:vir:79063 Length: 491 # NCBI annotation: gp3 # Family: family:all:313 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111203;genbank:gi:134288841;genbank:GeneID:4960737 Probab=96.55 E-value=0.00052 Score=38.57 Aligned_cols=208 Identities=13% Similarity=0.077 Sum_probs=101.8 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVL 78 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~ 78 (237) |+..||..+.--..+...-+..+.++.+ .+.|++.- .. +....+=++.+..+.+ .+.++|. ++.+++.+ T Consensus 194 Ll~~~~w~~~fK~~~~~~w~~f~E~~G~P~~igky~~~-----a~--~~ek~~l~~al~~~~~-~a~~viP-~~~~ie~~ 264 (491) T protein:vir:79 194 DLSMCFWPTTFKKGGLKFWVQFTEKYGSPMLVGKHPRS-----AS--DAETNLLLDRLEDMVQ-DAVAVIP-DDSSIEIK 264 (491) T ss_pred hHHHHHHHHHHHHhhHHHHHHHHHHcCCCeEEEecCCC-----CC--HHHHHHHHHHHHHHhc-CeEEEec-CCceeEEE Confidence 8888777666556666667777888775 56666410 11 1112222333333333 3445554 46889998 Q ss_pred ecCc-CCH----HHHHHHHHHHHhh-hhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhcC Q lcl|NC_019725. 79 NSDI-SGV----PEFLSSKMDRIVS-LSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVEE 152 (237) Q Consensus 79 ~~~l-sGl----~dl~~~~~~~iaa-~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~s 152 (237) .... +|- ..+++..-..|+- ..|--+| .. .+|=.|.|+--.....+.+.+-... +...+.+|+.-++.- T Consensus 265 ea~~~~g~~~~y~~li~~~d~~Isk~iLGqtlT---t~-~~gs~a~~~vh~~v~~~i~~~D~~~-i~~tln~li~~l~~~ 339 (491) T protein:vir:79 265 EAAGKSGSADVYERLLHFCRGEVSIALLGQNQT---TE-ATSTRASAQAGLEVTDDIRDGDKAI-VVEAMNMLIRWICDL 339 (491) T ss_pred eccCCCCChhHHHHHHHHHHHHHHHHHhhhhhc---cC-cccchhhHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHh Confidence 7763 443 3455555666663 2232222 11 2333455655556666666655433 444455565555422 Q ss_pred -----CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCC-CCHHHHHHHHHhhccccccCCCCCCChhccccCCCC- Q lcl|NC_019725. 153 -----EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQI-IDLEEARDTLRSIAPEFKLKDGNNINIREPEETTEP- 225 (237) Q Consensus 153 -----~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~-i~~~e~r~~l~~~~~~~g~~~~~~~~~~~~e~~~e~- 225 (237) +...|.| .+.| ++.+..|++++++++.|+ |+.+.+++.+ |+..........+...+.. T Consensus 340 N~~~~~~p~f~~------~e~e--e~~~~~a~~~~~L~~~G~~i~~~~~~e~~-------Gip~~~~~e~~~~~~~~~~~ 404 (491) T protein:vir:79 340 NFDGAARPVFDM------WEQE--QVDEIQAGRDEKLTRAGARFTPAYFKRAY-------NLQDGDLDERPLPVSAVDAV 404 (491) T ss_pred cCCCCCcceEee------cCcC--chhHHHHHHHHHHHhCCCccCHHHHHHHh-------CCCCCCCCccccCcCccccc Confidence 2223333 2222 445678899999999997 7888888765 2211110000000000000 Q ss_pred -CCCCCCCcCcCC Q lcl|NC_019725. 226 -EPGLGEKLEDEN 237 (237) Q Consensus 226 -~~~~~~~~~~e~ 237 (237) .....+...+.. T Consensus 405 ~~~~~~~~~~~~~ 417 (491) T protein:vir:79 405 GAASFAEFEAPDQ 417 (491) T ss_pred ccccccccCCCCC Confidence 000011111111 No 179 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=96.50 E-value=0.00055 Score=38.45 Aligned_cols=210 Identities=12% Similarity=0.092 Sum_probs=104.6 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhc---CCchHHHHHHHHHHHHHhcCchheeeeecCCcceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCD---DDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~---~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~ 77 (237) ....+.+.++..+.+......-+...+..++ ++ ..++. ++.+... -..+..-...+..+..+..+.-|+. T Consensus 274 ~~~~a~~~~d~lD~~~s~~~~e~~~g~~~i~-vp--~~~l~~~~~~~g~~~----~~~~d~~~~~y~~~~~~~~~~~i~~ 346 (517) T protein:vir:98 274 ITDNSVSTLKKINDTYDQFWWEIKMGQRTVF-VS--DVMLRTVPDESGMPP----PQVFDPDVNVYKSIRMGTDEEFVKD 346 (517) T ss_pred hhhhhHHHHHHHHHHHHHHHHHHHhCCccee-cC--hhhhccccCCCCccc----CCCCCcccceeeeccCCCCCCceee Confidence 5556667777777666665554444333333 21 22332 1111100 0000101111122222222344555 Q ss_pred eecCc--CCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccch---hHHHHHHHHHHHHHHHhhhHHHHHHHHHhhc- Q lcl|NC_019725. 78 LNSDI--SGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQN---TALETFYKLVDRKREEDYRPLLEFLLPFIVE- 151 (237) Q Consensus 78 ~~~~l--sGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe---~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~- 151 (237) ++..+ ......++.+.+.++..+|+|-.- ||...+|+- |.. +..+.-|.+++++| ..++.+|++|+..++. T Consensus 347 ~~~~iR~e~~~~~~~~~L~~i~~~~Gls~~t-~~~~~~~~k-TATEi~s~~~~~~~t~~~~~-~~~~~aL~~lv~~i~~l 423 (517) T protein:vir:98 347 VTHDIRTEQYKEAINQALRTLEMELKLSVGT-FSFDGRSMK-TATEIVSENDLTYRTRNDHV-YEVEQFIKGLVISVLEL 423 (517) T ss_pred eccccchHHHHHHHHHHHHHHHHHhCCCccc-ccccccccc-cHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHH Confidence 55444 345666777888888888888654 455556653 432 23455677888777 5588889988877631 Q ss_pred -------------CCCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHH------------HHhhcccc Q lcl|NC_019725. 152 -------------EEEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDT------------LRSIAPEF 206 (237) Q Consensus 152 -------------s~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~------------l~~~~~~~ 206 (237) ..+++|.|.+-...+.++.++.. ..++.+|+++..+++.. ++....+. T Consensus 424 ~~~~~~~~~~~~~~~~v~v~f~D~i~~D~~~~~~~~-------~~~v~aG~ms~~~~i~~~~g~~eeeA~~e~~~i~~E~ 496 (517) T protein:vir:98 424 AKTYKLFGGEIPSAEHIGVDFDDGVFQDRSALLRFY-------GQAKTFGFIPTVEAIQRIFKVPKKTAEQWLEEIRKDQ 496 (517) T ss_pred HHHHhhcCCCCCCCcceEEEcCCCCCCCHHHHHHHH-------HHHHhcCCCCHHHHHHHhCCCChHHHHHHHHHHHHhc Confidence 13588999999888877766544 34566666666655443 32222111 Q ss_pred ccCCCCCCChhccccCCCCCCCCCCCc Q lcl|NC_019725. 207 KLKDGNNINIREPEETTEPEPGLGEKL 233 (237) Q Consensus 207 g~~~~~~~~~~~~e~~~e~~~~~~~~~ 233 (237) ...+.. ....+..++-.|+++ T Consensus 497 ~~~~~~------~~~~~~~~~~~gd~e 517 (517) T protein:vir:98 497 IELDPV------TISQRAQKRMFGDEE 517 (517) T ss_pred cccCCC------CccccccCCCCCCCC Confidence 111000 000011111222222 No 180 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=96.29 E-value=0.00078 Score=37.64 Aligned_cols=219 Identities=11% Similarity=0.018 Sum_probs=108.8 Q ss_pred CchhHHHHHHH---HHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheee-eecCCccee Q lcl|NC_019725. 1 MNKSLIDAICD---YDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIG-IDAETEEYD 76 (237) Q Consensus 1 llq~~~d~v~~---~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~-iD~~~e~~~ 76 (237) .+-.+...+.+ |..+. .++..+...-..++|++.-........+...-.....+ ..|+++ .-..++++. T Consensus 241 ~lapvl~~l~~l~~~~dae-l~~a~i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~l------~pG~i~~~L~pGe~i~ 313 (502) T protein:vir:79 241 LLSGVLIRLSALKEYEDSE-LTAARIAAALGMYIRKGDGQSYEPDGNGSKENERELTI------QPGIIYDDLKPGEEIG 313 (502) T ss_pred hHHHHHHHHHHHhHHHHHH-HHHHHHhhhheeeeecCCCcccccccCCCCCccccccc------cCCccccccCCCceee Confidence 44444444444 44333 33334444445566664311111111000000011111 135433 335578888 Q ss_pred eeecC--cCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHH----HHHhh Q lcl|NC_019725. 77 VLNSD--ISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFL----LPFIV 150 (237) Q Consensus 77 ~~~~~--lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l----~~~i~ 150 (237) .++.+ -++..+.+..+...||+..|||--.|.|-.-+..+ +.-..+.-+...++..|+..+..++..+ ++..+ T Consensus 314 ~~~p~~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~s~nyS-s~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l~~a~ 392 (502) T protein:vir:79 314 MVKSDRPNPNLETFRNGQLRAVAAGSRLSFSSTARNYNGTYS-AQRQELVESTDGYLILQDWFIGAVTRPMYRAWLKQAV 392 (502) T ss_pred eeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhccccchHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 87754 57899999999999999999999999998644443 4456778888999999986554444444 33333 Q ss_pred cCCC------------ceeEe--CCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHH-----------HHHH---hh Q lcl|NC_019725. 151 EEEE------------WSIEF--EPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEAR-----------DTLR---SI 202 (237) Q Consensus 151 ~s~~------------~~~~f--~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r-----------~~l~---~~ 202 (237) .... +...| +..-..+. .|.+++....+++|+.|..++. +.+. .. T Consensus 393 l~G~i~~p~~~~~~~~~~~~W~~p~~~~iDP-------~Ke~~a~~~~i~~Gl~t~~~~~a~~G~D~~~v~~q~a~e~~~ 465 (502) T protein:vir:79 393 ASGVIRLPRDLDRSSLYTAVYSGPVMPWIDP-------VKEAEAWKIQIRGGAATESDWVRAGGRNPDDVKRRRKAEIDE 465 (502) T ss_pred HcCCCCCCCCCCchhhcceeeecCCccccCh-------HHHHHHHHHHHHcCCCCHHHHHHHcCCCHHHHHHHHHHHHHH Confidence 2211 12223 33333343 3455666666777766655544 3321 12 Q ss_pred ccccccCCCCCC-----ChhccccCCCCCCCCCCCcC Q lcl|NC_019725. 203 APEFKLKDGNNI-----NIREPEETTEPEPGLGEKLE 234 (237) Q Consensus 203 ~~~~g~~~~~~~-----~~~~~e~~~e~~~~~~~~~~ 234 (237) ..+.|+....+. .-....+..++.++.+++++ T Consensus 466 ~~~~Gl~~~~~~~~~~~~~~~~~~~~e~~~~~~~~e~ 502 (502) T protein:vir:79 466 NRKLDLVFDTDPASDKGGSSAATKRQEPQHTDDQSEE 502 (502) T ss_pred HHHcCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCCC Confidence 234454322111 01111122223333333333 No 181 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=96.27 E-value=0.00026 Score=40.21 Aligned_cols=213 Identities=12% Similarity=0.081 Sum_probs=109.3 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhc-C---CchHHHHHHHHHHHHHhcCchheee-eecCCcce Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCD-D---DDAQYAARLRLAQVDDNSGVGRAIG-IDAETEEY 75 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~-~---~~~e~~~~~r~~~~~~~r~~~~~~~-iD~~~e~~ 75 (237) .+..+.+.+...+.+......-+...+..++=-. .++. + .+++.....++. .-+..+-.+- -++++..+ T Consensus 259 ~~~~~~~lid~lD~~~s~~~~e~~~g~~~i~v~~---~~l~~~~~~~~g~~~~~~~~d---~~~~~~~~~~~~~~~~~~i 332 (500) T protein:vir:98 259 IFDNAKTTIDFINTTYDEFMWEVKMGQRRVAVPE---SLTALTVRTTDGDVVPRPRFE---SDQNVYIRMGGRDLDSSAI 332 (500) T ss_pred hhhhhHHHHHHHHHHHHHHHHHHHhCcceeeech---HHhcccCCCCCccccCCcccC---CCcceEEEcCCCCCcCcce Confidence 6667778888888888888877766555544322 2322 1 111110011111 0111111111 11223446 Q ss_pred eeeecCc--CCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchh---HHHHHHHHHHHHHHHhhhHHHHHHHHHhh Q lcl|NC_019725. 76 DVLNSDI--SGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNT---ALETFYKLVDRKREEDYRPLLEFLLPFIV 150 (237) Q Consensus 76 ~~~~~~l--sGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~---D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~ 150 (237) +.++..+ ......++.+...++..+|++-..|- -..+|. .|... ....-|.++..+| ..++.+|++|+..++ T Consensus 333 ~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~-~~~~g~-~TAtei~s~~~~~~~t~~~~~-~~~~~al~~lv~~il 409 (500) T protein:vir:98 333 QDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFS-FDGKSM-KTATEIVSENSDTYQMRNSIV-ALVEQSLKELVISIF 409 (500) T ss_pred eEeccccChHHHHHHHHHHHHHHHHHhCCCccccc-cCcCcc-ccHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHH Confidence 6665554 34666777778888888887765543 233443 23332 3455677777777 558889998888774 Q ss_pred c--------------CCCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCCh Q lcl|NC_019725. 151 E--------------EEEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINI 216 (237) Q Consensus 151 ~--------------s~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~ 216 (237) . ..+++|.|+.-...++.+.+ +.+..++.+|++|.++++..+ +|+.+. .. . T Consensus 410 ~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~-------~~~~~~v~aGi~s~~~~i~~~------~g~~ee-ea-~ 474 (500) T protein:vir:98 410 EIAKAYDLYQSEVPSMDNISISLDDGVFTDRDAEL-------DYWIKVVNAGFGTREMAIQKV------LNVTEE-KA-Q 474 (500) T ss_pred HHHHHHhhcCCCCCCCcceEEEeCCCCCCCHHHHH-------HHHHHHHHcCCCCHHHHHHhc------CCCCHH-HH-H Confidence 1 13678999987777765543 445567888998888776432 222110 00 0 Q ss_pred hccccC-CCCCCCCCCCcCcCC Q lcl|NC_019725. 217 REPEET-TEPEPGLGEKLEDEN 237 (237) Q Consensus 217 ~~~e~~-~e~~~~~~~~~~~e~ 237 (237) ++.++. .|..|..+...+..- T Consensus 475 ~~l~~i~~E~~~~~~~~~~~~~ 496 (500) T protein:vir:98 475 EIAAEINTGIVDEINQQRTDTH 496 (500) T ss_pred HHHHHHHHhccccCCCCCcccc Confidence 111111 111111111111111 No 182 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=96.27 E-value=0.00026 Score=40.21 Aligned_cols=213 Identities=12% Similarity=0.081 Sum_probs=109.3 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhc-C---CchHHHHHHHHHHHHHhcCchheee-eecCCcce Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCD-D---DDAQYAARLRLAQVDDNSGVGRAIG-IDAETEEY 75 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~-~---~~~e~~~~~r~~~~~~~r~~~~~~~-iD~~~e~~ 75 (237) .+..+.+.+...+.+......-+...+..++=-. .++. + .+++.....++. .-+..+-.+- -++++..+ T Consensus 259 ~~~~~~~lid~lD~~~s~~~~e~~~g~~~i~v~~---~~l~~~~~~~~g~~~~~~~~d---~~~~~~~~~~~~~~~~~~i 332 (500) T protein:vir:30 259 IFDNAKTTIDFINTTYDEFMWEVKMGQRRVAVPE---SLTALTVRTTDGDVVPRPRFE---SDQNVYIRMGGRDLDSSAI 332 (500) T ss_pred hhhhhHHHHHHHHHHHHHHHHHHHhCcceeeech---HHhcccCCCCCccccCCcccC---CCcceEEEcCCCCCcCcce Confidence 6667778888888888888877766555544322 2322 1 111110011111 0111111111 11223446 Q ss_pred eeeecCc--CCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchh---HHHHHHHHHHHHHHHhhhHHHHHHHHHhh Q lcl|NC_019725. 76 DVLNSDI--SGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNT---ALETFYKLVDRKREEDYRPLLEFLLPFIV 150 (237) Q Consensus 76 ~~~~~~l--sGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~---D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~ 150 (237) +.++..+ ......++.+...++..+|++-..|- -..+|. .|... ....-|.++..+| ..++.+|++|+..++ T Consensus 333 ~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~-~~~~g~-~TAtei~s~~~~~~~t~~~~~-~~~~~al~~lv~~il 409 (500) T protein:vir:30 333 QDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFS-FDGKSM-KTATEIVSENSDTYQMRNSIV-ALVEQSLKELVISIF 409 (500) T ss_pred eEeccccChHHHHHHHHHHHHHHHHHhCCCccccc-cCcCcc-ccHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHH Confidence 6665554 34666777778888888887765543 233443 23332 3455677777777 558889998888774 Q ss_pred c--------------CCCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCCh Q lcl|NC_019725. 151 E--------------EEEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINI 216 (237) Q Consensus 151 ~--------------s~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~ 216 (237) . ..+++|.|+.-...++.+.+ +.+..++.+|++|.++++..+ +|+.+. .. . T Consensus 410 ~~~~~~~~~~~~~~~~~~v~v~f~d~i~~d~~~~~-------~~~~~~v~aGi~s~~~~i~~~------~g~~ee-ea-~ 474 (500) T protein:vir:30 410 EIAKAYDLYQSEVPSMDNISISLDDGVFTDRDAEL-------DYWIKVVNAGFGTREMAIQKV------LNVTEE-KA-Q 474 (500) T ss_pred HHHHHHhhcCCCCCCCcceEEEeCCCCCCCHHHHH-------HHHHHHHHcCCCCHHHHHHhc------CCCCHH-HH-H Confidence 1 13678999987777765543 445567888998888776432 222110 00 0 Q ss_pred hccccC-CCCCCCCCCCcCcCC Q lcl|NC_019725. 217 REPEET-TEPEPGLGEKLEDEN 237 (237) Q Consensus 217 ~~~e~~-~e~~~~~~~~~~~e~ 237 (237) ++.++. .|..|..+...+..- T Consensus 475 ~~l~~i~~E~~~~~~~~~~~~~ 496 (500) T protein:vir:30 475 EIAAEINTGIVDEINQQRTDTH 496 (500) T ss_pred HHHHHHHHhccccCCCCCcccc Confidence 111111 111111111111111 No 183 >protein:vir:78161 Length: 355 # NCBI annotation: hypothetical protein # Family: family:all:2372 # MgeID: mge:1847 # MgeName: Min1 # Cross-refs: genbank:acc:YP_001294798;genbank:gi:149882819;genbank:GeneID:5309189 Probab=95.96 E-value=0.0012 Score=36.60 Aligned_cols=224 Identities=10% Similarity=0.023 Sum_probs=100.4 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHh--ccceeechhHHHhhcCCc------hHHHHHHHHHHHHHhcCch-heeeeecC Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRK--QQAVWKVKGLAEMCDDDD------AQYAARLRLAQVDDNSGVG-RAIGIDAE 71 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~--~~~v~k~~~l~~~~~~~~------~e~~~~~r~~~~~~~r~~~-~~~~iD~~ 71 (237) |+..||-.+.--..+..--+..+.++ .+.+.+.+--......++ .+........++...++-. ..++|.. T Consensus 75 Llr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~~~~~~~~~~d~~~~~~~~~~~~~~l~~~~~~i~~g~~a~~iip~- 153 (355) T protein:vir:78 75 LLRQAYKNWLLKDRFLRIQALVGERNGLGVPIYQGAPLPEAIARDTARAEQWLNDQKEEGLQLAKEFRAGEAAGGYIPH- 153 (355) T ss_pred hHHHHHHHHHHHHhhHHHHHHHHHHcCCCceEEEecCCCCcccchhhhHHHHHHHHHHHHHHHHHHhhCCcceeEeecC- Confidence 88888776666666666677778887 667776652111000000 0111122233333333332 3445554 Q ss_pred CcceeeeecCcC--CHHHHHHHHHHHHhhhhcCceeeeeccCcc-cccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHH Q lcl|NC_019725. 72 TEEYDVLNSDIS--GVPEFLSSKMDRIVSLSGIHEIIIKNKNVG-GVSASQNTALETFYKLVDRKREEDYRPLLEFLLPF 148 (237) Q Consensus 72 ~e~~~~~~~~ls--Gl~dl~~~~~~~iaa~s~iP~t~L~G~sp~-GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~ 148 (237) +.+++.+.+.-+ ....+++..-..||-+.--. |--.+.+.+ |=.|-|+.-...+.+.+.+-......-+-+.|+.- T Consensus 154 g~~ie~~ea~g~~~~~~~~i~~~d~~Isk~iLGq-tlTs~~~~~gGS~Alg~vh~~v~~~~~~aD~~~i~~~ln~~li~~ 232 (355) T protein:vir:78 154 GANFTLTGVQGKLPEMDGPIRYHDEQIARAVLAH-FLTLGGDKSTGSYALGDTFASFFTGSLNAVMKHIADVTQQHVVED 232 (355) T ss_pred CceEEEeecCCCcccHHHHHHHHHHHHHHHHhhh-hhccccCCccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 578888866532 34456666666666432111 111111222 22234565566677777776644333322345555 Q ss_pred hhcC----CC--ceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHH-HHHHHHhhccccccCCCCCCChh---- Q lcl|NC_019725. 149 IVEE----EE--WSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEE-ARDTLRSIAPEFKLKDGNNINIR---- 217 (237) Q Consensus 149 i~~s----~~--~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e-~r~~l~~~~~~~g~~~~~~~~~~---- 217 (237) ++.- .. -.|+|... +. ..++.|++++.+++.|++.+++ ....++. .+|+......++. T Consensus 233 l~~lN~~~~~~~P~~~~~~~------~~--~~~~~a~~~~~l~~~G~~~~~~~~~~~~~e---~~gip~p~~~~~~~~~~ 301 (355) T protein:vir:78 233 LVDQNWGPEEPAPRLVPAQL------GK--EQPVTAEAIRALVECGAFTADPELEKDLRA---RYGLPAPAERDDGADAA 301 (355) T ss_pred HHHhcCCCCCCCCEEEecCc------Ch--hHHHHHHHHHHHHhCCCccccHHHHHHHHH---HhCCCCCCCCCcccCCc Confidence 4421 11 13444332 21 2234689999999999976644 3445553 2343111100000 Q ss_pred ccccCCCCC--CCCC-----CCcCcC--------C Q lcl|NC_019725. 218 EPEETTEPE--PGLG-----EKLEDE--------N 237 (237) Q Consensus 218 ~~e~~~e~~--~~~~-----~~~~~e--------~ 237 (237) .....+... .+.| +..... . T Consensus 302 ~~~~~~~~~~~~~~~~~~~~~~~a~~~~a~~~~~~ 336 (355) T protein:vir:78 302 AAKAAGRRRAKRLPGQRQGAALPSRSPRADPPRRR 336 (355) T ss_pred cccccccccccccCCccccccccccCCCCCChhhh Confidence 000000000 0000 000000 0 No 184 >protein:vir:103860 Length: 528 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938234;genbank:gi:38229139;genbank:GeneID:2648175 Probab=95.52 E-value=0.0019 Score=35.50 Aligned_cols=222 Identities=13% Similarity=0.056 Sum_probs=95.5 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVL 78 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~ 78 (237) |+..||-.+.--..+...-+..+.++.+ .+.|++. +..+.....=++.+..+.+ .+.++|- .+.+++.+ T Consensus 205 Llr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~-------~a~~~ek~~L~~al~~i~~-~~~~iiP-~~~~ie~~ 275 (528) T protein:vir:10 205 LFRVLAWPYLFKHYSTADLAEMLEIYGLPIRLGKYPP-------GTPDEEKVTLLRAVTGLGH-AAAGIIP-ESMSIDFQ 275 (528) T ss_pred hHHHHHHHHHHHHhhHHHHHHHHHHcCCCeEEEecCC-------CCCHHHHHHHHHHHHHHhh-CcEEEec-CCceeEEe Confidence 7777766665555555556677888775 5666641 1112222222333333333 3445554 46889998 Q ss_pred ecCcCCHH---HHHHHHHHHHhhh-hcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHH-HHHHHHhhcCC Q lcl|NC_019725. 79 NSDISGVP---EFLSSKMDRIVSL-SGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLL-EFLLPFIVEEE 153 (237) Q Consensus 79 ~~~lsGl~---dl~~~~~~~iaa~-s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l-~~l~~~i~~s~ 153 (237) +.+=++.+ .+++..-..||-+ .|=-+|--.|..-+|=+|-|+--.....+.+.+-... +...| +.|+.-++.-. T Consensus 276 ea~~~~~~~f~~li~~~d~~Isk~iLGqtlTs~~~~g~~gS~Alg~vh~~v~~di~~aDa~~-i~~tln~~li~~l~~~N 354 (528) T protein:vir:10 276 EASKGSAEPFMAMMRWCDDSMSKAILGGTLTSQTSESGGGAYALGQVHNEVRHDLLAADARQ-LAATLSRDLLWPLLVLN 354 (528) T ss_pred ecCCCChhHHHHHHHHHHHHHHHHHhhhhhhccccccccchhhhHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhC Confidence 87644432 2455555555532 2212221112111222222333344555555555433 33333 33554443211 Q ss_pred ---CceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCC-CCHHHHHHHHHhhccccccCCCCCCChhccccC-CC--CC Q lcl|NC_019725. 154 ---EWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQI-IDLEEARDTLRSIAPEFKLKDGNNINIREPEET-TE--PE 226 (237) Q Consensus 154 ---~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~-i~~~e~r~~l~~~~~~~g~~~~~~~~~~~~e~~-~e--~~ 226 (237) ...-.--|-+.....|..++ ++.|++++.+++.|+ |+.+.+++.+.--.+ .++..+........ .. .. T Consensus 355 ~~~~~~~~~~p~~~~~~~e~eDl-~~~a~~~~~L~~~G~~i~~~~i~e~~gip~p----~~~e~~~~~~~~~~~~~~~~~ 429 (528) T protein:vir:10 355 RSGNLDARRAPRLVFDLKDRADL-AAMATSLPPLVKLGVQVPVNWVQEQLGIPLP----ANGEAVLGDQAGAGIAQLSRR 429 (528) T ss_pred CCCCCCccccceEEecCCCcccH-HHHHHHHHHHHhCCCCCCHHHHHHHhCCCCC----CCCcccccCCCcccccccCcc Confidence 00000001112222333333 357999999999998 899999987631000 01111110000000 00 00 Q ss_pred CCCCC-----Cc---CcCC Q lcl|NC_019725. 227 PGLGE-----KL---EDEN 237 (237) Q Consensus 227 ~~~~~-----~~---~~e~ 237 (237) ++... .. .++. T Consensus 430 ~~~~~~~~~~~~~~~~~~~ 448 (528) T protein:vir:10 430 PGPRIAALAQVIGPRYRDQ 448 (528) T ss_pred ccccccccccccccccccc Confidence 00000 00 0000 No 185 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=95.25 E-value=0.0024 Score=34.90 Aligned_cols=221 Identities=12% Similarity=0.118 Sum_probs=111.4 Q ss_pred CchhH--HHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCC---chHHHHHHHHHHHHHhcCchheeeeecCCcce Q lcl|NC_019725. 1 MNKSL--IDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDD---DAQYAARLRLAQVDDNSGVGRAIGIDAETEEY 75 (237) Q Consensus 1 llq~~--~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~---~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~ 75 (237) .+-.+ +..+..|..+.--.+. +...-.-++|.++-....... ..+..-..+.. .-.-|++..-..++++ T Consensus 233 ~la~i~~l~~l~~y~dael~~a~-i~A~~~~fi~~~~~~~~~~~~~~~~~~~~~~~~~~-----~l~pG~i~~L~pGe~i 306 (495) T protein:vir:10 233 WFQLLLRLNELDQYEDAELVRKK-TAALFAAFIQEATADSTGGPTIGQPKRSKGGKRIT-----GLNPGTLQYLQPGQEV 306 (495) T ss_pred hhHHHHHHHHhhHHHHHHHHHHH-HhhhheeeeecCCCccccccccCccccccCcccce-----ecCCceeeecCCCCee Confidence 22122 2456666665544444 344445566654322211110 00000000110 1124455555667888 Q ss_pred eeeecC--cCCHHHHHHHHHHHHhhhhcCceeeeeccCcc-cccccchhHHHHHHHHHHHHHHHh-----hhHHHHHHHH Q lcl|NC_019725. 76 DVLNSD--ISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVG-GVSASQNTALETFYKLVDRKREED-----YRPLLEFLLP 147 (237) Q Consensus 76 ~~~~~~--lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~-GlnatGe~D~~nyyd~I~~~Qe~~-----l~p~l~~l~~ 147 (237) +.++.+ -++..+.+..+...||+..|||--.|.|--.+ ..+ |.-..+.-+...+++.|.+. ++|+-+..++ T Consensus 307 ~~~~p~~p~~~~~~f~~~~lr~iaaglGi~Ye~ltgD~s~~nYS-S~R~~~~e~~r~~~~~q~~~~~~~~~~pi~~~~l~ 385 (495) T protein:vir:10 307 KFSNPADVGTTYEPWLRYQLLSIAKGYGITYEMLTGDLRGVNYS-SIRAGLLEFRRLCQQVQHHMIIHQFCRPVGRWFMD 385 (495) T ss_pred eeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 888865 57999999999999999999999999995433 233 34556677888888888754 4566666666 Q ss_pred HhhcCCCc-------------eeEe--CCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHH-----------HH- Q lcl|NC_019725. 148 FIVEEEEW-------------SIEF--EPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDT-----------LR- 200 (237) Q Consensus 148 ~i~~s~~~-------------~~~f--~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~-----------l~- 200 (237) ..+..-.+ ..+| +..-..+. .|.+++....+++|+.|..++..+ +. T Consensus 386 ~a~l~G~i~~p~~~~~~~~~~~~~w~~p~~~~vDP-------~Ke~~A~~~~i~~G~~s~~~~~a~~G~D~~~v~~q~a~ 458 (495) T protein:vir:10 386 FAVASGAVVIPDYLQRRRYYNRVSWRTPRWEEVDP-------LKKHLADLGDVRAGFAPISDKQAERGYDMEELFDMISD 458 (495) T ss_pred HHHHcCCCCCCCchhhhHhhhccccccCCccccCh-------HHHHHHHHHHHHcCCCCHHHHHHHcCCCHHHHHHHHHH Confidence 54433211 1222 33233333 356667777777777766655432 21 Q ss_pred --hhccccccCCCCCCChhccccCCCCCCCCCCCcCcC Q lcl|NC_019725. 201 --SIAPEFKLKDGNNINIREPEETTEPEPGLGEKLEDE 236 (237) Q Consensus 201 --~~~~~~g~~~~~~~~~~~~e~~~e~~~~~~~~~~~e 236 (237) ....+.|+.-..++.. .......+++...+..++| T Consensus 459 e~~~~~~~Gl~~~~~p~~-~~~~~~~~~~~~~~~~~~e 495 (495) T protein:vir:10 459 ANQLIDEYDLRLDSDPRY-VNGSGAEQKSVMEAALNNE 495 (495) T ss_pred HHHHHHHcCCCCCCCCCc-CCCccCCCCCCCCCCCCCC Confidence 1223445421111111 0111111222222233333 No 186 >protein:vir:95149 Length: 501 # NCBI annotation: hypothetical protein ORF007 # Family: family:all:584 # MgeID: mge:1552 # MgeName: PA73 # Cross-refs: genbank:acc:YP_001293414;genbank:gi:148912835;genbank:GeneID:5228224 Probab=95.25 E-value=0.0023 Score=35.08 Aligned_cols=208 Identities=13% Similarity=0.088 Sum_probs=110.7 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeeeec Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVLNS 80 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~~~ 80 (237) |+..++=.|.+|..... --++++-..+-++-+.|+...-........ ..-+....+.+.. +-++..++. T Consensus 270 Ll~lA~lni~hy~~ssd-~~~~l~~~~~P~l~i~G~~~~~~~~~~~~~---------i~~G~~~~~~lP~-~~~~~~ie~ 338 (501) T protein:vir:95 270 FYDLASLNMAHYRNSAD-YEESCYIVGQPTPVLIGLTEEWVTNVLKGS---------VNFGSRGGIPLPV-GADAKLLQA 338 (501) T ss_pred hHHHHHHHHHHHhhhhH-HHHHHHHcccceeeeeCCcccccccCCCCc---------eeecccccccCCC-CCceeEEec Confidence 88888888888887766 566788889988887776542211111000 0112233344443 456777777 Q ss_pred CcCCH-HHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHH---HHHHHHHHHHhhhHHHHHHHHHhhc----- Q lcl|NC_019725. 81 DISGV-PEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETF---YKLVDRKREEDYRPLLEFLLPFIVE----- 151 (237) Q Consensus 81 ~lsGl-~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~ny---yd~I~~~Qe~~l~p~l~~l~~~i~~----- 151 (237) +-+++ ...++...+++..+.- +|+-+.++.. |++.-...+ +..++.+- ..+.-.+.+++++++. T Consensus 339 ~~~~i~~~~l~~l~~~m~~~Ga----~ll~~~~~~~--Ta~~~~~~~~~~~S~L~~~a-~~le~al~~~l~~~a~w~g~~ 411 (501) T protein:vir:95 339 SENTMLKEAMDTKERQMVALGA----KLVEQKEVQR--TATEAELEAASEGSTLSSAT-KNVSAAFEWALKWAARWVGQA 411 (501) T ss_pred ChhhHHHHHHHHHHHHHHHHHH----hhccCCccch--hHHHHHHHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHcCCC Confidence 66666 4455555555554422 2333343434 333222222 22333333 3466677888877653 Q ss_pred CCCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCCh--hcccc-------- Q lcl|NC_019725. 152 EEEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINI--REPEE-------- 221 (237) Q Consensus 152 s~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~--~~~e~-------- 221 (237) .++..|+.|+-+....-+ ...++++..+++.|.|+.++.++.|+..+ +.+ .+-++ +.+++ T Consensus 412 ~~~~~v~i~~df~~~~~~-----~~~~~al~~~~~~G~is~~t~~~~L~~~~----v~~-~~~~~e~e~i~~~~~~~~~~ 481 (501) T protein:vir:95 412 DSGVKFELNTDFDIARMT-----PDERRSLVEEWQKGAITFEEMRTGLRKAG----VAT-EDDSKAKEKIAKDTAEAMAL 481 (501) T ss_pred CCceEEEEecccccccCC-----HHHHHHHHHHHhCCCCcHHHHHHHHHhCC----CCC-hhHHHHHHHHHhhhcCcccc Confidence 345778877776544332 33367788889999999999999998643 221 11111 11111 Q ss_pred -----CCCCCCCCCCCcCcC Q lcl|NC_019725. 222 -----TTEPEPGLGEKLEDE 236 (237) Q Consensus 222 -----~~e~~~~~~~~~~~e 236 (237) .+.+..|+.+--++| T Consensus 482 ~~~~~~~~~~~gg~~~~~~~ 501 (501) T protein:vir:95 482 ATPANVPGDGSGGDNVGNSE 501 (501) T ss_pred cccCCCCCCCcccccccCCC Confidence 001111111112333 No 187 >protein:vir:97265 Length: 513 # NCBI annotation: hypothetical protein ORF013 # Family: family:all:584 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294521;genbank:gi:149408242;genbank:GeneID:5237130 Probab=95.13 E-value=0.0027 Score=34.66 Aligned_cols=207 Identities=11% Similarity=0.084 Sum_probs=107.8 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeeeec Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVLNS 80 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~~~ 80 (237) |+..++=.+.+|.... .--++++-..+.++-+.|+..- ..+ . ..-+.+..+.+...+.++..++. T Consensus 259 Ll~LA~ln~~hy~~~S-d~~~il~~~~~P~l~~~G~~~~----~~~-~---------i~iG~~~~~~lpe~~~~~~yie~ 323 (513) T protein:vir:97 259 LLDLAHLNVAHWQSAS-DQRHILTVSRFPILACSGASGE----DSD-P---------VVVGPNKVLYNPDPAGRFYYVEH 323 (513) T ss_pred hHHHHHHHHHHHhhhh-hHHHHHHhcccceeeeecCCcC----CCC-c---------eEeeccccccCCCCCCcceeecc Confidence 9988888888885554 4456788888888888765321 111 0 01123344455433566888888 Q ss_pred CcCCHHH---HHHHHHHHHhhhhcCceeeeeccCcccccccchhHH---HHHHHHHHHHHHHhhhHHHHHHHHHhhc--- Q lcl|NC_019725. 81 DISGVPE---FLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTAL---ETFYKLVDRKREEDYRPLLEFLLPFIVE--- 151 (237) Q Consensus 81 ~lsGl~d---l~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~---~nyyd~I~~~Qe~~l~p~l~~l~~~i~~--- 151 (237) +-+++.. -+....++|..+.-.++ ..+++.- |++.-. ..=+..++++. ..+.-.|+++++++.. T Consensus 324 ~g~~i~~~~~~l~~le~qm~~~Ga~ll----~~~~~~~--Ta~a~~~~~~~~~S~L~~~a-~~le~al~~~l~~~a~wlg 396 (513) T protein:vir:97 324 TGQAIAAGRTDLKDLEEQMAGYGAEFL----KRKTGGQ--TATARALDSAEATSDLSAMT-GLFEDALAQALDITADWLR 396 (513) T ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhh----ccCCccc--cHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhC Confidence 8888754 45555555554443333 3344443 443333 33334444444 3366778888887752 Q ss_pred --CCCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCC--CChhcc----ccCC Q lcl|NC_019725. 152 --EEEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNN--INIREP----EETT 223 (237) Q Consensus 152 --s~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~--~~~~~~----e~~~ 223 (237) .++..|+.|+=+.......+ ..+++..+++.|.|+....++.|+.++.-. +..+ ..+++. +++. T Consensus 397 ~~~~~~~v~in~dF~~~~~~~~-----~~~al~~a~~~G~is~~t~~~~L~r~gvl~---~d~d~~~~~e~~~~~~~~~~ 468 (513) T protein:vir:97 397 LGPNGGTVELVKDYDLEEMDAP-----GLQALQVAREKRDISRKTYLNGLRLRGVLP---EDFDEDEDWEELMEEISEAM 468 (513) T ss_pred CCCCccEEEeccccCcccCCHH-----HHHHHHHHHhCCCCCHHHHHHHHHhccCCC---ccCCHHHHHHHHHHhhhhcc Confidence 24577888876654433322 335666778888888888888887643211 1111 001111 1110 Q ss_pred -C----------CCC--CCCCCc-CcCC Q lcl|NC_019725. 224 -E----------PEP--GLGEKL-EDEN 237 (237) Q Consensus 224 -e----------~~~--~~~~~~-~~e~ 237 (237) + .++ +.+.++ +.|. T Consensus 469 ~~~~~d~~~~~~~~~~~~~~~~~~~~~~ 496 (513) T protein:vir:97 469 GRAGLDLDPAQKNPPEGGEGEGEGEGEG 496 (513) T ss_pred CCCCccccccCCCCCCCCCCCCCCCCCC Confidence 0 000 000000 1111 No 188 >protein:vir:79150 Length: 368 # NCBI annotation: bacteriophage gpQ # Family: family:all:196 # MgeID: mge:1863 # MgeName: RSA1 # Cross-refs: genbank:acc:YP_001165254;genbank:gi:145708079;genbank:GeneID:5247161 Probab=94.97 E-value=0.001 Score=36.97 Aligned_cols=166 Identities=10% Similarity=0.079 Sum_probs=83.9 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHh-ccc-eeechhHHHhhcCCchHHHHHHHHHHHHHhcCchh-eeeeec--CCcce Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRK-QQA-VWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGR-AIGIDA--ETEEY 75 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~-~~~-v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~-~~~iD~--~~e~~ 75 (237) -++.....+.....+......+.... +.. ++++.+ ..+ +++..+.+++.++-. ..-.|.+ ++++.. ..+.+ T Consensus 190 p~~~a~~si~l~~aa~~~~~~~~~NGa~~~gil~~~~--~~l-~~e~~~~lk~~~~~~-~G~~N~g~~~vl~~~g~~~g~ 265 (368) T protein:vir:79 190 EYLSALNATWLNESATLFRRRYYKNGSHAGFILYMTD--AAQ-KQEDVDTLREAMKSA-KGPGNFRNLFMYAPNGKKDGI 265 (368) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCC--CCC-CHHHHHHHHHHHHHh-cCCcccCceeEecCCCCccce Confidence 23333344443333333333333222 122 223322 112 222334566666542 2223444 444422 12456 Q ss_pred eeeecCcCCHHH----HHHHHHHHHhhhhcCceeeeeccCccccc--ccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHh Q lcl|NC_019725. 76 DVLNSDISGVPE----FLSSKMDRIVSLSGIHEIIIKNKNVGGVS--ASQNTALETFYKLVDRKREEDYRPLLEFLLPFI 149 (237) Q Consensus 76 ~~~~~~lsGl~d----l~~~~~~~iaa~s~iP~t~L~G~sp~Gln--atGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i 149 (237) +.+..+.+..++ +.....+.||++-|||- .|+|..+++-. ++-+.-.+.||. ..|.|.++++-.+. T Consensus 266 ~~~pls~~~~d~qf~e~k~~~~~eIa~af~VPp-~llGi~~~~t~~~sn~e~~~~~f~~-------~~l~Pl~~~ie~ln 337 (368) T protein:vir:79 266 QLLPVSEVAAKDEFWNIKNVTRDDQLAAHRVPP-QLMGIIPNNTGGFGDVEKAAMVFAR-------NEVKPLQDRLLAIN 337 (368) T ss_pred eEEEcCCCHHHHHHHHHHHHhHHHHHHHhCCCH-HHccccCCCCCccccHHHHHHHHHH-------HHHHHHHHHHHHHH Confidence 666666666554 44666788999999996 56687665321 223334445543 44777766665443 Q ss_pred hcCCCceeEeCC--CCCCCHHHHHHHHHHHH Q lcl|NC_019725. 150 VEEEEWSIEFEP--LSVPSKKEESEITKNNV 178 (237) Q Consensus 150 ~~s~~~~~~f~p--L~~~seke~Aei~~~~A 178 (237) -+-....++|++ |...+.+.+|+...+-| T Consensus 338 ~~l~~e~~rF~~~~l~~~D~~a~a~~~~rsa 368 (368) T protein:vir:79 338 DWIGDEVVRFAPYALGGHDQPAAAPGGQRSA 368 (368) T ss_pred hccCcceeeechhHhhcccccccCCcccccC Confidence 222233567776 66777777777555555 No 189 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=93.39 E-value=0.0077 Score=32.16 Aligned_cols=225 Identities=8% Similarity=-0.044 Sum_probs=108.9 Q ss_pred CchhHHHHHHH---HHHHHHHHHHHHHHhccceeechhHHH----hhcCCch---HHHHHHHHHHHHH----h--cCchh Q lcl|NC_019725. 1 MNKSLIDAICD---YDYCESLATQILRRKQQAVWKVKGLAE----MCDDDDA---QYAARLRLAQVDD----N--SGVGR 64 (237) Q Consensus 1 llq~~~d~v~~---~~~~~~~~~~Ll~~~~~~v~k~~~l~~----~~~~~~~---e~~~~~r~~~~~~----~--r~~~~ 64 (237) .+-.++..+.+ |..+.--.+ .+...-..++|.+.-.. .+..... ...+......... . .-.-| T Consensus 250 ~lapvl~~l~~l~~y~dael~~a-~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG 328 (533) T protein:vir:34 250 VFYSVMEQMKMLDTLQNTQLQSA-IVKAMYAATIESELDTQSAMDFILGANSQEQRERLTGWIGEIAAYYAAAPVRLGGA 328 (533) T ss_pred hHHHHHHHHHHHHHHHHHHHHHH-HHhhhheeeeecCCCcccccccccCCCcccccccccccchhhhhccCcceeeccCc Confidence 44444444444 333333333 34444455566542111 1111100 0111111110000 0 01134 Q ss_pred eeeeecCCcceeeeecC--cCCHHHHHHHHHHHHhhhhcCceeeeeccC-cccccccchhHHHHHHHHHHHHHHHhhhHH Q lcl|NC_019725. 65 AIGIDAETEEYDVLNSD--ISGVPEFLSSKMDRIVSLSGIHEIIIKNKN-VGGVSASQNTALETFYKLVDRKREEDYRPL 141 (237) Q Consensus 65 ~~~iD~~~e~~~~~~~~--lsGl~dl~~~~~~~iaa~s~iP~t~L~G~s-p~GlnatGe~D~~nyyd~I~~~Qe~~l~p~ 141 (237) ++..-..++++..++.+ -++..+.+..+...||+..|||...|.|-- -...+ |.-..+.-+...++..|...+.++ T Consensus 329 ~i~~L~pGe~i~~~~~~~p~~~~~~f~~~~lr~iAaglGi~ye~lt~D~s~~nYS-S~R~~~~e~~r~~~~~q~~~~~~~ 407 (533) T protein:vir:34 329 KVPHLMPGDSLNLQTAQDTDNGYSVFEQSLLRYIAAGLGVSYEQLSRNYAQMSYS-TARASANESWAYFMGRRKFVASRQ 407 (533) T ss_pred eeeecCCCCeeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhhhcccccHH-HHHHHHHHHHHHHHHHHHHHHHHH Confidence 45555667888877754 579999999999999999999999999963 23343 456677888899999998766554 Q ss_pred H----HHHHHHhhcCCCc--------------------eeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHH Q lcl|NC_019725. 142 L----EFLLPFIVEEEEW--------------------SIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARD 197 (237) Q Consensus 142 l----~~l~~~i~~s~~~--------------------~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~ 197 (237) + +..++..+.+..+ .|..++.-..+. .|.+++....+++|+.|..++.. T Consensus 408 ~~pi~~~wl~~ail~G~i~~p~~~~~~~~~~~~~~~~~~w~~p~~~~iDP-------~Ke~~a~~~~i~~G~~s~~~~~a 480 (533) T protein:vir:34 408 ASQMFLCWLEEAIVRRVVTLPSKARFSFQEARSAWGNCDWIGSGRMAIDG-------LKEVQEAVMLIEAGLSTYEKECA 480 (533) T ss_pred HHHHHHHHHHHHHHcCcccCCCccCCCchhhHHhhhceeeccCCccccCh-------HHHHHHHHHHHHcCCCCHHHHHH Confidence 4 4444433322111 122233333333 46677777888888877766653 Q ss_pred HH-----------H---hhccccccCCCCCCChhccccCCCCCCCCCCCcCcCC Q lcl|NC_019725. 198 TL-----------R---SIAPEFKLKDGNNINIREPEETTEPEPGLGEKLEDEN 237 (237) Q Consensus 198 ~l-----------~---~~~~~~g~~~~~~~~~~~~e~~~e~~~~~~~~~~~e~ 237 (237) +. . ....+.|+....+......... .+...++.++.. T Consensus 481 ~~G~D~~ev~~q~a~e~~~~~~~gl~~~~~~~~~~~s~~---~~~~~~~~~~~~ 531 (533) T protein:vir:34 481 KRGDDYQEIFAQQVRETMERRAAGLKPPAWAAAAFESGL---RQSTEEEKSDSR 531 (533) T ss_pred HcCCCHHHHHHHHHHHHHHHHhcCCCCCCCCCcCccCCC---CCCCCCCcccCC Confidence 32 1 1223444432221111110000 111111111111 No 190 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=93.07 E-value=0.0052 Score=33.10 Aligned_cols=221 Identities=10% Similarity=0.015 Sum_probs=106.7 Q ss_pred CchhHHHHHH---HHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheee-eecCCccee Q lcl|NC_019725. 1 MNKSLIDAIC---DYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIG-IDAETEEYD 76 (237) Q Consensus 1 llq~~~d~v~---~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~-iD~~~e~~~ 76 (237) .+-.++..+. .|..+. .++..+...-..++|.++-......+ +...-...+. -.-|+++ .-..+|++. T Consensus 246 ~lapvl~~l~~l~~y~dae-l~~aki~A~~a~fi~~~~~~~~~~~~-~~~~~~~~~~------~~pG~iv~~L~pGe~i~ 317 (548) T protein:vir:95 246 MLHAVLIRLADLKDYEESE-RVAARISAALAMYIKKGNPDSYTVEP-GKDRKNRTIP------IAPGMVFDDLEPGEDVG 317 (548) T ss_pred hHHHHHHHHHHHhHHHHHH-HHHHHHhhhheeeeecCCCccccCCC-Cccccccccc------ccCCccccccCCCceee Confidence 4444444444 444444 33444455455666665322111111 1110011111 1134432 234578888 Q ss_pred eeecC--cCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHH----HHHhh Q lcl|NC_019725. 77 VLNSD--ISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFL----LPFIV 150 (237) Q Consensus 77 ~~~~~--lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l----~~~i~ 150 (237) .++.+ -++..+.+..+...||+..|||.-.|.|-.-+..+ |.-..+.-+...+...|...+..++..+ ++..+ T Consensus 318 ~~~p~~p~~~~~~f~~~~lr~IAaglGipYe~ltgD~s~nYS-S~R~~l~e~~r~~~~~q~~~i~~~~~Pi~~~wle~a~ 396 (548) T protein:vir:95 318 MIESNRPNPFLEGFRNGQLRMIGAGTRSTYSSVSRAYDGTYS-AQRQELVEGWLGYDLLQHEFIDYWCRPVYRSWLQMYL 396 (548) T ss_pred ecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccchhHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 87754 57999999999999999999999999998644443 4556777788888889887554444433 33332 Q ss_pred cCCC------------ceeEe--CCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHH-----------HH---hh Q lcl|NC_019725. 151 EEEE------------WSIEF--EPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDT-----------LR---SI 202 (237) Q Consensus 151 ~s~~------------~~~~f--~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~-----------l~---~~ 202 (237) ..-. +..+| +..-..+. .|.+++....+++|+.|..++..+ +. .. T Consensus 397 l~G~i~lP~~~~~~~~~~~~W~~P~~~~iDP-------~Kea~A~~~~i~~Gl~T~~~~~a~~G~D~~ev~~q~a~E~~~ 469 (548) T protein:vir:95 397 LARKERLPADVDHRTLYAAVYQGPVMPWINP-------MHEANAWELLVKAGFADEAEVARARGRDPRELKKSRETEIKA 469 (548) T ss_pred HcCCcCCCCCCCchhheeeeeecCCccccCh-------HHHHHHHHHHHHcCCCCHHHHHHHhCCCHHHHHHHHHHHHHH Confidence 2211 22334 22222333 356667777777777766655433 21 11 Q ss_pred ccccccCCCCCCCh------hccccCCC------CCCCCCCCcCcC--------------------C Q lcl|NC_019725. 203 APEFKLKDGNNINI------REPEETTE------PEPGLGEKLEDE--------------------N 237 (237) Q Consensus 203 ~~~~g~~~~~~~~~------~~~e~~~e------~~~~~~~~~~~e--------------------~ 237 (237) ..+.|+.-..+..- .++.+.+. .-|-+|++.+.| | T Consensus 470 ~~~~GL~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 536 (548) T protein:vir:95 470 NRAAGLVFSSDAYHQLVKSGMDPVEAVQKVYLGVGKMLTADEARELVNRYGAGLPVPGPDFPNESNN 536 (548) T ss_pred HHHcCCCCCCcccccccccccCCCCchhhhccccccccccchhHHhhccCCCCCcCCCCCCCccccc Confidence 22344321111100 00000000 000111111111 1 No 191 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=91.81 E-value=0.014 Score=30.73 Aligned_cols=226 Identities=8% Similarity=-0.058 Sum_probs=109.8 Q ss_pred CchhHHHHHHHHHHHHHH--HHHHHHHhccceeechhHHH----hhcCCchHHH-------HHHHHHHHHH--hcCchhe Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESL--ATQILRRKQQAVWKVKGLAE----MCDDDDAQYA-------ARLRLAQVDD--NSGVGRA 65 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~--~~~Ll~~~~~~v~k~~~l~~----~~~~~~~e~~-------~~~r~~~~~~--~r~~~~~ 65 (237) .+-.++..+.+++.-..+ ++..+...-..++|.+.-.. .....++... ...+....+. ..-.-|+ T Consensus 247 ~lapvl~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~pG~ 326 (530) T protein:vir:38 247 AFYSVMEQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDFILGADNKEQQSKLTGWLGEMAAYYSAAPVRLGGAR 326 (530) T ss_pred hHHHHHHHHHHHhHHHHHHHHHHHHhhhheeeeeccCCccccccccccCCcccccccccccchhhhhcccccceeccCce Confidence 444555544444433222 22334444455666542111 1110110000 0001110000 0112444 Q ss_pred eeeecCCcceeeeecC--cCCHHHHHHHHHHHHhhhhcCceeeeeccC-cccccccchhHHHHHHHHHHHHHHHhhhHHH Q lcl|NC_019725. 66 IGIDAETEEYDVLNSD--ISGVPEFLSSKMDRIVSLSGIHEIIIKNKN-VGGVSASQNTALETFYKLVDRKREEDYRPLL 142 (237) Q Consensus 66 ~~iD~~~e~~~~~~~~--lsGl~dl~~~~~~~iaa~s~iP~t~L~G~s-p~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l 142 (237) +..-..++++...+.+ -++..+.+..+...||+..|||...|.|-- -...+ |.-..+.-|...++..|...+.|++ T Consensus 327 i~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~s~~nYS-S~R~~~~e~~r~~~~~q~~~~~~~~ 405 (530) T protein:vir:38 327 VPHLLPGDSLNLQSAQDTDNGYSTFEQSLLRYIAAGLGVSYEQLSRNYSQMSYS-TARASANESWAYFMGRRKFVASRQA 405 (530) T ss_pred eeecCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccHH-HHHHHHHHHHHHHHHHHHHHHHHHh Confidence 5445567888877765 478999999999999999999999999943 22343 4566788899999999998766655 Q ss_pred HHHHHHh----hcCCC--------------------ceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHH Q lcl|NC_019725. 143 EFLLPFI----VEEEE--------------------WSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDT 198 (237) Q Consensus 143 ~~l~~~i----~~s~~--------------------~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~ 198 (237) ..+.... +.... ..|..++.-..+. .|.+++....+++|+.|..++..+ T Consensus 406 ~pi~~~wl~~av~~G~i~~p~~~~~~~~~~~~a~~~~~w~~p~~~~iDP-------~Ke~~a~~~~i~~G~~s~~~~~a~ 478 (530) T protein:vir:38 406 CQMFLCWLEEAIVRRVVTLPSKARFSFQEARTAWGNANWIGSGRMAIDG-------LKEVQEAVMLIEAGLSTYEKECAK 478 (530) T ss_pred hHHHHHHHHHHHHcCCccCCCCCCCCchhhHHhhhceeeecCCccccCh-------HHHHHHHHHHHHcCCCCHHHHHHH Confidence 5444432 22211 1222233333443 356667777777777766655432 Q ss_pred -----------HH---hhccccccCCCCCCChhccccCCCCCCCCCCCcCcCC Q lcl|NC_019725. 199 -----------LR---SIAPEFKLKDGNNINIREPEETTEPEPGLGEKLEDEN 237 (237) Q Consensus 199 -----------l~---~~~~~~g~~~~~~~~~~~~e~~~e~~~~~~~~~~~e~ 237 (237) +. ....+.|+........ .......+...++.++.+ T Consensus 479 ~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~---~~~~~~~~~~~~~~d~~~ 528 (530) T protein:vir:38 479 RGDDYQEIFAQQVRESMERRAAGLNPPAWAAA---AFEAGVKKSNEEEQDGAR 528 (530) T ss_pred cCCCHHHHHHHHHHHHHHHHHcCCCCCCCccc---ccCCCCCCCCCCCCCCCC Confidence 21 1112344422111110 011111222222222222 No 192 >protein:vir:94956 Length: 452 # NCBI annotation: putative phage structural protein # Family: family:all:584 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239276;genbank:gi:66392058;genbank:GeneID:5076601 Probab=91.58 E-value=0.015 Score=30.55 Aligned_cols=201 Identities=14% Similarity=0.068 Sum_probs=109.5 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeeeec Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVLNS 80 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~~~ 80 (237) |+..+.=.+.+|.+... --++++-..+-+.-+.|+.+ .+. ..-+....+.+...+.++..++. T Consensus 235 Ll~LA~ln~~hy~~~sd-~~~~l~~~~~P~l~~~g~~~----~~~------------i~iG~~~~~~lpe~~~~~~yie~ 297 (452) T protein:vir:94 235 MIDIVDINYSHYRTSAD-LEHGRHFTGLPTPWITGAES----QST------------MHIGSTKAWVIPEVAAKVGFLEF 297 (452) T ss_pred hHHHHHHHHHHhcchhH-HHHHHHHcccceeEeecCcC----CCc------------eEecccccccCCCCCCcceEEcc Confidence 88888888898888776 56778888888887766432 110 11133444556543556888888 Q ss_pred CcCCHH---HHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHH---HHHHHHHHHHhhhHHHHHHHHHhhc--- Q lcl|NC_019725. 81 DISGVP---EFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETF---YKLVDRKREEDYRPLLEFLLPFIVE--- 151 (237) Q Consensus 81 ~lsGl~---dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~ny---yd~I~~~Qe~~l~p~l~~l~~~i~~--- 151 (237) +-+++. +-++...+++..+.. +|+=++++|- -|++.-...+ +..++++- ..+.-.+.+++++++. T Consensus 298 ~g~~i~~~~~~l~~le~~m~~~Ga----~ll~~~~~~~-~s~ea~~~~~~~~~s~L~~~a-~~~e~al~~~l~~~a~w~g 371 (452) T protein:vir:94 298 TGQGLQSLEKALSEKQAQLASLSA----RLIDNSTRGS-EATETVKLRYMSETASLKSVT-RAVEALLNKAYSCIMDMES 371 (452) T ss_pred CchhHHHHHHHHHHHHHHHHHHHH----HhhccCCCcc-hHHHHHHHHHHHhhHHHHHHH-HHHHHHHHHHHHHHHHHcC Confidence 888874 445555555543322 3333343442 2444322222 33444443 3356677888887753 Q ss_pred -CCCceeEeCC---CCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCChhc-cccCCCCC Q lcl|NC_019725. 152 -EEEWSIEFEP---LSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIRE-PEETTEPE 226 (237) Q Consensus 152 -s~~~~~~f~p---L~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~~~-~e~~~e~~ 226 (237) ..+..|+.|. .-.+++ ...+++..+++.|.|+.+..++.|+..+. .+. +-+.+. .++.+.+. T Consensus 372 ~~~~~~v~~n~dF~~~~~~~--------~~~~al~~~~~~G~is~~t~~~~L~~~gv----l~~-~~e~~~i~~E~~~~~ 438 (452) T protein:vir:94 372 MGGTLNIKLNSAFLDSKLTA--------AELKAWVEAYLSGGISKEIYIHALKVGKV----LPP-PGESMGVIPDPPAPE 438 (452) T ss_pred CCCceEEEeccccccccCCH--------HHHHHHHHHHhcCCCcHHHHHHHHHhCCC----CCC-ccCHHHHHHHhhccC Confidence 2344444432 112222 34455667899999999999999987543 211 111111 12222222 Q ss_pred C-CCCCCcCcCC Q lcl|NC_019725. 227 P-GLGEKLEDEN 237 (237) Q Consensus 227 ~-~~~~~~~~e~ 237 (237) | ..|++.++-| T Consensus 439 ~~~~~~~~~~~~ 450 (452) T protein:vir:94 439 PSPSNTPPNPSS 450 (452) T ss_pred cccCCCCCCCcc Confidence 2 2234444444 No 193 >protein:vir:95254 Length: 488 # NCBI annotation: Phage conserved protein # Family: family:all:2372 # MgeID: mge:1561 # MgeName: Felix 01 # Cross-refs: genbank:acc:NP_944885;genbank:gi:158267601;genbank:GeneID:2744039 Probab=91.45 E-value=0.016 Score=30.46 Aligned_cols=223 Identities=10% Similarity=0.056 Sum_probs=87.5 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHh--ccceeechhHHHhhc-CCchH-HHHHHHH-HHHHHhcCc--hheee-----e Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRK--QQAVWKVKGLAEMCD-DDDAQ-YAARLRL-AQVDDNSGV--GRAIG-----I 68 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~--~~~v~k~~~l~~~~~-~~~~e-~~~~~r~-~~~~~~r~~--~~~~~-----i 68 (237) |+..||-...--..+..--+.-+.++ .+.+.+.+ ..... +.+.+ ..+.+.+ ++......+ .++++ + T Consensus 221 Llr~~~w~~~fK~~~~~~w~~f~Er~g~g~p~~~~p--~~~~~~~~~~e~~~l~~a~~~i~~~~~~~~~ag~iiP~g~~~ 298 (488) T protein:vir:95 221 PLLNAYVPWKYKVQIEEYEAVGVSRDLVGMPKIGLP--PDYLDENAEPEKKAFVQYCKTVVNDMIANDRAGLIWPRYIDP 298 (488) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhcccceeEeec--cCCCCCcccHHHHHHHHHHHHHHHHhhccchhheeecccccc Confidence 66666655443334444444455554 44454442 11111 11111 1122222 222222211 22222 1 Q ss_pred ecCCcc--eeeeecCcCC---HHHHHHHHHHHHhhhh-cCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHH Q lcl|NC_019725. 69 DAETEE--YDVLNSDISG---VPEFLSSKMDRIVSLS-GIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLL 142 (237) Q Consensus 69 D~~~e~--~~~~~~~lsG---l~dl~~~~~~~iaa~s-~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l 142 (237) +...+. ++...+.=++ ...+++..-..||-+. |--+|- +..-+|-+|.|+--...+.+.+.+-......-+- T Consensus 299 ~~k~~~~e~~l~~~~~~~~~~~~~li~~~d~~Isk~iLGqtLT~--~~~~~Gs~Al~~vh~ev~~~i~~aDa~~i~~tln 376 (488) T protein:vir:95 299 DTKEDIFEFSLVSRQGAKAYDTGSIIDRYSKQIMMAFMSDVLAM--GQSKYGSFSLADSKTSLLAMSVDILLKQIKNVIN 376 (488) T ss_pred ccchhhhhhhccccccCCchhHHHHHHHHHHHHHHHHhcccccc--ccCcchhhhHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 221111 2222222222 3456776667777443 222221 2222455556666667777777766644333333 Q ss_pred HHHHHHhhc-C---CC--ceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHH-HHHHHHHhhccccccCCCCCCC Q lcl|NC_019725. 143 EFLLPFIVE-E---EE--WSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLE-EARDTLRSIAPEFKLKDGNNIN 215 (237) Q Consensus 143 ~~l~~~i~~-s---~~--~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~-e~r~~l~~~~~~~g~~~~~~~~ 215 (237) +.|+.-++. . .. -.|+| ...+..++ ++.|++++.++++|+..++ +..+.++. .+|+.....-. T Consensus 377 ~~li~~l~~~Nfg~~~~~P~~~~------~~~e~~Dl-~~~ae~~~~L~~~G~~i~~~~~~~~i~e---~~gip~~~~~e 446 (488) T protein:vir:95 377 RDLVAQTYALNMWDDEEHVQITY------DDIETPDL-EAIGSYIQKTVAVGALEVDKELSNKLRE---HIGLPPADESQ 446 (488) T ss_pred HHHHHHHHHhcCCCCCCccEEEe------cCcChhhH-HHHHHHHHHHHhCCCccccHHHHHHHHH---HhCCCCCCCCc Confidence 345544432 1 11 23444 33333333 3678999999999997653 23344443 23332110000 Q ss_pred hhccccCCCCCCCCCCCcCc-----------C--------C Q lcl|NC_019725. 216 IREPEETTEPEPGLGEKLED-----------E--------N 237 (237) Q Consensus 216 ~~~~e~~~e~~~~~~~~~~~-----------e--------~ 237 (237) .......+.+++..+..... + | T Consensus 447 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~ 487 (488) T protein:vir:95 447 PVSEKLSPNSQSRSGDGYKTAGEGTAKTPSAKDPSTANKAN 487 (488) T ss_pred cccccCCCCCCCCCCcccCCCcccCCcccccccchhhhhcc Confidence 00000111111111111000 0 0 No 194 >protein:vir:99853 Length: 488 # NCBI annotation: portal protein # Family: family:all:313 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164068;genbank:gi:56692600;genbank:GeneID:3192581 Probab=90.92 E-value=0.018 Score=30.10 Aligned_cols=208 Identities=11% Similarity=-0.001 Sum_probs=96.1 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccce--eechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAV--WKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVL 78 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v--~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~ 78 (237) |+..||..+.--..+...-+..+.++.+-+ .|++. .+..+....+=++.+..+.+ .+.++|.. +.+++.+ T Consensus 185 Ll~~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~------~~a~~~ek~~l~~av~~~~~-~~~~viP~-~~~ie~~ 256 (488) T protein:vir:99 185 LAHWLYWPVFFKRNGIKFWLIFLDKFGMPTAVGRYDD------KTATPEDKAKLLAALHAIQT-DSAIIMPA-GMQAELL 256 (488) T ss_pred HHHHHHHHHHHHHhhHHHHHHHHHHcCCceeeeecCC------CCCCHHHHHHHHHHHHHHhc-CcEEEecC-CceeEEe Confidence 777777765444445555556677766554 44321 11111111111233333333 44555554 5789988 Q ss_pred ecCcCCHH---HHHHHHHHHHhhh-hcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHH-HHHHHHhhcC- Q lcl|NC_019725. 79 NSDISGVP---EFLSSKMDRIVSL-SGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLL-EFLLPFIVEE- 152 (237) Q Consensus 79 ~~~lsGl~---dl~~~~~~~iaa~-s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l-~~l~~~i~~s- 152 (237) +..=+|.. .+++..-..||-+ .|=-+| ++.-+|=.|.|+.-...+.+.+.+.... +...+ +.|+..++.- T Consensus 257 ea~~~~~~~~~~li~~~d~~Isk~iLGqtlt---s~~~~Gs~a~~~vh~~v~~d~~~aDa~~-i~~tln~~li~~l~~~N 332 (488) T protein:vir:99 257 EAGRSGTADYKTLHDTMDATIAKVGLGQVAS---TQGTPGRLGNDDLQADVRLDLVKADADL-ICESFNLGPARWLTEWN 332 (488) T ss_pred ecCCCChHHHHHHHHHHHHHHHHHHhhhhhc---ccccccchhhHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhC Confidence 87544433 3666666666633 232222 2333333456666666777777776644 33334 3465554432 Q ss_pred ---CCc-eeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhC-CC-CCHHHHHHHHHhhccccccCCCCCCChhccccCCCCC Q lcl|NC_019725. 153 ---EEW-SIEFEPLSVPSKKEESEITKNNVESVTKAITE-QI-IDLEEARDTLRSIAPEFKLKDGNNINIREPEETTEPE 226 (237) Q Consensus 153 ---~~~-~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~-g~-i~~~e~r~~l~~~~~~~g~~~~~~~~~~~~e~~~e~~ 226 (237) ... .|.|.-.... -.+..|++++++++. |+ |+.+.+++.+ |+... ...+....+.+. T Consensus 333 ~~~~~~p~~~~~~~e~e-------dl~~~a~~~~~l~~~~G~~i~~~~i~e~~-------Gip~~---~~~~~~~~~~~~ 395 (488) T protein:vir:99 333 FPGAQPPRVYRVIEEPE-------DITAKAERDEKVFRMSGFRPTRGYVQETY-------GVEVE---STQAEATAPTPS 395 (488) T ss_pred cCCcCCceeEecCCCcc-------cHHHHHHHHHHHHhhcCCCCCHHHHHHHc-------CCCCc---ccccccccCCCc Confidence 111 1223222222 224567888899986 75 7888788765 22111 111111111111 Q ss_pred CCCCCCc--CcCC Q lcl|NC_019725. 227 PGLGEKL--EDEN 237 (237) Q Consensus 227 ~~~~~~~--~~e~ 237 (237) ....+.. .+.. T Consensus 396 ~~~~~~~~~~~~~ 408 (488) T protein:vir:99 396 TEFAEGDQPSDPA 408 (488) T ss_pred ccCCCCCCCCCch Confidence 1111111 1111 No 195 >protein:vir:2013 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:315 # MgeName: P2 # Cross-refs: genbank:acc:NP_046757;genbank:gi:9630328;genbank:GeneID:1261529 Probab=90.79 E-value=0.019 Score=30.02 Aligned_cols=154 Identities=8% Similarity=0.064 Sum_probs=75.8 Q ss_pred CchhHH---------HHHHHHHHHHHHHHHHHHHh-cc-ceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeee Q lcl|NC_019725. 1 MNKSLI---------DAICDYDYCESLATQILRRK-QQ-AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGID 69 (237) Q Consensus 1 llq~~~---------d~v~~~~~~~~~~~~Ll~~~-~~-~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD 69 (237) ..+.+| ..+..-..+........... +. .++++.+ ++ + +.+....++++++-. ...++...++|. T Consensus 168 ~~~~~yGls~~~~a~~si~l~~~a~~~~~~~f~NGa~p~~Il~~~d-~~-l-~~e~~~~ik~~~~~~-~g~~n~r~l~l~ 243 (344) T protein:vir:20 168 INQELYGLPEYLSALNSAWLNESATLFRRKYYENGAHAGYIMYVTD-AV-Q-DRNDIEMLRENMVKS-KGRNNFKNLFLY 243 (344) T ss_pred CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecC-cC-C-CHHHHHHHHHHHHHh-cCCCCccceEEe Confidence 122222 22221111111111211111 11 2333322 11 1 222344566666542 234555556664 Q ss_pred cC---CcceeeeecCcCCHHH----HHHHHHHHHhhhhcCceeeeeccCcc---cccccchhHHHHHHHHHHHHHHHhhh Q lcl|NC_019725. 70 AE---TEEYDVLNSDISGVPE----FLSSKMDRIVSLSGIHEIIIKNKNVG---GVSASQNTALETFYKLVDRKREEDYR 139 (237) Q Consensus 70 ~~---~e~~~~~~~~lsGl~d----l~~~~~~~iaa~s~iP~t~L~G~sp~---GlnatGe~D~~nyyd~I~~~Qe~~l~ 139 (237) .. .+.++....+.+..++ +-....+.||++-|||-.. +|..|. |++ +-+.-.+.|+ ++.|. T Consensus 244 ~p~g~~~gi~~~pis~~~~d~qf~e~k~~s~~eIa~af~VPp~l-lGi~~~~t~~~~-n~e~~~~~f~-------~~~l~ 314 (344) T protein:vir:20 244 APQGKADGIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQL-MGGKPENVGSLG-DIEKVAKVFV-------RNELI 314 (344) T ss_pred cCCCCccceeEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHHH-hccCCCCCCccc-cHHHHHHHHH-------HHHHH Confidence 32 2456666666666555 4556678899999999975 476654 333 2233444444 34466 Q ss_pred HHHHHHHHHhh--cCCCceeEeCCCCCCCH Q lcl|NC_019725. 140 PLLEFLLPFIV--EEEEWSIEFEPLSVPSK 167 (237) Q Consensus 140 p~l~~l~~~i~--~s~~~~~~f~pL~~~se 167 (237) |.++++-++.- ..+.|.|+++.|..-+| T Consensus 315 P~~~~~e~in~~lg~~~i~F~~~~l~~~d~ 344 (344) T protein:vir:20 315 PLQDRIREINGWLGQEVIRFKNYSLDTDND 344 (344) T ss_pred HHHHHHHHHHHhcCCcccccCccccccCCC Confidence 76665554332 23668888888877777 No 196 >protein:vir:78191 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1848 # MgeName: phiE12-2 # Cross-refs: genbank:acc:YP_001111155;genbank:gi:134288732;genbank:GeneID:4960651 Probab=89.26 E-value=0.027 Score=29.15 Aligned_cols=158 Identities=10% Similarity=0.084 Sum_probs=73.3 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHh-cc-ceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchhee-eeecC--Ccce Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRK-QQ-AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAI-GIDAE--TEEY 75 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~-~~-~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~-~iD~~--~e~~ 75 (237) -+......+.--..+........... +. .++++.+ . .+ +.+..+.+++.++-. ..-+|.+-+ ++... .+.+ T Consensus 182 ~~~~a~~si~l~~~a~~~~~~~f~NGa~pggIl~~~~-~-~l-s~e~~~~lr~~~~~~-~G~~N~~~~~v~~~~g~~~g~ 257 (351) T protein:vir:78 182 EYLSSLHSAWLNESSTLFRRKYYENGSHAGFILYMTD-A-AQ-KQDDVDNMRDALKNA-KGPGNFRNVFMYAPGGKKDGI 257 (351) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecC-C-CC-CHHHHHHHHHHHHHh-cCcccccceeeecCCCCccce Confidence 11222222222222222222222111 11 2233322 1 11 233455677766543 233444434 44321 2456 Q ss_pred eeeecCcCCHHH----HHHHHHHHHhhhhcCceeeeeccCccc---ccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHH Q lcl|NC_019725. 76 DVLNSDISGVPE----FLSSKMDRIVSLSGIHEIIIKNKNVGG---VSASQNTALETFYKLVDRKREEDYRPLLEFLLPF 148 (237) Q Consensus 76 ~~~~~~lsGl~d----l~~~~~~~iaa~s~iP~t~L~G~sp~G---lnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~ 148 (237) +....+.+..++ +-....+.||++-|||-.. +|..|.+ ++ +-+.-.+.|| +..|.|.++++-++ T Consensus 258 k~~pls~~~~d~qf~e~k~~~~~eIa~a~~VPp~l-lGi~~~~t~~~s-n~e~~~~~f~-------~~~l~P~~~~iee~ 328 (351) T protein:vir:78 258 QLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPPQL-LGIVPSNSGGFG-TPDTAARVFG-------RNEIRPLQARFAEL 328 (351) T ss_pred eEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHH-hcccCCCCCCcc-cHHHHHHHHH-------HHHHHHHHHHHHHH Confidence 666666666554 3455566799999999754 4876643 32 2233333343 34566666666554 Q ss_pred hhcCCCceeEeCCCCCCCHHHHH Q lcl|NC_019725. 149 IVEEEEWSIEFEPLSVPSKKEES 171 (237) Q Consensus 149 i~~s~~~~~~f~pL~~~seke~A 171 (237) .-+-..-.|+|++-.-+...++| T Consensus 329 n~~l~~~~~~F~~~~Llr~d~ka 351 (351) T protein:vir:78 329 NDWLGDEVVRFDDYEIPPAPVAA 351 (351) T ss_pred HhhcCccceecChhhhccccccC Confidence 32212223777775555555555 No 197 >protein:vir:100328 Length: 346 # NCBI annotation: capsid portal protein Q # Family: family:all:196 # MgeID: mge:1484 # MgeName: phi-MhaA1-PHL101 # Cross-refs: genbank:acc:YP_655469;genbank:gi:109289937;genbank:GeneID:4157371 Probab=89.19 E-value=0.028 Score=29.11 Aligned_cols=156 Identities=12% Similarity=0.114 Sum_probs=74.9 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHH-hccce-eechhHHHhhcCCchHHHHHHHHHHHHHhcCchhee-eeecC--Ccce Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRR-KQQAV-WKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAI-GIDAE--TEEY 75 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~-~~~~v-~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~-~iD~~--~e~~ 75 (237) -+......+.-...+.......... ++... +++.+ + .+ +++..+.+++.++-. ...+|.+.+ ++... .+.+ T Consensus 179 ~~~~a~~si~l~~~a~~~~~~~~~NG~~~~~il~~~d-~-~l-~~e~~~~i~~~~~~~-~g~~n~~~~~vl~~~~~~~gi 254 (346) T protein:vir:10 179 QYLSALQSAWLNESATLFRRKYFLNGAHAGFVFYMSD-A-SQ-KQEDVENIRQQLKQS-KGVGNFKNLFVHAPNGKKDGI 254 (346) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCC-C-CC-CHHHHHHHHHHHHHh-cCccccCceeEecCCCCccce Confidence 1222223333333333333333322 22222 33322 1 11 233344566666543 233444444 44332 2345 Q ss_pred eeeecCcCCHHH----HHHHHHHHHhhhhcCceeeeeccCccc---ccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHH Q lcl|NC_019725. 76 DVLNSDISGVPE----FLSSKMDRIVSLSGIHEIIIKNKNVGG---VSASQNTALETFYKLVDRKREEDYRPLLEFLLPF 148 (237) Q Consensus 76 ~~~~~~lsGl~d----l~~~~~~~iaa~s~iP~t~L~G~sp~G---lnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~ 148 (237) +....+.+..++ +-....+.||++-|||-. |+|..|++ ++ +-+.-.+.|| +..|.|.++++-++ T Consensus 255 ~~~pis~~~~d~qf~e~k~~~~~~I~~af~VPp~-llG~~~~~~~~~s-~~e~~~~~f~-------~~~l~P~~~~iee~ 325 (346) T protein:vir:10 255 QIIPIADVSAKDEFFNIKNVSRDDVLAAHRVPPQ-LMGIIPNNTGGFG-NVADAAEVFF-------ITEIEPLQERLKEF 325 (346) T ss_pred eEEecCCChhHHHHHHHHHHhHHHHHHHhCCCHH-HhcccCCCCCCcc-cHHHHHHHHH-------HHHHHHHHHHHHHH Confidence 555555555554 344557889999999997 45876653 43 2233444554 35577776666543 Q ss_pred hhcCCCceeEeCCCCCCCHHH Q lcl|NC_019725. 149 IVEEEEWSIEFEPLSVPSKKE 169 (237) Q Consensus 149 i~~s~~~~~~f~pL~~~seke 169 (237) .-+-..=.|+|+|=.-+..+| T Consensus 326 n~~L~~e~i~F~~~~ll~~~~ 346 (346) T protein:vir:10 326 NQWLGQEVIKFKPSKLLQRTQ 346 (346) T ss_pred HhhcccceeeechhhhcccCC Confidence 321111136677765555555 No 198 >protein:vir:6058 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:126 # MgeName: WPhi # Cross-refs: genbank:acc:NP_878199;genbank:gi:33438898;genbank:GeneID:1457733 Probab=88.12 E-value=0.034 Score=28.61 Aligned_cols=154 Identities=8% Similarity=0.053 Sum_probs=76.4 Q ss_pred Cc--hhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeec---CCc Q lcl|NC_019725. 1 MN--KSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDA---ETE 73 (237) Q Consensus 1 ll--q~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~---~~e 73 (237) |. +.....+.--..+............. .++++.+ ++ + +.+..+.++++++-. ...++.+.+++.. +.+ T Consensus 175 lsp~~~a~~si~l~~~a~~~~~~~f~NG~~pg~il~~~~-~~-l-s~e~~~~ik~~~~~~-~g~~~~r~~~l~~p~g~~~ 250 (344) T protein:vir:60 175 LPEYLSALNSAWLNESATLFRRKYYENGAHAGYIMYVTD-AV-Q-DRNDIEMLRENMVKS-KGRNNFKNLFLYAPQGKAD 250 (344) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecC-cC-C-CHHHHHHHHHHHHHh-cCCCCCcceEEecCCCCcc Confidence 21 22222222222222222222222111 3344332 11 2 222344566666543 2345555566642 224 Q ss_pred ceeeeecCcCCHHH----HHHHHHHHHhhhhcCceeeeeccCccc---ccccchhHHHHHHHHHHHHHHHhhhHHHHHHH Q lcl|NC_019725. 74 EYDVLNSDISGVPE----FLSSKMDRIVSLSGIHEIIIKNKNVGG---VSASQNTALETFYKLVDRKREEDYRPLLEFLL 146 (237) Q Consensus 74 ~~~~~~~~lsGl~d----l~~~~~~~iaa~s~iP~t~L~G~sp~G---lnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~ 146 (237) .++....+.+..++ +-....+.||++-+||-. |+|..|.+ ++ +-+.-.+.|+. ..|.|.++++- T Consensus 251 g~~~~pis~~~~d~qf~e~k~~~~~eIa~af~VPp~-llGi~~~~t~~~~-n~e~~~~~f~~-------~~L~Pl~~~~e 321 (344) T protein:vir:60 251 GIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQ-LMGGKPENVGSLG-DIEKVAKVFVR-------NELIPLQDRIR 321 (344) T ss_pred ceeEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHH-HhcccCCCCCccc-cHHHHHHHHHH-------HHHHHHHHHHH Confidence 56666666666554 445677889999999986 66766543 33 22334444443 45777666665 Q ss_pred HHhh--cCCCceeEeCCCCCCCH Q lcl|NC_019725. 147 PFIV--EEEEWSIEFEPLSVPSK 167 (237) Q Consensus 147 ~~i~--~s~~~~~~f~pL~~~se 167 (237) ++.- -.+.|.|.+..|..-+. T Consensus 322 ~ln~~lg~~~i~F~~~~l~~~d~ 344 (344) T protein:vir:60 322 EINGWLGQEVIRFKNYSLDTDNG 344 (344) T ss_pred HHHHhcCCcccccCccccCCCCC Confidence 4432 23557777777766666 No 199 >protein:vir:79207 Length: 351 # NCBI annotation: gp5, phage portal protein, pbsx family # Family: family:all:196 # MgeID: mge:1866 # MgeName: phiE202 # Cross-refs: genbank:acc:YP_001111036;genbank:gi:134288763;genbank:GeneID:4960726 Probab=86.45 E-value=0.046 Score=27.94 Aligned_cols=157 Identities=11% Similarity=0.119 Sum_probs=73.0 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHh-cc-ceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeee-ec--CCcce Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRK-QQ-AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGI-DA--ETEEY 75 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~-~~-~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~i-D~--~~e~~ 75 (237) -++.....+..-..+......+.... +. -++++.+ + .+ +.+..+.+++.++-. ...+|.+-+++ .. ..+.+ T Consensus 182 ~~~~a~~si~l~~~a~~~~~~~f~NGa~pg~il~~~~-~-~l-s~e~~~~lk~~~~~~-~G~~N~~~~~v~~~~g~~~gi 257 (351) T protein:vir:79 182 EYLSSLHSAWLNESSTLFRRKYYENGSHAGFILYMTD-A-AQ-KQDDVDNMRDALKNA-KGPGNFRNVFMYAPGGKKDGI 257 (351) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecC-C-CC-CHHHHHHHHHHHHHh-cCccccCceeEecCCCCccce Confidence 11122222222222222222222211 12 2233332 1 11 233455677777643 33445444444 22 12456 Q ss_pred eeeecCcCCHHH----HHHHHHHHHhhhhcCceeeeeccCccc---ccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHH Q lcl|NC_019725. 76 DVLNSDISGVPE----FLSSKMDRIVSLSGIHEIIIKNKNVGG---VSASQNTALETFYKLVDRKREEDYRPLLEFLLPF 148 (237) Q Consensus 76 ~~~~~~lsGl~d----l~~~~~~~iaa~s~iP~t~L~G~sp~G---lnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~ 148 (237) +....+.+..++ +-....+.||++-|+|-.. +|..|.+ ++ +-+.-.+.||. ..|.|.++++-++ T Consensus 258 ~~~pl~~~~~d~ef~e~k~~s~~eI~~a~~VPp~l-lGi~~~~t~~~~-n~e~~~~~f~~-------~~l~Pl~~~ie~l 328 (351) T protein:vir:79 258 QLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPPQL-LGIVPSNSGGFG-TPDTAARVFGR-------NEIRPLQARFAEL 328 (351) T ss_pred EEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHH-hcccCCCCCCcc-cHHHHHHHHHH-------HHHHHHHHHHHHH Confidence 676666666554 4445677799999999555 4876643 32 23444455553 3356655555443 Q ss_pred hhc-CCCceeEeCCCCCCCHHHHH Q lcl|NC_019725. 149 IVE-EEEWSIEFEPLSVPSKKEES 171 (237) Q Consensus 149 i~~-s~~~~~~f~pL~~~seke~A 171 (237) --. ..+ -++|++---+..-.+| T Consensus 329 n~~lg~~-~~~F~~~~llr~d~~a 351 (351) T protein:vir:79 329 NDWLGDE-VVTFDDYEIPPAPVAA 351 (351) T ss_pred HhhcCcc-eeeeChhhhccccccC Confidence 211 122 3677775555444444 No 200 >protein:vir:103971 Length: 376 # NCBI annotation: pbsx family phage portal protein # Family: family:all:196 # MgeID: mge:1665 # MgeName: phi52237 # Cross-refs: genbank:acc:YP_293752;genbank:gi:72537722;genbank:GeneID:3608098 Probab=85.68 E-value=0.051 Score=27.67 Aligned_cols=158 Identities=11% Similarity=0.111 Sum_probs=70.6 Q ss_pred CchhHH---------HHHHHHHHHHHHHHHHHHHh-cc-ceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchhee-ee Q lcl|NC_019725. 1 MNKSLI---------DAICDYDYCESLATQILRRK-QQ-AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAI-GI 68 (237) Q Consensus 1 llq~~~---------d~v~~~~~~~~~~~~Ll~~~-~~-~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~-~i 68 (237) ..+.+| ..+.--..+........... +. -++++.| . .+ +.+..+.+++.++-. ...+|.+-+ ++ T Consensus 198 ~~~~~yGls~~~~a~~si~l~~aa~~f~~~~f~NGa~pggIl~~~d-~-~l-~~e~~~~lr~~~~~~-~G~~N~~~~~vl 273 (376) T protein:vir:10 198 INQEVYGLPEYLSSLHSAWLNESSTLFRRKYYENGSHAGFILYMTD-A-AQ-KQDDVDNMRDALKNA-KGPGNFRNVFMY 273 (376) T ss_pred CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecC-C-CC-CHHHHHHHHHHHHHh-cCccccCceeEe Confidence 122222 12211111111111211111 11 2233322 1 11 223344566655542 223443444 44 Q ss_pred ecC--CcceeeeecCcCCHHH----HHHHHHHHHhhhhcCceeeeeccCccc---ccccchhHHHHHHHHHHHHHHHhhh Q lcl|NC_019725. 69 DAE--TEEYDVLNSDISGVPE----FLSSKMDRIVSLSGIHEIIIKNKNVGG---VSASQNTALETFYKLVDRKREEDYR 139 (237) Q Consensus 69 D~~--~e~~~~~~~~lsGl~d----l~~~~~~~iaa~s~iP~t~L~G~sp~G---lnatGe~D~~nyyd~I~~~Qe~~l~ 139 (237) ... .+.++....+.+.-++ +-....+.||++-+||- .|+|..+.+ ++ +-+.-.+.|| +..|. T Consensus 274 ~~~g~~~Gi~~~pls~~~~d~qf~e~k~~~~~eIa~af~VPp-~llGi~~~~t~~~s-n~eq~~~~f~-------~~~L~ 344 (376) T protein:vir:10 274 APGGKKDGIQLIPVSEVAAKDEFFNIKNVTRDDLLAAHRVPP-QLLGIVPSNSGGFG-TPDTAARVFG-------RNEIR 344 (376) T ss_pred cCCCCccceEEEEccCCHHHHHHHHHHHHhHHHHHHHhCCCH-HHhcccCCCCCCcc-cHHHHHHHHH-------HHHHH Confidence 332 2446666666665554 44455778999999996 577887753 32 2233344444 34466 Q ss_pred HHHHHHHHHhhcCCCceeEeCCCCCCCHHHHH Q lcl|NC_019725. 140 PLLEFLLPFIVEEEEWSIEFEPLSVPSKKEES 171 (237) Q Consensus 140 p~l~~l~~~i~~s~~~~~~f~pL~~~seke~A 171 (237) |.++++-++.-.-..-.|+|++-.-+.-..+| T Consensus 345 Pl~~~ieeln~~L~~~~~~F~~~~Llr~d~ka 376 (376) T protein:vir:10 345 PLQARFAELNDWLGEEVVRFDDYEIPPAPVAA 376 (376) T ss_pred HHHHHHHHHHhhccccccccChhHhhcccccC Confidence 76666654332212223667765444444444 No 201 >protein:vir:1986 Length: 512 # NCBI annotation: Hypothetical protein # Family: family:all:313 # MgeID: mge:320 # MgeName: Mu # Cross-refs: genbank:acc:NP_050633;genbank:gi:9633520;genbank:GeneID:2636304 Probab=85.60 E-value=0.052 Score=27.64 Aligned_cols=210 Identities=12% Similarity=0.001 Sum_probs=94.7 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVL 78 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~ 78 (237) |+..||-.+.--..+....+..+.++.+ .+.|++. +..+.....-++.+..+.+ .+.++| .++.+++.+ T Consensus 205 Llr~~~w~~~fK~~~~~~w~~f~E~yG~P~~igky~~-------~a~~~ek~~L~~al~~~~~-~a~~ii-P~~~~ie~~ 275 (512) T protein:vir:19 205 LVRTLIWPFIFKNYSVRDFAEFLEIYGLPMRVGKYPT-------GSTNREKATLMQAVMDIGR-RAGGII-PMGMTLDFQ 275 (512) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHcCCCeeEEecCC-------CCCHHHHHHHHHHHHHHhh-CcEEEe-cCCceEEEe Confidence 8887777666666666667777888774 4555541 1112222222344444433 334445 446788888 Q ss_pred ecCcCCH---HHHHHHHHHHHhhh-hcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhcC-- Q lcl|NC_019725. 79 NSDISGV---PEFLSSKMDRIVSL-SGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVEE-- 152 (237) Q Consensus 79 ~~~lsGl---~dl~~~~~~~iaa~-s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~s-- 152 (237) .+.=+|- ..+++..-..||-+ .|--+|-=-| -+|-+|.|+--.....+.+.+-......-.-+.|+.-++.- T Consensus 276 ea~~~~~~~y~~li~~~d~~Isk~iLGqtlTs~~g--~~Gs~a~~~vh~ev~~di~~aDa~~i~~tln~~li~~l~~~N~ 353 (512) T protein:vir:19 276 SAADGQSDPFMAMIGWAEKAISKAILGGTLTTEAG--DKGARSLGEVHDEVRREIRNADVGQLARSINRDLIYPLLALNS 353 (512) T ss_pred ecCCCCHHHHHHHHHHHHHHHHHHHhhhhhccccc--ccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCC Confidence 8753333 22444455555532 3332222111 22334455555566666666666443333223455555421 Q ss_pred -CC------ceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCC-CCHHHHHHHHHhhccccccCCCCCCChhcc---cc Q lcl|NC_019725. 153 -EE------WSIEFEPLSVPSKKEESEITKNNVESVTKAITEQI-IDLEEARDTLRSIAPEFKLKDGNNINIREP---EE 221 (237) Q Consensus 153 -~~------~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~-i~~~e~r~~l~~~~~~~g~~~~~~~~~~~~---e~ 221 (237) .. =.|.|..- |..++ ++.++++..+. .|+ |+.+.+++.+. + |......... .. T Consensus 354 ~~~~~~~~~p~~~f~~~------e~eDl-~~~a~~~~~l~-~G~~i~~~~i~e~~G-------i-p~~~~~e~~~~~~~~ 417 (512) T protein:vir:19 354 DSTIDINRLPGIVFDTS------EAGDI-TALSDAIPKLA-AGMRIPVSWIQEKLH-------I-PQPVGDEAVFTIQPV 417 (512) T ss_pred CCCCCccccceEEecCC------ChhhH-HHHHHHHHHHh-cCCCCCHHHHHHHhC-------C-CCCCCccccccCCCc Confidence 11 12334332 22222 44566666665 554 78888888763 1 1100000000 01 Q ss_pred CCCCCCCCCCCcCcCC Q lcl|NC_019725. 222 TTEPEPGLGEKLEDEN 237 (237) Q Consensus 222 ~~e~~~~~~~~~~~e~ 237 (237) .+...+........+. T Consensus 418 ~~~~~~~~~~~~~~~~ 433 (512) T protein:vir:19 418 VPDNGSQKEAALSAED 433 (512) T ss_pred cccccccccccccccC Confidence 1111111110101101 No 202 >protein:vir:7208 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049782;genbank:gi:9632594;genbank:GeneID:1258582 Probab=82.51 E-value=0.076 Score=26.71 Aligned_cols=211 Identities=19% Similarity=0.212 Sum_probs=110.2 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCC-------- Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAET-------- 72 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~-------- 72 (237) =|..+-|++..|--++.- + -+||=++ ..+|-. .+.+.=++ .+++.+|+ =++.|+.. T Consensus 263 QLkmlEDAlVIYRitRAP------e--RRvFYID-vGnlPk-~KAeqYl~---~im~k~KN---klvYDa~TGev~ddrk 326 (524) T protein:vir:72 263 QLKLLEDAVVIYRITRAP------D--RRVWYVD-TGNMPA-RKAAEHMQ---HVMNTMKN---RVVYDASTGKIKNQQH 326 (524) T ss_pred hhhHHHhhHHHHhhhccc------c--ceEEEEe-cCCCCc-hhHHHHHH---HHHHhcCc---eeEEeCCCCeeccchh Confidence 455666777666544321 1 1223221 222211 11111111 11111211 12333221 Q ss_pred ------------------cceeeee--cCcCCHHHHHHHHHHHHhhhhcCceeeeeccCccccc--ccc--hhHHHHHHH Q lcl|NC_019725. 73 ------------------EEYDVLN--SDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVS--ASQ--NTALETFYK 128 (237) Q Consensus 73 ------------------e~~~~~~--~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~Gln--atG--e~D~~nyyd 128 (237) =+++.+. -+|+-+.|| ..|+..+=.+.++|++||-+.+++|+| .++ .-|.-.|.. T Consensus 327 ~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV-~YF~kkLy~aLnVP~sRl~~d~~~~f~~gr~~EItRDEikF~K 405 (524) T protein:vir:72 327 NMSMTEDYWLQRRDGKAVTEVDTLPGADNTGNMEDI-RWFRQALYMALRVPLSRIPQDQQGGVMFDSGTSITRDELTFAK 405 (524) T ss_pred hhhhHhhhcccccCCCcccceeeccccCCcChHHHH-HHHHHHHHHHhCCchhhcCCCCCccccccccchhhHHHHHHHH Confidence 1222222 245556664 688899999999999999888888886 222 235567888 Q ss_pred HHHHHHHHhhhHHHHHHHHH------hhcC-------CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHh--CCCCCHH Q lcl|NC_019725. 129 LVDRKREEDYRPLLEFLLPF------IVEE-------EEWSIEFEPLSVPSKKEESEITKNNVESVTKAIT--EQIIDLE 193 (237) Q Consensus 129 ~I~~~Qe~~l~p~l~~l~~~------i~~s-------~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~--~g~i~~~ 193 (237) .|.++|.. +.+.+..+++. ++.. ++|.|+|.-=...+|-..+||...+..+++.+-. .-.++.+ T Consensus 406 FI~rLR~r-Fs~~f~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~ 484 (524) T protein:vir:72 406 FIRELQHK-FEEVFLDPLKTNLLLKGIITEDEWNDEINNIKIEFHRDSYFAELKEAEILERRINMLTMAEPFIGKYISHR 484 (524) T ss_pred HHHHHHHH-HHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhH Confidence 88888854 34443333332 2222 4788999999999999999999999998887743 2246777 Q ss_pred HHHHHHHhhccccccCCCCCCChh--ccccCCCCCCCCCCCcCcCC Q lcl|NC_019725. 194 EARDTLRSIAPEFKLKDGNNINIR--EPEETTEPEPGLGEKLEDEN 237 (237) Q Consensus 194 e~r~~l~~~~~~~g~~~~~~~~~~--~~e~~~e~~~~~~~~~~~e~ 237 (237) -+++..-.+.+ ..+..+ .++ .+..++-.-++.+.++ T Consensus 485 yi~k~ILr~tD-------eei~~~~k~I~-~E~k~~~~~~~~~~~~ 522 (524) T protein:vir:72 485 TAMKDILQMTD-------EEIEQEAKQIE-EESKEARFQDPDQEQE 522 (524) T ss_pred HHHHHHhccCH-------HHHHHHHHHHH-HHhhcCCCCCCchhhh Confidence 77775422211 111111 111 1112233333333333 No 203 >protein:vir:103177 Length: 533 # NCBI annotation: gp131 # Family: family:all:1036 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717798;genbank:gi:113200635;genbank:GeneID:4239186 Probab=82.48 E-value=0.077 Score=26.70 Aligned_cols=216 Identities=13% Similarity=0.176 Sum_probs=104.5 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCC-------- Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAET-------- 72 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~-------- 72 (237) =|..+-|++..|--++.- + -+||=++ ..+|-. .+.+.=++ .+++.+| +=++.|+.+ T Consensus 243 QLkm~EDAlVIYRitRAP------e--RRvFYID-VGnLPk-~KAeqYlr---~iM~k~K---NklVYDa~TGev~ddrk 306 (533) T protein:vir:10 243 QLRMIEDSLVIYRLSRAP------E--RRIFYID-VGNLPK-NKAEQYLR---EVMGRYR---NKLVYDANTGEIKDDKK 306 (533) T ss_pred hhHHHHhhHHHHhhhccc------c--ceEEEEe-cCCCCc-hhHHHHHH---HHHHhcc---ceEEEeccCceecccch Confidence 455566666665544321 1 1233332 222211 11111111 1122222 113333322 Q ss_pred ------------------cceeeee--cCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchh-----HHHHHH Q lcl|NC_019725. 73 ------------------EEYDVLN--SDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNT-----ALETFY 127 (237) Q Consensus 73 ------------------e~~~~~~--~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~-----D~~nyy 127 (237) =+++.+. -+|+-++|| ..|+..+=.+.++|++||= +.+|+|- |.+ |.-.|. T Consensus 307 ~msMlEDyWLPRReGgrgTEItTLpGgqnLgem~DV-~YF~kKLY~aLnVP~SRl~--~e~~f~~-Gr~~EItRDEiKF~ 382 (533) T protein:vir:10 307 FMSMLEDFWLPRREGGRGTEITTLPGGQNLGELEDV-KYFQKKLYKSLNVPGSRLE--TETTFNV-GRAAEITRDEVKFQ 382 (533) T ss_pred hhhhHhhhcccccCCCCccceeeccccCCcChHHHH-HHHHHHHHHHhCCCccccC--CCCcccc-cccchhhHHHHHHH Confidence 1122221 235555554 6888999999999999994 4467663 433 556788 Q ss_pred HHHHHHHHHhhhHHHHHHHHH------hhcC-------CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHh--CCCCCH Q lcl|NC_019725. 128 KLVDRKREEDYRPLLEFLLPF------IVEE-------EEWSIEFEPLSVPSKKEESEITKNNVESVTKAIT--EQIIDL 192 (237) Q Consensus 128 d~I~~~Qe~~l~p~l~~l~~~------i~~s-------~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~--~g~i~~ 192 (237) ..|.++|.. +.+.+..+++. ++.. ++|.|+|+-=...+|-..++|...+..+++.+-. .-.+|. T Consensus 383 KFI~RLR~r-Fs~lF~~~Lk~qLiLKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~ 461 (533) T protein:vir:10 383 KFVARLRKR-FSELFTDLLKTQLVLKGVISIEEWDQMKEHIQYDYIADNYFAELKEIEIRNERMNQVATMDPFVGKYFSV 461 (533) T ss_pred HHHHHHHHH-HHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEeeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccch Confidence 888888854 44444433332 2222 4788999999999999999999999888876521 223455 Q ss_pred HHHHHHHHhhc--------------cccccCCCCCCChhccccCCCCC--CCCCCCcCcCC Q lcl|NC_019725. 193 EEARDTLRSIA--------------PEFKLKDGNNINIREPEETTEPE--PGLGEKLEDEN 237 (237) Q Consensus 193 ~e~r~~l~~~~--------------~~~g~~~~~~~~~~~~e~~~e~~--~~~~~~~~~e~ 237 (237) +-+++.+-.+. -..|.++....+.+..-.+.+|+ ..++++..++. T Consensus 462 dyi~k~ILr~tDeei~~~~kqI~~E~k~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~ 522 (533) T protein:vir:10 462 EYMRRQVLKQTDVEMKEIDKQIESEMESGIIADPAAEMDPAMAAGDPDAGGAPAEEVAPEG 522 (533) T ss_pred HHHHHHHhccCHHHHHHHHHHHHHHHhCCCCCCCcchhhHHhcCCCCCcCCcccccCCCCC Confidence 55554321111 12344432211111100111111 11112112222 No 204 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=82.25 E-value=0.079 Score=26.64 Aligned_cols=227 Identities=9% Similarity=-0.003 Sum_probs=108.2 Q ss_pred CchhHHHHHHHHH---HHHHHHHHHHHHhccceeechhHHH----hhcCCchHHH----HHHHHHHH-------HHhcCc Q lcl|NC_019725. 1 MNKSLIDAICDYD---YCESLATQILRRKQQAVWKVKGLAE----MCDDDDAQYA----ARLRLAQV-------DDNSGV 62 (237) Q Consensus 1 llq~~~d~v~~~~---~~~~~~~~Ll~~~~~~v~k~~~l~~----~~~~~~~e~~----~~~r~~~~-------~~~r~~ 62 (237) .|-.++..+.+++ .+.- ++..+...-..++|.+.-.+ .+..+..... ........ ....-. T Consensus 259 ~lapvl~~l~~l~~y~daeL-~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~ 337 (553) T protein:vir:63 259 DIVSGLKDMRMAKRFKEMSL-QNAVINASYAAAIESELPPEFIHSQMSGGSPNADMVGIFGKYMDALKAYVGGANNIQID 337 (553) T ss_pred hHHHHHHHHHHHhHHHHHHH-HHHHHhhhheeeeecCCChhhhhhhcccccccccccccccccccccccccccccceeec Confidence 4445555444443 3333 33344444556666542111 1111110000 00000000 000012 Q ss_pred hheeeeecCCcceeeeecC--cCCHHHHHHHHHHHHhhhhcCceeeeeccCc-ccccccchhHHHHHHHHHHHHHHHhhh Q lcl|NC_019725. 63 GRAIGIDAETEEYDVLNSD--ISGVPEFLSSKMDRIVSLSGIHEIIIKNKNV-GGVSASQNTALETFYKLVDRKREEDYR 139 (237) Q Consensus 63 ~~~~~iD~~~e~~~~~~~~--lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp-~GlnatGe~D~~nyyd~I~~~Qe~~l~ 139 (237) -|++..-..+|++...+.+ -++..+.+..+...||+..|||.-.|.|--- ...+ +.-..+.-|...++..|...+. T Consensus 338 pG~i~~L~pGe~i~~~~p~~p~~~~~~F~~~~lr~iaaglGi~Ye~lt~D~s~~nYS-S~R~~~~e~~r~~~~~q~~~~~ 416 (553) T protein:vir:63 338 GAKIPHLFPGTKLNLKPMGTPGGVGSEFEASLNRHLASAFGMSYEEFTRDFSKANYS-SIQAGIAMTRRFLEGRKKMCAD 416 (553) T ss_pred CceeeecCCCCeeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhhhcccccHH-HHHHHHHHHHHHHHHHHHHHHH Confidence 4455555667888877775 5789999999999999999999999999632 2343 4456777888889999986544 Q ss_pred HHH----HHHHHHhhcCCCc-----------------------eeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCH Q lcl|NC_019725. 140 PLL----EFLLPFIVEEEEW-----------------------SIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDL 192 (237) Q Consensus 140 p~l----~~l~~~i~~s~~~-----------------------~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~ 192 (237) .++ +..++.-+....+ .|..+..-..+. .|.+++....+++|+.|. T Consensus 417 ~~~~pi~~~wl~~a~l~G~i~~p~~~~~~~~~~p~~~~a~~~~~w~~p~~~~iDP-------~Ke~~A~~~~i~~G~~t~ 489 (553) T protein:vir:63 417 RLATEFFTLWLEEAIAAGEVPMPPGQTRDLFYQPLMKEALSKCEWIGASQGQIDQ-------LKETQAAVMRIDAGLSTY 489 (553) T ss_pred HHHHHHHHHHHHHHHHcCCccCCCcccchhhcchhhhhhhhceeeecCCccccCh-------HHHHHHHHHHHHcCCCCH Confidence 344 3333332222111 122222222333 356667777777777666 Q ss_pred HHHHHH-----------HH---hhccccccCCCCCCCh-----hccccCCCCCCCC-CCCcCcC Q lcl|NC_019725. 193 EEARDT-----------LR---SIAPEFKLKDGNNINI-----REPEETTEPEPGL-GEKLEDE 236 (237) Q Consensus 193 ~e~r~~-----------l~---~~~~~~g~~~~~~~~~-----~~~e~~~e~~~~~-~~~~~~e 236 (237) .++..+ +. ....+.|+....+... .+.+..+.+++.. ..+.++| T Consensus 490 ~~~~a~~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e 553 (553) T protein:vir:63 490 EREIARLGGDFRKSFAQRAREDALLKKYGLTFNLSAKRSLGDGRDAATGIAEDPAAAQTSQQGE 553 (553) T ss_pred HHHHHHhCCCHHHHHHHHHHHHHHHHHcCCCCCCCCccccCCCcccCCCCCCCCCCCCcccccC Confidence 655433 21 1222345422111110 0111111122221 2222233 No 205 >protein:vir:79511 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1870 # MgeName: P74-26 # Cross-refs: genbank:acc:YP_001468055;genbank:gi:157265497;genbank:GeneID:5600628 Probab=80.52 E-value=0.094 Score=26.21 Aligned_cols=210 Identities=10% Similarity=0.040 Sum_probs=86.4 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhc--cceeechhHHHhhcCCchHHHHHHHHHHHHHhc-CchheeeeecCCcceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQ--QAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNS-GVGRAIGIDAETEEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~--~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r-~~~~~~~iD~~~e~~~~ 77 (237) |+..||-...--..+...-+..+.++. +.|.|.+.-+ ...+.....=.+++...+ +....++| .++.+++. T Consensus 214 Llr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vgky~~ga-----~~~~~~~~~l~~av~~i~~g~~a~~ii-P~~~~ie~ 287 (448) T protein:vir:79 214 ALRAAVPHWLAKRALILLINHGLERFMIGVPTLTIPKSV-----RQGTKQWEAAKEIVKNFVQKPRHGIIL-PDDWKFDT 287 (448) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHcCCceEEEecCCCC-----CcCHHHHHHHHHHHHHHhcCCceEEEe-cCCceEEE Confidence 777777765555555556667788887 5566664211 111111111122333333 22333444 45678888 Q ss_pred eecCcCC--HHHHHHHHHHHHhh-hhcCceeeeeccCcccccccchhHH-HHHHHHHHHHHHHhhhHHHH-HHHHHhhc- Q lcl|NC_019725. 78 LNSDISG--VPEFLSSKMDRIVS-LSGIHEIIIKNKNVGGVSASQNTAL-ETFYKLVDRKREEDYRPLLE-FLLPFIVE- 151 (237) Q Consensus 78 ~~~~lsG--l~dl~~~~~~~iaa-~s~iP~t~L~G~sp~GlnatGe~D~-~nyyd~I~~~Qe~~l~p~l~-~l~~~i~~- 151 (237) +++.-++ ...+++..-..||- ..|--+|- .+-+|-++.+.++. ....+.+.+-.+. +...+. .|+.-++. T Consensus 288 ~ea~~~~~~~~~~i~~~d~~Isk~iLGqtlTs---~~~~g~~~~~~~~~~~v~~~~~~aDa~~-i~~tln~~li~~l~~l 363 (448) T protein:vir:79 288 VDLKSAMPDAIPYLTYHDAGIARALGIDFNTV---QLNMGVQAINIGEFVSLTQQTIISLQRE-FASAVNLYLIPKLVLP 363 (448) T ss_pred EecCCCcccHHHHHHHHHHHHHHHHhhhhhcc---ccccchhhhhhhhHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHh Confidence 8776332 22344444455552 22222221 12122222222221 2233333333322 223332 24433321 Q ss_pred ---C-CCc-eeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCChhccccCCCCC Q lcl|NC_019725. 152 ---E-EEW-SIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIREPEETTEPE 226 (237) Q Consensus 152 ---s-~~~-~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~~~~e~~~e~~ 226 (237) . ... .|.|. ..|..|+ ++.|+++.++++.+-+..+-+++.+ |+ | .....+..+++... T Consensus 364 Nfg~~~~~P~~~f~------~~e~~Dl-~~~a~~~~~l~~~~~~~~~~~~~~~-------~~-p--~~~~~~~~~a~~~~ 426 (448) T protein:vir:79 364 NWPSATRFPRLTFE------MEERNDF-SAAANLMGMLINAVKDSEDIPTELK-------AL-I--DALPSKMRRALGVV 426 (448) T ss_pred cCCCcCCCcEEEec------CCChHHH-HHHHHHhhhhhccchhhHHHHHHhh-------cC-C--CCCCCccccccCCC Confidence 1 111 34442 2233333 3468888888887644443344332 11 1 11222222232222 Q ss_pred CCCC-CCcCcCC Q lcl|NC_019725. 227 PGLG-EKLEDEN 237 (237) Q Consensus 227 ~~~~-~~~~~e~ 237 (237) +..+ +...+.+ T Consensus 427 ~~~~~~~~~~~~ 438 (448) T protein:vir:79 427 DEVREAVRQPAD 438 (448) T ss_pred CcccccccCCcc Confidence 2222 2223333 No 206 >protein:vir:103458 Length: 524 # NCBI annotation: portal vertex of the head # Family: family:all:1036 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803110;genbank:gi:116326390;genbank:GeneID:4405487 Probab=79.73 E-value=0.1 Score=26.03 Aligned_cols=211 Identities=19% Similarity=0.221 Sum_probs=109.2 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCC-------- Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAET-------- 72 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~-------- 72 (237) =|..+-|++..|--++.- + -+||=++ ..+|-. .+.+.=++ .+++.+|+ =++.|+.. T Consensus 263 QLkmlEDAlVIYRitRAP------e--RRvFYID-vGnlPk-~KAeqYl~---~im~k~KN---klvYDa~TGev~ddrk 326 (524) T protein:vir:10 263 QLKLLEDAVVIYRITRAP------D--RRVWYVD-TGNMPA-RKAAEHMQ---HVMNTMKN---RVVYDASTGKIKNQQH 326 (524) T ss_pred hhhHHHhhHHHHhhhccc------c--ceEEEEe-cCCCCc-hhHHHHHH---HHHHhcCc---eeEEeCCCCeeccchh Confidence 455666777666544321 1 1223221 222211 11111111 11111211 12333221 Q ss_pred ------------------cceeeee--cCcCCHHHHHHHHHHHHhhhhcCceeeeeccCccccc--ccc--hhHHHHHHH Q lcl|NC_019725. 73 ------------------EEYDVLN--SDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVS--ASQ--NTALETFYK 128 (237) Q Consensus 73 ------------------e~~~~~~--~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~Gln--atG--e~D~~nyyd 128 (237) =+++.+. -+|+-+.|| ..|+..+=.+.++|++||-+.+++|+| .++ .-|.-.|.. T Consensus 327 ~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV-~YF~kkLy~aLnVP~sRl~~d~~~~f~~gr~~EItRDEikF~K 405 (524) T protein:vir:10 327 NMSMTEDYWLQRRDGKAVTEVDTLPGADNTGNMEDV-RWFRQALYMALRVPLSRIPQDQQGGVMFDSGTSITRDELTFAK 405 (524) T ss_pred hhhhHhhhcccccCCCcccceeeccccCCcChHHHH-HHHHHHHHHHhCCchhhcCCCCCccccccccchhhHHHHHHHH Confidence 1222222 245556664 688899999999999999888888886 222 235567888 Q ss_pred HHHHHHHHhhhHHHHHHHHH------hhcC-------CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHh--CCCCCHH Q lcl|NC_019725. 129 LVDRKREEDYRPLLEFLLPF------IVEE-------EEWSIEFEPLSVPSKKEESEITKNNVESVTKAIT--EQIIDLE 193 (237) Q Consensus 129 ~I~~~Qe~~l~p~l~~l~~~------i~~s-------~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~--~g~i~~~ 193 (237) .|.++|.. +.+.+..+++. ++.. ++|.|+|.-=...+|-..+||...+..+++.+-. .-.++.+ T Consensus 406 FI~rLR~r-Fs~~f~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~ 484 (524) T protein:vir:10 406 FIRELQHK-FEEVFLDPLKTNLLLKGIITEDEWNDEINNIKIEFHRDSYFTELKEAEILERRINMLTMAEPFIGKYISHR 484 (524) T ss_pred HHHHHHHH-HHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhH Confidence 88888854 34443333332 2222 4788999999999999999999999998887743 2246777 Q ss_pred HHHHHHHhhccccccCCCCCCCh--hccccCCCCCCCCCCCcCcCC Q lcl|NC_019725. 194 EARDTLRSIAPEFKLKDGNNINI--REPEETTEPEPGLGEKLEDEN 237 (237) Q Consensus 194 e~r~~l~~~~~~~g~~~~~~~~~--~~~e~~~e~~~~~~~~~~~e~ 237 (237) -+++..-.+.+ ..+.. +.+++ +..++-.-++.+.+. T Consensus 485 yi~k~ILr~tD-------eei~~~~k~I~~-E~k~~~~~~~~~~~~ 522 (524) T protein:vir:10 485 TAMKDILQMTD-------EEIEQEAKQIEE-ESKEARFQDPDQEQE 522 (524) T ss_pred HHHHHHhccCH-------HHHHHHHHHHHH-HhhcCCCCCCchhhh Confidence 77775422211 11111 11111 112222222333322 No 207 >protein:vir:267 Length: 348 # NCBI annotation: putative capsid portal protein # Family: family:all:196 # MgeID: mge:7 # MgeName: K139 # Cross-refs: genbank:acc:NP_536647;genbank:gi:17975125;genbank:GeneID:929081 Probab=78.73 E-value=0.11 Score=25.81 Aligned_cols=158 Identities=6% Similarity=0.008 Sum_probs=68.0 Q ss_pred CchhHH---------HHHHHHHHHHHHHHHHHHH-hcc-ceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchh-eeee Q lcl|NC_019725. 1 MNKSLI---------DAICDYDYCESLATQILRR-KQQ-AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGR-AIGI 68 (237) Q Consensus 1 llq~~~---------d~v~~~~~~~~~~~~Ll~~-~~~-~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~-~~~i 68 (237) ..+.+| ..+.--..+.......... ++. .++++.+ . .+ +.+..+.+++.++-. ...+|.+ ++++ T Consensus 164 ~~~~~~Gls~~~~a~~si~l~~~a~~~~~~~f~NGa~pg~Il~~~~-~-~l-s~e~~~~lk~~~~~~-~G~~n~~~~~vl 239 (348) T protein:vir:26 164 PQQQIYGLPDYLGSIQSSLLNRDATLFRRRYYLNGAHMGFIFYATD-P-NL-SEADEKALKEKIASS-KGIGNFRSMFVN 239 (348) T ss_pred CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecC-C-CC-CHHHHHHHHHHHHHh-cCcccccceeEE Confidence 122222 1111111111111111111 111 1222221 0 11 223344566555542 2233433 3444 Q ss_pred --ecCCcceeeeecCcCCHHHHH----HHHHHHHhhhhcCceeeeeccCcc---cccccchhHHHHHHHHHHHHHHHhhh Q lcl|NC_019725. 69 --DAETEEYDVLNSDISGVPEFL----SSKMDRIVSLSGIHEIIIKNKNVG---GVSASQNTALETFYKLVDRKREEDYR 139 (237) Q Consensus 69 --D~~~e~~~~~~~~lsGl~dl~----~~~~~~iaa~s~iP~t~L~G~sp~---GlnatGe~D~~nyyd~I~~~Qe~~l~ 139 (237) .++.+.++....+.+..++-+ ....+.||++-|||-. |.|..|. |++ +-+.-.+.|| +..|. T Consensus 240 ~~~g~~~Gi~~~pis~~~~d~qf~e~k~~t~~dIa~af~VPp~-llGi~~~~~~~~s-n~e~~~~~f~-------~~~l~ 310 (348) T protein:vir:26 240 IPNGKEKGIQLIPVGDIATKDEFERIKNITAQDIFVGHRFPAG-MGGMLPQQGANVP-DPLKVSQVYD-------FYEVI 310 (348) T ss_pred cCCCCccceeEEEccCChhHHHHHHHHHhhHHHHHHHhCCCHH-HccccCCCCCccc-cHHHHHHHHH-------HHHHH Confidence 222245666666666666533 4445569999999975 5676544 333 2234455555 34477 Q ss_pred HHHHHHHHHhh----cCCCc--eeEeCCCCCCCHHHHHHH Q lcl|NC_019725. 140 PLLEFLLPFIV----EEEEW--SIEFEPLSVPSKKEESEI 173 (237) Q Consensus 140 p~l~~l~~~i~----~s~~~--~~~f~pL~~~seke~Aei 173 (237) |.++++-..|- ..+++ .|+|+|...-+ +++-+ T Consensus 311 P~~~~ie~~ln~~l~~~~~~~~~fdl~~~~e~~--~~~a~ 348 (348) T protein:vir:26 311 PVCKRFMDAVNNDPEIPDNLKLKFNLNPGVESA--NGSAV 348 (348) T ss_pred HHHHHHHHHHhhhhCCCCccEEEEecCcccccc--hhhcC Confidence 77776655432 23444 45555543222 22222 No 208 >protein:vir:98816 Length: 446 # NCBI annotation: hypothetical protein # Family: family:all:32558 # MgeID: mge:1530 # MgeName: Ma-LMM01 # Cross-refs: genbank:acc:YP_851097;genbank:gi:117530254;genbank:GeneID:4484480 Probab=77.60 E-value=0.12 Score=25.57 Aligned_cols=206 Identities=11% Similarity=0.054 Sum_probs=98.3 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhc--cceeechhHHH--hhcCCchHHHHHHHH-HHHHHhcCc--hheeee----e Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQ--QAVWKVKGLAE--MCDDDDAQYAARLRL-AQVDDNSGV--GRAIGI----D 69 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~--~~v~k~~~l~~--~~~~~~~e~~~~~r~-~~~~~~r~~--~~~~~i----D 69 (237) |+..||-...=-..+..--+..+.++. +.+.|.+--+. -..+.+....-.... .+++..++. .+.+++ + T Consensus 217 Llr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vGkyp~ga~~~~~~~~~~~~~~~~~~~~L~~av~~~~~da~~ii~~~~~ 296 (446) T protein:vir:98 217 CLTSVLDYSIFKRAFRDMMLIALDRYGTPLIYVIVPPGNTGVVEEAPDGTEITTTIAEQAEDALRRLSTDSGLVLTQLSK 296 (446) T ss_pred hHHHHHHHHHHHHhhHHHHHHHHhHcCCceeEEeecCCCCcccccchhHHHHHHHHHHHHHHHHHhccccceeeeecccC Confidence 666666655444555555555666655 55666642110 011111111111111 123333221 233332 3 Q ss_pred cCCcceeeeecCcCC---HHHHHHHHHHHHhhhhcCceeeeeccCcc--cccccchhHHHHHHHHHHHHHHHhhhHHHHH Q lcl|NC_019725. 70 AETEEYDVLNSDISG---VPEFLSSKMDRIVSLSGIHEIIIKNKNVG--GVSASQNTALETFYKLVDRKREEDYRPLLEF 144 (237) Q Consensus 70 ~~~e~~~~~~~~lsG---l~dl~~~~~~~iaa~s~iP~t~L~G~sp~--GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~ 144 (237) .++-+++.+...-+| ...+++..-..||-+.--+...| |++.+ |=+|-|+.-...+.+.+.+-....-.-+-.. T Consensus 297 P~g~eie~~ea~~~~~~~~~~~i~~~d~~IskaiLg~~Ltl-~~~~~~~GS~ala~vh~~V~~d~~~aDa~~i~~tln~~ 375 (446) T protein:vir:98 297 EQPVQVGALTTGNNFSDSFERAISLCDNNMLMGMGIPNLLV-QNRETTFGTGRASEIQLELFDGKINSIFDTVIHAFTEQ 375 (446) T ss_pred CCCceEEeeccccCChhhHHHHHHHHHHHHHHHHhcccccc-cccccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 445677777766554 45677777788886655443322 44433 3334466566677777777775544333344 Q ss_pred HHHHhhcC----CCce-eEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCH---HHHHHHHHhhccccccCCCCCCCh Q lcl|NC_019725. 145 LLPFIVEE----EEWS-IEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDL---EEARDTLRSIAPEFKLKDGNNINI 216 (237) Q Consensus 145 l~~~i~~s----~~~~-~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~---~e~r~~l~~~~~~~g~~~~~~~~~ 216 (237) |+.-++.- ..-. ..+.|.....-.|..++. +.|++++.+++.|++.+ +.+|+.+ |+.+ . + T Consensus 376 Li~~l~~lNf~~~~~~~~~~~~~~~~~~~e~eDl~-~~a~~~~~L~~~G~~~p~~~~~ire~~-------giP~-~---~ 443 (446) T protein:vir:98 376 VIGNLIRLNFDPALYPLASNTGYITRLPGRATDLA-ALVEAIKQMHDMGFLVDGDKDHIRSIT-------GLPD-A---I 443 (446) T ss_pred HHHHHHHhCCCccccccccccccceeccCChhhHH-HHHHHHHHHHhCCccccccHHHHHHHh-------CcCC-C---C Confidence 55555421 1100 001111112222444444 57999999999998643 2355543 3311 1 1 Q ss_pred hcc Q lcl|NC_019725. 217 REP 219 (237) Q Consensus 217 ~~~ 219 (237) ++. T Consensus 444 ~~~ 446 (446) T protein:vir:98 444 SST 446 (446) T ss_pred CCC Confidence 111 No 209 >protein:vir:96783 Length: 488 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1629 # MgeName: phiHSIC # Cross-refs: genbank:acc:YP_224240;genbank:gi:62362375;genbank:GeneID:3345722 Probab=77.13 E-value=0.13 Score=25.48 Aligned_cols=201 Identities=11% Similarity=0.101 Sum_probs=100.1 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeeeec Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVLNS 80 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~~~ 80 (237) |+..++=.+.+|...... -++++.+++.++-+.+ ..+......+ +.. .. ...... +......-++...+. T Consensus 273 LldLA~lnl~Hy~~ssd~-~~il~~~~~p~lv~~~-~~~~~~~~~~--~~~-~g----~~~~~~-~~~~~~~g~~~~~e~ 342 (488) T protein:vir:96 273 LTSLAEISLSIYVMNAYS-NKAMILANEAKWMVDM-GDMNKTMASE--MNP-LG----FTLAGR-MPYYVKNGDVKVIQA 342 (488) T ss_pred hHHHHHHHHHHHhhhhHH-HHHHHhcCCceeeecc-CCCCcccccc--ccc-ce----eeeccc-ccccccCCceeecCC Confidence 899999999999999887 7778888888664421 1111000000 000 00 000011 111112234555555 Q ss_pred CcCCH-HHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHH---HHHHHHHHHHhhhHHHHHHHHHhhc----- Q lcl|NC_019725. 81 DISGV-PEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETF---YKLVDRKREEDYRPLLEFLLPFIVE----- 151 (237) Q Consensus 81 ~lsGl-~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~ny---yd~I~~~Qe~~l~p~l~~l~~~i~~----- 151 (237) ..+.+ ...++...+++..+.- +|+-++ ++- |++.-...+ +..++++- ..+.-.+++++++++. T Consensus 343 ~~~~l~~~~l~~l~~qm~~~Ga----~l~~~~-~~~--Ta~~~~~~~~~~~S~L~~~a-~~le~al~~~l~~~A~w~g~~ 414 (488) T protein:vir:96 343 QFSPETENKVEKLFEQAVKVGA----SLFTQQ-SNE--TATGAAIRSGSSTASMATLG-NNVEDTVRNMLRFIMRYFEGT 414 (488) T ss_pred chhHHHHHHHHHHHHHHHHHhH----hhccCC-Ccc--hHHHHHHHHHHhhHHHHHHH-HHHHHHHHHHHHHHHHHcCCC Confidence 55544 4445555555532211 233222 233 443222222 22333332 4467778888888753 Q ss_pred -----CCCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCChhccc-cCCCC Q lcl|NC_019725. 152 -----EEEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIREPE-ETTEP 225 (237) Q Consensus 152 -----s~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~~~~e-~~~e~ 225 (237) +.+..|+-|+=+..-.- .....+++..+.++|.|+.++.++.|+.++ +. ..+.+.++++ +-.+. T Consensus 415 ~~~~~~~~~~~~in~dF~~~~l-----d~~~~~al~~~~~~G~Is~~t~~~~L~~~g----vl-~~d~~~e~~~~~ie~~ 484 (488) T protein:vir:96 415 NLYVNPDELVFKLNRDYFDVEV-----NPQMLQVAYAAMMEGNLPQVSWFELLKRAR----VV-RGDMSKEEFDEHIAEL 484 (488) T ss_pred CCCcCccceEEEeccCCCCccC-----CHHHHHHHHHHHhcCCCCHHHHHHHHHhCC----cC-CccCCHHHHHHHHhhc Confidence 12455665543332222 233456777888999999999999998753 22 1244444433 22222 Q ss_pred CCCC Q lcl|NC_019725. 226 EPGL 229 (237) Q Consensus 226 ~~~~ 229 (237) ..+. T Consensus 485 g~~~ 488 (488) T protein:vir:96 485 GFGM 488 (488) T ss_pred CCCC Confidence 2333 No 210 >protein:vir:4073 Length: 279 # NCBI annotation: minor structural protein # Family: family:all:11744 # MgeID: mge:85 # MgeName: c2 # Cross-refs: genbank:acc:NP_043552;genbank:gi:9628686;genbank:GeneID:1261159 Probab=75.93 E-value=0.091 Score=26.28 Aligned_cols=158 Identities=15% Similarity=0.131 Sum_probs=80.0 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcC-chheeeeecCCcceeeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSG-VGRAIGIDAETEEYDVLN 79 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~-~~~~~~iD~~~e~~~~~~ 79 (237) |...++.++..-. .+.++ --.+||++--+ +.....++.+.|+..+-.+-. .+|...++. +|++.+++ T Consensus 120 M~~la~nai~~KL---D~~~q-----Ik~fIKTd~d~---glee~kekaR~rIk~mlalAk~~nGityid~-~ddItQL~ 187 (279) T protein:vir:40 120 MFGMASNGIGRRL---DSQAQ-----IKIYWKTKVSS---GLKEVWDRIRERLTQQQQLAREFNGVSVIGS-DDDIKQIQ 187 (279) T ss_pred HHHHHHhhhhhhh---cccce-----eeeEEecCcch---hHHHHHHHHHHHHHHHHHHHHhcCCeeeecC-CceeEeec Confidence 3333332222111 11111 11356665111 011223455666665544433 467778887 59999999 Q ss_pred cCcCC-HHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHhhhHHHHHHHHHhhcCCCceeE Q lcl|NC_019725. 80 SDISG-VPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREEDYRPLLEFLLPFIVEEEEWSIE 158 (237) Q Consensus 80 ~~lsG-l~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~s~~~~~~ 158 (237) -+.|+ +.+=++.+..++...-+||-.+|.|++ .|..+.+||.+ .+-|.|+++.+=|+-++++-+. T Consensus 188 kDYStslk~die~lkS~l~Sq~GinekIL~GsA-------tE~q~iAyy~r-------tVePILkQyek~liY~~E~fv~ 253 (279) T protein:vir:40 188 PDYSGSLQNDANLAIEIALSEYGMPRELLYGQS-------NEVTIIAFAIQ-------KVLPLLKQHDKNIIFNQENFVA 253 (279) T ss_pred cccccccHHHHHHHHHHHHhhcCCchhhccccC-------chhhhhhHHHh-------hHHHHHHHhcccccchhhhhhh Confidence 99885 456778888899999999999999975 36788889874 4567777765533323222111 Q ss_pred eCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccC Q lcl|NC_019725. 159 FEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLK 209 (237) Q Consensus 159 f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~ 209 (237) | .+-|. ..|+|......+. ++.-|-. T Consensus 254 y---~ttta------------------~gg~~~s~~~~~~----~~~~~~~ 279 (279) T protein:vir:40 254 Y---ISTTA------------------KGGAIESKSSKRD----SEPVGND 279 (279) T ss_pred h---heecc------------------cCccccccccccc----CCCCCCC Confidence 1 11100 0111111100000 0000000 No 211 >protein:vir:3743 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:79 # MgeName: HP1 # Cross-refs: genbank:acc:NP_043484;genbank:gi:9628619;genbank:GeneID:1261113 Probab=74.93 E-value=0.15 Score=25.06 Aligned_cols=151 Identities=8% Similarity=0.067 Sum_probs=69.7 Q ss_pred CchhHH---------HHHHHHHHHHHHHHHHHHH-hccc-eeechhHHHhhcCCchHHHHHHHHHHHHHhcCchhee-ee Q lcl|NC_019725. 1 MNKSLI---------DAICDYDYCESLATQILRR-KQQA-VWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAI-GI 68 (237) Q Consensus 1 llq~~~---------d~v~~~~~~~~~~~~Ll~~-~~~~-v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~-~i 68 (237) ..+.+| ..+.--..+.......... ++.. |+++.+ . .+ +.+..+.++++++-.. ..+|.+.+ ++ T Consensus 168 ~~~~~~Gl~~~~~a~~si~l~~~a~~~~~~~f~NGa~~~~Il~~t~-~-~l-~~e~~~~lk~~~~~~~-g~~n~~~~~i~ 243 (345) T protein:vir:37 168 PMQQVYGSPDYVGGIQSALLNSDATVFRRRYFSNGAHMGFILYSTD-P-DL-TEEMEEEIARKISESK-GVGNFRSMFVN 243 (345) T ss_pred CCCCcccchHHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCC-C-CC-CHHHHHHHHHHHHHhc-CccccCceeEe Confidence 112111 1111111111111121111 1111 333322 1 12 2233445666665432 23444434 44 Q ss_pred ec--CCcceeeeecCcCCHHH----HHHHHHHHHhhhhcCceeeeeccCccc---ccccchhHHHHHHHHHHHHHHHhhh Q lcl|NC_019725. 69 DA--ETEEYDVLNSDISGVPE----FLSSKMDRIVSLSGIHEIIIKNKNVGG---VSASQNTALETFYKLVDRKREEDYR 139 (237) Q Consensus 69 D~--~~e~~~~~~~~lsGl~d----l~~~~~~~iaa~s~iP~t~L~G~sp~G---lnatGe~D~~nyyd~I~~~Qe~~l~ 139 (237) .. +.+.++....+.+..++ +-....+.||++-|||-. |.|..|.+ ++ +-+.-.+.|+ +..|. T Consensus 244 ~~~g~~~G~~~~pl~~~~~d~qf~e~k~~~~~dI~~a~~VPp~-liGi~~~~t~~~s-~~e~~~~~f~-------~~~l~ 314 (345) T protein:vir:37 244 IAGGHPDGLKVIPIGDTGTKDEFANIKNISAQDVLTAHRFPAG-LSGIIPTNTGGLG-DPLKYREVYH-------YDEVM 314 (345) T ss_pred cCCCCccceeEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHH-HhccccCCCCCcc-cHHHHHHHHH-------HHHHH Confidence 22 22446666666665554 445567789999999964 45876643 32 2233444454 45577 Q ss_pred HHHHHHHHHhh----cCCCceeEeCC--CCC Q lcl|NC_019725. 140 PLLEFLLPFIV----EEEEWSIEFEP--LSV 164 (237) Q Consensus 140 p~l~~l~~~i~----~s~~~~~~f~p--L~~ 164 (237) |.++++-..+- ...+..|.|+| |.. T Consensus 315 P~~~~ie~~ln~~~e~~~~~~i~F~~~~l~k 345 (345) T protein:vir:37 315 PLQEIIAETINQDPEIKNLLKIKFREQNFAK 345 (345) T ss_pred HHHHHHHHHhhhhhccCCcceEEECchhhcC Confidence 77777666653 23456677775 332 No 212 >protein:vir:98567 Length: 340 # NCBI annotation: gp1 # Family: family:all:196 # MgeID: mge:1533 # MgeName: PSP3 # Cross-refs: genbank:acc:NP_958056;genbank:gi:41057353;genbank:GeneID:2744238 Probab=73.19 E-value=0.17 Score=24.76 Aligned_cols=154 Identities=6% Similarity=0.068 Sum_probs=69.0 Q ss_pred CchhHH---------HHHHHHHHHHHHHHHHHHH-hccc-eeechhHHHhhcCCchHHHHHHHHHHHHHhcCchhe-eee Q lcl|NC_019725. 1 MNKSLI---------DAICDYDYCESLATQILRR-KQQA-VWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRA-IGI 68 (237) Q Consensus 1 llq~~~---------d~v~~~~~~~~~~~~Ll~~-~~~~-v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~-~~i 68 (237) ..+.+| ..+.--..+.......... ++.. ++.+.+ ..+ +.+....+++.++-. ....|.+. +++ T Consensus 165 ~~~~~~Gls~~~~a~~si~l~~aa~~~~~~~f~NGa~pg~il~~~~--~~l-s~e~~~~lk~~~~~~-~G~~n~~~~~vl 240 (340) T protein:vir:98 165 INQEIYGLPEYLSALNSAWLNESATLFRRKYYQNGAHAGYIMYVTD--PAQ-SATDVESLRDAMRNS-KGLGNFKNLFFY 240 (340) T ss_pred CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecC--CCC-CHHHHHHHHHHHHHh-cCccccCceeEe Confidence 112211 1111111111111111111 1122 233322 011 223344566666542 33344334 444 Q ss_pred ec--CCcceeeeecCcCCHH----HHHHHHHHHHhhhhcCceeeeeccCcc---cccccchhHHHHHHHHHHHHHHHhhh Q lcl|NC_019725. 69 DA--ETEEYDVLNSDISGVP----EFLSSKMDRIVSLSGIHEIIIKNKNVG---GVSASQNTALETFYKLVDRKREEDYR 139 (237) Q Consensus 69 D~--~~e~~~~~~~~lsGl~----dl~~~~~~~iaa~s~iP~t~L~G~sp~---GlnatGe~D~~nyyd~I~~~Qe~~l~ 139 (237) .. ..+.++....+.+..+ ++-....+.||++-|||-. |.|..|. |++ +-+.-.+.|+ +..|. T Consensus 241 ~~~g~~~g~~~~pls~~~~d~qf~e~k~~~~~eIa~a~~VPp~-llGi~~~~t~~~s-n~e~~~~~f~-------~~~l~ 311 (340) T protein:vir:98 241 SPNGKPDGIKIVPLSEVATKDDFFNIKKASAADLMDAHRVPFQ-LMGGKPENIGSLG-DVEKVAKVFV-------RNELS 311 (340) T ss_pred cCCCCccceEEEEcCCChhHHHHHHHHHhhHHHHHHHhCCCHH-HhcccCCCCCccc-cHHHHHHHHH-------HHHHH Confidence 32 2245666666666544 3566677889999999975 6677654 333 2233444444 34577 Q ss_pred HHHHHHHHHhhc-CCCceeEeCCCCCCCHH Q lcl|NC_019725. 140 PLLEFLLPFIVE-EEEWSIEFEPLSVPSKK 168 (237) Q Consensus 140 p~l~~l~~~i~~-s~~~~~~f~pL~~~sek 168 (237) |.++++-++.-. ..+ -|+|++-.-++.. T Consensus 312 Pl~~~iee~n~~L~~e-~~rF~~~~l~~~d 340 (340) T protein:vir:98 312 PLQDRFREVNDWLGME-VIRFKEYTLDNPE 340 (340) T ss_pred HHHHHHHHHHhccccc-ccccCccccccCC Confidence 776666543211 122 2567665544444 No 213 >protein:vir:80453 Length: 535 # NCBI annotation: BcepGomrgp05 # Family: family:all:584 # MgeID: mge:1882 # MgeName: BcepGomr # Cross-refs: genbank:acc:YP_001210225;genbank:gi:146329917;genbank:GeneID:5123562 Probab=71.79 E-value=0.19 Score=24.52 Aligned_cols=212 Identities=13% Similarity=0.095 Sum_probs=103.3 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeeeec Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVLNS 80 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~~~ 80 (237) |+..+.=.|.+|..... --+++|-..+-++-+.|+.+....+..+.. -+ .-+....+.+- ++-++..... T Consensus 290 Ll~LA~lni~Hy~~ssd-~~~il~~~~~P~l~i~G~~~~~~~~~~~~~---~i-----~iG~~~~~~lP-~~~~~~~~e~ 359 (535) T protein:vir:80 290 LLDLCEVNIGHYRNSAD-YEEMAFVAGQPTAFFTGLTKDWVEDVFKDF---KV-----HLGSRAIIPLP-QGATAGILQI 359 (535) T ss_pred hHHHHHHHHHHhhchhH-HHHHHHHhcCceeeeecCchhhhhcCCCCc---ce-----EecCcccccCC-CCCCcceeee Confidence 88888888998888776 566788888888888776543322111100 00 11223333443 3344555555 Q ss_pred CcCCHHH-HHHHHHHHHhhhhcCceeeeeccCcccccccch-hHHHHHHHHHHHHHHHhhhHHHHHHHHHhhcC------ Q lcl|NC_019725. 81 DISGVPE-FLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQN-TALETFYKLVDRKREEDYRPLLEFLLPFIVEE------ 152 (237) Q Consensus 81 ~lsGl~d-l~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe-~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i~~s------ 152 (237) +-+|+.- .++...+++......+++ ++.++..++.- .|...=+..++++- ..+.-.+.+++++++.= T Consensus 360 ~~~~~a~~~l~~~e~qM~~lGa~ll~----~~~~~~Ta~~a~~~~~~~~S~L~~~a-~~le~al~~aL~~~A~w~G~~~~ 434 (535) T protein:vir:80 360 TPNSVPFEAMTHKESQMIAMGANLLV----KSGGNRTFGEAQQEEASEQSILSACT-KNVSMAFRKALRWANQFQTGIVN 434 (535) T ss_pred ccchhHHHHHHHHHHHHHHHHHHhhc----cCcccccHHHHHHHHHHHhHHHHHHH-HHHHHHHHHHHHHHHHHcCCccC Confidence 4555543 455555555554443333 33444443321 11122223333332 34666777888776521 Q ss_pred -CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCChhccc-----cCCCCC Q lcl|NC_019725. 153 -EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIREPE-----ETTEPE 226 (237) Q Consensus 153 -~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~~~~e-----~~~e~~ 226 (237) +...|..|+=+.. +.+.....+++..+++.|.|+.+..++.|+..+. .+ .+.+.++.. +..+.. T Consensus 435 ~~~~~i~~n~dF~~-----~~ld~~~~~all~~~~~G~Is~et~~~~L~r~gv----l~-~~~~~eee~~ri~~E~~~~~ 504 (535) T protein:vir:80 435 DETVEYNLNTDFPA-----ARLTPNERAELILEWQQGAITFKEMRAGLRRAGV----AS-EDDAKAETEGKATVEFIAKT 504 (535) T ss_pred CCceEEEecccccc-----ccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhCCC----CC-cccchHHHHHHHHhhhhhcc Confidence 2345554432211 1112223566778888999999999999986432 21 112212110 101111 Q ss_pred CCCCCCcC----------------cCC Q lcl|NC_019725. 227 PGLGEKLE----------------DEN 237 (237) Q Consensus 227 ~~~~~~~~----------------~e~ 237 (237) ...|...+ ..| T Consensus 505 ~~~g~~~d~~~~g~~~~~~~~~~~~~~ 531 (535) T protein:vir:80 505 AAAGKVGDAASGGTNKAKLNNGNGGGN 531 (535) T ss_pred ccCCCCCCCCCCCCCcCcccCCccccc Confidence 11111111 111 No 214 >protein:vir:108049 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595296;genbank:gi:161622602;genbank:GeneID:5783768 Probab=71.76 E-value=0.19 Score=24.52 Aligned_cols=211 Identities=18% Similarity=0.194 Sum_probs=108.8 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCC-------- Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAET-------- 72 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~-------- 72 (237) =|..+-|++..|--++.- +- +||=++ ..+|-. .+.+.=++. +++.+|+ =++.|+.+ T Consensus 263 QLkm~EDAlVIYRitRAP------eR--RvFYID-VGnlPk-~KAeqYl~~---im~k~kN---KlvYDa~TGev~ddrk 326 (524) T protein:vir:10 263 QLKLMEDAMVIYRITRAP------DR--RVFYID-TGNMPS-RKAAAQMQH---IMNTMKN---RVVYDASTGKIKNQQH 326 (524) T ss_pred hhHHHHhhHHHHhhhccc------cc--eEEEEe-cCCCCc-hhHHHHHHH---HHHhcCc---eeEEeccCCeeccchh Confidence 455666777666544321 11 222221 222211 111111111 1111111 12222221 Q ss_pred ------------------cceeeee--cCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccc--cc--hhHHHHHHH Q lcl|NC_019725. 73 ------------------EEYDVLN--SDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSA--SQ--NTALETFYK 128 (237) Q Consensus 73 ------------------e~~~~~~--~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~Glna--tG--e~D~~nyyd 128 (237) =+++.+. -+|+-++|| ..|+..+=.+.++|++||-..+++|+|- ++ .-|.-.|.. T Consensus 327 ~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV-~YF~kkLy~aLnVP~sRl~~e~~~~f~~gr~~EItRDEiKF~K 405 (524) T protein:vir:10 327 NMSMTEDYWLQRRDGKAVTEVDTMPGATGMSDMDDV-LYFRTALYRALRIPESRIPSESNSGVMFDAGTAITRDELKFAK 405 (524) T ss_pred hhhhHhhhcccccCCCCccceeeccccCCcChHHHH-HHHHHHHHHHhCCCchhccCCCCccccccccchhhHHHHHHHH Confidence 1222221 235555564 6888999999999999997677777653 22 124567888 Q ss_pred HHHHHHHHhhhHHHHHHHHH------hhcC-------CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHh--CCCCCHH Q lcl|NC_019725. 129 LVDRKREEDYRPLLEFLLPF------IVEE-------EEWSIEFEPLSVPSKKEESEITKNNVESVTKAIT--EQIIDLE 193 (237) Q Consensus 129 ~I~~~Qe~~l~p~l~~l~~~------i~~s-------~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~--~g~i~~~ 193 (237) .|.++|.. +.+.+..+++. ++.. ++|.|+|.-=...+|-..+||...+..+++.+-. .-.++.+ T Consensus 406 FI~rLR~r-Fs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~ 484 (524) T protein:vir:10 406 WIRQLQNK-FEEIFLDPLKTNLILKKIITEDEWEREINNIKVTFNRDSYFSEMKDAEIMERRINMLTMAEPFIGKYISHQ 484 (524) T ss_pred HHHHHHHH-HHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhH Confidence 88888854 44444433332 2222 4788999999999999999999999998887743 2246777 Q ss_pred HHHHHHHhhccccccCCCCCCChh--ccccCCCCCCCCCCCcCcCC Q lcl|NC_019725. 194 EARDTLRSIAPEFKLKDGNNINIR--EPEETTEPEPGLGEKLEDEN 237 (237) Q Consensus 194 e~r~~l~~~~~~~g~~~~~~~~~~--~~e~~~e~~~~~~~~~~~e~ 237 (237) -+++..-.+.+ .++..+ .++ .+..++-.-++.++++ T Consensus 485 yi~k~ILr~tD-------eei~~~~k~I~-~E~k~~~~~~~~~~~~ 522 (524) T protein:vir:10 485 TAMKDFLQMTD-------EEINQEAKQIE-EESKEARFQNPDEEEE 522 (524) T ss_pred HHHHHHhccCH-------HHHHHHHHHHH-HHhhcCCCCCCChhhh Confidence 77775422211 111111 111 1112233333333444 No 215 >protein:vir:104500 Length: 537 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214665;genbank:gi:61806306;genbank:GeneID:3294555 Probab=68.15 E-value=0.24 Score=23.97 Aligned_cols=215 Identities=13% Similarity=0.151 Sum_probs=102.6 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCC-------- Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAET-------- 72 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~-------- 72 (237) =|..+-|++..|--++.- + -+||=++ ..+|-. .+.+.=++ .+++.+| +=++.|+.+ T Consensus 244 QLkm~EDAlVIYRitRAP------e--RRvFYID-VGnLPk-~KAeqYlr---~iM~k~K---NklVYDa~TGev~ddrk 307 (537) T protein:vir:10 244 QLRMIEDSLVIYRLSRAP------E--RRIFYID-VGNLPK-NKAEQYLR---EVMGRYR---NKLVYDANTGEIKDDKK 307 (537) T ss_pred hhHHHHhhHHHHhhhccc------c--ceEEEEe-cCCCCc-hhHHHHHH---HHHHhcc---ceEEEeccCceecccch Confidence 455666777666544321 1 1233332 222211 11111111 1122222 113333322 Q ss_pred ------------------cceeeee--cCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchh-----HHHHHH Q lcl|NC_019725. 73 ------------------EEYDVLN--SDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNT-----ALETFY 127 (237) Q Consensus 73 ------------------e~~~~~~--~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~-----D~~nyy 127 (237) =+++.+. -+|+-++|| ..|+..+=.+.++|++||= +.+|+|- |.+ |.-.|. T Consensus 308 ~msMlEDyWLPRReGgrgTEItTLpGgqnlgem~DV-~YF~kKLy~aLnVP~SRl~--~e~~f~~-Gr~~EItRDEiKF~ 383 (537) T protein:vir:10 308 FMSMLEDFWLPRREGGRGTEISTLPGGQNLGELEDV-KYFQKKLYKALNVPSSRLE--TETTFNI-GRAAEITRDEVKFQ 383 (537) T ss_pred hhhhhhhhcccccCCCcccceeeccccCCcChHHHH-HHHHHHHHHHhCCCccccC--CCCcccc-cccchhhHHHHHHH Confidence 1122221 235555554 6888999999999999994 3467663 433 556788 Q ss_pred HHHHHHHHHhhhHHHHHHHHH------hhcC-------CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHh--CCCCCH Q lcl|NC_019725. 128 KLVDRKREEDYRPLLEFLLPF------IVEE-------EEWSIEFEPLSVPSKKEESEITKNNVESVTKAIT--EQIIDL 192 (237) Q Consensus 128 d~I~~~Qe~~l~p~l~~l~~~------i~~s-------~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~--~g~i~~ 192 (237) ..|.++|.. +.+.+..+++. ++.. ++|.|+|+-=...+|-..++|...+..+++.+-. .-.++. T Consensus 384 KFI~RLR~r-Fs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~s~ 462 (537) T protein:vir:10 384 KFIARLRKR-FSELFVDLLKTQLILKGICSIEEWEEMKEHIQFDFIADNYFTELKEIEIRNERMNEVAQMDPYVGKYFSA 462 (537) T ss_pred HHHHHHHHH-HHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhhcccch Confidence 888888854 44444433332 2222 4788999999999999999999999888877521 223455 Q ss_pred HHHHHHHHhhc------------c--ccccCCCCCCChhccccC-CCCCCCCCCCcCcC---C Q lcl|NC_019725. 193 EEARDTLRSIA------------P--EFKLKDGNNINIREPEET-TEPEPGLGEKLEDE---N 237 (237) Q Consensus 193 ~e~r~~l~~~~------------~--~~g~~~~~~~~~~~~e~~-~e~~~~~~~~~~~e---~ 237 (237) +-+++..-.+. . ..|.+.... .+++.+.. .+.+|.+.++++++ | T Consensus 463 dyi~k~ILr~tDeeI~~~~k~I~~E~k~~~~~~p~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ 524 (537) T protein:vir:10 463 NYIRTKVLKQTESEIKEIDKEIKQEIADGVIMDPQ-AMQAMEMGIGDEEPVPEGGEEPQTDPN 524 (537) T ss_pred HHHHHHHhccCHHHHHHHHHHHHHHhhCCCCCCcc-cccccccCCCCcccCCCCCCCcccCCc Confidence 55554321111 1 123332110 01111000 01111111111111 1 No 216 >protein:vir:101494 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1627 # MgeName: PLot # Cross-refs: genbank:acc:YP_655388;genbank:gi:109522576;genbank:GeneID:4157566 Probab=67.47 E-value=0.25 Score=23.87 Aligned_cols=212 Identities=8% Similarity=0.107 Sum_probs=101.8 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHH-hccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCc-hheeeeecCCcceeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRR-KQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGV-GRAIGIDAETEEYDVL 78 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~-~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~-~~~~~iD~~~e~~~~~ 78 (237) -|..+..-+...+.+. +-.+++.+ ..+-++.++++.-.-..+.... + .+ -|.+.=-+++-.|..+ T Consensus 270 ~La~ll~l~deLn~~~-Td~s~is~~sG~Pi~~~tg~~~vd~~G~~~~-----~-------~VgPG~iweL~e~ak~~~v 336 (527) T protein:vir:10 270 GLAGLESLIASVNQTM-TDEDLIMVFGGLGFYATDSAPPRDSRGNMVP-----W-------TISPLGMVEHGQNNKIYRV 336 (527) T ss_pred hHhHHHHHHHHHhhhh-hHHHHHHHHhCCceeeecccccccccCCcCc-----c-------ccCCceeEecCCCcceeec Confidence 2222222222222222 22222222 3455566665542211111100 0 01 2233323445667777 Q ss_pred ec--CcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHh--hhHHHHHHHH-Hh---- Q lcl|NC_019725. 79 NS--DISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREED--YRPLLEFLLP-FI---- 149 (237) Q Consensus 79 ~~--~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~--l~p~l~~l~~-~i---- 149 (237) ++ .+.++.+.+..++..|+..+++|.+-+=..-+++ +-||-.=.-.+--.+++.|+.. ++-++.+... .+ T Consensus 337 ~~~~~la~~~~h~~~L~~~l~~vA~~PavA~G~vD~s~-~~SG~ALeL~L~PLlar~~rk~L~~~~vqrq~~~~~~~~~L 415 (527) T protein:vir:10 337 NGVASLEPSQTHMTKAEEAMQQTKGIPDIAVGVVDAAV-AESGIALDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWL 415 (527) T ss_pred cchhhhHHHHHHHHHHHHHHHHhhcCCeeeeccccCCc-CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHH Confidence 76 5677888999999999999999998765344333 3345333333334455555553 2333332211 11 Q ss_pred ------hcCC-----CceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCChhc Q lcl|NC_019725. 150 ------VEEE-----EWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIRE 218 (237) Q Consensus 150 ------~~s~-----~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~~~ 218 (237) ...+ .+.+.|-|....++++. .+....++++|++|.+-+.+.|...+ |+. ....+..+ T Consensus 416 ~aye~v~~~d~~~~~~v~ivf~p~lP~D~~av-------ie~v~tL~~aGi~S~~tAv~~L~~~~---g~e-D~E~E~~~ 484 (527) T protein:vir:10 416 PAYEGVGIDDADKKLTVTITFRDPKPVNSEKR-------FNQLLQLWEAGLIPAKKLTEELSKIM---GFE-LTEEDFKQ 484 (527) T ss_pred HHhhhcccCCCccccceEEEecccCCCCHHHH-------HHHHHHHHHcCchhHHHHHHHHHhcc---CCC-ChHHHHHH Confidence 1111 45799999977666554 45567789999999999999886432 111 11111111 Q ss_pred cc-----------cC-------CCCCCCCCCCc--CcCC Q lcl|NC_019725. 219 PE-----------ET-------TEPEPGLGEKL--EDEN 237 (237) Q Consensus 219 ~e-----------~~-------~e~~~~~~~~~--~~e~ 237 (237) +. ++ .-.+.|.++.. +.-| T Consensus 485 I~~era~~a~a~a~A~~~~~a~~~~~~g~~~~~~d~~~~ 523 (527) T protein:vir:10 485 ATEDKKTQGIAQAEAADPFGAQMAAEQGIPDEEDDQALN 523 (527) T ss_pred HHHHHHHHhHHhhhhcCchhhhhccccCCCCCCcccccC Confidence 10 00 01222222222 2222 No 217 >protein:vir:102239 Length: 527 # NCBI annotation: gp9 # Family: family:all:6920 # MgeID: mge:1648 # MgeName: PBI1 # Cross-refs: genbank:acc:YP_655205;genbank:gi:109522785;genbank:GeneID:4157478 Probab=67.20 E-value=0.26 Score=23.83 Aligned_cols=212 Identities=9% Similarity=0.108 Sum_probs=101.7 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHH-hccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCc-hheeeeecCCcceeee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRR-KQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGV-GRAIGIDAETEEYDVL 78 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~-~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~-~~~~~iD~~~e~~~~~ 78 (237) -|..+..-+...+.+. +-.+++.+ ..+-++.++++.-.-..+.... + .+ -|.+.=-+++-.|..+ T Consensus 270 ~La~ll~l~deLn~~~-Td~s~is~~sG~Pi~~~tg~~~vd~~G~~~~-----~-------~VgPG~iweL~e~ak~~~v 336 (527) T protein:vir:10 270 GLAGLESLIASVNQTM-TDEDLIMVFGGLGFYATDSAPPRDSRGNMVP-----W-------TISPLGMVEHGQNNKIYRV 336 (527) T ss_pred hHhHHHHHHHHHhhhh-hHHHHHHHHhCCceeeecccccccccCCcCc-----c-------ccCCceeEecCCCcceeec Confidence 2222222222222222 22222222 3455566665542211111100 0 01 2233323445667777 Q ss_pred ec--CcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchhHHHHHHHHHHHHHHHh--hhHHHHHHHH-Hh---- Q lcl|NC_019725. 79 NS--DISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNTALETFYKLVDRKREED--YRPLLEFLLP-FI---- 149 (237) Q Consensus 79 ~~--~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~D~~nyyd~I~~~Qe~~--l~p~l~~l~~-~i---- 149 (237) ++ .+.++.+.++.++..|+..+++|.+-+=..-+++ +-||-.=.-.+--.+++.|+.. ++-++.+... .+ T Consensus 337 ~~~~~la~~~~h~~~L~~~l~~vA~~PavA~G~vD~s~-~~SG~ALeL~L~PLlar~~rk~L~~~~Vqrq~~~~~~~~~L 415 (527) T protein:vir:10 337 NGVASLEPSQTHMNKAEEAMQQTKGIPDIAVGVVDAAV-AESGIALDLKLSAILSSCAEQELELKSVLKQFFYNLVTQWL 415 (527) T ss_pred cchhhhHHHHHHHHHHHHHHHHhhcCCeeeeccccCCc-CcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHH Confidence 76 5677888999999999999999998765344333 3345333333334455555553 2333332211 11 Q ss_pred ------hcCC-----CceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCChhc Q lcl|NC_019725. 150 ------VEEE-----EWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIRE 218 (237) Q Consensus 150 ------~~s~-----~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~~~ 218 (237) ...+ .+.+.|-|....+.++. .+....++++|++|.+-+.+.|...+ |+. ....+..+ T Consensus 416 ~aye~v~~~d~~~~~~v~ivf~p~lP~D~~av-------ie~v~tL~~aGiiS~etAv~~L~~~~---g~e-D~E~E~~~ 484 (527) T protein:vir:10 416 PAYEGVGIDDADKKLTVTITFRDPKPVNNEKR-------FAQLLELWEAGLIPAKKLTEELSKIM---GFE-LTEEDFRQ 484 (527) T ss_pred HHhhhcccCCCccccceEEEecccCCCCHHHH-------HHHHHHHHHcCchhHHHHHHHHHhcc---CCC-chHHHHHH Confidence 1111 45799999976666554 45567789999999999999886532 111 11111111 Q ss_pred cc-----------cC-------CCCCCCCCCCc--CcCC Q lcl|NC_019725. 219 PE-----------ET-------TEPEPGLGEKL--EDEN 237 (237) Q Consensus 219 ~e-----------~~-------~e~~~~~~~~~--~~e~ 237 (237) +. ++ .-.+.|.++.. +.-| T Consensus 485 I~~era~~a~a~a~a~~~~~a~~~~~~g~~~~~~d~~~~ 523 (527) T protein:vir:10 485 ATEDKKTQGIAQAEAADPFGAQMAAEQGIPDEEDDQALN 523 (527) T ss_pred HHHHHHHHhHHhhhhcCchhhhhccccCCCCCCcccccC Confidence 11 00 01222222222 2222 No 218 >protein:vir:100598 Length: 516 # NCBI annotation: gp20 head portal vertex protein # Family: family:all:1036 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656382;genbank:gi:109290133;genbank:GeneID:4156576 Probab=64.48 E-value=0.3 Score=23.46 Aligned_cols=212 Identities=13% Similarity=0.132 Sum_probs=109.8 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCC-------- Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAET-------- 72 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~-------- 72 (237) =|..+-|++..|--++.- +- +||=++ ..+|-. .+.+.=++ .+++.+|+ =++.|+.. T Consensus 256 QLkm~EDAlVIYRitRAP------eR--RvFYID-VGnLPk-~KAeqYl~---~iM~k~KN---klvYDa~TGev~ddrk 319 (516) T protein:vir:10 256 QLKLLEDALVIYRITRAP------ER--RVFYID-VGNMPN-RKATEYVN---GIMQSLKN---RVVYDSNTGTVKNQKR 319 (516) T ss_pred hhHHHHhhHHHHhhhccc------cc--eEEEEe-cCCCCc-hhHHHHHH---HHHHhcCc---eeEEeCCCCeeccchh Confidence 455666777666544321 11 222221 222211 11111111 11111211 12333222 Q ss_pred ------------------cceeeee--cCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccc--cccch--hHHHHHHH Q lcl|NC_019725. 73 ------------------EEYDVLN--SDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGV--SASQN--TALETFYK 128 (237) Q Consensus 73 ------------------e~~~~~~--~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~Gl--natGe--~D~~nyyd 128 (237) =+++.+. -+|+-++|| ..|+..+=.+.++|++||-..+++.+ +.++| -|.-.|.. T Consensus 320 ~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV-~YF~kkLy~aLnVP~SRl~~e~~~~~~~Gr~~EItRDEiKF~K 398 (516) T protein:vir:10 320 NLSMTEDYWLMRRDGKSVTEVTSLPGAQTMGEMDDV-RWFNKKLYEALRIPLSRMPRDDGGMVIGGQDMAITRDELDFRK 398 (516) T ss_pred hhhhHhhhcccccCCCcccceeeccccCCcChHHHH-HHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHHHHH Confidence 1122221 235555554 68889999999999999988887766 33332 24467888 Q ss_pred HHHHHHHHh---hhHHHHHH--HHHhhcC-------CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHH--hCCCCCHHH Q lcl|NC_019725. 129 LVDRKREED---YRPLLEFL--LPFIVEE-------EEWSIEFEPLSVPSKKEESEITKNNVESVTKAI--TEQIIDLEE 194 (237) Q Consensus 129 ~I~~~Qe~~---l~p~l~~l--~~~i~~s-------~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~--~~g~i~~~e 194 (237) .|.++|... +.-+|++- ++-++.. ++|.|+|.-=...+|-..+||...+..+++.+- -...++.+- T Consensus 399 FI~rLR~rFs~lF~~~L~~qLilKgIit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~s~~y 478 (516) T protein:vir:10 399 FIVQLQHNFEEIFLDPLKTNLIYKKIILESEWEEQINNIKVNFHQDSYYTELKDIETLRQRVDALSQIEPYVGKYVSHDY 478 (516) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhcCCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHH Confidence 888888543 23333331 2223332 467899999999999999999999999988874 356788888 Q ss_pred HHHHHHhhccccccCCCCCCChhc--cccCCCCCCCCCCCcCcCC Q lcl|NC_019725. 195 ARDTLRSIAPEFKLKDGNNINIRE--PEETTEPEPGLGEKLEDEN 237 (237) Q Consensus 195 ~r~~l~~~~~~~g~~~~~~~~~~~--~e~~~e~~~~~~~~~~~e~ 237 (237) +++..-.+.+ ..+..++ +++. ..+|-.-.+.+.++ T Consensus 479 i~k~ILr~tD-------eei~~~~k~I~~E-~~~~~~~~p~~e~~ 515 (516) T protein:vir:10 479 VMKNILQMTD-------EQIAQEEKQIEKE-ANVKRFQNPENEDD 515 (516) T ss_pred HHHHHhcCCH-------hHHHHHHHHHHHh-hhCCCCCCCCcccc Confidence 8876432221 1222111 1111 11111122222233 No 219 >protein:vir:101189 Length: 516 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932511;genbank:gi:37651637;genbank:GeneID:2610682 Probab=64.25 E-value=0.3 Score=23.43 Aligned_cols=211 Identities=14% Similarity=0.155 Sum_probs=111.6 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCC-------- Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAET-------- 72 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~-------- 72 (237) =|..+-|++..|--++.- + -+||=++ ..+|-. .+.+.=++ .+++.+|+ =++.|+.. T Consensus 256 QLkm~EDAlVIYRitRAP------e--RRvFYID-vGnlPk-~KAeqYl~---~im~k~kN---klvYDa~TGev~ddrk 319 (516) T protein:vir:10 256 QLKLLEDAMVIYRITRAP------E--RRVFYID-VGNMNN-RKATEYVN---GIMQSLKN---RVVYDSNTGTVKNQKR 319 (516) T ss_pred hhHHHHhhHHHHhhhccc------c--ceEEEEe-cCCCCc-hhHHHHHH---HHHHhcCc---eeEEeCCCCeeccchh Confidence 456666777766544321 1 1222221 222211 11111111 11111211 12333222 Q ss_pred ------------------cceeeee--cCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccc--cccc--hhHHHHHHH Q lcl|NC_019725. 73 ------------------EEYDVLN--SDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGV--SASQ--NTALETFYK 128 (237) Q Consensus 73 ------------------e~~~~~~--~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~Gl--natG--e~D~~nyyd 128 (237) =+++.+. -+|+-++|| ..|+..+=.+.++|++||-..+++.| +.++ .-|.-.|.. T Consensus 320 ~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV-~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF~K 398 (516) T protein:vir:10 320 NLSMTEDYWLMRRDGKSVTEVSSLPGAQTMGDMDDV-RWFNKKLYEALRIPLSRIPRDDGGMVIGGQDTAITRDELDFRK 398 (516) T ss_pred hhhhHhhhcccccCCCCccceeeccccCCcChHHHH-HHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHHHHH Confidence 1222221 235555664 68889999999999999988887766 2222 235567888 Q ss_pred HHHHHHHHhhhHHHHHHHHH------hhcC-------CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHH--hCCCCCHH Q lcl|NC_019725. 129 LVDRKREEDYRPLLEFLLPF------IVEE-------EEWSIEFEPLSVPSKKEESEITKNNVESVTKAI--TEQIIDLE 193 (237) Q Consensus 129 ~I~~~Qe~~l~p~l~~l~~~------i~~s-------~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~--~~g~i~~~ 193 (237) .|.++|.. +.+.+..+++. ++.. ++|.|+|.-=...+|-..+||...+..+++.+- -...++.+ T Consensus 399 FI~rLR~r-Fs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~ 477 (516) T protein:vir:10 399 FVVQLQHD-FEEIFLDPLKTNLIYKRIITEDEWDEQINNIKVNFHQDSYYTELKDIETLRLRVDALSQIEPYVGKYVSHD 477 (516) T ss_pred HHHHHHHH-HHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchH Confidence 88888854 44444433332 2222 478899999999999999999999999988874 35678888 Q ss_pred HHHHHHHhhccccccCCCCCCChhc--cccCCCCCCCCCCCcCcCC Q lcl|NC_019725. 194 EARDTLRSIAPEFKLKDGNNINIRE--PEETTEPEPGLGEKLEDEN 237 (237) Q Consensus 194 e~r~~l~~~~~~~g~~~~~~~~~~~--~e~~~e~~~~~~~~~~~e~ 237 (237) -+++..-.+. +..+..++ +++. ..++-.-.+++.++ T Consensus 478 yi~k~ILr~t-------Deei~~e~k~I~~E-~~~~~~~~p~~~~~ 515 (516) T protein:vir:10 478 YVMKNILQMT-------EEQIAQEEKQIEQE-AGIKRFQNPENEDD 515 (516) T ss_pred HHHHHHhcCC-------HhhHHHHHHHHHHh-hhCCCCCCCCcccc Confidence 8887643222 11222111 1111 11221223333344 No 220 >protein:vir:101806 Length: 516 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238883;genbank:gi:66391958;genbank:GeneID:3416633 Probab=64.25 E-value=0.3 Score=23.43 Aligned_cols=211 Identities=14% Similarity=0.155 Sum_probs=111.6 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCC-------- Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAET-------- 72 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~-------- 72 (237) =|..+-|++..|--++.- + -+||=++ ..+|-. .+.+.=++ .+++.+|+ =++.|+.. T Consensus 256 QLkm~EDAlVIYRitRAP------e--RRvFYID-vGnlPk-~KAeqYl~---~im~k~kN---klvYDa~TGev~ddrk 319 (516) T protein:vir:10 256 QLKLLEDAMVIYRITRAP------E--RRVFYID-VGNMNN-RKATEYVN---GIMQSLKN---RVVYDSNTGTVKNQKR 319 (516) T ss_pred hhHHHHhhHHHHhhhccc------c--ceEEEEe-cCCCCc-hhHHHHHH---HHHHhcCc---eeEEeCCCCeeccchh Confidence 456666777766544321 1 1222221 222211 11111111 11111211 12333222 Q ss_pred ------------------cceeeee--cCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccc--cccc--hhHHHHHHH Q lcl|NC_019725. 73 ------------------EEYDVLN--SDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGV--SASQ--NTALETFYK 128 (237) Q Consensus 73 ------------------e~~~~~~--~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~Gl--natG--e~D~~nyyd 128 (237) =+++.+. -+|+-++|| ..|+..+=.+.++|++||-..+++.| +.++ .-|.-.|.. T Consensus 320 ~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV-~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF~K 398 (516) T protein:vir:10 320 NLSMTEDYWLMRRDGKSVTEVSSLPGAQTMGDMDDV-RWFNKKLYEALRIPLSRIPRDDGGMVIGGQDTAITRDELDFRK 398 (516) T ss_pred hhhhHhhhcccccCCCCccceeeccccCCcChHHHH-HHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHHHHH Confidence 1222221 235555664 68889999999999999988887766 2222 235567888 Q ss_pred HHHHHHHHhhhHHHHHHHHH------hhcC-------CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHH--hCCCCCHH Q lcl|NC_019725. 129 LVDRKREEDYRPLLEFLLPF------IVEE-------EEWSIEFEPLSVPSKKEESEITKNNVESVTKAI--TEQIIDLE 193 (237) Q Consensus 129 ~I~~~Qe~~l~p~l~~l~~~------i~~s-------~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~--~~g~i~~~ 193 (237) .|.++|.. +.+.+..+++. ++.. ++|.|+|.-=...+|-..+||...+..+++.+- -...++.+ T Consensus 399 FI~rLR~r-Fs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~ 477 (516) T protein:vir:10 399 FVVQLQHD-FEEIFLDPLKTNLIYKRIITEDEWDEQINNIKVNFHQDSYYTELKDIETLRLRVDALSQIEPYVGKYVSHD 477 (516) T ss_pred HHHHHHHH-HHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchH Confidence 88888854 44444433332 2222 478899999999999999999999999988874 35678888 Q ss_pred HHHHHHHhhccccccCCCCCCChhc--cccCCCCCCCCCCCcCcCC Q lcl|NC_019725. 194 EARDTLRSIAPEFKLKDGNNINIRE--PEETTEPEPGLGEKLEDEN 237 (237) Q Consensus 194 e~r~~l~~~~~~~g~~~~~~~~~~~--~e~~~e~~~~~~~~~~~e~ 237 (237) -+++..-.+. +..+..++ +++. ..++-.-.+++.++ T Consensus 478 yi~k~ILr~t-------Deei~~e~k~I~~E-~~~~~~~~p~~~~~ 515 (516) T protein:vir:10 478 YVMKNILQMT-------EEQIAQEEKQIEQE-AGIKRFQNPENEDD 515 (516) T ss_pred HHHHHHhcCC-------HhhHHHHHHHHHHh-hhCCCCCCCCcccc Confidence 8887643222 11222111 1111 11221223333344 No 221 >protein:vir:5691 Length: 344 # NCBI annotation: gpQ # Family: family:all:196 # MgeID: mge:120 # MgeName: L-413C # Cross-refs: genbank:acc:NP_839850;genbank:gi:30065705;genbank:GeneID:1260599 Probab=62.39 E-value=0.34 Score=23.18 Aligned_cols=156 Identities=8% Similarity=0.068 Sum_probs=70.2 Q ss_pred Cc--hhHHHHHHHHHHHHHHHHHHHHH-hccc-eeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecC---Cc Q lcl|NC_019725. 1 MN--KSLIDAICDYDYCESLATQILRR-KQQA-VWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAE---TE 73 (237) Q Consensus 1 ll--q~~~d~v~~~~~~~~~~~~Ll~~-~~~~-v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~---~e 73 (237) +. ......+.--..+.......... ++.. |+++.+ + .+ +.+..+.++++++-. ...++...++|... .+ T Consensus 175 ls~~~~a~~si~l~~~a~~~~~~~f~NGa~pg~Il~~~d-~-~l-s~e~~~~lk~~~~~~-~g~~~~r~l~l~~p~g~~~ 250 (344) T protein:vir:56 175 LPEYLSALNSAWLNESATLFRRKYYENGAHAGYIMYVTD-A-VQ-DRNDIEMLRENMVKS-KGRNNFKNLFLYAPQGKAD 250 (344) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecC-C-CC-CHHHHHHHHHHHHHh-cCCCCccceEEecCCCCcc Confidence 11 12222222222222222222221 1122 233322 1 11 222344566666542 23556666677532 24 Q ss_pred ceeeeecCcCCHHH----HHHHHHHHHhhhhcCceeeeeccCccc---ccccchhHHHHHHHHHHHHHHHhhhHHHHHHH Q lcl|NC_019725. 74 EYDVLNSDISGVPE----FLSSKMDRIVSLSGIHEIIIKNKNVGG---VSASQNTALETFYKLVDRKREEDYRPLLEFLL 146 (237) Q Consensus 74 ~~~~~~~~lsGl~d----l~~~~~~~iaa~s~iP~t~L~G~sp~G---lnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~ 146 (237) .++....+.+..++ +-....+.||++-|||-. |+|..|.+ ++ +-+.-.+.|+ +..|.|.++++- T Consensus 251 G~~~~pis~~~~d~qf~e~k~~s~~eIa~afrVPp~-llGi~~~~t~~~~-n~eq~~~~f~-------~~tL~Pl~~~ie 321 (344) T protein:vir:56 251 GIKIIPLSEVATKDDFFNIKKASAADLLDAHRIPFQ-LMGGKPENVGSLG-DIEKVAKVFV-------RNELIPLQDRIR 321 (344) T ss_pred ceeEEEcCCChHHHHHHHHHHhhHHHHHHHhCCCHH-HhccCCCCCCccc-cHHHHHHHHH-------HHHHHHHHHHHH Confidence 56666666666654 556667789999999997 55766543 32 2233334443 345666665554 Q ss_pred HHhhcCCCceeEeCCCCCCCHHH Q lcl|NC_019725. 147 PFIVEEEEWSIEFEPLSVPSKKE 169 (237) Q Consensus 147 ~~i~~s~~~~~~f~pL~~~seke 169 (237) ++.-.--.=.+.|+|-.-..+.- T Consensus 322 ~~n~~l~~~~~~F~~y~l~~~~~ 344 (344) T protein:vir:56 322 EINGWIGQEVIRFKNYSLDTDNG 344 (344) T ss_pred HHHhhhccccccCCCccccccCC Confidence 43221111123454433222111 No 222 >protein:vir:78393 Length: 489 # NCBI annotation: putative structural protein # Family: family:all:584 # MgeID: mge:1851 # MgeName: SETP3 # Cross-refs: genbank:acc:YP_001110831;genbank:gi:134288592;genbank:GeneID:5179656 Probab=59.33 E-value=0.39 Score=22.80 Aligned_cols=208 Identities=7% Similarity=-0.006 Sum_probs=98.3 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeeeec Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVLNS 80 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~~~ 80 (237) |+..++=.|.+|..... --++++-..+-++-+.|..+.- ..... ...++- -.-+-...+.+ ..+-++..++. T Consensus 261 Ll~LA~lni~Hy~~ssd-~~~~l~~~~~P~l~i~G~d~~~-~~~~~-~~~~~~----i~~g~~~~~~l-p~~~~~~~ie~ 332 (489) T protein:vir:78 261 LLPLAELNIGHYRNSAD-NEESSFVVGQPTLFIYPGENLT-PQAFK-EANPNG----IKFGSRRGHNL-GYGGSAQLIQA 332 (489) T ss_pred hHHHHHHHHHHhhhhhH-HHHHHHHcccceeeeecCccCC-ccccc-ccCccc----eeeCCcccccC-CCCCCcceecc Confidence 89998889999988877 5677899999988877643221 11000 000000 00011111222 22344555555 Q ss_pred CcCCH-HHHHHHHHHHHhhh-hcCceeeeeccCcccccccchhHHHHH---HHHHHHHHHHhhhHHHHHHHHHhhc---- Q lcl|NC_019725. 81 DISGV-PEFLSSKMDRIVSL-SGIHEIIIKNKNVGGVSASQNTALETF---YKLVDRKREEDYRPLLEFLLPFIVE---- 151 (237) Q Consensus 81 ~lsGl-~dl~~~~~~~iaa~-s~iP~t~L~G~sp~GlnatGe~D~~ny---yd~I~~~Qe~~l~p~l~~l~~~i~~---- 151 (237) +-+++ ...++...+++..+ ++ |+.++ +.. |++.-...+ +..++++- ..+.-.+.+++++.+. T Consensus 333 ~~~~~~r~~l~~le~qm~~lGa~-----l~~~~-~~~--Ta~~~~~~~~~~~S~L~~~a-~~~e~al~~~l~~~a~w~G~ 403 (489) T protein:vir:78 333 GENNLARQNMLDKEQQAIQIGAQ-----LITPT-QQI--TAQSARIQRGADTSVMATIA-RNVSQAYTDALRWVAVMLGK 403 (489) T ss_pred CcchHHHHHHHHHHHHHHHHhhh-----hccCC-cch--hHHHHHHHHHHhhHHHHHHH-HHHHHHHHHHHHHHHHHcCC Confidence 55554 33444444444432 33 23222 223 433222222 22333333 3466677888877753 Q ss_pred C--CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCChhccc----cCCCC Q lcl|NC_019725. 152 E--EEWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIREPE----ETTEP 225 (237) Q Consensus 152 s--~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~~~~e----~~~e~ 225 (237) . .+..|.-|+=+.. +.+.....+++..+++.|.|+.++.++.|+.++. . +.++++++ +.+.+ T Consensus 404 ~~~~~~~i~~n~dF~~-----~~~d~~~~~al~~~~~~G~is~~t~~~~L~~~gv----~---d~~~e~~~~ei~~~~~~ 471 (489) T protein:vir:78 404 PEDTEVEFRLNMDFFL-----EPMTAQDRAAWMADINAGLLPATAYYAALRKAGV----T---DWTDADIKDAVADQPLP 471 (489) T ss_pred CCCCceEEEeecccCc-----ccCCHHHHHHHHHHHhcCCCCHHHHHHHHHhCCC----C---CccHHHHHHHHhhcCCC Confidence 2 2334433332211 1111223566677899999999999999987532 2 12333332 22111 Q ss_pred CCCCCCCcCcCC Q lcl|NC_019725. 226 EPGLGEKLEDEN 237 (237) Q Consensus 226 ~~~~~~~~~~e~ 237 (237) -+....+.-+++ T Consensus 472 ~~~~~~g~~~~~ 483 (489) T protein:vir:78 472 VATEVQGEIPQS 483 (489) T ss_pred cccCCcccCCCC Confidence 111111111111 No 223 >protein:vir:6896 Length: 523 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861872;genbank:gi:32453663;genbank:GeneID:1494298 Probab=54.43 E-value=0.5 Score=22.22 Aligned_cols=210 Identities=19% Similarity=0.230 Sum_probs=106.6 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCC-------- Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAET-------- 72 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~-------- 72 (237) =|..+-|++..|--++.- + -+||=++ ..+|-. .+.+.=++. +++.+|+ =++.|+.+ T Consensus 263 QLkmlEDAlVIYRitRAP------e--RRvFYID-vGnlPk-~KAeqYl~~---im~k~kN---KlvYDa~TGev~ddrk 326 (523) T protein:vir:68 263 QLKLLEDAVVIYRITRAP------D--RRVWYVD-TGNMPS-RKAAEHMQH---VMNTMKN---RIAYDATTGKIKNQQH 326 (523) T ss_pred hhHHHHhhHHHHhhhccc------c--ceEEEEe-cCCCCc-hhHHHHHHH---HHHhhcc---eeEEeccCCeeccchh Confidence 455666777666544321 1 1223221 222211 111111111 1111111 12222221 Q ss_pred ------------------cceeeee--cCcCCHHHHHHHHHHHHhhhhcCceeeeeccCccccc--ccc--hhHHHHHHH Q lcl|NC_019725. 73 ------------------EEYDVLN--SDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVS--ASQ--NTALETFYK 128 (237) Q Consensus 73 ------------------e~~~~~~--~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~Gln--atG--e~D~~nyyd 128 (237) =+++.+. -+|+-+.|| ..|+..+=.+.++|++||-+.. ||+| .++ .-|.-.|.. T Consensus 327 ~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV-~YF~kkLy~aLnVP~sRl~~~~-~~f~~Gr~~EItRDEikF~K 404 (523) T protein:vir:68 327 IMSMTEDYWLQRRDGKAVTEVDTLPGADNTGNMEDV-RWFRNALYMALRIPITRIPSDQ-GGIQFDAGTSITRDELSFGK 404 (523) T ss_pred hhhhHhhhcccccCCCcccceeeccccCCcChHHHH-HHHHHHHHHHhCCcceeecCCC-cceecccccchhHHHHHHHH Confidence 1222222 245556664 6888999999999999997653 6666 322 235567888 Q ss_pred HHHHHHHHhhhHHHHHHHHH------hhcC-------CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHh--CCCCCHH Q lcl|NC_019725. 129 LVDRKREEDYRPLLEFLLPF------IVEE-------EEWSIEFEPLSVPSKKEESEITKNNVESVTKAIT--EQIIDLE 193 (237) Q Consensus 129 ~I~~~Qe~~l~p~l~~l~~~------i~~s-------~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~--~g~i~~~ 193 (237) .|.++|.. +.+.+..+++. ++.. ++|.|+|.-=...+|-..+||...+..+++.+-. .-.++.+ T Consensus 405 FI~rLR~r-Fs~lf~~~Lk~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~ 483 (523) T protein:vir:68 405 FIRELQHK-FEEIFLDPLKTNLILKGIITEDEWNDEINNIKIKFHRDSYFSELKDAEILERRINMLQMAEPFIGKYISHR 483 (523) T ss_pred HHHHHHHH-HHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhH Confidence 88888854 33433333332 2222 4788999999999999999999999998887743 2246777 Q ss_pred HHHHHHHhhccccccCCCCCCCh--hccccCCCCCCCCCCCcCcCC Q lcl|NC_019725. 194 EARDTLRSIAPEFKLKDGNNINI--REPEETTEPEPGLGEKLEDEN 237 (237) Q Consensus 194 e~r~~l~~~~~~~g~~~~~~~~~--~~~e~~~e~~~~~~~~~~~e~ 237 (237) -+++..-.+.+ ..+.. +.+++ +..++-.-++.+.+. T Consensus 484 yi~k~ILr~tD-------eei~~~~kqI~~-E~k~~~~~~p~~e~~ 521 (523) T protein:vir:68 484 TAMKDILQMSD-------EEIEQEAKQIEE-ESKEARFQDPDQEQE 521 (523) T ss_pred HHHHHHhccCH-------HHHHHHHHHHHH-HhhcCCCCCCchhhh Confidence 77775422211 11111 11111 112222223333322 No 224 >protein:vir:3780 Length: 345 # NCBI annotation: orf15 # Family: family:all:196 # MgeID: mge:328 # MgeName: HP2 # Cross-refs: genbank:acc:NP_536820;genbank:gi:17981829;genbank:GeneID:929208 Probab=51.78 E-value=0.57 Score=21.91 Aligned_cols=154 Identities=9% Similarity=0.044 Sum_probs=70.5 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHH-hcc-ceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeee-ec--CCcce Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRR-KQQ-AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGI-DA--ETEEY 75 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~-~~~-~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~i-D~--~~e~~ 75 (237) -+......+.--..+.......... ++. -|+++.+ ..+ +.+....++++++-. ...+|.+.++| .. ..+.+ T Consensus 177 ~~~~a~~si~l~~~a~~~~~~~f~NG~~p~~Il~~~d--~~l-~~e~~~~lk~~~~~~-~g~~n~~~~~i~~p~g~~~G~ 252 (345) T protein:vir:37 177 DYVGGIQSALLNSDATVFRRRYFSNGAHMGFILYSTD--PDL-TEEMEEEIARKISES-KGVGNFRSMFVNIANGHPDGL 252 (345) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEecC--CCC-CHHHHHHHHHHHHHh-cCcccccceEEEcCCCcccce Confidence 1111112222222222222222211 111 1233321 111 223344566666542 33344444444 22 12445 Q ss_pred eeeecCcCCHHH----HHHHHHHHHhhhhcCceeeeeccCcccccc--cchhHHHHHHHHHHHHHHHhhhHHHHHHHHHh Q lcl|NC_019725. 76 DVLNSDISGVPE----FLSSKMDRIVSLSGIHEIIIKNKNVGGVSA--SQNTALETFYKLVDRKREEDYRPLLEFLLPFI 149 (237) Q Consensus 76 ~~~~~~lsGl~d----l~~~~~~~iaa~s~iP~t~L~G~sp~Glna--tGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~i 149 (237) +....+.+..++ +-....+.||++-|||-. |+|..|.+-.+ +-+.-.+.|| +..|.|.+.++-..+ T Consensus 253 ~~~pls~~~~d~qf~e~k~~~~~dIa~a~~VPp~-llGi~~~~~~~~~~~e~~~~~f~-------~~~l~P~~~~ie~~l 324 (345) T protein:vir:37 253 KVIPIGDTGTKDEFANIKNISAQDVLTAHRFPAG-LSGIIPTNTGGLGDPLKYREVYH-------YDEVMPLQEIIAETI 324 (345) T ss_pred EEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHH-HhCccCCCCCCcccHHHHHHHHH-------HHHHHHHHHHHHHHh Confidence 555555555554 333567789999999976 56876653211 2233334454 345778777776665 Q ss_pred h----cCCCceeEeCCCCCCCH Q lcl|NC_019725. 150 V----EEEEWSIEFEPLSVPSK 167 (237) Q Consensus 150 ~----~s~~~~~~f~pL~~~se 167 (237) - ...+..+.|+|= .+++ T Consensus 325 n~~~~~~~~~~i~F~~~-~L~~ 345 (345) T protein:vir:37 325 NQDPEIKNLLKIKFREQ-NFAK 345 (345) T ss_pred hhhccCCCcceEEecch-hhcC Confidence 3 235667778641 1222 No 225 >protein:vir:78749 Length: 337 # NCBI annotation: putative portal protein # Family: family:all:196 # MgeID: mge:1857 # MgeName: phiO18P # Cross-refs: genbank:acc:YP_001285643;genbank:gi:148727149;genbank:GeneID:5220095 Probab=51.65 E-value=0.58 Score=21.90 Aligned_cols=153 Identities=7% Similarity=0.006 Sum_probs=69.9 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHH-hccc-eeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeee-ecC--Ccce Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRR-KQQA-VWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGI-DAE--TEEY 75 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~-~~~~-v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~i-D~~--~e~~ 75 (237) -++.....+.--..+.......... ++.. ++++.+ + .+ +.+..+.+++.++-. ...+|.+.+++ ... .+.+ T Consensus 168 ~~~~a~~si~l~~aa~~~~~~~f~NGa~p~~il~~~~-~-~l-~~e~~~~lk~~~~~~-~G~~n~~~~~v~~~~g~~~Gi 243 (337) T protein:vir:78 168 DYLGGLQSALLNQDATLFRRRYFLNGAHMGFIFYATD-P-NM-DDDTEEEMKEMIANS-KGVGNFRSMFVNIPDGKPDGI 243 (337) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCCceeEEcCC-C-CC-CHHHHHHHHHHHHHh-cCcccccceEEEcCCCCccce Confidence 1112222222222222222222221 1222 233222 0 11 223344566666542 22345444444 321 2446 Q ss_pred eeeecCcCCHHH----HHHHHHHHHhhhhcCceeeeeccCcccccc---cchhHHHHHHHHHHHHHHHhhhHHHHHHHHH Q lcl|NC_019725. 76 DVLNSDISGVPE----FLSSKMDRIVSLSGIHEIIIKNKNVGGVSA---SQNTALETFYKLVDRKREEDYRPLLEFLLPF 148 (237) Q Consensus 76 ~~~~~~lsGl~d----l~~~~~~~iaa~s~iP~t~L~G~sp~Glna---tGe~D~~nyyd~I~~~Qe~~l~p~l~~l~~~ 148 (237) +....+.+..++ +-....+.||++-|||-.. .|..|.+=.+ +-+...+.|+ +..|.|.++++-.. T Consensus 244 ~~~pis~~~~d~qfle~k~~s~~eIa~a~~VPp~l-lGi~~~~~~~~~~n~e~~~~~f~-------~~~L~P~~~~ie~~ 315 (337) T protein:vir:78 244 KLIPVGDIATKDEFAAIKGITAQDVLTAHRYPPAL-AGIIPTNGGGGLGDPEKYDATYA-------RNEVLPLCELVQDA 315 (337) T ss_pred eEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHHH-cccccCCCcCccccHHHHHHHHH-------HHHHHHHHHHHHHH Confidence 666666666654 3445677899999999854 4665544222 1233334444 35577777766655 Q ss_pred hhc---CC--CceeEeCCCCCC Q lcl|NC_019725. 149 IVE---EE--EWSIEFEPLSVP 165 (237) Q Consensus 149 i~~---s~--~~~~~f~pL~~~ 165 (237) +-+ +. -+.|+|++=-.+ T Consensus 316 ~n~~ll~~~~~~~f~~~~~~~~ 337 (337) T protein:vir:78 316 INSAGLPRALWVTFRETIGAAV 337 (337) T ss_pred HhhhcCChhhceeccccccccC Confidence 532 21 234556654444 No 226 >protein:vir:81017 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469501;genbank:gi:157311458;genbank:GeneID:5602316 Probab=51.38 E-value=0.58 Score=21.87 Aligned_cols=211 Identities=12% Similarity=0.179 Sum_probs=108.7 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecC--------- Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAE--------- 71 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~--------- 71 (237) =|..+-|++..|--++.- + -+||=++ ..+|-. .+.+.=++ .+++.+|+ =++.|+. T Consensus 260 QLkm~EDAlVIYRitRAP------e--RRvFYID-vGnlpk-~KAeqYl~---~im~k~kN---klvYDa~TGev~ddrk 323 (521) T protein:vir:81 260 QLKLLEDAMVVYRITRAP------E--RRVFFID-TGNMNN-RKAAQHMN---SVAQSFKN---RVVYDASTGKLKNQQA 323 (521) T ss_pred hhHHHHhhHHHHhhhccc------c--ceEEEEe-cCCCCc-hhHHHHHH---HHHHhcCc---eeEeeccccccccccc Confidence 455666777666544321 1 1233332 222211 11111111 11222222 1222221 Q ss_pred -----------------Ccceeeee--cCcCCHHHHHHHHHHHHhhhhcCceeeeeccCccccc--ccc--hhHHHHHHH Q lcl|NC_019725. 72 -----------------TEEYDVLN--SDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVS--ASQ--NTALETFYK 128 (237) Q Consensus 72 -----------------~e~~~~~~--~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~Gln--atG--e~D~~nyyd 128 (237) +=+++.+. -+|+-++|| ..|+..+=.+.++|++||-..+.+|+| .++ .-|.-.|.. T Consensus 324 ~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV-~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF~K 402 (521) T protein:vir:81 324 NLSMTEDYWLQRRDGKAITDVTTLPGASGMSDIDDI-RYFNRKLYEALRVPLSRSNLSDANMVIGGDGSEITRDELEFSK 402 (521) T ss_pred ccchhhhhcccccCCCcccceeecccCCCCChHHHH-HHHHHHHHHHhCCccccccCCCCcceeccccchhhHHHHHHHH Confidence 12233332 245556664 688899999999999999777777775 222 235567888 Q ss_pred HHHHHHHHhhhHHHHHHHHH------hhcC-------CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHh--CCCCCHH Q lcl|NC_019725. 129 LVDRKREEDYRPLLEFLLPF------IVEE-------EEWSIEFEPLSVPSKKEESEITKNNVESVTKAIT--EQIIDLE 193 (237) Q Consensus 129 ~I~~~Qe~~l~p~l~~l~~~------i~~s-------~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~--~g~i~~~ 193 (237) .|.++|.. +.+.+..+++. ++.. +.|.|+|.-=...+|-..+||...+..+++.+-. .-.++.+ T Consensus 403 FI~rLR~r-Fs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~d 481 (521) T protein:vir:81 403 FIRTRQSQ-FSEVLRDPLKYNLILKNVITEDDWDREINNIKVVFHRDSYYTEVKDAEILERRIGLIERITPYIGKYFSNQ 481 (521) T ss_pred HHHHHHHH-HHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEEeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchH Confidence 88888854 33433333332 2322 3678999999999999999999999998888743 2246777 Q ss_pred HHHHHHHhhccccccCCCCCCCh--hccccCCCCCCCCCCCcCcCC Q lcl|NC_019725. 194 EARDTLRSIAPEFKLKDGNNINI--REPEETTEPEPGLGEKLEDEN 237 (237) Q Consensus 194 e~r~~l~~~~~~~g~~~~~~~~~--~~~e~~~e~~~~~~~~~~~e~ 237 (237) -+++..-.+.+ .++.. +.+++. ..++-.-++.++++ T Consensus 482 yi~k~ILr~tD-------eei~~~~k~I~~E-~~~~~~~~p~~~~~ 519 (521) T protein:vir:81 482 TVMRDILKYTD-------DQMDTEKKQIEEE-ANDPRFKQTPDEIE 519 (521) T ss_pred HHHHHHhccCH-------HHHHHHHHHHHHH-hhCCCCCCCccccc Confidence 78775422221 11111 111111 11111122222222 No 227 >protein:vir:1150 Length: 350 # NCBI annotation: predicted capsid packaging protein # Family: family:all:196 # MgeID: mge:24 # MgeName: phi CTX # Cross-refs: genbank:acc:NP_490599;genbank:gi:17313219;genbank:GeneID:927315 Probab=49.69 E-value=0.63 Score=21.68 Aligned_cols=153 Identities=10% Similarity=0.085 Sum_probs=61.9 Q ss_pred CchhH---------HHHHHHHHHHHHHHHHHHHHhcc--ceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchhe-eee Q lcl|NC_019725. 1 MNKSL---------IDAICDYDYCESLATQILRRKQQ--AVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRA-IGI 68 (237) Q Consensus 1 llq~~---------~d~v~~~~~~~~~~~~Ll~~~~~--~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~-~~i 68 (237) ..+.+ ...+.--..+............. -++++.+ . .+ +.+..+.+++.++-. ..-+|.+- +++ T Consensus 176 ~~~~~yGls~~~~a~~si~l~~~a~~~~~~~f~NGa~~~gil~~~~-~-~l-s~e~~~~l~~~~~~~-~G~~N~~~~~v~ 251 (350) T protein:vir:11 176 INQEIYGVPEWFCALQSALLNESATLFRRKYYNNGSHAGFILYMTD-A-AQ-NEEDIDALRTALKTA-KGPGNFRNLFVY 251 (350) T ss_pred CCCCcccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEecC-C-CC-CHHHHHHHHHHHHHh-cCccccCceeee Confidence 12222 22222222222222222211111 2334332 1 12 223344566655542 22344444 444 Q ss_pred ecC--CcceeeeecCcCCHHH----HHHHHHHHHhhhhcCceeeeeccCcc---cccccchhHHHHHHHHHHHHHHHhhh Q lcl|NC_019725. 69 DAE--TEEYDVLNSDISGVPE----FLSSKMDRIVSLSGIHEIIIKNKNVG---GVSASQNTALETFYKLVDRKREEDYR 139 (237) Q Consensus 69 D~~--~e~~~~~~~~lsGl~d----l~~~~~~~iaa~s~iP~t~L~G~sp~---GlnatGe~D~~nyyd~I~~~Qe~~l~ 139 (237) ..+ .+.++....+.+..++ +-....+.||++-+||-. |+|..|. |++.. +.-.+.||. ..|. T Consensus 252 ~~~g~~~g~~~~pl~~~~~d~qf~e~k~~~~~eIa~a~~VPp~-llGi~~~~t~~~sn~-e~~~~~f~~-------~~L~ 322 (350) T protein:vir:11 252 APNGKKEGIQLIPVSEVAAKDEFGSIKNISRDDQLAGLRVYPQ-LMGVVPQNAGGFGSI-SDAAAVWAS-------LELA 322 (350) T ss_pred cCCCCccceEEEEcCCChhHHHHHHHHHHhHHHHHHHhCCCHH-HhcccCCCCCCcCCH-HHHHHHHHH-------HHHH Confidence 322 2445665555555443 455667789999999966 6665544 34322 333344443 2345 Q ss_pred HHHHHHHHHhhc-CCCceeEeCCCCCCCHH Q lcl|NC_019725. 140 PLLEFLLPFIVE-EEEWSIEFEPLSVPSKK 168 (237) Q Consensus 140 p~l~~l~~~i~~-s~~~~~~f~pL~~~sek 168 (237) |.++++-++.-. ..+ .+.|+|- .++.- T Consensus 323 P~~~~ie~ln~~l~~~-~~~F~~~-~~~~l 350 (350) T protein:vir:11 323 PMQTRLQQVNEMIGEE-VVRFAQF-DAPGL 350 (350) T ss_pred HHHHHHHHHHhhcCcc-ccccCcc-cccCC Confidence 555444332211 111 1223321 11111 No 228 >protein:vir:98853 Length: 219 # NCBI annotation: hypothetical protein # Family: family:all:196 # MgeID: mge:1495 # MgeName: F108 # Cross-refs: genbank:acc:YP_654729;genbank:gi:109302914;genbank:GeneID:4156058 Probab=49.31 E-value=0.64 Score=21.64 Aligned_cols=155 Identities=8% Similarity=0.049 Sum_probs=70.2 Q ss_pred Cc--hhHHHHHHHHHHHHHHHHHHHHH-hccc-eeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeec-----C Q lcl|NC_019725. 1 MN--KSLIDAICDYDYCESLATQILRR-KQQA-VWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDA-----E 71 (237) Q Consensus 1 ll--q~~~d~v~~~~~~~~~~~~Ll~~-~~~~-v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~-----~ 71 (237) |. +.+...+.....+......+... +... |+++.+ . .+ +++....+++.++-. ...+|.+.++|.. + T Consensus 47 lspi~~a~~~i~~~~aa~~~~~~~f~Ng~~p~gil~~~~-~-~l-~~e~~~~~~~~~~~~-~g~~n~~~~~l~~~gg~~~ 122 (219) T protein:vir:98 47 SPDYVGGITSALLNSDATIFRRRYYSNGAHMGFILYSTD-P-DM-TEEMEDEIAERIRDS-KGVGNFRSMFVNIAGGHPD 122 (219) T ss_pred ecHHHHHHHHHHHHHHHHHHHHHHHhcCCCCceEEEeCC-C-CC-CHHHHHHHHHHHHHh-cCcccccceeEecCCCCcc Confidence 22 22333333222222222222211 1111 234332 1 11 122334455555442 2333434444432 1 Q ss_pred CcceeeeecCcC--CHHHHHHHHHHHHhhhhcCceeeeeccC---cccccccchhHHHHHHHHHHHHHHHhhhHHHHHHH Q lcl|NC_019725. 72 TEEYDVLNSDIS--GVPEFLSSKMDRIVSLSGIHEIIIKNKN---VGGVSASQNTALETFYKLVDRKREEDYRPLLEFLL 146 (237) Q Consensus 72 ~e~~~~~~~~ls--Gl~dl~~~~~~~iaa~s~iP~t~L~G~s---p~GlnatGe~D~~nyyd~I~~~Qe~~l~p~l~~l~ 146 (237) +=+|..++.+.. -+-+.-......||.+-|+|-.+| |.. .++.++. +.-...|| +..|.|.++++- T Consensus 123 G~~~~~~~~~~~d~qfle~rk~~~~eIa~~fgVPp~~l-G~~~~~~~~~sn~-eq~~~~f~-------~~tL~P~~~~ie 193 (219) T protein:vir:98 123 GLKVIPIGDTGQKDEFANIKNISAQDVLTSHRFPPGLS-GIIPVNTAGLGDP-LKIREAYQ-------ADEVLPLQEIIA 193 (219) T ss_pred ceeEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCHHHc-ccccCCCCCccCH-HHHHHHHH-------HHHHHHHHHHHH Confidence 223444433332 222344455788999999999965 543 3333222 22333344 566889888877 Q ss_pred HHhh----cCCCceeEeCCCCCCCHHH Q lcl|NC_019725. 147 PFIV----EEEEWSIEFEPLSVPSKKE 169 (237) Q Consensus 147 ~~i~----~s~~~~~~f~pL~~~seke 169 (237) ..|- ...+..+.|+ =..++++- T Consensus 194 ~~ln~~~~~~~~~~~~F~-~~~~~d~~ 219 (219) T protein:vir:98 194 ESINSDYEIKSALKVNFK-QPEKRDKN 219 (219) T ss_pred HHhhhhhcCCCccEEeec-CcccccCC Confidence 6663 3457777786 33333333 No 229 >protein:vir:6596 Length: 521 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891727;genbank:gi:33620636;genbank:GeneID:1725288 Probab=44.12 E-value=0.82 Score=21.06 Aligned_cols=211 Identities=12% Similarity=0.172 Sum_probs=108.3 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecC--------- Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAE--------- 71 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~--------- 71 (237) =|..+-|++..|--++.- + -+||=++ ..+|-. .+.+.=++ .+++.+|+ =++.|+. T Consensus 260 QLkm~EDAlVIYRitRAP------e--RRvFYID-vGnlPk-~KAeqYl~---~im~k~kN---klvYDa~TGev~ddrk 323 (521) T protein:vir:65 260 QLKLLEDAMVVYRITRAP------E--RRVFFID-TGNMNN-RKAAQHMN---SVAQSFKN---RVVYDASTGKLKNQQA 323 (521) T ss_pred hhHHHHhhHHHHhhhccc------c--ceEEEEe-cCCCCc-hhHHHHHH---HHHHhcCc---eeEeeccccccccccc Confidence 455666777666544321 1 1233332 222211 11111111 11222222 1222221 Q ss_pred -----------------Ccceeeee--cCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccc--cc--hhHHHHHHH Q lcl|NC_019725. 72 -----------------TEEYDVLN--SDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSA--SQ--NTALETFYK 128 (237) Q Consensus 72 -----------------~e~~~~~~--~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~Glna--tG--e~D~~nyyd 128 (237) +=+++.+. -+|+-++|| ..|+..+=.+.++|++||-..+.+|+|- ++ .-|.-.|.. T Consensus 324 ~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV-~YF~kkLy~aLnVP~sRl~~e~~~~~~~gr~~EItRDEiKF~K 402 (521) T protein:vir:65 324 NLSMTEDYWLQRRDGKAITDVTTLPGASGMSDIDDI-RYFNRKLYEALRVPLSRSNLSDANMVIGGDGSEITRDELEFSK 402 (521) T ss_pred ccchhhhhcccccCCCCccceeecccCCCcChHHHH-HHHHHHHHHHhCCCceeccCCCCcceeccccchhhHHHHHHHH Confidence 12233332 245556664 6888999999999999996666677762 22 235567888 Q ss_pred HHHHHHHHhhhHHHHHHHHH------hhcC-------CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHh--CCCCCHH Q lcl|NC_019725. 129 LVDRKREEDYRPLLEFLLPF------IVEE-------EEWSIEFEPLSVPSKKEESEITKNNVESVTKAIT--EQIIDLE 193 (237) Q Consensus 129 ~I~~~Qe~~l~p~l~~l~~~------i~~s-------~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~--~g~i~~~ 193 (237) .|.++|.. +.+.+..+++. ++.. +.|.|+|.-=...+|-..+||...+..+++.+-. .-.+|.+ T Consensus 403 FI~rLR~r-Fs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~S~d 481 (521) T protein:vir:65 403 FIRTLQSQ-FSEVLRDPLKYNLILKNVITEDDWDREINNIKVVFHRDSYYTEVKDAEILERRIGLIERITPYIGKYFSNQ 481 (521) T ss_pred HHHHHHHH-HHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchH Confidence 88888854 33433333332 2322 3678999999999999999999999998888743 2256888 Q ss_pred HHHHHHHhhccccccCCCCCCCh--hccccCCCCCCCCCCCcCcCC Q lcl|NC_019725. 194 EARDTLRSIAPEFKLKDGNNINI--REPEETTEPEPGLGEKLEDEN 237 (237) Q Consensus 194 e~r~~l~~~~~~~g~~~~~~~~~--~~~e~~~e~~~~~~~~~~~e~ 237 (237) -+++..-.+.+ ..+.. +.+++. ..++-.-++.++++ T Consensus 482 yi~k~ILr~tD-------eei~~~~k~I~~E-~~~~~~~~p~~~~~ 519 (521) T protein:vir:65 482 TVMRDILKYTD-------DQMDTEKKQIEEE-ANDPRFKQTPDEIE 519 (521) T ss_pred HHHHHHhccCH-------HHHHHHHHHHHHh-hhCCCCCCCccccc Confidence 88876422221 11111 111111 11111112222222 No 230 >protein:vir:95014 Length: 491 # NCBI annotation: structural protein # Family: family:all:584 # MgeID: mge:1547 # MgeName: KS7 # Cross-refs: genbank:acc:YP_224035;genbank:gi:62327322;genbank:GeneID:5176842 Probab=43.27 E-value=0.85 Score=20.97 Aligned_cols=205 Identities=7% Similarity=-0.012 Sum_probs=98.9 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeeeec Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVLNS 80 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~~~ 80 (237) |+..++=.|.+|..... --++++-..+-++-+.|..+.-... .. .... .. ..-+ .+..+.-..+-++..++. T Consensus 261 Ll~LA~lni~Hy~~ssd-~~~~l~~~~~P~l~~~G~d~~~~~~-~~-~~~~-~~---i~~g-~~~~~~lP~~~~~~~ie~ 332 (491) T protein:vir:95 261 LLPLAELNIGHYRNSAD-NEESSFVVGQPTLFIYPGDNLTPQS-FK-EANP-NG---IKFG-SRCGHNLGYGGSAQLIQA 332 (491) T ss_pred hHHHHHHHHHHhhhhhH-HHHHHHHcccceeeeecCcccCcch-hh-ccCc-ce---eEec-CcCCcCCCCCCccceeec Confidence 89998889999988877 5667888888888776633221110 00 0000 00 0000 011122233456666666 Q ss_pred CcCCH-HHHHHHHHHHHhhh-hcCceeeeeccCcccccccchhHHHHH---HHHHHHHHHHhhhHHHHHHHHHhhc---- Q lcl|NC_019725. 81 DISGV-PEFLSSKMDRIVSL-SGIHEIIIKNKNVGGVSASQNTALETF---YKLVDRKREEDYRPLLEFLLPFIVE---- 151 (237) Q Consensus 81 ~lsGl-~dl~~~~~~~iaa~-s~iP~t~L~G~sp~GlnatGe~D~~ny---yd~I~~~Qe~~l~p~l~~l~~~i~~---- 151 (237) .-+++ ...++...+++..+ ++ |+.++ + +-|++.-...+ +..++++- ..+.-.+.+++++.+. T Consensus 333 ~~~~~~~~~l~~~e~qm~~~Ga~-----l~~~~-~--~~Ta~~~~~~~~~~~S~L~~~a-~~~e~al~~~l~~~a~w~G~ 403 (491) T protein:vir:95 333 GENNLARQNMLDKEQQAIQIGAQ-----LITPS-Q--QITAESARIQRGADTSVMATIA-RNVSQAYTDALRWVAMMLGK 403 (491) T ss_pred CcchHHHHHHHHHHHHHHHHHHH-----hccCC-c--chhHHHHHHHHHHhhHHHHHHH-HHHHHHHHHHHHHHHHHcCC Confidence 65555 33444444444322 22 33322 2 33443333222 22233332 3456677777777652 Q ss_pred C--CCceeEeCCC---CCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCChhccccC-CCC Q lcl|NC_019725. 152 E--EEWSIEFEPL---SVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIREPEET-TEP 225 (237) Q Consensus 152 s--~~~~~~f~pL---~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~~~~e~~-~e~ 225 (237) . .+..|..|+= ..++.. ..++.-.++++|.|+.+..++.|+.++. . +.++++..+. .++ T Consensus 404 ~~~~~v~i~~n~dF~~~~~~~~--------~~~all~~~~~G~is~~t~~~~L~~~~v----l---~~~~e~~~~~ie~~ 468 (491) T protein:vir:95 404 PEDSEVEFQLNMDFFLQPMTAQ--------DRAAWMADINAGLLPATAYYAALRKAGV----T---DWTDEDILNAIEDA 468 (491) T ss_pred CCCCceEEEeecccccccCCHH--------HHHHHHHHHhcCCCCHHHHHHHHHhCCC----C---CccHHHHHHHHHhc Confidence 1 2334433332 222222 3667778889999999999999986542 2 2333332211 111 Q ss_pred CCCC-------CCCcCcCC Q lcl|NC_019725. 226 EPGL-------GEKLEDEN 237 (237) Q Consensus 226 ~~~~-------~~~~~~e~ 237 (237) .+.. |+-.+... T Consensus 469 ~~~~~~~~~~~~~~~~~~~ 487 (491) T protein:vir:95 469 PLPSGAVTQVAGEIPQAAQ 487 (491) T ss_pred CCCCCccccccccchhhhh Confidence 1111 11111111 No 231 >protein:vir:104892 Length: 558 # NCBI annotation: T4-like capsid assembly protein # Family: family:all:1036 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214363;genbank:gi:61806003;genbank:GeneID:3294412 Probab=42.17 E-value=0.9 Score=20.85 Aligned_cols=214 Identities=14% Similarity=0.170 Sum_probs=103.8 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCC-------- Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAET-------- 72 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~-------- 72 (237) =|..+-|++..|--++.- + -+||=++ ..+|-. .+.+.=++ .+++.+| +=++.|+.+ T Consensus 255 QLkmlEDAlVIYRitRAP------E--RRvFYID-VGnLPk-~KAeqYlr---~iM~k~K---NklVYDa~TGev~ddrk 318 (558) T protein:vir:10 255 QLRMIEDSLVIYRLSRAP------E--RRIFYID-VGNLPK-VKAEQYLK---EVMSRYR---NKLVYDANTGEVRDDRK 318 (558) T ss_pred hhHHHHhhHHHHhhhccc------c--ceEEEEe-cCCCCc-hhHHHHHH---HHHHhcc---ceEEEeccCceecccch Confidence 455666777666544321 1 1223221 222211 11111111 1122222 112333322 Q ss_pred ------------------cceeee--ecCcCCHHHHHHHHHHHHhhhhcCceeeeeccCcccccccchh-----HHHHHH Q lcl|NC_019725. 73 ------------------EEYDVL--NSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVSASQNT-----ALETFY 127 (237) Q Consensus 73 ------------------e~~~~~--~~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~GlnatGe~-----D~~nyy 127 (237) =+++.+ .-+|+-++|| ..|...+=.+.++|++||=.+ +|+|- |.+ |.-.|. T Consensus 319 ~msMlEDyWLpRReGgrgTEItTLpGgqnLgem~DV-~YF~kKLy~aLnVP~SRl~~e--~~f~~-Gr~~EItRDEiKF~ 394 (558) T protein:vir:10 319 FMSMMEDFWLPRREGGRGTEITTLPGGQNLGELSDV-DYFQKKLYRALGVPESRIAAE--GGFNL-GRSSEILRDELKFA 394 (558) T ss_pred hhhhHhhhcccccCCCCccceeeccccCCcchHHHH-HHHHHHHHHHhCCCccccCCC--Ccccc-cccchhhHHHHHHH Confidence 112222 1255666665 578899999999999999544 56663 322 456788 Q ss_pred HHHHHHHHHhhhHHHHHHHHH------hhcC-------CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHh--CCCCCH Q lcl|NC_019725. 128 KLVDRKREEDYRPLLEFLLPF------IVEE-------EEWSIEFEPLSVPSKKEESEITKNNVESVTKAIT--EQIIDL 192 (237) Q Consensus 128 d~I~~~Qe~~l~p~l~~l~~~------i~~s-------~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~--~g~i~~ 192 (237) ..|.++|.. +.+.+..+++. ++.. ++|.|+|+-=...+|-..++|...+..+++.+-. .-.+|. T Consensus 395 KFI~RLR~r-Fs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~ 473 (558) T protein:vir:10 395 KFVGRLRKR-FAAMFNDMLKTQLVLKNIVTPEDWKTMEDHIQYDFLYDNQFAELKESELMEGRLGMLATIEPYIGKYYST 473 (558) T ss_pred HHHHHHHHH-HHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccch Confidence 888888854 34444433332 2222 4788999999999999999999999988877633 223466 Q ss_pred HHHHHHHHhhcc--------------ccccCCCCCCChhcccc--------CCCCCCCCCCCcCcCC Q lcl|NC_019725. 193 EEARDTLRSIAP--------------EFKLKDGNNINIREPEE--------TTEPEPGLGEKLEDEN 237 (237) Q Consensus 193 ~e~r~~l~~~~~--------------~~g~~~~~~~~~~~~e~--------~~e~~~~~~~~~~~e~ 237 (237) +-+++..-.+.+ ..|.++. ++.+++.. ..+.+.....+.+++- T Consensus 474 dyi~k~ILr~tDeeI~~~~kqI~~E~k~~~~~~--p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 538 (558) T protein:vir:10 474 EYVRKRVLRQTDMEIEEIDTQIEDEIQKGIIPD--PSQIDPITGEPLPQEGDPAMEGMGEQPVDPDL 538 (558) T ss_pred HHHHHHHhccCHHHHHHHHHHHHHHHhCCCCCC--ccccChhhccccCccCCchhccCCCCCccccc Confidence 666654311111 2233321 22122111 0000011111111111 No 232 >protein:vir:77981 Length: 448 # NCBI annotation: portal protein # Family: family:all:2372 # MgeID: mge:1843 # MgeName: P23-45 # Cross-refs: genbank:acc:YP_001467939;genbank:gi:157265380;genbank:GeneID:5600471 Probab=42.03 E-value=0.9 Score=20.83 Aligned_cols=205 Identities=11% Similarity=0.063 Sum_probs=75.0 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhc--cceeechhHHHhhcCCchHHHHHHHHHHHHHhc-CchheeeeecCCcceee Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQ--QAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNS-GVGRAIGIDAETEEYDV 77 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~--~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r-~~~~~~~iD~~~e~~~~ 77 (237) |+..||-.+.--..+..--+..+.++. +.|.|.+.-+ + +.+.....=.+++...+ +....++|. ++.+++. T Consensus 214 Llr~~~w~~~fK~~~~~~w~~f~E~yG~P~~vgky~~ga----~-~~~~~~~~l~~av~~i~~g~~a~~iiP-~g~~ie~ 287 (448) T protein:vir:77 214 ALRAAVPHWLAKRALILLINHGLERFMIGVPTLTIPKSV----R-QGTKQWEAAKEIVKNFVQKPRHGIILP-DDWKFDT 287 (448) T ss_pred HHHHHHHHHHHHHhhHHHHHHHHHHcCCceeEEecCCCC----C-CCHHHHHHHHHHHHHHhcCCceEEEec-CCceEEE Confidence 666666644444444444555666766 5566654211 1 11111111123333333 223334454 4678888 Q ss_pred eecCcCC--HHHHHHHHHHHHhhh-hcCceeeeeccCcccccccchhHH-HHHHHHHHHHHHHhhhHHH-HHHHHHhhc- Q lcl|NC_019725. 78 LNSDISG--VPEFLSSKMDRIVSL-SGIHEIIIKNKNVGGVSASQNTAL-ETFYKLVDRKREEDYRPLL-EFLLPFIVE- 151 (237) Q Consensus 78 ~~~~lsG--l~dl~~~~~~~iaa~-s~iP~t~L~G~sp~GlnatGe~D~-~nyyd~I~~~Qe~~l~p~l-~~l~~~i~~- 151 (237) +++.-++ ...+++..-..||-+ .|--+|- + +-+|-++...++. ....+.+.+.-+. +...+ +.|+.-++. T Consensus 288 ~ea~~~~~~~~~~i~~~d~~Isk~iLGqtlTs--~-~~~g~~~~~~~~~~~v~~~~~~aDa~~-i~~tln~~Li~~l~~l 363 (448) T protein:vir:77 288 VDLKSAMPDAIPYLTYHDAGIARALGIDFNTV--Q-LNMGVQAVNIGEFVSLTQQTIISLQRE-FASAVNLYLIPKLVLP 363 (448) T ss_pred EecCCCccCHHHHHHHHHHHHHHHHhcccccc--c-cccchhhhhhhhHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHh Confidence 8776332 233555545555532 2222221 1 2223222222222 2233334333322 22223 234443321 Q ss_pred C----CCc-eeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHhhccccccCCCCCCChhccccCCCCC Q lcl|NC_019725. 152 E----EEW-SIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLEEARDTLRSIAPEFKLKDGNNINIREPEETTEPE 226 (237) Q Consensus 152 s----~~~-~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~e~r~~l~~~~~~~g~~~~~~~~~~~~e~~~e~~ 226 (237) . ... .|.|.-. |..|+ ++.|+++..+++ .+++.+ |+....+...+..+..+.+. T Consensus 364 Nfg~~~~~P~~~f~~~------e~eDl-~~~a~~~~~l~~-------~~~~~~-------~ip~~~~~~~~~~~~~~~~~ 422 (448) T protein:vir:77 364 NWPGATRFPRLTFEME------ERNDF-SAAANLMGMLIN-------AVKDSE-------DIPTELKALIDALPSKMRRA 422 (448) T ss_pred cCCCCCCCCEEEecCC------ChhhH-HHHHHHhHHHHH-------HHHHHh-------cCCccCCcCCCCCchhcccc Confidence 1 111 3444322 22333 235666666653 233332 11100000011111111111 Q ss_pred CCCCCCcCcCC Q lcl|NC_019725. 227 PGLGEKLEDEN 237 (237) Q Consensus 227 ~~~~~~~~~e~ 237 (237) ++.-+ .+.+. T Consensus 423 ~~~~~-~~~~~ 432 (448) T protein:vir:77 423 LGVVD-EVREA 432 (448) T ss_pred cCCCC-CCCch Confidence 11111 11111 No 233 >protein:vir:106999 Length: 564 # NCBI annotation: portal vertex protein gp20 # Family: family:all:1036 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195138;genbank:gi:58532915;interpro:IPR010823;uniprot:Q5GQN4;genbank:GeneID:3260496 Probab=40.12 E-value=0.99 Score=20.62 Aligned_cols=216 Identities=13% Similarity=0.151 Sum_probs=100.6 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCC-------- Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAET-------- 72 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~-------- 72 (237) =|..+-|+++.|--++.- + -+||=++ ..+|-. .+.+.=++ .+++.+| +=++.|+.+ T Consensus 257 QLkmlEDAlVIYRitRAP------e--RRvFYID-VGnLPk-~KAeqYlr---~iM~k~K---NklVYDa~TGevrddrk 320 (564) T protein:vir:10 257 QLRMIEDSLVIYRLSRAP------E--RRIFYID-VGNLPK-VKAEQYLR---DVMSRYR---NKLVYDGQTGEIRDDKK 320 (564) T ss_pred hhHHHHhhHHHHhhhccc------c--ceEEEEe-cCCCCc-hhHHHHHH---HHHHhcC---ceEEEeccCceecccch Confidence 455666776666544321 1 1223221 222211 11111111 1122222 113333322 Q ss_pred ------------------cceeee--ecCcCCHHHHHHHHHHHHhhhhcCceeeeeccCccccc--ccc--hhHHHHHHH Q lcl|NC_019725. 73 ------------------EEYDVL--NSDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVS--ASQ--NTALETFYK 128 (237) Q Consensus 73 ------------------e~~~~~--~~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~Gln--atG--e~D~~nyyd 128 (237) =+++.+ .-+|+-++|| ..|...+=.+.++|++||-.+ .+|+| .++ .-|.-.|.. T Consensus 321 ~msMlEDyWLPRReGgrgTEItTLpGgqnLgem~DV-~YF~kKLY~aLnVP~SRl~~e-~~~f~~Gr~~EItRDEiKF~K 398 (564) T protein:vir:10 321 HMSMLEDFWLPRREGGRGTEITTLPGGQNLGELKDV-EYFKKKLYNSLNLPPSRLTDD-NKAFNLGKSTEILRDELKFTK 398 (564) T ss_pred hhhhHhhhcccccCCCcccceeeccccCCcchHHHH-HHHHHHHHHHhCCCcccccCC-CceeecccccchhHHHHHHHH Confidence 112222 1255666665 578899999999999999665 44554 333 234567888 Q ss_pred HHHHHHHHhhhHHHHHHHHH------hhcC-------CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHh--CCCCCHH Q lcl|NC_019725. 129 LVDRKREEDYRPLLEFLLPF------IVEE-------EEWSIEFEPLSVPSKKEESEITKNNVESVTKAIT--EQIIDLE 193 (237) Q Consensus 129 ~I~~~Qe~~l~p~l~~l~~~------i~~s-------~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~--~g~i~~~ 193 (237) .|.++|.. +.+.+..+++. ++.. ++|.|+|.-=...+|-..++|...+..+++.+-. .-.+|.+ T Consensus 399 FI~RLR~r-Fs~lF~~~Lk~qLiLKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~d 477 (564) T protein:vir:10 399 FIGRLRKR-FAQLFHDILKTQLILKGIITPEDWDDMEEHIQYDFLFDNHFNELKEQEMQLQRVNLATQMDPFVGKYFSTE 477 (564) T ss_pred HHHHHHHH-HHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchH Confidence 88888854 34444433332 2222 4688999998899999999999998888776521 2234555 Q ss_pred HHHHHHHhhc------------c--ccccCCCCCCChhcc-------------------------------ccCCCCCCC Q lcl|NC_019725. 194 EARDTLRSIA------------P--EFKLKDGNNINIREP-------------------------------EETTEPEPG 228 (237) Q Consensus 194 e~r~~l~~~~------------~--~~g~~~~~~~~~~~~-------------------------------e~~~e~~~~ 228 (237) -+++.+-.+. . ..|++. ++..++. +.+++++|+ T Consensus 478 yi~k~ILr~tDeei~~~~kqI~~E~k~~~~~--~P~e~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~a~~~~~~ 555 (564) T protein:vir:10 478 YIRRKILMQTENEFKEIDKQMKSDIESGLAI--DPIQVNMLDDMEKQNQAFAPELQAAQDDLAAEREIKKLNSAPKPPPS 555 (564) T ss_pred HHHHHHhccCHHHHHHHHHHHHHHhhcCCCC--CchhhhcCCCccCCCCcCCcchhhhccccccccChhhhccCCCCCCC Confidence 5554321111 0 112211 1100000 001111111 Q ss_pred CCCCcCcCC Q lcl|NC_019725. 229 LGEKLEDEN 237 (237) Q Consensus 229 ~~~~~~~e~ 237 (237) ....+++-- T Consensus 556 ~~~~~~~~~ 564 (564) T protein:vir:10 556 QQSKSQSNK 564 (564) T ss_pred CCCcCcCCC Confidence 110000000 No 234 >protein:vir:5665 Length: 511 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899604;genbank:gi:34419591;genbank:GeneID:2546036 Probab=36.56 E-value=1.2 Score=20.22 Aligned_cols=203 Identities=17% Similarity=0.238 Sum_probs=104.6 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHH-HHHhcCchheeeeecCC------- Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQ-VDDNSGVGRAIGIDAET------- 72 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~-~~~~r~~~~~~~iD~~~------- 72 (237) =|..+-|++..|--++.- + -+||=++ ..+|-. .+.+ +.+.- ++.+|+ =++.|+.+ T Consensus 252 QLkm~EDAlVIYRitRAP------e--RRvFYID-VGnLPk-~KAe----qYl~~iM~k~kN---klVYDa~TGev~ddr 314 (511) T protein:vir:56 252 QLKMLEDALVIYRLARAP------E--RRVFYVD-VGNLPT-QKAQ----QYVNGIMQNVKN---RVVYDTQTGQVKNTT 314 (511) T ss_pred hhHHHHhhHHHHhhhccc------c--ceEEEEe-cCCCCc-hhHH----HHHHHHHHhcCc---eEEEeccCceeccch Confidence 455566666655443321 1 1233232 222211 1111 12211 111211 13333322 Q ss_pred -------------------cceeeee--cCcCCHHHHHHHHHHHHhhhhcCceeeeecc-Cccccc-c-cc--hhHHHHH Q lcl|NC_019725. 73 -------------------EEYDVLN--SDISGVPEFLSSKMDRIVSLSGIHEIIIKNK-NVGGVS-A-SQ--NTALETF 126 (237) Q Consensus 73 -------------------e~~~~~~--~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~-sp~Gln-a-tG--e~D~~ny 126 (237) =+++.+. -+|+-++|| ..|...+=.+.++|++||-.. +++|+| + ++ .-|.-.| T Consensus 315 k~msMlEDyWLpRReGgrgTEItTLpGgqnlgem~DV-~YF~kKLy~aLnVP~SRl~~e~q~~~f~~Gr~~EItRDEiKF 393 (511) T protein:vir:56 315 NAMSMLEDYYLPRREGSKGTEVSTLPGGQSLGDIEDV-LYFNRKLYKAMRIPTSRAASEDQTGGINFGQGAEITRDELKF 393 (511) T ss_pred hhhhhHhhhcccccCCCCccceeeccccCCcChHHHH-HHHHHHHHHHhCCCcccccCCCCccccccccchhhhHHHHHH Confidence 1122221 235555554 688899999999999999855 446776 1 11 2355678 Q ss_pred HHHHHHHHHHhhhHHHHHHHHH------hhcC-------CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHh--CCCCC Q lcl|NC_019725. 127 YKLVDRKREEDYRPLLEFLLPF------IVEE-------EEWSIEFEPLSVPSKKEESEITKNNVESVTKAIT--EQIID 191 (237) Q Consensus 127 yd~I~~~Qe~~l~p~l~~l~~~------i~~s-------~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~--~g~i~ 191 (237) ...|.++|.. +.+.+..+++. ++.. ++|.|+|+-=...+|-..+||...+..+++.+-. .-.+| T Consensus 394 ~KFI~RLR~r-Fs~lF~~~Lk~qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S 472 (511) T protein:vir:56 394 TKFVKRLQTK-FETVITDPLKHQLIVNNIITEEEWDANHEKLYVVFNQDSYFEEAKELEILNSRMNAMRDIQDYAGKYYS 472 (511) T ss_pred HHHHHHHHHH-HHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhcchhccccc Confidence 8888888854 44444433332 2222 4788999999999999999999999998887633 22468 Q ss_pred HHHHHHHHHhhccccccCCCCCCCh--hccc-cCCC-----CCCCC Q lcl|NC_019725. 192 LEEARDTLRSIAPEFKLKDGNNINI--REPE-ETTE-----PEPGL 229 (237) Q Consensus 192 ~~e~r~~l~~~~~~~g~~~~~~~~~--~~~e-~~~e-----~~~~~ 229 (237) .+-+++..-.+.+ ..+.. +.++ +.++ ++.+. T Consensus 473 ~~yi~k~ILr~tD-------eei~~~~k~I~~E~k~~~~~~~e~~f 511 (511) T protein:vir:56 473 HKYIQKNILRLSD-------DQITAMQSEIDEEETNPRFQQDDQGF 511 (511) T ss_pred hHHHHHHHhccCH-------HHHHHHHHHHHHhhcCCCCCCcccCC Confidence 8888876422221 11111 1111 1111 22222 No 235 >protein:vir:98265 Length: 524 # NCBI annotation: gp20 portal vertex of the head # Family: family:all:1036 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239198;genbank:gi:66391673;genbank:GeneID:3416367 Probab=31.62 E-value=1.5 Score=19.64 Aligned_cols=211 Identities=14% Similarity=0.162 Sum_probs=104.6 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCC-------- Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAET-------- 72 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~-------- 72 (237) =|..+-|++..|--++.- + -+||=++ ..+|-. .+.+.=++ .+++.+|+ =++.|+.. T Consensus 264 QLkm~EDAlVIYRitRAP------e--RRvFYID-vGnlPk-~KAeqYl~---~im~k~kN---klvYDa~TGevrddrk 327 (524) T protein:vir:98 264 QLRLLEDAMVIYRITRAP------E--RRVFYID-VGQMGG-NKATQYVN---NIAQGLKN---RVVYDARTGTVKNQQN 327 (524) T ss_pred hhHHHHhhHHHHhhhccc------c--ceEEEEe-cCCCCc-hhHHHHHH---HHHHhcCc---eeEeeccCceeecccc Confidence 455566666665544321 1 1233332 222211 11111111 11222221 12233221 Q ss_pred ------------------cceeeee--cCcCCHHHHHHHHHHHHhhhhcCceeeee-ccCcccccccc--hhHHHHHHHH Q lcl|NC_019725. 73 ------------------EEYDVLN--SDISGVPEFLSSKMDRIVSLSGIHEIIIK-NKNVGGVSASQ--NTALETFYKL 129 (237) Q Consensus 73 ------------------e~~~~~~--~~lsGl~dl~~~~~~~iaa~s~iP~t~L~-G~sp~GlnatG--e~D~~nyyd~ 129 (237) =+++.+. -+|+-++|| ..|+..+=.+.++|++||- ..+.-.++.++ .-|.-.|... T Consensus 328 ~msMlEDyWLpRReGgrgTEItTLpggqnlgem~DV-~YF~kkLy~aLnVP~sRl~~~~~~f~~Gr~~EItRDEiKF~KF 406 (524) T protein:vir:98 328 NLSMTEDYWLMRRDGKAITEVSTLPGGQNFSDMDDI-KWFNRKLYEALRVPLSRMPRDDGGMQIGGGGEITRDELKFSKF 406 (524) T ss_pred ccchhhhhcccccCCCCccceeeccccCCcChHHHH-HHHHHHHHHHhCCCceeccCCCCccccccccchhHHHHHHHHH Confidence 1222221 235555554 6888999999999999994 33222233332 2355678888 Q ss_pred HHHHHHHhhhHHHHHHHHH------hhcC-------CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHh--CCCCCHHH Q lcl|NC_019725. 130 VDRKREEDYRPLLEFLLPF------IVEE-------EEWSIEFEPLSVPSKKEESEITKNNVESVTKAIT--EQIIDLEE 194 (237) Q Consensus 130 I~~~Qe~~l~p~l~~l~~~------i~~s-------~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~--~g~i~~~e 194 (237) |.++|.. +.+.+..+++. ++.. +.|.|+|.-=...+|-..+||...+..+++.+-. .-.++.+- T Consensus 407 I~rLR~r-Fs~lf~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~dy 485 (524) T protein:vir:98 407 IRTLQIQ-FSPVLSDPLKTNLIAKKIITEDEWEENVSKISFVFQQDSYYAEVKDIEILERRLNLMSQVEGVVGKYVSHKY 485 (524) T ss_pred HHHHHHH-HHHHHHHHHHHhhhhhcCCCHHHHHHHhhcceEEEeecchHHHHHHHHHHHHHHHHHHHhccccccccchHH Confidence 8888854 33433333332 2322 3678999999999999999999999998887754 23678888 Q ss_pred HHHHHHhhccccccCCCCCCCh--hccccCCCCCCCCCCCcCcCC Q lcl|NC_019725. 195 ARDTLRSIAPEFKLKDGNNINI--REPEETTEPEPGLGEKLEDEN 237 (237) Q Consensus 195 ~r~~l~~~~~~~g~~~~~~~~~--~~~e~~~e~~~~~~~~~~~e~ 237 (237) +++..-.+.+ ..+.. ..+++.. .++-.-++++++. T Consensus 486 i~k~ILr~tD-------eei~~~~k~I~~E~-k~~~~~~p~~e~~ 522 (524) T protein:vir:98 486 IMKEILRMSD-------EDIDEQAKLIEEES-KEERFKNPEAEEE 522 (524) T ss_pred HHHHHhccCH-------HHHHHHHHHHHHHH-hCCCCcCCccccc Confidence 8775422211 11111 1111111 1111122222222 No 236 >protein:vir:106282 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944108;genbank:gi:38640152;genbank:GeneID:2658030 Probab=25.93 E-value=2 Score=18.93 Aligned_cols=209 Identities=14% Similarity=0.160 Sum_probs=105.9 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHH-HHHhcCchheeeeecCC------- Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQ-VDDNSGVGRAIGIDAET------- 72 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~-~~~~r~~~~~~~iD~~~------- 72 (237) =|..+-|++..|--++.- + -+||=++ ..+|-. .+.+ +.+.- ++.+|+ =++.|+.+ T Consensus 259 QLkm~EDAlVIYRitRAP------e--RRvFYID-vGnlpk-~KAe----qYl~~iM~k~kN---klVYDa~TGev~ddr 321 (521) T protein:vir:10 259 QLKMLEDAMVIYRITRAP------E--RRVFYID-VGTMPN-KKAT----QHLNNVMQGLKN---RVVYDSSTGKVKNSS 321 (521) T ss_pred hhHHHHhhHHHHhhhccc------c--ceEEEEe-cCCCCc-hhHH----HHHHHHHHhcCc---eEEEeccCceeccch Confidence 455666777666544321 1 1223221 222211 1111 11111 111111 13333322 Q ss_pred -------------------cceeeee--cCcCCHHHHHHHHHHHHhhhhcCceeeeeccCccccc--ccc--hhHHHHHH Q lcl|NC_019725. 73 -------------------EEYDVLN--SDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVS--ASQ--NTALETFY 127 (237) Q Consensus 73 -------------------e~~~~~~--~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp~Gln--atG--e~D~~nyy 127 (237) =+++.+. -+|+-++|| ..|+..+=.+.++|++||=.+ .+|+| .++ .-|.-.|. T Consensus 322 k~msMlEDyWLpRReGgrgTEI~TLpggqnlgem~DV-~YF~kkLy~aLnVP~sRl~~e-~~~f~~Gr~~EItRDEikF~ 399 (521) T protein:vir:10 322 NNLAMTEDYWLMRRDGKATTEVSTLPGAQSMGEMDDV-RWFNRKLYESMKIPLSRLPQE-GAGVTFGAGNDITRDELQFT 399 (521) T ss_pred hhhhhHhhhcccccCCCCccceeeccccCCcChHHHH-HHHHHHHHHHhCCCccccCCC-CCceecccccchhHHHHHHH Confidence 1122221 235555554 688899999999999998544 45544 332 23456788 Q ss_pred HHHHHHHHHhhhHHHHHHHHH------hhcC-------CCceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHh----CCCC Q lcl|NC_019725. 128 KLVDRKREEDYRPLLEFLLPF------IVEE-------EEWSIEFEPLSVPSKKEESEITKNNVESVTKAIT----EQII 190 (237) Q Consensus 128 d~I~~~Qe~~l~p~l~~l~~~------i~~s-------~~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~----~g~i 190 (237) ..|.++|.. +.+.+..+++. ++.. ++|.|+|+-=...+|-..++|...+..+++.+-- .-.+ T Consensus 400 KFI~rLR~r-Fs~~f~~~L~~qLilKgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~eil~~R~~~l~~~dp~~yvGky~ 478 (521) T protein:vir:10 400 KYIRGLQQQ-FEPIFLNPLRTNLMLKGKMSVSEWEEQAENIKVVFSKDSYYEEIKDVEILERRVNLVQTLASAEVTGKYL 478 (521) T ss_pred HHHHHHHHH-HHHHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHhhcCcccccccc Confidence 888888854 44444433332 2222 4788999999999999999999999999887733 3367 Q ss_pred CHHHHHHHHHhhccccccCCCCCCCh--hccccCCCCCCCCCCCcCcCC Q lcl|NC_019725. 191 DLEEARDTLRSIAPEFKLKDGNNINI--REPEETTEPEPGLGEKLEDEN 237 (237) Q Consensus 191 ~~~e~r~~l~~~~~~~g~~~~~~~~~--~~~e~~~e~~~~~~~~~~~e~ 237 (237) +.+-+++..-.+.+ .++.. +.+++.. .+|-.-++.+.+. T Consensus 479 s~dyi~k~ILr~tD-------eeik~~~k~I~~E~-~~~~~~~p~~e~~ 519 (521) T protein:vir:10 479 SHEYVMKNILRMSD-------EDIKTEREKIDGEL-KDSVYKNPEDPME 519 (521) T ss_pred chHHHHHHHhcCCH-------hHHHHHHHHHHHhh-hCCCCCCCcchhh Confidence 88888876432221 11221 1111111 1111112222212 No 237 >protein:vir:80165 Length: 651 # NCBI annotation: portal protein # Family: family:all:1548 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285799;genbank:gi:148747833;genbank:GeneID:5220441 Probab=24.67 E-value=2.1 Score=18.76 Aligned_cols=218 Identities=10% Similarity=0.073 Sum_probs=99.7 Q ss_pred CchhHHHHHHHHHHHHHHHHHHHHHhccceeechhHHHhhcCCchHHHHHHHHHHHHHhcCchheeeeecCCcceeeee- Q lcl|NC_019725. 1 MNKSLIDAICDYDYCESLATQILRRKQQAVWKVKGLAEMCDDDDAQYAARLRLAQVDDNSGVGRAIGIDAETEEYDVLN- 79 (237) Q Consensus 1 llq~~~d~v~~~~~~~~~~~~Ll~~~~~~v~k~~~l~~~~~~~~~e~~~~~r~~~~~~~r~~~~~~~iD~~~e~~~~~~- 79 (237) ..+.+.+.............+.+..+.-..+.+..-+ .. ..-.+ . . .-.+.+.+. ..+++..++ T Consensus 341 ~~~~~~~~q~~ln~l~~~~ld~~~~~~~~~~~v~~d~-~~----------~~~~l-~-~-~pg~vi~~~-~~~~~~~l~~ 405 (651) T protein:vir:80 341 ALQPNLGMLHELNIITNQRLDNLELAIDQMYTLRSDG-LL----------QPEDV-Y-T-EPGKVFLVS-DHGDLQPLAN 405 (651) T ss_pred hHHHHhHHHHHHHHHHHHHHHHHHHHhCCcEEecCCc-cc----------cHHHh-h-c-CCCceEEec-CCCCceeecc Confidence 7888888888888888888888888877777664211 10 00011 0 0 123444443 345555443 Q ss_pred --cCcCCHHHHHHHHHHHHhhhhcCceeeeeccCc---ccccccchhHH-----HHHHHHHHHHHHHhhhHHHHHHHHHh Q lcl|NC_019725. 80 --SDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNV---GGVSASQNTAL-----ETFYKLVDRKREEDYRPLLEFLLPFI 149 (237) Q Consensus 80 --~~lsGl~dl~~~~~~~iaa~s~iP~t~L~G~sp---~GlnatGe~D~-----~nyyd~I~~~Qe~~l~p~l~~l~~~i 149 (237) .++.+...++......+.-.+++|- ...|.+| +-.+||+=.-+ ...-..++.+++..++|.+++++.++ T Consensus 406 ~~~~~~~~~~~l~~l~~~~~~~~gv~~-~~~g~~~~~~~~~TAteI~~~~~~~~~~l~~v~~~l~~e~l~pl~~r~l~l~ 484 (651) T protein:vir:80 406 QSSNFSITYQESSFLESTIDKNFGTGN-YVGANAARSGERVTAAEVAAVREAGGNRLSGIHKHIEETSLLVLLEKVMHLV 484 (651) T ss_pred CcccchhHHHHHHHHHHHHHHHhcCCh-HHhCCCccchhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3445666778888888888888873 3344333 33455541111 22333445556667899999999998 Q ss_pred hcCC------------------------CceeEeCCCCCCCHHHHHHHHHHHHHHHHHHHhCCCCCHH-----HHHHHHH Q lcl|NC_019725. 150 VEEE------------------------EWSIEFEPLSVPSKKEESEITKNNVESVTKAITEQIIDLE-----EARDTLR 200 (237) Q Consensus 150 ~~s~------------------------~~~~~f~pL~~~seke~Aei~~~~A~a~~~~~~~g~i~~~-----e~r~~l~ 200 (237) +..- ++..+|.- -..+...+.+.....++..+ ..+.+.-.|. ...+.++ T Consensus 485 ~~~~~~~~~~ri~~~~~~~~~~~~i~~~dl~~~~~i-v~~g~~~~~~r~~~~~~l~~-~~q~~~~~p~~~~~~~~~~~~~ 562 (651) T protein:vir:80 485 QQFTDQPGMVRVAGDEAGAYEYYELDVEDLQKEVRL-VPIGSDHVIERKQYIEDRLT-FIQAVAQVPEMGQLVDYKRILV 562 (651) T ss_pred HHhcCcccceeecccccccccccccCccceeeeeee-eeccHHHHHHHHHHHHHHHH-HHHhhccCCccchhhhHHHHHH Confidence 7531 22222211 11233333333333333333 3333322221 1222222 Q ss_pred hhccccccCCCCCCChhccccCCCCCCCCCCCcCc-------CC Q lcl|NC_019725. 201 SIAPEFKLKDGNNINIREPEETTEPEPGLGEKLED-------EN 237 (237) Q Consensus 201 ~~~~~~g~~~~~~~~~~~~e~~~e~~~~~~~~~~~-------e~ 237 (237) ......|+...... ...+++...+.+.+..-.+. +. T Consensus 563 ~l~~~~g~~~~~~~-l~~~~q~~~~~~~~~~~~q~~~~~~~a~~ 605 (651) T protein:vir:80 563 DLLQHWGFEEPEAY-LKQQDQQAPANPQEALLSQAKDVGGQAMS 605 (651) T ss_pred HHHHHcCCCCcHHh-cCCCccchhhhhhHHHHhhHHHHHHHHHH Confidence 22223444211110 00111111111100000011 11 Done!