Query lcl|NC_015285.1_cdsid_YP_004323727.1 [gene=gp20] [protein=portal vertex protein of head] [protein_id=YP_004323727.1] [location=108684..109763] Match_columns 359 No_of_seqs 74 out of 86 Neff 3.9 Searched_HMMs 1612 Date Thu Nov 7 14:46:34 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_116 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_116_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:103177 Length: 533 100.0 3E-203 2E-206 1131.3 32.3 358 1-359 169-533 (533) 2 protein:vir:104500 Length: 537 100.0 3E-202 2E-205 1125.4 30.9 359 1-359 177-537 (537) 3 protein:vir:104892 Length: 558 100.0 6E-198 4E-201 1101.9 30.2 359 1-359 181-558 (558) 4 protein:vir:6896 Length: 523 # 100.0 9E-198 5E-201 1101.0 30.0 327 1-327 189-523 (523) 5 protein:vir:106999 Length: 564 100.0 3E-197 2E-200 1098.2 30.4 359 1-359 169-560 (564) 6 protein:vir:103458 Length: 524 100.0 3E-197 2E-200 1098.4 29.8 327 1-327 189-524 (524) 7 protein:vir:7208 Length: 524 # 100.0 3E-197 2E-200 1098.3 29.8 327 1-327 189-524 (524) 8 protein:vir:108049 Length: 524 100.0 2E-196 1E-199 1093.9 30.1 327 1-327 189-524 (524) 9 protein:vir:81017 Length: 521 100.0 1E-196 8E-200 1094.6 28.5 327 1-327 186-521 (521) 10 protein:vir:106282 Length: 521 100.0 3E-196 2E-199 1092.4 29.5 327 1-327 187-521 (521) 11 protein:vir:101806 Length: 516 100.0 9E-196 5E-199 1090.0 29.1 326 1-326 182-516 (516) 12 protein:vir:101189 Length: 516 100.0 9E-196 5E-199 1090.0 29.1 326 1-326 182-516 (516) 13 protein:vir:6596 Length: 521 # 100.0 1E-195 7E-199 1089.3 28.6 327 1-327 186-521 (521) 14 protein:vir:98265 Length: 524 100.0 3E-195 2E-198 1087.1 29.5 326 1-327 190-524 (524) 15 protein:vir:100598 Length: 516 100.0 1E-194 7E-198 1083.9 29.1 326 1-326 182-516 (516) 16 protein:vir:5665 Length: 511 # 100.0 4E-193 2E-196 1075.5 29.0 324 1-324 175-511 (511) 17 protein:vir:5839 Length: 533 # 100.0 7E-168 5E-171 936.8 25.5 340 1-359 147-513 (533) 18 protein:vir:107742 Length: 537 98.3 2.5E-07 1.6E-10 56.8 17.5 294 1-357 184-537 (537) 19 protein:vir:2500 Length: 501 # 98.2 1.8E-06 1.1E-09 52.1 19.5 298 1-355 160-501 (501) 20 protein:vir:99916 Length: 504 98.1 2.5E-06 1.5E-09 51.3 18.8 309 1-357 155-504 (504) 21 protein:vir:96068 Length: 765 98.0 2.8E-06 1.7E-09 51.0 17.8 296 1-359 188-553 (765) 22 protein:vir:94049 Length: 532 97.9 9.9E-06 6.1E-09 48.0 18.3 311 1-359 163-532 (532) 23 protein:vir:99563 Length: 862 97.9 2.1E-06 1.3E-09 51.7 14.2 308 1-359 215-598 (862) 24 protein:vir:105782 Length: 449 97.7 2.9E-05 1.8E-08 45.5 18.5 264 1-348 132-449 (449) 25 protein:vir:7768 Length: 484 # 97.4 6.7E-05 4.2E-08 43.5 17.5 307 1-357 152-484 (484) 26 protein:vir:104338 Length: 422 97.3 9.8E-05 6.1E-08 42.6 17.7 275 1-353 101-422 (422) 27 protein:vir:1587 Length: 508 # 97.1 0.00017 1.1E-07 41.2 18.1 288 1-339 174-508 (508) 28 protein:vir:2427 Length: 485 # 97.0 0.00019 1.2E-07 40.9 18.4 307 1-354 153-485 (485) 29 protein:vir:79538 Length: 502 96.9 0.00026 1.6E-07 40.2 21.8 310 1-356 176-502 (502) 30 protein:vir:104082 Length: 485 96.9 0.00028 1.7E-07 40.1 18.7 307 1-355 153-485 (485) 31 protein:vir:4223 Length: 486 # 96.8 0.0003 1.9E-07 39.9 17.4 304 1-354 153-486 (486) 32 protein:vir:2341 Length: 488 # 96.8 0.00031 1.9E-07 39.8 19.4 306 1-354 158-488 (488) 33 protein:vir:78227 Length: 480 96.7 0.00041 2.5E-07 39.2 18.5 303 1-359 141-479 (480) 34 protein:vir:79772 Length: 648 96.5 0.00052 3.2E-07 38.6 22.1 311 1-359 165-520 (648) 35 protein:vir:99072 Length: 479 96.5 0.00059 3.6E-07 38.3 19.3 300 1-358 154-479 (479) 36 protein:vir:101418 Length: 569 96.4 0.00065 4E-07 38.1 16.0 318 1-356 199-569 (569) 37 protein:vir:80040 Length: 461 96.4 0.00071 4.4E-07 37.9 17.8 275 1-329 122-461 (461) 38 protein:vir:107662 Length: 427 96.4 0.00071 4.4E-07 37.8 17.6 278 1-353 102-427 (427) 39 protein:vir:8184 Length: 474 # 96.1 0.00097 6E-07 37.1 17.0 282 1-341 149-474 (474) 40 protein:vir:78537 Length: 480 95.8 0.0015 9.4E-07 36.1 18.1 305 1-358 141-480 (480) 41 protein:vir:79703 Length: 505 95.7 0.0015 9.5E-07 36.0 16.9 293 1-333 172-505 (505) 42 protein:vir:98444 Length: 434 95.7 0.0016 9.9E-07 35.9 20.5 292 1-357 107-434 (434) 43 protein:vir:10321 Length: 495 95.5 0.0019 1.2E-06 35.5 19.9 306 1-358 166-495 (495) 44 protein:vir:78907 Length: 518 95.2 0.0025 1.6E-06 34.8 18.7 294 1-335 186-518 (518) 45 protein:vir:5249 Length: 437 # 94.9 0.0032 2E-06 34.2 23.3 289 1-352 108-437 (437) 46 protein:vir:80959 Length: 499 94.8 0.0034 2.1E-06 34.1 20.2 285 1-341 174-499 (499) 47 protein:vir:98883 Length: 517 94.5 0.0043 2.7E-06 33.6 20.0 282 1-341 186-517 (517) 48 protein:vir:9815 Length: 500 # 94.4 0.0047 2.9E-06 33.4 19.8 284 1-339 171-500 (500) 49 protein:vir:3028 Length: 500 # 94.4 0.0047 2.9E-06 33.4 19.8 284 1-339 171-500 (500) 50 protein:vir:102727 Length: 945 94.1 0.0053 3.3E-06 33.1 14.8 298 1-359 206-541 (945) 51 protein:vir:79647 Length: 435 94.0 0.0056 3.4E-06 32.9 16.7 272 1-336 113-435 (435) 52 protein:vir:103219 Length: 201 93.8 0.0052 3.2E-06 33.1 11.3 189 89-354 1-201 (201) 53 protein:vir:80680 Length: 441 93.7 0.0066 4.1E-06 32.5 15.0 286 1-345 132-441 (441) 54 protein:vir:4782 Length: 522 # 93.6 0.0072 4.4E-06 32.3 20.3 294 1-357 190-522 (522) 55 protein:vir:96738 Length: 505 93.4 0.0075 4.7E-06 32.2 21.4 290 1-354 187-505 (505) 56 protein:vir:9568 Length: 410 # 93.3 0.0079 4.9E-06 32.1 17.4 265 1-316 117-410 (410) 57 protein:vir:99522 Length: 470 92.9 0.0096 6E-06 31.6 19.3 292 1-341 158-470 (470) 58 protein:vir:94742 Length: 409 92.8 0.0098 6.1E-06 31.6 16.1 254 1-300 129-409 (409) 59 protein:vir:78641 Length: 278 92.6 0.011 6.7E-06 31.4 15.3 206 1-256 54-278 (278) 60 protein:vir:9922 Length: 489 # 91.1 0.018 1.1E-05 30.2 17.4 306 1-351 153-489 (489) 61 protein:vir:6382 Length: 553 # 90.8 0.019 1.2E-05 30.1 22.7 323 1-358 184-553 (553) 62 protein:vir:7853 Length: 518 # 90.8 0.019 1.2E-05 30.0 18.6 293 1-359 121-468 (518) 63 protein:vir:106571 Length: 499 88.8 0.03 1.9E-05 28.9 18.1 296 1-357 165-499 (499) 64 protein:vir:95542 Length: 548 88.0 0.035 2.2E-05 28.6 22.3 310 1-359 176-543 (548) 65 protein:vir:5961 Length: 503 # 84.5 0.06 3.7E-05 27.3 18.9 303 1-357 168-503 (503) 66 protein:vir:3609 Length: 452 # 83.6 0.067 4.2E-05 27.0 17.8 283 1-346 149-452 (452) 67 protein:vir:96980 Length: 409 83.5 0.068 4.2E-05 27.0 15.8 278 1-357 107-409 (409) 68 protein:vir:38 Length: 496 # N 83.1 0.071 4.4E-05 26.9 20.3 287 1-341 182-496 (496) 69 protein:vir:81218 Length: 423 82.6 0.075 4.7E-05 26.7 17.9 281 1-357 119-423 (423) 70 protein:vir:105889 Length: 474 82.4 0.077 4.8E-05 26.7 17.9 273 1-348 185-474 (474) 71 protein:vir:94101 Length: 474 82.4 0.077 4.8E-05 26.7 17.9 273 1-348 185-474 (474) 72 protein:vir:1634 Length: 409 # 81.6 0.084 5.2E-05 26.5 15.6 257 1-300 129-409 (409) 73 protein:vir:7987 Length: 456 # 80.6 0.093 5.8E-05 26.2 16.1 277 1-346 141-456 (456) 74 protein:vir:102080 Length: 429 78.6 0.11 7E-05 25.8 14.0 286 1-358 113-429 (429) 75 protein:vir:96179 Length: 468 78.5 0.11 7.1E-05 25.8 18.4 271 1-339 181-468 (468) 76 protein:vir:1236 Length: 483 # 77.5 0.12 7.7E-05 25.5 17.7 282 1-358 190-483 (483) 77 protein:vir:94426 Length: 409 77.4 0.12 7.8E-05 25.5 16.4 278 1-357 107-409 (409) 78 protein:vir:102602 Length: 456 77.1 0.13 8E-05 25.5 16.9 277 1-346 141-456 (456) 79 protein:vir:105819 Length: 456 77.1 0.13 8E-05 25.5 16.9 277 1-346 141-456 (456) 80 protein:vir:3153 Length: 467 # 76.8 0.13 8.2E-05 25.4 19.5 302 1-359 93-453 (467) 81 protein:vir:99781 Length: 511 75.9 0.14 8.8E-05 25.2 17.6 299 1-358 175-511 (511) 82 protein:vir:2683 Length: 412 # 75.8 0.14 8.9E-05 25.2 16.3 274 1-357 110-412 (412) 83 protein:vir:389 Length: 530 # 74.0 0.16 0.0001 24.9 21.9 311 1-358 177-530 (530) 84 protein:vir:101647 Length: 460 73.8 0.17 0.0001 24.9 17.6 284 1-356 140-460 (460) 85 protein:vir:94805 Length: 492 71.0 0.2 0.00013 24.4 18.5 283 1-358 199-492 (492) 86 protein:vir:9306 Length: 511 # 70.9 0.2 0.00013 24.4 18.8 304 1-356 175-511 (511) 87 protein:vir:1266 Length: 416 # 69.9 0.22 0.00013 24.2 14.7 279 1-338 111-416 (416) 88 protein:vir:105292 Length: 478 69.8 0.22 0.00014 24.2 18.5 286 1-352 165-478 (478) 89 protein:vir:106639 Length: 481 69.2 0.23 0.00014 24.1 18.2 291 1-354 167-481 (481) 90 protein:vir:3420 Length: 533 # 68.6 0.24 0.00015 24.0 22.4 315 1-358 177-533 (533) 91 protein:vir:93747 Length: 472 68.6 0.24 0.00015 24.0 20.1 283 1-358 179-472 (472) 92 protein:vir:9751 Length: 422 # 68.3 0.24 0.00015 24.0 17.8 267 1-315 130-422 (422) 93 protein:vir:93943 Length: 409 67.7 0.25 0.00015 23.9 15.7 275 1-357 107-409 (409) 94 protein:vir:9359 Length: 348 # 66.1 0.27 0.00017 23.7 16.7 276 1-357 46-348 (348) 95 protein:vir:3964 Length: 453 # 65.7 0.28 0.00017 23.6 17.5 283 1-356 149-453 (453) 96 protein:vir:95806 Length: 440 64.9 0.29 0.00018 23.5 17.3 286 1-343 131-440 (440) 97 protein:vir:4509 Length: 424 # 64.6 0.3 0.00018 23.5 19.2 279 1-358 124-424 (424) 98 protein:vir:78805 Length: 511 63.1 0.32 0.0002 23.3 17.0 302 1-358 175-511 (511) 99 protein:vir:96366 Length: 511 63.1 0.32 0.0002 23.3 17.0 302 1-358 175-511 (511) 100 protein:vir:9408 Length: 441 # 61.8 0.35 0.00022 23.1 19.4 284 1-358 130-441 (441) 101 protein:vir:79984 Length: 441 61.8 0.35 0.00022 23.1 19.4 284 1-358 130-441 (441) 102 protein:vir:96579 Length: 576 60.7 0.37 0.00023 23.0 19.2 304 1-359 176-527 (576) 103 protein:vir:80644 Length: 551 59.7 0.39 0.00024 22.8 18.6 306 1-359 182-545 (551) 104 protein:vir:107112 Length: 478 59.2 0.4 0.00025 22.8 19.2 285 1-352 164-478 (478) 105 protein:vir:8418 Length: 409 # 59.0 0.4 0.00025 22.8 20.0 283 1-356 107-409 (409) 106 protein:vir:97336 Length: 492 58.7 0.41 0.00025 22.7 20.0 283 1-358 199-492 (492) 107 protein:vir:100882 Length: 383 58.5 0.41 0.00026 22.7 19.2 267 1-355 98-383 (383) 108 protein:vir:96240 Length: 511 58.3 0.42 0.00026 22.7 16.3 306 1-358 175-511 (511) 109 protein:vir:102118 Length: 409 54.9 0.49 0.00031 22.3 18.1 274 1-359 105-409 (409) 110 protein:vir:105461 Length: 470 53.7 0.52 0.00032 22.1 19.0 278 1-334 165-470 (470) 111 protein:vir:93610 Length: 454 52.2 0.56 0.00035 22.0 19.1 297 1-359 116-448 (454) 112 protein:vir:99312 Length: 563 51.5 0.58 0.00036 21.9 19.1 305 1-359 177-549 (563) 113 protein:vir:95599 Length: 563 51.5 0.58 0.00036 21.9 19.1 305 1-359 177-549 (563) 114 protein:vir:4598 Length: 416 # 51.2 0.59 0.00036 21.9 19.4 281 1-359 105-414 (416) 115 protein:vir:81095 Length: 416 51.2 0.59 0.00036 21.9 19.4 281 1-359 105-414 (416) 116 protein:vir:100249 Length: 431 49.7 0.63 0.00039 21.7 18.5 274 1-349 141-431 (431) 117 protein:vir:4454 Length: 414 # 47.3 0.71 0.00044 21.4 18.3 288 1-357 110-414 (414) 118 protein:vir:4194 Length: 540 # 46.2 0.74 0.00046 21.3 16.5 305 1-359 112-464 (540) 119 protein:vir:95899 Length: 474 45.1 0.78 0.00049 21.2 17.4 284 1-348 165-474 (474) 120 protein:vir:96266 Length: 474 45.1 0.78 0.00049 21.2 17.4 284 1-348 165-474 (474) 121 protein:vir:100187 Length: 385 44.7 0.8 0.0005 21.1 19.6 271 1-357 98-385 (385) 122 protein:vir:97171 Length: 512 43.8 0.83 0.00052 21.0 18.0 300 1-358 175-512 (512) 123 protein:vir:63755 Length: 547 43.6 0.84 0.00052 21.0 18.5 299 1-359 177-541 (547) 124 protein:vir:95113 Length: 474 43.4 0.85 0.00053 21.0 19.5 281 1-352 182-474 (474) 125 protein:vir:80333 Length: 419 40.0 0.99 0.00062 20.6 15.6 290 1-359 109-414 (419) 126 protein:vir:102330 Length: 451 39.5 1 0.00063 20.6 19.5 272 1-335 166-451 (451) 127 protein:vir:96494 Length: 501 39.3 1 0.00063 20.5 18.1 297 1-359 178-500 (501) 128 protein:vir:9702 Length: 406 # 37.3 1.1 0.0007 20.3 17.9 281 1-359 108-406 (406) 129 protein:vir:1023 Length: 392 # 37.0 1.1 0.00071 20.3 17.9 263 1-342 105-392 (392) 130 protein:vir:3989 Length: 392 # 37.0 1.1 0.00071 20.3 17.9 263 1-342 105-392 (392) 131 protein:vir:6240 Length: 457 # 36.5 1.2 0.00072 20.2 19.0 296 1-359 123-451 (457) 132 protein:vir:3648 Length: 695 # 35.0 1.3 0.00078 20.0 16.4 302 1-359 206-571 (695) 133 protein:vir:483 Length: 413 # 34.5 1.3 0.0008 20.0 18.8 287 1-349 109-413 (413) 134 protein:vir:101541 Length: 694 34.1 1.3 0.00081 19.9 16.5 302 1-359 205-570 (694) 135 protein:vir:98396 Length: 441 33.6 1.3 0.00084 19.9 18.3 283 1-358 130-441 (441) 136 protein:vir:4337 Length: 434 # 31.5 1.5 0.00092 19.6 14.5 284 1-353 123-434 (434) 137 protein:vir:100150 Length: 437 27.9 1.8 0.0011 19.2 17.1 293 1-358 119-437 (437) 138 protein:vir:105002 Length: 432 27.0 1.9 0.0012 19.1 19.4 287 1-358 116-432 (432) 139 protein:vir:102855 Length: 432 27.0 1.9 0.0012 19.1 19.4 287 1-358 116-432 (432) 140 protein:vir:107605 Length: 432 27.0 1.9 0.0012 19.1 19.4 287 1-358 116-432 (432) 141 protein:vir:1082 Length: 359 # 26.4 1.9 0.0012 19.0 14.1 233 1-280 93-359 (359) 142 protein:vir:1661 Length: 378 # 26.2 2 0.0012 19.0 18.1 277 1-358 82-378 (378) 143 protein:vir:79043 Length: 479 25.4 2.1 0.0013 18.9 17.0 279 1-347 161-479 (479) 144 protein:vir:5737 Length: 419 # 25.0 2.1 0.0013 18.8 16.5 285 1-359 109-416 (419) 145 protein:vir:4156 Length: 542 # 24.6 2.2 0.0013 18.8 16.8 299 1-359 114-473 (542) 146 protein:vir:81072 Length: 432 22.3 2.5 0.0015 18.4 19.4 288 1-358 124-432 (432) 147 protein:vir:3843 Length: 397 # 22.0 2.5 0.0016 18.4 20.7 274 1-352 94-397 (397) 148 protein:vir:1326 Length: 457 # 20.8 2.7 0.0017 18.2 15.8 299 1-359 123-452 (457) 149 protein:vir:103951 Length: 511 20.6 2.7 0.0017 18.2 18.7 298 1-358 175-511 (511) No 1 >protein:vir:103177 Length: 533 # NCBI annotation: gp131 # Family: family:all:1036 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717798;genbank:gi:113200635;genbank:GeneID:4239186 Probab=100.00 E-value=2.5e-203 Score=1131.29 Aligned_cols=358 Identities=83% Similarity=1.248 Sum_probs=338.3 Q ss_pred CCCc-------hhhHHHhhhhhhheeeccccccccCCCceeecHhHhhhhhcccccCCCCcchhhHHHHHHHHHHHHHHH Q lcl|NC_015285. 1 MRGV-------DLNQQLTQKAAEYFLYNPKGLKNSTNQGMKITTDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLRMIE 73 (359) Q Consensus 1 ~~~~-------~~~~~~~~~~~e~f~yn~~~~~~~~~~~v~i~~~ai~y~hSGl~d~~~~~i~syL~~Aik~~NqL~m~E 73 (359) .+.+ ..+.+|++++.|||+|||+|++++++++||||+||||||||||+|||+|+|+||||+|||||||||||| T Consensus 169 ~~~~~~~~~~~~~~~~v~~~~~eyf~Ynp~g~~~~~~~~vkI~~dAI~y~hSGl~d~~~~~i~syLhkAiKp~NQLkm~E 248 (533) T protein:vir:10 169 EQKRPEQLRGLPLNQQLSPKSAEYFLYDPKGLKNSTTQGLKIAPDSICYVHSGIMDLNKNMTLSHLHKAIKAVNQLRMIE 248 (533) T ss_pred eccCCCccceeecchhhhccceeeeeeccccccccCCCceecchhheeeeeccceeCCCCceeccchHhHHHHHhhHHHH Confidence 2233 345679999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhhcccccCCCCcccee Q lcl|NC_015285. 74 DSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDFWLPRREGGRGTEIS 153 (359) Q Consensus 74 DalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDywLpRReGgrgTEIs 153 (359) |||||||+||||||||||||||||||.||||||++||++||||+|||++||+|+||+||||||||||||||||||||||| T Consensus 249 DAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~ddrk~msMlEDyWLPRReGgrgTEIt 328 (533) T protein:vir:10 249 DSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEIT 328 (533) T ss_pred hhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCceecccchhhhhHhhhcccccCCCCcccee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_015285. 154 TLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFTDLLKTQLILKG 233 (359) Q Consensus 154 TLpGgqnLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KFI~rLr~rFs~if~d~Lk~QLiLkg 233 (359) |||||||||||+||+||++|||+|||||+|||+++++|++||++||||||+||+|||.|||+||+.+|+++||+|||||| T Consensus 329 TLpGgqnLgem~DV~YF~kKLY~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLiLKg 408 (533) T protein:vir:10 329 TLPGGQNLGELEDVKYFQKKLYKSLNVPGSRLETETTFNVGRAAEITRDEVKFQKFVARLRKRFSELFTDLLKTQLVLKG 408 (533) T ss_pred eccccCCcChHHHHHHHHHHHHHHhCCCccccCCCCcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCCCHHHHHHHHHHHHHHHh Q lcl|NC_015285. 234 VMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQTEIEIKEIDEQIASEME 313 (359) Q Consensus 234 I~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~ 313 (359) |||++||++++++|+|+|++||||+|+|++|||++|+++++++|||||||||++|||++||+|||+||++++|||++|++ T Consensus 409 iit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tDeei~~~~kqI~~E~k 488 (533) T protein:vir:10 409 VISIEEWDQMKEHIQYDYIADNYFAELKEIEIRNERMNQVATMDPFVGKYFSVEYMRRQVLKQTDVEMKEIDKQIESEME 488 (533) T ss_pred CCCHHHHHHHhhcceEeeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHHHHHHHHHHHHHHh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCCCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCCCCccCCC Q lcl|NC_015285. 314 AGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPGDVRRGEF 359 (359) Q Consensus 314 ~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~ 359 (359) +|+|++|+++|++.+|.+.+..... ...++.|....|....+++| T Consensus 489 ~~~~~~p~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~ 533 (533) T protein:vir:10 489 SGIIADPAAEMDPAMAAGDPDAGGA-PAEEVAPEGPDPSDERKAEF 533 (533) T ss_pred CCCCCCCcchhhHHhcCCCCCcCCc-ccccCCCCCCCcchhhccCC Confidence 9999999999998777643321111 11244456667888899999 No 2 >protein:vir:104500 Length: 537 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214665;genbank:gi:61806306;genbank:GeneID:3294555 Probab=100.00 E-value=3e-202 Score=1125.39 Aligned_cols=359 Identities=83% Similarity=1.250 Sum_probs=336.9 Q ss_pred CCCchhhHHHhhhhhhheeeccccccccCCCceeecHhHhhhhhcccccCCCCcchhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015285. 1 MRGVDLNQQLTQKAAEYFLYNPKGLKNSTNQGMKITTDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYR 80 (359) Q Consensus 1 ~~~~~~~~~~~~~~~e~f~yn~~~~~~~~~~~v~i~~~ai~y~hSGl~d~~~~~i~syL~~Aik~~NqL~m~EDalVIyR 80 (359) .+-.+.+++|+++..|||+|||+|++++++++||||+||||||||||+|||+|+|+||||+||||||||||||||||||| T Consensus 177 ~~~~~~~~~v~~~~~eyf~ynp~g~~~~~~~~vkI~~dAI~y~hSGl~d~n~~~i~syLhkAiKp~NQLkm~EDAlVIYR 256 (537) T protein:vir:10 177 LRTQDLNQQLTQQSASYFLYNPKGLKNSTNQGMKIAPDSIAYCHSGIQDLNKNMVLSHLHKAIKAVNQLRMIEDSLVIYR 256 (537) T ss_pred ceEEecceeeeecccceeeeccccccccCCCceeccHhheeeecccceeCCCCeeeeeehhhhHHHHhhHHHHhhHHHHh Confidence 33344556689999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhhcccccCCCCccceeecCCCCC Q lcl|NC_015285. 81 LSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDFWLPRREGGRGTEISTLPGGQN 160 (359) Q Consensus 81 ~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDywLpRReGgrgTEIsTLpGgqn 160 (359) +||||||||||||||||||.||||||++||++||||+|||++||+|+||+|||||||||||||||||||||||||||||| T Consensus 257 itRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~ddrk~msMlEDyWLPRReGgrgTEItTLpGgqn 336 (537) T protein:vir:10 257 LSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKLVYDANTGEIKDDKKFMSMLEDFWLPRREGGRGTEISTLPGGQN 336 (537) T ss_pred hhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCceecccchhhhhhhhhcccccCCCcccceeeccccCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHH Q lcl|NC_015285. 161 LGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFTDLLKTQLILKGVMSLEEW 240 (359) Q Consensus 161 Lgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KFI~rLr~rFs~if~d~Lk~QLiLkgI~t~eew 240 (359) ||||+||+||++|||+|||||+|||+++++|++||++||||||+||+|||.|||+||+.+|+++||+|||||||||++|| T Consensus 337 lgem~DV~YF~kKLy~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLilKgiit~eeW 416 (537) T protein:vir:10 337 LGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFVDLLKTQLILKGICSIEEW 416 (537) T ss_pred cChHHHHHHHHHHHHHHhCCCccccCCCCcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCCCHHHHHHHHHHHHHHHhcCCCCCC Q lcl|NC_015285. 241 EDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQTEIEIKEIDEQIASEMEAGIIADP 320 (359) Q Consensus 241 ~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~P 320 (359) ++++++|+|+|++||||+|+|++|||++|++++++++||||||||++|||++||+|||+||++++|||++|+++|+|++| T Consensus 417 ~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~s~dyi~k~ILr~tDeeI~~~~k~I~~E~k~~~~~~p 496 (537) T protein:vir:10 417 EEMKEHIQFDFIADNYFTELKEIEIRNERMNEVAQMDPYVGKYFSANYIRTKVLKQTESEIKEIDKEIKQEIADGVIMDP 496 (537) T ss_pred HHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhhcccchHHHHHHHhccCHHHHHHHHHHHHHHhhCCCCCCc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cchhhhcCCCCCccc-ccccCCCCCcC-CCCCCCCCccCCC Q lcl|NC_015285. 321 MAEMDPAMAAGGEGA-PAAEVDPNAQE-SSVDPGDVRRGEF 359 (359) Q Consensus 321 ~~~~~~~~~~~~~~~-~~~~~~~~~~~-~~~~p~~~~~~~~ 359 (359) ++.++...|.+...+ +++-..|++.+ +...|...+.||. T Consensus 497 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 537 (537) T protein:vir:10 497 QAMQAMEMGIGDEEPVPEGGEEPQTDPNSAVSPADQKRGEL 537 (537) T ss_pred ccccccccCCCCcccCCCCCCCcccCCccCCCCCCccCCCC Confidence 987777766544332 22223555433 5556666777777 No 3 >protein:vir:104892 Length: 558 # NCBI annotation: T4-like capsid assembly protein # Family: family:all:1036 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214363;genbank:gi:61806003;genbank:GeneID:3294412 Probab=100.00 E-value=5.8e-198 Score=1101.89 Aligned_cols=359 Identities=59% Similarity=0.982 Sum_probs=322.2 Q ss_pred CCCchhhHHHhhhhhhheeeccccccc-------cCCCceeecHhHhhhhhcccccCCCCcchhhHHHHHHHHHHHHHHH Q lcl|NC_015285. 1 MRGVDLNQQLTQKAAEYFLYNPKGLKN-------STNQGMKITTDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLRMIE 73 (359) Q Consensus 1 ~~~~~~~~~~~~~~~e~f~yn~~~~~~-------~~~~~v~i~~~ai~y~hSGl~d~~~~~i~syL~~Aik~~NqL~m~E 73 (359) ...++.++.++.++.|||+|||++... +++++||||+||||||||||+|||+|+|+||||+|||||||||||| T Consensus 181 ~~~~~~~~~~~~~~~eyy~Y~~~~~~~~~~~~~~~~~~~vkI~~dAI~y~hSGL~d~~~~~i~syLhkAIKp~NQLkmlE 260 (558) T protein:vir:10 181 RVRSEQDVVPNPEFEEFYIYTPKVQHPTGMVGQMGGKNSIKIAKDSITMCTSGLVDRNKNRVLSYLHKAIKALNQLRMIE 260 (558) T ss_pred eeecccceeeccceeEeeeecCCcccccccceeecCCCceeechhheeeecccceecCCCeeeecchHhhHhHHhhHHHH Confidence 122345677788999999999987643 4566799999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhhcccccCCCCcccee Q lcl|NC_015285. 74 DSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDFWLPRREGGRGTEIS 153 (359) Q Consensus 74 DalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDywLpRReGgrgTEIs 153 (359) |||||||+||||||||||||||||||.||||||++||++||||+|||++||+|+||+||||||||||||||||||||||| T Consensus 261 DAlVIYRitRAPERRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEIt 340 (558) T protein:vir:10 261 DSLVIYRLSRAPERRIFYIDVGNLPKVKAEQYLKEVMSRYRNKLVYDANTGEVRDDRKFMSMMEDFWLPRREGGRGTEIT 340 (558) T ss_pred hhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhccceEEEeccCceecccchhhhhHhhhcccccCCCCcccee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcC Q lcl|NC_015285. 154 TLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFTDLLKTQLILKG 233 (359) Q Consensus 154 TLpGgqnLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KFI~rLr~rFs~if~d~Lk~QLiLkg 233 (359) |||||||||||+||+||++|||+|||||+|||+++++|++||++||||||+||+|||.|||+||+.+|+++||+|||||| T Consensus 341 TLpGgqnLgem~DV~YF~kKLy~aLnVP~SRl~~e~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~qLilKg 420 (558) T protein:vir:10 341 TLPGGQNLGELSDVDYFQKKLYRALGVPESRIAAEGGFNLGRSSEILRDELKFAKFVGRLRKRFAAMFNDMLKTQLVLKN 420 (558) T ss_pred eccccCCcchHHHHHHHHHHHHHHhCCCccccCCCCcccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCCCHHHHHHHHHHHHHHHh Q lcl|NC_015285. 234 VMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQTEIEIKEIDEQIASEME 313 (359) Q Consensus 234 I~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~ 313 (359) |||++||++++++|+|+|++||||+|+|++|||++|++++++++||||||||++|||++||+|||+||++++|||++|++ T Consensus 421 iit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tDeeI~~~~kqI~~E~k 500 (558) T protein:vir:10 421 IVTPEDWKTMEDHIQYDFLYDNQFAELKESELMEGRLGMLATIEPYIGKYYSTEYVRKRVLRQTDMEIEEIDTQIEDEIQ 500 (558) T ss_pred CCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHHHHHHHHHHHHHHh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCCCCCcchhhhcCCCCCc-cccc---ccCCC-----CCcCCCCCCC---CCccCCC Q lcl|NC_015285. 314 AGIIADPMAEMDPAMAAGGE-GAPA---AEVDP-----NAQESSVDPG---DVRRGEF 359 (359) Q Consensus 314 ~~~~~~P~~~~~~~~~~~~~-~~~~---~~~~~-----~~~~~~~~p~---~~~~~~~ 359 (359) +|+|++|+++....+|+-+. +.++ +..+| ++.+...... +.++.+. T Consensus 501 ~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 558 (558) T protein:vir:10 501 KGIIPDPSQIDPITGEPLPQEGDPAMEGMGEQPVDPDLEAQAQAVDAQYSKDTKKAEL 558 (558) T ss_pred CCCCCCccccChhhccccCccCCchhccCCCCCcccccccchhhhhhhhhhhhhhhcC Confidence 99999999765554444222 2221 11122 1111111111 1223333 No 4 >protein:vir:6896 Length: 523 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861872;genbank:gi:32453663;genbank:GeneID:1494298 Probab=100.00 E-value=8.5e-198 Score=1100.98 Aligned_cols=327 Identities=39% Similarity=0.737 Sum_probs=319.7 Q ss_pred CCCchhhHHHhhhhhhheeeccccccc-------cCCCceeecHhHhhhhhcccccCCCCcchhhHHHHHHHHHHHHHHH Q lcl|NC_015285. 1 MRGVDLNQQLTQKAAEYFLYNPKGLKN-------STNQGMKITTDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLRMIE 73 (359) Q Consensus 1 ~~~~~~~~~~~~~~~e~f~yn~~~~~~-------~~~~~v~i~~~ai~y~hSGl~d~~~~~i~syL~~Aik~~NqL~m~E 73 (359) .+++++|++|+++++|||+|||++..+ +++++||||+||||||||||+|||+|+|+||||+|||||||||||| T Consensus 189 ~~~~~~g~~vi~~~~e~f~Y~~~~~~~~~~g~~~~~~~~ikI~~dAI~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlE 268 (523) T protein:vir:68 189 ITTTEAGVKIVKGYKEYFIYDTSHESYACDGRIYEAGTKIKIPKAAIVYAHSGLVDCCGKNIIGYLHRAIKPANQLKLLE 268 (523) T ss_pred cCCCCcchhhhhhhhhheeeccccccccccccccCCCcceecchhheeeeeccceeCCCCceeccchhhhHHHHhhHHHH Confidence 789999999999999999999987543 5688999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhhcccccCCCCcccee Q lcl|NC_015285. 74 DSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDFWLPRREGGRGTEIS 153 (359) Q Consensus 74 DalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDywLpRReGgrgTEIs 153 (359) |||||||+||||||||||||||||||.||||||++||++||||+|||++||+|+||+||||||||||||||||||||||| T Consensus 269 DAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNKlvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEIt 348 (523) T protein:vir:68 269 DAVVIYRITRAPDRRVWYVDTGNMPSRKAAEHMQHVMNTMKNRIAYDATTGKIKNQQHIMSMTEDYWLQRRDGKAVTEVD 348 (523) T ss_pred hhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhhcceeEEeccCCeeccchhhhhhHhhhcccccCCCccccee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCC-CcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhc Q lcl|NC_015285. 154 TLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETE-TTFNIGRAAEITRDEVKFQKFIARLRKRFSELFTDLLKTQLILK 232 (359) Q Consensus 154 TLpGgqnLgei~DV~YF~kkLy~aL~VP~SRl~~~-~~~~~g~~~eItRDElKF~KFI~rLr~rFs~if~d~Lk~QLiLk 232 (359) |||||||||||+||+||++|||+|||||+|||+++ ++|++|+++||||||+||+|||.|||+||+.+|+++||+||||| T Consensus 349 TLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~~~~~f~~Gr~~EItRDEikF~KFI~rLR~rFs~lf~~~Lk~qLilK 428 (523) T protein:vir:68 349 TLPGADNTGNMEDVRWFRNALYMALRIPITRIPSDQGGIQFDAGTSITRDELSFGKFIRELQHKFEEIFLDPLKTNLILK 428 (523) T ss_pred eccccCCcChHHHHHHHHHHHHHHhCCcceeecCCCcceecccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc Confidence 99999999999999999999999999999999877 57999999999999999999999999999999999999999999 Q ss_pred CCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCCCHHHHHHHHHHHHHHH Q lcl|NC_015285. 233 GVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQTEIEIKEIDEQIASEM 312 (359) Q Consensus 233 gI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~tDeeI~e~~kqi~~E~ 312 (359) ||||++||++++++|+|+|++||||+|+|++|||++|++++++++||||||||++|||++||+|||+||++++|||++|+ T Consensus 429 giit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~~~kqI~~E~ 508 (523) T protein:vir:68 429 GIITEDEWNDEINNIKIKFHRDSYFSELKDAEILERRINMLQMAEPFIGKYISHRTAMKDILQMSDEEIEQEAKQIEEES 508 (523) T ss_pred cCCCHHHHHHHhhcceEeeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHHHHhccCHHHHHHHHHHHHHHh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hcCCCCCCcchhhhc Q lcl|NC_015285. 313 EAGIIADPMAEMDPA 327 (359) Q Consensus 313 ~~~~~~~P~~~~~~~ 327 (359) ++|+|++|++++|.= T Consensus 509 k~~~~~~p~~e~~~f 523 (523) T protein:vir:68 509 KEARFQDPDQEQEDF 523 (523) T ss_pred hcCCCCCCchhhhcC Confidence 999999999887655 No 5 >protein:vir:106999 Length: 564 # NCBI annotation: portal vertex protein gp20 # Family: family:all:1036 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195138;genbank:gi:58532915;interpro:IPR010823;uniprot:Q5GQN4;genbank:GeneID:3260496 Probab=100.00 E-value=2.8e-197 Score=1098.17 Aligned_cols=359 Identities=59% Similarity=0.992 Sum_probs=321.1 Q ss_pred CCC-chhhHHHhhhh---------hhheeecccccc-----------ccCCCceeecHhHhhhhhcccccCCCCcchhhH Q lcl|NC_015285. 1 MRG-VDLNQQLTQKA---------AEYFLYNPKGLK-----------NSTNQGMKITTDSVTYCHSGIQDLNKNMTLSHL 59 (359) Q Consensus 1 ~~~-~~~~~~~~~~~---------~e~f~yn~~~~~-----------~~~~~~v~i~~~ai~y~hSGl~d~~~~~i~syL 59 (359) +++ +..+.+|++++ .|||+|||+++. .+++++||||+||||||||||+|||+|+|+||| T Consensus 169 ~~~~~~~~~~v~k~~~~~~~y~~~~Eyy~Ynp~~~~g~~~~~~~~~~~~~~~~ikI~~daI~y~hSGL~d~~~~~i~gyL 248 (564) T protein:vir:10 169 LKDVDPNRKEIEKGTALQYDYGDFIEYYIYNPKGFAGNIPMVTGSMDWSNQEGIKIASDAIAQSTSGLMDLNKKMTLSFL 248 (564) T ss_pred ccccccccceeeeeeeeeccccccccceeeccccccCcccccccccccccccceeechhhcceecccceeCCCCceeccc Confidence 444 34466666665 599999998863 246789999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhh Q lcl|NC_015285. 60 HKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDF 139 (359) Q Consensus 60 ~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDy 139 (359) |+|||||||||||||||||||+||||||||||||||||||.||||||++||++||||+|||++||+|+||+||||||||| T Consensus 249 hkAIKp~NQLkmlEDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYlr~iM~k~KNklVYDa~TGevrddrk~msMlEDy 328 (564) T protein:vir:10 249 HKAIKSLNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKVKAEQYLRDVMSRYRNKLVYDGQTGEIRDDKKHMSMLEDF 328 (564) T ss_pred hhhhHhHHhhHHHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCceecccchhhhhHhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCCC-cccccchhhhhHHhhhHHHHHHHHHHHHH Q lcl|NC_015285. 140 WLPRREGGRGTEISTLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETET-TFNIGRAAEITRDEVKFQKFIARLRKRFS 218 (359) Q Consensus 140 wLpRReGgrgTEIsTLpGgqnLgei~DV~YF~kkLy~aL~VP~SRl~~~~-~~~~g~~~eItRDElKF~KFI~rLr~rFs 218 (359) |||||||||||||||||||||||||+||+||++|||+|||||+|||++++ +|++|+++|||||||||+|||.|||+||+ T Consensus 329 WLPRReGgrgTEItTLpGgqnLgem~DV~YF~kKLY~aLnVP~SRl~~e~~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs 408 (564) T protein:vir:10 329 WLPRREGGRGTEITTLPGGQNLGELKDVEYFKKKLYNSLNLPPSRLTDDNKAFNLGKSTEILRDELKFTKFIGRLRKRFA 408 (564) T ss_pred cccccCCCcccceeeccccCCcchHHHHHHHHHHHHHHhCCCcccccCCCceeecccccchhHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999995 89999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCCCH Q lcl|NC_015285. 219 ELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQTE 298 (359) Q Consensus 219 ~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~tD 298 (359) .+|+++||+|||||||||++||++++++|+|+|++||||+|+|++|||++|+++++++|||||||||++|||++||+||| T Consensus 409 ~lF~~~Lk~qLiLKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~dyi~k~ILr~tD 488 (564) T protein:vir:10 409 QLFHDILKTQLILKGIITPEDWDDMEEHIQYDFLFDNHFNELKEQEMQLQRVNLATQMDPFVGKYFSTEYIRRKILMQTE 488 (564) T ss_pred HHHHHHHHHhhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHhcCCCCCCc-------chhhhcCCCCCccccccc--CCC--CCcCCCCCCCCCccCCC Q lcl|NC_015285. 299 IEIKEIDEQIASEMEAGIIADPM-------AEMDPAMAAGGEGAPAAE--VDP--NAQESSVDPGDVRRGEF 359 (359) Q Consensus 299 eeI~e~~kqi~~E~~~~~~~~P~-------~~~~~~~~~~~~~~~~~~--~~~--~~~~~~~~p~~~~~~~~ 359 (359) +||++++|||++|+++++|++|. +++++....|..++..+. .+| ++..++..|..++.|+= T Consensus 489 eei~~~~kqI~~E~k~~~~~~P~e~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~ 560 (564) T protein:vir:10 489 NEFKEIDKQMKSDIESGLAIDPIQVNMLDDMEKQNQAFAPELQAAQDDLAAEREIKKLNSAPKPPPSQQSKS 560 (564) T ss_pred HHHHHHHHHHHHHhhcCCCCCchhhhcCCCccCCCCcCCcchhhhccccccccChhhhccCCCCCCCCCCcC Confidence 99999999999999999999994 333333222222211111 122 22233344444444444 No 6 >protein:vir:103458 Length: 524 # NCBI annotation: portal vertex of the head # Family: family:all:1036 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803110;genbank:gi:116326390;genbank:GeneID:4405487 Probab=100.00 E-value=2.5e-197 Score=1098.42 Aligned_cols=327 Identities=40% Similarity=0.737 Sum_probs=319.1 Q ss_pred CCCchhhHHHhhhhhhheeecccccc-------ccCCCceeecHhHhhhhhcccccCCCCcchhhHHHHHHHHHHHHHHH Q lcl|NC_015285. 1 MRGVDLNQQLTQKAAEYFLYNPKGLK-------NSTNQGMKITTDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLRMIE 73 (359) Q Consensus 1 ~~~~~~~~~~~~~~~e~f~yn~~~~~-------~~~~~~v~i~~~ai~y~hSGl~d~~~~~i~syL~~Aik~~NqL~m~E 73 (359) .+++++|++|+++++|||+|||++.. .+++++||||+||||||||||+|||+|+|+||||+|||||||||||| T Consensus 189 ~~~~~~~~~vi~~~~e~f~Y~~~~~~y~~~g~~~~~~~~ikI~~dAI~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlE 268 (524) T protein:vir:10 189 ITETEAGTKIVKGYKEYFIYDTAHESYACDGRMYEAGTKIKIPKAAIVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLE 268 (524) T ss_pred ccCCCccchhhcchhhheeeccCccccccCccccCCCcceecchhheeeeeccceeCCCCceeccchhhhHHHHhhhHHH Confidence 69999999999999999999997643 35688999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhhcccccCCCCcccee Q lcl|NC_015285. 74 DSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDFWLPRREGGRGTEIS 153 (359) Q Consensus 74 DalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDywLpRReGgrgTEIs 153 (359) |||||||+||||||||||||||||||.||||||++||++||||+|||++||+|+||+||||||||||||||||||||||| T Consensus 269 DAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~KNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEIt 348 (524) T protein:vir:10 269 DAVVIYRITRAPDRRVWYVDTGNMPARKAAEHMQHVMNTMKNRVVYDASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVD 348 (524) T ss_pred hhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHhhhcccccCCCccccee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCC--CcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015285. 154 TLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETE--TTFNIGRAAEITRDEVKFQKFIARLRKRFSELFTDLLKTQLIL 231 (359) Q Consensus 154 TLpGgqnLgei~DV~YF~kkLy~aL~VP~SRl~~~--~~~~~g~~~eItRDElKF~KFI~rLr~rFs~if~d~Lk~QLiL 231 (359) |||||||||||+||+||++|||+|||||+|||+++ ++|++|+++||||||+||+|||.|||+||+.+|+++||+|||| T Consensus 349 TLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~d~~~~f~~gr~~EItRDEikF~KFI~rLR~rFs~~f~~~Lk~qLil 428 (524) T protein:vir:10 349 TLPGADNTGNMEDVRWFRQALYMALRVPLSRIPQDQQGGVMFDSGTSITRDELTFAKFIRELQHKFEEVFLDPLKTNLLL 428 (524) T ss_pred eccccCCcChHHHHHHHHHHHHHHhCCchhhcCCCCCccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh Confidence 99999999999999999999999999999999776 6799999999999999999999999999999999999999999 Q ss_pred cCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCCCHHHHHHHHHHHHHH Q lcl|NC_015285. 232 KGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQTEIEIKEIDEQIASE 311 (359) Q Consensus 232 kgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~tDeeI~e~~kqi~~E 311 (359) |||||++||++++++|+|+|++||||+|+|++|||++|++++++++||||||||++|||++||+|||+||++++|||++| T Consensus 429 Kgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~~~k~I~~E 508 (524) T protein:vir:10 429 KGIITEDEWNDEINNIKIEFHRDSYFTELKEAEILERRINMLTMAEPFIGKYISHRTAMKDILQMTDEEIEQEAKQIEEE 508 (524) T ss_pred ccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHHHHhccCHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhcCCCCCCcchhhhc Q lcl|NC_015285. 312 MEAGIIADPMAEMDPA 327 (359) Q Consensus 312 ~~~~~~~~P~~~~~~~ 327 (359) +++|+|++|++++|.= T Consensus 509 ~k~~~~~~~~~~~~~f 524 (524) T protein:vir:10 509 SKEARFQDPDQEQEDF 524 (524) T ss_pred hhcCCCCCCchhhhcC Confidence 9999999999877655 No 7 >protein:vir:7208 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049782;genbank:gi:9632594;genbank:GeneID:1258582 Probab=100.00 E-value=2.6e-197 Score=1098.29 Aligned_cols=327 Identities=40% Similarity=0.735 Sum_probs=319.1 Q ss_pred CCCchhhHHHhhhhhhheeecccccc-------ccCCCceeecHhHhhhhhcccccCCCCcchhhHHHHHHHHHHHHHHH Q lcl|NC_015285. 1 MRGVDLNQQLTQKAAEYFLYNPKGLK-------NSTNQGMKITTDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLRMIE 73 (359) Q Consensus 1 ~~~~~~~~~~~~~~~e~f~yn~~~~~-------~~~~~~v~i~~~ai~y~hSGl~d~~~~~i~syL~~Aik~~NqL~m~E 73 (359) .+++++|++|+++++|||+|||++.. .+++++||||+||||||||||+|||+|+|+||||+|||||||||||| T Consensus 189 ~~~~~~~~~vi~~~~e~f~Y~~~~~~y~~~g~~~~~~~~ikI~~dAI~y~hSGL~d~~~~~i~gyLhkAiKp~NQLkmlE 268 (524) T protein:vir:72 189 ITETEAGTKIVKGYKEYFIYDTAHESYACDGRMYEAGTKIKIPKAAVVYAHSGLVDCCGKNIIGYLHRAVKPANQLKLLE 268 (524) T ss_pred ccCCCccchhhcchhhheeeccCccccccCccccCCCcceecchhheeeeeccceeCCCCceeccchhhhHhHHhhhHHH Confidence 69999999999999999999997643 35688999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhhcccccCCCCcccee Q lcl|NC_015285. 74 DSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDFWLPRREGGRGTEIS 153 (359) Q Consensus 74 DalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDywLpRReGgrgTEIs 153 (359) |||||||+||||||||||||||||||.||||||++||++||||+|||++||+|+||+||||||||||||||||||||||| T Consensus 269 DAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~KNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEIt 348 (524) T protein:vir:72 269 DAVVIYRITRAPDRRVWYVDTGNMPARKAAEHMQHVMNTMKNRVVYDASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVD 348 (524) T ss_pred hhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHhhhcccccCCCccccee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCC--CcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015285. 154 TLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETE--TTFNIGRAAEITRDEVKFQKFIARLRKRFSELFTDLLKTQLIL 231 (359) Q Consensus 154 TLpGgqnLgei~DV~YF~kkLy~aL~VP~SRl~~~--~~~~~g~~~eItRDElKF~KFI~rLr~rFs~if~d~Lk~QLiL 231 (359) |||||||||||+||+||++|||+|||||+|||+++ ++|++|+++||||||+||+|||.|||+||+.+|+++||+|||| T Consensus 349 TLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~d~~~~f~~gr~~EItRDEikF~KFI~rLR~rFs~~f~~~Lk~qLil 428 (524) T protein:vir:72 349 TLPGADNTGNMEDIRWFRQALYMALRVPLSRIPQDQQGGVMFDSGTSITRDELTFAKFIRELQHKFEEVFLDPLKTNLLL 428 (524) T ss_pred eccccCCcChHHHHHHHHHHHHHHhCCchhhcCCCCCccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh Confidence 99999999999999999999999999999999776 6799999999999999999999999999999999999999999 Q ss_pred cCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCCCHHHHHHHHHHHHHH Q lcl|NC_015285. 232 KGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQTEIEIKEIDEQIASE 311 (359) Q Consensus 232 kgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~tDeeI~e~~kqi~~E 311 (359) |||||++||++++++|+|+|++||||+|+|++|||++|++++++++||||||||++|||++||+|||+||++++|||++| T Consensus 429 Kgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~~~k~I~~E 508 (524) T protein:vir:72 429 KGIITEDEWNDEINNIKIEFHRDSYFAELKEAEILERRINMLTMAEPFIGKYISHRTAMKDILQMTDEEIEQEAKQIEEE 508 (524) T ss_pred ccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHHHHhccCHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhcCCCCCCcchhhhc Q lcl|NC_015285. 312 MEAGIIADPMAEMDPA 327 (359) Q Consensus 312 ~~~~~~~~P~~~~~~~ 327 (359) +++|+|++|+++++.= T Consensus 509 ~k~~~~~~~~~~~~~f 524 (524) T protein:vir:72 509 SKEARFQDPDQEQEDF 524 (524) T ss_pred hhcCCCCCCchhhhcC Confidence 9999999999877654 No 8 >protein:vir:108049 Length: 524 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595296;genbank:gi:161622602;genbank:GeneID:5783768 Probab=100.00 E-value=1.7e-196 Score=1093.92 Aligned_cols=327 Identities=39% Similarity=0.709 Sum_probs=318.8 Q ss_pred CCCchhhHHHhhhhhhheeecccc-------ccccCCCceeecHhHhhhhhcccccCCCCcchhhHHHHHHHHHHHHHHH Q lcl|NC_015285. 1 MRGVDLNQQLTQKAAEYFLYNPKG-------LKNSTNQGMKITTDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLRMIE 73 (359) Q Consensus 1 ~~~~~~~~~~~~~~~e~f~yn~~~-------~~~~~~~~v~i~~~ai~y~hSGl~d~~~~~i~syL~~Aik~~NqL~m~E 73 (359) .+++++|++||++++|||+|||+. ...+++++||||+||||||||||+|||+|+|+||||+|||||||||||| T Consensus 189 ~~~~~~~~~vi~~~~e~f~Y~~~~~~~~~~~~~~~~~~~ikI~~dAIvy~~SGL~d~~~~~i~syLhkAiKp~NQLkm~E 268 (524) T protein:vir:10 189 VTRMEDGVKIVDGYREFFVYDTGHESYCADGRIYSAGTKVKIPRAAVVYAHSGLLDCCGKNIIGYLQRAIKPANQLKLME 268 (524) T ss_pred cccCcccchhhcchhhheeecCCCcccccCcceecCCcceecchhheeeeccCcccCCCCceeccchHhhHHHHhhHHHH Confidence 789999999999999999999853 2347889999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhhcccccCCCCcccee Q lcl|NC_015285. 74 DSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDFWLPRREGGRGTEIS 153 (359) Q Consensus 74 DalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDywLpRReGgrgTEIs 153 (359) |||||||+||||||||||||||||||.||||||++||++||||+|||++||+|+||+||||||||||||||||||||||| T Consensus 269 DAlVIYRitRAPeRRvFYIDVGnlPk~KAeqYl~~im~k~kNKlvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEIt 348 (524) T protein:vir:10 269 DAMVIYRITRAPDRRVFYIDTGNMPSRKAAAQMQHIMNTMKNRVVYDASTGKIKNQQHNMSMTEDYWLQRRDGKAVTEVD 348 (524) T ss_pred hhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeccCCeeccchhhhhhHhhhcccccCCCCcccee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCC--CcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015285. 154 TLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETE--TTFNIGRAAEITRDEVKFQKFIARLRKRFSELFTDLLKTQLIL 231 (359) Q Consensus 154 TLpGgqnLgei~DV~YF~kkLy~aL~VP~SRl~~~--~~~~~g~~~eItRDElKF~KFI~rLr~rFs~if~d~Lk~QLiL 231 (359) |||||||||||+||+||++|||+|||||+|||+++ ++|++|+++||||||+||+|||.|||+||+.+|+++||+|||| T Consensus 349 TLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~f~~gr~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLil 428 (524) T protein:vir:10 349 TMPGATGMSDMDDVLYFRTALYRALRIPESRIPSESNSGVMFDAGTAITRDELKFAKWIRQLQNKFEEIFLDPLKTNLIL 428 (524) T ss_pred eccccCCcChHHHHHHHHHHHHHHhCCCchhccCCCCccccccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh Confidence 99999999999999999999999999999999766 5899999999999999999999999999999999999999999 Q ss_pred cCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCCCHHHHHHHHHHHHHH Q lcl|NC_015285. 232 KGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQTEIEIKEIDEQIASE 311 (359) Q Consensus 232 kgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~tDeeI~e~~kqi~~E 311 (359) |||||++||++++++|+|+|++||||+|+|++|||++|++++++++||||||||++|||++||+|||+||++++|||++| T Consensus 429 Kgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~~~k~I~~E 508 (524) T protein:vir:10 429 KKIITEDEWEREINNIKVTFNRDSYFSEMKDAEIMERRINMLTMAEPFIGKYISHQTAMKDFLQMTDEEINQEAKQIEEE 508 (524) T ss_pred ccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhcccchhHHHHHHHhccCHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhcCCCCCCcchhhhc Q lcl|NC_015285. 312 MEAGIIADPMAEMDPA 327 (359) Q Consensus 312 ~~~~~~~~P~~~~~~~ 327 (359) +++|+|++|+++++.= T Consensus 509 ~k~~~~~~~~~~~~~f 524 (524) T protein:vir:10 509 SKEARFQNPDEEEEDF 524 (524) T ss_pred hhcCCCCCCChhhhcC Confidence 9999999999877655 No 9 >protein:vir:81017 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469501;genbank:gi:157311458;genbank:GeneID:5602316 Probab=100.00 E-value=1.3e-196 Score=1094.56 Aligned_cols=327 Identities=38% Similarity=0.698 Sum_probs=318.0 Q ss_pred CCCchhhHHHhhhhhhheeecccc-------ccccCCCceeecHhHhhhhhcccccCCCCcchhhHHHHHHHHHHHHHHH Q lcl|NC_015285. 1 MRGVDLNQQLTQKAAEYFLYNPKG-------LKNSTNQGMKITTDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLRMIE 73 (359) Q Consensus 1 ~~~~~~~~~~~~~~~e~f~yn~~~-------~~~~~~~~v~i~~~ai~y~hSGl~d~~~~~i~syL~~Aik~~NqL~m~E 73 (359) .+++..|++|+++++|||+|||++ .+.+++++||||+||||||||||+|||+|+|+||||+|||||||||||| T Consensus 186 ~k~~~~~~~v~~~~~e~f~Y~~~~~~~~~~g~~~~~~~~vkI~~dAI~y~hSGl~d~~~~~i~syLhkAiKp~NQLkm~E 265 (521) T protein:vir:81 186 ITEDTPEGKIYKATKEYFIYTVGNSSYCAGGQVFSPNSRVKIPRSAITYAHSGLMDCDDKYIIGYLHRAVKPANQLKLLE 265 (521) T ss_pred cccccCccceecceeeeeeeecCCccccccceeecCCcceeechhheeeeeccceeCCCCeeeecchhhhHhHHhhHHHH Confidence 889999999999999999999853 4557899999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhhcccccCCCCcccee Q lcl|NC_015285. 74 DSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDFWLPRREGGRGTEIS 153 (359) Q Consensus 74 DalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDywLpRReGgrgTEIs 153 (359) |||||||+||||||||||||||||||.||||||++||++||||+|||++||+|+||+|+||||||||||||||||||||| T Consensus 266 DAlVIYRitRAPeRRvFYIDvGnlpk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEIt 345 (521) T protein:vir:81 266 DAMVVYRITRAPERRVFFIDTGNMNNRKAAQHMNSVAQSFKNRVVYDASTGKLKNQQANLSMTEDYWLQRRDGKAITDVT 345 (521) T ss_pred hhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeecccccccccccccchhhhhcccccCCCccccee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCCCCCcchHHHHHHHHHHHHHhcCCCccccC--CCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015285. 154 TLPGGQNLGELEDVKYFQKKLYKALNVPSSRLE--TETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFTDLLKTQLIL 231 (359) Q Consensus 154 TLpGgqnLgei~DV~YF~kkLy~aL~VP~SRl~--~~~~~~~g~~~eItRDElKF~KFI~rLr~rFs~if~d~Lk~QLiL 231 (359) |||||||||||+||+||++|||+|||||+|||+ ++++|++|+++||||||+||+|||+|||+||+.+|+++||+|||| T Consensus 346 TLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLil 425 (521) T protein:vir:81 346 TLPGASGMSDIDDIRYFNRKLYEALRVPLSRSNLSDANMVIGGDGSEITRDELEFSKFIRTRQSQFSEVLRDPLKYNLIL 425 (521) T ss_pred ecccCCCCChHHHHHHHHHHHHHHhCCccccccCCCCcceeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh Confidence 999999999999999999999999999999995 446899999999999999999999999999999999999999999 Q ss_pred cCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCCCHHHHHHHHHHHHHH Q lcl|NC_015285. 232 KGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQTEIEIKEIDEQIASE 311 (359) Q Consensus 232 kgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~tDeeI~e~~kqi~~E 311 (359) |||||++||++++++|+|+|++||||+|+|++|||++|+++++++|||||||||++||||+||+|||+||++++|||++| T Consensus 426 Kgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~dyi~k~ILr~tDeei~~~~k~I~~E 505 (521) T protein:vir:81 426 KNVITEDDWDREINNIKVVFHRDSYYTEVKDAEILERRIGLIERITPYIGKYFSNQTVMRDILKYTDDQMDTEKKQIEEE 505 (521) T ss_pred hcCCCHHHHHHHhhcceEEEeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhcCCCCCCcchhhhc Q lcl|NC_015285. 312 MEAGIIADPMAEMDPA 327 (359) Q Consensus 312 ~~~~~~~~P~~~~~~~ 327 (359) +++|+|++|+++++.= T Consensus 506 ~~~~~~~~p~~~~~~f 521 (521) T protein:vir:81 506 ANDPRFKQTPDEIEDF 521 (521) T ss_pred hhCCCCCCCcccccCC Confidence 9999999999866433 No 10 >protein:vir:106282 Length: 521 # NCBI annotation: gp20 portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944108;genbank:gi:38640152;genbank:GeneID:2658030 Probab=100.00 E-value=3.1e-196 Score=1092.45 Aligned_cols=327 Identities=39% Similarity=0.759 Sum_probs=319.6 Q ss_pred CCCchhhHHHhhhhhhheeecccc-----ccccCCCceeecHhHhhhhhcccccCCCCcchhhHHHHHHHHHHHHHHHHH Q lcl|NC_015285. 1 MRGVDLNQQLTQKAAEYFLYNPKG-----LKNSTNQGMKITTDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLRMIEDS 75 (359) Q Consensus 1 ~~~~~~~~~~~~~~~e~f~yn~~~-----~~~~~~~~v~i~~~ai~y~hSGl~d~~~~~i~syL~~Aik~~NqL~m~EDa 75 (359) .+++.+|++|+++++|||+|||.+ ..++++++||||+||||||||||+|||+|+|+||||+|||||||||||||| T Consensus 187 ~k~~~~~~~v~~~~~e~f~Y~~~~~~~~~~~g~~~~~vkI~~daI~y~hSGL~d~~~~~i~syLhkAiKp~NQLkm~EDA 266 (521) T protein:vir:10 187 LKSNENGNDVYKGVKEFFTYGATEDNRYNISGNSNNLVQIPIDAIVYSHSGKVDIDGKTIVGYLHNVIKPANQLKMLEDA 266 (521) T ss_pred cCCCCCcchhhccceeeeeeccCCCceecCCCCCCcceeechhheeeecccceeCCCCceeccchhhhHhHHhhHHHHhh Confidence 889999999999999999999743 345678899999999999999999999999999999999999999999999 Q ss_pred HHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhhcccccCCCCccceeec Q lcl|NC_015285. 76 LVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDFWLPRREGGRGTEISTL 155 (359) Q Consensus 76 lVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDywLpRReGgrgTEIsTL 155 (359) |||||+||||||||||||||||||.||||||++||++||||+|||++||+|+||+||||||||||||||||||||||||| T Consensus 267 lVIYRitRAPeRRvFYIDvGnlpk~KAeqYl~~iM~k~kNklVYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEI~TL 346 (521) T protein:vir:10 267 MVIYRITRAPERRVFYIDVGTMPNKKATQHLNNVMQGLKNRVVYDSSTGKVKNSSNNLAMTEDYWLMRRDGKATTEVSTL 346 (521) T ss_pred HHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCceeccchhhhhhHhhhcccccCCCCccceeec Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCCcchHHHHHHHHHHHHHhcCCCccccCCCC-cccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Q lcl|NC_015285. 156 PGGQNLGELEDVKYFQKKLYKALNVPSSRLETET-TFNIGRAAEITRDEVKFQKFIARLRKRFSELFTDLLKTQLILKGV 234 (359) Q Consensus 156 pGgqnLgei~DV~YF~kkLy~aL~VP~SRl~~~~-~~~~g~~~eItRDElKF~KFI~rLr~rFs~if~d~Lk~QLiLkgI 234 (359) |||||||||+||+||++|||+|||||+|||++++ +|++|+++||||||+||+|||+|||+||+.+|+++||+||||||| T Consensus 347 pggqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~f~~Gr~~EItRDEikF~KFI~rLR~rFs~~f~~~L~~qLilKgi 426 (521) T protein:vir:10 347 PGAQSMGEMDDVRWFNRKLYESMKIPLSRLPQEGAGVTFGAGNDITRDELQFTKYIRGLQQQFEPIFLNPLRTNLMLKGK 426 (521) T ss_pred cccCCcChHHHHHHHHHHHHHHhCCCccccCCCCCceecccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccC Confidence 9999999999999999999999999999999995 799999999999999999999999999999999999999999999 Q ss_pred CChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhh--hcchhhhHHHHHHHHhCCCHHHHHHHHHHHHHHH Q lcl|NC_015285. 235 MSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDP--YVGKYFSVDYMRRQVLKQTEIEIKEIDEQIASEM 312 (359) Q Consensus 235 ~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp--~vGKy~S~~~i~k~IL~~tDeeI~e~~kqi~~E~ 312 (359) ||++||++++++|+|+|++||||+|+|++|||++|+++++++|| |||||||++|||++||+|||+||+++++||++|+ T Consensus 427 it~eew~~i~~~I~~~f~~Dn~f~ElKe~eil~~R~~~l~~~dp~~yvGky~s~dyi~k~ILr~tDeeik~~~k~I~~E~ 506 (521) T protein:vir:10 427 MSVSEWEEQAENIKVVFSKDSYYEEIKDVEILERRVNLVQTLASAEVTGKYLSHEYVMKNILRMSDEDIKTEREKIDGEL 506 (521) T ss_pred CCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHhhcCccccccccchHHHHHHHhcCCHhHHHHHHHHHHHhh Confidence 99999999999999999999999999999999999999999999 9999999999999999999999999999999999 Q ss_pred hcCCCCCCcchhhhc Q lcl|NC_015285. 313 EAGIIADPMAEMDPA 327 (359) Q Consensus 313 ~~~~~~~P~~~~~~~ 327 (359) ++|+|++|+++++.= T Consensus 507 ~~~~~~~p~~e~~df 521 (521) T protein:vir:10 507 KDSVYKNPEDPMEEF 521 (521) T ss_pred hCCCCCCCcchhhcC Confidence 999999999988655 No 11 >protein:vir:101806 Length: 516 # NCBI annotation: gp20 # Family: family:all:1036 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238883;genbank:gi:66391958;genbank:GeneID:3416633 Probab=100.00 E-value=8.6e-196 Score=1090.00 Aligned_cols=326 Identities=40% Similarity=0.740 Sum_probs=318.1 Q ss_pred CCCchhhHHHhhhhhhheeecccc-------ccccCCCceeecHhHhhhhhcccccCCCCcchhhHHHHHHHHHHHHHHH Q lcl|NC_015285. 1 MRGVDLNQQLTQKAAEYFLYNPKG-------LKNSTNQGMKITTDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLRMIE 73 (359) Q Consensus 1 ~~~~~~~~~~~~~~~e~f~yn~~~-------~~~~~~~~v~i~~~ai~y~hSGl~d~~~~~i~syL~~Aik~~NqL~m~E 73 (359) .+++++|++|+++++|||+|+|++ ...+++++||||+||||||||||+|||+|+|+||||+|||||||||||| T Consensus 182 ~~~~~~~~~v~~~~~e~~~Y~~~~~~~~~~g~~~~~~~~ikI~~dAI~y~hSGL~d~~~~~i~syLhkAiKp~NQLkm~E 261 (516) T protein:vir:10 182 VTSDIGGTTIVKGYREFFIYTTGNEGYSYNGRIFEPNTRIKIPRSAVVYASSGLMDCSDRGIIGYLHNAVKPANQLKLLE 261 (516) T ss_pred cccccccchhhhhhhheeeeccCccccccccceeCCCcceeechhheeeecccceeCCCCceeeeehhhhHhHHhhHHHH Confidence 788999999999999999999864 3346788999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhhcccccCCCCcccee Q lcl|NC_015285. 74 DSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDFWLPRREGGRGTEIS 153 (359) Q Consensus 74 DalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDywLpRReGgrgTEIs 153 (359) |||||||+||||||||||||||||||.||||||++||++||||+|||++||+|+||+||||||||||||||||||||||| T Consensus 262 DAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEIt 341 (516) T protein:vir:10 262 DAMVIYRITRAPERRVFYIDVGNMNNRKATEYVNGIMQSLKNRVVYDSNTGTVKNQKRNLSMTEDYWLMRRDGKSVTEVS 341 (516) T ss_pred hhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHhhhcccccCCCCcccee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccc--cchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015285. 154 TLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNI--GRAAEITRDEVKFQKFIARLRKRFSELFTDLLKTQLIL 231 (359) Q Consensus 154 TLpGgqnLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~--g~~~eItRDElKF~KFI~rLr~rFs~if~d~Lk~QLiL 231 (359) |||||||||||+||+||++|||+|||||+|||+++++|++ |+++||||||+||+|||.|||+||+.+|+++||+|||| T Consensus 342 TLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLil 421 (516) T protein:vir:10 342 SLPGAQTMGDMDDVRWFNKKLYEALRIPLSRIPRDDGGMVIGGQDTAITRDELDFRKFVVQLQHDFEEIFLDPLKTNLIY 421 (516) T ss_pred eccccCCcChHHHHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh Confidence 9999999999999999999999999999999999999887 99999999999999999999999999999999999999 Q ss_pred cCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCCCHHHHHHHHHHHHHH Q lcl|NC_015285. 232 KGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQTEIEIKEIDEQIASE 311 (359) Q Consensus 232 kgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~tDeeI~e~~kqi~~E 311 (359) |||||++||++++++|+|+|++||||+|+|++|||++|+++++++|||||||||++|||++||+|||+||++++|||++| T Consensus 422 Kgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~e~k~I~~E 501 (516) T protein:vir:10 422 KRIITEDEWDEQINNIKVNFHQDSYYTELKDIETLRLRVDALSQIEPYVGKYVSHDYVMKNILQMTEEQIAQEEKQIEQE 501 (516) T ss_pred ccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhcCCHhhHHHHHHHHHHh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhcCCCCCCcchhhh Q lcl|NC_015285. 312 MEAGIIADPMAEMDP 326 (359) Q Consensus 312 ~~~~~~~~P~~~~~~ 326 (359) +++|+|++|+++++= T Consensus 502 ~~~~~~~~p~~~~~f 516 (516) T protein:vir:10 502 AGIKRFQNPENEDDF 516 (516) T ss_pred hhCCCCCCCCccccC Confidence 999999999876544 No 12 >protein:vir:101189 Length: 516 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932511;genbank:gi:37651637;genbank:GeneID:2610682 Probab=100.00 E-value=8.6e-196 Score=1090.00 Aligned_cols=326 Identities=40% Similarity=0.740 Sum_probs=318.1 Q ss_pred CCCchhhHHHhhhhhhheeecccc-------ccccCCCceeecHhHhhhhhcccccCCCCcchhhHHHHHHHHHHHHHHH Q lcl|NC_015285. 1 MRGVDLNQQLTQKAAEYFLYNPKG-------LKNSTNQGMKITTDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLRMIE 73 (359) Q Consensus 1 ~~~~~~~~~~~~~~~e~f~yn~~~-------~~~~~~~~v~i~~~ai~y~hSGl~d~~~~~i~syL~~Aik~~NqL~m~E 73 (359) .+++++|++|+++++|||+|+|++ ...+++++||||+||||||||||+|||+|+|+||||+|||||||||||| T Consensus 182 ~~~~~~~~~v~~~~~e~~~Y~~~~~~~~~~g~~~~~~~~ikI~~dAI~y~hSGL~d~~~~~i~syLhkAiKp~NQLkm~E 261 (516) T protein:vir:10 182 VTSDIGGTTIVKGYREFFIYTTGNEGYSYNGRIFEPNTRIKIPRSAVVYASSGLMDCSDRGIIGYLHNAVKPANQLKLLE 261 (516) T ss_pred cccccccchhhhhhhheeeeccCccccccccceeCCCcceeechhheeeecccceeCCCCceeeeehhhhHhHHhhHHHH Confidence 788999999999999999999864 3346788999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhhcccccCCCCcccee Q lcl|NC_015285. 74 DSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDFWLPRREGGRGTEIS 153 (359) Q Consensus 74 DalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDywLpRReGgrgTEIs 153 (359) |||||||+||||||||||||||||||.||||||++||++||||+|||++||+|+||+||||||||||||||||||||||| T Consensus 262 DAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEIt 341 (516) T protein:vir:10 262 DAMVIYRITRAPERRVFYIDVGNMNNRKATEYVNGIMQSLKNRVVYDSNTGTVKNQKRNLSMTEDYWLMRRDGKSVTEVS 341 (516) T ss_pred hhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHhhhcccccCCCCcccee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccc--cchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015285. 154 TLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNI--GRAAEITRDEVKFQKFIARLRKRFSELFTDLLKTQLIL 231 (359) Q Consensus 154 TLpGgqnLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~--g~~~eItRDElKF~KFI~rLr~rFs~if~d~Lk~QLiL 231 (359) |||||||||||+||+||++|||+|||||+|||+++++|++ |+++||||||+||+|||.|||+||+.+|+++||+|||| T Consensus 342 TLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLil 421 (516) T protein:vir:10 342 SLPGAQTMGDMDDVRWFNKKLYEALRIPLSRIPRDDGGMVIGGQDTAITRDELDFRKFVVQLQHDFEEIFLDPLKTNLIY 421 (516) T ss_pred eccccCCcChHHHHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh Confidence 9999999999999999999999999999999999999887 99999999999999999999999999999999999999 Q ss_pred cCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCCCHHHHHHHHHHHHHH Q lcl|NC_015285. 232 KGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQTEIEIKEIDEQIASE 311 (359) Q Consensus 232 kgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~tDeeI~e~~kqi~~E 311 (359) |||||++||++++++|+|+|++||||+|+|++|||++|+++++++|||||||||++|||++||+|||+||++++|||++| T Consensus 422 Kgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~e~k~I~~E 501 (516) T protein:vir:10 422 KRIITEDEWDEQINNIKVNFHQDSYYTELKDIETLRLRVDALSQIEPYVGKYVSHDYVMKNILQMTEEQIAQEEKQIEQE 501 (516) T ss_pred ccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhcCCHhhHHHHHHHHHHh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhcCCCCCCcchhhh Q lcl|NC_015285. 312 MEAGIIADPMAEMDP 326 (359) Q Consensus 312 ~~~~~~~~P~~~~~~ 326 (359) +++|+|++|+++++= T Consensus 502 ~~~~~~~~p~~~~~f 516 (516) T protein:vir:10 502 AGIKRFQNPENEDDF 516 (516) T ss_pred hhCCCCCCCCccccC Confidence 999999999876544 No 13 >protein:vir:6596 Length: 521 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891727;genbank:gi:33620636;genbank:GeneID:1725288 Probab=100.00 E-value=1.2e-195 Score=1089.26 Aligned_cols=327 Identities=39% Similarity=0.710 Sum_probs=317.4 Q ss_pred CCCchhhHHHhhhhhhheeeccc-------cccccCCCceeecHhHhhhhhcccccCCCCcchhhHHHHHHHHHHHHHHH Q lcl|NC_015285. 1 MRGVDLNQQLTQKAAEYFLYNPK-------GLKNSTNQGMKITTDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLRMIE 73 (359) Q Consensus 1 ~~~~~~~~~~~~~~~e~f~yn~~-------~~~~~~~~~v~i~~~ai~y~hSGl~d~~~~~i~syL~~Aik~~NqL~m~E 73 (359) .+++..|++|+++++|||+|+|+ |.+.+++++||||+||||||||||+|||+|+|+||||+|||||||||||| T Consensus 186 ~k~~~~~~~v~~~~~e~f~Y~~~~~~~~~~g~~~~~~~~vkI~~dAI~y~hSGl~d~~~~~i~syLhkAiKp~NQLkm~E 265 (521) T protein:vir:65 186 ITEDTPEGKIYKATKEYFIYTVGNSSYCAGGQVFSPNSRVKIPRSAITYAHSGLMDCDDKYIIGYLHRAVKPANQLKLLE 265 (521) T ss_pred cccccCCcceecceeeeeeeecCCcceeccceeecCCcceeechhheeeeeccceeCCCCeeeecchhhhHhHHhhHHHH Confidence 88999999999999999999764 34557899999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhhcccccCCCCcccee Q lcl|NC_015285. 74 DSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDFWLPRREGGRGTEIS 153 (359) Q Consensus 74 DalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDywLpRReGgrgTEIs 153 (359) |||||||+||||||||||||||||||.||||||++||++||||+|||++||+|+||+|+||||||||||||||||||||| T Consensus 266 DAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEIt 345 (521) T protein:vir:65 266 DAMVVYRITRAPERRVFFIDTGNMNNRKAAQHMNSVAQSFKNRVVYDASTGKLKNQQANLSMTEDYWLQRRDGKAITDVT 345 (521) T ss_pred hhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeecccccccccccccchhhhhcccccCCCCcccee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCCCCCcchHHHHHHHHHHHHHhcCCCccccC--CCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015285. 154 TLPGGQNLGELEDVKYFQKKLYKALNVPSSRLE--TETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFTDLLKTQLIL 231 (359) Q Consensus 154 TLpGgqnLgei~DV~YF~kkLy~aL~VP~SRl~--~~~~~~~g~~~eItRDElKF~KFI~rLr~rFs~if~d~Lk~QLiL 231 (359) |||||||||||+||+||++|||+|||||+|||+ ++++|++|+++||||||+||+|||+|||+||+.+|+++||+|||| T Consensus 346 TLpGgqnlgem~DV~YF~kkLy~aLnVP~sRl~~e~~~~~~~gr~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLil 425 (521) T protein:vir:65 346 TLPGASGMSDIDDIRYFNRKLYEALRVPLSRSNLSDANMVIGGDGSEITRDELEFSKFIRTLQSQFSEVLRDPLKYNLIL 425 (521) T ss_pred ecccCCCcChHHHHHHHHHHHHHHhCCCceeccCCCCcceeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh Confidence 999999999999999999999999999999985 446899999999999999999999999999999999999999999 Q ss_pred cCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCCCHHHHHHHHHHHHHH Q lcl|NC_015285. 232 KGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQTEIEIKEIDEQIASE 311 (359) Q Consensus 232 kgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~tDeeI~e~~kqi~~E 311 (359) |||||++||++++++|+|+|++||||+|+|++|||++|++++++++||||||||++|||++||+|||+||++++|||++| T Consensus 426 Kgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~S~dyi~k~ILr~tDeei~~~~k~I~~E 505 (521) T protein:vir:65 426 KNVITEDDWDREINNIKVVFHRDSYYTEVKDAEILERRIGLIERITPYIGKYFSNQTVMRDILKYTDDQMDTEKKQIEEE 505 (521) T ss_pred hcCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhccCHHHHHHHHHHHHHh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhcCCCCCCcchhhhc Q lcl|NC_015285. 312 MEAGIIADPMAEMDPA 327 (359) Q Consensus 312 ~~~~~~~~P~~~~~~~ 327 (359) +++|+|++|+++++.= T Consensus 506 ~~~~~~~~p~~~~~~f 521 (521) T protein:vir:65 506 ANDPRFKQTPDEIEDF 521 (521) T ss_pred hhCCCCCCCcccccCC Confidence 9999999999877433 No 14 >protein:vir:98265 Length: 524 # NCBI annotation: gp20 portal vertex of the head # Family: family:all:1036 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239198;genbank:gi:66391673;genbank:GeneID:3416367 Probab=100.00 E-value=2.9e-195 Score=1087.15 Aligned_cols=326 Identities=42% Similarity=0.749 Sum_probs=314.4 Q ss_pred CCCc-hhhHHHhhhhhhheeeccccc-------cccCCCceeecHhHhhhhhcccccCCCCcchhhHHHHHHHHHHHHHH Q lcl|NC_015285. 1 MRGV-DLNQQLTQKAAEYFLYNPKGL-------KNSTNQGMKITTDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLRMI 72 (359) Q Consensus 1 ~~~~-~~~~~~~~~~~e~f~yn~~~~-------~~~~~~~v~i~~~ai~y~hSGl~d~~~~~i~syL~~Aik~~NqL~m~ 72 (359) ++.+ +.|++|+++++|||+|||++. +++++++||||+||||||||||+|||++ |+||||+||||||||||| T Consensus 190 ~~~~~~~~~~v~~~~~e~f~Y~~~~~~~~~~g~~~~~~~~ikI~~dAIvy~hSGL~d~~~~-iisyLhkAiKp~NQLkm~ 268 (524) T protein:vir:98 190 ITETLDGGVKVFRGYREFFVYSAPKAGYTYNGQIYQANQKIKIPRSAIVYAHSGLEDCSNN-IIGYLHRAVKPANQLRLL 268 (524) T ss_pred cccccccchhhccceeeeeeeccCCCccccccceecCCCceeechhheeeeccCcccCCCC-eeeehhHhhHhHHhhHHH Confidence 4444 568999999999999998654 3458889999999999999999998876 679999999999999999 Q ss_pred HHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhhcccccCCCCccce Q lcl|NC_015285. 73 EDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDFWLPRREGGRGTEI 152 (359) Q Consensus 73 EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDywLpRReGgrgTEI 152 (359) ||||||||+||||||||||||||||||.||||||++||++||||+|||++||+||||+|||||||||||||||||||||| T Consensus 269 EDAlVIYRitRAPeRRvFYIDvGnlPk~KAeqYl~~im~k~kNklvYDa~TGevrddrk~msMlEDyWLpRReGgrgTEI 348 (524) T protein:vir:98 269 EDAMVIYRITRAPERRVFYIDVGQMGGNKATQYVNNIAQGLKNRVVYDARTGTVKNQQNNLSMTEDYWLMRRDGKAITEV 348 (524) T ss_pred HhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEeeccCceeeccccccchhhhhcccccCCCCccce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eecCCCCCcchHHHHHHHHHHHHHhcCCCccccCC-CCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015285. 153 STLPGGQNLGELEDVKYFQKKLYKALNVPSSRLET-ETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFTDLLKTQLIL 231 (359) Q Consensus 153 sTLpGgqnLgei~DV~YF~kkLy~aL~VP~SRl~~-~~~~~~g~~~eItRDElKF~KFI~rLr~rFs~if~d~Lk~QLiL 231 (359) ||||||||||||+||+||++|||+|||||+|||++ +++|++|+++||||||+||+|||.|||+||+.+|+++||+|||| T Consensus 349 tTLpggqnlgem~DV~YF~kkLy~aLnVP~sRl~~~~~~f~~Gr~~EItRDEiKF~KFI~rLR~rFs~lf~~~L~~qLil 428 (524) T protein:vir:98 349 STLPGGQNFSDMDDIKWFNRKLYEALRVPLSRMPRDDGGMQIGGGGEITRDELKFSKFIRTLQIQFSPVLSDPLKTNLIA 428 (524) T ss_pred eeccccCCcChHHHHHHHHHHHHHHhCCCceeccCCCCccccccccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhh Confidence 99999999999999999999999999999999975 57999999999999999999999999999999999999999999 Q ss_pred cCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCCCHHHHHHHHHHHHHH Q lcl|NC_015285. 232 KGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQTEIEIKEIDEQIASE 311 (359) Q Consensus 232 kgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~tDeeI~e~~kqi~~E 311 (359) |||||++||++++++|+|+|++||||+|+|++|||++|+++++++|||||||||++|||++||+|||+||++++|||++| T Consensus 429 Kgiit~eew~~i~~~I~~~f~~Dn~f~ElKe~Eil~~R~~~l~~~dpyvGky~s~dyi~k~ILr~tDeei~~~~k~I~~E 508 (524) T protein:vir:98 429 KKIITEDEWEENVSKISFVFQQDSYYAEVKDIEILERRLNLMSQVEGVVGKYVSHKYIMKEILRMSDEDIDEQAKLIEEE 508 (524) T ss_pred hcCCCHHHHHHHhhcceEEEeecchHHHHHHHHHHHHHHHHHHHhccccccccchHHHHHHHhccCHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhcCCCCCCcchhhhc Q lcl|NC_015285. 312 MEAGIIADPMAEMDPA 327 (359) Q Consensus 312 ~~~~~~~~P~~~~~~~ 327 (359) +++|+|++|+++++.= T Consensus 509 ~k~~~~~~p~~e~~~f 524 (524) T protein:vir:98 509 SKEERFKNPEAEEENF 524 (524) T ss_pred HhCCCCcCCccccccC Confidence 9999999999988655 No 15 >protein:vir:100598 Length: 516 # NCBI annotation: gp20 head portal vertex protein # Family: family:all:1036 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656382;genbank:gi:109290133;genbank:GeneID:4156576 Probab=100.00 E-value=1.1e-194 Score=1083.95 Aligned_cols=326 Identities=42% Similarity=0.765 Sum_probs=317.7 Q ss_pred CCCchhhHHHhhhhhhheeeccc-------cccccCCCceeecHhHhhhhhcccccCCCCcchhhHHHHHHHHHHHHHHH Q lcl|NC_015285. 1 MRGVDLNQQLTQKAAEYFLYNPK-------GLKNSTNQGMKITTDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLRMIE 73 (359) Q Consensus 1 ~~~~~~~~~~~~~~~e~f~yn~~-------~~~~~~~~~v~i~~~ai~y~hSGl~d~~~~~i~syL~~Aik~~NqL~m~E 73 (359) .+++.+|++|+++++|||+|+++ |...+++++||||+||||||||||+|||+++|+||||+|||||||||||| T Consensus 182 ~~~~~~~~~v~~~~~e~~~Y~~~~~~~~~~g~~~~~~~~ikI~~daI~y~hSGl~d~~~~~i~syLhkAiKp~NQLkm~E 261 (516) T protein:vir:10 182 VTSDVGGTSVVKGYREFFVYTTGNEGYAYNGRLFEPNTRIKIPRSAIVYAHSGLQDCSDRGIVGYLHNAVKPANQLKLLE 261 (516) T ss_pred ecccCcchhhhhceeeeeeeecCccceeccccccCCCCceecchhheeeeecCcccCCCCceeceehhhhHhHHhhHHHH Confidence 78889999999999999999654 33356888999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhhcccccCCCCcccee Q lcl|NC_015285. 74 DSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDFWLPRREGGRGTEIS 153 (359) Q Consensus 74 DalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDywLpRReGgrgTEIs 153 (359) |||||||+||||||||||||||||||.||||||++||++||||+|||++||+|+||+||||||||||||||||||||||| T Consensus 262 DAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYl~~iM~k~KNklvYDa~TGev~ddrk~msMlEDyWLpRReGgrgTEIt 341 (516) T protein:vir:10 262 DALVIYRITRAPERRVFYIDVGNMPNRKATEYVNGIMQSLKNRVVYDSNTGTVKNQKRNLSMTEDYWLMRRDGKSVTEVT 341 (516) T ss_pred hhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceeEEeCCCCeeccchhhhhhHhhhcccccCCCccccee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccc--cchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015285. 154 TLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNI--GRAAEITRDEVKFQKFIARLRKRFSELFTDLLKTQLIL 231 (359) Q Consensus 154 TLpGgqnLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~--g~~~eItRDElKF~KFI~rLr~rFs~if~d~Lk~QLiL 231 (359) |||||||||||+||+||++|||+|||||+|||+++++|++ |+++||||||+||+|||.|||+|||.+|.++||+|||| T Consensus 342 TLpGgqnlgem~DV~YF~kkLy~aLnVP~SRl~~e~~~~~~~Gr~~EItRDEiKF~KFI~rLR~rFs~lF~~~L~~qLil 421 (516) T protein:vir:10 342 SLPGAQTMGEMDDVRWFNKKLYEALRIPLSRMPRDDGGMVIGGQDMAITRDELDFRKFIVQLQHNFEEIFLDPLKTNLIY 421 (516) T ss_pred eccccCCcChHHHHHHHHHHHHHHhCCCcccccCCCCceeeccccchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh Confidence 9999999999999999999999999999999999999887 99999999999999999999999999999999999999 Q ss_pred cCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCCCHHHHHHHHHHHHHH Q lcl|NC_015285. 232 KGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQTEIEIKEIDEQIASE 311 (359) Q Consensus 232 kgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~tDeeI~e~~kqi~~E 311 (359) |||||++||++++++|+|+|++||||+|+|++|||++|++++++++||||||||++|||++||+|||+||++++|||++| T Consensus 422 KgIit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~s~~yi~k~ILr~tDeei~~~~k~I~~E 501 (516) T protein:vir:10 422 KKIILESEWEEQINNIKVNFHQDSYYTELKDIETLRQRVDALSQIEPYVGKYVSHDYVMKNILQMTDEQIAQEEKQIEKE 501 (516) T ss_pred cCCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHhcCCHhHHHHHHHHHHHh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhcCCCCCCcchhhh Q lcl|NC_015285. 312 MEAGIIADPMAEMDP 326 (359) Q Consensus 312 ~~~~~~~~P~~~~~~ 326 (359) +++|+|++|+++++= T Consensus 502 ~~~~~~~~p~~e~~f 516 (516) T protein:vir:10 502 ANVKRFQNPENEDDF 516 (516) T ss_pred hhCCCCCCCCccccC Confidence 999999999987654 No 16 >protein:vir:5665 Length: 511 # NCBI annotation: portal vertex protein of head # Family: family:all:1036 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899604;genbank:gi:34419591;genbank:GeneID:2546036 Probab=100.00 E-value=3.9e-193 Score=1075.46 Aligned_cols=324 Identities=42% Similarity=0.775 Sum_probs=313.2 Q ss_pred CCCchhhHHHhhhhhhheeeccccccc--------cCCCceeecHhHhhhhhcccccCC--CCcchhhHHHHHHHHHHHH Q lcl|NC_015285. 1 MRGVDLNQQLTQKAAEYFLYNPKGLKN--------STNQGMKITTDSVTYCHSGIQDLN--KNMTLSHLHKAIKAVNQLR 70 (359) Q Consensus 1 ~~~~~~~~~~~~~~~e~f~yn~~~~~~--------~~~~~v~i~~~ai~y~hSGl~d~~--~~~i~syL~~Aik~~NqL~ 70 (359) .+++++|++|++++.|||+|||++... .++++|+||+||||||||||+||| +|+|+||||+||||||||| T Consensus 175 ~~~~~~~~~v~~~~~ey~~Y~~~~~~~~~~~~~~~~~~~~vkI~~daI~y~hSGL~d~~~~~g~i~syLhkAiKp~NQLk 254 (511) T protein:vir:56 175 QKETIDGVEVVKGTLEYYVYKQSDYKMPSWMSATNRAQTSFRIPKDAIVFAHSGLMRGCADDPYIIGYLDRAIKPANQLK 254 (511) T ss_pred hcccccccccccceeeeeEecCCCcccCcccccccccccceeechhheeeecccceeccCCCCeeeccchhhhHHHHhhH Confidence 789999999999999999999976431 246789999999999999999965 5679999999999999999 Q ss_pred HHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhhcccccCCCCcc Q lcl|NC_015285. 71 MIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDFWLPRREGGRGT 150 (359) Q Consensus 71 m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDywLpRReGgrgT 150 (359) ||||||||||+||||||||||||||||||.||||||++||++||||+|||++||+|+||+|||||||||||||||||||| T Consensus 255 m~EDAlVIYRitRAPeRRvFYIDVGnLPk~KAeqYl~~iM~k~kNklVYDa~TGev~ddrk~msMlEDyWLpRReGgrgT 334 (511) T protein:vir:56 255 MLEDALVIYRLARAPERRVFYVDVGNLPTQKAQQYVNGIMQNVKNRVVYDTQTGQVKNTTNAMSMLEDYYLPRREGSKGT 334 (511) T ss_pred HHHhhHHHHhhhccccceEEEEecCCCCchhHHHHHHHHHHhcCceEEEeccCceeccchhhhhhHhhhcccccCCCCcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCC---CcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015285. 151 EISTLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETE---TTFNIGRAAEITRDEVKFQKFIARLRKRFSELFTDLLKT 227 (359) Q Consensus 151 EIsTLpGgqnLgei~DV~YF~kkLy~aL~VP~SRl~~~---~~~~~g~~~eItRDElKF~KFI~rLr~rFs~if~d~Lk~ 227 (359) ||||||||||||||+||+||++|||+|||||+|||+++ ++|++||++||||||+||+|||.|||+||+.+|+++||+ T Consensus 335 EItTLpGgqnlgem~DV~YF~kKLy~aLnVP~SRl~~e~q~~~f~~Gr~~EItRDEiKF~KFI~RLR~rFs~lF~~~Lk~ 414 (511) T protein:vir:56 335 EVSTLPGGQSLGDIEDVLYFNRKLYKAMRIPTSRAASEDQTGGINFGQGAEITRDELKFTKFVKRLQTKFETVITDPLKH 414 (511) T ss_pred ceeeccccCCcChHHHHHHHHHHHHHHhCCCcccccCCCCccccccccchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999877 489999999999999999999999999999999999999 Q ss_pred HHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCCCHHHHHHHHHH Q lcl|NC_015285. 228 QLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQTEIEIKEIDEQ 307 (359) Q Consensus 228 QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~tDeeI~e~~kq 307 (359) |||||||||++||++++++|+|+|++||||+|+|++|||++|++++++++||||||||++|||++||+|||+||+++++| T Consensus 415 qLilKgiit~eeW~~i~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~l~~~dpyvGky~S~~yi~k~ILr~tDeei~~~~k~ 494 (511) T protein:vir:56 415 QLIVNNIITEEEWDANHEKLYVVFNQDSYFEEAKELEILNSRMNAMRDIQDYAGKYYSHKYIQKNILRLSDDQITAMQSE 494 (511) T ss_pred hhhhccCCCHHHHHHHhhcceEEeeecchHHHHHHHHHHHHHHHHHHHhcchhccccchHHHHHHHhccCHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHhcCCCCCCcchh Q lcl|NC_015285. 308 IASEMEAGIIADPMAEM 324 (359) Q Consensus 308 i~~E~~~~~~~~P~~~~ 324 (359) |++|+++|+|++|++.- T Consensus 495 I~~E~k~~~~~~~e~~f 511 (511) T protein:vir:56 495 IDEEETNPRFQQDDQGF 511 (511) T ss_pred HHHhhcCCCCCCcccCC Confidence 99999999999998622 No 17 >protein:vir:5839 Length: 533 # NCBI annotation: similar to portal vertex protein of head # Family: family:all:1036 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835625;genbank:gi:30044028 Probab=100.00 E-value=7.5e-168 Score=936.82 Aligned_cols=340 Identities=22% Similarity=0.366 Sum_probs=290.7 Q ss_pred CCCchhhHH--------------Hhhhhhhheeeccccccc-cCCCceeecHhHhhhhhcccccCCCCcchhhHHHHHHH Q lcl|NC_015285. 1 MRGVDLNQQ--------------LTQKAAEYFLYNPKGLKN-STNQGMKITTDSVTYCHSGIQDLNKNMTLSHLHKAIKA 65 (359) Q Consensus 1 ~~~~~~~~~--------------~~~~~~e~f~yn~~~~~~-~~~~~v~i~~~ai~y~hSGl~d~~~~~i~syL~~Aik~ 65 (359) ++..+.|+. .+.+-.|||+|||++.++ +++++++||+|||+||||||+|||+++|+||||+|||| T Consensus 147 ik~~k~GI~elr~lDPr~i~~vr~~~t~~eyyvy~~~~~~~~s~~~~~kI~~daI~y~~SGl~d~~~~~iisyLhkAiKp 226 (533) T protein:vir:58 147 EKGSDGTIEKFQVVSPYIFSKRYNPETDTWYYVITDVYRNVVSGYFNEDIPEEDVIHFSHKIDTNFFPYGRSYLESARAI 226 (533) T ss_pred cCCcccchhhheecCCeeeEEEEeeccceEEEeecccccccccCccccccchhheeeeeeccccCCCCceehhhhHHHHH Confidence 333444431 012236999999999865 67888999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccch---hhHhhhccc Q lcl|NC_015285. 66 VNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFM---SMMEDFWLP 142 (359) Q Consensus 66 ~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~m---SMlEDywLp 142 (359) |||||||||||||||+||||+|||||||||||||.||+|||++||++||||+|||++||+|+||+++| ||||||||| T Consensus 227 ~NQLkmiEDAlVIYRisRAPeRRvFYIDVGNlpk~KAeqYl~~im~k~kNklvYDa~TGev~ddrk~m~~~sMlEDyWLp 306 (533) T protein:vir:58 227 WNQLRLMEDALMLYRVVRSVDRRVFYVDVGNVPPDKINEYLTNIAMQYKRDYWVRNNQNQFLGIDNYFSIESILKDYFIP 306 (533) T ss_pred HHHHHHHHHHHHHHhhcCChhheEEEEeecCCCccCHHHHHHHHHHhcccceEEeccCCeEeeccchhhhhhhHhhhccc Confidence 99999999999999999999999999999999999999999999999999999999999999999998 999999999 Q ss_pred ccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHH Q lcl|NC_015285. 143 RREGGRGTEISTLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFT 222 (359) Q Consensus 143 RReGgrgTEIsTLpGgqnLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KFI~rLr~rFs~if~ 222 (359) |||||||||||||||| |||+|+||+||++|||+|||||+|||+++++| ||++|||||||||+|||+|||++|+++| T Consensus 307 RReGgrgTEI~TLpGg-~lgemeDV~YF~kkLy~ALnVP~sRl~~e~~f--gr~~eItRDEiKF~KFI~rLR~rF~~ll- 382 (533) T protein:vir:58 307 RRGDRRAVEIDILQGS-KVDLAEDVEYMLNRLISALKVPKAFIGYEGDV--NAKNTLATQDIKFNNTIKRIQGFFVEEL- 382 (533) T ss_pred ccCCCccceeeecCCC-CCCcHHHHHHHHHHHHHHhCCCeeecCCCCCC--ccchhhhHHHHHHHHHHHHHHHHHHHHH- Confidence 9999999999999997 59999999999999999999999999999987 9999999999999999999999998877 Q ss_pred HHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCCCHHHHH Q lcl|NC_015285. 223 DLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQTEIEIK 302 (359) Q Consensus 223 d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~tDeeI~ 302 (359) ++||+||||||++|| +|+|++||||+|+|++|||++|++++++++||||| +|||++||+||| ||+ T Consensus 383 ---~~qLilk~iit~eew-------~~~f~~Dn~f~ElKe~Eil~~Ri~~l~~~dpyvgk----~yi~k~ILr~td-ei~ 447 (533) T protein:vir:58 383 ---ERMVRMNKEFADQDF-------RLVMNRSNSIVEGERFAVIEQRIGIAERLKGWVRE----DWIYSNILQIPY-DLK 447 (533) T ss_pred ---hcccccccCcchhhe-------eeeeeccchHHHHHHHHHHHHHHHHHHHhcchhhH----HHHHHHHhcCCh-hhh Confidence 559999999999999 59999999999999999999999999999999998 589999999998 677 Q ss_pred HHHHHHHHHHhcCCCCCCcchhhhcCCC-CCcccccccCC--CCCcCCCCCCCC------CccCCC Q lcl|NC_015285. 303 EIDEQIASEMEAGIIADPMAEMDPAMAA-GGEGAPAAEVD--PNAQESSVDPGD------VRRGEF 359 (359) Q Consensus 303 e~~kqi~~E~~~~~~~~P~~~~~~~~~~-~~~~~~~~~~~--~~~~~~~~~p~~------~~~~~~ 359 (359) +++++|++|+++|+|++|+.+.+.+... .|....|.+.. +.+-.....|+. ...|.| T Consensus 448 ~q~e~ie~E~~~~~~~~~~~~~e~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~ 513 (533) T protein:vir:58 448 PQEEVAEAAGGGGLFDTGGFGEETTPADFLGERGSPIESPRGRTEFDFGTEGGEELGGELNLGGAF 513 (533) T ss_pred HHHHHHHHhhcCCCCCCCCcccccCCcccCccccCcccCCCChhhHhcccCCcccccccccccccc Confidence 7778999999999999987544433221 22222122111 111111111111 112222 No 18 >protein:vir:107742 Length: 537 # NCBI annotation: gp28 # Family: family:all:297 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024875;genbank:gi:48697517;genbank:GeneID:2948359 Probab=98.33 E-value=2.5e-07 Score=56.75 Aligned_cols=294 Identities=10% Similarity=0.048 Sum_probs=123.5 Q ss_pred CCCch-----hhHHHhh--hhhhheeeccccccc-----------cCCC---------ceeecHhHhhhhhcccccCCCC Q lcl|NC_015285. 1 MRGVD-----LNQQLTQ--KAAEYFLYNPKGLKN-----------STNQ---------GMKITTDSVTYCHSGIQDLNKN 53 (359) Q Consensus 1 ~~~~~-----~~~~~~~--~~~e~f~yn~~~~~~-----------~~~~---------~v~i~~~ai~y~hSGl~d~~~~ 53 (359) ..|+. .+++-++ ++....+++|.+... +++. +.+|+. |=|+..++. T Consensus 184 ~~D~~~~~~Pl~~~~i~kg~~k~l~vidp~~~~~~~~~~~~~dp~sp~fg~P~~y~v~g~~iH~-------SRli~f~g~ 256 (537) T protein:vir:10 184 SPDPYYYEKPFNIDGVMPGAYKGIVQIDPYWCAPLLDAQASSNPVSMHFYEPTYWLINGKKYHR-------SHLAIYIND 256 (537) T ss_pred CcCCcccccccccccccccceeEEEEechhhcccccchhhhccCCccccCCceeeeecCeEecc-------eeEEEecCC Confidence 01111 0111111 223334445433211 1111 234444 333333333 Q ss_pred cc------------hhhHHHHHHHHHHHHHHHH--HHHHHHHhcCccceeEeccCCC-CchHHHHHHHHHHHHhhcceEE Q lcl|NC_015285. 54 MT------------LSHLHKAIKAVNQLRMIED--SLVIYRLSRAPERRIFYIDVGN-LPKNKAEQYLREVMGRYRNKMV 118 (359) Q Consensus 54 ~i------------~syL~~Aik~~NqL~m~ED--alVIyR~~RAPeRRvFyIDvGn-lpk~KAeqYl~~iM~kyrnklv 118 (359) -+ .|-|+++...+.+...... +.++|+- - =+++.+|... |....+..-.-+.++++|+ T Consensus 257 ~~p~~~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~---~-~~v~k~~~~~~l~~~~~~~~r~~~~~~~r~--- 329 (537) T protein:vir:10 257 EVVDFLKPSYIYGGVPLPQQIMERVYAAERTANEGPMLAMTK---R-QTVLKVDAAQVLANKQQFDETMSWWTATRD--- 329 (537) T ss_pred CCchhhhcccCcccccHHHHHHHHHHHHHHHHHHHHHHHHhc---C-CceeeechHHhhcCHHHHHHHHHHHHhhcC--- Confidence 22 3445554443333322111 2233322 1 2355555311 1111111111112333332 Q ss_pred eeCCCCcccccccchhhHhhhcccccCCCCccceeecCCCCCcchHHHH-HHHHHHHHHhcCCCccccCCC--Ccccccc Q lcl|NC_015285. 119 YDANTGEIKDDKKFMSMMEDFWLPRREGGRGTEISTLPGGQNLGELEDV-KYFQKKLYKALNVPSSRLETE--TTFNIGR 195 (359) Q Consensus 119 YD~~TGevkdd~~~mSMlEDywLpRReGgrgTEIsTLpGgqnLgei~DV-~YF~kkLy~aL~VP~SRl~~~--~~~~~g~ 195 (359) ..|-+- + -.+ +-+++++- .+|+-++|+ ..|...+=.+++||+.||-.+ +|++-.. T Consensus 330 ---n~g~~~-------i-------d~e---~e~~e~~~--~~lsgl~~~l~~~~~~iAa~~~IP~t~L~G~sp~GlnatG 387 (537) T protein:vir:10 330 ---NYQVRV-------V-------DKD---NEDVVQID--TTLNDLDKVIMNQYQLVCAIARTPAPKMLGTVPTGFNSTG 387 (537) T ss_pred ---CcceeE-------e-------cCC---CceeEEEe--ccCCCHHHHHHHHHHHHHhhhCCCceeeccCCccccccch Confidence 111100 0 000 01122221 245556664 567777888889999998443 4775421 Q ss_pred hhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHH Q lcl|NC_015285. 196 AAEITRDEVKFQKFIARLRKRFSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAA 275 (359) Q Consensus 196 ~~eItRDElKF~KFI~rLr~rFs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~ 275 (359) -. |.-.|+.+|+++|.++..++..+++. |+...+..+ ..++|.|..=...++...+|+...+.++++. T Consensus 388 e~----D~~~yyd~I~~~Qe~l~p~l~~l~~l-l~~~~~~~~-------~~~~i~f~pL~~~s~kEkAei~~~~a~a~~~ 455 (537) T protein:vir:10 388 DY----EEASYHEECESTQDDMRPLIDRHHQL-VCRSHLRKR-------IRVKVEFPPMDAPKESERADTFLKKMQAAKL 455 (537) T ss_pred hH----HHHHHHHHHHHHHHHHHHHHHHHHHH-HHHhcCCCC-------cceEEEeCCCCCCCHHHHHHHHHHHHHHHHH Confidence 11 33349999999999888877776643 222222222 2577888877788898999999999998887 Q ss_pred hhhhcchhhhHHHHHHHHhCCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhc---------------CCCCCcccccccC Q lcl|NC_015285. 276 MDPYVGKYFSVDYMRRQVLKQTEIEIKEIDEQIASEMEAGIIADPMAEMDPA---------------MAAGGEGAPAAEV 340 (359) Q Consensus 276 ~dp~vGKy~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~---------------~~~~~~~~~~~~~ 340 (359) +-.- -.+|.+-++.. |+...+ .--.++.+.++.++.+. .|-+++.+++. - T Consensus 456 ~~~~--G~i~~~Evr~~-L~~~~~-----------~g~~~l~~~~~~ed~e~~~~~~~~~~~~~~~~~~~~~~~~~~~-~ 520 (537) T protein:vir:10 456 AFEM--GAVDGVDVNEY-LRMDPT-----------LGFTSITPAMRPTDAEDIDVDDEGKPVRIIEDQPAPSEMFGAT-S 520 (537) T ss_pred HHHc--CCCCHHHHHHH-HhccCc-----------cccccccCCCChhhhhcccCCccCCcCCCCCCCCCccccCCCC-c Confidence 6443 24566655544 322110 00011222221111111 11111111110 0 Q ss_pred CCCCcCCCCCCCCCccC Q lcl|NC_015285. 341 DPNAQESSVDPGDVRRG 357 (359) Q Consensus 341 ~~~~~~~~~~p~~~~~~ 357 (359) ..+-++.+.++|.+-+- T Consensus 521 ~~~~~~~~~~~~a~~~~ 537 (537) T protein:vir:10 521 SGESANDPRDSGAAFED 537 (537) T ss_pred cccccCCCccCccccCC Confidence 11111222222222222 No 19 >protein:vir:2500 Length: 501 # NCBI annotation: putative portal gp5 # Family: family:all:524 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569741;genbank:gi:18496891;genbank:GeneID:932330 Probab=98.20 E-value=1.8e-06 Score=52.14 Aligned_cols=298 Identities=13% Similarity=0.121 Sum_probs=145.2 Q ss_pred CCCchhhHHHhhhh------------hhheeeccccccc--------cC-----------CCceeecHhHhhh-hhc-cc Q lcl|NC_015285. 1 MRGVDLNQQLTQKA------------AEYFLYNPKGLKN--------ST-----------NQGMKITTDSVTY-CHS-GI 47 (359) Q Consensus 1 ~~~~~~~~~~~~~~------------~e~f~yn~~~~~~--------~~-----------~~~v~i~~~ai~y-~hS-Gl 47 (359) ..|+.++..+.-.+ ....+|.+...+. .. +.....+...... -|- |. T Consensus 160 y~D~~~~~~~~~ai~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 239 (501) T protein:vir:25 160 YADPSVDAWPQYALETWVAQKDAKPHRRGVLYDDTYMYELDLGEVVLGDAGGGQATQQPVNVREVTDVIEHGATFEGKPV 239 (501) T ss_pred EecCCCCcceeEEEEEEeeccccCcceeEEEecCeeEEEEecCceeeeeccccccccccccccccccccccccccCCccc Confidence 12222222111111 1122333321110 00 0000111111000 011 11 Q ss_pred cc----CC----CCcchhhHHHHH---HHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcce Q lcl|NC_015285. 48 QD----LN----KNMTLSHLHKAI---KAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNK 116 (359) Q Consensus 48 ~d----~~----~~~i~syL~~Ai---k~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnk 116 (359) ++ +| ++...|-++..+ ..+| +++-+.+++-..+-.|.|-+.=.+....+..++ +.+ T Consensus 240 vPiv~f~N~~~~~~~g~sdie~v~~l~Da~~--~~~s~~~~~~e~~a~p~~~i~G~~~~~~~~~~~----------~~~- 306 (501) T protein:vir:25 240 CPVVRFVNGRDADDMIVGEVAPLILLQQAIN--SVNFDRLIVSRFGANPQRVISGWTGSKAEVLKA----------SAL- 306 (501) T ss_pred eeeEeccCccccCccccchhhhhHHHHHHHH--HHHHHHHHHHHhhccHHHHHhCCCCCccchhhh----------ccc- Confidence 11 11 122334333322 3333 245567777777777877776544333321111 111 Q ss_pred EEeeCCCCcccccccchhhHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccch Q lcl|NC_015285. 117 MVYDANTGEIKDDKKFMSMMEDFWLPRREGGRGTEISTLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRA 196 (359) Q Consensus 117 lvYD~~TGevkdd~~~mSMlEDywLpRReGgrgTEIsTLpGgqnLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~ 196 (359) ..|+. +| -+++|.++|+..-=+-++-++-.-..+...-++|..-|+..++ |. .+ T Consensus 307 ---------------------~i~~~--~~-~~~~~~q~~~~~~~~~~~~l~~~i~~i~~~s~~P~~~~~~~~~-N~-Sg 360 (501) T protein:vir:25 307 ---------------------RVWTF--ED-PEVKAQAFPPASVEPYNLILEEMLQHVAMVAQISPAQVTGKMI-NV-SA 360 (501) T ss_pred ---------------------ceecc--CC-CCceEEEecccChHHHHHHHHHHHHHHHhhcCCChhhhccccC-Ch-HH Confidence 12332 12 2356777876542223344555555666677888776653221 22 34 Q ss_pred hhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015285. 197 AEITRDEVKFQKFIARLRKRFSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAM 276 (359) Q Consensus 197 ~eItRDElKF~KFI~rLr~rFs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~ 276 (359) ..|.--+....+-+.+.|+.|..-+.++++.-+.++|.....+| ..|.+.|..-..=+. .+..+++..+ T Consensus 361 ~Al~~~~~~l~~ka~~k~~~f~~~l~~~~rl~~~~~~~~~~~~~----~~i~v~w~~~~~~s~-------~~~ada~~kl 429 (501) T protein:vir:25 361 EALAAAEANQQRKLAAKRESFGESWEQLLRLAAEMDDDPDTAAD----SGAEVLWRDTEARSF-------GAVVDGITKL 429 (501) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCccccc----eeeeEEecCCCCCCH-------HHHHHHHHHH Confidence 45566666678889999999999999999999989986543333 246777754332222 4556666666 Q ss_pred hhhcchhhhHHHHHHHHhCCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCCCCc Q lcl|NC_015285. 277 DPYVGKYFSVDYMRRQVLKQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPGDVR 355 (359) Q Consensus 277 dp~vGKy~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~ 355 (359) ..- | +|.+++...++++|++||+++.++.+++...+.+..+.. .++.++.++++.. .....++..++|..+- T Consensus 430 ~~~-g--is~et~~~~~~g~~~~~ie~~~~~~~e~~~~~~~~~~~~-~~~~~~~~~~~~~---~~~~~~~~~~~~~~g~ 501 (501) T protein:vir:25 430 ASA-G--IPIEHLLSMVPGMTQQTIQAIKDSLRGGEVKSLVDKLLS-NEPAPVPPPPPQA---AAQALNEGGVNGNGGA 501 (501) T ss_pred Hhc-C--CCHHHHHHHcCCCCHHHHHHHHHHHHHHhHHHHHHHhhc-cCcCCCCCCCCCC---CccccccccCCCCCCC Confidence 543 3 699999999999999999988888777766554432221 1111111111111 1112223333443333 No 20 >protein:vir:99916 Length: 504 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655520;genbank:gi:109392290;genbank:GeneID:4157085 Probab=98.12 E-value=2.5e-06 Score=51.33 Aligned_cols=309 Identities=15% Similarity=0.115 Sum_probs=152.4 Q ss_pred CCCchhhH-----HHh-----hhhhhheeeccccccc--------------cCCCceeecHhHhhhhhcccccCCCC--- Q lcl|NC_015285. 1 MRGVDLNQ-----QLT-----QKAAEYFLYNPKGLKN--------------STNQGMKITTDSVTYCHSGIQDLNKN--- 53 (359) Q Consensus 1 ~~~~~~~~-----~~~-----~~~~e~f~yn~~~~~~--------------~~~~~v~i~~~ai~y~hSGl~d~~~~--- 53 (359) +-|+.++. .++ .......+|.|...+. .+..| +| .|.|+|..-.+...| T Consensus 155 iyD~~~~~~~~a~~~~~~d~~g~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~g--vP--vV~~~n~~~~~~~~G~se 230 (504) T protein:vir:99 155 EWNSRRNAMDSLLSITSRDAEGHPTGIALYEDGVTVTADMDDDGDWHADVRTHKLG--VP--VEVLPYKPREDRPLGSSR 230 (504) T ss_pred EEeCCCCceeEEEEEEEecCCCeEEEEEEEcCCcEEEEEEcCCceeeeccccCCCC--cc--eEEecccccCccccCccc Confidence 11111000 000 0011122333322110 01113 34 577777643332211 Q ss_pred ---cchhhHHHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCccc-cc Q lcl|NC_015285. 54 ---MTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIK-DD 129 (359) Q Consensus 54 ---~i~syL~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevk-dd 129 (359) -+++..+++-+ .+-+.++.=..+=.|.|-|+=.+-..++... |+.. .- T Consensus 231 i~~~v~~l~Da~~~------~~~~~~~~~e~~a~p~r~i~G~~~~~~~~~d----------------------~~~~~~~ 282 (504) T protein:vir:99 231 ITRPVMSLQQRALK------GCIRMDGHADVYSFPQLILLGADAKNFRNKD----------------------GSMKPAW 282 (504) T ss_pred chhhHHHHHHHHHH------HHHHHHHHHHHhcchhhhhccCCcccccccc----------------------ccccchh Confidence 23444444443 4555666666666777666533322211111 1100 00 Q ss_pred ccchhhHhhhcccccC-----CCCccceeecCCCCCcchH-HHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHh Q lcl|NC_015285. 130 KKFMSMMEDFWLPRRE-----GGRGTEISTLPGGQNLGEL-EDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDE 203 (359) Q Consensus 130 ~~~mSMlEDywLpRRe-----GgrgTEIsTLpGgqnLgei-~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDE 203 (359) +..++ .=+.||.-+ ++-.+++..++++. |.-. +-++-.-..+...-++|..-|+..+.-|-..+..|.-.+ T Consensus 283 ~~~~~--~i~~~~~~~~~~~~~~~~~~~~q~~~~~-l~~~~~~l~~~i~~~a~~t~~P~~~lG~~~~~n~sSa~Ai~~~~ 359 (504) T protein:vir:99 283 QIALA--RVFALPDDEDEPDAARARADVKQFPASS-PQPHIEMLEQIAMMFSGETSIPVESLGFSNRANPTSADAYIASR 359 (504) T ss_pred hhhhh--hhhcCCCccccccccCccceeeecCCCC-hHHHHHHHHHHHHHHHhhhCCCHHHhcccccccccHHHHHHHHH Confidence 00000 112233321 22346788888875 4432 335555556666689998888543322222445677778 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCC--hhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcc Q lcl|NC_015285. 204 VKFQKFIARLRKRFSELFTDLLKTQLILKGVMS--LEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVG 281 (359) Q Consensus 204 lKF~KFI~rLr~rFs~if~d~Lk~QLiLkgI~t--~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vG 281 (359) ....+-+.+.|++|..-+.+.+|.-+.+.+... ..+| ..+.+.|..-..=+ +.++.+++..+..-+. T Consensus 360 ~~L~~ka~~k~~~f~~~l~~~~rla~~~~~~~~~~~~~~----~~~~v~w~d~~~~s-------~a~~aDa~~Kl~~ag~ 428 (504) T protein:vir:99 360 EDLIAEAEGATDDWSPAFRRSMIRALAIKNGLDRIPPEW----KTIDSKFRSPLYLS-------KAAQADAGAKMLGAGP 428 (504) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccc----ccceeEecCCCccC-------HHHHHHHHHHHHhhcc Confidence 888899999999999999999999887766543 2333 35677775333222 2556777777666555 Q ss_pred hhhhHHHHHHHHhCCCHHHHHHHHHHHHHHHhcCCCCCCcchhhh--cCCCCCcccccccCCCCCcCCCCCCCCCccC Q lcl|NC_015285. 282 KYFSVDYMRRQVLKQTEIEIKEIDEQIASEMEAGIIADPMAEMDP--AMAAGGEGAPAAEVDPNAQESSVDPGDVRRG 357 (359) Q Consensus 282 Ky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~ 357 (359) .+++...++.+.|++|++||+.+..+.+++...+.+-...+ .++ .++.+++.+++++ .+.-.+++..+...+.| T Consensus 429 ~l~~~~~~l~~~lg~~~~ei~r~~~e~~~~~~~~~~~~l~~-~~~~~~~~~~~~~~~~~e-~a~~~~~~~~~~p~~~~ 504 (504) T protein:vir:99 429 EWLKETEVGLELLGLTPQQAKRALAERRRASSVSIIEALNR-RQQEAATAGEDQDQGAGE-PPANEPPAALGRPTLVG 504 (504) T ss_pred ccccchHHHHhhcCCCHHHHHHHHHHHHHHhhHHHHHHHhc-ccCCCCCCCCCCCcCCCC-CCCCCCCccCCCcccCC Confidence 56776666667789999999977766666544333211111 111 1111122222221 11222333334444556 No 21 >protein:vir:96068 Length: 765 # NCBI annotation: conserved hypothetical protein ORF017 # Family: family:all:297 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294434;genbank:gi:149408331;genbank:GeneID:5237187 Probab=98.05 E-value=2.8e-06 Score=51.02 Aligned_cols=296 Identities=13% Similarity=0.142 Sum_probs=130.7 Q ss_pred CC--Cch-----hhHH-Hhhh-hhhheeeccccccc-----------cCC---------CceeecHhHhhhhhcccccCC Q lcl|NC_015285. 1 MR--GVD-----LNQQ-LTQK-AAEYFLYNPKGLKN-----------STN---------QGMKITTDSVTYCHSGIQDLN 51 (359) Q Consensus 1 ~~--~~~-----~~~~-~~~~-~~e~f~yn~~~~~~-----------~~~---------~~v~i~~~ai~y~hSGl~d~~ 51 (359) ++ |+. .+.+ |-++ +.-+-+++|.+... +++ .+-+|+.| =|+..+ T Consensus 188 i~~~D~~~l~~PL~~~~I~kg~~kgl~vldp~~~~~~~v~e~~~Dp~sp~fg~P~~y~i~g~~IH~S-------Rli~~~ 260 (765) T protein:vir:96 188 VESDDPDYYEKPFNPDGIAPGSYKGISQIDPYWAMPQLTAESTADPSAEHFYEPDFWIISGKKYHRS-------HLVVVR 260 (765) T ss_pred ecccCcchhhccccccccccceeeEEEEechhhcccccchhccccccccccCcceeeeecCceeccc-------eEEEec Confidence 11 110 0111 1111 11122222221110 110 11244444 333333 Q ss_pred CCcc------------hhhHHHHHHHHHHHHHH--HHHHHHHHHhcCccceeEeccCCCC-chHHHHHHHHHHHHhhcce Q lcl|NC_015285. 52 KNMT------------LSHLHKAIKAVNQLRMI--EDSLVIYRLSRAPERRIFYIDVGNL-PKNKAEQYLREVMGRYRNK 116 (359) Q Consensus 52 ~~~i------------~syL~~Aik~~NqL~m~--EDalVIyR~~RAPeRRvFyIDvGnl-pk~KAeqYl~~iM~kyrnk 116 (359) +.-+ .|-|+++...+..+... +.+.++++- -. +++.+|.... ...++...--+.++++|+- T Consensus 261 g~~lpd~lk~~~~~~G~Svlq~~yd~I~~~~~t~~~~a~Ll~k~---~~-~v~k~~~~~~l~~~~~l~~r~~~~~~~r~n 336 (765) T protein:vir:96 261 GPQPPDILKPTYIFGGIPLTQRIYERVYAAERTANEAPLLAMSK---RT-STIHVDVEKAIANEDAFNARLAFWIANRDN 336 (765) T ss_pred CCCchhhhccccCccCccHHHHHHHHHHHHHHHHHHHHHHHHHh---cc-ceeeechHhhhccHHHHHHHHHHHHHhcCC Confidence 3222 34455554443333222 334455542 22 3566665532 1111111111223333321 Q ss_pred EEeeCCCCcccccccchhhHhhhcccccCCCCccceeecCCCCCcchHHHH-HHHHHHHHHhcCCCccccCCC--Ccccc Q lcl|NC_015285. 117 MVYDANTGEIKDDKKFMSMMEDFWLPRREGGRGTEISTLPGGQNLGELEDV-KYFQKKLYKALNVPSSRLETE--TTFNI 193 (359) Q Consensus 117 lvYD~~TGevkdd~~~mSMlEDywLpRReGgrgTEIsTLpGgqnLgei~DV-~YF~kkLy~aL~VP~SRl~~~--~~~~~ 193 (359) .|- +-+ +++-+++++. .+|+-++|+ ..|...+=.+.+||+.||-.+ .|+|- T Consensus 337 ------~g~-------~~i-----------d~ee~~e~~s--~~lsgl~d~l~~~~~~iAaas~IP~t~LfGqsp~GlnA 390 (765) T protein:vir:96 337 ------HGV-------KVI-----------GIDETMEQFD--TNLSDFDSVIMNQYQLVAAIAKTPATKLLGTSPKGFNA 390 (765) T ss_pred ------cee-------EEe-----------cCCcceeEEe--cccCCHHHHHHHHHHHHHhhhCCCeeeeccCCcccccC Confidence 111 000 0112344443 356667774 678999999999999999554 57764 Q ss_pred cchhhhhHHhhhHHHHHHHHHHH-HHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHH Q lcl|NC_015285. 194 GRAAEITRDEVKFQKFIARLRKR-FSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQ 272 (359) Q Consensus 194 g~~~eItRDElKF~KFI~rLr~r-Fs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~ 272 (359) +.-. |.-.|+.+|+.+|.. +..+...+++. |++-+.+. ..+.|.|..=..=+|...+||...+.++ T Consensus 391 TGe~----D~~nYyD~I~s~Qe~~l~p~le~L~~l-i~~s~~i~--------~d~~i~FnpL~~~sekEkAei~~k~Aea 457 (765) T protein:vir:96 391 TGEH----ETISYHEELESIQEHIFDPLLERHYLL-LAKSESID--------VQLEIVWNPVDSTTSQQQAELNNKKAAT 457 (765) T ss_pred cchH----HHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHhcCCC--------CcceEEeCCCCCCCHHHHHHHHHHHHHH Confidence 3222 333499999999965 45555555444 44545432 3577899887778888899999999999 Q ss_pred HHHhhhhcchhhhHHHHHHHHh--------CCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCc----------c Q lcl|NC_015285. 273 VAAMDPYVGKYFSVDYMRRQVL--------KQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGE----------G 334 (359) Q Consensus 273 ~~~~dp~vGKy~S~~~i~k~IL--------~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~----------~ 334 (359) ++.+-.- -.+|.+-++..+- .+++++++.. ...+|++..+...+.... + T Consensus 458 ~~~~~~~--Gvis~dEvR~~L~~~~~~g~~~l~d~~~e~~-----------~~~~pe~~~~~~~~~~~~~~~~~e~~~~~ 524 (765) T protein:vir:96 458 DEIYINS--GVVSPDEVRERLRDDPRSGYNRLTDDQAETE-----------PGMSPENLAELEKAGAQSAKAKGEAERAE 524 (765) T ss_pred HHHHHhc--CCCCHHHHHHHHhccccCCCCCCCccccccc-----------cCCCccccccccCCCcccccccCcccccc Confidence 8876333 2567777776532 2455555321 111222111111000000 0 Q ss_pred cccccCCCCCcCCCCCC-CCC---ccCCC Q lcl|NC_015285. 335 APAAEVDPNAQESSVDP-GDV---RRGEF 359 (359) Q Consensus 335 ~~~~~~~~~~~~~~~~p-~~~---~~~~~ 359 (359) ++++...+..++++.-| +.. ..|.+ T Consensus 525 a~p~~~eg~~~~~~~~p~~~~p~~~~~~~ 553 (765) T protein:vir:96 525 AQAGAVEGAGDPVPAAPRGTKPLAKAAEE 553 (765) T ss_pred CCCCccCCCCcccccCCcccCCccccccc Confidence 11111111111121111 111 11111 No 22 >protein:vir:94049 Length: 532 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453629;genbank:gi:84662665;genbank:GeneID:5142559 Probab=97.90 E-value=9.9e-06 Score=48.02 Aligned_cols=311 Identities=14% Similarity=0.156 Sum_probs=130.8 Q ss_pred CCCchhh--------HH---Hhhh-hhhheeecccccc-----c----cC----------CCceeecHhHhhhhh-cccc Q lcl|NC_015285. 1 MRGVDLN--------QQ---LTQK-AAEYFLYNPKGLK-----N----ST----------NQGMKITTDSVTYCH-SGIQ 48 (359) Q Consensus 1 ~~~~~~~--------~~---~~~~-~~e~f~yn~~~~~-----~----~~----------~~~v~i~~~ai~y~h-SGl~ 48 (359) +++.... .+ |-++ +..+-+++|.+.. . ++ ..+.+|++|=|+-.. .-+- T Consensus 163 v~~~~~~~~~~~p~~l~~~~I~~g~~~~l~vld~~~v~p~~~~~~dp~sp~fg~P~~y~v~~g~~iH~SRli~f~g~~~p 242 (532) T protein:vir:94 163 LKMDGDSVPADAPLLLSPSFVQRGCLIGFATIEPMWLSPNAYNATDPTLPSFYKPDSWIATSGKKIHSSRIHTVVGRPVG 242 (532) T ss_pred eccCCccccccccccccccccccceeeEEEeechheecccccccccccccccCCceeEEEccCeeeccceEEEecCCCch Confidence 2211110 10 1111 2222233333211 0 00 123456655433211 1110 Q ss_pred ----cCCCCcchhhHHHHHHHHHHHHHHHHH--HHHHHHhcCccceeEeccCCCCc-hHHHHHHHHH--HHHhhcceEEe Q lcl|NC_015285. 49 ----DLNKNMTLSHLHKAIKAVNQLRMIEDS--LVIYRLSRAPERRIFYIDVGNLP-KNKAEQYLRE--VMGRYRNKMVY 119 (359) Q Consensus 49 ----d~~~~~i~syL~~Aik~~NqL~m~EDa--lVIyR~~RAPeRRvFyIDvGnlp-k~KAeqYl~~--iM~kyrnklvY 119 (359) +..+..=.|.|+++...+.+......+ .++++.. =.|+.++..++- ....++..+. .++++|+ T Consensus 243 ~~~~~~~~~~G~Svlq~~~~~l~~~~~t~~~~~~l~~~~~----~~v~k~~~a~~ls~~~~~~~~~r~~~~~~~~~---- 314 (532) T protein:vir:94 243 DMLKAAYSFRGVSISQLAMPYVDNWLRTRQSVSDTVKQFS----MTNLATDMAQLLAPGGAQSLDARLQLFNLYRD---- 314 (532) T ss_pred hhhccccccccccHHHHHHHHHHHHHHHHHHHHHHHHhcC----CceeeechHHhhcchhHHHHHHHHHHHHhhcC---- Confidence 001111156677776666665444333 3344422 234444432221 1111221111 1222221 Q ss_pred eCCCCcccccccchhhHhhhcccccCCCCcc-ceeecCCCCCcchHHHH-HHHHHHHHHhcCCCccccCCC--Ccccccc Q lcl|NC_015285. 120 DANTGEIKDDKKFMSMMEDFWLPRREGGRGT-EISTLPGGQNLGELEDV-KYFQKKLYKALNVPSSRLETE--TTFNIGR 195 (359) Q Consensus 120 D~~TGevkdd~~~mSMlEDywLpRReGgrgT-EIsTLpGgqnLgei~DV-~YF~kkLy~aL~VP~SRl~~~--~~~~~g~ 195 (359) .+|-+- + .+++ +++++. .+|+-++|+ ..|...+-.+++||+.||-.+ +|||-.. T Consensus 315 --n~g~~~-----i-------------d~~~e~~e~~~--~~lsgl~~~l~~~~~~iAaa~~IP~t~LfG~sp~GlnstG 372 (532) T protein:vir:94 315 --NRNIGA-----L-------------DKGTEEIQQTN--TPLSGLDSLQAQSQEQMAAVSHIPLVKLLGITPNGLNASS 372 (532) T ss_pred --CccceE-----E-------------cCCCceeEEEe--cccCCHHHHHHHHHHHHHhHhCCCeeeeecCCcccccccc Confidence 111100 0 0111 233332 345555554 788999999999999998443 5675422 Q ss_pred hhhhhHHhhhHHHHHHHHHHH-HHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHH Q lcl|NC_015285. 196 AAEITRDEVKFQKFIARLRKR-FSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVA 274 (359) Q Consensus 196 ~~eItRDElKF~KFI~rLr~r-Fs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~ 274 (359) -. |.-.|+.||+++|.. +..++..+++.-+.......+ +.++|.|..=...++...+|+...+.++++ T Consensus 373 e~----D~~~yyd~I~s~Qe~~l~p~le~l~~~l~~s~~g~~~-------~d~~~~f~pL~~~s~kEkAei~~~~a~a~~ 441 (532) T protein:vir:94 373 DG----EIRVWYDFIAGYQATNLTPLMEWIIDLIQLSEYGQID-------PGLAWEWSPLMELDDKELAEVRQLNASTDS 441 (532) T ss_pred hH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCC-------CCceEEeCCCCCCCHHHHHHHHHHHHHHHH Confidence 22 233399999999955 566666666433222222222 357788987667788888999999999988 Q ss_pred HhhhhcchhhhHHHHHHHHhCCCH-----------HHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCC Q lcl|NC_015285. 275 AMDPYVGKYFSVDYMRRQVLKQTE-----------IEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPN 343 (359) Q Consensus 275 ~~dp~vGKy~S~~~i~k~IL~~tD-----------eeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~ 343 (359) .+-.- -.+|.+-++.. |++.. +++++...++.+.+... ...|.. .+..+-|+ +.+..+-... T Consensus 442 ~~~~~--Gvi~~~Evr~~-l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~--~~~~~~~~-~~~~~d~~~~ 514 (532) T protein:vir:94 442 TLMEL--GVIDAKMVQQR-LAADPTSGYAGALGERDELDDVEEIAKQLMAAA-LNPPAT--APQTPNPQ-PDSEDDQTDN 514 (532) T ss_pred HHHhc--CCCCHHHHHHH-HhcCCccccccccccccccccccchhhhhcccc-cCCCCC--CCCCCCCC-CCCCCCCCCC Confidence 77432 25677777764 44322 23333333222222211 111110 00000000 0111111111 Q ss_pred CcCCCCCCCCCc--cCCC Q lcl|NC_015285. 344 AQESSVDPGDVR--RGEF 359 (359) Q Consensus 344 ~~~~~~~p~~~~--~~~~ 359 (359) .+.++-.|+.++ -|.- T Consensus 515 ~~~~~~~~~~~~~~~~~~ 532 (532) T protein:vir:94 515 QPDAQADPAQNDQPVGNR 532 (532) T ss_pred ccCCCccccccCCCcCCC Confidence 111111121111 1111 No 23 >protein:vir:99563 Length: 862 # NCBI annotation: minor head protein-like protein # Family: family:all:297 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039808;genbank:gi:126011058;genbank:GeneID:4818258 Probab=97.87 E-value=2.1e-06 Score=51.71 Aligned_cols=308 Identities=12% Similarity=0.115 Sum_probs=125.8 Q ss_pred CCCchhhH--------H-Hhhh-hhhheeecccccc-----------ccCCCc----eeecHhHhhhhhcccccCCCCcc Q lcl|NC_015285. 1 MRGVDLNQ--------Q-LTQK-AAEYFLYNPKGLK-----------NSTNQG----MKITTDSVTYCHSGIQDLNKNMT 55 (359) Q Consensus 1 ~~~~~~~~--------~-~~~~-~~e~f~yn~~~~~-----------~~~~~~----v~i~~~ai~y~hSGl~d~~~~~i 55 (359) +.+..++. + |.++ +..+-+++|.+.. .+++.+ .+|.. -.+-||=|+...+.-+ T Consensus 215 lv~~~D~~~LsqPLn~e~I~kG~lkgl~vlDp~w~~p~~v~~~~~Dp~sp~yGkP~~y~I~g--~~IH~SRliif~g~~v 292 (862) T protein:vir:99 215 VVDSEDPDYYEKPFNPDGITPGSYRGISQIDPYWMMPMLTAESTADPSSQFFYEPEFWIISG--QKYHRSHLIIARGPQP 292 (862) T ss_pred EecCcCchhhhcCcCcccccccceeEEEEechhhhcccccccccccccccccCCceeeeecC--eeeccceeEEecCCCc Confidence 11111111 1 1111 2223334443321 011111 11111 1122343333333333 Q ss_pred ------------hhhHHHHHHHHHHHHHHHH-----HHHHHHHhcCccceeEeccCCC-CchHHHHHHHHHHHHhhcceE Q lcl|NC_015285. 56 ------------LSHLHKAIKAVNQLRMIED-----SLVIYRLSRAPERRIFYIDVGN-LPKNKAEQYLREVMGRYRNKM 117 (359) Q Consensus 56 ------------~syL~~Aik~~NqL~m~ED-----alVIyR~~RAPeRRvFyIDvGn-lpk~KAeqYl~~iM~kyrnkl 117 (359) +|-|+++. +.|+-.+. +.++++.. -+++.+|... |....+-..=.++++++|+- T Consensus 293 pd~lk~ay~f~G~SvLe~iy---d~L~~~d~t~~saa~Ll~ka~----l~v~ktd~l~~l~~ed~l~~r~~~~~~~rdN- 364 (862) T protein:vir:99 293 ADILKPTYIFGGIPLVQRIY---ERVYAAERTANEAPLLAMNKR----TTAIHTDTAKAIANEDKFIQRLMFWVRYRDN- 364 (862) T ss_pred hhhhhccCCccCccHHHHHH---HHHHHHHHHHHHHHHHHHHhc----cceeechhHhhhccHHHHHHHHHHHHhccCc- Confidence 45555443 33333322 33344322 2244444432 22111111111234444431 Q ss_pred EeeCCCCcccccccchhhHhhhcccccCCCCccceeecCCCCCcchHHH-HHHHHHHHHHhcCCCccccCCC--Cccccc Q lcl|NC_015285. 118 VYDANTGEIKDDKKFMSMMEDFWLPRREGGRGTEISTLPGGQNLGELED-VKYFQKKLYKALNVPSSRLETE--TTFNIG 194 (359) Q Consensus 118 vYD~~TGevkdd~~~mSMlEDywLpRReGgrgTEIsTLpGgqnLgei~D-V~YF~kkLy~aL~VP~SRl~~~--~~~~~g 194 (359) .|-+ .+ +++-+++++- .+|+-++| +..|...+=.+.+||+.||-.+ .|++-+ T Consensus 365 -----~Gi~-----li-------------D~eEe~e~ls--~slSGL~dll~~~~q~IAaas~IP~tiLfGqspaGlnAT 419 (862) T protein:vir:99 365 -----HAVK-----VL-------------GTDETMEQFD--TSLADFDAVIMGQYQLVASIAKTPATKLLGTAPKGFNST 419 (862) T ss_pred -----ceeE-----Ee-------------cCCCceeEEe--cccCChHHHHHHHHHHHHhhhCCCceeecccCcccccCc Confidence 1100 00 0111233332 34444555 5778889999999999998443 577653 Q ss_pred chhhhhHHhhhHHHHHHHHHHH-HHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHH Q lcl|NC_015285. 195 RAAEITRDEVKFQKFIARLRKR-FSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQV 273 (359) Q Consensus 195 ~~~eItRDElKF~KFI~rLr~r-Fs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~ 273 (359) .-.+ .-.|+.+|+++|.. +..++..++ .|+......+ ..+.|.|..=..=+|...+|+.....+++ T Consensus 420 GE~D----~~nYyD~I~s~QE~~L~P~LerL~--~li~~~lg~~-------~d~~ieFnpL~~~sekEkAEi~kk~Aea~ 486 (862) T protein:vir:99 420 GEFE----TISYHEELESIQEHVYMPFLQRHY--LISRLSLGIQ-------HEIDVVMEPVASMTAQQQADLNKTKAEGG 486 (862) T ss_pred hHHH----HHHHHHHHHHHHHHHHHHHHHHHH--HHHHHhcCCC-------CcceEEeCCCCCCCHHHHHHHHHHHHHHH Confidence 3222 33399999999964 444444433 2333222222 34678888777778888899999998888 Q ss_pred HHhhhhcchhhhHHHHHHHHh--------CCCHHHHHHHHHHHHHHHhcC------CCCCCcchhhhcCC--CCCcc--- Q lcl|NC_015285. 274 AAMDPYVGKYFSVDYMRRQVL--------KQTEIEIKEIDEQIASEMEAG------IIADPMAEMDPAMA--AGGEG--- 334 (359) Q Consensus 274 ~~~dp~vGKy~S~~~i~k~IL--------~~tDeeI~e~~kqi~~E~~~~------~~~~P~~~~~~~~~--~~~~~--- 334 (359) +.+-.- -.+|.+-++.... .++|+++++......++..+. .-..|..+. .++. ..+++ T Consensus 487 ~~lv~s--GvispdEvR~~L~~~~~~g~~~l~ded~E~d~~~~~e~~~~~e~~g~a~~~ap~de~-~aga~~~~~e~d~~ 563 (862) T protein:vir:99 487 KVLIDG--GVISPDEERNRIRDDKRSGYNRLTKEDAEETPGASPENLAAYQKAGAAQETASAKET-QAGAAVTTAEGDQP 563 (862) T ss_pred HHHHhc--CCCCHHHHHHHHHhcCCcCCCCCCcccccccCCCCcccccccccCCccccccccccc-ccccCCccccCCcc Confidence 876432 2567777776521 245566653221111111110 000111000 0000 00000 Q ss_pred ----cc---cc-cCCCCCcCCCCCCCC-CccC-CC Q lcl|NC_015285. 335 ----AP---AA-EVDPNAQESSVDPGD-VRRG-EF 359 (359) Q Consensus 335 ----~~---~~-~~~~~~~~~~~~p~~-~~~~-~~ 359 (359) ++ +| ...+.....+++|++ +..+ .+ T Consensus 564 ~~p~~~~~~~g~~~~~t~~~~a~~p~~~~~~~~~~ 598 (862) T protein:vir:99 564 NVQMVPSMKPGQMVGPEVGITAPMPEDDAPVAGVV 598 (862) T ss_pred cccccCCCCCCCccccccccccCCCccccccCccc Confidence 00 00 001111222223331 1111 11 No 24 >protein:vir:105782 Length: 449 # NCBI annotation: gp5 # Family: family:all:6783 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224143;genbank:gi:62362218;genbank:GeneID:3342535 Probab=97.68 E-value=2.9e-05 Score=45.45 Aligned_cols=264 Identities=15% Similarity=0.096 Sum_probs=126.1 Q ss_pred CCCc---hhhHHHhhhhhhheee------------cccc--------c------cccCCCceeecHhHhhhhhcccccCC Q lcl|NC_015285. 1 MRGV---DLNQQLTQKAAEYFLY------------NPKG--------L------KNSTNQGMKITTDSVTYCHSGIQDLN 51 (359) Q Consensus 1 ~~~~---~~~~~~~~~~~e~f~y------------n~~~--------~------~~~~~~~v~i~~~ai~y~hSGl~d~~ 51 (359) ++|. ..-+..-+++...-+| ||.. . .+++..+++|+.|= |+-.+ T Consensus 132 v~d~~~l~~Pl~~~~~i~~i~v~~~~~i~~~~~~~dp~sp~yg~P~~y~v~~~~~g~~~~~~~iH~SR-------l~~~~ 204 (449) T protein:vir:10 132 IRDEKDWNLPATKGRGLQKVSVSWAGSLKVAEWDTGINSKTYGQPKLWKYTERLPNGSSRRVDIHPDR-------VFILG 204 (449) T ss_pred ecCCCCCCcccccCcceeeEEeeccccCChhhhhcCCCCCCCCCceEEEEeeeccCCCccceeeccce-------eEeec Confidence 1110 0000001122222222 2211 1 01122335565444 33222 Q ss_pred CCc--chhhHHHHHHHHHHHHHHHHHHH------HHHHhcC----ccceeEeccCCCCchHHHH------HHHHHHH--- Q lcl|NC_015285. 52 KNM--TLSHLHKAIKAVNQLRMIEDSLV------IYRLSRA----PERRIFYIDVGNLPKNKAE------QYLREVM--- 110 (359) Q Consensus 52 ~~~--i~syL~~Aik~~NqL~m~EDalV------IyR~~RA----PeRRvFyIDvGnlpk~KAe------qYl~~iM--- 110 (359) ++- -.|+|+++ ||.|.-+|-+.. .-...|. -++ .+|+.+|...++. +-+++.+ T Consensus 205 ~~~~~g~~~L~~~---yn~l~~~~~~~~~~a~~~l~~~~rq~~~~~~~---~~~~~~l~~~~~~~~e~~~~~~~~~~~~~ 278 (449) T protein:vir:10 205 DYSEDAIGFLEPA---YNAFVSLEKVEGGSGESFLKNAARQLNVNFEK---EIDFTNLASLYGVSIDELQDKFNEVAGEI 278 (449) T ss_pred CCCCCChhHHHHH---HHHhhhHHHhhhhHHHHHHHHHHHHHhhhhhh---hhhhhhhhHHhhCCchHHHHHHHHHHHHH Confidence 211 24778876 455554444321 1111111 112 2455555443321 1122222 Q ss_pred HhhcceEEeeCCCCcccccccchhhHhhhcccccCCCCccceeecCCCCCcchHHH-HHHHHHHHHHhcCCCccccCC-- Q lcl|NC_015285. 111 GRYRNKMVYDANTGEIKDDKKFMSMMEDFWLPRREGGRGTEISTLPGGQNLGELED-VKYFQKKLYKALNVPSSRLET-- 187 (359) Q Consensus 111 ~kyrnklvYD~~TGevkdd~~~mSMlEDywLpRReGgrgTEIsTLpGgqnLgei~D-V~YF~kkLy~aL~VP~SRl~~-- 187 (359) ++-.+-+..|. -+|| +..+| +++-++| +..|...+=-+.+||+.||=. T Consensus 279 ~~~~~~~~i~~--------------~~d~----------~~~~~-----~~sgl~d~l~~~~q~iaaa~~IP~t~L~Gqs 329 (449) T protein:vir:10 279 NRGNDVLMTTQ--------------GATV----------TPLVT-----SVADPTATYNVNLQTAAAGVDIPTRILIGNQ 329 (449) T ss_pred hccchheeecC--------------Ccce----------EEEec-----ccCChhHHHHHHHHHHHHHhCCCeeeeeccC Confidence 22122111110 0122 22333 3444555 567888899999999999933 Q ss_pred CCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHH Q lcl|NC_015285. 188 ETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRN 267 (359) Q Consensus 188 ~~~~~~g~~~eItRDElKF~KFI~rLr~rFs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~ 267 (359) -+|+|- ++ |.-.|+..|...|.++......++.. |+.-++..+. ..+.|.|..=..=+|...+||.. T Consensus 330 p~glns---t~---D~~nyyd~i~~~Q~~l~p~le~l~~~-l~~s~~g~~~------~d~~i~f~pL~~~t~kEkAei~k 396 (449) T protein:vir:10 330 QAERSS---TE---DQKYFNARCQSRRVDLSFEIEDFCDK-LIELKIIDAV------AKKAVIWDDLNEQTGTEKLTNAK 396 (449) T ss_pred cccccc---ch---hHHHHHHHHHHHHHhhhHHHHHHHHH-HHHhhcCCCC------CceeEEeCCCCCCCHHHHHHHHH Confidence 367873 22 44459999999999998888887764 6677776553 35779999999999999999999 Q ss_pred HHHHHHHHhhhhcc-hhhhHHHHHHHHhCCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcC Q lcl|NC_015285. 268 ERMNQVAAMDPYVG-KYFSVDYMRRQVLKQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQE 346 (359) Q Consensus 268 ~Rl~~~~~~dp~vG-Ky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~ 346 (359) ...++++.+-..++ ..|+ .+|+.+.. ..+-. +-.+-|++..+++ +++.-+ T Consensus 397 ~~A~a~~~~~~ag~~~~~~------------~~EiR~~~---~~~~~-~~~~~~~e~~de~-------------~~~~d~ 447 (449) T protein:vir:10 397 TMGEINQTMLGSGDNPAFS------------REEIRTAA---GYDND-DEEPLGEEDGDEE-------------DKATDS 447 (449) T ss_pred HHHHHHHHHHHccccCCcC------------HHHHHHHh---cccCC-CCCCCCCCCCccc-------------cccCCc Confidence 99999888766542 2334 44443222 11100 0011111111111 111111 Q ss_pred CC Q lcl|NC_015285. 347 SS 348 (359) Q Consensus 347 ~~ 348 (359) ++ T Consensus 448 ~a 449 (449) T protein:vir:10 448 AA 449 (449) T ss_pred CC Confidence 11 No 25 >protein:vir:7768 Length: 484 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817602;genbank:gi:29566032;genbank:GeneID:1259226 Probab=97.44 E-value=6.7e-05 Score=43.47 Aligned_cols=307 Identities=13% Similarity=0.136 Sum_probs=136.1 Q ss_pred CCCchhhHHHh-----------hhhhhheeecccccc--ccCCCceeecHh---------HhhhhhcccccCCCCcchhh Q lcl|NC_015285. 1 MRGVDLNQQLT-----------QKAAEYFLYNPKGLK--NSTNQGMKITTD---------SVTYCHSGIQDLNKNMTLSH 58 (359) Q Consensus 1 ~~~~~~~~~~~-----------~~~~e~f~yn~~~~~--~~~~~~v~i~~~---------ai~y~hSGl~d~~~~~i~sy 58 (359) +-|..++ +++ +.+..+.+|.+...+ ....++.+...+ .|.|++.--..+- .-.|- T Consensus 152 ~~D~~~~-~~~~a~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~f~N~~~~~~~--~G~s~ 228 (484) T protein:vir:77 152 QIDPRTR-QVMRAIRAIEDEEGNEVIGATLYLPNNTVIWNREDGQWVQVANVAHNLEMVPVIPIPNRTRLSDL--YGTTE 228 (484) T ss_pred EecCCCC-ceEEEEEEEEeecCCcEEEEEEEecCeEEEEEecCCceEeeccccCCCCCcceEEeccccccCcc--CCccc Confidence 1111111 111 112233344443221 111111111110 1334432111100 11222 Q ss_pred HHHHHHHH-HHH-HHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhH Q lcl|NC_015285. 59 LHKAIKAV-NQL-RMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMM 136 (359) Q Consensus 59 L~~Aik~~-NqL-~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMl 136 (359) +.+.++++ ..+ +.+-+.+++-+.+-.|.|-|.-.+....+.. ..+|...-+ ... T Consensus 229 i~~~v~~L~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~--------------------~~~~~~~~~----~~~ 284 (484) T protein:vir:77 229 ITPELRSVTDAAARTLMLMQATAELMGVPQRLLFGVKGEELGVD--------------------PETGQTLFD----AYL 284 (484) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHhCCCcchhccc--------------------ccccchhhh----hhh Confidence 33322222 221 3344555665666666665543333222211 112211100 011 Q ss_pred hhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHH Q lcl|NC_015285. 137 EDFWLPRREGGRGTEISTLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKR 216 (359) Q Consensus 137 EDywLpRReGgrgTEIsTLpGgqnLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KFI~rLr~r 216 (359) -.+|..-. -++.+.+++..+-=+-++-++-.-.++....++|.+-|...+. |...+..|.--+..+-.-+.+.|.. T Consensus 285 ~~~~~~~~---~~~~~~q~~~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~-n~~Sg~Al~~~~~~l~~ka~~k~~~ 360 (484) T protein:vir:77 285 ARILAFED---HESKAQQFSAAELRNFVDALDALDRKAAAYTGLPPYYLSFSSE-NPASAEAIRSSESRLVKTVERKNKI 360 (484) T ss_pred hhhcccCC---CCceeEeecCCChHHHHHHHHHHHHHHhcccCCCHHHhccccC-cchHHHHHHHHHHHHHHHHHHHHHH Confidence 13454322 2355667776541123344555556666677888888854322 3233445666666677788999999 Q ss_pred HHHHHHHHHHHHHHhcCCC-ChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhC Q lcl|NC_015285. 217 FSELFTDLLKTQLILKGVM-SLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLK 295 (359) Q Consensus 217 Fs~if~d~Lk~QLiLkgI~-t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~ 295 (359) |..-+.+.++.-+-+.|.. ...+| ..|.+.|..-..-+. .+.++.+.++..-.-..+|.++++.. |+ T Consensus 361 f~~~l~~~~~l~~~~~~~~~~~~~~----~~i~v~w~~~~~~s~-------~~~ad~~~kl~~~g~gi~s~et~~~~-l~ 428 (484) T protein:vir:77 361 FGGAWEQAMRVAYKVMNGGDIPPEY----YRMESIWRDPSTPTY-------AAKADAATKLYNNGQGVIPKERARID-MG 428 (484) T ss_pred HHHHHHHHHHHHHHHhCCCCccccc----ccceEEecCCCCCCH-------HHHHHHHHHHHhccCCCCCHHHHHhc-CC Confidence 9999999998777666542 12222 357788854433332 34455555553332235688888755 89 Q ss_pred CCHHHHHHHHHHHHHHHhcCC-CCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCCCCccC Q lcl|NC_015285. 296 QTEIEIKEIDEQIASEMEAGI-IADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPGDVRRG 357 (359) Q Consensus 296 ~tDeeI~e~~kqi~~E~~~~~-~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~ 357 (359) +++++++++++..++|..... ..++....+..++..++ .++..+...+|+..+.| T Consensus 429 ~~~~~~~e~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~ 484 (484) T protein:vir:77 429 YSITEREEMRKWDEEEQAQGLGLMGTMFGTDPSGGGNPD-------NPETPEPQPNPAEEAAA 484 (484) T ss_pred CChhHHHHHHHHHHHHHHHHHHHHhhhccccccCCCCCC-------CCCcccccCCCccccCC Confidence 999999987655444432210 01111111111111111 11222233444444445 No 26 >protein:vir:104338 Length: 422 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398967;genbank:gi:81343951;genbank:GeneID:3778870 Probab=97.31 E-value=9.8e-05 Score=42.55 Aligned_cols=275 Identities=12% Similarity=0.100 Sum_probs=111.5 Q ss_pred CCCch---hhHHHhhhhhhheeeccccc--------------------cc---cCCCceeecHhH-hhhhhccccc---- Q lcl|NC_015285. 1 MRGVD---LNQQLTQKAAEYFLYNPKGL--------------------KN---STNQGMKITTDS-VTYCHSGIQD---- 49 (359) Q Consensus 1 ~~~~~---~~~~~~~~~~e~f~yn~~~~--------------------~~---~~~~~v~i~~~a-i~y~hSGl~d---- 49 (359) .+|+. +-+..-+.+..+-++++.+. .. .++.+++|++|= |++..+-+-+ T Consensus 101 v~d~~~~~~Pl~~~g~~~~l~v~d~~~i~~~~~~~dp~s~~fg~P~~y~v~~~~~~~~~~iH~SRli~~~g~~~p~~~~~ 180 (422) T protein:vir:10 101 VKDNRALTSPVREGAELETVRVYDRTQVKVQTREENPRNARFGEPLTYRITTNESDMFYDVHYSRIHIIDGERIPNVMRR 180 (422) T ss_pred ecCCCCccccccccCceeeEEeeccccccchhcccCccccccCcceEEEEecCCCCcceeeccceeEEeCCCCchhhhcc Confidence 22211 00011112222222222111 01 122336676553 3332221111 Q ss_pred CCCCcchhhHHHHHHHHHHHHHHHH-----HHHHHHHhcCccceeEeccC-CCCc--hHHHHHHHH--HHHHhhcceEEe Q lcl|NC_015285. 50 LNKNMTLSHLHKAIKAVNQLRMIED-----SLVIYRLSRAPERRIFYIDV-GNLP--KNKAEQYLR--EVMGRYRNKMVY 119 (359) Q Consensus 50 ~~~~~i~syL~~Aik~~NqL~m~ED-----alVIyR~~RAPeRRvFyIDv-Gnlp--k~KAeqYl~--~iM~kyrnklvY 119 (359) .....=.|-|.++ .++.|+-+|- +.++++-. =+|+.++. .++- ..+..+-+. ..+.+.|+. T Consensus 181 ~~~~~G~S~l~~~--~~~~i~~~~~~~~~~~~l~~~~~----~~v~~~~~l~~~~~~~~~~~~~~~r~~~~~~~~~~--- 251 (422) T protein:vir:10 181 QNDGWGRSVLSSD--ILDSIKDYTNCERLATQLLKRKQ----QAVWKAKGLAELCDDSEGFGAARLRLAQVDNNSGV--- 251 (422) T ss_pred cCCcccchhHHHH--HHHHHHHHHHHHHHHHHHHHHhc----cccccchhHHHhcCCccchHHHHHHHHHHHHhcCC--- Confidence 1111113445442 1233332222 33455532 23455552 2210 011111111 112222221 Q ss_pred eCCCCcccccccchhhHhhhcccccCCCCccceeecCCCCCcchHHH-HHHHHHHHHHhcCCCccccCCC--Ccccccch Q lcl|NC_015285. 120 DANTGEIKDDKKFMSMMEDFWLPRREGGRGTEISTLPGGQNLGELED-VKYFQKKLYKALNVPSSRLETE--TTFNIGRA 196 (359) Q Consensus 120 D~~TGevkdd~~~mSMlEDywLpRReGgrgTEIsTLpGgqnLgei~D-V~YF~kkLy~aL~VP~SRl~~~--~~~~~g~~ 196 (359) +|-+. +. +.+-+++++. .+|+-++| +..|...+-.+.+||+.||-.+ +|||- -+ T Consensus 252 ---~~~~~-------l~----------~~~e~~e~~~--~~lsgl~~~~~~~~~~iaaa~~IP~t~L~G~s~~Glna-tg 308 (422) T protein:vir:10 252 ---GQAIG-------ID----------AESEEYSVLN--SDIGGIDAFLDKKFDRIVALSGIHEIILKNKNVGGVSS-SQ 308 (422) T ss_pred ---cccee-------Ee----------cCCcceEEEe--cccCChHHHHHHHHHHHHhhhCCCeeeeccCCcccccc-cc Confidence 11111 00 0111233331 23444455 4789999999999999999444 46652 12 Q ss_pred hhhhHHhhhHHHHHHHHHHH-HHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHH Q lcl|NC_015285. 197 AEITRDEVKFQKFIARLRKR-FSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAA 275 (359) Q Consensus 197 ~eItRDElKF~KFI~rLr~r-Fs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~ 275 (359) .+ |--.|+.+|+.+|.. +..+...+++. |. -. +.++|.|..-..=+|...+|+...+.++++. T Consensus 309 d~---d~~~yyd~i~~~Qe~~l~p~l~~l~~~--i~----~s-------~~~~~~f~pL~~~sekekaei~~~~a~a~~~ 372 (422) T protein:vir:10 309 NT---ALETFHKLVDRKRNAELLPILEFLIPF--IV----NA-------EEWSVEFNPLAQESSKDKAEILEKNVNSIAA 372 (422) T ss_pred hH---HHHHHHHHHHHHHHHHHHHHHHHHHHH--hc----cc-------CCcEEEeCCCCCCCHHHHHHHHHHHHHHHHH Confidence 12 222499999999964 45554444432 22 12 3566889888888898899999988888877 Q ss_pred hhhhcchhhhHHHHHHHHhCCCHHHHHHHHHHHHHHHhcCCCC--CCcchhhhcCCCCCcccccccCCCCCcCCCCCCCC Q lcl|NC_015285. 276 MDPYVGKYFSVDYMRRQVLKQTEIEIKEIDEQIASEMEAGIIA--DPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPGD 353 (359) Q Consensus 276 ~dp~vGKy~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~--~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~ 353 (359) +-.- -.++.+-+++. -.+.-. ..+++. .|++..+...+.+++ ..+|+| T Consensus 373 ~~~~--g~i~~~e~r~~------------L~~~~~--~~~~~~~~~~~~~~~~~~~~~~~--------------~~~~~d 422 (422) T protein:vir:10 373 LIAA--GAMDIDEARDT------------LRTIAP--EVKINDGSVETEVTISETSNDPL--------------EVPTDD 422 (422) T ss_pred HHhc--CCCCHHHHHHH------------hhhhcc--cccCCCCCCccccchhhcCCCCC--------------CCCCCC Confidence 6332 13344444432 111100 011110 011111111111111 112222 No 27 >protein:vir:1587 Length: 508 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:32 # MgeName: phig1e # Cross-refs: genbank:acc:NP_695169;swissprot:trembl:o03928;genbank:gi:23455800;interpro:IPR006432;uniprot:O03928;genbank:GeneID:955566 Probab=97.10 E-value=0.00017 Score=41.23 Aligned_cols=288 Identities=11% Similarity=0.102 Sum_probs=137.3 Q ss_pred CC-Cchhh-----HHHh------hhhhhheeeccccccccCCCceeecHhH----------hhhhhcc-----cc----- Q lcl|NC_015285. 1 MR-GVDLN-----QQLT------QKAAEYFLYNPKGLKNSTNQGMKITTDS----------VTYCHSG-----IQ----- 48 (359) Q Consensus 1 ~~-~~~~~-----~~~~------~~~~e~f~yn~~~~~~~~~~~v~i~~~a----------i~y~hSG-----l~----- 48 (359) .+ ++..+ .+.. ++.-+|.+|...+. ...|..++-+. +++.+.. .+ T Consensus 174 ~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~---~~lG~~v~l~~~~e~~~l~~~~~~~g~~~p~f~y~~~~~~ 250 (508) T protein:vir:15 174 QRTESNQTKYYTLLEFHQWQDNGSYQITNELYKSDSP---DIVGNQVPLSTLPVYKELAPQVTISGLQRPLFAYFKTPGA 250 (508) T ss_pred EeecCCCceEEEEEEEEEEecCcceEEEEEEEecCCc---hhcCcccchhhcccccCCCcceEecCCCcceeEEecCCcc Confidence 11 11111 1111 23335556643211 11122222111 1111110 00 Q ss_pred ---cCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCc Q lcl|NC_015285. 49 ---DLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGE 125 (359) Q Consensus 49 ---d~~~~~i~syL~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGe 125 (359) +..+..=+|-++.|+-.+..|-..-+. +.|.+|.=.+|||.-+.- + -+|..+|. T Consensus 251 N~~~~~splG~S~~~~~~~lid~lD~~~s~--~~~e~~~~~~~i~v~~~~--------------l-------~~d~~~~~ 307 (508) T protein:vir:15 251 NNINIESPLGLGVVDNAKHVLDDINDTHDQ--FIWEIRLGQKHIAVQPGM--------------L-------RFDDEHKP 307 (508) T ss_pred ccccCCCCcCCchHhhhHHHHHHHHHHHHH--HHHHHHhcccceeechHH--------------h-------cCCCCCcc Confidence 001122356777777776666655554 558888888888863210 0 02333333 Q ss_pred cc--ccccchhhHhhhcccccCCCCccceeecCCCCCcc-hHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHH Q lcl|NC_015285. 126 IK--DDKKFMSMMEDFWLPRREGGRGTEISTLPGGQNLG-ELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRD 202 (359) Q Consensus 126 vk--dd~~~mSMlEDywLpRReGgrgTEIsTLpGgqnLg-ei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRD 202 (359) +- +++.|..|-.| ...|.-|+.+...--.+ -.+-+..+.+.+....+++-+-|+.++. ....++||... T Consensus 308 ~~~~~~~~~~~~~~~-------~~~~~~i~~~~~~ir~e~~~~~~~~~l~~~~~~~gls~~~f~~~~~-~~~TAtei~s~ 379 (508) T protein:vir:15 308 TFDTEQNVYVGVLSD-------DNNGLGVKDMTTPIRTVQYKDAIDHFIKEFEVQIGLSTGTFSYSND-GVKTATEVVSN 379 (508) T ss_pred ccCCCCeeEEeccCC-------CCCCCceeEeecccChHHHHHHHHHHHHHHHHHhCCCchhcccccC-ccccHHHHHHH Confidence 21 23333332111 12223365555432222 2455777888899999999888876543 33457777666 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHh-------cCCCChh--HHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHH Q lcl|NC_015285. 203 EVKFQKFIARLRKRFSELFTDLLKTQLIL-------KGVMSLE--EWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQV 273 (359) Q Consensus 203 ElKF~KFI~rLr~rFs~if~d~Lk~QLiL-------kgI~t~e--ew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~ 273 (359) +-.-..-+.+.|+.|...+.++++.=|-+ ++..+.. .+...-..+.++|...- .+-+++++- .+ T Consensus 380 ~~~~~~t~~~~~~~~~~al~~lv~~il~l~~~~~~~~~g~~~~~~~~~~~~~~v~v~f~D~i--~~d~~~~~~-----~~ 452 (508) T protein:vir:15 380 NSMTYQTRSSYLTMVEKAIDELCQSIFELANAGALFDDGKPLFTLDSASQPLDIECHFDDGV--FVNKDKQLE-----ED 452 (508) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccccccCCcceEEEeCCCC--CCCHHHHHH-----HH Confidence 66666666677777777777666654322 2221111 11111234666665332 222333222 22 Q ss_pred HHhhhhcchhhhHHHHHHHHhCCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCccccccc Q lcl|NC_015285. 274 AAMDPYVGKYFSVDYMRRQVLKQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAE 339 (359) Q Consensus 274 ~~~dp~vGKy~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~ 339 (359) .++.. .| .+|.+..+++..+.||+|.+++-++|++|..... |.. .. .++.+.+-|| T Consensus 453 ~~~v~-aG-i~s~e~~i~~~~g~~deea~~el~ri~~E~~~~~---~~~---~~--~~~~~g~~ge 508 (508) T protein:vir:15 453 AKVLA-IG-ALSKQTFLQRNYGMTDEQAAEELAKIQSEAPTDT---FEG---GR--SAILNGGDGE 508 (508) T ss_pred HHHHh-cC-CCCHHHHHHhcCCCChHHHHHHHHHHHHhccccC---ccc---cc--cccCCCCCCC Confidence 22211 13 4688887777789999999999999999954421 111 11 1111122222 No 28 >protein:vir:2427 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046829;genbank:gi:9630397;genbank:GeneID:1261620 Probab=97.05 E-value=0.00019 Score=40.94 Aligned_cols=307 Identities=12% Similarity=0.103 Sum_probs=132.5 Q ss_pred CCCchhhHHHh-----------hhhhhheeeccccccc--cCCCceeecHh-H--------hhhhhcccccCCCCcchhh Q lcl|NC_015285. 1 MRGVDLNQQLT-----------QKAAEYFLYNPKGLKN--STNQGMKITTD-S--------VTYCHSGIQDLNKNMTLSH 58 (359) Q Consensus 1 ~~~~~~~~~~~-----------~~~~e~f~yn~~~~~~--~~~~~v~i~~~-a--------i~y~hSGl~d~~~~~i~sy 58 (359) +-|+..+. ++ +.+..+-+|.+...+. ..+.+...... . |.|++..-...- .-.|- T Consensus 153 i~D~~~~~-~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~~~~~--~G~s~ 229 (485) T protein:vir:24 153 EIDPRIGR-PAKAIRVAYDAEGNEIQAATLYTPNETFGWFRAEGEWVEWFSDPHGLGAVPVVPLPNRTRLSDL--YGTSE 229 (485) T ss_pred EeeCCcCc-eeEEEEEEEeecCCeEEEEEEEcCCcEEEEEecCCceEeecccccCCCcccEEEeccCcccCCc--CCccc Confidence 11111110 00 0111122333332110 11111111100 0 223332111100 11122 Q ss_pred HHHHHHHH-HH-HHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhH Q lcl|NC_015285. 59 LHKAIKAV-NQ-LRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMM 136 (359) Q Consensus 59 L~~Aik~~-Nq-L~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMl 136 (359) |.+.++++ .. -+++-|..++-..+-.|.|-+.=.+....+...- +...+.+...| T Consensus 230 i~~~v~~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~-----------~~~~~~~~~~~------------ 286 (485) T protein:vir:24 230 ITPELRSMTDAAARILMLMQATAELMGVPQRLIFGIKPEEIGVDPE-----------TGQTLFDAYLA------------ 286 (485) T ss_pred chhhHHHHHHHHHHHHHHHHHHHHhhcchhhhhccCCccccccccc-----------cccchhhhccc------------ Confidence 22222222 11 2345566666677767766554222111110000 00111111122 Q ss_pred hhhcccccCCCCccceeecCCCCCcc-hHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHH Q lcl|NC_015285. 137 EDFWLPRREGGRGTEISTLPGGQNLG-ELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRK 215 (359) Q Consensus 137 EDywLpRReGgrgTEIsTLpGgqnLg-ei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KFI~rLr~ 215 (359) ..|+.--+ +.++..++... +. -++-++-.-..+...-++|..-|...+. |--.+..|.--+...-.-+.+.|. T Consensus 287 -~i~~~~~~---~~~~~q~~~~~-~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~-n~~Sg~Al~~~~~~l~~ka~~~~~ 360 (485) T protein:vir:24 287 -RILAFEDA---EGKIQQFSAAE-LANFTNALDQIAKQVAAYTGLPPQYLSTAAD-NPASAEAIRAAESRLIKKVERKNA 360 (485) T ss_pred -ceeccCCC---CceEEeecccc-hHHHHHHHHHHHHHHhcccCCCHHHhccccC-cchHHHHHHHHHHHHHHHHHHHHH Confidence 23443222 23355555432 22 1222333333444445777777753321 111334456566667888899999 Q ss_pred HHHHHHHHHHHHHHHhcCCC-ChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHh Q lcl|NC_015285. 216 RFSELFTDLLKTQLILKGVM-SLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVL 294 (359) Q Consensus 216 rFs~if~d~Lk~QLiLkgI~-t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL 294 (359) .|..-+...++.-+.+.+.. ...+| ..|++.|.....=+. .+.++.+.++..-.-..+|.+++++. | T Consensus 361 ~f~~~l~~~~~l~~~~~~~~~~~~d~----~~i~v~f~~~~~~s~-------~~~ad~~~kl~~~g~~~~s~et~~~~-l 428 (485) T protein:vir:24 361 IFGGAWEEAMRLAYRLMKGGDVPPDM----LRMETVWRDPSTPTY-------AAKADAATKLYGNGQGVIPRERARKD-M 428 (485) T ss_pred HHHHHHHHHHHHHHHHhcCCCCcccc----ceeeEEecCCCCCCH-------HHHHHHHHHHHhcccccCCHHHHHhh-C Confidence 99999999988765554422 22232 467888865443332 34455555554332246799999854 9 Q ss_pred CCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCCCC Q lcl|NC_015285. 295 KQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPGDV 354 (359) Q Consensus 295 ~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~ 354 (359) ++++++++++++..++|...+. ...+ +-.+.+..+++++.+.-++...++++++.++ T Consensus 429 ~~~~d~~~e~~~~~ee~~~~~~--~~~~-~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~a 485 (485) T protein:vir:24 429 GYSIAEREEMRRWDEEEAAMGL--GLLG-TMVDADPTVPGSPNPTPAPKPQPAIEGGDSA 485 (485) T ss_pred CCCHhHHHHHHHHHHHHhhhhh--hHHH-hhcccCCCCCCCCCCCCCCCCccCCCCCCCC Confidence 9999999987765555533221 0111 1112222222333333345555555544444 No 29 >protein:vir:79538 Length: 502 # NCBI annotation: putative portal protein # Family: family:all:47 # MgeID: mge:1871 # MgeName: cdtI # Cross-refs: genbank:acc:YP_001272517;genbank:gi:148609386;genbank:GeneID:5204374 Probab=96.91 E-value=0.00026 Score=40.20 Aligned_cols=310 Identities=13% Similarity=0.108 Sum_probs=153.3 Q ss_pred CC-CchhhHHHhhhh--------hhheeeccccccccCCCceeecHhHhhhhhcccccCCCCcchhhHHHHHHHHHHHHH Q lcl|NC_015285. 1 MR-GVDLNQQLTQKA--------AEYFLYNPKGLKNSTNQGMKITTDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQLRM 71 (359) Q Consensus 1 ~~-~~~~~~~~~~~~--------~e~f~yn~~~~~~~~~~~v~i~~~ai~y~hSGl~d~~~~~i~syL~~Aik~~NqL~m 71 (359) +. ...++..|+.|| .-|+++....-......-++||.+-|..++...-. .--.=+|.|..+++.+.+|.- T Consensus 176 l~~~~~~~~~i~~GVe~d~~Gr~~aY~i~~~hPgd~~~~~~~rvpA~~vlH~f~~~r~-gQ~RGis~lapvl~~l~~l~~ 254 (502) T protein:vir:79 176 IPMTSDESNRLNQGVFVDDWGRPEKYLVYKSRPVSGRQMETKEVDAERMLHLKFVRRL-HQMRGTSLLSGVLIRLSALKE 254 (502) T ss_pred cCCCCCCCCeeEeeeEECCCCceEEEEEeecCCCCCcccceeEechhheEEeecccCC-ccccCCchHHHHHHHHHHHhH Confidence 10 011233333333 45666643222233345589999988877766533 222247999999999999999 Q ss_pred HHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhhcccccCCCCccc Q lcl|NC_015285. 72 IEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDFWLPRREGGRGTE 151 (359) Q Consensus 72 ~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDywLpRReGgrgTE 151 (359) .+||..+-...-|-.=-+..-+.|.-....+ .+.. ++....+|-.-==+ ..-.-|.+ T Consensus 255 ~~dael~~a~i~A~~~~fi~~~~~~~~~~~~----------------~~~~-----~~~~~~~l~pG~i~--~~L~pGe~ 311 (502) T protein:vir:79 255 YEDSELTAARIAAALGMYIRKGDGQSYEPDG----------------NGSK-----ENERELTIQPGIIY--DDLKPGEE 311 (502) T ss_pred HHHHHHHHHHHhhhheeeeecCCCccccccc----------------CCCC-----CccccccccCCccc--cccCCCce Confidence 9999999988888765444444333111000 0000 11111111000000 00122445 Q ss_pred eeecCCCCCcchHHH-HHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHH---- Q lcl|NC_015285. 152 ISTLPGGQNLGELED-VKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFTDLLK---- 226 (359) Q Consensus 152 IsTLpGgqnLgei~D-V~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KFI~rLr~rFs~if~d~Lk---- 226 (359) |..+.....-+..++ ++...+.+=.+|+||-.-|..+ |+ ++-|.+.-.-+.|-+.+.++|+.|..-|..++- T Consensus 312 i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D--~s-~nySs~R~~~~e~~r~~~~~q~~~~~~~~~pi~~~~l 388 (502) T protein:vir:79 312 IGMVKSDRPNPNLETFRNGQLRAVAAGSRLSFSSTARN--YN-GTYSAQRQELVESTDGYLILQDWFIGAVTRPMYRAWL 388 (502) T ss_pred eeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcc--cc-chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 555555444444333 3334444667899999999776 33 244555556667999999999988877666533 Q ss_pred HHHHhcCCCChhHHHHHhhceeeee--eccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCCCHHHHH-H Q lcl|NC_015285. 227 TQLILKGVMSLEEWEDMKNHIQFDF--IADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQTEIEIK-E 303 (359) Q Consensus 227 ~QLiLkgI~t~eew~~~~~~I~~~f--~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~tDeeI~-e 303 (359) ...+|.|.++.-.|.+-.......| ..--+.-.+||+.-...+++. -+-|.+-+..+ .+..-+++- | T Consensus 389 ~~a~l~G~i~~p~~~~~~~~~~~~W~~p~~~~iDP~Ke~~a~~~~i~~---------Gl~t~~~~~a~-~G~D~~~v~~q 458 (502) T protein:vir:79 389 KQAVASGVIRLPRDLDRSSLYTAVYSGPVMPWIDPVKEAEAWKIQIRG---------GAATESDWVRA-GGRNPDDVKRR 458 (502) T ss_pred HHHHHcCCCCCCCCCCchhhcceeeecCCccccChHHHHHHHHHHHHc---------CCCCHHHHHHH-cCCCHHHHHHH Confidence 3567888876433333222333333 333344556666555444431 23355555555 455444433 3 Q ss_pred HHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCCCCcc Q lcl|NC_015285. 304 IDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPGDVRR 356 (359) Q Consensus 304 ~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~ 356 (359) .+...+..++.|+-.+.+. ....+ ++.. +....+.+.++++.+. T Consensus 459 ~a~e~~~~~~~Gl~~~~~~----~~~~~-~~~~----~~~~~e~~~~~~~~e~ 502 (502) T protein:vir:79 459 RKAEIDENRKLDLVFDTDP----ASDKG-GSSA----ATKRQEPQHTDDQSEE 502 (502) T ss_pred HHHHHHHHHHcCCCCCCCC----CCCCC-CCCC----CCCCCCCCCCCCCCCC Confidence 3333344444554332211 11111 1111 1111111111222222 No 30 >protein:vir:104082 Length: 485 # NCBI annotation: gp14 # Family: family:all:524 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655593;genbank:gi:109392464;genbank:GeneID:4156950 Probab=96.88 E-value=0.00028 Score=40.07 Aligned_cols=307 Identities=13% Similarity=0.091 Sum_probs=135.1 Q ss_pred CCCchhhHHHhhh-----------hhhheeeccccccc--cCCCceeecHh---------HhhhhhcccccCCCCc--ch Q lcl|NC_015285. 1 MRGVDLNQQLTQK-----------AAEYFLYNPKGLKN--STNQGMKITTD---------SVTYCHSGIQDLNKNM--TL 56 (359) Q Consensus 1 ~~~~~~~~~~~~~-----------~~e~f~yn~~~~~~--~~~~~v~i~~~---------ai~y~hSGl~d~~~~~--i~ 56 (359) +.|+..+. +... +.-+-+|.+...+. ...++...... .|.|++..-.....|. |- T Consensus 153 ~~D~~~~~-~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~i~ 231 (485) T protein:vir:10 153 EIDPRIGR-VSKAIRVAYDAEGNEIQAATLYTPNDIFGWYRVENEWQEWFNNPHGLGVVPVVPIPNRTRLSDLYGTSEIT 231 (485) T ss_pred EEcCCCCc-eeEEEEEEEeeCCCeEEEEEEEeCCeEEEEEEcCCceEEeccccCCCCcccEEEeccccccCCCCCccchh Confidence 12221111 1111 11122333322211 11111111110 1334443211111111 00 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhH Q lcl|NC_015285. 57 SHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMM 136 (359) Q Consensus 57 syL~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMl 136 (359) -++...+..+| +++-+..++-..+-.|.|-|.=.+..+.+.. ..+|...-+. .. T Consensus 232 ~~v~~liDa~~--~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~--------------------~~~~~~~~~~----~~ 285 (485) T protein:vir:10 232 PELRSMTDAAA--RILMLMQATAELMGVPQRLIFGIKPEEIGVD--------------------PETGQTLFDA----YL 285 (485) T ss_pred HHHHHHHHHHH--HHHHHHHHHHHhhcchHHHHhcCCccccccc--------------------ccccchhhhh----cc Confidence 01222222222 2455666666777777775553332222211 1111110000 00 Q ss_pred hhhcccccCCCCccceeecCCCCCcc-hHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHH Q lcl|NC_015285. 137 EDFWLPRREGGRGTEISTLPGGQNLG-ELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRK 215 (359) Q Consensus 137 EDywLpRReGgrgTEIsTLpGgqnLg-ei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KFI~rLr~ 215 (359) -..|+.-. -+.++-.++... ++ -++-++=.-..++..-++|.+-|...+ -|-..+..|..-+..+..-+.+.|. T Consensus 286 ~~i~~~~~---~d~k~~q~~~~~-~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~-~n~~Sg~Al~~~~~~l~~k~~~k~~ 360 (485) T protein:vir:10 286 ARILAFED---AEGKIQQFSAAE-LANFTNALDQIAKQVAAYTGLPPQYLSTAA-DNPASAEAIRAAESRLIKKVERKNS 360 (485) T ss_pred cceeccCC---CCceEEeecccc-hHHHHHHHHHHHHHHhcccCCCHHHhcccc-CchhHHHHHHHHHHHHHHHHHHHHH Confidence 12243311 123344555432 22 122233333445555777877775432 2222344577777778888899999 Q ss_pred HHHHHHHHHHHHHHHhcCCCC-hhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHh Q lcl|NC_015285. 216 RFSELFTDLLKTQLILKGVMS-LEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVL 294 (359) Q Consensus 216 rFs~if~d~Lk~QLiLkgI~t-~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL 294 (359) .|..-+...++.-+.+.+... ..+| ..|.+.|..-..-+. .+..+++..+..-+-..+|.++++. .| T Consensus 361 ~f~~~l~~~~~l~~~~~~~~~~~~~~----~~i~v~w~~~~~~~~-------~~~ada~~kl~~ag~~~~s~et~~~-~l 428 (485) T protein:vir:10 361 IFGGAWEEAMRLAYRMMKGGDVPPDM----LRMETVWRDPSTPTY-------AAKADAASKLYNGGTGVIPRERARK-DM 428 (485) T ss_pred HHHHHHHHHHHHHHHHhCCCCCcccc----eeeeEEecCCCCCCH-------HHHHHHHHHHHhccccCCCHHHHHH-hC Confidence 999999999987666655321 2222 467888865443333 3344555555433224679999985 59 Q ss_pred CCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCCCCc Q lcl|NC_015285. 295 KQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPGDVR 355 (359) Q Consensus 295 ~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~ 355 (359) ++++++++++++..+++...+.- ..+ .-.+.+.+.++++.. .+-..++...+||++- T Consensus 429 g~~~~~~~~~~~~~ee~~~~~~~--~~~-~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~ 485 (485) T protein:vir:10 429 GYSIAEREEMRRWDEEEAAMGLG--LIG-TMVDPNPTVPGSPSP-APAPKPAALESGGDAA 485 (485) T ss_pred CCCHhHHHHHHHHHHHHHHHHHH--HHH-HhhccCCCCCCCCCc-cccccCcCCCCCCCCC Confidence 99999999887765555433210 000 001111111111111 1223334556666666 No 31 >protein:vir:4223 Length: 486 # NCBI annotation: predicted 53.7Kd protein # Family: family:all:524 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039678;swissprot:sw:q05220;genbank:gi:9625444;uniprot:Q05220;genbank:GeneID:2942930;interpro:IPR010859 Probab=96.85 E-value=0.0003 Score=39.89 Aligned_cols=304 Identities=13% Similarity=0.113 Sum_probs=133.5 Q ss_pred CCCchhhHHHhhh-----------hhhheeeccccccc--cCCCceeecHh---------Hhhhhhc-ccccCCC----- Q lcl|NC_015285. 1 MRGVDLNQQLTQK-----------AAEYFLYNPKGLKN--STNQGMKITTD---------SVTYCHS-GIQDLNK----- 52 (359) Q Consensus 1 ~~~~~~~~~~~~~-----------~~e~f~yn~~~~~~--~~~~~v~i~~~---------ai~y~hS-Gl~d~~~----- 52 (359) +-|+.++ +++-+ +..+-+|.+...+. ..+++.+.... .|.|++- -+-.+.+ T Consensus 153 i~d~~~~-~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~~~n~~~~~~~~G~s~i~ 231 (486) T protein:vir:42 153 EIDPRIN-RVSKAIRVAYDKEGNEIQAATLYTPMETIGWFRADGEWAEWFNVPHGLGVVPVVPLPNRTRLSDLYGTSEIT 231 (486) T ss_pred EEeCCCC-CeEEEEEEEEecCCCeEEEEEEEcCCcEEEEEecCCcEEeecceecCCCCceEEEeccccccCCCCCcccch Confidence 1111111 11111 11122344433221 11111111110 0223332 0100011 Q ss_pred CcchhhHHHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccc Q lcl|NC_015285. 53 NMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKF 132 (359) Q Consensus 53 ~~i~syL~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~ 132 (359) ..|.+..+++-+. +-+..++-..+-.|.|-|.-.+....+.... +.+.+.++..| T Consensus 232 ~~v~~liDa~~~~------~s~~~~~~e~~a~p~~~i~G~~~~~~~~~~~-----------~~~~~~~~~~~-------- 286 (486) T protein:vir:42 232 PELRSMTDAAARI------LMLMQATAELMGVPQRLIFGIKPEEIGVDSE-----------TGQTLFDAYLA-------- 286 (486) T ss_pred hhHHHHHHHHHHH------HHHHHHHHHhhcchHHHhhcCCccccccccc-----------cccchhhhhhc-------- Confidence 1233444444433 3344555555555655544332222110000 01111111122 Q ss_pred hhhHhhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHH Q lcl|NC_015285. 133 MSMMEDFWLPRREGGRGTEISTLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIAR 212 (359) Q Consensus 133 mSMlEDywLpRReGgrgTEIsTLpGgqnLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KFI~r 212 (359) ..|..-.+ ..++..+|+..-=.-++-++=.-.++...-++|..-|...+. |-..+..|.--+.....-+.+ T Consensus 287 -----~~~~~~~~---~~~~~q~~~~~~e~~~~~l~~~i~~~s~~~~~p~~~fg~~~~-n~~Sg~Al~~~~~~l~~ka~~ 357 (486) T protein:vir:42 287 -----RILAFEDA---EGKIQQFSAAELANFTNALDQIAKQVAAYTGLPPQYLSTAAD-NPASAEAIRAAESRLIKKVER 357 (486) T ss_pred -----hhcccCCC---CceEEeecccCHHHHHHHHHHHHHHHhcccCCCHHHhccccC-chhHHHHHHHHHHHHHHHHHH Confidence 22322111 134556665431123333444445556667888777743321 212344566677778888899 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCC-hhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHH Q lcl|NC_015285. 213 LRKRFSELFTDLLKTQLILKGVMS-LEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRR 291 (359) Q Consensus 213 Lr~rFs~if~d~Lk~QLiLkgI~t-~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k 291 (359) .|..|..-+...++.-+.+.|... +.+| ..|++.|.....=+. .+.++.+..+..-...++|.++++ T Consensus 358 ~~~~f~~~l~~~~~l~~~~~~~~~~~~d~----~~i~v~w~~~~~~s~-------~~~ad~~~kl~~~~~g~~s~et~~- 425 (486) T protein:vir:42 358 KNLMFGGAWEEAMRIAYRIMKGGDVPPDM----LRMETVWRDPSTPTY-------AAKADAATKLYGNGQGVIPRERAR- 425 (486) T ss_pred HHHHHHHHHHHHHHHHHHHhcCCCccccc----eeeeEEecCCCCCCH-------HHHHHHHHHHHhcccCCCCHHHHH- Confidence 999999999999998777766532 2233 368888865544333 334444444443322467999988 Q ss_pred HHhCCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCC-CCcCCCCCCCCC Q lcl|NC_015285. 292 QVLKQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDP-NAQESSVDPGDV 354 (359) Q Consensus 292 ~IL~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~p~~~ 354 (359) ..|++++++++++++.-+++...+. .....+ ...+..+++++++..++ ..+++...+|++ T Consensus 426 ~~lg~~~d~~~e~~~~~~e~~~~~~--~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 486 (486) T protein:vir:42 426 IDMGYSVKEREEMRRWDEEEAAMGL--GLLGTM-VDADPTVPGSPSPTAPPKPQPAIESSGGDA 486 (486) T ss_pred hcCCCChhHHHHHHHHHHHHHHHHH--HHHHHh-hcCCCCCCCCCCCCCCCCCCcccCCCCCCC Confidence 5699999999887764343322211 111111 11111122222211111 223334555555 No 32 >protein:vir:2341 Length: 488 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075278;genbank:gi:12657865;genbank:GeneID:920078 Probab=96.83 E-value=0.00031 Score=39.80 Aligned_cols=306 Identities=13% Similarity=0.148 Sum_probs=130.8 Q ss_pred CCCchhhHHHh----------hhhhhheeeccccccc--cCCCceeecHh---------HhhhhhcccccCCCCcchhhH Q lcl|NC_015285. 1 MRGVDLNQQLT----------QKAAEYFLYNPKGLKN--STNQGMKITTD---------SVTYCHSGIQDLNKNMTLSHL 59 (359) Q Consensus 1 ~~~~~~~~~~~----------~~~~e~f~yn~~~~~~--~~~~~v~i~~~---------ai~y~hSGl~d~~~~~i~syL 59 (359) +.++.++...+ +.+..+.+|.+...+. ..+++...... .|.|+|..-..+ ....|-| T Consensus 158 ~~d~~~~~~~~~~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~~~~--~~G~s~i 235 (488) T protein:vir:23 158 EVDPRTRKVLYAIRAIYGADGNEIVSATLYLPDTTMTWLRAEGEWEAPTSTPHGLEMVPVIPISNRTRLSD--LYGTSEI 235 (488) T ss_pred EEecCCCceEEEEEEEEecCCCcEEEEEEEecCcEEEEEecCCceEeccccccCCCCcceEEeccccccCC--cCCccch Confidence 11111111000 0111222333322211 11111111110 033444322111 1122444 Q ss_pred HHHHHHH-H-HHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHh Q lcl|NC_015285. 60 HKAIKAV-N-QLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMME 137 (359) Q Consensus 60 ~~Aik~~-N-qL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlE 137 (359) .+.++++ . -=+++-+..+.-..+.-|.|-|.=.+....+.... +.+.+.++..| T Consensus 236 ~~~v~~l~Da~~~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~-----------~~~~~~~~~~~------------- 291 (488) T protein:vir:23 236 SPELRSVTDAAAQILMNMQGTANLMAIPQRLIFGAKPEELGINAE-----------TGQRMFDAYMA------------- 291 (488) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHhhhHHHHHhCCCccccccccc-----------ccchhhhhhhh------------- Confidence 4433332 1 12344555566566666666554222111111000 00011111111 Q ss_pred hhcccccCCCCccceeecCCCCCcc-hHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHH Q lcl|NC_015285. 138 DFWLPRREGGRGTEISTLPGGQNLG-ELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKR 216 (359) Q Consensus 138 DywLpRReGgrgTEIsTLpGgqnLg-ei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KFI~rLr~r 216 (359) ..|.- ++|-..++-++++.. ++ -++-++=....++...++|..-|...+. |-..+..|.--+..+-.-+.+.+.. T Consensus 292 ~v~~~--~~g~~~~~~q~~~~~-~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~-n~~Sg~Al~~~~~~l~~k~~~~~~~ 367 (488) T protein:vir:23 292 RILAF--EGGEGAHAEQFSAAE-LRNFVDALDALDRKAASYSGLPPQYLSSSSD-NPASAEAIKAAESRLVKKVERKNKI 367 (488) T ss_pred hhccC--CCCCCceeEecCCCC-hHHHHHHHHHHHHHHhcccCCCHHHhccccC-cchHHHHHHHHHHHHHHHHHHHHHH Confidence 22322 233445677777654 33 2333444445566677888777743321 2223444666666677788889999 Q ss_pred HHHHHHHHHHHHHHhcCCC-ChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhC Q lcl|NC_015285. 217 FSELFTDLLKTQLILKGVM-SLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLK 295 (359) Q Consensus 217 Fs~if~d~Lk~QLiLkgI~-t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~ 295 (359) |..-+.+.++.-+.+.|.. ...+| ..|.+.|..-..-+. .+.++++..+..-....+|.++++.. |+ T Consensus 368 f~~~l~~~~~l~~~~~~~~~~~~~~----~~i~v~f~~~~~~s~-------~~~ada~~kl~~~g~~~~s~et~~~~-l~ 435 (488) T protein:vir:23 368 FGGAWEQAMRLAYKMVKGGDIPTEY----YRMETVWRDPSTPTY-------AAKADAAAKLFANGAGLIPRERGWVD-MG 435 (488) T ss_pred HHHHHHHHHHHHHHHhcCCCcchhh----ccceEEecCCCCCCH-------HHHHHHHHHHHhcccccCCHHHHHHh-CC Confidence 9999999988877655543 22333 457888864433333 34444555543332246799998866 79 Q ss_pred CCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCCCC Q lcl|NC_015285. 296 QTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPGDV 354 (359) Q Consensus 296 ~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~ 354 (359) +++++++++++..++|..... .+++.-.+.+.+...++...+..++.+. |-.+ T Consensus 436 ~~~d~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~e-~~~a 488 (488) T protein:vir:23 436 YTIVEREQMRQWLEQDQKQGL-----GLIGSLYGASTPEGKPGEAPVGEPPAPE-PDAA 488 (488) T ss_pred CCchHHHHHHHHHHHHHHHHH-----HHHHHHhccCCCcccCCCCCCCCCCCCC-CCCC Confidence 999999887654444422210 1111111111011111111111112221 1122 No 33 >protein:vir:78227 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491663;genbank:gi:157786487;genbank:GeneID:5625705 Probab=96.69 E-value=0.00041 Score=39.17 Aligned_cols=303 Identities=15% Similarity=0.142 Sum_probs=132.7 Q ss_pred CCCchhh------HHH------hhhhhhheeeccccccc---cCC---CceeecHhH----------hhhhhcccccCCC Q lcl|NC_015285. 1 MRGVDLN------QQL------TQKAAEYFLYNPKGLKN---STN---QGMKITTDS----------VTYCHSGIQDLNK 52 (359) Q Consensus 1 ~~~~~~~------~~~------~~~~~e~f~yn~~~~~~---~~~---~~v~i~~~a----------i~y~hSGl~d~~~ 52 (359) +-|+... +.+ .+.+..+.+|.+..... .++ ..+. +.+. |.|++---.+... T Consensus 141 ~~D~~~~~~~~~~i~~~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~g~vPvv~f~n~~~~~~~~ 219 (480) T protein:vir:78 141 ELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVV-DGDVIKHGLGVVPVVPLTNDPRLGNRY 219 (480) T ss_pred EEcCCCccceEEEEEEEEeecCCCceEEEEEEeCCeEEEEEecCCCcccccc-ccccccCCCCCcceEEeecccccCCcc Confidence 1111100 100 11222233444432211 000 0010 0010 1122211111011 Q ss_pred CcchhhHHHH----HHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccc Q lcl|NC_015285. 53 NMTLSHLHKA----IKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKD 128 (359) Q Consensus 53 ~~i~syL~~A----ik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkd 128 (359) ..|=|.+. +-.+| +++-+..++-...-.|.|-|.=.+....+..+. + T Consensus 220 --G~s~i~~~v~~l~Da~~--~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~~---------------------~---- 270 (480) T protein:vir:78 220 --GRSEISPELRKVTDAAS--RTLMNLQSASQILGTPLRVISGVTTDELTNDGE---------------------N---- 270 (480) T ss_pred --CcccchhhHHHHHHHHH--HHHHHHHHHHHhhcchhhhhhcCCccccccccc---------------------c---- Confidence 11222222 22222 244566666666667776554222222111100 0 Q ss_pred cccchhhHh-hhcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhH Q lcl|NC_015285. 129 DKKFMSMME-DFWLPRREGGRGTEISTLPGGQNLGE-LEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKF 206 (359) Q Consensus 129 d~~~mSMlE-DywLpRReGgrgTEIsTLpGgqnLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF 206 (359) .++-.+.. ..|++ |..+++.++++.+ ++- ++-++-....++..-++|..=|+..+. |-..+..|.--+... T Consensus 271 -~~~~~~~~~~~~~~----~~~~~~~~~~~~~-~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~-n~~Sg~Alk~~~~~l 343 (480) T protein:vir:78 271 -TTLDIYYGRILTLA----SEAAKISEFKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSE-NPASAEAIIATDSRI 343 (480) T ss_pred -chhhhhhhhhccCC----CCCceEEecCccC-HHHHHHHHHHHHHHHhcccCCChHHhccccC-cchHHHHHHHHHHHH Confidence 00111111 12332 3346677777653 333 344666667777778888877754322 222233455556667 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhH Q lcl|NC_015285. 207 QKFIARLRKRFSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSV 286 (359) Q Consensus 207 ~KFI~rLr~rFs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~ 286 (359) ..-+.+.|+.|..-+.+.++.-+.+.|.--..+|. .|.+.|..-..=+. .+.++.+.++-.-.+-.+|. T Consensus 344 ~~ka~~~~~~f~~~l~~~~~l~~~~~g~~~~~~~~----~i~v~f~~~~~~s~-------~~~ad~~~kl~~~g~~~~s~ 412 (480) T protein:vir:78 344 VKMAERKGRIFGGAWERAMRIAMQIMGREVTEEYT----RLETVWRDPSTPTV-------AAKADAVSKLYANGQGPIPK 412 (480) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHcCCCccccce----eeeEEecCCCCCCH-------HHHHHHHHHHHHhccccCCH Confidence 77889999999999999999888788854445554 46677753322222 23444445544433455799 Q ss_pred HHHHHHHhCCCHHHHHHHHHHHHHHHhcC--CCCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCCCCccCCC Q lcl|NC_015285. 287 DYMRRQVLKQTEIEIKEIDEQIASEMEAG--IIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPGDVRRGEF 359 (359) Q Consensus 287 ~~i~k~IL~~tDeeI~e~~kqi~~E~~~~--~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~ 359 (359) +++... |++++++++++++..+++.... ....+.. .++.+.+...+++..++. .+.|+.--+..- T Consensus 413 et~~~~-lg~~~d~~~~~~~~~~e~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~ 479 (480) T protein:vir:78 413 EQARID-LGYTATQREQMRDWDKQETEDMIDTLYSTTK----AQADATPKPTVTETKTET---QTSPSGFNRTKT 479 (480) T ss_pred HHHHhc-CCCCHhHHHHHHHHHHHHHHHHHHHhhcccc----ccCCCCCCCCCCCCCCcc---ccccCCCCcccC Confidence 998866 8999999998776444433221 1111111 111111111112111111 111111111111 No 34 >protein:vir:79772 Length: 648 # NCBI annotation: portal protein # Family: family:all:3222 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429612;genbank:gi:156564103;genbank:GeneID:5525537 Probab=96.55 E-value=0.00052 Score=38.58 Aligned_cols=311 Identities=17% Similarity=0.245 Sum_probs=125.9 Q ss_pred CCCchhh---------HHHhhhhhhheeeccc---------cc-----c--ccCCCceeecHhHhhhhhcccccCCCCcc Q lcl|NC_015285. 1 MRGVDLN---------QQLTQKAAEYFLYNPK---------GL-----K--NSTNQGMKITTDSVTYCHSGIQDLNKNMT 55 (359) Q Consensus 1 ~~~~~~~---------~~~~~~~~e~f~yn~~---------~~-----~--~~~~~~v~i~~~ai~y~hSGl~d~~~~~i 55 (359) +++...+ .....-+.+++--+|. |. + ...+..+.++.+-|.+..-+. +.++-.= T Consensus 165 iRd~~G~~~~~l~~~~~~~~~~v~~l~pl~p~~v~v~~d~~g~~~~Y~y~~~g~~~~~~~~~~dIIHik~~~-~~d~~~G 243 (648) T protein:vir:79 165 SRAKDALPFQGMNVMGVGDSMPVAGYFPLNLASMKVKRDKFGMIKGWQQEQEGQDKPQKFKPEDIVHIYYKR-EKGRAFG 243 (648) T ss_pred EecCCCccchhhhhhhhccccceeeeEeecCceeEEEEcCCCceeeeEEEecCCceeEEecCccEEEEccCC-CCCCcee Confidence 2222111 1111122223222221 10 0 112233556666665443111 1222223 Q ss_pred hhhHHHHHHHHHHHHHHHHHHH-HHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchh Q lcl|NC_015285. 56 LSHLHKAIKAVNQLRMIEDSLV-IYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMS 134 (359) Q Consensus 56 ~syL~~Aik~~NqL~m~EDalV-IyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mS 134 (359) +|-|..|+..+.....+++..- .++=.--| .-|+.+..+......+++.++.+-..|++..+ .. T Consensus 244 lSpi~~a~~aI~l~~aa~~~~~~fF~NGa~P-~gil~~~~~~~~~e~~k~~~e~~~~~~~~~~i---~g----------- 308 (648) T protein:vir:79 244 TPWLLPALDDIRALRQVEENVLRLVYRNLHP-LWHVKVGLEQEGFGAEEGEVDLVRGEVENMDV---EG----------- 308 (648) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHhccCCc-cEEEEeCCCccchHHHHHHHHHHHHhcccccc---cc----------- Confidence 5677777777766555555433 22222223 45555555555555566666666666655332 11 Q ss_pred hHhhhcccccCCCCccceeecCC-C--CCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHH Q lcl|NC_015285. 135 MMEDFWLPRREGGRGTEISTLPG-G--QNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIA 211 (359) Q Consensus 135 MlEDywLpRReGgrgTEIsTLpG-g--qnLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KFI~ 211 (359) |+...+...++- + +.+.-++-.++..+..-++.+||...|+..++-+...++.... -|...|. T Consensus 309 -----------g~v~~~~~~i~~~~s~~dlqfle~rk~~~~eIa~aFgVPP~lLG~~~~ss~stae~~~~---~~~~~i~ 374 (648) T protein:vir:79 309 -----------GMVTTERVNISSIASNQIIDAKEYLKHFEQRAFTVLGVSELMMGRGGTASRSTGDNLSS---DFKDRIK 374 (648) T ss_pred -----------cccccceeeccccCCHHHHHHHHHHHHHHHHHHHHhCCCHhHcccCCCccchHHHHHHH---HHHHHHH Confidence 111112222211 1 1222233347788999999999999997544433333333222 2778888 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHH Q lcl|NC_015285. 212 RLRKRFSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRR 291 (359) Q Consensus 212 rLr~rFs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k 291 (359) .|+..+..++...+-+.+++..-+. +|-.....|+|+|..-..-.+.. |.+.+.++ +-+-++|.+-++. T Consensus 375 ~l~~~i~~~le~~~~~~ll~e~~l~--~~l~~d~~ieF~~~~Llr~D~~~-------~a~~~~~l--~~~GilT~NEaR~ 443 (648) T protein:vir:79 375 ALQKVMATFINEFMVKEILMEGGFD--PVLNPDDKVEFRFNEIDMDSKIK-------LENQAVFL--YEHNAISEDEMRE 443 (648) T ss_pred HHHHHHHHHHHHHHHHHHhhhhhcc--ccccccceEEEeecccchhhHHH-------HHHHHHHH--HhCCCcCHHHHHH Confidence 8888777777766555555544332 23222345667765221112222 22333222 2234667777764 Q ss_pred HHhCCCHHHHHHHHHHHHHHHhcCCCCCCcchhh-hc--CCCCCccc--c-cccC---------CCCCcCCCCC-CCCCc Q lcl|NC_015285. 292 QVLKQTEIEIKEIDEQIASEMEAGIIADPMAEMD-PA--MAAGGEGA--P-AAEV---------DPNAQESSVD-PGDVR 355 (359) Q Consensus 292 ~IL~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~-~~--~~~~~~~~--~-~~~~---------~~~~~~~~~~-p~~~~ 355 (359) .+++..-+=......+ ....+ |.++.. +. ++.|+++. + .++. .|+.+..+.+ |..-. T Consensus 444 -~lGlpPi~~g~~~~~l----~~~~~--~~~~~~~~~~~~~~~~~~~~~~a~~eg~~~e~~~~~~~~~~~g~~~~~~~~~ 516 (648) T protein:vir:79 444 -LIGRDPVDDGEGRAKM----HLQMV--TIAQATALAALAPTPAGGSSASASGDKKKKATDNKTKPTNQHGTKTSPKKQT 516 (648) T ss_pred -HhCCCCCCCCCCcccc----ccccc--cchhccccccCCCCCCCCCCCCccccccccccCCCCCCCCCCCcCCCCcccc Confidence 3677542100000000 00000 000000 00 00010000 0 0000 1111000000 00000 Q ss_pred cCCC Q lcl|NC_015285. 356 RGEF 359 (359) Q Consensus 356 ~~~~ 359 (359) .++- T Consensus 517 ~~~~ 520 (648) T protein:vir:79 517 NGRH 520 (648) T ss_pred chhh Confidence 0100 No 35 >protein:vir:99072 Length: 479 # NCBI annotation: gp27 # Family: family:all:524 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655892;genbank:gi:109521464;genbank:GeneID:4158037 Probab=96.48 E-value=0.00059 Score=38.30 Aligned_cols=300 Identities=10% Similarity=0.006 Sum_probs=121.4 Q ss_pred CCCchhhHHH-----hhhh--hhheeeccccccccCCCceee-cHh--------HhhhhhcccccCCC----CcchhhHH Q lcl|NC_015285. 1 MRGVDLNQQL-----TQKA--AEYFLYNPKGLKNSTNQGMKI-TTD--------SVTYCHSGIQDLNK----NMTLSHLH 60 (359) Q Consensus 1 ~~~~~~~~~~-----~~~~--~e~f~yn~~~~~~~~~~~v~i-~~~--------ai~y~hSGl~d~~~----~~i~syL~ 60 (359) ..++..+... +... ..||..+..........+... +.. .|.|+|.--.+..+ --+++.++ T Consensus 154 ydd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~n~~~~~~~g~sd~e~v~~liD 233 (479) T protein:vir:99 154 WEDPYWDEWPKYLLERQPNGQYWWWTEEDYSIFEFKQGKFIYRETVSHDYGHIPFVRYVNVMDLRGVCYGDVEPLVTVAK 233 (479) T ss_pred ecCCcccceeeEEEeecCceeEEEEecceEEEEEecCCceeeccccccCCCCcceEEeecCCCcCcCCcchhHHHHHHHH Confidence 0111100000 0000 011111100000001111111 000 13344442111000 01444444 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhhc Q lcl|NC_015285. 61 KAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDFW 140 (359) Q Consensus 61 ~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDyw 140 (359) ++-+ .+-+..++-.....|.|-++-. .++.... .-...+.-+.++ .| T Consensus 234 a~~~------~~s~~~~~~~~~a~p~~~i~G~---~~~~~~~--~~~~~~~~~~~~----------------------i~ 280 (479) T protein:vir:99 234 AIDK------TGLDILLVQHHQSFQIRWATGL---MLPEGAN--ADQEKMRFAQES----------------------ML 280 (479) T ss_pred HHHH------HHHHHHHHHHHhhchhhhhcCC---Ccccccc--cchhcccccccc----------------------ce Confidence 4443 3455556666666777655421 1111000 000000000011 12 Q ss_pred ccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchh--hhhHHhhhHHHHHHHHHHHHH Q lcl|NC_015285. 141 LPRREGGRGTEISTLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAA--EITRDEVKFQKFIARLRKRFS 218 (359) Q Consensus 141 LpRReGgrgTEIsTLpGgqnLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~--eItRDElKF~KFI~rLr~rFs 218 (359) .. .+-+.++-++|+..--.-++-++=....+...-++|..-|+. .|++| .|.--+...-.-+.+.|+.|. T Consensus 281 ~~---~~~~~~~~q~~~~~~~~~~~~l~~~i~~i~~~t~~p~~~~g~-----~~n~Sg~Al~~~~~~l~~ka~~~~~~f~ 352 (479) T protein:vir:99 281 IS---QNEKASFGAIPAAPLDGLLNAYKESLLEFLALAQLPPHIAGQ-----IVNVAADALAAGTRQTMQKLFEKQATWK 352 (479) T ss_pred ee---cCCCceEEEecccchHHHHHHHHHHHHHHhccCCCCHHHccc-----ccchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 11 122345667775432222222333333444455677655532 12222 344344445677889999999 Q ss_pred HHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCCCH Q lcl|NC_015285. 219 ELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQTE 298 (359) Q Consensus 219 ~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~tD 298 (359) .-+.+.|+.-+.+.|..-..++ -.|.+.|..-..=+. .+..+.+.++..- | .+|.+++++.+.++|+ T Consensus 353 ~al~~~~~l~~~~~~~~~~~~~----~~i~~~w~~~~~~s~-------~~~ad~~~kl~~a-g-~is~et~l~~l~gv~~ 419 (479) T protein:vir:99 353 ASHNQTMRLVNKIEGRTEEATD----LDFTITWQDVTIQSL-------AQFADAWAKMVES-L-KIPAEGVWDMIPNLDQ 419 (479) T ss_pred HHHHHHHHHHHHHcCCCccccc----eeeeEEecCCCCCCH-------HHHHHHHHHHHhc-C-CCCHHHHHHhcCCCCH Confidence 9999999988888886433322 236677743211111 2344555554322 3 3899999999889999 Q ss_pred HHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCCCC----ccCC Q lcl|NC_015285. 299 IEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPGDV----RRGE 358 (359) Q Consensus 299 eeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~----~~~~ 358 (359) .+++.+.+..+++...+.+.+ .+. +++.+++..+++.-.+++.++.+.||++ +-|- T Consensus 420 ~~~e~~~~~~~~~~~~~~~~~---~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 479 (479) T protein:vir:99 420 STVNGWKEIYDREGDFGKYMR---KLQ-NGPDPAEQRGGPNGATNMQQANNKTGEPASLNKSGA 479 (479) T ss_pred HHHHHHHHHHHHHHHHHHHHH---HHh-cccCcccccCCCCCCCCCCCCCCCCcchhccCCCCC Confidence 999877665555422221110 010 0111111111111122222333323221 1111 No 36 >protein:vir:101418 Length: 569 # NCBI annotation: Prt # Family: family:all:9458 # MgeID: mge:1512 # MgeName: P1 # Cross-refs: genbank:acc:YP_006480;genbank:gi:46401636;genbank:GeneID:2777482 Probab=96.42 E-value=0.00065 Score=38.07 Aligned_cols=318 Identities=17% Similarity=0.199 Sum_probs=174.3 Q ss_pred CCCchhhHHHhhhhhhhee-----------------eccccccccCCCceeecHhHhhhhhcccc------cCCCCcc-- Q lcl|NC_015285. 1 MRGVDLNQQLTQKAAEYFL-----------------YNPKGLKNSTNQGMKITTDSVTYCHSGIQ------DLNKNMT-- 55 (359) Q Consensus 1 ~~~~~~~~~~~~~~~e~f~-----------------yn~~~~~~~~~~~v~i~~~ai~y~hSGl~------d~~~~~i-- 55 (359) +..-+.|-+.+.=...|-. =.|++.+.+. +.=+|+|.. |..+... T Consensus 199 IqpFE~g~~tvGF~~~~~~~~~~ti~~l~p~qm~rmKmPrm~~i~q----------~~~v~~g~~~~~L~~d~~~~~Pi~ 268 (569) T protein:vir:10 199 IKEFEVSGNLAGFSGDYLKDASGKMVFADPWAIIPMKIPYWRPKSN----------LMPVHTGHKAYSLLDNPEERTPIE 268 (569) T ss_pred cchhhhcCceEEeecccCCccccceeeechhhhhhhcccceeeccc----------cchhhhhhhheeeccccccccccc Confidence 2221111111111111111 0222222211 111222222 2222222 Q ss_pred -----hhhHHHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhc---ceEEeeCCCCccc Q lcl|NC_015285. 56 -----LSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYR---NKMVYDANTGEIK 127 (359) Q Consensus 56 -----~syL~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyr---nklvYD~~TGevk 127 (359) =|||+.|-+||-+|..-=-+|--=|+-=|.--++.=+-.-.|||+++.+|++.+-.-.| ..+.-=+..|+.- T Consensus 269 psn~GgSFL~~ae~pf~~l~~Al~sL~~qri~dSv~~~~Itlnm~gM~p~qr~~y~r~lt~~LKr~~d~ie~a~~gg~~~ 348 (569) T protein:vir:10 269 TQNYGTSLLEYAYEPYMNLRSAIRSLKATRFNASKIDRIIGLAMNSLDPVKAADYSRTITQTLKRAADLMERRARGANNM 348 (569) T ss_pred chhhhhHHHHHHHhHHHHHHHHHHhccchhhHHHHHhHHhhccccCCCHHHHhHHHHHHHHHHHHHHHHHHHHhccCccc Confidence 39999999999999998888888899999999999999999999999999998765444 3333223334322 Q ss_pred ccccchhhHhhhcccccCCCC-ccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCC----CcccccchhhhhHH Q lcl|NC_015285. 128 DDKKFMSMMEDFWLPRREGGR-GTEISTLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETE----TTFNIGRAAEITRD 202 (359) Q Consensus 128 dd~~~mSMlEDywLpRReGgr-gTEIsTLpGgqnLgei~DV~YF~kkLy~aL~VP~SRl~~~----~~~~~g~~~eItRD 202 (359) -.+ ----||-=+.++ +--|+|=.+-.++--|+||...-+.|--||++-.|-|+-- ||. |.+.-+ |- T Consensus 349 ~~~------~~H~LPv~gekq~~~tvDt~~~~A~~~gIEdvM~~~R~LagaLGlD~SMlGwAD~LsGGL--GeGG~f-rt 419 (569) T protein:vir:10 349 PTV------TNTLLPIMGDGKGQMTIDTQTIQADINGIEDILTYMRQLAAALGLDYTLLGWADQMSGGL--GEGGFL-RT 419 (569) T ss_pred ccc------ceeeeeeecCccccccccccccccCcccHHHHHHHHHHHHhhhccchhHhhHHHHhcccc--cccHHH-HH Confidence 111 112256666666 3467766666667789999999999999999999998522 221 221110 00 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHhc--CCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhc Q lcl|NC_015285. 203 EVKFQKFIARLRKRFSELFTDLLKTQLILK--GVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYV 280 (359) Q Consensus 203 ElKF~KFI~rLr~rFs~if~d~Lk~QLiLk--gI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~v 280 (359) -+.=..=-+-||.-.++.|..++-.++-.| ++-+++| ..-+++|..++.=.|-.+.+-..+|+++++-+.-.. T Consensus 420 SaQaa~RS~~iRqa~~e~in~iidiH~~fKYgevf~~~d-----rP~~V~F~s~~tAl~~E~~~n~~~raN~a~i~~Q~l 494 (569) T protein:vir:10 420 AIQAAMRASWIQQGVEEFIQRAIDIHLAFKYGKVYPEGD-----RPYKIEFHSVNTALQQEHNDNRDSQANYATIVTQIL 494 (569) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcCcccCCCC-----cceEEEeccchHHHHHHHHhHHHHHHHHHHHHHHHH Confidence 001111124578888888888888877665 5666665 457899999999999899999999988765441111 Q ss_pred ---------c-hhhhHHHHHHHHhCCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcC---CCCCcccccccCCCCCcCC Q lcl|NC_015285. 281 ---------G-KYFSVDYMRRQVLKQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAM---AAGGEGAPAAEVDPNAQES 347 (359) Q Consensus 281 ---------G-Ky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~---~~~~~~~~~~~~~~~~~~~ 347 (359) | .==-..|+.+++|+| |+.+.+ .+-.|. .+.|++|+-.-. -.|+.. .+--+.+.+-+ T Consensus 495 a~l~e~n~Lg~de~~m~y~l~d~~~~-De~~~e---~l~ae~----~akp~DEe~~~~~~~~~~~~~-~~~~~~~~~~~- 564 (569) T protein:vir:10 495 DAVSNNSVLANSDAFKRYLFSDVLEI-DEKISE---ALVNEL----KAKSEDDDHLMDSIIKTPPQE-LAQILESVFKE- 564 (569) T ss_pred HHhhhcccccccHHHHHHHHHHHhhc-chhHHH---HHHhhc----CCCcchhHHHHHHHhcCChHH-HHHHHHHHhhc- Confidence 0 001235677777777 333321 223343 233554431110 001000 00000111100 Q ss_pred CCCCCCCcc Q lcl|NC_015285. 348 SVDPGDVRR 356 (359) Q Consensus 348 ~~~p~~~~~ 356 (359) ||..+ T Consensus 565 ----~~~~~ 569 (569) T protein:vir:10 565 ----GNDND 569 (569) T ss_pred ----cCCCC Confidence 00000 No 37 >protein:vir:80040 Length: 461 # NCBI annotation: gp3 # Family: family:all:297 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468707;genbank:gi:157325287;genbank:GeneID:5601731 Probab=96.36 E-value=0.00071 Score=37.86 Aligned_cols=275 Identities=9% Similarity=0.065 Sum_probs=116.9 Q ss_pred CCCch-------hhH--HHhhhhhhheee------------cccc--------cc--------------ccCCCceeecH Q lcl|NC_015285. 1 MRGVD-------LNQ--QLTQKAAEYFLY------------NPKG--------LK--------------NSTNQGMKITT 37 (359) Q Consensus 1 ~~~~~-------~~~--~~~~~~~e~f~y------------n~~~--------~~--------------~~~~~~v~i~~ 37 (359) ++++. .++ .-++++...=.| ||.+ .. ..+...++|++ T Consensus 122 v~d~~~~~~~~~~pl~~~~~~~~~~l~~~~~~~i~~~~~~~dp~sp~fg~P~~y~i~~~~~~~~~~~~~~~~~~~~~iH~ 201 (461) T protein:vir:80 122 VVSSNREQADLSTAIDPKTIKSIPYINTFNTQKVTQLYLNQDMFSEHFGEVEFFEVNRVSQLGEEILSGTTASTSEQIHR 201 (461) T ss_pred eecCCccccCccCCcccccccceeEEEeccccccchhhhcccCcCcccccceEEEEeccccccccccccccCccceEEcc Confidence 11110 111 111121111011 1111 00 11223467777 Q ss_pred hHhhhhhccccc-CCCCcchhhHHHHHHHHHHHHHHHH--HHHHHHHhcCccceeEeccCC-CCchHHHHHHHHHHHHhh Q lcl|NC_015285. 38 DSVTYCHSGIQD-LNKNMTLSHLHKAIKAVNQLRMIED--SLVIYRLSRAPERRIFYIDVG-NLPKNKAEQYLREVMGRY 113 (359) Q Consensus 38 ~ai~y~hSGl~d-~~~~~i~syL~~Aik~~NqL~m~ED--alVIyR~~RAPeRRvFyIDvG-nlpk~KAeqYl~~iM~ky 113 (359) +=|+.+..+-++ ...| .|.|+++...+..+..... +.++++ +-.+ +|.+|.- .+......+. ...+..+ T Consensus 202 SRii~~~~~~~~~~~~G--~S~le~~~~~l~~~~~~~~~~~~l~~~---~~~~-v~k~~~l~~~~~~~~~~~-~~~~~~~ 274 (461) T protein:vir:80 202 SRIIHEQGLRFEGETKG--RSIFESLYDIITVMDTSLWSVGQILYD---FAFK-VYKTDDIDALNKDDKANL-TAMLDFM 274 (461) T ss_pred ccEEEecCCCCCccccC--cchHHHHHHHHHHHHHHHHHHHHHHHH---hCCC-ceecchHHhhhchHHHHH-HHHHHHh Confidence 765554333322 1122 5677776655554433322 223333 4333 4555421 1111222222 2223322 Q ss_pred cceEEeeCCCCcccccccchhhHhhhcccccCCCCccceeecCCCCCcchHHH-HHHHHHHHHHhcCCCccccCCCC-cc Q lcl|NC_015285. 114 RNKMVYDANTGEIKDDKKFMSMMEDFWLPRREGGRGTEISTLPGGQNLGELED-VKYFQKKLYKALNVPSSRLETET-TF 191 (359) Q Consensus 114 rnklvYD~~TGevkdd~~~mSMlEDywLpRReGgrgTEIsTLpGgqnLgei~D-V~YF~kkLy~aL~VP~SRl~~~~-~~ 191 (359) ++ .+|-+- + + ..-+++++- .+|+-++| +..|...+-.+.+||+.+|-.++ |. T Consensus 275 ~~------~~g~~~-------------~----d-~~e~~e~~~--~~lsgl~~~l~~~~~~iaa~s~iP~t~L~G~s~g~ 328 (461) T protein:vir:80 275 FR------TEALAI-------------I----K-GDEQLTKES--TNVSGMKDLLDYGWDYLAGAVRMPKTVLKGQEAGT 328 (461) T ss_pred cC------CceEEE-------------E----c-CCcceEEEe--cCcCCHHHHHHHHHHHHhhhhcCCeeeeecccCCc Confidence 21 122110 0 0 011122222 23555555 46888899999999999984432 33 Q ss_pred cccchhhhhHHhhhHHHHHHHHHHH-HHHHHHHHHHHHHH----hcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHH Q lcl|NC_015285. 192 NIGRAAEITRDEVKFQKFIARLRKR-FSELFTDLLKTQLI----LKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIR 266 (359) Q Consensus 192 ~~g~~~eItRDElKF~KFI~rLr~r-Fs~if~d~Lk~QLi----LkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil 266 (359) + .-+.+ |.-.|+.+|.++|.. +..+...+++.=+. +..++.++ ...++|.|..=..-+|...+|++ T Consensus 329 ~-asge~---D~~~yyd~i~~~qe~~l~p~le~l~~~i~~s~~~~~~~~~p~-----~~~~~i~f~~L~~~s~kekAe~~ 399 (461) T protein:vir:80 329 L-TGAQY---DVMNYYARVSSIQENRLRPQLEYLTRLLMWASDDCGPSIDPD-----SFEWAIEFNPLWNLDSKTDAEVR 399 (461) T ss_pred c-ccchH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccCcc-----ccceEEEeCCCCCCCHHHHHHHH Confidence 2 11221 333499999999964 55555555543211 11122221 13466888877777888889999 Q ss_pred HHHHHHHHHhhhhcchhhhHHHHHHHHhC---CC--------HHHHHHHHHHHHHHHhcCCCCCCcchhhhcCC Q lcl|NC_015285. 267 NERMNQVAAMDPYVGKYFSVDYMRRQVLK---QT--------EIEIKEIDEQIASEMEAGIIADPMAEMDPAMA 329 (359) Q Consensus 267 ~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~---~t--------DeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~ 329 (359) ..+.++++.+-.- -.+|.+-++....+ ++ +.|+++.+++..+ .| +.|..+| T Consensus 400 ~~~a~a~~~~~~~--g~is~~e~r~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~--~~e~~~g 461 (461) T protein:vir:80 400 KLTAEADQIYIVN--GVLDPDEVKETRFGRFGLENSSKFSGDSAEIDKLAKLVYD--------AY--AKKNADG 461 (461) T ss_pred HHHHHHHHHHHhc--CCCCHHHHHHHHHHhcCCCCCccCCCCCchhhhhhhhccc--------cc--cccCCCC Confidence 9999888765332 13444444432211 10 0011111111000 00 0111111 No 38 >protein:vir:107662 Length: 427 # NCBI annotation: putative portal protein # Family: family:all:297 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003893;genbank:gi:45686310;genbank:GeneID:2773002 Probab=96.35 E-value=0.00071 Score=37.84 Aligned_cols=278 Identities=13% Similarity=0.166 Sum_probs=111.9 Q ss_pred CCCchh-hHHH--hhhhhhheee------------cccc--------ccc---cCCCceeecHhHh-hhhhcccccC--- Q lcl|NC_015285. 1 MRGVDL-NQQL--TQKAAEYFLY------------NPKG--------LKN---STNQGMKITTDSV-TYCHSGIQDL--- 50 (359) Q Consensus 1 ~~~~~~-~~~~--~~~~~e~f~y------------n~~~--------~~~---~~~~~v~i~~~ai-~y~hSGl~d~--- 50 (359) ++++.. ...+ -..+..+-++ ||.. ... .+...++|++|=+ +|...-+-+. T Consensus 102 v~d~~~l~~p~~~~g~l~~l~v~d~~~~~~~~~~~dp~s~~fg~P~~y~v~~~~~~~~~~iH~SRli~~~g~~~p~~~~~ 181 (427) T protein:vir:10 102 IKDNRMLTSQAKPGAKLEGVRVYDRFAITVEKRVTNARSPRYGEPEIYKVSPGDNMQPYLIHHSRVFIADGERVAQQARK 181 (427) T ss_pred ecCCCccccccCCCcceeEEEEechhcccccccccCccccccCcceEEEEecCCCCcceEEccccEEEecCCCchhhhcc Confidence 111110 0000 0111111111 1111 111 1223367776643 3332222111 Q ss_pred -CCCcchhhHHHHHHHHHHHHHHHH-----HHHHHHHhcCccceeEec-cCCCC---chHH--HHHHHHHHHHhhcceEE Q lcl|NC_015285. 51 -NKNMTLSHLHKAIKAVNQLRMIED-----SLVIYRLSRAPERRIFYI-DVGNL---PKNK--AEQYLREVMGRYRNKMV 118 (359) Q Consensus 51 -~~~~i~syL~~Aik~~NqL~m~ED-----alVIyR~~RAPeRRvFyI-DvGnl---pk~K--AeqYl~~iM~kyrnklv 118 (359) .+..=.|.|.++ .++.|+-+|. +.++++- -.+ |+.+ +++++ +... +..-+. .+.+.|+ T Consensus 182 ~~~~~G~S~l~~~--~~~~i~~~~~~~~~~~~l~~k~---~~~-v~k~~~l~~~~~~~~~~~~~~~r~~-~~~~~~~--- 251 (427) T protein:vir:10 182 QNQGWGASVLNKS--LIDAICDYDYCESLATQILRRK---QQA-VWKVKGLAEMCDDDDAQYAARLRLA-QVDDNSG--- 251 (427) T ss_pred cCCcccchhhhHH--HHHHHHHHHHHHHHHHHHHHHh---ccc-cccchhHHHHhcCccchHHHHHHHH-HHHHhcC--- Confidence 111112344443 2444444443 3344442 222 3334 33221 1111 111111 1111111 Q ss_pred eeCCCCcccccccchhhHhhhcccccCCCCccceeecCCCCCcchHHH-HHHHHHHHHHhcCCCccccCCC--Ccccccc Q lcl|NC_015285. 119 YDANTGEIKDDKKFMSMMEDFWLPRREGGRGTEISTLPGGQNLGELED-VKYFQKKLYKALNVPSSRLETE--TTFNIGR 195 (359) Q Consensus 119 YD~~TGevkdd~~~mSMlEDywLpRReGgrgTEIsTLpGgqnLgei~D-V~YF~kkLy~aL~VP~SRl~~~--~~~~~g~ 195 (359) .+|-+. + | +.+-+++++. .+|+-++| +..|...+=.+.+||+.||-.+ +|+| +- T Consensus 252 ---~~~~~~-------l--~--------~~~e~~e~~~--~~lsgl~~~~~~~~~~iaaa~~IP~t~L~G~sp~Gln-st 308 (427) T protein:vir:10 252 ---VGRAIG-------I--D--------AETEEYDVLN--SDISGVPEFLSSKMDRIVSLSGIHEIIIKNKNVGGVS-AS 308 (427) T ss_pred ---ccccee-------e--e--------cCCCceeEEe--cccCChHHHHHHHHHHHHhhhCCCeeeeccCCccccc-cc Confidence 111110 0 0 1111233331 23444555 4789999999999999999443 5665 22 Q ss_pred hhhhhHHhhhHHHHHHHHHHH-HHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHH Q lcl|NC_015285. 196 AAEITRDEVKFQKFIARLRKR-FSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVA 274 (359) Q Consensus 196 ~~eItRDElKF~KFI~rLr~r-Fs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~ 274 (359) +.+ |.-.|+.+|+.+|.. +..+...+++ +|. -. +.++|.|..-..=+|...+||...+.++++ T Consensus 309 gd~---D~~nyyd~i~~~Qe~~l~p~l~~l~~--~i~----~s-------~~~~~~f~pL~~~s~kEkaei~~~~a~a~~ 372 (427) T protein:vir:10 309 QNT---ALETFYKLVDRKREEDYRPLLEFLLP--FIV----DE-------EEWSIEFEPLSVPSKKEESEITKNNVESVT 372 (427) T ss_pred hhH---HHHHHHHHHHHHHHHHHHHHHHHHHH--Hhh----cC-------CCcEEEeCCCCCCCHHHHHHHHHHHHHHHH Confidence 222 334499999999954 4444433332 222 12 256799998888899999999999998887 Q ss_pred HhhhhcchhhhHHHHHHHHhCCCHHHHHHHHHHHHHHHhcCC--CCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCC Q lcl|NC_015285. 275 AMDPYVGKYFSVDYMRRQVLKQTEIEIKEIDEQIASEMEAGI--IADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPG 352 (359) Q Consensus 275 ~~dp~vGKy~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~--~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~ 352 (359) .+-.-. .++ .+|+-+....+-.+ .++ +.+...+..+...- + +| .+..-+++. T Consensus 373 ~~~~~g--vi~------------~~e~r~~L~~~~~~--~~~~~~~~~~~e~~~~~~e----~-----~p-~~~e~~~d~ 426 (427) T protein:vir:10 373 KAITEQ--IID------------LEEARDTLRSIAPE--FKLKDGNNINIREPEETTE----P-----EP-GLGEKLEDE 426 (427) T ss_pred HHHhcC--CCC------------HHHHHHHHHhhhcc--ccCCCCccccccccchhcC----C-----CC-CCCCCCCCC Confidence 764431 233 34333222222111 111 11111111000000 0 00 011111222 Q ss_pred C Q lcl|NC_015285. 353 D 353 (359) Q Consensus 353 ~ 353 (359) | T Consensus 427 ~ 427 (427) T protein:vir:10 427 N 427 (427) T ss_pred C Confidence 2 No 39 >protein:vir:8184 Length: 474 # NCBI annotation: gp4 # Family: family:all:524 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817977;genbank:gi:29566411;genbank:GeneID:2700965 Probab=96.13 E-value=0.00097 Score=37.11 Aligned_cols=282 Identities=11% Similarity=0.090 Sum_probs=147.0 Q ss_pred CCCchhhH-----HH-----hhhhhhheeecccccc-------------c--cCCCceeecHhHhhhhhcccccC-CCC- Q lcl|NC_015285. 1 MRGVDLNQ-----QL-----TQKAAEYFLYNPKGLK-------------N--STNQGMKITTDSVTYCHSGIQDL-NKN- 53 (359) Q Consensus 1 ~~~~~~~~-----~~-----~~~~~e~f~yn~~~~~-------------~--~~~~~v~i~~~ai~y~hSGl~d~-~~~- 53 (359) +-|+.++. .+ -+....+.+|.|.... . .+..| +| .|.|+|.-=.+. .+. T Consensus 149 ~~D~~~~~~~~al~~~~~~~~g~~~~~~ly~~~~~~~~~~~~~~~~w~~~~~~~~~g--vP--vV~~~n~~~~~~~~G~s 224 (474) T protein:vir:81 149 EWNRRRRGLNNLLSIIDKDKEGKVLSLALYLDNETVTAQRDKATLKWQVDRDEHVYG--VP--AQVLPYKPAPKRPFGQS 224 (474) T ss_pred EEeCCCCcceeeeEEEEEcCCCcEEEEEEEeCCcEEEEEEcCccceeeeccCCCCCC--cc--eEEecccccccCcCCcc Confidence 11111111 00 0011233344332211 0 01112 33 577777632221 221 Q ss_pred ----cchhhHHHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCC------chHHHHHHHHHHHHhhcceEEeeCCC Q lcl|NC_015285. 54 ----MTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNL------PKNKAEQYLREVMGRYRNKMVYDANT 123 (359) Q Consensus 54 ----~i~syL~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnl------pk~KAeqYl~~iM~kyrnklvYD~~T 123 (359) -+++..+++- +.+.+.++.=...=.|+|-|+=++-... |..+-.+|+- T Consensus 225 ~i~e~v~~l~da~~------r~~~~~~~~~e~~a~pqr~i~G~~~~~~~d~d~~~~~~~~~~~~---------------- 282 (474) T protein:vir:81 225 RITKPMMGLQDAGV------RELARREGHMDVFSYPEFWLLGADESALKNADGTIKSVWEARLG---------------- 282 (474) T ss_pred ccchhHHHHHHHHH------HHHHHHHHHHHHhcchhheeecCChhhcccccccccchhhhhHH---------------- Confidence 2344555544 4567788888888888888864332111 1111111221 Q ss_pred CcccccccchhhHhhhc-ccccCC-C----CccceeecCCCCCcchH-HHHHHHHHHHHHhcCCCccccCCCCcccccch Q lcl|NC_015285. 124 GEIKDDKKFMSMMEDFW-LPRREG-G----RGTEISTLPGGQNLGEL-EDVKYFQKKLYKALNVPSSRLETETTFNIGRA 196 (359) Q Consensus 124 Gevkdd~~~mSMlEDyw-LpRReG-g----rgTEIsTLpGgqnLgei-~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~ 196 (359) ..| +|.-+. . .+++|..+|+.. |.-. +-++=.-..+-...++|.+-|+-.+.-|-..+ T Consensus 283 --------------~i~~~~~d~d~~~~~~~~~~~~q~~~a~-l~~~~~~l~~~~~~~a~~t~iP~~~lG~~~~~np~Sa 347 (474) T protein:vir:81 283 --------------RIKGLPDDADADIPQLARADVKQFPAAS-PDAHWSDINGLAKLFAREASLPDTAVAISGLSNPTSA 347 (474) T ss_pred --------------HHhcCCCcccccccccccccccccCCCC-hhHHHHHHHHHHHHHHhhhCCCHHHhcccccccccHH Confidence 222 333222 1 235667777654 4432 22333344555567999888753222233335 Q ss_pred hhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHh Q lcl|NC_015285. 197 AEITRDEVKFQKFIARLRKRFSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAM 276 (359) Q Consensus 197 ~eItRDElKF~KFI~rLr~rFs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~ 276 (359) ..|.-.|....+-+.+.|+.|..=+.+++|.-+.+.|-...++|..--..+.+.|..-.+=+ +.++.+.+..+ T Consensus 348 eAi~a~~~~l~~kae~k~~~fg~~l~~~~rla~~i~~~~~~~~~~~~~~~~~v~W~d~~~~s-------~a~~aDa~~Kl 420 (474) T protein:vir:81 348 ESYDASQYELIAEAEGAVDDFTPALRKAFIRALAMKNKVAIDEIPDEWKSIDAKWRDPRYLS-------KSAQADAGMKQ 420 (474) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCccccchhhccceeEecCCCccC-------HHHHHHHHHHH Confidence 56777788888999999999999999999999999887766666554556777774332221 24455555555 Q ss_pred hhhcchhhhHHHHHHHHhCCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCC Q lcl|NC_015285. 277 DPYVGKYFSVDYMRRQVLKQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVD 341 (359) Q Consensus 277 dp~vGKy~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~ 341 (359) ..- |.-+....+..++|++|++||+.....++++...+.+ +.+- +.+.+++. .| T Consensus 421 ~~a-~~~~~~~~~~~~~lg~t~~~i~~~~~~~~~~~~~~~~----~~l~-~~~~~~~~-----aq 474 (474) T protein:vir:81 421 LAA-VPWLAETEVGLELIGLTPQQARRAMADKRRVQGRGTL----QALI-DRSNNGAT-----AQ 474 (474) T ss_pred Hhc-ccCCCcHHHHHhhcCCCHHHHHHHHHHHHHHhHHHHH----HHHH-hcCCCCCC-----CC Confidence 432 3344555667788999999998777666665433322 2111 11111111 11 No 40 >protein:vir:78537 Length: 480 # NCBI annotation: gp11 # Family: family:all:524 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491582;genbank:gi:157786405;genbank:GeneID:5625689 Probab=95.76 E-value=0.0015 Score=36.05 Aligned_cols=305 Identities=14% Similarity=0.150 Sum_probs=132.4 Q ss_pred CCCchh------hHHHh------hhhhhheeeccccccc---cCC--CceeecHhHhhhhhc-ccccC---CCC------ Q lcl|NC_015285. 1 MRGVDL------NQQLT------QKAAEYFLYNPKGLKN---STN--QGMKITTDSVTYCHS-GIQDL---NKN------ 53 (359) Q Consensus 1 ~~~~~~------~~~~~------~~~~e~f~yn~~~~~~---~~~--~~v~i~~~ai~y~hS-Gl~d~---~~~------ 53 (359) +-|+.. .+.++ +.+..+.+|.+..... .++ .+.....+ ++-|. |-+++ .++ T Consensus 141 i~D~~~~~~~~~~i~~~~~~d~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~g~vPvv~f~n~~~~~~~ 218 (480) T protein:vir:78 141 ELDPRNTRRVTRAVRLYTTRDDVAVPDRATLYLPDETVPLRRNGGLNDQWVVDGD--VIKHGLGVVPVVPLTNDPRLGNR 218 (480) T ss_pred EEcCCCccceEEEEEEEEeecCCcceEEEEEEeCCeEEEEEecCCCccccccccc--ccccCCCCcceEEeecccccCCc Confidence 111110 01100 1112233444432211 000 01011111 11121 22211 011 Q ss_pred cchhhHHHH----HHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCccccc Q lcl|NC_015285. 54 MTLSHLHKA----IKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDD 129 (359) Q Consensus 54 ~i~syL~~A----ik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd 129 (359) ...|=|.+. +..+| +++-+.+++-....-|.|-+.=.+....+..+ .+ T Consensus 219 ~G~sdi~~~i~~l~Da~~--~~~s~~~~~~~~~a~p~~~i~G~~~~~~~~~~---------------------~~----- 270 (480) T protein:vir:78 219 YGRSEISPELRKVTDAAS--RTLMNLQSASQILGTPLRVISGVTTDELTNDG---------------------EN----- 270 (480) T ss_pred cCccchhHHHHHHHHHHH--HHHHHHHHHHHhhcchhhhhhCCCcccccccc---------------------cc----- Confidence 112223332 23333 34556667767777777755422222111000 00 Q ss_pred ccc-hhhHhhhcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHH Q lcl|NC_015285. 130 KKF-MSMMEDFWLPRREGGRGTEISTLPGGQNLGE-LEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQ 207 (359) Q Consensus 130 ~~~-mSMlEDywLpRReGgrgTEIsTLpGgqnLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~ 207 (359) .++ ..+-...|++ +-+.++-++++.+ ++- ++-++-....++..-++|..-|+..+. |-..+..|.--+...- T Consensus 271 ~~~~~~~~~~~~~~----~~~~~~~~~~~~~-~~~~~~~l~~~i~~~~~~~~~p~~~fg~~~~-n~~Sg~Al~~~~~~l~ 344 (480) T protein:vir:78 271 TTLDIYYGRILTLA----SEAAKISEFKAAE-LRNFAEEMEVFRKEAASITGLPPQYLSSSSE-NPASAEAIIATDSRIV 344 (480) T ss_pred chhhhhhhhhccCC----CCCceEEecCccC-HHHHHHHHHHHHHHHhcccCCCHHHhccccC-chhHHHHHHHHHHHHH Confidence 001 1111223443 2346777877754 333 333566666677777888777754221 2112333554555677 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHH Q lcl|NC_015285. 208 KFIARLRKRFSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVD 287 (359) Q Consensus 208 KFI~rLr~rFs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~ 287 (359) .-+.+.|+.|..-+.+.++.-+.+.|.--..+| ..|.+.|..-..=+. .+.++.+.++-.-.+..+|.+ T Consensus 345 ~k~~~~~~~f~~~l~~~~rl~~~~~~~~~~~~~----~~i~v~w~~~~~~s~-------~~~ad~~~kl~~~g~~~~s~e 413 (480) T protein:vir:78 345 KMAERKGRIFGGAWERAMRIAMQIMGREVTEEY----TRLETVWRDPSTPTV-------AAKADAVSKLYANGQGPIPKE 413 (480) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCCCccccc----eeeeEEecCCCCCCH-------HHHHHHHHHHHHhcccCCCHH Confidence 788999999999999999987777775433444 356788854332222 234555555544434556888 Q ss_pred HHHHHHhCCCHHHHHHHHHHHHHHHhc--CCCCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCCCCccCC Q lcl|NC_015285. 288 YMRRQVLKQTEIEIKEIDEQIASEMEA--GIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPGDVRRGE 358 (359) Q Consensus 288 ~i~k~IL~~tDeeI~e~~kqi~~E~~~--~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~ 358 (359) ++.. +|++++++++++++..+++... +.+..|..+....++.|+ +++.+++..+++.-=| -..+. T Consensus 414 t~~~-~lg~~~d~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~-~~~~~ 480 (480) T protein:vir:78 414 QARI-DLGYTATQREQMRDWDKQETEDMIDTLYSTTKAQADATPKPT----VTETKTETQTSPSGFN-RTKTR 480 (480) T ss_pred HHHh-cCCCCHhHHHHHHHHHHHHHHHHHHHhhccccCCCccccCCC----CCCCCCccCCCcccCC-CcCCC Confidence 8775 5899999999876544444221 111122111000111111 1111111111110000 01111 No 41 >protein:vir:79703 Length: 505 # NCBI annotation: minor structural protein gp61 # Family: family:all:898 # MgeID: mge:1873 # MgeName: LL-H # Cross-refs: genbank:acc:YP_001285880;genbank:gi:148750838;genbank:GeneID:5220405 Probab=95.74 E-value=0.0015 Score=36.02 Aligned_cols=293 Identities=12% Similarity=0.143 Sum_probs=135.2 Q ss_pred CCCchhhHH-Hhhhhhhheeecc-ccc-----c---ccCCCceeecHhHh----------hhhhcc--cc---c------ Q lcl|NC_015285. 1 MRGVDLNQQ-LTQKAAEYFLYNP-KGL-----K---NSTNQGMKITTDSV----------TYCHSG--IQ---D------ 49 (359) Q Consensus 1 ~~~~~~~~~-~~~~~~e~f~yn~-~~~-----~---~~~~~~v~i~~~ai----------~y~hSG--l~---d------ 49 (359) .+....+.. +|+ ..||+..+- .+. + ++..-|..+|-+.+ +|.+.. ++ . T Consensus 172 ~~~~~~~~~~~yt-~lE~h~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~~~l~~~~~~~g~~~p~f~~~~~~~~N~ 250 (505) T protein:vir:79 172 TTEVENHRTIYYT-LLEFHQWDHGDYVITNELYRSEAAETVGINVPLNSLEQYEGLEPQVKITGLKHPLFAFYRNKGANN 250 (505) T ss_pred EEEecCCcceEEE-EEEEEEecCceEEEEEEEEecCCCCccCcccchhhcccccccCcceeecCCCcceEEEecCCcccc Confidence 111111111 121 223322111 010 1 11122333333332 221110 11 0 Q ss_pred --CCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCccc Q lcl|NC_015285. 50 --LNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIK 127 (359) Q Consensus 50 --~~~~~i~syL~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevk 127 (359) ..+..=+|-++.|.-.+..|-..=+ -+.|.+|.=.+|||. |-.=|++...-....... ..-++| . T Consensus 251 ~~~~splG~S~~~~~~~~id~lD~~~s--~~~~e~~~g~~~i~v-~~~~l~~~~~~~~~~~~~----~~~~fd------~ 317 (505) T protein:vir:79 251 KNFTSPMGMSLIDNSYTVIDAINRTHD--QFVDEVKKGQRRLIV-PAEWLKTGSSYGGQASET----HPPMFD------P 317 (505) T ss_pred cccCCccCCchhhhhHHHHHHHHHHHH--HHHHHHHhcccceee-chHHhcccCCCCcccccc----cccCCC------c Confidence 0111125777777766666654333 455777777777765 211110000000000000 000011 1 Q ss_pred ccccchhhHhhhcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhH Q lcl|NC_015285. 128 DDKKFMSMMEDFWLPRREGGRGTEISTLPGGQNLGE-LEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKF 206 (359) Q Consensus 128 dd~~~mSMlEDywLpRReGgrgTEIsTLpGgqnLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF 206 (359) +++.+.++.-| +| +.-|+++.+.---.+ .+-+..+.+.+....+++-+-|+.++. ....++||....-.- T Consensus 318 ~~~~y~~~~~~------~~--~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~~~~~~~-~~~TAtei~s~~~~l 388 (505) T protein:vir:79 318 DETVYQAMYGD------AS--EVGFHDATSPIRVADYQATMDFFLREFENQTGLSQGTFTTSPS-GIQTATEVVTNNSQT 388 (505) T ss_pred cceeeeeccCC------CC--CCceEEecccCCHHHHHHHHHHHHHHHHHHhCCChhhcCCCcc-ccchHHHHHHHHhHH Confidence 22333333211 12 223777776432222 345777888888999999999876643 234577777666566 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCC---C----hhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhh Q lcl|NC_015285. 207 QKFIARLRKRFSELFTDLLKTQLILKGVM---S----LEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPY 279 (359) Q Consensus 207 ~KFI~rLr~rFs~if~d~Lk~QLiLkgI~---t----~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~ 279 (359) ..-+.+.|+.|...+.++++.=|-+..+. + ...++.-...+.++|...-.-. +++++- ..+.+++ . T Consensus 389 ~~t~~~~~~~~~~al~~li~~i~~~~~~~~~~~~g~~~~~~~~~~~~i~v~f~d~i~~d--~~~~~~-~~~~~v~--~-- 461 (505) T protein:vir:79 389 YQTRSSYITQVEKTIKALTYAILELASVPSFYADGQARWTGDVDSLDITINFNDGVFVD--QESKRA-ADLQAVQ--A-- 461 (505) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCCCCceeEEEEeCCCCCCC--HHHHHH-HHHHHHH--c-- Confidence 66777778778777777777655433221 1 0011111134666666332222 222221 1122221 1 Q ss_pred cchhhhHHHHHHHHhCCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCc Q lcl|NC_015285. 280 VGKYFSVDYMRRQVLKQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGE 333 (359) Q Consensus 280 vGKy~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~ 333 (359) | .+|.++++.+..+.||+|.+++-++|++|... ..|+|+ .++|+ T Consensus 462 -G-i~s~e~~l~~~~~~~eeea~~el~ri~~E~~~-~~p~~~-------~~gg~ 505 (505) T protein:vir:79 462 -Q-VMPKKQFLMRNYGLDEEEADEWLAQIDAENST-AEPEFN-------QFGGD 505 (505) T ss_pred -C-CCCHHHHHHhcCCCChHHHHHHHHHHHHhccc-cCCCch-------hccCC Confidence 2 47888887888999999999999999998543 122222 22333 No 42 >protein:vir:98444 Length: 434 # NCBI annotation: hypothetical protein # Family: family:all:5096 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958276;genbank:gi:41057250;genbank:GeneID:2732828 Probab=95.70 E-value=0.0016 Score=35.91 Aligned_cols=292 Identities=14% Similarity=0.138 Sum_probs=132.7 Q ss_pred CCC-----chhhHHHh------hhhhhheeecccccc-----------ccCCCce-ee--cH---------hHhhhhhcc Q lcl|NC_015285. 1 MRG-----VDLNQQLT------QKAAEYFLYNPKGLK-----------NSTNQGM-KI--TT---------DSVTYCHSG 46 (359) Q Consensus 1 ~~~-----~~~~~~~~------~~~~e~f~yn~~~~~-----------~~~~~~v-~i--~~---------~ai~y~hSG 46 (359) +-| ...++.++ +.....|+|+....+ ......+ .. +. -.|.|++.- T Consensus 107 i~D~~~~~~~~ai~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~h~~g~vPvv~f~N~~ 186 (434) T protein:vir:98 107 EYDPETGEPLVGLKVWHNDIDGFGYARVFFDDTSFPYRTRERTGARLPWGPDSWVYTGTADSGDVHDLGGMQLVEFARMP 186 (434) T ss_pred EEeCCCCceEEEEEEEEeccCCceEEEEEEeCcEEEEEEeeccccccccccccceecccccccccCCCCccceEEeccCC Confidence 111 11111111 011122222211110 0000000 00 00 012244432 Q ss_pred cccCCCCcchhhHHHHHHHHHHH-HHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCc Q lcl|NC_015285. 47 IQDLNKNMTLSHLHKAIKAVNQL-RMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGE 125 (359) Q Consensus 47 l~d~~~~~i~syL~~Aik~~NqL-~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGe 125 (359) -.+. .-.|=++..+....-+ +++-+.+++-+.+-.|.|-+.=.+ ++. .-|...+. T Consensus 187 ~~~~---~g~sd~e~vi~liDa~~~~~s~~~~~~~~~a~p~~~i~G~~---~~~------------------~~~~~~~~ 242 (434) T protein:vir:98 187 DLGE---DPEPEFAGVLDIQDRVNLGILNRMAASRFSGFRQKWIKGHK---FAK------------------RTDPATGM 242 (434) T ss_pred CcCc---CCcchhhhHHHHHHHHHHHHHHHHHHHHHhcchhhhhcCCC---ccc------------------cccccccc Confidence 1111 1223333333222222 235566666676666766553111 110 00222233 Q ss_pred ccccccchhhHhhhcccccCCCCccceeecCCCCCcchH-HHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhh Q lcl|NC_015285. 126 IKDDKKFMSMMEDFWLPRREGGRGTEISTLPGGQNLGEL-EDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEV 204 (359) Q Consensus 126 vkdd~~~mSMlEDywLpRReGgrgTEIsTLpGgqnLgei-~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDEl 204 (359) +..++.+..-....|+.- +-++++..+++.. ++.. +-++-.-..+....++|.+-|..+.+ | -.+..|.-.+. T Consensus 243 ~~~~~~~~~~~~~i~~~~---~~~~~~~q~~~~~-~~~~~~~l~~~i~~~~~~~~~p~~~~~~~~~-n-~Sg~Al~~~~~ 316 (434) T protein:vir:98 243 TVVDQPFVPSPSAVWASE---GENTQFGQLDATD-LSGFLKEHASDVRDMLTISQTPTYLYATDLV-N-ISADTIGALDI 316 (434) T ss_pred chhhhhhhccccccccCC---CCCceEEEecCcc-hHHHHHHHHHHHHHHhcccCCCHHHhccccC-C-hHHHHHHHHHH Confidence 322222322223346543 2346677787654 3333 33566677888889999888863211 1 12334555555 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhh Q lcl|NC_015285. 205 KFQKFIARLRKRFSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYF 284 (359) Q Consensus 205 KF~KFI~rLr~rFs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~ 284 (359) ....-+.+.|+.|..-+.++++.-+.+.|+ +. +| ..+++.|..-..=+. .+..+++..+..- | + T Consensus 317 ~l~~k~~~k~~~f~~~l~~~~rl~~~~~g~-~~-~~----~~~~v~w~~~~~~s~-------~~~ada~~kl~~~-g--~ 380 (434) T protein:vir:98 317 LHVAKVREHIASFSEGLESVLALAAAQAGV-PE-DY----TEAEVRWANPAHVTM-------AVKADAATKLKSI-G--Y 380 (434) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhcCC-Ch-hh----eeeeEEecCCCCCCH-------HHHHHHHHHHHhc-C--C Confidence 677778888999999999999887777775 32 22 247788865443322 3455555555432 2 5 Q ss_pred hHHHHHHHHhCCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCCCCccC Q lcl|NC_015285. 285 SVDYMRRQVLKQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPGDVRRG 357 (359) Q Consensus 285 S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~ 357 (359) |.++++ +.|+++++||+++.++.+++........|. .|.++ +| .+|. -|++-+| T Consensus 381 ~~e~~~-~~lg~~~~e~~r~~~e~~~~~~~~~~~~~~------~~~~~----~g-~~~~-------~~~~~dg 434 (434) T protein:vir:98 381 PLDVIA-EELDESPARVRRIVAGAASQALLAASLLPA------PGAPS----AG-NVPD-------SGGAVDG 434 (434) T ss_pred cHHHHH-HhCCCCHHHHHHHHHHHHHHHHHHHhhhcc------CCCCC----CC-CCCc-------ccCCCCC Confidence 777776 558999999987776655543322111111 11110 01 0110 1222223 No 43 >protein:vir:10321 Length: 495 # NCBI annotation: ORF23 # Family: family:all:47 # MgeID: mge:182 # MgeName: VHML # Cross-refs: genbank:acc:NP_758916;genbank:gi:27311190;genbank:GeneID:956137 Probab=95.51 E-value=0.0019 Score=35.46 Aligned_cols=306 Identities=12% Similarity=0.093 Sum_probs=143.3 Q ss_pred CCCchhhHHHhhh--------hhhheeec--ccc--ccccCCCceeecHhHhhhhhcccccCCCCcchhhHHHHHHHHHH Q lcl|NC_015285. 1 MRGVDLNQQLTQK--------AAEYFLYN--PKG--LKNSTNQGMKITTDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQ 68 (359) Q Consensus 1 ~~~~~~~~~~~~~--------~~e~f~yn--~~~--~~~~~~~~v~i~~~ai~y~hSGl~d~~~~~i~syL~~Aik~~Nq 68 (359) .....+|..|+.| -.-|+++. |.. ....+..-++||.+-|..++- .-..-..- +|.|+..+ .+++ T Consensus 166 ~~~~~~g~~i~~GIe~d~~Gr~vaY~i~~~hpgd~~~~~~~~~~~rvpA~~vlH~f~-~r~gQ~RG-is~la~i~-~l~~ 242 (495) T protein:vir:10 166 DETLPSGGYVKGGIRFSNGGKRKAYCFYRNHPAESSLIGDPVDTVWIKAEHVLHVTV-LTVRSDAG-APWFQLLL-RLNE 242 (495) T ss_pred CCCCCCCCEEEeceEECCCCceEEEEEeecCCCcccccccccceeeechhheEeccc-cCCCcccC-cchhHHHH-HHHH Confidence 1112223333333 34466653 321 112334458999988876653 32222222 47998655 5899 Q ss_pred HHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhhcccccCCCC Q lcl|NC_015285. 69 LRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDFWLPRREGGR 148 (359) Q Consensus 69 L~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDywLpRReGgr 148 (359) |.-.+||..+-...-|-. +.+|-..+ |..-+.+-.. ..++-.+.....++ +.==++. =.- T Consensus 243 l~~y~dael~~a~i~A~~--~~fi~~~~-~~~~~~~~~~--------------~~~~~~~~~~~~~l-~pG~i~~--L~p 302 (495) T protein:vir:10 243 LDQYEDAELVRKKTAALF--AAFIQEAT-ADSTGGPTIG--------------QPKRSKGGKRITGL-NPGTLQY--LQP 302 (495) T ss_pred hhHHHHHHHHHHHHhhhh--eeeeecCC-CccccccccC--------------ccccccCcccceec-CCceeee--cCC Confidence 999999999999888855 32332211 1111100000 00111111111110 0000000 013 Q ss_pred ccceeecCCCCCcchHHH-HHHHHHHHHHhcCCCccccCCC-CcccccchhhhhHHhhhHHHHHHHHHHH-HHHHHHH-- Q lcl|NC_015285. 149 GTEISTLPGGQNLGELED-VKYFQKKLYKALNVPSSRLETE-TTFNIGRAAEITRDEVKFQKFIARLRKR-FSELFTD-- 223 (359) Q Consensus 149 gTEIsTLpGgqnLgei~D-V~YF~kkLy~aL~VP~SRl~~~-~~~~~g~~~eItRDElKF~KFI~rLr~r-Fs~if~d-- 223 (359) |.+|+.+.....-+..++ ++...+.+=.+|+||-+-|..+ ++.|+ |.+.-.-+.|-+.+.++|.+ |..-|.. T Consensus 303 Ge~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~Ye~ltgD~s~~nY---SS~R~~~~e~~r~~~~~q~~~~~~~~~~pi 379 (495) T protein:vir:10 303 GQEVKFSNPADVGTTYEPWLRYQLLSIAKGYGITYEMLTGDLRGVNY---SSIRAGLLEFRRLCQQVQHHMIIHQFCRPV 379 (495) T ss_pred CCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 445555555444444444 5566666778899999988665 34554 23344555699999999875 5544444 Q ss_pred ---HHHHHHHhcCCCC-hhHHHHHhhceeeee--eccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCCC Q lcl|NC_015285. 224 ---LLKTQLILKGVMS-LEEWEDMKNHIQFDF--IADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQT 297 (359) Q Consensus 224 ---~Lk~QLiLkgI~t-~eew~~~~~~I~~~f--~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~t 297 (359) .|+ ..+|.|.++ +.-|+.........| -.--+.-.+||+.-...+++ .-.-|.+-+..+ .+.. T Consensus 380 ~~~~l~-~a~l~G~i~~p~~~~~~~~~~~~~w~~p~~~~vDP~Ke~~A~~~~i~---------~G~~s~~~~~a~-~G~D 448 (495) T protein:vir:10 380 GRWFMD-FAVASGAVVIPDYLQRRRYYNRVSWRTPRWEEVDPLKKHLADLGDVR---------AGFAPISDKQAE-RGYD 448 (495) T ss_pred HHHHHH-HHHHcCCCCCCCchhhhHhhhccccccCCccccChHHHHHHHHHHHH---------cCCCCHHHHHHH-cCCC Confidence 344 456677654 444443333333344 44445566777766655544 234466666665 4655 Q ss_pred HHHHH-HHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCCCCccCC Q lcl|NC_015285. 298 EIEIK-EIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPGDVRRGE 358 (359) Q Consensus 298 DeeI~-e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~ 358 (359) -+|+- |.+...+...+.|+..+.+. . ..++. +..+.+..+.+ -..| T Consensus 449 ~~~v~~q~a~e~~~~~~~Gl~~~~~p----~-----~~~~~-----~~~~~~~~~~~-~~~e 495 (495) T protein:vir:10 449 MEELFDMISDANQLIDEYDLRLDSDP----R-----YVNGS-----GAEQKSVMEAA-LNNE 495 (495) T ss_pred HHHHHHHHHHHHHHHHHcCCCCCCCC----C-----cCCCc-----cCCCCCCCCCC-CCCC Confidence 55443 23333333334454322210 0 00000 11111111111 1111 No 44 >protein:vir:78907 Length: 518 # NCBI annotation: gp3 # Family: family:all:4147 # MgeID: mge:1859 # MgeName: A006 # Cross-refs: genbank:acc:YP_001468843;genbank:gi:157325445;genbank:GeneID:5601904 Probab=95.22 E-value=0.0025 Score=34.85 Aligned_cols=294 Identities=16% Similarity=0.158 Sum_probs=134.3 Q ss_pred CCCchhhHHHh-hhhhhheeeccccccccCC-------------------------CceeecHhHhhhhhcc---cccCC Q lcl|NC_015285. 1 MRGVDLNQQLT-QKAAEYFLYNPKGLKNSTN-------------------------QGMKITTDSVTYCHSG---IQDLN 51 (359) Q Consensus 1 ~~~~~~~~~~~-~~~~e~f~yn~~~~~~~~~-------------------------~~v~i~~~ai~y~hSG---l~d~~ 51 (359) ..+........ .+.-.|-+|--.+....+. ++++.+- +.|..-. -.+.. T Consensus 186 ~~~~~~~~~~~~~~~I~n~ly~~~~~~~v~~~~~~~~~~l~~~~~~~~~~e~~~~~tg~~~~~--~~~~~n~~~N~~~~~ 263 (518) T protein:vir:78 186 IKQWDKEGKKLSGGFVTYSVIKIDGDKTTPISAERLPEQITSYLHTNDIQLNHSVSIGLKSMG--AYLINNSPSNTRYPH 263 (518) T ss_pred cccccceeecccceeEEEEEeeecCcccccccccccccccccccccccCccceeeccCCccce--EEeeccccccccccC Confidence 11111111111 1222334442111110011 1111111 1111100 00001 Q ss_pred CCcchhhHHHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCccccccc Q lcl|NC_015285. 52 KNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKK 131 (359) Q Consensus 52 ~~~i~syL~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~ 131 (359) +..=+|-|+.|.-++..|-..=+ -+.|..|.=++|||.-+ .=|+ ...|..++.-. .. T Consensus 264 splG~S~~~~~~~~id~lD~~~s--~~~~e~~~g~~~i~v~~-~~l~------------------~~~~~~~~~~~--~~ 320 (518) T protein:vir:78 264 LNLGESDLSQCTNYLFAVDYFFT--VYMREGEKTKTKIAASE-RMFR------------------KKVNKSTDKEE--WS 320 (518) T ss_pred CCcCcchHhhhhHHHHHHHHHHH--HHHHHHHhCCceeeech-hHhc------------------cCCCCCCCccc--cc Confidence 11124667777766666655554 45678888777777621 1110 00111111000 00 Q ss_pred chhhHhhhccccc----CCCC-ccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhh Q lcl|NC_015285. 132 FMSMMEDFWLPRR----EGGR-GTEISTLPGGQNLGE-LEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVK 205 (359) Q Consensus 132 ~mSMlEDywLpRR----eGgr-gTEIsTLpGgqnLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElK 205 (359) ..--.++|.+.. +|+. .+.|+++...=.-.+ ..-+..+.+.+....+++-+-|+.+++- --++||.+..-+ T Consensus 321 -fd~~~~~y~~i~~~~~~~~~~~~~i~~~~~~Ir~e~~~~~~~~~l~~~~~~~G~s~~tfg~~~~~--~TATei~s~~~~ 397 (518) T protein:vir:78 321 -MNVDEDYFMQFKGTLDAGAKLNDMIQFMQGDFRDGSYRETMEYFAQKAVSKSGYNPATFNLGNRE--VKATEIWSLQDA 397 (518) T ss_pred -cCCCCceEEEecCcCCCCCccccceeeeecccChHHHHHHHHHHHHHHHHhhCCChhhcCccccc--ccHHHHHHHHHH Confidence 000112232221 1222 234666554322222 3336777888888899988878654321 136677776666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHh-cCCCChhHHHHHh--hceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcch Q lcl|NC_015285. 206 FQKFIARLRKRFSELFTDLLKTQLIL-KGVMSLEEWEDMK--NHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGK 282 (359) Q Consensus 206 F~KFI~rLr~rFs~if~d~Lk~QLiL-kgI~t~eew~~~~--~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGK 282 (359) -...+.+.|+.+...+.++++.-|-| +..+-...|.... ..+.++|...-.-.+..+++ .++++.- +- T Consensus 398 ~~~t~~~~~~~~e~al~~l~~~i~~l~~~~~~~~~~~~~~~~~~v~i~f~D~i~~D~~~~~~-------~~~~~v~--aG 468 (518) T protein:vir:78 398 TVRKIEKKKRLIQNVYEQMLWDFLYLLTGGTNNKEKAIMRDEIRVIIEFPDPMSVNLNELSS-------TLNNMNS--AL 468 (518) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccccccCCCceeEEEEeCCCCCCCHHHHHH-------HHHHHHh--cC Confidence 66688888888888888877764332 3221111111111 34677776332222222222 2222211 12 Q ss_pred hhhHHHHHHHH-hCCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCccc Q lcl|NC_015285. 283 YFSVDYMRRQV-LKQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGA 335 (359) Q Consensus 283 y~S~~~i~k~I-L~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~ 335 (359) .+|++..++++ -..||+|.+++-++|++|......++|++ -.|+.+.|+ T Consensus 469 imS~e~~i~~~~~~~~deea~~e~~ri~~E~~~~~~~~p~~----~~g~~~~~g 518 (518) T protein:vir:78 469 AMSVEEKVKLIHPKWEDEEIQAEVKRIYLENAIGEVPDPEA----IGGMETKGG 518 (518) T ss_pred CCCHHHHHHHhCCCCCHHHHHHHHHHHHHHhcccCCCCCcc----ccCCCCCCC Confidence 57998877664 37899999999999999977665555543 222322222 No 45 >protein:vir:5249 Length: 437 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852754;genbank:gi:31544029;interpro:IPR006445;uniprot:Q7Y5U6;genbank:GeneID:2753529 Probab=94.89 E-value=0.0032 Score=34.22 Aligned_cols=289 Identities=13% Similarity=0.168 Sum_probs=124.1 Q ss_pred CCCch---hhHHHhhhhhhheeec---------------------cccc-cccCCCceeecHhHhh-hhhc-ccccCCCC Q lcl|NC_015285. 1 MRGVD---LNQQLTQKAAEYFLYN---------------------PKGL-KNSTNQGMKITTDSVT-YCHS-GIQDLNKN 53 (359) Q Consensus 1 ~~~~~---~~~~~~~~~~e~f~yn---------------------~~~~-~~~~~~~v~i~~~ai~-y~hS-Gl~d~~~~ 53 (359) ..+.. +-..+-..+..+-+++ |... ..+.+..++|++|=|. |++. +-.+.+.. T Consensus 108 ~~d~~~~~~pl~~~~~~~~~~v~~~~~v~~~~~~~~dp~s~~fg~p~~y~v~~~~~~~~iH~SRii~~~~~~~~~~~~~~ 187 (437) T protein:vir:52 108 VTDSQNTSAPLKPTERLKRLIILPKWKISPTGTKDDDVLSPNFGRYSEYSILGGSQSITVHHSRLIILNANDAPLSDNDI 187 (437) T ss_pred EecCCCcccccccCCceeEEEEechhhccccccccccccccccCcceEEEEecCCcceeEccceeEEecCccCCCccccc Confidence 11000 0000000111111111 1111 1122334677776643 2221 11121222 Q ss_pred cchhhHHHHHHHHHHHHHHHHH--HHHHHHhcCccceeEeccCC--CCch--HHHHHHHHHHHHhhcceEEeeCCCCccc Q lcl|NC_015285. 54 MTLSHLHKAIKAVNQLRMIEDS--LVIYRLSRAPERRIFYIDVG--NLPK--NKAEQYLREVMGRYRNKMVYDANTGEIK 127 (359) Q Consensus 54 ~i~syL~~Aik~~NqL~m~EDa--lVIyR~~RAPeRRvFyIDvG--nlpk--~KAeqYl~~iM~kyrnklvYD~~TGevk 127 (359) .=+|.|+++...+..+...+.+ .++++ +... ++.++.- .|.. ..+..-..+.++++|+ ..|- T Consensus 188 ~G~s~le~~~~~i~~~~~~~~~~~~l~~~---~~~~-v~k~~~l~~~l~~~~~~~~~~~~~~~~~~~~------~~~~-- 255 (437) T protein:vir:52 188 WGVSDLEKIIDVLKRFDSASVNVGDLIFE---SKID-IFKIAGLSDKIAAGMENEVASVISAVQEIKS------ATNS-- 255 (437) T ss_pred cCCchHHHHHHHHHHHHHHHHHHHHHHHH---cCCC-ceecchHHHHhcCCcHHHHHHHHHHHHHhcC------CCce-- Confidence 2378888877666555444432 33443 4333 4555420 1111 1111111222333321 1111 Q ss_pred ccccchhhHhhhcccccCCCCccceeecCCCCCcchHHHH-HHHHHHHHHhcCCCccccCCC--CcccccchhhhhHHhh Q lcl|NC_015285. 128 DDKKFMSMMEDFWLPRREGGRGTEISTLPGGQNLGELEDV-KYFQKKLYKALNVPSSRLETE--TTFNIGRAAEITRDEV 204 (359) Q Consensus 128 dd~~~mSMlEDywLpRReGgrgTEIsTLpGgqnLgei~DV-~YF~kkLy~aL~VP~SRl~~~--~~~~~g~~~eItRDEl 204 (359) +-| ..+-+++++. .+++-++|+ ..|...+=.+.+||+.+|-.+ +|++ .+.+ |.- T Consensus 256 -----~~~-----------d~~~~~e~~~--~~~sgl~~~l~~~~~~iaaa~~iP~t~L~G~s~~Gla--sge~---D~~ 312 (437) T protein:vir:52 256 -----LLL-----------DAENEYDRKE--LTFTGLKDLLTEFRNAVAGAADMPVTILFGQSVSGLA--SGDE---DIQ 312 (437) T ss_pred -----EEE-----------cCCcceEEEe--cCcCCHHHHHHHHHHHHHHHhcCchhhhcCcCccccc--ccHH---HHH Confidence 111 0112233332 245556664 688999999999999999544 4552 2322 334 Q ss_pred hHHHHHHHHHHH-HHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchh Q lcl|NC_015285. 205 KFQKFIARLRKR-FSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKY 283 (359) Q Consensus 205 KF~KFI~rLr~r-Fs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy 283 (359) .|+.+|+++|.. +..+...+++. |++. +|-.+-+.++|.|..=..-++...+|+...+.++++.+-.- | . T Consensus 313 ~yyd~i~~~Qe~~l~p~le~l~~~-i~~~------~~g~~~~~~~~~f~pL~~~s~kekae~~~~~a~a~~~~~~~-g-~ 383 (437) T protein:vir:52 313 NYHEAIRRLQETRLRPIFEIIDPL-ICNE------LFGGLPADWWFEFVPLTTVKQEQQINMLNTFATAANTLIQN-G-V 383 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHH-HHHH------hcCCCCCcceEEeCCcCCcCHHHHHHHHHHHHHHHHHHHhc-C-C Confidence 499999999964 66666665552 2222 12222245778998777778888899988888887775332 1 2 Q ss_pred hhHHHHHHHHhCCCHHHHHHHHHHHHHHHhcCCCCC-Ccchh---hhcCCCCCcccccccCCCCCcCCCCCCC Q lcl|NC_015285. 284 FSVDYMRRQVLKQTEIEIKEIDEQIASEMEAGIIAD-PMAEM---DPAMAAGGEGAPAAEVDPNAQESSVDPG 352 (359) Q Consensus 284 ~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~-P~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~p~ 352 (359) +|.+-+++. | . ..+.|+. |++.. +......+...++.+ .++.++.+..|+ T Consensus 384 i~~~e~r~~-L--------------~---~~g~~~~i~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~ 437 (437) T protein:vir:52 384 LNEYQIANE-L--------------R---ESGLFANISAEHIEELKNADEFAGNFEEPEK-MEGAQVQNSEDQ 437 (437) T ss_pred CCHHHHHHH-H--------------H---hcCCCCCCCccccccccCCCCCCCccCCCCC-CCCCCCCCCCCC Confidence 343333322 1 1 1244431 11111 111111111111111 111111111111 No 46 >protein:vir:80959 Length: 499 # NCBI annotation: gp3 # Family: family:all:898 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468389;genbank:gi:157324963;genbank:GeneID:5601394 Probab=94.83 E-value=0.0034 Score=34.12 Aligned_cols=285 Identities=15% Similarity=0.192 Sum_probs=126.3 Q ss_pred CCCch----------hhHHHhhhhhhheeeccccccccCCCceeecHhHh------------------hhhhccc---cc Q lcl|NC_015285. 1 MRGVD----------LNQQLTQKAAEYFLYNPKGLKNSTNQGMKITTDSV------------------TYCHSGI---QD 49 (359) Q Consensus 1 ~~~~~----------~~~~~~~~~~e~f~yn~~~~~~~~~~~v~i~~~ai------------------~y~hSGl---~d 49 (359) -++.. .+.+.....-++.+|..... ...|..++-..+ +|..--. .+ T Consensus 174 ~~~~~~y~~lE~h~~~~~~~~~y~I~n~~~~~~~~---~~lG~~v~l~~~~~~~~~~~~~~~~~~p~f~~~~~~~~N~~~ 250 (499) T protein:vir:80 174 HKNNKYYKLLEWNEWKGEKEEVYTVTTELYQSDDP---NELGGKVSLKLLFNDIEPVVPLPSLTRPTFIYIKPNIANNKN 250 (499) T ss_pred eecCeEEEEEEEEEecccceeeEEEEEEEEeccCc---cccCcccchhhhccCcCCceeecCCCccceEeecCCcccccc Confidence 11110 11111122223334432211 111111221111 1110000 01 Q ss_pred CCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCC-Cccc- Q lcl|NC_015285. 50 LNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANT-GEIK- 127 (359) Q Consensus 50 ~~~~~i~syL~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~T-Gevk- 127 (359) .....-+|-++.|...+..|-..-+.+ .|..+.=.+|+|. +..=++. ..=++... +... T Consensus 251 ~~splG~S~~~~~~~lid~lD~~~s~~--~~e~~~~~~~i~v-~~~~l~~----------------~~~~~g~~~~~~~~ 311 (499) T protein:vir:80 251 LTSPLGISVYANALDTLKTLDLMFDSY--YQEFKLGKKKVLV-PSSFVKT----------------AVNLDGSTTQYFDS 311 (499) T ss_pred CCCccCCchHhhHHHHHHHHHHHHHHH--HHHHHhcccceec-chhhhhc----------------cCCCCCCcccCCCc Confidence 112223567777777777777666654 3777886677764 2111100 00011000 0001 Q ss_pred ccccchhhHhhhcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhH Q lcl|NC_015285. 128 DDKKFMSMMEDFWLPRREGGRGTEISTLPGGQNLGE-LEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKF 206 (359) Q Consensus 128 dd~~~mSMlEDywLpRReGgrgTEIsTLpGgqnLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF 206 (359) +++.+..+ +=..++.|--|+++.+.-.-.+ .+-+..+.+.+....++|-+-|+.+++- ...++||.-....- T Consensus 312 ~~~~~~~~------~~~~~~~~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~g~s~~~fg~~~~g-~~TAtei~s~~~~l 384 (499) T protein:vir:80 312 TDEAFFLY------QGEQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENG-LKTATEVVSEKSET 384 (499) T ss_pred ccceeeEe------eccCCCCcCceeEecCcCChHHHHHHHHHHHHHHHHhcCCChhhcCCCccc-chhHHHHHHHHHHH Confidence 22222111 0001122223676665443332 3567788889999999998888765421 22356665443333 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHH-------HhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhh Q lcl|NC_015285. 207 QKFIARLRKRFSELFTDLLKTQL-------ILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPY 279 (359) Q Consensus 207 ~KFI~rLr~rFs~if~d~Lk~QL-------iLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~ 279 (359) ..-+...++.|..-+.++++.=| .+.|. .|+ ...+.++|...-.-.+..+ ++.+.++-- T Consensus 385 ~~~~~~~~~~~~~~l~~l~~~il~~~~~~~~~~~~----~~~--~~~v~v~f~d~i~~d~~~~-------~~~~~~~~~- 450 (499) T protein:vir:80 385 YQTKNSHSQLIEQGIKEMIVSILEVGKLIKAYDGD----TVE--LDTITVDFDDSIAQDEDTT-------INRYTTAKN- 450 (499) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCC----CCC--ccceEEEeCCCCCCCHHHH-------HHHHHHHHH- Confidence 33344445555444544444422 23332 222 2578888854432222111 122222110 Q ss_pred cchhhhHHHHHHHHhCCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCC Q lcl|NC_015285. 280 VGKYFSVDYMRRQVLKQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVD 341 (359) Q Consensus 280 vGKy~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~ 341 (359) .| .+|.++++.+..+.||+|.+++.++|++|.... .++|+ .+ |.. |+.. T Consensus 451 ~G-i~S~et~l~~~~~~~d~ea~~el~~i~~E~~~~-~~~~d----~~-g~~------ge~e 499 (499) T protein:vir:80 451 QG-MIPLKIALQRAWNITEAEADEWAEMLAKEKQAE-IPNND----MT-GIF------GEEE 499 (499) T ss_pred cC-CCCHHHHHhhcCCCChHHHHHHHHHHHHHhhcC-CCCCC----cc-ccC------CCCC Confidence 12 468888888878999999999999999996543 22221 11 111 1111 No 47 >protein:vir:98883 Length: 517 # NCBI annotation: portal # Family: family:all:898 # MgeID: mge:1568 # MgeName: BCJA1c # Cross-refs: genbank:acc:YP_164413;genbank:gi:56694903;genbank:GeneID:3197273 Probab=94.49 E-value=0.0043 Score=33.57 Aligned_cols=282 Identities=14% Similarity=0.165 Sum_probs=129.8 Q ss_pred CC--------------CchhhHHHhhhhhhheeeccccccccCCCceeec------------------HhHhhh------ Q lcl|NC_015285. 1 MR--------------GVDLNQQLTQKAAEYFLYNPKGLKNSTNQGMKIT------------------TDSVTY------ 42 (359) Q Consensus 1 ~~--------------~~~~~~~~~~~~~e~f~yn~~~~~~~~~~~v~i~------------------~~ai~y------ 42 (359) .+ +..++.+ -+..-++-+|.... ....|..|+ +--.+| T Consensus 186 ~~~~~~~Yt~lE~H~~~~~~~~~-~~y~I~n~ly~s~~---~~~lG~~v~L~~~~e~l~~~~~~~g~~~Plf~y~~~p~~ 261 (517) T protein:vir:98 186 IGNKTVYYTLLEFHEWEKTEEGE-SLYVITNELYKSDN---EGEIGKRIPLEELYEGMQEKTYIQGLSRPLFNYLKPSGF 261 (517) T ss_pred ecCCceEEEEEEEEecCceeccC-CcEEEEEEEEecCC---CccccccccccccccCCCcceeECCCCcceEEEecCCcc Confidence 00 0000000 00011122221100 000000100 000111 Q ss_pred ----hhcccccCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEE Q lcl|NC_015285. 43 ----CHSGIQDLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMV 118 (359) Q Consensus 43 ----~hSGl~d~~~~~i~syL~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklv 118 (359) .||.| =+|-++.|+-++.-|-..-+ .+.|..|.=.+|||. +..=+++.... +.-....+ T Consensus 262 N~~~~~spl-------G~S~~~~a~~~~d~lD~~~s--~~~~e~~~g~~~i~v-p~~~l~~~~~~-------~g~~~~~~ 324 (517) T protein:vir:98 262 NNINPHSPL-------GLGITDNSVSTLKKINDTYD--QFWWEIKMGQRTVFV-SDVMLRTVPDE-------SGMPPPQV 324 (517) T ss_pred cccccCCCC-------CCchhhhhHHHHHHHHHHHH--HHHHHHHhCCcceec-ChhhhccccCC-------CCcccCCC Confidence 13322 23555556555555553333 445788887777775 22211100000 00000011 Q ss_pred eeCCCCcccccccchhhHhhhcccccCCCCccceeecCCCCCc-chHHHHHHHHHHHHHhcCCCccccCCCCcccccchh Q lcl|NC_015285. 119 YDANTGEIKDDKKFMSMMEDFWLPRREGGRGTEISTLPGGQNL-GELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAA 197 (359) Q Consensus 119 YD~~TGevkdd~~~mSMlEDywLpRReGgrgTEIsTLpGgqnL-gei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~ 197 (359) +| .+++.|..+..+ .++. -|+++.+.==. .-..-+.++.+.+-...++|-+-|+.++. ....++ T Consensus 325 ~d------~~~~~y~~~~~~------~~~~--~i~~~~~~iR~e~~~~~~~~~L~~i~~~~Gls~~t~~~~~~-~~kTAT 389 (517) T protein:vir:98 325 FD------PDVNVYKSIRMG------TDEE--FVKDVTHDIRTEQYKEAINQALRTLEMELKLSVGTFSFDGR-SMKTAT 389 (517) T ss_pred CC------cccceeeeccCC------CCCC--ceeeeccccchHHHHHHHHHHHHHHHHHhCCCccccccccc-ccccHH Confidence 11 123334332211 1111 14444432111 22455677888888899999999987654 235688 Q ss_pred hhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhc-------CCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHH Q lcl|NC_015285. 198 EITRDEVKFQKFIARLRKRFSELFTDLLKTQLILK-------GVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERM 270 (359) Q Consensus 198 eItRDElKF~KFI~rLr~rFs~if~d~Lk~QLiLk-------gI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl 270 (359) ||...+-.-..-+.+.|+.+...+.++++.-|.|. +.+.+. ..+.++|. |+.+.. +++++ T Consensus 390 Ei~s~~~~~~~t~~~~~~~~~~aL~~lv~~i~~l~~~~~~~~~~~~~~------~~v~v~f~-D~i~~D-~~~~~----- 456 (517) T protein:vir:98 390 EIVSENDLTYRTRNDHVYEVEQFIKGLVISVLELAKTYKLFGGEIPSA------EHIGVDFD-DGVFQD-RSALL----- 456 (517) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCCC------cceEEEcC-CCCCCC-HHHHH----- Confidence 88877777777888899888888888888765432 222221 34677775 333322 22221 Q ss_pred HHHHHhhhhcchhhhHHHHHHHHhCCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCC Q lcl|NC_015285. 271 NQVAAMDPYVGKYFSVDYMRRQVLKQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVD 341 (359) Q Consensus 271 ~~~~~~dp~vGKy~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~ 341 (359) +.+.++.-- | .+|....+.+..++||+|.+++-.+|++|.... +|..-. .+.++..+|+.. T Consensus 457 ~~~~~~v~a-G-~ms~~~~i~~~~g~~eeeA~~e~~~i~~E~~~~---~~~~~~-----~~~~~~~~gd~e 517 (517) T protein:vir:98 457 RFYGQAKTF-G-FIPTVEAIQRIFKVPKKTAEQWLEEIRKDQIEL---DPVTIS-----QRAQKRMFGDEE 517 (517) T ss_pred HHHHHHHhc-C-CCCHHHHHHHhCCCChHHHHHHHHHHHHhcccc---CCCCcc-----ccccCCCCCCCC Confidence 111121111 3 368877766778999999999999999986532 222111 111112222111 No 48 >protein:vir:9815 Length: 500 # NCBI annotation: putative minor capsid protein # Family: family:all:898 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795577;genbank:gi:28876344;genbank:GeneID:1257866 Probab=94.35 E-value=0.0047 Score=33.37 Aligned_cols=284 Identities=14% Similarity=0.197 Sum_probs=132.7 Q ss_pred CCCchhhHHHhhhhhhheeecccc-------cccc---CCCceeecHhHh--------hhhhc--ccc-----------c Q lcl|NC_015285. 1 MRGVDLNQQLTQKAAEYFLYNPKG-------LKNS---TNQGMKITTDSV--------TYCHS--GIQ-----------D 49 (359) Q Consensus 1 ~~~~~~~~~~~~~~~e~f~yn~~~-------~~~~---~~~~v~i~~~ai--------~y~hS--Gl~-----------d 49 (359) ...+..+-.++=...||+.++.+. .+.+ ..-|-.|+-..+ ++.+. .|+ + T Consensus 171 ~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~p~f~~~~~~~~N~~~ 250 (500) T protein:vir:98 171 SVKTINGKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLSEVYKDLKDEAKVTDVTRPIFTYLKTPGMNNKD 250 (500) T ss_pred EeeeecCCceEEEEEEEEEEeCCceeEEEEEEEecccccccCcccccccccCCcCcceEeccCCCccEEEecCCcccccc Confidence 322222222222222333322111 0111 111222221111 11110 111 1 Q ss_pred CCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCccc-- Q lcl|NC_015285. 50 LNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIK-- 127 (359) Q Consensus 50 ~~~~~i~syL~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevk-- 127 (359) ..+..=+|-++.|.-.+..|-..-+.+. |..|.=++|||. +..=++ ...+..+|+.. T Consensus 251 ~~sp~G~S~~~~~~~lid~lD~~~s~~~--~e~~~g~~~i~v-~~~~l~------------------~~~~~~~g~~~~~ 309 (500) T protein:vir:98 251 INSPLGLSIFDNAKTTIDFINTTYDEFM--WEVKMGQRRVAV-PESLTA------------------LTVRTTDGDVVPR 309 (500) T ss_pred CCCccCCchhhhhHHHHHHHHHHHHHHH--HHHHhCcceeee-chHHhc------------------ccCCCCCccccCC Confidence 1112235888888888877777766655 788887777765 111100 01111122110 Q ss_pred -----ccccchhhHhhhcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhH Q lcl|NC_015285. 128 -----DDKKFMSMMEDFWLPRREGGRGTEISTLPGGQNLGE-LEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITR 201 (359) Q Consensus 128 -----dd~~~mSMlEDywLpRReGgrgTEIsTLpGgqnLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItR 201 (359) +++.|..|-.+ -++ +.-|+.+...--..+ ..-+.++.+.+=.+.+++-+.|+.+++ ....++||.- T Consensus 310 ~~~d~~~~~~~~~~~~-----~~~--~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~-g~~TAtei~s 381 (500) T protein:vir:98 310 PRFESDQNVYIRMGGR-----DLD--SSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGK-SMKTATEIVS 381 (500) T ss_pred cccCCCcceEEEcCCC-----CCc--CcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcC-ccccHHHHHH Confidence 12222221100 011 122555543221222 234566777777788888888876654 2345677765 Q ss_pred HhhhHHHHHHHHHHHHHHHHHHHHHHHHHh-------cCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHH Q lcl|NC_015285. 202 DEVKFQKFIARLRKRFSELFTDLLKTQLIL-------KGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVA 274 (359) Q Consensus 202 DElKF~KFI~rLr~rFs~if~d~Lk~QLiL-------kgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~ 274 (359) .+-.-..-+.+.|+.|...+.++++.=|-+ .|.... ...+.++|. |+.+.. +++++- ..+.+++ T Consensus 382 ~~~~~~~t~~~~~~~~~~al~~lv~~il~~~~~~~~~~~~~~~------~~~v~v~f~-d~i~~d-~~~~~~-~~~~~v~ 452 (500) T protein:vir:98 382 ENSDTYQMRNSIVALVEQSLKELVISIFEIAKAYDLYQSEVPS------MDNISISLD-DGVFTD-RDAELD-YWIKVVN 452 (500) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCC------CcceEEEeC-CCCCCC-HHHHHH-HHHHHHH Confidence 555555667777777777777777665432 232222 134778885 443333 222221 1111111 Q ss_pred HhhhhcchhhhHHHHHHHHhCCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCccccccc Q lcl|NC_015285. 275 AMDPYVGKYFSVDYMRRQVLKQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAE 339 (359) Q Consensus 275 ~~dp~vGKy~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~ 339 (359) . | .+|..+.+.+..++||+|.+++.++|++|.... ...|+ ..+++.|+ T Consensus 453 --a---G-i~s~~~~i~~~~g~~eeea~~~l~~i~~E~~~~-~~~~~----------~~~~~~g~ 500 (500) T protein:vir:98 453 --A---G-FGTREMAIQKVLNVTEEKAQEIAAEINTGIVDE-INQQR----------TDTHLYGE 500 (500) T ss_pred --c---C-CCCHHHHHHhcCCCCHHHHHHHHHHHHHhcccc-CCCCC----------ccccccCC Confidence 1 2 368877766778999999999999999884221 11121 22233333 No 49 >protein:vir:3028 Length: 500 # NCBI annotation: minor capsid protein # Family: family:all:898 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438141;genbank:gi:16271804;genbank:GeneID:929241 Probab=94.35 E-value=0.0047 Score=33.37 Aligned_cols=284 Identities=14% Similarity=0.197 Sum_probs=132.7 Q ss_pred CCCchhhHHHhhhhhhheeecccc-------cccc---CCCceeecHhHh--------hhhhc--ccc-----------c Q lcl|NC_015285. 1 MRGVDLNQQLTQKAAEYFLYNPKG-------LKNS---TNQGMKITTDSV--------TYCHS--GIQ-----------D 49 (359) Q Consensus 1 ~~~~~~~~~~~~~~~e~f~yn~~~-------~~~~---~~~~v~i~~~ai--------~y~hS--Gl~-----------d 49 (359) ...+..+-.++=...||+.++.+. .+.+ ..-|-.|+-..+ ++.+. .|+ + T Consensus 171 ~~~~~~~~~~~yt~lE~h~~~~~~~~~I~n~ly~~~~~~~lG~~v~l~~~~~~l~~~~~~~~~~~p~f~~~~~~~~N~~~ 250 (500) T protein:vir:30 171 SVKTINGKEVYYTLIEFHEWQSSDDYVISNELYRSDDKAKVGSRVPLSEVYKDLKDEAKVTDVTRPIFTYLKTPGMNNKD 250 (500) T ss_pred EeeeecCCceEEEEEEEEEEeCCceeEEEEEEEecccccccCcccccccccCCcCcceEeccCCCccEEEecCCcccccc Confidence 322222222222222333322111 0111 111222221111 11110 111 1 Q ss_pred CCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCccc-- Q lcl|NC_015285. 50 LNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIK-- 127 (359) Q Consensus 50 ~~~~~i~syL~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevk-- 127 (359) ..+..=+|-++.|.-.+..|-..-+.+. |..|.=++|||. +..=++ ...+..+|+.. T Consensus 251 ~~sp~G~S~~~~~~~lid~lD~~~s~~~--~e~~~g~~~i~v-~~~~l~------------------~~~~~~~g~~~~~ 309 (500) T protein:vir:30 251 INSPLGLSIFDNAKTTIDFINTTYDEFM--WEVKMGQRRVAV-PESLTA------------------LTVRTTDGDVVPR 309 (500) T ss_pred CCCccCCchhhhhHHHHHHHHHHHHHHH--HHHHhCcceeee-chHHhc------------------ccCCCCCccccCC Confidence 1112235888888888877777766655 788887777765 111100 01111122110 Q ss_pred -----ccccchhhHhhhcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhH Q lcl|NC_015285. 128 -----DDKKFMSMMEDFWLPRREGGRGTEISTLPGGQNLGE-LEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITR 201 (359) Q Consensus 128 -----dd~~~mSMlEDywLpRReGgrgTEIsTLpGgqnLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItR 201 (359) +++.|..|-.+ -++ +.-|+.+...--..+ ..-+.++.+.+=.+.+++-+.|+.+++ ....++||.- T Consensus 310 ~~~d~~~~~~~~~~~~-----~~~--~~~i~~~~~~ir~e~~~~~l~~~l~~i~~~~gls~~~~~~~~~-g~~TAtei~s 381 (500) T protein:vir:30 310 PRFESDQNVYIRMGGR-----DLD--SSAIQDLTTPIRADDYIKAINEGLSLFEMQIGVSAGLFSFDGK-SMKTATEIVS 381 (500) T ss_pred cccCCCcceEEEcCCC-----CCc--CcceeEeccccChHHHHHHHHHHHHHHHHHhCCCccccccCcC-ccccHHHHHH Confidence 12222221100 011 122555543221222 234566777777788888888876654 2345677765 Q ss_pred HhhhHHHHHHHHHHHHHHHHHHHHHHHHHh-------cCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHH Q lcl|NC_015285. 202 DEVKFQKFIARLRKRFSELFTDLLKTQLIL-------KGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVA 274 (359) Q Consensus 202 DElKF~KFI~rLr~rFs~if~d~Lk~QLiL-------kgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~ 274 (359) .+-.-..-+.+.|+.|...+.++++.=|-+ .|.... ...+.++|. |+.+.. +++++- ..+.+++ T Consensus 382 ~~~~~~~t~~~~~~~~~~al~~lv~~il~~~~~~~~~~~~~~~------~~~v~v~f~-d~i~~d-~~~~~~-~~~~~v~ 452 (500) T protein:vir:30 382 ENSDTYQMRNSIVALVEQSLKELVISIFEIAKAYDLYQSEVPS------MDNISISLD-DGVFTD-RDAELD-YWIKVVN 452 (500) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCC------CcceEEEeC-CCCCCC-HHHHHH-HHHHHHH Confidence 555555667777777777777777665432 232222 134778885 443333 222221 1111111 Q ss_pred HhhhhcchhhhHHHHHHHHhCCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCccccccc Q lcl|NC_015285. 275 AMDPYVGKYFSVDYMRRQVLKQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAE 339 (359) Q Consensus 275 ~~dp~vGKy~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~ 339 (359) . | .+|..+.+.+..++||+|.+++.++|++|.... ...|+ ..+++.|+ T Consensus 453 --a---G-i~s~~~~i~~~~g~~eeea~~~l~~i~~E~~~~-~~~~~----------~~~~~~g~ 500 (500) T protein:vir:30 453 --A---G-FGTREMAIQKVLNVTEEKAQEIAAEINTGIVDE-INQQR----------TDTHLYGE 500 (500) T ss_pred --c---C-CCCHHHHHHhcCCCCHHHHHHHHHHHHHhcccc-CCCCC----------ccccccCC Confidence 1 2 368877766778999999999999999884221 11121 22233333 No 50 >protein:vir:102727 Length: 945 # NCBI annotation: portal protein # Family: family:all:2446 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874016;genbank:gi:118197623;genbank:GeneID:4495919 Probab=94.13 E-value=0.0053 Score=33.05 Aligned_cols=298 Identities=12% Similarity=0.109 Sum_probs=125.9 Q ss_pred CCCchhhHHHhhhhhhheeeccccc-------------c---ccCCCceeecHhHhhhhhcccccCCC---CcchhhHHH Q lcl|NC_015285. 1 MRGVDLNQQLTQKAAEYFLYNPKGL-------------K---NSTNQGMKITTDSVTYCHSGIQDLNK---NMTLSHLHK 61 (359) Q Consensus 1 ~~~~~~~~~~~~~~~e~f~yn~~~~-------------~---~~~~~~v~i~~~ai~y~hSGl~d~~~---~~i~syL~~ 61 (359) +++ ..|. +.+.+..+|... + ..+.....++.+-+++ |--..+.++ +.=+|-|.. T Consensus 206 iRd-~~G~-----ii~L~pLdPs~Vti~~ddDG~~~y~Yv~~idG~~~~~v~a~DvIl-hirn~s~DG~~~GyGlSPIea 278 (945) T protein:vir:10 206 IRD-EQGN-----LVAITPVDGTTIKPILSEDTGIVVGYVQEVDGAIVAHFDKRDVVL-FRQNLTPDVYMYGYSLPPIEI 278 (945) T ss_pred EEC-CCCc-----EEEEEEECCcceEEEEcCCCcEEEEEEEecCCceEEEecCCceEE-EeccCCCCcccccCCchHHHH Confidence 111 1111 122222222110 0 0122234555555443 111112222 223577888 Q ss_pred HHHHHHHHHHHHHH-HHHHHHhcCccceeEeccCCCC---------chHHHHHHHHHHHHhhcceEEeeCCCCccccccc Q lcl|NC_015285. 62 AIKAVNQLRMIEDS-LVIYRLSRAPERRIFYIDVGNL---------PKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKK 131 (359) Q Consensus 62 Aik~~NqL~m~EDa-lVIyR~~RAPeRRvFyIDvGnl---------pk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~ 131 (359) |.+.+.....+++. .-.|+--.|.-+-+..++.++. .+..+++ +++.+.+... |. .+... T Consensus 279 a~~aI~~alAaek~aar~FskNGa~PsGILsvkg~~~~d~k~~~~LseEq~er-lKe~wee~~s--------G~-NnG~p 348 (945) T protein:vir:10 279 LYKVILSDIFIDKGNLDYYRKGGSIPEGILAIEPPSYKEGDIYPQLSREQLES-IQRQLQAIMM--------GD-YTQVP 348 (945) T ss_pred HHHHHHHHHHHHHHHHHHHHhCCCccceEEEecCccccccccccccCHHHHHH-HHHHHHHHhC--------Cc-ccccc Confidence 88888776656553 4444445677788998886643 2222222 3333332211 21 11111 Q ss_pred chhhHhhhcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHH- Q lcl|NC_015285. 132 FMSMMEDFWLPRREGGRGTEISTLPGGQNLGE-LEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKF- 209 (359) Q Consensus 132 ~mSMlEDywLpRReGgrgTEIsTLpGgqnLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KF- 209 (359) + ++ ..|.+++.|.....-.| ++-.+|..+...++.+||...|+..++.+.....+. ..-|..+ T Consensus 349 -i-VL----------deGmef~pLs~s~~DaQfLEsrkfs~eeIArAFGVPP~lLG~~e~st~SNiEqq---~~~Fv~~t 413 (945) T protein:vir:10 349 -I-LS----------GGKFTWIDFKGKRRDMQFKELAEFVARKICAVYQVSPQDVGILEGSNKATAEVM---ASLTKAKG 413 (945) T ss_pred -e-ec----------CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHcccCCCCCcchHHHH---HHHHHHHH Confidence 1 12 23567777743222122 344566778899999999999965444333333333 3336554 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHH Q lcl|NC_015285. 210 IARLRKRFSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYM 289 (359) Q Consensus 210 I~rLr~rFs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i 289 (359) +..+..++...+...|.. .. ....+.++|..+..-.. ..|.++++.+-.- -+++.+-+ T Consensus 414 L~Pil~~IEqeLNrkLl~---------~~----eg~~i~fdFd~ldl~D~-------ksraEal~kli~s--GiLTiNEv 471 (945) T protein:vir:10 414 LEPLMATISKGFDEVVSE---------FR----NEKDIKLWFKEDDLEKE-------RDWWNIIQGQLNT--GFRSINEA 471 (945) T ss_pred HHHHHHHHHHHHHHhccc---------cc----cCceeEEEecchhccCH-------HHHHHHHHHHHhC--CCcCHHHH Confidence 777777776666654421 11 13567888877764332 3455555544222 35666666 Q ss_pred HHHHhCCCHHHHHHHHHHHHHHHhcCCCCCCcchh---hhcCCCCCcccccc-c---CCCCCcCCCCCCCCCccCCC Q lcl|NC_015285. 290 RRQVLKQTEIEIKEIDEQIASEMEAGIIADPMAEM---DPAMAAGGEGAPAA-E---VDPNAQESSVDPGDVRRGEF 359 (359) Q Consensus 290 ~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~---~~~~~~~~~~~~~~-~---~~~~~~~~~~~p~~~~~~~~ 359 (359) +.. +++.+-+ --++..-. .+.. .|.++. ..+..++..+.+.+ . ..++..+.+..|.+.+...- T Consensus 472 Re~-lGLpPIe--GGD~lli~--~nn~--~P~d~~~ka~~ga~p~q~aq~~~dqp~~kGGe~dEns~~psE~kda~~ 541 (945) T protein:vir:10 472 RME-KGLEPVP--WGDVPFSG--LRNW--KPEDEQAKAQQGAMPPQLAQAMADQPSQQGGGVDENSSVPSEQKNAGL 541 (945) T ss_pred HHH-hCCCCCC--Ccceeeec--cccc--cccccccccccCCCCcccccCCCCCCCCCCCCCCCCCCCCCcccchHH Confidence 643 6665431 00000000 0000 111100 00000000000000 0 00111111222222222211 No 51 >protein:vir:79647 Length: 435 # NCBI annotation: PorT # Family: family:all:297 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285520;genbank:gi:148734503;genbank:GeneID:5220005 Probab=94.05 E-value=0.0056 Score=32.95 Aligned_cols=272 Identities=12% Similarity=0.130 Sum_probs=107.2 Q ss_pred CCCc---hhhHHHhhhhhhheeecc--------------------ccccc---cCCCceeecHhHh-hhhhcccccC--C Q lcl|NC_015285. 1 MRGV---DLNQQLTQKAAEYFLYNP--------------------KGLKN---STNQGMKITTDSV-TYCHSGIQDL--N 51 (359) Q Consensus 1 ~~~~---~~~~~~~~~~~e~f~yn~--------------------~~~~~---~~~~~v~i~~~ai-~y~hSGl~d~--~ 51 (359) ++|+ .+-+.+-..+..+-+++| ..... ++..+++|++|=+ +++..-+-+. . T Consensus 113 ~~d~~~~~~Pl~~~g~i~~i~v~d~~~i~~~~~~~dp~sp~fg~P~~y~v~~~~~~~~~~iH~SRli~~~g~~~p~~~~~ 192 (435) T protein:vir:79 113 VADNKMLKSPVKPGAQLEDIRVYDRYQITIHERETNARSVRYGEPKLYKISPGGDIPEFFVHYSRICIIDGERVSNEKRR 192 (435) T ss_pred ecCCCCcccccccCCceeeEEeechhhccchhhccCCcccccCcceEEEEecCCCCCceEEcceeEEEecCCcchhhhcc Confidence 1111 000000011111112221 11111 1223567777653 3332222111 1 Q ss_pred CCcc--hhhH-HHHHHHHHHHHHHH-H-HHHHHHHhcCccceeEec-cCCCC-----chHHHHHHHHHHHHhhcc---eE Q lcl|NC_015285. 52 KNMT--LSHL-HKAIKAVNQLRMIE-D-SLVIYRLSRAPERRIFYI-DVGNL-----PKNKAEQYLREVMGRYRN---KM 117 (359) Q Consensus 52 ~~~i--~syL-~~Aik~~NqL~m~E-D-alVIyR~~RAPeRRvFyI-DvGnl-----pk~KAeqYl~~iM~kyrn---kl 117 (359) .+.. .|-| +++...+.+..... - +.++++-. . +|+.+ ++.++ ....+..-+. .++++|+ -+ T Consensus 193 ~~~~~G~S~l~e~~~~~l~~~~~~~~~~~~l~~~~~---~-~v~~~~~l~~~~~~~~~~~~~~~r~~-~~~~~~~~~~~~ 267 (435) T protein:vir:79 193 QNDGWGASILNKRLIEAIVDYNYCQELATQLLRRKQ---Q-AVWKARDLALMCDDEEGRYAARLRLA-QVDDESGVGKAI 267 (435) T ss_pred ccCcccchHHHHHHHHHHHHHHHHHHHHHHHHHHhc---C-ccccchhHHHhhcCccchHHHHHHHH-HHHHhcCCCCce Confidence 1111 1222 33333222222221 1 23344331 2 23444 22221 1111111111 1223322 12 Q ss_pred EeeCCCCcccccccchhhHhhhcccccCCCCccceeecCCCCCcchHHHH-HHHHHHHHHhcCCCccccCCC--Cccccc Q lcl|NC_015285. 118 VYDANTGEIKDDKKFMSMMEDFWLPRREGGRGTEISTLPGGQNLGELEDV-KYFQKKLYKALNVPSSRLETE--TTFNIG 194 (359) Q Consensus 118 vYD~~TGevkdd~~~mSMlEDywLpRReGgrgTEIsTLpGgqnLgei~DV-~YF~kkLy~aL~VP~SRl~~~--~~~~~g 194 (359) +-|+.+ -+++++. .+|+-++|+ .+|...+-.+.+||+.+|-.+ +|+|- T Consensus 268 ~i~~~~--------------------------e~~e~~~--~~lsgl~~~~~~~~~~iaaa~~IP~t~L~G~s~~glns- 318 (435) T protein:vir:79 268 GIDATD--------------------------EEYEVLN--SDVSGVPEFLQEKIDRIVALTGIHEIIIKNKNTGGVSA- 318 (435) T ss_pred eEecCC--------------------------cceEEEe--cccCCHHHHHHHHHHHHHhhhCCCeeeeccCCcccccc- Confidence 211111 1233332 245555554 789999999999999998443 46642 Q ss_pred chhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHH Q lcl|NC_015285. 195 RAAEITRDEVKFQKFIARLRKRFSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVA 274 (359) Q Consensus 195 ~~~eItRDElKF~KFI~rLr~rFs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~ 274 (359) -+.+ |--.|+.+|+++|.. .+..+|++- +.-.+.+ +.+.|.|..=..-+|...+|+...+.++++ T Consensus 319 tgd~---d~~~yyd~i~~~Qe~---~l~p~l~~l-~~li~~s--------~d~~~~f~pL~~~sekEkAei~~~~a~a~~ 383 (435) T protein:vir:79 319 SQNT---ALETFYKLIDRKRVE---DYKPILEFL-LPFMISE--------TEWSIEFEPLSVPSDKDKAEIMAKNVESVV 383 (435) T ss_pred chhH---HHHHHHHHHHHHHHH---HHHHHHHHH-HHHhhcC--------CCCeEEeCCCCCCCHHHHHHHHHHHHHHHH Confidence 2222 333499999999953 333333321 1111122 346688887777788888999998888887 Q ss_pred HhhhhcchhhhHHHHHHHHhCCCHHHHHHHHHHHHHHH--hcCC---CCCCcchhhhcCCCCCcccc Q lcl|NC_015285. 275 AMDPYVGKYFSVDYMRRQVLKQTEIEIKEIDEQIASEM--EAGI---IADPMAEMDPAMAAGGEGAP 336 (359) Q Consensus 275 ~~dp~vGKy~S~~~i~k~IL~~tDeeI~e~~kqi~~E~--~~~~---~~~P~~~~~~~~~~~~~~~~ 336 (359) .+-.- -.++.+-+++. ...+-.+. .... +++|+. .++..+.+++++- T Consensus 384 ~~~~~--g~i~~~e~r~~------------L~~~~~~~~~~~~~~~~~~~~~d-~~~~~~~e~g~~~ 435 (435) T protein:vir:79 384 KLKAE--QAINLKETRDT------------LRSICPDLKIMDNDNIELPEPED-LDPEPGQEGGLNK 435 (435) T ss_pred HHHhc--CCCCHHHHHHH------------HHHhccccCCCCcccccCCcccc-CCCCCCCCCCCCC Confidence 76332 12343333322 21111110 1100 111111 1111111111111 No 52 >protein:vir:103219 Length: 201 # NCBI annotation: hypothetical protein # Family: family:all:297 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277473;genbank:gi:71834115;genbank:GeneID:3562330 Probab=93.82 E-value=0.0052 Score=33.11 Aligned_cols=189 Identities=14% Similarity=0.119 Sum_probs=86.4 Q ss_pred eEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhhcccccC--CC----Ccc-ceeecCCCCCc Q lcl|NC_015285. 89 IFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDFWLPRRE--GG----RGT-EISTLPGGQNL 161 (359) Q Consensus 89 vFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDywLpRRe--Gg----rgT-EIsTLpGgqnL 161 (359) ||.++. +.+-+ +...+++ ++.|.+..++ |+ ++ +.+ +..++ ..+| T Consensus 1 V~k~~~------------------l~~~~--~~~~~~~---~~r~~~~~~~----~~~~~~~~ld~~~e~~e~~--~~~l 51 (201) T protein:vir:10 1 MWKAKG------------------LADLC--DDSDGAA---RLRLAQVDNN----SGVGQAIGIDADSEEYNVL--NSDI 51 (201) T ss_pred CccchH------------------HHHHh--cCChHHH---HHHHHHHHHh----hhhhhhheeecCCcceeee--ecCc Confidence 333221 00000 0000111 1122211111 10 00 000 11111 2356 Q ss_pred chHHH-HHHHHHHHHHhcCCCccccCCC--Ccccc-cchhhhhHHhhhHHHHHHHHHHH-HHHHHHHHHHHHHHhcCCCC Q lcl|NC_015285. 162 GELED-VKYFQKKLYKALNVPSSRLETE--TTFNI-GRAAEITRDEVKFQKFIARLRKR-FSELFTDLLKTQLILKGVMS 236 (359) Q Consensus 162 gei~D-V~YF~kkLy~aL~VP~SRl~~~--~~~~~-g~~~eItRDElKF~KFI~rLr~r-Fs~if~d~Lk~QLiLkgI~t 236 (359) +-++| +..|...+=.+.++|+.||-.+ +|+|- |.+ |.-.|+.+|..+|.+ +..+...+++ -+.. T Consensus 52 sGl~d~l~~~~~~iaa~s~iP~t~LfG~sp~Glnatge~-----d~~nyyd~i~~~Qe~~l~p~le~l~~-----~~~~- 120 (201) T protein:vir:10 52 GGIDTFLSQKFDRIVALSGIHEIILKGKNVGGVSASQNT-----ALETFYGYVDRKRKAELLPLLEFLLP-----FIVT- 120 (201) T ss_pred CChHHHHHHHHHHHHhHhcCchhhhcCCCCccccccchh-----HHHHHHHHHHHHHHHHHHHHHHHHHH-----hhcC- Confidence 66777 4578888999999999999444 57763 333 333599999999953 3444443333 2332 Q ss_pred hhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCCCHHHHHHHHHHHHHHHhcCC Q lcl|NC_015285. 237 LEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQTEIEIKEIDEQIASEMEAGI 316 (359) Q Consensus 237 ~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~ 316 (359) + +.++|.|..=..=+|...+||.....++++.+-.- -. ++.+|+-+.- ......+- T Consensus 121 ~-------~~~~~~f~pL~~~s~kekAei~~~~a~a~~~~~~~--g~------------i~~~e~r~~L---~~~~~~~~ 176 (201) T protein:vir:10 121 E-------QEWSVEFNPLSQVSDKDKSEILEKNVNSVAALIAA--GI------------IDADEARDTL---RAISTEVK 176 (201) T ss_pred C-------CCceEeeCCCCCCCHHHHHHHHHHHHHHHHHHHHc--CC------------CCHHHHHHHH---HhcCCcCC Confidence 2 35679999888889999999999999888776332 12 3444432211 11111221 Q ss_pred CCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCCCC Q lcl|NC_015285. 317 IADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPGDV 354 (359) Q Consensus 317 ~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~ 354 (359) ++ +... +.+...+-+.+|. ..|.|- T Consensus 177 ~~--~~~~------~~~~~~~e~~dp~-----~~~~~~ 201 (201) T protein:vir:10 177 IG--EGSI------QTEVVINESEDPL-----DVSANN 201 (201) T ss_pred CC--CCCC------CccccccccCCCC-----CCCCCC Confidence 11 1000 0000000001111 111111 No 53 >protein:vir:80680 Length: 441 # NCBI annotation: gp3 # Family: family:all:524 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285579;genbank:gi:148727085;genbank:GeneID:5247051 Probab=93.71 E-value=0.0066 Score=32.52 Aligned_cols=286 Identities=12% Similarity=0.117 Sum_probs=124.2 Q ss_pred CCCchhhHHH---------hhhhhhheeeccccccc--cCCC-c-eeecHhH--------hhhhhcccccCCCCcchhhH Q lcl|NC_015285. 1 MRGVDLNQQL---------TQKAAEYFLYNPKGLKN--STNQ-G-MKITTDS--------VTYCHSGIQDLNKNMTLSHL 59 (359) Q Consensus 1 ~~~~~~~~~~---------~~~~~e~f~yn~~~~~~--~~~~-~-v~i~~~a--------i~y~hSGl~d~~~~~i~syL 59 (359) +-++..+..+ ........+|.+..... ..++ + +.++... |.|++.--.++.-| .|-| T Consensus 132 i~d~~~~~~~~~~~~~~~~~~~~~~~~vy~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G--~s~l 209 (441) T protein:vir:80 132 KFSADGSRLDAGLVVQQTCDPEVVEAELLLPDVIVQVERRGSREWVEVDRIPNVLGAVPLVPIVNRRRTSRIDG--RSEI 209 (441) T ss_pred EEeCCCCceeEEEEEEEEecCceEEEEEEecCeEEEEEEcCCcceeeccccccCCCceeEEEeeccccCCccCC--cccc Confidence 1222211110 01111223344433221 1111 1 1122111 11221111000001 1112 Q ss_pred HHHHHHH-HH-HHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHh Q lcl|NC_015285. 60 HKAIKAV-NQ-LRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMME 137 (359) Q Consensus 60 ~~Aik~~-Nq-L~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlE 137 (359) ...++++ .. -+++-+..++-+.+..|.|-+.=.+.+..+. +. .... + . T Consensus 210 ~~~v~~liDa~~~~~s~~~~~~~~~~~~~~~i~G~~~~~~~~--------~~--------------~~~~-------~-~ 259 (441) T protein:vir:80 210 TRSIRAYTDEAVRTLLGQSVNRDFYAYPQRWVTGVSADEFSQ--------PG--------------WVLS-------M-A 259 (441) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHhhcCceeeeecCCcccccc--------ch--------------hhhc-------c-c Confidence 2222221 11 2345567777788888877554222111110 00 0000 0 1 Q ss_pred hhc-ccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHH Q lcl|NC_015285. 138 DFW-LPRREGGRGTEISTLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKR 216 (359) Q Consensus 138 Dyw-LpRReGgrgTEIsTLpGgqnLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KFI~rLr~r 216 (359) -+| +|--+++.+.++..+|+..-=.-++-++=....++...++|.+-|...+. +...|..|.--+...-.-+.+.++. T Consensus 260 ~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~~~~~~~~p~~~~g~~~~-~~~Sg~Al~~~~~~l~~k~~~~~~~ 338 (441) T protein:vir:80 260 SVWAVDKDDDGDTPNVGSFPVNSPTPYSDQMRLLAQLTAGEAAVPERYFGFITS-NPPSGEALAAEESRLVKRAERRQTS 338 (441) T ss_pred ccccCCCCCCCCcceeEecCccchHHHHHHHHHHHHHHhcccCCCHHHhccCCC-cchHHHHHHHHHHHHHHHHHHHHHH Confidence 122 34444455567777776431122333444556777778888777754322 1112333444444466677888888 Q ss_pred HHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCC Q lcl|NC_015285. 217 FSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQ 296 (359) Q Consensus 217 Fs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~ 296 (359) |..-+...++.=+-+.|.. .+|......+.+.|.....=+. .+.++.+.++..-+-...|.++++ ..|++ T Consensus 339 f~~~l~~~~~l~~~~~~~~--~~~~~~~~~i~~~f~~~~~~~~-------~e~ad~~~kl~~~g~~~~s~~~~~-~~l~~ 408 (441) T protein:vir:80 339 FGQGWLSVGFLAAKALDSR--VDEADFFGDVGLRWRDASTPTR-------AATADAVTKLVGAGILPADSRTVL-EMLGL 408 (441) T ss_pred HHHHHHHHHHHHHHHhcCC--CcccccceeeeEEeCCCCCcCH-------HHHHHHHHHHHhcCcccccHHHHH-HhCCC Confidence 8888888887655554543 3344445678888875433222 345555555544322345777776 56899 Q ss_pred CHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCc Q lcl|NC_015285. 297 TEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQ 345 (359) Q Consensus 297 tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~ 345 (359) +++|++++.++.+++.. +..+. ++. -. -+|.-. T Consensus 409 ~~~e~~~~~~e~~e~~~------~~~~~--~~~---~~-----~~~~~~ 441 (441) T protein:vir:80 409 DDVQVEAVMRHRAESSD------PLAVL--AGA---IS-----RQTNEV 441 (441) T ss_pred CHHHHHHHHHHHHHHHH------HHHHH--hhh---hh-----cccccC Confidence 99999876654333211 11111 000 00 011100 No 54 >protein:vir:4782 Length: 522 # NCBI annotation: putative minor capsid protein 1 # Family: family:all:898 # MgeID: mge:104 # MgeName: MM1 # Cross-refs: genbank:acc:NP_150162;swissprot:trembl:q94m49;genbank:gi:26553451;uniprot:Q94M49;genbank:GeneID:955983 Probab=93.55 E-value=0.0072 Score=32.34 Aligned_cols=294 Identities=14% Similarity=0.147 Sum_probs=123.4 Q ss_pred CCCchhhHH----Hh-hhhhhheeeccccccccCCCceeecHhH----------hhhhhcccc---------------cC Q lcl|NC_015285. 1 MRGVDLNQQ----LT-QKAAEYFLYNPKGLKNSTNQGMKITTDS----------VTYCHSGIQ---------------DL 50 (359) Q Consensus 1 ~~~~~~~~~----~~-~~~~e~f~yn~~~~~~~~~~~v~i~~~a----------i~y~hSGl~---------------d~ 50 (359) ..+.+.... .. .+.-++.+|.-.. +..-|..++-+. +++ +|+- +. T Consensus 190 ~~~~~~~~~~~~~~~~~~~I~n~ly~~~~---~~~lG~~v~l~~~~e~~~l~~~~~~--~~~~~Plf~y~~~~~~N~~~~ 264 (522) T protein:vir:47 190 WVTADGQETGSTNDKKYYRITNELYRSDV---NDVLGQRVNLSELDKYKNLEPVTVF--ENLSRPLFTYLKTPGMNNKDI 264 (522) T ss_pred ecccccccccccccCCceEEEEEEeecCC---CcccCccccccccccccCCCCceEe--CCCCcceEEEecCCccccccc Confidence 111111110 01 1122344452210 000011110000 000 0100 00 Q ss_pred CCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccc Q lcl|NC_015285. 51 NKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDK 130 (359) Q Consensus 51 ~~~~i~syL~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~ 130 (359) .+..=+|-++.|+-.+.-|-..= -.+.|.+|.=.+|||. |-.=++ ..-+..+|+...-. T Consensus 265 ~splG~S~~~~~~~~id~lD~~~--s~~~~e~~~g~~~i~v-~~~~l~------------------~~~~~~~g~~~~~~ 323 (522) T protein:vir:47 265 NSPLGLSIFDNAKTTIDFINRSY--DEFMWEVRMGQRRVIV-PEHLTQ------------------RQYQRPDGTIDFRP 323 (522) T ss_pred CCCcCCchhhhhHHHHHHHHHHH--HHHHHHHHhccceeec-chHHhc------------------cCCCCCCccccccc Confidence 11112455666665555444333 3456777877788775 111100 11122233211000 Q ss_pred cchhhHhhhcccccCC-CCccceeecCCCCCcchHH-HHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHH Q lcl|NC_015285. 131 KFMSMMEDFWLPRREG-GRGTEISTLPGGQNLGELE-DVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQK 208 (359) Q Consensus 131 ~~mSMlEDywLpRReG-grgTEIsTLpGgqnLgei~-DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~K 208 (359) .+- --+.+|.+-..+ +-|--|+++...---++.. =+..+.+.+=...+++-+-|..+++ ....++||...+-.-.. T Consensus 324 ~fd-~~~~~f~~~~~~~~~~~~i~~~~~~ir~e~~~~~~~~~l~~i~~~~gls~~tf~~~~~-~~kTAtEi~s~~~~~~~ 401 (522) T protein:vir:47 324 RFD-VEQNVYMQIGGSSMDAGGITDLTSPIRANDYILAISEGLKLFEMQIGVSSGMFTFDGQ-GMKTATEIVSENSDTYQ 401 (522) T ss_pred ccC-cccceEeecCCCCCCCCcceeeccccChHHHHHHHHHHHHHHHHHhCCCccccCcccc-ccccHHHHHHHHHHHHH Confidence 000 001111111110 0111255554433222221 2455556666667777777766544 23457777666666666 Q ss_pred HHHHHHHHHHHHHHHHHHHHHH-------hcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcc Q lcl|NC_015285. 209 FIARLRKRFSELFTDLLKTQLI-------LKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVG 281 (359) Q Consensus 209 FI~rLr~rFs~if~d~Lk~QLi-------LkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vG 281 (359) -+.+.|+.+...+.++++.=|- +.|.... + ..|.++|. |+.+.. +++++-..+ .+++ . | T Consensus 402 t~~~~~~~~~~al~~lv~~i~~l~~~~~~~~~~~~~-~-----~~i~v~f~-D~i~~D-~~~~~~~~~-~~v~--a---G 467 (522) T protein:vir:47 402 MRSSIVALVEQSIKELCVSMCELGKAVGVYSGEIPE-L-----DDISVNLD-DGVFTD-RHAELDYWA-KMVA--A---G 467 (522) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCCCCC-c-----ceeEEEcC-CCCCCC-HHHHHHHHH-HHHh--c---C Confidence 6777777777777777766542 2332222 1 34777887 554444 222222211 1111 1 3 Q ss_pred hhhhHHHHHHHHhCCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCCCCccC Q lcl|NC_015285. 282 KYFSVDYMRRQVLKQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPGDVRRG 357 (359) Q Consensus 282 Ky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~ 357 (359) .+|....+.+..++||+|.+++-++|++|.... .|.+ .+.. |+.... ..||+. +| T Consensus 468 -~~s~e~~i~~~~g~~eeea~~el~ri~~E~~~~---~~~~-~~~~---~~~~~~------------~~~~d~-~~ 522 (522) T protein:vir:47 468 -FSTKKRAIGKTLNISGVEAEKELNAINSELLPM---NDAE-LAIY---GMHDQN------------EEKADD-KG 522 (522) T ss_pred -CCCHHHHHHhcCCCChHHHHHHHHHHHHhhccC---CCCC-CCCC---CCCCcc------------cccCCC-CC Confidence 568877666778999999999999999985432 1110 1000 000000 000000 00 No 55 >protein:vir:96738 Length: 505 # NCBI annotation: putative phage-related protein # Family: family:all:47 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039817;genbank:gi:126010916;genbank:GeneID:5076248 Probab=93.44 E-value=0.0075 Score=32.22 Aligned_cols=290 Identities=13% Similarity=0.146 Sum_probs=147.0 Q ss_pred CCC---chhhHHH--hhhhhhheeec--ccccc----ccCCCceeecHhHhhhhhcccccCCCCcchhhHHHHHHHHHHH Q lcl|NC_015285. 1 MRG---VDLNQQL--TQKAAEYFLYN--PKGLK----NSTNQGMKITTDSVTYCHSGIQDLNKNMTLSHLHKAIKAVNQL 69 (359) Q Consensus 1 ~~~---~~~~~~~--~~~~~e~f~yn--~~~~~----~~~~~~v~i~~~ai~y~hSGl~d~~~~~i~syL~~Aik~~NqL 69 (359) ..+ -..||++ .....-|++++ |.... ......++||.+-|..++.-.-..-.. =+|.|+.+++.+.+| T Consensus 187 ~~~~~~i~~GIe~d~~Gr~~aY~i~~~hPgd~~~~~~~~~~~~~rvpa~~vlH~f~~~r~gQ~R-Gis~lapvl~~l~~l 265 (505) T protein:vir:96 187 LQNGNRIRMSIELDAWERPVAYHLLVNHPGDNSYCYHYAGQTYERVPADEIIHTFVPWRPHQNR-GIPWTHASMVELHHI 265 (505) T ss_pred cCCcCeEEeceEECCCCceEEEEEeecCCCccccccccccccccccCHhHhhhhhcccCCcccc-CcchHHHHHHHHHHH Confidence 111 1234443 33344577764 43221 223456889998888777654332222 289999999999999 Q ss_pred HHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhhcccccCCCCc Q lcl|NC_015285. 70 RMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDFWLPRREGGRG 149 (359) Q Consensus 70 ~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDywLpRReGgrg 149 (359) .-.+||..+-...-|-.=-+..=|.+.+.... .| ..|+ ... . -+-| T Consensus 266 ~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~-----------------~~-~~~~-----~~~----~-------l~pG 311 (505) T protein:vir:96 266 GEYRKSEMIAAELGAKKVGFYEQDPEAYDQPP-----------------ED-DQGE-----IVE----E-------VEAG 311 (505) T ss_pred hHHHHHHHHHHHHhhhheeeeecCCccCCCcc-----------------cc-ccCc-----ccc----c-------cCCc Confidence 99999999988887765444333443332110 00 0011 000 0 1123 Q ss_pred cceeecCCCCC---------cchHHHH-HHHHHHHHHhcCCCccccCCC-CcccccchhhhhHHhhhHHHHHHHHHHHHH Q lcl|NC_015285. 150 TEISTLPGGQN---------LGELEDV-KYFQKKLYKALNVPSSRLETE-TTFNIGRAAEITRDEVKFQKFIARLRKRFS 218 (359) Q Consensus 150 TEIsTLpGgqn---------Lgei~DV-~YF~kkLy~aL~VP~SRl~~~-~~~~~g~~~eItRDElKF~KFI~rLr~rFs 218 (359) | |.+|+.|+. -+..++. +-..+.+=.+|+||-+-|..+ ++.|. |.+.-.-+.|-+.+.++|..|. T Consensus 312 ~-i~~L~pGe~i~~~~~~~p~~~~~~f~~~~lr~iaaglgi~ye~lt~D~s~~nY---SS~R~~~~e~~r~~~~~q~~~~ 387 (505) T protein:vir:96 312 T-YQLLPYGIRFKEHKIDHPHTNFGAFVKSSLRGVAAGMGPAYNRLAHDLEGVNF---SSLRSGELDERDLYKLLQFFVV 387 (505) T ss_pred e-eeecCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccH---HHHHHHHHHHHHHHHHHHHHHH Confidence 3 555555543 3343333 333344556899998888665 44554 2223344559999999999988 Q ss_pred HHHHHH-----HHHHHHhcCCCChhHHHHH-hhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHH Q lcl|NC_015285. 219 ELFTDL-----LKTQLILKGVMSLEEWEDM-KNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQ 292 (359) Q Consensus 219 ~if~d~-----Lk~QLiLkgI~t~eew~~~-~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~ 292 (359) .-|..+ |+ ..+|.|.++.-.++.- .-...|..-.--+.-.+||+.-...+++. -.-|.+-+..+ T Consensus 388 ~~~~~pi~~~~l~-~a~l~G~i~~p~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~---------G~~t~~~~~a~ 457 (505) T protein:vir:96 388 TELLERVAGNLIS-MSLLTQALPLNMVDIDRLSQYAFQPRGWDWVDPAKDSKAHSESIKN---------RTRSRSSIIRA 457 (505) T ss_pred HHHHHHHHHHHHH-HHHHcCCcCCCCccchhhceeeeccCCccccChHHHHHHHHHHHHc---------CCCCHHHHHHH Confidence 755554 43 5678887764333211 11223333333445667777666555442 23355556655 Q ss_pred HhCCCHHHHH-HHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCCCC Q lcl|NC_015285. 293 VLKQTEIEIK-EIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPGDV 354 (359) Q Consensus 293 IL~~tDeeI~-e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~ 354 (359) .+..-+|+- |.+...+..++.|+.+++.. .+.. .+..+..+.+|+|- T Consensus 458 -~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~-------~~~~-------~~~~~~~~~~~~d~ 505 (505) T protein:vir:96 458 -AGDDPEDVFDEIAWEEQLMRDKGVNPTPPE-------QESK-------DATTDEEDDSASDD 505 (505) T ss_pred -cCCCHHHHHHHHHHHHHHHHHcCCCCCCCC-------CCCC-------CCCCCCCCCCCCCC Confidence 465554443 22233333334444222211 0000 11111222222222 No 56 >protein:vir:9568 Length: 410 # NCBI annotation: gp34 # Family: family:all:524 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862873;genbank:gi:32469465;genbank:GeneID:1461310 Probab=93.34 E-value=0.0079 Score=32.11 Aligned_cols=265 Identities=9% Similarity=0.059 Sum_probs=134.8 Q ss_pred CCCchhhHHHh-----------hhhhhheeeccccccc--cCCCceeecH-----hHhhhhhcccccCCCC------cch Q lcl|NC_015285. 1 MRGVDLNQQLT-----------QKAAEYFLYNPKGLKN--STNQGMKITT-----DSVTYCHSGIQDLNKN------MTL 56 (359) Q Consensus 1 ~~~~~~~~~~~-----------~~~~e~f~yn~~~~~~--~~~~~v~i~~-----~ai~y~hSGl~d~~~~------~i~ 56 (359) +-|+.++ ++. .......+|.|.-... ..+..-.++. -.|.|+|..=.+..-| -|+ T Consensus 117 i~Dp~~~-~~~~al~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvV~f~n~~~l~~~~G~s~I~~~v~ 195 (410) T protein:vir:95 117 VIDPITG-LLVEGYAVLARDDYNRPTLEAYFEPNATHFIPKDGEPYSVTNETGIPLLVPVIHRPDAVRPFGRSRITRAGM 195 (410) T ss_pred EEeCCCC-ceEEEEEEEEecCCCeEEEEEEEeCCcEEEEeeCCccccccCCCCCcceEEecccccCCccCCccccchhHH Confidence 1121111 010 0112223443322110 0000001111 1234443311111111 144 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhH Q lcl|NC_015285. 57 SHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMM 136 (359) Q Consensus 57 syL~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMl 136 (359) +..+++- +.|.++++.=..+=.|.|-++=+|.-.-|..+-..+ +- T Consensus 196 ~l~da~~------r~~~~~~~~~e~~a~pqr~i~G~d~d~~~~~~~~~~-----------------------------~~ 240 (410) T protein:vir:95 196 YYQKYAK------RTLERADITAEFYSWPQKYILGLDPDAEPMEKWKAT-----------------------------VS 240 (410) T ss_pred HHHHHHH------HHHHHHHHHHHHhcchhheeeccCCCCCcCchhhhh-----------------------------hh Confidence 4444443 456777888888888988887655422121111111 11 Q ss_pred hhhcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHH Q lcl|NC_015285. 137 EDFWLPRREGGRGTEISTLPGGQNLGE-LEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRK 215 (359) Q Consensus 137 EDywLpRReGgrgTEIsTLpGgqnLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KFI~rLr~ 215 (359) -=..+|.-++|-+.+|..++++. |+. ++-++=.-..+....++|..-|+..+. |-..+..|.-.|....+-+.+-|+ T Consensus 241 ~i~~~~~~~~~~~~~v~q~~~~~-l~~~~~~l~~l~~~~a~~s~lP~~~lg~~~~-NpsSa~Al~a~~~~L~~ka~~k~~ 318 (410) T protein:vir:95 241 SLLTISSSDKGVKPSVGQFTTAS-MSPFTEQLRTAAAGFAGEMGLTLDDLGFVSD-NPSSVEAIKASHENLRLAGRKAQR 318 (410) T ss_pred hheeccCCCCCCcceEEecCCCC-hHHHHHHHHHHHHHHhhhcCCCHHHhccccC-chhHHHHHHHHHHHHHHHHHHHHH Confidence 12445666777778888898866 443 344555556666777999888865432 222334577778889999999999 Q ss_pred HHHHHHHHHHHHHHHhcCCCC--hhHHHHHhhceeeeeec--cchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHH Q lcl|NC_015285. 216 RFSELFTDLLKTQLILKGVMS--LEEWEDMKNHIQFDFIA--DNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRR 291 (359) Q Consensus 216 rFs~if~d~Lk~QLiLkgI~t--~eew~~~~~~I~~~f~~--Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k 291 (359) .|..-+.++++.-+.+.+-.. +.+|. .+.+.|.. |- +.--+.++.+.+..+..-+=.+.+.++++ T Consensus 319 ~fg~~l~~~~rla~~i~~~~~~~~~~~~----~~~v~W~p~~d~------~~~s~a~~aDa~~Kl~~a~~g~~~~~~~~- 387 (410) T protein:vir:95 319 SLGAGLLNVAYVAACLRDEFRYTRSQFV----RTAVKWEPLFEA------DANTMTMIGDGVVKLNQALPGYINAETIR- 387 (410) T ss_pred HHHHHHHHHHHHHHHHhcCCCCcccccc----eeeEEeeecCCc------chhhHHHHHHHHHHHHHhccCCccHHHHH- Confidence 999999999999888766543 23333 34555541 21 11123445554443333211234555555 Q ss_pred HHhCCCHHHHHHHHHHHHHHHhcCC Q lcl|NC_015285. 292 QVLKQTEIEIKEIDEQIASEMEAGI 316 (359) Q Consensus 292 ~IL~~tDeeI~e~~kqi~~E~~~~~ 316 (359) +.|++||++|..... ++....|- T Consensus 388 ~~lg~~~~~~~~~~~--~e~~~~g~ 410 (410) T protein:vir:95 388 DLTGIAGDMSAKPVV--SEGGSNGE 410 (410) T ss_pred HhcCCChHHHHHHHH--HHHHhCCC Confidence 569999998764332 22222322 No 57 >protein:vir:99522 Length: 470 # NCBI annotation: putative protein # Family: family:all:125 # MgeID: mge:1559 # MgeName: Lj928 # Cross-refs: genbank:acc:NP_958533;genbank:gi:41179315;genbank:GeneID:2717160 Probab=92.89 E-value=0.0096 Score=31.65 Aligned_cols=292 Identities=11% Similarity=0.069 Sum_probs=117.1 Q ss_pred CCCchhhHHHhhhh------------hhheeeccccccccCCCceeecHhHh-hhhhc-cccc----CCCCcchhhHHHH Q lcl|NC_015285. 1 MRGVDLNQQLTQKA------------AEYFLYNPKGLKNSTNQGMKITTDSV-TYCHS-GIQD----LNKNMTLSHLHKA 62 (359) Q Consensus 1 ~~~~~~~~~~~~~~------------~e~f~yn~~~~~~~~~~~v~i~~~ai-~y~hS-Gl~d----~~~~~i~syL~~A 62 (359) +-++..+.+++-.+ .-+.+|.+...+.-...+...+...+ ...|- |.++ .++..-.|=++.. T Consensus 158 i~d~~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g~sd~e~v 237 (470) T protein:vir:99 158 IYDDTVQRQPLAFVHYQIDNSNNWTDAYGVIQYADKFYKFKGYDIEEDTNAAGYAINPYGLVPAVEFFENEERQGIFDSI 237 (470) T ss_pred EEcCCCCcceEEEEEEEEEecCCeeEEEEEEEecCeEEEEEecccccccccccccccCCCccceEeecCCCCCCcchHhH Confidence 11111111111110 11223333221100000000000000 00111 1111 0111112333333 Q ss_pred HHHHHHHH-HHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhhcc Q lcl|NC_015285. 63 IKAVNQLR-MIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDFWL 141 (359) Q Consensus 63 ik~~NqL~-m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDywL 141 (359) +....-+. ++=+....-+..+.|.+-+.- ..++..+.-+-+..+ -.+++++ + T Consensus 238 ~~liDa~~~~~s~~~~~~~~~~~~~~~i~g---~~~~~~~~g~~~~~~---~~~~~~~---------------------~ 290 (470) T protein:vir:99 238 KTLINALDKVISQKANQVEYFDNAYMYMIG---FKLPEDDEGNPKFDF---KNNRVLY---------------------V 290 (470) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCceeeeec---CCcccccccchhhhh---hhcceee---------------------e Confidence 33322222 444555555666776665533 222211110111110 0122211 2 Q ss_pred cccCCCCccceeecCCCCCcchHHH-HHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHH Q lcl|NC_015285. 142 PRREGGRGTEISTLPGGQNLGELED-VKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSEL 220 (359) Q Consensus 142 pRReGgrgTEIsTLpGgqnLgei~D-V~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KFI~rLr~rFs~i 220 (359) |=-+++.+..+.+|....+...... +.-+.+.+|...++|-.-.+..++ |. .+..|...+.....-+.+.+..|... T Consensus 291 ~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~-n~-Sg~Ai~~~~~~l~~k~~~~~~~~~~~ 368 (470) T protein:vir:99 291 SQLDPDTNPQIGFIAKPDADQMQENLIQHLTDFIFMMAMVPNIQDKNFAG-NS-SGVALQYKLFAMKNKADSKERKFDKS 368 (470) T ss_pred cCCCCCCCCcceEEeecCChHHHHHHHHHHHHHHHHHhCCcccccccccc-Cc-hHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 2223344556777776666665544 788889999999999422222111 11 23344444444555677777888887 Q ss_pred HHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCCCHHH Q lcl|NC_015285. 221 FTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQTEIE 300 (359) Q Consensus 221 f~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~tDee 300 (359) +.++++.-+-+-+.....+++ ...|.+.|...-.-.+. +.++++..+. | .+|.++++.. |...|. T Consensus 369 l~~~~~li~~~~~~~~~~~~~--~~~i~v~f~~~~p~~~~-------e~a~~~~kl~---g-iis~et~l~~-l~~vd~- 433 (470) T protein:vir:99 369 LMQLYRIVLATLFNNKQDQEL--WSELDFKFTRNLPEDMA-------SAIDNAKNAE---G-IVSKKTQLGM-IPDIEP- 433 (470) T ss_pred HHHHHHHHHHHHhccCCcccc--cccceEEeCCCCCcCHH-------HHHHHHHHHh---c-cCCHHHHHHh-CCCCCH- Confidence 777776544333333222222 24688888654443443 3344455543 4 3799999987 455542 Q ss_pred HHHHHHHHHHHHhcCCCCCCcchhhhcCCCCC-cccccccCC Q lcl|NC_015285. 301 IKEIDEQIASEMEAGIIADPMAEMDPAMAAGG-EGAPAAEVD 341 (359) Q Consensus 301 I~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~-~~~~~~~~~ 341 (359) +++.++|++|..+. -+..+....+.+. ++.+.++-+ T Consensus 434 -~~E~eri~~E~~~~----~~~~~~~~~~~d~~~~d~~~ee~ 470 (470) T protein:vir:99 434 -DAEMKQIAKEKADA----IKQTQQLSMPIDILKRDNNAEEE 470 (470) T ss_pred -HHHHHHHHHHHHHH----HHHHHhhcCCCCcCCCCCCccCC Confidence 23334455553321 0011111111111 111111111 No 58 >protein:vir:94742 Length: 409 # NCBI annotation: putative portal protein # Family: family:all:524 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996701;genbank:gi:45597416;genbank:GeneID:2767966 Probab=92.83 E-value=0.0098 Score=31.59 Aligned_cols=254 Identities=8% Similarity=0.046 Sum_probs=135.9 Q ss_pred CCCchhh-----HHHh-----hhhhhheeecccccc---ccCCCceeecH-----hHhhhhhcccccC-CC-----Ccch Q lcl|NC_015285. 1 MRGVDLN-----QQLT-----QKAAEYFLYNPKGLK---NSTNQGMKITT-----DSVTYCHSGIQDL-NK-----NMTL 56 (359) Q Consensus 1 ~~~~~~~-----~~~~-----~~~~e~f~yn~~~~~---~~~~~~v~i~~-----~ai~y~hSGl~d~-~~-----~~i~ 56 (359) +-|+.++ +.+. .+.....+|.|.... ........++. =.|.|++..-.+. .+ .-|+ T Consensus 129 i~D~~~~~~~~a~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~n~~g~vPvV~f~n~~~~~~~~G~s~I~e~v~ 208 (409) T protein:vir:94 129 IIDPITGLLTEGYAVLERDENNNVVLEAHFLPDRTDYYYRDSRNNISIANPTGHPLLVPIIHRPDAVRPFGRSRITRSGM 208 (409) T ss_pred EEecCCCceeeeEEEEEecCCCceEEEEEEecCcEEEEEecCceeEeeeCCCCCcceEEeccccccccccCccccchhHH Confidence 1111111 0000 011122334332211 11111111110 1344554321111 11 1233 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhH Q lcl|NC_015285. 57 SHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMM 136 (359) Q Consensus 57 syL~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMl 136 (359) ...+++- +.+.+.++.=...=.|.|-++=+|...-|..+-..++ - T Consensus 209 ~l~da~~------r~~~~~~~~~e~~a~pqr~i~G~d~d~~~~~~~~~~~-----------------------------~ 253 (409) T protein:vir:94 209 YWQSNAK------RTLERADVTAEFYSFPQKYVTGLSDDAEPMETWKATV-----------------------------S 253 (409) T ss_pred HHHHHHH------HHHHHHHHHHHHhcChhheeEecCCCCcccchhhhhH-----------------------------H Confidence 4444433 4567788888899999998886654222222111111 1 Q ss_pred hhhcccccCCCCccceeecCCCCCcc-hHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHH Q lcl|NC_015285. 137 EDFWLPRREGGRGTEISTLPGGQNLG-ELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRK 215 (359) Q Consensus 137 EDywLpRReGgrgTEIsTLpGgqnLg-ei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KFI~rLr~ 215 (359) .=..+|.-+.|-+.+|..++++. |+ =++-++=.-..+....++|.+-|+..+. |-..+..|.-.|....+-+.+-|+ T Consensus 254 ~i~~~~~d~dg~~~~v~q~~~~~-l~~~~~~l~~~~~~~a~~t~lP~~~lg~~~~-NpsSa~Al~a~~~~L~~~a~~k~~ 331 (409) T protein:vir:94 254 SMLQFTKDEDGDKPTLGQFTQPS-MSPFTEQLRTAAAGFAGETGLTLDDLGFVSD-NPSSVEAIKASHENLRLAGRKAQR 331 (409) T ss_pred HhhcCCCCCCCCCceEEecCCCC-hhHHHHHHHHHHHHHhhhcCCCHHHhccccC-chhHHHHHHHHHHHHHHHHHHHHH Confidence 11335666666778898898866 43 3455566666777788999888865443 222344566677778889999999 Q ss_pred HHHHHHHHHHHHHHHhcCCCC--hhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHH Q lcl|NC_015285. 216 RFSELFTDLLKTQLILKGVMS--LEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQV 293 (359) Q Consensus 216 rFs~if~d~Lk~QLiLkgI~t--~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~I 293 (359) .|..-+.+++|.-+.+.|-.+ +++| ..+.+.|. ++.-++ +--+.+..+.+..+..-+=.+.+.+.+. +- T Consensus 332 ~fg~~~~~~~rla~~i~~~~~~~~~~~----~~~~v~W~-p~~~~~---~~~~a~~aDa~~Kl~~ag~~~~~~~~~~-~~ 402 (409) T protein:vir:94 332 SLGAGLLNVAYLAACLRDDAPYLREQF----RKTKPKWE-PLFEAD---ASMLSLIGDGAIKLNQAIPEFINKDTIR-DL 402 (409) T ss_pred HHHHHHHHHHHHHHHHhCCCCcccccc----ccceEEec-cCCCcc---hHHHHHHHHHHHHHHHhcccccchhHHH-HH Confidence 999999999998777665433 2344 34667776 333332 2233556666666655322344556555 55 Q ss_pred hCCCHHH Q lcl|NC_015285. 294 LKQTEIE 300 (359) Q Consensus 294 L~~tDee 300 (359) |++|+.| T Consensus 403 lG~~~~d 409 (409) T protein:vir:94 403 TGIEGGE 409 (409) T ss_pred cCCCCCC Confidence 9999999 No 59 >protein:vir:78641 Length: 278 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429941;genbank:gi:156603995;genbank:GeneID:5525387 Probab=92.59 E-value=0.011 Score=31.37 Aligned_cols=206 Identities=15% Similarity=0.237 Sum_probs=103.5 Q ss_pred CCCchhhHHHhhhhhhheeeccccc---------------cccCCCceeecHhHhhhhhcccccCCCCcchhhHHHHHHH Q lcl|NC_015285. 1 MRGVDLNQQLTQKAAEYFLYNPKGL---------------KNSTNQGMKITTDSVTYCHSGIQDLNKNMTLSHLHKAIKA 65 (359) Q Consensus 1 ~~~~~~~~~~~~~~~e~f~yn~~~~---------------~~~~~~~v~i~~~ai~y~hSGl~d~~~~~i~syL~~Aik~ 65 (359) +..+..| ...+.+..+|... ....+..+.++.+.|.+.. .....++=.-+|.+..|.++ T Consensus 54 i~r~~~G-----~~~~l~~l~~~~v~v~~~~~~~~~~y~~~~~~g~~~~~~~~evih~~-~~~~~~~~~G~s~~~~~~~~ 127 (278) T protein:vir:78 54 IERDIYH-----QPSKLFLLNPDVVEMLIENQSRELYYSIHAATGNKLIVHNMDMLHFK-HIVASNMVQGISPIDVLKNT 127 (278) T ss_pred EEECCCC-----cEEEEEEECCceeEEEEcCCCceEEEEEEcCCceEEEEccccEEEEC-CCCCCCCeeeccHHHHHHHH Confidence 1111111 1123333322111 1122344778888877653 11112222346889999999 Q ss_pred HHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhhcccccC Q lcl|NC_015285. 66 VNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDFWLPRRE 145 (359) Q Consensus 66 ~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDywLpRRe 145 (359) +.....++..- .+...+.| .-+++ .-++|.+..+++..+.+-..+ + ..|. .+ .++ T Consensus 128 i~~~~~~~~~~-~~~~~~~~-~~i~~-~~~~l~~e~~~~~~~~~~~~~------~-~~g~------~~-vl~-------- 182 (278) T protein:vir:78 128 TDFDNAVRTFN-LTEMQKPD-SFMLK-YGSNVGKEKRQQVLEDFKQYY------E-ENGG------IL-FQE-------- 182 (278) T ss_pred HHHHHHHHHHH-HHHhcCCC-cEEEE-eCCCCCHHHHHHHHHHHHHHh------c-cCCC------ce-ecC-------- Confidence 99888877663 45555554 44444 446777766655444332222 1 2232 22 111 Q ss_pred CCCccceeecCCCCCcchHHH---HHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHH-HHHHHHHHHHHH Q lcl|NC_015285. 146 GGRGTEISTLPGGQNLGELED---VKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKF-IARLRKRFSELF 221 (359) Q Consensus 146 GgrgTEIsTLpGgqnLgei~D---V~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KF-I~rLr~rFs~if 221 (359) .|+++..|. .+.-+++- .++..+.+.++++||.+.++...+-+....++..+. |..+ |.-+..++...| T Consensus 183 --~g~~~~~l~--~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~~sn~~~~~~~---~~~~~l~P~~~~i~~~l 255 (278) T protein:vir:78 183 --PGVEIEPLP--KKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRF---YLQHTLLPIVKQYEEEF 255 (278) T ss_pred --CCceEEEcc--CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHH---HHHHHHHHHHHHHHHHH Confidence 256777774 33334433 357888999999999999975544444444443322 5544 444444444333 Q ss_pred HHHHHHHHHhcCCCChhHHHHHhhceeeeeeccch Q lcl|NC_015285. 222 TDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNY 256 (359) Q Consensus 222 ~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~ 256 (359) -.. +++++|+. -..+|+|+.. .- T Consensus 256 ----n~~-----L~~~~e~~-~g~~~~f~~~--~l 278 (278) T protein:vir:78 256 ----NRK-----LLTKTDRE-KIGILNLTLN--LI 278 (278) T ss_pred ----Hhh-----cCChhHhc-CCceEEEecc--cC Confidence 333 35555554 2244555543 22 No 60 >protein:vir:9922 Length: 489 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795684;genbank:gi:28876464;genbank:GeneID:1257980 Probab=91.07 E-value=0.018 Score=30.20 Aligned_cols=306 Identities=10% Similarity=0.053 Sum_probs=110.7 Q ss_pred CCCchhhHHHh-------------hhhhhheeeccccccc-----cCCCceeecHhHhhhhhc-cccc----CCCCcchh Q lcl|NC_015285. 1 MRGVDLNQQLT-------------QKAAEYFLYNPKGLKN-----STNQGMKITTDSVTYCHS-GIQD----LNKNMTLS 57 (359) Q Consensus 1 ~~~~~~~~~~~-------------~~~~e~f~yn~~~~~~-----~~~~~v~i~~~ai~y~hS-Gl~d----~~~~~i~s 57 (359) +-++....+++ +....+.+|.+...+. ....++..... +-|- |-++ +|+..-.| T Consensus 153 v~dd~~~~~~~~~i~~~~~~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~---~~~~~g~vPvv~~~n~~~~~s 229 (489) T protein:vir:99 153 IYDDTYQRNSLMAVHFYDIDYGSGKRKQIIKAYTSDTIYTYEDYNLETKGMRLKDY---EGHFFKGVPVNEYANNEERTG 229 (489) T ss_pred EEcCCCCCceEEEEEEEEEecCCCceEEEEEEEeCCcEEEEEecCCCcccceeccc---ccccCCceeEEEeecCCCCCC Confidence 11110000000 1122344555543321 01111111110 1110 2111 12222234 Q ss_pred hHHHHHHHHHHHHH-HHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCccc-ccccchhh Q lcl|NC_015285. 58 HLHKAIKAVNQLRM-IEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIK-DDKKFMSM 135 (359) Q Consensus 58 yL~~Aik~~NqL~m-~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevk-dd~~~mSM 135 (359) -++..+.....+.. +-+....-+..+.|-+-+.-..... ...-+.....+.. -+...+... .+.+.+ T Consensus 230 ~~~~v~~liDa~d~~~s~~~~~~~~~~~~~l~i~g~~~~~---~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~-- 298 (489) T protein:vir:99 230 AYESVLDNIDAYDLSQSELANFQQDSVNALLVIAGNAYTG---ADENDYLDDGRLN------PNGRLAISIGFKKAQV-- 298 (489) T ss_pred chhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhhccCCccc---ccchhhhhhcccc------ccccccccccccccee-- Confidence 44444333333322 2222222233444444443222111 1111111111100 010111100 111111 Q ss_pred HhhhcccccCC--CCccceeecCCCCCcchH-HHHHHHHHHHHHhcCCCccccCCCCcccccchh--hhhHHhhhHHHHH Q lcl|NC_015285. 136 MEDFWLPRREG--GRGTEISTLPGGQNLGEL-EDVKYFQKKLYKALNVPSSRLETETTFNIGRAA--EITRDEVKFQKFI 210 (359) Q Consensus 136 lEDywLpRReG--grgTEIsTLpGgqnLgei-~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~--eItRDElKF~KFI 210 (359) +++..... +.+..+.-|.-..+.+.. .-+.-+.+.+|+-.++|- +..++ +. |..| .|..-+..-..-+ T Consensus 299 ---~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~--~~~~~-~~-~n~Sg~Al~~~~~~l~~k~ 371 (489) T protein:vir:99 299 ---LILDDNPNPNGVKPQAYFLKKEYDTAGSEAYKNRLVADILRFTFTPD--TQDMK-FS-GVQSGESMKYKLMASDNYR 371 (489) T ss_pred ---eeeccccCccccccceeeeeecCChHHHHHHHHHHHHHHHHHhCCcc--ccccc-cc-ccchHHHHHHHHHHHHHHH Confidence 11111111 112234434332222222 234566778888889983 22211 11 2222 2222222223335 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCChhHHHH-HhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHH Q lcl|NC_015285. 211 ARLRKRFSELFTDLLKTQLILKGVMSLEEWED-MKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYM 289 (359) Q Consensus 211 ~rLr~rFs~if~d~Lk~QLiLkgI~t~eew~~-~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i 289 (359) .+-|+.|...+.++++.=+-+-++.....|.. ....|.+.|....--.+. +-++++.++. | .+|.+++ T Consensus 372 ~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~~~i~v~f~~~~p~d~~-------~~~~~~~kl~---g-iis~et~ 440 (489) T protein:vir:99 372 EKQERLFKKGLMRRLRLAANIWAIKGNEATTYSLVNDTSIVFTPNLPQNDN-------EIVTAAQNLY---G-IVSDQTI 440 (489) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcCCccccccccccceEEeCCCCCcCHH-------HHHHHHHHHh---c-cCCHHHH Confidence 55666666666666654322223222121211 234577888644443332 2334444443 4 3799999 Q ss_pred HHHHhCCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCCCCC Q lcl|NC_015285. 290 RRQVLKQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDP 351 (359) Q Consensus 290 ~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p 351 (359) ++.+=..++++.+++-++|++|.....- .+ ++... ++.-++. .+.+..| T Consensus 441 ~~~l~~v~~~d~~~E~~ri~~E~~~~~~-~~--~~~~~------~~~~~~~----~~~~~~p 489 (489) T protein:vir:99 441 FEILNTVTGVDAEAELKRLKEEADKKQS-LP--EPRLV------GDASGQE----EPTAEKP 489 (489) T ss_pred HHhcCCCCchhHHHHHHHHHHHHHHHhc-cc--ccccc------CCCCCCc----CCCCCCC Confidence 9886677778888888888887543211 01 11111 1111111 1122222 No 61 >protein:vir:6382 Length: 553 # NCBI annotation: portal protein Lambda B # Family: family:all:47 # MgeID: mge:133 # MgeName: BcepNazgul # Cross-refs: genbank:acc:NP_918995;genbank:gi:34610170;genbank:GeneID:2559575 Probab=90.85 E-value=0.019 Score=30.05 Aligned_cols=323 Identities=13% Similarity=0.113 Sum_probs=156.4 Q ss_pred CCCchhhHHHhhhh--------hhheeec--cccccccCC---------CceeecHhHhhhhhcccccCCCCcchhhHHH Q lcl|NC_015285. 1 MRGVDLNQQLTQKA--------AEYFLYN--PKGLKNSTN---------QGMKITTDSVTYCHSGIQDLNKNMTLSHLHK 61 (359) Q Consensus 1 ~~~~~~~~~~~~~~--------~e~f~yn--~~~~~~~~~---------~~v~i~~~ai~y~hSGl~d~~~~~i~syL~~ 61 (359) =.+..+|..|+.|| .-|++++ |......+. ...+++.+-|..++...-..-.. =+|.|+. T Consensus 184 ~~~~~~~~~i~~GVE~d~~Gr~vaY~i~~~hPgd~~~~~~~~~~~~r~~~~~~v~a~~vlH~f~~~r~gQ~R-Gis~lap 262 (553) T protein:vir:63 184 PYQQLDTPTLRRGVQYDKRGRPQGYWIQVAHPGDLYQMAPDMYKWKFVQQSKPWGRRQVIHILEPREPDQSR-GIADIVS 262 (553) T ss_pred CCCCCCCCeeEeeeEECCCCceEEEEeeccCCCccccccccccceeeeccccccChhHheecccccCCCccc-CCchHHH Confidence 11112333444444 3467764 544332211 12578888888877765432222 2899999 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhhcc Q lcl|NC_015285. 62 AIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDFWL 141 (359) Q Consensus 62 Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDywL 141 (359) +++.+.+|.-.+||...-...-|-. +.+|-.+ .|...+.+.+..--.. +..+|- ....+.-+..+.- T Consensus 263 vl~~l~~l~~y~daeL~~a~i~A~~--a~fi~~~-~~~~~~~~~~~~~~~~-------~~~~~~---~~~~~~~~~~~~~ 329 (553) T protein:vir:63 263 GLKDMRMAKRFKEMSLQNAVINASY--AAAIESE-LPPEFIHSQMSGGSPN-------ADMVGI---FGKYMDALKAYVG 329 (553) T ss_pred HHHHHHHHhHHHHHHHHHHHHhhhh--eeeeecC-CChhhhhhhccccccc-------cccccc---ccccccccccccc Confidence 9999999999999999998888866 3333322 2433333222210000 000000 0000100011110 Q ss_pred ccc----CCC------CccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCC-CcccccchhhhhHHhhhHHHH Q lcl|NC_015285. 142 PRR----EGG------RGTEISTLPGGQNLGE-LEDVKYFQKKLYKALNVPSSRLETE-TTFNIGRAAEITRDEVKFQKF 209 (359) Q Consensus 142 pRR----eGg------rgTEIsTLpGgqnLge-i~DV~YF~kkLy~aL~VP~SRl~~~-~~~~~g~~~eItRDElKF~KF 209 (359) ..+ ++| -|.+|+.+.....-+. -+=++...+.+=.+|+||-+-|..+ ++.|. |.+.-.-+.|-+. T Consensus 330 ~~~~~~l~pG~i~~L~pGe~i~~~~p~~p~~~~~~F~~~~lr~iaaglGi~Ye~lt~D~s~~nY---SS~R~~~~e~~r~ 406 (553) T protein:vir:63 330 GANNIQIDGAKIPHLFPGTKLNLKPMGTPGGVGSEFEASLNRHLASAFGMSYEEFTRDFSKANY---SSIQAGIAMTRRF 406 (553) T ss_pred cccceeecCceeeecCCCCeeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhhhcccccH---HHHHHHHHHHHHH Confidence 000 111 1333444443333333 2334555566667899999988766 45555 2234455669999 Q ss_pred HHHHHHHHHHHHHHHHH----HHHHhcCCCChhHHH-H--------H--hhceeeeeeccchHHHHHHHHHHHHHHHHHH Q lcl|NC_015285. 210 IARLRKRFSELFTDLLK----TQLILKGVMSLEEWE-D--------M--KNHIQFDFIADNYFTELKEIEIRNERMNQVA 274 (359) Q Consensus 210 I~rLr~rFs~if~d~Lk----~QLiLkgI~t~eew~-~--------~--~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~ 274 (359) +.++|..|..-|..++- ...+|.|-++--.+. . - .-...+..-.-.+.--+||+.-...+++. T Consensus 407 ~~~~q~~~~~~~~~pi~~~wl~~a~l~G~i~~p~~~~~~~~~~p~~~~a~~~~~w~~p~~~~iDP~Ke~~A~~~~i~~-- 484 (553) T protein:vir:63 407 LEGRKKMCADRLATEFFTLWLEEAIAAGEVPMPPGQTRDLFYQPLMKEALSKCEWIGASQGQIDQLKETQAAVMRIDA-- 484 (553) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHcCCccCCCcccchhhcchhhhhhhhceeeecCCccccChHHHHHHHHHHHHc-- Confidence 99999988877776542 355788876422111 0 0 01123333444455566776655555442 Q ss_pred HhhhhcchhhhHHHHHHHHhCCCHHHHH-HHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCCC Q lcl|NC_015285. 275 AMDPYVGKYFSVDYMRRQVLKQTEIEIK-EIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPGD 353 (359) Q Consensus 275 ~~dp~vGKy~S~~~i~k~IL~~tDeeI~-e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~ 353 (359) -+-|.+-+..+ ++..-+++. |.+...+.-.+.|+..+....... +.+..+ .++..+.+..+.+ T Consensus 485 -------G~~t~~~~~a~-~G~D~~~v~~q~a~e~~~~~~~Gl~~~~~~~~~~-----~~~~~~---~~~~~~~~~~~~~ 548 (553) T protein:vir:63 485 -------GLSTYEREIAR-LGGDFRKSFAQRAREDALLKKYGLTFNLSAKRSL-----GDGRDA---ATGIAEDPAAAQT 548 (553) T ss_pred -------CCCCHHHHHHH-hCCCHHHHHHHHHHHHHHHHHcCCCCCCCCcccc-----CCCccc---CCCCCCCCCCCCc Confidence 22355555555 365555443 333333333344543322211100 111111 1112233334455 Q ss_pred CccCC Q lcl|NC_015285. 354 VRRGE 358 (359) Q Consensus 354 ~~~~~ 358 (359) .+.|| T Consensus 549 ~~~~e 553 (553) T protein:vir:63 549 SQQGE 553 (553) T ss_pred ccccC Confidence 56666 No 62 >protein:vir:7853 Length: 518 # NCBI annotation: gp10 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817460;genbank:gi:29565889;genbank:GeneID:1259085 Probab=90.81 E-value=0.019 Score=30.03 Aligned_cols=293 Identities=12% Similarity=0.119 Sum_probs=119.8 Q ss_pred CCCchhhHHHhhhhhhheeeccccc------------------cccCCCceeecHhHhhhhhcccccCCCC-cchhhHHH Q lcl|NC_015285. 1 MRGVDLNQQLTQKAAEYFLYNPKGL------------------KNSTNQGMKITTDSVTYCHSGIQDLNKN-MTLSHLHK 61 (359) Q Consensus 1 ~~~~~~~~~~~~~~~e~f~yn~~~~------------------~~~~~~~v~i~~~ai~y~hSGl~d~~~~-~i~syL~~ 61 (359) +++ ..| .+.+.+..+|... ....+..++++.+-|.+.. ....++. .=+|-|.. T Consensus 121 ~r~-~~G-----~~~~L~~l~p~~Vtv~~~~~~~~~~y~~~~~~~~~~~~~~~~~~eIiHir--~~~~dg~~~G~Spi~~ 192 (518) T protein:vir:78 121 QKN-KSG-----TPEKLMPMHPSRVAIKRNSRTGRYEYYFQAGAGVGTQLVSFADDEVVPIR--FFNPDGLERGLSLMES 192 (518) T ss_pred EEc-CCC-----cEEEEEEECCCceEEEEcCCCCEEEEEEEecCCccceeEEecCCcEEEec--CCCCCcccccccHHHH Confidence 111 111 1112222222110 0111223667777776653 1222222 23678888 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhhcc Q lcl|NC_015285. 62 AIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDFWL 141 (359) Q Consensus 62 Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDywL 141 (359) |.+++.....+++...=+----+.-+-|...+ +.|.+..+++.-+.+...|+-- ...|.+- .++ T Consensus 193 ~~~~i~~~~aa~~~~~~~f~Ng~~p~gvl~~~-~~ls~e~~~~~k~~~~~~~~G~----~nag~~~-------vL~---- 256 (518) T protein:vir:78 193 LKSTIFSEDSSRNATAAMWKNAGRPNLVLRHE-KRLSPEAQQRLREQFDRAHAGS----SNTGKTM-------VVE---- 256 (518) T ss_pred HHHHHHHHHHHHHHHHHHHhcCCCccEEEecC-CCCCHHHHHHHHHHHHHHhcCc----ccCCcee-------EcC---- Confidence 88888888888777544433345556677776 5676666655444444444310 0112211 121 Q ss_pred cccCCCCccceeecCCCCCcch---HHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHH-HHHHHHHH Q lcl|NC_015285. 142 PRREGGRGTEISTLPGGQNLGE---LEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKF-IARLRKRF 217 (359) Q Consensus 142 pRReGgrgTEIsTLpGgqnLge---i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KF-I~rLr~rF 217 (359) + |.+++.|. .+.-+ ++-.+|..+.+.++++||...|...+.-+..+..+..+. |.++ |.-+-.++ T Consensus 257 ---~---G~~~~~l~--~~~~d~q~le~r~~~~~eIa~afgVPp~~lg~~~~st~sn~e~~~~~---f~~~tL~P~~~~i 325 (518) T protein:vir:78 257 ---E---GMEPIPLQ--LTAVEMQFIEARQLNREEVCGVYDIAPPIVHILDRATFSNISAQMRA---FYRDTMAIPIARI 325 (518) T ss_pred ---C---CceEEecc--CChhHHHHHHHHHHHHHHHHHHhCCCHHHhccCCCCCchhHHHHHHH---HHHHHHHHHHHHH Confidence 1 34455443 23333 344457779999999999999964433333333333322 6554 44445554 Q ss_pred HHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCCC Q lcl|NC_015285. 218 SELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQT 297 (359) Q Consensus 218 s~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~t 297 (359) ...|...|- +.. +. ...+ .|..+ ++...+ ++.|.+.+..+-.. -+++.+-++. .+++. T Consensus 326 e~eln~~L~---------~~~--~~-~~~~--~fd~~----~Llr~D-~~~r~~~~~~~~~~--G~lT~NE~R~-~~gl~ 383 (518) T protein:vir:78 326 QSAMDKYVG---------QYW--VR-KNRM--KFDID----DVIQPD-WEAKSESTQKMVNS--GVATPNEGRE-IMGLP 383 (518) T ss_pred HHHHHHhhc---------ccc--cC-cceE--Eeech----hhhccC-HHHHHHHHHHHHhC--CCcCHHHHHH-HhCCC Confidence 444444332 221 11 2233 34322 222222 24566666655433 3566666663 35664 Q ss_pred HHHHHHHHHHHHHHHhcCCCCC----Ccc---------hhhhcCCCCCc------ccccccCCCCCcCCCC-CCCCCccC Q lcl|NC_015285. 298 EIEIKEIDEQIASEMEAGIIAD----PMA---------EMDPAMAAGGE------GAPAAEVDPNAQESSV-DPGDVRRG 357 (359) Q Consensus 298 DeeI~e~~kqi~~E~~~~~~~~----P~~---------~~~~~~~~~~~------~~~~~~~~~~~~~~~~-~p~~~~~~ 357 (359) ..+=..- +-.|.+ |-. +..+..+.+++ +..+.+..|..-+++. .+++..+- T Consensus 384 pie~~~g---------D~~~v~~n~~pl~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 454 (518) T protein:vir:78 384 RSDDPKA---------DELYANSALQPLGATPDGAVEGEEAPAPKRPASTPVASLDQSPPASVPGLSPTNSDRSTDSGKT 454 (518) T ss_pred CCCCCCC---------ceeeecccceecccccccccCCCCCCCCCCCCcccccccccCccccCCCCCccccccccccccc Confidence 4320000 000000 000 00000000000 0000000111111111 11111100 Q ss_pred ------------CC Q lcl|NC_015285. 358 ------------EF 359 (359) Q Consensus 358 ------------~~ 359 (359) +| T Consensus 455 ~~~~~~~~~~~~~~ 468 (518) T protein:vir:78 455 EPRRLMQKPPPKES 468 (518) T ss_pred chhcccCCCCcccc Confidence 11 No 63 >protein:vir:106571 Length: 499 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1598 # MgeName: Lj965 # Cross-refs: genbank:acc:NP_958580;genbank:gi:41179240;genbank:GeneID:2717107 Probab=88.79 E-value=0.03 Score=28.92 Aligned_cols=296 Identities=11% Similarity=0.115 Sum_probs=108.7 Q ss_pred CCCchhhHHHhhhh--------------hhheeeccccccc---cCC-----CceeecHhHhhhhhc-ccccC----CCC Q lcl|NC_015285. 1 MRGVDLNQQLTQKA--------------AEYFLYNPKGLKN---STN-----QGMKITTDSVTYCHS-GIQDL----NKN 53 (359) Q Consensus 1 ~~~~~~~~~~~~~~--------------~e~f~yn~~~~~~---~~~-----~~v~i~~~ai~y~hS-Gl~d~----~~~ 53 (359) +-++..+.+..-.+ ..+-+|+|..+.. ..+ ....+.. +.|- |.++. ++. T Consensus 165 v~~d~~~~~~~~~i~~~~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~----~~~~~g~vPvv~~~n~~ 240 (499) T protein:vir:10 165 VCDDTVEHDPLFAVFTQEKKDLEGNTNGYSITVYMPQRIVEYRTKTTMEVSANDPIVYD----GENLFGAVPIIEFRNNE 240 (499) T ss_pred EecCCCCcceEEEEEEEEEeecCCCceEEEEEEEeCCeEEEEEecCCccccCcceeccc----ccCCCCccceEEecCCC Confidence 11111111111111 1112333332110 000 0000100 1111 22111 112 Q ss_pred cchhhHHHHHHHHHHHH-HHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccc Q lcl|NC_015285. 54 MTLSHLHKAIKAVNQLR-MIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKF 132 (359) Q Consensus 54 ~i~syL~~Aik~~NqL~-m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~ 132 (359) .-.|=++..+.....+. ++-+....-+.+..|-+-+.-.+.+.. ... T Consensus 241 ~~~~d~e~v~~liD~~~~~~S~~~~~~~~~~~~~lv~~G~~~~~~--------------------------------~~~ 288 (499) T protein:vir:10 241 ERQGDFEQLISLIDAYNLLQTDRISDKEAFVDALLVTFGFGLGDD--------------------------------KDD 288 (499) T ss_pred CCCCchHhHHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccccc--------------------------------cch Confidence 22344444444444333 234444455566666655543332221 111 Q ss_pred hhhHhhhcccccCCCCccceeecCCCCCcchH-HHHHHHHHHHHHhcCCCccccCCCCcccccchh--hhhHHhhhHHHH Q lcl|NC_015285. 133 MSMMEDFWLPRREGGRGTEISTLPGGQNLGEL-EDVKYFQKKLYKALNVPSSRLETETTFNIGRAA--EITRDEVKFQKF 209 (359) Q Consensus 133 mSMlEDywLpRReGgrgTEIsTLpGgqnLgei-~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~--eItRDElKF~KF 209 (359) ...+..+.+.--.+..|..+++|-...+.... .-+.-+.+.+|+-..+|-- ..+. | .|..| .|..-......- T Consensus 289 ~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~--~~~~-~-~gn~Sg~Al~~~~~~l~~k 364 (499) T protein:vir:10 289 IQRLKRGAIEAPPREEGADIEWLTKSFDETQVNLLSQSIENDIHKISYVPNM--NDEK-F-MGNVSGEAMKFKLFGLENL 364 (499) T ss_pred hhhhhhcceeccCCCCCCcceEEeccCCHHHHHHHHHHHHHHHHHHhCcccC--Cchh-h-cccchHHHHHHHHHHHHHH Confidence 11112211111122233446666554444332 3456667778888888831 1111 1 12222 222222222344 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCC-CChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHH Q lcl|NC_015285. 210 IARLRKRFSELFTDLLKTQLILKGV-MSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDY 288 (359) Q Consensus 210 I~rLr~rFs~if~d~Lk~QLiLkgI-~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~ 288 (359) +.+.++.|...+.++++.=+-+-++ -...+| ..+.+.|....--.+ .+.+++++.+. | .+|.++ T Consensus 365 ~~~k~~~~~~~l~~~~~li~~~~~~~~~~~d~----~~i~i~f~~~~p~n~-------~e~~~~~~kl~---g-~iS~et 429 (499) T protein:vir:10 365 LSIKQRYFFDGLRRRLKLIQTIVNIKGANDDA----SGCKISLVANIPSNL-------SDVVNNVKNAD---G-IIPRKY 429 (499) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCcccc----ccceEEeCCCCCCCH-------HHHHHHHHHHh---c-cCChHH Confidence 5566666666666666654433222 122333 356777765544444 34445555553 4 379999 Q ss_pred HHHHHhCCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccc-----cc--cCCCCCcCCCCCCCCCccC Q lcl|NC_015285. 289 MRRQVLKQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAP-----AA--EVDPNAQESSVDPGDVRRG 357 (359) Q Consensus 289 i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~-----~~--~~~~~~~~~~~~p~~~~~~ 357 (359) ++.. |...++ .+++.++|++|..+. .+.+++ ...+..++... .. +.+++.....+.||-.+-- T Consensus 430 ~~~~-l~~v~d-~~~E~~ri~~E~~~~-~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 499 (499) T protein:vir:10 430 TYSW-LPDVDN-PQDVIDEMNQQDAET-IKKNQE---ALRGQDPDRLELEDKQDDSSENDKEAGSNHNQSHRTRAV 499 (499) T ss_pred HHHh-CCCCCC-HHHHHHHHHHHHHHH-HHHHHh---hhccCCCCCCCCCCCCcccCCCCCCCccccccCCCCCCC Confidence 9987 555321 223334444443221 111111 11111111000 00 0112222222222222222 No 64 >protein:vir:95542 Length: 548 # NCBI annotation: Putative portal protein # Family: family:all:47 # MgeID: mge:1574 # MgeName: F10 # Cross-refs: genbank:acc:YP_001293348;genbank:gi:148912769;genbank:GeneID:5228194 Probab=88.03 E-value=0.035 Score=28.58 Aligned_cols=310 Identities=13% Similarity=0.141 Sum_probs=151.3 Q ss_pred CC--CchhhHHHhhhhh--------hheeec--ccccc--ccCCCceeecHhHhhhhhcccccCCCCcchhhHHHHHHHH Q lcl|NC_015285. 1 MR--GVDLNQQLTQKAA--------EYFLYN--PKGLK--NSTNQGMKITTDSVTYCHSGIQDLNKNMTLSHLHKAIKAV 66 (359) Q Consensus 1 ~~--~~~~~~~~~~~~~--------e~f~yn--~~~~~--~~~~~~v~i~~~ai~y~hSGl~d~~~~~i~syL~~Aik~~ 66 (359) +. .+..+..|+.||+ -|+++. |.... .....-++||.+-|..++.-.-. .--.=+|.|..+++.+ T Consensus 176 l~~~~~~~~~~i~~GIE~D~~Grp~aY~i~~~hPgd~~~~~~~~~~~rvpA~~VlHif~~~r~-gQ~RGvs~lapvl~~l 254 (548) T protein:vir:95 176 LPFSYNNLSKGIVQGIERDTWRRKRAYHLLKDHPGNLQTLGGSLAVKRVEAERIIHIAYRKRI-GQNRGVPMLHAVLIRL 254 (548) T ss_pred cCCCCCCCCCceeeeeEECCCCceEEEEEeecCCCcccccccccceeeechhHheecccccCC-ccccCcchHHHHHHHH Confidence 11 1122333444443 466663 44332 22344689999998887766533 2222478999999999 Q ss_pred HHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhhcccccCC Q lcl|NC_015285. 67 NQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDFWLPRREG 146 (359) Q Consensus 67 NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDywLpRReG 146 (359) .+|.-.+||..+-...-|-.=-+..=+-+. ... ...+ -.+.....+| T Consensus 255 ~~l~~y~dael~~aki~A~~a~fi~~~~~~---~~~------------------~~~~-~~~~~~~~~~----------- 301 (548) T protein:vir:95 255 ADLKDYEESERVAARISAALAMYIKKGNPD---SYT------------------VEPG-KDRKNRTIPI----------- 301 (548) T ss_pred HHHhHHHHHHHHHHHHhhhheeeeecCCCc---ccc------------------CCCC-cccccccccc----------- Confidence 999999999999988888763333322221 100 0000 0011111111 Q ss_pred CCccceeecCCCCCc---------chHHHHHH-HHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHH Q lcl|NC_015285. 147 GRGTEISTLPGGQNL---------GELEDVKY-FQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKR 216 (359) Q Consensus 147 grgTEIsTLpGgqnL---------gei~DV~Y-F~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KFI~rLr~r 216 (359) .-|+=|.+|+.|+.+ +..++... ..+.+=.+|+||-+-|..+ ++ ++-|.+.-.-+.|-+.+.++|.. T Consensus 302 ~pG~iv~~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~IAaglGipYe~ltgD--~s-~nYSS~R~~l~e~~r~~~~~q~~ 378 (548) T protein:vir:95 302 APGMVFDDLEPGEDVGMIESNRPNPFLEGFRNGQLRMIGAGTRSTYSSVSRA--YD-GTYSAQRQELVEGWLGYDLLQHE 378 (548) T ss_pred cCCccccccCCCceeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcc--cc-hhHHHHHHHHHHHHHHHHHHHHH Confidence 123334555444433 33333322 3333456799999999776 33 24455555667799999999999 Q ss_pred HHHHHHHHHHH----HHHhcCCCChhHHHHHhhceeeeeec--cchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHH Q lcl|NC_015285. 217 FSELFTDLLKT----QLILKGVMSLEEWEDMKNHIQFDFIA--DNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMR 290 (359) Q Consensus 217 Fs~if~d~Lk~----QLiLkgI~t~eew~~~~~~I~~~f~~--Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~ 290 (359) |..-|..++-. ..+|+|.++.-.|.+-.......|.. --+.-.+||+.-...+++. -+-|.+-+. T Consensus 379 ~i~~~~~Pi~~~wle~a~l~G~i~lP~~~~~~~~~~~~W~~P~~~~iDP~Kea~A~~~~i~~---------Gl~T~~~~~ 449 (548) T protein:vir:95 379 FIDYWCRPVYRSWLQMYLLARKERLPADVDHRTLYAAVYQGPVMPWINPMHEANAWELLVKA---------GFADEAEVA 449 (548) T ss_pred HHHHHHHHHHHHHHHHHHHcCCcCCCCCCCchhheeeeeecCCccccChHHHHHHHHHHHHc---------CCCCHHHHH Confidence 88777665333 46788877532232223344455533 3345567777665555442 334665555 Q ss_pred HHHhCCCHHHHH-HHHHHHHHHHhcCCCC--CCcchhhhcC--CC-------C-----Ccccc---------cc-cC-CC Q lcl|NC_015285. 291 RQVLKQTEIEIK-EIDEQIASEMEAGIIA--DPMAEMDPAM--AA-------G-----GEGAP---------AA-EV-DP 342 (359) Q Consensus 291 k~IL~~tDeeI~-e~~kqi~~E~~~~~~~--~P~~~~~~~~--~~-------~-----~~~~~---------~~-~~-~~ 342 (359) .+ .+..-+|+. |.+...+.-+..|+-. +|-.+...++ +. . .+|+- || .+ -| T Consensus 450 a~-~G~D~~ev~~q~a~E~~~~~~~GL~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 528 (548) T protein:vir:95 450 RA-RGRDPRELKKSRETEIKANRAAGLVFSSDAYHQLVKSGMDPVEAVQKVYLGVGKMLTADEARELVNRYGAGLPVPGP 528 (548) T ss_pred HH-hCCCHHHHHHHHHHHHHHHHHcCCCCCCcccccccccccCCCCchhhhccccccccccchhHHhhccCCCCCcCCCC Confidence 55 566555533 3333333333334322 1111000000 00 0 00000 00 00 11 Q ss_pred CCcCCCCCCCCCccCCC Q lcl|NC_015285. 343 NAQESSVDPGDVRRGEF 359 (359) Q Consensus 343 ~~~~~~~~p~~~~~~~~ 359 (359) .++..+++.| -+|+- T Consensus 529 ~~~~~~~~~~--~~~~~ 543 (548) T protein:vir:95 529 DFPNESNNGG--ADGQP 543 (548) T ss_pred CCCcccccCC--CCCCC Confidence 2222222111 11111 No 65 >protein:vir:5961 Length: 503 # NCBI annotation: hypothetical protein # Family: family:all:125 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690661;genbank:geneid:6329220;genbank:gi:22855055;interpro:IPR006428;uniprot:P54309;genbank:GeneID:955279 Probab=84.48 E-value=0.06 Score=27.28 Aligned_cols=303 Identities=10% Similarity=0.053 Sum_probs=112.1 Q ss_pred CCCchhhHHHhhhh-------------hhheeeccccccc--cCCC---------------ceeecHhHhhhhhccccc- Q lcl|NC_015285. 1 MRGVDLNQQLTQKA-------------AEYFLYNPKGLKN--STNQ---------------GMKITTDSVTYCHSGIQD- 49 (359) Q Consensus 1 ~~~~~~~~~~~~~~-------------~e~f~yn~~~~~~--~~~~---------------~v~i~~~ai~y~hSGl~d- 49 (359) +.++....++.-.+ .-+-+|++..... ..+. .+.....+..|.+-=++. T Consensus 168 i~d~~~~~~~~~~ir~~~~~~~~~~~~~~~evy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPiv~~ 247 (503) T protein:vir:59 168 VYKDNTRRDILFALRYYSYKGIMGEETQKAELYTDTHVYYYEKIDGVYQMDYSYGENNPRPHMTKGGQAIGWGRVPIIPF 247 (503) T ss_pred EEeCCCCCceEEEEEEEEEecCCCceEEEEEEEeCCcEEEEEEcCCcccccccccccccccceeecceeccCCccceEEe Confidence 11111111111111 1111444433211 0111 111111111110000111 Q ss_pred CCCCcchhhHHHHHHHHHHHH-HHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccc Q lcl|NC_015285. 50 LNKNMTLSHLHKAIKAVNQLR-MIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKD 128 (359) Q Consensus 50 ~~~~~i~syL~~Aik~~NqL~-m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkd 128 (359) .++..-.|=++.++.....+. ++-+.+..-+.++.|-+.+--.+.-+.+ +.... |..+ +++ T Consensus 248 ~nn~~~~sd~~~~~~liDa~d~~~s~~~~~~~~~~~~~~v~~g~~~~~~~-----~~~~~-~~~~--~~~---------- 309 (503) T protein:vir:59 248 KNNEEMVSDLKFYKDLIDNYDSITSSTMDSFSDFQQIVYVLKNYDGENPK-----EFTAN-LRYH--SVI---------- 309 (503) T ss_pred cCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhcCCeeEeecCCccccc-----hhhhh-hhcc--cce---------- Confidence 122233444555544444443 3345555567777775544333222211 11111 1111 111 Q ss_pred cccchhhHhhhcccccCCCCccceeecCCCCCcchH-HHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHH Q lcl|NC_015285. 129 DKKFMSMMEDFWLPRREGGRGTEISTLPGGQNLGEL-EDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQ 207 (359) Q Consensus 129 d~~~mSMlEDywLpRReGgrgTEIsTLpGgqnLgei-~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~ 207 (359) .+| +++ .+..|-...+.+.. .-+.-+++.+|+...+|---.+.-++ +. .+..|......-. T Consensus 310 -----------~~~--~~~---~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~-~~-Sg~Ai~~~~~~l~ 371 (503) T protein:vir:59 310 -----------KVS--GDG---GVDTLRAEIPVDSAAKELERIQDELYKSAQAVDNSPETIGG-GA-TGPALENLYALLD 371 (503) T ss_pred -----------ecc--CCC---cceeEeccCCHHHHHHHHHHHHHHHHHHhcccCCCcccccc-cc-cHHHHHHHHHHHH Confidence 011 111 24444443333332 34466677788888888322111111 11 1222332222334 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHH Q lcl|NC_015285. 208 KFIARLRKRFSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVD 287 (359) Q Consensus 208 KFI~rLr~rFs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~ 287 (359) .-+.+.+..|...+.++++.=+-+-++....++... ..|.+.|...-.-.+. +.++++..+-.- | .+|.+ T Consensus 372 ~k~~~~~~~~~~~l~~~~~~i~~~~~~~~~~~~~~~-~~i~i~f~~~~p~d~~-------~~~~~~~kl~~~-G-iiS~e 441 (503) T protein:vir:59 372 LKANMAERKIRAGLRLFFWFFAEYLRNTGKGDFNPD-KELTMTFTRTRIQNDS-------EIVQSLVQGVTG-G-IMSKE 441 (503) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccc-cceeEEeCCCCCCCHH-------HHHHHHHHHHhC-C-CCchH Confidence 446667777777777766653333334433333322 3488888655444442 334444444211 2 47999 Q ss_pred HHHHHHhCCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCCCCccC Q lcl|NC_015285. 288 YMRRQVLKQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPGDVRRG 357 (359) Q Consensus 288 ~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~ 357 (359) ++++. |...+ +.+++-++|++|+....-..++. ++ +.++....+.+-+....+...+.++-. T Consensus 442 t~l~~-l~~v~-d~~~E~~ri~~E~~~~~~~~~~~-~~-----~~~~~~~~~~~~~~~~~~~~~~~g~~~ 503 (503) T protein:vir:59 442 TAVAR-NPFVQ-DPEEELARIEEEMNQYAEMQGNL-LD-----DEGGDDDLEEDDPNAGAAESGGAGQVS 503 (503) T ss_pred HHHHh-CCCCC-CHHHHHHHHHHHHHHHHhhhccc-cC-----ccCCCCCCCcCCCCCCcccCCCCCCcC Confidence 99987 55443 12233344444433211001110 00 011111111111111222233333333 No 66 >protein:vir:3609 Length: 452 # NCBI annotation: ORF32 # Family: family:all:125 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112695;genbank:gi:13786563;genbank:GeneID:921063 Probab=83.62 E-value=0.067 Score=27.02 Aligned_cols=283 Identities=13% Similarity=0.158 Sum_probs=114.2 Q ss_pred CCCchhhHHHh---------hhhhhheeecccccc--ccCCCceeecHhHhhhhh-ccccc----CCCCcchhhHHHHHH Q lcl|NC_015285. 1 MRGVDLNQQLT---------QKAAEYFLYNPKGLK--NSTNQGMKITTDSVTYCH-SGIQD----LNKNMTLSHLHKAIK 64 (359) Q Consensus 1 ~~~~~~~~~~~---------~~~~e~f~yn~~~~~--~~~~~~v~i~~~ai~y~h-SGl~d----~~~~~i~syL~~Aik 64 (359) +-++....+.+ .+....-+|.+...+ ...+++..+.... -| .|-++ .++....|-++..+. T Consensus 149 v~d~~~~~~~~~~i~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~---~~~~g~iPvv~~~n~~~g~sd~e~v~~ 225 (452) T protein:vir:36 149 VYDDTVKQEPLFAVRYGVDEDKKLQGEVYTLLETIKISGENDEISFGEGT---YNPYPDLPVVEFYFNEERMSIFESVIS 225 (452) T ss_pred EEcCCCCCceEEEEEEEEecCceEEEEEEecCeEEEEEEcCCceEEecce---eccCCcccEEEecCCCCCCcchHHHHH Confidence 11110000000 011112244443321 1112222221110 01 12111 122233344444443 Q ss_pred HHHHHH-HHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhhcccc Q lcl|NC_015285. 65 AVNQLR-MIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDFWLPR 143 (359) Q Consensus 65 ~~NqL~-m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDywLpR 143 (359) ....+. ++-+....-+..+.|-+-+.- +.++... +.++ . .++++.=..+| T Consensus 226 liDa~d~~~s~~~~~~~~~~~p~~~~~g---~~~~~~~----~~~~-~--~~~~~~~~~~~------------------- 276 (452) T protein:vir:36 226 LVNAFNKAISEKANDVDYFSDQYLTFLG---AAVEEED----LKNI-R--SNRVINYYADG------------------- 276 (452) T ss_pred HHHHHHHHHHHHHHHHHHhcCceeEeec---CCcCchh----hhhh-h--hcceEEecCCC------------------- Confidence 333332 234444455666777665542 2222111 1111 0 11222111111 Q ss_pred cCCCCccceeecCCCCCcchHH-HHHHHHHHHHHhcCCCccccCCCCcccccch--hhhhHHhhhHHHHHHHHHHHHHHH Q lcl|NC_015285. 144 REGGRGTEISTLPGGQNLGELE-DVKYFQKKLYKALNVPSSRLETETTFNIGRA--AEITRDEVKFQKFIARLRKRFSEL 220 (359) Q Consensus 144 ReGgrgTEIsTLpGgqnLgei~-DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~--~eItRDElKF~KFI~rLr~rFs~i 220 (359) .+.+..+.+|....+.+.+. -+.-+.+.+|.-..+|- +..++ +|++ ..|..-+.....-+.+.+..|..- T Consensus 277 --~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~--~~~~~---~gn~Sg~Al~~~~~~l~~k~~~~~~~~~~~ 349 (452) T protein:vir:36 277 --EGKNVDVKFLEKPDSDSQTENLLDRLTKLIFQTTMVAN--ISDES---FGSSSGVSLAYKLQAMSNLALSFQRKFQSS 349 (452) T ss_pred --CccCCcceeEeecCCHHHHHHHHHHHHHHHHHHhCccc--cCccc---ccCCcHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 22334566665554444433 35667788888889984 33222 2333 233333333445567777777777 Q ss_pred HHHHHHHHHHhcC-CCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCCCHH Q lcl|NC_015285. 221 FTDLLKTQLILKG-VMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQTEI 299 (359) Q Consensus 221 f~d~Lk~QLiLkg-I~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~tDe 299 (359) +...++.=+-+.+ .....+|. .|.+.|...---.+ .+.+++++.+. | .+|.+++++. |..+++ T Consensus 350 l~~~~~li~~~~~~~~~~~~~~----~i~i~f~~~~p~d~-------~~~a~~~~k~~---g-~iS~et~~~~-~~~~~d 413 (452) T protein:vir:36 350 LNSRYKLFCELSTNVSNKDSWK----DIEYTFTRNEPKDI-------KEQAETANILM---G-ITSQETALSV-ISVIPD 413 (452) T ss_pred HHHHHHHHHHHHhccCCccccc----cceEEeCCCCCcCH-------HHHHHHHHHHh---c-cCChHHHHHh-CCCCCC Confidence 7777775433222 22333454 46777765433333 23344555553 3 3799999976 566532 Q ss_pred HHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcC Q lcl|NC_015285. 300 EIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQE 346 (359) Q Consensus 300 eI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~ 346 (359) .+++.++|++|.....-.+++ . .+++............+ T Consensus 414 -~~~E~~ri~~E~~~~~~~~~~------~-~~~~~~~~~~~~~~~~e 452 (452) T protein:vir:36 414 -VQAEMEKIKKEEASTAIFDKD------K-QPSEKGTDTVVSETNEE 452 (452) T ss_pred -HHHHHHHHHHHHHHHHHHHhh------c-cCCCCcccccCccccCC Confidence 445555666664432111111 1 00010010100110001 No 67 >protein:vir:96980 Length: 409 # NCBI annotation: ORF008 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239857;genbank:gi:66395516;genbank:GeneID:5133013 Probab=83.48 E-value=0.068 Score=26.98 Aligned_cols=278 Identities=14% Similarity=0.222 Sum_probs=116.9 Q ss_pred CCC--------chhhHHHhhhhhhheeecccc---------------ccccCCCceeecHhHhhhhhcccccCCCCcchh Q lcl|NC_015285. 1 MRG--------VDLNQQLTQKAAEYFLYNPKG---------------LKNSTNQGMKITTDSVTYCHSGIQDLNKNMTLS 57 (359) Q Consensus 1 ~~~--------~~~~~~~~~~~~e~f~yn~~~---------------~~~~~~~~v~i~~~ai~y~hSGl~d~~~~~i~s 57 (359) +.+ +..|. ..+.+...|.. +....+..+.++.+-|.+.. +.-..++=.-+| T Consensus 107 l~Gnay~~i~r~~~G~-----~~~L~~l~~~~v~v~~~~~~~~~~y~~~~~~g~~~~~~~~evih~r-~~~~~~~~~G~s 180 (409) T protein:vir:96 107 EKGNAYVLIERDIYHQ-----PSKLFLLNPDVVEMLIENQSRELYYSIHAATGNKLIVHNMDMLHFK-HIVASNMVQGIS 180 (409) T ss_pred hcCceEEEEEECCCCc-----EEEEEEEcCceeEEEEeCCCcEEEEEEEcCCceEEEEccccEEEeC-CCCCCCcccccc Confidence 111 11111 11222222211 01112334567777665531 111112212245 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHh Q lcl|NC_015285. 58 HLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMME 137 (359) Q Consensus 58 yL~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlE 137 (359) .|..|.........++.. ......+.| . +...-.+.|.+.+++...+.+.+.|.| .|.+ + .++ T Consensus 181 ~l~~~~~~i~~~~~~~~~-~~~~~~~~~-~-~i~~~~~~l~~e~~~~~~~~~~~~~~n-------~g~~------~-vl~ 243 (409) T protein:vir:96 181 PIDVLKNTTDFDNAVRTF-NLTEMQKPD-S-FMLKYGSNVSTEKRQQVLEDFKQYYEE-------NGGI------L-FQE 243 (409) T ss_pred HHHHHHHHHHHHHHHHHH-HHHhcCCCc-e-eEEecCCCCCHHHHHHHHHHHHHHhhc-------CCCe------e-ecC Confidence 566666655555555544 345554444 2 344445777777776666665554432 2322 1 111 Q ss_pred hhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHH-HHHHHH Q lcl|NC_015285. 138 DFWLPRREGGRGTEISTLPGG-QNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKF-IARLRK 215 (359) Q Consensus 138 DywLpRReGgrgTEIsTLpGg-qnLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KF-I~rLr~ 215 (359) -|.+++.|.-. +.+.-++-..|..+.+.++++||.+.|+..+.-+.....+..+. |.++ |.-+-. T Consensus 244 ----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~s~~e~~~~~---f~~~~l~P~~~ 310 (409) T protein:vir:96 244 ----------PGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSIFLNARSNTNFAKNEELNRF---YLQHTLLPIVK 310 (409) T ss_pred ----------CCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHH---HHHHHHHHHHH Confidence 25667766421 22222333445678899999999999976554455555554444 6555 333323 Q ss_pred HHHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhC Q lcl|NC_015285. 216 RFSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLK 295 (359) Q Consensus 216 rFs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~ 295 (359) ++. +.|- +.++++.++. ....|.|..+ ++...+ +..|++.+..+-.- -+++..-++. .++ T Consensus 311 ~ie----~~l~-----~~Ll~~~~~~---~g~~i~fd~~----~ll~~d-~~~~~e~~~~~~~~--G~~T~NE~R~-~~g 370 (409) T protein:vir:96 311 QYE----EEFN-----RKLLTKTDRE---KNRYFKFNVK----SYLRAD-SATQAEVYFKAVRS--GYYTINDIRE-WED 370 (409) T ss_pred HHH----HHHH-----hhcCCccccc---CcceEEeech----hhhccC-HHHHHHHHHHHHhC--CCCCHHHHHH-HhC Confidence 222 2233 3344555543 2344555433 333333 34566666555332 2556666654 355 Q ss_pred CCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCCCCccC Q lcl|NC_015285. 296 QTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPGDVRRG 357 (359) Q Consensus 296 ~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~ 357 (359) +.+-+ --++-+- ... +...+...+......|++ ++.+.| T Consensus 371 ~~pi~--ggD~~~~---~~n-~~~~~~~~~~~~~~~gG~-----------------~n~~e~ 409 (409) T protein:vir:96 371 LPPVE--GGDKPLI---SGD-LYPIDTPLELRKSLKGGD-----------------KNVNES 409 (409) T ss_pred CCCCC--Ccceeee---ccc-ccccccchhhcccccCCC-----------------CCcCCC Confidence 54321 0000000 000 000000000111111111 122222 No 68 >protein:vir:38 Length: 496 # NCBI annotation: putative portal protein # Family: family:all:898 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463464;swissprot:trembl:q9t1c0;genbank:gi:16798786;uniprot:Q9T1C0;genbank:GeneID:922383 Probab=83.12 E-value=0.071 Score=26.88 Aligned_cols=287 Identities=14% Similarity=0.182 Sum_probs=120.3 Q ss_pred CCCchhhHHHhhhhhhheeeccccccccCCCcee----------------ecHhHhhhhhcc---cccCCCCcchhhHHH Q lcl|NC_015285. 1 MRGVDLNQQLTQKAAEYFLYNPKGLKNSTNQGMK----------------ITTDSVTYCHSG---IQDLNKNMTLSHLHK 61 (359) Q Consensus 1 ~~~~~~~~~~~~~~~e~f~yn~~~~~~~~~~~v~----------------i~~~ai~y~hSG---l~d~~~~~i~syL~~ 61 (359) ..+--+... .+..-+|.+|.-.... .-+..|. ++.--++|.--- -.+.....=+|=|+. T Consensus 182 ~le~h~~~~-~~~~I~~~~y~~~~~~-~~g~~v~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~~N~~~~~~p~G~Sd~~~ 259 (496) T protein:vir:38 182 LLEWNEWQG-DVYTVTTELYQSDDPN-ELGTKVSLTLLFDDIEPVVPLPDFTRPTFIYIKPNIANNKNLTSPLGISVYAN 259 (496) T ss_pred EEEEEEEeC-ceEEEEEEEEecCCcc-ccCccccccccccccccceeecCCCcceEEEecCCcccccccCCcCCCchHhh Confidence 000000000 1122234444211100 0000000 000011111000 001111222466667 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhhcc Q lcl|NC_015285. 62 AIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDFWL 141 (359) Q Consensus 62 Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDywL 141 (359) ++.....|-..=.. +.+-++.-.+|+|. +.. +++ . ..|..++.+. -++.-.+.|.. T Consensus 260 ~~~lid~ld~~~s~--~~~~~~~~~~~i~v-~~~---------~l~----~-----~~~~~g~~~~---~~~~~~~~~~~ 315 (496) T protein:vir:38 260 ALDTLKTLDLMFDS--YYQEFKLGKKKVLV-PSS---------FVK----T-----AVNLDGSTTQ---YFDSTDEAFFL 315 (496) T ss_pred HHHHHHHHHHHHHH--HHHHHhhcccceec-chH---------Hhh----c-----cCCCCCcccc---CCCCccceEEE Confidence 76666666444333 33566776666654 211 110 0 0112222111 01111111111 Q ss_pred ccc-CCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHH Q lcl|NC_015285. 142 PRR-EGGRGTEISTLPGGQNLGE-LEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSE 219 (359) Q Consensus 142 pRR-eGgrgTEIsTLpGgqnLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KFI~rLr~rFs~ 219 (359) ..- +++.+.-|+++.+.-...+ ..-++.+.+.+....++|-+-|+.+++- .-.+++|.-....-..-+.+.++.|.. T Consensus 316 ~~~~~~~~~~~i~~~~~~i~~e~~~~~l~~~l~~i~~~~g~~~~~f~~~~~g-~~tAtei~~~~~~l~~~~~~~~~~~~~ 394 (496) T protein:vir:38 316 YQGDQDDNGKAIKDISVEIRSTEFIESINAMLRIYAMQVGLSAGTFTFDENG-LKTATEVVSEKSETYQTKNSHSQLIEQ 394 (496) T ss_pred eecCCCcccccceeeccccCHHHHHHHHHHHHHHHHHhhCCChhhcCCCccc-cchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 110 1222233665555322221 3346777789999999999988765321 113556644443333334444444444 Q ss_pred HHHHHHHH-------HHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHH Q lcl|NC_015285. 220 LFTDLLKT-------QLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQ 292 (359) Q Consensus 220 if~d~Lk~-------QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~ 292 (359) .+.++++. .+.++|..- + ...+.++|...-.-.+. +.++.+.++-. .| .+|.++++++ T Consensus 395 ~l~~l~~~il~~~~~~~~~~g~~~----~--~~~i~v~f~d~i~~d~~-------~~~~~~~~~~~-~G-iiS~et~l~~ 459 (496) T protein:vir:38 395 GIKEMIVSILEVGKFIEAYSGEVV----E--LDTITVDFDDSIAQDED-------TTINRYTNAKN-QG-MIPLKIALQR 459 (496) T ss_pred HHHHHHHHHHHHHHHHHhhcCCCC----C--ccceEEEeCCCCCCCHH-------HHHHHHHHHHh-cC-CCCHHHHHHh Confidence 44444433 344555332 1 24578888743222221 12222222211 13 4799999888 Q ss_pred HhCCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCC Q lcl|NC_015285. 293 VLKQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVD 341 (359) Q Consensus 293 IL~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~ 341 (359) ....||+|.+++.++|++|.... .+.|+ .++..|+.+ T Consensus 460 ~~~~~d~ea~~el~ri~~E~~~~-~~~~d-----------~~~~~~~~e 496 (496) T protein:vir:38 460 AWNITEAEADEWAEMLAKEKQAE-MPNND-----------MNGIFGEEE 496 (496) T ss_pred cCCCChHHHHHHHHHHHHhhhcc-Ccccc-----------ccCCCCCCC Confidence 88999999999999999986543 22111 111112111 No 69 >protein:vir:81218 Length: 423 # NCBI annotation: gp3, phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456733;genbank:gi:157168376;interpro:IPR006427;interpro:IPR006944;uniprot:Q9MBK2;genbank:GeneID:5580341 Probab=82.63 E-value=0.075 Score=26.74 Aligned_cols=281 Identities=11% Similarity=0.089 Sum_probs=111.3 Q ss_pred CCCchhhHHHhhhhhhheeeccc-----------c--------ccccCCCceeecHhHhhhhhcccccCCCC-cchhhHH Q lcl|NC_015285. 1 MRGVDLNQQLTQKAAEYFLYNPK-----------G--------LKNSTNQGMKITTDSVTYCHSGIQDLNKN-MTLSHLH 60 (359) Q Consensus 1 ~~~~~~~~~~~~~~~e~f~yn~~-----------~--------~~~~~~~~v~i~~~ai~y~hSGl~d~~~~-~i~syL~ 60 (359) .++......++ +...+.+. + .....+..+.++.+.|.+.+. .+.++. .=+|-++ T Consensus 119 ~rd~~~~~~~~----~l~p~~~~~v~~~~~~~~~~~~~Y~~~~~~~~~g~~~~~~~~evih~r~--~~~~~~~~G~spi~ 192 (423) T protein:vir:81 119 PGDLGVDTPTL----DIRPIPVSWVQRRAYKDGWGSLDYIIIESGDNDGRSVKVPGERVIHRHG--YNPKTMKRGKSPVQ 192 (423) T ss_pred EecCCcCcceE----EEeecccceeeeeeccCCCcceEEEEEEecCCCceEEEEcccceEEecC--CCCCCccccccHHH Confidence 11111111100 00001111 0 000122336788888876552 222222 2357788 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEE-eeCCCCcccccccchhhHhhh Q lcl|NC_015285. 61 KAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMV-YDANTGEIKDDKKFMSMMEDF 139 (359) Q Consensus 61 ~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklv-YD~~TGevkdd~~~mSMlEDy 139 (359) .|..++.....+++...=+=---+.-+-|+..|-...|.+-.++-.+.+..+++.... --..+|.+- .++ T Consensus 193 ~~~~~i~~~~~~~~~~~~~f~ng~~p~gvi~~~~~~~~~~l~~e~~~~~~~~~~~~~~~~~~n~g~~~-------vl~-- 263 (423) T protein:vir:81 193 SLRDILGEQIEAAIFRAQMWRNGPRPGMVIMRDPESKAGKWDAESRTRFMANLRASFSPKSSDVGGTL-------LLE-- 263 (423) T ss_pred HHHHHHHHHHHHHHHHHHHHhccCCCceEEEecCcccCccCCHHHHHHHHHHHHHHhccccccCCcce-------ecC-- Confidence 8888777777777664433222345666777775443322122222333333332221 112234321 221 Q ss_pred cccccCCCCccceeecCCCCCcchHHHH---HHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHH Q lcl|NC_015285. 140 WLPRREGGRGTEISTLPGGQNLGELEDV---KYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKR 216 (359) Q Consensus 140 wLpRReGgrgTEIsTLpGgqnLgei~DV---~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KFI~rLr~r 216 (359) .|.+++.|. .+.-+++-+ ++-...+.++.+||...++.-++-+..+..+..+. |..++ |+-. T Consensus 264 --------~g~~~~~l~--~s~~d~q~~e~~~~~~~eIa~~fgVPp~~lg~~~~~t~sn~e~~~~~---f~~~~--L~P~ 328 (423) T protein:vir:81 264 --------DGMKAENFH--TTSKDEQTVETTKLSLQTVAQVYGINPTMVGQLDNANYSNVREFRKA---LYGDN--LGSW 328 (423) T ss_pred --------CCceEEecc--CChhhHHHHHHHHhhHHHHHHHhCCCHHHhcCCCCCCcccHHHHHHH---HHHHH--HHHH Confidence 245566553 333444333 35567799999999998864333233233332322 66552 3321 Q ss_pred HHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCC Q lcl|NC_015285. 217 FSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQ 296 (359) Q Consensus 217 Fs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~ 296 (359) + ..+.+.|-..| .++.+|+.-...|+|++. .+...++ +.|.+.++.+-.-.| + | T Consensus 329 ~-~~ie~~l~~~L-----~~~~~~~~~~~~~~fd~~------~llr~d~-~~r~~~~~~~l~~~G-~------------~ 382 (423) T protein:vir:81 329 I-RIIQDVMNLFL-----LPRVGIDNEKFYFEFNLE------EKLRASF-EEAAEIKRAAVGNVA-W------------M 382 (423) T ss_pred H-HHHHHHHhhhh-----cCccccccCccEEEecch------hhhccCH-HHHHHHHHHHHhCCC-C------------c Confidence 1 22344444443 344444433334444332 2222222 345555443211111 3 3 Q ss_pred CHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCCCCccC Q lcl|NC_015285. 297 TEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPGDVRRG 357 (359) Q Consensus 297 tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~ 357 (359) |..|+-+. .+. +|-+.. ....--.+......+..|++..++ T Consensus 383 T~NE~R~~---------~gl--~p~~gG---------D~~~~p~n~~~~~~~~~~~~~~~t 423 (423) T protein:vir:81 383 TINEVRAM---------DNL--PSIDGG---------DDLARPLNTEFGDSEDAPGEEVET 423 (423) T ss_pred CHHHHHHH---------hCC--CCCCCc---------ceeecccccccCccCCCCCCCCCC Confidence 44444221 011 121100 000000011111222333444444 No 70 >protein:vir:105889 Length: 474 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004371;genbank:gi:122891826;genbank:GeneID:4712360 Probab=82.39 E-value=0.077 Score=26.68 Aligned_cols=273 Identities=12% Similarity=0.144 Sum_probs=105.1 Q ss_pred CCCchhhHHHhhhhhhheeecccccc-c-cC--CCceeecHhHhhhhh-ccccc----CCCCcchhhHHHHHHHHHHHHH Q lcl|NC_015285. 1 MRGVDLNQQLTQKAAEYFLYNPKGLK-N-ST--NQGMKITTDSVTYCH-SGIQD----LNKNMTLSHLHKAIKAVNQLRM 71 (359) Q Consensus 1 ~~~~~~~~~~~~~~~e~f~yn~~~~~-~-~~--~~~v~i~~~ai~y~h-SGl~d----~~~~~i~syL~~Aik~~NqL~m 71 (359) ..+...+. .+....+|++...+ + +. .....+. .+.| .|.++ +++..-.|-++..+.....+.. T Consensus 185 ~~~~~~~~----~~~~~~~y~~~~~~~~~~~~~~~~~~~~----~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~d~ 256 (474) T protein:vir:10 185 EKDDDNGT----DYVYAEFYDNAYYYVFRGEGIDALQEVG----RYEHLFDYNPLFGVPNNKEMIGDAEKVIHLIDAYDL 256 (474) T ss_pred EeeCCCce----EEEEEEEEcCceEEEEeecCCCcccccc----cccCCCCccceEEecCCCCCCCchHHHHHHHHHHHH Confidence 11111110 01112233333221 0 00 1111111 1223 13332 1233334555655555544433 Q ss_pred -HHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhhcccccCCCCcc Q lcl|NC_015285. 72 -IEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDFWLPRREGGRGT 150 (359) Q Consensus 72 -~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDywLpRReGgrgT 150 (359) +-+....-+.+..|-+-+.-. .++. +-+..+ .. ++. .|++ + .+. T Consensus 257 ~~S~~~~~~~~~~~~~l~i~g~---~~~~----~~~~~~-~~--~~~---------------------i~~~--~--~~~ 301 (474) T protein:vir:10 257 TMSDASSEISQTRLAYLVLRGM---GMSE----EMIQET-QK--SGA---------------------FELF--D--KDM 301 (474) T ss_pred HHHHHHHHHHHhhcchhhhccC---CCCc----hhhhhh-hh--cce---------------------eEec--C--CCC Confidence 333333444455554433322 1121 111111 00 111 1111 0 112 Q ss_pred ceeecCCCCCcc-hHHHHHHHHHHHHHhcCCCccccCCCCcccccchh--hhhHHhhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015285. 151 EISTLPGGQNLG-ELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAA--EITRDEVKFQKFIARLRKRFSELFTDLLKT 227 (359) Q Consensus 151 EIsTLpGgqnLg-ei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~--eItRDElKF~KFI~rLr~rFs~if~d~Lk~ 227 (359) .++.|--..+.. ...-++-+.+.+|....+|--- .+ .|. |..| .|..-......-+.+.+..|..-+...++. T Consensus 302 ~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~--~~-~~~-~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~l 377 (474) T protein:vir:10 302 DVKYLTKDVNDTMIENHLDRIEKNIMRFAKSVNFN--SD-EFN-GNVPIIGMKLKLMALENKCMTFERKMTAMLRYQFKV 377 (474) T ss_pred ceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcccc--cc-ccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 244443333322 2344566778888888888421 11 111 2222 222222223334556666666666666655 Q ss_pred HHHh---cCCC-ChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCCCHHHHHH Q lcl|NC_015285. 228 QLIL---KGVM-SLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQTEIEIKE 303 (359) Q Consensus 228 QLiL---kgI~-t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~tDeeI~e 303 (359) =+-+ +|.- .+.+| ..|.+.|....--.+...+ ++++.+. | .+|.+++++. |...+ +.++ T Consensus 378 i~~~l~~~~~~~~~~~~----~~i~~~f~~~~p~d~~e~a-------~~~~kl~---g-~iS~et~~~~-l~~v~-d~~~ 440 (474) T protein:vir:10 378 ILSALKRKGYNLDDDSY----LNLIFKFTRNIPVNKLEES-------QVLINLK---G-QVSERTRLGQ-SQLVD-DVDY 440 (474) T ss_pred HHHHHhhccCCCCcccc----ccceEEeCCCCCCCHHHHH-------HHHHHHh---c-cCchHHHHHh-CCCCC-CHHH Confidence 3322 2221 23333 4577888765554454444 3444443 3 3799999987 55543 3455 Q ss_pred HHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCC Q lcl|NC_015285. 304 IDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESS 348 (359) Q Consensus 304 ~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 348 (359) +-++|++|..+..-..|+.. +++... .++..++. T Consensus 441 E~eri~~E~~e~~~~~~~~~-------~~~~~~----~~~~~~s~ 474 (474) T protein:vir:10 441 ELDEMEKESLEFNDKLPDID-------EGDAND----KSQNNQSE 474 (474) T ss_pred HHHHHHHHHHHHHhhccccc-------CCCcCC----CCccccCC Confidence 55556555443211112210 000000 11111111 No 71 >protein:vir:94101 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240229;genbank:gi:66395892;genbank:GeneID:5133270 Probab=82.39 E-value=0.077 Score=26.68 Aligned_cols=273 Identities=12% Similarity=0.144 Sum_probs=105.1 Q ss_pred CCCchhhHHHhhhhhhheeecccccc-c-cC--CCceeecHhHhhhhh-ccccc----CCCCcchhhHHHHHHHHHHHHH Q lcl|NC_015285. 1 MRGVDLNQQLTQKAAEYFLYNPKGLK-N-ST--NQGMKITTDSVTYCH-SGIQD----LNKNMTLSHLHKAIKAVNQLRM 71 (359) Q Consensus 1 ~~~~~~~~~~~~~~~e~f~yn~~~~~-~-~~--~~~v~i~~~ai~y~h-SGl~d----~~~~~i~syL~~Aik~~NqL~m 71 (359) ..+...+. .+....+|++...+ + +. .....+. .+.| .|.++ +++..-.|-++..+.....+.. T Consensus 185 ~~~~~~~~----~~~~~~~y~~~~~~~~~~~~~~~~~~~~----~~~~~~g~vPvv~~~n~~~g~sd~e~v~~liDa~d~ 256 (474) T protein:vir:94 185 EKDDDNGT----DYVYAEFYDNAYYYVFRGEGIDALQEVG----RYEHLFDYNPLFGVPNNKEMIGDAEKVIHLIDAYDL 256 (474) T ss_pred EeeCCCce----EEEEEEEEcCceEEEEeecCCCcccccc----cccCCCCccceEEecCCCCCCCchHHHHHHHHHHHH Confidence 11111110 01112233333221 0 00 1111111 1223 13332 1233334555655555544433 Q ss_pred -HHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhhcccccCCCCcc Q lcl|NC_015285. 72 -IEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDFWLPRREGGRGT 150 (359) Q Consensus 72 -~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDywLpRReGgrgT 150 (359) +-+....-+.+..|-+-+.-. .++. +-+..+ .. ++. .|++ + .+. T Consensus 257 ~~S~~~~~~~~~~~~~l~i~g~---~~~~----~~~~~~-~~--~~~---------------------i~~~--~--~~~ 301 (474) T protein:vir:94 257 TMSDASSEISQTRLAYLVLRGM---GMSE----EMIQET-QK--SGA---------------------FELF--D--KDM 301 (474) T ss_pred HHHHHHHHHHHhhcchhhhccC---CCCc----hhhhhh-hh--cce---------------------eEec--C--CCC Confidence 333333444455554433322 1121 111111 00 111 1111 0 112 Q ss_pred ceeecCCCCCcc-hHHHHHHHHHHHHHhcCCCccccCCCCcccccchh--hhhHHhhhHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015285. 151 EISTLPGGQNLG-ELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAA--EITRDEVKFQKFIARLRKRFSELFTDLLKT 227 (359) Q Consensus 151 EIsTLpGgqnLg-ei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~--eItRDElKF~KFI~rLr~rFs~if~d~Lk~ 227 (359) .++.|--..+.. ...-++-+.+.+|....+|--- .+ .|. |..| .|..-......-+.+.+..|..-+...++. T Consensus 302 ~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~--~~-~~~-~n~Sg~Al~~~~~~l~~k~~~~~~~~~~~l~~~~~l 377 (474) T protein:vir:94 302 DVKYLTKDVNDTMIENHLDRIEKNIMRFAKSVNFN--SD-EFN-GNVPIIGMKLKLMALENKCMTFERKMTAMLRYQFKV 377 (474) T ss_pred ceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcccc--cc-ccc-ccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 244443333322 2344566778888888888421 11 111 2222 222222223334556666666666666655 Q ss_pred HHHh---cCCC-ChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCCCHHHHHH Q lcl|NC_015285. 228 QLIL---KGVM-SLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQTEIEIKE 303 (359) Q Consensus 228 QLiL---kgI~-t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~tDeeI~e 303 (359) =+-+ +|.- .+.+| ..|.+.|....--.+...+ ++++.+. | .+|.+++++. |...+ +.++ T Consensus 378 i~~~l~~~~~~~~~~~~----~~i~~~f~~~~p~d~~e~a-------~~~~kl~---g-~iS~et~~~~-l~~v~-d~~~ 440 (474) T protein:vir:94 378 ILSALKRKGYNLDDDSY----LNLIFKFTRNIPVNKLEES-------QVLINLK---G-QVSERTRLGQ-SQLVD-DVDY 440 (474) T ss_pred HHHHHhhccCCCCcccc----ccceEEeCCCCCCCHHHHH-------HHHHHHh---c-cCchHHHHHh-CCCCC-CHHH Confidence 3322 2221 23333 4577888765554454444 3444443 3 3799999987 55543 3455 Q ss_pred HHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCC Q lcl|NC_015285. 304 IDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESS 348 (359) Q Consensus 304 ~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 348 (359) +-++|++|..+..-..|+.. +++... .++..++. T Consensus 441 E~eri~~E~~e~~~~~~~~~-------~~~~~~----~~~~~~s~ 474 (474) T protein:vir:94 441 ELDEMEKESLEFNDKLPDID-------EGDAND----KSQNNQSE 474 (474) T ss_pred HHHHHHHHHHHHHhhccccc-------CCCcCC----CCccccCC Confidence 55556555443211112210 000000 11111111 No 72 >protein:vir:1634 Length: 409 # NCBI annotation: Structural protein # Family: family:all:524 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695055;genbank:gi:23455746;genbank:GeneID:955506 Probab=81.62 E-value=0.084 Score=26.48 Aligned_cols=257 Identities=8% Similarity=0.005 Sum_probs=139.5 Q ss_pred CCCchhh-----HHHh-----hhhhhheeeccccc---cccCCCceeecH-----hHhhhhhcc-cccCCC-----Ccch Q lcl|NC_015285. 1 MRGVDLN-----QQLT-----QKAAEYFLYNPKGL---KNSTNQGMKITT-----DSVTYCHSG-IQDLNK-----NMTL 56 (359) Q Consensus 1 ~~~~~~~-----~~~~-----~~~~e~f~yn~~~~---~~~~~~~v~i~~-----~ai~y~hSG-l~d~~~-----~~i~ 56 (359) +-|+.++ +.+. ++...+.+|-|... .........++. -.|.|++.- |.+..+ .-++ T Consensus 129 i~D~~~~~~~~a~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvV~f~n~~~~~~~~G~seI~~~v~ 208 (409) T protein:vir:16 129 IIDPITGLLTEGYAVLERDENNNVVLEAHFLPDRTDYYYRDSRNNISIANPTGNPLLVPIIHRPDAVRPFGRSRITRSGM 208 (409) T ss_pred EeecccccceeeeEEEEecCCCceEEEEEEecCcEEEEEecCccccceecCCCCcceEEecccccccccCCccccchhHH Confidence 1111111 0000 01122334433221 111111111110 134455431 111111 1244 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhH Q lcl|NC_015285. 57 SHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMM 136 (359) Q Consensus 57 syL~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMl 136 (359) +..+++- +.+.+.++.=..+=.|.|-++=+|...-|..+=. ..+- T Consensus 209 ~l~da~~------r~~~~~~~~~e~~a~pqr~i~G~d~d~~~~~~~~-----------------------------~~~~ 253 (409) T protein:vir:16 209 YWQSNAK------RTLERADVTAEFYSFPQKYVTGLSDDAEPMETWK-----------------------------ATVS 253 (409) T ss_pred HHHHHHH------HHHHHHHHHHHHhcChhheeEecCCCCCccchhh-----------------------------hhhh Confidence 5555544 4567778888888899998886654222221110 0111 Q ss_pred hhhcccccCCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHH Q lcl|NC_015285. 137 EDFWLPRREGGRGTEISTLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKR 216 (359) Q Consensus 137 EDywLpRReGgrgTEIsTLpGgqnLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KFI~rLr~r 216 (359) .=..+|.-+.|-+.+|..++++.==+=++-++-.-..+....++|.+-|+..+. |-..+..|.-.|....+-+.+-|+. T Consensus 254 ~i~~~~~d~~g~~~~v~q~~~~~l~~~~~~l~~~~~~~a~~s~lP~~~lg~~~~-NpsSa~Ai~a~~~~L~~ka~~k~~~ 332 (409) T protein:vir:16 254 SMLQFTKDEDGDKPTLGQFTQPSMSPFTEQLRTAAAGFAGETGLTLDDLGFVSD-NPSSVEAIKASHENLRLAGRKAQRS 332 (409) T ss_pred HhhccCCCCCCCCceEEecCCCChhHHHHHHHHHHHHHhhhcCCCHHHcccccC-chhHHHHHHHHHHHHHHHHHHHHHH Confidence 122356666677788989988652223566666777888888999988865442 3233456777888899999999999 Q ss_pred HHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCC Q lcl|NC_015285. 217 FSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQ 296 (359) Q Consensus 217 Fs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~ 296 (359) |..-+..++|.-+.+.|-... +.+....+.+.|. |+.. .++.-+.+..+.+..+..- |+.+...-+..+-|++ T Consensus 333 fg~~l~~~~rla~~~~~~~~~--~~~~~~~~~v~W~-~~~~---~~~~s~a~~aDa~~Kl~~a-~~~~~~~~v~~~~~g~ 405 (409) T protein:vir:16 333 LGAGLLNVAYLAACLRDDVPY--LREQFSKTKPKWE-PLFE---ADASMLSLIGDGAIKLNQA-IPEFINKDTIRDLTGI 405 (409) T ss_pred HHHHHHHHHHHHHHHhcCCCc--cchhhccceEEec-CCCC---cchhhHHHHHHHHHHHHhh-cccccchhHHHHhccC Confidence 999999999998887664321 2222345667775 2211 1233345666666666553 5555444444566899 Q ss_pred CHHH Q lcl|NC_015285. 297 TEIE 300 (359) Q Consensus 297 tDee 300 (359) |+.| T Consensus 406 ~~~d 409 (409) T protein:vir:16 406 KGAE 409 (409) T ss_pred CCCC Confidence 9999 No 73 >protein:vir:7987 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817341;genbank:gi:29565769;genbank:GeneID:1258964 Probab=80.65 E-value=0.093 Score=26.24 Aligned_cols=277 Identities=15% Similarity=0.157 Sum_probs=115.9 Q ss_pred CCCchhhHHHhhh----------hhhheeeccccccc--------c----------CCCceeecHh-------Hhh-hhh Q lcl|NC_015285. 1 MRGVDLNQQLTQK----------AAEYFLYNPKGLKN--------S----------TNQGMKITTD-------SVT-YCH 44 (359) Q Consensus 1 ~~~~~~~~~~~~~----------~~e~f~yn~~~~~~--------~----------~~~~v~i~~~-------ai~-y~h 44 (359) +-++..+..+.-. ...+.+|++.+... . .++.+.+..- -|+ |.+ T Consensus 141 i~d~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~N 220 (456) T protein:vir:79 141 SVDPLQPWRIRSAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQN 220 (456) T ss_pred EEcCCCCCceEEEEEEEEecCCceeEEEEEcCCceEEEEEEEEeeccccceeeeccCCceeecccccCCCCceeEEEecC Confidence 2222222111111 11122333322110 0 0000110000 001 111 Q ss_pred c-ccccCCCCcchhhHHHHHHHH-HHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCC Q lcl|NC_015285. 45 S-GIQDLNKNMTLSHLHKAIKAV-NQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDAN 122 (359) Q Consensus 45 S-Gl~d~~~~~i~syL~~Aik~~-NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~ 122 (359) . |+=+.. -+++-++++-+++ +.+..+|.....+|+.-.-..-.+-+|..+-+- .+ .+.++. . T Consensus 221 ~~~~gd~e--~v~~liD~~~~~~s~~~~~~~~~a~~~~~~~G~~~~~~~~d~~g~~i----~~----~~~~~~------~ 284 (456) T protein:vir:79 221 PDGMGEVE--PHIDIINRINRAELQLLSTMAIQAFRQRALKSSEHRLPKVDENGNAI----DY----ASIFEA------A 284 (456) T ss_pred CCCCchhh--hhHHHHHHHHHHHHHHHHHHHHHhhHHHHHhcCCccccccccccccc----ch----hhhhhh------h Confidence 1 221111 1333334433332 223333444444444433221111112111000 00 000000 0 Q ss_pred CCcccccccchhhHhhhcccccCCCCccceeecCCCCCcc-hHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhH Q lcl|NC_015285. 123 TGEIKDDKKFMSMMEDFWLPRREGGRGTEISTLPGGQNLG-ELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITR 201 (359) Q Consensus 123 TGevkdd~~~mSMlEDywLpRReGgrgTEIsTLpGgqnLg-ei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItR 201 (359) .| -.|. +..|.+|..++... ++ -.+-++-+-..++...++|..-|+..++ |. .+..|.- T Consensus 285 ~~-------------~~~~----~~~~~~~~q~~~~~-~~~~~~~l~~~i~~i~~~t~~p~~~~~~~~~-N~-Sg~Al~~ 344 (456) T protein:vir:79 285 PG-------------ALWE----LPPGVDIWESQTND-FTPMLSAIKEHIRQLSSATKTPLPMLMPDSA-NQ-SAEGAHN 344 (456) T ss_pred cc-------------cccc----CCCCcceeeecccC-hHHHHHHHHHHHHHHHhhcCCChhHhccccc-Cc-HHHHHHH Confidence 01 1122 11234444444332 22 2344677777888888999988865432 22 3344555 Q ss_pred HhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcc Q lcl|NC_015285. 202 DEVKFQKFIARLRKRFSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVG 281 (359) Q Consensus 202 DElKF~KFI~rLr~rFs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vG 281 (359) -+..+-.-+.+.|+.|..-+.+.++.-+.+.|.. ++ ..|++.|..-..=+. .+..+++..+..- T Consensus 345 ~~~~l~~k~~~~~~~f~~~l~~~~~l~~~~~g~~--~~-----~~i~v~w~~~~~~s~-------~~~ada~~kl~~~-- 408 (456) T protein:vir:79 345 IEKGFLFKCEDRLSIAKIGLEAILVKALQIEGES--VE-----DTVDVSFESPDRVTL-------GEKYSAASLAKAA-- 408 (456) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC--cc-----ccceEEeCCCCCcCH-------HHHHHHHHHHHhc-- Confidence 5666777888999999999999999888888842 21 247788865433332 3345554444321 Q ss_pred hhhhHHHHHHHHhCCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcC Q lcl|NC_015285. 282 KYFSVDYMRRQVLKQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQE 346 (359) Q Consensus 282 Ky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~ 346 (359) .+.|.+.+ ..+|++|+++|++.+.+-.++..+....+|- -..+|.+-. T Consensus 409 G~~~~~~~-~~~lg~~~~~i~~~e~~r~~~e~~~~~~~~~----------------~~~~~~~~~ 456 (456) T protein:vir:79 409 GESWASIR-RNILNYNADQIKQDDLDRAREQITLFAGNPV----------------QRPQEDGSR 456 (456) T ss_pred CCChHHHH-HhcCCCCHHHHHHHHHHHHHHHHHHHhhhHh----------------hcCCCCCCC Confidence 23565555 5789999999975443323332222221221 111221111 No 74 >protein:vir:102080 Length: 429 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512313;genbank:gi:89152482;genbank:GeneID:3953073 Probab=78.55 E-value=0.11 Score=25.77 Aligned_cols=286 Identities=10% Similarity=0.129 Sum_probs=122.1 Q ss_pred CCC--------chhhHHHhhhhhhheeeccc-------------------cccccCCCceeecHhHhhhhhcccccCCCC Q lcl|NC_015285. 1 MRG--------VDLNQQLTQKAAEYFLYNPK-------------------GLKNSTNQGMKITTDSVTYCHSGIQDLNKN 53 (359) Q Consensus 1 ~~~--------~~~~~~~~~~~~e~f~yn~~-------------------~~~~~~~~~v~i~~~ai~y~hSGl~d~~~~ 53 (359) +.+ +..|. ..+.+..+|. ......+..+.++.+-|.+..-+. ..++= T Consensus 113 l~Gnay~~i~r~~~G~-----~~~L~~i~~~~v~v~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~evih~~~~~-~~~~~ 186 (429) T protein:vir:10 113 LYGNSYANIEFDRKGK-----VQALWPIDASKVTVYIDDVGLLNSKTKMWYVVNTGGQQRVLKPEEILHFKNGI-TLDGL 186 (429) T ss_pred hcCCeEEEEEECCCCc-----EEEEEEEcCceeEEEEcCcccccccceEEEEEccCCeEEEEccccEEEecCCC-CCCCc Confidence 111 11110 1111111111 112233344668888777764332 22333 Q ss_pred cchhhHHHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccch Q lcl|NC_015285. 54 MTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFM 133 (359) Q Consensus 54 ~i~syL~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~m 133 (359) .-+|.|..|.+++.....++....=+----+.-+-+..++ +.|.+.++++..+.+...|..- ...|. .+ T Consensus 187 ~G~s~i~~~~~~i~~~~~~~~~~~~~~~ng~~~~~il~~~-~~l~~e~~~~~~~~~~~~~~g~----~n~~~------~~ 255 (429) T protein:vir:10 187 VGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYV-GDLNEDAKKVFRENFESMSSGL----QNSHR------IA 255 (429) T ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcC-CCCCHHHHHHHHHHHHHHhccc----cccCc------ee Confidence 3468899999999998888887766655555556777776 5677777766655554444220 01121 11 Q ss_pred hhHhhhcccccCCCCccceeecCCCCCcch---HHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHH- Q lcl|NC_015285. 134 SMMEDFWLPRREGGRGTEISTLPGGQNLGE---LEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKF- 209 (359) Q Consensus 134 SMlEDywLpRReGgrgTEIsTLpGgqnLge---i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KF- 209 (359) .++ .|.+++.|. .+..+ ++-.++..+.+.++++||.+-|....+-+.....+..+. |.++ T Consensus 256 -vl~----------~g~~~~~l~--~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~sn~e~~~~~---f~~~~ 319 (429) T protein:vir:10 256 -LMP----------VGYQFQPIS--LNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQ---FYTDT 319 (429) T ss_pred -ecC----------CCceEEEcc--CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHH---HHHHH Confidence 121 245555553 22223 333457788999999999999964333233233332222 4332 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHH Q lcl|NC_015285. 210 IARLRKRFSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYM 289 (359) Q Consensus 210 I~rLr~rFs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i 289 (359) |.-+-..+. + -+-+.++++.+|. ..+.+.|..+. +...+ +..|++.++.+-.- -+++.+-+ T Consensus 320 l~P~~~~ie----~-----~ln~kl~~~~~~~---~g~~~~fd~~~----ll~~d-~~~~~~~~~~~~~~--G~~T~NE~ 380 (429) T protein:vir:10 320 LQATLTMYE----Q-----EMTYKLFLDSELD---KGFYSKFNVDA----ILRAD-IKTRYEAYRTGIQG--GFLKPNEA 380 (429) T ss_pred HHHHHHHHH----H-----HHHHhhcChhhcC---CCcEEEeechh----hhcCC-HHHHHHHHHHHHhC--CCcCHHHH Confidence 222222222 2 2223344555554 33345554332 22111 23455555554332 25566655 Q ss_pred HHHHhCCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCCCCccCC Q lcl|NC_015285. 290 RRQVLKQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPGDVRRGE 358 (359) Q Consensus 290 ~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~ 358 (359) +. ++++.+. +.-++.+-. ... -|-+..+.....+ |...+ ....++..|- T Consensus 381 R~-~~gl~p~--~ggD~~~~~---~n~--~~~d~~~~~~~k~--g~~~~----------~~~~~~~e~~ 429 (429) T protein:vir:10 381 RS-KEDLPPE--AGGDRLLVN---GNM--LPIDMAGQAYLKG--GDTNG----------EVSKEGNEGN 429 (429) T ss_pred HH-HhCCCCC--CCcCeeeec---ccc--cchhhccccccCC--CCCCC----------CCCCCCCCCC Confidence 53 3555321 000000000 000 0000000000000 00000 0011111111 No 75 >protein:vir:96179 Length: 468 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240075;genbank:gi:66395736;genbank:GeneID:5133166 Probab=78.46 E-value=0.11 Score=25.75 Aligned_cols=271 Identities=10% Similarity=0.075 Sum_probs=108.3 Q ss_pred CCCchhhHHHhhhh-hhheeecccccc-------ccCCCceeecHhHhhhhhc-ccccC----CCCcchhhHHHHHHHHH Q lcl|NC_015285. 1 MRGVDLNQQLTQKA-AEYFLYNPKGLK-------NSTNQGMKITTDSVTYCHS-GIQDL----NKNMTLSHLHKAIKAVN 67 (359) Q Consensus 1 ~~~~~~~~~~~~~~-~e~f~yn~~~~~-------~~~~~~v~i~~~ai~y~hS-Gl~d~----~~~~i~syL~~Aik~~N 67 (359) -++..+.+++++.. ..+|.+..++.. .....+...... .|- |.++. ++..-.|-++..+.... T Consensus 181 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~iPvv~~~n~~~g~sd~e~v~~liD 256 (468) T protein:vir:96 181 ELDGGERVEYWTANDVTFYELKDGQLIPDYYQGEEHVQAHYYVGNK----SMSWNRVPFIPFKNNPQEVSDLFMYKTIID 256 (468) T ss_pred EecCceEEEEEeCCeEEEEEEcCCceeecccccccccccceeeccc----cccCCcccEEEecCCCCCCCchHHHHHHHH Confidence 11111222221111 011111111100 000111111111 111 11111 12222344555444444 Q ss_pred HHHH-HHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhhcccccCC Q lcl|NC_015285. 68 QLRM-IEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDFWLPRREG 146 (359) Q Consensus 68 qL~m-~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDywLpRReG 146 (359) .|.+ +-+..-.-+.++.|-+-+.-.+... .++.+.. |..++ . ++++= ++ T Consensus 257 a~d~~~S~~~~~~~~~~~p~lv~~g~~~~~-----~~~~~~~-~~~~~--~---------------------i~~~~-d~ 306 (468) T protein:vir:96 257 AMDKRLSDTQNTFDEATELIYVLKGYEGED-----LEEFMYN-LKYYK--A---------------------INVDG-DG 306 (468) T ss_pred HHHHHHHHHHHHHHHhcCceeeeecCCccc-----cchhhhh-hhcCc--e---------------------EEecC-CC Confidence 4432 3333334455666654333211111 1111110 11111 1 12322 22 Q ss_pred CCccceeecCCCCCcchH-HHHHHHHHHHHHhcCCCccccCCCCcccccchh--hhhHHhhhHHHHHHHHHHHHHHHHHH Q lcl|NC_015285. 147 GRGTEISTLPGGQNLGEL-EDVKYFQKKLYKALNVPSSRLETETTFNIGRAA--EITRDEVKFQKFIARLRKRFSELFTD 223 (359) Q Consensus 147 grgTEIsTLpGgqnLgei-~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~--eItRDElKF~KFI~rLr~rFs~if~d 223 (359) +.+ +..|....+.... .-++-+.+.+|...++|- +..++ |. |..| .|.........-+.+.+..|...+.+ T Consensus 307 ~~~--~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~--~~~~~-~~-~n~Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~ 380 (468) T protein:vir:96 307 SGG--VDTIQIDVPVQSAKEYLDMLRDYVIEFGQGVD--FQQDK-FG-NSPSGIALKFMYSNLDLKANKLKNKTLTALQE 380 (468) T ss_pred CCc--ceEEeecCChHHHHHHHHHHHHHHHHHhCccc--ccccc-cc-cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 222 4444333333333 347778888999999983 32222 21 2222 23222222344567777777777777 Q ss_pred HHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCCCHHHHHH Q lcl|NC_015285. 224 LLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQTEIEIKE 303 (359) Q Consensus 224 ~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~tDeeI~e 303 (359) +++.=+-+.|+. -+| ..|.+.|...---.+...++ ++.++ | .+|.+++++. |...++ .++ T Consensus 381 ~~~li~~~~g~~--~d~----~~i~i~f~~~~p~d~~e~a~-------~~~~~----g-~iS~et~i~~-l~~v~D-~~~ 440 (468) T protein:vir:96 381 LLQYIIDFYKLS--IKV----QDVEITFNFNVMVNELEQSQ-------IGVNS----Q-YLSKETVVTN-HPWVDD-PVA 440 (468) T ss_pred HHHHHHHHhCCC--ccc----ceeeEEecCCCCcCHHHHHH-------HHHhc----C-CCchHHHHHh-CCCCCC-HHH Confidence 776654455542 233 35677776554444433333 23332 2 5799999977 555432 455 Q ss_pred HHHHHHHHHhcCCCCCCcchhhhcCCCCCccccccc Q lcl|NC_015285. 304 IDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAE 339 (359) Q Consensus 304 ~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~ 339 (359) +.++|++|...- ++...+..++++..+- T Consensus 441 E~~ri~~E~~~~--------~~~~~~~~~~~~~~~~ 468 (468) T protein:vir:96 441 EMERIDQEELAL--------PSIEEGLNGKENNEPT 468 (468) T ss_pred HHHHHHHHHHHH--------HHHhhccCCCCCCCCC Confidence 566666664431 1111223333322221 No 76 >protein:vir:1236 Length: 483 # NCBI annotation: similar to phage Spp1 gp6 (portal protein) # Family: family:all:125 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510935;genbank:gi:17426269;genbank:GeneID:927380 Probab=77.45 E-value=0.12 Score=25.54 Aligned_cols=282 Identities=11% Similarity=0.087 Sum_probs=107.4 Q ss_pred CCCchhhHHHhhh-hhhheeeccccccccCC---CceeecHhHhhhhhc-ccccC----CCCcchhhHHHHHHHHHHHH- Q lcl|NC_015285. 1 MRGVDLNQQLTQK-AAEYFLYNPKGLKNSTN---QGMKITTDSVTYCHS-GIQDL----NKNMTLSHLHKAIKAVNQLR- 70 (359) Q Consensus 1 ~~~~~~~~~~~~~-~~e~f~yn~~~~~~~~~---~~v~i~~~ai~y~hS-Gl~d~----~~~~i~syL~~Aik~~NqL~- 70 (359) -.++.+.++++.. ...||.+.......... ....+.. ..|. |-++. ++..-.|=++..+.....+. T Consensus 190 ~~~~~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~ 265 (483) T protein:vir:12 190 KLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHF----STGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNR 265 (483) T ss_pred EeecceEEEEEecCeEEEEEEeCCeeeeccccccccccccc----ccCCCCccceEEecCCCCCCCchhhHHHHHHHHHH Confidence 1222223333221 11222222221111100 0011110 1111 21111 11122333443333333332 Q ss_pred HHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhhcccccCCCCcc Q lcl|NC_015285. 71 MIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDFWLPRREGGRGT 150 (359) Q Consensus 71 m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDywLpRReGgrgT 150 (359) ++=+....-+.++.|-+-+.-.+.-++. +..+ .+..++...-++ |. T Consensus 266 ~~S~~~~~~~~~~~~~lv~~g~~~~~~~-----~~~~---------------------------~~~~~~~~~~~~--~~ 311 (483) T protein:vir:12 266 RLSDLSNTFKDSNELTYVLTNYDDQELP-----EFKR---------------------------LLRYYGAIKVSD--NG 311 (483) T ss_pred HHHHHHHHHHHhcCceeeeecCCcccch-----hHHH---------------------------hhhhccccccCC--CC Confidence 3445555556677775543322222211 1111 111111111111 12 Q ss_pred ceeecCCCCCcchH-HHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015285. 151 EISTLPGGQNLGEL-EDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFTDLLKTQL 229 (359) Q Consensus 151 EIsTLpGgqnLgei-~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KFI~rLr~rFs~if~d~Lk~QL 229 (359) ++.+|-...+.+.. .-+.-+.+.+|...++|---.+.-++ |. .|..|.--+.....-+.+.++.|...+.++++.=+ T Consensus 312 ~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~-n~-Sg~Al~~~~~~l~~k~~~~~~~f~~~l~~~~~li~ 389 (483) T protein:vir:12 312 GVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGS-AP-SGVALEFLYTNLNLKADKLARKAKVAIQELLWFVF 389 (483) T ss_pred cceEEeecCCHHHHHHHHHHHHHHHHHHhCCCCCCcccccc-Cc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 35555443343332 33456667788888888422221111 11 22233333444555567778888888888777533 Q ss_pred HhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCCCHHHHHHHHHHHH Q lcl|NC_015285. 230 ILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQTEIEIKEIDEQIA 309 (359) Q Consensus 230 iLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~tDeeI~e~~kqi~ 309 (359) -+-|+ ..+|. .|.+.|....--.+.. -+++++.+. | .+|.+++++.+ ...+ +.+++.++|+ T Consensus 390 ~~~~~--~~~~~----~i~v~f~~~~p~~~~~-------~a~~~~kl~---G-iiS~et~~~~~-~~v~-d~~~E~~ri~ 450 (483) T protein:vir:12 390 EHFDI--KGEHK----DVDISFNYNKVANTEL-------QVQTAQQSM---G-IVSHETVLENH-PFVE-DLQAELERIE 450 (483) T ss_pred HHhcC--CCccc----eeeEEeCCCCCCCHHH-------HHHHHHHHh---c-cCchHHHHHhC-CCCC-CHHHHHHHHH Confidence 33333 34554 4566665433333322 244555553 3 37999999874 4432 1333444455 Q ss_pred HHHhcCCCCCCcchhhhcCCCCCcc-cccccCCCCCcCCCCCCCCCccCC Q lcl|NC_015285. 310 SEMEAGIIADPMAEMDPAMAAGGEG-APAAEVDPNAQESSVDPGDVRRGE 358 (359) Q Consensus 310 ~E~~~~~~~~P~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~p~~~~~~~ 358 (359) +|..+..-..++ .++++ +++. +. ..+ +-..+| T Consensus 451 ~E~~~~~~~~~~--------~~~~~~d~~~-------~~-~~~-~~~e~e 483 (483) T protein:vir:12 451 QEQMEYNKQLPN--------LDDGGADGAQ-------QQ-ERS-NNKESE 483 (483) T ss_pred HHHHHHHhhccc--------ccccccCCcc-------cC-CCC-CcccCC Confidence 553321111111 00000 0000 00 000 011111 No 77 >protein:vir:94426 Length: 409 # NCBI annotation: ORF009 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240003;genbank:gi:66395665;genbank:GeneID:5133086 Probab=77.42 E-value=0.12 Score=25.54 Aligned_cols=278 Identities=14% Similarity=0.212 Sum_probs=115.3 Q ss_pred CCCc--------hhhHHHhhhhhhheeecccc---------------ccccCCCceeecHhHhhhhhcccccCCCCcchh Q lcl|NC_015285. 1 MRGV--------DLNQQLTQKAAEYFLYNPKG---------------LKNSTNQGMKITTDSVTYCHSGIQDLNKNMTLS 57 (359) Q Consensus 1 ~~~~--------~~~~~~~~~~~e~f~yn~~~---------------~~~~~~~~v~i~~~ai~y~hSGl~d~~~~~i~s 57 (359) +.++ ..| ...+.+..+|.. +....+..+.++.+-|.+.. +....++-.=+| T Consensus 107 l~Gnay~~i~r~~~G-----~~~~L~~l~~~~v~v~~~~~~~~~~y~~~~~~g~~~~~~~~dvih~r-~~~~~~~~~G~s 180 (409) T protein:vir:94 107 EKGNAYVLIERDIYH-----QPSKLFLLNPDVVEMLIENQSRELYYSIHAATGNKLIVHNMDMLHFK-HIVASNMVQGIS 180 (409) T ss_pred hcCCeEEEEEECCCC-----cEEEEEEEcCceeEEEEeCCCcEEEEEEEcCCceEEEEccccEEEec-CCCCCCcccccc Confidence 1111 111 112333322211 01112334667777776652 111112111234 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHh Q lcl|NC_015285. 58 HLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMME 137 (359) Q Consensus 58 yL~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlE 137 (359) -|..|.+.......++.. -+....+.| .++..-.+.+.+.+++...+.+.+.|. ++|.+ + .++ T Consensus 181 ~l~~~~~~i~~~~~~~~~-~~~~~~~~~--~~i~~~~~~l~~e~~~~~~~~~~~~~~-------~~g~~------~-vl~ 243 (409) T protein:vir:94 181 PIDVLKNTTDFDNAVRTF-NLTEMQKPD--SFMLKYGSNVGKEKRQQVLEDFKQYYE-------ENGGI------L-FQE 243 (409) T ss_pred HHHHHHHHHHHHHHHHHH-HHHhcCCCC--eeEEecCCCCCHHHHHHHHHHHHHHhh-------cCCCe------e-ecC Confidence 555555555544444444 344444544 233334556666666555555444332 23322 1 121 Q ss_pred hhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHH-HHHHH Q lcl|NC_015285. 138 DFWLPRREGGRGTEISTLPGG-QNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFI-ARLRK 215 (359) Q Consensus 138 DywLpRReGgrgTEIsTLpGg-qnLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KFI-~rLr~ 215 (359) -|.+++.|.-. +.+.-++-..|-.+.+.++++||...|...+.-+.....+..+. |.+++ .-+-. T Consensus 244 ----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~---f~~~~l~P~~~ 310 (409) T protein:vir:94 244 ----------PGVEIEPLPKKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRF---YLQHTLLPIVK 310 (409) T ss_pred ----------CCceEEEcCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHH---HHHHHHHHHHH Confidence 25677777532 22223444456678899999999999976554454444444444 65553 33322 Q ss_pred HHHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhC Q lcl|NC_015285. 216 RFSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLK 295 (359) Q Consensus 216 rFs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~ 295 (359) ++ .+.|-. .++++.+|.. ...|.|..+ ++...+ +..|++.+..+-.- -+++..-++. +++ T Consensus 311 ~i----e~~ln~-----~Ll~~~~~~~---~~~i~fd~~----~ll~~d-~~~~~~~~~~~~~~--G~~T~NE~R~-~~g 370 (409) T protein:vir:94 311 QY----EEEFNR-----KLLTKTDREK---NRYFKFNVK----SYLRAD-SATQAEVYFKAVRS--GYYTINDIRE-WED 370 (409) T ss_pred HH----HHHHHH-----hhCCcccccC---cceEEeech----hhhccC-HHHHHHHHHHHHhC--CCcCHHHHHH-HhC Confidence 22 222322 3345555442 334555433 333333 34566666555322 3556666653 355 Q ss_pred CCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCCCCccC Q lcl|NC_015285. 296 QTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPGDVRRG 357 (359) Q Consensus 296 ~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~ 357 (359) +.+-+= -++-+- ..... ..+...+......|+ -+|+..| T Consensus 371 ~~p~~g--gD~~~~---~~n~~-~~~~~~~~~~~~kGG-----------------~~n~~e~ 409 (409) T protein:vir:94 371 LPPVEG--GDKPLI---SGDLY-PIDTPLELRKSLKGG-----------------DKNVNES 409 (409) T ss_pred CCCCCC--cCeEee---ccccc-ccccchhhcccccCC-----------------CCCcCCC Confidence 543210 000000 00000 000000000001111 1122222 No 78 >protein:vir:102602 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_654999;genbank:gi:109392189;genbank:GeneID:4157224 Probab=77.09 E-value=0.13 Score=25.47 Aligned_cols=277 Identities=14% Similarity=0.144 Sum_probs=115.5 Q ss_pred CCCchhhHHHhhhhhhh----------eeeccccc------------------cccCCCceeecHh-------Hhhhhhc Q lcl|NC_015285. 1 MRGVDLNQQLTQKAAEY----------FLYNPKGL------------------KNSTNQGMKITTD-------SVTYCHS 45 (359) Q Consensus 1 ~~~~~~~~~~~~~~~e~----------f~yn~~~~------------------~~~~~~~v~i~~~-------ai~y~hS 45 (359) +-|+..+..+.-.+.-| .+|++.+. ...+...+.+... -|+|.+. T Consensus 141 i~d~~~~~~~~~~i~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~N 220 (456) T protein:vir:10 141 SVDPLQPWRIRAAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQN 220 (456) T ss_pred EEcCCCCcceEEEEEEEEecCCceeEEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccccCCCCCceeEEEecC Confidence 22222222211111100 01111110 0001111111100 0222221 Q ss_pred --ccccCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCC Q lcl|NC_015285. 46 --GIQDLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANT 123 (359) Q Consensus 46 --Gl~d~~~~~i~syL~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~T 123 (359) |+=+. --+++..+++-+.+- |.++.-...--|.|-+.-.+.+. +.. | .+ T Consensus 221 ~~g~gd~--e~vi~liDa~~~~~s------~~~~~~~~~a~~~~~i~G~~~~~-~~~-------------------d-~~ 271 (456) T protein:vir:10 221 PDGMGEV--EPHIDIINRINRAEL------QLLSTMAIQAFRQRALKSTEHGL-PNV-------------------D-EN 271 (456) T ss_pred CCCCchh--hhhHHHHHHHHHHHH------HHHHHHHHhhhHhHhhhccCccc-ccc-------------------c-cc Confidence 22111 113333343333332 33333333333444333222111 100 0 01 Q ss_pred CcccccccchhhHhh-hcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhH Q lcl|NC_015285. 124 GEIKDDKKFMSMMED-FWLPRREGGRGTEISTLPGGQNLGE-LEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITR 201 (359) Q Consensus 124 Gevkdd~~~mSMlED-ywLpRReGgrgTEIsTLpGgqnLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItR 201 (359) |..-+..+....-.+ .|. ...|..|..++... ++. ++-++-.-..+....++|.+-|+..++ |. .+..|.- T Consensus 272 g~~~~~~~~~~~~~~~~~~----~~~~~~~~q~~~~~-~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~-N~-Sg~Ai~~ 344 (456) T protein:vir:10 272 GNAIDYASIFEAAPGALWE----LPPGVDIWESQAND-FTPMLSAIKEHIRQLSSATKTPLPMLMPDSA-NQ-SAEGAHN 344 (456) T ss_pred ccccchhhhhhhhcccccc----CCCCcceEEecccC-hhHHHHHHHHHHHHHHhccCCChHHhccccc-Ch-HHHHHHH Confidence 100000000000000 132 11345566676543 443 344677777788888999888865332 22 3445566 Q ss_pred HhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcc Q lcl|NC_015285. 202 DEVKFQKFIARLRKRFSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVG 281 (359) Q Consensus 202 DElKF~KFI~rLr~rFs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vG 281 (359) -+..+-.-+.+.|+.|..-+.+.++.-+.+.|... + ..+++.|..-..=+. .+.++++..+..- T Consensus 345 ~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~g~~~---~----~~~~v~w~~~~~~~~-------~~~ada~~kl~~~-- 408 (456) T protein:vir:10 345 IEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESV---E----DTVDVSFESPDRVTL-------GEKYSAASLAKAA-- 408 (456) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc---c----cceeEEecCCCCcCH-------HHHHHHHHHHHHc-- Confidence 66667778889999999999999998888888431 1 357788854433222 2334555544321 Q ss_pred hhhhHHHHHHHHhCCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcC Q lcl|NC_015285. 282 KYFSVDYMRRQVLKQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQE 346 (359) Q Consensus 282 Ky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~ 346 (359) ...|... ...+|++++++|++.+.+-.+|..+....+|-. . .+|++.. T Consensus 409 gi~~~~~-~~~~lg~~~~~i~~~e~er~~~e~~~~~~~~~~--------------~--~~~~~~~ 456 (456) T protein:vir:10 409 GESWASI-RRNILNYNADQIKQDDLDRAREQITLFAGNPVQ--------------R--PQEDGSR 456 (456) T ss_pred CCChHHH-HHhhCCCCHHHHHHHHHHHHHHHHHHHhhhhhh--------------c--CCCCCCC Confidence 2345554 457899999998754332222222211111100 0 0222222 No 79 >protein:vir:105819 Length: 456 # NCBI annotation: gp3 # Family: family:all:5096 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655764;genbank:gi:109522087;genbank:GeneID:4157627 Probab=77.09 E-value=0.13 Score=25.47 Aligned_cols=277 Identities=14% Similarity=0.144 Sum_probs=115.5 Q ss_pred CCCchhhHHHhhhhhhh----------eeeccccc------------------cccCCCceeecHh-------Hhhhhhc Q lcl|NC_015285. 1 MRGVDLNQQLTQKAAEY----------FLYNPKGL------------------KNSTNQGMKITTD-------SVTYCHS 45 (359) Q Consensus 1 ~~~~~~~~~~~~~~~e~----------f~yn~~~~------------------~~~~~~~v~i~~~-------ai~y~hS 45 (359) +-|+..+..+.-.+.-| .+|++.+. ...+...+.+... -|+|.+. T Consensus 141 i~d~~~~~~~~~~i~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~pvv~~~N 220 (456) T protein:vir:10 141 SVDPLQPWRIRAAMRWWRDLDAESDFAIVWSGDGWQKFARPCFVQSSSRRRLVTRISDSWVPVGDAVVTGSPPPVVVYQN 220 (456) T ss_pred EEcCCCCcceEEEEEEEEecCCceeEEEEEeccceeEEEEEEEEeecccceeeeecCCceeeccccCCCCCceeEEEecC Confidence 22222222211111100 01111110 0001111111100 0222221 Q ss_pred --ccccCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCC Q lcl|NC_015285. 46 --GIQDLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANT 123 (359) Q Consensus 46 --Gl~d~~~~~i~syL~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~T 123 (359) |+=+. --+++..+++-+.+- |.++.-...--|.|-+.-.+.+. +.. | .+ T Consensus 221 ~~g~gd~--e~vi~liDa~~~~~s------~~~~~~~~~a~~~~~i~G~~~~~-~~~-------------------d-~~ 271 (456) T protein:vir:10 221 PDGMGEV--EPHIDIINRINRAEL------QLLSTMAIQAFRQRALKSTEHGL-PNV-------------------D-EN 271 (456) T ss_pred CCCCchh--hhhHHHHHHHHHHHH------HHHHHHHHhhhHhHhhhccCccc-ccc-------------------c-cc Confidence 22111 113333343333332 33333333333444333222111 100 0 01 Q ss_pred CcccccccchhhHhh-hcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhH Q lcl|NC_015285. 124 GEIKDDKKFMSMMED-FWLPRREGGRGTEISTLPGGQNLGE-LEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITR 201 (359) Q Consensus 124 Gevkdd~~~mSMlED-ywLpRReGgrgTEIsTLpGgqnLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItR 201 (359) |..-+..+....-.+ .|. ...|..|..++... ++. ++-++-.-..+....++|.+-|+..++ |. .+..|.- T Consensus 272 g~~~~~~~~~~~~~~~~~~----~~~~~~~~q~~~~~-~~~~~~~l~~~i~~~~~~s~~p~~~~~~~~~-N~-Sg~Ai~~ 344 (456) T protein:vir:10 272 GNAIDYASIFEAAPGALWE----LPPGVDIWESQAND-FTPMLSAIKEHIRQLSSATKTPLPMLMPDSA-NQ-SAEGAHN 344 (456) T ss_pred ccccchhhhhhhhcccccc----CCCCcceEEecccC-hhHHHHHHHHHHHHHHhccCCChHHhccccc-Ch-HHHHHHH Confidence 100000000000000 132 11345566676543 443 344677777788888999888865332 22 3445566 Q ss_pred HhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcc Q lcl|NC_015285. 202 DEVKFQKFIARLRKRFSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVG 281 (359) Q Consensus 202 DElKF~KFI~rLr~rFs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vG 281 (359) -+..+-.-+.+.|+.|..-+.+.++.-+.+.|... + ..+++.|..-..=+. .+.++++..+..- T Consensus 345 ~~~~l~~k~~~~~~~f~~~l~~~~rl~~~~~g~~~---~----~~~~v~w~~~~~~~~-------~~~ada~~kl~~~-- 408 (456) T protein:vir:10 345 IEKGFLFKCEDRLSIAKIGLEAILVKALQIEGESV---E----DTVDVSFESPDRVTL-------GEKYSAASLAKAA-- 408 (456) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCc---c----cceeEEecCCCCcCH-------HHHHHHHHHHHHc-- Confidence 66667778889999999999999998888888431 1 357788854433222 2334555544321 Q ss_pred hhhhHHHHHHHHhCCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcC Q lcl|NC_015285. 282 KYFSVDYMRRQVLKQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQE 346 (359) Q Consensus 282 Ky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~ 346 (359) ...|... ...+|++++++|++.+.+-.+|..+....+|-. . .+|++.. T Consensus 409 gi~~~~~-~~~~lg~~~~~i~~~e~er~~~e~~~~~~~~~~--------------~--~~~~~~~ 456 (456) T protein:vir:10 409 GESWASI-RRNILNYNADQIKQDDLDRAREQITLFAGNPVQ--------------R--PQEDGSR 456 (456) T ss_pred CCChHHH-HHhhCCCCHHHHHHHHHHHHHHHHHHHhhhhhh--------------c--CCCCCCC Confidence 2345554 457899999998754332222222211111100 0 0222222 No 80 >protein:vir:3153 Length: 467 # NCBI annotation: capsid protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665924;genbank:gi:22091110;genbank:GeneID:951257 Probab=76.78 E-value=0.13 Score=25.41 Aligned_cols=302 Identities=11% Similarity=0.167 Sum_probs=109.4 Q ss_pred CCC---------------------chhhHHHhhhhhhheeeccc-------c---------ccccCCCceeecHhHhhhh Q lcl|NC_015285. 1 MRG---------------------VDLNQQLTQKAAEYFLYNPK-------G---------LKNSTNQGMKITTDSVTYC 43 (359) Q Consensus 1 ~~~---------------------~~~~~~~~~~~~e~f~yn~~-------~---------~~~~~~~~v~i~~~ai~y~ 43 (359) .++ ...-+..+.+-.-||.+... + .....+..+.++.+-|.+. T Consensus 93 ~r~~~G~~~~l~~l~~~~v~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~diih~ 172 (467) T protein:vir:31 93 LTQTDGTPTGLAYVPGHTIRKRMDERGFVQLLEEKEKYFGVAGDRYQTNGNGDLDPVFVDADDGSTGTSVSNPANELIFK 172 (467) T ss_pred EECCCCcEEEEEEeCCceeEeeeecceeEeecCCceeeEEeccccceeecccceeeeeeeeccccccceeEeccccEEEe Confidence 111 10001111112222222111 1 0123455688888888764 Q ss_pred hcccccCCCCcchhhHHHHHHHHHHHHHHHHHHHHHHHhc-CccceeEeccCCCCchHHHHHHHHHHHH-hhcceEEeeC Q lcl|NC_015285. 44 HSGIQDLNKNMTLSHLHKAIKAVNQLRMIEDSLVIYRLSR-APERRIFYIDVGNLPKNKAEQYLREVMG-RYRNKMVYDA 121 (359) Q Consensus 44 hSGl~d~~~~~i~syL~~Aik~~NqL~m~EDalVIyR~~R-APeRRvFyIDvGnlpk~KAeqYl~~iM~-kyrnklvYD~ 121 (359) .- .-..++-.=+|-+..|.+.+..-..++....=+ ..+ +--+-|..+.-+.+.+ ++.+-+++.+. .|++..- T Consensus 173 r~-~~~~~~~~G~s~~~~~~~~i~~~~~~~~~~~~~-f~ng~~p~gil~~~~~~l~~-e~~~~~~~~~~~~~~~~~~--- 246 (467) T protein:vir:31 173 RN-HSPLYPHYGAPDIIPAVKTIRGDSAAQDYNIDF-FENDGVPRIAIIVKGAELTE-KGREEMRNLIEDNNEDNHR--- 246 (467) T ss_pred cC-CCCCCCcccccHHHHHHHHHHHHHHHHHHHHHH-HhccCCCceEEEecCcCCCH-HHHHHHHHHHHhhhcchhh--- Confidence 21 111222233677888887776555555433211 112 2223445555455554 44444444443 3332110 Q ss_pred CCCcccccccchhhHhhhcccccCCCCccceeecCCCCC-----------------cchHHHH-HHHHHHHHHhcCCCcc Q lcl|NC_015285. 122 NTGEIKDDKKFMSMMEDFWLPRREGGRGTEISTLPGGQN-----------------LGELEDV-KYFQKKLYKALNVPSS 183 (359) Q Consensus 122 ~TGevkdd~~~mSMlEDywLpRReGgrgTEIsTLpGgqn-----------------Lgei~DV-~YF~kkLy~aL~VP~S 183 (359) -.++.--+...|-.+..|++|.. =.|+-+. ++..+..-++.+||.+ T Consensus 247 ----------------~~~~~~~g~~n~~~~~~l~~g~~~~~~~~~~~~ls~~~~~d~qf~e~~~~~~~~Ia~~fgVpp~ 310 (467) T protein:vir:31 247 ----------------TAFIETEKIVQNEDYLNLADGADRSDVEIRLEPLTVGIDEEASFLEFRGRNEHDILKVHDVPPV 310 (467) T ss_pred ----------------hhhhhhcccccccccccccCCCcccccceeEEeccccChhhHHHHHHHHHHHHHHHHHhCCCHH Confidence 00111111111222333333321 1122222 3455669999999999 Q ss_pred ccCCCCcccccchhhhhHHhhhHHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHH Q lcl|NC_015285. 184 RLETETTFNIGRAAEITRDEVKFQKF-IARLRKRFSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKE 262 (359) Q Consensus 184 Rl~~~~~~~~g~~~eItRDElKF~KF-I~rLr~rFs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe 262 (359) .|+...+-+.+. .+...-..|.++ +.-+..++...|...| ++..+ ......|+|++..--.-.. T Consensus 311 ~lG~~~~~~~~s--~~e~~~~~f~~~~l~P~~~~ie~~ln~~l---------~~~~~-~~~~~~i~f~~~~l~~~d~--- 375 (467) T protein:vir:31 311 IAGVVESGAFST--DAEEQRKEFAEETIQPKQHDFGELLYELV---------HKQGL-DAPDWTIEFELAKPDTKLQ--- 375 (467) T ss_pred HcccCCCCCccc--CHHHHHHHHHHHHHHHHHHHHHHHHHHhh---------cchhh-ccCCceEEEecchhhccCH--- Confidence 986433334432 222222335444 4555555544444333 22111 1112345665553322222 Q ss_pred HHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCCCHHHHHHHHHHHHHHHhcCCCC-CCcchhhhcCCCCCcccccccCC Q lcl|NC_015285. 263 IEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQTEIEIKEIDEQIASEMEAGIIA-DPMAEMDPAMAAGGEGAPAAEVD 341 (359) Q Consensus 263 ~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~-~P~~~~~~~~~~~~~~~~~~~~~ 341 (359) ..|++.+..+-. .-+++.+-+++. +++.+- . ++ ..++ .|.. ...+++.-++++...... T Consensus 376 ----~~~~~~~~~~~~--~G~~T~NE~R~~-~Gl~pi--~-------d~---~~~~~~~~~-~~~~~~~~~~~~~~~~~~ 435 (467) T protein:vir:31 376 ----DVEIASQRVQAM--QGLLTVNELRDE-FGFEPF--P-------EE---HVYGGETLV-AEVTGGSGPGGGIGDQIE 435 (467) T ss_pred ----HHHHHHHHHHHh--CCCcCHHHHHHH-hCCCCC--C-------cc---cccCCcccc-cccccccCCCCcccCcCC Confidence 344444444321 125566655533 555321 0 00 0000 0000 001111100000000000 Q ss_pred CCCcCCCCCCCCCccCCC Q lcl|NC_015285. 342 PNAQESSVDPGDVRRGEF 359 (359) Q Consensus 342 ~~~~~~~~~p~~~~~~~~ 359 (359) +..-..+.+.-+.....+ T Consensus 436 ~~~~~~~~~~~~~~~~~~ 453 (467) T protein:vir:31 436 QLVEDRADEIIDSYQADL 453 (467) T ss_pred CCCCCcccchHhhhhhcc Confidence 000000011111112222 No 81 >protein:vir:99781 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004303;genbank:gi:122891757;genbank:GeneID:4712336 Probab=75.90 E-value=0.14 Score=25.24 Aligned_cols=299 Identities=10% Similarity=0.045 Sum_probs=104.4 Q ss_pred CCCchhhHHHhhhh----------------hhheeeccccccc---cCCCceeecHhH-hhhhhc-cccc----CCCCcc Q lcl|NC_015285. 1 MRGVDLNQQLTQKA----------------AEYFLYNPKGLKN---STNQGMKITTDS-VTYCHS-GIQD----LNKNMT 55 (359) Q Consensus 1 ~~~~~~~~~~~~~~----------------~e~f~yn~~~~~~---~~~~~v~i~~~a-i~y~hS-Gl~d----~~~~~i 55 (359) +-++....+.+-.+ ..+-+|++...+. ..+.+......- -...|- |.++ +++..- T Consensus 175 vyd~~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g 254 (511) T protein:vir:99 175 IYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERR 254 (511) T ss_pred EEcCCCCCceEEEEEEEEeeecccCccceEEEEEEEeCCcEEEEEecCCccccccccccccccCCCCccceEEecCCCCC Confidence 11111111111111 1112555544321 111111111100 000111 1110 011112 Q ss_pred hhhHHHHHHHHHHHHHH-HHHHHHHHHhcCccceeEe---ccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCccccccc Q lcl|NC_015285. 56 LSHLHKAIKAVNQLRMI-EDSLVIYRLSRAPERRIFY---IDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKK 131 (359) Q Consensus 56 ~syL~~Aik~~NqL~m~-EDalVIyR~~RAPeRRvFy---IDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~ 131 (359) .|=++..+.....+..+ =+....-+-++.|-+.+.- .|.+.+++.+. ++ T Consensus 255 ~sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~~~~~~~~~~------------~~--------------- 307 (511) T protein:vir:99 255 KGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQKE------------AN--------------- 307 (511) T ss_pred CCchhhhHHHHHHHHHHHHHHHHHHHHhhchhhhhccCcccCchhhccccc------------cc--------------- Confidence 23333333333322221 1112222333444433321 11111111000 00 Q ss_pred chhhHhhhccccc--------CCCCccceeecCCCCCcchHH-HHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHH Q lcl|NC_015285. 132 FMSMMEDFWLPRR--------EGGRGTEISTLPGGQNLGELE-DVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRD 202 (359) Q Consensus 132 ~mSMlEDywLpRR--------eGgrgTEIsTLpGgqnLgei~-DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRD 202 (359) ..|++.. ..+.|..+..|-...+...+. -+.-+.+.+|+...+|---.+.-+| |. .+..|..- T Consensus 308 ------~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~g-n~-Sg~Alk~~ 379 (511) T protein:vir:99 308 ------VLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSG-TQ-SGEAMKYK 379 (511) T ss_pred ------ceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccc-cc-hHHHHHHH Confidence 1222211 112234455554444433333 3566677788888888532221111 11 12223333 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcch Q lcl|NC_015285. 203 EVKFQKFIARLRKRFSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGK 282 (359) Q Consensus 203 ElKF~KFI~rLr~rFs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGK 282 (359) ...-..-+.+.++.|..-+.+.++.=+-+-++...-++..-...+.+.|....--.+ .+.++++..+. | T Consensus 380 ~~~l~~ka~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~~~~~i~i~f~~~~p~n~-------~e~~~~~~kl~---G- 448 (511) T protein:vir:99 380 LFGLEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDVSKDFNTVRYVYNRNLPKSL-------IEELKAYIDSG---G- 448 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccccccccceEEeCCCCCcCH-------HHHHHHHHHHh---c- Confidence 333444556666667766666665422221222111122222346677764333223 23344455553 3 Q ss_pred hhhHHHHHHHHhCCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCCCCccCC Q lcl|NC_015285. 283 YFSVDYMRRQVLKQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPGDVRRGE 358 (359) Q Consensus 283 y~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~ 358 (359) .+|.+++++. |...+ +.+++.++|++|..+. .++.. ...+..+++.... -++...+ .....+| T Consensus 449 iiS~et~l~~-l~~v~-D~~~E~~ri~~E~~~~-~~~~~----~~~~~~~~~~~~~-~~~~~~~-----~~~d~~e 511 (511) T protein:vir:99 449 KISQTTLMSL-FSFFQ-DPELEVKKIEEDEKES-IKKAQ----KNMYQDPRNINDD-EQDDSTK-----DSIDKKE 511 (511) T ss_pred cCCHHHHHHh-CCCCC-CHHHHHHHHHHHHHHH-HHHHh----hcccccCCCCCCC-CCCCCCc-----CcccccC Confidence 3799999987 45543 2445555566665432 11111 1111111111111 1111111 1222233 No 82 >protein:vir:2683 Length: 412 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075502;genbank:gi:12719431;genbank:GeneID:920150 Probab=75.78 E-value=0.14 Score=25.22 Aligned_cols=274 Identities=15% Similarity=0.220 Sum_probs=113.4 Q ss_pred CCCc--------hhhHHHhhhhhhheeecccc---------------ccccCCCceeecHhHhhhhhcccccCCCCcchh Q lcl|NC_015285. 1 MRGV--------DLNQQLTQKAAEYFLYNPKG---------------LKNSTNQGMKITTDSVTYCHSGIQDLNKNMTLS 57 (359) Q Consensus 1 ~~~~--------~~~~~~~~~~~e~f~yn~~~---------------~~~~~~~~v~i~~~ai~y~hSGl~d~~~~~i~s 57 (359) +.++ ..|. ..+.+...|.. +....+..+.++.+-|.+.. +.-..++-.=+| T Consensus 110 l~Gnay~~i~r~~~G~-----~~~L~~l~~~~v~v~~~~~~~~~~y~~~~~~g~~~~~~~~evih~~-~~~~~~~~~G~s 183 (412) T protein:vir:26 110 EKGNAYVLIERDIYHQ-----PSKLFLLNPDVVEMLIENQSRELYYSIHAATGNKLIVHNMDMLHFK-HIVASNMVQGIS 183 (412) T ss_pred hcCceEEEEEECCCCc-----EEEEEEEcCceeEEEEeCCCcEEEEEEEcCCceEEEEccccEEEeC-CCCCCCCccccc Confidence 1111 1111 11222222211 11112334667777776652 111112111234 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHh Q lcl|NC_015285. 58 HLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMME 137 (359) Q Consensus 58 yL~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlE 137 (359) -|..|.+.......+++. .++...+.| . +....-+.+-+.++++..+.+.+.+. ..|.+ + .+ T Consensus 184 ~i~~~~~~i~~~~a~~~~-~~~~~~~~~-~-~i~~~~~~l~~e~~~~~~~~~~~~~~-------~~g~~------~-vl- 245 (412) T protein:vir:26 184 PIDVLKNTTDFDNAVRTF-NLTEMQKPD-S-FMLKYGSNVGKEKRQQVLEDFKQYYE-------ENGGI------L-FQ- 245 (412) T ss_pred HHHHHHHHHHHHHHHHHH-HHHhcCCCC-c-eEEecCCCCCHHHHHHHHHHHHHHhh-------cCCCe------e-ec- Confidence 555555555544445554 344444443 3 33334567777777666665544332 22321 1 11 Q ss_pred hhcccccCCCCccceeecCCCCCcchHH---HHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHH-HHHH Q lcl|NC_015285. 138 DFWLPRREGGRGTEISTLPGGQNLGELE---DVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKF-IARL 213 (359) Q Consensus 138 DywLpRReGgrgTEIsTLpGgqnLgei~---DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KF-I~rL 213 (359) ..|.+++.|. .+..+++ -..|-.+.+.++++||...|...++-+.+...+..+. |.++ |.-+ T Consensus 246 ---------~~g~~~~~l~--~~~~d~q~~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~sn~e~~~~~---f~~~~l~P~ 311 (412) T protein:vir:26 246 ---------EPGVEIEPLP--KKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRF---YLQHTLLPI 311 (412) T ss_pred ---------CCCceEEEcC--CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHH---HHHHHHHHH Confidence 1356677764 2333333 3335668899999999999987655566666665554 6555 3333 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHH Q lcl|NC_015285. 214 RKRFSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQV 293 (359) Q Consensus 214 r~rFs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~I 293 (359) ..+ +.+.|-.. +.++.+|. ....|.|..+ ++...+ +.+|++.+..+-.- -+++.+-++.. T Consensus 312 ~~~----ie~~ln~k-----Ll~~~~~~---~~~~~~fd~~----~l~~~d-~~~~~~~~~~~~~~--G~~t~NE~R~~- 371 (412) T protein:vir:26 312 VKQ----YEEEFNRK-----LLTKTDRE---KNRYFKFNVK----SYLRAD-SATQAEVYFKAVRS--GYYTINDIREW- 371 (412) T ss_pred HHH----HHHHHHhh-----cCCccccc---CcceEEeech----hhhccC-HHHHHHHHHHHHhC--CCcCHHHHHHH- Confidence 222 23333333 34445543 2233445432 222222 34555555544322 24455555432 Q ss_pred hCCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCC--cCCCCCCCCCccC Q lcl|NC_015285. 294 LKQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNA--QESSVDPGDVRRG 357 (359) Q Consensus 294 L~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~p~~~~~~ 357 (359) +++.+-+ --++ .+-+.+ -.+.. .+.. ..+..--+|.++| T Consensus 372 ~gl~p~~--ggD~---------~~~~~n------------~~~~~--~~~~~~~~~~gG~~n~~e~ 412 (412) T protein:vir:26 372 EDLPPVE--GGDK---------PLISGD------------LYPID--TPLELRKSLKGGDKNVNES 412 (412) T ss_pred hCCCCCC--CcCe---------eeeccc------------ccccc--cchhhcccccCCCCCcCCC Confidence 4443211 0000 000000 00000 0000 0011111233333 No 83 >protein:vir:389 Length: 530 # NCBI annotation: gp4 # Family: family:all:47 # MgeID: mge:325 # MgeName: N15 # Cross-refs: genbank:acc:NP_046899;genbank:gi:9630468;genbank:GeneID:1261643 Probab=73.97 E-value=0.16 Score=24.89 Aligned_cols=311 Identities=11% Similarity=0.088 Sum_probs=146.1 Q ss_pred CCCchhhHHHhhhh--------hhheeec--cccccccCC----CceeecHhHhhhhhcccccCCCCcchhhHHHHHHHH Q lcl|NC_015285. 1 MRGVDLNQQLTQKA--------AEYFLYN--PKGLKNSTN----QGMKITTDSVTYCHSGIQDLNKNMTLSHLHKAIKAV 66 (359) Q Consensus 1 ~~~~~~~~~~~~~~--------~e~f~yn--~~~~~~~~~----~~v~i~~~ai~y~hSGl~d~~~~~i~syL~~Aik~~ 66 (359) -.+...|..|+.|| .-|+++. |.+...... ..++++.+-|..+..-.-. .---=+|.|..+++.+ T Consensus 177 ~~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~~~~~~~~~~~~~~~~~~~v~a~~vlH~f~~~r~-gQ~RGis~lapvl~~l 255 (530) T protein:vir:38 177 PNNIGDTRNCRAGVKINDSGAALGYYVSDDGYPGWMAQNWTYIPRELPGGRPSFIHVFEPMED-GQTRGANAFYSVMEQM 255 (530) T ss_pred CCCCCCCCeeEeeeEECCCCceEEEEEeeccCCCccccccceeeeeeccChhHeEeeccccCC-CcccCCchHHHHHHHH Confidence 11112233344444 3566663 333222111 1255666677776665422 2222379999999999 Q ss_pred HHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHH-------------HHHHHHHHHhhcceEEeeCCCCcccccccch Q lcl|NC_015285. 67 NQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKA-------------EQYLREVMGRYRNKMVYDANTGEIKDDKKFM 133 (359) Q Consensus 67 NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KA-------------eqYl~~iM~kyrnklvYD~~TGevkdd~~~m 133 (359) ++|.-.+||...-...-|-.=-++.=+.+......+ ..+... ...+.+.-+.+-..|.|. T Consensus 256 ~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~l~pG~i~------ 328 (530) T protein:vir:38 256 KMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDFILGADNKEQQSKLTGWLGE-MAAYYSAAPVRLGGARVP------ 328 (530) T ss_pred HHHhHHHHHHHHHHHHhhhheeeeeccCCccccccccccCCcccccccccccchh-hhhcccccceeccCceee------ Confidence 999999999998888777653333322222110000 000000 111111111112222111 Q ss_pred hhHhhhcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCC-CcccccchhhhhHHhhhHHHHHH Q lcl|NC_015285. 134 SMMEDFWLPRREGGRGTEISTLPGGQNLGE-LEDVKYFQKKLYKALNVPSSRLETE-TTFNIGRAAEITRDEVKFQKFIA 211 (359) Q Consensus 134 SMlEDywLpRReGgrgTEIsTLpGgqnLge-i~DV~YF~kkLy~aL~VP~SRl~~~-~~~~~g~~~eItRDElKF~KFI~ 211 (359) .-.-|-+|+.+..+.--+. -+=++...+.+=.+|+||-+-|..+ ++.|+ |.+.-.-+.|-+.+. T Consensus 329 -----------~L~pGe~i~~~~p~~p~~~~~~f~~~~lr~iaaglGi~ye~lt~D~s~~nY---SS~R~~~~e~~r~~~ 394 (530) T protein:vir:38 329 -----------HLLPGDSLNLQSAQDTDNGYSTFEQSLLRYIAAGLGVSYEQLSRNYSQMSY---STARASANESWAYFM 394 (530) T ss_pred -----------ecCCCCeeeeeCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhcccccccH---HHHHHHHHHHHHHHH Confidence 0012444555544432233 2344555566667899999988665 45555 222334556999999 Q ss_pred HHHHHHHHH-----HHHHHHHHHHhcCCCCh------hHHHHHhhceeeee--eccchHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_015285. 212 RLRKRFSEL-----FTDLLKTQLILKGVMSL------EEWEDMKNHIQFDF--IADNYFTELKEIEIRNERMNQVAAMDP 278 (359) Q Consensus 212 rLr~rFs~i-----f~d~Lk~QLiLkgI~t~------eew~~~~~~I~~~f--~~Dn~f~ElKe~Eil~~Rl~~~~~~dp 278 (359) ++|..|..= |..-|+ ..++.|.++. +.|..........| -.--+.-.+||+.-...+++. T Consensus 395 ~~q~~~~~~~~~pi~~~wl~-~av~~G~i~~p~~~~~~~~~~~~a~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~------ 467 (530) T protein:vir:38 395 GRRKFVASRQACQMFLCWLE-EAIVRRVVTLPSKARFSFQEARTAWGNANWIGSGRMAIDGLKEVQEAVMLIEA------ 467 (530) T ss_pred HHHHHHHHHHhhHHHHHHHH-HHHHcCCccCCCCCCCCchhhHHhhhceeeecCCccccChHHHHHHHHHHHHc------ Confidence 999988653 344444 4578887763 33333222322333 333445567777666555542 Q ss_pred hcchhhhHHHHHHHHhCCCHHHHHHH-HHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCCCCccC Q lcl|NC_015285. 279 YVGKYFSVDYMRRQVLKQTEIEIKEI-DEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPGDVRRG 357 (359) Q Consensus 279 ~vGKy~S~~~i~k~IL~~tDeeI~e~-~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~ 357 (359) -+-|.+-+..+ .+..-+|+-++ +...+...+.|+ +.|-. ++....++..+.+.+|.++-.| T Consensus 468 ---G~~s~~~~~a~-~G~D~~~v~~q~a~e~~~~~~~Gl-~~~~~-------------~~~~~~~~~~~~~~~~~d~~~~ 529 (530) T protein:vir:38 468 ---GLSTYEKECAK-RGDDYQEIFAQQVRESMERRAAGL-NPPAW-------------AAAAFEAGVKKSNEEEQDGARA 529 (530) T ss_pred ---CCCCHHHHHHH-cCCCHHHHHHHHHHHHHHHHHcCC-CCCCC-------------cccccCCCCCCCCCCCCCCCCC Confidence 33466666655 45554444322 222222222332 21110 0000011111222222222222 Q ss_pred C Q lcl|NC_015285. 358 E 358 (359) Q Consensus 358 ~ 358 (359) - T Consensus 530 a 530 (530) T protein:vir:38 530 A 530 (530) T ss_pred C Confidence 2 No 84 >protein:vir:101647 Length: 460 # NCBI annotation: phage portal protein # Family: family:all:26542 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112492;genbank:gi:53793592;uniprot:Q5ZGG1;genbank:GeneID:3101755 Probab=73.84 E-value=0.17 Score=24.87 Aligned_cols=284 Identities=13% Similarity=0.083 Sum_probs=116.7 Q ss_pred CCCc--------hhhHHHhhhhhhheeeccc--------------------cccc-cCCCceeecHhHhhhh--hccccc Q lcl|NC_015285. 1 MRGV--------DLNQQLTQKAAEYFLYNPK--------------------GLKN-STNQGMKITTDSVTYC--HSGIQD 49 (359) Q Consensus 1 ~~~~--------~~~~~~~~~~~e~f~yn~~--------------------~~~~-~~~~~v~i~~~ai~y~--hSGl~d 49 (359) +.++ ..|.. -..+.+.+..+|. ++.. .++..+.++.+-|.++ ++-..+ T Consensus 140 l~Gnay~~i~r~~~~~~-~G~~~~L~~l~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~evih~r~~~~~~~ 218 (460) T protein:vir:10 140 LNGNCYFYLMSPDDGIN-AGVPSQMYVLPAHLIKIVLKDDINLLSTDSPIKSYMLIQGDQFIEFNEDEVIHTKYANPNFD 218 (460) T ss_pred hcCCeEEEEEecCCCcc-CceeEEEEEEcCceEEEEEcCCCceeeeeeeeeEEEEecCceeEEecccceEEEecCCCCcc Confidence 1111 00000 0011222222221 1111 1233467777777664 333333 Q ss_pred CCCCcc--hhhHHHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCccc Q lcl|NC_015285. 50 LNKNMT--LSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIK 127 (359) Q Consensus 50 ~~~~~i--~syL~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevk 127 (359) .+++.+ +|.+..|.+.+.....+++...-+--.-++-. ..+..-+.|.+..+++..+.+...|+..- ..|.+ T Consensus 219 ~~~~~~~G~sp~~~~~~~i~~~~~~~~~~~~~f~ng~~~~-~i~~~~~~l~~e~~~~~~~~~~~~~~g~~----n~g~~- 292 (460) T protein:vir:10 219 LQGSHLYGMSPIRAILRNINSQNSTIDNNVKTMQNGGVFG-FIHGGSTGLTQPQADSLKQRLTEMDKSPD----RLSQI- 292 (460) T ss_pred cccCccccccHHHHHHHHHHHHHHHHHHHHHHHhcCCCcc-eeeecCCCCCHHHHHHHHHHHHHHhcCcc----ccCCc- Confidence 333323 46788888888877777877666555556554 45666677777777666666555554210 11222 Q ss_pred ccccchhhHhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCCccccCCC-C-cccccchhhhhHHhh Q lcl|NC_015285. 128 DDKKFMSMMEDFWLPRREGGRGTEISTLPGG-QNLGELEDVKYFQKKLYKALNVPSSRLETE-T-TFNIGRAAEITRDEV 204 (359) Q Consensus 128 dd~~~mSMlEDywLpRReGgrgTEIsTLpGg-qnLgei~DV~YF~kkLy~aL~VP~SRl~~~-~-~~~~g~~~eItRDEl 204 (359) + .++ .|.+++.|.-. ..+.-++-.+|..+.+.++++||.+.|+.. + +.+.....+..+. T Consensus 293 -----~-vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~~~~~-- 354 (460) T protein:vir:10 293 -----A-GAS----------GEIAFTKISLNTDELKPFDYLKYDQKAICNALGWSDKLLNNNEGGGLNTGNLEEERKR-- 354 (460) T ss_pred -----e-ecC----------CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCCccccHHHHHHH-- Confidence 1 121 24566666332 222234556688899999999999998643 2 2233333333333 Q ss_pred hHHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchh Q lcl|NC_015285. 205 KFQKF-IARLRKRFSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKY 283 (359) Q Consensus 205 KF~KF-I~rLr~rFs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy 283 (359) |..+ |.-+-.++...|. ..| +++.+.. ....|.++|. ++ ..++.+.....++ T Consensus 355 -f~~~~l~P~~~~ie~~ln----~kl-----~~~~~~~-~~~~i~~d~~------~l---~~l~~d~~~~~~~------- 407 (460) T protein:vir:10 355 -VVTDNIQPDLVILKQAFD----KKF-----IKRFKGY-ENAVIEWDIS------EL---PEMQTDMVAMASW------- 407 (460) T ss_pred -HHHHHHHHHHHHHHHHHH----Hhh-----cCccccc-CCceEEeecc------hh---hhHHHHHHHHHHH------- Confidence 5554 3333333333333 232 2322211 1233444433 22 1122222222211 Q ss_pred hhHHHHHHHHhCCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCCCCcc Q lcl|NC_015285. 284 FSVDYMRRQVLKQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPGDVRR 356 (359) Q Consensus 284 ~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~ 356 (359) + ..-+ ||..|+-+.. .-+.+++|.... - -.+..-++... ..-+....+.|..+ T Consensus 408 ~-----~~g~--~T~NE~R~~~-------g~~pi~~~~gD~--~-~~~~n~~~~~~---~~~~~~~~~~nq~~ 460 (460) T protein:vir:10 408 L-----NTIP--VTPNEIRIAM-------KYETLNQDGMDI--V-FMPSNKVRIDD---VSNNLIDSAFNQNQ 460 (460) T ss_pred H-----hCCC--CCHHHHHHHh-------CCCCCCCCCCCe--e-eecccccchhh---cccccCCCcccCCC Confidence 0 1112 5655553211 122222221100 0 00000000000 00011112222222 No 85 >protein:vir:94805 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240531;genbank:gi:66396197;genbank:GeneID:5133585 Probab=70.99 E-value=0.2 Score=24.40 Aligned_cols=283 Identities=11% Similarity=0.089 Sum_probs=107.7 Q ss_pred CCCchhhHHHhhhh-hhheeeccccccccC---CCceeecHhHhhhhhc-cccc---C-CCCcchhhHHHHHHHHHHHH- Q lcl|NC_015285. 1 MRGVDLNQQLTQKA-AEYFLYNPKGLKNST---NQGMKITTDSVTYCHS-GIQD---L-NKNMTLSHLHKAIKAVNQLR- 70 (359) Q Consensus 1 ~~~~~~~~~~~~~~-~e~f~yn~~~~~~~~---~~~v~i~~~ai~y~hS-Gl~d---~-~~~~i~syL~~Aik~~NqL~- 70 (359) -.+..+.+++++.. ..+|.+...+..... .+...+.. ..|. |-++ . ++..-.|=++..+....-+. T Consensus 199 ~~~~~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~g~vPvv~~~nn~~~~sd~e~v~~liDa~d~ 274 (492) T protein:vir:94 199 KLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHF----STGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNR 274 (492) T ss_pred eeccceeEEEEecCeEEEEEEecCeeeeccccccccccccc----cccCCCccceEEecCCCCCCCchHHHHHHHHHHHH Confidence 11111122222111 112222221111100 01111111 1122 2111 1 12222344444433333332 Q ss_pred HHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhhcccccCCCCcc Q lcl|NC_015285. 71 MIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDFWLPRREGGRGT 150 (359) Q Consensus 71 m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDywLpRReGgrgT 150 (359) ++=+....-+.+..|-+-+.-.+..+.+. +...+..++..--++ +. T Consensus 275 ~~S~~~~~~~~~~~p~lv~~g~~~~~~~~--------------------------------~~~~~~~~~~~~~~~--~~ 320 (492) T protein:vir:94 275 RLSDLSNTFKDSNELTYVLKNYDDQELPE--------------------------------FKRLLRYYGAIKVSD--NG 320 (492) T ss_pred HHHHHHHHHHHhcCceeeeecCCcccchh--------------------------------hHHHHhhccceecCC--CC Confidence 23344444556666654443332222111 111111222211122 12 Q ss_pred ceeecCCCCCcchH-HHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015285. 151 EISTLPGGQNLGEL-EDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFTDLLKTQL 229 (359) Q Consensus 151 EIsTLpGgqnLgei-~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KFI~rLr~rFs~if~d~Lk~QL 229 (359) .+++|-...+.+.+ .-+.-+.+.+|+-.++|---.+.-++ +. .|..|..-+.....-+.+.++.|..-+.++++.=+ T Consensus 321 ~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~-n~-Sg~Al~~~~~~l~~k~~~k~~~f~~~l~~~~~li~ 398 (492) T protein:vir:94 321 GVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGS-AP-SGVALEFLYTNLNLKADKLARKAKVAIQELLWFVF 398 (492) T ss_pred cceeEeccCCHHHHHHHHHHHHHHHHHHhCCcCCCcccccc-Cc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 35555443333332 33466777888888988422211111 11 12224333444555567777777777777766533 Q ss_pred HhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCCCHHHHHHHHHHHH Q lcl|NC_015285. 230 ILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQTEIEIKEIDEQIA 309 (359) Q Consensus 230 iLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~tDeeI~e~~kqi~ 309 (359) -+-|+ ..+|. .|.+.|....--.+ .+.++++..+. | .+|.+++++. |...+ +.+++.++|+ T Consensus 399 ~~~~~--~~~~~----~i~v~f~~~~p~~~-------~e~~~~~~kl~---g-iiS~et~~~~-l~~v~-d~~~E~eri~ 459 (492) T protein:vir:94 399 EHFDI--KGEHK----DVDISFNYNKVANT-------ELQVQTAQQSM---G-IVSHETVLEN-HPFVE-DLQAELERIE 459 (492) T ss_pred HHhcC--Ccccc----eeeEEecCCCCCCH-------HHHHHHHHHHh---c-cCchHHHHHh-CCCCC-CHHHHHHHHH Confidence 33343 33444 46677754433333 23345555554 4 3799999986 55543 2334444455 Q ss_pred HHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCCCCccCC Q lcl|NC_015285. 310 SEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPGDVRRGE 358 (359) Q Consensus 310 ~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~ 358 (359) +|..+..=..|+. +..++.....+-++ +.+.+| T Consensus 460 ~E~~~~~~~~~~~------~~~~~~~~~~~~~~----------~~~e~e 492 (492) T protein:vir:94 460 QEQMEYNKQLPNL------DDGGADSAQQQERS----------NNKESE 492 (492) T ss_pred HHHHHHHhhcccc------ccccCCCCccccCC----------ccccCC Confidence 5533211111110 00000000000011 111222 No 86 >protein:vir:9306 Length: 511 # NCBI annotation: phi Mu50B-like protein # Family: family:all:125 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803284;genbank:gi:29028594;genbank:GeneID:1258040 Probab=70.91 E-value=0.2 Score=24.38 Aligned_cols=304 Identities=11% Similarity=0.044 Sum_probs=100.6 Q ss_pred CCCchhhHHHhhhh---------------hhhe-eeccccccc---cCCCceeecHhHh-hhhhc-ccccC----CCCcc Q lcl|NC_015285. 1 MRGVDLNQQLTQKA---------------AEYF-LYNPKGLKN---STNQGMKITTDSV-TYCHS-GIQDL----NKNMT 55 (359) Q Consensus 1 ~~~~~~~~~~~~~~---------------~e~f-~yn~~~~~~---~~~~~v~i~~~ai-~y~hS-Gl~d~----~~~~i 55 (359) +-++....+++-.+ ..++ +|++..... ..+.+......-+ ...|. |-++. ++..- T Consensus 175 vydd~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g 254 (511) T protein:vir:93 175 IYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERR 254 (511) T ss_pred EEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCccccccccccccccCCCccceEEecCCCCC Confidence 11111000111100 0111 344433211 1111111100000 00111 11111 11122 Q ss_pred hhhHHHHHHHHHHHHH-HHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCccc-ccccch Q lcl|NC_015285. 56 LSHLHKAIKAVNQLRM-IEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIK-DDKKFM 133 (359) Q Consensus 56 ~syL~~Aik~~NqL~m-~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevk-dd~~~m 133 (359) .|-++..+.....+.. +=+....-+-++.|-+-+.=..... .++++ +....+ T Consensus 255 ~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~--------------------------~~~~~~~~~~~~ 308 (511) T protein:vir:93 255 KGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLD--------------------------PVEVRKQKEANV 308 (511) T ss_pred CCchhhHHHHHHHHHHHHHHHHHHHHHhhCcceeeecCcccC--------------------------chhhcccccccc Confidence 3445554444443332 2222223344444444333111000 01111 111111 Q ss_pred hhHhh-hc--ccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccchh--hhhHHhhhHH Q lcl|NC_015285. 134 SMMED-FW--LPRREGGRGTEISTLPGGQNLGE-LEDVKYFQKKLYKALNVPSSRLETETTFNIGRAA--EITRDEVKFQ 207 (359) Q Consensus 134 SMlED-yw--LpRReGgrgTEIsTLpGgqnLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~--eItRDElKF~ 207 (359) -.+.. .| ...-....|..+..|-...+... -.-+.-..+.+|+-.++|---.+. |. |..| .|..-...-. T Consensus 309 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~P~~~~~~---~~-~n~Sg~Al~~~~~~l~ 384 (511) T protein:vir:93 309 LFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDN---FS-GTQSGEAMKYKLFGLE 384 (511) T ss_pred eecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCccccccc---cc-ccchHHHHHHHHHHHH Confidence 11111 11 11112223344555544333322 223344567777888888532221 21 2222 2322222233 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHH Q lcl|NC_015285. 208 KFIARLRKRFSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVD 287 (359) Q Consensus 208 KFI~rLr~rFs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~ 287 (359) .-+.+.+..|..-+.+.++.=+-+-++....+++.-...+.+.|...---.+ .+.++++..+. | .+|.+ T Consensus 385 ~k~~~k~~~f~~~l~~~~~li~~~l~~~~~~~~~~d~~~i~~~f~~~~p~n~-------~e~~~~~~kl~---g-~iS~e 453 (511) T protein:vir:93 385 QRTKTKEGLFTKGLRRRAKLLETILKNTWSIDANKDFNTVRYVYNRNLPKSL-------IEELKAYIDSG---G-KISQT 453 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhccCcccccccccceEEeCCCCCCCH-------HHHHHHHHHHh---c-cCchH Confidence 4455556666665555554422221222222222223457777864333223 23445555553 3 37999 Q ss_pred HHHHHHhCCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCCCCcc Q lcl|NC_015285. 288 YMRRQVLKQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPGDVRR 356 (359) Q Consensus 288 ~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~ 356 (359) +++.. |...++ .+++.++|++|..+. .+ ..++..++.+. +....+-+ ..+....+-.+ T Consensus 454 t~~~~-l~~v~d-~~~E~~ri~~E~~~~-~~---~~~~~~~~~~~-~~~~~~~~----~~~~~~~~~~~ 511 (511) T protein:vir:93 454 TLMSL-FSFFQD-PELEVKKIEEDEKES-IK---KAQKGIYKDPR-DINDDEQD----DDTKDTVDKKE 511 (511) T ss_pred HHHHh-CCCCCC-HHHHHHHHHHHHHHH-HH---HHhhhcccCCC-CCCCCCCC----CcccccccccC Confidence 99977 555431 233344455554321 10 00111111111 11111001 11111111111 No 87 >protein:vir:1266 Length: 416 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690758;genbank:gi:22854998;genbank:GeneID:955213 Probab=69.93 E-value=0.22 Score=24.23 Aligned_cols=279 Identities=12% Similarity=0.101 Sum_probs=121.6 Q ss_pred CCCc--------hhhHHHhhhhhhheeeccccc--------------cccCCCceeecHhHhhhhhcccccCCCCcchhh Q lcl|NC_015285. 1 MRGV--------DLNQQLTQKAAEYFLYNPKGL--------------KNSTNQGMKITTDSVTYCHSGIQDLNKNMTLSH 58 (359) Q Consensus 1 ~~~~--------~~~~~~~~~~~e~f~yn~~~~--------------~~~~~~~v~i~~~ai~y~hSGl~d~~~~~i~sy 58 (359) +.++ ..| ...+.+..+|... ....+..+.++.+-|.+.. ..+.++-.=+|. T Consensus 111 l~Gna~~~i~r~~~G-----~~~~L~~l~~~~v~v~~~~~~~~~~~~~~~~g~~~~~~~~eiih~~--~~~~~~~~G~s~ 183 (416) T protein:vir:12 111 TWGNAYSYIQFGSHG-----YPEALFPLRPDYTNAYVHPTTGMLWYQTVLNGKAIELYDYEVLHFK--GLSTDGIHGKSP 183 (416) T ss_pred hcCCeEEEEEECCCC-----cEEEEEEECCcceEEEEeCCCcEEEEEEecCCeEEEecCccEEEec--CcCCCCcccccH Confidence 1111 111 1112222222110 1112344677777776654 123343334678 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhh Q lcl|NC_015285. 59 LHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMED 138 (359) Q Consensus 59 L~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlED 138 (359) ++.|.+++......++...=+=-..+.-+-|..++ +.+.+..+++. ++-.++.. +.|.+. .++ T Consensus 184 i~~~~~~i~~~~~~~~~~~~~~~ng~~p~~il~~~-~~~~~e~~~~~-~~~~~~~~-------~~~~~~-------vl~- 246 (416) T protein:vir:12 184 IGVVREHIGAQAAATKYNAKLYKNEATPRGILKVP-AFLDEKPKENV-RKEWKRVN-------KVENIA-------IID- 246 (416) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCCCCceEEecC-CCCCHHHHHHH-HHHHHHHh-------cCCCee-------ecC- Confidence 99999999888888887665545556667777776 35555444443 33333321 112211 121 Q ss_pred hcccccCCCCccceeecCC-CCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHH-HHHHHHH Q lcl|NC_015285. 139 FWLPRREGGRGTEISTLPG-GQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKF-IARLRKR 216 (359) Q Consensus 139 ywLpRReGgrgTEIsTLpG-gqnLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KF-I~rLr~r 216 (359) .|++++.|.= ...+.-++-.+|....+.++++||.+-|.....-+.....+..+. |..+ |.-+-.. T Consensus 247 ---------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~---f~~~~l~P~~~~ 314 (416) T protein:vir:12 247 ---------YGLEYQSISMPLQEAQFVESMKFNKAQISMIYKVPLHKLNELDKATFSNIEHQSIE---YVRNTLQPWIVN 314 (416) T ss_pred ---------CCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCccCCCcccHHHHHHH---HHHHHHHHHHHH Confidence 1445555532 122333455667788999999999999975443344344443333 5433 3333333 Q ss_pred HHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCC Q lcl|NC_015285. 217 FSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQ 296 (359) Q Consensus 217 Fs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~ 296 (359) +...|. .. ++++.++. ....+.|.-+ .+...+ ...|.+.+..+-.- -+++.+-++.. +++ T Consensus 315 ie~~l~----~~-----l~~~~~~~---~g~~i~fd~~----~l~~~d-~~~~~~~~~~~~~~--G~~T~NE~R~~-~gl 374 (416) T protein:vir:12 315 FEQELN----VK-----LFLDHDQK---SGHYVKFNID----SELRGD-SKTQAEYLKTLHET--GVLNKDEIREL-LER 374 (416) T ss_pred HHHHHH----Hh-----hcCchhhc---CCceEEeech----hhhccC-HHHHHHHHHHHHhC--CCcCHHHHHHH-hCC Confidence 333332 22 23333322 2233444433 222222 24556666555333 35677766653 666 Q ss_pred CHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCC---CCCcccccc Q lcl|NC_015285. 297 TEIEIKEIDEQIASEMEAGIIADPMAEMDPAMA---AGGEGAPAA 338 (359) Q Consensus 297 tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~---~~~~~~~~~ 338 (359) .+-+ --++-+..- ..-.+...++......| .+|+..+.| T Consensus 375 ~Pi~--ggd~~~~~~-n~~~~~~~~~~~~~~~~~~~~gge~~~~g 416 (416) T protein:vir:12 375 NPIE--NGDKYISSL-NYVFLDFLEEYQRLKAGGAMKGGDNKNEG 416 (416) T ss_pred CCCC--Ccceeeecc-ccccccccchhhccccccccCCCCCcCCC Confidence 5421 000000000 00000000000000000 011111111 No 88 >protein:vir:105292 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950666;genbank:gi:119967836;genbank:GeneID:4643171 Probab=69.76 E-value=0.22 Score=24.21 Aligned_cols=286 Identities=11% Similarity=0.039 Sum_probs=105.4 Q ss_pred CCCchhh-----HHH--hhhhhhheeeccccccc-c------------CCCceeecHhHhhhhhc-ccccC----CCCcc Q lcl|NC_015285. 1 MRGVDLN-----QQL--TQKAAEYFLYNPKGLKN-S------------TNQGMKITTDSVTYCHS-GIQDL----NKNMT 55 (359) Q Consensus 1 ~~~~~~~-----~~~--~~~~~e~f~yn~~~~~~-~------------~~~~v~i~~~ai~y~hS-Gl~d~----~~~~i 55 (359) -.++..+ +.. .++...+-+|++..+.. . ...++.--.......|- |-++. ++..- T Consensus 165 ~d~~~~~~~~~~v~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~n~~~g 244 (478) T protein:vir:10 165 WTNKERDELQAFIRVYELDGAERVEYWTKDDVTYYELKEGQLIPDFYRSDDHIQPHYYQGNKLMSWGRVPFIPFKNNPQE 244 (478) T ss_pred EcCCCCCceEEEEEEEEecCceEEEEEeCCeEEEEEEcCCeeeccccccccccccceecccccccCCccceEEeccCCCC Confidence 0000000 000 01111122333321100 0 00000000001111222 21111 12222 Q ss_pred hhhHHHHHHHHHHHH-HHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchh Q lcl|NC_015285. 56 LSHLHKAIKAVNQLR-MIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMS 134 (359) Q Consensus 56 ~syL~~Aik~~NqL~-m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mS 134 (359) .|=++..+....-+. ++-+....-+.++.|-+-+.-.+..+.. +....+. T Consensus 245 ~sd~~~v~~liDa~~~~~S~~~~~~~~~~~p~~~~~g~~~~~~~-----------------------------~~~~~~~ 295 (478) T protein:vir:10 245 VSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILKGYEGEDMK-----------------------------DFMHNLK 295 (478) T ss_pred CCcHHHHHHHHHHHHHHHHHHHHHHHHhhCceeeeecCCccccc-----------------------------hhhhhhh Confidence 233333333332222 3444444555666665443322221110 0000111 Q ss_pred hHhhhcccccCCCCccceeecCCCCCcchH-HHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHH Q lcl|NC_015285. 135 MMEDFWLPRREGGRGTEISTLPGGQNLGEL-EDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARL 213 (359) Q Consensus 135 MlEDywLpRReGgrgTEIsTLpGgqnLgei-~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KFI~rL 213 (359) ...=++++.-+|+ ++.+|-...+...+ .-+.-+.+.+|+-.++|---.+.-++ |. .|..|..-...-..-+.+. T Consensus 296 ~~~~~~~~~~~~~---~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~-n~-Sg~Al~~~~~~l~~k~~~~ 370 (478) T protein:vir:10 296 YYKAISVAGESGS---GVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVDFQQDKFGN-SP-SGIALKFMYSNLDLKANKL 370 (478) T ss_pred hcceEEecCCCCC---cceEEeecCChHHHHHHHHHHHHHHHHHhCccccCcccccc-cc-HHHHHHHHHHHHHHHHHHH Confidence 1111234433333 35555444444433 45667788899999988422211111 11 1122333222233345666 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHH Q lcl|NC_015285. 214 RKRFSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQV 293 (359) Q Consensus 214 r~rFs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~I 293 (359) +..|...+..+++.=+-+.|+ ..+| ..|.+.|..-.--.+. +.+++++.+.+ .+|.+++++. T Consensus 371 ~~~~~~~l~~~~~li~~~~g~--~~~~----~~i~i~f~~~~p~d~~-------e~a~~~~kl~g----~iS~et~~~~- 432 (478) T protein:vir:10 371 KNKTLTALQELLQYIIDFYRL--DVKV----QDIEITFNFNVMVNEL-------ENSQIAMNSTG----LLSKETILSN- 432 (478) T ss_pred HHHHHHHHHHHHHHHHHHhCC--Cccc----ccceEEecCCCCCCHH-------HHHHHHHHHhC----CCChHHHHHh- Confidence 666766666666554444453 2233 3467777544333342 23445555543 3799999976 Q ss_pred hCCCHHHHHHHHHHHHHHHhcCCCCC-CcchhhhcCCCCCcccccccCCCCCcCCCCCCC Q lcl|NC_015285. 294 LKQTEIEIKEIDEQIASEMEAGIIAD-PMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPG 352 (359) Q Consensus 294 L~~tDeeI~e~~kqi~~E~~~~~~~~-P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~ 352 (359) |...++ .+++.++|++|..+- .+. ++. ..+.. ++.++ ....+.|+ T Consensus 433 l~~v~D-~~~E~~ri~~E~~~~-~~~~~~~----~~~~~------~~~~~--~~~~~~~~ 478 (478) T protein:vir:10 433 HAWVED-PVAEMERIEQENIEL-NQQLPDI----EEGLN------GEQQR--QSENNQPE 478 (478) T ss_pred CCCCCC-HHHHHHHHHHHHHHH-Hhhcccc----ccccC------CCCCC--CCCCCCCC Confidence 566432 334455555553220 100 000 00110 10111 11111111 No 89 >protein:vir:106639 Length: 481 # NCBI annotation: ORF003 # Family: family:all:125 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239490;genbank:gi:66395218;genbank:GeneID:4555793 Probab=69.24 E-value=0.23 Score=24.13 Aligned_cols=291 Identities=13% Similarity=0.090 Sum_probs=106.7 Q ss_pred CCCchhhHHHhhh-------------hhhheeeccccccc--cCCCcee-ecHhHhhhhhc-ccccC----CCCcchhhH Q lcl|NC_015285. 1 MRGVDLNQQLTQK-------------AAEYFLYNPKGLKN--STNQGMK-ITTDSVTYCHS-GIQDL----NKNMTLSHL 59 (359) Q Consensus 1 ~~~~~~~~~~~~~-------------~~e~f~yn~~~~~~--~~~~~v~-i~~~ai~y~hS-Gl~d~----~~~~i~syL 59 (359) +-++....+++-. +..+-+|.+...+. ..+++.. +.. +-|- |-++. ++..-.|-+ T Consensus 167 v~d~~~~~~~~~~i~~~~~~~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~----~~~~~g~vPvv~~~n~~~g~~~~ 242 (481) T protein:vir:10 167 VYDQTLDKKVVAGVRYFEKQDKDKVPVQHVEVYTTDKIYYIEIKGGTYHRVEE----VEHYYNDVPIIEYLNDQFKQGDF 242 (481) T ss_pred EEcCCCCCceEEEEEEEEEeeCCCceEEEEEEEecCeEEEEEecCCceeeccc----ccccCCceeEEEeecCCCCCCch Confidence 1111111111111 11122344333211 1111111 111 1111 11110 111112333 Q ss_pred HHHHHHHHHHH-HHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCc-ccccccchhhHh Q lcl|NC_015285. 60 HKAIKAVNQLR-MIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGE-IKDDKKFMSMME 137 (359) Q Consensus 60 ~~Aik~~NqL~-m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGe-vkdd~~~mSMlE 137 (359) +..+....-+. ++-+....-+.++.|-+-+. |..+ .|.++|. ++.++....... T Consensus 243 ~~v~~lida~~~~~s~~~~~~~~~~~~~~~~~----g~~~--------------------~~~~~~~~~~~~~~~~~~~~ 298 (481) T protein:vir:10 243 ENVIALIDLYDSAQSDTANYMTDLNDAMLAII----GNVD--------------------LDSEDAKAFRDANMIHLEPG 298 (481) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHhcCceeEee----cCcC--------------------CCccchhhhhhccceecccc Confidence 33222222221 22233333344455544332 1110 0111111 111111111111 Q ss_pred hhcccccCCCCccceeecCCCCCcchHH-HHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHH Q lcl|NC_015285. 138 DFWLPRREGGRGTEISTLPGGQNLGELE-DVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKR 216 (359) Q Consensus 138 DywLpRReGgrgTEIsTLpGgqnLgei~-DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KFI~rLr~r 216 (359) .-+.+ .+.+.++..|....+...+. -+.-.++.+|....+|---++.-++ |. .|..|.........-+.+.|.. T Consensus 299 ~~~~~---~~~~~~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~-n~-Sg~Al~~~~~~l~~k~~~~~~~ 373 (481) T protein:vir:10 299 TNANG---SEGKAEVKYVYKQYDVAGVEAYKKRLQNDIHKYTNTPDLNDEQFSG-VQ-SGESMKYKLFGLEQVRAIKERL 373 (481) T ss_pred ccccC---CCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccc-cc-HHHHHHHHHHHHHHHHHHHHHH Confidence 11111 11223455554444443333 3566677788888988432322111 11 2223433334456668888888 Q ss_pred HHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCC Q lcl|NC_015285. 217 FSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQ 296 (359) Q Consensus 217 Fs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~ 296 (359) |...+.++++.=+-+-++--..+++ ...+.+.|.....-.+. +.++++.++. | .+|.+++++. |.. T Consensus 374 ~~~~l~~~~~li~~~~~~~~~~~~~--~~~i~v~f~~~~~~~~~-------~~a~~~~kl~---g-~is~et~~~~-l~~ 439 (481) T protein:vir:10 374 FKKGLMKRYKLLLNNVNLTGLKQHN--YAELTITFTPNLPKSMM-------ESINAFNALS---G-GVSESTRLSL-LDF 439 (481) T ss_pred HHHHHHHHHHHHHHHHhccCCCccc--cceeeEEeCCCCCcCHH-------HHHHHHHHHh---c-cCChHHHHHh-CCC Confidence 8888888776433332332222232 24678888766554553 3344455553 3 3799999977 555 Q ss_pred CHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCCCC Q lcl|NC_015285. 297 TEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPGDV 354 (359) Q Consensus 297 tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~ 354 (359) .++ .+++.++|++|..+. .+.+++ .+.+.++.. ....+.+++ T Consensus 440 i~d-~~~E~~ri~~E~~~~---~~~~~~---~~~~~~~~~---------~~~~dd~~g 481 (481) T protein:vir:10 440 IDN-PKEELEKMQEEEAQR---EKQADK---RGYGEAFEN---------HLNVDDSNG 481 (481) T ss_pred CCC-HHHHHHHHHHHHHHH---Hhhhhh---ccCCccCCC---------CCCCCCCCC Confidence 432 222333344443221 011100 011000000 000011111 No 90 >protein:vir:3420 Length: 533 # NCBI annotation: capsid component # Family: family:all:47 # MgeID: mge:70 # MgeName: lambda # Cross-refs: genbank:acc:NP_040583;genbank:gi:9626247;genbank:GeneID:2703526 Probab=68.58 E-value=0.24 Score=24.03 Aligned_cols=315 Identities=10% Similarity=0.086 Sum_probs=146.3 Q ss_pred CC---CchhhHHHhhhh--------hhheeec--cccccccC----CCceeecHhHhhhhhcccccCCCCcchhhHHHHH Q lcl|NC_015285. 1 MR---GVDLNQQLTQKA--------AEYFLYN--PKGLKNST----NQGMKITTDSVTYCHSGIQDLNKNMTLSHLHKAI 63 (359) Q Consensus 1 ~~---~~~~~~~~~~~~--------~e~f~yn--~~~~~~~~----~~~v~i~~~ai~y~hSGl~d~~~~~i~syL~~Ai 63 (359) +. +...|..|+.|| .-|++|. |.+..... ...+.++.+-|..++...-..-.. =+|.|..++ T Consensus 177 l~~~~~~~~~~~i~~GIe~d~~Gr~~aY~i~~~~~~~~~~~~~~~~~~~~~v~a~~VlH~f~~~r~gQ~R-Gis~lapvl 255 (533) T protein:vir:34 177 ISNPNNTGDSRNCRAGVQINDSGAALGYYVSEDGYPGWMPQKWTWIPRELPGGRASFIHVFEPVEDGQTR-GANVFYSVM 255 (533) T ss_pred cCCCCCCCCCCceEeeeEECCCCCeEEEEEeecCCCCccccccceeeeeeccChhHeeeeccccCCCccc-CCchHHHHH Confidence 11 112233344444 3466663 33221111 113557777788777765332222 289999999 Q ss_pred HHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCccc-ccccchhhHhhhccc Q lcl|NC_015285. 64 KAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIK-DDKKFMSMMEDFWLP 142 (359) Q Consensus 64 k~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevk-dd~~~mSMlEDywLp 142 (359) ..+.+|.-.+||...-...-|-.=-++.=|.+......+ + .....+.-. .-....+...++.-. T Consensus 256 ~~l~~l~~y~dael~~a~i~A~~a~fi~~~~~~~~~~~~---~------------~~~~~~~~~~~~~~~~~~~~~~~~~ 320 (533) T protein:vir:34 256 EQMKMLDTLQNTQLQSAIVKAMYAATIESELDTQSAMDF---I------------LGANSQEQRERLTGWIGEIAAYYAA 320 (533) T ss_pred HHHHHHHHHHHHHHHHHHHhhhheeeeecCCCccccccc---c------------cCCCcccccccccccchhhhhccCc Confidence 999999999999999888888764444333322111000 0 000000000 000011111111100 Q ss_pred c---cCCC------CccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCC-CcccccchhhhhHHhhhHHHHHH Q lcl|NC_015285. 143 R---REGG------RGTEISTLPGGQNLGE-LEDVKYFQKKLYKALNVPSSRLETE-TTFNIGRAAEITRDEVKFQKFIA 211 (359) Q Consensus 143 R---ReGg------rgTEIsTLpGgqnLge-i~DV~YF~kkLy~aL~VP~SRl~~~-~~~~~g~~~eItRDElKF~KFI~ 211 (359) + =++| -|.+|+.+.....-+. -+-++-..+.+=.+|+||-+-|..+ ++.|. |.+.-.-+.|-+++. T Consensus 321 ~~~~l~pG~i~~L~pGe~i~~~~~~~p~~~~~~f~~~~lr~iAaglGi~ye~lt~D~s~~nY---SS~R~~~~e~~r~~~ 397 (533) T protein:vir:34 321 APVRLGGAKVPHLMPGDSLNLQTAQDTDNGYSVFEQSLLRYIAAGLGVSYEQLSRNYAQMSY---STARASANESWAYFM 397 (533) T ss_pred ceeeccCceeeecCCCCeeeecCCCCCCCCHHHHHHHHHHHHHhhcCCCHHHHhhhcccccH---HHHHHHHHHHHHHHH Confidence 0 0111 1233333333322223 2334445555667899999988765 45554 333445566999999 Q ss_pred HHHHHHHHHHHHHHHH----HHHhcCCCCh------hHHHHHhh--ceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhh Q lcl|NC_015285. 212 RLRKRFSELFTDLLKT----QLILKGVMSL------EEWEDMKN--HIQFDFIADNYFTELKEIEIRNERMNQVAAMDPY 279 (359) Q Consensus 212 rLr~rFs~if~d~Lk~----QLiLkgI~t~------eew~~~~~--~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~ 279 (359) ++|..|..=|..++-. ..+|.|.++. +-|..-.. ...|..-.--+.--+||+.-...+++. T Consensus 398 ~~q~~~~~~~~~pi~~~wl~~ail~G~i~~p~~~~~~~~~~~~~~~~~~w~~p~~~~iDP~Ke~~a~~~~i~~------- 470 (533) T protein:vir:34 398 GRRKFVASRQASQMFLCWLEEAIVRRVVTLPSKARFSFQEARSAWGNCDWIGSGRMAIDGLKEVQEAVMLIEA------- 470 (533) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHcCcccCCCccCCCchhhHHhhhceeeccCCccccChHHHHHHHHHHHHc------- Confidence 9999888655443332 4567887752 22332222 233433444455667777666555442 Q ss_pred cchhhhHHHHHHHHhCCCHHHHHHHHH-HHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCCCCccCC Q lcl|NC_015285. 280 VGKYFSVDYMRRQVLKQTEIEIKEIDE-QIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPGDVRRGE 358 (359) Q Consensus 280 vGKy~S~~~i~k~IL~~tDeeI~e~~k-qi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~ 358 (359) -.-|.+-+..+ .+..-+|+-++.+ ..+...+.+ ++.|.. +. ..-..+.++.+.+|.....+- T Consensus 471 --G~~s~~~~~a~-~G~D~~ev~~q~a~e~~~~~~~g-l~~~~~------~~-------~~~~s~~~~~~~~~~~~~~~~ 533 (533) T protein:vir:34 471 --GLSTYEKECAK-RGDDYQEIFAQQVRETMERRAAG-LKPPAW------AA-------AAFESGLRQSTEEEKSDSRAA 533 (533) T ss_pred --CCCCHHHHHHH-cCCCHHHHHHHHHHHHHHHHhcC-CCCCCC------CC-------cCccCCCCCCCCCCcccCCCC Confidence 33466666655 4655544433222 222222233 222211 00 000111111122222222222 No 91 >protein:vir:93747 Length: 472 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240454;genbank:gi:66396119;genbank:GeneID:5133516 Probab=68.56 E-value=0.24 Score=24.03 Aligned_cols=283 Identities=11% Similarity=0.075 Sum_probs=107.7 Q ss_pred CCCchhhHHHhhhhh-hheeeccccccccCC---CceeecHhHhhhhhc-cccc---C-CCCcchhhHHHHHHHHHHHH- Q lcl|NC_015285. 1 MRGVDLNQQLTQKAA-EYFLYNPKGLKNSTN---QGMKITTDSVTYCHS-GIQD---L-NKNMTLSHLHKAIKAVNQLR- 70 (359) Q Consensus 1 ~~~~~~~~~~~~~~~-e~f~yn~~~~~~~~~---~~v~i~~~ai~y~hS-Gl~d---~-~~~~i~syL~~Aik~~NqL~- 70 (359) -.++.+.++++.... .+|.+...+...... ....+.. ..|. |-++ . ++..-.|=++..+....-+. T Consensus 179 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~vPvv~~~nn~~g~s~~e~v~~liDa~~~ 254 (472) T protein:vir:93 179 KLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHF----STGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNR 254 (472) T ss_pred EeecceeEEEEecCeEEEEEEecCeeeeccccccccccccc----ccCCCCCcceEEecCCCCCCCchhhhHHHHHHHHH Confidence 122222333322111 123332222111000 0011110 1111 1111 0 11111233333333222222 Q ss_pred HHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhhcccccCCCCcc Q lcl|NC_015285. 71 MIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDFWLPRREGGRGT 150 (359) Q Consensus 71 m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDywLpRReGgrgT 150 (359) ++-+....-+.+..|-+-+.-.+.-.. .+..+. +..++ -..+ ++ +. T Consensus 255 ~~s~~~~~~~~~~~~~~~~~g~~~~~~-----~~~~~~-~~~~~-----------------------~~~~---~~--~~ 300 (472) T protein:vir:93 255 RLSDLSNTFKDSNELTYVLTNYDDQEL-----PEFKRL-LRYYG-----------------------AIKV---SD--NG 300 (472) T ss_pred HHHHHHHHHHHhcCceeEeecCCcccc-----hhhHHH-Hhhcc-----------------------cccc---CC--CC Confidence 444555555666666544432221111 111111 11110 0111 11 12 Q ss_pred ceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015285. 151 EISTLPGGQNLGE-LEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFTDLLKTQL 229 (359) Q Consensus 151 EIsTLpGgqnLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KFI~rLr~rFs~if~d~Lk~QL 229 (359) .+.+|-...+.+. ..-+.-+.+.+|+..++|---++.-++ |. .+..|.--+.....-+.+.++.|...+.++++.=+ T Consensus 301 ~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~-n~-Sg~Al~~~~~~l~~ka~~~~~~~~~~l~~~~~li~ 378 (472) T protein:vir:93 301 GVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGS-AP-SGVALEFLYTNLNLKADKLARKAKVAIQELLWFVF 378 (472) T ss_pred cceeEeecCCHHHHHHHHHHHHHHHHHHhCCCCCCcccccc-Cc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3455433333332 234666777888888888432222111 11 22234444444566678888888888888887644 Q ss_pred HhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCCCHHHHHHHHHHHH Q lcl|NC_015285. 230 ILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQTEIEIKEIDEQIA 309 (359) Q Consensus 230 iLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~tDeeI~e~~kqi~ 309 (359) -+-|+ ..+|. .|.+.|....--.+. +.++++..+. | .+|.+++++.+-..+| .+++.++|+ T Consensus 379 ~~~~~--~~~~~----~i~v~f~~~~p~~~~-------~~~~~~~k~~---g-iis~et~l~~l~~~~d--~~~E~~ri~ 439 (472) T protein:vir:93 379 EHFDI--KGEHK----DVDISFNYNKVANTE-------LQVQTAQQSM---G-IVSHETVLENHPFVED--LQAELERIE 439 (472) T ss_pred HHhCC--Ccccc----eeeEEeCCCCCCCHH-------HHHHHHHHHh---c-cCchHHHHHhCCCCCC--HHHHHHHHH Confidence 44443 34454 466777433222232 2344555553 4 3799999987433443 233334455 Q ss_pred HHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCCCCccCC Q lcl|NC_015285. 310 SEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPGDVRRGE 358 (359) Q Consensus 310 ~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~ 358 (359) +|..+.. ...+..++.+..++ .+.-.+ +-..+| T Consensus 440 ~E~~~~~--------~~~~~~~~~~~d~~--~~~~~~------~~~~~e 472 (472) T protein:vir:93 440 QEQMEYN--------KQLPNLDDGGADGA--QQQERS------NNKESE 472 (472) T ss_pred HHHHHHH--------HhccCcCcccCCCC--CCCCCC------CcccCC Confidence 5432211 00011111110000 000000 111111 No 92 >protein:vir:9751 Length: 422 # NCBI annotation: putative structural protein # Family: family:all:524 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795513;genbank:gi:28876291;genbank:GeneID:1257832 Probab=68.28 E-value=0.24 Score=23.99 Aligned_cols=267 Identities=10% Similarity=0.101 Sum_probs=135.8 Q ss_pred CCCchhhHH-----Hh----hhhhhheeeccccc--c-ccCCCceeecH-----hHhhhhhccc-ccCCCC-----cchh Q lcl|NC_015285. 1 MRGVDLNQQ-----LT----QKAAEYFLYNPKGL--K-NSTNQGMKITT-----DSVTYCHSGI-QDLNKN-----MTLS 57 (359) Q Consensus 1 ~~~~~~~~~-----~~----~~~~e~f~yn~~~~--~-~~~~~~v~i~~-----~ai~y~hSGl-~d~~~~-----~i~s 57 (359) +-|+.++.. ++ .+....++|-+... . ...+....++. -.|.|+|..- .+..|. -++. T Consensus 130 i~D~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~~~~~~G~s~I~e~v~~ 209 (422) T protein:vir:97 130 ILDPTTFLLTEGYAILESDSNGNPTLEAYFTDKDIWYYPKKGKPYNIKNPTGHPLLVPIIHRPDAVRPFGRSRITKAGMY 209 (422) T ss_pred EEeCCCCcceeeEEEEEecCCCcEEEEEEEcCceEEEEcCCCccccccCCCCCcceEEecccCCCccccCccccchhHHH Confidence 111111110 00 00001111111100 0 00111111110 0133443311 111111 1334 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHh Q lcl|NC_015285. 58 HLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMME 137 (359) Q Consensus 58 yL~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlE 137 (359) ..+++- +.+.++++.=...=.|.|-++=+|--.-|..+-. ..|-. T Consensus 210 l~da~~------r~~~~~~~~~e~~a~pqr~i~G~d~d~~~~~~~~-----------------------------~~~~~ 254 (422) T protein:vir:97 210 HQKAAK------RTLERAEVTAEFYSFPQKYVLGMDPDAKPMEKWR-----------------------------ATVST 254 (422) T ss_pred HHHHHH------HHHHHHHHHHHHhcchhhhhcccCcccccCchhh-----------------------------hhhhh Confidence 444443 4577888888888889988865543211211111 11223 Q ss_pred hhcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHH Q lcl|NC_015285. 138 DFWLPRREGGRGTEISTLPGGQNLGE-LEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKR 216 (359) Q Consensus 138 DywLpRReGgrgTEIsTLpGgqnLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KFI~rLr~r 216 (359) =..+|.-+.|-+.+|..+++.. |+- ++-++-.-..+....++|.+-|+..+. |-=.+..|.-.|....+-+.+-|+. T Consensus 255 i~~~~~de~~~~~~v~q~~~~~-l~~~~~~l~~~~~~~a~~s~lP~~~lg~~~~-NpsSa~Ai~a~~~~L~~ka~~k~~~ 332 (422) T protein:vir:97 255 LLEISKDEDGDKPTVGQFTTAS-MAPFMEHLKMYASLFAGGSGLTLDDLGFPSD-NPSSVESIKAAHENLRAAGRKAQRS 332 (422) T ss_pred hhccCCCCCCCcceeeecCCCC-hhHHHHHHHHHHHHHhcccCCCHHHhccccC-chhHHHHHHHHHHHHHHHHHHHHHH Confidence 3456777777778898898866 442 333444444555556999888865443 2112345777788899999999999 Q ss_pred HHHHHHHHHHHHHHhcCCCC--hhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHh Q lcl|NC_015285. 217 FSELFTDLLKTQLILKGVMS--LEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVL 294 (359) Q Consensus 217 Fs~if~d~Lk~QLiLkgI~t--~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL 294 (359) |..-+..+++.-+.+.|-.. +++| ..+.+.|. .++-. ++..+.+..+.+..+..-+=.+.+.+++++. | T Consensus 333 fg~~l~~~~rla~~~~~~~~~~~~~~----~~~~~~w~-p~~~~---~~~s~a~~aDa~~Kl~~a~~~~~~~~~~~~~-l 403 (422) T protein:vir:97 333 FSSGFLNVAYIAVCLRDEFPYLRNQF----MDTVIKWE-PLFEA---DANMLTLVGDGAIKLNQAIPGFMDADVIRDL-T 403 (422) T ss_pred HHHHHHHHHHHHHHHhcCCcccchhh----ccceEEEc-cCCCC---ChHHHHHHHHHHHHHHhhccccccHHHHHHH-c Confidence 99999999999887766443 3344 34566665 22211 2333445555554443322234566766655 8 Q ss_pred CCCHHHHHHHHHHHHHHHhcC Q lcl|NC_015285. 295 KQTEIEIKEIDEQIASEMEAG 315 (359) Q Consensus 295 ~~tDeeI~e~~kqi~~E~~~~ 315 (359) ++++.+++ ...+++++.+| T Consensus 404 g~~~~~~~--~~~~~~~~~d~ 422 (422) T protein:vir:97 404 GVKGADKP--IPAITEVTTDG 422 (422) T ss_pred CCCchhHH--HHHHHhhhccC Confidence 99877664 33445555555 No 93 >protein:vir:93943 Length: 409 # NCBI annotation: ORF010 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239936;genbank:gi:66395598;genbank:GeneID:5131009 Probab=67.74 E-value=0.25 Score=23.91 Aligned_cols=275 Identities=15% Similarity=0.204 Sum_probs=113.1 Q ss_pred CCCc--------hhhHHHhhhhhhheeecccc---------------ccccCCCceeecHhHhhhhhcccccCCCCcchh Q lcl|NC_015285. 1 MRGV--------DLNQQLTQKAAEYFLYNPKG---------------LKNSTNQGMKITTDSVTYCHSGIQDLNKNMTLS 57 (359) Q Consensus 1 ~~~~--------~~~~~~~~~~~e~f~yn~~~---------------~~~~~~~~v~i~~~ai~y~hSGl~d~~~~~i~s 57 (359) +.++ ..| ...+.+...|.. +....+..+.++.+-|.+.. +....++=.=+| T Consensus 107 l~Gnay~~i~r~~~G-----~~~~L~~l~~~~v~~~~~~~~~~~~y~~~~~~g~~~~~~~~eVih~r-~~~~~~~~~G~s 180 (409) T protein:vir:93 107 EKGNAYVLIERDIYH-----QPSKLFLLNPDVVEMLIENQSRELYYSIHAATGNKLIVHNMDMLHFK-HIVASNMVQGIS 180 (409) T ss_pred hcCceEEEEEECCCC-----cEEEEEEEcCceeEEEEeCCCcEEEEEEEcCCceEEEEccccEEEeC-CCCCCCcccccc Confidence 1111 111 112222222211 11122334668888777653 111111111134 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHh Q lcl|NC_015285. 58 HLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMME 137 (359) Q Consensus 58 yL~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlE 137 (359) -|..|.++......++..- +. -..+|..-+ ..--+.+.+.+++...+.+.+.|. ..|.+ + .+ T Consensus 181 ~i~~~~~~i~~~~~~~~~~-~~-~~~~~~~~i-~~~~~~l~~e~~~~~~~~~~~~~~-------~~g~~------~-vl- 242 (409) T protein:vir:93 181 PIDVLKNTTDFDNAVRTFN-LT-EMQKPDSFM-LKYGSNVGKEKRQQVLEDFKQYYE-------ENGGI------L-FQ- 242 (409) T ss_pred HHHHHHHHHHHHHHHHHHH-HH-hcCCCCceE-EecCCCCCHHHHHHHHHHHHHHhh-------cCCCe------e-ec- Confidence 4555555544444444442 33 333343333 233456666666555555444332 12321 1 11 Q ss_pred hhcccccCCCCccceeecCCCCCcch---HHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHH Q lcl|NC_015285. 138 DFWLPRREGGRGTEISTLPGGQNLGE---LEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLR 214 (359) Q Consensus 138 DywLpRReGgrgTEIsTLpGgqnLge---i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KFI~rLr 214 (359) + .|.+++.|. .+.-+ ++-..|-...+.++++||...|...++-+.+...+..+. |..++ |+ T Consensus 243 -------~--~g~~~~~l~--~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~~~~sn~e~~~~~---f~~~~--l~ 306 (409) T protein:vir:93 243 -------E--PGVEIEPLP--KKYVSEDIVASENLTRERVANVFQLPSVFLNARSNTNFAKNEELNRF---YLQHT--LL 306 (409) T ss_pred -------C--CCceEEEcC--CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHH---HHHHH--HH Confidence 1 256677664 22233 333346778899999999999976555555555554443 65553 33 Q ss_pred HHHHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHh Q lcl|NC_015285. 215 KRFSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVL 294 (359) Q Consensus 215 ~rFs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL 294 (359) -.+. .+.+.|-.. ++++.+|. ....|.|..+ ++...+ +..|.+.+..+-.- -+++..-++.. + T Consensus 307 P~~~-~ie~~l~~~-----Ll~~~~~~---~~~~~~fd~~----~ll~~d-~~~~~~~~~~~~~~--G~~T~NE~R~~-~ 369 (409) T protein:vir:93 307 PIVK-QYEEEFNRK-----LLTKTDRE---KNRYFKFNVK----SYLRAD-SATQAEVYFKAVRS--GYYTINDIREW-E 369 (409) T ss_pred HHHH-HHHHHHHhh-----cCCccccc---CcceEEeech----hhhccC-HHHHHHHHHHHHhC--CCcCHHHHHHH-h Confidence 2211 133333333 44555554 2334555432 333333 34566666555332 24555555533 4 Q ss_pred CCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCc--CCCCCCCCCccC Q lcl|NC_015285. 295 KQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQ--ESSVDPGDVRRG 357 (359) Q Consensus 295 ~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~p~~~~~~ 357 (359) ++.+-+ --+ -.+-+.+ ..+.. .+... ....--+|...| T Consensus 370 g~~p~~--ggD---------~~~~~~n------------~~~~~--~~~~~~~~~~gG~~n~~e~ 409 (409) T protein:vir:93 370 DLPPVE--GGD---------KPLISGD------------LYPID--TPLELRKSLKGGDKNVNES 409 (409) T ss_pred CCCCCC--CcC---------eeeeccc------------ccccc--cchhhcccccCCCCCcCCC Confidence 443221 000 0000000 00000 00000 001111233333 No 94 >protein:vir:9359 Length: 348 # NCBI annotation: head portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803337;genbank:gi:29028648;genbank:GeneID:1258089 Probab=66.07 E-value=0.27 Score=23.67 Aligned_cols=276 Identities=14% Similarity=0.213 Sum_probs=113.5 Q ss_pred CCC--------chhhHHHhhhhhhheeecccc---------------ccccCCCceeecHhHhhhhhcccccCCCCcchh Q lcl|NC_015285. 1 MRG--------VDLNQQLTQKAAEYFLYNPKG---------------LKNSTNQGMKITTDSVTYCHSGIQDLNKNMTLS 57 (359) Q Consensus 1 ~~~--------~~~~~~~~~~~~e~f~yn~~~---------------~~~~~~~~v~i~~~ai~y~hSGl~d~~~~~i~s 57 (359) +.+ +..|. +.+.+...|.. +....+..+.++.+-|.+.- +....+.=.=+| T Consensus 46 l~Gna~~~i~r~~~G~-----~~~L~~l~~~~v~~~~~~~~~~~~y~~~~~~g~~~~~~~~eiih~r-~~~~~~~~~G~s 119 (348) T protein:vir:93 46 EKGNAYVLIERDIYHQ-----PSKLFLLNPDVVEMLIENQSRELYYSIHAATGNKLIVHNMDMLHFK-HIVASNMVQGIS 119 (348) T ss_pred hcCCeEEEEEECCCCc-----EEEEEEEcCCceEEEEeCCCcEEEEEEEcCCCeEEEEccccEEEec-CCCCCCceeecc Confidence 111 11111 11222222211 11122345678888876541 111111111246 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHh Q lcl|NC_015285. 58 HLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMME 137 (359) Q Consensus 58 yL~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlE 137 (359) .+..|..++.....++... +....+.| . +...-.+++-+.++++..+.+...|. . .|.+ + . T Consensus 120 ~~~~~~~~i~~~~~~~~~~-~~~~~~~~-~-~i~~~~~~l~~e~~~~~~~~~~~~~~------n-~~~~------~-v-- 180 (348) T protein:vir:93 120 PIDVLKNTTDFDNAVRTFN-LTEMQKPD-S-FMLKYGSNVSTEKRQQVLEDFKQYYE------E-NGGI------L-F-- 180 (348) T ss_pred HHHHHHHHHHHHHHHHHHH-HHhcCCCc-e-eEEecCCCCCHHHHHHHHHHHHHHhh------c-CCCe------e-e-- Confidence 6777777766655555553 33333333 2 33333456666666555555444332 2 2321 1 1 Q ss_pred hhcccccCCCCccceeecCCCCCcchHHH---HHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHH-HHH Q lcl|NC_015285. 138 DFWLPRREGGRGTEISTLPGGQNLGELED---VKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFI-ARL 213 (359) Q Consensus 138 DywLpRReGgrgTEIsTLpGgqnLgei~D---V~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KFI-~rL 213 (359) | ..|.+++.|. .+..+++= -+|..+.+.++++||...+...++-+.....+..+. |.+++ .-+ T Consensus 181 ---l-----~~g~~~~~l~--~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~~~~e~~~~~---~~~~~l~P~ 247 (348) T protein:vir:93 181 ---Q-----EPGVEIEPLP--KKYVSEDIVASENLTRERVANVFQLPSIFLNARSNTNFAKNEELNRF---YLQHTLLPI 247 (348) T ss_pred ---c-----CCCceEEEcC--CChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHH---HHHHHHHHH Confidence 1 1356677664 33444333 346788899999999999986655566555554443 54433 222 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHH Q lcl|NC_015285. 214 RKRFSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQV 293 (359) Q Consensus 214 r~rFs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~I 293 (359) -.++... +-..++++.+|.. ..+|+|++. ++...+ +..|.+++..+-.- -+++.+-++. . T Consensus 248 ~~~ie~~---------l~~~l~~~~~~~~-g~~i~fd~~------~l~~~d-~~~~a~~~~~~~~~--G~~T~NE~R~-~ 307 (348) T protein:vir:93 248 VKQYEEE---------FNRKLLTKTDREK-NRYFKFNVK------SYLRAD-SATQAEVYFKAVRS--GYYTINDIRE-W 307 (348) T ss_pred HHHHHHH---------HHHhhCCcccccC-cceEEeech------hhhccC-HHHHHHHHHHHHhC--CCCCHHHHHH-H Confidence 2222222 2334455666541 233444332 222222 34566665555322 2445555553 2 Q ss_pred hCCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCCCCccC Q lcl|NC_015285. 294 LKQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPGDVRRG 357 (359) Q Consensus 294 L~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~ 357 (359) +++.+-+ --++-+ .....+ ..++..+......|++ ++...| T Consensus 308 ~g~~p~~--ggD~~~---~~~n~~-~~~~~~~~~~~~~gg~-----------------~n~~~~ 348 (348) T protein:vir:93 308 EDLPPVE--GGDKPL---ISGDLY-PIDTPLELRKSLKGGD-----------------KNVNES 348 (348) T ss_pred hCCCCCC--CcCeEe---eccccc-ccccchhhcccccCCC-----------------CCcCCC Confidence 4443211 000000 000000 0000000000000111 111112 No 95 >protein:vir:3964 Length: 453 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:83 # MgeName: ul36 # Cross-refs: genbank:acc:NP_663672;genbank:gi:21716109;genbank:GeneID:951201 Probab=65.74 E-value=0.28 Score=23.63 Aligned_cols=283 Identities=12% Similarity=0.113 Sum_probs=115.5 Q ss_pred CCCchhhHHH------h---hhhhhheeeccccccc--cCCCceeecHhHhhhhhc-cccc----CCCCcchhhHHHHHH Q lcl|NC_015285. 1 MRGVDLNQQL------T---QKAAEYFLYNPKGLKN--STNQGMKITTDSVTYCHS-GIQD----LNKNMTLSHLHKAIK 64 (359) Q Consensus 1 ~~~~~~~~~~------~---~~~~e~f~yn~~~~~~--~~~~~v~i~~~ai~y~hS-Gl~d----~~~~~i~syL~~Aik 64 (359) +-++..+.++ + ....-+-+|++...+. ....++.+... .-|- |.++ +++..-.|=++..+. T Consensus 149 v~d~~~~~~~~~~ir~~~~~~~~~~~~~yt~~~i~~~~~~~~~~~~~~~---~~~~~g~vPvv~~~n~~~g~sd~e~v~~ 225 (453) T protein:vir:39 149 VYDDTIKQEPLFAVRYGYDDDYKLYGEVYTKETTYALNGTMGFYNMTEQ---APNPFDDLPVVEFYFNEERMSIFESVIS 225 (453) T ss_pred EecCCCCCeEEEEEEEEEeCCeEEEEEEEeCCeEEEEEecCCceeeecc---cccCCCceeEEEecCCCCCCcchhhhHH Confidence 1111111100 0 0001112334433221 11112221110 0110 1111 112223344444333 Q ss_pred HHHHH-HHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhc-ceEEeeCCCCcccccccchhhHhhhccc Q lcl|NC_015285. 65 AVNQL-RMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYR-NKMVYDANTGEIKDDKKFMSMMEDFWLP 142 (359) Q Consensus 65 ~~NqL-~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyr-nklvYD~~TGevkdd~~~mSMlEDywLp 142 (359) ...-+ +++-+....-+.++.|-+-+.-. +++... +.+.+ +++. ....+ . T Consensus 226 liDa~~~~~s~~~~~~~~~~~p~~~~~g~---~~~~~~--------~~~~~~~~~~-~~~~~--------------~--- 276 (453) T protein:vir:39 226 LVNAFNKAISEKANDVDYFSDQYLTFLGA---AVEEED--------LKNIRSNRVI-NYYGE--------------S--- 276 (453) T ss_pred HHHHHHHHHHHHHHHHHHhhCceeeeecC---CCCchh--------hhhhhhccee-eecCC--------------C--- Confidence 33222 24455555667777776655432 222211 11111 1111 10000 0 Q ss_pred ccCCCCccceeecCCCCCcchHH-HHHHHHHHHHHhcCCCccccCCCCcccccchh--hhhHHhhhHHHHHHHHHHHHHH Q lcl|NC_015285. 143 RREGGRGTEISTLPGGQNLGELE-DVKYFQKKLYKALNVPSSRLETETTFNIGRAA--EITRDEVKFQKFIARLRKRFSE 219 (359) Q Consensus 143 RReGgrgTEIsTLpGgqnLgei~-DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~--eItRDElKF~KFI~rLr~rFs~ 219 (359) ..+.+.++.+|....+.+.+. -++-+.+.+|....+|- +..+ . +|++| .|.--+.....-+.+.|..|.. T Consensus 277 --~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~I~~~s~~p~--~~~~-~--~gn~Sg~Al~~~~~~l~~ka~~~~~~~~~ 349 (453) T protein:vir:39 277 --SEAKNVDVKFLEKPDSDSQTENLLDRLTKLIFQTTMVAN--ISDE-S--FGSSSGVSLAYKLQAMSNLALSFQRKFQS 349 (453) T ss_pred --CCCCCCceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcc--cccc-c--ccCChHHHHHHHHHHHHHHHHHHHHHHHH Confidence 112344567666555555444 35667777788888884 2222 2 23333 2332333344567777788888 Q ss_pred HHHHHHHHHHHhcCCC-ChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCCCH Q lcl|NC_015285. 220 LFTDLLKTQLILKGVM-SLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQTE 298 (359) Q Consensus 220 if~d~Lk~QLiLkgI~-t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~tD 298 (359) -+...++.=+-+-+.. ...+| ..|.+.|...-.=.+ .+.++++..+. | .+|.+++++. |...+ T Consensus 350 ~l~~~~~li~~~~~~~~~~~~~----~~i~v~f~~~~p~~~-------~~~a~~~~kl~---g-~is~et~l~~-l~~v~ 413 (453) T protein:vir:39 350 SLNSRYKLYCELSTNVSNKEAW----KDIEYTFTRNEPKDI-------KEQAETANILM---G-ITSQETALSV-ISVIP 413 (453) T ss_pred HHHHHHHHHHHHHhccCCcccc----ccceEEeCCCCCcCH-------HHHHHHHHHHh---c-cCChHHHHHh-CCCCC Confidence 7777777533332322 23333 356788864433223 33445555553 4 3799999976 56654 Q ss_pred HHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCCCCcc Q lcl|NC_015285. 299 IEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPGDVRR 356 (359) Q Consensus 299 eeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~ 356 (359) + .+++-++|++|.....-.+++. +....|..++ .|.+.++ T Consensus 414 D-~~~E~~ri~~E~~~~~~~~~~~-~~~~~~~~~~----------------~~~~~~e 453 (453) T protein:vir:39 414 D-VQAEMEKIKKEEASTAIFDKDK-QPSEKGTDTV----------------VPETNEE 453 (453) T ss_pred C-HHHHHHHHHHHHHHHHHHHHhc-cCCCCCCCCC----------------CCCcCCC Confidence 2 3344456666654432111111 1111111111 1112222 No 96 >protein:vir:95806 Length: 440 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950583;genbank:gi:119953778;genbank:GeneID:5076876 Probab=64.91 E-value=0.29 Score=23.51 Aligned_cols=286 Identities=9% Similarity=0.083 Sum_probs=117.1 Q ss_pred CCCchhhHHHhh--------hhhhheeeccccccc-----cCCCceeecHhHhhhhhc-cccc----CCCCcchhhHHHH Q lcl|NC_015285. 1 MRGVDLNQQLTQ--------KAAEYFLYNPKGLKN-----STNQGMKITTDSVTYCHS-GIQD----LNKNMTLSHLHKA 62 (359) Q Consensus 1 ~~~~~~~~~~~~--------~~~e~f~yn~~~~~~-----~~~~~v~i~~~ai~y~hS-Gl~d----~~~~~i~syL~~A 62 (359) +.++....+++- ....+-+|++..... ....+++... .+.|. |-++ +|+..-.|=++.. T Consensus 131 ~~d~~~~~~~~~~i~~~~~~~~~~~~vyt~~~~~~~~~~~~~~~~~~~~~---~~~~~~g~vPvv~~~n~~~g~sd~e~v 207 (440) T protein:vir:95 131 IRDLTVEQNIIAAVHLPIYADKVNMTVYTKDKVITYKPYSNNSVRLVVDD---VKKHSYNDVPVVEWWNNRFRMGDYESE 207 (440) T ss_pred EEcCCCCCceEEEEEEEEecCceEEEEEeCCeEEEEEEecCCccceeecc---eeeccCceeeEEEeeCCCCCCCchhhh Confidence 111110000000 000111344433221 0011111110 01121 2111 1111122333443 Q ss_pred HHHHHHHH-HHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhhcc Q lcl|NC_015285. 63 IKAVNQLR-MIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDFWL 141 (359) Q Consensus 63 ik~~NqL~-m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDywL 141 (359) +.....+. ++-+....-+..+.|-+-+.-.+.+.-.... ..+.+++.+ -.|+ T Consensus 208 ~~lida~~~~~s~~~~~~~~~~~~~~v~~g~~~~~~~~~e--------------------~~~~~~~~~-------~~~~ 260 (440) T protein:vir:95 208 ISLIDAYDAGQSDTANYMSDLNDAMLLVKGDLDGIKLSPE--------------------DAAKMKDAN-------MLFL 260 (440) T ss_pred HHHHHHHHHHHHHHHHHHHHhhcceeeeecccccCCCCcc--------------------chhhhhhcc-------ceec Confidence 33333332 3344444455666665554432111100000 001111111 1233 Q ss_pred ccc----CCCCccceeecCCCCCcchH-HHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHH Q lcl|NC_015285. 142 PRR----EGGRGTEISTLPGGQNLGEL-EDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKR 216 (359) Q Consensus 142 pRR----eGgrgTEIsTLpGgqnLgei-~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KFI~rLr~r 216 (359) +.. .++.+..++.|-...++... .-++-+.+.+|....+|-=-++.-++ |. .+..|.--+.....-+.+.|.. T Consensus 261 ~~~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~l~~~i~~~s~~p~~~~~~~~~-n~-Sg~Al~~~~~~l~~k~~~k~~~ 338 (440) T protein:vir:95 261 KTGISTTGQQTTADASYIYKQYDVNGTEAYKNRLANDIHRFSRIPNLDDDRFNS-TS-SGIALLYKMIGLEQVRKDKETY 338 (440) T ss_pred ccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccc-cc-hHHHHHHHHHHHHHHHHHHHHH Confidence 222 22334457777665665544 44777888899999998411111111 11 1122333333344557777888 Q ss_pred HHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCC Q lcl|NC_015285. 217 FSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQ 296 (359) Q Consensus 217 Fs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~ 296 (359) |..-+.++++.=+-+-+++...+|+. ..+.+.|..--.-.+. +.++++..+. | .+|.++++.. |.. T Consensus 339 ~~~~l~~~~~li~~~~~~~~~~~~~~--~~v~i~f~~~~p~~~~-------~~ad~~~kl~---g-~iS~et~~~~-l~~ 404 (440) T protein:vir:95 339 FTKALRRRYELISNIHKAINGPVIEA--NKLTFTFHPNIPQDVW-------TEIKAYIEAG---G-EISQETLMEN-ASF 404 (440) T ss_pred HHHHHHHHHHHHHHHHhhcCCccccc--ccceEEeCCCCCCCHH-------HHHHHHHHHh---c-cCcHHHHHHh-CCC Confidence 88888888876544444555555553 4577777655444443 3444445543 3 4799999987 566 Q ss_pred CHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCC Q lcl|NC_015285. 297 TEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPN 343 (359) Q Consensus 297 tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~ 343 (359) .|.+ ++.++|++|.... .++ .+...|..-++. .+++ T Consensus 405 ~d~~--~E~~ri~~E~~~~---~~~--~~~~~~~~~~~~----~~~e 440 (440) T protein:vir:95 405 TDYK--TEHSRILKQGGSS---DLE--IGQIVGDADVGQ----ADTE 440 (440) T ss_pred CCcH--HHHHHHHHHHHHh---hhh--HHhhccCCCCCC----cCCC Confidence 5543 3345555554431 111 111111100111 1222 No 97 >protein:vir:4509 Length: 424 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599035;genbank:gi:19548993;genbank:GeneID:935206 Probab=64.59 E-value=0.3 Score=23.47 Aligned_cols=279 Identities=10% Similarity=0.060 Sum_probs=119.5 Q ss_pred CCCc--------hhhHHHhhhhhhheeeccccc------------cccCCCceeecHhHhhhhhcccccCCCCcchhhHH Q lcl|NC_015285. 1 MRGV--------DLNQQLTQKAAEYFLYNPKGL------------KNSTNQGMKITTDSVTYCHSGIQDLNKNMTLSHLH 60 (359) Q Consensus 1 ~~~~--------~~~~~~~~~~~e~f~yn~~~~------------~~~~~~~v~i~~~ai~y~hSGl~d~~~~~i~syL~ 60 (359) +.++ ..|. ..+.+...|... ........+++.+-|.+.. + .+.++-.-+|-++ T Consensus 124 l~Gna~~~i~r~~~G~-----~~~L~~l~~~~v~i~~~~~~~~y~~~~~~~~~~~~~~eVih~r-~-~~~d~~~G~spi~ 196 (424) T protein:vir:45 124 GWGNGYTWVKRNRRGE-----VISLDCCMPWETTLMNTGGRYTYGLYNEYGAFAISPDDMIHIR-A-LGNNQKMGLSPIM 196 (424) T ss_pred hcCCeEEEEEEcCCCc-----EEEEEEecCceEEEEEcCCeEEEEEEecCceEEECcccEEEec-C-cCCCCcccccHHH Confidence 1110 1110 011111111110 0112233567777776553 2 3445556689999 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhhc Q lcl|NC_015285. 61 KAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDFW 140 (359) Q Consensus 61 ~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDyw 140 (359) .|...+.....+++...=+----|--+-|+.++- .|.+.++++--+.+-..|+- ..+ +.|.+ + .+ T Consensus 197 ~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~-~l~~e~~~~~~~~~~~~~~g--~~~-n~g~~------~-vl---- 261 (424) T protein:vir:45 197 QHAETIGMGMSGQKYTESFFSGNARPAGIVSVKS-GLNKESWGWLKDQWQKASQA--LRR-QENKT------M-LL---- 261 (424) T ss_pred HHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCC-CCCHHHHHHHHHHHHHHhcc--ccc-cCCce------e-Ec---- Confidence 9999999888888876644444455567777774 46665554444433333321 111 11211 1 11 Q ss_pred ccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHH-HHHHHHHHH Q lcl|NC_015285. 141 LPRREGGRGTEISTLPGG-QNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKF-IARLRKRFS 218 (359) Q Consensus 141 LpRReGgrgTEIsTLpGg-qnLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KF-I~rLr~rFs 218 (359) ..|.++..|.=. ..+.-++-.++-.+.+.++++||...|...+.-+.....+..+. |.++ +.-+-.++. T Consensus 262 ------~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~eq~~~~---f~~~tL~P~~~~ie 332 (424) T protein:vir:45 262 ------PADLDYKALTVSPVDAQIIDMMKLNRSMIAGIFNIPAHMINDLEKATFSNISAQAIQ---FVRYTMMPWVTNWE 332 (424) T ss_pred ------CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHH---HHHHHHHHHHHHHH Confidence 124455555321 11222455568888999999999999975443333334444433 5544 233322222 Q ss_pred HHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCCCH Q lcl|NC_015285. 219 ELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQTE 298 (359) Q Consensus 219 ~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~tD 298 (359) +.|-.. +.|+.|+. ....+.|..+. +... -+..|.+.++.+-.- -+++.+-++. ++++.. T Consensus 333 ----~~ln~k-----Ll~~~e~~---~g~~i~fd~~~----llr~-d~~~r~~~~~~~~~~--g~~T~NE~R~-~~gl~p 392 (424) T protein:vir:45 333 ----QELNRR-----LFTRAELA---AGYYVRFNLTG----LLRG-TPQERAQFYHFAITD--GWMSRNEARA-FEDMNP 392 (424) T ss_pred ----HHHHHh-----cCChhhhc---CCcEEEeechh----hhcc-CHHHHHHHHHHHHhC--CCcCHHHHHH-HhCCCC Confidence 222222 34444432 23345555332 2111 134566666554332 2444444442 233321 Q ss_pred HHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCCCCccCC Q lcl|NC_015285. 299 IEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPGDVRRGE 358 (359) Q Consensus 299 eeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~ 358 (359) +++-+.-.-+ ...+ .|.....+.+...++..+ T Consensus 393 ------------------i~ggD~~~~~-----~n~~-----~~~~~~~~~~~~~~~~~~ 424 (424) T protein:vir:45 393 ------------------VEGLDEMLVS-----VNAA-----NPAGDFKPPKNDEGKTNE 424 (424) T ss_pred ------------------CCCcceeeec-----cccc-----ccccccCCCCCCCCCCCC Confidence 1110000000 0000 011111111111222222 No 98 >protein:vir:78805 Length: 511 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285356;genbank:gi:148717884;genbank:GeneID:5246936 Probab=63.05 E-value=0.32 Score=23.27 Aligned_cols=302 Identities=10% Similarity=0.045 Sum_probs=103.5 Q ss_pred CCCchhhHHHhhh----------------hhhheeeccccccc---cCCCceeec-HhHhhhhhc-cccc---C-CCCcc Q lcl|NC_015285. 1 MRGVDLNQQLTQK----------------AAEYFLYNPKGLKN---STNQGMKIT-TDSVTYCHS-GIQD---L-NKNMT 55 (359) Q Consensus 1 ~~~~~~~~~~~~~----------------~~e~f~yn~~~~~~---~~~~~v~i~-~~ai~y~hS-Gl~d---~-~~~~i 55 (359) +-++....+.+-+ +.-+-+|++...+. ..+.+.... ...-...|. |-++ . ++..- T Consensus 175 v~dd~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g 254 (511) T protein:vir:78 175 IYDNTVERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTNRTNGLKLTPRENSFESHSFERMPITEFSNNERR 254 (511) T ss_pred EEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccCcCcccceEEecCCCCC Confidence 1111111111111 11122555554321 111111110 000001111 1111 0 11111 Q ss_pred hhhHHHHHHHHHHHHH-HHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCccc-ccccch Q lcl|NC_015285. 56 LSHLHKAIKAVNQLRM-IEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIK-DDKKFM 133 (359) Q Consensus 56 ~syL~~Aik~~NqL~m-~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevk-dd~~~m 133 (359) .|=++..+.....+.. +-+....-+.++.|-+-+.-....+ .++++ +....+ T Consensus 255 ~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~--------------------------~~~~~~~~~~~~ 308 (511) T protein:vir:78 255 KGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLD--------------------------PVEVRKQKEANV 308 (511) T ss_pred CCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccCC--------------------------chhhcccccccc Confidence 2334433333332221 1111112233344433332211110 00111 000011 Q ss_pred hhHhh---hcccccCCCCccceeecCCCCCcchH-HHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHH Q lcl|NC_015285. 134 SMMED---FWLPRREGGRGTEISTLPGGQNLGEL-EDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKF 209 (359) Q Consensus 134 SMlED---ywLpRReGgrgTEIsTLpGgqnLgei-~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KF 209 (359) -.+.. +.......+.|..+..|-...+...+ .-+.-+.+.+|.-.++|---.+.-++ |. .+..|..-...-..- T Consensus 309 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~-n~-Sg~Al~~~~~~l~~k 386 (511) T protein:vir:78 309 LFLEPTVYVDAEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSG-TQ-SGEAMKYKLFGLEQR 386 (511) T ss_pred eeccccceeccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccc-cc-HHHHHHHHHHHHHHH Confidence 00000 00111122233445555544443332 34455677788888888532222111 11 122233222223444 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcC----CCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhh Q lcl|NC_015285. 210 IARLRKRFSELFTDLLKTQLILKG----VMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFS 285 (359) Q Consensus 210 I~rLr~rFs~if~d~Lk~QLiLkg----I~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S 285 (359) +.+.+..|..-+.+.++.=+-+-+ +-...+| ..+++.|...---.+ .+.++++..+. | .+| T Consensus 387 a~~~~~~f~~~l~~~~~li~~~~~~~~~~~~~~~~----~~i~~~f~~~~p~n~-------~e~~d~~~kl~---G-~iS 451 (511) T protein:vir:78 387 TKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDF----NTVRYVYNRNLPKSL-------IEELKAYIDSG---G-KIS 451 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccc----ccceEEeCCCCCcCH-------HHHHHHHHHHh---c-cCC Confidence 555666666666665554222222 2223333 356777865333333 23445555553 4 379 Q ss_pred HHHHHHHHhCCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCCCCccCC Q lcl|NC_015285. 286 VDYMRRQVLKQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPGDVRRGE 358 (359) Q Consensus 286 ~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~ 358 (359) .++++.. |...++ .+++.++|++|..+. .+ ..+ .+.+..+++....+-++ ++. ..+-++| T Consensus 452 ~et~l~~-l~~v~d-~~~El~ri~~E~~~~-~~---~~~-~~~~~~~~~~~~~~~~~---~~~---~~~~e~~ 511 (511) T protein:vir:78 452 QTTLMSL-FSFFQD-PELEVKKIEEDEKES-IK---KAQ-KGIYKDPRDINDDEQDD---DTK---DTVDKKE 511 (511) T ss_pred hHHHHHh-CCCCCC-HHHHHHHHHHHHHHH-HH---HHh-hccccCCCCCCCCCCCC---Ccc---CcccccC Confidence 9999976 666542 444555666664431 11 001 11111111111111111 111 1111122 No 99 >protein:vir:96366 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239644;genbank:gi:66395376;genbank:GeneID:5132842 Probab=63.05 E-value=0.32 Score=23.27 Aligned_cols=302 Identities=10% Similarity=0.045 Sum_probs=103.5 Q ss_pred CCCchhhHHHhhh----------------hhhheeeccccccc---cCCCceeec-HhHhhhhhc-cccc---C-CCCcc Q lcl|NC_015285. 1 MRGVDLNQQLTQK----------------AAEYFLYNPKGLKN---STNQGMKIT-TDSVTYCHS-GIQD---L-NKNMT 55 (359) Q Consensus 1 ~~~~~~~~~~~~~----------------~~e~f~yn~~~~~~---~~~~~v~i~-~~ai~y~hS-Gl~d---~-~~~~i 55 (359) +-++....+.+-+ +.-+-+|++...+. ..+.+.... ...-...|. |-++ . ++..- T Consensus 175 v~dd~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~g 254 (511) T protein:vir:96 175 IYDNTVERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTNRTNGLKLTPRENSFESHSFERMPITEFSNNERR 254 (511) T ss_pred EEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccCcCcccceEEecCCCCC Confidence 1111111111111 11122555554321 111111110 000001111 1111 0 11111 Q ss_pred hhhHHHHHHHHHHHHH-HHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCccc-ccccch Q lcl|NC_015285. 56 LSHLHKAIKAVNQLRM-IEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIK-DDKKFM 133 (359) Q Consensus 56 ~syL~~Aik~~NqL~m-~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevk-dd~~~m 133 (359) .|=++..+.....+.. +-+....-+.++.|-+-+.-....+ .++++ +....+ T Consensus 255 ~gd~e~v~~liDa~~~~~S~~~~~~~~~~~~~lv~~G~~~~~--------------------------~~~~~~~~~~~~ 308 (511) T protein:vir:96 255 KGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLD--------------------------PVEVRKQKEANV 308 (511) T ss_pred CCchhhhHHHHHHHHHHHHHHHHHHHHhhcchhheecCccCC--------------------------chhhcccccccc Confidence 2334433333332221 1111112233344433332211110 00111 000011 Q ss_pred hhHhh---hcccccCCCCccceeecCCCCCcchH-HHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHH Q lcl|NC_015285. 134 SMMED---FWLPRREGGRGTEISTLPGGQNLGEL-EDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKF 209 (359) Q Consensus 134 SMlED---ywLpRReGgrgTEIsTLpGgqnLgei-~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KF 209 (359) -.+.. +.......+.|..+..|-...+...+ .-+.-+.+.+|.-.++|---.+.-++ |. .+..|..-...-..- T Consensus 309 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~~~~~-n~-Sg~Al~~~~~~l~~k 386 (511) T protein:vir:96 309 LFLEPTVYVDAEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSG-TQ-SGEAMKYKLFGLEQR 386 (511) T ss_pred eeccccceeccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccc-cc-HHHHHHHHHHHHHHH Confidence 00000 00111122233445555544443332 34455677788888888532222111 11 122233222223444 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcC----CCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhh Q lcl|NC_015285. 210 IARLRKRFSELFTDLLKTQLILKG----VMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFS 285 (359) Q Consensus 210 I~rLr~rFs~if~d~Lk~QLiLkg----I~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S 285 (359) +.+.+..|..-+.+.++.=+-+-+ +-...+| ..+++.|...---.+ .+.++++..+. | .+| T Consensus 387 a~~~~~~f~~~l~~~~~li~~~~~~~~~~~~~~~~----~~i~~~f~~~~p~n~-------~e~~d~~~kl~---G-~iS 451 (511) T protein:vir:96 387 TKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDF----NTVRYVYNRNLPKSL-------IEELKAYIDSG---G-KIS 451 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccccc----ccceEEeCCCCCcCH-------HHHHHHHHHHh---c-cCC Confidence 555666666666665554222222 2223333 356777865333333 23445555553 4 379 Q ss_pred HHHHHHHHhCCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCCCCccCC Q lcl|NC_015285. 286 VDYMRRQVLKQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPGDVRRGE 358 (359) Q Consensus 286 ~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~ 358 (359) .++++.. |...++ .+++.++|++|..+. .+ ..+ .+.+..+++....+-++ ++. ..+-++| T Consensus 452 ~et~l~~-l~~v~d-~~~El~ri~~E~~~~-~~---~~~-~~~~~~~~~~~~~~~~~---~~~---~~~~e~~ 511 (511) T protein:vir:96 452 QTTLMSL-FSFFQD-PELEVKKIEEDEKES-IK---KAQ-KGIYKDPRDINDDEQDD---DTK---DTVDKKE 511 (511) T ss_pred hHHHHHh-CCCCCC-HHHHHHHHHHHHHHH-HH---HHh-hccccCCCCCCCCCCCC---Ccc---CcccccC Confidence 9999976 666542 444555666664431 11 001 11111111111111111 111 1111122 No 100 >protein:vir:9408 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803386;genbank:gi:29028698;genbank:GeneID:1258164 Probab=61.75 E-value=0.35 Score=23.10 Aligned_cols=284 Identities=13% Similarity=0.125 Sum_probs=111.6 Q ss_pred CCCc--------hhhHHHhhhhhhheeeccccc----------------cccC--CCceeecHhHhhhhhcccccCCCCc Q lcl|NC_015285. 1 MRGV--------DLNQQLTQKAAEYFLYNPKGL----------------KNST--NQGMKITTDSVTYCHSGIQDLNKNM 54 (359) Q Consensus 1 ~~~~--------~~~~~~~~~~~e~f~yn~~~~----------------~~~~--~~~v~i~~~ai~y~hSGl~d~~~~~ 54 (359) +.++ ..| ...+.+..+|... .... .....++.+-|.++. ..+.++=. T Consensus 130 l~Gnay~~i~r~~~G-----~~~~L~~i~~~~v~v~~d~~g~~~~~~~~~~~~~~~~~~~~~~~dvih~k--~~~~dg~~ 202 (441) T protein:vir:94 130 LTSHGYIEITRDKTG-----EPMNLTFRKTSEIELKSDARGRLYYFHQRIDSNGNNIERNVKFEDMLDIK--FYSLDGIN 202 (441) T ss_pred hcCCeEEEEEECCCC-----cEEEEEEEcCceeEEEECCCccEEEEEEEeccCCceeEEEEccccEEEec--cCCCCCcc Confidence 1111 111 0112222221110 0011 112345666665442 12333323 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchh Q lcl|NC_015285. 55 TLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMS 134 (359) Q Consensus 55 i~syL~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mS 134 (359) =+|-|+.|.+++.....+++...=+=---+--+-|..++ |.+...+|.+=+++-+++.- +| ..+..+.+ T Consensus 203 G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~~~~~e~~e~~r~~~~~~~--------~G-~~nag~~~- 271 (441) T protein:vir:94 203 GLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMK-GVLDNKKARDRAREEFHKSF--------SG-TKQAGKVV- 271 (441) T ss_pred ccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcC-CCCCCHHHHHHHHHHHHHHh--------cC-ccccCcce- Confidence 367788888888877777776654433445566777777 45544444333443222211 12 11111121 Q ss_pred hHhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHH Q lcl|NC_015285. 135 MMEDFWLPRREGGRGTEISTLPGG-QNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARL 213 (359) Q Consensus 135 MlEDywLpRReGgrgTEIsTLpGg-qnLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KFI~rL 213 (359) .++ .|.+++.|.=. ..+.-++-.++..+.+.++++||.+.|+..+. + +.+.-..+-|..-+.-+ T Consensus 272 vl~----------~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~-~----~s~~q~~~~~~~tl~P~ 336 (441) T protein:vir:94 272 VLD----------ESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETA-N----MSITDANLDYLSTLKPY 336 (441) T ss_pred ecC----------CCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCC-C----ccHHHHHHHHHHHHHHH Confidence 222 14455555321 11222344567788899999999999964321 1 11121223344434444 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHH Q lcl|NC_015285. 214 RKRFSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQV 293 (359) Q Consensus 214 r~rFs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~I 293 (359) -.++...+. ..| .++ +. ...+.|.-+ ++...+ ...|.+.++.+-.- -+++.+-++.. T Consensus 337 ~~~ie~eln----~kl-----~~~--~~----~~~~~fd~~----~llr~D-~~~~~~~~~~~i~~--G~~T~NE~R~~- 393 (441) T protein:vir:94 337 ITCVCAELN----FKF-----NDE--YV----NREFKFDTT----EIRVVD-EKTQAEIDKINIDS--GKMNIDEIRQR- 393 (441) T ss_pred HHHHHHHHh----hhc-----ccc--cc----CceEEeech----hhhccC-HHHHHHHHHHHHhC--CCcCHHHHHHH- Confidence 343333333 222 121 21 233444322 222222 23455555444221 23343333321 Q ss_pred hCCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCC-CCCCCCccCC Q lcl|NC_015285. 294 LKQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESS-VDPGDVRRGE 358 (359) Q Consensus 294 L~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~p~~~~~~~ 358 (359) +++ +.+++++...-....-..+...+++.++..+... .+-+.+-++| T Consensus 394 ~gl------------------~Pi~ggd~~~~~~~~n~~~~~~~~~~~~~~~~~~~~~~kgGe~~e 441 (441) T protein:vir:94 394 DGL------------------APIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATDKKLKGGEENE 441 (441) T ss_pred hCC------------------CCCCCCCcceEeecccccccccccccccccccccccccCCCCCCC Confidence 222 3333333211111111111111222222221111 1122233333 No 101 >protein:vir:79984 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430000;genbank:gi:156604055;genbank:GeneID:5525444 Probab=61.75 E-value=0.35 Score=23.10 Aligned_cols=284 Identities=13% Similarity=0.125 Sum_probs=111.6 Q ss_pred CCCc--------hhhHHHhhhhhhheeeccccc----------------cccC--CCceeecHhHhhhhhcccccCCCCc Q lcl|NC_015285. 1 MRGV--------DLNQQLTQKAAEYFLYNPKGL----------------KNST--NQGMKITTDSVTYCHSGIQDLNKNM 54 (359) Q Consensus 1 ~~~~--------~~~~~~~~~~~e~f~yn~~~~----------------~~~~--~~~v~i~~~ai~y~hSGl~d~~~~~ 54 (359) +.++ ..| ...+.+..+|... .... .....++.+-|.++. ..+.++=. T Consensus 130 l~Gnay~~i~r~~~G-----~~~~L~~i~~~~v~v~~d~~g~~~~~~~~~~~~~~~~~~~~~~~dvih~k--~~~~dg~~ 202 (441) T protein:vir:79 130 LTSHGYIEITRDKTG-----EPMNLTFRKTSEIELKSDARGRLYYFHQRIDSNGNNIERNVKFEDMLDIK--FYSLDGIN 202 (441) T ss_pred hcCCeEEEEEECCCC-----cEEEEEEEcCceeEEEECCCccEEEEEEEeccCCceeEEEEccccEEEec--cCCCCCcc Confidence 1111 111 0112222221110 0011 112345666665442 12333323 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchh Q lcl|NC_015285. 55 TLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMS 134 (359) Q Consensus 55 i~syL~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mS 134 (359) =+|-|+.|.+++.....+++...=+=---+--+-|..++ |.+...+|.+=+++-+++.- +| ..+..+.+ T Consensus 203 G~spl~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~~~~~e~~e~~r~~~~~~~--------~G-~~nag~~~- 271 (441) T protein:vir:79 203 GLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMK-GVLDNKKARDRAREEFHKSF--------SG-TKQAGKVV- 271 (441) T ss_pred ccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcC-CCCCCHHHHHHHHHHHHHHh--------cC-ccccCcce- Confidence 367788888888877777776654433445566777777 45544444333443222211 12 11111121 Q ss_pred hHhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHH Q lcl|NC_015285. 135 MMEDFWLPRREGGRGTEISTLPGG-QNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARL 213 (359) Q Consensus 135 MlEDywLpRReGgrgTEIsTLpGg-qnLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KFI~rL 213 (359) .++ .|.+++.|.=. ..+.-++-.++..+.+.++++||.+.|+..+. + +.+.-..+-|..-+.-+ T Consensus 272 vl~----------~G~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~-~----~s~~q~~~~~~~tl~P~ 336 (441) T protein:vir:79 272 VLD----------ESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETA-N----MSITDANLDYLSTLKPY 336 (441) T ss_pred ecC----------CCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCC-C----ccHHHHHHHHHHHHHHH Confidence 222 14455555321 11222344567788899999999999964321 1 11121223344434444 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHH Q lcl|NC_015285. 214 RKRFSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQV 293 (359) Q Consensus 214 r~rFs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~I 293 (359) -.++...+. ..| .++ +. ...+.|.-+ ++...+ ...|.+.++.+-.- -+++.+-++.. T Consensus 337 ~~~ie~eln----~kl-----~~~--~~----~~~~~fd~~----~llr~D-~~~~~~~~~~~i~~--G~~T~NE~R~~- 393 (441) T protein:vir:79 337 ITCVCAELN----FKF-----NDE--YV----NREFKFDTT----EIRVVD-EKTQAEIDKINIDS--GKMNIDEIRQR- 393 (441) T ss_pred HHHHHHHHh----hhc-----ccc--cc----CceEEeech----hhhccC-HHHHHHHHHHHHhC--CCcCHHHHHHH- Confidence 343333333 222 121 21 233444322 222222 23455555444221 23343333321 Q ss_pred hCCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCC-CCCCCCccCC Q lcl|NC_015285. 294 LKQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESS-VDPGDVRRGE 358 (359) Q Consensus 294 L~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~p~~~~~~~ 358 (359) +++ +.+++++...-....-..+...+++.++..+... .+-+.+-++| T Consensus 394 ~gl------------------~Pi~ggd~~~~~~~~n~~~~~~~~~~~~~~~~~~~~~~kgGe~~e 441 (441) T protein:vir:79 394 DGL------------------APIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATDKKLKGGEENE 441 (441) T ss_pred hCC------------------CCCCCCCcceEeecccccccccccccccccccccccccCCCCCCC Confidence 222 3333333211111111111111222222221111 1122233333 No 102 >protein:vir:96579 Length: 576 # NCBI annotation: ORF012 # Family: family:all:2446 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238542;genbank:gi:66391267;genbank:GeneID:5130361 Probab=60.73 E-value=0.37 Score=22.97 Aligned_cols=304 Identities=11% Similarity=0.117 Sum_probs=117.4 Q ss_pred CCCchhhHHH-----hhhhhhheeecccc-----------------cc--ccCCCceeecHhHhhhhhccc-ccCC-CCc Q lcl|NC_015285. 1 MRGVDLNQQL-----TQKAAEYFLYNPKG-----------------LK--NSTNQGMKITTDSVTYCHSGI-QDLN-KNM 54 (359) Q Consensus 1 ~~~~~~~~~~-----~~~~~e~f~yn~~~-----------------~~--~~~~~~v~i~~~ai~y~hSGl-~d~~-~~~ 54 (359) +.++.--..+ ...+.+.+.-+|.. +. ......+.++.+.|++.--+. .|.. ++. T Consensus 176 l~Gna~~~i~~~rd~~g~~~~L~pl~p~~V~v~~~~dg~~~~~~~~~~~~~~~~~~~~~~~~dii~~~~~~~~d~~~~~~ 255 (576) T protein:vir:96 176 TYDQVNFEKVFNKKNATTMDKFIAVDPSTIFYATDKNGKIIKGGKRFVQVINKKVVASFTSREMAMGIRNPRTELSSSGY 255 (576) T ss_pred hcCCeEEEEEEecCCCCceEEEEEeCCceeEEEECCCCceeeeeeEEEEecCCceEEEecccceEEEeecCCCCcccCcc Confidence 1111000000 00112222222211 00 111223556666665421111 1111 223 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCC-CCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccch Q lcl|NC_015285. 55 TLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVG-NLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFM 133 (359) Q Consensus 55 i~syL~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvG-nlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~m 133 (359) =+|-|+.|.+++.....++....=+=---|.-+-|..++.+ .+.+..+++-.+.+-..|+.- ...|. .. T Consensus 256 G~Spi~~a~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~~~~~ls~e~~~~lr~~~~~~~~G~----~nag~------~p 325 (576) T protein:vir:96 256 GLSEVEIAMKQFIAYNNTETFNDRFFSHGGTTRGILQIKSEQQQSQRALENFKREWKSSFSGI----NGSWQ------VP 325 (576) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccc----ccccc------ce Confidence 36678888888887777777654443344566777777764 455555544444443334311 00111 11 Q ss_pred hhHhhhcccccCCCCccceeecCCCCC-cchHHHHHHHHHHHHHhcCCCccccCCCC-c----------ccccchhhhhH Q lcl|NC_015285. 134 SMMEDFWLPRREGGRGTEISTLPGGQN-LGELEDVKYFQKKLYKALNVPSSRLETET-T----------FNIGRAAEITR 201 (359) Q Consensus 134 SMlEDywLpRReGgrgTEIsTLpGgqn-Lgei~DV~YF~kkLy~aL~VP~SRl~~~~-~----------~~~g~~~eItR 201 (359) -+++ -|.+++.|.-... +.-++-.+|..+.+.++.+||...|+... + ++.....+..+ T Consensus 326 ~vl~----------~G~~~~~ls~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~g~~~~~s~t~sn~e~~~~ 395 (576) T protein:vir:96 326 VVMA----------DDIKFVNMTPTANDMQFEKWLTYLINIISALYGIDPAEIGFPNRGGATGGKGGNTLNEADPGKKQQ 395 (576) T ss_pred eecC----------CCceEEeccCChhhHHHHHHHHHhHHHHHHHhCCCHHHccccccccccccccccccccccHHHHHH Confidence 1111 1456666643222 22244556788999999999999996431 1 12222222222 Q ss_pred HhhhHHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhc Q lcl|NC_015285. 202 DEVKFQKF-IARLRKRFSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYV 280 (359) Q Consensus 202 DElKF~KF-I~rLr~rFs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~v 280 (359) . |.++ +.-+..++...|.. .|+ +. + ...+.++|.+...=+ |.+.++...... T Consensus 396 ~---f~~~tL~P~~~~ie~~ln~----~Ll-----~~--~---~~~~~~~f~r~d~~~----------~~e~~~~~~~~~ 448 (576) T protein:vir:96 396 Q---SQNKGLQPLLRFIEDLINT----HII-----SE--Y---SDKYVFQFVGGDTKS----------ELDKIKILQEEV 448 (576) T ss_pred H---HHHHHHHHHHHHHHHHHHh----hhc-----hh--c---cCceEEEeccCCHHH----------HHHHHHHHHHHh Confidence 2 4443 44444444444443 222 21 1 245677787664422 222222222111 Q ss_pred chhhhHHHHHHHHhCCCHHHHHHHHHHH--------HHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCC Q lcl|NC_015285. 281 GKYFSVDYMRRQVLKQTEIEIKEIDEQI--------ASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPG 352 (359) Q Consensus 281 GKy~S~~~i~k~IL~~tDeeI~e~~kqi--------~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~ 352 (359) .-+++..-++. .+++..-+ --++.+ ......+-- +++.+.++.... ......++.++ +.+..++ T Consensus 449 ~G~lT~NE~R~-~~gl~pie--gGD~~~~~~~~~~~~~~~~~~~~-e~~~~~~~~~~~-~~~~~~~~~~~---~~~~s~~ 520 (576) T protein:vir:96 449 KTYKTVNEARK-EKGLKPIE--GGDVLLDGSFIQSMSLNTQKEQY-EDTKQKERFDMI-QQFLNSPDDEE---PQQESTE 520 (576) T ss_pred cCccCHHHHHH-HhCCCCCC--CcceeccccccccccccccCCCC-CCcccccccccc-ccccCCCCCCC---CCCCCCC Confidence 12556666654 46665421 000000 000000000 000000000000 00000000011 1111112 Q ss_pred CCccCCC Q lcl|NC_015285. 353 DVRRGEF 359 (359) Q Consensus 353 ~~~~~~~ 359 (359) +...|+- T Consensus 521 ~~~~g~~ 527 (576) T protein:vir:96 521 DKVDGRE 527 (576) T ss_pred Ccccccc Confidence 2222222 No 103 >protein:vir:80644 Length: 551 # NCBI annotation: gp23 # Family: family:all:2446 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468463;genbank:gi:157325038;genbank:GeneID:5601615 Probab=59.69 E-value=0.39 Score=22.84 Aligned_cols=306 Identities=10% Similarity=0.121 Sum_probs=121.9 Q ss_pred CCCchhhHHHhhhhhhheeecccc-----------------c--cccCCCceeecHhHhhhhhcc-cccC-CCCcchhhH Q lcl|NC_015285. 1 MRGVDLNQQLTQKAAEYFLYNPKG-----------------L--KNSTNQGMKITTDSVTYCHSG-IQDL-NKNMTLSHL 59 (359) Q Consensus 1 ~~~~~~~~~~~~~~~e~f~yn~~~-----------------~--~~~~~~~v~i~~~ai~y~hSG-l~d~-~~~~i~syL 59 (359) .++ ..| .+.+.+..+|.. + ...+...+.++.+-|.|.+-. +.+. .+..=+|-| T Consensus 182 ~rd-~~G-----~~~~L~~l~p~~V~v~~~~~g~~~~~~~~y~~~~~g~~~~~~~~~eiiH~~~n~~~~~~~~~~G~spi 255 (551) T protein:vir:80 182 VFN-RNQ-----SMVRFVAKDPTTIFFATTADGKIPDNGNRFVQVIDQKIVATFNAREMAFAVRNPRSDIYATGYGYPEL 255 (551) T ss_pred EEC-CCC-----cEEEEEEeCCceeEEEECCccccccCceEEEEEeCCcEEEEEcccceEEecccCCCCcccccccccHH Confidence 111 111 112222222211 0 111223456777777765421 1111 122234678 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccC-CCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhh Q lcl|NC_015285. 60 HKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDV-GNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMED 138 (359) Q Consensus 60 ~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDv-Gnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlED 138 (359) +.|+..+.....++....=+----+--+-|..+.. .+|....+++.-+.+...|.-- ...|.+ .++. T Consensus 256 ~~a~~~i~~~~a~~~~~~~~f~Ng~~p~giL~~~~~~~lt~e~~~~lk~~~~~~~~G~----~nag~~-------~vl~- 323 (551) T protein:vir:80 256 EIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQQSQHALEIFKREWKNSLSGI----NGSWQI-------PVVS- 323 (551) T ss_pred HHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEEcCCCCCCHHHHHHHHHHHHHHhcCc----cccCcc-------cccc- Confidence 99999988888888776544333345566666654 3455554444444433333210 011211 1221 Q ss_pred hcccccCCCCccceeecCCCCCcchHHH---HHHHHHHHHHhcCCCccccCCCC--cccccchhhhhH-----HhhhHHH Q lcl|NC_015285. 139 FWLPRREGGRGTEISTLPGGQNLGELED---VKYFQKKLYKALNVPSSRLETET--TFNIGRAAEITR-----DEVKFQK 208 (359) Q Consensus 139 ywLpRReGgrgTEIsTLpGgqnLgei~D---V~YF~kkLy~aL~VP~SRl~~~~--~~~~g~~~eItR-----DElKF~K 208 (359) +.|.++..|. .+..++.- .+|..+..-++.+||...|+-.+ +.....++.+++ ....|.. T Consensus 324 --------~~g~~~~~l~--~~~~D~qfle~~~~~~~~Ia~aFgVPp~~lG~~~~~~~~~~~~~s~t~sn~e~~~~~f~~ 393 (551) T protein:vir:80 324 --------AEDVKFVNMT--PSARDMEFEKWLNYLINVISALYGIDPAEINIPNNGGATGSKGGSLNEGNSAEKNQASKN 393 (551) T ss_pred --------CCCceEEEcc--CChhHHHHHHHHHHHHHHHHHHhcCCHHHcCcccccccccccccccchhhHHHHHHHHHH Confidence 1134455553 33333333 45577889999999999996432 111111111221 1222433 Q ss_pred -HHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHH Q lcl|NC_015285. 209 -FIARLRKRFSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVD 287 (359) Q Consensus 209 -FI~rLr~rFs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~ 287 (359) -+.-+..++...|...|- ++. ...+.|+|.....-.+ .+|.++...+. .-+++.. T Consensus 394 ~tL~P~~~~ie~~ln~~L~---------~~~-----~~~~~f~f~~~~~~~~-------~~~~~~~~~~~---~g~lT~N 449 (551) T protein:vir:80 394 KGLQPLLGFIEDFINKHIV---------AEF-----GDKYTFQFVGGDIKSE-------LESVKILAEKA---KVAMTVN 449 (551) T ss_pred HHHHHHHHHHHHHHHhhhc---------ccc-----CCceEEEeeccChhhH-------HHHHHHHHHHh---cCCcCHH Confidence 355555555555554332 221 2457777776544333 23334333222 1246777 Q ss_pred HHHHHHhCCCH-HHH-H--------------HHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccc-----c----CCC Q lcl|NC_015285. 288 YMRRQVLKQTE-IEI-K--------------EIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAA-----E----VDP 342 (359) Q Consensus 288 ~i~k~IL~~tD-eeI-~--------------e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~-----~----~~~ 342 (359) -++.. +++.. .+- . ...++.+.+........+........+++++..+.+ . .+- T Consensus 450 E~R~~-~gl~P~~egGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 528 (551) T protein:vir:80 450 EVRKE-LNLPGDVIGGDIPLNGVIVQRIGQLMQQEQFEHEKQQSNLQMLQEQTGNRVSTDVEDIPDGKDTTGDIGKDGQR 528 (551) T ss_pred HHHHH-hCCCCCCCCCceeecccccccccccccccCcchhhhhhccccccCcCCCCCCCCCCCCCCccccCCCccccccc Confidence 77744 67754 110 0 000001111111111101000000000000000000 0 000 Q ss_pred CCcCCCCCCCCCccCCC Q lcl|NC_015285. 343 NAQESSVDPGDVRRGEF 359 (359) Q Consensus 343 ~~~~~~~~p~~~~~~~~ 359 (359) ......+.-+.+.+|.| T Consensus 529 ~~~~~~~~~~~~~~~~~ 545 (551) T protein:vir:80 529 KDKDNANAGKQGMKGDK 545 (551) T ss_pred cCccccchhhhhcCCCC Confidence 00111111222333333 No 104 >protein:vir:107112 Length: 478 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950601;genbank:gi:119953681;genbank:GeneID:4643121 Probab=59.19 E-value=0.4 Score=22.78 Aligned_cols=285 Identities=9% Similarity=0.056 Sum_probs=103.9 Q ss_pred CCC-chhhHH-----H--hhhhhhheeeccccccc-c------------CCCceeecHhHhhhhhc-cccc----CCCCc Q lcl|NC_015285. 1 MRG-VDLNQQ-----L--TQKAAEYFLYNPKGLKN-S------------TNQGMKITTDSVTYCHS-GIQD----LNKNM 54 (359) Q Consensus 1 ~~~-~~~~~~-----~--~~~~~e~f~yn~~~~~~-~------------~~~~v~i~~~ai~y~hS-Gl~d----~~~~~ 54 (359) +-+ +..+.- . .++...+-+|++..+.. . ...+...........|. |.++ +++.. T Consensus 164 v~d~~~~~~~~~~ir~~~~~~~~~~~~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~n~~~ 243 (478) T protein:vir:10 164 IWTNKERDELQAFIRVYELDGAERVEYWTKDDVTFYELKEGQLIPDFYRSEDHIQPHYYQGNKLMSWGRVPFIPFKNNPQ 243 (478) T ss_pred EEcCCCCCceEEEEEEEeeeCceEEEEEeCCcEEEEEecCCeeeccccccccccccceecccccccCCcceEEEeccCCC Confidence 000 000000 0 00111111222222110 0 00000000000111222 2221 12223 Q ss_pred chhhHHHHHHHHHHHH-HHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccch Q lcl|NC_015285. 55 TLSHLHKAIKAVNQLR-MIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFM 133 (359) Q Consensus 55 i~syL~~Aik~~NqL~-m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~m 133 (359) -.|-|+..+....-+. ++-+....-+-++.|-+-+.-.+..+.. + .... +..++ T Consensus 244 g~sd~e~v~~liDa~~~~~S~~~~~~~~~~~~~~~~~g~~~~~~~--~---~~~~-~~~~~------------------- 298 (478) T protein:vir:10 244 EVSDLFMYKTIIDALDKRLSDTQNTFDESVELIYILKGYEGEDMK--D---FMHN-LKYYK------------------- 298 (478) T ss_pred CCCcHHHHHHHHHHHHHHHHHHHHHHHHhhCcceeeecCCccccc--c---hhhh-hhhCc------------------- Confidence 3455555444444443 3334444445666665444322221111 0 0000 11111 Q ss_pred hhHhhhcccccCCCCccceeecCCCCCcchHH-HHHHHHHHHHHhcCCCccccCCCCcccccchh--hhhHHhhhHHHHH Q lcl|NC_015285. 134 SMMEDFWLPRREGGRGTEISTLPGGQNLGELE-DVKYFQKKLYKALNVPSSRLETETTFNIGRAA--EITRDEVKFQKFI 210 (359) Q Consensus 134 SMlEDywLpRReGgrgTEIsTLpGgqnLgei~-DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~--eItRDElKF~KFI 210 (359) =+|++--+| .+++.|-...+...+. -+.-+.+.+|....+|- +..+ .|. |..| .|..-..-...-+ T Consensus 299 ----~~~~~~~~~---~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~--~~~~-~~~-~n~Sg~Ai~~~~~~l~~k~ 367 (478) T protein:vir:10 299 ----AISVAGESG---SGVDTIKVEVPIDSVKEYTKMLRDYIIEFGQGVD--FQQD-KFG-NSPSGIALKFMYSNLDLKA 367 (478) T ss_pred ----eeEecCCCC---CcceEEeecCCHHHHHHHHHHHHHHHHHHhCCcC--cCcc-ccc-cchHHHHHHHHHHHHHHHH Confidence 112322222 3356555444544433 36777888999999984 2111 121 2222 2332222233345 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHH Q lcl|NC_015285. 211 ARLRKRFSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMR 290 (359) Q Consensus 211 ~rLr~rFs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~ 290 (359) .+.+..|...+.++|+.=+-+.|+ ..+|. .|.+.|....--.+... +++++.+.+ .+|.++++ T Consensus 368 ~~~~~~~~~~l~~~~~li~~~~~~--~~d~~----~i~i~f~~~~p~~~~e~-------~~~~~~~~g----~iS~et~i 430 (478) T protein:vir:10 368 NKLKNKTLTALQELLQYIIDFYRL--DVRVQ----DIEITFNFNVMVNELEN-------SQIAMNSTG----LLSKETIL 430 (478) T ss_pred HHHHHHHHHHHHHHHHHHHHHhCC--Ccccc----cceEEeCCCCCCCHHHH-------HHHHHHHhC----CCChHHHH Confidence 566666666666665543333443 33443 46677754433333222 334444433 36999999 Q ss_pred HHHhCCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCC Q lcl|NC_015285. 291 RQVLKQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPG 352 (359) Q Consensus 291 k~IL~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~ 352 (359) .. |...++ .+++.++|++|..+..-..|+ .+. +...+. ...+.+..++ T Consensus 431 ~~-~~~v~d-~~~E~~ri~~E~~~~~~~~~~------~~~--~~~d~~----~~~~~d~~~e 478 (478) T protein:vir:10 431 GN-HSWVQD-PVAEMERIEQENIELNQQLPD------IEE--GLNDEQ----QRQSEDNQSE 478 (478) T ss_pred Hh-CCCCCC-HHHHHHHHHHHHHHHHHhccc------cCC--CCcccc----cccCcCCCCC Confidence 76 454432 334444555554331100111 000 000000 0000111111 No 105 >protein:vir:8418 Length: 409 # NCBI annotation: gp13 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818314;genbank:gi:29566750;genbank:GeneID:1260067 Probab=59.01 E-value=0.4 Score=22.76 Aligned_cols=283 Identities=13% Similarity=0.135 Sum_probs=117.8 Q ss_pred CCCchhhHHH----hhhhhhheeecccccc--------------ccCCCceeecHhHhhhhhcccccCCCC-cchhhHHH Q lcl|NC_015285. 1 MRGVDLNQQL----TQKAAEYFLYNPKGLK--------------NSTNQGMKITTDSVTYCHSGIQDLNKN-MTLSHLHK 61 (359) Q Consensus 1 ~~~~~~~~~~----~~~~~e~f~yn~~~~~--------------~~~~~~v~i~~~ai~y~hSGl~d~~~~-~i~syL~~ 61 (359) +.++.--+-. -....+.+..+|.... .-...+..++.+-|.+.. ....++. .=+|-++. T Consensus 107 l~Gn~~~~i~~~~~~g~~~~L~~l~p~~v~v~~~~~~~~~~~~~~~~~~g~~~~~~dvih~~--~~~~~~~~~G~s~i~~ 184 (409) T protein:vir:84 107 VTGNAFGYISARDEANRPTAIMPIHPDCIHVTDAKDEDGDWIEPVYRIDGKVVPNHRIMHIK--RYPVAGCALGMSPIEK 184 (409) T ss_pred hcCCeEEEEEEECCCCceEEEEEEcCceeEEEEcCCCcceEEEEEecCCceEEchhhEEEec--CCCCCcccccccHHHH Confidence 1111100000 0111233333332210 011234556777666543 2222332 23677888 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhhcc Q lcl|NC_015285. 62 AIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDFWL 141 (359) Q Consensus 62 Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDywL 141 (359) |.+.+.....+++...-+----+--+-+.-++ |+|.+..+++..+.....+.| .|.+ + .++ T Consensus 185 ~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~l~~e~~~~~~~~~~~~~~n-------~g~~------~-vl~---- 245 (409) T protein:vir:84 185 AASAIGLGLAAERYGLRWFRDSANPSGILSSD-ADLTPDQVKQTQKQWIQSHHN-------RRLP------A-VMS---- 245 (409) T ss_pred HHHHHHHHHHHHHHHHHHHhcCCCccEEEecC-CCCCHHHHHHHHHHHHHHhcc-------CCCe------e-ecC---- Confidence 88888877777776654333333345555554 578777777766666555432 2321 1 111 Q ss_pred cccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHH Q lcl|NC_015285. 142 PRREGGRGTEISTLPGG-QNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSEL 220 (359) Q Consensus 142 pRReGgrgTEIsTLpGg-qnLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KFI~rLr~rFs~i 220 (359) .|.+++.+--. ..+.-++-.++..+.+.++++||.+.|+...+-+.. ++.+.-.-+.|..++ |+--+. . T Consensus 246 ------~g~~~~~~~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~-~sn~e~~~~~f~~~~--l~P~~~-~ 315 (409) T protein:vir:84 246 ------AGIKWQSVSITPNESQFLETRSFQRSEIAMWFRIPPHMIGDVEKSTSW-GTGIEEQGINFVRHT--LLPWLR-C 315 (409) T ss_pred ------CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccc-cchHHHHHHHHHHHH--HHHHHH-H Confidence 24445544321 122334555677899999999999999653322221 122222223354442 332222 2 Q ss_pred HHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCCCHHH Q lcl|NC_015285. 221 FTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQTEIE 300 (359) Q Consensus 221 f~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~tDee 300 (359) +.+.|-..| .+| ..|+|++ ..+...++ ..|++.+..+-.- -+++.+-++. ++++.+ T Consensus 316 ie~~l~~~L-~~g-----------~~i~fd~------~~l~~~d~-~~~~~~~~~~~~~--G~~t~NE~R~-~~g~~p-- 371 (409) T protein:vir:84 316 IEQALDTFL-PRG-----------QFVKFNV------DGLMRGDV-TARFTAYQMGLQN--GIWSVNEVRA-WEDAPP-- 371 (409) T ss_pred HHHHHHHhc-cCC-----------CeEEEec------hhhhccCH-HHHHHHHHHHHhC--CCcCHHHHHH-HhCCCC-- Confidence 233333332 222 2344443 23333322 4455555554322 2445554443 234422 Q ss_pred HHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCCCCcc Q lcl|NC_015285. 301 IKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPGDVRR 356 (359) Q Consensus 301 I~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~ 356 (359) ++.-+.---+..-.+.+..++ .+|...+.++...++.+ T Consensus 372 ----------------~~ggD~~~~~~n~~~~~~~~~--~~~~~~~~~~~~~~gn~ 409 (409) T protein:vir:84 372 ----------------IPEGDIHLQPMNFVPLGYVPP--EEPAQEPQPNSATEGNK 409 (409) T ss_pred ----------------CCCcceeeecccccccccCCc--cccCcCCCCCCccCCCC Confidence 111000000000011111111 23322222223223333 No 106 >protein:vir:97336 Length: 492 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240606;genbank:gi:66396273;genbank:GeneID:5133692 Probab=58.70 E-value=0.41 Score=22.72 Aligned_cols=283 Identities=11% Similarity=0.074 Sum_probs=103.9 Q ss_pred CCCchhhHHHhhh-hhhheeeccccccccC---CCceeecHhHhhhhhc-ccccC----CCCcchhhHHHHHHHHHHHH- Q lcl|NC_015285. 1 MRGVDLNQQLTQK-AAEYFLYNPKGLKNST---NQGMKITTDSVTYCHS-GIQDL----NKNMTLSHLHKAIKAVNQLR- 70 (359) Q Consensus 1 ~~~~~~~~~~~~~-~~e~f~yn~~~~~~~~---~~~v~i~~~ai~y~hS-Gl~d~----~~~~i~syL~~Aik~~NqL~- 70 (359) -.+..+.++++.. ...+|.+......... .+...+.. ..|. |.++. ++..-.|=++..+....-+. T Consensus 199 ~~~~~~~~~~y~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~g~vPvv~~~nn~~g~sd~e~v~~liDa~d~ 274 (492) T protein:vir:97 199 KLENETKVEYWDKVTVNYYVYENGSLIPDYSNNLENSKTHF----STGSWGKIPFIPFKNNDLEISDIFMYKTLIDAYNR 274 (492) T ss_pred eeccceeEEEEecCeEEEEEEecCeeeeccccccccccccc----ccCCCCCcceEEecCCCCCCCchHhHHHHHHHHHH Confidence 1112222222211 1112222221111100 00011110 1121 22111 11112333443333332222 Q ss_pred HHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhhcccccCCCCcc Q lcl|NC_015285. 71 MIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDFWLPRREGGRGT 150 (359) Q Consensus 71 m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDywLpRReGgrgT 150 (359) ++=+....-+-+.-|-+-+.-.+.-+.+ + +...+..++...-++ +. T Consensus 275 ~~S~~~~~~~~~~~~~l~~~g~~~~~~~-----~---------------------------~~~~~~~~~~~~~~~--~~ 320 (492) T protein:vir:97 275 RLSDLSNTFKDSNELTYVLKNYDDQELP-----E---------------------------FKRLLRYYGAIKVSD--NG 320 (492) T ss_pred HHHHHHHHHHHhccceeeeecCCcccch-----h---------------------------HHHHHhhccceecCC--CC Confidence 2333334445555554444322211111 1 111111111111111 22 Q ss_pred ceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015285. 151 EISTLPGGQNLGE-LEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFTDLLKTQL 229 (359) Q Consensus 151 EIsTLpGgqnLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KFI~rLr~rFs~if~d~Lk~QL 229 (359) .+.+|-...+... ..-+.-+.+.+|+-..+|---.+.-++ +. .|..|.--+.....-+.+.++.|..-+.++++.=+ T Consensus 321 ~~~~l~~~~~~~~~~~~~~~L~~~I~~~s~~p~~~~~~~~~-n~-Sg~Al~~~~~~l~~ka~~~~~~f~~~l~~~~~li~ 398 (492) T protein:vir:97 321 GVDTIQVEVPVENSKKYLDELYQKIMLFGQAVDFSSDKFGS-AP-SGVALEFLYTNLNLKADKLARKAKVAIQELLWFVF 398 (492) T ss_pred cceeEeccCCHHHHHHHHHHHHHHHHHHhCCCCCCcccccc-Cc-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 3555544333332 233455666778888888421111111 11 12223333344555567777788887777777533 Q ss_pred HhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCCCHHHHHHHHHHHH Q lcl|NC_015285. 230 ILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQTEIEIKEIDEQIA 309 (359) Q Consensus 230 iLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~tDeeI~e~~kqi~ 309 (359) -+-|+ ..+|. .|.+.|....--.+. +.++++..+. | .+|.+++++. |...++ .+++.++|+ T Consensus 399 ~~~~~--~~~~~----~i~v~f~~~~p~~~~-------e~a~~~~kl~---G-~iS~et~l~~-l~~v~d-~~~Eleri~ 459 (492) T protein:vir:97 399 EHFDI--KGEHK----DVDISFNYNKVANTE-------LQVQTAQQSM---G-IVSHETVLEN-HPFVED-LQAELERIE 459 (492) T ss_pred HHhcC--Ccccc----eeeEEecCCCCCCHH-------HHHHHHHHHh---c-cCchHHHHHh-CCCCCC-HHHHHHHHH Confidence 33343 33454 456667543332332 2345555554 3 3799999987 444332 334445555 Q ss_pred HHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCCCCccCC Q lcl|NC_015285. 310 SEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPGDVRRGE 358 (359) Q Consensus 310 ~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~ 358 (359) +|..+. ++..+...+.+...+.-. ..| +....| T Consensus 460 ~E~~~~--------~~~~~~~~~~~~~~~~~~-------~~~-~~~~~e 492 (492) T protein:vir:97 460 QEQTEY--------NKQLPNLDDGGADSAQQQ-------ERS-NNKESE 492 (492) T ss_pred HHHHHH--------HHhhhccccCCCCCCccc-------ccc-cccccC Confidence 554321 111111111111111000 000 111111 No 107 >protein:vir:100882 Length: 383 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358762;genbank:gi:78000027;genbank:GeneID:3726153 Probab=58.46 E-value=0.41 Score=22.69 Aligned_cols=267 Identities=15% Similarity=0.141 Sum_probs=105.7 Q ss_pred CCCchhhHHHhhhhhhheeecc---------cc----c-cccCCCceeecHhHhhhhhcccccC-CCCcchhhHHHHHHH Q lcl|NC_015285. 1 MRGVDLNQQLTQKAAEYFLYNP---------KG----L-KNSTNQGMKITTDSVTYCHSGIQDL-NKNMTLSHLHKAIKA 65 (359) Q Consensus 1 ~~~~~~~~~~~~~~~e~f~yn~---------~~----~-~~~~~~~v~i~~~ai~y~hSGl~d~-~~~~i~syL~~Aik~ 65 (359) +.++. -+.++....+++--++ .+ . ....+..+.++.+-|.+...--.+. +...=+|.|..|+++ T Consensus 98 l~Gn~-~~~i~~~~~~~~p~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~evih~r~~~~~~~~~~~G~s~l~~~~~~ 176 (383) T protein:vir:10 98 LSGND-YIPLVGQNLEHIPNSDVQINYLPGNMGIVYTVLESNDRPKMVLRQDQMLHFRLMPDPQYRYLIGRSPLESLQNA 176 (383) T ss_pred hcCCe-EEEEEcCceeEeecCcceEEEEEcCCceEEEEEEcCCceEEEEcccceEEeccCCCCcccccccccHHHHHHHH Confidence 11110 0001111111110010 00 0 1122334677877776542100111 111226899999999 Q ss_pred HHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhhcccccC Q lcl|NC_015285. 66 VNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDFWLPRRE 145 (359) Q Consensus 66 ~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDywLpRRe 145 (359) +.....++....=+----+--+-+..++.+ +-..++.+=+++.++++..- ...|.+ + .+ T Consensus 177 i~~~~~~~~~~~~~f~ng~~~~~il~~~~~-~~~~e~~~~~~~~~~~~~~~----~n~~~~------~-vl--------- 235 (383) T protein:vir:10 177 LNLDDKASKSNMSAMENQINPAGKLTISNY-LSDGKDLESAREEFEKANTG----DNSGRL------M-VL--------- 235 (383) T ss_pred HHHHHHHHHHHHHHHhccCCcceEEEeCCC-CCCHHHHHHHHHHHHHHhCc----cccCCc------c-cc--------- Confidence 999888888765443333555566666644 33344444444445544321 112221 1 22 Q ss_pred CCCccceeecCCCCCcchHH---HH-HHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHH Q lcl|NC_015285. 146 GGRGTEISTLPGGQNLGELE---DV-KYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELF 221 (359) Q Consensus 146 GgrgTEIsTLpGgqnLgei~---DV-~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KFI~rLr~rFs~if 221 (359) ..|.+++.|.- +..+++ +. ++-.+.+.++++||.+.|+...+-+.. .+.+.-...-|.+-+.-+-+.+ T Consensus 236 -~~g~~~~~l~~--~~~d~~~l~e~~~~~~~~Ia~afgVPp~~lg~~~~~~~~-~sn~eq~~~~~~~~l~P~~~~i---- 307 (383) T protein:vir:10 236 -PDGFDYTQLEM--KTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTESQ-HSNIDQIKATYLANLNSYVNPI---- 307 (383) T ss_pred -CCCceEEecCC--ChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCCc-cccHHHHHHHHHHHHHHHHHHH---- Confidence 12667777754 333333 33 344688999999999999643211111 1111111122433333333333 Q ss_pred HHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCCCHHHH Q lcl|NC_015285. 222 TDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQTEIEI 301 (359) Q Consensus 222 ~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~tDeeI 301 (359) .+.|...|. + ..|+ |..+. +... -+..|.+.+..+-.- -+ ||..|+ T Consensus 308 e~~l~~~l~-----~--------~~~~--f~~~~----l~~~-d~~~~~~~~~~~~~~--G~------------~t~nE~ 353 (383) T protein:vir:10 308 VDELRLKMN-----A--------PDLE--LDIKD----MLDV-DDSILINQVSNLAKS--GV------------LGAEQA 353 (383) T ss_pred HHHHHHhhC-----C--------ceEE--eechh----hhcc-CHHHHHHHHHHHHhC--CC------------cCHHHH Confidence 333433332 1 1233 43222 1111 112344443333221 12 344444 Q ss_pred HHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCCCCc Q lcl|NC_015285. 302 KEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPGDVR 355 (359) Q Consensus 302 ~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~ 355 (359) -++ ...+.++.++ .+ .+..+..+..-|+-. T Consensus 354 R~~-------lg~~p~~~~d-------------~~----~~~~~~~~~~gGd~e 383 (383) T protein:vir:10 354 QFI-------LTRSGFLPDN-------------LP----EFKPLTNETKGGDDK 383 (383) T ss_pred HHH-------hCCCcccCCc-------------cc----ccCCCcccCCCCCCC Confidence 221 0111121111 00 111111122222222 No 108 >protein:vir:96240 Length: 511 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239567;genbank:gi:66395299;genbank:GeneID:5132789 Probab=58.32 E-value=0.42 Score=22.67 Aligned_cols=306 Identities=10% Similarity=0.022 Sum_probs=104.0 Q ss_pred CCCchhhHHHhhhh---------------hhhe-eeccccccc---cCCCceeecHhHh-hhhhc-ccccC----CCCcc Q lcl|NC_015285. 1 MRGVDLNQQLTQKA---------------AEYF-LYNPKGLKN---STNQGMKITTDSV-TYCHS-GIQDL----NKNMT 55 (359) Q Consensus 1 ~~~~~~~~~~~~~~---------------~e~f-~yn~~~~~~---~~~~~v~i~~~ai-~y~hS-Gl~d~----~~~~i 55 (359) +-++....+.+-.+ ..++ +|++...+. ..+........-+ ...|. |-++. ++..- T Consensus 175 vydd~~~~~~~~~vr~~~~~~~d~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~g 254 (511) T protein:vir:96 175 IYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERR 254 (511) T ss_pred EEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccccCCceeeEEecCCCCC Confidence 11111111111111 1112 455544321 1111111110000 00111 11111 11222 Q ss_pred hhhHHHHHHHHHHHHHHH-HHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCccc-ccccch Q lcl|NC_015285. 56 LSHLHKAIKAVNQLRMIE-DSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIK-DDKKFM 133 (359) Q Consensus 56 ~syL~~Aik~~NqL~m~E-DalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevk-dd~~~m 133 (359) .|-++..+.....+..+- +....-+-++.|-+-+.-.... .++++. +....+ T Consensus 255 ~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~--------------------------~~~~~~~~~~~~~ 308 (511) T protein:vir:96 255 KGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNL--------------------------DPVEVRKQKEANV 308 (511) T ss_pred CCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCccC--------------------------Cchhhcccccccc Confidence 344554444444333221 1122223344443332210000 011111 111111 Q ss_pred hhHhhhccc---ccCCCCccceeecCCCCCcchH-HHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHH Q lcl|NC_015285. 134 SMMEDFWLP---RREGGRGTEISTLPGGQNLGEL-EDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKF 209 (359) Q Consensus 134 SMlEDywLp---RReGgrgTEIsTLpGgqnLgei-~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KF 209 (359) -.+...... ....+.|..+..|-...+...+ .-+.-+.+.+|.-.++|---.+.-++ |. .+..|..-...-..- T Consensus 309 ~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~~-n~-Sg~Al~~~~~~l~~k 386 (511) T protein:vir:96 309 LFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFSG-TQ-SGEAMKYKLFGLEQR 386 (511) T ss_pred eecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccccccc-cc-hHHHHHHHHHHHHHH Confidence 111111100 1112223456666554444433 33456677888888888532211111 11 122233222223344 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHH Q lcl|NC_015285. 210 IARLRKRFSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYM 289 (359) Q Consensus 210 I~rLr~rFs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i 289 (359) +.+.+..|..-+...++.=+-+-++....+++.-...+++.|...---.+ .+.++++..+. | .+|.+++ T Consensus 387 ~~~k~~~~~~~l~~~~~li~~~~~~~~~~~~~~d~~~i~~~f~~~~p~n~-------~e~~~~~~kl~---G-~iS~et~ 455 (511) T protein:vir:96 387 TKTKEGLFTKGLRRRAKLLETILKNTWSIDANKDFNTVRYVYNRNLPKSL-------IEELKAYIDSG---G-KISQTTL 455 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhcCcccccccccceEEeCCCCCCCH-------HHHHHHHHHHh---c-cCChHHH Confidence 55555555555555554422222222222222223356777864333223 23345555553 3 3799999 Q ss_pred HHHHhCCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCCCCccCC Q lcl|NC_015285. 290 RRQVLKQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPGDVRRGE 358 (359) Q Consensus 290 ~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~ 358 (359) ++. |...++ .+++.++|++|..+. .+ ..++..++. +++...++-+++ +.+ .+.++| T Consensus 456 l~~-l~~v~D-~~~E~~ri~~E~~~~-~~---~~~~~~~~~-~~~~~~~~~~~~---~~~---~~~~~~ 511 (511) T protein:vir:96 456 MSL-FSFFQD-PELEVKKIEEDEKES-IK---KAQKGIYKD-PRDINDDEQDDD---TKD---TVDKKE 511 (511) T ss_pred HHh-CCCCCC-HHHHHHHHHHHHHHH-HH---HHhhccccC-CCCCCCCCCCCc---ccc---cccccC Confidence 976 565442 344455566664431 11 111111111 111111111111 111 111222 No 109 >protein:vir:102118 Length: 409 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699943;genbank:gi:110804051;genbank:GeneID:4206661 Probab=54.88 E-value=0.49 Score=22.27 Aligned_cols=274 Identities=14% Similarity=0.180 Sum_probs=115.1 Q ss_pred CCC--------chhhHHHhhhhhhheeeccccc--------------------cccCCCceeecHhHhhhhhcccccCCC Q lcl|NC_015285. 1 MRG--------VDLNQQLTQKAAEYFLYNPKGL--------------------KNSTNQGMKITTDSVTYCHSGIQDLNK 52 (359) Q Consensus 1 ~~~--------~~~~~~~~~~~~e~f~yn~~~~--------------------~~~~~~~v~i~~~ai~y~hSGl~d~~~ 52 (359) +.+ +..|. ..+.+..+|... ....+....++.+-|.+... . +.++ T Consensus 105 l~Gna~~~i~r~~~G~-----~~~L~~i~~~~V~v~~~~~~~~~~~~~~~y~~~~~~g~~~~~~~~evih~r~-~-~~d~ 177 (409) T protein:vir:10 105 IYGNAYVALDFKKNGE-----IKGLYPLKSDGMKIFVDDTGLLNSENNVWYLYTDDLGQRHKFMSDEILHFKG-L-TADG 177 (409) T ss_pred hcCCeEEEEEEcCCCc-----EEEEEEEcCCceEEEEcCCccccccceEEEEEEeCCceeEEeccccEEEecC-c-CCCC Confidence 111 11111 112222222110 01123446788888877652 2 3344 Q ss_pred CcchhhHHHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccc Q lcl|NC_015285. 53 NMTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKF 132 (359) Q Consensus 53 ~~i~syL~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~ 132 (359) -.-+|-|+.|..++.....+++...=+=--.+.-+-|..++ +.+.+..+++ +++.+++...-. ...|. . T Consensus 178 ~~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~-~~l~~e~~~~-~~~~~~~~~~g~---~n~~~------~ 246 (409) T protein:vir:10 178 LAGLSVIELLNHLIENGKSSETYLNNFFKNGLQVKGLVQYA-GDLNPEAEEV-FKENFERMSSGL---KNAHR------I 246 (409) T ss_pred cccccHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEcC-CCCCHHHHHH-HHHHHHHHhccc---cccCC------c Confidence 34568888888888887777776554433344556777776 4566555544 333333321110 01121 1 Q ss_pred hhhHhhhcccccCCCCccceeecCC-CCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHH-H Q lcl|NC_015285. 133 MSMMEDFWLPRREGGRGTEISTLPG-GQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKF-I 210 (359) Q Consensus 133 mSMlEDywLpRReGgrgTEIsTLpG-gqnLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KF-I 210 (359) + +++ .|++++.|.- ...+.-++-.++..+.+.++++||.+-|...+.-+.....+..+. |..+ | T Consensus 247 ~-vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~e~~~~~---f~~~~l 312 (409) T protein:vir:10 247 A-MLP----------IGYKFEPISQKLVDAQFLENSQLTIRQIASVFGVKMHQLNDLDRATHSNITEQNRE---FYIDTL 312 (409) T ss_pred e-ecC----------CCceEEEccCChhhHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCccccHHHHHHH---HHHHHH Confidence 1 221 2456666532 122334556678999999999999999965433333333333322 5543 2 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHH Q lcl|NC_015285. 211 ARLRKRFSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMR 290 (359) Q Consensus 211 ~rLr~rFs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~ 290 (359) .-+..+ +.+.|-..| +++.+. .....+.|.-+. +...+ +..|.+.++.+-.- -+++.+-++ T Consensus 313 ~P~~~~----ie~~ln~kL-----~~~~~~---~~~~~~~fd~~~----ll~~d-~~~~~~~~~~~~~~--G~~T~NE~R 373 (409) T protein:vir:10 313 QSILNM----YELEINYKL-----FLISEI---KNGFYSKFNVDT----ILRAD-IKTRYESYKEAIQN--GFKTPNEIR 373 (409) T ss_pred HHHHHH----HHHHHHHhh-----cCchhc---cCCcEEEEechh----hhccC-HHHHHHHHHHHHhC--CCcCHHHHH Confidence 222222 233333333 344332 233344454332 22211 23344444443221 234444443 Q ss_pred HHHhCCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCCC-CccCCC Q lcl|NC_015285. 291 RQVLKQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPGD-VRRGEF 359 (359) Q Consensus 291 k~IL~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~-~~~~~~ 359 (359) . +|++. |-+ + |+ ..-++.+..+....+.+ .+-|+= T Consensus 374 ~-~lgl~----------------------p~~---------g-gD-~~~~~~n~~~~~~~~~~~~kgGe~ 409 (409) T protein:vir:10 374 E-LEEDE----------------------PLE---------G-GD-VLLINGNMIPVKMAGEQYSKGGEK 409 (409) T ss_pred H-HhCCC----------------------CCC---------C-cC-eeeeccCccchhhccccccccCCC Confidence 2 23331 110 0 00 00011111111111111 111111 No 110 >protein:vir:105461 Length: 470 # NCBI annotation: putative phage portal protein # Family: family:all:125 # MgeID: mge:1502 # MgeName: KC5a # Cross-refs: genbank:acc:YP_529871;genbank:gi:90592611;genbank:GeneID:3974525 Probab=53.73 E-value=0.52 Score=22.14 Aligned_cols=278 Identities=11% Similarity=0.064 Sum_probs=109.3 Q ss_pred CCCchhhHHHhhhhhhheeeccccccc----cCCCceeecHh---------------Hhhhhhc-ccccC----CCCcch Q lcl|NC_015285. 1 MRGVDLNQQLTQKAAEYFLYNPKGLKN----STNQGMKITTD---------------SVTYCHS-GIQDL----NKNMTL 56 (359) Q Consensus 1 ~~~~~~~~~~~~~~~e~f~yn~~~~~~----~~~~~v~i~~~---------------ai~y~hS-Gl~d~----~~~~i~ 56 (359) ..+.. +.+.+. -+-+|++..... .....+..+.. .-++.|. |-++. ++..-. T Consensus 165 ~~~~~-~~~~~~---~~e~yt~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~g~ 240 (470) T protein:vir:10 165 QLDPD-SGKYFT---VHEYWTDKEAQFFRTNATDSTVIEPYNIITSYDLSAGYETGQSNTLKHNFGRVPFIEFSKNKYRL 240 (470) T ss_pred eeecC-CceEEE---EEEEEcCCcEEEEEeecCcceeccccccccccccccccccccccccccCCCeeeEEEeecCCCCC Confidence 11111 111010 112233222110 00000000000 0111222 22221 122223 Q ss_pred hhHHHHHHHHHHHHH-HHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhh Q lcl|NC_015285. 57 SHLHKAIKAVNQLRM-IEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSM 135 (359) Q Consensus 57 syL~~Aik~~NqL~m-~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSM 135 (359) |=|+..+.....+.. +=+....-+.+..|-.-+.-.+.-+++ +.+. -+.+++-- T Consensus 241 sd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lvl~g~~~~~~~-----~~~~-~~~~~~~i------------------- 295 (470) T protein:vir:10 241 PELNKYKGLIDAYDDIYNGFINDLDDVQTVILVLTNYGGADLH-----QFMN-DLRKYKSI------------------- 295 (470) T ss_pred CchhHHHHHHHHHHHHHHHHHHHHHHhcCcceeeecCCccccc-----hhhh-hhhhcCeE------------------- Confidence 445554444444432 233333445555555444432221211 1111 12222222 Q ss_pred HhhhcccccCCCCccceeecCCCCCcchH-HHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhH--HhhhHHHHHHH Q lcl|NC_015285. 136 MEDFWLPRREGGRGTEISTLPGGQNLGEL-EDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITR--DEVKFQKFIAR 212 (359) Q Consensus 136 lEDywLpRReGgrgTEIsTLpGgqnLgei-~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItR--DElKF~KFI~r 212 (359) +++-.+.+.|..+..|--..+.... .-++-+.+.+|+-..+|- +..++ +|.+|.... -...--.-+.+ T Consensus 296 ----~~~~~~~~~~~~~~~lt~~~~~~~~~~~~~~L~~~I~~~s~~p~--~~~~~---~gn~Sg~Alk~~~~~l~~k~~~ 366 (470) T protein:vir:10 296 ----KINNTGNGDNSGVDKLQIDIPVEARDDALKITRKNIFLFGQGID--PANFE---SSNASGVAIKMLYSHLELKAAK 366 (470) T ss_pred ----eccCCCCCcCceeEEEeecCChHHHHHHHHHHHHHHHHHhCCCC--CCccc---cccchHHHHHHHHHHHHHHHHH Confidence 2222223334445555554454433 334567778888889884 22222 244443221 11112233666 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHH Q lcl|NC_015285. 213 LRKRFSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQ 292 (359) Q Consensus 213 Lr~rFs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~ 292 (359) .+..|...|.++++.=+-+-|+. .-+| ..|.+.|...--=.+...+ ++++.+ +| .+|.+++++. T Consensus 367 ~~~~~~~~l~~~~~~i~~~l~~~-~~d~----~~i~i~f~~~~p~d~~e~~-------~~~~~~---~g-~iS~et~l~~ 430 (470) T protein:vir:10 367 TQTYFEHAINELVRAIMRYLNFS-DADK----RHISQHWTRTKVEDSLTKA-------QIVSTV---AN-YSSKEAVAKA 430 (470) T ss_pred HHHHHHHHHHHHHHHHHHHhccc-Cccc----ceeeEEeccCCCCCHHHHH-------HHHHHH---hc-cCcHHHHHHh Confidence 77777777777666433222432 2233 4567777655444443333 334444 34 3799999977 Q ss_pred HhCCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcc Q lcl|NC_015285. 293 VLKQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEG 334 (359) Q Consensus 293 IL~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~ 334 (359) |...++ .+++-++|++|..+..=..++.......|.+.+. T Consensus 431 -~p~v~D-~~~E~eri~~E~~e~~~~~~~~~~~~~~~~dde~ 470 (470) T protein:vir:10 431 -NPIVDD-WQQELKDLAKDKEENDPYSNQADELNGKGVNDEQ 470 (470) T ss_pred -CCCCCC-HHHHHHHHHHHHHHHHHhhccccccCCCCCCCCC Confidence 665432 4455566666644321111111111111111111 No 111 >protein:vir:93610 Length: 454 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449295;genbank:gi:157166043;interpro:IPR006427;interpro:IPR006944;uniprot:Q6H9U6;genbank:GeneID:5580432 Probab=52.22 E-value=0.56 Score=21.96 Aligned_cols=297 Identities=11% Similarity=0.127 Sum_probs=122.3 Q ss_pred CCCchhhHHH---hhhhhhheeeccccc------------------cccCCCceeecHhHhhhhhcccccCCCCcchhhH Q lcl|NC_015285. 1 MRGVDLNQQL---TQKAAEYFLYNPKGL------------------KNSTNQGMKITTDSVTYCHSGIQDLNKNMTLSHL 59 (359) Q Consensus 1 ~~~~~~~~~~---~~~~~e~f~yn~~~~------------------~~~~~~~v~i~~~ai~y~hSGl~d~~~~~i~syL 59 (359) +.++.--..+ ...+.+.+..+|... .......+.++.+-|.++.-+. ..++-.=+|-+ T Consensus 116 l~Gna~~~i~r~~~G~~~~L~~i~~~~v~v~~~~~g~~~y~~~~~~~~~~~~~~~~~~~eViH~k~~~-~~~~~~G~sp~ 194 (454) T protein:vir:93 116 RHGNTVVLKIRNARGQIKELRILDWNRVEPLVADDGEVFYRITPDRNCGITEAVTVPAREVIHDRFNC-FFHPLIGLPPV 194 (454) T ss_pred hcCceEEEEEECCCCcEEEEEEEcCcceEEEEcCCCcEEEEEEeccccccceeEEecCcceEEeccCC-CCCCceeccHH Confidence 1111100000 011223343333211 0111224678888777664332 12333457889 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhh Q lcl|NC_015285. 60 HKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDF 139 (359) Q Consensus 60 ~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDy 139 (359) ..|++.+.....+++...=+=---+--+-|..++ +.|-+..+++--+. .+....- ..+|.+- +++ T Consensus 195 ~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~l~~e~~~~~~~~-~~~~~~g----~n~g~~~-------vl~-- 259 (454) T protein:vir:93 195 YAAGLAATQGHHIQENSTSFFRNGGRPSGVIEIP-GSITEENAKKLKSN-WDSGYTG----ENAGKTA-------ILS-- 259 (454) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCccEEEecC-CCCCHHHHHHHHHH-HHHHhcc----cccCCce-------ecc-- Confidence 9999999988888886653322223345666666 56665554433222 2221100 0122211 221 Q ss_pred cccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHH-HHHHHHHH Q lcl|NC_015285. 140 WLPRREGGRGTEISTLPGG-QNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKF-IARLRKRF 217 (359) Q Consensus 140 wLpRReGgrgTEIsTLpGg-qnLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KF-I~rLr~rF 217 (359) .|.+++.|.=. ..+.-++-.++..+.+.++++||...|...++-+....++..+. |.++ |.-+..++ T Consensus 260 --------~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~~~---f~~~~l~P~~~~i 328 (454) T protein:vir:93 260 --------NGAKYNPTTFSPVDSQTVEQLKMTAEIVCSVFRVPAYKIGVGQPPSSDNVEALEQQ---YYSQCLQTLIESI 328 (454) T ss_pred --------CCceEEEcccChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCcchhHHHHHHH---HHHHHHHHHHHHH Confidence 13444444321 11222344457778999999999999965443333333333333 5443 44455555 Q ss_pred HHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCCC Q lcl|NC_015285. 218 SELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQT 297 (359) Q Consensus 218 s~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~t 297 (359) ...+...| + +..+ .+ +.|..+ ++...+ +..|++.+..+-.- -+++.+-++. .+++. T Consensus 329 e~~ln~~L----~-----~~~~-----~~--~~f~~~----~ll~~D-~~~r~~~~~~~~~~--G~~T~NE~R~-~~gl~ 384 (454) T protein:vir:93 329 ELLLDEAL----E-----TGEN-----ES--TEFDVT----TLLRMD-SERRMKTLGDAVKN--TLLTPNEARK-RENLP 384 (454) T ss_pred HHHHHHhh----c-----CCCC-----cE--EEeech----hhhccC-HHHHHHHHHHHHhC--CCcCHHHHHH-HhCCC Confidence 44444333 2 2222 13 444422 222221 24566666555322 3566666664 36665 Q ss_pred HHHHHHHHHHHHHHHhcCCCCCCc----ch----hhhcCCCCCcccccccCCC-----CCcCCCCCCCCCccCCC Q lcl|NC_015285. 298 EIEIKEIDEQIASEMEAGIIADPM----AE----MDPAMAAGGEGAPAAEVDP-----NAQESSVDPGDVRRGEF 359 (359) Q Consensus 298 DeeI~e~~kqi~~E~~~~~~~~P~----~~----~~~~~~~~~~~~~~~~~~~-----~~~~~~~~p~~~~~~~~ 359 (359) +-+ --++ .|-..+ .. .+..++..+.+.++.+-++ .....+....++.+..| T Consensus 385 pi~--ggD~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~e~~~d~~~~~~ 448 (454) T protein:vir:93 385 PLA--GGDA---------LYLQQQNYSLEALSRRDAREDPFASSGKTASVPQAVAASDGNKAITETEHDAVKAMF 448 (454) T ss_pred CCC--CCCe---------eeeccCccchHhhhccCcccCCCCCCccCCCCCCCCCCCCCCCCccCCccchhhhhh Confidence 421 0000 000000 00 0000111111111111000 00111122222222222 No 112 >protein:vir:99312 Length: 563 # NCBI annotation: putative portal protein # Family: family:all:2446 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024471;genbank:gi:48696430;genbank:GeneID:2948040 Probab=51.53 E-value=0.58 Score=21.88 Aligned_cols=305 Identities=11% Similarity=0.095 Sum_probs=120.7 Q ss_pred CCCchhhHHHh-----hhhhhheeeccccc-----------------cc--cCCCceeecHhHhhhhhccc-ccCC-CCc Q lcl|NC_015285. 1 MRGVDLNQQLT-----QKAAEYFLYNPKGL-----------------KN--STNQGMKITTDSVTYCHSGI-QDLN-KNM 54 (359) Q Consensus 1 ~~~~~~~~~~~-----~~~~e~f~yn~~~~-----------------~~--~~~~~v~i~~~ai~y~hSGl-~d~~-~~~ 54 (359) +.++.--+.++ ..+.+.+..+|... .. .+...+.++.+-|++..-+. .|.. ++. T Consensus 177 l~Gn~~~~~~~~rd~~G~~~~L~pl~p~~V~v~~~~~g~~~~~~~~y~~~~~g~~~~~~~~~evI~~~~~~~~d~~~~~~ 256 (563) T protein:vir:99 177 IYDQVNFEKVFNKNNKTKLEKFIAVDPSTIFYATDKKGKIIKGGKRFVQVVDKRVVASFTSRELAMGIRNPRTELSSSGY 256 (563) T ss_pred hcCCeEEEEEEEecCCCceEEEEEeCCceeEEEECCCCceeccceeEEEEeCCceeEEecCcceEEEeccCCCCcccCcc Confidence 11111000000 11223333332211 11 12222455666655421121 1212 222 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCC-CchHHHHHHHHHHHHhhcceEEeeCCCCcccccccch Q lcl|NC_015285. 55 TLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGN-LPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFM 133 (359) Q Consensus 55 i~syL~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGn-lpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~m 133 (359) =+|-|+.|++++.....+|+...=+=---+.-+-|.-++.+. |.+..+++.-+.+-..|+.- ...|+ .. T Consensus 257 G~Spi~~a~~~i~~~~~~~~~~~~~f~ng~~p~giL~~~~~~~ls~e~~~~~~~~~~~~~~G~----~nagk------~~ 326 (563) T protein:vir:99 257 GLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRSDQQQSQHALENFKREWKSSLSGI----NGSWQ------IP 326 (563) T ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccc----ccccc------ce Confidence 367899999999988888887665544556677777787664 55544444444433334320 00111 10 Q ss_pred hhHhhhcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCC--cccc-cchhhhhHH---hh-- Q lcl|NC_015285. 134 SMMEDFWLPRREGGRGTEISTLPGGQNLGE-LEDVKYFQKKLYKALNVPSSRLETET--TFNI-GRAAEITRD---EV-- 204 (359) Q Consensus 134 SMlEDywLpRReGgrgTEIsTLpGgqnLge-i~DV~YF~kkLy~aL~VP~SRl~~~~--~~~~-g~~~eItRD---El-- 204 (359) -.+ ..|.+++.|--...-.+ ++-.+|..+.+.++.+||...|+-.. ++.- ..++.+++. +. T Consensus 327 ~vl----------~~G~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~~~~~~ss~~~sn~e~~~~ 396 (563) T protein:vir:99 327 VVM----------ADDIKFVNMTPTANDMQFEKWLNYLINIISALYGIDPAEIGFPNRGGATGSKGGSTLNEADPGKKQQ 396 (563) T ss_pred EEc----------CCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHccccccccccccccccchhhccHHHHHH Confidence 011 12445555543222222 45556788999999999999996432 2211 112223322 21 Q ss_pred hHHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchh Q lcl|NC_015285. 205 KFQKF-IARLRKRFSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKY 283 (359) Q Consensus 205 KF~KF-I~rLr~rFs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy 283 (359) -|... +.-+..++...|.. .|+ ++. ..++.++|.+...=+ |.+.++...-...-+ T Consensus 397 ~f~~~tL~P~l~~ie~~ln~----~L~-----~~~-----~~~~~~~f~r~D~~~----------~~e~~~~~~~~~~G~ 452 (563) T protein:vir:99 397 QSQNKGLQPLLRFIEDLVNR----HII-----SEY-----GDKYTFQFVGGDTKS----------ATDKLNILKLETQIF 452 (563) T ss_pred HHHHHHHHHHHHHHHHHHHh----hhc-----hhc-----ccccEEEeccCCHHH----------HHHHHHHHHHhcCCc Confidence 24333 34444444433333 222 222 245777787664422 233222211111235 Q ss_pred hhHHHHHHHHhCCCHHH---HH------------HHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCC------ Q lcl|NC_015285. 284 FSVDYMRRQVLKQTEIE---IK------------EIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDP------ 342 (359) Q Consensus 284 ~S~~~i~k~IL~~tDee---I~------------e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~------ 342 (359) +|..-++. .++|.+-+ +- +..++.+.+...... ....++...+++.-.+ T Consensus 453 lT~NE~R~-~~gl~Pi~gGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~ 522 (563) T protein:vir:99 453 KTVNEARE-EQGKKPIEGGDIILDASFLQGTAQLQQDKQYNDGKQKERL---------QMMMSLLEGDNDDSEEGQSTDS 522 (563) T ss_pred cCHHHHHH-HhCCCCCCCcceeecccccccccccccccCCCccccchhh---------hhcccccCCCCCCCCCCCCCCC Confidence 66666664 36665432 00 000000000000000 0000000000110000 Q ss_pred ----------CCcCCCCCCCCCccCCC Q lcl|NC_015285. 343 ----------NAQESSVDPGDVRRGEF 359 (359) Q Consensus 343 ----------~~~~~~~~p~~~~~~~~ 359 (359) ..+++...+.-.+.|.| T Consensus 523 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 549 (563) T protein:vir:99 523 SNDDKEIGTDAQIKGDDNVYRTQTSNK 549 (563) T ss_pred CCCccccccccccccccccccccCccc Confidence 01111111111122333 No 113 >protein:vir:95599 Length: 563 # NCBI annotation: ORF014 # Family: family:all:2446 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240900;genbank:gi:66394963;genbank:GeneID:5132540 Probab=51.53 E-value=0.58 Score=21.88 Aligned_cols=305 Identities=11% Similarity=0.095 Sum_probs=120.7 Q ss_pred CCCchhhHHHh-----hhhhhheeeccccc-----------------cc--cCCCceeecHhHhhhhhccc-ccCC-CCc Q lcl|NC_015285. 1 MRGVDLNQQLT-----QKAAEYFLYNPKGL-----------------KN--STNQGMKITTDSVTYCHSGI-QDLN-KNM 54 (359) Q Consensus 1 ~~~~~~~~~~~-----~~~~e~f~yn~~~~-----------------~~--~~~~~v~i~~~ai~y~hSGl-~d~~-~~~ 54 (359) +.++.--+.++ ..+.+.+..+|... .. .+...+.++.+-|++..-+. .|.. ++. T Consensus 177 l~Gn~~~~~~~~rd~~G~~~~L~pl~p~~V~v~~~~~g~~~~~~~~y~~~~~g~~~~~~~~~evI~~~~~~~~d~~~~~~ 256 (563) T protein:vir:95 177 IYDQVNFEKVFNKNNKTKLEKFIAVDPSTIFYATDKKGKIIKGGKRFVQVVDKRVVASFTSRELAMGIRNPRTELSSSGY 256 (563) T ss_pred hcCCeEEEEEEEecCCCceEEEEEeCCceeEEEECCCCceeccceeEEEEeCCceeEEecCcceEEEeccCCCCcccCcc Confidence 11111000000 11223333332211 11 12222455666655421121 1212 222 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCC-CchHHHHHHHHHHHHhhcceEEeeCCCCcccccccch Q lcl|NC_015285. 55 TLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGN-LPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFM 133 (359) Q Consensus 55 i~syL~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGn-lpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~m 133 (359) =+|-|+.|++++.....+|+...=+=---+.-+-|.-++.+. |.+..+++.-+.+-..|+.- ...|+ .. T Consensus 257 G~Spi~~a~~~i~~~~~~~~~~~~~f~ng~~p~giL~~~~~~~ls~e~~~~~~~~~~~~~~G~----~nagk------~~ 326 (563) T protein:vir:95 257 GLSEVEIAMKEFIAYNNTESFNDRFFSHGGTTRGILQIRSDQQQSQHALENFKREWKSSLSGI----NGSWQ------IP 326 (563) T ss_pred cchHHHHHHHHHHHHHHHHHHHHHHHHccCCCceEEEeCCCCCCCHHHHHHHHHHHHHHhccc----ccccc------ce Confidence 367899999999988888887665544556677777787664 55544444444433334320 00111 10 Q ss_pred hhHhhhcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCC--cccc-cchhhhhHH---hh-- Q lcl|NC_015285. 134 SMMEDFWLPRREGGRGTEISTLPGGQNLGE-LEDVKYFQKKLYKALNVPSSRLETET--TFNI-GRAAEITRD---EV-- 204 (359) Q Consensus 134 SMlEDywLpRReGgrgTEIsTLpGgqnLge-i~DV~YF~kkLy~aL~VP~SRl~~~~--~~~~-g~~~eItRD---El-- 204 (359) -.+ ..|.+++.|--...-.+ ++-.+|..+.+.++.+||...|+-.. ++.- ..++.+++. +. T Consensus 327 ~vl----------~~G~~~~~l~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~~~~~~~~~ss~~~sn~e~~~~ 396 (563) T protein:vir:95 327 VVM----------ADDIKFVNMTPTANDMQFEKWLNYLINIISALYGIDPAEIGFPNRGGATGSKGGSTLNEADPGKKQQ 396 (563) T ss_pred EEc----------CCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHccccccccccccccccchhhccHHHHHH Confidence 011 12445555543222222 45556788999999999999996432 2211 112223322 21 Q ss_pred hHHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchh Q lcl|NC_015285. 205 KFQKF-IARLRKRFSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKY 283 (359) Q Consensus 205 KF~KF-I~rLr~rFs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy 283 (359) -|... +.-+..++...|.. .|+ ++. ..++.++|.+...=+ |.+.++...-...-+ T Consensus 397 ~f~~~tL~P~l~~ie~~ln~----~L~-----~~~-----~~~~~~~f~r~D~~~----------~~e~~~~~~~~~~G~ 452 (563) T protein:vir:95 397 QSQNKGLQPLLRFIEDLVNR----HII-----SEY-----GDKYTFQFVGGDTKS----------ATDKLNILKLETQIF 452 (563) T ss_pred HHHHHHHHHHHHHHHHHHHh----hhc-----hhc-----ccccEEEeccCCHHH----------HHHHHHHHHHhcCCc Confidence 24333 34444444433333 222 222 245777787664422 233222211111235 Q ss_pred hhHHHHHHHHhCCCHHH---HH------------HHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCC------ Q lcl|NC_015285. 284 FSVDYMRRQVLKQTEIE---IK------------EIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDP------ 342 (359) Q Consensus 284 ~S~~~i~k~IL~~tDee---I~------------e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~------ 342 (359) +|..-++. .++|.+-+ +- +..++.+.+...... ....++...+++.-.+ T Consensus 453 lT~NE~R~-~~gl~Pi~gGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~~~~~~~~~~~~~ 522 (563) T protein:vir:95 453 KTVNEARE-EQGKKPIEGGDIILDASFLQGTAQLQQDKQYNDGKQKERL---------QMMMSLLEGDNDDSEEGQSTDS 522 (563) T ss_pred cCHHHHHH-HhCCCCCCCcceeecccccccccccccccCCCccccchhh---------hhcccccCCCCCCCCCCCCCCC Confidence 66666664 36665432 00 000000000000000 0000000000110000 Q ss_pred ----------CCcCCCCCCCCCccCCC Q lcl|NC_015285. 343 ----------NAQESSVDPGDVRRGEF 359 (359) Q Consensus 343 ----------~~~~~~~~p~~~~~~~~ 359 (359) ..+++...+.-.+.|.| T Consensus 523 ~~~~~~~~~~~~~~~~~~~~~~~~~~~ 549 (563) T protein:vir:95 523 SNDDKEIGTDAQIKGDDNVYRTQTSNK 549 (563) T ss_pred CCCccccccccccccccccccccCccc Confidence 01111111111122333 No 114 >protein:vir:4598 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058443;genbank:gi:9635169;genbank:GeneID:1262702 Probab=51.23 E-value=0.59 Score=21.85 Aligned_cols=281 Identities=13% Similarity=0.119 Sum_probs=111.9 Q ss_pred CCCc--------hhhHHHhhhhhhheeeccccc----------------cccC--CCceeecHhHhhhhhcccccCCCCc Q lcl|NC_015285. 1 MRGV--------DLNQQLTQKAAEYFLYNPKGL----------------KNST--NQGMKITTDSVTYCHSGIQDLNKNM 54 (359) Q Consensus 1 ~~~~--------~~~~~~~~~~~e~f~yn~~~~----------------~~~~--~~~v~i~~~ai~y~hSGl~d~~~~~ 54 (359) +.++ ..|. ..+.+...|... .... .....++.+-|.+..- .+.++=. T Consensus 105 l~Gna~~~i~r~~~G~-----~~~L~~i~~~~v~v~~~~~g~~~~~~~~~~~~~~~~~~~~~~~evihir~--~~~d~~~ 177 (416) T protein:vir:45 105 LTSHGYIEITRDKTGE-----PMNLTFRKTSEIELKSDARGRLYYFHQRIDSNGNNIERNVKFEDMLDIKF--YSLDGIN 177 (416) T ss_pred hcCCeEEEEEECCCCc-----EEEEEEEcCceeEEEECCCccEEEEEEEecCCCceeEEEEccccEEEecc--CCCCCcc Confidence 1111 1111 112222222110 0011 1124566666655431 2223323 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHH-HHHhhcceEEeeCCCCcccccccch Q lcl|NC_015285. 55 TLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLRE-VMGRYRNKMVYDANTGEIKDDKKFM 133 (359) Q Consensus 55 i~syL~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~-iM~kyrnklvYD~~TGevkdd~~~m 133 (359) =+|-|+.|++++......++..--+--.-+--+-|..++ |.+...+|.+=+++ +...|. | .++..+.+ T Consensus 178 G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~-~~~~~~~~~~~~~~~~~~~~~---------g-~~nag~~~ 246 (416) T protein:vir:45 178 GLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMK-GVLDNKKARDRAREEFHKSFS---------G-TKQAGKVV 246 (416) T ss_pred ccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeC-CCCCCHHHHHHHHHHHHHHhc---------C-ccccCcee Confidence 357788888888887777777664444445566677777 45543444333333 222221 1 11111111 Q ss_pred hhHhhhcccccCCCCccceeecCCCCC-cchHHHHHHHHHHHHHhcCCCccccCCCC-cccccchhhhhHHhhhHHHHHH Q lcl|NC_015285. 134 SMMEDFWLPRREGGRGTEISTLPGGQN-LGELEDVKYFQKKLYKALNVPSSRLETET-TFNIGRAAEITRDEVKFQKFIA 211 (359) Q Consensus 134 SMlEDywLpRReGgrgTEIsTLpGgqn-Lgei~DV~YF~kkLy~aL~VP~SRl~~~~-~~~~g~~~eItRDElKF~KFI~ 211 (359) .++ .|.+++.|.-... +.-++-.++..+.+.++++||.+.|+.+. +++ +.-..+-|..-|. T Consensus 247 -vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~------~~~~~~~~~~~l~ 309 (416) T protein:vir:45 247 -VLD----------ESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMS------ITDANLDYLSTLK 309 (416) T ss_pred -ecC----------CCceeEeccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCcc------HHHHHHHHHHHHH Confidence 121 1445555532221 12244456678899999999999996432 221 2222333444444 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHH Q lcl|NC_015285. 212 RLRKRFSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRR 291 (359) Q Consensus 212 rLr~rFs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k 291 (359) -+-.++...|...| .++ +. ...| .|.- .++...+ ...|.+.++.+-.- -+ T Consensus 310 P~~~~ie~~ln~~l---------~~~--~~--~~~~--~f~~----~~l~~~D-~~~~~~~~~~~~~~--G~-------- 359 (416) T protein:vir:45 310 PYITCVCAELNFKF---------NDE--YV--NREF--KFDT----TEIRVVD-EKTQAEIDKINIDS--GK-------- 359 (416) T ss_pred HHHHHHHHHHhhhc---------ccc--cc--CceE--EEec----hhhhccC-HHHHHHHHHHHHhC--CC-------- Confidence 44444444443332 221 21 2233 4432 2222221 23455555444222 13 Q ss_pred HHhCCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCCCCccCCC Q lcl|NC_015285. 292 QVLKQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPGDVRRGEF 359 (359) Q Consensus 292 ~IL~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~ 359 (359) ||..|+-+ +..-+.+++++...-....-..+...+.+.++.-... .....+-|+= T Consensus 360 ----~T~NE~R~-------~~gl~p~~~gd~~~~~~~~n~~~~~~~~~~~~~~~~~--~~~~~kgGe~ 414 (416) T protein:vir:45 360 ----MNIDEIRQ-------RDGLAPIPGGNGSIHRVDLNHVNIELVDEYQMNKSRA--TDKKLKGGEE 414 (416) T ss_pred ----cCHHHHHH-------HhCCCCCCCCCcceEeecccccccccccccCcccccc--cccccCCCCC Confidence 34444421 1233344444432111111111111112222211111 1111222222 No 115 >protein:vir:81095 Length: 416 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429872;genbank:gi:156603925;genbank:GeneID:5525315 Probab=51.23 E-value=0.59 Score=21.85 Aligned_cols=281 Identities=13% Similarity=0.119 Sum_probs=111.9 Q ss_pred CCCc--------hhhHHHhhhhhhheeeccccc----------------cccC--CCceeecHhHhhhhhcccccCCCCc Q lcl|NC_015285. 1 MRGV--------DLNQQLTQKAAEYFLYNPKGL----------------KNST--NQGMKITTDSVTYCHSGIQDLNKNM 54 (359) Q Consensus 1 ~~~~--------~~~~~~~~~~~e~f~yn~~~~----------------~~~~--~~~v~i~~~ai~y~hSGl~d~~~~~ 54 (359) +.++ ..|. ..+.+...|... .... .....++.+-|.+..- .+.++=. T Consensus 105 l~Gna~~~i~r~~~G~-----~~~L~~i~~~~v~v~~~~~g~~~~~~~~~~~~~~~~~~~~~~~evihir~--~~~d~~~ 177 (416) T protein:vir:81 105 LTSHGYIEITRDKTGE-----PMNLTFRKTSEIELKSDARGRLYYFHQRIDSNGNNIERNVKFEDMLDIKF--YSLDGIN 177 (416) T ss_pred hcCCeEEEEEECCCCc-----EEEEEEEcCceeEEEECCCccEEEEEEEecCCCceeEEEEccccEEEecc--CCCCCcc Confidence 1111 1111 112222222110 0011 1124566666655431 2223323 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHH-HHHhhcceEEeeCCCCcccccccch Q lcl|NC_015285. 55 TLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLRE-VMGRYRNKMVYDANTGEIKDDKKFM 133 (359) Q Consensus 55 i~syL~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~-iM~kyrnklvYD~~TGevkdd~~~m 133 (359) =+|-|+.|++++......++..--+--.-+--+-|..++ |.+...+|.+=+++ +...|. | .++..+.+ T Consensus 178 G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~-~~~~~~~~~~~~~~~~~~~~~---------g-~~nag~~~ 246 (416) T protein:vir:81 178 GLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMK-GVLDNKKARDRAREEFHKSFS---------G-TKQAGKVV 246 (416) T ss_pred ccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeC-CCCCCHHHHHHHHHHHHHHhc---------C-ccccCcee Confidence 357788888888887777777664444445566677777 45543444333333 222221 1 11111111 Q ss_pred hhHhhhcccccCCCCccceeecCCCCC-cchHHHHHHHHHHHHHhcCCCccccCCCC-cccccchhhhhHHhhhHHHHHH Q lcl|NC_015285. 134 SMMEDFWLPRREGGRGTEISTLPGGQN-LGELEDVKYFQKKLYKALNVPSSRLETET-TFNIGRAAEITRDEVKFQKFIA 211 (359) Q Consensus 134 SMlEDywLpRReGgrgTEIsTLpGgqn-Lgei~DV~YF~kkLy~aL~VP~SRl~~~~-~~~~g~~~eItRDElKF~KFI~ 211 (359) .++ .|.+++.|.-... +.-++-.++..+.+.++++||.+.|+.+. +++ +.-..+-|..-|. T Consensus 247 -vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~------~~~~~~~~~~~l~ 309 (416) T protein:vir:81 247 -VLD----------ESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETANMS------ITDANLDYLSTLK 309 (416) T ss_pred -ecC----------CCceeEeccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCCcc------HHHHHHHHHHHHH Confidence 121 1445555532221 12244456678899999999999996432 221 2222333444444 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHH Q lcl|NC_015285. 212 RLRKRFSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRR 291 (359) Q Consensus 212 rLr~rFs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k 291 (359) -+-.++...|...| .++ +. ...| .|.- .++...+ ...|.+.++.+-.- -+ T Consensus 310 P~~~~ie~~ln~~l---------~~~--~~--~~~~--~f~~----~~l~~~D-~~~~~~~~~~~~~~--G~-------- 359 (416) T protein:vir:81 310 PYITCVCAELNFKF---------NDE--YV--NREF--KFDT----TEIRVVD-EKTQAEIDKINIDS--GK-------- 359 (416) T ss_pred HHHHHHHHHHhhhc---------ccc--cc--CceE--EEec----hhhhccC-HHHHHHHHHHHHhC--CC-------- Confidence 44444444443332 221 21 2233 4432 2222221 23455555444222 13 Q ss_pred HHhCCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCCCCccCCC Q lcl|NC_015285. 292 QVLKQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPGDVRRGEF 359 (359) Q Consensus 292 ~IL~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~ 359 (359) ||..|+-+ +..-+.+++++...-....-..+...+.+.++.-... .....+-|+= T Consensus 360 ----~T~NE~R~-------~~gl~p~~~gd~~~~~~~~n~~~~~~~~~~~~~~~~~--~~~~~kgGe~ 414 (416) T protein:vir:81 360 ----MNIDEIRQ-------RDGLAPIPGGNGSIHRVDLNHVNIELVDEYQMNKSRA--TDKKLKGGEE 414 (416) T ss_pred ----cCHHHHHH-------HhCCCCCCCCCcceEeecccccccccccccCcccccc--cccccCCCCC Confidence 34444421 1233344444432111111111111112222211111 1111222222 No 116 >protein:vir:100249 Length: 431 # NCBI annotation: gp78 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355414;genbank:gi:77864704;genbank:GeneID:3725971 Probab=49.74 E-value=0.63 Score=21.68 Aligned_cols=274 Identities=12% Similarity=0.107 Sum_probs=116.2 Q ss_pred CCCchhhHHHhhhhhhheeecccc--------------ccccCCCceeecHhHhhhhhcccccCCCCcchhhHHHHHHHH Q lcl|NC_015285. 1 MRGVDLNQQLTQKAAEYFLYNPKG--------------LKNSTNQGMKITTDSVTYCHSGIQDLNKNMTLSHLHKAIKAV 66 (359) Q Consensus 1 ~~~~~~~~~~~~~~~e~f~yn~~~--------------~~~~~~~~v~i~~~ai~y~hSGl~d~~~~~i~syL~~Aik~~ 66 (359) .++ . | ...+.+..+|.. +....+..+.++.+-|.+.. +. +.|+-.=+|.++.|.+++ T Consensus 141 ~r~-~-g-----~~~~L~pl~~~~v~~~~~~~~~~~y~~~~~~g~~~~~~~~dViHir-~~-~~dg~~G~spi~~~~~~i 211 (431) T protein:vir:10 141 VWS-G-N-----RPIRLIPMDRGSAKGRLTSTWQIVYDYTTPTGDKIELPAREVFHLR-DL-SIDGVSGVSRVKLSGNAL 211 (431) T ss_pred EEc-C-C-----ceEEEEEEcCceeEEEEcCCCeEEEEEEeCCceEEEEchhhEEEec-Cc-CCCCcccccHHHHHHHHH Confidence 111 0 1 011111111110 00112234678888776553 22 233333368888888888 Q ss_pred HHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhhcccccCC Q lcl|NC_015285. 67 NQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDFWLPRREG 146 (359) Q Consensus 67 NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDywLpRReG 146 (359) .....+++...=+----+--+-|.-.+ ++|.+.++++.-+.+...|..- ...|.+ + .+ + T Consensus 212 ~~~~~~~~~~~~~f~ng~~p~gil~~~-~~ls~e~~~~~~~~~~~~~~g~----~n~g~~------~-vl--------~- 270 (431) T protein:vir:10 212 ELAEQAERAASRTFRTGVMAGGAIEVP-KELSDNAYGRMKASVQENHTGS----ENAGSW------M-LL--------E- 270 (431) T ss_pred HHHHHHHHHHHHHHhccCCccEEEecC-CCCCHHHHHHHHHHHHHHhcCc----cccCCc------e-ec--------C- Confidence 888888877665555555556666666 4677666655555544444320 111211 1 11 1 Q ss_pred CCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015285. 147 GRGTEISTLPGG-QNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFTDLL 225 (359) Q Consensus 147 grgTEIsTLpGg-qnLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KFI~rLr~rFs~if~d~L 225 (359) .|.+++.|.-. ..+.-++--+|-...+.++++||..-|....+-+.-+..+..+. |.+++ |+--+. .+.+.| T Consensus 271 -~g~~~~~l~~~~~d~q~le~r~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~eq~~~~---f~~~t--L~P~~~-~ie~~l 343 (431) T protein:vir:10 271 -EGATAKQFSNTAASAQQIENRNHQIEEVARMYGVPRPLLMMDDTSWGSGIEQLAIF---FIQYG--LSHWFV-SWEQAA 343 (431) T ss_pred -CCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHhCCCCCCccccHHHHHHH---HHHHH--HHHHHH-HHHHHH Confidence 13455544321 11222333345567899999999999975432222222333333 55442 333222 223333 Q ss_pred HHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhh--cchhhhHHHHHHHHhCCCHHHHHH Q lcl|NC_015285. 226 KTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPY--VGKYFSVDYMRRQVLKQTEIEIKE 303 (359) Q Consensus 226 k~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~--vGKy~S~~~i~k~IL~~tDeeI~e 303 (359) -. .+++++++. ...+.|.. .+|...+ +..|.+.++.+-.- -..+++.+-+++. ++| T Consensus 344 n~-----~Ll~~~~~~----~~~~~fd~----~~llr~d-~~~r~~~~~~~~~~G~~~g~lT~NE~R~~-~gl------- 401 (431) T protein:vir:10 344 AR-----AFLPEKMLG----QRQFKFNE----GALLRGT-LNDQAAFFSKALGAGGQSPWMKQNEVREM-LDL------- 401 (431) T ss_pred Hh-----hccChhhcC----CceEEEec----hhhhccC-HHHHHHHHHHHHhcccccCccCHHHHHHH-hCC------- Confidence 33 334555543 23445542 2333222 35566665554321 1223454444432 333 Q ss_pred HHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCCC Q lcl|NC_015285. 304 IDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSV 349 (359) Q Consensus 304 ~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 349 (359) +.+++|..+. .- .|.-..+.+..++ +|++. T Consensus 402 -----------~p~~~~~gD~-~~--~p~n~~~~~~~~~--~p~~~ 431 (431) T protein:vir:10 402 -----------PRADDPVADQ-LR--NPMTQKQKGSGDE--PPATT 431 (431) T ss_pred -----------CCCCCccccc-ee--cccccccCCCCCC--CCCCC Confidence 2233332211 00 0000000000000 01111 No 117 >protein:vir:4454 Length: 414 # NCBI annotation: Portal Protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700377;genbank:gi:23505449;genbank:GeneID:955656 Probab=47.30 E-value=0.71 Score=21.41 Aligned_cols=288 Identities=14% Similarity=0.158 Sum_probs=122.8 Q ss_pred CCCchhhHH--Hhhhhhhheeeccccc--------------cccCCCceeecHhHhhhhhcccccCCCCcchhhHHHHHH Q lcl|NC_015285. 1 MRGVDLNQQ--LTQKAAEYFLYNPKGL--------------KNSTNQGMKITTDSVTYCHSGIQDLNKNMTLSHLHKAIK 64 (359) Q Consensus 1 ~~~~~~~~~--~~~~~~e~f~yn~~~~--------------~~~~~~~v~i~~~ai~y~hSGl~d~~~~~i~syL~~Aik 64 (359) +.++.--+. --+...+.+..+|... ....+....++.+-|.+... . +.++-.=+|-+..|.. T Consensus 110 l~Gna~~~i~~~~g~~~~L~~l~~~~v~~~~~~~~~~~y~~~~~~g~~~~~~~~evih~~~-~-~~d~~~G~s~i~~~~~ 187 (414) T protein:vir:44 110 LRGNFYAYKVKAFGEVAELLPVDPGCVVPKLNSSWEPVYQVTFPDGSTDVLSQEDIWHVRT-L-TLDGLVGLNPIAYARE 187 (414) T ss_pred hcCCeEEEEEeCCCcEEEEEEEcCceEEEEECCCCcEEEEEEecCceEEEEccccEEEecC-C-CCCCcccccHHHHHHH Confidence 111000000 0011222222222110 11122345677777776542 1 3343334566888888 Q ss_pred HHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhhccccc Q lcl|NC_015285. 65 AVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDFWLPRR 144 (359) Q Consensus 65 ~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDywLpRR 144 (359) ++.....+++...-+----+--+-++.+| ++|.+..+++..+.+...|+.- ...|. .+ .++ T Consensus 188 ~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~l~~e~~~~~~~~~~~~~~g~----~n~~~------~~-vl~------- 248 (414) T protein:vir:44 188 AISLAAATEEHGARLFSNGAVTSGVLRTE-QTLSDQAYERLKKDFEERHTGL----GNAHR------PM-ILE------- 248 (414) T ss_pred HHHHHHHHHHHHHHHHhccCCCceEEEeC-CCCCHHHHHHHHHHHHHHhcCc----cccCc------ce-ecC------- Confidence 88777777776665544445557788887 4677776666666555555420 01121 11 111 Q ss_pred CCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHH Q lcl|NC_015285. 145 EGGRGTEISTLPGG-QNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFTD 223 (359) Q Consensus 145 eGgrgTEIsTLpGg-qnLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KFI~rLr~rFs~if~d 223 (359) .|++++.|.-. ..+.-++-.++..+.+.++++||.+.|...+.-+.....+..+. |.+++ |+- +...+.+ T Consensus 249 ---~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~l~~~~~~t~~n~e~~~~~---~~~~~--l~P-~~~~ie~ 319 (414) T protein:vir:44 249 ---MGLDWKSMALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRATFNNIEELGLG---FINYS--LVP-YLTRIEQ 319 (414) T ss_pred ---CCceEEEccCChHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHH---HHHHH--HHH-HHHHHHH Confidence 24556655321 22223445567788899999999999975443344333333322 54432 222 2222333 Q ss_pred HHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCCCHHHHHH Q lcl|NC_015285. 224 LLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQTEIEIKE 303 (359) Q Consensus 224 ~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~tDeeI~e 303 (359) .|-.. ++++.++. .+.+.|..+ ++...+ +..|.+.++.+-.- -+++.+-++. ++++.+-+ - T Consensus 320 ~ln~~-----L~~~~~~~----~~~i~fd~~----~ll~~d-~~~~~~~~~~~~~~--G~~t~NE~R~-~~gl~p~~--g 380 (414) T protein:vir:44 320 RINTG-----LVRKSKQG----VFYAKFNAG----ALLRGD-MKSRFEAYATGINW--GIYSPNDCRD-LEDMNPRP--G 380 (414) T ss_pred HHHhh-----cCCccccC----ceEEEEech----hhhccC-HHHHHHHHHHHHhC--CCcCHHHHHH-HhCCCCCC--C Confidence 34333 45555553 334555533 222222 23455555443221 3456555553 34553311 0 Q ss_pred HHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCCCCccC Q lcl|NC_015285. 304 IDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPGDVRRG 357 (359) Q Consensus 304 ~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~ 357 (359) -+.+-.|.. + ...+..+.. ...+..+..++...+ T Consensus 381 ----------gD~~~~~~n-~---~~~~~~~~~------~~~~~~~~~~d~~~~ 414 (414) T protein:vir:44 381 ----------GDVYLTPMN-M---TTKPSDGSK------AGKQKDNANADETTS 414 (414) T ss_pred ----------cceeccccc-c---cccCCcccc------CCCCCCCCCCCCCCC Confidence 000001100 0 000000000 000011111111111 No 118 >protein:vir:4194 Length: 540 # NCBI annotation: putative portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071819;genbank:gi:11863102;genbank:GeneID:1257604 Probab=46.19 E-value=0.74 Score=21.29 Aligned_cols=305 Identities=11% Similarity=0.080 Sum_probs=119.2 Q ss_pred CCCch-----------hhHHHhhhhhhheee------------ccccc--cccCCCceeecHhHhhhhhcccccCCCCcc Q lcl|NC_015285. 1 MRGVD-----------LNQQLTQKAAEYFLY------------NPKGL--KNSTNQGMKITTDSVTYCHSGIQDLNKNMT 55 (359) Q Consensus 1 ~~~~~-----------~~~~~~~~~~e~f~y------------n~~~~--~~~~~~~v~i~~~ai~y~hSGl~d~~~~~i 55 (359) .++.. ..+.|...-..|+.. .-... .......+.++.+-|.+.+-.- ..++-.= T Consensus 112 ~r~~~G~~~~L~~i~~~~V~v~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~eViHir~~~-~~~~~~G 190 (540) T protein:vir:41 112 VRDDQGEPVRLDYIPAHTVRVHRDGSRYMQTWDGIHVTYFKDYRYEGEVNPDNGEDQDGVGANEIIFIHLPS-PICSYYG 190 (540) T ss_pred EECCCCcEEEEEEeCCcceEEeEcCceeEeeecCceeeeeecccccceeeccccccceeecccceEEecCCC-CCCCccc Confidence 11100 111111111111111 10011 1122234567777776543211 1122233 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCch----HHHHHHHHHHHHhh-cceEEeeCCCCcccccc Q lcl|NC_015285. 56 LSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPK----NKAEQYLREVMGRY-RNKMVYDANTGEIKDDK 130 (359) Q Consensus 56 ~syL~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk----~KAeqYl~~iM~ky-rnklvYD~~TGevkdd~ 130 (359) +|-|..|.+.+.....+++...=+----+--.-|..++.+-.+. .++.+-+++.+.++ .+.. .|-..+.. T Consensus 191 ~Spi~~~~~~i~~~~~~~~~~~~~f~Ng~~p~giL~~~g~l~~e~~~~~~~~~~~~~~~~~~~~~~~-----~g~~~nag 265 (540) T protein:vir:41 191 VPRYLSAAPSILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDEMELGSDGEPTGRTVLQGLIEDNF-----KYLKEAPH 265 (540) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCcccCchhccchHHHHHHHHHHHHHHHHHh-----cccccccc Confidence 67888888888877777766543322234445566666433222 22233333322221 1110 01111111 Q ss_pred cchhhHhhhcccccCCCCccceeecCCCCCcchH---HHHHHHHHHHHHhcCCCccccCC--CCcccccchhhhhHHhhh Q lcl|NC_015285. 131 KFMSMMEDFWLPRREGGRGTEISTLPGGQNLGEL---EDVKYFQKKLYKALNVPSSRLET--ETTFNIGRAAEITRDEVK 205 (359) Q Consensus 131 ~~mSMlEDywLpRReGgrgTEIsTLpGgqnLgei---~DV~YF~kkLy~aL~VP~SRl~~--~~~~~~g~~~eItRDElK 205 (359) +.+ .++.=. .+..|.+++.|. .+..++ +-.++..+.+.++++||...++. .++++.....+..+. T Consensus 266 ~~~-vLe~~~----~~~~g~~~~pl~--~~~~d~qfle~~~~~~~eIa~afgVPp~~lG~~~~~~~n~sn~eq~~~~--- 335 (540) T protein:vir:41 266 TPL-VFSIPG----GDTVEVTFTPLN--TSQKELSFREYAAEKKHDIAAAHMIDPYRLGITDVGPLGGNFAEVARRT--- 335 (540) T ss_pred ceE-EEecCC----CcccceeEEecc--cchhHHHHHHHHHHHHHHHHHHhCCCHHHcCcccCCCCCcccHHHHHHH--- Confidence 111 222100 012344555543 233333 34457788899999999999963 245555555555544 Q ss_pred HHHH-HHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhh Q lcl|NC_015285. 206 FQKF-IARLRKRFSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYF 284 (359) Q Consensus 206 F~KF-I~rLr~rFs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~ 284 (359) |.+. +.-+..++...+...|.++ .+ ..+.+.|..+. +.+.++ ..+++.+- -.-++ T Consensus 336 f~~~tL~P~~~~ie~~ln~~L~~~---------~~-----~~~~i~f~~~~----ll~~D~-~~~~~~lv-----~~G~l 391 (540) T protein:vir:41 336 YYESVVRPQQEIVSSVLTDFIQLK---------LD-----PGARFVFNEEI----LMESEF-VHNYALLV-----QCGVL 391 (540) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhc---------cC-----CceEEEecchh----hcchHH-HHHHHHHH-----hCCCC Confidence 6554 6667777776666554332 11 23456666543 223332 23333221 11345 Q ss_pred hHHHHHHHHhCCCHHHHHHHHHHHHHHHhcCCCCCCc----ch---hhhcCC--CCCccccc-ccCCCCC--cCCCCCCC Q lcl|NC_015285. 285 SVDYMRRQVLKQTEIEIKEIDEQIASEMEAGIIADPM----AE---MDPAMA--AGGEGAPA-AEVDPNA--QESSVDPG 352 (359) Q Consensus 285 S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~P~----~~---~~~~~~--~~~~~~~~-~~~~~~~--~~~~~~p~ 352 (359) +.+-++.+++++..-+- .+..|. ++ .+.+.. .+.+.... .+.+|.. +.....|. T Consensus 392 T~NE~Re~L~g~e~gdd--------------~~l~p~n~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~ 457 (540) T protein:vir:41 392 TPSEVREKLFGLDGGPD--------------MFMVPSSIGKSAMKRQKRNYEKNQINEIKRTYAKYKPRIQEIISSESPL 457 (540) T ss_pred CHHHHHHHhCcCcCCCc--------------ccccccccccccccccccccCCCCccccccccchhcccccCcccccccc Confidence 66666544333322110 010110 00 000000 00000000 1112211 11112222 Q ss_pred CCccCCC Q lcl|NC_015285. 353 DVRRGEF 359 (359) Q Consensus 353 ~~~~~~~ 359 (359) +...++- T Consensus 458 ~~~~~~~ 464 (540) T protein:vir:41 458 EDKKKKI 464 (540) T ss_pred ccccccc Confidence 2222222 No 119 >protein:vir:95899 Length: 474 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240382;genbank:gi:66396046;genbank:GeneID:5133410 Probab=45.09 E-value=0.78 Score=21.17 Aligned_cols=284 Identities=11% Similarity=0.074 Sum_probs=100.9 Q ss_pred CCC-chhhH-----HH--hhhhhhheeeccccccc--cCCCceeecHh-------Hhhhhhc-cccc---C-CCCcchhh Q lcl|NC_015285. 1 MRG-VDLNQ-----QL--TQKAAEYFLYNPKGLKN--STNQGMKITTD-------SVTYCHS-GIQD---L-NKNMTLSH 58 (359) Q Consensus 1 ~~~-~~~~~-----~~--~~~~~e~f~yn~~~~~~--~~~~~v~i~~~-------ai~y~hS-Gl~d---~-~~~~i~sy 58 (359) +-+ +..+. .. .++...+-+|++..... ..+.+...... .-+..|. |-++ + ++..-.|= T Consensus 165 v~d~~~~~~~~a~ir~~~~~~~~~~~vy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~~~~d 244 (474) T protein:vir:95 165 IWTDKEREQLNAFIRIFTFNGETKVEYWTAETVTYYVYENGGLIPDFYYGDEHIQTHFSTGSWERVPFIAFKNNPEEVSD 244 (474) T ss_pred EEcCCCCCceEEEEEEEeecCeeEEEEEeCCeEEEEEEcCCceeeccccccccccCcccccCCCccceEEecCCCCCCCc Confidence 101 00000 00 00111122333322210 00000000000 0001111 1110 0 11122344 Q ss_pred HHHHHHHHHHHH-HHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHh Q lcl|NC_015285. 59 LHKAIKAVNQLR-MIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMME 137 (359) Q Consensus 59 L~~Aik~~NqL~-m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlE 137 (359) |+..+.....+. ++-+....-+-++.|-+-+.-.+.-++. +.+.. |..+ +++ T Consensus 245 ~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~-----~~~~~-~~~~--~~i------------------- 297 (474) T protein:vir:95 245 IWMYKSFVDAIDKRLSDVQNMFDESVELIYILRGYEGEDLS-----EFMEG-LKYY--KAI------------------- 297 (474) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCCccccc-----chhhh-hhcc--cee------------------- Confidence 444444444333 2333333334555554433322111111 11110 1111 111 Q ss_pred hhcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccchhh--hhHHhhhHHHHHHHHH Q lcl|NC_015285. 138 DFWLPRREGGRGTEISTLPGGQNLGE-LEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAE--ITRDEVKFQKFIARLR 214 (359) Q Consensus 138 DywLpRReGgrgTEIsTLpGgqnLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~e--ItRDElKF~KFI~rLr 214 (359) |++ ++ ..+..|-...+.+. -.-+.-+.+.+|...++|---.+. |. |..|. |..-......-+.+.+ T Consensus 298 --~~~---~~--~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~---~~-~n~Sg~Alk~~~~~l~~k~~~~~ 366 (474) T protein:vir:95 298 --NVS---SD--GGVETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQTDK---FG-SATSGIALKFLYTNLNLKANKLK 366 (474) T ss_pred --ecc---CC--CceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCccccc---cc-cccHHHHHHHHHHHHHHHHHHHH Confidence 121 11 12443333222221 234555667789999998422221 11 22222 2222222335566677 Q ss_pred HHHHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHh Q lcl|NC_015285. 215 KRFSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVL 294 (359) Q Consensus 215 ~rFs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL 294 (359) ..|...+.+.|+.=+-+-|+. .+| ..|.+.|....--.+...+ +++.++ | .+|.++++.. | T Consensus 367 ~~~~~~l~~~~~~i~~~~g~~--~d~----~~i~i~f~~~~p~~~~e~a-------~~~~~~----g-iiS~et~~~~-l 427 (474) T protein:vir:95 367 NKANVALQELMQFILDFNKIK--LDA----KEIEITFNFNVMVNDLEQS-------QIGAQS----Q-YLSKETLVRH-H 427 (474) T ss_pred HHHHHHHHHHHHHHHHHhCCC--ccc----ceeeEEecCCCccCHHHHH-------HHHHHc----C-CCChHHHHHh-C Confidence 777777777766444344532 233 4677888655443343222 333332 3 5799999966 5 Q ss_pred CCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCC Q lcl|NC_015285. 295 KQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESS 348 (359) Q Consensus 295 ~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 348 (359) ..+++ .+++.++|++|..+. .++.. ...+. +.+.......+++.++. T Consensus 428 p~v~D-~~~E~eri~~E~~~~-~~~~~----~~~~~-~~~~~~~~~~~~~~e~~ 474 (474) T protein:vir:95 428 PWVDD-PKAELERLDEEQLEL-NKQLP----NLDDG-GADGAQQQQQSENNQSK 474 (474) T ss_pred CCCCC-HHHHHHHHHHHHHHH-Hhhcc----ccccc-cCCCCCCcCCCCccccC Confidence 65543 334445566664321 00000 00000 01111111222222222 No 120 >protein:vir:96266 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240308;genbank:gi:66395972;genbank:GeneID:5133343 Probab=45.09 E-value=0.78 Score=21.17 Aligned_cols=284 Identities=11% Similarity=0.074 Sum_probs=100.9 Q ss_pred CCC-chhhH-----HH--hhhhhhheeeccccccc--cCCCceeecHh-------Hhhhhhc-cccc---C-CCCcchhh Q lcl|NC_015285. 1 MRG-VDLNQ-----QL--TQKAAEYFLYNPKGLKN--STNQGMKITTD-------SVTYCHS-GIQD---L-NKNMTLSH 58 (359) Q Consensus 1 ~~~-~~~~~-----~~--~~~~~e~f~yn~~~~~~--~~~~~v~i~~~-------ai~y~hS-Gl~d---~-~~~~i~sy 58 (359) +-+ +..+. .. .++...+-+|++..... ..+.+...... .-+..|. |-++ + ++..-.|= T Consensus 165 v~d~~~~~~~~a~ir~~~~~~~~~~~vy~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~~~nn~~~~~d 244 (474) T protein:vir:96 165 IWTDKEREQLNAFIRIFTFNGETKVEYWTAETVTYYVYENGGLIPDFYYGDEHIQTHFSTGSWERVPFIAFKNNPEEVSD 244 (474) T ss_pred EEcCCCCCceEEEEEEEeecCeeEEEEEeCCeEEEEEEcCCceeeccccccccccCcccccCCCccceEEecCCCCCCCc Confidence 101 00000 00 00111122333322210 00000000000 0001111 1110 0 11122344 Q ss_pred HHHHHHHHHHHH-HHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHh Q lcl|NC_015285. 59 LHKAIKAVNQLR-MIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMME 137 (359) Q Consensus 59 L~~Aik~~NqL~-m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlE 137 (359) |+..+.....+. ++-+....-+-++.|-+-+.-.+.-++. +.+.. |..+ +++ T Consensus 245 ~e~v~~liDa~d~~~S~~~~~~~~~~~p~lv~~g~~~~~~~-----~~~~~-~~~~--~~i------------------- 297 (474) T protein:vir:96 245 IWMYKSFVDAIDKRLSDVQNMFDESVELIYILRGYEGEDLS-----EFMEG-LKYY--KAI------------------- 297 (474) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcCCCccccc-----chhhh-hhcc--cee------------------- Confidence 444444444333 2333333334555554433322111111 11110 1111 111 Q ss_pred hhcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccchhh--hhHHhhhHHHHHHHHH Q lcl|NC_015285. 138 DFWLPRREGGRGTEISTLPGGQNLGE-LEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAE--ITRDEVKFQKFIARLR 214 (359) Q Consensus 138 DywLpRReGgrgTEIsTLpGgqnLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~e--ItRDElKF~KFI~rLr 214 (359) |++ ++ ..+..|-...+.+. -.-+.-+.+.+|...++|---.+. |. |..|. |..-......-+.+.+ T Consensus 298 --~~~---~~--~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~---~~-~n~Sg~Alk~~~~~l~~k~~~~~ 366 (474) T protein:vir:96 298 --NVS---SD--GGVETIQVEVPVASTKEYLDMMRAYIVEFGQGVDFQTDK---FG-SATSGIALKFLYTNLNLKANKLK 366 (474) T ss_pred --ecc---CC--CceeEEeccCCHHHHHHHHHHHHHHHHHHhCCcCccccc---cc-cccHHHHHHHHHHHHHHHHHHHH Confidence 121 11 12443333222221 234555667789999998422221 11 22222 2222222335566677 Q ss_pred HHHHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHh Q lcl|NC_015285. 215 KRFSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVL 294 (359) Q Consensus 215 ~rFs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL 294 (359) ..|...+.+.|+.=+-+-|+. .+| ..|.+.|....--.+...+ +++.++ | .+|.++++.. | T Consensus 367 ~~~~~~l~~~~~~i~~~~g~~--~d~----~~i~i~f~~~~p~~~~e~a-------~~~~~~----g-iiS~et~~~~-l 427 (474) T protein:vir:96 367 NKANVALQELMQFILDFNKIK--LDA----KEIEITFNFNVMVNDLEQS-------QIGAQS----Q-YLSKETLVRH-H 427 (474) T ss_pred HHHHHHHHHHHHHHHHHhCCC--ccc----ceeeEEecCCCccCHHHHH-------HHHHHc----C-CCChHHHHHh-C Confidence 777777777766444344532 233 4677888655443343222 333332 3 5799999966 5 Q ss_pred CCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCC Q lcl|NC_015285. 295 KQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESS 348 (359) Q Consensus 295 ~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 348 (359) ..+++ .+++.++|++|..+. .++.. ...+. +.+.......+++.++. T Consensus 428 p~v~D-~~~E~eri~~E~~~~-~~~~~----~~~~~-~~~~~~~~~~~~~~e~~ 474 (474) T protein:vir:96 428 PWVDD-PKAELERLDEEQLEL-NKQLP----NLDDG-GADGAQQQQQSENNQSK 474 (474) T ss_pred CCCCC-HHHHHHHHHHHHHHH-Hhhcc----ccccc-cCCCCCCcCCCCccccC Confidence 65543 334445566664321 00000 00000 01111111222222222 No 121 >protein:vir:100187 Length: 385 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025029;genbank:gi:48697262;genbank:GeneID:2948285 Probab=44.66 E-value=0.8 Score=21.12 Aligned_cols=271 Identities=16% Similarity=0.152 Sum_probs=107.2 Q ss_pred CCCchhhHHHhhhhhhheeecc---------cc----c-cccCCCceeecHhHhhhhhcccccCCCC-cchhhHHHHHHH Q lcl|NC_015285. 1 MRGVDLNQQLTQKAAEYFLYNP---------KG----L-KNSTNQGMKITTDSVTYCHSGIQDLNKN-MTLSHLHKAIKA 65 (359) Q Consensus 1 ~~~~~~~~~~~~~~~e~f~yn~---------~~----~-~~~~~~~v~i~~~ai~y~hSGl~d~~~~-~i~syL~~Aik~ 65 (359) +.++.- +.++....+++--++ .+ . ...++..+.++.+-|.+..---.+..++ .=+|.|..|+++ T Consensus 98 l~Gn~~-~~i~r~~~~~~p~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~eiihik~~~~~~~~~~~G~s~i~~~~~~ 176 (385) T protein:vir:10 98 LSGNDY-IPLVGQNLEHIPNSDVQINYLPGNMGIVYTVLESNDRPQMVLRQDQMLHFRLMPDPQYRYLIGRSPLESLQNA 176 (385) T ss_pred hcCCeE-EEEEcCceeEeecCCceEEEEEcCCceEEEEEEcCCceEEEEccccEEEeccCCCCcccccccccHHHHHHHH Confidence 000000 000000011110000 00 0 1122344678888877643100111112 125889999999 Q ss_pred HHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhhcccccC Q lcl|NC_015285. 66 VNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDFWLPRRE 145 (359) Q Consensus 66 ~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDywLpRRe 145 (359) ++....+++...-+----+--+-+..++.+-..+..++ =+++-+.+.... ...|.+ + .++ T Consensus 177 i~~~~~~~~~~~~~~~ng~~~~gil~~~~~~~~~e~~~-~~~~~~~~~~~~----~n~~~~------~-vl~-------- 236 (385) T protein:vir:10 177 LNLDDKASKSNMSAMENQINPAGKLTISNYLSDGKDLE-SAREEFEKANTG----DNSGRL------M-VLP-------- 236 (385) T ss_pred HHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHH-HHHHHHHHHhCc----cccCCc------c-ccC-------- Confidence 99998888876655555556677777775444444433 334434443211 112221 1 221 Q ss_pred CCCccceeecCCCCCcch-HHHH-HHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHH Q lcl|NC_015285. 146 GGRGTEISTLPGGQNLGE-LEDV-KYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFTD 223 (359) Q Consensus 146 GgrgTEIsTLpGgqnLge-i~DV-~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KFI~rLr~rFs~if~d 223 (359) .|.+++.|.-...-.+ +.+. +|-.+.+.++++||...|....+-+. ..+.+.....-|. ..|+--+. .+.+ T Consensus 237 --~g~~~~~l~~~~~d~~~l~e~~~~~~~~Ia~~fgVp~~~lg~~~~~~~-~~sn~eq~~~~~~---~~l~P~~~-~ie~ 309 (385) T protein:vir:10 237 --DGFDYTQLEMKTDVFKALADNSAYSADQISKAFGVPSDILGGGTSTES-QHSNIDQIKATYL---ANLNSYVN-PIVD 309 (385) T ss_pred --CCceEEecCCChhHHHHHHHHHHHHHHHHHHHhCCCHHHcCCccCCCc-ccccHHHHHHHHH---HHHHHHHH-HHHH Confidence 2566776643222122 2233 45578899999999999964321111 1122222222233 34543222 3333 Q ss_pred HHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCCCHHHHHH Q lcl|NC_015285. 224 LLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQTEIEIKE 303 (359) Q Consensus 224 ~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~tDeeI~e 303 (359) .|-..|+ ++ .|+ |. +.++...+ +..|.+.++.+-.- -+ ||..|+-+ T Consensus 310 ~l~~~l~-----~~--------~~~--f~----~~~ll~~d-~~~~~~~~~~~~~~--G~------------~T~NE~R~ 355 (385) T protein:vir:10 310 ELRLKMN-----AP--------DLE--LD----IKDMLDVD-DSALINQVSNLAKS--GV------------LGAEQAQF 355 (385) T ss_pred HHHHhhC-----Cc--------eEE--ee----chhhhccC-HHHHHHHHHHHHhC--CC------------cCHHHHHH Confidence 3444432 21 233 33 22333222 23455555443222 12 34444422 Q ss_pred HHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCCCCccC Q lcl|NC_015285. 304 IDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPGDVRRG 357 (359) Q Consensus 304 ~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~ 357 (359) +.. .+.+|+.+ + ........ ....|+.-+- T Consensus 356 ~~g-------~~p~p~~~--~-----------~~~~~~~~----~~~~g~~~dn 385 (385) T protein:vir:10 356 ILT-------RSGFLPDN--L-----------PEFKPLTT----QVKGGDEGDN 385 (385) T ss_pred HhC-------CCccCCCC--C-----------ccccCccc----ccCCCCCCCC Confidence 111 11121111 0 00000000 0000000000 No 122 >protein:vir:97171 Length: 512 # NCBI annotation: ORF006 # Family: family:all:125 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239722;genbank:gi:66394876;genbank:GeneID:5130904 Probab=43.75 E-value=0.83 Score=21.02 Aligned_cols=300 Identities=11% Similarity=0.050 Sum_probs=100.6 Q ss_pred CCCchhhHHHhhhh---------------hhh-eeeccccccc---cCCCceeecHh-Hhhhhhc-cccc---C-CCCcc Q lcl|NC_015285. 1 MRGVDLNQQLTQKA---------------AEY-FLYNPKGLKN---STNQGMKITTD-SVTYCHS-GIQD---L-NKNMT 55 (359) Q Consensus 1 ~~~~~~~~~~~~~~---------------~e~-f~yn~~~~~~---~~~~~v~i~~~-ai~y~hS-Gl~d---~-~~~~i 55 (359) +-++....+++-.+ ..+ -+|++...+. ..+.+...... .-+..|. |-++ . ++..- T Consensus 175 iyd~~~~~~~~~~vr~~~~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~vPvv~~~nn~~~ 254 (512) T protein:vir:97 175 IYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERR 254 (512) T ss_pred EEcCCCCCceEEEEEEEEeeeccccccceEEEEEEEeCCcEEEEEecCCCcccccccccccccccCcccceEeecCCCCC Confidence 11111111111110 111 2555544321 11111111000 0000111 1111 1 11122 Q ss_pred hhhHHHHHHHHHHHHH-HHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCccc-ccccch Q lcl|NC_015285. 56 LSHLHKAIKAVNQLRM-IEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIK-DDKKFM 133 (359) Q Consensus 56 ~syL~~Aik~~NqL~m-~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevk-dd~~~m 133 (359) .|-++.++.....+.. +=+....-+-++.|-+-+.-.+..+ ..++. +....+ T Consensus 255 ~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~G~~~~~--------------------------~~~~~~~~~~~~ 308 (512) T protein:vir:97 255 KGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLD--------------------------PVEVRKQKEANV 308 (512) T ss_pred CCchhhhHHHHHHHHHHHHHHHHHHHHhcCceeeeecCccCC--------------------------chhhhhhhhccc Confidence 3445544444443332 2222222334444444332111111 00000 000111 Q ss_pred hhHhhhc----ccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccchh--hhhHHhhhH Q lcl|NC_015285. 134 SMMEDFW----LPRREGGRGTEISTLPGGQNLGE-LEDVKYFQKKLYKALNVPSSRLETETTFNIGRAA--EITRDEVKF 206 (359) Q Consensus 134 SMlEDyw----LpRReGgrgTEIsTLpGgqnLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~--eItRDElKF 206 (359) ..+++.+ .+.-+++.|..+..|-...+... -.-+.-+.+.+|.-.++|---.+.-+ |..| .|..-...- T Consensus 309 ~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~p~~~~~~~~----gn~Sg~Al~~~~~~l 384 (512) T protein:vir:97 309 LFLEPTVYENRDTGIETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDDNFS----GTQSGEAMKYKLFGL 384 (512) T ss_pred ccccccchhhcccccCCCCCcceEEEeecCCHHHHHHHHHHHHHHHHHHhCCcccCccccc----ccchHHHHHHHHHHH Confidence 1111111 11111222333444433333322 23355566777888888863332211 2222 232222223 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHh---cCC-CChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcch Q lcl|NC_015285. 207 QKFIARLRKRFSELFTDLLKTQLIL---KGV-MSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGK 282 (359) Q Consensus 207 ~KFI~rLr~rFs~if~d~Lk~QLiL---kgI-~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGK 282 (359) ..-+.+.++.|..-+.+.++.=+-+ ++. -...+| ..|++.|...---.+ .+.++++..+. | T Consensus 385 ~~ka~~k~~~f~~~l~~~~~li~~~~~~~~~~~~~~d~----~~i~~~f~~~~p~~~-------~e~~~~~~kl~---g- 449 (512) T protein:vir:97 385 EQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDF----NTVRYVYNRNLPKSL-------IEELKAYIDSG---G- 449 (512) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCccccccc----ccceEEeCCCCCcCH-------HHHHHHHHHHh---c- Confidence 3345555666666665555442222 222 123333 357778865333223 23345555553 4 Q ss_pred hhhHHHHHHHHhCCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCCCCccCC Q lcl|NC_015285. 283 YFSVDYMRRQVLKQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPGDVRRGE 358 (359) Q Consensus 283 y~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~ 358 (359) .+|.+++++. |...+ +.+++.++|++|..+. .+.. +.+.+..+++....+-+ ..+....+ +.| T Consensus 450 iiS~et~~~~-l~~v~-d~~~E~eri~~E~~~~-~~~~----~~~~~~~~~~~~~~~~~----~~~~~~~~--~~~ 512 (512) T protein:vir:97 450 KISQTTLMSL-FSFFQ-DPELEVKKIEEDEKES-IKKA----QKGIYKDPRDINDDEQD----DDTKDTVD--KKE 512 (512) T ss_pred cCchHHHHHh-CCCCC-CHHHHHHHHHHHHHHH-HHHH----hhcccCCCCCCCCCCCC----CCcccccc--ccC Confidence 3799999977 56543 2333445555554431 1101 01111111111111001 11111111 111 No 123 >protein:vir:63755 Length: 547 # NCBI annotation: gp14 # Family: family:all:2446 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547619;genbank:GeneID:3783506 Probab=43.63 E-value=0.84 Score=21.01 Aligned_cols=299 Identities=11% Similarity=0.122 Sum_probs=117.3 Q ss_pred CCCchhhHHHhhhhhhheeecccc-----------------c--cccCCCceeecHhHhhhhhcc-cccCC-CCcchhhH Q lcl|NC_015285. 1 MRGVDLNQQLTQKAAEYFLYNPKG-----------------L--KNSTNQGMKITTDSVTYCHSG-IQDLN-KNMTLSHL 59 (359) Q Consensus 1 ~~~~~~~~~~~~~~~e~f~yn~~~-----------------~--~~~~~~~v~i~~~ai~y~hSG-l~d~~-~~~i~syL 59 (359) +.-+..| .+.+.+..+|.. + ...+...+.++.+-|.|.+-. +.+.. +..=+|-| T Consensus 177 i~rd~~G-----~~~~L~~l~p~~V~~~~~~~g~~~~~~~~y~~~~~~~~~~~~~~~eiih~r~n~~~~~~~~~~G~Spi 251 (547) T protein:vir:63 177 KVFNRNQ-----SMVRFVAKDPTTIFFATTADGKIPDNGNRFVQVIDQKIVATFNAREMAFAVRNPRSDIYATGYGYPEL 251 (547) T ss_pred EEECCCC-----cEEEEEEecCceeEEEECCccccccCceEEEEEcCCcEEEEeccccEEEecccCCCCcccccccccHH Confidence 1111111 011222222211 0 011223356777777776532 11111 22234668 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCC-CCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhh Q lcl|NC_015285. 60 HKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVG-NLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMED 138 (359) Q Consensus 60 ~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvG-nlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlED 138 (359) ..|++++.....++....=+----+--+-|..+... +|.+..+++.-+.+...|. | +.+..+. .++.+ T Consensus 252 ~~~~~~i~~~~~a~~~~~~~f~Ng~~p~giL~~~~~~~ls~e~~~~lk~~~~~~~~---------G-~~nagk~-~vl~~ 320 (547) T protein:vir:63 252 EIALKQFIAHENTEAFNDRFFSHGGTTRGILQIKAAQQQSQHALEIFKREWKNSLS---------G-INGSWQI-PVVSA 320 (547) T ss_pred HHHHHHHHHHHHHHHHHHHHHHcCCCcceEEEecCCCCCCHHHHHHHHHHHHHHhc---------C-ccccccc-ccccC Confidence 999998888888877665444444555666666644 3444433333333323332 1 1111111 12211 Q ss_pred hcccccCCCCccceeecCCCCCcchHHHH---HHHHHHHHHhcCCCccccCCCC--cccccchhhhhH-----HhhhHHH Q lcl|NC_015285. 139 FWLPRREGGRGTEISTLPGGQNLGELEDV---KYFQKKLYKALNVPSSRLETET--TFNIGRAAEITR-----DEVKFQK 208 (359) Q Consensus 139 ywLpRReGgrgTEIsTLpGgqnLgei~DV---~YF~kkLy~aL~VP~SRl~~~~--~~~~g~~~eItR-----DElKF~K 208 (359) .|.++..|- .+..++.-+ +|..+.+-++.+||...|+..+ ......++.+++ ....|.. T Consensus 321 ---------~g~~~~~l~--~~~~d~qfle~~~~~~~~Ia~afgVPP~~lG~~~~~~~~~~~~~s~t~sn~e~~~~~~~~ 389 (547) T protein:vir:63 321 ---------EDVKFVNMT--PSARDMEFEKWLNYLINVISALYGIDPAEINIPNNGGATGSKGGSLNEGNSAEKNQASKN 389 (547) T ss_pred ---------CCceEEEcC--CChhHHHHHHHHHHHHHHHHHHhCCCHHHcCcccccccccccccccchhhHHHHHHHHHH Confidence 134455553 334444443 4466889999999999996432 111111111221 1112433 Q ss_pred -HHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHH Q lcl|NC_015285. 209 -FIARLRKRFSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVD 287 (359) Q Consensus 209 -FI~rLr~rFs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~ 287 (359) -+.-+..++...|...|- +. + ...+.++|.....-.+ .+|..+...+. .-+++.. T Consensus 390 ~tL~P~~~~ie~~ln~~L~---------~~--~---~~~~~~~f~~~~~~~~-------~~~~~~~~~~~---~g~lT~N 445 (547) T protein:vir:63 390 KGLQPLLGFIEDFINKHIV---------AE--F---GDKYTFQFVGGDIKSE-------LESVKILAEKA---KVAMTVN 445 (547) T ss_pred HHHHHHHHHHHHHHHhhcc---------cc--c---CCceEEEeeccccccH-------HHHHHHHHHHh---CCCcCHH Confidence 345555555554444332 21 1 2356677765543332 22333322221 1246777 Q ss_pred HHHHHHhCCCHH-HH----------HHHH-----HHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCCCCC Q lcl|NC_015285. 288 YMRRQVLKQTEI-EI----------KEID-----EQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDP 351 (359) Q Consensus 288 ~i~k~IL~~tDe-eI----------~e~~-----kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p 351 (359) -++.. ++|... +- .... ++.+.+......+.|.++. +++.++.....| ..+.+.+ T Consensus 446 E~R~~-~gl~P~~egGD~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~--~~~~~~~ 516 (547) T protein:vir:63 446 EVRKE-LNLPGDVIGGDIPLNGVIVQRIGQLMQQEQFEHEKQQSNLQMLQEQT------GNRVSTDVEDIP--DGKDTTG 516 (547) T ss_pred HHHHH-hCCCCCCCCCceeecccccccccccccccCCccccchhhcccccccc------CCCCCCCCCCCC--CCcccCC Confidence 77754 677541 10 0000 0000010000010010000 000001110111 1112222 Q ss_pred CCC-----------------ccCCC Q lcl|NC_015285. 352 GDV-----------------RRGEF 359 (359) Q Consensus 352 ~~~-----------------~~~~~ 359 (359) +.. .+|.| T Consensus 517 ~~~~d~~~~~~~~~~~~~~~~~~~~ 541 (547) T protein:vir:63 517 DIGKDGQRKDKDNANAGKQGMKGDK 541 (547) T ss_pred CcCccccccCccccchhhhhcCCCC Confidence 222 23333 No 124 >protein:vir:95113 Length: 474 # NCBI annotation: ORF007 # Family: family:all:125 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240817;genbank:gi:66394677;genbank:GeneID:5133907 Probab=43.38 E-value=0.85 Score=20.98 Aligned_cols=281 Identities=10% Similarity=0.069 Sum_probs=106.7 Q ss_pred CCCchhhHHHhhhhh-hheeecccccccc---CCCceeecHhHhhhhhc-ccc---c-CCCCcchhhHHHHHHHHHHHH- Q lcl|NC_015285. 1 MRGVDLNQQLTQKAA-EYFLYNPKGLKNS---TNQGMKITTDSVTYCHS-GIQ---D-LNKNMTLSHLHKAIKAVNQLR- 70 (359) Q Consensus 1 ~~~~~~~~~~~~~~~-e~f~yn~~~~~~~---~~~~v~i~~~ai~y~hS-Gl~---d-~~~~~i~syL~~Aik~~NqL~- 70 (359) -.+....+++++.-. -+|.+..+++... ....+... +..|. |-+ . .++..-.|=++..+.....+. T Consensus 182 ~~~~~~~~~~y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~g~iPvv~~~nn~~g~sd~e~v~~liDa~d~ 257 (474) T protein:vir:95 182 KFNNEEKVEFWTDTTVTYYVLENGGLIPDYYYGANHIQSH----FSNGNWGRVPFIAFKNNPEEVSDIWMYKSLIDAIDK 257 (474) T ss_pred EEcCeeEEEEEeCCeEEEEEEcCCccccccccCccccccc----ccccCCCccceEeecCCCCCCCcHHHHHHHHHHHHH Confidence 011111111111100 0111111111000 00000000 00111 110 0 011222344555555554443 Q ss_pred HHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhhcccccCCCCcc Q lcl|NC_015285. 71 MIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDFWLPRREGGRGT 150 (359) Q Consensus 71 m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDywLpRReGgrgT 150 (359) ++-+....-+.++.|-+-+.-.+..... . ...-+.. .+. +++ +++. T Consensus 258 ~~S~~~~~~~~~~~p~lv~~g~~~~~~~-----~-~~~~~~~--~~~---------------------i~~---~~~~-- 303 (474) T protein:vir:95 258 RLSDAQNMFDESVELIYILKGYEGQDLE-----E-FMRGLKY--YKA---------------------INV---DGDG-- 303 (474) T ss_pred HHHHHHHHHHHhcCceeeeecCCcccch-----h-hhhhhhc--cce---------------------eec---cCCC-- Confidence 4455555556677776554433322111 0 0000100 001 111 1211 Q ss_pred ceeecCCCCCcchH-HHHHHHHHHHHHhcCCCccccCCCCc-ccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015285. 151 EISTLPGGQNLGEL-EDVKYFQKKLYKALNVPSSRLETETT-FNIGRAAEITRDEVKFQKFIARLRKRFSELFTDLLKTQ 228 (359) Q Consensus 151 EIsTLpGgqnLgei-~DV~YF~kkLy~aL~VP~SRl~~~~~-~~~g~~~eItRDElKF~KFI~rLr~rFs~if~d~Lk~Q 228 (359) +++.|-...+++.. .-+.-+.+.+|....+|- +..++. -+. .+..|..-+..-..-+.+.+..|..-+.++++.= T Consensus 304 ~~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~--~~~~~~~~n~-Sg~Alk~~~~~l~~k~~~k~~~~~~~l~~~~~li 380 (474) T protein:vir:95 304 GVETIQVEVPVSSTKEYIDLMRAYIMEFGQGVD--FQTDKFGSAP-SGIALKFLYGNLDLKANKLKNKATVAIQELIGFI 380 (474) T ss_pred ceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcc--cccccccccc-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 24444433444433 334667788898999983 222221 111 2223433333344456677777777777777654 Q ss_pred HHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCCCHHHHHHHHHHH Q lcl|NC_015285. 229 LILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQTEIEIKEIDEQI 308 (359) Q Consensus 229 LiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~tDeeI~e~~kqi 308 (359) +-+-|+ ..+| ..|.+.|....--.+. +.++++.++ | .+|.+++++. |..+++ -+++.++| T Consensus 381 ~~~~g~--~~d~----~~i~v~f~~~~p~d~~-------e~a~~~~~~----g-~iS~et~i~~-l~~v~d-~~~E~~ri 440 (474) T protein:vir:95 381 IDFNNL--KMDV----KDIEISFNFNRMMNDA-------EQSQIIAQS----Q-YLSRETLVKS-SPLVDD-YKAELERI 440 (474) T ss_pred HHHhCC--Cccc----ceeeEEeccCCCcCHH-------HHHHHHHhc----C-CCchHHHHHh-CCCCCC-HHHHHHHH Confidence 434443 2333 4567777544333332 223344443 3 4799999986 555432 23444555 Q ss_pred HHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCC Q lcl|NC_015285. 309 ASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPG 352 (359) Q Consensus 309 ~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~ 352 (359) ++|..+..-..+. ..+. +++.. ++...+...+|. T Consensus 441 ~~E~~~~~~~~~~-----~~~~--~~d~~---~~~~~~~~~~~~ 474 (474) T protein:vir:95 441 EQEQMEYNKQLPN-----LDDG--GADGA---QQQERSNDKESE 474 (474) T ss_pred HHHHHHHHhcccc-----cccc--cCCCC---cCCCCCccCCCC Confidence 5554331111111 0011 01110 111112222222 No 125 >protein:vir:80333 Length: 419 # NCBI annotation: gp4, phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111083;genbank:gi:134288632;genbank:GeneID:4960580 Probab=39.96 E-value=0.99 Score=20.60 Aligned_cols=290 Identities=10% Similarity=0.125 Sum_probs=111.3 Q ss_pred CCCchhhHHHh----hhhhhheeecccccc----------ccCCCceeecHhHhhhhhcccccCCCCcchhhHHHHHHHH Q lcl|NC_015285. 1 MRGVDLNQQLT----QKAAEYFLYNPKGLK----------NSTNQGMKITTDSVTYCHSGIQDLNKNMTLSHLHKAIKAV 66 (359) Q Consensus 1 ~~~~~~~~~~~----~~~~e~f~yn~~~~~----------~~~~~~v~i~~~ai~y~hSGl~d~~~~~i~syL~~Aik~~ 66 (359) +.++.-- .+. ..+.+.+..+|.... +.-.....++.+-|.+.. ..+.|+-.=+|-++.|..++ T Consensus 109 l~Gna~~-~i~r~~~G~~~~L~~i~~~~v~i~~~~~~~~~y~~~~~~~~~~~~i~h~~--~~~~d~~~G~s~i~~~~~~i 185 (419) T protein:vir:80 109 LRGNSYS-FIDRDQDGVIQGLYPLDNEAVTVMKGPDLKPMYRVAGADPLPQRLVHHVR--WMSINGYTGLSPVLLHANAI 185 (419) T ss_pred hcCCeEE-EEEECCCCcEEEEEEecCceEEEEECCCceEEEEEcCccccchhheEEec--CCCCCCcccccHHHHHHHHH Confidence 1111000 000 011223333322110 001111234444443322 22334434567888888888 Q ss_pred HHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhhcccccCC Q lcl|NC_015285. 67 NQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDFWLPRREG 146 (359) Q Consensus 67 NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDywLpRReG 146 (359) .....+++...=+---.+--+-++.++... +..+.++-+..+...+....-=....|.+ + +++ T Consensus 186 ~~~~~~~~~~~~~f~ng~~~~gil~~~~~~-~~~~~~~~~~~~~~~~~~~~~g~~n~g~~------~-vl~--------- 248 (419) T protein:vir:80 186 GHAQAIQQYAGKSFMNGTALSGVIERPTDA-PALKDQASVDRITDGWNAKFGGSGNAKKV------A-LLQ--------- 248 (419) T ss_pred HHHHHHHHHHHHHHhcCCCccEEEEecCCC-CcccCHHHHHHHHHHHHHHhcCccccCCc------e-ecC--------- Confidence 888888777665544556566677776422 21111222222333332221100111221 1 221 Q ss_pred CCccceeecCC-CCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHH-HHHHHHHHHHHHHH Q lcl|NC_015285. 147 GRGTEISTLPG-GQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFI-ARLRKRFSELFTDL 224 (359) Q Consensus 147 grgTEIsTLpG-gqnLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KFI-~rLr~rFs~if~d~ 224 (359) .|.+++-|.- .+.+.-++-.++..+.+.++++||...|...+.-+.....+..+ .|..++ .-+..+ +.+. T Consensus 249 -~g~~~~~l~~s~~d~q~~e~~~~~~~~Ia~~fgVPp~llg~~~~~t~~n~e~~~~---~f~~~~l~P~~~~----ie~~ 320 (419) T protein:vir:80 249 -EGMKFKPLSMTNVDAALIDALRLSALDIARIYKIPAHMVNELERATFSNIEHQSL---QFVIYTLLPWVKR----HEQA 320 (419) T ss_pred -CCceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhcCCCCCCcccHHHHHH---HHHHHHHHHHHHH----HHHH Confidence 2455655532 12222334455778999999999999997543334433444333 365552 222222 2233 Q ss_pred HHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCCCHHHHHHH Q lcl|NC_015285. 225 LKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQTEIEIKEI 304 (359) Q Consensus 225 Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~tDeeI~e~ 304 (359) |-. .++++.++. ...+.|..+ ++... -+..|++.++.+-.- -+++.+-++. ++++.+ T Consensus 321 l~~-----kll~~~~~~----~~~i~fd~~----~l~~~-d~~~~~~~~~~~~~~--G~~T~NE~R~-~~g~~p------ 377 (419) T protein:vir:80 321 KTR-----DLLLPSERK----QYFIEYNLA----GLLRG-DQSSRYAAYAVGRQW--GWLSINDIRR-LENMPP------ 377 (419) T ss_pred Hhh-----hccCccccC----CeEEEEech----hhhcc-CHHHHHHHHHHHHhC--CCcCHHHHHH-HhCCCC------ Confidence 333 334555543 233444432 22222 134555555554221 2333333331 122211 Q ss_pred HHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCCCCccCCC Q lcl|NC_015285. 305 DEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPGDVRRGEF 359 (359) Q Consensus 305 ~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~ 359 (359) + |. .|. .-.+....++....|.- .....|.+....++ T Consensus 378 ------------~--~g--GD~-~~~~~n~~~~~~~~~~~-~~~~~~~~~~~~~~ 414 (419) T protein:vir:80 378 ------------V--KG--GDI-YLSPMNMVDASKPQPIP-MGKTEPTKAALDEI 414 (419) T ss_pred ------------C--CC--cce-eeecccccccccccccc-CCCCCchhhhHHHH Confidence 1 11 000 00000001111111100 01111222222333 No 126 >protein:vir:102330 Length: 451 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1566 # MgeName: phi CD119 # Cross-refs: genbank:acc:YP_529555;genbank:gi:90592641;genbank:GeneID:3974462 Probab=39.52 E-value=1 Score=20.55 Aligned_cols=272 Identities=15% Similarity=0.111 Sum_probs=110.8 Q ss_pred CCCchhhHHHhhhhhhheeeccccccc--c---CCCceeecHhHhhhhhc-ccccC----CCCcchhhHHHHHHHHHHHH Q lcl|NC_015285. 1 MRGVDLNQQLTQKAAEYFLYNPKGLKN--S---TNQGMKITTDSVTYCHS-GIQDL----NKNMTLSHLHKAIKAVNQLR 70 (359) Q Consensus 1 ~~~~~~~~~~~~~~~e~f~yn~~~~~~--~---~~~~v~i~~~ai~y~hS-Gl~d~----~~~~i~syL~~Aik~~NqL~ 70 (359) ..+...+....+.+.-+-+|++..... . +..+-.+..+. .-|- |.++. ++....|-++..+.....+. T Consensus 166 ~~~~~~~~~~~~~~~~~e~yt~~~~~~~~~~~~~~~~~~~~~~~--~~~~~g~vPvv~~~nn~~~~~d~e~v~~liDa~~ 243 (451) T protein:vir:10 166 QLEDVKGQIQKQAYTYVEFWTDKILDKYKFFGVSCCGSQIEHIT--VQHRFNSVPFVEFSNNIKKQSDLSKYKKILDLYD 243 (451) T ss_pred eeecccccccceEEEEEEEEeCCeEEEEEecccCcccccccccc--ccCCCCeeeEEEeccCCCCCCchhhHHHHHHHHH Confidence 111111111111111111333322111 0 00000000000 0111 21111 12234466666555555444 Q ss_pred H-HHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhhcccccCCCCc Q lcl|NC_015285. 71 M-IEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDFWLPRREGGRG 149 (359) Q Consensus 71 m-~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDywLpRReGgrg 149 (359) + +=+..-.-+-+.-|-+-+.-.+... . .+.+.. +..++--.+.+. +.+.| T Consensus 244 ~~~S~~~~~~~~~~~~~l~~~g~~~~~-~----~~~~~~-~~~~~~i~~~~~-----------------------~~~~~ 294 (451) T protein:vir:10 244 RVMSGFANDLEDIQQIIYILENFGGED-T----SEFLKE-LKRYKTIKTETD-----------------------SEGDS 294 (451) T ss_pred HHHHHHHHHHHHhccceeeeecCCccc-c----hhhHHH-HhhCCeEEecCc-----------------------CCccC Confidence 2 3333334455566655443332222 1 111111 222221111111 11222 Q ss_pred cceeecCCCCCcchHHH-HHHHHHHHHHhcCCCccccCCCCcccccchhhhh--HHhhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015285. 150 TEISTLPGGQNLGELED-VKYFQKKLYKALNVPSSRLETETTFNIGRAAEIT--RDEVKFQKFIARLRKRFSELFTDLLK 226 (359) Q Consensus 150 TEIsTLpGgqnLgei~D-V~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eIt--RDElKF~KFI~rLr~rFs~if~d~Lk 226 (359) -.+..|....+...... +.-+.+.+|+...+|- +..+ ++|++|.+. --....-.-+.+.+..|...+.++++ T Consensus 295 ~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~--~~~~---~~gn~Sg~Alk~~~~~l~~k~~~k~~~f~~~l~~~~~ 369 (451) T protein:vir:10 295 GGLKTMQIEIPTEARKIILEILKKQIYESGQGLQ--QDTE---NFGNASGVALKFFYRKLELKSGLLETEFRTSFDKLIK 369 (451) T ss_pred CcceEEeecCCHHHHHHHHHHHHHHHHHHhCccc--cccc---ccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 33555555445555544 7788889999999994 2222 234444322 22222333456666666666666665 Q ss_pred HHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCCCHHHHHHHHH Q lcl|NC_015285. 227 TQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQTEIEIKEIDE 306 (359) Q Consensus 227 ~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~tDeeI~e~~k 306 (359) .=+-.-|+ .+|. .|.+.|...---.+ .+.++++..+. | .+|.+++++. |...++ .+++.+ T Consensus 370 li~~~~~~---~d~~----~i~i~f~~~~p~n~-------~e~~~~~~kl~---g-~iS~et~~~~-~p~v~d-~~~e~~ 429 (451) T protein:vir:10 370 AILYFLGV---TDYK----KIQQTYTRNMMSND-------LEDADIATKSV---G-IIPTKIILRH-HPWVDD-VEEAEK 429 (451) T ss_pred HHHHHhCC---CCcc----ceeEEecCCCCCCH-------HHHHHHHHHHh---c-cCchHHHHHh-CCCCCC-HHHHHH Confidence 44433333 3454 46667754433333 23444555553 4 3799999987 555442 344555 Q ss_pred HHHHHHhcCCCCCCcchhhhcCCCCCccc Q lcl|NC_015285. 307 QIASEMEAGIIADPMAEMDPAMAAGGEGA 335 (359) Q Consensus 307 qi~~E~~~~~~~~P~~~~~~~~~~~~~~~ 335 (359) +|++|+... ++..++ +.+. -+. T Consensus 430 ~~~ee~~~~----~~~~~~-~~~~--~~~ 451 (451) T protein:vir:10 430 LYLEEKKIQ----ASKVSD-DYNN--FTE 451 (451) T ss_pred HHHHHHHHH----HHHHHh-hcCC--CCC Confidence 555554431 111111 1111 111 No 127 >protein:vir:96494 Length: 501 # NCBI annotation: portal protein # Family: family:all:125 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238488;genbank:gi:66391764;genbank:GeneID:5176916 Probab=39.32 E-value=1 Score=20.53 Aligned_cols=297 Identities=10% Similarity=0.058 Sum_probs=116.1 Q ss_pred CCCchhhHHHhh------------hhhhheeeccccccc--cCCCceeecHhHhhhhhc-cccc----CCCCcchhhHHH Q lcl|NC_015285. 1 MRGVDLNQQLTQ------------KAAEYFLYNPKGLKN--STNQGMKITTDSVTYCHS-GIQD----LNKNMTLSHLHK 61 (359) Q Consensus 1 ~~~~~~~~~~~~------------~~~e~f~yn~~~~~~--~~~~~v~i~~~ai~y~hS-Gl~d----~~~~~i~syL~~ 61 (359) +-++....++.- +...+-+|++...+. .......+.. ..|. |.++ .++..-.|=++. T Consensus 178 v~d~~~~~~~~~~v~~~~~~~~~~~~~~~~vyt~~~i~~~~~~~~~~~~~~----~~~~~g~vPvv~~~nn~~g~sd~e~ 253 (501) T protein:vir:96 178 IYDNSLEDNSIAAVRYYNRGTLQSAKDVVEIYTDEHIYTLDASDDFNEISV----TTHAFGTVPITEYLNNIDGIGDYET 253 (501) T ss_pred EEcCCCCCceEEEEEEEEeecCCCcEEEEEEEcCCcEEEEeeCCCceeccc----cccCCCccceEEecCCccCCCchhh Confidence 111111111111 111222455544321 1111111111 1221 2111 111222344444 Q ss_pred HHHHHHHHH-HHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhhc Q lcl|NC_015285. 62 AIKAVNQLR-MIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDFW 140 (359) Q Consensus 62 Aik~~NqL~-m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDyw 140 (359) .+.....+. ++=+....-+.++.|-+-+.-.+..+.+... .+ +.. ++ .++ T Consensus 254 v~~liDa~d~~~s~~~~~~~~~~~~~l~i~G~~~~~~~~~~-----~~-~~~--~~---------------------~~~ 304 (501) T protein:vir:96 254 ELYLIDLYDSAESDTANHMSDMADAILAIYGDLALPKGMQA-----SD-MKR--TR---------------------LMQ 304 (501) T ss_pred hHHHHHHHHHHHHHHHHHHHHhcCceeeeecccccCcccch-----hh-hhh--cC---------------------eee Confidence 444433332 3344444455666666655443322211100 00 000 11 122 Q ss_pred ccccCC----CCccceeecCCCCCcchH-HHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHH Q lcl|NC_015285. 141 LPRREG----GRGTEISTLPGGQNLGEL-EDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRK 215 (359) Q Consensus 141 LpRReG----grgTEIsTLpGgqnLgei-~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KFI~rLr~ 215 (359) ++-..+ +.+..+..|-...+...+ .-++-+.+.+|...++|---++.-++ |. .+..|.--.......+.+.++ T Consensus 305 ~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~l~~~I~~~s~~p~~~~~~~~~-n~-Sg~Al~~~~~~l~~ka~~~~~ 382 (501) T protein:vir:96 305 LKPPKSADGKEGTVKAEYLTKSYDVSGAEAYKTRLNRDIHIFTNTPDMSDTNFSG-NT-SGEALKYKLFGLDQDRVDTQS 382 (501) T ss_pred ecccccccccccCcceeeEeccCCHHHHHHHHHHHHHHHHHHhCCcccCcccccc-cc-hHHHHHHHHHHHHHHHHHHHH Confidence 222221 122244444433333222 22345567778888888433322211 11 222333333345566777788 Q ss_pred HHHHHHHHHHHHHHHhcCCCCh-hHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHh Q lcl|NC_015285. 216 RFSELFTDLLKTQLILKGVMSL-EEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVL 294 (359) Q Consensus 216 rFs~if~d~Lk~QLiLkgI~t~-eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL 294 (359) .|..-+.+.++.=+-+-++... .+++ ...|.+.|...-.-.+ .+.++++..+. | .+|.+++++. | T Consensus 383 ~~~~~l~~~~~li~~~~~~~~~~~~~d--~~~i~i~f~~~~p~n~-------~e~ad~~~kl~---g-~iS~et~~~~-l 448 (501) T protein:vir:96 383 QFTKGLKRRYRLAARIGSLVNEFKDFD--ESLLKITFTPNLPKSL-------NEQVSILTGLG---G-QVSQETALSL-S 448 (501) T ss_pred HHHHHHHHHHHHHHHHHHhcccccccc--cccceEEeCCCCCcCH-------HHHHHHHHHHh---c-cCchHHHHHh-C Confidence 8888887777654333232211 1222 2357788865444334 34445566664 3 3799999987 5 Q ss_pred CCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCCCCccCCC Q lcl|NC_015285. 295 KQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPGDVRRGEF 359 (359) Q Consensus 295 ~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~ 359 (359) ...++ .+++.++|++|...--......++++..|...+. .+.+..+.+.-+. T Consensus 449 ~~v~D-~~~E~~ri~~E~~~~~~~~~~~~~~~~~~~~~~~------------~~e~~~d~~e~~~ 500 (501) T protein:vir:96 449 GLVES-PNEELDKINKEMSEIDFKGYSNDFNEHVGKYTDE------------VKETHTDDFEREY 500 (501) T ss_pred CCCCC-HHHHHHHHHHHHHHhhccccccchhhcccccCCc------------CCCCCCCcccccc Confidence 55442 3344555666644311111111122222221111 1111111111122 No 128 >protein:vir:9702 Length: 406 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795464;genbank:gi:28876227;genbank:GeneID:1257772 Probab=37.28 E-value=1.1 Score=20.30 Aligned_cols=281 Identities=15% Similarity=0.145 Sum_probs=110.6 Q ss_pred CCCchhhHHHhhhhhhheeeccccc-------------c--ccCCCceeecHhHhhhhhcccccCCCCcchhhHHHHHHH Q lcl|NC_015285. 1 MRGVDLNQQLTQKAAEYFLYNPKGL-------------K--NSTNQGMKITTDSVTYCHSGIQDLNKNMTLSHLHKAIKA 65 (359) Q Consensus 1 ~~~~~~~~~~~~~~~e~f~yn~~~~-------------~--~~~~~~v~i~~~ai~y~hSGl~d~~~~~i~syL~~Aik~ 65 (359) .++...| ...+.+..+|... . ...+..+.++.+-|.+.. ..+.++=.=+|-|..|.++ T Consensus 108 ~r~~~~g-----~~~~L~~i~p~~v~v~~~~~~~~~y~~~~~~~~~~~~~~~~evih~r--~~~~dg~~G~spi~~~~~~ 180 (406) T protein:vir:97 108 LRDPKTN-----QALQFQFYRPSETTVEETDNHEIVYTFTDMLTAKQVKCFAHDVIHWK--FFSHDTILGRSPLLSLGDE 180 (406) T ss_pred EecCCCC-----eEEEEEEECCCeeEEEEcCCceEEEEEEecCCceEEEEccccEEEec--CCCCCCcccccHHHHHHHH Confidence 1111111 1223333333211 0 012233566666665442 3343432236778888888 Q ss_pred HHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhhcccccC Q lcl|NC_015285. 66 VNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDFWLPRRE 145 (359) Q Consensus 66 ~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDywLpRRe 145 (359) +.....+++...=+-.--++- .++...-+.|-+..+++. ++-+.++.. ...+|.+ + .++ T Consensus 181 i~~~~a~~~~~~~~f~ng~~~-~~i~~~~~~l~~e~~~~~-~~~~~~~~~----g~n~g~~------~-vl~-------- 239 (406) T protein:vir:97 181 IDLQTGGINTLIKFFKDGFSS-GILTMKGAQLSGDARQRA-RQEFEKMRE----GSVGGSP------L-VFD-------- 239 (406) T ss_pred HHHHHHHHHHHHHHHhccCCC-ceEEecCCCCCHHHHHHH-HHHHHHHhc----ccccCce------e-ecC-------- Confidence 887777777655443344553 455555566655544443 332333211 0122322 1 111 Q ss_pred CCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_015285. 146 GGRGTEISTLPGGQNLGE-LEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFTDL 224 (359) Q Consensus 146 GgrgTEIsTLpGgqnLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KFI~rLr~rFs~if~d~ 224 (359) .|.+++.|.-..+-.| ++--+|-.+.+-++.+||...|...+..+ ..++..+. |.+++ |+-.+. .+.+. T Consensus 240 --~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~afgVPp~~lg~~~~~~--~~e~~~~~---f~~~~--l~P~~~-~ie~~ 309 (406) T protein:vir:97 240 --STMEYTPLEIDTNVLQLITSNNFSTAQIAKALRVPSYKLGVNSPNQ--SVAQLMED---YVTND--LPFYFD-AITSE 309 (406) T ss_pred --CCceEEEccCCHHHHHHHHHHHhhHHHHHHHhCCCHHHcCCCCCcc--hHHHHHHH---HHHHH--HHHHHH-HHHHH Confidence 2456666642221111 22223336778889999999996433321 22333332 65542 333222 22333 Q ss_pred HHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCCCHHHHHHH Q lcl|NC_015285. 225 LKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQTEIEIKEI 304 (359) Q Consensus 225 Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~tDeeI~e~ 304 (359) |-.. +.++.+|.. -.|++++.. +++.|++.+..+-. +-. ||..|+-+ T Consensus 310 l~~k-----ll~~~~~~~--~~i~fd~~~-----------~~~~~~~~~~~~~~--~g~------------~T~NE~R~- 356 (406) T protein:vir:97 310 LGLK-----TLNDKDRRL--YHIEFDTRS-----------VTGRNVDEIVKLVN--NQI------------LTPNQGLV- 356 (406) T ss_pred Hhhh-----hcChhhccc--eeEEEecCc-----------cchhhHHHHHHHHh--CCC------------cCHHHHHH- Confidence 3333 345666542 234444322 23445544433211 012 55555522 Q ss_pred HHHHHHHHhcCCCCCCcchh--hhcCCCCCcccccccCCCCCcCCCCCCCCCccCCC Q lcl|NC_015285. 305 DEQIASEMEAGIIADPMAEM--DPAMAAGGEGAPAAEVDPNAQESSVDPGDVRRGEF 359 (359) Q Consensus 305 ~kqi~~E~~~~~~~~P~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~ 359 (359) +...+.+++|+... -+..-.+.+....+ -++........+.++.+++= T Consensus 357 ------~~g~~p~~~~~gD~~~~~~n~~~~~~~~~~-~~~~~~~~~gg~~~~~~~~~ 406 (406) T protein:vir:97 357 ------ELGKQKSTDPNMDRYQSSLNYVFLDKKEEY-QDKVGIKGKGGEVNAEEDKS 406 (406) T ss_pred ------HhCCCCCCCCCCCeEeeccCccchhccccc-ccccccccCCCCCCCCCCCC Confidence 11222333332100 00000011110000 00001111122222222222 No 129 >protein:vir:1023 Length: 392 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076677;genbank:gi:13095786;genbank:GeneID:920364 Probab=37.00 E-value=1.1 Score=20.27 Aligned_cols=263 Identities=12% Similarity=0.130 Sum_probs=103.2 Q ss_pred CCCchhhHHH---hhhhhhheeecccccc------------------ccCCCceeecHhHhhhhhcccccCCCCc-chhh Q lcl|NC_015285. 1 MRGVDLNQQL---TQKAAEYFLYNPKGLK------------------NSTNQGMKITTDSVTYCHSGIQDLNKNM-TLSH 58 (359) Q Consensus 1 ~~~~~~~~~~---~~~~~e~f~yn~~~~~------------------~~~~~~v~i~~~ai~y~hSGl~d~~~~~-i~sy 58 (359) +.++.--..+ -..+.+.+..+|.... ...+.-+.++.+-|.+.. ..+.++.. =+|. T Consensus 105 l~Gna~~~i~r~~~g~~~~L~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~eiih~~--~~~~~~~~~G~s~ 182 (392) T protein:vir:10 105 LGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYENGMYYNITFDDPKIEPILQAPQSDLIHMK--LLSIDGGKTGISP 182 (392) T ss_pred hcCcEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceEEEEEEecCcccceeEEEccccEEEec--CCCCCCccccccH Confidence 1111100000 0112233333332110 011112456667776654 12333332 2788 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhh Q lcl|NC_015285. 59 LHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMED 138 (359) Q Consensus 59 L~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlED 138 (359) |..|...++....+++...-+=---+--+-+..++.+..+..++.+.++ ..|+.. ...|.+ + .+ T Consensus 183 i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~---~~~~~~----~~~g~~------~-vl-- 246 (392) T protein:vir:10 183 LYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKASRS---RSFMKR----SRSGGP------V-VL-- 246 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHHHH---HHHhcc----ccCCCe------e-ec-- Confidence 9999999999888887665443334555666777665545444433322 233321 111211 1 11 Q ss_pred hcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHH-HHHHHH Q lcl|NC_015285. 139 FWLPRREGGRGTEISTLPGGQNLGE-LEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFI-ARLRKR 216 (359) Q Consensus 139 ywLpRReGgrgTEIsTLpGgqnLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KFI-~rLr~r 216 (359) ..|++++.|.-...-.| ++=.+|..+.+.++++||...|+..+..+ ...+-. ..|..++ .-+-.+ T Consensus 247 --------~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~--~~~~~~---~~f~~~~l~P~~~~ 313 (392) T protein:vir:10 247 --------DDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQ--SSIQQI---SGMYASALNRYLRP 313 (392) T ss_pred --------CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcc--cHHHHH---HHHHHHHHHHHHHH Confidence 12567777754333333 55567888999999999999996433221 111111 1244322 222222 Q ss_pred HHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCC Q lcl|NC_015285. 217 FSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQ 296 (359) Q Consensus 217 Fs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~ 296 (359) +.. .|...|. +. +.+++. ..+ +... ..+.+.+..+ + ...+ + T Consensus 314 ie~----~l~~~L~-----~~---------~~~d~~--~~~----~~d~-~~~~~~~~~l---~---------~~g~--~ 354 (392) T protein:vir:10 314 AIS----ELEYKLS-----DH---------ISVNMR--PAI----DPLG-DNYLSTISTA---T---------RWGA--L 354 (392) T ss_pred HHH----HHHHhcc-----cc---------ccccch--hhh----ccCH-HHHHHHHHHH---H---------hCCC--c Confidence 222 2222221 11 111110 000 0000 0111111111 0 0111 3 Q ss_pred CHHHHHHHHHHHHHHHhcCCCCCCcc-hhhhcCCCCCcccccccCCC Q lcl|NC_015285. 297 TEIEIKEIDEQIASEMEAGIIADPMA-EMDPAMAAGGEGAPAAEVDP 342 (359) Q Consensus 297 tDeeI~e~~kqi~~E~~~~~~~~P~~-~~~~~~~~~~~~~~~~~~~~ 342 (359) |..|.-++. ...|.. |++ ....+.++-++|+. .+-.| T Consensus 355 t~nE~r~~l------~~~g~~--p~e~r~~e~l~~~~~Gd~-~~p~p 392 (392) T protein:vir:10 355 AENQATFVL------QEAGYI--PKDLPAPENTNKKTTGQS-NEPVP 392 (392) T ss_pred CHHHHHHHH------HhcCCC--ccccchhcCCCCCCCCCC-CCCCC Confidence 444433221 123433 322 11112222121211 10011 No 130 >protein:vir:3989 Length: 392 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116497;genbank:gi:14251130;genbank:GeneID:921299 Probab=37.00 E-value=1.1 Score=20.27 Aligned_cols=263 Identities=12% Similarity=0.130 Sum_probs=103.2 Q ss_pred CCCchhhHHH---hhhhhhheeecccccc------------------ccCCCceeecHhHhhhhhcccccCCCCc-chhh Q lcl|NC_015285. 1 MRGVDLNQQL---TQKAAEYFLYNPKGLK------------------NSTNQGMKITTDSVTYCHSGIQDLNKNM-TLSH 58 (359) Q Consensus 1 ~~~~~~~~~~---~~~~~e~f~yn~~~~~------------------~~~~~~v~i~~~ai~y~hSGl~d~~~~~-i~sy 58 (359) +.++.--..+ -..+.+.+..+|.... ...+.-+.++.+-|.+.. ..+.++.. =+|. T Consensus 105 l~Gna~~~i~r~~~g~~~~L~~l~~~~v~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~~eiih~~--~~~~~~~~~G~s~ 182 (392) T protein:vir:39 105 LGGEAFAYRWRNANGADMKWEYLRPSQVNTYYFEYENGMYYNITFDDPKIEPILQAPQSDLIHMK--LLSIDGGKTGISP 182 (392) T ss_pred hcCcEEEEEEECCCCcEEEEEEEcCceeEEEEcCCCceEEEEEEecCcccceeEEEccccEEEec--CCCCCCccccccH Confidence 1111100000 0112233333332110 011112456667776654 12333332 2788 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhh Q lcl|NC_015285. 59 LHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMED 138 (359) Q Consensus 59 L~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlED 138 (359) |..|...++....+++...-+=---+--+-+..++.+..+..++.+.++ ..|+.. ...|.+ + .+ T Consensus 183 i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~~~~~~~~~~~~~---~~~~~~----~~~g~~------~-vl-- 246 (392) T protein:vir:39 183 LYSLRRESKIQRASDRLTISSLNSSLNVPGVLTVKGGGLLSDKDKASRS---RSFMKR----SRSGGP------V-VL-- 246 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCCceEEEeCCCCCchHHHHHHHH---HHHhcc----ccCCCe------e-ec-- Confidence 9999999999888887665443334555666777665545444433322 233321 111211 1 11 Q ss_pred hcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHH-HHHHHH Q lcl|NC_015285. 139 FWLPRREGGRGTEISTLPGGQNLGE-LEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFI-ARLRKR 216 (359) Q Consensus 139 ywLpRReGgrgTEIsTLpGgqnLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KFI-~rLr~r 216 (359) ..|++++.|.-...-.| ++=.+|..+.+.++++||...|+..+..+ ...+-. ..|..++ .-+-.+ T Consensus 247 --------~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~~fgVpp~~lg~~~~~~--~~~~~~---~~f~~~~l~P~~~~ 313 (392) T protein:vir:39 247 --------DDLEEFTALEIKSNVAQLLSQTDWTSKQYAKVYGLPDSYIGGQGDQQ--SSIQQI---SGMYASALNRYLRP 313 (392) T ss_pred --------CCCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcc--cHHHHH---HHHHHHHHHHHHHH Confidence 12567777754333333 55567888999999999999996433221 111111 1244322 222222 Q ss_pred HHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCC Q lcl|NC_015285. 217 FSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQ 296 (359) Q Consensus 217 Fs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~ 296 (359) +.. .|...|. +. +.+++. ..+ +... ..+.+.+..+ + ...+ + T Consensus 314 ie~----~l~~~L~-----~~---------~~~d~~--~~~----~~d~-~~~~~~~~~l---~---------~~g~--~ 354 (392) T protein:vir:39 314 AIS----ELEYKLS-----DH---------ISVNMR--PAI----DPLG-DNYLSTISTA---T---------RWGA--L 354 (392) T ss_pred HHH----HHHHhcc-----cc---------ccccch--hhh----ccCH-HHHHHHHHHH---H---------hCCC--c Confidence 222 2222221 11 111110 000 0000 0111111111 0 0111 3 Q ss_pred CHHHHHHHHHHHHHHHhcCCCCCCcc-hhhhcCCCCCcccccccCCC Q lcl|NC_015285. 297 TEIEIKEIDEQIASEMEAGIIADPMA-EMDPAMAAGGEGAPAAEVDP 342 (359) Q Consensus 297 tDeeI~e~~kqi~~E~~~~~~~~P~~-~~~~~~~~~~~~~~~~~~~~ 342 (359) |..|.-++. ...|.. |++ ....+.++-++|+. .+-.| T Consensus 355 t~nE~r~~l------~~~g~~--p~e~r~~e~l~~~~~Gd~-~~p~p 392 (392) T protein:vir:39 355 AENQATFVL------QEAGYI--PKDLPAPENTNKKTTGQS-NEPVP 392 (392) T ss_pred CHHHHHHHH------HhcCCC--ccccchhcCCCCCCCCCC-CCCCC Confidence 444433221 123433 322 11112222121211 10011 No 131 >protein:vir:6240 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813694;swissprot:trembl:q859c3;genbank:gi:29366754;interpro:IPR006427;interpro:IPR006944;uniprot:Q859C3;genbank:GeneID:1258894 Probab=36.52 E-value=1.2 Score=20.21 Aligned_cols=296 Identities=15% Similarity=0.124 Sum_probs=123.4 Q ss_pred CCCchhhHHHhhhhhhheeeccccc--------------------cccCCC--ceeecHhHhhhhhcccccCCCC-cchh Q lcl|NC_015285. 1 MRGVDLNQQLTQKAAEYFLYNPKGL--------------------KNSTNQ--GMKITTDSVTYCHSGIQDLNKN-MTLS 57 (359) Q Consensus 1 ~~~~~~~~~~~~~~~e~f~yn~~~~--------------------~~~~~~--~v~i~~~ai~y~hSGl~d~~~~-~i~s 57 (359) ++... | .+.+.+..+|... ...++. ...++.+-|.++. ....++. .=+| T Consensus 123 i~~~~-g-----~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~y~~~~~g~~~~~~~~~~~eiih~r--~~~~~~~~~G~s 194 (457) T protein:vir:62 123 VRWAG-P-----NIAGLDVLDPTKIHVHMVMVDGLRRKVFEAYDIDADGNEVLLGWFTPRDVLHIP--GMMLPGDFVGCS 194 (457) T ss_pred EEeCC-C-----cEEEEEEEcCcceEEEEeccCCccceeEEEEEEccCCceeEEEeeCccceEEec--CCCCCCceeccc Confidence 11110 0 1112222222111 011111 1345666665543 2223332 3467 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHh Q lcl|NC_015285. 58 HLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMME 137 (359) Q Consensus 58 yL~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlE 137 (359) -++.|++.+.....+++...-+=---+--+-|..++ |.|-+..+++..+.+-..|+.. ...|. .+ .++ T Consensus 195 p~~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~ls~e~~~~~~~~~~~~~~G~----~nag~------~~-vl~ 262 (457) T protein:vir:62 195 PISYARESIGLALAAQKYGAHFFRNGAMPGAVVEVP-GTMSEEGLARAREAWRAANSGV----DNAHR------VA-LLT 262 (457) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhccCCcceEEEcC-CCCCHHHHHHHHHHHHHHhcCc----cccCc------ce-ecC Confidence 788888888888888776654433334555677776 5676655555444443334311 01121 11 121 Q ss_pred hhcccccCCCCccceeecCCCCCcchH---HHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHH Q lcl|NC_015285. 138 DFWLPRREGGRGTEISTLPGGQNLGEL---EDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLR 214 (359) Q Consensus 138 DywLpRReGgrgTEIsTLpGgqnLgei---~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KFI~rLr 214 (359) .|.+++.|. .+..++ +=-+|-...+.++++||...|+.-++-+... +.+...-+.|.+++ |+ T Consensus 263 ----------~g~~~~~l~--~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~-sn~eq~~~~f~~~~--l~ 327 (457) T protein:vir:62 263 ----------EGAKFSKVA--MSPDEAQFLQTRQFQVPEIARIFGVPPHLISDATNSTSWG-SGLAEQNIAFTMFS--LR 327 (457) T ss_pred ----------CCceEEEcc--CChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCccccc-chHHHHHHHHHHHH--HH Confidence 244555552 333333 3334677889999999999996543332211 22232333366653 32 Q ss_pred HHHHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHh Q lcl|NC_015285. 215 KRFSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVL 294 (359) Q Consensus 215 ~rFs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL 294 (359) --+ ..+.+.|-..| +++.+. ....+.|..+. +... -+..|++.+..+-.- -+++.+-+++. + T Consensus 328 P~~-~~ie~~ln~~L-----~~~~~~----~~~~i~fd~~~----l~~~-d~~~r~~~~~~~~~~--G~~T~NE~R~~-~ 389 (457) T protein:vir:62 328 PWL-ERIEAGFNRLL-----FAETAD----RFRFVKFNLDE----IKRG-APKERMELWSLGLQN--GIYSIDEVRAA-E 389 (457) T ss_pred HHH-HHHHHHHHhhh-----cCcccc----CceEEEeechh----hhcc-CHHHHHHHHHHHHhC--CCcCHHHHHHH-h Confidence 211 23333344443 333332 23344554332 2111 224566666554322 35566666643 5 Q ss_pred CCCHHHHHHHHHHHHHHHhcCCCCCCc-----chhhhcCCCCC-cccccccCCC-CCcCCCCCCCCCccCCC Q lcl|NC_015285. 295 KQTEIEIKEIDEQIASEMEAGIIADPM-----AEMDPAMAAGG-EGAPAAEVDP-NAQESSVDPGDVRRGEF 359 (359) Q Consensus 295 ~~tDeeI~e~~kqi~~E~~~~~~~~P~-----~~~~~~~~~~~-~~~~~~~~~~-~~~~~~~~p~~~~~~~~ 359 (359) +|..-+= . ..+.+..|- ..+....+.++ ....++..+| ...+..+..+++-+++. T Consensus 390 gl~pi~~---------g-~~D~~~~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~ 451 (457) T protein:vir:62 390 DMTPLPD---------G-LGEKYRVPLNLGEIGEEPEPEPAPAPPAIDPPAEEPADDEEPDNAEGDPDEGET 451 (457) T ss_pred CCCCCCC---------C-CcceeeeccccccccccccccccCCCccCCCCccCCCCCCCCCCCCCCCccccc Confidence 6643210 0 000000000 00000000000 0000111112 12233444555555555 No 132 >protein:vir:3648 Length: 695 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705643;genbank:gi:23752328;genbank:GeneID:955749 Probab=34.98 E-value=1.3 Score=20.04 Aligned_cols=302 Identities=15% Similarity=0.205 Sum_probs=112.1 Q ss_pred CCCch---------hhHHHhhh-hhhheeeccccccc--------------cCC----CceeecHhHhhhhhc-ccc--- Q lcl|NC_015285. 1 MRGVD---------LNQQLTQK-AAEYFLYNPKGLKN--------------STN----QGMKITTDSVTYCHS-GIQ--- 48 (359) Q Consensus 1 ~~~~~---------~~~~~~~~-~~e~f~yn~~~~~~--------------~~~----~~v~i~~~ai~y~hS-Gl~--- 48 (359) |++.. ....|-++ ++-+-+-+|.+..- -|. .|-+|+.|=+...++ -+- T Consensus 206 i~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~~n~~dP~spdfgkP~~y~V~G~kIH~SRL~~f~g~plPd~L 285 (695) T protein:vir:36 206 IKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSINPVADDFYKPSTWWMIGTEVHATRLHTIVSRPVGDML 285 (695) T ss_pred eccCccccccccccccccccCcceeeeEeecccccccchhhhccchhhccCCCceEEEeceEEeeeeEEEecCCCchhhh Confidence 22211 01112121 11122222222110 000 022343332211111 000 Q ss_pred -cCCCCcchhhHHHHHHHHHH-HHHHHHHHHHHHHhcCccceeEeccCC-CCchHHHHHHH--HHHHHhhcceEEeeCCC Q lcl|NC_015285. 49 -DLNKNMTLSHLHKAIKAVNQ-LRMIEDSLVIYRLSRAPERRIFYIDVG-NLPKNKAEQYL--REVMGRYRNKMVYDANT 123 (359) Q Consensus 49 -d~~~~~i~syL~~Aik~~Nq-L~m~EDalVIyR~~RAPeRRvFyIDvG-nlpk~KAeqYl--~~iM~kyrnklvYD~~T 123 (359) ..-+..=+|.+..+..-..+ +++...+.=+ +.++.-+ ++-.|.. -|.....++.. -+++++||+-. T Consensus 286 Kp~y~~~GiSv~q~~~e~V~~~~rT~~~v~~L--i~~~~v~-~lk~dla~aL~~g~~~~l~~R~eli~~~Rsn~------ 356 (695) T protein:vir:36 286 KPTYSFAGISMTQLAMPYIDNWLRTRQSVSDI--VKQFSVS-GILMDLAQALMPGANVDLSMRAELINRYRDNR------ 356 (695) T ss_pred hcccccCcccHHHHHHHHHHHHHHHHhHHHHH--HHhhhHH-HHHHHHHHhhcChhHHHHHHHHHHHHHhcCcc------ Confidence 00001112333333322211 2222222111 1111110 0011211 00111112222 25556665211 Q ss_pred CcccccccchhhHhhhcccccCCCCccceeecCCCCCcchHHHHH-HHHHHHHHhcCCCccccCC--CCcccc-cchhhh Q lcl|NC_015285. 124 GEIKDDKKFMSMMEDFWLPRREGGRGTEISTLPGGQNLGELEDVK-YFQKKLYKALNVPSSRLET--ETTFNI-GRAAEI 199 (359) Q Consensus 124 Gevkdd~~~mSMlEDywLpRReGgrgTEIsTLpGgqnLgei~DV~-YF~kkLy~aL~VP~SRl~~--~~~~~~-g~~~eI 199 (359) |-+--|+ =.|||- +. ..+|+-++||. =|..-+=-+.+||+.||=. -.|||- |-+ T Consensus 357 G~~llDk----~~Eefe-------------q~--stslSGLddVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~--- 414 (695) T protein:vir:36 357 NILFLDK----ATEEFF-------------QF--NTPLSGLDALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSEG--- 414 (695) T ss_pred ceEEEec----CCcceE-------------EE--ecccCCHHHHHHHHHHHHHhhhcCchhhhhccCcccccccchh--- Confidence 1111000 013442 11 24788889975 4888888899999999943 368875 333 Q ss_pred hHHhhhHHHHHHHHHHHHHHHHHHHHHH-----HHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHH Q lcl|NC_015285. 200 TRDEVKFQKFIARLRKRFSELFTDLLKT-----QLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVA 274 (359) Q Consensus 200 tRDElKF~KFI~rLr~rFs~if~d~Lk~-----QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~ 274 (359) |.-.|...|..+|. ..+...|++ |+-.-|.+. ..|.|.|+.=..-+|..-+||...+.+... T Consensus 415 --D~rnYYD~I~s~Qe---~~L~p~L~rl~~ii~rS~~G~id--------pdi~~~fnPL~qmtd~EkAeI~~k~A~~d~ 481 (695) T protein:vir:36 415 --EIRVWYDYVRAYQR---NALQQLMNDVIVMIQLSLFGAVD--------PSIKWQWNALRELDDLEVAESRYKQAQSDV 481 (695) T ss_pred --hHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHhcCCCC--------CcceEEeCCCCCcCHHHHHHHHhhhhHHHH Confidence 44459999998885 334444433 444445443 458899998777888888888888877644 Q ss_pred HhhhhcchhhhHHHHHHHH-----------h-------CCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccc Q lcl|NC_015285. 275 AMDPYVGKYFSVDYMRRQV-----------L-------KQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAP 336 (359) Q Consensus 275 ~~dp~vGKy~S~~~i~k~I-----------L-------~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~ 336 (359) ..-.- -.++.+-|...+ + .-+|+||.-+...- ++.. ...+.|.+|+... T Consensus 482 ~~~~~--gvI~~~evr~rL~~d~~s~Y~~~~D~~d~p~~~~~~~~~~~~~~~----------~~~~-~~~~~~~~~~~~~ 548 (695) T protein:vir:36 482 LYVQE--QVIRPDQVAARLNTEPDGPYAGKLDANDDPGVPADDDIDGVLTYV----------QRLA-EGGDTGAPGGARA 548 (695) T ss_pred HHHHh--cCCCHHHHHHHHhcCCCcccccccccccCCCcCccchhhhhHhhh----------cCcc-cccccCCCCcccc Confidence 43211 011222222111 0 01111121111100 0100 0111111111111 Q ss_pred cccCCCCCcCCCCCCCCCccCCC Q lcl|NC_015285. 337 AAEVDPNAQESSVDPGDVRRGEF 359 (359) Q Consensus 337 ~~~~~~~~~~~~~~p~~~~~~~~ 359 (359) ....+|...+.....-..+-|-+ T Consensus 549 g~~~~~~v~~~~~~~~~~~ag~~ 571 (695) T protein:vir:36 549 GATAPPTVANVNANVNPREAGAQ 571 (695) T ss_pred cccCCCcccccccccCccccCCC Confidence 11123333222222222222333 No 133 >protein:vir:483 Length: 413 # NCBI annotation: putative portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543090;swissprot:trembl:q8w629;genbank:gi:18249902;uniprot:Q8W629;genbank:GeneID:929685 Probab=34.50 E-value=1.3 Score=19.98 Aligned_cols=287 Identities=15% Similarity=0.156 Sum_probs=122.1 Q ss_pred CCCchhhHHH---hhhhhhheeeccccc--------------cccCCCceeecHhHhhhhhcccccCCCCcchhhHHHHH Q lcl|NC_015285. 1 MRGVDLNQQL---TQKAAEYFLYNPKGL--------------KNSTNQGMKITTDSVTYCHSGIQDLNKNMTLSHLHKAI 63 (359) Q Consensus 1 ~~~~~~~~~~---~~~~~e~f~yn~~~~--------------~~~~~~~v~i~~~ai~y~hSGl~d~~~~~i~syL~~Ai 63 (359) +.++.- +.+ -+.+.+.+..+|... ....+....++.+-|.+... ...++-.-+|-|+.|. T Consensus 109 l~Gn~~-~~i~~~~g~~~~L~~l~~~~v~~~~~~~~~~~y~~~~~~g~~~~~~~~evih~~~--~~~d~~~G~s~i~~~~ 185 (413) T protein:vir:48 109 LRGNFY-AYKVKALGEVVELLPIDPGCVEPKLNSQWQPVYQVTFPDGSVDVLTQDEIWHVRT--LTLDGLVGLNPIAYAR 185 (413) T ss_pred hcCceE-EEEEeCCCcEEEEEEEcCceEEEEEcCCceEEEEEEecCceEEEEccccEEEecC--cCCCCcccccHHHHHH Confidence 111100 000 011223333222210 11122335677777776542 1234444567888888 Q ss_pred HHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhhcccc Q lcl|NC_015285. 64 KAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDFWLPR 143 (359) Q Consensus 64 k~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDywLpR 143 (359) +++.....+++...-+---.+.-+-|+.++ +.+.+..+++-.+.+...|+.- + ..|.+ + .+ T Consensus 186 ~~i~~~~~~~~~~~~~~~ng~~p~gil~~~-~~~~~e~~~~~~~~~~~~~~g~---~-n~g~~------~-vl------- 246 (413) T protein:vir:48 186 EAISLAAATEEHGARLFGNGAVTSGVLRTE-QKLTPDAYERLKKDFEERHTGL---G-NAHRP------M-IL------- 246 (413) T ss_pred HHHHHHHHHHHHHHHHHhccCCcceEEEeC-CCCCHHHHHHHHHHHHHHhcCc---c-ccCcc------e-ec------- Confidence 888888888777665555556667888887 5677776666666655555431 1 11211 1 11 Q ss_pred cCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHH Q lcl|NC_015285. 144 REGGRGTEISTLPGG-QNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFT 222 (359) Q Consensus 144 ReGgrgTEIsTLpGg-qnLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KFI~rLr~rFs~if~ 222 (359) ..|.++..|.-. +.+.-++-.++....+.++++||..-|...+.-+.....+.. +.|.++ .|+- +...+. T Consensus 247 ---~~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~n~e~~~---~~f~~~--~i~P-~~~~ie 317 (413) T protein:vir:48 247 ---EMGLDWKSMALNAEDSQFLETRKFQLEEICRLFRVPLHMVQNTDRATFNNIEELG---LGFINY--SLVP-YLTRIE 317 (413) T ss_pred ---CCCceEEeccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCCcccHHHHH---HHHHHH--HHHH-HHHHHH Confidence 124566666321 222224455678899999999999999754333333333322 235544 2221 111122 Q ss_pred HHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCCCHHHHH Q lcl|NC_015285. 223 DLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQTEIEIK 302 (359) Q Consensus 223 d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~tDeeI~ 302 (359) +. +-+.++++.++. ..+| .|..+ ++...+ +..|.++++.+-. +-+++.+-++. ++++.+-+ T Consensus 318 ~~-----l~~~L~~~~~~~--~~~~--~fd~~----~l~~~d-~~~~~~~~~~~~~--~g~~T~NE~R~-~~g~~p~~-- 378 (413) T protein:vir:48 318 QR-----INTGLVRESKQG--KFYA--KFNAG----ALLRGD-MKSRFEAYATGIN--WGIYSPNDCRD-LEDMNPRP-- 378 (413) T ss_pred HH-----HHhhccCccccC--CeEE--EEech----hhhccC-HHHHHHHHHHHHh--CCCcCHHHHHH-HhCCCCCC-- Confidence 22 334455666553 2234 44322 332221 2345555544321 13445555542 24443211 Q ss_pred HHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCCC Q lcl|NC_015285. 303 EIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSV 349 (359) Q Consensus 303 e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 349 (359) . -+.+-.|-. +.+....+....+..+ .+...+.+. T Consensus 379 g----------gD~~~~~~n-~~~~~~~~~~~~~~~~-~~~~~~~~~ 413 (413) T protein:vir:48 379 G----------GDVYLTPMN-MTTSPSAGDDNGKKKE-SGDADKTAS 413 (413) T ss_pred C----------cceeecccc-ccccccccccCCCCCC-CCCccccCC Confidence 0 000001110 0000000000000000 000000000 No 134 >protein:vir:101541 Length: 694 # NCBI annotation: gp17 # Family: family:all:297 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958122;genbank:gi:41057668;genbank:GeneID:2716798 Probab=34.15 E-value=1.3 Score=19.94 Aligned_cols=302 Identities=15% Similarity=0.202 Sum_probs=112.8 Q ss_pred CCCch---------hhHHHhhh-hhhheeeccccccc--------------cCC----CceeecHhHhhhhhc-ccc--- Q lcl|NC_015285. 1 MRGVD---------LNQQLTQK-AAEYFLYNPKGLKN--------------STN----QGMKITTDSVTYCHS-GIQ--- 48 (359) Q Consensus 1 ~~~~~---------~~~~~~~~-~~e~f~yn~~~~~~--------------~~~----~~v~i~~~ai~y~hS-Gl~--- 48 (359) |++.. ....|-++ ++-+-+-+|.+..- .|. .|-+|+.|=+...++ -+- T Consensus 205 I~gdd~~l~~PL~~~~~~I~kGslKGl~ViDp~~vtP~~~n~~dP~spdfgkP~~y~V~G~~IH~SRL~~f~g~plPd~L 284 (694) T protein:vir:10 205 IKGDDQIMDTPLVPRPYTVPKGSFQGLRVVEPYWVTPNNYNSINPVADDFYKPSTWWMIGTEVHATRLHTIVSRPVGDML 284 (694) T ss_pred eecCccccccccccccccccCcceeeeEeecccccccchhhhccchhhccCCCceEEEeceEEeeeeEEEecCCCchhhh Confidence 22211 01112121 11122222222110 000 022343332211111 000 Q ss_pred -cCCCCcchhhHHHHHHHHHH-HHHHHHHHHHHHHhcCccceeEeccCC-CCchHHHHHHH--HHHHHhhcceEEeeCCC Q lcl|NC_015285. 49 -DLNKNMTLSHLHKAIKAVNQ-LRMIEDSLVIYRLSRAPERRIFYIDVG-NLPKNKAEQYL--REVMGRYRNKMVYDANT 123 (359) Q Consensus 49 -d~~~~~i~syL~~Aik~~Nq-L~m~EDalVIyR~~RAPeRRvFyIDvG-nlpk~KAeqYl--~~iM~kyrnklvYD~~T 123 (359) ..-+..=+|.+..+..-..+ +++...+.=+ ++.+.-+ ++-.|.. -|.....++.. -+++++||+-. T Consensus 285 Kp~y~~~G~Sv~q~~~e~V~~~~rT~~~v~~L--i~~~~v~-~lk~dla~~L~~g~~~~l~~R~eli~~~Rsn~------ 355 (694) T protein:vir:10 285 KPTYSFAGISMTQLAMPYIDNWLRTRQSVSDI--VKQFSVS-GILMDLAQALMPGANVDLSMRAELINRYRDNR------ 355 (694) T ss_pred hcccccCcccHHHHHHHHHHHHHHHHhHHHHH--HHhhhhH-HHHHHHHHhhcChhHHHHHHHHHHHHHhcCcc------ Confidence 00001112333333322211 2222222111 1111111 0111211 01111112222 25556665211 Q ss_pred CcccccccchhhHhhhcccccCCCCccceeecCCCCCcchHHHHH-HHHHHHHHhcCCCccccCC--CCcccc-cchhhh Q lcl|NC_015285. 124 GEIKDDKKFMSMMEDFWLPRREGGRGTEISTLPGGQNLGELEDVK-YFQKKLYKALNVPSSRLET--ETTFNI-GRAAEI 199 (359) Q Consensus 124 Gevkdd~~~mSMlEDywLpRReGgrgTEIsTLpGgqnLgei~DV~-YF~kkLy~aL~VP~SRl~~--~~~~~~-g~~~eI 199 (359) |-+--|+ =.|||- +. ..+|+-++||. =|..-+=-+.+||+.||=. -.|||- |-+ T Consensus 356 G~~llDk----~~Eefe-------------q~--stslSGLddVi~qf~q~VAgaa~IPltkLfGqSPkGlNATGE~--- 413 (694) T protein:vir:10 356 NILFLDK----ATEEFF-------------QF--NTPLSGLDALQAQAQEQMSAVSHIPLIKLLGITPTGLNASSEG--- 413 (694) T ss_pred ceEEEec----CCcceE-------------EE--ecccCCHHHHHHHHHHHHHhhhcCchhhhhccCcccccccchh--- Confidence 1111000 013442 11 24788889975 4888888899999999943 368875 333 Q ss_pred hHHhhhHHHHHHHHHHHHHHHHHHHHHH-----HHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHH Q lcl|NC_015285. 200 TRDEVKFQKFIARLRKRFSELFTDLLKT-----QLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVA 274 (359) Q Consensus 200 tRDElKF~KFI~rLr~rFs~if~d~Lk~-----QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~ 274 (359) |.-.|...|..+|. ..+...|++ |+-.-|.+. ..|.|.|+.=..-+|..-+||...+.+... T Consensus 414 --D~rnYYD~I~s~Qe---~~L~p~L~rl~~ii~rS~~G~id--------p~i~~~fnPL~qmtd~EkAeI~~k~A~~d~ 480 (694) T protein:vir:10 414 --EIRVWYDYVRAYQR---NALQQLMNDVIVMIQLSLFGAVD--------PSIKWQWNALRELDDLEVAESRYKQAQSDV 480 (694) T ss_pred --hHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHhcCCCC--------CcceEEeCCCCCcCHHHHHHHHhhhhHHHH Confidence 44459999998885 334444433 444445443 458889998777888888888888877644 Q ss_pred HhhhhcchhhhHHHHHHHH-----------h-------CCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccc Q lcl|NC_015285. 275 AMDPYVGKYFSVDYMRRQV-----------L-------KQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAP 336 (359) Q Consensus 275 ~~dp~vGKy~S~~~i~k~I-----------L-------~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~ 336 (359) ..-.- | .++.+-|...+ + .-+|+||.-+...- ++.. ...+.|.+|+... T Consensus 481 ~~~~~-g-vI~~~evr~rL~~d~~s~Y~~~~D~~d~p~~~~~~~~~~~~~~~----------~~~~-~~~~~~~~~~~~~ 547 (694) T protein:vir:10 481 LYVQE-Q-VIRPDQVAARLNTEPDGPYAGKLDANDDPGVPADDDIDGVLTYV----------QRLA-EGGDTGAPGGARA 547 (694) T ss_pred HHHHh-c-CCCHHHHHHHHhcCCCcccccccccccCCCcCccchhhhhHhhh----------cCcc-cccccCCCCcccc Confidence 43211 0 11222222111 0 01111221111100 0100 0111111111111 Q ss_pred cccCCCCCcCCCCCCCCCccCCC Q lcl|NC_015285. 337 AAEVDPNAQESSVDPGDVRRGEF 359 (359) Q Consensus 337 ~~~~~~~~~~~~~~p~~~~~~~~ 359 (359) ....+|...+.....-..+-|-+ T Consensus 548 g~~~~~~v~~~~~~~~~~~ag~~ 570 (694) T protein:vir:10 548 GATAPPTVANVNANVNPREAGAQ 570 (694) T ss_pred cccCCCcccccccccCccccCCC Confidence 11123333332222222222333 No 135 >protein:vir:98396 Length: 441 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918929;genbank:gi:119443691;genbank:GeneID:4594558 Probab=33.58 E-value=1.3 Score=19.87 Aligned_cols=283 Identities=13% Similarity=0.119 Sum_probs=106.2 Q ss_pred CCCc--------hhhHHHhhhhhhheeecccc----------------cccc--CCCceeecHhHhhhhhcccccCCCCc Q lcl|NC_015285. 1 MRGV--------DLNQQLTQKAAEYFLYNPKG----------------LKNS--TNQGMKITTDSVTYCHSGIQDLNKNM 54 (359) Q Consensus 1 ~~~~--------~~~~~~~~~~~e~f~yn~~~----------------~~~~--~~~~v~i~~~ai~y~hSGl~d~~~~~ 54 (359) +.++ ..| ...+.+..+|.. .... .+....++.+-|.++.. .+.++=. T Consensus 130 l~Gnay~~i~r~~~G-----~~~~L~~i~~~~v~v~~~~~g~~~~~~~~~~~~~~~~~~~~~~~dviHir~--~~~dg~~ 202 (441) T protein:vir:98 130 LTSHGYIEITRDKTG-----EPMNLTFRKTSEIELKLDARGRLYYFHQRIDSNGNNIERNVKFEDMLDIKF--YSLDGIN 202 (441) T ss_pred hcCCeEEEEEEcCCC-----cEEEEEEEcCceeEEEECCCCcEEEEEEEeccCcceeeEEEccccEEEecc--CCCCCcc Confidence 1111 111 011111111110 0011 11224566666654421 2223222 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHH-HHHHhhcceEEeeCCCCcccccccch Q lcl|NC_015285. 55 TLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLR-EVMGRYRNKMVYDANTGEIKDDKKFM 133 (359) Q Consensus 55 i~syL~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~-~iM~kyrnklvYD~~TGevkdd~~~m 133 (359) =+|-|+.|.+++.....+++...=+=.--+--+-|..++ |.+...+|.+=++ .....|. | ..+.-+.+ T Consensus 203 G~spi~~~~~~i~~~~a~~~~~~~~f~ng~~~~gil~~~-~~~~~~e~~~~~~~~~~~~~~---------G-~~nag~~~ 271 (441) T protein:vir:98 203 GLSLLDTLSRTIESDNNGKDFLNNFLRNGTHAGGILKMK-GVLDNKKARDRAREEFHKSFS---------G-TKQAGKVV 271 (441) T ss_pred ccCHHHHHHHHHHHHHHHHHHHHHHHhccCCCcEEEEeC-CCCCCHHHHHHHHHHHHHHhc---------C-ccccCcce Confidence 357777777777766666665542222223345566666 4443334433233 3333332 1 11111122 Q ss_pred hhHhhhcccccCCCCccceeecCCC-CCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHH Q lcl|NC_015285. 134 SMMEDFWLPRREGGRGTEISTLPGG-QNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIAR 212 (359) Q Consensus 134 SMlEDywLpRReGgrgTEIsTLpGg-qnLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KFI~r 212 (359) .++ .|.+++.|.=. +.+.-++.-+|..+.+.++++||...|+.+.. + +.+.-..+-| +.. T Consensus 272 -vl~----------~g~~~~~l~~~~~d~q~~e~r~~~~~~Ia~~fgVPp~~lg~~~~-~----~s~~q~~~~y---~~t 332 (441) T protein:vir:98 272 -VLD----------ESMTFDQLEVDTEVLKLIRENKSSTREIAGVFGIPLHKFGIETA-N----MSITDANLDY---LST 332 (441) T ss_pred -ecC----------CCceEEEccCChhHHHHHHHHHHhHHHHHHHhCCCHHHcCCCCC-C----ccHHHHHHHH---HHH Confidence 222 24556655321 11222444566778899999999999964321 1 1122122224 344 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHH Q lcl|NC_015285. 213 LRKRFSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQ 292 (359) Q Consensus 213 Lr~rFs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~ 292 (359) |+--+. .+.+.|-..| .++. ....+.|..+ ++...+ +..|.+.++.+-.- -+ T Consensus 333 l~P~~~-~ie~~ln~~L-----~~~~------~~~~~~fd~~----~llr~d-~~~~~~~~~~~~~~--G~--------- 384 (441) T protein:vir:98 333 LKPYIT-CVCAELNFKF-----NDEY------VNREFKFDTT----EIRVVD-EKTQAEIDKINIDS--GK--------- 384 (441) T ss_pred HHHHHH-HHHHHHHhhc-----cccc------cCceEEEech----hhhccC-HHHHHHHHHHHHhC--CC--------- Confidence 543222 2333333333 2221 1223444322 222221 23455555554322 13 Q ss_pred HhCCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCC-CCCCCCCccCC Q lcl|NC_015285. 293 VLKQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQES-SVDPGDVRRGE 358 (359) Q Consensus 293 IL~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~p~~~~~~~ 358 (359) ||..|+-+ ...-+.+++++...=....-..+-..+++.+++.+.. ..+-+.+-++| T Consensus 385 ---~T~NE~R~-------~~gl~pi~gGd~~~~~~~~n~~~~~~~~~~q~~~~~~~~~~~kgGe~ne 441 (441) T protein:vir:98 385 ---MNIDEIRQ-------RDGLAPIPGGNGSIHRVDLNHVNIELVDEYQMNKSRATDKKLKGGEENE 441 (441) T ss_pred ---cCHHHHHH-------HhCCCCCCCCCcceEeecccccccccccccccccccccccccCCCCCCC Confidence 44444421 1122333333321100111111111222233322221 11112222333 No 136 >protein:vir:4337 Length: 434 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061500;genbank:gi:9635589;genbank:GeneID:1262858 Probab=31.53 E-value=1.5 Score=19.63 Aligned_cols=284 Identities=15% Similarity=0.161 Sum_probs=118.5 Q ss_pred CCCc-------hhhHHHhhhhhhheeecc---------cc-----ccccCCCceeecHhHhhhhhcccccCCCCcchhhH Q lcl|NC_015285. 1 MRGV-------DLNQQLTQKAAEYFLYNP---------KG-----LKNSTNQGMKITTDSVTYCHSGIQDLNKNMTLSHL 59 (359) Q Consensus 1 ~~~~-------~~~~~~~~~~~e~f~yn~---------~~-----~~~~~~~~v~i~~~ai~y~hSGl~d~~~~~i~syL 59 (359) +.++ ..|. ..+.+...| .| +....+..+.++.+-|.+.+ ++ +.++-.=+|-+ T Consensus 123 l~Gnay~~i~~~~G~-----~~~L~~l~p~~v~~~~~~~g~~~y~~~~~~g~~~~~~~~eVih~~-~~-~~dg~~G~spi 195 (434) T protein:vir:43 123 LWGNAYAEIRRAAGR-----PAALDFLLPSRVDLECDENGRLKYFYTTKKGARREIERTNMLHIP-AF-TLDGRIGLSAI 195 (434) T ss_pred hcCCeEEEEEeCCCc-----EEEEEEEcCcceEEEEcCCCeEEEEEEecCceEEEEccccEEEec-Cc-CCCCccccCHH Confidence 1111 1111 111121111 11 01112334788888887775 22 44444446778 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhh Q lcl|NC_015285. 60 HKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDF 139 (359) Q Consensus 60 ~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDy 139 (359) ..|+..+.....+++...-+----+--.-|..++ +.|.+.+++ =+++.++++..- ...|.+- +++ T Consensus 196 ~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~-~~l~~e~~~-~~r~~~~~~~g~----~nag~~~-------vl~-- 260 (434) T protein:vir:43 196 RYGVDVFGSVMSAEDAANGTFKNGLLPTVAFKVD-RILQPAQRE-EFREYVKSVSGA----MNSGRSP-------VLE-- 260 (434) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCcceEEecC-CCCCHHHHH-HHHHHHHHhcCc----cccCCcc-------ccC-- Confidence 8888888877777766543332223334555555 456554444 356666554211 1122211 221 Q ss_pred cccccCCCCccceeecCC-CCCcchHHHHHHHHHHHHHhcCCCccccCCCC-cccccch-hhhhHHhhhHHHHHHHHHHH Q lcl|NC_015285. 140 WLPRREGGRGTEISTLPG-GQNLGELEDVKYFQKKLYKALNVPSSRLETET-TFNIGRA-AEITRDEVKFQKFIARLRKR 216 (359) Q Consensus 140 wLpRReGgrgTEIsTLpG-gqnLgei~DV~YF~kkLy~aL~VP~SRl~~~~-~~~~g~~-~eItRDElKF~KFI~rLr~r 216 (359) .|.+++.|.- .+.+.-++-.++..+.+.++++||..-|+... +-+.+.. ++..+. |.+++ |+-- T Consensus 261 --------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~s~~e~~~~~---f~~~~--L~P~ 327 (434) T protein:vir:43 261 --------QGITPETIGINPVDAQLLETREHGVIEICRWFGVPPWMIGQTDKGSNWGTGLEQQMLA---FLTFS--ISSI 327 (434) T ss_pred --------CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCcCCccccchHHHHHHH---HHHHH--HHHH Confidence 2556666632 11222234456778889999999999986432 2222221 222222 54432 3332 Q ss_pred HHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCC Q lcl|NC_015285. 217 FSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQ 296 (359) Q Consensus 217 Fs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~ 296 (359) +.. +.+.| -+.+.+.++|. ...+.|.-+. +...+ ...|.+.+..+-.- -+++.+-++. .+++ T Consensus 328 ~~~-ie~~l-----n~kL~~~~~~~----~~~~~fd~~~----llr~d-~~~r~~~~~~~~~~--G~~T~NE~R~-~~gl 389 (434) T protein:vir:43 328 TNQ-IQQCV-----NKRLLTAPERI----RYYAEFSLEG----FLKAD-SAGRAAWYSTMAQN--GFMTRNEGRR-KENL 389 (434) T ss_pred HHH-HHHHH-----HhhcCChhhhc----CceEEEechh----hhccC-HHHHHHHHHHHHhC--CCcCHHHHHH-HhCC Confidence 221 12222 22345666654 2344454332 22221 24456666555322 3556666664 3555 Q ss_pred CHHHHHHHHHHHHHHHhcCCCCC----CcchhhhcCCCCCcccccccCCCCCcCCCCCCCC Q lcl|NC_015285. 297 TEIEIKEIDEQIASEMEAGIIAD----PMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPGD 353 (359) Q Consensus 297 tDeeI~e~~kqi~~E~~~~~~~~----P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~ 353 (359) .+- +.-++ .+.+ |-++.+...... ...++.-....++ +|+. T Consensus 390 ~p~--~ggD~---------~~~~~n~~~~~~~~~~~~~~--~~~~~~~~~~~~~---~~~~ 434 (434) T protein:vir:43 390 PEL--PGGDI---------LTVQSNLVPIDQLGQSNKSQ--AVRAALMNWFSQP---EPQE 434 (434) T ss_pred CCC--CCCCe---------EeeccCccchhhhhccCCCc--chhhhhhccCCCC---CCCC Confidence 431 10000 0000 101111110000 0000000001111 1221 No 137 >protein:vir:100150 Length: 437 # NCBI annotation: gp3 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945033;genbank:gi:38707893;genbank:GeneID:2744197 Probab=27.89 E-value=1.8 Score=19.18 Aligned_cols=293 Identities=13% Similarity=0.146 Sum_probs=125.3 Q ss_pred CCCch-------hhHHHhhhhhhheeeccc---------c-----ccccCCCceeecHhHhhhhhcccccCCCCcchhhH Q lcl|NC_015285. 1 MRGVD-------LNQQLTQKAAEYFLYNPK---------G-----LKNSTNQGMKITTDSVTYCHSGIQDLNKNMTLSHL 59 (359) Q Consensus 1 ~~~~~-------~~~~~~~~~~e~f~yn~~---------~-----~~~~~~~~v~i~~~ai~y~hSGl~d~~~~~i~syL 59 (359) +.++. .| ...+.+...|. | +....+....++.+-|.+... .+.|+-.=+|-+ T Consensus 119 l~Gnay~~i~r~~g-----~~~~L~~l~p~~v~i~~~~~g~~~y~~~~~~g~~~~~~~~dIih~r~--~~~d~~~G~spi 191 (437) T protein:vir:10 119 LWGNGYARKLRSAG-----VLIGLELMLPQRTTVKRLTSGALQYTYRNVDGTVSTLAEDDVFHVRG--FSLDGLMGLTPI 191 (437) T ss_pred hcCCeEEEEEecCC-----cEEEEEEEcCcceEEEECCCCeEEEEEEecCceEEEEccccEEEecC--cCCCCcccccHH Confidence 11100 01 11111111111 1 011123345677777766531 223333345778 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhh Q lcl|NC_015285. 60 HKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDF 139 (359) Q Consensus 60 ~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDy 139 (359) ..|.+++.....+++...=+----+--+-|..++ +.|.+.++++..+.+-.+|..- ...|.+ + .++ T Consensus 192 ~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~l~~e~~~~~~~~~~~~~~g~----~nag~~------~-vl~-- 257 (437) T protein:vir:10 192 QYAREVLGNSTAANKTSASVFRNGLRPSGVLSTD-QILQKEKRAEIRTDLAEQFGGA----MQAGKT------M-VLE-- 257 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcC-CCCCHHHHHHHHHHHHHHhcCc----cccCcc------e-ecc-- Confidence 9999888887778776665555556667777776 6688888777666655554320 011221 1 121 Q ss_pred cccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCccccc--chhhhhHHhhhHHHHHHHHHHH Q lcl|NC_015285. 140 WLPRREGGRGTEISTLPGGQNLGE-LEDVKYFQKKLYKALNVPSSRLETETTFNIG--RAAEITRDEVKFQKFIARLRKR 216 (359) Q Consensus 140 wLpRReGgrgTEIsTLpGgqnLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g--~~~eItRDElKF~KFI~rLr~r 216 (359) .|++++.|.-...-.+ ++=-++-.+.+.++++||...|+...+-+.. ..++..+. |..++ |+-. T Consensus 258 --------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~~~sn~e~~~~~---f~~~t--l~P~ 324 (437) T protein:vir:10 258 --------AGMKYQAITMNPGDVQLLETRAFNIEEICRWYRVPPFMVGHSEKSTSWGTGIEQQTLG---FLTFT--LRPW 324 (437) T ss_pred --------CCceEEeccCChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcccccchHHHHHHH---HHHHH--HHHH Confidence 2456666632221222 3333466788999999999999654332221 12232222 54442 3322 Q ss_pred HHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCC Q lcl|NC_015285. 217 FSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQ 296 (359) Q Consensus 217 Fs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~ 296 (359) +. .+.+.|-. .++++.+|.. .. +.|..+ ++...+ +..|.+.++.+-.- -+++.+-++. ++++ T Consensus 325 ~~-~ie~~l~~-----kll~~~e~~~--~~--~~fd~~----~ll~~d-~~~r~~~~~~~~~~--G~~T~NE~R~-~~gl 386 (437) T protein:vir:10 325 LT-RIEQAARR-----SLLRPGERDQ--FY--AEFSVE----GLLRAD-SAGRAAFYSTMTQN--GLMTRDECRA-KENL 386 (437) T ss_pred HH-HHHHHHHh-----hccCccccCc--eE--EEEech----hhhccC-HHHHHHHHHHHHhC--CCcCHHHHHH-HhCC Confidence 22 23333333 3355556542 23 444422 232222 35667766655322 3566666664 3566 Q ss_pred CHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCc--ccccccCCCCCcCCCCCCCCCccCC Q lcl|NC_015285. 297 TEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGE--GAPAAEVDPNAQESSVDPGDVRRGE 358 (359) Q Consensus 297 tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~p~~~~~~~ 358 (359) .+-+ .-+..+- ..... -|-+...+.....++ +...++.+++-..++.+ + T Consensus 387 ~pi~--gg~~~~~--~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e-------~ 437 (437) T protein:vir:10 387 PPMG--GNAAVLT--VQSAL--LPIDKLGEHTTATAAQDALKAWLYQEEKTRATQE-------R 437 (437) T ss_pred CCCC--CCcceEe--ecCcc--cchhhccCcCCCcchhccccccCCCCCCCCcccc-------C Confidence 4322 0000000 00000 011111000000000 00111111111111111 1 No 138 >protein:vir:105002 Length: 432 # NCBI annotation: putative phage portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459967;genbank:gi:85701382;genbank:GeneID:3882143 Probab=27.02 E-value=1.9 Score=19.07 Aligned_cols=287 Identities=9% Similarity=0.085 Sum_probs=123.4 Q ss_pred CCC--------chhhHHHhhhhhhheeeccc-------------------cccccCCCceeecHhHhhhhhcccccCCCC Q lcl|NC_015285. 1 MRG--------VDLNQQLTQKAAEYFLYNPK-------------------GLKNSTNQGMKITTDSVTYCHSGIQDLNKN 53 (359) Q Consensus 1 ~~~--------~~~~~~~~~~~~e~f~yn~~-------------------~~~~~~~~~v~i~~~ai~y~hSGl~d~~~~ 53 (359) +.+ +..|. ..+.+..+|. .....++..+.++.+-|.+..-+. ..++- T Consensus 116 l~Gnay~~i~r~~~G~-----~~~L~~i~~~~v~v~~d~~~~~~~~~~~~y~~~~~g~~~~~~~~eiih~r~~~-~~~~~ 189 (432) T protein:vir:10 116 LYGNSYANIEFDRKGK-----VQALWPIDASKVTVYIDDVGLLNSKTKMWYVVNTGGQQRVLKPEEILHFKNGI-TLDGL 189 (432) T ss_pred hcCCeEEEEEECCCCc-----EEEEEEEcCceeEEEEcCcccccccceEEEEEecCCeEEEEccccEEEecCCC-CCCCc Confidence 111 11110 1111111111 111233444678888887764332 22333 Q ss_pred cchhhHHHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccch Q lcl|NC_015285. 54 MTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFM 133 (359) Q Consensus 54 ~i~syL~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~m 133 (359) .=+|.|..|++++.....+++...=+----+.-+-|..++ +.|.+..+++..+.+...|..- ...|. .+ T Consensus 190 ~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~-~~l~~e~~~~~~~~~~~~~~g~----~n~~~------~~ 258 (432) T protein:vir:10 190 VGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYV-GDLNEDAKKVFRENFESMSSGL----QNSHR------IA 258 (432) T ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcC-CCCCHHHHHHHHHHHHHHhccc----ccCCc------ce Confidence 4578999999999998888887666654445556777776 4677666666555544444320 01121 11 Q ss_pred hhHhhhcccccCCCCccceeecCC-CCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHH-HH Q lcl|NC_015285. 134 SMMEDFWLPRREGGRGTEISTLPG-GQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKF-IA 211 (359) Q Consensus 134 SMlEDywLpRReGgrgTEIsTLpG-gqnLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KF-I~ 211 (359) .++ .|.++..|.- ...+.-++-.++..+.+.++++||...|...+.-+.....+..+. |.++ |. T Consensus 259 -vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~~~---~~~~~l~ 324 (432) T protein:vir:10 259 -LMP----------VGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQ---FYTDTLQ 324 (432) T ss_pred -ecC----------CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHH---HHHHHHH Confidence 221 2455555532 112222344567789999999999999964333233333332222 4432 22 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHH Q lcl|NC_015285. 212 RLRKRFSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRR 291 (359) Q Consensus 212 rLr~rFs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k 291 (359) -+-.++ .+.|- +.++++.+|. ..+.+.|..+ ++...+ +..|++++..+-.- -+++.+-++. T Consensus 325 P~~~~i----e~~ln-----~kLl~~~~~~---~g~~~~fd~~----~l~~~d-~~~~~~~~~~~~~~--G~~t~NE~R~ 385 (432) T protein:vir:10 325 ATLTMY----EQEMT-----YKLFLDSELD---KGFYSKFNVD----AILRAD-IKTRYEAYRTGIQG--GFLKPNEARS 385 (432) T ss_pred HHHHHH----HHHHH-----HhhcChhhcC---CCcEEEeech----hhhcCC-HHHHHHHHHHHHhC--CCcCHHHHHH Confidence 222222 22222 2334555554 3334555432 222211 23455555544322 3556555553 Q ss_pred HHhCCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCC-CCcCCCCCCCCCccCC Q lcl|NC_015285. 292 QVLKQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDP-NAQESSVDPGDVRRGE 358 (359) Q Consensus 292 ~IL~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~p~~~~~~~ 358 (359) ++++.+. +--++ .+.+-+ + .+-+. +++.+. +........+++..|- T Consensus 386 -~~g~~pi--~ggD~---------~~~~~n--~-----~~~~~--~~~~~~k~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 386 -KEDLPPE--AGGDR---------LLVNGN--M-----LPIDM--AGQAYLKGGDTNGEVSKEGNEGN 432 (432) T ss_pred -HhCCCCC--CCCCe---------Eeeccc--c-----cchhh--ccccccCCCCCCCCCCCCCCCCC Confidence 3555331 00000 000000 0 00000 000000 0000001111111122 No 139 >protein:vir:102855 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338135;genbank:gi:77020228;genbank:GeneID:3703764 Probab=27.02 E-value=1.9 Score=19.07 Aligned_cols=287 Identities=9% Similarity=0.085 Sum_probs=123.4 Q ss_pred CCC--------chhhHHHhhhhhhheeeccc-------------------cccccCCCceeecHhHhhhhhcccccCCCC Q lcl|NC_015285. 1 MRG--------VDLNQQLTQKAAEYFLYNPK-------------------GLKNSTNQGMKITTDSVTYCHSGIQDLNKN 53 (359) Q Consensus 1 ~~~--------~~~~~~~~~~~~e~f~yn~~-------------------~~~~~~~~~v~i~~~ai~y~hSGl~d~~~~ 53 (359) +.+ +..|. ..+.+..+|. .....++..+.++.+-|.+..-+. ..++- T Consensus 116 l~Gnay~~i~r~~~G~-----~~~L~~i~~~~v~v~~d~~~~~~~~~~~~y~~~~~g~~~~~~~~eiih~r~~~-~~~~~ 189 (432) T protein:vir:10 116 LYGNSYANIEFDRKGK-----VQALWPIDASKVTVYIDDVGLLNSKTKMWYVVNTGGQQRVLKPEEILHFKNGI-TLDGL 189 (432) T ss_pred hcCCeEEEEEECCCCc-----EEEEEEEcCceeEEEEcCcccccccceEEEEEecCCeEEEEccccEEEecCCC-CCCCc Confidence 111 11110 1111111111 111233444678888887764332 22333 Q ss_pred cchhhHHHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccch Q lcl|NC_015285. 54 MTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFM 133 (359) Q Consensus 54 ~i~syL~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~m 133 (359) .=+|.|..|++++.....+++...=+----+.-+-|..++ +.|.+..+++..+.+...|..- ...|. .+ T Consensus 190 ~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~-~~l~~e~~~~~~~~~~~~~~g~----~n~~~------~~ 258 (432) T protein:vir:10 190 VGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYV-GDLNEDAKKVFRENFESMSSGL----QNSHR------IA 258 (432) T ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcC-CCCCHHHHHHHHHHHHHHhccc----ccCCc------ce Confidence 4578999999999998888887666654445556777776 4677666666555544444320 01121 11 Q ss_pred hhHhhhcccccCCCCccceeecCC-CCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHH-HH Q lcl|NC_015285. 134 SMMEDFWLPRREGGRGTEISTLPG-GQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKF-IA 211 (359) Q Consensus 134 SMlEDywLpRReGgrgTEIsTLpG-gqnLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KF-I~ 211 (359) .++ .|.++..|.- ...+.-++-.++..+.+.++++||...|...+.-+.....+..+. |.++ |. T Consensus 259 -vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~~~---~~~~~l~ 324 (432) T protein:vir:10 259 -LMP----------VGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQ---FYTDTLQ 324 (432) T ss_pred -ecC----------CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHH---HHHHHHH Confidence 221 2455555532 112222344567789999999999999964333233333332222 4432 22 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHH Q lcl|NC_015285. 212 RLRKRFSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRR 291 (359) Q Consensus 212 rLr~rFs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k 291 (359) -+-.++ .+.|- +.++++.+|. ..+.+.|..+ ++...+ +..|++++..+-.- -+++.+-++. T Consensus 325 P~~~~i----e~~ln-----~kLl~~~~~~---~g~~~~fd~~----~l~~~d-~~~~~~~~~~~~~~--G~~t~NE~R~ 385 (432) T protein:vir:10 325 ATLTMY----EQEMT-----YKLFLDSELD---KGFYSKFNVD----AILRAD-IKTRYEAYRTGIQG--GFLKPNEARS 385 (432) T ss_pred HHHHHH----HHHHH-----HhhcChhhcC---CCcEEEeech----hhhcCC-HHHHHHHHHHHHhC--CCcCHHHHHH Confidence 222222 22222 2334555554 3334555432 222211 23455555544322 3556555553 Q ss_pred HHhCCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCC-CCcCCCCCCCCCccCC Q lcl|NC_015285. 292 QVLKQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDP-NAQESSVDPGDVRRGE 358 (359) Q Consensus 292 ~IL~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~p~~~~~~~ 358 (359) ++++.+. +--++ .+.+-+ + .+-+. +++.+. +........+++..|- T Consensus 386 -~~g~~pi--~ggD~---------~~~~~n--~-----~~~~~--~~~~~~k~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 386 -KEDLPPE--AGGDR---------LLVNGN--M-----LPIDM--AGQAYLKGGDTNGEVSKEGNEGN 432 (432) T ss_pred -HhCCCCC--CCCCe---------Eeeccc--c-----cchhh--ccccccCCCCCCCCCCCCCCCCC Confidence 3555331 00000 000000 0 00000 000000 0000001111111122 No 140 >protein:vir:107605 Length: 432 # NCBI annotation: phage portal protein, HK97 family # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338186;genbank:gi:77020175;genbank:GeneID:3703736 Probab=27.02 E-value=1.9 Score=19.07 Aligned_cols=287 Identities=9% Similarity=0.085 Sum_probs=123.4 Q ss_pred CCC--------chhhHHHhhhhhhheeeccc-------------------cccccCCCceeecHhHhhhhhcccccCCCC Q lcl|NC_015285. 1 MRG--------VDLNQQLTQKAAEYFLYNPK-------------------GLKNSTNQGMKITTDSVTYCHSGIQDLNKN 53 (359) Q Consensus 1 ~~~--------~~~~~~~~~~~~e~f~yn~~-------------------~~~~~~~~~v~i~~~ai~y~hSGl~d~~~~ 53 (359) +.+ +..|. ..+.+..+|. .....++..+.++.+-|.+..-+. ..++- T Consensus 116 l~Gnay~~i~r~~~G~-----~~~L~~i~~~~v~v~~d~~~~~~~~~~~~y~~~~~g~~~~~~~~eiih~r~~~-~~~~~ 189 (432) T protein:vir:10 116 LYGNSYANIEFDRKGK-----VQALWPIDASKVTVYIDDVGLLNSKTKMWYVVNTGGQQRVLKPEEILHFKNGI-TLDGL 189 (432) T ss_pred hcCCeEEEEEECCCCc-----EEEEEEEcCceeEEEEcCcccccccceEEEEEecCCeEEEEccccEEEecCCC-CCCCc Confidence 111 11110 1111111111 111233444678888887764332 22333 Q ss_pred cchhhHHHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccch Q lcl|NC_015285. 54 MTLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFM 133 (359) Q Consensus 54 ~i~syL~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~m 133 (359) .=+|.|..|++++.....+++...=+----+.-+-|..++ +.|.+..+++..+.+...|..- ...|. .+ T Consensus 190 ~G~s~~~~~~~~i~~~~~~~~~~~~~~~ng~~p~gil~~~-~~l~~e~~~~~~~~~~~~~~g~----~n~~~------~~ 258 (432) T protein:vir:10 190 VGVPTMEYLKSTLENSASADKFINNFYKQGLQVKGLVQYV-GDLNEDAKKVFRENFESMSSGL----QNSHR------IA 258 (432) T ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEcC-CCCCHHHHHHHHHHHHHHhccc----ccCCc------ce Confidence 4578999999999998888887666654445556777776 4677666666555544444320 01121 11 Q ss_pred hhHhhhcccccCCCCccceeecCC-CCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHH-HH Q lcl|NC_015285. 134 SMMEDFWLPRREGGRGTEISTLPG-GQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKF-IA 211 (359) Q Consensus 134 SMlEDywLpRReGgrgTEIsTLpG-gqnLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KF-I~ 211 (359) .++ .|.++..|.- ...+.-++-.++..+.+.++++||...|...+.-+.....+..+. |.++ |. T Consensus 259 -vl~----------~g~~~~~l~~~~~d~q~~e~~~~~~~~Ia~~fgVP~~~lg~~~~~~~s~~e~~~~~---~~~~~l~ 324 (432) T protein:vir:10 259 -LMP----------VGYQFQPISLNMSDAQFLENTELTIRQIATAFGIKMHQLNDLSKATLNNIEQQQQQ---FYTDTLQ 324 (432) T ss_pred -ecC----------CCceEEEccCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCCcccHHHHHHH---HHHHHHH Confidence 221 2455555532 112222344567789999999999999964333233333332222 4432 22 Q ss_pred HHHHHHHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHH Q lcl|NC_015285. 212 RLRKRFSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRR 291 (359) Q Consensus 212 rLr~rFs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k 291 (359) -+-.++ .+.|- +.++++.+|. ..+.+.|..+ ++...+ +..|++++..+-.- -+++.+-++. T Consensus 325 P~~~~i----e~~ln-----~kLl~~~~~~---~g~~~~fd~~----~l~~~d-~~~~~~~~~~~~~~--G~~t~NE~R~ 385 (432) T protein:vir:10 325 ATLTMY----EQEMT-----YKLFLDSELD---KGFYSKFNVD----AILRAD-IKTRYEAYRTGIQG--GFLKPNEARS 385 (432) T ss_pred HHHHHH----HHHHH-----HhhcChhhcC---CCcEEEeech----hhhcCC-HHHHHHHHHHHHhC--CCcCHHHHHH Confidence 222222 22222 2334555554 3334555432 222211 23455555544322 3556555553 Q ss_pred HHhCCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCC-CCcCCCCCCCCCccCC Q lcl|NC_015285. 292 QVLKQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDP-NAQESSVDPGDVRRGE 358 (359) Q Consensus 292 ~IL~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~p~~~~~~~ 358 (359) ++++.+. +--++ .+.+-+ + .+-+. +++.+. +........+++..|- T Consensus 386 -~~g~~pi--~ggD~---------~~~~~n--~-----~~~~~--~~~~~~k~~~~~~~~~~~~~~~~ 432 (432) T protein:vir:10 386 -KEDLPPE--AGGDR---------LLVNGN--M-----LPIDM--AGQAYLKGGDTNGEVSKEGNEGN 432 (432) T ss_pred -HhCCCCC--CCCCe---------Eeeccc--c-----cchhh--ccccccCCCCCCCCCCCCCCCCC Confidence 3555331 00000 000000 0 00000 000000 0000001111111122 No 141 >protein:vir:1082 Length: 359 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076736;genbank:gi:13095846;genbank:GeneID:920394 Probab=26.43 E-value=1.9 Score=19.00 Aligned_cols=233 Identities=11% Similarity=0.150 Sum_probs=91.6 Q ss_pred CCCchhhHHHhh----hhhhheeeccccc--------------cccCCCceeecHhHhhhhh--c-ccccCCCCcchhhH Q lcl|NC_015285. 1 MRGVDLNQQLTQ----KAAEYFLYNPKGL--------------KNSTNQGMKITTDSVTYCH--S-GIQDLNKNMTLSHL 59 (359) Q Consensus 1 ~~~~~~~~~~~~----~~~e~f~yn~~~~--------------~~~~~~~v~i~~~ai~y~h--S-Gl~d~~~~~i~syL 59 (359) +.++.- +.|++ ...+.+...|... ....+....++.+-|.+.. | +.-..++-.=+|.| T Consensus 93 l~Gnay-~~i~r~~~g~~~~l~~l~~~~v~i~~~~~~~~y~~~~~~~~~~~~~~~~evih~~~~~~~~~~~dg~~G~spi 171 (359) T protein:vir:10 93 LNGNVF-LAILKGDNSLMKELRLIPSNAITIDLTDDTLTYEVNQFDDYPSAKYNASEMIHVKIMAYGVDTLHNLVGHSPL 171 (359) T ss_pred ccCceE-EEEEECCCCeEEEEEEeCCceEEEEEcCCeEEEEEEecCCceEEEEcccceEEeccCCCCCCccCccccccHH Confidence 111100 00000 0112222221110 0122345667777765432 1 11112232336888 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhh Q lcl|NC_015285. 60 HKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDF 139 (359) Q Consensus 60 ~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDy 139 (359) +.|..++.....+++...-+=---+--+-|..++-|++.+..+++. ++-..++.. - ...|++- .++ T Consensus 172 ~~~~~~i~~~~~~~~~~~~~f~ng~~~~gil~~~~~~l~~e~~~~~-~~~~~~~~~--~--~n~g~~~-------vl~-- 237 (359) T protein:vir:10 172 ESLTSEIGQQKEANRLSLSTLKGALNPTSVVKVPQGTLSSEAKDSI-RKEFEKANG--G--NNSGRVM-------VLD-- 237 (359) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCcceEEEeCCCCCCHHHHHHH-HHHHHHHhC--c--cccCCce-------ecC-- Confidence 9999988888888876543322223345677777777766554443 332333321 0 1122221 221 Q ss_pred cccccCCCCccceeecCCCCCcch---HHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHH Q lcl|NC_015285. 140 WLPRREGGRGTEISTLPGGQNLGE---LEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKR 216 (359) Q Consensus 140 wLpRReGgrgTEIsTLpGgqnLge---i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KFI~rLr~r 216 (359) .|.+++.|. .+.-+ ++-.+|-...+.++++||.+.|...+.-+ ...+.+ |-.+..|+...-.. T Consensus 238 --------~g~~~~~l~--~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~-~~~~~~---e~~~~~~l~~~l~p 303 (359) T protein:vir:10 238 --------QSADFSTVS--INADVANYLNSMNWGRTQIAKAFGVSDSYLNGTGDQQ-SSLDQI---KDLYVNALNRFIEP 303 (359) T ss_pred --------CCcceeeec--CCHHHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCccc-ccHHHH---HHHHHHHHHHHHHH Confidence 245566552 33333 23455667889999999999995432211 111222 11233332211111 Q ss_pred HHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHH----H------HHHhhhhc Q lcl|NC_015285. 217 FSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMN----Q------VAAMDPYV 280 (359) Q Consensus 217 Fs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~----~------~~~~dp~v 280 (359) +..-+...|-+++-+ +. ...+. |. .++...++. ..++ + +-.+.|+. T Consensus 304 ~~~~l~~~l~~~~~~-----~~-----~~~~~--~d-----~~~~~~~~~-~~~~~G~~t~NE~R~~l~~~pv~ 359 (359) T protein:vir:10 304 LISELRIKCDSSIGV-----DM-----SPITD--YS-----NSVFKADIL-NWVKEGIIEPTEAKTLLESKGII 359 (359) T ss_pred HHHHHHHHhhhhhcc-----cc-----hhhhh--cC-----HHHHHHHHH-HHHhCCCcCHHHHHHHhCCCCCC Confidence 111111111111110 00 00111 11 111111111 1111 1 11234443 No 142 >protein:vir:1661 Length: 378 # NCBI annotation: unknown # Family: family:all:2379 # MgeID: mge:34 # MgeName: sk1 # Cross-refs: genbank:acc:NP_044950;genbank:gi:9629657;genbank:GeneID:1261302 Probab=26.18 E-value=2 Score=18.96 Aligned_cols=277 Identities=13% Similarity=0.105 Sum_probs=115.0 Q ss_pred CC--CchhhHHHhhhhh-------hheee---ccc-c-c--cccCCCceeecHhHhhhhhcccccCCCCcchhhHHHHHH Q lcl|NC_015285. 1 MR--GVDLNQQLTQKAA-------EYFLY---NPK-G-L--KNSTNQGMKITTDSVTYCHSGIQDLNKNMTLSHLHKAIK 64 (359) Q Consensus 1 ~~--~~~~~~~~~~~~~-------e~f~y---n~~-~-~--~~~~~~~v~i~~~ai~y~hSGl~d~~~~~i~syL~~Aik 64 (359) .+ ..-+..+....+. +.|+| +.+ + . ..-.+.++.++.+-|.+..+- .++....|.|+.|.+ T Consensus 82 ~~PN~~~t~~~f~~~~~~~lll~Gna~i~~~~d~~~g~~~~l~~~~~~~~~~~~diih~r~~---~~~~~~~s~l~~~~~ 158 (378) T protein:vir:16 82 WSPKGERNSMDFWRKVIKKLLRAPYVDLYAVFDDNTGELLDLLFADDKKEYKPEELVRLTSP---FYINEDTSILDNALA 158 (378) T ss_pred hcCCCCCCHHHHHHHHHHHHhhcCceEEEEEeecCCceEEEEEecCCeeEecccceEEecCc---cCccchhHHHHHHHH Confidence 00 0111111111111 11222 100 0 0 011234577788888777642 345557788888877 Q ss_pred HHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhhccccc Q lcl|NC_015285. 65 AVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDFWLPRR 144 (359) Q Consensus 65 ~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDywLpRR 144 (359) .++. + ..-+--|-+..++ +.|....+.+....+...|++..--+.+ | +.+ .+ T Consensus 159 ~i~~------~-----~~~~~~~g~l~~~-~~l~~~~~~~~~~~~~~~~~~~~~~~~~-g------~~~-vl-------- 210 (378) T protein:vir:16 159 SIQT------K-----LEQGKLRGLLKIN-AFLDIDNTQEYREKALTTIKNMQEGSSY-N------GLT-PV-------- 210 (378) T ss_pred HHHH------H-----HhcCccceeeEeC-CcCCHHHHHHHHHHHHHHHHHhhccccc-c------cce-Ec-------- Confidence 6532 1 1122223344443 4455555555444444444432221111 1 111 11 Q ss_pred CCCCccceeecCCCCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_015285. 145 EGGRGTEISTLPGGQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSELFTDL 224 (359) Q Consensus 145 eGgrgTEIsTLpGgqnLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KFI~rLr~rFs~if~d~ 224 (359) ..|.+++.|.-.....++...+|-++.+.++++||.+.|.. ..+|-.. +-|..+ .|+-.+ ..+.+. T Consensus 211 --~~g~~~~~l~~~~~~~~~~~~~~~~~~Ia~~fgVPp~~l~g-------~~~e~~~--~~f~~~--tl~P~~-~~ie~~ 276 (378) T protein:vir:16 211 --DNKTEIVELKKDYSVLNKDEIDLIKSELLTGYFMNENILLG-------TASQEQQ--IYFYNS--TIIPLL-IQLEKE 276 (378) T ss_pred --CCCceEEEccCChhhhhHHHHHHHHHHHHHHhCCCHHHhcC-------CchHHHH--HHHHHH--HHHHHH-HHHHHH Confidence 12445555554444556788899999999999999999832 1222111 114333 132211 122333 Q ss_pred HHHHHHhcCCCChhHHHHHh---hceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCCCHHHH Q lcl|NC_015285. 225 LKTQLILKGVMSLEEWEDMK---NHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQTEIEI 301 (359) Q Consensus 225 Lk~QLiLkgI~t~eew~~~~---~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~tDeeI 301 (359) |.. .+++++||.... ....++|.-+ .+.... +..|++.+..+-.- -+++.+-++. ++++.+ T Consensus 277 l~~-----kLl~~~e~~~~~~~~~~~~~~f~~~----~l~~~d-~~~~~~~~~~~~~~--G~~T~NE~R~-~~g~~p--- 340 (378) T protein:vir:16 277 LTY-----KLISTNRRRVVKGNLYYERIIVDNQ----LFKFAT-LKELIDLYHENING--PIFTQNQLLV-KMGEQP--- 340 (378) T ss_pred HHh-----hcCChhhhhhhhhcccccceeeccc----hhhhcC-HHHHHHHHHHHHhC--CCcCHHHHHH-HhCCCC--- Confidence 333 446667766432 2233444422 222222 23555555544332 2445444442 234422 Q ss_pred HHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccC-CCCCcCCCCCCCCCccCC Q lcl|NC_015285. 302 KEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEV-DPNAQESSVDPGDVRRGE 358 (359) Q Consensus 302 ~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~p~~~~~~~ 358 (359) ++..+.-. .+....+.... +.....+..+|++--.+| T Consensus 341 ---------------~~ggD~~~-----~~~n~~~~~~~~~~~~~~~~~~~~~e~~ne 378 (378) T protein:vir:16 341 ---------------IEGGDVYI-----ANLNAVAVKNLSDLQGSRKDVTSTDETNNQ 378 (378) T ss_pred ---------------CCCCCeEe-----eccccccccchhhhcCccCCCCCCCCCCCC Confidence 11111000 00000010000 111224444555555666 No 143 >protein:vir:79043 Length: 479 # NCBI annotation: putative portal protein # Family: family:all:125 # MgeID: mge:1861 # MgeName: phiC2 # Cross-refs: genbank:acc:YP_001110721;genbank:gi:134287338;genbank:GeneID:4955217 Probab=25.36 E-value=2.1 Score=18.86 Aligned_cols=279 Identities=11% Similarity=0.078 Sum_probs=102.0 Q ss_pred CCCchhhHHHhhhhhh-------------heeeccccccc---cCCCc---eeecHhH------------hhhhhc-ccc Q lcl|NC_015285. 1 MRGVDLNQQLTQKAAE-------------YFLYNPKGLKN---STNQG---MKITTDS------------VTYCHS-GIQ 48 (359) Q Consensus 1 ~~~~~~~~~~~~~~~e-------------~f~yn~~~~~~---~~~~~---v~i~~~a------------i~y~hS-Gl~ 48 (359) +-++....+++-.++- +=+|.+..... ..+.. +..+... ....|. |-+ T Consensus 161 v~d~~~~~~~~~~ir~y~~~~~~~~~~~~~e~y~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v 240 (479) T protein:vir:79 161 IWDSKRQRELVAFIRFYYIEDIDGNKIKRVEYYTENDITYFIERGNSFIQEFLYDEYGKMTDIQEGHFRINNKEQGWGKV 240 (479) T ss_pred EEeCCCCCceEEEEEEEEEeecCCceEEEEEEEeCCcEEEEEecCCcccccccccccccccccccccccccccccCCCcc Confidence 0010000011111111 11222211110 00000 0000000 000111 111 Q ss_pred cC----CCCcchhhHHHHHHHHHHHHH-HHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCC Q lcl|NC_015285. 49 DL----NKNMTLSHLHKAIKAVNQLRM-IEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANT 123 (359) Q Consensus 49 d~----~~~~i~syL~~Aik~~NqL~m-~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~T 123 (359) +. ++..-.|-++..+....-+.. +-+....-+.++-|-.-+--.+ |. ...+.. T Consensus 241 Pvv~~~nn~~g~sd~~~v~~liDa~d~~~S~~~~~~~~~~~~~~v~~g~~-~~----~~~~~~----------------- 298 (479) T protein:vir:79 241 PFIPFKNNEKCVSDLTFYKSLIDIYDNNISTLADNLDEIQEVIYVLKEYP-GT----SLQEFI----------------- 298 (479) T ss_pred cEEEecCCCCCCcchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeecCC-cc----ccccch----------------- Confidence 11 222233555555544444432 2344444455555543321111 11 000000 Q ss_pred CcccccccchhhHhhhcccccCCCCccceeecCCCCCcc-hHHHHHHHHHHHHHhcCCCccccCCCCcccccchhh--hh Q lcl|NC_015285. 124 GEIKDDKKFMSMMEDFWLPRREGGRGTEISTLPGGQNLG-ELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAE--IT 200 (359) Q Consensus 124 Gevkdd~~~mSMlEDywLpRReGgrgTEIsTLpGgqnLg-ei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~e--It 200 (359) .+.+.. .|+ .+ +++-+ ++.|....+.. .-.-++-+++.+|....+|-. ..++ +|.+|. |. T Consensus 299 ---~~~~~~-~~i---~~---~~~~~--~~~l~~~~~~~~~~~~~~~l~~~i~~~s~~p~~--~~~~---~gn~Sg~Ai~ 361 (479) T protein:vir:79 299 ---DNIRYY-KSI---KV---DGGGG--VDKLEINIPVEAKKELLDRLEKNIIIFGQGVNP--ESQN---TGDKSGVALK 361 (479) T ss_pred ---hhhhhc-cce---ec---CCCCc--ceEEeccCCHHHHHHHHHHHHHHHHHHhCcccc--cccc---ccchhHHHHH Confidence 000000 011 01 12222 33333323332 223456667778888888843 2221 244333 22 Q ss_pred HHhhhHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhc Q lcl|NC_015285. 201 RDEVKFQKFIARLRKRFSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYV 280 (359) Q Consensus 201 RDElKF~KFI~rLr~rFs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~v 280 (359) .....-..-+.+.+..|...+.++++.=+-+-++....+++ ...+.+.|...---.+... +++++.+. T Consensus 362 ~~~~~l~~k~~~~~~~~~~~l~~~~~li~~~~~~~~~~~~~--~~~i~i~f~~~~p~~~~~~-------a~~~~kl~--- 429 (479) T protein:vir:79 362 FLYSLLDLKCSKTEKKFKKAIRELLWFVCEYLKISGNKSYD--YKTVQITFNHSMIINEAEK-------IDMAAKST--- 429 (479) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCccc--cccceEEeCCCCCcCHHHH-------HHHHHHHh--- Confidence 22222334466777777777777776544333443333333 2457777765544444333 34444443 Q ss_pred chhhhHHHHHHHHhCCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCC Q lcl|NC_015285. 281 GKYFSVDYMRRQVLKQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQES 347 (359) Q Consensus 281 GKy~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~ 347 (359) | .+|++++++. |...++ .+++.++|++|..... +.....++. .+++.-++ T Consensus 430 g-~iS~et~l~~-l~~v~d-~~~E~~ri~~E~~~~~--------~~~~~~~~~------~~~~~~e~ 479 (479) T protein:vir:79 430 G-IVSDETIVSN-HPWVED-VNDELERLKKQEDTQK--------EYDDLIPNN------QDGVIDET 479 (479) T ss_pred c-cCcHHHHHHh-CCCCCC-HHHHHHHHHHHHHHHH--------HHHhccCcc------cCCCcCcC Confidence 4 3799999977 555432 3344455555543211 000000100 01111111 No 144 >protein:vir:5737 Length: 419 # NCBI annotation: portal protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892048;genbank:gi:33770511;goa:Q7Y412;interpro:IPR006427;interpro:IPR006944;uniprot:Q7Y412;genbank:GeneID:1732929;interpro:IPR010994 Probab=25.03 E-value=2.1 Score=18.81 Aligned_cols=285 Identities=11% Similarity=0.132 Sum_probs=110.4 Q ss_pred CCC--------chhhHHHhhhhhhheeeccccc-----------cccCCCceeecHhHhhhhhcccccCCCCcchhhHHH Q lcl|NC_015285. 1 MRG--------VDLNQQLTQKAAEYFLYNPKGL-----------KNSTNQGMKITTDSVTYCHSGIQDLNKNMTLSHLHK 61 (359) Q Consensus 1 ~~~--------~~~~~~~~~~~~e~f~yn~~~~-----------~~~~~~~v~i~~~ai~y~hSGl~d~~~~~i~syL~~ 61 (359) +.+ +..| .+.+.+..+|... +.-+..+-.++.+-|.+... ...++-.-+|.+.. T Consensus 109 l~Gna~~~i~r~~~G-----~~~~L~pl~~~~v~v~~~~~g~~~y~~~~~~~~~~~~~vih~r~--~~~d~~~G~s~i~~ 181 (419) T protein:vir:57 109 LEGNSYSLIDRNGRG-----DITELIPINPHKVIVLKGPDGMPYYDIPSIGEILPMRMVHHIKS--FSLDGYIGTSPIQT 181 (419) T ss_pred hcCCeEEEEEECCCC-----cEEEEEEEcCcceEEEECCCceEEEEEcCCceEEchhhEEEecC--cCCCCcccccHHHH Confidence 111 1111 1112222222110 01122334577777665542 23344334678888 Q ss_pred HHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhhcc Q lcl|NC_015285. 62 AIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDFWL 141 (359) Q Consensus 62 Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDywL 141 (359) |.+.+....-+++...=+----+--+-+...+.. +.....++-...+..++.++.-=..+.|.+ + .++ T Consensus 182 ~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~~~-~~~~~~~e~~~~~~~~~~~~~~g~~nag~~------~-vl~---- 249 (419) T protein:vir:57 182 NPDVLGLGIAVEQHAAQVFARGTTMSGVIERPFE-AKAIASQAAVDAILAKWTERYGGVRNAFSV------G-MLQ---- 249 (419) T ss_pred HHHHHHHHHHHHHHHHHHHHccCCccEEEEecCc-CCcccCHHHHHHHHHHHHHHhccccccccc------e-ecC---- Confidence 8888887777776655443333444455555421 111112222333333333322100011111 1 111 Q ss_pred cccCCCCccceeecCCCCCcchHH---HHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHH-HHHHHHHH Q lcl|NC_015285. 142 PRREGGRGTEISTLPGGQNLGELE---DVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKF-IARLRKRF 217 (359) Q Consensus 142 pRReGgrgTEIsTLpGgqnLgei~---DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KF-I~rLr~rF 217 (359) .|+++..|- .+..++. =.++..+.++++.+||.+.|...+.-+.....+.. +-|.++ |.-+...+ T Consensus 250 ------~g~~~~~l~--~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~t~sn~e~~~---~~f~~~~l~P~~~~i 318 (419) T protein:vir:57 250 ------EGMTYKQLS--QDNEKAQLLQSRQYTVNEVCRLYKVPPHMIQDLQKSTNNNIEHQG---LQYVIYTMLAILKRH 318 (419) T ss_pred ------CCceEEEcC--CChhhHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCCccccHHHHH---HHHHHHHHHHHHHHH Confidence 245565553 2333333 33466688999999999999654332322222222 225444 23332222 Q ss_pred HHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCCC Q lcl|NC_015285. 218 SELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQT 297 (359) Q Consensus 218 s~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~t 297 (359) .. -+-+.++++.++. ...+.|..+.. ... -+..|++.++.+-.- -+++.+-++. ++++. T Consensus 319 e~---------~l~~~ll~~~~~~----~~~i~fd~~~l----l~~-d~~~~~~~~~~~~~~--G~~T~NE~R~-~~gl~ 377 (419) T protein:vir:57 319 ES---------AMMRDLLLPSERR----DFYIEFNVSSL----LRG-DQKSRYESYALGRQW--GWLSVNDIRR-MENLT 377 (419) T ss_pred HH---------HHHhhccCccccC----CeEEEEechhh----hcc-CHHHHHHHHHHHHhC--CCcCHHHHHH-HhCCC Confidence 22 2333444555443 34455554332 111 123455555544222 2444444442 23332 Q ss_pred HHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCCCCccCCC Q lcl|NC_015285. 298 EIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPGDVRRGEF 359 (359) Q Consensus 298 DeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~ 359 (359) +-+ .-+.+.-|-. ..+...+.+...+.|....+++..-+ T Consensus 378 p~~------------ggD~~~~~~n-----------~~~~~~~~~~~~~~~~~~~~~~~~~~ 416 (419) T protein:vir:57 378 PIP------------GGDKYLTPLN-----------MVDSKALTGIGKATPQQLKDIEAILC 416 (419) T ss_pred CCC------------CcCeeeeccc-----------cccccccccccCCCcccCcchhhhhh Confidence 210 0000111110 00111111111122222222222222 No 145 >protein:vir:4156 Length: 542 # NCBI annotation: portal protein # Family: family:all:1379 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046965;genbank:gi:9630535;genbank:GeneID:1261709 Probab=24.64 E-value=2.2 Score=18.76 Aligned_cols=299 Identities=11% Similarity=0.094 Sum_probs=110.2 Q ss_pred CCCch-----------hhHHHhhhhhhheee-ccc-----------ccc--ccCCCceeecHhHhhhhhccccc-CCCCc Q lcl|NC_015285. 1 MRGVD-----------LNQQLTQKAAEYFLY-NPK-----------GLK--NSTNQGMKITTDSVTYCHSGIQD-LNKNM 54 (359) Q Consensus 1 ~~~~~-----------~~~~~~~~~~e~f~y-n~~-----------~~~--~~~~~~v~i~~~ai~y~hSGl~d-~~~~~ 54 (359) .++.. ..+.|...-..|+.+ +.. ... ..+.....++.+-|.+.. ..+ .++-. T Consensus 114 ~rd~~G~~~~L~~l~~~~v~v~~d~~~~~~~~~~~~~~~~~~y~~~~~~~~~~g~~~~~~~~~eIiHir--~~~~~~~~~ 191 (542) T protein:vir:41 114 VRDDRGDPIRFEYIPSHTIRVHKDGSRYRQTWDGVNITHFKDYRYEGEINPETGEDQDSVGANELVFIH--IPSPVCSYY 191 (542) T ss_pred EEcCCCcEEEEEEEcCcceEEEEcCCeeEeeecCCcceeEEeecccccccccccccccccCcccEEEec--CCCCCCCcc Confidence 11110 011111111111111 100 000 011123456666665432 111 12223 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCC---------CCchHHHHHHHHHHHHhhcceEEeeCCCCc Q lcl|NC_015285. 55 TLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVG---------NLPKNKAEQYLREVMGRYRNKMVYDANTGE 125 (359) Q Consensus 55 i~syL~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvG---------nlpk~KAeqYl~~iM~kyrnklvYD~~TGe 125 (359) =+|-+..|+..+.....+++...-+=--.+--+-|.++..+ .+-+...+..-+.+...|+.-. .+.|. T Consensus 192 Glspi~~~~~~i~~~~~~~~~~~~~f~Ng~~p~gIL~~~~~l~de~~~~~~~~~e~~~~lk~~~~~~~~g~~---~n~gk 268 (542) T protein:vir:41 192 GVPRYVSAAPAILAMQKIDEYNYAFFDNYTIPSYVITVTGEFEDELEEDPDGNPTGRTVIQALIEDNFKHLK---EAPHT 268 (542) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCccccccccccccCHHHHHHHHHHHHHHHhhhh---cccCc Confidence 36788889888877777776654433333555667777543 2222333333333333332100 01111 Q ss_pred ccccccchhhHhhhcccccCC-CCccceeecCCCCCcchHH---HHHHHHHHHHHhcCCCccccCCCC--cccccchhhh Q lcl|NC_015285. 126 IKDDKKFMSMMEDFWLPRREG-GRGTEISTLPGGQNLGELE---DVKYFQKKLYKALNVPSSRLETET--TFNIGRAAEI 199 (359) Q Consensus 126 vkdd~~~mSMlEDywLpRReG-grgTEIsTLpGgqnLgei~---DV~YF~kkLy~aL~VP~SRl~~~~--~~~~g~~~eI 199 (359) .+ .++ .-.| ..|.+++.| +.+..+++ -..+..+.+.++++||...|+... +++.....+. T Consensus 269 ------~~-vL~-----~~~~~~~g~~~~pl--~~~~~d~qfle~~~~~~~~Ia~afgVPp~~lG~~~~~t~n~sn~Eq~ 334 (542) T protein:vir:41 269 ------PL-VFS-----IPGGDTVKVTFTPL--NTSQKELSFREYAAEKKYDIAAAHMIDPYRLGIADTGPLGGNFAEVT 334 (542) T ss_pred ------ee-Eee-----ccCCcccceeEEEc--CCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCcCCCcccccccHHHH Confidence 11 111 1111 124445544 33333332 335567889999999999996543 3333333333 Q ss_pred hHHhhhHHH-HHHHHHHHHHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhh Q lcl|NC_015285. 200 TRDEVKFQK-FIARLRKRFSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDP 278 (359) Q Consensus 200 tRDElKF~K-FI~rLr~rFs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp 278 (359) .+. |.+ -|.-+++++...+...|- ++.+ ..+.+.|..+.....- +...++.+ T Consensus 335 ~~~---f~~~tL~P~~~~ie~~ln~~L~---------~~~~-----~~~~~~f~~~~ll~~d--------~~~~~~~~-- 387 (542) T protein:vir:41 335 RRT---YYESVVRPQQNIISSILTDFFQ---------VKFN-----PKTRFKFNDETLLESD--------SVRNCALL-- 387 (542) T ss_pred HHH---HHHHHHHHHHHHHHHHHHhhcc---------cccC-----CceEEEecchhhcchH--------HHHHHHHH-- Confidence 322 543 446667766666664332 2222 2345666654433221 11111111 Q ss_pred hcchhhhHHHHHHHHhCCCHHHHHHHHHHHHHHHhcCCCCCCcch-------hhhcCCCCCc---ccccccCCCCCc--- Q lcl|NC_015285. 279 YVGKYFSVDYMRRQVLKQTEIEIKEIDEQIASEMEAGIIADPMAE-------MDPAMAAGGE---GAPAAEVDPNAQ--- 345 (359) Q Consensus 279 ~vGKy~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~P~~~-------~~~~~~~~~~---~~~~~~~~~~~~--- 345 (359) .-.-+++.+-++.++.++..-+ ..+..|... .+.+...... ...-.+..|... T Consensus 388 v~~GilT~NE~Re~L~g~~pgd--------------d~~l~p~~~~~~~~~~~~~n~~~~~~~~~~k~~~k~~~~~~~~~ 453 (542) T protein:vir:41 388 VQSGVLTPAEARERLFGLDGGP--------------DIFMVPSKGAAKSVKRQERNYEKNQIREIRKIYAKYRPRFNEII 453 (542) T ss_pred HhCCCCCHHHHHHhhCCCCCCC--------------ccccccccccccccccCCcCCCCCchhhhhhcccccCccccccc Confidence 1123445444443322222100 001011000 0000000000 000000111111 Q ss_pred ------CCCCCCCCCccCCC Q lcl|NC_015285. 346 ------ESSVDPGDVRRGEF 359 (359) Q Consensus 346 ------~~~~~p~~~~~~~~ 359 (359) +++..-.+.+.|+| T Consensus 454 ~~~~~~~~~~~~~~~~~~~~ 473 (542) T protein:vir:41 454 SSKLSAEEKKKKIDESLAEF 473 (542) T ss_pred cccccchhhcccccchhhhh Confidence 11111122233444 No 146 >protein:vir:81072 Length: 432 # NCBI annotation: p07 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285677;genbank:gi:148727185;genbank:GeneID:5247117 Probab=22.27 E-value=2.5 Score=18.43 Aligned_cols=288 Identities=13% Similarity=0.161 Sum_probs=113.4 Q ss_pred CCCchhhHHHh---hhhhhheeeccccc--------------cccCCCceeecHhHhhhhhcccccCCCCcchhhHHHHH Q lcl|NC_015285. 1 MRGVDLNQQLT---QKAAEYFLYNPKGL--------------KNSTNQGMKITTDSVTYCHSGIQDLNKNMTLSHLHKAI 63 (359) Q Consensus 1 ~~~~~~~~~~~---~~~~e~f~yn~~~~--------------~~~~~~~v~i~~~ai~y~hSGl~d~~~~~i~syL~~Ai 63 (359) +.++.- +.++ +.+.+.+.+.|... ....+..+.++.+-|.+.. + .+.++-.=+|-|+.|. T Consensus 124 l~Gnay-v~i~~~~g~~~~L~~l~~~~v~v~~~~~g~~~y~~~~~~g~~~~~~~~~iih~r-~-~~~dg~~G~spi~~~~ 200 (432) T protein:vir:81 124 LDGTAY-VRKVVTDGRIESLQYLANDRLTITTDPKGNTAYRYRRTDGQMIDIPKQQIWKIM-G-YSLDGENGLSAIRYGA 200 (432) T ss_pred hcCCeE-EEEEecCCcEEEEEEEcCCceEEEECCCCcEEEEEEecCceEEEEccccEEEec-C-CCCCCcccccHHHHHH Confidence 111100 0000 11223333333221 0112344677777775542 1 1223323357788888 Q ss_pred HHHHHHHHHHHHHHHHHHhc--CccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHhhhcc Q lcl|NC_015285. 64 KAVNQLRMIEDSLVIYRLSR--APERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMMEDFWL 141 (359) Q Consensus 64 k~~NqL~m~EDalVIyR~~R--APeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlEDywL 141 (359) +.+..-..+++... +.++ +--.-|..+| +.|-+..++...+ +|..- .+.|.+ + .++ T Consensus 201 ~~i~~~~~~~~~~~--~~f~ng~~~~gil~~~-~~l~~e~~~~~~~----~~~~~----~nag~~------~-vl~---- 258 (432) T protein:vir:81 201 QIFGTAIAAEAQAA--RAFRNGQLQSVYYQID-RFLTDDQYDSFAK----KVSGS----VEAGRA------P-LLE---- 258 (432) T ss_pred HHHHHHHHHHHHHH--HHHhcCCCcceEEecC-CCCCHHHHHHHHH----HHhhh----hcCCCc------e-ecC---- Confidence 87777666665543 2222 2223455555 5555444444332 22110 011211 1 221 Q ss_pred cccCCCCccceeecCC-CCCcchHHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHHHHHHHHHH Q lcl|NC_015285. 142 PRREGGRGTEISTLPG-GQNLGELEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIARLRKRFSEL 220 (359) Q Consensus 142 pRReGgrgTEIsTLpG-gqnLgei~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KFI~rLr~rFs~i 220 (359) .|++++.|.= .+.+.-++-.+|....+.++.+||...|+....-+.+.++.+.-.-+-|.++ .|+--+. . T Consensus 259 ------~g~~~~~l~~~~~d~q~le~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~~~sn~eq~~~~f~~~--tl~P~~~-~ 329 (432) T protein:vir:81 259 ------GGMDVKSLGLNPVDAQLLQSRQYSVESICRFFGVPPSMIGHSSAGTTSWGSGIESQQLGFLTM--TLSPWLR-R 329 (432) T ss_pred ------CCceEEEccCCHHHHHHHHHHHHHHHHHHHHhCCCHHHcCCcCCccccccchHHHHHHHHHHH--HHHHHHH-H Confidence 2455555532 1222233445678889999999999999754332222222222222225543 3333222 2 Q ss_pred HHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHHhCCCHHH Q lcl|NC_015285. 221 FTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQVLKQTEIE 300 (359) Q Consensus 221 f~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~IL~~tDee 300 (359) +.+.|-.. +.++.++. ...+.|..+ ++...+ +++|.+.++.+-.- -+++.+-++.. +++.. T Consensus 330 ie~~l~~k-----Ll~~~~~~----~~~~~fd~~----~llr~d-~~~r~~~~~~~~~~--G~~t~NE~R~~-~glpp-- 390 (432) T protein:vir:81 330 IEQSIALN-----LLSPAERR----RYFADFDTS----ALLRAD-SAARSSYYSQLVNN--GLMTRDEAREI-EGLPK-- 390 (432) T ss_pred HHHHHHhh-----ccCccccC----ceEEEeech----hhhccC-HHHHHHHHHHHHhC--CCCCHHHHHHH-hCCCC-- Confidence 33334433 34555543 345555533 222222 34566666655322 24455555432 44422 Q ss_pred HHHHHHHHHHHHhcCCCCCCcchhhhcCC-CCCcccccccCCCCCcCCCCCCCCCccCC Q lcl|NC_015285. 301 IKEIDEQIASEMEAGIIADPMAEMDPAMA-AGGEGAPAAEVDPNAQESSVDPGDVRRGE 358 (359) Q Consensus 301 I~e~~kqi~~E~~~~~~~~P~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~p~~~~~~~ 358 (359) ++.-+.-.-.+.. .|.+. .+....|.......+..+-..+| T Consensus 391 ----------------~~g~~~~~~~~~~~~pl~~-~~~~~~~~~~~~~~n~~~~~~~~ 432 (432) T protein:vir:81 391 ----------------LGGNAAVLTVQSAMVPLDS-IGLQASPEPASGLGNQQQDKVSK 432 (432) T ss_pred ----------------CCCCcceEeecCcccchhh-hccCCCCCCCCCCCCcccccccC Confidence 1100000000000 00000 00000111111122222333333 No 147 >protein:vir:3843 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050149;swissprot:trembl:q9t1f8;genbank:gi:9633041;uniprot:Q9T1F8;genbank:GeneID:1262206 Probab=21.99 E-value=2.5 Score=18.39 Aligned_cols=274 Identities=15% Similarity=0.160 Sum_probs=116.5 Q ss_pred CCCc--------hhhHHHhhhhhhheeeccccc------------------cccCCCceeecHhHhhhhhcccccCCCCc Q lcl|NC_015285. 1 MRGV--------DLNQQLTQKAAEYFLYNPKGL------------------KNSTNQGMKITTDSVTYCHSGIQDLNKNM 54 (359) Q Consensus 1 ~~~~--------~~~~~~~~~~~e~f~yn~~~~------------------~~~~~~~v~i~~~ai~y~hSGl~d~~~~~ 54 (359) +.++ ..| .+.+.+...|..+ ....+..+.++.+-|.|..- .+.++.. T Consensus 94 l~Gna~~~i~r~~~g-----~~~~l~~l~~~~v~i~~~~~~~~~~y~~~~~~~~~~~~~~~~~~eiih~~~--~~~~~~~ 166 (397) T protein:vir:38 94 LDGNCYAYRHKNTNG-----VDLSWEYLRPSQVQPMLLQDGSGLIYNINFDEPAIGYMENVPAADVIHIRL--LSKNGGK 166 (397) T ss_pred hcCCEEEEEEECCCC-----cEEEEEEEcCceeEEEEcCCCceEEEEEEeccccccceeEecCccEEEecC--CCCCCcc Confidence 1111 111 1222222222210 01122235678777776542 2233332 Q ss_pred -chhhHHHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccch Q lcl|NC_015285. 55 -TLSHLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFM 133 (359) Q Consensus 55 -i~syL~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~m 133 (359) =+|-|..|.+.+.....+++...-+----+.-+-++.++.+ +.+. +.+-+++.+..++.- + +.|. .+ T Consensus 167 ~G~s~i~~~~~~i~~~~~~~~~~~~~f~ng~~~~~il~~~~~-~~~e-~~~~~~~~~~~~~~~---~-n~~~------~~ 234 (397) T protein:vir:38 167 TGISPLSALINEQQIKDASNELTLKALKQSVTASAVLTIQKG-GLLD-AETRIARSKEISKQI---H-NSDG------PV 234 (397) T ss_pred ccccHHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEeCCC-CCHH-HHHHHHHHHHHHhcc---c-ccCC------ce Confidence 36889999999999888888877655556666777777755 3433 334444444332210 1 1111 11 Q ss_pred hhHhhhcccccCCCCccceeecCCCCCcch-HHHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHHHH Q lcl|NC_015285. 134 SMMEDFWLPRREGGRGTEISTLPGGQNLGE-LEDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFIAR 212 (359) Q Consensus 134 SMlEDywLpRReGgrgTEIsTLpGgqnLge-i~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KFI~r 212 (359) .+ ..|.+++.|.-..+-.+ ++=.++..+.+.++++||...|....+-+ +.+.....-|.+-+.- T Consensus 235 -vl----------~~g~~~~~l~~~~~d~~~~e~~~~~~~~Ia~afgVp~~~lg~~~~~~----~~~e~~~~~~~~~l~P 299 (397) T protein:vir:38 235 -VI----------DALEDYKPLEVKGNIASLLNQVDWTRDQIAKVYGVPDSYLNGQGDQQ----SSITQISGQYAKSLNR 299 (397) T ss_pred -ec----------CCCceEEecCCChhHHHHHHHHHHHHHHHHHHhCCCHHHhCCCCCcc----cHHHHHHHHHHHHHHH Confidence 11 12466666654333233 45567889999999999999996543211 2222222223222333 Q ss_pred HHHHHHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHH Q lcl|NC_015285. 213 LRKRFSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQ 292 (359) Q Consensus 213 Lr~rFs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~ 292 (359) +...+ .+.|-..| .++.+|+ +.|.-+- -...|.+.++.+-. +.+++.+-++. T Consensus 300 ~~~~i----e~~ln~~l-----~~~~~~~-------~~~~~~~---------d~~~~~~~~~~~~~--~G~~t~nE~R~- 351 (397) T protein:vir:38 300 YVQAI----VGELNDKL-----HANISAN-------IRFAIDA---------MGDQYASTISSSVK--GGTIAGNQARF- 351 (397) T ss_pred HHHHH----HHHHHHhc-----cChhccc-------ccccccC---------CHHHHHHHHHHHHh--CCCcCHHHHHH- Confidence 33322 33333332 3333432 2222111 12355555554422 23555555554 Q ss_pred HhCCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCC--CCcccccccCCCCCcCCCCCCC Q lcl|NC_015285. 293 VLKQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAA--GGEGAPAAEVDPNAQESSVDPG 352 (359) Q Consensus 293 IL~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~p~ 352 (359) +|++.+-+ .+-.+.|+.......+. ..+|...+ ....+...+|+ T Consensus 352 ~lg~~p~~-------------~~d~~~~~~~~~~~~~~~~~~~g~~~~---~~~~e~~~~~~ 397 (397) T protein:vir:38 352 ILQNSGYL-------------AKDLPDPEKEPQQAIQLIQQEGGENDG---NNSDERGSDPE 397 (397) T ss_pred HhCCCCCC-------------CCccccccccccccccccccccCCCCC---CCCCCCCCCCC Confidence 24443210 00011111111111000 00111110 01112222233 No 148 >protein:vir:1326 Length: 457 # NCBI annotation: gp34 # Family: family:all:31 # ACLAME annotation(s): phi:0000068 - phage portal protein # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047925;swissprot:trembl:q9zxb2;genbank:gi:9631143;uniprot:Q9ZXB2;genbank:GeneID:2715872 Probab=20.77 E-value=2.7 Score=18.21 Aligned_cols=299 Identities=13% Similarity=0.112 Sum_probs=113.1 Q ss_pred CCCchhhHHHhhhhhhheeecccc--------------------ccccCCCc--eeecHhHhhhhhcccccCCCC-cchh Q lcl|NC_015285. 1 MRGVDLNQQLTQKAAEYFLYNPKG--------------------LKNSTNQG--MKITTDSVTYCHSGIQDLNKN-MTLS 57 (359) Q Consensus 1 ~~~~~~~~~~~~~~~e~f~yn~~~--------------------~~~~~~~~--v~i~~~ai~y~hSGl~d~~~~-~i~s 57 (359) ++.+. | .+.+.+..+|.. +...++.. ..++.+-|.+.. ..+.++. .=+| T Consensus 123 i~~~~-g-----~~~~l~~l~p~~v~v~~~~~~~~~~~~~~~y~~~~~~~~~~~~~~~~~diih~~--~~~~~~~~~G~s 194 (457) T protein:vir:13 123 VRWQG-P-----NIVGLDVLDPTKIHVHMVMVDGLRRKVFEAYDIDADGNEVLLGWFTPRDVLHIP--GMMLPGDFVGCS 194 (457) T ss_pred EEecC-C-----cEEEEEEEccCceEEEEecCCCccceeEEEEEEecCCceeeEEeeCccceEEec--CCCCCCcccccc Confidence 11110 1 111112222211 00011111 234555554432 2222332 3456 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhcCccceeEeccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCcccccccchhhHh Q lcl|NC_015285. 58 HLHKAIKAVNQLRMIEDSLVIYRLSRAPERRIFYIDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKKFMSMME 137 (359) Q Consensus 58 yL~~Aik~~NqL~m~EDalVIyR~~RAPeRRvFyIDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~~mSMlE 137 (359) -+..|.+.+.....+++...-+=---+--+-|..++ +.|-+...++..+.+...|+.. +. .|.+ + .++ T Consensus 195 ~i~~~~~~i~~~~~~~~~~~~~f~ng~~p~gil~~~-~~ls~e~~~~~~~~~~~~~~g~---~n-ag~~------~-vl~ 262 (457) T protein:vir:13 195 PISYARESIGLALAAQKYGSKFFANGAMPGAVVEVP-GTMSEEGLARAREAWRAANSGV---DN-AHRV------A-LLT 262 (457) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcCCCcceEEEcC-CCCCHHHHHHHHHHHHHHhcCc---cc-cCcc------e-ecC Confidence 788888888877777776554434444555666665 5666655555444444444320 11 1211 1 111 Q ss_pred hhcccccCCCCccceeecCCCCCcchH---HHHHHHHHHHHHhcCCCccccCCCCcccccchhhhhHHhhhHHHHH-HHH Q lcl|NC_015285. 138 DFWLPRREGGRGTEISTLPGGQNLGEL---EDVKYFQKKLYKALNVPSSRLETETTFNIGRAAEITRDEVKFQKFI-ARL 213 (359) Q Consensus 138 DywLpRReGgrgTEIsTLpGgqnLgei---~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~eItRDElKF~KFI-~rL 213 (359) .|.+++.|. .+..++ +=-+|....+.++++||...|+.-.+-+.. ++.+...-+.|.+++ .-+ T Consensus 263 ----------~g~~~~~l~--~~~~d~q~~e~~~~~~~~Ia~~fgVPp~~lg~~~~~~~~-~sn~eq~~~~f~~~tl~P~ 329 (457) T protein:vir:13 263 ----------EGAKFSKVA--MSPDEAQFLQTRQFQVPEIARIFGVPPHLISDATNSTSW-GSGLAEQNIAFTMFSLRPW 329 (457) T ss_pred ----------CCceEEEcc--CChhHHHHHHHHHHHHHHHHHHhCCCHHHcCCCCCcccc-cchHHHHHHHHHHHHHHHH Confidence 234555552 223332 333477888999999999999643332221 122333434476653 333 Q ss_pred HHHHHHHHHHHHHHHHHhcCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcchhhhHHHHHHHH Q lcl|NC_015285. 214 RKRFSELFTDLLKTQLILKGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVGKYFSVDYMRRQV 293 (359) Q Consensus 214 r~rFs~if~d~Lk~QLiLkgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vGKy~S~~~i~k~I 293 (359) ..+ +.+.|-..|+ ++.++ ....+.|..+ ++...+ +..|.+.+..+-.- -+++.+-++. . T Consensus 330 ~~~----ie~~ln~~L~-----~~~~~----~~~~i~fd~~----~l~~~D-~~~r~~~~~~~~~~--G~~T~NE~R~-~ 388 (457) T protein:vir:13 330 LER----IEAGFNRLLF-----AETAD----RFRFVKFNLD----EIKRGA-PKERMELWSLGLQN--GIYSIDEVRA-A 388 (457) T ss_pred HHH----HHHHHHHhhc-----Ccccc----CceeEEeech----hhhccC-HHHHHHHHHHHHhC--CCcCHHHHHH-H Confidence 333 3333333333 33332 2223445433 222222 24556665554322 2556666653 3 Q ss_pred hCCCHHHHHHHHHHHHHHHhc-CCCCCCc--chhhhcCCCCCcccccccCCCCCcCCCCCCCCCccCC-C Q lcl|NC_015285. 294 LKQTEIEIKEIDEQIASEMEA-GIIADPM--AEMDPAMAAGGEGAPAAEVDPNAQESSVDPGDVRRGE-F 359 (359) Q Consensus 294 L~~tDeeI~e~~kqi~~E~~~-~~~~~P~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~-~ 359 (359) ++|.+-+=-..++-.-. ..- +.-..|+ ....++.+.++.+.++...++ ...+.+.-.++ . T Consensus 389 ~gl~Pi~~g~~d~~~~~-~n~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~g~~d~~~~~~~~ 452 (457) T protein:vir:13 389 EDMTPLPDGLGEKYRVP-LNLGEVGEEPEPEPAPAPPAIEPPAEEPDEEPEP-----EGKPDDEGATEED 452 (457) T ss_pred hCCCCCCCCcccceeec-cccccccccccccccCCCCCCCCCccccCCCCCC-----CCCCccccCCCCc Confidence 56643110000000000 000 0000000 000000000000000000000 00111111011 1 No 149 >protein:vir:103951 Length: 511 # NCBI annotation: phage portal protein # Family: family:all:125 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873988;genbank:gi:118430763;genbank:GeneID:4525445 Probab=20.55 E-value=2.7 Score=18.17 Aligned_cols=298 Identities=11% Similarity=0.085 Sum_probs=101.1 Q ss_pred CCCchhhHHHhhhh---------------hhhe-eeccccccc---cCCCceeecHhHhh-hhhc-cccc----CCCCcc Q lcl|NC_015285. 1 MRGVDLNQQLTQKA---------------AEYF-LYNPKGLKN---STNQGMKITTDSVT-YCHS-GIQD----LNKNMT 55 (359) Q Consensus 1 ~~~~~~~~~~~~~~---------------~e~f-~yn~~~~~~---~~~~~v~i~~~ai~-y~hS-Gl~d----~~~~~i 55 (359) +-++....+.+-.+ ..|+ +|++...+. ..+........-+. ..|- |-++ +++..- T Consensus 175 vydd~~~~~~~~~vr~~~~~~~d~~~~~~~~~~~iyt~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~vPvv~f~nn~~g 254 (511) T protein:vir:10 175 IYDNTIERNSIAGVRYLRTKPIDKTDEDEVFTVDLFTSHGVYRYLTSRTNGLKLTPRENGFESHSFERMPITEFSNNERR 254 (511) T ss_pred EEcCCCCCceEEEEEEEEeeecccCccceEEEEEEEeCCcEEEEEecCCCcccccccccccccccCcceeEEEecCCCCC Confidence 11111111111111 1111 455543221 11111111000000 0010 1111 111122 Q ss_pred hhhHHHHHHHHHHHHH-HHHHHHHHHHhcCccceeEe---ccCCCCchHHHHHHHHHHHHhhcceEEeeCCCCccccccc Q lcl|NC_015285. 56 LSHLHKAIKAVNQLRM-IEDSLVIYRLSRAPERRIFY---IDVGNLPKNKAEQYLREVMGRYRNKMVYDANTGEIKDDKK 131 (359) Q Consensus 56 ~syL~~Aik~~NqL~m-~EDalVIyR~~RAPeRRvFy---IDvGnlpk~KAeqYl~~iM~kyrnklvYD~~TGevkdd~~ 131 (359) .|-++..+....-+.. +=+....-+-++.|-+-+.- .|...+++.+ .. T Consensus 255 ~gd~e~v~~liDa~d~~~S~~~~~~~~~~~~~lv~~g~~~~~~~~~~~~~----------------------------~~ 306 (511) T protein:vir:10 255 KGDYEKVITLIDLYDNAESDTANYMSDLNDAMLLIKGNLNLDPVEVRKQK----------------------------EA 306 (511) T ss_pred CCchhhhHHHHHHHHHHHHHHHHHHHHhhCceeeeeccccCCchhhccch----------------------------hc Confidence 3444444444333322 11222222333444333221 1111111100 01 Q ss_pred chhhHhhh-cccc--cCCCCccceeecCCCCCcchH-HHHHHHHHHHHHhcCCCccccCCCCcccccchh--hhhHHhhh Q lcl|NC_015285. 132 FMSMMEDF-WLPR--REGGRGTEISTLPGGQNLGEL-EDVKYFQKKLYKALNVPSSRLETETTFNIGRAA--EITRDEVK 205 (359) Q Consensus 132 ~mSMlEDy-wLpR--ReGgrgTEIsTLpGgqnLgei-~DV~YF~kkLy~aL~VP~SRl~~~~~~~~g~~~--eItRDElK 205 (359) .+-.+..- +.+. ...+.|..+..|-...+...+ .-+.-+.+.+|.-..+|---.+ .|. |..| .|..-... T Consensus 307 ~~~~~~~~~~~~~~~~~~~~~~d~~~l~~~~~~~~~e~~~~~L~~~I~~~s~~P~~~~~---~~~-~n~Sg~Al~~~~~~ 382 (511) T protein:vir:10 307 NVLFLEPTVYADSEGRETEGSVDGGYIYKQYDVQGTEAYKDRLNSDIHMFTNTPNMKDD---NFS-GTQSGEAMKYKLFG 382 (511) T ss_pred cceecccccccccccccCCCCcceeEEeecCCHHHHHHHHHHHHHHHHHHhCCcccccc---ccc-ccchHHHHHHHHHH Confidence 11011100 0000 011112334444333333322 3455667778888888852221 121 2222 23222223 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHh----cCCCChhHHHHHhhceeeeeeccchHHHHHHHHHHHHHHHHHHHhhhhcc Q lcl|NC_015285. 206 FQKFIARLRKRFSELFTDLLKTQLIL----KGVMSLEEWEDMKNHIQFDFIADNYFTELKEIEIRNERMNQVAAMDPYVG 281 (359) Q Consensus 206 F~KFI~rLr~rFs~if~d~Lk~QLiL----kgI~t~eew~~~~~~I~~~f~~Dn~f~ElKe~Eil~~Rl~~~~~~dp~vG 281 (359) ...-+.+.+..|..-+.+.++.=+-+ .++-...+| ..|++.|...--=.+ .+.++++..+. | T Consensus 383 l~~k~~~k~~~f~~~l~~~~~li~~~~~~~~~~~~~~d~----~~i~i~f~~~~p~d~-------~~~~~~~~kl~---G 448 (511) T protein:vir:10 383 LEQRTKTKEGLFTKGLRRRAKLLETILKNTRSIDANKDF----NTVRYVYNRNLPKSL-------IEELKAYIDSG---G 448 (511) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCCccccccc----ceeeEEeCCCCCcCH-------HHHHHHHHHHh---c Confidence 34445666666666666665542222 122233344 357778865333222 23455555553 4 Q ss_pred hhhhHHHHHHHHhCCCHHHHHHHHHHHHHHHhcCCCCCCcchhhhcCCCCCcccccccCCCCCcCCCCCCCCCccCC Q lcl|NC_015285. 282 KYFSVDYMRRQVLKQTEIEIKEIDEQIASEMEAGIIADPMAEMDPAMAAGGEGAPAAEVDPNAQESSVDPGDVRRGE 358 (359) Q Consensus 282 Ky~S~~~i~k~IL~~tDeeI~e~~kqi~~E~~~~~~~~P~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~ 358 (359) .+|.+++++. |...++ .+++.++|++|+.+. .+.. ++..++. +++...++-++ ++.+.+ .+.| T Consensus 449 -~iS~et~~~~-l~~v~d-~~~E~~ri~~E~~~~-~~~~---~~~~~~~-~~~~~~~~~~~---~~~~~~---~~~~ 511 (511) T protein:vir:10 449 -KISQTTLMSL-FSFFQD-PELEVKKIEEDEKES-IKKA---QKGIYKD-PRDINDDEQDD---DTKDTV---DKKE 511 (511) T ss_pred -cCcHHHHHHh-CCCCCC-HHHHHHHHHHHHHHH-HHHH---hhhcccC-CCCCCCCCCCC---cccCcc---cccC Confidence 3799999977 555432 334455566664431 1111 1111111 11111111111 111111 1111 Done!