Query lcl|NC_018856.1_cdsid_YP_006907308.1 [gene=D307_gp252] [protein=major capsid protein] [protein_id=YP_006907308.1] [location=12740..14179] Match_columns 479 No_of_seqs 47 out of 50 Neff 4.8 Searched_HMMs 1612 Date Thu Nov 7 15:28:41 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_16 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_16_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:95603 Length: 463 100.0 4E-199 3E-202 1108.1 36.1 463 1-470 1-463 (463) 2 protein:vir:99311 Length: 463 100.0 4E-199 3E-202 1108.1 36.1 463 1-470 1-463 (463) 3 protein:vir:96666 Length: 462 100.0 2E-198 1E-201 1104.8 34.7 461 1-469 1-462 (462) 4 protein:vir:80835 Length: 464 100.0 7E-194 4E-197 1079.6 35.1 460 1-479 1-461 (464) 5 protein:vir:63741 Length: 468 100.0 5E-193 3E-196 1075.0 33.4 468 1-478 1-468 (468) 6 protein:vir:80491 Length: 467 100.0 9E-193 6E-196 1073.4 31.9 467 1-478 1-467 (467) 7 protein:vir:100851 Length: 514 100.0 2E-189 1E-192 1054.7 34.5 472 1-476 1-514 (514) 8 protein:vir:102823 Length: 470 100.0 5E-157 3E-160 877.7 31.0 437 15-471 1-470 (470) 9 protein:vir:8843 Length: 317 # 99.4 8.2E-15 5.1E-18 97.8 14.8 294 36-343 1-317 (317) 10 protein:vir:93631 Length: 580 99.1 8.8E-13 5.5E-16 86.7 9.8 247 189-479 1-284 (580) 11 protein:vir:2792 Length: 567 # 99.0 2E-11 1.2E-14 79.3 14.5 255 201-479 1-329 (567) 12 protein:vir:3306 Length: 567 # 99.0 2E-11 1.2E-14 79.3 14.5 255 201-479 1-329 (567) 13 protein:vir:10145 Length: 567 99.0 2E-11 1.2E-14 79.3 14.5 255 201-479 1-329 (567) 14 protein:vir:9979 Length: 567 # 99.0 2E-11 1.2E-14 79.3 14.5 255 201-479 1-329 (567) 15 protein:vir:94933 Length: 330 98.9 5.9E-11 3.7E-14 76.7 12.4 318 1-468 1-330 (330) 16 protein:vir:104388 Length: 566 98.8 1.3E-10 8.3E-14 74.7 12.4 288 141-479 1-328 (566) 17 protein:vir:827 Length: 567 # 98.8 6.3E-11 3.9E-14 76.5 9.3 291 140-479 1-329 (567) 18 protein:vir:97255 Length: 310 98.8 4.4E-10 2.7E-13 71.9 13.9 300 1-467 1-310 (310) 19 protein:vir:5120 Length: 615 # 98.8 2.8E-10 1.7E-13 72.9 12.0 252 187-479 1-310 (615) 20 protein:vir:78830 Length: 324 97.9 1.4E-06 8.5E-10 52.7 13.3 299 1-338 1-324 (324) 21 protein:vir:96392 Length: 324 97.9 1.4E-06 8.5E-10 52.7 13.3 299 1-338 1-324 (324) 22 protein:vir:7771 Length: 330 # 97.8 1.8E-05 1.1E-08 46.6 18.8 323 23-419 1-330 (330) 23 protein:vir:105563 Length: 396 97.6 3.1E-07 1.9E-10 56.3 6.2 266 155-479 1-296 (396) 24 protein:vir:9309 Length: 324 # 97.6 1.2E-05 7.5E-09 47.6 14.8 299 1-338 1-324 (324) 25 protein:vir:103955 Length: 324 97.6 1.5E-05 9.1E-09 47.1 14.7 297 1-338 1-324 (324) 26 protein:vir:8102 Length: 543 # 97.6 2.2E-05 1.4E-08 46.1 15.6 326 1-389 198-543 (543) 27 protein:vir:96223 Length: 324 97.5 2.2E-05 1.4E-08 46.1 15.1 300 1-338 1-324 (324) 28 protein:vir:99749 Length: 324 97.5 9.9E-06 6.2E-09 48.0 13.0 297 1-338 1-324 (324) 29 protein:vir:191 Length: 385 # 97.5 2.4E-05 1.5E-08 45.9 14.6 313 1-358 50-385 (385) 30 protein:vir:1886 Length: 385 # 97.5 2.4E-05 1.5E-08 45.9 14.6 313 1-358 50-385 (385) 31 protein:vir:100135 Length: 418 97.4 1.3E-05 8E-09 47.4 13.1 313 1-347 87-418 (418) 32 protein:vir:4339 Length: 395 # 97.4 5.9E-06 3.6E-09 49.3 11.0 310 1-346 54-395 (395) 33 protein:vir:97148 Length: 324 97.4 4.4E-05 2.7E-08 44.5 15.6 299 1-338 1-324 (324) 34 protein:vir:8187 Length: 311 # 97.4 1.1E-05 6.9E-09 47.7 12.1 287 39-349 1-311 (311) 35 protein:vir:2430 Length: 318 # 97.3 2.5E-05 1.5E-08 45.8 13.2 293 15-349 1-318 (318) 36 protein:vir:94142 Length: 304 97.3 1.4E-05 9E-09 47.1 11.7 297 27-345 1-304 (304) 37 protein:vir:105905 Length: 304 97.3 1.4E-05 9E-09 47.1 11.7 297 27-345 1-304 (304) 38 protein:vir:4953 Length: 397 # 97.3 0.0001 6.5E-08 42.4 18.0 316 1-413 60-397 (397) 39 protein:vir:97053 Length: 390 97.2 6.8E-05 4.2E-08 43.4 14.8 296 1-344 54-390 (390) 40 protein:vir:98487 Length: 681 97.2 2.7E-06 1.7E-09 51.1 7.1 291 129-479 1-319 (681) 41 protein:vir:107423 Length: 681 97.2 2.7E-06 1.7E-09 51.1 7.1 291 129-479 1-319 (681) 42 protein:vir:107802 Length: 681 97.2 2.7E-06 1.7E-09 51.1 7.1 291 129-479 1-319 (681) 43 protein:vir:9574 Length: 300 # 97.1 2.5E-05 1.5E-08 45.8 11.5 280 38-346 1-300 (300) 44 protein:vir:10364 Length: 390 97.1 0.00013 8.1E-08 41.9 15.4 309 1-344 54-390 (390) 45 protein:vir:4830 Length: 397 # 97.0 5.1E-05 3.2E-08 44.1 12.3 306 1-352 72-397 (397) 46 protein:vir:78523 Length: 338 97.0 9.2E-05 5.7E-08 42.7 13.7 296 17-337 1-338 (338) 47 protein:vir:1638 Length: 298 # 96.9 0.00026 1.6E-07 40.3 16.4 293 41-402 1-298 (298) 48 protein:vir:80684 Length: 315 96.9 3.4E-05 2.1E-08 45.1 10.4 289 31-347 1-315 (315) 49 protein:vir:4226 Length: 326 # 96.9 5.5E-05 3.4E-08 43.9 11.2 288 1-333 1-326 (326) 50 protein:vir:81070 Length: 390 96.9 0.0001 6.3E-08 42.5 12.7 309 1-344 54-390 (390) 51 protein:vir:41 Length: 299 # N 96.8 0.00013 8E-08 41.9 12.8 283 33-347 1-299 (299) 52 protein:vir:3991 Length: 404 # 96.8 0.00019 1.2E-07 41.0 13.5 306 1-316 56-404 (404) 53 protein:vir:104085 Length: 320 96.7 0.00015 9.3E-08 41.6 12.6 278 31-337 1-320 (320) 54 protein:vir:95318 Length: 328 96.7 1.6E-05 1E-08 46.9 7.2 275 1-325 1-328 (328) 55 protein:vir:7324 Length: 335 # 96.4 4.2E-05 2.6E-08 44.6 7.6 292 1-343 1-335 (335) 56 protein:vir:78223 Length: 333 96.4 0.00042 2.6E-07 39.1 13.1 303 20-344 1-333 (333) 57 protein:vir:95376 Length: 425 96.3 0.00037 2.3E-07 39.4 12.4 304 1-348 73-425 (425) 58 protein:vir:95763 Length: 297 96.3 0.00076 4.7E-07 37.7 16.5 287 15-347 1-297 (297) 59 protein:vir:94771 Length: 298 96.3 0.00081 5E-07 37.5 16.8 294 41-402 1-298 (298) 60 protein:vir:4856 Length: 293 # 96.2 0.00035 2.2E-07 39.5 12.0 274 27-346 1-293 (293) 61 protein:vir:4997 Length: 397 # 96.2 0.00076 4.7E-07 37.7 13.7 303 1-345 71-397 (397) 62 protein:vir:1025 Length: 408 # 96.1 0.00038 2.4E-07 39.3 11.6 288 1-350 56-408 (408) 63 protein:vir:105038 Length: 428 96.1 0.00043 2.7E-07 39.0 11.9 316 1-346 82-428 (428) 64 protein:vir:3845 Length: 395 # 96.1 0.00015 9.2E-08 41.6 9.3 296 1-311 74-395 (395) 65 protein:vir:99920 Length: 311 96.1 0.00039 2.4E-07 39.3 11.6 293 31-346 1-311 (311) 66 protein:vir:9410 Length: 415 # 96.1 0.00073 4.5E-07 37.8 12.9 300 1-354 68-415 (415) 67 protein:vir:103370 Length: 418 96.0 0.00024 1.5E-07 40.4 10.1 326 1-363 10-418 (418) 68 protein:vir:81100 Length: 415 96.0 0.0011 6.9E-07 36.8 14.5 301 1-354 68-415 (415) 69 protein:vir:98339 Length: 415 96.0 0.0011 6.9E-07 36.8 14.5 301 1-354 68-415 (415) 70 protein:vir:79987 Length: 415 96.0 0.0011 6.9E-07 36.8 14.5 301 1-354 68-415 (415) 71 protein:vir:98525 Length: 331 96.0 3.4E-05 2.1E-08 45.1 5.4 304 1-368 1-331 (331) 72 protein:vir:107388 Length: 331 96.0 3.4E-05 2.1E-08 45.1 5.4 304 1-368 1-331 (331) 73 protein:vir:107826 Length: 331 96.0 3.4E-05 2.1E-08 45.1 5.4 304 1-368 1-331 (331) 74 protein:vir:94673 Length: 419 96.0 0.0011 6.9E-07 36.8 13.4 312 1-344 71-419 (419) 75 protein:vir:7409 Length: 408 # 95.9 0.00094 5.8E-07 37.2 12.9 304 1-319 56-408 (408) 76 protein:vir:9759 Length: 303 # 95.9 0.0013 7.8E-07 36.5 17.0 301 39-405 1-303 (303) 77 protein:vir:1433 Length: 435 # 95.9 0.0013 8.3E-07 36.3 14.4 320 1-348 45-435 (435) 78 protein:vir:103759 Length: 330 95.8 0.00014 8.4E-08 41.8 7.8 283 1-341 1-330 (330) 79 protein:vir:81160 Length: 371 95.7 0.00085 5.3E-07 37.4 11.9 299 1-346 62-371 (371) 80 protein:vir:1328 Length: 392 # 95.7 0.00051 3.2E-07 38.6 10.7 308 1-328 44-392 (392) 81 protein:vir:80376 Length: 435 95.7 0.0015 9.3E-07 36.1 13.2 317 1-348 55-435 (435) 82 protein:vir:100247 Length: 425 95.7 0.00052 3.2E-07 38.6 10.6 304 1-347 88-425 (425) 83 protein:vir:102119 Length: 404 95.6 0.0017 1.1E-06 35.7 13.9 327 1-394 44-404 (404) 84 protein:vir:105004 Length: 392 95.4 0.0007 4.4E-07 37.9 10.3 298 1-310 35-392 (392) 85 protein:vir:102082 Length: 392 95.4 0.0007 4.4E-07 37.9 10.3 298 1-310 35-392 (392) 86 protein:vir:102873 Length: 392 95.4 0.0007 4.4E-07 37.9 10.3 298 1-310 35-392 (392) 87 protein:vir:107593 Length: 392 95.4 0.0007 4.4E-07 37.9 10.3 298 1-310 35-392 (392) 88 protein:vir:485 Length: 407 # 95.4 0.0017 1E-06 35.8 12.3 313 1-334 53-407 (407) 89 protein:vir:2504 Length: 305 # 95.4 0.0022 1.3E-06 35.2 14.2 291 37-365 1-305 (305) 90 protein:vir:4456 Length: 401 # 95.1 0.0017 1.1E-06 35.8 11.6 299 1-327 74-401 (401) 91 protein:vir:3158 Length: 321 # 94.9 0.0032 2E-06 34.2 14.7 301 19-364 1-321 (321) 92 protein:vir:5739 Length: 366 # 94.5 0.0044 2.7E-06 33.5 13.0 314 1-346 21-366 (366) 93 protein:vir:4600 Length: 415 # 94.1 0.0053 3.3E-06 33.1 14.4 300 1-354 58-415 (415) 94 protein:vir:4700 Length: 415 # 94.1 0.0053 3.3E-06 33.1 14.4 300 1-354 58-415 (415) 95 protein:vir:81227 Length: 413 94.1 0.0053 3.3E-06 33.1 12.5 310 1-346 58-413 (413) 96 protein:vir:104256 Length: 458 93.9 0.0049 3.1E-06 33.2 11.3 309 1-346 95-458 (458) 97 protein:vir:1268 Length: 397 # 93.5 0.0075 4.7E-06 32.2 16.6 293 1-357 70-397 (397) 98 protein:vir:4092 Length: 390 # 92.7 0.01 6.4E-06 31.5 12.3 319 1-350 47-390 (390) 99 protein:vir:1084 Length: 437 # 92.6 0.0033 2.1E-06 34.2 8.3 306 1-345 100-437 (437) 100 protein:vir:2344 Length: 397 # 91.8 0.014 8.7E-06 30.8 16.3 329 15-414 1-397 (397) 101 protein:vir:6242 Length: 390 # 91.8 0.014 8.8E-06 30.7 11.9 305 1-328 71-390 (390) 102 protein:vir:4511 Length: 409 # 91.5 0.016 9.8E-06 30.5 13.6 316 1-349 57-409 (409) 103 protein:vir:9643 Length: 377 # 91.0 0.012 7.1E-06 31.2 9.5 292 1-330 42-377 (377) 104 protein:vir:3870 Length: 400 # 91.0 0.018 1.1E-05 30.1 13.6 293 1-388 82-400 (400) 105 protein:vir:9704 Length: 394 # 90.9 0.014 8.8E-06 30.7 9.9 290 1-336 78-394 (394) 106 protein:vir:98635 Length: 377 89.8 0.018 1.1E-05 30.2 9.5 306 1-359 47-377 (377) 107 protein:vir:101650 Length: 497 88.3 0.033 2.1E-05 28.7 12.6 286 1-348 85-497 (497) 108 protein:vir:7855 Length: 497 # 88.3 0.033 2.1E-05 28.7 12.6 286 1-348 85-497 (497) 109 protein:vir:4197 Length: 314 # 86.9 0.042 2.6E-05 28.1 16.9 294 1-377 1-314 (314) 110 protein:vir:100884 Length: 389 86.1 0.048 3E-05 27.8 16.5 308 1-395 71-389 (389) 111 protein:vir:96762 Length: 632 85.7 0.051 3.1E-05 27.7 16.3 298 1-374 288-632 (632) 112 protein:vir:78350 Length: 383 85.7 0.03 1.9E-05 28.9 8.1 297 1-337 43-383 (383) 113 protein:vir:96442 Length: 418 84.5 0.05 3.1E-05 27.7 8.7 327 1-406 40-418 (418) 114 protein:vir:93616 Length: 645 83.6 0.067 4.2E-05 27.0 15.0 327 1-349 280-645 (645) 115 protein:vir:80128 Length: 466 83.1 0.071 4.4E-05 26.9 11.0 324 1-366 91-466 (466) 116 protein:vir:6212 Length: 434 # 81.2 0.088 5.5E-05 26.4 13.8 316 1-360 82-434 (434) 117 protein:vir:9820 Length: 272 # 80.0 0.099 6.2E-05 26.1 11.8 256 31-349 1-272 (272) 118 protein:vir:3033 Length: 272 # 80.0 0.099 6.2E-05 26.1 11.8 256 31-349 1-272 (272) 119 protein:vir:9509 Length: 381 # 79.5 0.074 4.6E-05 26.8 7.7 308 1-354 20-381 (381) 120 protein:vir:101291 Length: 381 79.5 0.074 4.6E-05 26.8 7.7 308 1-354 20-381 (381) 121 protein:vir:101607 Length: 379 78.2 0.12 7.2E-05 25.7 10.4 293 1-344 39-379 (379) 122 protein:vir:8420 Length: 477 # 71.9 0.19 0.00012 24.5 12.2 312 1-338 90-477 (477) 123 protein:vir:4159 Length: 315 # 66.3 0.27 0.00017 23.7 14.8 291 12-343 1-315 (315) 124 protein:vir:95963 Length: 395 61.3 0.36 0.00022 23.0 9.6 315 1-353 38-395 (395) 125 protein:vir:5255 Length: 304 # 60.3 0.18 0.00011 24.6 5.4 267 42-329 1-304 (304) 126 protein:vir:103285 Length: 296 52.0 0.57 0.00035 21.9 8.1 274 37-357 1-296 (296) 127 protein:vir:93881 Length: 387 44.0 0.82 0.00051 21.0 9.9 302 1-336 60-387 (387) 128 protein:vir:9361 Length: 402 # 43.5 0.84 0.00052 21.0 10.3 298 1-336 68-402 (402) 129 protein:vir:100632 Length: 381 42.5 0.88 0.00055 20.9 6.7 307 1-366 20-381 (381) 130 protein:vir:97397 Length: 517 41.7 0.92 0.00057 20.8 13.8 287 1-310 168-517 (517) 131 protein:vir:107687 Length: 319 40.9 0.95 0.00059 20.7 9.1 272 1-304 1-319 (319) 132 protein:vir:962 Length: 397 # 38.3 1.1 0.00067 20.4 15.2 287 1-357 91-397 (397) 133 protein:vir:100172 Length: 394 34.2 1.3 0.00081 19.9 16.1 315 1-401 67-394 (394) 134 protein:vir:104342 Length: 314 31.2 1 0.00062 20.6 4.5 296 13-357 1-314 (314) 135 protein:vir:93742 Length: 274 20.6 2.7 0.0017 18.2 12.7 261 31-361 1-274 (274) No 1 >protein:vir:95603 Length: 463 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240903;genbank:gi:66394965;genbank:GeneID:5132544 Probab=100.00 E-value=4.4e-199 Score=1108.07 Aligned_cols=463 Identities=58% Similarity=0.927 Sum_probs=447.9 Q ss_pred CCccchhhhhhhhcCCccchHHHHHHHHHhhhcCCCcChhhccCccccchhhhhhhhhhheeccccccchhhccccchhH Q lcl|NC_018856. 1 MTELKKEAEAKNKKLPVEAEAELAELVSKSFTTGYGITPDTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQQVNS 80 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~Ks~tag~~~~p~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~~~~s 80 (479) ||.++|+++++.+ .++++.|++.|||+|||+|+|++|+||+||||||||++|++|+|+++||+|||+|+|||++| T Consensus 1 ~~~~~~~~~~~~~-----~~~~~~e~~~KS~~tg~g~~p~~q~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~~i~k~~a~S 75 (463) T protein:vir:95 1 MTIEKNLSDVQQK-----YADQFQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQS 75 (463) T ss_pred CCcccccchHHHH-----HHhhhhHHHHHHhhcCCccCCccccCcchhhhhhhhhhhheeeecccchhhhhhcCCchhhh Confidence 9999999998876 46788999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHhhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_018856. 81 TVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAIAVIAKSIEW 160 (479) Q Consensus 81 tv~eY~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~iE~ 160 (479) ||+||+++++||++||++|++|+|+++++||+|+||+++||||+++++||+++++||+++||+++|+++||+++||+||| T Consensus 76 TV~~y~~~~~~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~K~l~~~~~VS~~~~l~n~~~d~~~~~~~dai~~ia~tiE~ 155 (463) T protein:vir:95 76 TVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEW 155 (463) T ss_pred hhhhheeeeccCccccccccccccccccCCCceEEEEEEeeeeehhhhhhhHHHhhcccccHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccCCCCCHHHHhhhhhhhhhccCceEEEecChHHhhhHHHHhhC Q lcl|NC_018856. 161 AIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGARLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLD 240 (479) Q Consensus 161 a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~~td~~mp~~vka~f~~~~~~ 240 (479) +|||||++|+|..++ +|||||||.++|+++ |||||||++||+++||+|+++|+++||+|||+|||+++|++|+|++++ T Consensus 156 a~FyGds~l~~~~~~-~gleFDGl~~lId~e-nviDarG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~vka~f~~~~l~ 233 (463) T protein:vir:95 156 ASFYGDASLTSEVEG-EGLEFDGLAKLIDKN-NVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILG 233 (463) T ss_pred HHhhhhhccCCCcCc-cccchhhhhhhcCCC-CeeecCCCcccHHHHhhhhhhhhcccCChhheecchHHHHHHHHHhcC Confidence 999999999997555 599999999999985 999999999999999999999999999999999999999999999999 Q ss_pred cceeeeccCCCcceeeeehhhhcCCCcceecccceecCCCceecccCCcCCCCCCCceeEeeeeccCCCCCCCcccccce Q lcl|NC_018856. 241 RQRVIQPSTAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILLEGRNPEPNAPQAPASVVASIVDDKKGGFRDEDIKTH 320 (479) Q Consensus 241 ~qrv~~~~n~g~~~~G~~I~~~~s~~G~I~l~~s~~m~~~~~L~e~~~~~~~AP~~pa~v~at~~t~~~G~f~~~d~gty 320 (479) +|||++++|.|++.+|++|++|+|++|+|+||||+||++++++++.+...|+||.+|.++.+-.++++++.+.+++.+.| T Consensus 234 ~qrv~~~~N~~~~~~G~~v~~f~s~~G~I~L~~s~~m~~~~il~~~~~~~p~ap~~~~~tatv~~~~~~~~~~~~~~a~~ 313 (463) T protein:vir:95 234 RQMQLMQDNSGNVNTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQPLPNAPQPAKVTATVETKQKGAFENEEDRAGL 313 (463) T ss_pred ceEEEEcCCCCceeeeeeccceeeeeeeeeeCCceecCCcccccchhhcCCCCccCceeEEEEeeccCCCCCCcccccce Confidence 99999999999999999999999999999999999999999999999999999996665554445566666679999999 Q ss_pred EEEEEEEcCcCccccccceeeeeecCCceEEEEEEecCCCCcccceEEEEEecCCCcceEEEEeeeeeeecCCceEEEee Q lcl|NC_018856. 321 SYKVVVHSDDAESLPSEAVTAAVAKKDNTVKLEVKLASLYQAQPQFISVYREGTETGHYFLIARVPVSKVNDQGVIEVLD 400 (479) Q Consensus 321 ~YkVtavn~~GES~pS~~vt~Tv~~~g~sv~ltIT~~~~~~a~~~~y~IYR~~~~~G~y~li~rv~vs~~n~~g~T~ftD 400 (479) +|+|++||++|||+||+++++|+++.+++++|+||++++++.+|+|++|||+++++|+|++|+|||++++|+||+|+|+| T Consensus 314 ~Y~vv~~s~~geS~pS~ivtaT~a~~~~gv~l~It~~a~~~~~~~~v~IYR~~~~~g~~~~i~rv~v~~an~~gttt~~D 393 (463) T protein:vir:95 314 SYKVVVNSDDAQSAPSEEVTATVSNVDDGVKLSINVNAMYQQQPQFVSIYRQGKETGMYFLIKRVPVKDAQEDGTIVFVD 393 (463) T ss_pred EEEEEEECCCCCcccchheeeeeeeccceEEEEEEecCCcccceeEEEEEeecCCCCcceeEEEEEecccCCCceEEEee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccCCCcccceecCCchhhhhhhhhcchhhccccccCcchhhhhhhhhhhheecccccEEEEeccccCc Q lcl|NC_018856. 401 RNQVIPETTDVFVGELTPNVVSLLELLPMMKLPLAQMNATTTFTVLWYGALALYAPKKWVRIKNVQYIPA 470 (479) Q Consensus 401 ~N~~iPgT~~~fvGe~~p~vi~l~ellPm~k~Pla~~~~~~~~~V~~ygaL~l~aPkk~~~ikNV~~~~~ 470 (479) +|+|||||+++|||||||+||+|+|||||||||||+.|++++|||+|||+|+|+|||||+|||||+|+|- T Consensus 394 ~n~~IPgt~~vfVgems~~ti~~~ellPm~klpLA~~~~~~~waVl~YGaLal~~Pk~~~~ikNv~~~~v 463 (463) T protein:vir:95 394 KNETLPETADVFVGEMSPQVVHLFELLPMMKLPLAQINASITFAVLWYGALALRAPKKWARIKNVRYIAV 463 (463) T ss_pred cccccCCceeEeeeccCchhhhhHhhhHhhhCCchhccchhhhHHHHhhHHHhhccccceEEEEeeEecC Confidence 9999999999999999999999999999999999999999999999999999999999999999999998 No 2 >protein:vir:99311 Length: 463 # NCBI annotation: putative capsid protein # Family: family:all:2450 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024474;genbank:gi:48696433;genbank:GeneID:2948039 Probab=100.00 E-value=4.4e-199 Score=1108.07 Aligned_cols=463 Identities=58% Similarity=0.927 Sum_probs=447.9 Q ss_pred CCccchhhhhhhhcCCccchHHHHHHHHHhhhcCCCcChhhccCccccchhhhhhhhhhheeccccccchhhccccchhH Q lcl|NC_018856. 1 MTELKKEAEAKNKKLPVEAEAELAELVSKSFTTGYGITPDTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQQVNS 80 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~Ks~tag~~~~p~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~~~~s 80 (479) ||.++|+++++.+ .++++.|++.|||+|||+|+|++|+||+||||||||++|++|+|+++||+|||+|+|||++| T Consensus 1 ~~~~~~~~~~~~~-----~~~~~~e~~~KS~~tg~g~~p~~q~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~~i~k~~a~S 75 (463) T protein:vir:99 1 MTIEKNLSDVQQK-----YADQFQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQS 75 (463) T ss_pred CCcccccchHHHH-----HHhhhhHHHHHHhhcCCccCCccccCcchhhhhhhhhhhheeeecccchhhhhhcCCchhhh Confidence 9999999998876 46788999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHhhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_018856. 81 TVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAIAVIAKSIEW 160 (479) Q Consensus 81 tv~eY~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~iE~ 160 (479) ||+||+++++||++||++|++|+|+++++||+|+||+++||||+++++||+++++||+++||+++|+++||+++||+||| T Consensus 76 TV~~y~~~~~~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~K~l~~~~~VS~~~~l~n~~~d~~~~~~~dai~~ia~tiE~ 155 (463) T protein:vir:99 76 TVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEW 155 (463) T ss_pred hhhhheeeeccCccccccccccccccccCCCceEEEEEEeeeeehhhhhhhHHHhhcccccHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccCCCCCHHHHhhhhhhhhhccCceEEEecChHHhhhHHHHhhC Q lcl|NC_018856. 161 AIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGARLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLD 240 (479) Q Consensus 161 a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~~td~~mp~~vka~f~~~~~~ 240 (479) +|||||++|+|..++ +|||||||.++|+++ |||||||++||+++||+|+++|+++||+|||+|||+++|++|+|++++ T Consensus 156 a~FyGds~l~~~~~~-~gleFDGl~~lId~e-nviDarG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~vka~f~~~~l~ 233 (463) T protein:vir:99 156 ASFYGDASLTSEVEG-EGLEFDGLAKLIDKN-NVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILG 233 (463) T ss_pred HHhhhhhccCCCcCc-cccchhhhhhhcCCC-CeeecCCCcccHHHHhhhhhhhhcccCChhheecchHHHHHHHHHhcC Confidence 999999999997555 599999999999985 999999999999999999999999999999999999999999999999 Q ss_pred cceeeeccCCCcceeeeehhhhcCCCcceecccceecCCCceecccCCcCCCCCCCceeEeeeeccCCCCCCCcccccce Q lcl|NC_018856. 241 RQRVIQPSTAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILLEGRNPEPNAPQAPASVVASIVDDKKGGFRDEDIKTH 320 (479) Q Consensus 241 ~qrv~~~~n~g~~~~G~~I~~~~s~~G~I~l~~s~~m~~~~~L~e~~~~~~~AP~~pa~v~at~~t~~~G~f~~~d~gty 320 (479) +|||++++|.|++.+|++|++|+|++|+|+||||+||++++++++.+...|+||.+|.++.+-.++++++.+.+++.+.| T Consensus 234 ~qrv~~~~N~~~~~~G~~v~~f~s~~G~I~L~~s~~m~~~~il~~~~~~~p~ap~~~~~tatv~~~~~~~~~~~~~~a~~ 313 (463) T protein:vir:99 234 RQMQLMQDNSGNVNTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQPLPNAPQPAKVTATVETKQKGAFENEEDRAGL 313 (463) T ss_pred ceEEEEcCCCCceeeeeeccceeeeeeeeeeCCceecCCcccccchhhcCCCCccCceeEEEEeeccCCCCCCcccccce Confidence 99999999999999999999999999999999999999999999999999999996665554445566666679999999 Q ss_pred EEEEEEEcCcCccccccceeeeeecCCceEEEEEEecCCCCcccceEEEEEecCCCcceEEEEeeeeeeecCCceEEEee Q lcl|NC_018856. 321 SYKVVVHSDDAESLPSEAVTAAVAKKDNTVKLEVKLASLYQAQPQFISVYREGTETGHYFLIARVPVSKVNDQGVIEVLD 400 (479) Q Consensus 321 ~YkVtavn~~GES~pS~~vt~Tv~~~g~sv~ltIT~~~~~~a~~~~y~IYR~~~~~G~y~li~rv~vs~~n~~g~T~ftD 400 (479) +|+|++||++|||+||+++++|+++.+++++|+||++++++.+|+|++|||+++++|+|++|+|||++++|+||+|+|+| T Consensus 314 ~Y~vv~~s~~geS~pS~ivtaT~a~~~~gv~l~It~~a~~~~~~~~v~IYR~~~~~g~~~~i~rv~v~~an~~gttt~~D 393 (463) T protein:vir:99 314 SYKVVVNSDDAQSAPSEEVTATVSNVDDGVKLSINVNAMYQQQPQFVSIYRQGKETGMYFLIKRVPVKDAQEDGTIVFVD 393 (463) T ss_pred EEEEEEECCCCCcccchheeeeeeeccceEEEEEEecCCcccceeEEEEEeecCCCCcceeEEEEEecccCCCceEEEee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccCCCcccceecCCchhhhhhhhhcchhhccccccCcchhhhhhhhhhhheecccccEEEEeccccCc Q lcl|NC_018856. 401 RNQVIPETTDVFVGELTPNVVSLLELLPMMKLPLAQMNATTTFTVLWYGALALYAPKKWVRIKNVQYIPA 470 (479) Q Consensus 401 ~N~~iPgT~~~fvGe~~p~vi~l~ellPm~k~Pla~~~~~~~~~V~~ygaL~l~aPkk~~~ikNV~~~~~ 470 (479) +|+|||||+++|||||||+||+|+|||||||||||+.|++++|||+|||+|+|+|||||+|||||+|+|- T Consensus 394 ~n~~IPgt~~vfVgems~~ti~~~ellPm~klpLA~~~~~~~waVl~YGaLal~~Pk~~~~ikNv~~~~v 463 (463) T protein:vir:99 394 KNETLPETADVFVGEMSPQVVHLFELLPMMKLPLAQINASITFAVLWYGALALRAPKKWARIKNVRYIAV 463 (463) T ss_pred cccccCCceeEeeeccCchhhhhHhhhHhhhCCchhccchhhhHHHHhhHHHhhccccceEEEEeeEecC Confidence 9999999999999999999999999999999999999999999999999999999999999999999998 No 3 >protein:vir:96666 Length: 462 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238545;genbank:gi:66391271;genbank:GeneID:5130448 Probab=100.00 E-value=1.7e-198 Score=1104.80 Aligned_cols=461 Identities=58% Similarity=0.932 Sum_probs=445.3 Q ss_pred CCccchhhhhhhhcCCccchHHHHHHHHHhhhcCCCcChhhccCccccchhhhhhhhhhheeccccccchhhccccchhH Q lcl|NC_018856. 1 MTELKKEAEAKNKKLPVEAEAELAELVSKSFTTGYGITPDTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQQVNS 80 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~Ks~tag~~~~p~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~~~~s 80 (479) |+..++.+..+ ++++++. .|+++|||+|||+|+||+|+||+||||||||++|++|+|+++||+|||+|+|||++| T Consensus 1 ~~~~~~~~~~~----~~~~~~~-~e~~~KS~~tg~g~~p~~q~~~gAlR~esL~~~i~~Lt~~~~~~~~~~~i~k~~a~s 75 (462) T protein:vir:96 1 MHKDTNLTAEQ----NKYADKF-QEEVMKSYQTGYGITPDTQVDAGALRREILDDQITMLTWTQDDLIFYREISRRPAQS 75 (462) T ss_pred Cccccccchhh----hhhhchh-hHHHHHHHhcCCCcCCccccccchhhhhhhhhhhheeeecccchhhhhhcCCchhhh Confidence 88777665443 3444555 488999999999999999999999999999999999999999999999999999999 Q ss_pred HHHhhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_018856. 81 TVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAIAVIAKSIEW 160 (479) Q Consensus 81 tv~eY~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~iE~ 160 (479) ||+||+++++||++||++|++|+|+++++||+|+||+++||||++++++|++++|||+++||+++|++|||+++||+||| T Consensus 76 Tv~~y~~~~~~G~~g~~~f~~E~g~~~~~d~~~~R~~~~~k~l~~t~~vsi~~tl~n~~~d~~~~~~~dai~~~a~tiE~ 155 (462) T protein:vir:96 76 TVQKYDVYLRHGNVGHSRFVREVGVAPVSDPNIRQKTVEMKYVSDTKNLSIASTLVNNIQDPMQILTEDAIAVVAKTIEW 155 (462) T ss_pred hhhhheeeeccCccccccccccccccccCCCceEEEEEEEEEEeeeeeechhhhhccchhhHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccCCCCCHHHHhhhhhhhhhccCceEEEecChHHhhhHHHHhhC Q lcl|NC_018856. 161 AIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGARLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLD 240 (479) Q Consensus 161 a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~~td~~mp~~vka~f~~~~~~ 240 (479) +|||||++|+|+..++ |||||||.++|+++ |||||||++||+++||+||++|+++||+|||+|||+++|++|+|++++ T Consensus 156 a~Fygds~l~~~~~~~-gleFDGl~~lI~~~-NViDarG~~Ls~~~ln~aa~~i~~~fGt~TD~~~p~~v~a~f~~~~l~ 233 (462) T protein:vir:96 156 ASFYGDASLTADPTGQ-GLEFDGLAKLIDKD-NVIDAKGESLTETLLNRSAVLIGKSFGTATDAYMPIGVHADFVNSVLG 233 (462) T ss_pred HHhhhhcccCCCcccc-ccchhhhhhhcCCC-ceeecCCCCccHHHHhhhhhhcccccCChhheecchHHHHHHHHhhcC Confidence 9999999999998887 99999999999984 999999999999999999999999999999999999999999999999 Q ss_pred cceeeeccCCCcceeeeehhhhcCCCcceecccceecCCCceecccCCcCCCCCCCceeEeeeeccCCCCCCCcc-cccc Q lcl|NC_018856. 241 RQRVIQPSTAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILLEGRNPEPNAPQAPASVVASIVDDKKGGFRDE-DIKT 319 (479) Q Consensus 241 ~qrv~~~~n~g~~~~G~~I~~~~s~~G~I~l~~s~~m~~~~~L~e~~~~~~~AP~~pa~v~at~~t~~~G~f~~~-d~gt 319 (479) +|||++++|.|++.+|++|++|+|++|+|+||||+||++++++++.+...|++|++| .+++|+.++.+|+|.++ |.++ T Consensus 234 ~qrv~~~~n~g~~~~G~~v~~f~s~~G~I~L~~s~~m~~~~i~~~~~~~~p~ap~~~-~vsaTv~t~~~g~f~~~~d~~~ 312 (462) T protein:vir:96 234 RQMQLMQDNSGNVNAGYNVQGFYSSRGFIKLHGSTVMENELILDESLQPLPNAPQPA-TVKATVETGKKGLFTDEHDRAE 312 (462) T ss_pred ceEEEEcCCCCceeeeeeccceeeeeeeeeeCCceecCcccccccccccCCCCCCCC-ceeEEEEeCCCCCCCCccCcee Confidence 999999999999999999999999999999999999999999999999999988855 77788889999999887 7999 Q ss_pred eEEEEEEEcCcCccccccceeeeeecCCceEEEEEEecCCCCcccceEEEEEecCCCcceEEEEeeeeeeecCCceEEEe Q lcl|NC_018856. 320 HSYKVVVHSDDAESLPSEAVTAAVAKKDNTVKLEVKLASLYQAQPQFISVYREGTETGHYFLIARVPVSKVNDQGVIEVL 399 (479) Q Consensus 320 y~YkVtavn~~GES~pS~~vt~Tv~~~g~sv~ltIT~~~~~~a~~~~y~IYR~~~~~G~y~li~rv~vs~~n~~g~T~ft 399 (479) |+|+|++||++|||+||+++++|+++.++.++|+|+|+++++++|+||+|||+++++|+|++|+|||++.+|++|+++|+ T Consensus 313 y~Y~V~avs~dgeS~PS~~VtaTva~~~~gv~ltIt~~a~~~~~~~~~~IYRk~~~sg~y~li~rv~~~~~n~~gt~tf~ 392 (462) T protein:vir:96 313 LTYKVVVNSDDAQSAPSEAVTATVNNATDGVKLEISVNAMYQQQPQFVSIYRQGRKTGDFYLIKRLGMKEVNDEGKLVFY 392 (462) T ss_pred EEEEEEEECCCCccccceeeEeeeecccccceEEEEEcCCccccceEEEEEeecCCccccceeeeeeceeecCCcceeEe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eccccCCCcccceecCCchhhhhhhhhcchhhccccccCcchhhhhhhhhhhheecccccEEEEeccccC Q lcl|NC_018856. 400 DRNQVIPETTDVFVGELTPNVVSLLELLPMMKLPLAQMNATTTFTVLWYGALALYAPKKWVRIKNVQYIP 469 (479) Q Consensus 400 D~N~~iPgT~~~fvGe~~p~vi~l~ellPm~k~Pla~~~~~~~~~V~~ygaL~l~aPkk~~~ikNV~~~~ 469 (479) |+|++||||+++|||||+||||+|+|||||||||||+.|++++|||+|||+|+|+|||||+|||||+|+- T Consensus 393 D~n~~iPgt~~~fVge~~p~vi~~~qllpm~~~plA~~n~~~~waVl~yG~Lal~~Pk~~~~ikNv~~~~ 462 (462) T protein:vir:96 393 DLNETIPETTDVFVGEMSPQVLHLFELLPMMKLPLAQINASVTFAVLWYGALALRAPKKWVRIKNVKYIV 462 (462) T ss_pred eccCCCCCcccceeecCCchhhhhhhhhhhhhcCcccccchhhhhhhhhhHHHhhcccccEEEEEEEEeC Confidence 9999999999999999999999999999999999999999999999999999999999999999999998 No 4 >protein:vir:80835 Length: 464 # NCBI annotation: putative major capsid protein # Family: family:all:2450 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504125;genbank:gi:158079312;genbank:GeneID:5666484 Probab=100.00 E-value=6.8e-194 Score=1079.59 Aligned_cols=460 Identities=58% Similarity=0.931 Sum_probs=440.3 Q ss_pred CCccchhhhhhhhcCCccchHHHHHHHHHhhhcCCCcChhhccCccccchhhhhhhhhhheeccccccchhhccccchhH Q lcl|NC_018856. 1 MTELKKEAEAKNKKLPVEAEAELAELVSKSFTTGYGITPDTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQQVNS 80 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~Ks~tag~~~~p~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~~~~s 80 (479) ||+.| ++..++++++|+ +.|||+|||+|+||+|+||+||||||||++|++|+|+++||+|||+|+|||++| T Consensus 1 ~~~~~----n~~~~~~~~~e~-----~~Ks~ttgy~~~p~~q~~~~AlRrEsL~~~i~~Lt~~~~~f~f~~di~k~~a~S 71 (464) T protein:vir:80 1 MTEKK----NTERQLTSVQEE-----VIKGFTTGYGITPESQTDAAALRREFLDDQITMLTWADGDLSFYRDITKRPATS 71 (464) T ss_pred CCcch----hhHhhcCcccHH-----HHHHHHhCCccCcccccCcchhhhhhhhhhhheeeecccchhhhhhcCCchhhh Confidence 87654 466788888764 459999999999999999999999999999999999999999999999999999 Q ss_pred HHHhhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_018856. 81 TVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAIAVIAKSIEW 160 (479) Q Consensus 81 tv~eY~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~iE~ 160 (479) ||+||+++++||++||++|++|+|+++++||+|+||+++||||+++|++|++++|||+++||+.+|++|||+++||+||| T Consensus 72 TV~~y~~~~~~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~Kfl~~~r~vsia~~lvn~~~d~~~~~~~dai~~va~tiE~ 151 (464) T protein:vir:80 72 TVAKYDVYLAHGRVGHTRFTREIGVAPISDPNLRQKTVNMKYVSDTKNMSIATGLVNNIEDPMRILTDDAISVVAKTIEW 151 (464) T ss_pred hhhhhheeeccCccccccccccccccccCCCceEEEEEEeeeeecceeeeeehhhhcchhhHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccCCCCCHHHHhhhhhhhhhccCceEEEecChHHhhhHHHHhhC Q lcl|NC_018856. 161 AIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGARLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLD 240 (479) Q Consensus 161 a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~~td~~mp~~vka~f~~~~~~ 240 (479) +|||||++|+|++++++|||||||+++|+++ |||||||++||+++||+|+++|+++||+|||+|||+++|++|.+++++ T Consensus 152 a~FyGds~l~~~~~~~~gleFDGl~~lI~~~-NViDarG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~v~a~f~n~~l~ 230 (464) T protein:vir:80 152 ASFYGDSDLSENPDAGSGLEFDGLAKLIDKH-NVLDAKGASLTEALLNQASVLVGKGYGTPTDAYMPIGVQADFVNQQLD 230 (464) T ss_pred HHhhhccccCCCCCCccccchhhhHhhcCCC-ceeecCCCCcCHHHHhhhhhhhhcccCChhhcccchhHHHHHHhhhcC Confidence 9999999999999999999999999999985 999999999999999999999999999999999999999999999999 Q ss_pred cceeeeccCCCcceeeeehhhhcCCCcceecccceecCCCceecccCCcCCCCCCCceeEeeeeccCCCCCCCcccc-cc Q lcl|NC_018856. 241 RQRVIQPSTAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILLEGRNPEPNAPQAPASVVASIVDDKKGGFRDEDI-KT 319 (479) Q Consensus 241 ~qrv~~~~n~g~~~~G~~I~~~~s~~G~I~l~~s~~m~~~~~L~e~~~~~~~AP~~pa~v~at~~t~~~G~f~~~d~-gt 319 (479) +||+++.+|.+++++|++|++|+|++|+|+||+|++|+++++|++.+...++||++|.. ++|+.++++|+|.+++. +. T Consensus 231 ~q~~~~~~n~~~~~~G~~v~~f~sa~G~i~L~~s~~m~~~~~ld~~~~~~~~apaapsv-t~tv~~~~~g~f~~~~~~~~ 309 (464) T protein:vir:80 231 RQVQVISDNGQNATMGFNVKGFNSARGFIRLHGSTVMELEQILDENRMQLPNAPQKATV-KATLEAGTKGKFRDEDLTID 309 (464) T ss_pred ceeEEEcCCCCcceeeeecccccccccceeccCccccCcccccccccccCCCCcCCcee-EEEecCCcccCCccccccce Confidence 99999999999999999999999999999999999999999999999999999997654 45777778999999995 66 Q ss_pred eEEEEEEEcCcCccccccceeeeeecCCceEEEEEEecCCCCcccceEEEEEecCCCcceEEEEeeeeeeecCCceEEEe Q lcl|NC_018856. 320 HSYKVVVHSDDAESLPSEAVTAAVAKKDNTVKLEVKLASLYQAQPQFISVYREGTETGHYFLIARVPVSKVNDQGVIEVL 399 (479) Q Consensus 320 y~YkVtavn~~GES~pS~~vt~Tv~~~g~sv~ltIT~~~~~~a~~~~y~IYR~~~~~G~y~li~rv~vs~~n~~g~T~ft 399 (479) |+|||++||++|||+||.++++|+++.++.|+|+||+.++++++|+|++|||++.++|+|++|+|||+++.+ +|+++|+ T Consensus 310 ~~Ykv~~vn~~GeS~ps~~~~~ti~~~~~~V~l~it~~~~~~~~p~yv~IYR~~~~~g~f~~i~rv~~~~~~-~gt~t~v 388 (464) T protein:vir:80 310 TEYKVVVVSDDAESAPSDVASVVIDDKKKQVKLEITINNMYQARPQYVAIYRKGLETGLFYQIARVPASKAV-EGVITFI 388 (464) T ss_pred eEEEEEEECCCCccccceeeeeeecCcccEEEEEEEeCCccccccceEEEEeecCCCCceeEEEEEeecccc-CCceEEE Confidence 999999999999999999999999999999999999999999999999999999999999999999999964 7888899 Q ss_pred eccccCCCcccceecCCchhhhhhhhhcchhhccccccCcchhhhhhhhhhhheecccccEEEEeccccCcccceeeecC Q lcl|NC_018856. 400 DRNQVIPETTDVFVGELTPNVVSLLELLPMMKLPLAQMNATTTFTVLWYGALALYAPKKWVRIKNVQYIPALAADVTVKY 479 (479) Q Consensus 400 D~N~~iPgT~~~fvGe~~p~vi~l~ellPm~k~Pla~~~~~~~~~V~~ygaL~l~aPkk~~~ikNV~~~~~~~~~~~~~~ 479 (479) |+|+|||||+++|||||||+||+|+|||||||||||+.|++++|||+|||+|+|+|||||+|||||+|++- +| T Consensus 389 D~n~~IPgt~~vfVgems~~ti~l~ellPm~rlplA~~n~~~~waVl~YGaLal~aPk~~~~ikNv~~~~~-------~~ 461 (464) T protein:vir:80 389 DVNDEIPETADVFVGELTPSVVHLFELLPMMRLPLAQVNASVTFAVLWYGALALRAPKKWARIKNVKYIAT-------GN 461 (464) T ss_pred ecccccCCceeEeeecCCchHHHHHHHHHhhhCCchhcccchhhhhhhhhHHhhhccccceEEEEEEEeec-------cc Confidence 99999999999999999999999999999999999999999999999999999999999999999999862 34 No 5 >protein:vir:63741 Length: 468 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547622;genbank:GeneID:3783474 Probab=100.00 E-value=4.6e-193 Score=1075.04 Aligned_cols=468 Identities=59% Similarity=0.918 Sum_probs=451.8 Q ss_pred CCccchhhhhhhhcCCccchHHHHHHHHHhhhcCCCcChhhccCccccchhhhhhhhhhheeccccccchhhccccchhH Q lcl|NC_018856. 1 MTELKKEAEAKNKKLPVEAEAELAELVSKSFTTGYGITPDTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQQVNS 80 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~Ks~tag~~~~p~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~~~~s 80 (479) ||+++||++ +++++++++||.++|||+|||+|+||+|+||+||||||||++|++|+|+++||+|||+|+|++++| T Consensus 1 ~~~~~~~~~-----~~~~~~~~~~e~~~Ks~~agy~~~p~~q~~~~AlR~EsL~~~i~~L~~~~~~f~~~~di~k~~a~s 75 (468) T protein:vir:63 1 MPKNNKEEE-----VKEVNLNSVQEDALKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATS 75 (468) T ss_pred CCCCcchhh-----ccccChhHHHHHHHHHHHcCcccCCccccCcchhhhhhhhhhhheeeecccchhhhhhcccchhhh Confidence 999999987 589999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHhhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_018856. 81 TVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAIAVIAKSIEW 160 (479) Q Consensus 81 tv~eY~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~iE~ 160 (479) ||+||+++++||++||++|++|+|+++++||+|+||+++||||++++++|+++++||+|+||+++|+++||+++||+||| T Consensus 76 tv~~y~~~~~~G~~g~~~f~~E~g~~~~~~~~~~r~~~~~k~l~~~~~vs~~~~l~n~i~d~~~~~~~~ai~~~a~tiE~ 155 (468) T protein:vir:63 76 TVAKYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEW 155 (468) T ss_pred hhhhheeeeccCccccccccccccccccCCCceEEEEEEeeeeeeeeeehhhhhhhcchhhHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccCCCCCHHHHhhhhhhhhhccCceEEEecChHHhhhHHHHhhC Q lcl|NC_018856. 161 AIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGARLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLD 240 (479) Q Consensus 161 a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~~td~~mp~~vka~f~~~~~~ 240 (479) +|||||++|++.+++++|||||||+++|+++ ||||+||++|++++||+|+++|+++||++||+|||+++|++|++.++. T Consensus 156 a~FyGds~l~~s~~~~~glqfDGi~~li~~e-nviDa~G~~ls~~~lneaa~~i~~gfG~~td~~~~~~v~a~~~~~~L~ 234 (468) T protein:vir:63 156 ASFFGDSDLSDSPEPQAGLEFDGLAKLINQD-NVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLS 234 (468) T ss_pred HhhhcccccccCCCccccccccceeEEecCC-ceeccCCCccCHHHHHHHhhhccccccChhhhhcchhHHhhhhhhhcC Confidence 9999999999888999999999999999985 999999999999999999999999999999999999999999999999 Q ss_pred cceeeeccCCCcceeeeehhhhcCCCcceecccceecCCCceecccCCcCCCCCCCceeEeeeeccCCCCCCCcccccce Q lcl|NC_018856. 241 RQRVIQPSTAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILLEGRNPEPNAPQAPASVVASIVDDKKGGFRDEDIKTH 320 (479) Q Consensus 241 ~qrv~~~~n~g~~~~G~~I~~~~s~~G~I~l~~s~~m~~~~~L~e~~~~~~~AP~~pa~v~at~~t~~~G~f~~~d~gty 320 (479) .||+++.+|.+.+.+|++|++|++++|+|+||||++|+++++|.+.....++||.++ .++++...+++|++.+++.++| T Consensus 235 ~q~~v~~~n~~~~~~G~~v~g~~sa~G~I~l~gs~il~~~~~l~~~~~~~~~Apsp~-~vsaT~~~~~~g~~~~~~~a~y 313 (468) T protein:vir:63 235 KQTQLVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQPA-KVTATQEAGKKGQFRAEDLAAH 313 (468) T ss_pred ceEEEEcCCCCceeeeecccceecceeeeeecCceeeccccCCCcccccccccccCC-ccceeeecccCCcccCCCcceE Confidence 988888889999999999999999999999999999999999999999999998855 6778888899999999999999 Q ss_pred EEEEEEEcCcCccccccceeeeeecCCceEEEEEEecCCCCcccceEEEEEecCCCcceEEEEeeeeeeecCCceEEEee Q lcl|NC_018856. 321 SYKVVVHSDDAESLPSEAVTAAVAKKDNTVKLEVKLASLYQAQPQFISVYREGTETGHYFLIARVPVSKVNDQGVIEVLD 400 (479) Q Consensus 321 ~YkVtavn~~GES~pS~~vt~Tv~~~g~sv~ltIT~~~~~~a~~~~y~IYR~~~~~G~y~li~rv~vs~~n~~g~T~ftD 400 (479) +|||++||++|||.||+++++|++++++.++|+|||+++++++|+||+|||++.++|+|+||+|||++.+ .+|+++|+| T Consensus 314 ~Y~v~~vs~~GES~pS~~vtvTVaa~~dg~~ltIt~~~~~~~~p~yv~IYR~~~gg~~f~li~~va~~~a-~~gt~tf~D 392 (468) T protein:vir:63 314 EYKVVVSSDDAESIASEVATATVTAKDDGVKLEIELAPMYSSRPQFVSIYRKGAETGLFYLIARVPASKA-ENNVITFYD 392 (468) T ss_pred EEEEEEECCCCccccccceEEEecCcccceeEEEEecCCCCCcceEEEEEEeCCCCcceeEeeeEeeeec-CCCeEEEEc Confidence 9999999999999999999999999999999999999999999999999999999999999999999986 589999999 Q ss_pred ccccCCCcccceecCCchhhhhhhhhcchhhccccccCcchhhhhhhhhhhheecccccEEEEeccccCcccceeeec Q lcl|NC_018856. 401 RNQVIPETTDVFVGELTPNVVSLLELLPMMKLPLAQMNATTTFTVLWYGALALYAPKKWVRIKNVQYIPALAADVTVK 478 (479) Q Consensus 401 ~N~~iPgT~~~fvGe~~p~vi~l~ellPm~k~Pla~~~~~~~~~V~~ygaL~l~aPkk~~~ikNV~~~~~~~~~~~~~ 478 (479) +|++||||+++|||||+|+||+|+|||||||||||++|+.++|||+|||+|+|+|||||+|||||+|+|-- |+.-. T Consensus 393 ~n~~iPgT~~~fVgem~~~~i~~~~llpm~~lplA~~n~~~~~~Vl~Ygalal~~Pk~~~~ikNv~~~~~~--~~~~~ 468 (468) T protein:vir:63 393 LNDSIPETVDVFVGEMSANVVHLFELLPMMRLPLAQINASVTFAVLWYGALALRAPKKWVRIRNVKYIPVK--NVHSN 468 (468) T ss_pred CCcccCCCcceeeeecChhHHHHHHHhccccCChhHhccchhhhhhhhhHHhhhccccceEEEEeeeeeec--cccCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999842 22212 No 6 >protein:vir:80491 Length: 467 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468466;genbank:gi:157325041;genbank:GeneID:5601449 Probab=100.00 E-value=9.3e-193 Score=1073.37 Aligned_cols=467 Identities=59% Similarity=0.918 Sum_probs=450.7 Q ss_pred CCccchhhhhhhhcCCccchHHHHHHHHHhhhcCCCcChhhccCccccchhhhhhhhhhheeccccccchhhccccchhH Q lcl|NC_018856. 1 MTELKKEAEAKNKKLPVEAEAELAELVSKSFTTGYGITPDTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQQVNS 80 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~Ks~tag~~~~p~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~~~~s 80 (479) ||+++||. +++++|+++||+++|||+|||+|+|++|+||+||||||||++|++|+|+++||+|||+|+|++++| T Consensus 1 ~~~~~~~~------~~~~n~~~~~e~~~Ks~~agy~~~p~tq~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~di~k~~a~s 74 (467) T protein:vir:80 1 MPKNNKEE------VKEVNLNSVQEDALKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATS 74 (467) T ss_pred CCCcchhh------hhhcccccCHHHHHHHHHcccccCCccccCcchhhhhhhhhhhheeeccccchhhhhhcccchhhh Confidence 99999875 789999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHhhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_018856. 81 TVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAIAVIAKSIEW 160 (479) Q Consensus 81 tv~eY~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~iE~ 160 (479) ||+||+++++||++||++|++|+|+++++||+|+||+++||||++++++|+++++||+|+||+++|+++||+++||+||| T Consensus 75 tv~~y~~~~~~G~~g~~~f~~E~g~~~~~~~~~~r~~~~~k~l~~~~~vs~~~~l~n~i~d~~~~~~~~ai~~~a~tiE~ 154 (467) T protein:vir:80 75 TVAKYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEW 154 (467) T ss_pred hhhhheeeeccCccccccccccccccccCCCceEEEEEEeeeeeeeeeehhhhhhhcchhhHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccCCCCCHHHHhhhhhhhhhccCceEEEecChHHhhhHHHHhhC Q lcl|NC_018856. 161 AIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGARLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLD 240 (479) Q Consensus 161 a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~~td~~mp~~vka~f~~~~~~ 240 (479) +|||||++|++.+++++|||||||+++|+++ ||||+||++|++++||+|+++|+++||++||+|||+++|++|++.++. T Consensus 155 a~FyGds~l~~s~~~~~glqfDGi~~li~~e-nviDa~G~~ls~~~lneaa~~i~~gfG~~td~~~p~~v~a~~~~~~L~ 233 (467) T protein:vir:80 155 ASFFGDSDLSDSPEPQAGLEFDGLAKLINQD-NVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLS 233 (467) T ss_pred HhhhcccccccCCCccccccccceeEEecCC-ceeccCCCccCHHHHHHHhhhccccccChhhhhcchhHHhhhhhhhcC Confidence 9999999999888999999999999999985 999999999999999999999999999999999999999999999999 Q ss_pred cceeeeccCCCcceeeeehhhhcCCCcceecccceecCCCceecccCCcCCCCCCCceeEeeeeccCCCCCCCcccccce Q lcl|NC_018856. 241 RQRVIQPSTAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILLEGRNPEPNAPQAPASVVASIVDDKKGGFRDEDIKTH 320 (479) Q Consensus 241 ~qrv~~~~n~g~~~~G~~I~~~~s~~G~I~l~~s~~m~~~~~L~e~~~~~~~AP~~pa~v~at~~t~~~G~f~~~d~gty 320 (479) .||+++.+|.+.+.+|++|++|++++|+|+||||++|+++++|.+.....++||.++ .++++...+++|++.+++.++| T Consensus 234 ~q~~v~~~n~~~~~~G~~v~g~~sa~G~I~l~gs~il~~~~~l~~~~~~~~~Apsp~-~vsaT~~~~~~g~~~~~~~a~y 312 (467) T protein:vir:80 234 KQTQLVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQPA-KVTATQEAGKKGQFRAEDLAAH 312 (467) T ss_pred ceEEEEcCCCCceeeeecccceecceeeeeecCceeeccccCCCcccccccccccCC-ccceeeecccCCcccCCCcceE Confidence 988888889999999999999999999999999999999999999999999998855 6778888899999999999999 Q ss_pred EEEEEEEcCcCccccccceeeeeecCCceEEEEEEecCCCCcccceEEEEEecCCCcceEEEEeeeeeeecCCceEEEee Q lcl|NC_018856. 321 SYKVVVHSDDAESLPSEAVTAAVAKKDNTVKLEVKLASLYQAQPQFISVYREGTETGHYFLIARVPVSKVNDQGVIEVLD 400 (479) Q Consensus 321 ~YkVtavn~~GES~pS~~vt~Tv~~~g~sv~ltIT~~~~~~a~~~~y~IYR~~~~~G~y~li~rv~vs~~n~~g~T~ftD 400 (479) +|||++||++|||.||+++++|++++++.++|+|||+++++++|+||+|||++.++|+|+||+|||++.+ .+|+++|+| T Consensus 313 ~Y~v~~vs~~GES~pS~~vtvTVaa~~dg~~ltIt~~~~~~~~p~yv~IYR~~~gg~~f~li~~va~~~a-~~gt~tf~D 391 (467) T protein:vir:80 313 EYKVVVSSDDAESIASEVATATVTAKDDGVKLEIELAPMYSSRPQFVSIYRKGAETGLFYLIARVPASKA-ENNVITFYD 391 (467) T ss_pred EEEEEEECCCCccccccceEEEecCcccceeEEEEecCCCCCcceEEEEEEeCCCCcceeEeeeEeeeec-CCCeEEEEc Confidence 9999999999999999999999999999999999999999999999999999999999999999999986 589999999 Q ss_pred ccccCCCcccceecCCchhhhhhhhhcchhhccccccCcchhhhhhhhhhhheecccccEEEEeccccCcccceeeec Q lcl|NC_018856. 401 RNQVIPETTDVFVGELTPNVVSLLELLPMMKLPLAQMNATTTFTVLWYGALALYAPKKWVRIKNVQYIPALAADVTVK 478 (479) Q Consensus 401 ~N~~iPgT~~~fvGe~~p~vi~l~ellPm~k~Pla~~~~~~~~~V~~ygaL~l~aPkk~~~ikNV~~~~~~~~~~~~~ 478 (479) +|++||||+++|||||+|+||+|+|||||||||||++|+.++|||+|||+|+|+|||||+|||||+|+|-- |+.-. T Consensus 392 ~n~~iPgT~~~fVgem~~~~i~~~~llpm~~lplA~~n~~~~~~Vl~Ygalal~~Pk~~~~ikNv~~~~~~--~~~~~ 467 (467) T protein:vir:80 392 LNDSIPETVDVFVGEMSANVVHLFELLPMMRLPLAQINASVTFAVLWYGALALRAPKKWVRIRNVKYIPVK--NVHSN 467 (467) T ss_pred CCcccCCCcceeeeecChhHHHHHHHhccccCChhHhccchhhhhhhhhHHhhhccccceEEEEeeeeeec--cccCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999842 22212 No 7 >protein:vir:100851 Length: 514 # NCBI annotation: hypothetical protein # Family: family:all:2450 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164744;genbank:gi:56693157;genbank:GeneID:3197484 Probab=100.00 E-value=2.3e-189 Score=1054.74 Aligned_cols=472 Identities=42% Similarity=0.666 Sum_probs=433.4 Q ss_pred CCccchh------------hhhhhhcCC-ccchHHHHHHHHHh-hhcCCCcChhhccCccccchhhhhhhhhhheecccc Q lcl|NC_018856. 1 MTELKKE------------AEAKNKKLP-VEAEAELAELVSKS-FTTGYGITPDTQLDGAAVRRELLEDQVKMLAFSSND 66 (479) Q Consensus 1 ~~~~~~~------------~~~~~~~~~-~~~~~~~~e~~~Ks-~tag~~~~p~~~~~gaalr~esld~~~~~l~~~~~~ 66 (479) |-...|. ..+++-+.. .+..+.+.|+++|| |++||+|+|++|+||+||||||||++|++|+|++++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~a~t~gy~~~~~~~t~gaAlR~EsLd~~l~~Lt~~~~~ 80 (514) T protein:vir:10 1 MYTQDKTKDIMKKSFFGGDRAVAFDTNKEDILNENLPENVKKSAFTAGHSITPDTQTDGAANRIESLNRDLKVTTWGERD 80 (514) T ss_pred CCccchhhHHHhhhhcccceeeeecCcHHHHHHHhcchhhhhhhhccccccCCccccCccchhhhhhccceeEeeecCcc Confidence 1110000 011111111 12345567889999 999999999999999999999999999999999999 Q ss_pred ccchhhccccchhHHHHhhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHH Q lcl|NC_018856. 67 FTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTIL 146 (479) Q Consensus 67 f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~ 146 (479) |+|||+|+|||++|||+||+++++||++||++|++|+|+++++||+|+||+++||||++++++|+++++||++.||++++ T Consensus 81 ftf~~~i~k~~a~STV~ey~~~~~~G~~G~~~f~~E~gi~~~~d~~~~rk~~~~k~l~~~~~vS~~~~l~n~i~d~~~~~ 160 (514) T protein:vir:10 81 FTLYNDIAKQPVDNTVLKYTQYYSHGRTGHSLFQPEIGIGDVNNPNERQRTINIKYIVDTHVTSIALQRANTIVDSLKVQ 160 (514) T ss_pred hhhhhhcCCchhhHHHhhhhhhcccCcccccccccccccCcCCCcceEEEEEeeeeeeeeeeeeehhhhccchhhHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccCCCCCHHHHhhhhhhhhhccCceEEEec Q lcl|NC_018856. 147 TEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGARLDEATLNKAAVIVGKGYGRATDAFM 226 (479) Q Consensus 147 ~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~~td~~m 226 (479) +++||+++||+|||+|||||++|+|+..+ +|||||||+++|+++ |||||||++||+++||+||++|+++||+|||+|| T Consensus 161 ~~dai~~ia~tiE~a~FyGDs~L~s~~~~-~gleFDGl~~lI~~~-NvIDarG~~Ls~~~ln~aA~~i~~gfGt~TD~yl 238 (514) T protein:vir:10 161 EYAAISTVIKTDEWAMFYGDADLTSGQKG-EGLQFDGLFKLIAPE-NHIDLRGGRLSPAALNMAARKIGEGFGTPTDAYM 238 (514) T ss_pred HHHHHHHHHHHHHHHHhhhcccCCCcccc-CcchhhhHHHhhcCC-CeEecCCCCccHHHHhhhhhhhhcccCChhheeC Confidence 99999999999999999999999976544 599999999999985 9999999999999999999999999999999999 Q ss_pred ChHHhhhHHHHhhCcceeeeccCCCcceeeeehhhhcCCCcceecccceecCCCceecccCCcCCCCCCCceeEeeeecc Q lcl|NC_018856. 227 PIGVQADFTNNLLDRQRVIQPSTAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILLEGRNPEPNAPQAPASVVASIVD 306 (479) Q Consensus 227 p~~vka~f~~~~~~~qrv~~~~n~g~~~~G~~I~~~~s~~G~I~l~~s~~m~~~~~L~e~~~~~~~AP~~pa~v~at~~t 306 (479) |+++|++|+|+++|+|||++|+|++++++|+++++|.+++|+|+||||++|+++++|.+.....|.||.+|. +++++++ T Consensus 239 p~~vka~f~~~~~~~qRV~~~~n~~~~~~G~~v~~f~s~~G~I~L~gs~im~~~n~L~~~~~~~~~Ap~~~~-va~svT~ 317 (514) T protein:vir:10 239 PIGIKADFVNQHLNGQRVMLPGQTGGMTTGLDIDKFLSAHGSIRIQGSTIMDSDNKLDFDRPVSPTAPTAPQ-LSATVTP 317 (514) T ss_pred chHHHHHHhhcccCcceEEeecCccceeeeeeccceeEeccceeecCCeeecccccCccCCccCCcCCCCCc-ceEEEec Confidence 999999999999999999999999999999999999999999999999999999999999999999999776 4455566 Q ss_pred CCCCCCCccccc--------------ceEEEEEEEcCcCccccccceeeeeecCCceEEEEEEecCCCCcccceEEEEEe Q lcl|NC_018856. 307 DKKGGFRDEDIK--------------THSYKVVVHSDDAESLPSEAVTAAVAKKDNTVKLEVKLASLYQAQPQFISVYRE 372 (479) Q Consensus 307 ~~~G~f~~~d~g--------------ty~YkVtavn~~GES~pS~~vt~Tv~~~g~sv~ltIT~~~~~~a~~~~y~IYR~ 372 (479) +.+|.|.+++.+ .|+|+|++||++|||+||+++++|++++|++++|+||++++++..|+|++|||+ T Consensus 318 ~~~g~~~~ad~t~~~g~~~~~~~~g~~~sYaVv~~n~~GeS~ps~~vtaT~a~~~~~i~ltItp~~~~~~~p~yv~IYR~ 397 (514) T protein:vir:10 318 DGGGLWHEADKTDSKGEVILNKEVGVEQSYVAVMVSRHGDSRPSLVQTATPTKKDDAITLTITPNAMQNVIPDYVAIYRK 397 (514) T ss_pred CcccccCcccccccccccccccccceeEEEEEEEECCCCcccccceeeeeeeccCceEEEEEEeccCcccccceEEEEec Confidence 667777766653 588999999999999999999999999999999999999999999999999999 Q ss_pred cC--------------CCcceEEEEeeeeeeecCCceEEEeeccccCCCcccceecCCchhhhhhhhhcchhhccccccC Q lcl|NC_018856. 373 GT--------------ETGHYFLIARVPVSKVNDQGVIEVLDRNQVIPETTDVFVGELTPNVVSLLELLPMMKLPLAQMN 438 (479) Q Consensus 373 ~~--------------~~G~y~li~rv~vs~~n~~g~T~ftD~N~~iPgT~~~fvGe~~p~vi~l~ellPm~k~Pla~~~ 438 (479) +. ++|+|++|+|||++. +.+|+|+|+|+|+|||||+++|||||+||||+|+|||||||||||+.| T Consensus 398 ~~~~s~~~~~~~~~~~~tGdf~li~rv~~~~-~~~gttt~~D~n~~IPgT~~vfVgemspevi~l~ellPm~klpLA~~n 476 (514) T protein:vir:10 398 SNFDSDALEANTDASGNRGSYYLIGKVAVRE-QEGATITFVDTNARIAGCGDVFVIENRPETVALQEFIPLSKLNLAVTT 476 (514) T ss_pred cCCCcchhhhhccccccccceeEEEEEeeec-CCCCeEEEeccccccCCcceeEEeeCchHHHHHHHHhhhhhcChhhhc Confidence 74 669999999999966 789999999999999999999999999999999999999999999999 Q ss_pred cchhhhhhhhhhhheecccccEEEEeccccCcccceee Q lcl|NC_018856. 439 ATTTFTVLWYGALALYAPKKWVRIKNVQYIPALAADVT 476 (479) Q Consensus 439 ~~~~~~V~~ygaL~l~aPkk~~~ikNV~~~~~~~~~~~ 476 (479) ++++|||+|||+|+|+|||||+|||||+|+|----+.+ T Consensus 477 a~~~waVlwYGaLal~aPkr~~~IkNv~~~~v~~~~~~ 514 (514) T protein:vir:10 477 TATSFVVLNYVALALYYPKRGAVLENVVYSRVEDLELS 514 (514) T ss_pred chHHHHHHHHhHHHhhccccceEEEeeeeeeccccccC Confidence 99999999999999999999999999999997655555 No 8 >protein:vir:102823 Length: 470 # NCBI annotation: major structural protein # Family: family:all:2450 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874086;genbank:gi:118197693;genbank:GeneID:4496015 Probab=100.00 E-value=4.5e-157 Score=877.71 Aligned_cols=437 Identities=20% Similarity=0.286 Sum_probs=388.2 Q ss_pred CCccchHHHHHHHHHhhhcCCCcChhhccCccccchhhhhhhhhhheeccccccchhhccccchhHHHHhhhhhhc-cCc Q lcl|NC_018856. 15 LPVEAEAELAELVSKSFTTGYGITPDTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQ-HGR 93 (479) Q Consensus 15 ~~~~~~~~~~e~~~Ks~tag~~~~p~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~-~G~ 93 (479) ||++++++|.|+..|++++.-.+ |+||||||||++|++|+|++++|+|||+|+|||++|||+||+++++ ||+ T Consensus 1 ~~~~~~~~~~~a~~~al~~a~~~-------g~AlR~EsLd~~l~~lt~~~~~ftf~~~i~k~~a~STV~ey~~~~~rhG~ 73 (470) T protein:vir:10 1 MPYEHLKHLDEATLKALNAAGQV-------AESLEREDLEPEVTQLNVLDTPLTDLLSKNAVKAKAYEHEYNVVTARHDK 73 (470) T ss_pred CChhHhhhhhHHHHHHHHHhhhc-------chhhhhhhhccceeEeeecCccchhhhhcCCchhhhHhhhhhhhcccccc Confidence 99999999999999999993333 7889999999999999999999999999999999999999999886 888 Q ss_pred ccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHh--hhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCC Q lcl|NC_018856. 94 TGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAG--LVNNIADPMTILTEDAIAVIAKSIEWAIFYGDAALSS 171 (479) Q Consensus 94 ~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~--lvn~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~ 171 (479) .||++| +|+|+++++||+|+||+++||||+++++||+++. ++|+++||+++++++||+++||+|||+|||||++|++ T Consensus 74 ~g~s~~-~E~~l~~~~d~~~~Rr~v~~K~l~~~~~VT~~a~~~~~n~v~d~~~~~~~dai~~ia~tiE~a~FyGDs~l~s 152 (470) T protein:vir:10 74 IGYAAF-REGGLPRTVEVNVVRRRIRPMLVGHRITVTELATRTTQNGVMQIDELVKREKMIAVANEFEYLAFYGDNLLGD 152 (470) T ss_pred ccceee-cccccCccCCCceEEEEEEEEEEeecchhhhhhhhhhhccccchHHHHHHHHHHHHHHHHHhhhhhhcccccc Confidence 888866 9999999999999999999999999999999974 5688999999999999999999999999999999987 Q ss_pred C-CCCcccchhhhHHHhhccC--CCEEEccCCCCCHHHHhhhhh--hhhhccCceEEEecChHHhhhHHHHhhCcceeee Q lcl|NC_018856. 172 E-ADGQAGIEFDGLHKLIDQD--TNVIDLKGARLDEATLNKAAV--IVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQ 246 (479) Q Consensus 172 ~-~~~~~gleFDGl~~~I~~~--~NviDarG~~l~~~~l~~aa~--~i~~~fG~~td~~mp~~vka~f~~~~~~~qrv~~ 246 (479) . +++++|||||||.++||++ +|||||||++||+++||+|+. +++++||+|||+|||+++|++|+|+++++|||++ T Consensus 153 ~~~g~~~gleFDGl~~lId~~~~~NViDarG~~Ls~~~L~~aa~~I~~~~~fGt~TD~~lp~~vka~f~~~~~~~qRv~~ 232 (470) T protein:vir:10 153 DVPGSPNNLQQDGIINIIKRGAPQNVLDAGGRPLSIDLLWEAESRVVSTQAFANPTAVFISYVDKLNLQASFYQISRVMT 232 (470) T ss_pred ccCcccCceeccchhhhccCCCCccccccCCCCccHHHHHHHHhhhcccccccChhhhccchhHHHHHHHhhcCceEEEE Confidence 6 6778999999999999853 799999999999999999995 4589999999999999999999999999999999 Q ss_pred ccCCCcceeeeehhhhcCCCcceecccceecC-----CCceecccCCcCCCCCCCceeEeee------eccCCCCCCCcc Q lcl|NC_018856. 247 PSTAGGFSTGFSINQFLSTRGAINLHGSTIME-----NDNILLEGRNPEPNAPQAPASVVAS------IVDDKKGGFRDE 315 (479) Q Consensus 247 ~~n~g~~~~G~~I~~~~s~~G~I~l~~s~~m~-----~~~~L~e~~~~~~~AP~~pa~v~at------~~t~~~G~f~~~ 315 (479) ++|.|++++|++|++|.|++|+|+||||++|+ ++++|.+ .++.-+||+..+.|..+ ...+.+|.|.++ T Consensus 233 ~~N~~~~~~G~~v~~f~sa~G~I~L~~s~~m~~~~k~~p~~l~~-~v~~~aAP~~~~tv~~t~~~~a~~~~sk~g~~~~~ 311 (470) T protein:vir:10 233 TADRRAGLLGADAQSYIGVRGEHSLYPSQFLGDFHKFNPARFGA-EVGDFAAPSNSWTVSTTDNFVTLPYNSGLGDPANT 311 (470) T ss_pred ecCCCceeeeeeccceeeeeeeeeecccccccchhhcCcccCCc-ccCCcccCceeEEeecCCCceeecccCCCCcccCc Confidence 99999999999999999999999999999999 4555655 34433477655554433 223456678888 Q ss_pred cccceEEEEEEEcCcCccccccc--eeeeeecCCceEEEEEEecCCCCcccceEEEEEecCCCcceEEEEeeeeeeecCC Q lcl|NC_018856. 316 DIKTHSYKVVVHSDDAESLPSEA--VTAAVAKKDNTVKLEVKLASLYQAQPQFISVYREGTETGHYFLIARVPVSKVNDQ 393 (479) Q Consensus 316 d~gty~YkVtavn~~GES~pS~~--vt~Tv~~~g~sv~ltIT~~~~~~a~~~~y~IYR~~~~~G~y~li~rv~vs~~n~~ 393 (479) +++.|.|+||.++ ||| +|++ +++|++..+++|+|+|+.+. .++|++|||+++++|.|++|+||+++++| + T Consensus 312 ~v~sy~y~v~~~~--gds-~s~~v~vt~t~~~v~kgv~ltI~~~~----~v~yv~IYRk~~~s~~~~li~rv~v~~~n-g 383 (470) T protein:vir:10 312 TVYSYAFKAANFY--GES-AAKYIDVYIDSTEAGKGVRFQFHGLV----NVKWLDVYRKDPGSQEYKFYKRVKVSTVN-G 383 (470) T ss_pred ceeEEEEEEEEec--CCC-CcceEEEEEeeehhcceeEEEEecCC----CCcEEEEEeecCCCCceeEEEEEeeeecc-C Confidence 8876666666555 666 3444 46677788999999998553 37999999999999999999999999988 8 Q ss_pred ceEEEeeccccCCCcccc----------eecCCchhhhhhhhhcch--hhccccccCcchhhhhhhhhhhheecccccEE Q lcl|NC_018856. 394 GVIEVLDRNQVIPETTDV----------FVGELTPNVVSLLELLPM--MKLPLAQMNATTTFTVLWYGALALYAPKKWVR 461 (479) Q Consensus 394 g~T~ftD~N~~iPgT~~~----------fvGe~~p~vi~l~ellPm--~k~Pla~~~~~~~~~V~~ygaL~l~aPkk~~~ 461 (479) +.+.|+|.|++||+|+++ |||||+|++++|++|||| +|||+++.+....|.| |+|+|+|||||++ T Consensus 384 ~~~~~~D~~e~i~tt~~v~~~~~~Pgt~~Vgemsp~v~sl~~~l~m~l~klp~a~~~~~v~~~v---galal~aPKr~~~ 460 (470) T protein:vir:10 384 DFTWIDDGHETVTTPSGVYRWKKIPGTGVVVGIDPNVTTMAVWIGMELYRLPPALTHDYVIWKV---ASVFSRAPEFNFL 460 (470) T ss_pred CEEEEecccccCCCcceeeeecccCcceeccccCcchhhhhhhhhhhhhhcCHHHHHHHHHHHH---HHHHHhccccceE Confidence 889999999999999987 999999999999999999 7899888887777777 9999999999999 Q ss_pred EEeccccCcc Q lcl|NC_018856. 462 IKNVQYIPAL 471 (479) Q Consensus 462 ikNV~~~~~~ 471 (479) ||||+|+|-. T Consensus 461 IkNV~~~~~~ 470 (470) T protein:vir:10 461 IVNVGQEPIV 470 (470) T ss_pred EEEeeeeecC Confidence 9999999988 No 9 >protein:vir:8843 Length: 317 # NCBI annotation: major head protein # Family: family:all:3919 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775251;genbank:gi:27476049;genbank:GeneID:2700597 Probab=99.42 E-value=8.2e-15 Score=97.81 Aligned_cols=294 Identities=13% Similarity=0.096 Sum_probs=182.1 Q ss_pred CcCh--hhccCccccchhhhhhhhhhheeccccccchhhccccchhHHHHhhhhhhccCcccccccccccccccccC-cc Q lcl|NC_018856. 36 GITP--DTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASIND-PN 112 (479) Q Consensus 36 ~~~p--~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~~~~d-~~ 112 (479) =..| ..++--+..-||+|.++|.++.-.+. .|+-.|.|.++++|.+||....=..-. . .-..||+++.... .. T Consensus 1 ma~~~~~~~t~~~~g~~~dl~~~I~~isp~dT--Pf~S~i~~~~a~~~~~~W~~d~l~~~~-~-~~~~EG~da~~~~~~~ 76 (317) T protein:vir:88 1 MATPTNAVSTVEINGKREDLIDIIYNIAPYDT--PFMSAIGKGVATAITHEWQTDELRQPG-K-NTRVEGEDATIKAGSF 76 (317) T ss_pred CCccccceEeeeeeeeeechhhhheecCCccC--cceeeecCceecccEEEEEeeecCCcc-c-cccccCcccccccccC Confidence 1111 12234567789999999988876665 556678889999999999864422211 1 2234886543332 22 Q ss_pred eEEEEEEEEeeeehhhhhhhHhhhcc--hhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhcc Q lcl|NC_018856. 113 IRQKTVQMKFLSDTKQQSLAAGLVNN--IADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQ 190 (479) Q Consensus 113 ~~r~~~~~k~l~~~~~vs~~~~lvn~--~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~ 190 (479) -.|+...+--+.+..+||.-++.++. ++|-++.|...++.-|..++|+++++|.+....+..+.. =+.+||...|.. T Consensus 77 r~~~~N~tQIf~k~v~VSgTa~av~~~G~~~ela~q~~kk~~EikrdmE~~li~g~~a~~~~~~t~~-r~~~Gl~~~i~t 155 (317) T protein:vir:88 77 TTMLNNYCQISDETLQVTGTADRVKKAGRKNELAYQLAKKSKELKLDMEYALVGAPQAKVQRNTTTP-GQMANIFAYYKT 155 (317) T ss_pred CEEeccEEEEEEeEEEEeehhhhhhhcCccchhHHHHHHHHHHHHHHHHHHHhcCeeeccCCCCccc-hhhhhHHHHhcc Confidence 33444444456666777777777644 569999999999999999999999999987654322211 268999999976 Q ss_pred CCCEEEccC----------------CCCCHHHHhhhhhhhhhccCceEEEecChHHhhhHHHHhhCcceeeeccCCCcce Q lcl|NC_018856. 191 DTNVIDLKG----------------ARLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSTAGGFS 254 (479) Q Consensus 191 ~~NviDarG----------------~~l~~~~l~~aa~~i~~~fG~~td~~mp~~vka~f~~~~~~~qrv~~~~n~g~~~ 254 (479) + |+..+.| ..|+|+.|+++...+-.+.|.++.+|++...|..|...+-.+...+ ........ T Consensus 156 ~-~~~~~~g~~~~~~~~~~~t~~t~~~lte~~l~~~l~~i~~~Gg~~~~i~v~a~~k~~i~~~~~~~~~~i-~~~~~~~~ 233 (317) T protein:vir:88 156 N-GSLGANGVAPVGDGSNTGTAGDLRLLTEDMLLNASESIWRNGGQANSIQTSSSIKKAISKNMKGRATEI-TLDASDNR 233 (317) T ss_pred C-ceeccCccccccCCCccccccccccccHHHHHHHHHHHHhcCCCCCEEEeChHHHHHHHHHhcCCceeE-EEcccCeE Confidence 4 4444443 4699999999999999999999999999999999977765544333 22344568 Q ss_pred eeeehhhhcCCCcceecccceecCCCcee-cc-cCCcCCCCCCCceeEeeeeccCCCCCCCcccccceEEEEEEEcCcCc Q lcl|NC_018856. 255 TGFSINQFLSTRGAINLHGSTIMENDNIL-LE-GRNPEPNAPQAPASVVASIVDDKKGGFRDEDIKTHSYKVVVHSDDAE 332 (479) Q Consensus 255 ~G~~I~~~~s~~G~I~l~~s~~m~~~~~L-~e-~~~~~~~AP~~pa~v~at~~t~~~G~f~~~d~gty~YkVtavn~~GE 332 (479) .|+.|+.|.|+.|.|++..+.+|..+..+ ++ ..+.. ++--|. -....+-+|.....-- .-.|.+-+.|..+- T Consensus 234 ~g~~v~~~~tdfG~v~ii~~r~lp~~~~~~~D~~~~~l--~~Lr~~--~~e~laKtGd~~k~~i--~~E~tLe~~N~~a~ 307 (317) T protein:vir:88 234 IAQTVDVYESDFGKYTIRANRWFHENTLFVFDPKMHSL--CYLRPF--FQHELAKTGDSEKRQL--LVEYTFRVNNEKSG 307 (317) T ss_pred EEEEEEEEEeCCeEEEEEeCCCCCCCeEEEEcccccce--eecccc--eeeccCCCcccceeEE--EEEEEEEEcCccce Confidence 99999999999999999999999866553 22 11100 111011 1111111111000000 01233333333211 Q ss_pred cccccceeeee Q lcl|NC_018856. 333 SLPSEAVTAAV 343 (479) Q Consensus 333 S~pS~~vt~Tv 343 (479) +--.-.++++ T Consensus 308 -a~i~~l~~~~ 317 (317) T protein:vir:88 308 -ALIRDVVAQL 317 (317) T ss_pred -eEEEEecccC Confidence 0011111112 No 10 >protein:vir:93631 Length: 580 # NCBI annotation: Bcep22gp67 # Family: family:all:1544 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944296;genbank:gi:38640373;genbank:GeneID:2658280 Probab=99.09 E-value=8.8e-13 Score=86.66 Aligned_cols=247 Identities=21% Similarity=0.195 Sum_probs=104.9 Q ss_pred ccCCCEEEccCCC--CCHHHHhh-hhhhhhhccCceEEEecChHHhhhHHHH-hhCcceeeeccCCCcceeeeehhhhcC Q lcl|NC_018856. 189 DQDTNVIDLKGAR--LDEATLNK-AAVIVGKGYGRATDAFMPIGVQADFTNN-LLDRQRVIQPSTAGGFSTGFSINQFLS 264 (479) Q Consensus 189 ~~~~NviDarG~~--l~~~~l~~-aa~~i~~~fG~~td~~mp~~vka~f~~~-~~~~qrv~~~~n~g~~~~G~~I~~~~s 264 (479) =..=.+...+|+- |-+.+|-+ +|+ .+.+..+-.|+-..+-+. ++-+ .... T Consensus 1 M~~i~i~~f~Ge~Prl~p~lLP~~~a~-------~a~n~~~~~G~i~P~~~~~~~~~-------------------~~~i 54 (580) T protein:vir:93 1 MTIIKITGFSGEIPRLVPRLLPDTAAQ-------NATNARLESGGLTPYRKPKFITR-------------------ISTI 54 (580) T ss_pred CeeEeecccccccccchhhhccccccc-------eEEeeeccCCeeeeeeCchhhcc-------------------cccc Confidence 0001112222221 11222221 111 122222222222222110 0000 0000 Q ss_pred CCcc---eecccceecCCCce--------------ecccCCcCCCCCC------CceeEeeeeccCCCCCCCcccccceE Q lcl|NC_018856. 265 TRGA---INLHGSTIMENDNI--------------LLEGRNPEPNAPQ------APASVVASIVDDKKGGFRDEDIKTHS 321 (479) Q Consensus 265 ~~G~---I~l~~s~~m~~~~~--------------L~e~~~~~~~AP~------~pa~v~at~~t~~~G~f~~~d~gty~ 321 (479) +-++ |=+++..|+.-+.+ +..+..|..++++ .|.+..+.++...++ ...+.++|+ T Consensus 55 ~~~~~~t~~~~~~~W~~w~~~V~~i~~PvA~DRvy~Td~g~Pkvt~~g~sy~lgVpaPs~Apt~~~~g~--g~l~~~~y~ 132 (580) T protein:vir:93 55 PAGQIETIYRNGETWMAWDKPVYAAPGPVAADRLYVMGDGAPKMIVGGTTYPLAVPMPSAALTAATSGT--GTGDVFSRV 132 (580) T ss_pred CcCcceEEEecCceeEEeCCceeeecCccccceeEEcCCcccceecCCccccccCCCcccCceeeecCC--CCcCccceE Confidence 1111 11222233222111 1122222222222 122222322222222 245778999 Q ss_pred EEEEEEcCcC-ccccccceeeeeecCCceEEEEEEecCCCCcccceEEEEEecCCC--cceEEEEeeeeeeecCCceEEE Q lcl|NC_018856. 322 YKVVVHSDDA-ESLPSEAVTAAVAKKDNTVKLEVKLASLYQAQPQFISVYREGTET--GHYFLIARVPVSKVNDQGVIEV 398 (479) Q Consensus 322 YkVtavn~~G-ES~pS~~vt~Tv~~~g~sv~ltIT~~~~~~a~~~~y~IYR~~~~~--G~y~li~rv~vs~~n~~g~T~f 398 (479) |++|.|+++| ||.||.+........|++|+|+..+.+..+...+.++|||+..++ ++|+|++.++. ++++| T Consensus 133 Yv~TfVt~~GeES~PS~~S~~vtv~~g~tVtLs~~p~p~~~~~i~~~RIYRS~tG~~gtdy~lVAel~A------g~~sF 206 (580) T protein:vir:93 133 YVYTFVTGFGEESEPSAISNEVNWQAGQTVTLSGFQAAPAGRNITKQRIYRSQTSLSGTDLYFIAERDA------SAANF 206 (580) T ss_pred EEEEEEcCCCCcCCCcccccceeeCCCCeEEEEecCCCCCCCccceEEEEEeccCCCceeEEEEeeecc------ceeee Confidence 9999999999 899887644444456888888887777776666779999987654 59999999765 56799 Q ss_pred eeccccCCCcccceecCCchhhhhh----hhhcchhhccccccCcchhhhhhhhhhhheecccccEE-E-Eec-cccCcc Q lcl|NC_018856. 399 LDRNQVIPETTDVFVGELTPNVVSL----LELLPMMKLPLAQMNATTTFTVLWYGALALYAPKKWVR-I-KNV-QYIPAL 471 (479) Q Consensus 399 tD~N~~iPgT~~~fvGe~~p~vi~l----~ellPm~k~Pla~~~~~~~~~V~~ygaL~l~aPkk~~~-i-kNV-~~~~~~ 471 (479) +|...... .|+.=| +..| ..|..+..||+.-+ +-..+-..||.=. +.|.-|-. + ..+ ..+-|+ T Consensus 207 ~Dd~s~a~------Lge~Lp-s~~~~~PP~~m~gL~~m~nGi~-agF~Gnev~fsEp--y~P~AWP~~yr~t~~~~Ivai 276 (580) T protein:vir:93 207 VDNVPLSD------QNEPLP-SLEWNAPPDDLTGLISLPNGMM-AAFRGKELWLCEP--WRPHAWPQKYVLTMDYNIVAL 276 (580) T ss_pred eecccccc------cccccc-hhhccCcCCCcceEEeeccceE-EEEeCCEEEEecC--CCCccchhhcCCCCCCCceeE Confidence 99864421 111111 1111 01122223443211 1111112222111 44554432 1 111 134455 Q ss_pred cceeeecC Q lcl|NC_018856. 472 AADVTVKY 479 (479) Q Consensus 472 ~~~~~~~~ 479 (479) |+.=+.-| T Consensus 277 a~~g~~Lv 284 (580) T protein:vir:93 277 GAYGTTIV 284 (580) T ss_pred eeeCceEE Confidence 55544444 No 11 >protein:vir:2792 Length: 567 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612909;genbank:gi:20065826;genbank:GeneID:935648 Probab=99.02 E-value=2e-11 Score=79.28 Aligned_cols=255 Identities=15% Similarity=0.189 Sum_probs=106.5 Q ss_pred CCCHHHHhhhhh--hhhhccCceEEEecChHHhhhH---HHHhhCcceeeeccCCCcceeeee--------------hhh Q lcl|NC_018856. 201 RLDEATLNKAAV--IVGKGYGRATDAFMPIGVQADF---TNNLLDRQRVIQPSTAGGFSTGFS--------------INQ 261 (479) Q Consensus 201 ~l~~~~l~~aa~--~i~~~fG~~td~~mp~~vka~f---~~~~~~~qrv~~~~n~g~~~~G~~--------------I~~ 261 (479) -..+.+|..... .|.|-= -+.-+=||..--..| ..-+.|+ ++|.+....+.+.. +.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~-~~~~~~M~~i~i~~f~Ge~Prl~p~---lLP~~~a~~A~n~~~~~G~itP~~~~~~~~~ 76 (567) T protein:vir:27 1 MMPIAILANSIINPLIFKPE-AVKGISMPYIDITTMRGMMPRVVTS---MLPEHSAVLAEDCHFRFGVITPERQISGVEK 76 (567) T ss_pred Ccchhhhhhhhccceeeccc-ccccceeeEEeecccccccccchhh---hccccccceEEeeeccCCeeeeeeccccccc Confidence 222222211100 000000 000011111111111 0000000 12222222222211 111 Q ss_pred hcCCC-cceecc-cceecCCCce---------------------------------ecccCCcCCC-CCCCcee----Ee Q lcl|NC_018856. 262 FLSTR-GAINLH-GSTIMENDNI---------------------------------LLEGRNPEPN-APQAPAS----VV 301 (479) Q Consensus 262 ~~s~~-G~I~l~-~s~~m~~~~~---------------------------------L~e~~~~~~~-AP~~pa~----v~ 301 (479) ..+.+ .-|=++ ++.|+.-+.+ +.++..|.+. .=+.|++ ++ T Consensus 77 ~~~~~~~Tif~y~~~~W~~w~~~V~~ir~PvAqD~~~rvY~tgdg~Pk~t~~~iat~G~~~~P~~~y~LgVpaps~aP~~ 156 (567) T protein:vir:27 77 TFTIKPKTIFHYRDDFWFAWPDVVDVIRSPIAQDPHGRIYYTDGRFPKVTDATIATKGDGNHPTSSYRLGIPAPTTAPVC 156 (567) T ss_pred ccccCceeeEEEcCcEEEEeCCceeeccCccccCCcceEEEecCCcceeeeeeeeecCCCCCCcchhhcccCCcccccee Confidence 11111 112122 2223322211 2234444442 1122222 22 Q ss_pred eeeccCCCCCCCcccccceEEEEEEEcCcC-ccccccc-eeeeeecCCceEEEEEEecCCCCcccceEEEEEecCCC--c Q lcl|NC_018856. 302 ASIVDDKKGGFRDEDIKTHSYKVVVHSDDA-ESLPSEA-VTAAVAKKDNTVKLEVKLASLYQAQPQFISVYREGTET--G 377 (479) Q Consensus 302 at~~t~~~G~f~~~d~gty~YkVtavn~~G-ES~pS~~-vt~Tv~~~g~sv~ltIT~~~~~~a~~~~y~IYR~~~~~--G 377 (479) +.+.+++-..-.+.|+.++.|++|.|+++| ||.||.+ ..+++...|++|.|+..+++..+..-+..+|||+..++ + T Consensus 157 a~~~~~~~~~~~~~d~etr~Yv~TfVt~~GeES~PS~~S~~~~v~~pg~~V~ls~~p~~~~~~~i~~~RIYRS~tg~~gt 236 (567) T protein:vir:27 157 TVQQGGDVSDDNPNDDETRFYTETFVSDYGEEGPPGPASLEVTLRTPGTAVQLTLAPVPLQNASIKRRRIYRSASGGGEA 236 (567) T ss_pred eecCCCCCCCCCCcccceeEEEEEEEcCCCCcCCCcccccceeeecCCceEEEeeccCCccccccceEEEEEecCCCCce Confidence 222333333345678899999999999999 7888866 35677778999999999898888888999999987764 4 Q ss_pred ceEEEEeeeeeeecCCceEEEeeccccCCCcccceecCCchhhhhh----hhhcchhhccccccCcchhhhhhhhhhhhe Q lcl|NC_018856. 378 HYFLIARVPVSKVNDQGVIEVLDRNQVIPETTDVFVGELTPNVVSL----LELLPMMKLPLAQMNATTTFTVLWYGALAL 453 (479) Q Consensus 378 ~y~li~rv~vs~~n~~g~T~ftD~N~~iPgT~~~fvGe~~p~vi~l----~ellPm~k~Pla~~~~~~~~~V~~ygaL~l 453 (479) +|+|+++++. ++++|+|.- |.+- .|+.=| +..| ..|..|..||++=+-+ ..+-=.||. -- T Consensus 237 dy~lVael~a------s~~sf~D~~---~~~~---lg~~Lp-s~~w~~PP~~m~GL~~m~NGimAg-F~GneV~Fs--Ep 300 (567) T protein:vir:27 237 DFLLVAELDA------SVLSYTDKI---PAKN---LGPSLA-TWDYLPPPENMTGLCLMANGIAAG-FAGNEVMFS--EA 300 (567) T ss_pred eeEEEEeecc------ceeeeeecc---chhh---cccccc-cccccCcCcccceeeecccceEEe-ecCCEEEEe--cC Confidence 8999999876 457999983 2221 110000 0011 1122222333221110 000000000 01 Q ss_pred ecccccEEEEeccccCccccee-------eecC Q lcl|NC_018856. 454 YAPKKWVRIKNVQYIPALAADV-------TVKY 479 (479) Q Consensus 454 ~aPkk~~~ikNV~~~~~~~~~~-------~~~~ 479 (479) |-|.-| ..+|.-..-.|+ +.-+ T Consensus 301 ylPyAW----P~~Yr~t~~~dIVaiA~~gt~LV 329 (567) T protein:vir:27 301 YLPYAW----PEVNRHTTAEDIVAICPLGTSLV 329 (567) T ss_pred CCCccc----chhhccCCCCCeEEEeecccEEE Confidence 334434 234433333332 1111 No 12 >protein:vir:3306 Length: 567 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049522;genbank:gi:9632528;genbank:GeneID:1262016 Probab=99.02 E-value=2e-11 Score=79.28 Aligned_cols=255 Identities=15% Similarity=0.189 Sum_probs=106.5 Q ss_pred CCCHHHHhhhhh--hhhhccCceEEEecChHHhhhH---HHHhhCcceeeeccCCCcceeeee--------------hhh Q lcl|NC_018856. 201 RLDEATLNKAAV--IVGKGYGRATDAFMPIGVQADF---TNNLLDRQRVIQPSTAGGFSTGFS--------------INQ 261 (479) Q Consensus 201 ~l~~~~l~~aa~--~i~~~fG~~td~~mp~~vka~f---~~~~~~~qrv~~~~n~g~~~~G~~--------------I~~ 261 (479) -..+.+|..... .|.|-= -+.-+=||..--..| ..-+.|+ ++|.+....+.+.. +.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~-~~~~~~M~~i~i~~f~Ge~Prl~p~---lLP~~~a~~A~n~~~~~G~itP~~~~~~~~~ 76 (567) T protein:vir:33 1 MMPIAILANSIINPLIFKPE-AVKGISMPYIDITTMRGMMPRVVTS---MLPEHSAVLAEDCHFRFGVITPERQISGVEK 76 (567) T ss_pred Ccchhhhhhhhccceeeccc-ccccceeeEEeecccccccccchhh---hccccccceEEeeeccCCeeeeeeccccccc Confidence 222222211100 000000 000011111111111 0000000 12222222222211 111 Q ss_pred hcCCC-cceecc-cceecCCCce---------------------------------ecccCCcCCC-CCCCcee----Ee Q lcl|NC_018856. 262 FLSTR-GAINLH-GSTIMENDNI---------------------------------LLEGRNPEPN-APQAPAS----VV 301 (479) Q Consensus 262 ~~s~~-G~I~l~-~s~~m~~~~~---------------------------------L~e~~~~~~~-AP~~pa~----v~ 301 (479) ..+.+ .-|=++ ++.|+.-+.+ +.++..|.+. .=+.|++ ++ T Consensus 77 ~~~~~~~Tif~y~~~~W~~w~~~V~~ir~PvAqD~~~rvY~tgdg~Pk~t~~~iat~G~~~~P~~~y~LgVpaps~aP~~ 156 (567) T protein:vir:33 77 TFTIKPKTIFHYRDDFWFAWPDVVDVIRSPIAQDPHGRIYYTDGRFPKVTDATIATKGDGNHPTSSYRLGIPAPTTAPVC 156 (567) T ss_pred ccccCceeeEEEcCcEEEEeCCceeeccCccccCCcceEEEecCCcceeeeeeeeecCCCCCCcchhhcccCCcccccee Confidence 11111 112122 2223322211 2234444442 1122222 22 Q ss_pred eeeccCCCCCCCcccccceEEEEEEEcCcC-ccccccc-eeeeeecCCceEEEEEEecCCCCcccceEEEEEecCCC--c Q lcl|NC_018856. 302 ASIVDDKKGGFRDEDIKTHSYKVVVHSDDA-ESLPSEA-VTAAVAKKDNTVKLEVKLASLYQAQPQFISVYREGTET--G 377 (479) Q Consensus 302 at~~t~~~G~f~~~d~gty~YkVtavn~~G-ES~pS~~-vt~Tv~~~g~sv~ltIT~~~~~~a~~~~y~IYR~~~~~--G 377 (479) +.+.+++-..-.+.|+.++.|++|.|+++| ||.||.+ ..+++...|++|.|+..+++..+..-+..+|||+..++ + T Consensus 157 a~~~~~~~~~~~~~d~etr~Yv~TfVt~~GeES~PS~~S~~~~v~~pg~~V~ls~~p~~~~~~~i~~~RIYRS~tg~~gt 236 (567) T protein:vir:33 157 TVQQGGDVSDDNPNDDETRFYTETFVSDYGEEGPPGPASLEVTLRTPGTAVQLTLAPVPLQNASIKRRRIYRSASGGGEA 236 (567) T ss_pred eecCCCCCCCCCCcccceeEEEEEEEcCCCCcCCCcccccceeeecCCceEEEeeccCCccccccceEEEEEecCCCCce Confidence 222333333345678899999999999999 7888866 35677778999999999898888888999999987764 4 Q ss_pred ceEEEEeeeeeeecCCceEEEeeccccCCCcccceecCCchhhhhh----hhhcchhhccccccCcchhhhhhhhhhhhe Q lcl|NC_018856. 378 HYFLIARVPVSKVNDQGVIEVLDRNQVIPETTDVFVGELTPNVVSL----LELLPMMKLPLAQMNATTTFTVLWYGALAL 453 (479) Q Consensus 378 ~y~li~rv~vs~~n~~g~T~ftD~N~~iPgT~~~fvGe~~p~vi~l----~ellPm~k~Pla~~~~~~~~~V~~ygaL~l 453 (479) +|+|+++++. ++++|+|.- |.+- .|+.=| +..| ..|..|..||++=+-+ ..+-=.||. -- T Consensus 237 dy~lVael~a------s~~sf~D~~---~~~~---lg~~Lp-s~~w~~PP~~m~GL~~m~NGimAg-F~GneV~Fs--Ep 300 (567) T protein:vir:33 237 DFLLVAELDA------SVLSYTDKI---PAKN---LGPSLA-TWDYLPPPENMTGLCLMANGIAAG-FAGNEVMFS--EA 300 (567) T ss_pred eeEEEEeecc------ceeeeeecc---chhh---cccccc-cccccCcCcccceeeecccceEEe-ecCCEEEEe--cC Confidence 8999999876 457999983 2221 110000 0011 1122222333221110 000000000 01 Q ss_pred ecccccEEEEeccccCccccee-------eecC Q lcl|NC_018856. 454 YAPKKWVRIKNVQYIPALAADV-------TVKY 479 (479) Q Consensus 454 ~aPkk~~~ikNV~~~~~~~~~~-------~~~~ 479 (479) |-|.-| ..+|.-..-.|+ +.-+ T Consensus 301 ylPyAW----P~~Yr~t~~~dIVaiA~~gt~LV 329 (567) T protein:vir:33 301 YLPYAW----PEVNRHTTAEDIVAICPLGTSLV 329 (567) T ss_pred CCCccc----chhhccCCCCCeEEEeecccEEE Confidence 334434 234433333332 1111 No 13 >protein:vir:10145 Length: 567 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859275;genbank:gi:32171031;genbank:GeneID:2653447 Probab=99.02 E-value=2e-11 Score=79.28 Aligned_cols=255 Identities=15% Similarity=0.189 Sum_probs=106.5 Q ss_pred CCCHHHHhhhhh--hhhhccCceEEEecChHHhhhH---HHHhhCcceeeeccCCCcceeeee--------------hhh Q lcl|NC_018856. 201 RLDEATLNKAAV--IVGKGYGRATDAFMPIGVQADF---TNNLLDRQRVIQPSTAGGFSTGFS--------------INQ 261 (479) Q Consensus 201 ~l~~~~l~~aa~--~i~~~fG~~td~~mp~~vka~f---~~~~~~~qrv~~~~n~g~~~~G~~--------------I~~ 261 (479) -..+.+|..... .|.|-= -+.-+=||..--..| ..-+.|+ ++|.+....+.+.. +.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~-~~~~~~M~~i~i~~f~Ge~Prl~p~---lLP~~~a~~A~n~~~~~G~itP~~~~~~~~~ 76 (567) T protein:vir:10 1 MMPIAILANSIINPLIFKPE-AVKGISMPYIDITTMRGMMPRVVTS---MLPEHSAVLAEDCHFRFGVITPERQISGVEK 76 (567) T ss_pred Ccchhhhhhhhccceeeccc-ccccceeeEEeecccccccccchhh---hccccccceEEeeeccCCeeeeeeccccccc Confidence 222222211100 000000 000011111111111 0000000 12222222222211 111 Q ss_pred hcCCC-cceecc-cceecCCCce---------------------------------ecccCCcCCC-CCCCcee----Ee Q lcl|NC_018856. 262 FLSTR-GAINLH-GSTIMENDNI---------------------------------LLEGRNPEPN-APQAPAS----VV 301 (479) Q Consensus 262 ~~s~~-G~I~l~-~s~~m~~~~~---------------------------------L~e~~~~~~~-AP~~pa~----v~ 301 (479) ..+.+ .-|=++ ++.|+.-+.+ +.++..|.+. .=+.|++ ++ T Consensus 77 ~~~~~~~Tif~y~~~~W~~w~~~V~~ir~PvAqD~~~rvY~tgdg~Pk~t~~~iat~G~~~~P~~~y~LgVpaps~aP~~ 156 (567) T protein:vir:10 77 TFTIKPKTIFHYRDDFWFAWPDVVDVIRSPIAQDPHGRIYYTDGRFPKVTDATIATKGDGNHPTSSYRLGIPAPTTAPVC 156 (567) T ss_pred ccccCceeeEEEcCcEEEEeCCceeeccCccccCCcceEEEecCCcceeeeeeeeecCCCCCCcchhhcccCCcccccee Confidence 11111 112122 2223322211 2234444442 1122222 22 Q ss_pred eeeccCCCCCCCcccccceEEEEEEEcCcC-ccccccc-eeeeeecCCceEEEEEEecCCCCcccceEEEEEecCCC--c Q lcl|NC_018856. 302 ASIVDDKKGGFRDEDIKTHSYKVVVHSDDA-ESLPSEA-VTAAVAKKDNTVKLEVKLASLYQAQPQFISVYREGTET--G 377 (479) Q Consensus 302 at~~t~~~G~f~~~d~gty~YkVtavn~~G-ES~pS~~-vt~Tv~~~g~sv~ltIT~~~~~~a~~~~y~IYR~~~~~--G 377 (479) +.+.+++-..-.+.|+.++.|++|.|+++| ||.||.+ ..+++...|++|.|+..+++..+..-+..+|||+..++ + T Consensus 157 a~~~~~~~~~~~~~d~etr~Yv~TfVt~~GeES~PS~~S~~~~v~~pg~~V~ls~~p~~~~~~~i~~~RIYRS~tg~~gt 236 (567) T protein:vir:10 157 TVQQGGDVSDDNPNDDETRFYTETFVSDYGEEGPPGPASLEVTLRTPGTAVQLTLAPVPLQNASIKRRRIYRSASGGGEA 236 (567) T ss_pred eecCCCCCCCCCCcccceeEEEEEEEcCCCCcCCCcccccceeeecCCceEEEeeccCCccccccceEEEEEecCCCCce Confidence 222333333345678899999999999999 7888866 35677778999999999898888888999999987764 4 Q ss_pred ceEEEEeeeeeeecCCceEEEeeccccCCCcccceecCCchhhhhh----hhhcchhhccccccCcchhhhhhhhhhhhe Q lcl|NC_018856. 378 HYFLIARVPVSKVNDQGVIEVLDRNQVIPETTDVFVGELTPNVVSL----LELLPMMKLPLAQMNATTTFTVLWYGALAL 453 (479) Q Consensus 378 ~y~li~rv~vs~~n~~g~T~ftD~N~~iPgT~~~fvGe~~p~vi~l----~ellPm~k~Pla~~~~~~~~~V~~ygaL~l 453 (479) +|+|+++++. ++++|+|.- |.+- .|+.=| +..| ..|..|..||++=+-+ ..+-=.||. -- T Consensus 237 dy~lVael~a------s~~sf~D~~---~~~~---lg~~Lp-s~~w~~PP~~m~GL~~m~NGimAg-F~GneV~Fs--Ep 300 (567) T protein:vir:10 237 DFLLVAELDA------SVLSYTDKI---PAKN---LGPSLA-TWDYLPPPENMTGLCLMANGIAAG-FAGNEVMFS--EA 300 (567) T ss_pred eeEEEEeecc------ceeeeeecc---chhh---cccccc-cccccCcCcccceeeecccceEEe-ecCCEEEEe--cC Confidence 8999999876 457999983 2221 110000 0011 1122222333221110 000000000 01 Q ss_pred ecccccEEEEeccccCccccee-------eecC Q lcl|NC_018856. 454 YAPKKWVRIKNVQYIPALAADV-------TVKY 479 (479) Q Consensus 454 ~aPkk~~~ikNV~~~~~~~~~~-------~~~~ 479 (479) |-|.-| ..+|.-..-.|+ +.-+ T Consensus 301 ylPyAW----P~~Yr~t~~~dIVaiA~~gt~LV 329 (567) T protein:vir:10 301 YLPYAW----PEVNRHTTAEDIVAICPLGTSLV 329 (567) T ss_pred CCCccc----chhhccCCCCCeEEEeecccEEE Confidence 334434 234433333332 1111 No 14 >protein:vir:9979 Length: 567 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859109;genbank:gi:32170864;genbank:GeneID:2653256 Probab=99.02 E-value=2e-11 Score=79.28 Aligned_cols=255 Identities=15% Similarity=0.189 Sum_probs=106.5 Q ss_pred CCCHHHHhhhhh--hhhhccCceEEEecChHHhhhH---HHHhhCcceeeeccCCCcceeeee--------------hhh Q lcl|NC_018856. 201 RLDEATLNKAAV--IVGKGYGRATDAFMPIGVQADF---TNNLLDRQRVIQPSTAGGFSTGFS--------------INQ 261 (479) Q Consensus 201 ~l~~~~l~~aa~--~i~~~fG~~td~~mp~~vka~f---~~~~~~~qrv~~~~n~g~~~~G~~--------------I~~ 261 (479) -..+.+|..... .|.|-= -+.-+=||..--..| ..-+.|+ ++|.+....+.+.. +.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~-~~~~~~M~~i~i~~f~Ge~Prl~p~---lLP~~~a~~A~n~~~~~G~itP~~~~~~~~~ 76 (567) T protein:vir:99 1 MMPIAILANSIINPLIFKPE-AVKGISMPYIDITTMRGMMPRVVTS---MLPEHSAVLAEDCHFRFGVITPERQISGVEK 76 (567) T ss_pred Ccchhhhhhhhccceeeccc-ccccceeeEEeecccccccccchhh---hccccccceEEeeeccCCeeeeeeccccccc Confidence 222222211100 000000 000011111111111 0000000 12222222222211 111 Q ss_pred hcCCC-cceecc-cceecCCCce---------------------------------ecccCCcCCC-CCCCcee----Ee Q lcl|NC_018856. 262 FLSTR-GAINLH-GSTIMENDNI---------------------------------LLEGRNPEPN-APQAPAS----VV 301 (479) Q Consensus 262 ~~s~~-G~I~l~-~s~~m~~~~~---------------------------------L~e~~~~~~~-AP~~pa~----v~ 301 (479) ..+.+ .-|=++ ++.|+.-+.+ +.++..|.+. .=+.|++ ++ T Consensus 77 ~~~~~~~Tif~y~~~~W~~w~~~V~~ir~PvAqD~~~rvY~tgdg~Pk~t~~~iat~G~~~~P~~~y~LgVpaps~aP~~ 156 (567) T protein:vir:99 77 TFTIKPKTIFHYRDDFWFAWPDVVDVIRSPIAQDPHGRIYYTDGRFPKVTDATIATKGDGNHPTSSYRLGIPAPTTAPVC 156 (567) T ss_pred ccccCceeeEEEcCcEEEEeCCceeeccCccccCCcceEEEecCCcceeeeeeeeecCCCCCCcchhhcccCCcccccee Confidence 11111 112122 2223322211 2234444442 1122222 22 Q ss_pred eeeccCCCCCCCcccccceEEEEEEEcCcC-ccccccc-eeeeeecCCceEEEEEEecCCCCcccceEEEEEecCCC--c Q lcl|NC_018856. 302 ASIVDDKKGGFRDEDIKTHSYKVVVHSDDA-ESLPSEA-VTAAVAKKDNTVKLEVKLASLYQAQPQFISVYREGTET--G 377 (479) Q Consensus 302 at~~t~~~G~f~~~d~gty~YkVtavn~~G-ES~pS~~-vt~Tv~~~g~sv~ltIT~~~~~~a~~~~y~IYR~~~~~--G 377 (479) +.+.+++-..-.+.|+.++.|++|.|+++| ||.||.+ ..+++...|++|.|+..+++..+..-+..+|||+..++ + T Consensus 157 a~~~~~~~~~~~~~d~etr~Yv~TfVt~~GeES~PS~~S~~~~v~~pg~~V~ls~~p~~~~~~~i~~~RIYRS~tg~~gt 236 (567) T protein:vir:99 157 TVQQGGDVSDDNPNDDETRFYTETFVSDYGEEGPPGPASLEVTLRTPGTAVQLTLAPVPLQNASIKRRRIYRSASGGGEA 236 (567) T ss_pred eecCCCCCCCCCCcccceeEEEEEEEcCCCCcCCCcccccceeeecCCceEEEeeccCCccccccceEEEEEecCCCCce Confidence 222333333345678899999999999999 7888866 35677778999999999898888888999999987764 4 Q ss_pred ceEEEEeeeeeeecCCceEEEeeccccCCCcccceecCCchhhhhh----hhhcchhhccccccCcchhhhhhhhhhhhe Q lcl|NC_018856. 378 HYFLIARVPVSKVNDQGVIEVLDRNQVIPETTDVFVGELTPNVVSL----LELLPMMKLPLAQMNATTTFTVLWYGALAL 453 (479) Q Consensus 378 ~y~li~rv~vs~~n~~g~T~ftD~N~~iPgT~~~fvGe~~p~vi~l----~ellPm~k~Pla~~~~~~~~~V~~ygaL~l 453 (479) +|+|+++++. ++++|+|.- |.+- .|+.=| +..| ..|..|..||++=+-+ ..+-=.||. -- T Consensus 237 dy~lVael~a------s~~sf~D~~---~~~~---lg~~Lp-s~~w~~PP~~m~GL~~m~NGimAg-F~GneV~Fs--Ep 300 (567) T protein:vir:99 237 DFLLVAELDA------SVLSYTDKI---PAKN---LGPSLA-TWDYLPPPENMTGLCLMANGIAAG-FAGNEVMFS--EA 300 (567) T ss_pred eeEEEEeecc------ceeeeeecc---chhh---cccccc-cccccCcCcccceeeecccceEEe-ecCCEEEEe--cC Confidence 8999999876 457999983 2221 110000 0011 1122222333221110 000000000 01 Q ss_pred ecccccEEEEeccccCccccee-------eecC Q lcl|NC_018856. 454 YAPKKWVRIKNVQYIPALAADV-------TVKY 479 (479) Q Consensus 454 ~aPkk~~~ikNV~~~~~~~~~~-------~~~~ 479 (479) |-|.-| ..+|.-..-.|+ +.-+ T Consensus 301 ylPyAW----P~~Yr~t~~~dIVaiA~~gt~LV 329 (567) T protein:vir:99 301 YLPYAW----PEVNRHTTAEDIVAICPLGTSLV 329 (567) T ss_pred CCCccc----chhhccCCCCCeEEEeecccEEE Confidence 334434 234433333332 1111 No 15 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=98.89 E-value=5.9e-11 Score=76.67 Aligned_cols=318 Identities=14% Similarity=0.141 Sum_probs=169.3 Q ss_pred CCccchhhhhhhhcCCccchHHHHHHHH-----HhhhcCCCcChhhccCccccchhhhhhhhh-hheeccccccchhhcc Q lcl|NC_018856. 1 MTELKKEAEAKNKKLPVEAEAELAELVS-----KSFTTGYGITPDTQLDGAAVRRELLEDQVK-MLAFSSNDFTIYPLIN 74 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~-----Ks~tag~~~~p~~~~~gaalr~esld~~~~-~l~~~~~~f~f~~~i~ 74 (479) |-.. -.|+ ..--|..+. +||++ -+...++.|-...+...+- .|+.. -.++..++ T Consensus 1 ~~~~---------~~~~--~~~~~~~~~~~~p~l~m~a------lTLaea~~l~~d~~~~~VIE~l~~~---s~iL~~lp 60 (330) T protein:vir:94 1 MVRI---------CTPP--LRGRWRTLTHQFPELKMPT------VTLAESAKLSQDHLVSGLIETIVEV---NPLYEMMP 60 (330) T ss_pred Ccee---------cCCc--cccceeehhccccccchhh------hhhhHHhhcCchhhHHHHHHhhhcc---chHHhhcc Confidence 1100 0011 111111010 22221 1222233333333333332 23222 23445555 Q ss_pred ccchhHHHHhhhhhhccCcccccccccc-cccccccCcceEEEEEEEEeeeehhhhhhhH-hhhcchhhHHHHHHHHHHH Q lcl|NC_018856. 75 KQQVNSTVAKYAVFNQHGRTGHSRFVRE-VGVASINDPNIRQKTVQMKFLSDTKQQSLAA-GLVNNIADPMTILTEDAIA 152 (479) Q Consensus 75 k~~~~stv~eY~~~~~~G~~g~~~fv~E-~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~-~lvn~~~Dp~~~~~~~ai~ 152 (479) =.++++....|++....++ ..|... .+.++....++.|.+..++-++.-..|.... ++-++..|-+..|.+..|. T Consensus 61 f~~ve~~~~~~~r~~~lp~---a~~r~~n~~~~~~~~~Tf~q~t~~l~~l~~~~~Vd~~iadl~g~~~d~~~~q~~~~ie 137 (330) T protein:vir:94 61 FTEIEGNALAYNRENVLGD---VQFLAVGGTITAKNPATFTKVTSELTTLIGDAEVNGLIQATRSDFMDQTSVQVASKAK 137 (330) T ss_pred cccccCCcceeeeeecCCc---ceeeeccccccccCcceeeeeeechhhhhhhHHHHHHHHHhcCCHHHHHHHHHHHHHH Confidence 4556677788888776543 334332 2333334567889999999999988887765 4677789999999999999 Q ss_pred HHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEc--cCCCCCHHHHhhhhhhhhhccCceEEEecChHH Q lcl|NC_018856. 153 VIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDL--KGARLDEATLNKAAVIVGKGYGRATDAFMPIGV 230 (479) Q Consensus 153 ~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDa--rG~~l~~~~l~~aa~~i~~~fG~~td~~mp~~v 230 (479) .+.+.+|+.+|+||+.- -|||||.+.++. .|+||+ +|+.|+.+.|.++-..+-+--|.+.-++|+-.. T Consensus 138 al~~~~e~~linGDs~~---------~~F~GL~~~~~~-~q~i~tg~~gg~~T~d~LDeLl~~v~~~~g~~~~~l~n~a~ 207 (330) T protein:vir:94 138 SIGRQYQASMITGDGTG---------NSFQGMMGLVAA-SQTISAGANGGTLTFELLDQLLDLVKDKDGQVDYLMSSFAM 207 (330) T ss_pred HHHHHHHHHhhccCCCC---------ccccchhhcCCc-ccEEecCCCCCCCCHHHHHHHHHHhcCCCCCCcEEEechhH Confidence 99999999999999762 269999999986 599999 789999999999887776666777777777665 Q ss_pred hhhHHHHhhC-cceeeeccCCCcceeeeehhhhcCCCcceecccceecCCCceecccCCcCCCCCCCceeEeeeeccCCC Q lcl|NC_018856. 231 QADFTNNLLD-RQRVIQPSTAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILLEGRNPEPNAPQAPASVVASIVDDKK 309 (479) Q Consensus 231 ka~f~~~~~~-~qrv~~~~n~g~~~~G~~I~~~~s~~G~I~l~~s~~m~~~~~L~e~~~~~~~AP~~pa~v~at~~t~~~ 309 (479) ...+...... ..|-+-+... ..-|-.|..| +.-.++.-... |. .. T Consensus 208 ~r~I~a~~R~~~~~~v~~~~~--~~~G~~v~~~---------------~GvPi~~~d~i-----p~-----~~------- 253 (330) T protein:vir:94 208 RRKYFSLLRALGGAAIGEVMT--LPSGRQIPTY---------------RGVPWFVNDFI-----PS-----NM------- 253 (330) T ss_pred HHHHHHHHHhccCCCCCCccc--ccCCCEEeee---------------CCeEEEecccc-----cC-----CC------- Confidence 5555444321 1111111000 0112222222 11111110000 00 00 Q ss_pred CCCCccccc-ceEEEEEEEcCcCccccccceeeeeecCCceEEEEEEecCCCCcccceEEEEEecCCCcceEEEEeeeee Q lcl|NC_018856. 310 GGFRDEDIK-THSYKVVVHSDDAESLPSEAVTAAVAKKDNTVKLEVKLASLYQAQPQFISVYREGTETGHYFLIARVPVS 388 (479) Q Consensus 310 G~f~~~d~g-ty~YkVtavn~~GES~pS~~vt~Tv~~~g~sv~ltIT~~~~~~a~~~~y~IYR~~~~~G~y~li~rv~vs 388 (479) ++. +..| +-=|.|..-.+... -++++- . T Consensus 254 ~~~--~~~~ttsIyav~~G~~~~~----------------------------------------------qgV~Gl---~ 282 (330) T protein:vir:94 254 TQG--TATNATAIFAGTFDDGSNK----------------------------------------------YGIAGL---T 282 (330) T ss_pred Ccc--cCCCceeEEEEeecccccc----------------------------------------------cceEee---c Confidence 000 0000 01111111100000 000000 0 Q ss_pred eecCCceEEEeeccccCCCcccceecCCchhhhhhhhhcchhhccccccCcchhhhhhhhhhhheecccccEEEEecccc Q lcl|NC_018856. 389 KVNDQGVIEVLDRNQVIPETTDVFVGELTPNVVSLLELLPMMKLPLAQMNATTTFTVLWYGALALYAPKKWVRIKNVQYI 468 (479) Q Consensus 389 ~~n~~g~T~ftD~N~~iPgT~~~fvGe~~p~vi~l~ellPm~k~Pla~~~~~~~~~V~~ygaL~l~aPkk~~~ikNV~~~ 468 (479) ..| .||=.--|+|+.+ -.+.+.|.|.||..+++.-|+...+++||... T Consensus 283 ---~~g----------~~glsVr~~G~~~-------------------~k~v~~~~v~~y~~~av~~~~a~~~L~~V~~g 330 (330) T protein:vir:94 283 ---ARG----------SAGLRVQNVGAKE-------------------NADETITRVKMYCGFANFSQLGLAAIKGLIPG 330 (330) T ss_pred ---CCC----------CCcceeeeCCCcc-------------------ccceeeEEEEEeeeeEEechhheeeeccccCC Confidence 000 1221111222111 02456789999999999999999999999988 No 16 >protein:vir:104388 Length: 566 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794072;genbank:gi:116222017;genbank:GeneID:4397450 Probab=98.83 E-value=1.3e-10 Score=74.71 Aligned_cols=288 Identities=16% Similarity=0.162 Sum_probs=132.3 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhh-hHHHhhccC-CCEEEccCCC--CCHHHHhh----hhh Q lcl|NC_018856. 141 DPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFD-GLHKLIDQD-TNVIDLKGAR--LDEATLNK----AAV 212 (479) Q Consensus 141 Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFD-Gl~~~I~~~-~NviDarG~~--l~~~~l~~----aa~ 212 (479) =|.+ ...|+-++| |-|. --+|-|+.- =.+...+|+- |-..+|-+ .|. T Consensus 1 ~~~~------------------~~~~~~~~~-------~~~~~~~~~~~~M~~i~i~~f~Ge~Pr~~p~lLP~~~a~~A~ 55 (566) T protein:vir:10 1 MPIA------------------ILANSIINP-------LIFKPEAVKGISMPYIDITTMRGMMPRVVTSMLPDHSAVLAE 55 (566) T ss_pred Ccee------------------eehhhhccc-------eeecccccccceeeEEeecccccccccchhhhccccccceEE Confidence 1111 122333333 1111 011222221 1244556644 44555543 335 Q ss_pred hhhhccCceEEEecChHHhhhH----HHHhhCccee--eeccC--------CCcceeeeehhhhcCC---Ccceecccce Q lcl|NC_018856. 213 IVGKGYGRATDAFMPIGVQADF----TNNLLDRQRV--IQPST--------AGGFSTGFSINQFLST---RGAINLHGST 275 (479) Q Consensus 213 ~i~~~fG~~td~~mp~~vka~f----~~~~~~~qrv--~~~~n--------~g~~~~G~~I~~~~s~---~G~I~l~~s~ 275 (479) ...-.+|.-+=...|..+..-| .+.|.=+..+ .-+.. ++...--++..+-..| .|.|...|+. T Consensus 56 n~~~~~G~itP~~~~~~~~~~~~~~~kTif~y~~~~W~~w~~~V~~ir~PvAqD~~~rvY~tg~~~Pk~t~~diAt~g~~ 135 (566) T protein:vir:10 56 DCHFRFGVITPERQISGVEKTFTIKPKTIFHYRDDFWFAWPDVVDVIRSPVAQDNYGRIYYTDGKFPKVTAAEIATKGEG 135 (566) T ss_pred eeeecCCeeeeeecccccccccccCceeeeeecCcEeEEeCCceeeccCccccCCcceEEEeeCCcceeeecceeecccc Confidence 5666678888777776665444 1122101000 00000 0000111222222222 3555555544 Q ss_pred ecCCCceecccCCcCCCCCCCceeEeeeeccCCCCCCCcccccceEEEEEEEcCcC-ccccccc-eeeeeecCCceEEEE Q lcl|NC_018856. 276 IMENDNILLEGRNPEPNAPQAPASVVASIVDDKKGGFRDEDIKTHSYKVVVHSDDA-ESLPSEA-VTAAVAKKDNTVKLE 353 (479) Q Consensus 276 ~m~~~~~L~e~~~~~~~AP~~pa~v~at~~t~~~G~f~~~d~gty~YkVtavn~~G-ES~pS~~-vt~Tv~~~g~sv~lt 353 (479) ..-.. ..+..-| +|.....+..+..+++ ....+.|+++|.|++|.|+++| ||.||.+ ..+++.+.|++|.|+ T Consensus 136 ~~pa~----~y~LgVP-aPs~apv~~~~~~sg~-~~~~~~d~~tr~Yv~TfVt~~GeES~PS~~S~~v~v~~~gs~V~lt 209 (566) T protein:vir:10 136 NFPAA----SYRLGIP-APTTAPVCTVQKGEGA-TDENPNDDETRFYTETFVSAYGEEGPPGPESLEVTVGIPDTPVQLT 209 (566) T ss_pred ccccc----cccccCC-CCcccceeeccCCCcc-cCCCCcccceeEEEEEEEcCCCCcCCCccccceeEecCCCceEEEE Confidence 32211 1122222 3432222122222222 2345789999999999999999 7788755 457777888889999 Q ss_pred EEecCCCCcccceEEEEEecCCC--cceEEEEeeeeeeecCCceEEEeeccccCCCcccceecCCchhhhhh----hhhc Q lcl|NC_018856. 354 VKLASLYQAQPQFISVYREGTET--GHYFLIARVPVSKVNDQGVIEVLDRNQVIPETTDVFVGELTPNVVSL----LELL 427 (479) Q Consensus 354 IT~~~~~~a~~~~y~IYR~~~~~--G~y~li~rv~vs~~n~~g~T~ftD~N~~iPgT~~~fvGe~~p~vi~l----~ell 427 (479) +.+++.++...+.++|||+..++ ++|+|+++++. +.++|+|.-. .+- .|+.=| +..| ..|. T Consensus 210 l~~~p~~~~~i~~~RIYRS~tg~~gtdy~lVael~a------s~~sf~Dd~~---~~~---lg~~Lp-s~~w~~PP~~m~ 276 (566) T protein:vir:10 210 LSPVPLQDANINRRRIYRSVSGGGEADFLLVAELEA------SVLSYTDNIP---AKN---LGPSLA-TWDYLPPPENMT 276 (566) T ss_pred ecCCCcCcCCceeEEEEEecCCCCceeEEEEeeecc------cceeeecccc---ccc---cCcccc-cccccCcCcccc Confidence 99999999889999999987654 58999999876 4578999832 221 111000 0001 1122 Q ss_pred chhhccccccCcchhhhhhhhhhhheecccccEEEEeccccCccccee-------eecC Q lcl|NC_018856. 428 PMMKLPLAQMNATTTFTVLWYGALALYAPKKWVRIKNVQYIPALAADV-------TVKY 479 (479) Q Consensus 428 Pm~k~Pla~~~~~~~~~V~~ygaL~l~aPkk~~~ikNV~~~~~~~~~~-------~~~~ 479 (479) .|..||++=+-+ ..+-=.||. --|-|.-| ..+|+-....|+ +.-+ T Consensus 277 GL~~m~NGimAg-F~GneV~Fs--EpylPyAW----P~~Yr~t~~~dIVaiA~~gt~LV 328 (566) T protein:vir:10 277 GLCLMANGIAAG-FAGNEVMFS--EAYLPYAW----PEVNRHTTAEDIVAVCPLGTSLV 328 (566) T ss_pred eeeecccceEEe-ecCCEEEEe--cCCCCccc----chhhccCCCCCeEEEEeccceEE Confidence 222333221100 000000000 01334433 233433333332 1111 No 17 >protein:vir:827 Length: 567 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050560;genbank:gi:9633457;genbank:GeneID:1262210 Probab=98.79 E-value=6.3e-11 Score=76.52 Aligned_cols=291 Identities=17% Similarity=0.215 Sum_probs=132.3 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhh-hHHHhhccC-CCEEEccCCC--CCHHHHhh----hh Q lcl|NC_018856. 140 ADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFD-GLHKLIDQD-TNVIDLKGAR--LDEATLNK----AA 211 (479) Q Consensus 140 ~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFD-Gl~~~I~~~-~NviDarG~~--l~~~~l~~----aa 211 (479) +-|.+| ..|+-++| |-|. --+|-|+.- =.+...+|+- |-+.+|-+ .| T Consensus 1 ~~~~~~------------------~~~~~~~~-------~~~~~~~~~~~~M~~i~i~~f~Ge~Prl~p~lLP~~~a~~A 55 (567) T protein:vir:82 1 MMPIAI------------------LANSIINP-------LIFKPEAVKGISMPYIDITTMRGMMPRVVTSMLPEHSAVLA 55 (567) T ss_pred Ccchhh------------------hhhhhccc-------eeecccccccceeeEEeecccccccccchhhhccccccceE Confidence 222222 22333333 1111 011222221 1244556644 44555543 33 Q ss_pred hhhhhccCceEEEecChHHhhhHH----HHhhCcceeeec-cCCCcceeeeehhhhcCCCcceeccccee--cCCCce-- Q lcl|NC_018856. 212 VIVGKGYGRATDAFMPIGVQADFT----NNLLDRQRVIQP-STAGGFSTGFSINQFLSTRGAINLHGSTI--MENDNI-- 282 (479) Q Consensus 212 ~~i~~~fG~~td~~mp~~vka~f~----~~~~~~qrv~~~-~n~g~~~~G~~I~~~~s~~G~I~l~~s~~--m~~~~~-- 282 (479) ....-..|.-+=...|..+..-|+ +.|.=+...-+. ...-...-|-..+ -++..+=++||.. |....+ T Consensus 56 ~n~~~~~G~itP~~~~~~~~~~~~~~~~Tif~y~~~~W~~w~~~V~~ir~PvAq---D~~~rvY~tgdg~Pk~t~~~iat 132 (567) T protein:vir:82 56 EDCHFRFGVITPERQISGVEKTFTIKPKTIFHYRDDFWFAWPDVVDVIRSPIAQ---DPHGRIYYTDGRFPKVTDATIAT 132 (567) T ss_pred EeeeecCCeeeeeecccccccccccCceeeeeecCcEeEEeCCceeeccCcccc---CCcccEEEecCCcceeeeeeeee Confidence 556666788877777766644441 111101000000 0000000000000 0112222233220 111112 Q ss_pred ecccCCcCCC------CCCCceeEeeeeccCCCCCCCcccccceEEEEEEEcCcC-ccccccc-eeeeeecCCceEEEEE Q lcl|NC_018856. 283 LLEGRNPEPN------APQAPASVVASIVDDKKGGFRDEDIKTHSYKVVVHSDDA-ESLPSEA-VTAAVAKKDNTVKLEV 354 (479) Q Consensus 283 L~e~~~~~~~------AP~~pa~v~at~~t~~~G~f~~~d~gty~YkVtavn~~G-ES~pS~~-vt~Tv~~~g~sv~ltI 354 (479) +.++..|.+. +|++ +++++.+.+++-..-.+.|+.++.|++|.|+++| ||.||.+ ..+++...|++|.|+. T Consensus 133 ~G~~~~P~~~y~LgVpaps~-aP~~a~~~~~~~~~~~p~d~etr~Yv~TfVt~~GeES~PS~~S~~~~v~~pg~~V~ls~ 211 (567) T protein:vir:82 133 KGDGNHPTSSYRLGIPAPTT-APVCTVQQGGDVSDDNPNDDETRFYTETFVSDYGEEGPPGPASLEVTLRTPGTAVQLTL 211 (567) T ss_pred cCCCCCCcchhhcccCCccc-cceeeecCCCCCCCCCCccccceEEEEEEEcCCCCcCCCcccccceeeecCCceEEEee Confidence 2345555553 3331 1222323333333335678899999999999999 7888866 4577777899999999 Q ss_pred EecCCCCcccceEEEEEecCCC--cceEEEEeeeeeeecCCceEEEeeccccCCCcccceecCCchhhhhh----hhhcc Q lcl|NC_018856. 355 KLASLYQAQPQFISVYREGTET--GHYFLIARVPVSKVNDQGVIEVLDRNQVIPETTDVFVGELTPNVVSL----LELLP 428 (479) Q Consensus 355 T~~~~~~a~~~~y~IYR~~~~~--G~y~li~rv~vs~~n~~g~T~ftD~N~~iPgT~~~fvGe~~p~vi~l----~ellP 428 (479) .+++..+..-+..+|||+..++ ++|+|+++++. ++++|+|.- |.+- .|+.=| +..| ..|.. T Consensus 212 ~p~~~~~~~i~~~RIYRS~tg~~gtdy~lVael~a------s~~sf~D~~---~~~~---lg~~Lp-s~~w~~PP~~m~G 278 (567) T protein:vir:82 212 APVPLQNASIKRRRIYRSASGGGEADFLLVAELDA------SVLSYTDKI---PAKN---LGPSLA-TWDYLPPPENMTG 278 (567) T ss_pred ccCCccccccceEEEEEecCCCCceeeEEEEeecc------ceeeeeecc---chhh---cccccc-cccccCcCcccce Confidence 9998888888999999987764 48999999876 457999983 2221 110000 0011 11222 Q ss_pred hhhccccccCcchhhhhhhhhhhheecccccEEEEeccccCccccee-------eecC Q lcl|NC_018856. 429 MMKLPLAQMNATTTFTVLWYGALALYAPKKWVRIKNVQYIPALAADV-------TVKY 479 (479) Q Consensus 429 m~k~Pla~~~~~~~~~V~~ygaL~l~aPkk~~~ikNV~~~~~~~~~~-------~~~~ 479 (479) |..||++=+-+ ..+-=.||. --|-|.-| ..+|.-..-.|+ +.-+ T Consensus 279 L~~m~NGimAg-F~GneV~Fs--EpylPyAW----P~~Yr~t~~~dIVaiA~~gt~LV 329 (567) T protein:vir:82 279 LCLMANGIAAG-FAGNEVMFS--EAYLPYAW----PEVNRHTTAEDIVAICPLRTSLV 329 (567) T ss_pred eeecccceEEe-ecCCEEEEe--cCCCCccc----chhhccCCCCCeEEEEecccEEE Confidence 22333221110 000000000 01334434 234433333332 1111 No 18 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=98.79 E-value=4.4e-10 Score=71.89 Aligned_cols=300 Identities=14% Similarity=0.149 Sum_probs=161.9 Q ss_pred CCccchhhhhhhhcCCccchHHHHHHHHHhhhcCCCcChhhccCccccchhhhhhhhhhheeccccccchhhccccchhH Q lcl|NC_018856. 1 MTELKKEAEAKNKKLPVEAEAELAELVSKSFTTGYGITPDTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQQVNS 80 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~Ks~tag~~~~p~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~~~~s 80 (479) || ...+.+.. |- + -..|-++-+|. ++. +-.+|..++=.++.. T Consensus 1 mp--------------altLaea~----k~-------~------~d~l~~~ViE~----~~~---~s~lL~~LpF~~veg 42 (310) T protein:vir:97 1 MA--------------SVTLAESA----KL-------A------QDELVAGVIEN----IIT---VNRMFDVLPFDSIEG 42 (310) T ss_pred Cc--------------ccchHHHh----hc-------C------cchHHHHHHHH----Hhc---cchHHHhCCcccccC Confidence 43 22211111 10 0 00111111111 111 112233333233444 Q ss_pred HHHhhhhhhccCcccc----cccccccccccccCcceEEEEEEEEeeeehhhhhhh-Hhhh-cchhhHHHHHHHHHHHHH Q lcl|NC_018856. 81 TVAKYAVFNQHGRTGH----SRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLA-AGLV-NNIADPMTILTEDAIAVI 154 (479) Q Consensus 81 tv~eY~~~~~~G~~g~----~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~-~~lv-n~~~Dp~~~~~~~ai~~~ 154 (479) ....|+|....++.+- ..+..| |+++ +..+..++...++-++....|... +++. ++..|-.++|.+-.|..+ T Consensus 43 ~~~~ynR~~~~~~~~~~~v~~~~~~~-g~~~-~~~t~~~~~~~L~i~~g~~~Vd~~i~dl~~~~~~dq~~~Ql~~~iea~ 120 (310) T protein:vir:97 43 NSLAYNRENVLGDVIMAGVGTTFSGA-GAGK-AAATFTKVNSNLTTIMGDAEVNGLIQATRSGDGNDQTAVQIASKAKSA 120 (310) T ss_pred CcceeeEeeccCCcccccccccccCC-Cccc-cccccceeeeeeeeeeehhhhhhHHHhhhcCChHHHHHHHHHHHHHHH Confidence 4566777665555431 112222 2222 557788999999999999999875 6776 557899999999999999 Q ss_pred HHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEc--cCCCCCHHHHhhhhhhhhhccCceEEEecChHHhh Q lcl|NC_018856. 155 AKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDL--KGARLDEATLNKAAVIVGKGYGRATDAFMPIGVQA 232 (479) Q Consensus 155 ~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDa--rG~~l~~~~l~~aa~~i~~~fG~~td~~mp~~vka 232 (479) ...+|+.+++||++-+ |||||.+.++. .++||+ +|+.|+.+.|.++-..+-+.=|.+.-++||+.... T Consensus 121 ~~~~e~~lINGD~a~n---------~F~GL~~~~~~-~q~i~~~~~gg~~t~d~LDeLl~~v~~~~g~p~~~l~~~~~~r 190 (310) T protein:vir:97 121 GRKYQDQLINGNGAGN---------EFAGLIQLCAS-GQKATTGATGSAISFAILDELMDLVVDKDGQVDYLTMHARTLR 190 (310) T ss_pred HHHHHHHhhccccCCC---------cccchhhcCCc-cceeecCCCCCCCCHHHHHHHHHHHhcCCCCCCEEEecHHHHH Confidence 9999999999999743 59999999997 499998 67999999999977777665677888999998766 Q ss_pred hHHHHhhCcc-eeeeccCCCcceeeeehhhhcCCCcceecccceecCCCceecc-cCCcCCCCCCCceeEeeeeccCCCC Q lcl|NC_018856. 233 DFTNNLLDRQ-RVIQPSTAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILLE-GRNPEPNAPQAPASVVASIVDDKKG 310 (479) Q Consensus 233 ~f~~~~~~~q-rv~~~~n~g~~~~G~~I~~~~s~~G~I~l~~s~~m~~~~~L~e-~~~~~~~AP~~pa~v~at~~t~~~G 310 (479) .+..-....- |-+-+ .....-|-.|..| +..++.+ ...|.. .+...+ .| T Consensus 191 ~i~A~~R~~~~~g~~~--~~~~~~G~~v~~~----------------~GiPi~~~d~ip~~-----~~~~~~------~g 241 (310) T protein:vir:97 191 SYKALLRALGGASINE--VVELPSGAEVPAY----------------SGTPIFRNDYIPTN-----QTKGGT------TG 241 (310) T ss_pred HHHHHHHHhcCCCCCC--ccccCCCCEEeee----------------CCeEEEEeCccCCC-----cccccc------CC Confidence 6654443111 11100 0001122222222 1122211 111110 000000 00 Q ss_pred CCCcccccceEEEEEEEcCcCccccccceeeeeecCCceEEEEEEecCCCCcccceEEEEEecCCCcceEEEEeeeeeee Q lcl|NC_018856. 311 GFRDEDIKTHSYKVVVHSDDAESLPSEAVTAAVAKKDNTVKLEVKLASLYQAQPQFISVYREGTETGHYFLIARVPVSKV 390 (479) Q Consensus 311 ~f~~~d~gty~YkVtavn~~GES~pS~~vt~Tv~~~g~sv~ltIT~~~~~~a~~~~y~IYR~~~~~G~y~li~rv~vs~~ 390 (479) .+--|.+..-.+.. ..++++- . T Consensus 242 -------tTsIya~r~Ge~~~----------------------------------------------~~Gv~Gl---~-- 263 (310) T protein:vir:97 242 -------CTTIFAGTLDDGSR----------------------------------------------THGIAGL---T-- 263 (310) T ss_pred -------ceeEEEEeeCcccc----------------------------------------------ccceecc---c-- Confidence 01122222211100 0011100 0 Q ss_pred cCCceEEEeeccccCCCcccceecCCchhhhhhhhhcchhhccccccCcchhhhhhhhhhhheecccccEEEEeccc Q lcl|NC_018856. 391 NDQGVIEVLDRNQVIPETTDVFVGELTPNVVSLLELLPMMKLPLAQMNATTTFTVLWYGALALYAPKKWVRIKNVQY 467 (479) Q Consensus 391 n~~g~T~ftD~N~~iPgT~~~fvGe~~p~vi~l~ellPm~k~Pla~~~~~~~~~V~~ygaL~l~aPkk~~~ikNV~~ 467 (479) ..+ -||=.--|+|+++ -.+...|.|.||..+++.-|+...+++||-= T Consensus 264 -~~~----------~~glsVr~~G~~~-------------------~~~v~~~~V~~Y~~~av~~~~A~a~L~~V~~ 310 (310) T protein:vir:97 264 -ATQ----------AAGIQVVDVGESE-------------------DSDEHIWRVKWYCGLALFSEKGLACADGITN 310 (310) T ss_pred -cCC----------ccceeEEeCCccc-------------------CCcceeEEEEEeeeEEEecccceeeeccccC Confidence 000 0111101222111 1256678999999999999999999999987 No 19 >protein:vir:5120 Length: 615 # NCBI annotation: unknown # Family: family:all:1544 # MgeID: mge:114 # MgeName: PBC5 # Cross-refs: genbank:acc:NP_542277;genbank:gi:18071220;genbank:GeneID:929342 Probab=98.76 E-value=2.8e-10 Score=72.94 Aligned_cols=252 Identities=19% Similarity=0.220 Sum_probs=103.8 Q ss_pred hhccCCCEEEccCCCCCHHH--Hh-----hhhhhhhhccCceEEEecChHHhhhHHHHhhCcceeeeccCCCcceeeeeh Q lcl|NC_018856. 187 LIDQDTNVIDLKGARLDEAT--LN-----KAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSTAGGFSTGFSI 259 (479) Q Consensus 187 ~I~~~~NviDarG~~l~~~~--l~-----~aa~~i~~~fG~~td~~mp~~vka~f~~~~~~~qrv~~~~n~g~~~~G~~I 259 (479) +.+.+..-=-+|-+.++-.+ |+ +.++.|+. |.-..... +|+ ++|.+....+.+... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~M~~I~i~~-f~Ge~Prl-------------~P~---lLP~~~A~~A~N~~~ 63 (615) T protein:vir:51 1 MVSTGTRRGTLRSRAPSRLHCYLKQGYLGMVAIKISA-FAGEQPML-------------LPR---LLPETGATAAMNVRL 63 (615) T ss_pred CcccccccceecccCcceeeeeeecCceeeEEEeecc-cccccccc-------------hhh---hccCcccceEEeeee Confidence 44443222222333333222 22 12222222 22111111 111 111122222222222 Q ss_pred h-hhcCCCcc--------------eecccceecCCCcee--cccCCcC---------------C---CCCCCceeEeeee Q lcl|NC_018856. 260 N-QFLSTRGA--------------INLHGSTIMENDNIL--LEGRNPE---------------P---NAPQAPASVVASI 304 (479) Q Consensus 260 ~-~~~s~~G~--------------I~l~~s~~m~~~~~L--~e~~~~~---------------~---~AP~~pa~v~at~ 304 (479) . +-++|.++ |-+|.+.|+--..+- +.+.+.. | -.=+.|++..+.+ T Consensus 64 ~~G~ltP~~~~~~~~~~~~~~~~Tif~~~~~W~~w~~~V~av~sPvA~DRvy~tgdg~Pkv~~~~~sY~LgVpaPs~ap~ 143 (615) T protein:vir:51 64 NDGGLTPINKPIEVATIATASQKTIYRHQGSWLSWPNVVNAVPGPVAQDRLYFTGDGAPKVKIGGVDYALKVPRPTGALT 143 (615) T ss_pred cCCeeeeecCcccccccccccceeeeeecCceeccCCceeEccCCcccceeEEcCCCcceEeecccCccccccCCCccce Confidence 2 22222221 222222232221110 0011100 0 0001122222222 Q ss_pred ccCCCCCCCcccccceEEEEEEEcCcC-ccccccceeeeeecCCceEEEEEEecCCCCcccceEEEEEecCC--CcceEE Q lcl|NC_018856. 305 VDDKKGGFRDEDIKTHSYKVVVHSDDA-ESLPSEAVTAAVAKKDNTVKLEVKLASLYQAQPQFISVYREGTE--TGHYFL 381 (479) Q Consensus 305 ~t~~~G~f~~~d~gty~YkVtavn~~G-ES~pS~~vt~Tv~~~g~sv~ltIT~~~~~~a~~~~y~IYR~~~~--~G~y~l 381 (479) +...++ ...|+.++.|+.|.|+++| ||.||.+........|++|+|+..+++..+...+..+|||+..+ +++|+| T Consensus 144 ~~~~g~--g~~d~etr~Yv~TfVt~~GeES~PSp~S~~v~v~~g~tVtLs~~pa~~~~~~i~~rRIYRS~tg~~gtdy~l 221 (615) T protein:vir:51 144 AALSGT--GSGDIQSRTYVYTWVTSFGEESAPCPASIIVDWKPGQTVTLSGFAATPGGRSITTQRIYRSQTGKTGTGLYL 221 (615) T ss_pred EEecCC--CCccccceEEEEEEEcCCCCcCCCCccceeeEecCCCeEEEeeccCCcCCCceeeEEEEEeccCCCceeeEE Confidence 222222 2236789999999999988 88888665555556788999999988888888888999998765 459999 Q ss_pred EEeeeeeeecCCceEEEeecccc------CCCcccceecCCchhhhhhhhhcchhhccccccCcchhhhhhhhhhhheec Q lcl|NC_018856. 382 IARVPVSKVNDQGVIEVLDRNQV------IPETTDVFVGELTPNVVSLLELLPMMKLPLAQMNATTTFTVLWYGALALYA 455 (479) Q Consensus 382 i~rv~vs~~n~~g~T~ftD~N~~------iPgT~~~fvGe~~p~vi~l~ellPm~k~Pla~~~~~~~~~V~~ygaL~l~a 455 (479) +++.+. ++++|+|.... ||-.- =+|.|. .|..|..||++=+-+ ..+-=.||. --|- T Consensus 222 VAel~a------s~~sf~D~~~~~~Lg~~Lps~~----w~~PP~-----~l~GL~~m~NGimAg-F~GneV~Fs--Epy~ 283 (615) T protein:vir:51 222 IAERAA------SAGNFTDNIAVDQFQEPLPSAD----WNEPPD-----GLAGLAEMPNGMMAA-FVGRSIYFC--EPYR 283 (615) T ss_pred Eeeecc------cceeeeeccchhhcCccccccc----ccCcCc-----chhhhhccccceEEe-ecCCEEEEe--cCCC Confidence 999766 45789998432 11000 011121 123333333221110 000000000 0023 Q ss_pred ccccEEEEeccccC-------cccceeeecC Q lcl|NC_018856. 456 PKKWVRIKNVQYIP-------ALAADVTVKY 479 (479) Q Consensus 456 Pkk~~~ikNV~~~~-------~~~~~~~~~~ 479 (479) |.-| ..+|.- |+|+==+.-+ T Consensus 284 PyAW----P~~Yr~t~d~dIVaiA~~gt~LV 310 (615) T protein:vir:51 284 PHAW----PEKYSRNVGSDIVGIAALGSILV 310 (615) T ss_pred Cccc----chhcccCcCCCeeEEEecccEEE Confidence 3333 122222 2332212111 No 20 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=97.87 E-value=1.4e-06 Score=52.71 Aligned_cols=299 Identities=11% Similarity=0.039 Sum_probs=162.6 Q ss_pred CCccchhhhhhhhcCCccchHHHHHHHH--HhhhcCCCcChhhccCccccchhhhhhhhhhheeccccccchhhccccch Q lcl|NC_018856. 1 MTELKKEAEAKNKKLPVEAEAELAELVS--KSFTTGYGITPDTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQQV 78 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~--Ks~tag~~~~p~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~~~ 78 (479) |-+.++..+.. ++.+.... +.+.+...+ .-.+|+.|-.+.+...|.... .+.-.+++.+.+.++ T Consensus 1 ~~~~~~~~~~~---------~~~~~~~~~~~~~~a~~~~---~~~~~~~~iP~~~~~~ii~~~--~~~s~l~~l~~~~~~ 66 (324) T protein:vir:78 1 MEQTQKLKLNL---------QHFASNNVKPQVFNPDNVM---MHEKKDGTLMNEFTTPILQEV--MENSKIMQLGKYEPM 66 (324) T ss_pred CCcchhhhHHH---------HHHHHHhhhhhhhcccccc---ccCcCccccchhHHHHHHHHH--Hhhchhhhhcceeec Confidence 76665544433 22222211 223333222 223466677777777664333 333345666666666 Q ss_pred hHHHHhhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHH Q lcl|NC_018856. 79 NSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAIAVIAKSI 158 (479) Q Consensus 79 ~stv~eY~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~i 158 (479) .+.-.+|.++.. .+...+++|++..+..++.+.+.....+=++---.+|.-+- .++..|.+....+.--..++..+ T Consensus 67 ~~~~~~~p~~~~---~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell-~ds~~~l~~~i~~~la~ai~~~~ 142 (324) T protein:vir:78 67 EGTEKKFTFWAD---KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFL-NYTYSQFFEEMKPMIAEAFYKKF 142 (324) T ss_pred cCCceEEEEEec---CcceeEecCCccccccccceeEEEEeeEEEEEeehhhHHHH-hcchHHHHHHHHHHHHHHHHHHH Confidence 654455666543 34567899999999999999999999999988888887432 24456788888888889999999 Q ss_pred HHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccCCCCCHHHHhhhhhhhhhccCceEEEecChHHhhhHHHHh Q lcl|NC_018856. 159 EWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGARLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNL 238 (479) Q Consensus 159 E~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~~td~~mp~~vka~f~~~~ 238 (479) |.++|+|+-.=+ +..|+.+.+... +... ...++.+.|.++.-.+..+|..+.-+.|++.+...+...- T Consensus 143 d~a~l~G~g~~~---------~~~gi~~~~~~~-~~~~--~~~~t~~~i~~~~~~l~~~~~~~~~~vmn~~~~~~L~~l~ 210 (324) T protein:vir:78 143 DEAGILNQGNNP---------FGKSIAQSIEKT-NKVI--KGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIV 210 (324) T ss_pred HHHHhccCCCCC---------cCcccccccccc-ceec--cccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhh Confidence 999999975421 224565555432 3332 2345677778777777888888888999999999887665 Q ss_pred hCcceeeeccCCCcceeeeehhhhcCCCcceecccceecCCC-c-eecc--cCCcCCCCCCCceeEeeeeccCCCCC--- Q lcl|NC_018856. 239 LDRQRVIQPSTAGGFSTGFSINQFLSTRGAINLHGSTIMEND-N-ILLE--GRNPEPNAPQAPASVVASIVDDKKGG--- 311 (479) Q Consensus 239 ~~~qrv~~~~n~g~~~~G~~I~~~~s~~G~I~l~~s~~m~~~-~-~L~e--~~~~~~~AP~~pa~v~at~~t~~~G~--- 311 (479) ...-|.+.++..+..-.|++|-...+. .+. .+..++.+. . ++.. +..... ... ...+...+.+|. T Consensus 211 d~~G~~~~~~~~~~~l~G~PV~~~~~~--~~~-~~~~~~gd~~~~~~g~~~~~~i~~-~~~----~~~~~~~~~~~~~~~ 282 (324) T protein:vir:78 211 DPETKERIYDRNSDSLDGLPVVNLKSS--NLK-RGELITGDFDKLIYGIPQLIEYKI-DET----AQLSTVKNEDGTPVN 282 (324) T ss_pred ccCCCeeecCCCCCcccceeeEeeCCC--CCC-cceEEEEecceEEEEEecCcEEEE-eec----ccccccccccccchh Confidence 555566665444444556655321111 110 111111110 0 0100 000000 000 000001111111 Q ss_pred -CCcccc-----------c----ceEEEEEEEcCcCccccccc Q lcl|NC_018856. 312 -FRDEDI-----------K----THSYKVVVHSDDAESLPSEA 338 (479) Q Consensus 312 -f~~~d~-----------g----ty~YkVtavn~~GES~pS~~ 338 (479) |.. +. + ...-+++-++...|..|++. T Consensus 283 ~f~~-d~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~~~~~~ 324 (324) T protein:vir:78 283 LFEQ-DMVALRATMHVALHIADDKAFAKLVPADKRTDSVPGEV 324 (324) T ss_pred hhhc-CcEEEEEEEEEccEEecccceEEEecccccCCCCCCCC Confidence 100 00 0 01123344444555555554 No 21 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=97.87 E-value=1.4e-06 Score=52.71 Aligned_cols=299 Identities=11% Similarity=0.039 Sum_probs=162.6 Q ss_pred CCccchhhhhhhhcCCccchHHHHHHHH--HhhhcCCCcChhhccCccccchhhhhhhhhhheeccccccchhhccccch Q lcl|NC_018856. 1 MTELKKEAEAKNKKLPVEAEAELAELVS--KSFTTGYGITPDTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQQV 78 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~--Ks~tag~~~~p~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~~~ 78 (479) |-+.++..+.. ++.+.... +.+.+...+ .-.+|+.|-.+.+...|.... .+.-.+++.+.+.++ T Consensus 1 ~~~~~~~~~~~---------~~~~~~~~~~~~~~a~~~~---~~~~~~~~iP~~~~~~ii~~~--~~~s~l~~l~~~~~~ 66 (324) T protein:vir:96 1 MEQTQKLKLNL---------QHFASNNVKPQVFNPDNVM---MHEKKDGTLMNEFTTPILQEV--MENSKIMQLGKYEPM 66 (324) T ss_pred CCcchhhhHHH---------HHHHHHhhhhhhhcccccc---ccCcCccccchhHHHHHHHHH--Hhhchhhhhcceeec Confidence 76665544433 22222211 223333222 223466677777777664333 333345666666666 Q ss_pred hHHHHhhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHH Q lcl|NC_018856. 79 NSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAIAVIAKSI 158 (479) Q Consensus 79 ~stv~eY~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~i 158 (479) .+.-.+|.++.. .+...+++|++..+..++.+.+.....+=++---.+|.-+- .++..|.+....+.--..++..+ T Consensus 67 ~~~~~~~p~~~~---~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell-~ds~~~l~~~i~~~la~ai~~~~ 142 (324) T protein:vir:96 67 EGTEKKFTFWAD---KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFL-NYTYSQFFEEMKPMIAEAFYKKF 142 (324) T ss_pred cCCceEEEEEec---CcceeEecCCccccccccceeEEEEeeEEEEEeehhhHHHH-hcchHHHHHHHHHHHHHHHHHHH Confidence 654455666543 34567899999999999999999999999988888887432 24456788888888889999999 Q ss_pred HHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccCCCCCHHHHhhhhhhhhhccCceEEEecChHHhhhHHHHh Q lcl|NC_018856. 159 EWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGARLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNL 238 (479) Q Consensus 159 E~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~~td~~mp~~vka~f~~~~ 238 (479) |.++|+|+-.=+ +..|+.+.+... +... ...++.+.|.++.-.+..+|..+.-+.|++.+...+...- T Consensus 143 d~a~l~G~g~~~---------~~~gi~~~~~~~-~~~~--~~~~t~~~i~~~~~~l~~~~~~~~~~vmn~~~~~~L~~l~ 210 (324) T protein:vir:96 143 DEAGILNQGNNP---------FGKSIAQSIEKT-NKVI--KGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIV 210 (324) T ss_pred HHHHhccCCCCC---------cCcccccccccc-ceec--cccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhh Confidence 999999975421 224565555432 3332 2345677778777777888888888999999999887665 Q ss_pred hCcceeeeccCCCcceeeeehhhhcCCCcceecccceecCCC-c-eecc--cCCcCCCCCCCceeEeeeeccCCCCC--- Q lcl|NC_018856. 239 LDRQRVIQPSTAGGFSTGFSINQFLSTRGAINLHGSTIMEND-N-ILLE--GRNPEPNAPQAPASVVASIVDDKKGG--- 311 (479) Q Consensus 239 ~~~qrv~~~~n~g~~~~G~~I~~~~s~~G~I~l~~s~~m~~~-~-~L~e--~~~~~~~AP~~pa~v~at~~t~~~G~--- 311 (479) ...-|.+.++..+..-.|++|-...+. .+. .+..++.+. . ++.. +..... ... ...+...+.+|. T Consensus 211 d~~G~~~~~~~~~~~l~G~PV~~~~~~--~~~-~~~~~~gd~~~~~~g~~~~~~i~~-~~~----~~~~~~~~~~~~~~~ 282 (324) T protein:vir:96 211 DPETKERIYDRNSDSLDGLPVVNLKSS--NLK-RGELITGDFDKLIYGIPQLIEYKI-DET----AQLSTVKNEDGTPVN 282 (324) T ss_pred ccCCCeeecCCCCCcccceeeEeeCCC--CCC-cceEEEEecceEEEEEecCcEEEE-eec----ccccccccccccchh Confidence 555566665444444556655321111 110 111111110 0 0100 000000 000 000001111111 Q ss_pred -CCcccc-----------c----ceEEEEEEEcCcCccccccc Q lcl|NC_018856. 312 -FRDEDI-----------K----THSYKVVVHSDDAESLPSEA 338 (479) Q Consensus 312 -f~~~d~-----------g----ty~YkVtavn~~GES~pS~~ 338 (479) |.. +. + ...-+++-++...|..|++. T Consensus 283 ~f~~-d~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~~~~~~ 324 (324) T protein:vir:96 283 LFEQ-DMVALRATMHVALHIADDKAFAKLVPADKRTDSVPGEV 324 (324) T ss_pred hhhc-CcEEEEEEEEEccEEecccceEEEecccccCCCCCCCC Confidence 100 00 0 01123344444555555554 No 22 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=97.80 E-value=1.8e-05 Score=46.57 Aligned_cols=323 Identities=11% Similarity=0.026 Sum_probs=160.7 Q ss_pred HHHHHHHhhhcCCCcChhhccCccccchhhhhhhhhhheeccccccchhhccccchhHHHHhhhhhhccCcccccccccc Q lcl|NC_018856. 23 LAELVSKSFTTGYGITPDTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVRE 102 (479) Q Consensus 23 ~~e~~~Ks~tag~~~~p~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E 102 (479) +.. ..+.+...+. +..+|+.+..+..++-+..|.. .-.+++.+.+.+..+.-.+|.+.. +.....|++| T Consensus 1 m~~---~~~~a~~~~~--t~~~g~~i~~~~~~~ii~~~~~---~s~l~~~~~~~~~~~~~~~~p~~~---~~~~a~~v~E 69 (330) T protein:vir:77 1 MAG---STVPSTQVAL--TGDFSAFLTPEQSQDYFAEIEK---TSIVQRIARKVPMGPTGISIPHWT---GAVSASWTGE 69 (330) T ss_pred Ccc---cccchhhccc--cCCCcceechhHHHHHHHHHHh---ccchhhhcceeeccCCceEEEEEc---CCcceeEecC Confidence 000 0011111121 2334666777766655444432 334666666666666555566643 3445679999 Q ss_pred cccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhh Q lcl|NC_018856. 103 VGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFD 182 (479) Q Consensus 103 ~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFD 182 (479) ++..+.+++.+.+....++=++.--.+|.-+ +.++..|.+....+.-...+++.+|.++|+|+.+ |-+++ T Consensus 70 g~~~~~~~~~f~~i~~~~~k~~~~~~is~el-l~ds~~~~~~~i~~~l~~ai~~~~~~~~l~G~g~---------~~~~~ 139 (330) T protein:vir:77 70 AERKPITKGSFGKQELEPVKITTIFAESAEV-VRLNPLNYLNTMRTKIAEAIALKFDAAAIHGIDK---------PSAFK 139 (330) T ss_pred CCccccccceeeEEEEeEEEEEEeehhhHHH-HhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCC---------CCccc Confidence 9999999999999999999999888888853 3456678889999999999999999999999875 23468 Q ss_pred hHHHhhccCCCEEEccC---CCCC---HHHHhhhhhhhhhccCceEEEecChHHhhhHHHHhhCcceeeeccCCCcceee Q lcl|NC_018856. 183 GLHKLIDQDTNVIDLKG---ARLD---EATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSTAGGFSTG 256 (479) Q Consensus 183 Gl~~~I~~~~NviDarG---~~l~---~~~l~~aa~~i~~~fG~~td~~mp~~vka~f~~~~~~~qrv~~~~n~g~~~~G 256 (479) |+.+.+....++.+..+ ...+ .+.|.++-..+.+++...+-.+|+..+.+.+...-....|.+.+.+.+... T Consensus 140 g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~-- 217 (330) T protein:vir:77 140 GYLAETTKVVSLADTNLTTASGPQGNAYLAVNNALSLLVNSGKKWTGTLLDNVTEPILNTAVDGNGRPLFVESTYTEQ-- 217 (330) T ss_pred cccccccccceeecccccccccccchhHHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHHHhccCCceeecCcccccc-- Confidence 88887754433322222 1222 334555555667778888889999999998876555555555443222111 Q ss_pred eehhhhcCCCcceecccceecCCCceecccCCcCCCCCCCceeEeeeeccCCCCCCCcccccceEEEEEEEcCcCcc-cc Q lcl|NC_018856. 257 FSINQFLSTRGAINLHGSTIMENDNILLEGRNPEPNAPQAPASVVASIVDDKKGGFRDEDIKTHSYKVVVHSDDAES-LP 335 (479) Q Consensus 257 ~~I~~~~s~~G~I~l~~s~~m~~~~~L~e~~~~~~~AP~~pa~v~at~~t~~~G~f~~~d~gty~YkVtavn~~GES-~p 335 (479) .....+.+++..+-...+ .+ |. ++.+.-...-.|..++.+ .....|-+ .- T Consensus 218 -----------~~~~~~~~l~G~PV~~~~-~~-----p~-----------~~~~~~~~~~~gd~s~~~-i~~~~~~~i~~ 268 (330) T protein:vir:77 218 -----------VGAIREGRILGRPTYVAD-NV-----VN-----------GTVGNRVVGVMGDFSQVI-WGQIGGLSFDV 268 (330) T ss_pred -----------ccccCCceecceeeEEec-cc-----cC-----------CCCCCccEEEEEecceEE-EEEecCcEEEE Confidence 112233344443322211 11 11 000100111112233222 22222211 11 Q ss_pred ccceeeeeecCCceEEEEEEecCCCCcccceEEEEEecCCCcceEEEEeeeeeeecCCceEEEeeccccCCCcccceecC Q lcl|NC_018856. 336 SEAVTAAVAKKDNTVKLEVKLASLYQAQPQFISVYREGTETGHYFLIARVPVSKVNDQGVIEVLDRNQVIPETTDVFVGE 415 (479) Q Consensus 336 S~~vt~Tv~~~g~sv~ltIT~~~~~~a~~~~y~IYR~~~~~G~y~li~rv~vs~~n~~g~T~ftD~N~~iPgT~~~fvGe 415 (479) +.....+...... .......+.-|.+. --.|+...|+...-.+.......+... +|++ +| T Consensus 269 ~~e~~~~~~~~~~-----------~~~~~~~~~~f~~~--~~~~r~~~r~d~~v~~~~a~~~i~~~~---~~~~----~~ 328 (330) T protein:vir:77 269 TDQATLDFGEEQG-----------GVWVPKLISLWQHN--MVAVRCEAEFAFMVNDKDAFVKLTDQV---AGTD----PE 328 (330) T ss_pred eecceeeeccccc-----------ccccccccchhhcC--cEEEEEEEEeccEEecccceEEEEecc---CCcC----CC Confidence 1111111100000 00000112223211 122233333333322222222222221 1111 00 Q ss_pred Cchh Q lcl|NC_018856. 416 LTPN 419 (479) Q Consensus 416 ~~p~ 419 (479) = + T Consensus 329 ~--~ 330 (330) T protein:vir:77 329 E--E 330 (330) T ss_pred C--C Confidence 0 0 No 23 >protein:vir:105563 Length: 396 # NCBI annotation: hypothetical protein # Family: family:all:27455 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164316;genbank:gi:56692963;genbank:GeneID:3197174 Probab=97.62 E-value=3.1e-07 Score=56.29 Aligned_cols=266 Identities=16% Similarity=0.149 Sum_probs=100.1 Q ss_pred HHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCE-EEccCCC--------CCHHHHhh--hhhhhhhccCceEE Q lcl|NC_018856. 155 AKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNV-IDLKGAR--------LDEATLNK--AAVIVGKGYGRATD 223 (479) Q Consensus 155 ~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~Nv-iDarG~~--------l~~~~l~~--aa~~i~~~fG~~td 223 (479) |-+.--.-|-|=..+.++..=+.|-|=+++ -...+.|| ||+.|+. ++...|.- .+-...+.|+.- T Consensus 1 ~~~~~~~~~~ginnv~~e~~l~~~~~~~~~--~~r~a~nvdi~~~G~~~~r~~~tr~~~g~l~~~~~~~~~~~~~~~~-- 76 (396) T protein:vir:10 1 MATTSLVPLAGINNVAEDAALQRGGESPRL--YVRDAVNIDLSPAGKAQLRASVRQVTDQPFRQLWQSPLHGDAFGAL-- 76 (396) T ss_pred CcceeeeeeecccccccccccccCCCcccc--eeeeeeeecccCCCchhhhccCcccCCceecccccCccccceeeeC-- Confidence 222222223333333332211111111111 11222343 5665543 22222210 001111111111 Q ss_pred EecChHHhhhHHHHhhCcceeeeccCCCcceeeeehhhhcCCCcceecc---cceecCCC-ce-e-cccCC--cCCCCCC Q lcl|NC_018856. 224 AFMPIGVQADFTNNLLDRQRVIQPSTAGGFSTGFSINQFLSTRGAINLH---GSTIMEND-NI-L-LEGRN--PEPNAPQ 295 (479) Q Consensus 224 ~~mp~~vka~f~~~~~~~qrv~~~~n~g~~~~G~~I~~~~s~~G~I~l~---~s~~m~~~-~~-L-~e~~~--~~~~AP~ 295 (479) ++.-+...++. -+.-..+ .-.+|+|... +-+..... .+ . +.+.. ...++|+ T Consensus 77 ----------------~~tl~~~~~~~--w~~~~~v---~v~~~pva~d~~~~Rvy~t~~~~p~~~~~~~~y~L~vp~P~ 135 (396) T protein:vir:10 77 ----------------GDQWGKVDPHS--WTFEPLA---QIGEGDLSHEVLNNRVCVAGTAGIFTYDGAQAERLTLDTPA 135 (396) T ss_pred ----------------CceEEEEeCCe--EEEEeee---eeccCchhccccCCeEEEEcCCCceeeeCCcceecCcCCCc Confidence 11111111010 0000000 0112222210 00000000 00 1 11110 1122333 Q ss_pred CceeEeeeeccCCCCCCCcccccceEEEEEEEcCcCccccccceeeeeecCCceEEEEEEecCCCCcccceEEEEEecCC Q lcl|NC_018856. 296 APASVVASIVDDKKGGFRDEDIKTHSYKVVVHSDDAESLPSEAVTAAVAKKDNTVKLEVKLASLYQAQPQFISVYREGTE 375 (479) Q Consensus 296 ~pa~v~at~~t~~~G~f~~~d~gty~YkVtavn~~GES~pS~~vt~Tv~~~g~sv~ltIT~~~~~~a~~~~y~IYR~~~~ 375 (479) ++. ..+ ..|. .+.++|.|.++.|+..||+.++.+++..++ .+..++|+++++ .+.+.+.++|||++++ T Consensus 136 ~a~-~~a-----~~Gs---l~~~~~~Y~~t~V~~~gEEs~p~~~S~~v~-~~gg~~vtl~~~--~~~~i~~~RiYrS~~~ 203 (396) T protein:vir:10 136 PPL-LVA-----GAGS---LSQGTYGAAVAWLRGPQESAPSLIAFAEVT-DAGALEVTFPLC--LDASVTGARLYLTRAN 203 (396) T ss_pred ccc-ccc-----ccCc---cCCceEEEEEEEEecCCCcCcccccccccC-CCCCcEEEEEcc--cCCCcceEEEEEeCCC Confidence 221 111 1332 355899999999999998777777666555 455677777754 3445678999999999 Q ss_pred CcceEEEEeeeeeeecCCceEEEeeccccCCCcccceecCCchhhhhhhhhcchhhccccccCcchhh-------hhhhh Q lcl|NC_018856. 376 TGHYFLIARVPVSKVNDQGVIEVLDRNQVIPETTDVFVGELTPNVVSLLELLPMMKLPLAQMNATTTF-------TVLWY 448 (479) Q Consensus 376 ~G~y~li~rv~vs~~n~~g~T~ftD~N~~iPgT~~~fvGe~~p~vi~l~ellPm~k~Pla~~~~~~~~-------~V~~y 448 (479) ++.|+|++..+++ +.+|++.. +| .++.|.. +..|.| +|++.+-+-..+ -.+|| T Consensus 204 G~~~~l~aE~~a~------~~s~vlPs--~~-------w~gpP~~--~~gL~p---mP~G~~~A~faGRi~~A~Gn~V~F 263 (396) T protein:vir:10 204 GGELLLAGDYPLG------AATVILPT--LP-------ELGRPAQ--FRHLSP---MPTGKHLAYWRGRLLIARANVLRF 263 (396) T ss_pred hhhhhheehhccc------eeeeeeec--CC-------CCCCCcc--cccccc---CchhHhhhhhcceEEEEeCCEEEE Confidence 9999999988765 45665532 12 1122221 234444 454433333221 22333 Q ss_pred hhhheecccccEEEEeccc----cCcccceeeecC Q lcl|NC_018856. 449 GALALYAPKKWVRIKNVQY----IPALAADVTVKY 479 (479) Q Consensus 449 gaL~l~aPkk~~~ikNV~~----~~~~~~~~~~~~ 479 (479) .-..+ |.-|-+-++-.. +=+||+=-+--| T Consensus 264 SEp~~--Ph~~~~~~~~~~~~~~Iv~lapv~~gL~ 296 (396) T protein:vir:10 264 SEALA--YHLHDERYGFVQMPQRITFVQPVDGGIW 296 (396) T ss_pred ecCCC--CceecchhccCCCCCceEEEEEecCeEE Confidence 22221 210111111111 111111101111 No 24 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=97.61 E-value=1.2e-05 Score=47.55 Aligned_cols=299 Identities=11% Similarity=0.056 Sum_probs=155.3 Q ss_pred CCccchhhhhhhhcCCccchHHHHHHHHHhhhcCCCcChhhccCccccchhhhhhhhhhheeccccccchhhccccchhH Q lcl|NC_018856. 1 MTELKKEAEAKNKKLPVEAEAELAELVSKSFTTGYGITPDTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQQVNS 80 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~Ks~tag~~~~p~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~~~~s 80 (479) |-|.++.+.. ..+ .+......+.|.|...+..+ +++.|-.+.+..+|..+.... ..+.+...+.++.+ T Consensus 1 ~~~~~~~~~~-~~~------f~~~~~~~~~~~a~~~~~~~---~~~~liP~~~~~~ii~~~~~~--s~l~~l~~~~~~~~ 68 (324) T protein:vir:93 1 MEQTQKLKLN-LQH------FASNNVKPQVFNPDNVMMHE---KKDGTLLNDFTTPILQEVMEN--SKIMQLGKYEPMEG 68 (324) T ss_pred CchhHHHHHH-HHH------HHHhhhhhhhcccccccccC---CCcceechhHHHHHHHHHHhh--chhhhhcceeeccC Confidence 6554443322 111 12222222455555433222 344466777777775444322 23444455555655 Q ss_pred HHHhhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_018856. 81 TVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAIAVIAKSIEW 160 (479) Q Consensus 81 tv~eY~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~iE~ 160 (479) ...+|.++. +.....+++|++..+..++++.+.....+=++..-.+|.-+- .++..|.+....+.--..+++.+|. T Consensus 69 ~~~~ip~~~---~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~l~~aia~~~d~ 144 (324) T protein:vir:93 69 TEKKFTFWA---DKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFL-NYTYSQFFEEMKPMIAEAFYKKFDE 144 (324) T ss_pred CceEEEEEe---cCcceeeecCCccccccccceeEEEEEeEEEEEeehhhHHHH-hcchHHHHHHHHHHHHHHHHHHHHH Confidence 444566644 234567999999999999999999999999998888887432 2455677888888888889999999 Q ss_pred HHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccCCCCCHHHHhhhhhhhhhccCceEEEecChHHhhhHHHHhhC Q lcl|NC_018856. 161 AIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGARLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLD 240 (479) Q Consensus 161 a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~~td~~mp~~vka~f~~~~~~ 240 (479) ++++|+-.- .+..|+...+.. .+.... ..++.+.|.++.-.+..+|+......|++.+.+.+...-.. T Consensus 145 a~l~G~g~~---------~~~~~~~~~~~~-~~~~~~--~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~ 212 (324) T protein:vir:93 145 AGILNQGNN---------PFGKSIAQSIEK-TNKVIK--GDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDP 212 (324) T ss_pred HHhcCCCCC---------CcCccccccccc-cceecc--ccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhCC Confidence 999997541 123455555543 233322 23556667776667777888888899999999988765555 Q ss_pred cceeeeccCCCcceeeeehhhhc---CCCcceecccc---ee-cCCCceecccCCcCCCCCCCceeEeeeeccCCCCC-- Q lcl|NC_018856. 241 RQRVIQPSTAGGFSTGFSINQFL---STRGAINLHGS---TI-MENDNILLEGRNPEPNAPQAPASVVASIVDDKKGG-- 311 (479) Q Consensus 241 ~qrv~~~~n~g~~~~G~~I~~~~---s~~G~I~l~~s---~~-m~~~~~L~e~~~~~~~AP~~pa~v~at~~t~~~G~-- 311 (479) .-|.+.+......-.|++|--.. ...+.+- -|+ .+ ..+..+-++ ....+. .+.....++. T Consensus 213 ~G~~~~~~~~~~~l~G~PVv~~~~~~~~~~~i~-~gdfs~~~~~~~~~~~i~---~~~~~~-------~~~~~~~~~~~~ 281 (324) T protein:vir:93 213 ETKERIYDRNSDSLDGLPVVNLKSSNLKRGELI-TGDFDKLIYGIPQLIEYK---IDETAQ-------LSTVKNEDGTPV 281 (324) T ss_pred CCCeeecCCCCCcccceeeEeecCCCCCcceEE-EEecceEEEEEecCcEEE---Eeeccc-------ccccccccccch Confidence 55666554444445666553211 1111111 111 00 000000000 000000 0000011111 Q ss_pred --CCccccc---ceEE-----------EEEEEcCcCccccccc Q lcl|NC_018856. 312 --FRDEDIK---THSY-----------KVVVHSDDAESLPSEA 338 (479) Q Consensus 312 --f~~~d~g---ty~Y-----------kVtavn~~GES~pS~~ 338 (479) |...-.. ...| +++..+...+..|++. T Consensus 282 ~~f~~n~~~~r~~~r~d~~v~~~~a~~~l~~a~~~~~~~~~~~ 324 (324) T protein:vir:93 282 NLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVPGEV 324 (324) T ss_pred hhhhcCcEEEEEEEEeccEEecccceEEEecccccCCCCCCCC Confidence 1100000 0000 1222223334444443 No 25 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=97.57 E-value=1.5e-05 Score=47.08 Aligned_cols=297 Identities=10% Similarity=0.023 Sum_probs=152.0 Q ss_pred CCccchhhhhhhhcCCccchHHHHHHHH--HhhhcCCCcChhhccCccccchhhhhhhhhhheeccccccchhhccccch Q lcl|NC_018856. 1 MTELKKEAEAKNKKLPVEAEAELAELVS--KSFTTGYGITPDTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQQV 78 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~--Ks~tag~~~~p~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~~~ 78 (479) |-+.++.... .++....+. ..|.+..-+. ..+++.|-.+.+...|.... .+...+++.....++ T Consensus 1 ~~~~~~~~~~---------~~~f~~~~~~~~~~~a~~~~~---~~~~~~liP~~~~~~ii~~~--~~~s~l~~~~~~~~~ 66 (324) T protein:vir:10 1 MEQTQKLKLN---------LQHFASNNVKPQVFNPDNVMM---HEKKDGTLLNDFTTPILQEV--MENSKIMQLGKYEPM 66 (324) T ss_pred CCCchHHHHH---------HHHHHHHhhccceecccceec---cCCCcceechhHHHHHHHHH--Hhhchhhhhcceeec Confidence 6555433311 111111111 1123322221 22344466666666664333 222345555555556 Q ss_pred hHHHHhhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHH Q lcl|NC_018856. 79 NSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAIAVIAKSI 158 (479) Q Consensus 79 ~stv~eY~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~i 158 (479) .+.-.+|.++. +.+...+++|++..+..++.+.+.....+=++..-.+|.-+- .++..|.+....+.--..+++.+ T Consensus 67 ~~~~~~~p~~~---~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~l~~ai~~~~ 142 (324) T protein:vir:10 67 EGTEKKFTFWA---DKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFL-NYTYSQFFEEMKPMIAEAFYKKF 142 (324) T ss_pred cCCceEEEEEe---CCcceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHH-hcchHHHHHHHHHHHHHHHHHHH Confidence 55445565543 234578999999999999999999999999998888887432 24456788888888889999999 Q ss_pred HHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccCCCCCHHHHhhhhhhhhhccCceEEEecChHHhhhHHHHh Q lcl|NC_018856. 159 EWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGARLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNL 238 (479) Q Consensus 159 E~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~~td~~mp~~vka~f~~~~ 238 (479) |.++|+|+-.= .+..|+.+.+... +... ..-++.+.|.++.-.+..++..+.-+.|++.+...+...- T Consensus 143 d~a~l~G~g~~---------~~~~~i~~~~~~~-~~~~--~~~~t~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~ 210 (324) T protein:vir:10 143 DEAGILNQGNN---------PFGKSIAQSIEKT-NKVI--KGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIV 210 (324) T ss_pred HHHhhhcCCCC---------ccCcccccccccc-ceec--cccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhh Confidence 99999997541 1224555555432 3332 2346677778777777888888888999999999987655 Q ss_pred hCcceeeeccCCCcceeeeehhhhc---CCCcceecccceecCCCceecccCCcCCCCCCCceeEee----eeccCCCCC Q lcl|NC_018856. 239 LDRQRVIQPSTAGGFSTGFSINQFL---STRGAINLHGSTIMENDNILLEGRNPEPNAPQAPASVVA----SIVDDKKGG 311 (479) Q Consensus 239 ~~~qrv~~~~n~g~~~~G~~I~~~~---s~~G~I~l~~s~~m~~~~~L~e~~~~~~~AP~~pa~v~a----t~~t~~~G~ 311 (479) ...-|.+.+......-.|++|--.. ...+.+-+ |+ +. +.++... . ...-.+.. +...+.+|. T Consensus 211 d~~g~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~-gd--~~-~~~~~~~------~-~~~i~~~~~~~~~~~~~~~~~ 279 (324) T protein:vir:10 211 DPETKERIYDRNSDTLDGLPVVNLKSSNLKRGELIT-GD--FD-KLIYGIP------Q-LIEYKIDETAQLSTVKNEDGT 279 (324) T ss_pred ccCCceeecCCCCccccceeEEeecCCCCCcceEEE-Ee--cc-cEEEEEe------c-CcEEEEeeccccccccccccc Confidence 5444454443333334555542211 11121111 11 00 0001000 0 00000000 011111111 Q ss_pred CCc-ccccceEEE-----------------EEEEcCcCccccccc Q lcl|NC_018856. 312 FRD-EDIKTHSYK-----------------VVVHSDDAESLPSEA 338 (479) Q Consensus 312 f~~-~d~gty~Yk-----------------Vtavn~~GES~pS~~ 338 (479) ... -.-+.-.++ ++-...-+|..|.+. T Consensus 280 ~~~~~~~~~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~~~~~~ 324 (324) T protein:vir:10 280 PVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKTDSVPGEV 324 (324) T ss_pred chhhhhcCcEEEEEEEEEccEEecccceEEEEeccCCCCCCCCCC Confidence 000 000111111 222222223222222 No 26 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=97.56 E-value=2.2e-05 Score=46.14 Aligned_cols=326 Identities=10% Similarity=-0.021 Sum_probs=146.6 Q ss_pred CCccchhhhhhhhcCCccch-----------------HHHHHHHHHhhhcCCCcChhhccCccccchhhhhhhh-hhhee Q lcl|NC_018856. 1 MTELKKEAEAKNKKLPVEAE-----------------AELAELVSKSFTTGYGITPDTQLDGAAVRRELLEDQV-KMLAF 62 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~-----------------~~~~e~~~Ks~tag~~~~p~~~~~gaalr~esld~~~-~~l~~ 62 (479) +-+..++.+.+..+..-... ..+-..-.+++...-... .+.++|+.|..+.+..++ ..+.. T Consensus 198 ~~~~~d~~e~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~l~~~e~~~~~~~~~~~-~t~~~gg~lip~~~~~~ii~~~~~ 276 (543) T protein:vir:81 198 IIERFDDEDSTLARQCLATSSPAYLRAWSKMARNPHAAILTEEEKRAINEVRAMG-LTKADGGYLVPFQLDPTVIITSNG 276 (543) T ss_pred HHHHHHHHHHHHhhhhhhhhhhhhhhHHHHHHHhhHHHHhhhhhhhhhhhhhhcc-cccccCcccCchhhhhHHHHHHHh Confidence 11111111111111111000 000000112221111111 223456677666655443 22211 Q ss_pred ccccccchhhccccchhHHHHhhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhH Q lcl|NC_018856. 63 SSNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADP 142 (479) Q Consensus 63 ~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp 142 (479) .. ..+.+..........+. |.+ ..+.....+++|++..+.+++.+.+....++-++.-..+|.-+- .++ .|. T Consensus 277 ~~--~~l~~~~~~~~~~g~~~-~~~---~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell-~d~-~~~ 348 (543) T protein:vir:81 277 SL--NDIRRFARQVVATGDVW-HGV---SSAAVQWSWDAEFEEVSDDSPEFGQPEIPVKKAQGFVPISIEAL-QDE-ANV 348 (543) T ss_pred hh--chhhhhcccccCCcceE-EEE---ecCCcceeecccCccccccccccceeeeeeeeeEeeehhhHHHH-hcc-HHH Confidence 11 12222222222223222 112 22334677999999999999999999999999999989998642 344 588 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccC-CCEEEccCCCCCHHHHhhhhhhhhhccCce Q lcl|NC_018856. 143 MTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQD-TNVIDLKGARLDEATLNKAAVIVGKGYGRA 221 (479) Q Consensus 143 ~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~-~NviDarG~~l~~~~l~~aa~~i~~~fG~~ 221 (479) +....+.-...++..++.++|+||-. |-++.|+.+..... ..+..+.+..++.+.+.++...+..+|... T Consensus 349 ~~~i~~~l~~~~~~~~d~ail~G~Gt---------~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~ 419 (543) T protein:vir:81 349 TETVALLFAEGKDELEAVTLTTGTGQ---------GNQPTGIVTALAGTAAEIAPVTAETFALADVYAVYEQLAARHRRQ 419 (543) T ss_pred HHHHHHHHHHHHHHHHHHHHhccCCC---------CcccccchhhcccccccccccccccccHHHHHHHHHhhhccccCC Confidence 89999999999999999999999743 23578888765432 235566667777777777777777788877 Q ss_pred EEEecChHHhhhHHHHhhCcceeeeccCCCcceeeeehhhhcCCCcceecccceecCCCceecccCCcCCCCCCCceeEe Q lcl|NC_018856. 222 TDAFMPIGVQADFTNNLLDRQRVIQPSTAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILLEGRNPEPNAPQAPASVV 301 (479) Q Consensus 222 td~~mp~~vka~f~~~~~~~qrv~~~~n~g~~~~G~~I~~~~s~~G~I~l~~s~~m~~~~~L~e~~~~~~~AP~~pa~v~ 301 (479) ..++|+..+.+.+...-...-|.+.+...+ | .+.+++..+-...+ .++....+.. T Consensus 420 ~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~---------------g----~~~~l~G~pv~~~~-~~~~~~~~~~----- 474 (543) T protein:vir:81 420 GAWLANNLIYNKIRQFDTQGGAGLWTTIGN---------------G----EPSQLLGRPVGEAE-AMDANWNTSA----- 474 (543) T ss_pred cEEEEcHHHHHHHHHhhcCCCceeccCcCC---------------C----CCccccceeeEEec-cccccccccc----- Confidence 778999999888875444333443321110 0 01233333322211 1111111110 Q ss_pred eeeccCCCCCCCcccccceEEEEEEEcCcCccc-cccceeeeeecCCceEEEEEEecCCCCcccceEEEEEecCCCcceE Q lcl|NC_018856. 302 ASIVDDKKGGFRDEDIKTHSYKVVVHSDDAESL-PSEAVTAAVAKKDNTVKLEVKLASLYQAQPQFISVYREGTETGHYF 380 (479) Q Consensus 302 at~~t~~~G~f~~~d~gty~YkVtavn~~GES~-pS~~vt~Tv~~~g~sv~ltIT~~~~~~a~~~~y~IYR~~~~~G~y~ 380 (479) . .|. .+-..|.+++.+ .....|-+. -+.....+.-...+.+.+-.. ... .+.|.+. ..+ T Consensus 475 ---~---~~~-~~i~~gd~~~~~-i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~---~r~----d~~v~~~-----~A~ 534 (543) T protein:vir:81 475 ---S---ADN-FVLLYGNFQNYV-IADRIGMTVEFIPHLFGTNRRPNGSRGWFAY---YRM----GADVVNP-----NAF 534 (543) T ss_pred ---c---CCc-ceEEEeecccee-EEeecccEEEEeccccccchhhcCceEEEEE---Eee----ccEeecc-----cce Confidence 0 000 000112222211 111111100 000000000000000111100 000 0122221 112 Q ss_pred EEEeeeeee Q lcl|NC_018856. 381 LIARVPVSK 389 (479) Q Consensus 381 li~rv~vs~ 389 (479) .+..++.+. T Consensus 535 ~~l~~~~~a 543 (543) T protein:vir:81 535 RLLNVETAS 543 (543) T ss_pred EEEEecccC Confidence 222332221 No 27 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=97.51 E-value=2.2e-05 Score=46.07 Aligned_cols=300 Identities=11% Similarity=0.050 Sum_probs=156.3 Q ss_pred CCccchhhhhhhhcCCccchHHHHHHHHH--hhhcCCCcChhhccCccccchhhhhhhhhhheeccccccchhhccccch Q lcl|NC_018856. 1 MTELKKEAEAKNKKLPVEAEAELAELVSK--SFTTGYGITPDTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQQV 78 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~K--s~tag~~~~p~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~~~ 78 (479) |-+.++..+ +.++....+.+ .+.+...+ .-.+++.|-.+.+-.+|..+.. +...+++.+.+.++ T Consensus 1 ~~~~~~~~~---------~~~~f~~~~~~~~~~~a~~~~---~~~~~~~lip~~~~~~ii~~~~--~~s~l~~l~~~~~~ 66 (324) T protein:vir:96 1 MEQTQKLKL---------NLQHFASNNVKPQVFNPDNVM---MHEKKDGTLLNDFTTPILQEVM--ENSKIMQLGKYEPM 66 (324) T ss_pred CCcchhhhH---------HHHHHHHhhhhhhhccccccc---ccCCCcceechhHHHHHHHHHH--hhchhhhhcceeec Confidence 655433321 12222222221 12222222 1123455667767666643332 22335555555666 Q ss_pred hHHHHhhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHH Q lcl|NC_018856. 79 NSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAIAVIAKSI 158 (479) Q Consensus 79 ~stv~eY~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~i 158 (479) .+.-.+|.++.. .+...+++|++..+..++++.+.....+=++-.-.+|.-+- .++..|.+....+.-...+++.+ T Consensus 67 ~~~~~~~p~~~~---~~~a~~v~Eg~~~~~~~~~f~~v~~~~~k~~~~~~is~ell-~ds~~~l~~~i~~~l~~aia~~~ 142 (324) T protein:vir:96 67 EGTEKKFTFWAD---KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFL-NYTYSQFFEEMKPMIAEAFYKKF 142 (324) T ss_pred cCCceEEEEEec---CcceeeecCCccccccccceeEEEEEeEEEEEeehhhHHHH-hcchHHHHHHHHHHHHHHHHHHH Confidence 655455666543 23467999999999999999999999999998888887432 24556788888888889999999 Q ss_pred HHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccCCCCCHHHHhhhhhhhhhccCceEEEecChHHhhhHHHHh Q lcl|NC_018856. 159 EWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGARLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNL 238 (479) Q Consensus 159 E~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~~td~~mp~~vka~f~~~~ 238 (479) |.++|+|+-+=. +-.|+...+... +.... ..++.+.|.++...+..++..++-+.|+....+.+...- T Consensus 143 d~~~l~G~g~~~---------~~~~~~~~~~~~-~~~~~--~~~~~~~i~~~~~~i~~~~~~~~~~i~n~~~~~~L~~lk 210 (324) T protein:vir:96 143 DEAGILNQGNNP---------FGKSIAQSIKKT-NKVIK--GDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIV 210 (324) T ss_pred HHHhhhcCCCCC---------cCcccccccccc-ceecc--cccchHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhh Confidence 999999975411 123555544432 33222 234566677666667778888888999999999887665 Q ss_pred hCcceeeeccCCCcceeeeehhhhcCCCcceecccceecCCCceecccCCcCCCCCCCceeEee----eeccCCCC---- Q lcl|NC_018856. 239 LDRQRVIQPSTAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILLEGRNPEPNAPQAPASVVA----SIVDDKKG---- 310 (479) Q Consensus 239 ~~~qrv~~~~n~g~~~~G~~I~~~~s~~G~I~l~~s~~m~~~~~L~e~~~~~~~AP~~pa~v~a----t~~t~~~G---- 310 (479) ...-|.+.++..+..-.|++|--..+.. +. .+..++.+..-+.-+.. -.....+.. +...+..+ T Consensus 211 d~~G~~~~~~~~~~~l~G~PV~~~~~~~--~~-~~~~~~gd~s~~~~~~~-----~~~~i~~~~~~~~~~~~~~~~~~~~ 282 (324) T protein:vir:96 211 DPETKERIYDRNSDSLDGLPVVNLKSSN--LK-RGELITGDFDKLIYGIP-----QLIEYKIDETAQLSTVKNEDGTPVN 282 (324) T ss_pred CCCCCeeecCCCCCcccceeeEeecCCC--CC-cceEEEEecceEEEEEe-----cCcEEEEeecccccccccccccchh Confidence 5555666655544445666553111110 00 01111111100000000 000000000 00000011 Q ss_pred CCCcccc--------------cceEEEEEEEcCcCccccccc Q lcl|NC_018856. 311 GFRDEDI--------------KTHSYKVVVHSDDAESLPSEA 338 (479) Q Consensus 311 ~f~~~d~--------------gty~YkVtavn~~GES~pS~~ 338 (479) .|...-. ...--+++..+..++..|++. T Consensus 283 ~~~~n~v~~r~~~r~d~~v~~~~a~~~l~~a~~~~~~~~~~~ 324 (324) T protein:vir:96 283 LFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVPGEV 324 (324) T ss_pred hhhcCcEEEEEEEEeccEEecccceEEEecccccCCCCCCCC Confidence 0100000 001123334444455555554 No 28 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=97.49 E-value=9.9e-06 Score=48.01 Aligned_cols=297 Identities=11% Similarity=0.040 Sum_probs=151.5 Q ss_pred CCccchhhhhhhhcCCccchHHHHHHHH--HhhhcCCCcChhhccCccccchhhhhhhhhhheeccccccchhhccccch Q lcl|NC_018856. 1 MTELKKEAEAKNKKLPVEAEAELAELVS--KSFTTGYGITPDTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQQV 78 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~--Ks~tag~~~~p~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~~~ 78 (479) |-|.++.+.. . ++....+. ..|.+..-+.. .+++.|-.+.+...|..+.... ..+.+...+.++ T Consensus 1 ~~k~~~~~~~-~--------~~~~~~~~~~~~~~a~~~~~~---~~~~~lip~~~~~~ii~~~~~~--s~l~~~~~~~~~ 66 (324) T protein:vir:99 1 MEQTQKLKLN-L--------QHFASNNVKPQVFNPDNVMMH---EKKDGTLLNDFTTPILQEVMEN--SKIMRLGKYEPM 66 (324) T ss_pred CCCchHhhHH-H--------HHHHHHhhhhhhccccceecc---CCCcceechhHHHHHHHHHHhh--chhhhhcceeec Confidence 7665443321 1 11111111 12333322222 2344466777766664333222 234444555555 Q ss_pred hHHHHhhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHH Q lcl|NC_018856. 79 NSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAIAVIAKSI 158 (479) Q Consensus 79 ~stv~eY~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~i 158 (479) .+.-.+|.++. +.+...+++|++..+..++.+.+....++=++..-.+|.-+- .++..|.+....+.-...+++.+ T Consensus 67 ~~~~~~~p~~~---~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~l~~ai~~~~ 142 (324) T protein:vir:99 67 EGTEKKFTFWA---DKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFL-NYTYSQFFEEMKPMIAEAFYKKF 142 (324) T ss_pred cCCceEEEEEe---cCcceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHH-hcchHHHHHHHHHHHHHHHHHHH Confidence 54333455533 334578999999999999999999999999998888887432 24456788888888899999999 Q ss_pred HHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccCCCCCHHHHhhhhhhhhhccCceEEEecChHHhhhHHHHh Q lcl|NC_018856. 159 EWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGARLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNL 238 (479) Q Consensus 159 E~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~~td~~mp~~vka~f~~~~ 238 (479) |.++|+|+-.= .+.-|+.+.+... +... ...++.+.|.++.-.+..++..+.-+.|++.+.+.+...- T Consensus 143 d~~~l~G~g~~---------~~~~~~~~~~~~~-~~~~--~~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~ 210 (324) T protein:vir:99 143 DEAGILNQGNN---------PFGKSIAQSIEKT-NKVI--KGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIV 210 (324) T ss_pred HHHhhhcCCCC---------ccCcccccccccc-ceec--cccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhh Confidence 99999997641 1224555554432 3322 2335667777777777888888888999999999887655 Q ss_pred hCcceeeeccCCCcceeeeehhhhcCC---Ccceecccc----eecCCCceecccCCcCCCCCCCceeEeeeeccCCCCC Q lcl|NC_018856. 239 LDRQRVIQPSTAGGFSTGFSINQFLST---RGAINLHGS----TIMENDNILLEGRNPEPNAPQAPASVVASIVDDKKGG 311 (479) Q Consensus 239 ~~~qrv~~~~n~g~~~~G~~I~~~~s~---~G~I~l~~s----~~m~~~~~L~e~~~~~~~AP~~pa~v~at~~t~~~G~ 311 (479) ...-|.+.+......-.|++|--..+. .|.+-+ |+ .+..+..+-++- ...+ ..+...+.++. T Consensus 211 d~~g~~~~~~~~~~~l~G~PVv~~~~~~~~~~~~i~-gd~~~~~~~~~~~~~i~~---~~~~-------~~~~~~~~~~~ 279 (324) T protein:vir:99 211 DPETKERIYDRNSDTLDGLPVVNLKSSNLKRGELIT-GDFDKLIYGIPQLIEYKI---DETA-------QLSTVKNEDGT 279 (324) T ss_pred cCCCceeecCCCCccccceeEEeecCCCCCcceEEE-EecccEEEEEecCcEEEE---eecc-------ccccccccccc Confidence 444445444333334456654322111 111111 11 000111100000 0000 00000111111 Q ss_pred ----CCccccc--------------ceEEEEEEEcCcCccccccc Q lcl|NC_018856. 312 ----FRDEDIK--------------THSYKVVVHSDDAESLPSEA 338 (479) Q Consensus 312 ----f~~~d~g--------------ty~YkVtavn~~GES~pS~~ 338 (479) |...-.. .-.-+++..+.-.|..|.+. T Consensus 280 ~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~lt~a~~~~~~~~~~~ 324 (324) T protein:vir:99 280 PVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKTDSVPGEV 324 (324) T ss_pred chhhhhcCcEEEEEEEEEccEEecccceEEEEeccCCCCCCCCCC Confidence 1100000 01112222222333333332 No 29 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=97.46 E-value=2.4e-05 Score=45.95 Aligned_cols=313 Identities=14% Similarity=0.110 Sum_probs=142.3 Q ss_pred CCccch---hhhhhhhcCCcc--c----hHHHHHHHHHhhhcCC----------CcChhhccCccccchhhhhhhhhhhe Q lcl|NC_018856. 1 MTELKK---EAEAKNKKLPVE--A----EAELAELVSKSFTTGY----------GITPDTQLDGAAVRRELLEDQVKMLA 61 (479) Q Consensus 1 ~~~~~~---~~~~~~~~~~~~--~----~~~~~e~~~Ks~tag~----------~~~p~~~~~gaalr~esld~~~~~l~ 61 (479) +.+.+. +.+.+..+.... . .+...+.+.|.+...- .....+..+|..+..+. .+.+.... T Consensus 50 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~-~~~ii~~~ 128 (385) T protein:vir:19 50 LTKSGTRLFDLEQKLASGAENPGEKKSFSERAAEELIKSWDGKQGTFGAKTFNKSLGSDADSAGSLIQPMQ-IPGIIMPG 128 (385) T ss_pred HHHHHHHHHHHHHHhhccccccchhhhhHHHHHHHHHHHHHHhhccchhhHHHhhhccccccCCceecchh-hhHHHHHh Confidence 111110 011111111111 0 1112222334331110 01112223344455544 44443322 Q ss_pred eccccccchhhccccchhHHHHhhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhh Q lcl|NC_018856. 62 FSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIAD 141 (479) Q Consensus 62 ~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~D 141 (479) .....+++.++..++.+.-.+|.+.. +..+...+++|++..+..++.+.+....++=++....+|.- +.+...+ T Consensus 129 --~~~~~l~~~~~~~~~~~~~~~~~~~~--~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~k~~~~~~is~e--ll~d~~~ 202 (385) T protein:vir:19 129 --LRRLTIRDLLAQGRTSSNALEYVREE--VFTNNADVVAEKALKPESDITFSKQTANVKTIAHWVQASRQ--VMDDAPM 202 (385) T ss_pred --hhccchhhhcceecccCcceEEEEEe--cCCcceeeeccCccccccccceeEEEEeeeeEEEeehhhHH--HHhhHHH Confidence 23345666666666655434444432 33344668999999999999999999999999998888875 3334356 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccCCCCCHHHHhhhhhhhhhccCce Q lcl|NC_018856. 142 PMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGARLDEATLNKAAVIVGKGYGRA 221 (479) Q Consensus 142 p~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~~ 221 (479) .+....+.-...+...++.++++|+-. |-.+.||.+............ .....+.|.++...+...++.. T Consensus 203 l~~~i~~~la~a~~~~~d~~~l~G~g~---------~~~~~Gi~~~~~~~~~~~~~~-~~~~~d~i~~~~~~l~~~~~~~ 272 (385) T protein:vir:19 203 LQSYINNRLMYGLALKEEGQLLNGDGT---------GDNLEGLNKVATAYDTSLNAT-GDTRADIIAHAIYQVTESEFSA 272 (385) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCC---------CCccccccccccccccccccc-ccchHHHHHHHHHhhccccCCC Confidence 778888888899999999999999744 224677776553221222222 2345666777766777889999 Q ss_pred EEEecChHHhhhHHHHhhCcceeeeccCC-C--cceeeeehhhhc-CCCcceecccceecCCCceecccCCcCCCCCCCc Q lcl|NC_018856. 222 TDAFMPIGVQADFTNNLLDRQRVIQPSTA-G--GFSTGFSINQFL-STRGAINLHGSTIMENDNILLEGRNPEPNAPQAP 297 (479) Q Consensus 222 td~~mp~~vka~f~~~~~~~qrv~~~~n~-g--~~~~G~~I~~~~-s~~G~I~l~~s~~m~~~~~L~e~~~~~~~AP~~p 297 (479) +-++|+..+.+.+...-...-|.+.+... + +.-.|++|-... -|.|.+-| |+- ...-.+.. ... . T Consensus 273 ~~~~~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~l~G~pV~~~~~~p~~~~~~-gd~--~~~~~~~~-------~~~-~ 341 (385) T protein:vir:19 273 SGIVLNPRDWHNIALLKDNEGRYIFGGPQAFTSNIMWGLPVVPTKAQAAGTFTV-GGF--DMASQVWD-------RMD-A 341 (385) T ss_pred CEEEEcHHHHHHHHHhhcCCCceeccCcccCCCceecceeeEEcCcCCCCcEEE-eec--ccEEEEEE-------ecc-e Confidence 99999999988886654444444443210 0 011122211000 00111000 000 00000000 000 0 Q ss_pred eeEeeeeccCCCCCCCcccccceEEEEEEEcCcCccccccceeeeeecCCceEEEEEEecC Q lcl|NC_018856. 298 ASVVASIVDDKKGGFRDEDIKTHSYKVVVHSDDAESLPSEAVTAAVAKKDNTVKLEVKLAS 358 (479) Q Consensus 298 a~v~at~~t~~~G~f~~~d~gty~YkVtavn~~GES~pS~~vt~Tv~~~g~sv~ltIT~~~ 358 (479) ..... . ....-|. .+...|++...-+-. +..+..-+.++++-+. T Consensus 342 ~v~~~--~-~~~~~~~---~~~~~~~~~~r~~~~-----------v~~~~a~~~~~~~aa~ 385 (385) T protein:vir:19 342 TVEVS--R-EDRDNFV---KNMLTILCEERLALA-----------HYRPTAIIKGTFSSGS 385 (385) T ss_pred EEEEe--c-cccchhh---cCcEEEEEEEeeccE-----------EecccceEEEEeccCC Confidence 00000 0 0000000 001112211111111 1111122222222111 No 30 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=97.46 E-value=2.4e-05 Score=45.95 Aligned_cols=313 Identities=14% Similarity=0.110 Sum_probs=142.3 Q ss_pred CCccch---hhhhhhhcCCcc--c----hHHHHHHHHHhhhcCC----------CcChhhccCccccchhhhhhhhhhhe Q lcl|NC_018856. 1 MTELKK---EAEAKNKKLPVE--A----EAELAELVSKSFTTGY----------GITPDTQLDGAAVRRELLEDQVKMLA 61 (479) Q Consensus 1 ~~~~~~---~~~~~~~~~~~~--~----~~~~~e~~~Ks~tag~----------~~~p~~~~~gaalr~esld~~~~~l~ 61 (479) +.+.+. +.+.+..+.... . .+...+.+.|.+...- .....+..+|..+..+. .+.+.... T Consensus 50 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~-~~~ii~~~ 128 (385) T protein:vir:18 50 LTKSGTRLFDLEQKLASGAENPGEKKSFSERAAEELIKSWDGKQGTFGAKTFNKSLGSDADSAGSLIQPMQ-IPGIIMPG 128 (385) T ss_pred HHHHHHHHHHHHHHhhccccccchhhhhHHHHHHHHHHHHHHhhccchhhHHHhhhccccccCCceecchh-hhHHHHHh Confidence 111110 011111111111 0 1112222334331110 01112223344455544 44443322 Q ss_pred eccccccchhhccccchhHHHHhhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhh Q lcl|NC_018856. 62 FSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIAD 141 (479) Q Consensus 62 ~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~D 141 (479) .....+++.++..++.+.-.+|.+.. +..+...+++|++..+..++.+.+....++=++....+|.- +.+...+ T Consensus 129 --~~~~~l~~~~~~~~~~~~~~~~~~~~--~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~k~~~~~~is~e--ll~d~~~ 202 (385) T protein:vir:18 129 --LRRLTIRDLLAQGRTSSNALEYVREE--VFTNNADVVAEKALKPESDITFSKQTANVKTIAHWVQASRQ--VMDDAPM 202 (385) T ss_pred --hhccchhhhcceecccCcceEEEEEe--cCCcceeeeccCccccccccceeEEEEeeeeEEEeehhhHH--HHhhHHH Confidence 23345666666666655434444432 33344668999999999999999999999999998888875 3334356 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccCCCCCHHHHhhhhhhhhhccCce Q lcl|NC_018856. 142 PMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGARLDEATLNKAAVIVGKGYGRA 221 (479) Q Consensus 142 p~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~~ 221 (479) .+....+.-...+...++.++++|+-. |-.+.||.+............ .....+.|.++...+...++.. T Consensus 203 l~~~i~~~la~a~~~~~d~~~l~G~g~---------~~~~~Gi~~~~~~~~~~~~~~-~~~~~d~i~~~~~~l~~~~~~~ 272 (385) T protein:vir:18 203 LQSYINNRLMYGLALKEEGQLLNGDGT---------GDNLEGLNKVATAYDTSLNAT-GDTRADIIAHAIYQVTESEFSA 272 (385) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCC---------CCccccccccccccccccccc-ccchHHHHHHHHHhhccccCCC Confidence 778888888899999999999999744 224677776553221222222 2345666777766777889999 Q ss_pred EEEecChHHhhhHHHHhhCcceeeeccCC-C--cceeeeehhhhc-CCCcceecccceecCCCceecccCCcCCCCCCCc Q lcl|NC_018856. 222 TDAFMPIGVQADFTNNLLDRQRVIQPSTA-G--GFSTGFSINQFL-STRGAINLHGSTIMENDNILLEGRNPEPNAPQAP 297 (479) Q Consensus 222 td~~mp~~vka~f~~~~~~~qrv~~~~n~-g--~~~~G~~I~~~~-s~~G~I~l~~s~~m~~~~~L~e~~~~~~~AP~~p 297 (479) +-++|+..+.+.+...-...-|.+.+... + +.-.|++|-... -|.|.+-| |+- ...-.+.. ... . T Consensus 273 ~~~~~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~l~G~pV~~~~~~p~~~~~~-gd~--~~~~~~~~-------~~~-~ 341 (385) T protein:vir:18 273 SGIVLNPRDWHNIALLKDNEGRYIFGGPQAFTSNIMWGLPVVPTKAQAAGTFTV-GGF--DMASQVWD-------RMD-A 341 (385) T ss_pred CEEEEcHHHHHHHHHhhcCCCceeccCcccCCCceecceeeEEcCcCCCCcEEE-eec--ccEEEEEE-------ecc-e Confidence 99999999988886654444444443210 0 011122211000 00111000 000 00000000 000 0 Q ss_pred eeEeeeeccCCCCCCCcccccceEEEEEEEcCcCccccccceeeeeecCCceEEEEEEecC Q lcl|NC_018856. 298 ASVVASIVDDKKGGFRDEDIKTHSYKVVVHSDDAESLPSEAVTAAVAKKDNTVKLEVKLAS 358 (479) Q Consensus 298 a~v~at~~t~~~G~f~~~d~gty~YkVtavn~~GES~pS~~vt~Tv~~~g~sv~ltIT~~~ 358 (479) ..... . ....-|. .+...|++...-+-. +..+..-+.++++-+. T Consensus 342 ~v~~~--~-~~~~~~~---~~~~~~~~~~r~~~~-----------v~~~~a~~~~~~~aa~ 385 (385) T protein:vir:18 342 TVEVS--R-EDRDNFV---KNMLTILCEERLALA-----------HYRPTAIIKGTFSSGS 385 (385) T ss_pred EEEEe--c-cccchhh---cCcEEEEEEEeeccE-----------EecccceEEEEeccCC Confidence 00000 0 0000000 001112211111111 1111122222222111 No 31 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=97.45 E-value=1.3e-05 Score=47.38 Aligned_cols=313 Identities=12% Similarity=0.016 Sum_probs=152.0 Q ss_pred CCccchhhhhhhhcCCccchHHHHHHHHHhhhcCCCcC-------------hhhccCccccchhhhhhhhhhheeccccc Q lcl|NC_018856. 1 MTELKKEAEAKNKKLPVEAEAELAELVSKSFTTGYGIT-------------PDTQLDGAAVRRELLEDQVKMLAFSSNDF 67 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~Ks~tag~~~~-------------p~~~~~gaalr~esld~~~~~l~~~~~~f 67 (479) +.........+..+-... ...-...+.+.+..+.... ..+-.+|+.|-.+.+..++..+. .+.- T Consensus 87 ~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~lvp~~~~~~ii~~~--~~~~ 163 (418) T protein:vir:10 87 GGGSAELETPKTLGQLVT-ESEEMKGMDGSARKSVRVRVDRKSIMNVPATVGSGVSGSNSLVVADRQAGIIAPP--QRKM 163 (418) T ss_pred cccccccchhhhhhHHhh-hHHHHHHHHHHHhhhhhhhhHHHHHHHhhhhccCCCCCCccccchhHHHHHHHHH--hhhh Confidence 000000000000000000 0000111111111111110 11233456666777766664333 3334 Q ss_pred cchhhccccchhHHHHhhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHH Q lcl|NC_018856. 68 TIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILT 147 (479) Q Consensus 68 ~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~ 147 (479) .+++.+...++.+.-.+|.+.. +......++.|++..+.+++.+......++-++.--.+|.- +.+.-.|.+.... T Consensus 164 ~l~~~~~~~~~~~~~~~~~~~~--~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~is~e--ll~ds~~l~~~i~ 239 (418) T protein:vir:10 164 TIRDLLMPGQTSSSSIEYTVET--GFTNNAAAVAEGAQKPTSDLKFNLKNQPVRTIAHLFKASRQ--ILDDAPALQSYID 239 (418) T ss_pred hHHhhcceeeccCCceeEEEEe--cCCCceeeeccCccccccccceeeEEEeeeeEEEeehhhHH--HHHhHHHHHHHHH Confidence 5666566555554323344432 33345678999999999999999999999999988778775 3333358888888 Q ss_pred HHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccCCCCCHHHHhhhhhhhhhccCceEEEecC Q lcl|NC_018856. 148 EDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGARLDEATLNKAAVIVGKGYGRATDAFMP 227 (479) Q Consensus 148 ~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~~td~~mp 227 (479) +.-...+++.++.++|+|+-.= -+..|+.+............ .....+.|..+.-.+...++..+-++|+ T Consensus 240 ~~l~~a~~~~~d~a~l~G~g~~---------~~p~Gi~~~~~~~~~~~~~~-~~~~~~~i~~~~~~~~~~~~~~~~~v~n 309 (418) T protein:vir:10 240 GRARYGLQLTEEGQILKGDGTG---------ANILGILPQASAFMPSITLA-NATPIDKIRLALLQAVLAEFPATGIVLN 309 (418) T ss_pred HHHHHHHHHHHHHHHhccCCCC---------cccccccccccccccccccc-ccccHHHHHHHHHhhccccCCCCEEEEc Confidence 9999999999999999997641 12467766543221121111 2234555666655667778888889999 Q ss_pred hHHhhhHHHHhhCcceeeeccC---CCcceeeeehhhh-cCCCcceecccceecCCCceecccCCcCCCCCCCceeEeee Q lcl|NC_018856. 228 IGVQADFTNNLLDRQRVIQPST---AGGFSTGFSINQF-LSTRGAINLHGSTIMENDNILLEGRNPEPNAPQAPASVVAS 303 (479) Q Consensus 228 ~~vka~f~~~~~~~qrv~~~~n---~g~~~~G~~I~~~-~s~~G~I~l~~s~~m~~~~~L~e~~~~~~~AP~~pa~v~at 303 (479) ..+...+...-...-|.+.++. .+..-.|++|-.. .-|.|.+-+ || +.+.-.+.. . ....... T Consensus 310 ~~~~~~L~~lkd~~G~~i~~~~~~~~~~~l~G~pV~~~~~~p~~~~~~-gd--~s~~~~~~~-------~-~~~~i~~-- 376 (418) T protein:vir:10 310 PIDWASIELTKDSQGRYIVGNPVNGTTPRLWNLPVVETQAMTANEFLV-GA--FSMAAQIFD-------R-MEIEVLL-- 376 (418) T ss_pred HHHHHHHHHhhcCCCceeccccccCCCceecceeeEEcCCCCCCcEEE-ee--ccceEEEEE-------e-cceEEEE-- Confidence 9999888766555556665531 1112334433211 122333211 11 000001111 0 0011111 Q ss_pred eccCCCCCCCcccccceEEEEEEEcCcCccccccc--eeeeeecCC Q lcl|NC_018856. 304 IVDDKKGGFRDEDIKTHSYKVVVHSDDAESLPSEA--VTAAVAKKD 347 (479) Q Consensus 304 ~~t~~~G~f~~~d~gty~YkVtavn~~GES~pS~~--vt~Tv~~~g 347 (479) ....+..|.. +...|++...=+.+---|... ++.+.++.| T Consensus 377 -~~~~~~~f~~---~~~~~r~~~~~d~~~~~~~a~~~~~~~~~~~g 418 (418) T protein:vir:10 377 -STENVDDFEK---NMVSIRAEERLALAVYRPESFVTGALVEQAGG 418 (418) T ss_pred -ecccchhhhc---CceEEEEEEeeccEEecccceEEEEeccCCCC Confidence 1111122321 234444432222221123333 334444555 No 32 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=97.43 E-value=5.9e-06 Score=49.26 Aligned_cols=310 Identities=15% Similarity=0.073 Sum_probs=153.8 Q ss_pred CCccchh---hhhh-----hhcCCccc---------hHHHHHHHHHhhhcCCC-------cChhhccCccccchhhhhhh Q lcl|NC_018856. 1 MTELKKE---AEAK-----NKKLPVEA---------EAELAELVSKSFTTGYG-------ITPDTQLDGAAVRRELLEDQ 56 (479) Q Consensus 1 ~~~~~~~---~~~~-----~~~~~~~~---------~~~~~e~~~Ks~tag~~-------~~p~~~~~gaalr~esld~~ 56 (479) +.+.+.+ .+.+ ........ +....+.+.+.+..+.. ++..+..+| .|-...+..+ T Consensus 54 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g-~~vp~~~~~~ 132 (395) T protein:vir:43 54 QGELQARLSAAEQAMLANEKRDGGEEAPKTAGQMVAESLKEQGVTSSLRGSHRVSMPRSAITSIDGSGG-ALVAPDRRPG 132 (395) T ss_pred HHHHHHHHHHHHHHHHhhhccccccchhhhHHHHHHHHHHHHHHHHHhhhhhhhhhhhhhhcccCCCCc-cccchhhHHH Confidence 0000000 0000 00000000 00011111111111111 111222334 4444445555 Q ss_pred hhhheeccccccchhhccccchhHHHHhhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhh Q lcl|NC_018856. 57 VKMLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLV 136 (479) Q Consensus 57 ~~~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv 136 (479) |..+. .+...+++-+++.++.+...+|.+.. ++.+...+++|++..+.+++.+......++-++-...+|.-+ . T Consensus 133 ii~~~--~~~~~l~~l~~~~~~~~~~~~~~~~~--~~~~~a~~v~E~~~~~~~~~~~~~i~~~~~k~~~~~~is~el--l 206 (395) T protein:vir:43 133 VVAAP--QRRLTIRDLVAPGTTESNSVEYVRET--GFVNNAAPVSEGTQKPYSDLTFELENAPVRTIAHLFKASRQI--L 206 (395) T ss_pred HHHHH--HhhhhHHhhccceecCCCceEEEEEe--cCCCceeeecCCccccccccceeEEEEeeeeEEEeehhhHHH--H Confidence 53332 33445666677777766655565543 333456789999999999999999999999999988888764 3 Q ss_pred cchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccC---CCCCHHHHhhhhhh Q lcl|NC_018856. 137 NNIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKG---ARLDEATLNKAAVI 213 (479) Q Consensus 137 n~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG---~~l~~~~l~~aa~~ 213 (479) +...+.+....+.-...++..++.++++|+-. +-.+.|+.+...- .+.+.-+ .....+.|.++... T Consensus 207 ~d~~~l~~~v~~~la~a~~~~~d~~~l~G~g~---------~~~~~Gi~~~~~~--~~~~~~~~~~~~~~~~~i~~~~~~ 275 (395) T protein:vir:43 207 DDASALQSYIDARARYGLMLVEECQLLYGNGT---------GANLHGIIPQAQA--YAPPSGVVVTAEQRIDRIRLAILQ 275 (395) T ss_pred HhHHHHHHHHHHHHHHHHHHHHHHHHHhccCC---------CCccccccccccc--cccccccccccchhHHHHHHHHHh Confidence 33456778888888889999999999999753 2236777665432 2222222 23345566666667 Q ss_pred hhhccCceEEEecChHHhhhHHHHhhCcceeeeccCCCcc----eeeeehhhhc-CCCcceecccceecCCCceecccCC Q lcl|NC_018856. 214 VGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSTAGGF----STGFSINQFL-STRGAINLHGSTIMENDNILLEGRN 288 (479) Q Consensus 214 i~~~fG~~td~~mp~~vka~f~~~~~~~qrv~~~~n~g~~----~~G~~I~~~~-s~~G~I~l~~s~~m~~~~~L~e~~~ 288 (479) +..+|+...-++|++.+...+...-...-|.+.+. ..+. -.|++|-... -+.|.+-+ |+. .+.-.+.. T Consensus 276 ~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~-~~~~~~~~l~G~pVv~~~~~~~~~~~~-gd~--~~~~~~~~--- 348 (395) T protein:vir:43 276 AQLAEFPASGIVLNPIDWALIELNKDAENRYIIGS-PQNGTTPTLWRLPVVETQAITQDEFLT-GAF--SLGAQIFD--- 348 (395) T ss_pred hccccCCCcEEEEcHHHHHHHHHhhccCCceeccc-cccCCCceecceeeEEcCCCCCCcEEE-Eec--cceEEEEE--- Confidence 77788888889999999988876554444555442 2112 2344332111 12333211 110 00000110 Q ss_pred cCCCCCCCceeEeeeeccCCCCCCCcccccceEEEEEEEcCcCccccccceeeeeecC Q lcl|NC_018856. 289 PEPNAPQAPASVVASIVDDKKGGFRDEDIKTHSYKVVVHSDDAESLPSEAVTAAVAKK 346 (479) Q Consensus 289 ~~~~AP~~pa~v~at~~t~~~G~f~~~d~gty~YkVtavn~~GES~pS~~vt~Tv~~~ 346 (479) .-+ ...-..+ ..+..|.. +...|++...-+-+---|...+..++++. T Consensus 349 ----~~~-~~i~~~~---~~~~~f~~---~~~~~r~~~r~d~~v~~~~a~~~~~~taa 395 (395) T protein:vir:43 349 ----RMD-IEVLVST---ENDKDFEN---NMVTIRAEERLAFAVYRPEAFVTGSLTAS 395 (395) T ss_pred ----ecc-eEEEEec---cccchhhc---CcEEEEEEEeeccEEecccceEEEEeccC Confidence 001 1111111 11122321 23455544333333223444445444444 No 33 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=97.41 E-value=4.4e-05 Score=44.47 Aligned_cols=299 Identities=11% Similarity=0.045 Sum_probs=157.8 Q ss_pred CCccchhhhhhhhcCCccchHHHHHHHH--HhhhcCCCcChhhccCccccchhhhhhhhhhheeccccccchhhccccch Q lcl|NC_018856. 1 MTELKKEAEAKNKKLPVEAEAELAELVS--KSFTTGYGITPDTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQQV 78 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~--Ks~tag~~~~p~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~~~ 78 (479) |-+.++..+.- . ..+.... ..+.+...+ .-++|+.|-++.+..+|.... .+...+.+...+.+. T Consensus 1 ~~~~~~~~~~~-~--------~f~~~~~~~~~~~a~~~~---~~~~~~~~iP~~~~~~ii~~~--~~~s~l~~~~~~~~~ 66 (324) T protein:vir:97 1 MEQTQKLKLNL-Q--------HFASNNVKPQVFNPDNVM---MHEKKDGTLMNEFTTPILQEV--MENSKIMQLGKYEPM 66 (324) T ss_pred CccchhHHHHH-H--------HHHHhhhhhhhhcccccc---ccCCCcceechhHHHHHHHHH--Hhhcchhhhcceeec Confidence 76664433221 1 1121111 122333222 223456677777766664333 233345555555566 Q ss_pred hHHHHhhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHH Q lcl|NC_018856. 79 NSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAIAVIAKSI 158 (479) Q Consensus 79 ~stv~eY~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~i 158 (479) .+--.+|.++. +.+...+++|++..+..++.+.......+=++---.+|.-+ +.++..|.+....+.-...++..+ T Consensus 67 ~~~~~~ip~~~---~~~~a~~v~Eg~~~~~~~~~f~~v~~~~~k~~~~~~is~el-l~ds~~~l~~~i~~~l~~aia~~~ 142 (324) T protein:vir:97 67 EGTEKKFTFWA---DKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEF-LNYTYSQFFEEMKPMIAEAFYKKF 142 (324) T ss_pred cCCceEEEEEe---cCcceeEeccCccccccccceeEEEEeeEEEEEeehhhHHH-HhcchHHHHHHHHHHHHHHHHHHH Confidence 54434455543 23356799999999999999999999999999888888732 234556788888889999999999 Q ss_pred HHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccCCCCCHHHHhhhhhhhhhccCceEEEecChHHhhhHHHHh Q lcl|NC_018856. 159 EWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGARLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNL 238 (479) Q Consensus 159 E~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~~td~~mp~~vka~f~~~~ 238 (479) |.++|.|+..= .+..|+.+.+... +... ..-++.+.|.++.-.+..++..+.-..|++.+...+...- T Consensus 143 d~a~l~G~g~~---------~~~~gi~~~~~~~-~~~~--~~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lk 210 (324) T protein:vir:97 143 DEAGILNQGNN---------PFGKSIAQSIEKT-NKVI--KGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIV 210 (324) T ss_pred HHHhhccCCCC---------ccCcccccccccc-ceec--cccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhh Confidence 99999998641 2335666555443 4332 2345677788888788888888888999999999887665 Q ss_pred hCcceeeeccCCCcceeeeehhhhc---CCCcceecccceecCCCceecc--cCCcCCCCCCCceeEeeeeccCCCCC-- Q lcl|NC_018856. 239 LDRQRVIQPSTAGGFSTGFSINQFL---STRGAINLHGSTIMENDNILLE--GRNPEPNAPQAPASVVASIVDDKKGG-- 311 (479) Q Consensus 239 ~~~qrv~~~~n~g~~~~G~~I~~~~---s~~G~I~l~~s~~m~~~~~L~e--~~~~~~~AP~~pa~v~at~~t~~~G~-- 311 (479) .+.-|.+......+.-.|++|-... ...|.+-+ |+ +. .-++.. +..-.. ... ...+...+.+|. T Consensus 211 d~~g~~~~~~~~~~tl~G~PV~~~~~~~~~~~~~~~-gd--~~-~~~i~~~~~~~i~~-~~~----~~~~~~~~~~~~~~ 281 (324) T protein:vir:97 211 DPETKERIYDRNSDTLDGLPVVNLKSSNLKRGELIT-GD--FD-KLIYGIPQLIEYKI-DET----AQLSTVKNEDGTPV 281 (324) T ss_pred cCCCceeecCCCCccccceeeEeecCCCCCcceEEE-Ee--cc-cEEEEEecCcEEEE-eec----ccccccccccccch Confidence 5443444433333334555442111 11111111 11 00 001100 000000 000 000001111111 Q ss_pred --CCcccc----------c----ceEEEEEEEcCcCccccccc Q lcl|NC_018856. 312 --FRDEDI----------K----THSYKVVVHSDDAESLPSEA 338 (479) Q Consensus 312 --f~~~d~----------g----ty~YkVtavn~~GES~pS~~ 338 (479) |...-. + .-.-+++-.+..++..|++. T Consensus 282 ~~f~~d~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~ 324 (324) T protein:vir:97 282 NLFEQDMVALRATMHVALHIADDKAFAKLVPADKKTDSVPGEV 324 (324) T ss_pred hhhhcCcEEEEEEEEeccEEecccceEEEEeccCCCCCCCCCC Confidence 100000 0 01123344445555555554 No 34 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=97.39 E-value=1.1e-05 Score=47.75 Aligned_cols=287 Identities=11% Similarity=0.019 Sum_probs=144.0 Q ss_pred hhhccCccccchhhhhhhhhhheeccccccchhhccccchhHHHHhhhhhhccCcccccccccccccccccCcceEEEEE Q lcl|NC_018856. 39 PDTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTV 118 (479) Q Consensus 39 p~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~ 118 (479) =.+.+.|+.|-.+.+.++|...... +..+.+..+..+..+---+|.++ .+.....+++|++..+.+++.+...+. T Consensus 1 mat~~~gg~lvP~~~~~~ii~~~~~--~s~i~~~~~~i~~~~~~~~~p~~---~~~~~a~wv~Eg~~~~~~~~~f~~v~l 75 (311) T protein:vir:81 1 MVALATGTFQLPKHLVPGVWQKAQG--QSVLARLSMAEPQEFGEQQYMTL---TAPPRGEVVGEGAQKSESTATFAPVTA 75 (311) T ss_pred CceecCCceEcchhHHHHHHHHHHh--cchhhhhcceeecCCCceEEEEE---eCCceeEEeecCcccccccceeeEEEE Confidence 2333345666666676666433322 22333433333333222334443 233456789999999999999999999 Q ss_pred EEEeeeehhhhhhhHhhh--cchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEE Q lcl|NC_018856. 119 QMKFLSDTKQQSLAAGLV--NNIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVID 196 (479) Q Consensus 119 ~~k~l~~~~~vs~~~~lv--n~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviD 196 (479) ..+=++.--.+|.-+-.. ....+.+....+.....+++.++.++|+|+.+-. |..+.|+.+.+-...+++. T Consensus 76 ~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~-------~~~~~gi~~~~~~~~~~~~ 148 (311) T protein:vir:81 76 IPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLT-------GAALSGSPAKILDTTNIVE 148 (311) T ss_pred eeEEEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCCC-------Ccccccccccccccceeee Confidence 999888777777664332 2334677888888889999999999999986432 4567899998876667776 Q ss_pred ccCCC--CCHHHHhhhhhhhhhccCceEEEecChHHhhhHHHHhhCcceeeeccCCC----cceeeeehh--hhcCCCcc Q lcl|NC_018856. 197 LKGAR--LDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSTAG----GFSTGFSIN--QFLSTRGA 268 (479) Q Consensus 197 arG~~--l~~~~l~~aa~~i~~~fG~~td~~mp~~vka~f~~~~~~~qrv~~~~n~g----~~~~G~~I~--~~~s~~G~ 268 (479) .-+.- .....|.++-..+..+.+.++-..|++.+...+...-...-|.+.+.... .--.|++|- +.+- .+. T Consensus 149 ~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~tl~G~Pv~~~~~i~-~~~ 227 (311) T protein:vir:81 149 LTTGTSATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTVR-GGP 227 (311) T ss_pred ecccccchHHHHHHHHHHHhhhcCCCceEEEEcHHHHHHHHhhhccCCCeeecCccccCCCceecceeEEeccccc-ccc Confidence 65533 22344556655666777888889999999988865433333333332211 112333221 1110 111 Q ss_pred eeccc------------ceecCCCceecccCCcCCCCCCCceeEeeee-ccC-CCCCCCcccccceEEEEEEEcCcCccc Q lcl|NC_018856. 269 INLHG------------STIMENDNILLEGRNPEPNAPQAPASVVASI-VDD-KKGGFRDEDIKTHSYKVVVHSDDAESL 334 (479) Q Consensus 269 I~l~~------------s~~m~~~~~L~e~~~~~~~AP~~pa~v~at~-~t~-~~G~f~~~d~gty~YkVtavn~~GES~ 334 (479) +...+ ..++.+..-+.=+. ..........- -.+ ....|.. +...|++...-+-.=-- T Consensus 228 ~~~~~~~~~~~~~~~~~~~~~gDfs~~~i~~------~~~~~~~~~~~~~~~~~~~~~~~---~~v~~r~~~r~d~~v~~ 298 (311) T protein:vir:81 228 EAVTASTGVYRTTNPNVKAIAGDFSAFRWGV------QVSIPLELIEFGDPDGLGDLKRQ---NQIAIRAEVVYGIGIMS 298 (311) T ss_pred cccccccchhcccCCccEEEEEecccEEEEE------eccceEEEeccCCCCcchhhhhc---CcEEEEEEEEeccEeec Confidence 00000 00111100000000 00000000000 000 0011211 12344443322222112 Q ss_pred cccceeeeeecCCce Q lcl|NC_018856. 335 PSEAVTAAVAKKDNT 349 (479) Q Consensus 335 pS~~vt~Tv~~~g~s 349 (479) |...+-.+-.. .+ T Consensus 299 ~~a~~~l~~a~--~~ 311 (311) T protein:vir:81 299 TDAFAVVRDAD--ES 311 (311) T ss_pred ccceEEEEeec--cC Confidence 22222222111 11 No 35 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=97.31 E-value=2.5e-05 Score=45.83 Aligned_cols=293 Identities=11% Similarity=0.071 Sum_probs=132.7 Q ss_pred CCccchHHHHHHHHHhhhcCCCcChhhccCccccchhhhhhhhhhheeccccccchhhccccchhHHHHhhhhhhccCcc Q lcl|NC_018856. 15 LPVEAEAELAELVSKSFTTGYGITPDTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRT 94 (479) Q Consensus 15 ~~~~~~~~~~e~~~Ks~tag~~~~p~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~ 94 (479) +-...+-..|+. +..+++ -++++.+-.+.+..++..+. .+.-.+.+.....++.+.-.+|.++. +. T Consensus 1 ~~~~~~~~~e~~--~~~~~~-------~~~~~~~ip~~~~~~ii~~~--~~~~~l~~~~~~~~~~~~~~~ip~~~---~~ 66 (318) T protein:vir:24 1 MAAGTAFAVDHA--QIAQTG-------DTMFKGYLEPEQAKDYFAEA--EKTSIVQQFAQKVPMGTTGQKIPHWV---GD 66 (318) T ss_pred CCCCCCCCHHHH--Hhhccc-------CcccceeechhHHHHHHHHH--HhhchhhhhcceeeccCCceEEEEEe---CC Confidence 111111112211 111111 12344455555555553332 22234555555566655545555544 34 Q ss_pred cccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCC Q lcl|NC_018856. 95 GHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEAD 174 (479) Q Consensus 95 g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~ 174 (479) +...+++|++..+.+++.+.+.....+=++-.-.+|.-+ +.++..|.+....+.....+++.+|.++++|+.+-.+ T Consensus 67 ~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~e~-l~ds~~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~--- 142 (318) T protein:vir:24 67 VSAQWIGEGDMKPITKGNMTSQTIAPHKIATIFVASAET-VRANPANYLGTMRTKVATAFAMAFDGAAMHGTDSPFP--- 142 (318) T ss_pred cceEEecCCccccccccceeEEEEeeEEEEEeehhhHHH-hhcChHHHHHHHHHHHHHHHHHHHHHhhhcccCCCCC--- Confidence 557899999999999999999999999998877777732 2345678889999999999999999999999875222 Q ss_pred CcccchhhhHHHhhccCCCEEEccC-CCCCHHHHhhhhhhhhhccCceEEEecChHHhhhHHHHhhCcceeeeccCCCcc Q lcl|NC_018856. 175 GQAGIEFDGLHKLIDQDTNVIDLKG-ARLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSTAGGF 253 (479) Q Consensus 175 ~~~gleFDGl~~~I~~~~NviDarG-~~l~~~~l~~aa~~i~~~fG~~td~~mp~~vka~f~~~~~~~qrv~~~~n~g~~ 253 (479) .|+...+... +.-...+ .....+.+.++...+...+....-..|+....+.+...-...-|.+...+..+. T Consensus 143 -------~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~ 214 (318) T protein:vir:24 143 -------TYIGQTTKAI-SIADTTGATTVYDQVAVNGLSLLVNDGKKWTHTLLDDITEPILNGAKDQNGRPLFIESTYGE 214 (318) T ss_pred -------cccccccccc-cccccccccchHHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhccCCceeecCccccC Confidence 2333333211 1111111 122233344555566667777778999999999887654443344433332211 Q ss_pred ---------eeeeehh--hhcCCCcce-ecccc---eec-CCCceecccCCcCCCCCCCceeEeeeeccCCCCC----CC Q lcl|NC_018856. 254 ---------STGFSIN--QFLSTRGAI-NLHGS---TIM-ENDNILLEGRNPEPNAPQAPASVVASIVDDKKGG----FR 313 (479) Q Consensus 254 ---------~~G~~I~--~~~s~~G~I-~l~~s---~~m-~~~~~L~e~~~~~~~AP~~pa~v~at~~t~~~G~----f~ 313 (479) -.|+++- ... +.|.. -+-|+ .++ .+..+.++- ... . .-..-++.++. |. T Consensus 215 ~~~~~~~~~i~g~pv~~~~~~-~~~~~~~~~gdfs~~~~~~~~~l~i~~--~~~--~------~~~~~~~~~~~~~~~f~ 283 (318) T protein:vir:24 215 AASPFRSGRIVARPTILSDHV-VEGTTVGFMGDFSQLIWGQIGGLSFDV--TDQ--A------TLNLGTVESPNFVSLWQ 283 (318) T ss_pred ccccccCceEEEEeeEEeCCC-CCCccEEEEeecceEEEEEecCeEEEE--eec--c------ceeccccccccchhhhh Confidence 2233222 111 11211 11111 000 000000000 000 0 00000000000 11 Q ss_pred cccccceEEEEE-EEc---CcCccccccceeeeeecCCce Q lcl|NC_018856. 314 DEDIKTHSYKVV-VHS---DDAESLPSEAVTAAVAKKDNT 349 (479) Q Consensus 314 ~~d~gty~YkVt-avn---~~GES~pS~~vt~Tv~~~g~s 349 (479) . +...|++. .++ .+.++. ..++.-..+.+.. T Consensus 284 ~---~~~~~r~~~r~d~~v~~~~a~--~~i~~~~a~~~~~ 318 (318) T protein:vir:24 284 H---NLVAVRVEAEYAFHCNDAEAF--VALTNVVSGGGEG 318 (318) T ss_pred c---CcEEEEEEEEEccEEecccce--EEEEeeccCCCCC Confidence 0 00001000 000 000100 0000000000000 No 36 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=97.29 E-value=1.4e-05 Score=47.11 Aligned_cols=297 Identities=10% Similarity=0.034 Sum_probs=151.8 Q ss_pred HH-HhhhcCCCcChhhccCccccchhhhhhhhhhheeccccccchhhccccchhHHHHhhhhhhccCccccccccccccc Q lcl|NC_018856. 27 VS-KSFTTGYGITPDTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGV 105 (479) Q Consensus 27 ~~-Ks~tag~~~~p~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~ 105 (479) .. ..++++.- .+-.+|++|-.+.+.+++..... +.-.+++...+.++.+-..+|.++. +.....+++|++. T Consensus 1 ma~~~~~~~~~---~~t~~gg~lip~~~~~~ii~~~~--~~~~l~~~~~~~~~~~~~~~ip~~~---~~~~a~~v~E~~~ 72 (304) T protein:vir:94 1 MATPTYTPGNV---ILSDFKNGVIPAEQGTLIMKDIM--ANSAIMKLAKNEPMTAQKKKFTYLA---KGVGAYWVSETER 72 (304) T ss_pred Ccccccccccc---cccCCCceecchhHHHHHHHHHH--hccchhhhcceeeccCCceEEEEEe---CCcceEEeecCcc Confidence 00 11122221 12234556777667666643332 2233556566666665444454433 3445679999999 Q ss_pred ccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHH Q lcl|NC_018856. 106 ASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLH 185 (479) Q Consensus 106 ~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~ 185 (479) .+..++.+.......+=++.--.+|.-+ +.++..|.+....+.--..+++.+|.++|+|+.+-.+. |..-+|+. T Consensus 73 ~~~~~~~~~~i~~~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~-----~~~~~~~~ 146 (304) T protein:vir:94 73 IQTSKPEYAQAEMEAKKIGVIIPLSKEF-LKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNT-----STSGKPLV 146 (304) T ss_pred cccccceeeEEEEEEEEEEEeehhhHHH-HhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCccc-----cccccccc Confidence 9999999999999999999888888754 34566788888888888999999999999998764331 22234444 Q ss_pred HhhccCCCEEEccCCCCCHHHHhhhhhhhhhccCceEEEecChHHhhhHHHHhhCcceeeeccCCCcceeeeehh--hhc Q lcl|NC_018856. 186 KLIDQDTNVIDLKGARLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSTAGGFSTGFSIN--QFL 263 (479) Q Consensus 186 ~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~~td~~mp~~vka~f~~~~~~~qrv~~~~n~g~~~~G~~I~--~~~ 263 (479) .-... ......+.-..-+.|.++.-.+..+|....-.+|+..+.+.+...-...-|++...+.+ --.|++|- ... T Consensus 147 ~~~~~--~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~lkd~~G~~l~~~~~~-~l~G~PV~~~~~~ 223 (304) T protein:vir:94 147 EGAEE--KGNVVTDTNNLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRNALDANDRPLFDANGN-EIMGLPLSYTGAD 223 (304) T ss_pred ccccc--cccccccccchHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCcEeecCCCc-cccceeeEEeccc Confidence 44432 23333444455666676666667777777789999999999976555555666554433 23455542 111 Q ss_pred C---CCcceecccceecCCCceecccCCcCCCCCCCceeEeeeeccCCCCCCCcc-cccceEEEEEEEcCcCccccccce Q lcl|NC_018856. 264 S---TRGAINLHGSTIMENDNILLEGRNPEPNAPQAPASVVASIVDDKKGGFRDE-DIKTHSYKVVVHSDDAESLPSEAV 339 (479) Q Consensus 264 s---~~G~I~l~~s~~m~~~~~L~e~~~~~~~AP~~pa~v~at~~t~~~G~f~~~-d~gty~YkVtavn~~GES~pS~~v 339 (479) . ..+. -+-|+ +.+ .++....-..-..--..+... ....+..|+.-+. ..+.-.|++...-+-.=--|...+ T Consensus 224 ~~~~~~~~-~~~gd--~~~-~~~~~~~~~~i~~~~e~~~~~-~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~ 298 (304) T protein:vir:94 224 VYDKKKSL-ALMGD--WDY-ARYGILQGIEYAISEDATLTT-LQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFA 298 (304) T ss_pred ccCCCCcE-EEEEe--hhh-EEEEEecceEEEEeecceeee-ecccccCccchhhhhcCcEEEEEEEEeccEeecccceE Confidence 1 1111 11111 000 111110000000000000000 0111222221100 011223333322222211122223 Q ss_pred eeeeec Q lcl|NC_018856. 340 TAAVAK 345 (479) Q Consensus 340 t~Tv~~ 345 (479) -.+.+- T Consensus 299 ~l~~a~ 304 (304) T protein:vir:94 299 TLKPTE 304 (304) T ss_pred EEEecC Confidence 333322 No 37 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=97.29 E-value=1.4e-05 Score=47.11 Aligned_cols=297 Identities=10% Similarity=0.034 Sum_probs=151.8 Q ss_pred HH-HhhhcCCCcChhhccCccccchhhhhhhhhhheeccccccchhhccccchhHHHHhhhhhhccCccccccccccccc Q lcl|NC_018856. 27 VS-KSFTTGYGITPDTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGV 105 (479) Q Consensus 27 ~~-Ks~tag~~~~p~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~ 105 (479) .. ..++++.- .+-.+|++|-.+.+.+++..... +.-.+++...+.++.+-..+|.++. +.....+++|++. T Consensus 1 ma~~~~~~~~~---~~t~~gg~lip~~~~~~ii~~~~--~~~~l~~~~~~~~~~~~~~~ip~~~---~~~~a~~v~E~~~ 72 (304) T protein:vir:10 1 MATPTYTPGNV---ILSDFKNGVIPAEQGTLIMKDIM--ANSAIMKLAKNEPMTAQKKKFTYLA---KGVGAYWVSETER 72 (304) T ss_pred Ccccccccccc---cccCCCceecchhHHHHHHHHHH--hccchhhhcceeeccCCceEEEEEe---CCcceEEeecCcc Confidence 00 11122221 12234556777667666643332 2233556566666665444454433 3445679999999 Q ss_pred ccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHH Q lcl|NC_018856. 106 ASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLH 185 (479) Q Consensus 106 ~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~ 185 (479) .+..++.+.......+=++.--.+|.-+ +.++..|.+....+.--..+++.+|.++|+|+.+-.+. |..-+|+. T Consensus 73 ~~~~~~~~~~i~~~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~-----~~~~~~~~ 146 (304) T protein:vir:10 73 IQTSKPEYAQAEMEAKKIGVIIPLSKEF-LKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNT-----STSGKPLV 146 (304) T ss_pred cccccceeeEEEEEEEEEEEeehhhHHH-HhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCccc-----cccccccc Confidence 9999999999999999999888888754 34566788888888888999999999999998764331 22234444 Q ss_pred HhhccCCCEEEccCCCCCHHHHhhhhhhhhhccCceEEEecChHHhhhHHHHhhCcceeeeccCCCcceeeeehh--hhc Q lcl|NC_018856. 186 KLIDQDTNVIDLKGARLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSTAGGFSTGFSIN--QFL 263 (479) Q Consensus 186 ~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~~td~~mp~~vka~f~~~~~~~qrv~~~~n~g~~~~G~~I~--~~~ 263 (479) .-... ......+.-..-+.|.++.-.+..+|....-.+|+..+.+.+...-...-|++...+.+ --.|++|- ... T Consensus 147 ~~~~~--~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~lkd~~G~~l~~~~~~-~l~G~PV~~~~~~ 223 (304) T protein:vir:10 147 EGAEE--KGNVVTDTNNLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRNALDANDRPLFDANGN-EIMGLPLSYTGAD 223 (304) T ss_pred ccccc--cccccccccchHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCcEeecCCCc-cccceeeEEeccc Confidence 44432 23333444455666676666667777777789999999999976555555666554433 23455542 111 Q ss_pred C---CCcceecccceecCCCceecccCCcCCCCCCCceeEeeeeccCCCCCCCcc-cccceEEEEEEEcCcCccccccce Q lcl|NC_018856. 264 S---TRGAINLHGSTIMENDNILLEGRNPEPNAPQAPASVVASIVDDKKGGFRDE-DIKTHSYKVVVHSDDAESLPSEAV 339 (479) Q Consensus 264 s---~~G~I~l~~s~~m~~~~~L~e~~~~~~~AP~~pa~v~at~~t~~~G~f~~~-d~gty~YkVtavn~~GES~pS~~v 339 (479) . ..+. -+-|+ +.+ .++....-..-..--..+... ....+..|+.-+. ..+.-.|++...-+-.=--|...+ T Consensus 224 ~~~~~~~~-~~~gd--~~~-~~~~~~~~~~i~~~~e~~~~~-~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~ 298 (304) T protein:vir:10 224 VYDKKKSL-ALMGD--WDY-ARYGILQGIEYAISEDATLTT-LQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFA 298 (304) T ss_pred ccCCCCcE-EEEEe--hhh-EEEEEecceEEEEeecceeee-ecccccCccchhhhhcCcEEEEEEEEeccEeecccceE Confidence 1 1111 11111 000 111110000000000000000 0111222221100 011223333322222211122223 Q ss_pred eeeeec Q lcl|NC_018856. 340 TAAVAK 345 (479) Q Consensus 340 t~Tv~~ 345 (479) -.+.+- T Consensus 299 ~l~~a~ 304 (304) T protein:vir:10 299 TLKPTE 304 (304) T ss_pred EEEecC Confidence 333322 No 38 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=97.29 E-value=0.0001 Score=42.41 Aligned_cols=316 Identities=13% Similarity=0.063 Sum_probs=139.2 Q ss_pred CCccc----hhhhhhhhcCCccchHHHHHHHHHhh----hcCCCc-----ChhhccCccccchhhhhhhhhhheeccccc Q lcl|NC_018856. 1 MTELK----KEAEAKNKKLPVEAEAELAELVSKSF----TTGYGI-----TPDTQLDGAAVRRELLEDQVKMLAFSSNDF 67 (479) Q Consensus 1 ~~~~~----~~~~~~~~~~~~~~~~~~~e~~~Ks~----tag~~~-----~p~~~~~gaalr~esld~~~~~l~~~~~~f 67 (479) +.+.. ...+.+..+.....+.+..+...|+| ..+... .-.+.++|+.|.++.+.++|..+..... T Consensus 60 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~-- 137 (397) T protein:vir:49 60 YTEARANEVANMSEEEKKPLTKSEEEVKAGFVKDFKNLVRGRYQNLLDSKTDASGSDAGLTIPQDIQTAIHTLVSQYD-- 137 (397) T ss_pred HHHHHHHhhhccccccccccccchhHHHHHHHHHHHHHHhcchhHHHHHhhccccccCcccccHhHHHHHHHHHHhhh-- Confidence 00000 00001111111111222111122332 222111 1123356888888888888855444433 Q ss_pred cchhhccccchhHHHHhhhhhhccCcccccccccccccc-cccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHH Q lcl|NC_018856. 68 TIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVA-SINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTIL 146 (479) Q Consensus 68 ~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~-~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~ 146 (479) .+++.+...++.+...+|.......+.+...+++|++.. +.+++.+......++-++.-..+|.-+ +.++..|.+... T Consensus 138 ~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~el-l~ds~~~l~~~i 216 (397) T protein:vir:49 138 SLQEYVNVENVTTLTGSRVYEKWTDITGLANIDDEAGKIADVDDPKLSLIKYTIKRYAGISTVTNSL-LADSAENILAWL 216 (397) T ss_pred hHHhhhceeecccCccceEEEeeccCCcceeeecCccccccccccceeeEEeeeeeEEeeehhHHHH-HhhhHHHHHHHH Confidence 455555555565444444333333445567899999885 578999999999999999988888764 345566788888 Q ss_pred HHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccCCCCCHHHHhhhhhhhhhccCceEEEec Q lcl|NC_018856. 147 TEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGARLDEATLNKAAVIVGKGYGRATDAFM 226 (479) Q Consensus 147 ~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~~td~~m 226 (479) .+.-...++..++.+++.|+..-.+... .+-+| .|.++-.-+..+|...+..+| T Consensus 217 ~~~l~~~~~~~~d~ai~~G~g~~~~~~~---~~~~d-----------------------~i~~~~~~l~~~~~~~a~~vm 270 (397) T protein:vir:49 217 SGWIAKKVVVTRNKAILEAIAALPTKPT---LTKWD-----------------------DIIDLEAKVDPAIKQTSFFLT 270 (397) T ss_pred HHHHHHHHHHHHHHHHHhhccccccccc---cccHH-----------------------HHHHHHHhhhhhhcCCCEEEE Confidence 8899999999999999999877543211 12233 333333344456666677899 Q ss_pred ChHHhhhHHHHhhCcceeeeccCCCcceeeeehhhhcCCCcceecccceecCCCceecccCCcCCCCCCCceeEeeeecc Q lcl|NC_018856. 227 PIGVQADFTNNLLDRQRVIQPSTAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILLEGRNPEPNAPQAPASVVASIVD 306 (479) Q Consensus 227 p~~vka~f~~~~~~~qrv~~~~n~g~~~~G~~I~~~~s~~G~I~l~~s~~m~~~~~L~e~~~~~~~AP~~pa~v~at~~t 306 (479) +..+.+.+...-...-|.+.+.+.... .+.+++..+-...+... .| . T Consensus 271 n~~~~~~l~~lkd~~G~~l~~~~~~~~------------------~~~~l~G~PV~~~~~~~----~~-----------~ 317 (397) T protein:vir:49 271 NTSGFTALKKVKNALGDYLMERDVKSP------------------TGYSIDGFAVKEVADRW----LA-----------N 317 (397) T ss_pred cHHHHHHHHHhhcCCCceeeccCcCCC------------------CCceecceeeEEecccc----cc-----------c Confidence 999988776543332233332222110 01223332222111100 00 0 Q ss_pred CCCCCCCcccccceEEEEEEEcCcCccccccceeeeeecCCceEEEEEEec----CCCCcccceEEEEEecC---CCcce Q lcl|NC_018856. 307 DKKGGFRDEDIKTHSYKVVVHSDDAESLPSEAVTAAVAKKDNTVKLEVKLA----SLYQAQPQFISVYREGT---ETGHY 379 (479) Q Consensus 307 ~~~G~f~~~d~gty~YkVtavn~~GES~pS~~vt~Tv~~~g~sv~ltIT~~----~~~~a~~~~y~IYR~~~---~~G~y 379 (479) ++.+. ..--.|.++.-+..+++.|-+ +.+... +..+ ...++.+.|=+- ....| T Consensus 318 ~~~~~-~~i~~gd~~~~~~~~~~~~~~------------------i~~~~~~~~~~~~~-~~~~r~~~r~d~~~~~~~a~ 377 (397) T protein:vir:49 318 GTGGA-MPLYFGDLKQAVTLFDRQHMS------------------LLSTNIGGGAFETD-TTKVRVIDRFDVVATDTEAF 377 (397) T ss_pred ccCCc-eeEEEeeccceEEEEeecceE------------------EEEeccccchhhcC-ceeEEEEeeeCcEEecccce Confidence 00000 000011222112222222211 000000 0000 001111111000 00001 Q ss_pred EEEEeeeeeeecCCceEEEeeccccCCCcccc-ee Q lcl|NC_018856. 380 FLIARVPVSKVNDQGVIEVLDRNQVIPETTDV-FV 413 (479) Q Consensus 380 ~li~rv~vs~~n~~g~T~ftD~N~~iPgT~~~-fv 413 (479) .++ .++..... ||+.-. -| T Consensus 378 ~~~--------------~~~~~~~~-~~~~~~~~~ 397 (397) T protein:vir:49 378 VPA--------------SFKAIADQ-KGNLGSTAV 397 (397) T ss_pred EEE--------------EeecccCC-CCCcccccC Confidence 111 11111111 111100 00 No 39 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=97.23 E-value=6.8e-05 Score=43.44 Aligned_cols=296 Identities=11% Similarity=0.030 Sum_probs=139.3 Q ss_pred CCccchh-------hhhhhhcCCc-----cchHHHHHHHHHhh---------------hcCCCcChhhccCccccchhhh Q lcl|NC_018856. 1 MTELKKE-------AEAKNKKLPV-----EAEAELAELVSKSF---------------TTGYGITPDTQLDGAAVRRELL 53 (479) Q Consensus 1 ~~~~~~~-------~~~~~~~~~~-----~~~~~~~e~~~Ks~---------------tag~~~~p~~~~~gaalr~esl 53 (479) +.+.+.+ .........+ ...++. +.+.+.+ +++. + .+-.+++.|-.+.+ T Consensus 54 i~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~--~-~~~~~~g~lip~~~ 129 (390) T protein:vir:97 54 VQAARQRVAELEGNGAGGDVQHVSVGDMFVASEQF-QASTGRWNDRSARATMNIKAALNTAS--T-DAAGSAGALTTPNR 129 (390) T ss_pred HHHHHHHHHHHHhcccccccccccchhhhhhhHHH-HHHHHHhhhhhhhhhhHHHHHHHhhh--c-ccccccccccchhh Confidence 0000000 0000000000 000110 1111111 1111 1 12233444554445 Q ss_pred hhhhhhheeccccccchhhccccchhHHHHhhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhH Q lcl|NC_018856. 54 EDQVKMLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAA 133 (479) Q Consensus 54 d~~~~~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~ 133 (479) -+++..+.. ..-.+++.+...++.+....|.++.. +.+...|++|++..+.+++.+.+....++-++..-.+|.-+ T Consensus 130 ~~~ii~~~~--~~~~i~~~~~~~~~~~~~~~~~~~~~--~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~el 205 (390) T protein:vir:97 130 LPGFITPPD--ARLTVRDLIGSGRTDSALIEYVQETG--FVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQI 205 (390) T ss_pred hHHHHHHHh--hhhhhHhhcceeeccCCceEEEEEec--CCcceeeecCCccccccccceeEEEEeeeeEEEeehhhHHH Confidence 555544332 22346666666666655555555443 33456799999999999999999999999999988888864 Q ss_pred hhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccCCCCCHHHHhhhhhh Q lcl|NC_018856. 134 GLVNNIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGARLDEATLNKAAVI 213 (479) Q Consensus 134 ~lvn~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~ 213 (479) +.++ .+.+....+.-...+.+.++.++|+|+-. +-++.||.+.... .+..-...+....+.|..+-.. T Consensus 206 -l~ds-~~l~~~i~~~la~a~~~~~d~a~l~G~g~---------~~~p~Gi~~~~~~-~~~~~~~~~~~~~d~~~~~~~~ 273 (390) T protein:vir:97 206 -LSDA-PQLASYMNNRLIRGLKVKEDAEILRGTGA---------NDGLLGLIPQATT-YAAPTTIAGATRVDQLRLAMLQ 273 (390) T ss_pred -HHhH-HHHHHHHHHHHHHHHHHHHHHHHhhcCCC---------Cccccceeecccc-ccccccccccchHHHHHHHHHh Confidence 3333 57888899999999999999999999643 1235777765432 1222223344455666666666 Q ss_pred hhhccCceEEEecChHHhhhHHHHhhCcceeeeccCCCcceeeeehhhhcCCCcceecccceecCCCceecccCCcCCCC Q lcl|NC_018856. 214 VGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSTAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILLEGRNPEPNA 293 (479) Q Consensus 214 i~~~fG~~td~~mp~~vka~f~~~~~~~qrv~~~~n~g~~~~G~~I~~~~s~~G~I~l~~s~~m~~~~~L~e~~~~~~~A 293 (479) +...|...+-++|++.+...+...-...-|++.+...+ .. +.+++..+-.. . +.. T Consensus 274 ~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~-~~------------------~~~l~G~pV~~-~-----~~~ 328 (390) T protein:vir:97 274 ASLAEYPASGIVINPIDWAAIELAKDANNQYLIGNARG-TL------------------TPTLWGLPVVA-T-----QAM 328 (390) T ss_pred hccccCCCCEEEEcHHHHHHHHHhhcCCCceeecCccC-CC------------------CceecceeeEE-c-----CCC Confidence 67777778889999999888875444444454432111 10 00111111111 0 001 Q ss_pred CCCceeEee-------------eec-cCCCCCCCcccccceEEEEEEEcCcCccccccceeeeee Q lcl|NC_018856. 294 PQAPASVVA-------------SIV-DDKKGGFRDEDIKTHSYKVVVHSDDAESLPSEAVTAAVA 344 (479) Q Consensus 294 P~~pa~v~a-------------t~~-t~~~G~f~~~d~gty~YkVtavn~~GES~pS~~vt~Tv~ 344 (479) |......-. +.. ......|.. +...|++...-+-.---|...+.++.+ T Consensus 329 ~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~f~~---~~~~~r~~~r~d~~v~~~~a~v~~~~a 390 (390) T protein:vir:97 329 APGEFLVGAFDLAAQIFDQWDARVEIGYVNDDFQR---NMVTVLAEERLALVVYRPEALITGSFA 390 (390) T ss_pred CCCcEEEEeccceEEEEEecceEEEEeeccccccc---CcEEEEEEEeeccEEeccccEEEEEeC Confidence 110000000 000 000001110 011122111111111111111222222 No 40 >protein:vir:98487 Length: 681 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:780 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996575;genbank:gi:45569506;genbank:GeneID:2767815 Probab=97.23 E-value=2.7e-06 Score=51.13 Aligned_cols=291 Identities=18% Similarity=0.136 Sum_probs=150.0 Q ss_pred hhhhHhhhcch----hhHH--HHHHHHHHHHHHHHHHHHH--hhcccccCCCCCCcccchhhhHHHhhcc---------- Q lcl|NC_018856. 129 QSLAAGLVNNI----ADPM--TILTEDAIAVIAKSIEWAI--FYGDAALSSEADGQAGIEFDGLHKLIDQ---------- 190 (479) Q Consensus 129 vs~~~~lvn~~----~Dp~--~~~~~~ai~~~~~~iE~a~--f~Gd~~l~~~~~~~~gleFDGl~~~I~~---------- 190 (479) +++..-+++++ -+|+ .+--.+.-.+-++.+|.++ -+|-..--| |++|-|-++-=.. T Consensus 1 m~~~~~~~~~f~~Ge~~p~l~~r~D~~~y~~~~~~~~N~~~~~~G~~~~R~------g~~~~~~~~~~~~~~rlipf~~~ 74 (681) T protein:vir:98 1 MSNVRVLQRSFGGGEISPEMFGRIDDVKYQSGLAICRNFVVKPQGPAENRA------GFAFVREVKDSAKKVRLIPFTYS 74 (681) T ss_pred CcceeEeeeecCCceeeeeeccchhHHHHHHHHHHhcCcEEEecCCceecC------hhHhhhhcCCCCCcEEEEEEEeC Confidence 11111123333 2555 4444455556666666553 445444433 6788776553211 Q ss_pred CCCEEEccCCCCCHHHHhhhhhhhhhccCceEEEecChHHhhhHHHHhhCcceeeeccCCCcceeeeehhhhcCCCccee Q lcl|NC_018856. 191 DTNVIDLKGARLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSTAGGFSTGFSINQFLSTRGAIN 270 (479) Q Consensus 191 ~~NviDarG~~l~~~~l~~aa~~i~~~fG~~td~~mp~~vka~f~~~~~~~qrv~~~~n~g~~~~G~~I~~~~s~~G~I~ 270 (479) ..+.+.++ +-.....+-++-|...+--+|+.+...|....++..++.|.++. +.|-.-..+.-.+. T Consensus 75 ~~~~~~l~--------~g~~~~r~~~~~~~~~~~~~~~~~~tpy~~~~l~~l~~~q~aD~------~~i~h~~~~p~~L~ 140 (681) T protein:vir:98 75 VTQTMVIE--------LGAGYFRFHTNGGTLLDGAVPYEIANPYAEADLFNIHYVQSADV------LTLVHPNYAPRELR 140 (681) T ss_pred CCceEEEE--------EeCCeEEEEeCCcEEeeCcEeEEecCCCChhhhcCceEEEEcCE------EEEECCCCcceEEE Confidence 11211111 01112333445555556567888888898888999999998776 55555666777787 Q ss_pred cccceecCCCceecccCCcCCCCCCCceeEeeeeccCCCCCCCcccccceEEEEEEEcCcC--ccccccceeeeeecCCc Q lcl|NC_018856. 271 LHGSTIMENDNILLEGRNPEPNAPQAPASVVASIVDDKKGGFRDEDIKTHSYKVVVHSDDA--ESLPSEAVTAAVAKKDN 348 (479) Q Consensus 271 l~~s~~m~~~~~L~e~~~~~~~AP~~pa~v~at~~t~~~G~f~~~d~gty~YkVtavn~~G--ES~pS~~vt~Tv~~~g~ 348 (479) .++...|.-..+ ...+.|..|+...++.. .++ ..-+++|.|++++..+ +|.++...+++....+. T Consensus 141 r~~~~~W~l~~~------~f~~~p~~p~~~~at~~--~~~-----~~~t~~~~v~avda~t~~~s~~~~~~tvt~~~~~~ 207 (681) T protein:vir:98 141 RLGATNWQLATI------AFTSPVATPTSVTATSN--NKG-----TDYTYRYVVTALDAEGKTESAPSSAGTCTNNLFTN 207 (681) T ss_pred EccCCceEEEEE------Eeccccccceeeeeecc--CCc-----cceeEeEEEEEeecccceeecCCcceEEeeeeecC Confidence 788777664433 23334444554444322 222 1236899999999887 78888888887666665 Q ss_pred eEEEEEEecCCCCcccceEEEEEecCCCcceEEEEeeeeeeecCCceEEEeecccc------CCCcccceecCCc-hhhh Q lcl|NC_018856. 349 TVKLEVKLASLYQAQPQFISVYREGTETGHYFLIARVPVSKVNDQGVIEVLDRNQV------IPETTDVFVGELT-PNVV 421 (479) Q Consensus 349 sv~ltIT~~~~~~a~~~~y~IYR~~~~~G~y~li~rv~vs~~n~~g~T~ftD~N~~------iPgT~~~fvGe~~-p~vi 421 (479) ....+++|.++.++ .+++|||. .++.+.+++.. ..+.+.|.+.. .|...++|-.... |+.+ T Consensus 208 ~~~~t~~w~a~~g~--~~~~V~~~--~~gi~g~ig~~--------~~~~~~~~~~~~~~~~t~~~~~~~~~~~~gyP~~v 275 (681) T protein:vir:98 208 GGANTIAWSASSGA--SRYNVYKE--QGGLYGYIGQT--------TGTSLVDDNIAPDLSVTPPIYDAVFNAAGDYPAAV 275 (681) T ss_pred CcceeEEEEecCCc--eeeeeccc--ceeEEEEeecc--------ceeeeeecccccCccccccccccccccCCCceEEE Confidence 55667777777766 56899985 35667666543 23455555543 3333333322111 3444 Q ss_pred hhhh-hcchhhccccccCcchhhhhhhhhhhheecccccEEEEeccccCcccceeeecC Q lcl|NC_018856. 422 SLLE-LLPMMKLPLAQMNATTTFTVLWYGALALYAPKKWVRIKNVQYIPALAADVTVKY 479 (479) Q Consensus 422 ~l~e-llPm~k~Pla~~~~~~~~~V~~ygaL~l~aPkk~~~ikNV~~~~~~~~~~~~~~ 479 (479) .++| .|-+.-.+- .+.+-| |-++-. +.|--+.-++..|=.|.+ T Consensus 276 ~f~q~RL~f~~~~~---~p~~v~---------~Srsgd---y~nF~~~~~~~ddD~i~~ 319 (681) T protein:vir:98 276 SYFEQRRCFAGTTN---KPQNIW---------MTRSGT---ESAMSYSLPVRDDDRVAF 319 (681) T ss_pred EEEcceEEEeeCCC---CCcEEE---------EEcccC---cccccccCCCCCCccEEE Confidence 4322 222211000 011111 111111 122222223434433333 No 41 >protein:vir:107423 Length: 681 # NCBI annotation: Bbp13 # Family: family:all:780 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958682;genbank:gi:41179374;genbank:GeneID:2717217 Probab=97.23 E-value=2.7e-06 Score=51.13 Aligned_cols=291 Identities=18% Similarity=0.136 Sum_probs=150.0 Q ss_pred hhhhHhhhcch----hhHH--HHHHHHHHHHHHHHHHHHH--hhcccccCCCCCCcccchhhhHHHhhcc---------- Q lcl|NC_018856. 129 QSLAAGLVNNI----ADPM--TILTEDAIAVIAKSIEWAI--FYGDAALSSEADGQAGIEFDGLHKLIDQ---------- 190 (479) Q Consensus 129 vs~~~~lvn~~----~Dp~--~~~~~~ai~~~~~~iE~a~--f~Gd~~l~~~~~~~~gleFDGl~~~I~~---------- 190 (479) +++..-+++++ -+|+ .+--.+.-.+-++.+|.++ -+|-..--| |++|-|-++-=.. T Consensus 1 m~~~~~~~~~f~~Ge~~p~l~~r~D~~~y~~~~~~~~N~~~~~~G~~~~R~------g~~~~~~~~~~~~~~rlipf~~~ 74 (681) T protein:vir:10 1 MSNVRVLQRSFGGGEISPEMFGRIDDVKYQSGLAICRNFVVKPQGPAENRA------GFAFVREVKDSAKKVRLIPFTYS 74 (681) T ss_pred CcceeEeeeecCCceeeeeeccchhHHHHHHHHHHhcCcEEEecCCceecC------hhHhhhhcCCCCCcEEEEEEEeC Confidence 11111123333 2555 4444455556666666553 445444433 6788776553211 Q ss_pred CCCEEEccCCCCCHHHHhhhhhhhhhccCceEEEecChHHhhhHHHHhhCcceeeeccCCCcceeeeehhhhcCCCccee Q lcl|NC_018856. 191 DTNVIDLKGARLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSTAGGFSTGFSINQFLSTRGAIN 270 (479) Q Consensus 191 ~~NviDarG~~l~~~~l~~aa~~i~~~fG~~td~~mp~~vka~f~~~~~~~qrv~~~~n~g~~~~G~~I~~~~s~~G~I~ 270 (479) ..+.+.++ +-.....+-++-|...+--+|+.+...|....++..++.|.++. +.|-.-..+.-.+. T Consensus 75 ~~~~~~l~--------~g~~~~r~~~~~~~~~~~~~~~~~~tpy~~~~l~~l~~~q~aD~------~~i~h~~~~p~~L~ 140 (681) T protein:vir:10 75 VTQTMVIE--------LGAGYFRFHTNGGTLLDGAVPYEIANPYAEADLFNIHYVQSADV------LTLVHPNYAPRELR 140 (681) T ss_pred CCceEEEE--------EeCCeEEEEeCCcEEeeCcEeEEecCCCChhhhcCceEEEEcCE------EEEECCCCcceEEE Confidence 11211111 01112333445555556567888888898888999999998776 55555666777787 Q ss_pred cccceecCCCceecccCCcCCCCCCCceeEeeeeccCCCCCCCcccccceEEEEEEEcCcC--ccccccceeeeeecCCc Q lcl|NC_018856. 271 LHGSTIMENDNILLEGRNPEPNAPQAPASVVASIVDDKKGGFRDEDIKTHSYKVVVHSDDA--ESLPSEAVTAAVAKKDN 348 (479) Q Consensus 271 l~~s~~m~~~~~L~e~~~~~~~AP~~pa~v~at~~t~~~G~f~~~d~gty~YkVtavn~~G--ES~pS~~vt~Tv~~~g~ 348 (479) .++...|.-..+ ...+.|..|+...++.. .++ ..-+++|.|++++..+ +|.++...+++....+. T Consensus 141 r~~~~~W~l~~~------~f~~~p~~p~~~~at~~--~~~-----~~~t~~~~v~avda~t~~~s~~~~~~tvt~~~~~~ 207 (681) T protein:vir:10 141 RLGATNWQLATI------AFTSPVATPTSVTATSN--NKG-----TDYTYRYVVTALDAEGKTESAPSSAGTCTNNLFTN 207 (681) T ss_pred EccCCceEEEEE------Eeccccccceeeeeecc--CCc-----cceeEeEEEEEeecccceeecCCcceEEeeeeecC Confidence 788777664433 23334444554444322 222 1236899999999887 78888888887666665 Q ss_pred eEEEEEEecCCCCcccceEEEEEecCCCcceEEEEeeeeeeecCCceEEEeecccc------CCCcccceecCCc-hhhh Q lcl|NC_018856. 349 TVKLEVKLASLYQAQPQFISVYREGTETGHYFLIARVPVSKVNDQGVIEVLDRNQV------IPETTDVFVGELT-PNVV 421 (479) Q Consensus 349 sv~ltIT~~~~~~a~~~~y~IYR~~~~~G~y~li~rv~vs~~n~~g~T~ftD~N~~------iPgT~~~fvGe~~-p~vi 421 (479) ....+++|.++.++ .+++|||. .++.+.+++.. ..+.+.|.+.. .|...++|-.... |+.+ T Consensus 208 ~~~~t~~w~a~~g~--~~~~V~~~--~~gi~g~ig~~--------~~~~~~~~~~~~~~~~t~~~~~~~~~~~~gyP~~v 275 (681) T protein:vir:10 208 GGANTIAWSASSGA--SRYNVYKE--QGGLYGYIGQT--------TGTSLVDDNIAPDLSVTPPIYDAVFNAAGDYPAAV 275 (681) T ss_pred CcceeEEEEecCCc--eeeeeccc--ceeEEEEeecc--------ceeeeeecccccCccccccccccccccCCCceEEE Confidence 55667777777766 56899985 35667666543 23455555543 3333333322111 3444 Q ss_pred hhhh-hcchhhccccccCcchhhhhhhhhhhheecccccEEEEeccccCcccceeeecC Q lcl|NC_018856. 422 SLLE-LLPMMKLPLAQMNATTTFTVLWYGALALYAPKKWVRIKNVQYIPALAADVTVKY 479 (479) Q Consensus 422 ~l~e-llPm~k~Pla~~~~~~~~~V~~ygaL~l~aPkk~~~ikNV~~~~~~~~~~~~~~ 479 (479) .++| .|-+.-.+- .+.+-| |-++-. +.|--+.-++..|=.|.+ T Consensus 276 ~f~q~RL~f~~~~~---~p~~v~---------~Srsgd---y~nF~~~~~~~ddD~i~~ 319 (681) T protein:vir:10 276 SYFEQRRCFAGTTN---KPQNIW---------MTRSGT---ESAMSYSLPVRDDDRVAF 319 (681) T ss_pred EEEcceEEEeeCCC---CCcEEE---------EEcccC---cccccccCCCCCCccEEE Confidence 4322 222211000 011111 111111 122222223434433333 No 42 >protein:vir:107802 Length: 681 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:780 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996623;genbank:gi:45580757;genbank:GeneID:2767878 Probab=97.23 E-value=2.7e-06 Score=51.13 Aligned_cols=291 Identities=18% Similarity=0.136 Sum_probs=150.0 Q ss_pred hhhhHhhhcch----hhHH--HHHHHHHHHHHHHHHHHHH--hhcccccCCCCCCcccchhhhHHHhhcc---------- Q lcl|NC_018856. 129 QSLAAGLVNNI----ADPM--TILTEDAIAVIAKSIEWAI--FYGDAALSSEADGQAGIEFDGLHKLIDQ---------- 190 (479) Q Consensus 129 vs~~~~lvn~~----~Dp~--~~~~~~ai~~~~~~iE~a~--f~Gd~~l~~~~~~~~gleFDGl~~~I~~---------- 190 (479) +++..-+++++ -+|+ .+--.+.-.+-++.+|.++ -+|-..--| |++|-|-++-=.. T Consensus 1 m~~~~~~~~~f~~Ge~~p~l~~r~D~~~y~~~~~~~~N~~~~~~G~~~~R~------g~~~~~~~~~~~~~~rlipf~~~ 74 (681) T protein:vir:10 1 MSNVRVLQRSFGGGEISPEMFGRIDDVKYQSGLAICRNFVVKPQGPAENRA------GFAFVREVKDSAKKVRLIPFTYS 74 (681) T ss_pred CcceeEeeeecCCceeeeeeccchhHHHHHHHHHHhcCcEEEecCCceecC------hhHhhhhcCCCCCcEEEEEEEeC Confidence 11111123333 2555 4444455556666666553 445444433 6788776553211 Q ss_pred CCCEEEccCCCCCHHHHhhhhhhhhhccCceEEEecChHHhhhHHHHhhCcceeeeccCCCcceeeeehhhhcCCCccee Q lcl|NC_018856. 191 DTNVIDLKGARLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSTAGGFSTGFSINQFLSTRGAIN 270 (479) Q Consensus 191 ~~NviDarG~~l~~~~l~~aa~~i~~~fG~~td~~mp~~vka~f~~~~~~~qrv~~~~n~g~~~~G~~I~~~~s~~G~I~ 270 (479) ..+.+.++ +-.....+-++-|...+--+|+.+...|....++..++.|.++. +.|-.-..+.-.+. T Consensus 75 ~~~~~~l~--------~g~~~~r~~~~~~~~~~~~~~~~~~tpy~~~~l~~l~~~q~aD~------~~i~h~~~~p~~L~ 140 (681) T protein:vir:10 75 VTQTMVIE--------LGAGYFRFHTNGGTLLDGAVPYEIANPYAEADLFNIHYVQSADV------LTLVHPNYAPRELR 140 (681) T ss_pred CCceEEEE--------EeCCeEEEEeCCcEEeeCcEeEEecCCCChhhhcCceEEEEcCE------EEEECCCCcceEEE Confidence 11211111 01112333445555556567888888898888999999998776 55555666777787 Q ss_pred cccceecCCCceecccCCcCCCCCCCceeEeeeeccCCCCCCCcccccceEEEEEEEcCcC--ccccccceeeeeecCCc Q lcl|NC_018856. 271 LHGSTIMENDNILLEGRNPEPNAPQAPASVVASIVDDKKGGFRDEDIKTHSYKVVVHSDDA--ESLPSEAVTAAVAKKDN 348 (479) Q Consensus 271 l~~s~~m~~~~~L~e~~~~~~~AP~~pa~v~at~~t~~~G~f~~~d~gty~YkVtavn~~G--ES~pS~~vt~Tv~~~g~ 348 (479) .++...|.-..+ ...+.|..|+...++.. .++ ..-+++|.|++++..+ +|.++...+++....+. T Consensus 141 r~~~~~W~l~~~------~f~~~p~~p~~~~at~~--~~~-----~~~t~~~~v~avda~t~~~s~~~~~~tvt~~~~~~ 207 (681) T protein:vir:10 141 RLGATNWQLATI------AFTSPVATPTSVTATSN--NKG-----TDYTYRYVVTALDAEGKTESAPSSAGTCTNNLFTN 207 (681) T ss_pred EccCCceEEEEE------Eeccccccceeeeeecc--CCc-----cceeEeEEEEEeecccceeecCCcceEEeeeeecC Confidence 788777664433 23334444554444322 222 1236899999999887 78888888887666665 Q ss_pred eEEEEEEecCCCCcccceEEEEEecCCCcceEEEEeeeeeeecCCceEEEeecccc------CCCcccceecCCc-hhhh Q lcl|NC_018856. 349 TVKLEVKLASLYQAQPQFISVYREGTETGHYFLIARVPVSKVNDQGVIEVLDRNQV------IPETTDVFVGELT-PNVV 421 (479) Q Consensus 349 sv~ltIT~~~~~~a~~~~y~IYR~~~~~G~y~li~rv~vs~~n~~g~T~ftD~N~~------iPgT~~~fvGe~~-p~vi 421 (479) ....+++|.++.++ .+++|||. .++.+.+++.. ..+.+.|.+.. .|...++|-.... |+.+ T Consensus 208 ~~~~t~~w~a~~g~--~~~~V~~~--~~gi~g~ig~~--------~~~~~~~~~~~~~~~~t~~~~~~~~~~~~gyP~~v 275 (681) T protein:vir:10 208 GGANTIAWSASSGA--SRYNVYKE--QGGLYGYIGQT--------TGTSLVDDNIAPDLSVTPPIYDAVFNAAGDYPAAV 275 (681) T ss_pred CcceeEEEEecCCc--eeeeeccc--ceeEEEEeecc--------ceeeeeecccccCccccccccccccccCCCceEEE Confidence 55667777777766 56899985 35667666543 23455555543 3333333322111 3444 Q ss_pred hhhh-hcchhhccccccCcchhhhhhhhhhhheecccccEEEEeccccCcccceeeecC Q lcl|NC_018856. 422 SLLE-LLPMMKLPLAQMNATTTFTVLWYGALALYAPKKWVRIKNVQYIPALAADVTVKY 479 (479) Q Consensus 422 ~l~e-llPm~k~Pla~~~~~~~~~V~~ygaL~l~aPkk~~~ikNV~~~~~~~~~~~~~~ 479 (479) .++| .|-+.-.+- .+.+-| |-++-. +.|--+.-++..|=.|.+ T Consensus 276 ~f~q~RL~f~~~~~---~p~~v~---------~Srsgd---y~nF~~~~~~~ddD~i~~ 319 (681) T protein:vir:10 276 SYFEQRRCFAGTTN---KPQNIW---------MTRSGT---ESAMSYSLPVRDDDRVAF 319 (681) T ss_pred EEEcceEEEeeCCC---CCcEEE---------EEcccC---cccccccCCCCCCccEEE Confidence 4322 222211000 011111 111111 122222223434433333 No 43 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=97.13 E-value=2.5e-05 Score=45.83 Aligned_cols=280 Identities=11% Similarity=0.024 Sum_probs=139.3 Q ss_pred ChhhccCccccchhhhhhhhh-hheeccccccchhhccccchhHHHHhhhhhhccCcccccccccccccccccCcceEEE Q lcl|NC_018856. 38 TPDTQLDGAAVRRELLEDQVK-MLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQK 116 (479) Q Consensus 38 ~p~~~~~gaalr~esld~~~~-~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~ 116 (479) =.++.++++.|-.+.+..+|- .|... ..+.+-....+..+.--+|.++ .+.+...+++|++..+.+++.+.+. T Consensus 1 ma~~t~~~G~lip~~~~~~ii~~l~~~---s~i~~l~~~~~~~~~~~~~p~~---~~~~~a~wv~Eg~~~~~s~~~f~~v 74 (300) T protein:vir:95 1 MSEAQLSKGNLFNPELVTKVINKVKGH---SSIAKLSPQKPIPFNGQREFVF---DFDSDIDIVAENGKKTHGGVSLDPV 74 (300) T ss_pred CcccccCCcceechhhHHHHHHHHHhh---hhhhhhcceeeccCCceEEEEE---ecCcceEEeeCCcccccccccceee Confidence 111112233333333333332 22211 1122222223333222234443 2334678999999999999999999 Q ss_pred EEEEEeeeehhhhhhhHhhhc--chhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCE Q lcl|NC_018856. 117 TVQMKFLSDTKQQSLAAGLVN--NIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNV 194 (479) Q Consensus 117 ~~~~k~l~~~~~vs~~~~lvn--~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~Nv 194 (479) ....+=++---.+|.-+-... ...|.+....++-...+++.++.++|+|+.+-... +....|....-....++ T Consensus 75 ~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g~-----~~~~~~~~~~~~~~~~~ 149 (300) T protein:vir:95 75 TIVPLKVEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTKQ-----ASTIIGDNCFDKKVTQT 149 (300) T ss_pred EeeeEEEEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCCC-----Cccccccccccccccee Confidence 999988888888888765443 35677788888899999999999999997543321 23333332222222234 Q ss_pred EEccCCCCCHHHHhhhhhhhhhccCceEEEecChHHhhhHHHHhhCcceeeeccCC----Ccceeeeehh--hhcCC--- Q lcl|NC_018856. 195 IDLKGARLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSTA----GGFSTGFSIN--QFLST--- 265 (479) Q Consensus 195 iDarG~~l~~~~l~~aa~~i~~~fG~~td~~mp~~vka~f~~~~~~~qrv~~~~n~----g~~~~G~~I~--~~~s~--- 265 (479) ....|..+ -+.|.++.-.+...++.++-..|++.....+...-...-|.+.+... ..--.|++|- ..... T Consensus 150 ~~~~~~~~-~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~Pv~~s~~v~~~~~ 228 (300) T protein:vir:95 150 VPFKDTNP-DESMEDAVGMIDGSERDITGAILDPIFTTALSKMKNAEGGKLYPELAWGGVPDAINGLAVDKNRTVSYSQT 228 (300) T ss_pred ecccccch-HHHHHHHHHHhhhcCCCccEEEECHHHHHHHHHhhccCCCeeccCccccCCCceecceeeEEecCCCCCCC Confidence 44444433 45677776677778888888999999999886655443344443322 1223454442 22211 Q ss_pred -Ccceecccce-----ecCCCceecccCCcCCCCCCCceeEeeeeccCCCC--CCCcccccceEEEEEEEcCcCcccccc Q lcl|NC_018856. 266 -RGAINLHGST-----IMENDNILLEGRNPEPNAPQAPASVVASIVDDKKG--GFRDEDIKTHSYKVVVHSDDAESLPSE 337 (479) Q Consensus 266 -~G~I~l~~s~-----~m~~~~~L~e~~~~~~~AP~~pa~v~at~~t~~~G--~f~~~d~gty~YkVtavn~~GES~pS~ 337 (479) ...+.+-||- |..+..+-++ +..-...+.++ -|.. ....+++...-+-+=-.|.. T Consensus 229 ~~~~~~~~GDf~~~~~~~~~~~~~~~--------------v~~~~~~d~~~~~~f~~---~~v~~r~~~r~d~~v~~~~a 291 (300) T protein:vir:95 229 DPKNTAIVGDFETMFKWGYAKEVPME--------------IIKYGDPDNSGRDLKGY---NQIYIRCEAYIGWGIMDAAS 291 (300) T ss_pred CCccEEEEeeccceEEEEEecccEEE--------------EeeccCCCCcchhhhhc---CcEEEEEEEeecceeecccc Confidence 1112222320 1111111111 00000001111 1211 12334443322222222444 Q ss_pred ceeeeeecC Q lcl|NC_018856. 338 AVTAAVAKK 346 (479) Q Consensus 338 ~vt~Tv~~~ 346 (479) .+..+-.++ T Consensus 292 ~~~l~~~~g 300 (300) T protein:vir:95 292 FARIVKTGG 300 (300) T ss_pred eEEEecCCC Confidence 333333332 No 44 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=97.13 E-value=0.00013 Score=41.88 Aligned_cols=309 Identities=13% Similarity=0.085 Sum_probs=137.1 Q ss_pred CCc---cchhhhhhhhcCCcc----ch----HHHHHHHHHhhhcCC-------------CcChhhccCccccchhhhhhh Q lcl|NC_018856. 1 MTE---LKKEAEAKNKKLPVE----AE----AELAELVSKSFTTGY-------------GITPDTQLDGAAVRRELLEDQ 56 (479) Q Consensus 1 ~~~---~~~~~~~~~~~~~~~----~~----~~~~e~~~Ks~tag~-------------~~~p~~~~~gaalr~esld~~ 56 (479) +.+ ..++.+.+....+.. .+ .+-...+......+. ..+..+..+|+-+-.+.+.+- T Consensus 54 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~i 133 (390) T protein:vir:10 54 VQAARQRVAELEGNGAGGDVQHVSVGDLFVASEQFQASAGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLPGF 133 (390) T ss_pred HHHHHHHHHHHHhhcccccccccchhhhhhhhHHHHHHHHhhhhhhhhhhhHHHHHHHhhhcccccccccccchhHHHHH Confidence 000 000000000000000 00 000000000000000 011112233444555555443 Q ss_pred hhhheeccccccchhhccccchhHHHHhhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhh Q lcl|NC_018856. 57 VKMLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLV 136 (479) Q Consensus 57 ~~~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv 136 (479) +..+ .....+++.+...++.+.-.+|.+. .+..+...+++|++..+..|+++......++-++-...+|.-+ +. T Consensus 134 i~~~---~~~~~l~~~~~~~~~~~~~~~~~~~--~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~el-l~ 207 (390) T protein:vir:10 134 ITQP---DARLTVRDLIGSGRTDSALIEYVQE--TGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQI-LS 207 (390) T ss_pred HHHH---HhhchhhhhcceeeccCCceEEEEE--ecCCcceeeecCCccccccccceeEEEEeeEEEEEeehhhHHH-HH Confidence 3333 2233556666655555544444443 3334456789999999999999999999999999988888864 33 Q ss_pred cchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccCCCCCHHHHhhhhhhhhh Q lcl|NC_018856. 137 NNIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGARLDEATLNKAAVIVGK 216 (479) Q Consensus 137 n~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~ 216 (479) ++ .+......+.-...++..++.+++.|+-. |-++.|+.+............|.. ..+.|..+-..+.. T Consensus 208 d~-~~l~~~i~~~l~~~~~~~~~~~il~G~G~---------~~~p~Gi~~~~~~~~~~~~~~~~~-~~~~~~~~~~~l~~ 276 (390) T protein:vir:10 208 DA-PQLASYMNNRLIRGLKVKEDAEILRGTGA---------NDGLLGLIPQATTYAAPTTIAGAT-RVDQLRLAMLQASL 276 (390) T ss_pred hH-HHHHHHHHHHHHHHHHHHHHHHHhhcCCC---------Cccccccccccccccccccccccc-hHHHHHHHHHhhcc Confidence 44 47778888888889999999999999743 224678877654322233333433 34555555555667 Q ss_pred ccCceEEEecChHHhhhHHHHhhCcceeeeccCCCc--c-eeeeehhhhcC-CCcceecccceecCCCceecccCCcCCC Q lcl|NC_018856. 217 GYGRATDAFMPIGVQADFTNNLLDRQRVIQPSTAGG--F-STGFSINQFLS-TRGAINLHGSTIMENDNILLEGRNPEPN 292 (479) Q Consensus 217 ~fG~~td~~mp~~vka~f~~~~~~~qrv~~~~n~g~--~-~~G~~I~~~~s-~~G~I~l~~s~~m~~~~~L~e~~~~~~~ 292 (479) +|...+-.+|++...+.+...-...-|.+.+...+. . -.|++|-.... |.|.+-+ |+ +++.-.+. + T Consensus 277 ~~~~~~~~v~n~~~~~~L~~lkd~~g~~l~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~-gd--f~~~~~~~-------~ 346 (390) T protein:vir:10 277 AEYPASGIVINPIDWAAIELAKDANNQYLIGNARGTLTPTLWGLPVVATQAMAPGEFLV-GA--FDLAAQIF-------D 346 (390) T ss_pred ccCCCCEEEEcHHHHHHHHHhhcCCCceeecCCcCcCCceecceeeEEcCCCCCCcEEE-Ee--ccceEEEE-------E Confidence 788888899999998888765444334443321110 0 11222111000 0111000 00 00000000 0 Q ss_pred CCCCceeEeeeeccCCCCCCCcccccceEEEEEEEcCcCccccccceeeeee Q lcl|NC_018856. 293 APQAPASVVASIVDDKKGGFRDEDIKTHSYKVVVHSDDAESLPSEAVTAAVA 344 (479) Q Consensus 293 AP~~pa~v~at~~t~~~G~f~~~d~gty~YkVtavn~~GES~pS~~vt~Tv~ 344 (479) .-. ..... ......|.. +...|++...-+-.=--|...+..|++ T Consensus 347 ~~~-~~i~~----~~~~~~~~~---~~~~~r~~~r~d~~v~~~~a~~~~~~a 390 (390) T protein:vir:10 347 QWD-ARVEI----GYVNDDFQR---NMVTVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred ecc-eEEEE----eeccccccc---CcEEEEEEEeeccEEeccccEEEEEeC Confidence 000 00000 000000100 011121111111111112222222222 No 45 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=97.03 E-value=5.1e-05 Score=44.11 Aligned_cols=306 Identities=14% Similarity=0.121 Sum_probs=129.8 Q ss_pred CCccchhhhhhhhcCCccchHHHHHHHHHhhhcCCC-----cChhhccCccccchhhhhhhhhhheeccccccchhhccc Q lcl|NC_018856. 1 MTELKKEAEAKNKKLPVEAEAELAELVSKSFTTGYG-----ITPDTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINK 75 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~Ks~tag~~-----~~p~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k 75 (479) ..+.+...+.+ ......+....+.+.+..+.. ..-.+.++|+.|..+.+.++|..+... ...+++.+.. T Consensus 72 ~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~~--~~~l~~~~~~ 145 (397) T protein:vir:48 72 SEEEKKPLTKS----EEEVKAGFVKDFKNLVRGRYQNLLDSKTDASGSDAGLTIPQDIQTAIHTLVRQ--YDSLQEYVNV 145 (397) T ss_pred hhhccccccch----hhHHHHHHHHHHHHHHhhhhhHHHHHhhccCCccccccccHHHHHHHHHHHHH--HHHHHhhhce Confidence 00000000000 000111111112111111111 111223457888888888887444333 3345666666 Q ss_pred cchhHHHHhhhhhhccCccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHH Q lcl|NC_018856. 76 QQVNSTVAKYAVFNQHGRTGHSRFVREVGVAS-INDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAIAVI 154 (479) Q Consensus 76 ~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~ 154 (479) .++.+...++.....-+..+...+++|++..+ ..++.+.+.+..++-++.-..+|.-+ +.++.-|.+....+.--..+ T Consensus 146 ~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~el-l~ds~~~l~~~v~~~l~~~~ 224 (397) T protein:vir:48 146 ENVTTLTGSRVYEKWADITGLAKLDDEAGSIGTNDDPKLYPIRYAIKRYAGISTVTNSL-LADSAENILAWLSGWIAKKV 224 (397) T ss_pred eeccCCcceEEEEeecCCCcceeeeccccccccccccceeeEEeeheeeeeehhhHHHH-HhhchHHHHHHHHHHHHHHH Confidence 66665444444333333444567899998765 55799999999999999888888764 34555678888888888999 Q ss_pred HHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccCCCCCHHHHhhhhhhhhhccCceEEEecChHHhhhH Q lcl|NC_018856. 155 AKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGARLDEATLNKAAVIVGKGYGRATDAFMPIGVQADF 234 (479) Q Consensus 155 ~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~~td~~mp~~vka~f 234 (479) +..++.+++.|+..-.+.. ..+-+|+|.+++ ..+...|......+|+..+.+.+ T Consensus 225 ~~~~d~~il~G~g~~~~~~---~~~~~d~i~~~~-----------------------~~l~~~~~~~a~~v~n~~~~~~L 278 (397) T protein:vir:48 225 VVTRNKAILEAIATLPTKP---TLTKWDDIIDLQ-----------------------AKVDPAIKQTSFFLTNTSGFTAL 278 (397) T ss_pred HHHHHHHHhhccccccccc---ccccHHHHHHHH-----------------------HHhhhhhcCCCEEEECHHHHHHH Confidence 9999999999998754421 123344444433 22334455555667777777766 Q ss_pred HHHhhCcceeeeccCCCc----ceeeeehh----hhcCC--Ccce-ecccceecCCCceecccCCcCCCCCCCceeEeee Q lcl|NC_018856. 235 TNNLLDRQRVIQPSTAGG----FSTGFSIN----QFLST--RGAI-NLHGSTIMENDNILLEGRNPEPNAPQAPASVVAS 303 (479) Q Consensus 235 ~~~~~~~qrv~~~~n~g~----~~~G~~I~----~~~s~--~G~I-~l~~s~~m~~~~~L~e~~~~~~~AP~~pa~v~at 303 (479) ...=...-|.+.+.+.+. .-.|++|- .+... .+.. -+.|+ +.+...+.. ..+ ....... T Consensus 279 ~~lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~~~~gd--~~~~~~~~~-------~~~-~~i~~~~ 348 (397) T protein:vir:48 279 KKVKNAFGDYLMERDVKSPTGYSIDGFAVKEVADRWLANASSGAMPLYFGD--LKQAVTLFD-------RQQ-MSLLSTN 348 (397) T ss_pred HHhhcCCCceeeccCcCCCCCceeccceeEEecccccCCcCCCceEEEEEe--ccceEEEEe-------ecc-eEEEEec Confidence 553333334443323222 22444331 11110 0110 01111 000000000 000 0000000 Q ss_pred eccCCCCCCCcc---cccceEEEEEEEcCcCccccccceeeeeecCCceEEE Q lcl|NC_018856. 304 IVDDKKGGFRDE---DIKTHSYKVVVHSDDAESLPSEAVTAAVAKKDNTVKL 352 (479) Q Consensus 304 ~~t~~~G~f~~~---d~gty~YkVtavn~~GES~pS~~vt~Tv~~~g~sv~l 352 (479) .. ...|... ......+-+...+..+-..-.-..+++.+....++-+ T Consensus 349 ~~---~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~~~~~~ 397 (397) T protein:vir:48 349 IG---GGAFETDTTKIRVIDRFDVVATDTESFVPASFKAIADQKGNLGSTAV 397 (397) T ss_pred cc---hhhhhcCceeEEEEeeeccEEecccceEEEEecccccCCCCccccCC Confidence 00 0000000 0000001111111111000000000000000000000 No 46 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=97.02 E-value=9.2e-05 Score=42.72 Aligned_cols=296 Identities=13% Similarity=0.049 Sum_probs=149.5 Q ss_pred ccchHHHHHHHHHhhhcCCCcChhhccCccccchhhhhhhhhhheeccccccchhhccccchhHHHHhhhhhhc-----c Q lcl|NC_018856. 17 VEAEAELAELVSKSFTTGYGITPDTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQ-----H 91 (479) Q Consensus 17 ~~~~~~~~e~~~Ks~tag~~~~p~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~-----~ 91 (479) .+...++ ++.++|....-...+.+++|-.+.+..+|..+. .+...+.+...+.++.+.-.+|.++.. | T Consensus 1 ~~~~~e~-----~~~~~~~~~~~~~~~~~~~liP~~~~~~ii~~~--~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~ 73 (338) T protein:vir:78 1 MATLNEL-----APNTAGSNHQGRLAHVPSDLLPKEIVGPIFDKA--QESSLVLRLGENIPISYGETIIPTTVKRPEVGQ 73 (338) T ss_pred CcchHHh-----hhhhcccccccceecccccccchHHHHHHHHHH--HhhchhhhhcceeeccCCceEEEEEecCcccee Confidence 2222332 455555444334445567788888877775444 333446666666666665555555442 2 Q ss_pred CcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCC Q lcl|NC_018856. 92 GRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAIAVIAKSIEWAIFYGDAALSS 171 (479) Q Consensus 92 G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~ 171 (479) .+.+...+++|++..+..++++.......+=++--..+|.-+ +.++..|.+....+.-...+.+.+|.++++|+.+-.+ T Consensus 74 v~~~~~~~~~Eg~~~~~~~~~f~~v~l~~~k~~~~~~is~el-l~ds~~~~~~~i~~~la~a~~~~~d~~~l~G~g~~~~ 152 (338) T protein:vir:78 74 VGVGTSNEQREGGTKPLSGTAWDTRSVAPIKLATIVTVSEEF-ARMNPSGLYTKLQADLAYAIGRGIDLAVFHGKSPLTG 152 (338) T ss_pred ecccccccccccccccccccceeEEEEEEEEEEEeehhhHHH-HhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcc Confidence 334457789999999999999999999998888777777632 2345678888888999999999999999999987432 Q ss_pred CCCCcccchhhhHHHhhccC-CCEEEccC--CCCCHHHHhhhh-hhhhhccCceEEEecChHHhhhHHHH--hh-Cccee Q lcl|NC_018856. 172 EADGQAGIEFDGLHKLIDQD-TNVIDLKG--ARLDEATLNKAA-VIVGKGYGRATDAFMPIGVQADFTNN--LL-DRQRV 244 (479) Q Consensus 172 ~~~~~~gleFDGl~~~I~~~-~NviDarG--~~l~~~~l~~aa-~~i~~~fG~~td~~mp~~vka~f~~~--~~-~~qrv 244 (479) .++.|+.+..... ....|.-+ .....+.|..+. .++......++-.+|+....+.|... +. ..-|+ T Consensus 153 -------~~~~gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~~~~l~d~~g~~ 225 (338) T protein:vir:78 153 -------SALQGIDTNNVIVNTTNVDYLQTGTTPLLDRFLDGYDLVSANTDVDFNGWAADPRYRARLLRSQAYRDANGNV 225 (338) T ss_pred -------ccccccccccccccccccccccccchhhHHHHHHHHHHhhhhccccceEEEEchHHHHHHHHHhhhccCCCce Confidence 4567776654322 12223222 223344555443 44455677888899999998888553 22 33455 Q ss_pred eeccCCCc----ceeeeehh--hhcCC-----Cc--ceecccc----eecCCCceecc----cCCcCCCCCCCceeEeee Q lcl|NC_018856. 245 IQPSTAGG----FSTGFSIN--QFLST-----RG--AINLHGS----TIMENDNILLE----GRNPEPNAPQAPASVVAS 303 (479) Q Consensus 245 ~~~~n~g~----~~~G~~I~--~~~s~-----~G--~I~l~~s----~~m~~~~~L~e----~~~~~~~AP~~pa~v~at 303 (479) +.+..... --.|++|- +++.. .+ .+.+-|+ .|.++..+-++ +.......|... T Consensus 226 l~~~~~~~~~~~~l~G~PV~~~~~ip~~~~~~~~~~~~~~~gdfs~~~~~~~~~~~i~~~~~~~~~~~~~~~~~------ 299 (338) T protein:vir:78 226 DPTRINLAASAGDLLGLPVQFGKAVGGDLGAATDSKVRVVGGDFSQLKYGFADEIRVKMSDTATLTDNTSPTPQ------ 299 (338) T ss_pred eecccccCCCCceeeeeeEEEccccCccccccCCcccEEEEEecceEEEEeecccEEEEeeccccccccccccc------ Confidence 54432221 22454442 11110 00 1111111 11111111000 000000000000 Q ss_pred eccCCCCCCCccccc---ceEEEEEEEcCcCcc------cccc Q lcl|NC_018856. 304 IVDDKKGGFRDEDIK---THSYKVVVHSDDAES------LPSE 337 (479) Q Consensus 304 ~~t~~~G~f~~~d~g---ty~YkVtavn~~GES------~pS~ 337 (479) ....|..--.+ ...+=...++..+-. +|+. T Consensus 300 ----~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~ 338 (338) T protein:vir:78 300 ----TVSMWQTNQIAILIEVTFGWLLGDKQAFVKFVDDEDPDA 338 (338) T ss_pred ----chhhhhcCcEEEEEEEEeccEeecccceEEEecccCCCC Confidence 00001000000 000001111111100 1111 No 47 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=96.92 E-value=0.00026 Score=40.27 Aligned_cols=293 Identities=11% Similarity=-0.002 Sum_probs=140.3 Q ss_pred hccCccccchhhhhhhhhhheeccccccchhhccccchhHHHHhhhhhhccCcccccccccccccccccCcceEEEEEEE Q lcl|NC_018856. 41 TQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQM 120 (479) Q Consensus 41 ~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~ 120 (479) =.++|+.|-.+.+..+|..+... ...+.+...+.++.+.-.+|.++. +.+...+++|++..+.+|+.+.+..... T Consensus 1 ma~~gG~lvp~~~~~~ii~~~~~--~s~i~~l~~~~~~~~~~~~ip~~~---~~~~a~~v~E~~~~~~~~~~f~~v~l~~ 75 (298) T protein:vir:16 1 MVLNKGTLFDPTLVTDLISKVAG--KSSIARLSAQKPIPFNGEKVFTFT---MDSEIDVVAESGKKTHGGVTLAPQTMVP 75 (298) T ss_pred CcccCcceechhHHHHHHHHHHh--hhhhhhhcceeeccCCceEEEEEe---cCcceEEecCCccccccccceeEEEEee Confidence 11234445444445555433322 234555555555665444455533 3345779999999999999999999999 Q ss_pred EeeeehhhhhhhHhhhc--chhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEE-- Q lcl|NC_018856. 121 KFLSDTKQQSLAAGLVN--NIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVID-- 196 (479) Q Consensus 121 k~l~~~~~vs~~~~lvn--~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviD-- 196 (479) +=++.--.+|.-+-..+ ...|.+....+.-...+++.+|.++|+|...-.. ....+-|+........+... T Consensus 76 ~k~a~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g-----~~~~~~~~~~~~~~~~~~~~~~ 150 (298) T protein:vir:16 76 IKVEYGARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRLG-----TASAVIGTNHFDSKVTQKVEAP 150 (298) T ss_pred eeEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCC-----cccccccccccccccccccccc Confidence 99998888888764433 3456777788888899999999999999644221 11223443333332222222 Q ss_pred ccCCCCCHHHHhhhhhhhhhccCceEEEecChHHhhhHHHHhhCcceeeeccCCCcceeeeehhhhcCCCcceeccccee Q lcl|NC_018856. 197 LKGARLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSTAGGFSTGFSINQFLSTRGAINLHGSTI 276 (479) Q Consensus 197 arG~~l~~~~l~~aa~~i~~~fG~~td~~mp~~vka~f~~~~~~~qrv~~~~n~g~~~~G~~I~~~~s~~G~I~l~~s~~ 276 (479) ..+..+ .+.|.++...+..++..+.-..|++...+.+...-...-|.+.+...... .+.++ T Consensus 151 ~~~~~~-~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~------------------~~~~l 211 (298) T protein:vir:16 151 RGIADP-NGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQDNALFPELKWGA------------------TPDTI 211 (298) T ss_pred cccccH-HHHHHHHHHHhhhcCCCccEEEEcHHHHHHHHHhhccCCCeeecCcccCC------------------CCcee Confidence 222222 23455555566677888888999999999886644333344433222111 12233 Q ss_pred cCCCceecccCCcCCCCCCCceeEeeeeccCCCCCCCcccccceEEEEEEEcCcCccccccceeeeeecCCceEEEEEEe Q lcl|NC_018856. 277 MENDNILLEGRNPEPNAPQAPASVVASIVDDKKGGFRDEDIKTHSYKVVVHSDDAESLPSEAVTAAVAKKDNTVKLEVKL 356 (479) Q Consensus 277 m~~~~~L~e~~~~~~~AP~~pa~v~at~~t~~~G~f~~~d~gty~YkVtavn~~GES~pS~~vt~Tv~~~g~sv~ltIT~ 356 (479) +..+-.. +....+...+.-...-.|.++.-+...-+.+ +++++.. T Consensus 212 ~G~PV~~-----------------~~~v~~~~~~~~~~~~~GDfs~~~~~~~~~~------------------~~~~~~~ 256 (298) T protein:vir:16 212 NGLPVDV-----------------NKTVSDMSLTQRDRAIIGDFANGFKWGYAKE------------------VPLEVIQ 256 (298) T ss_pred cceeeEE-----------------ecccccccCCCccEEEEeeccceEEEEEecC------------------ceEEEee Confidence 3332221 1111110000000001111211111100111 1111111 Q ss_pred cC-CCCcccceEEEEEecCCCcceEEEEeeeeeeecCCceEEEeecc Q lcl|NC_018856. 357 AS-LYQAQPQFISVYREGTETGHYFLIARVPVSKVNDQGVIEVLDRN 402 (479) Q Consensus 357 ~~-~~~a~~~~y~IYR~~~~~G~y~li~rv~vs~~n~~g~T~ftD~N 402 (479) .. ..+.+ ++-|++. -=.|....|+...-.+..+....++.+ T Consensus 257 ~~~~~~~~---~~~f~~~--~v~~ra~~r~d~~v~~~~a~~~l~~at 298 (298) T protein:vir:16 257 YGDPDNSG---LDLKGYN--QVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred ccCCcCcc---hhhhhcC--cEEEEEEEEEccEeecccceEEEeecC Confidence 00 00100 1112111 011222233333323333444444444 No 48 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=96.90 E-value=3.4e-05 Score=45.06 Aligned_cols=289 Identities=12% Similarity=0.075 Sum_probs=127.6 Q ss_pred hhcCCCcChhhccCccccchhhhhhhhhhheeccccccchhhccccchhHHHHhhhhhhccCcccccccccccccccccC Q lcl|NC_018856. 31 FTTGYGITPDTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASIND 110 (479) Q Consensus 31 ~tag~~~~p~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~~~~d 110 (479) |..+. -+.|+.+-.+.+..+|..... +...+.+.....+..+--.+|.+ ..+.+...+++|++..+.++ T Consensus 1 Ma~~~------~~~gg~~vP~~~~~~ii~~l~--~~s~i~~l~~~i~~~~~~~~ip~---~~~~~~a~wv~Eg~~~~~s~ 69 (315) T protein:vir:80 1 MADDF------LSAGKLELPGSMIGAVRDRAI--DSGVLAKLSPEQPTIFGPVKGAV---FSGVPRAKIVGEGEVKPSAS 69 (315) T ss_pred CCCCc------CCcCceEcchHHHHHHHHHHH--hhchhhhhcceeecCCCceEEEE---EeCCcceEEeeCCccccccc Confidence 33221 122444444445444432211 11223332222333322223333 33444667999999999999 Q ss_pred cceEEEEEEEEeeeehhhhhhhHhhhcc---hhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHh Q lcl|NC_018856. 111 PNIRQKTVQMKFLSDTKQQSLAAGLVNN---IADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKL 187 (479) Q Consensus 111 ~~~~r~~~~~k~l~~~~~vs~~~~lvn~---~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~ 187 (479) +.+.+.....|=|+.--.+|..+-..+. +...+....++-...+++.+|.++|+|+..-. |....|+... T Consensus 70 ~~f~~v~l~~~kl~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~-------~~~~~~~~~~ 142 (315) T protein:vir:80 70 VDVSAFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPAT-------GKAASAVHTS 142 (315) T ss_pred cceeeeEeeeeeEEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCC-------Cccccccccc Confidence 9999999999988888788876543332 23356777888888999999999999985432 2335677777 Q ss_pred hccCCCEEEccCCCCCHHHHhhhhhhhhhccCceEEEecChHHhhhHHHHhhC------cceeeeccCCC--cceeeeeh Q lcl|NC_018856. 188 IDQDTNVIDLKGARLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLD------RQRVIQPSTAG--GFSTGFSI 259 (479) Q Consensus 188 I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~~td~~mp~~vka~f~~~~~~------~qrv~~~~n~g--~~~~G~~I 259 (479) +....+.+++-+... .+.++.-+.+....+...+-..|++.+...+...-.. ++........+ ..-.|++| T Consensus 143 ~~~~~~~~~~~~~~~-~d~~~~~~~~~~~~~~~~~~~imn~~~~~~L~~l~~~~g~~~~g~~~~~~~~~g~~~tl~G~PV 221 (315) T protein:vir:80 143 LNKTKNIVDATDSAT-ADLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNV 221 (315) T ss_pred cccccceeeccccch-HHHHHHHHHHhhccCccceEEEEcHHHHHHHHHHhhccCCcccccccccccccCCCceecceee Confidence 776677888877543 3444443344444455555688999998888544322 12211111111 12345444 Q ss_pred h--hhcCCCc-------ceecccc----eecCCCceecccCCcCCCCCCCceeEeeeeccCCCCCCCcccccceEEEEEE Q lcl|NC_018856. 260 N--QFLSTRG-------AINLHGS----TIMENDNILLEGRNPEPNAPQAPASVVASIVDDKKGGFRDEDIKTHSYKVVV 326 (479) Q Consensus 260 ~--~~~s~~G-------~I~l~~s----~~m~~~~~L~e~~~~~~~AP~~pa~v~at~~t~~~G~f~~~d~gty~YkVta 326 (479) - +.+.... .+.+.|| .+..+..+.++- ....+.-..+ +.. -..+. =.| -....+=..+ T Consensus 222 ~~~~~~~~~~~~~~~~~~~~~~GDfs~~~~g~~~~~~i~i-~~~~~~~~~~--~~~-~~~~~-v~~----r~~~r~~~~v 292 (315) T protein:vir:80 222 GASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRNFPIEL-IEYGDPDQTG--RDL-KGHNE-VMV----RAEAVLYVAI 292 (315) T ss_pred EecCcCCcccccccccccEEEEeecccEEEEEecCeeEEE-eccccccCcc--cch-hhcCc-EEE----EEEEEeccee Confidence 3 1111000 0111111 010111110000 0000000000 000 00000 000 0000000000 Q ss_pred EcCcCccc--cccceeeeeecCC Q lcl|NC_018856. 327 HSDDAESL--PSEAVTAAVAKKD 347 (479) Q Consensus 327 vn~~GES~--pS~~vt~Tv~~~g 347 (479) .+..+-.. ...+-..+.++.. T Consensus 293 ~~~~a~~~l~~~~a~~~~~~~~~ 315 (315) T protein:vir:80 293 ESLDSFAVVKEKAAPKPNPPAEN 315 (315) T ss_pred ecccceEEEeeccCCCCCCCCCC Confidence 00000000 0000001111111 No 49 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=96.86 E-value=5.5e-05 Score=43.94 Aligned_cols=288 Identities=13% Similarity=0.083 Sum_probs=133.4 Q ss_pred CCccchhhhhhhhcCCccchHHHHHHHHHhhhcCCCcChhhccCccccchhhhhhhhhhheeccccccchhhccccchhH Q lcl|NC_018856. 1 MTELKKEAEAKNKKLPVEAEAELAELVSKSFTTGYGITPDTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQQVNS 80 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~Ks~tag~~~~p~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~~~~s 80 (479) |--|++++... +...-.|+++++.. .+|+-+-.|..++-+..+... -.+.+...+.++.+ T Consensus 1 ~~~~~~r~~~~-----------~~~~e~~a~~~~~~------~~g~~ip~~~~~~ii~~~~~~---s~i~~~~~~~~~~~ 60 (326) T protein:vir:42 1 MAVNPDRTTPF-----------LGVNDPKVAQTGDS------MFEGYLEPEQAQDYFAEAEKI---SIVQQFAQKIPMGT 60 (326) T ss_pred CCCCccchhhh-----------cCcchhhheecccc------CCcceechhhHHHHHHHHHhc---chhhhhcceeeccC Confidence 44333332111 11112356655432 224444444444433333222 23444444444444 Q ss_pred HHHhhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_018856. 81 TVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAIAVIAKSIEW 160 (479) Q Consensus 81 tv~eY~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~iE~ 160 (479) .-.+|.++. +.+...|++|++..+.+++.+.+....++=++..-.+|.-+ +.++..|.+....+.-...++..+|. T Consensus 61 ~~~~~p~~~---~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~v~iS~el-l~~s~~~~~~~i~~~l~~a~~~~~d~ 136 (326) T protein:vir:42 61 TGQKIPHWT---GDVSASWIGEGDMKPITKGNMTSQTIAPHKIATIFVASAET-VRANPANYLGTMRTKVATAFAMAFDN 136 (326) T ss_pred CceEEEEEe---CCcceEEecCCccccccccceeEEEEeeEEEEEeehhhHHH-HhcCHHHHHHHHHHHHHHHHHHHHHH Confidence 333444433 33456799999999999999999999999999999998854 34567788888889999999999999 Q ss_pred HHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccCCC----CC-HH-HHhhhhhhhhhccCceEEEecChHHhhhH Q lcl|NC_018856. 161 AIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGAR----LD-EA-TLNKAAVIVGKGYGRATDAFMPIGVQADF 234 (479) Q Consensus 161 a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~----l~-~~-~l~~aa~~i~~~fG~~td~~mp~~vka~f 234 (479) ++|+|+.+=.| .|+.+..... ......+.. +. .+ .+..+.......+...+...|+......+ T Consensus 137 a~l~G~gs~~p----------~gi~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~n~~~~~~L 205 (326) T protein:vir:42 137 AAINGTDSPFP----------TFLAQTTKEV-SLVDPDGTGSNADLTVYDAVAVNALSLLVNAGKKWTHTLLDDITEPIL 205 (326) T ss_pred HhhcccCCCcc----------cccccccccc-ceeecccccccccchhHHHHHHHHHhhhhhhccCccEEEEeHHHHHHH Confidence 99999885222 3444333221 222222211 11 11 12233344455566667788999999988 Q ss_pred HHHhhCcceeeeccCCCc---------ceeeeeh--hhhcCCCccee-cccc----eecCCCceeccc---CCcCCCCCC Q lcl|NC_018856. 235 TNNLLDRQRVIQPSTAGG---------FSTGFSI--NQFLSTRGAIN-LHGS----TIMENDNILLEG---RNPEPNAPQ 295 (479) Q Consensus 235 ~~~~~~~qrv~~~~n~g~---------~~~G~~I--~~~~s~~G~I~-l~~s----~~m~~~~~L~e~---~~~~~~AP~ 295 (479) ...-...-|.+.+..... .-.|+++ ..+. +.|.+. +.|+ .+..+..+.++- ....-..+. T Consensus 206 ~~lkd~~G~~l~~~~~~~~~~~~~~~~~l~G~pv~~~~~~-~~~~~~~~~Gd~s~~~~~~~~~~~v~~~~e~~~~~~~~~ 284 (326) T protein:vir:42 206 NGAKDKSGRPLFIESTYTEENSPFRLGRIVARPTILSDHV-ASGTVVGYQGDFRQLVWGQVGGLSFDVTDQATLNLGTPQ 284 (326) T ss_pred HHhhccCCceeeccccccCccccccCceeeeeeEEEcCCC-CCCceEEEEeecceEEEEEecceEEEEeecceeeecccc Confidence 754333333333222111 1234333 2222 233322 1222 010111110000 000000000 Q ss_pred CceeEeeeeccCCCCCCCcccc---cceEEEEEEEcC----------cCcc Q lcl|NC_018856. 296 APASVVASIVDDKKGGFRDEDI---KTHSYKVVVHSD----------DAES 333 (479) Q Consensus 296 ~pa~v~at~~t~~~G~f~~~d~---gty~YkVtavn~----------~GES 333 (479) ....+ ..|...-. ...++-+.+.+. -++| T Consensus 285 ~~~~~---------~~~~~d~~~~r~~~~~d~~v~~~~a~~~l~~~~~~~~ 326 (326) T protein:vir:42 285 APNFV---------SLWQHNLVAVRVEAEYAFHCNDKDAFVKLTNVDATEA 326 (326) T ss_pred cccch---------hhhhcCcEEEEEEEEeccEEecccceEEEeeccccCC Confidence 00000 00000000 000111111111 1222 No 50 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=96.86 E-value=0.0001 Score=42.46 Aligned_cols=309 Identities=11% Similarity=0.045 Sum_probs=149.0 Q ss_pred CCccch---hhhh-------hhhcC-CccchHHHHHHHHHhhhcCC-------------CcChhhccCccccchhhhhhh Q lcl|NC_018856. 1 MTELKK---EAEA-------KNKKL-PVEAEAELAELVSKSFTTGY-------------GITPDTQLDGAAVRRELLEDQ 56 (479) Q Consensus 1 ~~~~~~---~~~~-------~~~~~-~~~~~~~~~e~~~Ks~tag~-------------~~~p~~~~~gaalr~esld~~ 56 (479) +.+.+. +.+. +.... ....+.+-.+.+.+.+.-+. .....+..+|+.+..|.. +. T Consensus 54 i~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~-~~ 132 (390) T protein:vir:81 54 VQAARQRVAELEGNGAGGDVQHVSVGDMFVASEQFQASAGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRL-PG 132 (390) T ss_pred HHHHHHHHHHHHhcccccccccccchhhhhhhHHHHHHHHHHhhhhhhhhhHHHHHHHhhccccccCCcceechhhh-HH Confidence 000000 0000 00000 00000000001111110000 001112233444555544 44 Q ss_pred hhhheeccccccchhhccccchhHHHHhhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhh Q lcl|NC_018856. 57 VKMLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLV 136 (479) Q Consensus 57 ~~~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv 136 (479) +..+. .+...+.+.+...++.+-..+|.++. +..+...+++|++..+..++.+......++-++---.+|.-+ +. T Consensus 133 ii~~~--~~~~~l~~~~~~~~~~~~~~~~~~~~--~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~el-l~ 207 (390) T protein:vir:81 133 FITPP--DARLTVRDLIGSGRTDSALIEYVQET--GFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQI-LS 207 (390) T ss_pred HHHHH--hhhhhhhhhcceeeccCCceEEEEEe--cCCcceeeecCCcccccccceeeEEEEeeeEEEEeehhhHHH-HH Confidence 43222 23344555555555655444455433 333456789999999999999999999999999888888853 33 Q ss_pred cchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccCCCCCHHHHhhhhhhhhh Q lcl|NC_018856. 137 NNIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGARLDEATLNKAAVIVGK 216 (479) Q Consensus 137 n~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~ 216 (479) ++ .+.+....+.-...++..++.++++|+-. |-.+.|+.+.......+.... .....+.|..+--.+.. T Consensus 208 d~-~~~~~~i~~~l~~~~~~~~d~a~l~G~g~---------~~~~~Gi~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~ 276 (390) T protein:vir:81 208 DA-PQLASYMNNRLIRGLKVKEDAEILRGTGA---------NDGLLGLIPQATTYAAPTTIA-GATRVDQLRLAMLQASL 276 (390) T ss_pred hH-HHHHHHHHHHHHHHHHHHHHHHHHhcCCC---------CCcccceeecccccccccccc-cchhHHHHHHHHHhhcc Confidence 44 47888888889999999999999999754 223678776544322233332 33344556655555566 Q ss_pred ccCceEEEecChHHhhhHHHHhhCcceeeeccC--CCcc-eeeeehhhhc-CCCcceecccceecCCCceecccCCcCCC Q lcl|NC_018856. 217 GYGRATDAFMPIGVQADFTNNLLDRQRVIQPST--AGGF-STGFSINQFL-STRGAINLHGSTIMENDNILLEGRNPEPN 292 (479) Q Consensus 217 ~fG~~td~~mp~~vka~f~~~~~~~qrv~~~~n--~g~~-~~G~~I~~~~-s~~G~I~l~~s~~m~~~~~L~e~~~~~~~ 292 (479) .+...+-++|++.+.+.+...-...-|.+.+.. .+.. -.|++|-... -|.|.+-+ |+- ++--.+.. T Consensus 277 ~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~-gd~--~~~~~~~~------- 346 (390) T protein:vir:81 277 AEYNPSGIVINPIDWAAIELAKDANNQYLIGNARGTLTPTLWGLPVVATQAMAPGEFLV-GAF--DLAAQIFD------- 346 (390) T ss_pred ccCCCCEEEEcHHHHHHHHHhhcCCCceeecCcccccCceecceeeEEcCCCCCCcEEE-Eeh--hceEEEEE------- Confidence 677778899999999888765544445443321 1111 2354433221 12332211 110 00000110 Q ss_pred CCCCceeEeeeeccCCCCCCCcccccceEEEEEEEcCcCccccccceeeeee Q lcl|NC_018856. 293 APQAPASVVASIVDDKKGGFRDEDIKTHSYKVVVHSDDAESLPSEAVTAAVA 344 (479) Q Consensus 293 AP~~pa~v~at~~t~~~G~f~~~d~gty~YkVtavn~~GES~pS~~vt~Tv~ 344 (479) .-. ... ..... +-.|. .+...|++..--+-.=-.|...+..|.+ T Consensus 347 ~~~-~~v--~~~~~--~~~~~---~~~v~~r~~~r~d~~v~~~~a~v~~t~a 390 (390) T protein:vir:81 347 QWD-ARV--EIGYV--GEDFQ---RNMITVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred ecc-eEE--EEecc--cchhh---cCcEEEEEEEeeccEEecccceEEEEeC Confidence 001 111 10000 00122 1223455444433333345555566665 No 51 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=96.79 E-value=0.00013 Score=41.92 Aligned_cols=283 Identities=10% Similarity=0.047 Sum_probs=148.5 Q ss_pred cCCCcCh-hhccCccccchhhhhhhhhhheeccccccchhhccccchhHHHHhhhhhhccCcccccccccccccccccCc Q lcl|NC_018856. 33 TGYGITP-DTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDP 111 (479) Q Consensus 33 ag~~~~p-~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~~~~d~ 111 (479) =|+..+- .+.++|+.|-.+.+.+++..... +...+.+.....++.+...++.+. .+ ....|++|++..+..++ T Consensus 1 ~g~~a~~~~~~~~~~~~iP~~~~~~ii~~~~--~~s~l~~~~~~~~~~~~~~~~~~~---~~-~~a~~v~E~~~~~~~~~ 74 (299) T protein:vir:41 1 MGFNPDTTTMQSAKTGSIPINISEQIITGVK--NGSAAMKLAKAVPMTKPEEEFTFM---SG-VGAFWVDEAERIQTSKP 74 (299) T ss_pred CCcCCCcccccCCCceecchhHHHHHHHHHH--hcchhhhhceeeecCCCcEEEEEE---cC-CceeeeecCcccccccc Confidence 3333222 22334666777777777754332 223455555666666666655543 33 24679999999999999 Q ss_pred ceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccC Q lcl|NC_018856. 112 NIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQD 191 (479) Q Consensus 112 ~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~ 191 (479) .+.......|-++---.+|.-+- .++..|.+....+.-...+++.+|.++++|+.+-.+ .|+++..... T Consensus 75 ~f~~v~l~~~k~~~~~~is~ell-~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~g~~~~----------~gil~~~~~~ 143 (299) T protein:vir:41 75 TFTKAKMRSKKMGVIIPTTKENL-NYSVTNFFSLMQAEIVEAFYKKFDQAVFTGVESPYN----------WNILKSATDA 143 (299) T ss_pred ceeEEEEeeEEEEEeehhhHHHH-hcCHHHHHHHHHHHHHHHHHHHHHHHHhhcccCccc----------cccccccccc Confidence 99999999999999888888432 345668888888999999999999999999975322 3666655544 Q ss_pred CCEEEccCCCCCHHHHhhhhhhhhhccCceEEEecChHHhhhHHHHhhCcceeeeccCCCc---ceeeeeh--hhhcCCC Q lcl|NC_018856. 192 TNVIDLKGARLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSTAGG---FSTGFSI--NQFLSTR 266 (479) Q Consensus 192 ~NviDarG~~l~~~~l~~aa~~i~~~fG~~td~~mp~~vka~f~~~~~~~qrv~~~~n~g~---~~~G~~I--~~~~s~~ 266 (479) .+.... ...+.+.|.++.-.+...+...+-++|+......+...-...-|.+....... .-.|++| ...+... T Consensus 144 ~~~~~~--~~~~~~~l~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~l~G~PV~~~~~~~~~ 221 (299) T protein:vir:41 144 SNLVEE--TANKYDDLNEAIGLIEAEDLEPNGIATIRKQRVKYRSTKDGNGMPIFNTATSNGVDDVLGLPIAYTPKYTFG 221 (299) T ss_pred ceeecc--ccccHHHHHHHHHhhhcccCCcCEEEEcHHHHHHHHHhhccCCceeecCCcCCCCceecceeeEEecccCCC Confidence 444322 33455666666656777788888899999998888765544445554433221 1233332 2222111 Q ss_pred cc--eecccc----eecCCCceecccCCcCCCCCCCceeEeeeeccCCCCC----CCcccccceEEEEEEEcCcCccccc Q lcl|NC_018856. 267 GA--INLHGS----TIMENDNILLEGRNPEPNAPQAPASVVASIVDDKKGG----FRDEDIKTHSYKVVVHSDDAESLPS 336 (479) Q Consensus 267 G~--I~l~~s----~~m~~~~~L~e~~~~~~~AP~~pa~v~at~~t~~~G~----f~~~d~gty~YkVtavn~~GES~pS 336 (479) .. ..+-|+ .+..+.++-++ ..+.. .-....+..|+ |.. +-..|++..--+-.---|. T Consensus 222 ~~~~~~~~gdfs~~~i~~~~~~~i~----~~~~~------~~~~~~~~~~~~~~~~~~---~~~~~r~~~~~d~~v~~~~ 288 (299) T protein:vir:41 222 DKDISELVGDWNQAYYGILRGVEYE----ILTEA------TLTTVADETGKPLNLAER---DMAAIKATFEVGFMVVKDE 288 (299) T ss_pred CCceEEEEEecccEEEEEecCcEEE----Eeecc------cccccccccccchhhhhc---CcEEEEEEEEeccEEeccc Confidence 10 001111 11111111000 00000 00000011111 111 1123333221111111122 Q ss_pred cceeeeeecCC Q lcl|NC_018856. 337 EAVTAAVAKKD 347 (479) Q Consensus 337 ~~vt~Tv~~~g 347 (479) ..+-.+..+.. T Consensus 289 A~~~l~~~aa~ 299 (299) T protein:vir:41 289 AFSAVQPKAGN 299 (299) T ss_pred ceEEEEeccCC Confidence 22222222222 No 52 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=96.76 E-value=0.00019 Score=40.98 Aligned_cols=306 Identities=14% Similarity=0.130 Sum_probs=134.1 Q ss_pred CCccc---hhhhh--------hhhcCCccchHHHHHHHHHhh----hcCCC---------cChhhccCccccchhhhhhh Q lcl|NC_018856. 1 MTELK---KEAEA--------KNKKLPVEAEAELAELVSKSF----TTGYG---------ITPDTQLDGAAVRRELLEDQ 56 (479) Q Consensus 1 ~~~~~---~~~~~--------~~~~~~~~~~~~~~e~~~Ks~----tag~~---------~~p~~~~~gaalr~esld~~ 56 (479) +.+.+ ++.+. ..............+...++| ..+.. ....+.++|+.|-++.+..+ T Consensus 56 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~a~~~~t~~~gg~~iP~~~~~~ 135 (404) T protein:vir:39 56 RDALREQLVEAQAEQVVNMREEEKGPLNKSEYELKDKFVKEFVNMVRNPMAFLNTVSSKTETSGSDSAAGLTIPQDIRTM 135 (404) T ss_pred HHHHHHHHHHHHHHHHhccccccccccccchhhhHHHHHHHHHHHHhcchhhhhhhhhhhhhcccccCCceeccHHHHHH Confidence 00000 00000 000000000111111111222 11111 11233456778888888888 Q ss_pred hhhheeccccccchhhccccchhHHHHhhhhhhccCcccccccccccccc-cccCcceEEEEEEEEeeeehhhhhhhHhh Q lcl|NC_018856. 57 VKMLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVA-SINDPNIRQKTVQMKFLSDTKQQSLAAGL 135 (479) Q Consensus 57 ~~~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~-~~~d~~~~r~~~~~k~l~~~~~vs~~~~l 135 (479) |..+.... -.++..+...++.+-...|.....-+..+...+++|++.. +.+++.+.+....++-++.-..+|.-+= T Consensus 136 ii~~~~~~--~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell- 212 (404) T protein:vir:39 136 INTLVRQY--DSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPRLTIIKYLIKRYAGIITATNTLL- 212 (404) T ss_pred HHHHHHhh--hhHHhhcceeeccCCcceEEEEeecCCccceeeecCccccccccccceeeEEeeeeeEEeeehhHHHHH- Confidence 75444333 3455556666565544444433333344557789999875 4789999999999999998888887432 Q ss_pred hcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhcc-CCCEEEccCCC-CCHHHHhhhhhh Q lcl|NC_018856. 136 VNNIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQ-DTNVIDLKGAR-LDEATLNKAAVI 213 (479) Q Consensus 136 vn~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~-~~NviDarG~~-l~~~~l~~aa~~ 213 (479) .++..|.+....+.-...+.+.++.++++|+..-.+.. .++-+|++...|.. ....+...+.. ++...+++...+ T Consensus 213 ~ds~~~l~~~i~~~l~~~~~~~~d~~il~g~g~~~~~~---~~~~~~~i~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~l 289 (404) T protein:vir:39 213 KDTAENILAWLSSWIAKKVVVTRNQAIIAAMGTVPKKP---TIAKFDDVITMINTSVDPAIIATSSLLTNQSGLNKLALV 289 (404) T ss_pred hhchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccc---ccccHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHh Confidence 34556778888888999999999999999998865432 25778888877642 11122211111 233333333222 Q ss_pred hhhccCceEEEecChHHhhhHHHHhhCcceeeecc----CCCcceeeeehhhhcC-----CCcceeccccee----cCCC Q lcl|NC_018856. 214 VGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPS----TAGGFSTGFSINQFLS-----TRGAINLHGSTI----MEND 280 (479) Q Consensus 214 i~~~fG~~td~~mp~~vka~f~~~~~~~qrv~~~~----n~g~~~~G~~I~~~~s-----~~G~I~l~~s~~----m~~~ 280 (479) . ..-|. .+|.|. ....-...+++..-++..+ +.+....-+.+.+|.. .++.+.+.-+.. ...+ T Consensus 290 k-d~~G~--~l~~~~-~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~ 365 (404) T protein:vir:39 290 K-TAEGK--YLLEPD-PTKPNSYLIKGKKVIVVADRWLPNSGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETD 365 (404) T ss_pred h-ccCCc--eeeccC-cCCCCcceecceeEEEecccccCccCCCccEEEEEeccccEEEEeecceEEEEeccchhhhhhc Confidence 1 11122 233321 1111112222221111110 0111111122222211 122222221111 1111 Q ss_pred ce--ecccCC-cCCCCCCCceeEeeeeccCCCCCCCccc Q lcl|NC_018856. 281 NI--LLEGRN-PEPNAPQAPASVVASIVDDKKGGFRDED 316 (479) Q Consensus 281 ~~--L~e~~~-~~~~AP~~pa~v~at~~t~~~G~f~~~d 316 (479) .+ ..+.|+ ..+..|.+-..+.-++++.++|..+.-. T Consensus 366 ~~~~r~~~r~d~~~~~~~a~~~~~~~~~a~~~~~~~~~~ 404 (404) T protein:vir:39 366 TTKIRVIDRFDVKTTDSEALVAGSFTAIADQVGNFTAGK 404 (404) T ss_pred eeeEEEEeeeccEEecccceEEEEeeccccCCCCCCCCC Confidence 11 111111 1112233333333333333333332111 No 53 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=96.72 E-value=0.00015 Score=41.56 Aligned_cols=278 Identities=11% Similarity=0.079 Sum_probs=132.9 Q ss_pred hhcCCCcChh--------hccCccccchhhhhhhhhhheeccccccchhhccccchhHHHHhhhhhhccCcccccccccc Q lcl|NC_018856. 31 FTTGYGITPD--------TQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVRE 102 (479) Q Consensus 31 ~tag~~~~p~--------~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E 102 (479) |.+|-..+++ +..+|+.+-.+...+-+..+. +...+.+.+...++.+.-.+|.++. ......+++| T Consensus 1 ~~~~~~~~~~~~~~~~t~~~~~~~~ip~~~~~~ii~~~~---~~s~l~~~~~~~~~~~~~~~~p~~~---~~~~a~~v~E 74 (320) T protein:vir:10 1 MAAGTAFQVDHAQIAQTGDTMFKGYLEPEQAKDYFAEAE---KTSIVQQFAQKVPMGTTGQKIPHWI---GDVSAQWIGE 74 (320) T ss_pred CCCCccCCHHHHHhhccccccccccccHHHHHHHHHHHH---hccchhhhcceeeccCCceEEEEEe---CCcceEEecC Confidence 3333332221 111233444444444333333 2234566666666654434555544 2334579999 Q ss_pred cccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhh Q lcl|NC_018856. 103 VGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFD 182 (479) Q Consensus 103 ~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFD 182 (479) ++..+.+++++.+....++=++..-.+|+-+-. ++..|.+....+.-...+++.+|.++|.|+.+-.+ +.+. T Consensus 75 ~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~-ds~~~l~~~i~~~l~~a~a~~~d~a~l~G~g~~~~-------~~~~ 146 (320) T protein:vir:10 75 GDMKPITKGNMTSQNIAPHKIATIFVASAETVR-ANPANYLGTMRTKVATAFAMAFDSAALNGTDSPFP-------TYLA 146 (320) T ss_pred CccccccccceeEEEEeeEEEEEeehhhHHHHh-cChHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCC-------cccc Confidence 999999999999999999999988888876433 45568888888888999999999999999975322 2223 Q ss_pred hHHHhhccCCCEEEccCCC---CC--HHHHhhhhhhhhhccCceEEEecChHHhhhHHHHhhCcceeeeccCCC------ Q lcl|NC_018856. 183 GLHKLIDQDTNVIDLKGAR---LD--EATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSTAG------ 251 (479) Q Consensus 183 Gl~~~I~~~~NviDarG~~---l~--~~~l~~aa~~i~~~fG~~td~~mp~~vka~f~~~~~~~qrv~~~~n~g------ 251 (479) |. +.. .++....|.- +. .+.+-.+...+..++....-..|++.....+...-...-|.+.+.... T Consensus 147 ~~---~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~ 222 (320) T protein:vir:10 147 QT---TKS-VSLADPGGATASDLTAYDAVAVNGLSLLVNAKKKWTHTLLDDIVEPILNGAKDKNGRPLFIESTYTDENSP 222 (320) T ss_pred cc---ccc-ccceecccccccccccHHHHHHHHHhhhhcccCCCcEEEEcHHHHHHHHHhhccCCceeeccccccCcccc Confidence 32 222 2333333322 11 223444455556667778889999999999975444333443332211 Q ss_pred ---cceeeeehh--hhcCCCcce-ecccc----eecCCCceecccCCcCCCCCCCceeEeeeeccCCCC----CCCcccc Q lcl|NC_018856. 252 ---GFSTGFSIN--QFLSTRGAI-NLHGS----TIMENDNILLEGRNPEPNAPQAPASVVASIVDDKKG----GFRDEDI 317 (479) Q Consensus 252 ---~~~~G~~I~--~~~s~~G~I-~l~~s----~~m~~~~~L~e~~~~~~~AP~~pa~v~at~~t~~~G----~f~~~d~ 317 (479) ..-.|++|- ... +.|.+ -+.|+ .+..+..+-++-. ... .... -+..++ .|...-. T Consensus 223 ~~~~~i~g~pv~~~~~~-~~~~~~~~~gd~~~~~~~~~~~~~i~~~-~~~------~~~~---~~~~~~~~~~~f~~~~~ 291 (320) T protein:vir:10 223 FRAGRIVSRPTILSDHV-ADGTTVGYMGDFRNVIWGQVGGLSFDVT-DQA------TLNL---GTPTEPNFVSLWQHNLV 291 (320) T ss_pred ccCceeeeeeeEecCCC-CCCceEEEEeecceEEEEEecCeEEEEe-ecc------eeee---ccccccccchhhhcCcE Confidence 112343332 222 23322 12221 1111111100000 000 0000 000000 0100000 Q ss_pred ---cceEEEEEEEcCcCc------ccccc Q lcl|NC_018856. 318 ---KTHSYKVVVHSDDAE------SLPSE 337 (479) Q Consensus 318 ---gty~YkVtavn~~GE------S~pS~ 337 (479) ...++-+...+...- -+|.. T Consensus 292 ~~r~~~~~d~~v~~~~a~~~l~~~~ap~~ 320 (320) T protein:vir:10 292 AVRVEAEYAFHNNDKDAFVKLTNVVTPDA 320 (320) T ss_pred EEEEEEeeccEEecccceEEEEeccCCCC Confidence 001111111111110 01111 No 54 >protein:vir:95318 Length: 328 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512264;genbank:gi:89152431;genbank:GeneID:3952987 Probab=96.70 E-value=1.6e-05 Score=46.85 Aligned_cols=275 Identities=16% Similarity=0.194 Sum_probs=137.1 Q ss_pred CCccchhhhhhhhcCCccchHHHHHHHHHhhhcCCCcChhhccCccccchhhhhhhhhhheeccccccchhhccccchh- Q lcl|NC_018856. 1 MTELKKEAEAKNKKLPVEAEAELAELVSKSFTTGYGITPDTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQQVN- 79 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~Ks~tag~~~~p~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~~~~- 79 (479) |++..-- .. .|.| ++|= .+|| .+..+ -+..|+..+ .+|..++=..++ T Consensus 1 m~~~~~~---------~~---TL~e-~Akr------~~~d------~~~~~----VIE~l~~~n---~IL~~lpf~e~n~ 48 (328) T protein:vir:95 1 MAVKGLT---------AL---TLAD-WGKR------VDPN------GKVDK----IIELLGQTN---PILQDMPFVEGNL 48 (328) T ss_pred CCccccc---------cc---cHHH-HHhh------hCcc------hhHHH----HHHHHhccc---hhHhhcceeeccc Confidence 4443211 11 1111 0010 0000 01111 111222222 234444444454 Q ss_pred HHHHhhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcc-hhhHHHHHHHHHHHHHHHHH Q lcl|NC_018856. 80 STVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNN-IADPMTILTEDAIAVIAKSI 158 (479) Q Consensus 80 stv~eY~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~-~~Dp~~~~~~~ai~~~~~~i 158 (479) .|=|.|++. .+.-...|..=....+-+.++..|++..++-|..-..|.+...-.++ ..+-+++|.+.-|..+.+.+ T Consensus 49 gt~~~~~v~---~~LP~~~fR~lN~g~~~s~~tt~q~t~~l~ilgg~~eVDr~la~~~Gn~~~~ra~q~~~~~ka~~~~~ 125 (328) T protein:vir:95 49 PTGHRTTIR---SGLPSATWRLLNYGVQPSKSTTVQVTDSVGMLETYAEVDKSLADLNGNTAEFRLSEDRAFIEAMNQQM 125 (328) T ss_pred CCcceeeEe---eccCCceeeecCCccCcccceeEEEEEEEEEEecceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHH Confidence 445666664 44434444332334456778999999999999999999987666544 66778999999999999999 Q ss_pred HHHHhhcccccCCCCCCcccchhhhHHHhhcc-----CCCEEEccCCCCCHHHHhhhhhhhhhccCceEEEecChHHhhh Q lcl|NC_018856. 159 EWAIFYGDAALSSEADGQAGIEFDGLHKLIDQ-----DTNVIDLKGARLDEATLNKAAVIVGKGYGRATDAFMPIGVQAD 233 (479) Q Consensus 159 E~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~-----~~NviDarG~~l~~~~l~~aa~~i~~~fG~~td~~mp~~vka~ 233 (479) +..+||||+..+| -+||||.+.+.. ++|+||+.|.--+.-.|+ ++.=+=....=+| |-+-++- T Consensus 126 ~~~~iyGdsa~~p-------~~F~GL~~R~~~~s~~~a~qiidaGgtg~~~TSi~----~v~~g~~~~~giy-PkG~~~G 193 (328) T protein:vir:95 126 AQTLFYGDSSVNP-------QQFMGLSSRYSSLSAGNAQNIIDAGGTGTDNTSIW----LVVWGENTVHGIF-PKGKKAG 193 (328) T ss_pred HHHHhcCCccCCh-------hhhcchhhhcCccccccccceeecccCCCCceEEE----EEEEcCCeEEEec-ccccccC Confidence 9999999999877 489999998842 468999999543332231 1111112333355 8888888 Q ss_pred HHHHhhCcceeeeccCCCcc---------eeeeehhhhcCCCcceecccceec----CCC--ceecccCCcCCC--CCCC Q lcl|NC_018856. 234 FTNNLLDRQRVIQPSTAGGF---------STGFSINQFLSTRGAINLHGSTIM----END--NILLEGRNPEPN--APQA 296 (479) Q Consensus 234 f~~~~~~~qrv~~~~n~g~~---------~~G~~I~~~~s~~G~I~l~~s~~m----~~~--~~L~e~~~~~~~--AP~~ 296 (479) |+-..++.+..... +.+.. ..|+.|.++.++-.-.++.-+..- ..+ +-+.++....|+ ...+ T Consensus 194 l~~~d~g~~~~~~~-~g~~y~~y~~~~~w~~Gl~i~d~r~vvrI~NId~~~l~~~~~~~~l~~lm~~a~~~ip~~~~~~~ 272 (328) T protein:vir:95 194 IQMEDKGQVTLEDA-NGGKYEGYRTHYKWDNGLALRDWRYVVRIANIDVSNLSEPSSAANIAKLMVKALHRIPNRGMGRP 272 (328) T ss_pred ceeeecCceeeecC-CCCeeeEEEEEEEeeeeeEEcCcccEEEEecCcccccccccChhhHHHHHHHHHHHhccCCCCcc Confidence 87777777777633 44432 456666666555332222111000 000 001111111110 0000 Q ss_pred -------------------ceeEeeeeccCCCCCCCcccccc-------eE---EEEE Q lcl|NC_018856. 297 -------------------PASVVASIVDDKKGGFRDEDIKT-------HS---YKVV 325 (479) Q Consensus 297 -------------------pa~v~at~~t~~~G~f~~~d~gt-------y~---YkVt 325 (479) ..... +.....|++...--|. +. =+|+ T Consensus 273 ~~y~n~~v~~~L~~q~~~~~n~~~--~~~~~~g~~~t~~~gipir~~dai~~tE~~vv 328 (328) T protein:vir:95 273 VFYMNRTVGQALDLQSLEKTSLAI--SVKETEGEWWTSFRGVPIRETDALLETEARVV 328 (328) T ss_pred eeehhHHHHHHHHHHHhcCcceee--eeeccCCcceeEECCeEEEEEeeeecCccccC Confidence 00000 0111112111000000 00 1222 No 55 >protein:vir:7324 Length: 335 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848215;genbank:gi:30387386;genbank:GeneID:2641870 Probab=96.38 E-value=4.2e-05 Score=44.59 Aligned_cols=292 Identities=15% Similarity=0.132 Sum_probs=132.6 Q ss_pred CCccchhhhhhhhcCCccchHHHHHHHHHhhhcCCCcChhhccCccccchhhhhhhhhhheeccccccchhhccccchhH Q lcl|NC_018856. 1 MTELKKEAEAKNKKLPVEAEAELAELVSKSFTTGYGITPDTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQQVNS 80 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~Ks~tag~~~~p~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~~~~s 80 (479) |++..-.. =.|.| ++|-+ ++|... +-..|.|. .++ .+|..++=...++ T Consensus 1 m~~~~~~a------------~TL~E-~Akr~------~~d~~~---~~IIE~l~-------~tn---eIL~~lpf~e~N~ 48 (335) T protein:vir:73 1 MALIGQTL------------PSLLD-IYNRT------DKNGRI---ARIVEQLA-------KTN---DILTDAIYVPCND 48 (335) T ss_pred CCcCCCCc------------hhHHH-HHhhc------CcchhH---HHHHHHHh-------cCc---hHHhhcchhcccC Confidence 54432210 01111 11211 111100 01222222 111 1122222222221 Q ss_pred -HHHhhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcc-hhhHHHHHHHHHHHHHHHHH Q lcl|NC_018856. 81 -TVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNN-IADPMTILTEDAIAVIAKSI 158 (479) Q Consensus 81 -tv~eY~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~-~~Dp~~~~~~~ai~~~~~~i 158 (479) |=|.+.+ +-+--...|-.=.....-+.++..|++..++.|..-..|-+...-.++ ..+-+++|.+.-|..+.+.+ T Consensus 49 ~tg~~~~v---rt~LP~~~fR~lN~g~~~s~~tt~qvt~~l~ilgg~~eVDr~La~~~Gn~a~~ra~e~~~~ikam~q~~ 125 (335) T protein:vir:73 49 GSKHKTTI---RAGIPEPVWRRYNQGVQPTKTQTVPVTDTTGMLYDLGFVDKALADRSNNAAAFRVSENMGKLQGFNNKV 125 (335) T ss_pred CcccceeE---EEecCCchhhhcCCccccccceEEEEEEEEEEecchhhhhHHHHhhcCCHHHHHHHHHHHHHHHHHHHH Confidence 1122212 112222333221223344568999999999999999999987665544 67889999999999999999 Q ss_pred HHHHhhcccccCCCCCCcccchhhhHHHhhc--------cCCCEEEccCCCCCHHHHhhhhhhhhhccCceEEEecChHH Q lcl|NC_018856. 159 EWAIFYGDAALSSEADGQAGIEFDGLHKLID--------QDTNVIDLKGARLDEATLNKAAVIVGKGYGRATDAFMPIGV 230 (479) Q Consensus 159 E~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~--------~~~NviDarG~~l~~~~l~~aa~~i~~~fG~~td~~mp~~v 230 (479) +..+||||++.+| -+||||.+.+. .+.|+||+.|.--..-.|+- +.=+=..+ ..+-|-|- T Consensus 126 ~~~~iyGDsa~~p-------~~FdGL~kR~~~~st~~a~~a~~iIdaGGtG~~~TSi~~----v~wg~~~~-~giyPkG~ 193 (335) T protein:vir:73 126 ARYSIYGNTDAEP-------EAFMGLAPRFNTLSTSKAASAENVFSAGGSGSTNTSIWF----MSWGENTA-HMIYPEGM 193 (335) T ss_pred HHHhccCCcCCCh-------hhccchhhhhcCccccccCcccceeeccccccCceEEEE----EEEcCCee-EEEcccCc Confidence 9999999999877 48999999872 23689999885443322221 11111222 33449999 Q ss_pred hhhHHHHhhCcceeeeccCCCcc---------eeeeehhhhcCCCcceeccccee---cCC----Cceeccc----CCcC Q lcl|NC_018856. 231 QADFTNNLLDRQRVIQPSTAGGF---------STGFSINQFLSTRGAINLHGSTI---MEN----DNILLEG----RNPE 290 (479) Q Consensus 231 ka~f~~~~~~~qrv~~~~n~g~~---------~~G~~I~~~~s~~G~I~l~~s~~---m~~----~~~L~e~----~~~~ 290 (479) |+-|+-..++.|..... +.+.+ .+|+.|.++.++-.--++.-+.. ... ...+++. ++|. T Consensus 194 kaGl~~~d~g~~~~~d~-~G~~y~~~~~~~~w~~Gl~i~d~r~vvRI~NIdvs~l~~d~~~~~~l~~lmi~a~~~~~ip~ 272 (335) T protein:vir:73 194 VAGFQHEDLGDDLVSDG-NGGQFRAYRDEFKWDIGLSVRDWRSISRICNIDVTTLTKDASTGADLISMMVDAYYARDVAM 272 (335) T ss_pred cccceeeeccceeeecC-CCCEEeEEEeeeeeeeeeEEeCcccEEEEeecccccccccccchhhHHhhHHHHHHHHhccC Confidence 98887777777777643 33332 56777777766643333321111 000 0011111 1111 Q ss_pred CCCCCCceeEee-ee-------ccCCCC-CCCcccccceEEEEEEEcC----cCccccccceeeee Q lcl|NC_018856. 291 PNAPQAPASVVA-SI-------VDDKKG-GFRDEDIKTHSYKVVVHSD----DAESLPSEAVTAAV 343 (479) Q Consensus 291 ~~AP~~pa~v~a-t~-------~t~~~G-~f~~~d~gty~YkVtavn~----~GES~pS~~vt~Tv 343 (479) . .++-+..=.- ++ ....+. ..+.+.. .-=+|+.++. .-++.-..+..++. T Consensus 273 ~-~~~~~~~y~n~~v~~~L~~q~~~~~n~~l~~~~~--~g~~~t~~~gipir~~Dail~tE~~v~~ 335 (335) T protein:vir:73 273 L-GDGKEVIYANKTIHAWLHKQAMNAKNVNLTIEEY--GGKKIVSFLGIPIRRVDAILNTESAVTA 335 (335) T ss_pred C-CCCceEEEechHHHHHHHHHHhccCceeeeeecc--CCceeEEECCeEEEEEeeeecCcccccC Confidence 1 1110110000 00 000000 0000000 0011111110 00111111111111 No 56 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=96.38 E-value=0.00042 Score=39.12 Aligned_cols=303 Identities=12% Similarity=0.019 Sum_probs=141.2 Q ss_pred hHHHHHHHHHhhhcCCCcChhhccCccccchhhhhhhhhhheeccccccchhhccccchhHHHHhhhhhhcc-----Ccc Q lcl|NC_018856. 20 EAELAELVSKSFTTGYGITPDTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQH-----GRT 94 (479) Q Consensus 20 ~~~~~e~~~Ks~tag~~~~p~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~-----G~~ 94 (479) .+.|.|. .+.++|-..+....+.+++|-.+.+-.+|..+... +..+.+...+.++.+--.+|.+.... .+- T Consensus 1 ~a~l~el--~~~~~~~~~~g~~~~~~~~liP~~~~~~ii~~l~~--~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~e 76 (333) T protein:vir:78 1 MATLNEL--LPNSAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQE--SSLVLRMGEQIPISYGETIIPTTVKRPEVGQVGV 76 (333) T ss_pred CchhHHh--hhhcccccccCceecCCccccchhHHHHHHHHHHh--hchhhhhcceeeccCCceEEEEEeCCceeEeecC Confidence 4444442 23344444444444556667777776666433322 22345555555555433445454432 233 Q ss_pred cccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCC Q lcl|NC_018856. 95 GHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEAD 174 (479) Q Consensus 95 g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~ 174 (479) |...++.|++..+..++.+.+.....+=++.--.+|.-+- .++..|.+....+.--..+++.+|.++|+|+-+-.+ T Consensus 77 g~~~~~~e~~~~~~~~~~f~~i~l~~~kl~~~~~is~ell-~~s~~~~~~~i~~~la~ai~~~~d~~~l~G~g~~~~--- 152 (333) T protein:vir:78 77 GTSNEQREGGLKPLSGTAWDTRSVSPIKLATIVTVSEEFA-RMNPSGLYTKLQGDLAYAIGRGIDLAVFHGKSPLTG--- 152 (333) T ss_pred cccccccccccccccccceeEEEEeeEEEEEeehhhHHHH-hcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCCC--- Confidence 4566777888889999999999999999998888887332 245668888888999999999999999999987543 Q ss_pred CcccchhhhHHHhhccC-CCEEEc--cCCCCCHHHHhhhh-hhhhhccCceEEEecChHHhhhHHHHhh--C-cceeeec Q lcl|NC_018856. 175 GQAGIEFDGLHKLIDQD-TNVIDL--KGARLDEATLNKAA-VIVGKGYGRATDAFMPIGVQADFTNNLL--D-RQRVIQP 247 (479) Q Consensus 175 ~~~gleFDGl~~~I~~~-~NviDa--rG~~l~~~~l~~aa-~~i~~~fG~~td~~mp~~vka~f~~~~~--~-~qrv~~~ 247 (479) ..+.|+.+...-. ...++. .++....+.|.++- .+...++..++-.+|++.....|.+... + .-+.+.+ T Consensus 153 ----~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~~~~~~d~~G~~i~~ 228 (333) T protein:vir:78 153 ----SALQGIDTDNVIANTTNVDYLQETGDPLLDRLLDGYDLVSANTDVEFNGWAVDPRFRAHLLRAQAYRDANGNVDPS 228 (333) T ss_pred ----cccccccccccccccccccccccccchhHHHHHHHHHhhccccccCceEEEEcchHHHHHHHHhhhcCCCCceeec Confidence 4456665533211 111222 22334444444433 3444556677889999988877754321 2 1233333 Q ss_pred cCCCc----ceeeeehhh--hcCCCcceec--ccceecCCC--ceecccCCcCCCCCCCceeEee-eeccCCCCCCCcc- Q lcl|NC_018856. 248 STAGG----FSTGFSINQ--FLSTRGAINL--HGSTIMEND--NILLEGRNPEPNAPQAPASVVA-SIVDDKKGGFRDE- 315 (479) Q Consensus 248 ~n~g~----~~~G~~I~~--~~s~~G~I~l--~~s~~m~~~--~~L~e~~~~~~~AP~~pa~v~a-t~~t~~~G~f~~~- 315 (479) ..... .-.|++|-. ++........ ...+++.+. .++.. .-...-.+.. ++..+.++..... T Consensus 229 ~~~~~~~~~~l~G~Pv~~~~~i~~~~~~~~~~~~~~~~gD~~~~~~g~-------~~~~~i~~~~~~~~~~~~~~~~~~~ 301 (333) T protein:vir:78 229 RINLAAQTGDVLGLPAQFGRAVGGDLGAAVDSKTRIIGGDFSQLKFGF-------ADEIRIKMSDTATLTDSGSATVSMW 301 (333) T ss_pred CccccCCCceeeceeeEEccccCCCccccCCCccEEEEEecccEEEEE-------eeccEEEEeccccccccccceeehh Confidence 22211 123444421 1111100000 001111110 00100 0000000000 0011111110000 Q ss_pred cccceEEEE------EEEcCcCccccccceeeeee Q lcl|NC_018856. 316 DIKTHSYKV------VVHSDDAESLPSEAVTAAVA 344 (479) Q Consensus 316 d~gty~YkV------tavn~~GES~pS~~vt~Tv~ 344 (479) ..+.-.|++ ...+..+ ... ...++.+ T Consensus 302 ~~~~v~~r~~~r~d~~v~~~~a--~~~-l~~~~a~ 333 (333) T protein:vir:78 302 QTNQIAILIEVTFGWLLGDKQA--FVK-FVDDEQP 333 (333) T ss_pred hcCcEEEEEEEEEccEEecccc--eEE-EeccCCC Confidence 001111111 1111111 000 0001111 No 57 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=96.31 E-value=0.00037 Score=39.41 Aligned_cols=304 Identities=15% Similarity=0.162 Sum_probs=131.5 Q ss_pred CCccchh---------------hhhhhhcCCcc---ch--HHHHHH--------------HHHhhhcCCCcChhhccCcc Q lcl|NC_018856. 1 MTELKKE---------------AEAKNKKLPVE---AE--AELAEL--------------VSKSFTTGYGITPDTQLDGA 46 (479) Q Consensus 1 ~~~~~~~---------------~~~~~~~~~~~---~~--~~~~e~--------------~~Ks~tag~~~~p~~~~~ga 46 (479) +.....+ ...+....... .. .+..+. +.+.+.+. .+-++|+ T Consensus 73 le~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~gg 147 (425) T protein:vir:95 73 LEGEIAQLEDELEQINSKQPSNQSRQKMQGSKGDVVEMNRLQVREMLKTGEYYKRSEVVEFYEKFRNL-----RAVAGGE 147 (425) T ss_pred HHHHHHHHHHHHHHhhhhccchhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhHHHHHHHHHHhh-----cccccCc Confidence 0000000 00000000000 00 000000 00111111 1224677 Q ss_pred ccchhhhhhhhhhheeccccccchhhccccchhHHHHhhhhhhccCcccccccccccccccccC-cceEEEEEEEEeeee Q lcl|NC_018856. 47 AVRRELLEDQVKMLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASIND-PNIRQKTVQMKFLSD 125 (479) Q Consensus 47 alr~esld~~~~~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~~~~d-~~~~r~~~~~k~l~~ 125 (479) .+-.+.+.++|...... .-.+++.+...+....+ ++.+ .++.+.+.|+.|++..+..+ +.+.+.....+=++. T Consensus 148 ~~vP~~~~~~Ii~~l~~--~~~i~~~~~~~~~~g~~-~ip~---~~~~~~a~~v~E~~~~~~~~~~~f~~i~l~~~k~~~ 221 (425) T protein:vir:95 148 LTIPEVVVNRIMDIMGD--YTTLYPLVDKIRVKGTT-RILV---DTDTSPATWIEQSGALPTGDVGTIASIDFDGFKVGK 221 (425) T ss_pred eeccHHHHHHHHHHHHh--hhhHHHhhceeecCcee-EEEE---ecCCccccccccccccccccccccceeeeeheeeee Confidence 88888888887533332 22345555444444433 3333 45556688999999865555 789888888887776 Q ss_pred hhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccCCCCCHH Q lcl|NC_018856. 126 TKQQSLAAGLVNNIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGARLDEA 205 (479) Q Consensus 126 ~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~ 205 (479) -..+|.-+ +.++..|.+....+.-...+++.+|.++|+|+..-++ ++.|+.+.+....++. ..+..++.+ T Consensus 222 ~~~iS~el-l~ds~~~l~~~i~~~l~~~i~~~~d~~il~G~G~~~~--------~p~Gil~~~~~~~~~~-~~~~~~~~~ 291 (425) T protein:vir:95 222 VTFVDNYL-LQDSIINLDDYVTKKIARAIAKALDLAIVKGTGAANK--------QPLGIIPSLPPENQVT-VEADNNLLK 291 (425) T ss_pred eehhhHHH-HhccHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCcc--------ccceeecccccccccc-cccccchHH Confidence 55555542 2345567788888888889999999999999865332 4678887776543444 344555666 Q ss_pred HHhhhhhhhhhccCceEE--EecChHHh-hh---HHHHhhCcceee-eccCCCcc-eeeeehh--hhcCCCcceecccc- Q lcl|NC_018856. 206 TLNKAAVIVGKGYGRATD--AFMPIGVQ-AD---FTNNLLDRQRVI-QPSTAGGF-STGFSIN--QFLSTRGAINLHGS- 274 (479) Q Consensus 206 ~l~~aa~~i~~~fG~~td--~~mp~~vk-a~---f~~~~~~~qrv~-~~~n~g~~-~~G~~I~--~~~s~~G~I~l~~s- 274 (479) .|.++.-.+..++..... .+|+..+. +. +...-...-|++ ++.+.+.. -.|++|- .++ +.+.|-| |+ T Consensus 292 ~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~l~~l~~~kd~~g~~i~~~~~~~~~~l~G~pvv~~~~~-~~~~i~~-Gd~ 369 (425) T protein:vir:95 292 NLVKQIGLIDTGDDSVGEIVAVMKRSTYYNRLVEFSIQVDSNGNVVGKLPNLRTPDLLGLRVVFNNFL-DDDTVLF-GEF 369 (425) T ss_pred HHHHHHHhhhhhccccCceEEEEeChHHHHHHHHHHhhcCCCCceeeccCCCCCccccceeeEEcCcC-CCccEEE-Eec Confidence 666665556666654433 34665542 11 111112222333 22222222 2243221 111 2222211 21 Q ss_pred ---eecCCCceecccCCcCCCCCCCceeEeeeeccCCCCCCCcccccceEEEEEEEcCcCccccccceeeeeecCCc Q lcl|NC_018856. 275 ---TIMENDNILLEGRNPEPNAPQAPASVVASIVDDKKGGFRDEDIKTHSYKVVVHSDDAESLPSEAVTAAVAKKDN 348 (479) Q Consensus 275 ---~~m~~~~~L~e~~~~~~~AP~~pa~v~at~~t~~~G~f~~~d~gty~YkVtavn~~GES~pS~~vt~Tv~~~g~ 348 (479) .++.+..+.+. +...+-+.. + .-.......+-...++..+-. ...+|++..|. T Consensus 370 ~~~~~~~~~~~~i~--------------~~~~~~f~~-~--~~~~~~~~r~d~~~~~~~a~~----~~~i~~~~~g~ 425 (425) T protein:vir:95 370 EQYTLVERENITID--------------SSTHVKFTE-D--QTAFRGKGRFDGKPVKPEAFV----LVTITDPVQGA 425 (425) T ss_pred ccEEEEeecceEEE--------------eeccccccc-C--ceEEEEEEeeCcEeecccceE----EEEecCcCCCC Confidence 11111111000 000000000 0 000001112222222222110 11222222121 No 58 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=96.30 E-value=0.00076 Score=37.68 Aligned_cols=287 Identities=10% Similarity=0.041 Sum_probs=139.4 Q ss_pred CCccchHHHHHHHHHhhhcCCCcChhhccCccccchhhhhhhhhhheeccccccchhhccccchhHHHHhhhhhhccCcc Q lcl|NC_018856. 15 LPVEAEAELAELVSKSFTTGYGITPDTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRT 94 (479) Q Consensus 15 ~~~~~~~~~~e~~~Ks~tag~~~~p~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~ 94 (479) |+ .+.|.+..-+ +-+++++|-++.+.+++..+.... -.+++...+.+..+.-..+.. .-.+. T Consensus 1 m~-----------~~~~~~~~~~---~t~~~~~lvP~~~~~~ii~~~~~~--s~l~~~~~~~~~~~~~~~~~~--~~~~~ 62 (297) T protein:vir:95 1 MT-----------VQTFNPENVL---VSQKKDGTLHKEFTDIIMKEVAQN--SLVMQLGQYQEMEGEQEKTVY--VQTDG 62 (297) T ss_pred CC-----------cccccccccc---ccCCCcceechhHHHHHHHHHHhh--chhhhhcceeecCCCccEEEE--EEcCC Confidence 11 0222222222 223456677777766665443222 245555555555432221211 11223 Q ss_pred cccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCC Q lcl|NC_018856. 95 GHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEAD 174 (479) Q Consensus 95 g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~ 174 (479) ....+++|++..+..++++.......+=++-.-.+|..+ +.++..|.+....+.--..+.+.+|.++++|+.+-.+ T Consensus 63 ~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~~~~is~el-l~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~--- 138 (297) T protein:vir:95 63 ISAYWVNETEKIKTDKPEVVPVTLKAHKLGIILVTSREA-LNYTWKKFFEDMKPQIVEAFYKKIDEAGLLGHDTPFA--- 138 (297) T ss_pred ceeEEeecCccccccccceeEEEEeeEEEEEeehhhHHH-HhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCccc--- Confidence 346799999999999999999999999999988888742 2345678888888888899999999999999865322 Q ss_pred CcccchhhhHHHhhccCCCEEEccCCCCCHHHHhhhhhhhhhccCceEEEecChHHhhhHHHHhhCcceeeeccCCCcce Q lcl|NC_018856. 175 GQAGIEFDGLHKLIDQDTNVIDLKGARLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSTAGGFS 254 (479) Q Consensus 175 ~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~~td~~mp~~vka~f~~~~~~~qrv~~~~n~g~~~ 254 (479) .|+.+.+... +... +.-++.+.|.++.-.+..++...+-..|+....+.+...-...-|.+.....+ .- T Consensus 139 -------~gi~~~~~~~-~~~~--~~~~t~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~G~~i~~~~~~-~l 207 (297) T protein:vir:95 139 -------NSVAKAAKDA-NKVI--GGPINYDNILKLQDALYDADVEPNAFVSKIQNRSALREARDGNKVSIYDKAAN-TI 207 (297) T ss_pred -------cccccccccc-ceec--ccccCHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCceeecCCCC-cc Confidence 4566655432 3332 33456666666666677788888889999999998876444433444432222 22 Q ss_pred eeeehhhhc---CCCcceecccceecCCC-ce-ecccCCcCCCCCCCceeEee----eeccCCCCCCCc-ccccceEEEE Q lcl|NC_018856. 255 TGFSINQFL---STRGAINLHGSTIMEND-NI-LLEGRNPEPNAPQAPASVVA----SIVDDKKGGFRD-EDIKTHSYKV 324 (479) Q Consensus 255 ~G~~I~~~~---s~~G~I~l~~s~~m~~~-~~-L~e~~~~~~~AP~~pa~v~a----t~~t~~~G~f~~-~d~gty~YkV 324 (479) .|+++-... ...|.+- +.+. .. +... . .....+.. ....+.+|.... -..+.-.+|+ T Consensus 208 ~G~Pv~~~~~~~~~~~~~~------~gd~s~~~~~~~------~-~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~ 274 (297) T protein:vir:95 208 DGITTVDLKSARFEKGDLL------AGDFDNLIYGVP------Y-NITYKISEEGQISTITNADGTPINLFEQEMIAIRA 274 (297) T ss_pred cceeeEeecCCCCCCceEE------EEecccEEEEEe------c-CeEEEEeeccccccccccCccchhhhhcCcEEEEE Confidence 244332111 1111110 0000 00 0000 0 00000000 000000110000 0000011222 Q ss_pred EEEcCcCccccccceeeeeecCC Q lcl|NC_018856. 325 VVHSDDAESLPSEAVTAAVAKKD 347 (479) Q Consensus 325 tavn~~GES~pS~~vt~Tv~~~g 347 (479) ...-+.+---|...+..+.+.+. T Consensus 275 ~~~~d~~v~~~~a~~~l~~at~~ 297 (297) T protein:vir:95 275 TMDIAVMITKTDAFAKLTPAERV 297 (297) T ss_pred EEEeccEeecccceEEEeecCCC Confidence 22111111111111111111111 No 59 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=96.26 E-value=0.00081 Score=37.52 Aligned_cols=294 Identities=11% Similarity=-0.006 Sum_probs=136.7 Q ss_pred hccCccccchhhhhhhhhhheeccccccchhhccccchhHHHHhhhhhhccCcccccccccccccccccCcceEEEEEEE Q lcl|NC_018856. 41 TQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQM 120 (479) Q Consensus 41 ~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~ 120 (479) =-++|+.|-.+.+..+|..+.. +...+.+.....++.+.-.+|.++... +...+++|++..+.+++.+.+..... T Consensus 1 ma~~gG~lip~~~~~~ii~~~~--~~s~i~~~~~~~~~~~~~~~~p~~~~~---~~a~~v~Eg~~~~~~~~~f~~v~l~~ 75 (298) T protein:vir:94 1 MVLNKGTLFDPELVTDLISKVA--GKSSIARLSAQKPIPFNGEKVFTFTMD---SEIDVVAESGKKTHGGVTLAPQTMVP 75 (298) T ss_pred CeeccccccChhHHHHHHHHHH--hhchhhhhcceeeccCCceEEEEEecC---cceEEeeCCccccccccceeEEEEee Confidence 1123444555555555533222 222344444444455433345554433 34578999999999999999999999 Q ss_pred EeeeehhhhhhhHhhhc--chhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEc- Q lcl|NC_018856. 121 KFLSDTKQQSLAAGLVN--NIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDL- 197 (479) Q Consensus 121 k~l~~~~~vs~~~~lvn--~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDa- 197 (479) +=++.--.+|.-+-..+ ...+.+....++-...+++.+|.++++|...-+. ......|....+....|.... T Consensus 76 ~k~~~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g-----~~~~~~~~~~~~~~~~~~~~~~ 150 (298) T protein:vir:94 76 IKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLG-----TASAVIGTNHFDSKVTQKVEAP 150 (298) T ss_pred eEEEEeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCC-----cccccccccccccccccccccc Confidence 99988888887763222 2346677788889999999999999999543221 112233333333332232221 Q ss_pred -cCCCCCHHHHhhhhhhhhhccCceEEEecChHHhhhHHHHhhCcceeeeccCCCcceeeeehhhhcCCCcceeccccee Q lcl|NC_018856. 198 -KGARLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSTAGGFSTGFSINQFLSTRGAINLHGSTI 276 (479) Q Consensus 198 -rG~~l~~~~l~~aa~~i~~~fG~~td~~mp~~vka~f~~~~~~~qrv~~~~n~g~~~~G~~I~~~~s~~G~I~l~~s~~ 276 (479) .+..+ .+.|.++...+..++..+.-..|++...+.+...-...-|.+.+....+. .+.++ T Consensus 151 ~~~~~~-~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~------------------~~~tl 211 (298) T protein:vir:94 151 RGIADP-NGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGA------------------TPDTI 211 (298) T ss_pred cccccH-HHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHHhhccCCCeeecCcccCC------------------CCcee Confidence 22222 34456666666677778888999999998886644333344433221110 11223 Q ss_pred cCCCceecccCCcCCCCCCCceeEeeeeccCCCCCCCcccccceEEEEEEEcCcCccccccceeeeeecCCceEEEEEEe Q lcl|NC_018856. 277 MENDNILLEGRNPEPNAPQAPASVVASIVDDKKGGFRDEDIKTHSYKVVVHSDDAESLPSEAVTAAVAKKDNTVKLEVKL 356 (479) Q Consensus 277 m~~~~~L~e~~~~~~~AP~~pa~v~at~~t~~~G~f~~~d~gty~YkVtavn~~GES~pS~~vt~Tv~~~g~sv~ltIT~ 356 (479) +..+-.. . ...+.+.++.-...-.|.++.-+...-+.+ +.+++.. T Consensus 212 ~G~PV~~-~----------------~~v~~~~~~~~~~~~~Gdfs~~~~~~~~~~------------------~~~~~~~ 256 (298) T protein:vir:94 212 NGLPVDV-N----------------KTVSDMSLTQRDRAIIGDFANGFKWGYAKE------------------VPLEVIQ 256 (298) T ss_pred cceeeEE-e----------------cccccccCCCccEEEEeeccceEEEEEecC------------------ceEEEee Confidence 3322111 1 111100000000001122221111111111 1111111 Q ss_pred cCCCCcccceEEEEEecCCCcceEEEEeeeeeeecCCceEEEeecc Q lcl|NC_018856. 357 ASLYQAQPQFISVYREGTETGHYFLIARVPVSKVNDQGVIEVLDRN 402 (479) Q Consensus 357 ~~~~~a~~~~y~IYR~~~~~G~y~li~rv~vs~~n~~g~T~ftD~N 402 (479) .+..-. +.++-|++. --.|....|+...-.+.......++.+ T Consensus 257 ~~~~d~--~~~~~f~~~--~v~~r~~~r~~~~~~~~~a~~~l~~~t 298 (298) T protein:vir:94 257 YGDPDN--SGLDLKGYN--QVYIRAELFLGWGILDATKFARVTEAN 298 (298) T ss_pred cCCCcC--cchhhhhcC--cEEEEEEEEeccEeecccceEEEEecC Confidence 000000 001111110 011222233333322333444444444 No 60 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=96.24 E-value=0.00035 Score=39.52 Aligned_cols=274 Identities=12% Similarity=0.072 Sum_probs=122.4 Q ss_pred HHHhhhcCCCcChhhccCccccchhhhhhhhhhheeccccccchhhccccchhHHHHhhhhhhccC-ccccccccccccc Q lcl|NC_018856. 27 VSKSFTTGYGITPDTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHG-RTGHSRFVREVGV 105 (479) Q Consensus 27 ~~Ks~tag~~~~p~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G-~~g~~~fv~E~g~ 105 (479) +.++++++.. ++|++|-.+-+.++|..+.... ..+.+.....+.....-++.. ..+. ..+...+++|++. T Consensus 1 ~l~~~~~~t~------~~gg~liP~~~~~~Ii~~~~~~--~~l~~~~~~~~~~~~~g~~~~-~~~~~~~~~a~~v~Eg~~ 71 (293) T protein:vir:48 1 MLDSKTDHSG------SDAGLTIPQDIRTAINTLVRQY--DSLQEYVNVENVTTLTGSRVY-EKWTDITGLANIDDEAGK 71 (293) T ss_pred Cceeeccccc------CcCceEechhHHHHHHHHHHhh--hhhhhhceeeeccCCcceEEE-EeecCCCcceeeecCCcc Confidence 5567766544 3577787888877774333322 234444444444433333222 2232 3445779999987 Q ss_pred cc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhH Q lcl|NC_018856. 106 AS-INDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGL 184 (479) Q Consensus 106 ~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl 184 (479) .. .+++.+.+....+|-++....+|.-+- .++.-|.+....+..-..++..++.++|.|..+..... .-+-+|.| T Consensus 72 ~~~~~~~~~~~i~l~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~~~---~~~~~d~i 147 (293) T protein:vir:48 72 IADIDDPKLSLIKYTIKRYAGISTVTNSLL-ADSAENILAWLSGWIAKKVVVTRNKAILGVVDKLPTKP---TLTKWDDI 147 (293) T ss_pred cccccccceeEEEEeeeEEEEeehhhHHHH-hhhhHHHHHHHHHHHHHHHHHHHHhHHhhccccccccc---cccCHHHH Confidence 65 688999999999999998877776442 34455777778888888889999999999887644321 12444554 Q ss_pred HHhhccCCCEEEccCCCCCHHHHhhhhhhhhhccCceEEEecChHHhhhHHHHhhCcceeeeccCCCcc----eeeeehh Q lcl|NC_018856. 185 HKLIDQDTNVIDLKGARLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSTAGGF----STGFSIN 260 (479) Q Consensus 185 ~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~~td~~mp~~vka~f~~~~~~~qrv~~~~n~g~~----~~G~~I~ 260 (479) .+++.. +..+|......+|+..+.+.+...-...-|.+...+..+. -.|++|- T Consensus 148 ~~~~~~-----------------------l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~l~G~Pv~ 204 (293) T protein:vir:48 148 IDLEAK-----------------------VDPAIKQTSFFLTNTSGFTALKKVKNALGDYLMERDVKSPTGYSIAGFAVK 204 (293) T ss_pred HHHHHh-----------------------hhhhhcCCCEEEEcHHHHHHHHHhhccCCceEeecCcCCCCCceecceeeE Confidence 444432 2233444445677777777776544443344433332222 2343321 Q ss_pred h----hcCC--Ccce-ecccc-----eecCCCceeccc-CCcCCCCCCCceeEeeeeccCCCCCCCcccccceEEEEEEE Q lcl|NC_018856. 261 Q----FLST--RGAI-NLHGS-----TIMENDNILLEG-RNPEPNAPQAPASVVASIVDDKKGGFRDEDIKTHSYKVVVH 327 (479) Q Consensus 261 ~----~~s~--~G~I-~l~~s-----~~m~~~~~L~e~-~~~~~~AP~~pa~v~at~~t~~~G~f~~~d~gty~YkVtav 327 (479) - +... .|.. .+.|+ .+.++..+-++. +......-..-+..-+..-. +++.. ..-.|+.+ T Consensus 205 ~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~--d~~~~------~~~a~~~l 276 (293) T protein:vir:48 205 EISDRWLPNASSGVMPLYFGDLKQAVTLFDRQQMSLLSTNIGGGAFETDTTKVRVIDRF--DVVAT------DTEAFVPA 276 (293) T ss_pred EecccccCCccCCceEEEEEeccceEEEEEecceEEEEecccchhhhcCeEEEEEEEee--CcEEe------cccceEEE Confidence 0 0000 0000 01010 000101000000 00000000000000000000 00000 00001111 Q ss_pred cCcCccccccceeeeeecC Q lcl|NC_018856. 328 SDDAESLPSEAVTAAVAKK 346 (479) Q Consensus 328 n~~GES~pS~~vt~Tv~~~ 346 (479) .--+ +.+.+.+....+. T Consensus 277 ~~~~--~~~~~~~~~~~~~ 293 (293) T protein:vir:48 277 SFKA--IADQKGNIGSTAV 293 (293) T ss_pred Eeec--cccCCccccccCC Confidence 1000 0000001000000 No 61 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=96.21 E-value=0.00076 Score=37.68 Aligned_cols=303 Identities=12% Similarity=0.071 Sum_probs=124.0 Q ss_pred CCccchhhhhhhhcCCccchHHHHHHHHHhhhcCCC-----cChhhccCccccchhhhhhhhhhheeccccccchhhccc Q lcl|NC_018856. 1 MTELKKEAEAKNKKLPVEAEAELAELVSKSFTTGYG-----ITPDTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINK 75 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~Ks~tag~~-----~~p~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k 75 (479) +.......... .......+....+.+.+..+-. ..-.+.++|+.+.++.+..+|..+.... -.+++.+.. T Consensus 71 ~~~~~~~~~~~---~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~--~~l~~~~~~ 145 (397) T protein:vir:49 71 MSEEEKKPLTK---NEEEVKANFVKDFKNLVRGRYQNLLDSKTDGSGSDAGLTIPQDIRTAINTLVRQF--DSLQEYVNV 145 (397) T ss_pred ccccccccccc---hhhHHHHHHHHHHHHHhhcchhhHHHhhhccCCccCcceecHHHHHHHHHHHHhh--hhHhhhcce Confidence 00000000000 0000011111112222221111 1112344577888888877774443333 345555555 Q ss_pred cchhHHHHhhhhhhcc-Ccccccccccccccc-cccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHH Q lcl|NC_018856. 76 QQVNSTVAKYAVFNQH-GRTGHSRFVREVGVA-SINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAIAV 153 (479) Q Consensus 76 ~~~~stv~eY~~~~~~-G~~g~~~fv~E~g~~-~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~ 153 (479) .++..-.-.|.. ..+ +..+...+++|++.. +...+.+...+..++-++.-..+|.-+- .++..|.+....+..... T Consensus 146 ~~~~~~~~~~~~-~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~l~~~ 223 (397) T protein:vir:49 146 ENVTTLTGSRVY-EKWADITGLAKLDDEGGQIGQNDDPKLSLIRYAIKRYAGISTVTNSLL-ADSAENILAWLSGWIAKK 223 (397) T ss_pred eeccCCcceEEE-EeeccCCcceeeeccccccccccccceeeeEeeeeeeEeehhhHHHHH-hhhhHHHHHHHHHHHHHH Confidence 555433222221 112 223456799999975 4566899999999999998888886432 345667888899999999 Q ss_pred HHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccCCCCCHHHHhhhhhhhhhccCceEEEecChHHhhh Q lcl|NC_018856. 154 IAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGARLDEATLNKAAVIVGKGYGRATDAFMPIGVQAD 233 (479) Q Consensus 154 ~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~~td~~mp~~vka~ 233 (479) +++.++.++++|+..-.+... .+-+|+|.+++. .+..+|......+|+....+. T Consensus 224 ~~~~~d~ail~G~g~~~~~~~---~~~~d~i~~~~~-----------------------~l~~~~~~~a~~v~n~~~~~~ 277 (397) T protein:vir:49 224 VVVTRNKAILEAIGTLPNKPT---LAKWDDIIDLQA-----------------------KVDPAIKQTSLFLTNTSGFTA 277 (397) T ss_pred HHHHHHHHHHhcccccccccc---ccCHHHHHHHHH-----------------------hhhhhhcCCCEEEEcHHHHHH Confidence 999999999999987554221 344555554442 233345555566677666666 Q ss_pred HHHHhhCcceeeeccCCCc----ceeeeehhh---hcCCCc---ce-ecccc-----eecCCCceecccC-CcCCCCCCC Q lcl|NC_018856. 234 FTNNLLDRQRVIQPSTAGG----FSTGFSINQ---FLSTRG---AI-NLHGS-----TIMENDNILLEGR-NPEPNAPQA 296 (479) Q Consensus 234 f~~~~~~~qrv~~~~n~g~----~~~G~~I~~---~~s~~G---~I-~l~~s-----~~m~~~~~L~e~~-~~~~~AP~~ 296 (479) +...-...-|.+...+... .-.|++|-- -..+.+ .. -+.|+ .++++...-++.. ......-.. T Consensus 278 l~~lkd~~g~~l~~~~~~~g~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~ 357 (397) T protein:vir:49 278 LKKVKNAMGDYLMERDVKSPTGYSIDGFVVKEISDRFLPNGTGGAMPLYFGDLKQAVTLFDRQHLSLLSTNIGGGAFETD 357 (397) T ss_pred HHHhhccCCceeecccccCCCCceecceeeEEecccccccccCCceeEEEeeccceEEEEeecccEEEEeccccchhhcC Confidence 6543333333333222211 122322210 000100 00 01110 0111110000000 000000000 Q ss_pred ceeEeeeeccCCCCCCCcccccceEEEEEEEcCcCccccccceeeeeec Q lcl|NC_018856. 297 PASVVASIVDDKKGGFRDEDIKTHSYKVVVHSDDAESLPSEAVTAAVAK 345 (479) Q Consensus 297 pa~v~at~~t~~~G~f~~~d~gty~YkVtavn~~GES~pS~~vt~Tv~~ 345 (479) -...-+..-. ++.....+ +--..++++... +.+.+.+..+ T Consensus 358 ~~~~~~~~r~--d~~~~~~~-a~~~~~~~~~~~------~~~~~~~~~~ 397 (397) T protein:vir:49 358 TTKVRVIDRF--DVVSTDTE-AFVPASFKAIAD------QKAKLSTAGA 397 (397) T ss_pred eeeEEEEEee--ccEEeccc-ceEEEEeccccc------ccCcccccCC Confidence 0000000000 00000000 000001111000 0001111111 No 62 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=96.12 E-value=0.00038 Score=39.32 Aligned_cols=288 Identities=12% Similarity=0.102 Sum_probs=121.7 Q ss_pred CCccchh---hhh--------hhhcCCccchHHHHHHHHHhh----hcCCC---------cChhhccCccccchhhhhhh Q lcl|NC_018856. 1 MTELKKE---AEA--------KNKKLPVEAEAELAELVSKSF----TTGYG---------ITPDTQLDGAAVRRELLEDQ 56 (479) Q Consensus 1 ~~~~~~~---~~~--------~~~~~~~~~~~~~~e~~~Ks~----tag~~---------~~p~~~~~gaalr~esld~~ 56 (479) +.+.+.+ .+. ..............+...|+| ..+.+ ....+..+|+.|-.+.+..+ T Consensus 56 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~ 135 (408) T protein:vir:10 56 RDALREQLVEAQAEQVVNMREEEKGPLNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTM 135 (408) T ss_pred HHHHHHHHHHHHHHHHhccccccccccccchhhhHHHHHHHHHHHhhcchhhhhhhhhhhhhcccccCCceeccHhHHHH Confidence 0000000 000 000000011111112222322 11110 11123345778888888888 Q ss_pred hhhheeccccccchhhccccchhHHHHhhhhhhccCccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhhhHhh Q lcl|NC_018856. 57 VKMLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVAS-INDPNIRQKTVQMKFLSDTKQQSLAAGL 135 (479) Q Consensus 57 ~~~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~l 135 (479) |..+.... -.+.+.+...++.+..-.+......++.+...+++|++... .+++.+.......+-++.-..+|.-+ + T Consensus 136 Ii~~~~~~--~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~el-l 212 (408) T protein:vir:10 136 INTLVRQY--DSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTS-L 212 (408) T ss_pred HHHHHHhh--chhhhhcceeeccCCcceEEEeeccccccceeeecCccccccccCcceeeEEeeeeeEEeeehhHHHH-H Confidence 75444333 34555566666665444444433445556788999998765 67799999999999999887777754 2 Q ss_pred hcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhcc-CCCEEEccCCCCCHHHHhhhhhhh Q lcl|NC_018856. 136 VNNIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQ-DTNVIDLKGARLDEATLNKAAVIV 214 (479) Q Consensus 136 vn~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~-~~NviDarG~~l~~~~l~~aa~~i 214 (479) .++.-|......+.-...+.+.++.+++.|+.+-.+... ..-+|.+...+.. ...-+...+ T Consensus 213 ~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~~~~~~~---~~~~~~l~~~~~~~~~~~~~~~a--------------- 274 (408) T protein:vir:10 213 KDTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAPKKPT---IAKFDDVITMINTAVDPAIIATS--------------- 274 (408) T ss_pred hhchHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccc---cccHHHHHHHHHHhhhhhhccCC--------------- Confidence 345667788888888899999999999999988554221 3456666655421 111111111 Q ss_pred hhccCceEEEecChHHhhhHHHHhhCcceeeeccCCCc----ceeeeeh--------------------hhhc-----CC Q lcl|NC_018856. 215 GKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSTAGG----FSTGFSI--------------------NQFL-----ST 265 (479) Q Consensus 215 ~~~fG~~td~~mp~~vka~f~~~~~~~qrv~~~~n~g~----~~~G~~I--------------------~~~~-----s~ 265 (479) -.+|+....+.+...-...-|.+.+.+..+ .-.|++| .+|. .. T Consensus 275 --------~~v~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~ 346 (408) T protein:vir:10 275 --------SLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFD 346 (408) T ss_pred --------EEEEcHHHHHHHHHhhccCCceEeccCcCCCCCceecceeeEEecccccCccCCCceEEEEEehhccEEEEE Confidence 133444444444332221112221111111 1122211 1110 00 Q ss_pred Ccceecccce-----ecCCCce-e----cccCCcCCCCCCCceeEeeeeccCCCCCCCcccccceEEEEEEEcCcCcccc Q lcl|NC_018856. 266 RGAINLHGST-----IMENDNI-L----LEGRNPEPNAPQAPASVVASIVDDKKGGFRDEDIKTHSYKVVVHSDDAESLP 335 (479) Q Consensus 266 ~G~I~l~~s~-----~m~~~~~-L----~e~~~~~~~AP~~pa~v~at~~t~~~G~f~~~d~gty~YkVtavn~~GES~p 335 (479) ++.+.+.-+. |..+... . ..+.+..|+|-. ...-++++...|.. T Consensus 347 ~~~~~v~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~---~~~~~~~~~~~~~~----------------------- 400 (408) T protein:vir:10 347 RENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALV---AGSFSAIADQVGNF----------------------- 400 (408) T ss_pred ecceEEEEcccccchhhcCceEEEEEEeeccEEeccccEE---EEEeeccccCCCCC----------------------- Confidence 1111111110 0000000 0 011111111110 00000000000000 Q ss_pred ccceeeeeecCCceE Q lcl|NC_018856. 336 SEAVTAAVAKKDNTV 350 (479) Q Consensus 336 S~~vt~Tv~~~g~sv 350 (479) .|. + .++| T Consensus 401 -----~~~-~-~~~~ 408 (408) T protein:vir:10 401 -----KTT-T-STAV 408 (408) T ss_pred -----CCC-C-cccC Confidence 000 0 0000 No 63 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=96.11 E-value=0.00043 Score=39.03 Aligned_cols=316 Identities=14% Similarity=0.082 Sum_probs=137.6 Q ss_pred CCccchhhhhhhh---------cCCccchHHHH----HHHHHhhhcCCCcChhhccCccccchhhhhhhhhhheeccccc Q lcl|NC_018856. 1 MTELKKEAEAKNK---------KLPVEAEAELA----ELVSKSFTTGYGITPDTQLDGAAVRRELLEDQVKMLAFSSNDF 67 (479) Q Consensus 1 ~~~~~~~~~~~~~---------~~~~~~~~~~~----e~~~Ks~tag~~~~p~~~~~gaalr~esld~~~~~l~~~~~~f 67 (479) ..+.+......+. +....+..... ....++...+ .+-..|+.|-.+.+..+|..+.. +. T Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~gg~liP~~~~~~ii~~l~---~~ 153 (428) T protein:vir:10 82 KAEPKQYTGAGMTRMVMSIAAAQGNLQDAAKFASDELNDQSVSMAIS-----TAAGSGGVLIPQNIHSEVIELLR---DR 153 (428) T ss_pred ccccchhhhHHHHHHHHHHHHhhhhHHHHHHHhhhhhhhhhHhhhhc-----ccccCCccccchhHHHHHHHHHh---hh Confidence 0011111111110 00110000000 0000111100 11123566777777666644332 12 Q ss_pred cchhhccccch--hHHHHhhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHH Q lcl|NC_018856. 68 TIYPLINKQQV--NSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTI 145 (479) Q Consensus 68 ~f~~~i~k~~~--~stv~eY~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~ 145 (479) ..+..+.-+.+ .+--.+|.++. +.+...+++|++..+.+++.+.+.+...+=++.--.+|..+ +.++..|.+.. T Consensus 154 ~~l~~~~~~~~~~~~g~~~~p~~~---~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~v~is~el-l~ds~~~l~~~ 229 (428) T protein:vir:10 154 TIVRKLGARSIPLPNGNMSLPRLA---GGATASYTGENQDAKVSEARFDDVKLTAKTMIAMVPISNAL-IGRAGFNVEQL 229 (428) T ss_pred chhhhhcceeeecCCcceEEEEEe---CCcceeeeccCccccccccceeeEEeeeEEEEEeehhhHHH-HhhhhHHHHHH Confidence 22333211111 12223444433 33456799999999999999999999999888887777765 34556678888 Q ss_pred HHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEE-ccCCCCCHHHHhhhhh------hhhhcc Q lcl|NC_018856. 146 LTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVID-LKGARLDEATLNKAAV------IVGKGY 218 (479) Q Consensus 146 ~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviD-arG~~l~~~~l~~aa~------~i~~~f 218 (479) ..+.-...+.+.+|.++++||.. |-+++|+.+.......++. ..+...+.+.+..... .....+ T Consensus 230 i~~~l~~ai~~~~d~~~l~G~G~---------~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 300 (428) T protein:vir:10 230 VLQDILTAISVREDKAFMRDDGT---------GDTPIGMKARATQWNRLLPWAADAAVNLDTIDTYLDSIILMSMDGNSN 300 (428) T ss_pred HHHHHHHHHHHHHHHHHhccCCC---------CccccccccccccccccccccccccccHHHHHHHHHHHHHhhhccccc Confidence 89999999999999999999854 3357898887653323333 3345555555443221 112223 Q ss_pred CceEEEecChHHhhhHHHHhhCcceeeeccCCCcceeeeehhhhcCCCcc--------eecccceecCCCceecc-cCCc Q lcl|NC_018856. 219 GRATDAFMPIGVQADFTNNLLDRQRVIQPSTAGGFSTGFSINQFLSTRGA--------INLHGSTIMENDNILLE-GRNP 289 (479) Q Consensus 219 G~~td~~mp~~vka~f~~~~~~~qrv~~~~n~g~~~~G~~I~~~~s~~G~--------I~l~~s~~m~~~~~L~e-~~~~ 289 (479) -...-.+|+......+...=...-|.+.+...++--.|++|-........ .-+.|+ | .+.++.+ +-+. T Consensus 301 ~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~g~l~G~pv~~~~~~p~~~~~~~~~~~i~~gd-~--s~~~i~~~~~i~ 377 (428) T protein:vir:10 301 MISSGWGMSNRTYMKLFGLRDGNGNKVYPEMAQGMLKGYPIQRTSAIPANLGEGGKESEIYFAD-F--NDVVIGEDGNMK 377 (428) T ss_pred cccCEEEEcHHHHHHHHHhhccCCceeccCCCCCeeeceeeEEeccccccccCCCccceEEEEe-c--ceEEEEEecceE Confidence 33344678888888775543333344444333333456554221111000 011111 0 0011111 0000 Q ss_pred CCCCCCCceeEeeeeccCCCCCCCcccccceEEEEEEEcCcCccccccceeeeeecC Q lcl|NC_018856. 290 EPNAPQAPASVVASIVDDKKGGFRDEDIKTHSYKVVVHSDDAESLPSEAVTAAVAKK 346 (479) Q Consensus 290 ~~~AP~~pa~v~at~~t~~~G~f~~~d~gty~YkVtavn~~GES~pS~~vt~Tv~~~ 346 (479) -.-.+.+.- ..+... ....|.- +.-.+++...-+-.---|+..+..|-..= T Consensus 378 i~~~~~~~~--~~~~~~-~~~~f~~---~~~~~R~~~r~d~~v~~p~a~~~~t~~~~ 428 (428) T protein:vir:10 378 VDFSKEASY--IDTDGK-LVSAFSR---NQSLIRVVTEHDIGFRHPEGLVLGTGVLF 428 (428) T ss_pred EEeeccccc--cccccc-ccchhhc---chhheeeeeeeCceeeccceEEEEeccCC Confidence 000000000 000000 0011211 11222222222222222333333221111 No 64 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=96.10 E-value=0.00015 Score=41.58 Aligned_cols=296 Identities=11% Similarity=0.039 Sum_probs=122.3 Q ss_pred CCccchhhhhhhhcCCccchHHHHHHHHHhhhcCCCcChhhccCccccchhhhhhhhhhheeccccccchhhccccchhH Q lcl|NC_018856. 1 MTELKKEAEAKNKKLPVEAEAELAELVSKSFTTGYGITPDTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQQVNS 80 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~Ks~tag~~~~p~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~~~~s 80 (479) +..++.+.+. ........+....+.|.+...-.....+-.+|+.|-.+.+.++|-.+... .-.+.+-+...++.+ T Consensus 74 ~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~~--~~~l~~~~~~~~~~~ 148 (395) T protein:vir:38 74 EPVNKKPLPV---KDGKPDAQAMKNQFVKDFKNLVTSGTTGTGNAGLTIPEDIQLQIRTLTRS--FTSLESLANVENVTT 148 (395) T ss_pred ccccccccch---hhhhHHHHHHHHHHHHHHHHHHhhccCccCCCceecchhHhhHHHHHHHh--hcchhhhcceeeccC Confidence 1111111111 01111122333444455422211222233457888888887777443333 334555555555554 Q ss_pred HHHhhhhhhccCccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_018856. 81 TVAKYAVFNQHGRTGHSRFVREVGVAS-INDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAIAVIAKSIE 159 (479) Q Consensus 81 tv~eY~~~~~~G~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~iE 159 (479) ...+|.....-+..+...+++|++..+ ..++.+.+.....+-++.--.+|.-+= .++..|.+....+.-...+...++ T Consensus 149 ~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~la~~~~~~~~ 227 (395) T protein:vir:38 149 SHGSRVYEKLADITPLKDLDDESALIGDNDDPELTVVKYLIHRYAGITTVTNTLL-KDTVDNIIQWLVNWAAKKDVVTRN 227 (395) T ss_pred CcceEEEEeeccCCccccccccccccccccccceeeEEeeeeeeEeehhhHHHHH-hhhHHHHHHHHHHHHHHHHHHHHH Confidence 444332211111223456899998865 567999999999999988888877432 344557778888888899999999 Q ss_pred HHHhhcccccCCCCCCcccchhhhHHHhhcc-CCCEEEccCCC-CCHHHHhhhhhhhhhccCceEEEecChHHhhhHHHH Q lcl|NC_018856. 160 WAIFYGDAALSSEADGQAGIEFDGLHKLIDQ-DTNVIDLKGAR-LDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNN 237 (479) Q Consensus 160 ~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~-~~NviDarG~~-l~~~~l~~aa~~i~~~fG~~td~~mp~~vka~f~~~ 237 (479) .++++|+..-.+.. ...-+|.+.+.+.. ....+...+.. ++...+.....+. ..-|. -+|.|. ........ T Consensus 228 ~~il~g~g~~~~~~---~~~~~~~i~~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lk-d~~G~--~l~~~~-~~~~~~~~ 300 (395) T protein:vir:38 228 AKILEVMGKAPKKP---TISQFDNIKDLENNTLDPAIESTSSFITNQSGYNILSKVK-DADGR--YLMQPD-VTSPDKYL 300 (395) T ss_pred HHHhhccccccccc---ccccHHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhh-ccCCc--eeeccC-cCCCCcce Confidence 99999998754422 23557777766531 11112111111 2222222222111 11121 223221 11111111 Q ss_pred hhCcceeeecc-----CCCcceeeeehhhhc-----CCCcceecccce-----ecCCCc-eec----ccCCcCCCCCCCc Q lcl|NC_018856. 238 LLDRQRVIQPS-----TAGGFSTGFSINQFL-----STRGAINLHGST-----IMENDN-ILL----EGRNPEPNAPQAP 297 (479) Q Consensus 238 ~~~~qrv~~~~-----n~g~~~~G~~I~~~~-----s~~G~I~l~~s~-----~m~~~~-~L~----e~~~~~~~AP~~p 297 (479) +++...++.++ ..+... +.+.+|. ..++.+.+.-+. |..+.. +.. .+.+..|+|-..- T Consensus 301 l~G~pV~~~~~~~~~~~~~~~~--i~~gd~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~ 378 (395) T protein:vir:38 301 IDGKPVIRIADKWLPDVSGSHP--LYFGDLKQGITLFDRQQMQIDTTNVGAGSFEHDTTKLRFIDRFDVQLIDDGAFAAA 378 (395) T ss_pred eccceeEEecccccCcCCCcce--EEEEeccccEEEEEecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEE Confidence 22211111110 000000 1111111 001111111110 111000 001 1111222211100 Q ss_pred eeE---eeeeccCCCCC Q lcl|NC_018856. 298 ASV---VASIVDDKKGG 311 (479) Q Consensus 298 a~v---~at~~t~~~G~ 311 (479) ... +.+..++..|+ T Consensus 379 ~~~~~~~~~~~~~~~~~ 395 (395) T protein:vir:38 379 SFKTVANQAQGTAGTGK 395 (395) T ss_pred EeecccCCCCCccCCCC Confidence 000 00111222222 No 65 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=96.10 E-value=0.00039 Score=39.26 Aligned_cols=293 Identities=11% Similarity=0.013 Sum_probs=143.0 Q ss_pred hhcCCCcChhhccCccccchhhhhhhhhhheeccccccchhhccccchhHHHHhhhhhhccCcccccccccccccccccC Q lcl|NC_018856. 31 FTTGYGITPDTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASIND 110 (479) Q Consensus 31 ~tag~~~~p~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~~~~d 110 (479) |-+ ..++|+.|-.+.+.++|..+... ...+.+-..+.+..+--.+|.++. +.....+++|++..+.++ T Consensus 1 Mat-------~tt~~g~~vP~~~~~~ii~~~~~--~s~l~~~~~~i~~~~~~~~~p~~~---~~~~a~wv~Eg~~~~~~~ 68 (311) T protein:vir:99 1 MAT-------FGTGNLKNLPRNIADGMVKDVVQ--GSTVAVLSARKPQRFGNEDIITFN---GRPKAEFVGEGQQKSSTT 68 (311) T ss_pred Cce-------ecCCCceeccHHHHHHHHHHHHh--hchhhhhcceeeccCCceEEEEEe---CCceeEEeecCccccccc Confidence 221 11344555566665655433322 223444444555554333455533 334577999999999999 Q ss_pred cceEEEEEEEEeeeehhhhhhhHhhh--cchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhh Q lcl|NC_018856. 111 PNIRQKTVQMKFLSDTKQQSLAAGLV--NNIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLI 188 (479) Q Consensus 111 ~~~~r~~~~~k~l~~~~~vs~~~~lv--n~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I 188 (479) +++.......|=++.--.+|.-+-.. ++..|.+....+.--..+++.+|.++|+|+.+-. |..+-|+...+ T Consensus 69 ~~f~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~-------g~~~~g~~~~~ 141 (311) T protein:vir:99 69 GEFDFVTSTPKKAQVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINPLT-------GTVIPGWSNYL 141 (311) T ss_pred ceeeEEEEeeEEEEEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCccc-------Ccccccccccc Confidence 99999999999888888888776333 3456788899999999999999999999987532 23455666666 Q ss_pred ccCCCEEEccCCCCC--HHHHhhhhhhhhhc--cCceEEEecChHHhhhHHHHhhCcceeeeccCCCcc----eeeeehh Q lcl|NC_018856. 189 DQDTNVIDLKGARLD--EATLNKAAVIVGKG--YGRATDAFMPIGVQADFTNNLLDRQRVIQPSTAGGF----STGFSIN 260 (479) Q Consensus 189 ~~~~NviDarG~~l~--~~~l~~aa~~i~~~--fG~~td~~mp~~vka~f~~~~~~~qrv~~~~n~g~~----~~G~~I~ 260 (479) ....+.+...+.... .+.+..+...+..+ ...++-+.|++.+...+...-...-|.+.+...... -.|+++- T Consensus 142 ~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~Pv~ 221 (311) T protein:vir:99 142 GAASKRVELTADTIANPDLAIEAAVGLLVANGHPTPVNGLALHPSIAWGLSTARYTDGRKKFPELGLGIGVSSFEGIDAS 221 (311) T ss_pred ccccceeeccccccchhHHHHHHHHHHHhhhccCCCccEEEEcHHHHHHHHhhhccCCCeeecCcccCCCCceecceeeE Confidence 655566666554433 23444443333322 345566899999999996644443355544332221 2344332 Q ss_pred hhcCC-Ccceecccc--eecCCCc-ee-cccC--CcCCCCCCCceeEeeeec-cCCCCCCCcccccceEEEEEEEcCcCc Q lcl|NC_018856. 261 QFLST-RGAINLHGS--TIMENDN-IL-LEGR--NPEPNAPQAPASVVASIV-DDKKGGFRDEDIKTHSYKVVVHSDDAE 332 (479) Q Consensus 261 ~~~s~-~G~I~l~~s--~~m~~~~-~L-~e~~--~~~~~AP~~pa~v~at~~-t~~~G~f~~~d~gty~YkVtavn~~GE 332 (479) -.... .+.+...+. ....+.. .+ .+-. +.--..-.....+..... ......|. .+.--||+...-+-.- T Consensus 222 ~s~~i~~~~~~~~~~~~~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~d~~~~r~~~r~d~~v 298 (311) T protein:vir:99 222 VSDTVNGGDEADPDDEDLDAARAVRGIVGDFANGIHWGVQRDIPVELIKYGDPDGQGDLKR---HNQIALRLEIVYGWYV 298 (311) T ss_pred eecccccccccccccchhhccCcceEEEeeccccEEEEEecCceEEEeecCCCCcchhhhh---cCcEEEEEEEeeccee Confidence 21111 111111000 0010110 00 0000 000000000111111000 11111221 1223344433222221 Q ss_pred cccccceeeeeecC Q lcl|NC_018856. 333 SLPSEAVTAAVAKK 346 (479) Q Consensus 333 S~pS~~vt~Tv~~~ 346 (479) --| ..+..+..+. T Consensus 299 ~~~-~~v~~~~~~A 311 (311) T protein:vir:99 299 FTD-RFVVIENAVA 311 (311) T ss_pred cCh-hHeeeecccC Confidence 111 1222221111 No 66 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=96.06 E-value=0.00073 Score=37.79 Aligned_cols=300 Identities=13% Similarity=0.047 Sum_probs=136.7 Q ss_pred CCccchhhhhh----------------hhcCCccchHHHHHHHHHhhhcCCCcCh--hhccCccccchhhhhhhhhhhee Q lcl|NC_018856. 1 MTELKKEAEAK----------------NKKLPVEAEAELAELVSKSFTTGYGITP--DTQLDGAAVRRELLEDQVKMLAF 62 (479) Q Consensus 1 ~~~~~~~~~~~----------------~~~~~~~~~~~~~e~~~Ks~tag~~~~p--~~~~~gaalr~esld~~~~~l~~ 62 (479) +.+.++..+.. ..........+. ..+.+.+..+..... .+-.+|+.+..+.+.+.|..+.. T Consensus 68 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~-~~~~~~~~~~~~~~~~~~~~~~g~~~iP~~~~~~ii~~~~ 146 (415) T protein:vir:94 68 SENNQQSVEVNEASTYRNQANINDLGISIQNTKVTSQEV-RDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKE 146 (415) T ss_pred hhhccccccccchhhHHHHHHHHHHHhhhhhhhhhHHHH-HHHHHHhhhhhhhhhhccccccccccCcHHHHHHHHHHHH Confidence 00000000000 000000001111 111122222222111 22345788888888877754433 Q ss_pred ccccccchhhccccchhHHHHhhhhhhccCccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcchhh Q lcl|NC_018856. 63 SSNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVAS-INDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIAD 141 (479) Q Consensus 63 ~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~D 141 (479) .. ..+.+.+...++.+--..|.+.. +.+.+...+++|++..+ ..++.+.+....++-++.-..+|.-+ +.++..| T Consensus 147 ~~--~~l~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~v~Eg~~~~~~~~~~~~~i~~~~~k~~~~~~is~el-l~ds~~~ 222 (415) T protein:vir:94 147 VE--FNLDKYVTVKRVTNGSGKYPVVR-QSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREA-IEDAKVN 222 (415) T ss_pred hh--hhhhhhcceeeccCCceeEEEEe-ecCCccceeccccccccccccccceeeEeeheeeeeechhhHHH-HhhchHH Confidence 32 34555556566654444454443 33334577999998865 67799999999999999888888753 3455668 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccCCCCCHHHHhhhhhhhhhccCce Q lcl|NC_018856. 142 PMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGARLDEATLNKAAVIVGKGYGRA 221 (479) Q Consensus 142 p~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~~ 221 (479) .+....+.-...++..++.+++.|+..-.+... +... ....+.....+.. +.+.|.++--.+...+... T Consensus 223 ~~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~--------~~~~--~~~~~~~~~~~~~-~~~~i~~~~~~~~~~~~~~ 291 (415) T protein:vir:94 223 VLQELKLWMARTIAATRNKAIIDVITKGSTGST--------SSGF--EKEGKKLEVKKAK-SLDDIKDAINLNVKPNYEH 291 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhccccCccccc--------cccc--ccccccccccccc-chHHHHHHHHhhhhhccCC Confidence 888888999999999999999999876332111 1111 1112333344433 3344444333344555566 Q ss_pred EEEecChHHhhhHHHHhhCcceeeeccCCCc----ceeeeehhhhc-CCCcc---e-ecccc----e-ecCCCceecc-- Q lcl|NC_018856. 222 TDAFMPIGVQADFTNNLLDRQRVIQPSTAGG----FSTGFSINQFL-STRGA---I-NLHGS----T-IMENDNILLE-- 285 (479) Q Consensus 222 td~~mp~~vka~f~~~~~~~qrv~~~~n~g~----~~~G~~I~~~~-s~~G~---I-~l~~s----~-~m~~~~~L~e-- 285 (479) +-.+|++...+.+...-...-|++...+..+ .-.|++|--.. .+-|. . -+.|+ . +.++..+-++ T Consensus 292 ~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~v~~~ 371 (415) T protein:vir:94 292 NVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWT 371 (415) T ss_pred CEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEe Confidence 7799999998888654333334443323222 12344332111 01111 0 01110 1 1111111000 Q ss_pred ------------cCC-cCCCCCCCceeEeeeeccCCCCCCCcccccceEEEEEEEcCcCccccccceeeeeecCCceEEE Q lcl|NC_018856. 286 ------------GRN-PEPNAPQAPASVVASIVDDKKGGFRDEDIKTHSYKVVVHSDDAESLPSEAVTAAVAKKDNTVKL 352 (479) Q Consensus 286 ------------~~~-~~~~AP~~pa~v~at~~t~~~G~f~~~d~gty~YkVtavn~~GES~pS~~vt~Tv~~~g~sv~l 352 (479) .++ ..+--|.+-..+.-++++.+.|.. -| T Consensus 372 ~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~--------------------------------------~~ 413 (415) T protein:vir:94 372 DYMHFGECLMIAVRQDCRILDYKSAIVIEYDDSERGEGDL--------------------------------------GL 413 (415) T ss_pred ccccCceEEEEEEEeccEEeccccEEEEEEeccCCCCCcc--------------------------------------cc Confidence 000 000011111111111121111111 11 Q ss_pred EE Q lcl|NC_018856. 353 EV 354 (479) Q Consensus 353 tI 354 (479) +- T Consensus 414 ~~ 415 (415) T protein:vir:94 414 EA 415 (415) T ss_pred CC Confidence 11 No 67 >protein:vir:103370 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:11266 # MgeID: mge:1621 # MgeName: PaP2 # Cross-refs: genbank:acc:YP_024741;genbank:gi:48697083;genbank:GeneID:2846038 Probab=96.05 E-value=0.00024 Score=40.45 Aligned_cols=326 Identities=17% Similarity=0.156 Sum_probs=139.9 Q ss_pred CCccchh------hhhhhhcCCccchHHHHHHH-------HHhhhcCCCcCh----------hhccCccccchhhhhhhh Q lcl|NC_018856. 1 MTELKKE------AEAKNKKLPVEAEAELAELV-------SKSFTTGYGITP----------DTQLDGAAVRRELLEDQV 57 (479) Q Consensus 1 ~~~~~~~------~~~~~~~~~~~~~~~~~e~~-------~Ks~tag~~~~p----------~~~~~gaalr~esld~~~ 57 (479) -|-|..| +---+.+.|--+.- +.-.+ .||+|++|.-+- ....|+.-|..|+=+--- T Consensus 10 ~~~~~~~~~~~~~~~~~~~~~PN~~~p-ll~li~~g~~~ta~ast~~w~~d~~~~~~~~~ta~a~a~~T~l~ve~~~~f~ 88 (418) T protein:vir:10 10 TTLNPQELNMKSFAGTILRRVPNGSAP-LLAMTSVVGSTTAKASTHGYFSKTMVFASAVVTAEAAADATVLTVENSDGLT 88 (418) T ss_pred cCCChhhhchhhhhhhhhhhcCCcchh-hhhhhhcccccccceeEEEEEEEEEeeeeEEEEEEEecCceEEEEcCcceec Confidence 1111111 11123333331111 11111 133343333111 111111222222222100 Q ss_pred h-hheeccccccch--hhccccchhHHHHhhhhhhccCcccccc--------cc----cccccccccCcceEEEEEEE-E Q lcl|NC_018856. 58 K-MLAFSSNDFTIY--PLINKQQVNSTVAKYAVFNQHGRTGHSR--------FV----REVGVASINDPNIRQKTVQM-K 121 (479) Q Consensus 58 ~-~l~~~~~~f~f~--~~i~k~~~~stv~eY~~~~~~G~~g~~~--------fv----~E~g~~~~~d~~~~r~~~~~-k 121 (479) + +|.|.+..+..+ -.|+ =..-.+....|++-+.+ |+ -||.+.... +.+.+ +.+ + T Consensus 89 ~~~l~~~~~~~Evirv~sVn-------g~~lTV~Rg~~~t~aaaia~n~~~~~Ig~~~eEGsd~~ta--~~~k~-~~vsN 158 (418) T protein:vir:10 89 KGMIFYNEATGENMRLELVN-------GLNLTVKRQTGRISAAIIAANTKLIVIGTAFEEGSQRPTA--RSIQP-VYVPN 158 (418) T ss_pred cccEEEEccCCeEEEEEEEe-------CCEEEEEEecCCeeEEEEecCceEEEeccccccccccCCc--ceecc-eeccc Confidence 0 222332221111 1111 01111333334433332 22 366554443 33322 222 2 Q ss_pred e---eeehhhhhhhHhhh---cchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccc--hhhhHHHhhcc--C Q lcl|NC_018856. 122 F---LSDTKQQSLAAGLV---NNIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGI--EFDGLHKLIDQ--D 191 (479) Q Consensus 122 ~---l~~~~~vs~~~~lv---n~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gl--eFDGl~~~I~~--~ 191 (479) | +.+.-++|.-+..+ -+++|+...+ .+.+.-.+..||+++|+|-..... ..+|+ .++||..+|.. . T Consensus 159 vtQIF~~avsvSgTaqAs~~q~Gvsn~~ese-~drk~~~av~iEkalI~G~~~~~~---~~~g~~R~m~GIl~~vr~~~~ 234 (418) T protein:vir:10 159 FTQIFRNAWALTDTARASYAEAGYSNITESR-RDCMDFHATEQETAIFFGQAFMGT---YNGQPLHTTQGIVDAVRQYAP 234 (418) T ss_pred hhhhhhhhhhhhhhhhhccccccCchHHHHH-HHHHHHHHHHHHHHHhcccccCCC---cCCcchhhHHHHHHHHhhhcc Confidence 3 34455566555542 3567887666 455555566899999999755322 22233 68999877752 2 Q ss_pred CCEEEccCC-CCCHHHHhhhhhhh---hhccCceEE-----EecChHHhhhHHHHhhCcceeeeccCCCcceeeeehhhh Q lcl|NC_018856. 192 TNVIDLKGA-RLDEATLNKAAVIV---GKGYGRATD-----AFMPIGVQADFTNNLLDRQRVIQPSTAGGFSTGFSINQF 262 (479) Q Consensus 192 ~NviDarG~-~l~~~~l~~aa~~i---~~~fG~~td-----~~mp~~vka~f~~~~~~~qrv~~~~n~g~~~~G~~I~~~ 262 (479) +||+|+.+. .++.+.|.++...+ +.+-|..++ +++|...|.+++..+ +..|. +......|+.+..+ T Consensus 235 gnVv~a~~~t~~s~d~l~~a~~~af~~g~~~G~~~q~~~f~~~V~~~~k~~I~k~~-~~I~~----~~~e~~~G~vv~~~ 309 (418) T protein:vir:10 235 DNVNAMPNPTAVTYDDVVDATIDAFKWSVNVGDNTQRVMFCDTVGMRTMQDIGRFF-GEVTV----TQRETSYGMVFTEW 309 (418) T ss_pred cceeccCCCCccCHHHHHHHHHHHhhccCCCcccccceeEEEEeChHHHHHhhhhh-hheee----cccceeeeEEEEEE Confidence 699999997 69999998877665 335677766 555999999998776 33333 45566899999999 Q ss_pred cCCCcceecccc-----eecCCCceec-c-------cCCcCCCCCCCceeEe-----ee-------eccCCCCCCCcccc Q lcl|NC_018856. 263 LSTRGAINLHGS-----TIMENDNILL-E-------GRNPEPNAPQAPASVV-----AS-------IVDDKKGGFRDEDI 317 (479) Q Consensus 263 ~s~~G~I~l~~s-----~~m~~~~~L~-e-------~~~~~~~AP~~pa~v~-----at-------~~t~~~G~f~~~d~ 317 (479) ...+|.|.|.-. .-|..+.+|+ + -...+..+|......= ++ .....+|... T Consensus 310 ~~~~G~I~L~~~p~~~~~~lp~g~mlVvD~~~vkL~~L~~R~~~~E~l~k~G~~~~~~~~~~~~~~~~D~~kG~iv---- 385 (418) T protein:vir:10 310 KFFKGRLILKEHPLFSAIGISPGFAVVVDVPAVKLAYMDGRNAKVENYGQGGGENKSGATDYSYGHGVDAQGGSLT---- 385 (418) T ss_pred EcceEEEEeecccccccccCCCceEEEEccccceEEEeccccccchhcccCCCcccccccccccccccccccceEE---- Confidence 999999955444 2233332221 1 0000111222111100 00 0011111111 Q ss_pred cceEEEEEEEcCcCccccccceeeeeecCCceEEEEEEecCCCCcc Q lcl|NC_018856. 318 KTHSYKVVVHSDDAESLPSEAVTAAVAKKDNTVKLEVKLASLYQAQ 363 (479) Q Consensus 318 gty~YkVtavn~~GES~pS~~vt~Tv~~~g~sv~ltIT~~~~~~a~ 363 (479) +. |.+-+.|..+- +-++. -+.++-++.+++- ++ T Consensus 386 ~E--~tLe~~N~~a~--------avitg-l~~~~~~~~~t~p--~~ 418 (418) T protein:vir:10 386 SE--WALELLNPQGC--------AVITG-LQKAKERVYLTAP--AP 418 (418) T ss_pred EE--eeeeeecccce--------EEeec-cceecccccCCCC--CC Confidence 11 22222232221 01111 1222222221111 11 No 68 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=96.02 E-value=0.0011 Score=36.78 Aligned_cols=301 Identities=13% Similarity=0.020 Sum_probs=138.4 Q ss_pred CCccchhhhh-----------hhhcCCccchHHH----HHHHHHhhhcCCCc--ChhhccCccccchhhhhhhhhhheec Q lcl|NC_018856. 1 MTELKKEAEA-----------KNKKLPVEAEAEL----AELVSKSFTTGYGI--TPDTQLDGAAVRRELLEDQVKMLAFS 63 (479) Q Consensus 1 ~~~~~~~~~~-----------~~~~~~~~~~~~~----~e~~~Ks~tag~~~--~p~~~~~gaalr~esld~~~~~l~~~ 63 (479) +...++..+. ........-.... ...+.+.+..+... ...+-.+|+.|..+.+.+.|..+... T Consensus 68 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~ 147 (415) T protein:vir:81 68 SENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEV 147 (415) T ss_pred hhhcccccccchhhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHh Confidence 0000000000 0000000000001 11122222222221 11223468889999888888544443 Q ss_pred cccccchhhccccchhHHHHhhhhhhccCccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhH Q lcl|NC_018856. 64 SNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVAS-INDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADP 142 (479) Q Consensus 64 ~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp 142 (479) .. .+.+.+...++.+....|......++ ....+++|++..+ ..++.+......++-++.-..+|.-+ +.++..|. T Consensus 148 ~~--~l~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~el-l~ds~~~l 223 (415) T protein:vir:81 148 EF--NLDKYVTVKRVTNGSGKYPVVRQSEV-AALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREA-IEDAKVNV 223 (415) T ss_pred hh--hhhhheeeeeccCCceeEEEEeecCC-ccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHH-HhhchHHH Confidence 33 45555666666655555655444443 3566899988765 67799999999999999888888764 24556678 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccCCCCCHHHHhhhhhhhhhccCceE Q lcl|NC_018856. 143 MTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGARLDEATLNKAAVIVGKGYGRAT 222 (479) Q Consensus 143 ~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~~t 222 (479) +....+.-...+++.++.+++.|+-.=.+.. .++... ...+.....+. .+.+.|.++--.+...|.... T Consensus 224 ~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~--------~~~~~~--~~~~~~~~~~~-~~~~~i~~~~~~~~~~~~~~~ 292 (415) T protein:vir:81 224 LQELKLWMARTIAATRNKAIIDVITKGSTGS--------TSSGFE--KEGKKLEVKKA-KSLDDIKDAINLNVKPNYEHN 292 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHhhccccCcccc--------cccccc--ccccccccccc-cchhHHHHHHHhhhhhccCCC Confidence 8888888888999999999999986522211 111111 11233333343 334444444434445555566 Q ss_pred EEecChHHhhhHHHHhhCcceeeeccCCCc----ceeeeehhhhc-CC---Ccce-ecccc----e-ecCCCceeccc-- Q lcl|NC_018856. 223 DAFMPIGVQADFTNNLLDRQRVIQPSTAGG----FSTGFSINQFL-ST---RGAI-NLHGS----T-IMENDNILLEG-- 286 (479) Q Consensus 223 d~~mp~~vka~f~~~~~~~qrv~~~~n~g~----~~~G~~I~~~~-s~---~G~I-~l~~s----~-~m~~~~~L~e~-- 286 (479) -.+|+....+.+..--...-|++...+..+ .-.|++|--.. .+ .|.+ .+.|+ . ++++..+-++. T Consensus 293 ~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~ 372 (415) T protein:vir:81 293 VAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD 372 (415) T ss_pred EEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEec Confidence 788999888887653333334444333222 22343332110 01 1111 11111 1 11111111110 Q ss_pred ------------CC-cCCCCCCCceeEeeeeccCCCCCCCcccccceEEEEEEEcCcCccccccceeeeeecCCceEEEE Q lcl|NC_018856. 287 ------------RN-PEPNAPQAPASVVASIVDDKKGGFRDEDIKTHSYKVVVHSDDAESLPSEAVTAAVAKKDNTVKLE 353 (479) Q Consensus 287 ------------~~-~~~~AP~~pa~v~at~~t~~~G~f~~~d~gty~YkVtavn~~GES~pS~~vt~Tv~~~g~sv~lt 353 (479) ++ ..+--|.+-....-+++..+.|... |+ T Consensus 373 ~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~~~~~~~--------------------------------------~~ 414 (415) T protein:vir:81 373 YMHFGECLMIAVRQDCRILDYKSAIVIEYDDSERGEGDLG--------------------------------------LE 414 (415) T ss_pred cccCceEEEEEEEeccEEeccccEEEEEEeccCCCCCccc--------------------------------------cC Confidence 00 0000111111111111111111111 11 Q ss_pred E Q lcl|NC_018856. 354 V 354 (479) Q Consensus 354 I 354 (479) - T Consensus 415 ~ 415 (415) T protein:vir:81 415 A 415 (415) T ss_pred C Confidence 1 No 69 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=96.02 E-value=0.0011 Score=36.78 Aligned_cols=301 Identities=13% Similarity=0.020 Sum_probs=138.4 Q ss_pred CCccchhhhh-----------hhhcCCccchHHH----HHHHHHhhhcCCCc--ChhhccCccccchhhhhhhhhhheec Q lcl|NC_018856. 1 MTELKKEAEA-----------KNKKLPVEAEAEL----AELVSKSFTTGYGI--TPDTQLDGAAVRRELLEDQVKMLAFS 63 (479) Q Consensus 1 ~~~~~~~~~~-----------~~~~~~~~~~~~~----~e~~~Ks~tag~~~--~p~~~~~gaalr~esld~~~~~l~~~ 63 (479) +...++..+. ........-.... ...+.+.+..+... ...+-.+|+.|..+.+.+.|..+... T Consensus 68 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~ 147 (415) T protein:vir:98 68 SENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEV 147 (415) T ss_pred hhhcccccccchhhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHh Confidence 0000000000 0000000000001 11122222222221 11223468889999888888544443 Q ss_pred cccccchhhccccchhHHHHhhhhhhccCccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhH Q lcl|NC_018856. 64 SNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVAS-INDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADP 142 (479) Q Consensus 64 ~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp 142 (479) .. .+.+.+...++.+....|......++ ....+++|++..+ ..++.+......++-++.-..+|.-+ +.++..|. T Consensus 148 ~~--~l~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~el-l~ds~~~l 223 (415) T protein:vir:98 148 EF--NLDKYVTVKRVTNGSGKYPVVRQSEV-AALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREA-IEDAKVNV 223 (415) T ss_pred hh--hhhhheeeeeccCCceeEEEEeecCC-ccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHH-HhhchHHH Confidence 33 45555666666655555655444443 3566899988765 67799999999999999888888764 24556678 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccCCCCCHHHHhhhhhhhhhccCceE Q lcl|NC_018856. 143 MTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGARLDEATLNKAAVIVGKGYGRAT 222 (479) Q Consensus 143 ~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~~t 222 (479) +....+.-...+++.++.+++.|+-.=.+.. .++... ...+.....+. .+.+.|.++--.+...|.... T Consensus 224 ~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~--------~~~~~~--~~~~~~~~~~~-~~~~~i~~~~~~~~~~~~~~~ 292 (415) T protein:vir:98 224 LQELKLWMARTIAATRNKAIIDVITKGSTGS--------TSSGFE--KEGKKLEVKKA-KSLDDIKDAINLNVKPNYEHN 292 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHhhccccCcccc--------cccccc--ccccccccccc-cchhHHHHHHHhhhhhccCCC Confidence 8888888888999999999999986522211 111111 11233333343 334444444434445555566 Q ss_pred EEecChHHhhhHHHHhhCcceeeeccCCCc----ceeeeehhhhc-CC---Ccce-ecccc----e-ecCCCceeccc-- Q lcl|NC_018856. 223 DAFMPIGVQADFTNNLLDRQRVIQPSTAGG----FSTGFSINQFL-ST---RGAI-NLHGS----T-IMENDNILLEG-- 286 (479) Q Consensus 223 d~~mp~~vka~f~~~~~~~qrv~~~~n~g~----~~~G~~I~~~~-s~---~G~I-~l~~s----~-~m~~~~~L~e~-- 286 (479) -.+|+....+.+..--...-|++...+..+ .-.|++|--.. .+ .|.+ .+.|+ . ++++..+-++. T Consensus 293 ~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~ 372 (415) T protein:vir:98 293 VAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD 372 (415) T ss_pred EEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEec Confidence 788999888887653333334444333222 22343332110 01 1111 11111 1 11111111110 Q ss_pred ------------CC-cCCCCCCCceeEeeeeccCCCCCCCcccccceEEEEEEEcCcCccccccceeeeeecCCceEEEE Q lcl|NC_018856. 287 ------------RN-PEPNAPQAPASVVASIVDDKKGGFRDEDIKTHSYKVVVHSDDAESLPSEAVTAAVAKKDNTVKLE 353 (479) Q Consensus 287 ------------~~-~~~~AP~~pa~v~at~~t~~~G~f~~~d~gty~YkVtavn~~GES~pS~~vt~Tv~~~g~sv~lt 353 (479) ++ ..+--|.+-....-+++..+.|... |+ T Consensus 373 ~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~~~~~~~--------------------------------------~~ 414 (415) T protein:vir:98 373 YMHFGECLMIAVRQDCRILDYKSAIVIEYDDSERGEGDLG--------------------------------------LE 414 (415) T ss_pred cccCceEEEEEEEeccEEeccccEEEEEEeccCCCCCccc--------------------------------------cC Confidence 00 0000111111111111111111111 11 Q ss_pred E Q lcl|NC_018856. 354 V 354 (479) Q Consensus 354 I 354 (479) - T Consensus 415 ~ 415 (415) T protein:vir:98 415 A 415 (415) T ss_pred C Confidence 1 No 70 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=96.02 E-value=0.0011 Score=36.78 Aligned_cols=301 Identities=13% Similarity=0.020 Sum_probs=138.4 Q ss_pred CCccchhhhh-----------hhhcCCccchHHH----HHHHHHhhhcCCCc--ChhhccCccccchhhhhhhhhhheec Q lcl|NC_018856. 1 MTELKKEAEA-----------KNKKLPVEAEAEL----AELVSKSFTTGYGI--TPDTQLDGAAVRRELLEDQVKMLAFS 63 (479) Q Consensus 1 ~~~~~~~~~~-----------~~~~~~~~~~~~~----~e~~~Ks~tag~~~--~p~~~~~gaalr~esld~~~~~l~~~ 63 (479) +...++..+. ........-.... ...+.+.+..+... ...+-.+|+.|..+.+.+.|..+... T Consensus 68 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~ 147 (415) T protein:vir:79 68 SENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEV 147 (415) T ss_pred hhhcccccccchhhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHh Confidence 0000000000 0000000000001 11122222222221 11223468889999888888544443 Q ss_pred cccccchhhccccchhHHHHhhhhhhccCccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhH Q lcl|NC_018856. 64 SNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVAS-INDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADP 142 (479) Q Consensus 64 ~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp 142 (479) .. .+.+.+...++.+....|......++ ....+++|++..+ ..++.+......++-++.-..+|.-+ +.++..|. T Consensus 148 ~~--~l~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~el-l~ds~~~l 223 (415) T protein:vir:79 148 EF--NLDKYVTVKRVTNGSGKYPVVRQSEV-AALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREA-IEDAKVNV 223 (415) T ss_pred hh--hhhhheeeeeccCCceeEEEEeecCC-ccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHH-HhhchHHH Confidence 33 45555666666655555655444443 3566899988765 67799999999999999888888764 24556678 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccCCCCCHHHHhhhhhhhhhccCceE Q lcl|NC_018856. 143 MTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGARLDEATLNKAAVIVGKGYGRAT 222 (479) Q Consensus 143 ~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~~t 222 (479) +....+.-...+++.++.+++.|+-.=.+.. .++... ...+.....+. .+.+.|.++--.+...|.... T Consensus 224 ~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~--------~~~~~~--~~~~~~~~~~~-~~~~~i~~~~~~~~~~~~~~~ 292 (415) T protein:vir:79 224 LQELKLWMARTIAATRNKAIIDVITKGSTGS--------TSSGFE--KEGKKLEVKKA-KSLDDIKDAINLNVKPNYEHN 292 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHhhccccCcccc--------cccccc--ccccccccccc-cchhHHHHHHHhhhhhccCCC Confidence 8888888888999999999999986522211 111111 11233333343 334444444434445555566 Q ss_pred EEecChHHhhhHHHHhhCcceeeeccCCCc----ceeeeehhhhc-CC---Ccce-ecccc----e-ecCCCceeccc-- Q lcl|NC_018856. 223 DAFMPIGVQADFTNNLLDRQRVIQPSTAGG----FSTGFSINQFL-ST---RGAI-NLHGS----T-IMENDNILLEG-- 286 (479) Q Consensus 223 d~~mp~~vka~f~~~~~~~qrv~~~~n~g~----~~~G~~I~~~~-s~---~G~I-~l~~s----~-~m~~~~~L~e~-- 286 (479) -.+|+....+.+..--...-|++...+..+ .-.|++|--.. .+ .|.+ .+.|+ . ++++..+-++. T Consensus 293 ~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~~~~v~~~~ 372 (415) T protein:vir:79 293 VAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDRSQYQASWTD 372 (415) T ss_pred EEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeEEecccccCCCCccEEEEEehhccEEEEeecceEEEEec Confidence 788999888887653333334444333222 22343332110 01 1111 11111 1 11111111110 Q ss_pred ------------CC-cCCCCCCCceeEeeeeccCCCCCCCcccccceEEEEEEEcCcCccccccceeeeeecCCceEEEE Q lcl|NC_018856. 287 ------------RN-PEPNAPQAPASVVASIVDDKKGGFRDEDIKTHSYKVVVHSDDAESLPSEAVTAAVAKKDNTVKLE 353 (479) Q Consensus 287 ------------~~-~~~~AP~~pa~v~at~~t~~~G~f~~~d~gty~YkVtavn~~GES~pS~~vt~Tv~~~g~sv~lt 353 (479) ++ ..+--|.+-....-+++..+.|... |+ T Consensus 373 ~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~~~~~~~--------------------------------------~~ 414 (415) T protein:vir:79 373 YMHFGECLMIAVRQDCRILDYKSAIVIEYDDSERGEGDLG--------------------------------------LE 414 (415) T ss_pred cccCceEEEEEEEeccEEeccccEEEEEEeccCCCCCccc--------------------------------------cC Confidence 00 0000111111111111111111111 11 Q ss_pred E Q lcl|NC_018856. 354 V 354 (479) Q Consensus 354 I 354 (479) - T Consensus 415 ~ 415 (415) T protein:vir:79 415 A 415 (415) T ss_pred C Confidence 1 No 71 >protein:vir:98525 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996579;genbank:gi:45569510;genbank:GeneID:2767853 Probab=96.01 E-value=3.4e-05 Score=45.05 Aligned_cols=304 Identities=17% Similarity=0.201 Sum_probs=133.9 Q ss_pred CCccchhhhhhhhcCCccchHHHHHHHHHhhhcCCCcChhhccCccccchhhhhhhhhhheeccccccchhhccccchh- Q lcl|NC_018856. 1 MTELKKEAEAKNKKLPVEAEAELAELVSKSFTTGYGITPDTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQQVN- 79 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~Ks~tag~~~~p~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~~~~- 79 (479) ||..+.. .- .|.| ++|=+ +|+ +.+..+.+ ..|+..+. +|..++=...+ T Consensus 1 m~~~~~~---------~~---TL~e-~Ak~~------~~~-----~~l~~~II----E~l~~tn~---IL~~lpf~e~N~ 49 (331) T protein:vir:98 1 MPTLSTT---------NP---TLAD-VAARM------TPD-----GKIDPQIV----EMLNETNE---ILDDMTVIEANG 49 (331) T ss_pred CCccccC---------cc---cHHH-HHHhc------Ccc-----hhHHHHHH----HHHhcCch---HHhhceeeeccC Confidence 4443211 00 1111 00100 010 01111111 11222221 23333322233 Q ss_pred HHHHhhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhh-cchhhHHHHHHHHHHHHHHHHH Q lcl|NC_018856. 80 STVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLV-NNIADPMTILTEDAIAVIAKSI 158 (479) Q Consensus 80 stv~eY~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv-n~~~Dp~~~~~~~ai~~~~~~i 158 (479) .|=|.|.+ +-+.-...|-.=...-+-+.++..|++..++.|..-..|.+...-. .+..+-.++|.+.-|..+.+.+ T Consensus 50 ~t~~~~~v---rt~LP~~~fR~lN~g~~~s~~tt~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~ 126 (331) T protein:vir:98 50 FTEHKTTV---RSGLPTGTWRKLNYGVQPEKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQ 126 (331) T ss_pred CccceeeE---EeccCCchhhccCCccCcccceeEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHH Confidence 22244444 2233334442222334567788999999999999999999876555 4456777999999999999999 Q ss_pred HHHHhhcccccCCCCCCcccchhhhHHHhhcc-----CCCEEEccCCCCCHHHHhhhhhhhhhccCceEEEecChHHhhh Q lcl|NC_018856. 159 EWAIFYGDAALSSEADGQAGIEFDGLHKLIDQ-----DTNVIDLKGARLDEATLNKAAVIVGKGYGRATDAFMPIGVQAD 233 (479) Q Consensus 159 E~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~-----~~NviDarG~~l~~~~l~~aa~~i~~~fG~~td~~mp~~vka~ 233 (479) +..+||||++.+| -+||||.+.+.. ++|+||+.|.--..-.|+ ++.=+=....=+| |-+-++- T Consensus 127 ~~~~iyGD~a~~p-------~~F~GL~kR~~~~~a~~~~q~IdaGgtG~~~TSI~----~v~~~~~~~~giy-PkG~~~G 194 (331) T protein:vir:98 127 ATTLFYGDSSIDA-------EKFMGLTPRFNSLSAENGQNIIDAGGTGSDNASIW----LTVWGPNTLHTIY-PKGSQAG 194 (331) T ss_pred HHHHhcCCcccCh-------hhhccchhhccccccccccceeecCCCCCCceEEE----EEEEcCCeeEEec-ccccccC Confidence 9999999999977 489999998842 469999999543332222 1111112233355 8888888 Q ss_pred HHHHhhCcceeeeccCCCcc---------eeeeehhhhcCCCcceeccccee-------cCCCceecccCCcCCC-CCCC Q lcl|NC_018856. 234 FTNNLLDRQRVIQPSTAGGF---------STGFSINQFLSTRGAINLHGSTI-------MENDNILLEGRNPEPN-APQA 296 (479) Q Consensus 234 f~~~~~~~qrv~~~~n~g~~---------~~G~~I~~~~s~~G~I~l~~s~~-------m~~~~~L~e~~~~~~~-AP~~ 296 (479) ++-..++.+..... +.+.. ..|+.|.++.++-.-.++.-|-. -+....++++...-|+ .++. T Consensus 195 l~~~d~g~~~~~~~-~G~~y~~y~~~~~w~~Gl~i~d~r~v~ri~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~ 273 (331) T protein:vir:98 195 LQSRDLGEDTLIDA-AGGRYQGYRTHYKWDIGLTLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGR 273 (331) T ss_pred ceEeecCceeeecC-CCCeeeEEEEEEEeeeeeEEcCcccEEEEeccchhccCCCcchhhhHHHHHHHHHHHhcccCCCC Confidence 87777777777632 44332 45566666655543333321111 0000112222222221 1111 Q ss_pred ceeE-eeeeccCCCCCCCc--ccccceEEEEEEEcCcCccccccceeeeeecCCceEEEEEEecCCCCcccceEE Q lcl|NC_018856. 297 PASV-VASIVDDKKGGFRD--EDIKTHSYKVVVHSDDAESLPSEAVTAAVAKKDNTVKLEVKLASLYQAQPQFIS 368 (479) Q Consensus 297 pa~v-~at~~t~~~G~f~~--~d~gty~YkVtavn~~GES~pS~~vt~Tv~~~g~sv~ltIT~~~~~~a~~~~y~ 368 (479) +..= .-++-+ .... .+.+. -++.+.-...|+- .|.- + .+-+..+ -....++.-+. T Consensus 274 ~~~y~n~~v~~----~L~~q~~~~~~-~~~~~~~~~~g~~-------~t~~--~-gipir~~--dai~~tE~~Vv 331 (331) T protein:vir:98 274 PAFYMPRKIRS----FLRRQITNKVA-ASTLTMEEIAGKK-------VVAF--D-GIPCRRT--DALLLTEARVV 331 (331) T ss_pred eEEEechHHHH----HHHHHHhhccc-eeeeeeeecCCcc-------eeEE--C-CeeEEEe--eeeecCccccC Confidence 1000 000000 0000 00000 0000000111110 0000 0 0000000 00000111111 No 72 >protein:vir:107388 Length: 331 # NCBI annotation: Bbp17 # Family: family:all:1903 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958686;genbank:gi:41179378;genbank:GeneID:2717182 Probab=96.01 E-value=3.4e-05 Score=45.05 Aligned_cols=304 Identities=17% Similarity=0.201 Sum_probs=133.9 Q ss_pred CCccchhhhhhhhcCCccchHHHHHHHHHhhhcCCCcChhhccCccccchhhhhhhhhhheeccccccchhhccccchh- Q lcl|NC_018856. 1 MTELKKEAEAKNKKLPVEAEAELAELVSKSFTTGYGITPDTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQQVN- 79 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~Ks~tag~~~~p~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~~~~- 79 (479) ||..+.. .- .|.| ++|=+ +|+ +.+..+.+ ..|+..+. +|..++=...+ T Consensus 1 m~~~~~~---------~~---TL~e-~Ak~~------~~~-----~~l~~~II----E~l~~tn~---IL~~lpf~e~N~ 49 (331) T protein:vir:10 1 MPTLSTT---------NP---TLAD-VAARM------TPD-----GKIDPQIV----EMLNETNE---ILDDMTVIEANG 49 (331) T ss_pred CCccccC---------cc---cHHH-HHHhc------Ccc-----hhHHHHHH----HHHhcCch---HHhhceeeeccC Confidence 4443211 00 1111 00100 010 01111111 11222221 23333322233 Q ss_pred HHHHhhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhh-cchhhHHHHHHHHHHHHHHHHH Q lcl|NC_018856. 80 STVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLV-NNIADPMTILTEDAIAVIAKSI 158 (479) Q Consensus 80 stv~eY~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv-n~~~Dp~~~~~~~ai~~~~~~i 158 (479) .|=|.|.+ +-+.-...|-.=...-+-+.++..|++..++.|..-..|.+...-. .+..+-.++|.+.-|..+.+.+ T Consensus 50 ~t~~~~~v---rt~LP~~~fR~lN~g~~~s~~tt~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~ 126 (331) T protein:vir:10 50 FTEHKTTV---RSGLPTGTWRKLNYGVQPEKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQ 126 (331) T ss_pred CccceeeE---EeccCCchhhccCCccCcccceeEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHH Confidence 22244444 2233334442222334567788999999999999999999876555 4456777999999999999999 Q ss_pred HHHHhhcccccCCCCCCcccchhhhHHHhhcc-----CCCEEEccCCCCCHHHHhhhhhhhhhccCceEEEecChHHhhh Q lcl|NC_018856. 159 EWAIFYGDAALSSEADGQAGIEFDGLHKLIDQ-----DTNVIDLKGARLDEATLNKAAVIVGKGYGRATDAFMPIGVQAD 233 (479) Q Consensus 159 E~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~-----~~NviDarG~~l~~~~l~~aa~~i~~~fG~~td~~mp~~vka~ 233 (479) +..+||||++.+| -+||||.+.+.. ++|+||+.|.--..-.|+ ++.=+=....=+| |-+-++- T Consensus 127 ~~~~iyGD~a~~p-------~~F~GL~kR~~~~~a~~~~q~IdaGgtG~~~TSI~----~v~~~~~~~~giy-PkG~~~G 194 (331) T protein:vir:10 127 ATTLFYGDSSIDA-------EKFMGLTPRFNSLSAENGQNIIDAGGTGSDNASIW----LTVWGPNTLHTIY-PKGSQAG 194 (331) T ss_pred HHHHhcCCcccCh-------hhhccchhhccccccccccceeecCCCCCCceEEE----EEEEcCCeeEEec-ccccccC Confidence 9999999999977 489999998842 469999999543332222 1111112233355 8888888 Q ss_pred HHHHhhCcceeeeccCCCcc---------eeeeehhhhcCCCcceeccccee-------cCCCceecccCCcCCC-CCCC Q lcl|NC_018856. 234 FTNNLLDRQRVIQPSTAGGF---------STGFSINQFLSTRGAINLHGSTI-------MENDNILLEGRNPEPN-APQA 296 (479) Q Consensus 234 f~~~~~~~qrv~~~~n~g~~---------~~G~~I~~~~s~~G~I~l~~s~~-------m~~~~~L~e~~~~~~~-AP~~ 296 (479) ++-..++.+..... +.+.. ..|+.|.++.++-.-.++.-|-. -+....++++...-|+ .++. T Consensus 195 l~~~d~g~~~~~~~-~G~~y~~y~~~~~w~~Gl~i~d~r~v~ri~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~ 273 (331) T protein:vir:10 195 LQSRDLGEDTLIDA-AGGRYQGYRTHYKWDIGLTLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGR 273 (331) T ss_pred ceEeecCceeeecC-CCCeeeEEEEEEEeeeeeEEcCcccEEEEeccchhccCCCcchhhhHHHHHHHHHHHhcccCCCC Confidence 87777777777632 44332 45566666655543333321111 0000112222222221 1111 Q ss_pred ceeE-eeeeccCCCCCCCc--ccccceEEEEEEEcCcCccccccceeeeeecCCceEEEEEEecCCCCcccceEE Q lcl|NC_018856. 297 PASV-VASIVDDKKGGFRD--EDIKTHSYKVVVHSDDAESLPSEAVTAAVAKKDNTVKLEVKLASLYQAQPQFIS 368 (479) Q Consensus 297 pa~v-~at~~t~~~G~f~~--~d~gty~YkVtavn~~GES~pS~~vt~Tv~~~g~sv~ltIT~~~~~~a~~~~y~ 368 (479) +..= .-++-+ .... .+.+. -++.+.-...|+- .|.- + .+-+..+ -....++.-+. T Consensus 274 ~~~y~n~~v~~----~L~~q~~~~~~-~~~~~~~~~~g~~-------~t~~--~-gipir~~--dai~~tE~~Vv 331 (331) T protein:vir:10 274 PAFYMPRKIRS----FLRRQITNKVA-ASTLTMEEIAGKK-------VVAF--D-GIPCRRT--DALLLTEARVV 331 (331) T ss_pred eEEEechHHHH----HHHHHHhhccc-eeeeeeeecCCcc-------eeEE--C-CeeEEEe--eeeecCccccC Confidence 1000 000000 0000 00000 0000000111110 0000 0 0000000 00000111111 No 73 >protein:vir:107826 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996627;genbank:gi:45580761;genbank:GeneID:2767902 Probab=96.01 E-value=3.4e-05 Score=45.05 Aligned_cols=304 Identities=17% Similarity=0.201 Sum_probs=133.9 Q ss_pred CCccchhhhhhhhcCCccchHHHHHHHHHhhhcCCCcChhhccCccccchhhhhhhhhhheeccccccchhhccccchh- Q lcl|NC_018856. 1 MTELKKEAEAKNKKLPVEAEAELAELVSKSFTTGYGITPDTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQQVN- 79 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~Ks~tag~~~~p~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~~~~- 79 (479) ||..+.. .- .|.| ++|=+ +|+ +.+..+.+ ..|+..+. +|..++=...+ T Consensus 1 m~~~~~~---------~~---TL~e-~Ak~~------~~~-----~~l~~~II----E~l~~tn~---IL~~lpf~e~N~ 49 (331) T protein:vir:10 1 MPTLSTT---------NP---TLAD-VAARM------TPD-----GKIDPQIV----EMLNETNE---ILDDMTVIEANG 49 (331) T ss_pred CCccccC---------cc---cHHH-HHHhc------Ccc-----hhHHHHHH----HHHhcCch---HHhhceeeeccC Confidence 4443211 00 1111 00100 010 01111111 11222221 23333322233 Q ss_pred HHHHhhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhh-cchhhHHHHHHHHHHHHHHHHH Q lcl|NC_018856. 80 STVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLV-NNIADPMTILTEDAIAVIAKSI 158 (479) Q Consensus 80 stv~eY~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv-n~~~Dp~~~~~~~ai~~~~~~i 158 (479) .|=|.|.+ +-+.-...|-.=...-+-+.++..|++..++.|..-..|.+...-. .+..+-.++|.+.-|..+.+.+ T Consensus 50 ~t~~~~~v---rt~LP~~~fR~lN~g~~~s~~tt~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~ 126 (331) T protein:vir:10 50 FTEHKTTV---RSGLPTGTWRKLNYGVQPEKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQ 126 (331) T ss_pred CccceeeE---EeccCCchhhccCCccCcccceeEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHH Confidence 22244444 2233334442222334567788999999999999999999876555 4456777999999999999999 Q ss_pred HHHHhhcccccCCCCCCcccchhhhHHHhhcc-----CCCEEEccCCCCCHHHHhhhhhhhhhccCceEEEecChHHhhh Q lcl|NC_018856. 159 EWAIFYGDAALSSEADGQAGIEFDGLHKLIDQ-----DTNVIDLKGARLDEATLNKAAVIVGKGYGRATDAFMPIGVQAD 233 (479) Q Consensus 159 E~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~-----~~NviDarG~~l~~~~l~~aa~~i~~~fG~~td~~mp~~vka~ 233 (479) +..+||||++.+| -+||||.+.+.. ++|+||+.|.--..-.|+ ++.=+=....=+| |-+-++- T Consensus 127 ~~~~iyGD~a~~p-------~~F~GL~kR~~~~~a~~~~q~IdaGgtG~~~TSI~----~v~~~~~~~~giy-PkG~~~G 194 (331) T protein:vir:10 127 ATTLFYGDSSIDA-------EKFMGLTPRFNSLSAENGQNIIDAGGTGSDNASIW----LTVWGPNTLHTIY-PKGSQAG 194 (331) T ss_pred HHHHhcCCcccCh-------hhhccchhhccccccccccceeecCCCCCCceEEE----EEEEcCCeeEEec-ccccccC Confidence 9999999999977 489999998842 469999999543332222 1111112233355 8888888 Q ss_pred HHHHhhCcceeeeccCCCcc---------eeeeehhhhcCCCcceeccccee-------cCCCceecccCCcCCC-CCCC Q lcl|NC_018856. 234 FTNNLLDRQRVIQPSTAGGF---------STGFSINQFLSTRGAINLHGSTI-------MENDNILLEGRNPEPN-APQA 296 (479) Q Consensus 234 f~~~~~~~qrv~~~~n~g~~---------~~G~~I~~~~s~~G~I~l~~s~~-------m~~~~~L~e~~~~~~~-AP~~ 296 (479) ++-..++.+..... +.+.. ..|+.|.++.++-.-.++.-|-. -+....++++...-|+ .++. T Consensus 195 l~~~d~g~~~~~~~-~G~~y~~y~~~~~w~~Gl~i~d~r~v~ri~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~~~~~~ 273 (331) T protein:vir:10 195 LQSRDLGEDTLIDA-AGGRYQGYRTHYKWDIGLTLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPNVGMGR 273 (331) T ss_pred ceEeecCceeeecC-CCCeeeEEEEEEEeeeeeEEcCcccEEEEeccchhccCCCcchhhhHHHHHHHHHHHhcccCCCC Confidence 87777777777632 44332 45566666655543333321111 0000112222222221 1111 Q ss_pred ceeE-eeeeccCCCCCCCc--ccccceEEEEEEEcCcCccccccceeeeeecCCceEEEEEEecCCCCcccceEE Q lcl|NC_018856. 297 PASV-VASIVDDKKGGFRD--EDIKTHSYKVVVHSDDAESLPSEAVTAAVAKKDNTVKLEVKLASLYQAQPQFIS 368 (479) Q Consensus 297 pa~v-~at~~t~~~G~f~~--~d~gty~YkVtavn~~GES~pS~~vt~Tv~~~g~sv~ltIT~~~~~~a~~~~y~ 368 (479) +..= .-++-+ .... .+.+. -++.+.-...|+- .|.- + .+-+..+ -....++.-+. T Consensus 274 ~~~y~n~~v~~----~L~~q~~~~~~-~~~~~~~~~~g~~-------~t~~--~-gipir~~--dai~~tE~~Vv 331 (331) T protein:vir:10 274 PAFYMPRKIRS----FLRRQITNKVA-ASTLTMEEIAGKK-------VVAF--D-GIPCRRT--DALLLTEARVV 331 (331) T ss_pred eEEEechHHHH----HHHHHHhhccc-eeeeeeeecCCcc-------eeEE--C-CeeEEEe--eeeecCccccC Confidence 1000 000000 0000 00000 0000000111110 0000 0 0000000 00000111111 No 74 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=95.96 E-value=0.0011 Score=36.79 Aligned_cols=312 Identities=9% Similarity=0.012 Sum_probs=131.7 Q ss_pred CCccchhhhhhhhcC----CccchHHHHHH------------H-HHhhhcCCCcChhhccCccccchhhhhhhhhhheec Q lcl|NC_018856. 1 MTELKKEAEAKNKKL----PVEAEAELAEL------------V-SKSFTTGYGITPDTQLDGAAVRRELLEDQVKMLAFS 63 (479) Q Consensus 1 ~~~~~~~~~~~~~~~----~~~~~~~~~e~------------~-~Ks~tag~~~~p~~~~~gaalr~esld~~~~~l~~~ 63 (479) ............... ........... + ..+.+.-........+++..+..+.+.+.+..+... T Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~ 150 (419) T protein:vir:94 71 TPLTPAEAGTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDL 150 (419) T ss_pred ccccccccccccchhhhhhhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhccccccccccCCcccccchhhhHHHHHHHhh Confidence 000000000000000 00000000000 0 000001111111222344555555555555433322 Q ss_pred cccccchhhccccchhHHHHhhhhhhc-----cCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcc Q lcl|NC_018856. 64 SNDFTIYPLINKQQVNSTVAKYAVFNQ-----HGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNN 138 (479) Q Consensus 64 ~~~f~f~~~i~k~~~~stv~eY~~~~~-----~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~ 138 (479) .. .+.+.+...+..+-...|.+... .++.+...+++|++..+.+++.+.+.+..++-++.--.+|..+ .+. T Consensus 151 ~~--~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~el--l~d 226 (419) T protein:vir:94 151 PL--LVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQA--ADD 226 (419) T ss_pred hh--hhhhcceeeeccCCceeeeeeccccccccccCcccceecCCccccccccceeeEEeeeeeEEEeehhhHHH--HHh Confidence 21 12222222223332333333322 2223346799999999999999999999999999888888753 344 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccC------CCCCHHHHhhhhh Q lcl|NC_018856. 139 IADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKG------ARLDEATLNKAAV 212 (479) Q Consensus 139 ~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG------~~l~~~~l~~aa~ 212 (479) ..+.+....+.--..++..++.++++||-+=. .-|+.+.-.- +.+...+ .....+.|.++-. T Consensus 227 ~~~l~~~i~~~la~a~~~~~d~aii~G~G~~~----------p~Gi~~~~~~--~~~~~~~~~~~~t~~~~~~~l~~~~~ 294 (419) T protein:vir:94 227 NSQLMGYIQGRLTYGLRFLRDRQLLNGNGSTE----------MQGILTTPGI--GTYQQPKPTAPATDEPPLVDIRRAKT 294 (419) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhccCccc----------ccceeccccc--ccccccccccccccchhHHHHHHHHH Confidence 45777888888999999999999999987622 3455553211 1111111 1122444555555 Q ss_pred hhhhccCceEEEecChHHhhhHHHHhhC-cceeeeccCCCcc----eeeeehhhhcC-CCcceecccceecCCCceeccc Q lcl|NC_018856. 213 IVGKGYGRATDAFMPIGVQADFTNNLLD-RQRVIQPSTAGGF----STGFSINQFLS-TRGAINLHGSTIMENDNILLEG 286 (479) Q Consensus 213 ~i~~~fG~~td~~mp~~vka~f~~~~~~-~qrv~~~~n~g~~----~~G~~I~~~~s-~~G~I~l~~s~~m~~~~~L~e~ 286 (479) .+...+..++-.+|+......+...... +.+.+.+.+..+. -.|++|-.... +.|.+-+ ||. .+...+.+ T Consensus 295 ~~~~~~~~~~~~v~n~~~~~~l~~~k~~~~~~~~~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~-gd~--~~~~~~~~- 370 (419) T protein:vir:94 295 VAEIAGFPPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQGTALV-GGF--RQGATLWS- 370 (419) T ss_pred hhhhccCCCCEEEEcHHHHHHHHHHhhcCCCceeecCCcccCCCccccceeeEEcCCCCCccEEE-eec--cceEEEEE- Confidence 5556677777899999998888766665 4444433333222 23433321111 2232221 110 00000100 Q ss_pred CCcCCCCCCCceeEeeeeccCCCCCCCcccc---cceEEEEEEEcCcCccccccceeeeee Q lcl|NC_018856. 287 RNPEPNAPQAPASVVASIVDDKKGGFRDEDI---KTHSYKVVVHSDDAESLPSEAVTAAVA 344 (479) Q Consensus 287 ~~~~~~AP~~pa~v~at~~t~~~G~f~~~d~---gty~YkVtavn~~GES~pS~~vt~Tv~ 344 (479) ..+ ...-.... .+.-|..--. ....+=+.+++..+-- ....++.++ T Consensus 371 ------~~~-~~v~~~~~---~~~~~~~~~~~~r~~~r~d~~v~~~~a~~--~~~~~aa~~ 419 (419) T protein:vir:94 371 ------RQG-ITVLMTDS---HADFFTANTLVILAEFRANLAVYQPKAFV--RVTFAAATT 419 (419) T ss_pred ------ecc-eEEEEecc---ccchhhcCcEEEEEEEeeccEEeccccEE--EEEeccCCC Confidence 000 00000000 0000100000 0011222222222211 000000000 No 75 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=95.93 E-value=0.00094 Score=37.18 Aligned_cols=304 Identities=13% Similarity=0.131 Sum_probs=127.8 Q ss_pred CCccch---hhhhhh--------hcCCccchHHHHHHHHHhhhc----CCC---------cChhhccCccccchhhhhhh Q lcl|NC_018856. 1 MTELKK---EAEAKN--------KKLPVEAEAELAELVSKSFTT----GYG---------ITPDTQLDGAAVRRELLEDQ 56 (479) Q Consensus 1 ~~~~~~---~~~~~~--------~~~~~~~~~~~~e~~~Ks~ta----g~~---------~~p~~~~~gaalr~esld~~ 56 (479) +.+.++ +.+.+. ........++..+.+.|+|.. +.. ....+..+|+.+-++.+... T Consensus 56 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~vP~~~~~~ 135 (408) T protein:vir:74 56 RDALREQLVEAQAEQVVNMREEEKGPLNKSENELKDKFVKDFVNMVRNPMAFLNTVSSKTETSGSDSAAGLTIPQDIRTM 135 (408) T ss_pred HHHHHHHHHHHHHHHHhhccccccccccchhhhhHHHHHHHHHHHHhcchhhhhhhhhhhhcccccCCCceeechhHhhH Confidence 110000 000000 000011122222223333311 110 01123345677778888877 Q ss_pred hhhheeccccccchhhccccchhHHHHhhhhhhccCcc-cccccccccccc-cccCcceEEEEEEEEeeeehhhhhhhHh Q lcl|NC_018856. 57 VKMLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRT-GHSRFVREVGVA-SINDPNIRQKTVQMKFLSDTKQQSLAAG 134 (479) Q Consensus 57 ~~~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~-g~~~fv~E~g~~-~~~d~~~~r~~~~~k~l~~~~~vs~~~~ 134 (479) |..+.. ..-.+.+.+...++.+....|... .+... ....+++|++.. +.+++.+.+....++-++.--.+|.-+= T Consensus 136 Ii~~~~--~~~~l~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell 212 (408) T protein:vir:74 136 INTLVR--QYDSLQQYVRVESVSTSSGSRVYE-KWTDVTPLKAMDEEDGKIPDLDNPRLTIIKYLIKRYAGIITATNTLL 212 (408) T ss_pred HHHHHh--hhcchhhhcceeeccCCcceEEEE-eecCCcccccccccccccccccccceeeEEeeeeeEEeeehhHHHHH Confidence 744333 333466667666666544443322 23222 235688898875 4788999999999999998888887532 Q ss_pred hhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccC-CCEEEccCCC-CCHHHHhhhhh Q lcl|NC_018856. 135 LVNNIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQD-TNVIDLKGAR-LDEATLNKAAV 212 (479) Q Consensus 135 lvn~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~-~NviDarG~~-l~~~~l~~aa~ 212 (479) .++..|.+....+.--..+...++.++++|+..-.+. ...+-+|+++..+... ...+-..... .+...+++... T Consensus 213 -~ds~~~l~~~i~~~l~~~~~~~~d~~il~G~G~~~~~---~~~~~~~~i~~~~~~~l~~~~~~~a~~v~n~~~~~~l~~ 288 (408) T protein:vir:74 213 -KDTAENILAWLSSWIAKKVVVTRNQAIIAAMGTVPKK---PTIANFDDVITMINTSVDPAIIATSSLLTNQSGLNKLAL 288 (408) T ss_pred -hhchHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc---cccccHHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHH Confidence 3456677888888888999999999999999875442 2357788887766311 0111011111 23333332222 Q ss_pred hhhhccCceEEEecChHHhhhHHHHhhCcceeeeccCCCcceee-----eehhhhc-----CCCcceecccce-----ec Q lcl|NC_018856. 213 IVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSTAGGFSTG-----FSINQFL-----STRGAINLHGST-----IM 277 (479) Q Consensus 213 ~i~~~fG~~td~~mp~~vka~f~~~~~~~qrv~~~~n~g~~~~G-----~~I~~~~-----s~~G~I~l~~s~-----~m 277 (479) + ..+-|. .+|.|.. .......+++. .|+...+....+.| +.+.+|. ..++.+.+.-+. |. T Consensus 289 l-kd~~G~--~l~~~~~-~~~~~~~l~G~-pV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~ 363 (408) T protein:vir:74 289 V-KTAEGK--YLLEPDP-TKPNSYLIKGK-QVIVVADRWLPNSGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFE 363 (408) T ss_pred h-hcCCCc--eEeccCc-CCCCCceecce-eeEEecCcccccccCCcceEEEEehhccEEEEEecceEEEEeccccchhh Confidence 1 112222 2343321 11111222222 22211110000000 1111111 112233332221 11 Q ss_pred CCCc-eec----ccCCcCCCCCCCceeEeeeeccCCCCCC-Ccccccc Q lcl|NC_018856. 278 ENDN-ILL----EGRNPEPNAPQAPASVVASIVDDKKGGF-RDEDIKT 319 (479) Q Consensus 278 ~~~~-~L~----e~~~~~~~AP~~pa~v~at~~t~~~G~f-~~~d~gt 319 (479) .+.. +.. .+.+..|+| -..+.-++.+...|.+ +.+.... T Consensus 364 ~~~~~~r~~~r~d~~~~~~~a---~~~~~~~~~~~~~~~~~~~~~~~~ 408 (408) T protein:vir:74 364 TDTTKIRVIDRFDVKATDSEA---LVAGSFTAIADQVGNFKTTTSTAV 408 (408) T ss_pred cceeeEEEEEeeCcEEecccc---eEEEEeecccCCCCCCCCCccccC Confidence 1110 011 122222221 1111111111111111 0100000 No 76 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=95.92 E-value=0.0013 Score=36.49 Aligned_cols=301 Identities=10% Similarity=-0.018 Sum_probs=140.6 Q ss_pred hhhccCccccchhhhhhhhhhheeccccccchhhccccchhHHHHhhhhhhccCcccccccccccccccccCcceEEEEE Q lcl|NC_018856. 39 PDTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTV 118 (479) Q Consensus 39 p~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~ 118 (479) =.+.+.|+.|-.+.+..+|..+. .+...+++-..+.+..+---+|.++. +.+...+++|++..+.+++.+.+... T Consensus 1 m~t~t~gg~liP~~~~~~ii~~l--~~~s~i~~l~~~~~~~~~~~~ip~~~---~~~~a~wv~E~~~~~~s~~~f~~v~l 75 (303) T protein:vir:97 1 MGTETSKASLFDKHLVSDLINKV--KGHSSLAKLSSQKPIPFNGSKEFTFT---LDSDIDVVAENGKKTHGGLSLEPVTI 75 (303) T ss_pred CcccCCCCeEcchhHHHHHHHHH--HhhchhhhhcceeecCCCceEEEEEe---cCcceEEeecCccccccccceeeEEe Confidence 22334456666766766664332 23334555555555554323455533 33457899999999999999999999 Q ss_pred EEEeeeehhhhhhhHhhhc--chhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEE Q lcl|NC_018856. 119 QMKFLSDTKQQSLAAGLVN--NIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVID 196 (479) Q Consensus 119 ~~k~l~~~~~vs~~~~lvn--~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviD 196 (479) ..|=++.--.+|.-+-.++ ...+.+....+..-..+++.+|.++++|+.+-... +..--|.........++.- T Consensus 76 ~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g~-----~~~~~~~~~~~~~~~~~~~ 150 (303) T protein:vir:97 76 VPIKVEYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPRTKK-----ASDVIGTNHFDSKVTQVVK 150 (303) T ss_pred eeEEEEEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccCCcc-----ccccccccccccccccccc Confidence 9999998888887754433 34467788889999999999999999997542211 1111111111111122222 Q ss_pred ccCCCCCHHHHhhhhhhhhhccCceEEEecChHHhhhHHHHhhCcceeeeccCCCcceeeeehhhhcCCCcceeccccee Q lcl|NC_018856. 197 LKGARLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSTAGGFSTGFSINQFLSTRGAINLHGSTI 276 (479) Q Consensus 197 arG~~l~~~~l~~aa~~i~~~fG~~td~~mp~~vka~f~~~~~~~qrv~~~~n~g~~~~G~~I~~~~s~~G~I~l~~s~~ 276 (479) .-+.....+.|.++.-.+..+++.++.+.|++.....+...-...-+.+...+.+. ...+..+ T Consensus 151 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~g~~~~~~~~~~-----------------~~~~~~l 213 (303) T protein:vir:97 151 FTESEDADANIEAAVNLIQGAEGVVTGLAMDTEFSTALAKVTNGEMGPKMYPELAW-----------------GANPDSI 213 (303) T ss_pred cccccchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCCeEEecCccC-----------------CCCCcee Confidence 22222334556665556667788888899999999888654332223322212110 0011123 Q ss_pred cCCCceecccCCcCCCCCCCceeEeeeeccCCCCCCCcccccceEEEEEEEcCcCccccccceeeeeecCCceEEEEEEe Q lcl|NC_018856. 277 MENDNILLEGRNPEPNAPQAPASVVASIVDDKKGGFRDEDIKTHSYKVVVHSDDAESLPSEAVTAAVAKKDNTVKLEVKL 356 (479) Q Consensus 277 m~~~~~L~e~~~~~~~AP~~pa~v~at~~t~~~G~f~~~d~gty~YkVtavn~~GES~pS~~vt~Tv~~~g~sv~ltIT~ 356 (479) +..+- .+...++.......+. ...-.|.+++.+....+.+-+ +++.. T Consensus 214 ~G~Pv-~~s~~v~~~~~~~~~~--------------~~~~~Gdf~~~~~~~~~~~~~------------------~~~~~ 260 (303) T protein:vir:97 214 NGLKS-SVNTTVGAGADEAESK--------------DLVIIGDFESMFKWGYAKQIP------------------MEIIK 260 (303) T ss_pred cceee-EEecccCCccccCCCc--------------cEEEEeeccccEEEEEecCcE------------------EEEee Confidence 32221 1111111110000000 001112233222222222211 11111 Q ss_pred cCCCCcccceEEEEEecCCCcceEEEEeeeeeeecCCceEEEeeccccC Q lcl|NC_018856. 357 ASLYQAQPQFISVYREGTETGHYFLIARVPVSKVNDQGVIEVLDRNQVI 405 (479) Q Consensus 357 ~~~~~a~~~~y~IYR~~~~~G~y~li~rv~vs~~n~~g~T~ftD~N~~i 405 (479) ....- ...++-|++. ---|+...|+...-.+...-...+|. .+ T Consensus 261 ~~~~d--~~~~~~~~~n--~~~~r~~~r~~~~v~~p~af~~l~~~--~~ 303 (303) T protein:vir:97 261 YGDPD--NSGKDLKGYN--QIYLRAEAYIGWGILDAKSFARVTKG--EV 303 (303) T ss_pred ccCCC--CcchhhhhcC--cEEEEEEEEeccEeecccceEEeeCC--CC Confidence 00000 0001111100 01112222332222222233333332 11 No 77 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=95.87 E-value=0.0013 Score=36.35 Aligned_cols=320 Identities=12% Similarity=0.101 Sum_probs=137.9 Q ss_pred CCccchh------------------------hhhhh---hcC-Cccc--hHHHHHHHHHhhhcC---------------C Q lcl|NC_018856. 1 MTELKKE------------------------AEAKN---KKL-PVEA--EAELAELVSKSFTTG---------------Y 35 (479) Q Consensus 1 ~~~~~~~------------------------~~~~~---~~~-~~~~--~~~~~e~~~Ks~tag---------------~ 35 (479) |.++.+| ..... +.. +... ...-...+.|++..+ + T Consensus 45 i~~l~~~I~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 124 (435) T protein:vir:14 45 FSELTAQIERAEAAERMAAAAAVPVDPNPTAVAAPAAAPVHAQPKALEVKGAKMARMVRALAAARGDAQLASKLAIERGF 124 (435) T ss_pred HHHHHHHHHHHHHHHHHHHhhcccccchhhhhhhccccccccccchhhhhHHHHHHHHHHHHhhcchhhHHHHHHHhhhh Confidence 0000000 00000 000 0000 000001122222111 1 Q ss_pred C------cChhhccCccccchhhhhhhhhhheeccccccchhhccccch--hHHHHhhhhhhccCccccccccccccccc Q lcl|NC_018856. 36 G------ITPDTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQQV--NSTVAKYAVFNQHGRTGHSRFVREVGVAS 107 (479) Q Consensus 36 ~------~~p~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~~~--~stv~eY~~~~~~G~~g~~~fv~E~g~~~ 107 (479) . .+-.+...|+.|-.+.+..+|..+... ...+..+..+.+ .+---+|.++. +.+...+++|++..+ T Consensus 125 ~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~l~~---~~~i~~~~~~~~~~~~~~~~~p~~~---~~~~a~~v~E~~~~~ 198 (435) T protein:vir:14 125 GEEVAMSLNTLSPGAGGVLVPENLSSEVIELLRP---KSVVRKLGARTLPLSNGNITIPRLK---GGAIVGYIGADTDIP 198 (435) T ss_pred hhhhhhhcccCCcCCCccccchhHHHHHHHHHhh---hchhhhhcceeeecCCCceEEEEEe---CCcceeeeccCcccc Confidence 1 111222346667788887776544322 222223222222 22223344433 233567899999999 Q ss_pred ccCcceEEEEEEEEeeeehhhhhhhHhhhcchh--hHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHH Q lcl|NC_018856. 108 INDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIA--DPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLH 185 (479) Q Consensus 108 ~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~--Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~ 185 (479) ..|+.+.+.+..++=++....+|.-+ +.++.- +.+....+.-...+.+.+|.++++|+-. +-++.||. T Consensus 199 ~~~~~f~~i~~~~~k~~~~~~iS~el-l~ds~~~~~l~~~i~~~l~~ai~~~~d~a~l~G~G~---------~~~p~Gi~ 268 (435) T protein:vir:14 199 TTQQQFDDLKLTAKKMAALVPIANDL-IKYAGVNPNVDQIVVGDLTAAIGAREDKAFIRDDGT---------ANTPKGLR 268 (435) T ss_pred ccccceeEEEeeeEEEEEeehhhHHH-HHhhccCHHHHHHHHHHHHHHHHHHHHHHhhccCCC---------Ccccccee Confidence 99999999999999898888888655 233322 3557778888889999999999999743 12457776 Q ss_pred HhhccCCCEEEcc-CCCCC--HHHHhhhhhhhhh---ccCceEEEecChHHhhhHHHHhhCcceeeeccCCCcceeeeeh Q lcl|NC_018856. 186 KLIDQDTNVIDLK-GARLD--EATLNKAAVIVGK---GYGRATDAFMPIGVQADFTNNLLDRQRVIQPSTAGGFSTGFSI 259 (479) Q Consensus 186 ~~I~~~~NviDar-G~~l~--~~~l~~aa~~i~~---~fG~~td~~mp~~vka~f~~~~~~~qrv~~~~n~g~~~~G~~I 259 (479) +..... ++...- +...+ ...|.++...+.. ++..+ -..|+....+.+...-...-|.+.+...++.-.|++| T Consensus 269 ~~~~~~-~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~-~~v~n~~~~~~L~~lkd~~G~~l~~~~~~g~l~G~Pv 346 (435) T protein:vir:14 269 FWALPS-NVITASDASTLQKIETDLGKVILALENADANLTQP-GWIMAPRTFRFLEGLRDGNGNKVYPELANGMLKGYPV 346 (435) T ss_pred eccccc-ceeccccccchhhHHHHHHHHHHHhhhccccccCC-EEEEcHHHHHHHHHhhccCCceeccCCCCCeeeccee Confidence 554332 333322 22222 2233333333332 23332 3678999888886544444455555444434455443 Q ss_pred hhhcCC---CcceecccceecCCC--ceecccCCcCCCCCCCceeEee-eeccCCC----CCCCcccccceEEEEEEEcC Q lcl|NC_018856. 260 NQFLST---RGAINLHGSTIMEND--NILLEGRNPEPNAPQAPASVVA-SIVDDKK----GGFRDEDIKTHSYKVVVHSD 329 (479) Q Consensus 260 ~~~~s~---~G~I~l~~s~~m~~~--~~L~e~~~~~~~AP~~pa~v~a-t~~t~~~----G~f~~~d~gty~YkVtavn~ 329 (479) --.... -+...-.+.++..+. .++.. .-.....++. +...... ..|.- +.-.+++...-+ T Consensus 347 ~~~~~~p~~~~~~~~~~~i~~gd~s~~~i~~-------~~~~~~~~~~~~~~~~~~~~~~~~f~~---~~~~~r~~~r~d 416 (435) T protein:vir:14 347 GKTTQVPINLGETGKESEIYFTDFGDVFIGE-------EETLEIDYSKEATYKDADGHMVSAFQR---DQTLIRVIAKND 416 (435) T ss_pred EeeccccccccCCCccceEEEeecccEEEEE-------ecccEEEEeccccccccccchhhhhhc---ChhheeeeeeeC Confidence 211110 000000001111100 00100 0000000000 0000000 11211 123344444433 Q ss_pred cCccccccceeeeeecCCc Q lcl|NC_018856. 330 DAESLPSEAVTAAVAKKDN 348 (479) Q Consensus 330 ~GES~pS~~vt~Tv~~~g~ 348 (479) .+--.|...+..+-.+-|. T Consensus 417 ~~~~~~~a~~~l~~~~~~~ 435 (435) T protein:vir:14 417 FGPRHVESIAVLAGVAWGA 435 (435) T ss_pred ceeecccceEEEecCCCCC Confidence 3433444444444333333 No 78 >protein:vir:103759 Length: 330 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024928;genbank:gi:48697198;genbank:GeneID:2846083 Probab=95.81 E-value=0.00014 Score=41.79 Aligned_cols=283 Identities=17% Similarity=0.205 Sum_probs=134.6 Q ss_pred CCccchhhhhhhhcCCccchHHHHHHHHHhhhcCCCcChhhccCccccchhhhhhhhhhheeccccccchhhccccchhH Q lcl|NC_018856. 1 MTELKKEAEAKNKKLPVEAEAELAELVSKSFTTGYGITPDTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQQVNS 80 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~Ks~tag~~~~p~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~~~~s 80 (479) |+....+. =.|.| ++|- .+||... +-..|.|. .++ .+|..++=...++ T Consensus 1 m~~~~~~a------------~TL~e-~AKr------~~~d~~~---~~IIE~l~-------~tn---~IL~~lpf~e~N~ 48 (330) T protein:vir:10 1 MATLSTNN------------PTMAD-VAKR------LDPNGKV---DIIVEMLN-------QTN---PVLQDMTAIEGNL 48 (330) T ss_pred CCcCCCCc------------ccHHH-HHhh------cCcchhH---HHHHHHHh-------cCc---hHHhhcchhhccC Confidence 55443321 01111 1121 1111111 11222222 111 1122222222221 Q ss_pred -HHHhhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhh-cchhhHHHHHHHHHHHHHHHHH Q lcl|NC_018856. 81 -TVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLV-NNIADPMTILTEDAIAVIAKSI 158 (479) Q Consensus 81 -tv~eY~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv-n~~~Dp~~~~~~~ai~~~~~~i 158 (479) |=|.+.+ +-+--...|-.=.....-+.++..|++..++.|..-..|-+...-. .+..+-+++|.+.-|..+.+.+ T Consensus 49 ~tg~~t~v---rt~LP~~~fR~lN~g~~~s~~tt~qvt~~l~ilgg~~eVDr~la~~~Gn~a~~ra~e~~~~ikam~q~~ 125 (330) T protein:vir:10 49 PTGHRTSV---RTGLPTPTWRKLYGGVLPNKSSTAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEV 125 (330) T ss_pred CcccceeE---EeecCCchhhhcCCccccccceEEEEEEEeEEecchhhhhhHHHhhcCCHHHHHHHHHHHHHHHHHHHH Confidence 1122211 1122223332212233445689999999999999999998876544 5567778999999999999999 Q ss_pred HHHHhhcccccCCCCCCcccchhhhHHHhhcc-----CCCEEEccCCCCCHHHHhhhhhhhhhccCceEEEecChHHhhh Q lcl|NC_018856. 159 EWAIFYGDAALSSEADGQAGIEFDGLHKLIDQ-----DTNVIDLKGARLDEATLNKAAVIVGKGYGRATDAFMPIGVQAD 233 (479) Q Consensus 159 E~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~-----~~NviDarG~~l~~~~l~~aa~~i~~~fG~~td~~mp~~vka~ 233 (479) +..+||||++.+| -+||||.+.+.. ++|+||+.|.--....|+ ++.=+=..+ ..+-|-|-|+- T Consensus 126 ~~~~iyGD~a~~p-------~~F~GL~kR~~~~ta~~~~qvIdaGGtG~~~TSi~----~v~wg~~~~-~giyPkG~kaG 193 (330) T protein:vir:10 126 AQTLFYGNDGIAP-------AEFTGLSPRYNSLSAENKDNVIDAGGTGSDNASAW----LVVWGPNTC-HSIYPKGSKAG 193 (330) T ss_pred HHHhccCCCCCCh-------hhccchhhhcCCCCCCchhheeeccccccCceEEE----EEEEcCCeE-EEEcccCcccc Confidence 9999999999877 489999999942 369999999544332222 111111222 33348898888 Q ss_pred HHHHhhCcceeeec-cCCCcc---------eeeeehhhhcCCCcceeccccee---cCCC---ceecccCCcCCC--CCC Q lcl|NC_018856. 234 FTNNLLDRQRVIQP-STAGGF---------STGFSINQFLSTRGAINLHGSTI---MEND---NILLEGRNPEPN--APQ 295 (479) Q Consensus 234 f~~~~~~~qrv~~~-~n~g~~---------~~G~~I~~~~s~~G~I~l~~s~~---m~~~---~~L~e~~~~~~~--AP~ 295 (479) |+-..++.+++.-. .+.|.. .+|+.|.++.++-.--++..+-. .... ..+.++...-|+ ... T Consensus 194 l~~~d~g~~~~~~~dg~gg~y~~~~~~~~w~~Gl~i~d~r~vvRI~NIdvs~l~~~~~~~~li~lm~~A~~~ip~~~~g~ 273 (330) T protein:vir:10 194 LSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGR 273 (330) T ss_pred ceeeeccceeeecccCCCCceeEEeeeeeeeeeeEEeCcccEEEEeecccccCCCCccHHHHHHHHHHHHHhccCCCCCc Confidence 87777776666422 121222 57778887777644333322211 0000 001111111111 110 Q ss_pred Cce------------------eEeeeeccCCCCCCCcccccceE--EEEEEEc--CcCccccccceee Q lcl|NC_018856. 296 APA------------------SVVASIVDDKKGGFRDEDIKTHS--YKVVVHS--DDAESLPSEAVTA 341 (479) Q Consensus 296 ~pa------------------~v~at~~t~~~G~f~~~d~gty~--YkVtavn--~~GES~pS~~vt~ 341 (479) +.. -+.-+. -...|+.. ++. =-|.-|+ ...|+. -+ T Consensus 274 ~~~y~n~~v~~~L~~q~~~k~n~~l~~-~~~~g~~~-----t~~~gipir~~Dail~tE~~-----vv 330 (330) T protein:vir:10 274 AVWYMNRNLREKLRLGIVDKIANNLTW-ETVSGERV-----MTFDGIPVQRTDALLNTESR-----VV 330 (330) T ss_pred ceeeechHHHHHHHHHHhhcccceeee-eecCCeee-----EEECCeEEEEEeeeecCccc-----cC Confidence 000 000001 11123321 111 1222232 123321 11 No 79 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=95.74 E-value=0.00085 Score=37.42 Aligned_cols=299 Identities=10% Similarity=0.034 Sum_probs=138.9 Q ss_pred CCcc--chhhhhhhhcCCccchHHHHHHHHHhhhcCCCcChhhccCccccchhhhhhhhhhheeccccccchhhccccch Q lcl|NC_018856. 1 MTEL--KKEAEAKNKKLPVEAEAELAELVSKSFTTGYGITPDTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQQV 78 (479) Q Consensus 1 ~~~~--~~~~~~~~~~~~~~~~~~~~e~~~Ks~tag~~~~p~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~~~ 78 (479) +.+. ..+.+.+.++ +-.+.+.....+++++|. -.+|+.+..+.+.++|..+. .+...+++.+...++ T Consensus 62 ~~~~~~~~~~~~~~~~---~~~~~l~~~~~~a~~~~t------~~~gg~~vP~~~~~~ii~~~--~~~s~i~~~~~~~~~ 130 (371) T protein:vir:81 62 KEPLKPTVQVKENEVE---AFVNHIRTRFRNAMSEGS------NQDGGYTVPQDIQTRINELR--ESKDALQNLITVEPV 130 (371) T ss_pred ccccccchhhHHHHHH---HHHHHHHHHHHHhhccCC------CccCceeecHhHHHHHHHHH--Hhhhhhhhhceeeec Confidence 0000 0000000000 001111112234454442 24577788888877774333 333456666666667 Q ss_pred hHHHHhhhhhhccCcccccccccccccc-cccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHH Q lcl|NC_018856. 79 NSTVAKYAVFNQHGRTGHSRFVREVGVA-SINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAIAVIAKS 157 (479) Q Consensus 79 ~stv~eY~~~~~~G~~g~~~fv~E~g~~-~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~ 157 (479) .+...+|......++ +...+++|++.. +.+++++.+.+...+-++....+|.-+ +.++.-|.+....+.-...++.. T Consensus 131 ~~~~~~~~~~~~~~~-~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~l~~a~~~~ 208 (371) T protein:vir:81 131 TTLSGSRVFKKRSQQ-TGFVEVAEGAAIGEKATPQFTLLQYQVKKYAGFFRVTNEL-LNDSTEAIVNTLVRWIGDESRVT 208 (371) T ss_pred cCCceeEEEEeecCC-cceeeeccccccccccccceeeEEeeeeEEEEeehhhHHH-HhhhhHHHHHHHHHHHHHHHHHH Confidence 655555555444333 356789999875 578999999999999999988888765 34455577788888888889999 Q ss_pred HHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccCCCCCHHHHhhhhhhhhhccCceEEEecChHHhhhHHHH Q lcl|NC_018856. 158 IEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGARLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNN 237 (479) Q Consensus 158 iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~~td~~mp~~vka~f~~~ 237 (479) ++.+++.|+....+.. .+-.|++...+... ....|....-.+|++.+.+.+... T Consensus 209 ~~~~i~~g~g~~~~~~----~~~~~~i~~~~~~~----------------------l~~~~~~~a~~vmn~~~~~~L~~l 262 (371) T protein:vir:81 209 RNGLIINVLNTKAKTA----IADLDGLKQIINVQ----------------------LDPVFRSTSSVIVNQDAFNWLDTL 262 (371) T ss_pred HHHHHHhhcccccccc----cccHHHHHHHHHhh----------------------cchhhhcCCEEEEcHHHHHHHHHh Confidence 9999999998755421 24456665554311 111222223467787777776554 Q ss_pred hhCcceeeeccCCCc----ceeeeehhhhc-CCCcceecccceecCCCcee-cccC--CcCCCCCCCceeEeeeeccCCC Q lcl|NC_018856. 238 LLDRQRVIQPSTAGG----FSTGFSINQFL-STRGAINLHGSTIMENDNIL-LEGR--NPEPNAPQAPASVVASIVDDKK 309 (479) Q Consensus 238 ~~~~qrv~~~~n~g~----~~~G~~I~~~~-s~~G~I~l~~s~~m~~~~~L-~e~~--~~~~~AP~~pa~v~at~~t~~~ 309 (479) -...-|.+...+... .-.|++|--.. .|-|... ..+...+...++ ..-. ....+..+ ......... . T Consensus 263 kd~~g~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~-~~~~~~~~~~i~~Gd~~~~~~~~~~~~-~~i~~~~~~---~ 337 (371) T protein:vir:81 263 KDQNGQYLLQPSISSPTGRQLLGLPVVIVSNKVLANRV-DGGTGAQFAPIIVGDLKEAVVMFDRQR-TEIMSSNVA---M 337 (371) T ss_pred hccCCCeeeecccCCCCCceecceeEEEecccccCccc-cccccCCcceEEEEehhceEEEEeecc-eEEEEeccc---c Confidence 333233333222222 12343332111 1111100 000000011010 0000 00000011 111111111 0 Q ss_pred CCCCcccccceEEEEEEEcCcCccccccceeeeeecC Q lcl|NC_018856. 310 GGFRDEDIKTHSYKVVVHSDDAESLPSEAVTAAVAKK 346 (479) Q Consensus 310 G~f~~~d~gty~YkVtavn~~GES~pS~~vt~Tv~~~ 346 (479) ..|. -+...|++...-+-+---|...+..++++. T Consensus 338 ~~f~---~~~v~~~~~~r~d~~~~~~~a~~~~~~~~A 371 (371) T protein:vir:81 338 DAFE---TDATLWRAIERMDVKMRDDEAFVFGEVQLA 371 (371) T ss_pred chhh---cCceEEEEEEeeccEEecccceEEEEEecC Confidence 1121 122334443333323222333433333333 No 80 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=95.73 E-value=0.00051 Score=38.64 Aligned_cols=308 Identities=9% Similarity=-0.028 Sum_probs=130.1 Q ss_pred CCccch------------------hhhhhhhcCCccchHHHH----HHHHHhhhcC---------CCcChhhccCccccc Q lcl|NC_018856. 1 MTELKK------------------EAEAKNKKLPVEAEAELA----ELVSKSFTTG---------YGITPDTQLDGAAVR 49 (479) Q Consensus 1 ~~~~~~------------------~~~~~~~~~~~~~~~~~~----e~~~Ks~tag---------~~~~p~~~~~gaalr 49 (479) +.|.+. +.+.+..+.+....+... +...++...+ -...-.+..+|+.+- T Consensus 44 ~~e~~~l~~~i~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~g~~~~~~~~~~~~~~~~~t~~~~g~~~~ 123 (392) T protein:vir:13 44 LTAVADFDGRIKRGIDAIKATDAVTSLLSGLQGSGSGAQRSADHDDDAVLRAGNLGEARSFEFAPEKRDGTKAGNPNVLS 123 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCCcccchhhhhhHHHHHHHhccchhhhHHHHhhhhhhcccccCCCcccc Confidence 000000 000000000000000000 0001111111 011111222344444 Q ss_pred hhhhhhhhhhheeccccccchhhccccchhHHHHhhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhh Q lcl|NC_018856. 50 RELLEDQVKMLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQ 129 (479) Q Consensus 50 ~esld~~~~~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~v 129 (479) .+..++.|..+. ..+..++.+...--.+.-..|.....- +.....+++|++..+.+++.+.+....++=++.--.+ T Consensus 124 ~~~~~~~i~~~~---~~~~~l~~~~~~~~~~~~~~~~~~~~~-~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~i 199 (392) T protein:vir:13 124 RTLYGQLIAQAV---ERSAIMRGGASTFTTSDANPMDFTVIT-GRATAGIVGETAEIPESYPATTQRSMGGFKYGFASVV 199 (392) T ss_pred ccchHHHHHHHH---hhhhhhhhcceeeecCCCceeEEEEEc-CCcceeeecccccccccccceeeEEeeeeeEEeeehh Confidence 444444443322 223334433221111122222222222 2335668999999999999999999999888877777 Q ss_pred hhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCC-EEEccCCCCCHHHHh Q lcl|NC_018856. 130 SLAAGLVNNIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTN-VIDLKGARLDEATLN 208 (479) Q Consensus 130 s~~~~lvn~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~N-viDarG~~l~~~~l~ 208 (479) |.-+ +.++..|.+....+.-...+++.++.++|+||-.-. -.|+++....... +..+....++-+.|. T Consensus 200 S~el-l~ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~Gt~~----------p~Gil~~~~~~~~~~~~~~~~~~~~d~l~ 268 (392) T protein:vir:13 200 SYEF-ATDQVLDLVGFLVSDAGPAIGDAMGRHFLTGTGTGQ----------PRGILTDATGANAAFGEADADSKVSDALI 268 (392) T ss_pred HHHH-HhcchHHHHHHHHHHHHHHHHHHHHHHHhcccCCcc----------ccccccccccccccccccccccccHHHHH Confidence 7653 335556778888888889999999999999986421 2567666543222 223334455555555 Q ss_pred hhhhhhhhccCceEEEecChHHhhhHHHHhhCcceeeeccCCCcc----eeeeehhhh-cCCCcceecccc----eecCC Q lcl|NC_018856. 209 KAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSTAGGF----STGFSINQF-LSTRGAINLHGS----TIMEN 279 (479) Q Consensus 209 ~aa~~i~~~fG~~td~~mp~~vka~f~~~~~~~qrv~~~~n~g~~----~~G~~I~~~-~s~~G~I~l~~s----~~m~~ 279 (479) ++-..+..+|....-..|+....+.+...-...-|++...+.... -.|++|--. ..+.+.|-| || .+..+ T Consensus 269 ~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~g~~~~l~G~Pv~~~~~~~~~~i~~-Gdf~~~~i~~~ 347 (392) T protein:vir:13 269 DLFHEVPSAYRKNAKFVVNDLRAAQMRKLKDANGQYLWQSALTVGAPDTFNGKVVETDDGMPADKVLF-ADLSKYRVRFA 347 (392) T ss_pred HHHHhhhhhhhcCCEEEEcHHHHHHHHHhhccCCceeecCCcCCCCCceecceeeEEcCCCCCCcEEE-eeccceeEEee Confidence 544344455655556889999888887655554455543222211 345433211 112333322 22 01111 Q ss_pred CceecccCCcCCCCCCCceeEeeeeccCCCCCCCcccccceEEEEEEEc Q lcl|NC_018856. 280 DNILLEGRNPEPNAPQAPASVVASIVDDKKGGFRDEDIKTHSYKVVVHS 328 (479) Q Consensus 280 ~~~L~e~~~~~~~AP~~pa~v~at~~t~~~G~f~~~d~gty~YkVtavn 328 (479) ..+.++. ...+..-..-+..-+..-. +|+....++ -...+|++.. T Consensus 348 ~~~~i~~-~~~~~~~~~~~~~r~~~r~--d~~~~~~~A-~~~~~~~~aa 392 (392) T protein:vir:13 348 GSLRVDR-SVDAKFSTDQIVYRFLQRA--DGLLVDARG-AKVLTVTPAA 392 (392) T ss_pred cceEEEe-eccccccCCcEEEEEEEEe--ccEEecccc-eEEEEeeccC Confidence 1110000 0000000000000111010 011100000 0111221111 No 81 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=95.70 E-value=0.0015 Score=36.06 Aligned_cols=317 Identities=12% Similarity=0.102 Sum_probs=138.0 Q ss_pred CC----------c-cc------hhhhhhhhcCCccchH---HHHHHHHHhhhcC---------------CC------cCh Q lcl|NC_018856. 1 MT----------E-LK------KEAEAKNKKLPVEAEA---ELAELVSKSFTTG---------------YG------ITP 39 (479) Q Consensus 1 ~~----------~-~~------~~~~~~~~~~~~~~~~---~~~e~~~Ks~tag---------------~~------~~p 39 (479) +. + .+ +....+.+......+. .-...+.|++..+ +. .+- T Consensus 55 ~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 134 (435) T protein:vir:80 55 AEAAERMAAAAAVPVDPNPAAVTASAAAPVYAQPKAPEVKGAKMARMVRALAAARGDAQLASKLAIERGFGEEVAMSLNT 134 (435) T ss_pred HHHHHHHHHhhcccccchhhhhccccccccccccchhhhhHHHHHHHHHHHHhccchhHHHHHHHHhhhhhhhhhhhhcc Confidence 00 0 00 0000000000000000 0001111222111 00 111 Q ss_pred hhccCccccchhhhhhhhhhheeccccccchhhcccc--chhHHHHhhhhhhccCcccccccccccccccccCcceEEEE Q lcl|NC_018856. 40 DTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQ--QVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKT 117 (479) Q Consensus 40 ~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~--~~~stv~eY~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~ 117 (479) .+...|+.|-.+.+..+|..+.... ..+..+..+ +..+-..+|.++. +.+...|++|++..+..++.+.+.. T Consensus 135 ~~~~~gg~lvP~~~~~~ii~~l~~~---~~i~~~~~~~v~~~~~~~~~p~~~---~~~~a~~v~E~~~~~~~~~~f~~i~ 208 (435) T protein:vir:80 135 LSPGAGGVLVPENLSSEVIELLRPK---SVVRKLGARTLPLSNGNITIPRLK---GGAIVGYIGADTDIPTTQQQFDDLK 208 (435) T ss_pred cCCCCCccccchhHHHHHHHHHhhh---chhhhccceeeecCCCceEEEEEe---CCcceeeeccCccccccccceeeEE Confidence 2233466677777777765443222 222222211 2223233444433 3345679999999999999999999 Q ss_pred EEEEeeeehhhhhhhHhhhcc-h-hhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEE Q lcl|NC_018856. 118 VQMKFLSDTKQQSLAAGLVNN-I-ADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVI 195 (479) Q Consensus 118 ~~~k~l~~~~~vs~~~~lvn~-~-~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~Nvi 195 (479) ..++=++....+|.-+ +.++ + -+.+....+.-...+...+|.++|+|+..= -+..||.+..... ++. T Consensus 209 ~~~~k~~~~~~is~el-l~ds~~~~~l~~~i~~~l~~a~~~~~d~a~l~G~G~~---------~~p~Gi~~~~~~~-~~~ 277 (435) T protein:vir:80 209 LTAKKMAALVPIANDL-IKYAGVNPNVDQIVVGDLTAAIGAREDKAFIRDDGTA---------NTPKGLRFWALPG-NVI 277 (435) T ss_pred EeeEEEEEeehhhHHH-HHhhcccHHHHHHHHHHHHHHHHHHHHHHhhccCCCC---------Ccccceeeccccc-cee Confidence 9999898888888765 3333 2 356788899999999999999999997531 1236766655443 444 Q ss_pred Ecc-CCCCC--HHHHhhhhhhhhhc--cCceEEEecChHHhhhHHHHhhCcceeeeccCCCcceeeeehhhhcCCC---- Q lcl|NC_018856. 196 DLK-GARLD--EATLNKAAVIVGKG--YGRATDAFMPIGVQADFTNNLLDRQRVIQPSTAGGFSTGFSINQFLSTR---- 266 (479) Q Consensus 196 Dar-G~~l~--~~~l~~aa~~i~~~--fG~~td~~mp~~vka~f~~~~~~~qrv~~~~n~g~~~~G~~I~~~~s~~---- 266 (479) ..- |.... ...+.++-..+..+ +-...-..|+..+...+...-...-|.+.|...++--.|++|-...... T Consensus 278 ~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~l~~~~~~~~l~G~pv~~~~~~p~~~~ 357 (435) T protein:vir:80 278 TASDGSTLQKIETDLGKAILALENADANLTQPGWIMAPRTFRFLEGLRDGNGNKVYPELANGMLKGYPVGKTTQVPINLG 357 (435) T ss_pred ecccccchhhHHHHHHHHHHHhhccccccccCEEEEcHHHHHHHHhhhccCCceeccCCCCCeEeeeeeEEecccccccc Confidence 333 33332 12233433322222 2222346789998888766444444555554444445565442211110 Q ss_pred -----cceecccceecCCCceecccCCcCCCCCCCceeEeee-eccCCCC----CCCcccccceEEEEEEEcCcCccccc Q lcl|NC_018856. 267 -----GAINLHGSTIMENDNILLEGRNPEPNAPQAPASVVAS-IVDDKKG----GFRDEDIKTHSYKVVVHSDDAESLPS 336 (479) Q Consensus 267 -----G~I~l~~s~~m~~~~~L~e~~~~~~~AP~~pa~v~at-~~t~~~G----~f~~~d~gty~YkVtavn~~GES~pS 336 (479) +.| +.|+ +.+ .++.. ..+....++.- .-.+..+ .|.. +.-.+++...=+-.=-.|. T Consensus 358 ~~~~~~~i-~~gd--~s~-~~i~~-------~~~~~i~~~~~~~~~~~~~~~~~~f~~---n~~~~r~~~r~d~~~~~~~ 423 (435) T protein:vir:80 358 EAGKESEI-YFTD--FGD-VFIGE-------EETLEIDYSKEATYKDADGHMVSAFQR---DQTLIRVIAKNDFGPRHVE 423 (435) T ss_pred CCCCcceE-EEEE--ccc-EEEEe-------ecceEEEEeccccccccccchhhhhhc---CcceeeeeeeeCcEeeccc Confidence 111 1111 000 00100 00000000000 0001111 1111 1122333322222222333 Q ss_pred cceeeeeecCCc Q lcl|NC_018856. 337 EAVTAAVAKKDN 348 (479) Q Consensus 337 ~~vt~Tv~~~g~ 348 (479) ..+..+-.+-|. T Consensus 424 a~~~l~~~~~~~ 435 (435) T protein:vir:80 424 SIAVLSGVAWGA 435 (435) T ss_pred ceEEEeccCCCC Confidence 333333222222 No 82 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=95.69 E-value=0.00052 Score=38.58 Aligned_cols=304 Identities=13% Similarity=0.077 Sum_probs=138.0 Q ss_pred CCc----cchhhhhhhhcCCccchHHHHHHH---------HHhhhcCCCcChhhccCccccchhhhhhhhhhheeccccc Q lcl|NC_018856. 1 MTE----LKKEAEAKNKKLPVEAEAELAELV---------SKSFTTGYGITPDTQLDGAAVRRELLEDQVKMLAFSSNDF 67 (479) Q Consensus 1 ~~~----~~~~~~~~~~~~~~~~~~~~~e~~---------~Ks~tag~~~~p~~~~~gaalr~esld~~~~~l~~~~~~f 67 (479) +.+ ..++...... .......+..+.| .++++.| +-.+|+.|-.+.+..+|..+.. ... T Consensus 88 ~~~~~~~~~~~~~~~~~-~~~~~~~~~~~af~~~l~~~e~~~al~~~------t~~~gG~lvP~~~~~~ii~~~~--~~s 158 (425) T protein:vir:10 88 VDEANIKIAAAQMGANG-VKPLRDPEYTEAFKAHVKRGDVQAALNKG------EDSEGGYLTPIEWDRTITNKLV--LIS 158 (425) T ss_pred HHHHHHHHHhhhccccc-ccccccHHHHHHHHHHhhhhhhHHHhhcC------cCCCCceeccHhHHHHHHHHHH--hhh Confidence 000 0000000000 0011111111111 2333333 3345777888888877754433 333 Q ss_pred cchhhccccchhHHHHhhhhhhccCccccccccccccc-ccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHH Q lcl|NC_018856. 68 TIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGV-ASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTIL 146 (479) Q Consensus 68 ~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~-~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~ 146 (479) .+++-....++.+.-..|.+.. + .....+++|++. ++...+.+.+.....+=++.-..+|.-+ +.++.-|.+... T Consensus 159 ~l~~l~~~~~~~~~~~~~~~~~--~-~~~a~wv~E~~~~~~~~~~~f~~v~~~~~k~~~~i~iS~el-l~ds~~~l~~~i 234 (425) T protein:vir:10 159 PMRQLCRVQPVSKAGFSKLFNM--G-GTTSGWVGEASQRPQTNAATFQPLSFASGEIYANPAATQQI-LDDAEIDLESWL 234 (425) T ss_pred hhhhhceeeeccCCceEEEEEc--C-CcceeeeccccccccccccccceeeeeheeeEeehHhHHHH-HhcchhHHHHHH Confidence 4555555556655444444432 2 224678999986 4566689999998888777766666643 234456778888 Q ss_pred HHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCC-----------EEEccCCCCCHHHHhhhhhhhh Q lcl|NC_018856. 147 TEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTN-----------VIDLKGARLDEATLNKAAVIVG 215 (479) Q Consensus 147 ~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~N-----------viDarG~~l~~~~l~~aa~~i~ 215 (479) .+.-...+++.++.++++||-.-. -.|+++.+....+ +.......++.+.|-++...+. T Consensus 235 ~~~la~ai~~~~d~~~l~G~G~~~----------p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~l~~l~~~l~ 304 (425) T protein:vir:10 235 ATEVQTEFAKQEGKAFLAGDGTNK----------PNGLLTYIAGGANAAKHPFGAIEVVNSGAAADITSDGIIDLVYDLP 304 (425) T ss_pred HHHHHHHHHHHHHhhhhcccCCCC----------cceeeeccccccccccccccccccccccccccccHHHHHHHHhhhh Confidence 999999999999999999986422 3577666542211 1112223344444444333344 Q ss_pred hccCceEEEecChHHhhhHHHHhhCcceeeeccCCCcc----eeeeeh--hhhcCC---CcceecccceecCCCceeccc Q lcl|NC_018856. 216 KGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSTAGGF----STGFSI--NQFLST---RGAINLHGSTIMENDNILLEG 286 (479) Q Consensus 216 ~~fG~~td~~mp~~vka~f~~~~~~~qrv~~~~n~g~~----~~G~~I--~~~~s~---~G~I~l~~s~~m~~~~~L~e~ 286 (479) ..|-...-..|+..+...+...-...-|.+...+.... -.|++| ...+.. .....+.||- .+.-++.+ T Consensus 305 ~~~~~~a~~vmn~~~~~~L~~lkD~~G~~l~~~~~~~g~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~--~~~~~i~~- 381 (425) T protein:vir:10 305 SAFTGNARFAMNRNTQRQVRKLKDGQGNYLWQPSYVAGQPATLAGYPVTEVPDMPDVAANSTPILFGDF--QQTYLIID- 381 (425) T ss_pred hhhccCCEEEEchHHHHHHHHhhcCCCceeeccCccCCCCceecceeeEEecCcCCccCCccEEEEEeh--hccEEEEE- Confidence 45544445789998888877544444455543332221 223322 122111 1111122211 01001111 Q ss_pred CCcCCCCCCCceeEeeeeccCCCCCCCcccccceEEEEEEEcCcCccccccceeeeeecCC Q lcl|NC_018856. 287 RNPEPNAPQAPASVVASIVDDKKGGFRDEDIKTHSYKVVVHSDDAESLPSEAVTAAVAKKD 347 (479) Q Consensus 287 ~~~~~~AP~~pa~v~at~~t~~~G~f~~~d~gty~YkVtavn~~GES~pS~~vt~Tv~~~g 347 (479) ..+ ..+.. +. +. ..+...|+...-=+.+=-.|...+...+.+-. T Consensus 382 ------~~~--~~v~~----d~---~~--~~~~~~~~~~~r~d~~v~~~~A~~~l~~~as~ 425 (425) T protein:vir:10 382 ------RIG--VRVLR----DP---YT--AKPYVLFYTTKRVGGGLLNPEPMRAMKVAASE 425 (425) T ss_pred ------ecc--eEEEe----cc---cc--cCCcEEEEEEEEeccEeecccceEEEEeeccC Confidence 001 00110 00 00 01123333322111111112222222222211 No 83 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=95.61 E-value=0.0017 Score=35.70 Aligned_cols=327 Identities=13% Similarity=0.064 Sum_probs=134.2 Q ss_pred CCccc----------hhhhh--hh-hcCCccch--------HHHHHHHHHhhh-cCCCcC--------hhhccCccccch Q lcl|NC_018856. 1 MTELK----------KEAEA--KN-KKLPVEAE--------AELAELVSKSFT-TGYGIT--------PDTQLDGAAVRR 50 (479) Q Consensus 1 ~~~~~----------~~~~~--~~-~~~~~~~~--------~~~~e~~~Ks~t-ag~~~~--------p~~~~~gaalr~ 50 (479) +.+.+ +..+. +. ........ ...-+.+.|... .+.... ..+.++|+.+-+ T Consensus 44 ~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~a~~~~~~~~gg~~vP 123 (404) T protein:vir:10 44 QAKIEAQKRKENIENNFNEDNVKSLNTGKEENVIYNGALFVRAIADNLLKQKNQRGLNLSEKEINAISENIDEDGGYAVP 123 (404) T ss_pred HHHHHHHHHHHHHHHHHhhhhccccccccchhhHHHHHHHHHHHHHHHHHHHHhhhhcchhhHHhhhccccCCCCceeec Confidence 00000 00000 00 00000000 011111111111 111111 112356777888 Q ss_pred hhhhhhhhhheeccccccchhhccccchhHHHHhhhhhhccCcccccccccccccccc--cCcceEEEEEEEEeeeehhh Q lcl|NC_018856. 51 ELLEDQVKMLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASI--NDPNIRQKTVQMKFLSDTKQ 128 (479) Q Consensus 51 esld~~~~~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~~~--~d~~~~r~~~~~k~l~~~~~ 128 (479) +.+..+|-.+... ...+++.+.+.++......|... .+.+.....++.|++..+. .++.+.+.....+=++.-.. T Consensus 124 ~~~~~~ii~~~~~--~~~l~~l~~~~~~~~~~g~~~~~-~~~~~~~~~~v~e~~~~~~~~~~~~f~~i~~~~~k~~~~~~ 200 (404) T protein:vir:10 124 EDIQTKINTRLKD--TTDLYNMVDYEPVFTRSGSRTYE-KRSKQKPMKPLSENQQIPTNGDNGKLERFNFKLKDLADFMS 200 (404) T ss_pred hhHHHHHHHHHhh--hhhHhhhhceeeccCCccceEEE-EecCCcceeeccccccccccccccceeeeEeeheeeEeeeh Confidence 8887777544433 33456666666555433333222 2333445678999988655 36889999999988888888 Q ss_pred hhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccCCCCCHHHHh Q lcl|NC_018856. 129 QSLAAGLVNNIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGARLDEATLN 208 (479) Q Consensus 129 vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~ 208 (479) +|.-+ +.++..+.+....+.-...+...+|.++++|+-. |-.+.|+...-. .+.+-..+. ...+.|. T Consensus 201 iS~el-l~ds~~~l~~~i~~~la~~~~~~~~~~il~G~g~---------~~~~~gi~~~~~--~~~~~~~~~-~~~~~~~ 267 (404) T protein:vir:10 201 IPNDL-LKFADKSLEDWIINWFVDKVRITRNAEILYGAGG---------DEHATGIMTANK--FKKITLPKS-PALKDFK 267 (404) T ss_pred hhHHH-HhhcHHHHHHHHHHHHHHHHHHHHHHHHhhcCCC---------CCcccceeeccc--cceeecccc-ccHHHHH Confidence 88743 3455567788888889999999999999999754 123456554332 233333333 4455555 Q ss_pred hhh-hhhhhccCceEEEecChHHhhhHHHHhhCcceeeeccCCCcceeeeehhhhcCCCcceecccceecCCCceecccC Q lcl|NC_018856. 209 KAA-VIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSTAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILLEGR 287 (479) Q Consensus 209 ~aa-~~i~~~fG~~td~~mp~~vka~f~~~~~~~qrv~~~~n~g~~~~G~~I~~~~s~~G~I~l~~s~~m~~~~~L~e~~ 287 (479) .+- ..+..+|...--++|++.+.+.+...=...-|++...+.++. .+.+++..+-...+.. T Consensus 268 ~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~------------------~~~~l~G~PV~~~~~~ 329 (404) T protein:vir:10 268 KCKNVELLNVFKATSSWIVNQDGFNYLDSLEDKTGRPYLQPDPKDP------------------TQYRFLGLPVIELPND 329 (404) T ss_pred HHHHhhhhccccCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCC------------------CCccccceeeEEeccc Confidence 433 334445544334688988887776532223344433222111 1122222221111111 Q ss_pred CcCCCCCCCceeEeeeeccCCCCCCCcccccceEEEEEEEcCcCccc-cccceeeeeecCCceEEEEEEecCCCCcccce Q lcl|NC_018856. 288 NPEPNAPQAPASVVASIVDDKKGGFRDEDIKTHSYKVVVHSDDAESL-PSEAVTAAVAKKDNTVKLEVKLASLYQAQPQF 366 (479) Q Consensus 288 ~~~~~AP~~pa~v~at~~t~~~G~f~~~d~gty~YkVtavn~~GES~-pS~~vt~Tv~~~g~sv~ltIT~~~~~~a~~~~ 366 (479) ++.. +.+. ..--.|.++.-+....+.|-+. .+... ......+.+.+.+..- .. T Consensus 330 ~~~~----------------~~~~-~~~~~gd~s~~~~~~~~~~~~i~~~~~~--~~~~~~~~~~~~~~~r-~d------ 383 (404) T protein:vir:10 330 LLLS----------------TESA-IPVLLGDTKEAYKYVSDGAYELATTNIG--AGAFETNTTKARIIMR-ID------ 383 (404) T ss_pred ccCC----------------CCCc-cEEEEEeccccEEEEEecceEEEEeccc--cchhhcCceEEEEEEe-ec------ Confidence 1100 0000 0001122221111111111000 00000 0000000001111000 00 Q ss_pred EEEEEecCCCcceEEEEeeeeeeecCCc Q lcl|NC_018856. 367 ISVYREGTETGHYFLIARVPVSKVNDQG 394 (479) Q Consensus 367 y~IYR~~~~~G~y~li~rv~vs~~n~~g 394 (479) +.|-|. .- +...+++.+.. -+ T Consensus 384 ~~v~~~----~a-~~~~~~~~aa~--~~ 404 (404) T protein:vir:10 384 GNVKDS----EA-LLIAEIPVESV--QA 404 (404) T ss_pred cEEecc----cc-eEEEEeecccC--CC Confidence 011110 01 11111111110 01 No 84 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=95.40 E-value=0.0007 Score=37.87 Aligned_cols=298 Identities=13% Similarity=0.078 Sum_probs=127.2 Q ss_pred CCcc---c---------hhhh-------hhhhcCCccchHHHHHHHHHhhhcCCC----------------cChhhccCc Q lcl|NC_018856. 1 MTEL---K---------KEAE-------AKNKKLPVEAEAELAELVSKSFTTGYG----------------ITPDTQLDG 45 (479) Q Consensus 1 ~~~~---~---------~~~~-------~~~~~~~~~~~~~~~e~~~Ks~tag~~----------------~~p~~~~~g 45 (479) +.|. + .+.+ .+..+.+.....+..+.+.|.+.-+.. ..-.+-++| T Consensus 35 ~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~g 114 (392) T protein:vir:10 35 MEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDG 114 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCC Confidence 0000 0 0000 000011111222222333333321111 111122457 Q ss_pred cccchhhhhhhhhhheeccccccchhhccccchhHHHHhhhhhhccCccccccccccccccc-ccCcceEEEEEEEEeee Q lcl|NC_018856. 46 AAVRRELLEDQVKMLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVAS-INDPNIRQKTVQMKFLS 124 (479) Q Consensus 46 aalr~esld~~~~~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~ 124 (479) +.|-.+.+..+|..+... .-.+++.+...++.+.-.+|......+ .....+++|++... ...+.+.......+-++ T Consensus 115 g~~vP~~~~~~ii~~~~~--~s~l~~~~~~~~~~~~~~~~~~~~~~~-~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~ 191 (392) T protein:vir:10 115 GLVIPQDIQTQINELARS--FDALEQYVTVEPVRTRSGSRVLEKNSD-MIPFAEITEMGEIPETDNPKFSNVQYAVKDRA 191 (392) T ss_pred ceecchhHHHHHHHHHHh--hhhhhhhceeeeccCCceeEEEEeecC-CccceeecccccccccccccceeEEeeeeeEE Confidence 777788887777544333 234555566566655444444433222 23466899998876 56799999999999998 Q ss_pred ehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhc-cCCCEEEccCC-CC Q lcl|NC_018856. 125 DTKQQSLAAGLVNNIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLID-QDTNVIDLKGA-RL 202 (479) Q Consensus 125 ~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~-~~~NviDarG~-~l 202 (479) --..+|.-+ +.++.-|.+....+.--..+++.++.+++.|+....+. ..+-+|.+...|. ...+.+...+. .+ T Consensus 192 ~~~~iS~el-l~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~----~~~~~d~i~~~~~~~l~~~~~~~a~~vm 266 (392) T protein:vir:10 192 GILPLSRSL-LQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQ----AIKSLDDIKDVLNVKLDPAISPNAILLT 266 (392) T ss_pred EeehhhHHH-HhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc----CccCHHHHHHHHHHhhhhhhccCCEEEE Confidence 888888754 23455677888888888999999999999999886553 2355777777663 11122211111 12 Q ss_pred CHHHHhhhhhhhhhccCceEEEecChHHhhhHHHHhhCcceeeec-----cC--CCcceeeeehhhhc-----CCCccee Q lcl|NC_018856. 203 DEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQP-----ST--AGGFSTGFSINQFL-----STRGAIN 270 (479) Q Consensus 203 ~~~~l~~aa~~i~~~fG~~td~~mp~~vka~f~~~~~~~qrv~~~-----~n--~g~~~~G~~I~~~~-----s~~G~I~ 270 (479) +...+.....+ ...-|. .+|.|... ......+++.--|+.. .+ .......+.+.+|. ..++.+. T Consensus 267 ~~~~~~~L~~l-kd~~G~--~l~~~~~~-~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~ 342 (392) T protein:vir:10 267 NQDGFNYLDKL-KDKDGK--YILQSDPT-QKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDME 342 (392) T ss_pred cHHHHHHHHHh-hccCCC--eEeecCcc-CCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceE Confidence 33332222221 111121 23333211 1111112221111100 00 00011111111111 1123333 Q ss_pred cccceec----CCCce--ecc----cCCcCCCCCCCceeEeeeeccCCCC Q lcl|NC_018856. 271 LHGSTIM----ENDNI--LLE----GRNPEPNAPQAPASVVASIVDDKKG 310 (479) Q Consensus 271 l~~s~~m----~~~~~--L~e----~~~~~~~AP~~pa~v~at~~t~~~G 310 (479) +.-+... .++.+ ..+ +.+..|+|-..-....++++..+-| T Consensus 343 ~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 343 LASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred EEEeccccchhhcCceEEEEEEeeccEEecccceEEEEecccccccCCCC Confidence 3222111 11111 111 1122222221111111111111122 No 85 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=95.40 E-value=0.0007 Score=37.87 Aligned_cols=298 Identities=13% Similarity=0.078 Sum_probs=127.2 Q ss_pred CCcc---c---------hhhh-------hhhhcCCccchHHHHHHHHHhhhcCCC----------------cChhhccCc Q lcl|NC_018856. 1 MTEL---K---------KEAE-------AKNKKLPVEAEAELAELVSKSFTTGYG----------------ITPDTQLDG 45 (479) Q Consensus 1 ~~~~---~---------~~~~-------~~~~~~~~~~~~~~~e~~~Ks~tag~~----------------~~p~~~~~g 45 (479) +.|. + .+.+ .+..+.+.....+..+.+.|.+.-+.. ..-.+-++| T Consensus 35 ~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~g 114 (392) T protein:vir:10 35 MEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDG 114 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCC Confidence 0000 0 0000 000011111222222333333321111 111122457 Q ss_pred cccchhhhhhhhhhheeccccccchhhccccchhHHHHhhhhhhccCccccccccccccccc-ccCcceEEEEEEEEeee Q lcl|NC_018856. 46 AAVRRELLEDQVKMLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVAS-INDPNIRQKTVQMKFLS 124 (479) Q Consensus 46 aalr~esld~~~~~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~ 124 (479) +.|-.+.+..+|..+... .-.+++.+...++.+.-.+|......+ .....+++|++... ...+.+.......+-++ T Consensus 115 g~~vP~~~~~~ii~~~~~--~s~l~~~~~~~~~~~~~~~~~~~~~~~-~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~ 191 (392) T protein:vir:10 115 GLVIPQDIQTQINELARS--FDALEQYVTVEPVRTRSGSRVLEKNSD-MIPFAEITEMGEIPETDNPKFSNVQYAVKDRA 191 (392) T ss_pred ceecchhHHHHHHHHHHh--hhhhhhhceeeeccCCceeEEEEeecC-CccceeecccccccccccccceeEEeeeeeEE Confidence 777788887777544333 234555566566655444444433222 23466899998876 56799999999999998 Q ss_pred ehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhc-cCCCEEEccCC-CC Q lcl|NC_018856. 125 DTKQQSLAAGLVNNIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLID-QDTNVIDLKGA-RL 202 (479) Q Consensus 125 ~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~-~~~NviDarG~-~l 202 (479) --..+|.-+ +.++.-|.+....+.--..+++.++.+++.|+....+. ..+-+|.+...|. ...+.+...+. .+ T Consensus 192 ~~~~iS~el-l~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~----~~~~~d~i~~~~~~~l~~~~~~~a~~vm 266 (392) T protein:vir:10 192 GILPLSRSL-LQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQ----AIKSLDDIKDVLNVKLDPAISPNAILLT 266 (392) T ss_pred EeehhhHHH-HhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc----CccCHHHHHHHHHHhhhhhhccCCEEEE Confidence 888888754 23455677888888888999999999999999886553 2355777777663 11122211111 12 Q ss_pred CHHHHhhhhhhhhhccCceEEEecChHHhhhHHHHhhCcceeeec-----cC--CCcceeeeehhhhc-----CCCccee Q lcl|NC_018856. 203 DEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQP-----ST--AGGFSTGFSINQFL-----STRGAIN 270 (479) Q Consensus 203 ~~~~l~~aa~~i~~~fG~~td~~mp~~vka~f~~~~~~~qrv~~~-----~n--~g~~~~G~~I~~~~-----s~~G~I~ 270 (479) +...+.....+ ...-|. .+|.|... ......+++.--|+.. .+ .......+.+.+|. ..++.+. T Consensus 267 ~~~~~~~L~~l-kd~~G~--~l~~~~~~-~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~ 342 (392) T protein:vir:10 267 NQDGFNYLDKL-KDKDGK--YILQSDPT-QKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDME 342 (392) T ss_pred cHHHHHHHHHh-hccCCC--eEeecCcc-CCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceE Confidence 33332222221 111121 23333211 1111112221111100 00 00011111111111 1123333 Q ss_pred cccceec----CCCce--ecc----cCCcCCCCCCCceeEeeeeccCCCC Q lcl|NC_018856. 271 LHGSTIM----ENDNI--LLE----GRNPEPNAPQAPASVVASIVDDKKG 310 (479) Q Consensus 271 l~~s~~m----~~~~~--L~e----~~~~~~~AP~~pa~v~at~~t~~~G 310 (479) +.-+... .++.+ ..+ +.+..|+|-..-....++++..+-| T Consensus 343 ~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 343 LASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred EEEeccccchhhcCceEEEEEEeeccEEecccceEEEEecccccccCCCC Confidence 3222111 11111 111 1122222221111111111111122 No 86 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=95.40 E-value=0.0007 Score=37.87 Aligned_cols=298 Identities=13% Similarity=0.078 Sum_probs=127.2 Q ss_pred CCcc---c---------hhhh-------hhhhcCCccchHHHHHHHHHhhhcCCC----------------cChhhccCc Q lcl|NC_018856. 1 MTEL---K---------KEAE-------AKNKKLPVEAEAELAELVSKSFTTGYG----------------ITPDTQLDG 45 (479) Q Consensus 1 ~~~~---~---------~~~~-------~~~~~~~~~~~~~~~e~~~Ks~tag~~----------------~~p~~~~~g 45 (479) +.|. + .+.+ .+..+.+.....+..+.+.|.+.-+.. ..-.+-++| T Consensus 35 ~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~g 114 (392) T protein:vir:10 35 MEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDG 114 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCC Confidence 0000 0 0000 000011111222222333333321111 111122457 Q ss_pred cccchhhhhhhhhhheeccccccchhhccccchhHHHHhhhhhhccCccccccccccccccc-ccCcceEEEEEEEEeee Q lcl|NC_018856. 46 AAVRRELLEDQVKMLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVAS-INDPNIRQKTVQMKFLS 124 (479) Q Consensus 46 aalr~esld~~~~~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~ 124 (479) +.|-.+.+..+|..+... .-.+++.+...++.+.-.+|......+ .....+++|++... ...+.+.......+-++ T Consensus 115 g~~vP~~~~~~ii~~~~~--~s~l~~~~~~~~~~~~~~~~~~~~~~~-~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~ 191 (392) T protein:vir:10 115 GLVIPQDIQTQINELARS--FDALEQYVTVEPVRTRSGSRVLEKNSD-MIPFAEITEMGEIPETDNPKFSNVQYAVKDRA 191 (392) T ss_pred ceecchhHHHHHHHHHHh--hhhhhhhceeeeccCCceeEEEEeecC-CccceeecccccccccccccceeEEeeeeeEE Confidence 777788887777544333 234555566566655444444433222 23466899998876 56799999999999998 Q ss_pred ehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhc-cCCCEEEccCC-CC Q lcl|NC_018856. 125 DTKQQSLAAGLVNNIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLID-QDTNVIDLKGA-RL 202 (479) Q Consensus 125 ~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~-~~~NviDarG~-~l 202 (479) --..+|.-+ +.++.-|.+....+.--..+++.++.+++.|+....+. ..+-+|.+...|. ...+.+...+. .+ T Consensus 192 ~~~~iS~el-l~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~----~~~~~d~i~~~~~~~l~~~~~~~a~~vm 266 (392) T protein:vir:10 192 GILPLSRSL-LQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQ----AIKSLDDIKDVLNVKLDPAISPNAILLT 266 (392) T ss_pred EeehhhHHH-HhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc----CccCHHHHHHHHHHhhhhhhccCCEEEE Confidence 888888754 23455677888888888999999999999999886553 2355777777663 11122211111 12 Q ss_pred CHHHHhhhhhhhhhccCceEEEecChHHhhhHHHHhhCcceeeec-----cC--CCcceeeeehhhhc-----CCCccee Q lcl|NC_018856. 203 DEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQP-----ST--AGGFSTGFSINQFL-----STRGAIN 270 (479) Q Consensus 203 ~~~~l~~aa~~i~~~fG~~td~~mp~~vka~f~~~~~~~qrv~~~-----~n--~g~~~~G~~I~~~~-----s~~G~I~ 270 (479) +...+.....+ ...-|. .+|.|... ......+++.--|+.. .+ .......+.+.+|. ..++.+. T Consensus 267 ~~~~~~~L~~l-kd~~G~--~l~~~~~~-~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~ 342 (392) T protein:vir:10 267 NQDGFNYLDKL-KDKDGK--YILQSDPT-QKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDME 342 (392) T ss_pred cHHHHHHHHHh-hccCCC--eEeecCcc-CCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceE Confidence 33332222221 111121 23333211 1111112221111100 00 00011111111111 1123333 Q ss_pred cccceec----CCCce--ecc----cCCcCCCCCCCceeEeeeeccCCCC Q lcl|NC_018856. 271 LHGSTIM----ENDNI--LLE----GRNPEPNAPQAPASVVASIVDDKKG 310 (479) Q Consensus 271 l~~s~~m----~~~~~--L~e----~~~~~~~AP~~pa~v~at~~t~~~G 310 (479) +.-+... .++.+ ..+ +.+..|+|-..-....++++..+-| T Consensus 343 ~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 343 LASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred EEEeccccchhhcCceEEEEEEeeccEEecccceEEEEecccccccCCCC Confidence 3222111 11111 111 1122222221111111111111122 No 87 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=95.40 E-value=0.0007 Score=37.87 Aligned_cols=298 Identities=13% Similarity=0.078 Sum_probs=127.2 Q ss_pred CCcc---c---------hhhh-------hhhhcCCccchHHHHHHHHHhhhcCCC----------------cChhhccCc Q lcl|NC_018856. 1 MTEL---K---------KEAE-------AKNKKLPVEAEAELAELVSKSFTTGYG----------------ITPDTQLDG 45 (479) Q Consensus 1 ~~~~---~---------~~~~-------~~~~~~~~~~~~~~~e~~~Ks~tag~~----------------~~p~~~~~g 45 (479) +.|. + .+.+ .+..+.+.....+..+.+.|.+.-+.. ..-.+-++| T Consensus 35 ~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~g 114 (392) T protein:vir:10 35 MEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDG 114 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCC Confidence 0000 0 0000 000011111222222333333321111 111122457 Q ss_pred cccchhhhhhhhhhheeccccccchhhccccchhHHHHhhhhhhccCccccccccccccccc-ccCcceEEEEEEEEeee Q lcl|NC_018856. 46 AAVRRELLEDQVKMLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVAS-INDPNIRQKTVQMKFLS 124 (479) Q Consensus 46 aalr~esld~~~~~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~ 124 (479) +.|-.+.+..+|..+... .-.+++.+...++.+.-.+|......+ .....+++|++... ...+.+.......+-++ T Consensus 115 g~~vP~~~~~~ii~~~~~--~s~l~~~~~~~~~~~~~~~~~~~~~~~-~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~ 191 (392) T protein:vir:10 115 GLVIPQDIQTQINELARS--FDALEQYVTVEPVRTRSGSRVLEKNSD-MIPFAEITEMGEIPETDNPKFSNVQYAVKDRA 191 (392) T ss_pred ceecchhHHHHHHHHHHh--hhhhhhhceeeeccCCceeEEEEeecC-CccceeecccccccccccccceeEEeeeeeEE Confidence 777788887777544333 234555566566655444444433222 23466899998876 56799999999999998 Q ss_pred ehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhc-cCCCEEEccCC-CC Q lcl|NC_018856. 125 DTKQQSLAAGLVNNIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLID-QDTNVIDLKGA-RL 202 (479) Q Consensus 125 ~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~-~~~NviDarG~-~l 202 (479) --..+|.-+ +.++.-|.+....+.--..+++.++.+++.|+....+. ..+-+|.+...|. ...+.+...+. .+ T Consensus 192 ~~~~iS~el-l~ds~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~----~~~~~d~i~~~~~~~l~~~~~~~a~~vm 266 (392) T protein:vir:10 192 GILPLSRSL-LQDSDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTKQ----AIKSLDDIKDVLNVKLDPAISPNAILLT 266 (392) T ss_pred EeehhhHHH-HhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc----CccCHHHHHHHHHHhhhhhhccCCEEEE Confidence 888888754 23455677888888888999999999999999886553 2355777777663 11122211111 12 Q ss_pred CHHHHhhhhhhhhhccCceEEEecChHHhhhHHHHhhCcceeeec-----cC--CCcceeeeehhhhc-----CCCccee Q lcl|NC_018856. 203 DEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQP-----ST--AGGFSTGFSINQFL-----STRGAIN 270 (479) Q Consensus 203 ~~~~l~~aa~~i~~~fG~~td~~mp~~vka~f~~~~~~~qrv~~~-----~n--~g~~~~G~~I~~~~-----s~~G~I~ 270 (479) +...+.....+ ...-|. .+|.|... ......+++.--|+.. .+ .......+.+.+|. ..++.+. T Consensus 267 ~~~~~~~L~~l-kd~~G~--~l~~~~~~-~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~ 342 (392) T protein:vir:10 267 NQDGFNYLDKL-KDKDGK--YILQSDPT-QKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDME 342 (392) T ss_pred cHHHHHHHHHh-hccCCC--eEeecCcc-CCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceE Confidence 33332222221 111121 23333211 1111112221111100 00 00011111111111 1123333 Q ss_pred cccceec----CCCce--ecc----cCCcCCCCCCCceeEeeeeccCCCC Q lcl|NC_018856. 271 LHGSTIM----ENDNI--LLE----GRNPEPNAPQAPASVVASIVDDKKG 310 (479) Q Consensus 271 l~~s~~m----~~~~~--L~e----~~~~~~~AP~~pa~v~at~~t~~~G 310 (479) +.-+... .++.+ ..+ +.+..|+|-..-....++++..+-| T Consensus 343 ~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~a~~~~~~~ 392 (392) T protein:vir:10 343 LASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLSAPVEQPQG 392 (392) T ss_pred EEEeccccchhhcCceEEEEEEeeccEEecccceEEEEecccccccCCCC Confidence 3222111 11111 111 1122222221111111111111122 No 88 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=95.39 E-value=0.0017 Score=35.80 Aligned_cols=313 Identities=13% Similarity=0.080 Sum_probs=138.8 Q ss_pred CCccchhhhhhhh--cC-----CccchHHHHHHHHHhhhcCCCcC----------hhhccCccccchhhhhhhhhhheec Q lcl|NC_018856. 1 MTELKKEAEAKNK--KL-----PVEAEAELAELVSKSFTTGYGIT----------PDTQLDGAAVRRELLEDQVKMLAFS 63 (479) Q Consensus 1 ~~~~~~~~~~~~~--~~-----~~~~~~~~~e~~~Ks~tag~~~~----------p~~~~~gaalr~esld~~~~~l~~~ 63 (479) +.+.+++.+.++. .. ......+..+.|.+.+-.|.... -.+..+|+.|-+|.+.++|..+... T Consensus 53 ~e~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~a~~~~l~~g~~~~~~~~e~~a~~~~t~~~gG~~iP~~~~~~I~~~~~~ 132 (407) T protein:vir:48 53 LENLKSDLEAELAEVKRPAGGTQNKVASEHKEAFIGFMRKGREDGLRELERKALQVGNDEDGGYAIPEELDRTILTLLKD 132 (407) T ss_pred HHHHHHHHHHHHHHhhccccccccchhhHHHHHHHHHHhccchhhhhHHHHHhhhcccCCCCcccccHhHHHHHHHHHHh Confidence 0000000000000 00 00001111122222222221100 0123457788899988888655533 Q ss_pred cccccchhhccccchhHHHHhhhhhhccCcccccccccccccc-cccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhH Q lcl|NC_018856. 64 SNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVA-SINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADP 142 (479) Q Consensus 64 ~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~-~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp 142 (479) ...+++.....++.+--..|.+ ..++ ....+++|++.. +..++.+......++=++.-..+|.-+ +.++..|. T Consensus 133 --~~~l~~~~~~~~~~~~~~~~~~--~~~~-~~a~~v~E~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~el-l~ds~~~l 206 (407) T protein:vir:48 133 --EVVMRQEATVITLGGSDYKKLV--NLGG-TTSGWVGETDARPETATSKLGLIEPFMGEIYGNPQATQKM-LDDAFFNV 206 (407) T ss_pred --hhhhhhhceeeecCCCceEEEE--ecCC-cceeeecccccccccccccceeEEeeeeeeEeehhhHHHH-HhcchHHH Confidence 2345555554555543333333 2232 246689999975 567899999999998777777777653 23556678 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccC-CC----------EEEccCCCCCHHHHhhhh Q lcl|NC_018856. 143 MTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQD-TN----------VIDLKGARLDEATLNKAA 211 (479) Q Consensus 143 ~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~-~N----------viDarG~~l~~~~l~~aa 211 (479) +....+.-...+...+|.++++||-.= +..|+++..... .+ +.-..-..++.+.|.++. T Consensus 207 ~~~i~~~l~~~i~~~~~~a~l~G~G~~----------~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~i~~l~ 276 (407) T protein:vir:48 207 EDWINSELALEFAEQEEIAFTSGDGSK----------KPKGFLAYESTDEDDKTRAFGKLQHIASGAASGVTADAIIKLI 276 (407) T ss_pred HHHHHHHHHHHHHHHHHhhhhccCCCC----------ccceeeecccccccccccccccccccccccccccChHHHHHHH Confidence 888888888889999999999998651 236766544311 01 111112335555555444 Q ss_pred hhhhhccCceEEEecChHHhhhHHHHhhCcceeeeccCCCc----ceeeeehh--hhc---CCCcceecccceecCCCce Q lcl|NC_018856. 212 VIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSTAGG----FSTGFSIN--QFL---STRGAINLHGSTIMENDNI 282 (479) Q Consensus 212 ~~i~~~fG~~td~~mp~~vka~f~~~~~~~qrv~~~~n~g~----~~~G~~I~--~~~---s~~G~I~l~~s~~m~~~~~ 282 (479) ..+..+|-.....+|+..+.+.+...-...-|.+...+... .-.|++|- +.+ .......+.|+- ...-. T Consensus 277 ~~l~~~~~~~a~~v~n~~~~~~L~~lkD~~Gr~l~~~~~~~g~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~--~~~~~ 354 (407) T protein:vir:48 277 YTLRKAHRSGAKFMMNNSSLFAIRLLKDNDGNYLWRPGIELGQPSSLAGYGIVENEQMPDIAADAKAIAFGNF--KRGYT 354 (407) T ss_pred HhhchhhhcCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeEEecCcCCccCCccEEEEEec--cccEE Confidence 34444554444578999988887654444445553322221 23454332 111 112222233321 11111 Q ss_pred ecc--cCC--cCCCCCCCceeEeeeeccCCCCCCCcccccceEEEEEEEcCcCccc Q lcl|NC_018856. 283 LLE--GRN--PEPNAPQAPASVVASIVDDKKGGFRDEDIKTHSYKVVVHSDDAESL 334 (479) Q Consensus 283 L~e--~~~--~~~~AP~~pa~v~at~~t~~~G~f~~~d~gty~YkVtavn~~GES~ 334 (479) +.+ +.. ..+.+-..-....+..-. +|+-.. +.+-..+++.+...+-=++ T Consensus 355 i~~~~~~~i~~d~~~~~~~~~~~~~~r~--d~~v~~-~~a~~~l~~~aa~~~~~~~ 407 (407) T protein:vir:48 355 IVDRIGTRILRDPYTNKPFVGFYTTKRT--GGMLVD-SQAIKLMKIGAATRQKAAA 407 (407) T ss_pred EEEeeceEEEeeccccCCcEEEEEEEEe--ccEEec-ccceEEEEeeccCCCCCCC Confidence 111 000 000000000000110011 111100 0011233333332221111 No 89 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=95.38 E-value=0.0022 Score=35.18 Aligned_cols=291 Identities=9% Similarity=0.024 Sum_probs=132.6 Q ss_pred cChhhccCccccchhhhhhhhhhheeccccccchhhccccchhHHHHhhhhhhccCccccccccccccc-----ccccCc Q lcl|NC_018856. 37 ITPDTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGV-----ASINDP 111 (479) Q Consensus 37 ~~p~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~-----~~~~d~ 111 (479) ....+-++|+.|-.+.+.++|....... -.+.+.....+..+.-..|.++. +.....+++|++. .+.+++ T Consensus 1 ma~~t~~~gg~liP~~~~~~Ii~~~~~~--s~l~~l~~~~~~~~~~~~~p~~~---~~~~a~wv~E~~~~~~~~~~~s~~ 75 (305) T protein:vir:25 1 MADISRAEVASLIQEAYSDTLLAAAKQG--STVLSAFQNVNMGTKTTHLPVLA---TLPEADWVGESATDPKGVKPTSKV 75 (305) T ss_pred CCCccCCccceecCHHHHHHHHHHHHhh--chhhhhcceeeccCCcEEEEEEe---CCcceEEeeccccccccccccccc Confidence 3333444577788888877774333322 34555555555554434444433 3345678999874 456799 Q ss_pred ceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccC Q lcl|NC_018856. 112 NIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQD 191 (479) Q Consensus 112 ~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~ 191 (479) .+.+....++=++.--.+|.-+ +.++..|.+....+.-...+++.+|.++|+|+.+-.+ ++=-+........ T Consensus 76 ~f~~i~~~~~k~~~~~~is~el-l~ds~~~~~~~i~~~l~~~~a~~~d~a~~~G~g~~~~-------~~~~~~~~~~~~~ 147 (305) T protein:vir:25 76 TWANRTLVAEEIAVIIPVHENV-IDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPAS-------WVSPALIPAAVTA 147 (305) T ss_pred ceeeEEeeeEEEEEeehhhHHH-HhcchHHHHHHHHHHHHHHHHHHHhhhheeccCCCCC-------ccccccccccccc Confidence 9999999999888888888743 2356678899999999999999999999999864211 1111222222222 Q ss_pred CCEEEccCCCCC-H---HHHhhhhhhhhhccCceEEEecChHHhhhHHHHhhCcceeeeccCCCcceeeeehh--hhcCC Q lcl|NC_018856. 192 TNVIDLKGARLD-E---ATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSTAGGFSTGFSIN--QFLST 265 (479) Q Consensus 192 ~NviDarG~~l~-~---~~l~~aa~~i~~~fG~~td~~mp~~vka~f~~~~~~~qrv~~~~n~g~~~~G~~I~--~~~s~ 265 (479) .+.....+.... . +.+.++...+...+..++..+|+....+.+...-....|.+.+.+ .-.|+.+- ..... T Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~i~~~~---~l~G~Pv~~~~~~~~ 224 (305) T protein:vir:25 148 GQAVEVVGGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRDD---SFAGFRTFFNRNGAW 224 (305) T ss_pred cccccccccchhhhHHHHHHHHHHHhhhhcccccceeEecHHHHHHHHHhhccCCceeecCC---cccccceEEcCccCC Confidence 233333333322 2 224455556667777888899999988887654443334433211 11222111 11000 Q ss_pred ---CcceecccceecCCCceecccCCcCCCCCCCceeEeeeeccCCCCCCCcccccceEEEEEEEcCcCccccccceeee Q lcl|NC_018856. 266 ---RGAINLHGSTIMENDNILLEGRNPEPNAPQAPASVVASIVDDKKGGFRDEDIKTHSYKVVVHSDDAESLPSEAVTAA 342 (479) Q Consensus 266 ---~G~I~l~~s~~m~~~~~L~e~~~~~~~AP~~pa~v~at~~t~~~G~f~~~d~gty~YkVtavn~~GES~pS~~vt~T 342 (479) .+.+- -|+ | .+ -.+.. .-+..-....-.....++. .......-...+.+..+.|-- T Consensus 225 ~~~~~~~~-~gd-~-s~-~~i~~-------~~~~~i~~~~~~~~~~~~~-~~~~~~~~~~~~R~~~r~~~~--------- 283 (305) T protein:vir:25 225 DADAAIEV-IAD-S-SR-VKIGV-------RQDITVKFLDQATLGTGEN-QINLAERDMVALRLKARFAYV--------- 283 (305) T ss_pred CCCccEEE-EEe-c-ce-EEEEE-------ecCeEEEEeeeeeeecCCc-eeeeeecCcEEEEEEEeecce--------- Confidence 00000 000 0 00 00000 0000000000000000000 000000001112222222211 Q ss_pred eecCCceEEEEEEecCCCCcccc Q lcl|NC_018856. 343 VAKKDNTVKLEVKLASLYQAQPQ 365 (479) Q Consensus 343 v~~~g~sv~ltIT~~~~~~a~~~ 365 (479) +..+..-++++.++.+ ...+.. T Consensus 284 v~~p~a~v~~~~~~~~-~~~pa~ 305 (305) T protein:vir:25 284 LGVSATAQGANKTPVA-VVAPAA 305 (305) T ss_pred eeCcccEEEEcccccc-ccCCCC Confidence 1111112222221110 000001 No 90 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=95.14 E-value=0.0017 Score=35.76 Aligned_cols=299 Identities=14% Similarity=0.089 Sum_probs=139.1 Q ss_pred CCccchhhhhhh--hc-CCccchHHHHHHHHHhhhcCCCcChhhccCccccchhhhhhhhhhheeccccccchhhccccc Q lcl|NC_018856. 1 MTELKKEAEAKN--KK-LPVEAEAELAELVSKSFTTGYGITPDTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQQ 77 (479) Q Consensus 1 ~~~~~~~~~~~~--~~-~~~~~~~~~~e~~~Ks~tag~~~~p~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~~ 77 (479) ..+.+.+.|.+. .+ +......++.+.-.|+++++.. .+|+.|-++.+.++|..+... ...+.+.....+ T Consensus 74 ~~~~~~~~e~~~a~~~~lr~~~~~~~~~~e~~a~~~~~~------~~GG~~iP~~~~~~ii~~~~~--~~~l~~~~~~~~ 145 (401) T protein:vir:44 74 GAQNKVAAEHKDAFVGFLRKGREDGLRDLERKALQVGTD------EDGGYAVPEELDRSILSLLKD--EVVMRQEATVIT 145 (401) T ss_pred ccccchhHHHHHHHHHHHhhhhhhhhHHHHHHHhhcCCC------CCCceeccHhHHHHHHHHHHh--hhhhhhhceeee Confidence 111111111110 00 0001112222222344444422 346677777777777544332 234555555566 Q ss_pred hhHHHHhhhhhhccCccccccccccccc-ccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHH Q lcl|NC_018856. 78 VNSTVAKYAVFNQHGRTGHSRFVREVGV-ASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAIAVIAK 156 (479) Q Consensus 78 ~~stv~eY~~~~~~G~~g~~~fv~E~g~-~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~ 156 (479) +.+....|.+. -++. ...+++|++. ++..++.+.+....++=++.--.+|.-+ +.++..|.+....+.-...+++ T Consensus 146 ~~~~~~~~~~~--~~~~-~a~wv~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~la~ai~~ 221 (401) T protein:vir:44 146 VGGSDYKKLVN--LGGT-ASGWVGETDTRSQTATSRLGLIEPFMGEIYGNPQATQKM-LDDAFFNVEAWINSELATEFAE 221 (401) T ss_pred cCCCceEEEEe--cCCc-cceeeccccccCccccccceeeeeehhheeeehhhhHHH-HhcchHHHHHHHHHHHHHHHHH Confidence 66554444442 2332 3457999985 5677789999988888777766666642 2355668888888888889999 Q ss_pred HHHHHHhhcccccCCCCCCcccchhhhHHHhhcc-----------CCCEEEccCCCCCHHHHhhhhhhhhhccCceEEEe Q lcl|NC_018856. 157 SIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQ-----------DTNVIDLKGARLDEATLNKAAVIVGKGYGRATDAF 225 (479) Q Consensus 157 ~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~-----------~~NviDarG~~l~~~~l~~aa~~i~~~fG~~td~~ 225 (479) .++.++++||-.=.| .|+.+.... .+.+.......++.+.|..+.-.+...|....-.+ T Consensus 222 ~~~~~~l~G~G~~~p----------~Gil~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~d~i~~~~~~l~~~~~~~a~~v 291 (401) T protein:vir:44 222 QEEIAFTTGDGTKKP----------KGFLAYESTEESDKARAFGKLQHIVSGEATAVTADAIIKLIYTLRKAHRTGAKFM 291 (401) T ss_pred HHHhhhhccCCCCcc----------ceeeccccccccccccccccccccccccccccCHHHHHHHHHhcchhhhcCCEEE Confidence 999999999875222 455544321 12233334444555555554444455555545578 Q ss_pred cChHHhhhHHHHhhCcceeeeccCCCc----ceeeeehh--hhc---CCCcceecccce-----ecCCCceecccCCcCC Q lcl|NC_018856. 226 MPIGVQADFTNNLLDRQRVIQPSTAGG----FSTGFSIN--QFL---STRGAINLHGST-----IMENDNILLEGRNPEP 291 (479) Q Consensus 226 mp~~vka~f~~~~~~~qrv~~~~n~g~----~~~G~~I~--~~~---s~~G~I~l~~s~-----~m~~~~~L~e~~~~~~ 291 (479) |+......+...-...-|++...+... .-.|++|- +.. ...+...+.||. +.++..+-+. ..+ T Consensus 292 ~n~~~~~~L~~lkd~~G~~l~~~~~~~g~~~~l~G~PVv~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~~~---~~~ 368 (401) T protein:vir:44 292 MNNNSLFAIRLLKDTEGNYLWRPGLELGQPSSLAGYGIAENEQMPDIAADAKAIAFGNFKRGYTIVDRIGTRIL---RDP 368 (401) T ss_pred EcHHHHHHHHHhhccCCceeecCCcCCCCCceecceeeEEecCcCCccCCccEEEEeehhccEEEEEecceEEe---eec Confidence 999888777654444446654322211 13455442 111 112222222321 1111100000 000 Q ss_pred CCCCCceeEeeeeccCCCCCCCcccccceEEEEEEE Q lcl|NC_018856. 292 NAPQAPASVVASIVDDKKGGFRDEDIKTHSYKVVVH 327 (479) Q Consensus 292 ~AP~~pa~v~at~~t~~~G~f~~~d~gty~YkVtav 327 (479) ..-..-+..-+..-. +|+... +.+-..+++.+. T Consensus 369 ~~~~~~v~~~a~~r~--d~~~~~-~~a~~~l~~~aa 401 (401) T protein:vir:44 369 YTNKPFVGFYTTKRT--GGMLVD-SQAIKLLKIAAA 401 (401) T ss_pred cccCCcEEEEEEEEe--ccEEec-ccceEEEEeecC Confidence 000000000111000 111111 001122233222 No 91 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=94.90 E-value=0.0032 Score=34.24 Aligned_cols=301 Identities=13% Similarity=0.079 Sum_probs=141.2 Q ss_pred chHHHHHHHHHhhhcCCCcChhhccCccccchhhhhhhhhhheeccccccchhhccccchhHHHHhhhhhhccCcccccc Q lcl|NC_018856. 19 AEAELAELVSKSFTTGYGITPDTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSR 98 (479) Q Consensus 19 ~~~~~~e~~~Ks~tag~~~~p~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~ 98 (479) -.+.+.......+.-...++..+..+|..+..|.-.+-+..+...+ .|++.++-.++++-..+ +...|-.+... T Consensus 1 ~~~k~~~~~l~~~~~~~~~~~~~~~~g~~v~~~~~~~l~~~i~e~s---~~l~~i~v~~v~~~~~~---i~~~~~~~~~~ 74 (321) T protein:vir:31 1 MASRTINNDLSRITEKNALTVDDLDAGGTLPDPLWDEFWTDMIEET---PLLDAIRTETVGAKKTR---IPTLNIGERHR 74 (321) T ss_pred CchHHHHHHHHHHHHhccccccccCCcceeCHHHHHHHHHHHHHhh---hhhhhceeeeccCccee---eeeeccCCccc Confidence 1111111111222212344445556677777776666555554433 47777776666543332 22222111111 Q ss_pred ccc-cc-ccccccCcceEEEEEEEEeeeehhhhhhhHhhhcch--hhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCC Q lcl|NC_018856. 99 FVR-EV-GVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNI--ADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEAD 174 (479) Q Consensus 99 fv~-E~-g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~--~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~ 174 (479) ..+ |+ +....++|.+.+....++=+..--.+|.-. |-++. .|-+....+.-..+++.+++.+.|+||..-.+. T Consensus 75 ~~~~e~~~~~~~~~~~~~~~~~~~~k~~~~~~it~e~-L~d~a~~~d~e~~i~~~ia~~~a~~~~~~~~nGd~~~~~~-- 151 (321) T protein:vir:31 75 RPQDEGEWNENESDVSTGTIDISTEKATVAWDLPREV-VQENPEGEALADRILNLMTDAWSADVEDLAANGDEDAEDS-- 151 (321) T ss_pred ccccccccccccccceeeeeeeeeEEEEeehhccHHH-HHhhhcchhHHHHHHHHHHHHHHHHHHhheeeccccCCCc-- Confidence 222 33 344567888887777776665555555432 22332 477888888888899999999999999764331 Q ss_pred CcccchhhhHHHhhccCCCEEEccCCCCCHHHHhhhhhhhhhccCce-E-EEecChHHhhhHHHHhhCcceeeeccCC-- Q lcl|NC_018856. 175 GQAGIEFDGLHKLIDQDTNVIDLKGARLDEATLNKAAVIVGKGYGRA-T-DAFMPIGVQADFTNNLLDRQRVIQPSTA-- 250 (479) Q Consensus 175 ~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~~-t-d~~mp~~vka~f~~~~~~~qrv~~~~n~-- 250 (479) ++ ...+|+++.+....+.++..+..++.+.|..+-..+-..|-.. + -++|+..+.+.+...+.+++..+..... T Consensus 152 ~~--~~n~G~l~~a~~~~~~~~~~~~~~~~d~l~~l~~~l~~~yr~~~~~v~im~~~~~~~~~~~l~~~~~~~~~~~l~~ 229 (321) T protein:vir:31 152 FE--NQNDGFITVAEGDVETIDAADDILDNDLVIRTIAGLDSKYRARMNPALIVSEDQLLSYHYTLTDRDTPLGDNVIMG 229 (321) T ss_pred cc--ccchhhhhhhccccccccccccccCHHHHHHHHHhccHhHhcCCCeEEEechHHHHHHHHHHhcCCCccccchhhc Confidence 11 2469999999877788999999999888887766676666432 2 2679998887777766665544321100 Q ss_pred -Ccce-eeeehhhh-cCCCcceecccceecCCCceecccCCcCCCCCCCceeEeeeeccCCCCCCC------c-ccccce Q lcl|NC_018856. 251 -GGFS-TGFSINQF-LSTRGAINLHGSTIMENDNILLEGRNPEPNAPQAPASVVASIVDDKKGGFR------D-EDIKTH 320 (479) Q Consensus 251 -g~~~-~G~~I~~~-~s~~G~I~l~~s~~m~~~~~L~e~~~~~~~AP~~pa~v~at~~t~~~G~f~------~-~d~gty 320 (479) ...+ .|+.|-.. ..|.+.|-+.. +....... ..+.... . ++...+ T Consensus 230 ~~~~tl~G~pvv~~~~mP~~~il~t~--------------------~~nl~~~~-----~~~~~~~~~~~~~~~~~~~~~ 284 (321) T protein:vir:31 230 EADVNPFSFPIIGSGLWPDDKAMFTD--------------------PQNLIYAL-----YRDLEIDVLTESDKVSERDLH 284 (321) T ss_pred cccccccceeEEEcCCCCCCcEEEec--------------------cccEEEEE-----eeccEEEEeecCcccccccee Confidence 0001 12211100 00111111111 00000000 0000000 0 001112 Q ss_pred EEEEEEEcCcC--ccccccceeeeeecCCceEEEEEEecCCCCccc Q lcl|NC_018856. 321 SYKVVVHSDDA--ESLPSEAVTAAVAKKDNTVKLEVKLASLYQAQP 364 (479) Q Consensus 321 ~YkVtavn~~G--ES~pS~~vt~Tv~~~g~sv~ltIT~~~~~~a~~ 364 (479) .|.....+.+. |- +...+.++ . +..+....-..+. T Consensus 285 ~~~~~~~~~~~~ve~-~~a~a~~~--~------i~~~~~~~~~~~~ 321 (321) T protein:vir:31 285 ARYFMRGDDDFAIEN-TEAVVLAE--G------LGDPLEHLEEETS 321 (321) T ss_pred eEeeeeeecceeEec-cccEEEEe--c------CCcchhcccCCCC Confidence 22222222211 11 11111111 0 0000001111000 No 92 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=94.46 E-value=0.0044 Score=33.52 Aligned_cols=314 Identities=11% Similarity=0.082 Sum_probs=140.3 Q ss_pred CCccchhhh---hhhh------cCCccchHHHHHHHHHhh-----hcCCCcChhhccCccccchhhhhhhhhhheecccc Q lcl|NC_018856. 1 MTELKKEAE---AKNK------KLPVEAEAELAELVSKSF-----TTGYGITPDTQLDGAAVRRELLEDQVKMLAFSSND 66 (479) Q Consensus 1 ~~~~~~~~~---~~~~------~~~~~~~~~~~e~~~Ks~-----tag~~~~p~~~~~gaalr~esld~~~~~l~~~~~~ 66 (479) -++.++.+. ..++ +.+.... .+.-.+.+ .....++. ..|+.|-.+.+..+|..+... T Consensus 21 ~~~~~~~kg~~~~~~~~a~a~~~g~~~~a---~~~a~~~~~~~~~~~a~~~~~---~~Gg~lvP~~~~~~ii~~l~~--- 91 (366) T protein:vir:57 21 KEELQQYKGAGMTRMVMSIAAGKGNLADA---AKFAATELGDTGLSMAISTAA---GSGGALIPQNMQNEVIELLRD--- 91 (366) T ss_pred ccccccccchhHHHHHHHHHhcccchhHH---HHHHHHhhcchhhhhhccccc---cCCccccchhHHHHHHHHHhh--- Confidence 111111110 0111 2222111 11111222 11111211 246677777777776544432 Q ss_pred ccchhhccccchh--HHHHhhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHH Q lcl|NC_018856. 67 FTIYPLINKQQVN--STVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMT 144 (479) Q Consensus 67 f~f~~~i~k~~~~--stv~eY~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~ 144 (479) ...+..+.-+.+. +---+|.++. +.....+++|++..+.+++.+.+.....+=++.--.+|.-+- .++.-|.+. T Consensus 92 ~s~l~~lg~~~v~~~~g~~~~p~~t---~~~~a~wv~E~~~~~~s~~~f~~i~~~~~k~~~~~~iS~ell-~ds~~~~~~ 167 (366) T protein:vir:57 92 RTVVRILGARSIPLPNGNLSMPRLS---GGATAGYVGEGKDVVATGATFDDVKLSAKTMIALVPVSNQLI-GRAGFNVEQ 167 (366) T ss_pred hcchhhhceeeeecCCCceEEEEEe---CCcceeeeccCccccccccceeEEEEeeEEEEEeehhhHHHH-hhhhHHHHH Confidence 2233333222221 1111233322 233566899999999999999999999999988777775432 244457788 Q ss_pred HHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccCCCCCHHHHhhhhhhhh------hcc Q lcl|NC_018856. 145 ILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGARLDEATLNKAAVIVG------KGY 218 (479) Q Consensus 145 ~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~------~~f 218 (479) ...++-...+++.++.++++||-. |-+..||.+.........+.-|..++...+......+. ..+ T Consensus 168 ~i~~~l~~a~~~~~d~a~l~G~G~---------~~~p~Gi~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~ 238 (366) T protein:vir:57 168 LLLGDILSAIATREDKAFLRDDGT---------GDTPKGMKAVATAANRLVAWTGTAINLTTIDEYLDSLILKHMDSNSN 238 (366) T ss_pred HHHHHHHHHHHHHHHHHhhccCCC---------CccccceeeccccccceeeccccccchhhHHHHHHHHHHhhhccccc Confidence 888999999999999999999853 22457887766544446666777766555543222222 223 Q ss_pred CceEEEecChHHhhhHHHHhhCcceeeeccCCCcceeeeehhhhcCCCcceec---ccceecCCC-ce-ecc-cCCcCCC Q lcl|NC_018856. 219 GRATDAFMPIGVQADFTNNLLDRQRVIQPSTAGGFSTGFSINQFLSTRGAINL---HGSTIMEND-NI-LLE-GRNPEPN 292 (479) Q Consensus 219 G~~td~~mp~~vka~f~~~~~~~qrv~~~~n~g~~~~G~~I~~~~s~~G~I~l---~~s~~m~~~-~~-L~e-~~~~~~~ 292 (479) ....-..|+......+...-...-|.+.+...++.-.|++|-........... .+.++..+. +. +.+ +-+.-.- T Consensus 239 ~~~a~~vmn~~~~~~L~~lkd~~G~~l~~~~~~g~l~G~Pvv~s~~ip~~~~~~~~~~~i~~gdfs~~~i~~~~~i~i~~ 318 (366) T protein:vir:57 239 MIRCGWGLSNRTYMTLFGLRDGNGNKVYPEMSQGILKGYPIQRTSAIPANLGDDGNESEIYFCDFNDVVIGEDGMMKVDF 318 (366) T ss_pred cccCEEEecHHHHHHHHhhhccCCceeccCCCCCeecceeeEEccccccccccCCCccEEEEEecceEEEEEecceEEEE Confidence 33444679999888887544443344544444434455544322211110000 001111110 01 100 0000000 Q ss_pred CCCCceeEeeeeccCCC----CCCCcccccceEEEEEEEcCcCccccccceeeeeecC Q lcl|NC_018856. 293 APQAPASVVASIVDDKK----GGFRDEDIKTHSYKVVVHSDDAESLPSEAVTAAVAKK 346 (479) Q Consensus 293 AP~~pa~v~at~~t~~~----G~f~~~d~gty~YkVtavn~~GES~pS~~vt~Tv~~~ 346 (479) .+. ++..+.. +.|.. +..-+++...-+-+---|...+-.|-..= T Consensus 319 ~~e-------a~~~~~~g~~~~~f~~---~~~~iR~~~~~d~~v~~~~a~~~lt~~~~ 366 (366) T protein:vir:57 319 STE-------ATYKDADGQLVSAFAR---NQSLIRVVTEHDIGFRHPEGLVLGTGVIW 366 (366) T ss_pred eec-------cccccccccchhhhhc---CceeEEeeeeeCcEeeccccEEEEecccC Confidence 000 0000111 11211 11222222221111111211111110000 No 93 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=94.13 E-value=0.0053 Score=33.06 Aligned_cols=300 Identities=13% Similarity=0.044 Sum_probs=135.3 Q ss_pred CC----------ccchhhhh----------------hhhcCCccchHHHHHHHHHhhhcCCCcC--hhhccCccccchhh Q lcl|NC_018856. 1 MT----------ELKKEAEA----------------KNKKLPVEAEAELAELVSKSFTTGYGIT--PDTQLDGAAVRREL 52 (479) Q Consensus 1 ~~----------~~~~~~~~----------------~~~~~~~~~~~~~~e~~~Ks~tag~~~~--p~~~~~gaalr~es 52 (479) +. ..+...+. ..........++.. .+.+....+.... ..+-.+|+.+..+. T Consensus 58 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~t~~g~~~iP~~ 136 (415) T protein:vir:46 58 LDKLKEKDRTSENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVR-DFTEYLETRNDIQGGSLKTDSGFVVIPEE 136 (415) T ss_pred HHHHHHHHHhhhhcccccccchhhhhHHHHHHHHHHHhhhhhhhhHHHHH-HHHHHHhhhhhhhhccccccCCcccccHH Confidence 00 00000000 00000001111111 1222222211111 11223577888988 Q ss_pred hhhhhhhheeccccccchhhccccchhHHHHhhhhhhccCccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhh Q lcl|NC_018856. 53 LEDQVKMLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVAS-INDPNIRQKTVQMKFLSDTKQQSL 131 (479) Q Consensus 53 ld~~~~~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~ 131 (479) +...|..+... ...+++.+...++.+.-..|.+....++ ....+++|++..+ .+++.+.......+-++.-..+|. T Consensus 137 ~~~~ii~~~~~--~~~l~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ 213 (415) T protein:vir:46 137 IVTDILKLKEV--EFNLDKYVTVKRVTNGSGKYPVVRQSEV-AALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISR 213 (415) T ss_pred HHHHHHHHHHh--hhhhhhhcceeeccCCceeEEEEEecCC-cceeecccccccccccccceeeEEeeeeeeEeeehhhH Confidence 88888544433 3345555666666655555555444443 3566899998776 678999999999999998877776 Q ss_pred hHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccCCCCCHHHHhhhh Q lcl|NC_018856. 132 AAGLVNNIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGARLDEATLNKAA 211 (479) Q Consensus 132 ~~~lvn~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa 211 (479) -+- .++..|.+....+.....+++.++.+++.|+-.=.+. -+...... ..+.....+.. +.+.|.++- T Consensus 214 ell-~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~---------~~~~~~~~-~~~~~~~~~~~-~~~~i~~~~ 281 (415) T protein:vir:46 214 EAI-EDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTG---------STSSGFEK-EGKKLEVKKAK-SLDDIKDAI 281 (415) T ss_pred HHH-hhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCcc---------cccccccc-ccceecccccc-chHHHHHHH Confidence 432 3455678888999999999999999999998552221 11111111 12333333333 333343333 Q ss_pred hhhhhccCceEEEecChHHhhhHHHHhhCcceeeeccCCCcc----eeeeehhhhc-CC---Ccce-ecccc-----eec Q lcl|NC_018856. 212 VIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSTAGGF----STGFSINQFL-ST---RGAI-NLHGS-----TIM 277 (479) Q Consensus 212 ~~i~~~fG~~td~~mp~~vka~f~~~~~~~qrv~~~~n~g~~----~~G~~I~~~~-s~---~G~I-~l~~s-----~~m 277 (479) -.+...|....-.+|+....+.+...-...-|++...+..+. -.|++|--.. .+ .|.+ .+.|+ .++ T Consensus 282 ~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~ 361 (415) T protein:vir:46 282 NLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLF 361 (415) T ss_pred HhhhhhccCCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCCccccceeeEEeccccccCCCccEEEEEehhccEEEE Confidence 333444555667889998888876533333344433232221 2344432110 01 1110 11111 011 Q ss_pred CCCceecc--------------cCC-cCCCCCCCceeEeeeeccCCCCCCCcccccceEEEEEEEcCcCccccccceeee Q lcl|NC_018856. 278 ENDNILLE--------------GRN-PEPNAPQAPASVVASIVDDKKGGFRDEDIKTHSYKVVVHSDDAESLPSEAVTAA 342 (479) Q Consensus 278 ~~~~~L~e--------------~~~-~~~~AP~~pa~v~at~~t~~~G~f~~~d~gty~YkVtavn~~GES~pS~~vt~T 342 (479) ++..+.++ -|+ ..+--|.+-..+.-++++.+.|. T Consensus 362 ~~~~~~v~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~~~~~------------------------------- 410 (415) T protein:vir:46 362 DRSQYQASWTDYMHFGECLMIAVRQDCRILDYKSAIVIEYDDSERGEGD------------------------------- 410 (415) T ss_pred eecceEEEeeccccCceEEEEEEEeccEEeccccEEEEEeeccCCCCCC------------------------------- Confidence 11111000 000 00001111111111111111111 Q ss_pred eecCCceEEEEE Q lcl|NC_018856. 343 VAKKDNTVKLEV 354 (479) Q Consensus 343 v~~~g~sv~ltI 354 (479) .-|+- T Consensus 411 -------~~~~~ 415 (415) T protein:vir:46 411 -------LGLEA 415 (415) T ss_pred -------ccCCC Confidence 11111 No 94 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=94.13 E-value=0.0053 Score=33.06 Aligned_cols=300 Identities=13% Similarity=0.044 Sum_probs=135.3 Q ss_pred CC----------ccchhhhh----------------hhhcCCccchHHHHHHHHHhhhcCCCcC--hhhccCccccchhh Q lcl|NC_018856. 1 MT----------ELKKEAEA----------------KNKKLPVEAEAELAELVSKSFTTGYGIT--PDTQLDGAAVRREL 52 (479) Q Consensus 1 ~~----------~~~~~~~~----------------~~~~~~~~~~~~~~e~~~Ks~tag~~~~--p~~~~~gaalr~es 52 (479) +. ..+...+. ..........++.. .+.+....+.... ..+-.+|+.+..+. T Consensus 58 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~t~~g~~~iP~~ 136 (415) T protein:vir:47 58 LDKLKEKDRTSENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVR-DFTEYLETRNDIQGGSLKTDSGFVVIPEE 136 (415) T ss_pred HHHHHHHHHhhhhcccccccchhhhhHHHHHHHHHHHhhhhhhhhHHHHH-HHHHHHhhhhhhhhccccccCCcccccHH Confidence 00 00000000 00000001111111 1222222211111 11223577888988 Q ss_pred hhhhhhhheeccccccchhhccccchhHHHHhhhhhhccCccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhh Q lcl|NC_018856. 53 LEDQVKMLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVAS-INDPNIRQKTVQMKFLSDTKQQSL 131 (479) Q Consensus 53 ld~~~~~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~ 131 (479) +...|..+... ...+++.+...++.+.-..|.+....++ ....+++|++..+ .+++.+.......+-++.-..+|. T Consensus 137 ~~~~ii~~~~~--~~~l~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ 213 (415) T protein:vir:47 137 IVTDILKLKEV--EFNLDKYVTVKRVTNGSGKYPVVRQSEV-AALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISR 213 (415) T ss_pred HHHHHHHHHHh--hhhhhhhcceeeccCCceeEEEEEecCC-cceeecccccccccccccceeeEEeeeeeeEeeehhhH Confidence 88888544433 3345555666666655555555444443 3566899998776 678999999999999998877776 Q ss_pred hHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccCCCCCHHHHhhhh Q lcl|NC_018856. 132 AAGLVNNIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGARLDEATLNKAA 211 (479) Q Consensus 132 ~~~lvn~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa 211 (479) -+- .++..|.+....+.....+++.++.+++.|+-.=.+. -+...... ..+.....+.. +.+.|.++- T Consensus 214 ell-~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~---------~~~~~~~~-~~~~~~~~~~~-~~~~i~~~~ 281 (415) T protein:vir:47 214 EAI-EDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTG---------STSSGFEK-EGKKLEVKKAK-SLDDIKDAI 281 (415) T ss_pred HHH-hhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCcc---------cccccccc-ccceecccccc-chHHHHHHH Confidence 432 3455678888999999999999999999998552221 11111111 12333333333 333343333 Q ss_pred hhhhhccCceEEEecChHHhhhHHHHhhCcceeeeccCCCcc----eeeeehhhhc-CC---Ccce-ecccc-----eec Q lcl|NC_018856. 212 VIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSTAGGF----STGFSINQFL-ST---RGAI-NLHGS-----TIM 277 (479) Q Consensus 212 ~~i~~~fG~~td~~mp~~vka~f~~~~~~~qrv~~~~n~g~~----~~G~~I~~~~-s~---~G~I-~l~~s-----~~m 277 (479) -.+...|....-.+|+....+.+...-...-|++...+..+. -.|++|--.. .+ .|.+ .+.|+ .++ T Consensus 282 ~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~ 361 (415) T protein:vir:47 282 NLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLF 361 (415) T ss_pred HhhhhhccCCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCCccccceeeEEeccccccCCCccEEEEEehhccEEEE Confidence 333444555667889998888876533333344433232221 2344432110 01 1110 11111 011 Q ss_pred CCCceecc--------------cCC-cCCCCCCCceeEeeeeccCCCCCCCcccccceEEEEEEEcCcCccccccceeee Q lcl|NC_018856. 278 ENDNILLE--------------GRN-PEPNAPQAPASVVASIVDDKKGGFRDEDIKTHSYKVVVHSDDAESLPSEAVTAA 342 (479) Q Consensus 278 ~~~~~L~e--------------~~~-~~~~AP~~pa~v~at~~t~~~G~f~~~d~gty~YkVtavn~~GES~pS~~vt~T 342 (479) ++..+.++ -|+ ..+--|.+-..+.-++++.+.|. T Consensus 362 ~~~~~~v~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~~~~~------------------------------- 410 (415) T protein:vir:47 362 DRSQYQASWTDYMHFGECLMIAVRQDCRILDYKSAIVIEYDDSERGEGD------------------------------- 410 (415) T ss_pred eecceEEEeeccccCceEEEEEEEeccEEeccccEEEEEeeccCCCCCC------------------------------- Confidence 11111000 000 00001111111111111111111 Q ss_pred eecCCceEEEEE Q lcl|NC_018856. 343 VAKKDNTVKLEV 354 (479) Q Consensus 343 v~~~g~sv~ltI 354 (479) .-|+- T Consensus 411 -------~~~~~ 415 (415) T protein:vir:47 411 -------LGLEA 415 (415) T ss_pred -------ccCCC Confidence 11111 No 95 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=94.13 E-value=0.0053 Score=33.05 Aligned_cols=310 Identities=12% Similarity=0.037 Sum_probs=130.9 Q ss_pred CCccchhh---h--hhhhcCC-------ccchHHHHHHH-------------HHhhhcCCCcChhhccCccccchhhhhh Q lcl|NC_018856. 1 MTELKKEA---E--AKNKKLP-------VEAEAELAELV-------------SKSFTTGYGITPDTQLDGAAVRRELLED 55 (479) Q Consensus 1 ~~~~~~~~---~--~~~~~~~-------~~~~~~~~e~~-------------~Ks~tag~~~~p~~~~~gaalr~esld~ 55 (479) +.....+. + ....... ....++..... .+++..-. .....-++++.+-.+.+.+ T Consensus 58 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~vp~~~~~ 136 (413) T protein:vir:81 58 SVDSEKSGELTRKGEGYKSIGEFFAKRAGDQIKQQAGGAQLNYSVGEYVAPRVKAASDPA-STATLTDEFQGGYGTTWNR 136 (413) T ss_pred HHhHHHhhhHhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhHHHhhhhhh-hhcccccccccccchhhHH Confidence 00000000 0 0000000 00000000000 01111000 0111224566666777777 Q ss_pred hhhhheeccccccchhhccccchhHHHHhhhhhhccC-cccccccccccccccccC-cceEEEEEEEEeeeehhhhhhhH Q lcl|NC_018856. 56 QVKMLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHG-RTGHSRFVREVGVASIND-PNIRQKTVQMKFLSDTKQQSLAA 133 (479) Q Consensus 56 ~~~~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G-~~g~~~fv~E~g~~~~~d-~~~~r~~~~~k~l~~~~~vs~~~ 133 (479) +|..+.... -.+.+.+...+..+.-.+|.+..... ..+...+++|++....++ +.+.+....++=++.-..+|..+ T Consensus 137 ~ii~~~~~~--~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~el 214 (413) T protein:vir:81 137 NIIYRRREK--LVVADLMDNLTMTNTTIKYLMEKANRVVEGGFKTVAEGGKKPYMRFADFDIVTESLSKIAGLTKITDEM 214 (413) T ss_pred HHHHHHhhh--hhHHhhcceeeccCCceeEEEeccccccccccceecCcccccccCcccceeeEeeeeeEEEeehhhHHH Confidence 764443332 23445555556655444555443222 234577999998876665 78999999998888777888753 Q ss_pred hhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccCCCCCHHHHhhhhhh Q lcl|NC_018856. 134 GLVNNIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGARLDEATLNKAAVI 213 (479) Q Consensus 134 ~lvn~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~ 213 (479) +.++ .+.+....+.-...+++.+|.++++|+-. |-.+.||.+.-.. +..-..+..-..+.|.++... T Consensus 215 -l~ds-~~l~~~i~~~la~~~~~~~d~~~l~G~G~---------~~~~~Gi~~~~~~--~~~~~~~~~~~~~~i~~~~~~ 281 (413) T protein:vir:81 215 -IEDY-DFLVSYINARLLEELAIEEERQLLLGDGT---------GNNLTGLLKRDGI--QTLAVSNKDELADSIYKAMTN 281 (413) T ss_pred -HHHH-HHHHHHHHHHHHHHHHHHHHHHHhccCCC---------CCccccccccccc--ccccccccchhHHHHHHHHHH Confidence 3333 34667777777888999999999999743 1235677665432 222222222224455555544 Q ss_pred hhhc-cCceEEEecChHHhhhHHHHhhCcceeeeccCCCc-----------ceeeeehhhhcC-CCcceecccceecCCC Q lcl|NC_018856. 214 VGKG-YGRATDAFMPIGVQADFTNNLLDRQRVIQPSTAGG-----------FSTGFSINQFLS-TRGAINLHGSTIMEND 280 (479) Q Consensus 214 i~~~-fG~~td~~mp~~vka~f~~~~~~~qrv~~~~n~g~-----------~~~G~~I~~~~s-~~G~I~l~~s~~m~~~ 280 (479) +... ...++-++|++...+.+...-...-|.+.+...+. --.|++|-.... +.|.+-| |+. .+. T Consensus 282 ~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~l~G~pv~~s~~~~~~~~~~-gd~--~~~ 358 (413) T protein:vir:81 282 ISLATPFQADALVINPLDYQELRLAKDANGQYYGGGVFQGQYGSGGIMLDPAPWGLRTVQSQVVPVGKPVV-GAF--RSA 358 (413) T ss_pred hhhhccCCCcEEEEcHHHHHHHHHhhccCCceeccccccccccccccccCceecceeeEEcCCCCcccEEE-Eec--ccE Confidence 4333 33556689999998887654443334433211110 123443321111 2333211 110 000 Q ss_pred ceecccCCcCCCCCCCceeEeeeeccCCCCCCCcccccceEEE------EEEEcCcCccccccceeeeeecC Q lcl|NC_018856. 281 NILLEGRNPEPNAPQAPASVVASIVDDKKGGFRDEDIKTHSYK------VVVHSDDAESLPSEAVTAAVAKK 346 (479) Q Consensus 281 ~~L~e~~~~~~~AP~~pa~v~at~~t~~~G~f~~~d~gty~Yk------Vtavn~~GES~pS~~vt~Tv~~~ 346 (479) -.+.. . .........-. +.-|+. +...|+ +...+..+--.-.... .+++ T Consensus 359 ~~~~~-------~-~~~~v~~~~~~---~~~~~~---~~~~~r~~~r~d~~~~~~~a~~~l~~~~---~~~p 413 (413) T protein:vir:81 359 ASVLR-------K-GGVRIDSTNTN---VDDFEN---NLITVRAEERVGLMVTFPEAIVQLDVAE---VVTP 413 (413) T ss_pred EEEEE-------e-cceEEEEeccc---cchhhc---CcEEEEEEEeeccEEecccceEEEEecC---CCCC Confidence 00100 0 00111111000 001110 011111 1122222111000000 1111 No 96 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=93.86 E-value=0.0049 Score=33.23 Aligned_cols=309 Identities=13% Similarity=0.107 Sum_probs=137.5 Q ss_pred CCccchh--------------hhhhhhcCCccchHHHHHHHHHhhh-----cCC------------CcChhhccCccccc Q lcl|NC_018856. 1 MTELKKE--------------AEAKNKKLPVEAEAELAELVSKSFT-----TGY------------GITPDTQLDGAAVR 49 (479) Q Consensus 1 ~~~~~~~--------------~~~~~~~~~~~~~~~~~e~~~Ks~t-----ag~------------~~~p~~~~~gaalr 49 (479) +.+.+++ .+...............+.-.|++. .+. .....+..+|+.+- T Consensus 95 ~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~i 174 (458) T protein:vir:10 95 IVGLQDEIKSLLTAREGRSFVGDSVAKALYGTQENFEDEVEKLVLLSYVMEKGVFETEHGQRHLKAVNQSSSVEVSSESY 174 (458) T ss_pred HHHHHHHHHHHHHHHHhhhhhhhhhhccchhhhhhHHHHHHHHHHHHHHHhhccchhhhhhhhhhhhhhcccCcccccee Confidence 1111000 0000000111100001110001110 000 00001223567777 Q ss_pred hhhhhhhhhhheeccccccchhhccccchhHHHHhhhhhhccCcccccccccccccc------cccCcceEEEEEEEEee Q lcl|NC_018856. 50 RELLEDQVKMLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVA------SINDPNIRQKTVQMKFL 123 (479) Q Consensus 50 ~esld~~~~~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~------~~~d~~~~r~~~~~k~l 123 (479) .+.+.++|..+.... -.+.+.....++.+....|.+.. +.+...+++|++.. +.+++.+.+.....+=+ T Consensus 175 p~~~~~~ii~~~~~~--~~l~~~~~~~~~~~~~~~~~~~~---~~~~a~~v~e~~~~~~~~~~~~~~~~~~~i~~~~~k~ 249 (458) T protein:vir:10 175 ETIFSQRIIRDLQKE--LVVGALFEELPMSSKILTMLVEP---DAGKATWVAASTYGTDTTTGEEVKGALKEIHFSTYKL 249 (458) T ss_pred hhhHhHHHHHHHHhh--hhHHhhcceeecCCcceEEEEec---CCcceeecccccccccccccccccccceeeEeeeeeE Confidence 777777775444332 34555555556666555555533 33345666766554 45678888888887777 Q ss_pred eehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccC-CCE-EEccC-- Q lcl|NC_018856. 124 SDTKQQSLAAGLVNNIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQD-TNV-IDLKG-- 199 (479) Q Consensus 124 ~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~-~Nv-iDarG-- 199 (479) +.--.+|.-+ +.++..|.+....+.....++..++.++|+||-.= +..|+.+..... .++ .+.-+ T Consensus 250 ~~~v~is~el-l~ds~~~~~~~i~~~l~~~i~~~~d~~~l~G~G~~----------~p~Gi~~~~~~~~~~~~~~~~~~~ 318 (458) T protein:vir:10 250 AAKSFITDET-EEDAIFSLLPLLRKRLIEAHAVSIEEAFMTGDGSG----------KPKGLLTLASEDSAKVVTEAKADG 318 (458) T ss_pred EeeehhhHHH-HhcchHHHHHHHHHHHHHHHHHHHHHHhhcCCCCC----------ccceeeecccccccceeecccccc Confidence 7766677653 34455678888889999999999999999998541 235666654321 122 22222 Q ss_pred -CCCCHHHHhhhhhhhhhccCceEEEecChHHhhhHHHHhhCcceeee-cc---CCCc----ceeeeehh--hhcCCC-- Q lcl|NC_018856. 200 -ARLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQ-PS---TAGG----FSTGFSIN--QFLSTR-- 266 (479) Q Consensus 200 -~~l~~~~l~~aa~~i~~~fG~~td~~mp~~vka~f~~~~~~~qrv~~-~~---n~g~----~~~G~~I~--~~~s~~-- 266 (479) ..++.+.|.++-..+..+|......+|+......+...-...-|.+. +. .... .-.|++|- +++... T Consensus 319 ~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~~ 398 (458) T protein:vir:10 319 SVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDSVKLQGQVGRIYGLPVVVSEYFPAKAN 398 (458) T ss_pred cccccHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHhhcccCCceeeccccccccccCcCceecceeeEEccccccccC Confidence 23455666665555667777777799999988877653332223222 11 1111 01233321 111110 Q ss_pred -cceecccceecCCCceecccCCcCCCCCCCceeEeeeeccCCCCCCCcccccceEEEEEEEcCcCccccccceeeeeec Q lcl|NC_018856. 267 -GAINLHGSTIMENDNILLEGRNPEPNAPQAPASVVASIVDDKKGGFRDEDIKTHSYKVVVHSDDAESLPSEAVTAAVAK 345 (479) Q Consensus 267 -G~I~l~~s~~m~~~~~L~e~~~~~~~AP~~pa~v~at~~t~~~G~f~~~d~gty~YkVtavn~~GES~pS~~vt~Tv~~ 345 (479) +.+.+ ++ |.+. ..+++ ..+. .... -..... +-..|+...--.-.--.|+..+..|.++ T Consensus 399 ~~~~~~-~~-f~~~-~~~~~-------~~~~-~v~~--d~~~~~--------~~~~~~~~~r~~~~v~~~~a~v~~~~aa 457 (458) T protein:vir:10 399 SAEFAV-IV-YKDN-FVMPR-------QRAV-TVER--ERQAGK--------QRDAYYVTQRVNLQRYFANGVVSGTYAA 457 (458) T ss_pred CcceEE-EE-eccc-EEEEE-------eece-EEEe--ecccCC--------CceEEEEEEEecceEecccceEEEeecc Confidence 01111 00 0000 00100 0000 0000 000000 0112222221122222345555555444 Q ss_pred C Q lcl|NC_018856. 346 K 346 (479) Q Consensus 346 ~ 346 (479) - T Consensus 458 ~ 458 (458) T protein:vir:10 458 S 458 (458) T ss_pred C Confidence 4 No 97 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=93.45 E-value=0.0075 Score=32.23 Aligned_cols=293 Identities=11% Similarity=0.041 Sum_probs=131.4 Q ss_pred CC---ccchhhhhhhhcCCccchHHHHHHHHHhhhcCCCc---------------ChhhccCccccchhhhhhhhhhhee Q lcl|NC_018856. 1 MT---ELKKEAEAKNKKLPVEAEAELAELVSKSFTTGYGI---------------TPDTQLDGAAVRRELLEDQVKMLAF 62 (479) Q Consensus 1 ~~---~~~~~~~~~~~~~~~~~~~~~~e~~~Ks~tag~~~---------------~p~~~~~gaalr~esld~~~~~l~~ 62 (479) .. ........+. .......++....+.|++..+... +-.+-.+|+.|-++.+.+.|..+.. T Consensus 70 ~~~~~~~~~~~~~~~-~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~lvP~~~~~~ii~~~~ 148 (397) T protein:vir:12 70 VPEQERNPEGQRSQG-QGNEERQQQYSKAFLKGLRGKRLTDEERDLLDSPEFRAMSGINDEDGGILIPEDIGRQIHEFKR 148 (397) T ss_pred hhhhhhhhccccccc-chhhHHHHHHHHHHHHHHhccCCcHHHHHHHhhhhhhhccccccccCcccCchhHHHHHHHhhh Confidence 00 0000000000 000001111222233333322210 0112345777888888887765544 Q ss_pred ccccccchhhccccchhHHHHhhhhhhccCccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcchhh Q lcl|NC_018856. 63 SSNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVAS-INDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIAD 141 (479) Q Consensus 63 ~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~D 141 (479) .. -.+++.+...++.+...+|....+.++. ...+++|++..+ .+++.+.......+-++.--.+|.-+- .++..| T Consensus 149 ~~--~~l~~~~~~~~~~~~~~~~~~~~~~~~~-~a~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~is~e~l-~ds~~~ 224 (397) T protein:vir:12 149 QF--EPLEQYVTVEPVTTRSGTRLLEKNADMV-PFSPVEELGNLPEIDQPRFTKVSYSIIDYGGIMTLSNSML-NDSDQA 224 (397) T ss_pred hh--hhHHhhcceeeccCCceeEEEEEecCCc-ceeeecccccccccccccceeEEeeheeeEeeehhhHHHH-hhchHH Confidence 33 3456666666666544445444444443 466999998754 678999999999998888777666432 344557 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccCCCCCHHHHhhhhhhhhhccCce Q lcl|NC_018856. 142 PMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGARLDEATLNKAAVIVGKGYGRA 221 (479) Q Consensus 142 p~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~~ 221 (479) .+....+.-...+++.++.++++|+..-.|.. -+-+|++.+.+.. .+...|... T Consensus 225 l~~~i~~~l~~~~~~~~d~~il~G~g~~~~~g----~~~~~~i~~~~~~----------------------~l~~~~~~~ 278 (397) T protein:vir:12 225 IMTYVAKWFAKKSVVTRNNLILAAIASLKKVD----IDGLDGIKKALNV----------------------TLDPMVAPG 278 (397) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccccccccc----cccHHHHHHHHhh----------------------ccchhhhCC Confidence 78888888999999999999999998865422 2346666655431 111123333 Q ss_pred EEEecChHHhhhHHHHhhCcceeeeccCCCcceeeeehhhhcCCCcceecccceecCCCceecccCCcCCCCCCCceeEe Q lcl|NC_018856. 222 TDAFMPIGVQADFTNNLLDRQRVIQPSTAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILLEGRNPEPNAPQAPASVV 301 (479) Q Consensus 222 td~~mp~~vka~f~~~~~~~qrv~~~~n~g~~~~G~~I~~~~s~~G~I~l~~s~~m~~~~~L~e~~~~~~~AP~~pa~v~ 301 (479) ...+|++...+.+...-...-|.+.+.+..+. .+.+++..+-...+..++...+...+ .+. T Consensus 279 a~~~~n~~~~~~L~~lkd~~G~~l~~~~~~~g------------------~~~~l~G~pv~~~~~~~~~~~~~~~~-~~~ 339 (397) T protein:vir:12 279 SIVLTNQDGYDWLDTLKDGTGRYLLQPDPTNP------------------TKKLLDGRPVVPFTNRVLKTQKGKAP-LII 339 (397) T ss_pred CEEEEcHHHHHHHHHhhccCCceeecccccCC------------------CCccccceeeEEecccccccCCCccE-EEE Confidence 34678888877776543332233322222111 11122222211111111111000000 000 Q ss_pred e--------------ee--ccCCCCCCCcccccceEEEEEEEcCcCccccccceeeeeecCCceEEEEEEec Q lcl|NC_018856. 302 A--------------SI--VDDKKGGFRDEDIKTHSYKVVVHSDDAESLPSEAVTAAVAKKDNTVKLEVKLA 357 (479) Q Consensus 302 a--------------t~--~t~~~G~f~~~d~gty~YkVtavn~~GES~pS~~vt~Tv~~~g~sv~ltIT~~ 357 (479) + +. .......|. -+...|++..--+- .+..+..-+.+++|.- T Consensus 340 gd~~~~~~~~~~~~~~i~~~~~~~~~f~---~~~~~~r~~~r~d~-----------~~~~~~a~~~~~~t~~ 397 (397) T protein:vir:12 340 GNLKEAIVLFDREQQSIASTDTGAGAFE---TNSTKVRGIEREDV-----------RKWDEDAVVFGQITVE 397 (397) T ss_pred EehhceEEEEeecceEEEEeccccchhh---cCceEEEEEEeecc-----------EEecccceEEEEEeeC Confidence 0 00 000000000 00111222111111 1111222222333322 No 98 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=92.70 E-value=0.01 Score=31.47 Aligned_cols=319 Identities=13% Similarity=0.006 Sum_probs=121.4 Q ss_pred CCccchhhhh----hhhcCCccchHHHHHHHHHhhhcCCCcChhhccCccccchhhhhhhhhhheeccccccchhhcccc Q lcl|NC_018856. 1 MTELKKEAEA----KNKKLPVEAEAELAELVSKSFTTGYGITPDTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQ 76 (479) Q Consensus 1 ~~~~~~~~~~----~~~~~~~~~~~~~~e~~~Ks~tag~~~~p~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~ 76 (479) +.+.+.+.+. .......... .+-+...|.+.+- ..-.+-++|+.|-.+.+..+|..+.... ..+++.+... T Consensus 47 ~~~~~~~~~~~~~~~~~~~~~~~~-~l~~~~r~~~~~~--~~~~~~~~gg~lvP~~~~~~I~~~~~~~--s~i~~~~~~~ 121 (390) T protein:vir:40 47 IAQARKEVNREMNDNNVLASRGAN-ALTSDESKYYNEV--IAGNGFAGVTALLPPTVFERVFEDLTVE--HPLLSKINFV 121 (390) T ss_pred HHHHHHHHHHHHHHHHHHHhcCch-hccHHHHHHHHHH--HhccCcccCcccccHHHHHHHHHHHHhh--hhhhhhceee Confidence 0000000000 0000000000 0000011222110 0001224678888888888875443322 3455666655 Q ss_pred chhHHHHhhhhhhccCccccccccccccc-ccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHH Q lcl|NC_018856. 77 QVNSTVAKYAVFNQHGRTGHSRFVREVGV-ASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAIAVIA 155 (479) Q Consensus 77 ~~~stv~eY~~~~~~G~~g~~~fv~E~g~-~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~ 155 (479) ++.+.-. .+....+.+...++.|++. ++..++.+.+.....+=++.-..+|.-+- .++..|.+....+.-...++ T Consensus 122 ~~~~~~~---~i~~~~~~~~a~~~~E~~~~~~~~~~~f~~i~l~~~k~~~~i~iS~ell-~ds~~~l~~~i~~~la~~i~ 197 (390) T protein:vir:40 122 NTTATTE---WIISVGDVATAWWGPLCAEIKEVLDNGFDKIQTGMYKLSAYIPVCNAML-DLGPSWLDQYVRTILGEAMA 197 (390) T ss_pred ecCCcee---EEEEEcCCcceeeeccccccCccccccceeeEeeeeeEEEeehhhHHHH-hcchHHHHHHHHHHHHHHHH Confidence 5555333 2333455566778899776 45789999999999998887777774332 35566888999999999999 Q ss_pred HHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCC--EEEccCCCCCHHHHhhhhhhhh--------hccCceEEEe Q lcl|NC_018856. 156 KSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTN--VIDLKGARLDEATLNKAAVIVG--------KGYGRATDAF 225 (479) Q Consensus 156 ~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~N--viDarG~~l~~~~l~~aa~~i~--------~~fG~~td~~ 225 (479) ..++.++++|+-.=. -.|+++.+..... ..+.....++-..+......+- +.++.+. ++ T Consensus 198 ~~~~~a~l~G~G~~~----------P~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~a~-~i 266 (390) T protein:vir:40 198 LGLEAGIVNGSGKDQ----------PIGMMRDLNNVTAGEHPVKTATPLTDLTPATLATKVMLPLTDNGKKSVSDAI-LV 266 (390) T ss_pred HHHHhhhhcccCCCc----------cceeeeccccccccccccccccccchhhHHHHHHHHHHHhhcchhhhhcCce-EE Confidence 999999999985411 2466654432111 1111112222221111111111 1222221 34 Q ss_pred cChHHhhhHHH---HhhC-cceeeeccCCCcceeeeeh--hhhcCCCcceecccce----ecCCCceecccCCcCCCCCC Q lcl|NC_018856. 226 MPIGVQADFTN---NLLD-RQRVIQPSTAGGFSTGFSI--NQFLSTRGAINLHGST----IMENDNILLEGRNPEPNAPQ 295 (479) Q Consensus 226 mp~~vka~f~~---~~~~-~qrv~~~~n~g~~~~G~~I--~~~~s~~G~I~l~~s~----~m~~~~~L~e~~~~~~~AP~ 295 (479) |+.....++-. .+.+ .-+.+.+.. -.|+.| ..+ .+.|.|.| |+- +.++..+.++ .......-. T Consensus 267 ~n~~t~~~~l~~~~~~~d~~G~~v~~~~----~~g~pvv~~~~-~p~~~i~~-Gd~s~~~i~~~~~~~v~-~~~~~~f~~ 339 (390) T protein:vir:40 267 INPADYWSKIYAATSYMTPQGVWVTGIL----PVPLEIVQSVA-VPVGKAVA-GRAKDYFMGIGSEQVIR-TSTEYRLLD 339 (390) T ss_pred EcchhHHHHHHHHhhccCCCCccccccC----CCceeEEEcCC-CCCCcEEE-EeeceEEEEeecceEEE-ecchhhhhc Confidence 55444322111 1111 001111000 011111 111 11222211 110 0000000000 000000000 Q ss_pred CceeEeeeeccCCCCCCCcccccceEEEEEEEcCcCccccccceeeeeecCCceE Q lcl|NC_018856. 296 APASVVASIVDDKKGGFRDEDIKTHSYKVVVHSDDAESLPSEAVTAAVAKKDNTV 350 (479) Q Consensus 296 ~pa~v~at~~t~~~G~f~~~d~gty~YkVtavn~~GES~pS~~vt~Tv~~~g~sv 350 (479) .-+...+..-. +|+... +.+-...++++.+.+. .+|...++++.+..+++- T Consensus 340 ~~~~~r~~~r~--dg~v~~-~~A~~~l~~~~~~~~~-~~~~~~~~~~~~~~~~~~ 390 (390) T protein:vir:40 340 DETLYYAKQYA--NGRPKD-NSSFLVFDITGLEGSP-AIDVNVVNNATPSETPAE 390 (390) T ss_pred CcEEEEEEEEe--CCEEec-ccceEEEEeeccCCCC-CCCcceeeCCCCCCCCCC Confidence 00111111111 111100 0011222233222111 122222333333222222 No 99 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=92.57 E-value=0.0033 Score=34.17 Aligned_cols=306 Identities=17% Similarity=0.100 Sum_probs=113.4 Q ss_pred CCccchhhh--------------hhhhcCCccchHH----HHHHHHHhhhcCCC--cChhhccCccccchhhhhhhhhhh Q lcl|NC_018856. 1 MTELKKEAE--------------AKNKKLPVEAEAE----LAELVSKSFTTGYG--ITPDTQLDGAAVRRELLEDQVKML 60 (479) Q Consensus 1 ~~~~~~~~~--------------~~~~~~~~~~~~~----~~e~~~Ks~tag~~--~~p~~~~~gaalr~esld~~~~~l 60 (479) ..+.+++.+ ............. -...+.+.+..+-. ....+..+|+.|..+.+...|..+ T Consensus 100 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~g~lvp~~~~~~i~~~ 179 (437) T protein:vir:10 100 KTETKSEAEKDKKTVKDEEKRDAGGLQDMKLKVGGEIADKKVTAFADYLKTGEVRDVTGIALKDGKVIIPETILTPEKEV 179 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHhHHHHhHHHHHHHHHHHHhhhhhhHHHHHhhhhhhhhhcccccccccchHHHHHHHHHh Confidence 000000000 0000000000000 00011111211111 111233456677777777766554 Q ss_pred eeccccccchhhccccchhHHHHhhhhhhccCccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcch Q lcl|NC_018856. 61 AFSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVAS-INDPNIRQKTVQMKFLSDTKQQSLAAGLVNNI 139 (479) Q Consensus 61 ~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~ 139 (479) .. .-.+...+...+..+---+|......+ +...+++|++... .+++.+.+.+..++=++.-..+|.-+ +.++. T Consensus 180 ~~---~~~l~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~e~~~~~e~~~~~~~~v~~~~~k~~~~~~is~el-l~ds~ 253 (437) T protein:vir:10 180 HQ---FPRLGSLVRTESVTTTTGKLPIFNNST--DLLTAHTEYGQTTKNATPVITPILWDLKTYTGGYVFSQEL-ISDSS 253 (437) T ss_pred hh---hhhhhhcceeEeeccCceeeEEeeccc--cccccccccccccccccccceeeeeehhheeeehhhhHHH-HhhhH Confidence 22 123344444444444434444433332 3456788888665 78899999999888777766776643 34556 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhcc-CCCEEEccCCC-CCHHHHhhhhhhhhhc Q lcl|NC_018856. 140 ADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQ-DTNVIDLKGAR-LDEATLNKAAVIVGKG 217 (479) Q Consensus 140 ~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~-~~NviDarG~~-l~~~~l~~aa~~i~~~ 217 (479) .|......+.-...+...++.+++.|+.+-.+. +......|.+...|.. ..+.+...+.. ++...++....+. .+ T Consensus 254 ~~~~~~i~~~l~~~~~~~~~~~i~~g~g~~~~~--~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~lk-d~ 330 (437) T protein:vir:10 254 YDWQAELQSRLIELRDNTDDSLIITALTDGIKK--TTSTYLLGDLKKVLNVTLKPQDSAAASIVMSQSAYNLFDMAT-DA 330 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhcccccc--cccccchhhHHHHHHhhhhhhhhcCCEEEEcHHHHHHHHHhh-cc Confidence 677788888888899999999999999775543 2234456666665541 11222222111 2222222222211 11 Q ss_pred cCceEEEecChHHhhhHHHHhhCcceeeeccC----CCcceeeeehhhhcCCCcceecccceecCCCceecccCCcCCCC Q lcl|NC_018856. 218 YGRATDAFMPIGVQADFTNNLLDRQRVIQPST----AGGFSTGFSINQFLSTRGAINLHGSTIMENDNILLEGRNPEPNA 293 (479) Q Consensus 218 fG~~td~~mp~~vka~f~~~~~~~qrv~~~~n----~g~~~~G~~I~~~~s~~G~I~l~~s~~m~~~~~L~e~~~~~~~A 293 (479) -|. .+|.|. +...-...++++..++.++. .+....-+.+.+|... -+++++..+-++ T Consensus 331 ~g~--~~~~~~-~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~--------~~~~~r~~~~~~-------- 391 (437) T protein:vir:10 331 MGR--PLLQPN-VTAATGYTLLGKTVVIVDDKLFPSASAGDVNIVVAPLKKA--------VINFKLTEITGQ-------- 391 (437) T ss_pred CCC--eeeccC-ccCCCCcccccceeEEecccccCCcCCCceEEEEeecccc--------EEEEeeeceEEE-------- Confidence 121 133331 11111112222222211100 0000000111111110 001111100000 Q ss_pred CCCceeEeeeeccCCCCCCCcccccceEEEEEEEcCcCcc-----ccccceeeeeec Q lcl|NC_018856. 294 PQAPASVVASIVDDKKGGFRDEDIKTHSYKVVVHSDDAES-----LPSEAVTAAVAK 345 (479) Q Consensus 294 P~~pa~v~at~~t~~~G~f~~~d~gty~YkVtavn~~GES-----~pS~~vt~Tv~~ 345 (479) ... .... +.-.......|-+.+++..+-. .|...++...++ T Consensus 392 ------~~~--~~~~---~~~~~~~~~r~d~~~~~~~a~~~l~~~~~~~~~~~~~~~ 437 (437) T protein:vir:10 392 ------FQD--TYDI---WYKQLGIFLRQNVVQASKDLIVNLTGKLKAVTVVQSTAV 437 (437) T ss_pred ------Eec--cccc---ccceeeEEEEEccEEecccceEEEEeeccccccCCCCCC Confidence 000 0000 0000000011111222211100 011110000000 No 100 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=91.84 E-value=0.014 Score=30.75 Aligned_cols=329 Identities=11% Similarity=0.059 Sum_probs=142.2 Q ss_pred CCccchHHHHHHHHHhhhcCCCcChhhccCccccchhhhhhhhhhheeccccccchhhccccchhHHHHhhhhhhccCcc Q lcl|NC_018856. 15 LPVEAEAELAELVSKSFTTGYGITPDTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRT 94 (479) Q Consensus 15 ~~~~~~~~~~e~~~Ks~tag~~~~p~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~ 94 (479) |-...+.... ..++-+ .+|+.|-.|...+-|..+. +...+.+...+.+..+.-.+|.++. +. T Consensus 1 ~g~~~e~~~~--~~~~t~----------~~~g~l~~~~~~~ii~~l~---~~s~i~~l~~~~~~~~~~~~ip~~~---~~ 62 (397) T protein:vir:23 1 MGFSADHSQI--AQTKDT----------MFTGYLDPVQAKDYFAEAE---KTSIVQRVAQKIPMGATGIVIPHWT---GD 62 (397) T ss_pred CCcCHHHHHH--hhccCC----------CCccccchhHHHHHHHHHH---hccchhhhcceeeccCCceEEEEEc---CC Confidence 2222221111 101111 1134455555554444443 2234555555555555434455543 34 Q ss_pred cccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCC Q lcl|NC_018856. 95 GHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEAD 174 (479) Q Consensus 95 g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~ 174 (479) ....|++|++..+.+++.+.+....+|=++--..+|.-+-. ++..|.+....+.-...+++.+|.++++|+..-. T Consensus 63 ~~a~wv~Eg~~~~~s~~~f~~v~l~~~k~~~~v~iS~ell~-ds~~~l~~~i~~~l~~aia~~~d~a~l~G~gt~~---- 137 (397) T protein:vir:23 63 VSAQWIGEGDMKPITKGNMTKRDVHPAKIATIFVASAETVR-ANPANYLGTMRTKVATAIAMAFDNAALHGTNAPS---- 137 (397) T ss_pred cceEEecCCccccccccceeEEEEeeEEEEEeehhhHHHHh-cchHHHHHHHHHHHHHHHHHHHHHHHhhcccCCc---- Confidence 45679999999999999999999999999988888875433 4567888999999999999999999999997621 Q ss_pred CcccchhhhHHHhhccCCCEEEccCCCCCHHHHhhhhhhhhhccCceEEEecChHHhhhHHHHhhCcceeeeccCC-Cc- Q lcl|NC_018856. 175 GQAGIEFDGLHKLIDQDTNVIDLKGARLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSTA-GG- 252 (479) Q Consensus 175 ~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~~td~~mp~~vka~f~~~~~~~qrv~~~~n~-g~- 252 (479) ...|+...-. ...-..+.......+ .+...+...|....-..|+......+...-...-|.+.+.+. ++ T Consensus 138 -----~~~~~~~~~~---~~~~~~~~~~~~~~~-~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~ 208 (397) T protein:vir:23 138 -----AFQGYLDQSN---KTQSISPNAYQGLGV-SGLTKLVTDGKKWTHTLLDDTVEPVLNGSVDANGRPLFVESTYESL 208 (397) T ss_pred -----cccccccccc---ceeeecccchhHHHH-HHHHhhhhcccCCCEEEEcHHHHHHHHHhhccCCceeecccccccc Confidence 1233332221 222223333333333 444456667788888999999998887644333333332221 11 Q ss_pred -------ceeeeehh--hhcCCCcc------------------eecccce-----------------ecCCCcee----- Q lcl|NC_018856. 253 -------FSTGFSIN--QFLSTRGA------------------INLHGST-----------------IMENDNIL----- 283 (479) Q Consensus 253 -------~~~G~~I~--~~~s~~G~------------------I~l~~s~-----------------~m~~~~~L----- 283 (479) .-.|+++- ... +.|. +.+.-+. |..+-..+ T Consensus 209 ~~~~~~~tl~G~Pv~~s~~~-~~g~~~~~~gDfs~~~i~~~~~i~i~~~~e~~~~~~~~~~~~~~~lf~~d~v~~ra~~r 287 (397) T protein:vir:23 209 TTPFREGRILGRPTILSDHV-AEGDVVGYAGDFSQIIWGQVGGLSFDVTDQATLNLGSQESPNFVSLWQHNLVAVRVEAE 287 (397) T ss_pred cccccCceeeeeeEEEeCCC-CCCceEEEEeecceEEEEEEeceEEEEeeeeeeeeccccccceeeeeeccceeEEEEee Confidence 12233222 111 1222 2211110 00000000 Q ss_pred cccCCcCCCCCC------CceeEeeeeccCCCCCCCccc----ccceEEEEEEEcCcC--ccccc--cceeeeeecCCce Q lcl|NC_018856. 284 LEGRNPEPNAPQ------APASVVASIVDDKKGGFRDED----IKTHSYKVVVHSDDA--ESLPS--EAVTAAVAKKDNT 349 (479) Q Consensus 284 ~e~~~~~~~AP~------~pa~v~at~~t~~~G~f~~~d----~gty~YkVtavn~~G--ES~pS--~~vt~Tv~~~g~s 349 (479) ....+..|.|-. .....+.++.+.++|.|+-.- ...-.|..++.+-.+ |.++. ....+++.+.+. T Consensus 288 ~d~~v~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 366 (397) T protein:vir:23 288 YGLLINDVNAFVKLTFDPVLTTYALDLDGASAGNFTLSLDGKTSANIAYNASTATVKSAIVAIDDGVSADDVTVTGSAG- 366 (397) T ss_pred eccceecccceEEEeeccccceeeecccccCcceEEEEecCccccCcccccchhhhHHHhhhcccccccceeeeecCCc- Confidence 111111221110 011111122233344332110 000112111111111 11110 111112222111 Q ss_pred EEEEEEecCCCCcccceEEEEEecCCCcceEEEEeeeeeeecCCceEEEeeccccCCCcccc---eec Q lcl|NC_018856. 350 VKLEVKLASLYQAQPQFISVYREGTETGHYFLIARVPVSKVNDQGVIEVLDRNQVIPETTDV---FVG 414 (479) Q Consensus 350 v~ltIT~~~~~~a~~~~y~IYR~~~~~G~y~li~rv~vs~~n~~g~T~ftD~N~~iPgT~~~---fvG 414 (479) --+|++++...+ .|....-+.+..+ =+| T Consensus 367 -~~~~~~~~~~~~------------------------------------~~~~~~~~~~~~~~~~~~~ 397 (397) T protein:vir:23 367 -DYTITVPGTLTA------------------------------------DFSGLTDGEGASISVVSVG 397 (397) T ss_pred -eeEEEecccccc------------------------------------CccccccCccccceeeecC Confidence 112222211100 1111110000000 011 No 101 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=91.81 E-value=0.014 Score=30.73 Aligned_cols=305 Identities=11% Similarity=0.020 Sum_probs=125.5 Q ss_pred CCccchhhhhhhhcCCccchHHH---HHHHHHhh-hcCCCcChhhccCccccchhhhhhhhhhheeccccccchhhcccc Q lcl|NC_018856. 1 MTELKKEAEAKNKKLPVEAEAEL---AELVSKSF-TTGYGITPDTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQ 76 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~---~e~~~Ks~-tag~~~~p~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~ 76 (479) +...+...............+.+ ...-.+++ .+.....-....+|+.+-.+..++.|..+.. ....++.+..+ T Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~r~~~~~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~---~~~~l~~~~~~ 147 (390) T protein:vir:62 71 LSGLQGSGSGAQRSADVDDDATLRAGNLGEARSFEFAPEKRDGTKAGNPNVLSRTLYGQLIAQAVE---RSAIMRGGATT 147 (390) T ss_pred HhhcccccccchhhcchHHHHHHhhhhhhhhHHHHhhhhhhcccccCCCccccccchHHHHHHHHh---hhhhhhhccee Confidence 00000000000000000000000 00000111 0000111112223455555555555543332 22333333221 Q ss_pred chhHHHHhhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHH Q lcl|NC_018856. 77 QVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAIAVIAK 156 (479) Q Consensus 77 ~~~stv~eY~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~ 156 (479) --.+.-..+. +....+.....+++|++..+.+++.+.+....++=++.-..+|.-+= .++.-|.+....+.--..+++ T Consensus 148 ~~~~~~~~~~-~p~~~~~~~a~wv~E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~l~~~i~~ 225 (390) T protein:vir:62 148 FTTSDANPLD-FTVITGRSSASIVGETAEIPESYPATAQRSMGGFKYGFASVVSYEFA-TDQVLDLVGFLVSDAGPAIGD 225 (390) T ss_pred eecCCCceeE-EEEEcCCcceeeecccccccccccceeeeEeeeeeEEeehHHHHHHH-hhhhHHHHHHHHHHHHHHHHH Confidence 1111111111 11223334677899999999999999999999998887777775443 344557777788888889999 Q ss_pred HHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEcc-CCCCCHHHHhhhhhhhhhccCceEEEecChHHhhhHH Q lcl|NC_018856. 157 SIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLK-GARLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFT 235 (479) Q Consensus 157 ~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDar-G~~l~~~~l~~aa~~i~~~fG~~td~~mp~~vka~f~ 235 (479) .++.++++|+-+ | -||.+......+.+... ...++.+.|.++-.-+..+|-.---.+|+....+.+. T Consensus 226 ~~d~~~l~G~G~--p----------~Gi~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~l~~~~~~~a~~vmn~~~~~~L~ 293 (390) T protein:vir:62 226 AMGRHFITGTGQ--P----------RGILTDASPATATFLATDTDSKVSDALIDLFHEVPSAYRANAKYVVNDLRAAQMR 293 (390) T ss_pred HHHhhhhccCCc--c----------ccccccccccccceecccccccchHHHHHHHHhhhhhhhcCCEEEEchHHHHHHH Confidence 999999999742 2 46766665443433332 2345545444332222334432224789999888876 Q ss_pred HHhhCcceeeeccCCCc----ceeeeehh--hhcCCCcceecccc----eecCCCceecccCCcCCCCCCCceeEeeeec Q lcl|NC_018856. 236 NNLLDRQRVIQPSTAGG----FSTGFSIN--QFLSTRGAINLHGS----TIMENDNILLEGRNPEPNAPQAPASVVASIV 305 (479) Q Consensus 236 ~~~~~~qrv~~~~n~g~----~~~G~~I~--~~~s~~G~I~l~~s----~~m~~~~~L~e~~~~~~~AP~~pa~v~at~~ 305 (479) ..=...-|.+...+... .-.|++|- ... +.+.|-| || .+..+..+.++.. ..+.+-..-+...+... T Consensus 294 ~lkd~~g~~l~~~~~~~g~~~~l~G~Pv~~~~~~-p~~~i~~-gd~s~~~i~~~~~~~v~~~-~~~~~~~~~~~~~~~~r 370 (390) T protein:vir:62 294 KLKDANGQYLWQSGLTVGAPSLFNGKVVETDDGM-PADKILF-ADLSKYRVRFAGSLRVDRS-VDAKFSTDQIVYRFLQR 370 (390) T ss_pred HhhccCCCeeecCCcCCCccceecccceEEecCC-CCccEEE-eeccceeEEeecceEEEee-ccccccCCcEEEEEEEE Confidence 53333334443222211 12343332 211 2223322 22 1111111111100 00000000111111111 Q ss_pred cCCCCCCCcccccceEEEEEEEc Q lcl|NC_018856. 306 DDKKGGFRDEDIKTHSYKVVVHS 328 (479) Q Consensus 306 t~~~G~f~~~d~gty~YkVtavn 328 (479) . +|+.....+ ....+|++.. T Consensus 371 ~--d~~~~~~~A-~~~l~~~~~a 390 (390) T protein:vir:62 371 A--DGLLVDARG-AKVLTVTPGA 390 (390) T ss_pred e--CcEeechhh-eEEEEeecCC Confidence 1 111111000 1122222222 No 102 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=91.46 E-value=0.016 Score=30.46 Aligned_cols=316 Identities=10% Similarity=0.014 Sum_probs=133.0 Q ss_pred CCcc-----chh-hhhhhhcCCccc---hHHHHHHHH---------------HhhhcCCCcChhhccCccccchhhhhhh Q lcl|NC_018856. 1 MTEL-----KKE-AEAKNKKLPVEA---EAELAELVS---------------KSFTTGYGITPDTQLDGAAVRRELLEDQ 56 (479) Q Consensus 1 ~~~~-----~~~-~~~~~~~~~~~~---~~~~~e~~~---------------Ks~tag~~~~p~~~~~gaalr~esld~~ 56 (479) +.+. +.. .+.+....+... ..+..+.|. |.+..--.....+..+|+.|-++-+..+ T Consensus 57 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~l~~~~~~~~~~e~~~~~~~~a~~~~~~~~gg~liP~~~~~~ 136 (409) T protein:vir:45 57 LRRQDQAYIESNEEEQRQNLDPENNSQQDEKRAQVFDKWMRHGASELTSEERKALRELRAQGVAQDEKGGYTVPETFLAK 136 (409) T ss_pred HHHHHHHHHhhhhhhhcccCCCCCcchhhHHHHHHHHHHHHhhhhhccHHHHHHHHHHhhccCccCcCCceeccHhHHHH Confidence 0000 000 000000000000 000001111 1111100111112234667777777777 Q ss_pred hhhheeccccccchhhccccchhHHHHhhhhhhccCc-ccccccccccccccccCcceEEEEEEE-EeeeehhhhhhhHh Q lcl|NC_018856. 57 VKMLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHGR-TGHSRFVREVGVASINDPNIRQKTVQM-KFLSDTKQQSLAAG 134 (479) Q Consensus 57 ~~~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~-~g~~~fv~E~g~~~~~d~~~~r~~~~~-k~l~~~~~vs~~~~ 134 (479) |..+.... ..+.+.+...++.+- .+..+...++ ...+.+++|++..+..++.+....... |+.+.--.+|.-+ T Consensus 137 ii~~~~~~--~~l~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~f~~~~l~~~k~~~~~i~is~el- 211 (409) T protein:vir:45 137 VVEKMKSY--GGIASVAQILTTSDG--RTMEWATADGTSEVGVLLGENEEAGEEDTDFGMGSLGALKMTSKIIRVSNEL- 211 (409) T ss_pred HHHHHHhh--hhhhhhceeeecCCC--ceEEEEeeccCccccccccccccccccccccceeeeeeeeeeeeehhhhHHH- Confidence 65544332 233343333333321 1112222333 334679999999999999999888654 5554333344432 Q ss_pred hhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccCCCCCHHHHhhhhhhh Q lcl|NC_018856. 135 LVNNIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGARLDEATLNKAAVIV 214 (479) Q Consensus 135 lvn~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i 214 (479) +.++.-|.+....+.--..+...++.++++|+..-.+ .+..|+.+..... +.. +....++.+.|.++..-+ T Consensus 212 l~ds~~~l~~~i~~~la~a~~~~~~~a~l~G~G~~~~-------~~p~Gil~~~~~~-~~~-~~~~~~~~d~i~~l~~~l 282 (409) T protein:vir:45 212 LQDSAIDMEAYLARRIAERIGRGEARYLIQGTGAGTP-------KQPKGLAASVTGT-TQT-AAANAVKWQEILALKHSI 282 (409) T ss_pred HhccHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCc-------cccceeeeccccc-ccc-ccccccchHHHHHHHHhh Confidence 1344457778888888888999999999999976432 3457776665432 222 233445555555544334 Q ss_pred hhcc-CceEE-EecChHHhhhHHHHhhCcceeeeccCCCc----ceeeeehh--hhcC---CCcceecccceecCCCcee Q lcl|NC_018856. 215 GKGY-GRATD-AFMPIGVQADFTNNLLDRQRVIQPSTAGG----FSTGFSIN--QFLS---TRGAINLHGSTIMENDNIL 283 (479) Q Consensus 215 ~~~f-G~~td-~~mp~~vka~f~~~~~~~qrv~~~~n~g~----~~~G~~I~--~~~s---~~G~I~l~~s~~m~~~~~L 283 (479) ..+| ..+.- ++|+..+.+.+...-...-|.+...+... .-.|++|- .++- .....-+.|+. .+ -++ T Consensus 283 ~~~~~~~a~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~--~~-~~i 359 (409) T protein:vir:45 283 DPAYRRGPKFRLAFNDNTLKLISEMEDGQGRPLWLPDIVGVAPASVLNVPYVIDQEIDDIGAGKKFMFCGDF--DR-FII 359 (409) T ss_pred hhhhccCCeEEEEECHHHHHHHHHhhcCCCceeeccCcCCCCCceecceeeEEecCcCCccCCccEEEEeeh--hh-hhe Confidence 4443 44443 45788887777654444445543322211 23444332 1111 11111122210 00 001 Q ss_pred cccCCcCCCCCCCceeEeeeeccCCCCCCCcccccceEEEEEEEcCcCccccccceeeeeecCCce Q lcl|NC_018856. 284 LEGRNPEPNAPQAPASVVASIVDDKKGGFRDEDIKTHSYKVVVHSDDAESLPSEAVTAAVAKKDNT 349 (479) Q Consensus 284 ~e~~~~~~~AP~~pa~v~at~~t~~~G~f~~~d~gty~YkVtavn~~GES~pS~~vt~Tv~~~g~s 349 (479) .. . .... .... .+. |. .-+...|++..-=+.+=-.|...+..++.+...+ T Consensus 360 ~~-------~-~~~~--~~~~-~d~---~~--~~~~~~~~~~~r~d~~~~~~~A~~~l~~k~s~~~ 409 (409) T protein:vir:45 360 RR-------V-RYMI--LKRL-VER---YA--EYDQTGFLAFHRFDCILEDTSAIKALVGKGSVGG 409 (409) T ss_pred ee-------c-cceE--EEEe-ecc---cc--cCCcEEEEEEEEeccEeechhheEEEEeccCCCC Confidence 00 0 0010 0000 011 00 0012333333322222222333333333222222 No 103 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=91.04 E-value=0.012 Score=31.21 Aligned_cols=292 Identities=15% Similarity=0.048 Sum_probs=126.9 Q ss_pred CC-----ccchhhhhhhhcCC--ccchHHHHHHHHHhhhcCCCcChhhccCccccchhhhhhhhh-hheeccccccchhh Q lcl|NC_018856. 1 MT-----ELKKEAEAKNKKLP--VEAEAELAELVSKSFTTGYGITPDTQLDGAAVRRELLEDQVK-MLAFSSNDFTIYPL 72 (479) Q Consensus 1 ~~-----~~~~~~~~~~~~~~--~~~~~~~~e~~~Ks~tag~~~~p~~~~~gaalr~esld~~~~-~l~~~~~~f~f~~~ 72 (479) +. +.++|.+......+ ...-.+--+++++..+. .+-.+|+.|.++.+.+.|. .| .....+++. T Consensus 42 ~~~~~~~~~~~e~~~~~~~~~~~~~lt~ee~~~~~~~~~~------~~~~~gg~lvP~~~~~~I~~~l---~~~s~i~~~ 112 (377) T protein:vir:96 42 MGDEILAKNEEEMERMFDLRDKNRELTAEEIKFFNDIDKN------VGGKDKFKLLPEETMVQVFDDL---VAEHPLLKV 112 (377) T ss_pred HHHHHHHHHHHHHHHHHHhccCCcccCHHHHHHHHHHHhc------CCCCCCceecCHHHHHHHHHHH---Hhhhhhhhh Confidence 00 00011111110000 00000001111111111 2345678888887776664 33 223455555 Q ss_pred ccccchhHHHHhhhhhhccCccccccccccccc-ccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHH Q lcl|NC_018856. 73 INKQQVNSTVAKYAVFNQHGRTGHSRFVREVGV-ASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAI 151 (479) Q Consensus 73 i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~-~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai 151 (479) +...++.+.+ ++......+.+.++.|.+. ++..++.+.+.....+=|+.--.+|..+ |.++..|.+....+.-- T Consensus 113 ~~v~~~~~~~----~i~~~~~~~~a~wv~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~~l-l~ds~~~le~~i~~~l~ 187 (377) T protein:vir:96 113 INFKNTSLRL----KALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDA-LKFGPKWLKQFITEQLK 187 (377) T ss_pred ceeEecCCce----EEEEecCCcceeEeecccccccccCccceeEeeeeeeEEeechhhHHH-hhcchhhHHHHHHHHHH Confidence 5555554432 2333445556778899876 5678999999999999998877777665 56778899999999999 Q ss_pred HHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCC----------CEEEc---cC--CCCCHHHHhhh-hhh-- Q lcl|NC_018856. 152 AVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDT----------NVIDL---KG--ARLDEATLNKA-AVI-- 213 (479) Q Consensus 152 ~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~----------NviDa---rG--~~l~~~~l~~a-a~~-- 213 (479) .++++.++.+++.||-+=- --||++-+.... -+++. -| ..++.+.+-+. ..+ T Consensus 188 ~~~~~~~~~a~i~G~G~~~----------P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 257 (377) T protein:vir:96 188 EAIAVALELAIVKGNGLLQ----------PVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVPVMK 257 (377) T ss_pred HHHHHHHhhceEeccCCCc----------ceeeeeccccccccccccccccceeeccccccccccCChhHHHHHHHHHHH Confidence 9999999999999997422 257776553210 01111 11 11333333221 111 Q ss_pred -hh-hccCceEE------EecChHHhhhHHHHhhCcceeeeccCCCccee--eee---hhhhcCCCcceecccc----ee Q lcl|NC_018856. 214 -VG-KGYGRATD------AFMPIGVQADFTNNLLDRQRVIQPSTAGGFST--GFS---INQFLSTRGAINLHGS----TI 276 (479) Q Consensus 214 -i~-~~fG~~td------~~mp~~vka~f~~~~~~~qrv~~~~n~g~~~~--G~~---I~~~~s~~G~I~l~~s----~~ 276 (479) .+ .+.|.+.. +.|+..+..+. .+++..++++ |.... |++ +.+-..|.|.|.| |+ .+ T Consensus 258 ~~~~~~~~~~~~~~~~a~~~mn~~t~~~~-----~~~~~~~~~~-G~~~~~l~~p~~v~~s~~~p~~~i~f-gdf~~Y~i 330 (377) T protein:vir:96 258 HLSVNDKKHPLKIAGQVKLLLNPEDRWTL-----EAKFTSRNQF-GEYVTVLPHGITILESLAVETGKAIA-FVANRYDA 330 (377) T ss_pred hhccccccccccccCceEEEEchhhHHhc-----cccccccCCC-CCceeccCCCceEEecCCCCcccEEE-EEcCcEEE Confidence 11 11233222 44666554432 2333344433 22221 111 1222234444432 22 22 Q ss_pred cCCCceecccCCcCCCCCCCceeEeeeeccCCCCCCCcccccceEEEEEEEcCc Q lcl|NC_018856. 277 MENDNILLEGRNPEPNAPQAPASVVASIVDDKKGGFRDEDIKTHSYKVVVHSDD 330 (479) Q Consensus 277 m~~~~~L~e~~~~~~~AP~~pa~v~at~~t~~~G~f~~~d~gty~YkVtavn~~ 330 (479) .++..+.++.. .+..+-.--+.-.+..-. +|+ +-+ .-..+|.-+.-. T Consensus 331 ~~r~~~~i~~~-~~~~~~~d~~~f~~~~r~--dG~--~~d--~~a~~vl~l~~~ 377 (377) T protein:vir:96 331 FMATASTIEEY-DQTFAMEDLQLYLTKNYF--YGK--AKD--NHTAALLTLAGG 377 (377) T ss_pred EEecccEEEee-hhhhhhcCCeEEEEEEEE--cCE--Eec--CCcEEEEEEecC Confidence 22211111100 000000000000111111 010 000 011222222222 No 104 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=90.99 E-value=0.018 Score=30.15 Aligned_cols=293 Identities=10% Similarity=0.074 Sum_probs=130.7 Q ss_pred CCccchhhhhh-----hhcCCcc----------------chHHHHHHHHHhhhcCCCcChhhccCccccchhhhhhhhhh Q lcl|NC_018856. 1 MTELKKEAEAK-----NKKLPVE----------------AEAELAELVSKSFTTGYGITPDTQLDGAAVRRELLEDQVKM 59 (479) Q Consensus 1 ~~~~~~~~~~~-----~~~~~~~----------------~~~~~~e~~~Ks~tag~~~~p~~~~~gaalr~esld~~~~~ 59 (479) .+..+.+.+.+ ..+.... ......+.....+++| .+..+|+.|.++.+...|.. T Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~gg~~vP~~~~~~ii~ 156 (400) T protein:vir:38 82 KPDHPEEHSYRDALNAYLHTRGRNTDGVNFEKTDVGTFAVLRAVPTDASDAVNAG-----VKAADAASTIPETISNTPQR 156 (400) T ss_pred cccchhhhhHHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHHhhc-----ccccCCcccccHHHHHHHHH Confidence 00000000000 0000000 0001111111112222 13455788888888888754 Q ss_pred heeccccccchhhccccchhHHHHhhhhhhccCccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcc Q lcl|NC_018856. 60 LAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVAS-INDPNIRQKTVQMKFLSDTKQQSLAAGLVNN 138 (479) Q Consensus 60 l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~ 138 (479) +.. ....+++.+...++.+.--+|.+.... .+...+++|++... .+++.+.+.+..++-++.-..+|.-+ +.++ T Consensus 157 ~~~--~~~~l~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~E~~~~~~~~~~~f~~i~~~~~k~~~~~~is~el-l~ds 231 (400) T protein:vir:38 157 ELQ--TVVDLKPFTNVFQASTQKGTYPTVANA--TTKMVTVAELEKNPAMAKPEFKPVNWSVETYRQALPVSQES-IDDS 231 (400) T ss_pred HHH--hhhhhhhcceeEeccCcceEEEEEecC--CCccccccccccccccccccceeeEeehhheeeehhhHHHH-Hhhh Confidence 443 333556666655565443345444333 34566788887665 68999999999998888777777632 2345 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccCCCCCHHHHhhhhhhhhhcc Q lcl|NC_018856. 139 IADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGARLDEATLNKAAVIVGKGY 218 (479) Q Consensus 139 ~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~f 218 (479) ..|.+....+.....+..+++.++++|.....+.. ...+|++...+... +|.. + T Consensus 232 ~~~~~~~i~~~l~~~~~~~~~~~i~~~~~~~~~~~----~~~~~~~~~~~~~~---~~~~-------------------~ 285 (400) T protein:vir:38 232 AIDLVGLIAQNGQQIKVNTTNGAVATLLKGFTAKT----ISSVDDLKHINNVD---LDPA-------------------Y 285 (400) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhccccccccc----cccHHHHHHHHHhh---hhhh-------------------h Confidence 66778888888889999999999999998866532 24577776665421 1111 1 Q ss_pred CceEEEecChHHhhhHHHHhhCcceeeeccCCCcceeeeehhhhcCCCcceecccceecCCCceecccCCcCCCCCCCce Q lcl|NC_018856. 219 GRATDAFMPIGVQADFTNNLLDRQRVIQPSTAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILLEGRNPEPNAPQAPA 298 (479) Q Consensus 219 G~~td~~mp~~vka~f~~~~~~~qrv~~~~n~g~~~~G~~I~~~~s~~G~I~l~~s~~m~~~~~L~e~~~~~~~AP~~pa 298 (479) ..-..|++.+...|...-...-|++...+..+. .+.+++..+-...+ .++.+.+ .. T Consensus 286 --~a~~v~~~~~~~~l~~lkd~~G~~i~~~~~~~~------------------~~~~l~G~pv~~~~-~~~~~~~---g~ 341 (400) T protein:vir:38 286 --SRVIIASQSFYNFLDTVKDGNGRYLLQDSILTP------------------SGKSVLGMPIAVVS-DDTLGAA---GE 341 (400) T ss_pred --CcEEEEcHHHHHHHHHhhccCCCeeeecCcCCC------------------CccccccceeEEec-ccccCCC---Cc Confidence 123678888877776533322233322122111 11234444422222 1111100 00 Q ss_pred eEeeeeccCCCCCCCcccccceEEEEEEEcCcCcccc-ccceeeeeecCCceEEEEEEecCCCCcccceEEEEEecC--- Q lcl|NC_018856. 299 SVVASIVDDKKGGFRDEDIKTHSYKVVVHSDDAESLP-SEAVTAAVAKKDNTVKLEVKLASLYQAQPQFISVYREGT--- 374 (479) Q Consensus 299 ~v~at~~t~~~G~f~~~d~gty~YkVtavn~~GES~p-S~~vt~Tv~~~g~sv~ltIT~~~~~~a~~~~y~IYR~~~--- 374 (479) ... -.|.++.-+..+++.+-+.- +.....+ .-+..+.|=+- T Consensus 342 ---~~~-----------~~gd~s~~~~~~~~~~~~~~~~~~~~~~---------------------~~~~~~~r~d~~~~ 386 (400) T protein:vir:38 342 ---AHA-----------FLGDIKRAILFANRADFMVRWVDDQIYG---------------------QFLQAGMRFGVSVA 386 (400) T ss_pred ---eEE-----------EEEeccccEEEEeecceEEEEecccccc---------------------eeEEEEEEeccEEe Confidence 000 01122211222222221100 0000000 00111111100 Q ss_pred CCcceEEEEeeeee Q lcl|NC_018856. 375 ETGHYFLIARVPVS 388 (479) Q Consensus 375 ~~G~y~li~rv~vs 388 (479) ....|..+.-.|.. T Consensus 387 ~~~a~~~l~~~~~a 400 (400) T protein:vir:38 387 DEKAGYFLTYTPKA 400 (400) T ss_pred cccceEEEEeecCC Confidence 01122222222221 No 105 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=90.93 E-value=0.014 Score=30.73 Aligned_cols=290 Identities=11% Similarity=0.058 Sum_probs=123.1 Q ss_pred CCccchhhhh-----hhhc---------CCccchHHHHHHHHHhhhcCCCcChhhccCccccchhhhhhhhhhheecccc Q lcl|NC_018856. 1 MTELKKEAEA-----KNKK---------LPVEAEAELAELVSKSFTTGYGITPDTQLDGAAVRRELLEDQVKMLAFSSND 66 (479) Q Consensus 1 ~~~~~~~~~~-----~~~~---------~~~~~~~~~~e~~~Ks~tag~~~~p~~~~~gaalr~esld~~~~~l~~~~~~ 66 (479) .++.+.+++. +..+ .......+......+....-...+..+..+|+.|..+.+...|..+..... T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~liP~~~~~~ii~~~~~~~- 156 (394) T protein:vir:97 78 KEVTQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVV- 156 (394) T ss_pred cccchhhHHHHHHHHHHHHHHHHHhhhhhhhhhHHHHHHHHHhhhhhhhhccccccccccccChHHHHHHHHHHhhhhh- Confidence 0000000000 0000 000000111110100000001111224456888888888888765544333 Q ss_pred ccchhhccccchhHHHHhhhhhhccCccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHH Q lcl|NC_018856. 67 FTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVAS-INDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTI 145 (479) Q Consensus 67 f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~ 145 (479) .+.+.+...++.+.-.+|.+.. .+.+...+++|++... .+++.+...+...+-++.--.+|.-+ +.++..|.+.. T Consensus 157 -~l~~~~~~~~~~~~~~~~~~~~--~~~~~~~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~i~is~el-l~ds~~~~~~~ 232 (394) T protein:vir:97 157 -DLKPFTTVYQAKKASGKYPVLQ--RATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQES-IDDADVDLVGI 232 (394) T ss_pred -hhhhhceeeeccCcceEEEEEe--cCCCccceecccccccccccccceeEEeehhheeeehhhHHHH-HhhhhHHHHHH Confidence 3344444444444434454432 2223456899998765 67899999999999888777777643 23445567777 Q ss_pred HHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccCCCCCHHHHhhhhhhhhhccCceEEEe Q lcl|NC_018856. 146 LTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGARLDEATLNKAAVIVGKGYGRATDAF 225 (479) Q Consensus 146 ~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~~td~~ 225 (479) ..+.-...+.++++.++..|.....+. ...-+|+++..+... +|.. ++ ..+. T Consensus 233 i~~~la~~~~~~~~~~i~~g~~~~~~~----~~~~~~~~~~~~~~~---~~~~-------------------~~--a~~v 284 (394) T protein:vir:97 233 VSESISQIKVNTTNDAIAKVLKSFTTK----TVKNLDEIKALLNGG---FDPA-------------------YN--VSLI 284 (394) T ss_pred HHHHHHHHHHHHHHHHHhhcccccccc----ccccHHHHHHHHHhh---hhhh-------------------hC--CEEE Confidence 788888888899999999998765442 245578777766421 1111 10 1356 Q ss_pred cChHHhhhHHHHhhCcceeeeccCCCc----ceeeeehh---hhcCCCcceecccc-----eecCCCceecccCCcCCCC Q lcl|NC_018856. 226 MPIGVQADFTNNLLDRQRVIQPSTAGG----FSTGFSIN---QFLSTRGAINLHGS-----TIMENDNILLEGRNPEPNA 293 (479) Q Consensus 226 mp~~vka~f~~~~~~~qrv~~~~n~g~----~~~G~~I~---~~~s~~G~I~l~~s-----~~m~~~~~L~e~~~~~~~A 293 (479) |++.+.+.+...-...-|++...+..+ .-.|++|- +...+.+.+ +.|| .++++....++ ..+- T Consensus 285 ~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~-~~gd~~~~~~~~~~~~~~~~----~~~~ 359 (394) T protein:vir:97 285 VSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVLGANKA-FIGDFKRGVLFADRKDLGLR----WADN 359 (394) T ss_pred EcHHHHHHHHHhhccCCCeeeecCcCCCCCceeccceeEEecccccCCccE-EEeeccccEEEEEecceEEE----Eecc Confidence 777776666543333223333222211 23333321 111111111 1111 11111111000 0000 Q ss_pred CCCceeEeeeeccCCCCCCCcccccceEEEEEEEcCcCccccc Q lcl|NC_018856. 294 PQAPASVVASIVDDKKGGFRDEDIKTHSYKVVVHSDDAESLPS 336 (479) Q Consensus 294 P~~pa~v~at~~t~~~G~f~~~d~gty~YkVtavn~~GES~pS 336 (479) ........+..-. +|+.. +.-.++.+.-.....|- T Consensus 360 ~~~~~~~~~~~r~--d~~v~------~~~a~~~~~~~~~~~p~ 394 (394) T protein:vir:97 360 EIYGQYLQAVLRF--GVSKV------DDKAGYYVTFTPEPLPL 394 (394) T ss_pred cccceeEEEEEEE--ccEEe------cccceEEEEecccccCC Confidence 0000000000000 00000 01112222222222222 No 106 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=89.82 E-value=0.018 Score=30.17 Aligned_cols=306 Identities=15% Similarity=0.036 Sum_probs=123.6 Q ss_pred CCccchhhhhhhhcCCcc--chHHHHHHHHHhhhcCCCcChhhccCccccchhhhhhhhh-hheeccccccchhhccccc Q lcl|NC_018856. 1 MTELKKEAEAKNKKLPVE--AEAELAELVSKSFTTGYGITPDTQLDGAAVRRELLEDQVK-MLAFSSNDFTIYPLINKQQ 77 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~--~~~~~~e~~~Ks~tag~~~~p~~~~~gaalr~esld~~~~-~l~~~~~~f~f~~~i~k~~ 77 (479) ..+.++|.+.....-+.. .-.+-.+++++..+.| +-++|+.|-++.+.+.|. .| .+...+++.+...+ T Consensus 47 ~~~~~~e~~~~~~~~~~~~~lt~ee~~~~~~~~~~~------~~~~gg~~vP~~~~~~I~~~l---~~~s~i~~~~~v~~ 117 (377) T protein:vir:98 47 LAKNEEEMERMFDLRDKNRELTAEEIKFFNDIDKNV------GGKDKFKLLPEETMVQVFDDL---VAEHPLLKVINFKN 117 (377) T ss_pred HHHHHHHHHHHHHhccCCcccCHHHHHHHHHHHhcc------CCCCCccccCHHHHHHHHHHH---HHhhhhhhheeeEe Confidence 111111111111111000 0011112233333322 234567777777766663 33 22245555555555 Q ss_pred hhHHHHhhhhhhccCccccccccccccc-ccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHH Q lcl|NC_018856. 78 VNSTVAKYAVFNQHGRTGHSRFVREVGV-ASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAIAVIAK 156 (479) Q Consensus 78 ~~stv~eY~~~~~~G~~g~~~fv~E~g~-~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~ 156 (479) +.+.+ + +....+.+.+.+++|.+. ++..+|.+.+.....+=|+.--.+|.-+ |.++..|.+....+.--.++++ T Consensus 118 ~~~~~-~---~~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~a~~~is~el-L~ds~~~ie~~i~~~la~~~a~ 192 (377) T protein:vir:98 118 TSLRL-K---ALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDA-LKFGPKWIKQFITEQLKEAIAV 192 (377) T ss_pred cCcce-E---EEEecCCcceeEeecccccCcccCccceeEeecceeEEeeecccHHh-hhccHhHHHHHHHHHHHHHHHH Confidence 54432 3 333445556668899875 4578999999999998888776776655 5567889999999999999999 Q ss_pred HHHHHHhhcccccCCCCCCcccchhhhHHHhhccCC----CEEEccCCCCCHHHHhhhhhhhhhccCceEEEecChHHhh Q lcl|NC_018856. 157 SIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDT----NVIDLKGARLDEATLNKAAVIVGKGYGRATDAFMPIGVQA 232 (479) Q Consensus 157 ~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~----NviDarG~~l~~~~l~~aa~~i~~~fG~~td~~mp~~vka 232 (479) .++.+++.||-+-- --||++.+.... ...++-+...+.+.|-++.-.....|..--...|...+.. T Consensus 193 ~~~~a~i~G~G~~q----------P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~a~~~m~~~t~~ 262 (377) T protein:vir:98 193 ALELAIVKGDGLLQ----------PVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLTPDNAPKKLVPVMKHLSVN 262 (377) T ss_pred HHhhceEeccCCCc----------ceeeeecccccccccccccccccccchhhhHhhhhhhchhHHHHHHHHHHHHHHHH Confidence 99999999996522 257776653221 1222223222222222111111111111001122222222 Q ss_pred hHHHHhhCcceeeeccCCCcceeeeehhhhcCCCcceecccceecCCCceecccCCcCCCCCCCceeEeeeeccCCCCCC Q lcl|NC_018856. 233 DFTNNLLDRQRVIQPSTAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILLEGRNPEPNAPQAPASVVASIVDDKKGGF 312 (479) Q Consensus 233 ~f~~~~~~~qrv~~~~n~g~~~~G~~I~~~~s~~G~I~l~~s~~m~~~~~L~e~~~~~~~AP~~pa~v~at~~t~~~G~f 312 (479) .....-...-+++..-|+...--.........+.|. ..+.|..+..+++ ..+.|.. .+.. T Consensus 263 ~~~klkd~~G~~i~~~n~~~~~~~~p~~~~~~~~G~----~~t~lg~p~~vv~----s~~~p~~------~i~f------ 322 (377) T protein:vir:98 263 DKKRPLKIAGQVKLILNPEDRWALEAQFTSRNQFGE----YVTVLPHGITILE----SLAVETG------KAIA------ 322 (377) T ss_pred HHhhhhccCCceEEEecccchhhccccccccCCCCc----cccccCCCceEEe----cCCCCcc------cEEE------ Confidence 111111112222221121110000000011111111 0122222211111 1111110 0111 Q ss_pred CcccccceE-EEEEEEcCcCccc-ccccee---------ee------eecCCceEEEEEEecCC Q lcl|NC_018856. 313 RDEDIKTHS-YKVVVHSDDAESL-PSEAVT---------AA------VAKKDNTVKLEVKLASL 359 (479) Q Consensus 313 ~~~d~gty~-YkVtavn~~GES~-pS~~vt---------~T------v~~~g~sv~ltIT~~~~ 359 (479) |..+ |.+ +.+.|-+. .|.... +. +..+..-+.|+|+ .. T Consensus 323 -----gdf~~Y~i--~~r~~~~i~~~~~~~~~~d~~~f~~~~r~dg~~~~~~a~~vl~i~--~~ 377 (377) T protein:vir:98 323 -----FVANRYDA--FMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLA--GG 377 (377) T ss_pred -----EEecceeE--EeecceEEEeechhhhhcCceEEEEEEEEcCEEeccCcEEEEEEe--cC Confidence 1111 333 22333111 111111 11 1111222333333 11 No 107 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=88.32 E-value=0.033 Score=28.70 Aligned_cols=286 Identities=13% Similarity=0.055 Sum_probs=120.8 Q ss_pred CCccchh------hhhhhh---------cC------Ccc---chHHHHH---H---------HHHhhhcCCCcChhhccC Q lcl|NC_018856. 1 MTELKKE------AEAKNK---------KL------PVE---AEAELAE---L---------VSKSFTTGYGITPDTQLD 44 (479) Q Consensus 1 ~~~~~~~------~~~~~~---------~~------~~~---~~~~~~e---~---------~~Ks~tag~~~~p~~~~~ 44 (479) ..+..+. ....+. +. ... ......+ . ..++++.| +-.+ T Consensus 85 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~ 158 (497) T protein:vir:10 85 LKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFG------STGT 158 (497) T ss_pred hhhHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhcc------cCcc Confidence 0000000 000000 00 000 0000000 0 01111111 1134 Q ss_pred ccccchhhhhhhhhhheeccccccchhhccccchhHHHHhhhhhhccCcccccccccccccccccCcceEEEEEEEEeee Q lcl|NC_018856. 45 GAAVRRELLEDQVKMLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLS 124 (479) Q Consensus 45 gaalr~esld~~~~~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~ 124 (479) |+.|-.+.+..+|..+. .+...+.+-++..+..+---.|.+ .-++.+...+|+|++..+.+|+.+.+.....+=++ T Consensus 159 gg~~vp~~~~~~ii~~~--~~~~~i~~l~~~~~~~~~~~~~~~--~~~~~~~a~wv~E~~~~~~s~~~f~~i~~~~~k~a 234 (497) T protein:vir:10 159 FAPGILPTFLPGIVEQL--FYELSLADLISSRPVTSPNLSYLT--ESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVA 234 (497) T ss_pred cccccchhhhHHHHHHH--HhhhhHHhhccccccCCCceEEEE--EcCCCCcceeeccCcccccccccceeeEeeeeeeE Confidence 56666677777764433 334456666666666543223333 23444567799999999999999999999999999 Q ss_pred ehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccC------------- Q lcl|NC_018856. 125 DTKQQSLAAGLVNNIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQD------------- 191 (479) Q Consensus 125 ~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~------------- 191 (479) .--.+|.-+ +.++ .+.+....++-...+++.++.++++|+-.-.| .||.+.-... T Consensus 235 ~~~~iS~el-l~d~-~~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p----------~Gil~~~~~~~~~~~~~~~~~~~ 302 (497) T protein:vir:10 235 NALTITDEG-LRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGV----------NGLLQRSTGFTASSASSLFGATS 302 (497) T ss_pred eecHhHHHH-HHhH-HHHHHHHHHHHHHHHHHHHHHHhhcCCCcccc----------cccccccccccccccccchhhhh Confidence 988888764 2333 46778888888899999999999999854221 2332221100 Q ss_pred ------C-CEEEccCCCC---------------------------------CHHHHhhhhhhhhhccC-ceEEEecChHH Q lcl|NC_018856. 192 ------T-NVIDLKGARL---------------------------------DEATLNKAAVIVGKGYG-RATDAFMPIGV 230 (479) Q Consensus 192 ------~-NviDarG~~l---------------------------------~~~~l~~aa~~i~~~fG-~~td~~mp~~v 230 (479) . ++-....... ....+.++-..+...++ .++-..|++.. T Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~ 382 (497) T protein:vir:10 303 ATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRD 382 (497) T ss_pred hhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHH Confidence 0 0000000001 11122333333333333 34446678777 Q ss_pred hhhHHHHhhCcceeeeccCCC----------cceeeeehhhhcC-CCcceecccc------eecCCCce----------- Q lcl|NC_018856. 231 QADFTNNLLDRQRVIQPSTAG----------GFSTGFSINQFLS-TRGAINLHGS------TIMENDNI----------- 282 (479) Q Consensus 231 ka~f~~~~~~~qrv~~~~n~g----------~~~~G~~I~~~~s-~~G~I~l~~s------~~m~~~~~----------- 282 (479) ...+...-...-|.+.+...+ .--.|++|-.... +.|.+-+ |+ .++++..+ T Consensus 383 ~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~~~~-Gd~~~~~~~i~~r~~~~v~~~~~~~~~ 461 (497) T protein:vir:10 383 WELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILV-GHFAPSVIQTARREGVTMQMTNSNGTD 461 (497) T ss_pred HHHHHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCCCCCceEE-eecccceEEEEEecccEEEeecccchh Confidence 766654433333333221111 0112332211000 1111110 11 01111111 Q ss_pred --------ecccCCcC-CCCCCCceeEeeeeccCCCCCCCcccccceEEEEEEEcCcCccccccceeeeeecCCc Q lcl|NC_018856. 283 --------LLEGRNPE-PNAPQAPASVVASIVDDKKGGFRDEDIKTHSYKVVVHSDDAESLPSEAVTAAVAKKDN 348 (479) Q Consensus 283 --------L~e~~~~~-~~AP~~pa~v~at~~t~~~G~f~~~d~gty~YkVtavn~~GES~pS~~vt~Tv~~~g~ 348 (479) ..+.|+.- ...|.+-..+.-+++ ..| + T Consensus 462 f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~--~~~-------------------------------------~ 497 (497) T protein:vir:10 462 FVDGKVTVRAEERLGLLVYRPSAFQLIQLKKG--ATG-------------------------------------S 497 (497) T ss_pred hhcCcEEEEEEEeecceeeccccEEEEEecCC--ccC-------------------------------------C Confidence 11111100 111111111111111 111 1 No 108 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=88.32 E-value=0.033 Score=28.70 Aligned_cols=286 Identities=13% Similarity=0.055 Sum_probs=120.8 Q ss_pred CCccchh------hhhhhh---------cC------Ccc---chHHHHH---H---------HHHhhhcCCCcChhhccC Q lcl|NC_018856. 1 MTELKKE------AEAKNK---------KL------PVE---AEAELAE---L---------VSKSFTTGYGITPDTQLD 44 (479) Q Consensus 1 ~~~~~~~------~~~~~~---------~~------~~~---~~~~~~e---~---------~~Ks~tag~~~~p~~~~~ 44 (479) ..+..+. ....+. +. ... ......+ . ..++++.| +-.+ T Consensus 85 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~ 158 (497) T protein:vir:78 85 LKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFG------STGT 158 (497) T ss_pred hhhHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhcc------cCcc Confidence 0000000 000000 00 000 0000000 0 01111111 1134 Q ss_pred ccccchhhhhhhhhhheeccccccchhhccccchhHHHHhhhhhhccCcccccccccccccccccCcceEEEEEEEEeee Q lcl|NC_018856. 45 GAAVRRELLEDQVKMLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLS 124 (479) Q Consensus 45 gaalr~esld~~~~~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~ 124 (479) |+.|-.+.+..+|..+. .+...+.+-++..+..+---.|.+ .-++.+...+|+|++..+.+|+.+.+.....+=++ T Consensus 159 gg~~vp~~~~~~ii~~~--~~~~~i~~l~~~~~~~~~~~~~~~--~~~~~~~a~wv~E~~~~~~s~~~f~~i~~~~~k~a 234 (497) T protein:vir:78 159 FAPGILPTFLPGIVEQL--FYELSLADLISSRPVTSPNLSYLT--ESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVA 234 (497) T ss_pred cccccchhhhHHHHHHH--HhhhhHHhhccccccCCCceEEEE--EcCCCCcceeeccCcccccccccceeeEeeeeeeE Confidence 56666677777764433 334456666666666543223333 23444567799999999999999999999999999 Q ss_pred ehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccC------------- Q lcl|NC_018856. 125 DTKQQSLAAGLVNNIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQD------------- 191 (479) Q Consensus 125 ~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~------------- 191 (479) .--.+|.-+ +.++ .+.+....++-...+++.++.++++|+-.-.| .||.+.-... T Consensus 235 ~~~~iS~el-l~d~-~~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p----------~Gil~~~~~~~~~~~~~~~~~~~ 302 (497) T protein:vir:78 235 NALTITDEG-LRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGV----------NGLLQRSTGFTASSASSLFGATS 302 (497) T ss_pred eecHhHHHH-HHhH-HHHHHHHHHHHHHHHHHHHHHHhhcCCCcccc----------cccccccccccccccccchhhhh Confidence 988888764 2333 46778888888899999999999999854221 2332221100 Q ss_pred ------C-CEEEccCCCC---------------------------------CHHHHhhhhhhhhhccC-ceEEEecChHH Q lcl|NC_018856. 192 ------T-NVIDLKGARL---------------------------------DEATLNKAAVIVGKGYG-RATDAFMPIGV 230 (479) Q Consensus 192 ------~-NviDarG~~l---------------------------------~~~~l~~aa~~i~~~fG-~~td~~mp~~v 230 (479) . ++-....... ....+.++-..+...++ .++-..|++.. T Consensus 303 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~ 382 (497) T protein:vir:78 303 ATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRD 382 (497) T ss_pred hhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHH Confidence 0 0000000001 11122333333333333 34446678777 Q ss_pred hhhHHHHhhCcceeeeccCCC----------cceeeeehhhhcC-CCcceecccc------eecCCCce----------- Q lcl|NC_018856. 231 QADFTNNLLDRQRVIQPSTAG----------GFSTGFSINQFLS-TRGAINLHGS------TIMENDNI----------- 282 (479) Q Consensus 231 ka~f~~~~~~~qrv~~~~n~g----------~~~~G~~I~~~~s-~~G~I~l~~s------~~m~~~~~----------- 282 (479) ...+...-...-|.+.+...+ .--.|++|-.... +.|.+-+ |+ .++++..+ T Consensus 383 ~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~~~~-Gd~~~~~~~i~~r~~~~v~~~~~~~~~ 461 (497) T protein:vir:78 383 WELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILV-GHFAPSVIQTARREGVTMQMTNSNGTD 461 (497) T ss_pred HHHHHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCCCCCceEE-eecccceEEEEEecccEEEeecccchh Confidence 766654433333333221111 0112332211000 1111110 11 01111111 Q ss_pred --------ecccCCcC-CCCCCCceeEeeeeccCCCCCCCcccccceEEEEEEEcCcCccccccceeeeeecCCc Q lcl|NC_018856. 283 --------LLEGRNPE-PNAPQAPASVVASIVDDKKGGFRDEDIKTHSYKVVVHSDDAESLPSEAVTAAVAKKDN 348 (479) Q Consensus 283 --------L~e~~~~~-~~AP~~pa~v~at~~t~~~G~f~~~d~gty~YkVtavn~~GES~pS~~vt~Tv~~~g~ 348 (479) ..+.|+.- ...|.+-..+.-+++ ..| + T Consensus 462 f~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~--~~~-------------------------------------~ 497 (497) T protein:vir:78 462 FVDGKVTVRAEERLGLLVYRPSAFQLIQLKKG--ATG-------------------------------------S 497 (497) T ss_pred hhcCcEEEEEEEeecceeeccccEEEEEecCC--ccC-------------------------------------C Confidence 11111100 111111111111111 111 1 No 109 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=86.88 E-value=0.042 Score=28.11 Aligned_cols=294 Identities=14% Similarity=0.110 Sum_probs=131.4 Q ss_pred CCccchhhhhhhhcCCccchHHHHHHHHHhhhcCCCcChhhccCccccchhhhhhhhhhheeccccccchhhccccch-h Q lcl|NC_018856. 1 MTELKKEAEAKNKKLPVEAEAELAELVSKSFTTGYGITPDTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQQV-N 79 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~Ks~tag~~~~p~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~~~-~ 79 (479) |. .++.+.+ +.|++| .+|. +|+-|..|-+++-+..|...+ .|.+.+...+. . T Consensus 1 ~~----------------~~~~~~~-~~k~it-----~~d~--~gG~L~P~~~~~~i~~l~e~s---~i~~~a~vi~t~~ 53 (314) T protein:vir:41 1 MD----------------FLNKPFQ-ITPKID-----VPDL--GKGILAVQRFGEFVREVRENS---AIIKDARVLNALK 53 (314) T ss_pred Cc----------------hhhhHHH-hhcccc-----cccC--CCceeChHHHHHHHHHHHhcc---chhhheeeecccC Confidence 21 1222222 224443 2222 467899999987665555322 23333332211 2 Q ss_pred HHHHhhhhhhccCccc-c-cccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcch--hhHHHHHHHHHHHHHH Q lcl|NC_018856. 80 STVAKYAVFNQHGRTG-H-SRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNI--ADPMTILTEDAIAVIA 155 (479) Q Consensus 80 stv~eY~~~~~~G~~g-~-~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~--~Dp~~~~~~~ai~~~~ 155 (479) |.-.+..++ .+|+.- . ..-.+|......+|+++.+....+|=|+.--.+|.-. |.++. .|-+....+.=..++. T Consensus 54 s~~~~i~~i-~~g~~~~~~~~~~~~~~~~~~~~~tf~~~~l~~~kl~~~v~is~e~-L~D~a~~~~le~~i~~~~Ae~~g 131 (314) T protein:vir:41 54 SYEVDISRI-SLGVELEPGRNTSGTKVAPTADEVTVSTNTLEMKELVTKVVLEDEA-LEDNIEQSAFEQTITSLLASGVT 131 (314) T ss_pred ccceeeccc-ccCcccccccccccCCccCCcccccccceeeeeEEEEEeecccHHH-HHhhhchhhHHHHHHHHHHHHHH Confidence 222222222 334321 1 1122455555678899988888888777654444322 34554 3777777777788999 Q ss_pred HHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccC--CCCCHHHHhhhhhhhhhcc-CceE--EEecChHH Q lcl|NC_018856. 156 KSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKG--ARLDEATLNKAAVIVGKGY-GRAT--DAFMPIGV 230 (479) Q Consensus 156 ~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG--~~l~~~~l~~aa~~i~~~f-G~~t--d~~mp~~v 230 (479) ...|.+.|-||.+..+.... .. +.||+.+... ..+.+..+ ...+.+.+..+-.-+...| -..+ -.+|+..+ T Consensus 132 ~~~~~~~~nGdg~~~s~~~~-~~-~p~G~l~~a~--~~~~~~~~~~~~~~~~~~~~l~~sl~~~yr~~~~~~~~~m~~~t 207 (314) T protein:vir:41 132 YDLECFFLHADSSLTTGREL-YR-INDGWMKLAG--NQYTDAEPEDENWPLNLFDGMMDELDTRYLQLKPRMKFYVSNEI 207 (314) T ss_pred HHHHHHhhccccCCcCcccc-hh-cchhhhhhcc--cceeecCccccccHHHHHHHHHHhcCchhhcCCCceEEEecHHH Confidence 99999999999875332110 01 6799988764 24666554 3456666665554444444 2222 47799998 Q ss_pred hhhHHHHhhCcceeeeccC--CCc-cee-eeehhhhcC------CCcceecccceecCCCceecccCCcCCCCCCCceeE Q lcl|NC_018856. 231 QADFTNNLLDRQRVIQPST--AGG-FST-GFSINQFLS------TRGAINLHGSTIMENDNILLEGRNPEPNAPQAPASV 300 (479) Q Consensus 231 ka~f~~~~~~~qrv~~~~n--~g~-~~~-G~~I~~~~s------~~G~I~l~~s~~m~~~~~L~e~~~~~~~AP~~pa~v 300 (479) ...+-..+..+.+.+.... .+. ..+ |+.|-.... +.+.|-|.. |.....+ T Consensus 208 ~~~~r~~l~~~~~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~i~fgd--------------------~~nlv~~ 267 (314) T protein:vir:41 208 YNGYRKQLLVRETGLGDSALIGATGLQYDGIPIQYVPALDALGDDKARALLTV--------------------PTNLVYG 267 (314) T ss_pred HHHHHHHHhccCCcccchhhhCCCCceecceeeEecccccccCCCCceEEEec--------------------hhheEEE Confidence 8888887777666653211 011 111 333221111 111111110 0000000 Q ss_pred eeeeccCCCCCCCcccccceEEEEEEEcCcCccccccceeeeeecCCceEEEEEEecCCCCcccceEEEEEecCCCc Q lcl|NC_018856. 301 VASIVDDKKGGFRDEDIKTHSYKVVVHSDDAESLPSEAVTAAVAKKDNTVKLEVKLASLYQAQPQFISVYREGTETG 377 (479) Q Consensus 301 ~at~~t~~~G~f~~~d~gty~YkVtavn~~GES~pS~~vt~Tv~~~g~sv~ltIT~~~~~~a~~~~y~IYR~~~~~G 377 (479) ..-..--. ......-+.+.|..+.-=+.+-.....++-+++.. + .+| T Consensus 268 ~~~~ir~~--~~~~a~~~~~~~~~~~r~d~~~~~~~aa~~~~~~~-----------~-----------------~~~ 314 (314) T protein:vir:41 268 FWRNIRIE--PKRDAAMRRTEYIASLRADCNYEDENAAVAAVIDM-----------S-----------------SGG 314 (314) T ss_pred eeceeEEe--ecccCcCCeEEEEEEEEeceEEEEcCcEEEEEeec-----------c-----------------CCC Confidence 00000000 00000001112221110011100011111111111 1 111 No 110 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=86.10 E-value=0.048 Score=27.81 Aligned_cols=308 Identities=11% Similarity=0.115 Sum_probs=131.6 Q ss_pred CCccchhhhhhhhcCCccchHHHHHHHHHhhhcCCC----cChhhccCccccchhhhhhhhhhheeccccccchhhcccc Q lcl|NC_018856. 1 MTELKKEAEAKNKKLPVEAEAELAELVSKSFTTGYG----ITPDTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQ 76 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~Ks~tag~~----~~p~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~ 76 (479) +++...+..... ........-.+.+.+.+..+.. ..-.+-++|+.+-++-+..+|..+.... -.+++.+... T Consensus 71 ~~~~~~~~~~~~--~~~~~~~~~~~~~~~~lr~~~~~~~~~~~~t~~~gg~~vP~~~~~~i~~~~~~~--~~l~~~~~~~ 146 (389) T protein:vir:10 71 EPKDDGSKKGTD--LSKKPIDAKKKAINDFIHSHGKVIDATSKVTSTEAGVLIPEEIIYDPTAEVNSV--VDLSTLVTKT 146 (389) T ss_pred cccccccccccc--cchhHHHHHHHHHHHHhhcchhhhhhhcccccCCcceeehHHHHHHHHHHHHhh--hhHHhhccee Confidence 111111111111 1100001111112222211111 1112334577777888777764443332 3455556666 Q ss_pred chhHHHHhhhhhhccCccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHH Q lcl|NC_018856. 77 QVNSTVAKYAVFNQHGRTGHSRFVREVGVAS-INDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAIAVIA 155 (479) Q Consensus 77 ~~~stv~eY~~~~~~G~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~ 155 (479) ++.+.--+|.+.... .+...+++|++... .+++.+.+....++-++.-..+|.-+ +.++..|.+....+.-...+. T Consensus 147 ~~~~~~~~~~~~~~~--~~~~~~~~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~la~~~~ 223 (389) T protein:vir:10 147 PVTTPKGTYPILKRA--TDRFSSVAELAENPKLAEPEFNKVDWSVATYRGAIPLSEEA-IADSAVDLTALVGQSIKEKSV 223 (389) T ss_pred eccCCeeEEEEEecC--CCccccccccccccccccccceeeeeeheeeEeeehhhHHH-HhhhhHHHHHHHHHHHHHHHH Confidence 666555555554332 23445789987554 78999999999999998888888754 345566777888888888889 Q ss_pred HHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccCCCCCHHHHhhhhhhhhhccCceEEEecChHHhhhHH Q lcl|NC_018856. 156 KSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGARLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFT 235 (479) Q Consensus 156 ~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~~td~~mp~~vka~f~ 235 (479) ..++.++..|.....+.... ...-.|-|..++. .....+|+ .-.+|+..+...+. T Consensus 224 ~~~~~~i~~g~~~~~~~~~~-~~~~~d~l~~~~~----------------------~~~~~~~~--a~~~~n~~~~~~L~ 278 (389) T protein:vir:10 224 NTYNAMIAPVLQSFTAKKTT-TDTLVDSLKHILN----------------------VDLDPAYS--RALVVTQSLFNTLD 278 (389) T ss_pred HHHHHHHhhhhccccccccc-ccccHHHHHHHHH----------------------hhhhhhhC--cEEEecHHHHHHHH Confidence 99999999888765442111 1222333333222 11112232 24788888877776 Q ss_pred HHhhCcceeeeccCCCcceeeeehhhhcCCCcceecccceecCCCceecccCCcCCCCCCCceeEeeeeccCCCCCCCcc Q lcl|NC_018856. 236 NNLLDRQRVIQPSTAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILLEGRNPEPNAPQAPASVVASIVDDKKGGFRDE 315 (479) Q Consensus 236 ~~~~~~qrv~~~~n~g~~~~G~~I~~~~s~~G~I~l~~s~~m~~~~~L~e~~~~~~~AP~~pa~v~at~~t~~~G~f~~~ 315 (479) ..-...-|.+...+..+...+ -.+.+++..+-...+...+... .|. ..- T Consensus 279 ~lkd~~G~~i~~~~~~~~~~~--------------~~~~~l~G~pV~~~~~~~~~~~----------------~~~-~~~ 327 (389) T protein:vir:10 279 TLKDKNGRYLLHDASDSITDG--------------TAKGTILGVPVYVVGDTLLGSL----------------AGD-QKA 327 (389) T ss_pred HhhccCCCeeeecCccccccc--------------ccccccccceeEEecccccCCC----------------CCc-eEE Confidence 544333344433222211110 0112333333222221111100 000 000 Q ss_pred cccceEEEEEEEcCcCcccc-ccceeeeeecCCceEEEEEEecCCCCcccceEEEEEec-----CCCcceEEEEeeeeee Q lcl|NC_018856. 316 DIKTHSYKVVVHSDDAESLP-SEAVTAAVAKKDNTVKLEVKLASLYQAQPQFISVYREG-----TETGHYFLIARVPVSK 389 (479) Q Consensus 316 d~gty~YkVtavn~~GES~p-S~~vt~Tv~~~g~sv~ltIT~~~~~~a~~~~y~IYR~~-----~~~G~y~li~rv~vs~ 389 (479) -.|.++..+..+++.|-+.- +....-+ -.+..++|=+ .....++-+..+|.++ T Consensus 328 ~~gd~~~~~~~~~~~~~~i~~~~~~~~~---------------------~~~~~~~r~d~~~~~~~a~~~~~~~~~~~~~ 386 (389) T protein:vir:10 328 FVGDLKRGVLFTDRQQVTLAWEDSKIYG---------------------KYLGAAFRFGVQKADSKAGYFVTNTDVPGSA 386 (389) T ss_pred EEeeccccEEEEeecceEEEeecccccc---------------------ceEEEEEEeccEEecccceEEEEeeccCCCC Confidence 11222211122222221100 0000000 0011112210 1111222222233222 Q ss_pred ecCCce Q lcl|NC_018856. 390 VNDQGV 395 (479) Q Consensus 390 ~n~~g~ 395 (479) . +. T Consensus 387 ~---~~ 389 (389) T protein:vir:10 387 L---GK 389 (389) T ss_pred C---CC Confidence 1 11 No 111 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=85.75 E-value=0.051 Score=27.69 Aligned_cols=298 Identities=12% Similarity=0.092 Sum_probs=123.5 Q ss_pred CCccch-------hhhh---hh-------hcCCc---cchHHH-------------------HHHHHHhhhcCCCcChhh Q lcl|NC_018856. 1 MTELKK-------EAEA---KN-------KKLPV---EAEAEL-------------------AELVSKSFTTGYGITPDT 41 (479) Q Consensus 1 ~~~~~~-------~~~~---~~-------~~~~~---~~~~~~-------------------~e~~~Ks~tag~~~~p~~ 41 (479) +++... +.+. ++ .+... ..+.+. .+.+.+++.++.. T Consensus 288 ~~~~~~~~~i~~~~re~~~~~l~rai~a~a~~~~~~a~~~~e~a~~~a~~~G~~arg~~~~~~~l~~ra~~~~t~----- 362 (632) T protein:vir:96 288 KPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKTA----- 362 (632) T ss_pred hhhhhhhhhhhhhHHHHHHHHHHHHHHhhhccchhhhhhhhHHHHHHHHhhhhhhhhhhhhHHHHHHhhhhcccc----- Confidence 110000 0000 00 00000 000000 1112234433322 Q ss_pred ccCccccch-hhhhhhhhhheeccccccchhhccccchhHHHHhhhhhhccCcccccccccccccccccCcceEEEEEEE Q lcl|NC_018856. 42 QLDGAAVRR-ELLEDQVKMLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQM 120 (479) Q Consensus 42 ~~~gaalr~-esld~~~~~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~ 120 (479) .+|+.|-. |.+..++..+.. +-..+..+.-+.+......+. +..+.+.+...+++|++..+-+++.+.+.+... T Consensus 363 -~~gg~lvp~~~~~~~iie~lr---~~s~i~~l~~~~~~~~~g~~~-ip~~~~~~~a~wv~E~~~~~~s~~~f~~i~l~~ 437 (632) T protein:vir:96 363 -GKGGELVATELLSEEFIDILR---NKAIIGQMGARMLPGLVGDVD-IPKKTSGANFYWIGEDEDVQDSDFDFTTLSFSP 437 (632) T ss_pred -cccccccccccchHHHHHHHh---hcchhhhhcceEeecCCcceE-EEEEeCCceeEeecCCccccccccceeeEEeee Confidence 23444444 333333321111 111222222122221111111 112333445679999999999999999999999 Q ss_pred EeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccCC Q lcl|NC_018856. 121 KFLSDTKQQSLAAGLVNNIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGA 200 (479) Q Consensus 121 k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~ 200 (479) |=++.-..+|..+= .++.-|.+....++-...++..++.++++|+.. +. +--|+.+.-. -+.+...+. T Consensus 438 ~k~~~~v~iS~ell-~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~-~~--------~p~Gi~~~~~--~~~~~~~~~ 505 (632) T protein:vir:96 438 KTIAGAVPVTRKLR-KQSSIHVENLIREDLIEGIGVALDLAMLTGTGL-AN--------DPVGLLNMTG--VPALTYPAG 505 (632) T ss_pred eEEEEehhhHHHHH-hccchHHHHHHHHHHHHHHHHHHHHHhhcccCC-CC--------ccceeeeccc--ccceecccc Confidence 99998888877653 244457788888899999999999999999753 11 1245554332 234555555 Q ss_pred CCCHHHHhhhhhhhhhccCceEE--EecChHHhhhHHHH-hhC-cceeeeccCCCcceeeeehhhhcCCCcceeccccee Q lcl|NC_018856. 201 RLDEATLNKAAVIVGKGYGRATD--AFMPIGVQADFTNN-LLD-RQRVIQPSTAGGFSTGFSINQFLSTRGAINLHGSTI 276 (479) Q Consensus 201 ~l~~~~l~~aa~~i~~~fG~~td--~~mp~~vka~f~~~-~~~-~qrv~~~~n~g~~~~G~~I~~~~s~~G~I~l~~s~~ 276 (479) .++.+.|..+...+...++.... ..|+......+... +.+ .-+.+..+ .++ T Consensus 506 ~~~~~~i~~~~~~i~~~~~~~~~~~~~~~~~~~~~l~~~~l~d~~G~~i~~~-------------------------~~l 560 (632) T protein:vir:96 506 GVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQN-------------------------NEV 560 (632) T ss_pred cCCHHHHHHHHHHHhhcccccCccEEEEchhHHHHHHHHhccCCCCceeecC-------------------------Cee Confidence 56766666666666666665443 45676666555432 211 11222110 111 Q ss_pred cCCCceecccCCcCCCCCCCceeEeeeeccC--CCCCCCcccccceEEEEEEEcCcCccccccceeeeeecCCceEEEEE Q lcl|NC_018856. 277 MENDNILLEGRNPEPNAPQAPASVVASIVDD--KKGGFRDEDIKTHSYKVVVHSDDAESLPSEAVTAAVAKKDNTVKLEV 354 (479) Q Consensus 277 m~~~~~L~e~~~~~~~AP~~pa~v~at~~t~--~~G~f~~~d~gty~YkVtavn~~GES~pS~~vt~Tv~~~g~sv~ltI 354 (479) +.. |..++...+.+ -.|.|..-..+.+.-....++++. -. ..+.+.+.. T Consensus 561 ~G~-----------------pv~~s~~ip~~~~~~gd~s~~~i~~~~~~~i~~~~~~-----------~~-~~~~v~~~~ 611 (632) T protein:vir:96 561 NGY-----------------RAEASNQIPADTWIFGDWSQIVIAMWGVLDLKVDPYT-----------KA-ASDGLVLRV 611 (632) T ss_pred ccc-----------------ceEeccccccCcEEEeecceEEEEEecceEEEEcccc-----------cc-ccCceEEEE Confidence 111 11111111100 012221100111110111111111 00 112222222 Q ss_pred Eec-CCCCcccceEEEEEecC Q lcl|NC_018856. 355 KLA-SLYQAQPQFISVYREGT 374 (479) Q Consensus 355 T~~-~~~~a~~~~y~IYR~~~ 374 (479) ... .....-|..+.+.|..+ T Consensus 612 ~~~~d~~v~~~~af~~~k~~A 632 (632) T protein:vir:96 612 FQDVDAGVRRKEAFCIAKKGA 632 (632) T ss_pred EeecCceeechhhhhheeecC Confidence 111 01111112223333332 No 112 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=85.70 E-value=0.03 Score=28.92 Aligned_cols=297 Identities=13% Similarity=0.093 Sum_probs=116.7 Q ss_pred CCccc-hhh--hhh-----hhcCCccchHHHHHHHHH---hhhcCCCcChhhccCccccchhhhhhhhh-hheecccccc Q lcl|NC_018856. 1 MTELK-KEA--EAK-----NKKLPVEAEAELAELVSK---SFTTGYGITPDTQLDGAAVRRELLEDQVK-MLAFSSNDFT 68 (479) Q Consensus 1 ~~~~~-~~~--~~~-----~~~~~~~~~~~~~e~~~K---s~tag~~~~p~~~~~gaalr~esld~~~~-~l~~~~~~f~ 68 (479) +.+.. ++. +.+ ...... ..+.+...-.| ++.+ .+-++|+.|-.+.+.+.|. .|. +.-. T Consensus 43 ~~~~~~~~~~~~~~~~~~~~~~~~~-g~~~lt~~e~~~~~~~~~------~~~~~gg~lvP~~~~~~I~~~l~---~~s~ 112 (383) T protein:vir:78 43 MAADIMEQAKKEARQEADAYISASR-TDKNITNEEIKFFNDINK------EVGYKEETLLPQTVVDEIFEDLT---TEHP 112 (383) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcC-ChhhhhHHHHHHHHHHhc------cCCCCCccccCHHHHHHHHHHHH---hhcc Confidence 00000 000 000 000000 00000000112 2222 2345677788887777764 332 2234 Q ss_pred chhhccccchhHHHHhhhhhhccCccccccccccccc-ccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHH Q lcl|NC_018856. 69 IYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGV-ASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILT 147 (479) Q Consensus 69 f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~-~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~ 147 (479) +++.+...++.+.+ ++. ...+.+.+.+++|.+. +...|+.+.+.....+=|+.-..+|..+ |.++..|.+.... T Consensus 113 l~~~~~v~~~~~~~-~i~---~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~i~is~el-l~Ds~~~ie~~i~ 187 (383) T protein:vir:78 113 FLASIGMRTTGLRT-KFL---KSETSGVAVWGKIFGEIKGQLDATFSDEESIQNKLTAFVVVPKDL-EKFGPAWVKRFVV 187 (383) T ss_pred ceeeeeeEecCCce-EEE---EEcCCcceEEeecccccccccCcceeeEeecceeeEeeccchHHH-hhccHHHHHHHHH Confidence 55555555554443 333 3444455668899775 4678999999999999998776666554 4567778999999 Q ss_pred HHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEc---cC---CCCC---HHHH-hhhhhhhhhc Q lcl|NC_018856. 148 EDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDL---KG---ARLD---EATL-NKAAVIVGKG 217 (479) Q Consensus 148 ~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDa---rG---~~l~---~~~l-~~aa~~i~~~ 217 (479) +.--.++++.++.+++.||-.-- --||++.+....++... .+ +.++ ...+ +...-.+... T Consensus 188 ~~l~~~~a~~~~~a~i~G~G~~q----------P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~ 257 (383) T protein:vir:78 188 TQIEEAFAVALESAYIVGDGNDK----------PIGLNRKVGKGSTVVDGVYAEKAATGTLTFANPKTTVNELTDVYKYH 257 (383) T ss_pred HHHHHHHHHHHhhheEeccCCCC----------ceeeeeccCCcccccccccccccccchhhhhhhHHHHHHHHHHHhcc Confidence 99999999999999999996421 25777665433232211 00 1111 1111 1111000000 Q ss_pred -c----------CceEEEecChHHhhhHHHHhhCcceeeeccCCCccee--eee---hhhhcCCCcceecccc----eec Q lcl|NC_018856. 218 -Y----------GRATDAFMPIGVQADFTNNLLDRQRVIQPSTAGGFST--GFS---INQFLSTRGAINLHGS----TIM 277 (479) Q Consensus 218 -f----------G~~td~~mp~~vka~f~~~~~~~qrv~~~~n~g~~~~--G~~---I~~~~s~~G~I~l~~s----~~m 277 (479) | |.+ ...|......+ ..|.... +. ..|.... |+. |.+-..+.|.|.| |+ .+. T Consensus 258 ~~~~~~~~~~~~~~~-~~~~n~~~~~~----~~~~~~~-~~-~~G~~~t~l~~~~~iv~s~~~p~~~iif-gdfs~Y~i~ 329 (383) T protein:vir:78 258 SVKENGHPLNVAGKV-TLLVNPTDAWD----VKKQYTS-LN-ANGVYVTALPFNLNIIESLFVPEKKAIS-YVAERYDAL 329 (383) T ss_pred chhcccchhhhcCce-EEEEcCcchhh----hccchhc-cC-CCCceeeecCCCceEEecCCCCcccEEE-eeccceEEE Confidence 0 111 01222211111 1111111 11 1121111 111 1122233444432 21 111 Q ss_pred CCCceecccCCcCCCCCCCceeEeeeeccCCCCCCCcccccceEEEEEEEc-CcCcccccc Q lcl|NC_018856. 278 ENDNILLEGRNPEPNAPQAPASVVASIVDDKKGGFRDEDIKTHSYKVVVHS-DDAESLPSE 337 (479) Q Consensus 278 ~~~~~L~e~~~~~~~AP~~pa~v~at~~t~~~G~f~~~d~gty~YkVtavn-~~GES~pS~ 337 (479) ++..+-++. ..+..+-.--+.-.+..-. +|+ +-+. -..+|.-++ ..++-.|.. T Consensus 330 ~r~~~~i~~-~~~~~f~~d~~~f~~~~r~--dG~--~~~~--~A~~vl~~~~~~~~~~~~~ 383 (383) T protein:vir:78 330 IGGPLDIGT-YDQTLAIEDLNLYAAKQFA--YGK--AKDD--KAAAVWTLNINPAEQTPEG 383 (383) T ss_pred ecccceEEe-cchhhhhcCceEEEEEEEE--cCE--EecC--CeEEEEEEEecCCCCCCCC Confidence 111110000 0000000000000111111 111 0000 012222122 222222222 No 113 >protein:vir:96442 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:11266 # MgeID: mge:1616 # MgeName: 119X # Cross-refs: genbank:acc:YP_001218814;genbank:gi:147917331;genbank:GeneID:5142645 Probab=84.52 E-value=0.05 Score=27.73 Aligned_cols=327 Identities=14% Similarity=0.093 Sum_probs=148.7 Q ss_pred CCccchhhhhhhhc---CCccchHHHHHHHHHhhhcCCCcChhhccCccccchhhh---h--hhhhhheeccccccchhh Q lcl|NC_018856. 1 MTELKKEAEAKNKK---LPVEAEAELAELVSKSFTTGYGITPDTQLDGAAVRRELL---E--DQVKMLAFSSNDFTIYPL 72 (479) Q Consensus 1 ~~~~~~~~~~~~~~---~~~~~~~~~~e~~~Ks~tag~~~~p~~~~~gaalr~esl---d--~~~~~l~~~~~~f~f~~~ 72 (479) |+-.-.+.-.+-++ .....+.+.-- +.|.-.++ -+--+-.++..|+.-.| + .++..++.-+.+ + -+ T Consensus 40 ~i~~g~~~~~~~~t~~w~~d~l~~~~~~-~ta~~~a~--~T~i~V~~~~~f~~~~l~~~~~~~EvirVtsVng~-~--lT 113 (418) T protein:vir:96 40 MTSVVGSTTAKASTHGYFSKTMVFASAV-VTAEALAD--ATVLTVENSDGLTKGMIFYNEATGENMRLELVNGL-N--LT 113 (418) T ss_pred hhcccCccccceeEEEEEeeEeeeeeEE-EEEEEecC--ceEEEecCCcccccccEEEEecCCeEEEEEEEeCC-E--EE Confidence 22222221111111 11111110000 00111111 11123334444776665 1 344333332211 1 11 Q ss_pred ccccchhHHHHhhhhhhccCcccc--cccccccccccccCcceE--EEEEEEEeeeehhhhhhhHhh-h--cchhhHHHH Q lcl|NC_018856. 73 INKQQVNSTVAKYAVFNQHGRTGH--SRFVREVGVASINDPNIR--QKTVQMKFLSDTKQQSLAAGL-V--NNIADPMTI 145 (479) Q Consensus 73 i~k~~~~stv~eY~~~~~~G~~g~--~~fv~E~g~~~~~d~~~~--r~~~~~k~l~~~~~vs~~~~l-v--n~~~Dp~~~ 145 (479) +.|... .|..+= .--|.-+. +--+-||.+...+. .+. ++...+--+.+..+||.-++. + -+++|.... T Consensus 114 V~RG~~-~t~aa~---iaag~~~~~ig~~~eEGsd~~ta~-~~k~~~vsN~tQIf~e~vsVSgTAqA~v~qaGvsn~~~~ 188 (418) T protein:vir:96 114 VKRQTG-RIAAAI---IAANTKLIVIGTAFEEGSQRPTAR-SIQPVYVPNFTQIFRNAWALTDTARASYAEAGYSNITES 188 (418) T ss_pred EEEccC-Ceeeee---eecCceEEEeecCcccccccCCcc-eecceeccchhheehhhhhhhhhhhhhhhhcCcchhHHH Confidence 222111 111110 00111111 11224777765544 121 222223334566667766544 2 256677666 Q ss_pred HHHHHHHHHHHHHHHHHhhcccccCCCCCCccc----chhhhHHHhhccCCCEEEccCC-CCCHHHHhhhhhhhhh---c Q lcl|NC_018856. 146 LTEDAIAVIAKSIEWAIFYGDAALSSEADGQAG----IEFDGLHKLIDQDTNVIDLKGA-RLDEATLNKAAVIVGK---G 217 (479) Q Consensus 146 ~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~g----leFDGl~~~I~~~~NviDarG~-~l~~~~l~~aa~~i~~---~ 217 (479) + .|+|.-.+..+|.++++|.+......+.. . =.-|||..-++ +||+++.+. .++++.|..+...+.+ + T Consensus 189 e-~d~l~~~kv~iE~ali~g~~~~~~~ng~p-~~~t~R~m~gI~~f~~--~Nvi~ag~~~~~t~d~L~~~~~~a~~~g~n 264 (418) T protein:vir:96 189 R-RDCMDFHATEQETAIFFGQAFMGTYNGQP-LHTTQGIVDAIRQYAP--DNVNAMPNPTAVTYDDVVDATIDAFKWSVN 264 (418) T ss_pred H-HHHHHHHHHHHHHhhhccccccCCCCCcc-cccccchhHHHHhhcc--ccccccCCCCcCCHHHHHHHHHHHHhhcCC Confidence 6 69999999999999999998773222111 0 01366665553 599999986 6999999988766655 4 Q ss_pred cCceEE-----EecChHHhhhHHHHhhCcceeeeccCCCcceeeeehhhhcCCCcceecccceecCCCce------ecc- Q lcl|NC_018856. 218 YGRATD-----AFMPIGVQADFTNNLLDRQRVIQPSTAGGFSTGFSINQFLSTRGAINLHGSTIMENDNI------LLE- 285 (479) Q Consensus 218 fG~~td-----~~mp~~vka~f~~~~~~~qrv~~~~n~g~~~~G~~I~~~~s~~G~I~l~~s~~m~~~~~------L~e- 285 (479) -|..++ ++.|...|..+...+ ...|. +......|..|+.|.|-.|-|++--..||..+++ +++ T Consensus 265 ~G~~~~~~~y~~~V~a~~k~~I~k~~-~~I~~----~~~en~~G~vv~~~~Td~G~v~ii~n~~~pad~I~~g~mlVvD~ 339 (418) T protein:vir:96 265 VGDNTQRVMFCDTVGMRTMQDIGRFF-GEVTV----TQRETSYGMVFTEWKFFKGRLIIKEHPLFSAIGISPGFAVVVDV 339 (418) T ss_pred CCCcccceEEEEEeChHHHHHHhhhh-ceeEe----ccccceeceEEEEEEeeccEEEEEecCCCCccccCcceEEEEec Confidence 577776 566999999998754 33343 4556689999999999999999988888877763 222 Q ss_pred cCC------cCCCCCCCceeEeeeeccCCCCCCCcccccceEEEEEEEcCcCccccccceeeeeecCCceEEEEEEec-- Q lcl|NC_018856. 286 GRN------PEPNAPQAPASVVASIVDDKKGGFRDEDIKTHSYKVVVHSDDAESLPSEAVTAAVAKKDNTVKLEVKLA-- 357 (479) Q Consensus 286 ~~~------~~~~AP~~pa~v~at~~t~~~G~f~~~d~gty~YkVtavn~~GES~pS~~vt~Tv~~~g~sv~ltIT~~-- 357 (479) ... .++-+|-... .++.+-+| +++..|-- ..+-..++.+.-++|+- T Consensus 340 ~~vkL~yL~~R~~~~E~l~------k~G~~~~~------------------~~~~~~~~--~~~D~~~G~l~~Eltle~~ 393 (418) T protein:vir:96 340 PAVKLAYMDGRNAKVENYG------QGGGENKS------------------GATDYSYG--HGVDAQGGSLTSEWALELL 393 (418) T ss_pred CceEEEEecCCCccchhcc------cCCCcccc------------------cccccccc--cccccccCEEEEEEEEEee Confidence 111 0111111111 11111111 11111100 00001111222222221 Q ss_pred ---------CCCCcccceEEEEEecCCCcceEEEEeeeeeeecCCceEEEeeccccCC Q lcl|NC_018856. 358 ---------SLYQAQPQFISVYREGTETGHYFLIARVPVSKVNDQGVIEVLDRNQVIP 406 (479) Q Consensus 358 ---------~~~~a~~~~y~IYR~~~~~G~y~li~rv~vs~~n~~g~T~ftD~N~~iP 406 (479) +++-+-| .||-..+ -| T Consensus 394 N~~a~a~itgl~~~~~---~~~~~~~------------------------------~~ 418 (418) T protein:vir:96 394 NPQGCAVITGLQKAKE---RVYLTAP------------------------------AP 418 (418) T ss_pred cccccEEeeccccccc---ccccCCC------------------------------CC Confidence 1111111 1111110 11 No 114 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=83.57 E-value=0.067 Score=27.01 Aligned_cols=327 Identities=9% Similarity=0.043 Sum_probs=130.1 Q ss_pred CC----ccchhhhh---hh------hcCCccch-----------HHHHHHHHHhhhcCCCcChhhccCccccchhhhhhh Q lcl|NC_018856. 1 MT----ELKKEAEA---KN------KKLPVEAE-----------AELAELVSKSFTTGYGITPDTQLDGAAVRRELLEDQ 56 (479) Q Consensus 1 ~~----~~~~~~~~---~~------~~~~~~~~-----------~~~~e~~~Ks~tag~~~~p~~~~~gaalr~esld~~ 56 (479) .. +.+.++-. +. .+.+.... ......+.+++.+|..+++. .+|+-+-.+.+..+ T Consensus 280 ~~~~~~~~~~~kg~~f~~~~~al~~~~g~~~~a~e~a~~~~~~~~~~~~~~~~a~~~~~~~~~~--~~Gg~~vp~~~~~~ 357 (645) T protein:vir:93 280 APVIRVEQKLDKGIGFARFAKSLAAAKGVRSEALEVARRQYPDDSRLHHVLKSAVGAGTTTDPQ--WAGSLSEYQEYAQD 357 (645) T ss_pred cccccchhhhhhhhhHHHHHHHHHhcccchhHHHHHHHhhcccchhhhhhhhhhhhcccccccc--ccCCccCchhhHHH Confidence 00 00000000 00 01111111 11122244666666655443 34555666665555 Q ss_pred hhhheeccccccchhhccccchhHHHH-hhhh-hhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHh Q lcl|NC_018856. 57 VKMLAFSSNDFTIYPLINKQQVNSTVA-KYAV-FNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAG 134 (479) Q Consensus 57 ~~~l~~~~~~f~f~~~i~k~~~~stv~-eY~~-~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~ 134 (479) +..+.... ..+..+..+....... .++. +....+.+...+++|++..+.+++.+...+...|=|+.--.+|.-+= T Consensus 358 ii~~l~~~---svv~~l~~~~~~~~~~~~~~~~ip~~t~~~~a~wv~Eg~~~~~s~~~f~~v~l~~~kla~~~~iS~ell 434 (645) T protein:vir:93 358 FIDYLRPQ---TIIGRFGQGGIPALRQVPFNIRVHAQVSGGAAGWVGEGKTKPLTKFDFESITFSHAKVSAIAVLTEELI 434 (645) T ss_pred HHHhhhhh---hhHHhhccccccccccccCceeeeeeecCcceEEeccCccccccccceeEEEEeeEEEEEeehhHHHHH Confidence 53222111 1122222111111000 1111 11112224577999999999999999999999999988777777543 Q ss_pred hhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccCCCCCHHHHhhhhhhh Q lcl|NC_018856. 135 LVNNIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGARLDEATLNKAAVIV 214 (479) Q Consensus 135 lvn~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i 214 (479) . ++.-|.+....++-...+++.++.++|.|+..-..+. .+.|+ .... ..+...| ....+..+...... T Consensus 435 ~-ds~~~~~~~i~~~l~~aia~~~d~a~l~g~g~~~~~~-~p~gi-----~~~~----~~~~~~~-~~~~d~~~~~~~~~ 502 (645) T protein:vir:93 435 R-FSSPAADALVRNALAEAVVARLDTDFVDPKKAAVADV-SPASI-----THDV----KGTASSG-NPDADAEAAFGQFV 502 (645) T ss_pred h-hchHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccCCc-cccce-----eccc----ccccccc-chHHHHHHHHHHHH Confidence 2 3445677888889999999999999999886532111 11222 1100 1111222 22234333333334 Q ss_pred hhccCceEE-EecChHHhhhHHHHhhCcceeeeccC--CCcceeeeehhhhcCCCcceecc--cceec-CCCceec---- Q lcl|NC_018856. 215 GKGYGRATD-AFMPIGVQADFTNNLLDRQRVIQPST--AGGFSTGFSINQFLSTRGAINLH--GSTIM-ENDNILL---- 284 (479) Q Consensus 215 ~~~fG~~td-~~mp~~vka~f~~~~~~~qrv~~~~n--~g~~~~G~~I~~~~s~~G~I~l~--~s~~m-~~~~~L~---- 284 (479) ..++...+- ..|++.+...+...-...-+.+.+.- .++--.|++|-.......++.|- .++++ ....+.+ T Consensus 503 ~a~~~~~~a~~vmn~~~~~~L~~lkd~~G~~~~~~~~~~~~tL~G~PV~~s~~vp~~~~~gd~s~~~ig~~~~v~i~~s~ 582 (645) T protein:vir:93 503 AANLQPTGAVWLMSSTNALALSMRKNALGQKEYPDMTLLGGSFQGLPVIVSQYVGDQLVLVNAPDIYLADDGGVAVDMSR 582 (645) T ss_pred hcCCCccccEEEEcHHHHHHHHhccccCCceeecCCCCCCceeeceeeEEeccCCcceeEeccccEEEEEecceEEEeec Confidence 444444443 45899888887554443222232321 12234555543332222222210 00000 0000000 Q ss_pred ccCCcCCCCCCCceeEeeeeccCCCCCCCcccccceEEEEEEEcCcCccccccceeee---eecCCce Q lcl|NC_018856. 285 EGRNPEPNAPQAPASVVASIVDDKKGGFRDEDIKTHSYKVVVHSDDAESLPSEAVTAA---VAKKDNT 349 (479) Q Consensus 285 e~~~~~~~AP~~pa~v~at~~t~~~G~f~~~d~gty~YkVtavn~~GES~pS~~vt~T---v~~~g~s 349 (479) +..+---..|.......+ .......|.. .---+++...-+-+---|...+..| .-+..+. T Consensus 583 ~a~~~~~~~~~~~~~~~~--~~~~v~lf~~---d~vaira~~r~d~~~~~p~a~~~lt~~~~g~~~~~ 645 (645) T protein:vir:93 583 EASLEMQSEPTGDSTTPS--PVELVSMFQT---GSVAIRAERWINWRRRRTAAVAVITGVNYGSASGG 645 (645) T ss_pred ceeEEEeecccccccccc--cccchhHhhc---CceEEEEEEEEcceeeCccceEEEecccCCcccCC Confidence 000000000100000000 0000011211 1122333322222211121111111 1110011 No 115 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=83.12 E-value=0.071 Score=26.88 Aligned_cols=324 Identities=11% Similarity=0.052 Sum_probs=135.2 Q ss_pred CCccchhhhhhhh-------c-----------CCcc--chHHHHHHHHHhhhc--CCCcChhhccCccccchhhhhhhhh Q lcl|NC_018856. 1 MTELKKEAEAKNK-------K-----------LPVE--AEAELAELVSKSFTT--GYGITPDTQLDGAAVRRELLEDQVK 58 (479) Q Consensus 1 ~~~~~~~~~~~~~-------~-----------~~~~--~~~~~~e~~~Ks~ta--g~~~~p~~~~~gaalr~esld~~~~ 58 (479) ..+.+.+.+.... . +... ....+.+.+.+.+.. .-...-.+..||+++-+|.+-..|. T Consensus 91 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~vP~~~~~~i~ 170 (466) T protein:vir:80 91 EPKNNSEPAQVSGARTQQFVGGETRMKGFFRNMPYEQRAALIARSEVKEFLAQVRTLAQQKRAVSGAELTIPDVMLELLR 170 (466) T ss_pred hhccCchhHHHHhhhhhHHhhHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhccccccccHHHHHHHH Confidence 1000000000000 0 0000 000000000000000 0001112334566677776655553 Q ss_pred hheeccccccchhhccccchhHHHHhhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcc Q lcl|NC_018856. 59 MLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNN 138 (479) Q Consensus 59 ~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~ 138 (479) ... .....+.+.+...++..++ ++.+ ++....+.++.|++..+..|+.+.+....++=++.--.+|.-+- .++ T Consensus 171 ~~l--~~~~~l~~~~~v~~~~g~~-~~~~---~~~~~~a~wv~E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell-~ds 243 (466) T protein:vir:80 171 DNM--HRYSKLISKVRLRPLKGTA-RQNI---AGAIPEGVWTEAVANLNELSLSFSQIEVDGYKVGGFIPIPNSTL-EDS 243 (466) T ss_pred Hhh--hhhhhhhhheeeeecCcee-Eeee---ecCCcceeecccccccccccccccceeecceeeeeehhhhHHHH-hcc Confidence 221 1223455656555554433 2333 45454577899999999999999998888887777666665432 356 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccC------------------CCEEEcc-- Q lcl|NC_018856. 139 IADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQD------------------TNVIDLK-- 198 (479) Q Consensus 139 ~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~------------------~NviDar-- 198 (479) ..|.+....+.-...++..++.+++.||-.=.| -|+++.+... .+.+++. T Consensus 244 ~~~l~~~i~~~la~~~~~~~~~ail~G~G~~~P----------~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 313 (466) T protein:vir:80 244 DLNLADEILDAIGQAIGFALDKAILYGTGTKMP----------VGIVTRLAQTTQPPNWGTKAPAWTNLSTTNLLKIDPT 313 (466) T ss_pred hHHHHHHHHHHHHHHHHHHHhhheeeccCCCCc----------ceeeecccccccccccccccccccccchhhhhhhhhh Confidence 668888999999999999999999999864222 3555443211 0011110 Q ss_pred CCCCC--HHHHhhhhhhhhhccCceEEEecChHHh-hhHHHHhh----CcceeeeccCCCcceeeeehhhhc-CCCccee Q lcl|NC_018856. 199 GARLD--EATLNKAAVIVGKGYGRATDAFMPIGVQ-ADFTNNLL----DRQRVIQPSTAGGFSTGFSINQFL-STRGAIN 270 (479) Q Consensus 199 G~~l~--~~~l~~aa~~i~~~fG~~td~~mp~~vk-a~f~~~~~----~~qrv~~~~n~g~~~~G~~I~~~~-s~~G~I~ 270 (479) +..-. -..+..+....-..++.+.++|++.... ..+....+ ++..+..+++. ..-.|++|-... .+.|. T Consensus 314 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~l~~~~~~~~~~g~~~~~~~~~-~~i~G~pvv~s~~~~~~~-- 390 (466) T protein:vir:80 314 GKSAEEFFSELVLKLSKARANYSNGMKFWAMSSNTHAVLMSKAITFNSAGALVASLNNT-MPIVGGDIVILDFIPDND-- 390 (466) T ss_pred ccchhhHHHHHHHHHHhhhccccCCceeEEecchhHHHhhcccccccCCccccccCCCc-ccccccceeecCccCccc-- Confidence 00000 0001111223345578888888775433 22211110 11222222111 111222221110 11111 Q ss_pred cccceecCCC--ceecccCCcCCCCCCCceeEeeeeccCCCCCCCcccccceEEEEEEEcCcCccccccceeeeeecCCc Q lcl|NC_018856. 271 LHGSTIMEND--NILLEGRNPEPNAPQAPASVVASIVDDKKGGFRDEDIKTHSYKVVVHSDDAESLPSEAVTAAVAKKDN 348 (479) Q Consensus 271 l~~s~~m~~~--~~L~e~~~~~~~AP~~pa~v~at~~t~~~G~f~~~d~gty~YkVtavn~~GES~pS~~vt~Tv~~~g~ 348 (479) .+..+. -++.. . ....... +....|. .+..-|+++..-+..=-.|...+..++..... T Consensus 391 ----~~~g~~~~y~i~~-------r-~~~~i~~-----~~~~~f~---~d~~~~r~~~r~dg~~~~~~afv~~~~~~~~~ 450 (466) T protein:vir:80 391 ----IIGGYGSLYLLAE-------R-ADIKLAQ-----SEHVRFI---EDQTVFKGTARYDGKPVFGEGFVAVNIANANP 450 (466) T ss_pred ----eeeeccccEEEEe-------e-cceEEEe-----chhhhhh---cCcEEEEEEEEEccEEeccCceEEEEecCCCc Confidence 111100 00110 0 0000000 0011111 01122444332222211334445566666666 Q ss_pred eEEEEEEecCCCCcccce Q lcl|NC_018856. 349 TVKLEVKLASLYQAQPQF 366 (479) Q Consensus 349 sv~ltIT~~~~~~a~~~~ 366 (479) .|.++.+ +..+..|+- T Consensus 451 ~~~~~~~--~~~~~~~~~ 466 (466) T protein:vir:80 451 TTSITFA--PDEANVPEV 466 (466) T ss_pred ccceeee--cCcCcCCCC Confidence 6666665 445555543 No 116 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=81.19 E-value=0.088 Score=26.37 Aligned_cols=316 Identities=13% Similarity=0.068 Sum_probs=131.3 Q ss_pred CCccch---------hhhhhhh-------cCCccchHHHHHHHHHhhh---cCCC-------cChhhccCccccchhhhh Q lcl|NC_018856. 1 MTELKK---------EAEAKNK-------KLPVEAEAELAELVSKSFT---TGYG-------ITPDTQLDGAAVRRELLE 54 (479) Q Consensus 1 ~~~~~~---------~~~~~~~-------~~~~~~~~~~~e~~~Ks~t---ag~~-------~~p~~~~~gaalr~esld 54 (479) ....+. |....+- ....... .....+.++|. .|.. .+. +-.+|+.|-.+.+. T Consensus 82 ~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~-~~~~e~r~a~~~~l~~~~~~~e~~a~~~-~t~~GG~lvP~~~~ 159 (434) T protein:vir:62 82 PTAKENPNEKTELSEEQRSAISASIAAALSTKGHRT-NKETEIRSVFANYIVGNIDEKEARALGL-VTGNGSVTIPDFLS 159 (434) T ss_pred hhhhcchhhhHHHHHHHHHHHHHHHHhhhhhccccc-hHHHHHHHHHHHHhccccchhhhhhhcc-cccccceecchhhH Confidence 000000 0000000 0000000 01111223321 1110 000 11247788888888 Q ss_pred hhhhhheeccccccchhhccccchhHHHHhhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHh Q lcl|NC_018856. 55 DQVKMLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAG 134 (479) Q Consensus 55 ~~~~~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~ 134 (479) ..|..+... ...+.+...+.+..+. ..|.++...+.........|++..+..|+.+.+.....+=++.-..+|.-+ T Consensus 160 ~~Ii~~l~~--~~~i~~~~~~~~~~~~-~~~p~~~~~~~a~~~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~iS~el- 235 (434) T protein:vir:62 160 KEIITYAQE--ENFLRRLGTGVKTKEN-IKYPVLVKKAEAQGHKNERTNNEMPETDIEFDEIELSPTEFDALATVTKKL- 235 (434) T ss_pred HHHHHhhhh--hhhhhhhcceeccCCc-eEEEEEecCCcccceecccccccccccccceeeEEeeheeeEeehhhHHHH- Confidence 877544332 2233333333344333 346665544444333455778888899999999999999888877777654 Q ss_pred hhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccCCCCCHHHHhhhhhhh Q lcl|NC_018856. 135 LVNNIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGARLDEATLNKAAVIV 214 (479) Q Consensus 135 lvn~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i 214 (479) +.++.-|.+....+.-...+++.++.+++.|+-.=.+ ..|+.+. ........+...-.++++.-.. + T Consensus 236 l~ds~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~---------~~g~~~~---~~~~~~~~~~~~~d~l~~l~~~-l 302 (434) T protein:vir:62 236 LARTGLPIEQIVMDELKKAYVRKETQYMVNGDEANNI---------NDGALAK---KAVEFKTDEKNLYDALVKMKNT-P 302 (434) T ss_pred HhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcc---------ccceeec---ccccccccccchhhHHHHHHhh-c Confidence 2344557788888899999999999999999865222 2343321 1122333333333334432222 2 Q ss_pred hhccCceEEEecChHHhhhHHHHhhCcceeee-ccCCCcceeeeehhhhcCCCcceecccceecCCCceecccCCcCCCC Q lcl|NC_018856. 215 GKGYGRATDAFMPIGVQADFTNNLLDRQRVIQ-PSTAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILLEGRNPEPNA 293 (479) Q Consensus 215 ~~~fG~~td~~mp~~vka~f~~~~~~~qrv~~-~~n~g~~~~G~~I~~~~s~~G~I~l~~s~~m~~~~~L~e~~~~~~~A 293 (479) ...|..-...+|+..+.+.+...-...-|.+. |.+.... | .+.+++..+-.. ...++.+++ T Consensus 303 ~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~--g---------------~~~tl~G~pV~~-~~~~~~~~~ 364 (434) T protein:vir:62 303 VKEVRKKARWVLNTAALTKIETMKTDDGFPLLRPFNQAEG--G---------------IGYTLLGFPVEE-EDAIDIPDS 364 (434) T ss_pred chhhhcCCEEEEcHHHHHHHHHhhccCCCEeeccCCCccC--C---------------CCceecceeeEE-ecCccCccC Confidence 33443333468888887776543333334443 3221100 0 011122222111 111111211 Q ss_pred CCCceeEeee----eccCCCCC--CC---cccccceEEEEEEEc-CcCccccccceeeeeecCCceEEEEEEecCCC Q lcl|NC_018856. 294 PQAPASVVAS----IVDDKKGG--FR---DEDIKTHSYKVVVHS-DDAESLPSEAVTAAVAKKDNTVKLEVKLASLY 360 (479) Q Consensus 294 P~~pa~v~at----~~t~~~G~--f~---~~d~gty~YkVtavn-~~GES~pS~~vt~Tv~~~g~sv~ltIT~~~~~ 360 (479) ...+....+- ..-.-.|. +. .....+-.-.+.+.. .+|.-.-++...+ -++++++.+..+ T Consensus 365 ~~~~~i~~Gdfs~~~i~~~~g~~~i~~~~~~~~~~~~v~~~~~~r~Dgk~i~~~~~~~-------~~~~~~~~~~~~ 434 (434) T protein:vir:62 365 PDTPVFYFGDFSKFYIQDVIGSLEVQKLVELFSRTNRVGFRIWNLLDAQLIHSPFEVP-------VYKYVLKAPTGA 434 (434) T ss_pred CCceEEEEeeccceEEEEeeceeEEEeehhhhcccCceEEEEEeeecceeecCcccce-------EEEEEeccCCCC Confidence 1111111000 00000000 00 000001111112222 1333221111111 223444433222 No 117 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=79.98 E-value=0.099 Score=26.08 Aligned_cols=256 Identities=14% Similarity=0.122 Sum_probs=126.2 Q ss_pred hhcCCCcChhhccCccccchhhhhhhhhhheeccccccch------hhccccchhHHHHhhhhhhccCcccccccccccc Q lcl|NC_018856. 31 FTTGYGITPDTQLDGAAVRRELLEDQVKMLAFSSNDFTIY------PLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVG 104 (479) Q Consensus 31 ~tag~~~~p~~~~~gaalr~esld~~~~~l~~~~~~f~f~------~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g 104 (479) |. ...| +.+..+.+|.+.+.+.... .+-..|- ..+..+ .-.||+ +..++..|...++.|++ T Consensus 1 MA-~~~T-----~~~~~~iPev~s~~v~~~~--~~~~~~~~~~~~~~~~~g~-~G~tv~----iP~~~~~~~a~~v~eg~ 67 (272) T protein:vir:98 1 MA-VGTT-----KMAQMLDPEVLADMIDAEV--GKAIRFAPLAEVDTTLEGQ-PGTTLT----VPKWDYIGDAEDVAEGE 67 (272) T ss_pred CC-Cccc-----cchheechHHHHHHHHHHH--HHHhhhhccccccccccCC-CCCEEE----EEEecCCCCcccccCCC Confidence 21 1111 1234556655555442111 0001100 011111 011222 23345566777899999 Q ss_pred cccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhH Q lcl|NC_018856. 105 VASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGL 184 (479) Q Consensus 105 ~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl 184 (479) .....+.+......+++-++....+|..+.. ++..|++....+.....+++.++-.+|= .+. T Consensus 68 ~i~~~~~~~~~~~~~~~~~~~~~~itd~~~~-~s~~d~~~~~~~~~~~~~a~~~d~~i~~---~~~-------------- 129 (272) T protein:vir:98 68 AIPMTQLGFKKTTMTIKKAGKGVEITDEAIL-SGYGDPVGQAAKQIVEAIDHKVDADVLD---ALS-------------- 129 (272) T ss_pred cccccccccceEEEEeeeeeeeeeecHHHHh-hccccHHHHHHHHHHHHHHHHHHHHHHH---Hhc-------------- Confidence 9999999999999999999999999988754 4567999999999999999999977661 111 Q ss_pred HHhhccCCCEEEccCCCCCHHHHhhhhhhhhhccCceEEEecChHHhhhHHHHhhCcceeeeccCCCc-----c----ee Q lcl|NC_018856. 185 HKLIDQDTNVIDLKGARLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSTAGG-----F----ST 255 (479) Q Consensus 185 ~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~~td~~mp~~vka~f~~~~~~~qrv~~~~n~g~-----~----~~ 255 (479) ...+.++ ...+.+.|..|...........+-++||+.+.+.+-..-+. +++..+..+. . -. T Consensus 130 -----~a~~~~~---~~~t~d~i~da~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~--~~~~~~~~~~~~~~~g~ig~i~ 199 (272) T protein:vir:98 130 -----KSTQTVE---ATATVDGVSKALDIFNDEDDAETVIVMNPADASTLRLDAAK--EWLGATEVGANRVVSGVYGEVL 199 (272) T ss_pred -----ccccccc---cccCHHHHHHHHHHHhccCCCccEEEEcHHHHHHHHHhccc--cccccccccccccccccchhhc Confidence 1112221 12345666777666777777788899999998887433211 1111111110 0 11 Q ss_pred eeehhhhcC-CCcceecccceecCCCceecccCCcCCCCCCCceeEeeeeccCCCCCCCcccccceEEEEEEEcCcCccc Q lcl|NC_018856. 256 GFSINQFLS-TRGAINLHGSTIMENDNILLEGRNPEPNAPQAPASVVASIVDDKKGGFRDEDIKTHSYKVVVHSDDAESL 334 (479) Q Consensus 256 G~~I~~~~s-~~G~I~l~~s~~m~~~~~L~e~~~~~~~AP~~pa~v~at~~t~~~G~f~~~d~gty~YkVtavn~~GES~ 334 (479) |+.|-..+. +.|. .++ +..+.+.--.... +... +-... .++...-.+.+.|-+..++ T Consensus 200 G~~Vi~s~~~p~~t------~~~-----~~~~a~~~~~~~~-~~ve--~~r~~--~~~~~~i~~~~~~~~~v~~------ 257 (272) T protein:vir:98 200 GVQIVRSRKCPKGT------AYM-----VRKGALRIMLKRN-TMVE--TDRDI--TKAINQIVANKHYGVYLYK------ 257 (272) T ss_pred CeeEEEcCCCCcce------EEE-----EcCCeEEEEecCC-ceee--ecccc--ccceeEEEEEEEEEEEEEc------ Confidence 221111111 1111 110 1111111100001 1111 11111 1111111223445444443 Q ss_pred cccceeeeeecCCce Q lcl|NC_018856. 335 PSEAVTAAVAKKDNT 349 (479) Q Consensus 335 pS~~vt~Tv~~~g~s 349 (479) |+..+..|+...++. T Consensus 258 ~~~vv~~t~~~a~~~ 272 (272) T protein:vir:98 258 AEKAVKITLKDAAKK 272 (272) T ss_pred CCceEEEEecccccC Confidence 556666776665655 No 118 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=79.98 E-value=0.099 Score=26.08 Aligned_cols=256 Identities=14% Similarity=0.122 Sum_probs=126.2 Q ss_pred hhcCCCcChhhccCccccchhhhhhhhhhheeccccccch------hhccccchhHHHHhhhhhhccCcccccccccccc Q lcl|NC_018856. 31 FTTGYGITPDTQLDGAAVRRELLEDQVKMLAFSSNDFTIY------PLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVG 104 (479) Q Consensus 31 ~tag~~~~p~~~~~gaalr~esld~~~~~l~~~~~~f~f~------~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g 104 (479) |. ...| +.+..+.+|.+.+.+.... .+-..|- ..+..+ .-.||+ +..++..|...++.|++ T Consensus 1 MA-~~~T-----~~~~~~iPev~s~~v~~~~--~~~~~~~~~~~~~~~~~g~-~G~tv~----iP~~~~~~~a~~v~eg~ 67 (272) T protein:vir:30 1 MA-VGTT-----KMAQMLDPEVLADMIDAEV--GKAIRFAPLAEVDTTLEGQ-PGTTLT----VPKWDYIGDAEDVAEGE 67 (272) T ss_pred CC-Cccc-----cchheechHHHHHHHHHHH--HHHhhhhccccccccccCC-CCCEEE----EEEecCCCCcccccCCC Confidence 21 1111 1234556655555442111 0001100 011111 011222 23345566777899999 Q ss_pred cccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhH Q lcl|NC_018856. 105 VASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGL 184 (479) Q Consensus 105 ~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl 184 (479) .....+.+......+++-++....+|..+.. ++..|++....+.....+++.++-.+|= .+. T Consensus 68 ~i~~~~~~~~~~~~~~~~~~~~~~itd~~~~-~s~~d~~~~~~~~~~~~~a~~~d~~i~~---~~~-------------- 129 (272) T protein:vir:30 68 AIPMTQLGFKKTTMTIKKAGKGVEITDEAIL-SGYGDPVGQAAKQIVEAIDHKVDADVLD---ALS-------------- 129 (272) T ss_pred cccccccccceEEEEeeeeeeeeeecHHHHh-hccccHHHHHHHHHHHHHHHHHHHHHHH---Hhc-------------- Confidence 9999999999999999999999999988754 4567999999999999999999977661 111 Q ss_pred HHhhccCCCEEEccCCCCCHHHHhhhhhhhhhccCceEEEecChHHhhhHHHHhhCcceeeeccCCCc-----c----ee Q lcl|NC_018856. 185 HKLIDQDTNVIDLKGARLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSTAGG-----F----ST 255 (479) Q Consensus 185 ~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~~td~~mp~~vka~f~~~~~~~qrv~~~~n~g~-----~----~~ 255 (479) ...+.++ ...+.+.|..|...........+-++||+.+.+.+-..-+. +++..+..+. . -. T Consensus 130 -----~a~~~~~---~~~t~d~i~da~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~--~~~~~~~~~~~~~~~g~ig~i~ 199 (272) T protein:vir:30 130 -----KSTQTVE---ATATVDGVSKALDIFNDEDDAETVIVMNPADASTLRLDAAK--EWLGATEVGANRVVSGVYGEVL 199 (272) T ss_pred -----ccccccc---cccCHHHHHHHHHHHhccCCCccEEEEcHHHHHHHHHhccc--cccccccccccccccccchhhc Confidence 1112221 12345666777666777777788899999998887433211 1111111110 0 11 Q ss_pred eeehhhhcC-CCcceecccceecCCCceecccCCcCCCCCCCceeEeeeeccCCCCCCCcccccceEEEEEEEcCcCccc Q lcl|NC_018856. 256 GFSINQFLS-TRGAINLHGSTIMENDNILLEGRNPEPNAPQAPASVVASIVDDKKGGFRDEDIKTHSYKVVVHSDDAESL 334 (479) Q Consensus 256 G~~I~~~~s-~~G~I~l~~s~~m~~~~~L~e~~~~~~~AP~~pa~v~at~~t~~~G~f~~~d~gty~YkVtavn~~GES~ 334 (479) |+.|-..+. +.|. .++ +..+.+.--.... +... +-... .++...-.+.+.|-+..++ T Consensus 200 G~~Vi~s~~~p~~t------~~~-----~~~~a~~~~~~~~-~~ve--~~r~~--~~~~~~i~~~~~~~~~v~~------ 257 (272) T protein:vir:30 200 GVQIVRSRKCPKGT------AYM-----VRKGALRIMLKRN-TMVE--TDRDI--TKAINQIVANKHYGVYLYK------ 257 (272) T ss_pred CeeEEEcCCCCcce------EEE-----EcCCeEEEEecCC-ceee--ecccc--ccceeEEEEEEEEEEEEEc------ Confidence 221111111 1111 110 1111111100001 1111 11111 1111111223445444443 Q ss_pred cccceeeeeecCCce Q lcl|NC_018856. 335 PSEAVTAAVAKKDNT 349 (479) Q Consensus 335 pS~~vt~Tv~~~g~s 349 (479) |+..+..|+...++. T Consensus 258 ~~~vv~~t~~~a~~~ 272 (272) T protein:vir:30 258 AEKAVKITLKDAAKK 272 (272) T ss_pred CCceEEEEecccccC Confidence 556666776665655 No 119 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=79.46 E-value=0.074 Score=26.78 Aligned_cols=308 Identities=12% Similarity=0.044 Sum_probs=119.5 Q ss_pred CCccc-hhhhhhh----hcC-Cc-c---chHHHHHHH-------------HHhhhcCCCcChhhccCccccchhhhhhhh Q lcl|NC_018856. 1 MTELK-KEAEAKN----KKL-PV-E---AEAELAELV-------------SKSFTTGYGITPDTQLDGAAVRRELLEDQV 57 (479) Q Consensus 1 ~~~~~-~~~~~~~----~~~-~~-~---~~~~~~e~~-------------~Ks~tag~~~~p~~~~~gaalr~esld~~~ 57 (479) +.+.. .+++.+. ... .. . ..++..+++ .|.|++ ..-.+-++|+.|-++.+...| T Consensus 20 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~lt~~e~~~~~~---~~~~~~~~gg~lvP~~~~~~I 96 (381) T protein:vir:95 20 VNNGEPQERQNELYGDMINQLFEETKLQAKAEAERVSSLPKSAQSLSANQRSFFMD---INKNVNYKEEKLLPEETIDRI 96 (381) T ss_pred HhhhhhhHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHhccCcccccHHHHHHHHH---HhcccCCCCceecCHHHHHHH Confidence 11000 0000000 000 00 0 000001111 111111 011223457778777777777 Q ss_pred h-hheeccccccchhhccccchhHHHHhhhhhhccCccccccccccccc-ccccCcceEEEEEEEEeeeehhhhhhhHhh Q lcl|NC_018856. 58 K-MLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGV-ASINDPNIRQKTVQMKFLSDTKQQSLAAGL 135 (479) Q Consensus 58 ~-~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~-~~~~d~~~~r~~~~~k~l~~~~~vs~~~~l 135 (479) . .|... ..+++.+...++.+.+ ++....+.+...++.|.+. ++..++.+.+.....+=|+.--.+|..+ | T Consensus 97 ~~~l~~~---s~i~~~~~v~~~~~~~----~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~el-L 168 (381) T protein:vir:95 97 FEDLTTN---HPLLADLGIKNAGLRL----KFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDL-N 168 (381) T ss_pred HHHHHhh---ccceeheeeEecCcce----EEEEecCCcceeeecccccccccccccceeeeecceeEEeechhhHHH-h Confidence 4 23222 2444545444443322 3344555566778899775 4577999999999999998877777766 6 Q ss_pred hcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEE-------ccCCC--CCH-- Q lcl|NC_018856. 136 VNNIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVID-------LKGAR--LDE-- 204 (479) Q Consensus 136 vn~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviD-------arG~~--l~~-- 204 (479) .++..|.+....+.--.++++.++.+++.||-.-.| -||++.+....++-+ +-|.. .+. T Consensus 169 ~Ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP----------~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~ 238 (381) T protein:vir:95 169 DFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQP----------IGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRA 238 (381) T ss_pred hcCHHHHHHHHHHHHHHHHHHHhhheeEeccCCCCc----------eeeeeccCcccccccccccccccccccccccchh Confidence 677889999999999999999999999999975322 467666543211111 00100 111 Q ss_pred --HHHhhhhhhhhhccC------ceEE-EecChHHhhhHHHHhhCcceeeeccCCCcce----eeee-hhhhcCCCccee Q lcl|NC_018856. 205 --ATLNKAAVIVGKGYG------RATD-AFMPIGVQADFTNNLLDRQRVIQPSTAGGFS----TGFS-INQFLSTRGAIN 270 (479) Q Consensus 205 --~~l~~aa~~i~~~fG------~~td-~~mp~~vka~f~~~~~~~qrv~~~~n~g~~~----~G~~-I~~~~s~~G~I~ 270 (479) +.|..........++ ..+- +.|.+.+...+.. ....+.. .|... .|.. +.+-..+.|+|. T Consensus 239 ~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~-----~~~~~~~-~G~~v~~l~~g~~vv~s~~~p~~~ii 312 (381) T protein:vir:95 239 TVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQA-----QYTHLNA-NGVYVTALPFNLNVIESTVQEAGKVL 312 (381) T ss_pred hHHHHHHHHHhhccccccccccccCceEEEEccccHHhhcc-----ccccCCC-CCceeecCCCCceEEecCCCCcCcEE Confidence 111111111111111 1111 2355554443311 1111111 12111 1111 112223344443 Q ss_pred cccc----eecCCCceecccCCcCCCCCCCceeEeeeeccCCCCCCCcccccceEEEEEEEcCcCccccccceeeeeecC Q lcl|NC_018856. 271 LHGS----TIMENDNILLEGRNPEPNAPQAPASVVASIVDDKKGGFRDEDIKTHSYKVVVHSDDAESLPSEAVTAAVAKK 346 (479) Q Consensus 271 l~~s----~~m~~~~~L~e~~~~~~~AP~~pa~v~at~~t~~~G~f~~~d~gty~YkVtavn~~GES~pS~~vt~Tv~~~ 346 (479) | || .+.++..+.++.. ....+-.--+.-.+..-. +|+-. +. --++|.-+.-.| ..|.. ...+ T Consensus 313 f-gDfs~Y~i~~r~~~~i~~~-~~~~~~~d~~~f~a~~r~--dg~~~--~~--~A~~v~~l~~~~-~~~~~-----~~~~ 378 (381) T protein:vir:95 313 T-YVKGLYDGYLAGGINVQKF-KETLALDDMDLYTAKQFA--YGKAK--DN--KVAAVWKLDLKG-HKPAL-----EGTE 378 (381) T ss_pred E-EecccEEEEEecccEEEee-chhHhhcCCeEEEEEEEE--cCEEe--cC--ceEEEEEEEecC-CCcCc-----cccc Confidence 2 22 1111111111000 000000000000000000 11100 00 112221122111 11111 1111 Q ss_pred CceEEEEE Q lcl|NC_018856. 347 DNTVKLEV 354 (479) Q Consensus 347 g~sv~ltI 354 (479) . |+ T Consensus 379 ~-----~~ 381 (381) T protein:vir:95 379 E-----TL 381 (381) T ss_pred c-----cC Confidence 1 11 No 120 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=79.46 E-value=0.074 Score=26.78 Aligned_cols=308 Identities=12% Similarity=0.044 Sum_probs=119.5 Q ss_pred CCccc-hhhhhhh----hcC-Cc-c---chHHHHHHH-------------HHhhhcCCCcChhhccCccccchhhhhhhh Q lcl|NC_018856. 1 MTELK-KEAEAKN----KKL-PV-E---AEAELAELV-------------SKSFTTGYGITPDTQLDGAAVRRELLEDQV 57 (479) Q Consensus 1 ~~~~~-~~~~~~~----~~~-~~-~---~~~~~~e~~-------------~Ks~tag~~~~p~~~~~gaalr~esld~~~ 57 (479) +.+.. .+++.+. ... .. . ..++..+++ .|.|++ ..-.+-++|+.|-++.+...| T Consensus 20 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~lt~~e~~~~~~---~~~~~~~~gg~lvP~~~~~~I 96 (381) T protein:vir:10 20 VNNGEPQERQNELYGDMINQLFEETKLQAKAEAERVSSLPKSAQSLSANQRSFFMD---INKNVNYKEEKLLPEETIDRI 96 (381) T ss_pred HhhhhhhHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHhccCcccccHHHHHHHHH---HhcccCCCCceecCHHHHHHH Confidence 11000 0000000 000 00 0 000001111 111111 011223457778777777777 Q ss_pred h-hheeccccccchhhccccchhHHHHhhhhhhccCccccccccccccc-ccccCcceEEEEEEEEeeeehhhhhhhHhh Q lcl|NC_018856. 58 K-MLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGV-ASINDPNIRQKTVQMKFLSDTKQQSLAAGL 135 (479) Q Consensus 58 ~-~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~-~~~~d~~~~r~~~~~k~l~~~~~vs~~~~l 135 (479) . .|... ..+++.+...++.+.+ ++....+.+...++.|.+. ++..++.+.+.....+=|+.--.+|..+ | T Consensus 97 ~~~l~~~---s~i~~~~~v~~~~~~~----~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~el-L 168 (381) T protein:vir:10 97 FEDLTTN---HPLLADLGIKNAGLRL----KFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDL-N 168 (381) T ss_pred HHHHHhh---ccceeheeeEecCcce----EEEEecCCcceeeecccccccccccccceeeeecceeEEeechhhHHH-h Confidence 4 23222 2444545444443322 3344555566778899775 4577999999999999998877777766 6 Q ss_pred hcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEE-------ccCCC--CCH-- Q lcl|NC_018856. 136 VNNIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVID-------LKGAR--LDE-- 204 (479) Q Consensus 136 vn~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviD-------arG~~--l~~-- 204 (479) .++..|.+....+.--.++++.++.+++.||-.-.| -||++.+....++-+ +-|.. .+. T Consensus 169 ~Ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP----------~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~ 238 (381) T protein:vir:10 169 DFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQP----------IGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRA 238 (381) T ss_pred hcCHHHHHHHHHHHHHHHHHHHhhheeEeccCCCCc----------eeeeeccCcccccccccccccccccccccccchh Confidence 677889999999999999999999999999975322 467666543211111 00100 111 Q ss_pred --HHHhhhhhhhhhccC------ceEE-EecChHHhhhHHHHhhCcceeeeccCCCcce----eeee-hhhhcCCCccee Q lcl|NC_018856. 205 --ATLNKAAVIVGKGYG------RATD-AFMPIGVQADFTNNLLDRQRVIQPSTAGGFS----TGFS-INQFLSTRGAIN 270 (479) Q Consensus 205 --~~l~~aa~~i~~~fG------~~td-~~mp~~vka~f~~~~~~~qrv~~~~n~g~~~----~G~~-I~~~~s~~G~I~ 270 (479) +.|..........++ ..+- +.|.+.+...+.. ....+.. .|... .|.. +.+-..+.|+|. T Consensus 239 ~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~-----~~~~~~~-~G~~v~~l~~g~~vv~s~~~p~~~ii 312 (381) T protein:vir:10 239 TVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQA-----QYTHLNA-NGVYVTALPFNLNVIESTVQEAGKVL 312 (381) T ss_pred hHHHHHHHHHhhccccccccccccCceEEEEccccHHhhcc-----ccccCCC-CCceeecCCCCceEEecCCCCcCcEE Confidence 111111111111111 1111 2355554443311 1111111 12111 1111 112223344443 Q ss_pred cccc----eecCCCceecccCCcCCCCCCCceeEeeeeccCCCCCCCcccccceEEEEEEEcCcCccccccceeeeeecC Q lcl|NC_018856. 271 LHGS----TIMENDNILLEGRNPEPNAPQAPASVVASIVDDKKGGFRDEDIKTHSYKVVVHSDDAESLPSEAVTAAVAKK 346 (479) Q Consensus 271 l~~s----~~m~~~~~L~e~~~~~~~AP~~pa~v~at~~t~~~G~f~~~d~gty~YkVtavn~~GES~pS~~vt~Tv~~~ 346 (479) | || .+.++..+.++.. ....+-.--+.-.+..-. +|+-. +. --++|.-+.-.| ..|.. ...+ T Consensus 313 f-gDfs~Y~i~~r~~~~i~~~-~~~~~~~d~~~f~a~~r~--dg~~~--~~--~A~~v~~l~~~~-~~~~~-----~~~~ 378 (381) T protein:vir:10 313 T-YVKGLYDGYLAGGINVQKF-KETLALDDMDLYTAKQFA--YGKAK--DN--KVAAVWKLDLKG-HKPAL-----EGTE 378 (381) T ss_pred E-EecccEEEEEecccEEEee-chhHhhcCCeEEEEEEEE--cCEEe--cC--ceEEEEEEEecC-CCcCc-----cccc Confidence 2 22 1111111111000 000000000000000000 11100 00 112221122111 11111 1111 Q ss_pred CceEEEEE Q lcl|NC_018856. 347 DNTVKLEV 354 (479) Q Consensus 347 g~sv~ltI 354 (479) . |+ T Consensus 379 ~-----~~ 381 (381) T protein:vir:10 379 E-----TL 381 (381) T ss_pred c-----cC Confidence 1 11 No 121 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=78.23 E-value=0.12 Score=25.70 Aligned_cols=293 Identities=12% Similarity=0.008 Sum_probs=117.7 Q ss_pred CCccch------------------hhhhhhhcCCccc--hHHHHHHHH------------Hhh---hcCCCcChhhccCc Q lcl|NC_018856. 1 MTELKK------------------EAEAKNKKLPVEA--EAELAELVS------------KSF---TTGYGITPDTQLDG 45 (479) Q Consensus 1 ~~~~~~------------------~~~~~~~~~~~~~--~~~~~e~~~------------Ks~---tag~~~~p~~~~~g 45 (479) |.+..+ +.|.+..+..... .+..-+.+. ++. .+|..+. .+++ T Consensus 39 ~~~~~~~~~~e~~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~ 115 (379) T protein:vir:10 39 MTSEKDLAVNELKSDMAALQAHADKLDVKLKEKAKSEDKSDSLVKSITENFNDIKEVRNGKSIQVKAVGDMTL---PVNL 115 (379) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccchhHHHHHHHHHHhHHHHHhhhhhhhhhhccccc---CCCC Confidence 111000 0011111100000 000000000 111 1122222 2233 Q ss_pred cccchhhhhhhhhhheeccccccchhhccccchhHHHHhhhhhhccCcccccccccccccccccCcceEEEEEEEEeeee Q lcl|NC_018856. 46 AAVRRELLEDQVKMLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSD 125 (479) Q Consensus 46 aalr~esld~~~~~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~ 125 (479) +.+-.+.+...|..+. .+...+.+-+...++.+.--+|.+....+. +...++.|++..+..++.+.+.....+=++. T Consensus 116 ~~~ip~~~~~~ii~~~--~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~v~Eg~~~~~~~~~f~~i~~~~~k~~~ 192 (379) T protein:vir:10 116 TGAQPKDYNFDVVLNP--SQMLNVSDIVGAVSISGGTYTFVRENGAGE-GAIGAQVEGATKGQKDYDISMIDVNTDFIAG 192 (379) T ss_pred ccccchhhhhHHHHhH--HhhhhHHhhceeeeccCCceEEEEeecCCC-cccccccCCccccccccceeeeEeeeeeEEe Confidence 3344444444442211 222233333444444443334444433322 2345789999999999999999999999998 Q ss_pred hhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccCCCCCHH Q lcl|NC_018856. 126 TKQQSLAAGLVNNIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGARLDEA 205 (479) Q Consensus 126 ~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~ 205 (479) -..+|.-+ +.++ .+.+....+.-...+++.++.+++-|+..-... + .... -...+.+ T Consensus 193 ~~~iS~el-l~D~-~~l~~~i~~~la~~~~~~~~~~~~~g~~~~~~~--~--------~~~~-----------~~~~~~d 249 (379) T protein:vir:10 193 FTRYSKKM-ANNL-PFLTSFIPNALRRDYAKAENAAFNAVLAANATA--S--------TEII-----------TNKNKVE 249 (379) T ss_pred eehhhHHH-HhhH-HHHHHHHHHHHHHHHHHHHHHHHhccccccccc--c--------cccc-----------cCcccHH Confidence 88888765 4443 346666666667788888888888776542110 0 0000 0122345 Q ss_pred HHhhhhhhhhhccCceEEEecChHHhhhHHHHhhCcceee-eccC---CCc--ceeeeehhhhc-CCCcceecccceecC Q lcl|NC_018856. 206 TLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVI-QPST---AGG--FSTGFSINQFL-STRGAINLHGSTIME 278 (479) Q Consensus 206 ~l~~aa~~i~~~fG~~td~~mp~~vka~f~~~~~~~qrv~-~~~n---~g~--~~~G~~I~~~~-s~~G~I~l~~s~~m~ 278 (479) .|.++...+...+-.++-+.|++.....+...-...-|.+ +++. .+. .-.|++|-... .+-|. .+.. T Consensus 250 ~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~l~G~pvv~s~~~~ag~------~~~g 323 (379) T protein:vir:10 250 MLINEIAKQENLDFPVTAIVLRPTDYYDILVTQKSVGAGYGLPGVVTQDNGVLRINGIPLFRATWLAANK------YYVG 323 (379) T ss_pred HHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCceeccCCccCCCCCcceecceeeEecCCCCCCc------eEEe Confidence 5666555556667777778888877766655444333333 3221 111 12244332111 11222 1211 Q ss_pred CCce--ecccCCcCCCCCCCceeEeeeeccCCCCCCCcccccceEEE----EEEEcCcCccccccceeeeee Q lcl|NC_018856. 279 NDNI--LLEGRNPEPNAPQAPASVVASIVDDKKGGFRDEDIKTHSYK----VVVHSDDAESLPSEAVTAAVA 344 (479) Q Consensus 279 ~~~~--L~e~~~~~~~AP~~pa~v~at~~t~~~G~f~~~d~gty~Yk----Vtavn~~GES~pS~~vt~Tv~ 344 (479) +..- +.. .-. ....... .....|.. + ...++ +-..-.+-+. ..-++.|.. T Consensus 324 df~~~~~~~-------~~~-~~i~~~~---~~~~~f~~-~--~~~~r~~~R~~~~v~~p~a--~v~~~~~~~ 379 (379) T protein:vir:10 324 DWTRVTKVT-------TEG-LSLEFSE---VEGTNFVK-N--NITARIEAQVALAVEQPAA--LIFGDFTAV 379 (379) T ss_pred ecccEEEEE-------Eec-eEEEEee---cccccccC-C--cEEEEEEEEeccEEecCcc--EEEEEecCC Confidence 1100 000 000 0000000 00001110 0 01111 1011111111 111111111 No 122 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=71.89 E-value=0.19 Score=24.54 Aligned_cols=312 Identities=13% Similarity=0.031 Sum_probs=120.3 Q ss_pred CCccchhhh-----------hhhhc--------CCccchHHHHH------------HHHHhhhcCCCcChhhccCccccc Q lcl|NC_018856. 1 MTELKKEAE-----------AKNKK--------LPVEAEAELAE------------LVSKSFTTGYGITPDTQLDGAAVR 49 (479) Q Consensus 1 ~~~~~~~~~-----------~~~~~--------~~~~~~~~~~e------------~~~Ks~tag~~~~p~~~~~gaalr 49 (479) +.+...+.+ ..... ......+.+.. ...+....--..+..+..||..+. T Consensus 90 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~lv~ 169 (477) T protein:vir:84 90 VRKATVEVNEALTYEKGNGQSYFRDLAMQTVGMADEPAKERLRRHMVDVESDKEIRKIAKVGEEYRDLDRNGGTGGYAVP 169 (477) T ss_pred hcccccccccchhhhhhHHHHHHHHHHHHHhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHhhhhhccccccCCCcceeec Confidence 000000000 00000 00000000000 000011000011112233455555 Q ss_pred hhhhhhhhhhheeccccccchhhccccchhHHHHhhhhhhccCcccccccccccc-----cccccCcceEEEEEEEEeee Q lcl|NC_018856. 50 RELLEDQVKMLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVG-----VASINDPNIRQKTVQMKFLS 124 (479) Q Consensus 50 ~esld~~~~~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g-----~~~~~d~~~~r~~~~~k~l~ 124 (479) .|-+...+..+.. ..-.+.+.+...+...+...+..-...++..-+..++|++ ..+.+|+.+.+....+|=++ T Consensus 170 ~~~~~~~ii~~l~--~~~~i~~~~~~~~~~~~~~~~~ip~~~~~~~~a~~~~Eg~~~~~~~~~~s~~~f~~i~~~~~k~~ 247 (477) T protein:vir:84 170 PLWMMNRFIELAR--AGRTYANLCPTEPLPGGTSSINIPKILTGTSTAIQAADNAALTAPSAHEVDLTDGFVQANVKTIA 247 (477) T ss_pred cchhHHHHHHHhh--hcchHHHhhceeeecCCcceeEEEEEecCcceeeeeccCcccccccccccccceeeEEEeeeeEE Confidence 5544444432221 1122334444444444433322211122222234577775 34678899999999998888 Q ss_pred ehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccCCCCCH Q lcl|NC_018856. 125 DTKQQSLAAGLVNNIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGARLDE 204 (479) Q Consensus 125 ~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~ 204 (479) .--.+|..+= .++.-|.+....++-...+++.++.++|+|+-. +-+..||.+.-. .+.+++-+...+. T Consensus 248 ~~~~iS~ell-~ds~~~l~~~i~~~l~~~~~~~~d~~~l~G~Gt---------~~~p~Gi~~~~~--~~~~~~~~~~~t~ 315 (477) T protein:vir:84 248 GQQGIAIQLL-DQAAVSVDEFVFRDLAADYANKLNVQVISGTGS---------NNQVVGVRATAG--ITQVTATSAGSAL 315 (477) T ss_pred eeeHHHHHHH-hccchhHHHHHHHHHHHHHHHHHHHHHhccCCC---------CCccceeeeccc--cccccccccccch Confidence 8877876543 333446788888889999999999999999743 113467776532 2444444444332 Q ss_pred HH-------HhhhhhhhhhccCceE-EEecChHHhhhHHHHhhC-cceeeeccCCCccee----------------eeeh Q lcl|NC_018856. 205 AT-------LNKAAVIVGKGYGRAT-DAFMPIGVQADFTNNLLD-RQRVIQPSTAGGFST----------------GFSI 259 (479) Q Consensus 205 ~~-------l~~aa~~i~~~fG~~t-d~~mp~~vka~f~~~~~~-~qrv~~~~n~g~~~~----------------G~~I 259 (479) .. |-.+..-+..+|+... -.+|++...+.+...-.. ++-.++|+..+.... |++| T Consensus 316 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~~~~~~~l~G~pV 395 (477) T protein:vir:84 316 EKHQIIYQKIADAIQRVHTSRFLEPEVIVMHPRRWASFHAIFAGDDRPLIVPSGPGFNNLGVLTEVASQRVVGQMHGLPV 395 (477) T ss_pred hhHHHHHHHHHHHHhhccccccCCccEEEEcHHHHHHHHHhhccCCCeeeecCcccccccccccccccccccchhcccce Confidence 22 2222233344555544 467798888777665543 333334432221111 2222 Q ss_pred hhhcCCCcceeccc---ceec-C-CCceecccCCcCCCCCCCceeEeeeeccCCCCCCCcccccceEEE----------E Q lcl|NC_018856. 260 NQFLSTRGAINLHG---STIM-E-NDNILLEGRNPEPNAPQAPASVVASIVDDKKGGFRDEDIKTHSYK----------V 324 (479) Q Consensus 260 ~~~~s~~G~I~l~~---s~~m-~-~~~~L~e~~~~~~~AP~~pa~v~at~~t~~~G~f~~~d~gty~Yk----------V 324 (479) -......-...-.+ .++. + .+.++.++.+.....+- .-.+ . .......|.|- + T Consensus 396 v~s~~~p~~~~~~~d~~~i~~gd~~~~~i~~~~~~~~~~~~--------~~~~---~-~~~~~~v~~~~~~~~~r~~~af 463 (477) T protein:vir:84 396 VTDPTLPTTLGTGTDQDVIHVLRASDLALFESSVRMRALQE--------TRAE---N-LSVLLQVYGYLAFTAARFPQSV 463 (477) T ss_pred EecCcccccccccCCcceEEEEEeceEEEEeeceeEEeccc--------cccc---c-ceeeeeehhhhhhhhhccccce Confidence 11100000000000 0000 0 00011110000000000 0000 0 00000011111 1 Q ss_pred EEEcCcCccccccc Q lcl|NC_018856. 325 VVHSDDAESLPSEA 338 (479) Q Consensus 325 tavn~~GES~pS~~ 338 (479) +.+..-+-.+|.=+ T Consensus 464 v~~t~~~~~~~~~~ 477 (477) T protein:vir:84 464 VEIGGTALTAPTFA 477 (477) T ss_pred EEeecccccccccC Confidence 11111222222111 No 123 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=66.34 E-value=0.27 Score=23.71 Aligned_cols=291 Identities=12% Similarity=0.056 Sum_probs=133.4 Q ss_pred hhcCCccchHHHHHHHHHhhhcCCCcChhhccCccccchhhhhhhhhhheeccccccchhhccccchhHHHHhhhhhhcc Q lcl|NC_018856. 12 NKKLPVEAEAELAELVSKSFTTGYGITPDTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQH 91 (479) Q Consensus 12 ~~~~~~~~~~~~~e~~~Ks~tag~~~~p~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~ 91 (479) ..+........+.+. .|++++ +| .+|+.|..|.+++-|..+... -.|++.+.-.+. ...+...... T Consensus 1 ~~~~~~~~~~~~~~~-~k~~t~-----~d--~~Gg~l~P~~~~~~i~~~~e~---s~~l~~~~vi~~---~~~~~~~i~~ 66 (315) T protein:vir:41 1 MLTIEDIRGGKPFEI-VPKIDV-----PD--LGRGVLSVDRFGEFVKAVRDS---AVIIPEARIDNA---LKSYEKDISR 66 (315) T ss_pred CcccchhhcCChhhh-hhhcCC-----cC--CCCceechHHHHHHHHHHHhh---hhhhhhceeeec---cccccccccc Confidence 333344444444443 477652 22 268889999998877655443 334554432111 1111121222 Q ss_pred Cccc-----ccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcch--hhHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_018856. 92 GRTG-----HSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNI--ADPMTILTEDAIAVIAKSIEWAIFY 164 (479) Q Consensus 92 G~~g-----~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~--~Dp~~~~~~~ai~~~~~~iE~a~f~ 164 (479) .+.| ...-.+|.+.+..++|.+.+....++-+..--.+|.-+ |.++. .|-+......--.+++...|.+.|. T Consensus 67 ~g~~~~~~~g~~~~~~~~~~~~~~~~f~~~~l~~~~l~~~~~it~el-L~D~~~~~~~e~~l~~~~a~~~a~~~~~~~~n 145 (315) T protein:vir:41 67 LSLVLDVGPGRDETGQKLAPPESTAEVKTNTLYMREMVTKVVIHEDA-IEDNIEGKAFEQKIVTLLGEGISYVLEKYYLH 145 (315) T ss_pred cccCcccccccccccCcCCCCCCccccceeeeceeeeeeeccccHHH-HHhhhccccHHHHHHHHHHHHHHHHHHHHhhc Confidence 1111 11234566667778899999998888877654444322 23443 3788888888889999999999999 Q ss_pred cccccCCCCCCcccchhhhHHHhhccC--CCEEEccCCCCCHHHHhhhhhhhhhcc---CceEEEecChHHhhhHHHHhh Q lcl|NC_018856. 165 GDAALSSEADGQAGIEFDGLHKLIDQD--TNVIDLKGARLDEATLNKAAVIVGKGY---GRATDAFMPIGVQADFTNNLL 239 (479) Q Consensus 165 Gd~~l~~~~~~~~gleFDGl~~~I~~~--~NviDarG~~l~~~~l~~aa~~i~~~f---G~~td~~mp~~vka~f~~~~~ 239 (479) ||..-.+ +. =-+.+|+.+.+... ....|.....++.+.|..+.--+-..| +.---++|+..+.+.+..... T Consensus 146 Gdg~s~~-p~---~~~~~G~l~~a~~~~~~~~~~~~a~~~~~d~l~~l~~sl~~~yr~~~~~~~~imn~~t~~~~rklk~ 221 (315) T protein:vir:41 146 GDTSSSD-PL---LRMSDGWLKLASEKLTESDVDPEAEDWPMNLFDTMIESLPTPYRNNLPNMKFYVTWDIYRAYRDALK 221 (315) T ss_pred cCCcCcC-cc---ccccccceecccccccccccccccccccHHHHHHHHHhcChHHhhcCCceEEEEcHHHHHHHHHHhc Confidence 9986211 10 01469999877532 123444555566666655443333333 222247799998888766555 Q ss_pred CcceeeeccC----CCcceeeeehhhhc------CCCcceecccceecCCCceecccCCcCCCCCCCceeEeeeeccCCC Q lcl|NC_018856. 240 DRQRVIQPST----AGGFSTGFSINQFL------STRGAINLHGSTIMENDNILLEGRNPEPNAPQAPASVVASIVDDKK 309 (479) Q Consensus 240 ~~qrv~~~~n----~g~~~~G~~I~~~~------s~~G~I~l~~s~~m~~~~~L~e~~~~~~~AP~~pa~v~at~~t~~~ 309 (479) .+.+.+.+.. ....-.|+.|.... .+.+.|-|..= .+ -+++..+.-. .-. T Consensus 222 ~~g~~lw~~~~~~g~~~tl~G~PV~~~~~m~~~~~~~~~ilf~d~---~n-l~~~~~~~i~-------------i~~--- 281 (315) T protein:vir:41 222 GRETGLGDQALTGANSILYDGRPVQYVPALEALNDGKSRALFVVP---TQ-LVYGFWRNIK-------------VVP--- 281 (315) T ss_pred cCCCccccchhhcCCCceecccceEecccccccCCCCccEEEecc---cc-eEEEeccccE-------------EEe--- Confidence 5444432211 11111133221100 01111111110 00 0000000000 000 Q ss_pred CCCCcccccceEEEEEE-Ec-CcCccccccceeeee Q lcl|NC_018856. 310 GGFRDEDIKTHSYKVVV-HS-DDAESLPSEAVTAAV 343 (479) Q Consensus 310 G~f~~~d~gty~YkVta-vn-~~GES~pS~~vt~Tv 343 (479) .....-+.+.|..+. ++ ..+.+-.......+| T Consensus 282 --~~~a~~~~~~~~~~~r~d~~~~~~~~~a~~~~~v 315 (315) T protein:vir:41 282 --DYDAEMRLTKYVASLRTDNHYEDEEGAVSATITV 315 (315) T ss_pred --eecCCCCceEEEEEEEeceeEEeccceeEeeeeC Confidence 000000112222221 11 112221111111111 No 124 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=61.32 E-value=0.36 Score=23.05 Aligned_cols=315 Identities=15% Similarity=0.086 Sum_probs=122.0 Q ss_pred CCccchhhhhhhhcCCccchHHHHHHHHHhh---hcC------------CCcChhhccCccccchhhhhhhhhhheeccc Q lcl|NC_018856. 1 MTELKKEAEAKNKKLPVEAEAELAELVSKSF---TTG------------YGITPDTQLDGAAVRRELLEDQVKMLAFSSN 65 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~Ks~---tag------------~~~~p~~~~~gaalr~esld~~~~~l~~~~~ 65 (479) +.+..++-+.+..+ ...++..+...+.+ ..| ..+...+..+|+.|-.+.+.+.|.... .+ T Consensus 38 ~~~~~~~~~~~~~~---~~~~e~~~~~~~~~~~~~r~~~~l~~ee~~~~~~~~~~t~~~gG~liP~~~~~~Ii~~l--~~ 112 (395) T protein:vir:95 38 FGAMFDALSNDLQE---EITAEINNRVVDNGILAKRSQDPLTSEERKFFNDINYDVGYTDEKILPETVVERVFDDL--QK 112 (395) T ss_pred HHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHhhcCccccchHHHHHHHHHhhccCCCCceeccHHHHHHHHHHH--Hh Confidence 00000000000000 00000000000000 000 012223455677888888777774322 22 Q ss_pred cccchhhccccchhHHHHhhhhhhccCccccccccccccc-ccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHH Q lcl|NC_018856. 66 DFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGV-ASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMT 144 (479) Q Consensus 66 ~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~-~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~ 144 (479) ...+++.+...++..++ .+...++.+...++.|.+. ...+++.+.+.....+=|+.-..+|..+ |.++..|.+. T Consensus 113 ~s~i~~~~~v~~~~~~~----~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~iS~el-l~ds~~~ie~ 187 (395) T protein:vir:95 113 DHPLLSKINFQNAGIKT----RVIKADPAGQAVWGKVFGEIKGQLDAAFREENFTQYKLTCFVVLPDDL-STFGPAWIER 187 (395) T ss_pred hhhhhhhceeEecCCce----EEEEecCCcceEEeecccccCccccccceeeeeceeeEEEeecccHHH-HhcchhHHHH Confidence 33455555555555543 2333455555667677665 4578999999999999998777777665 5667788899 Q ss_pred HHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEE-Ecc-CCCCCH-------HHH----hhhh Q lcl|NC_018856. 145 ILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVI-DLK-GARLDE-------ATL----NKAA 211 (479) Q Consensus 145 ~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~Nvi-Dar-G~~l~~-------~~l----~~aa 211 (479) ...+.--..+++.++.+++.|+-.=.. + =-||++.+....... +.. ...+.. ..| ..++ T Consensus 188 ~i~~~la~~ia~~~~~a~i~G~G~~~~--q------P~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~ 259 (395) T protein:vir:95 188 FVRTQIQEAISVALESAIINGGGAAKT--Q------PVGLMKDVNTNSGAVTDKASSGTLTFADADTTILELNDVLKNLS 259 (395) T ss_pred HHHHHHHHHHHHHHhhheeeccCCCCc--C------ceeeeecccccccccccccccchhhhhhhHhhHHHHHHHHHhhc Confidence 999999999999999999999854211 1 135655443221111 000 001111 111 1111 Q ss_pred hh-hhhc--c-CceEEEecChHHhhhHHHHhhCcceeeeccCCCcce---eeee-hhhhcCCCcceecccceecCCCcee Q lcl|NC_018856. 212 VI-VGKG--Y-GRATDAFMPIGVQADFTNNLLDRQRVIQPSTAGGFS---TGFS-INQFLSTRGAINLHGSTIMENDNIL 283 (479) Q Consensus 212 ~~-i~~~--f-G~~td~~mp~~vka~f~~~~~~~qrv~~~~n~g~~~---~G~~-I~~~~s~~G~I~l~~s~~m~~~~~L 283 (479) +. ..+. | |.. .+.|+.....+.. +....++.+.+..+ .|+. +.+-..|.|.|.| |+ | .+ -++ T Consensus 260 ~~~~~~~~~~~~~~-~~~mn~~t~~~~~-----g~~~~~~~~G~~~~~lg~g~~v~~~~~~p~~~i~f-gd-f-s~-y~i 329 (395) T protein:vir:95 260 VDEKGKELKIDGKV-ALVVNPRDSWDVQ-----ARYTYLTANGGFVTVLPYNVTIITSEFVPEGKLVA-FV-T-DR-YNA 329 (395) T ss_pred cccccchhhhcCce-EEEEcchhhhhcC-----CcceeccCCCcceeccCCcceEEEcCCCCCCcEEE-Ee-c-cc-EEE Confidence 11 0000 1 111 1345555444332 23333443222111 1321 1222234444433 22 1 11 111 Q ss_pred cc-cCCcCCCCCC-----CceeEeeeeccCCCCCCCcccccceEEEEEEEcCcCccccccceeeeeecCCceEEEE Q lcl|NC_018856. 284 LE-GRNPEPNAPQ-----APASVVASIVDDKKGGFRDEDIKTHSYKVVVHSDDAESLPSEAVTAAVAKKDNTVKLE 353 (479) Q Consensus 284 ~e-~~~~~~~AP~-----~pa~v~at~~t~~~G~f~~~d~gty~YkVtavn~~GES~pS~~vt~Tv~~~g~sv~lt 353 (479) +. .-+.-.-.+. --+...+..-. +|+-.. .. ..+|.-+...-+ |... ++..+..+.+--. T Consensus 330 ~~r~~~~i~~~~~~~~~~d~~~f~~~~r~--dg~~~~--~~--A~~~l~i~~~~~--~~~~--~~~~~~~~~~~~~ 395 (395) T protein:vir:95 330 VRGGGLTVKKFDQTLALEDAVLFTAKTFA--YGQPDD--NK--ASAVYDLKVASA--PRRQ--TSAGGTTDGIAEA 395 (395) T ss_pred EEecceEEEeccchhhhCCcEEEEEEEEE--CCEEec--cc--cEEEEEeeccCC--CCCC--CCCCCCCCccccC Confidence 11 0000000000 00111111111 111110 00 112222211000 1000 0000000000000 No 125 >protein:vir:5255 Length: 304 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852760;genbank:gi:31544035;uniprot:Q7Y5U0;genbank:GeneID:2753552 Probab=60.32 E-value=0.18 Score=24.63 Aligned_cols=267 Identities=10% Similarity=0.002 Sum_probs=119.0 Q ss_pred ccCccccchh--hhhhhhhhheeccccccchhhcc---ccchhHHHHhhhhhhccCccccccccc-ccccccccCcceEE Q lcl|NC_018856. 42 QLDGAAVRRE--LLEDQVKMLAFSSNDFTIYPLIN---KQQVNSTVAKYAVFNQHGRTGHSRFVR-EVGVASINDPNIRQ 115 (479) Q Consensus 42 ~~~gaalr~e--sld~~~~~l~~~~~~f~f~~~i~---k~~~~stv~eY~~~~~~G~~g~~~fv~-E~g~~~~~d~~~~r 115 (479) |+++|=|-+| .+|+++...-+ .+++.-+.|+ .-++-.+.-.|..+..+|+.-++ +++ ..++-+.-|.++.+ T Consensus 1 ~~~lafl~~qL~~id~~vye~~~--~~~~~~~lipv~t~~~~~~~~~~~~~~d~~G~a~~~-~i~~~a~dip~vd~~~~~ 77 (304) T protein:vir:52 1 MSLLAYVKNGLTAVSKDIAETKY--PEIVFPQFVYVDQQTAVGITEKLHYGADEHGSLDDG-LITVGTSTLDQVEVGFTP 77 (304) T ss_pred CchHHHHHHHHHHHhhhhhcccc--ccchhhhhccccCCCCcccceEEEeeeeccCccccc-ccCCcCCccceeecccce Confidence 5555555553 24444432111 2344444443 11122222333333344444322 333 55777889999999 Q ss_pred EEEEEEeeeehhhhhhhHhh---hcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCC Q lcl|NC_018856. 116 KTVQMKFLSDTKQQSLAAGL---VNNIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDT 192 (479) Q Consensus 116 ~~~~~k~l~~~~~vs~~~~l---vn~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~ 192 (479) ++..|.-.+.++.+|+. ++ +..-.+..+..-+.|.+.+-+.+-.-.||||++. ..+-||.+. ++- T Consensus 78 ~~~~i~~~~~~~~y~~~-El~~a~~~g~~l~~~ka~aa~~a~~~~~n~v~~~Gd~~~---------~g~~GllN~--p~v 145 (304) T protein:vir:52 78 TRSYIVPWAKSVTWTKP-ELEQGKLLGLALNTAKIMALNKNAQQTLQKVAFLGHAKD---------SRLTGLLNN--KSV 145 (304) T ss_pred eEEEEEEEeeeeeecHH-HHHHHHHhCCCcHHHHHHHHHHHHHhhhceEEEEeeccc---------cceEEEEeC--CCc Confidence 99999999999999743 33 2222366677777788889999999999998752 124566543 221 Q ss_pred CEEEccC----CCC---C-HHH---Hhhhhhhh---hhccCceEEEecChHHhhhHHHHhhC-----cceeeeccCCCcc Q lcl|NC_018856. 193 NVIDLKG----ARL---D-EAT---LNKAAVIV---GKGYGRATDAFMPIGVQADFTNNLLD-----RQRVIQPSTAGGF 253 (479) Q Consensus 193 NviDarG----~~l---~-~~~---l~~aa~~i---~~~fG~~td~~mp~~vka~f~~~~~~-----~qrv~~~~n~g~~ 253 (479) .++..-| ... + +++ |+++-.-+ +++.-.++.+.||+.....+..-..+ -..+++.++.... T Consensus 146 ~~~~~~~~~a~~~w~~~T~~eI~~di~~~~~~i~~~s~~~~~p~tl~Lpp~~~~~l~~~~~~~~~~Tvl~~l~~n~~~~~ 225 (304) T protein:vir:52 146 EVYAIKGAAQNTKVQAMDFDKAVAFFKEIFLKGMEKTKRIEAPNTFAIDSLDLAHLALVQRANTDTTALEFLTKHLSAAA 225 (304) T ss_pred ceeeecCCccCCccccCCHHHHHHHHHHHHHHHHhccCceecCceEEeCHHHHHHHhhccCCCCCchHHHHHHHhccccc Confidence 2233222 111 1 222 44444333 44456788999999987777321111 1112222222111 Q ss_pred eeeeehhhh---cCCCcceecccceecCCCceecccCCcCC-----CCCCCc-eeEeeeeccCCCCCCCcccccceEEEE Q lcl|NC_018856. 254 STGFSINQF---LSTRGAINLHGSTIMENDNILLEGRNPEP-----NAPQAP-ASVVASIVDDKKGGFRDEDIKTHSYKV 324 (479) Q Consensus 254 ~~G~~I~~~---~s~~G~I~l~~s~~m~~~~~L~e~~~~~~-----~AP~~p-a~v~at~~t~~~G~f~~~d~gty~YkV 324 (479) -.++.|... ....|.=.-+.-++.+++.--++-.+|.| ..+-.. .-++... .-.+|-+- -|-+.+ T Consensus 226 g~~l~I~~v~~~~~~~g~~g~~r~vvY~~d~~~~~~~vP~p~~~l~~q~~~~~~~~vp~~-~r~gGv~v-----~~P~a~ 299 (304) T protein:vir:52 226 GRQVAIKALPSNYGTRVTDGKTRAMVYVNSKEHVIFDVPMSPTVLDAQPKGLLAFESGLR-MAFGGVTF-----MEPDSA 299 (304) T ss_pred CCcceEEEecccccccCCCCceEEEEEecChhheEEecCccccccchhhcCCceEEecce-eeeeeEEE-----Ecccee Confidence 011222211 11111000000122233311111111111 011001 1111000 00112110 133455 Q ss_pred EEEcC Q lcl|NC_018856. 325 VVHSD 329 (479) Q Consensus 325 tavn~ 329 (479) +.++. T Consensus 300 ~y~D~ 304 (304) T protein:vir:52 300 LYVDY 304 (304) T ss_pred eeecC Confidence 55555 No 126 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=51.98 E-value=0.57 Score=21.94 Aligned_cols=274 Identities=13% Similarity=0.125 Sum_probs=123.5 Q ss_pred cChhhccCccccch---hhhhhhhhhheeccccccchhhccccchhHHHHhhhh---hhccCccccccccc-cccccccc Q lcl|NC_018856. 37 ITPDTQLDGAAVRR---ELLEDQVKMLAFSSNDFTIYPLINKQQVNSTVAKYAV---FNQHGRTGHSRFVR-EVGVASIN 109 (479) Q Consensus 37 ~~p~~~~~gaalr~---esld~~~~~l~~~~~~f~f~~~i~k~~~~stv~eY~~---~~~~G~~g~~~fv~-E~g~~~~~ 109 (479) .+-|...+++++-. |.+|+++....+. +++.-+.++ +.+.+.+|.. +....+.|....++ +....+.. T Consensus 1 ~~~~~a~~~~~f~~~ql~~id~~v~e~~~~--~l~~~~~i~---v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v 75 (296) T protein:vir:10 1 MGVDKADAAGIWTVKQLTASLNKAYETEYD--QNSVVNLFP---VSNEIPGYAKYFEYPVFDGVGIAQIVADYTDDLPLV 75 (296) T ss_pred CcccchhhhHHHHHHHHHHHHHHHHhhhhc--ccccceecc---cccCCCCceeEEEeeeeeccCceeEeCCCcccccee Confidence 33344445556655 4456665432222 222222222 1122222211 11233444444444 44556788 Q ss_pred CcceEEEEEEEEeeeehhhhhhhHhh---hcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHH Q lcl|NC_018856. 110 DPNIRQKTVQMKFLSDTKQQSLAAGL---VNNIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHK 186 (479) Q Consensus 110 d~~~~r~~~~~k~l~~~~~vs~~~~l---vn~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~ 186 (479) |.++.|++..+..++..+.++.. ++ ...-.+..+.....|.+.+.+.....+|||++.+. +-||++ T Consensus 76 ~~~~~~~~~~i~~~~~~~~~~~~-El~~a~~~g~~l~~~ka~aA~~~~~~~~n~~~f~G~~~~g----------~~GLlN 144 (296) T protein:vir:10 76 DALATERQGKVFRFGNAFLISID-EIKVGQATGQSLSTRKQSLAFEAHDKLLDKLVWSGSTAHG----------IPSVFD 144 (296) T ss_pred eccceeEEEEEEEEEeeeeecHH-HHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeeccccc----------ceeEee Confidence 99999999999999999888743 33 23344667777888889999999999999988753 456654 Q ss_pred hhccCCCEEEccCC--CCC--HHHHhhhh---hhhhhccCceEEEecChHHhhhHHHHhhCcceeeeccCCCcceeeeeh Q lcl|NC_018856. 187 LIDQDTNVIDLKGA--RLD--EATLNKAA---VIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSTAGGFSTGFSI 259 (479) Q Consensus 187 ~I~~~~NviDarG~--~l~--~~~l~~aa---~~i~~~fG~~td~~mp~~vka~f~~~~~~~qrv~~~~n~g~~~~G~~I 259 (479) - ++-....+.|. -.+ .+.|+++- ...++++=.++.+.||+.....+..-+ + + .|..+ T Consensus 145 ~--p~v~~~~~~~~W~~~t~i~~Di~~~~~~l~~~s~g~~~p~~l~L~p~~~~~L~~~~--------~-~-----~~~t~ 208 (296) T protein:vir:10 145 Y--PNINNVVSGGSWSQPTTAVSDITSLLDIIETSTNGQHRATHLLLPTTARRIMQNLV--------P-G-----TSVSY 208 (296) T ss_pred c--CCCccccccCCccCHHHHHHHHHHHHHHHHHhhCceecceeEEeCHHHHHHHhhcc--------C-C-----CCccH Confidence 2 11112223231 122 23344433 445667788899999998887764221 1 1 12222 Q ss_pred hhhcCC-CcceecccceecCCCceecccCC-cCCCCCCCceeEeeeeccCCCCCCCcccccceEEEEEEEcCcCcc---c Q lcl|NC_018856. 260 NQFLST-RGAINLHGSTIMENDNILLEGRN-PEPNAPQAPASVVASIVDDKKGGFRDEDIKTHSYKVVVHSDDAES---L 334 (479) Q Consensus 260 ~~~~s~-~G~I~l~~s~~m~~~~~L~e~~~-~~~~AP~~pa~v~at~~t~~~G~f~~~d~gty~YkVtavn~~GES---~ 334 (479) .++... ..++++...-++.....-.+.++ .-.+.|..-.-.. +-+- .+.+.......|++-...+-|.- - T Consensus 209 l~~ik~~~~~l~i~~~~~l~~a~~~g~~~~v~~~~~~~~~~~~v----~~~~-~~~~~e~~~l~~~~~~~~~~~Gv~i~~ 283 (296) T protein:vir:10 209 GEFFRQNNSGVTVEFVQYLNDYNGTGTSAAIAYEKDPNNMAIEI----PEAT-NALPAQPKDLHFKIPVTSKATGLIVYR 283 (296) T ss_pred HHHHHHhcCCceEEEeeeeccCCCCcceEEEEEEcCCceEEEEc----Ccce-eeecccccCceEEEeeEeeEEEEEEEC Confidence 222221 11111111111111100000000 0000010000000 0000 00011112345555544443311 1 Q ss_pred cccceeeeeecCCceEEEEEEec Q lcl|NC_018856. 335 PSEAVTAAVAKKDNTVKLEVKLA 357 (479) Q Consensus 335 pS~~vt~Tv~~~g~sv~ltIT~~ 357 (479) |.. +...++ ||.+ T Consensus 284 P~a-----i~~~dG-----I~~~ 296 (296) T protein:vir:10 284 PLT-----MAVMKG-----ITFA 296 (296) T ss_pred Cce-----eEEEee-----eecC Confidence 221 111122 6655 No 127 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=43.99 E-value=0.82 Score=21.05 Aligned_cols=302 Identities=10% Similarity=0.041 Sum_probs=105.6 Q ss_pred CCccchhhhhh---hhc---CCccchH---HHHHHHHHhhhcCCC--------------cChhhccCccccchhhhhhhh Q lcl|NC_018856. 1 MTELKKEAEAK---NKK---LPVEAEA---ELAELVSKSFTTGYG--------------ITPDTQLDGAAVRRELLEDQV 57 (479) Q Consensus 1 ~~~~~~~~~~~---~~~---~~~~~~~---~~~e~~~Ks~tag~~--------------~~p~~~~~gaalr~esld~~~ 57 (479) +.+...+.+.+ ..+ .....+. ...+ +.+++..+.. ....+-++|+.|.++.+..+| T Consensus 60 ~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~r~~~~~~~~~~~~~~~~~~~~al~~~t~s~gG~~IP~~~~~~I 138 (387) T protein:vir:93 60 VKDIEEKEKAKVKDTGEAYQSLNDHEKMVKAKAE-FYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEI 138 (387) T ss_pred HHHHHHHHHHhhhhccccCCCcchhhHHHHHHHH-HHHHHhhhhhhhhhhhhhHHHHHhhccCcCCCCceeechhHHHHH Confidence 00000000000 000 0000000 0011 1122111110 011233567888898888877 Q ss_pred hhheeccccccchhhccccchhHHHHhhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhc Q lcl|NC_018856. 58 KMLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVN 137 (479) Q Consensus 58 ~~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn 137 (479) ..+...... +.+.+...++.+ + ++.+.. +..+...+++|++..+.+++.+......++=++.--.+|.- -|.+ T Consensus 139 i~~~~~~~~--l~~~~~v~~~~~-~-~~p~~~--~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~iS~e-ll~D 211 (387) T protein:vir:93 139 VSEPFAKNQ--LREKARLTNIKG-L-EIPRVS--YTLDDDDFITDVETAKELKLKGDTVKFTTNKFKVFAAISDT-VIHG 211 (387) T ss_pred HHHHHhhch--hhhheeeeecCC-c-eEEEEe--ecCCccccccCcccccccccccceeeeeheeeeeechhhHH-HHhh Confidence 644443332 233333333332 1 122211 11223568999999999999999988888877766556633 1245 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccCCCCCHHHHhhhhhhhhhc Q lcl|NC_018856. 138 NIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGARLDEATLNKAAVIVGKG 217 (479) Q Consensus 138 ~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~ 217 (479) +..|.+....+.-...+.+..+..+|-+..+.. +..|+..- . .+-...|..+ .+.|.++---+... T Consensus 212 s~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g---------~p~g~l~~--~--~~~~v~~~~~-~d~i~~~~~~l~~~ 277 (387) T protein:vir:93 212 SDVDLVNWVENALQSGLAAKERKDALAVSPKSG---------LDHMSFYN--G--SVKEVEGADM-YDAIINALADLHED 277 (387) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCcc---------ccceeeec--c--ccccccccch-HHHHHHHHhccChh Confidence 566777777777777777765555554332211 11222110 0 0111112111 12222222122223 Q ss_pred cCceEEEecChHHhhhHHHHhhCcceeeeccCCCcceeeeehhhhcCCCcceecccce---ecCCCceecccCCcCCCCC Q lcl|NC_018856. 218 YGRATDAFMPIGVQADFTNNLLDRQRVIQPSTAGGFSTGFSINQFLSTRGAINLHGST---IMENDNILLEGRNPEPNAP 294 (479) Q Consensus 218 fG~~td~~mp~~vka~f~~~~~~~qrv~~~~n~g~~~~G~~I~~~~s~~G~I~l~~s~---~m~~~~~L~e~~~~~~~AP 294 (479) |-...-.+|...+...+...+-++.+.++...+. --.|++|--.... .++ +.||- +......+.+ +.... .. T Consensus 278 ~~~~a~~~mn~~t~~~~~~~~~d~~~~~~~~~~~-~llG~PV~~~~~~-~~~-~~GDf~~~~~~~~~~~~~-~~~~~-~~ 352 (387) T protein:vir:93 278 YRDNATIYMRYADYVKIISVLSNGTTNFFDTPAE-KVFGKPVVFTDAA-VKP-IVGDFNYFGINYDGTTYD-TDKDV-KK 352 (387) T ss_pred hhcCCEEEEechHHHHHHHHHhcCCCcccccCCc-cccccceEEecCC-Cce-eeeehhhhheehhhheee-ecccc-cC Confidence 3332234555444333333333333332222221 1123222110000 000 11110 0000000000 00000 00 Q ss_pred CCceeEeeeeccCCCCCCCcccccceEEEEEEEcCcCccccc Q lcl|NC_018856. 295 QAPASVVASIVDDKKGGFRDEDIKTHSYKVVVHSDDAESLPS 336 (479) Q Consensus 295 ~~pa~v~at~~t~~~G~f~~~d~gty~YkVtavn~~GES~pS 336 (479) + -.-..+..-. +|+... .-..++.-+-...-|.|| T Consensus 353 ~-~~~~~~~~r~--d~~v~~----~eA~~~l~~k~~~~~~~~ 387 (387) T protein:vir:93 353 G-EYLFVLTAWY--DQQRTL----DSAFRIAKAKENTGSLPS 387 (387) T ss_pred C-ceeEEEEeee--Cceeec----hhheEEEEeecCCCCCCC Confidence 0 0000010000 111100 011222222222333444 No 128 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=43.50 E-value=0.84 Score=20.99 Aligned_cols=298 Identities=11% Similarity=0.063 Sum_probs=108.1 Q ss_pred CCccchh------------------------------hhhhhhcC-----CccchHHHHHHHHHhhhcCCCcChhhccCc Q lcl|NC_018856. 1 MTELKKE------------------------------AEAKNKKL-----PVEAEAELAELVSKSFTTGYGITPDTQLDG 45 (479) Q Consensus 1 ~~~~~~~------------------------------~~~~~~~~-----~~~~~~~~~e~~~Ks~tag~~~~p~~~~~g 45 (479) +.+.+.+ ...++.+. .............+++++| +-++| T Consensus 68 ~~~l~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~------t~~~G 141 (402) T protein:vir:93 68 FNIVERQVQDIEEKEKAKVKDKGEAYQSLSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTG------NDSGG 141 (402) T ss_pred HHHHHHHHHHHHHHHHhhhhhccccCCCCchhHHHHHHHHHHHHHHHhhhhHHHHHHhHHHHHhhhccC------CCcCC Confidence 0000000 00000000 0000000001112222222 22457 Q ss_pred cccchhhhhhhhhhheeccccccchhhccccchhHHHHhhhhhhccCcccccccccccccccccCcceEEEEEEEEeeee Q lcl|NC_018856. 46 AAVRRELLEDQVKMLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSD 125 (479) Q Consensus 46 aalr~esld~~~~~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~ 125 (479) +.|-.+.+..+|..+..... .+.+.+...++.+ + .+.+. +.+ .+...++.|++..+..++++......++=++. T Consensus 142 G~lIP~~~~~~Ii~~~~~~~--~l~~~~~v~~~~~-~-~~p~~-~~~-~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~ 215 (402) T protein:vir:93 142 DKLLPKTLSKEIVSEPFAKN--QLREKARLTNIKG-L-EIPRV-SYT-LDDDDFITDVETAKELKAKGDTVKFTTNKFKV 215 (402) T ss_pred ccccchhHHHHHHHhHHhhh--hhhhhceeeecCC-c-eeeee-ecc-CCccccccccccccccccccceeeecceeeee Confidence 78889998888764443332 2333333344432 1 12221 122 12356899999999999999998888888876 Q ss_pred hhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccCCCCCHH Q lcl|NC_018856. 126 TKQQSLAAGLVNNIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGARLDEA 205 (479) Q Consensus 126 ~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~ 205 (479) -..+|.-+ |.++..|.+....+.-...++...+..+|-+..+.. +..|+..- ..+--..|..+ .+ T Consensus 216 ~i~iS~el-l~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g---------~p~g~~~~----~~~~~~~~~~~-~d 280 (402) T protein:vir:93 216 FAAISDTV-IHGSDVDLVNWVENALQSGLAAKERKDALAVSPKSG---------LEHMSFYN----GSVKEVEGADM-YD 280 (402) T ss_pred echhhHHH-HhhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCcc---------ccceeeec----cccccccccch-HH Confidence 65566431 234455666666666656666554444453322211 11222110 00101112221 12 Q ss_pred HHhhhhhhhhhccCceEEEecChHHhhhHHHHhhCcceeeeccCCCcceeeeehhhhcCCCcceecccceecCCCceecc Q lcl|NC_018856. 206 TLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSTAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILLE 285 (479) Q Consensus 206 ~l~~aa~~i~~~fG~~td~~mp~~vka~f~~~~~~~qrv~~~~n~g~~~~G~~I~~~~s~~G~I~l~~s~~m~~~~~L~e 285 (479) .|..+-.-+...|-.-.-.+|...+...+...+-++.+-+....+. .-.|++|-..... .++ +.||- .+-....+ T Consensus 281 ~l~~~~~~l~~~y~~na~~imn~~t~~~~~~~~~d~~~~~~~~~~~-~llG~PV~~t~~~-~~i-~~GDf--~~~~~~~~ 355 (402) T protein:vir:93 281 AIINALADLHEDYRDNATIYMRYADYVKIISVLSNGTTNFFDTPAE-KVFGKPVVFTDAA-VKP-IVGDF--NYFGINYD 355 (402) T ss_pred HHHHHHhccChhhhcCCEEEEechHHHHHHHHHhcCCCcccccCCc-cccccceEEecCC-Cce-eeech--hhhhhhhh Confidence 2222222233334332234555555444433333333333322221 1224333211100 011 11110 00000011 Q ss_pred cCCcCC--CCCCCceeEeeeeccCCCCCCCcccccceEEEEEEEcCcCccccc Q lcl|NC_018856. 286 GRNPEP--NAPQAPASVVASIVDDKKGGFRDEDIKTHSYKVVVHSDDAESLPS 336 (479) Q Consensus 286 ~~~~~~--~AP~~pa~v~at~~t~~~G~f~~~d~gty~YkVtavn~~GES~pS 336 (479) +....+ ++-..-..-.+..-. +|+-... =-.++.-+-..+.|.|| T Consensus 356 ~~~~~~~~~~~~~~~~~~~~~r~--Dg~v~~~----~A~~~l~ik~~~~~~~~ 402 (402) T protein:vir:93 356 GTTYDTDKDVKKGEYLFVLTAWY--DQQRTLD----SAFRIAKAKENTGPLPS 402 (402) T ss_pred hhhhhhhhcccCCceEEEEEEEe--CcEEech----hheEEEEeecCCCCCCC Confidence 000000 000000111111111 1111110 01222333333444555 No 129 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=42.53 E-value=0.88 Score=20.88 Aligned_cols=307 Identities=11% Similarity=0.034 Sum_probs=113.5 Q ss_pred CCcc-chhhhhhhh----c--CCccchHHH---HHHHH-------------HhhhcCCCcChhhccCccccchhhhhhhh Q lcl|NC_018856. 1 MTEL-KKEAEAKNK----K--LPVEAEAEL---AELVS-------------KSFTTGYGITPDTQLDGAAVRRELLEDQV 57 (479) Q Consensus 1 ~~~~-~~~~~~~~~----~--~~~~~~~~~---~e~~~-------------Ks~tag~~~~p~~~~~gaalr~esld~~~ 57 (479) |.+. .+|++.+.. + ......++. .+.+. |.|.+ ....+..+|+.|-++.+.+.| T Consensus 20 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~l~~~e~~~~~~---~~~~t~~~Gg~lvP~~~~~~I 96 (381) T protein:vir:10 20 VNNGEPQERQNELYGDMINQLFEETKLQAKAEAERVSSLPKSAQTLSANQRNFFMD---INKSVGYKEEKLLPEETIDRI 96 (381) T ss_pred HHhhhHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHhcccccccCHHHHHHHHH---HhhcCCCCCceecCHHHHHHH Confidence 1110 000000000 0 000000000 01100 11100 112233567778877777777 Q ss_pred h-hheeccccccchhhccccchhHHHHhhhhhhccCccccccccccccc-ccccCcceEEEEEEEEeeeehhhhhhhHhh Q lcl|NC_018856. 58 K-MLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGV-ASINDPNIRQKTVQMKFLSDTKQQSLAAGL 135 (479) Q Consensus 58 ~-~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~-~~~~d~~~~r~~~~~k~l~~~~~vs~~~~l 135 (479) . .|... ..+++.+...++.+.+ + +....+.+.+.++.|.+. +...+|.+.+.....+=|+.--.+|..+ | T Consensus 97 ~~~l~~~---spir~~a~v~~~~~~~-~---i~~~~~~~~a~W~~e~~~~~~~~~~~f~~i~l~~~kl~a~i~is~el-L 168 (381) T protein:vir:10 97 FEDLTTN---HPLLADLGIKNAGLRL-K---FLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDL-N 168 (381) T ss_pred HHHHHhh---cceeeeeeeEecCcce-E---EEeecCCcceEEeecccccccccCccceeEeecceeEEeeccccHHH-H Confidence 5 33322 2444545444443332 2 233344445667778765 5678999999999999888766666555 5 Q ss_pred hcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccCC---------CCCHHH Q lcl|NC_018856. 136 VNNIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGA---------RLDEAT 206 (479) Q Consensus 136 vn~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~---------~l~~~~ 206 (479) .++..|.+....+.--.++++.++.++..||-.--| -||++.+....++.+.-.. ..+... T Consensus 169 ~Ds~~~le~~i~~~la~~~a~~~~~afi~GdG~~qP----------~Gil~~~~~~~~~~~g~~~~~~~~~~~t~~~~~~ 238 (381) T protein:vir:10 169 DFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQP----------IGLNRQVQKGVSVTDGAYPEKEEQGTLTFANPRA 238 (381) T ss_pred hccHHHHHHHHHHHHHHHHHHHhhceeEecccCCCc----------eeeeecCCccccccccccccccccccccccchhh Confidence 667788999999999999999999999999975322 5777666543333221100 011111 Q ss_pred H-hh-------hhhhh---hhcc-CceEEEecChHHhhhHHHHhhCcceeeeccCCCccee----eeeh-hhhcCCCcce Q lcl|NC_018856. 207 L-NK-------AAVIV---GKGY-GRATDAFMPIGVQADFTNNLLDRQRVIQPSTAGGFST----GFSI-NQFLSTRGAI 269 (479) Q Consensus 207 l-~~-------aa~~i---~~~f-G~~td~~mp~~vka~f~~~~~~~qrv~~~~n~g~~~~----G~~I-~~~~s~~G~I 269 (479) + +. .++.- ...| |.+ -+.|.+.+...+. .....++.+ |.... |..| ..-..|.|+| T Consensus 239 ~~~~l~~~~~~~~~~~~~~~~~~~~~~-~~vmn~~t~~~l~-----~~~~~~~~~-G~~v~~lp~g~~vv~~~~~p~~~i 311 (381) T protein:vir:10 239 TVNELTQVFKYHSTNEKGKSVAVKGNV-TMVVNPSDAFEVQ-----AQYTHLNAN-GVYVTALPFNLNVIESTVQEAGKV 311 (381) T ss_pred HHHHHHHHHHhhhhhhccccccccCce-EEEEchhhHHhhc-----cccccCCCC-CceeecCCCCceeEEcCCCCcCcE Confidence 0 11 11100 0001 111 1223433332221 111111111 11110 1000 0000111111 Q ss_pred ecccceecCCCceecccCCcCCCCCCCceeEeeeeccCCCCCCCcccccceEEEEEEEcCcCccccccceeeeeecCCce Q lcl|NC_018856. 270 NLHGSTIMENDNILLEGRNPEPNAPQAPASVVASIVDDKKGGFRDEDIKTHSYKVVVHSDDAESLPSEAVTAAVAKKDNT 349 (479) Q Consensus 270 ~l~~s~~m~~~~~L~e~~~~~~~AP~~pa~v~at~~t~~~G~f~~~d~gty~YkVtavn~~GES~pS~~vt~Tv~~~g~s 349 (479) .| ||- .+ -++++ .-+....+.. .-.|.. | .--|+...-- +|.- ....-- T Consensus 312 ~f-GDf--s~-Y~i~~-------r~~~~i~~~~------~~~~~~-d--~~~f~a~~r~-dG~~----------~~~~A~ 360 (381) T protein:vir:10 312 LT-YVK--GL-YDGYL-------AGGINVQKFK------ETLALD-D--MDLYTAKQFA-YGKA----------KDNKVA 360 (381) T ss_pred EE-EEc--cc-EEEEE-------ecccEEEeec------hhhhhc-C--ceEEEEEEEE-cCEE----------ecCCcE Confidence 11 100 00 00000 0000000000 000000 0 0011111111 1110 000111 Q ss_pred EEEEEE----ecCCCCcccce Q lcl|NC_018856. 350 VKLEVK----LASLYQAQPQF 366 (479) Q Consensus 350 v~ltIT----~~~~~~a~~~~ 366 (479) +.++|+ ++++-...++. T Consensus 361 ~v~~l~~~~~~~~~~~~~~~~ 381 (381) T protein:vir:10 361 AVWKLDLKGHKPALEDTEETL 381 (381) T ss_pred EEEEEeecCCccccccccccC Confidence 122222 12222211121 No 130 >protein:vir:97397 Length: 517 # NCBI annotation: major capsid protein # Family: family:all:11745 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762590;genbank:gi:115304291;genbank:GeneID:5130600 Probab=41.67 E-value=0.92 Score=20.79 Aligned_cols=287 Identities=13% Similarity=0.145 Sum_probs=103.8 Q ss_pred CCccchhhhhhhhcCCccchHH---------------------HHHHH---------HHhhhcC--CCcCh----hhccC Q lcl|NC_018856. 1 MTELKKEAEAKNKKLPVEAEAE---------------------LAELV---------SKSFTTG--YGITP----DTQLD 44 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~---------------------~~e~~---------~Ks~tag--~~~~p----~~~~~ 44 (479) +.+..++.+.+..+.....+.+ ..|.+ .+...++ ...-+ ....+ T Consensus 168 l~~~~~~~~~~~~e~~~~l~a~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 247 (517) T protein:vir:97 168 LKERENGGDNAALKTVSELAANLMKQRESEKILGVEALKVTPEATEFLKTREAEVAYMSASLTKDPKAAWTAELKERGIS 247 (517) T ss_pred HHHHHHHHHHHHHhhhhhhhhhHHHHHHhhhhcccccccccchhhHHHHHHHHHHHHHHhcccccccceeeeeccccccc Confidence 1111111000000000000000 00000 0000111 00001 01111 Q ss_pred ccccchhhhhhhhhhheeccccccchhhccccchhHHHHhhhhhhccCcccccccccccccccccCcceEEEEEEEEeee Q lcl|NC_018856. 45 GAAVRRELLEDQVKMLAFSSNDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLS 124 (479) Q Consensus 45 gaalr~esld~~~~~l~~~~~~f~f~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~ 124 (479) +-....+.+.+-+..+...+ .+.+-+....+ .+.........+...++.|+...+.+|..+..++..++-++ T Consensus 248 ~~~~p~~~~~~i~~~~~~~~---~i~~~~~~~~i-----~~~~~~~~~~~~~a~~~~eG~~kp~s~~tf~~~~~~~~~ia 319 (517) T protein:vir:97 248 GMPAPAGILKRIQDAVNDEG---SLLPFIRHENL-----PTLVVGGDNALTQGTGHTTGTDKTESNITLQTRVLTPQYVY 319 (517) T ss_pred ccccchHHHHHHHHhhhhhc---cceeeeeeccc-----cceeeecccccceeeeeecCCcccccccceeeEEeeHhhhh Confidence 22222222222222222111 11111110000 00010000011124467899999999999999999998888 Q ss_pred ehhhhhhhHhhhcchhh----HHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccCC Q lcl|NC_018856. 125 DTKQQSLAAGLVNNIAD----PMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGA 200 (479) Q Consensus 125 ~~~~vs~~~~lvn~~~D----p~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~ 200 (479) .-..+|..+ +.++.-| .+....+.-...++...|.++..||-. |....|+..+... .....+.+. T Consensus 320 ~~~~~S~ql-l~Ds~~dd~~~l~s~i~~~l~~~l~~~ee~a~l~GdGt---------g~~~~gi~~~a~~-~~~~~~~~~ 388 (517) T protein:vir:97 320 KYIKLPKIV-MNSNATDIAGAILTYVMNRLPDMVIMAVNRAIIMGGVT---------GVSETQIYPVVGD-AWATNVTGT 388 (517) T ss_pred hhhhhhHHH-HHHhhhccHHHHHHHHHHHHHHHHHHHHHHHHhcccCC---------Ccccccccccccc-ccccccccc Confidence 877777643 2233323 566677778888999999999999853 1222344443321 112222222 Q ss_pred CCCHHHHhhhhhhhhhccCceEEEecChHHhhhHHHHhhCcceeeeccCCCcc--eeee-----------------ehhh Q lcl|NC_018856. 201 RLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSTAGGF--STGF-----------------SINQ 261 (479) Q Consensus 201 ~l~~~~l~~aa~~i~~~fG~~td~~mp~~vka~f~~~~~~~qrv~~~~n~g~~--~~G~-----------------~I~~ 261 (479) .-..+++.....-..+.++ .-+.|+..+.+.+.-.=...=|.+.+...+.. ...+ ...+ T Consensus 389 ~~~~d~i~~l~~a~~~a~~--a~~vmn~~t~~~I~klKD~~G~Yl~~~~~~~~~~~~l~G~~~~~~~~~~~~~~~~~~~~ 466 (517) T protein:vir:97 389 TNIQELLEKLSVATPKAAD--STLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQTIATHFGFNRLVQSVAVDEKTAVSLSG 466 (517) T ss_pred chHHHHHHHHHHHhhhccC--CEEEECHHHHHHHHHhhcCCCCeeccCcCCcccccccCCccccccccccCceeEeeccc Confidence 2222333322222222222 22678888877775443332223322211111 0001 0111 Q ss_pred hcC--CCcceecccceec-CCCceecccCCc-CCCCCCCceeEeeeeccCCCC Q lcl|NC_018856. 262 FLS--TRGAINLHGSTIM-ENDNILLEGRNP-EPNAPQAPASVVASIVDDKKG 310 (479) Q Consensus 262 ~~s--~~G~I~l~~s~~m-~~~~~L~e~~~~-~~~AP~~pa~v~at~~t~~~G 310 (479) |.. ..|...+..-.++ +.+.++.+.++. .-.+|...+ .++..+...| T Consensus 467 y~i~~~~g~~~~~~fd~~~n~~~f~~~~~~~g~i~~~~r~a--~~~~~p~~~~ 517 (517) T protein:vir:97 467 YVTNGSRGMEFEQGTILVENNKEYLFEMPISGSLEYKGTTA--YGTYTPPVAG 517 (517) T ss_pred cEEEeecceeeeeeeecccCceeEeeeeeeccccccccceE--EEEEcCCCCC Confidence 110 0111111111111 222334443331 112233222 2222222222 No 131 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=40.89 E-value=0.95 Score=20.70 Aligned_cols=272 Identities=13% Similarity=0.158 Sum_probs=124.5 Q ss_pred CCccchhhhhhhhcCCccchHHHHHHHHHhhhcCCCcChhhccCccccch---hhhhhhhhhheecc-ccccchhhcccc Q lcl|NC_018856. 1 MTELKKEAEAKNKKLPVEAEAELAELVSKSFTTGYGITPDTQLDGAAVRR---ELLEDQVKMLAFSS-NDFTIYPLINKQ 76 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~Ks~tag~~~~p~~~~~gaalr~---esld~~~~~l~~~~-~~f~f~~~i~k~ 76 (479) |.. +.+. ++++.-+ .......+++-|+..+.+.+-. |-+|+.+....+.. .--.|+.-...- T Consensus 1 ~~~-----------~~~~-~~~~~~~--~~~~~~~~~~~da~~~~g~~~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~ 66 (319) T protein:vir:10 1 MTT-----------KKFD-EADKSNV--EMYLIQAGVKQDAAATMGIWTAQELHRIKSQSYEEDYPVGSALRVFPVTTEL 66 (319) T ss_pred CCC-----------cchh-HHhhHHH--HHHHhhccchhhhhhhhhhHHHHHHHHHHHHHHhhhhcceechhhcccccCC Confidence 332 2222 3333311 2222223466677777777876 45566665433322 111233222222 Q ss_pred chhHHHHhhhhhhccCcccccccccc-cccccccCcceEEEEEEEEeeeehhhhhhhHhh--hcchhhHHHHHHHHHHHH Q lcl|NC_018856. 77 QVNSTVAKYAVFNQHGRTGHSRFVRE-VGVASINDPNIRQKTVQMKFLSDTKQQSLAAGL--VNNIADPMTILTEDAIAV 153 (479) Q Consensus 77 ~~~stv~eY~~~~~~G~~g~~~fv~E-~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~l--vn~~~Dp~~~~~~~ai~~ 153 (479) .+-.....|..+.. .|....++. ....+..|.++.|++..+..++..+.++..--. ...-.+..+..-..|.+. T Consensus 67 ~~~~~~~~~~~~~~---~G~a~~~~d~~~dip~v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~ 143 (319) T protein:vir:10 67 SPTDKTFEYMTFDK---VGTAQIIADYTDDLPLVDALGTSEFGKVFRLGNAYLISIDEIKAGQATGRPLSTRKASACQLA 143 (319) T ss_pred CCceEEEEeeeecc---ccceeeecCccccccceeccceeeEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHH Confidence 22222222333333 344445554 444578889999999999999999998763222 233346667778888888 Q ss_pred HHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEc-cCCCC---C-H---HHHhhhh---hhhhhccCceE Q lcl|NC_018856. 154 IAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDL-KGARL---D-E---ATLNKAA---VIVGKGYGRAT 222 (479) Q Consensus 154 ~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDa-rG~~l---~-~---~~l~~aa---~~i~~~fG~~t 222 (479) +.+.....+||||+.+. +.||.+- ++-..+-+ .+... + + +.|+++- ...++++-.++ T Consensus 144 ~~~~~n~i~f~G~~~~g----------~~GLlN~--p~~~~~~~~~~~~~~t~t~~~i~~di~~~~~~l~~~s~g~~~p~ 211 (319) T protein:vir:10 144 HDQLVNRLVFKGSAPHK----------IVSVFNH--PNITKITSGKWIDVSTMKPETAEAELTQAIETIETITRGQHRAT 211 (319) T ss_pred HHHhhceEEEeeccccc----------ceeEEeC--CCceeeecCCCCCccccCHHHHHHHHHHHHHHHHHhcCceeece Confidence 89999999999988753 4566543 22222222 22222 2 2 2344432 33466788999 Q ss_pred EEecChHHhhhHHHHhhC----cceeeeccCCCcceeeeehhhhcCCCc--------------ceecccceecCCC---- Q lcl|NC_018856. 223 DAFMPIGVQADFTNNLLD----RQRVIQPSTAGGFSTGFSINQFLSTRG--------------AINLHGSTIMEND---- 280 (479) Q Consensus 223 d~~mp~~vka~f~~~~~~----~qrv~~~~n~g~~~~G~~I~~~~s~~G--------------~I~l~~s~~m~~~---- 280 (479) .+.||+.....+..-+.. -..++..++.+ ..+ ..++.+.+..| .+.++=-..++.+ T Consensus 212 ~L~L~p~~~~~L~~~~~~~~~t~l~~lk~~~~~-l~I-~~~pel~~ag~~g~~~~v~y~~~~~~~~~~v~~~~~~~~~e~ 289 (319) T protein:vir:10 212 NILIPPSMRKVLAIRMPETTMSYLDYFKSQNSG-IEI-DSIAELEDIDGAGTKGVLVYEKNPMNMSIEIPEAFNMLPAQP 289 (319) T ss_pred EEEecHHHHHhhhcccCCCCeeHHHHHHHhcCC-ceE-EEeeeecccCCCcceEEEEEecCCceEEEecCcceeeeeeee Confidence 999999998877432211 00011111111 111 11122222111 1111100011100 Q ss_pred ---ceecc--cCCc--CCCCCCCceeEeeee Q lcl|NC_018856. 281 ---NILLE--GRNP--EPNAPQAPASVVASI 304 (479) Q Consensus 281 ---~~L~e--~~~~--~~~AP~~pa~v~at~ 304 (479) ++.++ .++. .-.-|.+-.- ...+ T Consensus 290 ~~l~~~~~~~~r~~Gv~i~~P~ai~~-~dGI 319 (319) T protein:vir:10 290 KDLHFKVPCTSKCTGLTIYRPMTIVL-ITGV 319 (319) T ss_pred cCceEEEeeeeeeEEEEEEccceeEe-eecC Confidence 00000 0000 0011221111 1111 No 132 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=38.28 E-value=1.1 Score=20.41 Aligned_cols=287 Identities=13% Similarity=0.119 Sum_probs=118.2 Q ss_pred CCccchhhhhhhhcCCccchHHHHHHHHHhh----hcCC--CcChhhccCccccchhhhhhhhhhheeccccccchhhcc Q lcl|NC_018856. 1 MTELKKEAEAKNKKLPVEAEAELAELVSKSF----TTGY--GITPDTQLDGAAVRRELLEDQVKMLAFSSNDFTIYPLIN 74 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~Ks~----tag~--~~~p~~~~~gaalr~esld~~~~~l~~~~~~f~f~~~i~ 74 (479) ..+...+.+.+.............+. .+++ ...- ..+..+..+|+.+-.+.+...+..+. ..-.+.+.+. T Consensus 91 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~---~~~~l~~~~~ 166 (397) T protein:vir:96 91 TDQKPKDGEKRKMKKFKVTEEELAEK-RSAINAFVKSKGAEKRDGFTSVEGGALIPQELLQPQLEPK---DIVDLSKYVR 166 (397) T ss_pred hhhhhHHHHHHHHHHHhhhhHHHHHH-HHHHHHHHHhhhhhhhhcccccccccchhHHHHHHHHHhh---hhhhHHHhhh Confidence 11111111111111111111111110 0111 1100 01112234456666666666665442 2223444444 Q ss_pred ccchhHHHHhhhhhhccCccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHH Q lcl|NC_018856. 75 KQQVNSTVAKYAVFNQHGRTGHSRFVREVGVAS-INDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAIAV 153 (479) Q Consensus 75 k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~ 153 (479) ..++.+.-.+|.....+ .+...+++|++... .+++.+.+....++=++.--.+|.-+ +.++..|.+....+.--.. T Consensus 167 ~~~~~~~~~~~~~~~~~--~~~~~~~~E~~~~~~~~~~~~~~i~~~~~~~~~~~~~s~el-l~ds~~~l~~~i~~~l~~~ 243 (397) T protein:vir:96 167 SVPVNSASGKFPVISKS--GSKMATVQQLEKNPQLANPKMVEIDYSVATRRGYIPISQEM-IDDASYDVTGLIADEIQDQ 243 (397) T ss_pred hccccccceeEEEEecc--CCccccccccccccccccccccceeecHhHhhcchhhHHHH-HhhhHHHHHHHHHHHHHHH Confidence 44444433344433332 23456788888665 78999999999988777655555532 2344556666677777788 Q ss_pred HHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccCCCCCHHHHhhhhhhhhhccCceEEEecChHHhhh Q lcl|NC_018856. 154 IAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGARLDEATLNKAAVIVGKGYGRATDAFMPIGVQAD 233 (479) Q Consensus 154 ~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~~td~~mp~~vka~ 233 (479) +..+++.+++.|+..-.+.. .+-+|+|.++|.. ....+++ + -.+|++.+... T Consensus 244 ~~~~~~~~i~~g~g~~~~~~----~~~~d~~~~~~~~----------------------~~~~~~~-a-~~v~n~~~~~~ 295 (397) T protein:vir:96 244 SLNTKNADIAAVLKTATAKS----VVGVDGLKDLINK----------------------EIKKVYD-V-KLFISASMYSE 295 (397) T ss_pred HHHHHHHHHhhccccccccc----ccchHHHHHHHHH----------------------hhhhhcC-c-EEEEcHHHHHH Confidence 88899999999987654421 2446666555431 1111222 2 37899988887 Q ss_pred HHHHhhCcceeeeccCCCcceeeeehhhhcCCCcceecccceecCCCceecccCCcCCCCCCCceeEeeeeccCCCCCCC Q lcl|NC_018856. 234 FTNNLLDRQRVIQPSTAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILLEGRNPEPNAPQAPASVVASIVDDKKGGFR 313 (479) Q Consensus 234 f~~~~~~~qrv~~~~n~g~~~~G~~I~~~~s~~G~I~l~~s~~m~~~~~L~e~~~~~~~AP~~pa~v~at~~t~~~G~f~ 313 (479) +...-...-|++...+..+.. +.+++..+-...+...+...++. ... . T Consensus 296 l~~lkd~~G~~~~~~~~~~~~------------------~~~l~G~pv~~~~~~~~~~~~~~-~~~-----~-------- 343 (397) T protein:vir:96 296 LDKLKDKNGRYLLQDSITAAS------------------GKQLLGKEVVVLDDDVIGKSVGN-VVG-----F-------- 343 (397) T ss_pred HHHhhccCCCeEeccCccCCC------------------cccccccceEEecccccCCCCCc-eEE-----E-------- Confidence 765433322444322222110 11222222111111111000000 000 0 Q ss_pred cccccceEEEEEEEcCcCcc-ccccc------------eeeeeecCCceEEEEEEec Q lcl|NC_018856. 314 DEDIKTHSYKVVVHSDDAES-LPSEA------------VTAAVAKKDNTVKLEVKLA 357 (479) Q Consensus 314 ~~d~gty~YkVtavn~~GES-~pS~~------------vt~Tv~~~g~sv~ltIT~~ 357 (479) .|.++.-+...++.|-+ ..+.. ....+..+..-+.+++|.+ T Consensus 344 ---~gd~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~d~~~~~~~a~~~~~~~~a 397 (397) T protein:vir:96 344 ---IGDAKAFASFFDRKQVSVSWVDNNIYGQLLAGIIRYDVKATDKKAGFYVTFTIG 397 (397) T ss_pred ---EeehhcceEeEeecceEEEEecccccceeEEEEEEEccEEecccceEEEEeecC Confidence 01111000011111100 00000 0011112222333444433 No 133 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=34.18 E-value=1.3 Score=19.94 Aligned_cols=315 Identities=10% Similarity=0.101 Sum_probs=126.7 Q ss_pred CCccchhhhhhhhcCCc------cch-HHHHHHHHHhh-h-cCCCcChhhccCccccchhhhhhhhhhheeccccccchh Q lcl|NC_018856. 1 MTELKKEAEAKNKKLPV------EAE-AELAELVSKSF-T-TGYGITPDTQLDGAAVRRELLEDQVKMLAFSSNDFTIYP 71 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~------~~~-~~~~e~~~Ks~-t-ag~~~~p~~~~~gaalr~esld~~~~~l~~~~~~f~f~~ 71 (479) .....+....+...... ... +.+.+ +.|.. . .-......+-++|+.|-++.+...|..+..... .+.+ T Consensus 67 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~l~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~--~l~~ 143 (394) T protein:vir:10 67 NSDPDKPVDNAQPNGTDLKKKPIDAKKKAIND-FIHSHGKVIDNAAGHVTSTEAGVLIPEEIIYDPTAEVNSVV--DLST 143 (394) T ss_pred hcchhhhhhhhcccccchhhhHHHHHHHHHHH-HHhccchhhhhhhcccccccCceeccHHHHHHHHHHHHhhh--hhhh Confidence 00000000000000000 000 11111 11111 0 000011123345677778777777644433332 3455 Q ss_pred hccccchhHHHHhhhhhhccCccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHH Q lcl|NC_018856. 72 LINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVAS-INDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDA 150 (479) Q Consensus 72 ~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~a 150 (479) .+...++.+.--+|.+.. .+.+...+++|++... .+++.+.+....++=++.-..+|.-+ +.++..|.+....+.- T Consensus 144 ~~~~~~~~~~~~~~~~~~--~~~~~~~~~~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~l 220 (394) T protein:vir:10 144 LVTKTPVTTPKGTYPILK--RATDRFSSVAELAENPALAEPEFEQVDWSVSTYRGAIPLSEEA-IADSAVDLTSLVGQSI 220 (394) T ss_pred hceeeeccCCceEEEEEe--cCCCccccccccccccccccccceeEEeeeeeeEeeehhHHHH-HhhhhHHHHHHHHHHH Confidence 555555554433444322 2334567899987755 68899999999999888776677653 3345557777788888 Q ss_pred HHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCCEEEccCCCCCHHHHhhhhhhhhhccCceEEEecChHH Q lcl|NC_018856. 151 IAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGARLDEATLNKAAVIVGKGYGRATDAFMPIGV 230 (479) Q Consensus 151 i~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~~td~~mp~~v 230 (479) ...++..++.++..|+..-.+... ....-+|-|...+ ......+|. .-++|+..+ T Consensus 221 a~~~~~~~~~~il~g~g~~~~~~~-~~~~~~d~l~~~~----------------------~~~~~~~~~--a~~vmn~~~ 275 (394) T protein:vir:10 221 NEKSVNTYNAMIAPVLQSFTAKAT-TTDTLVDSLKHIL----------------------NVDLDPAYS--RALVVTQSL 275 (394) T ss_pred HHHHHHHHHHHHhhcccccccccc-cccccHHHHHHHH----------------------Hhhhhhhcc--CEEEecHHH Confidence 888999999999999876443211 1112222222222 112222332 248899888 Q ss_pred hhhHHHHhhCcceeeeccCCCcceeeeehhhhcCCCcceecccceecCCCceecccCCcCCCCCCCceeEeeeeccCCCC Q lcl|NC_018856. 231 QADFTNNLLDRQRVIQPSTAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILLEGRNPEPNAPQAPASVVASIVDDKKG 310 (479) Q Consensus 231 ka~f~~~~~~~qrv~~~~n~g~~~~G~~I~~~~s~~G~I~l~~s~~m~~~~~L~e~~~~~~~AP~~pa~v~at~~t~~~G 310 (479) ...+...-...-|.+...+......| -.+.+++..+-...+...+ |. ..| T Consensus 276 ~~~l~~lkd~~G~~i~~~~~~~~~~~--------------~~~~~L~G~PV~~~~~~~~----~~------------~~~ 325 (394) T protein:vir:10 276 FNTLDTLKDKNGRYLLHDASDSITDG--------------TAKGTVLGVPVYVVGDALL----GS------------AAG 325 (394) T ss_pred HHHHHHhhccCCCeeeeccccccccC--------------CcccccccceeEEeccccc----CC------------CCC Confidence 77776543333344433222211110 0111233333222211000 00 000 Q ss_pred CCCcccccceEEEEEEEcCcCccccccceeeeeecCCceEEEEEEecCCCCcccceEEEEEecC---CCcceEEEEeeee Q lcl|NC_018856. 311 GFRDEDIKTHSYKVVVHSDDAESLPSEAVTAAVAKKDNTVKLEVKLASLYQAQPQFISVYREGT---ETGHYFLIARVPV 387 (479) Q Consensus 311 ~f~~~d~gty~YkVtavn~~GES~pS~~vt~Tv~~~g~sv~ltIT~~~~~~a~~~~y~IYR~~~---~~G~y~li~rv~v 387 (479) . .+--.|.++.-+..+...+-+. ... ......--++.++|-.- ....+.++...+. T Consensus 326 ~-~~i~~gd~s~~~~~~~~~~~~v------------------~~~--~~~~~~~~~~~~~r~d~~~~~~~ai~~~~~~~~ 384 (394) T protein:vir:10 326 D-QKAFVGDLKRGVLFADRQQVTL------------------AWE--DSKIYGRYLGAAFRFGVKQADSNAGYFVTNTDA 384 (394) T ss_pred c-eEEEEeeccccEEEEeecceEE------------------EEe--cccccceeEEEEEEeccEEeccccEEEEEeecc Confidence 0 0001112221111122211110 000 00000000122222110 0011122211111 Q ss_pred eeecCCceEEEeec Q lcl|NC_018856. 388 SKVNDQGVIEVLDR 401 (479) Q Consensus 388 s~~n~~g~T~ftD~ 401 (479) . .|.|.++.+ T Consensus 385 ~----~~~~~~~~~ 394 (394) T protein:vir:10 385 A----SGSTSGTGK 394 (394) T ss_pred c----CCCCCCCCC Confidence 1 133333333 No 134 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=31.15 E-value=1 Score=20.58 Aligned_cols=296 Identities=10% Similarity=0.087 Sum_probs=123.6 Q ss_pred hcCCccchHHHHHHHHHhhhcCCCcChhhc-cCccccch--hhhhhhhhhheeccccccchhhccccchhHHHHhhh--- Q lcl|NC_018856. 13 KKLPVEAEAELAELVSKSFTTGYGITPDTQ-LDGAAVRR--ELLEDQVKMLAFSSNDFTIYPLINKQQVNSTVAKYA--- 86 (479) Q Consensus 13 ~~~~~~~~~~~~e~~~Ks~tag~~~~p~~~-~~gaalr~--esld~~~~~l~~~~~~f~f~~~i~k~~~~stv~eY~--- 86 (479) ..|+++. ++...-.-.++.+ -.+. ++|+-|-+ |.+|+++....+- +++.-+.|+ +.+.+.+|. T Consensus 1 ~~~~~~~--~~~~~~~~~~~~~----~~~~d~~~~fl~~ql~~id~~v~e~~~~--~~~~~~~i~---v~~~~~~~~et~ 69 (314) T protein:vir:10 1 MAIKFDA--EQAKITTHLEQMG----VEKADAAGIWAVSQLTAALNRAYEKEYA--ENSVVNIFP---VTNEIPGHAKYF 69 (314) T ss_pred CccchHH--HHHHHHHHHHhhc----ccchhhhHHHHHHHHHHHHHHHhhhhcc--ccccceeec---cccCCCCceeEE Confidence 4455542 2221111112222 2222 23444444 4666666532221 223333332 222222221 Q ss_pred hhhccCcccccccccc-cccccccCcceEEEEEEEEeeeehhhhhhhHhh--hcchhhHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018856. 87 VFNQHGRTGHSRFVRE-VGVASINDPNIRQKTVQMKFLSDTKQQSLAAGL--VNNIADPMTILTEDAIAVIAKSIEWAIF 163 (479) Q Consensus 87 ~~~~~G~~g~~~fv~E-~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~l--vn~~~Dp~~~~~~~ai~~~~~~iE~a~f 163 (479) .+......|....++. ....+..|.++.|++..+..++..+.++..--. ...-.+..+.....|.+.+.+.+-..+| T Consensus 70 ~~~~~e~~G~a~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f 149 (314) T protein:vir:10 70 EYPEFDGVGIAQIIADYSDDLPLVDAFMTEKQGKVFRFGNAFLISTDEIKAGAATGQSLSARKQALAFEAHDNLLDKLVW 149 (314) T ss_pred EeeeeccccceeeeCCcccccceeecccceeEEEEEEEEeeEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEE Confidence 2223345555555564 444678899999999999999999999754222 2333466677888888888999999999 Q ss_pred hcccccCCCCCCcccchhhhHHHhhccCCCEEEccCCCCC----HHHHhhhh---hhhhhccCceEEEecChHHhhhHHH Q lcl|NC_018856. 164 YGDAALSSEADGQAGIEFDGLHKLIDQDTNVIDLKGARLD----EATLNKAA---VIVGKGYGRATDAFMPIGVQADFTN 236 (479) Q Consensus 164 ~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~----~~~l~~aa---~~i~~~fG~~td~~mp~~vka~f~~ 236 (479) ||++.+. +-||++.=+ -...-+.+.--+ .+.|+++- ...++++-.|+.+.||+.-.+.+. T Consensus 150 ~G~~~~g----------~~GLlN~p~--v~~~~~~~~WaT~~ei~~Di~~~~~~l~~~s~g~~~p~~l~Lpp~~~~~L~- 216 (314) T protein:vir:10 150 SGSAPHG----------IVSVFDQPN--INNVVATPNWSVPQNAIDDVTAMIDAVESSTQGLHHVTDILLPASARRVMQ- 216 (314) T ss_pred eeccccc----------ceeEeecCC--CccccCCCCcccHHHHHHHHHHHHHHHHHhcCccccceeEEecHHHHHhhc- Confidence 9988753 456654211 000001111112 22344432 444567788999999988655442 Q ss_pred HhhCcceeeeccCCCcceeeeehhhhcCCC-cceecccceecCCCceecccCC-cCCCCCCCceeEeeeeccCCCCCCCc Q lcl|NC_018856. 237 NLLDRQRVIQPSTAGGFSTGFSINQFLSTR-GAINLHGSTIMENDNILLEGRN-PEPNAPQAPASVVASIVDDKKGGFRD 314 (479) Q Consensus 237 ~~~~~qrv~~~~n~g~~~~G~~I~~~~s~~-G~I~l~~s~~m~~~~~L~e~~~-~~~~AP~~pa~v~at~~t~~~G~f~~ 314 (479) |.. + + .|..+.++..-. -++++.+--++....--...++ .-.+.|..- -.+ .+-+ =.+.+ T Consensus 217 ------~~~-~-~-----~~~tvl~~l~~n~~~l~I~~~~el~~ag~~g~~~~v~y~~~~~~~--~~~--vp~~-~~~l~ 278 (314) T protein:vir:10 217 ------GLV-P-Q-----TNLSYGELFTRNNPGLTIRFLQFLDNYDGAGGKAALAFEKSPLNM--SIE--IPEV-TNVLP 278 (314) T ss_pred ------ccc-c-C-----CCccHHHHHHHhCCCcEEEEcccccccCCCcceEEEEEecCCcEE--EEe--cCcc-ceeec Confidence 111 1 1 123333332211 0111111111111100000000 000000000 000 0000 00001 Q ss_pred ccccceEEEEEEEcCcCccccccceeeeeecCCceEEEEEEec Q lcl|NC_018856. 315 EDIKTHSYKVVVHSDDAESLPSEAVTAAVAKKDNTVKLEVKLA 357 (479) Q Consensus 315 ~d~gty~YkVtavn~~GES~pS~~vt~Tv~~~g~sv~ltIT~~ 357 (479) .......|++-...+-|--.-- .+.++...++ ||.+ T Consensus 279 ~e~~~~~~~~~~~~r~~Gv~i~--~P~ai~~~dG-----I~~~ 314 (314) T protein:vir:10 279 AQPKDLHFRYPVTSKATGLIVY--RPLTMAVIKG-----ITFA 314 (314) T ss_pred ceecCceEEEcceeeeEEEEEE--CcceeEeeee-----eecC Confidence 1111234554443333210000 0111221122 5655 No 135 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=20.58 E-value=2.7 Score=18.18 Aligned_cols=261 Identities=12% Similarity=0.073 Sum_probs=112.1 Q ss_pred hhcCCCcChhhccCccccchhhhhhhhhhheeccccccc------hhhccccchhHHHHhhhhhhccCcccccccccccc Q lcl|NC_018856. 31 FTTGYGITPDTQLDGAAVRRELLEDQVKMLAFSSNDFTI------YPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVG 104 (479) Q Consensus 31 ~tag~~~~p~~~~~gaalr~esld~~~~~l~~~~~~f~f------~~~i~k~~~~stv~eY~~~~~~G~~g~~~fv~E~g 104 (479) |-.+ .|. -+.-+.+|.+.+.+. ..-.+...| ..++..++ -.||+ +..|+..|....+.|++ T Consensus 1 ma~~-~T~-----~~~~iiPev~~~~v~--~~~~~~~~~~~~~~~~~~l~g~~-G~tv~----ip~~~~~g~~~~~~eg~ 67 (274) T protein:vir:93 1 MPQG-ITK-----TSNQIIPEVLAPMMQ--AQLEKKLRFASFAEVDSTLQGQP-GDTLT----FPAFVYSGDAQVVAEGE 67 (274) T ss_pred CCcc-cee-----hhheechHHHHHHHH--HHHHhhhhhcccccccccccCCC-CCEEE----EEeeccCCCcccccCCC Confidence 1100 000 011223333333221 111111111 11122111 22332 33455566666778999 Q ss_pred cccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhH Q lcl|NC_018856. 105 VASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAIAVIAKSIEWAIFYGDAALSSEADGQAGIEFDGL 184 (479) Q Consensus 105 ~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~iE~a~f~Gd~~l~~~~~~~~gleFDGl 184 (479) .....+.+......+++...-...+++...++ +..||+....+.....++..++..++=. + T Consensus 68 ~i~~~~it~~~~~~~i~~~~~~~~i~D~~~~~-~~~d~~~~~~~~~~~~~a~~~d~~~~~~---~--------------- 128 (274) T protein:vir:93 68 KIPTDILETKKREAKIRKIAKGTSITDEALLS-GYGDPQGEQVRQHGLAHANKVDNDVLEA---L--------------- 128 (274) T ss_pred cccccccccceeEEEeeeecccccccHHHHHh-hccchHHHHHHHHHHHHHHHHHHHHHHH---H--------------- Confidence 89999999999999999999899999986655 4679998888888888888888766521 1 Q ss_pred HHhhccCCCEEEccCCCCCHHHHhhhhhhhhhccCceEEEecChHHhhhHHHHhhCcceeeeccCCCccee-eeehhhhc Q lcl|NC_018856. 185 HKLIDQDTNVIDLKGARLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSTAGGFST-GFSINQFL 263 (479) Q Consensus 185 ~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~~td~~mp~~vka~f~~~~~~~qrv~~~~n~g~~~~-G~~I~~~~ 263 (479) .... .+..+..++.+.+.+|....+..-...+-++||+.+.+.+-.+ +..+++.....|...+ .-.|..++ T Consensus 129 ----~~a~--~~~~~~~~~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~--~~~~f~~~s~~g~~~~~~G~ig~~~ 200 (274) T protein:vir:93 129 ----MGAK--LTVNADITKLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGD--ASTNFTRATELGDDIIVKGAFGEAL 200 (274) T ss_pred ----hccc--ccccccccCHHHHHHHHHHhhhccCCccEEEeCHHHHHHHHhh--hhhcccccccccccceeecccceec Confidence 1111 1122334566667777666666656777899999999888432 2223433222221100 00111111 Q ss_pred CCCcceecccceecCCCce-----e-cccCCcCCCCCCCceeEeeeeccCCCCCCCcccccceEEEEEEEcCcCcccccc Q lcl|NC_018856. 264 STRGAINLHGSTIMENDNI-----L-LEGRNPEPNAPQAPASVVASIVDDKKGGFRDEDIKTHSYKVVVHSDDAESLPSE 337 (479) Q Consensus 264 s~~G~I~l~~s~~m~~~~~-----L-~e~~~~~~~AP~~pa~v~at~~t~~~G~f~~~d~gty~YkVtavn~~GES~pS~ 337 (479) | =.+++++.-+ + ..+.+.--..+. +. + .+--+ ..++...-.+.+.|-+..++.. T Consensus 201 ---G-----~~Vi~s~~~p~~t~~l~~~gai~~~~~~~-~~-v--E~~Rd-~~~~~d~i~~~~~y~~~~~~~~------- 260 (274) T protein:vir:93 201 ---G-----AIIVRTNKLEAGTAILAKKGAVKLILKRD-FF-L--EVARD-ASTKTTALYSDKHYVAYLYDES------- 260 (274) T ss_pred ---C-----eeEEEcCCCCcceEEEEeCCeEEEEecCC-cc-c--ccccc-hhhcccEEEEEEEEEEEEEcCC------- Confidence 1 0111111100 0 000000000000 00 0 00000 0000111111223333333221 Q ss_pred ceeeeeecCCceEEEEEEecCCCC Q lcl|NC_018856. 338 AVTAAVAKKDNTVKLEVKLASLYQ 361 (479) Q Consensus 338 ~vt~Tv~~~g~sv~ltIT~~~~~~ 361 (479) ..++++..-..+.. T Consensus 261 ----------~~v~~t~~~~s~~~ 274 (274) T protein:vir:93 261 ----------KAVKITKGSGSLEM 274 (274) T ss_pred ----------ceEEEeeCccccCC Confidence 11122211111111 Done!