Query lcl|NC_019448.1_cdsid_YP_007002215.1 [gene=F360_gp092] [protein=putative capsid protein] [protein_id=YP_007002215.1] [location=51358..52749] Match_columns 463 No_of_seqs 41 out of 44 Neff 4.9 Searched_HMMs 1612 Date Thu Nov 7 16:40:47 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_92 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_92_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:95603 Length: 463 100.0 1E-214 8E-218 1193.3 35.9 463 1-463 1-463 (463) 2 protein:vir:99311 Length: 463 100.0 1E-214 8E-218 1193.3 35.9 463 1-463 1-463 (463) 3 protein:vir:96666 Length: 462 100.0 8E-210 5E-213 1167.0 35.5 462 1-462 1-462 (462) 4 protein:vir:80835 Length: 464 100.0 9E-203 5E-206 1128.4 34.4 458 1-463 1-459 (464) 5 protein:vir:63741 Length: 468 100.0 6E-197 4E-200 1096.2 34.8 461 1-463 1-462 (468) 6 protein:vir:80491 Length: 467 100.0 2E-195 1E-198 1088.4 34.8 460 1-463 1-461 (467) 7 protein:vir:100851 Length: 514 100.0 1E-194 8E-198 1083.5 32.4 462 1-463 1-508 (514) 8 protein:vir:102823 Length: 470 100.0 7E-159 4E-162 887.7 29.6 433 1-463 1-469 (470) 9 protein:vir:8843 Length: 317 # 99.6 2.5E-17 1.6E-20 111.6 11.8 293 1-336 1-317 (317) 10 protein:vir:94933 Length: 330 99.2 5.1E-13 3.2E-16 88.0 14.2 320 1-461 5-330 (330) 11 protein:vir:97255 Length: 310 98.9 1.7E-10 1.1E-13 74.2 15.3 283 3-341 1-310 (310) 12 protein:vir:96392 Length: 324 98.3 5.1E-07 3.2E-10 55.1 19.1 315 1-360 1-324 (324) 13 protein:vir:78830 Length: 324 98.3 5.1E-07 3.2E-10 55.1 19.1 315 1-360 1-324 (324) 14 protein:vir:96223 Length: 324 98.1 3.4E-06 2.1E-09 50.6 19.4 315 1-360 1-324 (324) 15 protein:vir:103955 Length: 324 98.0 4.4E-06 2.7E-09 49.9 18.4 313 1-360 1-324 (324) 16 protein:vir:9309 Length: 324 # 98.0 7.5E-06 4.6E-09 48.7 20.1 312 7-360 1-324 (324) 17 protein:vir:104388 Length: 566 98.0 1.5E-06 9E-10 52.6 14.7 266 184-463 1-394 (566) 18 protein:vir:97148 Length: 324 97.9 1.3E-05 8.3E-09 47.3 20.0 314 1-360 1-324 (324) 19 protein:vir:99749 Length: 324 97.8 1.5E-05 9.5E-09 47.0 19.3 311 1-360 1-324 (324) 20 protein:vir:105905 Length: 304 97.8 1.1E-05 6.8E-09 47.8 17.6 296 26-338 1-304 (304) 21 protein:vir:94142 Length: 304 97.8 1.1E-05 6.8E-09 47.8 17.6 296 26-338 1-304 (304) 22 protein:vir:8102 Length: 543 # 97.8 1.8E-05 1.1E-08 46.6 18.6 318 1-351 212-543 (543) 23 protein:vir:7771 Length: 330 # 97.8 6.6E-06 4.1E-09 49.0 15.7 304 18-346 1-330 (330) 24 protein:vir:93631 Length: 580 97.8 1.9E-06 1.2E-09 52.0 12.6 237 202-463 1-278 (580) 25 protein:vir:4953 Length: 397 # 97.8 1.1E-05 6.9E-09 47.7 16.5 291 1-339 77-397 (397) 26 protein:vir:191 Length: 385 # 97.7 1.4E-05 8.7E-09 47.2 17.0 302 1-340 68-385 (385) 27 protein:vir:1886 Length: 385 # 97.7 1.4E-05 8.7E-09 47.2 17.0 302 1-340 68-385 (385) 28 protein:vir:97053 Length: 390 97.7 2.9E-05 1.8E-08 45.4 17.9 303 1-337 71-390 (390) 29 protein:vir:4830 Length: 397 # 97.6 2.3E-05 1.4E-08 46.1 16.0 304 1-347 77-397 (397) 30 protein:vir:95318 Length: 328 97.6 4.7E-06 2.9E-09 49.8 12.2 234 1-292 1-328 (328) 31 protein:vir:4339 Length: 395 # 97.6 4.3E-05 2.7E-08 44.5 17.5 305 1-350 72-395 (395) 32 protein:vir:100135 Length: 418 97.6 4.6E-05 2.8E-08 44.4 19.1 311 1-354 87-418 (418) 33 protein:vir:2344 Length: 397 # 97.6 4.6E-05 2.9E-08 44.4 18.0 322 16-402 1-397 (397) 34 protein:vir:78523 Length: 338 97.6 1.6E-05 1E-08 46.9 14.8 302 15-339 1-338 (338) 35 protein:vir:95763 Length: 297 97.5 5.8E-05 3.6E-08 43.8 17.2 290 1-352 1-297 (297) 36 protein:vir:5120 Length: 615 # 97.5 1.4E-05 8.7E-09 47.2 13.4 246 206-463 1-371 (615) 37 protein:vir:8187 Length: 311 # 97.4 8.4E-05 5.2E-08 42.9 16.5 299 26-386 1-311 (311) 38 protein:vir:104085 Length: 320 97.3 4.1E-05 2.6E-08 44.6 14.4 290 1-330 1-320 (320) 39 protein:vir:3306 Length: 567 # 97.3 1.4E-05 8.5E-09 47.2 11.5 288 135-463 1-395 (567) 40 protein:vir:10145 Length: 567 97.3 1.4E-05 8.5E-09 47.2 11.5 288 135-463 1-395 (567) 41 protein:vir:9979 Length: 567 # 97.3 1.4E-05 8.5E-09 47.2 11.5 288 135-463 1-395 (567) 42 protein:vir:2792 Length: 567 # 97.3 1.4E-05 8.5E-09 47.2 11.5 288 135-463 1-395 (567) 43 protein:vir:827 Length: 567 # 97.2 2.5E-05 1.5E-08 45.8 12.6 298 135-463 1-395 (567) 44 protein:vir:4226 Length: 326 # 97.1 5.8E-05 3.6E-08 43.8 13.4 295 3-340 1-326 (326) 45 protein:vir:1328 Length: 392 # 97.1 6.3E-05 3.9E-08 43.6 13.6 302 1-336 74-392 (392) 46 protein:vir:4856 Length: 293 # 97.1 0.00016 9.7E-08 41.4 15.3 272 22-343 1-293 (293) 47 protein:vir:41 Length: 299 # N 97.0 9.1E-05 5.6E-08 42.7 13.9 269 30-321 1-299 (299) 48 protein:vir:103759 Length: 330 97.0 4.9E-06 3.1E-09 49.7 6.7 275 1-318 1-330 (330) 49 protein:vir:78223 Length: 333 97.0 0.00012 7.5E-08 42.1 14.0 295 15-337 1-333 (333) 50 protein:vir:107826 Length: 331 97.0 5.7E-06 3.6E-09 49.3 6.7 291 1-361 1-331 (331) 51 protein:vir:98525 Length: 331 97.0 5.7E-06 3.6E-09 49.3 6.7 291 1-361 1-331 (331) 52 protein:vir:107388 Length: 331 97.0 5.7E-06 3.6E-09 49.3 6.7 291 1-361 1-331 (331) 53 protein:vir:2430 Length: 318 # 97.0 0.00012 7.6E-08 42.0 14.0 293 1-342 1-318 (318) 54 protein:vir:94673 Length: 419 96.9 0.00026 1.6E-07 40.2 17.2 310 1-341 74-419 (419) 55 protein:vir:10364 Length: 390 96.9 0.00028 1.7E-07 40.1 20.3 302 1-337 69-390 (390) 56 protein:vir:105563 Length: 396 96.9 2.7E-05 1.7E-08 45.6 9.7 241 159-463 1-291 (396) 57 protein:vir:81070 Length: 390 96.9 0.00029 1.8E-07 40.0 18.5 304 1-348 68-390 (390) 58 protein:vir:2504 Length: 305 # 96.8 0.00015 9.3E-08 41.5 13.1 278 32-338 1-305 (305) 59 protein:vir:9574 Length: 300 # 96.7 0.00038 2.3E-07 39.4 18.3 288 26-370 1-300 (300) 60 protein:vir:80684 Length: 315 96.7 0.00037 2.3E-07 39.4 14.8 291 26-343 1-315 (315) 61 protein:vir:4997 Length: 397 # 96.6 0.0005 3.1E-07 38.7 18.2 304 1-371 77-397 (397) 62 protein:vir:9410 Length: 415 # 96.2 0.00083 5.1E-07 37.5 15.0 294 1-308 71-415 (415) 63 protein:vir:1433 Length: 435 # 96.2 0.00086 5.4E-07 37.4 17.0 314 1-341 82-435 (435) 64 protein:vir:3991 Length: 404 # 96.1 0.001 6.2E-07 37.0 17.0 300 1-343 80-404 (404) 65 protein:vir:1638 Length: 298 # 96.1 0.001 6.3E-07 37.0 17.7 284 35-367 1-298 (298) 66 protein:vir:102119 Length: 404 96.1 0.001 6.5E-07 36.9 15.1 309 1-338 67-404 (404) 67 protein:vir:81160 Length: 371 96.1 0.0011 6.6E-07 36.9 16.0 291 1-351 61-371 (371) 68 protein:vir:99920 Length: 311 96.1 0.00085 5.3E-07 37.4 13.2 281 26-336 1-311 (311) 69 protein:vir:98339 Length: 415 95.9 0.0012 7.5E-07 36.6 16.8 313 1-347 73-415 (415) 70 protein:vir:79987 Length: 415 95.9 0.0012 7.5E-07 36.6 16.8 313 1-347 73-415 (415) 71 protein:vir:81100 Length: 415 95.9 0.0012 7.5E-07 36.6 16.8 313 1-347 73-415 (415) 72 protein:vir:80376 Length: 435 95.9 0.0013 7.8E-07 36.5 16.9 309 1-341 85-435 (435) 73 protein:vir:105038 Length: 428 95.9 0.0013 7.9E-07 36.5 14.3 312 1-339 74-428 (428) 74 protein:vir:9759 Length: 303 # 95.9 0.0013 7.9E-07 36.4 17.6 293 30-368 1-303 (303) 75 protein:vir:81227 Length: 413 95.8 0.00053 3.3E-07 38.5 11.2 308 1-338 67-413 (413) 76 protein:vir:95376 Length: 425 95.8 0.00061 3.8E-07 38.2 11.3 310 1-343 91-425 (425) 77 protein:vir:7324 Length: 335 # 95.7 0.00016 9.8E-08 41.4 8.0 264 1-319 1-335 (335) 78 protein:vir:4700 Length: 415 # 95.5 0.002 1.3E-06 35.4 16.2 313 1-347 71-415 (415) 79 protein:vir:4600 Length: 415 # 95.5 0.002 1.3E-06 35.4 16.2 313 1-347 71-415 (415) 80 protein:vir:4197 Length: 314 # 95.4 0.0021 1.3E-06 35.3 15.4 297 19-342 1-314 (314) 81 protein:vir:94771 Length: 298 95.3 0.0023 1.4E-06 35.0 17.8 282 37-367 1-298 (298) 82 protein:vir:1025 Length: 408 # 95.2 0.0026 1.6E-06 34.7 16.8 304 1-352 80-408 (408) 83 protein:vir:5739 Length: 366 # 94.9 0.0031 1.9E-06 34.3 15.5 314 1-339 19-366 (366) 84 protein:vir:103370 Length: 418 94.8 0.00011 7E-08 42.2 4.2 321 1-357 11-418 (418) 85 protein:vir:96442 Length: 418 94.7 0.00018 1.1E-07 41.1 5.2 310 1-328 11-418 (418) 86 protein:vir:3158 Length: 321 # 94.6 0.0041 2.5E-06 33.7 15.3 299 13-358 1-321 (321) 87 protein:vir:3845 Length: 395 # 94.4 0.0045 2.8E-06 33.4 15.6 301 1-343 74-395 (395) 88 protein:vir:7409 Length: 408 # 94.1 0.0053 3.3E-06 33.1 16.5 303 1-339 80-408 (408) 89 protein:vir:3870 Length: 400 # 93.9 0.0061 3.8E-06 32.7 14.6 285 1-351 82-400 (400) 90 protein:vir:485 Length: 407 # 93.8 0.0063 3.9E-06 32.7 16.8 302 1-327 69-407 (407) 91 protein:vir:101650 Length: 497 93.7 0.0066 4.1E-06 32.5 14.6 278 1-303 114-497 (497) 92 protein:vir:7855 Length: 497 # 93.7 0.0066 4.1E-06 32.5 14.6 278 1-303 114-497 (497) 93 protein:vir:96762 Length: 632 93.6 0.007 4.3E-06 32.4 13.6 293 1-334 313-632 (632) 94 protein:vir:4456 Length: 401 # 91.5 0.015 9.6E-06 30.5 13.3 295 1-335 68-401 (401) 95 protein:vir:6242 Length: 390 # 90.9 0.018 1.1E-05 30.1 15.0 301 1-338 74-390 (390) 96 protein:vir:100247 Length: 425 90.7 0.019 1.2E-05 30.0 18.8 306 1-368 104-425 (425) 97 protein:vir:80128 Length: 466 87.6 0.038 2.3E-05 28.4 10.3 288 1-336 95-466 (466) 98 protein:vir:4511 Length: 409 # 87.4 0.039 2.4E-05 28.3 21.0 311 1-355 75-409 (409) 99 protein:vir:6212 Length: 434 # 86.6 0.045 2.8E-05 28.0 14.8 304 1-323 85-434 (434) 100 protein:vir:8420 Length: 477 # 85.8 0.05 3.1E-05 27.7 11.0 306 1-331 93-477 (477) 101 protein:vir:107687 Length: 319 85.0 0.056 3.5E-05 27.4 11.1 280 15-310 1-319 (319) 102 protein:vir:1268 Length: 397 # 84.9 0.057 3.5E-05 27.4 18.7 292 1-350 81-397 (397) 103 protein:vir:100172 Length: 394 82.6 0.075 4.7E-05 26.7 11.3 298 1-343 67-394 (394) 104 protein:vir:104256 Length: 458 82.5 0.076 4.7E-05 26.7 17.7 291 1-339 124-458 (458) 105 protein:vir:9820 Length: 272 # 82.2 0.079 4.9E-05 26.6 13.7 260 1-342 1-272 (272) 106 protein:vir:3033 Length: 272 # 82.2 0.079 4.9E-05 26.6 13.7 260 1-342 1-272 (272) 107 protein:vir:102873 Length: 392 82.1 0.08 4.9E-05 26.6 18.1 296 1-359 64-392 (392) 108 protein:vir:105004 Length: 392 82.1 0.08 4.9E-05 26.6 18.1 296 1-359 64-392 (392) 109 protein:vir:107593 Length: 392 82.1 0.08 4.9E-05 26.6 18.1 296 1-359 64-392 (392) 110 protein:vir:102082 Length: 392 82.1 0.08 4.9E-05 26.6 18.1 296 1-359 64-392 (392) 111 protein:vir:93616 Length: 645 81.8 0.083 5.1E-05 26.5 12.7 305 1-342 278-645 (645) 112 protein:vir:80068 Length: 301 80.8 0.091 5.7E-05 26.3 10.8 261 31-310 1-301 (301) 113 protein:vir:97397 Length: 517 80.5 0.094 5.8E-05 26.2 11.7 278 1-302 200-517 (517) 114 protein:vir:9361 Length: 402 # 80.1 0.098 6.1E-05 26.1 10.2 289 1-329 96-402 (402) 115 protein:vir:95963 Length: 395 80.1 0.028 1.7E-05 29.1 5.6 284 1-308 41-395 (395) 116 protein:vir:93881 Length: 387 78.9 0.11 6.8E-05 25.8 11.6 293 1-329 75-387 (387) 117 protein:vir:4159 Length: 315 # 78.8 0.11 6.9E-05 25.8 18.0 294 1-349 1-315 (315) 118 protein:vir:101607 Length: 379 77.9 0.12 7.5E-05 25.6 19.0 287 1-352 75-379 (379) 119 protein:vir:107423 Length: 681 77.7 0.12 7.6E-05 25.6 12.0 271 124-463 1-311 (681) 120 protein:vir:98487 Length: 681 77.7 0.12 7.6E-05 25.6 12.0 271 124-463 1-311 (681) 121 protein:vir:107802 Length: 681 77.7 0.12 7.6E-05 25.6 12.0 271 124-463 1-311 (681) 122 protein:vir:96978 Length: 387 77.3 0.13 7.8E-05 25.5 9.8 289 1-329 78-387 (387) 123 protein:vir:94424 Length: 387 77.3 0.13 7.8E-05 25.5 9.8 289 1-329 78-387 (387) 124 protein:vir:2685 Length: 387 # 77.3 0.13 7.8E-05 25.5 9.8 289 1-329 78-387 (387) 125 protein:vir:5255 Length: 304 # 74.7 0.068 4.2E-05 27.0 6.1 273 38-359 1-304 (304) 126 protein:vir:9643 Length: 377 # 74.4 0.16 9.8E-05 25.0 13.2 293 1-337 22-377 (377) 127 protein:vir:1084 Length: 437 # 74.0 0.16 0.0001 24.9 15.8 298 1-360 119-437 (437) 128 protein:vir:4092 Length: 390 # 68.8 0.23 0.00014 24.1 12.3 312 1-343 47-390 (390) 129 protein:vir:98635 Length: 377 67.2 0.26 0.00016 23.8 11.8 306 1-387 61-377 (377) 130 protein:vir:104342 Length: 314 66.5 0.27 0.00017 23.7 10.4 280 1-316 1-314 (314) 131 protein:vir:103285 Length: 296 65.4 0.28 0.00018 23.6 11.1 260 32-316 1-296 (296) 132 protein:vir:100884 Length: 389 64.5 0.3 0.00019 23.5 17.7 293 1-357 71-389 (389) 133 protein:vir:9704 Length: 394 # 60.3 0.37 0.00023 22.9 12.4 280 1-329 78-394 (394) 134 protein:vir:102944 Length: 330 55.7 0.47 0.00029 22.4 14.3 295 1-380 1-330 (330) 135 protein:vir:78640 Length: 352 50.6 0.61 0.00038 21.8 12.8 289 1-329 43-352 (352) 136 protein:vir:79642 Length: 329 33.7 1.3 0.00083 19.9 11.0 293 3-313 1-329 (329) 137 protein:vir:962 Length: 397 # 30.5 1.6 0.00098 19.5 15.2 283 1-350 89-397 (397) 138 protein:vir:96833 Length: 275 22.3 2.5 0.0015 18.4 16.5 261 1-355 1-275 (275) 139 protein:vir:105334 Length: 276 20.0 2.8 0.0018 18.1 15.9 266 1-358 1-276 (276) No 1 >protein:vir:95603 Length: 463 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240903;genbank:gi:66394965;genbank:GeneID:5132544 Probab=100.00 E-value=1.2e-214 Score=1193.29 Aligned_cols=463 Identities=100% Similarity=1.390 Sum_probs=461.1 Q ss_pred CCCCCccchHHHHhhhhhhHHHHHHhhcCCccCCccccCccccchhhhhhHhhhhhccccccchhhhcccchhhHHHhhh Q lcl|NC_019448. 1 MTIEKNLSDVQQKYADQFQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVKY 80 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k~~~e~~~Ks~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~stv~ey 80 (463) ||++.|+++.++.++++|+|+++|||+|||+++|++|++|+||||||||++|++|+|+++||+|||+|+|||++|||+|| T Consensus 1 ~~~~~~~~~~~~~~~~~~~e~~~KS~~tg~g~~p~~q~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~~i~k~~a~STV~~y 80 (463) T protein:vir:95 1 MTIEKNLSDVQQKYADQFQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVKY 80 (463) T ss_pred CCcccccchHHHHHHhhhhHHHHHHhhcCCccCCccccCcchhhhhhhhhhhheeeecccchhhhhhcCCchhhhhhhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_019448. 81 DQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEWASFYG 160 (463) Q Consensus 81 ~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~~E~a~fyG 160 (463) +++++||++||++|++|+|+++++||+|+||+++||||+++++||+++++||+++||+++|+++||+++||+|||+|||| T Consensus 81 ~~~~~~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~K~l~~~~~VS~~~~l~n~~~d~~~~~~~dai~~ia~tiE~a~FyG 160 (463) T protein:vir:95 81 DQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEWASFYG 160 (463) T ss_pred eeeeccCccccccccccccccccCCCceEEEEEEeeeeehhhhhhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHHHHHhcCcceEEee Q lcl|NC_019448. 161 DASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQMQLMQ 240 (463) Q Consensus 161 d~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~~~~~~qrv~~~ 240 (463) |++|+|++++||||||||.+||||||||||||++||+++||+|+++|+++||+|||+|||+++|++|+|+++++|||||+ T Consensus 161 ds~l~~~~~~~gleFDGl~~lId~enviDarG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~vka~f~~~~l~~qrv~~~ 240 (463) T protein:vir:95 161 DASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQMQLMQ 240 (463) T ss_pred hhccCCCcCccccchhhhhhhcCCCCeeecCCCcccHHHHhhhhhhhhcccCChhheecchHHHHHHHHHhcCceEEEEc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCCCcccceecCeeeecccccccCCceeccCccccccccccCCCCCCCCeeEEEEeccCCCcCcccccccceEEEEEEE Q lcl|NC_019448. 241 DNSGNVNTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQPLPNAPQPAKVTATVETKQKGAFEDEEDRAGLSYKVVVN 320 (463) Q Consensus 241 ~n~g~~~~G~~v~~~~s~~G~i~l~~s~~~~~d~~l~~~~~~~p~ap~p~~vtat~~~~~~g~~~~~~~~a~ysYkV~a~ 320 (463) +|+|++++|++|++|++++|+|+||||++|++|++++++++.+|+||+||++++|++++.+|.|+++++.++|||+|++| T Consensus 241 ~N~~~~~~G~~v~~f~s~~G~I~L~~s~~m~~~~il~~~~~~~p~ap~~~~~tatv~~~~~~~~~~~~~~a~~~Y~vv~~ 320 (463) T protein:vir:95 241 DNSGNVNTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQPLPNAPQPAKVTATVETKQKGAFENEEDRAGLSYKVVVN 320 (463) T ss_pred CCCCceeeeeeccceeeeeeeeeeCCceecCCcccccchhhcCCCCccCceeEEEEeeccCCCCCCcccccceEEEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCCccccccceeeeecCCCCceEEEEEecCCCCCCcceEEEEeecCCCceEEEEEEeeeeeecCCceEEEEeccCCCCC Q lcl|NC_019448. 321 SDDAQSAPSEEVTATVSNVDDGVKLSISVNAMYQQQPQFVSIYRQGKETGMYFLIKRVPVKDAQEDGTIVFVDKNETLPE 400 (463) Q Consensus 321 s~~geS~~S~~vt~Tva~~~~gv~ltIt~~a~~g~~~~~y~IYR~~~~~g~~~li~rv~~s~~n~~gtttf~D~N~~iPg 400 (463) |++|||+||++|++|++++++|++|+||++++++.+|+|++|||+++++|+|++|+|||++++|+||||+|+|+|+|||| T Consensus 321 s~~geS~pS~ivtaT~a~~~~gv~l~It~~a~~~~~~~~v~IYR~~~~~g~~~~i~rv~v~~an~~gttt~~D~n~~IPg 400 (463) T protein:vir:95 321 SDDAQSAPSEEVTATVSNVDDGVKLSINVNAMYQQQPQFVSIYRQGKETGMYFLIKRVPVKDAQEDGTIVFVDKNETLPE 400 (463) T ss_pred CCCCCcccchheeeeeeeccceEEEEEEecCCcccceeEEEEEeecCCCCcceeEEEEEecccCCCceEEEeecccccCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CccceecCCchHHHHhhhhcchhhcCCcccCCcceeeeeeechhheecceeeEEEEEEeEecC Q lcl|NC_019448. 401 TADVFVGEMSPQVVHLFELLPMMKLPLAQINASITFAVLWYGALALRAPKKWARIKNVRYIAV 463 (463) Q Consensus 401 t~~~fvGe~~pqvi~l~ellPm~k~pla~~na~~~~~V~~Yg~L~l~aPkk~~~ikNV~~~~~ 463 (463) |+++|||||||+||+|+|||||||||||++|++++|||+|||+|+|+|||||+|||||+|||| T Consensus 401 t~~vfVgems~~ti~~~ellPm~klpLA~~~~~~~waVl~YGaLal~~Pk~~~~ikNv~~~~v 463 (463) T protein:vir:95 401 TADVFVGEMSPQVVHLFELLPMMKLPLAQINASITFAVLWYGALALRAPKKWARIKNVRYIAV 463 (463) T ss_pred ceeEeeeccCchhhhhHhhhHhhhCCchhccchhhhHHHHhhHHHhhccccceEEEEeeEecC Confidence 999999999999999999999999999999999999999999999999999999999999999 No 2 >protein:vir:99311 Length: 463 # NCBI annotation: putative capsid protein # Family: family:all:2450 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024474;genbank:gi:48696433;genbank:GeneID:2948039 Probab=100.00 E-value=1.2e-214 Score=1193.29 Aligned_cols=463 Identities=100% Similarity=1.390 Sum_probs=461.1 Q ss_pred CCCCCccchHHHHhhhhhhHHHHHHhhcCCccCCccccCccccchhhhhhHhhhhhccccccchhhhcccchhhHHHhhh Q lcl|NC_019448. 1 MTIEKNLSDVQQKYADQFQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVKY 80 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k~~~e~~~Ks~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~stv~ey 80 (463) ||++.|+++.++.++++|+|+++|||+|||+++|++|++|+||||||||++|++|+|+++||+|||+|+|||++|||+|| T Consensus 1 ~~~~~~~~~~~~~~~~~~~e~~~KS~~tg~g~~p~~q~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~~i~k~~a~STV~~y 80 (463) T protein:vir:99 1 MTIEKNLSDVQQKYADQFQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVKY 80 (463) T ss_pred CCcccccchHHHHHHhhhhHHHHHHhhcCCccCCccccCcchhhhhhhhhhhheeeecccchhhhhhcCCchhhhhhhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_019448. 81 DQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEWASFYG 160 (463) Q Consensus 81 ~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~~E~a~fyG 160 (463) +++++||++||++|++|+|+++++||+|+||+++||||+++++||+++++||+++||+++|+++||+++||+|||+|||| T Consensus 81 ~~~~~~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~K~l~~~~~VS~~~~l~n~~~d~~~~~~~dai~~ia~tiE~a~FyG 160 (463) T protein:vir:99 81 DQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEWASFYG 160 (463) T ss_pred eeeeccCccccccccccccccccCCCceEEEEEEeeeeehhhhhhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHHHHHhcCcceEEee Q lcl|NC_019448. 161 DASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQMQLMQ 240 (463) Q Consensus 161 d~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~~~~~~qrv~~~ 240 (463) |++|+|++++||||||||.+||||||||||||++||+++||+|+++|+++||+|||+|||+++|++|+|+++++|||||+ T Consensus 161 ds~l~~~~~~~gleFDGl~~lId~enviDarG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~vka~f~~~~l~~qrv~~~ 240 (463) T protein:vir:99 161 DASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQMQLMQ 240 (463) T ss_pred hhccCCCcCccccchhhhhhhcCCCCeeecCCCcccHHHHhhhhhhhhcccCChhheecchHHHHHHHHHhcCceEEEEc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCCCcccceecCeeeecccccccCCceeccCccccccccccCCCCCCCCeeEEEEeccCCCcCcccccccceEEEEEEE Q lcl|NC_019448. 241 DNSGNVNTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQPLPNAPQPAKVTATVETKQKGAFEDEEDRAGLSYKVVVN 320 (463) Q Consensus 241 ~n~g~~~~G~~v~~~~s~~G~i~l~~s~~~~~d~~l~~~~~~~p~ap~p~~vtat~~~~~~g~~~~~~~~a~ysYkV~a~ 320 (463) +|+|++++|++|++|++++|+|+||||++|++|++++++++.+|+||+||++++|++++.+|.|+++++.++|||+|++| T Consensus 241 ~N~~~~~~G~~v~~f~s~~G~I~L~~s~~m~~~~il~~~~~~~p~ap~~~~~tatv~~~~~~~~~~~~~~a~~~Y~vv~~ 320 (463) T protein:vir:99 241 DNSGNVNTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQPLPNAPQPAKVTATVETKQKGAFENEEDRAGLSYKVVVN 320 (463) T ss_pred CCCCceeeeeeccceeeeeeeeeeCCceecCCcccccchhhcCCCCccCceeEEEEeeccCCCCCCcccccceEEEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCCccccccceeeeecCCCCceEEEEEecCCCCCCcceEEEEeecCCCceEEEEEEeeeeeecCCceEEEEeccCCCCC Q lcl|NC_019448. 321 SDDAQSAPSEEVTATVSNVDDGVKLSISVNAMYQQQPQFVSIYRQGKETGMYFLIKRVPVKDAQEDGTIVFVDKNETLPE 400 (463) Q Consensus 321 s~~geS~~S~~vt~Tva~~~~gv~ltIt~~a~~g~~~~~y~IYR~~~~~g~~~li~rv~~s~~n~~gtttf~D~N~~iPg 400 (463) |++|||+||++|++|++++++|++|+||++++++.+|+|++|||+++++|+|++|+|||++++|+||||+|+|+|+|||| T Consensus 321 s~~geS~pS~ivtaT~a~~~~gv~l~It~~a~~~~~~~~v~IYR~~~~~g~~~~i~rv~v~~an~~gttt~~D~n~~IPg 400 (463) T protein:vir:99 321 SDDAQSAPSEEVTATVSNVDDGVKLSINVNAMYQQQPQFVSIYRQGKETGMYFLIKRVPVKDAQEDGTIVFVDKNETLPE 400 (463) T ss_pred CCCCCcccchheeeeeeeccceEEEEEEecCCcccceeEEEEEeecCCCCcceeEEEEEecccCCCceEEEeecccccCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CccceecCCchHHHHhhhhcchhhcCCcccCCcceeeeeeechhheecceeeEEEEEEeEecC Q lcl|NC_019448. 401 TADVFVGEMSPQVVHLFELLPMMKLPLAQINASITFAVLWYGALALRAPKKWARIKNVRYIAV 463 (463) Q Consensus 401 t~~~fvGe~~pqvi~l~ellPm~k~pla~~na~~~~~V~~Yg~L~l~aPkk~~~ikNV~~~~~ 463 (463) |+++|||||||+||+|+|||||||||||++|++++|||+|||+|+|+|||||+|||||+|||| T Consensus 401 t~~vfVgems~~ti~~~ellPm~klpLA~~~~~~~waVl~YGaLal~~Pk~~~~ikNv~~~~v 463 (463) T protein:vir:99 401 TADVFVGEMSPQVVHLFELLPMMKLPLAQINASITFAVLWYGALALRAPKKWARIKNVRYIAV 463 (463) T ss_pred ceeEeeeccCchhhhhHhhhHhhhCCchhccchhhhHHHHhhHHHhhccccceEEEEeeEecC Confidence 999999999999999999999999999999999999999999999999999999999999999 No 3 >protein:vir:96666 Length: 462 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238545;genbank:gi:66391271;genbank:GeneID:5130448 Probab=100.00 E-value=7.9e-210 Score=1166.95 Aligned_cols=462 Identities=82% Similarity=1.227 Sum_probs=459.7 Q ss_pred CCCCCccchHHHHhhhhhhHHHHHHhhcCCccCCccccCccccchhhhhhHhhhhhccccccchhhhcccchhhHHHhhh Q lcl|NC_019448. 1 MTIEKNLSDVQQKYADQFQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVKY 80 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k~~~e~~~Ks~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~stv~ey 80 (463) |||..|.+.++...+++++|+++|||+|||+|+|++|++|+||||||||++|++|+|+++||+|||+|+|||++|||+|| T Consensus 1 ~~~~~~~~~~~~~~~~~~~e~~~KS~~tg~g~~p~~q~~~gAlR~esL~~~i~~Lt~~~~~~~~~~~i~k~~a~sTv~~y 80 (462) T protein:vir:96 1 MHKDTNLTAEQNKYADKFQEEVMKSYQTGYGITPDTQVDAGALRREILDDQITMLTWTQDDLIFYREISRRPAQSTVQKY 80 (462) T ss_pred CccccccchhhhhhhchhhHHHHHHHhcCCCcCCccccccchhhhhhhhhhhheeeecccchhhhhhcCCchhhhhhhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_019448. 81 DQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEWASFYG 160 (463) Q Consensus 81 ~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~~E~a~fyG 160 (463) +++++||++||++|++|+|+++++||+|+||+++||||++++++|++++|||+++||+++|++|||.++||+|||+|||| T Consensus 81 ~~~~~~G~~g~~~f~~E~g~~~~~d~~~~R~~~~~k~l~~t~~vsi~~tl~n~~~d~~~~~~~dai~~~a~tiE~a~Fyg 160 (462) T protein:vir:96 81 DVYLRHGNVGHSRFVREVGVAPVSDPNIRQKTVEMKYVSDTKNLSIASTLVNNIQDPMQILTEDAIAVVAKTIEWASFYG 160 (462) T ss_pred eeeeccCccccccccccccccccCCCceEEEEEEEEEEeeeeeechhhhhccchhhHHHHHHHHHHHHHHHHHHHHHhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHHHHHhcCcceEEee Q lcl|NC_019448. 161 DASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQMQLMQ 240 (463) Q Consensus 161 d~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~~~~~~qrv~~~ 240 (463) |++|+|++.++|||||||.+|||++|||||||++||+++||+||++|+++||+|||+|||+++|++|+|+++++|||||+ T Consensus 161 ds~l~~~~~~~gleFDGl~~lI~~~NViDarG~~Ls~~~ln~aa~~i~~~fGt~TD~~~p~~v~a~f~~~~l~~qrv~~~ 240 (462) T protein:vir:96 161 DASLTADPTGQGLEFDGLAKLIDKDNVIDAKGESLTETLLNRSAVLIGKSFGTATDAYMPIGVHADFVNSVLGRQMQLMQ 240 (462) T ss_pred hcccCCCccccccchhhhhhhcCCCceeecCCCCccHHHHhhhhhhcccccCChhheecchHHHHHHHHhhcCceEEEEc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCCCcccceecCeeeecccccccCCceeccCccccccccccCCCCCCCCeeEEEEeccCCCcCcccccccceEEEEEEE Q lcl|NC_019448. 241 DNSGNVNTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQPLPNAPQPAKVTATVETKQKGAFEDEEDRAGLSYKVVVN 320 (463) Q Consensus 241 ~n~g~~~~G~~v~~~~s~~G~i~l~~s~~~~~d~~l~~~~~~~p~ap~p~~vtat~~~~~~g~~~~~~~~a~ysYkV~a~ 320 (463) +|+|++++|++|++|+|++|+|+||||++|++|+++|+++...|++|+|++|+||+++..+|.|++.++.+.|+|+|++| T Consensus 241 ~n~g~~~~G~~v~~f~s~~G~I~L~~s~~m~~~~i~~~~~~~~p~ap~~~~vsaTv~t~~~g~f~~~~d~~~y~Y~V~av 320 (462) T protein:vir:96 241 DNSGNVNAGYNVQGFYSSRGFIKLHGSTVMENELILDESLQPLPNAPQPATVKATVETGKKGLFTDEHDRAELTYKVVVN 320 (462) T ss_pred CCCCceeeeeeccceeeeeeeeeeCCceecCcccccccccccCCCCCCCCceeEEEEeCCCCCCCCccCceeEEEEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCCccccccceeeeecCCCCceEEEEEecCCCCCCcceEEEEeecCCCceEEEEEEeeeeeecCCceEEEEeccCCCCC Q lcl|NC_019448. 321 SDDAQSAPSEEVTATVSNVDDGVKLSISVNAMYQQQPQFVSIYRQGKETGMYFLIKRVPVKDAQEDGTIVFVDKNETLPE 400 (463) Q Consensus 321 s~~geS~~S~~vt~Tva~~~~gv~ltIt~~a~~g~~~~~y~IYR~~~~~g~~~li~rv~~s~~n~~gtttf~D~N~~iPg 400 (463) |++|||+||++|++|++++++|++|+|+|+++++++|+||+|||+++++|.|+||+|||++++|++|+++|+|+|++||| T Consensus 321 s~dgeS~PS~~VtaTva~~~~gv~ltIt~~a~~~~~~~~~~IYRk~~~sg~y~li~rv~~~~~n~~gt~tf~D~n~~iPg 400 (462) T protein:vir:96 321 SDDAQSAPSEAVTATVNNATDGVKLEISVNAMYQQQPQFVSIYRQGRKTGDFYLIKRLGMKEVNDEGKLVFYDLNETIPE 400 (462) T ss_pred CCCCccccceeeEeeeecccccceEEEEEcCCccccceEEEEEeecCCccccceeeeeeceeecCCcceeEeeccCCCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CccceecCCchHHHHhhhhcchhhcCCcccCCcceeeeeeechhheecceeeEEEEEEeEec Q lcl|NC_019448. 401 TADVFVGEMSPQVVHLFELLPMMKLPLAQINASITFAVLWYGALALRAPKKWARIKNVRYIA 462 (463) Q Consensus 401 t~~~fvGe~~pqvi~l~ellPm~k~pla~~na~~~~~V~~Yg~L~l~aPkk~~~ikNV~~~~ 462 (463) |+++|||||+||+|+|+|||||||||||++|++++|||+|||+|+|+|||||+|||||+||- T Consensus 401 t~~~fVge~~p~vi~~~qllpm~~~plA~~n~~~~waVl~yG~Lal~~Pk~~~~ikNv~~~~ 462 (462) T protein:vir:96 401 TTDVFVGEMSPQVLHLFELLPMMKLPLAQINASVTFAVLWYGALALRAPKKWVRIKNVKYIV 462 (462) T ss_pred cccceeecCCchhhhhhhhhhhhhcCcccccchhhhhhhhhhHHHhhcccccEEEEEEEEeC Confidence 99999999999999999999999999999999999999999999999999999999999998 No 4 >protein:vir:80835 Length: 464 # NCBI annotation: putative major capsid protein # Family: family:all:2450 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504125;genbank:gi:158079312;genbank:GeneID:5666484 Probab=100.00 E-value=8.6e-203 Score=1128.38 Aligned_cols=458 Identities=66% Similarity=1.060 Sum_probs=448.8 Q ss_pred CCCCCccchHHHHhhhhhhHHHHHHhhcCCccCCccccCccccchhhhhhHhhhhhccccccchhhhcccchhhHHHhhh Q lcl|NC_019448. 1 MTIEKNLSDVQQKYADQFQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVKY 80 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k~~~e~~~Ks~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~stv~ey 80 (463) |+.+.| .+.-+++++|+++|||+|||+++|++|++|+||||||||++|++|+|+++||+|||+|+|||++|||+|| T Consensus 1 ~~~~~n----~~~~~~~~~e~~~Ks~ttgy~~~p~~q~~~~AlRrEsL~~~i~~Lt~~~~~f~f~~di~k~~a~STV~~y 76 (464) T protein:vir:80 1 MTEKKN----TERQLTSVQEEVIKGFTTGYGITPESQTDAAALRREFLDDQITMLTWADGDLSFYRDITKRPATSTVAKY 76 (464) T ss_pred CCcchh----hHhhcCcccHHHHHHHHhCCccCcccccCcchhhhhhhhhhhheeeecccchhhhhhcCCchhhhhhhhh Confidence 888777 6777899999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_019448. 81 DQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEWASFYG 160 (463) Q Consensus 81 ~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~~E~a~fyG 160 (463) +++++||++||++|++|+|+++++||+|+||+++||||+++|++|++++||||++||+++|++|||+++||+|||+|||| T Consensus 77 ~~~~~~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~Kfl~~~r~vsia~~lvn~~~d~~~~~~~dai~~va~tiE~a~FyG 156 (464) T protein:vir:80 77 DVYLAHGRVGHTRFTREIGVAPISDPNLRQKTVNMKYVSDTKNMSIATGLVNNIEDPMRILTDDAISVVAKTIEWASFYG 156 (464) T ss_pred heeeccCccccccccccccccccCCCceEEEEEEeeeeecceeeeeehhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccCCC-ccccccccccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHHHHHhcCcceEEe Q lcl|NC_019448. 161 DASLTSE-VEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQMQLM 239 (463) Q Consensus 161 d~~l~~~-~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~~~~~~qrv~~ 239 (463) |++|+|+ +.++|||||||.+|||++|||||||++||+++||+|+++|+++||+|||+|||+++|++|++++|++||+++ T Consensus 157 ds~l~~~~~~~~gleFDGl~~lI~~~NViDarG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~v~a~f~n~~l~~q~~~~ 236 (464) T protein:vir:80 157 DSDLSENPDAGSGLEFDGLAKLIDKHNVLDAKGASLTEALLNQASVLVGKGYGTPTDAYMPIGVQADFVNQQLDRQVQVI 236 (464) T ss_pred ccccCCCCCCccccchhhhHhhcCCCceeecCCCCcCHHHHhhhhhhhhcccCChhhcccchhHHHHHHhhhcCceeEEE Confidence 9999997 678999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCCCCcccceecCeeeecccccccCCceeccCccccccccccCCCCCCCCeeEEEEeccCCCcCcccccccceEEEEEE Q lcl|NC_019448. 240 QDNSGNVNTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQPLPNAPQPAKVTATVETKQKGAFEDEEDRAGLSYKVVV 319 (463) Q Consensus 240 ~~n~g~~~~G~~v~~~~s~~G~i~l~~s~~~~~d~~l~~~~~~~p~ap~p~~vtat~~~~~~g~~~~~~~~a~ysYkV~a 319 (463) .+|++++++|++|++|+|++|+|+||||++|+++.++|+.+..+|+||+||++++|++++.+|+|++....+.|+|||++ T Consensus 237 ~~n~~~~~~G~~v~~f~sa~G~i~L~~s~~m~~~~~ld~~~~~~~~apaapsvt~tv~~~~~g~f~~~~~~~~~~Ykv~~ 316 (464) T protein:vir:80 237 SDNGQNATMGFNVKGFNSARGFIRLHGSTVMELEQILDENRMQLPNAPQKATVKATLEAGTKGKFRDEDLTIDTEYKVVV 316 (464) T ss_pred cCCCCcceeeeecccccccccceeccCccccCcccccccccccCCCCcCCceeEEEecCCcccCCccccccceeEEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999988767779999999 Q ss_pred EecCCccccccceeeeecCCCCceEEEEEecCCCCCCcceEEEEeecCCCceEEEEEEeeeeeecCCceEEEEeccCCCC Q lcl|NC_019448. 320 NSDDAQSAPSEEVTATVSNVDDGVKLSISVNAMYQQQPQFVSIYRQGKETGMYFLIKRVPVKDAQEDGTIVFVDKNETLP 399 (463) Q Consensus 320 ~s~~geS~~S~~vt~Tva~~~~gv~ltIt~~a~~g~~~~~y~IYR~~~~~g~~~li~rv~~s~~n~~gtttf~D~N~~iP 399 (463) ||++|||+||+++++||++++++|+|+||++++++++|+|++|||++.++|+|+||+|||++++ .+|+++|+|+|+||| T Consensus 317 vn~~GeS~ps~~~~~ti~~~~~~V~l~it~~~~~~~~p~yv~IYR~~~~~g~f~~i~rv~~~~~-~~gt~t~vD~n~~IP 395 (464) T protein:vir:80 317 VSDDAESAPSDVASVVIDDKKKQVKLEITINNMYQARPQYVAIYRKGLETGLFYQIARVPASKA-VEGVITFIDVNDEIP 395 (464) T ss_pred ECCCCccccceeeeeeecCcccEEEEEEEeCCccccccceEEEEeecCCCCceeEEEEEeeccc-cCCceEEEecccccC Confidence 9999999999999999999999999999999999999999999999999999999999999997 478999999999999 Q ss_pred CCccceecCCchHHHHhhhhcchhhcCCcccCCcceeeeeeechhheecceeeEEEEEEeEecC Q lcl|NC_019448. 400 ETADVFVGEMSPQVVHLFELLPMMKLPLAQINASITFAVLWYGALALRAPKKWARIKNVRYIAV 463 (463) Q Consensus 400 gt~~~fvGe~~pqvi~l~ellPm~k~pla~~na~~~~~V~~Yg~L~l~aPkk~~~ikNV~~~~~ 463 (463) ||+++|||||||+||+|+|||||||||||++|++++|||+|||+|+|+|||||+|||||+||++ T Consensus 396 gt~~vfVgems~~ti~l~ellPm~rlplA~~n~~~~waVl~YGaLal~aPk~~~~ikNv~~~~~ 459 (464) T protein:vir:80 396 ETADVFVGELTPSVVHLFELLPMMRLPLAQVNASVTFAVLWYGALALRAPKKWARIKNVKYIAT 459 (464) T ss_pred CceeEeeecCCchHHHHHHHHHhhhCCchhcccchhhhhhhhhHHhhhccccceEEEEEEEeec Confidence 9999999999999999999999999999999999999999999999999999999999999999 No 5 >protein:vir:63741 Length: 468 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547622;genbank:GeneID:3783474 Probab=100.00 E-value=6.4e-197 Score=1096.19 Aligned_cols=461 Identities=67% Similarity=1.050 Sum_probs=451.6 Q ss_pred CCCCCccchHHHHhhhhhhHHHHHHhhcCCccCCccccCccccchhhhhhHhhhhhccccccchhhhcccchhhHHHhhh Q lcl|NC_019448. 1 MTIEKNLSDVQQKYADQFQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVKY 80 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k~~~e~~~Ks~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~stv~ey 80 (463) ||+.++.+...++.+++..|.++|||+|||+++|++|++|+||||||||++|++|+|+++||+|||+|+|++++|||+|| T Consensus 1 ~~~~~~~~~~~~~~~~~~~e~~~Ks~~agy~~~p~~q~~~~AlR~EsL~~~i~~L~~~~~~f~~~~di~k~~a~stv~~y 80 (468) T protein:vir:63 1 MPKNNKEEEVKEVNLNSVQEDALKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAKY 80 (468) T ss_pred CCCCcchhhccccChhHHHHHHHHHHHcCcccCCccccCcchhhhhhhhhhhheeeecccchhhhhhcccchhhhhhhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_019448. 81 DQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEWASFYG 160 (463) Q Consensus 81 ~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~~E~a~fyG 160 (463) +++++||++||++|++|+|+++++||+|+||+++||||++++++|+++++||+|+||+++|+++||+++||+|||+|||| T Consensus 81 ~~~~~~G~~g~~~f~~E~g~~~~~~~~~~r~~~~~k~l~~~~~vs~~~~l~n~i~d~~~~~~~~ai~~~a~tiE~a~FyG 160 (468) T protein:vir:63 81 DVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWASFFG 160 (468) T ss_pred eeeeccCccccccccccccccccCCCceEEEEEEeeeeeeeeeehhhhhhhcchhhHHHHHHHHHHHHHHHHHHHHhhhc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccCC-CccccccccccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHHHHHhcCcceEEe Q lcl|NC_019448. 161 DASLTS-EVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQMQLM 239 (463) Q Consensus 161 d~~l~~-~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~~~~~~qrv~~ 239 (463) |++|.+ .++++|||||||.+|||+|||||+||++||+++||+|+++|+++||++||+|||+++|++|++++|.+||+|+ T Consensus 161 ds~l~~s~~~~~glqfDGi~~li~~enviDa~G~~ls~~~lneaa~~i~~gfG~~td~~~~~~v~a~~~~~~L~~q~~v~ 240 (468) T protein:vir:63 161 DSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLSKQTQLV 240 (468) T ss_pred ccccccCCCccccccccceeEEecCCceeccCCCccCHHHHHHHhhhccccccChhhhhcchhHHhhhhhhhcCceEEEE Confidence 999964 5778999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCCCCcccceecCeeeecccccccCCceeccCccccccccccCCCCCCCCeeEEEEeccCCCcCcccccccceEEEEEE Q lcl|NC_019448. 240 QDNSGNVNTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQPLPNAPQPAKVTATVETKQKGAFEDEEDRAGLSYKVVV 319 (463) Q Consensus 240 ~~n~g~~~~G~~v~~~~s~~G~i~l~~s~~~~~d~~l~~~~~~~p~ap~p~~vtat~~~~~~g~~~~~~~~a~ysYkV~a 319 (463) .+|.+.+++|++|++|++++|+|+||||+||++++++++....+|+||+|++++||+....+|.+.. .+.++|+|||++ T Consensus 241 ~~n~~~~~~G~~v~g~~sa~G~I~l~gs~il~~~~~l~~~~~~~~~Apsp~~vsaT~~~~~~g~~~~-~~~a~y~Y~v~~ 319 (468) T protein:vir:63 241 RDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQPAKVTATQEAGKKGQFRA-EDLAAHEYKVVV 319 (468) T ss_pred cCCCCceeeeecccceecceeeeeecCceeeccccCCCcccccccccccCCccceeeecccCCcccC-CCcceEEEEEEE Confidence 9999999999999999999999999999999999999999999999999999999998888888754 677899999999 Q ss_pred EecCCccccccceeeeecCCCCceEEEEEecCCCCCCcceEEEEeecCCCceEEEEEEeeeeeecCCceEEEEeccCCCC Q lcl|NC_019448. 320 NSDDAQSAPSEEVTATVSNVDDGVKLSISVNAMYQQQPQFVSIYRQGKETGMYFLIKRVPVKDAQEDGTIVFVDKNETLP 399 (463) Q Consensus 320 ~s~~geS~~S~~vt~Tva~~~~gv~ltIt~~a~~g~~~~~y~IYR~~~~~g~~~li~rv~~s~~n~~gtttf~D~N~~iP 399 (463) ||++|||+||+++++||++.++|++|+|||+++++++|+||+|||++.++|+||||+|||++.+ .+++++|+|+|++|| T Consensus 320 vs~~GES~pS~~vtvTVaa~~dg~~ltIt~~~~~~~~p~yv~IYR~~~gg~~f~li~~va~~~a-~~gt~tf~D~n~~iP 398 (468) T protein:vir:63 320 SSDDAESIASEVATATVTAKDDGVKLEIELAPMYSSRPQFVSIYRKGAETGLFYLIARVPASKA-ENNVITFYDLNDSIP 398 (468) T ss_pred ECCCCccccccceEEEecCcccceeEEEEecCCCCCcceEEEEEEeCCCCcceeEeeeEeeeec-CCCeEEEEcCCcccC Confidence 9999999999999999999999999999999999999999999999999999999999999997 589999999999999 Q ss_pred CCccceecCCchHHHHhhhhcchhhcCCcccCCcceeeeeeechhheecceeeEEEEEEeEecC Q lcl|NC_019448. 400 ETADVFVGEMSPQVVHLFELLPMMKLPLAQINASITFAVLWYGALALRAPKKWARIKNVRYIAV 463 (463) Q Consensus 400 gt~~~fvGe~~pqvi~l~ellPm~k~pla~~na~~~~~V~~Yg~L~l~aPkk~~~ikNV~~~~~ 463 (463) ||+++|||||+|+||+|+|||||||||||++|++++|||+|||+|+|+|||||+|||||+|||| T Consensus 399 gT~~~fVgem~~~~i~~~~llpm~~lplA~~n~~~~~~Vl~Ygalal~~Pk~~~~ikNv~~~~~ 462 (468) T protein:vir:63 399 ETVDVFVGEMSANVVHLFELLPMMRLPLAQINASVTFAVLWYGALALRAPKKWVRIRNVKYIPV 462 (468) T ss_pred CCcceeeeecChhHHHHHHHhccccCChhHhccchhhhhhhhhHHhhhccccceEEEEeeeeee Confidence 9999999999999999999999999999999999999999999999999999999999999999 No 6 >protein:vir:80491 Length: 467 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468466;genbank:gi:157325041;genbank:GeneID:5601449 Probab=100.00 E-value=1.7e-195 Score=1088.35 Aligned_cols=460 Identities=67% Similarity=1.055 Sum_probs=444.8 Q ss_pred CCCCCccchHHHHhhhhhhHHHHHHhhcCCccCCccccCccccchhhhhhHhhhhhccccccchhhhcccchhhHHHhhh Q lcl|NC_019448. 1 MTIEKNLSDVQQKYADQFQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVKY 80 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k~~~e~~~Ks~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~stv~ey 80 (463) ||. .|.++...+.++...|+++|||+|||+++|++|++|+||||||||++|++|+|+++||+|||+|+|++++|||+|| T Consensus 1 ~~~-~~~~~~~~~n~~~~~e~~~Ks~~agy~~~p~tq~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~di~k~~a~stv~~y 79 (467) T protein:vir:80 1 MPK-NNKEEVKEVNLNSVQEDALKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATSTVAKY 79 (467) T ss_pred CCC-cchhhhhhcccccCHHHHHHHHHcccccCCccccCcchhhhhhhhhhhheeeccccchhhhhhcccchhhhhhhhh Confidence 653 4556666777777779999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_019448. 81 DQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEWASFYG 160 (463) Q Consensus 81 ~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~~E~a~fyG 160 (463) +++++||++||++|++|+|+++++||+|+||+++||||++++++|+++++||+|+||+++|+++||+++||+|||+|||| T Consensus 80 ~~~~~~G~~g~~~f~~E~g~~~~~~~~~~r~~~~~k~l~~~~~vs~~~~l~n~i~d~~~~~~~~ai~~~a~tiE~a~FyG 159 (467) T protein:vir:80 80 DVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEWASFFG 159 (467) T ss_pred eeeeccCccccccccccccccccCCCceEEEEEEeeeeeeeeeehhhhhhhcchhhHHHHHHHHHHHHHHHHHHHHhhhc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccCC-CccccccccccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHHHHHhcCcceEEe Q lcl|NC_019448. 161 DASLTS-EVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQMQLM 239 (463) Q Consensus 161 d~~l~~-~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~~~~~~qrv~~ 239 (463) |++|.+ .++++|||||||.+|||+|||||+||++||+++||+|+++|+++||++||+|||+++|++|++++|.+||+|+ T Consensus 160 ds~l~~s~~~~~glqfDGi~~li~~enviDa~G~~ls~~~lneaa~~i~~gfG~~td~~~p~~v~a~~~~~~L~~q~~v~ 239 (467) T protein:vir:80 160 DSDLSDSPEPQAGLEFDGLAKLINQDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLSKQTQLV 239 (467) T ss_pred ccccccCCCccccccccceeEEecCCceeccCCCccCHHHHHHHhhhccccccChhhhhcchhHHhhhhhhhcCceEEEE Confidence 999964 5778999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ecCCCCcccceecCeeeecccccccCCceeccCccccccccccCCCCCCCCeeEEEEeccCCCcCcccccccceEEEEEE Q lcl|NC_019448. 240 QDNSGNVNTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQPLPNAPQPAKVTATVETKQKGAFEDEEDRAGLSYKVVV 319 (463) Q Consensus 240 ~~n~g~~~~G~~v~~~~s~~G~i~l~~s~~~~~d~~l~~~~~~~p~ap~p~~vtat~~~~~~g~~~~~~~~a~ysYkV~a 319 (463) .+|.+.+++|++|++|++++|+|+||||+||++++++++....+|+||+|++++||+....+|.+.. .+.++|+|||++ T Consensus 240 ~~n~~~~~~G~~v~g~~sa~G~I~l~gs~il~~~~~l~~~~~~~~~Apsp~~vsaT~~~~~~g~~~~-~~~a~y~Y~v~~ 318 (467) T protein:vir:80 240 RDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAPQPAKVTATQEAGKKGQFRA-EDLAAHEYKVVV 318 (467) T ss_pred cCCCCceeeeecccceecceeeeeecCceeeccccCCCcccccccccccCCccceeeecccCCcccC-CCcceEEEEEEE Confidence 9999999999999999999999999999999999999999999999999999999998888888754 677899999999 Q ss_pred EecCCccccccceeeeecCCCCceEEEEEecCCCCCCcceEEEEeecCCCceEEEEEEeeeeeecCCceEEEEeccCCCC Q lcl|NC_019448. 320 NSDDAQSAPSEEVTATVSNVDDGVKLSISVNAMYQQQPQFVSIYRQGKETGMYFLIKRVPVKDAQEDGTIVFVDKNETLP 399 (463) Q Consensus 320 ~s~~geS~~S~~vt~Tva~~~~gv~ltIt~~a~~g~~~~~y~IYR~~~~~g~~~li~rv~~s~~n~~gtttf~D~N~~iP 399 (463) ||++|||+||+++++||++.++|++|+|||+++++++|+||+|||++.++|+||||+|||++.+ .+++++|+|+|++|| T Consensus 319 vs~~GES~pS~~vtvTVaa~~dg~~ltIt~~~~~~~~p~yv~IYR~~~gg~~f~li~~va~~~a-~~gt~tf~D~n~~iP 397 (467) T protein:vir:80 319 SSDDAESIASEVATATVTAKDDGVKLEIELAPMYSSRPQFVSIYRKGAETGLFYLIARVPASKA-ENNVITFYDLNDSIP 397 (467) T ss_pred ECCCCccccccceEEEecCcccceeEEEEecCCCCCcceEEEEEEeCCCCcceeEeeeEeeeec-CCCeEEEEcCCcccC Confidence 9999999999999999999999999999999999999999999999999999999999999997 589999999999999 Q ss_pred CCccceecCCchHHHHhhhhcchhhcCCcccCCcceeeeeeechhheecceeeEEEEEEeEecC Q lcl|NC_019448. 400 ETADVFVGEMSPQVVHLFELLPMMKLPLAQINASITFAVLWYGALALRAPKKWARIKNVRYIAV 463 (463) Q Consensus 400 gt~~~fvGe~~pqvi~l~ellPm~k~pla~~na~~~~~V~~Yg~L~l~aPkk~~~ikNV~~~~~ 463 (463) ||+++|||||+|+||+|+|||||||||||++|++++|||+|||+|+|+|||||+|||||+|||| T Consensus 398 gT~~~fVgem~~~~i~~~~llpm~~lplA~~n~~~~~~Vl~Ygalal~~Pk~~~~ikNv~~~~~ 461 (467) T protein:vir:80 398 ETVDVFVGEMSANVVHLFELLPMMRLPLAQINASVTFAVLWYGALALRAPKKWVRIRNVKYIPV 461 (467) T ss_pred CCcceeeeecChhHHHHHHHhccccCChhHhccchhhhhhhhhHHhhhccccceEEEEeeeeee Confidence 9999999999999999999999999999999999999999999999999999999999999999 No 7 >protein:vir:100851 Length: 514 # NCBI annotation: hypothetical protein # Family: family:all:2450 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164744;genbank:gi:56693157;genbank:GeneID:3197484 Probab=100.00 E-value=1.3e-194 Score=1083.45 Aligned_cols=462 Identities=43% Similarity=0.667 Sum_probs=437.2 Q ss_pred CC-----------------CCCccchHHHHhhh-hhhHHHHHH-hhcCCccCCccccCccccchhhhhhHhhhhhccccc Q lcl|NC_019448. 1 MT-----------------IEKNLSDVQQKYAD-QFQEDVVKS-FQTGYGITPDTQIDAGALRREILDDQITMLTWTNED 61 (463) Q Consensus 1 ~~-----------------~~~~~~~~~~~~~k-~~~e~~~Ks-~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~d 61 (463) |- .-.-|.+-++..+| .+.+++.|| |++||+++|++|++||||||||||++|++|+|++++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~a~t~gy~~~~~~~t~gaAlR~EsLd~~l~~Lt~~~~~ 80 (514) T protein:vir:10 1 MYTQDKTKDIMKKSFFGGDRAVAFDTNKEDILNENLPENVKKSAFTAGHSITPDTQTDGAANRIESLNRDLKVTTWGERD 80 (514) T ss_pred CCccchhhHHHhhhhcccceeeeecCcHHHHHHHhcchhhhhhhhccccccCCccccCccchhhhhhccceeEeeecCcc Confidence 11 12234444544444 445889999 999999999999999999999999999999999999 Q ss_pred cchhhhcccchhhHHHhhhhhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHH Q lcl|NC_019448. 62 LIFYRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQIL 141 (463) Q Consensus 62 f~f~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~ 141 (463) |+|||+|+||+++|||+||+++++||++||++|++|+|+++++||+|+||+++|||+++++++|+++++||++.||++++ T Consensus 81 ftf~~~i~k~~a~STV~ey~~~~~~G~~G~~~f~~E~gi~~~~d~~~~rk~~~~k~l~~~~~vS~~~~l~n~i~d~~~~~ 160 (514) T protein:vir:10 81 FTLYNDIAKQPVDNTVLKYTQYYSHGRTGHSLFQPEIGIGDVNNPNERQRTINIKYIVDTHVTSIALQRANTIVDSLKVQ 160 (514) T ss_pred hhhhhhcCCchhhHHHhhhhhhcccCcccccccccccccCcCCCcceEEEEEeeeeeeeeeeeeehhhhccchhhHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhhcccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCH Q lcl|NC_019448. 142 TEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPI 221 (463) Q Consensus 142 ~~~ai~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~ 221 (463) +++||+++||+|||+|||||++|+|+..++|||||||.+|||++|||||||++||+++||+||++|+++||+|||+|||+ T Consensus 161 ~~dai~~ia~tiE~a~FyGDs~L~s~~~~~gleFDGl~~lI~~~NvIDarG~~Ls~~~ln~aA~~i~~gfGt~TD~ylp~ 240 (514) T protein:vir:10 161 EYAAISTVIKTDEWAMFYGDADLTSGQKGEGLQFDGLFKLIAPENHIDLRGGRLSPAALNMAARKIGEGFGTPTDAYMPI 240 (514) T ss_pred HHHHHHHHHHHHHHHHhhhcccCCCccccCcchhhhHHHhhcCCCeEecCCCCccHHHHhhhhhhhhcccCChhheeCch Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhcCcceEEeecCCCCcccceecCeeeecccccccCCceeccCccccccccccCCCCCCCCeeEEEEeccCC Q lcl|NC_019448. 222 GVHADFVNSILGRQMQLMQDNSGNVNTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQPLPNAPQPAKVTATVETKQK 301 (463) Q Consensus 222 ~vka~f~~~~~~~qrv~~~~n~g~~~~G~~v~~~~s~~G~i~l~~s~~~~~d~~l~~~~~~~p~ap~p~~vtat~~~~~~ 301 (463) ++|++|+|+++++|||+|++|+|++++|+++++|++++|+|+||||++|+.+++|++.....|+||+|+++++++++.++ T Consensus 241 ~vka~f~~~~~~~qRV~~~~n~~~~~~G~~v~~f~s~~G~I~L~gs~im~~~n~L~~~~~~~~~Ap~~~~va~svT~~~~ 320 (514) T protein:vir:10 241 GIKADFVNQHLNGQRVMLPGQTGGMTTGLDIDKFLSAHGSIRIQGSTIMDSDNKLDFDRPVSPTAPTAPQLSATVTPDGG 320 (514) T ss_pred HHHHHHhhcccCcceEEeecCccceeeeeeccceeEeccceeecCCeeecccccCccCCccCCcCCCCCcceEEEecCcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999887 Q ss_pred CcCcc--cc----------ccc-ceEEEEEEEecCCccccccceeeeecCCCCceEEEEEecCCCCCCcceEEEEeecC- Q lcl|NC_019448. 302 GAFED--EE----------DRA-GLSYKVVVNSDDAQSAPSEEVTATVSNVDDGVKLSISVNAMYQQQPQFVSIYRQGK- 367 (463) Q Consensus 302 g~~~~--~~----------~~a-~ysYkV~a~s~~geS~~S~~vt~Tva~~~~gv~ltIt~~a~~g~~~~~y~IYR~~~- 367 (463) |.+.. .+ +.+ .|+|+|++||++|||+||+++++|+++++++++|+||++++++..|+|++|||++. T Consensus 321 g~~~~ad~t~~~g~~~~~~~~g~~~sYaVv~~n~~GeS~ps~~vtaT~a~~~~~i~ltItp~~~~~~~p~yv~IYR~~~~ 400 (514) T protein:vir:10 321 GLWHEADKTDSKGEVILNKEVGVEQSYVAVMVSRHGDSRPSLVQTATPTKKDDAITLTITPNAMQNVIPDYVAIYRKSNF 400 (514) T ss_pred cccCcccccccccccccccccceeEEEEEEEECCCCcccccceeeeeeeccCceEEEEEEeccCcccccceEEEEeccCC Confidence 75541 11 122 47899999999999999999999999999999999999999999999999999974 Q ss_pred -------------CCceEEEEEEeeeeeecCCceEEEEeccCCCCCCccceecCCchHHHHhhhhcchhhcCCcccCCcc Q lcl|NC_019448. 368 -------------ETGMYFLIKRVPVKDAQEDGTIVFVDKNETLPETADVFVGEMSPQVVHLFELLPMMKLPLAQINASI 434 (463) Q Consensus 368 -------------~~g~~~li~rv~~s~~n~~gtttf~D~N~~iPgt~~~fvGe~~pqvi~l~ellPm~k~pla~~na~~ 434 (463) ++|+|++|+|||+++ +++++|+|+|+|+|||||+++|||||+|+||+|+|||||||||||++|+.+ T Consensus 401 ~s~~~~~~~~~~~~tGdf~li~rv~~~~-~~~gttt~~D~n~~IPgT~~vfVgemspevi~l~ellPm~klpLA~~na~~ 479 (514) T protein:vir:10 401 DSDALEANTDASGNRGSYYLIGKVAVRE-QEGATITFVDTNARIAGCGDVFVIENRPETVALQEFIPLSKLNLAVTTTAT 479 (514) T ss_pred CcchhhhhccccccccceeEEEEEeeec-CCCCeEEEeccccccCCcceeEEeeCchHHHHHHHHhhhhhcChhhhcchH Confidence 569999999999976 679999999999999999999999999999999999999999999999999 Q ss_pred eeeeeeechhheecceeeEEEEEEeEecC Q lcl|NC_019448. 435 TFAVLWYGALALRAPKKWARIKNVRYIAV 463 (463) Q Consensus 435 ~~~V~~Yg~L~l~aPkk~~~ikNV~~~~~ 463 (463) +|||+|||+|+|+|||||+|||||+|+|| T Consensus 480 ~waVlwYGaLal~aPkr~~~IkNv~~~~v 508 (514) T protein:vir:10 480 SFVVLNYVALALYYPKRGAVLENVVYSRV 508 (514) T ss_pred HHHHHHHhHHHhhccccceEEEeeeeeec Confidence 99999999999999999999999999999 No 8 >protein:vir:102823 Length: 470 # NCBI annotation: major structural protein # Family: family:all:2450 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874086;genbank:gi:118197693;genbank:GeneID:4496015 Probab=100.00 E-value=6.7e-159 Score=887.73 Aligned_cols=433 Identities=20% Similarity=0.267 Sum_probs=383.8 Q ss_pred CCCCCccchHHHHhhhhhhHHHHHHhhcCCccCCccccCccccchhhhhhHhhhhhccccccchhhhcccchhhHHHhhh Q lcl|NC_019448. 1 MTIEKNLSDVQQKYADQFQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVKY 80 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k~~~e~~~Ks~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~stv~ey 80 (463) ||- .-+|++.|+++|++.+ +.+.|+||||||||++|++|+|++++|+|||+|+|||++|||+|| T Consensus 1 ~~~---------~~~~~~~~a~~~al~~-------a~~~g~AlR~EsLd~~l~~lt~~~~~ftf~~~i~k~~a~STV~ey 64 (470) T protein:vir:10 1 MPY---------EHLKHLDEATLKALNA-------AGQVAESLEREDLEPEVTQLNVLDTPLTDLLSKNAVKAKAYEHEY 64 (470) T ss_pred CCh---------hHhhhhhHHHHHHHHH-------hhhcchhhhhhhhccceeEeeecCccchhhhhcCCchhhhHhhhh Confidence 442 3578999999999998 777889999999999999999999999999999999999999999 Q ss_pred hhhhc-cCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhh--hhcccccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019448. 81 DQYLR-HGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASG--LVNNIADPSQILTEDAIAVVAKTIEWAS 157 (463) Q Consensus 81 ~~~~~-hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~--lvn~~~Dp~~~~~~~ai~~~~~~~E~a~ 157 (463) +++++ ||+.||++| +|+|+++++||+|+||+++||||+++++||+++. ++|+++||+++++++||+++||+|||+| T Consensus 65 ~~~~~rhG~~g~s~~-~E~~l~~~~d~~~~Rr~v~~K~l~~~~~VT~~a~~~~~n~v~d~~~~~~~dai~~ia~tiE~a~ 143 (470) T protein:vir:10 65 NVVTARHDKIGYAAF-REGGLPRTVEVNVVRRRIRPMLVGHRITVTELATRTTQNGVMQIDELVKREKMIAVANEFEYLA 143 (470) T ss_pred hhhccccccccceee-cccccCccCCCceEEEEEEEEEEeecchhhhhhhhhhhccccchHHHHHHHHHHHHHHHHHhhh Confidence 99886 888888866 9999999999999999999999999999999974 5788999999999999999999999999 Q ss_pred hhcccccCCC--ccccccccccceeeec---CcceEeccCCCCCHHHHhhhhhhh--hhcCCceeEEecCHHHHHHHHHH Q lcl|NC_019448. 158 FYGDASLTSE--VEGEGLEFDGLAKLID---KNNVINAKGNQLTEKHLNEAAVRI--GKGFGTATDAYMPIGVHADFVNS 230 (463) Q Consensus 158 fyGd~~l~~~--~~~~gleFDGl~~lI~---~~nviDarG~~ls~~~ln~aa~~i--~~~~G~~td~~m~~~vka~f~~~ 230 (463) ||||++|+++ ++++|||||||.+||| |+|||||||++||+++||+|++.| +++||+|||+|||+++|++|+|+ T Consensus 144 FyGDs~l~s~~~g~~~gleFDGl~~lId~~~~~NViDarG~~Ls~~~L~~aa~~I~~~~~fGt~TD~~lp~~vka~f~~~ 223 (470) T protein:vir:10 144 FYGDNLLGDDVPGSPNNLQQDGIINIIKRGAPQNVLDAGGRPLSIDLLWEAESRVVSTQAFANPTAVFISYVDKLNLQAS 223 (470) T ss_pred hhhccccccccCcccCceeccchhhhccCCCCccccccCCCCccHHHHHHHHhhhcccccccChhhhccchhHHHHHHHh Confidence 9999999974 5569999999999998 689999999999999999999766 68999999999999999999999 Q ss_pred hcCcceEEeecCCCCcccceecCeeeecccccccCCceecc-----CccccccccccCCCCCCCCeeEEEEeccCCCcC- Q lcl|NC_019448. 231 ILGRQMQLMQDNSGNVNTGYSVNGFYSSRGFIKLHGSTVME-----NELILDESLQPLPNAPQPAKVTATVETKQKGAF- 304 (463) Q Consensus 231 ~~~~qrv~~~~n~g~~~~G~~v~~~~s~~G~i~l~~s~~~~-----~d~~l~~~~~~~p~ap~p~~vtat~~~~~~g~~- 304 (463) |+++|||||++|+|++++|++|++|++++|+|+||||++|+ ++++|++.....+ .|++++|++++.++.. T Consensus 224 ~~~~qRv~~~~N~~~~~~G~~v~~f~sa~G~I~L~~s~~m~~~~k~~p~~l~~~v~~~a----AP~~~~tv~~t~~~~a~ 299 (470) T protein:vir:10 224 FYQISRVMTTADRRAGLLGADAQSYIGVRGEHSLYPSQFLGDFHKFNPARFGAEVGDFA----APSNSWTVSTTDNFVTL 299 (470) T ss_pred hcCceEEEEecCCCceeeeeeccceeeeeeeeeecccccccchhhcCcccCCcccCCcc----cCceeEEeecCCCceee Confidence 99999999999999999999999999999999999999999 6888888654333 2234455444432221 Q ss_pred ------cccccccceEEEEEEEecCCccccccce--eeeecCCCCceEEEEEecCCCCCCcceEEEEeecCCCceEEEEE Q lcl|NC_019448. 305 ------EDEEDRAGLSYKVVVNSDDAQSAPSEEV--TATVSNVDDGVKLSISVNAMYQQQPQFVSIYRQGKETGMYFLIK 376 (463) Q Consensus 305 ------~~~~~~a~ysYkV~a~s~~geS~~S~~v--t~Tva~~~~gv~ltIt~~a~~g~~~~~y~IYR~~~~~g~~~li~ 376 (463) ++..+...|+|.+++|+.+|||. |++| ++|++.+++|++++|+.. ..++|++|||+++++|.|+||+ T Consensus 300 ~~~sk~g~~~~~~v~sy~y~v~~~~gds~-s~~v~vt~t~~~v~kgv~ltI~~~----~~v~yv~IYRk~~~s~~~~li~ 374 (470) T protein:vir:10 300 PYNSGLGDPANTTVYSYAFKAANFYGESA-AKYIDVYIDSTEAGKGVRFQFHGL----VNVKWLDVYRKDPGSQEYKFYK 374 (470) T ss_pred cccCCCCcccCcceeEEEEEEEEecCCCC-cceEEEEEeeehhcceeEEEEecC----CCCcEEEEEeecCCCCceeEEE Confidence 12224446788888999999994 4555 556777778888888743 2379999999999999999999 Q ss_pred EeeeeeecCCceEEEEeccCCCCCCccc----------eecCCchHHHHhhhhcch--hhcCCcccCCcceeeeeeechh Q lcl|NC_019448. 377 RVPVKDAQEDGTIVFVDKNETLPETADV----------FVGEMSPQVVHLFELLPM--MKLPLAQINASITFAVLWYGAL 444 (463) Q Consensus 377 rv~~s~~n~~gtttf~D~N~~iPgt~~~----------fvGe~~pqvi~l~ellPm--~k~pla~~na~~~~~V~~Yg~L 444 (463) ||+++++| |+.++|+|.|++||+|+.+ |||||||++++|++|||| +||||+..++...|+| |+| T Consensus 375 rv~v~~~n-g~~~~~~D~~e~i~tt~~v~~~~~~Pgt~~Vgemsp~v~sl~~~l~m~l~klp~a~~~~~v~~~v---gal 450 (470) T protein:vir:10 375 RVKVSTVN-GDFTWIDDGHETVTTPSGVYRWKKIPGTGVVVGIDPNVTTMAVWIGMELYRLPPALTHDYVIWKV---ASV 450 (470) T ss_pred EEeeeecc-CCEEEEecccccCCCcceeeeecccCcceeccccCcchhhhhhhhhhhhhhcCHHHHHHHHHHHH---HHH Confidence 99999987 7888999999999999976 999999999999999999 8999999998888988 999 Q ss_pred heecceeeEEEEEEeEecC Q lcl|NC_019448. 445 ALRAPKKWARIKNVRYIAV 463 (463) Q Consensus 445 ~l~aPkk~~~ikNV~~~~~ 463 (463) +|+|||||++||||+|||| T Consensus 451 al~aPKr~~~IkNV~~~~~ 469 (470) T protein:vir:10 451 FSRAPEFNFLIVNVGQEPI 469 (470) T ss_pred HHhccccceEEEEeeeeec Confidence 9999999999999999999 No 9 >protein:vir:8843 Length: 317 # NCBI annotation: major head protein # Family: family:all:3919 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775251;genbank:gi:27476049;genbank:GeneID:2700597 Probab=99.56 E-value=2.5e-17 Score=111.58 Aligned_cols=293 Identities=15% Similarity=0.132 Sum_probs=193.3 Q ss_pred CCCCCccchHHHHhhhhhhHHHHHHhhcCCccCCccccCccccchhhhhhHhhhhhccccccchhhhcccchhhHHHhhh Q lcl|NC_019448. 1 MTIEKNLSDVQQKYADQFQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVKY 80 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k~~~e~~~Ks~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~stv~ey 80 (463) |..|++.-.+ --+..-||+|.++|.++.-.+.. |+..|.|.+++||.||| T Consensus 1 ma~~~~~~~t----------------------------~~~~g~~~dl~~~I~~isp~dTP--f~S~i~~~~a~~~~~~W 50 (317) T protein:vir:88 1 MATPTNAVST----------------------------VEINGKREDLIDIIYNIAPYDTP--FMSAIGKGVATAITHEW 50 (317) T ss_pred CCccccceEe----------------------------eeeeeeeechhhhheecCCccCc--ceeeecCceecccEEEE Confidence 5555543322 23345789999999887766554 56677788999999999 Q ss_pred hhhhccCcccccccccccCcccccC-cceEEEEEEEEEeechhhhhhhhhhhcc--cccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019448. 81 DQYLRHGNVGHSRFVKEIGVAPVSD-PNIRQKTVSMKYVSDTKNMSIASGLVNN--IADPSQILTEDAIAVVAKTIEWAS 157 (463) Q Consensus 81 ~~~~~hG~~g~~~fv~E~g~~~~~d-~~~~r~~~~~k~l~~~~~vs~~~~lvn~--~~Dp~~~~~~~ai~~~~~~~E~a~ 157 (463) ....=..- . ..-..||+++.... ..-.|+...+--+.+..+||.-++.++. ++|-++.|...++.-+..++|+++ T Consensus 51 ~~d~l~~~-~-~~~~~EG~da~~~~~~~r~~~~N~tQIf~k~v~VSgTa~av~~~G~~~ela~q~~kk~~EikrdmE~~l 128 (317) T protein:vir:88 51 QTDELRQP-G-KNTRVEGEDATIKAGSFTTMLNNYCQISDETLQVTGTADRVKKAGRKNELAYQLAKKSKELKLDMEYAL 128 (317) T ss_pred EeeecCCc-c-ccccccCcccccccccCCEEeccEEEEEEeEEEEeehhhhhhhcCccchhHHHHHHHHHHHHHHHHHHH Confidence 97552211 1 12334887544333 3344555555666777777777777665 469999999999999999999999 Q ss_pred hhcccccCCCccccccccccceeeecCcceEeccCC----------------CCCHHHHhhhhhhhhhcCCceeEEecCH Q lcl|NC_019448. 158 FYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGN----------------QLTEKHLNEAAVRIGKGFGTATDAYMPI 221 (463) Q Consensus 158 fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~----------------~ls~~~ln~aa~~i~~~~G~~td~~m~~ 221 (463) ++|.+....+..+.-=+.+|+...|+..++..+.|. .|+|+.|+++..+|=.+.|.++.+|+++ T Consensus 129 i~g~~a~~~~~~t~~r~~~Gl~~~i~t~~~~~~~g~~~~~~~~~~~t~~t~~~lte~~l~~~l~~i~~~Gg~~~~i~v~a 208 (317) T protein:vir:88 129 VGAPQAKVQRNTTTPGQMANIFAYYKTNGSLGANGVAPVGDGSNTGTAGDLRLLTEDMLLNASESIWRNGGQANSIQTSS 208 (317) T ss_pred hcCeeeccCCCCccchhhhhHHHHhccCceeccCccccccCCCccccccccccccHHHHHHHHHHHHhcCCCCCEEEeCh Confidence 999987654333332288999999988877765555 6999999999999999999999999999 Q ss_pred HHHHHHHHHhcCcceEEeecCCCCcccceecCeeeecccccccCCceeccCcccc--ccccccCCCCC-CCCeeEEEEec Q lcl|NC_019448. 222 GVHADFVNSILGRQMQLMQDNSGNVNTGYSVNGFYSSRGFIKLHGSTVMENELIL--DESLQPLPNAP-QPAKVTATVET 298 (463) Q Consensus 222 ~vka~f~~~~~~~qrv~~~~n~g~~~~G~~v~~~~s~~G~i~l~~s~~~~~d~~l--~~~~~~~p~ap-~p~~vtat~~~ 298 (463) ..|..|...+-++...+-. ......+|..|+.|.+..|.|++..+.+|..|..+ |...|.+. + -|. -.+. - T Consensus 209 ~~k~~i~~~~~~~~~~i~~-~~~~~~~g~~v~~~~tdfG~v~ii~~r~lp~~~~~~~D~~~~~l~--~Lr~~--~~e~-l 282 (317) T protein:vir:88 209 SIKKAISKNMKGRATEITL-DASDNRIAQTVDVYESDFGKYTIRANRWFHENTLFVFDPKMHSLC--YLRPF--FQHE-L 282 (317) T ss_pred HHHHHHHHHhcCCceeEEE-cccCeEEEEEEEEEEeCCeEEEEEeCCCCCCCeEEEEccccccee--ecccc--eeec-c Confidence 9999998776655544322 23345889999999999999999999998755543 33322111 1 111 1111 1 Q ss_pred cCCCcCcccccccc--eEEEEEEEecCCccccccceeeee Q lcl|NC_019448. 299 KQKGAFEDEEDRAG--LSYKVVVNSDDAQSAPSEEVTATV 336 (463) Q Consensus 299 ~~~g~~~~~~~~a~--ysYkV~a~s~~geS~~S~~vt~Tv 336 (463) ..+| |. +..- ..|-+.+.|..+-. --...++++ T Consensus 283 aKtG---d~-~k~~i~~E~tLe~~N~~a~a-~i~~l~~~~ 317 (317) T protein:vir:88 283 AKTG---DS-EKRQLLVEYTFRVNNEKSGA-LIRDVVAQL 317 (317) T ss_pred CCCc---cc-ceeEEEEEEEEEEcCcccee-EEEEecccC Confidence 1112 11 1111 23333344544322 122344444 No 10 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=99.22 E-value=5.1e-13 Score=87.96 Aligned_cols=320 Identities=13% Similarity=0.124 Sum_probs=181.2 Q ss_pred CCCCCccchHHHHhhhhhhHHHHHHhhcCCccCCccccCccccchhhhhhHhhhhhccccccchhhhcccchhhHHHhhh Q lcl|NC_019448. 1 MTIEKNLSDVQQKYADQFQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVKY 80 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k~~~e~~~Ks~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~stv~ey 80 (463) .+.|..+ -|..+..+|.+ +||.+ + |...++.|-...+...+ +..-.+.-.++..++=.++++..+.| T Consensus 5 ~~~~~~~--~~~~~~~~~p~---l~m~a---l---TLaea~~l~~d~~~~~V--IE~l~~~s~iL~~lpf~~ve~~~~~~ 71 (330) T protein:vir:94 5 CTPPLRG--RWRTLTHQFPE---LKMPT---V---TLAESAKLSQDHLVSGL--IETIVEVNPLYEMMPFTEIEGNALAY 71 (330) T ss_pred cCCcccc--ceeehhccccc---cchhh---h---hhhHHhhcCchhhHHHH--HHhhhccchHHhhcccccccCCccee Confidence 1222211 24344444432 23332 1 12223333333333322 22222333455666656677888999 Q ss_pred hhhhccCccccccccccc-CcccccCcceEEEEEEEEEeechhhh-hhhhhhhcccccHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019448. 81 DQYLRHGNVGHSRFVKEI-GVAPVSDPNIRQKTVSMKYVSDTKNM-SIASGLVNNIADPSQILTEDAIAVVAKTIEWASF 158 (463) Q Consensus 81 ~~~~~hG~~g~~~fv~E~-g~~~~~d~~~~r~~~~~k~l~~~~~v-s~~~~lvn~~~Dp~~~~~~~ai~~~~~~~E~a~f 158 (463) +|....++ ..|.... +.++....++.|.+..++-++.-..| +.++++-++..|-+..|.+..|..+.+.+|+.+| T Consensus 72 ~r~~~lp~---a~~r~~n~~~~~~~~~Tf~q~t~~l~~l~~~~~Vd~~iadl~g~~~d~~~~q~~~~ieal~~~~e~~li 148 (330) T protein:vir:94 72 NRENVLGD---VQFLAVGGTITAKNPATFTKVTSELTTLIGDAEVNGLIQATRSDFMDQTSVQVASKAKSIGRQYQASMI 148 (330) T ss_pred eeeecCCc---ceeeeccccccccCcceeeeeeechhhhhhhHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 99887643 3454422 33334456789999999999999999 5556788889999999999999999999999999 Q ss_pred hcccccCCCccccccccccceeeecCcceEec--cCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHHHHHhcCcce Q lcl|NC_019448. 159 YGDASLTSEVEGEGLEFDGLAKLIDKNNVINA--KGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQM 236 (463) Q Consensus 159 yGd~~l~~~~~~~gleFDGl~~lI~~~nviDa--rG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~~~~~~qr 236 (463) |||+.- -+||||.+.++++++||+ +|+.||.+.|.++-..+...-|.+.-++|+......+...-...-+ T Consensus 149 nGDs~~--------~~F~GL~~~~~~~q~i~tg~~gg~~T~d~LDeLl~~v~~~~g~~~~~l~n~a~~r~I~a~~R~~~~ 220 (330) T protein:vir:94 149 TGDGTG--------NSFQGMMGLVAASQTISAGANGGTLTFELLDQLLDLVKDKDGQVDYLMSSFAMRRKYFSLLRALGG 220 (330) T ss_pred ccCCCC--------ccccchhhcCCcccEEecCCCCCCCCHHHHHHHHHHhcCCCCCCcEEEechhHHHHHHHHHHhccC Confidence 999772 179999999999999999 8899999999999999977778888888877766666543221100 Q ss_pred EEeecCCCCcccceecCeeeecccccccCCcee-ccCccccccccccCCCCCCCCeeEEEEeccCCCcCcccccccceEE Q lcl|NC_019448. 237 QLMQDNSGNVNTGYSVNGFYSSRGFIKLHGSTV-MENELILDESLQPLPNAPQPAKVTATVETKQKGAFEDEEDRAGLSY 315 (463) Q Consensus 237 v~~~~n~g~~~~G~~v~~~~s~~G~i~l~~s~~-~~~d~~l~~~~~~~p~ap~p~~vtat~~~~~~g~~~~~~~~a~ysY 315 (463) +|..-+ .+.+.|..+ ..+...+. |+.+-|.. .++ T Consensus 221 -----------~~v~~~-------~~~~~G~~v~~~~GvPi~------~~d~ip~~---------~~~------------ 255 (330) T protein:vir:94 221 -----------AAIGEV-------MTLPSGRQIPTYRGVPWF------VNDFIPSN---------MTQ------------ 255 (330) T ss_pred -----------CCCCCc-------ccccCCCEEeeeCCeEEE------ecccccCC---------CCc------------ Confidence 000000 001112211 11111111 11111100 000 Q ss_pred EEEEEecCCccccccceeeeecCCCCceEEEEEecCCCCCCcceEEEEeecCC-CceEEEEEEeeeeeecCCceEEEEec Q lcl|NC_019448. 316 KVVVNSDDAQSAPSEEVTATVSNVDDGVKLSISVNAMYQQQPQFVSIYRQGKE-TGMYFLIKRVPVKDAQEDGTIVFVDK 394 (463) Q Consensus 316 kV~a~s~~geS~~S~~vt~Tva~~~~gv~ltIt~~a~~g~~~~~y~IYR~~~~-~g~~~li~rv~~s~~n~~gtttf~D~ 394 (463) ++ ..+.+.-|..-+.+... -|.-++ ...| T Consensus 256 --------~~--------------------------~~~ttsIyav~~G~~~~~qgV~Gl---------~~~g------- 285 (330) T protein:vir:94 256 --------GT--------------------------ATNATAIFAGTFDDGSNKYGIAGL---------TARG------- 285 (330) T ss_pred --------cc--------------------------CCCceeEEEEeecccccccceEee---------cCCC------- Confidence 00 01111001111111100 021111 1111 Q ss_pred cCCCCCCccceecCCchHHHHhhhhcchhhcCCcccCCcceeeeeeechhheecceeeEEEEEEeEe Q lcl|NC_019448. 395 NETLPETADVFVGEMSPQVVHLFELLPMMKLPLAQINASITFAVLWYGALALRAPKKWARIKNVRYI 461 (463) Q Consensus 395 N~~iPgt~~~fvGe~~pqvi~l~ellPm~k~pla~~na~~~~~V~~Yg~L~l~aPkk~~~ikNV~~~ 461 (463) .||-.--|+|+.+ -.+..+|.|.||-.+++.-|+...+++||.-= T Consensus 286 ---~~glsVr~~G~~~-------------------~k~v~~~~v~~y~~~av~~~~a~~~L~~V~~g 330 (330) T protein:vir:94 286 ---SAGLRVQNVGAKE-------------------NADETITRVKMYCGFANFSQLGLAAIKGLIPG 330 (330) T ss_pred ---CCcceeeeCCCcc-------------------ccceeeEEEEEeeeeEEechhheeeeccccCC Confidence 1332222333211 11457889999999999999999999999866 No 11 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=98.90 E-value=1.7e-10 Score=74.15 Aligned_cols=283 Identities=11% Similarity=0.125 Sum_probs=145.8 Q ss_pred CCCccchHHHHhhhhhhHHHHHHhhcCCccCCccccCccccchhhhhhHhhhhhccccccchhhhcccchhhHHHhhhhh Q lcl|NC_019448. 3 IEKNLSDVQQKYADQFQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVKYDQ 82 (463) Q Consensus 3 ~~~~~~~~~~~~~k~~~e~~~Ks~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~stv~ey~~ 82 (463) ||+ .|-.+.+.. .-..|-++.+|... ++-.++..+|=..++.....|+| T Consensus 1 mpa-ltLaea~k~-----------------------~~d~l~~~ViE~~~-------~~s~lL~~LpF~~veg~~~~ynR 49 (310) T protein:vir:97 1 MAS-VTLAESAKL-----------------------AQDELVAGVIENII-------TVNRMFDVLPFDSIEGNSLAYNR 49 (310) T ss_pred Ccc-cchHHHhhc-----------------------CcchHHHHHHHHHh-------ccchHHHhCCcccccCCcceeeE Confidence 332 111111000 00111222222221 23334455554556666778888 Q ss_pred hhccCcccc----cccccccCcccccCcceEEEEEEEEEeechhhhhh-hhhhh-cccccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019448. 83 YLRHGNVGH----SRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSI-ASGLV-NNIADPSQILTEDAIAVVAKTIEWA 156 (463) Q Consensus 83 ~~~hG~~g~----~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~-~~~lv-n~~~Dp~~~~~~~ai~~~~~~~E~a 156 (463) ....++.+- ..+..| |++ -+..++.++..+++-++....|.. ++++. ++..|-.++|.+-.|..+...+|+. T Consensus 50 ~~~~~~~~~~~v~~~~~~~-g~~-~~~~t~~~~~~~L~i~~g~~~Vd~~i~dl~~~~~~dq~~~Ql~~~iea~~~~~e~~ 127 (310) T protein:vir:97 50 ENVLGDVIMAGVGTTFSGA-GAG-KAAATFTKVNSNLTTIMGDAEVNGLIQATRSGDGNDQTAVQIASKAKSAGRKYQDQ 127 (310) T ss_pred eeccCCcccccccccccCC-Ccc-ccccccceeeeeeeeeeehhhhhhHHHhhhcCChHHHHHHHHHHHHHHHHHHHHHH Confidence 876655531 122222 222 256788999999999999999975 47886 5578999999999999999999999 Q ss_pred HhhcccccCCCccccccccccceeeecCcceEec--cCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHHHHHhcCc Q lcl|NC_019448. 157 SFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINA--KGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGR 234 (463) Q Consensus 157 ~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDa--rG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~~~~~~ 234 (463) +++||++.+ +||||.+.+++.++||+ +|+.||.+.|.++-..+.+.=|.+.-++||+.+...+...-..- T Consensus 128 lINGD~a~n--------~F~GL~~~~~~~q~i~~~~~gg~~t~d~LDeLl~~v~~~~g~p~~~l~~~~~~r~i~A~~R~~ 199 (310) T protein:vir:97 128 LINGNGAGN--------EFAGLIQLCASGQKATTGATGSAISFAILDELMDLVVDKDGQVDYLTMHARTLRSYKALLRAL 199 (310) T ss_pred hhccccCCC--------cccchhhcCCccceeecCCCCCCCCHHHHHHHHHHHhcCCCCCCEEEecHHHHHHHHHHHHHh Confidence 999999843 59999999999999998 77999999999999999776788999999998766665433211 Q ss_pred ceEEeecCCCCcccceecCeeeecccccccCCceeccCccccccccccCCCCCCCCe-eEEEEecc-----CCCc----- Q lcl|NC_019448. 235 QMQLMQDNSGNVNTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQPLPNAPQPAK-VTATVETK-----QKGA----- 303 (463) Q Consensus 235 qrv~~~~n~g~~~~G~~v~~~~s~~G~i~l~~s~~~~~d~~l~~~~~~~p~ap~p~~-vtat~~~~-----~~g~----- 303 (463) -+.=+.+-.. ...|-.|. +-+| =-|+..|.+-.... .. .+--.++ -.....-+ -.|. T Consensus 200 ~~~g~~~~~~-~~~G~~v~---~~~G------iPi~~~d~ip~~~~-~~-~~~gtTsIya~r~Ge~~~~~Gv~Gl~~~~~ 267 (310) T protein:vir:97 200 GGASINEVVE-LPSGAEVP---AYSG------TPIFRNDYIPTNQT-KG-GTTGCTTIFAGTLDDGSRTHGIAGLTATQA 267 (310) T ss_pred cCCCCCCccc-cCCCCEEe---eeCC------eEEEEeCccCCCcc-cc-ccCCceeEEEEeeCccccccceeccccCCc Confidence 1100000000 01122221 2222 11222221110000 00 0000000 00000000 0000 Q ss_pred -------CcccccccceEEEEEEE-ecCCccccccceeeeecCCCC Q lcl|NC_019448. 304 -------FEDEEDRAGLSYKVVVN-SDDAQSAPSEEVTATVSNVDD 341 (463) Q Consensus 304 -------~~~~~~~a~ysYkV~a~-s~~geS~~S~~vt~Tva~~~~ 341 (463) .+...+.+..+|.|..- +---.+ +-..+.+-++.. T Consensus 268 ~glsVr~~G~~~~~~v~~~~V~~Y~~~av~~---~~A~a~L~~V~~ 310 (310) T protein:vir:97 268 AGIQVVDVGESEDSDEHIWRVKWYCGLALFS---EKGLACADGITN 310 (310) T ss_pred cceeEEeCCcccCCcceeEEEEEeeeEEEec---ccceeeeccccC Confidence 00112334455554421 100000 000111111111 No 12 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=98.33 E-value=5.1e-07 Score=55.06 Aligned_cols=315 Identities=11% Similarity=0.038 Sum_probs=160.3 Q ss_pred CCCCCccchHHHHhhhhhhHHHHHHhhcCCccCCccccCc-cccchhhhhhHhhhhhccccccchhhhcccchhhHHHhh Q lcl|NC_019448. 1 MTIEKNLSDVQQKYADQFQEDVVKSFQTGYGITPDTQIDA-GALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVK 79 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k~~~e~~~Ks~~agy~~~p~~q~~g-aalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~stv~e 79 (463) |-.+.+.......++....+ .+.+.+.- ..+..+| +.+..+..++-+..+ .+.-.+++.+.+.++.+.-.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~--~~~~~a~~---~~~~~~~~~~iP~~~~~~ii~~~---~~~s~l~~l~~~~~~~~~~~~ 72 (324) T protein:vir:96 1 MEQTQKLKLNLQHFASNNVK--PQVFNPDN---VMMHEKKDGTLMNEFTTPILQEV---MENSKIMQLGKYEPMEGTEKK 72 (324) T ss_pred CCcchhhhHHHHHHHHHhhh--hhhhcccc---ccccCcCccccchhHHHHHHHHH---HhhchhhhhcceeeccCCceE Confidence 65554444444444432211 12333321 1122233 344455443333333 333345555555666655456 Q ss_pred hhhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019448. 80 YDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEWASFY 159 (463) Q Consensus 80 y~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~~E~a~fy 159 (463) |.++... +...+++|++..+.+++.+.+.....+=++---.+|.-+- .++..|.+....+.-...++..+|.++|+ T Consensus 73 ~p~~~~~---~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell-~ds~~~l~~~i~~~la~ai~~~~d~a~l~ 148 (324) T protein:vir:96 73 FTFWADK---PGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFL-NYTYSQFFEEMKPMIAEAFYKKFDEAGIL 148 (324) T ss_pred EEEEecC---cceeEecCCccccccccceeEEEEeeEEEEEeehhhHHHH-hcchHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 6666533 4567999999999999999999999999988888877432 23445788888888889999999999999 Q ss_pred cccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHHHHHhcCcceEEe Q lcl|NC_019448. 160 GDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQMQLM 239 (463) Q Consensus 160 Gd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~~~~~~qrv~~ 239 (463) |+-.=+ +..|+.+.+...+.... ..++.+.|.++...+..+|..+.-+.|++.+.+.+.+.--..-|.+. T Consensus 149 G~g~~~--------~~~gi~~~~~~~~~~~~--~~~t~~~i~~~~~~l~~~~~~~~~~vmn~~~~~~L~~l~d~~G~~~~ 218 (324) T protein:vir:96 149 NQGNNP--------FGKSIAQSIEKTNKVIK--GDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERI 218 (324) T ss_pred cCCCCC--------cCccccccccccceecc--ccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCCeee Confidence 975421 34566666665554443 45678888888888999998888899999999999865444445555 Q ss_pred ecCCCCcccceecCeeeecccccccCCceeccCccccccccccCCCCCCCCeeEEEEeccC--------CCcCccccccc Q lcl|NC_019448. 240 QDNSGNVNTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQPLPNAPQPAKVTATVETKQ--------KGAFEDEEDRA 311 (463) Q Consensus 240 ~~n~g~~~~G~~v~~~~s~~G~i~l~~s~~~~~d~~l~~~~~~~p~ap~p~~vtat~~~~~--------~g~~~~~~~~a 311 (463) ++..+..-.|++|- .+....+. .+.+++-+ ......- -... ++....... .+...+.-..- T Consensus 219 ~~~~~~~l~G~PV~--~~~~~~~~-~~~~~~gd-----~~~~~~g-~~~~--~~i~~~~~~~~~~~~~~~~~~~~~f~~d 287 (324) T protein:vir:96 219 YDRNSDSLDGLPVV--NLKSSNLK-RGELITGD-----FDKLIYG-IPQL--IEYKIDETAQLSTVKNEDGTPVNLFEQD 287 (324) T ss_pred cCCCCCcccceeeE--eeCCCCCC-cceEEEEe-----cceEEEE-EecC--cEEEEeecccccccccccccchhhhhcC Confidence 54444444555542 11111111 01111110 0000000 0001 111111111 11100000001 Q ss_pred ceEEEEEEEecCCccccccceeeeecCCCCceEEEEEecCCCCCCcceE Q lcl|NC_019448. 312 GLSYKVVVNSDDAQSAPSEEVTATVSNVDDGVKLSISVNAMYQQQPQFV 360 (463) Q Consensus 312 ~ysYkV~a~s~~geS~~S~~vt~Tva~~~~gv~ltIt~~a~~g~~~~~y 360 (463) -..|++...-+.+=--|...+ .|+.. .+...++|.=+ T Consensus 288 ~~~~r~~~r~d~~v~~~~A~~-----------~l~~a-~~~~~~~~~~~ 324 (324) T protein:vir:96 288 MVALRATMHVALHIADDKAFA-----------KLVPA-DKRTDSVPGEV 324 (324) T ss_pred cEEEEEEEEEccEEecccceE-----------EEecc-cccCCCCCCCC Confidence 122222222111111121111 11111 11222233111 No 13 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=98.33 E-value=5.1e-07 Score=55.06 Aligned_cols=315 Identities=11% Similarity=0.038 Sum_probs=160.3 Q ss_pred CCCCCccchHHHHhhhhhhHHHHHHhhcCCccCCccccCc-cccchhhhhhHhhhhhccccccchhhhcccchhhHHHhh Q lcl|NC_019448. 1 MTIEKNLSDVQQKYADQFQEDVVKSFQTGYGITPDTQIDA-GALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVK 79 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k~~~e~~~Ks~~agy~~~p~~q~~g-aalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~stv~e 79 (463) |-.+.+.......++....+ .+.+.+.- ..+..+| +.+..+..++-+..+ .+.-.+++.+.+.++.+.-.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~--~~~~~a~~---~~~~~~~~~~iP~~~~~~ii~~~---~~~s~l~~l~~~~~~~~~~~~ 72 (324) T protein:vir:78 1 MEQTQKLKLNLQHFASNNVK--PQVFNPDN---VMMHEKKDGTLMNEFTTPILQEV---MENSKIMQLGKYEPMEGTEKK 72 (324) T ss_pred CCcchhhhHHHHHHHHHhhh--hhhhcccc---ccccCcCccccchhHHHHHHHHH---HhhchhhhhcceeeccCCceE Confidence 65554444444444432211 12333321 1122233 344455443333333 333345555555666655456 Q ss_pred hhhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019448. 80 YDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEWASFY 159 (463) Q Consensus 80 y~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~~E~a~fy 159 (463) |.++... +...+++|++..+.+++.+.+.....+=++---.+|.-+- .++..|.+....+.-...++..+|.++|+ T Consensus 73 ~p~~~~~---~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell-~ds~~~l~~~i~~~la~ai~~~~d~a~l~ 148 (324) T protein:vir:78 73 FTFWADK---PGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFL-NYTYSQFFEEMKPMIAEAFYKKFDEAGIL 148 (324) T ss_pred EEEEecC---cceeEecCCccccccccceeEEEEeeEEEEEeehhhHHHH-hcchHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 6666533 4567999999999999999999999999988888877432 23445788888888889999999999999 Q ss_pred cccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHHHHHhcCcceEEe Q lcl|NC_019448. 160 GDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQMQLM 239 (463) Q Consensus 160 Gd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~~~~~~qrv~~ 239 (463) |+-.=+ +..|+.+.+...+.... ..++.+.|.++...+..+|..+.-+.|++.+.+.+.+.--..-|.+. T Consensus 149 G~g~~~--------~~~gi~~~~~~~~~~~~--~~~t~~~i~~~~~~l~~~~~~~~~~vmn~~~~~~L~~l~d~~G~~~~ 218 (324) T protein:vir:78 149 NQGNNP--------FGKSIAQSIEKTNKVIK--GDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERI 218 (324) T ss_pred cCCCCC--------cCccccccccccceecc--ccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCCeee Confidence 975421 34566666665554443 45678888888888999998888899999999999865444445555 Q ss_pred ecCCCCcccceecCeeeecccccccCCceeccCccccccccccCCCCCCCCeeEEEEeccC--------CCcCccccccc Q lcl|NC_019448. 240 QDNSGNVNTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQPLPNAPQPAKVTATVETKQ--------KGAFEDEEDRA 311 (463) Q Consensus 240 ~~n~g~~~~G~~v~~~~s~~G~i~l~~s~~~~~d~~l~~~~~~~p~ap~p~~vtat~~~~~--------~g~~~~~~~~a 311 (463) ++..+..-.|++|- .+....+. .+.+++-+ ......- -... ++....... .+...+.-..- T Consensus 219 ~~~~~~~l~G~PV~--~~~~~~~~-~~~~~~gd-----~~~~~~g-~~~~--~~i~~~~~~~~~~~~~~~~~~~~~f~~d 287 (324) T protein:vir:78 219 YDRNSDSLDGLPVV--NLKSSNLK-RGELITGD-----FDKLIYG-IPQL--IEYKIDETAQLSTVKNEDGTPVNLFEQD 287 (324) T ss_pred cCCCCCcccceeeE--eeCCCCCC-cceEEEEe-----cceEEEE-EecC--cEEEEeecccccccccccccchhhhhcC Confidence 54444444555542 11111111 01111110 0000000 0001 111111111 11100000001 Q ss_pred ceEEEEEEEecCCccccccceeeeecCCCCceEEEEEecCCCCCCcceE Q lcl|NC_019448. 312 GLSYKVVVNSDDAQSAPSEEVTATVSNVDDGVKLSISVNAMYQQQPQFV 360 (463) Q Consensus 312 ~ysYkV~a~s~~geS~~S~~vt~Tva~~~~gv~ltIt~~a~~g~~~~~y 360 (463) -..|++...-+.+=--|...+ .|+.. .+...++|.=+ T Consensus 288 ~~~~r~~~r~d~~v~~~~A~~-----------~l~~a-~~~~~~~~~~~ 324 (324) T protein:vir:78 288 MVALRATMHVALHIADDKAFA-----------KLVPA-DKRTDSVPGEV 324 (324) T ss_pred cEEEEEEEEEccEEecccceE-----------EEecc-cccCCCCCCCC Confidence 122222222111111121111 11111 11222233111 No 14 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=98.11 E-value=3.4e-06 Score=50.58 Aligned_cols=315 Identities=12% Similarity=0.053 Sum_probs=156.3 Q ss_pred CCCCCccchHHHHhhhhhhHHHHHHhhcCCccCCccccCccccchhhhhhHhhhhhccccccchhhhcccchhhHHHhhh Q lcl|NC_019448. 1 MTIEKNLSDVQQKYADQFQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVKY 80 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k~~~e~~~Ks~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~stv~ey 80 (463) |-.+.+..-.-..|++...+. ..+.+...+ .+...++-+..+. -.+|..+. .+...+.+.....++.+.-.+| T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~--~~~~a~~~~--~~~~~~~lip~~~-~~~ii~~~--~~~s~l~~l~~~~~~~~~~~~~ 73 (324) T protein:vir:96 1 MEQTQKLKLNLQHFASNNVKP--QVFNPDNVM--MHEKKDGTLLNDF-TTPILQEV--MENSKIMQLGKYEPMEGTEKKF 73 (324) T ss_pred CCcchhhhHHHHHHHHhhhhh--hhccccccc--ccCCCcceechhH-HHHHHHHH--HhhchhhhhcceeeccCCceEE Confidence 665555553333343322110 112222111 1112233444444 44443332 2233345545555555555567 Q ss_pred hhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_019448. 81 DQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEWASFYG 160 (463) Q Consensus 81 ~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~~E~a~fyG 160 (463) .++... +...+++|++..+..++++.+.+...+=++-.-.+|.-+- .++..|.+....+.-...+++.+|.++|+| T Consensus 74 p~~~~~---~~a~~v~Eg~~~~~~~~~f~~v~~~~~k~~~~~~is~ell-~ds~~~l~~~i~~~l~~aia~~~d~~~l~G 149 (324) T protein:vir:96 74 TFWADK---PGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFL-NYTYSQFFEEMKPMIAEAFYKKFDEAGILN 149 (324) T ss_pred EEEecC---cceeeecCCccccccccceeEEEEEeEEEEEeehhhHHHH-hcchHHHHHHHHHHHHHHHHHHHHHHhhhc Confidence 776543 3567999999999999999999999999998888877432 234567888888888999999999999999 Q ss_pred ccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHHHHHhcCcceEEee Q lcl|NC_019448. 161 DASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQMQLMQ 240 (463) Q Consensus 161 d~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~~~~~~qrv~~~ 240 (463) +-+= .+-.|+...+...+.... ..++.+.|..+...+..+|+.++-+.|++.+.+.+...--..-|.+.+ T Consensus 150 ~g~~--------~~~~~~~~~~~~~~~~~~--~~~~~~~i~~~~~~i~~~~~~~~~~i~n~~~~~~L~~lkd~~G~~~~~ 219 (324) T protein:vir:96 150 QGNN--------PFGKSIAQSIKKTNKVIK--GDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIY 219 (324) T ss_pred CCCC--------CcCccccccccccceecc--cccchHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhCCCCCeeec Confidence 7541 123455555544444333 346677788888888888988889999999999988644344455555 Q ss_pred cCCCCcccceecCeeeecccccccCCceeccCccccccccccCCCCCCCCeeEEEEeccCC-CcCcccccccc------- Q lcl|NC_019448. 241 DNSGNVNTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQPLPNAPQPAKVTATVETKQK-GAFEDEEDRAG------- 312 (463) Q Consensus 241 ~n~g~~~~G~~v~~~~s~~G~i~l~~s~~~~~d~~l~~~~~~~p~ap~p~~vtat~~~~~~-g~~~~~~~~a~------- 312 (463) +..+..-.|++|- .+....+. .+..++.+ ...... .-... ++.......+ ..+.+. +... T Consensus 220 ~~~~~~l~G~PV~--~~~~~~~~-~~~~~~gd-----~s~~~~-~~~~~--~~i~~~~~~~~~~~~~~-~~~~~~~~~~n 287 (324) T protein:vir:96 220 DRNSDSLDGLPVV--NLKSSNLK-RGELITGD-----FDKLIY-GIPQL--IEYKIDETAQLSTVKNE-DGTPVNLFEQD 287 (324) T ss_pred CCCCCcccceeeE--eecCCCCC-cceEEEEe-----cceEEE-EEecC--cEEEEeecccccccccc-cccchhhhhcC Confidence 5444445566552 11111111 01111110 000000 00011 1111111111 000000 1111 Q ss_pred -eEEEEEEEecCCccccccceeeeecCCCCceEEEEEecCCCCCCcceE Q lcl|NC_019448. 313 -LSYKVVVNSDDAQSAPSEEVTATVSNVDDGVKLSISVNAMYQQQPQFV 360 (463) Q Consensus 313 -ysYkV~a~s~~geS~~S~~vt~Tva~~~~gv~ltIt~~a~~g~~~~~y 360 (463) -.+++...-+.+=--|... +.|+.. .++..++|.-+ T Consensus 288 ~v~~r~~~r~d~~v~~~~a~-----------~~l~~a-~~~~~~~~~~~ 324 (324) T protein:vir:96 288 MVALRATMHVALHIADDKAF-----------AKLVPA-DKRTDSVPGEV 324 (324) T ss_pred cEEEEEEEEeccEEecccce-----------EEEecc-cccCCCCCCCC Confidence 1222221111110011111 122221 12222222111 No 15 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=98.02 E-value=4.4e-06 Score=49.95 Aligned_cols=313 Identities=12% Similarity=0.047 Sum_probs=155.1 Q ss_pred CCCCCccchHHHHhhhhhhHHHHHH--hhcCCccCCccccCccccchhhhhhHhhhhhccccccchhhhcccchhhHHHh Q lcl|NC_019448. 1 MTIEKNLSDVQQKYADQFQEDVVKS--FQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVV 78 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k~~~e~~~Ks--~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~stv~ 78 (463) |-.+.+.... +++|...+.+. |.+..-. .+...++-+..+..++-+..+... ..+.+.....++.+.-. T Consensus 1 ~~~~~~~~~~----~~~f~~~~~~~~~~~a~~~~--~~~~~~~liP~~~~~~ii~~~~~~---s~l~~~~~~~~~~~~~~ 71 (324) T protein:vir:10 1 MEQTQKLKLN----LQHFASNNVKPQVFNPDNVM--MHEKKDGTLLNDFTTPILQEVMEN---SKIMQLGKYEPMEGTEK 71 (324) T ss_pred CCCchHHHHH----HHHHHHHhhccceeccccee--ccCCCcceechhHHHHHHHHHHhh---chhhhhcceeeccCCce Confidence 6555444433 23332222222 2221111 111223456665554444444332 23555555555555445 Q ss_pred hhhhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019448. 79 KYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEWASF 158 (463) Q Consensus 79 ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~~E~a~f 158 (463) +|.++.. .+...+++|++..+.+++.+.+.....+=++..-.+|.-+- .++..|.+....+.-...+++.+|.++| T Consensus 72 ~~p~~~~---~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~l~~ai~~~~d~a~l 147 (324) T protein:vir:10 72 KFTFWAD---KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFL-NYTYSQFFEEMKPMIAEAFYKKFDEAGI 147 (324) T ss_pred EEEEEeC---CcceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHH-hcchHHHHHHHHHHHHHHHHHHHHHHhh Confidence 5666542 24578999999999999999999999999998888877432 3445678888888999999999999999 Q ss_pred hcccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHHHHHhcCcceEE Q lcl|NC_019448. 159 YGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQMQL 238 (463) Q Consensus 159 yGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~~~~~~qrv~ 238 (463) +|+-.= .+..|+.+.+...+.... ..++.+.|.++...+..+|..+.-++|++.+.+.+.+.--..-|.+ T Consensus 148 ~G~g~~--------~~~~~i~~~~~~~~~~~~--~~~t~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~g~~~ 217 (324) T protein:vir:10 148 LNQGNN--------PFGKSIAQSIEKTNKVIK--GDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKER 217 (324) T ss_pred hcCCCC--------ccCccccccccccceecc--ccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCcee Confidence 997541 133456666665555543 4577888888888899999888889999999999885432332334 Q ss_pred eecCCCCcccceecCeeeecccccccCCc-eeccC--ccccccccccCCCCCCCCeeEEE------EeccCCCcCccccc Q lcl|NC_019448. 239 MQDNSGNVNTGYSVNGFYSSRGFIKLHGS-TVMEN--ELILDESLQPLPNAPQPAKVTAT------VETKQKGAFEDEED 309 (463) Q Consensus 239 ~~~n~g~~~~G~~v~~~~s~~G~i~l~~s-~~~~~--d~~l~~~~~~~p~ap~p~~vtat------~~~~~~g~~~~~~~ 309 (463) .+...++.-.|.+|- .+.. ...... .++-+ ..++- . .....+... ...+..+...+-.. T Consensus 218 ~~~~~~~~l~G~PV~--~~~~--~~~~~~~~~~gd~~~~~~~-~-------~~~~~i~~~~~~~~~~~~~~~~~~~~~~~ 285 (324) T protein:vir:10 218 IYDRNSDTLDGLPVV--NLKS--SNLKRGELITGDFDKLIYG-I-------PQLIEYKIDETAQLSTVKNEDGTPVNLFE 285 (324) T ss_pred ecCCCCccccceeEE--eecC--CCCCcceEEEEecccEEEE-E-------ecCcEEEEeecccccccccccccchhhhh Confidence 333333334455441 1110 001100 11100 00000 0 001111000 00000110000000 Q ss_pred ccceEEEEEEEecCCccccccceeeeecCCCCceEEEEEecCCCCCCcceE Q lcl|NC_019448. 310 RAGLSYKVVVNSDDAQSAPSEEVTATVSNVDDGVKLSISVNAMYQQQPQFV 360 (463) Q Consensus 310 ~a~ysYkV~a~s~~geS~~S~~vt~Tva~~~~gv~ltIt~~a~~g~~~~~y 360 (463) ..-..+++...-+. -+....+=+.|+.. .+...++|.=+ T Consensus 286 ~~~~~~r~~~r~d~-----------~v~~~~A~~~l~~a-~~~~~~~~~~~ 324 (324) T protein:vir:10 286 QDMVALRATMHVAL-----------HIADDKAFAKLVPA-DKKTDSVPGEV 324 (324) T ss_pred cCcEEEEEEEEEcc-----------EEecccceEEEEec-cCCCCCCCCCC Confidence 00112222211111 11111111222221 12222222112 No 16 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=98.00 E-value=7.5e-06 Score=48.69 Aligned_cols=312 Identities=12% Similarity=0.055 Sum_probs=153.5 Q ss_pred cchHH--HHhhhhhhHHHH--HHhhcCCccCCccccCccccchhhhhhHhhhhhccccccchhhhcccchhhHHHhhhhh Q lcl|NC_019448. 7 LSDVQ--QKYADQFQEDVV--KSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVKYDQ 82 (463) Q Consensus 7 ~~~~~--~~~~k~~~e~~~--Ks~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~stv~ey~~ 82 (463) ++..+ +...++|..+.. +.|.|...+. +...++.+..+.. .+|..+.. +...+.+...+.+..+...+|.+ T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~a~~~~~--~~~~~~liP~~~~-~~ii~~~~--~~s~l~~l~~~~~~~~~~~~ip~ 75 (324) T protein:vir:93 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMM--HEKKDGTLLNDFT-TPILQEVM--ENSKIMQLGKYEPMEGTEKKFTF 75 (324) T ss_pred CchhHHHHHHHHHHHHhhhhhhhcccccccc--cCCCcceechhHH-HHHHHHHH--hhchhhhhcceeeccCCceEEEE Confidence 22222 122334433322 3343432111 1122344555554 44433322 22234444444455555456766 Q ss_pred hhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHHHHHHHHHHHhhccc Q lcl|NC_019448. 83 YLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEWASFYGDA 162 (463) Q Consensus 83 ~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~~E~a~fyGd~ 162 (463) +.. .....+++|++..+..++++.+.....+=++..-.+|.-+- .++..|.+....+.--..+++.+|.++++|+- T Consensus 76 ~~~---~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~g 151 (324) T protein:vir:93 76 WAD---KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFL-NYTYSQFFEEMKPMIAEAFYKKFDEAGILNQG 151 (324) T ss_pred Eec---CcceeeecCCccccccccceeEEEEEeEEEEEeehhhHHHH-hcchHHHHHHHHHHHHHHHHHHHHHHHhcCCC Confidence 653 23567999999999999999999999999998888876332 33456778888888889999999999999975 Q ss_pred ccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHHHHHhcCcceEEeecC Q lcl|NC_019448. 163 SLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQMQLMQDN 242 (463) Q Consensus 163 ~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~~~~~~qrv~~~~n 242 (463) +- .+..|+...+...+.... ..++.+.|.++...+..+|+.++...|++.+.+.+...--..-|.+.+.. T Consensus 152 ~~--------~~~~~~~~~~~~~~~~~~--~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~G~~~~~~~ 221 (324) T protein:vir:93 152 NN--------PFGKSIAQSIEKTNKVIK--GDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKERIYDR 221 (324) T ss_pred CC--------CcCccccccccccceecc--ccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhCCCCCeeecCC Confidence 41 134556555555454443 34677888888888898998888999999999999754333444455444 Q ss_pred CCCcccceecCeeeecccccccCCceeccCccccccccccCCCCCCCCeeEEEEeccCCCcCcccccccc--------eE Q lcl|NC_019448. 243 SGNVNTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQPLPNAPQPAKVTATVETKQKGAFEDEEDRAG--------LS 314 (463) Q Consensus 243 ~g~~~~G~~v~~~~s~~G~i~l~~s~~~~~d~~l~~~~~~~p~ap~p~~vtat~~~~~~g~~~~~~~~a~--------ys 314 (463) .+..-.|++|- ++...+.. .+.+++- | ...... .-..... ...............+... -. T Consensus 222 ~~~~l~G~PVv--~~~~~~~~-~~~i~~g-d----fs~~~~-~~~~~~~--i~~~~~~~~~~~~~~~~~~~~~f~~n~~~ 290 (324) T protein:vir:93 222 NSDSLDGLPVV--NLKSSNLK-RGELITG-D----FDKLIY-GIPQLIE--YKIDETAQLSTVKNEDGTPVNLFEQDMVA 290 (324) T ss_pred CCCcccceeeE--eecCCCCC-cceEEEE-e----cceEEE-EEecCcE--EEEeecccccccccccccchhhhhcCcEE Confidence 44444566542 12111111 0011111 0 000000 0000111 1111111000000001111 11 Q ss_pred EEEEEEecCCccccccceeeeecCCCCceEEEEEecCCCCCCcceE Q lcl|NC_019448. 315 YKVVVNSDDAQSAPSEEVTATVSNVDDGVKLSISVNAMYQQQPQFV 360 (463) Q Consensus 315 YkV~a~s~~geS~~S~~vt~Tva~~~~gv~ltIt~~a~~g~~~~~y 360 (463) +++...-+.+ +....+=+.|+.. .+++.++|.-+ T Consensus 291 ~r~~~r~d~~-----------v~~~~a~~~l~~a-~~~~~~~~~~~ 324 (324) T protein:vir:93 291 LRATMHVALH-----------IADDKAFAKLVPA-DKRTDSVPGEV 324 (324) T ss_pred EEEEEEeccE-----------EecccceEEEecc-cccCCCCCCCC Confidence 2222111111 1111111222221 22223333212 No 17 >protein:vir:104388 Length: 566 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794072;genbank:gi:116222017;genbank:GeneID:4397450 Probab=97.96 E-value=1.5e-06 Score=52.59 Aligned_cols=266 Identities=16% Similarity=0.197 Sum_probs=106.0 Q ss_pred CcceEeccCCCCCHHHHhhhh-------------hhhhhcC-CceeEE---ecCHHHHHHHHHHhcCcceEEeecCC-CC Q lcl|NC_019448. 184 KNNVINAKGNQLTEKHLNEAA-------------VRIGKGF-GTATDA---YMPIGVHADFVNSILGRQMQLMQDNS-GN 245 (463) Q Consensus 184 ~~nviDarG~~ls~~~ln~aa-------------~~i~~~~-G~~td~---~m~~~vka~f~~~~~~~qrv~~~~n~-g~ 245 (463) +--. .|...+||-+. ..| ..| |...-+ .||......=.|-.+.+- +|.|=+. -. T Consensus 1 ~~~~------~~~~~~~~~~~~~~~~~~~~~M~~i~i-~~f~Ge~Pr~~p~lLP~~~a~~A~n~~~~~G-~itP~~~~~~ 72 (566) T protein:vir:10 1 MPIA------ILANSIINPLIFKPEAVKGISMPYIDI-TTMRGMMPRVVTSMLPDHSAVLAEDCHFRFG-VITPERQISG 72 (566) T ss_pred Ccee------eehhhhccceeecccccccceeeEEee-cccccccccchhhhccccccceEEeeeecCC-eeeeeecccc Confidence 1111 11112222111 011 111 111110 111111111111111100 0000000 00 Q ss_pred cccceecC--e---------------eeecccccccC--CceeccCcc---cccc-----ccccCC------CCCCCCee Q lcl|NC_019448. 246 VNTGYSVN--G---------------FYSSRGFIKLH--GSTVMENEL---ILDE-----SLQPLP------NAPQPAKV 292 (463) Q Consensus 246 ~~~G~~v~--~---------------~~s~~G~i~l~--~s~~~~~d~---~l~~-----~~~~~p------~ap~p~~v 292 (463) ...-+.++ . +.-++|.|+-+ ..+.+.+|. +..- ...+.| ..|+|.+. T Consensus 73 ~~~~~~~~~kTif~y~~~~W~~w~~~V~~ir~PvAqD~~~rvY~tg~~~Pk~t~~diAt~g~~~~pa~~y~LgVPaPs~a 152 (566) T protein:vir:10 73 VEKTFTIKPKTIFHYRDDFWFAWPDVVDVIRSPVAQDNYGRIYYTDGKFPKVTAAEIATKGEGNFPAASYRLGIPAPTTA 152 (566) T ss_pred cccccccCceeeeeecCcEeEEeCCceeeccCccccCCcceEEEeeCCcceeeecceeeccccccccccccccCCCCccc Confidence 00000000 0 01122222221 111111111 0000 000001 22233221 Q ss_pred EEEEeccCCCc-CcccccccceEEEEEEEecCC-ccccccce-eeeecCCCCceEEEEEecCCCCCCcceEEEEeecCC- Q lcl|NC_019448. 293 TATVETKQKGA-FEDEEDRAGLSYKVVVNSDDA-QSAPSEEV-TATVSNVDDGVKLSISVNAMYQQQPQFVSIYRQGKE- 368 (463) Q Consensus 293 tat~~~~~~g~-~~~~~~~a~ysYkV~a~s~~g-eS~~S~~v-t~Tva~~~~gv~ltIt~~a~~g~~~~~y~IYR~~~~- 368 (463) ..+..+++.|. ..++.+-.++.|+++.+...| ||.||.+- ..++...+..|.|++.+.+..+..-+...|||+..+ T Consensus 153 pv~~~~~~sg~~~~~~~d~~tr~Yv~TfVt~~GeES~PS~~S~~v~v~~~gs~V~ltl~~~p~~~~~i~~~RIYRS~tg~ 232 (566) T protein:vir:10 153 PVCTVQKGEGATDENPNDDETRFYTETFVSAYGEEGPPGPESLEVTVGIPDTPVQLTLSPVPLQDANINRRRIYRSVSGG 232 (566) T ss_pred ceeeccCCCcccCCCCcccceeEEEEEEEcCCCCcCCCccccceeEecCCCceEEEEecCCCcCcCCceeEEEEEecCCC Confidence 11122323321 234456678999999998888 78887543 345544444577777665555555577999998755 Q ss_pred -CceEEEEEEeeeeeecCCceEEEEec------cCCCC------------CCccc-------eecC-------Cch---- Q lcl|NC_019448. 369 -TGMYFLIKRVPVKDAQEDGTIVFVDK------NETLP------------ETADV-------FVGE-------MSP---- 411 (463) Q Consensus 369 -~g~~~li~rv~~s~~n~~gtttf~D~------N~~iP------------gt~~~-------fvGe-------~~p---- 411 (463) +++|+|+++.+. +.++|+|. ++.|| |-..| |.|+ +.| T Consensus 233 ~gtdy~lVael~a------s~~sf~Dd~~~~~lg~~Lps~~w~~PP~~m~GL~~m~NGimAgF~GneV~FsEpylPyAWP 306 (566) T protein:vir:10 233 GEADFLLVAELEA------SVLSYTDNIPAKNLGPSLATWDYLPPPENMTGLCLMANGIAAGFAGNEVMFSEAYLPYAWP 306 (566) T ss_pred CceeEEEEeeecc------cceeeeccccccccCcccccccccCcCcccceeeecccceEEeecCCEEEEecCCCCcccc Confidence 369999998753 56789987 44444 11111 2221 111 Q ss_pred ----HH-----HHh----------------------hhhcchhhcCCcccCCcceeeeeeechhheecceeeEEEE---E Q lcl|NC_019448. 412 ----QV-----VHL----------------------FELLPMMKLPLAQINASITFAVLWYGALALRAPKKWARIK---N 457 (463) Q Consensus 412 ----qv-----i~l----------------------~ellPm~k~pla~~na~~~~~V~~Yg~L~l~aPkk~~~ik---N 457 (463) ++ +.+ -.-+-++||+..+.=-+-+-+|.+=|...--.|.-.+.|. | T Consensus 307 ~~Yr~t~~~dIVaiA~~gt~LVV~TkG~PYl~sG~sP~sms~~kL~~~qaCvS~rsiV~~~g~v~Yas~dGLv~v~a~g~ 386 (566) T protein:vir:10 307 EVNRHTTAEDIVAVCPLGTSLVVATKGEPYLFSGVSPSTISGSKIPSMQACLSRQSMVAMEGFVLYAGTNGLVSVDANGN 386 (566) T ss_pred hhhccCCCCCeEEEEeccceEEEEEcCceEEEEcCChhhccccccccccccccccceeeecceEEeecCCceEEEecCCC Confidence 11 100 1122234555444444666677776666666777777764 2 Q ss_pred Ee--EecC Q lcl|NC_019448. 458 VR--YIAV 463 (463) Q Consensus 458 V~--~~~~ 463 (463) .+ -.-+ T Consensus 387 a~vvT~~l 394 (566) T protein:vir:10 387 AALATEQI 394 (566) T ss_pred hhhhhhhh Confidence 21 1111 No 18 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=97.87 E-value=1.3e-05 Score=47.31 Aligned_cols=314 Identities=13% Similarity=0.084 Sum_probs=154.2 Q ss_pred CCCCCccchHHHHhhhhhhHHHHHH--hhcCCccCCccccCccccchhhhhhHhhhhhccccccchhhhcccchhhHHHh Q lcl|NC_019448. 1 MTIEKNLSDVQQKYADQFQEDVVKS--FQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVV 78 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k~~~e~~~Ks--~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~stv~ 78 (463) |-.+.+....- ++|...+.+. +.+.- + -.+..++..+..+..++-+..+ .+...+.+...+.+..+--. T Consensus 1 ~~~~~~~~~~~----~~f~~~~~~~~~~~a~~-~-~~~~~~~~~iP~~~~~~ii~~~---~~~s~l~~~~~~~~~~~~~~ 71 (324) T protein:vir:97 1 MEQTQKLKLNL----QHFASNNVKPQVFNPDN-V-MMHEKKDGTLMNEFTTPILQEV---MENSKIMQLGKYEPMEGTEK 71 (324) T ss_pred CccchhHHHHH----HHHHHhhhhhhhhcccc-c-cccCCCcceechhHHHHHHHHH---HhhcchhhhcceeeccCCce Confidence 44443222222 2333332322 22221 1 1122233344544444333333 23334555555555554434 Q ss_pred hhhhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019448. 79 KYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEWASF 158 (463) Q Consensus 79 ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~~E~a~f 158 (463) +|.++.. .+...+++|++..+.+++.+.......+=++---.+|.-+ +.++..|.+....+.-...++..+|.++| T Consensus 72 ~ip~~~~---~~~a~~v~Eg~~~~~~~~~f~~v~~~~~k~~~~~~is~el-l~ds~~~l~~~i~~~l~~aia~~~d~a~l 147 (324) T protein:vir:97 72 KFTFWAD---KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEF-LNYTYSQFFEEMKPMIAEAFYKKFDEAGI 147 (324) T ss_pred EEEEEec---CcceeEeccCccccccccceeEEEEeeEEEEEeehhhHHH-HhcchHHHHHHHHHHHHHHHHHHHHHHhh Confidence 5555553 2356799999999999999999999999999888888732 23445678888999999999999999999 Q ss_pred hcccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHHHHHhcCcceEE Q lcl|NC_019448. 159 YGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQMQL 238 (463) Q Consensus 159 yGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~~~~~~qrv~ 238 (463) .|+..= .+..|+.+.+...+.... ..++.+.|.++...+..+|..+.-..|++.+.+.+.+.--+.-|-+ T Consensus 148 ~G~g~~--------~~~~gi~~~~~~~~~~~~--~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~g~~~ 217 (324) T protein:vir:97 148 LNQGNN--------PFGKSIAQSIEKTNKVIK--GDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKER 217 (324) T ss_pred ccCCCC--------ccCccccccccccceecc--ccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhcCCCcee Confidence 998642 245677777776665544 4567888888988999999888889999999999875432222223 Q ss_pred eecCCCCcccceecCeeeecccccccCCceeccC--ccccccccccCCCCCCCCeeE------EEEeccCCCcCcccccc Q lcl|NC_019448. 239 MQDNSGNVNTGYSVNGFYSSRGFIKLHGSTVMEN--ELILDESLQPLPNAPQPAKVT------ATVETKQKGAFEDEEDR 310 (463) Q Consensus 239 ~~~n~g~~~~G~~v~~~~s~~G~i~l~~s~~~~~--d~~l~~~~~~~p~ap~p~~vt------at~~~~~~g~~~~~~~~ 310 (463) .+...++.-.|++|- .+..-++. .+.+++-+ ..++-. .....+. .+...+..+...+.-.. T Consensus 218 ~~~~~~~tl~G~PV~--~~~~~~~~-~~~~~~gd~~~~~i~~--------~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~ 286 (324) T protein:vir:97 218 IYDRNSDTLDGLPVV--NLKSSNLK-RGELITGDFDKLIYGI--------PQLIEYKIDETAQLSTVKNEDGTPVNLFEQ 286 (324) T ss_pred ecCCCCccccceeeE--eecCCCCC-cceEEEEecccEEEEE--------ecCcEEEEeecccccccccccccchhhhhc Confidence 332222223444431 11100000 00011100 000000 0000000 00000000000000000 Q ss_pred cceEEEEEEEecCCccccccceeeeecCCCCceEEEEEecCCCCCCcceE Q lcl|NC_019448. 311 AGLSYKVVVNSDDAQSAPSEEVTATVSNVDDGVKLSISVNAMYQQQPQFV 360 (463) Q Consensus 311 a~ysYkV~a~s~~geS~~S~~vt~Tva~~~~gv~ltIt~~a~~g~~~~~y 360 (463) .--.+++... ...-+....+=+.|+++ .+...++|.=+ T Consensus 287 d~~~~r~~~r-----------~d~~v~~~~a~~~l~~~-~~~~~~~~~~~ 324 (324) T protein:vir:97 287 DMVALRATMH-----------VALHIADDKAFAKLVPA-DKKTDSVPGEV 324 (324) T ss_pred CcEEEEEEEE-----------eccEEecccceEEEEec-cCCCCCCCCCC Confidence 0011111111 11112112222334443 22234444222 No 19 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=97.84 E-value=1.5e-05 Score=46.99 Aligned_cols=311 Identities=13% Similarity=0.070 Sum_probs=152.6 Q ss_pred CCCCCccchHHHHhhhhhhHHHHHH--hhcCCccCCccccC-ccccchhhhhhHhhhhhccccccchhhhcccchhhHHH Q lcl|NC_019448. 1 MTIEKNLSDVQQKYADQFQEDVVKS--FQTGYGITPDTQID-AGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTV 77 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k~~~e~~~Ks--~~agy~~~p~~q~~-gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~stv 77 (463) |-.+.+..-.- ++|...+.+. |.+.. -....+ ++-+..+..++-+..+... ..+.+.....++.+.- T Consensus 1 ~~k~~~~~~~~----~~~~~~~~~~~~~~a~~---~~~~~~~~~lip~~~~~~ii~~~~~~---s~l~~~~~~~~~~~~~ 70 (324) T protein:vir:99 1 MEQTQKLKLNL----QHFASNNVKPQVFNPDN---VMMHEKKDGTLLNDFTTPILQEVMEN---SKIMRLGKYEPMEGTE 70 (324) T ss_pred CCCchHhhHHH----HHHHHHhhhhhhccccc---eeccCCCcceechhHHHHHHHHHHhh---chhhhhcceeeccCCc Confidence 65554444332 2233332222 22221 111123 3345555543333333222 2344444444444433 Q ss_pred hhhhhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019448. 78 VKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEWAS 157 (463) Q Consensus 78 ~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~~E~a~ 157 (463) .+|.++.. .+...+++|++..+..++.+.+.....+=++..-.+|.-+- .++..|.+....+.-..++++.+|.++ T Consensus 71 ~~~p~~~~---~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~l~~ai~~~~d~~~ 146 (324) T protein:vir:99 71 KKFTFWAD---KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFL-NYTYSQFFEEMKPMIAEAFYKKFDEAG 146 (324) T ss_pred eEEEEEec---CcceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHH-hcchHHHHHHHHHHHHHHHHHHHHHHh Confidence 44555442 34578999999999999999999999999998888877432 234457888888999999999999999 Q ss_pred hhcccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHHHHHhcCcceE Q lcl|NC_019448. 158 FYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQMQ 237 (463) Q Consensus 158 fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~~~~~~qrv 237 (463) ++|+-.= .+..|+.+.+...+.... ..++.+.|.++...+..+|..+.-+.|++.+.+.+...--..-|. T Consensus 147 l~G~g~~--------~~~~~~~~~~~~~~~~~~--~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~~g~~ 216 (324) T protein:vir:99 147 ILNQGNN--------PFGKSIAQSIEKTNKVIK--GDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDPETKE 216 (324) T ss_pred hhcCCCC--------ccCccccccccccceecc--ccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhcCCCce Confidence 9997641 234456666655555443 456778888888889989988888999999999987543222233 Q ss_pred EeecCCCCcccceecCeeeecccccccCCceeccC--ccccccccccCCCCCCCCeeEEEEeccCCCcCcccccccc--- Q lcl|NC_019448. 238 LMQDNSGNVNTGYSVNGFYSSRGFIKLHGSTVMEN--ELILDESLQPLPNAPQPAKVTATVETKQKGAFEDEEDRAG--- 312 (463) Q Consensus 238 ~~~~n~g~~~~G~~v~~~~s~~G~i~l~~s~~~~~--d~~l~~~~~~~p~ap~p~~vtat~~~~~~g~~~~~~~~a~--- 312 (463) +.+...++.-.|.+|- .+..-+.. .+.+++-+ ..++ .. .....+ ..............+... T Consensus 217 ~~~~~~~~~l~G~PVv--~~~~~~~~-~~~~i~gd~~~~~~-~~-------~~~~~i--~~~~~~~~~~~~~~~~~~~~~ 283 (324) T protein:vir:99 217 RIYDRNSDTLDGLPVV--NLKSSNLK-RGELITGDFDKLIY-GI-------PQLIEY--KIDETAQLSTVKNEDGTPVNL 283 (324) T ss_pred eecCCCCccccceeEE--eecCCCCC-cceEEEEecccEEE-EE-------ecCcEE--EEeecccccccccccccchhh Confidence 3333333334455441 11110000 00111110 0000 00 011111 110110000000000001 Q ss_pred -----eEEEEEEEecCCccccccceeeeecCCCCceEEEEEecCCCCCCcceE Q lcl|NC_019448. 313 -----LSYKVVVNSDDAQSAPSEEVTATVSNVDDGVKLSISVNAMYQQQPQFV 360 (463) Q Consensus 313 -----ysYkV~a~s~~geS~~S~~vt~Tva~~~~gv~ltIt~~a~~g~~~~~y 360 (463) ..+++...- ...+....+=+.|+.. .+...++|.=+ T Consensus 284 f~~~~~~~r~~~r~-----------d~~v~~~~a~~~lt~a-~~~~~~~~~~~ 324 (324) T protein:vir:99 284 FEQDMVALRATMHV-----------ALHIADDKAFAKLVPA-DKKTDSVPGEV 324 (324) T ss_pred hhcCcEEEEEEEEE-----------ccEEecccceEEEEec-cCCCCCCCCCC Confidence 111111111 1111111112233332 22223333112 No 20 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=97.83 E-value=1.1e-05 Score=47.76 Aligned_cols=296 Identities=12% Similarity=0.040 Sum_probs=145.4 Q ss_pred hhcCCccCC----ccccCccccchhhhhhHhhhhhccccccchhhhcccchhhHHHhhhhhhhccCcccccccccccCcc Q lcl|NC_019448. 26 FQTGYGITP----DTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVA 101 (463) Q Consensus 26 ~~agy~~~p----~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~ 101 (463) |..+. .++ .+..+|..+..+..++-+..+... -.+.+.....+..+-..+|.++. +.....+++|++.. T Consensus 1 ma~~~-~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~---~~l~~~~~~~~~~~~~~~ip~~~---~~~~a~~v~E~~~~ 73 (304) T protein:vir:10 1 MATPT-YTPGNVILSDFKNGVIPAEQGTLIMKDIMAN---SAIMKLAKNEPMTAQKKKFTYLA---KGVGAYWVSETERI 73 (304) T ss_pred Ccccc-cccccccccCCCceecchhHHHHHHHHHHhc---cchhhhcceeeccCCceEEEEEe---CCcceEEeecCccc Confidence 22221 111 122334446666544444344332 33555555555555444455554 23456799999999 Q ss_pred cccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHHHHHHHHHHHhhcccccCCCccccccccccceee Q lcl|NC_019448. 102 PVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKL 181 (463) Q Consensus 102 ~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~l 181 (463) +.+++.+.......+=++.--.+|.-+ +.++..|.+....+.-...+++.+|.++|+|+.+-.+ .|..-.|+..- T Consensus 74 ~~~~~~~~~i~~~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~----~~~~~~~~~~~ 148 (304) T protein:vir:10 74 QTSKPEYAQAEMEAKKIGVIIPLSKEF-LKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYN----TSTSGKPLVEG 148 (304) T ss_pred ccccceeeEEEEEEEEEEEeehhhHHH-HhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCcc----ccccccccccc Confidence 999999999999999999888887754 4455678888888888899999999999999876433 22223333332 Q ss_pred ecCcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHHHHHhcCcceEEeecCCCCcccceecCeeeecccc Q lcl|NC_019448. 182 IDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQMQLMQDNSGNVNTGYSVNGFYSSRGF 261 (463) Q Consensus 182 I~~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~~~~~~qrv~~~~n~g~~~~G~~v~~~~s~~G~ 261 (463) +.. ......+.-.+-+.|.++...+..+|....-.+|++.+.+.|...--...|.+.+++.+ .-.|++|- .+.... T Consensus 149 ~~~-~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~lkd~~G~~l~~~~~~-~l~G~PV~--~~~~~~ 224 (304) T protein:vir:10 149 AEE-KGNVVTDTNNLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRNALDANDRPLFDANGN-EIMGLPLS--YTGADV 224 (304) T ss_pred ccc-cccccccccchHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCcEeecCCCc-cccceeeE--Eecccc Confidence 221 22333344556677777888888888888889999999999985433333555555443 23355441 111111 Q ss_pred cccCCc-eeccC--ccc-cccccccCCCCCCCCeeEEEEeccCCCcCcccccccceEEEEEEEecCCccccccceeeeec Q lcl|NC_019448. 262 IKLHGS-TVMEN--ELI-LDESLQPLPNAPQPAKVTATVETKQKGAFEDEEDRAGLSYKVVVNSDDAQSAPSEEVTATVS 337 (463) Q Consensus 262 i~l~~s-~~~~~--d~~-l~~~~~~~p~ap~p~~vtat~~~~~~g~~~~~~~~a~ysYkV~a~s~~geS~~S~~vt~Tva 337 (463) ...... .++.+ ..+ -++.....-- ---+++.-.......|+..+.-..-.-.|++...-+..=--|...+-.+.+ T Consensus 225 ~~~~~~~~~~gd~~~~~~~~~~~~~i~~-~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a 303 (304) T protein:vir:10 225 YDKKKSLALMGDWDYARYGILQGIEYAI-SEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPT 303 (304) T ss_pred cCCCCcEEEEEehhhEEEEEecceEEEE-eecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEec Confidence 111111 11110 000 0000000000 000000000000011110000000012222222211111112222222222 Q ss_pred C Q lcl|NC_019448. 338 N 338 (463) Q Consensus 338 ~ 338 (463) - T Consensus 304 ~ 304 (304) T protein:vir:10 304 E 304 (304) T ss_pred C Confidence 1 No 21 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=97.83 E-value=1.1e-05 Score=47.76 Aligned_cols=296 Identities=12% Similarity=0.040 Sum_probs=145.4 Q ss_pred hhcCCccCC----ccccCccccchhhhhhHhhhhhccccccchhhhcccchhhHHHhhhhhhhccCcccccccccccCcc Q lcl|NC_019448. 26 FQTGYGITP----DTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVA 101 (463) Q Consensus 26 ~~agy~~~p----~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~ 101 (463) |..+. .++ .+..+|..+..+..++-+..+... -.+.+.....+..+-..+|.++. +.....+++|++.. T Consensus 1 ma~~~-~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~---~~l~~~~~~~~~~~~~~~ip~~~---~~~~a~~v~E~~~~ 73 (304) T protein:vir:94 1 MATPT-YTPGNVILSDFKNGVIPAEQGTLIMKDIMAN---SAIMKLAKNEPMTAQKKKFTYLA---KGVGAYWVSETERI 73 (304) T ss_pred Ccccc-cccccccccCCCceecchhHHHHHHHHHHhc---cchhhhcceeeccCCceEEEEEe---CCcceEEeecCccc Confidence 22221 111 122334446666544444344332 33555555555555444455554 23456799999999 Q ss_pred cccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHHHHHHHHHHHhhcccccCCCccccccccccceee Q lcl|NC_019448. 102 PVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKL 181 (463) Q Consensus 102 ~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~l 181 (463) +.+++.+.......+=++.--.+|.-+ +.++..|.+....+.-...+++.+|.++|+|+.+-.+ .|..-.|+..- T Consensus 74 ~~~~~~~~~i~~~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~----~~~~~~~~~~~ 148 (304) T protein:vir:94 74 QTSKPEYAQAEMEAKKIGVIIPLSKEF-LKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYN----TSTSGKPLVEG 148 (304) T ss_pred ccccceeeEEEEEEEEEEEeehhhHHH-HhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCcc----ccccccccccc Confidence 999999999999999999888887754 4455678888888888899999999999999876433 22223333332 Q ss_pred ecCcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHHHHHhcCcceEEeecCCCCcccceecCeeeecccc Q lcl|NC_019448. 182 IDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQMQLMQDNSGNVNTGYSVNGFYSSRGF 261 (463) Q Consensus 182 I~~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~~~~~~qrv~~~~n~g~~~~G~~v~~~~s~~G~ 261 (463) +.. ......+.-.+-+.|.++...+..+|....-.+|++.+.+.|...--...|.+.+++.+ .-.|++|- .+.... T Consensus 149 ~~~-~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~lkd~~G~~l~~~~~~-~l~G~PV~--~~~~~~ 224 (304) T protein:vir:94 149 AEE-KGNVVTDTNNLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRNALDANDRPLFDANGN-EIMGLPLS--YTGADV 224 (304) T ss_pred ccc-cccccccccchHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCcEeecCCCc-cccceeeE--Eecccc Confidence 221 22333344556677777888888888888889999999999985433333555555443 23355441 111111 Q ss_pred cccCCc-eeccC--ccc-cccccccCCCCCCCCeeEEEEeccCCCcCcccccccceEEEEEEEecCCccccccceeeeec Q lcl|NC_019448. 262 IKLHGS-TVMEN--ELI-LDESLQPLPNAPQPAKVTATVETKQKGAFEDEEDRAGLSYKVVVNSDDAQSAPSEEVTATVS 337 (463) Q Consensus 262 i~l~~s-~~~~~--d~~-l~~~~~~~p~ap~p~~vtat~~~~~~g~~~~~~~~a~ysYkV~a~s~~geS~~S~~vt~Tva 337 (463) ...... .++.+ ..+ -++.....-- ---+++.-.......|+..+.-..-.-.|++...-+..=--|...+-.+.+ T Consensus 225 ~~~~~~~~~~gd~~~~~~~~~~~~~i~~-~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~a~~~l~~a 303 (304) T protein:vir:94 225 YDKKKSLALMGDWDYARYGILQGIEYAI-SEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPEAFATLKPT 303 (304) T ss_pred cCCCCcEEEEEehhhEEEEEecceEEEE-eecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecccceEEEEec Confidence 111111 11110 000 0000000000 000000000000011110000000012222222211111112222222222 Q ss_pred C Q lcl|NC_019448. 338 N 338 (463) Q Consensus 338 ~ 338 (463) - T Consensus 304 ~ 304 (304) T protein:vir:94 304 E 304 (304) T ss_pred C Confidence 1 No 22 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=97.80 E-value=1.8e-05 Score=46.56 Aligned_cols=318 Identities=10% Similarity=0.012 Sum_probs=142.8 Q ss_pred CCCCCccchHHHHh---hhh-----hhHHHHHHhhcCCccCCccccCccc-cchhhhhhHhhhhhccccccchhhhcccc Q lcl|NC_019448. 1 MTIEKNLSDVQQKY---ADQ-----FQEDVVKSFQTGYGITPDTQIDAGA-LRREILDDQITMLTWTNEDLIFYRDISRR 71 (463) Q Consensus 1 ~~~~~~~~~~~~~~---~k~-----~~e~~~Ks~~agy~~~p~~q~~gaa-lr~esLd~~i~~L~~~~~df~f~~~i~k~ 71 (463) +....+-.....++ ++. +...-.+++...-..+ .+..+||. +..+.....|..+..... .+.+..... T Consensus 212 ~~~~~~~~~~~~a~~~~~~~~~~~~l~~~e~~~~~~~~~~~-~t~~~gg~lip~~~~~~ii~~~~~~~~--~l~~~~~~~ 288 (543) T protein:vir:81 212 QCLATSSPAYLRAWSKMARNPHAAILTEEEKRAINEVRAMG-LTKADGGYLVPFQLDPTVIITSNGSLN--DIRRFARQV 288 (543) T ss_pred hhhhhhhhhhhhHHHHHHHhhHHHHhhhhhhhhhhhhhhcc-cccccCcccCchhhhhHHHHHHHhhhc--hhhhhcccc Confidence 11111100000111 111 1111122332221111 12334444 444544444423222211 122222222 Q ss_pred hhhHHHhhhhhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHHHHH Q lcl|NC_019448. 72 PAQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAK 151 (463) Q Consensus 72 ~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~ 151 (463) .....+. |.+.. +.....+++|++..+.+++.+.+....++-++.-..+|.-+ +.++ .|.+....+.-...++. T Consensus 289 ~~~g~~~-~~~~~---~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~el-l~d~-~~~~~~i~~~l~~~~~~ 362 (543) T protein:vir:81 289 VATGDVW-HGVSS---AAVQWSWDAEFEEVSDDSPEFGQPEIPVKKAQGFVPISIEA-LQDE-ANVTETVALLFAEGKDE 362 (543) T ss_pred cCCcceE-EEEec---CCcceeecccCccccccccccceeeeeeeeeEeeehhhHHH-Hhcc-HHHHHHHHHHHHHHHHH Confidence 2223322 22222 23467799999999999999999999999999988888854 2344 58999999999999999 Q ss_pred HHHHHHhhcccccCCCccccccccccceeeec--CcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHHHH Q lcl|NC_019448. 152 TIEWASFYGDASLTSEVEGEGLEFDGLAKLID--KNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVN 229 (463) Q Consensus 152 ~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~--~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~ 229 (463) .++.++|+||-. +-++.|+.+.-. ...+..+.+..++.+.+..+...+..+|.....++|++.+.+.+.. T Consensus 363 ~~d~ail~G~Gt--------~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~l~~ 434 (543) T protein:vir:81 363 LEAVTLTTGTGQ--------GNQPTGIVTALAGTAAEIAPVTAETFALADVYAVYEQLAARHRRQGAWLANNLIYNKIRQ 434 (543) T ss_pred HHHHHHhccCCC--------CcccccchhhcccccccccccccccccHHHHHHHHHhhhccccCCcEEEEcHHHHHHHHH Confidence 999999999843 126788876433 2345566677788888888888888888777789999999999985 Q ss_pred HhcCcceEEeec-CCC--CcccceecCeeeecccccccCCceeccCccccccccccCCCCCCCCeeEEEEeccCCCcCcc Q lcl|NC_019448. 230 SILGRQMQLMQD-NSG--NVNTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQPLPNAPQPAKVTATVETKQKGAFED 306 (463) Q Consensus 230 ~~~~~qrv~~~~-n~g--~~~~G~~v~~~~s~~G~i~l~~s~~~~~d~~l~~~~~~~p~ap~p~~vtat~~~~~~g~~~~ 306 (463) .--..-|.+.++ ..| +.-.|++|- .+.. .........-.++..+...++-.--.-.-..++....+...+... T Consensus 435 lkd~~G~~l~~~~~~g~~~~l~G~pv~--~~~~-~~~~~~~~~~~~~~~i~~gd~~~~~i~~~~~~~i~~~~~~~~~~~- 510 (543) T protein:vir:81 435 FDTQGGAGLWTTIGNGEPSQLLGRPVG--EAEA-MDANWNTSASADNFVLLYGNFQNYVIADRIGMTVEFIPHLFGTNR- 510 (543) T ss_pred hhcCCCceeccCcCCCCCccccceeeE--Eecc-ccccccccccCCcceEEEeeccceeEEeecccEEEEeccccccch- Confidence 332222333332 111 111222110 0000 000000000000000000000000000000011111111110000 Q ss_pred cccccceEEEEEEEecCCccccccceeeeecCCCCceEEEEEecC Q lcl|NC_019448. 307 EEDRAGLSYKVVVNSDDAQSAPSEEVTATVSNVDDGVKLSISVNA 351 (463) Q Consensus 307 ~~~~a~ysYkV~a~s~~geS~~S~~vt~Tva~~~~gv~ltIt~~a 351 (463) .......|++... +..-+...++=+.++++..+ T Consensus 511 -~~~~~~~~~~~~r-----------~d~~v~~~~A~~~l~~~~~a 543 (543) T protein:vir:81 511 -RPNGSRGWFAYYR-----------MGADVVNPNAFRLLNVETAS 543 (543) T ss_pred -hhcCceEEEEEEe-----------eccEeecccceEEEEecccC Confidence 0000111222111 11122222222334443222 No 23 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=97.78 E-value=6.6e-06 Score=48.97 Aligned_cols=304 Identities=11% Similarity=0.011 Sum_probs=150.6 Q ss_pred hhHHHHHHhhcCCccCCccccCccccchhhhhhHhhhhhccccccchhhhcccchhhHHHhhhhhhhccCcccccccccc Q lcl|NC_019448. 18 FQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKE 97 (463) Q Consensus 18 ~~e~~~Ks~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E 97 (463) +.-+..|+..+ . .+...|+.+..+..++-+..|.. .-.+.+.....+..+.-.+|.+... .....+++| T Consensus 1 m~~~~~~a~~~---~--~t~~~g~~i~~~~~~~ii~~~~~---~s~l~~~~~~~~~~~~~~~~p~~~~---~~~a~~v~E 69 (330) T protein:vir:77 1 MAGSTVPSTQV---A--LTGDFSAFLTPEQSQDYFAEIEK---TSIVQRIARKVPMGPTGISIPHWTG---AVSASWTGE 69 (330) T ss_pred Ccccccchhhc---c--ccCCCcceechhHHHHHHHHHHh---ccchhhhcceeeccCCceEEEEEcC---CcceeEecC Confidence 11111222222 1 13345677888877665555543 3346666666666665556766653 345679999 Q ss_pred cCcccccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHHHHHHHHHHHhhcccccCCCcccccccccc Q lcl|NC_019448. 98 IGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDG 177 (463) Q Consensus 98 ~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~~E~a~fyGd~~l~~~~~~~gleFDG 177 (463) ++..+.+++++.+....++=++.--.+|.-+ +.++..|.+....+.-...+++.+|.++|+|+.+ +-+++| T Consensus 70 g~~~~~~~~~f~~i~~~~~k~~~~~~is~el-l~ds~~~~~~~i~~~l~~ai~~~~~~~~l~G~g~--------~~~~~g 140 (330) T protein:vir:77 70 AERKPITKGSFGKQELEPVKITTIFAESAEV-VRLNPLNYLNTMRTKIAEAIALKFDAAAIHGIDK--------PSAFKG 140 (330) T ss_pred CCccccccceeeEEEEeEEEEEEeehhhHHH-HhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCC--------CCcccc Confidence 9999999999999999999999888888743 3445678899999999999999999999999875 235788 Q ss_pred ceeeecCc-ceEec---cCCCCC---HHHHhhhhhhhhhcCCceeEEecCHHHHHHHHHHhcCcceEEeecCCCCc---- Q lcl|NC_019448. 178 LAKLIDKN-NVINA---KGNQLT---EKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQMQLMQDNSGNV---- 246 (463) Q Consensus 178 l~~lI~~~-nviDa---rG~~ls---~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~~~~~~qrv~~~~n~g~~---- 246 (463) +.+.+... .+.+. .+...+ .+.|.++-..+.+++...+-.+|++.+.+.+...--...|.+.+++.+.. T Consensus 141 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~ 220 (330) T protein:vir:77 141 YLAETTKVVSLADTNLTTASGPQGNAYLAVNNALSLLVNSGKKWTGTLLDNVTEPILNTAVDGNGRPLFVESTYTEQVGA 220 (330) T ss_pred ccccccccceeecccccccccccchhHHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHHHhccCCceeecCccccccccc Confidence 87765432 22221 122222 34455555667778888888999999999998544334455555433221 Q ss_pred -----ccceecCeeeecccc-ccc-CCc-eecc--Ccccc-ccccccCCCCCCCCeeEEEEec--cC--CCcCccccccc Q lcl|NC_019448. 247 -----NTGYSVNGFYSSRGF-IKL-HGS-TVME--NELIL-DESLQPLPNAPQPAKVTATVET--KQ--KGAFEDEEDRA 311 (463) Q Consensus 247 -----~~G~~v~~~~s~~G~-i~l-~~s-~~~~--~d~~l-~~~~~~~p~ap~p~~vtat~~~--~~--~g~~~~~~~~a 311 (463) -.|++| +.+..-+ ... +.. .++- +..++ ++.....- .-.....+... .. .+...+.-... T Consensus 221 ~~~~~l~G~PV--~~~~~~p~~~~~~~~~~~~gd~s~~~i~~~~~~~i~---~~~e~~~~~~~~~~~~~~~~~~~~f~~~ 295 (330) T protein:vir:77 221 IREGRILGRPT--YVADNVVNGTVGNRVVGVMGDFSQVIWGQIGGLSFD---VTDQATLDFGEEQGGVWVPKLISLWQHN 295 (330) T ss_pred cCCceecceee--EEeccccCCCCCCccEEEEEecceEEEEEecCcEEE---EeecceeeecccccccccccccchhhcC Confidence 234443 1111100 000 000 0110 00000 00000000 00000000000 00 00000000111 Q ss_pred ceEEEEEEEecCCccccccceeeeecCCCCceEEE Q lcl|NC_019448. 312 GLSYKVVVNSDDAQSAPSEEVTATVSNVDDGVKLS 346 (463) Q Consensus 312 ~ysYkV~a~s~~geS~~S~~vt~Tva~~~~gv~lt 346 (463) -..+++...-+..=--|...+-.+.....+.-+=. T Consensus 296 ~~~~r~~~r~d~~v~~~~a~~~i~~~~~~~~~~~~ 330 (330) T protein:vir:77 296 MVAVRCEAEFAFMVNDKDAFVKLTDQVAGTDPEEE 330 (330) T ss_pred cEEEEEEEEeccEEecccceEEEEeccCCcCCCCC Confidence 23333332222111111111111111111110001 No 24 >protein:vir:93631 Length: 580 # NCBI annotation: Bcep22gp67 # Family: family:all:1544 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944296;genbank:gi:38640373;genbank:GeneID:2658280 Probab=97.77 E-value=1.9e-06 Score=51.95 Aligned_cols=237 Identities=19% Similarity=0.216 Sum_probs=84.1 Q ss_pred hhhhhhhhcCCceeEE---ecCHHHHHHHHHHhcCcceEEeecCC------------CCcccce-------ecCe-eeec Q lcl|NC_019448. 202 EAAVRIGKGFGTATDA---YMPIGVHADFVNSILGRQMQLMQDNS------------GNVNTGY-------SVNG-FYSS 258 (463) Q Consensus 202 ~aa~~i~~~~G~~td~---~m~~~vka~f~~~~~~~qrv~~~~n~------------g~~~~G~-------~v~~-~~s~ 258 (463) .-+..|..=-|...-+ .||......=.|-.+.+- +|.|=+. +...+=+ .-++ +.-+ T Consensus 1 M~~i~i~~f~Ge~Prl~p~lLP~~~a~~a~n~~~~~G-~i~P~~~~~~~~~~~~i~~~~~~t~~~~~~~W~~w~~~V~~i 79 (580) T protein:vir:93 1 MTIIKITGFSGEIPRLVPRLLPDTAAQNATNARLESG-GLTPYRKPKFITRISTIPAGQIETIYRNGETWMAWDKPVYAA 79 (580) T ss_pred CeeEeecccccccccchhhhccccccceEEeeeccCC-eeeeeeCchhhccccccCcCcceEEEecCceeEEeCCceeee Confidence 2222221111222211 122221111111111111 0111100 0000000 0000 1122 Q ss_pred ccccccCCceeccCccccccccc--cCC-CCCCCCe-eEEEEeccCCCcCcccccccceEEEEEEEecCC-cccccccee Q lcl|NC_019448. 259 RGFIKLHGSTVMENELILDESLQ--PLP-NAPQPAK-VTATVETKQKGAFEDEEDRAGLSYKVVVNSDDA-QSAPSEEVT 333 (463) Q Consensus 259 ~G~i~l~~s~~~~~d~~l~~~~~--~~p-~ap~p~~-vtat~~~~~~g~~~~~~~~a~ysYkV~a~s~~g-eS~~S~~vt 333 (463) +|.|.-+ -+.+-+|..-+-... ..+ ..|.|.. +++.. .+.+ ..+..+|+|.++.+...| ||.||.... T Consensus 80 ~~PvA~D-Rvy~Td~g~Pkvt~~g~sy~lgVpaPs~Apt~~~--~g~g----~l~~~~y~Yv~TfVt~~GeES~PS~~S~ 152 (580) T protein:vir:93 80 PGPVAAD-RLYVMGDGAPKMIVGGTTYPLAVPMPSAALTAAT--SGTG----TGDVFSRVYVYTFVTGFGEESEPSAISN 152 (580) T ss_pred cCccccc-eeEEcCCcccceecCCccccccCCCcccCceeee--cCCC----CcCccceEEEEEEEcCCCCcCCCccccc Confidence 2332222 111111111000000 000 0122221 22221 1112 235678999999998888 899876433 Q ss_pred e-eecCCCCceEEEEEecCCCCCCcceEEEEeecCC--CceEEEEEEeeeeeecCCceEEEEeccCCCCCCccceecCCc Q lcl|NC_019448. 334 A-TVSNVDDGVKLSISVNAMYQQQPQFVSIYRQGKE--TGMYFLIKRVPVKDAQEDGTIVFVDKNETLPETADVFVGEMS 410 (463) Q Consensus 334 ~-Tva~~~~gv~ltIt~~a~~g~~~~~y~IYR~~~~--~g~~~li~rv~~s~~n~~gtttf~D~N~~iPgt~~~fvGe~~ 410 (463) . ++. .+.+|.|+..+.+..+..=+...|||+..+ +++|+|+++++ -++++|+|...... +|+.= T Consensus 153 ~vtv~-~g~tVtLs~~p~p~~~~~i~~~RIYRS~tG~~gtdy~lVAel~------Ag~~sF~Dd~s~a~------Lge~L 219 (580) T protein:vir:93 153 EVNWQ-AGQTVTLSGFQAAPAGRNITKQRIYRSQTSLSGTDLYFIAERD------ASAANFVDNVPLSD------QNEPL 219 (580) T ss_pred ceeeC-CCCeEEEEecCCCCCCCccceEEEEEeccCCCceeEEEEeeec------cceeeeeecccccc------ccccc Confidence 3 332 344566665443333322244799998765 46999999864 36789999875421 11111 Q ss_pred hHHHHh----hhhcchhhcCCcccCCcceeeeeeechhheecceeeEE-----EE-EEeEecC Q lcl|NC_019448. 411 PQVVHL----FELLPMMKLPLAQINASITFAVLWYGALALRAPKKWAR-----IK-NVRYIAV 463 (463) Q Consensus 411 pqvi~l----~ellPm~k~pla~~na~~~~~V~~Yg~L~l~aPkk~~~-----ik-NV~~~~~ 463 (463) | +..| ..|..+..||+- +=+...+-..||.=. +.|.-|-. +. +|.=|.. T Consensus 220 p-s~~~~~PP~~m~gL~~m~nG-i~agF~Gnev~fsEp--y~P~AWP~~yr~t~~~~Ivaia~ 278 (580) T protein:vir:93 220 P-SLEWNAPPDDLTGLISLPNG-MMAAFRGKELWLCEP--WRPHAWPQKYVLTMDYNIVALGA 278 (580) T ss_pred c-hhhccCcCCCcceEEeeccc-eEEEEeCCEEEEecC--CCCccchhhcCCCCCCCceeEee Confidence 1 1111 001111222222 112222222232221 34444432 11 1211111 No 25 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=97.75 E-value=1.1e-05 Score=47.73 Aligned_cols=291 Identities=12% Similarity=0.095 Sum_probs=134.5 Q ss_pred CCCCCccchHHHHhhhhhhHHHHHHhhcCCc-----cCCccccCccccchhhhhhHhhhhhccccccchhhhcccchhhH Q lcl|NC_019448. 1 MTIEKNLSDVQQKYADQFQEDVVKSFQTGYG-----ITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQS 75 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k~~~e~~~Ks~~agy~-----~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~s 75 (463) -+.+.+.........+.|.+ .+..+.. .+-.+..+|+.|--+.+.++|..+..... .+++.+...++.+ T Consensus 77 ~~~~~~~~~~~~~~~~~~~~----~l~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~--~l~~~~~~~~~~~ 150 (397) T protein:vir:49 77 KPLTKSEEEVKAGFVKDFKN----LVRGRYQNLLDSKTDASGSDAGLTIPQDIQTAIHTLVSQYD--SLQEYVNVENVTT 150 (397) T ss_pred cccccchhHHHHHHHHHHHH----HHhcchhHHHHHhhccccccCcccccHhHHHHHHHHHHhhh--hHHhhhceeeccc Confidence 11111111111122222221 1111110 11123345666655555566643333332 3444455555544 Q ss_pred HHhhhhhhhccCcccccccccccCccc-ccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHHHHHHHH Q lcl|NC_019448. 76 TVVKYDQYLRHGNVGHSRFVKEIGVAP-VSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIE 154 (463) Q Consensus 76 tv~ey~~~~~hG~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~~E 154 (463) ...+|.......+.+...+++|++..+ .+++.+......++-++.-..+|.-+ +.++..|.+....+.-...++..++ T Consensus 151 ~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~l~~~~~~~~d 229 (397) T protein:vir:49 151 LTGSRVYEKWTDITGLANIDDEAGKIADVDDPKLSLIKYTIKRYAGISTVTNSL-LADSAENILAWLSGWIAKKVVVTRN 229 (397) T ss_pred CccceEEEeeccCCcceeeecCccccccccccceeeEEeeeeeEEeeehhHHHH-HhhhHHHHHHHHHHHHHHHHHHHHH Confidence 333333222223345678999998854 68999999999999999888888654 2445567888889999999999999 Q ss_pred HHHhhcccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHHHHHhcCc Q lcl|NC_019448. 155 WASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGR 234 (463) Q Consensus 155 ~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~~~~~~ 234 (463) .+++.|+..-.+. +...+-+.|.++...+...|...+..+|++.+.+.+...--.. T Consensus 230 ~ai~~G~g~~~~~------------------------~~~~~~d~i~~~~~~l~~~~~~~a~~vmn~~~~~~l~~lkd~~ 285 (397) T protein:vir:49 230 KAILEAIAALPTK------------------------PTLTKWDDIIDLEAKVDPAIKQTSFFLTNTSGFTALKKVKNAL 285 (397) T ss_pred HHHHhhccccccc------------------------cccccHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhcCC Confidence 9999998775431 1122445566666677778877888999999999987532122 Q ss_pred ceEEeecCC----CCcccceecC----eeeecccccccCC-ceecc--CccccccccccCCCCCCCCeeEEEEeccCCCc Q lcl|NC_019448. 235 QMQLMQDNS----GNVNTGYSVN----GFYSSRGFIKLHG-STVME--NELILDESLQPLPNAPQPAKVTATVETKQKGA 303 (463) Q Consensus 235 qrv~~~~n~----g~~~~G~~v~----~~~s~~G~i~l~~-s~~~~--~d~~l~~~~~~~p~ap~p~~vtat~~~~~~g~ 303 (463) -|.+.+++. +..-.|++|- .++...+ ... .+++. .+.+....+ ..++....... +. T Consensus 286 G~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~---~~~~~i~~gd~~~~~~~~~~---------~~~~i~~~~~~-~~ 352 (397) T protein:vir:49 286 GDYLMERDVKSPTGYSIDGFAVKEVADRWLANGT---GGAMPLYFGDLKQAVTLFDR---------QHMSLLSTNIG-GG 352 (397) T ss_pred CceeeccCcCCCCCceecceeeEEeccccccccc---CCceeEEEeeccceEEEEee---------cceEEEEeccc-cc Confidence 234443332 2234455441 1111110 000 01111 000000000 00011111110 00 Q ss_pred CcccccccceEEEEE------EEecCC------cccccccee-eeecCC Q lcl|NC_019448. 304 FEDEEDRAGLSYKVV------VNSDDA------QSAPSEEVT-ATVSNV 339 (463) Q Consensus 304 ~~~~~~~a~ysYkV~------a~s~~g------eS~~S~~vt-~Tva~~ 339 (463) .+ ..+ ...|++. +.+..+ ..+++++.+ .|.+ + T Consensus 353 ~~-~~~--~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~~~~~-~ 397 (397) T protein:vir:49 353 AF-ETD--TTKVRVIDRFDVVATDTEAFVPASFKAIADQKGNLGSTA-V 397 (397) T ss_pred hh-hcC--ceeEEEEeeeCcEEecccceEEEEeecccCCCCCccccc-C Confidence 00 000 0111111 111100 000111111 1111 1 No 26 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=97.74 E-value=1.4e-05 Score=47.18 Aligned_cols=302 Identities=13% Similarity=0.109 Sum_probs=150.9 Q ss_pred CCCCCccchHHHHhhhhhhHHHHHHhhcC----------CccCCccccCccccchhhhhhHhhhhhccccccchhhhccc Q lcl|NC_019448. 1 MTIEKNLSDVQQKYADQFQEDVVKSFQTG----------YGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISR 70 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k~~~e~~~Ks~~ag----------y~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k 70 (463) -..+..... ..+.+.+++.|.+... -..+..+..+|+.+.-+....-+..+ .+...+++.++. T Consensus 68 ~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~---~~~~~l~~~~~~ 140 (385) T protein:vir:19 68 AENPGEKKS----FSERAAEELIKSWDGKQGTFGAKTFNKSLGSDADSAGSLIQPMQIPGIIMPG---LRRLTIRDLLAQ 140 (385) T ss_pred ccccchhhh----hHHHHHHHHHHHHHHhhccchhhHHHhhhccccccCCceecchhhhHHHHHh---hhccchhhhcce Confidence 111111111 1112223333333211 01111223345556655544433333 334456666666 Q ss_pred chhhHHHhhhhhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHHHH Q lcl|NC_019448. 71 RPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVA 150 (463) Q Consensus 71 ~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~ 150 (463) .++.+.-.+|.+... ..+...+++|++..+.+++++.+....++=++.-..+|.-+ .+...+.+....+.-...+. T Consensus 141 ~~~~~~~~~~~~~~~--~~~~a~~v~E~~~~~~~~~~~~~~~~~~~k~~~~~~is~el--l~d~~~l~~~i~~~la~a~~ 216 (385) T protein:vir:19 141 GRTSSNALEYVREEV--FTNNADVVAEKALKPESDITFSKQTANVKTIAHWVQASRQV--MDDAPMLQSYINNRLMYGLA 216 (385) T ss_pred ecccCcceEEEEEec--CCcceeeeccCccccccccceeEEEEeeeeEEEeehhhHHH--HhhHHHHHHHHHHHHHHHHH Confidence 666554445555543 23345688999999999999999999999999888888753 33334678888888899999 Q ss_pred HHHHHHHhhcccccCCCccccccccccceeeecCc-ceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHHHH Q lcl|NC_019448. 151 KTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKN-NVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVN 229 (463) Q Consensus 151 ~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~-nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~ 229 (463) ..+|.++++|+-. +-.+.|+.+..... ..... +...+.+.|-.+...+...|+..+-++||+.+.+.+.. T Consensus 217 ~~~d~~~l~G~g~--------~~~~~Gi~~~~~~~~~~~~~-~~~~~~d~i~~~~~~l~~~~~~~~~~~~~~~~~~~l~~ 287 (385) T protein:vir:19 217 LKEEGQLLNGDGT--------GDNLEGLNKVATAYDTSLNA-TGDTRADIIAHAIYQVTESEFSASGIVLNPRDWHNIAL 287 (385) T ss_pred HHHHHHHHhccCC--------CCcccccccccccccccccc-cccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHH Confidence 9999999999744 23577877755422 22222 23345677777777888889999999999999998875 Q ss_pred HhcCcceEEeec-CC--CCcccceecCeeeecccccccCCceeccC--ccccccccccCCCCCCCCeeEEEEeccCCCcC Q lcl|NC_019448. 230 SILGRQMQLMQD-NS--GNVNTGYSVNGFYSSRGFIKLHGSTVMEN--ELILDESLQPLPNAPQPAKVTATVETKQKGAF 304 (463) Q Consensus 230 ~~~~~qrv~~~~-n~--g~~~~G~~v~~~~s~~G~i~l~~s~~~~~--d~~l~~~~~~~p~ap~p~~vtat~~~~~~g~~ 304 (463) .--..-|.+.++ .. ++.-.|++| +.+.. +. .+.+++-+ .......+ ..++..........| T Consensus 288 lkd~~G~~l~~~~~~~~~~~l~G~pV--~~~~~--~p-~~~~~~gd~~~~~~~~~~---------~~~~v~~~~~~~~~~ 353 (385) T protein:vir:19 288 LKDNEGRYIFGGPQAFTSNIMWGLPV--VPTKA--QA-AGTFTVGGFDMASQVWDR---------MDATVEVSREDRDNF 353 (385) T ss_pred hhcCCCceeccCcccCCCceecceee--EEcCc--CC-CCcEEEeecccEEEEEEe---------cceEEEEeccccchh Confidence 332222333332 11 112234432 11111 00 11122110 00000000 011111111111000 Q ss_pred cccccccceEEEEEEEecCCccccccceeeeecCCC Q lcl|NC_019448. 305 EDEEDRAGLSYKVVVNSDDAQSAPSEEVTATVSNVD 340 (463) Q Consensus 305 ~~~~~~a~ysYkV~a~s~~geS~~S~~vt~Tva~~~ 340 (463) ......|++...-+..=--|...+-.+++... T Consensus 354 ----~~~~~~~~~~~r~~~~v~~~~a~~~~~~~aa~ 385 (385) T protein:vir:19 354 ----VKNMLTILCEERLALAHYRPTAIIKGTFSSGS 385 (385) T ss_pred ----hcCcEEEEEEEeeccEEecccceEEEEeccCC Confidence 01112333332222221223333333333221 No 27 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=97.74 E-value=1.4e-05 Score=47.18 Aligned_cols=302 Identities=13% Similarity=0.109 Sum_probs=150.9 Q ss_pred CCCCCccchHHHHhhhhhhHHHHHHhhcC----------CccCCccccCccccchhhhhhHhhhhhccccccchhhhccc Q lcl|NC_019448. 1 MTIEKNLSDVQQKYADQFQEDVVKSFQTG----------YGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISR 70 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k~~~e~~~Ks~~ag----------y~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k 70 (463) -..+..... ..+.+.+++.|.+... -..+..+..+|+.+.-+....-+..+ .+...+++.++. T Consensus 68 ~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~ii~~~---~~~~~l~~~~~~ 140 (385) T protein:vir:18 68 AENPGEKKS----FSERAAEELIKSWDGKQGTFGAKTFNKSLGSDADSAGSLIQPMQIPGIIMPG---LRRLTIRDLLAQ 140 (385) T ss_pred ccccchhhh----hHHHHHHHHHHHHHHhhccchhhHHHhhhccccccCCceecchhhhHHHHHh---hhccchhhhcce Confidence 111111111 1112223333333211 01111223345556655544433333 334456666666 Q ss_pred chhhHHHhhhhhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHHHH Q lcl|NC_019448. 71 RPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVA 150 (463) Q Consensus 71 ~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~ 150 (463) .++.+.-.+|.+... ..+...+++|++..+.+++++.+....++=++.-..+|.-+ .+...+.+....+.-...+. T Consensus 141 ~~~~~~~~~~~~~~~--~~~~a~~v~E~~~~~~~~~~~~~~~~~~~k~~~~~~is~el--l~d~~~l~~~i~~~la~a~~ 216 (385) T protein:vir:18 141 GRTSSNALEYVREEV--FTNNADVVAEKALKPESDITFSKQTANVKTIAHWVQASRQV--MDDAPMLQSYINNRLMYGLA 216 (385) T ss_pred ecccCcceEEEEEec--CCcceeeeccCccccccccceeEEEEeeeeEEEeehhhHHH--HhhHHHHHHHHHHHHHHHHH Confidence 666554445555543 23345688999999999999999999999999888888753 33334678888888899999 Q ss_pred HHHHHHHhhcccccCCCccccccccccceeeecCc-ceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHHHH Q lcl|NC_019448. 151 KTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKN-NVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVN 229 (463) Q Consensus 151 ~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~-nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~ 229 (463) ..+|.++++|+-. +-.+.|+.+..... ..... +...+.+.|-.+...+...|+..+-++||+.+.+.+.. T Consensus 217 ~~~d~~~l~G~g~--------~~~~~Gi~~~~~~~~~~~~~-~~~~~~d~i~~~~~~l~~~~~~~~~~~~~~~~~~~l~~ 287 (385) T protein:vir:18 217 LKEEGQLLNGDGT--------GDNLEGLNKVATAYDTSLNA-TGDTRADIIAHAIYQVTESEFSASGIVLNPRDWHNIAL 287 (385) T ss_pred HHHHHHHHhccCC--------CCcccccccccccccccccc-cccchHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHH Confidence 9999999999744 23577877755422 22222 23345677777777888889999999999999998875 Q ss_pred HhcCcceEEeec-CC--CCcccceecCeeeecccccccCCceeccC--ccccccccccCCCCCCCCeeEEEEeccCCCcC Q lcl|NC_019448. 230 SILGRQMQLMQD-NS--GNVNTGYSVNGFYSSRGFIKLHGSTVMEN--ELILDESLQPLPNAPQPAKVTATVETKQKGAF 304 (463) Q Consensus 230 ~~~~~qrv~~~~-n~--g~~~~G~~v~~~~s~~G~i~l~~s~~~~~--d~~l~~~~~~~p~ap~p~~vtat~~~~~~g~~ 304 (463) .--..-|.+.++ .. ++.-.|++| +.+.. +. .+.+++-+ .......+ ..++..........| T Consensus 288 lkd~~G~~l~~~~~~~~~~~l~G~pV--~~~~~--~p-~~~~~~gd~~~~~~~~~~---------~~~~v~~~~~~~~~~ 353 (385) T protein:vir:18 288 LKDNEGRYIFGGPQAFTSNIMWGLPV--VPTKA--QA-AGTFTVGGFDMASQVWDR---------MDATVEVSREDRDNF 353 (385) T ss_pred hhcCCCceeccCcccCCCceecceee--EEcCc--CC-CCcEEEeecccEEEEEEe---------cceEEEEeccccchh Confidence 332222333332 11 112234432 11111 00 11122110 00000000 011111111111000 Q ss_pred cccccccceEEEEEEEecCCccccccceeeeecCCC Q lcl|NC_019448. 305 EDEEDRAGLSYKVVVNSDDAQSAPSEEVTATVSNVD 340 (463) Q Consensus 305 ~~~~~~a~ysYkV~a~s~~geS~~S~~vt~Tva~~~ 340 (463) ......|++...-+..=--|...+-.+++... T Consensus 354 ----~~~~~~~~~~~r~~~~v~~~~a~~~~~~~aa~ 385 (385) T protein:vir:18 354 ----VKNMLTILCEERLALAHYRPTAIIKGTFSSGS 385 (385) T ss_pred ----hcCcEEEEEEEeeccEEecccceEEEEeccCC Confidence 01112333332222221223333333333221 No 28 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=97.68 E-value=2.9e-05 Score=45.44 Aligned_cols=303 Identities=12% Similarity=0.049 Sum_probs=148.7 Q ss_pred CCCCCccch----HH---HHhhhhhh-------HHHHHHhhcCCccCCccccCccccchhhhhhHhhhhhccccccchhh Q lcl|NC_019448. 1 MTIEKNLSD----VQ---QKYADQFQ-------EDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYR 66 (463) Q Consensus 1 ~~~~~~~~~----~~---~~~~k~~~-------e~~~Ks~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~ 66 (463) +..+.+... .. .++.+... .+......++ .+..+-.+|+.+..+.+..-+..+ .+.-.+++ T Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~g~lip~~~~~~ii~~~---~~~~~i~~ 145 (390) T protein:vir:97 71 GDVQHVSVGDMFVASEQFQASTGRWNDRSARATMNIKAALNTA--STDAAGSAGALTTPNRLPGFITPP---DARLTVRD 145 (390) T ss_pred cccccccchhhhhhhHHHHHHHHHhhhhhhhhhhHHHHHHHhh--hcccccccccccchhhhHHHHHHH---hhhhhhHh Confidence 111111110 00 11111111 0111111121 122223334456666555443333 22334566 Q ss_pred hcccchhhHHHhhhhhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHH Q lcl|NC_019448. 67 DISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAI 146 (463) Q Consensus 67 ~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai 146 (463) -+...++.+....|.+.... .+...+++|++..+.+++.+.+....++-++..-.+|.-+ +.++ .+.+....+.-. T Consensus 146 ~~~~~~~~~~~~~~~~~~~~--~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~el-l~ds-~~l~~~i~~~la 221 (390) T protein:vir:97 146 LIGSGRTDSALIEYVQETGF--VNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQI-LSDA-PQLASYMNNRLI 221 (390) T ss_pred hcceeeccCCceEEEEEecC--CcceeeecCCccccccccceeEEEEeeeeEEEeehhhHHH-HHhH-HHHHHHHHHHHH Confidence 66666666655566666533 3456799999999999999999999999999988888853 3333 578899999999 Q ss_pred HHHHHHHHHHHhhcccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHH Q lcl|NC_019448. 147 AVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHAD 226 (463) Q Consensus 147 ~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~ 226 (463) ..+.+.+++++|+|+-. +-++.|+.+........-........+.|..+...+...|...+-++|++.+.+. T Consensus 222 ~a~~~~~d~a~l~G~g~--------~~~p~Gi~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~v~n~~~~~~ 293 (390) T protein:vir:97 222 RGLKVKEDAEILRGTGA--------NDGLLGLIPQATTYAAPTTIAGATRVDQLRLAMLQASLAEYPASGIVINPIDWAA 293 (390) T ss_pred HHHHHHHHHHHhhcCCC--------CccccceeeccccccccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHHH Confidence 99999999999999643 1247788776544444334445555667777777888888888889999999999 Q ss_pred HHHHhcCcceEEeecCCC---CcccceecCeeeecccccccCCceeccCccccccccccCCCCCCCCeeEEEEeccCCCc Q lcl|NC_019448. 227 FVNSILGRQMQLMQDNSG---NVNTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQPLPNAPQPAKVTATVETKQKGA 303 (463) Q Consensus 227 f~~~~~~~qrv~~~~n~g---~~~~G~~v~~~~s~~G~i~l~~s~~~~~d~~l~~~~~~~p~ap~p~~vtat~~~~~~g~ 303 (463) |...-=..-|.+.++..+ ..-.|++| +.+.. +. .+.+++. |...... -.....++... +.... T Consensus 294 L~~lkd~~G~~l~~~~~~~~~~~l~G~pV--~~~~~--~~-~~~~~~g-----d~~~~~~--~~~~~~~~i~~--~~~~~ 359 (390) T protein:vir:97 294 IELAKDANNQYLIGNARGTLTPTLWGLPV--VATQA--MA-PGEFLVG-----AFDLAAQ--IFDQWDARVEI--GYVND 359 (390) T ss_pred HHHhhcCCCceeecCccCCCCceecceee--EEcCC--CC-CCcEEEE-----eccceEE--EEEecceEEEE--eeccc Confidence 884321222333332111 01122222 11100 00 0011100 0000000 00000001110 00000 Q ss_pred CcccccccceEEEEEEEecCCccccccceeeeec Q lcl|NC_019448. 304 FEDEEDRAGLSYKVVVNSDDAQSAPSEEVTATVS 337 (463) Q Consensus 304 ~~~~~~~a~ysYkV~a~s~~geS~~S~~vt~Tva 337 (463) .+ . .....|++...-+..=--|...+-++++ T Consensus 360 ~f-~--~~~~~~r~~~r~d~~v~~~~a~v~~~~a 390 (390) T protein:vir:97 360 DF-Q--RNMVTVLAEERLALVVYRPEALITGSFA 390 (390) T ss_pred cc-c--cCcEEEEEEEeeccEEeccccEEEEEeC Confidence 00 0 0111222222222222223333333333 No 29 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=97.59 E-value=2.3e-05 Score=46.05 Aligned_cols=304 Identities=11% Similarity=0.041 Sum_probs=135.2 Q ss_pred CCCCCccchHHHHhhhhhhHHHHHHhhcC-CccCCccccCccccchhhhhhHhhhhhccccccchhhhcccchhhHHHhh Q lcl|NC_019448. 1 MTIEKNLSDVQQKYADQFQEDVVKSFQTG-YGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVK 79 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k~~~e~~~Ks~~ag-y~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~stv~e 79 (463) -+.+.+.........+.+.+.+.+....- -..+..+..+||.|--+.+.++|..+.. +...+++.+...++.+...+ T Consensus 77 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~--~~~~l~~~~~~~~~~~~~~~ 154 (397) T protein:vir:48 77 KPLTKSEEEVKAGFVKDFKNLVRGRYQNLLDSKTDASGSDAGLTIPQDIQTAIHTLVR--QYDSLQEYVNVENVTTLTGS 154 (397) T ss_pred ccccchhhHHHHHHHHHHHHHHhhhhhHHHHHhhccCCccccccccHHHHHHHHHHHH--HHHHHHhhhceeeccCCcce Confidence 22222222222333333332222221100 0011122345666555555555533322 23345555555555544334 Q ss_pred hhhhhccCcccccccccccCccc-ccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019448. 80 YDQYLRHGNVGHSRFVKEIGVAP-VSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEWASF 158 (463) Q Consensus 80 y~~~~~hG~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~~E~a~f 158 (463) +.....-...+...+++|++..+ ..++.+.+.+..++-++.-..+|.-+ +.++.-|.+....+.--..++..++.+++ T Consensus 155 ~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~el-l~ds~~~l~~~v~~~l~~~~~~~~d~~il 233 (397) T protein:vir:48 155 RVYEKWADITGLAKLDDEAGSIGTNDDPKLYPIRYAIKRYAGISTVTNSL-LADSAENILAWLSGWIAKKVVVTRNKAIL 233 (397) T ss_pred EEEEeecCCCcceeeeccccccccccccceeeEEeeheeeeeehhhHHHH-HhhchHHHHHHHHHHHHHHHHHHHHHHHh Confidence 33222222334567899998875 45799999999999999888888754 33455678888888889999999999999 Q ss_pred hcccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHHHHHhcCcceEE Q lcl|NC_019448. 159 YGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQMQL 238 (463) Q Consensus 159 yGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~~~~~~qrv~ 238 (463) .|+..-.+. |...+-+.|..+...+...|......+|++.+.+.|...==..-|.+ T Consensus 234 ~G~g~~~~~------------------------~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~i 289 (397) T protein:vir:48 234 EAIATLPTK------------------------PTLTKWDDIIDLQAKVDPAIKQTSFFLTNTSGFTALKKVKNAFGDYL 289 (397) T ss_pred hcccccccc------------------------cccccHHHHHHHHHHhhhhhcCCCEEEECHHHHHHHHHhhcCCCcee Confidence 998775431 11123344455555666777777889999999999875322222344 Q ss_pred eecCCC----CcccceecCe----eeecccccccCC-ceecc--CccccccccccCCCCCCCCeeEEEEeccCCCcCccc Q lcl|NC_019448. 239 MQDNSG----NVNTGYSVNG----FYSSRGFIKLHG-STVME--NELILDESLQPLPNAPQPAKVTATVETKQKGAFEDE 307 (463) Q Consensus 239 ~~~n~g----~~~~G~~v~~----~~s~~G~i~l~~-s~~~~--~d~~l~~~~~~~p~ap~p~~vtat~~~~~~g~~~~~ 307 (463) .+++.+ ..-.|++|-- +....+ ... ..++. .+..... .-..++........ ..+ T Consensus 290 ~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~---~~~~~~~~gd~~~~~~~~---------~~~~~~i~~~~~~~-~~~-- 354 (397) T protein:vir:48 290 MERDVKSPTGYSIDGFAVKEVADRWLANAS---SGAMPLYFGDLKQAVTLF---------DRQQMSLLSTNIGG-GAF-- 354 (397) T ss_pred eccCcCCCCCceeccceeEEecccccCCcC---CCceEEEEEeccceEEEE---------eecceEEEEeccch-hhh-- Confidence 444322 2234554421 110000 000 01110 0000000 00011111111110 000 Q ss_pred ccccceEEEEEE----EecCCccccccceeeeecCCCCceEEEE Q lcl|NC_019448. 308 EDRAGLSYKVVV----NSDDAQSAPSEEVTATVSNVDDGVKLSI 347 (463) Q Consensus 308 ~~~a~ysYkV~a----~s~~geS~~S~~vt~Tva~~~~gv~ltI 347 (463) ......|++.. .-.+.++.....++++.+...+...+-+ T Consensus 355 -~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~~~~~~ 397 (397) T protein:vir:48 355 -ETDTTKIRVIDRFDVVATDTESFVPASFKAIADQKGNLGSTAV 397 (397) T ss_pred -hcCceeEEEEeeeccEEecccceEEEEecccccCCCCccccCC Confidence 00011111110 0111122111111111110000000000 No 30 >protein:vir:95318 Length: 328 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512264;genbank:gi:89152431;genbank:GeneID:3952987 Probab=97.58 E-value=4.7e-06 Score=49.80 Aligned_cols=234 Identities=18% Similarity=0.145 Sum_probs=113.6 Q ss_pred CCCCCccchHHHHhhhhhhHHHHHHhhcCCccCCccccCccccchhhhhhHhhhhhccccccchhhhcccchhh-HHHhh Q lcl|NC_019448. 1 MTIEKNLSDVQQKYADQFQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQ-STVVK 79 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k~~~e~~~Ks~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~-stv~e 79 (463) ||.=+..--+-..++|.+.. ..+..+ -|..|+..+ .++..+|=..++ .|=|. T Consensus 1 m~~~~~~~~TL~e~Akr~~~--------------------d~~~~~----VIE~l~~~n---~IL~~lpf~e~n~gt~~~ 53 (328) T protein:vir:95 1 MAVKGLTALTLADWGKRVDP--------------------NGKVDK----IIELLGQTN---PILQDMPFVEGNLPTGHR 53 (328) T ss_pred CCccccccccHHHHHhhhCc--------------------chhHHH----HHHHHhccc---hhHhhcceeecccCCcce Confidence 44332222222222221110 001111 122222222 234455444554 45578 Q ss_pred hhhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhc-ccccHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019448. 80 YDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVN-NIADPSQILTEDAIAVVAKTIEWASF 158 (463) Q Consensus 80 y~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn-~~~Dp~~~~~~~ai~~~~~~~E~a~f 158 (463) |+++..--+. .|..=....+-+.++..|++..++.|..-..|.+...-.+ +..+-+++|.+..+..+.+.++..+| T Consensus 54 ~~v~~~LP~~---~fR~lN~g~~~s~~tt~q~t~~l~ilgg~~eVDr~la~~~Gn~~~~ra~q~~~~~ka~~~~~~~~~i 130 (328) T protein:vir:95 54 TTIRSGLPSA---TWRLLNYGVQPSKSTTVQVTDSVGMLETYAEVDKSLADLNGNTAEFRLSEDRAFIEAMNQQMAQTLF 130 (328) T ss_pred eeEeeccCCc---eeeecCCccCcccceeEEEEEEEEEEecceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 8887755333 3433333456678899999999999999999977655444 46777999999999999999999999 Q ss_pred hcccccCCCccccccccccceeeec------CcceEeccCCCCCHHHHh------------------------------- Q lcl|NC_019448. 159 YGDASLTSEVEGEGLEFDGLAKLID------KNNVINAKGNQLTEKHLN------------------------------- 201 (463) Q Consensus 159 yGd~~l~~~~~~~gleFDGl~~lI~------~~nviDarG~~ls~~~ln------------------------------- 201 (463) |||+..+|. +||||.+... ..|+||+.|.--+.-.|+ T Consensus 131 yGdsa~~p~------~F~GL~~R~~~~s~~~a~qiidaGgtg~~~TSi~~v~~g~~~~~giyPkG~~~Gl~~~d~g~~~~ 204 (328) T protein:vir:95 131 YGDSSVNPQ------QFMGLSSRYSSLSAGNAQNIIDAGGTGTDNTSIWLVVWGENTVHGIFPKGKKAGIQMEDKGQVTL 204 (328) T ss_pred cCCccCChh------hhcchhhhcCccccccccceeecccCCCCceEEEEEEEcCCeEEEecccccccCceeeecCceee Confidence 999999884 8999998664 347888765433322211 Q ss_pred ------------------------------------------------------hhhhhhhhcCCce-eEEecCHHHHHH Q lcl|NC_019448. 202 ------------------------------------------------------EAAVRIGKGFGTA-TDAYMPIGVHAD 226 (463) Q Consensus 202 ------------------------------------------------------~aa~~i~~~~G~~-td~~m~~~vka~ 226 (463) +|..+|- +-|.. +-+||+-.++.. T Consensus 205 ~~~~g~~y~~y~~~~~w~~Gl~i~d~r~vvrI~NId~~~l~~~~~~~~l~~lm~~a~~~ip-~~~~~~~~~y~n~~v~~~ 283 (328) T protein:vir:95 205 EDANGGKYEGYRTHYKWDNGLALRDWRYVVRIANIDVSNLSEPSSAANIAKLMVKALHRIP-NRGMGRPVFYMNRTVGQA 283 (328) T ss_pred ecCCCCeeeEEEEEEEeeeeeEEcCcccEEEEecCcccccccccChhhHHHHHHHHHHHhc-cCCCCcceeehhHHHHHH Confidence 0001100 00111 113333333333 Q ss_pred HHHHhcCcceEEeecCCCCcccceecCeeeecccccccCCceeccCccccccccccCCCCCCCCee Q lcl|NC_019448. 227 FVNSILGRQMQLMQDNSGNVNTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQPLPNAPQPAKV 292 (463) Q Consensus 227 f~~~~~~~qrv~~~~n~g~~~~G~~v~~~~s~~G~i~l~~s~~~~~d~~l~~~~~~~p~ap~p~~v 292 (463) |+-......-+.++. ..++-+.+-+.+| =-+..-|.++..+.. .| T Consensus 284 L~~q~~~~~n~~~~~------~~~~g~~~t~~~g------ipir~~dai~~tE~~---------vv 328 (328) T protein:vir:95 284 LDLQSLEKTSLAISV------KETEGEWWTSFRG------VPIRETDALLETEAR---------VV 328 (328) T ss_pred HHHHHhcCcceeeee------eccCCcceeEECC------eEEEEEeeeecCccc---------cC Confidence 322111111111000 0000111111111 112222333322211 11 No 31 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=97.57 E-value=4.3e-05 Score=44.50 Aligned_cols=305 Identities=12% Similarity=0.026 Sum_probs=143.8 Q ss_pred CCCCCccchH--H-HHhhhh-hhHHHHHHhhcCCc-------cCCccccCccccchhhhhhHhhhhhccccccchhhhcc Q lcl|NC_019448. 1 MTIEKNLSDV--Q-QKYADQ-FQEDVVKSFQTGYG-------ITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDIS 69 (463) Q Consensus 1 ~~~~~~~~~~--~-~~~~k~-~~e~~~Ks~~agy~-------~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~ 69 (463) .......... + +...+. ....+.+.+..+.. .+..+..+|+.+.-+.. .+|..+ -.+...+++-++ T Consensus 72 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~vp~~~~-~~ii~~--~~~~~~l~~l~~ 148 (395) T protein:vir:43 72 EKRDGGEEAPKTAGQMVAESLKEQGVTSSLRGSHRVSMPRSAITSIDGSGGALVAPDRR-PGVVAA--PQRRLTIRDLVA 148 (395) T ss_pred hccccccchhhhHHHHHHHHHHHHHHHHHhhhhhhhhhhhhhhcccCCCCccccchhhH-HHHHHH--HHhhhhHHhhcc Confidence 1111111111 1 000000 00111122212211 11223334555555544 445332 234445666677 Q ss_pred cchhhHHHhhhhhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHHH Q lcl|NC_019448. 70 RRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVV 149 (463) Q Consensus 70 k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~ 149 (463) ..++.+...+|.+.... .+...+++|++..+.+++.+......++-++....+|.-+ +. ...+.+....+.-...+ T Consensus 149 ~~~~~~~~~~~~~~~~~--~~~a~~v~E~~~~~~~~~~~~~i~~~~~k~~~~~~is~el-l~-d~~~l~~~v~~~la~a~ 224 (395) T protein:vir:43 149 PGTTESNSVEYVRETGF--VNNAAPVSEGTQKPYSDLTFELENAPVRTIAHLFKASRQI-LD-DASALQSYIDARARYGL 224 (395) T ss_pred ceecCCCceEEEEEecC--CCceeeecCCccccccccceeEEEEeeeeEEEeehhhHHH-HH-hHHHHHHHHHHHHHHHH Confidence 77776666667666533 3346789999999999999999999999999888888764 33 33467888888888999 Q ss_pred HHHHHHHHhhcccccCCCccccccccccceeeecCcc--eEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHH Q lcl|NC_019448. 150 AKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNN--VINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADF 227 (463) Q Consensus 150 ~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~n--viDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f 227 (463) +..++.++++|+-. +-.+.|+.+...... ....-......+.|..+...+..+|+.++-++||+.+.+.+ T Consensus 225 ~~~~d~~~l~G~g~--------~~~~~Gi~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l 296 (395) T protein:vir:43 225 MLVEECQLLYGNGT--------GANLHGIIPQAQAYAPPSGVVVTAEQRIDRIRLAILQAQLAEFPASGIVLNPIDWALI 296 (395) T ss_pred HHHHHHHHHhccCC--------CCccccccccccccccccccccccchhHHHHHHHHHhhccccCCCcEEEEcHHHHHHH Confidence 99999999999743 124777766443211 11122233445667777778888898888999999999988 Q ss_pred HHHhcCcceEEeecCCCCc----ccceecCeeeecccccccCCceeccC--ccccccccccCCCCCCCCeeEEEEeccCC Q lcl|NC_019448. 228 VNSILGRQMQLMQDNSGNV----NTGYSVNGFYSSRGFIKLHGSTVMEN--ELILDESLQPLPNAPQPAKVTATVETKQK 301 (463) Q Consensus 228 ~~~~~~~qrv~~~~n~g~~----~~G~~v~~~~s~~G~i~l~~s~~~~~--d~~l~~~~~~~p~ap~p~~vtat~~~~~~ 301 (463) ...--..-|.+.++ +.+. -.|++| +.+.. + -.+.+++.+ ...+... -..++........ T Consensus 297 ~~lkd~~G~~i~~~-~~~~~~~~l~G~pV--v~~~~--~-~~~~~~~gd~~~~~~~~~---------~~~~~i~~~~~~~ 361 (395) T protein:vir:43 297 ELNKDAENRYIIGS-PQNGTTPTLWRLPV--VETQA--I-TQDEFLTGAFSLGAQIFD---------RMDIEVLVSTEND 361 (395) T ss_pred HHhhccCCceeccc-cccCCCceecceee--EEcCC--C-CCCcEEEEeccceEEEEE---------ecceEEEEecccc Confidence 75332222333332 1111 122221 01100 0 000000000 0000000 0000111111100 Q ss_pred CcCcccccccceEEEEEEEecCCccccccceeeeecCCCCceEEEEEec Q lcl|NC_019448. 302 GAFEDEEDRAGLSYKVVVNSDDAQSAPSEEVTATVSNVDDGVKLSISVN 350 (463) Q Consensus 302 g~~~~~~~~a~ysYkV~a~s~~geS~~S~~vt~Tva~~~~gv~ltIt~~ 350 (463) ..|. .....|++...-+.+=--|... +.+++|.. T Consensus 362 ~~f~----~~~~~~r~~~r~d~~v~~~~a~-----------~~~~~taa 395 (395) T protein:vir:43 362 KDFE----NNMVTIRAEERLAFAVYRPEAF-----------VTGSLTAS 395 (395) T ss_pred chhh----cCcEEEEEEEeeccEEecccce-----------EEEEeccC Confidence 0000 0011222221111111112222 22333211 No 32 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=97.56 E-value=4.6e-05 Score=44.38 Aligned_cols=311 Identities=11% Similarity=-0.005 Sum_probs=143.0 Q ss_pred CCCCCccchHH---HHhhhhh-hHHHHHHhhcCCc-------------cCCccccCccccchhhhhhHhhhhhccccccc Q lcl|NC_019448. 1 MTIEKNLSDVQ---QKYADQF-QEDVVKSFQTGYG-------------ITPDTQIDAGALRREILDDQITMLTWTNEDLI 63 (463) Q Consensus 1 ~~~~~~~~~~~---~~~~k~~-~e~~~Ks~~agy~-------------~~p~~q~~gaalr~esLd~~i~~L~~~~~df~ 63 (463) ...+.+..... +.+.+.- ...+.+.+..+.. ....+-.+|+.|--+.+..+|..+. .+.-. T Consensus 87 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~lvp~~~~~~ii~~~--~~~~~ 164 (418) T protein:vir:10 87 GGGSAELETPKTLGQLVTESEEMKGMDGSARKSVRVRVDRKSIMNVPATVGSGVSGSNSLVVADRQAGIIAPP--QRKMT 164 (418) T ss_pred cccccccchhhhhhHHhhhHHHHHHHHHHHhhhhhhhhHHHHHHHhhhhccCCCCCCccccchhHHHHHHHHH--hhhhh Confidence 22222221111 1111000 0111111111100 0011122334343334444443222 33444 Q ss_pred hhhhcccchhhHHHhhhhhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHH Q lcl|NC_019448. 64 FYRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTE 143 (463) Q Consensus 64 f~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~ 143 (463) +++-++..++.+.-.+|.+.... .....++.|++..+.+++.+......++-++.--.+|.- +.+.-.|.+....+ T Consensus 165 l~~~~~~~~~~~~~~~~~~~~~~--~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~is~e--ll~ds~~l~~~i~~ 240 (418) T protein:vir:10 165 IRDLLMPGQTSSSSIEYTVETGF--TNNAAAVAEGAQKPTSDLKFNLKNQPVRTIAHLFKASRQ--ILDDAPALQSYIDG 240 (418) T ss_pred HHhhcceeeccCCceeEEEEecC--CCceeeeccCccccccccceeeEEEeeeeEEEeehhhHH--HHHhHHHHHHHHHH Confidence 55555555555433345554433 235678999999999999999999999999987777765 33333578889999 Q ss_pred HHHHHHHHHHHHHHhhcccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHH Q lcl|NC_019448. 144 DAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGV 223 (463) Q Consensus 144 ~ai~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~v 223 (463) .-...+++.++.++|+|+-.= -+..|+.+.......--.-....+.+.|..+...+..+|+..+-++|++.+ T Consensus 241 ~l~~a~~~~~d~a~l~G~g~~--------~~p~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n~~~ 312 (418) T protein:vir:10 241 RARYGLQLTEEGQILKGDGTG--------ANILGILPQASAFMPSITLANATPIDKIRLALLQAVLAEFPATGIVLNPID 312 (418) T ss_pred HHHHHHHHHHHHHHhccCCCC--------ccccccccccccccccccccccccHHHHHHHHHhhccccCCCCEEEEcHHH Confidence 999999999999999997641 135677665443222222223345566677777788888888889999999 Q ss_pred HHHHHHHhcCcceEEeecC---CCCcccceecCeeeecccccccCCceeccCcccccccc-ccCCCCCCCCeeEEEEecc Q lcl|NC_019448. 224 HADFVNSILGRQMQLMQDN---SGNVNTGYSVNGFYSSRGFIKLHGSTVMENELILDESL-QPLPNAPQPAKVTATVETK 299 (463) Q Consensus 224 ka~f~~~~~~~qrv~~~~n---~g~~~~G~~v~~~~s~~G~i~l~~s~~~~~d~~l~~~~-~~~p~ap~p~~vtat~~~~ 299 (463) .+.+...-=..-|.+.++. .++.-.|++| +.+.. + -.|.+++- |... ...- .-..++....+. T Consensus 313 ~~~L~~lkd~~G~~i~~~~~~~~~~~l~G~pV--~~~~~--~-p~~~~~~g-----d~s~~~~~~---~~~~~~i~~~~~ 379 (418) T protein:vir:10 313 WASIELTKDSQGRYIVGNPVNGTTPRLWNLPV--VETQA--M-TANEFLVG-----AFSMAAQIF---DRMEIEVLLSTE 379 (418) T ss_pred HHHHHHhhcCCCceeccccccCCCceecceee--EEcCC--C-CCCcEEEe-----eccceEEEE---EecceEEEEecc Confidence 9998754322233343321 1111223222 11110 0 00111110 0000 0000 000111111111 Q ss_pred CCCcCcccccccceEEEEEEEecCCccccccceeeeecCCCCceEEEEEecCCCC Q lcl|NC_019448. 300 QKGAFEDEEDRAGLSYKVVVNSDDAQSAPSEEVTATVSNVDDGVKLSISVNAMYQ 354 (463) Q Consensus 300 ~~g~~~~~~~~a~ysYkV~a~s~~geS~~S~~vt~Tva~~~~gv~ltIt~~a~~g 354 (463) ....|. .....|++...-+ .-+....+=+.++++..+ .| T Consensus 380 ~~~~f~----~~~~~~r~~~~~d-----------~~~~~~~a~~~~~~~~~~-~g 418 (418) T protein:vir:10 380 NVDDFE----KNMVSIRAEERLA-----------LAVYRPESFVTGALVEQA-GG 418 (418) T ss_pred cchhhh----cCceEEEEEEeec-----------cEEecccceEEEEeccCC-CC Confidence 111000 0112222221111 111112222334444222 23 No 33 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=97.55 E-value=4.6e-05 Score=44.36 Aligned_cols=322 Identities=13% Similarity=0.093 Sum_probs=154.8 Q ss_pred hhhhHHHHHHhhcCCccCCccccCccccchhhhhhHhhhhhccccccchhhhcccchhhHHHhhhhhhhccCcccccccc Q lcl|NC_019448. 16 DQFQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVKYDQYLRHGNVGHSRFV 95 (463) Q Consensus 16 k~~~e~~~Ks~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv 95 (463) =-++++..++.+++ +..+|+.|..|..++-|..|.. ...+.+-..+.+..+.-.+|.++.. .....|+ T Consensus 1 ~g~~~e~~~~~~~~------t~~~~g~l~~~~~~~ii~~l~~---~s~i~~l~~~~~~~~~~~~ip~~~~---~~~a~wv 68 (397) T protein:vir:23 1 MGFSADHSQIAQTK------DTMFTGYLDPVQAKDYFAEAEK---TSIVQRVAQKIPMGATGIVIPHWTG---DVSAQWI 68 (397) T ss_pred CCcCHHHHHHhhcc------CCCCccccchhHHHHHHHHHHh---ccchhhhcceeeccCCceEEEEEcC---CcceEEe Confidence 22344444444442 2234566777777766665543 3445555555555554445665553 3356799 Q ss_pred cccCcccccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHHHHHHHHHHHhhcccccCCCcccccccc Q lcl|NC_019448. 96 KEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEF 175 (463) Q Consensus 96 ~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~~E~a~fyGd~~l~~~~~~~gleF 175 (463) +|++..+.+++.+.+....+|=++---.+|.-+-. ++..|.+....+.-...+++.+|.++++|+.+ + ... T Consensus 69 ~Eg~~~~~s~~~f~~v~l~~~k~~~~v~iS~ell~-ds~~~l~~~i~~~l~~aia~~~d~a~l~G~gt--~------~~~ 139 (397) T protein:vir:23 69 GEGDMKPITKGNMTKRDVHPAKIATIFVASAETVR-ANPANYLGTMRTKVATAIAMAFDNAALHGTNA--P------SAF 139 (397) T ss_pred cCCccccccccceeEEEEeeEEEEEeehhhHHHHh-cchHHHHHHHHHHHHHHHHHHHHHHHhhcccC--C------ccc Confidence 99999999999999999999999888888765433 44578899999999999999999999999976 2 124 Q ss_pred ccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHHHHHhcCcceEEeecCCCCc--------- Q lcl|NC_019448. 176 DGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQMQLMQDNSGNV--------- 246 (463) Q Consensus 176 DGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~~~~~~qrv~~~~n~g~~--------- 246 (463) .|+.......+. ..+.. ..+.+-.+...+...|....-..|++.+.+.|...--..-|.+.+++.+.. T Consensus 140 ~~~~~~~~~~~~--~~~~~-~~~~~~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~~~~~t 216 (397) T protein:vir:23 140 QGYLDQSNKTQS--ISPNA-YQGLGVSGLTKLVTDGKKWTHTLLDDTVEPVLNGSVDANGRPLFVESTYESLTTPFREGR 216 (397) T ss_pred ccccccccceee--ecccc-hhHHHHHHHHhhhhcccCCCEEEEcHHHHHHHHHhhccCCceeecccccccccccccCce Confidence 444443322222 22222 334444555567778888888999999999998533222233333322111 Q ss_pred ccceecC------------------e-eeecccccccCCc----------------eeccCcccccc----ccc--cCCC Q lcl|NC_019448. 247 NTGYSVN------------------G-FYSSRGFIKLHGS----------------TVMENELILDE----SLQ--PLPN 285 (463) Q Consensus 247 ~~G~~v~------------------~-~~s~~G~i~l~~s----------------~~~~~d~~l~~----~~~--~~p~ 285 (463) -.|+++- + ++..++.+.+.-+ ..+..|....| -.+ ..|. T Consensus 217 l~G~Pv~~s~~~~~g~~~~~~gDfs~~~i~~~~~i~i~~~~e~~~~~~~~~~~~~~~lf~~d~v~~ra~~r~d~~v~~~~ 296 (397) T protein:vir:23 217 ILGRPTILSDHVAEGDVVGYAGDFSQIIWGQVGGLSFDVTDQATLNLGSQESPNFVSLWQHNLVAVRVEAEYGLLINDVN 296 (397) T ss_pred eeeeeEEEeCCCCCCceEEEEeecceEEEEEEeceEEEEeeeeeeeeccccccceeeeeeccceeEEEEeeeccceeccc Confidence 1233310 0 1111122221100 00122222111 111 1122 Q ss_pred CC-------CCCeeEEEEeccCCCcCcccccccceEEEEEEE--------------ecCC--ccccc--cceeeeecCCC Q lcl|NC_019448. 286 AP-------QPAKVTATVETKQKGAFEDEEDRAGLSYKVVVN--------------SDDA--QSAPS--EEVTATVSNVD 340 (463) Q Consensus 286 ap-------~p~~vtat~~~~~~g~~~~~~~~a~ysYkV~a~--------------s~~g--eS~~S--~~vt~Tva~~~ 340 (463) +. ...+.+.++.+. .+++|+++.. .-.+ |.++- ..-+.+|.. T Consensus 297 a~~~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~~-- 363 (397) T protein:vir:23 297 AFVKLTFDPVLTTYALDLDGA-----------SAGNFTLSLDGKTSANIAYNASTATVKSAIVAIDDGVSADDVTVTG-- 363 (397) T ss_pred ceEEEeeccccceeeeccccc-----------CcceEEEEecCccccCcccccchhhhHHHhhhcccccccceeeeec-- Confidence 11 112222222222 2344555442 1111 11110 111112222 Q ss_pred CceEEEEEecCCCCCCcceEEEEeecCCCceEEEEEEeeeeeecCCceEEEEeccCCCCCCc Q lcl|NC_019448. 341 DGVKLSISVNAMYQQQPQFVSIYRQGKETGMYFLIKRVPVKDAQEDGTIVFVDKNETLPETA 402 (463) Q Consensus 341 ~gv~ltIt~~a~~g~~~~~y~IYR~~~~~g~~~li~rv~~s~~n~~gtttf~D~N~~iPgt~ 402 (463) +|.--+|++++-..+ +.-.++ .. ++ ...++-+|. T Consensus 364 ~~~~~~~~~~~~~~~--------------~~~~~~------~~--~~------~~~~~~~~~ 397 (397) T protein:vir:23 364 SAGDYTITVPGTLTA--------------DFSGLT------DG--EG------ASISVVSVG 397 (397) T ss_pred CCceeEEEecccccc--------------Cccccc------cC--cc------ccceeeecC Confidence 122333333211111 000010 00 00 001111111 No 34 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=97.55 E-value=1.6e-05 Score=46.85 Aligned_cols=302 Identities=13% Similarity=0.029 Sum_probs=143.2 Q ss_pred hhhhhHHHHHHhhcCCccCCccccCcc-ccchhhhhhHhhhhhccccccchhhhcccchhhHHHhhhhhhhc-----cCc Q lcl|NC_019448. 15 ADQFQEDVVKSFQTGYGITPDTQIDAG-ALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVKYDQYLR-----HGN 88 (463) Q Consensus 15 ~k~~~e~~~Ks~~agy~~~p~~q~~ga-alr~esLd~~i~~L~~~~~df~f~~~i~k~~~~stv~ey~~~~~-----hG~ 88 (463) +-.++|- ++.++|-.........++ -+..|..++-+..+ .+.-.+.+..++.+..+--.+|.++.. |-+ T Consensus 1 ~~~~~e~--~~~~~~~~~~~~~~~~~~~liP~~~~~~ii~~~---~~~s~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~ 75 (338) T protein:vir:78 1 MATLNEL--APNTAGSNHQGRLAHVPSDLLPKEIVGPIFDKA---QESSLVLRLGENIPISYGETIIPTTVKRPEVGQVG 75 (338) T ss_pred CcchHHh--hhhhcccccccceecccccccchHHHHHHHHHH---HhhchhhhhcceeeccCCceEEEEEecCccceeec Confidence 3233332 555555322222222333 45555555444443 333345555566666665555555442 223 Q ss_pred ccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHHHHHHHHHHHhhcccccCCCc Q lcl|NC_019448. 89 VGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEWASFYGDASLTSEV 168 (463) Q Consensus 89 ~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~~E~a~fyGd~~l~~~~ 168 (463) .+...+++|++..+..++++.......+=++--..+|.-+ +.++..|.+....+.-...+.+.+|.++++|+.+-.+ T Consensus 76 ~~~~~~~~Eg~~~~~~~~~f~~v~l~~~k~~~~~~is~el-l~ds~~~~~~~i~~~la~a~~~~~d~~~l~G~g~~~~-- 152 (338) T protein:vir:78 76 VGTSNEQREGGTKPLSGTAWDTRSVAPIKLATIVTVSEEF-ARMNPSGLYTKLQADLAYAIGRGIDLAVFHGKSPLTG-- 152 (338) T ss_pred ccccccccccccccccccceeEEEEEEEEEEEeehhhHHH-HhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcc-- Confidence 3457789999999999999999999998888777777632 2345578888999999999999999999999987544 Q ss_pred cccccccccceeeecCc--ceEec--cCCCCCHHHHhhhhhhhhh-cCCceeEEecCHHHHHHHHHH--hcC-cceEEee Q lcl|NC_019448. 169 EGEGLEFDGLAKLIDKN--NVINA--KGNQLTEKHLNEAAVRIGK-GFGTATDAYMPIGVHADFVNS--ILG-RQMQLMQ 240 (463) Q Consensus 169 ~~~gleFDGl~~lI~~~--nviDa--rG~~ls~~~ln~aa~~i~~-~~G~~td~~m~~~vka~f~~~--~~~-~qrv~~~ 240 (463) .++.|+.+..... ...|. .+.....+.|..+...+.. ....++-.+|++.+.+.|... +.+ ..|.+.+ T Consensus 153 ----~~~~gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~~~~l~d~~g~~l~~ 228 (338) T protein:vir:78 153 ----SALQGIDTNNVIVNTTNVDYLQTGTTPLLDRFLDGYDLVSANTDVDFNGWAADPRYRARLLRSQAYRDANGNVDPT 228 (338) T ss_pred ----ccccccccccccccccccccccccchhhHHHHHHHHHHhhhhccccceEEEEchHHHHHHHHHhhhccCCCceeec Confidence 3577777643322 22222 1222334556666655654 457888899999999988653 223 3344544 Q ss_pred cCCCC----cccceecC--eeeec-ccccccC-Cceecc----------CccccccccccCCCCCCCCeeEEEEeccCCC Q lcl|NC_019448. 241 DNSGN----VNTGYSVN--GFYSS-RGFIKLH-GSTVME----------NELILDESLQPLPNAPQPAKVTATVETKQKG 302 (463) Q Consensus 241 ~n~g~----~~~G~~v~--~~~s~-~G~i~l~-~s~~~~----------~d~~l~~~~~~~p~ap~p~~vtat~~~~~~g 302 (463) ..... .-.|++|- ..+-. .+...-. +-.++. .+.-+...+.... .--..+.+.+ T Consensus 229 ~~~~~~~~~~l~G~PV~~~~~ip~~~~~~~~~~~~~~~gdfs~~~~~~~~~~~i~~~~~~~~--------~~~~~~~~~~ 300 (338) T protein:vir:78 229 RINLAASAGDLLGLPVQFGKAVGGDLGAATDSKVRVVGGDFSQLKYGFADEIRVKMSDTATL--------TDNTSPTPQT 300 (338) T ss_pred ccccCCCCceeeeeeEEEccccCccccccCCcccEEEEEecceEEEEeecccEEEEeecccc--------cccccccccc Confidence 32221 23455431 00000 0000000 001111 0000100000000 0000000000 Q ss_pred cCcccccccce----EEEEEEEecCCccccccceeeeecCC Q lcl|NC_019448. 303 AFEDEEDRAGL----SYKVVVNSDDAQSAPSEEVTATVSNV 339 (463) Q Consensus 303 ~~~~~~~~a~y----sYkV~a~s~~geS~~S~~vt~Tva~~ 339 (463) --.+..+.-.+ .+=..+.+..+ .. ....++-+.. T Consensus 301 ~~~~~~~~~~~r~~~r~d~~v~~~~a--~~-~l~~~~~~~~ 338 (338) T protein:vir:78 301 VSMWQTNQIAILIEVTFGWLLGDKQA--FV-KFVDDEDPDA 338 (338) T ss_pred hhhhhcCcEEEEEEEEeccEeecccc--eE-EEecccCCCC Confidence 00000111110 11101111100 00 0000000000 No 35 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=97.49 E-value=5.8e-05 Score=43.83 Aligned_cols=290 Identities=10% Similarity=0.044 Sum_probs=141.1 Q ss_pred CCCCCccchHHHHhhhhhhHHHHHHhhcCCccCCccccCccccchhhhhhHhhhhhccccccchhhhcccchhhHHHh-h Q lcl|NC_019448. 1 MTIEKNLSDVQQKYADQFQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVV-K 79 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k~~~e~~~Ks~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~stv~-e 79 (463) |+ ....+++.. ++ +..+|+.+..|..++-+..+.... .+.+...+.+..+.-. . T Consensus 1 m~-----------------~~~~~~~~~---~~--t~~~~~lvP~~~~~~ii~~~~~~s---~l~~~~~~~~~~~~~~~~ 55 (297) T protein:vir:95 1 MT-----------------VQTFNPENV---LV--SQKKDGTLHKEFTDIIMKEVAQNS---LVMQLGQYQEMEGEQEKT 55 (297) T ss_pred CC-----------------ccccccccc---cc--cCCCcceechhHHHHHHHHHHhhc---hhhhhcceeecCCCccEE Confidence 11 111112111 11 223344455555544444443322 3455555544433222 2 Q ss_pred hhhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHHHHHHHHHHHhh Q lcl|NC_019448. 80 YDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEWASFY 159 (463) Q Consensus 80 y~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~~E~a~fy 159 (463) +.+.. +.....+++|++..+..++++.......+=++..-.+|..+ +.++..|.+....+.-...+.+.+|.++++ T Consensus 56 ~~~~~---~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~~~~is~el-l~ds~~~l~~~i~~~la~ai~~~~d~a~l~ 131 (297) T protein:vir:95 56 VYVQT---DGISAYWVNETEKIKTDKPEVVPVTLKAHKLGIILVTSREA-LNYTWKKFFEDMKPQIVEAFYKKIDEAGLL 131 (297) T ss_pred EEEEc---CCceeEEeecCccccccccceeEEEEeeEEEEEeehhhHHH-HhcCHHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 22222 22356799999999999999999999999999888887732 234556888888899999999999999999 Q ss_pred cccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHHHHHhcCcceEEe Q lcl|NC_019448. 160 GDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQMQLM 239 (463) Q Consensus 160 Gd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~~~~~~qrv~~ 239 (463) |+.+-. -.|+.+.+...+...+ .-++.+.|-++...+..+|...+-.+|++...+.+.+.--..-|.+. T Consensus 132 G~g~~~---------~~gi~~~~~~~~~~~~--~~~t~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~G~~i~ 200 (297) T protein:vir:95 132 GHDTPF---------ANSVAKAAKDANKVIG--GPINYDNILKLQDALYDADVEPNAFVSKIQNRSALREARDGNKVSIY 200 (297) T ss_pred ccCCcc---------cccccccccccceecc--cccCHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCceee Confidence 986532 2355555554444443 45677777788888888998888999999999999854322334455 Q ss_pred ecCCCCcccceecCeeeecccccccCCceeccCccccccccccCCCCCCCCeeEE------EEeccCCCcCcccccccce Q lcl|NC_019448. 240 QDNSGNVNTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQPLPNAPQPAKVTA------TVETKQKGAFEDEEDRAGL 313 (463) Q Consensus 240 ~~n~g~~~~G~~v~~~~s~~G~i~l~~s~~~~~d~~l~~~~~~~p~ap~p~~vta------t~~~~~~g~~~~~~~~a~y 313 (463) +.+.+ .-.|.++- .+..... -.+..++.+ ...... .......+.. +...+..|...+.....-- T Consensus 201 ~~~~~-~l~G~Pv~--~~~~~~~-~~~~~~~gd-----~s~~~~-~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 270 (297) T protein:vir:95 201 DKAAN-TIDGITTV--DLKSARF-EKGDLLAGD-----FDNLIY-GVPYNITYKISEEGQISTITNADGTPINLFEQEMI 270 (297) T ss_pred cCCCC-cccceeeE--eecCCCC-CCceEEEEe-----cccEEE-EEecCeEEEEeeccccccccccCccchhhhhcCcE Confidence 44333 23355431 1111110 111111110 000000 0000000000 0000001110000000111 Q ss_pred EEEEEEEecCCccccccceeeeecCCCCceEEEEEecCC Q lcl|NC_019448. 314 SYKVVVNSDDAQSAPSEEVTATVSNVDDGVKLSISVNAM 352 (463) Q Consensus 314 sYkV~a~s~~geS~~S~~vt~Tva~~~~gv~ltIt~~a~ 352 (463) .+++...-+.+=--|...+ .|+.. +++ T Consensus 271 ~~r~~~~~d~~v~~~~a~~-----------~l~~a-t~~ 297 (297) T protein:vir:95 271 AIRATMDIAVMITKTDAFA-----------KLTPA-ERV 297 (297) T ss_pred EEEEEEEeccEeecccceE-----------EEeec-CCC Confidence 2222222211111111111 12211 111 No 36 >protein:vir:5120 Length: 615 # NCBI annotation: unknown # Family: family:all:1544 # MgeID: mge:114 # MgeName: PBC5 # Cross-refs: genbank:acc:NP_542277;genbank:gi:18071220;genbank:GeneID:929342 Probab=97.46 E-value=1.4e-05 Score=47.18 Aligned_cols=246 Identities=18% Similarity=0.188 Sum_probs=106.8 Q ss_pred hhhhc--CCceeE---EecCHHHHHHHHH-------HhcCcc-e---EEeecCCCCcccceecC-eeeecccccc----- Q lcl|NC_019448. 206 RIGKG--FGTATD---AYMPIGVHADFVN-------SILGRQ-M---QLMQDNSGNVNTGYSVN-GFYSSRGFIK----- 263 (463) Q Consensus 206 ~i~~~--~G~~td---~~m~~~vka~f~~-------~~~~~q-r---v~~~~n~g~~~~G~~v~-~~~s~~G~i~----- 263 (463) |+.++ -|+--. --+|-..|+-+-- .|.+.. | -++|.+..........+ +.++..+... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~M~~I~i~~f~Ge~Prl~P~lLP~~~A~~A~N~~~~~G~ltP~~~~~~~~~~ 80 (615) T protein:vir:51 1 MVSTGTRRGTLRSRAPSRLHCYLKQGYLGMVAIKISAFAGEQPMLLPRLLPETGATAAMNVRLNDGGLTPINKPIEVATI 80 (615) T ss_pred CcccccccceecccCcceeeeeeecCceeeEEEeecccccccccchhhhccCcccceEEeeeecCCeeeeecCccccccc Confidence 44433 344321 1222222222100 000000 0 02222222222222222 2333222211 Q ss_pred --cCCcee-ccCccccccccc------cCC-----------------------CCCCCCe-eEEEEeccCCCcCcccccc Q lcl|NC_019448. 264 --LHGSTV-MENELILDESLQ------PLP-----------------------NAPQPAK-VTATVETKQKGAFEDEEDR 310 (463) Q Consensus 264 --l~~s~~-~~~d~~l~~~~~------~~p-----------------------~ap~p~~-vtat~~~~~~g~~~~~~~~ 310 (463) +...++ ...+-|+.-... |.. ..|+|.. +++. ..+.| ..+. T Consensus 81 ~~~~~~Tif~~~~~W~~w~~~V~av~sPvA~DRvy~tgdg~Pkv~~~~~sY~LgVpaPs~ap~~~--~~g~g----~~d~ 154 (615) T protein:vir:51 81 ATASQKTIYRHQGSWLSWPNVVNAVPGPVAQDRLYFTGDGAPKVKIGGVDYALKVPRPTGALTAA--LSGTG----SGDI 154 (615) T ss_pred ccccceeeeeecCceeccCCceeEccCCcccceeEEcCCCcceEeecccCccccccCCCccceEE--ecCCC----Cccc Confidence 122222 112225322211 111 1223322 2211 12222 1245 Q ss_pred cceEEEEEEEecCC-ccccccceeeeecCCCCceEEEEEecCCCCCCcceEEEEeecCC--CceEEEEEEeeeeeecCCc Q lcl|NC_019448. 311 AGLSYKVVVNSDDA-QSAPSEEVTATVSNVDDGVKLSISVNAMYQQQPQFVSIYRQGKE--TGMYFLIKRVPVKDAQEDG 387 (463) Q Consensus 311 a~ysYkV~a~s~~g-eS~~S~~vt~Tva~~~~gv~ltIt~~a~~g~~~~~y~IYR~~~~--~g~~~li~rv~~s~~n~~g 387 (463) ..+.|+++.+.+.| ||+||+......-..+..|.|+..+.+..+..-..-.|||+..+ +++|+|+++.+ -+ T Consensus 155 etr~Yv~TfVt~~GeES~PSp~S~~v~v~~g~tVtLs~~pa~~~~~~i~~rRIYRS~tg~~gtdy~lVAel~------as 228 (615) T protein:vir:51 155 QSRTYVYTWVTSFGEESAPCPASIIVDWKPGQTVTLSGFAATPGGRSITTQRIYRSQTGKTGTGLYLIAERA------AS 228 (615) T ss_pred cceEEEEEEEcCCCCcCCCCccceeeEecCCCeEEEeeccCCcCCCceeeEEEEEeccCCCceeeEEEeeec------cc Confidence 67889999888877 79998554332223556677776654444433345689998655 57999999864 36 Q ss_pred eEEEEec------cCCCC------------CCccc-------eecC-------Cch----H----H-----HHh------ Q lcl|NC_019448. 388 TIVFVDK------NETLP------------ETADV-------FVGE-------MSP----Q----V-----VHL------ 416 (463) Q Consensus 388 tttf~D~------N~~iP------------gt~~~-------fvGe-------~~p----q----v-----i~l------ 416 (463) +++|+|. ++.|| |-..| |.|+ +.| + + +.+ T Consensus 229 ~~sf~D~~~~~~Lg~~Lps~~w~~PP~~l~GL~~m~NGimAgF~GneV~FsEpy~PyAWP~~Yr~t~d~dIVaiA~~gt~ 308 (615) T protein:vir:51 229 AGNFTDNIAVDQFQEPLPSADWNEPPDGLAGLAEMPNGMMAAFVGRSIYFCEPYRPHAWPEKYSRNVGSDIVGIAALGSI 308 (615) T ss_pred ceeeeeccchhhcCcccccccccCcCcchhhhhccccceEEeecCCEEEEecCCCCcccchhcccCcCCCeeEEEecccE Confidence 6789998 34444 11112 2221 111 0 0 000 Q ss_pred ----------------hhhcchhhcCCcccCCcceeeeeeechhheecceeeEEEEEEeEecC Q lcl|NC_019448. 417 ----------------FELLPMMKLPLAQINASITFAVLWYGALALRAPKKWARIKNVRYIAV 463 (463) Q Consensus 417 ----------------~ellPm~k~pla~~na~~~~~V~~Yg~L~l~aPkk~~~ikNV~~~~~ 463 (463) -.-+-++||+..+.=-+-+-+|.+=|...--.|.-.+.|.+=.=..| T Consensus 309 LVV~TkG~PYl~sG~sP~sms~~kL~~~qpCvS~rsiV~~~~~v~Yas~dGLV~v~~~G~a~v 371 (615) T protein:vir:51 309 LVVVTKGKPYLLAGTHPDSMQQQQLEENLPCINARSIVDLGHAVCYASNDGLVAVRGDGSIRL 371 (615) T ss_pred EEEEEcCceEEEEcCChhhccccccccccccccccceeEecceEEeecCCceEEEecCCchhh Confidence 12224455554444445556666656555566666666644332222 No 37 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=97.35 E-value=8.4e-05 Score=42.94 Aligned_cols=299 Identities=11% Similarity=0.019 Sum_probs=134.6 Q ss_pred hhcCCccCCccccCccccchhhhhhHhhhhhccccccchhhhcccchhhHHHhhhhhhhccCcccccccccccCcccccC Q lcl|NC_019448. 26 FQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSD 105 (463) Q Consensus 26 ~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d 105 (463) |- +...|+.|--+.+.++|-.... ++..+.+-.+..+..+---+|.++. +.....+++|++..+.++ T Consensus 1 ma--------t~~~gg~lvP~~~~~~ii~~~~--~~s~i~~~~~~i~~~~~~~~~p~~~---~~~~a~wv~Eg~~~~~~~ 67 (311) T protein:vir:81 1 MV--------ALATGTFQLPKHLVPGVWQKAQ--GQSVLARLSMAEPQEFGEQQYMTLT---APPRGEVVGEGAQKSEST 67 (311) T ss_pred Cc--------eecCCceEcchhHHHHHHHHHH--hcchhhhhcceeecCCCceEEEEEe---CCceeEEeecCccccccc Confidence 21 2223444433334444422222 2223333333333332223444443 234567899999999999 Q ss_pred cceEEEEEEEEEeechhhhhhhhhhhc--ccccHHHHHHHHHHHHHHHHHHHHHhhcccccCCCccccccccccceeeec Q lcl|NC_019448. 106 PNIRQKTVSMKYVSDTKNMSIASGLVN--NIADPSQILTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLID 183 (463) Q Consensus 106 ~~~~r~~~~~k~l~~~~~vs~~~~lvn--~~~Dp~~~~~~~ai~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~ 183 (463) +++.+.+...+=++.--.+|.-+-..+ ...+.+....+.....+++.++.++++|+.+- .|..+.|+.+.+- T Consensus 68 ~~f~~v~l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~------~~~~~~gi~~~~~ 141 (311) T protein:vir:81 68 ATFAPVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPL------TGAALSGSPAKIL 141 (311) T ss_pred ceeeEEEEeeEEEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCC------CCccccccccccc Confidence 999999999998887777766543322 23456888888899999999999999998643 2456888888663 Q ss_pred -CcceEeccCCCCC--HHHHhhhhhhhhhcCCceeEEecCHHHHHHHHHHhcCcceEEeecCCCCcccceecCeeeeccc Q lcl|NC_019448. 184 -KNNVINAKGNQLT--EKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQMQLMQDNSGNVNTGYSVNGFYSSRG 260 (463) Q Consensus 184 -~~nviDarG~~ls--~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~~~~~~qrv~~~~n~g~~~~G~~v~~~~s~~G 260 (463) ..+++...+.-.. ...|..+...+....+.++-..||+.+...+...--..-|.+.++..... T Consensus 142 ~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~-------------- 207 (311) T protein:vir:81 142 DTTNIVELTTGTSATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGT-------------- 207 (311) T ss_pred ccceeeeecccccchHHHHHHHHHHHhhhcCCCceEEEEcHHHHHHHHhhhccCCCeeecCccccC-------------- Confidence 4555554333222 34466666777776788888999999999886432112222222211110 Q ss_pred ccccCCceeccCccccccccccCCCCCCCCeeE-EEEeccCCCcCcccccccceEEEEE-----EEecCCccccccceee Q lcl|NC_019448. 261 FIKLHGSTVMENELILDESLQPLPNAPQPAKVT-ATVETKQKGAFEDEEDRAGLSYKVV-----VNSDDAQSAPSEEVTA 334 (463) Q Consensus 261 ~i~l~~s~~~~~d~~l~~~~~~~p~ap~p~~vt-at~~~~~~g~~~~~~~~a~ysYkV~-----a~s~~geS~~S~~vt~ 334 (463) .+.+++..+....+.....|....+.... +........-++|.. .+-|.+. -+++++.. .. T Consensus 208 ----~~~tl~G~Pv~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~gDfs---~~~i~~~~~~~~~~~~~~~~------~~ 274 (311) T protein:vir:81 208 ----DVASFAGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFS---AFRWGVQVSIPLELIEFGDP------DG 274 (311) T ss_pred ----CCceecceeEEecccccccccccccccchhcccCCccEEEEEecc---cEEEEEeccceEEEeccCCC------Cc Confidence 11112222222212111111111111111 111111111122221 1111110 00111100 00 Q ss_pred eecCCC-CceEEEEEecCCCCCCcceEEEEeecCCCceEEEEEEeeeeeecCC Q lcl|NC_019448. 335 TVSNVD-DGVKLSISVNAMYQQQPQFVSIYRQGKETGMYFLIKRVPVKDAQED 386 (463) Q Consensus 335 Tva~~~-~gv~ltIt~~a~~g~~~~~y~IYR~~~~~g~~~li~rv~~s~~n~~ 386 (463) ++.... +.+. +..+.|=+ .....-=+-+.+..+... T Consensus 275 ~~~~~~~~~v~--------------~r~~~r~d--~~v~~~~a~~~l~~a~~~ 311 (311) T protein:vir:81 275 LGDLKRQNQIA--------------IRAEVVYG--IGIMSTDAFAVVRDADES 311 (311) T ss_pred chhhhhcCcEE--------------EEEEEEec--cEeecccceEEEEeeccC Confidence 000000 0011 11111111 000000000001111111 No 38 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=97.31 E-value=4.1e-05 Score=44.62 Aligned_cols=290 Identities=13% Similarity=0.053 Sum_probs=133.9 Q ss_pred CCCCCccchHHHHhhhhhhHHHHHHhhcCCccCCccccCccccchhhhhhHhhhhhccccccchhhhcccchhhHHHhhh Q lcl|NC_019448. 1 MTIEKNLSDVQQKYADQFQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVKY 80 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k~~~e~~~Ks~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~stv~ey 80 (463) |.--..+.... ....+++ +..+|+.+..+...+.+..+.... .+.+..+..+..+.-.+| T Consensus 1 ~~~~~~~~~~~-----------~~~~~t~------~~~~~~~ip~~~~~~ii~~~~~~s---~l~~~~~~~~~~~~~~~~ 60 (320) T protein:vir:10 1 MAAGTAFQVDH-----------AQIAQTG------DTMFKGYLEPEQAKDYFAEAEKTS---IVQQFAQKVPMGTTGQKI 60 (320) T ss_pred CCCCccCCHHH-----------HHhhccc------cccccccccHHHHHHHHHHHHhcc---chhhhcceeeccCCceEE Confidence 21111111111 0111111 223455677777766665555433 456666666665544455 Q ss_pred hhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_019448. 81 DQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEWASFYG 160 (463) Q Consensus 81 ~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~~E~a~fyG 160 (463) .+... .....+++|++..+.+++++.+....++=++..-.+|.-+-. ++..|.+....+.-...+++.+|.++|+| T Consensus 61 p~~~~---~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~-ds~~~l~~~i~~~l~~a~a~~~d~a~l~G 136 (320) T protein:vir:10 61 PHWIG---DVSAQWIGEGDMKPITKGNMTSQNIAPHKIATIFVASAETVR-ANPANYLGTMRTKVATAFAMAFDSAALNG 136 (320) T ss_pred EEEeC---CcceEEecCCccccccccceeEEEEeeEEEEEeehhhHHHHh-cChHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 55552 234679999999999999999999999999988888876433 44568888999999999999999999999 Q ss_pred ccccCCCccccccccccceeeecC--cceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHHHHHhcCcceEE Q lcl|NC_019448. 161 DASLTSEVEGEGLEFDGLAKLIDK--NNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQMQL 238 (463) Q Consensus 161 d~~l~~~~~~~gleFDGl~~lI~~--~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~~~~~~qrv~ 238 (463) +.+-.+ +.+.|+.+.... .....+.+-..-.+.+-.+...+...+....-.+||+.+.+.+...--..-|.+ T Consensus 137 ~g~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l 210 (320) T protein:vir:10 137 TDSPFP------TYLAQTTKSVSLADPGGATASDLTAYDAVAVNGLSLLVNAKKKWTHTLLDDIVEPILNGAKDKNGRPL 210 (320) T ss_pred cCCCCC------cccccccccccceecccccccccccHHHHHHHHHhhhhcccCCCcEEEEcHHHHHHHHHhhccCCcee Confidence 975322 334444433321 111111111111233445555566677778889999999999975322222333 Q ss_pred eecCCCC---------cccceecCeeeecccccccCCceec-----------cCccccc--cccccCCCCCCCCeeEEEE Q lcl|NC_019448. 239 MQDNSGN---------VNTGYSVNGFYSSRGFIKLHGSTVM-----------ENELILD--ESLQPLPNAPQPAKVTATV 296 (463) Q Consensus 239 ~~~n~g~---------~~~G~~v~~~~s~~G~i~l~~s~~~-----------~~d~~l~--~~~~~~p~ap~p~~vtat~ 296 (463) .++..+. .-.|+++ +.+.. +......++ ..+.-+. ++....-..+.-..+ T Consensus 211 ~~~~~~~~~~~~~~~~~i~g~pv--~~~~~--~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~---- 282 (320) T protein:vir:10 211 FIESTYTDENSPFRAGRIVSRPT--ILSDH--VADGTTVGYMGDFRNVIWGQVGGLSFDVTDQATLNLGTPTEPNF---- 282 (320) T ss_pred eccccccCccccccCceeeeeee--EecCC--CCCCceEEEEeecceEEEEEecCeEEEEeecceeeecccccccc---- Confidence 3322111 1122221 11110 000000000 0111110 000000000000000 Q ss_pred eccCCCcCcccccccceEEEEEEEecCC------cccccc Q lcl|NC_019448. 297 ETKQKGAFEDEEDRAGLSYKVVVNSDDA------QSAPSE 330 (463) Q Consensus 297 ~~~~~g~~~~~~~~a~ysYkV~a~s~~g------eS~~S~ 330 (463) ..-+.-.-..-.+..++-+.+.+... --+|-. T Consensus 283 --~~~f~~~~~~~r~~~~~d~~v~~~~a~~~l~~~~ap~~ 320 (320) T protein:vir:10 283 --VSLWQHNLVAVRVEAEYAFHNNDKDAFVKLTNVVTPDA 320 (320) T ss_pred --chhhhcCcEEEEEEEeeccEEecccceEEEEeccCCCC Confidence 00000000000001111111111000 001100 No 39 >protein:vir:3306 Length: 567 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049522;genbank:gi:9632528;genbank:GeneID:1262016 Probab=97.28 E-value=1.4e-05 Score=47.24 Aligned_cols=288 Identities=17% Similarity=0.197 Sum_probs=120.5 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHhhcccccCCC-ccccccccccce-eeecCcceEeccCCCCC--HHHHh----hhhhh Q lcl|NC_019448. 135 ADPSQILTEDAIAVVAKTIEWASFYGDASLTSE-VEGEGLEFDGLA-KLIDKNNVINAKGNQLT--EKHLN----EAAVR 206 (463) Q Consensus 135 ~Dp~~~~~~~ai~~~~~~~E~a~fyGd~~l~~~-~~~~gleFDGl~-~lI~~~nviDarG~~ls--~~~ln----~aa~~ 206 (463) .-|.+|+ .|+-++|- +.++. -.||- ..| .+..-+|+-+- ..+|- ..|.. T Consensus 1 ~~~~~~~------------------~~~~~~~~~~~~~~--~~~~~M~~i---~i~~f~Ge~Prl~p~lLP~~~a~~A~n 57 (567) T protein:vir:33 1 MMPIAIL------------------ANSIINPLIFKPEA--VKGISMPYI---DITTMRGMMPRVVTSMLPEHSAVLAED 57 (567) T ss_pred Ccchhhh------------------hhhhccceeecccc--cccceeeEE---eecccccccccchhhhccccccceEEe Confidence 2222222 23333331 11111 01110 011 11122333221 22222 12233 Q ss_pred hhhcCCceeEEecCHHHHHHHH----HHhcCcceEEeecCCCCcccceecCeeeecccccccC--Cceec---------c Q lcl|NC_019448. 207 IGKGFGTATDAYMPIGVHADFV----NSILGRQMQLMQDNSGNVNTGYSVNGFYSSRGFIKLH--GSTVM---------E 271 (463) Q Consensus 207 i~~~~G~~td~~m~~~vka~f~----~~~~~~qrv~~~~n~g~~~~G~~v~~~~s~~G~i~l~--~s~~~---------~ 271 (463) .+-+.|..+=...|..+..-|. +.|+=+...-+.= ++ +|+ -++|.|+-+ ..+.+ . T Consensus 58 ~~~~~G~itP~~~~~~~~~~~~~~~~Tif~y~~~~W~~w-~~------~V~---~ir~PvAqD~~~rvY~tgdg~Pk~t~ 127 (567) T protein:vir:33 58 CHFRFGVITPERQISGVEKTFTIKPKTIFHYRDDFWFAW-PD------VVD---VIRSPIAQDPHGRIYYTDGRFPKVTD 127 (567) T ss_pred eeccCCeeeeeecccccccccccCceeeEEEcCcEEEEe-CC------cee---eccCccccCCcceEEEecCCcceeee Confidence 3344466555555544433221 1111110000000 00 011 112222211 01110 1 Q ss_pred Ccccc------ccc--cccCCCCCCCCeeEEEEeccCCCcCcccccccceEEEEEEEecCC-ccccccce-eeeecCCCC Q lcl|NC_019448. 272 NELIL------DES--LQPLPNAPQPAKVTATVETKQKGAFEDEEDRAGLSYKVVVNSDDA-QSAPSEEV-TATVSNVDD 341 (463) Q Consensus 272 ~d~~l------~~~--~~~~p~ap~p~~vtat~~~~~~g~~~~~~~~a~ysYkV~a~s~~g-eS~~S~~v-t~Tva~~~~ 341 (463) .++.+ .+. +..+|.+.+++.+ ++. .++.-.-.++.+..++.|+++.+.+.| ||+||.+- ..++...+. T Consensus 128 ~~iat~G~~~~P~~~y~LgVpaps~aP~~-a~~-~~~~~~~~~~~d~etr~Yv~TfVt~~GeES~PS~~S~~~~v~~pg~ 205 (567) T protein:vir:33 128 ATIATKGDGNHPTSSYRLGIPAPTTAPVC-TVQ-QGGDVSDDNPNDDETRFYTETFVSDYGEEGPPGPASLEVTLRTPGT 205 (567) T ss_pred eeeeecCCCCCCcchhhcccCCcccccee-eec-CCCCCCCCCCcccceeEEEEEEEcCCCCcCCCcccccceeeecCCc Confidence 11111 111 1122221122221 111 111111224456678899999998888 79887654 335544445 Q ss_pred ceEEEEEecCCCCCCcceEEEEeecCCC--ceEEEEEEeeeeeecCCceEEEEec------cCCCC------------CC Q lcl|NC_019448. 342 GVKLSISVNAMYQQQPQFVSIYRQGKET--GMYFLIKRVPVKDAQEDGTIVFVDK------NETLP------------ET 401 (463) Q Consensus 342 gv~ltIt~~a~~g~~~~~y~IYR~~~~~--g~~~li~rv~~s~~n~~gtttf~D~------N~~iP------------gt 401 (463) .|.|++.+.+..+..=....|||+..++ ++|+|+++.+ -++++|+|. ++.|| |- T Consensus 206 ~V~ls~~p~~~~~~~i~~~RIYRS~tg~~gtdy~lVael~------as~~sf~D~~~~~~lg~~Lps~~w~~PP~~m~GL 279 (567) T protein:vir:33 206 AVQLTLAPVPLQNASIKRRRIYRSASGGGEADFLLVAELD------ASVLSYTDKIPAKNLGPSLATWDYLPPPENMTGL 279 (567) T ss_pred eEEEeeccCCccccccceEEEEEecCCCCceeeEEEEeec------cceeeeeeccchhhcccccccccccCcCccccee Confidence 5777776555444434679999987653 6999999875 366799998 44444 11 Q ss_pred ccc-------eecC-------Cch--------HH-----HHh----------------------hhhcchhhcCCcccCC Q lcl|NC_019448. 402 ADV-------FVGE-------MSP--------QV-----VHL----------------------FELLPMMKLPLAQINA 432 (463) Q Consensus 402 ~~~-------fvGe-------~~p--------qv-----i~l----------------------~ellPm~k~pla~~na 432 (463) ..| |.|+ +.| ++ +.+ -.-+-++||+..+.=- T Consensus 280 ~~m~NGimAgF~GneV~FsEpylPyAWP~~Yr~t~~~dIVaiA~~gt~LVV~TkG~PYl~sG~sP~sms~~kL~~~qpCv 359 (567) T protein:vir:33 280 CLMANGIAAGFAGNEVMFSEAYLPYAWPEVNRHTTAEDIVAICPLGTSLVVATKGEPYLFSGVSPSTISGSKIPSMQACL 359 (567) T ss_pred eecccceEEeecCCEEEEecCCCCcccchhhccCCCCCeEEEeecccEEEEEEcCceEEEEcCChhhccccccccccccc Confidence 112 2221 111 11 110 1223455665555545 Q ss_pred cceeeeeeechhheecceeeEEEE-E--EeE--ecC Q lcl|NC_019448. 433 SITFAVLWYGALALRAPKKWARIK-N--VRY--IAV 463 (463) Q Consensus 433 ~~~~~V~~Yg~L~l~aPkk~~~ik-N--V~~--~~~ 463 (463) +-+-+|.+=|...--.|.-.+.|. | ++. .-+ T Consensus 360 S~rsiV~~~g~v~Yas~dGLv~i~a~G~a~vvT~~l 395 (567) T protein:vir:33 360 SRRSMVAMEGFVLYAGTNGLVSVDANGNVALATEQI 395 (567) T ss_pred cccceeEeccEEEeecCCcEEEEecCCchhhhhhhc Confidence 666777776666667777777774 2 221 112 No 40 >protein:vir:10145 Length: 567 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859275;genbank:gi:32171031;genbank:GeneID:2653447 Probab=97.28 E-value=1.4e-05 Score=47.24 Aligned_cols=288 Identities=17% Similarity=0.197 Sum_probs=120.5 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHhhcccccCCC-ccccccccccce-eeecCcceEeccCCCCC--HHHHh----hhhhh Q lcl|NC_019448. 135 ADPSQILTEDAIAVVAKTIEWASFYGDASLTSE-VEGEGLEFDGLA-KLIDKNNVINAKGNQLT--EKHLN----EAAVR 206 (463) Q Consensus 135 ~Dp~~~~~~~ai~~~~~~~E~a~fyGd~~l~~~-~~~~gleFDGl~-~lI~~~nviDarG~~ls--~~~ln----~aa~~ 206 (463) .-|.+|+ .|+-++|- +.++. -.||- ..| .+..-+|+-+- ..+|- ..|.. T Consensus 1 ~~~~~~~------------------~~~~~~~~~~~~~~--~~~~~M~~i---~i~~f~Ge~Prl~p~lLP~~~a~~A~n 57 (567) T protein:vir:10 1 MMPIAIL------------------ANSIINPLIFKPEA--VKGISMPYI---DITTMRGMMPRVVTSMLPEHSAVLAED 57 (567) T ss_pred Ccchhhh------------------hhhhccceeecccc--cccceeeEE---eecccccccccchhhhccccccceEEe Confidence 2222222 23333331 11111 01110 011 11122333221 22222 12233 Q ss_pred hhhcCCceeEEecCHHHHHHHH----HHhcCcceEEeecCCCCcccceecCeeeecccccccC--Cceec---------c Q lcl|NC_019448. 207 IGKGFGTATDAYMPIGVHADFV----NSILGRQMQLMQDNSGNVNTGYSVNGFYSSRGFIKLH--GSTVM---------E 271 (463) Q Consensus 207 i~~~~G~~td~~m~~~vka~f~----~~~~~~qrv~~~~n~g~~~~G~~v~~~~s~~G~i~l~--~s~~~---------~ 271 (463) .+-+.|..+=...|..+..-|. +.|+=+...-+.= ++ +|+ -++|.|+-+ ..+.+ . T Consensus 58 ~~~~~G~itP~~~~~~~~~~~~~~~~Tif~y~~~~W~~w-~~------~V~---~ir~PvAqD~~~rvY~tgdg~Pk~t~ 127 (567) T protein:vir:10 58 CHFRFGVITPERQISGVEKTFTIKPKTIFHYRDDFWFAW-PD------VVD---VIRSPIAQDPHGRIYYTDGRFPKVTD 127 (567) T ss_pred eeccCCeeeeeecccccccccccCceeeEEEcCcEEEEe-CC------cee---eccCccccCCcceEEEecCCcceeee Confidence 3344466555555544433221 1111110000000 00 011 112222211 01110 1 Q ss_pred Ccccc------ccc--cccCCCCCCCCeeEEEEeccCCCcCcccccccceEEEEEEEecCC-ccccccce-eeeecCCCC Q lcl|NC_019448. 272 NELIL------DES--LQPLPNAPQPAKVTATVETKQKGAFEDEEDRAGLSYKVVVNSDDA-QSAPSEEV-TATVSNVDD 341 (463) Q Consensus 272 ~d~~l------~~~--~~~~p~ap~p~~vtat~~~~~~g~~~~~~~~a~ysYkV~a~s~~g-eS~~S~~v-t~Tva~~~~ 341 (463) .++.+ .+. +..+|.+.+++.+ ++. .++.-.-.++.+..++.|+++.+.+.| ||+||.+- ..++...+. T Consensus 128 ~~iat~G~~~~P~~~y~LgVpaps~aP~~-a~~-~~~~~~~~~~~d~etr~Yv~TfVt~~GeES~PS~~S~~~~v~~pg~ 205 (567) T protein:vir:10 128 ATIATKGDGNHPTSSYRLGIPAPTTAPVC-TVQ-QGGDVSDDNPNDDETRFYTETFVSDYGEEGPPGPASLEVTLRTPGT 205 (567) T ss_pred eeeeecCCCCCCcchhhcccCCcccccee-eec-CCCCCCCCCCcccceeEEEEEEEcCCCCcCCCcccccceeeecCCc Confidence 11111 111 1122221122221 111 111111224456678899999998888 79887654 335544445 Q ss_pred ceEEEEEecCCCCCCcceEEEEeecCCC--ceEEEEEEeeeeeecCCceEEEEec------cCCCC------------CC Q lcl|NC_019448. 342 GVKLSISVNAMYQQQPQFVSIYRQGKET--GMYFLIKRVPVKDAQEDGTIVFVDK------NETLP------------ET 401 (463) Q Consensus 342 gv~ltIt~~a~~g~~~~~y~IYR~~~~~--g~~~li~rv~~s~~n~~gtttf~D~------N~~iP------------gt 401 (463) .|.|++.+.+..+..=....|||+..++ ++|+|+++.+ -++++|+|. ++.|| |- T Consensus 206 ~V~ls~~p~~~~~~~i~~~RIYRS~tg~~gtdy~lVael~------as~~sf~D~~~~~~lg~~Lps~~w~~PP~~m~GL 279 (567) T protein:vir:10 206 AVQLTLAPVPLQNASIKRRRIYRSASGGGEADFLLVAELD------ASVLSYTDKIPAKNLGPSLATWDYLPPPENMTGL 279 (567) T ss_pred eEEEeeccCCccccccceEEEEEecCCCCceeeEEEEeec------cceeeeeeccchhhcccccccccccCcCccccee Confidence 5777776555444434679999987653 6999999875 366799998 44444 11 Q ss_pred ccc-------eecC-------Cch--------HH-----HHh----------------------hhhcchhhcCCcccCC Q lcl|NC_019448. 402 ADV-------FVGE-------MSP--------QV-----VHL----------------------FELLPMMKLPLAQINA 432 (463) Q Consensus 402 ~~~-------fvGe-------~~p--------qv-----i~l----------------------~ellPm~k~pla~~na 432 (463) ..| |.|+ +.| ++ +.+ -.-+-++||+..+.=- T Consensus 280 ~~m~NGimAgF~GneV~FsEpylPyAWP~~Yr~t~~~dIVaiA~~gt~LVV~TkG~PYl~sG~sP~sms~~kL~~~qpCv 359 (567) T protein:vir:10 280 CLMANGIAAGFAGNEVMFSEAYLPYAWPEVNRHTTAEDIVAICPLGTSLVVATKGEPYLFSGVSPSTISGSKIPSMQACL 359 (567) T ss_pred eecccceEEeecCCEEEEecCCCCcccchhhccCCCCCeEEEeecccEEEEEEcCceEEEEcCChhhccccccccccccc Confidence 112 2221 111 11 110 1223455665555545 Q ss_pred cceeeeeeechhheecceeeEEEE-E--EeE--ecC Q lcl|NC_019448. 433 SITFAVLWYGALALRAPKKWARIK-N--VRY--IAV 463 (463) Q Consensus 433 ~~~~~V~~Yg~L~l~aPkk~~~ik-N--V~~--~~~ 463 (463) +-+-+|.+=|...--.|.-.+.|. | ++. .-+ T Consensus 360 S~rsiV~~~g~v~Yas~dGLv~i~a~G~a~vvT~~l 395 (567) T protein:vir:10 360 SRRSMVAMEGFVLYAGTNGLVSVDANGNVALATEQI 395 (567) T ss_pred cccceeEeccEEEeecCCcEEEEecCCchhhhhhhc Confidence 666777776666667777777774 2 221 112 No 41 >protein:vir:9979 Length: 567 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859109;genbank:gi:32170864;genbank:GeneID:2653256 Probab=97.28 E-value=1.4e-05 Score=47.24 Aligned_cols=288 Identities=17% Similarity=0.197 Sum_probs=120.5 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHhhcccccCCC-ccccccccccce-eeecCcceEeccCCCCC--HHHHh----hhhhh Q lcl|NC_019448. 135 ADPSQILTEDAIAVVAKTIEWASFYGDASLTSE-VEGEGLEFDGLA-KLIDKNNVINAKGNQLT--EKHLN----EAAVR 206 (463) Q Consensus 135 ~Dp~~~~~~~ai~~~~~~~E~a~fyGd~~l~~~-~~~~gleFDGl~-~lI~~~nviDarG~~ls--~~~ln----~aa~~ 206 (463) .-|.+|+ .|+-++|- +.++. -.||- ..| .+..-+|+-+- ..+|- ..|.. T Consensus 1 ~~~~~~~------------------~~~~~~~~~~~~~~--~~~~~M~~i---~i~~f~Ge~Prl~p~lLP~~~a~~A~n 57 (567) T protein:vir:99 1 MMPIAIL------------------ANSIINPLIFKPEA--VKGISMPYI---DITTMRGMMPRVVTSMLPEHSAVLAED 57 (567) T ss_pred Ccchhhh------------------hhhhccceeecccc--cccceeeEE---eecccccccccchhhhccccccceEEe Confidence 2222222 23333331 11111 01110 011 11122333221 22222 12233 Q ss_pred hhhcCCceeEEecCHHHHHHHH----HHhcCcceEEeecCCCCcccceecCeeeecccccccC--Cceec---------c Q lcl|NC_019448. 207 IGKGFGTATDAYMPIGVHADFV----NSILGRQMQLMQDNSGNVNTGYSVNGFYSSRGFIKLH--GSTVM---------E 271 (463) Q Consensus 207 i~~~~G~~td~~m~~~vka~f~----~~~~~~qrv~~~~n~g~~~~G~~v~~~~s~~G~i~l~--~s~~~---------~ 271 (463) .+-+.|..+=...|..+..-|. +.|+=+...-+.= ++ +|+ -++|.|+-+ ..+.+ . T Consensus 58 ~~~~~G~itP~~~~~~~~~~~~~~~~Tif~y~~~~W~~w-~~------~V~---~ir~PvAqD~~~rvY~tgdg~Pk~t~ 127 (567) T protein:vir:99 58 CHFRFGVITPERQISGVEKTFTIKPKTIFHYRDDFWFAW-PD------VVD---VIRSPIAQDPHGRIYYTDGRFPKVTD 127 (567) T ss_pred eeccCCeeeeeecccccccccccCceeeEEEcCcEEEEe-CC------cee---eccCccccCCcceEEEecCCcceeee Confidence 3344466555555544433221 1111110000000 00 011 112222211 01110 1 Q ss_pred Ccccc------ccc--cccCCCCCCCCeeEEEEeccCCCcCcccccccceEEEEEEEecCC-ccccccce-eeeecCCCC Q lcl|NC_019448. 272 NELIL------DES--LQPLPNAPQPAKVTATVETKQKGAFEDEEDRAGLSYKVVVNSDDA-QSAPSEEV-TATVSNVDD 341 (463) Q Consensus 272 ~d~~l------~~~--~~~~p~ap~p~~vtat~~~~~~g~~~~~~~~a~ysYkV~a~s~~g-eS~~S~~v-t~Tva~~~~ 341 (463) .++.+ .+. +..+|.+.+++.+ ++. .++.-.-.++.+..++.|+++.+.+.| ||+||.+- ..++...+. T Consensus 128 ~~iat~G~~~~P~~~y~LgVpaps~aP~~-a~~-~~~~~~~~~~~d~etr~Yv~TfVt~~GeES~PS~~S~~~~v~~pg~ 205 (567) T protein:vir:99 128 ATIATKGDGNHPTSSYRLGIPAPTTAPVC-TVQ-QGGDVSDDNPNDDETRFYTETFVSDYGEEGPPGPASLEVTLRTPGT 205 (567) T ss_pred eeeeecCCCCCCcchhhcccCCcccccee-eec-CCCCCCCCCCcccceeEEEEEEEcCCCCcCCCcccccceeeecCCc Confidence 11111 111 1122221122221 111 111111224456678899999998888 79887654 335544445 Q ss_pred ceEEEEEecCCCCCCcceEEEEeecCCC--ceEEEEEEeeeeeecCCceEEEEec------cCCCC------------CC Q lcl|NC_019448. 342 GVKLSISVNAMYQQQPQFVSIYRQGKET--GMYFLIKRVPVKDAQEDGTIVFVDK------NETLP------------ET 401 (463) Q Consensus 342 gv~ltIt~~a~~g~~~~~y~IYR~~~~~--g~~~li~rv~~s~~n~~gtttf~D~------N~~iP------------gt 401 (463) .|.|++.+.+..+..=....|||+..++ ++|+|+++.+ -++++|+|. ++.|| |- T Consensus 206 ~V~ls~~p~~~~~~~i~~~RIYRS~tg~~gtdy~lVael~------as~~sf~D~~~~~~lg~~Lps~~w~~PP~~m~GL 279 (567) T protein:vir:99 206 AVQLTLAPVPLQNASIKRRRIYRSASGGGEADFLLVAELD------ASVLSYTDKIPAKNLGPSLATWDYLPPPENMTGL 279 (567) T ss_pred eEEEeeccCCccccccceEEEEEecCCCCceeeEEEEeec------cceeeeeeccchhhcccccccccccCcCccccee Confidence 5777776555444434679999987653 6999999875 366799998 44444 11 Q ss_pred ccc-------eecC-------Cch--------HH-----HHh----------------------hhhcchhhcCCcccCC Q lcl|NC_019448. 402 ADV-------FVGE-------MSP--------QV-----VHL----------------------FELLPMMKLPLAQINA 432 (463) Q Consensus 402 ~~~-------fvGe-------~~p--------qv-----i~l----------------------~ellPm~k~pla~~na 432 (463) ..| |.|+ +.| ++ +.+ -.-+-++||+..+.=- T Consensus 280 ~~m~NGimAgF~GneV~FsEpylPyAWP~~Yr~t~~~dIVaiA~~gt~LVV~TkG~PYl~sG~sP~sms~~kL~~~qpCv 359 (567) T protein:vir:99 280 CLMANGIAAGFAGNEVMFSEAYLPYAWPEVNRHTTAEDIVAICPLGTSLVVATKGEPYLFSGVSPSTISGSKIPSMQACL 359 (567) T ss_pred eecccceEEeecCCEEEEecCCCCcccchhhccCCCCCeEEEeecccEEEEEEcCceEEEEcCChhhccccccccccccc Confidence 112 2221 111 11 110 1223455665555545 Q ss_pred cceeeeeeechhheecceeeEEEE-E--EeE--ecC Q lcl|NC_019448. 433 SITFAVLWYGALALRAPKKWARIK-N--VRY--IAV 463 (463) Q Consensus 433 ~~~~~V~~Yg~L~l~aPkk~~~ik-N--V~~--~~~ 463 (463) +-+-+|.+=|...--.|.-.+.|. | ++. .-+ T Consensus 360 S~rsiV~~~g~v~Yas~dGLv~i~a~G~a~vvT~~l 395 (567) T protein:vir:99 360 SRRSMVAMEGFVLYAGTNGLVSVDANGNVALATEQI 395 (567) T ss_pred cccceeEeccEEEeecCCcEEEEecCCchhhhhhhc Confidence 666777776666667777777774 2 221 112 No 42 >protein:vir:2792 Length: 567 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612909;genbank:gi:20065826;genbank:GeneID:935648 Probab=97.28 E-value=1.4e-05 Score=47.24 Aligned_cols=288 Identities=17% Similarity=0.197 Sum_probs=120.5 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHhhcccccCCC-ccccccccccce-eeecCcceEeccCCCCC--HHHHh----hhhhh Q lcl|NC_019448. 135 ADPSQILTEDAIAVVAKTIEWASFYGDASLTSE-VEGEGLEFDGLA-KLIDKNNVINAKGNQLT--EKHLN----EAAVR 206 (463) Q Consensus 135 ~Dp~~~~~~~ai~~~~~~~E~a~fyGd~~l~~~-~~~~gleFDGl~-~lI~~~nviDarG~~ls--~~~ln----~aa~~ 206 (463) .-|.+|+ .|+-++|- +.++. -.||- ..| .+..-+|+-+- ..+|- ..|.. T Consensus 1 ~~~~~~~------------------~~~~~~~~~~~~~~--~~~~~M~~i---~i~~f~Ge~Prl~p~lLP~~~a~~A~n 57 (567) T protein:vir:27 1 MMPIAIL------------------ANSIINPLIFKPEA--VKGISMPYI---DITTMRGMMPRVVTSMLPEHSAVLAED 57 (567) T ss_pred Ccchhhh------------------hhhhccceeecccc--cccceeeEE---eecccccccccchhhhccccccceEEe Confidence 2222222 23333331 11111 01110 011 11122333221 22222 12233 Q ss_pred hhhcCCceeEEecCHHHHHHHH----HHhcCcceEEeecCCCCcccceecCeeeecccccccC--Cceec---------c Q lcl|NC_019448. 207 IGKGFGTATDAYMPIGVHADFV----NSILGRQMQLMQDNSGNVNTGYSVNGFYSSRGFIKLH--GSTVM---------E 271 (463) Q Consensus 207 i~~~~G~~td~~m~~~vka~f~----~~~~~~qrv~~~~n~g~~~~G~~v~~~~s~~G~i~l~--~s~~~---------~ 271 (463) .+-+.|..+=...|..+..-|. +.|+=+...-+.= ++ +|+ -++|.|+-+ ..+.+ . T Consensus 58 ~~~~~G~itP~~~~~~~~~~~~~~~~Tif~y~~~~W~~w-~~------~V~---~ir~PvAqD~~~rvY~tgdg~Pk~t~ 127 (567) T protein:vir:27 58 CHFRFGVITPERQISGVEKTFTIKPKTIFHYRDDFWFAW-PD------VVD---VIRSPIAQDPHGRIYYTDGRFPKVTD 127 (567) T ss_pred eeccCCeeeeeecccccccccccCceeeEEEcCcEEEEe-CC------cee---eccCccccCCcceEEEecCCcceeee Confidence 3344466555555544433221 1111110000000 00 011 112222211 01110 1 Q ss_pred Ccccc------ccc--cccCCCCCCCCeeEEEEeccCCCcCcccccccceEEEEEEEecCC-ccccccce-eeeecCCCC Q lcl|NC_019448. 272 NELIL------DES--LQPLPNAPQPAKVTATVETKQKGAFEDEEDRAGLSYKVVVNSDDA-QSAPSEEV-TATVSNVDD 341 (463) Q Consensus 272 ~d~~l------~~~--~~~~p~ap~p~~vtat~~~~~~g~~~~~~~~a~ysYkV~a~s~~g-eS~~S~~v-t~Tva~~~~ 341 (463) .++.+ .+. +..+|.+.+++.+ ++. .++.-.-.++.+..++.|+++.+.+.| ||+||.+- ..++...+. T Consensus 128 ~~iat~G~~~~P~~~y~LgVpaps~aP~~-a~~-~~~~~~~~~~~d~etr~Yv~TfVt~~GeES~PS~~S~~~~v~~pg~ 205 (567) T protein:vir:27 128 ATIATKGDGNHPTSSYRLGIPAPTTAPVC-TVQ-QGGDVSDDNPNDDETRFYTETFVSDYGEEGPPGPASLEVTLRTPGT 205 (567) T ss_pred eeeeecCCCCCCcchhhcccCCcccccee-eec-CCCCCCCCCCcccceeEEEEEEEcCCCCcCCCcccccceeeecCCc Confidence 11111 111 1122221122221 111 111111224456678899999998888 79887654 335544445 Q ss_pred ceEEEEEecCCCCCCcceEEEEeecCCC--ceEEEEEEeeeeeecCCceEEEEec------cCCCC------------CC Q lcl|NC_019448. 342 GVKLSISVNAMYQQQPQFVSIYRQGKET--GMYFLIKRVPVKDAQEDGTIVFVDK------NETLP------------ET 401 (463) Q Consensus 342 gv~ltIt~~a~~g~~~~~y~IYR~~~~~--g~~~li~rv~~s~~n~~gtttf~D~------N~~iP------------gt 401 (463) .|.|++.+.+..+..=....|||+..++ ++|+|+++.+ -++++|+|. ++.|| |- T Consensus 206 ~V~ls~~p~~~~~~~i~~~RIYRS~tg~~gtdy~lVael~------as~~sf~D~~~~~~lg~~Lps~~w~~PP~~m~GL 279 (567) T protein:vir:27 206 AVQLTLAPVPLQNASIKRRRIYRSASGGGEADFLLVAELD------ASVLSYTDKIPAKNLGPSLATWDYLPPPENMTGL 279 (567) T ss_pred eEEEeeccCCccccccceEEEEEecCCCCceeeEEEEeec------cceeeeeeccchhhcccccccccccCcCccccee Confidence 5777776555444434679999987653 6999999875 366799998 44444 11 Q ss_pred ccc-------eecC-------Cch--------HH-----HHh----------------------hhhcchhhcCCcccCC Q lcl|NC_019448. 402 ADV-------FVGE-------MSP--------QV-----VHL----------------------FELLPMMKLPLAQINA 432 (463) Q Consensus 402 ~~~-------fvGe-------~~p--------qv-----i~l----------------------~ellPm~k~pla~~na 432 (463) ..| |.|+ +.| ++ +.+ -.-+-++||+..+.=- T Consensus 280 ~~m~NGimAgF~GneV~FsEpylPyAWP~~Yr~t~~~dIVaiA~~gt~LVV~TkG~PYl~sG~sP~sms~~kL~~~qpCv 359 (567) T protein:vir:27 280 CLMANGIAAGFAGNEVMFSEAYLPYAWPEVNRHTTAEDIVAICPLGTSLVVATKGEPYLFSGVSPSTISGSKIPSMQACL 359 (567) T ss_pred eecccceEEeecCCEEEEecCCCCcccchhhccCCCCCeEEEeecccEEEEEEcCceEEEEcCChhhccccccccccccc Confidence 112 2221 111 11 110 1223455665555545 Q ss_pred cceeeeeeechhheecceeeEEEE-E--EeE--ecC Q lcl|NC_019448. 433 SITFAVLWYGALALRAPKKWARIK-N--VRY--IAV 463 (463) Q Consensus 433 ~~~~~V~~Yg~L~l~aPkk~~~ik-N--V~~--~~~ 463 (463) +-+-+|.+=|...--.|.-.+.|. | ++. .-+ T Consensus 360 S~rsiV~~~g~v~Yas~dGLv~i~a~G~a~vvT~~l 395 (567) T protein:vir:27 360 SRRSMVAMEGFVLYAGTNGLVSVDANGNVALATEQI 395 (567) T ss_pred cccceeEeccEEEeecCCcEEEEecCCchhhhhhhc Confidence 666777776666667777777774 2 221 112 No 43 >protein:vir:827 Length: 567 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050560;genbank:gi:9633457;genbank:GeneID:1262210 Probab=97.25 E-value=2.5e-05 Score=45.83 Aligned_cols=298 Identities=17% Similarity=0.193 Sum_probs=118.8 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHhhcccccCCC-ccccccccccce-eeecCcceEeccCCCCC--HHHHh----hhhhh Q lcl|NC_019448. 135 ADPSQILTEDAIAVVAKTIEWASFYGDASLTSE-VEGEGLEFDGLA-KLIDKNNVINAKGNQLT--EKHLN----EAAVR 206 (463) Q Consensus 135 ~Dp~~~~~~~ai~~~~~~~E~a~fyGd~~l~~~-~~~~gleFDGl~-~lI~~~nviDarG~~ls--~~~ln----~aa~~ 206 (463) .-|.+|+ .|+-++|- +.++. -.||- ..| .+..-+|+-+- ..+|- ..|.. T Consensus 1 ~~~~~~~------------------~~~~~~~~~~~~~~--~~~~~M~~i---~i~~f~Ge~Prl~p~lLP~~~a~~A~n 57 (567) T protein:vir:82 1 MMPIAIL------------------ANSIINPLIFKPEA--VKGISMPYI---DITTMRGMMPRVVTSMLPEHSAVLAED 57 (567) T ss_pred Ccchhhh------------------hhhhccceeecccc--cccceeeEE---eecccccccccchhhhccccccceEEe Confidence 2222222 23333331 11111 00110 011 11122232221 22222 12233 Q ss_pred hhhcCCceeEEecCHHHHHHH----HHHhcCcceE-EeecCCCCcccceecC-----eeeecccccccCCcee-ccCccc Q lcl|NC_019448. 207 IGKGFGTATDAYMPIGVHADF----VNSILGRQMQ-LMQDNSGNVNTGYSVN-----GFYSSRGFIKLHGSTV-MENELI 275 (463) Q Consensus 207 i~~~~G~~td~~m~~~vka~f----~~~~~~~qrv-~~~~n~g~~~~G~~v~-----~~~s~~G~i~l~~s~~-~~~d~~ 275 (463) .+-+.|..+=...|..+..-| .+.|+=+... |.=++.=+.--|--++ -|++-.|..+.....+ .-.|-. T Consensus 58 ~~~~~G~itP~~~~~~~~~~~~~~~~Tif~y~~~~W~~w~~~V~~ir~PvAqD~~~rvY~tgdg~Pk~t~~~iat~G~~~ 137 (567) T protein:vir:82 58 CHFRFGVITPERQISGVEKTFTIKPKTIFHYRDDFWFAWPDVVDVIRSPIAQDPHGRIYYTDGRFPKVTDATIATKGDGN 137 (567) T ss_pred eeecCCeeeeeecccccccccccCceeeeeecCcEeEEeCCceeeccCccccCCcccEEEecCCcceeeeeeeeecCCCC Confidence 333445555555554433322 1111100000 0000000000000000 0111111111111100 000111 Q ss_pred cccc--cccCCCCCCCCeeEEEEeccCCCcCcccccccceEEEEEEEecCC-ccccccce-eeeecCCCCceEEEEEecC Q lcl|NC_019448. 276 LDES--LQPLPNAPQPAKVTATVETKQKGAFEDEEDRAGLSYKVVVNSDDA-QSAPSEEV-TATVSNVDDGVKLSISVNA 351 (463) Q Consensus 276 l~~~--~~~~p~ap~p~~vtat~~~~~~g~~~~~~~~a~ysYkV~a~s~~g-eS~~S~~v-t~Tva~~~~gv~ltIt~~a 351 (463) ..+. +..+|.+.+++.+ ++. .++.-.-.++.+..++.|+++.+.+.| ||+||.+- ..++...+..|.|++.+.+ T Consensus 138 ~P~~~y~LgVpaps~aP~~-a~~-~~~~~~~~~p~d~etr~Yv~TfVt~~GeES~PS~~S~~~~v~~pg~~V~ls~~p~~ 215 (567) T protein:vir:82 138 HPTSSYRLGIPAPTTAPVC-TVQ-QGGDVSDDNPNDDETRFYTETFVSDYGEEGPPGPASLEVTLRTPGTAVQLTLAPVP 215 (567) T ss_pred CCcchhhcccCCcccccee-eec-CCCCCCCCCCccccceEEEEEEEcCCCCcCCCcccccceeeecCCceEEEeeccCC Confidence 1111 1122221122221 111 111111223456667889999998888 79887654 3355444445777776555 Q ss_pred CCCCCcceEEEEeecCCC--ceEEEEEEeeeeeecCCceEEEEec------cCCCC------------CCccc------- Q lcl|NC_019448. 352 MYQQQPQFVSIYRQGKET--GMYFLIKRVPVKDAQEDGTIVFVDK------NETLP------------ETADV------- 404 (463) Q Consensus 352 ~~g~~~~~y~IYR~~~~~--g~~~li~rv~~s~~n~~gtttf~D~------N~~iP------------gt~~~------- 404 (463) ..+..=....|||+..++ ++|+|+++.+ -++++|+|. ++.|| |-..| T Consensus 216 ~~~~~i~~~RIYRS~tg~~gtdy~lVael~------as~~sf~D~~~~~~lg~~Lps~~w~~PP~~m~GL~~m~NGimAg 289 (567) T protein:vir:82 216 LQNASIKRRRIYRSASGGGEADFLLVAELD------ASVLSYTDKIPAKNLGPSLATWDYLPPPENMTGLCLMANGIAAG 289 (567) T ss_pred ccccccceEEEEEecCCCCceeeEEEEeec------cceeeeeeccchhhcccccccccccCcCcccceeeecccceEEe Confidence 444434679999987653 6999999875 366799998 44444 11112 Q ss_pred eecC-------Cch--------HH-----HHh----------------------hhhcchhhcCCcccCCcceeeeeeec Q lcl|NC_019448. 405 FVGE-------MSP--------QV-----VHL----------------------FELLPMMKLPLAQINASITFAVLWYG 442 (463) Q Consensus 405 fvGe-------~~p--------qv-----i~l----------------------~ellPm~k~pla~~na~~~~~V~~Yg 442 (463) |.|+ +.| ++ +.+ -.-+-++||+..+.=-+-+-+|.+=| T Consensus 290 F~GneV~FsEpylPyAWP~~Yr~t~~~dIVaiA~~gt~LVV~TkG~PYl~sG~sP~sms~~kL~~~qpCvS~rsiV~~~g 369 (567) T protein:vir:82 290 FAGNEVMFSEAYLPYAWPEVNRHTTAEDIVAICPLRTSLVVATKGEPYLFSGVSPSTISGSKIPSMQACLSRRSMVAMEG 369 (567) T ss_pred ecCCEEEEecCCCCcccchhhccCCCCCeEEEEecccEEEEEEcCceEEEEcCChhhccccccccccccccccceeeecc Confidence 2221 111 11 110 12222345554444446666777766 Q ss_pred hhheecceeeEEEE-E--EeE--ecC Q lcl|NC_019448. 443 ALALRAPKKWARIK-N--VRY--IAV 463 (463) Q Consensus 443 ~L~l~aPkk~~~ik-N--V~~--~~~ 463 (463) ...--.|.-.+.|. | ++. .-+ T Consensus 370 ~v~Yas~dGLv~i~a~G~a~vvT~~l 395 (567) T protein:vir:82 370 FVLYAGTNGLVSVDANGNVALATEQI 395 (567) T ss_pred eEEeecCCcEEEEecCCchhhhhhhc Confidence 66666777777774 2 221 112 No 44 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=97.12 E-value=5.8e-05 Score=43.80 Aligned_cols=295 Identities=13% Similarity=0.070 Sum_probs=137.1 Q ss_pred CCCccchHHHHhhhhhhHHHHHHhhcCCccCCccccCccccchhhhhhHhhhhhccccccchhhhcccchhhHHHhhhhh Q lcl|NC_019448. 3 IEKNLSDVQQKYADQFQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVKYDQ 82 (463) Q Consensus 3 ~~~~~~~~~~~~~k~~~e~~~Ks~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~stv~ey~~ 82 (463) |=.|..... +.+..+..|+++++. ..+|+-|..|..++-|..+.... .+.+...+.+..+.-.+|.+ T Consensus 1 ~~~~~~r~~----~~~~~~e~~a~~~~~------~~~g~~ip~~~~~~ii~~~~~~s---~i~~~~~~~~~~~~~~~~p~ 67 (326) T protein:vir:42 1 MAVNPDRTT----PFLGVNDPKVAQTGD------SMFEGYLEPEQAQDYFAEAEKIS---IVQQFAQKIPMGTTGQKIPH 67 (326) T ss_pred CCCCccchh----hhcCcchhhheeccc------cCCcceechhhHHHHHHHHHhcc---hhhhhcceeeccCCceEEEE Confidence 222332222 223345567776642 23456677777777665554443 34443444444433344555 Q ss_pred hhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHHHHHHHHHHHhhccc Q lcl|NC_019448. 83 YLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEWASFYGDA 162 (463) Q Consensus 83 ~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~~E~a~fyGd~ 162 (463) .. +.+...|++|++..+.+++.+.+.....+=++..-.+|.-+ +.++..|.+....+.-...++..+|.++|+|+. T Consensus 68 ~~---~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~v~iS~el-l~~s~~~~~~~i~~~l~~a~~~~~d~a~l~G~g 143 (326) T protein:vir:42 68 WT---GDVSASWIGEGDMKPITKGNMTSQTIAPHKIATIFVASAET-VRANPANYLGTMRTKVATAFAMAFDNAAINGTD 143 (326) T ss_pred Ee---CCcceEEecCCccccccccceeEEEEeeEEEEEeehhhHHH-HhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccC Confidence 44 23456799999999999999999999999999988888754 345667888999999999999999999999988 Q ss_pred ccCCCccccccccccceeeecCcceEe-----ccCCCCCHHH-HhhhhhhhhhcCCceeEEecCHHHHHHHHHHhcCcce Q lcl|NC_019448. 163 SLTSEVEGEGLEFDGLAKLIDKNNVIN-----AKGNQLTEKH-LNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQM 236 (463) Q Consensus 163 ~l~~~~~~~gleFDGl~~lI~~~nviD-----arG~~ls~~~-ln~aa~~i~~~~G~~td~~m~~~vka~f~~~~~~~qr 236 (463) +=.| .|+.+......... +.+.....+. +..+.......+...+...|++.+.+.|...--..-| T Consensus 144 s~~p---------~gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~lkd~~G~ 214 (326) T protein:vir:42 144 SPFP---------TFLAQTTKEVSLVDPDGTGSNADLTVYDAVAVNALSLLVNAGKKWTHTLLDDITEPILNGAKDKSGR 214 (326) T ss_pred CCcc---------ccccccccccceeecccccccccchhHHHHHHHHHhhhhhhccCccEEEEeHHHHHHHHHhhccCCc Confidence 5222 23333222111111 1112222222 2233344445566677789999999999853222223 Q ss_pred EEeecCCCC---------cccceecCeeeecccccccCCcee-ccCc---cccccccccCCCCCCCCeeEE------EEe Q lcl|NC_019448. 237 QLMQDNSGN---------VNTGYSVNGFYSSRGFIKLHGSTV-MENE---LILDESLQPLPNAPQPAKVTA------TVE 297 (463) Q Consensus 237 v~~~~n~g~---------~~~G~~v~~~~s~~G~i~l~~s~~-~~~d---~~l~~~~~~~p~ap~p~~vta------t~~ 297 (463) .+.++.... .-.|+++- .+- .+.. +..+ +-.| .++ .. .....+.. +.. T Consensus 215 ~l~~~~~~~~~~~~~~~~~l~G~pv~--~~~--~~~~-~~~~~~~Gd~s~~~~-~~-------~~~~~v~~~~e~~~~~~ 281 (326) T protein:vir:42 215 PLFIESTYTEENSPFRLGRIVARPTI--LSD--HVAS-GTVVGYQGDFRQLVW-GQ-------VGGLSFDVTDQATLNLG 281 (326) T ss_pred eeeccccccCccccccCceeeeeeEE--EcC--CCCC-CceEEEEeecceEEE-EE-------ecceEEEEeecceeeec Confidence 333332211 12233221 110 0000 1110 0000 000 00 00000000 000 Q ss_pred ccCCCcCc--ccccccc----eEEEEEEEecCCccccccceeeeecCCC Q lcl|NC_019448. 298 TKQKGAFE--DEEDRAG----LSYKVVVNSDDAQSAPSEEVTATVSNVD 340 (463) Q Consensus 298 ~~~~g~~~--~~~~~a~----ysYkV~a~s~~geS~~S~~vt~Tva~~~ 340 (463) ....+..- ...+... .++-..+.+. ++... ....+.+ .. T Consensus 282 ~~~~~~~~~~~~~d~~~~r~~~~~d~~v~~~--~a~~~-l~~~~~~-~~ 326 (326) T protein:vir:42 282 TPQAPNFVSLWQHNLVAVRVEAEYAFHCNDK--DAFVK-LTNVDAT-EA 326 (326) T ss_pred ccccccchhhhhcCcEEEEEEEEeccEEecc--cceEE-Eeecccc-CC Confidence 00000000 0001111 1111111111 11110 0000000 00 No 45 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=97.12 E-value=6.3e-05 Score=43.62 Aligned_cols=302 Identities=12% Similarity=0.032 Sum_probs=133.5 Q ss_pred CCCCCccch-----HHHHhhhhhhHHHHHHhhcCCc-cCCccccCccccchhhhhhHhhhhhccccccchhhhcccchhh Q lcl|NC_019448. 1 MTIEKNLSD-----VQQKYADQFQEDVVKSFQTGYG-ITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQ 74 (463) Q Consensus 1 ~~~~~~~~~-----~~~~~~k~~~e~~~Ks~~agy~-~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~ 74 (463) ...+..... ..+++.+.....-.++++..-. ....+..+|+.+..+..++.|..+.. .+..++.+...--. T Consensus 74 ~~~~~~~~~~~~~~~~~~~~r~g~~~~~~~~~~~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~---~~~~l~~~~~~~~~ 150 (392) T protein:vir:13 74 LQGSGSGAQRSADHDDDAVLRAGNLGEARSFEFAPEKRDGTKAGNPNVLSRTLYGQLIAQAVE---RSAIMRGGASTFTT 150 (392) T ss_pred cCCcccchhhhhhHHHHHHHhccchhhhHHHHhhhhhhcccccCCCccccccchHHHHHHHHh---hhhhhhhcceeeec Confidence 111111110 0111111111111122221111 11112234555666655555544332 23333333222111 Q ss_pred HHHhhhhhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHHHHHHHH Q lcl|NC_019448. 75 STVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIE 154 (463) Q Consensus 75 stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~~E 154 (463) +.-..|.....-| .....+++|++..+.+++.+.+....++=++.--.+|.-+ +.++..|.+....+.-...+++.++ T Consensus 151 ~~~~~~~~~~~~~-~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~l~~~i~~~~d 228 (392) T protein:vir:13 151 SDANPMDFTVITG-RATAGIVGETAEIPESYPATTQRSMGGFKYGFASVVSYEF-ATDQVLDLVGFLVSDAGPAIGDAMG 228 (392) T ss_pred CCCceeEEEEEcC-CcceeeecccccccccccceeeEEeeeeeEEeeehhHHHH-HhcchHHHHHHHHHHHHHHHHHHHH Confidence 2222232222222 2356689999999999999999999998888777777653 3345557788888888999999999 Q ss_pred HHHhhcccccCCCccccccccccceeeecC--cceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHHHHHhc Q lcl|NC_019448. 155 WASFYGDASLTSEVEGEGLEFDGLAKLIDK--NNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSIL 232 (463) Q Consensus 155 ~a~fyGd~~l~~~~~~~gleFDGl~~lI~~--~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~~~~ 232 (463) .++|+||-.- +-.|+.+.... ..+..+....++-+.|.++-..+...|....-..|++.+.+.|...-- T Consensus 229 ~~~l~G~Gt~---------~p~Gil~~~~~~~~~~~~~~~~~~~~d~l~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd 299 (392) T protein:vir:13 229 RHFLTGTGTG---------QPRGILTDATGANAAFGEADADSKVSDALIDLFHEVPSAYRKNAKFVVNDLRAAQMRKLKD 299 (392) T ss_pred HHHhcccCCc---------cccccccccccccccccccccccccHHHHHHHHHhhhhhhhcCCEEEEcHHHHHHHHHhhc Confidence 9999998541 23455554321 122233345566555555554555667555568999999998875333 Q ss_pred CcceEEeecCCCC----cccceecCeeeecccccccCCceeccC-ccccccccccCCCCCCCCeeEEEEeccCCCcCccc Q lcl|NC_019448. 233 GRQMQLMQDNSGN----VNTGYSVNGFYSSRGFIKLHGSTVMEN-ELILDESLQPLPNAPQPAKVTATVETKQKGAFEDE 307 (463) Q Consensus 233 ~~qrv~~~~n~g~----~~~G~~v~~~~s~~G~i~l~~s~~~~~-d~~l~~~~~~~p~ap~p~~vtat~~~~~~g~~~~~ 307 (463) ..-|.+.+++... .-.|++|- .+.. +. .+.+++.+ ...+... -..++......... . T Consensus 300 ~~G~~l~~~~~~~g~~~~l~G~Pv~--~~~~--~~-~~~i~~Gdf~~~~i~~---------~~~~~i~~~~~~~~----~ 361 (392) T protein:vir:13 300 ANGQYLWQSALTVGAPDTFNGKVVE--TDDG--MP-ADKVLFADLSKYRVRF---------AGSLRVDRSVDAKF----S 361 (392) T ss_pred cCCceeecCCcCCCCCceecceeeE--EcCC--CC-CCcEEEeeccceeEEe---------ecceEEEeeccccc----c Confidence 3344444443221 23555531 1111 00 11122110 0000000 00111111111110 0 Q ss_pred ccccceEEEEE----EEecCCccccccceeeee Q lcl|NC_019448. 308 EDRAGLSYKVV----VNSDDAQSAPSEEVTATV 336 (463) Q Consensus 308 ~~~a~ysYkV~----a~s~~geS~~S~~vt~Tv 336 (463) ++. ..|++. ..-.+-++.....+++.. T Consensus 362 ~~~--~~~r~~~r~d~~~~~~~A~~~~~~~~aa 392 (392) T protein:vir:13 362 TDQ--IVYRFLQRADGLLVDARGAKVLTVTPAA 392 (392) T ss_pred CCc--EEEEEEEEeccEEecccceEEEEeeccC Confidence 000 111111 001111111111111111 No 46 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=97.06 E-value=0.00016 Score=41.44 Aligned_cols=272 Identities=11% Similarity=0.047 Sum_probs=127.6 Q ss_pred HHHHhhcCCccCCccccCccc-cchhhhhhHhhhhhccccccchhhhcccchhhHHHhhhhhhhccC-cccccccccccC Q lcl|NC_019448. 22 VVKSFQTGYGITPDTQIDAGA-LRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVKYDQYLRHG-NVGHSRFVKEIG 99 (463) Q Consensus 22 ~~Ks~~agy~~~p~~q~~gaa-lr~esLd~~i~~L~~~~~df~f~~~i~k~~~~stv~ey~~~~~hG-~~g~~~fv~E~g 99 (463) ++++++++. ..+|++ +..|...+-+..+.... .+.+.....+..+..-++. +..+. +.+...+++|++ T Consensus 1 ~l~~~~~~t------~~~gg~liP~~~~~~Ii~~~~~~~---~l~~~~~~~~~~~~~g~~~-~~~~~~~~~~a~~v~Eg~ 70 (293) T protein:vir:48 1 MLDSKTDHS------GSDAGLTIPQDIRTAINTLVRQYD---SLQEYVNVENVTTLTGSRV-YEKWTDITGLANIDDEAG 70 (293) T ss_pred Cceeecccc------cCcCceEechhHHHHHHHHHHhhh---hhhhhceeeeccCCcceEE-EEeecCCCcceeeecCCc Confidence 667777754 234454 45555544443333222 2333333333333222222 22232 234567999998 Q ss_pred ccc-ccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHHHHHHHHHHHhhcccccCCCccccccccccc Q lcl|NC_019448. 100 VAP-VSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGL 178 (463) Q Consensus 100 ~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl 178 (463) ..+ .+++.+.+....+|-++.-..+|.-+- .++.-|.+....+.....++..++.+++.|..+.... T Consensus 71 ~~~~~~~~~~~~i~l~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~~----------- 138 (293) T protein:vir:48 71 KIADIDDPKLSLIKYTIKRYAGISTVTNSLL-ADSAENILAWLSGWIAKKVVVTRNKAILGVVDKLPTK----------- 138 (293) T ss_pred ccccccccceeEEEEeeeEEEEeehhhHHHH-hhhhHHHHHHHHHHHHHHHHHHHHhHHhhcccccccc----------- Confidence 865 678999999999999998877775432 3444577888888888889999999999887764321 Q ss_pred eeeecCcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHHHHHhcCcceEEeecCCCCc----ccceecCe Q lcl|NC_019448. 179 AKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQMQLMQDNSGNV----NTGYSVNG 254 (463) Q Consensus 179 ~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~~~~~~qrv~~~~n~g~~----~~G~~v~~ 254 (463) +...+-+.|.++-..+..+|......+||+.+.+.+...--..-|.+.+++..+. -.|++|-- T Consensus 139 -------------~~~~~~d~i~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~l~G~Pv~~ 205 (293) T protein:vir:48 139 -------------PTLTKWDDIIDLEAKVDPAIKQTSFFLTNTSGFTALKKVKNALGDYLMERDVKSPTGYSIAGFAVKE 205 (293) T ss_pred -------------ccccCHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhccCCceEeecCcCCCCCceecceeeEE Confidence 1233455566666667777776677899999999988543233345555443322 34544310 Q ss_pred eee-cccccccCC-ceecc--Cc-ccc-ccccccCCCCCCCCeeEEEEeccCCCcCcccccccceEEEEEEE----ecCC Q lcl|NC_019448. 255 FYS-SRGFIKLHG-STVME--NE-LIL-DESLQPLPNAPQPAKVTATVETKQKGAFEDEEDRAGLSYKVVVN----SDDA 324 (463) Q Consensus 255 ~~s-~~G~i~l~~-s~~~~--~d-~~l-~~~~~~~p~ap~p~~vtat~~~~~~g~~~~~~~~a~ysYkV~a~----s~~g 324 (463) .-+ .-+.....- ..++. .+ .++ ++. .++....... +..+. .+ ...|++..- -.+. T Consensus 206 ~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~-----------~~~i~~~~~~-~~~~~-~~--~~~~r~~~r~d~~~~~~ 270 (293) T protein:vir:48 206 ISDRWLPNASSGVMPLYFGDLKQAVTLFDRQ-----------QMSLLSTNIG-GGAFE-TD--TTKVRVIDRFDVVATDT 270 (293) T ss_pred ecccccCCccCCceEEEEEeccceEEEEEec-----------ceEEEEeccc-chhhh-cC--eEEEEEEEeeCcEEecc Confidence 000 000000000 01111 00 010 000 0111111110 11110 00 011111100 0111 Q ss_pred ccccccceeee----ecCCCCce Q lcl|NC_019448. 325 QSAPSEEVTAT----VSNVDDGV 343 (463) Q Consensus 325 eS~~S~~vt~T----va~~~~gv 343 (463) ++...-..+++ .+....+| T Consensus 271 ~a~~~l~~~~~~~~~~~~~~~~~ 293 (293) T protein:vir:48 271 EAFVPASFKAIADQKGNIGSTAV 293 (293) T ss_pred cceEEEEeeccccCCccccccCC Confidence 11110000000 00011112 No 47 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=97.05 E-value=9.1e-05 Score=42.74 Aligned_cols=269 Identities=13% Similarity=0.079 Sum_probs=140.3 Q ss_pred CccCCccc----cCccccchhhhhhHhhhhhccccccchhhhcccchhhHHHhhhhhhhccCcccccccccccCcccccC Q lcl|NC_019448. 30 YGITPDTQ----IDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSD 105 (463) Q Consensus 30 y~~~p~~q----~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d 105 (463) -|.++++- .+|+.+..+..++-+..|.. ...+.+.....+..+...++.+.. + ....|++|++..+..+ T Consensus 1 ~g~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~---~s~l~~~~~~~~~~~~~~~~~~~~---~-~~a~~v~E~~~~~~~~ 73 (299) T protein:vir:41 1 MGFNPDTTTMQSAKTGSIPINISEQIITGVKN---GSAAMKLAKAVPMTKPEEEFTFMS---G-VGAFWVDEAERIQTSK 73 (299) T ss_pred CCcCCCcccccCCCceecchhHHHHHHHHHHh---cchhhhhceeeecCCCcEEEEEEc---C-CceeeeecCccccccc Confidence 44444433 22344544444443333332 223555555556666666665443 2 2467999999999999 Q ss_pred cceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHHHHHHHHHHHhhcccccCCCcccccccccccee-eecC Q lcl|NC_019448. 106 PNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAK-LIDK 184 (463) Q Consensus 106 ~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~-lI~~ 184 (463) +.+.......|-++.--.+|.-+- .++..|.+....+.-...+++.+|.++++|+.+-.+ .|+.+ .... T Consensus 74 ~~f~~v~l~~~k~~~~~~is~ell-~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~g~~~~---------~gil~~~~~~ 143 (299) T protein:vir:41 74 PTFTKAKMRSKKMGVIIPTTKENL-NYSVTNFFSLMQAEIVEAFYKKFDQAVFTGVESPYN---------WNILKSATDA 143 (299) T ss_pred cceeEEEEeeEEEEEeehhhHHHH-hcCHHHHHHHHHHHHHHHHHHHHHHHHhhcccCccc---------cccccccccc Confidence 999999999999999888887432 345567888899999999999999999999965322 24444 3334 Q ss_pred cceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHHHHHhcCcceEEeecCCC---CcccceecCeeeecccc Q lcl|NC_019448. 185 NNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQMQLMQDNSG---NVNTGYSVNGFYSSRGF 261 (463) Q Consensus 185 ~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~~~~~~qrv~~~~n~g---~~~~G~~v~~~~s~~G~ 261 (463) .+.... .-.+.+.|.++...+..++...+-++|++.+.+.+...--...|.+.++... ..-.|++| +.+..-. T Consensus 144 ~~~~~~--~~~~~~~l~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~l~G~PV--~~~~~~~ 219 (299) T protein:vir:41 144 SNLVEE--TANKYDDLNEAIGLIEAEDLEPNGIATIRKQRVKYRSTKDGNGMPIFNTATSNGVDDVLGLPI--AYTPKYT 219 (299) T ss_pred ceeecc--ccccHHHHHHHHHhhhcccCCcCEEEEcHHHHHHHHHhhccCCceeecCCcCCCCceecceee--EEecccC Confidence 444333 3456677777777888888888889999999999885333333445544322 12345543 1111100 Q ss_pred cc----------cCCcee-ccCcccccccccc------CC-----CCCCCCeeEEEEeccCCCcCcccccccceEEEEEE Q lcl|NC_019448. 262 IK----------LHGSTV-MENELILDESLQP------LP-----NAPQPAKVTATVETKQKGAFEDEEDRAGLSYKVVV 319 (463) Q Consensus 262 i~----------l~~s~~-~~~d~~l~~~~~~------~p-----~ap~p~~vtat~~~~~~g~~~~~~~~a~ysYkV~a 319 (463) .. +..-.+ ...+.-+...+.. -+ +...--.+..-+..--.+...+++ +--.-+..+ T Consensus 220 ~~~~~~~~~~gdfs~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~--A~~~l~~~a 297 (299) T protein:vir:41 220 FGDKDISELVGDWNQAYYGILRGVEYEILTEATLTTVADETGKPLNLAERDMAAIKATFEVGFMVVKDE--AFSAVQPKA 297 (299) T ss_pred CCCCceEEEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEEeccc--ceEEEEecc Confidence 00 000000 0001000000000 00 001111111111111111111111 111111112 Q ss_pred Ee Q lcl|NC_019448. 320 NS 321 (463) Q Consensus 320 ~s 321 (463) .| T Consensus 298 a~ 299 (299) T protein:vir:41 298 GN 299 (299) T ss_pred CC Confidence 22 No 48 >protein:vir:103759 Length: 330 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024928;genbank:gi:48697198;genbank:GeneID:2846083 Probab=97.02 E-value=4.9e-06 Score=49.68 Aligned_cols=275 Identities=19% Similarity=0.204 Sum_probs=126.3 Q ss_pred CCCCCccchHHHHhhhhhhHHHHHHhhcCCccCCccccCccccchhhhhhHhhhhhccccccchhhhcccchhhH-HHhh Q lcl|NC_019448. 1 MTIEKNLSDVQQKYADQFQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQS-TVVK 79 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k~~~e~~~Ks~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~s-tv~e 79 (463) ||.=+..-.+-..++|.+..+ .. +..-|..|+..+. ++.++|=...++ |=|. T Consensus 1 m~~~~~~a~TL~e~AKr~~~d--------------~~----------~~~IIE~l~~tn~---IL~~lpf~e~N~~tg~~ 53 (330) T protein:vir:10 1 MATLSTNNPTMADVAKRLDPN--------------GK----------VDIIVEMLNQTNP---VLQDMTAIEGNLPTGHR 53 (330) T ss_pred CCcCCCCcccHHHHHhhcCcc--------------hh----------HHHHHHHHhcCch---HHhhcchhhccCCcccc Confidence 553333233333333322110 00 0111111222221 122222222222 1133 Q ss_pred hhhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhh-hhhcccccHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019448. 80 YDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIAS-GLVNNIADPSQILTEDAIAVVAKTIEWASF 158 (463) Q Consensus 80 y~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~-~lvn~~~Dp~~~~~~~ai~~~~~~~E~a~f 158 (463) +..++.-. ...|-.=....+-+.++..|++..++.|..-..|-+.. ++..+..+-+++|.+.-|..+.++++..+| T Consensus 54 t~vrt~LP---~~~fR~lN~g~~~s~~tt~qvt~~l~ilgg~~eVDr~la~~~Gn~a~~ra~e~~~~ikam~q~~~~~~i 130 (330) T protein:vir:10 54 TSVRTGLP---TPTWRKLYGGVLPNKSSTAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEVAQTLF 130 (330) T ss_pred eeEEeecC---CchhhhcCCccccccceEEEEEEEeEEecchhhhhhHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 33333222 23332222334456699999999999999999996654 445567788999999999999999999999 Q ss_pred hcccccCCCccccccccccceeeec------CcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHHHHHhc Q lcl|NC_019448. 159 YGDASLTSEVEGEGLEFDGLAKLID------KNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSIL 232 (463) Q Consensus 159 yGd~~l~~~~~~~gleFDGl~~lI~------~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~~~~ 232 (463) |||++.+|. +||||.+... +.|+||+.|.--..-+|+- +..+. ..+.=+| |-+-|+-|+-.=+ T Consensus 131 yGD~a~~p~------~F~GL~kR~~~~ta~~~~qvIdaGGtG~~~TSi~~--v~wg~--~~~~giy-PkG~kaGl~~~d~ 199 (330) T protein:vir:10 131 YGNDGIAPA------EFTGLSPRYNSLSAENKDNVIDAGGTGSDNASAWL--VVWGP--NTCHSIY-PKGSKAGLSVEDK 199 (330) T ss_pred cCCCCCChh------hccchhhhcCCCCCCchhheeeccccccCceEEEE--EEEcC--CeEEEEc-ccCccccceeeec Confidence 999999884 8999999774 4699999887665444331 11111 2233344 8899988876667 Q ss_pred CcceEEeecCCCCcccceecCeeeeccccc-----------ccCCceecc----Cccc--cccccccCC----------- Q lcl|NC_019448. 233 GRQMQLMQDNSGNVNTGYSVNGFYSSRGFI-----------KLHGSTVME----NELI--LDESLQPLP----------- 284 (463) Q Consensus 233 ~~qrv~~~~n~g~~~~G~~v~~~~s~~G~i-----------~l~~s~~~~----~d~~--l~~~~~~~p----------- 284 (463) +.+++.-....|+.-.|+ .++|..--|.- +++-+-.-. .|+| |.......| T Consensus 200 g~~~~~~~dg~gg~y~~~-~~~~~w~~Gl~i~d~r~vvRI~NIdvs~l~~~~~~~~li~lm~~A~~~ip~~~~g~~~~y~ 278 (330) T protein:vir:10 200 GQVTIENADGNGGRMEGY-RTHYKWDIGLTLRDWRYVARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGRAVWYM 278 (330) T ss_pred cceeeecccCCCCceeEE-eeeeeeeeeeEEeCcccEEEEeecccccCCCCccHHHHHHHHHHHHHhccCCCCCcceeee Confidence 777654333333222222 12211111110 111010000 0000 111101111 Q ss_pred -------------CCCCCCeeEEE-EeccCCCcCcccccc-----cceEEEEE Q lcl|NC_019448. 285 -------------NAPQPAKVTAT-VETKQKGAFEDEEDR-----AGLSYKVV 318 (463) Q Consensus 285 -------------~ap~p~~vtat-~~~~~~g~~~~~~~~-----a~ysYkV~ 318 (463) +. .-..++-+ +...-.-.|....-. -.-.=+|| T Consensus 279 n~~v~~~L~~q~~~k-~n~~l~~~~~~g~~~t~~~gipir~~Dail~tE~~vv 330 (330) T protein:vir:10 279 NRNLREKLRLGIVDK-IANNLTWETVSGERVMTFDGIPVQRTDALLNTESRVV 330 (330) T ss_pred chHHHHHHHHHHhhc-ccceeeeeecCCeeeEEECCeEEEEEeeeecCccccC Confidence 00 00111110 000000111111000 00001122 No 49 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=96.98 E-value=0.00012 Score=42.08 Aligned_cols=295 Identities=12% Similarity=0.032 Sum_probs=138.0 Q ss_pred hhhhhHHHHHHhhcCCccCCcccc-CccccchhhhhhHhhhhhccccccchhhhcccchhhHHHhhhhhhhcc-----Cc Q lcl|NC_019448. 15 ADQFQEDVVKSFQTGYGITPDTQI-DAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVKYDQYLRH-----GN 88 (463) Q Consensus 15 ~k~~~e~~~Ks~~agy~~~p~~q~-~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~stv~ey~~~~~h-----G~ 88 (463) +-.++|- .+.++|-........ +++-+..|..++-+..|.... .+.+...+.+..+--.+|.+.... .+ T Consensus 1 ~a~l~el--~~~~~~~~~~g~~~~~~~~liP~~~~~~ii~~l~~~s---~l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~ 75 (333) T protein:vir:78 1 MATLNEL--LPNSAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESS---LVLRMGEQIPISYGETIIPTTVKRPEVGQVG 75 (333) T ss_pred CchhHHh--hhhcccccccCceecCCccccchhHHHHHHHHHHhhc---hhhhhcceeeccCCceEEEEEeCCceeEeec Confidence 3334333 233333222222222 233455555555554444333 344444444454443455555432 23 Q ss_pred ccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHHHHHHHHHHHhhcccccCCCc Q lcl|NC_019448. 89 VGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEWASFYGDASLTSEV 168 (463) Q Consensus 89 ~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~~E~a~fyGd~~l~~~~ 168 (463) -|...++.|++..+.+++.+.+.+...+=++.--.+|.-+- .++..|.+....+.-...+++.+|.++|+|+-+..+ T Consensus 76 eg~~~~~~e~~~~~~~~~~f~~i~l~~~kl~~~~~is~ell-~~s~~~~~~~i~~~la~ai~~~~d~~~l~G~g~~~~-- 152 (333) T protein:vir:78 76 VGTSNEQREGGLKPLSGTAWDTRSVSPIKLATIVTVSEEFA-RMNPSGLYTKLQGDLAYAIGRGIDLAVFHGKSPLTG-- 152 (333) T ss_pred CcccccccccccccccccceeEEEEeeEEEEEeehhhHHHH-hcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCCC-- Confidence 34566778888899999999999999999998888877332 245668888889999999999999999999987544 Q ss_pred cccccccccceeeec---Cc-ceEeccCCCCCHHHHhhhhhhhhhc-CCceeEEecCHHHHHHHHHHh--cCc-ceEEee Q lcl|NC_019448. 169 EGEGLEFDGLAKLID---KN-NVINAKGNQLTEKHLNEAAVRIGKG-FGTATDAYMPIGVHADFVNSI--LGR-QMQLMQ 240 (463) Q Consensus 169 ~~~gleFDGl~~lI~---~~-nviDarG~~ls~~~ln~aa~~i~~~-~G~~td~~m~~~vka~f~~~~--~~~-qrv~~~ 240 (463) ..+.|+.+... .. -+.-..+..++.+.|-.+-..+..+ +..++-.+|++...+.|.+.- .+. -+.+.+ T Consensus 153 ----~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~~~~~~d~~G~~i~~ 228 (333) T protein:vir:78 153 ----SALQGIDTDNVIANTTNVDYLQETGDPLLDRLLDGYDLVSANTDVEFNGWAVDPRFRAHLLRAQAYRDANGNVDPS 228 (333) T ss_pred ----cccccccccccccccccccccccccchhHHHHHHHHHhhccccccCceEEEEcchHHHHHHHHhhhcCCCCceeec Confidence 35677665221 11 1111233344555555555445444 567778999999988886532 122 233333 Q ss_pred cCCC----CcccceecCeeeecc---ccc-ccCC--ceecc--Ccccc-ccccccCCCCCCCCeeEEEEec-----cCCC Q lcl|NC_019448. 241 DNSG----NVNTGYSVNGFYSSR---GFI-KLHG--STVME--NELIL-DESLQPLPNAPQPAKVTATVET-----KQKG 302 (463) Q Consensus 241 ~n~g----~~~~G~~v~~~~s~~---G~i-~l~~--s~~~~--~d~~l-~~~~~~~p~ap~p~~vtat~~~-----~~~g 302 (463) .... ..-.|++|- .+.. +.. ...+ -+++- ++.++ ++.. ++..... +..+ T Consensus 229 ~~~~~~~~~~l~G~Pv~--~~~~i~~~~~~~~~~~~~~~~gD~~~~~~g~~~~-----------~~i~~~~~~~~~~~~~ 295 (333) T protein:vir:78 229 RINLAAQTGDVLGLPAQ--FGRAVGGDLGAAVDSKTRIIGGDFSQLKFGFADE-----------IRIKMSDTATLTDSGS 295 (333) T ss_pred CccccCCCceeeceeeE--EccccCCCccccCCCccEEEEEecccEEEEEeec-----------cEEEEecccccccccc Confidence 3221 123455441 1110 000 0000 01111 00110 1100 0011000 0111 Q ss_pred cCc--cccc----ccceEEEEEEEecCCccccccceeeeec Q lcl|NC_019448. 303 AFE--DEED----RAGLSYKVVVNSDDAQSAPSEEVTATVS 337 (463) Q Consensus 303 ~~~--~~~~----~a~ysYkV~a~s~~geS~~S~~vt~Tva 337 (463) ... +..+ ++...+-..+... ++.. ...-++.+ T Consensus 296 ~~~~~~~~~~v~~r~~~r~d~~v~~~--~a~~-~l~~~~a~ 333 (333) T protein:vir:78 296 ATVSMWQTNQIAILIEVTFGWLLGDK--QAFV-KFVDDEQP 333 (333) T ss_pred ceeehhhcCcEEEEEEEEEccEEecc--cceE-EEeccCCC Confidence 100 0011 1111111111111 1100 00111111 No 50 >protein:vir:107826 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996627;genbank:gi:45580761;genbank:GeneID:2767902 Probab=96.98 E-value=5.7e-06 Score=49.32 Aligned_cols=291 Identities=20% Similarity=0.187 Sum_probs=134.1 Q ss_pred CCCCCccchHHHHhhhhhhHHHHHHhhcCCccCCccccCccccchhhhhhHhhhhhccccccchhhhcccchhhH-HHhh Q lcl|NC_019448. 1 MTIEKNLSDVQQKYADQFQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQS-TVVK 79 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k~~~e~~~Ks~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~s-tv~e 79 (463) ||+=...-.+-...+|.+..+ +.+..+.+ ..|+..+ .++..+|=...++ |=|. T Consensus 1 m~~~~~~~~TL~e~Ak~~~~~-------------------~~l~~~II----E~l~~tn---~IL~~lpf~e~N~~t~~~ 54 (331) T protein:vir:10 1 MPTLSTTNPTLADVAARMTPD-------------------GKIDPQIV----EMLNETN---EILDDMTVIEANGFTEHK 54 (331) T ss_pred CCccccCcccHHHHHHhcCcc-------------------hhHHHHHH----HHHhcCc---hHHhhceeeeccCCccce Confidence 554322222222222211000 01111111 1122222 1233333223332 3355 Q ss_pred hhhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhh-cccccHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019448. 80 YDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLV-NNIADPSQILTEDAIAVVAKTIEWASF 158 (463) Q Consensus 80 y~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv-n~~~Dp~~~~~~~ai~~~~~~~E~a~f 158 (463) |.+++.-- ...|-.=....+-+.++..|++..++.|..-..|.+...-. .+..+-+++|.+.-|..+.++++..+| T Consensus 55 ~~vrt~LP---~~~fR~lN~g~~~s~~tt~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~i 131 (331) T protein:vir:10 55 TTVRSGLP---TGTWRKLNYGVQPEKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLF 131 (331) T ss_pred eeEEeccC---CchhhccCCccCcccceeEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 65655332 33443323345667889999999999999999997765444 446677999999999999999999999 Q ss_pred hcccccCCCccccccccccceeeec------CcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHHHHHhc Q lcl|NC_019448. 159 YGDASLTSEVEGEGLEFDGLAKLID------KNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSIL 232 (463) Q Consensus 159 yGd~~l~~~~~~~gleFDGl~~lI~------~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~~~~ 232 (463) |||++.+|. +||||.+..+ ..|+||+.|.--+.-+|+- +..+. ....=+| |-+-++-|+-.=+ T Consensus 132 yGD~a~~p~------~F~GL~kR~~~~~a~~~~q~IdaGgtG~~~TSI~~--v~~~~--~~~~giy-PkG~~~Gl~~~d~ 200 (331) T protein:vir:10 132 YGDSSIDAE------KFMGLTPRFNSLSAENGQNIIDAGGTGSDNASIWL--TVWGP--NTLHTIY-PKGSQAGLQSRDL 200 (331) T ss_pred cCCcccChh------hhccchhhccccccccccceeecCCCCCCceEEEE--EEEcC--CeeEEec-ccccccCceEeec Confidence 999999884 8999999774 3599999887654433321 11111 2333356 8888888876667 Q ss_pred CcceEEeecCCCCcccceecCeeeecccccccCCcee------------------ccCccccccccccCCCCCCCCeeEE Q lcl|NC_019448. 233 GRQMQLMQDNSGNVNTGYSVNGFYSSRGFIKLHGSTV------------------MENELILDESLQPLPNAPQPAKVTA 294 (463) Q Consensus 233 ~~qrv~~~~n~g~~~~G~~v~~~~s~~G~i~l~~s~~------------------~~~d~~l~~~~~~~p~ap~p~~vta 294 (463) +.|... ..+.|.+ .|+ ...|..--|.--.+..-+ -+....|.+-....|+ T Consensus 201 g~~~~~-~~~G~~y-~~y-~~~~~w~~Gl~i~d~r~v~ri~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~--------- 268 (331) T protein:vir:10 201 GEDTLI-DAAGGRY-QGY-RTHYKWDIGLTLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPN--------- 268 (331) T ss_pred Cceeee-cCCCCee-eEE-EEEEEeeeeeEEcCcccEEEEeccchhccCCCcchhhhHHHHHHHHHHHhcc--------- Confidence 777644 4444332 132 222333333211111000 0000111111111111 Q ss_pred EEeccCCCcCcccccccceEEEEEEEecCCcc----ccc---cceeeeecCC-------CCceEEEEEecCCCCCCcceE Q lcl|NC_019448. 295 TVETKQKGAFEDEEDRAGLSYKVVVNSDDAQS----APS---EEVTATVSNV-------DDGVKLSISVNAMYQQQPQFV 360 (463) Q Consensus 295 t~~~~~~g~~~~~~~~a~ysYkV~a~s~~geS----~~S---~~vt~Tva~~-------~~gv~ltIt~~a~~g~~~~~y 360 (463) .+.|+ .+- .||+.-.. ..+ +....++... =.|+.|-.+ -+... +++-+ T Consensus 269 ----~~~~~--------~~~----y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g~~~t~~~gipir~~-dai~~-tE~~V 330 (331) T protein:vir:10 269 ----VGMGR--------PAF----YMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCRRT-DALLL-TEARV 330 (331) T ss_pred ----cCCCC--------eEE----EechHHHHHHHHHHhhccceeeeeeeecCCcceeEECCeeEEEe-eeeec-Ccccc Confidence 00111 011 12211100 000 0000111000 011221111 11101 11111 Q ss_pred E Q lcl|NC_019448. 361 S 361 (463) Q Consensus 361 ~ 361 (463) + T Consensus 331 v 331 (331) T protein:vir:10 331 V 331 (331) T ss_pred C Confidence 1 No 51 >protein:vir:98525 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996579;genbank:gi:45569510;genbank:GeneID:2767853 Probab=96.98 E-value=5.7e-06 Score=49.32 Aligned_cols=291 Identities=20% Similarity=0.187 Sum_probs=134.1 Q ss_pred CCCCCccchHHHHhhhhhhHHHHHHhhcCCccCCccccCccccchhhhhhHhhhhhccccccchhhhcccchhhH-HHhh Q lcl|NC_019448. 1 MTIEKNLSDVQQKYADQFQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQS-TVVK 79 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k~~~e~~~Ks~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~s-tv~e 79 (463) ||+=...-.+-...+|.+..+ +.+..+.+ ..|+..+ .++..+|=...++ |=|. T Consensus 1 m~~~~~~~~TL~e~Ak~~~~~-------------------~~l~~~II----E~l~~tn---~IL~~lpf~e~N~~t~~~ 54 (331) T protein:vir:98 1 MPTLSTTNPTLADVAARMTPD-------------------GKIDPQIV----EMLNETN---EILDDMTVIEANGFTEHK 54 (331) T ss_pred CCccccCcccHHHHHHhcCcc-------------------hhHHHHHH----HHHhcCc---hHHhhceeeeccCCccce Confidence 554322222222222211000 01111111 1122222 1233333223332 3355 Q ss_pred hhhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhh-cccccHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019448. 80 YDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLV-NNIADPSQILTEDAIAVVAKTIEWASF 158 (463) Q Consensus 80 y~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv-n~~~Dp~~~~~~~ai~~~~~~~E~a~f 158 (463) |.+++.-- ...|-.=....+-+.++..|++..++.|..-..|.+...-. .+..+-+++|.+.-|..+.++++..+| T Consensus 55 ~~vrt~LP---~~~fR~lN~g~~~s~~tt~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~i 131 (331) T protein:vir:98 55 TTVRSGLP---TGTWRKLNYGVQPEKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLF 131 (331) T ss_pred eeEEeccC---CchhhccCCccCcccceeEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 65655332 33443323345667889999999999999999997765444 446677999999999999999999999 Q ss_pred hcccccCCCccccccccccceeeec------CcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHHHHHhc Q lcl|NC_019448. 159 YGDASLTSEVEGEGLEFDGLAKLID------KNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSIL 232 (463) Q Consensus 159 yGd~~l~~~~~~~gleFDGl~~lI~------~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~~~~ 232 (463) |||++.+|. +||||.+..+ ..|+||+.|.--+.-+|+- +..+. ....=+| |-+-++-|+-.=+ T Consensus 132 yGD~a~~p~------~F~GL~kR~~~~~a~~~~q~IdaGgtG~~~TSI~~--v~~~~--~~~~giy-PkG~~~Gl~~~d~ 200 (331) T protein:vir:98 132 YGDSSIDAE------KFMGLTPRFNSLSAENGQNIIDAGGTGSDNASIWL--TVWGP--NTLHTIY-PKGSQAGLQSRDL 200 (331) T ss_pred cCCcccChh------hhccchhhccccccccccceeecCCCCCCceEEEE--EEEcC--CeeEEec-ccccccCceEeec Confidence 999999884 8999999774 3599999887654433321 11111 2333356 8888888876667 Q ss_pred CcceEEeecCCCCcccceecCeeeecccccccCCcee------------------ccCccccccccccCCCCCCCCeeEE Q lcl|NC_019448. 233 GRQMQLMQDNSGNVNTGYSVNGFYSSRGFIKLHGSTV------------------MENELILDESLQPLPNAPQPAKVTA 294 (463) Q Consensus 233 ~~qrv~~~~n~g~~~~G~~v~~~~s~~G~i~l~~s~~------------------~~~d~~l~~~~~~~p~ap~p~~vta 294 (463) +.|... ..+.|.+ .|+ ...|..--|.--.+..-+ -+....|.+-....|+ T Consensus 201 g~~~~~-~~~G~~y-~~y-~~~~~w~~Gl~i~d~r~v~ri~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~--------- 268 (331) T protein:vir:98 201 GEDTLI-DAAGGRY-QGY-RTHYKWDIGLTLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPN--------- 268 (331) T ss_pred Cceeee-cCCCCee-eEE-EEEEEeeeeeEEcCcccEEEEeccchhccCCCcchhhhHHHHHHHHHHHhcc--------- Confidence 777644 4444332 132 222333333211111000 0000111111111111 Q ss_pred EEeccCCCcCcccccccceEEEEEEEecCCcc----ccc---cceeeeecCC-------CCceEEEEEecCCCCCCcceE Q lcl|NC_019448. 295 TVETKQKGAFEDEEDRAGLSYKVVVNSDDAQS----APS---EEVTATVSNV-------DDGVKLSISVNAMYQQQPQFV 360 (463) Q Consensus 295 t~~~~~~g~~~~~~~~a~ysYkV~a~s~~geS----~~S---~~vt~Tva~~-------~~gv~ltIt~~a~~g~~~~~y 360 (463) .+.|+ .+- .||+.-.. ..+ +....++... =.|+.|-.+ -+... +++-+ T Consensus 269 ----~~~~~--------~~~----y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g~~~t~~~gipir~~-dai~~-tE~~V 330 (331) T protein:vir:98 269 ----VGMGR--------PAF----YMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCRRT-DALLL-TEARV 330 (331) T ss_pred ----cCCCC--------eEE----EechHHHHHHHHHHhhccceeeeeeeecCCcceeEECCeeEEEe-eeeec-Ccccc Confidence 00111 011 12211100 000 0000111000 011221111 11101 11111 Q ss_pred E Q lcl|NC_019448. 361 S 361 (463) Q Consensus 361 ~ 361 (463) + T Consensus 331 v 331 (331) T protein:vir:98 331 V 331 (331) T ss_pred C Confidence 1 No 52 >protein:vir:107388 Length: 331 # NCBI annotation: Bbp17 # Family: family:all:1903 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958686;genbank:gi:41179378;genbank:GeneID:2717182 Probab=96.98 E-value=5.7e-06 Score=49.32 Aligned_cols=291 Identities=20% Similarity=0.187 Sum_probs=134.1 Q ss_pred CCCCCccchHHHHhhhhhhHHHHHHhhcCCccCCccccCccccchhhhhhHhhhhhccccccchhhhcccchhhH-HHhh Q lcl|NC_019448. 1 MTIEKNLSDVQQKYADQFQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQS-TVVK 79 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k~~~e~~~Ks~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~s-tv~e 79 (463) ||+=...-.+-...+|.+..+ +.+..+.+ ..|+..+ .++..+|=...++ |=|. T Consensus 1 m~~~~~~~~TL~e~Ak~~~~~-------------------~~l~~~II----E~l~~tn---~IL~~lpf~e~N~~t~~~ 54 (331) T protein:vir:10 1 MPTLSTTNPTLADVAARMTPD-------------------GKIDPQIV----EMLNETN---EILDDMTVIEANGFTEHK 54 (331) T ss_pred CCccccCcccHHHHHHhcCcc-------------------hhHHHHHH----HHHhcCc---hHHhhceeeeccCCccce Confidence 554322222222222211000 01111111 1122222 1233333223332 3355 Q ss_pred hhhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhh-cccccHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019448. 80 YDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLV-NNIADPSQILTEDAIAVVAKTIEWASF 158 (463) Q Consensus 80 y~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv-n~~~Dp~~~~~~~ai~~~~~~~E~a~f 158 (463) |.+++.-- ...|-.=....+-+.++..|++..++.|..-..|.+...-. .+..+-+++|.+.-|..+.++++..+| T Consensus 55 ~~vrt~LP---~~~fR~lN~g~~~s~~tt~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~i 131 (331) T protein:vir:10 55 TTVRSGLP---TGTWRKLNYGVQPEKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQATTLF 131 (331) T ss_pred eeEEeccC---CchhhccCCccCcccceeEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 65655332 33443323345667889999999999999999997765444 446677999999999999999999999 Q ss_pred hcccccCCCccccccccccceeeec------CcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHHHHHhc Q lcl|NC_019448. 159 YGDASLTSEVEGEGLEFDGLAKLID------KNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSIL 232 (463) Q Consensus 159 yGd~~l~~~~~~~gleFDGl~~lI~------~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~~~~ 232 (463) |||++.+|. +||||.+..+ ..|+||+.|.--+.-+|+- +..+. ....=+| |-+-++-|+-.=+ T Consensus 132 yGD~a~~p~------~F~GL~kR~~~~~a~~~~q~IdaGgtG~~~TSI~~--v~~~~--~~~~giy-PkG~~~Gl~~~d~ 200 (331) T protein:vir:10 132 YGDSSIDAE------KFMGLTPRFNSLSAENGQNIIDAGGTGSDNASIWL--TVWGP--NTLHTIY-PKGSQAGLQSRDL 200 (331) T ss_pred cCCcccChh------hhccchhhccccccccccceeecCCCCCCceEEEE--EEEcC--CeeEEec-ccccccCceEeec Confidence 999999884 8999999774 3599999887654433321 11111 2333356 8888888876667 Q ss_pred CcceEEeecCCCCcccceecCeeeecccccccCCcee------------------ccCccccccccccCCCCCCCCeeEE Q lcl|NC_019448. 233 GRQMQLMQDNSGNVNTGYSVNGFYSSRGFIKLHGSTV------------------MENELILDESLQPLPNAPQPAKVTA 294 (463) Q Consensus 233 ~~qrv~~~~n~g~~~~G~~v~~~~s~~G~i~l~~s~~------------------~~~d~~l~~~~~~~p~ap~p~~vta 294 (463) +.|... ..+.|.+ .|+ ...|..--|.--.+..-+ -+....|.+-....|+ T Consensus 201 g~~~~~-~~~G~~y-~~y-~~~~~w~~Gl~i~d~r~v~ri~NIdvs~l~~~~~~~~dl~~lm~~a~~~ip~--------- 268 (331) T protein:vir:10 201 GEDTLI-DAAGGRY-QGY-RTHYKWDIGLTLRDWRYVVRIANVDVSELTKNASAGADLIDLMTQAVELIPN--------- 268 (331) T ss_pred Cceeee-cCCCCee-eEE-EEEEEeeeeeEEcCcccEEEEeccchhccCCCcchhhhHHHHHHHHHHHhcc--------- Confidence 777644 4444332 132 222333333211111000 0000111111111111 Q ss_pred EEeccCCCcCcccccccceEEEEEEEecCCcc----ccc---cceeeeecCC-------CCceEEEEEecCCCCCCcceE Q lcl|NC_019448. 295 TVETKQKGAFEDEEDRAGLSYKVVVNSDDAQS----APS---EEVTATVSNV-------DDGVKLSISVNAMYQQQPQFV 360 (463) Q Consensus 295 t~~~~~~g~~~~~~~~a~ysYkV~a~s~~geS----~~S---~~vt~Tva~~-------~~gv~ltIt~~a~~g~~~~~y 360 (463) .+.|+ .+- .||+.-.. ..+ +....++... =.|+.|-.+ -+... +++-+ T Consensus 269 ----~~~~~--------~~~----y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g~~~t~~~gipir~~-dai~~-tE~~V 330 (331) T protein:vir:10 269 ----VGMGR--------PAF----YMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCRRT-DALLL-TEARV 330 (331) T ss_pred ----cCCCC--------eEE----EechHHHHHHHHHHhhccceeeeeeeecCCcceeEECCeeEEEe-eeeec-Ccccc Confidence 00111 011 12211100 000 0000111000 011221111 11101 11111 Q ss_pred E Q lcl|NC_019448. 361 S 361 (463) Q Consensus 361 ~ 361 (463) + T Consensus 331 v 331 (331) T protein:vir:10 331 V 331 (331) T ss_pred C Confidence 1 No 53 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=96.97 E-value=0.00012 Score=42.03 Aligned_cols=293 Identities=14% Similarity=0.099 Sum_probs=136.2 Q ss_pred CCCCCccchHHHHhhhhhhHHHHHHhhcCCccCCccccCccccchhhhhhHhhhhhccccccchhhhcccchhhHHHhhh Q lcl|NC_019448. 1 MTIEKNLSDVQQKYADQFQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVKY 80 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k~~~e~~~Ks~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~stv~ey 80 (463) |---+ .++.+-.+..+++ +...|+.+..+...+-|..|.. .-.+.+-....+..+.-.+| T Consensus 1 ~~~~~-----------~~~~e~~~~~~~~------~~~~~~~ip~~~~~~ii~~~~~---~~~l~~~~~~~~~~~~~~~i 60 (318) T protein:vir:24 1 MAAGT-----------AFAVDHAQIAQTG------DTMFKGYLEPEQAKDYFAEAEK---TSIVQQFAQKVPMGTTGQKI 60 (318) T ss_pred CCCCC-----------CCCHHHHHhhccc------CcccceeechhHHHHHHHHHHh---hchhhhhcceeeccCCceEE Confidence 11111 1122222223332 1223444666555444443322 23455555555555555556 Q ss_pred hhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHHHHHHHHHHHhhc Q lcl|NC_019448. 81 DQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEWASFYG 160 (463) Q Consensus 81 ~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~~E~a~fyG 160 (463) .+... .+...+++|++..+.+++.+.+.....+=++.--.+|.-+ +.++..|.+....+.....+++.+|.++++| T Consensus 61 p~~~~---~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~e~-l~ds~~~~~~~i~~~l~~~~~~~~d~a~l~G 136 (318) T protein:vir:24 61 PHWVG---DVSAQWIGEGDMKPITKGNMTSQTIAPHKIATIFVASAET-VRANPANYLGTMRTKVATAFAMAFDGAAMHG 136 (318) T ss_pred EEEeC---CcceEEecCCccccccccceeEEEEeeEEEEEeehhhHHH-hhcChHHHHHHHHHHHHHHHHHHHHHhhhcc Confidence 66653 3457899999999999999999999999988877777632 2345568899999999999999999999999 Q ss_pred ccccCCCccccccccccceeeecCcceEecc-CCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHHHHHhcCcceEEe Q lcl|NC_019448. 161 DASLTSEVEGEGLEFDGLAKLIDKNNVINAK-GNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQMQLM 239 (463) Q Consensus 161 d~~l~~~~~~~gleFDGl~~lI~~~nviDar-G~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~~~~~~qrv~~ 239 (463) +.+-.+ .|+...+..-..-... +.....+.+.++...+...+....-..|++.+.+.+...-=..-|.+. T Consensus 137 ~g~~~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~ 207 (318) T protein:vir:24 137 TDSPFP---------TYIGQTTKAISIADTTGATTVYDQVAVNGLSLLVNDGKKWTHTLLDDITEPILNGAKDQNGRPLF 207 (318) T ss_pred cCCCCC---------cccccccccccccccccccchHHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhccCCceee Confidence 875222 2333333221111111 122233445566667777788788899999999999853322223344 Q ss_pred ecCCCCccc----ceecCe---eeecccccccCCceecc-----------CccccccccccCCCCCCCCeeEEEEeccCC Q lcl|NC_019448. 240 QDNSGNVNT----GYSVNG---FYSSRGFIKLHGSTVME-----------NELILDESLQPLPNAPQPAKVTATVETKQK 301 (463) Q Consensus 240 ~~n~g~~~~----G~~v~~---~~s~~G~i~l~~s~~~~-----------~d~~l~~~~~~~p~ap~p~~vtat~~~~~~ 301 (463) +++..+... |..+-+ +.+.. +......++- .+..+...+. + +.+...+.. T Consensus 208 ~~~~~~~~~~~~~~~~i~g~pv~~~~~--~~~~~~~~~~gdfs~~~~~~~~~l~i~~~~~--------~--~~~~~~~~~ 275 (318) T protein:vir:24 208 IESTYGEAASPFRSGRIVARPTILSDH--VVEGTTVGFMGDFSQLIWGQIGGLSFDVTDQ--------A--TLNLGTVES 275 (318) T ss_pred cCccccCccccccCceEEEEeeEEeCC--CCCCccEEEEeecceEEEEEecCeEEEEeec--------c--ceecccccc Confidence 443322111 111111 11110 0000011000 0111100000 0 000000000 Q ss_pred CcCc--ccccccc----eEEEEEEEecCCccccccceeeeecCCCCc Q lcl|NC_019448. 302 GAFE--DEEDRAG----LSYKVVVNSDDAQSAPSEEVTATVSNVDDG 342 (463) Q Consensus 302 g~~~--~~~~~a~----ysYkV~a~s~~geS~~S~~vt~Tva~~~~g 342 (463) +... +..+.-. .++-..+.+ .++. ..++.-.+....| T Consensus 276 ~~~~~~f~~~~~~~r~~~r~d~~v~~--~~a~--~~i~~~~a~~~~~ 318 (318) T protein:vir:24 276 PNFVSLWQHNLVAVRVEAEYAFHCND--AEAF--VALTNVVSGGGEG 318 (318) T ss_pred ccchhhhhcCcEEEEEEEEEccEEec--ccce--EEEEeeccCCCCC Confidence 0000 0001111 111111111 1111 1122222222222 No 54 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=96.91 E-value=0.00026 Score=40.22 Aligned_cols=310 Identities=10% Similarity=0.032 Sum_probs=136.6 Q ss_pred CC-CCCccchHHHHhh-----hhhh------------HHHH-HHhhcCCccCCccccCccccchhhhhhHhhhhhccccc Q lcl|NC_019448. 1 MT-IEKNLSDVQQKYA-----DQFQ------------EDVV-KSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNED 61 (463) Q Consensus 1 ~~-~~~~~~~~~~~~~-----k~~~------------e~~~-Ks~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~d 61 (463) -+ .+.......+.+. +.+. ..+. .+.+.-.........+++.+--+.+...+..+....- T Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~- 152 (419) T protein:vir:94 74 TPAEAGTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDLPL- 152 (419) T ss_pred cccccccccchhhhhhhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhccccccccccCCcccccchhhhHHHHHHHhhhh- Confidence 00 0001111111100 0000 0000 0001111111112233444444444444443332221 Q ss_pred cchhhhcccchhhHHHhhhhhhhcc-----CcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhccccc Q lcl|NC_019448. 62 LIFYRDISRRPAQSTVVKYDQYLRH-----GNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIAD 136 (463) Q Consensus 62 f~f~~~i~k~~~~stv~ey~~~~~h-----G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~D 136 (463) .+.+-+...+..+-...|.+.+.+ ++.+...+++|++..+.+++.+.+.+...+-++.--.+|.-+ .+...+ T Consensus 153 -~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~el--l~d~~~ 229 (419) T protein:vir:94 153 -LVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQA--ADDNSQ 229 (419) T ss_pred -hhhhcceeeeccCCceeeeeeccccccccccCcccceecCCccccccccceeeEEeeeeeEEEeehhhHHH--HHhHHH Confidence 122222222333333444443322 223356799999999999999999999999999888888653 344457 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhcccccCCCccccccccccceeeecCc--ceEecc---CCCCCHHHHhhhhhhhhhcC Q lcl|NC_019448. 137 PSQILTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKN--NVINAK---GNQLTEKHLNEAAVRIGKGF 211 (463) Q Consensus 137 p~~~~~~~ai~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~--nviDar---G~~ls~~~ln~aa~~i~~~~ 211 (463) .+....+.--..++.+++.++++||-+= +.-|+.+.-... ++.... ......+.|.++-..+...| T Consensus 230 l~~~i~~~la~a~~~~~d~aii~G~G~~---------~p~Gi~~~~~~~~~~~~~~~~~~t~~~~~~~l~~~~~~~~~~~ 300 (419) T protein:vir:94 230 LMGYIQGRLTYGLRFLRDRQLLNGNGST---------EMQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAEIAG 300 (419) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCcc---------cccceecccccccccccccccccccchhHHHHHHHHHhhhhcc Confidence 7888888899999999999999998762 344554432211 111111 11122455666666777778 Q ss_pred CceeEEecCHHHHHHHHHHhcC-cceEEeecCCCCc----ccceecCeeeecccccccCCceeccCccccccccc--cCC Q lcl|NC_019448. 212 GTATDAYMPIGVHADFVNSILG-RQMQLMQDNSGNV----NTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQ--PLP 284 (463) Q Consensus 212 G~~td~~m~~~vka~f~~~~~~-~qrv~~~~n~g~~----~~G~~v~~~~s~~G~i~l~~s~~~~~d~~l~~~~~--~~p 284 (463) ..++-.+|++.+...+...--. ..+.+.+++..+. -.|++| +.+.. +. .+.+++. |.... ... T Consensus 301 ~~~~~~v~n~~~~~~l~~~k~~~~~~~~~~~~~~~~~~~~l~G~pV--~~~~~--~~-~~~~~~g-----d~~~~~~~~~ 370 (419) T protein:vir:94 301 FPPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNV--VSTVA--IA-QGTALVG-----GFRQGATLWS 370 (419) T ss_pred CCCCEEEEcHHHHHHHHHHhhcCCCceeecCCcccCCCccccceee--EEcCC--CC-CccEEEe-----eccceEEEEE Confidence 7788899999999998765544 3333445443322 122221 01100 00 0011110 00000 000 Q ss_pred CCCCCCeeEEEEeccCCCcCcccccccceEEEEEEEecCCccccccceeeeecCCCC Q lcl|NC_019448. 285 NAPQPAKVTATVETKQKGAFEDEEDRAGLSYKVVVNSDDAQSAPSEEVTATVSNVDD 341 (463) Q Consensus 285 ~ap~p~~vtat~~~~~~g~~~~~~~~a~ysYkV~a~s~~geS~~S~~vt~Tva~~~~ 341 (463) + . .++..........|. .....|++...-+.+=--|...+-.+++..+. T Consensus 371 ~--~--~~~v~~~~~~~~~~~----~~~~~~r~~~r~d~~v~~~~a~~~~~~~aa~~ 419 (419) T protein:vir:94 371 R--Q--GITVLMTDSHADFFT----ANTLVILAEFRANLAVYQPKAFVRVTFAAATT 419 (419) T ss_pred e--c--ceEEEEeccccchhh----cCcEEEEEEEeeccEEeccccEEEEEeccCCC Confidence 0 0 011111111110000 01122333322222222233333333332222 No 55 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=96.88 E-value=0.00028 Score=40.07 Aligned_cols=302 Identities=12% Similarity=0.043 Sum_probs=143.5 Q ss_pred CCCCCccchHHHHhhh---------hhhHH------HHHHhhcCCccCCccccCccccchhhhhhHhhhhhccccccchh Q lcl|NC_019448. 1 MTIEKNLSDVQQKYAD---------QFQED------VVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFY 65 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k---------~~~e~------~~Ks~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~ 65 (463) -..+.......+...+ ..... -.|++.. ...+..+..+|+-+.-+.+.+-|..+ .+...++ T Consensus 69 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~g~~~~~~~~~~ii~~~---~~~~~l~ 144 (390) T protein:vir:10 69 AGGDVQHVSVGDLFVASEQFQASAGRWNDRSARATMNIKAALN-TASTDAAGSAGALTTPNRLPGFITQP---DARLTVR 144 (390) T ss_pred ccccccccchhhhhhhhHHHHHHHHhhhhhhhhhhhHHHHHHH-hhhcccccccccccchhHHHHHHHHH---Hhhchhh Confidence 1111111111111111 00000 0011110 01112223345556666655444433 2333556 Q ss_pred hhcccchhhHHHhhhhhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHH Q lcl|NC_019448. 66 RDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDA 145 (463) Q Consensus 66 ~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~a 145 (463) +-+...++.+.-.+|.+... ..+...+++|++..+..|+++......++-++....+|.-+ +.++ .+......+.- T Consensus 145 ~~~~~~~~~~~~~~~~~~~~--~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~el-l~d~-~~l~~~i~~~l 220 (390) T protein:vir:10 145 DLIGSGRTDSALIEYVQETG--FVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQI-LSDA-PQLASYMNNRL 220 (390) T ss_pred hhcceeeccCCceEEEEEec--CCcceeeecCCccccccccceeEEEEeeEEEEEeehhhHHH-HHhH-HHHHHHHHHHH Confidence 65655555554445555443 33456789999999999999999999999999988888854 3344 47788888888 Q ss_pred HHHHHHHHHHHHhhcccccCCCccccccccccceeeecCcc-eEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHH Q lcl|NC_019448. 146 IAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNN-VINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVH 224 (463) Q Consensus 146 i~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~n-viDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vk 224 (463) ...++..++.++++|+-. +-++.|+.+...... .....|. .+.+.|..+...+..+|...+-.+|++.+. T Consensus 221 ~~~~~~~~~~~il~G~G~--------~~~p~Gi~~~~~~~~~~~~~~~~-~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~ 291 (390) T protein:vir:10 221 IRGLKVKEDAEILRGTGA--------NDGLLGLIPQATTYAAPTTIAGA-TRVDQLRLAMLQASLAEYPASGIVINPIDW 291 (390) T ss_pred HHHHHHHHHHHHhhcCCC--------Ccccccccccccccccccccccc-chHHHHHHHHHhhccccCCCCEEEEcHHHH Confidence 899999999999999743 125778877654332 2333333 345566666677778888888899999999 Q ss_pred HHHHHHhcCcceEEeec-CCCC--cccceecCeeeecccccccCCceeccCcccccccc-ccCCCCCCCCeeEEEEeccC Q lcl|NC_019448. 225 ADFVNSILGRQMQLMQD-NSGN--VNTGYSVNGFYSSRGFIKLHGSTVMENELILDESL-QPLPNAPQPAKVTATVETKQ 300 (463) Q Consensus 225 a~f~~~~~~~qrv~~~~-n~g~--~~~G~~v~~~~s~~G~i~l~~s~~~~~d~~l~~~~-~~~p~ap~p~~vtat~~~~~ 300 (463) +.+...--..-|.+.++ ..+. .-.|.+| +.+.. +. .+.+++. |... ...- -...++..... T Consensus 292 ~~L~~lkd~~g~~l~~~~~~~~~~~l~G~pv--~~~~~--~p-~~~~~~g-----df~~~~~~~---~~~~~~i~~~~-- 356 (390) T protein:vir:10 292 AAIELAKDANNQYLIGNARGTLTPTLWGLPV--VATQA--MA-PGEFLVG-----AFDLAAQIF---DQWDARVEIGY-- 356 (390) T ss_pred HHHHHhhcCCCceeecCCcCcCCceecceee--EEcCC--CC-CCcEEEE-----eccceEEEE---EecceEEEEee-- Confidence 98885332222333332 1110 0122221 00000 00 0011100 0000 0000 00000111000 Q ss_pred CCcCcccccccceEEEEEEEecCCccccccceeeeec Q lcl|NC_019448. 301 KGAFEDEEDRAGLSYKVVVNSDDAQSAPSEEVTATVS 337 (463) Q Consensus 301 ~g~~~~~~~~a~ysYkV~a~s~~geS~~S~~vt~Tva 337 (463) ....+ ......|++...-+..=--|...+.+|++ T Consensus 357 ~~~~~---~~~~~~~r~~~r~d~~v~~~~a~~~~~~a 390 (390) T protein:vir:10 357 VNDDF---QRNMVTVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred ccccc---ccCcEEEEEEEeeccEEeccccEEEEEeC Confidence 00000 00011222222212111222333333333 No 56 >protein:vir:105563 Length: 396 # NCBI annotation: hypothetical protein # Family: family:all:27455 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164316;genbank:gi:56692963;genbank:GeneID:3197174 Probab=96.88 E-value=2.7e-05 Score=45.61 Aligned_cols=241 Identities=15% Similarity=0.151 Sum_probs=101.8 Q ss_pred hcccccCCCccccccccccceeeec-----------------Ccce-EeccCCCCCHHHHhhhhhhhhhcCCceeEEecC Q lcl|NC_019448. 159 YGDASLTSEVEGEGLEFDGLAKLID-----------------KNNV-INAKGNQLTEKHLNEAAVRIGKGFGTATDAYMP 220 (463) Q Consensus 159 yGd~~l~~~~~~~gleFDGl~~lI~-----------------~~nv-iDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~ 220 (463) -..-+|. -|-|+.++.+ +.|| ||+.|+.=- |.+|=...|--+. T Consensus 1 ~~~~~~~--------~~~ginnv~~e~~l~~~~~~~~~~~r~a~nvdi~~~G~~~~-----------r~~~tr~~~g~l~ 61 (396) T protein:vir:10 1 MATTSLV--------PLAGINNVAEDAALQRGGESPRLYVRDAVNIDLSPAGKAQL-----------RASVRQVTDQPFR 61 (396) T ss_pred Ccceeee--------eeecccccccccccccCCCcccceeeeeeeecccCCCchhh-----------hccCcccCCceec Confidence 1222332 3555555443 2222 444443210 0111111111111 Q ss_pred HHHHHHHHHHhcCcce--------EEeecCCCCcccceecCeeeeccccccc---CCceeccCcc---cc---ccccccC Q lcl|NC_019448. 221 IGVHADFVNSILGRQM--------QLMQDNSGNVNTGYSVNGFYSSRGFIKL---HGSTVMENEL---IL---DESLQPL 283 (463) Q Consensus 221 ~~vka~f~~~~~~~qr--------v~~~~n~g~~~~G~~v~~~~s~~G~i~l---~~s~~~~~d~---~l---~~~~~~~ 283 (463) + +-|+.+.+.- +...++.- .. +....-.+|.+.. ++-+.+-+|. +. ...++.+ T Consensus 62 ~-----~~~~~~~~~~~~~~~~tl~~~~~~~w--~~---~~~v~v~~~pva~d~~~~Rvy~t~~~~p~~~~~~~~y~L~v 131 (396) T protein:vir:10 62 Q-----LWQSPLHGDAFGALGDQWGKVDPHSW--TF---EPLAQIGEGDLSHEVLNNRVCVAGTAGIFTYDGAQAERLTL 131 (396) T ss_pred c-----cccCccccceeeeCCceEEEEeCCeE--EE---EeeeeeccCchhccccCCeEEEEcCCCceeeeCCcceecCc Confidence 1 1111111111 11111110 00 0000112233321 1122211111 11 1222222 Q ss_pred CCCCCCCeeEEEEeccCCCcCcccccccceEEEEEEEecCCccccccceeeeecCCCCceEEEEEecCCCCCCcceEEEE Q lcl|NC_019448. 284 PNAPQPAKVTATVETKQKGAFEDEEDRAGLSYKVVVNSDDAQSAPSEEVTATVSNVDDGVKLSISVNAMYQQQPQFVSIY 363 (463) Q Consensus 284 p~ap~p~~vtat~~~~~~g~~~~~~~~a~ysYkV~a~s~~geS~~S~~vt~Tva~~~~gv~ltIt~~a~~g~~~~~y~IY 363 (463) |. |+|+...+ ..|+ -+.+.|.|.++.+...||..++.+++..++ ...++.++|++.. +..-..+.|| T Consensus 132 p~-P~~a~~~a-----~~Gs----l~~~~~~Y~~t~V~~~gEEs~p~~~S~~v~-~~gg~~vtl~~~~--~~~i~~~RiY 198 (396) T protein:vir:10 132 DT-PAPPLLVA-----GAGS----LSQGTYGAAVAWLRGPQESAPSLIAFAEVT-DAGALEVTFPLCL--DASVTGARLY 198 (396) T ss_pred CC-Cccccccc-----ccCc----cCCceEEEEEEEEecCCCcCcccccccccC-CCCCcEEEEEccc--CCCcceEEEE Confidence 22 33332222 2232 134578899999999997555555666666 6678888888632 2233578999 Q ss_pred eecCCCceEEEEEEeeeeeecCCceEEEEeccCCCCCCccceecCCchHHHHhhhhcchhhcCCcccCCcc-------ee Q lcl|NC_019448. 364 RQGKETGMYFLIKRVPVKDAQEDGTIVFVDKNETLPETADVFVGEMSPQVVHLFELLPMMKLPLAQINASI-------TF 436 (463) Q Consensus 364 R~~~~~g~~~li~rv~~s~~n~~gtttf~D~N~~iPgt~~~fvGe~~pqvi~l~ellPm~k~pla~~na~~-------~~ 436 (463) |++.+++.|+|++..+. ++.+|++.-. | .++.|.. +..|.| +|+..+-+.. .. T Consensus 199 rS~~~G~~~~l~aE~~a------~~~s~vlPs~--~-------w~gpP~~--~~gL~p---mP~G~~~A~faGRi~~A~G 258 (396) T protein:vir:10 199 LTRANGGELLLAGDYPL------GAATVILPTL--P-------ELGRPAQ--FRHLSP---MPTGKHLAYWRGRLLIARA 258 (396) T ss_pred EeCCChhhhhheehhcc------ceeeeeeecC--C-------CCCCCcc--cccccc---CchhHhhhhhcceEEEEeC Confidence 99999999999997763 5556665221 1 1222322 344454 4555454444 22 Q ss_pred eeeeechhheecceeeEEEEEEe-------Eec-C Q lcl|NC_019448. 437 AVLWYGALALRAPKKWARIKNVR-------YIA-V 463 (463) Q Consensus 437 ~V~~Yg~L~l~aPkk~~~ikNV~-------~~~-~ 463 (463) -.+||.-..+ |..|-+-++-. -+. | T Consensus 259 n~V~FSEp~~--Ph~~~~~~~~~~~~~~Iv~lapv 291 (396) T protein:vir:10 259 NVLRFSEALA--YHLHDERYGFVQMPQRITFVQPV 291 (396) T ss_pred CEEEEecCCC--CceecchhccCCCCCceEEEEEe Confidence 5556655443 43223222222 111 1 No 57 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=96.86 E-value=0.00029 Score=39.96 Aligned_cols=304 Identities=12% Similarity=-0.008 Sum_probs=142.4 Q ss_pred CCCC-Ccc-----chHHHHhhhhhhHHH----------HHHhhcCCccCCccccCccccchhhhhhHhhhhhccccccch Q lcl|NC_019448. 1 MTIE-KNL-----SDVQQKYADQFQEDV----------VKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIF 64 (463) Q Consensus 1 ~~~~-~~~-----~~~~~~~~k~~~e~~----------~Ks~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f 64 (463) ...+ ... ........+.+.... .|.+.. ...+..+..+|+.+.-|....-+..+ .+...+ T Consensus 68 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~g~~~~~~~~~~ii~~~---~~~~~l 143 (390) T protein:vir:81 68 GAGGDVQHVSVGDMFVASEQFQASAGRWNDRSARATMNIKAALN-TASTDAAGSAGALTTPNRLPGFITPP---DARLTV 143 (390) T ss_pred ccccccccccchhhhhhhHHHHHHHHHHhhhhhhhhhHHHHHHH-hhccccccCCcceechhhhHHHHHHH---hhhhhh Confidence 0000 000 000111111111000 011110 11122233445556666555444333 333445 Q ss_pred hhhcccchhhHHHhhhhhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHH Q lcl|NC_019448. 65 YRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTED 144 (463) Q Consensus 65 ~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ 144 (463) .+-+...+..+--.+|.++.. ..+...+++|++..+.+++.+......++-++.--.+|.-+ +.++ .+.+....+. T Consensus 144 ~~~~~~~~~~~~~~~~~~~~~--~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~el-l~d~-~~~~~~i~~~ 219 (390) T protein:vir:81 144 RDLIGSGRTDSALIEYVQETG--FVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQI-LSDA-PQLASYMNNR 219 (390) T ss_pred hhhcceeeccCCceEEEEEec--CCcceeeecCCcccccccceeeEEEEeeeEEEEeehhhHHH-HHhH-HHHHHHHHHH Confidence 555555555555555665552 33456789999999999999999999999999888888753 3344 4788888899 Q ss_pred HHHHHHHHHHHHHhhcccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHH Q lcl|NC_019448. 145 AIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVH 224 (463) Q Consensus 145 ai~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vk 224 (463) -...++..++.++++|+-. |-++.|+.+......+--.-+...+.+.|..+...+...|...+-++||+.+. T Consensus 220 l~~~~~~~~d~a~l~G~g~--------~~~~~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~ 291 (390) T protein:vir:81 220 LIRGLKVKEDAEILRGTGA--------NDGLLGLIPQATTYAAPTTIAGATRVDQLRLAMLQASLAEYNPSGIVINPIDW 291 (390) T ss_pred HHHHHHHHHHHHHHhcCCC--------CCcccceeecccccccccccccchhHHHHHHHHHhhccccCCCCEEEEcHHHH Confidence 9999999999999999754 12478887765433332233344456667777777788888888899999999 Q ss_pred HHHHHHhcCcceEEeec-CCCC--cccceecCeeeecccccccCCceeccCccccccccccCCCCCCCCeeEEEEeccCC Q lcl|NC_019448. 225 ADFVNSILGRQMQLMQD-NSGN--VNTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQPLPNAPQPAKVTATVETKQK 301 (463) Q Consensus 225 a~f~~~~~~~qrv~~~~-n~g~--~~~G~~v~~~~s~~G~i~l~~s~~~~~d~~l~~~~~~~p~ap~p~~vtat~~~~~~ 301 (463) +.|...-=..-|.+.++ ..+. .-.|++|- .+.. +. .+.+++. |...... -.--..++.. .+.. T Consensus 292 ~~l~~lkd~~G~~l~~~~~~~~~~~l~G~pv~--~~~~--~p-~~~~~~g-----d~~~~~~--~~~~~~~~v~--~~~~ 357 (390) T protein:vir:81 292 AAIELAKDANNQYLIGNARGTLTPTLWGLPVV--ATQA--MA-PGEFLVG-----AFDLAAQ--IFDQWDARVE--IGYV 357 (390) T ss_pred HHHHHhhcCCCceeecCcccccCceecceeeE--EcCC--CC-CCcEEEE-----ehhceEE--EEEecceEEE--Eecc Confidence 98875332222333322 1110 01122210 0000 00 0000000 0000000 0000000000 0000 Q ss_pred CcCcccccccceEEEEEEEecCCccccccceeeeecCCCCceEEEEE Q lcl|NC_019448. 302 GAFEDEEDRAGLSYKVVVNSDDAQSAPSEEVTATVSNVDDGVKLSIS 348 (463) Q Consensus 302 g~~~~~~~~a~ysYkV~a~s~~geS~~S~~vt~Tva~~~~gv~ltIt 348 (463) +..+ . .....|++...-+. -+....+=+.++|. T Consensus 358 ~~~~-~--~~~v~~r~~~r~d~-----------~v~~~~a~v~~t~a 390 (390) T protein:vir:81 358 GEDF-Q--RNMITVLAEERLAL-----------VVYRPEALISGSFA 390 (390) T ss_pred cchh-h--cCcEEEEEEEeecc-----------EEecccceEEEEeC Confidence 0000 0 00011111111111 11112222233332 No 58 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=96.78 E-value=0.00015 Score=41.55 Aligned_cols=278 Identities=12% Similarity=0.144 Sum_probs=126.6 Q ss_pred cCCccccCccccchhhhhhHh-hhhhccccccchhhhcccchhhHHHhhhhhhhccCcccccccccccCc-----ccccC Q lcl|NC_019448. 32 ITPDTQIDAGALRREILDDQI-TMLTWTNEDLIFYRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGV-----APVSD 105 (463) Q Consensus 32 ~~p~~q~~gaalr~esLd~~i-~~L~~~~~df~f~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~-----~~~~d 105 (463) ....+-.+|+.|--+.+.++| ..+.. .-.+.+.....+..+.-..|.+... .....+++|++. .+.++ T Consensus 1 ma~~t~~~gg~liP~~~~~~Ii~~~~~---~s~l~~l~~~~~~~~~~~~~p~~~~---~~~a~wv~E~~~~~~~~~~~s~ 74 (305) T protein:vir:25 1 MADISRAEVASLIQEAYSDTLLAAAKQ---GSTVLSAFQNVNMGTKTTHLPVLAT---LPEADWVGESATDPKGVKPTSK 74 (305) T ss_pred CCCccCCccceecCHHHHHHHHHHHHh---hchhhhhcceeeccCCcEEEEEEeC---CcceEEeecccccccccccccc Confidence 222233445554444444554 33332 2345555555555544344544442 235678999874 55789 Q ss_pred cceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHHHHHHHHHHHhhcccccCCCcccccccccccee-eecC Q lcl|NC_019448. 106 PNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAK-LIDK 184 (463) Q Consensus 106 ~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~-lI~~ 184 (463) +.+.+....++=++.--.+|.-+ +.++..|.+....+.-...+++.+|.++|+|+.+- .|+.=-++.. .... T Consensus 75 ~~f~~i~~~~~k~~~~~~is~el-l~ds~~~~~~~i~~~l~~~~a~~~d~a~~~G~g~~------~~~~~~~~~~~~~~~ 147 (305) T protein:vir:25 75 VTWANRTLVAEEIAVIIPVHENV-IDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKP------ASWVSPALIPAAVTA 147 (305) T ss_pred cceeeEEeeeEEEEEeehhhHHH-HhcchHHHHHHHHHHHHHHHHHHHhhhheeccCCC------CCccccccccccccc Confidence 99999999998888887777733 23456788999999999999999999999998652 1111111111 1112 Q ss_pred cceEeccCCCCCH----HHHhhhhhhhhhcCCceeEEecCHHHHHHHHHHhcCcceEEeecCCCCcccceecCeeeeccc Q lcl|NC_019448. 185 NNVINAKGNQLTE----KHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQMQLMQDNSGNVNTGYSVNGFYSSRG 260 (463) Q Consensus 185 ~nviDarG~~ls~----~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~~~~~~qrv~~~~n~g~~~~G~~v~~~~s~~G 260 (463) .+.....+..... +.+.++...+...+..++..+||+...+.+...--...|.+.+++ .-.|+++ +++... T Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~i~~~~---~l~G~Pv--~~~~~~ 222 (305) T protein:vir:25 148 GQAVEVVGGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRDD---SFAGFRT--FFNRNG 222 (305) T ss_pred cccccccccchhhhHHHHHHHHHHHhhhhcccccceeEecHHHHHHHHHhhccCCceeecCC---cccccce--EEcCcc Confidence 2333333333332 234555566666777788899999999988653322333343331 2334432 111111 Q ss_pred ccccCC-ceeccC--ccccccccccCCCCCCCCeeE----EEEeccCC-CcCcccccc----cceEEEEEEEecCC---- Q lcl|NC_019448. 261 FIKLHG-STVMEN--ELILDESLQPLPNAPQPAKVT----ATVETKQK-GAFEDEEDR----AGLSYKVVVNSDDA---- 324 (463) Q Consensus 261 ~i~l~~-s~~~~~--d~~l~~~~~~~p~ap~p~~vt----at~~~~~~-g~~~~~~~~----a~ysYkV~a~s~~g---- 324 (463) ...... .+++.+ +.++-. .....+. ++...... ...+ ..+. +...+-+.+.+..+ T Consensus 223 ~~~~~~~~~~~gd~s~~~i~~--------~~~~~i~~~~~~~~~~~~~~~~~~-~~~~~~~R~~~r~~~~v~~p~a~v~~ 293 (305) T protein:vir:25 223 AWDADAAIEVIADSSRVKIGV--------RQDITVKFLDQATLGTGENQINLA-ERDMVALRLKARFAYVLGVSATAQGA 293 (305) T ss_pred CCCCCccEEEEEecceEEEEE--------ecCeEEEEeeeeeeecCCceeeee-ecCcEEEEEEEeecceeeCcccEEEE Confidence 100000 111100 000000 0000000 00000000 0000 0000 00111111111111 Q ss_pred ccccccceeeeecC Q lcl|NC_019448. 325 QSAPSEEVTATVSN 338 (463) Q Consensus 325 eS~~S~~vt~Tva~ 338 (463) ...|.. .++-+. T Consensus 294 ~~~~~~--~~~pa~ 305 (305) T protein:vir:25 294 NKTPVA--VVAPAA 305 (305) T ss_pred cccccc--ccCCCC Confidence 001100 111111 No 59 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=96.73 E-value=0.00038 Score=39.35 Aligned_cols=288 Identities=9% Similarity=-0.016 Sum_probs=134.0 Q ss_pred hhcCCccCCccccCccccchhhhhhHhhhhhccccccchhhhcccchhhHHHhhhhhhhccCcccccccccccCcccccC Q lcl|NC_019448. 26 FQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSD 105 (463) Q Consensus 26 ~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d 105 (463) |.. .+..+|.-+..|..++-|..|...+ .+.+-...++..+.--.|.++. +.+...+++|++..+.++ T Consensus 1 ma~------~t~~~G~lip~~~~~~ii~~l~~~s---~i~~l~~~~~~~~~~~~~p~~~---~~~~a~wv~Eg~~~~~s~ 68 (300) T protein:vir:95 1 MSE------AQLSKGNLFNPELVTKVINKVKGHS---SIAKLSPQKPIPFNGQREFVFD---FDSDIDIVAENGKKTHGG 68 (300) T ss_pred Ccc------cccCCcceechhhHHHHHHHHHhhh---hhhhhcceeeccCCceEEEEEe---cCcceEEeeCCccccccc Confidence 222 1223344556665555444443332 1222222233333222455544 224678999999999999 Q ss_pred cceEEEEEEEEEeechhhhhhhhhhhc--ccccHHHHHHHHHHHHHHHHHHHHHhhcccccCCCccccccccccceeeec Q lcl|NC_019448. 106 PNIRQKTVSMKYVSDTKNMSIASGLVN--NIADPSQILTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLID 183 (463) Q Consensus 106 ~~~~r~~~~~k~l~~~~~vs~~~~lvn--~~~Dp~~~~~~~ai~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~ 183 (463) +.+.+.....+=++---.+|.-+-..+ ...|.++...++-...+++.++.++|+|+.+-.. .+....|....-. T Consensus 69 ~~f~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g----~~~~~~~~~~~~~ 144 (300) T protein:vir:95 69 VSLDPVTIVPLKVEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTK----QASTIIGDNCFDK 144 (300) T ss_pred ccceeeEeeeEEEEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCC----CCccccccccccc Confidence 999999999988888888888765443 3457788888899999999999999999754322 2223333222111 Q ss_pred -CcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHHHHHhcCcceEEeecCCCCcccceecCeeeeccccc Q lcl|NC_019448. 184 -KNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQMQLMQDNSGNVNTGYSVNGFYSSRGFI 262 (463) Q Consensus 184 -~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~~~~~~qrv~~~~n~g~~~~G~~v~~~~s~~G~i 262 (463) ..++....|. .+-+.|.++..++...++.++-..||+.+.+.+...-=..-|.+.++...+. T Consensus 145 ~~~~~~~~~~~-~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~---------------- 207 (300) T protein:vir:95 145 KVTQTVPFKDT-NPDESMEDAVGMIDGSERDITGAILDPIFTTALSKMKNAEGGKLYPELAWGG---------------- 207 (300) T ss_pred ccceeeccccc-chHHHHHHHHHHhhhcCCCccEEEECHHHHHHHHHhhccCCCeeccCccccC---------------- Confidence 1233333333 3356778888888888899888999999999887533222222322211100 Q ss_pred ccCCceeccCccccccccccCCCCCCCCeeEEEEeccCCCcCcccc------cccceEEEEEEEecCCccccccceeeee Q lcl|NC_019448. 263 KLHGSTVMENELILDESLQPLPNAPQPAKVTATVETKQKGAFEDEE------DRAGLSYKVVVNSDDAQSAPSEEVTATV 336 (463) Q Consensus 263 ~l~~s~~~~~d~~l~~~~~~~p~ap~p~~vtat~~~~~~g~~~~~~------~~a~ysYkV~a~s~~geS~~S~~vt~Tv 336 (463) .+..++..+....+. .|.... .+....-++|.. .....++++. +++.+- ...+ T Consensus 208 --~~~~l~G~Pv~~s~~------v~~~~~-----~~~~~~~~GDf~~~~~~~~~~~~~~~v~---~~~~~d-----~~~~ 266 (300) T protein:vir:95 208 --VPDAINGLAVDKNRT------VSYSQT-----DPKNTAIVGDFETMFKWGYAKEVPMEII---KYGDPD-----NSGR 266 (300) T ss_pred --CCceecceeeEEecC------CCCCCC-----CCccEEEEeeccceEEEEEecccEEEEe---eccCCC-----Ccch Confidence 011112211111111 000000 000000001110 0111122221 111100 0000 Q ss_pred cC-CCCce--EEEEEecCCCCCCcceEEEEeecCCCc Q lcl|NC_019448. 337 SN-VDDGV--KLSISVNAMYQQQPQFVSIYRQGKETG 370 (463) Q Consensus 337 a~-~~~gv--~ltIt~~a~~g~~~~~y~IYR~~~~~g 370 (463) .- ..+-+ ....-.... =..|+.+......+ | T Consensus 267 ~~f~~~~v~~r~~~r~d~~-v~~~~a~~~l~~~~--g 300 (300) T protein:vir:95 267 DLKGYNQIYIRCEAYIGWG-IMDAASFARIVKTG--G 300 (300) T ss_pred hhhhcCcEEEEEEEeecce-eecccceEEEecCC--C Confidence 00 00001 111111000 01122233332221 2 No 60 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=96.71 E-value=0.00037 Score=39.39 Aligned_cols=291 Identities=11% Similarity=0.059 Sum_probs=130.3 Q ss_pred hhcCCccCCccccCccccchhhhhhHhhhhhccccccchhhhcccchhhHHHhhhhhhhccCcccccccccccCcccccC Q lcl|NC_019448. 26 FQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSD 105 (463) Q Consensus 26 ~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d 105 (463) |..+ .+..+|..+..|.-++-|..|...+ .+.+-....+..+--.+|.++. +.+...+++|++..+.++ T Consensus 1 Ma~~-----~~~~gg~~vP~~~~~~ii~~l~~~s---~i~~l~~~i~~~~~~~~ip~~~---~~~~a~wv~Eg~~~~~s~ 69 (315) T protein:vir:80 1 MADD-----FLSAGKLELPGSMIGAVRDRAIDSG---VLAKLSPEQPTIFGPVKGAVFS---GVPRAKIVGEGEVKPSAS 69 (315) T ss_pred CCCC-----cCCcCceEcchHHHHHHHHHHHhhc---hhhhhcceeecCCCceEEEEEe---CCcceEEeeCCccccccc Confidence 4332 2344556666666555554444332 2222222223332223344443 334667999999999999 Q ss_pred cceEEEEEEEEEeechhhhhhhhhhhccc---ccHHHHHHHHHHHHHHHHHHHHHhhcccccCCCccccccccccceeee Q lcl|NC_019448. 106 PNIRQKTVSMKYVSDTKNMSIASGLVNNI---ADPSQILTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLI 182 (463) Q Consensus 106 ~~~~r~~~~~k~l~~~~~vs~~~~lvn~~---~Dp~~~~~~~ai~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI 182 (463) +++.+.+...|=++.--.+|.-+-..+.. ...+....++-...+++.+|.++|+|+..-.. ....|+.+.+ T Consensus 70 ~~f~~v~l~~~kl~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~~------~~~~~~~~~~ 143 (315) T protein:vir:80 70 VDVSAFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATG------KAASAVHTSL 143 (315) T ss_pred cceeeeEeeeeeEEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCCC------cccccccccc Confidence 99999999999888877787764433332 23577788888899999999999999864322 2456666655 Q ss_pred c-CcceEeccCCCCCHHHHhhhhhhhh-hcCCceeEEecCHHHHHHHHHHhcC------cceEEeecCCC--Ccccceec Q lcl|NC_019448. 183 D-KNNVINAKGNQLTEKHLNEAAVRIG-KGFGTATDAYMPIGVHADFVNSILG------RQMQLMQDNSG--NVNTGYSV 252 (463) Q Consensus 183 ~-~~nviDarG~~ls~~~ln~aa~~i~-~~~G~~td~~m~~~vka~f~~~~~~------~qrv~~~~n~g--~~~~G~~v 252 (463) . ..+.+++-+.. ..+. .++...+. ..+...+-..|++.+...+...-.. .+..+.....+ +.-.|++| T Consensus 144 ~~~~~~~~~~~~~-~~d~-~~~~~~~~~~~~~~~~~~imn~~~~~~L~~l~~~~g~~~~g~~~~~~~~~g~~~tl~G~PV 221 (315) T protein:vir:80 144 NKTKNIVDATDSA-TADL-VKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNV 221 (315) T ss_pred ccccceeeccccc-hHHH-HHHHHHHhhccCccceEEEEcHHHHHHHHHHhhccCCcccccccccccccCCCceecceee Confidence 5 35667765543 2333 34444443 4455555688999999998644211 22211111111 12445553 Q ss_pred C--eeeecccccccCC---ceeccC--cccc-ccccccCCCCCCCCeeEEEEeccCCCcCcccccccceE--EEEEEEec Q lcl|NC_019448. 253 N--GFYSSRGFIKLHG---STVMEN--ELIL-DESLQPLPNAPQPAKVTATVETKQKGAFEDEEDRAGLS--YKVVVNSD 322 (463) Q Consensus 253 ~--~~~s~~G~i~l~~---s~~~~~--d~~l-~~~~~~~p~ap~p~~vtat~~~~~~g~~~~~~~~a~ys--YkV~a~s~ 322 (463) - ..+.... ...+. -.++.+ ..++ .+..... .+.-.+.+++.+...+..+.-.+. .++=..-. T Consensus 222 ~~~~~~~~~~-~~~~~~~~~~~~GDfs~~~~g~~~~~~i-------~i~~~~~~~~~~~~~~~~~~v~~r~~~r~~~~v~ 293 (315) T protein:vir:80 222 GASSTVSGAP-EMSPASGVKAIVGDFSRVHWGFQRNFPI-------ELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIE 293 (315) T ss_pred EecCcCCccc-ccccccccEEEEeecccEEEEEecCeeE-------EEeccccccCcccchhhcCcEEEEEEEEecceee Confidence 2 0010000 00000 001100 0000 0000000 000000011111000011111110 01111111 Q ss_pred CCccccccc-eeeeecCCCCce Q lcl|NC_019448. 323 DAQSAPSEE-VTATVSNVDDGV 343 (463) Q Consensus 323 ~geS~~S~~-vt~Tva~~~~gv 343 (463) +.++...-. .++..+....+. T Consensus 294 ~~~a~~~l~~~~a~~~~~~~~~ 315 (315) T protein:vir:80 294 SLDSFAVVKEKAAPKPNPPAEN 315 (315) T ss_pred cccceEEEeeccCCCCCCCCCC Confidence 111111000 011111111121 No 61 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=96.58 E-value=0.0005 Score=38.70 Aligned_cols=304 Identities=13% Similarity=0.083 Sum_probs=128.8 Q ss_pred CCCCCccchHHHHhhhhhhHHHHHHhhcCC-----ccCCccccCccccchhhhhhHhhhhhccccccchhhhcccchhhH Q lcl|NC_019448. 1 MTIEKNLSDVQQKYADQFQEDVVKSFQTGY-----GITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQS 75 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k~~~e~~~Ks~~agy-----~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~s 75 (463) -+...+.........+.| .+.+..+- ..+..+..+|+.+--+.+..+|..+.. +.-.+++.+...++.+ T Consensus 77 ~~~~~~~~~~~~~~~~~~----~~~l~~~~~~~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~--~~~~l~~~~~~~~~~~ 150 (397) T protein:vir:49 77 KPLTKNEEEVKANFVKDF----KNLVRGRYQNLLDSKTDGSGSDAGLTIPQDIRTAINTLVR--QFDSLQEYVNVENVTT 150 (397) T ss_pred ccccchhhHHHHHHHHHH----HHHhhcchhhHHHhhhccCCccCcceecHHHHHHHHHHHH--hhhhHhhhcceeeccC Confidence 111111111111111111 11111110 011122334555544444455533322 2233444444444443 Q ss_pred HHhh--hhhhhccCcccccccccccCccc-ccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHHHHHH Q lcl|NC_019448. 76 TVVK--YDQYLRHGNVGHSRFVKEIGVAP-VSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKT 152 (463) Q Consensus 76 tv~e--y~~~~~hG~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~ 152 (463) ---. |.+.. +..+...+++|++..+ ...+.+...+..++-++.-..+|.-+- .++..|.+....+....++++. T Consensus 151 ~~~~~~~~~~~--~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~l~~~~~~~ 227 (397) T protein:vir:49 151 LTGSRVYEKWA--DITGLAKLDDEGGQIGQNDDPKLSLIRYAIKRYAGISTVTNSLL-ADSAENILAWLSGWIAKKVVVT 227 (397) T ss_pred CcceEEEEeec--cCCcceeeeccccccccccccceeeeEeeeeeeEeehhhHHHHH-hhhhHHHHHHHHHHHHHHHHHH Confidence 2222 22222 2224567999999755 556899999999999998888876432 3455678889999999999999 Q ss_pred HHHHHhhcccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHHHHHhc Q lcl|NC_019448. 153 IEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSIL 232 (463) Q Consensus 153 ~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~~~~ 232 (463) ++.++++|+..-.+. +..++-+.|.++...+...|......+|++.+.+.+...== T Consensus 228 ~d~ail~G~g~~~~~------------------------~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd 283 (397) T protein:vir:49 228 RNKAILEAIGTLPNK------------------------PTLAKWDDIIDLQAKVDPAIKQTSLFLTNTSGFTALKKVKN 283 (397) T ss_pred HHHHHHhcccccccc------------------------ccccCHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhc Confidence 999999998764431 12233455666666677788888899999999998875321 Q ss_pred CcceEEeecCCCCc----ccceecCeeeecc-cccccC-CceeccC---ccccccccccCCCCCCCCeeEEEEeccCCCc Q lcl|NC_019448. 233 GRQMQLMQDNSGNV----NTGYSVNGFYSSR-GFIKLH-GSTVMEN---ELILDESLQPLPNAPQPAKVTATVETKQKGA 303 (463) Q Consensus 233 ~~qrv~~~~n~g~~----~~G~~v~~~~s~~-G~i~l~-~s~~~~~---d~~l~~~~~~~p~ap~p~~vtat~~~~~~g~ 303 (463) ..-|.+.+++..+. -.|++|--.-+.. ...... ...++.+ -.++.. -..++....+..... T Consensus 284 ~~g~~l~~~~~~~g~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~----------~~~~~i~~~~~~~~~ 353 (397) T protein:vir:49 284 AMGDYLMERDVKSPTGYSIDGFVVKEISDRFLPNGTGGAMPLYFGDLKQAVTLFD----------RQHLSLLSTNIGGGA 353 (397) T ss_pred cCCceeecccccCCCCceecceeeEEecccccccccCCceeEEEeeccceEEEEe----------ecccEEEEeccccch Confidence 22233443322211 1222211000000 000000 0000000 000000 000001110100000 Q ss_pred CcccccccceEEEEEEEecCCccccccceeeeecCCCCceEEEEEecCCCCCCcceEEEEeecCCCce Q lcl|NC_019448. 304 FEDEEDRAGLSYKVVVNSDDAQSAPSEEVTATVSNVDDGVKLSISVNAMYQQQPQFVSIYRQGKETGM 371 (463) Q Consensus 304 ~~~~~~~a~ysYkV~a~s~~geS~~S~~vt~Tva~~~~gv~ltIt~~a~~g~~~~~y~IYR~~~~~g~ 371 (463) |. ....-|++...-+. -+....+-+.++++-.+ ..+| .+..+|- T Consensus 354 ~~----~~~~~~~~~~r~d~-----------~~~~~~a~~~~~~~~~~--~~~~-------~~~~~~~ 397 (397) T protein:vir:49 354 FE----TDTTKVRVIDRFDV-----------VSTDTEAFVPASFKAIA--DQKA-------KLSTAGA 397 (397) T ss_pred hh----cCeeeEEEEEeecc-----------EEecccceEEEEecccc--cccC-------cccccCC Confidence 00 00111222211111 11111122233332111 1111 0011111 No 62 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=96.24 E-value=0.00083 Score=37.48 Aligned_cols=294 Identities=12% Similarity=0.011 Sum_probs=134.8 Q ss_pred CC--CCCccchHHH------Hhhh-----hhhHHHHHHhh----cCCccCC--ccccCccccchhhhhhHhhhhhccccc Q lcl|NC_019448. 1 MT--IEKNLSDVQQ------KYAD-----QFQEDVVKSFQ----TGYGITP--DTQIDAGALRREILDDQITMLTWTNED 61 (463) Q Consensus 1 ~~--~~~~~~~~~~------~~~k-----~~~e~~~Ks~~----agy~~~p--~~q~~gaalr~esLd~~i~~L~~~~~d 61 (463) .. ...+...... ...+ .....-.++|. .+..... .+..+|+.+--+.+..+|..+.. +. T Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~g~~~iP~~~~~~ii~~~~--~~ 148 (415) T protein:vir:94 71 NQQSVEVNEASTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKE--VE 148 (415) T ss_pred ccccccccchhhHHHHHHHHHHHhhhhhhhhhHHHHHHHHHHhhhhhhhhhhccccccccccCcHHHHHHHHHHHH--hh Confidence 00 0011000000 0000 01111123322 1111111 11223444434444444533322 22 Q ss_pred cchhhhcccchhhHHHhhhhhhhccCcccccccccccCccc-ccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHH Q lcl|NC_019448. 62 LIFYRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAP-VSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQI 140 (463) Q Consensus 62 f~f~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~ 140 (463) ..+.+-+...++.+--..|...... +.+...+++|++..+ .+++.+.+....++-++.-..+|.-+ +.++..|.+.. T Consensus 149 ~~l~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~v~Eg~~~~~~~~~~~~~i~~~~~k~~~~~~is~el-l~ds~~~~~~~ 226 (415) T protein:vir:94 149 FNLDKYVTVKRVTNGSGKYPVVRQS-EVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREA-IEDAKVNVLQE 226 (415) T ss_pred hhhhhhcceeeccCCceeEEEEeec-CCccceeccccccccccccccceeeEeeheeeeeechhhHHH-HhhchHHHHHH Confidence 3344445555555444445444333 334577999998876 67799999999999999887777653 34456688888 Q ss_pred HHHHHHHHHHHHHHHHHhhcccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecC Q lcl|NC_019448. 141 LTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMP 220 (463) Q Consensus 141 ~~~~ai~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~ 220 (463) ..+.-...++..++.+++.|+..-.+.+ ++.......+...+.+. .+.+.|.++-..+...|...+-.+|| T Consensus 227 i~~~l~~~~~~~~~~~il~g~g~g~~~~--------~~~~~~~~~~~~~~~~~-~~~~~i~~~~~~~~~~~~~~~~~vmn 297 (415) T protein:vir:94 227 LKLWMARTIAATRNKAIIDVITKGSTGS--------TSSGFEKEGKKLEVKKA-KSLDDIKDAINLNVKPNYEHNVAIVS 297 (415) T ss_pred HHHHHHHHHHHHHHHHHhhccccCcccc--------ccccccccccccccccc-cchHHHHHHHHhhhhhccCCCEEEEc Confidence 8999999999999999999977533311 11111222233333333 34455555555566677667789999 Q ss_pred HHHHHHHHHHhcCcceEEeecCCCC----cccceecCeeee-cccccccCCceecc--Cc-ccc--------ccc----- Q lcl|NC_019448. 221 IGVHADFVNSILGRQMQLMQDNSGN----VNTGYSVNGFYS-SRGFIKLHGSTVME--NE-LIL--------DES----- 279 (463) Q Consensus 221 ~~vka~f~~~~~~~qrv~~~~n~g~----~~~G~~v~~~~s-~~G~i~l~~s~~~~--~d-~~l--------~~~----- 279 (463) +.+.+.|...--..-|.+.+++..+ .-.|++|--.-. .-|... ...+++. .+ .++ ... T Consensus 298 ~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~-~~~i~~gd~~~~~~~~~~~~~~v~~~~~~~~ 376 (415) T protein:vir:94 298 QTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKG-NNTLIIGNLKDAIVLFDRSQYQASWTDYMHF 376 (415) T ss_pred HHHHHHHHHhhccCCCeeeccCcCCCCCceecceeeEEecccccCCCC-ccEEEEEehhccEEEEeecceEEEEeccccC Confidence 9999998753222223344433221 234554311000 000000 0001110 01 010 000 Q ss_pred ----------cccCCCCCCCCeeEEEEeccCCCcCcccc Q lcl|NC_019448. 280 ----------LQPLPNAPQPAKVTATVETKQKGAFEDEE 308 (463) Q Consensus 280 ----------~~~~p~ap~p~~vtat~~~~~~g~~~~~~ 308 (463) .....++.+-..++.+.++++.|+++-.. T Consensus 377 ~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~~~~~ 415 (415) T protein:vir:94 377 GECLMIAVRQDCRILDYKSAIVIEYDDSERGEGDLGLEA 415 (415) T ss_pred ceEEEEEEEeccEEeccccEEEEEEeccCCCCCccccCC Confidence 00011111112223333333333333111 No 63 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=96.22 E-value=0.00086 Score=37.38 Aligned_cols=314 Identities=12% Similarity=0.096 Sum_probs=144.7 Q ss_pred CCCCCccchHHHHhhhhhhHHHHHHhhcC---------------------CccCCccccCccccchhhhhhHhhhhhccc Q lcl|NC_019448. 1 MTIEKNLSDVQQKYADQFQEDVVKSFQTG---------------------YGITPDTQIDAGALRREILDDQITMLTWTN 59 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k~~~e~~~Ks~~ag---------------------y~~~p~~q~~gaalr~esLd~~i~~L~~~~ 59 (463) -+...+.......-. .+ ..+.|++..+ ...+-.+...||.|--+.+..+|..+... T Consensus 82 ~~~~~~~~~~~~~~~-~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~l~~- 158 (435) T protein:vir:14 82 APVHAQPKALEVKGA-KM-ARMVRALAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGAGGVLVPENLSSEVIELLRP- 158 (435) T ss_pred cccccccchhhhhHH-HH-HHHHHHHHhhcchhhHHHHHHHhhhhhhhhhhhcccCCcCCCccccchhHHHHHHHHHhh- Confidence 001110000000000 00 1111211111 01111222334544444444554333322 Q ss_pred cccchhhhcccch--hhHHHhhhhhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhccccc- Q lcl|NC_019448. 60 EDLIFYRDISRRP--AQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIAD- 136 (463) Q Consensus 60 ~df~f~~~i~k~~--~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~D- 136 (463) ...+..+..+. ..+--.+|.++... +...+++|++..+.+|+.+.+.+..++=++....+|.-+ +.++.-| T Consensus 159 --~~~i~~~~~~~~~~~~~~~~~p~~~~~---~~a~~v~E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~el-l~ds~~~~ 232 (435) T protein:vir:14 159 --KSVVRKLGARTLPLSNGNITIPRLKGG---AIVGYIGADTDIPTTQQQFDDLKLTAKKMAALVPIANDL-IKYAGVNP 232 (435) T ss_pred --hchhhhhcceeeecCCCceEEEEEeCC---cceeeeccCccccccccceeEEEeeeEEEEEeehhhHHH-HHhhccCH Confidence 22222222222 22222345555432 356789999999999999999999999888888887655 3333323 Q ss_pred -HHHHHHHHHHHHHHHHHHHHHhhcccccCCCccccccccccceeeecCcceEeccC-CCCC--HHHHhhhhhhhhhcCC Q lcl|NC_019448. 137 -PSQILTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKG-NQLT--EKHLNEAAVRIGKGFG 212 (463) Q Consensus 137 -p~~~~~~~ai~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG-~~ls--~~~ln~aa~~i~~~~G 212 (463) .+....+.-..++.+.+|.++++|+-.- -+..|+.+...+.++...-+ ...+ ...|.++...+....+ T Consensus 233 ~l~~~i~~~l~~ai~~~~d~a~l~G~G~~--------~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~ 304 (435) T protein:vir:14 233 NVDQIVVGDLTAAIGAREDKAFIRDDGTA--------NTPKGLRFWALPSNVITASDASTLQKIETDLGKVILALENADA 304 (435) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhccCCCC--------ccccceeecccccceeccccccchhhHHHHHHHHHHHhhhccc Confidence 5677888888999999999999997431 25778776655554444322 2222 2334444444444322 Q ss_pred c--eeEEecCHHHHHHHHHHhcCcceEEeecCCCCcccceecC--eeeec-ccccccCCceecc--CccccccccccCCC Q lcl|NC_019448. 213 T--ATDAYMPIGVHADFVNSILGRQMQLMQDNSGNVNTGYSVN--GFYSS-RGFIKLHGSTVME--NELILDESLQPLPN 285 (463) Q Consensus 213 ~--~td~~m~~~vka~f~~~~~~~qrv~~~~n~g~~~~G~~v~--~~~s~-~G~i~l~~s~~~~--~d~~l~~~~~~~p~ 285 (463) . ..-..|++.+.+.+...--..-|.+.+...++.-.|++|- .++-. -|...-.+.+++. .+.++ .. T Consensus 305 ~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~g~l~G~Pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~i-~~------ 377 (435) T protein:vir:14 305 NLTQPGWIMAPRTFRFLEGLRDGNGNKVYPELANGMLKGYPVGKTTQVPINLGETGKESEIYFTDFGDVFI-GE------ 377 (435) T ss_pred cccCCEEEEcHHHHHHHHHhhccCCceeccCCCCCeeecceeEeeccccccccCCCccceEEEeecccEEE-EE------ Confidence 1 2237899999998875332333445554444444565531 11100 0110001112211 11111 00 Q ss_pred CCCCCeeEEEEeccCC-----CcCcccccccceEEEEEEEecCCccccccceeeeecCCCC Q lcl|NC_019448. 286 APQPAKVTATVETKQK-----GAFEDEEDRAGLSYKVVVNSDDAQSAPSEEVTATVSNVDD 341 (463) Q Consensus 286 ap~p~~vtat~~~~~~-----g~~~~~~~~a~ysYkV~a~s~~geS~~S~~vt~Tva~~~~ 341 (463) -..++....+.+. +...+.-......+++...-+.+---|...+..+-+++++ T Consensus 378 ---~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~ 435 (435) T protein:vir:14 378 ---EETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLAGVAWGA 435 (435) T ss_pred ---ecccEEEEeccccccccccchhhhhhcChhheeeeeeeCceeecccceEEEecCCCCC Confidence 0011122212111 1111111122345555555555544555566666655655 No 64 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=96.11 E-value=0.001 Score=37.04 Aligned_cols=300 Identities=11% Similarity=0.044 Sum_probs=123.3 Q ss_pred CCCCCccchHHHHhhhhhhHHHHHHhhcCC---------ccCCccccCccccchhhhhhHhhhhhccccccchhhhcccc Q lcl|NC_019448. 1 MTIEKNLSDVQQKYADQFQEDVVKSFQTGY---------GITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRR 71 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k~~~e~~~Ks~~agy---------~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~ 71 (463) .+.............+.| .+.+..+. ..+..+..+|+.+--+.+..+|..+... .-.+.+.+... T Consensus 80 ~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~e~~a~~~~t~~~gg~~iP~~~~~~ii~~~~~--~~~l~~~~~~~ 153 (404) T protein:vir:39 80 GPLNKSEYELKDKFVKEF----VNMVRNPMAFLNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQ--YDSLQQYVRVE 153 (404) T ss_pred cccccchhhhHHHHHHHH----HHHHhcchhhhhhhhhhhhhcccccCCceeccHHHHHHHHHHHHh--hhhHHhhccee Confidence 111111111111111111 11111110 0112233455555445555555333322 23455555555 Q ss_pred hhhHHHhhhhhhhccCcccccccccccCccc-ccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHHHH Q lcl|NC_019448. 72 PAQSTVVKYDQYLRHGNVGHSRFVKEIGVAP-VSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVA 150 (463) Q Consensus 72 ~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~ 150 (463) +..+-...|.....-+..+...+++|++..+ .+++.+.+....++-++.-..+|.-+= .++..|.+....+.-...+. T Consensus 154 ~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~l~~~~~ 232 (404) T protein:vir:39 154 SVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPRLTIIKYLIKRYAGIITATNTLL-KDTAENILAWLSSWIAKKVV 232 (404) T ss_pred eccCCcceEEEEeecCCccceeeecCccccccccccceeeEEeeeeeEEeeehhHHHHH-hhchHHHHHHHHHHHHHHHH Confidence 5544444443222223334567899998864 689999999999999998888876432 33456778888899999999 Q ss_pred HHHHHHHhhcccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHHHHH Q lcl|NC_019448. 151 KTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNS 230 (463) Q Consensus 151 ~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~~ 230 (463) ..+|.++++|+..-.+. +.++.+|++..++ ...+..+|....-.+||+.+.+.|... T Consensus 233 ~~~d~~il~g~g~~~~~--~~~~~~~~i~~~~---------------------~~~~~~~~~~~a~~v~n~~~~~~L~~l 289 (404) T protein:vir:39 233 VTRNQAIIAAMGTVPKK--PTIAKFDDVITMI---------------------NTSVDPAIIATSSLLTNQSGLNKLALV 289 (404) T ss_pred HHHHHHHHhcccccccc--cccccHHHHHHHH---------------------HHhhhhhhccCCEEEEcHHHHHHHHHh Confidence 99999999998775432 1122333332221 122333443333478899888888743 Q ss_pred hcCcceEEeecCCCC----cccceecCeeee-cccccccCC-ceeccC--ccccccccccCCCCCCCCeeEEEEeccCCC Q lcl|NC_019448. 231 ILGRQMQLMQDNSGN----VNTGYSVNGFYS-SRGFIKLHG-STVMEN--ELILDESLQPLPNAPQPAKVTATVETKQKG 302 (463) Q Consensus 231 ~~~~qrv~~~~n~g~----~~~G~~v~~~~s-~~G~i~l~~-s~~~~~--d~~l~~~~~~~p~ap~p~~vtat~~~~~~g 302 (463) =-..-|.+.+++... .-.|++|--.-+ .-+...... .+++.+ +..+...+ ..+.......... T Consensus 290 kd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~---------~~~~i~~~~~~~~ 360 (404) T protein:vir:39 290 KTAEGKYLLEPDPTKPNSYLIKGKKVIVVADRWLPNSGSTVYPLYYGDMSQAITLFDR---------ENMSLLPTNIGAG 360 (404) T ss_pred hccCCceeeccCcCCCCcceecceeEEEecccccCccCCCccEEEEEeccccEEEEee---------cceEEEEeccchh Confidence 222233444433221 223443310000 000000000 000000 00000000 0000111010000 Q ss_pred cCcccccccceEEEEEEE----ecCCccccc---cceeeeecCCCCce Q lcl|NC_019448. 303 AFEDEEDRAGLSYKVVVN----SDDAQSAPS---EEVTATVSNVDDGV 343 (463) Q Consensus 303 ~~~~~~~~a~ysYkV~a~----s~~geS~~S---~~vt~Tva~~~~gv 343 (463) .|. .....|++... -.+-++... ..++......+.|. T Consensus 361 ~~~----~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~a~~~~~~~~~~ 404 (404) T protein:vir:39 361 AFE----TDTTKIRVIDRFDVKTTDSEALVAGSFTAIADQVGNFTAGK 404 (404) T ss_pred hhh----hceeeEEEEeeeccEEecccceEEEEeeccccCCCCCCCCC Confidence 000 00011111100 011111110 00000111112232 No 65 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=96.09 E-value=0.001 Score=36.99 Aligned_cols=284 Identities=10% Similarity=0.027 Sum_probs=135.8 Q ss_pred ccccCccccchhhhhhHhhhhhccccccchhhhcccchhhHHHhhhhhhhccCcccccccccccCcccccCcceEEEEEE Q lcl|NC_019448. 35 DTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVS 114 (463) Q Consensus 35 ~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~ 114 (463) -+-.+|.-+..|...+-+..|. +...+.+-..+.+..+--.+|.++.. .+...+++|++..+.+|+.+.+.... T Consensus 1 ma~~gG~lvp~~~~~~ii~~~~---~~s~i~~l~~~~~~~~~~~~ip~~~~---~~~a~~v~E~~~~~~~~~~f~~v~l~ 74 (298) T protein:vir:16 1 MVLNKGTLFDPTLVTDLISKVA---GKSSIARLSAQKPIPFNGEKVFTFTM---DSEIDVVAESGKKTHGGVTLAPQTMV 74 (298) T ss_pred CcccCcceechhHHHHHHHHHH---hhhhhhhhcceeeccCCceEEEEEec---CcceEEecCCccccccccceeEEEEe Confidence 1112233345555444333332 22345555555666655455666553 34567999999999999999999999 Q ss_pred EEEeechhhhhhhhhhhcc--cccHHHHHHHHHHHHHHHHHHHHHhhcccccCCCccccccccccceeeecC-cceE--e Q lcl|NC_019448. 115 MKYVSDTKNMSIASGLVNN--IADPSQILTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDK-NNVI--N 189 (463) Q Consensus 115 ~k~l~~~~~vs~~~~lvn~--~~Dp~~~~~~~ai~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~-~nvi--D 189 (463) .+=++.--.+|.-+-..+. ..|.+....+.-...+++.+|.++|+|...-+ +....+-|+...... .+.. + T Consensus 75 ~~k~a~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~----g~~~~~~~~~~~~~~~~~~~~~~ 150 (298) T protein:vir:16 75 PIKVEYGARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRL----GTASAVIGTNHFDSKVTQKVEAP 150 (298) T ss_pred eeeEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCC----Ccccccccccccccccccccccc Confidence 9999988888877644333 34677778888999999999999999965422 122234444333331 1211 1 Q ss_pred ccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHHHHHhcCcceEEeecCCCCcccceecCeeeecccccccCCcee Q lcl|NC_019448. 190 AKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQMQLMQDNSGNVNTGYSVNGFYSSRGFIKLHGSTV 269 (463) Q Consensus 190 arG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~~~~~~qrv~~~~n~g~~~~G~~v~~~~s~~G~i~l~~s~~ 269 (463) ..+..+ ...|..+...+..++..+.-..|++.+.+.+...-=..-|.+.++...... +.++ T Consensus 151 ~~~~~~-~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~------------------~~~l 211 (298) T protein:vir:16 151 RGIADP-NGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQDNALFPELKWGAT------------------PDTI 211 (298) T ss_pred cccccH-HHHHHHHHHHhhhcCCCccEEEEcHHHHHHHHHhhccCCCeeecCcccCCC------------------Ccee Confidence 111111 335667777777788888889999999999875322223334433221111 1122 Q ss_pred ccCccccccccccCCCCCCCCeeEEEEeccCCCcCcccc------cccceEEEEEEEecCCccccccceeeeecCCC-Cc Q lcl|NC_019448. 270 MENELILDESLQPLPNAPQPAKVTATVETKQKGAFEDEE------DRAGLSYKVVVNSDDAQSAPSEEVTATVSNVD-DG 342 (463) Q Consensus 270 ~~~d~~l~~~~~~~p~ap~p~~vtat~~~~~~g~~~~~~------~~a~ysYkV~a~s~~geS~~S~~vt~Tva~~~-~g 342 (463) +..+-+.... .|..... +....-++|.+ .....++++. +++.+-- ..+.... +- T Consensus 212 ~G~PV~~~~~---v~~~~~~--------~~~~~~~GDfs~~~~~~~~~~~~~~~~---~~~~~~~-----~~~~~f~~~~ 272 (298) T protein:vir:16 212 NGLPVDVNKT---VSDMSLT--------QRDRAIIGDFANGFKWGYAKEVPLEVI---QYGDPDN-----SGLDLKGYNQ 272 (298) T ss_pred cceeeEEecc---cccccCC--------CccEEEEeeccceEEEEEecCceEEEe---eccCCcC-----cchhhhhcCc Confidence 2222211111 0000000 00001111111 1112222222 2221100 0000000 11 Q ss_pred e--EEEEEecCCCCCCcceEEEEeecC Q lcl|NC_019448. 343 V--KLSISVNAMYQQQPQFVSIYRQGK 367 (463) Q Consensus 343 v--~ltIt~~a~~g~~~~~y~IYR~~~ 367 (463) + ....-+... =..|+-+...+... T Consensus 273 v~~ra~~r~d~~-v~~~~a~~~l~~at 298 (298) T protein:vir:16 273 VYIRAELFLGWG-ILDATKFARVTEAN 298 (298) T ss_pred EEEEEEEEEccE-eecccceEEEeecC Confidence 1 111111111 01111222222222 No 66 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=96.07 E-value=0.001 Score=36.94 Aligned_cols=309 Identities=11% Similarity=0.054 Sum_probs=134.9 Q ss_pred CCCCCccch---HHHHhhhhhhHHHHHHhhcC-Cc--------cCCccccCccccchhhhhhHhhhhhccccccchhhhc Q lcl|NC_019448. 1 MTIEKNLSD---VQQKYADQFQEDVVKSFQTG-YG--------ITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDI 68 (463) Q Consensus 1 ~~~~~~~~~---~~~~~~k~~~e~~~Ks~~ag-y~--------~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i 68 (463) .....+... ....+.+.+-+..+|..... .. .+..+..+|+.+--+.+..+|-.+.. +...+++.+ T Consensus 67 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~a~~~~~~~~gg~~vP~~~~~~ii~~~~--~~~~l~~l~ 144 (404) T protein:vir:10 67 SLNTGKEENVIYNGALFVRAIADNLLKQKNQRGLNLSEKEINAISENIDEDGGYAVPEDIQTKINTRLK--DTTDLYNMV 144 (404) T ss_pred ccccccchhhHHHHHHHHHHHHHHHHHHHHhhhhcchhhHHhhhccccCCCCceeechhHHHHHHHHHh--hhhhHhhhh Confidence 111111111 11233333334444433221 11 11112234555544444455533332 333456666 Q ss_pred ccchhhHHHhh--hhhhhccCcccccccccccCccccc--CcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHH Q lcl|NC_019448. 69 SRRPAQSTVVK--YDQYLRHGNVGHSRFVKEIGVAPVS--DPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTED 144 (463) Q Consensus 69 ~k~~~~stv~e--y~~~~~hG~~g~~~fv~E~g~~~~~--d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ 144 (463) ...++...... |.+.. +.....++.|++..+.+ ++.+.+.....+=++.--.+|.-+ +.++..+.+....+. T Consensus 145 ~~~~~~~~~g~~~~~~~~---~~~~~~~v~e~~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~ 220 (404) T protein:vir:10 145 DYEPVFTRSGSRTYEKRS---KQKPMKPLSENQQIPTNGDNGKLERFNFKLKDLADFMSIPNDL-LKFADKSLEDWIINW 220 (404) T ss_pred ceeeccCCccceEEEEec---CCcceeeccccccccccccccceeeeEeeheeeEeeehhhHHH-HhhcHHHHHHHHHHH Confidence 66555443333 33322 23356789999886653 688999999998888877787743 344455778888889 Q ss_pred HHHHHHHHHHHHHhhcccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhhh-hhhhcCCceeEEecCHHH Q lcl|NC_019448. 145 AIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAV-RIGKGFGTATDAYMPIGV 223 (463) Q Consensus 145 ai~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~-~i~~~~G~~td~~m~~~v 223 (463) -...+...+|.++++|+-.= ..+.|+.+.-.. +.+-.. ...+.+.|..+.. .+-.+|...--++|++.+ T Consensus 221 la~~~~~~~~~~il~G~g~~--------~~~~gi~~~~~~-~~~~~~-~~~~~~~~~~~~~~~l~~~~~~~~~~v~n~~~ 290 (404) T protein:vir:10 221 FVDKVRITRNAEILYGAGGD--------EHATGIMTANKF-KKITLP-KSPALKDFKKCKNVELLNVFKATSSWIVNQDG 290 (404) T ss_pred HHHHHHHHHHHHHhhcCCCC--------Ccccceeecccc-ceeecc-ccccHHHHHHHHHhhhhccccCCCEEEEcHHH Confidence 99999999999999997541 245666553332 222232 3345555554433 444556443347899999 Q ss_pred HHHHHHHhcCcceEEeecCCC----CcccceecCeeeecccccccC-Cceecc--CccccccccccCCCCCCCCeeEEEE Q lcl|NC_019448. 224 HADFVNSILGRQMQLMQDNSG----NVNTGYSVNGFYSSRGFIKLH-GSTVME--NELILDESLQPLPNAPQPAKVTATV 296 (463) Q Consensus 224 ka~f~~~~~~~qrv~~~~n~g----~~~~G~~v~~~~s~~G~i~l~-~s~~~~--~d~~l~~~~~~~p~ap~p~~vtat~ 296 (463) .+.|...==..-|.+.+++.+ +.-.|++|--+-+.--..... ..+++. .+...... . ..++... T Consensus 291 ~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~~~gd~s~~~~~~~-----~----~~~~i~~ 361 (404) T protein:vir:10 291 FNYLDSLEDKTGRPYLQPDPKDPTQYRFLGLPVIELPNDLLLSTESAIPVLLGDTKEAYKYVS-----D----GAYELAT 361 (404) T ss_pred HHHHHHhhccCCceeeccCcCCCCCccccceeeEEecccccCCCCCccEEEEEeccccEEEEE-----e----cceEEEE Confidence 998875321233445544322 223455542111100000000 001111 00000000 0 0111111 Q ss_pred eccCCCcCcccccccceEEEEE----EEecCCccccc-cceeeeecC Q lcl|NC_019448. 297 ETKQKGAFEDEEDRAGLSYKVV----VNSDDAQSAPS-EEVTATVSN 338 (463) Q Consensus 297 ~~~~~g~~~~~~~~a~ysYkV~----a~s~~geS~~S-~~vt~Tva~ 338 (463) .......|. .+ ...|++. ..-.+.+.... ...+++.++ T Consensus 362 ~~~~~~~~~--~~--~~~~~~~~r~d~~v~~~~a~~~~~~~~aa~~~ 404 (404) T protein:vir:10 362 TNIGAGAFE--TN--TTKARIIMRIDGNVKDSEALLIAEIPVESVQA 404 (404) T ss_pred eccccchhh--cC--ceEEEEEEeeccEEecccceEEEEeecccCCC Confidence 111100000 00 1111111 00111111110 000111111 No 67 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=96.06 E-value=0.0011 Score=36.89 Aligned_cols=291 Identities=10% Similarity=0.009 Sum_probs=138.0 Q ss_pred CCCCCccc---h--HHHHhhhhhhHHHHHHhhcCCccCCccccCccccchhhhhhHhhhhhccccccchhhhcccchhhH Q lcl|NC_019448. 1 MTIEKNLS---D--VQQKYADQFQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQS 75 (463) Q Consensus 1 ~~~~~~~~---~--~~~~~~k~~~e~~~Ks~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~s 75 (463) ...+.... . ...++++.+.....+++++|. +..+|..+..+. .++| +..-.+...+++.+...++.+ T Consensus 61 ~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~~~~t-----~~~gg~~vP~~~-~~~i--i~~~~~~s~i~~~~~~~~~~~ 132 (371) T protein:vir:81 61 DKEPLKPTVQVKENEVEAFVNHIRTRFRNAMSEGS-----NQDGGYTVPQDI-QTRI--NELRESKDALQNLITVEPVTT 132 (371) T ss_pred cccccccchhhHHHHHHHHHHHHHHHHHHhhccCC-----CccCceeecHhH-HHHH--HHHHHhhhhhhhhceeeeccC Confidence 11111111 1 113555555555667777654 223344444444 4444 222233334666666666665 Q ss_pred HHhhhhhhhccCcccccccccccCcc-cccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHHHHHHHH Q lcl|NC_019448. 76 TVVKYDQYLRHGNVGHSRFVKEIGVA-PVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIE 154 (463) Q Consensus 76 tv~ey~~~~~hG~~g~~~fv~E~g~~-~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~~E 154 (463) ...+|......++ +...+++|++.. +.+++++.+.+...+-++....+|.-+ +.++.-|.+....+.-...++..++ T Consensus 133 ~~~~~~~~~~~~~-~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~l~~a~~~~~~ 210 (371) T protein:vir:81 133 LSGSRVFKKRSQQ-TGFVEVAEGAAIGEKATPQFTLLQYQVKKYAGFFRVTNEL-LNDSTEAIVNTLVRWIGDESRVTRN 210 (371) T ss_pred CceeEEEEeecCC-cceeeeccccccccccccceeeEEeeeeEEEEeehhhHHH-HhhhhHHHHHHHHHHHHHHHHHHHH Confidence 5555544443333 356789999875 578999999999999999888888764 3444557788888888889999999 Q ss_pred HHHhhcccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhh-hhhhhcCCceeEEecCHHHHHHHHHHhcC Q lcl|NC_019448. 155 WASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAA-VRIGKGFGTATDAYMPIGVHADFVNSILG 233 (463) Q Consensus 155 ~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa-~~i~~~~G~~td~~m~~~vka~f~~~~~~ 233 (463) .+++.|+..-.+.+. +..| .|..+. ..+-..|....-.+|++.+.+.+...--. T Consensus 211 ~~i~~g~g~~~~~~~---~~~~----------------------~i~~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~ 265 (371) T protein:vir:81 211 GLIINVLNTKAKTAI---ADLD----------------------GLKQIINVQLDPVFRSTSSVIVNQDAFNWLDTLKDQ 265 (371) T ss_pred HHHHhhccccccccc---ccHH----------------------HHHHHHHhhcchhhhcCCEEEEcHHHHHHHHHhhcc Confidence 999999987554221 1222 222222 22333454445689999999988753222 Q ss_pred cceEEeecCCCC----cccceecCeeeecc---cc-----cccCCc-eeccCccccccccccCCCCCCCCeeEEEEeccC Q lcl|NC_019448. 234 RQMQLMQDNSGN----VNTGYSVNGFYSSR---GF-----IKLHGS-TVMENELILDESLQPLPNAPQPAKVTATVETKQ 300 (463) Q Consensus 234 ~qrv~~~~n~g~----~~~G~~v~~~~s~~---G~-----i~l~~s-~~~~~d~~l~~~~~~~p~ap~p~~vtat~~~~~ 300 (463) .-|.+.+++..+ .-.|++| +.+.. |. +..+.. +++. |.- ...+. .....++....... T Consensus 266 ~g~~l~~~~~~~~~~~~l~G~pV--~~~~~~~~~~~~~~~~~~~~~~i~~G-d~~--~~~~~----~~~~~~~i~~~~~~ 336 (371) T protein:vir:81 266 NGQYLLQPSISSPTGRQLLGLPV--VIVSNKVLANRVDGGTGAQFAPIIVG-DLK--EAVVM----FDRQRTEIMSSNVA 336 (371) T ss_pred CCCeeeecccCCCCCceecceeE--EEecccccCccccccccCCcceEEEE-ehh--ceEEE----EeecceEEEEeccc Confidence 223444433322 1223332 11111 00 000000 0111 000 00000 00001111111111 Q ss_pred CCcCcccccccceEEEEEEEecCCccccccceeeeecCCCCceEEEEEecC Q lcl|NC_019448. 301 KGAFEDEEDRAGLSYKVVVNSDDAQSAPSEEVTATVSNVDDGVKLSISVNA 351 (463) Q Consensus 301 ~g~~~~~~~~a~ysYkV~a~s~~geS~~S~~vt~Tva~~~~gv~ltIt~~a 351 (463) .-.| ......|++...-+.+ +...+.-+.++++. + T Consensus 337 ~~~f----~~~~v~~~~~~r~d~~-----------~~~~~a~~~~~~~~-A 371 (371) T protein:vir:81 337 MDAF----ETDATLWRAIERMDVK-----------MRDDEAFVFGEVQL-A 371 (371) T ss_pred cchh----hcCceEEEEEEeeccE-----------EecccceEEEEEec-C Confidence 1000 0112233333222222 11222223333332 1 No 68 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=96.05 E-value=0.00085 Score=37.42 Aligned_cols=281 Identities=11% Similarity=0.042 Sum_probs=136.1 Q ss_pred hhcCCccCCccccCccccchhhhhhHhhhhhccccccchhhhcccchhhHHHhhhhhhhccCcccccccccccCcccccC Q lcl|NC_019448. 26 FQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSD 105 (463) Q Consensus 26 ~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d 105 (463) |.+. +-.+|+.+..|..++-+..+... ..+.+-..+.+..+--.+|.++.+ .....+++|++..+.++ T Consensus 1 Mat~------tt~~g~~vP~~~~~~ii~~~~~~---s~l~~~~~~i~~~~~~~~~p~~~~---~~~a~wv~Eg~~~~~~~ 68 (311) T protein:vir:99 1 MATF------GTGNLKNLPRNIADGMVKDVVQG---STVAVLSARKPQRFGNEDIITFNG---RPKAEFVGEGQQKSSTT 68 (311) T ss_pred Ccee------cCCCceeccHHHHHHHHHHHHhh---chhhhhcceeeccCCceEEEEEeC---CceeEEeecCccccccc Confidence 3321 11234556656554444433322 234444444445443335555543 24577999999999999 Q ss_pred cceEEEEEEEEEeechhhhhhhhhhhc--ccccHHHHHHHHHHHHHHHHHHHHHhhcccccCCCccccccccccceeeec Q lcl|NC_019448. 106 PNIRQKTVSMKYVSDTKNMSIASGLVN--NIADPSQILTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLID 183 (463) Q Consensus 106 ~~~~r~~~~~k~l~~~~~vs~~~~lvn--~~~Dp~~~~~~~ai~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~ 183 (463) +++.......|=++.--.+|.-+-..+ +..|.+....+.-...+++.+|.++|+|+.+-. |...-|+.+.+. T Consensus 69 ~~f~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~------g~~~~g~~~~~~ 142 (311) T protein:vir:99 69 GEFDFVTSTPKKAQVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINPLT------GTVIPGWSNYLG 142 (311) T ss_pred ceeeEEEEeeEEEEEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCccc------Cccccccccccc Confidence 999999999988888777777663332 345778999999999999999999999987532 234556666554 Q ss_pred -CcceEeccCCCCC--HHHHhhhhhhhhhc--CCceeEEecCHHHHHHHHHHhcCcceEEeecCCCCc----ccceecCe Q lcl|NC_019448. 184 -KNNVINAKGNQLT--EKHLNEAAVRIGKG--FGTATDAYMPIGVHADFVNSILGRQMQLMQDNSGNV----NTGYSVNG 254 (463) Q Consensus 184 -~~nviDarG~~ls--~~~ln~aa~~i~~~--~G~~td~~m~~~vka~f~~~~~~~qrv~~~~n~g~~----~~G~~v~~ 254 (463) ..+.+...+...+ ...|..+...+... ...++-+.||+.+.+.|...--..-|.+.++..... -.|+++- T Consensus 143 ~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~Pv~- 221 (311) T protein:vir:99 143 AASKRVELTADTIANPDLAIEAAVGLLVANGHPTPVNGLALHPSIAWGLSTARYTDGRKKFPELGLGIGVSSFEGIDAS- 221 (311) T ss_pred cccceeeccccccchhHHHHHHHHHHHhhhccCCCccEEEEcHHHHHHHHhhhccCCCeeecCcccCCCCceecceeeE- Confidence 2355444333332 34455555554433 245566899999999997532222244444422221 2344432 Q ss_pred eeecc---cccccCC-ce-ec-cCcccc--cccc---ccCCCCCCCCeeEEEEe----ccCCCcCcccccccce----EE Q lcl|NC_019448. 255 FYSSR---GFIKLHG-ST-VM-ENELIL--DESL---QPLPNAPQPAKVTATVE----TKQKGAFEDEEDRAGL----SY 315 (463) Q Consensus 255 ~~s~~---G~i~l~~-s~-~~-~~d~~l--~~~~---~~~p~ap~p~~vtat~~----~~~~g~~~~~~~~a~y----sY 315 (463) .+.. +.+...+ +. ++ +.+..+ |... +.... ..+.... ++..+..+ ..+--.+ .+ T Consensus 222 -~s~~i~~~~~~~~~~~~~~~~~~~~~~~Gdf~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~-~~d~~~~r~~~r~ 294 (311) T protein:vir:99 222 -VSDTVNGGDEADPDDEDLDAARAVRGIVGDFANGIHWGVQR-----DIPVELIKYGDPDGQGDLK-RHNQIALRLEIVY 294 (311) T ss_pred -eecccccccccccccchhhccCcceEEEeeccccEEEEEec-----CceEEEeecCCCCcchhhh-hcCcEEEEEEEee Confidence 1110 0000000 00 00 000000 0000 00000 0011111 11112111 1112122 12 Q ss_pred EEEEEecCCccccccceeeee Q lcl|NC_019448. 316 KVVVNSDDAQSAPSEEVTATV 336 (463) Q Consensus 316 kV~a~s~~geS~~S~~vt~Tv 336 (463) -. .+-+. +.+. ...++. T Consensus 295 d~-~v~~~-~~v~--~~~~~A 311 (311) T protein:vir:99 295 GW-YVFTD-RFVV--IENAVA 311 (311) T ss_pred cc-eecCh-hHee--eecccC Confidence 11 12111 1111 111111 No 69 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=95.95 E-value=0.0012 Score=36.57 Aligned_cols=313 Identities=12% Similarity=0.007 Sum_probs=137.0 Q ss_pred CCCCCccchHH--H----Hhh-----hhhhHHHHHHh----hcCCccC--CccccCccccchhhhhhHhhhhhccccccc Q lcl|NC_019448. 1 MTIEKNLSDVQ--Q----KYA-----DQFQEDVVKSF----QTGYGIT--PDTQIDAGALRREILDDQITMLTWTNEDLI 63 (463) Q Consensus 1 ~~~~~~~~~~~--~----~~~-----k~~~e~~~Ks~----~agy~~~--p~~q~~gaalr~esLd~~i~~L~~~~~df~ 63 (463) -..+....... . ... +.......++| ..+.... ..+-.+|+.|--+.+.+.|..+.... -. T Consensus 73 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~--~~ 150 (415) T protein:vir:98 73 QSVEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVE--FN 150 (415) T ss_pred cccccchhhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhh--hh Confidence 00000000000 0 000 00011112222 1221111 11223455555555555554333333 33 Q ss_pred hhhhcccchhhHHHhhhhhhhccCcccccccccccCccc-ccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHH Q lcl|NC_019448. 64 FYRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAP-VSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILT 142 (463) Q Consensus 64 f~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~ 142 (463) +.+-+...++.+.-..|......++ ....+++|++..+ .+++.+......++-++.-..+|.-+ +.++.-|.+.... T Consensus 151 l~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~el-l~ds~~~l~~~i~ 228 (415) T protein:vir:98 151 LDKYVTVKRVTNGSGKYPVVRQSEV-AALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREA-IEDAKVNVLQELK 228 (415) T ss_pred hhhheeeeeccCCceeEEEEeecCC-ccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHH-HhhchHHHHHHHH Confidence 5555555556555555555544433 3566899998875 66799999999999999887777764 2445567888888 Q ss_pred HHHHHHHHHHHHHHHhhcccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHH Q lcl|NC_019448. 143 EDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIG 222 (463) Q Consensus 143 ~~ai~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~ 222 (463) +.-...+++.+|.+++.|+-.=++. . ++.......+.....+. .+.+.|-++-..+...|....-.+||+. T Consensus 229 ~~l~~~~~~~~~~~il~g~g~g~~~-------~-~~~~~~~~~~~~~~~~~-~~~~~i~~~~~~~~~~~~~~~~~v~n~~ 299 (415) T protein:vir:98 229 LWMARTIAATRNKAIIDVITKGSTG-------S-TSSGFEKEGKKLEVKKA-KSLDDIKDAINLNVKPNYEHNVAIVSQT 299 (415) T ss_pred HHHHHHHHHHHHHHHhhccccCccc-------c-ccccccccccccccccc-cchhHHHHHHHhhhhhccCCCEEEEcHH Confidence 8888999999999999998652221 0 11111122233333333 4455555555566666766777999999 Q ss_pred HHHHHHHHhcCcceEEeecCCCC----cccceecCeeeecccc-cccCCc-eecc--CccccccccccCCCCCCCCeeEE Q lcl|NC_019448. 223 VHADFVNSILGRQMQLMQDNSGN----VNTGYSVNGFYSSRGF-IKLHGS-TVME--NELILDESLQPLPNAPQPAKVTA 294 (463) Q Consensus 223 vka~f~~~~~~~qrv~~~~n~g~----~~~G~~v~~~~s~~G~-i~l~~s-~~~~--~d~~l~~~~~~~p~ap~p~~vta 294 (463) +.+.+...=-..-|.+.+++..+ .-.|++|- .+..-. ...... +++. .+..+...+. .++. T Consensus 300 ~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~--~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~---------~~~v 368 (415) T protein:vir:98 300 MFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIE--ILPDEVLGQKGNNTLIIGNLKDAIVLFDRS---------QYQA 368 (415) T ss_pred HHHHHHHhhccCCceeeccCcCCCCCceecceeeE--EecccccCCCCccEEEEEehhccEEEEeec---------ceEE Confidence 99998753222234455443322 23444431 100000 000000 1111 0100000000 0000 Q ss_pred EEeccCCCcCcccccccceEEEEEEEe----cCCccccccceeeeecCCCCceEEEE Q lcl|NC_019448. 295 TVETKQKGAFEDEEDRAGLSYKVVVNS----DDAQSAPSEEVTATVSNVDDGVKLSI 347 (463) Q Consensus 295 t~~~~~~g~~~~~~~~a~ysYkV~a~s----~~geS~~S~~vt~Tva~~~~gv~ltI 347 (463) ....... ....|++...- .+-++...-..+.++. ......|.- T Consensus 369 ~~~~~~~---------~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~-~~~~~~~~~ 415 (415) T protein:vir:98 369 SWTDYMH---------FGECLMIAVRQDCRILDYKSAIVIEYDDSER-GEGDLGLEA 415 (415) T ss_pred EEecccc---------CceEEEEEEEeccEEeccccEEEEEEeccCC-CCCccccCC Confidence 0000000 00011111111 1112211111111111 111112222 No 70 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=95.95 E-value=0.0012 Score=36.57 Aligned_cols=313 Identities=12% Similarity=0.007 Sum_probs=137.0 Q ss_pred CCCCCccchHH--H----Hhh-----hhhhHHHHHHh----hcCCccC--CccccCccccchhhhhhHhhhhhccccccc Q lcl|NC_019448. 1 MTIEKNLSDVQ--Q----KYA-----DQFQEDVVKSF----QTGYGIT--PDTQIDAGALRREILDDQITMLTWTNEDLI 63 (463) Q Consensus 1 ~~~~~~~~~~~--~----~~~-----k~~~e~~~Ks~----~agy~~~--p~~q~~gaalr~esLd~~i~~L~~~~~df~ 63 (463) -..+....... . ... +.......++| ..+.... ..+-.+|+.|--+.+.+.|..+.... -. T Consensus 73 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~--~~ 150 (415) T protein:vir:79 73 QSVEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVE--FN 150 (415) T ss_pred cccccchhhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhh--hh Confidence 00000000000 0 000 00011112222 1221111 11223455555555555554333333 33 Q ss_pred hhhhcccchhhHHHhhhhhhhccCcccccccccccCccc-ccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHH Q lcl|NC_019448. 64 FYRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAP-VSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILT 142 (463) Q Consensus 64 f~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~ 142 (463) +.+-+...++.+.-..|......++ ....+++|++..+ .+++.+......++-++.-..+|.-+ +.++.-|.+.... T Consensus 151 l~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~el-l~ds~~~l~~~i~ 228 (415) T protein:vir:79 151 LDKYVTVKRVTNGSGKYPVVRQSEV-AALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREA-IEDAKVNVLQELK 228 (415) T ss_pred hhhheeeeeccCCceeEEEEeecCC-ccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHH-HhhchHHHHHHHH Confidence 5555555556555555555544433 3566899998875 66799999999999999887777764 2445567888888 Q ss_pred HHHHHHHHHHHHHHHhhcccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHH Q lcl|NC_019448. 143 EDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIG 222 (463) Q Consensus 143 ~~ai~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~ 222 (463) +.-...+++.+|.+++.|+-.=++. . ++.......+.....+. .+.+.|-++-..+...|....-.+||+. T Consensus 229 ~~l~~~~~~~~~~~il~g~g~g~~~-------~-~~~~~~~~~~~~~~~~~-~~~~~i~~~~~~~~~~~~~~~~~v~n~~ 299 (415) T protein:vir:79 229 LWMARTIAATRNKAIIDVITKGSTG-------S-TSSGFEKEGKKLEVKKA-KSLDDIKDAINLNVKPNYEHNVAIVSQT 299 (415) T ss_pred HHHHHHHHHHHHHHHhhccccCccc-------c-ccccccccccccccccc-cchhHHHHHHHhhhhhccCCCEEEEcHH Confidence 8888999999999999998652221 0 11111122233333333 4455555555566666766777999999 Q ss_pred HHHHHHHHhcCcceEEeecCCCC----cccceecCeeeecccc-cccCCc-eecc--CccccccccccCCCCCCCCeeEE Q lcl|NC_019448. 223 VHADFVNSILGRQMQLMQDNSGN----VNTGYSVNGFYSSRGF-IKLHGS-TVME--NELILDESLQPLPNAPQPAKVTA 294 (463) Q Consensus 223 vka~f~~~~~~~qrv~~~~n~g~----~~~G~~v~~~~s~~G~-i~l~~s-~~~~--~d~~l~~~~~~~p~ap~p~~vta 294 (463) +.+.+...=-..-|.+.+++..+ .-.|++|- .+..-. ...... +++. .+..+...+. .++. T Consensus 300 ~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~--~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~---------~~~v 368 (415) T protein:vir:79 300 MFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIE--ILPDEVLGQKGNNTLIIGNLKDAIVLFDRS---------QYQA 368 (415) T ss_pred HHHHHHHhhccCCceeeccCcCCCCCceecceeeE--EecccccCCCCccEEEEEehhccEEEEeec---------ceEE Confidence 99998753222234455443322 23444431 100000 000000 1111 0100000000 0000 Q ss_pred EEeccCCCcCcccccccceEEEEEEEe----cCCccccccceeeeecCCCCceEEEE Q lcl|NC_019448. 295 TVETKQKGAFEDEEDRAGLSYKVVVNS----DDAQSAPSEEVTATVSNVDDGVKLSI 347 (463) Q Consensus 295 t~~~~~~g~~~~~~~~a~ysYkV~a~s----~~geS~~S~~vt~Tva~~~~gv~ltI 347 (463) ....... ....|++...- .+-++...-..+.++. ......|.- T Consensus 369 ~~~~~~~---------~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~-~~~~~~~~~ 415 (415) T protein:vir:79 369 SWTDYMH---------FGECLMIAVRQDCRILDYKSAIVIEYDDSER-GEGDLGLEA 415 (415) T ss_pred EEecccc---------CceEEEEEEEeccEEeccccEEEEEEeccCC-CCCccccCC Confidence 0000000 00011111111 1112211111111111 111112222 No 71 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=95.95 E-value=0.0012 Score=36.57 Aligned_cols=313 Identities=12% Similarity=0.007 Sum_probs=137.0 Q ss_pred CCCCCccchHH--H----Hhh-----hhhhHHHHHHh----hcCCccC--CccccCccccchhhhhhHhhhhhccccccc Q lcl|NC_019448. 1 MTIEKNLSDVQ--Q----KYA-----DQFQEDVVKSF----QTGYGIT--PDTQIDAGALRREILDDQITMLTWTNEDLI 63 (463) Q Consensus 1 ~~~~~~~~~~~--~----~~~-----k~~~e~~~Ks~----~agy~~~--p~~q~~gaalr~esLd~~i~~L~~~~~df~ 63 (463) -..+....... . ... +.......++| ..+.... ..+-.+|+.|--+.+.+.|..+.... -. T Consensus 73 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~--~~ 150 (415) T protein:vir:81 73 QSVEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVE--FN 150 (415) T ss_pred cccccchhhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhh--hh Confidence 00000000000 0 000 00011112222 1221111 11223455555555555554333333 33 Q ss_pred hhhhcccchhhHHHhhhhhhhccCcccccccccccCccc-ccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHH Q lcl|NC_019448. 64 FYRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAP-VSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILT 142 (463) Q Consensus 64 f~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~ 142 (463) +.+-+...++.+.-..|......++ ....+++|++..+ .+++.+......++-++.-..+|.-+ +.++.-|.+.... T Consensus 151 l~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~el-l~ds~~~l~~~i~ 228 (415) T protein:vir:81 151 LDKYVTVKRVTNGSGKYPVVRQSEV-AALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREA-IEDAKVNVLQELK 228 (415) T ss_pred hhhheeeeeccCCceeEEEEeecCC-ccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHH-HhhchHHHHHHHH Confidence 5555555556555555555544433 3566899998875 66799999999999999887777764 2445567888888 Q ss_pred HHHHHHHHHHHHHHHhhcccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHH Q lcl|NC_019448. 143 EDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIG 222 (463) Q Consensus 143 ~~ai~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~ 222 (463) +.-...+++.+|.+++.|+-.=++. . ++.......+.....+. .+.+.|-++-..+...|....-.+||+. T Consensus 229 ~~l~~~~~~~~~~~il~g~g~g~~~-------~-~~~~~~~~~~~~~~~~~-~~~~~i~~~~~~~~~~~~~~~~~v~n~~ 299 (415) T protein:vir:81 229 LWMARTIAATRNKAIIDVITKGSTG-------S-TSSGFEKEGKKLEVKKA-KSLDDIKDAINLNVKPNYEHNVAIVSQT 299 (415) T ss_pred HHHHHHHHHHHHHHHhhccccCccc-------c-ccccccccccccccccc-cchhHHHHHHHhhhhhccCCCEEEEcHH Confidence 8888999999999999998652221 0 11111122233333333 4455555555566666766777999999 Q ss_pred HHHHHHHHhcCcceEEeecCCCC----cccceecCeeeecccc-cccCCc-eecc--CccccccccccCCCCCCCCeeEE Q lcl|NC_019448. 223 VHADFVNSILGRQMQLMQDNSGN----VNTGYSVNGFYSSRGF-IKLHGS-TVME--NELILDESLQPLPNAPQPAKVTA 294 (463) Q Consensus 223 vka~f~~~~~~~qrv~~~~n~g~----~~~G~~v~~~~s~~G~-i~l~~s-~~~~--~d~~l~~~~~~~p~ap~p~~vta 294 (463) +.+.+...=-..-|.+.+++..+ .-.|++|- .+..-. ...... +++. .+..+...+. .++. T Consensus 300 ~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~--~~~~~~~~~~~~~~~~~Gd~~~~~~~~~~~---------~~~v 368 (415) T protein:vir:81 300 MFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIE--ILPDEVLGQKGNNTLIIGNLKDAIVLFDRS---------QYQA 368 (415) T ss_pred HHHHHHHhhccCCceeeccCcCCCCCceecceeeE--EecccccCCCCccEEEEEehhccEEEEeec---------ceEE Confidence 99998753222234455443322 23444431 100000 000000 1111 0100000000 0000 Q ss_pred EEeccCCCcCcccccccceEEEEEEEe----cCCccccccceeeeecCCCCceEEEE Q lcl|NC_019448. 295 TVETKQKGAFEDEEDRAGLSYKVVVNS----DDAQSAPSEEVTATVSNVDDGVKLSI 347 (463) Q Consensus 295 t~~~~~~g~~~~~~~~a~ysYkV~a~s----~~geS~~S~~vt~Tva~~~~gv~ltI 347 (463) ....... ....|++...- .+-++...-..+.++. ......|.- T Consensus 369 ~~~~~~~---------~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~-~~~~~~~~~ 415 (415) T protein:vir:81 369 SWTDYMH---------FGECLMIAVRQDCRILDYKSAIVIEYDDSER-GEGDLGLEA 415 (415) T ss_pred EEecccc---------CceEEEEEEEeccEEeccccEEEEEEeccCC-CCCccccCC Confidence 0000000 00011111111 1112211111111111 111112222 No 72 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=95.91 E-value=0.0013 Score=36.48 Aligned_cols=309 Identities=13% Similarity=0.111 Sum_probs=144.0 Q ss_pred CCCCCccchHHHHhhhhhhHHHHHHhhcC---------------------CccCCccccCcccc-chhhhhhHhhhhhcc Q lcl|NC_019448. 1 MTIEKNLSDVQQKYADQFQEDVVKSFQTG---------------------YGITPDTQIDAGAL-RREILDDQITMLTWT 58 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k~~~e~~~Ks~~ag---------------------y~~~p~~q~~gaal-r~esLd~~i~~L~~~ 58 (463) ...+.........+ ..+.|++..+ ...+-.+...||.| ..+ +..+|..+... T Consensus 85 ~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~lvP~~-~~~~ii~~l~~ 158 (435) T protein:vir:80 85 YAQPKAPEVKGAKM-----ARMVRALAAARGDAQLASKLAIERGFGEEVAMSLNTLSPGAGGVLVPEN-LSSEVIELLRP 158 (435) T ss_pred ccccchhhhhHHHH-----HHHHHHHHhccchhHHHHHHHHhhhhhhhhhhhhcccCCCCCccccchh-HHHHHHHHHhh Confidence 11111111111111 1111222111 00111223334544 444 44444333222 Q ss_pred ccccchhhhcccc--hhhHHHhhhhhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhcc-c- Q lcl|NC_019448. 59 NEDLIFYRDISRR--PAQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNN-I- 134 (463) Q Consensus 59 ~~df~f~~~i~k~--~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~-~- 134 (463) . ..+..+..+ +..+--.+|.++.. .+...|++|++..+..++.+.+....++=++....+|.-+ +.++ + T Consensus 159 ~---~~i~~~~~~~v~~~~~~~~~p~~~~---~~~a~~v~E~~~~~~~~~~f~~i~~~~~k~~~~~~is~el-l~ds~~~ 231 (435) T protein:vir:80 159 K---SVVRKLGARTLPLSNGNITIPRLKG---GAIVGYIGADTDIPTTQQQFDDLKLTAKKMAALVPIANDL-IKYAGVN 231 (435) T ss_pred h---chhhhccceeeecCCCceEEEEEeC---CcceeeeccCccccccccceeeEEEeeEEEEEeehhhHHH-HHhhccc Confidence 1 122222111 22222345555542 2456789999999999999999999999888888888765 3334 2 Q ss_pred ccHHHHHHHHHHHHHHHHHHHHHhhcccccCCCccccccccccceeeecCcceEecc-CCCCC--HHHHhhhhhhhhhcC Q lcl|NC_019448. 135 ADPSQILTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAK-GNQLT--EKHLNEAAVRIGKGF 211 (463) Q Consensus 135 ~Dp~~~~~~~ai~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDar-G~~ls--~~~ln~aa~~i~~~~ 211 (463) -+.+....+.-..++...+|.++|+|+..= -+..||.+.....++.... |.... ...+.++...+..+. T Consensus 232 ~~l~~~i~~~l~~a~~~~~d~a~l~G~G~~--------~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~ 303 (435) T protein:vir:80 232 PNVDQIVVGDLTAAIGAREDKAFIRDDGTA--------NTPKGLRFWALPGNVITASDGSTLQKIETDLGKAILALENAD 303 (435) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccCCCC--------CcccceeecccccceeecccccchhhHHHHHHHHHHHhhccc Confidence 256888999999999999999999997431 1456776655544444432 22222 223445544444433 Q ss_pred --CceeEEecCHHHHHHHHHHhcCcceEEeecCCCCcccceecCe--eeec-ccccccCCc-eecc--CccccccccccC Q lcl|NC_019448. 212 --GTATDAYMPIGVHADFVNSILGRQMQLMQDNSGNVNTGYSVNG--FYSS-RGFIKLHGS-TVME--NELILDESLQPL 283 (463) Q Consensus 212 --G~~td~~m~~~vka~f~~~~~~~qrv~~~~n~g~~~~G~~v~~--~~s~-~G~i~l~~s-~~~~--~d~~l~~~~~~~ 283 (463) -...-..|++.+.+.+...--..-|.+.+...++.-.|++|-- .+.. -|. .-+.. +++. .+.++- .+ T Consensus 304 ~~~~~~~~vmn~~~~~~L~~lkd~~G~~l~~~~~~~~l~G~pv~~~~~~p~~~~~-~~~~~~i~~gd~s~~~i~-~~--- 378 (435) T protein:vir:80 304 ANLTQPGWIMAPRTFRFLEGLRDGNGNKVYPELANGMLKGYPVGKTTQVPINLGE-AGKESEIYFTDFGDVFIG-EE--- 378 (435) T ss_pred cccccCEEEEcHHHHHHHHhhhccCCceeccCCCCCeEeeeeeEEeccccccccC-CCCcceEEEEEcccEEEE-ee--- Confidence 2233467999999988753323333455554454556665411 1000 000 00111 1111 111110 00 Q ss_pred CCCCCCCeeEEEEeccCC-----CcCcccccccceEEEEEEEecCCccccccceeeeecCCCC Q lcl|NC_019448. 284 PNAPQPAKVTATVETKQK-----GAFEDEEDRAGLSYKVVVNSDDAQSAPSEEVTATVSNVDD 341 (463) Q Consensus 284 p~ap~p~~vtat~~~~~~-----g~~~~~~~~a~ysYkV~a~s~~geS~~S~~vt~Tva~~~~ 341 (463) ..++......++ +...+.-......+++...-+.+=--|...+..+-..+.+ T Consensus 379 ------~~~~i~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~ 435 (435) T protein:vir:80 379 ------ETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLSGVAWGA 435 (435) T ss_pred ------cceEEEEeccccccccccchhhhhhcCcceeeeeeeeCcEeecccceEEEeccCCCC Confidence 011122212111 1110000111234455544444444455555555555555 No 73 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=95.91 E-value=0.0013 Score=36.46 Aligned_cols=312 Identities=13% Similarity=0.109 Sum_probs=138.5 Q ss_pred CC--------CCCccchHH-HHhhhhhh---HHHHHHhh---cCCc-------cCCccccCccccchhhhhhHhhhhhcc Q lcl|NC_019448. 1 MT--------IEKNLSDVQ-QKYADQFQ---EDVVKSFQ---TGYG-------ITPDTQIDAGALRREILDDQITMLTWT 58 (463) Q Consensus 1 ~~--------~~~~~~~~~-~~~~k~~~---e~~~Ks~~---agy~-------~~p~~q~~gaalr~esLd~~i~~L~~~ 58 (463) .. .|....... ...+..+. .++..+.. -.+. .+..+ ..||.|--+.+..+|..+.. T Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~gg~liP~~~~~~ii~~l~- 151 (428) T protein:vir:10 74 QHGPAVIVKAEPKQYTGAGMTRMVMSIAAAQGNLQDAAKFASDELNDQSVSMAISTAA-GSGGVLIPQNIHSEVIELLR- 151 (428) T ss_pred hhccccccccccchhhhHHHHHHHHHHHHhhhhHHHHHHHhhhhhhhhhHhhhhcccc-cCCccccchhHHHHHHHHHh- Confidence 00 000000000 00111000 00000000 0010 11111 13444433333344422221 Q ss_pred ccccchhhhcccch--hhHHHhhhhhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhccccc Q lcl|NC_019448. 59 NEDLIFYRDISRRP--AQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIAD 136 (463) Q Consensus 59 ~~df~f~~~i~k~~--~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~D 136 (463) +...+.++..+. ..+--.+|.++.. .+...+++|++..+.+++.+.+.+...+=++.-..+|.-+ +.++..| T Consensus 152 --~~~~l~~~~~~~~~~~~g~~~~p~~~~---~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~v~is~el-l~ds~~~ 225 (428) T protein:vir:10 152 --DRTIVRKLGARSIPLPNGNMSLPRLAG---GATASYTGENQDAKVSEARFDDVKLTAKTMIAMVPISNAL-IGRAGFN 225 (428) T ss_pred --hhchhhhhcceeeecCCcceEEEEEeC---CcceeeeccCccccccccceeeEEeeeEEEEEeehhhHHH-HhhhhHH Confidence 122222221111 1122234555543 2456799999999999999999999999888877777765 3445567 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhcccccCCCccccccccccceeeecCc-ceE-eccCCCCCHHHHhhhhhhh------h Q lcl|NC_019448. 137 PSQILTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKN-NVI-NAKGNQLTEKHLNEAAVRI------G 208 (463) Q Consensus 137 p~~~~~~~ai~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~-nvi-DarG~~ls~~~ln~aa~~i------~ 208 (463) .+....+.-...+.+.+|.++++||.. |-+++|+.+..... .++ ...+...+.+.+......+ . T Consensus 226 l~~~i~~~l~~ai~~~~d~~~l~G~G~--------~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 297 (428) T protein:vir:10 226 VEQLVLQDILTAISVREDKAFMRDDGT--------GDTPIGMKARATQWNRLLPWAADAAVNLDTIDTYLDSIILMSMDG 297 (428) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCC--------CccccccccccccccccccccccccccHHHHHHHHHHHHHhhhcc Confidence 888999999999999999999999853 12678888755432 222 2334555555444332221 2 Q ss_pred hcCCceeEEecCHHHHHHHHHHhcCcceEEeecCCCCcccceecC--eeeec-ccccccCCcee-cc--Ccccccccccc Q lcl|NC_019448. 209 KGFGTATDAYMPIGVHADFVNSILGRQMQLMQDNSGNVNTGYSVN--GFYSS-RGFIKLHGSTV-ME--NELILDESLQP 282 (463) Q Consensus 209 ~~~G~~td~~m~~~vka~f~~~~~~~qrv~~~~n~g~~~~G~~v~--~~~s~-~G~i~l~~s~~-~~--~d~~l~~~~~~ 282 (463) ..+-...-.+|++.+...+...=-..-|.+.++..++.-.|++|- ..+-. -|. ..+...+ +. .+.++- .+ T Consensus 298 ~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~g~l~G~pv~~~~~~p~~~~~-~~~~~~i~~gd~s~~~i~-~~-- 373 (428) T protein:vir:10 298 NSNMISSGWGMSNRTYMKLFGLRDGNGNKVYPEMAQGMLKGYPIQRTSAIPANLGE-GGKESEIYFADFNDVVIG-ED-- 373 (428) T ss_pred ccccccCEEEEcHHHHHHHHHhhccCCceeccCCCCCeeeceeeEEeccccccccC-CCccceEEEEecceEEEE-Ee-- Confidence 223333446899999988875322222444444444344566541 00000 000 0000111 11 011110 00 Q ss_pred CCCCCCCCeeEEEEecc-----CCCcCcccccccceEEEEEEEecCCccccccceeeeecCC Q lcl|NC_019448. 283 LPNAPQPAKVTATVETK-----QKGAFEDEEDRAGLSYKVVVNSDDAQSAPSEEVTATVSNV 339 (463) Q Consensus 283 ~p~ap~p~~vtat~~~~-----~~g~~~~~~~~a~ysYkV~a~s~~geS~~S~~vt~Tva~~ 339 (463) ..+....... ..+.....-..-...+++...-+-+=--|...+.+|-..+ T Consensus 374 -------~~i~i~~~~~~~~~~~~~~~~~~f~~~~~~~R~~~r~d~~v~~p~a~~~~t~~~~ 428 (428) T protein:vir:10 374 -------GNMKVDFSKEASYIDTDGKLVSAFSRNQSLIRVVTEHDIGFRHPEGLVLGTGVLF 428 (428) T ss_pred -------cceEEEeecccccccccccccchhhcchhheeeeeeeCceeeccceEEEEeccCC Confidence 0011111111 1111000001112233444333333334555666665556 No 74 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=95.90 E-value=0.0013 Score=36.45 Aligned_cols=293 Identities=12% Similarity=0.055 Sum_probs=131.9 Q ss_pred CccCCccccCccccchhhhhhHhhhhhccccccchhhhcccchhhHHHhhhhhhhccCcccccccccccCcccccCcceE Q lcl|NC_019448. 30 YGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIR 109 (463) Q Consensus 30 y~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~ 109 (463) -++ +..+|.-+..|..++-|..|.. ...+++-....+..+--.+|.++... +...+++|++..+.+++.+. T Consensus 1 m~t---~t~gg~liP~~~~~~ii~~l~~---~s~i~~l~~~~~~~~~~~~ip~~~~~---~~a~wv~E~~~~~~s~~~f~ 71 (303) T protein:vir:97 1 MGT---ETSKASLFDKHLVSDLINKVKG---HSSLAKLSSQKPIPFNGSKEFTFTLD---SDIDVVAENGKKTHGGLSLE 71 (303) T ss_pred Ccc---cCCCCeEcchhHHHHHHHHHHh---hchhhhhcceeecCCCceEEEEEecC---cceEEeecCcccccccccee Confidence 111 1233444555554444433333 33344444444454433355555532 35789999999999999999 Q ss_pred EEEEEEEEeechhhhhhhhhhhcc--cccHHHHHHHHHHHHHHHHHHHHHhhcccccCCCccccccccccceeeec-Ccc Q lcl|NC_019448. 110 QKTVSMKYVSDTKNMSIASGLVNN--IADPSQILTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLID-KNN 186 (463) Q Consensus 110 r~~~~~k~l~~~~~vs~~~~lvn~--~~Dp~~~~~~~ai~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~-~~n 186 (463) +.....|=++.--.+|.-+=.++. ..+.++...+.....+++.+|.++++|+.+-.. . +..--|...... ..+ T Consensus 72 ~v~l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g--~--~~~~~~~~~~~~~~~~ 147 (303) T protein:vir:97 72 PVTIVPIKVEYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPRTK--K--ASDVIGTNHFDSKVTQ 147 (303) T ss_pred eEEeeeEEEEEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccCCc--c--cccccccccccccccc Confidence 999999999988888876544333 346788889999999999999999999754221 1 111111111111 122 Q ss_pred eEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHHHHHhcCcc-eEEeecCCCCcccceecCeeeecccccccC Q lcl|NC_019448. 187 VINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQ-MQLMQDNSGNVNTGYSVNGFYSSRGFIKLH 265 (463) Q Consensus 187 viDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~~~~~~q-rv~~~~n~g~~~~G~~v~~~~s~~G~i~l~ 265 (463) +.-.-+...+.+.|.++...+..+++.++.+.||+.+...+... .+.+ +.+..++.+. |...+ T Consensus 148 ~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~l-kd~~g~~~~~~~~~~---~~~~~------------ 211 (303) T protein:vir:97 148 VVKFTESEDADANIEAAVNLIQGAEGVVTGLAMDTEFSTALAKV-TNGEMGPKMYPELAW---GANPD------------ 211 (303) T ss_pred ccccccccchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHh-hccCCCeEEecCccC---CCCCc------------ Confidence 22222223345667777778888888889999999999998743 2322 3333332110 00011 Q ss_pred CceeccCccccccccccCCCCCCCCeeEEEEeccCCCcCcccc------cccceEEEEEEEecCCccccccceeeeecCC Q lcl|NC_019448. 266 GSTVMENELILDESLQPLPNAPQPAKVTATVETKQKGAFEDEE------DRAGLSYKVVVNSDDAQSAPSEEVTATVSNV 339 (463) Q Consensus 266 ~s~~~~~d~~l~~~~~~~p~ap~p~~vtat~~~~~~g~~~~~~------~~a~ysYkV~a~s~~geS~~S~~vt~Tva~~ 339 (463) .++..+.....+ .|. . ..+..+....-++|.. .+....+++. +++..--+. ++.- ... T Consensus 212 --~l~G~Pv~~s~~---v~~-----~-~~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~---~~~~~d~~~-~~~~-~~n 275 (303) T protein:vir:97 212 --SINGLKSSVNTT---VGA-----G-ADEAESKDLVIIGDFESMFKWGYAKQIPMEII---KYGDPDNSG-KDLK-GYN 275 (303) T ss_pred --eecceeeEEecc---cCC-----c-cccCCCccEEEEeeccccEEEEEecCcEEEEe---eccCCCCcc-hhhh-hcC Confidence 111111111111 000 0 0000000000111110 0111111111 111000000 0000 000 Q ss_pred CCceEEEEEecCCCCCCcceEEEEeecCC Q lcl|NC_019448. 340 DDGVKLSISVNAMYQQQPQFVSIYRQGKE 368 (463) Q Consensus 340 ~~gv~ltIt~~a~~g~~~~~y~IYR~~~~ 368 (463) .-++..+.-+... =..|+-+...++++= T Consensus 276 ~~~~r~~~r~~~~-v~~p~af~~l~~~~~ 303 (303) T protein:vir:97 276 QIYLRAEAYIGWG-ILDAKSFARVTKGEV 303 (303) T ss_pred cEEEEEEEEeccE-eecccceEEeeCCCC Confidence 0000001111000 000111222222211 No 75 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=95.83 E-value=0.00053 Score=38.53 Aligned_cols=308 Identities=12% Similarity=0.054 Sum_probs=129.1 Q ss_pred CCCCCc-cchHHHHhhhhhh--------------------HHHHHHhhcCCccCCccccCccccchhhhhhHhhhhhccc Q lcl|NC_019448. 1 MTIEKN-LSDVQQKYADQFQ--------------------EDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTN 59 (463) Q Consensus 1 ~~~~~~-~~~~~~~~~k~~~--------------------e~~~Ks~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~ 59 (463) +..+.. .....+...+... ..-.+++..-..... +..+++.+--+.+.++|..+.. T Consensus 67 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~vp~~~~~~ii~~~~-- 143 (413) T protein:vir:81 67 LTRKGEGYKSIGEFFAKRAGDQIKQQAGGAQLNYSVGEYVAPRVKAASDPASTAT-LTDEFQGGYGTTWNRNIIYRRR-- 143 (413) T ss_pred HhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhHHHhhhhhhhhcc-cccccccccchhhHHHHHHHHh-- Confidence 000000 0000000000000 000122211111111 1233444333334444433322 Q ss_pred cccchhhhcccchhhHHHhhhhhhhccC-cccccccccccCcccccC-cceEEEEEEEEEeechhhhhhhhhhhcccccH Q lcl|NC_019448. 60 EDLIFYRDISRRPAQSTVVKYDQYLRHG-NVGHSRFVKEIGVAPVSD-PNIRQKTVSMKYVSDTKNMSIASGLVNNIADP 137 (463) Q Consensus 60 ~df~f~~~i~k~~~~stv~ey~~~~~hG-~~g~~~fv~E~g~~~~~d-~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp 137 (463) +.-.+.+-+...+..+.-.+|.+..... ..+...+++|++..+.++ +.+.+.+..++=++.-..+|..+ +.++ .+. T Consensus 144 ~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~el-l~ds-~~l 221 (413) T protein:vir:81 144 EKLVVADLMDNLTMTNTTIKYLMEKANRVVEGGFKTVAEGGKKPYMRFADFDIVTESLSKIAGLTKITDEM-IEDY-DFL 221 (413) T ss_pred hhhhHHhhcceeeccCCceeEEEeccccccccccceecCcccccccCcccceeeEeeeeeEEEeehhhHHH-HHHH-HHH Confidence 2223445555556655545555554332 234577999998877666 78999999998888777787753 3333 346 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcC-CceeE Q lcl|NC_019448. 138 SQILTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGF-GTATD 216 (463) Q Consensus 138 ~~~~~~~ai~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~-G~~td 216 (463) +....+.-...+++.+|.++++|+-. +-++.||.+......+ -..+..-..+.|.++...+.... ..++- T Consensus 222 ~~~i~~~la~~~~~~~d~~~l~G~G~--------~~~~~Gi~~~~~~~~~-~~~~~~~~~~~i~~~~~~~~~~~~~~~~~ 292 (413) T protein:vir:81 222 VSYINARLLEELAIEEERQLLLGDGT--------GNNLTGLLKRDGIQTL-AVSNKDELADSIYKAMTNISLATPFQADA 292 (413) T ss_pred HHHHHHHHHHHHHHHHHHHHhccCCC--------CCcccccccccccccc-cccccchhHHHHHHHHHHhhhhccCCCcE Confidence 77777778889999999999999742 1246777665443211 11112223455666666655443 35566 Q ss_pred EecCHHHHHHHHHHhcCcceEEeecCC----CC-------cccceecCeeeecccccccCCceeccCccccccccccCCC Q lcl|NC_019448. 217 AYMPIGVHADFVNSILGRQMQLMQDNS----GN-------VNTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQPLPN 285 (463) Q Consensus 217 ~~m~~~vka~f~~~~~~~qrv~~~~n~----g~-------~~~G~~v~~~~s~~G~i~l~~s~~~~~d~~l~~~~~~~p~ 285 (463) ++|++.+.+.+...-=..-|.+.++.. ++ .-.|++| +.+.. +. .+.+++. |...... T Consensus 293 ~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~l~G~pv--~~s~~--~~-~~~~~~g-----d~~~~~~-- 360 (413) T protein:vir:81 293 LVINPLDYQELRLAKDANGQYYGGGVFQGQYGSGGIMLDPAPWGLRT--VQSQV--VP-VGKPVVG-----AFRSAAS-- 360 (413) T ss_pred EEEcHHHHHHHHHhhccCCceeccccccccccccccccCceecceee--EEcCC--CC-cccEEEE-----ecccEEE-- Confidence 899999999887432222222322211 10 1123332 11111 00 1111111 0000000 Q ss_pred CCCCCeeEEEEeccCCCcCcccccccceEEEEE----EEecCCccccccceeeeecC Q lcl|NC_019448. 286 APQPAKVTATVETKQKGAFEDEEDRAGLSYKVV----VNSDDAQSAPSEEVTATVSN 338 (463) Q Consensus 286 ap~p~~vtat~~~~~~g~~~~~~~~a~ysYkV~----a~s~~geS~~S~~vt~Tva~ 338 (463) -...-.++..........| .++ ...|++. ..-.+.++...-.++.+++- T Consensus 361 ~~~~~~~~v~~~~~~~~~~--~~~--~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~p 413 (413) T protein:vir:81 361 VLRKGGVRIDSTNTNVDDF--ENN--LITVRAEERVGLMVTFPEAIVQLDVAEVVTP 413 (413) T ss_pred EEEecceEEEEeccccchh--hcC--cEEEEEEEeeccEEecccceEEEEecCCCCC Confidence 0000011111111111000 001 1122221 11112232211111111111 No 76 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=95.79 E-value=0.00061 Score=38.20 Aligned_cols=310 Identities=13% Similarity=0.110 Sum_probs=139.8 Q ss_pred CCC--CCccc-hHHHHhhhhhh---HHHHHHhh---cCCc-------cCCccccCccccchhhhhhHhhhhhccccccch Q lcl|NC_019448. 1 MTI--EKNLS-DVQQKYADQFQ---EDVVKSFQ---TGYG-------ITPDTQIDAGALRREILDDQITMLTWTNEDLIF 64 (463) Q Consensus 1 ~~~--~~~~~-~~~~~~~k~~~---e~~~Ks~~---agy~-------~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f 64 (463) -+. +..+. ..+......-. .+..+... .+.. .+-.+-.+|+.+--+.+.++|...... .-.+ T Consensus 91 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~Ii~~l~~--~~~i 168 (425) T protein:vir:95 91 QPSNQSRQKMQGSKGDVVEMNRLQVREMLKTGEYYKRSEVVEFYEKFRNLRAVAGGELTIPEVVVNRIMDIMGD--YTTL 168 (425) T ss_pred ccchhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhHHHHHHHHHHhhcccccCceeccHHHHHHHHHHHHh--hhhH Confidence 000 00000 00000000000 00111110 0000 000112345554444455555322222 2234 Q ss_pred hhhcccchhhHHHhhhhhhhccCcccccccccccCcccccC-cceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHH Q lcl|NC_019448. 65 YRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSD-PNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTE 143 (463) Q Consensus 65 ~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d-~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~ 143 (463) ++.+...+....+ ++.+ .++.+...|+.|++..+.++ +.+.+.....+=++.-..+|.-+ +.++..|.+....+ T Consensus 169 ~~~~~~~~~~g~~-~ip~---~~~~~~a~~v~E~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~el-l~ds~~~l~~~i~~ 243 (425) T protein:vir:95 169 YPLVDKIRVKGTT-RILV---DTDTSPATWIEQSGALPTGDVGTIASIDFDGFKVGKVTFVDNYL-LQDSIINLDDYVTK 243 (425) T ss_pred HHhhceeecCcee-EEEE---ecCCccccccccccccccccccccceeeeeheeeeeeehhhHHH-HhccHHHHHHHHHH Confidence 4444444444333 3443 33445678999999866555 78998888887777655555532 23345577888888 Q ss_pred HHHHHHHHHHHHHHhhcccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcCCceeE--EecCH Q lcl|NC_019448. 144 DAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATD--AYMPI 221 (463) Q Consensus 144 ~ai~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td--~~m~~ 221 (463) .-...+++.+|.++|+|+..-+. ++.|+.+-+...+.....+..++.+.|.++...+..++..... .+|++ T Consensus 244 ~l~~~i~~~~d~~il~G~G~~~~-------~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~ 316 (425) T protein:vir:95 244 KIARAIAKALDLAIVKGTGAANK-------QPLGIIPSLPPENQVTVEADNNLLKNLVKQIGLIDTGDDSVGEIVAVMKR 316 (425) T ss_pred HHHHHHHHHHHHHhhccCCCCcc-------ccceeecccccccccccccccchHHHHHHHHHhhhhhccccCceEEEEeC Confidence 88899999999999999866432 6778877555443344455666777777777777777754433 45776 Q ss_pred HH-HHHHHH--HhcC-cceEE-eecCCCCc-ccceecCeeeecccccccCCceeccCccccccccccCCCCCCCCeeEEE Q lcl|NC_019448. 222 GV-HADFVN--SILG-RQMQL-MQDNSGNV-NTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQPLPNAPQPAKVTAT 295 (463) Q Consensus 222 ~v-ka~f~~--~~~~-~qrv~-~~~n~g~~-~~G~~v~~~~s~~G~i~l~~s~~~~~d~~l~~~~~~~p~ap~p~~vtat 295 (463) .+ .+.+.. .+-+ .-|.+ +.++.+.+ -.|++| +.+..-. .+.+++. | ...... . .-..++.. T Consensus 317 ~~~~~~l~~l~~~kd~~g~~i~~~~~~~~~~l~G~pv--v~~~~~~---~~~i~~G-d----~~~~~~-~--~~~~~~i~ 383 (425) T protein:vir:95 317 STYYNRLVEFSIQVDSNGNVVGKLPNLRTPDLLGLRV--VFNNFLD---DDTVLFG-E----FEQYTL-V--ERENITID 383 (425) T ss_pred hHHHHHHHHHHhhcCCCCceeeccCCCCCccccceee--EEcCcCC---CccEEEE-e----cccEEE-E--eecceEEE Confidence 65 222211 1112 22223 32333222 335543 1111110 1112221 0 000000 0 00112223 Q ss_pred EeccCCCcCcccccccceEEEEEEEecCCccccccceeeeecCCCCce Q lcl|NC_019448. 296 VETKQKGAFEDEEDRAGLSYKVVVNSDDAQSAPSEEVTATVSNVDDGV 343 (463) Q Consensus 296 ~~~~~~g~~~~~~~~a~ysYkV~a~s~~geS~~S~~vt~Tva~~~~gv 343 (463) ...... | ......|++...-+..=--|-..+..+|+....|- T Consensus 384 ~~~~~~--f----~~~~~~~~~~~r~d~~~~~~~a~~~~~i~~~~~g~ 425 (425) T protein:vir:95 384 SSTHVK--F----TEDQTAFRGKGRFDGKPVKPEAFVLVTITDPVQGA 425 (425) T ss_pred eecccc--c----ccCceEEEEEEeeCcEeecccceEEEEecCcCCCC Confidence 222221 1 11234455554444332334444455555533344 No 77 >protein:vir:7324 Length: 335 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848215;genbank:gi:30387386;genbank:GeneID:2641870 Probab=95.75 E-value=0.00016 Score=41.41 Aligned_cols=264 Identities=17% Similarity=0.150 Sum_probs=121.9 Q ss_pred CCCCCccchHHHHhhhhhhHHHHHHhhcCCccCCccccCccccchhhhhhHhhhhhccccccchhhhcccchhhH-HHhh Q lcl|NC_019448. 1 MTIEKNLSDVQQKYADQFQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQS-TVVK 79 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k~~~e~~~Ks~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~s-tv~e 79 (463) ||.=...-.+-...+|.+..+ . + +..-|..|+..+. ++.++|=...++ |=|. T Consensus 1 m~~~~~~a~TL~E~Akr~~~d--------------~------~----~~~IIE~l~~tne---IL~~lpf~e~N~~tg~~ 53 (335) T protein:vir:73 1 MALIGQTLPSLLDIYNRTDKN--------------G------R----IARIVEQLAKTND---ILTDAIYVPCNDGSKHK 53 (335) T ss_pred CCcCCCCchhHHHHHhhcCcc--------------h------h----HHHHHHHHhcCch---HHhhcchhcccCCcccc Confidence 443333323332233322100 0 0 1111111222221 122222222222 1133 Q ss_pred hhhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhh-hcccccHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019448. 80 YDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGL-VNNIADPSQILTEDAIAVVAKTIEWASF 158 (463) Q Consensus 80 y~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~l-vn~~~Dp~~~~~~~ai~~~~~~~E~a~f 158 (463) +..++.-. ...|-.=....+-+.++..|++..++.|..-..|-+...- ..+..+-+++|.+.-|..+.++++..+| T Consensus 54 ~~vrt~LP---~~~fR~lN~g~~~s~~tt~qvt~~l~ilgg~~eVDr~La~~~Gn~a~~ra~e~~~~ikam~q~~~~~~i 130 (335) T protein:vir:73 54 TTIRAGIP---EPVWRRYNQGVQPTKTQTVPVTDTTGMLYDLGFVDKALADRSNNAAAFRVSENMGKLQGFNNKVARYSI 130 (335) T ss_pred eeEEEecC---CchhhhcCCccccccceEEEEEEEEEEecchhhhhHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhc Confidence 33333222 2333222233445669999999999999999999765544 4446788999999999999999999999 Q ss_pred hcccccCCCccccccccccceeeec---------CcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHHHH Q lcl|NC_019448. 159 YGDASLTSEVEGEGLEFDGLAKLID---------KNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVN 229 (463) Q Consensus 159 yGd~~l~~~~~~~gleFDGl~~lI~---------~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~ 229 (463) |||++.+|. +||||.+..+ ..|+||+.|.--..-+|+- +..++ ..+.=+| |-+-|+-|+- T Consensus 131 yGDsa~~p~------~FdGL~kR~~~~st~~a~~a~~iIdaGGtG~~~TSi~~--v~wg~--~~~~giy-PkG~kaGl~~ 199 (335) T protein:vir:73 131 YGNTDAEPE------AFMGLAPRFNTLSTSKAASAENVFSAGGSGSTNTSIWF--MSWGE--NTAHMIY-PEGMVAGFQH 199 (335) T ss_pred cCCcCCChh------hccchhhhhcCccccccCcccceeeccccccCceEEEE--EEEcC--CeeEEEc-ccCcccccee Confidence 999999874 8999998652 4699999887655444331 11111 2222344 8888888876 Q ss_pred HhcCcceEEeecCCCCcc---------cceecCeee--eccccccc-------------------------CCce----- Q lcl|NC_019448. 230 SILGRQMQLMQDNSGNVN---------TGYSVNGFY--SSRGFIKL-------------------------HGST----- 268 (463) Q Consensus 230 ~~~~~qrv~~~~n~g~~~---------~G~~v~~~~--s~~G~i~l-------------------------~~s~----- 268 (463) .=++.|... ..+.+.+. .|..|...- .-..+|-. -+++ T Consensus 200 ~d~g~~~~~-d~~G~~y~~~~~~~~w~~Gl~i~d~r~vvRI~NIdvs~l~~d~~~~~~l~~lmi~a~~~~~ip~~~~~~~ 278 (335) T protein:vir:73 200 EDLGDDLVS-DGNGGQFRAYRDEFKWDIGLSVRDWRSISRICNIDVTTLTKDASTGADLISMMVDAYYARDVAMLGDGKE 278 (335) T ss_pred eeccceeee-cCCCCEEeEEEeeeeeeeeeEEeCcccEEEEeecccccccccccchhhHHhhHHHHHHHHhccCCCCCce Confidence 666776543 33333221 121111100 00111100 0000 Q ss_pred -e-ccCcc--ccccccccCC------CCCCCCeeE---------EEEeccCCCcCcccccccceEEEEEE Q lcl|NC_019448. 269 -V-MENEL--ILDESLQPLP------NAPQPAKVT---------ATVETKQKGAFEDEEDRAGLSYKVVV 319 (463) Q Consensus 269 -~-~~~d~--~l~~~~~~~p------~ap~p~~vt---------at~~~~~~g~~~~~~~~a~ysYkV~a 319 (463) | |+.+. +|+...+.-. .-+.+-.++ ..+..+.- . +||+ T Consensus 279 ~~y~n~~v~~~L~~q~~~~~n~~l~~~~~~g~~~t~~~gipir~~Dail~tE-----~--------~v~~ 335 (335) T protein:vir:73 279 VIYANKTIHAWLHKQAMNAKNVNLTIEEYGGKKIVSFLGIPIRRVDAILNTE-----S--------AVTA 335 (335) T ss_pred EEEechHHHHHHHHHHhccCceeeeeeccCCceeEEECCeEEEEEeeeecCc-----c--------cccC Confidence 0 11100 0111000000 001111111 01111110 0 1122 No 78 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=95.46 E-value=0.002 Score=35.35 Aligned_cols=313 Identities=12% Similarity=-0.006 Sum_probs=135.0 Q ss_pred CCCCCccchHH----H----Hhh-----hhhhHHHHHHhhc----CCccC--CccccCccccchhhhhhHhhhhhccccc Q lcl|NC_019448. 1 MTIEKNLSDVQ----Q----KYA-----DQFQEDVVKSFQT----GYGIT--PDTQIDAGALRREILDDQITMLTWTNED 61 (463) Q Consensus 1 ~~~~~~~~~~~----~----~~~-----k~~~e~~~Ks~~a----gy~~~--p~~q~~gaalr~esLd~~i~~L~~~~~d 61 (463) ...+....... . ... ......-.++|.. +.... ..+-.+|+.+--+.+..+|..+.. +. T Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~--~~ 148 (415) T protein:vir:47 71 NQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKE--VE 148 (415) T ss_pred cccccccchhhhhHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHhhhhhhhhccccccCCcccccHHHHHHHHHHHH--hh Confidence 00000000000 0 000 0011112233321 11111 111124555555555555533332 33 Q ss_pred cchhhhcccchhhHHHhhhhhhhccCcccccccccccCccc-ccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHH Q lcl|NC_019448. 62 LIFYRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAP-VSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQI 140 (463) Q Consensus 62 f~f~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~ 140 (463) ..+++.+...++.+.-..|......++ ....+++|++..+ .+++.+.......+-++.-..+|.-+- .++..|.+.. T Consensus 149 ~~l~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell-~ds~~~l~~~ 226 (415) T protein:vir:47 149 FNLDKYVTVKRVTNGSGKYPVVRQSEV-AALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAI-EDAKVNVLQE 226 (415) T ss_pred hhhhhhcceeeccCCceeEEEEEecCC-cceeecccccccccccccceeeEEeeeeeeEeeehhhHHHH-hhchHHHHHH Confidence 345555555556555555555544433 3566899998876 578999999999999998877776432 3445678888 Q ss_pred HHHHHHHHHHHHHHHHHhhcccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecC Q lcl|NC_019448. 141 LTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMP 220 (463) Q Consensus 141 ~~~~ai~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~ 220 (463) ..+.....++..++.+++.|+-.=.+ .-++.......+.....+. .+.+.|-.+-..+...|....-.+|| T Consensus 227 i~~~l~~~i~~~~d~~il~g~g~g~~--------~~~~~~~~~~~~~~~~~~~-~~~~~i~~~~~~~~~~~~~~~~~v~n 297 (415) T protein:vir:47 227 LKLWMARTIAATRNKAIIDVITKGST--------GSTSSGFEKEGKKLEVKKA-KSLDDIKDAINLNVKPNYEHNVAIVS 297 (415) T ss_pred HHHHHHHHHHHHHHHHHhhccccCCc--------cccccccccccceeccccc-cchHHHHHHHHhhhhhccCCCEEEEc Confidence 99999999999999999999754222 1112222222333333333 34455555555556667667778999 Q ss_pred HHHHHHHHHHhcCcceEEeecCCCC----cccceecCeeeecccc-cccCC-ceecc--CccccccccccCCCCCCCCee Q lcl|NC_019448. 221 IGVHADFVNSILGRQMQLMQDNSGN----VNTGYSVNGFYSSRGF-IKLHG-STVME--NELILDESLQPLPNAPQPAKV 292 (463) Q Consensus 221 ~~vka~f~~~~~~~qrv~~~~n~g~----~~~G~~v~~~~s~~G~-i~l~~-s~~~~--~d~~l~~~~~~~p~ap~p~~v 292 (463) +.+.+.|...-=..-|.+.+++..+ .-.|++|- .+..-. ..... .+++. .+.++...+. . + T Consensus 298 ~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~pV~--~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~-------~--~ 366 (415) T protein:vir:47 298 QTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIE--ILPDEVLGQKGNNTLIIGNLKDAIVLFDRS-------Q--Y 366 (415) T ss_pred HHHHHHHHHhhccCCCeeeccCcCCCCCccccceeeE--EeccccccCCCccEEEEEehhccEEEEeec-------c--e Confidence 9999988753212223444433222 23355431 110000 00000 00110 0000000000 0 0 Q ss_pred EEEEeccCCCcCcccccccceEEEEEEEec----CCccccccceeeeecCCCCceEEEE Q lcl|NC_019448. 293 TATVETKQKGAFEDEEDRAGLSYKVVVNSD----DAQSAPSEEVTATVSNVDDGVKLSI 347 (463) Q Consensus 293 tat~~~~~~g~~~~~~~~a~ysYkV~a~s~----~geS~~S~~vt~Tva~~~~gv~ltI 347 (463) +..... + ..+ ...+++...-+ +-++...-.+++++. ......|.- T Consensus 367 ~v~~~~------~-~~~--~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~-~~~~~~~~~ 415 (415) T protein:vir:47 367 QASWTD------Y-MHF--GECLMIAVRQDCRILDYKSAIVIEYDDSER-GEGDLGLEA 415 (415) T ss_pred EEEeec------c-ccC--ceEEEEEEEeccEEeccccEEEEEeeccCC-CCCCccCCC Confidence 000000 0 000 00111111110 111111111111111 111112222 No 79 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=95.46 E-value=0.002 Score=35.35 Aligned_cols=313 Identities=12% Similarity=-0.006 Sum_probs=135.0 Q ss_pred CCCCCccchHH----H----Hhh-----hhhhHHHHHHhhc----CCccC--CccccCccccchhhhhhHhhhhhccccc Q lcl|NC_019448. 1 MTIEKNLSDVQ----Q----KYA-----DQFQEDVVKSFQT----GYGIT--PDTQIDAGALRREILDDQITMLTWTNED 61 (463) Q Consensus 1 ~~~~~~~~~~~----~----~~~-----k~~~e~~~Ks~~a----gy~~~--p~~q~~gaalr~esLd~~i~~L~~~~~d 61 (463) ...+....... . ... ......-.++|.. +.... ..+-.+|+.+--+.+..+|..+.. +. T Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~--~~ 148 (415) T protein:vir:46 71 NQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKE--VE 148 (415) T ss_pred cccccccchhhhhHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHhhhhhhhhccccccCCcccccHHHHHHHHHHHH--hh Confidence 00000000000 0 000 0011112233321 11111 111124555555555555533332 33 Q ss_pred cchhhhcccchhhHHHhhhhhhhccCcccccccccccCccc-ccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHH Q lcl|NC_019448. 62 LIFYRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAP-VSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQI 140 (463) Q Consensus 62 f~f~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~ 140 (463) ..+++.+...++.+.-..|......++ ....+++|++..+ .+++.+.......+-++.-..+|.-+- .++..|.+.. T Consensus 149 ~~l~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell-~ds~~~l~~~ 226 (415) T protein:vir:46 149 FNLDKYVTVKRVTNGSGKYPVVRQSEV-AALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAI-EDAKVNVLQE 226 (415) T ss_pred hhhhhhcceeeccCCceeEEEEEecCC-cceeecccccccccccccceeeEEeeeeeeEeeehhhHHHH-hhchHHHHHH Confidence 345555555556555555555544433 3566899998876 578999999999999998877776432 3445678888 Q ss_pred HHHHHHHHHHHHHHHHHhhcccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecC Q lcl|NC_019448. 141 LTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMP 220 (463) Q Consensus 141 ~~~~ai~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~ 220 (463) ..+.....++..++.+++.|+-.=.+ .-++.......+.....+. .+.+.|-.+-..+...|....-.+|| T Consensus 227 i~~~l~~~i~~~~d~~il~g~g~g~~--------~~~~~~~~~~~~~~~~~~~-~~~~~i~~~~~~~~~~~~~~~~~v~n 297 (415) T protein:vir:46 227 LKLWMARTIAATRNKAIIDVITKGST--------GSTSSGFEKEGKKLEVKKA-KSLDDIKDAINLNVKPNYEHNVAIVS 297 (415) T ss_pred HHHHHHHHHHHHHHHHHhhccccCCc--------cccccccccccceeccccc-cchHHHHHHHHhhhhhccCCCEEEEc Confidence 99999999999999999999754222 1112222222333333333 34455555555556667667778999 Q ss_pred HHHHHHHHHHhcCcceEEeecCCCC----cccceecCeeeecccc-cccCC-ceecc--CccccccccccCCCCCCCCee Q lcl|NC_019448. 221 IGVHADFVNSILGRQMQLMQDNSGN----VNTGYSVNGFYSSRGF-IKLHG-STVME--NELILDESLQPLPNAPQPAKV 292 (463) Q Consensus 221 ~~vka~f~~~~~~~qrv~~~~n~g~----~~~G~~v~~~~s~~G~-i~l~~-s~~~~--~d~~l~~~~~~~p~ap~p~~v 292 (463) +.+.+.|...-=..-|.+.+++..+ .-.|++|- .+..-. ..... .+++. .+.++...+. . + T Consensus 298 ~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~pV~--~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~-------~--~ 366 (415) T protein:vir:46 298 QTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIE--ILPDEVLGQKGNNTLIIGNLKDAIVLFDRS-------Q--Y 366 (415) T ss_pred HHHHHHHHHhhccCCCeeeccCcCCCCCccccceeeE--EeccccccCCCccEEEEEehhccEEEEeec-------c--e Confidence 9999988753212223444433222 23355431 110000 00000 00110 0000000000 0 0 Q ss_pred EEEEeccCCCcCcccccccceEEEEEEEec----CCccccccceeeeecCCCCceEEEE Q lcl|NC_019448. 293 TATVETKQKGAFEDEEDRAGLSYKVVVNSD----DAQSAPSEEVTATVSNVDDGVKLSI 347 (463) Q Consensus 293 tat~~~~~~g~~~~~~~~a~ysYkV~a~s~----~geS~~S~~vt~Tva~~~~gv~ltI 347 (463) +..... + ..+ ...+++...-+ +-++...-.+++++. ......|.- T Consensus 367 ~v~~~~------~-~~~--~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~-~~~~~~~~~ 415 (415) T protein:vir:46 367 QASWTD------Y-MHF--GECLMIAVRQDCRILDYKSAIVIEYDDSER-GEGDLGLEA 415 (415) T ss_pred EEEeec------c-ccC--ceEEEEEEEeccEEeccccEEEEEeeccCC-CCCCccCCC Confidence 000000 0 000 00111111110 111111111111111 111112222 No 80 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=95.43 E-value=0.0021 Score=35.28 Aligned_cols=297 Identities=14% Similarity=0.095 Sum_probs=147.7 Q ss_pred hHHHHHHhhcCCccCCccccCccccchhhhhhHhhhhhccccccchhhhcccch-hhHHHhhhhhhhccCccc-c-cccc Q lcl|NC_019448. 19 QEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRP-AQSTVVKYDQYLRHGNVG-H-SRFV 95 (463) Q Consensus 19 ~e~~~Ks~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~-~~stv~ey~~~~~hG~~g-~-~~fv 95 (463) -|++.|.++.=-++|-. -.+||-|.-|-+++.|..|...+ .|.+.+...+ ..|.-.+..+. ++|+.- . ..-. T Consensus 1 ~~~~~~~~~~~k~it~~-d~~gG~L~P~~~~~~i~~l~e~s---~i~~~a~vi~t~~s~~~~i~~i-~~g~~~~~~~~~~ 75 (314) T protein:vir:41 1 MDFLNKPFQITPKIDVP-DLGKGILAVQRFGEFVREVRENS---AIIKDARVLNALKSYEVDISRI-SLGVELEPGRNTS 75 (314) T ss_pred CchhhhHHHhhcccccc-cCCCceeChHHHHHHHHHHHhcc---chhhheeeecccCccceeeccc-ccCcccccccccc Confidence 24444544432223322 23577899888887776665332 2333332211 12222222222 333321 1 1122 Q ss_pred cccCcccccCcceEEEEEEEEEeechhhhhhhhhhhccc--ccHHHHHHHHHHHHHHHHHHHHHhhcccccCCCcccccc Q lcl|NC_019448. 96 KEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNI--ADPSQILTEDAIAVVAKTIEWASFYGDASLTSEVEGEGL 173 (463) Q Consensus 96 ~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~--~Dp~~~~~~~ai~~~~~~~E~a~fyGd~~l~~~~~~~gl 173 (463) +|......+|+++.+....+|=|+.--.+|.- -|.++. .|-+....+.=..++....|.+.|-||.+..+.. +.-- T Consensus 76 ~~~~~~~~~~~tf~~~~l~~~kl~~~v~is~e-~L~D~a~~~~le~~i~~~~Ae~~g~~~~~~~~nGdg~~~s~~-~~~~ 153 (314) T protein:vir:41 76 GTKVAPTADEVTVSTNTLEMKELVTKVVLEDE-ALEDNIEQSAFEQTITSLLASGVTYDLECFFLHADSSLTTGR-ELYR 153 (314) T ss_pred cCCccCCcccccccceeeeeEEEEEeecccHH-HHHhhhchhhHHHHHHHHHHHHHHHHHHHHhhccccCCcCcc-cchh Confidence 45555677889999988888877765444432 234554 3788888888889999999999999998754321 1112 Q ss_pred ccccceeeecCcceEecc--CCCCCHHHHhhhhhhhhhcC-Cce--eEEecCHHHHHHHHHHhcCcceEEeecC---CCC Q lcl|NC_019448. 174 EFDGLAKLIDKNNVINAK--GNQLTEKHLNEAAVRIGKGF-GTA--TDAYMPIGVHADFVNSILGRQMQLMQDN---SGN 245 (463) Q Consensus 174 eFDGl~~lI~~~nviDar--G~~ls~~~ln~aa~~i~~~~-G~~--td~~m~~~vka~f~~~~~~~qrv~~~~n---~g~ 245 (463) +.||+.+... ..+.++. +...+.+.|..+-.-+...| -.. --.+|+..+...+...+-++++-+.++. .+. T Consensus 154 ~p~G~l~~a~-~~~~~~~~~~~~~~~~~~~~l~~sl~~~yr~~~~~~~~~m~~~t~~~~r~~l~~~~~~l~~~~~~~~~~ 232 (314) T protein:vir:41 154 INDGWMKLAG-NQYTDAEPEDENWPLNLFDGMMDELDTRYLQLKPRMKFYVSNEIYNGYRKQLLVRETGLGDSALIGATG 232 (314) T ss_pred cchhhhhhcc-cceeecCccccccHHHHHHHHHHhcCchhhcCCCceEEEecHHHHHHHHHHHhccCCcccchhhhCCCC Confidence 7899988654 2344433 34466777777766666666 223 3478999999888876666665543331 111 Q ss_pred -cccceecCeeeecccccccCCceeccCc-cccccccccCCCCCCCCeeEEEEeccCCCcCcccccccceEEEEEEEecC Q lcl|NC_019448. 246 -VNTGYSVNGFYSSRGFIKLHGSTVMENE-LILDESLQPLPNAPQPAKVTATVETKQKGAFEDEEDRAGLSYKVVVNSDD 323 (463) Q Consensus 246 -~~~G~~v~~~~s~~G~i~l~~s~~~~~d-~~l~~~~~~~p~ap~p~~vtat~~~~~~g~~~~~~~~a~ysYkV~a~s~~ 323 (463) .-.|++|-..-...+ +......+|-.| ..+.-.. .-.+........ ....+.|..+.--+. T Consensus 233 ~~l~G~PV~~~~~~~~-~~~~~~~i~fgd~~nlv~~~--------~~~ir~~~~~~a--------~~~~~~~~~~~r~d~ 295 (314) T protein:vir:41 233 LQYDGIPIQYVPALDA-LGDDKARALLTVPTNLVYGF--------WRNIRIEPKRDA--------AMRRTEYIASLRADC 295 (314) T ss_pred ceecceeeEecccccc-cCCCCceEEEechhheEEEe--------eceeEEeecccC--------cCCeEEEEEEEEece Confidence 123776543222111 111222222111 1110000 000111111111 112333334322221 Q ss_pred C--ccccccceeeeecCCCCc Q lcl|NC_019448. 324 A--QSAPSEEVTATVSNVDDG 342 (463) Q Consensus 324 g--eS~~S~~vt~Tva~~~~g 342 (463) + ++ ..++-+++-..++| T Consensus 296 ~~~~~--~aa~~~~~~~~~~~ 314 (314) T protein:vir:41 296 NYEDE--NAAVAAVIDMSSGG 314 (314) T ss_pred EEEEc--CcEEEEEeeccCCC Confidence 1 22 12234444445555 No 81 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=95.30 E-value=0.0023 Score=35.02 Aligned_cols=282 Identities=11% Similarity=0.042 Sum_probs=128.1 Q ss_pred c-cCccc-cchhhhhhHhhhhhccccccchhhhcccchhhHHHhhhhhhhccCcccccccccccCcccccCcceEEEEEE Q lcl|NC_019448. 37 Q-IDAGA-LRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVS 114 (463) Q Consensus 37 q-~~gaa-lr~esLd~~i~~L~~~~~df~f~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~ 114 (463) | .+||. +..|...+-+..|.. ...+.+-.+..+..+.-.+|.++... +...+++|++..+.+++.+.+.... T Consensus 1 ma~~gG~lip~~~~~~ii~~~~~---~s~i~~~~~~~~~~~~~~~~p~~~~~---~~a~~v~Eg~~~~~~~~~f~~v~l~ 74 (298) T protein:vir:94 1 MVLNKGTLFDPELVTDLISKVAG---KSSIARLSAQKPIPFNGEKVFTFTMD---SEIDVVAESGKKTHGGVTLAPQTMV 74 (298) T ss_pred CeeccccccChhHHHHHHHHHHh---hchhhhhcceeeccCCceEEEEEecC---cceEEeeCCccccccccceeEEEEe Confidence 2 23333 444443333333322 22344444444455433456666543 3457899999999999999999999 Q ss_pred EEEeechhhhhhhhhhhc--ccccHHHHHHHHHHHHHHHHHHHHHhhcccccCCCccccccccccceeeecC-cce--Ee Q lcl|NC_019448. 115 MKYVSDTKNMSIASGLVN--NIADPSQILTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDK-NNV--IN 189 (463) Q Consensus 115 ~k~l~~~~~vs~~~~lvn--~~~Dp~~~~~~~ai~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~-~nv--iD 189 (463) .+=++.--.+|.-+-..+ ...+.++...++-...+++.+|.++++|...-+ +......|....+.. .+. .+ T Consensus 75 ~~k~~~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~----g~~~~~~~~~~~~~~~~~~~~~~ 150 (298) T protein:vir:94 75 PIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRL----GTASAVIGTNHFDSKVTQKVEAP 150 (298) T ss_pred eeEEEEeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCC----Ccccccccccccccccccccccc Confidence 998988777777653222 234667888889999999999999999954321 112223333333321 121 11 Q ss_pred ccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHHHHHhcCcceEEeecCCCCcccceecCeeeecccccccCCcee Q lcl|NC_019448. 190 AKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQMQLMQDNSGNVNTGYSVNGFYSSRGFIKLHGSTV 269 (463) Q Consensus 190 arG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~~~~~~qrv~~~~n~g~~~~G~~v~~~~s~~G~i~l~~s~~ 269 (463) ..+. ...+.|.++...+...+..+.-..|++.+.+.+...--..-|.+.++...+... .++ T Consensus 151 ~~~~-~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~------------------~tl 211 (298) T protein:vir:94 151 RGIA-DPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATP------------------DTI 211 (298) T ss_pred cccc-cHHHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHHhhccCCCeeecCcccCCCC------------------cee Confidence 1111 123456777778888888888899999999999753222223343332211110 111 Q ss_pred ccCccccccccccCCCCCCCCeeEEEEeccCCCcCcccc------cccceEEEEEEEecCCccccccceeeeecCCC-Cc Q lcl|NC_019448. 270 MENELILDESLQPLPNAPQPAKVTATVETKQKGAFEDEE------DRAGLSYKVVVNSDDAQSAPSEEVTATVSNVD-DG 342 (463) Q Consensus 270 ~~~d~~l~~~~~~~p~ap~p~~vtat~~~~~~g~~~~~~------~~a~ysYkV~a~s~~geS~~S~~vt~Tva~~~-~g 342 (463) +..+...... .|..... +....-++|.. ....+.+++ +++++.--+ .+.... +- T Consensus 212 ~G~PV~~~~~---v~~~~~~--------~~~~~~~Gdfs~~~~~~~~~~~~~~~---~~~~~~d~~-----~~~~f~~~~ 272 (298) T protein:vir:94 212 NGLPVDVNKT---VSDMSLT--------QRDRAIIGDFANGFKWGYAKEVPLEV---IQYGDPDNS-----GLDLKGYNQ 272 (298) T ss_pred cceeeEEecc---cccccCC--------CccEEEEeeccceEEEEEecCceEEE---eecCCCcCc-----chhhhhcCc Confidence 1111111000 0000000 00000011111 001111111 111110000 000000 00 Q ss_pred eEE--EEEecCCCCCCcceEEEEeecC Q lcl|NC_019448. 343 VKL--SISVNAMYQQQPQFVSIYRQGK 367 (463) Q Consensus 343 v~l--tIt~~a~~g~~~~~y~IYR~~~ 367 (463) +.+ ..-+... =..|+-+....... T Consensus 273 v~~r~~~r~~~~-~~~~~a~~~l~~~t 298 (298) T protein:vir:94 273 VYIRAELFLGWG-ILDATKFARVTEAN 298 (298) T ss_pred EEEEEEEEeccE-eecccceEEEEecC Confidence 111 1111000 01111122221111 No 82 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=95.15 E-value=0.0026 Score=34.71 Aligned_cols=304 Identities=11% Similarity=0.014 Sum_probs=130.0 Q ss_pred CCCCCccchHHHHhhhhhhHHHH-----------HHhhcCCccCCccccCccccchhhhhhHhhhhhccccccchhhhcc Q lcl|NC_019448. 1 MTIEKNLSDVQQKYADQFQEDVV-----------KSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDIS 69 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k~~~e~~~-----------Ks~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~ 69 (463) .+.+.......+...+.|.+.+. |++.+| +..+|+.|--+.+..+|-.+.. +.-.+.+-+. T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~------t~~~gg~~vP~~~~~~Ii~~~~--~~~~l~~~~~ 151 (408) T protein:vir:10 80 GPLNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSG------SDSAAGLTIPQDIRTMINTLVR--QYDSLQQYVR 151 (408) T ss_pred cccccchhhhHHHHHHHHHHHhhcchhhhhhhhhhhhhcc------cccCCceeccHhHHHHHHHHHH--hhchhhhhcc Confidence 22222222222333333321111 222221 2234555444555556633333 3334555555 Q ss_pred cchhhHHHhhhhhhhccCcccccccccccCccc-ccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHH Q lcl|NC_019448. 70 RRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAP-VSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAV 148 (463) Q Consensus 70 k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~ 148 (463) ..++.+....+......++.+...+++|++..+ .+++.+.......+-++.-..+|.-+ +.++.-|......+.-... T Consensus 152 ~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~l~~~ 230 (408) T protein:vir:10 152 VESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTS-LKDTAENILAWLSSWIAKK 230 (408) T ss_pred eeeccCCcceEEEeeccccccceeeecCccccccccCcceeeEEeeeeeEEeeehhHHHH-HhhchHHHHHHHHHHHHHH Confidence 555555444443333334456778999998765 67799999999999998877777653 2345567788888888999 Q ss_pred HHHHHHHHHhhcccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhh-hhhhhhhcCCceeEEecCHHHHHHH Q lcl|NC_019448. 149 VAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNE-AAVRIGKGFGTATDAYMPIGVHADF 227 (463) Q Consensus 149 ~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~-aa~~i~~~~G~~td~~m~~~vka~f 227 (463) +..+++.+++.|+.+-.+.. ...+.+.|.. ....+..+|-..--.+|++.+.+.+ T Consensus 231 ~~~~~~~~il~g~g~~~~~~------------------------~~~~~~~l~~~~~~~~~~~~~~~a~~v~n~~~~~~l 286 (408) T protein:vir:10 231 VVVTRNQAIIEVMKAAPKKP------------------------TIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKL 286 (408) T ss_pred HHHHHHHHHhhccccccccc------------------------ccccHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHH Confidence 99999999999998754321 1112222222 2233334442222467899999888 Q ss_pred HHHhcCcceEEeecCCCC----cccceecCeeee-cccccccCCc-eecc--CccccccccccCCCCCCCCeeEEEEecc Q lcl|NC_019448. 228 VNSILGRQMQLMQDNSGN----VNTGYSVNGFYS-SRGFIKLHGS-TVME--NELILDESLQPLPNAPQPAKVTATVETK 299 (463) Q Consensus 228 ~~~~~~~qrv~~~~n~g~----~~~G~~v~~~~s-~~G~i~l~~s-~~~~--~d~~l~~~~~~~p~ap~p~~vtat~~~~ 299 (463) ...--..-|.+.+++..+ .-.|++|--.-+ .-+....+.. .++. .+.++... + ..++...... T Consensus 287 ~~lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~-----~----~~~~v~~~~~ 357 (408) T protein:vir:10 287 ALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFD-----R----ENMSLLPTNI 357 (408) T ss_pred HHhhccCCceEeccCcCCCCCceecceeeEEecccccCccCCCceEEEEEehhccEEEEE-----e----cceEEEEccc Confidence 753322224444443222 233444310000 0011100000 1111 00000000 0 0011111111 Q ss_pred CCCcCcccccccceEEEEEEE----ecCCccccccceeeeecCCCCceEEEEEecCC Q lcl|NC_019448. 300 QKGAFEDEEDRAGLSYKVVVN----SDDAQSAPSEEVTATVSNVDDGVKLSISVNAM 352 (463) Q Consensus 300 ~~g~~~~~~~~a~ysYkV~a~----s~~geS~~S~~vt~Tva~~~~gv~ltIt~~a~ 352 (463) ....|. .+ ...|++... -.+-++. ..++.+.+....|.+-+-+..++ T Consensus 358 ~~~~f~--~~--~~~~r~~~r~d~~v~~~~a~--~~~~~~~~~~~~~~~~~~~~~~~ 408 (408) T protein:vir:10 358 GAGAFE--TD--TTKIRVIDRFDVKATDSEAL--VAGSFSAIADQVGNFKTTTSTAV 408 (408) T ss_pred ccchhh--cC--ceEEEEEEeeccEEeccccE--EEEEeeccccCCCCCCCCCcccC Confidence 111110 00 111111110 0111111 11111111111111111111122 No 83 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=94.94 E-value=0.0031 Score=34.31 Aligned_cols=314 Identities=13% Similarity=0.082 Sum_probs=140.6 Q ss_pred CCCCCccchHHH---Hhhhhhh---HHHHHHhh---cCCc-------cCCccccCcccc-chhhhhhHhhhhhccccccc Q lcl|NC_019448. 1 MTIEKNLSDVQQ---KYADQFQ---EDVVKSFQ---TGYG-------ITPDTQIDAGAL-RREILDDQITMLTWTNEDLI 63 (463) Q Consensus 1 ~~~~~~~~~~~~---~~~k~~~---e~~~Ks~~---agy~-------~~p~~q~~gaal-r~esLd~~i~~L~~~~~df~ 63 (463) ...+........ .+++.+. -+...+.+ ..++ ++..+. .||.| ..+ +..+|..+... .. T Consensus 19 ~~~~~~~~~kg~~~~~~~~a~a~~~g~~~~a~~~a~~~~~~~~~~~a~~~~~~-~Gg~lvP~~-~~~~ii~~l~~---~s 93 (366) T protein:vir:57 19 IIKEELQQYKGAGMTRMVMSIAAGKGNLADAAKFAATELGDTGLSMAISTAAG-SGGALIPQN-MQNEVIELLRD---RT 93 (366) T ss_pred ccccccccccchhHHHHHHHHHhcccchhHHHHHHHHhhcchhhhhhcccccc-CCccccchh-HHHHHHHHHhh---hc Confidence 111111111111 1111000 01111110 0011 111122 34444 544 44444333322 22 Q ss_pred hhhhcccchh--hHHHhhhhhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHH Q lcl|NC_019448. 64 FYRDISRRPA--QSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQIL 141 (463) Q Consensus 64 f~~~i~k~~~--~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~ 141 (463) .+..+..+.+ .+--.+|.+++. .....+++|++..+.+++.+.+.....+=++.--.+|.-+- .++.-|.+... T Consensus 94 ~l~~lg~~~v~~~~g~~~~p~~t~---~~~a~wv~E~~~~~~s~~~f~~i~~~~~k~~~~~~iS~ell-~ds~~~~~~~i 169 (366) T protein:vir:57 94 VVRILGARSIPLPNGNLSMPRLSG---GATAGYVGEGKDVVATGATFDDVKLSAKTMIALVPVSNQLI-GRAGFNVEQLL 169 (366) T ss_pred chhhhceeeeecCCCceEEEEEeC---CcceeeeccCccccccccceeEEEEeeEEEEEeehhhHHHH-hhhhHHHHHHH Confidence 2233222221 111123344442 23566899999999999999999999998887777765432 23445778889 Q ss_pred HHHHHHHHHHHHHHHHhhcccccCCCccccccccccceeeecC-cceEeccCCCCCHHHHhhhhhhhhh------cCCce Q lcl|NC_019448. 142 TEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDK-NNVINAKGNQLTEKHLNEAAVRIGK------GFGTA 214 (463) Q Consensus 142 ~~~ai~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~-~nviDarG~~ls~~~ln~aa~~i~~------~~G~~ 214 (463) .++-...+++.++.++++||-. + -+..||.+.... ....+..|..++...+......+.. .+... T Consensus 170 ~~~l~~a~~~~~d~a~l~G~G~-~-------~~p~Gi~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~ 241 (366) T protein:vir:57 170 LGDILSAIATREDKAFLRDDGT-G-------DTPKGMKAVATAANRLVAWTGTAINLTTIDEYLDSLILKHMDSNSNMIR 241 (366) T ss_pred HHHHHHHHHHHHHHHhhccCCC-C-------ccccceeeccccccceeeccccccchhhHHHHHHHHHHhhhcccccccc Confidence 9999999999999999999852 1 257788776653 4556666667766655543333322 23334 Q ss_pred eEEecCHHHHHHHHHHhcCcceEEeecCCCCcccceecCe--eeec-ccccccCCceecc--CccccccccccCCCCCCC Q lcl|NC_019448. 215 TDAYMPIGVHADFVNSILGRQMQLMQDNSGNVNTGYSVNG--FYSS-RGFIKLHGSTVME--NELILDESLQPLPNAPQP 289 (463) Q Consensus 215 td~~m~~~vka~f~~~~~~~qrv~~~~n~g~~~~G~~v~~--~~s~-~G~i~l~~s~~~~--~d~~l~~~~~~~p~ap~p 289 (463) .-..|++.+.+.+...--..-|.+.++..++.-.|++|-- .+-. .|.-.-.+.+++. .+.++-.. .. T Consensus 242 a~~vmn~~~~~~L~~lkd~~G~~l~~~~~~g~l~G~Pvv~s~~ip~~~~~~~~~~~i~~gdfs~~~i~~~--------~~ 313 (366) T protein:vir:57 242 CGWGLSNRTYMTLFGLRDGNGNKVYPEMSQGILKGYPIQRTSAIPANLGDDGNESEIYFCDFNDVVIGED--------GM 313 (366) T ss_pred CEEEecHHHHHHHHhhhccCCceeccCCCCCeecceeeEEccccccccccCCCccEEEEEecceEEEEEe--------cc Confidence 4468999999988753322223444444444455665421 1000 0000000111111 11111000 00 Q ss_pred CeeEEE---EeccCCCcCcccccccceEEEEEEEecCCccccccceeeeecCC Q lcl|NC_019448. 290 AKVTAT---VETKQKGAFEDEEDRAGLSYKVVVNSDDAQSAPSEEVTATVSNV 339 (463) Q Consensus 290 ~~vtat---~~~~~~g~~~~~~~~a~ysYkV~a~s~~geS~~S~~vt~Tva~~ 339 (463) ..+... ....+.|...+--..-..-+++...-+-+=--|...+-.|-..+ T Consensus 314 i~i~~~~ea~~~~~~g~~~~~f~~~~~~iR~~~~~d~~v~~~~a~~~lt~~~~ 366 (366) T protein:vir:57 314 MKVDFSTEATYKDADGQLVSAFARNQSLIRVVTEHDIGFRHPEGLVLGTGVIW 366 (366) T ss_pred eEEEEeeccccccccccchhhhhcCceeEEeeeeeCcEeeccccEEEEecccC Confidence 000000 00111221110001112233333322222222333444454455 No 84 >protein:vir:103370 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:11266 # MgeID: mge:1621 # MgeName: PaP2 # Cross-refs: genbank:acc:YP_024741;genbank:gi:48697083;genbank:GeneID:2846038 Probab=94.79 E-value=0.00011 Score=42.24 Aligned_cols=321 Identities=12% Similarity=0.129 Sum_probs=134.1 Q ss_pred CCCCCccchHH------HHhhhhh----------hHHHHHHhhcCCc---------c---------CCccccCccccchh Q lcl|NC_019448. 1 MTIEKNLSDVQ------QKYADQF----------QEDVVKSFQTGYG---------I---------TPDTQIDAGALRRE 46 (463) Q Consensus 1 ~~~~~~~~~~~------~~~~k~~----------~e~~~Ks~~agy~---------~---------~p~~q~~gaalr~e 46 (463) --.|+.+...- .++.+.- .+--.||.+++|- + |..+-.++..|+.- T Consensus 11 ~~~~~~~~~~~~~~~~~~~~PN~~~pll~li~~g~~~ta~ast~~w~~d~~~~~~~~~ta~a~a~~T~l~ve~~~~f~~~ 90 (418) T protein:vir:10 11 TLNPQELNMKSFAGTILRRVPNGSAPLLAMTSVVGSTTAKASTHGYFSKTMVFASAVVTAEAAADATVLTVENSDGLTKG 90 (418) T ss_pred CCChhhhchhhhhhhhhhhcCCcchhhhhhhhcccccccceeEEEEEEEEEeeeeEEEEEEEecCceEEEEcCcceeccc Confidence 11111111000 0000000 0111122332220 0 00111123333333 Q ss_pred hhhhHhhhhhccccccc--hhhhcccchhhHHHhhhhhhhccCcccccc--------cc----cccCcccccCcceEEEE Q lcl|NC_019448. 47 ILDDQITMLTWTNEDLI--FYRDISRRPAQSTVVKYDQYLRHGNVGHSR--------FV----KEIGVAPVSDPNIRQKT 112 (463) Q Consensus 47 sLd~~i~~L~~~~~df~--f~~~i~k~~~~stv~ey~~~~~hG~~g~~~--------fv----~E~g~~~~~d~~~~r~~ 112 (463) .| .|.+..+. ....|+- ..-.+....|++-+.+ |+ -||.+..... .+.+ T Consensus 91 ~l-------~~~~~~~Evirv~sVng-------~~lTV~Rg~~~t~aaaia~n~~~~~Ig~~~eEGsd~~ta~--~~k~- 153 (418) T protein:vir:10 91 MI-------FYNEATGENMRLELVNG-------LNLTVKRQTGRISAAIIAANTKLIVIGTAFEEGSQRPTAR--SIQP- 153 (418) T ss_pred cE-------EEEccCCeEEEEEEEeC-------CEEEEEEecCCeeEEEEecCceEEEeccccccccccCCcc--eecc- Confidence 22 22221111 1111110 1111222222222222 22 3666555443 2222 Q ss_pred EEE-E---Eeechhhhhhhhhhh---cccccHHHHHHHHHHHHHHHHHHHHHhhcccccCCCccccc--cccccceeeec Q lcl|NC_019448. 113 VSM-K---YVSDTKNMSIASGLV---NNIADPSQILTEDAIAVVAKTIEWASFYGDASLTSEVEGEG--LEFDGLAKLID 183 (463) Q Consensus 113 ~~~-k---~l~~~~~vs~~~~lv---n~~~Dp~~~~~~~ai~~~~~~~E~a~fyGd~~l~~~~~~~g--leFDGl~~lI~ 183 (463) +.+ + -+.+-.++|.-+..+ -+++|+...+ .+.+.-.+.++|+++|+|-...... ..| =.++||..+|. T Consensus 154 ~~vsNvtQIF~~avsvSgTaqAs~~q~Gvsn~~ese-~drk~~~av~iEkalI~G~~~~~~~--~~g~~R~m~GIl~~vr 230 (418) T protein:vir:10 154 VYVPNFTQIFRNAWALTDTARASYAEAGYSNITESR-RDCMDFHATEQETAIFFGQAFMGTY--NGQPLHTTQGIVDAVR 230 (418) T ss_pred eeccchhhhhhhhhhhhhhhhhccccccCchHHHHH-HHHHHHHHHHHHHHHhcccccCCCc--CCcchhhHHHHHHHHh Confidence 222 2 234555566554442 3467886666 4444444568999999997553221 122 25889987775 Q ss_pred ---CcceEeccCC-CCCHHHHhhhhhhhhh---cCCceeE-----EecCHHHHHHHHHHhcCcceEEeecCCCCccccee Q lcl|NC_019448. 184 ---KNNVINAKGN-QLTEKHLNEAAVRIGK---GFGTATD-----AYMPIGVHADFVNSILGRQMQLMQDNSGNVNTGYS 251 (463) Q Consensus 184 ---~~nviDarG~-~ls~~~ln~aa~~i~~---~~G~~td-----~~m~~~vka~f~~~~~~~qrv~~~~n~g~~~~G~~ 251 (463) |+||+|+.+. .++.+.|.++...+.. +-|..++ +++|...|.++...+ +.. +.......+|.. T Consensus 231 ~~~~gnVv~a~~~t~~s~d~l~~a~~~af~~g~~~G~~~q~~~f~~~V~~~~k~~I~k~~-~~I----~~~~~e~~~G~v 305 (418) T protein:vir:10 231 QYAPDNVNAMPNPTAVTYDDVVDATIDAFKWSVNVGDNTQRVMFCDTVGMRTMQDIGRFF-GEV----TVTQRETSYGMV 305 (418) T ss_pred hhcccceeccCCCCccCHHHHHHHHHHHhhccCCCcccccceeEEEEeChHHHHHhhhhh-hhe----eecccceeeeEE Confidence 7999999998 6999999988877743 3477765 556999999999765 432 233444578999 Q ss_pred cCeeeecccccccCCcee-----ccCccccccccccCC------CCCCCCeeEEEEeccCCCcCcccccccceEEEEEEE Q lcl|NC_019448. 252 VNGFYSSRGFIKLHGSTV-----MENELILDESLQPLP------NAPQPAKVTATVETKQKGAFEDEEDRAGLSYKVVVN 320 (463) Q Consensus 252 v~~~~s~~G~i~l~~s~~-----~~~d~~l~~~~~~~p------~ap~p~~vtat~~~~~~g~~~~~~~~a~ysYkV~a~ 320 (463) +.++...+|.|-|+-+-+ |-.|..|.-+...+. +-+++--.. -.+++.|+... -|||---+. T Consensus 306 v~~~~~~~G~I~L~~~p~~~~~~lp~g~mlVvD~~~vkL~~L~~R~~~~E~l~----k~G~~~~~~~~---~~~~~~~~D 378 (418) T protein:vir:10 306 FTEWKFFKGRLILKEHPLFSAIGISPGFAVVVDVPAVKLAYMDGRNAKVENYG----QGGGENKSGAT---DYSYGHGVD 378 (418) T ss_pred EEEEEcceEEEEeecccccccccCCCceEEEEccccceEEEeccccccchhcc----cCCCccccccc---ccccccccc Confidence 999999999997765522 222222211111000 111111100 01111111010 011111111 Q ss_pred ecCCccc-------cccceeeeecCCCCceEEEEEecCCCCCCc Q lcl|NC_019448. 321 SDDAQSA-------PSEEVTATVSNVDDGVKLSISVNAMYQQQP 357 (463) Q Consensus 321 s~~geS~-------~S~~vt~Tva~~~~gv~ltIt~~a~~g~~~ 357 (463) +..|.-. --+..-|-+++. ..++=+..+++ ..| T Consensus 379 ~~kG~iv~E~tLe~~N~~a~avitgl-~~~~~~~~~t~---p~~ 418 (418) T protein:vir:10 379 AQGGSLTSEWALELLNPQGCAVITGL-QKAKERVYLTA---PAP 418 (418) T ss_pred cccceEEEEeeeeeecccceEEeecc-ceecccccCCC---CCC Confidence 1111000 000011122222 11121221111 111 No 85 >protein:vir:96442 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:11266 # MgeID: mge:1616 # MgeName: 119X # Cross-refs: genbank:acc:YP_001218814;genbank:gi:147917331;genbank:GeneID:5142645 Probab=94.74 E-value=0.00018 Score=41.11 Aligned_cols=310 Identities=11% Similarity=0.094 Sum_probs=138.6 Q ss_pred CCCCCccchHH------HHhhhhh----------hHHHHHHhhcCC---------cc------CC---ccccCccccchh Q lcl|NC_019448. 1 MTIEKNLSDVQ------QKYADQF----------QEDVVKSFQTGY---------GI------TP---DTQIDAGALRRE 46 (463) Q Consensus 1 ~~~~~~~~~~~------~~~~k~~----------~e~~~Ks~~agy---------~~------~p---~~q~~gaalr~e 46 (463) --.|+.+...- .++.+.- .+--.||.+++| .+ .+ -+-.++..++.- T Consensus 11 ~~~~~~~~~~~~~~~~~~~~PN~~~p~l~~i~~g~~~~~~~~t~~w~~d~l~~~~~~~ta~~~a~~T~i~V~~~~~f~~~ 90 (418) T protein:vir:96 11 TLNPQELNMKSFAGTILRRVPNGSAPLLAMTSVVGSTTAKASTHGYFSKTMVFASAVVTAEALADATVLTVENSDGLTKG 90 (418) T ss_pred CCChhhhchhhhhhhhhhhcCCcccchhhhhcccCccccceeEEEEEeeEeeeeeEEEEEEEecCceEEEecCCcccccc Confidence 11111111000 0000000 000011111111 00 00 001122234433 Q ss_pred hhhhHhhhhhccc--cccchhhhcccchhhHHHhhhhhhh--cc--Ccccc--cccccccCcccccCcce--EEEEEEEE Q lcl|NC_019448. 47 ILDDQITMLTWTN--EDLIFYRDISRRPAQSTVVKYDQYL--RH--GNVGH--SRFVKEIGVAPVSDPNI--RQKTVSMK 116 (463) Q Consensus 47 sLd~~i~~L~~~~--~df~f~~~i~k~~~~stv~ey~~~~--~h--G~~g~--~~fv~E~g~~~~~d~~~--~r~~~~~k 116 (463) .| .|.+ +..-...+|+-.+ =..+.-|..-. .| |.-+. +--+-||.+.+.+. .+ +++...+- T Consensus 91 ~l-------~~~~~~~EvirVtsVng~~-lTV~RG~~~t~aa~iaag~~~~~ig~~~eEGsd~~ta~-~~k~~~vsN~tQ 161 (418) T protein:vir:96 91 MI-------FYNEATGENMRLELVNGLN-LTVKRQTGRIAAAIIAANTKLIVIGTAFEEGSQRPTAR-SIQPVYVPNFTQ 161 (418) T ss_pred cE-------EEEecCCeEEEEEEEeCCE-EEEEEccCCeeeeeeecCceEEEeecCcccccccCCcc-eecceeccchhh Confidence 32 1111 0000011111000 00111111110 01 11000 11224777665544 11 22223333 Q ss_pred Eeechhhhhhhhhh---hcccccHHHHHHHHHHHHHHHHHHHHHhhcccccCCC-ccc----cccccccceeeecCcceE Q lcl|NC_019448. 117 YVSDTKNMSIASGL---VNNIADPSQILTEDAIAVVAKTIEWASFYGDASLTSE-VEG----EGLEFDGLAKLIDKNNVI 188 (463) Q Consensus 117 ~l~~~~~vs~~~~l---vn~~~Dp~~~~~~~ai~~~~~~~E~a~fyGd~~l~~~-~~~----~gleFDGl~~lI~~~nvi 188 (463) -+.+..+||.-++. +-+++|....+ .|+|.-.+..+|.++++|.+..... +.+ .+ .-||+..-+ ++||+ T Consensus 162 If~e~vsVSgTAqA~v~qaGvsn~~~~e-~d~l~~~kv~iE~ali~g~~~~~~~ng~p~~~t~R-~m~gI~~f~-~~Nvi 238 (418) T protein:vir:96 162 IFRNAWALTDTARASYAEAGYSNITESR-RDCMDFHATEQETAIFFGQAFMGTYNGQPLHTTQG-IVDAIRQYA-PDNVN 238 (418) T ss_pred eehhhhhhhhhhhhhhhhcCcchhHHHH-HHHHHHHHHHHHHhhhccccccCCCCCcccccccc-hhHHHHhhc-ccccc Confidence 45566677766543 22566776666 6999999999999999999977321 111 12 235555443 78999 Q ss_pred eccCC-CCCHHHHhhhhhhhhh---cCCceeE-----EecCHHHHHHHHHHhcCcceEEeecCCCCcccceecCeeeecc Q lcl|NC_019448. 189 NAKGN-QLTEKHLNEAAVRIGK---GFGTATD-----AYMPIGVHADFVNSILGRQMQLMQDNSGNVNTGYSVNGFYSSR 259 (463) Q Consensus 189 DarG~-~ls~~~ln~aa~~i~~---~~G~~td-----~~m~~~vka~f~~~~~~~qrv~~~~n~g~~~~G~~v~~~~s~~ 259 (463) ++.+. .++++.|..+...+-+ +-|..++ ++++...|..+...+ ..-|. ......+|..++.|.|-. T Consensus 239 ~ag~~~~~t~d~L~~~~~~a~~~g~n~G~~~~~~~y~~~V~a~~k~~I~k~~-~~I~~----~~~en~~G~vv~~~~Td~ 313 (418) T protein:vir:96 239 AMPNPTAVTYDDVVDATIDAFKWSVNVGDNTQRVMFCDTVGMRTMQDIGRFF-GEVTV----TQRETSYGMVFTEWKFFK 313 (418) T ss_pred ccCCCCcCCHHHHHHHHHHHHhhcCCCCCcccceEEEEEeChHHHHHHhhhh-ceeEe----ccccceeceEEEEEEeec Confidence 99888 6999999988877754 3477766 566999999999654 43333 344457799999999999 Q ss_pred cccccCCceeccCccc-------ccccccc---CC-CCCCCCee---E-----EEEeccCCCcCcccccc---cceEEEE Q lcl|NC_019448. 260 GFIKLHGSTVMENELI-------LDESLQP---LP-NAPQPAKV---T-----ATVETKQKGAFEDEEDR---AGLSYKV 317 (463) Q Consensus 260 G~i~l~~s~~~~~d~~-------l~~~~~~---~p-~ap~p~~v---t-----at~~~~~~g~~~~~~~~---a~ysYkV 317 (463) |-|.+--+.+|..|.+ +|...+- +. +.+.+--. - +++ ....|..-|...+ +.|.-++ T Consensus 314 G~v~ii~n~~~pad~I~~g~mlVvD~~~vkL~yL~~R~~~~E~l~k~G~~~~~~~~-~~~~~~~~D~~~G~l~~Eltle~ 392 (418) T protein:vir:96 314 GRLIIKEHPLFSAIGISPGFAVVVDVPAVKLAYMDGRNAKVENYGQGGGENKSGAT-DYSYGHGVDAQGGSLTSEWALEL 392 (418) T ss_pred cEEEEEecCCCCccccCcceEEEEecCceEEEEecCCCccchhcccCCCccccccc-ccccccccccccCEEEEEEEEEe Confidence 9998866666655553 2222210 00 11111111 1 111 1111111111100 1122222 Q ss_pred E-----EEe----------cCCcccc Q lcl|NC_019448. 318 V-----VNS----------DDAQSAP 328 (463) Q Consensus 318 ~-----a~s----------~~geS~~ 328 (463) . ++. .-.+-+| T Consensus 393 ~N~~a~a~itgl~~~~~~~~~~~~~~ 418 (418) T protein:vir:96 393 LNPQGCAVITGLQKAKERVYLTAPAP 418 (418) T ss_pred ecccccEEeecccccccccccCCCCC Confidence 1 111 2237777 No 86 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=94.56 E-value=0.0041 Score=33.69 Aligned_cols=299 Identities=11% Similarity=0.065 Sum_probs=141.3 Q ss_pred HhhhhhhHHHHHHhhcCCccCCccccCccccchhhhhhHhhhhhccccccchhhhcccchhhHHHhhhhhhhccCccccc Q lcl|NC_019448. 13 KYADQFQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVKYDQYLRHGNVGHS 92 (463) Q Consensus 13 ~~~k~~~e~~~Ks~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~stv~ey~~~~~hG~~g~~ 92 (463) -..|.|+..+.--.++ .+++..+..+|..+.-|...+-+..+...+ .|++.++-.++++--.+-. ..|-.+.. T Consensus 1 ~~~k~~~~~l~~~~~~-~~~~~~~~~~g~~v~~~~~~~l~~~i~e~s---~~l~~i~v~~v~~~~~~i~---~~~~~~~~ 73 (321) T protein:vir:31 1 MASRTINNDLSRITEK-NALTVDDLDAGGTLPDPLWDEFWTDMIEET---PLLDAIRTETVGAKKTRIP---TLNIGERH 73 (321) T ss_pred CchHHHHHHHHHHHHh-ccccccccCCcceeCHHHHHHHHHHHHHhh---hhhhhceeeeccCcceeee---eeccCCcc Confidence 1122232222111122 234444556667787777776666655443 4777776666554332222 12111111 Q ss_pred cccc-cc-CcccccCcceEEEEEEEEEeechhhhhhhhhhhcc--cccHHHHHHHHHHHHHHHHHHHHHhhcccccCCCc Q lcl|NC_019448. 93 RFVK-EI-GVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNN--IADPSQILTEDAIAVVAKTIEWASFYGDASLTSEV 168 (463) Q Consensus 93 ~fv~-E~-g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~--~~Dp~~~~~~~ai~~~~~~~E~a~fyGd~~l~~~~ 168 (463) ...+ |+ +....++|.+.+....++=+..--.+|.-. |-++ ..|-+....+.-..+++.+++.+.|+||..-.+.+ T Consensus 74 ~~~~~e~~~~~~~~~~~~~~~~~~~~k~~~~~~it~e~-L~d~a~~~d~e~~i~~~ia~~~a~~~~~~~~nGd~~~~~~~ 152 (321) T protein:vir:31 74 RRPQDEGEWNENESDVSTGTIDISTEKATVAWDLPREV-VQENPEGEALADRILNLMTDAWSADVEDLAANGDEDAEDSF 152 (321) T ss_pred cccccccccccccccceeeeeeeeeEEEEeehhccHHH-HHhhhcchhHHHHHHHHHHHHHHHHHHhheeeccccCCCcc Confidence 1222 33 345567888888777776666555555432 2333 24788888888889999999999999997644321 Q ss_pred cccccccccceeeec-CcceEeccCCCCCHHHHhhhhhhhhhcCC-cee-EEecCHHHHHHHHHHhcCcceEEeecCCCC Q lcl|NC_019448. 169 EGEGLEFDGLAKLID-KNNVINAKGNQLTEKHLNEAAVRIGKGFG-TAT-DAYMPIGVHADFVNSILGRQMQLMQDNSGN 245 (463) Q Consensus 169 ~~~gleFDGl~~lI~-~~nviDarG~~ls~~~ln~aa~~i~~~~G-~~t-d~~m~~~vka~f~~~~~~~qrv~~~~n~g~ 245 (463) -...+|+.+++. ..+.++..+..++.+.|..+-..|-..|- ..+ -++|+..+.+.+...+.+++.-+.++.- T Consensus 153 ---~~~n~G~l~~a~~~~~~~~~~~~~~~~d~l~~l~~~l~~~yr~~~~~v~im~~~~~~~~~~~l~~~~~~~~~~~l-- 227 (321) T protein:vir:31 153 ---ENQNDGFITVAEGDVETIDAADDILDNDLVIRTIAGLDSKYRARMNPALIVSEDQLLSYHYTLTDRDTPLGDNVI-- 227 (321) T ss_pred ---cccchhhhhhhccccccccccccccCHHHHHHHHHhccHhHhcCCCeEEEechHHHHHHHHHHhcCCCccccchh-- Confidence 124689998775 35678888999999988888888887773 333 2579999888877666665543221100 Q ss_pred cccceecCeeeecccccccCCceeccCccccccccccCCCCCCCCe-eEEEEeccC------CCcCc---c--cc-cccc Q lcl|NC_019448. 246 VNTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQPLPNAPQPAK-VTATVETKQ------KGAFE---D--EE-DRAG 312 (463) Q Consensus 246 ~~~G~~v~~~~s~~G~i~l~~s~~~~~d~~l~~~~~~~p~ap~p~~-vtat~~~~~------~g~~~---~--~~-~~a~ 312 (463) ..... .+++..+... .|. .|.. .-.|.-.+- ..... + .. +... T Consensus 228 -----------~~~~~-----~tl~G~pvv~------~~~--mP~~~il~t~~~nl~~~~~~~~~~~~~~~~~~~~~~~~ 283 (321) T protein:vir:31 228 -----------MGEAD-----VNPFSFPIIG------SGL--WPDDKAMFTDPQNLIYALYRDLEIDVLTESDKVSERDL 283 (321) T ss_pred -----------hcccc-----ccccceeEEE------cCC--CCCCcEEEeccccEEEEEeeccEEEEeecCccccccce Confidence 00000 1111111111 010 1110 111100000 00000 0 00 0001 Q ss_pred eEEEEEEEecCC--ccccccceeeeecCCCCceEEEEEecCCCCCCcc Q lcl|NC_019448. 313 LSYKVVVNSDDA--QSAPSEEVTATVSNVDDGVKLSISVNAMYQQQPQ 358 (463) Q Consensus 313 ysYkV~a~s~~g--eS~~S~~vt~Tva~~~~gv~ltIt~~a~~g~~~~ 358 (463) ..|.....+.+. |- ....|-+ .+++..+.. + ...++ T Consensus 284 ~~~~~~~~~~~~~ve~---~~a~a~~----~~i~~~~~~--~-~~~~~ 321 (321) T protein:vir:31 284 HARYFMRGDDDFAIEN---TEAVVLA----EGLGDPLEH--L-EEETS 321 (321) T ss_pred eeEeeeeeecceeEec---cccEEEE----ecCCcchhc--c-cCCCC Confidence 111111111111 10 0011111 122222211 1 00011 No 87 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=94.40 E-value=0.0045 Score=33.43 Aligned_cols=301 Identities=11% Similarity=-0.006 Sum_probs=122.5 Q ss_pred CCCCCccchHH--HHhhhhhhHHHHHHhhcCCccCCccccCccccchhhhhhHhhhhhccccccchhhhcccchhhHHHh Q lcl|NC_019448. 1 MTIEKNLSDVQ--QKYADQFQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVV 78 (463) Q Consensus 1 ~~~~~~~~~~~--~~~~k~~~e~~~Ks~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~stv~ 78 (463) .+.+....... ....+.+...+.|++...-.....+-.+||.|--+.+.++|-.+. .+.-.+.+-+...++.+... T Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~ii~~~--~~~~~l~~~~~~~~~~~~~~ 151 (395) T protein:vir:38 74 EPVNKKPLPVKDGKPDAQAMKNQFVKDFKNLVTSGTTGTGNAGLTIPEDIQLQIRTLT--RSFTSLESLANVENVTTSHG 151 (395) T ss_pred ccccccccchhhhhHHHHHHHHHHHHHHHHHHhhccCccCCCceecchhHhhHHHHHH--HhhcchhhhcceeeccCCcc Confidence 22222221111 122233334445554322111111223355544444444553222 22333444454445444433 Q ss_pred hhh--hhhccCcccccccccccCccc-ccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHHHHHHHHH Q lcl|NC_019448. 79 KYD--QYLRHGNVGHSRFVKEIGVAP-VSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEW 155 (463) Q Consensus 79 ey~--~~~~hG~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~~E~ 155 (463) .|. +..+++ +...+++|++..+ .+++.+.+.....+-++.--.+|.-+= .++..|.+....+.-...+...++. T Consensus 152 ~~~~~~~~~~~--~~a~~v~E~~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~la~~~~~~~~~ 228 (395) T protein:vir:38 152 SRVYEKLADIT--PLKDLDDESALIGDNDDPELTVVKYLIHRYAGITTVTNTLL-KDTVDNIIQWLVNWAAKKDVVTRNA 228 (395) T ss_pred eEEEEeeccCC--ccccccccccccccccccceeeEEeeeeeeEeehhhHHHHH-hhhHHHHHHHHHHHHHHHHHHHHHH Confidence 332 222322 3456899998866 566999999999999988877776432 3345577888888889999999999 Q ss_pred HHhhcccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhh-hhhhhcCCceeEEecCHHHHHHHHHHhcCc Q lcl|NC_019448. 156 ASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAA-VRIGKGFGTATDAYMPIGVHADFVNSILGR 234 (463) Q Consensus 156 a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa-~~i~~~~G~~td~~m~~~vka~f~~~~~~~ 234 (463) ++++|+..-.+. +....+|.+ ..+. ..+...|....-.+|++.+.+.+...--.. T Consensus 229 ~il~g~g~~~~~--~~~~~~~~i----------------------~~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~ 284 (395) T protein:vir:38 229 KILEVMGKAPKK--PTISQFDNI----------------------KDLENNTLDPAIESTSSFITNQSGYNILSKVKDAD 284 (395) T ss_pred HHhhcccccccc--cccccHHHH----------------------HHHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhccC Confidence 999998764331 111122222 2222 123334444444778988888886533233 Q ss_pred ceEEeecCCC----CcccceecCeeee-cccccccCCceecc---Ccccc-ccccccC--CC----CCCCCeeEEEEecc Q lcl|NC_019448. 235 QMQLMQDNSG----NVNTGYSVNGFYS-SRGFIKLHGSTVME---NELIL-DESLQPL--PN----APQPAKVTATVETK 299 (463) Q Consensus 235 qrv~~~~n~g----~~~~G~~v~~~~s-~~G~i~l~~s~~~~---~d~~l-~~~~~~~--p~----ap~p~~vtat~~~~ 299 (463) -|.+.+++.+ ..-.|++|--.-. .-+...-...+++. +..++ ++..... .+ .+.-..++.=+.-- T Consensus 285 G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~i~~gd~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r 364 (395) T protein:vir:38 285 GRYLMQPDVTSPDKYLIDGKPVIRIADKWLPDVSGSHPLYFGDLKQGITLFDRQQMQIDTTNVGAGSFEHDTTKLRFIDR 364 (395) T ss_pred CceeeccCcCCCCcceeccceeEEecccccCcCCCcceEEEEeccccEEEEEecceEEEEeccccchhhcCceEEEEEEe Confidence 3344433222 2234554311000 00000000001111 00110 0000000 00 00000000000000 Q ss_pred CCCcCcccccccceEEEEEEEecCCccccccceeeeecCCCCce Q lcl|NC_019448. 300 QKGAFEDEEDRAGLSYKVVVNSDDAQSAPSEEVTATVSNVDDGV 343 (463) Q Consensus 300 ~~g~~~~~~~~a~ysYkV~a~s~~geS~~S~~vt~Tva~~~~gv 343 (463) -.+...+++. --..++++. ++...+|+. .|. T Consensus 365 ~d~~~~~~~a--~~~~~~~~~--------~~~~~~~~~---~~~ 395 (395) T protein:vir:38 365 FDVQLIDDGA--FAAASFKTV--------ANQAQGTAG---TGK 395 (395) T ss_pred eccEEecccc--eEEEEeecc--------cCCCCCccC---CCC Confidence 0111111100 000111110 010111110 111 No 88 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=94.13 E-value=0.0053 Score=33.06 Aligned_cols=303 Identities=12% Similarity=0.042 Sum_probs=124.8 Q ss_pred CCCCCccchHHHHhhhhhhHHHHH------HhhcCCccCCccccCccccchhhhhhHhhhhhccccccchhhhcccchhh Q lcl|NC_019448. 1 MTIEKNLSDVQQKYADQFQEDVVK------SFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQ 74 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k~~~e~~~K------s~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~ 74 (463) .+.+.........+.+.|...+.+ ..... ..+..+..+|+.+--+.+..+|-.+. .+.-.+.+.+...++. T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-a~~~~~~~~gg~~vP~~~~~~Ii~~~--~~~~~l~~~~~~~~~~ 156 (408) T protein:vir:74 80 GPLNKSENELKDKFVKDFVNMVRNPMAFLNTVSSK-TETSGSDSAAGLTIPQDIRTMINTLV--RQYDSLQQYVRVESVS 156 (408) T ss_pred ccccchhhhhHHHHHHHHHHHHhcchhhhhhhhhh-hhcccccCCCceeechhHhhHHHHHH--hhhcchhhhcceeecc Confidence 222222222223333322211110 00000 01111223455544445555553332 2333456666666665 Q ss_pred HHHhhhhhhhccCcccccccccccCccc-ccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHHHHHHH Q lcl|NC_019448. 75 STVVKYDQYLRHGNVGHSRFVKEIGVAP-VSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTI 153 (463) Q Consensus 75 stv~ey~~~~~hG~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~~ 153 (463) +....|......+......+++|++..+ .+++.+.+....++-++.--.+|.-+= .++..|.+....+.--..+...+ T Consensus 157 ~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~l~~~~~~~~ 235 (408) T protein:vir:74 157 TSSGSRVYEKWTDVTPLKAMDEEDGKIPDLDNPRLTIIKYLIKRYAGIITATNTLL-KDTAENILAWLSSWIAKKVVVTR 235 (408) T ss_pred CCcceEEEEeecCCcccccccccccccccccccceeeEEeeeeeEEeeehhHHHHH-hhchHHHHHHHHHHHHHHHHHHH Confidence 5444332222112222456888988754 788999999999999998888877532 34556788888888899999999 Q ss_pred HHHHhhcccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHHHHHhcC Q lcl|NC_019448. 154 EWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILG 233 (463) Q Consensus 154 E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~~~~~ 233 (463) +.++++|+..-.+. +.++.+|++. +.....+-.+|....-.+|++.+.+.|...=-. T Consensus 236 d~~il~G~G~~~~~--~~~~~~~~i~---------------------~~~~~~l~~~~~~~a~~v~n~~~~~~l~~lkd~ 292 (408) T protein:vir:74 236 NQAIIAAMGTVPKK--PTIANFDDVI---------------------TMINTSVDPAIIATSSLLTNQSGLNKLALVKTA 292 (408) T ss_pred HHHHhhcccccccc--cccccHHHHH---------------------HHHHHhhhhhhcCCCEEEEcHHHHHHHHHhhcC Confidence 99999998764331 1122222222 212233444453333467899998888742212 Q ss_pred cceEEeecCCC----CcccceecC----eeeecccccccCCc-eeccC--c-ccc-ccccccC---C---CCCCCCeeEE Q lcl|NC_019448. 234 RQMQLMQDNSG----NVNTGYSVN----GFYSSRGFIKLHGS-TVMEN--E-LIL-DESLQPL---P---NAPQPAKVTA 294 (463) Q Consensus 234 ~qrv~~~~n~g----~~~~G~~v~----~~~s~~G~i~l~~s-~~~~~--d-~~l-~~~~~~~---p---~ap~p~~vta 294 (463) .-|.+.+++.. ..-.|++|- .+....| .... +++.+ + .++ ++..... + +.+.--.++. T Consensus 293 ~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~---~~~~~i~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~ 369 (408) T protein:vir:74 293 EGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNSG---STVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKI 369 (408) T ss_pred CCceEeccCcCCCCCceecceeeEEecCccccccc---CCcceEEEEehhccEEEEEecceEEEEeccccchhhcceeeE Confidence 22334433221 223455431 1111111 0000 11110 0 000 0000000 0 0000000100 Q ss_pred EEeccCCCcCcccccccceEEEEEEEecCCccccccceeeeecCC Q lcl|NC_019448. 295 TVETKQKGAFEDEEDRAGLSYKVVVNSDDAQSAPSEEVTATVSNV 339 (463) Q Consensus 295 t~~~~~~g~~~~~~~~a~ysYkV~a~s~~geS~~S~~vt~Tva~~ 339 (463) -+.--..|...+++ +--..+++.+. +.|..-.+.|.+++ T Consensus 370 r~~~r~d~~~~~~~--a~~~~~~~~~~----~~~~~~~~~~~~~~ 408 (408) T protein:vir:74 370 RVIDRFDVKATDSE--ALVAGSFTAIA----DQVGNFKTTTSTAV 408 (408) T ss_pred EEEEeeCcEEeccc--ceEEEEeeccc----CCCCCCCCCccccC Confidence 00000011111000 00111111110 00101011111111 No 89 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=93.88 E-value=0.0061 Score=32.73 Aligned_cols=285 Identities=13% Similarity=0.078 Sum_probs=127.0 Q ss_pred CCCCCccchHHHHhh--------------------------hhhhHHHHHHhhcCCccCCccccCccccchhhhhhHhhh Q lcl|NC_019448. 1 MTIEKNLSDVQQKYA--------------------------DQFQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITM 54 (463) Q Consensus 1 ~~~~~~~~~~~~~~~--------------------------k~~~e~~~Ks~~agy~~~p~~q~~gaalr~esLd~~i~~ 54 (463) .+.........+.+. ....++...++.+|- +..+|+.|.-+-+..+|-. T Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~gg~~vP~~~~~~ii~ 156 (400) T protein:vir:38 82 KPDHPEEHSYRDALNAYLHTRGRNTDGVNFEKTDVGTFAVLRAVPTDASDAVNAGV-----KAADAASTIPETISNTPQR 156 (400) T ss_pred cccchhhhhHHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHHhhcc-----cccCCcccccHHHHHHHHH Confidence 111111111110000 000111112222221 3345666655555666633 Q ss_pred hhccccccchhhhcccchhhHHHhhhhhhhccCcccccccccccCccc-ccCcceEEEEEEEEEeechhhhhhhhhhhcc Q lcl|NC_019448. 55 LTWTNEDLIFYRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAP-VSDPNIRQKTVSMKYVSDTKNMSIASGLVNN 133 (463) Q Consensus 55 L~~~~~df~f~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~ 133 (463) +.. +.-.+.+.+...++.+.--+|...... .+...+++|++..+ .+++.+.+.+..++-++.-..+|.-+ +.++ T Consensus 157 ~~~--~~~~l~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~E~~~~~~~~~~~f~~i~~~~~k~~~~~~is~el-l~ds 231 (400) T protein:vir:38 157 ELQ--TVVDLKPFTNVFQASTQKGTYPTVANA--TTKMVTVAELEKNPAMAKPEFKPVNWSVETYRQALPVSQES-IDDS 231 (400) T ss_pred HHH--hhhhhhhcceeEeccCcceEEEEEecC--CCccccccccccccccccccceeeEeehhheeeehhhHHHH-Hhhh Confidence 333 333455555555555443445554433 34566788888765 68999999999998888777776631 2344 Q ss_pred cccHHHHHHHHHHHHHHHHHHHHHhhcccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcCCc Q lcl|NC_019448. 134 IADPSQILTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGT 213 (463) Q Consensus 134 ~~Dp~~~~~~~ai~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~G~ 213 (463) ..|.+....+.....+..+++.++++|.....+.. ...+|++..++. ..+-.++ T Consensus 232 ~~~~~~~i~~~l~~~~~~~~~~~i~~~~~~~~~~~---~~~~~~~~~~~~---------------------~~~~~~~-- 285 (400) T protein:vir:38 232 AIDLVGLIAQNGQQIKVNTTNGAVATLLKGFTAKT---ISSVDDLKHINN---------------------VDLDPAY-- 285 (400) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhccccccccc---cccHHHHHHHHH---------------------hhhhhhh-- Confidence 56778888888899999999999999998765432 123444432221 1111111 Q ss_pred eeEEecCHHHHHHHHHHhcCcceEEeecCCCCcccceecCeeeecccccccCCceeccCccccccccccCCCCCCCCeeE Q lcl|NC_019448. 214 ATDAYMPIGVHADFVNSILGRQMQLMQDNSGNVNTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQPLPNAPQPAKVT 293 (463) Q Consensus 214 ~td~~m~~~vka~f~~~~~~~qrv~~~~n~g~~~~G~~v~~~~s~~G~i~l~~s~~~~~d~~l~~~~~~~p~ap~p~~vt 293 (463) -.-..|++.+.+.|...-=..-|.+.+++..+.. +.+++..+.++.. +.|.+.. + T Consensus 286 ~a~~v~~~~~~~~l~~lkd~~G~~i~~~~~~~~~------------------~~~l~G~pv~~~~------~~~~~~~-g 340 (400) T protein:vir:38 286 SRVIIASQSFYNFLDTVKDGNGRYLLQDSILTPS------------------GKSVLGMPIAVVS------DDTLGAA-G 340 (400) T ss_pred CcEEEEcHHHHHHHHHhhccCCCeeeecCcCCCC------------------ccccccceeEEec------ccccCCC-C Confidence 1247899999988874321122333333222111 1122222222111 1111100 0 Q ss_pred EEEeccCCCcCccccc------ccceEEEEEEEecCCcc-ccccceeeeecCCCCceEEEEEecC Q lcl|NC_019448. 294 ATVETKQKGAFEDEED------RAGLSYKVVVNSDDAQS-APSEEVTATVSNVDDGVKLSISVNA 351 (463) Q Consensus 294 at~~~~~~g~~~~~~~------~a~ysYkV~a~s~~geS-~~S~~vt~Tva~~~~gv~ltIt~~a 351 (463) .. .-.|+|... ....+.++.-....... ....-+.+.+.....-+.|++++.| T Consensus 341 ~~-----~~~~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~d~~~~~~~a~~~l~~~~~a 400 (400) T protein:vir:38 341 EA-----HAFLGDIKRAILFANRADFMVRWVDDQIYGQFLQAGMRFGVSVADEKAGYFLTYTPKA 400 (400) T ss_pred ce-----EEEEEeccccEEEEeecceEEEEecccccceeEEEEEEeccEEecccceEEEEeecCC Confidence 00 001111100 00111111000000000 0011123334334444566666544 No 90 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=93.81 E-value=0.0063 Score=32.65 Aligned_cols=302 Identities=14% Similarity=0.080 Sum_probs=136.7 Q ss_pred CCCCCccchHH----HH---hhhhhh-----HHHHHHhhcCCccCCccccCccccchhhhhhHhhhhhccccccchhhhc Q lcl|NC_019448. 1 MTIEKNLSDVQ----QK---YADQFQ-----EDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDI 68 (463) Q Consensus 1 ~~~~~~~~~~~----~~---~~k~~~-----e~~~Ks~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i 68 (463) .+...+..... .+ ++++.. +.=.|+++++. ..+||.|--|.+.++|..+... ...+++.. T Consensus 69 ~~~~~~~~~~~~e~~~a~~~~l~~g~~~~~~~~e~~a~~~~t------~~~gG~~iP~~~~~~I~~~~~~--~~~l~~~~ 140 (407) T protein:vir:48 69 RPAGGTQNKVASEHKEAFIGFMRKGREDGLRELERKALQVGN------DEDGGYAIPEELDRTILTLLKD--EVVMRQEA 140 (407) T ss_pred ccccccccchhhHHHHHHHHHHhccchhhhhHHHHHhhhccc------CCCCcccccHhHHHHHHHHHHh--hhhhhhhc Confidence 11111111111 11 111111 11124444432 2345655455566666444432 33455544 Q ss_pred ccchhhHHHhhhhhhhccCcccccccccccCcc-cccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHH Q lcl|NC_019448. 69 SRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVA-PVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIA 147 (463) Q Consensus 69 ~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~-~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~ 147 (463) ...+..+--..|.+.. ++ ....+++|++.. +.+++.+.+....++=++.-..+|.-+ +.++..|.+....+.-.. T Consensus 141 ~~~~~~~~~~~~~~~~--~~-~~a~~v~E~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~l~~ 216 (407) T protein:vir:48 141 TVITLGGSDYKKLVNL--GG-TTSGWVGETDARPETATSKLGLIEPFMGEIYGNPQATQKM-LDDAFFNVEDWINSELAL 216 (407) T ss_pred eeeecCCCceEEEEec--CC-cceeeecccccccccccccceeEEeeeeeeEeehhhHHHH-HhcchHHHHHHHHHHHHH Confidence 4445544433343333 22 246689999975 467799999999998777777777653 234566788888888888 Q ss_pred HHHHHHHHHHhhcccccCCCccccccccccceeeecC------------cceEeccCCCCCHHHHhhhhhhhhhcCCcee Q lcl|NC_019448. 148 VVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDK------------NNVINAKGNQLTEKHLNEAAVRIGKGFGTAT 215 (463) Q Consensus 148 ~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~------------~nviDarG~~ls~~~ln~aa~~i~~~~G~~t 215 (463) .+...+|.++++||-. + +..|+.+.... ..+.-..-..++.+.|-.+...+..+|-... T Consensus 217 ~i~~~~~~a~l~G~G~-~--------~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~i~~l~~~l~~~~~~~a 287 (407) T protein:vir:48 217 EFAEQEEIAFTSGDGS-K--------KPKGFLAYESTDEDDKTRAFGKLQHIASGAASGVTADAIIKLIYTLRKAHRSGA 287 (407) T ss_pred HHHHHHHhhhhccCCC-C--------ccceeeecccccccccccccccccccccccccccChHHHHHHHHhhchhhhcCC Confidence 9999999999999755 1 45666653321 1111222234556666666666667774444 Q ss_pred EEecCHHHHHHHHHHhcCcceEEeecCCC----CcccceecCeeeecc-cccccCCceeccCc----cccc-cccccCCC Q lcl|NC_019448. 216 DAYMPIGVHADFVNSILGRQMQLMQDNSG----NVNTGYSVNGFYSSR-GFIKLHGSTVMENE----LILD-ESLQPLPN 285 (463) Q Consensus 216 d~~m~~~vka~f~~~~~~~qrv~~~~n~g----~~~~G~~v~~~~s~~-G~i~l~~s~~~~~d----~~l~-~~~~~~p~ 285 (463) ..+|++.+.+.+...=-..-|.+.+++.. ..-.|++|- .+.. -.+......++-.| .++- +.....-. T Consensus 288 ~~v~n~~~~~~L~~lkD~~Gr~l~~~~~~~g~~~~l~G~PV~--~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~i~~ 365 (407) T protein:vir:48 288 KFMMNNSSLFAIRLLKDNDGNYLWRPGIELGQPSSLAGYGIV--ENEQMPDIAADAKAIAFGNFKRGYTIVDRIGTRILR 365 (407) T ss_pred EEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeE--EecCcCCccCCccEEEEEeccccEEEEEeeceEEEe Confidence 58999999998875322233555444322 124565531 1110 00001111111111 1110 00000000 Q ss_pred CC--CCCeeEEEEeccCCCcCcccccccceEEEEEEEecCCccc Q lcl|NC_019448. 286 AP--QPAKVTATVETKQKGAFEDEEDRAGLSYKVVVNSDDAQSA 327 (463) Q Consensus 286 ap--~p~~vtat~~~~~~g~~~~~~~~a~ysYkV~a~s~~geS~ 327 (463) .+ .-..+.--+..--.|...++ .+-..+++.+...+.-++ T Consensus 366 d~~~~~~~~~~~~~~r~d~~v~~~--~a~~~l~~~aa~~~~~~~ 407 (407) T protein:vir:48 366 DPYTNKPFVGFYTTKRTGGMLVDS--QAIKLMKIGAATRQKAAA 407 (407) T ss_pred eccccCCcEEEEEEEEeccEEecc--cceEEEEeeccCCCCCCC Confidence 00 00000000000001111110 011222222222211111 No 91 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=93.72 E-value=0.0066 Score=32.54 Aligned_cols=278 Identities=14% Similarity=0.057 Sum_probs=120.2 Q ss_pred CCCCCc--cch-----HHH----HhhhhhhH-HHHHHhhcCCccCCccccCccccchhhhhhHhhhhhccccccchhhhc Q lcl|NC_019448. 1 MTIEKN--LSD-----VQQ----KYADQFQE-DVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDI 68 (463) Q Consensus 1 ~~~~~~--~~~-----~~~----~~~k~~~e-~~~Ks~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i 68 (463) +..+-+ ... ... ++.+..++ ...+++.+| .+..+|..+..+ +..+|..+. .+...+.+-+ T Consensus 114 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~gg~~vp~~-~~~~ii~~~--~~~~~i~~l~ 185 (497) T protein:vir:10 114 FDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFG-----STGTFAPGILPT-FLPGIVEQL--FYELSLADLI 185 (497) T ss_pred hhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhcc-----cCcccccccchh-hhHHHHHHH--HhhhhHHhhc Confidence 100000 000 000 00000000 011222221 122334444544 444443222 3344556666 Q ss_pred ccchhhHHHhhhhhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHH Q lcl|NC_019448. 69 SRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAV 148 (463) Q Consensus 69 ~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~ 148 (463) +..+..+--..|.+.+ ++.+...+++|++..+.+|+.+.+.....+=++.--.+|.-+ +.++ .+.+....+.-... T Consensus 186 ~~~~~~~~~~~~~~~~--~~~~~a~wv~E~~~~~~s~~~f~~i~~~~~k~a~~~~iS~el-l~d~-~~l~~~i~~~l~~~ 261 (497) T protein:vir:10 186 SSRPVTSPNLSYLTES--AAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEG-LRDA-PELFNFVQGRLLEG 261 (497) T ss_pred cccccCCCceEEEEEc--CCCCcceeeccCcccccccccceeeEeeeeeeEeecHhHHHH-HHhH-HHHHHHHHHHHHHH Confidence 6666555433444443 334466799999999999999999999999999887777754 2333 45788888889999 Q ss_pred HHHHHHHHHhhcccccCCCccccccccccceeeecCc----------------ceE------------------------ Q lcl|NC_019448. 149 VAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKN----------------NVI------------------------ 188 (463) Q Consensus 149 ~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~----------------nvi------------------------ 188 (463) +++.++.++++|+-.= +..|+.+..... +.+ T Consensus 262 i~~~~d~~~l~G~G~~---------~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 332 (497) T protein:vir:10 262 IQRKEEVQLLAGGGYP---------GVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGR 332 (497) T ss_pred HHHHHHHHhhcCCCcc---------cccccccccccccccccccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHH Confidence 9999999999998542 233333321100 000 Q ss_pred eccCC----------C----CCHHHHhhhhhhhhhcC-CceeEEecCHHHHHHHHHHhcCcceEEeecCCC--------- Q lcl|NC_019448. 189 NAKGN----------Q----LTEKHLNEAAVRIGKGF-GTATDAYMPIGVHADFVNSILGRQMQLMQDNSG--------- 244 (463) Q Consensus 189 DarG~----------~----ls~~~ln~aa~~i~~~~-G~~td~~m~~~vka~f~~~~~~~qrv~~~~n~g--------- 244 (463) +..|. . .....+.++...+...+ -.++-..|++.+.+.+...-=..-|.+.++..+ T Consensus 333 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~ 412 (497) T protein:vir:10 333 VVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNG 412 (497) T ss_pred hhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHHHHHHHHhhcCCCceeccCcccccccccccC Confidence 00000 0 01122333444444343 344457789988888764321112222222111 Q ss_pred -Ccccceec--------CeeeecccccccCCcee-----------------ccCccccc--ccc--ccCCCCCCCCeeEE Q lcl|NC_019448. 245 -NVNTGYSV--------NGFYSSRGFIKLHGSTV-----------------MENELILD--ESL--QPLPNAPQPAKVTA 294 (463) Q Consensus 245 -~~~~G~~v--------~~~~s~~G~i~l~~s~~-----------------~~~d~~l~--~~~--~~~p~ap~p~~vta 294 (463) .--.|++| ..++ -|+.+.-.-.+ +..|.+-. +.+ +.+.++ -+-+.. T Consensus 413 ~~~l~G~pV~~t~~~~~~~~~--~Gd~~~~~~~i~~r~~~~v~~~~~~~~~f~~n~v~~r~~~r~~~~v~~p--~A~~~l 488 (497) T protein:vir:10 413 GKNIWGVPVVTTPLIPLGTIL--VGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRP--SAFQLI 488 (497) T ss_pred CceeeceeeEecCCCCCCceE--EeecccceEEEEEecccEEEeecccchhhhcCcEEEEEEEeecceeecc--ccEEEE Confidence 01123332 0111 12111100001 11111110 011 111110 011112 Q ss_pred EEeccCCCc Q lcl|NC_019448. 295 TVETKQKGA 303 (463) Q Consensus 295 t~~~~~~g~ 303 (463) +..+..+++ T Consensus 489 ~~~~~~~~~ 497 (497) T protein:vir:10 489 QLKKGATGS 497 (497) T ss_pred EecCCccCC Confidence 222222221 No 92 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=93.72 E-value=0.0066 Score=32.54 Aligned_cols=278 Identities=14% Similarity=0.057 Sum_probs=120.2 Q ss_pred CCCCCc--cch-----HHH----HhhhhhhH-HHHHHhhcCCccCCccccCccccchhhhhhHhhhhhccccccchhhhc Q lcl|NC_019448. 1 MTIEKN--LSD-----VQQ----KYADQFQE-DVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDI 68 (463) Q Consensus 1 ~~~~~~--~~~-----~~~----~~~k~~~e-~~~Ks~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i 68 (463) +..+-+ ... ... ++.+..++ ...+++.+| .+..+|..+..+ +..+|..+. .+...+.+-+ T Consensus 114 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~gg~~vp~~-~~~~ii~~~--~~~~~i~~l~ 185 (497) T protein:vir:78 114 FDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFG-----STGTFAPGILPT-FLPGIVEQL--FYELSLADLI 185 (497) T ss_pred hhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhcc-----cCcccccccchh-hhHHHHHHH--HhhhhHHhhc Confidence 100000 000 000 00000000 011222221 122334444544 444443222 3344556666 Q ss_pred ccchhhHHHhhhhhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHH Q lcl|NC_019448. 69 SRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAV 148 (463) Q Consensus 69 ~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~ 148 (463) +..+..+--..|.+.+ ++.+...+++|++..+.+|+.+.+.....+=++.--.+|.-+ +.++ .+.+....+.-... T Consensus 186 ~~~~~~~~~~~~~~~~--~~~~~a~wv~E~~~~~~s~~~f~~i~~~~~k~a~~~~iS~el-l~d~-~~l~~~i~~~l~~~ 261 (497) T protein:vir:78 186 SSRPVTSPNLSYLTES--AAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEG-LRDA-PELFNFVQGRLLEG 261 (497) T ss_pred cccccCCCceEEEEEc--CCCCcceeeccCcccccccccceeeEeeeeeeEeecHhHHHH-HHhH-HHHHHHHHHHHHHH Confidence 6666555433444443 334466799999999999999999999999999887777754 2333 45788888889999 Q ss_pred HHHHHHHHHhhcccccCCCccccccccccceeeecCc----------------ceE------------------------ Q lcl|NC_019448. 149 VAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKN----------------NVI------------------------ 188 (463) Q Consensus 149 ~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~----------------nvi------------------------ 188 (463) +++.++.++++|+-.= +..|+.+..... +.+ T Consensus 262 i~~~~d~~~l~G~G~~---------~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 332 (497) T protein:vir:78 262 IQRKEEVQLLAGGGYP---------GVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGR 332 (497) T ss_pred HHHHHHHHhhcCCCcc---------cccccccccccccccccccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHH Confidence 9999999999998542 233333321100 000 Q ss_pred eccCC----------C----CCHHHHhhhhhhhhhcC-CceeEEecCHHHHHHHHHHhcCcceEEeecCCC--------- Q lcl|NC_019448. 189 NAKGN----------Q----LTEKHLNEAAVRIGKGF-GTATDAYMPIGVHADFVNSILGRQMQLMQDNSG--------- 244 (463) Q Consensus 189 DarG~----------~----ls~~~ln~aa~~i~~~~-G~~td~~m~~~vka~f~~~~~~~qrv~~~~n~g--------- 244 (463) +..|. . .....+.++...+...+ -.++-..|++.+.+.+...-=..-|.+.++..+ T Consensus 333 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~ 412 (497) T protein:vir:78 333 VVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNG 412 (497) T ss_pred hhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHHHHHHHHhhcCCCceeccCcccccccccccC Confidence 00000 0 01122333444444343 344457789988888764321112222222111 Q ss_pred -Ccccceec--------CeeeecccccccCCcee-----------------ccCccccc--ccc--ccCCCCCCCCeeEE Q lcl|NC_019448. 245 -NVNTGYSV--------NGFYSSRGFIKLHGSTV-----------------MENELILD--ESL--QPLPNAPQPAKVTA 294 (463) Q Consensus 245 -~~~~G~~v--------~~~~s~~G~i~l~~s~~-----------------~~~d~~l~--~~~--~~~p~ap~p~~vta 294 (463) .--.|++| ..++ -|+.+.-.-.+ +..|.+-. +.+ +.+.++ -+-+.. T Consensus 413 ~~~l~G~pV~~t~~~~~~~~~--~Gd~~~~~~~i~~r~~~~v~~~~~~~~~f~~n~v~~r~~~r~~~~v~~p--~A~~~l 488 (497) T protein:vir:78 413 GKNIWGVPVVTTPLIPLGTIL--VGHFAPSVIQTARREGVTMQMTNSNGTDFVDGKVTVRAEERLGLLVYRP--SAFQLI 488 (497) T ss_pred CceeeceeeEecCCCCCCceE--EeecccceEEEEEecccEEEeecccchhhhcCcEEEEEEEeecceeecc--ccEEEE Confidence 01123332 0111 12111100001 11111110 011 111110 011112 Q ss_pred EEeccCCCc Q lcl|NC_019448. 295 TVETKQKGA 303 (463) Q Consensus 295 t~~~~~~g~ 303 (463) +..+..+++ T Consensus 489 ~~~~~~~~~ 497 (497) T protein:vir:78 489 QLKKGATGS 497 (497) T ss_pred EecCCccCC Confidence 222222221 No 93 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=93.60 E-value=0.007 Score=32.40 Aligned_cols=293 Identities=11% Similarity=0.073 Sum_probs=130.3 Q ss_pred CCCCCccchHHH----Hhhhh---------------hhHHHHHHhhcCCccCCccccCccccchhhhhhHhhhhhccccc Q lcl|NC_019448. 1 MTIEKNLSDVQQ----KYADQ---------------FQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNED 61 (463) Q Consensus 1 ~~~~~~~~~~~~----~~~k~---------------~~e~~~Ks~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~d 61 (463) +..-++...... .+.+. .++-..+++.++. +..+|..+.-|.+..++-.+-.. T Consensus 313 i~a~a~~~~~~a~~~~e~a~~~a~~~G~~arg~~~~~~~l~~ra~~~~t-----~~~gg~lvp~~~~~~~iie~lr~--- 384 (632) T protein:vir:96 313 INAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKT-----AGKGGELVATELLSEEFIDILRN--- 384 (632) T ss_pred HHhhhccchhhhhhhhHHHHHHHHhhhhhhhhhhhhHHHHHHhhhhccc-----ccccccccccccchHHHHHHHhh--- Confidence 000000000000 01100 1111223444432 11223233334443333222111 Q ss_pred cchhhhcccchhhHHHh--hhhhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHH Q lcl|NC_019448. 62 LIFYRDISRRPAQSTVV--KYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQ 139 (463) Q Consensus 62 f~f~~~i~k~~~~stv~--ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~ 139 (463) -..+.++..+.+..... .|.+++ +.+...+++|++..+.+++.+.+.+...|=++.-..+|..+= .++.-|.+. T Consensus 385 ~s~i~~l~~~~~~~~~g~~~ip~~~---~~~~a~wv~E~~~~~~s~~~f~~i~l~~~k~~~~v~iS~ell-~ds~~~~~~ 460 (632) T protein:vir:96 385 KAIIGQMGARMLPGLVGDVDIPKKT---SGANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLR-KQSSIHVEN 460 (632) T ss_pred cchhhhhcceEeecCCcceEEEEEe---CCceeEeecCCccccccccceeeEEeeeeEEEEehhhHHHHH-hccchHHHH Confidence 11222222222211111 233333 234567899999999999999999999999998877777653 233447788 Q ss_pred HHHHHHHHHHHHHHHHHHhhcccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcCCceeE--E Q lcl|NC_019448. 140 ILTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATD--A 217 (463) Q Consensus 140 ~~~~~ai~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td--~ 217 (463) ...++-...++..++.++++|+.. +. +-.|+.+.... +.+...+..++-+.|..+...+...++.... . T Consensus 461 ~i~~~l~~a~~~~~d~a~l~G~G~-~~-------~p~Gi~~~~~~-~~~~~~~~~~~~~~i~~~~~~i~~~~~~~~~~~~ 531 (632) T protein:vir:96 461 LIREDLIEGIGVALDLAMLTGTGL-AN-------DPVGLLNMTGV-PALTYPAGGVDWASVVDMETKISTFNADAGRLAY 531 (632) T ss_pred HHHHHHHHHHHHHHHHHhhcccCC-CC-------ccceeeecccc-cceecccccCCHHHHHHHHHHHhhcccccCccEE Confidence 888999999999999999999753 11 33455554332 2334444557777777777777777765543 4 Q ss_pred ecCHHHHHHHHHH-hcC-cceEEeecCCCCcccceecCeeeecccccccCCceeccC--ccccccccccCCCCCCCCeeE Q lcl|NC_019448. 218 YMPIGVHADFVNS-ILG-RQMQLMQDNSGNVNTGYSVNGFYSSRGFIKLHGSTVMEN--ELILDESLQPLPNAPQPAKVT 293 (463) Q Consensus 218 ~m~~~vka~f~~~-~~~-~qrv~~~~n~g~~~~G~~v~~~~s~~G~i~l~~s~~~~~--d~~l~~~~~~~p~ap~p~~vt 293 (463) .|++.+...+..- +.+ .-+.+.+++ .-.|+++- .+.. +. .+.+++.+ +.++ .. + ..+. T Consensus 532 ~~~~~~~~~l~~~~l~d~~G~~i~~~~---~l~G~pv~--~s~~--ip-~~~~~~gd~s~~~i-~~-~--------~~~~ 593 (632) T protein:vir:96 532 LTSVTQRGAAKKAQVFDNTGERIWQNN---EVNGYRAE--ASNQ--IP-ADTWIFGDWSQIVI-AM-W--------GVLD 593 (632) T ss_pred EEchhHHHHHHHHhccCCCCceeecCC---eecccceE--eccc--cc-cCcEEEeecceEEE-EE-e--------cceE Confidence 6888877766542 222 223444432 23354431 1111 00 12223221 1111 00 0 0111 Q ss_pred EEEeccCCCcCcccccccceEEEEEEEecCCccccccceee Q lcl|NC_019448. 294 ATVETKQKGAFEDEEDRAGLSYKVVVNSDDAQSAPSEEVTA 334 (463) Q Consensus 294 at~~~~~~g~~~~~~~~a~ysYkV~a~s~~geS~~S~~vt~ 334 (463) ..+.+-.....+...-.+...+-+.+. +-|+..-.-..| T Consensus 594 i~~~~~~~~~~~~v~~~~~~~~d~~v~--~~~af~~~k~~A 632 (632) T protein:vir:96 594 LKVDPYTKAASDGLVLRVFQDVDAGVR--RKEAFCIAKKGA 632 (632) T ss_pred EEEccccccccCceEEEEEeecCceee--chhhhhheeecC Confidence 222111111000000111122222222 222211111111 No 94 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=91.51 E-value=0.015 Score=30.51 Aligned_cols=295 Identities=12% Similarity=0.075 Sum_probs=134.9 Q ss_pred CCCC-----CccchHH-HH---hhhhhh-----HHHHHHhhcCCccCCccccCccccchhhhhhHhhhhhccccccchhh Q lcl|NC_019448. 1 MTIE-----KNLSDVQ-QK---YADQFQ-----EDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYR 66 (463) Q Consensus 1 ~~~~-----~~~~~~~-~~---~~k~~~-----e~~~Ks~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~ 66 (463) +..| .+....+ .+ ++++.. +.-.|++++|. +..+|-.+..+ +.++|..+.. +...+.+ T Consensus 68 ~~~~~~~~~~~~~~e~~~a~~~~lr~~~~~~~~~~e~~a~~~~~-----~~~GG~~iP~~-~~~~ii~~~~--~~~~l~~ 139 (401) T protein:vir:44 68 LKRPARGAQNKVAAEHKDAFVGFLRKGREDGLRDLERKALQVGT-----DEDGGYAVPEE-LDRSILSLLK--DEVVMRQ 139 (401) T ss_pred hhccccccccchhHHHHHHHHHHHhhhhhhhhHHHHHHHhhcCC-----CCCCceeccHh-HHHHHHHHHH--hhhhhhh Confidence 2212 1111111 12 222111 11134454432 12223334444 4444432222 2223555 Q ss_pred hcccchhhHHHhhhhhhhccCcccccccccccCc-ccccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHH Q lcl|NC_019448. 67 DISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGV-APVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDA 145 (463) Q Consensus 67 ~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~-~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~a 145 (463) -....++.+....|.+.. ++. ...+++|++. ++..++.+.+....++=++.--.+|.-+ +.++..|.+....+.- T Consensus 140 ~~~~~~~~~~~~~~~~~~--~~~-~a~wv~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~l 215 (401) T protein:vir:44 140 EATVITVGGSDYKKLVNL--GGT-ASGWVGETDTRSQTATSRLGLIEPFMGEIYGNPQATQKM-LDDAFFNVEAWINSEL 215 (401) T ss_pred hceeeecCCCceEEEEec--CCc-cceeeccccccCccccccceeeeeehhheeeehhhhHHH-HhcchHHHHHHHHHHH Confidence 555556655544454443 222 3457999986 5577789999988888777766666642 2344668888888888 Q ss_pred HHHHHHHHHHHHhhcccccCCCccccccccccceeeec------------CcceEeccCCCCCHHHHhhhhhhhhhcCCc Q lcl|NC_019448. 146 IAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLID------------KNNVINAKGNQLTEKHLNEAAVRIGKGFGT 213 (463) Q Consensus 146 i~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~------------~~nviDarG~~ls~~~ln~aa~~i~~~~G~ 213 (463) ...++..++.++++||-.= +-.|+.+... ...+.......++.+.|-.+...+...|.. T Consensus 216 a~ai~~~~~~~~l~G~G~~---------~p~Gil~~~~~~~~~~~~~~~~~~~~~t~~~~~~~~d~i~~~~~~l~~~~~~ 286 (401) T protein:vir:44 216 ATEFAEQEEIAFTTGDGTK---------KPKGFLAYESTEESDKARAFGKLQHIVSGEATAVTADAIIKLIYTLRKAHRT 286 (401) T ss_pred HHHHHHHHHhhhhccCCCC---------ccceeeccccccccccccccccccccccccccccCHHHHHHHHHhcchhhhc Confidence 8999999999999998652 2334443222 233444445556666666666666677755 Q ss_pred eeEEecCHHHHHHHHHHhcCcceEEeecCC--C--CcccceecCeeeeccc-ccccCCceeccCcc---ccccccccCCC Q lcl|NC_019448. 214 ATDAYMPIGVHADFVNSILGRQMQLMQDNS--G--NVNTGYSVNGFYSSRG-FIKLHGSTVMENEL---ILDESLQPLPN 285 (463) Q Consensus 214 ~td~~m~~~vka~f~~~~~~~qrv~~~~n~--g--~~~~G~~v~~~~s~~G-~i~l~~s~~~~~d~---~l~~~~~~~p~ 285 (463) ..-.+|++...+.+...-=...|.+.+++. | ..-.|++|- ++-.= ...-....++-.|. +....+. T Consensus 287 ~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~g~~~~l~G~PVv--~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~---- 360 (401) T protein:vir:44 287 GAKFMMNNNSLFAIRLLKDTEGNYLWRPGLELGQPSSLAGYGIA--ENEQMPDIAADAKAIAFGNFKRGYTIVDRI---- 360 (401) T ss_pred CCEEEEcHHHHHHHHHhhccCCceeecCCcCCCCCceecceeeE--EecCcCCccCCccEEEEeehhccEEEEEec---- Confidence 556899999998887532223355554432 2 124566642 11110 00011111110110 1000000 Q ss_pred CCCCCeeEEEEeccCCCcCcccccccceEEEEE----EEecCCccccccceeee Q lcl|NC_019448. 286 APQPAKVTATVETKQKGAFEDEEDRAGLSYKVV----VNSDDAQSAPSEEVTAT 335 (463) Q Consensus 286 ap~p~~vtat~~~~~~g~~~~~~~~a~ysYkV~----a~s~~geS~~S~~vt~T 335 (463) .++....+.. ...-..|++. ..-.+.+...--.+.+. T Consensus 361 -----~~~~~~~~~~--------~~~~v~~~a~~r~d~~~~~~~a~~~l~~~aa 401 (401) T protein:vir:44 361 -----GTRILRDPYT--------NKPFVGFYTTKRTGGMLVDSQAIKLLKIAAA 401 (401) T ss_pred -----ceEEeeeccc--------cCCcEEEEEEEEeccEEecccceEEEEeecC Confidence 0111110100 0001111111 00111111110111111 No 95 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=90.90 E-value=0.018 Score=30.09 Aligned_cols=301 Identities=11% Similarity=0.029 Sum_probs=126.4 Q ss_pred CCCCCccchH-----HHHhhhhhhHHHHHHhhc-CCccCCccccCccccchhhhhhHhhhhhccccccchhhhcccchhh Q lcl|NC_019448. 1 MTIEKNLSDV-----QQKYADQFQEDVVKSFQT-GYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQ 74 (463) Q Consensus 1 ~~~~~~~~~~-----~~~~~k~~~e~~~Ks~~a-gy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~ 74 (463) ......-... .+++.......-.+++.. .-.....+..+|+-+..+...+.|..+... ...++.+..+--. T Consensus 74 ~~~~~~~~~~~~~~~~~~~~r~~~~~~~r~~~~~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~---~~~l~~~~~~~~~ 150 (390) T protein:vir:62 74 LQGSGSGAQRSADVDDDATLRAGNLGEARSFEFAPEKRDGTKAGNPNVLSRTLYGQLIAQAVER---SAIMRGGATTFTT 150 (390) T ss_pred cccccccchhhcchHHHHHHhhhhhhhhHHHHhhhhhhcccccCCCccccccchHHHHHHHHhh---hhhhhhcceeeec Confidence 1111100000 011111111111122111 111111223355666766666666554432 2333333222111 Q ss_pred HHHh--hhhhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHHHHHH Q lcl|NC_019448. 75 STVV--KYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKT 152 (463) Q Consensus 75 stv~--ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~ 152 (463) +.-. .+.+.. +.....+++|++..+.+++.+.+.....+=++.--.+|.-+= .++.-|.+....+.-...+++. T Consensus 151 ~~~~~~~~p~~~---~~~~a~wv~E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~l~~~i~~~ 226 (390) T protein:vir:62 151 SDANPLDFTVIT---GRSSASIVGETAEIPESYPATAQRSMGGFKYGFASVVSYEFA-TDQVLDLVGFLVSDAGPAIGDA 226 (390) T ss_pred CCCceeEEEEEc---CCcceeeecccccccccccceeeeEeeeeeEEeehHHHHHHH-hhhhHHHHHHHHHHHHHHHHHH Confidence 1111 223332 234577899999999999999999999988887777775442 3444577888888888999999 Q ss_pred HHHHHhhcccccCCCccccccccccceeeecC-cc-eEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHHHHH Q lcl|NC_019448. 153 IEWASFYGDASLTSEVEGEGLEFDGLAKLIDK-NN-VINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNS 230 (463) Q Consensus 153 ~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~-~n-viDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~~ 230 (463) ++.++++|+-+ | -|+.+.... .+ +.......++.+.|-.+-..+..+|-.---.+|++.+.+.++.. T Consensus 227 ~d~~~l~G~G~--p---------~Gi~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~l~~~~~~~a~~vmn~~~~~~L~~l 295 (390) T protein:vir:62 227 MGRHFITGTGQ--P---------RGILTDASPATATFLATDTDSKVSDALIDLFHEVPSAYRANAKYVVNDLRAAQMRKL 295 (390) T ss_pred HHhhhhccCCc--c---------ccccccccccccceecccccccchHHHHHHHHhhhhhhhcCCEEEEchHHHHHHHHh Confidence 99999999742 2 355554432 12 22222234555554444334455663222479999999988742 Q ss_pred hcCcceEEeecCC--C--CcccceecCeeeecccccccCCceeccC--ccccccccccCCCCCCCCeeEEEEeccCCCcC Q lcl|NC_019448. 231 ILGRQMQLMQDNS--G--NVNTGYSVNGFYSSRGFIKLHGSTVMEN--ELILDESLQPLPNAPQPAKVTATVETKQKGAF 304 (463) Q Consensus 231 ~~~~qrv~~~~n~--g--~~~~G~~v~~~~s~~G~i~l~~s~~~~~--d~~l~~~~~~~p~ap~p~~vtat~~~~~~g~~ 304 (463) ==..-|.+.+++. | ..-.|++|- .+.. +- .+.+++.+ ..++... ..+............ T Consensus 296 kd~~g~~l~~~~~~~g~~~~l~G~Pv~--~~~~--~p-~~~i~~gd~s~~~i~~~----------~~~~v~~~~~~~~~~ 360 (390) T protein:vir:62 296 KDANGQYLWQSGLTVGAPSLFNGKVVE--TDDG--MP-ADKILFADLSKYRVRFA----------GSLRVDRSVDAKFST 360 (390) T ss_pred hccCCCeeecCCcCCCccceecccceE--EecC--CC-CccEEEeeccceeEEee----------cceEEEeeccccccC Confidence 2122234443322 1 123455431 1100 00 00111110 0000000 000111101100000 Q ss_pred cccccccceEEEEEEEecCCccccccceeeeecC Q lcl|NC_019448. 305 EDEEDRAGLSYKVVVNSDDAQSAPSEEVTATVSN 338 (463) Q Consensus 305 ~~~~~~a~ysYkV~a~s~~geS~~S~~vt~Tva~ 338 (463) ......+.+.+=..+++..+ .. ..+++.++ T Consensus 361 ~~~~~~~~~r~d~~~~~~~A--~~--~l~~~~~a 390 (390) T protein:vir:62 361 DQIVYRFLQRADGLLVDARG--AK--VLTVTPGA 390 (390) T ss_pred CcEEEEEEEEeCcEeechhh--eE--EEEeecCC Confidence 00000011111111111111 11 11111111 No 96 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=90.73 E-value=0.019 Score=29.98 Aligned_cols=306 Identities=13% Similarity=0.043 Sum_probs=136.1 Q ss_pred CCCCCccchHHHHhhhhhh-HHHHHHhhcCCccCCccccCccccchhhhhhHhhhhhccccccchhhhcccchhhHHHhh Q lcl|NC_019448. 1 MTIEKNLSDVQQKYADQFQ-EDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVK 79 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k~~~-e~~~Ks~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~stv~e 79 (463) -..+.+-.....+|...+. .++.+++++|. ..+||.|--+.+..+|..+. .+...+++-....++.+.-.. T Consensus 104 ~~~~~~~~~~~~af~~~l~~~e~~~al~~~t------~~~gG~lvP~~~~~~ii~~~--~~~s~l~~l~~~~~~~~~~~~ 175 (425) T protein:vir:10 104 GVKPLRDPEYTEAFKAHVKRGDVQAALNKGE------DSEGGYLTPIEWDRTITNKL--VLISPMRQLCRVQPVSKAGFS 175 (425) T ss_pred cccccccHHHHHHHHHHhhhhhhHHHhhcCc------CCCCceeccHhHHHHHHHHH--HhhhhhhhhceeeeccCCceE Confidence 1111111111122222221 33455666542 34455554455555553333 233345555555555554445 Q ss_pred hhhhhccCcccccccccccCcc-cccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019448. 80 YDQYLRHGNVGHSRFVKEIGVA-PVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEWASF 158 (463) Q Consensus 80 y~~~~~hG~~g~~~fv~E~g~~-~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~~E~a~f 158 (463) |.+.... ....+++|++.. +...+.+.+.....+=++.-..+|.-+ +.++.-|.+....+.-...++..++.+++ T Consensus 176 ~~~~~~~---~~a~wv~E~~~~~~~~~~~f~~v~~~~~k~~~~i~iS~el-l~ds~~~l~~~i~~~la~ai~~~~d~~~l 251 (425) T protein:vir:10 176 KLFNMGG---TTSGWVGEASQRPQTNAATFQPLSFASGEIYANPAATQQI-LDDAEIDLESWLATEVQTEFAKQEGKAFL 251 (425) T ss_pred EEEEcCC---cceeeeccccccccccccccceeeeeheeeEeehHhHHHH-HhcchhHHHHHHHHHHHHHHHHHHHhhhh Confidence 5554432 356789999874 456689999988888777766666543 23345577889999999999999999999 Q ss_pred hcccccCCCccccccccccceeeecC------------cceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHH Q lcl|NC_019448. 159 YGDASLTSEVEGEGLEFDGLAKLIDK------------NNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHAD 226 (463) Q Consensus 159 yGd~~l~~~~~~~gleFDGl~~lI~~------------~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~ 226 (463) +||-.- +-.|+.+.+.. +.+.......++-+.|-++...+...|-...-.+|++.+.+. T Consensus 252 ~G~G~~---------~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~l~~l~~~l~~~~~~~a~~vmn~~~~~~ 322 (425) T protein:vir:10 252 AGDGTN---------KPNGLLTYIAGGANAAKHPFGAIEVVNSGAAADITSDGIIDLVYDLPSAFTGNARFAMNRNTQRQ 322 (425) T ss_pred cccCCC---------CcceeeeccccccccccccccccccccccccccccHHHHHHHHhhhhhhhccCCEEEEchHHHHH Confidence 998642 34566664431 111222334455555555555566677444457899999988 Q ss_pred HHHHhcCcceEEeecCCCCcccceecCeeeecccccccCCceeccCccccccccccCCCCCCCCeeEEEEeccCCCcCcc Q lcl|NC_019448. 227 FVNSILGRQMQLMQDNSGNVNTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQPLPNAPQPAKVTATVETKQKGAFED 306 (463) Q Consensus 227 f~~~~~~~qrv~~~~n~g~~~~G~~v~~~~s~~G~i~l~~s~~~~~d~~l~~~~~~~p~ap~p~~vtat~~~~~~g~~~~ 306 (463) +...-=..-|.+.+++..... +.+++..+...... .|... +... .-.|+| T Consensus 323 L~~lkD~~G~~l~~~~~~~g~------------------~~~l~G~PV~~~~~------~p~~~---~~~~---~i~~Gd 372 (425) T protein:vir:10 323 VRKLKDGQGNYLWQPSYVAGQ------------------PATLAGYPVTEVPD------MPDVA---ANST---PILFGD 372 (425) T ss_pred HHHhhcCCCceeeccCccCCC------------------CceecceeeEEecC------cCCcc---CCcc---EEEEEe Confidence 874332222444433222110 11122222211111 11000 0000 001111 Q ss_pred cccccceEEEEEEEecCCccccccceeeeecCCCCceEEEE--EecCCCCCCcceEEEEeecCC Q lcl|NC_019448. 307 EEDRAGLSYKVVVNSDDAQSAPSEEVTATVSNVDDGVKLSI--SVNAMYQQQPQFVSIYRQGKE 368 (463) Q Consensus 307 ~~~~a~ysYkV~a~s~~geS~~S~~vt~Tva~~~~gv~ltI--t~~a~~g~~~~~y~IYR~~~~ 368 (463) .. ..|.+ +.+.+-...... .. ..+-+.+.. -.....-.+..+..+--++.. T Consensus 373 ~~----~~~~i--~~~~~~~v~~d~----~~-~~~~~~~~~~~r~d~~v~~~~A~~~l~~~as~ 425 (425) T protein:vir:10 373 FQ----QTYLI--IDRIGVRVLRDP----YT-AKPYVLFYTTKRVGGGLLNPEPMRAMKVAASE 425 (425) T ss_pred hh----ccEEE--EEecceEEEecc----cc-cCCcEEEEEEEEeccEeecccceEEEEeeccC Confidence 11 00221 111111100000 00 001111111 111100000011111111111 No 97 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=87.57 E-value=0.038 Score=28.38 Aligned_cols=288 Identities=10% Similarity=0.069 Sum_probs=112.9 Q ss_pred CCCCCccchHHHHh-----------hhhh----------hHHHHHHhhcC--CccCCccccCcccc-chhhhhhHhhhhh Q lcl|NC_019448. 1 MTIEKNLSDVQQKY-----------ADQF----------QEDVVKSFQTG--YGITPDTQIDAGAL-RREILDDQITMLT 56 (463) Q Consensus 1 ~~~~~~~~~~~~~~-----------~k~~----------~e~~~Ks~~ag--y~~~p~~q~~gaal-r~esLd~~i~~L~ 56 (463) -+.|+......... .+.. .+++.+.+... -.....+-.+|+.+ .-+..+.-+..|. T Consensus 95 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~vP~~~~~~i~~~l~ 174 (466) T protein:vir:80 95 NSEPAQVSGARTQQFVGGETRMKGFFRNMPYEQRAALIARSEVKEFLAQVRTLAQQKRAVSGAELTIPDVMLELLRDNMH 174 (466) T ss_pred CchhHHHHhhhhhHHhhHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhccccccccHHHHHHHHHhhh Confidence 11111111111000 0000 00000000000 00111122334443 3333332222232 Q ss_pred ccccccchhhhcccchhhHHHhhhhhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhccccc Q lcl|NC_019448. 57 WTNEDLIFYRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIAD 136 (463) Q Consensus 57 ~~~~df~f~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~D 136 (463) +...+.+.+...++..++ .+.. ++....+.+++|++..+..|+.+.+....++=++.--.+|.-+- .++..| T Consensus 175 ---~~~~l~~~~~v~~~~g~~-~~~~---~~~~~~a~wv~E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell-~ds~~~ 246 (466) T protein:vir:80 175 ---RYSKLISKVRLRPLKGTA-RQNI---AGAIPEGVWTEAVANLNELSLSFSQIEVDGYKVGGFIPIPNSTL-EDSDLN 246 (466) T ss_pred ---hhhhhhhheeeeecCcee-Eeee---ecCCcceeecccccccccccccccceeecceeeeeehhhhHHHH-hcchHH Confidence 222455555555554332 3332 33444567899999999999999998888887776666655432 345568 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhcccccCCCccccccccccceeeec-------------------CcceEeccCCCCC- Q lcl|NC_019448. 137 PSQILTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLID-------------------KNNVINAKGNQLT- 196 (463) Q Consensus 137 p~~~~~~~ai~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~-------------------~~nviDarG~~ls- 196 (463) .+....+.-...++..++.+++.||-.=. --|+.+-.. ....+++..-.-+ T Consensus 247 l~~~i~~~la~~~~~~~~~ail~G~G~~~---------P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 317 (466) T protein:vir:80 247 LADEILDAIGQAIGFALDKAILYGTGTKM---------PVGIVTRLAQTTQPPNWGTKAPAWTNLSTTNLLKIDPTGKSA 317 (466) T ss_pred HHHHHHHHHHHHHHHHHhhheeeccCCCC---------cceeeecccccccccccccccccccccchhhhhhhhhhccch Confidence 89999999999999999999999986422 224443221 1111111000000 Q ss_pred HHHHhh---hhhhhhhcCCceeEEecCHHH-HHHHHHHh----cCcceEEeecCCCCcccceecC--------------- Q lcl|NC_019448. 197 EKHLNE---AAVRIGKGFGTATDAYMPIGV-HADFVNSI----LGRQMQLMQDNSGNVNTGYSVN--------------- 253 (463) Q Consensus 197 ~~~ln~---aa~~i~~~~G~~td~~m~~~v-ka~f~~~~----~~~qrv~~~~n~g~~~~G~~v~--------------- 253 (463) ...+.. +.-....+++.+.++|+++.. ...+.... -+.+.+..+ +.+..-.|.+|- T Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~l~~~~~~~~~~g~~~~~~-~~~~~i~G~pvv~s~~~~~~~~~~g~~ 396 (466) T protein:vir:80 318 EEFFSELVLKLSKARANYSNGMKFWAMSSNTHAVLMSKAITFNSAGALVASL-NNTMPIVGGDIVILDFIPDNDIIGGYG 396 (466) T ss_pred hhHHHHHHHHHHhhhccccCCceeEEecchhHHHhhcccccccCCccccccC-CCcccccccceeecCccCccceeeecc Confidence 000111 112223457888888876533 33332211 111222111 222222343320 Q ss_pred -e-eeecccccccC--CceeccCcccccccccc------CCCCCCCC-------eeEEEEeccCCCcCcccccccceEEE Q lcl|NC_019448. 254 -G-FYSSRGFIKLH--GSTVMENELILDESLQP------LPNAPQPA-------KVTATVETKQKGAFEDEEDRAGLSYK 316 (463) Q Consensus 254 -~-~~s~~G~i~l~--~s~~~~~d~~l~~~~~~------~p~ap~p~-------~vtat~~~~~~g~~~~~~~~a~ysYk 316 (463) . ++.-++.+.+. ....+..|.+.-+.... .|+|+... .++.++.|+-.-. T Consensus 397 ~~y~i~~r~~~~i~~~~~~~f~~d~~~~r~~~r~dg~~~~~~afv~~~~~~~~~~~~~~~~~~~~~~------------- 463 (466) T protein:vir:80 397 SLYLLAERADIKLAQSEHVRFIEDQTVFKGTARYDGKPVFGEGFVAVNIANANPTTSITFAPDEANV------------- 463 (466) T ss_pred ccEEEEeecceEEEechhhhhhcCcEEEEEEEEEccEEeccCceEEEEecCCCcccceeeecCcCcC------------- Confidence 0 00111111111 11111122221111111 11111110 0111111221111 Q ss_pred EEEEecCCccccccceeeee Q lcl|NC_019448. 317 VVVNSDDAQSAPSEEVTATV 336 (463) Q Consensus 317 V~a~s~~geS~~S~~vt~Tv 336 (463) --| T Consensus 464 -----------------~~~ 466 (466) T protein:vir:80 464 -----------------PEV 466 (466) T ss_pred -----------------CCC Confidence 111 No 98 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=87.37 E-value=0.039 Score=28.30 Aligned_cols=311 Identities=9% Similarity=-0.018 Sum_probs=128.9 Q ss_pred CCCCCccc----hHHHHhhhhh-------hHHHHHHhhcCCccCCccccCcc-ccchhhhhhHhhhhhccccccchhhhc Q lcl|NC_019448. 1 MTIEKNLS----DVQQKYADQF-------QEDVVKSFQTGYGITPDTQIDAG-ALRREILDDQITMLTWTNEDLIFYRDI 68 (463) Q Consensus 1 ~~~~~~~~----~~~~~~~k~~-------~e~~~Ks~~agy~~~p~~q~~ga-alr~esLd~~i~~L~~~~~df~f~~~i 68 (463) -..+.+.. ....++.+.+ ..+-.|.+..--+....+..+|| .+..|...+-+..| ... ..+.+-+ T Consensus 75 ~~~~~~~~~~~~~~~~a~~~~l~~~~~~~~~~e~~~~~~~~a~~~~~~~~gg~liP~~~~~~ii~~~-~~~--~~l~~~~ 151 (409) T protein:vir:45 75 NLDPENNSQQDEKRAQVFDKWMRHGASELTSEERKALRELRAQGVAQDEKGGYTVPETFLAKVVEKM-KSY--GGIASVA 151 (409) T ss_pred cCCCCCcchhhHHHHHHHHHHHHhhhhhccHHHHHHHHHHhhccCccCcCCceeccHhHHHHHHHHH-Hhh--hhhhhhc Confidence 11111111 1111222111 11122333211111111222344 46666554444333 222 2333333 Q ss_pred ccchhhHHHhhhhhhhccC-cccccccccccCcccccCcceEEEEEEE-EEeechhhhhhhhhhhcccccHHHHHHHHHH Q lcl|NC_019448. 69 SRRPAQSTVVKYDQYLRHG-NVGHSRFVKEIGVAPVSDPNIRQKTVSM-KYVSDTKNMSIASGLVNNIADPSQILTEDAI 146 (463) Q Consensus 69 ~k~~~~stv~ey~~~~~hG-~~g~~~fv~E~g~~~~~d~~~~r~~~~~-k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai 146 (463) ...+..+- .+..+...+ ....+.+++|++..+.+++.+....... |+.+.--.+|.-+ +.++.-|.+....+.-- T Consensus 152 ~~~~~~~~--~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~f~~~~l~~~k~~~~~i~is~el-l~ds~~~l~~~i~~~la 228 (409) T protein:vir:45 152 QILTTSDG--RTMEWATADGTSEVGVLLGENEEAGEEDTDFGMGSLGALKMTSKIIRVSNEL-LQDSAIDMEAYLARRIA 228 (409) T ss_pred eeeecCCC--ceEEEEeeccCccccccccccccccccccccceeeeeeeeeeeeehhhhHHH-HhccHHHHHHHHHHHHH Confidence 33333221 111122222 2334679999999999999999888654 5544323344332 13344577888888888 Q ss_pred HHHHHHHHHHHhhcccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcC-CceeE-EecCHHHH Q lcl|NC_019448. 147 AVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGF-GTATD-AYMPIGVH 224 (463) Q Consensus 147 ~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~-G~~td-~~m~~~vk 224 (463) ..+...++.++++|+..-.+ .+..|+.+.......... ...++.+.|-++...+..+| ..+.- ++|+..+. T Consensus 229 ~a~~~~~~~a~l~G~G~~~~------~~p~Gil~~~~~~~~~~~-~~~~~~d~i~~l~~~l~~~~~~~a~~~~~~n~~~~ 301 (409) T protein:vir:45 229 ERIGRGEARYLIQGTGAGTP------KQPKGLAASVTGTTQTAA-ANAVKWQEILALKHSIDPAYRRGPKFRLAFNDNTL 301 (409) T ss_pred HHHHHHHHHHhhccCCCCCc------cccceeeecccccccccc-ccccchHHHHHHHHhhhhhhccCCeEEEEECHHHH Confidence 89999999999999976433 366788775554444433 34566666666666666666 44444 46799998 Q ss_pred HHHHHHhcCcceEEeecCCCC----cccceecCeeeecc-cccccCCcee-ccC--ccccccccccCCCCCCCCeeEEEE Q lcl|NC_019448. 225 ADFVNSILGRQMQLMQDNSGN----VNTGYSVNGFYSSR-GFIKLHGSTV-MEN--ELILDESLQPLPNAPQPAKVTATV 296 (463) Q Consensus 225 a~f~~~~~~~qrv~~~~n~g~----~~~G~~v~~~~s~~-G~i~l~~s~~-~~~--d~~l~~~~~~~p~ap~p~~vtat~ 296 (463) +.+...-=..-|.+.+++... .-.|++|= .+.. -.+.....++ +.+ +.++.+ ..... ... T Consensus 302 ~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~PV~--~~~~~p~~~~~~~~i~~Gd~~~~~i~~--------~~~~~--~~~ 369 (409) T protein:vir:45 302 KLISEMEDGQGRPLWLPDIVGVAPASVLNVPYV--IDQEIDDIGAGKKFMFCGDFDRFIIRR--------VRYMI--LKR 369 (409) T ss_pred HHHHHhhcCCCceeeccCcCCCCCceecceeeE--EecCcCCccCCccEEEEeehhhhheee--------ccceE--EEE Confidence 888743222333343332211 12233221 1000 0000000000 000 000000 00000 000 Q ss_pred eccCCCcCcccccccceEEEEEEEecCCccccccceeeeecCCCCceEEEEEecCCCCC Q lcl|NC_019448. 297 ETKQKGAFEDEEDRAGLSYKVVVNSDDAQSAPSEEVTATVSNVDDGVKLSISVNAMYQQ 355 (463) Q Consensus 297 ~~~~~g~~~~~~~~a~ysYkV~a~s~~geS~~S~~vt~Tva~~~~gv~ltIt~~a~~g~ 355 (463) .... ++ ......|++...-+.+ +...++=+.+++.. .+|+ T Consensus 370 ~~d~---~~---~~~~~~~~~~~r~d~~-----------~~~~~A~~~l~~k~--s~~~ 409 (409) T protein:vir:45 370 LVER---YA---EYDQTGFLAFHRFDCI-----------LEDTSAIKALVGKG--SVGG 409 (409) T ss_pred eecc---cc---cCCcEEEEEEEEeccE-----------eechhheEEEEecc--CCCC Confidence 0000 00 0001122222211111 11111112222221 1122 No 99 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=86.56 E-value=0.045 Score=27.98 Aligned_cols=304 Identities=10% Similarity=0.030 Sum_probs=131.3 Q ss_pred CCCCCccch-H-H--HHhhh-------------hhhHHHHHHhh---cCC-------ccCCccccCccccchhhhhhHhh Q lcl|NC_019448. 1 MTIEKNLSD-V-Q--QKYAD-------------QFQEDVVKSFQ---TGY-------GITPDTQIDAGALRREILDDQIT 53 (463) Q Consensus 1 ~~~~~~~~~-~-~--~~~~k-------------~~~e~~~Ks~~---agy-------~~~p~~q~~gaalr~esLd~~i~ 53 (463) ...+..... . . .++.+ ....+..++|. +|. ..+-.+ .+||.|--+.+...|. T Consensus 85 ~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~r~a~~~~l~~~~~~~e~~a~~~~t-~~GG~lvP~~~~~~Ii 163 (434) T protein:vir:62 85 KENPNEKTELSEEQRSAISASIAAALSTKGHRTNKETEIRSVFANYIVGNIDEKEARALGLVT-GNGSVTIPDFLSKEII 163 (434) T ss_pred hcchhhhHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHHHHHHhccccchhhhhhhcccc-cccceecchhhHHHHH Confidence 000000000 0 0 00000 01122233332 110 000011 2456654455555553 Q ss_pred hhhccccccchhhhcccchhhHHHhhhhhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhcc Q lcl|NC_019448. 54 MLTWTNEDLIFYRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNN 133 (463) Q Consensus 54 ~L~~~~~df~f~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~ 133 (463) .+.. ....+.+-..+.+..+. ..|.++...+.........|++..+.+|+.+.+.....+=++.-..+|.-+ +.++ T Consensus 164 ~~l~--~~~~i~~~~~~~~~~~~-~~~p~~~~~~~a~~~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~iS~el-l~ds 239 (434) T protein:vir:62 164 TYAQ--EENFLRRLGTGVKTKEN-IKYPVLVKKAEAQGHKNERTNNEMPETDIEFDEIELSPTEFDALATVTKKL-LART 239 (434) T ss_pred Hhhh--hhhhhhhhcceeccCCc-eEEEEEecCCcccceecccccccccccccceeeEEeeheeeEeehhhHHHH-Hhcc Confidence 3332 22223333333333333 456666655444333445678888899999999999999888877776653 2344 Q ss_pred cccHHHHHHHHHHHHHHHHHHHHHhhcccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcCCc Q lcl|NC_019448. 134 IADPSQILTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGT 213 (463) Q Consensus 134 ~~Dp~~~~~~~ai~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~G~ 213 (463) .-|.+....+.-...+++.+|.+++.||-.=. ...|+.+ .+.......+. .+.+.|-++-..+...|.. T Consensus 240 ~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~--------~~~g~~~--~~~~~~~~~~~-~~~d~l~~l~~~l~~~~~~ 308 (434) T protein:vir:62 240 GLPIEQIVMDELKKAYVRKETQYMVNGDEANN--------INDGALA--KKAVEFKTDEK-NLYDALVKMKNTPVKEVRK 308 (434) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhccCCCCc--------cccceee--ccccccccccc-chhhHHHHHHhhcchhhhc Confidence 55788888999999999999999999986422 2344433 12222222222 2334444444455666644 Q ss_pred eeEEecCHHHHHHHHHHhcCcceEEeec-C---CCC--cccceecCeeeecccc-cccC-Ccee-cc--Ccccc-ccc-c Q lcl|NC_019448. 214 ATDAYMPIGVHADFVNSILGRQMQLMQD-N---SGN--VNTGYSVNGFYSSRGF-IKLH-GSTV-ME--NELIL-DES-L 280 (463) Q Consensus 214 ~td~~m~~~vka~f~~~~~~~qrv~~~~-n---~g~--~~~G~~v~~~~s~~G~-i~l~-~s~~-~~--~d~~l-~~~-~ 280 (463) -...+|++.+.+.+...=-..-|.+.++ + .|. .-.|++|- ++..-+ .... -.+| +. ...++ ++. . T Consensus 309 ~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~g~~~tl~G~pV~--~~~~~~~~~~~~~~~i~~Gdfs~~~i~~~~g~ 386 (434) T protein:vir:62 309 KARWVLNTAALTKIETMKTDDGFPLLRPFNQAEGGIGYTLLGFPVE--EEDAIDIPDSPDTPVFYFGDFSKFYIQDVIGS 386 (434) T ss_pred CCEEEEcHHHHHHHHHhhccCCCEeeccCCCccCCCCceecceeeE--EecCccCccCCCceEEEEeeccceEEEEeece Confidence 4457899999988874322222444432 2 121 24455531 110000 0000 0001 10 01010 000 0 Q ss_pred --ccC-CCCCC-CCe--eEEEEeccCCCcCcccccccceEEEEEEEecC Q lcl|NC_019448. 281 --QPL-PNAPQ-PAK--VTATVETKQKGAFEDEEDRAGLSYKVVVNSDD 323 (463) Q Consensus 281 --~~~-p~ap~-p~~--vtat~~~~~~g~~~~~~~~a~ysYkV~a~s~~ 323 (463) ... ..... --. .-+..--+++-- ..+...+.+++.+.+.+.. T Consensus 387 ~~i~~~~~~~~~~~~v~~~~~~r~Dgk~i-~~~~~~~~~~~~~~~~~~~ 434 (434) T protein:vir:62 387 LEVQKLVELFSRTNRVGFRIWNLLDAQLI-HSPFEVPVYKYVLKAPTGA 434 (434) T ss_pred eEEEeehhhhcccCceEEEEEeeecceee-cCcccceEEEEEeccCCCC Confidence 000 00010 011 122222222211 0122223333333322222 No 100 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=85.75 E-value=0.05 Score=27.69 Aligned_cols=306 Identities=12% Similarity=0.031 Sum_probs=121.5 Q ss_pred CCCCCccchHH-----HHhhhhh------------hHHHHHHh----------------hcCCccCCccccCccccchhh Q lcl|NC_019448. 1 MTIEKNLSDVQ-----QKYADQF------------QEDVVKSF----------------QTGYGITPDTQIDAGALRREI 47 (463) Q Consensus 1 ~~~~~~~~~~~-----~~~~k~~------------~e~~~Ks~----------------~agy~~~p~~q~~gaalr~es 47 (463) .+.+.+..... ..++..+ .+.+.+.. ..--..+..+..+|..+.-|. T Consensus 93 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~lv~~~~ 172 (477) T protein:vir:84 93 ATVEVNEALTYEKGNGQSYFRDLAMQTVGMADEPAKERLRRHMVDVESDKEIRKIAKVGEEYRDLDRNGGTGGYAVPPLW 172 (477) T ss_pred cccccccchhhhhhHHHHHHHHHHHHHhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHhhhhhccccccCCCcceeeccch Confidence 11111111000 0000000 01111100 000111222233455555554 Q ss_pred hhhHhhhhhccccccchhhhcccchhhHHHhhhhhhhccCcccccccccccC-----cccccCcceEEEEEEEEEeechh Q lcl|NC_019448. 48 LDDQITMLTWTNEDLIFYRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIG-----VAPVSDPNIRQKTVSMKYVSDTK 122 (463) Q Consensus 48 Ld~~i~~L~~~~~df~f~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g-----~~~~~d~~~~r~~~~~k~l~~~~ 122 (463) +..+|..+.. ..-.+.+-+...+.......+..-...++...+..++|++ ..+.+|+.+.+....+|=++.-- T Consensus 173 ~~~~ii~~l~--~~~~i~~~~~~~~~~~~~~~~~ip~~~~~~~~a~~~~Eg~~~~~~~~~~s~~~f~~i~~~~~k~~~~~ 250 (477) T protein:vir:84 173 MMNRFIELAR--AGRTYANLCPTEPLPGGTSSINIPKILTGTSTAIQAADNAALTAPSAHEVDLTDGFVQANVKTIAGQQ 250 (477) T ss_pred hHHHHHHHhh--hcchHHHhhceeeecCCcceeEEEEEecCcceeeeeccCcccccccccccccceeeEEEeeeeEEeee Confidence 4444322221 1222333344444444333222111112222233577775 34678899999999998888877 Q ss_pred hhhhhhhhhcccccHHHHHHHHHHHHHHHHHHHHHhhcccccCCCccccccccccceeeecCcceEeccCCCCCHHH--- Q lcl|NC_019448. 123 NMSIASGLVNNIADPSQILTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKH--- 199 (463) Q Consensus 123 ~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~--- 199 (463) .+|..+= .++.-|.+....+.-...+++.+|.++++|+-. + -+..||.+.-.. +.+++-+...+... T Consensus 251 ~iS~ell-~ds~~~l~~~i~~~l~~~~~~~~d~~~l~G~Gt---~-----~~p~Gi~~~~~~-~~~~~~~~~~t~~~~~~ 320 (477) T protein:vir:84 251 GIAIQLL-DQAAVSVDEFVFRDLAADYANKLNVQVISGTGS---N-----NQVVGVRATAGI-TQVTATSAGSALEKHQI 320 (477) T ss_pred HHHHHHH-hccchhHHHHHHHHHHHHHHHHHHHHHhccCCC---C-----Cccceeeecccc-ccccccccccchhhHHH Confidence 7776542 333446788888999999999999999999843 1 145677664322 22333333333222 Q ss_pred ----HhhhhhhhhhcCCcee-EEecCHHHHHHHHHHhc-CcceEEeecCCCCc----------------ccceecCeeee Q lcl|NC_019448. 200 ----LNEAAVRIGKGFGTAT-DAYMPIGVHADFVNSIL-GRQMQLMQDNSGNV----------------NTGYSVNGFYS 257 (463) Q Consensus 200 ----ln~aa~~i~~~~G~~t-d~~m~~~vka~f~~~~~-~~qrv~~~~n~g~~----------------~~G~~v~~~~s 257 (463) |-.+...+..+|+... -.+|++.+.+.+...-= +.+..+++++++.. -.|++| +.+ T Consensus 321 ~~~~i~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~~~~~~~l~G~pV--v~s 398 (477) T protein:vir:84 321 IYQKIADAIQRVHTSRFLEPEVIVMHPRRWASFHAIFAGDDRPLIVPSGPGFNNLGVLTEVASQRVVGQMHGLPV--VTD 398 (477) T ss_pred HHHHHHHHHhhccccccCCccEEEEcHHHHHHHHHhhccCCCeeeecCcccccccccccccccccccchhcccce--Eec Confidence 2233334445665444 47789999888875432 22333333322211 123322 111 Q ss_pred ccccc---c-cCC-ceecc--CccccccccccCCCCCCCCeeEEEEecc-----CCCcCc---ccccccc-eEEEEEEEe Q lcl|NC_019448. 258 SRGFI---K-LHG-STVME--NELILDESLQPLPNAPQPAKVTATVETK-----QKGAFE---DEEDRAG-LSYKVVVNS 321 (463) Q Consensus 258 ~~G~i---~-l~~-s~~~~--~d~~l~~~~~~~p~ap~p~~vtat~~~~-----~~g~~~---~~~~~a~-ysYkV~a~s 321 (463) ..=+. . -+. .+++. .+.++.+... ...+.+. ....|. .....+. |-=.++.+. T Consensus 399 ~~~p~~~~~~~d~~~i~~gd~~~~~i~~~~~-----------~~~~~~~~~~~~~~~~~~v~~~~~~~~~r~~~afv~~t 467 (477) T protein:vir:84 399 PTLPTTLGTGTDQDVIHVLRASDLALFESSV-----------RMRALQETRAENLSVLLQVYGYLAFTAARFPQSVVEIG 467 (477) T ss_pred CcccccccccCCcceEEEEEeceEEEEeece-----------eEEeccccccccceeeeeehhhhhhhhhccccceEEee Confidence 00000 0 000 01111 1111111100 0000000 000000 0000000 000111222 Q ss_pred cCCccccccc Q lcl|NC_019448. 322 DDAQSAPSEE 331 (463) Q Consensus 322 ~~geS~~S~~ 331 (463) ..+-.+|.-. T Consensus 468 ~~~~~~~~~~ 477 (477) T protein:vir:84 468 GTALTAPTFA 477 (477) T ss_pred cccccccccC Confidence 2222222111 No 101 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=84.99 E-value=0.056 Score=27.44 Aligned_cols=280 Identities=13% Similarity=0.112 Sum_probs=124.6 Q ss_pred hhhhh-HHHHH----HhhcCCccCCccccCccccch---hhhhhHhhhhhcccccc-chhhhcccchhhHHHhhhhhhhc Q lcl|NC_019448. 15 ADQFQ-EDVVK----SFQTGYGITPDTQIDAGALRR---EILDDQITMLTWTNEDL-IFYRDISRRPAQSTVVKYDQYLR 85 (463) Q Consensus 15 ~k~~~-e~~~K----s~~agy~~~p~~q~~gaalr~---esLd~~i~~L~~~~~df-~f~~~i~k~~~~stv~ey~~~~~ 85 (463) +|+.+ ++..+ ....-.+..-|++.+.|.+-. |-+|+.+....+..-.+ .|+.-...-.+--....|..+.. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~da~~~~g~~~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~ 80 (319) T protein:vir:10 1 MTTKKFDEADKSNVEMYLIQAGVKQDAAATMGIWTAQELHRIKSQSYEEDYPVGSALRVFPVTTELSPTDKTFEYMTFDK 80 (319) T ss_pred CCCcchhHHhhHHHHHHHhhccchhhhhhhhhhHHHHHHHHHHHHHHhhhhcceechhhcccccCCCCceEEEEeeeecc Confidence 33322 11111 111123355566666777766 44555554443333111 13322112222222233344443 Q ss_pred cCcccccccccc-cCcccccCcceEEEEEEEEEeechhhhhhhh--hhhcccccHHHHHHHHHHHHHHHHHHHHHhhccc Q lcl|NC_019448. 86 HGNVGHSRFVKE-IGVAPVSDPNIRQKTVSMKYVSDTKNMSIAS--GLVNNIADPSQILTEDAIAVVAKTIEWASFYGDA 162 (463) Q Consensus 86 hG~~g~~~fv~E-~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~--~lvn~~~Dp~~~~~~~ai~~~~~~~E~a~fyGd~ 162 (463) . |....++. .+..+..|.++.|++..+..++.++.++..- .....-.+..+..-..|.+.+.+.....+||||+ T Consensus 81 ~---G~a~~~~d~~~dip~v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~~ 157 (319) T protein:vir:10 81 V---GTAQIIADYTDDLPLVDALGTSEFGKVFRLGNAYLISIDEIKAGQATGRPLSTRKASACQLAHDQLVNRLVFKGSA 157 (319) T ss_pred c---cceeeecCccccccceeccceeeEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecc Confidence 3 44445554 4445788999999999999999999987632 2222334667778888889999999999999998 Q ss_pred ccCCCccccccccccceeeecCcceEeccC---CCCCH----HHHhhhhhhhh---hcCCceeEEecCHHHHHHHHHHhc Q lcl|NC_019448. 163 SLTSEVEGEGLEFDGLAKLIDKNNVINAKG---NQLTE----KHLNEAAVRIG---KGFGTATDAYMPIGVHADFVNSIL 232 (463) Q Consensus 163 ~l~~~~~~~gleFDGl~~lI~~~nviDarG---~~ls~----~~ln~aa~~i~---~~~G~~td~~m~~~vka~f~~~~~ 232 (463) .+. +.||.+-=+-...-...+ .--+. +.|+++-..+. +++-.|+.+.||+.....+..-+- T Consensus 158 ~~g---------~~GLlN~p~~~~~~~~~~~~~~t~t~~~i~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~~~ 228 (319) T protein:vir:10 158 PHK---------IVSVFNHPNITKITSGKWIDVSTMKPETAEAELTQAIETIETITRGQHRATNILIPPSMRKVLAIRMP 228 (319) T ss_pred ccc---------ceeEEeCCCceeeecCCCCCccccCHHHHHHHHHHHHHHHHHhcCceeeceEEEecHHHHHhhhcccC Confidence 763 455544211000111111 11222 34666655443 456789999999999888753211 Q ss_pred Ccc----eEEeecCCCCcccceecCeeeecccccccCCceeccC--ccc---cccccccCC---CCCCCCeeEEE----- Q lcl|NC_019448. 233 GRQ----MQLMQDNSGNVNTGYSVNGFYSSRGFIKLHGSTVMEN--ELI---LDESLQPLP---NAPQPAKVTAT----- 295 (463) Q Consensus 233 ~~q----rv~~~~n~g~~~~G~~v~~~~s~~G~i~l~~s~~~~~--d~~---l~~~~~~~p---~ap~p~~vtat----- 295 (463) +.- ..|..++++ +. =..++.+....|.=. +-.++..+ |.+ +......+| .... -.+... T Consensus 229 ~~~~t~l~~lk~~~~~-l~-I~~~pel~~ag~~g~-~~~v~y~~~~~~~~~~v~~~~~~~~~e~~~l~-~~~~~~~r~~G 304 (319) T protein:vir:10 229 ETTMSYLDYFKSQNSG-IE-IDSIAELEDIDGAGT-KGVLVYEKNPMNMSIEIPEAFNMLPAQPKDLH-FKVPCTSKCTG 304 (319) T ss_pred CCCeeHHHHHHHhcCC-ce-EEEeeeecccCCCcc-eEEEEEecCCceEEEecCcceeeeeeeecCce-EEEeeeeeeEE Confidence 000 001111111 00 011223333222100 00122221 111 101111111 1110 011111 Q ss_pred EeccCCCcCcccccc Q lcl|NC_019448. 296 VETKQKGAFEDEEDR 310 (463) Q Consensus 296 ~~~~~~g~~~~~~~~ 310 (463) +.---..+..--++. T Consensus 305 v~i~~P~ai~~~dGI 319 (319) T protein:vir:10 305 LTIYRPMTIVLITGV 319 (319) T ss_pred EEEEccceeEeeecC Confidence 111111111101011 No 102 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=84.91 E-value=0.057 Score=27.41 Aligned_cols=292 Identities=15% Similarity=0.068 Sum_probs=125.7 Q ss_pred CCCCCccchHHHHhhhhhhHHHHHHhhcCC---------------ccCCccccCccccchhhhhhHhhhhhccccccchh Q lcl|NC_019448. 1 MTIEKNLSDVQQKYADQFQEDVVKSFQTGY---------------GITPDTQIDAGALRREILDDQITMLTWTNEDLIFY 65 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k~~~e~~~Ks~~agy---------------~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~ 65 (463) -..+........... ..+.|++..+. ..+..+-.+|+.|--+-+.+.|..+.. ..-.++ T Consensus 81 ~~~~~~~~~~~~~~~----~a~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~lvP~~~~~~ii~~~~--~~~~l~ 154 (397) T protein:vir:12 81 RSQGQGNEERQQQYS----KAFLKGLRGKRLTDEERDLLDSPEFRAMSGINDEDGGILIPEDIGRQIHEFKR--QFEPLE 154 (397) T ss_pred ccccchhhHHHHHHH----HHHHHHHhccCCcHHHHHHHhhhhhhhccccccccCcccCchhHHHHHHHhhh--hhhhHH Confidence 000111111111111 11222222111 011112234555444444444533333 223355 Q ss_pred hhcccchhhHHHhhhhhhhccCcccccccccccCccc-ccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHH Q lcl|NC_019448. 66 RDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAP-VSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTED 144 (463) Q Consensus 66 ~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ 144 (463) +.+...++.+...+|......++. ...+++|++..+ .+++.+.......+-++.--.+|.-+- .++..|.+....+. T Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~-~a~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~is~e~l-~ds~~~l~~~i~~~ 232 (397) T protein:vir:12 155 QYVTVEPVTTRSGTRLLEKNADMV-PFSPVEELGNLPEIDQPRFTKVSYSIIDYGGIMTLSNSML-NDSDQAIMTYVAKW 232 (397) T ss_pred hhcceeeccCCceeEEEEEecCCc-ceeeecccccccccccccceeEEeeheeeEeeehhhHHHH-hhchHHHHHHHHHH Confidence 555555555444444443333333 466999998865 678999999999988887766665432 33455778888899 Q ss_pred HHHHHHHHHHHHHhhcccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHH Q lcl|NC_019448. 145 AIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVH 224 (463) Q Consensus 145 ai~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vk 224 (463) ....+++.++.++++|+..-.|.+. +-+|++.+++ ...+...|......+|++.+. T Consensus 233 l~~~~~~~~d~~il~G~g~~~~~g~---~~~~~i~~~~---------------------~~~l~~~~~~~a~~~~n~~~~ 288 (397) T protein:vir:12 233 FAKKSVVTRNNLILAAIASLKKVDI---DGLDGIKKAL---------------------NVTLDPMVAPGSIVLTNQDGY 288 (397) T ss_pred HHHHHHHHHHHHHHhcccccccccc---ccHHHHHHHH---------------------hhccchhhhCCCEEEEcHHHH Confidence 9999999999999999987554321 2233322211 123334454445588999999 Q ss_pred HHHHHHhcCcceEEeecCCCC----cccceecCeeeeccccc--ccCC-ceeccC--ccccccccccCCCCCCCCeeEEE Q lcl|NC_019448. 225 ADFVNSILGRQMQLMQDNSGN----VNTGYSVNGFYSSRGFI--KLHG-STVMEN--ELILDESLQPLPNAPQPAKVTAT 295 (463) Q Consensus 225 a~f~~~~~~~qrv~~~~n~g~----~~~G~~v~~~~s~~G~i--~l~~-s~~~~~--d~~l~~~~~~~p~ap~p~~vtat 295 (463) +.|...--..-|.+.+++..+ .-.|++|- .+..... .... .+++.+ +.++... + . .++.. T Consensus 289 ~~L~~lkd~~G~~l~~~~~~~g~~~~l~G~pv~--~~~~~~~~~~~~~~~~~~gd~~~~~~~~~-----~--~--~~~i~ 357 (397) T protein:vir:12 289 DWLDTLKDGTGRYLLQPDPTNPTKKLLDGRPVV--PFTNRVLKTQKGKAPLIIGNLKEAIVLFD-----R--E--QQSIA 357 (397) T ss_pred HHHHHhhccCCceeecccccCCCCccccceeeE--EecccccccCCCccEEEEEehhceEEEEe-----e--c--ceEEE Confidence 888753222223333332211 12233321 0000000 0000 000000 0000000 0 0 00010 Q ss_pred EeccCCCcCcccccccceEEEEEEEecCCccccccceeeeecCCCCceEEEEEec Q lcl|NC_019448. 296 VETKQKGAFEDEEDRAGLSYKVVVNSDDAQSAPSEEVTATVSNVDDGVKLSISVN 350 (463) Q Consensus 296 ~~~~~~g~~~~~~~~a~ysYkV~a~s~~geS~~S~~vt~Tva~~~~gv~ltIt~~ 350 (463) ........| ......|++...- ..-+...+.-+.+++|.. T Consensus 358 ~~~~~~~~f----~~~~~~~r~~~r~-----------d~~~~~~~a~~~~~~t~~ 397 (397) T protein:vir:12 358 STDTGAGAF----ETNSTKVRGIERE-----------DVRKWDEDAVVFGQITVE 397 (397) T ss_pred Eeccccchh----hcCceEEEEEEee-----------ccEEecccceEEEEEeeC Confidence 000000000 0011122222111 222222333344444433 No 103 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=82.64 E-value=0.075 Score=26.75 Aligned_cols=298 Identities=12% Similarity=0.065 Sum_probs=117.0 Q ss_pred CCCCCccch-----HHH-------HhhhhhhHHHHHHhhc-CCccCCccccCccccchhhhhhHhhhhhccccccchhhh Q lcl|NC_019448. 1 MTIEKNLSD-----VQQ-------KYADQFQEDVVKSFQT-GYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRD 67 (463) Q Consensus 1 ~~~~~~~~~-----~~~-------~~~k~~~e~~~Ks~~a-gy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~ 67 (463) .+.+..... ..+ ...+.|.+.+..-... --.....+..+|+.|--+.+..+|..+.... -.+.+. T Consensus 67 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~--~~l~~~ 144 (394) T protein:vir:10 67 NSDPDKPVDNAQPNGTDLKKKPIDAKKKAINDFIHSHGKVIDNAAGHVTSTEAGVLIPEEIIYDPTAEVNSV--VDLSTL 144 (394) T ss_pred hcchhhhhhhhcccccchhhhHHHHHHHHHHHHHhccchhhhhhhcccccccCceeccHHHHHHHHHHHHhh--hhhhhh Confidence 111111100 000 0001111111100000 0000011223344443344444443232222 234444 Q ss_pred cccchhhHHHhhhhhhhccCcccccccccccCccc-ccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHH Q lcl|NC_019448. 68 ISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAP-VSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAI 146 (463) Q Consensus 68 i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai 146 (463) +...++.+.--+|.... .+.+...+++|++..+ .+++.+.+....++=++.-..+|.-+ +.++..|.+....+.-. T Consensus 145 ~~~~~~~~~~~~~~~~~--~~~~~~~~~~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~la 221 (394) T protein:vir:10 145 VTKTPVTTPKGTYPILK--RATDRFSSVAELAENPALAEPEFEQVDWSVSTYRGAIPLSEEA-IADSAVDLTSLVGQSIN 221 (394) T ss_pred ceeeeccCCceEEEEEe--cCCCccccccccccccccccccceeEEeeeeeeEeeehhHHHH-HhhhhHHHHHHHHHHHH Confidence 55555544433444333 2334567899987766 68899999999999888776666643 33445577888888888 Q ss_pred HHHHHHHHHHHhhcccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhh-hhhhhhcCCceeEEecCHHHHH Q lcl|NC_019448. 147 AVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEA-AVRIGKGFGTATDAYMPIGVHA 225 (463) Q Consensus 147 ~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~a-a~~i~~~~G~~td~~m~~~vka 225 (463) ..++.+++.++..|+..-.+...... .+.+.|.++ ...+-.+|. .-++||+.+.+ T Consensus 222 ~~~~~~~~~~il~g~g~~~~~~~~~~----------------------~~~d~l~~~~~~~~~~~~~--a~~vmn~~~~~ 277 (394) T protein:vir:10 222 EKSVNTYNAMIAPVLQSFTAKATTTD----------------------TLVDSLKHILNVDLDPAYS--RALVVTQSLFN 277 (394) T ss_pred HHHHHHHHHHHhhccccccccccccc----------------------ccHHHHHHHHHhhhhhhcc--CEEEecHHHHH Confidence 89999999999999876443211111 122223322 222323342 34899999988 Q ss_pred HHHHHhcCcceEEeecCCCCc--------ccceecCeeeeccc-ccccCCc--eecc--CccccccccccCCCCCCCCee Q lcl|NC_019448. 226 DFVNSILGRQMQLMQDNSGNV--------NTGYSVNGFYSSRG-FIKLHGS--TVME--NELILDESLQPLPNAPQPAKV 292 (463) Q Consensus 226 ~f~~~~~~~qrv~~~~n~g~~--------~~G~~v~~~~s~~G-~i~l~~s--~~~~--~d~~l~~~~~~~p~ap~p~~v 292 (463) .|...--..-|.+.+++..+. -.|++|- ++... .-...|+ +++. .+.++...+ ..+ T Consensus 278 ~l~~lkd~~G~~i~~~~~~~~~~~~~~~~L~G~PV~--~~~~~~~~~~~~~~~i~~gd~s~~~~~~~~---------~~~ 346 (394) T protein:vir:10 278 TLDTLKDKNGRYLLHDASDSITDGTAKGTVLGVPVY--VVGDALLGSAAGDQKAFVGDLKRGVLFADR---------QQV 346 (394) T ss_pred HHHHhhccCCCeeeeccccccccCCcccccccceeE--EecccccCCCCCceEEEEeeccccEEEEee---------cce Confidence 887533222344444433221 2344431 10000 0000000 1110 000000000 000 Q ss_pred EEEEeccCCCcCcccccccceEEEEEEEecCCccccccceeeeecC--CCCce Q lcl|NC_019448. 293 TATVETKQKGAFEDEEDRAGLSYKVVVNSDDAQSAPSEEVTATVSN--VDDGV 343 (463) Q Consensus 293 tat~~~~~~g~~~~~~~~a~ysYkV~a~s~~geS~~S~~vt~Tva~--~~~gv 343 (463) +......... ...-..+ +++=..-.+-++...-.++.+++. ..+|. T Consensus 347 ~v~~~~~~~~----~~~~~~~-~r~d~~~~~~~ai~~~~~~~~~~~~~~~~~~ 394 (394) T protein:vir:10 347 TLAWEDSKIY----GRYLGAA-FRFGVKQADSNAGYFVTNTDAASGSTSGTGK 394 (394) T ss_pred EEEEeccccc----ceeEEEE-EEeccEEeccccEEEEEeecccCCCCCCCCC Confidence 0000000000 0000000 011111111221111111111111 11222 No 104 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=82.54 E-value=0.076 Score=26.72 Aligned_cols=291 Identities=14% Similarity=0.095 Sum_probs=127.0 Q ss_pred CCCCCccchH--HHHhhhhhh-----HH-----HHHHhhcCCccCCccccCccccchhhhhhHhhhhhccccccchhhhc Q lcl|NC_019448. 1 MTIEKNLSDV--QQKYADQFQ-----ED-----VVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDI 68 (463) Q Consensus 1 ~~~~~~~~~~--~~~~~k~~~-----e~-----~~Ks~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i 68 (463) -....+.... +.++.+.+. ++ -.|+..+++ +..+|+.+--+.+.++|..+... .-.+.+-. T Consensus 124 ~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~-----~~~~g~~~ip~~~~~~ii~~~~~--~~~l~~~~ 196 (458) T protein:vir:10 124 YGTQENFEDEVEKLVLLSYVMEKGVFETEHGQRHLKAVNQSS-----SVEVSSESYETIFSQRIIRDLQK--ELVVGALF 196 (458) T ss_pred hhhhhhHHHHHHHHHHHHHHHhhccchhhhhhhhhhhhhhcc-----cCccccceehhhHhHHHHHHHHh--hhhHHhhc Confidence 0000000000 001111000 00 011111111 12234544444555555333322 22345555 Q ss_pred ccchhhHHHhhhhhhhccCcccccccccccCcc------cccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHH Q lcl|NC_019448. 69 SRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVA------PVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILT 142 (463) Q Consensus 69 ~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~------~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~ 142 (463) ...++.+....|.+.... +...+++|++.. +.+++.+.+.....+=++.--.+|.-+ +.++..|.+.... T Consensus 197 ~~~~~~~~~~~~~~~~~~---~~a~~v~e~~~~~~~~~~~~~~~~~~~i~~~~~k~~~~v~is~el-l~ds~~~~~~~i~ 272 (458) T protein:vir:10 197 EELPMSSKILTMLVEPDA---GKATWVAASTYGTDTTTGEEVKGALKEIHFSTYKLAAKSFITDET-EEDAIFSLLPLLR 272 (458) T ss_pred ceeecCCcceEEEEecCC---cceeecccccccccccccccccccceeeEeeeeeEEeeehhhHHH-HhcchHHHHHHHH Confidence 555666655566655533 345666766554 456788988888877677666666653 3445567888899 Q ss_pred HHHHHHHHHHHHHHHhhcccccCCCccccccccccceeeecC--cceE-ec---cCCCCCHHHHhhhhhhhhhcCCceeE Q lcl|NC_019448. 143 EDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDK--NNVI-NA---KGNQLTEKHLNEAAVRIGKGFGTATD 216 (463) Q Consensus 143 ~~ai~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~--~nvi-Da---rG~~ls~~~ln~aa~~i~~~~G~~td 216 (463) +.....++..++.++|+||-+ + +-.|+.+-..- .+++ +. ....++.+.|-++-..+..+|..... T Consensus 273 ~~l~~~i~~~~d~~~l~G~G~---~------~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~ 343 (458) T protein:vir:10 273 KRLIEAHAVSIEEAFMTGDGS---G------KPKGLLTLASEDSAKVVTEAKADGSVLVTAKTISKLRRKLGRHGLKLSK 343 (458) T ss_pred HHHHHHHHHHHHHHhhcCCCC---C------ccceeeecccccccceeecccccccccccHHHHHHHHHhhhhhhcCCCE Confidence 999999999999999999853 1 45566664321 1111 11 12345566666666677778877778 Q ss_pred EecCHHHHHHHHHHhcCcceEEeecCCC-CcccceecCeeeecccccccCCceeccCccccccccccCCCCCCCCeeEEE Q lcl|NC_019448. 217 AYMPIGVHADFVNSILGRQMQLMQDNSG-NVNTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQPLPNAPQPAKVTAT 295 (463) Q Consensus 217 ~~m~~~vka~f~~~~~~~qrv~~~~n~g-~~~~G~~v~~~~s~~G~i~l~~s~~~~~d~~l~~~~~~~p~ap~p~~vtat 295 (463) .+||+...+.|...--..-|.+.++... ....|. +.+++..+...... .|. -+ T Consensus 344 ~v~~~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~---------------~~~l~G~pv~~~~~---~p~---~~----- 397 (458) T protein:vir:10 344 LVLIVSMDAYYDLLEDEEWQDVAQVGNDSVKLQGQ---------------VGRIYGLPVVVSEY---FPA---KA----- 397 (458) T ss_pred EEEcHHHHHHHHhhcccCCceeeccccccccccCc---------------CceecceeeEEccc---ccc---cc----- Confidence 9999999888874221112223222111 110000 01111111111111 000 00 Q ss_pred EeccCCCcCccccc-------------------ccceEEEEEEEecCCccccccceeeeecCC Q lcl|NC_019448. 296 VETKQKGAFEDEED-------------------RAGLSYKVVVNSDDAQSAPSEEVTATVSNV 339 (463) Q Consensus 296 ~~~~~~g~~~~~~~-------------------~a~ysYkV~a~s~~geS~~S~~vt~Tva~~ 339 (463) ..+.-.+++..+ ..-..|+...--+-.--.|+..|..|.++- T Consensus 398 --~~~~~~~~~f~~~~~~~~~~~~~v~~d~~~~~~~~~~~~~~r~~~~v~~~~a~v~~~~aa~ 458 (458) T protein:vir:10 398 --NSAEFAVIVYKDNFVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFANGVVSGTYAAS 458 (458) T ss_pred --CCcceEEEEecccEEEEEeeceEEEeecccCCCceEEEEEEEecceEecccceEEEeeccC Confidence 000000110000 000111111111111112222222222211 No 105 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=82.20 E-value=0.079 Score=26.63 Aligned_cols=260 Identities=13% Similarity=0.135 Sum_probs=123.2 Q ss_pred CCCCCccchHHHHhhhhhhHHHHHHhhcCCccCCccccCccccchhhhhhHhhhhhccccccchh------hhcccchhh Q lcl|NC_019448. 1 MTIEKNLSDVQQKYADQFQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFY------RDISRRPAQ 74 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k~~~e~~~Ks~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~------~~i~k~~~~ 74 (463) |. ...|. .+..|--|.+.+.+.... .+-..|- ..+..++ - T Consensus 1 MA--~~~T~-----------------------------~~~~~iPev~s~~v~~~~--~~~~~~~~~~~~~~~~~g~~-G 46 (272) T protein:vir:98 1 MA--VGTTK-----------------------------MAQMLDPEVLADMIDAEV--GKAIRFAPLAEVDTTLEGQP-G 46 (272) T ss_pred CC--Ccccc-----------------------------chheechHHHHHHHHHHH--HHHhhhhccccccccccCCC-C Confidence 11 11111 122233333333321110 0000000 0011100 1 Q ss_pred HHHhhhhhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHHHHHHHH Q lcl|NC_019448. 75 STVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIE 154 (463) Q Consensus 75 stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~~E 154 (463) .|| + +-.++..|...++.|++..+.++.+......+++-++....+|..+.. .+..|++....+.....+++.++ T Consensus 47 ~tv-~---iP~~~~~~~a~~v~eg~~i~~~~~~~~~~~~~~~~~~~~~~itd~~~~-~s~~d~~~~~~~~~~~~~a~~~d 121 (272) T protein:vir:98 47 TTL-T---VPKWDYIGDAEDVAEGEAIPMTQLGFKKTTMTIKKAGKGVEITDEAIL-SGYGDPVGQAAKQIVEAIDHKVD 121 (272) T ss_pred CEE-E---EEEecCCCCcccccCCCcccccccccceEEEEeeeeeeeeeecHHHHh-hccccHHHHHHHHHHHHHHHHHH Confidence 111 1 112334466778999999999999999999999999998899988754 45679999999999999999999 Q ss_pred HHHhhcccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHHHHHhcCc Q lcl|NC_019448. 155 WASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGR 234 (463) Q Consensus 155 ~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~~~~~~ 234 (463) -.+|= .+.. ..+.++ ...+.+.|..|...+...+...+-++||+.+.+.|...-+.. T Consensus 122 ~~i~~---~~~~-----------------a~~~~~---~~~t~d~i~da~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~ 178 (272) T protein:vir:98 122 ADVLD---ALSK-----------------STQTVE---ATATVDGVSKALDIFNDEDDAETVIVMNPADASTLRLDAAKE 178 (272) T ss_pred HHHHH---Hhcc-----------------cccccc---cccCHHHHHHHHHHHhccCCCccEEEEcHHHHHHHHHhcccc Confidence 87761 1110 011111 234567788888888888888889999999998885432221 Q ss_pred ceEEeecCCCCc--ccceecCeeeecccccccCCceeccCccccccccc-cCCCCCCC---CeeEEEEeccCCCcCcccc Q lcl|NC_019448. 235 QMQLMQDNSGNV--NTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQ-PLPNAPQP---AKVTATVETKQKGAFEDEE 308 (463) Q Consensus 235 qrv~~~~n~g~~--~~G~~v~~~~s~~G~i~l~~s~~~~~d~~l~~~~~-~~p~ap~p---~~vtat~~~~~~g~~~~~~ 308 (463) +...+..+.. ..|. +. .+.|-.+...+..=..+.+ ..+.+..- ..++.+..-+.. .+... T Consensus 179 --~~~~~~~~~~~~~~g~-ig---------~i~G~~Vi~s~~~p~~t~~~~~~~a~~~~~~~~~~ve~~r~~~--~~~~~ 244 (272) T protein:vir:98 179 --WLGATEVGANRVVSGV-YG---------EVLGVQIVRSRKCPKGTAYMVRKGALRIMLKRNTMVETDRDIT--KAINQ 244 (272) T ss_pred --cccccccccccccccc-ch---------hhcCeeEEEcCCCCcceEEEEcCCeEEEEecCCceeeeccccc--cceeE Confidence 1111111110 0010 01 1222222222221100000 11111100 011222211111 11111 Q ss_pred cccceEEEEEEEecCCccccccceeeeecCCCCc Q lcl|NC_019448. 309 DRAGLSYKVVVNSDDAQSAPSEEVTATVSNVDDG 342 (463) Q Consensus 309 ~~a~ysYkV~a~s~~geS~~S~~vt~Tva~~~~g 342 (463) -...+.|-+-+.+ |+.+|-.|+....+. T Consensus 245 i~~~~~~~~~v~~------~~~vv~~t~~~a~~~ 272 (272) T protein:vir:98 245 IVANKHYGVYLYK------AEKAVKITLKDAAKK 272 (272) T ss_pred EEEEEEEEEEEEc------CCceEEEEecccccC Confidence 1112223222222 445556666555555 No 106 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=82.20 E-value=0.079 Score=26.63 Aligned_cols=260 Identities=13% Similarity=0.135 Sum_probs=123.2 Q ss_pred CCCCCccchHHHHhhhhhhHHHHHHhhcCCccCCccccCccccchhhhhhHhhhhhccccccchh------hhcccchhh Q lcl|NC_019448. 1 MTIEKNLSDVQQKYADQFQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFY------RDISRRPAQ 74 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k~~~e~~~Ks~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~------~~i~k~~~~ 74 (463) |. ...|. .+..|--|.+.+.+.... .+-..|- ..+..++ - T Consensus 1 MA--~~~T~-----------------------------~~~~~iPev~s~~v~~~~--~~~~~~~~~~~~~~~~~g~~-G 46 (272) T protein:vir:30 1 MA--VGTTK-----------------------------MAQMLDPEVLADMIDAEV--GKAIRFAPLAEVDTTLEGQP-G 46 (272) T ss_pred CC--Ccccc-----------------------------chheechHHHHHHHHHHH--HHHhhhhccccccccccCCC-C Confidence 11 11111 122233333333321110 0000000 0011100 1 Q ss_pred HHHhhhhhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHHHHHHHH Q lcl|NC_019448. 75 STVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIE 154 (463) Q Consensus 75 stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~~E 154 (463) .|| + +-.++..|...++.|++..+.++.+......+++-++....+|..+.. .+..|++....+.....+++.++ T Consensus 47 ~tv-~---iP~~~~~~~a~~v~eg~~i~~~~~~~~~~~~~~~~~~~~~~itd~~~~-~s~~d~~~~~~~~~~~~~a~~~d 121 (272) T protein:vir:30 47 TTL-T---VPKWDYIGDAEDVAEGEAIPMTQLGFKKTTMTIKKAGKGVEITDEAIL-SGYGDPVGQAAKQIVEAIDHKVD 121 (272) T ss_pred CEE-E---EEEecCCCCcccccCCCcccccccccceEEEEeeeeeeeeeecHHHHh-hccccHHHHHHHHHHHHHHHHHH Confidence 111 1 112334466778999999999999999999999999998899988754 45679999999999999999999 Q ss_pred HHHhhcccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHHHHHhcCc Q lcl|NC_019448. 155 WASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGR 234 (463) Q Consensus 155 ~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~~~~~~ 234 (463) -.+|= .+.. ..+.++ ...+.+.|..|...+...+...+-++||+.+.+.|...-+.. T Consensus 122 ~~i~~---~~~~-----------------a~~~~~---~~~t~d~i~da~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~ 178 (272) T protein:vir:30 122 ADVLD---ALSK-----------------STQTVE---ATATVDGVSKALDIFNDEDDAETVIVMNPADASTLRLDAAKE 178 (272) T ss_pred HHHHH---Hhcc-----------------cccccc---cccCHHHHHHHHHHHhccCCCccEEEEcHHHHHHHHHhcccc Confidence 87761 1110 011111 234567788888888888888889999999998885432221 Q ss_pred ceEEeecCCCCc--ccceecCeeeecccccccCCceeccCccccccccc-cCCCCCCC---CeeEEEEeccCCCcCcccc Q lcl|NC_019448. 235 QMQLMQDNSGNV--NTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQ-PLPNAPQP---AKVTATVETKQKGAFEDEE 308 (463) Q Consensus 235 qrv~~~~n~g~~--~~G~~v~~~~s~~G~i~l~~s~~~~~d~~l~~~~~-~~p~ap~p---~~vtat~~~~~~g~~~~~~ 308 (463) +...+..+.. ..|. +. .+.|-.+...+..=..+.+ ..+.+..- ..++.+..-+.. .+... T Consensus 179 --~~~~~~~~~~~~~~g~-ig---------~i~G~~Vi~s~~~p~~t~~~~~~~a~~~~~~~~~~ve~~r~~~--~~~~~ 244 (272) T protein:vir:30 179 --WLGATEVGANRVVSGV-YG---------EVLGVQIVRSRKCPKGTAYMVRKGALRIMLKRNTMVETDRDIT--KAINQ 244 (272) T ss_pred --cccccccccccccccc-ch---------hhcCeeEEEcCCCCcceEEEEcCCeEEEEecCCceeeeccccc--cceeE Confidence 1111111110 0010 01 1222222222221100000 11111100 011222211111 11111 Q ss_pred cccceEEEEEEEecCCccccccceeeeecCCCCc Q lcl|NC_019448. 309 DRAGLSYKVVVNSDDAQSAPSEEVTATVSNVDDG 342 (463) Q Consensus 309 ~~a~ysYkV~a~s~~geS~~S~~vt~Tva~~~~g 342 (463) -...+.|-+-+.+ |+.+|-.|+....+. T Consensus 245 i~~~~~~~~~v~~------~~~vv~~t~~~a~~~ 272 (272) T protein:vir:30 245 IVANKHYGVYLYK------AEKAVKITLKDAAKK 272 (272) T ss_pred EEEEEEEEEEEEc------CCceEEEEecccccC Confidence 1112223222222 445556666555555 No 107 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=82.13 E-value=0.08 Score=26.61 Aligned_cols=296 Identities=12% Similarity=0.054 Sum_probs=129.2 Q ss_pred CCC---CCccchHHHHhhhhhhHH----HHHHhh----cCCccCCccccCccccchhhhhhHhhhhhccccccchhhhcc Q lcl|NC_019448. 1 MTI---EKNLSDVQQKYADQFQED----VVKSFQ----TGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDIS 69 (463) Q Consensus 1 ~~~---~~~~~~~~~~~~k~~~e~----~~Ks~~----agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~ 69 (463) ..- .........++.+.+... -.+.+. .....+..+-.+|+.|--+.+..+|..+.. +.-.+++.+. T Consensus 64 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~--~~s~l~~~~~ 141 (392) T protein:vir:10 64 EVETRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELAR--SFDALEQYVT 141 (392) T ss_pred cccccCccchHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHH--hhhhhhhhce Confidence 000 001111111222222100 001111 011111112234554433344444422222 2334555555 Q ss_pred cchhhHHHhhhhhhhccCcccccccccccCccc-ccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHH Q lcl|NC_019448. 70 RRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAP-VSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAV 148 (463) Q Consensus 70 k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~ 148 (463) ..++.+.-.+|...... +.....+++|++... ...+.+.......+-++.-..+|.-+ +.++.-|.+....+.--.. T Consensus 142 ~~~~~~~~~~~~~~~~~-~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~l~~~ 219 (392) T protein:vir:10 142 VEPVRTRSGSRVLEKNS-DMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSL-LQDSDQNILKYVTKWLGKK 219 (392) T ss_pred eeeccCCceeEEEEeec-CCccceeecccccccccccccceeEEeeeeeEEEeehhhHHH-HhhhHHHHHHHHHHHHHHH Confidence 55665544444333322 223566899999876 55699999999999998888887753 2344557788888999999 Q ss_pred HHHHHHHHHhhcccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhh-hhhhhcCCceeEEecCHHHHHHH Q lcl|NC_019448. 149 VAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAA-VRIGKGFGTATDAYMPIGVHADF 227 (463) Q Consensus 149 ~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa-~~i~~~~G~~td~~m~~~vka~f 227 (463) +++.++.+++.|+....+.. .+-+ +.|..+- ..+...|-.....+||+.+.+.+ T Consensus 220 i~~~~d~~~~~g~g~~~~~~---~~~~----------------------d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L 274 (392) T protein:vir:10 220 SKVTRNVLILGVIEKLTKQA---IKSL----------------------DDIKDVLNVKLDPAISPNAILLTNQDGFNYL 274 (392) T ss_pred HHHHHHHHHhhccccccccC---ccCH----------------------HHHHHHHHHhhhhhhccCCEEEEcHHHHHHH Confidence 99999999999998765421 1222 2222221 23344454445589999999999 Q ss_pred HHHhcCcceEEeecCCCCcccceecCeeeecccccccCCceeccCccccccccccCCC--CCCCC--------------- Q lcl|NC_019448. 228 VNSILGRQMQLMQDNSGNVNTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQPLPN--APQPA--------------- 290 (463) Q Consensus 228 ~~~~~~~qrv~~~~n~g~~~~G~~v~~~~s~~G~i~l~~s~~~~~d~~l~~~~~~~p~--ap~p~--------------- 290 (463) ...=-..-|.+.+++..+... . .+.|. +.++..+....++ ..... T Consensus 275 ~~lkd~~G~~l~~~~~~~~~~----~---------tllG~-----~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~ 336 (392) T protein:vir:10 275 DKLKDKDGKYILQSDPTQKNK----K---------LFAGT-----NPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLF 336 (392) T ss_pred HHhhccCCCeEeecCccCCcc----c---------cccCc-----ccEEEecccccCCCcccCCceEEEEEehhceEEEE Confidence 743211223344333221110 0 11111 0000000000000 00000 Q ss_pred ---eeEEEEeccCCCcCcccccccceEEEEEEEecCCccccccceeeeecCCCCceEEEEEecCCCCCCcce Q lcl|NC_019448. 291 ---KVTATVETKQKGAFEDEEDRAGLSYKVVVNSDDAQSAPSEEVTATVSNVDDGVKLSISVNAMYQQQPQF 359 (463) Q Consensus 291 ---~vtat~~~~~~g~~~~~~~~a~ysYkV~a~s~~geS~~S~~vt~Tva~~~~gv~ltIt~~a~~g~~~~~ 359 (463) .++....+.....|. .....|++. .+ +..-+.....=+.++++..+ +..+|+. T Consensus 337 ~~~~~~~~~~~~~~~~f~----~~~~~~r~~--~r---------~d~~v~~~~a~~~l~~~~~a-~~~~~~~ 392 (392) T protein:vir:10 337 KREDMELASTDVGGKAFT----RNTLDLRAI--QR---------DDVQMWDNEAAVYGEIDLSA-PVEQPQG 392 (392) T ss_pred eecceEEEEeccccchhh----cCceEEEEE--Ee---------eccEEecccceEEEEecccc-cccCCCC Confidence 000000000000000 000111111 11 12233333344566666433 3555544 No 108 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=82.13 E-value=0.08 Score=26.61 Aligned_cols=296 Identities=12% Similarity=0.054 Sum_probs=129.2 Q ss_pred CCC---CCccchHHHHhhhhhhHH----HHHHhh----cCCccCCccccCccccchhhhhhHhhhhhccccccchhhhcc Q lcl|NC_019448. 1 MTI---EKNLSDVQQKYADQFQED----VVKSFQ----TGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDIS 69 (463) Q Consensus 1 ~~~---~~~~~~~~~~~~k~~~e~----~~Ks~~----agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~ 69 (463) ..- .........++.+.+... -.+.+. .....+..+-.+|+.|--+.+..+|..+.. +.-.+++.+. T Consensus 64 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~--~~s~l~~~~~ 141 (392) T protein:vir:10 64 EVETRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELAR--SFDALEQYVT 141 (392) T ss_pred cccccCccchHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHH--hhhhhhhhce Confidence 000 001111111222222100 001111 011111112234554433344444422222 2334555555 Q ss_pred cchhhHHHhhhhhhhccCcccccccccccCccc-ccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHH Q lcl|NC_019448. 70 RRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAP-VSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAV 148 (463) Q Consensus 70 k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~ 148 (463) ..++.+.-.+|...... +.....+++|++... ...+.+.......+-++.-..+|.-+ +.++.-|.+....+.--.. T Consensus 142 ~~~~~~~~~~~~~~~~~-~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~l~~~ 219 (392) T protein:vir:10 142 VEPVRTRSGSRVLEKNS-DMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSL-LQDSDQNILKYVTKWLGKK 219 (392) T ss_pred eeeccCCceeEEEEeec-CCccceeecccccccccccccceeEEeeeeeEEEeehhhHHH-HhhhHHHHHHHHHHHHHHH Confidence 55665544444333322 223566899999876 55699999999999998888887753 2344557788888999999 Q ss_pred HHHHHHHHHhhcccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhh-hhhhhcCCceeEEecCHHHHHHH Q lcl|NC_019448. 149 VAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAA-VRIGKGFGTATDAYMPIGVHADF 227 (463) Q Consensus 149 ~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa-~~i~~~~G~~td~~m~~~vka~f 227 (463) +++.++.+++.|+....+.. .+-+ +.|..+- ..+...|-.....+||+.+.+.+ T Consensus 220 i~~~~d~~~~~g~g~~~~~~---~~~~----------------------d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L 274 (392) T protein:vir:10 220 SKVTRNVLILGVIEKLTKQA---IKSL----------------------DDIKDVLNVKLDPAISPNAILLTNQDGFNYL 274 (392) T ss_pred HHHHHHHHHhhccccccccC---ccCH----------------------HHHHHHHHHhhhhhhccCCEEEEcHHHHHHH Confidence 99999999999998765421 1222 2222221 23344454445589999999999 Q ss_pred HHHhcCcceEEeecCCCCcccceecCeeeecccccccCCceeccCccccccccccCCC--CCCCC--------------- Q lcl|NC_019448. 228 VNSILGRQMQLMQDNSGNVNTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQPLPN--APQPA--------------- 290 (463) Q Consensus 228 ~~~~~~~qrv~~~~n~g~~~~G~~v~~~~s~~G~i~l~~s~~~~~d~~l~~~~~~~p~--ap~p~--------------- 290 (463) ...=-..-|.+.+++..+... . .+.|. +.++..+....++ ..... T Consensus 275 ~~lkd~~G~~l~~~~~~~~~~----~---------tllG~-----~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~ 336 (392) T protein:vir:10 275 DKLKDKDGKYILQSDPTQKNK----K---------LFAGT-----NPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLF 336 (392) T ss_pred HHhhccCCCeEeecCccCCcc----c---------cccCc-----ccEEEecccccCCCcccCCceEEEEEehhceEEEE Confidence 743211223344333221110 0 11111 0000000000000 00000 Q ss_pred ---eeEEEEeccCCCcCcccccccceEEEEEEEecCCccccccceeeeecCCCCceEEEEEecCCCCCCcce Q lcl|NC_019448. 291 ---KVTATVETKQKGAFEDEEDRAGLSYKVVVNSDDAQSAPSEEVTATVSNVDDGVKLSISVNAMYQQQPQF 359 (463) Q Consensus 291 ---~vtat~~~~~~g~~~~~~~~a~ysYkV~a~s~~geS~~S~~vt~Tva~~~~gv~ltIt~~a~~g~~~~~ 359 (463) .++....+.....|. .....|++. .+ +..-+.....=+.++++..+ +..+|+. T Consensus 337 ~~~~~~~~~~~~~~~~f~----~~~~~~r~~--~r---------~d~~v~~~~a~~~l~~~~~a-~~~~~~~ 392 (392) T protein:vir:10 337 KREDMELASTDVGGKAFT----RNTLDLRAI--QR---------DDVQMWDNEAAVYGEIDLSA-PVEQPQG 392 (392) T ss_pred eecceEEEEeccccchhh----cCceEEEEE--Ee---------eccEEecccceEEEEecccc-cccCCCC Confidence 000000000000000 000111111 11 12233333344566666433 3555544 No 109 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=82.13 E-value=0.08 Score=26.61 Aligned_cols=296 Identities=12% Similarity=0.054 Sum_probs=129.2 Q ss_pred CCC---CCccchHHHHhhhhhhHH----HHHHhh----cCCccCCccccCccccchhhhhhHhhhhhccccccchhhhcc Q lcl|NC_019448. 1 MTI---EKNLSDVQQKYADQFQED----VVKSFQ----TGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDIS 69 (463) Q Consensus 1 ~~~---~~~~~~~~~~~~k~~~e~----~~Ks~~----agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~ 69 (463) ..- .........++.+.+... -.+.+. .....+..+-.+|+.|--+.+..+|..+.. +.-.+++.+. T Consensus 64 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~--~~s~l~~~~~ 141 (392) T protein:vir:10 64 EVETRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELAR--SFDALEQYVT 141 (392) T ss_pred cccccCccchHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHH--hhhhhhhhce Confidence 000 001111111222222100 001111 011111112234554433344444422222 2334555555 Q ss_pred cchhhHHHhhhhhhhccCcccccccccccCccc-ccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHH Q lcl|NC_019448. 70 RRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAP-VSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAV 148 (463) Q Consensus 70 k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~ 148 (463) ..++.+.-.+|...... +.....+++|++... ...+.+.......+-++.-..+|.-+ +.++.-|.+....+.--.. T Consensus 142 ~~~~~~~~~~~~~~~~~-~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~l~~~ 219 (392) T protein:vir:10 142 VEPVRTRSGSRVLEKNS-DMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSL-LQDSDQNILKYVTKWLGKK 219 (392) T ss_pred eeeccCCceeEEEEeec-CCccceeecccccccccccccceeEEeeeeeEEEeehhhHHH-HhhhHHHHHHHHHHHHHHH Confidence 55665544444333322 223566899999876 55699999999999998888887753 2344557788888999999 Q ss_pred HHHHHHHHHhhcccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhh-hhhhhcCCceeEEecCHHHHHHH Q lcl|NC_019448. 149 VAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAA-VRIGKGFGTATDAYMPIGVHADF 227 (463) Q Consensus 149 ~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa-~~i~~~~G~~td~~m~~~vka~f 227 (463) +++.++.+++.|+....+.. .+-+ +.|..+- ..+...|-.....+||+.+.+.+ T Consensus 220 i~~~~d~~~~~g~g~~~~~~---~~~~----------------------d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L 274 (392) T protein:vir:10 220 SKVTRNVLILGVIEKLTKQA---IKSL----------------------DDIKDVLNVKLDPAISPNAILLTNQDGFNYL 274 (392) T ss_pred HHHHHHHHHhhccccccccC---ccCH----------------------HHHHHHHHHhhhhhhccCCEEEEcHHHHHHH Confidence 99999999999998765421 1222 2222221 23344454445589999999999 Q ss_pred HHHhcCcceEEeecCCCCcccceecCeeeecccccccCCceeccCccccccccccCCC--CCCCC--------------- Q lcl|NC_019448. 228 VNSILGRQMQLMQDNSGNVNTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQPLPN--APQPA--------------- 290 (463) Q Consensus 228 ~~~~~~~qrv~~~~n~g~~~~G~~v~~~~s~~G~i~l~~s~~~~~d~~l~~~~~~~p~--ap~p~--------------- 290 (463) ...=-..-|.+.+++..+... . .+.|. +.++..+....++ ..... T Consensus 275 ~~lkd~~G~~l~~~~~~~~~~----~---------tllG~-----~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~ 336 (392) T protein:vir:10 275 DKLKDKDGKYILQSDPTQKNK----K---------LFAGT-----NPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLF 336 (392) T ss_pred HHhhccCCCeEeecCccCCcc----c---------cccCc-----ccEEEecccccCCCcccCCceEEEEEehhceEEEE Confidence 743211223344333221110 0 11111 0000000000000 00000 Q ss_pred ---eeEEEEeccCCCcCcccccccceEEEEEEEecCCccccccceeeeecCCCCceEEEEEecCCCCCCcce Q lcl|NC_019448. 291 ---KVTATVETKQKGAFEDEEDRAGLSYKVVVNSDDAQSAPSEEVTATVSNVDDGVKLSISVNAMYQQQPQF 359 (463) Q Consensus 291 ---~vtat~~~~~~g~~~~~~~~a~ysYkV~a~s~~geS~~S~~vt~Tva~~~~gv~ltIt~~a~~g~~~~~ 359 (463) .++....+.....|. .....|++. .+ +..-+.....=+.++++..+ +..+|+. T Consensus 337 ~~~~~~~~~~~~~~~~f~----~~~~~~r~~--~r---------~d~~v~~~~a~~~l~~~~~a-~~~~~~~ 392 (392) T protein:vir:10 337 KREDMELASTDVGGKAFT----RNTLDLRAI--QR---------DDVQMWDNEAAVYGEIDLSA-PVEQPQG 392 (392) T ss_pred eecceEEEEeccccchhh----cCceEEEEE--Ee---------eccEEecccceEEEEecccc-cccCCCC Confidence 000000000000000 000111111 11 12233333344566666433 3555544 No 110 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=82.13 E-value=0.08 Score=26.61 Aligned_cols=296 Identities=12% Similarity=0.054 Sum_probs=129.2 Q ss_pred CCC---CCccchHHHHhhhhhhHH----HHHHhh----cCCccCCccccCccccchhhhhhHhhhhhccccccchhhhcc Q lcl|NC_019448. 1 MTI---EKNLSDVQQKYADQFQED----VVKSFQ----TGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDIS 69 (463) Q Consensus 1 ~~~---~~~~~~~~~~~~k~~~e~----~~Ks~~----agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~ 69 (463) ..- .........++.+.+... -.+.+. .....+..+-.+|+.|--+.+..+|..+.. +.-.+++.+. T Consensus 64 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~--~~s~l~~~~~ 141 (392) T protein:vir:10 64 EVETRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELAR--SFDALEQYVT 141 (392) T ss_pred cccccCccchHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHHH--hhhhhhhhce Confidence 000 001111111222222100 001111 011111112234554433344444422222 2334555555 Q ss_pred cchhhHHHhhhhhhhccCcccccccccccCccc-ccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHH Q lcl|NC_019448. 70 RRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAP-VSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAV 148 (463) Q Consensus 70 k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~ 148 (463) ..++.+.-.+|...... +.....+++|++... ...+.+.......+-++.-..+|.-+ +.++.-|.+....+.--.. T Consensus 142 ~~~~~~~~~~~~~~~~~-~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~l~~~ 219 (392) T protein:vir:10 142 VEPVRTRSGSRVLEKNS-DMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSL-LQDSDQNILKYVTKWLGKK 219 (392) T ss_pred eeeccCCceeEEEEeec-CCccceeecccccccccccccceeEEeeeeeEEEeehhhHHH-HhhhHHHHHHHHHHHHHHH Confidence 55665544444333322 223566899999876 55699999999999998888887753 2344557788888999999 Q ss_pred HHHHHHHHHhhcccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhh-hhhhhcCCceeEEecCHHHHHHH Q lcl|NC_019448. 149 VAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAA-VRIGKGFGTATDAYMPIGVHADF 227 (463) Q Consensus 149 ~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa-~~i~~~~G~~td~~m~~~vka~f 227 (463) +++.++.+++.|+....+.. .+-+ +.|..+- ..+...|-.....+||+.+.+.+ T Consensus 220 i~~~~d~~~~~g~g~~~~~~---~~~~----------------------d~i~~~~~~~l~~~~~~~a~~vm~~~~~~~L 274 (392) T protein:vir:10 220 SKVTRNVLILGVIEKLTKQA---IKSL----------------------DDIKDVLNVKLDPAISPNAILLTNQDGFNYL 274 (392) T ss_pred HHHHHHHHHhhccccccccC---ccCH----------------------HHHHHHHHHhhhhhhccCCEEEEcHHHHHHH Confidence 99999999999998765421 1222 2222221 23344454445589999999999 Q ss_pred HHHhcCcceEEeecCCCCcccceecCeeeecccccccCCceeccCccccccccccCCC--CCCCC--------------- Q lcl|NC_019448. 228 VNSILGRQMQLMQDNSGNVNTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQPLPN--APQPA--------------- 290 (463) Q Consensus 228 ~~~~~~~qrv~~~~n~g~~~~G~~v~~~~s~~G~i~l~~s~~~~~d~~l~~~~~~~p~--ap~p~--------------- 290 (463) ...=-..-|.+.+++..+... . .+.|. +.++..+....++ ..... T Consensus 275 ~~lkd~~G~~l~~~~~~~~~~----~---------tllG~-----~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~ 336 (392) T protein:vir:10 275 DKLKDKDGKYILQSDPTQKNK----K---------LFAGT-----NPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLF 336 (392) T ss_pred HHhhccCCCeEeecCccCCcc----c---------cccCc-----ccEEEecccccCCCcccCCceEEEEEehhceEEEE Confidence 743211223344333221110 0 11111 0000000000000 00000 Q ss_pred ---eeEEEEeccCCCcCcccccccceEEEEEEEecCCccccccceeeeecCCCCceEEEEEecCCCCCCcce Q lcl|NC_019448. 291 ---KVTATVETKQKGAFEDEEDRAGLSYKVVVNSDDAQSAPSEEVTATVSNVDDGVKLSISVNAMYQQQPQF 359 (463) Q Consensus 291 ---~vtat~~~~~~g~~~~~~~~a~ysYkV~a~s~~geS~~S~~vt~Tva~~~~gv~ltIt~~a~~g~~~~~ 359 (463) .++....+.....|. .....|++. .+ +..-+.....=+.++++..+ +..+|+. T Consensus 337 ~~~~~~~~~~~~~~~~f~----~~~~~~r~~--~r---------~d~~v~~~~a~~~l~~~~~a-~~~~~~~ 392 (392) T protein:vir:10 337 KREDMELASTDVGGKAFT----RNTLDLRAI--QR---------DDVQMWDNEAAVYGEIDLSA-PVEQPQG 392 (392) T ss_pred eecceEEEEeccccchhh----cCceEEEEE--Ee---------eccEEecccceEEEEecccc-cccCCCC Confidence 000000000000000 000111111 11 12233333344566666433 3555544 No 111 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=81.77 E-value=0.083 Score=26.52 Aligned_cols=305 Identities=11% Similarity=0.056 Sum_probs=115.9 Q ss_pred CCCCC-----ccchHH--H-----------------Hhhhh-------hhHHHHHHhhcCCccCCccccCccccchhhhh Q lcl|NC_019448. 1 MTIEK-----NLSDVQ--Q-----------------KYADQ-------FQEDVVKSFQTGYGITPDTQIDAGALRREILD 49 (463) Q Consensus 1 ~~~~~-----~~~~~~--~-----------------~~~k~-------~~e~~~Ks~~agy~~~p~~q~~gaalr~esLd 49 (463) ...+. +..... . .+++. +...+.+++.+|..+++ ...|+-+--+.+. T Consensus 278 ~~~~~~~~~~~~~kg~~f~~~~~al~~~~g~~~~a~e~a~~~~~~~~~~~~~~~~a~~~~~~~~~--~~~Gg~~vp~~~~ 355 (645) T protein:vir:93 278 ASAPVIRVEQKLDKGIGFARFAKSLAAAKGVRSEALEVARRQYPDDSRLHHVLKSAVGAGTTTDP--QWAGSLSEYQEYA 355 (645) T ss_pred cccccccchhhhhhhhhHHHHHHHHHhcccchhHHHHHHHhhcccchhhhhhhhhhhhccccccc--cccCCccCchhhH Confidence 00000 000000 0 00000 11223455655553332 2334443333333 Q ss_pred hHhhhhhccccccchhhhcccchhhHHHh-hhhhhh--ccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhh Q lcl|NC_019448. 50 DQITMLTWTNEDLIFYRDISRRPAQSTVV-KYDQYL--RHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSI 126 (463) Q Consensus 50 ~~i~~L~~~~~df~f~~~i~k~~~~stv~-ey~~~~--~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~ 126 (463) .+|-.+.... ..+.++..+....... .++... .-|+ +...+++|++..+.+++.+...+...|=|+.--.+|. T Consensus 356 ~~ii~~l~~~---svv~~l~~~~~~~~~~~~~~~~ip~~t~~-~~a~wv~Eg~~~~~s~~~f~~v~l~~~kla~~~~iS~ 431 (645) T protein:vir:93 356 QDFIDYLRPQ---TIIGRFGQGGIPALRQVPFNIRVHAQVSG-GAAGWVGEGKTKPLTKFDFESITFSHAKVSAIAVLTE 431 (645) T ss_pred HHHHHhhhhh---hhHHhhccccccccccccCceeeeeeecC-cceEEeccCccccccccceeEEEEeeEEEEEeehhHH Confidence 3332111111 1111111111111000 111111 1122 4577999999999999999999999998888777776 Q ss_pred hhhhhcccccHHHHHHHHHHHHHHHHHHHHHhhcccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhhhh Q lcl|NC_019448. 127 ASGLVNNIADPSQILTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVR 206 (463) Q Consensus 127 ~~~lvn~~~Dp~~~~~~~ai~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~ 206 (463) -+=. ++.-|.+....++-...+++.++.++|.|+..-..+..+.|+- .|.. .+...| ....+..+..... T Consensus 432 ell~-ds~~~~~~~i~~~l~~aia~~~d~a~l~g~g~~~~~~~p~gi~-~~~~-------~~~~~~-~~~~d~~~~~~~~ 501 (645) T protein:vir:93 432 ELIR-FSSPAADALVRNALAEAVVARLDTDFVDPKKAAVADVSPASIT-HDVK-------GTASSG-NPDADAEAAFGQF 501 (645) T ss_pred HHHh-hchHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccCCcccccee-cccc-------cccccc-chHHHHHHHHHHH Confidence 5432 3345678888899999999999999999986643333344431 1111 111212 1222333322222 Q ss_pred hhhcCCceeE-EecCHHHHHHHHHHhcC-cceEEeecC--CCCcccceecCeeeecc--ccc-ccCCc-eec--cCcccc Q lcl|NC_019448. 207 IGKGFGTATD-AYMPIGVHADFVNSILG-RQMQLMQDN--SGNVNTGYSVNGFYSSR--GFI-KLHGS-TVM--ENELIL 276 (463) Q Consensus 207 i~~~~G~~td-~~m~~~vka~f~~~~~~-~qrv~~~~n--~g~~~~G~~v~~~~s~~--G~i-~l~~s-~~~--~~d~~l 276 (463) +..++...+- ..|++.+.+.+...--. .+..+ +.. .++.-.|++|- .+-. +.+ ..+.+ +++ ..+..+ T Consensus 502 ~~a~~~~~~a~~vmn~~~~~~L~~lkd~~G~~~~-~~~~~~~~tL~G~PV~--~s~~vp~~~~~gd~s~~~ig~~~~v~i 578 (645) T protein:vir:93 502 VAANLQPTGAVWLMSSTNALALSMRKNALGQKEY-PDMTLLGGSFQGLPVI--VSQYVGDQLVLVNAPDIYLADDGGVAV 578 (645) T ss_pred HhcCCCccccEEEEcHHHHHHHHhccccCCceee-cCCCCCCceeeceeeE--EeccCCcceeEeccccEEEEEecceEE Confidence 3333333333 45899999888643222 22222 221 12234555541 1100 000 00000 000 001111 Q ss_pred ccccc------cCC--C-----------CCCCCeeEEEEeccCCCcCcccccccceEEEEEEEecCCccccccceeeeec Q lcl|NC_019448. 277 DESLQ------PLP--N-----------APQPAKVTATVETKQKGAFEDEEDRAGLSYKVVVNSDDAQSAPSEEVTATVS 337 (463) Q Consensus 277 ~~~~~------~~p--~-----------ap~p~~vtat~~~~~~g~~~~~~~~a~ysYkV~a~s~~geS~~S~~vt~Tva 337 (463) +..+. ..| + .++--.++.=+.---.|....++ . |+++.+=-+..+|- T Consensus 579 ~~s~~a~~~~~~~~~~~~~~~~~~~~v~lf~~d~vaira~~r~d~~~~~p~-----a--~~~lt~~~~g~~~~------- 644 (645) T protein:vir:93 579 DMSREASLEMQSEPTGDSTTPSPVELVSMFQTGSVAIRAERWINWRRRRTA-----A--VAVITGVNYGSASG------- 644 (645) T ss_pred EeecceeEEEeecccccccccccccchhHhhcCceEEEEEEEEcceeeCcc-----c--eEEEecccCCcccC------- Confidence 10000 000 0 00000000000000001000000 0 00111000111110 Q ss_pred CCCCc Q lcl|NC_019448. 338 NVDDG 342 (463) Q Consensus 338 ~~~~g 342 (463) | T Consensus 645 ----~ 645 (645) T protein:vir:93 645 ----G 645 (645) T ss_pred ----C Confidence 0 No 112 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=80.80 E-value=0.091 Score=26.28 Aligned_cols=261 Identities=13% Similarity=0.055 Sum_probs=115.3 Q ss_pred ccCCccccCccccch---hhhhhHhhhhhccccccc-hhhhcccchhhHHHhhhhhhhccCccccccccccc-CcccccC Q lcl|NC_019448. 31 GITPDTQIDAGALRR---EILDDQITMLTWTNEDLI-FYRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEI-GVAPVSD 105 (463) Q Consensus 31 ~~~p~~q~~gaalr~---esLd~~i~~L~~~~~df~-f~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~-g~~~~~d 105 (463) -++ .+.|++-- |-+|+.+....+..-.+. |+.-...-.+--....|...... |....++.. ...+..| T Consensus 1 ~~~----~~~g~f~~~~l~~id~~v~e~~~~~l~~r~l~~v~~~~~~~~~~~~~~~~~~~---G~~~~~~~~~~dip~~~ 73 (301) T protein:vir:80 1 MQG----KITATIEARDLQAIDNVIYEPKQEELTARSVFPQKFDVNEGAESYSFDVMTRS---GAAKIIANGADDLPLVD 73 (301) T ss_pred CCc----cccchhhHHHHHHHHHHHHHhhhhhhhhhhhcccccCCCCceEEEEEeeeccc---eeEEEecCccccccccc Confidence 001 12334444 334555544433332221 22222222232223334333333 333444433 3357789 Q ss_pred cceEEEEEEEEEeechhhhhhh--hhhhcccccHHHHHHHHHHHHHHHHHHHHHhhcccccCCCccccccccccceeeec Q lcl|NC_019448. 106 PNIRQKTVSMKYVSDTKNMSIA--SGLVNNIADPSQILTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLID 183 (463) Q Consensus 106 ~~~~r~~~~~k~l~~~~~vs~~--~~lvn~~~Dp~~~~~~~ai~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~ 183 (463) .++.|++..+.-+..++.++.. ......-.+..+.....|.+.+.+.....+||||+.+. +.||.+-=+ T Consensus 74 ~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~f~G~~~~g---------~~GLlN~p~ 144 (301) T protein:vir:80 74 VDMVRKSVPIYSIGIGLSYTIQDLRAARMQGTTVDAAKATTVRRAIAEKENSIAFRGEKKYA---------IKGAFEATG 144 (301) T ss_pred ccceeEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEeeeccccc---------ceeeecCCC Confidence 9999999999999998887653 33344445778888889999999999999999999863 555555221 Q ss_pred C---cceEeccCCC-----CC----HHHHhhhhhhhhh---cCCceeEEecCHHHHHHHHHHhcCcc------eEEeecC Q lcl|NC_019448. 184 K---NNVINAKGNQ-----LT----EKHLNEAAVRIGK---GFGTATDAYMPIGVHADFVNSILGRQ------MQLMQDN 242 (463) Q Consensus 184 ~---~nviDarG~~-----ls----~~~ln~aa~~i~~---~~G~~td~~m~~~vka~f~~~~~~~q------rv~~~~n 242 (463) - ...-+..|.. -+ .+.|+++...+.. ++=.++.+.||+.....+..-+...+ ..+.+++ T Consensus 145 ~~~~~~~~~~~~~~~~w~~~t~~ei~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~~~~~~~~~tvl~~l~~~~ 224 (301) T protein:vir:80 145 IQIDVSPTTGVGNVSKWEKKTAEQIIDEIGEAHTKITVLPGYGTASLKLCLPPKQFELINKKRYSNEDSRSVLKVLQDNA 224 (301) T ss_pred cccccccCcccccccccccCCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHhhhhccccCCCCeeHHHHHHHHc Confidence 0 1111111111 11 3557777777653 33468899999999998864322110 0111121 Q ss_pred CCCcccceecCeeeecccccccCCceecc--Ccc-----ccccc-cccCCCCCC---CCee-EEEEeccCCCcCcccccc Q lcl|NC_019448. 243 SGNVNTGYSVNGFYSSRGFIKLHGSTVME--NEL-----ILDES-LQPLPNAPQ---PAKV-TATVETKQKGAFEDEEDR 310 (463) Q Consensus 243 ~g~~~~G~~v~~~~s~~G~i~l~~s~~~~--~d~-----~l~~~-~~~~p~ap~---p~~v-tat~~~~~~g~~~~~~~~ 310 (463) .. .. =..++.+.++.+. .-+-.++.. .|. .++.. ....+..+. |-.- ++.+.---.....--++- T Consensus 225 ~~-~~-I~~~p~L~~~g~~-g~~~~v~~~~~~d~~~~~v~~~~~~~~~e~~~~~~~~~~~~r~~Gv~i~~P~ai~~~~GI 301 (301) T protein:vir:80 225 WF-SA-IVRVPDLAGMGTA-GSDSFAVIHDSNETAELIIPMDITRHPEEYSFPRTKVPFEERTAGVVVRFPAAIVRVDGI 301 (301) T ss_pred Cc-ce-EEEcceeccCCCC-cccEEEEEecCCcEEEEEecCceeeecceecCceeEeeeeeeeEEEEEEccceEEEEecC Confidence 11 00 0112222221110 000011111 111 11110 011111111 1110 000000001110000000 No 113 >protein:vir:97397 Length: 517 # NCBI annotation: major capsid protein # Family: family:all:11745 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762590;genbank:gi:115304291;genbank:GeneID:5130600 Probab=80.51 E-value=0.094 Score=26.21 Aligned_cols=278 Identities=12% Similarity=0.064 Sum_probs=111.8 Q ss_pred CC-CCCccchHHHHhhhhhhHH--HHHHhhcCCc--c---CCccc-cCccccchhhhhhHhhhhhccccccchhhhcccc Q lcl|NC_019448. 1 MT-IEKNLSDVQQKYADQFQED--VVKSFQTGYG--I---TPDTQ-IDAGALRREILDDQITMLTWTNEDLIFYRDISRR 71 (463) Q Consensus 1 ~~-~~~~~~~~~~~~~k~~~e~--~~Ks~~agy~--~---~p~~q-~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~ 71 (463) +. .+.........+.+..+.. ..+...++.+ . .-... .++-..+.+.+..-+..+..... +.+-++.. T Consensus 200 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~---i~~~~~~~ 276 (517) T protein:vir:97 200 LGVEALKVTPEATEFLKTREAEVAYMSASLTKDPKAAWTAELKERGISGMPAPAGILKRIQDAVNDEGS---LLPFIRHE 276 (517) T ss_pred cccccccccchhhHHHHHHHHHHHHHHhcccccccceeeeecccccccccccchHHHHHHHHhhhhhcc---ceeeeeec Confidence 00 0111111111222222111 1111111111 1 10111 12323344444443333332221 11111111 Q ss_pred hhhHHHhhhhhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhccccc----HHHHHHHHHHH Q lcl|NC_019448. 72 PAQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIAD----PSQILTEDAIA 147 (463) Q Consensus 72 ~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~D----p~~~~~~~ai~ 147 (463) .+. +.+....-..+...++.|+...+.+|..+..++..+|-++.-..+|..+ +.++.-| .+....+.-.. T Consensus 277 ~i~-----~~~~~~~~~~~~a~~~~eG~~kp~s~~tf~~~~~~~~~ia~~~~~S~ql-l~Ds~~dd~~~l~s~i~~~l~~ 350 (517) T protein:vir:97 277 NLP-----TLVVGGDNALTQGTGHTTGTDKTESNITLQTRVLTPQYVYKYIKLPKIV-MNSNATDIAGAILTYVMNRLPD 350 (517) T ss_pred ccc-----ceeeecccccceeeeeecCCcccccccceeeEEeeHhhhhhhhhhhHHH-HHHhhhccHHHHHHHHHHHHHH Confidence 110 0011000001124467899999999999999999998888877776643 2233333 56777778888 Q ss_pred HHHHHHHHHHhhcccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHH Q lcl|NC_019448. 148 VVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADF 227 (463) Q Consensus 148 ~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f 227 (463) .++...|.++..||-. |.+..|+..+........+.+..--.+++.....-+.+.++ .-+.|++.+.+.+ T Consensus 351 ~l~~~ee~a~l~GdGt--------g~~~~gi~~~a~~~~~~~~~~~~~~~d~i~~l~~a~~~a~~--a~~vmn~~t~~~I 420 (517) T protein:vir:97 351 MVIMAVNRAIIMGGVT--------GVSETQIYPVVGDAWATNVTGTTNIQELLEKLSVATPKAAD--STLVIHRNDLAAI 420 (517) T ss_pred HHHHHHHHHHhcccCC--------CcccccccccccccccccccccchHHHHHHHHHHHhhhccC--CEEEECHHHHHHH Confidence 9999999999999853 22344555443322222222222222333333333333222 2378999999988 Q ss_pred HHHh-cCcceEEeecCCCCc--c--ccee--cC-------------ee--eecccccccCCcee-ccCcccccccc---- Q lcl|NC_019448. 228 VNSI-LGRQMQLMQDNSGNV--N--TGYS--VN-------------GF--YSSRGFIKLHGSTV-MENELILDESL---- 280 (463) Q Consensus 228 ~~~~-~~~qrv~~~~n~g~~--~--~G~~--v~-------------~~--~s~~G~i~l~~s~~-~~~d~~l~~~~---- 280 (463) .-.= -+.+..+ ++..+.. . .|.. ++ +| ....|-..+..-.. .+.+.++.+.+ T Consensus 421 ~klKD~~G~Yl~-~~~~~~~~~~~l~G~~~~~~~~~~~~~~~~~~~~y~i~~~~g~~~~~~fd~~~n~~~f~~~~~~~g~ 499 (517) T protein:vir:97 421 RFLKDKNGNYVF-PVGVSNQTIATHFGFNRLVQSVAVDEKTAVSLSGYVTNGSRGMEFEQGTILVENNKEYLFEMPISGS 499 (517) T ss_pred HHhhcCCCCeec-cCcCCcccccccCCccccccccccCceeEeeccccEEEeecceeeeeeeecccCceeEeeeeeeccc Confidence 6322 1233333 3211111 1 1110 01 11 01111111111000 12333333333 Q ss_pred ccCCCCCCCCeeEEEEeccCCC Q lcl|NC_019448. 281 QPLPNAPQPAKVTATVETKQKG 302 (463) Q Consensus 281 ~~~p~ap~p~~vtat~~~~~~g 302 (463) +..|.+.+- ++..|...| T Consensus 500 i~~~~r~a~----~~~~p~~~~ 517 (517) T protein:vir:97 500 LEYKGTTAY----GTYTPPVAG 517 (517) T ss_pred cccccceEE----EEEcCCCCC Confidence 222332221 223333333 No 114 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=80.15 E-value=0.098 Score=26.12 Aligned_cols=289 Identities=8% Similarity=0.088 Sum_probs=117.9 Q ss_pred CCCCCccchHHHHhhhhh------h------HHHHHHhhcCCccCCccccCccccchhhhhhHhhhhhccccccchhhhc Q lcl|NC_019448. 1 MTIEKNLSDVQQKYADQF------Q------EDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDI 68 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k~~------~------e~~~Ks~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i 68 (463) ...+.+....+..+++.+ . ....+++.+| +..+||.|-=+.+..+|..+..... .+++.+ T Consensus 96 ~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~------t~~~GG~lIP~~~~~~Ii~~~~~~~--~l~~~~ 167 (402) T protein:vir:93 96 LSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTG------NDSGGDKLLPKTLSKEIVSEPFAKN--QLREKA 167 (402) T ss_pred CchhHHHHHHHHHHHHHHHhhhhHHHHHHhHHHHHhhhccC------CCcCCccccchhHHHHHHHhHHhhh--hhhhhc Confidence 111111111111111111 0 1111222222 2234555544545555543333222 233333 Q ss_pred ccchhhHHHhhhhhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHH Q lcl|NC_019448. 69 SRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAV 148 (463) Q Consensus 69 ~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~ 148 (463) ...++.+ + .+.+. +.+ .+...+++|++..+..++++......++=++.-..+|.- -+.++..|.+....+.-... T Consensus 168 ~v~~~~~-~-~~p~~-~~~-~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~i~iS~e-ll~Ds~~~l~~~i~~~la~~ 242 (402) T protein:vir:93 168 RLTNIKG-L-EIPRV-SYT-LDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAISDT-VIHGSDVDLVNWVENALQSG 242 (402) T ss_pred eeeecCC-c-eeeee-ecc-CCccccccccccccccccccceeeecceeeeeechhhHH-HHhhhHHHHHHHHHHHHHHH Confidence 3333332 1 12222 221 123568999999999999999988888877765555533 12344556666666666666 Q ss_pred HHHHHHHHHhhcccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHHH Q lcl|NC_019448. 149 VAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFV 228 (463) Q Consensus 149 ~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~ 228 (463) ++.+.+..+|-+..+. | +..|+.. + ..+--..|..+ .+.|..+-..+...|-.-.-.+|+..+...+. T Consensus 243 ~~~~e~~~~~~~g~g~-------g-~p~g~~~--~-~~~~~~~~~~~-~d~l~~~~~~l~~~y~~na~~imn~~t~~~~~ 310 (402) T protein:vir:93 243 LAAKERKDALAVSPKS-------G-LEHMSFY--N-GSVKEVEGADM-YDAIINALADLHEDYRDNATIYMRYADYVKII 310 (402) T ss_pred HHHHHHHhHhhcCCCc-------c-ccceeee--c-cccccccccch-HHHHHHHHhccChhhhcCCEEEEechHHHHHH Confidence 6665444455332221 1 2333321 1 11111222222 23344444456667744344678887766665 Q ss_pred HHhcCcceEEeecCCCCcccceecCeeeecccccccCCceecc--CccccccccccC--CCCCCCCe--eEEEEeccCCC Q lcl|NC_019448. 229 NSILGRQMQLMQDNSGNVNTGYSVNGFYSSRGFIKLHGSTVME--NELILDESLQPL--PNAPQPAK--VTATVETKQKG 302 (463) Q Consensus 229 ~~~~~~qrv~~~~n~g~~~~G~~v~~~~s~~G~i~l~~s~~~~--~d~~l~~~~~~~--p~ap~p~~--vtat~~~~~~g 302 (463) ...-+..+-+....++ .-.|.+|- .+. + -+.+++. ...|++...... .+-+-... .-+..-- .| T Consensus 311 ~~~~d~~~~~~~~~~~-~llG~PV~--~t~-~----~~~i~~GDf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~--Dg 380 (402) T protein:vir:93 311 SVLSNGTTNFFDTPAE-KVFGKPVV--FTD-A----AVKPIVGDFNYFGINYDGTTYDTDKDVKKGEYLFVLTAWY--DQ 380 (402) T ss_pred HHHhcCCCcccccCCc-cccccceE--Eec-C----CCceeeechhhhhhhhhhhhhhhhhcccCCceEEEEEEEe--Cc Confidence 5554445545543333 33466642 111 0 0011111 001111111100 00000011 1111111 12 Q ss_pred cCcccccccceEEEEEEEecCCccccc Q lcl|NC_019448. 303 AFEDEEDRAGLSYKVVVNSDDAQSAPS 329 (463) Q Consensus 303 ~~~~~~~~a~ysYkV~a~s~~geS~~S 329 (463) +..++ -..++.-+...+-|.|| T Consensus 381 ~v~~~-----~A~~~l~ik~~~~~~~~ 402 (402) T protein:vir:93 381 QRTLD-----SAFRIAKAKENTGPLPS 402 (402) T ss_pred EEech-----hheEEEEeecCCCCCCC Confidence 11111 12223333333555565 No 115 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=80.12 E-value=0.028 Score=29.10 Aligned_cols=284 Identities=13% Similarity=0.092 Sum_probs=107.6 Q ss_pred CCCCCccchHHHHhhhhhh---------------------HHHHHHhhcCCccCCccccCccccchhhhhhHh-hhhhcc Q lcl|NC_019448. 1 MTIEKNLSDVQQKYADQFQ---------------------EDVVKSFQTGYGITPDTQIDAGALRREILDDQI-TMLTWT 58 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k~~~---------------------e~~~Ks~~agy~~~p~~q~~gaalr~esLd~~i-~~L~~~ 58 (463) |-. +-.....+...++.+ ..+..++.+ .+..+|+.|--+.+.++| ..|. T Consensus 41 ~~~-~~~~~~~~~~~~e~~~~~~~~~~~~~r~~~~l~~ee~~~~~~~~~------~t~~~gG~liP~~~~~~Ii~~l~-- 111 (395) T protein:vir:95 41 MFD-ALSNDLQEEITAEINNRVVDNGILAKRSQDPLTSEERKFFNDINY------DVGYTDEKILPETVVERVFDDLQ-- 111 (395) T ss_pred HHH-HHHHHHHHHHHHHHHHHHHHHHHHhhcCccccchHHHHHHHHHhh------ccCCCCceeccHHHHHHHHHHHH-- Confidence 000 000000000000000 111112222 233445554444444444 3333 Q ss_pred ccccchhhhcccchhhHHHhhhhhhhccCcccccccccccCc-ccccCcceEEEEEEEEEeechhhhhhhhhhhcccccH Q lcl|NC_019448. 59 NEDLIFYRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGV-APVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADP 137 (463) Q Consensus 59 ~~df~f~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~-~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp 137 (463) +...+++.+...++..++ .+....+.+...++.|.+. ...+++.+.+.....+=|+.--.+|.-+ |.++..|. T Consensus 112 -~~s~i~~~~~v~~~~~~~----~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~iS~el-l~ds~~~i 185 (395) T protein:vir:95 112 -KDHPLLSKINFQNAGIKT----RVIKADPAGQAVWGKVFGEIKGQLDAAFREENFTQYKLTCFVVLPDDL-STFGPAWI 185 (395) T ss_pred -hhhhhhhhceeEecCCce----EEEEecCCcceEEeecccccCccccccceeeeeceeeEEEeecccHHH-HhcchhHH Confidence 223445555555554443 2233334455667667665 4578999999999999998777777655 56677788 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcccccCCCccccccccccceeeecCcce-Eec--cCCCCC-------HHHHh----hh Q lcl|NC_019448. 138 SQILTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNV-INA--KGNQLT-------EKHLN----EA 203 (463) Q Consensus 138 ~~~~~~~ai~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nv-iDa--rG~~ls-------~~~ln----~a 203 (463) +....+.-..++++.+|.+++.|+-.-.. |=-||.+-....+. ... ....++ ...|. .+ T Consensus 186 e~~i~~~la~~ia~~~~~a~i~G~G~~~~-------qP~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~ 258 (395) T protein:vir:95 186 ERFVRTQIQEAISVALESAIINGGGAAKT-------QPVGLMKDVNTNSGAVTDKASSGTLTFADADTTILELNDVLKNL 258 (395) T ss_pred HHHHHHHHHHHHHHHHhhheeeccCCCCc-------CceeeeecccccccccccccccchhhhhhhHhhHHHHHHHHHhh Confidence 99999999999999999999999854210 11244432221100 000 000011 11111 11 Q ss_pred hhh-hhh--cC-CceeEEecCHHHHHHHHHHh------------cCcc-eEEeecCC--CCcccceecCe-eeecccccc Q lcl|NC_019448. 204 AVR-IGK--GF-GTATDAYMPIGVHADFVNSI------------LGRQ-MQLMQDNS--GNVNTGYSVNG-FYSSRGFIK 263 (463) Q Consensus 204 a~~-i~~--~~-G~~td~~m~~~vka~f~~~~------------~~~q-rv~~~~n~--g~~~~G~~v~~-~~s~~G~i~ 263 (463) +.. ..+ .| |... +.|++.+..+..-.+ ++.- .++..++- +..-.| +-.. ++..++.+. T Consensus 259 ~~~~~~~~~~~~~~~~-~~mn~~t~~~~~g~~~~~~~~G~~~~~lg~g~~v~~~~~~p~~~i~fg-dfs~y~i~~r~~~~ 336 (395) T protein:vir:95 259 SVDEKGKELKIDGKVA-LVVNPRDSWDVQARYTYLTANGGFVTVLPYNVTIITSEFVPEGKLVAF-VTDRYNAVRGGGLT 336 (395) T ss_pred ccccccchhhhcCceE-EEEcchhhhhcCCcceeccCCCcceeccCCcceEEEcCCCCCCcEEEE-ecccEEEEEecceE Confidence 111 000 11 2221 345555444432111 1111 11111110 000111 1111 111222222 Q ss_pred cC--CceeccCccccccccc------cCCCCCCCCeeE------EEEeccCCCcCcccc Q lcl|NC_019448. 264 LH--GSTVMENELILDESLQ------PLPNAPQPAKVT------ATVETKQKGAFEDEE 308 (463) Q Consensus 264 l~--~s~~~~~d~~l~~~~~------~~p~ap~p~~vt------at~~~~~~g~~~~~~ 308 (463) +. ....+..|...-+... ..++|..--+++ +++.+.+.-++.... T Consensus 337 i~~~~~~~~~~d~~~f~~~~r~dg~~~~~~A~~~l~i~~~~~~~~~~~~~~~~~~~~~~ 395 (395) T protein:vir:95 337 VKKFDQTLALEDAVLFTAKTFAYGQPDDNKASAVYDLKVASAPRRQTSAGGTTDGIAEA 395 (395) T ss_pred EEeccchhhhCCcEEEEEEEEECCEEeccccEEEEEeeccCCCCCCCCCCCCCCccccC Confidence 21 1111222322111111 111111100111 001111111111111 No 116 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=78.86 E-value=0.11 Score=25.84 Aligned_cols=293 Identities=9% Similarity=0.030 Sum_probs=118.3 Q ss_pred CCCCCccchHHHHhhhhhhHHHHHHhhcC-C-------------ccCCccccCccccchhhhhhHhhhhhccccccchhh Q lcl|NC_019448. 1 MTIEKNLSDVQQKYADQFQEDVVKSFQTG-Y-------------GITPDTQIDAGALRREILDDQITMLTWTNEDLIFYR 66 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k~~~e~~~Ks~~ag-y-------------~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~ 66 (463) -+.|.+.. ..+...+.+.+. .+++..+ . ..+-.+..+||.|-=+.+..+|..+...... +.+ T Consensus 75 ~~~~~~~~-~~~~~~~~~~~~-~r~~~~~~~~~~~~~~~~~~~~al~~~t~s~gG~~IP~~~~~~Ii~~~~~~~~--l~~ 150 (387) T protein:vir:93 75 GEAYQSLN-DHEKMVKAKAEF-YRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIVSEPFAKNQ--LRE 150 (387) T ss_pred cccCCCcc-hhhHHHHHHHHH-HHHHhhhhhhhhhhhhhHHHHHhhccCcCCCCceeechhHHHHHHHHHHhhch--hhh Confidence 11111111 111111222111 1111111 0 0111123445654444455555433333322 233 Q ss_pred hcccchhhHHHhhhhhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHH Q lcl|NC_019448. 67 DISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAI 146 (463) Q Consensus 67 ~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai 146 (463) -+...++.+ + ++.+.. - ..+...+++|++..+.+++.+......++=++.--.+|.- -+.++..|.+....+.-. T Consensus 151 ~~~v~~~~~-~-~~p~~~-~-~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~iS~e-ll~Ds~~~l~~~i~~~la 225 (387) T protein:vir:93 151 KARLTNIKG-L-EIPRVS-Y-TLDDDDFITDVETAKELKLKGDTVKFTTNKFKVFAAISDT-VIHGSDVDLVNWVENALQ 225 (387) T ss_pred heeeeecCC-c-eEEEEe-e-cCCccccccCcccccccccccceeeeeheeeeeechhhHH-HHhhhHHHHHHHHHHHHH Confidence 233333322 1 122221 1 1224568999999999999999988888777765555532 124456677777777777 Q ss_pred HHHHHHHHHHHhhcccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHH Q lcl|NC_019448. 147 AVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHAD 226 (463) Q Consensus 147 ~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~ 226 (463) .++.+..+..+|-+..+. | +..|+.. + ..+-...|..+ .+.|..+--.+...|-...-.+|+..+... T Consensus 226 ~~~~~~e~~~~~~~g~g~-------g-~p~g~l~--~-~~~~~v~~~~~-~d~i~~~~~~l~~~~~~~a~~~mn~~t~~~ 293 (387) T protein:vir:93 226 SGLAAKERKDALAVSPKS-------G-LDHMSFY--N-GSVKEVEGADM-YDAIINALADLHEDYRDNATIYMRYADYVK 293 (387) T ss_pred HHHHHHHHHhHhhcCCCc-------c-ccceeee--c-cccccccccch-HHHHHHHHhccChhhhcCCEEEEechHHHH Confidence 777776555555333221 1 2334321 1 11111222222 233444444566666444457888877666 Q ss_pred HHHHhcCcceEEeecCCCCcccceecCeee----ecccccccCCceeccCccccccccccCCCCCCCCe--eEEEEeccC Q lcl|NC_019448. 227 FVNSILGRQMQLMQDNSGNVNTGYSVNGFY----SSRGFIKLHGSTVMENELILDESLQPLPNAPQPAK--VTATVETKQ 300 (463) Q Consensus 227 f~~~~~~~qrv~~~~n~g~~~~G~~v~~~~----s~~G~i~l~~s~~~~~d~~l~~~~~~~p~ap~p~~--vtat~~~~~ 300 (463) +...+-+..+-+....+. .-.|.+|---. -.-|+.+.. .+......+++... ..... .-++..- T Consensus 294 ~~~~~~d~~~~~~~~~~~-~llG~PV~~~~~~~~~~~GDf~~~--~~~~~~~~~~~~~~-----~~~~~~~~~~~~r~-- 363 (387) T protein:vir:93 294 IISVLSNGTTNFFDTPAE-KVFGKPVVFTDAAVKPIVGDFNYF--GINYDGTTYDTDKD-----VKKGEYLFVLTAWY-- 363 (387) T ss_pred HHHHHhcCCCcccccCCc-cccccceEEecCCCceeeeehhhh--heehhhheeeeccc-----ccCCceeEEEEeee-- Confidence 654444444444443333 23466542100 012222110 00010111111000 00011 1111111 Q ss_pred CCcCcccccccceEEEEEEEecCCccccc Q lcl|NC_019448. 301 KGAFEDEEDRAGLSYKVVVNSDDAQSAPS 329 (463) Q Consensus 301 ~g~~~~~~~~a~ysYkV~a~s~~geS~~S 329 (463) .|...+ ....++.-+-...-|.|| T Consensus 364 d~~v~~-----~eA~~~l~~k~~~~~~~~ 387 (387) T protein:vir:93 364 DQQRTL-----DSAFRIAKAKENTGSLPS 387 (387) T ss_pred Cceeec-----hhheEEEEeecCCCCCCC Confidence 111111 112222233333334444 No 117 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=78.80 E-value=0.11 Score=25.82 Aligned_cols=294 Identities=11% Similarity=0.125 Sum_probs=135.6 Q ss_pred CCCCCccchHHHHhhhhhhHHHHHHhhcCCccCCccccCccccchhhhhhHhhhhhccccccchhhhcccchhhHHHhhh Q lcl|NC_019448. 1 MTIEKNLSDVQQKYADQFQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVKY 80 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k~~~e~~~Ks~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~stv~ey 80 (463) |-++... ...++ .++.|++++ + -.+|+.|.-|.+++.|..+... -.|.+.+.-.+ +...+ T Consensus 1 ~~~~~~~------~~~~~-~~~~k~~t~-----~--d~~Gg~l~P~~~~~~i~~~~e~---s~~l~~~~vi~---~~~~~ 60 (315) T protein:vir:41 1 MLTIEDI------RGGKP-FEIVPKIDV-----P--DLGRGVLSVDRFGEFVKAVRDS---AVIIPEARIDN---ALKSY 60 (315) T ss_pred Ccccchh------hcCCh-hhhhhhcCC-----c--CCCCceechHHHHHHHHHHHhh---hhhhhhceeee---ccccc Confidence 3333221 12222 444677654 2 2368889988888877665543 23444433211 11112 Q ss_pred hhhhccCccc-----ccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhccc--ccHHHHHHHHHHHHHHHHH Q lcl|NC_019448. 81 DQYLRHGNVG-----HSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNI--ADPSQILTEDAIAVVAKTI 153 (463) Q Consensus 81 ~~~~~hG~~g-----~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~--~Dp~~~~~~~ai~~~~~~~ 153 (463) ....+..+.| ...-.+|.+.+..++|.+.+....++-+..--.+|.-. |.++. .|.+......-..+++... T Consensus 61 ~~~i~~~g~~~~~~~g~~~~~~~~~~~~~~~~f~~~~l~~~~l~~~~~it~el-L~D~~~~~~~e~~l~~~~a~~~a~~~ 139 (315) T protein:vir:41 61 EKDISRLSLVLDVGPGRDETGQKLAPPESTAEVKTNTLYMREMVTKVVIHEDA-IEDNIEGKAFEQKIVTLLGEGISYVL 139 (315) T ss_pred cccccccccCcccccccccccCcCCCCCCccccceeeeceeeeeeeccccHHH-HHhhhccccHHHHHHHHHHHHHHHHH Confidence 2222221111 11234566677788999999999888887654443322 23443 4789999999999999999 Q ss_pred HHHHhhcccccCCCccccccccccceeeecC---cceEeccCCCCCHHHHhhhhhhhhhcC---CceeEEecCHHHHHHH Q lcl|NC_019448. 154 EWASFYGDASLTSEVEGEGLEFDGLAKLIDK---NNVINAKGNQLTEKHLNEAAVRIGKGF---GTATDAYMPIGVHADF 227 (463) Q Consensus 154 E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~---~nviDarG~~ls~~~ln~aa~~i~~~~---G~~td~~m~~~vka~f 227 (463) |.+.|.||..-.. +.--+.+|+.+.+.. ....+.....++.+.|..+.--+-..| +.---.+|+..+.+.+ T Consensus 140 ~~~~~nGdg~s~~---p~~~~~~G~l~~a~~~~~~~~~~~~a~~~~~d~l~~l~~sl~~~yr~~~~~~~~imn~~t~~~~ 216 (315) T protein:vir:41 140 EKYYLHGDTSSSD---PLLRMSDGWLKLASEKLTESDVDPEAEDWPMNLFDTMIESLPTPYRNNLPNMKFYVTWDIYRAY 216 (315) T ss_pred HHHhhccCCcCcC---ccccccccceecccccccccccccccccccHHHHHHHHHhcChHHhhcCCceEEEEcHHHHHHH Confidence 9999999986221 111267999987753 233555555666777766665555555 2222478999999888 Q ss_pred HHHhcCcceEEeecC----CCCcccceecCeeeecccccccC-CceeccCc-cccccccccCCCCCCCCeeEEEEeccCC Q lcl|NC_019448. 228 VNSILGRQMQLMQDN----SGNVNTGYSVNGFYSSRGFIKLH-GSTVMENE-LILDESLQPLPNAPQPAKVTATVETKQK 301 (463) Q Consensus 228 ~~~~~~~qrv~~~~n----~g~~~~G~~v~~~~s~~G~i~l~-~s~~~~~d-~~l~~~~~~~p~ap~p~~vtat~~~~~~ 301 (463) ....-.+.+-+-++. ....-.|++|-..-. -..+... +.+++-+- +....-. -.+........ T Consensus 217 rklk~~~g~~lw~~~~~~g~~~tl~G~PV~~~~~-m~~~~~~~~~ilf~d~~nl~~~~~---------~~i~i~~~~~a- 285 (315) T protein:vir:41 217 RDALKGRETGLGDQALTGANSILYDGRPVQYVPA-LEALNDGKSRALFVVPTQLVYGFW---------RNIKVVPDYDA- 285 (315) T ss_pred HHHhccCCCccccchhhcCCCceecccceEeccc-ccccCCCCccEEEecccceEEEec---------cccEEEeeecC- Confidence 654444433332221 111122333311000 0000000 00000000 0000000 00000000000 Q ss_pred CcCcccccccceEEEEEEEecC--CccccccceeeeecCCCCceEEEEEe Q lcl|NC_019448. 302 GAFEDEEDRAGLSYKVVVNSDD--AQSAPSEEVTATVSNVDDGVKLSISV 349 (463) Q Consensus 302 g~~~~~~~~a~ysYkV~a~s~~--geS~~S~~vt~Tva~~~~gv~ltIt~ 349 (463) ....+.|..+.--+. +.+-.. +-.+|++ T Consensus 286 -------~~~~~~~~~~~r~d~~~~~~~~~-------------a~~~~~v 315 (315) T protein:vir:41 286 -------EMRLTKYVASLRTDNHYEDEEGA-------------VSATITV 315 (315) T ss_pred -------CCCceEEEEEEEeceeEEeccce-------------eEeeeeC Confidence 001122222111111 111110 1111111 No 118 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=77.88 E-value=0.12 Score=25.63 Aligned_cols=287 Identities=14% Similarity=-0.001 Sum_probs=123.5 Q ss_pred CCCCCccchHHHHhhhhhhHH-HHHHhh---cCCccCCccccCccccchhhhhhHhhhhhccccccchhhhcccchhhHH Q lcl|NC_019448. 1 MTIEKNLSDVQQKYADQFQED-VVKSFQ---TGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQST 76 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k~~~e~-~~Ks~~---agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~st 76 (463) ...................+. ..++.. +|..+++... ++....+....-+..+ .+...+.+-+...+..+. T Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~ip~~~~~~ii~~~---~~~~~i~~~~~~~~~~~~ 149 (379) T protein:vir:10 75 EDKSDSLVKSITENFNDIKEVRNGKSIQVKAVGDMTLPVNL--TGAQPKDYNFDVVLNP---SQMLNVSDIVGAVSISGG 149 (379) T ss_pred cccchhHHHHHHHHHHhHHHHHhhhhhhhhhhcccccCCCC--ccccchhhhhHHHHhH---HhhhhHHhhceeeeccCC Confidence 111111111111111101000 011111 2222222221 2223333322222222 222234444444445444 Q ss_pred HhhhhhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHHHHHHHHHH Q lcl|NC_019448. 77 VVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEWA 156 (463) Q Consensus 77 v~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~~E~a 156 (463) --+|.+.+..++. ...++.|++..+.+++.+.+.....+=++.-..+|.-+ +.+. .+.+....+.-...+++.++.+ T Consensus 150 ~~~~~~~~~~~~~-~~~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~el-l~D~-~~l~~~i~~~la~~~~~~~~~~ 226 (379) T protein:vir:10 150 TYTFVRENGAGEG-AIGAQVEGATKGQKDYDISMIDVNTDFIAGFTRYSKKM-ANNL-PFLTSFIPNALRRDYAKAENAA 226 (379) T ss_pred ceEEEEeecCCCc-ccccccCCccccccccceeeeEeeeeeEEeeehhhHHH-HhhH-HHHHHHHHHHHHHHHHHHHHHH Confidence 4456665544332 34578999999999999999999999999888888765 4333 3466666676777888888888 Q ss_pred HhhcccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHHHHHhcCcce Q lcl|NC_019448. 157 SFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGRQM 236 (463) Q Consensus 157 ~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~~~~~~qr 236 (463) ++-|+..-... +.... -...+.+.|..+...+...|-.++-+.|++.+.+.+...--..-| T Consensus 227 ~~~g~~~~~~~---------~~~~~----------~~~~~~d~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~ 287 (379) T protein:vir:10 227 FNAVLAANATA---------STEII----------TNKNKVEMLINEIAKQENLDFPVTAIVLRPTDYYDILVTQKSVGA 287 (379) T ss_pred Hhccccccccc---------ccccc----------cCcccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhccCCc Confidence 88776442110 11111 112234567777777777887888899999988888753322223 Q ss_pred EEeecCCCCcccceecCeeeecccccccCCceeccCccccccccccCCCCCCCCeeEEEEeccCCCcCccccccc----- Q lcl|NC_019448. 237 QLMQDNSGNVNTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQPLPNAPQPAKVTATVETKQKGAFEDEEDRA----- 311 (463) Q Consensus 237 v~~~~n~g~~~~G~~v~~~~s~~G~i~l~~s~~~~~d~~l~~~~~~~p~ap~p~~vtat~~~~~~g~~~~~~~~a----- 311 (463) .+.+++..... .+...|.|-.+..++ .+ |+ ++.-++|..... T Consensus 288 ~l~~~~~~~~~-----------~~~~~l~G~pvv~s~-~~------------~a---------g~~~~gdf~~~~~~~~~ 334 (379) T protein:vir:10 288 GYGLPGVVTQD-----------NGVLRINGIPLFRAT-WL------------AA---------NKYYVGDWTRVTKVTTE 334 (379) T ss_pred eeccCCccCCC-----------CCcceecceeeEecC-CC------------CC---------CceEEeecccEEEEEEe Confidence 33333221000 000111111111100 00 00 011111111100 Q ss_pred ceEEEEEEEecCCcc-cccc--------ceeeeecCCCCceEEEEEecCC Q lcl|NC_019448. 312 GLSYKVVVNSDDAQS-APSE--------EVTATVSNVDDGVKLSISVNAM 352 (463) Q Consensus 312 ~ysYkV~a~s~~geS-~~S~--------~vt~Tva~~~~gv~ltIt~~a~ 352 (463) ..+.. ++.+... .-.. -+.+-|.....-|++++| ++ T Consensus 335 ~~~i~---~~~~~~~~f~~~~~~~r~~~R~~~~v~~p~a~v~~~~~--~~ 379 (379) T protein:vir:10 335 GLSLE---FSEVEGTNFVKNNITARIEAQVALAVEQPAALIFGDFT--AV 379 (379) T ss_pred ceEEE---EeecccccccCCcEEEEEEEEeccEEecCccEEEEEec--CC Confidence 00000 0000000 0000 012222223233333333 44 No 119 >protein:vir:107423 Length: 681 # NCBI annotation: Bbp13 # Family: family:all:780 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958682;genbank:gi:41179374;genbank:GeneID:2717217 Probab=77.69 E-value=0.12 Score=25.59 Aligned_cols=271 Identities=16% Similarity=0.110 Sum_probs=117.1 Q ss_pred hhhhhhhhccc----ccH--HHHHHHHHHHHHHHHHHHHH--hhcccccCCCccccccccccce-------eee----cC Q lcl|NC_019448. 124 MSIASGLVNNI----ADP--SQILTEDAIAVVAKTIEWAS--FYGDASLTSEVEGEGLEFDGLA-------KLI----DK 184 (463) Q Consensus 124 vs~~~~lvn~~----~Dp--~~~~~~~ai~~~~~~~E~a~--fyGd~~l~~~~~~~gleFDGl~-------~lI----~~ 184 (463) ++...-+++++ -+| ..+.-.+.-.+-++.+|.++ -+|-..--| |++|=|-. .|| +. T Consensus 1 m~~~~~~~~~f~~Ge~~p~l~~r~D~~~y~~~~~~~~N~~~~~~G~~~~R~-----g~~~~~~~~~~~~~~rlipf~~~~ 75 (681) T protein:vir:10 1 MSNVRVLQRSFGGGEISPEMFGRIDDVKYQSGLAICRNFVVKPQGPAENRA-----GFAFVREVKDSAKKVRLIPFTYSV 75 (681) T ss_pred CcceeEeeeecCCceeeeeeccchhHHHHHHHHHHhcCcEEEecCCceecC-----hhHhhhhcCCCCCcEEEEEEEeCC Confidence 22222224442 255 33444445555666666543 333333222 45554422 244 34 Q ss_pred cceEeccCCCCCHHHHhhhhhhhhhcC------CceeEEecCHHHHHHHHHHhcCcceEEeecCCCCcccceecCeeeec Q lcl|NC_019448. 185 NNVINAKGNQLTEKHLNEAAVRIGKGF------GTATDAYMPIGVHADFVNSILGRQMQLMQDNSGNVNTGYSVNGFYSS 258 (463) Q Consensus 185 ~nviDarG~~ls~~~ln~aa~~i~~~~------G~~td~~m~~~vka~f~~~~~~~qrv~~~~n~g~~~~G~~v~~~~s~ 258 (463) ++.+.+.=. ..- .++-++. |.+-++=-|+... ++ ..-+ +.|. .+-+.-+ T Consensus 76 ~~~~~l~~g---~~~-----~r~~~~~~~~~~~~~~~~~~tpy~~~-~l-----~~l~-~~q~----------aD~~~i~ 130 (681) T protein:vir:10 76 TQTMVIELG---AGY-----FRFHTNGGTLLDGAVPYEIANPYAEA-DL-----FNIH-YVQS----------ADVLTLV 130 (681) T ss_pred CceEEEEEe---CCe-----EEEEeCCcEEeeCcEeEEecCCCChh-hh-----cCce-EEEE----------cCEEEEE Confidence 444444111 000 1111112 2333333343332 22 2111 1111 1222222 Q ss_pred ccccccCCcee--ccCccccccccccCCCCCCCCeeEEEEeccCCCcCcccccccceEEEEEEEecCC--ccccccceee Q lcl|NC_019448. 259 RGFIKLHGSTV--MENELILDESLQPLPNAPQPAKVTATVETKQKGAFEDEEDRAGLSYKVVVNSDDA--QSAPSEEVTA 334 (463) Q Consensus 259 ~G~i~l~~s~~--~~~d~~l~~~~~~~p~ap~p~~vtat~~~~~~g~~~~~~~~a~ysYkV~a~s~~g--eS~~S~~vt~ 334 (463) ++++ .+..+ ...+.|-.+.....+....|..++++. ..++ ....++|.|+++...+ +|.++..+++ T Consensus 131 h~~~--~p~~L~r~~~~~W~l~~~~f~~~p~~p~~~~at~--~~~~------~~~t~~~~v~avda~t~~~s~~~~~~tv 200 (681) T protein:vir:10 131 HPNY--APRELRRLGATNWQLATIAFTSPVATPTSVTATS--NNKG------TDYTYRYVVTALDAEGKTESAPSSAGTC 200 (681) T ss_pred CCCC--cceEEEEccCCceEEEEEEeccccccceeeeeec--cCCc------cceeEeEEEEEeecccceeecCCcceEE Confidence 2222 11122 235667655544444433444555442 2222 2346889999887766 6888888888 Q ss_pred eecCCCCceEEEEEecCCCCCCcceEEEEeecCCCceEEEEEEeeeeeecCCceEEEEeccCCCCCCccceecCCchHHH Q lcl|NC_019448. 335 TVSNVDDGVKLSISVNAMYQQQPQFVSIYRQGKETGMYFLIKRVPVKDAQEDGTIVFVDKNETLPETADVFVGEMSPQVV 414 (463) Q Consensus 335 Tva~~~~gv~ltIt~~a~~g~~~~~y~IYR~~~~~g~~~li~rv~~s~~n~~gtttf~D~N~~iPgt~~~fvGe~~pqvi 414 (463) +....+.+...+++|.+..++. +|.|||+. +|.+..++. ...+.+.|.|.. |.+...+=.+++ T Consensus 201 t~~~~~~~~~~t~~w~a~~g~~--~~~V~~~~--~gi~g~ig~--------~~~~~~~~~~~~-~~~~~t~~~~~~---- 263 (681) T protein:vir:10 201 TNNLFTNGGANTIAWSASSGAS--RYNVYKEQ--GGLYGYIGQ--------TTGTSLVDDNIA-PDLSVTPPIYDA---- 263 (681) T ss_pred eeeeecCCcceeEEEEecCCce--eeeecccc--eeEEEEeec--------cceeeeeecccc-cCcccccccccc---- Confidence 8777777778899998887764 68999853 577776542 234566666654 333321111110 Q ss_pred HhhhhcchhhcCCcccCCcceeeeeeec-hhhee----cceeeEE-----EEEEe-EecC Q lcl|NC_019448. 415 HLFELLPMMKLPLAQINASITFAVLWYG-ALALR----APKKWAR-----IKNVR-YIAV 463 (463) Q Consensus 415 ~l~ellPm~k~pla~~na~~~~~V~~Yg-~L~l~----aPkk~~~-----ikNV~-~~~~ 463 (463) .......++-.|.+|- =|.+. .|+.... +.|-. -.++ T Consensus 264 ------------~~~~~~gyP~~v~f~q~RL~f~~~~~~p~~v~~Srsgdy~nF~~~~~~ 311 (681) T protein:vir:10 264 ------------VFNAAGDYPAAVSYFEQRRCFAGTTNKPQNIWMTRSGTESAMSYSLPV 311 (681) T ss_pred ------------ccccCCCceEEEEEEcceEEEeeCCCCCcEEEEEcccCcccccccCCC Confidence 0111112222222221 11111 1221100 11111 0122 No 120 >protein:vir:98487 Length: 681 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:780 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996575;genbank:gi:45569506;genbank:GeneID:2767815 Probab=77.69 E-value=0.12 Score=25.59 Aligned_cols=271 Identities=16% Similarity=0.110 Sum_probs=117.1 Q ss_pred hhhhhhhhccc----ccH--HHHHHHHHHHHHHHHHHHHH--hhcccccCCCccccccccccce-------eee----cC Q lcl|NC_019448. 124 MSIASGLVNNI----ADP--SQILTEDAIAVVAKTIEWAS--FYGDASLTSEVEGEGLEFDGLA-------KLI----DK 184 (463) Q Consensus 124 vs~~~~lvn~~----~Dp--~~~~~~~ai~~~~~~~E~a~--fyGd~~l~~~~~~~gleFDGl~-------~lI----~~ 184 (463) ++...-+++++ -+| ..+.-.+.-.+-++.+|.++ -+|-..--| |++|=|-. .|| +. T Consensus 1 m~~~~~~~~~f~~Ge~~p~l~~r~D~~~y~~~~~~~~N~~~~~~G~~~~R~-----g~~~~~~~~~~~~~~rlipf~~~~ 75 (681) T protein:vir:98 1 MSNVRVLQRSFGGGEISPEMFGRIDDVKYQSGLAICRNFVVKPQGPAENRA-----GFAFVREVKDSAKKVRLIPFTYSV 75 (681) T ss_pred CcceeEeeeecCCceeeeeeccchhHHHHHHHHHHhcCcEEEecCCceecC-----hhHhhhhcCCCCCcEEEEEEEeCC Confidence 22222224442 255 33444445555666666543 333333222 45554422 244 34 Q ss_pred cceEeccCCCCCHHHHhhhhhhhhhcC------CceeEEecCHHHHHHHHHHhcCcceEEeecCCCCcccceecCeeeec Q lcl|NC_019448. 185 NNVINAKGNQLTEKHLNEAAVRIGKGF------GTATDAYMPIGVHADFVNSILGRQMQLMQDNSGNVNTGYSVNGFYSS 258 (463) Q Consensus 185 ~nviDarG~~ls~~~ln~aa~~i~~~~------G~~td~~m~~~vka~f~~~~~~~qrv~~~~n~g~~~~G~~v~~~~s~ 258 (463) ++.+.+.=. ..- .++-++. |.+-++=-|+... ++ ..-+ +.|. .+-+.-+ T Consensus 76 ~~~~~l~~g---~~~-----~r~~~~~~~~~~~~~~~~~~tpy~~~-~l-----~~l~-~~q~----------aD~~~i~ 130 (681) T protein:vir:98 76 TQTMVIELG---AGY-----FRFHTNGGTLLDGAVPYEIANPYAEA-DL-----FNIH-YVQS----------ADVLTLV 130 (681) T ss_pred CceEEEEEe---CCe-----EEEEeCCcEEeeCcEeEEecCCCChh-hh-----cCce-EEEE----------cCEEEEE Confidence 444444111 000 1111112 2333333343332 22 2111 1111 1222222 Q ss_pred ccccccCCcee--ccCccccccccccCCCCCCCCeeEEEEeccCCCcCcccccccceEEEEEEEecCC--ccccccceee Q lcl|NC_019448. 259 RGFIKLHGSTV--MENELILDESLQPLPNAPQPAKVTATVETKQKGAFEDEEDRAGLSYKVVVNSDDA--QSAPSEEVTA 334 (463) Q Consensus 259 ~G~i~l~~s~~--~~~d~~l~~~~~~~p~ap~p~~vtat~~~~~~g~~~~~~~~a~ysYkV~a~s~~g--eS~~S~~vt~ 334 (463) ++++ .+..+ ...+.|-.+.....+....|..++++. ..++ ....++|.|+++...+ +|.++..+++ T Consensus 131 h~~~--~p~~L~r~~~~~W~l~~~~f~~~p~~p~~~~at~--~~~~------~~~t~~~~v~avda~t~~~s~~~~~~tv 200 (681) T protein:vir:98 131 HPNY--APRELRRLGATNWQLATIAFTSPVATPTSVTATS--NNKG------TDYTYRYVVTALDAEGKTESAPSSAGTC 200 (681) T ss_pred CCCC--cceEEEEccCCceEEEEEEeccccccceeeeeec--cCCc------cceeEeEEEEEeecccceeecCCcceEE Confidence 2222 11122 235667655544444433444555442 2222 2346889999887766 6888888888 Q ss_pred eecCCCCceEEEEEecCCCCCCcceEEEEeecCCCceEEEEEEeeeeeecCCceEEEEeccCCCCCCccceecCCchHHH Q lcl|NC_019448. 335 TVSNVDDGVKLSISVNAMYQQQPQFVSIYRQGKETGMYFLIKRVPVKDAQEDGTIVFVDKNETLPETADVFVGEMSPQVV 414 (463) Q Consensus 335 Tva~~~~gv~ltIt~~a~~g~~~~~y~IYR~~~~~g~~~li~rv~~s~~n~~gtttf~D~N~~iPgt~~~fvGe~~pqvi 414 (463) +....+.+...+++|.+..++. +|.|||+. +|.+..++. ...+.+.|.|.. |.+...+=.+++ T Consensus 201 t~~~~~~~~~~t~~w~a~~g~~--~~~V~~~~--~gi~g~ig~--------~~~~~~~~~~~~-~~~~~t~~~~~~---- 263 (681) T protein:vir:98 201 TNNLFTNGGANTIAWSASSGAS--RYNVYKEQ--GGLYGYIGQ--------TTGTSLVDDNIA-PDLSVTPPIYDA---- 263 (681) T ss_pred eeeeecCCcceeEEEEecCCce--eeeecccc--eeEEEEeec--------cceeeeeecccc-cCcccccccccc---- Confidence 8777777778899998887764 68999853 577776542 234566666654 333321111110 Q ss_pred HhhhhcchhhcCCcccCCcceeeeeeec-hhhee----cceeeEE-----EEEEe-EecC Q lcl|NC_019448. 415 HLFELLPMMKLPLAQINASITFAVLWYG-ALALR----APKKWAR-----IKNVR-YIAV 463 (463) Q Consensus 415 ~l~ellPm~k~pla~~na~~~~~V~~Yg-~L~l~----aPkk~~~-----ikNV~-~~~~ 463 (463) .......++-.|.+|- =|.+. .|+.... +.|-. -.++ T Consensus 264 ------------~~~~~~gyP~~v~f~q~RL~f~~~~~~p~~v~~Srsgdy~nF~~~~~~ 311 (681) T protein:vir:98 264 ------------VFNAAGDYPAAVSYFEQRRCFAGTTNKPQNIWMTRSGTESAMSYSLPV 311 (681) T ss_pred ------------ccccCCCceEEEEEEcceEEEeeCCCCCcEEEEEcccCcccccccCCC Confidence 0111112222222221 11111 1221100 11111 0122 No 121 >protein:vir:107802 Length: 681 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:780 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996623;genbank:gi:45580757;genbank:GeneID:2767878 Probab=77.69 E-value=0.12 Score=25.59 Aligned_cols=271 Identities=16% Similarity=0.110 Sum_probs=117.1 Q ss_pred hhhhhhhhccc----ccH--HHHHHHHHHHHHHHHHHHHH--hhcccccCCCccccccccccce-------eee----cC Q lcl|NC_019448. 124 MSIASGLVNNI----ADP--SQILTEDAIAVVAKTIEWAS--FYGDASLTSEVEGEGLEFDGLA-------KLI----DK 184 (463) Q Consensus 124 vs~~~~lvn~~----~Dp--~~~~~~~ai~~~~~~~E~a~--fyGd~~l~~~~~~~gleFDGl~-------~lI----~~ 184 (463) ++...-+++++ -+| ..+.-.+.-.+-++.+|.++ -+|-..--| |++|=|-. .|| +. T Consensus 1 m~~~~~~~~~f~~Ge~~p~l~~r~D~~~y~~~~~~~~N~~~~~~G~~~~R~-----g~~~~~~~~~~~~~~rlipf~~~~ 75 (681) T protein:vir:10 1 MSNVRVLQRSFGGGEISPEMFGRIDDVKYQSGLAICRNFVVKPQGPAENRA-----GFAFVREVKDSAKKVRLIPFTYSV 75 (681) T ss_pred CcceeEeeeecCCceeeeeeccchhHHHHHHHHHHhcCcEEEecCCceecC-----hhHhhhhcCCCCCcEEEEEEEeCC Confidence 22222224442 255 33444445555666666543 333333222 45554422 244 34 Q ss_pred cceEeccCCCCCHHHHhhhhhhhhhcC------CceeEEecCHHHHHHHHHHhcCcceEEeecCCCCcccceecCeeeec Q lcl|NC_019448. 185 NNVINAKGNQLTEKHLNEAAVRIGKGF------GTATDAYMPIGVHADFVNSILGRQMQLMQDNSGNVNTGYSVNGFYSS 258 (463) Q Consensus 185 ~nviDarG~~ls~~~ln~aa~~i~~~~------G~~td~~m~~~vka~f~~~~~~~qrv~~~~n~g~~~~G~~v~~~~s~ 258 (463) ++.+.+.=. ..- .++-++. |.+-++=-|+... ++ ..-+ +.|. .+-+.-+ T Consensus 76 ~~~~~l~~g---~~~-----~r~~~~~~~~~~~~~~~~~~tpy~~~-~l-----~~l~-~~q~----------aD~~~i~ 130 (681) T protein:vir:10 76 TQTMVIELG---AGY-----FRFHTNGGTLLDGAVPYEIANPYAEA-DL-----FNIH-YVQS----------ADVLTLV 130 (681) T ss_pred CceEEEEEe---CCe-----EEEEeCCcEEeeCcEeEEecCCCChh-hh-----cCce-EEEE----------cCEEEEE Confidence 444444111 000 1111112 2333333343332 22 2111 1111 1222222 Q ss_pred ccccccCCcee--ccCccccccccccCCCCCCCCeeEEEEeccCCCcCcccccccceEEEEEEEecCC--ccccccceee Q lcl|NC_019448. 259 RGFIKLHGSTV--MENELILDESLQPLPNAPQPAKVTATVETKQKGAFEDEEDRAGLSYKVVVNSDDA--QSAPSEEVTA 334 (463) Q Consensus 259 ~G~i~l~~s~~--~~~d~~l~~~~~~~p~ap~p~~vtat~~~~~~g~~~~~~~~a~ysYkV~a~s~~g--eS~~S~~vt~ 334 (463) ++++ .+..+ ...+.|-.+.....+....|..++++. ..++ ....++|.|+++...+ +|.++..+++ T Consensus 131 h~~~--~p~~L~r~~~~~W~l~~~~f~~~p~~p~~~~at~--~~~~------~~~t~~~~v~avda~t~~~s~~~~~~tv 200 (681) T protein:vir:10 131 HPNY--APRELRRLGATNWQLATIAFTSPVATPTSVTATS--NNKG------TDYTYRYVVTALDAEGKTESAPSSAGTC 200 (681) T ss_pred CCCC--cceEEEEccCCceEEEEEEeccccccceeeeeec--cCCc------cceeEeEEEEEeecccceeecCCcceEE Confidence 2222 11122 235667655544444433444555442 2222 2346889999887766 6888888888 Q ss_pred eecCCCCceEEEEEecCCCCCCcceEEEEeecCCCceEEEEEEeeeeeecCCceEEEEeccCCCCCCccceecCCchHHH Q lcl|NC_019448. 335 TVSNVDDGVKLSISVNAMYQQQPQFVSIYRQGKETGMYFLIKRVPVKDAQEDGTIVFVDKNETLPETADVFVGEMSPQVV 414 (463) Q Consensus 335 Tva~~~~gv~ltIt~~a~~g~~~~~y~IYR~~~~~g~~~li~rv~~s~~n~~gtttf~D~N~~iPgt~~~fvGe~~pqvi 414 (463) +....+.+...+++|.+..++. +|.|||+. +|.+..++. ...+.+.|.|.. |.+...+=.+++ T Consensus 201 t~~~~~~~~~~t~~w~a~~g~~--~~~V~~~~--~gi~g~ig~--------~~~~~~~~~~~~-~~~~~t~~~~~~---- 263 (681) T protein:vir:10 201 TNNLFTNGGANTIAWSASSGAS--RYNVYKEQ--GGLYGYIGQ--------TTGTSLVDDNIA-PDLSVTPPIYDA---- 263 (681) T ss_pred eeeeecCCcceeEEEEecCCce--eeeecccc--eeEEEEeec--------cceeeeeecccc-cCcccccccccc---- Confidence 8777777778899998887764 68999853 577776542 234566666654 333321111110 Q ss_pred HhhhhcchhhcCCcccCCcceeeeeeec-hhhee----cceeeEE-----EEEEe-EecC Q lcl|NC_019448. 415 HLFELLPMMKLPLAQINASITFAVLWYG-ALALR----APKKWAR-----IKNVR-YIAV 463 (463) Q Consensus 415 ~l~ellPm~k~pla~~na~~~~~V~~Yg-~L~l~----aPkk~~~-----ikNV~-~~~~ 463 (463) .......++-.|.+|- =|.+. .|+.... +.|-. -.++ T Consensus 264 ------------~~~~~~gyP~~v~f~q~RL~f~~~~~~p~~v~~Srsgdy~nF~~~~~~ 311 (681) T protein:vir:10 264 ------------VFNAAGDYPAAVSYFEQRRCFAGTTNKPQNIWMTRSGTESAMSYSLPV 311 (681) T ss_pred ------------ccccCCCceEEEEEEcceEEEeeCCCCCcEEEEEcccCcccccccCCC Confidence 0111112222222221 11111 1221100 11111 0122 No 122 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=77.28 E-value=0.13 Score=25.51 Aligned_cols=289 Identities=8% Similarity=0.074 Sum_probs=121.5 Q ss_pred CCCCCccchHHHHhhhhhhH---------------HHHHHhhcCCccCCccccCccccchhhhhhHhhhhhccccccchh Q lcl|NC_019448. 1 MTIEKNLSDVQQKYADQFQE---------------DVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFY 65 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k~~~e---------------~~~Ks~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~ 65 (463) ...+.+......++++.+.. ...+++++| +..+||.|-=+.+..+|..+..... .++ T Consensus 78 ~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~------~~~~gG~lIP~~~~~~Ii~~~~~~~--~l~ 149 (387) T protein:vir:96 78 YQSLSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTG------NDSGGDKLLPKTLSKEIVSEPFAKN--QLR 149 (387) T ss_pred CCCCchhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccC------CCCCCceeechhHHHHHHHHHHhhc--hhh Confidence 11111111111122211111 111223332 2334555545555666644443332 233 Q ss_pred hhcccchhhHHHhhhhhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHH Q lcl|NC_019448. 66 RDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDA 145 (463) Q Consensus 66 ~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~a 145 (463) +.+...++.+ + .+.+.. .+ .+...+++|++..+..++.+.+....++=++.-..+|.-+ |.++..|.+....+.- T Consensus 150 ~~~~~~~~~~-~-~~p~~~-~~-~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~~i~iS~el-l~ds~~~l~~~i~~~l 224 (387) T protein:vir:96 150 EKARLTNIKG-L-EIPRVS-YT-LDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAISDTV-IHGSDVDLVNWVENAL 224 (387) T ss_pred hhceeeecCC-c-eeeeee-cc-CCccccccccccccccccccceeeechheeeeechhhHHH-HhhhHHHHHHHHHHHH Confidence 4343344332 1 222222 11 1235689999999999999999988888887766666431 2334556666666666 Q ss_pred HHHHHHHHHHHHhhcccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHH Q lcl|NC_019448. 146 IAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHA 225 (463) Q Consensus 146 i~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka 225 (463) ...++...+..+| ++-.- .| +..|+.. .+ .+--..|..+ .+.|..+-..+...|-...-.+|...+.. T Consensus 225 a~~~~~~e~~~~~-~~g~g------~g-~~~g~~~--~~-~~~~~~~~~~-~d~i~~~~~~l~~~y~~na~~imn~~t~~ 292 (387) T protein:vir:96 225 QSGLAAKERKDAL-AVSPK------SG-LEHMSFY--NG-SVKEVEGADM-YDAIINALADLHEDYRDNATIYMRYADYV 292 (387) T ss_pred HHHHHHHHHHhHh-hcCCC------cc-ccceeee--cc-ccccccccch-HHHHHHHHhccChhhhcCCEEEEechHHH Confidence 6666665344444 33221 11 2233321 11 0111112222 33344444456666644445788887776 Q ss_pred HHHHHhcCcceEEeecCCCCcccceecCeeeecccccccCCceecc--CccccccccccC--CCCCCCCee--EEEEecc Q lcl|NC_019448. 226 DFVNSILGRQMQLMQDNSGNVNTGYSVNGFYSSRGFIKLHGSTVME--NELILDESLQPL--PNAPQPAKV--TATVETK 299 (463) Q Consensus 226 ~f~~~~~~~qrv~~~~n~g~~~~G~~v~~~~s~~G~i~l~~s~~~~--~d~~l~~~~~~~--p~ap~p~~v--tat~~~~ 299 (463) .+....-+..+-+....++ .-.|.+|- .+. +. +..++. ...+++...... .+-.....+ -+...- T Consensus 293 ~~~~~~~~~~~~~~~~~~~-~llG~PV~--~~~-~~----~~~~~GDf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~- 363 (387) T protein:vir:96 293 KIISVLSNGTTNFFDTPAE-KVFGKPVV--FTD-AA----VKPIVGDFNYFGINYDGTTYDTDKDVKKGEYLFVLTAWY- 363 (387) T ss_pred HHHHHHhcCCCcccccCCc-cccccceE--Eec-CC----CceeeechhhhhhhhhhhhheecccccCCceEEEEEEEe- Confidence 6665554555555544333 33466542 111 00 011111 001111111000 000001111 111111 Q ss_pred CCCcCcccccccceEEEEEEEecCCccccc Q lcl|NC_019448. 300 QKGAFEDEEDRAGLSYKVVVNSDDAQSAPS 329 (463) Q Consensus 300 ~~g~~~~~~~~a~ysYkV~a~s~~geS~~S 329 (463) .|...+ .-..++.-+.-.+.|.|| T Consensus 364 -Dg~v~~-----~~A~~~l~~ka~~~~~~~ 387 (387) T protein:vir:96 364 -DQQRTL-----DSAFRIAKAKENTGPLPS 387 (387) T ss_pred -CcEeec-----hhheEEEEeecCCCCCCC Confidence 111111 123344455555566666 No 123 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=77.28 E-value=0.13 Score=25.51 Aligned_cols=289 Identities=8% Similarity=0.074 Sum_probs=121.5 Q ss_pred CCCCCccchHHHHhhhhhhH---------------HHHHHhhcCCccCCccccCccccchhhhhhHhhhhhccccccchh Q lcl|NC_019448. 1 MTIEKNLSDVQQKYADQFQE---------------DVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFY 65 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k~~~e---------------~~~Ks~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~ 65 (463) ...+.+......++++.+.. ...+++++| +..+||.|-=+.+..+|..+..... .++ T Consensus 78 ~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~------~~~~gG~lIP~~~~~~Ii~~~~~~~--~l~ 149 (387) T protein:vir:94 78 YQSLSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTG------NDSGGDKLLPKTLSKEIVSEPFAKN--QLR 149 (387) T ss_pred CCCCchhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccC------CCCCCceeechhHHHHHHHHHHhhc--hhh Confidence 11111111111122211111 111223332 2334555545555666644443332 233 Q ss_pred hhcccchhhHHHhhhhhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHH Q lcl|NC_019448. 66 RDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDA 145 (463) Q Consensus 66 ~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~a 145 (463) +.+...++.+ + .+.+.. .+ .+...+++|++..+..++.+.+....++=++.-..+|.-+ |.++..|.+....+.- T Consensus 150 ~~~~~~~~~~-~-~~p~~~-~~-~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~~i~iS~el-l~ds~~~l~~~i~~~l 224 (387) T protein:vir:94 150 EKARLTNIKG-L-EIPRVS-YT-LDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAISDTV-IHGSDVDLVNWVENAL 224 (387) T ss_pred hhceeeecCC-c-eeeeee-cc-CCccccccccccccccccccceeeechheeeeechhhHHH-HhhhHHHHHHHHHHHH Confidence 4343344332 1 222222 11 1235689999999999999999988888887766666431 2334556666666666 Q ss_pred HHHHHHHHHHHHhhcccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHH Q lcl|NC_019448. 146 IAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHA 225 (463) Q Consensus 146 i~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka 225 (463) ...++...+..+| ++-.- .| +..|+.. .+ .+--..|..+ .+.|..+-..+...|-...-.+|...+.. T Consensus 225 a~~~~~~e~~~~~-~~g~g------~g-~~~g~~~--~~-~~~~~~~~~~-~d~i~~~~~~l~~~y~~na~~imn~~t~~ 292 (387) T protein:vir:94 225 QSGLAAKERKDAL-AVSPK------SG-LEHMSFY--NG-SVKEVEGADM-YDAIINALADLHEDYRDNATIYMRYADYV 292 (387) T ss_pred HHHHHHHHHHhHh-hcCCC------cc-ccceeee--cc-ccccccccch-HHHHHHHHhccChhhhcCCEEEEechHHH Confidence 6666665344444 33221 11 2233321 11 0111112222 33344444456666644445788887776 Q ss_pred HHHHHhcCcceEEeecCCCCcccceecCeeeecccccccCCceecc--CccccccccccC--CCCCCCCee--EEEEecc Q lcl|NC_019448. 226 DFVNSILGRQMQLMQDNSGNVNTGYSVNGFYSSRGFIKLHGSTVME--NELILDESLQPL--PNAPQPAKV--TATVETK 299 (463) Q Consensus 226 ~f~~~~~~~qrv~~~~n~g~~~~G~~v~~~~s~~G~i~l~~s~~~~--~d~~l~~~~~~~--p~ap~p~~v--tat~~~~ 299 (463) .+....-+..+-+....++ .-.|.+|- .+. +. +..++. ...+++...... .+-.....+ -+...- T Consensus 293 ~~~~~~~~~~~~~~~~~~~-~llG~PV~--~~~-~~----~~~~~GDf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~- 363 (387) T protein:vir:94 293 KIISVLSNGTTNFFDTPAE-KVFGKPVV--FTD-AA----VKPIVGDFNYFGINYDGTTYDTDKDVKKGEYLFVLTAWY- 363 (387) T ss_pred HHHHHHhcCCCcccccCCc-cccccceE--Eec-CC----CceeeechhhhhhhhhhhhheecccccCCceEEEEEEEe- Confidence 6665554555555544333 33466542 111 00 011111 001111111000 000001111 111111 Q ss_pred CCCcCcccccccceEEEEEEEecCCccccc Q lcl|NC_019448. 300 QKGAFEDEEDRAGLSYKVVVNSDDAQSAPS 329 (463) Q Consensus 300 ~~g~~~~~~~~a~ysYkV~a~s~~geS~~S 329 (463) .|...+ .-..++.-+.-.+.|.|| T Consensus 364 -Dg~v~~-----~~A~~~l~~ka~~~~~~~ 387 (387) T protein:vir:94 364 -DQQRTL-----DSAFRIAKAKENTGPLPS 387 (387) T ss_pred -CcEeec-----hhheEEEEeecCCCCCCC Confidence 111111 123344455555566666 No 124 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=77.28 E-value=0.13 Score=25.51 Aligned_cols=289 Identities=8% Similarity=0.074 Sum_probs=121.5 Q ss_pred CCCCCccchHHHHhhhhhhH---------------HHHHHhhcCCccCCccccCccccchhhhhhHhhhhhccccccchh Q lcl|NC_019448. 1 MTIEKNLSDVQQKYADQFQE---------------DVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFY 65 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k~~~e---------------~~~Ks~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~ 65 (463) ...+.+......++++.+.. ...+++++| +..+||.|-=+.+..+|..+..... .++ T Consensus 78 ~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~------~~~~gG~lIP~~~~~~Ii~~~~~~~--~l~ 149 (387) T protein:vir:26 78 YQSLSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTG------NDSGGDKLLPKTLSKEIVSEPFAKN--QLR 149 (387) T ss_pred CCCCchhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccC------CCCCCceeechhHHHHHHHHHHhhc--hhh Confidence 11111111111122211111 111223332 2334555545555666644443332 233 Q ss_pred hhcccchhhHHHhhhhhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHH Q lcl|NC_019448. 66 RDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDA 145 (463) Q Consensus 66 ~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~a 145 (463) +.+...++.+ + .+.+.. .+ .+...+++|++..+..++.+.+....++=++.-..+|.-+ |.++..|.+....+.- T Consensus 150 ~~~~~~~~~~-~-~~p~~~-~~-~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~~i~iS~el-l~ds~~~l~~~i~~~l 224 (387) T protein:vir:26 150 EKARLTNIKG-L-EIPRVS-YT-LDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAISDTV-IHGSDVDLVNWVENAL 224 (387) T ss_pred hhceeeecCC-c-eeeeee-cc-CCccccccccccccccccccceeeechheeeeechhhHHH-HhhhHHHHHHHHHHHH Confidence 4343344332 1 222222 11 1235689999999999999999988888887766666431 2334556666666666 Q ss_pred HHHHHHHHHHHHhhcccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHH Q lcl|NC_019448. 146 IAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHA 225 (463) Q Consensus 146 i~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka 225 (463) ...++...+..+| ++-.- .| +..|+.. .+ .+--..|..+ .+.|..+-..+...|-...-.+|...+.. T Consensus 225 a~~~~~~e~~~~~-~~g~g------~g-~~~g~~~--~~-~~~~~~~~~~-~d~i~~~~~~l~~~y~~na~~imn~~t~~ 292 (387) T protein:vir:26 225 QSGLAAKERKDAL-AVSPK------SG-LEHMSFY--NG-SVKEVEGADM-YDAIINALADLHEDYRDNATIYMRYADYV 292 (387) T ss_pred HHHHHHHHHHhHh-hcCCC------cc-ccceeee--cc-ccccccccch-HHHHHHHHhccChhhhcCCEEEEechHHH Confidence 6666665344444 33221 11 2233321 11 0111112222 33344444456666644445788887776 Q ss_pred HHHHHhcCcceEEeecCCCCcccceecCeeeecccccccCCceecc--CccccccccccC--CCCCCCCee--EEEEecc Q lcl|NC_019448. 226 DFVNSILGRQMQLMQDNSGNVNTGYSVNGFYSSRGFIKLHGSTVME--NELILDESLQPL--PNAPQPAKV--TATVETK 299 (463) Q Consensus 226 ~f~~~~~~~qrv~~~~n~g~~~~G~~v~~~~s~~G~i~l~~s~~~~--~d~~l~~~~~~~--p~ap~p~~v--tat~~~~ 299 (463) .+....-+..+-+....++ .-.|.+|- .+. +. +..++. ...+++...... .+-.....+ -+...- T Consensus 293 ~~~~~~~~~~~~~~~~~~~-~llG~PV~--~~~-~~----~~~~~GDf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~- 363 (387) T protein:vir:26 293 KIISVLSNGTTNFFDTPAE-KVFGKPVV--FTD-AA----VKPIVGDFNYFGINYDGTTYDTDKDVKKGEYLFVLTAWY- 363 (387) T ss_pred HHHHHHhcCCCcccccCCc-cccccceE--Eec-CC----CceeeechhhhhhhhhhhhheecccccCCceEEEEEEEe- Confidence 6665554555555544333 33466542 111 00 011111 001111111000 000001111 111111 Q ss_pred CCCcCcccccccceEEEEEEEecCCccccc Q lcl|NC_019448. 300 QKGAFEDEEDRAGLSYKVVVNSDDAQSAPS 329 (463) Q Consensus 300 ~~g~~~~~~~~a~ysYkV~a~s~~geS~~S 329 (463) .|...+ .-..++.-+.-.+.|.|| T Consensus 364 -Dg~v~~-----~~A~~~l~~ka~~~~~~~ 387 (387) T protein:vir:26 364 -DQQRTL-----DSAFRIAKAKENTGPLPS 387 (387) T ss_pred -CcEeec-----hhheEEEEeecCCCCCCC Confidence 111111 123344455555566666 No 125 >protein:vir:5255 Length: 304 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852760;genbank:gi:31544035;uniprot:Q7Y5U0;genbank:GeneID:2753552 Probab=74.74 E-value=0.068 Score=26.99 Aligned_cols=273 Identities=11% Similarity=0.031 Sum_probs=118.8 Q ss_pred cCccc-cchh--hhhhHhhhhhccccccchhhhccc---chhhHHHhhhhhhhccCcccccccc-cccCcccccCcceEE Q lcl|NC_019448. 38 IDAGA-LRRE--ILDDQITMLTWTNEDLIFYRDISR---RPAQSTVVKYDQYLRHGNVGHSRFV-KEIGVAPVSDPNIRQ 110 (463) Q Consensus 38 ~~gaa-lr~e--sLd~~i~~L~~~~~df~f~~~i~k---~~~~stv~ey~~~~~hG~~g~~~fv-~E~g~~~~~d~~~~r 110 (463) +.|.| |-+| .+|++|...- ..+++.-+.|+= -++.-+.-.|..+..+|+.-++ ++ ...++-+.-|.++.+ T Consensus 1 ~~~lafl~~qL~~id~~vye~~--~~~~~~~~lipv~t~~~~~~~~~~~~~~d~~G~a~~~-~i~~~a~dip~vd~~~~~ 77 (304) T protein:vir:52 1 MSLLAYVKNGLTAVSKDIAETK--YPEIVFPQFVYVDQQTAVGITEKLHYGADEHGSLDDG-LITVGTSTLDQVEVGFTP 77 (304) T ss_pred CchHHHHHHHHHHHhhhhhccc--cccchhhhhccccCCCCcccceEEEeeeeccCccccc-ccCCcCCccceeecccce Confidence 44444 3332 2333332111 123443334431 1222222334444455544322 33 355777889999999 Q ss_pred EEEEEEEeechhhhhhhhhh---hcccccHHHHHHHHHHHHHHHHHHHHHhhcccccCCCccccccccccceeeecCcce Q lcl|NC_019448. 111 KTVSMKYVSDTKNMSIASGL---VNNIADPSQILTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNV 187 (463) Q Consensus 111 ~~~~~k~l~~~~~vs~~~~l---vn~~~Dp~~~~~~~ai~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nv 187 (463) ++..+.-.+.++.+++. ++ +..-.+..+..-+.|.+.+-+.+-...||||++.. .+-||.+-=+-..+ T Consensus 78 ~~~~i~~~~~~~~y~~~-El~~a~~~g~~l~~~ka~aa~~a~~~~~n~v~~~Gd~~~~--------g~~GllN~p~v~~~ 148 (304) T protein:vir:52 78 TRSYIVPWAKSVTWTKP-ELEQGKLLGLALNTAKIMALNKNAQQTLQKVAFLGHAKDS--------RLTGLLNNKSVEVY 148 (304) T ss_pred eEEEEEEEeeeeeecHH-HHHHHHHhCCCcHHHHHHHHHHHHHhhhceEEEEeecccc--------ceEEEEeCCCccee Confidence 99999999999999743 33 22223667777788888999999999999987631 35577653221111 Q ss_pred ---EeccCCCC---C----HHHHhhhhhhhhhc--C-CceeEEecCHHHHHHHHHHhcC--c---ceEEeecCCCCcccc Q lcl|NC_019448. 188 ---INAKGNQL---T----EKHLNEAAVRIGKG--F-GTATDAYMPIGVHADFVNSILG--R---QMQLMQDNSGNVNTG 249 (463) Q Consensus 188 ---iDarG~~l---s----~~~ln~aa~~i~~~--~-G~~td~~m~~~vka~f~~~~~~--~---qrv~~~~n~g~~~~G 249 (463) -|.-|... | .++|+++-..|..+ | -.|+.+.||+.....+..-..+ . -.+|.++|......+ T Consensus 149 ~~~~~~a~~~w~~~T~~eI~~di~~~~~~i~~~s~~~~~p~tl~Lpp~~~~~l~~~~~~~~~~Tvl~~l~~n~~~~~g~~ 228 (304) T protein:vir:52 149 AIKGAAQNTKVQAMDFDKAVAFFKEIFLKGMEKTKRIEAPNTFAIDSLDLAHLALVQRANTDTTALEFLTKHLSAAAGRQ 228 (304) T ss_pred eecCCccCCccccCCHHHHHHHHHHHHHHHHhccCceecCceEEeCHHHHHHHhhccCCCCCchHHHHHHHhcccccCCc Confidence 12122111 2 23466677666533 3 5688999999988877421100 0 011222332211111 Q ss_pred eecCee---eecccccccCCceeccCccccccccccCCCCCCCCeeEEEEeccCCCcCcccccccceEEEEEEEecCCcc Q lcl|NC_019448. 250 YSVNGF---YSSRGFIKLHGSTVMENELILDESLQPLPNAPQPAKVTATVETKQKGAFEDEEDRAGLSYKVVVNSDDAQS 326 (463) Q Consensus 250 ~~v~~~---~s~~G~i~l~~s~~~~~d~~l~~~~~~~p~ap~p~~vtat~~~~~~g~~~~~~~~a~ysYkV~a~s~~geS 326 (463) +++..+ ....|.=.-+-.++.++|.=-.+-..|.|--.-|++ + .+. ..|+|=..+ T Consensus 229 l~I~~v~~~~~~~g~~g~~r~vvY~~d~~~~~~~vP~p~~~l~~q------~--~~~---------~~~~vp~~~----- 286 (304) T protein:vir:52 229 VAIKALPSNYGTRVTDGKTRAMVYVNSKEHVIFDVPMSPTVLDAQ------P--KGL---------LAFESGLRM----- 286 (304) T ss_pred ceEEEecccccccCCCCceEEEEEecChhheEEecCccccccchh------h--cCC---------ceEEeccee----- Confidence 222221 111221111112344433322222222221111111 0 010 011110000 Q ss_pred ccccceeeeecCCCCceEEEEEecCCCCCCcce Q lcl|NC_019448. 327 APSEEVTATVSNVDDGVKLSISVNAMYQQQPQF 359 (463) Q Consensus 327 ~~S~~vt~Tva~~~~gv~ltIt~~a~~g~~~~~ 359 (463) ...||.+.-+....|- +| T Consensus 287 ------------r~gGv~v~~P~a~~y~---D~ 304 (304) T protein:vir:52 287 ------------AFGGVTFMEPDSALYV---DY 304 (304) T ss_pred ------------eeeeEEEEccceeeee---cC Confidence 0011111111111100 01 No 126 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=74.41 E-value=0.16 Score=24.97 Aligned_cols=293 Identities=11% Similarity=-0.013 Sum_probs=122.1 Q ss_pred CCCCCcc---chHHHHhhhhhhHH-------------------------HHHHhhcCCccCCccccCccc-cchhhhhhH Q lcl|NC_019448. 1 MTIEKNL---SDVQQKYADQFQED-------------------------VVKSFQTGYGITPDTQIDAGA-LRREILDDQ 51 (463) Q Consensus 1 ~~~~~~~---~~~~~~~~k~~~e~-------------------------~~Ks~~agy~~~p~~q~~gaa-lr~esLd~~ 51 (463) +..-.+. ....++..+.++++ -.|.|++. ++..+..+|+. +..|..++- T Consensus 22 ~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~lt~ee~~~~~~~--~~~~~~~~gg~lvP~~~~~~I 99 (377) T protein:vir:96 22 ISAGATPEEQEKLFEAAFTTMGDEILAKNEEEMERMFDLRDKNRELTAEEIKFFNDI--DKNVGGKDKFKLLPEETMVQV 99 (377) T ss_pred HhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccCHHHHHHHHHH--HhcCCCCCCceecCHHHHHHH Confidence 0000000 00000111111110 01111111 11112345555 444444444 Q ss_pred hhhhhccccccchhhhcccchhhHHHhhhhhhhccCcccccccccccCc-ccccCcceEEEEEEEEEeechhhhhhhhhh Q lcl|NC_019448. 52 ITMLTWTNEDLIFYRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGV-APVSDPNIRQKTVSMKYVSDTKNMSIASGL 130 (463) Q Consensus 52 i~~L~~~~~df~f~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~-~~~~d~~~~r~~~~~k~l~~~~~vs~~~~l 130 (463) +..|.. ...+++.+...++.+.+ ++. ...+.+...++.|.+. ++.+++.+.+.....+=|+.--.+|..+ | T Consensus 100 ~~~l~~---~s~i~~~~~v~~~~~~~-~i~---~~~~~~~a~wv~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~~l-l 171 (377) T protein:vir:96 100 FDDLVA---EHPLLKVINFKNTSLRL-KAL---TAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDA-L 171 (377) T ss_pred HHHHHh---hhhhhhhceeEecCCce-EEE---EecCCcceeEeecccccccccCccceeEeeeeeeEEeechhhHHH-h Confidence 444432 33455555444444432 222 2334456678899876 5678999999999999998777777655 5 Q ss_pred hcccccHHHHHHHHHHHHHHHHHHHHHhhcccccCCCccccccccccceeeecCcc-----------eEec---cC--CC Q lcl|NC_019448. 131 VNNIADPSQILTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNN-----------VINA---KG--NQ 194 (463) Q Consensus 131 vn~~~Dp~~~~~~~ai~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~n-----------viDa---rG--~~ 194 (463) .++..|.+....+.-..++++.++.+++.||-+= +--||.+-+.... +++. -| .. T Consensus 172 ~ds~~~le~~i~~~l~~~~~~~~~~a~i~G~G~~---------~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 242 (377) T protein:vir:96 172 KFGPKWLKQFITEQLKEAIAVALELAIVKGNGLL---------QPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSD 242 (377) T ss_pred hcchhhHHHHHHHHHHHHHHHHHhhceEeccCCC---------cceeeeeccccccccccccccccceeecccccccccc Confidence 6678899999999999999999999999999642 3345655332111 1110 01 11 Q ss_pred CCHHHHh-hhhh---hhhhc-CCceeE------EecCHHHHHHHHHHhcCcceEEeecCCCCcc--cceecCeeeecccc Q lcl|NC_019448. 195 LTEKHLN-EAAV---RIGKG-FGTATD------AYMPIGVHADFVNSILGRQMQLMQDNSGNVN--TGYSVNGFYSSRGF 261 (463) Q Consensus 195 ls~~~ln-~aa~---~i~~~-~G~~td------~~m~~~vka~f~~~~~~~qrv~~~~n~g~~~--~G~~v~~~~s~~G~ 261 (463) ++.+.+- .... ..+.+ .|.+.. +.|++.+..+.. .++..++ ..|.+. +|+++.-+.+...+ T Consensus 243 ~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~~~-----~~~~~~~-~~G~~~~~l~~p~~v~~s~~~p 316 (377) T protein:vir:96 243 LDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLE-----AKFTSRN-QFGEYVTVLPHGITILESLAVE 316 (377) T ss_pred CChhHHHHHHHHHHHhhccccccccccccCceEEEEchhhHHhcc-----ccccccC-CCCCceeccCCCceEEecCCCC Confidence 2223222 2221 12211 222222 558877765532 2332333 233332 22222222211110 Q ss_pred cccCCceecc--CccccccccccCCCCCCCCeeEEEEeccCCCcCcccccccceEEEEEEEecCC--ccccccceeeeec Q lcl|NC_019448. 262 IKLHGSTVME--NELILDESLQPLPNAPQPAKVTATVETKQKGAFEDEEDRAGLSYKVVVNSDDA--QSAPSEEVTATVS 337 (463) Q Consensus 262 i~l~~s~~~~--~d~~l~~~~~~~p~ap~p~~vtat~~~~~~g~~~~~~~~a~ysYkV~a~s~~g--eS~~S~~vt~Tva 337 (463) .+++++- ..-+|... ..+......... + ..|. --|++....+.. ..-+..+.+.++- T Consensus 317 ---~~~i~fgdf~~Y~i~~r----------~~~~i~~~~~~~---~-~~d~--~~f~~~~r~dG~~~d~~a~~vl~l~~~ 377 (377) T protein:vir:96 317 ---TGKAIAFVANRYDAFMA----------TASTIEEYDQTF---A-MEDL--QLYLTKNYFYGKAKDNHTAALLTLAGG 377 (377) T ss_pred ---cccEEEEEcCcEEEEEe----------cccEEEeehhhh---h-hcCC--eEEEEEEEEcCEEecCCcEEEEEEecC Confidence 0111111 00111000 000111100000 0 0011 112222222111 1112234444444 No 127 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=73.95 E-value=0.16 Score=24.89 Aligned_cols=298 Identities=14% Similarity=0.036 Sum_probs=119.7 Q ss_pred CCCCCccchHHHHhhhh----hhHHHHHHhhcCCc--cCCccccCccccchhhhhhHhhhhhccccccchhhhcccchhh Q lcl|NC_019448. 1 MTIEKNLSDVQQKYADQ----FQEDVVKSFQTGYG--ITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQ 74 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k~----~~e~~~Ks~~agy~--~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~ 74 (463) ................. -...+.+.+..+-. ....+..+|+.|.-+.+...|..+.. .-.+...+...+.. T Consensus 119 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~g~lvp~~~~~~i~~~~~---~~~l~~~~~~~~~~ 195 (437) T protein:vir:10 119 KRDAGGLQDMKLKVGGEIADKKVTAFADYLKTGEVRDVTGIALKDGKVIIPETILTPEKEVHQ---FPRLGSLVRTESVT 195 (437) T ss_pred HHhHHHHhHHHHHHHHHHHHhhhhhhHHHHHhhhhhhhhhcccccccccchHHHHHHHHHhhh---hhhhhhcceeEeec Confidence 00000000000000000 00111122211110 11112334555554555555544321 22344444444444 Q ss_pred HHHhhhhhhhccCcccccccccccCccc-ccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHHHHHHH Q lcl|NC_019448. 75 STVVKYDQYLRHGNVGHSRFVKEIGVAP-VSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTI 153 (463) Q Consensus 75 stv~ey~~~~~hG~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~~ 153 (463) +---+|......+ +...+++|++..+ .+++.+.+.+..++=++.-..+|.-+ +.++..|......+.-...+..++ T Consensus 196 ~~~~~~~~~~~~~--~~~~~~~e~~~~~e~~~~~~~~v~~~~~k~~~~~~is~el-l~ds~~~~~~~i~~~l~~~~~~~~ 272 (437) T protein:vir:10 196 TTTGKLPIFNNST--DLLTAHTEYGQTTKNATPVITPILWDLKTYTGGYVFSQEL-ISDSSYDWQAELQSRLIELRDNTD 272 (437) T ss_pred cCceeeEEeeccc--cccccccccccccccccccceeeeeehhheeeehhhhHHH-HhhhHHHHHHHHHHHHHHHHHHHH Confidence 4444455444332 3456788888765 68899999999888777766666543 345556778888888889999999 Q ss_pred HHHHhhcccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHHHHHhcC Q lcl|NC_019448. 154 EWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILG 233 (463) Q Consensus 154 E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~~~~~ 233 (463) +.+++.|+.+-.+...+. ...|.+..+ ++ ..+-.+|....-.+||+.+.+.|...-=. T Consensus 273 ~~~i~~g~g~~~~~~~~~-~~~~~~~~~------------------~~---~~l~~~~~~~~~~~~~~~~~~~l~~lkd~ 330 (437) T protein:vir:10 273 DSLIITALTDGIKKTTST-YLLGDLKKV------------------LN---VTLKPQDSAAASIVMSQSAYNLFDMATDA 330 (437) T ss_pred HHHHhhhhcccccccccc-cchhhHHHH------------------HH---hhhhhhhhcCCEEEEcHHHHHHHHHhhcc Confidence 999999987644322111 112222111 11 12223443333478999998888643211 Q ss_pred cceEEeecCCCCcccceecCeeeecccccccCCceeccCccccccccccCCCCCCCCeeEEEEeccCCCcCcccccccce Q lcl|NC_019448. 234 RQMQLMQDNSGNVNTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQPLPNAPQPAKVTATVETKQKGAFEDEEDRAGL 313 (463) Q Consensus 234 ~qrv~~~~n~g~~~~G~~v~~~~s~~G~i~l~~s~~~~~d~~l~~~~~~~p~ap~p~~vtat~~~~~~g~~~~~~~~a~y 313 (463) .-|.+.+++.++.. +.+++..+..+..+. ..|+. ++. ...--|+|.. T Consensus 331 ~g~~~~~~~~~~~~------------------~~~l~G~pv~~~~~~-~~~~~------~~~---~~~~~~gd~~----- 377 (437) T protein:vir:10 331 MGRPLLQPNVTAAT------------------GYTLLGKTVVIVDDK-LFPSA------SAG---DVNIVVAPLK----- 377 (437) T ss_pred CCCeeeccCccCCC------------------CcccccceeEEeccc-ccCCc------CCC---ceEEEEeecc----- Confidence 11333333222110 112222222221110 11110 000 0000111111 Q ss_pred EEEEEEEecCCccccc--------------cceeeeecCCCCceEEEEEecCCCCCCcceE Q lcl|NC_019448. 314 SYKVVVNSDDAQSAPS--------------EEVTATVSNVDDGVKLSISVNAMYQQQPQFV 360 (463) Q Consensus 314 sYkV~a~s~~geS~~S--------------~~vt~Tva~~~~gv~ltIt~~a~~g~~~~~y 360 (463) +| ++.+.+.+-+.-+ .-..+.+...++-+.|+.++++.....|.-+ T Consensus 378 ~~-~~~~~r~~~~~~~~~~~~~~~~~~~~~~r~d~~~~~~~a~~~l~~~~~~~~~~~~~~~ 437 (437) T protein:vir:10 378 KA-VINFKLTEITGQFQDTYDIWYKQLGIFLRQNVVQASKDLIVNLTGKLKAVTVVQSTAV 437 (437) T ss_pred cc-EEEEeeeceEEEEecccccccceeeEEEEEccEEecccceEEEEeeccccccCCCCCC Confidence 00 1111111110000 0011122222233333333322222222111 No 128 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=68.85 E-value=0.23 Score=24.07 Aligned_cols=312 Identities=11% Similarity=-0.019 Sum_probs=125.4 Q ss_pred CCCCCcc--chHH------HHhhhhhhHHHHHHhhcCCccCCccccCccccchhhhhhHhhhhhccccccchhhhcccch Q lcl|NC_019448. 1 MTIEKNL--SDVQ------QKYADQFQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRP 72 (463) Q Consensus 1 ~~~~~~~--~~~~------~~~~k~~~e~~~Ks~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~ 72 (463) ...+... .... ....+.+.++..|.+++- ....+-.+|+.|--+.+..+|..+.. +...+++.+...+ T Consensus 47 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~r~~~~~~--~~~~~~~~gg~lvP~~~~~~I~~~~~--~~s~i~~~~~~~~ 122 (390) T protein:vir:40 47 IAQARKEVNREMNDNNVLASRGANALTSDESKYYNEV--IAGNGFAGVTALLPPTVFERVFEDLT--VEHPLLSKINFVN 122 (390) T ss_pred HHHHHHHHHHHHHHHHHHHhcCchhccHHHHHHHHHH--HhccCcccCcccccHHHHHHHHHHHH--hhhhhhhhceeee Confidence 0000000 0000 000001111222322211 01112234555544444444433322 2223455555555 Q ss_pred hhHHHhhhhhhhccCcccccccccccCcc-cccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHHHHH Q lcl|NC_019448. 73 AQSTVVKYDQYLRHGNVGHSRFVKEIGVA-PVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAK 151 (463) Q Consensus 73 ~~stv~ey~~~~~hG~~g~~~fv~E~g~~-~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~ 151 (463) +.+.-..+. ...+.+...++.|++.. +..++.+.+.....+=++.-..+|.-+ +.++..|.+....+.-...++. T Consensus 123 ~~~~~~~i~---~~~~~~~a~~~~E~~~~~~~~~~~f~~i~l~~~k~~~~i~iS~el-l~ds~~~l~~~i~~~la~~i~~ 198 (390) T protein:vir:40 123 TTATTEWII---SVGDVATAWWGPLCAEIKEVLDNGFDKIQTGMYKLSAYIPVCNAM-LDLGPSWLDQYVRTILGEAMAL 198 (390) T ss_pred cCCceeEEE---EEcCCcceeeeccccccCccccccceeeEeeeeeEEEeehhhHHH-HhcchHHHHHHHHHHHHHHHHH Confidence 544322233 33344567788897764 578999999999999888776776433 2345568899999999999999 Q ss_pred HHHHHHhhcccccCCCccccccccccceeeecCc---ceEeccCCCCCHHHHhhhhhhhhhc--------CCceeEEecC Q lcl|NC_019448. 152 TIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKN---NVINAKGNQLTEKHLNEAAVRIGKG--------FGTATDAYMP 220 (463) Q Consensus 152 ~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~---nviDarG~~ls~~~ln~aa~~i~~~--------~G~~td~~m~ 220 (463) .++.++++|+-.= +-.|+.+..... .........++...+..+...+... ++.+ -++|+ T Consensus 199 ~~~~a~l~G~G~~---------~P~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~a-~~i~n 268 (390) T protein:vir:40 199 GLEAGIVNGSGKD---------QPIGMMRDLNNVTAGEHPVKTATPLTDLTPATLATKVMLPLTDNGKKSVSDA-ILVIN 268 (390) T ss_pred HHHhhhhcccCCC---------ccceeeeccccccccccccccccccchhhHHHHHHHHHHHhhcchhhhhcCc-eEEEc Confidence 9999999998531 234555533210 1111112233322222222222222 2222 14677 Q ss_pred HHHHHHHHH---HhcCcceEEeecCCCCcccceec--Ceeee----cccccccCCceec-cCccccccccccCCCCCCCC Q lcl|NC_019448. 221 IGVHADFVN---SILGRQMQLMQDNSGNVNTGYSV--NGFYS----SRGFIKLHGSTVM-ENELILDESLQPLPNAPQPA 290 (463) Q Consensus 221 ~~vka~f~~---~~~~~qrv~~~~n~g~~~~G~~v--~~~~s----~~G~i~l~~s~~~-~~d~~l~~~~~~~p~ap~p~ 290 (463) +.+...+-. .+.+..-..+.+ ....|.+| +..+. .-|+.+. -.+. ..+.-++.+. . ..+.-. T Consensus 269 ~~t~~~~l~~~~~~~d~~G~~v~~---~~~~g~pvv~~~~~p~~~i~~Gd~s~--~~i~~~~~~~v~~~~--~-~~f~~~ 340 (390) T protein:vir:40 269 PADYWSKIYAATSYMTPQGVWVTG---ILPVPLEIVQSVAVPVGKAVAGRAKD--YFMGIGSEQVIRTST--E-YRLLDD 340 (390) T ss_pred chhHHHHHHHHhhccCCCCccccc---cCCCceeEEEcCCCCCCcEEEEeece--EEEEeecceEEEecc--h-hhhhcC Confidence 766543221 111111000000 01122221 00000 0111110 0111 1111111110 0 001111 Q ss_pred --eeEEEEeccCCCcCcccccccceEEEEEEEecCCccccccceeeeecCCCCce Q lcl|NC_019448. 291 --KVTATVETKQKGAFEDEEDRAGLSYKVVVNSDDAQSAPSEEVTATVSNVDDGV 343 (463) Q Consensus 291 --~vtat~~~~~~g~~~~~~~~a~ysYkV~a~s~~geS~~S~~vt~Tva~~~~gv 343 (463) ..-++..-+++ ..+ ..+-...++.+.+... .+|-.+++.+.+..+.+- T Consensus 341 ~~~~r~~~r~dg~--v~~--~~A~~~l~~~~~~~~~-~~~~~~~~~~~~~~~~~~ 390 (390) T protein:vir:40 341 ETLYYAKQYANGR--PKD--NSSFLVFDITGLEGSP-AIDVNVVNNATPSETPAE 390 (390) T ss_pred cEEEEEEEEeCCE--Eec--ccceEEEEeeccCCCC-CCCcceeeCCCCCCCCCC Confidence 12233222222 111 2233445555554222 223334444444444443 No 129 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=67.20 E-value=0.26 Score=23.83 Aligned_cols=306 Identities=13% Similarity=-0.005 Sum_probs=122.0 Q ss_pred CCCCCccchHHHHhhhhhhHHHHHHhhcCCccCCccccCcc-ccchhhhhhHhhhhhccccccchhhhcccchhhHHHhh Q lcl|NC_019448. 1 MTIEKNLSDVQQKYADQFQEDVVKSFQTGYGITPDTQIDAG-ALRREILDDQITMLTWTNEDLIFYRDISRRPAQSTVVK 79 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k~~~e~~~Ks~~agy~~~p~~q~~ga-alr~esLd~~i~~L~~~~~df~f~~~i~k~~~~stv~e 79 (463) -+.....+..+. +.+.+.+++| +..+|+ .+..|..++-+..|... ..+++.+...++.+.+ + T Consensus 61 ~~~~~~lt~ee~-------~~~~~~~~~~------~~~~gg~~vP~~~~~~I~~~l~~~---s~i~~~~~v~~~~~~~-~ 123 (377) T protein:vir:98 61 RDKNRELTAEEI-------KFFNDIDKNV------GGKDKFKLLPEETMVQVFDDLVAE---HPLLKVINFKNTSLRL-K 123 (377) T ss_pred ccCCcccCHHHH-------HHHHHHHhcc------CCCCCccccCHHHHHHHHHHHHHh---hhhhhheeeEecCcce-E Confidence 001111111111 1111222221 223444 45555555555544332 3444444444444332 3 Q ss_pred hhhhhccCcccccccccccCc-ccccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_019448. 80 YDQYLRHGNVGHSRFVKEIGV-APVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEWASF 158 (463) Q Consensus 80 y~~~~~hG~~g~~~fv~E~g~-~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~~E~a~f 158 (463) +.+ ..+.+.+.+++|.+. .+..+|.+.+.....+=|+.--.+|.-+ |.++..|.+....+.-..++++.++.+++ T Consensus 124 ~~~---~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~a~~~is~el-L~ds~~~ie~~i~~~la~~~a~~~~~a~i 199 (377) T protein:vir:98 124 ALT---AETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDA-LKFGPKWIKQFITEQLKEAIAVALELAIV 199 (377) T ss_pred EEE---ecCCcceeEeecccccCcccCccceeEeecceeEEeeecccHHh-hhccHhHHHHHHHHHHHHHHHHHHhhceE Confidence 333 234445667889876 4578999999999998888766666554 56677899999999999999999999999 Q ss_pred hcccccCCCccccccccccceeeecC-----cceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHHHHHhcC Q lcl|NC_019448. 159 YGDASLTSEVEGEGLEFDGLAKLIDK-----NNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILG 233 (463) Q Consensus 159 yGd~~l~~~~~~~gleFDGl~~lI~~-----~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~~~~~ 233 (463) .||-+- |--||.+-+.. ....++.+.....+.|-.+.-.....|..--...|...+.+.....--. T Consensus 200 ~G~G~~---------qP~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~a~~~m~~~t~~~~~klkd~ 270 (377) T protein:vir:98 200 KGDGLL---------QPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLTPDNAPKKLVPVMKHLSVNDKKRPLKI 270 (377) T ss_pred eccCCC---------cceeeeecccccccccccccccccccchhhhHhhhhhhchhHHHHHHHHHHHHHHHHHHhhhhcc Confidence 999642 34566554321 2223333333332333222222222221111123333443333322112 Q ss_pred cceEEeecCCCCcccceecCeeeecccccccCCceeccCccccccccccCCCCCCCCeeEEEEeccCCCcCcccccccce Q lcl|NC_019448. 234 RQMQLMQDNSGNVNTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQPLPNAPQPAKVTATVETKQKGAFEDEEDRAGL 313 (463) Q Consensus 234 ~qrv~~~~n~g~~~~G~~v~~~~s~~G~i~l~~s~~~~~d~~l~~~~~~~p~ap~p~~vtat~~~~~~g~~~~~~~~a~y 313 (463) .-+.+...|+...--..+......+.|. ..++|..+..+.+. ...| .++-.|+|.+ + T Consensus 271 ~G~~i~~~n~~~~~~~~p~~~~~~~~G~----~~t~lg~p~~vv~s------~~~p---------~~~i~fgdf~----~ 327 (377) T protein:vir:98 271 AGQVKLILNPEDRWALEAQFTSRNQFGE----YVTVLPHGITILES------LAVE---------TGKAIAFVAN----R 327 (377) T ss_pred CCceEEEecccchhhccccccccCCCCc----cccccCCCceEEec------CCCC---------cccEEEEEec----c Confidence 2223332222211000000000111111 01122111111010 0111 1122233321 1 Q ss_pred EEEEEEEecCCccc-cccceeeeecCCCCceEEEEEecCCCCCCcceEEEEeec-C--CCceEEEEEEeeeeeecCCc Q lcl|NC_019448. 314 SYKVVVNSDDAQSA-PSEEVTATVSNVDDGVKLSISVNAMYQQQPQFVSIYRQG-K--ETGMYFLIKRVPVKDAQEDG 387 (463) Q Consensus 314 sYkV~a~s~~geS~-~S~~vt~Tva~~~~gv~ltIt~~a~~g~~~~~y~IYR~~-~--~~g~~~li~rv~~s~~n~~g 387 (463) |.+.. +.+-+. .|... .+..+.+ -|+.+.|=. + ....|. +-.| . +| T Consensus 328 -Y~i~~--r~~~~i~~~~~~-----------------~~~~d~~-~f~~~~r~dg~~~~~~a~~-vl~i-----~-~~ 377 (377) T protein:vir:98 328 -YDAFM--ATASTIEEYDQT-----------------FAMEDLQ-LYLTKNYFYGKAKDNHTAA-LLTL-----A-GG 377 (377) T ss_pred -eeEEe--ecceEEEeechh-----------------hhhcCce-EEEEEEEEcCEEeccCcEE-EEEE-----e-cC Confidence 33322 222111 11111 0111111 122222211 0 001111 1111 1 12 No 130 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=66.47 E-value=0.27 Score=23.73 Aligned_cols=280 Identities=11% Similarity=0.074 Sum_probs=118.3 Q ss_pred CCCCCccchHHHHhhhhhhHHHHHHhhcCCccCCccccCccc-cch--hhhhhHhhhhhccccccchhhhcccchhhHHH Q lcl|NC_019448. 1 MTIEKNLSDVQQKYADQFQEDVVKSFQTGYGITPDTQIDAGA-LRR--EILDDQITMLTWTNEDLIFYRDISRRPAQSTV 77 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k~~~e~~~Ks~~agy~~~p~~q~~gaa-lr~--esLd~~i~~L~~~~~df~f~~~i~k~~~~stv 77 (463) |.| |+.. -+..++..++....+ +...+++ |-+ |-+|+++....+ .+++.-+.|+ +.+.+ T Consensus 1 ~~~--~~~~----~~~~~~~~~~~~~~~-------~~d~~~~fl~~ql~~id~~v~e~~~--~~~~~~~~i~---v~~~~ 62 (314) T protein:vir:10 1 MAI--KFDA----EQAKITTHLEQMGVE-------KADAAGIWAVSQLTAALNRAYEKEY--AENSVVNIFP---VTNEI 62 (314) T ss_pred Ccc--chHH----HHHHHHHHHHhhccc-------chhhhHHHHHHHHHHHHHHHhhhhc--cccccceeec---cccCC Confidence 322 2221 111112222222211 2223444 333 456666643222 2233333332 22222 Q ss_pred hhhhh---hhccCcccccccccc-cCcccccCcceEEEEEEEEEeechhhhhhhhh--hhcccccHHHHHHHHHHHHHHH Q lcl|NC_019448. 78 VKYDQ---YLRHGNVGHSRFVKE-IGVAPVSDPNIRQKTVSMKYVSDTKNMSIASG--LVNNIADPSQILTEDAIAVVAK 151 (463) Q Consensus 78 ~ey~~---~~~hG~~g~~~fv~E-~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~--lvn~~~Dp~~~~~~~ai~~~~~ 151 (463) .+|.+ +......|....++. .+..+..|.++.|++..+...+....++..-= ....-.+..+.....|.+.+.+ T Consensus 63 ~~~~et~~~~~~e~~G~a~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~ 142 (314) T protein:vir:10 63 PGHAKYFEYPEFDGVGIAQIIADYSDDLPLVDAFMTEKQGKVFRFGNAFLISTDEIKAGAATGQSLSARKQALAFEAHDN 142 (314) T ss_pred CCceeEEEeeeeccccceeeeCCcccccceeecccceeEEEEEEEEeeEEecHHHHHHHHHhCCChHHHHHHHHHHHHHH Confidence 22221 112234455555554 44467889999999999999999999975322 2222347778888888899999 Q ss_pred HHHHHHhhcccccCCCccccccccccceeeecCcceEeccCCCCC----HHHHhhhhhhhh---hcCCceeEEecCHHHH Q lcl|NC_019448. 152 TIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLT----EKHLNEAAVRIG---KGFGTATDAYMPIGVH 224 (463) Q Consensus 152 ~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls----~~~ln~aa~~i~---~~~G~~td~~m~~~vk 224 (463) ..-..+||||+.+ .+-||.+-=+- ...-+.++--+ .++|+++-..+. +++-.|+.+.||+.-. T Consensus 143 ~~n~i~f~G~~~~---------g~~GLlN~p~v-~~~~~~~~WaT~~ei~~Di~~~~~~l~~~s~g~~~p~~l~Lpp~~~ 212 (314) T protein:vir:10 143 LLDKLVWSGSAPH---------GIVSVFDQPNI-NNVVATPNWSVPQNAIDDVTAMIDAVESSTQGLHHVTDILLPASAR 212 (314) T ss_pred hhceEEEeecccc---------cceeEeecCCC-ccccCCCCcccHHHHHHHHHHHHHHHHHhcCccccceeEEecHHHH Confidence 9999999998885 35566542110 00111111112 344666654444 3567889999999877 Q ss_pred HHHHHHhcCc-c----eEEeecCCCCcccceecCeeeecccccccCCceeccCc--cc---cccccccCC---CCCCCCe Q lcl|NC_019448. 225 ADFVNSILGR-Q----MQLMQDNSGNVNTGYSVNGFYSSRGFIKLHGSTVMENE--LI---LDESLQPLP---NAPQPAK 291 (463) Q Consensus 225 a~f~~~~~~~-q----rv~~~~n~g~~~~G~~v~~~~s~~G~i~l~~s~~~~~d--~~---l~~~~~~~p---~ap~p~~ 291 (463) +.+..- .+. . ..|.+++++ .. =..++.+.+..|.-+ +-.++..++ .+ +......+| .... -. T Consensus 213 ~~L~~~-~~~~~~tvl~~l~~n~~~-l~-I~~~~el~~ag~~g~-~~~v~y~~~~~~~~~~vp~~~~~l~~e~~~~~-~~ 287 (314) T protein:vir:10 213 RVMQGL-VPQTNLSYGELFTRNNPG-LT-IRFLQFLDNYDGAGG-KAALAFEKSPLNMSIEIPEVTNVLPAQPKDLH-FR 287 (314) T ss_pred Hhhccc-ccCCCccHHHHHHHhCCC-cE-EEEcccccccCCCcc-eEEEEEecCCcEEEEecCccceeecceecCce-EE Confidence 655321 110 0 001111110 10 011333333222100 000222211 11 111111111 1110 01 Q ss_pred eEEEEecc-----CCCcCcccccccceEEE Q lcl|NC_019448. 292 VTATVETK-----QKGAFEDEEDRAGLSYK 316 (463) Q Consensus 292 vtat~~~~-----~~g~~~~~~~~a~ysYk 316 (463) +....... -..... ..-+..+. T Consensus 288 ~~~~~r~~Gv~i~~P~ai~---~~dGI~~~ 314 (314) T protein:vir:10 288 YPVTSKATGLIVYRPLTMA---VIKGITFA 314 (314) T ss_pred EcceeeeEEEEEECcceeE---eeeeeecC Confidence 11110111 011100 11112222 No 131 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=65.40 E-value=0.28 Score=23.58 Aligned_cols=260 Identities=15% Similarity=0.120 Sum_probs=117.3 Q ss_pred cCCccccCccccch---hhhhhHhhhhhccccccchhhhcccchhhHHHh------hhhhhhccCccccccccc-ccCcc Q lcl|NC_019448. 32 ITPDTQIDAGALRR---EILDDQITMLTWTNEDLIFYRDISRRPAQSTVV------KYDQYLRHGNVGHSRFVK-EIGVA 101 (463) Q Consensus 32 ~~p~~q~~gaalr~---esLd~~i~~L~~~~~df~f~~~i~k~~~~stv~------ey~~~~~hG~~g~~~fv~-E~g~~ 101 (463) .+-|...+++++-. |.+|+.+....+. +++.-+.++ +.+.+. .|..+. +.|....++ +.+.. T Consensus 1 ~~~~~a~~~~~f~~~ql~~id~~v~e~~~~--~l~~~~~i~---v~~~~~~~~~~~~~~~~~---~~G~a~~~~~~~~di 72 (296) T protein:vir:10 1 MGVDKADAAGIWTVKQLTASLNKAYETEYD--QNSVVNLFP---VSNEIPGYAKYFEYPVFD---GVGIAQIVADYTDDL 72 (296) T ss_pred CcccchhhhHHHHHHHHHHHHHHHHhhhhc--ccccceecc---cccCCCCceeEEEeeeee---ccCceeEeCCCcccc Confidence 23333345566555 3455555432222 232222222 122222 233333 334444444 45556 Q ss_pred cccCcceEEEEEEEEEeechhhhhhhh--hhhcccccHHHHHHHHHHHHHHHHHHHHHhhcccccCCCccccccccccce Q lcl|NC_019448. 102 PVSDPNIRQKTVSMKYVSDTKNMSIAS--GLVNNIADPSQILTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLA 179 (463) Q Consensus 102 ~~~d~~~~r~~~~~k~l~~~~~vs~~~--~lvn~~~Dp~~~~~~~ai~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~ 179 (463) +..|.++.|++..+...+..+.++..- .....-.+..+.....|.+.+.+.....+||||+.+. +-||. T Consensus 73 p~v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~ka~aA~~~~~~~~n~~~f~G~~~~g---------~~GLl 143 (296) T protein:vir:10 73 PLVDALATERQGKVFRFGNAFLISIDEIKVGQATGQSLSTRKQSLAFEAHDKLLDKLVWSGSTAHG---------IPSVF 143 (296) T ss_pred ceeeccceeEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeeccccc---------ceeEe Confidence 788999999999999999998887432 2233344777888888889999999999999988853 55665 Q ss_pred eeecCcceEeccCCC--CC--HHHHhhhhhhh---hhcCCceeEEecCHHHHHHHHHHhcC-----cceEEeecCCCCcc Q lcl|NC_019448. 180 KLIDKNNVINAKGNQ--LT--EKHLNEAAVRI---GKGFGTATDAYMPIGVHADFVNSILG-----RQMQLMQDNSGNVN 247 (463) Q Consensus 180 ~lI~~~nviDarG~~--ls--~~~ln~aa~~i---~~~~G~~td~~m~~~vka~f~~~~~~-----~qrv~~~~n~g~~~ 247 (463) +-=+- ..+.+.|.- .+ .++|+++-..+ .+++=.|+.+.||+.....+..-+-+ .+ .|..++++ .. T Consensus 144 N~p~v-~~~~~~~~W~~~t~i~~Di~~~~~~l~~~s~g~~~p~~l~L~p~~~~~L~~~~~~~~~t~l~-~ik~~~~~-l~ 220 (296) T protein:vir:10 144 DYPNI-NNVVSGGSWSQPTTAVSDITSLLDIIETSTNGQHRATHLLLPTTARRIMQNLVPGTSVSYGE-FFRQNNSG-VT 220 (296) T ss_pred ecCCC-ccccccCCccCHHHHHHHHHHHHHHHHHhhCceecceeEEeCHHHHHHHhhccCCCCccHHH-HHHHhcCC-ce Confidence 42110 122232221 11 34466665433 34567789999999998888532210 01 01111111 00 Q ss_pred cceecCeeeecccccccCCceeccCc-cccc----cccccCCCCCCC--CeeEEEEeccC-----CCcCcccccccceEE Q lcl|NC_019448. 248 TGYSVNGFYSSRGFIKLHGSTVMENE-LILD----ESLQPLPNAPQP--AKVTATVETKQ-----KGAFEDEEDRAGLSY 315 (463) Q Consensus 248 ~G~~v~~~~s~~G~i~l~~s~~~~~d-~~l~----~~~~~~p~ap~p--~~vtat~~~~~-----~g~~~~~~~~a~ysY 315 (463) =..++.+.+..|. .-+..++.+++ ..++ .....+|--+.. -.+.......+ ..... ..-+..+ T Consensus 221 -i~~~~~l~~a~~~-g~~~~v~~~~~~~~~~~~v~~~~~~~~~e~~~l~~~~~~~~~~~Gv~i~~P~ai~---~~dGI~~ 295 (296) T protein:vir:10 221 -VEFVQYLNDYNGT-GTSAAIAYEKDPNNMAIEIPEATNALPAQPKDLHFKIPVTSKATGLIVYRPLTMA---VMKGITF 295 (296) T ss_pred -EEEeeeeccCCCC-cceEEEEEEcCCceEEEEcCcceeeecccccCceEEEeeEeeEEEEEEECCceeE---EEeeeec Confidence 0012222222221 00001222211 1111 111111210000 01111110110 11100 1111111 Q ss_pred E Q lcl|NC_019448. 316 K 316 (463) Q Consensus 316 k 316 (463) . T Consensus 296 ~ 296 (296) T protein:vir:10 296 A 296 (296) T ss_pred C Confidence 1 No 132 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=64.47 E-value=0.3 Score=23.45 Aligned_cols=293 Identities=12% Similarity=0.084 Sum_probs=126.3 Q ss_pred CCCCCc------cchHH--------HHhhhhhhHHHHHHhhcCCccCCccccCccccchhhhhhHhhhhhccccccchhh Q lcl|NC_019448. 1 MTIEKN------LSDVQ--------QKYADQFQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYR 66 (463) Q Consensus 1 ~~~~~~------~~~~~--------~~~~k~~~e~~~Ks~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~ 66 (463) ++.+.. ..... .++++.. ....+++.+ .+..+|+.+--+-+..+|..+.. +.-.+++ T Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~-~~~~~~~~~------~t~~~gg~~vP~~~~~~i~~~~~--~~~~l~~ 141 (389) T protein:vir:10 71 EPKDDGSKKGTDLSKKPIDAKKKAINDFIHSH-GKVIDATSK------VTSTEAGVLIPEEIIYDPTAEVN--SVVDLST 141 (389) T ss_pred cccccccccccccchhHHHHHHHHHHHHhhcc-hhhhhhhcc------cccCCcceeehHHHHHHHHHHHH--hhhhHHh Confidence 111111 11000 0111111 111222222 22234555444444455433322 2233455 Q ss_pred hcccchhhHHHhhhhhhhccCcccccccccccCccc-ccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHH Q lcl|NC_019448. 67 DISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAP-VSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDA 145 (463) Q Consensus 67 ~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~a 145 (463) .+...++.+.--+|.+.... .+...+++|++..+ .+++.+.+.....+-++.-..+|.-+ +.++..|.+....+.- T Consensus 142 ~~~~~~~~~~~~~~~~~~~~--~~~~~~~~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~l 218 (389) T protein:vir:10 142 LVTKTPVTTPKGTYPILKRA--TDRFSSVAELAENPKLAEPEFNKVDWSVATYRGAIPLSEEA-IADSAVDLTALVGQSI 218 (389) T ss_pred hcceeeccCCeeEEEEEecC--CCccccccccccccccccccceeeeeeheeeEeeehhhHHH-HhhhhHHHHHHHHHHH Confidence 55555665555555555432 23445789987665 78999999999999998877777754 3455567788888888 Q ss_pred HHHHHHHHHHHHhhcccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhh-hhhhhhcCCceeEEecCHHHH Q lcl|NC_019448. 146 IAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEA-AVRIGKGFGTATDAYMPIGVH 224 (463) Q Consensus 146 i~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~a-a~~i~~~~G~~td~~m~~~vk 224 (463) ...+..+++.++..|.....+....... +.+.|..+ ...+..+|+ .-.+||+.+. T Consensus 219 a~~~~~~~~~~i~~g~~~~~~~~~~~~~----------------------~~d~l~~~~~~~~~~~~~--a~~~~n~~~~ 274 (389) T protein:vir:10 219 KEKSVNTYNAMIAPVLQSFTAKKTTTDT----------------------LVDSLKHILNVDLDPAYS--RALVVTQSLF 274 (389) T ss_pred HHHHHHHHHHHHhhhhcccccccccccc----------------------cHHHHHHHHHhhhhhhhC--cEEEecHHHH Confidence 8889999999998888765442211111 12222221 122223342 3478999998 Q ss_pred HHHHHHhcCcceEEeecCCCCcccceecCeeeecccccccCCceeccCccccccccccCCCCCCCCeeEEEEeccCCCcC Q lcl|NC_019448. 225 ADFVNSILGRQMQLMQDNSGNVNTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQPLPNAPQPAKVTATVETKQKGAF 304 (463) Q Consensus 225 a~f~~~~~~~qrv~~~~n~g~~~~G~~v~~~~s~~G~i~l~~s~~~~~d~~l~~~~~~~p~ap~p~~vtat~~~~~~g~~ 304 (463) +.+...--..-|.+.+++..+...+- +. .+++..+....++.+ .|.. ++ ...--| T Consensus 275 ~~L~~lkd~~G~~i~~~~~~~~~~~~---------~~-----~~l~G~pV~~~~~~~------~~~~--~~---~~~~~~ 329 (389) T protein:vir:10 275 NTLDTLKDKNGRYLLHDASDSITDGT---------AK-----GTILGVPVYVVGDTL------LGSL--AG---DQKAFV 329 (389) T ss_pred HHHHHhhccCCCeeeecCcccccccc---------cc-----cccccceeEEecccc------cCCC--CC---ceEEEE Confidence 88875332222344443332211110 00 111221111111100 0000 00 000011 Q ss_pred ccccc------ccceEEEEEEEecCCc-c---ccccceeeeecCCCCceEEEEEecCCCCCCc Q lcl|NC_019448. 305 EDEED------RAGLSYKVVVNSDDAQ-S---APSEEVTATVSNVDDGVKLSISVNAMYQQQP 357 (463) Q Consensus 305 ~~~~~------~a~ysYkV~a~s~~ge-S---~~S~~vt~Tva~~~~gv~ltIt~~a~~g~~~ 357 (463) +|... ....+..+ +++.. . ....-+..-+....+-+.++|+..+.+++.. T Consensus 330 gd~~~~~~~~~~~~~~i~~---~~~~~~~~~~~~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~ 389 (389) T protein:vir:10 330 GDLKRGVLFTDRQQVTLAW---EDSKIYGKYLGAAFRFGVQKADSKAGYFVTNTDVPGSALGK 389 (389) T ss_pred eeccccEEEEeecceEEEe---eccccccceEEEEEEeccEEecccceEEEEeeccCCCCCCC Confidence 11100 00111110 00000 0 0001122233334444556665433323221 No 133 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=60.33 E-value=0.37 Score=22.92 Aligned_cols=280 Identities=13% Similarity=0.043 Sum_probs=116.4 Q ss_pred CCCCCccchHHH---Hhhhhhh------------HHHHHHhhcC----CccCCccccCccccchhhhhhHhhhhhccccc Q lcl|NC_019448. 1 MTIEKNLSDVQQ---KYADQFQ------------EDVVKSFQTG----YGITPDTQIDAGALRREILDDQITMLTWTNED 61 (463) Q Consensus 1 ~~~~~~~~~~~~---~~~k~~~------------e~~~Ks~~ag----y~~~p~~q~~gaalr~esLd~~i~~L~~~~~d 61 (463) .+.......... .+.+... ++........ -.....+..+|+.|--+.+...|-.+..... T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~liP~~~~~~ii~~~~~~~- 156 (394) T protein:vir:97 78 KEVTQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVV- 156 (394) T ss_pred cccchhhHHHHHHHHHHHHHHHHHhhhhhhhhhHHHHHHHHHhhhhhhhhccccccccccccChHHHHHHHHHHhhhhh- Confidence 111110000000 1111110 0011111000 0011123344666555555555544443332 Q ss_pred cchhhhcccchhhHHHhhhhhhhccCcccccccccccCccc-ccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHH Q lcl|NC_019448. 62 LIFYRDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAP-VSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQI 140 (463) Q Consensus 62 f~f~~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~ 140 (463) .+.+-+...++.+.-.+|.... . +.+...+++|++..+ .+++.+...+...+-++.--.+|.-+ +.++..|.+.. T Consensus 157 -~l~~~~~~~~~~~~~~~~~~~~-~-~~~~~~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~i~is~el-l~ds~~~~~~~ 232 (394) T protein:vir:97 157 -DLKPFTTVYQAKKASGKYPVLQ-R-ATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQES-IDDADVDLVGI 232 (394) T ss_pred -hhhhhceeeeccCcceEEEEEe-c-CCCccceecccccccccccccceeEEeehhheeeehhhHHHH-HhhhhHHHHHH Confidence 2333333334443333444332 2 223456899998775 67899999999999888777776642 23344567778 Q ss_pred HHHHHHHHHHHHHHHHHhhcccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecC Q lcl|NC_019448. 141 LTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMP 220 (463) Q Consensus 141 ~~~~ai~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~ 220 (463) ..+.-...+..+++.++..|.....+. ...-+|++.++++ .....+|+ ..++|+ T Consensus 233 i~~~la~~~~~~~~~~i~~g~~~~~~~---~~~~~~~~~~~~~---------------------~~~~~~~~--a~~v~n 286 (394) T protein:vir:97 233 VSESISQIKVNTTNDAIAKVLKSFTTK---TVKNLDEIKALLN---------------------GGFDPAYN--VSLIVS 286 (394) T ss_pred HHHHHHHHHHHHHHHHHhhcccccccc---ccccHHHHHHHHH---------------------hhhhhhhC--CEEEEc Confidence 888888889999999999987765442 1223443332221 11111221 347899 Q ss_pred HHHHHHHHHHhcCcceEEeecCCC----CcccceecCeeeecccccccCCc-eeccC--c-cc-cccccccCCCCCCCCe Q lcl|NC_019448. 221 IGVHADFVNSILGRQMQLMQDNSG----NVNTGYSVNGFYSSRGFIKLHGS-TVMEN--E-LI-LDESLQPLPNAPQPAK 291 (463) Q Consensus 221 ~~vka~f~~~~~~~qrv~~~~n~g----~~~~G~~v~~~~s~~G~i~l~~s-~~~~~--d-~~-l~~~~~~~p~ap~p~~ 291 (463) +.+.+.+...-=..-|.+.+++.. ..-.|++|-- +.. ..+... .++.+ . .+ .++.. T Consensus 287 ~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~pv~~--~~~--~~~~~~~~~~gd~~~~~~~~~~~~----------- 351 (394) T protein:vir:97 287 QSFYQTLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFV--LSD--EVLGANKAFIGDFKRGVLFADRKD----------- 351 (394) T ss_pred HHHHHHHHHhhccCCCeeeecCcCCCCCceeccceeEE--ecc--cccCCccEEEeeccccEEEEEecc----------- Confidence 999988875332222444443322 2345554411 000 001100 11110 0 00 00100 Q ss_pred eEEEEeccCCCcCcccccccceE--------EEEEEEecCCccccc Q lcl|NC_019448. 292 VTATVETKQKGAFEDEEDRAGLS--------YKVVVNSDDAQSAPS 329 (463) Q Consensus 292 vtat~~~~~~g~~~~~~~~a~ys--------YkV~a~s~~geS~~S 329 (463) ++.......... ..-.+... -.++.+.......|- T Consensus 352 ~~~~~~~~~~~~---~~~~~~~r~d~~v~~~~a~~~~~~~~~~~p~ 394 (394) T protein:vir:97 352 LGLRWADNEIYG---QYLQAVLRFGVSKVDDKAGYYVTFTPEPLPL 394 (394) T ss_pred eEEEEecccccc---eeEEEEEEEccEEecccceEEEEecccccCC Confidence 000000000000 00000011 111111222222221 No 134 >protein:vir:102944 Length: 330 # NCBI annotation: major head protein # Family: family:all:1522 # MgeID: mge:1461 # MgeName: EJ-1 # Cross-refs: genbank:acc:NP_945286;genbank:gi:39653721;uniprot:Q708M6;genbank:GeneID:2672858 Probab=55.74 E-value=0.47 Score=22.37 Aligned_cols=295 Identities=15% Similarity=0.170 Sum_probs=119.4 Q ss_pred CCCCCccchHH-----HHhhhhhhHHHHHH--hhcCCccCCccccCccccchhhhhhHhhhhhccccccchhhhcccchh Q lcl|NC_019448. 1 MTIEKNLSDVQ-----QKYADQFQEDVVKS--FQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPA 73 (463) Q Consensus 1 ~~~~~~~~~~~-----~~~~k~~~e~~~Ks--~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~ 73 (463) |. .+.|..- |.|..-+++.+.|- |- +.|+.-+...|+..+.. --..-.+.||+.+- T Consensus 1 Ma--~~~T~l~d~i~pevf~~yv~~~~~~~~~l~----------qSG~i~~~~~i~~~~~~-~G~~i~~P~~~~l~---- 63 (330) T protein:vir:10 1 MA--NELTKILDTITPQQYNAYMQQYTAAKSAFV----------QSGIAVSDERVSKNITS-GGLLVNMPFWNDLT---- 63 (330) T ss_pred CC--CCceEeeeeechhHHHHHHHHHhHHhhhhh----------hcccccccHHHHHHhhc-CCCEEEecccccCC---- Confidence 32 2222211 11111112222111 11 12334455556665522 11223445777651 Q ss_pred hHHHhhhhhhhccCcccccccccccC-cccccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHH-HH--HHHHHH Q lcl|NC_019448. 74 QSTVVKYDQYLRHGNVGHSRFVKEIG-VAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQIL-TE--DAIAVV 149 (463) Q Consensus 74 ~stv~ey~~~~~hG~~g~~~fv~E~g-~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~-~~--~ai~~~ 149 (463) |...-+.|+. ......-.--....++++.+.+..+++.+.+. +-+||+... .+ ..+.+. T Consensus 64 ----------------G~~~~~~dg~~~i~~~ki~t~~~~a~i~~~~k~~~~tD~a~~~-~g~dp~~~i~~q~a~~w~~~ 126 (330) T protein:vir:10 64 ----------------GDSEVLGNGDKALETGKITAGADIACVLYRGRGWAANELTGVV-AGSDPVRAILNRIGAYWLRE 126 (330) T ss_pred ----------------CcccccCCCccccchhhcccceeEEEEEeecceeeehhhhhhh-cchhHHHHHHHHHHHHhhhh Confidence 2333334443 23444445556778888888899999988665 567885432 21 112222 Q ss_pred HHHHHHHH---hhcccccCCCccccccccccceeeecCcceEeccC--CCCCHHHHhhhhhhhhhcCCceeEEecCHHHH Q lcl|NC_019448. 150 AKTIEWAS---FYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKG--NQLTEKHLNEAAVRIGKGFGTATDAYMPIGVH 224 (463) Q Consensus 150 ~~~~E~a~---fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG--~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vk 224 (463) .+..=.++ .|++...... ..++- ....|..| ..++.+.|+.|..+.+...+..+-++||+.+. T Consensus 127 ~q~~lla~l~gvf~~~~~~~~-----~~~~~-------~~~~~~~~~~a~~s~~~l~~A~~~~GD~~~~~~~ivmhS~v~ 194 (330) T protein:vir:10 127 DQKALIATLNGIFATGTAGEK-----GALEE-------THVSDQSKASTGIDAGMVLDAKQLLGDSADQVTAIAMHSAVY 194 (330) T ss_pred HHHHHHHHHHhhhhhhhcccc-----hhhhh-------hheecccccccccCHHHHHHHHHHhccccccceEEEEcHHHH Confidence 22211111 1222221110 01111 11222222 23567889999888888888888999999999 Q ss_pred HHHHHH-hcCcceEEeecCCCCcccceecCeeeecccccccCCceeccCccccccc-----cccCCCCC-----CCC-ee Q lcl|NC_019448. 225 ADFVNS-ILGRQMQLMQDNSGNVNTGYSVNGFYSSRGFIKLHGSTVMENELILDES-----LQPLPNAP-----QPA-KV 292 (463) Q Consensus 225 a~f~~~-~~~~qrv~~~~n~g~~~~G~~v~~~~s~~G~i~l~~s~~~~~d~~l~~~-----~~~~p~ap-----~p~-~v 292 (463) .++.+. ++.- +.+. .+...+| .+.|..++.+|..-... -...+.|- .|+ .+ T Consensus 195 ~~L~~~~li~~---~~~s-~~~~~i~-------------~~~G~~VivdD~~p~~~~~yt~yl~~~GAi~~~~~~~~~~v 257 (330) T protein:vir:10 195 TKLQKDNLIQY---IQPT-TATINIP-------------TYLGYRVIIDDGIAPTGDIYTSYLFRTGSIGLNTGNPSGLT 257 (330) T ss_pred HHHHHhhhhhh---hccc-ccCcccc-------------cccceEEEEeCCCCCCCCceeEEEEecCceeeecccCCccc Confidence 999764 3221 1122 2212221 22344444444331110 01111111 011 11 Q ss_pred EEEEeccC-CCc------CcccccccceEEEEEEEecCCccccccceeeeecCCCCceEEEEEecCCCCCCcceEEEEee Q lcl|NC_019448. 293 TATVETKQ-KGA------FEDEEDRAGLSYKVVVNSDDAQSAPSEEVTATVSNVDDGVKLSISVNAMYQQQPQFVSIYRQ 365 (463) Q Consensus 293 tat~~~~~-~g~------~~~~~~~a~ysYkV~a~s~~geS~~S~~vt~Tva~~~~gv~ltIt~~a~~g~~~~~y~IYR~ 365 (463) ..+..=.. +|. -+..-+..++||...+++-.|+| |+...-++.++++ .+| + T Consensus 258 ~~EtdRd~~~g~~~l~~r~~~~~hp~G~s~~~~~~~~~~~s-Pt~~~L~~~~NW~--------------------~v~-~ 315 (330) T protein:vir:10 258 TFETSREAAKGNDMIYTRRALVMHPYGVKWTGAEVDAGNIT-PSNADLAKFKNWK--------------------RVY-E 315 (330) T ss_pred cccccCCccccceEEEEeeEEEeeeeeeeecccccccCcCC-cChHHhcCCcCcc--------------------ccc-C Confidence 11111111 110 00000112233333333333333 3333333444442 222 2 Q ss_pred cCCCceEEEEEEeee Q lcl|NC_019448. 366 GKETGMYFLIKRVPV 380 (463) Q Consensus 366 ~~~~g~~~li~rv~~ 380 (463) -+.=.+-+++-++.+ T Consensus 316 ~k~i~iv~~~~~~~~ 330 (330) T protein:vir:10 316 PKNIGIIALKHKIGK 330 (330) T ss_pred hhhcceEEEEEecCC Confidence 222234444444433 No 135 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=50.59 E-value=0.61 Score=21.78 Aligned_cols=289 Identities=8% Similarity=0.066 Sum_probs=121.0 Q ss_pred CCCCCccchHHHHhhhhhhHH---------------HHHHhhcCCccCCccccCccccchhhhhhHhhhhhccccccchh Q lcl|NC_019448. 1 MTIEKNLSDVQQKYADQFQED---------------VVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFY 65 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k~~~e~---------------~~Ks~~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~ 65 (463) ...+.+......++.+.+... ..++++. .+..+||.|-=+.+..+|..+.... -.+. T Consensus 43 ~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~al~~------~~~~~gG~lIP~~~~~~Ii~~l~~~--s~l~ 114 (352) T protein:vir:78 43 YQSLNDNEKLVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPT------GNDSGGDKLLPKTLSKEIVSEPFAK--NQLR 114 (352) T ss_pred ccccchhhhHHHHHHHHHHHHhhhhHHHHHHhhHHHHHHHhcc------CCCCCCceeccHhHHHHHHHHHHhh--cchh Confidence 122222222222222111111 1122222 2334556555555555543322222 1222 Q ss_pred hhcccchhhHHHhhhhhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHH Q lcl|NC_019448. 66 RDISRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDA 145 (463) Q Consensus 66 ~~i~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~a 145 (463) +-....++.+ . .+-++. +..+...+++|++..+.+++++.+....++=++.-..+|.-+ +.++..|.+....+.- T Consensus 115 ~~~~v~~~~~-~-~~p~~~--~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~i~is~el-l~Ds~~~l~~~i~~~l 189 (352) T protein:vir:78 115 EKARLTNIKG-L-EIPRVS--YTLDDDDFITDVETAKELKLKGDTVKFTTNKFKVFAAISDTV-IHGSDVDLVNWVENAL 189 (352) T ss_pred hheeeEecCC-c-eEEEEe--cCCCcccccccccccccccccceeeeecceeEEeechhhHHH-HhhhhHHHHHHHHHHH Confidence 2222222222 1 122222 222356789999999999999999999888787766666542 3344566666666665 Q ss_pred HHHHHHHHHHHHhhcccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHH Q lcl|NC_019448. 146 IAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHA 225 (463) Q Consensus 146 i~~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka 225 (463) ...+.+. |-..+||+-.= .| +..|+.. ...+--..|..+ .+.|..+-..+...|-.-.-.+|.+.+.. T Consensus 190 a~~~~~~-e~~~~~~~g~g------~~-~~~g~l~---~~~~~~~t~~~~-~d~i~~~~~~l~~~~~~~a~~~mn~~t~~ 257 (352) T protein:vir:78 190 QSGLAAK-ERKDALAVSPK------SG-LEHMSFY---NGSVKEVEGANM-YDAIINALADLHEDYRDNATIYMRYADYV 257 (352) T ss_pred HHHHHHH-HHHhhhhcCCC------Cc-cccccee---ccccccccccch-HHHHHHHHhccChhhhcCCEEEEehHHHH Confidence 5666554 55555554321 11 1223221 111111222222 33444444455566644344778887777 Q ss_pred HHHHHhcCcceEEeecCCCCcccceecCeeeecccccccCCceecc--CccccccccccC-C-CCCCCCe--eEEEEecc Q lcl|NC_019448. 226 DFVNSILGRQMQLMQDNSGNVNTGYSVNGFYSSRGFIKLHGSTVME--NELILDESLQPL-P-NAPQPAK--VTATVETK 299 (463) Q Consensus 226 ~f~~~~~~~qrv~~~~n~g~~~~G~~v~~~~s~~G~i~l~~s~~~~--~d~~l~~~~~~~-p-~ap~p~~--vtat~~~~ 299 (463) .+....-+..+-+....+. .-.|.+|- .+. +. +..++. ...|+.+..... + +-+--.. ..+...-+ T Consensus 258 ~l~~~~~~~~~~~~~~~~~-~llG~PV~--~~~-~~----~~~~~Gdf~~~~~~~~~~~~~~~~~~~~g~~~f~~~~r~D 329 (352) T protein:vir:78 258 KIISVLSNGTTNFFDTPAE-KVFGKPVV--FTD-AA----VKPIVGDFNYFGINYDGTTYDTDKDVKKGEYLFVLTAWYD 329 (352) T ss_pred HHHHHHhccCCcccccCCc-cccccceE--Eec-CC----CceeEeehhhhhhhhhhheeeeeccccCCeeEEEEEeeeC Confidence 7665544455555544333 23466542 111 00 111111 011221111100 0 0000001 11111111 Q ss_pred CCCcCcccccccceEEEEEEEecCCccccc Q lcl|NC_019448. 300 QKGAFEDEEDRAGLSYKVVVNSDDAQSAPS 329 (463) Q Consensus 300 ~~g~~~~~~~~a~ysYkV~a~s~~geS~~S 329 (463) |..-+ ....++..+.-...+.|| T Consensus 330 --g~~~~-----~eA~~~l~~~a~~~~~~~ 352 (352) T protein:vir:78 330 --QQRTL-----DSAFRIAKAKESTGSLPS 352 (352) T ss_pred --ceeec-----hhheEEEEeecccCCCCC Confidence 11111 123444445555555565 No 136 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=33.68 E-value=1.3 Score=19.89 Aligned_cols=293 Identities=13% Similarity=0.094 Sum_probs=121.0 Q ss_pred CCCccchHHHHhhhhhhHHHHHHhhcC-CccCCccccCccccch---hhhhhHhhhhhcccccc-chhhhcccchhhHHH Q lcl|NC_019448. 3 IEKNLSDVQQKYADQFQEDVVKSFQTG-YGITPDTQIDAGALRR---EILDDQITMLTWTNEDL-IFYRDISRRPAQSTV 77 (463) Q Consensus 3 ~~~~~~~~~~~~~k~~~e~~~Ks~~ag-y~~~p~~q~~gaalr~---esLd~~i~~L~~~~~df-~f~~~i~k~~~~stv 77 (463) +--+.-.. +-.-+++ +....++-+. -+.+-+ ..+++++-- |-+|+.+....+..-.+ .|+.-...-.+--.. T Consensus 1 ~~~~~~~~-~~~~d~~-~~~~~a~~~~~~~~~~~-~~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~i~~~~~~~~~~ 77 (329) T protein:vir:79 1 MRGNIMSK-EMKYDEF-EANVIANHMQLRGAKND-ASDMGIWTSQELHKIKAQAYEKEYPAGSALRVFPVTSELSDTDKT 77 (329) T ss_pred Cccchhhh-hhccchh-hhhhHhhhcccccceec-cchhhHHHHHHHHHHHHHHHhhhhcccchhhhcccccCCCCceeE Confidence 11111100 1111111 1222333333 222222 233444433 34677775544333221 133222222222222 Q ss_pred hhhhhhhccCcccccccccc-cCcccccCcceEEEEEEEEEeechhhhhhhhh--hhcccccHHHHHHHHHHHHHHHHHH Q lcl|NC_019448. 78 VKYDQYLRHGNVGHSRFVKE-IGVAPVSDPNIRQKTVSMKYVSDTKNMSIASG--LVNNIADPSQILTEDAIAVVAKTIE 154 (463) Q Consensus 78 ~ey~~~~~hG~~g~~~fv~E-~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~--lvn~~~Dp~~~~~~~ai~~~~~~~E 154 (463) ..|..+... |....++. .+..+..|.++.+++..+.-++.++.++..-- ....-.+..+.....|.+.+.+... T Consensus 78 ~t~~~~~~~---G~a~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n 154 (329) T protein:vir:79 78 FEYQTFDKV---GHAKIIADYTDDLSTVDALMTSEFGKVFRLGNAFLISIDEIKAGQRTGKSLSTRKANAAQNAHDQLVN 154 (329) T ss_pred EEeeeeecc---eeeeeecCcccccceeecccceeEEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhc Confidence 344444433 44444453 45567889999999999999999988865422 2222336677788888889999999 Q ss_pred HHHhhcccccCCCccccccccccceeeecCcceEeccCCCC-----C----HHHHhhhhhhhhhc---CCceeEEecCHH Q lcl|NC_019448. 155 WASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQL-----T----EKHLNEAAVRIGKG---FGTATDAYMPIG 222 (463) Q Consensus 155 ~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~l-----s----~~~ln~aa~~i~~~---~G~~td~~m~~~ 222 (463) ..+||||+.+ .+-||.+-=+-..+....+... + .+.|+++-..+..+ .-.|+.+.||+. T Consensus 155 ~i~f~G~~~~---------g~~GLlN~p~v~~~~~~~~~~~~w~~kt~~ei~~di~~~~~~l~~~s~g~~~p~~L~Lpp~ 225 (329) T protein:vir:79 155 HLVFKGSKPH---------KIISVFEHPNLTTINSAGWNNAAGTGKKPETAQDELEQAIEKIETLTNGQHRANMILIPPS 225 (329) T ss_pred cEEEeecccc---------cceeeecCCCccccccCCCCCccccccCHHHHHHHHHHHHHHHHHhcCceecccEEEecHH Confidence 9999998875 3556654211111111111111 1 34577776666532 345788999999 Q ss_pred HHHHHHHHhcCcc----eEEeecCCCCcccceecCeeeecccccccCCceeccCcc--c---cccccccCC---CCC--- Q lcl|NC_019448. 223 VHADFVNSILGRQ----MQLMQDNSGNVNTGYSVNGFYSSRGFIKLHGSTVMENEL--I---LDESLQPLP---NAP--- 287 (463) Q Consensus 223 vka~f~~~~~~~q----rv~~~~n~g~~~~G~~v~~~~s~~G~i~l~~s~~~~~d~--~---l~~~~~~~p---~ap--- 287 (463) ....+..-+.... .+|.++++ ++.+ ..++.+.++.|. .-+-.+++++|. + +......+| ... T Consensus 226 ~~~~L~~~~~~~~~tvl~~lk~~~~-~l~I-~~~~el~~ag~~-g~~~~v~y~~~~~~~~~~vp~~~~~l~~q~~~~~~~ 302 (329) T protein:vir:79 226 MRKVLMVRMPETTMSYLDYFKQQNG-GITI-ESISELEDIDGA-GTKAALVYEKDPMNMSIEIPEAFNMLTAQPKDLHFK 302 (329) T ss_pred HHHHhhcccCCCCccHHHHHHHhCC-CcEE-EEcccccccCCC-CceEEEEEecCCceEEEecCcceeeeeceecCceEE Confidence 8777743211100 00111111 1111 112333222211 001112222111 1 111111111 100 Q ss_pred CCCee-EEEEeccCCCcCcccccccce Q lcl|NC_019448. 288 QPAKV-TATVETKQKGAFEDEEDRAGL 313 (463) Q Consensus 288 ~p~~v-tat~~~~~~g~~~~~~~~a~y 313 (463) .|-.- ++.+.---.....--++--.. T Consensus 303 v~~~~r~~Gv~i~~P~ai~~~dGI~~~ 329 (329) T protein:vir:79 303 VPCTSKCTGLTIYRPLTLVLIKGLVVG 329 (329) T ss_pred EceeeeEEEEEEECcceeeeeeeeeeC Confidence 01000 000000000000000000000 No 137 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=30.45 E-value=1.6 Score=19.50 Aligned_cols=283 Identities=13% Similarity=0.120 Sum_probs=112.1 Q ss_pred CCCCCccchHHHHhhhhh---h---HHHHHHhhc----C--CccCCccccCccccchhhhhhHhhhhhccccccchhhhc Q lcl|NC_019448. 1 MTIEKNLSDVQQKYADQF---Q---EDVVKSFQT----G--YGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDI 68 (463) Q Consensus 1 ~~~~~~~~~~~~~~~k~~---~---e~~~Ks~~a----g--y~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i 68 (463) ...+.............+ . .+..+++.. . -.....+..+|+.+--+.+...|..+. +.-.+.+.+ T Consensus 89 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vp~~~~~~i~~~~---~~~~l~~~~ 165 (397) T protein:vir:96 89 DPTDQKPKDGEKRKMKKFKVTEEELAEKRSAINAFVKSKGAEKRDGFTSVEGGALIPQELLQPQLEPK---DIVDLSKYV 165 (397) T ss_pred hhhhhhhHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHhhhhhhhhcccccccccchhHHHHHHHHHhh---hhhhHHHhh Confidence 111111111111000000 0 011111110 0 001111223344433333444443332 222234444 Q ss_pred ccchhhHHHhhhhhhhccCcccccccccccCccc-ccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHH Q lcl|NC_019448. 69 SRRPAQSTVVKYDQYLRHGNVGHSRFVKEIGVAP-VSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIA 147 (463) Q Consensus 69 ~k~~~~stv~ey~~~~~hG~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~ 147 (463) ...++.+.-.+|.....+ .+...+++|++..+ .+++.+.+....++=++.--.+|.-+ +.++..|.+....+.--. T Consensus 166 ~~~~~~~~~~~~~~~~~~--~~~~~~~~E~~~~~~~~~~~~~~i~~~~~~~~~~~~~s~el-l~ds~~~l~~~i~~~l~~ 242 (397) T protein:vir:96 166 RSVPVNSASGKFPVISKS--GSKMATVQQLEKNPQLANPKMVEIDYSVATRRGYIPISQEM-IDDASYDVTGLIADEIQD 242 (397) T ss_pred hhccccccceeEEEEecc--CCccccccccccccccccccccceeecHhHhhcchhhHHHH-HhhhHHHHHHHHHHHHHH Confidence 444444433344443333 23456788888665 68999999999988777655555432 234455667777777778 Q ss_pred HHHHHHHHHHhhcccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHH Q lcl|NC_019448. 148 VVAKTIEWASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADF 227 (463) Q Consensus 148 ~~~~~~E~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f 227 (463) .+..+++.+++.|+..-.+.+ .+-+|++.++| .......|+ + -.+||+.+.+.+ T Consensus 243 ~~~~~~~~~i~~g~g~~~~~~---~~~~d~~~~~~---------------------~~~~~~~~~-a-~~v~n~~~~~~l 296 (397) T protein:vir:96 243 QSLNTKNADIAAVLKTATAKS---VVGVDGLKDLI---------------------NKEIKKVYD-V-KLFISASMYSEL 296 (397) T ss_pred HHHHHHHHHHhhccccccccc---ccchHHHHHHH---------------------HHhhhhhcC-c-EEEEcHHHHHHH Confidence 888899999998877654421 12233222211 122222332 2 389999999998 Q ss_pred HHHhcCcceEEeecCCCCc----ccceecCeeeecccccccCCceeccCccccccccccCCCCCCCCeeEEEEeccCCCc Q lcl|NC_019448. 228 VNSILGRQMQLMQDNSGNV----NTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQPLPNAPQPAKVTATVETKQKGA 303 (463) Q Consensus 228 ~~~~~~~qrv~~~~n~g~~----~~G~~v~~~~s~~G~i~l~~s~~~~~d~~l~~~~~~~p~ap~p~~vtat~~~~~~g~ 303 (463) ...-=..-|.+.+++..+. -.|++|- ... ++..+... .....- T Consensus 297 ~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~----------------------~~~------~~~~~~~~-----~~~~~~ 343 (397) T protein:vir:96 297 DKLKDKNGRYLLQDSITAASGKQLLGKEVV----------------------VLD------DDVIGKSV-----GNVVGF 343 (397) T ss_pred HHhhccCCCeEeccCccCCCcccccccceE----------------------Eec------ccccCCCC-----CceEEE Confidence 7532112234443322211 1222211 000 00000000 000000 Q ss_pred Cccccc------ccceEEEEEEEecCC-cc--ccccceeeeecCCCCceEEEEEec Q lcl|NC_019448. 304 FEDEED------RAGLSYKVVVNSDDA-QS--APSEEVTATVSNVDDGVKLSISVN 350 (463) Q Consensus 304 ~~~~~~------~a~ysYkV~a~s~~g-eS--~~S~~vt~Tva~~~~gv~ltIt~~ 350 (463) |+|... ....+ |....... .. ....-+..-+.....-+.+++|.. T Consensus 344 ~gd~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~r~d~~~~~~~a~~~~~~~~a 397 (397) T protein:vir:96 344 IGDAKAFASFFDRKQVS--VSWVDNNIYGQLLAGIIRYDVKATDKKAGFYVTFTIG 397 (397) T ss_pred EeehhcceEeEeecceE--EEEecccccceeEEEEEEEccEEecccceEEEEeecC Confidence 111100 00000 00000000 00 000112223333334455555532 No 138 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=22.29 E-value=2.5 Score=18.43 Aligned_cols=261 Identities=13% Similarity=0.123 Sum_probs=115.7 Q ss_pred CCCCCccchHH-----HHhhhhhhHHHHHHh-hcCCccCCccccCccccchhhhhhHhhhhhccccccchhhhcccchhh Q lcl|NC_019448. 1 MTIEKNLSDVQ-----QKYADQFQEDVVKSF-QTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQ 74 (463) Q Consensus 1 ~~~~~~~~~~~-----~~~~k~~~e~~~Ks~-~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~ 74 (463) |-+.+. |..- |.+..-+.|.+.|++ -++-.+...+ |+-. .-..=+|.+|+.+ T Consensus 1 ~~~~~~-T~l~d~i~PEv~~~~v~~~~~~~~~~~~~~~~~~~-----------l~g~----~G~tv~iP~~~~i------ 58 (275) T protein:vir:96 1 MALENM-TKLANMVNPEVLAPMMQAELDKKLKFAQFADIDNT-----------LVGQ----PGNTITFPAFVYS------ 58 (275) T ss_pred CCCccc-chhhhhhchHHHHHHHHHHHHHhhhhcccceeccc-----------ccCC----CCCEEEeeeeccC------ Confidence 555543 2222 222222334444442 2222111100 0000 0001133344333 Q ss_pred HHHhhhhhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHHHHHHHH Q lcl|NC_019448. 75 STVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIE 154 (463) Q Consensus 75 stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~~E 154 (463) |....+.|++......-+......+++..+-...+++...++ ..+||+....+..-..+++.++ T Consensus 59 ---------------g~a~~~~~g~~i~~~~lt~~~~~~~i~~~~~~~~i~D~~~~~-~~~d~~~~~~~~~a~~~a~~~d 122 (275) T protein:vir:96 59 ---------------GDAKVVPEGEEIPIDLIETKKRQATIRKIGKGTVLTDEALLS-GYGDPKGEAVRQHGLAIANKVD 122 (275) T ss_pred ---------------CccccccCCCCcchhhcccceeeEEeehhcccccccHHHHHh-hccchHHHHHHHHHHHHHHHHH Confidence 344456677777777778888889999999899999986544 4678877777777777777777 Q ss_pred HHHhhcccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHHHHHhcCc Q lcl|NC_019448. 155 WASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGR 234 (463) Q Consensus 155 ~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~~~~~~ 234 (463) -.++ ..+. + ..+. .....++.+.|..|..+.+..-...+-++||+.+.+.|...- . T Consensus 123 ~~ll---~~l~-----------~------a~~~--~~~~~~~~d~i~dA~~~lgd~~~~~~~ivv~p~~~~~L~k~~--~ 178 (275) T protein:vir:96 123 NDVL---EALQ-----------G------ATLK--VEADITKLAGLQTAIDKFNDEDLEPMVLFVNPLDAGKLRASA--T 178 (275) T ss_pred HHHH---HHHh-----------c------cccc--ccccccCHHHHHHHHHHhccccCCccEEEeCHHHHHHHHhcc--c Confidence 6655 1111 1 0111 112446677788888877755556677999999999885431 1 Q ss_pred ceEEeecCCCCcccceecCeeeeccccc-ccCCceeccCccccccccccCCCCCCCCeeEEEEeccCCCcCcccccccce Q lcl|NC_019448. 235 QMQLMQDNSGNVNTGYSVNGFYSSRGFI-KLHGSTVMENELILDESLQPLPNAPQPAKVTATVETKQKGAFEDEEDRAGL 313 (463) Q Consensus 235 qrv~~~~n~g~~~~G~~v~~~~s~~G~i-~l~~s~~~~~d~~l~~~~~~~p~ap~p~~vtat~~~~~~g~~~~~~~~a~y 313 (463) .+++-.+..|..- +. .|.| .+.|-.+..+|.. |.- ++---++|.++...... T Consensus 179 ~~f~~~~~~g~~~--------~~-~G~ig~~~G~~Vi~s~~~-----------p~~-----t~~i~~~gA~~~~~~~~-- 231 (275) T protein:vir:96 179 DNFTRATLLGDNV--------IV-KGAFGEALGAIIVRSNKI-----------KEG-----EAILAKRGAVKLITKRD-- 231 (275) T ss_pred ccccccccccccc--------ee-ccccceecCeeEEEeCCC-----------Ccc-----eEEEEeccceeeeecCC-- Confidence 2233233222211 00 1111 2344444443322 000 00000111111000000 Q ss_pred EEEEEEE-ecCCccccccce------eeeecCCCCceEEEEEecCCCCC Q lcl|NC_019448. 314 SYKVVVN-SDDAQSAPSEEV------TATVSNVDDGVKLSISVNAMYQQ 355 (463) Q Consensus 314 sYkV~a~-s~~geS~~S~~v------t~Tva~~~~gv~ltIt~~a~~g~ 355 (463) +.+= .|+..+ =+..+ .+-+...++-+++++++..+ |. T Consensus 232 ---~~vE~~Rd~~~-~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~-~~ 275 (275) T protein:vir:96 232 ---FFLETERHASH-KSTALFSDKHYVAYLYDESKVVKITKSASGL-GV 275 (275) T ss_pred ---cccccccchhh-cCcEEEEeEEEEEEEEcCccEEEEEeccccc-CC Confidence 0000 000000 00000 11122233334555554333 32 No 139 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=20.04 E-value=2.8 Score=18.10 Aligned_cols=266 Identities=11% Similarity=0.077 Sum_probs=110.5 Q ss_pred CCCCCccchHHH-----HhhhhhhHHHHHHh-hcCCccCCccccCccccchhhhhhHhhhhhccccccchhhhcccchhh Q lcl|NC_019448. 1 MTIEKNLSDVQQ-----KYADQFQEDVVKSF-QTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQ 74 (463) Q Consensus 1 ~~~~~~~~~~~~-----~~~k~~~e~~~Ks~-~agy~~~p~~q~~gaalr~esLd~~i~~L~~~~~df~f~~~i~k~~~~ 74 (463) |. ...|..-+ -+..-+.+.+.|++ -.+.++.... |+ .-....=+|.+|+.+ T Consensus 1 Ma--~~~T~l~d~i~Pev~~~~v~~~~~~~~~~~~~~~~~~~-----------l~----g~~G~ti~iP~~~~i------ 57 (276) T protein:vir:10 1 MA--QGTTTKSTQIVPEVLAPMMQAELDKKLRFAQFADIDST-----------LV----GQPGDTLTFPAFVYS------ 57 (276) T ss_pred CC--cceeehhhhhchHHHHHHHHHHHHhhhhhcccceeccc-----------cc----CCCCCEEEeeeecCC------ Confidence 32 22221111 11111223333332 2222211110 00 000111133444443 Q ss_pred HHHhhhhhhhccCcccccccccccCcccccCcceEEEEEEEEEeechhhhhhhhhhhcccccHHHHHHHHHHHHHHHHHH Q lcl|NC_019448. 75 STVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIE 154 (463) Q Consensus 75 stv~ey~~~~~hG~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lvn~~~Dp~~~~~~~ai~~~~~~~E 154 (463) |...-+.|++......-+.......++..+-...+++.+.++. .+||+....+..-..++..+. T Consensus 58 ---------------gda~~~~eg~~i~~~~lt~~~~~a~i~~~~k~~~~tD~a~~~~-~~dp~~~~~~~~~~~~a~~~d 121 (276) T protein:vir:10 58 ---------------GDATVVPEGQKIPVDKIETNRREAKIHKIGKGTDITDEALLSG-YGDPQGEAVRQHGLAIANKVD 121 (276) T ss_pred ---------------CccccccCCCccCccccccceeeEEeehccccccccHHHHHhh-ccchHHHHHHHHHHHHHHHHH Confidence 4444567888888888888899999999888888888886654 668877766666666666655 Q ss_pred HHHhhcccccCCCccccccccccceeeecCcceEeccCCCCCHHHHhhhhhhhhhcCCceeEEecCHHHHHHHHHHhcCc Q lcl|NC_019448. 155 WASFYGDASLTSEVEGEGLEFDGLAKLIDKNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILGR 234 (463) Q Consensus 155 ~a~fyGd~~l~~~~~~~gleFDGl~~lI~~~nviDarG~~ls~~~ln~aa~~i~~~~G~~td~~m~~~vka~f~~~~~~~ 234 (463) ..++ +- +.| .... ..+..++.+.|..|..+.+..-...+-+.||+.+.+.+..... T Consensus 122 ~~~~---~~-----------l~~------~~~~--~~~~~~t~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~-- 177 (276) T protein:vir:10 122 NDVL---EA-----------LRG------TKLT--VSADIGTLAGLEAAIDTFDDEDLEPMVLFINPKDAGKLRSSAS-- 177 (276) T ss_pred HHHH---HH-----------Hhc------cccc--ccccccCHHHHHHHHHHhccccCcccEEEEcHHHHHHHHHhcc-- Confidence 4443 01 111 1111 1234567788888888887655567779999999999964322 Q ss_pred ceEEeecCCCCcccceecCeeeecccccccCCceeccCcccccc-ccccCCCCCC---CCeeEEEEeccCCCcCcccccc Q lcl|NC_019448. 235 QMQLMQDNSGNVNTGYSVNGFYSSRGFIKLHGSTVMENELILDE-SLQPLPNAPQ---PAKVTATVETKQKGAFEDEEDR 310 (463) Q Consensus 235 qrv~~~~n~g~~~~G~~v~~~~s~~G~i~l~~s~~~~~d~~l~~-~~~~~p~ap~---p~~vtat~~~~~~g~~~~~~~~ 310 (463) .+++-.++.|..-+ .++ ..| .+.|-.++.+|..-.- .-...+.|-. ...++.+ +.-.-+.+...-. T Consensus 178 ~~f~~~s~~g~~~~---~~G---~ig--~~~G~~Vi~s~~~p~~t~~l~~~gAi~~~~~~~~~vE--~dRd~~~~~d~i~ 247 (276) T protein:vir:10 178 DNFTRATELGDNII---VKG---AFG--EALGAVIVRSKKLDEGEAILAKRGAVKLITKRDFFLE--TDRDPSTKTTALY 247 (276) T ss_pred ccccccccccccce---ecc---ccc--eecceeEEEcCCCCcceEEEEeccceeeeecCCceee--cccchhhcccEEE Confidence 12222222221100 000 011 2233333333322000 0000011000 0001111 1111000000001 Q ss_pred cceEEEEEEEecCCccccccceeeeecCCCCceEEEEEecCCCCCCcc Q lcl|NC_019448. 311 AGLSYKVVVNSDDAQSAPSEEVTATVSNVDDGVKLSISVNAMYQQQPQ 358 (463) Q Consensus 311 a~ysYkV~a~s~~geS~~S~~vt~Tva~~~~gv~ltIt~~a~~g~~~~ 358 (463) +-+.|-+.+.+ |+. -++||..+++..+-. T Consensus 248 ~~~~y~~~~~~------~~~-------------vv~~t~~~~~~~~~~ 276 (276) T protein:vir:10 248 SDKHYVAYLYD------ESK-------------AVKVTKGAGTTDSGA 276 (276) T ss_pred EeeEEEEEEEc------Ccc-------------eEEEecCCcCCcCCC Confidence 11222221211 111 122222222111100 Done!