Query lcl|NC_018863.1_cdsid_YP_006908495.1 [gene=BCB4_0266] [protein=putative capsid protein] [protein_id=YP_006908495.1] [location=complement(153803..155242)] Match_columns 479 No_of_seqs 44 out of 47 Neff 5.0 Searched_HMMs 1612 Date Thu Nov 7 15:27:25 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_266 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_266_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:95603 Length: 463 100.0 1E-198 9E-202 1105.1 35.4 462 1-470 1-463 (463) 2 protein:vir:99311 Length: 463 100.0 1E-198 9E-202 1105.1 35.4 462 1-470 1-463 (463) 3 protein:vir:96666 Length: 462 100.0 6E-197 4E-200 1096.4 34.4 461 4-469 1-462 (462) 4 protein:vir:80835 Length: 464 100.0 5E-192 3E-195 1069.4 34.3 462 1-479 1-463 (464) 5 protein:vir:63741 Length: 468 100.0 1E-189 9E-193 1056.0 33.8 467 1-478 1-468 (468) 6 protein:vir:80491 Length: 467 100.0 5E-189 3E-192 1053.1 32.6 466 1-478 1-467 (467) 7 protein:vir:100851 Length: 514 100.0 3E-187 2E-190 1043.5 35.8 472 1-476 7-514 (514) 8 protein:vir:102823 Length: 470 100.0 3E-157 2E-160 878.8 31.6 436 15-471 1-470 (470) 9 protein:vir:8843 Length: 317 # 99.4 1.9E-14 1.2E-17 95.9 13.9 293 31-343 1-317 (317) 10 protein:vir:94933 Length: 330 99.1 8.6E-13 5.3E-16 86.7 11.9 323 1-468 1-330 (330) 11 protein:vir:97255 Length: 310 99.0 2.3E-11 1.4E-14 78.9 12.7 300 15-467 1-310 (310) 12 protein:vir:93631 Length: 580 98.5 4.4E-09 2.7E-12 66.4 11.3 261 185-479 1-291 (580) 13 protein:vir:5120 Length: 615 # 98.1 1.7E-07 1.1E-10 57.7 12.2 264 169-479 1-317 (615) 14 protein:vir:7771 Length: 330 # 97.8 1.1E-05 6.8E-09 47.8 17.8 321 15-409 1-330 (330) 15 protein:vir:8102 Length: 543 # 97.8 7.6E-06 4.7E-09 48.6 15.6 324 1-389 195-543 (543) 16 protein:vir:8187 Length: 311 # 97.6 1.5E-05 9.4E-09 47.0 15.7 307 39-404 1-311 (311) 17 protein:vir:96392 Length: 324 97.6 2.5E-05 1.5E-08 45.9 15.9 317 1-359 1-324 (324) 18 protein:vir:78830 Length: 324 97.6 2.5E-05 1.5E-08 45.9 15.9 317 1-359 1-324 (324) 19 protein:vir:78523 Length: 338 97.5 3.3E-05 2E-08 45.2 15.7 322 20-378 1-338 (338) 20 protein:vir:10145 Length: 567 97.5 1.4E-05 8.6E-09 47.2 13.5 280 140-479 1-336 (567) 21 protein:vir:3306 Length: 567 # 97.5 1.4E-05 8.6E-09 47.2 13.5 280 140-479 1-336 (567) 22 protein:vir:2792 Length: 567 # 97.5 1.4E-05 8.6E-09 47.2 13.5 280 140-479 1-336 (567) 23 protein:vir:9979 Length: 567 # 97.5 1.4E-05 8.6E-09 47.2 13.5 280 140-479 1-336 (567) 24 protein:vir:827 Length: 567 # 97.4 1.7E-05 1.1E-08 46.7 13.5 276 140-479 1-336 (567) 25 protein:vir:104388 Length: 566 97.4 1.2E-05 7.3E-09 47.6 12.4 281 141-479 1-335 (566) 26 protein:vir:103955 Length: 324 97.4 3.1E-05 1.9E-08 45.3 14.4 304 1-338 1-324 (324) 27 protein:vir:4953 Length: 397 # 97.3 4.1E-05 2.5E-08 44.6 14.8 307 1-339 60-397 (397) 28 protein:vir:4339 Length: 395 # 97.3 2.5E-05 1.6E-08 45.8 13.0 297 1-357 68-395 (395) 29 protein:vir:96223 Length: 324 97.2 8.2E-05 5.1E-08 43.0 15.4 312 1-359 1-324 (324) 30 protein:vir:97148 Length: 324 97.2 4.3E-05 2.7E-08 44.5 13.9 304 1-338 1-324 (324) 31 protein:vir:9309 Length: 324 # 97.2 2.7E-05 1.7E-08 45.6 12.7 315 1-359 1-324 (324) 32 protein:vir:100135 Length: 418 97.2 5.1E-05 3.1E-08 44.1 14.1 315 1-349 77-418 (418) 33 protein:vir:99749 Length: 324 97.1 0.00012 7.4E-08 42.1 15.2 315 1-359 1-324 (324) 34 protein:vir:9574 Length: 300 # 97.1 0.00017 1E-07 41.3 15.9 294 31-377 1-300 (300) 35 protein:vir:4226 Length: 326 # 96.8 0.00023 1.4E-07 40.6 14.5 300 12-362 1-326 (326) 36 protein:vir:78223 Length: 333 96.8 0.00031 1.9E-07 39.8 15.2 314 20-376 1-333 (333) 37 protein:vir:105905 Length: 304 96.8 0.00032 2E-07 39.8 15.9 301 31-402 1-304 (304) 38 protein:vir:94142 Length: 304 96.8 0.00032 2E-07 39.8 15.9 301 31-402 1-304 (304) 39 protein:vir:81160 Length: 371 96.8 0.00033 2.1E-07 39.7 15.6 312 1-388 55-371 (371) 40 protein:vir:95318 Length: 328 96.8 2.4E-05 1.5E-08 45.9 8.7 279 1-336 1-328 (328) 41 protein:vir:191 Length: 385 # 96.8 0.00021 1.3E-07 40.8 13.8 311 1-358 57-385 (385) 42 protein:vir:1886 Length: 385 # 96.8 0.00021 1.3E-07 40.8 13.8 311 1-358 57-385 (385) 43 protein:vir:1433 Length: 435 # 96.7 0.00035 2.2E-07 39.5 14.8 317 1-348 45-435 (435) 44 protein:vir:103759 Length: 330 96.6 9.1E-05 5.7E-08 42.7 11.0 263 1-319 1-330 (330) 45 protein:vir:81070 Length: 390 96.6 0.00015 9.5E-08 41.5 11.9 308 1-344 61-390 (390) 46 protein:vir:97053 Length: 390 96.6 0.00048 3E-07 38.8 16.2 299 1-344 61-390 (390) 47 protein:vir:105038 Length: 428 96.5 0.00014 8.5E-08 41.8 11.1 318 1-346 70-428 (428) 48 protein:vir:10364 Length: 390 96.5 0.00049 3.1E-07 38.7 14.0 312 1-383 61-390 (390) 49 protein:vir:80684 Length: 315 96.5 0.00026 1.6E-07 40.3 12.4 289 31-338 1-315 (315) 50 protein:vir:4856 Length: 293 # 96.4 0.0002 1.3E-07 40.8 11.7 254 27-339 1-293 (293) 51 protein:vir:7324 Length: 335 # 96.4 0.00011 7.1E-08 42.2 10.3 273 1-326 1-335 (335) 52 protein:vir:95763 Length: 297 96.4 0.00065 4E-07 38.0 14.5 291 26-359 1-297 (297) 53 protein:vir:104085 Length: 320 96.4 0.00066 4.1E-07 38.0 15.0 286 31-349 1-320 (320) 54 protein:vir:2504 Length: 305 # 96.4 0.00056 3.5E-07 38.4 13.9 287 37-358 1-305 (305) 55 protein:vir:9759 Length: 303 # 96.4 0.00068 4.2E-07 38.0 16.7 297 39-387 1-303 (303) 56 protein:vir:2430 Length: 318 # 96.3 0.00065 4E-07 38.1 13.7 302 15-350 1-318 (318) 57 protein:vir:4830 Length: 397 # 96.3 0.00079 4.9E-07 37.6 16.5 313 1-354 66-397 (397) 58 protein:vir:1638 Length: 298 # 96.2 0.00091 5.6E-07 37.3 15.5 290 41-374 1-298 (298) 59 protein:vir:80376 Length: 435 96.1 0.00096 6E-07 37.1 15.2 319 1-348 55-435 (435) 60 protein:vir:4997 Length: 397 # 96.1 0.00098 6.1E-07 37.1 14.4 280 1-317 71-397 (397) 61 protein:vir:94771 Length: 298 95.9 0.0013 8.2E-07 36.4 14.9 290 41-374 1-298 (298) 62 protein:vir:98339 Length: 415 95.8 0.0014 8.7E-07 36.2 13.8 317 1-354 68-415 (415) 63 protein:vir:79987 Length: 415 95.8 0.0014 8.7E-07 36.2 13.8 317 1-354 68-415 (415) 64 protein:vir:81100 Length: 415 95.8 0.0014 8.7E-07 36.2 13.8 317 1-354 68-415 (415) 65 protein:vir:99920 Length: 311 95.8 0.0014 8.9E-07 36.2 14.7 296 31-390 1-311 (311) 66 protein:vir:41 Length: 299 # N 95.8 0.0015 9.4E-07 36.0 16.2 293 35-375 1-299 (299) 67 protein:vir:1025 Length: 408 # 95.7 0.00089 5.5E-07 37.3 11.8 304 1-344 63-408 (408) 68 protein:vir:105563 Length: 396 95.5 0.00018 1.1E-07 41.1 7.6 265 155-479 1-280 (396) 69 protein:vir:9410 Length: 415 # 95.5 0.0019 1.2E-06 35.5 13.0 321 1-354 68-415 (415) 70 protein:vir:3991 Length: 404 # 95.5 0.002 1.2E-06 35.4 17.3 316 1-367 63-404 (404) 71 protein:vir:95376 Length: 425 95.0 0.0014 8.8E-07 36.2 10.9 315 1-350 84-425 (425) 72 protein:vir:3845 Length: 395 # 95.0 0.003 1.9E-06 34.4 18.6 315 1-403 74-395 (395) 73 protein:vir:103370 Length: 418 94.8 0.00049 3E-07 38.7 7.7 325 1-364 27-418 (418) 74 protein:vir:5739 Length: 366 # 94.7 0.0035 2.2E-06 34.1 12.2 317 1-346 21-366 (366) 75 protein:vir:102119 Length: 404 94.1 0.0054 3.4E-06 33.0 16.0 326 1-394 57-404 (404) 76 protein:vir:98525 Length: 331 94.0 0.004 2.5E-06 33.7 11.0 313 1-368 1-331 (331) 77 protein:vir:107388 Length: 331 94.0 0.004 2.5E-06 33.7 11.0 313 1-368 1-331 (331) 78 protein:vir:107826 Length: 331 94.0 0.004 2.5E-06 33.7 11.0 313 1-368 1-331 (331) 79 protein:vir:4600 Length: 415 # 93.8 0.0064 3.9E-06 32.6 14.1 320 1-354 58-415 (415) 80 protein:vir:4700 Length: 415 # 93.8 0.0064 3.9E-06 32.6 14.1 320 1-354 58-415 (415) 81 protein:vir:1268 Length: 397 # 93.6 0.007 4.3E-06 32.4 17.8 292 1-357 70-397 (397) 82 protein:vir:2344 Length: 397 # 92.6 0.011 6.6E-06 31.4 16.2 319 15-393 1-397 (397) 83 protein:vir:81227 Length: 413 92.6 0.0082 5.1E-06 32.0 10.4 313 1-344 58-413 (413) 84 protein:vir:94673 Length: 419 92.0 0.013 8.3E-06 30.8 15.6 316 1-359 71-419 (419) 85 protein:vir:4197 Length: 314 # 91.7 0.014 9E-06 30.7 16.2 294 24-360 1-314 (314) 86 protein:vir:104256 Length: 458 91.5 0.015 9.6E-06 30.5 12.7 308 1-346 95-458 (458) 87 protein:vir:1328 Length: 392 # 91.3 0.017 1E-05 30.3 12.3 311 1-358 54-392 (392) 88 protein:vir:9704 Length: 394 # 90.6 0.02 1.2E-05 29.9 11.2 292 1-346 53-394 (394) 89 protein:vir:3033 Length: 272 # 90.4 0.021 1.3E-05 29.8 14.3 259 31-360 1-272 (272) 90 protein:vir:9820 Length: 272 # 90.4 0.021 1.3E-05 29.8 14.3 259 31-360 1-272 (272) 91 protein:vir:96762 Length: 632 90.0 0.023 1.4E-05 29.5 11.6 300 1-345 288-632 (632) 92 protein:vir:7409 Length: 408 # 89.8 0.024 1.5E-05 29.4 17.6 312 1-363 63-408 (408) 93 protein:vir:4511 Length: 409 # 89.7 0.025 1.5E-05 29.4 17.7 309 1-362 67-409 (409) 94 protein:vir:1084 Length: 437 # 89.0 0.0072 4.4E-06 32.3 6.7 310 1-342 100-437 (437) 95 protein:vir:4456 Length: 401 # 88.6 0.031 1.9E-05 28.8 14.9 309 1-388 51-401 (401) 96 protein:vir:100247 Length: 425 88.3 0.033 2.1E-05 28.7 15.2 311 1-375 92-425 (425) 97 protein:vir:98635 Length: 377 88.1 0.034 2.1E-05 28.6 10.6 320 1-394 39-377 (377) 98 protein:vir:7855 Length: 497 # 88.1 0.035 2.1E-05 28.6 12.2 311 1-349 98-497 (497) 99 protein:vir:101650 Length: 497 88.1 0.035 2.1E-05 28.6 12.2 311 1-349 98-497 (497) 100 protein:vir:3870 Length: 400 # 87.8 0.036 2.3E-05 28.5 12.3 283 1-358 82-400 (400) 101 protein:vir:96442 Length: 418 87.8 0.031 1.9E-05 28.9 9.4 325 1-376 27-418 (418) 102 protein:vir:102873 Length: 392 86.7 0.044 2.7E-05 28.0 15.2 315 1-366 51-392 (392) 103 protein:vir:102082 Length: 392 86.7 0.044 2.7E-05 28.0 15.2 315 1-366 51-392 (392) 104 protein:vir:107593 Length: 392 86.7 0.044 2.7E-05 28.0 15.2 315 1-366 51-392 (392) 105 protein:vir:105004 Length: 392 86.7 0.044 2.7E-05 28.0 15.2 315 1-366 51-392 (392) 106 protein:vir:485 Length: 407 # 86.2 0.048 2.9E-05 27.8 17.3 320 1-395 50-407 (407) 107 protein:vir:8420 Length: 477 # 85.8 0.05 3.1E-05 27.7 12.7 323 1-367 90-477 (477) 108 protein:vir:3158 Length: 321 # 85.2 0.054 3.4E-05 27.5 15.8 303 15-359 1-321 (321) 109 protein:vir:4159 Length: 315 # 84.9 0.057 3.5E-05 27.4 15.4 280 9-354 1-315 (315) 110 protein:vir:962 Length: 397 # 83.9 0.065 4E-05 27.1 12.3 297 1-390 91-397 (397) 111 protein:vir:95963 Length: 395 81.8 0.0073 4.5E-06 32.3 3.0 287 1-314 38-395 (395) 112 protein:vir:4092 Length: 390 # 75.8 0.14 8.9E-05 25.2 11.2 321 1-351 35-390 (390) 113 protein:vir:104342 Length: 314 75.5 0.089 5.5E-05 26.3 7.0 269 13-313 1-314 (314) 114 protein:vir:6212 Length: 434 # 75.4 0.15 9.1E-05 25.1 13.4 307 1-330 75-434 (434) 115 protein:vir:103285 Length: 296 73.9 0.16 0.0001 24.9 8.6 254 37-313 1-296 (296) 116 protein:vir:9643 Length: 377 # 73.7 0.14 8.7E-05 25.3 7.6 292 1-327 39-377 (377) 117 protein:vir:80128 Length: 466 72.7 0.18 0.00011 24.7 11.3 324 1-366 84-466 (466) 118 protein:vir:6242 Length: 390 # 71.8 0.19 0.00012 24.5 13.8 304 1-358 71-390 (390) 119 protein:vir:93616 Length: 645 70.4 0.21 0.00013 24.3 15.2 313 1-362 280-645 (645) 120 protein:vir:100884 Length: 389 66.0 0.27 0.00017 23.7 18.9 309 1-395 71-389 (389) 121 protein:vir:107423 Length: 681 59.0 0.22 0.00014 24.2 5.6 302 129-479 1-377 (681) 122 protein:vir:98487 Length: 681 59.0 0.22 0.00014 24.2 5.6 302 129-479 1-377 (681) 123 protein:vir:107802 Length: 681 59.0 0.22 0.00014 24.2 5.6 302 129-479 1-377 (681) 124 protein:vir:93742 Length: 274 52.2 0.56 0.00035 22.0 13.5 264 1-361 1-274 (274) 125 protein:vir:107687 Length: 319 50.6 0.6 0.00037 21.8 9.9 275 1-317 1-319 (319) 126 protein:vir:93881 Length: 387 39.0 1 0.00064 20.5 8.4 303 1-336 60-387 (387) 127 protein:vir:80068 Length: 301 37.8 1.1 0.00068 20.4 9.0 253 40-317 1-301 (301) 128 protein:vir:101607 Length: 379 34.4 1.3 0.0008 20.0 16.4 308 1-385 61-379 (379) 129 protein:vir:100172 Length: 394 28.7 1.7 0.0011 19.3 15.5 315 1-401 67-394 (394) 130 protein:vir:78640 Length: 352 26.8 1.9 0.0012 19.0 10.5 301 1-336 25-352 (352) 131 protein:vir:94494 Length: 274 26.3 2 0.0012 19.0 13.6 262 42-361 1-274 (274) 132 protein:vir:97433 Length: 274 26.3 2 0.0012 19.0 13.6 262 42-361 1-274 (274) 133 protein:vir:105334 Length: 276 22.6 2.4 0.0015 18.5 14.5 260 31-338 1-276 (276) 134 protein:vir:3613 Length: 272 # 22.5 2.4 0.0015 18.5 13.3 249 31-359 1-272 (272) 135 protein:vir:78350 Length: 383 21.6 2.6 0.0016 18.3 11.0 317 1-400 43-383 (383) 136 protein:vir:101291 Length: 381 20.2 2.8 0.0017 18.1 5.5 306 1-343 25-381 (381) 137 protein:vir:9509 Length: 381 # 20.2 2.8 0.0017 18.1 5.5 306 1-343 25-381 (381) No 1 >protein:vir:95603 Length: 463 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240903;genbank:gi:66394965;genbank:GeneID:5132544 Probab=100.00 E-value=1.5e-198 Score=1105.14 Aligned_cols=462 Identities=59% Similarity=0.946 Sum_probs=446.3 Q ss_pred CcccccccceeeeecCchhHHHHHHHHHHHhhcCcccCcccccCccccchhhhHHHHHHHhhccccccchhhhccchhHH Q lcl|NC_018863. 1 MTELQKEQKVEARKLPAGAEAELAELVSKSFTTGTGITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQVNS 80 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~Ksf~ag~~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~~~s 80 (479) ||..+|+..++.+ .++++.|+++|||+|||+|+|++|+||+||||||||++|++|+|+++||+|||+|+|++++| T Consensus 1 ~~~~~~~~~~~~~-----~~~~~~e~~~KS~~tg~g~~p~~q~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~~i~k~~a~S 75 (463) T protein:vir:95 1 MTIEKNLSDVQQK-----YADQFQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQS 75 (463) T ss_pred CCcccccchHHHH-----HHhhhhHHHHHHhhcCCccCCccccCcchhhhhhhhhhhheeeecccchhhhhhcCCchhhh Confidence 9988887776643 46788899999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_018863. 81 TVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAISVIAKSIEW 160 (479) Q Consensus 81 tv~~y~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~~~~~ai~~~~~~~e~ 160 (479) ||+||+++++||++||++|++|+|+++++||+|+||+++||||+++++||+++++||+++||+++|++|||+++||+||| T Consensus 76 TV~~y~~~~~~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~K~l~~~~~VS~~~~l~n~~~d~~~~~~~dai~~ia~tiE~ 155 (463) T protein:vir:95 76 TVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEW 155 (463) T ss_pred hhhhheeeeccCccccccccccccccccCCCceEEEEEEeeeeehhhhhhhHHHhhcccccHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhhheeecccCceeeeecChHHHhhHHHhhcC Q lcl|NC_018863. 161 AIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLD 240 (479) Q Consensus 161 a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~atd~~mp~~vka~f~q~~~~ 240 (479) +|||||++|+|. +.++|||||||.++|+ ++|||||||++||+++||+|+++|+++||+|||+|||+++|++|+|++++ T Consensus 156 a~FyGds~l~~~-~~~~gleFDGl~~lId-~enviDarG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~vka~f~~~~l~ 233 (463) T protein:vir:95 156 ASFYGDASLTSE-VEGEGLEFDGLAKLID-KNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILG 233 (463) T ss_pred HHhhhhhccCCC-cCccccchhhhhhhcC-CCCeeecCCCcccHHHHhhhhhhhhcccCChhheecchHHHHHHHHHhcC Confidence 999999999996 4578999999999997 79999999999999999999999999999999999999999999999999 Q ss_pred ceeEEeecCCCccccCccccceecCceeEEecCCcccCCCccccCc-ccCCCCCcccceEEEeecccccCccccccccee Q lcl|NC_018863. 241 RQRVIQPSQAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILVDR-IPEPNAPQAPASVVATVKVNDKGAFRPVKDIKT 319 (479) Q Consensus 241 ~qrv~~~~n~g~~~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver-~~s~~aP~~P~~vta~~~~~~~g~~~~~sd~g~ 319 (479) +|||++++|+|++.+|++|++|.+++|.|+||++++|+.+.++.+. ...+++| +|..++++.+++.+|+|+..++.+. T Consensus 234 ~qrv~~~~N~~~~~~G~~v~~f~s~~G~I~L~~s~~m~~~~il~~~~~~~p~ap-~~~~~tatv~~~~~~~~~~~~~~a~ 312 (463) T protein:vir:95 234 RQMQLMQDNSGNVNTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQPLPNAP-QPAKVTATVETKQKGAFENEEDRAG 312 (463) T ss_pred ceEEEEcCCCCceeeeeeccceeeeeeeeeeCCceecCCcccccchhhcCCCCc-cCceeEEEEeeccCCCCCCcccccc Confidence 9999999999999999999999999999999999999999999974 4456666 5666888888999999999999999 Q ss_pred eEEEEEEEcCCCCcccccceeeeeecCCCeEEEEEeecCCccccceEEEEEeccCCCCcEEEEEEeeeeeccCCCeeEEE Q lcl|NC_018863. 320 HSYKVVVHSDDAESLASEAVTAVVANPTDSVSLAVKLQSLYQAKPQFISVYRQGNETGHYFLVARVPLSKADENGVITFV 399 (479) Q Consensus 320 Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~V~LtIt~~~~~~~~~~y~~IYR~t~~~g~~~~i~rV~~s~~n~~~tttf~ 399 (479) |||||+++|++|||+||++|++|+++++++|+|+|+++++++.+|+|++||||++++|+|++|+|||++.+|+++||+|+ T Consensus 313 ~~Y~vv~~s~~geS~pS~ivtaT~a~~~~gv~l~It~~a~~~~~~~~v~IYR~~~~~g~~~~i~rv~v~~an~~gttt~~ 392 (463) T protein:vir:95 313 LSYKVVVNSDDAQSAPSEEVTATVSNVDDGVKLSINVNAMYQQQPQFVSIYRQGKETGMYFLIKRVPVKDAQEDGTIVFV 392 (463) T ss_pred eEEEEEEECCCCCcccchheeeeeeeccceEEEEEEecCCcccceeEEEEEeecCCCCcceeEEEEEecccCCCceEEEe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eccCCCCCccceeeccccHHHHHHHHhccccccCccccCchhHHHHHhhhhhheeccceeEEEEeccccCc Q lcl|NC_018863. 400 DRNQVIPETTDVFIGELTPQVISLLELLPMMKLPLAQMNATTTFTVLWYGALALYAPKKWVRIKNVQYIPA 470 (479) Q Consensus 400 D~N~~iPgT~~~fvge~~~q~i~l~ellPm~k~Pla~~~~~~~~~V~~yg~L~l~aPkk~~~ikNV~~~~~ 470 (479) |+|+|||||+++|||||||++|+|+|||||+|||||+.|++++|||+|||+|+|+|||||++||||+|+|- T Consensus 393 D~n~~IPgt~~vfVgems~~ti~~~ellPm~klpLA~~~~~~~waVl~YGaLal~~Pk~~~~ikNv~~~~v 463 (463) T protein:vir:95 393 DKNETLPETADVFVGEMSPQVVHLFELLPMMKLPLAQINASITFAVLWYGALALRAPKKWARIKNVRYIAV 463 (463) T ss_pred ecccccCCceeEeeeccCchhhhhHhhhHhhhCCchhccchhhhHHHHhhHHHhhccccceEEEEeeEecC Confidence 99999999999999999999999999999999999999999999999999999999999999999999998 No 2 >protein:vir:99311 Length: 463 # NCBI annotation: putative capsid protein # Family: family:all:2450 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024474;genbank:gi:48696433;genbank:GeneID:2948039 Probab=100.00 E-value=1.5e-198 Score=1105.14 Aligned_cols=462 Identities=59% Similarity=0.946 Sum_probs=446.3 Q ss_pred CcccccccceeeeecCchhHHHHHHHHHHHhhcCcccCcccccCccccchhhhHHHHHHHhhccccccchhhhccchhHH Q lcl|NC_018863. 1 MTELQKEQKVEARKLPAGAEAELAELVSKSFTTGTGITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQVNS 80 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~Ksf~ag~~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~~~s 80 (479) ||..+|+..++.+ .++++.|+++|||+|||+|+|++|+||+||||||||++|++|+|+++||+|||+|+|++++| T Consensus 1 ~~~~~~~~~~~~~-----~~~~~~e~~~KS~~tg~g~~p~~q~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~~i~k~~a~S 75 (463) T protein:vir:99 1 MTIEKNLSDVQQK-----YADQFQEDVVKSFQTGYGITPDTQIDAGALRREILDDQITMLTWTNEDLIFYRDISRRPAQS 75 (463) T ss_pred CCcccccchHHHH-----HHhhhhHHHHHHhhcCCccCCccccCcchhhhhhhhhhhheeeecccchhhhhhcCCchhhh Confidence 9988887776643 46788899999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_018863. 81 TVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAISVIAKSIEW 160 (479) Q Consensus 81 tv~~y~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~~~~~ai~~~~~~~e~ 160 (479) ||+||+++++||++||++|++|+|+++++||+|+||+++||||+++++||+++++||+++||+++|++|||+++||+||| T Consensus 76 TV~~y~~~~~~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~K~l~~~~~VS~~~~l~n~~~d~~~~~~~dai~~ia~tiE~ 155 (463) T protein:vir:99 76 TVVKYDQYLRHGNVGHSRFVKEIGVAPVSDPNIRQKTVSMKYVSDTKNMSIASGLVNNIADPSQILTEDAIAVVAKTIEW 155 (463) T ss_pred hhhhheeeeccCccccccccccccccccCCCceEEEEEEeeeeehhhhhhhHHHhhcccccHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhhheeecccCceeeeecChHHHhhHHHhhcC Q lcl|NC_018863. 161 AIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLD 240 (479) Q Consensus 161 a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~atd~~mp~~vka~f~q~~~~ 240 (479) +|||||++|+|. +.++|||||||.++|+ ++|||||||++||+++||+|+++|+++||+|||+|||+++|++|+|++++ T Consensus 156 a~FyGds~l~~~-~~~~gleFDGl~~lId-~enviDarG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~vka~f~~~~l~ 233 (463) T protein:vir:99 156 ASFYGDASLTSE-VEGEGLEFDGLAKLID-KNNVINAKGNQLTEKHLNEAAVRIGKGFGTATDAYMPIGVHADFVNSILG 233 (463) T ss_pred HHhhhhhccCCC-cCccccchhhhhhhcC-CCCeeecCCCcccHHHHhhhhhhhhcccCChhheecchHHHHHHHHHhcC Confidence 999999999996 4578999999999997 79999999999999999999999999999999999999999999999999 Q ss_pred ceeEEeecCCCccccCccccceecCceeEEecCCcccCCCccccCc-ccCCCCCcccceEEEeecccccCccccccccee Q lcl|NC_018863. 241 RQRVIQPSQAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILVDR-IPEPNAPQAPASVVATVKVNDKGAFRPVKDIKT 319 (479) Q Consensus 241 ~qrv~~~~n~g~~~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver-~~s~~aP~~P~~vta~~~~~~~g~~~~~sd~g~ 319 (479) +|||++++|+|++.+|++|++|.+++|.|+||++++|+.+.++.+. ...+++| +|..++++.+++.+|+|+..++.+. T Consensus 234 ~qrv~~~~N~~~~~~G~~v~~f~s~~G~I~L~~s~~m~~~~il~~~~~~~p~ap-~~~~~tatv~~~~~~~~~~~~~~a~ 312 (463) T protein:vir:99 234 RQMQLMQDNSGNVNTGYSVNGFYSSRGFIKLHGSTVMENELILDESLQPLPNAP-QPAKVTATVETKQKGAFENEEDRAG 312 (463) T ss_pred ceEEEEcCCCCceeeeeeccceeeeeeeeeeCCceecCCcccccchhhcCCCCc-cCceeEEEEeeccCCCCCCcccccc Confidence 9999999999999999999999999999999999999999999974 4456666 5666888888999999999999999 Q ss_pred eEEEEEEEcCCCCcccccceeeeeecCCCeEEEEEeecCCccccceEEEEEeccCCCCcEEEEEEeeeeeccCCCeeEEE Q lcl|NC_018863. 320 HSYKVVVHSDDAESLASEAVTAVVANPTDSVSLAVKLQSLYQAKPQFISVYRQGNETGHYFLVARVPLSKADENGVITFV 399 (479) Q Consensus 320 Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~V~LtIt~~~~~~~~~~y~~IYR~t~~~g~~~~i~rV~~s~~n~~~tttf~ 399 (479) |||||+++|++|||+||++|++|+++++++|+|+|+++++++.+|+|++||||++++|+|++|+|||++.+|+++||+|+ T Consensus 313 ~~Y~vv~~s~~geS~pS~ivtaT~a~~~~gv~l~It~~a~~~~~~~~v~IYR~~~~~g~~~~i~rv~v~~an~~gttt~~ 392 (463) T protein:vir:99 313 LSYKVVVNSDDAQSAPSEEVTATVSNVDDGVKLSINVNAMYQQQPQFVSIYRQGKETGMYFLIKRVPVKDAQEDGTIVFV 392 (463) T ss_pred eEEEEEEECCCCCcccchheeeeeeeccceEEEEEEecCCcccceeEEEEEeecCCCCcceeEEEEEecccCCCceEEEe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eccCCCCCccceeeccccHHHHHHHHhccccccCccccCchhHHHHHhhhhhheeccceeEEEEeccccCc Q lcl|NC_018863. 400 DRNQVIPETTDVFIGELTPQVISLLELLPMMKLPLAQMNATTTFTVLWYGALALYAPKKWVRIKNVQYIPA 470 (479) Q Consensus 400 D~N~~iPgT~~~fvge~~~q~i~l~ellPm~k~Pla~~~~~~~~~V~~yg~L~l~aPkk~~~ikNV~~~~~ 470 (479) |+|+|||||+++|||||||++|+|+|||||+|||||+.|++++|||+|||+|+|+|||||++||||+|+|- T Consensus 393 D~n~~IPgt~~vfVgems~~ti~~~ellPm~klpLA~~~~~~~waVl~YGaLal~~Pk~~~~ikNv~~~~v 463 (463) T protein:vir:99 393 DKNETLPETADVFVGEMSPQVVHLFELLPMMKLPLAQINASITFAVLWYGALALRAPKKWARIKNVRYIAV 463 (463) T ss_pred ecccccCCceeEeeeccCchhhhhHhhhHhhhCCchhccchhhhHHHHhhHHHhhccccceEEEEeeEecC Confidence 99999999999999999999999999999999999999999999999999999999999999999999998 No 3 >protein:vir:96666 Length: 462 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238545;genbank:gi:66391271;genbank:GeneID:5130448 Probab=100.00 E-value=5.9e-197 Score=1096.38 Aligned_cols=461 Identities=57% Similarity=0.928 Sum_probs=444.4 Q ss_pred cccccceeeeecCchhHHHHHHHHHHHhhcCcccCcccccCccccchhhhHHHHHHHhhccccccchhhhccchhHHHHH Q lcl|NC_018863. 4 LQKEQKVEARKLPAGAEAELAELVSKSFTTGTGITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQVNSTVA 83 (479) Q Consensus 4 ~~~~~~~~~~~~~~~~~~~~~e~~~Ksf~ag~~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~~~stv~ 83 (479) .-++.+++.++.|++++.+ |+++|||+|||+|+|++|++|+||||||||++|++|+|+++||+|||+|+|++++|||+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~--e~~~KS~~tg~g~~p~~q~~~gAlR~esL~~~i~~Lt~~~~~~~~~~~i~k~~a~sTv~ 78 (462) T protein:vir:96 1 MHKDTNLTAEQNKYADKFQ--EEVMKSYQTGYGITPDTQVDAGALRREILDDQITMLTWTQDDLIFYREISRRPAQSTVQ 78 (462) T ss_pred Cccccccchhhhhhhchhh--HHHHHHHhcCCCcCCccccccchhhhhhhhhhhheeeecccchhhhhhcCCchhhhhhh Confidence 4478899999999999886 99999999999999999999999999999999999999999999999999999999999 Q ss_pred HhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHh Q lcl|NC_018863. 84 KYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAISVIAKSIEWAIF 163 (479) Q Consensus 84 ~y~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~~~~~ai~~~~~~~e~a~f 163 (479) ||+++++||++||++|++|+|+++++||+|+||+++||||++++++|++++|||+++||+++|++|||+++||+|||+|| T Consensus 79 ~y~~~~~~G~~g~~~f~~E~g~~~~~d~~~~R~~~~~k~l~~t~~vsi~~tl~n~~~d~~~~~~~dai~~~a~tiE~a~F 158 (462) T protein:vir:96 79 KYDVYLRHGNVGHSRFVREVGVAPVSDPNIRQKTVEMKYVSDTKNLSIASTLVNNIQDPMQILTEDAIAVVAKTIEWASF 158 (462) T ss_pred hheeeeccCccccccccccccccccCCCceEEEEEEEEEEeeeeeechhhhhccchhhHHHHHHHHHHHHHHHHHHHHHh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhhheeecccCceeeeecChHHHhhHHHhhcCcee Q lcl|NC_018863. 164 YGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQR 243 (479) Q Consensus 164 ~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~atd~~mp~~vka~f~q~~~~~qr 243 (479) |||++|+|+++ ++|||||||.+||+ ++|||||||++||+++||+||++|+++||+|||+|||+++|++|+|+++++|| T Consensus 159 ygds~l~~~~~-~~gleFDGl~~lI~-~~NViDarG~~Ls~~~ln~aa~~i~~~fGt~TD~~~p~~v~a~f~~~~l~~qr 236 (462) T protein:vir:96 159 YGDASLTADPT-GQGLEFDGLAKLID-KDNVIDAKGESLTETLLNRSAVLIGKSFGTATDAYMPIGVHADFVNSVLGRQM 236 (462) T ss_pred hhhcccCCCcc-ccccchhhhhhhcC-CCceeecCCCCccHHHHhhhhhhcccccCChhheecchHHHHHHHHhhcCceE Confidence 99999999876 56999999999997 79999999999999999999999999999999999999999999999999999 Q ss_pred EEeecCCCccccCccccceecCceeEEecCCcccCCCccccCc-ccCCCCCcccceEEEeecccccCcccccccceeeEE Q lcl|NC_018863. 244 VIQPSQAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILVDR-IPEPNAPQAPASVVATVKVNDKGAFRPVKDIKTHSY 322 (479) Q Consensus 244 v~~~~n~g~~~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver-~~s~~aP~~P~~vta~~~~~~~g~~~~~sd~g~Y~Y 322 (479) |+|++|+|++.+|++|++|.+++|.|+||++++|+++.++.+. ...|++| +|+.+++++.+..+|.|....|.++|+| T Consensus 237 v~~~~n~g~~~~G~~v~~f~s~~G~I~L~~s~~m~~~~i~~~~~~~~p~ap-~~~~vsaTv~t~~~g~f~~~~d~~~y~Y 315 (462) T protein:vir:96 237 QLMQDNSGNVNAGYNVQGFYSSRGFIKLHGSTVMENELILDESLQPLPNAP-QPATVKATVETGKKGLFTDEHDRAELTY 315 (462) T ss_pred EEEcCCCCceeeeeeccceeeeeeeeeeCCceecCcccccccccccCCCCC-CCCceeEEEEeCCCCCCCCccCceeEEE Confidence 9999999999999999999999999999999999999999974 4445655 5567888888888887755448999999 Q ss_pred EEEEEcCCCCcccccceeeeeecCCCeEEEEEeecCCccccceEEEEEeccCCCCcEEEEEEeeeeeccCCCeeEEEecc Q lcl|NC_018863. 323 KVVVHSDDAESLASEAVTAVVANPTDSVSLAVKLQSLYQAKPQFISVYRQGNETGHYFLVARVPLSKADENGVITFVDRN 402 (479) Q Consensus 323 kV~a~n~~GES~~S~~VtaT~a~~~~~V~LtIt~~~~~~~~~~y~~IYR~t~~~g~~~~i~rV~~s~~n~~~tttf~D~N 402 (479) ||+++|.+|||.||++|++|++++.++|+|+|+|+++++++|+||+|||+++++|+|++|+|||++.+|++++++|+|+| T Consensus 316 ~V~avs~dgeS~PS~~VtaTva~~~~gv~ltIt~~a~~~~~~~~~~IYRk~~~sg~y~li~rv~~~~~n~~gt~tf~D~n 395 (462) T protein:vir:96 316 KVVVNSDDAQSAPSEAVTATVNNATDGVKLEISVNAMYQQQPQFVSIYRQGRKTGDFYLIKRLGMKEVNDEGKLVFYDLN 395 (462) T ss_pred EEEEECCCCccccceeeEeeeecccccceEEEEEcCCccccceEEEEEeecCCccccceeeeeeceeecCCcceeEeecc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCCCccceeeccccHHHHHHHHhccccccCccccCchhHHHHHhhhhhheeccceeEEEEeccccC Q lcl|NC_018863. 403 QVIPETTDVFIGELTPQVISLLELLPMMKLPLAQMNATTTFTVLWYGALALYAPKKWVRIKNVQYIP 469 (479) Q Consensus 403 ~~iPgT~~~fvge~~~q~i~l~ellPm~k~Pla~~~~~~~~~V~~yg~L~l~aPkk~~~ikNV~~~~ 469 (479) ++||||+++|||||+||+|+|+|||||||||||+.+++++|||+|||+|+|+|||||+|||||+|+- T Consensus 396 ~~iPgt~~~fVge~~p~vi~~~qllpm~~~plA~~n~~~~waVl~yG~Lal~~Pk~~~~ikNv~~~~ 462 (462) T protein:vir:96 396 ETIPETTDVFVGEMSPQVLHLFELLPMMKLPLAQINASVTFAVLWYGALALRAPKKWVRIKNVKYIV 462 (462) T ss_pred CCCCCcccceeecCCchhhhhhhhhhhhhcCcccccchhhhhhhhhhHHHhhcccccEEEEEEEEeC Confidence 9999999999999999999999999999999999999999999999999999999999999999998 No 4 >protein:vir:80835 Length: 464 # NCBI annotation: putative major capsid protein # Family: family:all:2450 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504125;genbank:gi:158079312;genbank:GeneID:5666484 Probab=100.00 E-value=4.9e-192 Score=1069.39 Aligned_cols=462 Identities=57% Similarity=0.914 Sum_probs=430.9 Q ss_pred CcccccccceeeeecCchhHHHHHHHHHHHhhcCcccCcccccCccccchhhhHHHHHHHhhccccccchhhhccchhHH Q lcl|NC_018863. 1 MTELQKEQKVEARKLPAGAEAELAELVSKSFTTGTGITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQVNS 80 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~Ksf~ag~~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~~~s 80 (479) |++ |++++ .++++ ..|.++|||+|||+++|++|+||+||||||||++|++|+|+++||+|||+|+|++++| T Consensus 1 ~~~---~~n~~-~~~~~-----~~e~~~Ks~ttgy~~~p~~q~~~~AlRrEsL~~~i~~Lt~~~~~f~f~~di~k~~a~S 71 (464) T protein:vir:80 1 MTE---KKNTE-RQLTS-----VQEEVIKGFTTGYGITPESQTDAAALRREFLDDQITMLTWADGDLSFYRDITKRPATS 71 (464) T ss_pred CCc---chhhH-hhcCc-----ccHHHHHHHHhCCccCcccccCcchhhhhhhhhhhheeeecccchhhhhhcCCchhhh Confidence 654 22322 22322 2345679999999999999999999999999999999999999999999999999999 Q ss_pred HHHHhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_018863. 81 TVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAISVIAKSIEW 160 (479) Q Consensus 81 tv~~y~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~~~~~ai~~~~~~~e~ 160 (479) ||+||+++++||++||++|++|+|+++++||+|+||+++||||+++|++|++++|||+++||+.+|++|||+++||+||| T Consensus 72 TV~~y~~~~~~G~~g~~~f~~E~g~~~~~d~~~~Rr~~~~Kfl~~~r~vsia~~lvn~~~d~~~~~~~dai~~va~tiE~ 151 (464) T protein:vir:80 72 TVAKYDVYLAHGRVGHTRFTREIGVAPISDPNLRQKTVNMKYVSDTKNMSIATGLVNNIEDPMRILTDDAISVVAKTIEW 151 (464) T ss_pred hhhhhheeeccCccccccccccccccccCCCceEEEEEEeeeeecceeeeeehhhhcchhhHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhhheeecccCceeeeecChHHHhhHHHhhcC Q lcl|NC_018863. 161 AIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLD 240 (479) Q Consensus 161 a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~atd~~mp~~vka~f~q~~~~ 240 (479) +|||||++|++++++|+|||||||++||+ ++|||||||++||+++||+||++|+++||+|||+|||+++|++|.+.+++ T Consensus 152 a~FyGds~l~~~~~~~~gleFDGl~~lI~-~~NViDarG~~Ls~~~ln~Aa~~i~~~fGt~TD~~lp~~v~a~f~n~~l~ 230 (464) T protein:vir:80 152 ASFYGDSDLSENPDAGSGLEFDGLAKLID-KHNVLDAKGASLTEALLNQASVLVGKGYGTPTDAYMPIGVQADFVNQQLD 230 (464) T ss_pred HHhhhccccCCCCCCccccchhhhHhhcC-CCceeecCCCCcCHHHHhhhhhhhhcccCChhhcccchhHHHHHHhhhcC Confidence 99999999999999999999999999997 79999999999999999999999999999999999999999999999999 Q ss_pred ceeEEeecCCCccccCccccceecCceeEEecCCcccCCCccccCcccC-CCCCcccceEEEeecccccCccccccccee Q lcl|NC_018863. 241 RQRVIQPSQAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILVDRIPE-PNAPQAPASVVATVKVNDKGAFRPVKDIKT 319 (479) Q Consensus 241 ~qrv~~~~n~g~~~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver~~s-~~aP~~P~~vta~~~~~~~g~~~~~sd~g~ 319 (479) +||+++.+|.++..+|++|++|.|++|.|+||+|++|+.+.++++.... +++|.+|+ ++++.++..+|.|++....+. T Consensus 231 ~q~~~~~~n~~~~~~G~~v~~f~sa~G~i~L~~s~~m~~~~~ld~~~~~~~~apaaps-vt~tv~~~~~g~f~~~~~~~~ 309 (464) T protein:vir:80 231 RQVQVISDNGQNATMGFNVKGFNSARGFIRLHGSTVMELEQILDENRMQLPNAPQKAT-VKATLEAGTKGKFRDEDLTID 309 (464) T ss_pred ceeEEEcCCCCcceeeeecccccccccceeccCccccCcccccccccccCCCCcCCce-eEEEecCCcccCCccccccce Confidence 9999999999999999999999999999999999999999999985444 77776555 556777888899887665678 Q ss_pred eEEEEEEEcCCCCcccccceeeeeecCCCeEEEEEeecCCccccceEEEEEeccCCCCcEEEEEEeeeeeccCCCeeEEE Q lcl|NC_018863. 320 HSYKVVVHSDDAESLASEAVTAVVANPTDSVSLAVKLQSLYQAKPQFISVYRQGNETGHYFLVARVPLSKADENGVITFV 399 (479) Q Consensus 320 Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~V~LtIt~~~~~~~~~~y~~IYR~t~~~g~~~~i~rV~~s~~n~~~tttf~ 399 (479) |+|||+++|++|||+||+++++|++...++|+|+|++++++++.|+|++||||+.++|+||+|+|||++++ .+|+++|+ T Consensus 310 ~~Ykv~~vn~~GeS~ps~~~~~ti~~~~~~V~l~it~~~~~~~~p~yv~IYR~~~~~g~f~~i~rv~~~~~-~~gt~t~v 388 (464) T protein:vir:80 310 TEYKVVVVSDDAESAPSDVASVVIDDKKKQVKLEITINNMYQARPQYVAIYRKGLETGLFYQIARVPASKA-VEGVITFI 388 (464) T ss_pred eEEEEEEECCCCccccceeeeeeecCcccEEEEEEEeCCccccccceEEEEeecCCCCceeEEEEEeeccc-cCCceEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999997 48889999 Q ss_pred eccCCCCCccceeeccccHHHHHHHHhccccccCccccCchhHHHHHhhhhhheeccceeEEEEeccccCccccccccCC Q lcl|NC_018863. 400 DRNQVIPETTDVFIGELTPQVISLLELLPMMKLPLAQMNATTTFTVLWYGALALYAPKKWVRIKNVQYIPALAADVTYRP 479 (479) Q Consensus 400 D~N~~iPgT~~~fvge~~~q~i~l~ellPm~k~Pla~~~~~~~~~V~~yg~L~l~aPkk~~~ikNV~~~~~~~~~~~~~~ 479 (479) |+|+|||||+++|||||||+||+|+|||||+|||||+.|++++|||+|||+|+|+|||||+|||||+|++-- |-- T Consensus 389 D~n~~IPgt~~vfVgems~~ti~l~ellPm~rlplA~~n~~~~waVl~YGaLal~aPk~~~~ikNv~~~~~~-----~~~ 463 (464) T protein:vir:80 389 DVNDEIPETADVFVGELTPSVVHLFELLPMMRLPLAQVNASVTFAVLWYGALALRAPKKWARIKNVKYIATG-----NVF 463 (464) T ss_pred ecccccCCceeEeeecCCchHHHHHHHHHhhhCCchhcccchhhhhhhhhHHhhhccccceEEEEEEEeecc-----cCC Confidence 999999999999999999999999999999999999999999999999999999999999999999998521 111 No 5 >protein:vir:63741 Length: 468 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1517 # MgeName: P100 # Cross-refs: genbank:gi:82547622;genbank:GeneID:3783474 Probab=100.00 E-value=1.4e-189 Score=1055.96 Aligned_cols=467 Identities=58% Similarity=0.920 Sum_probs=442.1 Q ss_pred CcccccccceeeeecCchhHHHHHHHHHHHhhcCcccCcccccCccccchhhhHHHHHHHhhccccccchhhhccchhHH Q lcl|NC_018863. 1 MTELQKEQKVEARKLPAGAEAELAELVSKSFTTGTGITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQVNS 80 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~Ksf~ag~~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~~~s 80 (479) ||+.+||..| +++++.+|||+++|||+|||+|+|++|+||+||||||||++|++|+|+++||+|||+|+|++++| T Consensus 1 ~~~~~~~~~~-----~~~~~~~~~e~~~Ks~~agy~~~p~~q~~~~AlR~EsL~~~i~~L~~~~~~f~~~~di~k~~a~s 75 (468) T protein:vir:63 1 MPKNNKEEEV-----KEVNLNSVQEDALKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATS 75 (468) T ss_pred CCCCcchhhc-----cccChhHHHHHHHHHHHcCcccCCccccCcchhhhhhhhhhhheeeecccchhhhhhcccchhhh Confidence 9999999864 44555566799999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_018863. 81 TVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAISVIAKSIEW 160 (479) Q Consensus 81 tv~~y~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~~~~~ai~~~~~~~e~ 160 (479) ||+||+++++||++||++|++|+|+++++||+|+||+++||||++++++|+++++||+|+||+++|+++||+++||+||| T Consensus 76 tv~~y~~~~~~G~~g~~~f~~E~g~~~~~~~~~~r~~~~~k~l~~~~~vs~~~~l~n~i~d~~~~~~~~ai~~~a~tiE~ 155 (468) T protein:vir:63 76 TVAKYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEW 155 (468) T ss_pred hhhhheeeeccCccccccccccccccccCCCceEEEEEEeeeeeeeeeehhhhhhhcchhhHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhhheeecccCceeeeecChHHHhhHHHhhcC Q lcl|NC_018863. 161 AIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLD 240 (479) Q Consensus 161 a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~atd~~mp~~vka~f~q~~~~ 240 (479) +|||||++|.+.+++++|||||||+++|+ ++||||+||++|++++||+|+++|+++||++||+|||+++|++|++.++. T Consensus 156 a~FyGds~l~~s~~~~~glqfDGi~~li~-~enviDa~G~~ls~~~lneaa~~i~~gfG~~td~~~~~~v~a~~~~~~L~ 234 (468) T protein:vir:63 156 ASFFGDSDLSDSPEPQAGLEFDGLAKLIN-QDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLS 234 (468) T ss_pred HhhhcccccccCCCccccccccceeEEec-CCceeccCCCccCHHHHHHHhhhccccccChhhhhcchhHHhhhhhhhcC Confidence 99999999998888899999999999997 79999999999999999999999999999999999999999999999999 Q ss_pred ceeEEeecCCCccccCccccceecCceeEEecCCcccCCCccccCcccC-CCCCcccceEEEeecccccCccccccccee Q lcl|NC_018863. 241 RQRVIQPSQAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILVDRIPE-PNAPQAPASVVATVKVNDKGAFRPVKDIKT 319 (479) Q Consensus 241 ~qrv~~~~n~g~~~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver~~s-~~aP~~P~~vta~~~~~~~g~~~~~sd~g~ 319 (479) .||+++.+|.+...+|++|++|.|++|.|+||+++||++++++.+..+. +++| +|..+++++....+|.+ .+++.++ T Consensus 235 ~q~~v~~~n~~~~~~G~~v~g~~sa~G~I~l~gs~il~~~~~l~~~~~~~~~Ap-sp~~vsaT~~~~~~g~~-~~~~~a~ 312 (468) T protein:vir:63 235 KQTQLVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAP-QPAKVTATQEAGKKGQF-RAEDLAA 312 (468) T ss_pred ceEEEEcCCCCceeeeecccceecceeeeeecCceeeccccCCCcccccccccc-cCCccceeeecccCCcc-cCCCcce Confidence 9999988989999999999999999999999999999999999987655 4555 56667788777777775 6788899 Q ss_pred eEEEEEEEcCCCCcccccceeeeeecCCCeEEEEEeecCCccccceEEEEEeccCCCCcEEEEEEeeeeeccCCCeeEEE Q lcl|NC_018863. 320 HSYKVVVHSDDAESLASEAVTAVVANPTDSVSLAVKLQSLYQAKPQFISVYRQGNETGHYFLVARVPLSKADENGVITFV 399 (479) Q Consensus 320 Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~V~LtIt~~~~~~~~~~y~~IYR~t~~~g~~~~i~rV~~s~~n~~~tttf~ 399 (479) |+|||+++|++|||+||+++++|+++..++++|+|++++++++.|+|++|||+++++|+||+|+|||++.+ .+++++|+ T Consensus 313 y~Y~v~~vs~~GES~pS~~vtvTVaa~~dg~~ltIt~~~~~~~~p~yv~IYR~~~gg~~f~li~~va~~~a-~~gt~tf~ 391 (468) T protein:vir:63 313 HEYKVVVSSDDAESIASEVATATVTAKDDGVKLEIELAPMYSSRPQFVSIYRKGAETGLFYLIARVPASKA-ENNVITFY 391 (468) T ss_pred EEEEEEEECCCCccccccceEEEecCcccceeEEEEecCCCCCcceEEEEEEeCCCCcceeEeeeEeeeec-CCCeEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999987 58999999 Q ss_pred eccCCCCCccceeeccccHHHHHHHHhccccccCccccCchhHHHHHhhhhhheeccceeEEEEeccccCccccccccC Q lcl|NC_018863. 400 DRNQVIPETTDVFIGELTPQVISLLELLPMMKLPLAQMNATTTFTVLWYGALALYAPKKWVRIKNVQYIPALAADVTYR 478 (479) Q Consensus 400 D~N~~iPgT~~~fvge~~~q~i~l~ellPm~k~Pla~~~~~~~~~V~~yg~L~l~aPkk~~~ikNV~~~~~~~~~~~~~ 478 (479) |+|++||||+++|||||||+||+|+|||||+|||||+.|++++|||+|||+|+|+|||||+|||||+|+|-- |+.-. T Consensus 392 D~n~~iPgT~~~fVgem~~~~i~~~~llpm~~lplA~~n~~~~~~Vl~Ygalal~~Pk~~~~ikNv~~~~~~--~~~~~ 468 (468) T protein:vir:63 392 DLNDSIPETVDVFVGEMSANVVHLFELLPMMRLPLAQINASVTFAVLWYGALALRAPKKWVRIRNVKYIPVK--NVHSN 468 (468) T ss_pred cCCcccCCCcceeeeecChhHHHHHHHhccccCChhHhccchhhhhhhhhHHhhhccccceEEEEeeeeeec--cccCC Confidence 999999999999999999999999999999999999999999999999999999999999999999999853 33333 No 6 >protein:vir:80491 Length: 467 # NCBI annotation: Cps # Family: family:all:2450 # MgeID: mge:1883 # MgeName: A511 # Cross-refs: genbank:acc:YP_001468466;genbank:gi:157325041;genbank:GeneID:5601449 Probab=100.00 E-value=4.6e-189 Score=1053.12 Aligned_cols=466 Identities=58% Similarity=0.919 Sum_probs=440.8 Q ss_pred CcccccccceeeeecCchhHHHHHHHHHHHhhcCcccCcccccCccccchhhhHHHHHHHhhccccccchhhhccchhHH Q lcl|NC_018863. 1 MTELQKEQKVEARKLPAGAEAELAELVSKSFTTGTGITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQVNS 80 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~Ksf~ag~~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~~~s 80 (479) ||+.+||. .+|+++++. ||+|+|||+|||+|+|++|+||+||||||||++|++|+|+++||+|||+|+|++++| T Consensus 1 ~~~~~~~~-----~~~~n~~~~-~e~~~Ks~~agy~~~p~tq~~~~AlR~EsL~~~i~~Lt~~~~~f~~~~di~k~~a~s 74 (467) T protein:vir:80 1 MPKNNKEE-----VKEVNLNSV-QEDALKSFTTGYGITPDTQTDAGALRREFLDDQISMLTWTENDLTFYKDIAKKPATS 74 (467) T ss_pred CCCcchhh-----hhhcccccC-HHHHHHHHHcccccCCccccCcchhhhhhhhhhhheeeccccchhhhhhcccchhhh Confidence 99999985 345666666 599999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_018863. 81 TVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAISVIAKSIEW 160 (479) Q Consensus 81 tv~~y~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~~~~~ai~~~~~~~e~ 160 (479) ||+||+++++||++||++|++|+|+++++||+|+||+++||||++++++|+++++||+|+||+++|+++||+++||+||| T Consensus 75 tv~~y~~~~~~G~~g~~~f~~E~g~~~~~~~~~~r~~~~~k~l~~~~~vs~~~~l~n~i~d~~~~~~~~ai~~~a~tiE~ 154 (467) T protein:vir:80 75 TVAKYDVYMQHGKVGHTRFTREIGVAPVSDPNIRQKTVNMKFASDTKNISIAAGLVNNIQDPMQILTDDAIVNIAKTIEW 154 (467) T ss_pred hhhhheeeeccCccccccccccccccccCCCceEEEEEEeeeeeeeeeehhhhhhhcchhhHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhhheeecccCceeeeecChHHHhhHHHhhcC Q lcl|NC_018863. 161 AIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLD 240 (479) Q Consensus 161 a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~atd~~mp~~vka~f~q~~~~ 240 (479) +|||||++|.+.+++++|||||||+++|+ ++||||+||++|++++||+|+++|+++||++||+|||+++|++|++.++. T Consensus 155 a~FyGds~l~~s~~~~~glqfDGi~~li~-~enviDa~G~~ls~~~lneaa~~i~~gfG~~td~~~p~~v~a~~~~~~L~ 233 (467) T protein:vir:80 155 ASFFGDSDLSDSPEPQAGLEFDGLAKLIN-QDNVHDARGASLTESLLNQAAVMISKGYGTPTDAYMPVGVQADFVNQQLS 233 (467) T ss_pred HhhhcccccccCCCccccccccceeEEec-CCceeccCCCccCHHHHHHHhhhccccccChhhhhcchhHHhhhhhhhcC Confidence 99999999998888899999999999997 79999999999999999999999999999999999999999999999999 Q ss_pred ceeEEeecCCCccccCccccceecCceeEEecCCcccCCCccccCcccC-CCCCcccceEEEeecccccCccccccccee Q lcl|NC_018863. 241 RQRVIQPSQAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILVDRIPE-PNAPQAPASVVATVKVNDKGAFRPVKDIKT 319 (479) Q Consensus 241 ~qrv~~~~n~g~~~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver~~s-~~aP~~P~~vta~~~~~~~g~~~~~sd~g~ 319 (479) .||+++.+|.+...+|++|++|.|++|.|+||+++||++++++.+..+. +++| +|..+++++....+|.+ .+++.++ T Consensus 234 ~q~~v~~~n~~~~~~G~~v~g~~sa~G~I~l~gs~il~~~~~l~~~~~~~~~Ap-sp~~vsaT~~~~~~g~~-~~~~~a~ 311 (467) T protein:vir:80 234 KQTQLVRDNGNNVSVGFNIQGFHSARGFIKLHGSTVMENEQILDERILALPTAP-QPAKVTATQEAGKKGQF-RAEDLAA 311 (467) T ss_pred ceEEEEcCCCCceeeeecccceecceeeeeecCceeeccccCCCcccccccccc-cCCccceeeecccCCcc-cCCCcce Confidence 9999988989999999999999999999999999999999999987655 4555 56667788777777775 6788889 Q ss_pred eEEEEEEEcCCCCcccccceeeeeecCCCeEEEEEeecCCccccceEEEEEeccCCCCcEEEEEEeeeeeccCCCeeEEE Q lcl|NC_018863. 320 HSYKVVVHSDDAESLASEAVTAVVANPTDSVSLAVKLQSLYQAKPQFISVYRQGNETGHYFLVARVPLSKADENGVITFV 399 (479) Q Consensus 320 Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~V~LtIt~~~~~~~~~~y~~IYR~t~~~g~~~~i~rV~~s~~n~~~tttf~ 399 (479) |+|||+++|++|||+||+++++|+++..++++|+|++++++++.|+|++|||+++++|+||+|+|||++.+ .+++++|+ T Consensus 312 y~Y~v~~vs~~GES~pS~~vtvTVaa~~dg~~ltIt~~~~~~~~p~yv~IYR~~~gg~~f~li~~va~~~a-~~gt~tf~ 390 (467) T protein:vir:80 312 HEYKVVVSSDDAESIASEVATATVTAKDDGVKLEIELAPMYSSRPQFVSIYRKGAETGLFYLIARVPASKA-ENNVITFY 390 (467) T ss_pred EEEEEEEECCCCccccccceEEEecCcccceeEEEEecCCCCCcceEEEEEEeCCCCcceeEeeeEeeeec-CCCeEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999987 58999999 Q ss_pred eccCCCCCccceeeccccHHHHHHHHhccccccCccccCchhHHHHHhhhhhheeccceeEEEEeccccCccccccccC Q lcl|NC_018863. 400 DRNQVIPETTDVFIGELTPQVISLLELLPMMKLPLAQMNATTTFTVLWYGALALYAPKKWVRIKNVQYIPALAADVTYR 478 (479) Q Consensus 400 D~N~~iPgT~~~fvge~~~q~i~l~ellPm~k~Pla~~~~~~~~~V~~yg~L~l~aPkk~~~ikNV~~~~~~~~~~~~~ 478 (479) |+|++||||+++|||||||+||+|+|||||+|||||+.|++++|||+|||+|+|+|||||+|||||+|+|-- |+.-. T Consensus 391 D~n~~iPgT~~~fVgem~~~~i~~~~llpm~~lplA~~n~~~~~~Vl~Ygalal~~Pk~~~~ikNv~~~~~~--~~~~~ 467 (467) T protein:vir:80 391 DLNDSIPETVDVFVGEMSANVVHLFELLPMMRLPLAQINASVTFAVLWYGALALRAPKKWVRIRNVKYIPVK--NVHSN 467 (467) T ss_pred cCCcccCCCcceeeeecChhHHHHHHHhccccCChhHhccchhhhhhhhhHHhhhccccceEEEEeeeeeec--cccCC Confidence 999999999999999999999999999999999999999999999999999999999999999999999853 33323 No 7 >protein:vir:100851 Length: 514 # NCBI annotation: hypothetical protein # Family: family:all:2450 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164744;genbank:gi:56693157;genbank:GeneID:3197484 Probab=100.00 E-value=2.6e-187 Score=1043.54 Aligned_cols=472 Identities=42% Similarity=0.673 Sum_probs=430.3 Q ss_pred Ccccccccce---eeeecCchhHH----HHHHHHHHH-hhcCcccCcccccCccccchhhhHHHHHHHhhccccccchhh Q lcl|NC_018863. 1 MTELQKEQKV---EARKLPAGAEA----ELAELVSKS-FTTGTGITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIYPL 72 (479) Q Consensus 1 ~~~~~~~~~~---~~~~~~~~~~~----~~~e~~~Ks-f~ag~~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~ 72 (479) -..|.|++-| ..+..--+.|+ ...|+++|| |+|||+|+|++|+||+||||||||++|++|+|+++||+|||+ T Consensus 7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~a~t~gy~~~~~~~t~gaAlR~EsLd~~l~~Lt~~~~~ftf~~~ 86 (514) T protein:vir:10 7 TKDIMKKSFFGGDRAVAFDTNKEDILNENLPENVKKSAFTAGHSITPDTQTDGAANRIESLNRDLKVTTWGERDFTLYND 86 (514) T ss_pred hhHHHhhhhcccceeeeecCcHHHHHHHhcchhhhhhhhccccccCCccccCccchhhhhhccceeEeeecCcchhhhhh Confidence 1222333211 12233333333 345689999 999999999999999999999999999999999999999999 Q ss_pred hccchhHHHHHHhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHH Q lcl|NC_018863. 73 INKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAIS 152 (479) Q Consensus 73 i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~~~~~ai~ 152 (479) |+|++++|||+||+++++||++||++|++|+|+++++||+|+||+++||||++++++|+++++||++.||+++++++||+ T Consensus 87 i~k~~a~STV~ey~~~~~~G~~G~~~f~~E~gi~~~~d~~~~rk~~~~k~l~~~~~vS~~~~l~n~i~d~~~~~~~dai~ 166 (514) T protein:vir:10 87 IAKQPVDNTVLKYTQYYSHGRTGHSLFQPEIGIGDVNNPNERQRTINIKYIVDTHVTSIALQRANTIVDSLKVQEYAAIS 166 (514) T ss_pred cCCchhhHHHhhhhhhcccCcccccccccccccCcCCCcceEEEEEeeeeeeeeeeeeehhhhccchhhHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhhheeecccCceeeeecChHHHh Q lcl|NC_018863. 153 VIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAAVIVGKGYGRATDAFMPIGVQA 232 (479) Q Consensus 153 ~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~atd~~mp~~vka 232 (479) ++||+|||+|||||++|+++ ..++|||||||+++|+ ++|||||||++||+++||+||++|+++||+|||+|||+++|+ T Consensus 167 ~ia~tiE~a~FyGDs~L~s~-~~~~gleFDGl~~lI~-~~NvIDarG~~Ls~~~ln~aA~~i~~gfGt~TD~ylp~~vka 244 (514) T protein:vir:10 167 TVIKTDEWAMFYGDADLTSG-QKGEGLQFDGLFKLIA-PENHIDLRGGRLSPAALNMAARKIGEGFGTPTDAYMPIGIKA 244 (514) T ss_pred HHHHHHHHHHhhhcccCCCc-cccCcchhhhHHHhhc-CCCeEecCCCCccHHHHhhhhhhhhcccCChhheeCchHHHH Confidence 99999999999999999965 4589999999999997 799999999999999999999999999999999999999999 Q ss_pred hHHHhhcCceeEEeecCCCccccCccccceecCceeEEecCCcccCCCccccCccc-CCCCCcccceEEEeecc------ Q lcl|NC_018863. 233 DFTNNLLDRQRVIQPSQAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILVDRIP-EPNAPQAPASVVATVKV------ 305 (479) Q Consensus 233 ~f~q~~~~~qrv~~~~n~g~~~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver~~-s~~aP~~P~~vta~~~~------ 305 (479) +|+|+++++|||+|++|++++.+|++|++|.+++|+|+||||+||+.+++|.++.. +++||.+|. ++++.++ T Consensus 245 ~f~~~~~~~qRV~~~~n~~~~~~G~~v~~f~s~~G~I~L~gs~im~~~n~L~~~~~~~~~Ap~~~~-va~svT~~~~g~~ 323 (514) T protein:vir:10 245 DFVNQHLNGQRVMLPGQTGGMTTGLDIDKFLSAHGSIRIQGSTIMDSDNKLDFDRPVSPTAPTAPQ-LSATVTPDGGGLW 323 (514) T ss_pred HHhhcccCcceEEeecCccceeeeeeccceeEeccceeecCCeeecccccCccCCccCCcCCCCCc-ceEEEecCccccc Confidence 99999999999999999999999999999999999999999999999999998655 678886654 3333322 Q ss_pred ------cccCccccccccee-eEEEEEEEcCCCCcccccceeeeeecCCCeEEEEEeecCCccccceEEEEEeccC---- Q lcl|NC_018863. 306 ------NDKGAFRPVKDIKT-HSYKVVVHSDDAESLASEAVTAVVANPTDSVSLAVKLQSLYQAKPQFISVYRQGN---- 374 (479) Q Consensus 306 ------~~~g~~~~~sd~g~-Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~V~LtIt~~~~~~~~~~y~~IYR~t~---- 374 (479) +.+|+.+.++++|+ |+|||+++|++|||+||+++++|+++++++|+|+|+|+++++..|+|++||||+. T Consensus 324 ~~ad~t~~~g~~~~~~~~g~~~sYaVv~~n~~GeS~ps~~vtaT~a~~~~~i~ltItp~~~~~~~p~yv~IYR~~~~~s~ 403 (514) T protein:vir:10 324 HEADKTDSKGEVILNKEVGVEQSYVAVMVSRHGDSRPSLVQTATPTKKDDAITLTITPNAMQNVIPDYVAIYRKSNFDSD 403 (514) T ss_pred CcccccccccccccccccceeEEEEEEEECCCCcccccceeeeeeeccCceEEEEEEeccCcccccceEEEEeccCCCcc Confidence 23344345677886 7799999999999999999999999999999999999999999999999999974 Q ss_pred ----------CCCcEEEEEEeeeeeccCCCeeEEEeccCCCCCccceeeccccHHHHHHHHhccccccCccccCchhHHH Q lcl|NC_018863. 375 ----------ETGHYFLVARVPLSKADENGVITFVDRNQVIPETTDVFIGELTPQVISLLELLPMMKLPLAQMNATTTFT 444 (479) Q Consensus 375 ----------~~g~~~~i~rV~~s~~n~~~tttf~D~N~~iPgT~~~fvge~~~q~i~l~ellPm~k~Pla~~~~~~~~~ 444 (479) ++|+||+|+|||+++ +++++|+|+|+|+|||||+++|||||+||||+|+|||||+|||||+.|++++|| T Consensus 404 ~~~~~~~~~~~tGdf~li~rv~~~~-~~~gttt~~D~n~~IPgT~~vfVgemspevi~l~ellPm~klpLA~~na~~~wa 482 (514) T protein:vir:10 404 ALEANTDASGNRGSYYLIGKVAVRE-QEGATITFVDTNARIAGCGDVFVIENRPETVALQEFIPLSKLNLAVTTTATSFV 482 (514) T ss_pred hhhhhccccccccceeEEEEEeeec-CCCCeEEEeccccccCCcceeEEeeCchHHHHHHHHhhhhhcChhhhcchHHHH Confidence 789999999999955 679999999999999999999999999999999999999999999999999999 Q ss_pred HHhhhhhheeccceeEEEEeccccCccccccc Q lcl|NC_018863. 445 VLWYGALALYAPKKWVRIKNVQYIPALAADVT 476 (479) Q Consensus 445 V~~yg~L~l~aPkk~~~ikNV~~~~~~~~~~~ 476 (479) |+|||+|+|+|||||++||||+|+|----+.+ T Consensus 483 VlwYGaLal~aPkr~~~IkNv~~~~v~~~~~~ 514 (514) T protein:vir:10 483 VLNYVALALYYPKRGAVLENVVYSRVEDLELS 514 (514) T ss_pred HHHHhHHHhhccccceEEEeeeeeeccccccC Confidence 99999999999999999999999998766666 No 8 >protein:vir:102823 Length: 470 # NCBI annotation: major structural protein # Family: family:all:2450 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874086;genbank:gi:118197693;genbank:GeneID:4496015 Probab=100.00 E-value=2.8e-157 Score=878.84 Aligned_cols=436 Identities=19% Similarity=0.261 Sum_probs=385.4 Q ss_pred cCchhHHHHHHHHHHHhhcCcccCcccccCccccchhhhHHHHHHHhhccccccchhhhccchhHHHHHHhhhhhc-cCc Q lcl|NC_018863. 15 LPAGAEAELAELVSKSFTTGTGITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQVNSTVAKYAVFNQ-HGR 93 (479) Q Consensus 15 ~~~~~~~~~~e~~~Ksf~ag~~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~~~stv~~y~~~~~-~G~ 93 (479) ||.+|+++|.|+.+|++++ ....|+||||||||++|++|+|+++||+|||+|+|++++|||+||+++++ ||+ T Consensus 1 ~~~~~~~~~~~a~~~al~~-------a~~~g~AlR~EsLd~~l~~lt~~~~~ftf~~~i~k~~a~STV~ey~~~~~rhG~ 73 (470) T protein:vir:10 1 MPYEHLKHLDEATLKALNA-------AGQVAESLEREDLEPEVTQLNVLDTPLTDLLSKNAVKAKAYEHEYNVVTARHDK 73 (470) T ss_pred CChhHhhhhhHHHHHHHHH-------hhhcchhhhhhhhccceeEeeecCccchhhhhcCCchhhhHhhhhhhhcccccc Confidence 9999999999999999999 77778999999999999999999999999999999999999999999886 788 Q ss_pred ccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHh--hhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCC Q lcl|NC_018863. 94 TGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAG--LVNNIADPMTILTEDAISVIAKSIEWAIFYGDAALAA 171 (479) Q Consensus 94 ~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~--lv~~~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~ 171 (479) .||++| +|+|+++++||+|+||+++||||+++++||+++. ++|+++||+++++++||+++||+|||+|||||++|++ T Consensus 74 ~g~s~~-~E~~l~~~~d~~~~Rr~v~~K~l~~~~~VT~~a~~~~~n~v~d~~~~~~~dai~~ia~tiE~a~FyGDs~l~s 152 (470) T protein:vir:10 74 IGYAAF-REGGLPRTVEVNVVRRRIRPMLVGHRITVTELATRTTQNGVMQIDELVKREKMIAVANEFEYLAFYGDNLLGD 152 (470) T ss_pred ccceee-cccccCccCCCceEEEEEEEEEEeecchhhhhhhhhhhccccchHHHHHHHHHHHHHHHHHhhhhhhcccccc Confidence 888866 9999999999999999999999999999999974 5688999999999999999999999999999999985 Q ss_pred C-CCCcccchhhhHHHhhc--cCCcEEEccCCCCCHHHhhhhhh--eeecccCceeeeecChHHHhhHHHhhcCceeEEe Q lcl|NC_018863. 172 E-ADNQAGIEFDGLTKLID--EATNVIDLKGERLDEATLNKAAV--IVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQ 246 (479) Q Consensus 172 ~-~~~~~gleFDGl~~~I~--~~~NviDarG~~l~~~~l~~aa~--~i~~~fG~atd~~mp~~vka~f~q~~~~~qrv~~ 246 (479) . +..++|||||||.++|| .|+|||||||++||+++||+|+. +++++||+|||+|||+++|++|+|+++++|||+| T Consensus 153 ~~~g~~~gleFDGl~~lId~~~~~NViDarG~~Ls~~~L~~aa~~I~~~~~fGt~TD~~lp~~vka~f~~~~~~~qRv~~ 232 (470) T protein:vir:10 153 DVPGSPNNLQQDGIINIIKRGAPQNVLDAGGRPLSIDLLWEAESRVVSTQAFANPTAVFISYVDKLNLQASFYQISRVMT 232 (470) T ss_pred ccCcccCceeccchhhhccCCCCccccccCCCCccHHHHHHHHhhhcccccccChhhhccchhHHHHHHHhhcCceEEEE Confidence 4 44579999999999998 48999999999999999999995 5589999999999999999999999999999999 Q ss_pred ecCCCccccCccccceecCceeEEecCCcccCC-----CccccCcccCCCCCcccceEEEeec--------ccccCcccc Q lcl|NC_018863. 247 PSQAGGFSTGFSINQFLSTRGAINLHGSTIMEN-----DNILVDRIPEPNAPQAPASVVATVK--------VNDKGAFRP 313 (479) Q Consensus 247 ~~n~g~~~~G~~V~~~~ss~g~I~L~~s~v~~a-----~~~lver~~s~~aP~~P~~vta~~~--------~~~~g~~~~ 313 (479) ++|+|++.+|++|++|.|++|+|+||++++|+. ++++.++. ++.+ +|..+++.++ ..++++.|. T Consensus 233 ~~N~~~~~~G~~v~~f~sa~G~I~L~~s~~m~~~~k~~p~~l~~~v--~~~a-AP~~~~tv~~t~~~~a~~~~sk~g~~~ 309 (470) T protein:vir:10 233 TADRRAGLLGADAQSYIGVRGEHSLYPSQFLGDFHKFNPARFGAEV--GDFA-APSNSWTVSTTDNFVTLPYNSGLGDPA 309 (470) T ss_pred ecCCCceeeeeeccceeeeeeeeeecccccccchhhcCcccCCccc--CCcc-cCceeEEeecCCCceeecccCCCCccc Confidence 999999999999999999999999999999994 56666642 2222 4433333222 235555578 Q ss_pred cccceeeEEEEEEEcCCCCcccccc-eeeeeecCCCeEEEEEeecCCccccceEEEEEeccCCCCcEEEEEEeeeeeccC Q lcl|NC_018863. 314 VKDIKTHSYKVVVHSDDAESLASEA-VTAVVANPTDSVSLAVKLQSLYQAKPQFISVYRQGNETGHYFLVARVPLSKADE 392 (479) Q Consensus 314 ~sd~g~Y~YkV~a~n~~GES~~S~~-VtaT~a~~~~~V~LtIt~~~~~~~~~~y~~IYR~t~~~g~~~~i~rV~~s~~n~ 392 (479) +++++.|+||||. .+|||+++.+ +++|++.++++|+|+|+.+ ..++|++||||++++|+||+|+|||++.+| T Consensus 310 ~~~v~sy~y~v~~--~~gds~s~~v~vt~t~~~v~kgv~ltI~~~----~~v~yv~IYRk~~~s~~~~li~rv~v~~~n- 382 (470) T protein:vir:10 310 NTTVYSYAFKAAN--FYGESAAKYIDVYIDSTEAGKGVRFQFHGL----VNVKWLDVYRKDPGSQEYKFYKRVKVSTVN- 382 (470) T ss_pred CcceeEEEEEEEE--ecCCCCcceEEEEEeeehhcceeEEEEecC----CCCcEEEEEeecCCCCceeEEEEEeeeecc- Confidence 8887666666654 4577754443 4777888999999999865 347999999999999999999999999987 Q ss_pred CCeeEEEeccCCCCCccce----------eeccccHHHHHHHHhccc--cccCccccCchhHHHHHhhhhhheeccceeE Q lcl|NC_018863. 393 NGVITFVDRNQVIPETTDV----------FIGELTPQVISLLELLPM--MKLPLAQMNATTTFTVLWYGALALYAPKKWV 460 (479) Q Consensus 393 ~~tttf~D~N~~iPgT~~~----------fvge~~~q~i~l~ellPm--~k~Pla~~~~~~~~~V~~yg~L~l~aPkk~~ 460 (479) ++.++|+|.|++||+|+++ |||||||++++|.+|||| +|||+++.++...|.| |+|+|+|||||+ T Consensus 383 g~~~~~~D~~e~i~tt~~v~~~~~~Pgt~~Vgemsp~v~sl~~~l~m~l~klp~a~~~~~v~~~v---galal~aPKr~~ 459 (470) T protein:vir:10 383 GDFTWIDDGHETVTTPSGVYRWKKIPGTGVVVGIDPNVTTMAVWIGMELYRLPPALTHDYVIWKV---ASVFSRAPEFNF 459 (470) T ss_pred CCEEEEecccccCCCcceeeeecccCcceeccccCcchhhhhhhhhhhhhhcCHHHHHHHHHHHH---HHHHHhccccce Confidence 7888888888888888876 999999999999999999 7899888887667777 999999999999 Q ss_pred EEEeccccCcc Q lcl|NC_018863. 461 RIKNVQYIPAL 471 (479) Q Consensus 461 ~ikNV~~~~~~ 471 (479) +||||+|+|-. T Consensus 460 ~IkNV~~~~~~ 470 (470) T protein:vir:10 460 LIVNVGQEPIV 470 (470) T ss_pred EEEEeeeeecC Confidence 99999999988 No 9 >protein:vir:8843 Length: 317 # NCBI annotation: major head protein # Family: family:all:3919 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775251;genbank:gi:27476049;genbank:GeneID:2700597 Probab=99.37 E-value=1.9e-14 Score=95.86 Aligned_cols=293 Identities=14% Similarity=0.117 Sum_probs=189.1 Q ss_pred hhcCcccCcc-cccCccccchhhhHHHHHHHhhccccccchhhhccchhHHHHHHhhhhhccCccccccccccccccccc Q lcl|NC_018863. 31 FTTGTGITPD-TQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASIN 109 (479) Q Consensus 31 f~ag~~~~~~-~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~~~ 109 (479) |.+ -.. -++--+-.-+|+|.++|.++.-.+.+| +..|.|.+++||.++|....=..- .. .-..||+++... T Consensus 1 ma~----~~~~~~t~~~~g~~~dl~~~I~~isp~dTPf--~S~i~~~~a~~~~~~W~~d~l~~~-~~-~~~~EG~da~~~ 72 (317) T protein:vir:88 1 MAT----PTNAVSTVEINGKREDLIDIIYNIAPYDTPF--MSAIGKGVATAITHEWQTDELRQP-GK-NTRVEGEDATIK 72 (317) T ss_pred CCc----cccceEeeeeeeeeechhhhheecCCccCcc--eeeecCceecccEEEEEeeecCCc-cc-cccccCcccccc Confidence 111 001 123456678999999998888777655 567788899999999975442211 11 233488654333 Q ss_pred C-cceEEEEEEEEeeeehhhhhhhHhhhcc--hhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHH Q lcl|NC_018863. 110 D-PNIRQKTVQMKFLSDTKQQSLAAGLVNN--IADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTK 186 (479) Q Consensus 110 d-~~~~r~~~~~k~l~~~~~vs~~~~lv~~--~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~ 186 (479) . ..-.|+...+--+.+..+||.-++.++. ++|-++.|...++.-|..++|+++++|.+....++... -=+.+||.. T Consensus 73 ~~~~r~~~~N~tQIf~k~v~VSgTa~av~~~G~~~ela~q~~kk~~EikrdmE~~li~g~~a~~~~~~t~-~r~~~Gl~~ 151 (317) T protein:vir:88 73 AGSFTTMLNNYCQISDETLQVTGTADRVKKAGRKNELAYQLAKKSKELKLDMEYALVGAPQAKVQRNTTT-PGQMANIFA 151 (317) T ss_pred cccCCEEeccEEEEEEeEEEEeehhhhhhhcCccchhHHHHHHHHHHHHHHHHHHHhcCeeeccCCCCcc-chhhhhHHH Confidence 2 2233444444556666777777777644 56999999999999999999999999998765432210 127899999 Q ss_pred hhccCCcEEEccC----------------CCCCHHHhhhhhheeecccCceeeeecChHHHhhHHHhhcCceeEEeecCC Q lcl|NC_018863. 187 LIDEATNVIDLKG----------------ERLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSQA 250 (479) Q Consensus 187 ~I~~~~NviDarG----------------~~l~~~~l~~aa~~i~~~fG~atd~~mp~~vka~f~q~~~~~qrv~~~~n~ 250 (479) .|+ ..|+..+.| ..|+|+.|+++...+-.+.|.++.+|+++..|..|...+-++...+.. .. T Consensus 152 ~i~-t~~~~~~~g~~~~~~~~~~~t~~t~~~lte~~l~~~l~~i~~~Gg~~~~i~v~a~~k~~i~~~~~~~~~~i~~-~~ 229 (317) T protein:vir:88 152 YYK-TNGSLGANGVAPVGDGSNTGTAGDLRLLTEDMLLNASESIWRNGGQANSIQTSSSIKKAISKNMKGRATEITL-DA 229 (317) T ss_pred Hhc-cCceeccCccccccCCCccccccccccccHHHHHHHHHHHHhcCCCCCEEEeChHHHHHHHHHhcCCceeEEE-cc Confidence 996 355554443 469999999999888888899999999999999998887665544421 22 Q ss_pred CccccCccccceecCceeEEecCCcccCCCccccCc----ccCCCCCcccceEEEeecccccCcccccccceeeEEEEEE Q lcl|NC_018863. 251 GGFSTGFSINQFLSTRGAINLHGSTIMENDNILVDR----IPEPNAPQAPASVVATVKVNDKGAFRPVKDIKTHSYKVVV 326 (479) Q Consensus 251 g~~~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver----~~s~~aP~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a 326 (479) .....|..|..|.|..|.|++..+..|.++..++-. .....-|+. ....|..+...++.. .--|.+.+ T Consensus 230 ~~~~~g~~v~~~~tdfG~v~ii~~r~lp~~~~~~~D~~~~~l~~Lr~~~-~e~laKtGd~~k~~i-------~~E~tLe~ 301 (317) T protein:vir:88 230 SDNRIAQTVDVYESDFGKYTIRANRWFHENTLFVFDPKMHSLCYLRPFF-QHELAKTGDSEKRQL-------LVEYTFRV 301 (317) T ss_pred cCeEEEEEEEEEEeCCeEEEEEeCCCCCCCeEEEEcccccceeecccce-eeccCCCcccceeEE-------EEEEEEEE Confidence 334789999999999999999999998876666532 111122221 111111111111110 12466667 Q ss_pred EcCCCCcccccceeeee Q lcl|NC_018863. 327 HSDDAESLASEAVTAVV 343 (479) Q Consensus 327 ~n~~GES~~S~~VtaT~ 343 (479) .|..+-...... +++. T Consensus 302 ~N~~a~a~i~~l-~~~~ 317 (317) T protein:vir:88 302 NNEKSGALIRDV-VAQL 317 (317) T ss_pred cCccceeEEEEe-cccC Confidence 776644333321 2222 No 10 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=99.15 E-value=8.6e-13 Score=86.74 Aligned_cols=323 Identities=15% Similarity=0.179 Sum_probs=181.3 Q ss_pred Ccccccc-cceeeeecCchhHHHHHHHHHHHhhcCcccCcccccCccccchhhhHHHHHHHhhccccccchhhhccchhH Q lcl|NC_018863. 1 MTELQKE-QKVEARKLPAGAEAELAELVSKSFTTGTGITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQVN 79 (479) Q Consensus 1 ~~~~~~~-~~~~~~~~~~~~~~~~~e~~~Ksf~ag~~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~~~ 79 (479) |-++..- +.+..+.+.- +.. .+||.| -|...++.|-...+...+...-... -.++..++=..++ T Consensus 1 ~~~~~~~~~~~~~~~~~~----~~p---~l~m~a------lTLaea~~l~~d~~~~~VIE~l~~~--s~iL~~lpf~~ve 65 (330) T protein:vir:94 1 MVRICTPPLRGRWRTLTH----QFP---ELKMPT------VTLAESAKLSQDHLVSGLIETIVEV--NPLYEMMPFTEIE 65 (330) T ss_pred CceecCCccccceeehhc----ccc---ccchhh------hhhhHHhhcCchhhHHHHHHhhhcc--chHHhhccccccc Confidence 4433211 1111111110 000 122222 2233344444444444442222222 2355666655567 Q ss_pred HHHHHhhhhhccCcccccccccc-cccccccCcceEEEEEEEEeeeehhhhhhhH-hhhcchhhHHHHHHHHHHHHHHHH Q lcl|NC_018863. 80 STVAKYAVFNQHGRTGHSRFVRE-VGVASINDPNIRQKTVQMKFLSDTKQQSLAA-GLVNNIADPMTILTEDAISVIAKS 157 (479) Q Consensus 80 stv~~y~~~~~~G~~g~~~fv~E-~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~-~lv~~~~dp~~~~~~~ai~~~~~~ 157 (479) +....|++....++ ..|... .+.++....++.|.+..++-++.-..|.... ++-++..|-+..|.+..|..+.+. T Consensus 66 ~~~~~~~r~~~lp~---a~~r~~n~~~~~~~~~Tf~q~t~~l~~l~~~~~Vd~~iadl~g~~~d~~~~q~~~~ieal~~~ 142 (330) T protein:vir:94 66 GNALAYNRENVLGD---VQFLAVGGTITAKNPATFTKVTSELTTLIGDAEVNGLIQATRSDFMDQTSVQVASKAKSIGRQ 142 (330) T ss_pred CCcceeeeeecCCc---ceeeeccccccccCcceeeeeeechhhhhhhHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHH Confidence 77788877666543 344332 2333334567889999999999988887765 467778999999999999999999 Q ss_pred HHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEc--cCCCCCHHHhhhhhheeecccCceeeeecChHHHhhHH Q lcl|NC_018863. 158 IEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDL--KGERLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFT 235 (479) Q Consensus 158 ~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDa--rG~~l~~~~l~~aa~~i~~~fG~atd~~mp~~vka~f~ 235 (479) +|+.+|+||+.- -|||||.+.++ ++|+||+ +|+.|+.+.|.++-..+-+--|.+.-++|+......+. T Consensus 143 ~e~~linGDs~~---------~~F~GL~~~~~-~~q~i~tg~~gg~~T~d~LDeLl~~v~~~~g~~~~~l~n~a~~r~I~ 212 (330) T protein:vir:94 143 YQASMITGDGTG---------NSFQGMMGLVA-ASQTISAGANGGTLTFELLDQLLDLVKDKDGQVDYLMSSFAMRRKYF 212 (330) T ss_pred HHHHhhccCCCC---------ccccchhhcCC-cccEEecCCCCCCCCHHHHHHHHHHhcCCCCCCcEEEechhHHHHHH Confidence 999999999761 26999999997 7999999 78999999999988777666677777777777666665 Q ss_pred Hhhc--CceeEEeecCCCccccCccccceecCceeEEecCCcccCCCccccCcccCCCCCcccceEEEeecccccCcccc Q lcl|NC_018863. 236 NNLL--DRQRVIQPSQAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILVDRIPEPNAPQAPASVVATVKVNDKGAFRP 313 (479) Q Consensus 236 q~~~--~~qrv~~~~n~g~~~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver~~s~~aP~~P~~vta~~~~~~~g~~~~ 313 (479) ..-. +++.+..+. . ..-|..|.+|.+.. |-. .+. .|..- .++ T Consensus 213 a~~R~~~~~~v~~~~--~-~~~G~~v~~~~GvP--i~~------------~d~--------ip~~~-------~~~---- 256 (330) T protein:vir:94 213 SLLRALGGAAIGEVM--T-LPSGRQIPTYRGVP--WFV------------NDF--------IPSNM-------TQG---- 256 (330) T ss_pred HHHHhccCCCCCCcc--c-ccCCCEEeeeCCeE--EEe------------ccc--------ccCCC-------Ccc---- Confidence 5542 332222110 0 12355555554431 110 000 00000 000 Q ss_pred cccceeeEEEEEEEcCCCCcccccceeeeeecCCCeEEEEEeecCCccccceEEEEEeccCCCCcEEEEEEeeeeeccCC Q lcl|NC_018863. 314 VKDIKTHSYKVVVHSDDAESLASEAVTAVVANPTDSVSLAVKLQSLYQAKPQFISVYRQGNETGHYFLVARVPLSKADEN 393 (479) Q Consensus 314 ~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~V~LtIt~~~~~~~~~~y~~IYR~t~~~g~~~~i~rV~~s~~n~~ 393 (479) ++ ++...+|..-+.+..+.. +-+.+. .. T Consensus 257 ------------------~~--------------------------~~ttsIyav~~G~~~~~q-----gV~Gl~---~~ 284 (330) T protein:vir:94 257 ------------------TA--------------------------TNATAIFAGTFDDGSNKY-----GIAGLT---AR 284 (330) T ss_pred ------------------cC--------------------------CCceeEEEEeeccccccc-----ceEeec---CC Confidence 00 111111111111111000 111111 01 Q ss_pred CeeEEEeccCCCCCccceeeccccHHHHHHHHhccccccCccccCchhHHHHHhhhhhheeccceeEEEEecccc Q lcl|NC_018863. 394 GVITFVDRNQVIPETTDVFIGELTPQVISLLELLPMMKLPLAQMNATTTFTVLWYGALALYAPKKWVRIKNVQYI 468 (479) Q Consensus 394 ~tttf~D~N~~iPgT~~~fvge~~~q~i~l~ellPm~k~Pla~~~~~~~~~V~~yg~L~l~aPkk~~~ikNV~~~ 468 (479) + .||-.-=|+|+.+.. +.+.|.|.||..+++.-|+...+++||... T Consensus 285 g----------~~glsVr~~G~~~~k-------------------~v~~~~v~~y~~~av~~~~a~~~L~~V~~g 330 (330) T protein:vir:94 285 G----------SAGLRVQNVGAKENA-------------------DETITRVKMYCGFANFSQLGLAAIKGLIPG 330 (330) T ss_pred C----------CCcceeeeCCCcccc-------------------ceeeEEEEEeeeeEEechhheeeeccccCC Confidence 1 244332245533321 346678899999999999999999999988 No 11 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=98.96 E-value=2.3e-11 Score=78.88 Aligned_cols=300 Identities=15% Similarity=0.197 Sum_probs=172.4 Q ss_pred cCchhHHHHHHHHHHHhhcCcccCcccccCccccchhhhHHHHHHHhhccccccchhhhccchhHHHHHHhhhhhccCcc Q lcl|NC_018863. 15 LPAGAEAELAELVSKSFTTGTGITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQVNSTVAKYAVFNQHGRT 94 (479) Q Consensus 15 ~~~~~~~~~~e~~~Ksf~ag~~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~ 94 (479) ||+-.++++ +| +.. -.|-++-+|. +...+ .+|..++=..++.....|++....++. T Consensus 1 mpaltLaea----~k-~~~------------d~l~~~ViE~----~~~~s---~lL~~LpF~~veg~~~~ynR~~~~~~~ 56 (310) T protein:vir:97 1 MASVTLAES----AK-LAQ------------DELVAGVIEN----IITVN---RMFDVLPFDSIEGNSLAYNRENVLGDV 56 (310) T ss_pred CcccchHHH----hh-cCc------------chHHHHHHHH----Hhccc---hHHHhCCcccccCCcceeeEeeccCCc Confidence 555555554 12 111 0111122221 11222 234444444455555667766665555 Q ss_pred cc----cccccccccccccCcceEEEEEEEEeeeehhhhhhh-Hhhh-cchhhHHHHHHHHHHHHHHHHHHHHHhhcccc Q lcl|NC_018863. 95 GH----SRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLA-AGLV-NNIADPMTILTEDAISVIAKSIEWAIFYGDAA 168 (479) Q Consensus 95 g~----~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~-~~lv-~~~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~ 168 (479) +- ..+..| |+++ +..+..++...++-++....|... +++. ++..|-.++|.+-.|..+...+|+.+++||++ T Consensus 57 ~~~~v~~~~~~~-g~~~-~~~t~~~~~~~L~i~~g~~~Vd~~i~dl~~~~~~dq~~~Ql~~~iea~~~~~e~~lINGD~a 134 (310) T protein:vir:97 57 IMAGVGTTFSGA-GAGK-AAATFTKVNSNLTTIMGDAEVNGLIQATRSGDGNDQTAVQIASKAKSAGRKYQDQLINGNGA 134 (310) T ss_pred ccccccccccCC-Cccc-cccccceeeeeeeeeeehhhhhhHHHhhhcCChHHHHHHHHHHHHHHHHHHHHHHhhccccC Confidence 31 112222 2222 557788999999999999999875 6776 55789999999999999999999999999987 Q ss_pred cCCCCCCcccchhhhHHHhhccCCcEEEc--cCCCCCHHHhhhhhheeecccCceeeeecChHHHhhHHHhhc--CceeE Q lcl|NC_018863. 169 LAAEADNQAGIEFDGLTKLIDEATNVIDL--KGERLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLL--DRQRV 244 (479) Q Consensus 169 l~~~~~~~~gleFDGl~~~I~~~~NviDa--rG~~l~~~~l~~aa~~i~~~fG~atd~~mp~~vka~f~q~~~--~~qrv 244 (479) -+ |||||.+.++ +.++||+ +|+.|+.+.|.++-..+-+.=|.+.-++||+.+...+..... .+.-+ T Consensus 135 ~n---------~F~GL~~~~~-~~q~i~~~~~gg~~t~d~LDeLl~~v~~~~g~p~~~l~~~~~~r~i~A~~R~~~~~g~ 204 (310) T protein:vir:97 135 GN---------EFAGLIQLCA-SGQKATTGATGSAISFAILDELMDLVVDKDGQVDYLTMHARTLRSYKALLRALGGASI 204 (310) T ss_pred CC---------cccchhhcCC-ccceeecCCCCCCCCHHHHHHHHHHHhcCCCCCCEEEecHHHHHHHHHHHHHhcCCCC Confidence 32 5999999997 6899998 779999999998776665556778889999987666655442 22222 Q ss_pred EeecCCCccccCccccceecCceeEEecCCcccCCCccccCcccCCCCCcccceEEEeecccccCcccccccceeeEEEE Q lcl|NC_018863. 245 IQPSQAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILVDRIPEPNAPQAPASVVATVKVNDKGAFRPVKDIKTHSYKV 324 (479) Q Consensus 245 ~~~~n~g~~~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver~~s~~aP~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV 324 (479) ..+. -...|..|.+|.+.. |- . .+..+ ++-++.. + .| -+--|.| T Consensus 205 ~~~~---~~~~G~~v~~~~GiP--i~-------~-----~d~ip--------~~~~~~~-~--~g--------tTsIya~ 248 (310) T protein:vir:97 205 NEVV---ELPSGAEVPAYSGTP--IF-------R-----NDYIP--------TNQTKGG-T--TG--------CTTIFAG 248 (310) T ss_pred CCcc---ccCCCCEEeeeCCeE--EE-------E-----eCccC--------CCccccc-c--CC--------ceeEEEE Confidence 1110 112355555555531 11 0 01111 0000000 0 00 0112222 Q ss_pred EEEcCCCCcccccceeeeeecCCCeEEEEEeecCCccccceEEEEEeccCCCCcEEEEEEeeeeeccCCCeeEEEeccCC Q lcl|NC_018863. 325 VVHSDDAESLASEAVTAVVANPTDSVSLAVKLQSLYQAKPQFISVYRQGNETGHYFLVARVPLSKADENGVITFVDRNQV 404 (479) Q Consensus 325 ~a~n~~GES~~S~~VtaT~a~~~~~V~LtIt~~~~~~~~~~y~~IYR~t~~~g~~~~i~rV~~s~~n~~~tttf~D~N~~ 404 (479) .. .+.... . +-++.. ..+ T Consensus 249 r~--------------------------------------------Ge~~~~--~---Gv~Gl~---~~~---------- 266 (310) T protein:vir:97 249 TL--------------------------------------------DDGSRT--H---GIAGLT---ATQ---------- 266 (310) T ss_pred ee--------------------------------------------Cccccc--c---ceeccc---cCC---------- Confidence 21 111000 0 000000 000 Q ss_pred CCCccceeeccccHHHHHHHHhccccccCccccCchhHHHHHhhhhhheeccceeEEEEeccc Q lcl|NC_018863. 405 IPETTDVFIGELTPQVISLLELLPMMKLPLAQMNATTTFTVLWYGALALYAPKKWVRIKNVQY 467 (479) Q Consensus 405 iPgT~~~fvge~~~q~i~l~ellPm~k~Pla~~~~~~~~~V~~yg~L~l~aPkk~~~ikNV~~ 467 (479) -||-.-=|||+++. .+...|.|.||..+++.-|+...+++||-= T Consensus 267 ~~glsVr~~G~~~~-------------------~~v~~~~V~~Y~~~av~~~~A~a~L~~V~~ 310 (310) T protein:vir:97 267 AAGIQVVDVGESED-------------------SDEHIWRVKWYCGLALFSEKGLACADGITN 310 (310) T ss_pred ccceeEEeCCcccC-------------------CcceeEEEEEeeeEEEecccceeeeccccC Confidence 13322225554332 245678899999999999999999999987 No 12 >protein:vir:93631 Length: 580 # NCBI annotation: Bcep22gp67 # Family: family:all:1544 # MgeID: mge:1470 # MgeName: Bcep22 # Cross-refs: genbank:acc:NP_944296;genbank:gi:38640373;genbank:GeneID:2658280 Probab=98.48 E-value=4.4e-09 Score=66.41 Aligned_cols=261 Identities=16% Similarity=0.130 Sum_probs=111.5 Q ss_pred HHhhccCCcEEEccCCCC--CHHHhhh----hhheeecccCceeeeecChHHHhhHHHhhcCceeEEeecCCCccccCcc Q lcl|NC_018863. 185 TKLIDEATNVIDLKGERL--DEATLNK----AAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSQAGGFSTGFS 258 (479) Q Consensus 185 ~~~I~~~~NviDarG~~l--~~~~l~~----aa~~i~~~fG~atd~~mp~~vka~f~q~~~~~qrv~~~~n~g~~~~G~~ 258 (479) -.. =.+...+|+-+ -+.+|-+ .|....-.+|..+=.-=|-.+.... ..-.+..+-+-..+. T Consensus 1 M~~----i~i~~f~Ge~Prl~p~lLP~~~a~~a~n~~~~~G~i~P~~~~~~~~~~~-~i~~~~~~t~~~~~~-------- 67 (580) T protein:vir:93 1 MTI----IKITGFSGEIPRLVPRLLPDTAAQNATNARLESGGLTPYRKPKFITRIS-TIPAGQIETIYRNGE-------- 67 (580) T ss_pred Cee----EeecccccccccchhhhccccccceEEeeeccCCeeeeeeCchhhcccc-ccCcCcceEEEecCc-------- Confidence 222 23566777654 4555553 2345555567666443321110000 001111111111110 Q ss_pred ccceecCceeEEecCCcccCCCccccC-c---ccCCCCCcc---cceEEEeecccccCcccccccceeeEEEEEEEcCCC Q lcl|NC_018863. 259 INQFLSTRGAINLHGSTIMENDNILVD-R---IPEPNAPQA---PASVVATVKVNDKGAFRPVKDIKTHSYKVVVHSDDA 331 (479) Q Consensus 259 V~~~~ss~g~I~L~~s~v~~a~~~lve-r---~~s~~aP~~---P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~G 331 (479) .|.+=.+.++.-.++|-++..|+-. + .+.+.+++. |.-..|-..+...++ ..+.++|+|+++-++.+| T Consensus 68 --~W~~w~~~V~~i~~PvA~DRvy~Td~g~Pkvt~~g~sy~lgVpaPs~Apt~~~~g~g---~l~~~~y~Yv~TfVt~~G 142 (580) T protein:vir:93 68 --TWMAWDKPVYAAPGPVAADRLYVMGDGAPKMIVGGTTYPLAVPMPSAALTAATSGTG---TGDVFSRVYVYTFVTGFG 142 (580) T ss_pred --eeEEeCCceeeecCccccceeEEcCCcccceecCCccccccCCCcccCceeeecCCC---CcCccceEEEEEEEcCCC Confidence 1111111222222223222222221 1 122222220 111111111111111 345689999999999888 Q ss_pred -CcccccceeeeeecCCCeEEEEEeecCCccccceEEEEEeccCC--CCcEEEEEEeeeeeccCCCeeEEEeccCCCCCc Q lcl|NC_018863. 332 -ESLASEAVTAVVANPTDSVSLAVKLQSLYQAKPQFISVYRQGNE--TGHYFLVARVPLSKADENGVITFVDRNQVIPET 408 (479) Q Consensus 332 -ES~~S~~VtaT~a~~~~~V~LtIt~~~~~~~~~~y~~IYR~t~~--~g~~~~i~rV~~s~~n~~~tttf~D~N~~iPgT 408 (479) ||.||.+...++...+++|+|+-.+.+.++..-+-+.|||+..+ ++.|++++.++. ++++|+|..... T Consensus 143 eES~PS~~S~~vtv~~g~tVtLs~~p~p~~~~~i~~~RIYRS~tG~~gtdy~lVAel~A------g~~sF~Dd~s~a--- 213 (580) T protein:vir:93 143 EESEPSAISNEVNWQAGQTVTLSGFQAAPAGRNITKQRIYRSQTSLSGTDLYFIAERDA------SAANFVDNVPLS--- 213 (580) T ss_pred CcCCCcccccceeeCCCCeEEEEecCCCCCCCccceEEEEEeccCCCceeEEEEeeecc------ceeeeeeccccc--- Confidence 99999888888778888999997777676666677899998865 359999999973 578999987431 Q ss_pred cceeeccccHHHHHH----HHhccccccCccccCchhHHHHHhhhhhheeccceeEE-E-Eec-cccCccccccc----- Q lcl|NC_018863. 409 TDVFIGELTPQVISL----LELLPMMKLPLAQMNATTTFTVLWYGALALYAPKKWVR-I-KNV-QYIPALAADVT----- 476 (479) Q Consensus 409 ~~~fvge~~~q~i~l----~ellPm~k~Pla~~~~~~~~~V~~yg~L~l~aPkk~~~-i-kNV-~~~~~~~~~~~----- 476 (479) =+||.=| +-.| ..|..|.-||.+-+ +-..+-..||.=. +.|.-|-. + ..+ ..+-|+|+.-+ T Consensus 214 ---~Lge~Lp-s~~~~~PP~~m~gL~~m~nGi~-agF~Gnev~fsEp--y~P~AWP~~yr~t~~~~Ivaia~~g~~LvV~ 286 (580) T protein:vir:93 214 ---DQNEPLP-SLEWNAPPDDLTGLISLPNGMM-AAFRGKELWLCEP--WRPHAWPQKYVLTMDYNIVALGAYGTTIVVA 286 (580) T ss_pred ---ccccccc-hhhccCcCCCcceEEeeccceE-EEEeCCEEEEecC--CCCccchhhcCCCCCCCceeEeeeCceEEEE Confidence 1121111 1111 01111222332100 1000111111100 22222211 0 000 01112222111 Q ss_pred --cCC Q lcl|NC_018863. 477 --YRP 479 (479) Q Consensus 477 --~~~ 479 (479) =+| T Consensus 287 T~g~p 291 (580) T protein:vir:93 287 TDGQP 291 (580) T ss_pred EcCce Confidence 111 No 13 >protein:vir:5120 Length: 615 # NCBI annotation: unknown # Family: family:all:1544 # MgeID: mge:114 # MgeName: PBC5 # Cross-refs: genbank:acc:NP_542277;genbank:gi:18071220;genbank:GeneID:929342 Probab=98.11 E-value=1.7e-07 Score=57.70 Aligned_cols=264 Identities=17% Similarity=0.174 Sum_probs=112.0 Q ss_pred cCCCCCCcccc-hh---hhHHHhhc-----c-CCcEEEccCCCCC--HHHhhh----hhheeecccCceeeeecChHHHh Q lcl|NC_018863. 169 LAAEADNQAGI-EF---DGLTKLID-----E-ATNVIDLKGERLD--EATLNK----AAVIVGKGYGRATDAFMPIGVQA 232 (479) Q Consensus 169 l~~~~~~~~gl-eF---DGl~~~I~-----~-~~NviDarG~~l~--~~~l~~----aa~~i~~~fG~atd~~mp~~vka 232 (479) ...-+.. .|+ .- .-|+--++ + .=.+...+|+-+- +.+|-+ .|....-.+|..+=...|-.+.. T Consensus 1 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~M~~I~i~~f~Ge~Prl~P~lLP~~~A~~A~N~~~~~G~ltP~~~~~~~~~ 79 (615) T protein:vir:51 1 MVSTGTR-RGTLRSRAPSRLHCYLKQGYLGMVAIKISAFAGEQPMLLPRLLPETGATAAMNVRLNDGGLTPINKPIEVAT 79 (615) T ss_pred Ccccccc-cceecccCcceeeeeeecCceeeEEEeecccccccccchhhhccCcccceEEeeeecCCeeeeecCcccccc Confidence 1111100 000 00 00000111 0 0123445554432 333332 12333333454444333332222 Q ss_pred hHHHhhcCceeEEeecCCCccccCccccceecCceeEEecCCcccCCCcccc-Cc--------ccCCCCCcccceEEEee Q lcl|NC_018863. 233 DFTNNLLDRQRVIQPSQAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILV-DR--------IPEPNAPQAPASVVATV 303 (479) Q Consensus 233 ~f~q~~~~~qrv~~~~n~g~~~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lv-er--------~~s~~aP~~P~~vta~~ 303 (479) .++- +.+....-. ..|.+-++.++.-.++|-++..|+. ++ ..-..+-+.|+.+..++ T Consensus 80 ~~~~---~~~Tif~~~-----------~~W~~w~~~V~av~sPvA~DRvy~tgdg~Pkv~~~~~sY~LgVpaPs~ap~~~ 145 (615) T protein:vir:51 80 IATA---SQKTIYRHQ-----------GSWLSWPNVVNAVPGPVAQDRLYFTGDGAPKVKIGGVDYALKVPRPTGALTAA 145 (615) T ss_pred cccc---cceeeeeec-----------CceeccCCceeEccCCcccceeEEcCCCcceEeecccCccccccCCCccceEE Confidence 2110 000000000 1122323333433344443322222 11 11112222222221111 Q ss_pred cccccCcccccccceeeEEEEEEEcC-CCCcccccceeeeeecCCCeEEEEEeecCCccccceEEEEEeccCC--CCcEE Q lcl|NC_018863. 304 KVNDKGAFRPVKDIKTHSYKVVVHSD-DAESLASEAVTAVVANPTDSVSLAVKLQSLYQAKPQFISVYRQGNE--TGHYF 380 (479) Q Consensus 304 ~~~~~g~~~~~sd~g~Y~YkV~a~n~-~GES~~S~~VtaT~a~~~~~V~LtIt~~~~~~~~~~y~~IYR~t~~--~g~~~ 380 (479) ....| ..|..+++|+++-++. +.||+||++....+...+++|+|+-.+++.++..-+.+.|||+..+ +..|+ T Consensus 146 -~~g~g----~~d~etr~Yv~TfVt~~GeES~PSp~S~~v~v~~g~tVtLs~~pa~~~~~~i~~rRIYRS~tg~~gtdy~ 220 (615) T protein:vir:51 146 -LSGTG----SGDIQSRTYVYTWVTSFGEESAPCPASIIVDWKPGQTVTLSGFAATPGGRSITTQRIYRSQTGKTGTGLY 220 (615) T ss_pred -ecCCC----CccccceEEEEEEEcCCCCcCCCCccceeeEecCCCeEEEeeccCCcCCCceeeEEEEEeccCCCceeeE Confidence 11112 2356789999997776 7789999888888888899999998888888777778899998865 45899 Q ss_pred EEEEeeeeeccCCCeeEEEeccCCCCCccceeeccccHHHHHHHHhccccc--cCccccCchhHHHHHhhhhhhe-eccc Q lcl|NC_018863. 381 LVARVPLSKADENGVITFVDRNQVIPETTDVFIGELTPQVISLLELLPMMK--LPLAQMNATTTFTVLWYGALAL-YAPK 457 (479) Q Consensus 381 ~i~rV~~s~~n~~~tttf~D~N~~iPgT~~~fvge~~~q~i~l~ellPm~k--~Pla~~~~~~~~~V~~yg~L~l-~aPk 457 (479) +++.++. ++++|+|.... ..|-+.||=.- .| +....|++++=...+- ++=+ T Consensus 221 lVAel~a------s~~sf~D~~~~----------------~~Lg~~Lps~~w~~P----P~~l~GL~~m~NGimAgF~Gn 274 (615) T protein:vir:51 221 LIAERAA------SAGNFTDNIAV----------------DQFQEPLPSADWNEP----PDGLAGLAEMPNGMMAAFVGR 274 (615) T ss_pred EEeeecc------cceeeeeccch----------------hhcCcccccccccCc----CcchhhhhccccceEEeecCC Confidence 9999984 47889998621 11222222111 12 2334444443332222 2211 Q ss_pred eeEEEE--------eccccC-------ccccc-------cccCC Q lcl|NC_018863. 458 KWVRIK--------NVQYIP-------ALAAD-------VTYRP 479 (479) Q Consensus 458 k~~~ik--------NV~~~~-------~~~~~-------~~~~~ 479 (479) =++|. ..+|.- |+|+= +.=.| T Consensus 275 -eV~FsEpy~PyAWP~~Yr~t~d~dIVaiA~~gt~LVV~TkG~P 317 (615) T protein:vir:51 275 -SIYFCEPYRPHAWPEKYSRNVGSDIVGIAALGSILVVVTKGKP 317 (615) T ss_pred -EEEEecCCCCcccchhcccCcCCCeeEEEecccEEEEEEcCce Confidence 11111 011111 11110 00011 No 14 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=97.85 E-value=1.1e-05 Score=47.78 Aligned_cols=321 Identities=11% Similarity=0.054 Sum_probs=150.4 Q ss_pred cCchhHHHHHHHHHHHhhcCcccCcccccCccccchhhhHHHHHHHhhccccccchhhhccchhHHHHHHhhhhhccCcc Q lcl|NC_018863. 15 LPAGAEAELAELVSKSFTTGTGITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQVNSTVAKYAVFNQHGRT 94 (479) Q Consensus 15 ~~~~~~~~~~e~~~Ksf~ag~~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~ 94 (479) |.... .|+.+ .+ .+..+|+.+..|..++ +..+.... -.+.+.+...+..+.-.+|.+. .+. T Consensus 1 m~~~~--------~~a~~---~~--~t~~~g~~i~~~~~~~-ii~~~~~~--s~l~~~~~~~~~~~~~~~~p~~---~~~ 61 (330) T protein:vir:77 1 MAGST--------VPSTQ---VA--LTGDFSAFLTPEQSQD-YFAEIEKT--SIVQRIARKVPMGPTGISIPHW---TGA 61 (330) T ss_pred Ccccc--------cchhh---cc--ccCCCcceechhHHHH-HHHHHHhc--cchhhhcceeeccCCceEEEEE---cCC Confidence 11100 11111 11 1334466666666554 43443332 2355555555555544445444 344 Q ss_pred cccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCC Q lcl|NC_018863. 95 GHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAISVIAKSIEWAIFYGDAALAAEAD 174 (479) Q Consensus 95 g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~ 174 (479) +...|++|++..+.+++.+.+....++=++.--.+|.-+ +.++..|.+....+.-...+++.+|.++|+|+.+ T Consensus 62 ~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~is~el-l~ds~~~~~~~i~~~l~~ai~~~~~~~~l~G~g~------ 134 (330) T protein:vir:77 62 VSASWTGEAERKPITKGSFGKQELEPVKITTIFAESAEV-VRLNPLNYLNTMRTKIAEAIALKFDAAAIHGIDK------ 134 (330) T ss_pred cceeEecCCCccccccceeeEEEEeEEEEEEeehhhHHH-HhcchHHHHHHHHHHHHHHHHHHHHHHhhcccCC------ Confidence 456799999999999999999999999999888888853 3456678889999999999999999999999875 Q ss_pred CcccchhhhHHHhhccCCcEEEccC---CCCC---HHHhhhhhheeecccCceeeeecChHHHhhHHHhhcCceeEEeec Q lcl|NC_018863. 175 NQAGIEFDGLTKLIDEATNVIDLKG---ERLD---EATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPS 248 (479) Q Consensus 175 ~~~gleFDGl~~~I~~~~NviDarG---~~l~---~~~l~~aa~~i~~~fG~atd~~mp~~vka~f~q~~~~~qrv~~~~ 248 (479) |-+++|+.+.+....++.+..+ ...+ .+.|.++-..+.+.+...+-.+|++.+.+.+...-....|.+.+. T Consensus 135 ---~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~ 211 (330) T protein:vir:77 135 ---PSAFKGYLAETTKVVSLADTNLTTASGPQGNAYLAVNNALSLLVNSGKKWTGTLLDNVTEPILNTAVDGNGRPLFVE 211 (330) T ss_pred ---CCccccccccccccceeecccccccccccchhHHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHHHhccCCceeecC Confidence 2346788877753333332222 1222 334555555566778888889999999999887655444444332 Q ss_pred CCCccccCccccceecCceeEEecCCcccCCCccccCcccCCCCCcccceEEEeecccccCcccccccceeeEEEEEEEc Q lcl|NC_018863. 249 QAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILVDRIPEPNAPQAPASVVATVKVNDKGAFRPVKDIKTHSYKVVVHS 328 (479) Q Consensus 249 n~g~~~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver~~s~~aP~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n 328 (479) +...... ....+.++++.+.+..+..+...+ ..+.-.+ .|..++.+. .. T Consensus 212 ~~~~~~~-------------~~~~~~~l~G~PV~~~~~~p~~~~-------------~~~~~~~----~gd~s~~~i-~~ 260 (330) T protein:vir:77 212 STYTEQV-------------GAIREGRILGRPTYVADNVVNGTV-------------GNRVVGV----MGDFSQVIW-GQ 260 (330) T ss_pred ccccccc-------------cccCCceecceeeEEeccccCCCC-------------CCccEEE----EEecceEEE-EE Confidence 2211110 011223334444333332221000 0000001 122222221 11 Q ss_pred CCCCccc-ccceeeeeecCCCeEEEEEeecCCccccceEEEEEeccCCCCcEEEEEEeeeeeccCCCeeEEEec--cCCC Q lcl|NC_018863. 329 DDAESLA-SEAVTAVVANPTDSVSLAVKLQSLYQAKPQFISVYRQGNETGHYFLVARVPLSKADENGVITFVDR--NQVI 405 (479) Q Consensus 329 ~~GES~~-S~~VtaT~a~~~~~V~LtIt~~~~~~~~~~y~~IYR~t~~~g~~~~i~rV~~s~~n~~~tttf~D~--N~~i 405 (479) ..|-+.- +.......-...+..... ..+.-|.+.. -.|+.+.|+...-.+......-+.. +.+ T Consensus 261 ~~~~~i~~~~e~~~~~~~~~~~~~~~-----------~~~~~f~~~~--~~~r~~~r~d~~v~~~~a~~~i~~~~~~~~- 326 (330) T protein:vir:77 261 IGGLSFDVTDQATLDFGEEQGGVWVP-----------KLISLWQHNM--VAVRCEAEFAFMVNDKDAFVKLTDQVAGTD- 326 (330) T ss_pred ecCcEEEEeecceeeecccccccccc-----------cccchhhcCc--EEEEEEEEeccEEecccceEEEEeccCCcC- Confidence 1111111 111111100000000000 0011111100 1111112221111111111110000 001 Q ss_pred CCcc Q lcl|NC_018863. 406 PETT 409 (479) Q Consensus 406 PgT~ 409 (479) |+-- T Consensus 327 ~~~~ 330 (330) T protein:vir:77 327 PEEE 330 (330) T ss_pred CCCC Confidence 1000 No 15 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=97.75 E-value=7.6e-06 Score=48.64 Aligned_cols=324 Identities=12% Similarity=0.026 Sum_probs=147.0 Q ss_pred Ccccccc---cceeeee--cCch---------------hHHHHHHHHHHHhhcCcccCcccccCccccchhhhHHHH-HH Q lcl|NC_018863. 1 MTELQKE---QKVEARK--LPAG---------------AEAELAELVSKSFTTGTGITPDTQHDAAALRRELLDDQV-KM 59 (479) Q Consensus 1 ~~~~~~~---~~~~~~~--~~~~---------------~~~~~~e~~~Ksf~ag~~~~~~~~~~gaAlr~esld~~i-~~ 59 (479) ..++.+. .+-...+ .... ....+-..-.+++...-. ...+..+|+.|..+.+..++ .. T Consensus 195 ~~~~~~~~d~~e~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~l~~~e~~~~~~~~~-~~~t~~~gg~lip~~~~~~ii~~ 273 (543) T protein:vir:81 195 ATKIIERFDDEDSTLARQCLATSSPAYLRAWSKMARNPHAAILTEEEKRAINEVRA-MGLTKADGGYLVPFQLDPTVIIT 273 (543) T ss_pred HHHHHHHHHHHHHHHhhhhhhhhhhhhhhHHHHHHHhhHHHHhhhhhhhhhhhhhh-cccccccCcccCchhhhhHHHHH Confidence 0000000 0000000 0000 000110111122221111 11234567777777766554 23 Q ss_pred Hhhccccccchhhhcc-chhHHHHHHhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcc Q lcl|NC_018863. 60 LAFTNGDFTIYPLINK-QQVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNN 138 (479) Q Consensus 60 l~~~~~~f~~~~~i~k-~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~ 138 (479) +... ...+..+.+ ......+. | ....+.+...+++|++..+.+++.+......++-++.-..+|.-+ +.++ T Consensus 274 ~~~~---~~~l~~~~~~~~~~g~~~-~---~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~el-l~d~ 345 (543) T protein:vir:81 274 SNGS---LNDIRRFARQVVATGDVW-H---GVSSAAVQWSWDAEFEEVSDDSPEFGQPEIPVKKAQGFVPISIEA-LQDE 345 (543) T ss_pred HHhh---hchhhhhcccccCCcceE-E---EEecCCcceeecccCccccccccccceeeeeeeeeEeeehhhHHH-Hhcc Confidence 2222 222222222 22222222 1 122234467799999999999999999999999999998999864 2344 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhcc-CCcEEEccCCCCCHHHhhhhhheeecc Q lcl|NC_018863. 139 IADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDE-ATNVIDLKGERLDEATLNKAAVIVGKG 217 (479) Q Consensus 139 ~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~-~~NviDarG~~l~~~~l~~aa~~i~~~ 217 (479) .|.+....+.-...++..++.++|+||-. |-++.|+.+.... ...+..+.+..++.+.+.++...+..+ T Consensus 346 -~~~~~~i~~~l~~~~~~~~d~ail~G~Gt---------~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~ 415 (543) T protein:vir:81 346 -ANVTETVALLFAEGKDELEAVTLTTGTGQ---------GNQPTGIVTALAGTAAEIAPVTAETFALADVYAVYEQLAAR 415 (543) T ss_pred -HHHHHHHHHHHHHHHHHHHHHHHhccCCC---------CcccccchhhcccccccccccccccccHHHHHHHHHhhhcc Confidence 58899999999999999999999999743 2256788876532 233556666777777777776667777 Q ss_pred cCceeeeecChHHHhhHHHhhcCceeEEeec-CCCccccCccccceecCceeEEecCCcccCCCccccCcccCCCCCccc Q lcl|NC_018863. 218 YGRATDAFMPIGVQADFTNNLLDRQRVIQPS-QAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILVDRIPEPNAPQAP 296 (479) Q Consensus 218 fG~atd~~mp~~vka~f~q~~~~~qrv~~~~-n~g~~~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver~~s~~aP~~P 296 (479) |.....++|++.+.+.+...-...-+.+.++ ..|. +.++++.+.+..+..+....+... T Consensus 416 ~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~g~--------------------~~~l~G~pv~~~~~~~~~~~~~~~ 475 (543) T protein:vir:81 416 HRRQGAWLANNLIYNKIRQFDTQGGAGLWTTIGNGE--------------------PSQLLGRPVGEAEAMDANWNTSAS 475 (543) T ss_pred ccCCcEEEEcHHHHHHHHHhhcCCCceeccCcCCCC--------------------CccccceeeEEecccccccccccc Confidence 8777778999999998887554332333211 1110 112233333333322221111100 Q ss_pred ceEEEeecccccCcccccccceeeEEEEEEEcCCCCccc-ccceeeeeecCCCeEEEEEeecCCccccceEEEEEeccCC Q lcl|NC_018863. 297 ASVVATVKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLA-SEAVTAVVANPTDSVSLAVKLQSLYQAKPQFISVYRQGNE 375 (479) Q Consensus 297 ~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~-S~~VtaT~a~~~~~V~LtIt~~~~~~~~~~y~~IYR~t~~ 375 (479) .... .-.. |..++.+... ..|-+.- +.....+.-...+.+.+.+.-- .+ +.|.+..+ T Consensus 476 ~~~~----~i~~---------gd~~~~~i~~-~~~~~i~~~~~~~~~~~~~~~~~~~~~~~r-~d------~~v~~~~A- 533 (543) T protein:vir:81 476 ADNF----VLLY---------GNFQNYVIAD-RIGMTVEFIPHLFGTNRRPNGSRGWFAYYR-MG------ADVVNPNA- 533 (543) T ss_pred CCcc----eEEE---------eeccceeEEe-ecccEEEEeccccccchhhcCceEEEEEEe-ec------cEeecccc- Confidence 0000 0000 1112111111 1110000 0000000001111121111100 00 01111111 Q ss_pred CCcEEEEEEeeeee Q lcl|NC_018863. 376 TGHYFLVARVPLSK 389 (479) Q Consensus 376 ~g~~~~i~rV~~s~ 389 (479) +.+.+++.+. T Consensus 534 ----~~~l~~~~~a 543 (543) T protein:vir:81 534 ----FRLLNVETAS 543 (543) T ss_pred ----eEEEEecccC Confidence 1122222211 No 16 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=97.64 E-value=1.5e-05 Score=47.01 Aligned_cols=307 Identities=11% Similarity=-0.006 Sum_probs=149.3 Q ss_pred cccccCccccchhhhHHHHHHHhhccccccchhhhccchhHHHHHHhhhhhccCcccccccccccccccccCcceEEEEE Q lcl|NC_018863. 39 PDTQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTV 118 (479) Q Consensus 39 ~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~ 118 (479) =.+...|+.|-.+.+.++|........ .+.+-....+..+---+|.++ .+.....+++|++..+.+++.+..... T Consensus 1 mat~~~gg~lvP~~~~~~ii~~~~~~s--~i~~~~~~i~~~~~~~~~p~~---~~~~~a~wv~Eg~~~~~~~~~f~~v~l 75 (311) T protein:vir:81 1 MVALATGTFQLPKHLVPGVWQKAQGQS--VLARLSMAEPQEFGEQQYMTL---TAPPRGEVVGEGAQKSESTATFAPVTA 75 (311) T ss_pred CceecCCceEcchhHHHHHHHHHHhcc--hhhhhcceeecCCCceEEEEE---eCCceeEEeecCcccccccceeeEEEE Confidence 334566778888888888866655543 233333333333222233332 334457799999999999999999999 Q ss_pred EEEeeeehhhhhhhHhhh--cchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEE Q lcl|NC_018863. 119 QMKFLSDTKQQSLAAGLV--NNIADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVID 196 (479) Q Consensus 119 ~~k~l~~~~~vs~~~~lv--~~~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviD 196 (479) ..+=++.--.+|.-+-.. ....+.+....+.....+++.++.++|+|+.+-. |..+.|+.+.+-...+++. T Consensus 76 ~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~-------~~~~~gi~~~~~~~~~~~~ 148 (311) T protein:vir:81 76 IPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLT-------GAALSGSPAKILDTTNIVE 148 (311) T ss_pred eeEEEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCCC-------Ccccccccccccccceeee Confidence 999888777777664332 2344677888888889999999999999986422 4567899988866667776 Q ss_pred ccCCCC--CHHHhhhhhheeecccCceeeeecChHHHhhHHHhhcCceeEEeecCCCccccCccccceecCceeEEecCC Q lcl|NC_018863. 197 LKGERL--DEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSQAGGFSTGFSINQFLSTRGAINLHGS 274 (479) Q Consensus 197 arG~~l--~~~~l~~aa~~i~~~fG~atd~~mp~~vka~f~q~~~~~qrv~~~~n~g~~~~G~~V~~~~ss~g~I~L~~s 274 (479) ..+.-. ....|.++-..+..+.+.++...|++.+...+...-...-|.+.+.... + -.+. T Consensus 149 ~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~----~--------------~~~~ 210 (311) T protein:vir:81 149 LTTGTSATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQRDSQGRKLYPELGF----G--------------TDVA 210 (311) T ss_pred ecccccchHHHHHHHHHHHhhhcCCCceEEEEcHHHHHHHHhhhccCCCeeecCccc----c--------------CCCc Confidence 655332 2344555555566667788889999999988866543322222222111 0 0011 Q ss_pred cccCCCccccCcccCCCCCcccceEEEeecccccCcccccccceeeEEEEEEEcCCCCcccccceeeeeecCCCeEEEEE Q lcl|NC_018863. 275 TIMENDNILVDRIPEPNAPQAPASVVATVKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLASEAVTAVVANPTDSVSLAV 354 (479) Q Consensus 275 ~v~~a~~~lver~~s~~aP~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~V~LtI 354 (479) ++++.+.+..++.+.......+....... ...++..+ .|..++-+... -+.++|++ T Consensus 211 tl~G~Pv~~~~~i~~~~~~~~~~~~~~~~-~~~~~~~~----~gDfs~~~i~~-------------------~~~~~~~~ 266 (311) T protein:vir:81 211 SFAGLNAAVSDTVRGGPEAVTASTGVYRT-TNPNVKAI----AGDFSAFRWGV-------------------QVSIPLEL 266 (311) T ss_pred eecceeEEecccccccccccccccchhcc-cCCccEEE----EEecccEEEEE-------------------eccceEEE Confidence 22222333223222211111111111110 11111100 12222111100 00122222 Q ss_pred eecCCccccceEEEEEeccCCCCcEEEEEEeeeeeccCCCeeEEEeccCC Q lcl|NC_018863. 355 KLQSLYQAKPQFISVYRQGNETGHYFLVARVPLSKADENGVITFVDRNQV 404 (479) Q Consensus 355 t~~~~~~~~~~y~~IYR~t~~~g~~~~i~rV~~s~~n~~~tttf~D~N~~ 404 (479) ....-.... .+.|.+. --.|+-+.|+...-.+..+-...+|.++- T Consensus 267 ~~~~~~~~~---~~~~~~~--~v~~r~~~r~d~~v~~~~a~~~l~~a~~~ 311 (311) T protein:vir:81 267 IEFGDPDGL---GDLKRQN--QIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) T ss_pred eccCCCCcc---hhhhhcC--cEEEEEEEEeccEeecccceEEEEeeccC Confidence 211000000 0000000 00111111111111111122222222221 No 17 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=97.56 E-value=2.5e-05 Score=45.85 Aligned_cols=317 Identities=11% Similarity=0.039 Sum_probs=152.9 Q ss_pred CcccccccceeeeecCchhHHHHHHHHH--HHhhcCcccCcccccCccccchhhhHHHHHHHhhccccccchhhhccchh Q lcl|NC_018863. 1 MTELQKEQKVEARKLPAGAEAELAELVS--KSFTTGTGITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQV 78 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~--Ksf~ag~~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~~ 78 (479) |-+.+|.+ .++.+++.... +.+++. +..+..+|+.|..+.+...|........ .++..+.+.++ T Consensus 1 ~~~~~~~~---------~~~~~~~~~~~~~~~~~a~---~~~~~~~~~~~iP~~~~~~ii~~~~~~s--~l~~l~~~~~~ 66 (324) T protein:vir:96 1 MEQTQKLK---------LNLQHFASNNVKPQVFNPD---NVMMHEKKDGTLMNEFTTPILQEVMENS--KIMQLGKYEPM 66 (324) T ss_pred CCcchhhh---------HHHHHHHHHhhhhhhhccc---cccccCcCccccchhHHHHHHHHHHhhc--hhhhhcceeec Confidence 54443222 11222222221 223332 2223345677888888887755554333 34555555555 Q ss_pred HHHHHHhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHH Q lcl|NC_018863. 79 NSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAISVIAKSI 158 (479) Q Consensus 79 ~stv~~y~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~~~~~ai~~~~~~~ 158 (479) .+.-.+|.++.. .+...+++|++..+..++.+.+.....+=++---.+|.-+- .++..|.+....+.--..++..+ T Consensus 67 ~~~~~~~p~~~~---~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell-~ds~~~l~~~i~~~la~ai~~~~ 142 (324) T protein:vir:96 67 EGTEKKFTFWAD---KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFL-NYTYSQFFEEMKPMIAEAFYKKF 142 (324) T ss_pred cCCceEEEEEec---CcceeEecCCccccccccceeEEEEeeEEEEEeehhhHHHH-hcchHHHHHHHHHHHHHHHHHHH Confidence 544344555442 34567999999999999999999999999998888887432 24456788888888889999999 Q ss_pred HHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhhheeecccCceeeeecChHHHhhHHHhh Q lcl|NC_018863. 159 EWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNL 238 (479) Q Consensus 159 e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~atd~~mp~~vka~f~q~~ 238 (479) |.++|+|+-.= + +..|+.+.+.. .+... ...++.+.|.++.-.+..++..+.-..|++.+...+.+.- T Consensus 143 d~a~l~G~g~~---~------~~~gi~~~~~~-~~~~~--~~~~t~~~i~~~~~~l~~~~~~~~~~vmn~~~~~~L~~l~ 210 (324) T protein:vir:96 143 DEAGILNQGNN---P------FGKSIAQSIEK-TNKVI--KGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIV 210 (324) T ss_pred HHHHhccCCCC---C------cCccccccccc-cceec--cccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhh Confidence 99999997542 1 22466666542 33332 2345677777777677788888888999999999988776 Q ss_pred cCceeEEeecCCCccccCccccceecCceeEEecCCcccCCCccccCcccCCCCCcccceEEE--eecccccCc---ccc Q lcl|NC_018863. 239 LDRQRVIQPSQAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILVDRIPEPNAPQAPASVVA--TVKVNDKGA---FRP 313 (479) Q Consensus 239 ~~~qrv~~~~n~g~~~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver~~s~~aP~~P~~vta--~~~~~~~g~---~~~ 313 (479) ...-|.+.++..+..-.|.+|....+. .+. .+..+..+....+...-. ......+.-+ +......+. .|. T Consensus 211 d~~G~~~~~~~~~~~l~G~PV~~~~~~--~~~-~~~~~~gd~~~~~~g~~~--~~~i~~~~~~~~~~~~~~~~~~~~~f~ 285 (324) T protein:vir:96 211 DPETKERIYDRNSDSLDGLPVVNLKSS--NLK-RGELITGDFDKLIYGIPQ--LIEYKIDETAQLSTVKNEDGTPVNLFE 285 (324) T ss_pred ccCCCeeecCCCCCcccceeeEeeCCC--CCC-cceEEEEecceEEEEEec--CcEEEEeecccccccccccccchhhhh Confidence 554455544433333344443211000 000 001111111111000000 0000000000 000000000 000 Q ss_pred cccceeeEEEEEEEcCCCCcccccceeeeeecCCCeEEEEEeecCC Q lcl|NC_018863. 314 VKDIKTHSYKVVVHSDDAESLASEAVTAVVANPTDSVSLAVKLQSL 359 (479) Q Consensus 314 ~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~V~LtIt~~~~ 359 (479) .+ --.|++...=+.+--.+...+..+.+.....+ ||... T Consensus 286 -~d--~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~----~~~~~ 324 (324) T protein:vir:96 286 -QD--MVALRATMHVALHIADDKAFAKLVPADKRTDS----VPGEV 324 (324) T ss_pred -cC--cEEEEEEEEEccEEecccceEEEecccccCCC----CCCCC Confidence 00 01111111000000001111111111100000 11100 No 18 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=97.56 E-value=2.5e-05 Score=45.85 Aligned_cols=317 Identities=11% Similarity=0.039 Sum_probs=152.9 Q ss_pred CcccccccceeeeecCchhHHHHHHHHH--HHhhcCcccCcccccCccccchhhhHHHHHHHhhccccccchhhhccchh Q lcl|NC_018863. 1 MTELQKEQKVEARKLPAGAEAELAELVS--KSFTTGTGITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQV 78 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~--Ksf~ag~~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~~ 78 (479) |-+.+|.+ .++.+++.... +.+++. +..+..+|+.|..+.+...|........ .++..+.+.++ T Consensus 1 ~~~~~~~~---------~~~~~~~~~~~~~~~~~a~---~~~~~~~~~~~iP~~~~~~ii~~~~~~s--~l~~l~~~~~~ 66 (324) T protein:vir:78 1 MEQTQKLK---------LNLQHFASNNVKPQVFNPD---NVMMHEKKDGTLMNEFTTPILQEVMENS--KIMQLGKYEPM 66 (324) T ss_pred CCcchhhh---------HHHHHHHHHhhhhhhhccc---cccccCcCccccchhHHHHHHHHHHhhc--hhhhhcceeec Confidence 54443222 11222222221 223332 2223345677888888887755554333 34555555555 Q ss_pred HHHHHHhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHH Q lcl|NC_018863. 79 NSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAISVIAKSI 158 (479) Q Consensus 79 ~stv~~y~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~~~~~ai~~~~~~~ 158 (479) .+.-.+|.++.. .+...+++|++..+..++.+.+.....+=++---.+|.-+- .++..|.+....+.--..++..+ T Consensus 67 ~~~~~~~p~~~~---~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell-~ds~~~l~~~i~~~la~ai~~~~ 142 (324) T protein:vir:78 67 EGTEKKFTFWAD---KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFL-NYTYSQFFEEMKPMIAEAFYKKF 142 (324) T ss_pred cCCceEEEEEec---CcceeEecCCccccccccceeEEEEeeEEEEEeehhhHHHH-hcchHHHHHHHHHHHHHHHHHHH Confidence 544344555442 34567999999999999999999999999998888887432 24456788888888889999999 Q ss_pred HHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhhheeecccCceeeeecChHHHhhHHHhh Q lcl|NC_018863. 159 EWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNL 238 (479) Q Consensus 159 e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~atd~~mp~~vka~f~q~~ 238 (479) |.++|+|+-.= + +..|+.+.+.. .+... ...++.+.|.++.-.+..++..+.-..|++.+...+.+.- T Consensus 143 d~a~l~G~g~~---~------~~~gi~~~~~~-~~~~~--~~~~t~~~i~~~~~~l~~~~~~~~~~vmn~~~~~~L~~l~ 210 (324) T protein:vir:78 143 DEAGILNQGNN---P------FGKSIAQSIEK-TNKVI--KGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIV 210 (324) T ss_pred HHHHhccCCCC---C------cCccccccccc-cceec--cccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhh Confidence 99999997542 1 22466666542 33332 2345677777777677788888888999999999988776 Q ss_pred cCceeEEeecCCCccccCccccceecCceeEEecCCcccCCCccccCcccCCCCCcccceEEE--eecccccCc---ccc Q lcl|NC_018863. 239 LDRQRVIQPSQAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILVDRIPEPNAPQAPASVVA--TVKVNDKGA---FRP 313 (479) Q Consensus 239 ~~~qrv~~~~n~g~~~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver~~s~~aP~~P~~vta--~~~~~~~g~---~~~ 313 (479) ...-|.+.++..+..-.|.+|....+. .+. .+..+..+....+...-. ......+.-+ +......+. .|. T Consensus 211 d~~G~~~~~~~~~~~l~G~PV~~~~~~--~~~-~~~~~~gd~~~~~~g~~~--~~~i~~~~~~~~~~~~~~~~~~~~~f~ 285 (324) T protein:vir:78 211 DPETKERIYDRNSDSLDGLPVVNLKSS--NLK-RGELITGDFDKLIYGIPQ--LIEYKIDETAQLSTVKNEDGTPVNLFE 285 (324) T ss_pred ccCCCeeecCCCCCcccceeeEeeCCC--CCC-cceEEEEecceEEEEEec--CcEEEEeecccccccccccccchhhhh Confidence 554455544433333344443211000 000 001111111111000000 0000000000 000000000 000 Q ss_pred cccceeeEEEEEEEcCCCCcccccceeeeeecCCCeEEEEEeecCC Q lcl|NC_018863. 314 VKDIKTHSYKVVVHSDDAESLASEAVTAVVANPTDSVSLAVKLQSL 359 (479) Q Consensus 314 ~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~V~LtIt~~~~ 359 (479) .+ --.|++...=+.+--.+...+..+.+.....+ ||... T Consensus 286 -~d--~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~----~~~~~ 324 (324) T protein:vir:78 286 -QD--MVALRATMHVALHIADDKAFAKLVPADKRTDS----VPGEV 324 (324) T ss_pred -cC--cEEEEEEEEEccEEecccceEEEecccccCCC----CCCCC Confidence 00 01111111000000001111111111100000 11100 No 19 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=97.49 E-value=3.3e-05 Score=45.18 Aligned_cols=322 Identities=13% Similarity=0.038 Sum_probs=150.7 Q ss_pred HHHHHHHHHHHhhcCcccCcccccCccccchhhhHHHHHHHhhccccccchhhhccchhHHHHHHhhhhhc-----cCcc Q lcl|NC_018863. 20 EAELAELVSKSFTTGTGITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQVNSTVAKYAVFNQ-----HGRT 94 (479) Q Consensus 20 ~~~~~e~~~Ksf~ag~~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~~~stv~~y~~~~~-----~G~~ 94 (479) -+.+-| +++.++|........+.+++|-.+.+..+|..+..... .+.+...+.+..+--.+|.++.. |.+. T Consensus 1 ~~~~~e--~~~~~~~~~~~~~~~~~~~~liP~~~~~~ii~~~~~~s--~l~~l~~~~~~~~~~~~ip~~~~~~~a~~v~~ 76 (338) T protein:vir:78 1 MATLNE--LAPNTAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESS--LVLRLGENIPISYGETIIPTTVKRPEVGQVGV 76 (338) T ss_pred CcchHH--hhhhhcccccccceecccccccchHHHHHHHHHHHhhc--hhhhhcceeeccCCceEEEEEecCccceeecc Confidence 333322 46666665554445566778998888888866665444 34555555555554444444332 2334 Q ss_pred cccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCC Q lcl|NC_018863. 95 GHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAISVIAKSIEWAIFYGDAALAAEAD 174 (479) Q Consensus 95 g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~ 174 (479) +...+++|++.....++++.......+=++--..+|.-+ +.++..|.+....+.-...+.+.+|.++++|+..-.+ T Consensus 77 ~~~~~~~Eg~~~~~~~~~f~~v~l~~~k~~~~~~is~el-l~ds~~~~~~~i~~~la~a~~~~~d~~~l~G~g~~~~--- 152 (338) T protein:vir:78 77 GTSNEQREGGTKPLSGTAWDTRSVAPIKLATIVTVSEEF-ARMNPSGLYTKLQADLAYAIGRGIDLAVFHGKSPLTG--- 152 (338) T ss_pred cccccccccccccccccceeEEEEEEEEEEEeehhhHHH-HhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcc--- Confidence 557789999999999999999999998888777777632 2345678888889999999999999999999987443 Q ss_pred CcccchhhhHHHhhcc-CCcEEEccC--CCCCHHHhhhhh-heeecccCceeeeecChHHHhhHHHhh---cCceeEEee Q lcl|NC_018863. 175 NQAGIEFDGLTKLIDE-ATNVIDLKG--ERLDEATLNKAA-VIVGKGYGRATDAFMPIGVQADFTNNL---LDRQRVIQP 247 (479) Q Consensus 175 ~~~gleFDGl~~~I~~-~~NviDarG--~~l~~~~l~~aa-~~i~~~fG~atd~~mp~~vka~f~q~~---~~~qrv~~~ 247 (479) .++.|+.+.... .....|.-+ .....+.|..+. .+.......++-.+|++.+.+.|.... ....|.+.+ T Consensus 153 ----~~~~gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~~~~l~d~~g~~l~~ 228 (338) T protein:vir:78 153 ----SALQGIDTNNVIVNTTNVDYLQTGTTPLLDRFLDGYDLVSANTDVDFNGWAADPRYRARLLRSQAYRDANGNVDPT 228 (338) T ss_pred ----ccccccccccccccccccccccccchhhHHHHHHHHHHhhhhccccceEEEEchHHHHHHHHHhhhccCCCceeec Confidence 456777765432 112233222 222344555443 444556778888999999998886643 123344433 Q ss_pred cCCCccccCccccceecCceeEEecCCcccCCCccccCcccCCC-CCcccceEEEeecccccCcccccccceeeEEEEEE Q lcl|NC_018863. 248 SQAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILVDRIPEPN-APQAPASVVATVKVNDKGAFRPVKDIKTHSYKVVV 326 (479) Q Consensus 248 ~n~g~~~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver~~s~~-aP~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a 326 (479) ....+.. +.++++.+.+..+..+... ++..+. ..+.- ..+..++. .+.+.+.+++.- T Consensus 229 ~~~~~~~------------------~~~l~G~PV~~~~~ip~~~~~~~~~~-~~~~~--gdfs~~~~-~~~~~~~i~~~~ 286 (338) T protein:vir:78 229 RINLAAS------------------AGDLLGLPVQFGKAVGGDLGAATDSK-VRVVG--GDFSQLKY-GFADEIRVKMSD 286 (338) T ss_pred ccccCCC------------------CceeeeeeEEEccccCccccccCCcc-cEEEE--EecceEEE-EeecccEEEEee Confidence 2211111 0111111222111111100 000000 00000 00000000 000001111100 Q ss_pred --EcCCCCcccccceeeeeecCCCeEEEEEeec-CCccccceEEEEEeccCCCCc Q lcl|NC_018863. 327 --HSDDAESLASEAVTAVVANPTDSVSLAVKLQ-SLYQAKPQFISVYRQGNETGH 378 (479) Q Consensus 327 --~n~~GES~~S~~VtaT~a~~~~~V~LtIt~~-~~~~~~~~y~~IYR~t~~~g~ 378 (479) .-..+.......+. ....+.+.+.+..- ...-..|+-+..--....+.- T Consensus 287 ~~~~~~~~~~~~~~~~---~~~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~ 338 (338) T protein:vir:78 287 TATLTDNTSPTPQTVS---MWQTNQIAILIEVTFGWLLGDKQAFVKFVDDEDPDA 338 (338) T ss_pred cccccccccccccchh---hhhcCcEEEEEEEEeccEeecccceEEEecccCCCC Confidence 00000000000000 00001111110000 000000111111111111110 No 20 >protein:vir:10145 Length: 567 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:180 # MgeName: Stx2 converting bacteriophage II # Cross-refs: genbank:acc:NP_859275;genbank:gi:32171031;genbank:GeneID:2653447 Probab=97.47 E-value=1.4e-05 Score=47.22 Aligned_cols=280 Identities=19% Similarity=0.204 Sum_probs=118.3 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhh-hHHHhhccC-CcEEEccCCCC--CHHHhhh----hh Q lcl|NC_018863. 140 ADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFD-GLTKLIDEA-TNVIDLKGERL--DEATLNK----AA 211 (479) Q Consensus 140 ~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFD-Gl~~~I~~~-~NviDarG~~l--~~~~l~~----aa 211 (479) +-|.+| ..|+-++| |-|. --+|-|..+ =.+...+|+-+ -+.+|-+ .| T Consensus 1 ~~~~~~------------------~~~~~~~~-------~~~~~~~~~~~~M~~i~i~~f~Ge~Prl~p~lLP~~~a~~A 55 (567) T protein:vir:10 1 MMPIAI------------------LANSIINP-------LIFKPEAVKGISMPYIDITTMRGMMPRVVTSMLPEHSAVLA 55 (567) T ss_pred Ccchhh------------------hhhhhccc-------eeecccccccceeeEEeecccccccccchhhhccccccceE Confidence 222222 23333443 1111 012223221 13566777554 4555553 33 Q ss_pred heeecccCceeeeecChHHHhhHHHhhcCceeEEeecCCCccccCccccceecC-----ceeEEecCCcc--cCCCcc-- Q lcl|NC_018863. 212 VIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSQAGGFSTGFSINQFLST-----RGAINLHGSTI--MENDNI-- 282 (479) Q Consensus 212 ~~i~~~fG~atd~~mp~~vka~f~q~~~~~qrv~~~~n~g~~~~G~~V~~~~ss-----~g~I~L~~s~v--~~a~~~-- 282 (479) ....-.+|..+=...|..+..-|+- +.+..+.-.+.-=.+-.-+|..+.+. +.-+=++|+.. +....+ T Consensus 56 ~n~~~~~G~itP~~~~~~~~~~~~~---~~~Tif~y~~~~W~~w~~~V~~ir~PvAqD~~~rvY~tgdg~Pk~t~~~iat 132 (567) T protein:vir:10 56 EDCHFRFGVITPERQISGVEKTFTI---KPKTIFHYRDDFWFAWPDVVDVIRSPIAQDPHGRIYYTDGRFPKVTDATIAT 132 (567) T ss_pred EeeeccCCeeeeeeccccccccccc---CceeeEEEcCcEEEEeCCceeeccCccccCCcceEEEecCCcceeeeeeeee Confidence 5666668888877776555333311 11111111111000001111111110 11111112111 000000 Q ss_pred -------ccCcccCCCCCcccceEEEeecccccCcccccccceeeEEEEEEEcC-CCCcccccceeee-eecCCCeEEEE Q lcl|NC_018863. 283 -------LVDRIPEPNAPQAPASVVATVKVNDKGAFRPVKDIKTHSYKVVVHSD-DAESLASEAVTAV-VANPTDSVSLA 353 (479) Q Consensus 283 -------lver~~s~~aP~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~-~GES~~S~~VtaT-~a~~~~~V~Lt 353 (479) -........+|-.+.++ +++.+.....- ...|..++.|+++-++. +.||+||.+-... +...++.|.|+ T Consensus 133 ~G~~~~P~~~y~LgVpaps~aP~~-a~~~~~~~~~~-~~~d~etr~Yv~TfVt~~GeES~PS~~S~~~~v~~pg~~V~ls 210 (567) T protein:vir:10 133 KGDGNHPTSSYRLGIPAPTTAPVC-TVQQGGDVSDD-NPNDDETRFYTETFVSDYGEEGPPGPASLEVTLRTPGTAVQLT 210 (567) T ss_pred cCCCCCCcchhhcccCCcccccee-eecCCCCCCCC-CCcccceeEEEEEEEcCCCCcCCCcccccceeeecCCceEEEe Confidence 00001111222222222 22222221111 23566789999997765 5578888775454 34577889999 Q ss_pred EeecCCccccceEEEEEeccCCC--CcEEEEEEeeeeeccCCCeeEEEeccCCCCCccceeeccccHHHHHHHHhccccc Q lcl|NC_018863. 354 VKLQSLYQAKPQFISVYRQGNET--GHYFLVARVPLSKADENGVITFVDRNQVIPETTDVFIGELTPQVISLLELLPMMK 431 (479) Q Consensus 354 It~~~~~~~~~~y~~IYR~t~~~--g~~~~i~rV~~s~~n~~~tttf~D~N~~iPgT~~~fvge~~~q~i~l~ellPm~k 431 (479) ..+++..+..-.-+.|||+..++ ..|++++.++. ++++|+|--.. + .|-+.||=.- T Consensus 211 ~~p~~~~~~~i~~~RIYRS~tg~~gtdy~lVael~a------s~~sf~D~~~~---~-------------~lg~~Lps~~ 268 (567) T protein:vir:10 211 LAPVPLQNASIKRRRIYRSASGGGEADFLLVAELDA------SVLSYTDKIPA---K-------------NLGPSLATWD 268 (567) T ss_pred eccCCccccccceEEEEEecCCCCceeeEEEEeecc------ceeeeeeccch---h-------------hccccccccc Confidence 88888877777899999988654 48999999984 57899997422 1 1122222111 Q ss_pred --cCccccCchhHHHHHh-hhhhhee-----------ccceeEEEEeccccCcccccc--------------ccCC Q lcl|NC_018863. 432 --LPLAQMNATTTFTVLW-YGALALY-----------APKKWVRIKNVQYIPALAADV--------------TYRP 479 (479) Q Consensus 432 --~Pla~~~~~~~~~V~~-yg~L~l~-----------aPkk~~~ikNV~~~~~~~~~~--------------~~~~ 479 (479) .| +...++.+++ =|-+|=+ -|.-| ..+|.-..-.|+ +=.| T Consensus 269 w~~P----P~~m~GL~~m~NGimAgF~GneV~FsEpylPyAW----P~~Yr~t~~~dIVaiA~~gt~LVV~TkG~P 336 (567) T protein:vir:10 269 YLPP----PENMTGLCLMANGIAAGFAGNEVMFSEAYLPYAW----PEVNRHTTAEDIVAICPLGTSLVVATKGEP 336 (567) T ss_pred ccCc----CcccceeeecccceEEeecCCEEEEecCCCCccc----chhhccCCCCCeEEEeecccEEEEEEcCce Confidence 11 1111222211 1111111 22222 112211111111 0111 No 21 >protein:vir:3306 Length: 567 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:66 # MgeName: 933W # Cross-refs: genbank:acc:NP_049522;genbank:gi:9632528;genbank:GeneID:1262016 Probab=97.47 E-value=1.4e-05 Score=47.22 Aligned_cols=280 Identities=19% Similarity=0.204 Sum_probs=118.3 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhh-hHHHhhccC-CcEEEccCCCC--CHHHhhh----hh Q lcl|NC_018863. 140 ADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFD-GLTKLIDEA-TNVIDLKGERL--DEATLNK----AA 211 (479) Q Consensus 140 ~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFD-Gl~~~I~~~-~NviDarG~~l--~~~~l~~----aa 211 (479) +-|.+| ..|+-++| |-|. --+|-|..+ =.+...+|+-+ -+.+|-+ .| T Consensus 1 ~~~~~~------------------~~~~~~~~-------~~~~~~~~~~~~M~~i~i~~f~Ge~Prl~p~lLP~~~a~~A 55 (567) T protein:vir:33 1 MMPIAI------------------LANSIINP-------LIFKPEAVKGISMPYIDITTMRGMMPRVVTSMLPEHSAVLA 55 (567) T ss_pred Ccchhh------------------hhhhhccc-------eeecccccccceeeEEeecccccccccchhhhccccccceE Confidence 222222 23333443 1111 012223221 13566777554 4555553 33 Q ss_pred heeecccCceeeeecChHHHhhHHHhhcCceeEEeecCCCccccCccccceecC-----ceeEEecCCcc--cCCCcc-- Q lcl|NC_018863. 212 VIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSQAGGFSTGFSINQFLST-----RGAINLHGSTI--MENDNI-- 282 (479) Q Consensus 212 ~~i~~~fG~atd~~mp~~vka~f~q~~~~~qrv~~~~n~g~~~~G~~V~~~~ss-----~g~I~L~~s~v--~~a~~~-- 282 (479) ....-.+|..+=...|..+..-|+- +.+..+.-.+.-=.+-.-+|..+.+. +.-+=++|+.. +....+ T Consensus 56 ~n~~~~~G~itP~~~~~~~~~~~~~---~~~Tif~y~~~~W~~w~~~V~~ir~PvAqD~~~rvY~tgdg~Pk~t~~~iat 132 (567) T protein:vir:33 56 EDCHFRFGVITPERQISGVEKTFTI---KPKTIFHYRDDFWFAWPDVVDVIRSPIAQDPHGRIYYTDGRFPKVTDATIAT 132 (567) T ss_pred EeeeccCCeeeeeeccccccccccc---CceeeEEEcCcEEEEeCCceeeccCccccCCcceEEEecCCcceeeeeeeee Confidence 5666668888877776555333311 11111111111000001111111110 11111112111 000000 Q ss_pred -------ccCcccCCCCCcccceEEEeecccccCcccccccceeeEEEEEEEcC-CCCcccccceeee-eecCCCeEEEE Q lcl|NC_018863. 283 -------LVDRIPEPNAPQAPASVVATVKVNDKGAFRPVKDIKTHSYKVVVHSD-DAESLASEAVTAV-VANPTDSVSLA 353 (479) Q Consensus 283 -------lver~~s~~aP~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~-~GES~~S~~VtaT-~a~~~~~V~Lt 353 (479) -........+|-.+.++ +++.+.....- ...|..++.|+++-++. +.||+||.+-... +...++.|.|+ T Consensus 133 ~G~~~~P~~~y~LgVpaps~aP~~-a~~~~~~~~~~-~~~d~etr~Yv~TfVt~~GeES~PS~~S~~~~v~~pg~~V~ls 210 (567) T protein:vir:33 133 KGDGNHPTSSYRLGIPAPTTAPVC-TVQQGGDVSDD-NPNDDETRFYTETFVSDYGEEGPPGPASLEVTLRTPGTAVQLT 210 (567) T ss_pred cCCCCCCcchhhcccCCcccccee-eecCCCCCCCC-CCcccceeEEEEEEEcCCCCcCCCcccccceeeecCCceEEEe Confidence 00001111222222222 22222221111 23566789999997765 5578888775454 34577889999 Q ss_pred EeecCCccccceEEEEEeccCCC--CcEEEEEEeeeeeccCCCeeEEEeccCCCCCccceeeccccHHHHHHHHhccccc Q lcl|NC_018863. 354 VKLQSLYQAKPQFISVYRQGNET--GHYFLVARVPLSKADENGVITFVDRNQVIPETTDVFIGELTPQVISLLELLPMMK 431 (479) Q Consensus 354 It~~~~~~~~~~y~~IYR~t~~~--g~~~~i~rV~~s~~n~~~tttf~D~N~~iPgT~~~fvge~~~q~i~l~ellPm~k 431 (479) ..+++..+..-.-+.|||+..++ ..|++++.++. ++++|+|--.. + .|-+.||=.- T Consensus 211 ~~p~~~~~~~i~~~RIYRS~tg~~gtdy~lVael~a------s~~sf~D~~~~---~-------------~lg~~Lps~~ 268 (567) T protein:vir:33 211 LAPVPLQNASIKRRRIYRSASGGGEADFLLVAELDA------SVLSYTDKIPA---K-------------NLGPSLATWD 268 (567) T ss_pred eccCCccccccceEEEEEecCCCCceeeEEEEeecc------ceeeeeeccch---h-------------hccccccccc Confidence 88888877777899999988654 48999999984 57899997422 1 1122222111 Q ss_pred --cCccccCchhHHHHHh-hhhhhee-----------ccceeEEEEeccccCcccccc--------------ccCC Q lcl|NC_018863. 432 --LPLAQMNATTTFTVLW-YGALALY-----------APKKWVRIKNVQYIPALAADV--------------TYRP 479 (479) Q Consensus 432 --~Pla~~~~~~~~~V~~-yg~L~l~-----------aPkk~~~ikNV~~~~~~~~~~--------------~~~~ 479 (479) .| +...++.+++ =|-+|=+ -|.-| ..+|.-..-.|+ +=.| T Consensus 269 w~~P----P~~m~GL~~m~NGimAgF~GneV~FsEpylPyAW----P~~Yr~t~~~dIVaiA~~gt~LVV~TkG~P 336 (567) T protein:vir:33 269 YLPP----PENMTGLCLMANGIAAGFAGNEVMFSEAYLPYAW----PEVNRHTTAEDIVAICPLGTSLVVATKGEP 336 (567) T ss_pred ccCc----CcccceeeecccceEEeecCCEEEEecCCCCccc----chhhccCCCCCeEEEeecccEEEEEEcCce Confidence 11 1111222211 1111111 22222 112211111111 0111 No 22 >protein:vir:2792 Length: 567 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:59 # MgeName: Stx2 converting bacteriophage I # Cross-refs: genbank:acc:NP_612909;genbank:gi:20065826;genbank:GeneID:935648 Probab=97.47 E-value=1.4e-05 Score=47.22 Aligned_cols=280 Identities=19% Similarity=0.204 Sum_probs=118.3 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhh-hHHHhhccC-CcEEEccCCCC--CHHHhhh----hh Q lcl|NC_018863. 140 ADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFD-GLTKLIDEA-TNVIDLKGERL--DEATLNK----AA 211 (479) Q Consensus 140 ~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFD-Gl~~~I~~~-~NviDarG~~l--~~~~l~~----aa 211 (479) +-|.+| ..|+-++| |-|. --+|-|..+ =.+...+|+-+ -+.+|-+ .| T Consensus 1 ~~~~~~------------------~~~~~~~~-------~~~~~~~~~~~~M~~i~i~~f~Ge~Prl~p~lLP~~~a~~A 55 (567) T protein:vir:27 1 MMPIAI------------------LANSIINP-------LIFKPEAVKGISMPYIDITTMRGMMPRVVTSMLPEHSAVLA 55 (567) T ss_pred Ccchhh------------------hhhhhccc-------eeecccccccceeeEEeecccccccccchhhhccccccceE Confidence 222222 23333443 1111 012223221 13566777554 4555553 33 Q ss_pred heeecccCceeeeecChHHHhhHHHhhcCceeEEeecCCCccccCccccceecC-----ceeEEecCCcc--cCCCcc-- Q lcl|NC_018863. 212 VIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSQAGGFSTGFSINQFLST-----RGAINLHGSTI--MENDNI-- 282 (479) Q Consensus 212 ~~i~~~fG~atd~~mp~~vka~f~q~~~~~qrv~~~~n~g~~~~G~~V~~~~ss-----~g~I~L~~s~v--~~a~~~-- 282 (479) ....-.+|..+=...|..+..-|+- +.+..+.-.+.-=.+-.-+|..+.+. +.-+=++|+.. +....+ T Consensus 56 ~n~~~~~G~itP~~~~~~~~~~~~~---~~~Tif~y~~~~W~~w~~~V~~ir~PvAqD~~~rvY~tgdg~Pk~t~~~iat 132 (567) T protein:vir:27 56 EDCHFRFGVITPERQISGVEKTFTI---KPKTIFHYRDDFWFAWPDVVDVIRSPIAQDPHGRIYYTDGRFPKVTDATIAT 132 (567) T ss_pred EeeeccCCeeeeeeccccccccccc---CceeeEEEcCcEEEEeCCceeeccCccccCCcceEEEecCCcceeeeeeeee Confidence 5666668888877776555333311 11111111111000001111111110 11111112111 000000 Q ss_pred -------ccCcccCCCCCcccceEEEeecccccCcccccccceeeEEEEEEEcC-CCCcccccceeee-eecCCCeEEEE Q lcl|NC_018863. 283 -------LVDRIPEPNAPQAPASVVATVKVNDKGAFRPVKDIKTHSYKVVVHSD-DAESLASEAVTAV-VANPTDSVSLA 353 (479) Q Consensus 283 -------lver~~s~~aP~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~-~GES~~S~~VtaT-~a~~~~~V~Lt 353 (479) -........+|-.+.++ +++.+.....- ...|..++.|+++-++. +.||+||.+-... +...++.|.|+ T Consensus 133 ~G~~~~P~~~y~LgVpaps~aP~~-a~~~~~~~~~~-~~~d~etr~Yv~TfVt~~GeES~PS~~S~~~~v~~pg~~V~ls 210 (567) T protein:vir:27 133 KGDGNHPTSSYRLGIPAPTTAPVC-TVQQGGDVSDD-NPNDDETRFYTETFVSDYGEEGPPGPASLEVTLRTPGTAVQLT 210 (567) T ss_pred cCCCCCCcchhhcccCCcccccee-eecCCCCCCCC-CCcccceeEEEEEEEcCCCCcCCCcccccceeeecCCceEEEe Confidence 00001111222222222 22222221111 23566789999997765 5578888775454 34577889999 Q ss_pred EeecCCccccceEEEEEeccCCC--CcEEEEEEeeeeeccCCCeeEEEeccCCCCCccceeeccccHHHHHHHHhccccc Q lcl|NC_018863. 354 VKLQSLYQAKPQFISVYRQGNET--GHYFLVARVPLSKADENGVITFVDRNQVIPETTDVFIGELTPQVISLLELLPMMK 431 (479) Q Consensus 354 It~~~~~~~~~~y~~IYR~t~~~--g~~~~i~rV~~s~~n~~~tttf~D~N~~iPgT~~~fvge~~~q~i~l~ellPm~k 431 (479) ..+++..+..-.-+.|||+..++ ..|++++.++. ++++|+|--.. + .|-+.||=.- T Consensus 211 ~~p~~~~~~~i~~~RIYRS~tg~~gtdy~lVael~a------s~~sf~D~~~~---~-------------~lg~~Lps~~ 268 (567) T protein:vir:27 211 LAPVPLQNASIKRRRIYRSASGGGEADFLLVAELDA------SVLSYTDKIPA---K-------------NLGPSLATWD 268 (567) T ss_pred eccCCccccccceEEEEEecCCCCceeeEEEEeecc------ceeeeeeccch---h-------------hccccccccc Confidence 88888877777899999988654 48999999984 57899997422 1 1122222111 Q ss_pred --cCccccCchhHHHHHh-hhhhhee-----------ccceeEEEEeccccCcccccc--------------ccCC Q lcl|NC_018863. 432 --LPLAQMNATTTFTVLW-YGALALY-----------APKKWVRIKNVQYIPALAADV--------------TYRP 479 (479) Q Consensus 432 --~Pla~~~~~~~~~V~~-yg~L~l~-----------aPkk~~~ikNV~~~~~~~~~~--------------~~~~ 479 (479) .| +...++.+++ =|-+|=+ -|.-| ..+|.-..-.|+ +=.| T Consensus 269 w~~P----P~~m~GL~~m~NGimAgF~GneV~FsEpylPyAW----P~~Yr~t~~~dIVaiA~~gt~LVV~TkG~P 336 (567) T protein:vir:27 269 YLPP----PENMTGLCLMANGIAAGFAGNEVMFSEAYLPYAW----PEVNRHTTAEDIVAICPLGTSLVVATKGEP 336 (567) T ss_pred ccCc----CcccceeeecccceEEeecCCEEEEecCCCCccc----chhhccCCCCCeEEEeecccEEEEEEcCce Confidence 11 1111222211 1111111 22222 112211111111 0111 No 23 >protein:vir:9979 Length: 567 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:179 # MgeName: Stx1 converting bacteriophage # Cross-refs: genbank:acc:NP_859109;genbank:gi:32170864;genbank:GeneID:2653256 Probab=97.47 E-value=1.4e-05 Score=47.22 Aligned_cols=280 Identities=19% Similarity=0.204 Sum_probs=118.3 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhh-hHHHhhccC-CcEEEccCCCC--CHHHhhh----hh Q lcl|NC_018863. 140 ADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFD-GLTKLIDEA-TNVIDLKGERL--DEATLNK----AA 211 (479) Q Consensus 140 ~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFD-Gl~~~I~~~-~NviDarG~~l--~~~~l~~----aa 211 (479) +-|.+| ..|+-++| |-|. --+|-|..+ =.+...+|+-+ -+.+|-+ .| T Consensus 1 ~~~~~~------------------~~~~~~~~-------~~~~~~~~~~~~M~~i~i~~f~Ge~Prl~p~lLP~~~a~~A 55 (567) T protein:vir:99 1 MMPIAI------------------LANSIINP-------LIFKPEAVKGISMPYIDITTMRGMMPRVVTSMLPEHSAVLA 55 (567) T ss_pred Ccchhh------------------hhhhhccc-------eeecccccccceeeEEeecccccccccchhhhccccccceE Confidence 222222 23333443 1111 012223221 13566777554 4555553 33 Q ss_pred heeecccCceeeeecChHHHhhHHHhhcCceeEEeecCCCccccCccccceecC-----ceeEEecCCcc--cCCCcc-- Q lcl|NC_018863. 212 VIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSQAGGFSTGFSINQFLST-----RGAINLHGSTI--MENDNI-- 282 (479) Q Consensus 212 ~~i~~~fG~atd~~mp~~vka~f~q~~~~~qrv~~~~n~g~~~~G~~V~~~~ss-----~g~I~L~~s~v--~~a~~~-- 282 (479) ....-.+|..+=...|..+..-|+- +.+..+.-.+.-=.+-.-+|..+.+. +.-+=++|+.. +....+ T Consensus 56 ~n~~~~~G~itP~~~~~~~~~~~~~---~~~Tif~y~~~~W~~w~~~V~~ir~PvAqD~~~rvY~tgdg~Pk~t~~~iat 132 (567) T protein:vir:99 56 EDCHFRFGVITPERQISGVEKTFTI---KPKTIFHYRDDFWFAWPDVVDVIRSPIAQDPHGRIYYTDGRFPKVTDATIAT 132 (567) T ss_pred EeeeccCCeeeeeeccccccccccc---CceeeEEEcCcEEEEeCCceeeccCccccCCcceEEEecCCcceeeeeeeee Confidence 5666668888877776555333311 11111111111000001111111110 11111112111 000000 Q ss_pred -------ccCcccCCCCCcccceEEEeecccccCcccccccceeeEEEEEEEcC-CCCcccccceeee-eecCCCeEEEE Q lcl|NC_018863. 283 -------LVDRIPEPNAPQAPASVVATVKVNDKGAFRPVKDIKTHSYKVVVHSD-DAESLASEAVTAV-VANPTDSVSLA 353 (479) Q Consensus 283 -------lver~~s~~aP~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~-~GES~~S~~VtaT-~a~~~~~V~Lt 353 (479) -........+|-.+.++ +++.+.....- ...|..++.|+++-++. +.||+||.+-... +...++.|.|+ T Consensus 133 ~G~~~~P~~~y~LgVpaps~aP~~-a~~~~~~~~~~-~~~d~etr~Yv~TfVt~~GeES~PS~~S~~~~v~~pg~~V~ls 210 (567) T protein:vir:99 133 KGDGNHPTSSYRLGIPAPTTAPVC-TVQQGGDVSDD-NPNDDETRFYTETFVSDYGEEGPPGPASLEVTLRTPGTAVQLT 210 (567) T ss_pred cCCCCCCcchhhcccCCcccccee-eecCCCCCCCC-CCcccceeEEEEEEEcCCCCcCCCcccccceeeecCCceEEEe Confidence 00001111222222222 22222221111 23566789999997765 5578888775454 34577889999 Q ss_pred EeecCCccccceEEEEEeccCCC--CcEEEEEEeeeeeccCCCeeEEEeccCCCCCccceeeccccHHHHHHHHhccccc Q lcl|NC_018863. 354 VKLQSLYQAKPQFISVYRQGNET--GHYFLVARVPLSKADENGVITFVDRNQVIPETTDVFIGELTPQVISLLELLPMMK 431 (479) Q Consensus 354 It~~~~~~~~~~y~~IYR~t~~~--g~~~~i~rV~~s~~n~~~tttf~D~N~~iPgT~~~fvge~~~q~i~l~ellPm~k 431 (479) ..+++..+..-.-+.|||+..++ ..|++++.++. ++++|+|--.. + .|-+.||=.- T Consensus 211 ~~p~~~~~~~i~~~RIYRS~tg~~gtdy~lVael~a------s~~sf~D~~~~---~-------------~lg~~Lps~~ 268 (567) T protein:vir:99 211 LAPVPLQNASIKRRRIYRSASGGGEADFLLVAELDA------SVLSYTDKIPA---K-------------NLGPSLATWD 268 (567) T ss_pred eccCCccccccceEEEEEecCCCCceeeEEEEeecc------ceeeeeeccch---h-------------hccccccccc Confidence 88888877777899999988654 48999999984 57899997422 1 1122222111 Q ss_pred --cCccccCchhHHHHHh-hhhhhee-----------ccceeEEEEeccccCcccccc--------------ccCC Q lcl|NC_018863. 432 --LPLAQMNATTTFTVLW-YGALALY-----------APKKWVRIKNVQYIPALAADV--------------TYRP 479 (479) Q Consensus 432 --~Pla~~~~~~~~~V~~-yg~L~l~-----------aPkk~~~ikNV~~~~~~~~~~--------------~~~~ 479 (479) .| +...++.+++ =|-+|=+ -|.-| ..+|.-..-.|+ +=.| T Consensus 269 w~~P----P~~m~GL~~m~NGimAgF~GneV~FsEpylPyAW----P~~Yr~t~~~dIVaiA~~gt~LVV~TkG~P 336 (567) T protein:vir:99 269 YLPP----PENMTGLCLMANGIAAGFAGNEVMFSEAYLPYAW----PEVNRHTTAEDIVAICPLGTSLVVATKGEP 336 (567) T ss_pred ccCc----CcccceeeecccceEEeecCCEEEEecCCCCccc----chhhccCCCCCeEEEeecccEEEEEEcCce Confidence 11 1111222211 1111111 22222 112211111111 0111 No 24 >protein:vir:827 Length: 567 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:16 # MgeName: VT2-Sa # Cross-refs: genbank:acc:NP_050560;genbank:gi:9633457;genbank:GeneID:1262210 Probab=97.42 E-value=1.7e-05 Score=46.67 Aligned_cols=276 Identities=19% Similarity=0.199 Sum_probs=117.7 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhh-hHHHhhccC-CcEEEccCCCC--CHHHhhh----hh Q lcl|NC_018863. 140 ADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFD-GLTKLIDEA-TNVIDLKGERL--DEATLNK----AA 211 (479) Q Consensus 140 ~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFD-Gl~~~I~~~-~NviDarG~~l--~~~~l~~----aa 211 (479) +-|.+| ..|+-++| |-|. --+|-|..+ =.+...+|+-+ -+.+|-+ .| T Consensus 1 ~~~~~~------------------~~~~~~~~-------~~~~~~~~~~~~M~~i~i~~f~Ge~Prl~p~lLP~~~a~~A 55 (567) T protein:vir:82 1 MMPIAI------------------LANSIINP-------LIFKPEAVKGISMPYIDITTMRGMMPRVVTSMLPEHSAVLA 55 (567) T ss_pred Ccchhh------------------hhhhhccc-------eeecccccccceeeEEeecccccccccchhhhccccccceE Confidence 222222 23333443 1111 012223221 14566777554 4555553 33 Q ss_pred heeecccCceeeeecChHHHhhH----HHhhcCceeEEeecCCCccccCccccceecC-----ceeEEecCCcc--cCCC Q lcl|NC_018863. 212 VIVGKGYGRATDAFMPIGVQADF----TNNLLDRQRVIQPSQAGGFSTGFSINQFLST-----RGAINLHGSTI--MEND 280 (479) Q Consensus 212 ~~i~~~fG~atd~~mp~~vka~f----~q~~~~~qrv~~~~n~g~~~~G~~V~~~~ss-----~g~I~L~~s~v--~~a~ 280 (479) ....-.+|..+=...|..+..-| ...|+-+...-+ +-.-+|..+.+. +.-+=++|+.. +... T Consensus 56 ~n~~~~~G~itP~~~~~~~~~~~~~~~~Tif~y~~~~W~-------~w~~~V~~ir~PvAqD~~~rvY~tgdg~Pk~t~~ 128 (567) T protein:vir:82 56 EDCHFRFGVITPERQISGVEKTFTIKPKTIFHYRDDFWF-------AWPDVVDVIRSPIAQDPHGRIYYTDGRFPKVTDA 128 (567) T ss_pred EeeeecCCeeeeeecccccccccccCceeeeeecCcEeE-------EeCCceeeccCccccCCcccEEEecCCcceeeee Confidence 55666688888777765553333 222221111000 000111111110 11111112111 0000 Q ss_pred cc---------ccCcccCCCCCcccceEEEeecccccCcccccccceeeEEEEEEEcC-CCCcccccceeeee-ecCCCe Q lcl|NC_018863. 281 NI---------LVDRIPEPNAPQAPASVVATVKVNDKGAFRPVKDIKTHSYKVVVHSD-DAESLASEAVTAVV-ANPTDS 349 (479) Q Consensus 281 ~~---------lver~~s~~aP~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~-~GES~~S~~VtaT~-a~~~~~ 349 (479) .+ -........+|-.+.++ +++.+.....- ...|..+++|+++-++. +.||+||.+-...+ ...++. T Consensus 129 ~iat~G~~~~P~~~y~LgVpaps~aP~~-a~~~~~~~~~~-~p~d~etr~Yv~TfVt~~GeES~PS~~S~~~~v~~pg~~ 206 (567) T protein:vir:82 129 TIATKGDGNHPTSSYRLGIPAPTTAPVC-TVQQGGDVSDD-NPNDDETRFYTETFVSDYGEEGPPGPASLEVTLRTPGTA 206 (567) T ss_pred eeeecCCCCCCcchhhcccCCcccccee-eecCCCCCCCC-CCccccceEEEEEEEcCCCCcCCCcccccceeeecCCce Confidence 00 00001111222222222 22222211111 23566788999997765 55788887764543 346778 Q ss_pred EEEEEeecCCccccceEEEEEeccCCC--CcEEEEEEeeeeeccCCCeeEEEeccCCCCCccceeeccccHHHHHHHHhc Q lcl|NC_018863. 350 VSLAVKLQSLYQAKPQFISVYRQGNET--GHYFLVARVPLSKADENGVITFVDRNQVIPETTDVFIGELTPQVISLLELL 427 (479) Q Consensus 350 V~LtIt~~~~~~~~~~y~~IYR~t~~~--g~~~~i~rV~~s~~n~~~tttf~D~N~~iPgT~~~fvge~~~q~i~l~ell 427 (479) |.|+..+++..+..-.-+.|||+..++ ..|++++.++. ++++|+|--.. + .|-+.| T Consensus 207 V~ls~~p~~~~~~~i~~~RIYRS~tg~~gtdy~lVael~a------s~~sf~D~~~~---~-------------~lg~~L 264 (567) T protein:vir:82 207 VQLTLAPVPLQNASIKRRRIYRSASGGGEADFLLVAELDA------SVLSYTDKIPA---K-------------NLGPSL 264 (567) T ss_pred EEEeeccCCccccccceEEEEEecCCCCceeeEEEEeecc------ceeeeeeccch---h-------------hccccc Confidence 999988888877777899999988654 48999999984 57899997422 1 112222 Q ss_pred cccc--cCccccCchhHHHHHh-hhhhhee-----------ccceeEEEEeccccCcccccc--------------ccCC Q lcl|NC_018863. 428 PMMK--LPLAQMNATTTFTVLW-YGALALY-----------APKKWVRIKNVQYIPALAADV--------------TYRP 479 (479) Q Consensus 428 Pm~k--~Pla~~~~~~~~~V~~-yg~L~l~-----------aPkk~~~ikNV~~~~~~~~~~--------------~~~~ 479 (479) |=.- .| +...++.+++ =|-+|=+ -|.-| ..+|.-..-.|+ +=.| T Consensus 265 ps~~w~~P----P~~m~GL~~m~NGimAgF~GneV~FsEpylPyAW----P~~Yr~t~~~dIVaiA~~gt~LVV~TkG~P 336 (567) T protein:vir:82 265 ATWDYLPP----PENMTGLCLMANGIAAGFAGNEVMFSEAYLPYAW----PEVNRHTTAEDIVAICPLRTSLVVATKGEP 336 (567) T ss_pred ccccccCc----CcccceeeecccceEEeecCCEEEEecCCCCccc----chhhccCCCCCeEEEEecccEEEEEEcCce Confidence 2111 11 1111222211 1111111 22222 112211111111 0111 No 25 >protein:vir:104388 Length: 566 # NCBI annotation: hypothetical protein # Family: family:all:1544 # MgeID: mge:1471 # MgeName: 86 # Cross-refs: genbank:acc:YP_794072;genbank:gi:116222017;genbank:GeneID:4397450 Probab=97.41 E-value=1.2e-05 Score=47.60 Aligned_cols=281 Identities=15% Similarity=0.150 Sum_probs=115.9 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhh-hHHHhhccC-CcEEEccCCCC--CHHHhhh----hhh Q lcl|NC_018863. 141 DPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFD-GLTKLIDEA-TNVIDLKGERL--DEATLNK----AAV 212 (479) Q Consensus 141 dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFD-Gl~~~I~~~-~NviDarG~~l--~~~~l~~----aa~ 212 (479) -|.+ ...|+-++| |-|. --+|-|..+ =.+...+|+-+ -..+|-+ .|. T Consensus 1 ~~~~------------------~~~~~~~~~-------~~~~~~~~~~~~M~~i~i~~f~Ge~Pr~~p~lLP~~~a~~A~ 55 (566) T protein:vir:10 1 MPIA------------------ILANSIINP-------LIFKPEAVKGISMPYIDITTMRGMMPRVVTSMLPDHSAVLAE 55 (566) T ss_pred Ccee------------------eehhhhccc-------eeecccccccceeeEEeecccccccccchhhhccccccceEE Confidence 1111 122333333 1111 012223221 13556777654 4555553 335 Q ss_pred eeecccCceeeeecChHHHhhH----HHhhcCceeEEeecCCCccccCccccceec-----CceeEEecCCc-------- Q lcl|NC_018863. 213 IVGKGYGRATDAFMPIGVQADF----TNNLLDRQRVIQPSQAGGFSTGFSINQFLS-----TRGAINLHGST-------- 275 (479) Q Consensus 213 ~i~~~fG~atd~~mp~~vka~f----~q~~~~~qrv~~~~n~g~~~~G~~V~~~~s-----s~g~I~L~~s~-------- 275 (479) ...-.+|..+=...|..+..-| ...|+.+...-+ +-.-+|..+.+ .+..+-++|+- T Consensus 56 n~~~~~G~itP~~~~~~~~~~~~~~~kTif~y~~~~W~-------~w~~~V~~ir~PvAqD~~~rvY~tg~~~Pk~t~~d 128 (566) T protein:vir:10 56 DCHFRFGVITPERQISGVEKTFTIKPKTIFHYRDDFWF-------AWPDVVDVIRSPVAQDNYGRIYYTDGKFPKVTAAE 128 (566) T ss_pred eeeecCCeeeeeecccccccccccCceeeeeecCcEeE-------EeCCceeeccCccccCCcceEEEeeCCcceeeecc Confidence 6666788888777775554333 222221111100 00111212111 11112222111 Q ss_pred -------ccCCCccccCcccCCCCCcccceEEEeecccccCcccccccceeeEEEEEEEcC-CCCcccccceeeeeecCC Q lcl|NC_018863. 276 -------IMENDNILVDRIPEPNAPQAPASVVATVKVNDKGAFRPVKDIKTHSYKVVVHSD-DAESLASEAVTAVVANPT 347 (479) Q Consensus 276 -------v~~a~~~lver~~s~~aP~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~-~GES~~S~~VtaT~a~~~ 347 (479) ..-+..|. ..-.+|.++..+.....+...+. -..+.++|.|+++-++. +.||+||.+-........ T Consensus 129 iAt~g~~~~pa~~y~----LgVPaPs~apv~~~~~~sg~~~~--~~~d~~tr~Yv~TfVt~~GeES~PS~~S~~v~v~~~ 202 (566) T protein:vir:10 129 IATKGEGNFPAASYR----LGIPAPTTAPVCTVQKGEGATDE--NPNDDETRFYTETFVSAYGEEGPPGPESLEVTVGIP 202 (566) T ss_pred eeecccccccccccc----ccCCCCcccceeeccCCCcccCC--CCcccceeEEEEEEEcCCCCcCCCccccceeEecCC Confidence 11111111 11111211111111111111111 23466899999997775 557888877555554444 Q ss_pred C-eEEEEEeecCCccccceEEEEEeccCCC--CcEEEEEEeeeeeccCCCeeEEEeccCCCCCccceeeccccHHHHHH- Q lcl|NC_018863. 348 D-SVSLAVKLQSLYQAKPQFISVYRQGNET--GHYFLVARVPLSKADENGVITFVDRNQVIPETTDVFIGELTPQVISL- 423 (479) Q Consensus 348 ~-~V~LtIt~~~~~~~~~~y~~IYR~t~~~--g~~~~i~rV~~s~~n~~~tttf~D~N~~iPgT~~~fvge~~~q~i~l- 423 (479) | .|.|+..+.+.++..-+-+.|||+..++ ..|++++.++. +.++|+|--.. + -+|+.=| +..| T Consensus 203 gs~V~ltl~~~p~~~~~i~~~RIYRS~tg~~gtdy~lVael~a------s~~sf~Dd~~~---~---~lg~~Lp-s~~w~ 269 (566) T protein:vir:10 203 DTPVQLTLSPVPLQDANINRRRIYRSVSGGGEADFLLVAELEA------SVLSYTDNIPA---K---NLGPSLA-TWDYL 269 (566) T ss_pred CceEEEEecCCCcCcCCceeEEEEEecCCCCceeEEEEeeecc------cceeeeccccc---c---ccCcccc-ccccc Confidence 4 6999998888888888899999988644 58999999985 47899986422 1 1111100 0000 Q ss_pred ---HHhccccccCccccCchhHHHHHhhhhhheeccceeEEEEeccccCcccccc--------------ccCC Q lcl|NC_018863. 424 ---LELLPMMKLPLAQMNATTTFTVLWYGALALYAPKKWVRIKNVQYIPALAADV--------------TYRP 479 (479) Q Consensus 424 ---~ellPm~k~Pla~~~~~~~~~V~~yg~L~l~aPkk~~~ikNV~~~~~~~~~~--------------~~~~ 479 (479) ..|..|.-||++=+-+ ..+-=.||. --|-|.-| ..+|+-....|+ .=.| T Consensus 270 ~PP~~m~GL~~m~NGimAg-F~GneV~Fs--EpylPyAW----P~~Yr~t~~~dIVaiA~~gt~LVV~TkG~P 335 (566) T protein:vir:10 270 PPPENMTGLCLMANGIAAG-FAGNEVMFS--EAYLPYAW----PEVNRHTTAEDIVAVCPLGTSLVVATKGEP 335 (566) T ss_pred CcCcccceeeecccceEEe-ecCCEEEEe--cCCCCccc----chhhccCCCCCeEEEEeccceEEEEEcCce Confidence 1112222233211100 000000000 00122222 112221111111 0111 No 26 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=97.38 E-value=3.1e-05 Score=45.31 Aligned_cols=304 Identities=11% Similarity=0.024 Sum_probs=158.3 Q ss_pred CcccccccceeeeecCchhHHHHHHHHHHHhhcCcccCc---ccccCccccchhhhHHHHHHHhhccccccchhhhccch Q lcl|NC_018863. 1 MTELQKEQKVEARKLPAGAEAELAELVSKSFTTGTGITP---DTQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQ 77 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~Ksf~ag~~~~~---~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~ 77 (479) |-+.+|.+ .++ -.|.+.+..|...++ ....+++.|-.+.+..+|........ .++......+ T Consensus 1 ~~~~~~~~------------~~~-~~f~~~~~~~~~~~a~~~~~~~~~~~liP~~~~~~ii~~~~~~s--~l~~~~~~~~ 65 (324) T protein:vir:10 1 MEQTQKLK------------LNL-QHFASNNVKPQVFNPDNVMMHEKKDGTLLNDFTTPILQEVMENS--KIMQLGKYEP 65 (324) T ss_pred CCCchHHH------------HHH-HHHHHHhhccceecccceeccCCCcceechhHHHHHHHHHHhhc--hhhhhcceee Confidence 43332222 111 223444444332222 22244555778877777755554433 3444444444 Q ss_pred hHHHHHHhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHH Q lcl|NC_018863. 78 VNSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAISVIAKS 157 (479) Q Consensus 78 ~~stv~~y~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~~~~~ai~~~~~~ 157 (479) +.+.-.+|.++. +.+...+++|++..+..++.+.+.....+=++..-.+|.-+- .++..|.+....+.-...+++. T Consensus 66 ~~~~~~~~p~~~---~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~l~~ai~~~ 141 (324) T protein:vir:10 66 MEGTEKKFTFWA---DKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFL-NYTYSQFFEEMKPMIAEAFYKK 141 (324) T ss_pred ccCCceEEEEEe---CCcceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHH-hcchHHHHHHHHHHHHHHHHHH Confidence 544334454433 334578999999999999999999999999998888887432 2455678888888888999999 Q ss_pred HHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhhheeecccCceeeeecChHHHhhHHHh Q lcl|NC_018863. 158 IEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNN 237 (479) Q Consensus 158 ~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~atd~~mp~~vka~f~q~ 237 (479) +|.++|+|+-. ++ +..|+.+.+.. .+... ..-++.+.|.++.-.+..++..+.-+.|++.+.+.+... T Consensus 142 ~d~a~l~G~g~---~~------~~~~i~~~~~~-~~~~~--~~~~t~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l 209 (324) T protein:vir:10 142 FDEAGILNQGN---NP------FGKSIAQSIEK-TNKVI--KGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKI 209 (324) T ss_pred HHHHhhhcCCC---Cc------cCccccccccc-cceec--cccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHh Confidence 99999999754 11 22356655542 34333 234667777777777778888888899999999999876 Q ss_pred hcCceeEEeecCCCccccCccccceec---CceeEEe-cCCcc--cCCCccccCcccCCCC-----Cc-cc-----ceEE Q lcl|NC_018863. 238 LLDRQRVIQPSQAGGFSTGFSINQFLS---TRGAINL-HGSTI--MENDNILVDRIPEPNA-----PQ-AP-----ASVV 300 (479) Q Consensus 238 ~~~~qrv~~~~n~g~~~~G~~V~~~~s---s~g~I~L-~~s~v--~~a~~~lver~~s~~a-----P~-~P-----~~vt 300 (479) -...-|.+.+...++.-.|.+|....+ ..+.+-+ +++.+ ....++.++....... +. .+ ..-+ T Consensus 210 ~d~~g~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 289 (324) T protein:vir:10 210 VDPETKERIYDRNSDTLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMV 289 (324) T ss_pred hccCCceeecCCCCccccceeEEeecCCCCCcceEEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcE Confidence 654434444433333345555421111 1111111 01000 0111111111111000 00 00 0000 Q ss_pred EeecccccCcccccccceeeEEEEEEEcCCCCcccccc Q lcl|NC_018863. 301 ATVKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLASEA 338 (479) Q Consensus 301 a~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~ 338 (479) +.-..-..|...... .-=-+++.....+|..|.++ T Consensus 290 ~~r~~~r~d~~v~~~---~A~~~l~~a~~~~~~~~~~~ 324 (324) T protein:vir:10 290 ALRATMHVALHIADD---KAFAKLVPADKKTDSVPGEV 324 (324) T ss_pred EEEEEEEEccEEecc---cceEEEEeccCCCCCCCCCC Confidence 000000001000110 01123445555555566665 No 27 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=97.35 E-value=4.1e-05 Score=44.64 Aligned_cols=307 Identities=12% Similarity=0.102 Sum_probs=132.2 Q ss_pred Ccc--------c-ccccceeeeecCchhHHHHHHHHHHHhhcCcc-----cCcccccCccccchhhhHHHHHHHhhcccc Q lcl|NC_018863. 1 MTE--------L-QKEQKVEARKLPAGAEAELAELVSKSFTTGTG-----ITPDTQHDAAALRRELLDDQVKMLAFTNGD 66 (479) Q Consensus 1 ~~~--------~-~~~~~~~~~~~~~~~~~~~~e~~~Ksf~ag~~-----~~~~~~~~gaAlr~esld~~i~~l~~~~~~ 66 (479) +.+ . .++.+..... ........-..|.+.+..+.. .+-.+.++|+.|..+.+..+|..+...... T Consensus 60 ~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~ 138 (397) T protein:vir:49 60 YTEARANEVANMSEEEKKPLTKS-EEEVKAGFVKDFKNLVRGRYQNLLDSKTDASGSDAGLTIPQDIQTAIHTLVSQYDS 138 (397) T ss_pred HHHHHHHhhhccccccccccccc-hhHHHHHHHHHHHHHHhcchhHHHHHhhccccccCcccccHhHHHHHHHHHHhhhh Confidence 000 0 0000000000 000111111223333332221 122344678889999999988777666553 Q ss_pred ccchhhhccchhHHHHHHhhhhhccCcccccccccccccc-cccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHH Q lcl|NC_018863. 67 FTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVA-SINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTI 145 (479) Q Consensus 67 f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~-~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~ 145 (479) +++.+....+.+...+|.......+.+...+++|++.. +.+++.+......++-++.-..+|.-+ +.++..|.+.. T Consensus 139 --l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~el-l~ds~~~l~~~ 215 (397) T protein:vir:49 139 --LQEYVNVENVTTLTGSRVYEKWTDITGLANIDDEAGKIADVDDPKLSLIKYTIKRYAGISTVTNSL-LADSAENILAW 215 (397) T ss_pred --HHhhhceeecccCccceEEEeeccCCcceeeecCccccccccccceeeEEeeeeeEEeeehhHHHH-HhhhHHHHHHH Confidence 34444444444333334222233444567899999885 578999999999999999988888764 34556678888 Q ss_pred HHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhhheeecccCceeeee Q lcl|NC_018863. 146 LTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAAVIVGKGYGRATDAF 225 (479) Q Consensus 146 ~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~atd~~ 225 (479) ..+.-...++..++.+++.|+..-.+.+ ..+-+|++.+++.. +..+|...+..+ T Consensus 216 i~~~l~~~~~~~~d~ai~~G~g~~~~~~---~~~~~d~i~~~~~~-----------------------l~~~~~~~a~~v 269 (397) T protein:vir:49 216 LSGWIAKKVVVTRNKAILEAIAALPTKP---TLTKWDDIIDLEAK-----------------------VDPAIKQTSFFL 269 (397) T ss_pred HHHHHHHHHHHHHHHHHHhhcccccccc---ccccHHHHHHHHHh-----------------------hhhhhcCCCEEE Confidence 8899999999999999999988755432 23456666555532 222333334455 Q ss_pred cChHHHhhHHHhhcC-ceeEEeec---CCCccccCccccc----eecC----ceeEEe-cCCc---ccCCCccccCcccC Q lcl|NC_018863. 226 MPIGVQADFTNNLLD-RQRVIQPS---QAGGFSTGFSINQ----FLST----RGAINL-HGST---IMENDNILVDRIPE 289 (479) Q Consensus 226 mp~~vka~f~q~~~~-~qrv~~~~---n~g~~~~G~~V~~----~~ss----~g~I~L-~~s~---v~~a~~~lver~~s 289 (479) |++.+.+.+...-.. ++.+++++ .....-.|.+|.- +... ...|-+ .++. +.+..+..+++... T Consensus 270 mn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~i~~~~~ 349 (397) T protein:vir:49 270 TNTSGFTALKKVKNALGDYLMERDVKSPTGYSIDGFAVKEVADRWLANGTGGAMPLYFGDLKQAVTLFDRQHMSLLSTNI 349 (397) T ss_pred EcHHHHHHHHHhhcCCCceeeccCcCCCCCceecceeeEEecccccccccCCceeEEEeeccceEEEEeecceEEEEecc Confidence 666665555443322 22222222 1111223333310 0000 000000 0010 00111111110000 Q ss_pred CCCCcccceEEEeecccccCcccccccceeeEEEEEEEcCCCCcccccce Q lcl|NC_018863. 290 PNAPQAPASVVATVKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLASEAV 339 (479) Q Consensus 290 ~~aP~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~V 339 (479) ....+..=.+..-...-..+...-.+ +.=.-++.+....---.+|.+| T Consensus 350 ~~~~~~~~~~~~r~~~r~d~~~~~~~--a~~~~~~~~~~~~~~~~~~~~~ 397 (397) T protein:vir:49 350 GGGAFETDTTKVRVIDRFDVVATDTE--AFVPASFKAIADQKGNLGSTAV 397 (397) T ss_pred ccchhhcCceeEEEEeeeCcEEeccc--ceEEEEeecccCCCCCcccccC Confidence 00000000000000000000000000 0001111111111111111111 No 28 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=97.29 E-value=2.5e-05 Score=45.78 Aligned_cols=297 Identities=14% Similarity=0.047 Sum_probs=141.3 Q ss_pred CcccccccceeeeecCchh-----HHHHHHHHHHHhhcCcc-------cCcccccCccccchhhhHHHHHHHhhcccccc Q lcl|NC_018863. 1 MTELQKEQKVEARKLPAGA-----EAELAELVSKSFTTGTG-------ITPDTQHDAAALRRELLDDQVKMLAFTNGDFT 68 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~-----~~~~~e~~~Ksf~ag~~-------~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~ 68 (479) +.+..+.. ...+.+... +..-.+.+.+.+..+.. ++..+ ..++.|..+.+..+|..+... ... T Consensus 68 ~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~g~~vp~~~~~~ii~~~~~--~~~ 142 (395) T protein:vir:43 68 MLANEKRD--GGEEAPKTAGQMVAESLKEQGVTSSLRGSHRVSMPRSAITSID-GSGGALVAPDRRPGVVAAPQR--RLT 142 (395) T ss_pred HHhhhccc--cccchhhhHHHHHHHHHHHHHHHHHhhhhhhhhhhhhhhcccC-CCCccccchhhHHHHHHHHHh--hhh Confidence 11110000 000111000 00001122222222221 11222 334445555566666554443 334 Q ss_pred chhhhccchhHHHHHHhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHH Q lcl|NC_018863. 69 IYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTE 148 (479) Q Consensus 69 ~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~~~~ 148 (479) +++-+...++.+...+|.+.. ++.+...+++|++..+..++.+......++=++....+|.-+ .+...+.+....+ T Consensus 143 l~~l~~~~~~~~~~~~~~~~~--~~~~~a~~v~E~~~~~~~~~~~~~i~~~~~k~~~~~~is~el--l~d~~~l~~~v~~ 218 (395) T protein:vir:43 143 IRDLVAPGTTESNSVEYVRET--GFVNNAAPVSEGTQKPYSDLTFELENAPVRTIAHLFKASRQI--LDDASALQSYIDA 218 (395) T ss_pred HHhhccceecCCCceEEEEEe--cCCCceeeecCCccccccccceeEEEEeeeeEEEeehhhHHH--HHhHHHHHHHHHH Confidence 566666666655544453333 334456789999999999999999999999999988888764 3334567788888 Q ss_pred HHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccC---CCCCHHHhhhhhheeecccCceeeee Q lcl|NC_018863. 149 DAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKG---ERLDEATLNKAAVIVGKGYGRATDAF 225 (479) Q Consensus 149 ~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG---~~l~~~~l~~aa~~i~~~fG~atd~~ 225 (479) .-...++..++.++++|+-. +-.+.|+.+.... .+.+.-+ .....+.|.++...+..+++..+-.. T Consensus 219 ~la~a~~~~~d~~~l~G~g~---------~~~~~Gi~~~~~~--~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v 287 (395) T protein:vir:43 219 RARYGLMLVEECQLLYGNGT---------GANLHGIIPQAQA--YAPPSGVVVTAEQRIDRIRLAILQAQLAEFPASGIV 287 (395) T ss_pred HHHHHHHHHHHHHHHhccCC---------CCccccccccccc--cccccccccccchhHHHHHHHHHhhccccCCCcEEE Confidence 88899999999999999743 1236777765531 2222222 23345566666666777788888899 Q ss_pred cChHHHhhHHHhhcCceeEEeecCCCccccCccccceecCceeEEecCCcccCCCccccCcccCCCCCcccceEE----- Q lcl|NC_018863. 226 MPIGVQADFTNNLLDRQRVIQPSQAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILVDRIPEPNAPQAPASVV----- 300 (479) Q Consensus 226 mp~~vka~f~q~~~~~qrv~~~~n~g~~~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver~~s~~aP~~P~~vt----- 300 (479) |++.+...+.......-|.+.+. +.+... .++++.+.+..+. .|... ... T Consensus 288 mn~~~~~~l~~lkd~~G~~i~~~-~~~~~~------------------~~l~G~pVv~~~~-----~~~~~-~~~gd~~~ 342 (395) T protein:vir:43 288 LNPIDWALIELNKDAENRYIIGS-PQNGTT------------------PTLWRLPVVETQA-----ITQDE-FLTGAFSL 342 (395) T ss_pred EcHHHHHHHHHhhccCCceeccc-cccCCC------------------ceecceeeEEcCC-----CCCCc-EEEEeccc Confidence 99999988876654333333321 211111 1111111111110 01000 000 Q ss_pred -----------EeecccccCcccccccceeeEEEEEEEcCCCCcccccceeeeeecCCCeEEEEEeec Q lcl|NC_018863. 301 -----------ATVKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLASEAVTAVVANPTDSVSLAVKLQ 357 (479) Q Consensus 301 -----------a~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~V~LtIt~~ 357 (479) -..... .+..|. . +...|++...=+.+- ..+..-+.|+++.+ T Consensus 343 ~~~~~~~~~~~i~~~~~-~~~~f~-~--~~~~~r~~~r~d~~v-----------~~~~a~~~~~~taa 395 (395) T protein:vir:43 343 GAQIFDRMDIEVLVSTE-NDKDFE-N--NMVTIRAEERLAFAV-----------YRPEAFVTGSLTAS 395 (395) T ss_pred eEEEEEecceEEEEecc-ccchhh-c--CcEEEEEEEeeccEE-----------ecccceEEEEeccC Confidence 000000 000000 0 011122111111111 11111222333322 No 29 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=97.25 E-value=8.2e-05 Score=42.98 Aligned_cols=312 Identities=12% Similarity=0.042 Sum_probs=149.3 Q ss_pred CcccccccceeeeecCchhHHHHHHHHHHH--hhcCcccCcccccCccccchhhhHHHHHHHhhccccccchhhhccchh Q lcl|NC_018863. 1 MTELQKEQKVEARKLPAGAEAELAELVSKS--FTTGTGITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQV 78 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~Ks--f~ag~~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~~ 78 (479) |-+.+|.+ -++.++...+.+- +.+... ....+++.|..+.+..+|..+..... .++..+...++ T Consensus 1 ~~~~~~~~---------~~~~~f~~~~~~~~~~~a~~~---~~~~~~~~lip~~~~~~ii~~~~~~s--~l~~l~~~~~~ 66 (324) T protein:vir:96 1 MEQTQKLK---------LNLQHFASNNVKPQVFNPDNV---MMHEKKDGTLLNDFTTPILQEVMENS--KIMQLGKYEPM 66 (324) T ss_pred CCcchhhh---------HHHHHHHHhhhhhhhcccccc---cccCCCcceechhHHHHHHHHHHhhc--hhhhhcceeec Confidence 33332221 1222222222221 222111 11234556778888787766554443 24444444455 Q ss_pred HHHHHHhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHH Q lcl|NC_018863. 79 NSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAISVIAKSI 158 (479) Q Consensus 79 ~stv~~y~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~~~~~ai~~~~~~~ 158 (479) .+.-.+|.++.. .+...+++|++..+..++++.+.....+=++-.-.+|.-+- .++..|.+....+.-...+++.+ T Consensus 67 ~~~~~~~p~~~~---~~~a~~v~Eg~~~~~~~~~f~~v~~~~~k~~~~~~is~ell-~ds~~~l~~~i~~~l~~aia~~~ 142 (324) T protein:vir:96 67 EGTEKKFTFWAD---KPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFL-NYTYSQFFEEMKPMIAEAFYKKF 142 (324) T ss_pred cCCceEEEEEec---CcceeeecCCccccccccceeEEEEEeEEEEEeehhhHHHH-hcchHHHHHHHHHHHHHHHHHHH Confidence 544345555443 33567999999999999999999999999998888887432 24556788888888889999999 Q ss_pred HHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhhheeecccCceeeeecChHHHhhHHHhh Q lcl|NC_018863. 159 EWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNL 238 (479) Q Consensus 159 e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~atd~~mp~~vka~f~q~~ 238 (479) |.++|+|+-+ ++ +-.|+...+.. .+.... ..++.+.|.++...+..++..++-+.|++.+.+.+...- T Consensus 143 d~~~l~G~g~---~~------~~~~~~~~~~~-~~~~~~--~~~~~~~i~~~~~~i~~~~~~~~~~i~n~~~~~~L~~lk 210 (324) T protein:vir:96 143 DEAGILNQGN---NP------FGKSIAQSIKK-TNKVIK--GDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIV 210 (324) T ss_pred HHHhhhcCCC---CC------cCccccccccc-cceecc--cccchHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhh Confidence 9999999754 11 12355555542 233222 234566666666666778888888999999999988776 Q ss_pred cCceeEEeecCCCccccCccccceecCceeEEecCCcccCCCccccCcccCCCCCcccceEEEeec----ccccCc---c Q lcl|NC_018863. 239 LDRQRVIQPSQAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILVDRIPEPNAPQAPASVVATVK----VNDKGA---F 311 (479) Q Consensus 239 ~~~qrv~~~~n~g~~~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver~~s~~aP~~P~~vta~~~----~~~~g~---~ 311 (479) ...-|.+.++..+..-.|.+|.-..+. .+. .+..+..+......+.-. ... ..+.-.+. ....+. . T Consensus 211 d~~G~~~~~~~~~~~l~G~PV~~~~~~--~~~-~~~~~~gd~s~~~~~~~~--~~~--i~~~~~~~~~~~~~~~~~~~~~ 283 (324) T protein:vir:96 211 DPETKERIYDRNSDSLDGLPVVNLKSS--NLK-RGELITGDFDKLIYGIPQ--LIE--YKIDETAQLSTVKNEDGTPVNL 283 (324) T ss_pred CCCCCeeecCCCCCcccceeeEeecCC--CCC-cceEEEEecceEEEEEec--CcE--EEEeecccccccccccccchhh Confidence 555555555444433444444210000 000 001111111111110000 000 00000000 000000 0 Q ss_pred c-cccc--ceeeEEEEEEEcCCCCcccccceeeeeecCCCeEEEEEeecCC Q lcl|NC_018863. 312 R-PVKD--IKTHSYKVVVHSDDAESLASEAVTAVVANPTDSVSLAVKLQSL 359 (479) Q Consensus 312 ~-~~sd--~g~Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~V~LtIt~~~~ 359 (479) | .+.- .....+-..+.+..+ .+..+.+.....+ +|... T Consensus 284 ~~~n~v~~r~~~r~d~~v~~~~a------~~~l~~a~~~~~~----~~~~~ 324 (324) T protein:vir:96 284 FEQDMVALRATMHVALHIADDKA------FAKLVPADKRTDS----VPGEV 324 (324) T ss_pred hhcCcEEEEEEEEeccEEecccc------eEEEecccccCCC----CCCCC Confidence 0 0000 000011111111111 1111111111111 11111 No 30 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=97.24 E-value=4.3e-05 Score=44.50 Aligned_cols=304 Identities=12% Similarity=0.042 Sum_probs=161.8 Q ss_pred CcccccccceeeeecCchhHHHHHHHHHHH--hhcCcccCcccccCccccchhhhHHHHHHHhhccccccchhhhccchh Q lcl|NC_018863. 1 MTELQKEQKVEARKLPAGAEAELAELVSKS--FTTGTGITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQV 78 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~Ks--f~ag~~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~~ 78 (479) |-+.+|. +.+. .+++....+. +.+.. ....++|+.|.++.+..+|........ .+++.+.+.+. T Consensus 1 ~~~~~~~-~~~~--------~~f~~~~~~~~~~~a~~---~~~~~~~~~~iP~~~~~~ii~~~~~~s--~l~~~~~~~~~ 66 (324) T protein:vir:97 1 MEQTQKL-KLNL--------QHFASNNVKPQVFNPDN---VMMHEKKDGTLMNEFTTPILQEVMENS--KIMQLGKYEPM 66 (324) T ss_pred CccchhH-HHHH--------HHHHHhhhhhhhhcccc---ccccCCCcceechhHHHHHHHHHHhhc--chhhhcceeec Confidence 5443222 1111 1222222222 22221 122345677888888777755554333 34444444445 Q ss_pred HHHHHHhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHH Q lcl|NC_018863. 79 NSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAISVIAKSI 158 (479) Q Consensus 79 ~stv~~y~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~~~~~ai~~~~~~~ 158 (479) .+--.+|.++. +.+...+++|++..+..++.+.......+=++---.+|.-+ +.++..|.+....+.-...++..+ T Consensus 67 ~~~~~~ip~~~---~~~~a~~v~Eg~~~~~~~~~f~~v~~~~~k~~~~~~is~el-l~ds~~~l~~~i~~~l~~aia~~~ 142 (324) T protein:vir:97 67 EGTEKKFTFWA---DKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEF-LNYTYSQFFEEMKPMIAEAFYKKF 142 (324) T ss_pred cCCceEEEEEe---cCcceeEeccCccccccccceeEEEEeeEEEEEeehhhHHH-HhcchHHHHHHHHHHHHHHHHHHH Confidence 43323443433 33456799999999999999999999999999888888732 234556788888899999999999 Q ss_pred HHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhhheeecccCceeeeecChHHHhhHHHhh Q lcl|NC_018863. 159 EWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNL 238 (479) Q Consensus 159 e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~atd~~mp~~vka~f~q~~ 238 (479) |.++|.|+..= .+..|+...+.. .+.... .-++.+.|.++...+..++..+.-..|++.+.+.+.+.- T Consensus 143 d~a~l~G~g~~---------~~~~gi~~~~~~-~~~~~~--~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lk 210 (324) T protein:vir:97 143 DEAGILNQGNN---------PFGKSIAQSIEK-TNKVIK--GDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIV 210 (324) T ss_pred HHHhhccCCCC---------ccCccccccccc-cceecc--ccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhh Confidence 99999998641 223466666653 444332 345677788877777888888888999999999988766 Q ss_pred cCceeEEeecCCCccccCccccceec---CceeEEe-cCCcc--cCCCccccCcccCCC--C---Cc-cc-----ceEEE Q lcl|NC_018863. 239 LDRQRVIQPSQAGGFSTGFSINQFLS---TRGAINL-HGSTI--MENDNILVDRIPEPN--A---PQ-AP-----ASVVA 301 (479) Q Consensus 239 ~~~qrv~~~~n~g~~~~G~~V~~~~s---s~g~I~L-~~s~v--~~a~~~lver~~s~~--a---P~-~P-----~~vta 301 (479) .+.-|-+......+.-.|.+|....+ ..+.+-+ .++.+ ....++.+++..... . +. ++ ..-++ T Consensus 211 d~~g~~~~~~~~~~tl~G~PV~~~~~~~~~~~~~~~gd~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~ 290 (324) T protein:vir:97 211 DPETKERIYDRNSDTLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVA 290 (324) T ss_pred cCCCceeecCCCCccccceeeEeecCCCCCcceEEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEE Confidence 44333333333333344554421111 1111110 00000 011111111111100 0 00 00 00011 Q ss_pred e-ecccccCcccccccceeeEEEEEEEcCCCCcccccc Q lcl|NC_018863. 302 T-VKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLASEA 338 (479) Q Consensus 302 ~-~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~ 338 (479) . ...-..+.....+ . --+++-.+..++..|++. T Consensus 291 ~r~~~r~d~~v~~~~---a-~~~l~~~~~~~~~~~~~~ 324 (324) T protein:vir:97 291 LRATMHVALHIADDK---A-FAKLVPADKKTDSVPGEV 324 (324) T ss_pred EEEEEEeccEEeccc---c-eEEEEeccCCCCCCCCCC Confidence 0 0000111111111 1 124566777778888887 No 31 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=97.24 E-value=2.7e-05 Score=45.62 Aligned_cols=315 Identities=11% Similarity=0.064 Sum_probs=149.3 Q ss_pred CcccccccceeeeecCchhHHHHHHHHHHHhhcCcccCcccccCccccchhhhHHHHHHHhhccccccchhhhccchhHH Q lcl|NC_018863. 1 MTELQKEQKVEARKLPAGAEAELAELVSKSFTTGTGITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQVNS 80 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~Ksf~ag~~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~~~s 80 (479) |-+.+|.+. ...+. +......+.|.|...+ ...+++.|..+.+..+|..+...... +.+...+.+..+ T Consensus 1 ~~~~~~~~~-~~~~f------~~~~~~~~~~~a~~~~---~~~~~~~liP~~~~~~ii~~~~~~s~--l~~l~~~~~~~~ 68 (324) T protein:vir:93 1 MEQTQKLKL-NLQHF------ASNNVKPQVFNPDNVM---MHEKKDGTLLNDFTTPILQEVMENSK--IMQLGKYEPMEG 68 (324) T ss_pred CchhHHHHH-HHHHH------HHhhhhhhhccccccc---ccCCCcceechhHHHHHHHHHHhhch--hhhhcceeeccC Confidence 554444331 11110 1111123445543322 12234557788888888666554443 333333444544 Q ss_pred HHHHhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHH Q lcl|NC_018863. 81 TVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAISVIAKSIEW 160 (479) Q Consensus 81 tv~~y~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~~~~~ai~~~~~~~e~ 160 (479) ...+|.++. +.....+++|++..+..++++.+.....+=++..-.+|.-+- .++..|.+....+.--..+++.+|. T Consensus 69 ~~~~ip~~~---~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~l~~aia~~~d~ 144 (324) T protein:vir:93 69 TEKKFTFWA---DKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFL-NYTYSQFFEEMKPMIAEAFYKKFDE 144 (324) T ss_pred CceEEEEEe---cCcceeeecCCccccccccceeEEEEEeEEEEEeehhhHHHH-hcchHHHHHHHHHHHHHHHHHHHHH Confidence 434455443 334567999999999999999999999999998888887432 2455677888888888899999999 Q ss_pred HHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhhheeecccCceeeeecChHHHhhHHHhhcC Q lcl|NC_018863. 161 AIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLD 240 (479) Q Consensus 161 a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~atd~~mp~~vka~f~q~~~~ 240 (479) ++|+|+-. + .+..|+...+.. .+.... ..++.+.|.++.-.+..+++..+...|++.+.+.+...-.. T Consensus 145 a~l~G~g~---~------~~~~~~~~~~~~-~~~~~~--~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~d~ 212 (324) T protein:vir:93 145 AGILNQGN---N------PFGKSIAQSIEK-TNKVIK--GDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIVDP 212 (324) T ss_pred HHhcCCCC---C------CcCccccccccc-cceecc--ccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhhCC Confidence 99999754 1 123455555542 333322 23456667766666777888888899999999998876544 Q ss_pred ceeEEeecCCCccccCccccceecCceeEEecCCcccCCCccccCcccCCCCCcccceEEEe--ecccccC---cccccc Q lcl|NC_018863. 241 RQRVIQPSQAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILVDRIPEPNAPQAPASVVAT--VKVNDKG---AFRPVK 315 (479) Q Consensus 241 ~qrv~~~~n~g~~~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver~~s~~aP~~P~~vta~--~~~~~~g---~~~~~s 315 (479) .-|.+.+......-.|.+|....+. .+. .+..+..+........-. ......+.-+. ......+ ..|. . T Consensus 213 ~G~~~~~~~~~~~l~G~PVv~~~~~--~~~-~~~i~~gdfs~~~~~~~~--~~~i~~~~~~~~~~~~~~~~~~~~~f~-~ 286 (324) T protein:vir:93 213 ETKERIYDRNSDSLDGLPVVNLKSS--NLK-RGELITGDFDKLIYGIPQ--LIEYKIDETAQLSTVKNEDGTPVNLFE-Q 286 (324) T ss_pred CCCeeecCCCCCcccceeeEeecCC--CCC-cceEEEEecceEEEEEec--CcEEEEeecccccccccccccchhhhh-c Confidence 4445544333333344443211100 000 000111111111000000 00000000000 0000000 0000 0 Q ss_pred cc----eeeEEEEEEEcCCCCcccccceeeeeecCCCeEEEEEeecCC Q lcl|NC_018863. 316 DI----KTHSYKVVVHSDDAESLASEAVTAVVANPTDSVSLAVKLQSL 359 (479) Q Consensus 316 d~----g~Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~V~LtIt~~~~ 359 (479) +. ....|-..+.+..+ .+-.+.+.....+ ||... T Consensus 287 n~~~~r~~~r~d~~v~~~~a------~~~l~~a~~~~~~----~~~~~ 324 (324) T protein:vir:93 287 DMVALRATMHVALHIADDKA------FAKLVPADKRTDS----VPGEV 324 (324) T ss_pred CcEEEEEEEEeccEEecccc------eEEEecccccCCC----CCCCC Confidence 00 00111111111111 1111111111111 11111 No 32 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=97.23 E-value=5.1e-05 Score=44.13 Aligned_cols=315 Identities=13% Similarity=0.009 Sum_probs=153.5 Q ss_pred Cccccc-------ccceeeeecCchh--HHHHHHHHHHHhhcCcc-------------cCcccccCccccchhhhHHHHH Q lcl|NC_018863. 1 MTELQK-------EQKVEARKLPAGA--EAELAELVSKSFTTGTG-------------ITPDTQHDAAALRRELLDDQVK 58 (479) Q Consensus 1 ~~~~~~-------~~~~~~~~~~~~~--~~~~~e~~~Ksf~ag~~-------------~~~~~~~~gaAlr~esld~~i~ 58 (479) +.++.+ ..+....+.-... +......+.+.+..+.. ....+-.+|+.|-.+.+..+|. T Consensus 77 ~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~lvp~~~~~~ii 156 (418) T protein:vir:10 77 LLEAEQKLARGGGSAELETPKTLGQLVTESEEMKGMDGSARKSVRVRVDRKSIMNVPATVGSGVSGSNSLVVADRQAGII 156 (418) T ss_pred HHHHHHHHhhcccccccchhhhhhHHhhhHHHHHHHHHHHhhhhhhhhHHHHHHHhhhhccCCCCCCccccchhHHHHHH Confidence 111110 0011000000000 11111223332222211 1112234566678888877776 Q ss_pred HHhhccccccchhhhccchhHHHHHHhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcc Q lcl|NC_018863. 59 MLAFTNGDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNN 138 (479) Q Consensus 59 ~l~~~~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~ 138 (479) .+..... .+++-+...++.+.-.+|.+. .+......++.|++..+.+++.+......++-++.--.+|.- +.+. T Consensus 157 ~~~~~~~--~l~~~~~~~~~~~~~~~~~~~--~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~is~e--ll~d 230 (418) T protein:vir:10 157 APPQRKM--TIRDLLMPGQTSSSSIEYTVE--TGFTNNAAAVAEGAQKPTSDLKFNLKNQPVRTIAHLFKASRQ--ILDD 230 (418) T ss_pred HHHhhhh--hHHhhcceeeccCCceeEEEE--ecCCCceeeeccCccccccccceeeEEEeeeeEEEeehhhHH--HHHh Confidence 5554433 345555544444322223222 233345678999999999999999999999999988778775 3333 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhhheeeccc Q lcl|NC_018863. 139 IADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAAVIVGKGY 218 (479) Q Consensus 139 ~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~f 218 (479) -.|.+....+.-...+++.++.++|+|+-.= -+..|+.+........... ......+.|..+.-.+..++ T Consensus 231 s~~l~~~i~~~l~~a~~~~~d~a~l~G~g~~---------~~p~Gi~~~~~~~~~~~~~-~~~~~~~~i~~~~~~~~~~~ 300 (418) T protein:vir:10 231 APALQSYIDGRARYGLQLTEEGQILKGDGTG---------ANILGILPQASAFMPSITL-ANATPIDKIRLALLQAVLAE 300 (418) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhccCCCC---------ccccccccccccccccccc-cccccHHHHHHHHHhhcccc Confidence 3588888889999999999999999997641 1245776654321111111 12334555666555667778 Q ss_pred CceeeeecChHHHhhHHHhhcCceeEEeecC---CCccccCccccceecCceeEEecCCcccCCC--ccccCcccCCCCC Q lcl|NC_018863. 219 GRATDAFMPIGVQADFTNNLLDRQRVIQPSQ---AGGFSTGFSINQFLSTRGAINLHGSTIMEND--NILVDRIPEPNAP 293 (479) Q Consensus 219 G~atd~~mp~~vka~f~q~~~~~qrv~~~~n---~g~~~~G~~V~~~~ss~g~I~L~~s~v~~a~--~~lver~~s~~aP 293 (479) +..+-++|++.+...+...-...-|.+.++. .+..-.|.+| +.+...+ .+..+..+. .+++.. . T Consensus 301 ~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~l~G~pV--~~~~~~p---~~~~~~gd~s~~~~~~~---~--- 369 (418) T protein:vir:10 301 FPATGIVLNPIDWASIELTKDSQGRYIVGNPVNGTTPRLWNLPV--VETQAMT---ANEFLVGAFSMAAQIFD---R--- 369 (418) T ss_pred CCCCEEEEcHHHHHHHHHhhcCCCceeccccccCCCceecceee--EEcCCCC---CCcEEEeeccceEEEEE---e--- Confidence 8888899999999998876654444554431 1112223333 1111000 011122211 121110 0 Q ss_pred cccceEEEeecccccCcccccccceeeEEEEEEEcCCCCcccccceeeeeecCCCe Q lcl|NC_018863. 294 QAPASVVATVKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLASEAVTAVVANPTDS 349 (479) Q Consensus 294 ~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~ 349 (479) . ..+...... .+..|. . +...|++...=+.+-=.+...+..+.+.+.++ T Consensus 370 -~--~~~i~~~~~-~~~~f~-~--~~~~~r~~~~~d~~~~~~~a~~~~~~~~~~~g 418 (418) T protein:vir:10 370 -M--EIEVLLSTE-NVDDFE-K--NMVSIRAEERLALAVYRPESFVTGALVEQAGG 418 (418) T ss_pred -c--ceEEEEecc-cchhhh-c--CceEEEEEEeeccEEecccceEEEEeccCCCC Confidence 0 011111111 111222 1 23334433222222222344444444444444 No 33 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=97.13 E-value=0.00012 Score=42.08 Aligned_cols=315 Identities=11% Similarity=0.038 Sum_probs=146.2 Q ss_pred CcccccccceeeeecCchhHHHHHHHHHH--HhhcCcccCcccccCccccchhhhHHHHHHHhhccccccchhhhccchh Q lcl|NC_018863. 1 MTELQKEQKVEARKLPAGAEAELAELVSK--SFTTGTGITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQV 78 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~K--sf~ag~~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~~ 78 (479) |-+..|.+ .+..++..+..+ .|.+..- ....+++.|-.+.+..+|..+...... +.+.....++ T Consensus 1 ~~k~~~~~---------~~~~~~~~~~~~~~~~~a~~~---~~~~~~~~lip~~~~~~ii~~~~~~s~--l~~~~~~~~~ 66 (324) T protein:vir:99 1 MEQTQKLK---------LNLQHFASNNVKPQVFNPDNV---MMHEKKDGTLLNDFTTPILQEVMENSK--IMRLGKYEPM 66 (324) T ss_pred CCCchHhh---------HHHHHHHHHhhhhhhccccce---eccCCCcceechhHHHHHHHHHHhhch--hhhhcceeec Confidence 54443322 111222111111 1333221 122445567788888887666544442 3343443444 Q ss_pred HHHHHHhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHH Q lcl|NC_018863. 79 NSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAISVIAKSI 158 (479) Q Consensus 79 ~stv~~y~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~~~~~ai~~~~~~~ 158 (479) .+.-.+|.++. +.+...+++|++..+..++.+.+.....+=++..-.+|.-+- .++..|.+....+.-...+++.+ T Consensus 67 ~~~~~~~p~~~---~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~l~~ai~~~~ 142 (324) T protein:vir:99 67 EGTEKKFTFWA---DKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFL-NYTYSQFFEEMKPMIAEAFYKKF 142 (324) T ss_pred cCCceEEEEEe---cCcceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHH-hcchHHHHHHHHHHHHHHHHHHH Confidence 43223343332 344578999999999999999999999999998888887432 24456788888889999999999 Q ss_pred HHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhhheeecccCceeeeecChHHHhhHHHhh Q lcl|NC_018863. 159 EWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNL 238 (479) Q Consensus 159 e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~atd~~mp~~vka~f~q~~ 238 (479) |.++|+|+-. ++ +.-|+.+.+.. .+.... .-++.+.|.++.-.+..++..+.-+.|++.+.+.+...- T Consensus 143 d~~~l~G~g~---~~------~~~~~~~~~~~-~~~~~~--~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~l~ 210 (324) T protein:vir:99 143 DEAGILNQGN---NP------FGKSIAQSIEK-TNKVIK--GDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRKIV 210 (324) T ss_pred HHHhhhcCCC---Cc------cCccccccccc-cceecc--ccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHHhh Confidence 9999999764 11 22355555542 333322 335667777777777788888888999999999988765 Q ss_pred cCceeEEeecCCCccccCccccceecCceeEEecCCcccCCCccccCcccCCCCCcccceEEEeec---c-cccCc---c Q lcl|NC_018863. 239 LDRQRVIQPSQAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILVDRIPEPNAPQAPASVVATVK---V-NDKGA---F 311 (479) Q Consensus 239 ~~~qrv~~~~n~g~~~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver~~s~~aP~~P~~vta~~~---~-~~~g~---~ 311 (479) ...-|.+.+...++.-.|.+|....+.... .+..+..+....+.+.-. ... ....-... . ...+. . T Consensus 211 d~~g~~~~~~~~~~~l~G~PVv~~~~~~~~---~~~~i~gd~~~~~~~~~~--~~~--i~~~~~~~~~~~~~~~~~~~~~ 283 (324) T protein:vir:99 211 DPETKERIYDRNSDTLDGLPVVNLKSSNLK---RGELITGDFDKLIYGIPQ--LIE--YKIDETAQLSTVKNEDGTPVNL 283 (324) T ss_pred cCCCceeecCCCCccccceeEEeecCCCCC---cceEEEEecccEEEEEec--CcE--EEEeecccccccccccccchhh Confidence 433333333222222233333110000000 000111111111000000 000 00000000 0 00000 0 Q ss_pred cccccceeeEEEEEEEcCCCCcccccceeeeeecCCCeEEEEEeecCC Q lcl|NC_018863. 312 RPVKDIKTHSYKVVVHSDDAESLASEAVTAVVANPTDSVSLAVKLQSL 359 (479) Q Consensus 312 ~~~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~V~LtIt~~~~ 359 (479) |. . +.-.+++...=+.+-=.+...+..|.+..+..+ +|+.. T Consensus 284 f~-~--~~~~~r~~~r~d~~v~~~~a~~~lt~a~~~~~~----~~~~~ 324 (324) T protein:vir:99 284 FE-Q--DMVALRATMHVALHIADDKAFAKLVPADKKTDS----VPGEV 324 (324) T ss_pred hh-c--CcEEEEEEEEEccEEecccceEEEEeccCCCCC----CCCCC Confidence 00 0 001111110000000001111111111111111 11111 No 34 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=97.11 E-value=0.00017 Score=41.29 Aligned_cols=294 Identities=12% Similarity=0.028 Sum_probs=141.6 Q ss_pred hhcCcccCcccccCccccchhhhHHHHHHHhhccccccchhhhccchhHHHHHHhhhhhccCcccccccccccccccccC Q lcl|NC_018863. 31 FTTGTGITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASIND 110 (479) Q Consensus 31 f~ag~~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~~~d 110 (479) |. ++.++++.|-.+.+..+|........- +.+-....+..+.--+|.++ .+.+...+++|++..+.++ T Consensus 1 ma-------~~t~~~G~lip~~~~~~ii~~l~~~s~--i~~l~~~~~~~~~~~~~p~~---~~~~~a~wv~Eg~~~~~s~ 68 (300) T protein:vir:95 1 MS-------EAQLSKGNLFNPELVTKVINKVKGHSS--IAKLSPQKPIPFNGQREFVF---DFDSDIDIVAENGKKTHGG 68 (300) T ss_pred Cc-------ccccCCcceechhhHHHHHHHHHhhhh--hhhhcceeeccCCceEEEEE---ecCcceEEeeCCccccccc Confidence 22 233444555555566666444333321 22222222233221233333 3335678999999999999 Q ss_pred cceEEEEEEEEeeeehhhhhhhHhhhc--chhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhh Q lcl|NC_018863. 111 PNIRQKTVQMKFLSDTKQQSLAAGLVN--NIADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLI 188 (479) Q Consensus 111 ~~~~r~~~~~k~l~~~~~vs~~~~lv~--~~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I 188 (479) +.+.+.....+=++---.+|.-+-... ...|.+....++-...+++.++.++|+|+.+-.. .+..+.|....- T Consensus 69 ~~f~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g-----~~~~~~~~~~~~ 143 (300) T protein:vir:95 69 VSLDPVTIVPLKVEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTK-----QASTIIGDNCFD 143 (300) T ss_pred ccceeeEeeeEEEEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCC-----CCcccccccccc Confidence 999999999988888888888765443 3567778888899999999999999999754332 233344433322 Q ss_pred ccCCcEEEccCCCCCHHHhhhhhheeecccCceeeeecChHHHhhHHHhhcCceeEEeecCCCccccCccccceecCcee Q lcl|NC_018863. 189 DEATNVIDLKGERLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSQAGGFSTGFSINQFLSTRGA 268 (479) Q Consensus 189 ~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~atd~~mp~~vka~f~q~~~~~qrv~~~~n~g~~~~G~~V~~~~ss~g~ 268 (479) ....++....|..+ -+.|.++...+...++.++-..|++.+...+...-...-|-+.++.+.+. T Consensus 144 ~~~~~~~~~~~~~~-~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~--------------- 207 (300) T protein:vir:95 144 KKVTQTVPFKDTNP-DESMEDAVGMIDGSERDITGAILDPIFTTALSKMKNAEGGKLYPELAWGG--------------- 207 (300) T ss_pred cccceeecccccch-HHHHHHHHHHhhhcCCCccEEEECHHHHHHHHHhhccCCCeeccCccccC--------------- Confidence 22234444444433 45666666666777888888999999999887765433333322221110 Q ss_pred EEecCCcccCCCccccCcccCCCCCcccceEEEeecccccCccccc---ccceeeEEEEEEEcCCCCcccccceeeeeec Q lcl|NC_018863. 269 INLHGSTIMENDNILVDRIPEPNAPQAPASVVATVKVNDKGAFRPV---KDIKTHSYKVVVHSDDAESLASEAVTAVVAN 345 (479) Q Consensus 269 I~L~~s~v~~a~~~lver~~s~~aP~~P~~vta~~~~~~~g~~~~~---sd~g~Y~YkV~a~n~~GES~~S~~VtaT~a~ 345 (479) .+.++++.+.+..+..+.... .+. +.+. .|.|-.. .......++| +.++++.-+. + -.. T Consensus 208 ---~~~~l~G~Pv~~s~~v~~~~~--~~~-~~~~-----~GDf~~~~~~~~~~~~~~~v---~~~~~~d~~~-~---~~f 269 (300) T protein:vir:95 208 ---VPDAINGLAVDKNRTVSYSQT--DPK-NTAI-----VGDFETMFKWGYAKEVPMEI---IKYGDPDNSG-R---DLK 269 (300) T ss_pred ---CCceecceeeEEecCCCCCCC--CCc-cEEE-----EeeccceEEEEEecccEEEE---eeccCCCCcc-h---hhh Confidence 011222222222222211100 000 0000 0111000 0000111222 1222211110 0 001 Q ss_pred CCCeEEEEEeec-CCccccceEEEEEeccCCCC Q lcl|NC_018863. 346 PTDSVSLAVKLQ-SLYQAKPQFISVYRQGNETG 377 (479) Q Consensus 346 ~~~~V~LtIt~~-~~~~~~~~y~~IYR~t~~~g 377 (479) ..+.+.+.+.-- ...-..|+.+...-..+ | T Consensus 270 ~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~--g 300 (300) T protein:vir:95 270 GYNQIYIRCEAYIGWGIMDAASFARIVKTG--G 300 (300) T ss_pred hcCcEEEEEEEeecceeecccceEEEecCC--C Confidence 122222221111 11112233333332223 2 No 35 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=96.85 E-value=0.00023 Score=40.55 Aligned_cols=300 Identities=12% Similarity=0.031 Sum_probs=132.6 Q ss_pred eeecCchhHHHHHHHHHHHhhcCcccCcccccCccccchhhhHHHHHHHhhccccccchhhhccchhHHHHHHhhhhhcc Q lcl|NC_018863. 12 ARKLPAGAEAELAELVSKSFTTGTGITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQVNSTVAKYAVFNQH 91 (479) Q Consensus 12 ~~~~~~~~~~~~~e~~~Ksf~ag~~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~~~stv~~y~~~~~~ 91 (479) ..-+|.-....+-..-.|+++++. .+++.|-.+.+-.+|........ .+.+...+.+..+.-.+|.++. T Consensus 1 ~~~~~~r~~~~~~~~e~~a~~~~~-------~~~g~~ip~~~~~~ii~~~~~~s--~i~~~~~~~~~~~~~~~~p~~~-- 69 (326) T protein:vir:42 1 MAVNPDRTTPFLGVNDPKVAQTGD-------SMFEGYLEPEQAQDYFAEAEKIS--IVQQFAQKIPMGTTGQKIPHWT-- 69 (326) T ss_pred CCCCccchhhhcCcchhhheeccc-------cCCcceechhhHHHHHHHHHhcc--hhhhhcceeeccCCceEEEEEe-- Confidence 111111111121112356666532 22333445555555544433332 2334333333333223343333 Q ss_pred CcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCC Q lcl|NC_018863. 92 GRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAISVIAKSIEWAIFYGDAALAA 171 (479) Q Consensus 92 G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~ 171 (479) +.+...|++|++..+.+++.+.+....++=++..-.+|.-+ +.++..|.+....+.-...++..+|.++|+|+.+=.| T Consensus 70 -~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~v~iS~el-l~~s~~~~~~~i~~~l~~a~~~~~d~a~l~G~gs~~p 147 (326) T protein:vir:42 70 -GDVSASWIGEGDMKPITKGNMTSQTIAPHKIATIFVASAET-VRANPANYLGTMRTKVATAFAMAFDNAAINGTDSPFP 147 (326) T ss_pred -CCcceEEecCCccccccccceeEEEEeeEEEEEeehhhHHH-HhcCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcc Confidence 33456799999999999999999999999999999998854 3456778888889999999999999999999885221 Q ss_pred CCCCcccchhhhHHHhhccCCcEEEccCC-----CCCHH-HhhhhhheeecccCceeeeecChHHHhhHHHhhcCceeEE Q lcl|NC_018863. 172 EADNQAGIEFDGLTKLIDEATNVIDLKGE-----RLDEA-TLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVI 245 (479) Q Consensus 172 ~~~~~~gleFDGl~~~I~~~~NviDarG~-----~l~~~-~l~~aa~~i~~~fG~atd~~mp~~vka~f~q~~~~~qrv~ 245 (479) .|+.+.... .......+. ....+ .+..+.......+...+...|++.+.+.+...-...-|-+ T Consensus 148 ----------~gi~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l 216 (326) T protein:vir:42 148 ----------TFLAQTTKE-VSLVDPDGTGSNADLTVYDAVAVNALSLLVNAGKKWTHTLLDDITEPILNGAKDKSGRPL 216 (326) T ss_pred ----------ccccccccc-cceeecccccccccchhHHHHHHHHHhhhhhhccCccEEEEeHHHHHHHHHhhccCCcee Confidence 244443322 222222221 11112 1223333344556666678899999999987544322322 Q ss_pred -eecCCCccccCccccceecCceeEEecCCcccCCCccccCcccCCCC-------------CcccceEEE--e----ecc Q lcl|NC_018863. 246 -QPSQAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILVDRIPEPNA-------------PQAPASVVA--T----VKV 305 (479) Q Consensus 246 -~~~n~g~~~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver~~s~~a-------------P~~P~~vta--~----~~~ 305 (479) ++........ ...+.++.+.+.+..+.....+. -.....+.. . ... T Consensus 217 ~~~~~~~~~~~--------------~~~~~~l~G~pv~~~~~~~~~~~~~~~Gd~s~~~~~~~~~~~v~~~~e~~~~~~~ 282 (326) T protein:vir:42 217 FIESTYTEENS--------------PFRLGRIVARPTILSDHVASGTVVGYQGDFRQLVWGQVGGLSFDVTDQATLNLGT 282 (326) T ss_pred eccccccCccc--------------cccCceeeeeeEEEcCCCCCCceEEEEeecceEEEEEecceEEEEeecceeeecc Confidence 3221111110 11111122222221111100000 000000000 0 000 Q ss_pred cccCcccccccceeeEEEEEEEcCCCCcccccceeeeeecCCCeEEEEEeecCCccc Q lcl|NC_018863. 306 NDKGAFRPVKDIKTHSYKVVVHSDDAESLASEAVTAVVANPTDSVSLAVKLQSLYQA 362 (479) Q Consensus 306 ~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~V~LtIt~~~~~~~ 362 (479) ...+.....-.-+...|++...-+.+--.+... +.|+-. ..+.+ T Consensus 283 ~~~~~~~~~~~~d~~~~r~~~~~d~~v~~~~a~-----------~~l~~~--~~~~~ 326 (326) T protein:vir:42 283 PQAPNFVSLWQHNLVAVRVEAEYAFHCNDKDAF-----------VKLTNV--DATEA 326 (326) T ss_pred cccccchhhhhcCcEEEEEEEEeccEEecccce-----------EEEeec--cccCC Confidence 000000000000012222222211111111111 122211 11111 No 36 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=96.83 E-value=0.00031 Score=39.79 Aligned_cols=314 Identities=13% Similarity=0.057 Sum_probs=144.2 Q ss_pred HHHHHHHHHHHhhcCcccCcccccCccccchhhhHHHHHHHhhccccccchhhhccchhHHHHHHhhhhhc-----cCcc Q lcl|NC_018863. 20 EAELAELVSKSFTTGTGITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQVNSTVAKYAVFNQ-----HGRT 94 (479) Q Consensus 20 ~~~~~e~~~Ksf~ag~~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~~~stv~~y~~~~~-----~G~~ 94 (479) .+.+-| +.+..+|........+.+++|-.+.+..+|..+...... +.+...+.+..+--.+|.+... |.+- T Consensus 1 ~a~l~e--l~~~~~~~~~~g~~~~~~~~liP~~~~~~ii~~l~~~s~--l~~~~~~~~~~~~~~~~p~~~~~~~a~~v~e 76 (333) T protein:vir:78 1 MATLNE--LLPNSAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSL--VLRMGEQIPISYGETIIPTTVKRPEVGQVGV 76 (333) T ss_pred CchhHH--hhhhcccccccCceecCCccccchhHHHHHHHHHHhhch--hhhhcceeeccCCceEEEEEeCCceeEeecC Confidence 555544 344555555555555667778888888888666555442 3444444444443233434332 2233 Q ss_pred cccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCC Q lcl|NC_018863. 95 GHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAISVIAKSIEWAIFYGDAALAAEAD 174 (479) Q Consensus 95 g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~ 174 (479) |...++.|++..+..++.+.+.....+=++.--.+|.-+- .++..|.+....+.-...+++.+|.++|+|+-+..+ T Consensus 77 g~~~~~~e~~~~~~~~~~f~~i~l~~~kl~~~~~is~ell-~~s~~~~~~~i~~~la~ai~~~~d~~~l~G~g~~~~--- 152 (333) T protein:vir:78 77 GTSNEQREGGLKPLSGTAWDTRSVSPIKLATIVTVSEEFA-RMNPSGLYTKLQGDLAYAIGRGIDLAVFHGKSPLTG--- 152 (333) T ss_pred cccccccccccccccccceeEEEEeeEEEEEeehhhHHHH-hcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCCC--- Confidence 4556777888889999999999999999998888887332 245678888888999999999999999999987543 Q ss_pred CcccchhhhHHHhhccCC-cEEE--ccCCCCCHHHhhhhh-heeecccCceeeeecChHHHhhHHHhhc--Cc-eeEEee Q lcl|NC_018863. 175 NQAGIEFDGLTKLIDEAT-NVID--LKGERLDEATLNKAA-VIVGKGYGRATDAFMPIGVQADFTNNLL--DR-QRVIQP 247 (479) Q Consensus 175 ~~~gleFDGl~~~I~~~~-NviD--arG~~l~~~~l~~aa-~~i~~~fG~atd~~mp~~vka~f~q~~~--~~-qrv~~~ 247 (479) ..+.|+.+....+. ..++ ..+..+..+.|.++- .+...++..++...|++...+.+.+... +. -+.+.+ T Consensus 153 ----~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~~~~~~d~~G~~i~~ 228 (333) T protein:vir:78 153 ----SALQGIDTDNVIANTTNVDYLQETGDPLLDRLLDGYDLVSANTDVEFNGWAVDPRFRAHLLRAQAYRDANGNVDPS 228 (333) T ss_pred ----cccccccccccccccccccccccccchhHHHHHHHHHhhccccccCceEEEEcchHHHHHHHHhhhcCCCCceeec Confidence 34667655332111 1122 222334444444433 3444556677789999988887765432 11 122222 Q ss_pred cCCCccccCccccceecCceeEEecCCcccCCCccccCcccCCCCCcccceEEEeecccccCcccccccceeeEEEEE-- Q lcl|NC_018863. 248 SQAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILVDRIPEPNAPQAPASVVATVKVNDKGAFRPVKDIKTHSYKVV-- 325 (479) Q Consensus 248 ~n~g~~~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver~~s~~aP~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~-- 325 (479) ....... ..++++.+.+..+..+..... .......-..|. -..|.+.+. T Consensus 229 ~~~~~~~------------------~~~l~G~Pv~~~~~i~~~~~~-----~~~~~~~~~~gD------~~~~~~g~~~~ 279 (333) T protein:vir:78 229 RINLAAQ------------------TGDVLGLPAQFGRAVGGDLGA-----AVDSKTRIIGGD------FSQLKFGFADE 279 (333) T ss_pred CccccCC------------------CceeeceeeEEccccCCCccc-----cCCCccEEEEEe------cccEEEEEeec Confidence 1111000 011111222111111100000 000000000000 000111000 Q ss_pred ---EEcCCCCcccccceeeee-ecCCCeEEEEEee-cCCccccceEEEEEeccCCC Q lcl|NC_018863. 326 ---VHSDDAESLASEAVTAVV-ANPTDSVSLAVKL-QSLYQAKPQFISVYRQGNET 376 (479) Q Consensus 326 ---a~n~~GES~~S~~VtaT~-a~~~~~V~LtIt~-~~~~~~~~~y~~IYR~t~~~ 376 (479) -+++++ .....-.... ....+.+.+.+.- -...-..|+-+........+ T Consensus 280 ~~i~~~~~~--~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~a~ 333 (333) T protein:vir:78 280 IRIKMSDTA--TLTDSGSATVSMWQTNQIAILIEVTFGWLLGDKQAFVKFVDDEQP 333 (333) T ss_pred cEEEEeccc--cccccccceeehhhcCcEEEEEEEEEccEEecccceEEEeccCCC Confidence 000000 0000000000 0001111111100 00011111112222222212 No 37 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=96.82 E-value=0.00032 Score=39.75 Aligned_cols=301 Identities=10% Similarity=0.019 Sum_probs=151.9 Q ss_pred hhcCc--ccCcccccCccccchhhhHHHHHHHhhccccccchhhhccchhHHHHHHhhhhhccCcccccccccccccccc Q lcl|NC_018863. 31 FTTGT--GITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASI 108 (479) Q Consensus 31 f~ag~--~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~~ 108 (479) |..+. ..+..+-.+|++|-.+.+.+++........ .+++.+.+.+..+-..+|.++ .+.....+++|++..+. T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~--~l~~~~~~~~~~~~~~~ip~~---~~~~~a~~v~E~~~~~~ 75 (304) T protein:vir:10 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANS--AIMKLAKNEPMTAQKKKFTYL---AKGVGAYWVSETERIQT 75 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhcc--chhhhcceeeccCCceEEEEE---eCCcceEEeecCccccc Confidence 33222 111122245667888888777755554443 345555555555433334333 33445679999999999 Q ss_pred cCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhh Q lcl|NC_018863. 109 NDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLI 188 (479) Q Consensus 109 ~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I 188 (479) .++.+.......+=++.--.+|.-+ +.++..|.+....++-...+++.+|.++|+|+.+-.+. |..-+|+..-. T Consensus 76 ~~~~~~~i~~~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~-----~~~~~~~~~~~ 149 (304) T protein:vir:10 76 SKPEYAQAEMEAKKIGVIIPLSKEF-LKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNT-----STSGKPLVEGA 149 (304) T ss_pred ccceeeEEEEEEEEEEEeehhhHHH-HhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCccc-----ccccccccccc Confidence 9999999999999999888888754 44566788888888888999999999999999764332 22234444443 Q ss_pred ccCCcEEEccCCCCCHHHhhhhhheeecccCceeeeecChHHHhhHHHhhcCceeEEeecCCCccccCccccceecCcee Q lcl|NC_018863. 189 DEATNVIDLKGERLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSQAGGFSTGFSINQFLSTRGA 268 (479) Q Consensus 189 ~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~atd~~mp~~vka~f~q~~~~~qrv~~~~n~g~~~~G~~V~~~~ss~g~ 268 (479) . .......+.-..-+.|.++.-.+..++....-..|++.+.+.+...-...-|.+...++ T Consensus 150 ~--~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~lkd~~G~~l~~~~~------------------ 209 (304) T protein:vir:10 150 E--EKGNVVTDTNNLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRNALDANDRPLFDANG------------------ 209 (304) T ss_pred c--ccccccccccchHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCcEeecCCC------------------ Confidence 3 22334444455566666666666677777778999999999998654332232221111 Q ss_pred EEecCCcccCCCccccCcccCCCCCcccceEEEeecccccCcccccccceeeEEEEEEEcCCCCcc-cccceeeeeecCC Q lcl|NC_018863. 269 INLHGSTIMENDNILVDRIPEPNAPQAPASVVATVKVNDKGAFRPVKDIKTHSYKVVVHSDDAESL-ASEAVTAVVANPT 347 (479) Q Consensus 269 I~L~~s~v~~a~~~lver~~s~~aP~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~-~S~~VtaT~a~~~ 347 (479) .++++.+.+..+..+. ...++..+ .|.+++.+.. -..+-+. .+... T Consensus 210 -----~~l~G~PV~~~~~~~~---------------~~~~~~~~----~gd~~~~~~~-~~~~~~i~~~~e~-------- 256 (304) T protein:vir:10 210 -----NEIMGLPLSYTGADVY---------------DKKKSLAL----MGDWDYARYG-ILQGIEYAISEDA-------- 256 (304) T ss_pred -----ccccceeeEEeccccc---------------CCCCcEEE----EEehhhEEEE-EecceEEEEeecc-------- Confidence 1222222222221111 00111111 1334432221 1111100 00000 Q ss_pred CeEEEEEeecCCccccceEEEEEeccCCCCcEEEEEEeeeeeccCCCeeEEEecc Q lcl|NC_018863. 348 DSVSLAVKLQSLYQAKPQFISVYRQGNETGHYFLVARVPLSKADENGVITFVDRN 402 (479) Q Consensus 348 ~~V~LtIt~~~~~~~~~~y~~IYR~t~~~g~~~~i~rV~~s~~n~~~tttf~D~N 402 (479) .+.. ....-..+ ..++-|.+.. -.|+.+.|+...-.+...-..-+++. T Consensus 257 -~~~~-~~~~~~~g---~~~~~f~~~~--~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:10 257 -TLTT-LQASDASG---QPVSLFERDM--FALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred -eeee-ecccccCc---cchhhhhcCc--EEEEEEEEeccEeecccceEEEEecC Confidence 0000 00000000 0111111111 22333344433322333333333333 No 38 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=96.82 E-value=0.00032 Score=39.75 Aligned_cols=301 Identities=10% Similarity=0.019 Sum_probs=151.9 Q ss_pred hhcCc--ccCcccccCccccchhhhHHHHHHHhhccccccchhhhccchhHHHHHHhhhhhccCcccccccccccccccc Q lcl|NC_018863. 31 FTTGT--GITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASI 108 (479) Q Consensus 31 f~ag~--~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~~ 108 (479) |..+. ..+..+-.+|++|-.+.+.+++........ .+++.+.+.+..+-..+|.++ .+.....+++|++..+. T Consensus 1 ma~~~~~~~~~~~t~~gg~lip~~~~~~ii~~~~~~~--~l~~~~~~~~~~~~~~~ip~~---~~~~~a~~v~E~~~~~~ 75 (304) T protein:vir:94 1 MATPTYTPGNVILSDFKNGVIPAEQGTLIMKDIMANS--AIMKLAKNEPMTAQKKKFTYL---AKGVGAYWVSETERIQT 75 (304) T ss_pred CcccccccccccccCCCceecchhHHHHHHHHHHhcc--chhhhcceeeccCCceEEEEE---eCCcceEEeecCccccc Confidence 33222 111122245667888888777755554443 345555555555433334333 33445679999999999 Q ss_pred cCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhh Q lcl|NC_018863. 109 NDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLI 188 (479) Q Consensus 109 ~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I 188 (479) .++.+.......+=++.--.+|.-+ +.++..|.+....++-...+++.+|.++|+|+.+-.+. |..-+|+..-. T Consensus 76 ~~~~~~~i~~~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~-----~~~~~~~~~~~ 149 (304) T protein:vir:94 76 SKPEYAQAEMEAKKIGVIIPLSKEF-LKWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNT-----STSGKPLVEGA 149 (304) T ss_pred ccceeeEEEEEEEEEEEeehhhHHH-HhcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCccc-----ccccccccccc Confidence 9999999999999999888888754 44566788888888888999999999999999764332 22234444443 Q ss_pred ccCCcEEEccCCCCCHHHhhhhhheeecccCceeeeecChHHHhhHHHhhcCceeEEeecCCCccccCccccceecCcee Q lcl|NC_018863. 189 DEATNVIDLKGERLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSQAGGFSTGFSINQFLSTRGA 268 (479) Q Consensus 189 ~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~atd~~mp~~vka~f~q~~~~~qrv~~~~n~g~~~~G~~V~~~~ss~g~ 268 (479) . .......+.-..-+.|.++.-.+..++....-..|++.+.+.+...-...-|.+...++ T Consensus 150 ~--~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~lkd~~G~~l~~~~~------------------ 209 (304) T protein:vir:94 150 E--EKGNVVTDTNNLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRNALDANDRPLFDANG------------------ 209 (304) T ss_pred c--ccccccccccchHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCcEeecCCC------------------ Confidence 3 22334444455566666666666677777778999999999998654332232221111 Q ss_pred EEecCCcccCCCccccCcccCCCCCcccceEEEeecccccCcccccccceeeEEEEEEEcCCCCcc-cccceeeeeecCC Q lcl|NC_018863. 269 INLHGSTIMENDNILVDRIPEPNAPQAPASVVATVKVNDKGAFRPVKDIKTHSYKVVVHSDDAESL-ASEAVTAVVANPT 347 (479) Q Consensus 269 I~L~~s~v~~a~~~lver~~s~~aP~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~-~S~~VtaT~a~~~ 347 (479) .++++.+.+..+..+. ...++..+ .|.+++.+.. -..+-+. .+... T Consensus 210 -----~~l~G~PV~~~~~~~~---------------~~~~~~~~----~gd~~~~~~~-~~~~~~i~~~~e~-------- 256 (304) T protein:vir:94 210 -----NEIMGLPLSYTGADVY---------------DKKKSLAL----MGDWDYARYG-ILQGIEYAISEDA-------- 256 (304) T ss_pred -----ccccceeeEEeccccc---------------CCCCcEEE----EEehhhEEEE-EecceEEEEeecc-------- Confidence 1222222222221111 00111111 1334432221 1111100 00000 Q ss_pred CeEEEEEeecCCccccceEEEEEeccCCCCcEEEEEEeeeeeccCCCeeEEEecc Q lcl|NC_018863. 348 DSVSLAVKLQSLYQAKPQFISVYRQGNETGHYFLVARVPLSKADENGVITFVDRN 402 (479) Q Consensus 348 ~~V~LtIt~~~~~~~~~~y~~IYR~t~~~g~~~~i~rV~~s~~n~~~tttf~D~N 402 (479) .+.. ....-..+ ..++-|.+.. -.|+.+.|+...-.+...-..-+++. T Consensus 257 -~~~~-~~~~~~~g---~~~~~f~~~~--~~~r~~~r~~~~v~~~~a~~~l~~a~ 304 (304) T protein:vir:94 257 -TLTT-LQASDASG---QPVSLFERDM--FALRATMHIAYMNVKPEAFATLKPTE 304 (304) T ss_pred -eeee-ecccccCc---cchhhhhcCc--EEEEEEEEeccEeecccceEEEEecC Confidence 0000 00000000 0111111111 22333344433322333333333333 No 39 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=96.80 E-value=0.00033 Score=39.66 Aligned_cols=312 Identities=11% Similarity=0.056 Sum_probs=139.1 Q ss_pred CcccccccceeeeecCchhHHHHHHHHHHHhhcCc--ccCcccccCccccchhhhHHHHHHHhhccccccchhhhccchh Q lcl|NC_018863. 1 MTELQKEQKVEARKLPAGAEAELAELVSKSFTTGT--GITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQV 78 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~Ksf~ag~--~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~~ 78 (479) ..+..+..+-.... .+.+.+--..|.|.+..|. ..+-.+-.+|+.+..+.+.++|..+..... .+++.+...++ T Consensus 55 ~~~~~~~~~~~~~~--~~~~~~~~~~~~~~l~~~~~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~s--~i~~~~~~~~~ 130 (371) T protein:vir:81 55 QKQTIEDKEPLKPT--VQVKENEVEAFVNHIRTRFRNAMSEGSNQDGGYTVPQDIQTRINELRESKD--ALQNLITVEPV 130 (371) T ss_pred HHHhhccccccccc--hhhHHHHHHHHHHHHHHHHHHhhccCCCccCceeecHhHHHHHHHHHHhhh--hhhhhceeeec Confidence 00000000000000 0011111122333322221 011112356788888888888865555444 35555555556 Q ss_pred HHHHHHhhhhhccCcccccccccccccc-cccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHH Q lcl|NC_018863. 79 NSTVAKYAVFNQHGRTGHSRFVREVGVA-SINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAISVIAKS 157 (479) Q Consensus 79 ~stv~~y~~~~~~G~~g~~~fv~E~g~~-~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~~~~~ai~~~~~~ 157 (479) .+...+|...... ..+...+++|++.. +.+++.+.+.+...+-++....+|.-+ +.++.-|.+....+.-...++.. T Consensus 131 ~~~~~~~~~~~~~-~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~l~~a~~~~ 208 (371) T protein:vir:81 131 TTLSGSRVFKKRS-QQTGFVEVAEGAAIGEKATPQFTLLQYQVKKYAGFFRVTNEL-LNDSTEAIVNTLVRWIGDESRVT 208 (371) T ss_pred cCCceeEEEEeec-CCcceeeeccccccccccccceeeEEeeeeEEEEeehhhHHH-HhhhhHHHHHHHHHHHHHHHHHH Confidence 5544455333333 33456789999875 578999999999999999988888765 34555677788888888899999 Q ss_pred HHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhhheeecccCceeeeecChHHHhhHHHh Q lcl|NC_018863. 158 IEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNN 237 (479) Q Consensus 158 ~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~atd~~mp~~vka~f~q~ 237 (479) ++.+++.|+....+.+ .+-.|++...+.. .....|....-.+|++.+.+.+... T Consensus 209 ~~~~i~~g~g~~~~~~----~~~~~~i~~~~~~----------------------~l~~~~~~~a~~vmn~~~~~~L~~l 262 (371) T protein:vir:81 209 RNGLIINVLNTKAKTA----IADLDGLKQIINV----------------------QLDPVFRSTSSVIVNQDAFNWLDTL 262 (371) T ss_pred HHHHHHhhcccccccc----cccHHHHHHHHHh----------------------hcchhhhcCCEEEEcHHHHHHHHHh Confidence 9999999998766533 3456666655531 1111222223478999998888765 Q ss_pred hcCceeEEeecCCCccccCccccceecCceeEEecCCcccCCCccccCcccCCCCCcccceEEEeecccccCcccccccc Q lcl|NC_018863. 238 LLDRQRVIQPSQAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILVDRIPEPNAPQAPASVVATVKVNDKGAFRPVKDI 317 (479) Q Consensus 238 ~~~~qrv~~~~n~g~~~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver~~s~~aP~~P~~vta~~~~~~~g~~~~~sd~ 317 (479) -...-|-+...+..+ -.+.++++.+.+..+..+. +....... ....+.++ . T Consensus 263 kd~~g~~l~~~~~~~------------------~~~~~l~G~pV~~~~~~~~------~~~~~~~~-~~~~~~i~----~ 313 (371) T protein:vir:81 263 KDQNGQYLLQPSISS------------------PTGRQLLGLPVVIVSNKVL------ANRVDGGT-GAQFAPII----V 313 (371) T ss_pred hccCCCeeeecccCC------------------CCCceecceeEEEeccccc------Cccccccc-cCCcceEE----E Confidence 433222222221111 1112223333333332111 00000000 11111111 1 Q ss_pred eeeEEEEEEEcCCCCcccccceeeeeecCCCeEEEEEeecCCccccceEEEEEeccC--CCCcEEEEEEeeee Q lcl|NC_018863. 318 KTHSYKVVVHSDDAESLASEAVTAVVANPTDSVSLAVKLQSLYQAKPQFISVYRQGN--ETGHYFLVARVPLS 388 (479) Q Consensus 318 g~Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~V~LtIt~~~~~~~~~~y~~IYR~t~--~~g~~~~i~rV~~s 388 (479) |.++.-+....+.|-+..... ........+.+. |+..+|=.- -....+...++..+ T Consensus 314 Gd~~~~~~~~~~~~~~i~~~~-~~~~~f~~~~v~--------------~~~~~r~d~~~~~~~a~~~~~~~~A 371 (371) T protein:vir:81 314 GDLKEAVVMFDRQRTEIMSSN-VAMDAFETDATL--------------WRAIERMDVKMRDDEAFVFGEVQLA 371 (371) T ss_pred EehhceEEEEeecceEEEEec-cccchhhcCceE--------------EEEEEeeccEEecccceEEEEEecC Confidence 222211111111111110000 000000011222 222222210 00111222222222 No 40 >protein:vir:95318 Length: 328 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512264;genbank:gi:89152431;genbank:GeneID:3952987 Probab=96.79 E-value=2.4e-05 Score=45.93 Aligned_cols=279 Identities=16% Similarity=0.146 Sum_probs=141.1 Q ss_pred CcccccccceeeeecCchhHHHHHHHHHHHhhcCcccCcccccCccccchhhhHH-HHHHHhhccccccchhhhccchhH Q lcl|NC_018863. 1 MTELQKEQKVEARKLPAGAEAELAELVSKSFTTGTGITPDTQHDAAALRRELLDD-QVKMLAFTNGDFTIYPLINKQQVN 79 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~Ksf~ag~~~~~~~~~~gaAlr~esld~-~i~~l~~~~~~f~~~~~i~k~~~~ 79 (479) ||+| .+.+.-+.++ +|=+.. ..+.. -|..|+.++. +|..++=..++ T Consensus 1 m~~~---------~~~~~TL~e~----Akr~~~-----------------d~~~~~VIE~l~~~n~---IL~~lpf~e~n 47 (328) T protein:vir:95 1 MAVK---------GLTALTLADW----GKRVDP-----------------NGKVDKIIELLGQTNP---ILQDMPFVEGN 47 (328) T ss_pred CCcc---------ccccccHHHH----HhhhCc-----------------chhHHHHHHHHhccch---hHhhcceeecc Confidence 5554 2344444443 221111 11111 2222333332 34445444444 Q ss_pred -HHHHHhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcc-hhhHHHHHHHHHHHHHHHH Q lcl|NC_018863. 80 -STVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNN-IADPMTILTEDAISVIAKS 157 (479) Q Consensus 80 -stv~~y~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~-~~dp~~~~~~~ai~~~~~~ 157 (479) .|=|.|+ .+.+.-...|..=....+-+.++..|++..++-|..-.+|.+...-.++ ..+-+++|.+.-|..+.+. T Consensus 48 ~gt~~~~~---v~~~LP~~~fR~lN~g~~~s~~tt~q~t~~l~ilgg~~eVDr~la~~~Gn~~~~ra~q~~~~~ka~~~~ 124 (328) T protein:vir:95 48 LPTGHRTT---IRSGLPSATWRLLNYGVQPSKSTTVQVTDSVGMLETYAEVDKSLADLNGNTAEFRLSEDRAFIEAMNQQ 124 (328) T ss_pred cCCcceee---EeeccCCceeeecCCccCcccceeEEEEEEEEEEecceeechHHHhhcCCHHHHHHHHHHHHHHHHHHH Confidence 3446554 4444444445332334556778999999999999999999987665544 6677899999999999999 Q ss_pred HHHHHhhcccccCCCCCCcccchhhhHHHhhcc-----CCcEEEccCCCCCHHHhhhhhheeecccCceeeeecChHHHh Q lcl|NC_018863. 158 IEWAIFYGDAALAAEADNQAGIEFDGLTKLIDE-----ATNVIDLKGERLDEATLNKAAVIVGKGYGRATDAFMPIGVQA 232 (479) Q Consensus 158 ~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~-----~~NviDarG~~l~~~~l~~aa~~i~~~fG~atd~~mp~~vka 232 (479) ++..+||||+..+| -+||||.+.+.. ++|+||+.|.--+.-.|+ ++.=+=....=+| |-+-++ T Consensus 125 ~~~~~iyGdsa~~p-------~~F~GL~~R~~~~s~~~a~qiidaGgtg~~~TSi~----~v~~g~~~~~giy-PkG~~~ 192 (328) T protein:vir:95 125 MAQTLFYGDSSVNP-------QQFMGLSSRYSSLSAGNAQNIIDAGGTGTDNTSIW----LVVWGENTVHGIF-PKGKKA 192 (328) T ss_pred HHHHHhcCCccCCh-------hhhcchhhhcCccccccccceeecccCCCCceEEE----EEEEcCCeEEEec-cccccc Confidence 99999999999887 379999998842 469999998543332221 2211112333355 888888 Q ss_pred hHHHhhcCceeEEeecCCCc-ccc--------CccccceecCce--eEE---ecC-------------------CcccCC Q lcl|NC_018863. 233 DFTNNLLDRQRVIQPSQAGG-FST--------GFSINQFLSTRG--AIN---LHG-------------------STIMEN 279 (479) Q Consensus 233 ~f~q~~~~~qrv~~~~n~g~-~~~--------G~~V~~~~ss~g--~I~---L~~-------------------s~v~~a 279 (479) -++-.-++.+.+...+ .+- ..+ |.-|..+.++-= +|+ |.. +..++. T Consensus 193 Gl~~~d~g~~~~~~~~-g~~y~~y~~~~~w~~Gl~i~d~r~vvrI~NId~~~l~~~~~~~~l~~lm~~a~~~ip~~~~~~ 271 (328) T protein:vir:95 193 GIQMEDKGQVTLEDAN-GGKYEGYRTHYKWDNGLALRDWRYVVRIANIDVSNLSEPSSAANIAKLMVKALHRIPNRGMGR 271 (328) T ss_pred CceeeecCceeeecCC-CCeeeEEEEEEEeeeeeEEcCcccEEEEecCcccccccccChhhHHHHHHHHHHHhccCCCCc Confidence 8777777777776332 221 112 222222222100 110 000 011111 Q ss_pred CccccCc-------ccCCCCCcccceEEEeecccccCcccccccceeeEEEEE-EEcCCCCcccc Q lcl|NC_018863. 280 DNILVDR-------IPEPNAPQAPASVVATVKVNDKGAFRPVKDIKTHSYKVV-VHSDDAESLAS 336 (479) Q Consensus 280 ~~~lver-------~~s~~aP~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~-a~n~~GES~~S 336 (479) +.++.-| .... -+. -+........|+..+.- -|.= -+.| ++. ..|++-- T Consensus 272 ~~~y~n~~v~~~L~~q~~----~~~-n~~~~~~~~~g~~~t~~-~gip-ir~~dai~-~tE~~vv 328 (328) T protein:vir:95 272 PVFYMNRTVGQALDLQSL----EKT-SLAISVKETEGEWWTSF-RGVP-IRETDALL-ETEARVV 328 (328) T ss_pred ceeehhHHHHHHHHHHHh----cCc-ceeeeeeccCCcceeEE-CCeE-EEEEeeee-cCccccC Confidence 1111111 0000 011 11122233334433221 1211 2222 322 3444444 No 41 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=96.78 E-value=0.00021 Score=40.77 Aligned_cols=311 Identities=14% Similarity=0.096 Sum_probs=146.3 Q ss_pred CcccccccceeeeecCchh---HHHHHHHHHHHhhcCc----------ccCcccccCccccchhhhHHHHHHHhhccccc Q lcl|NC_018863. 1 MTELQKEQKVEARKLPAGA---EAELAELVSKSFTTGT----------GITPDTQHDAAALRRELLDDQVKMLAFTNGDF 67 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~---~~~~~e~~~Ksf~ag~----------~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f 67 (479) +.++.++. ......+... .....+.+.|.+...- .....+..+|..+..+ +...|..+.... . T Consensus 57 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~-~~~~ii~~~~~~--~ 132 (385) T protein:vir:19 57 LFDLEQKL-ASGAENPGEKKSFSERAAEELIKSWDGKQGTFGAKTFNKSLGSDADSAGSLIQPM-QIPGIIMPGLRR--L 132 (385) T ss_pred HHHHHHHh-hccccccchhhhhHHHHHHHHHHHHHHhhccchhhHHHhhhccccccCCceecch-hhhHHHHHhhhc--c Confidence 11111100 1111111111 1112233445443211 1111222334444444 555564444333 3 Q ss_pred cchhhhccchhHHHHHHhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHH Q lcl|NC_018863. 68 TIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILT 147 (479) Q Consensus 68 ~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~~~ 147 (479) .+++.++..++.+.-.+|.+. -+..+...+++|++..+..++.+.+....++=++....+|.- +.+...+.+.... T Consensus 133 ~l~~~~~~~~~~~~~~~~~~~--~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~k~~~~~~is~e--ll~d~~~l~~~i~ 208 (385) T protein:vir:19 133 TIRDLLAQGRTSSNALEYVRE--EVFTNNADVVAEKALKPESDITFSKQTANVKTIAHWVQASRQ--VMDDAPMLQSYIN 208 (385) T ss_pred chhhhcceecccCcceEEEEE--ecCCcceeeeccCccccccccceeEEEEeeeeEEEeehhhHH--HHhhHHHHHHHHH Confidence 456656655554432334332 233345678999999999999999999999999998888875 3333356778888 Q ss_pred HHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhhheeecccCceeeeecC Q lcl|NC_018863. 148 EDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAAVIVGKGYGRATDAFMP 227 (479) Q Consensus 148 ~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~atd~~mp 227 (479) +.-...+...++.++++|+-. |-.+.||.+............ .....+.|.++...+...++..+-++|| T Consensus 209 ~~la~a~~~~~d~~~l~G~g~---------~~~~~Gi~~~~~~~~~~~~~~-~~~~~d~i~~~~~~l~~~~~~~~~~~~~ 278 (385) T protein:vir:19 209 NRLMYGLALKEEGQLLNGDGT---------GDNLEGLNKVATAYDTSLNAT-GDTRADIIAHAIYQVTESEFSASGIVLN 278 (385) T ss_pred HHHHHHHHHHHHHHHHhccCC---------CCccccccccccccccccccc-ccchHHHHHHHHHhhccccCCCCEEEEc Confidence 888899999999999999744 224678776653212222222 2345666777666677888889999999 Q ss_pred hHHHhhHHHhhcCceeEEeec-CCC--ccccCccccceecCceeEEecCCcccCCC--ccccCcccCCCCCcccceEEEe Q lcl|NC_018863. 228 IGVQADFTNNLLDRQRVIQPS-QAG--GFSTGFSINQFLSTRGAINLHGSTIMEND--NILVDRIPEPNAPQAPASVVAT 302 (479) Q Consensus 228 ~~vka~f~q~~~~~qrv~~~~-n~g--~~~~G~~V~~~~ss~g~I~L~~s~v~~a~--~~lver~~s~~aP~~P~~vta~ 302 (479) +.+.+.+...-...-|.+.+. ..+ +.-.|.+|- .+..-+ .+..+..+. .+++. ...+. ++ . T Consensus 279 ~~~~~~l~~lkd~~G~~l~~~~~~~~~~~l~G~pV~--~~~~~p---~~~~~~gd~~~~~~~~---~~~~~-----~v-~ 344 (385) T protein:vir:19 279 PRDWHNIALLKDNEGRYIFGGPQAFTSNIMWGLPVV--PTKAQA---AGTFTVGGFDMASQVW---DRMDA-----TV-E 344 (385) T ss_pred HHHHHHHHHhhcCCCceeccCcccCCCceecceeeE--EcCcCC---CCcEEEeecccEEEEE---Eecce-----EE-E Confidence 999998877654333334322 111 111232221 111000 001111110 01110 00000 00 0 Q ss_pred ecccccCcccccccceeeEEEEEEEcCCCCcccccceeeeeecCCCeEEEEEeecC Q lcl|NC_018863. 303 VKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLASEAVTAVVANPTDSVSLAVKLQS 358 (479) Q Consensus 303 ~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~V~LtIt~~~ 358 (479) .... .+..|. . +...|++...=+..--.+...+ .|+++.+. T Consensus 345 ~~~~-~~~~~~-~--~~~~~~~~~r~~~~v~~~~a~~-----------~~~~~aa~ 385 (385) T protein:vir:19 345 VSRE-DRDNFV-K--NMLTILCEERLALAHYRPTAII-----------KGTFSSGS 385 (385) T ss_pred Eecc-ccchhh-c--CcEEEEEEEeeccEEecccceE-----------EEEeccCC Confidence 0000 000011 1 1123333211111111122222 23333221 No 42 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=96.78 E-value=0.00021 Score=40.77 Aligned_cols=311 Identities=14% Similarity=0.096 Sum_probs=146.3 Q ss_pred CcccccccceeeeecCchh---HHHHHHHHHHHhhcCc----------ccCcccccCccccchhhhHHHHHHHhhccccc Q lcl|NC_018863. 1 MTELQKEQKVEARKLPAGA---EAELAELVSKSFTTGT----------GITPDTQHDAAALRRELLDDQVKMLAFTNGDF 67 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~---~~~~~e~~~Ksf~ag~----------~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f 67 (479) +.++.++. ......+... .....+.+.|.+...- .....+..+|..+..+ +...|..+.... . T Consensus 57 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~-~~~~ii~~~~~~--~ 132 (385) T protein:vir:18 57 LFDLEQKL-ASGAENPGEKKSFSERAAEELIKSWDGKQGTFGAKTFNKSLGSDADSAGSLIQPM-QIPGIIMPGLRR--L 132 (385) T ss_pred HHHHHHHh-hccccccchhhhhHHHHHHHHHHHHHHhhccchhhHHHhhhccccccCCceecch-hhhHHHHHhhhc--c Confidence 11111100 1111111111 1112233445443211 1111222334444444 555564444333 3 Q ss_pred cchhhhccchhHHHHHHhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHH Q lcl|NC_018863. 68 TIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILT 147 (479) Q Consensus 68 ~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~~~ 147 (479) .+++.++..++.+.-.+|.+. -+..+...+++|++..+..++.+.+....++=++....+|.- +.+...+.+.... T Consensus 133 ~l~~~~~~~~~~~~~~~~~~~--~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~k~~~~~~is~e--ll~d~~~l~~~i~ 208 (385) T protein:vir:18 133 TIRDLLAQGRTSSNALEYVRE--EVFTNNADVVAEKALKPESDITFSKQTANVKTIAHWVQASRQ--VMDDAPMLQSYIN 208 (385) T ss_pred chhhhcceecccCcceEEEEE--ecCCcceeeeccCccccccccceeEEEEeeeeEEEeehhhHH--HHhhHHHHHHHHH Confidence 456656655554432334332 233345678999999999999999999999999998888875 3333356778888 Q ss_pred HHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhhheeecccCceeeeecC Q lcl|NC_018863. 148 EDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAAVIVGKGYGRATDAFMP 227 (479) Q Consensus 148 ~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~atd~~mp 227 (479) +.-...+...++.++++|+-. |-.+.||.+............ .....+.|.++...+...++..+-++|| T Consensus 209 ~~la~a~~~~~d~~~l~G~g~---------~~~~~Gi~~~~~~~~~~~~~~-~~~~~d~i~~~~~~l~~~~~~~~~~~~~ 278 (385) T protein:vir:18 209 NRLMYGLALKEEGQLLNGDGT---------GDNLEGLNKVATAYDTSLNAT-GDTRADIIAHAIYQVTESEFSASGIVLN 278 (385) T ss_pred HHHHHHHHHHHHHHHHhccCC---------CCccccccccccccccccccc-ccchHHHHHHHHHhhccccCCCCEEEEc Confidence 888899999999999999744 224678776653212222222 2345666777666677888889999999 Q ss_pred hHHHhhHHHhhcCceeEEeec-CCC--ccccCccccceecCceeEEecCCcccCCC--ccccCcccCCCCCcccceEEEe Q lcl|NC_018863. 228 IGVQADFTNNLLDRQRVIQPS-QAG--GFSTGFSINQFLSTRGAINLHGSTIMEND--NILVDRIPEPNAPQAPASVVAT 302 (479) Q Consensus 228 ~~vka~f~q~~~~~qrv~~~~-n~g--~~~~G~~V~~~~ss~g~I~L~~s~v~~a~--~~lver~~s~~aP~~P~~vta~ 302 (479) +.+.+.+...-...-|.+.+. ..+ +.-.|.+|- .+..-+ .+..+..+. .+++. ...+. ++ . T Consensus 279 ~~~~~~l~~lkd~~G~~l~~~~~~~~~~~l~G~pV~--~~~~~p---~~~~~~gd~~~~~~~~---~~~~~-----~v-~ 344 (385) T protein:vir:18 279 PRDWHNIALLKDNEGRYIFGGPQAFTSNIMWGLPVV--PTKAQA---AGTFTVGGFDMASQVW---DRMDA-----TV-E 344 (385) T ss_pred HHHHHHHHHhhcCCCceeccCcccCCCceecceeeE--EcCcCC---CCcEEEeecccEEEEE---Eecce-----EE-E Confidence 999998877654333334322 111 111232221 111000 001111110 01110 00000 00 0 Q ss_pred ecccccCcccccccceeeEEEEEEEcCCCCcccccceeeeeecCCCeEEEEEeecC Q lcl|NC_018863. 303 VKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLASEAVTAVVANPTDSVSLAVKLQS 358 (479) Q Consensus 303 ~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~V~LtIt~~~ 358 (479) .... .+..|. . +...|++...=+..--.+...+ .|+++.+. T Consensus 345 ~~~~-~~~~~~-~--~~~~~~~~~r~~~~v~~~~a~~-----------~~~~~aa~ 385 (385) T protein:vir:18 345 VSRE-DRDNFV-K--NMLTILCEERLALAHYRPTAII-----------KGTFSSGS 385 (385) T ss_pred Eecc-ccchhh-c--CcEEEEEEEeeccEEecccceE-----------EEEeccCC Confidence 0000 000011 1 1123333211111111122222 23333221 No 43 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=96.74 E-value=0.00035 Score=39.51 Aligned_cols=317 Identities=13% Similarity=0.099 Sum_probs=143.7 Q ss_pred Cccccccc-------------------------c-----e-eeeecCchhHHHHHHHHHHHhhcCc-------------- Q lcl|NC_018863. 1 MTELQKEQ-------------------------K-----V-EARKLPAGAEAELAELVSKSFTTGT-------------- 35 (479) Q Consensus 1 ~~~~~~~~-------------------------~-----~-~~~~~~~~~~~~~~e~~~Ksf~ag~-------------- 35 (479) |.+|.++. . + ...+......... ..+.|++..+. T Consensus 45 i~~l~~~I~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 123 (435) T protein:vir:14 45 FSELTAQIERAEAAERMAAAAAVPVDPNPTAVAAPAAAPVHAQPKALEVKGAKM-ARMVRALAAARGDAQLASKLAIERG 123 (435) T ss_pred HHHHHHHHHHHHHHHHHHHhhcccccchhhhhhhccccccccccchhhhhHHHH-HHHHHHHHhhcchhhHHHHHHHhhh Confidence 00000000 0 0 0000011111111 22333333221 Q ss_pred -------ccCcccccCccccchhhhHHHHHHHhhccccccchhhhccchhH--HHHHHhhhhhccCcccccccccccccc Q lcl|NC_018863. 36 -------GITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQVN--STVAKYAVFNQHGRTGHSRFVREVGVA 106 (479) Q Consensus 36 -------~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~~~--stv~~y~~~~~~G~~g~~~fv~E~g~~ 106 (479) ..+..+...|+.|-.+.+..+|..+..... .+..+..+.+. +---+|.++. +.+...+++|++.. T Consensus 124 ~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~l~~~~---~i~~~~~~~~~~~~~~~~~p~~~---~~~~a~~v~E~~~~ 197 (435) T protein:vir:14 124 FGEEVAMSLNTLSPGAGGVLVPENLSSEVIELLRPKS---VVRKLGARTLPLSNGNITIPRLK---GGAIVGYIGADTDI 197 (435) T ss_pred hhhhhhhhcccCCcCCCccccchhHHHHHHHHHhhhc---hhhhhcceeeecCCCceEEEEEe---CCcceeeeccCccc Confidence 112223345677888888888866664433 22333222222 2112333333 33456789999999 Q ss_pred cccCcceEEEEEEEEeeeehhhhhhhHhhhcchh--hHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhH Q lcl|NC_018863. 107 SINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIA--DPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGL 184 (479) Q Consensus 107 ~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~--dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl 184 (479) +..|+.+.+.+..++=++....+|.-+ +.++.- +.+....+.-...+.+.+|.++++|+-. +-++.|| T Consensus 198 ~~~~~~f~~i~~~~~k~~~~~~iS~el-l~ds~~~~~l~~~i~~~l~~ai~~~~d~a~l~G~G~---------~~~p~Gi 267 (435) T protein:vir:14 198 PTTQQQFDDLKLTAKKMAALVPIANDL-IKYAGVNPNVDQIVVGDLTAAIGAREDKAFIRDDGT---------ANTPKGL 267 (435) T ss_pred cccccceeEEEeeeEEEEEeehhhHHH-HHhhccCHHHHHHHHHHHHHHHHHHHHHHhhccCCC---------Cccccce Confidence 999999999999999898888888655 333322 3557778888889999999999999743 1245677 Q ss_pred HHhhccCCcEEEcc-CCCCC--HHHhhhhhheeec---ccCceeeeecChHHHhhHHHhhcCceeEEeecCCCccccCcc Q lcl|NC_018863. 185 TKLIDEATNVIDLK-GERLD--EATLNKAAVIVGK---GYGRATDAFMPIGVQADFTNNLLDRQRVIQPSQAGGFSTGFS 258 (479) Q Consensus 185 ~~~I~~~~NviDar-G~~l~--~~~l~~aa~~i~~---~fG~atd~~mp~~vka~f~q~~~~~qrv~~~~n~g~~~~G~~ 258 (479) .+... +.++...- +...+ ...|.++...+.. +++.+ -..|++.+.+.+...-...-|.+.+...++.-.|.+ T Consensus 268 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~-~~v~n~~~~~~L~~lkd~~G~~l~~~~~~g~l~G~P 345 (435) T protein:vir:14 268 RFWAL-PSNVITASDASTLQKIETDLGKVILALENADANLTQP-GWIMAPRTFRFLEGLRDGNGNKVYPELANGMLKGYP 345 (435) T ss_pred eeccc-ccceeccccccchhhHHHHHHHHHHHhhhccccccCC-EEEEcHHHHHHHHHhhccCCceeccCCCCCeeecce Confidence 65443 23333322 22222 2223333222222 23332 367899999888776554434454543333333443 Q ss_pred ccceecCceeEEecC-----CcccCCCc-ccc-Cc-----ccCCCCCcccceEEEeecccccCcccccccceeeEEEEEE Q lcl|NC_018863. 259 INQFLSTRGAINLHG-----STIMENDN-ILV-DR-----IPEPNAPQAPASVVATVKVNDKGAFRPVKDIKTHSYKVVV 326 (479) Q Consensus 259 V~~~~ss~g~I~L~~-----s~v~~a~~-~lv-er-----~~s~~aP~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a 326 (479) |.. +..-+.++.+ ..+.++.. +++ +| ..++.+.+.-. .++...-|.. +.-.+++.. T Consensus 346 v~~--~~~~p~~~~~~~~~~~i~~gd~s~~~i~~~~~~~~~~~~~~~~~~~------~~~~~~~f~~----~~~~~r~~~ 413 (435) T protein:vir:14 346 VGK--TTQVPINLGETGKESEIYFTDFGDVFIGEEETLEIDYSKEATYKDA------DGHMVSAFQR----DQTLIRVIA 413 (435) T ss_pred eEe--eccccccccCCCccceEEEeecccEEEEEecccEEEEecccccccc------ccchhhhhhc----Chhheeeee Confidence 311 1100001000 01111111 111 11 00000100000 0000011111 123444444 Q ss_pred EcCCCCcccccceeeeeecCCC Q lcl|NC_018863. 327 HSDDAESLASEAVTAVVANPTD 348 (479) Q Consensus 327 ~n~~GES~~S~~VtaT~a~~~~ 348 (479) .-+.+--.|+..+..+-++-+- T Consensus 414 r~d~~~~~~~a~~~l~~~~~~~ 435 (435) T protein:vir:14 414 KNDFGPRHVESIAVLAGVAWGA 435 (435) T ss_pred eeCceeecccceEEEecCCCCC Confidence 4444545555555555444333 No 44 >protein:vir:103759 Length: 330 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1645 # MgeName: BcepC6B # Cross-refs: genbank:acc:YP_024928;genbank:gi:48697198;genbank:GeneID:2846083 Probab=96.65 E-value=9.1e-05 Score=42.73 Aligned_cols=263 Identities=19% Similarity=0.208 Sum_probs=124.7 Q ss_pred CcccccccceeeeecCchhHHHHHHHHHHHhhcCcccCcccccCccccchhhhHHHHHHHhhccccccchhhhccchhHH Q lcl|NC_018863. 1 MTELQKEQKVEARKLPAGAEAELAELVSKSFTTGTGITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQVNS 80 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~Ksf~ag~~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~~~s 80 (479) ||.+. ++++-+.++ +|-+-. +.. .+-..|. |+.++. +|.+++=...++ T Consensus 1 m~~~~---------~~a~TL~e~----AKr~~~------d~~---~~~IIE~-------l~~tn~---IL~~lpf~e~N~ 48 (330) T protein:vir:10 1 MATLS---------TNNPTMADV----AKRLDP------NGK---VDIIVEM-------LNQTNP---VLQDMTAIEGNL 48 (330) T ss_pred CCcCC---------CCcccHHHH----HhhcCc------chh---HHHHHHH-------HhcCch---HHhhcchhhccC Confidence 66553 223333332 332211 111 0112222 222221 233333222221 Q ss_pred -HHHHhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhh-cchhhHHHHHHHHHHHHHHHHH Q lcl|NC_018863. 81 -TVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLV-NNIADPMTILTEDAISVIAKSI 158 (479) Q Consensus 81 -tv~~y~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv-~~~~dp~~~~~~~ai~~~~~~~ 158 (479) |=|.+.+ +-+--...|-.=.....-+.++..|++..++.|..-..|-+...-. .+..+-+++|.+.-|..+.+.+ T Consensus 49 ~tg~~t~v---rt~LP~~~fR~lN~g~~~s~~tt~qvt~~l~ilgg~~eVDr~la~~~Gn~a~~ra~e~~~~ikam~q~~ 125 (330) T protein:vir:10 49 PTGHRTSV---RTGLPTPTWRKLYGGVLPNKSSTAQVTDNCGMLEAYAEVDKALADLNGNTAAFRLSEDRAQIEGMNQEV 125 (330) T ss_pred CcccceeE---EeecCCchhhhcCCccccccceEEEEEEEeEEecchhhhhhHHHhhcCCHHHHHHHHHHHHHHHHHHHH Confidence 1121111 1122223342222233445699999999999999999998866544 5567778999999999999999 Q ss_pred HHHHhhcccccCCCCCCcccchhhhHHHhhc-----cCCcEEEccCCCCCHHHhhhhhheeecccCceeeeecChHHHhh Q lcl|NC_018863. 159 EWAIFYGDAALAAEADNQAGIEFDGLTKLID-----EATNVIDLKGERLDEATLNKAAVIVGKGYGRATDAFMPIGVQAD 233 (479) Q Consensus 159 e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~-----~~~NviDarG~~l~~~~l~~aa~~i~~~fG~atd~~mp~~vka~ 233 (479) +..+||||++.+| -+||||.+.+. .++|+||+.|.--..-.|+ ++.=+=..+ ..+-|-+-|+- T Consensus 126 ~~~~iyGD~a~~p-------~~F~GL~kR~~~~ta~~~~qvIdaGGtG~~~TSi~----~v~wg~~~~-~giyPkG~kaG 193 (330) T protein:vir:10 126 AQTLFYGNDGIAP-------AEFTGLSPRYNSLSAENKDNVIDAGGTGSDNASAW----LVVWGPNTC-HSIYPKGSKAG 193 (330) T ss_pred HHHhccCCCCCCh-------hhccchhhhcCCCCCCchhheeeccccccCceEEE----EEEEcCCeE-EEEcccCcccc Confidence 9999999999887 47999999994 1469999999544332222 111111222 33348888888 Q ss_pred HHHhhcCceeEEeecCCCc----------cccCccccceecCce--eEE----------------------ecCCcccCC Q lcl|NC_018863. 234 FTNNLLDRQRVIQPSQAGG----------FSTGFSINQFLSTRG--AIN----------------------LHGSTIMEN 279 (479) Q Consensus 234 f~q~~~~~qrv~~~~n~g~----------~~~G~~V~~~~ss~g--~I~----------------------L~~s~v~~a 279 (479) |+-.-++.+++....-.|+ --.|..|..|.++-= +|+ +-++..+.. T Consensus 194 l~~~d~g~~~~~~~dg~gg~y~~~~~~~~w~~Gl~i~d~r~vvRI~NIdvs~l~~~~~~~~li~lm~~A~~~ip~~~~g~ 273 (330) T protein:vir:10 194 LSVEDKGQVTIENADGNGGRMEGYRTHYKWDIGLTLRDWRYVARVCNIDVSDLATSANAQALIKYMIMAAERIPQLGMGR 273 (330) T ss_pred ceeeeccceeeecccCCCCceeEEeeeeeeeeeeEEeCcccEEEEeecccccCCCCccHHHHHHHHHHHHHhccCCCCCc Confidence 7777777766653221111 122333322222100 010 001111222 Q ss_pred CccccCc-----------------c-cCCCCCc-------ccceEE-EeecccccCccccccccee Q lcl|NC_018863. 280 DNILVDR-----------------I-PEPNAPQ-------APASVV-ATVKVNDKGAFRPVKDIKT 319 (479) Q Consensus 280 ~~~lver-----------------~-~s~~aP~-------~P~~vt-a~~~~~~~g~~~~~sd~g~ 319 (479) ..++.-| . ...-++. .|+-.+ |...+ +..=+ T Consensus 274 ~~~y~n~~v~~~L~~q~~~k~n~~l~~~~~~g~~~t~~~gipir~~Dail~t---------E~~vv 330 (330) T protein:vir:10 274 AVWYMNRNLREKLRLGIVDKIANNLTWETVSGERVMTFDGIPVQRTDALLNT---------ESRVV 330 (330) T ss_pred ceeeechHHHHHHHHHHhhcccceeeeeecCCeeeEEECCeEEEEEeeeecC---------ccccC Confidence 2222211 0 0000111 111111 01000 00001 No 45 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=96.60 E-value=0.00015 Score=41.51 Aligned_cols=308 Identities=10% Similarity=0.022 Sum_probs=147.2 Q ss_pred Ccccccccc--eeeeecCchh--HHHHHHHHHHHhhcCc-------------ccCcccccCccccchhhhHHHHHHHhhc Q lcl|NC_018863. 1 MTELQKEQK--VEARKLPAGA--EAELAELVSKSFTTGT-------------GITPDTQHDAAALRRELLDDQVKMLAFT 63 (479) Q Consensus 1 ~~~~~~~~~--~~~~~~~~~~--~~~~~e~~~Ksf~ag~-------------~~~~~~~~~gaAlr~esld~~i~~l~~~ 63 (479) +.+..++.. ....+-.... +....+.+.+.+.-+. ..+..+..+|+.+..| +.+.|..+... T Consensus 61 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~-~~~~ii~~~~~ 139 (390) T protein:vir:81 61 VAELEGNGAGGDVQHVSVGDMFVASEQFQASAGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPN-RLPGFITPPDA 139 (390) T ss_pred HHHHHhcccccccccccchhhhhhhHHHHHHHHHHhhhhhhhhhHHHHHHHhhccccccCCcceechh-hhHHHHHHHhh Confidence 111111110 0000111111 1111111221111111 1111223334444444 44555444333 Q ss_pred cccccchhhhccchhHHHHHHhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHH Q lcl|NC_018863. 64 NGDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPM 143 (479) Q Consensus 64 ~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~ 143 (479) . ..+.+-+...+..+-..+|.++. +..+...+++|++..+..++.+......++-++---.+|.-+ +.++ .+.+ T Consensus 140 ~--~~l~~~~~~~~~~~~~~~~~~~~--~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~el-l~d~-~~~~ 213 (390) T protein:vir:81 140 R--LTVRDLIGSGRTDSALIEYVQET--GFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQI-LSDA-PQLA 213 (390) T ss_pred h--hhhhhhcceeeccCCceEEEEEe--cCCcceeeecCCcccccccceeeEEEEeeeEEEEeehhhHHH-HHhH-HHHH Confidence 2 33454444444444333443332 333456789999999999999999999999999888888853 3344 4788 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhhheeecccCceee Q lcl|NC_018863. 144 TILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAAVIVGKGYGRATD 223 (479) Q Consensus 144 ~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~atd 223 (479) ....+.-...++..++.++++||-. |-.+.|+.+.......+....+ ....+.|..+--.+...+...+- T Consensus 214 ~~i~~~l~~~~~~~~d~a~l~G~g~---------~~~~~Gi~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 283 (390) T protein:vir:81 214 SYMNNRLIRGLKVKEDAEILRGTGA---------NDGLLGLIPQATTYAAPTTIAG-ATRVDQLRLAMLQASLAEYNPSG 283 (390) T ss_pred HHHHHHHHHHHHHHHHHHHHhcCCC---------CCcccceeeccccccccccccc-chhHHHHHHHHHhhccccCCCCE Confidence 8888889999999999999999754 1236788766543223333333 33445555555555566777778 Q ss_pred eecChHHHhhHHHhhcCceeEE-eecCCC-c-cccCccccceecCceeEEecCCcccCCC--ccccCcccCCCCCcccce Q lcl|NC_018863. 224 AFMPIGVQADFTNNLLDRQRVI-QPSQAG-G-FSTGFSINQFLSTRGAINLHGSTIMEND--NILVDRIPEPNAPQAPAS 298 (479) Q Consensus 224 ~~mp~~vka~f~q~~~~~qrv~-~~~n~g-~-~~~G~~V~~~~ss~g~I~L~~s~v~~a~--~~lver~~s~~aP~~P~~ 298 (479) ++|++.+.+.+...-...-|.+ ++...+ . .-.|.+|.. +...+ .+..+.++. .++.. ... . T Consensus 284 ~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~l~G~pv~~--~~~~p---~~~~~~gd~~~~~~~~---~~~------~ 349 (390) T protein:vir:81 284 IVINPIDWAAIELAKDANNQYLIGNARGTLTPTLWGLPVVA--TQAMA---PGEFLVGAFDLAAQIF---DQW------D 349 (390) T ss_pred EEEcHHHHHHHHHhhcCCCceeecCcccccCceecceeeEE--cCCCC---CCcEEEEehhceEEEE---Eec------c Confidence 9999999998887654333333 222111 1 112333311 11000 001111111 01110 000 0 Q ss_pred EEEeecccccCcccccccceeeEEEEEEEcCCCCcccccceeeeee Q lcl|NC_018863. 299 VVATVKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLASEAVTAVVA 344 (479) Q Consensus 299 vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~a 344 (479) ..... +..+..|. . +...|++...=+..--.+...|..|.+ T Consensus 350 ~~v~~--~~~~~~~~-~--~~v~~r~~~r~d~~v~~~~a~v~~t~a 390 (390) T protein:vir:81 350 ARVEI--GYVGEDFQ-R--NMITVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred eEEEE--ecccchhh-c--CcEEEEEEEeeccEEecccceEEEEeC Confidence 00010 11111111 1 223444444444444445555566666 No 46 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=96.59 E-value=0.00048 Score=38.76 Aligned_cols=299 Identities=13% Similarity=0.038 Sum_probs=139.8 Q ss_pred CcccccccceeeeecC--chh--HHHHHHHHHHHhhcCc-------------ccCcccccCccccchhhhHHHHHHHhhc Q lcl|NC_018863. 1 MTELQKEQKVEARKLP--AGA--EAELAELVSKSFTTGT-------------GITPDTQHDAAALRRELLDDQVKMLAFT 63 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~--~~~--~~~~~e~~~Ksf~ag~-------------~~~~~~~~~gaAlr~esld~~i~~l~~~ 63 (479) +-+..++.+-...... ... ..+-.+.+.+.+.-+. ..+ .+-.+++.|-.+.+...|..+... T Consensus 61 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~g~lip~~~~~~ii~~~~~ 139 (390) T protein:vir:97 61 VAELEGNGAGGDVQHVSVGDMFVASEQFQASTGRWNDRSARATMNIKAALNTAST-DAAGSAGALTTPNRLPGFITPPDA 139 (390) T ss_pred HHHHHhcccccccccccchhhhhhhHHHHHHHHHhhhhhhhhhhHHHHHHHhhhc-ccccccccccchhhhHHHHHHHhh Confidence 1111111000000000 000 0011112222211111 111 123445555555566666555544 Q ss_pred cccccchhhhccchhHHHHHHhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHH Q lcl|NC_018863. 64 NGDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPM 143 (479) Q Consensus 64 ~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~ 143 (479) .. .+++-+....+.+-...|.++. ++.+...|++|++..+.+++.+.+....++-++..-.+|.-+ +.++ .+.+ T Consensus 140 ~~--~i~~~~~~~~~~~~~~~~~~~~--~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~el-l~ds-~~l~ 213 (390) T protein:vir:97 140 RL--TVRDLIGSGRTDSALIEYVQET--GFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQI-LSDA-PQLA 213 (390) T ss_pred hh--hhHhhcceeeccCCceEEEEEe--cCCcceeeecCCccccccccceeEEEEeeeeEEEeehhhHHH-HHhH-HHHH Confidence 43 3555555555554444454433 333456799999999999999999999999999988888854 3333 5788 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhhheeecccCceee Q lcl|NC_018863. 144 TILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAAVIVGKGYGRATD 223 (479) Q Consensus 144 ~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~atd 223 (479) ....+.-...+.+.++.++|+|+-. +-++.||.+... ..+..-........+.|..+-..+...|...+. T Consensus 214 ~~i~~~la~a~~~~~d~a~l~G~g~---------~~~p~Gi~~~~~-~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~ 283 (390) T protein:vir:97 214 SYMNNRLIRGLKVKEDAEILRGTGA---------NDGLLGLIPQAT-TYAAPTTIAGATRVDQLRLAMLQASLAEYPASG 283 (390) T ss_pred HHHHHHHHHHHHHHHHHHHhhcCCC---------Cccccceeeccc-cccccccccccchHHHHHHHHHhhccccCCCCE Confidence 8899999999999999999999643 123578876553 223322333444556666666666677777788 Q ss_pred eecChHHHhhHHHhhcCceeEEeecCCCccccCccccceecCceeEEecCCcccCCCccccCcccCCCCCcccceEEEee Q lcl|NC_018863. 224 AFMPIGVQADFTNNLLDRQRVIQPSQAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILVDRIPEPNAPQAPASVVATV 303 (479) Q Consensus 224 ~~mp~~vka~f~q~~~~~qrv~~~~n~g~~~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver~~s~~aP~~P~~vta~~ 303 (479) ++|++.+...+...-...-|.+.+. +..... . .|.|-+| +..+. .|.. ......- T Consensus 284 ~v~n~~~~~~L~~lkd~~G~~l~~~-~~~~~~----~---------~l~G~pV-----~~~~~-----~~~~-~~~~gd~ 338 (390) T protein:vir:97 284 IVINPIDWAAIELAKDANNQYLIGN-ARGTLT----P---------TLWGLPV-----VATQA-----MAPG-EFLVGAF 338 (390) T ss_pred EEEcHHHHHHHHHhhcCCCceeecC-ccCCCC----c---------eecceee-----EEcCC-----CCCC-cEEEEec Confidence 9999999988886553333333222 110000 0 1111111 11110 0100 0000000 Q ss_pred --------------cccccCcccccccceeeEEEEEEEcCCCCcccccceeeeee Q lcl|NC_018863. 304 --------------KVNDKGAFRPVKDIKTHSYKVVVHSDDAESLASEAVTAVVA 344 (479) Q Consensus 304 --------------~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~a 344 (479) ..+..+..|. . +.-.|++...=+.+--.+...+..+.+ T Consensus 339 ~~~~~~~~~~~~~i~~~~~~~~f~-~--~~~~~r~~~r~d~~v~~~~a~v~~~~a 390 (390) T protein:vir:97 339 DLAAQIFDQWDARVEIGYVNDDFQ-R--NMVTVLAEERLALVVYRPEALITGSFA 390 (390) T ss_pred cceEEEEEecceEEEEeecccccc-c--CcEEEEEEEeeccEEeccccEEEEEeC Confidence 0000000000 0 001111111111111111111111111 No 47 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=96.49 E-value=0.00014 Score=41.75 Aligned_cols=318 Identities=11% Similarity=0.088 Sum_probs=141.9 Q ss_pred Cccccccc----ceeeeecCchhHHHHHHHHHH-------H--hh-cCccc------CcccccCccccchhhhHHHHHHH Q lcl|NC_018863. 1 MTELQKEQ----KVEARKLPAGAEAELAELVSK-------S--FT-TGTGI------TPDTQHDAAALRRELLDDQVKML 60 (479) Q Consensus 1 ~~~~~~~~----~~~~~~~~~~~~~~~~e~~~K-------s--f~-ag~~~------~~~~~~~gaAlr~esld~~i~~l 60 (479) +-...... +.................+.. . +. -.+.. ...+-..|+.|..+.+..+|..+ T Consensus 70 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~liP~~~~~~ii~~ 149 (428) T protein:vir:10 70 VKATQHGPAVIVKAEPKQYTGAGMTRMVMSIAAAQGNLQDAAKFASDELNDQSVSMAISTAAGSGGVLIPQNIHSEVIEL 149 (428) T ss_pred hhchhhccccccccccchhhhHHHHHHHHHHHHhhhhHHHHHHHhhhhhhhhhHhhhhcccccCCccccchhHHHHHHHH Confidence 00000000 000000100011111010000 0 00 00000 00112246778888887777665 Q ss_pred hhccccccchhhhccchhH--HHHHHhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcc Q lcl|NC_018863. 61 AFTNGDFTIYPLINKQQVN--STVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNN 138 (479) Q Consensus 61 ~~~~~~f~~~~~i~k~~~~--stv~~y~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~ 138 (479) ..... .+..+..+.+. +--.+|.++ .+.+...+++|++..+.+++.+.+.+...+=++.-..+|..+ +.++ T Consensus 150 l~~~~---~l~~~~~~~~~~~~g~~~~p~~---~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~v~is~el-l~ds 222 (428) T protein:vir:10 150 LRDRT---IVRKLGARSIPLPNGNMSLPRL---AGGATASYTGENQDAKVSEARFDDVKLTAKTMIAMVPISNAL-IGRA 222 (428) T ss_pred Hhhhc---hhhhhcceeeecCCcceEEEEE---eCCcceeeeccCccccccccceeeEEeeeEEEEEeehhhHHH-Hhhh Confidence 54332 23333212111 111233333 233457799999999999999999999999888887777765 3455 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEc-cCCCCCHHHhhhh------h Q lcl|NC_018863. 139 IADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDL-KGERLDEATLNKA------A 211 (479) Q Consensus 139 ~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDa-rG~~l~~~~l~~a------a 211 (479) ..|.+....+.-...+.+.+|.++++||.. |-+++|+.+.......++.. .+...+.+.+... . T Consensus 223 ~~~l~~~i~~~l~~ai~~~~d~~~l~G~G~---------~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 293 (428) T protein:vir:10 223 GFNVEQLVLQDILTAISVREDKAFMRDDGT---------GDTPIGMKARATQWNRLLPWAADAAVNLDTIDTYLDSIILM 293 (428) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhccCCC---------CccccccccccccccccccccccccccHHHHHHHHHHHHHh Confidence 667888889999999999999999999753 23568998876533333332 3445554444321 1 Q ss_pred heeecccCceeeeecChHHHhhHHHhhcCceeEEeecCCCccccCccccceecCceeEEecC-----CcccCC-Ccccc- Q lcl|NC_018863. 212 VIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSQAGGFSTGFSINQFLSTRGAINLHG-----STIMEN-DNILV- 284 (479) Q Consensus 212 ~~i~~~fG~atd~~mp~~vka~f~q~~~~~qrv~~~~n~g~~~~G~~V~~~~ss~g~I~L~~-----s~v~~a-~~~lv- 284 (479) ......+-.....+|++.+...+...-...-|-+.+...++.-.|.+|.. +..-+.++.. ..+.++ ..+++ T Consensus 294 ~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~g~l~G~pv~~--~~~~p~~~~~~~~~~~i~~gd~s~~~i~ 371 (428) T protein:vir:10 294 SMDGNSNMISSGWGMSNRTYMKLFGLRDGNGNKVYPEMAQGMLKGYPIQR--TSAIPANLGEGGKESEIYFADFNDVVIG 371 (428) T ss_pred hhccccccccCEEEEcHHHHHHHHHhhccCCceeccCCCCCeeeceeeEE--eccccccccCCCccceEEEEecceEEEE Confidence 22223333334567999888887765543223333322222333444411 1000000000 000000 01111 Q ss_pred Cc-----ccCCCCCcccceEEEeecccccCcccccccceeeEEEEEEEcCCCCcccccceeeeeecC Q lcl|NC_018863. 285 DR-----IPEPNAPQAPASVVATVKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLASEAVTAVVANP 346 (479) Q Consensus 285 er-----~~s~~aP~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~a~~ 346 (479) ++ ..+..+.+.. .......-|.. ..-.+++...-+-+--.|+..+..|..+- T Consensus 372 ~~~~i~i~~~~~~~~~~------~~~~~~~~f~~----~~~~~R~~~r~d~~v~~p~a~~~~t~~~~ 428 (428) T protein:vir:10 372 EDGNMKVDFSKEASYID------TDGKLVSAFSR----NQSLIRVVTEHDIGFRHPEGLVLGTGVLF 428 (428) T ss_pred EecceEEEeeccccccc------ccccccchhhc----chhheeeeeeeCceeeccceEEEEeccCC Confidence 00 0000010000 00000011111 12233333333333333444444444333 No 48 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=96.48 E-value=0.00049 Score=38.72 Aligned_cols=312 Identities=13% Similarity=0.065 Sum_probs=136.3 Q ss_pred Ccccccccceeee--ecCchh--HHHHHHHHHHHhhcCc-------------ccCcccccCccccchhhhHHHHHHHhhc Q lcl|NC_018863. 1 MTELQKEQKVEAR--KLPAGA--EAELAELVSKSFTTGT-------------GITPDTQHDAAALRRELLDDQVKMLAFT 63 (479) Q Consensus 1 ~~~~~~~~~~~~~--~~~~~~--~~~~~e~~~Ksf~ag~-------------~~~~~~~~~gaAlr~esld~~i~~l~~~ 63 (479) +-++.+..+.... +..... +.+....+......+. ..+..+..+|+-+-.+.+. .+..+... T Consensus 61 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~-~ii~~~~~ 139 (390) T protein:vir:10 61 VAELEGNGAGGDVQHVSVGDLFVASEQFQASAGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRLP-GFITQPDA 139 (390) T ss_pred HHHHHhhcccccccccchhhhhhhhHHHHHHHHhhhhhhhhhhhHHHHHHHhhhcccccccccccchhHHH-HHHHHHHh Confidence 1111111100000 000000 0000011111111110 1111223344445555544 44333333 Q ss_pred cccccchhhhccchhHHHHHHhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHH Q lcl|NC_018863. 64 NGDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPM 143 (479) Q Consensus 64 ~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~ 143 (479) . ..+++.+...++.+.-.+|.++ .+..+...+++|++..+..|+++......++-++-...+|.-+ +.++ .+.. T Consensus 140 ~--~~l~~~~~~~~~~~~~~~~~~~--~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~el-l~d~-~~l~ 213 (390) T protein:vir:10 140 R--LTVRDLIGSGRTDSALIEYVQE--TGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQI-LSDA-PQLA 213 (390) T ss_pred h--chhhhhcceeeccCCceEEEEE--ecCCcceeeecCCccccccccceeEEEEeeEEEEEeehhhHHH-HHhH-HHHH Confidence 2 3455555544444433344332 3334456789999999999999999999999999988888864 3444 4777 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhhheeecccCceee Q lcl|NC_018863. 144 TILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAAVIVGKGYGRATD 223 (479) Q Consensus 144 ~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~atd 223 (479) ....+.-...++..++.+++.|+-. |-++.|+.+............|.. ..+.|..+-..+..+|...+- T Consensus 214 ~~i~~~l~~~~~~~~~~~il~G~G~---------~~~p~Gi~~~~~~~~~~~~~~~~~-~~~~~~~~~~~l~~~~~~~~~ 283 (390) T protein:vir:10 214 SYMNNRLIRGLKVKEDAEILRGTGA---------NDGLLGLIPQATTYAAPTTIAGAT-RVDQLRLAMLQASLAEYPASG 283 (390) T ss_pred HHHHHHHHHHHHHHHHHHHhhcCCC---------Cccccccccccccccccccccccc-hHHHHHHHHHhhccccCCCCE Confidence 8888888889999999999999743 224678877654222233333433 344555555555667778888 Q ss_pred eecChHHHhhHHHhhcCceeEE-eecCCCccccCccccceecCceeEEecCCcccCCCccccCcccCCCCCcccceEEEe Q lcl|NC_018863. 224 AFMPIGVQADFTNNLLDRQRVI-QPSQAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILVDRIPEPNAPQAPASVVAT 302 (479) Q Consensus 224 ~~mp~~vka~f~q~~~~~qrv~-~~~n~g~~~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver~~s~~aP~~P~~vta~ 302 (479) .+|++.+.+.+...-...-|.+ ++...+.. + .|+|-+| +..+.. |.. T Consensus 284 ~v~n~~~~~~L~~lkd~~g~~l~~~~~~~~~------~---------~l~G~pv-----~~~~~~-----p~~------- 331 (390) T protein:vir:10 284 IVINPIDWAAIELAKDANNQYLIGNARGTLT------P---------TLWGLPV-----VATQAM-----APG------- 331 (390) T ss_pred EEEcHHHHHHHHHhhcCCCceeecCCcCcCC------c---------eecceee-----EEcCCC-----CCC------- Confidence 9999999988887554332333 22211100 0 1122221 111111 100 Q ss_pred ecccccCcccccccceeeEEEEEEEcCCCCcccccceeeeeecCCCeEEEEEeecCCccccceEEEEEeccCCCCcEEEE Q lcl|NC_018863. 303 VKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLASEAVTAVVANPTDSVSLAVKLQSLYQAKPQFISVYRQGNETGHYFLV 382 (479) Q Consensus 303 ~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~V~LtIt~~~~~~~~~~y~~IYR~t~~~g~~~~i 382 (479) ..+ .|.+++.+..+.+.|-+....-. ..-...+.+.+.+.-- .+ ..|+|..+ -....+ T Consensus 332 -------~~~----~gdf~~~~~~~~~~~~~i~~~~~--~~~~~~~~~~~r~~~r-~d------~~v~~~~a--~~~~~~ 389 (390) T protein:vir:10 332 -------EFL----VGAFDLAAQIFDQWDARVEIGYV--NDDFQRNMVTVLAEER-LA------LVVYRPEA--LISGSF 389 (390) T ss_pred -------cEE----EEeccceEEEEEecceEEEEeec--ccccccCcEEEEEEEe-ec------cEEecccc--EEEEEe Confidence 000 01111111111111111100000 0000011111110000 00 00000000 000000 Q ss_pred E Q lcl|NC_018863. 383 A 383 (479) Q Consensus 383 ~ 383 (479) + T Consensus 390 a 390 (390) T protein:vir:10 390 A 390 (390) T ss_pred C Confidence 0 No 49 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=96.46 E-value=0.00026 Score=40.28 Aligned_cols=289 Identities=11% Similarity=0.054 Sum_probs=132.4 Q ss_pred hhcCcccCcccccCccccchhhhHHHHHHHhhccccccchhhhccchhHHHHHHhhhhhccCcccccccccccccccccC Q lcl|NC_018863. 31 FTTGTGITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASIND 110 (479) Q Consensus 31 f~ag~~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~~~d 110 (479) |..| +-+.|+.|-.+.+..+|........ .+.+-....+..+--.+| ....+.+...+++|++..+.++ T Consensus 1 Ma~~------~~~~gg~~vP~~~~~~ii~~l~~~s--~i~~l~~~i~~~~~~~~i---p~~~~~~~a~wv~Eg~~~~~s~ 69 (315) T protein:vir:80 1 MADD------FLSAGKLELPGSMIGAVRDRAIDSG--VLAKLSPEQPTIFGPVKG---AVFSGVPRAKIVGEGEVKPSAS 69 (315) T ss_pred CCCC------cCCcCceEcchHHHHHHHHHHHhhc--hhhhhcceeecCCCceEE---EEEeCCcceEEeeCCccccccc Confidence 3332 2334556667777666644444332 222222222222221223 2334445677999999999999 Q ss_pred cceEEEEEEEEeeeehhhhhhhHhhhcc---hhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHh Q lcl|NC_018863. 111 PNIRQKTVQMKFLSDTKQQSLAAGLVNN---IADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKL 187 (479) Q Consensus 111 ~~~~r~~~~~k~l~~~~~vs~~~~lv~~---~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~ 187 (479) +.+.+.....|=|+.--.+|..+-..+. +...+....++-...+++.+|.++|+|+..-. |....|+... T Consensus 70 ~~f~~v~l~~~kl~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~-------~~~~~~~~~~ 142 (315) T protein:vir:80 70 VDVSAFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPAT-------GKAASAVHTS 142 (315) T ss_pred cceeeeEeeeeeEEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCC-------Cccccccccc Confidence 9999999999989888788876543332 23356777888888999999999999985432 2235677777 Q ss_pred hccCCcEEEccCCCCCHHHhhhhhheeecccCceeeeecChHHHhhHHHhhcC------ceeEEeecCCC--ccccCccc Q lcl|NC_018863. 188 IDEATNVIDLKGERLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLD------RQRVIQPSQAG--GFSTGFSI 259 (479) Q Consensus 188 I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~atd~~mp~~vka~f~q~~~~------~qrv~~~~n~g--~~~~G~~V 259 (479) +....+.+++-+... .+.++.-+.+....+...+-..|++.+...+...... .+........+ +.-.|.+| T Consensus 143 ~~~~~~~~~~~~~~~-~d~~~~~~~~~~~~~~~~~~~imn~~~~~~L~~l~~~~g~~~~g~~~~~~~~~g~~~tl~G~PV 221 (315) T protein:vir:80 143 LNKTKNIVDATDSAT-ADLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWRGLNV 221 (315) T ss_pred cccccceeeccccch-HHHHHHHHHHhhccCccceEEEEcHHHHHHHHHHhhccCCcccccccccccccCCCceecceee Confidence 766677888877643 3444433344444455555688999998888655321 11111111111 12333333 Q ss_pred c--ceecCc--------eeEEe-cCCc--ccCCCccccCcccCCCCCcccceEEEeecccccCcccccccceeeEEEEE- Q lcl|NC_018863. 260 N--QFLSTR--------GAINL-HGST--IMENDNILVDRIPEPNAPQAPASVVATVKVNDKGAFRPVKDIKTHSYKVV- 325 (479) Q Consensus 260 ~--~~~ss~--------g~I~L-~~s~--v~~a~~~lver~~s~~aP~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~- 325 (479) - ...... ..+-+ .++. +....+..++.....+.-..+.+....-........+-+..+-.-.--|+ T Consensus 222 ~~~~~~~~~~~~~~~~~~~~~~GDfs~~~~g~~~~~~i~i~~~~~~~~~~~~~~~~~~v~~r~~~r~~~~v~~~~a~~~l 301 (315) T protein:vir:80 222 GASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVV 301 (315) T ss_pred EecCcCCcccccccccccEEEEeecccEEEEEecCeeEEEeccccccCcccchhhcCcEEEEEEEEecceeecccceEEE Confidence 1 101000 00000 0110 00011111111111100000000000000000000000000000000000 Q ss_pred -EEcCCCCcccccc Q lcl|NC_018863. 326 -VHSDDAESLASEA 338 (479) Q Consensus 326 -a~n~~GES~~S~~ 338 (479) .....-.++|-+- T Consensus 302 ~~~~a~~~~~~~~~ 315 (315) T protein:vir:80 302 KEKAAPKPNPPAEN 315 (315) T ss_pred eeccCCCCCCCCCC Confidence 0001111111111 No 50 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=96.44 E-value=0.0002 Score=40.84 Aligned_cols=254 Identities=11% Similarity=0.091 Sum_probs=125.0 Q ss_pred HHHHhhcCcccCcccccCccccchhhhHHHHHHHhhccccccchhhhccchhHHHHHHhhhhhccC-ccccccccccccc Q lcl|NC_018863. 27 VSKSFTTGTGITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQVNSTVAKYAVFNQHG-RTGHSRFVREVGV 105 (479) Q Consensus 27 ~~Ksf~ag~~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~~~stv~~y~~~~~~G-~~g~~~fv~E~g~ 105 (479) ++++++++ +.++|++|-.+-+..+|..+...... +.+.....+.....-++ .+..+. +.+...+++|++. T Consensus 1 ~l~~~~~~------t~~~gg~liP~~~~~~Ii~~~~~~~~--l~~~~~~~~~~~~~g~~-~~~~~~~~~~~a~~v~Eg~~ 71 (293) T protein:vir:48 1 MLDSKTDH------SGSDAGLTIPQDIRTAINTLVRQYDS--LQEYVNVENVTTLTGSR-VYEKWTDITGLANIDDEAGK 71 (293) T ss_pred Cceeeccc------ccCcCceEechhHHHHHHHHHHhhhh--hhhhceeeeccCCcceE-EEEeecCCCcceeeecCCcc Confidence 66777764 33567888888888888655554443 33333333333332333 233343 3445789999988 Q ss_pred cc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhH Q lcl|NC_018863. 106 AS-INDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGL 184 (479) Q Consensus 106 ~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl 184 (479) .. .+++.+.+....+|-++....+|.-+- .++.-|.+....+.--..++..++.++|.|..+.... ..-+-+|.| T Consensus 72 ~~~~~~~~~~~i~l~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~~---~~~~~~d~i 147 (293) T protein:vir:48 72 IADIDDPKLSLIKYTIKRYAGISTVTNSLL-ADSAENILAWLSGWIAKKVVVTRNKAILGVVDKLPTK---PTLTKWDDI 147 (293) T ss_pred cccccccceeEEEEeeeEEEEeehhhHHHH-hhhhHHHHHHHHHHHHHHHHHHHHhHHhhcccccccc---ccccCHHHH Confidence 65 678999999999999998877776442 3445577778888888889999999999998775543 234556666 Q ss_pred HHhhccCCcEEEccCCCCCHHHhhhhhheeecccCceeeeecChHHHhhHHHhhcCceeEEeecCCCc----cccCcccc Q lcl|NC_018863. 185 TKLIDEATNVIDLKGERLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSQAGG----FSTGFSIN 260 (479) Q Consensus 185 ~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~atd~~mp~~vka~f~q~~~~~qrv~~~~n~g~----~~~G~~V~ 260 (479) .+++.+ +..+|......+||..+.+.+...-...-|.+...+..+ .-.|.+|. T Consensus 148 ~~~~~~-----------------------l~~~~~~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~l~G~Pv~ 204 (293) T protein:vir:48 148 IDLEAK-----------------------VDPAIKQTSFFLTNTSGFTALKKVKNALGDYLMERDVKSPTGYSIAGFAVK 204 (293) T ss_pred HHHHHh-----------------------hhhhhcCCCEEEEcHHHHHHHHHhhccCCceEeecCcCCCCCceecceeeE Confidence 666542 112233334556677776666654433223222222111 12333331 Q ss_pred cee----c--C--ceeEE-ecCCc-c--cCCCccccCcccC------------------CCCCcccc---eEEEeecccc Q lcl|NC_018863. 261 QFL----S--T--RGAIN-LHGST-I--MENDNILVDRIPE------------------PNAPQAPA---SVVATVKVND 307 (479) Q Consensus 261 ~~~----s--s--~g~I~-L~~s~-v--~~a~~~lver~~s------------------~~aP~~P~---~vta~~~~~~ 307 (479) -.. . . ...+- -.++. + .+..+..+++... .-.+.-|. ...-+..++. T Consensus 205 ~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~~ 284 (293) T protein:vir:48 205 EISDRWLPNASSGVMPLYFGDLKQAVTLFDRQQMSLLSTNIGGGAFETDTTKVRVIDRFDVVATDTEAFVPASFKAIADQ 284 (293) T ss_pred EecccccCCccCCceEEEEEeccceEEEEEecceEEEEecccchhhhcCeEEEEEEEeeCcEEecccceEEEEeeccccC Confidence 100 0 0 00000 00110 0 0011111110000 00000010 0111111222 Q ss_pred cCcccccccceeeEEEEEEEcCCCCcccccce Q lcl|NC_018863. 308 KGAFRPVKDIKTHSYKVVVHSDDAESLASEAV 339 (479) Q Consensus 308 ~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~V 339 (479) ++++.+ .+| T Consensus 285 ~~~~~~-----------------------~~~ 293 (293) T protein:vir:48 285 KGNIGS-----------------------TAV 293 (293) T ss_pred Cccccc-----------------------cCC Confidence 222211 111 No 51 >protein:vir:7324 Length: 335 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:143 # MgeName: epsilon15 # Cross-refs: genbank:acc:NP_848215;genbank:gi:30387386;genbank:GeneID:2641870 Probab=96.43 E-value=0.00011 Score=42.19 Aligned_cols=273 Identities=15% Similarity=0.061 Sum_probs=127.1 Q ss_pred CcccccccceeeeecCchhHHHHHHHHHHHhhcCcccCcccccCccccchhhhHHHHHHHhhccccccchhhhccchhHH Q lcl|NC_018863. 1 MTELQKEQKVEARKLPAGAEAELAELVSKSFTTGTGITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQVNS 80 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~Ksf~ag~~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~~~s 80 (479) ||.+. ++++-+.++ +|-+-+ +. -.+-..|. |+.++. +|.+++=...++ T Consensus 1 m~~~~---------~~a~TL~E~----Akr~~~------d~---~~~~IIE~-------l~~tne---IL~~lpf~e~N~ 48 (335) T protein:vir:73 1 MALIG---------QTLPSLLDI----YNRTDK------NG---RIARIVEQ-------LAKTND---ILTDAIYVPCND 48 (335) T ss_pred CCcCC---------CCchhHHHH----HhhcCc------ch---hHHHHHHH-------HhcCch---HHhhcchhcccC Confidence 66552 333333333 333321 00 00112222 222221 233333222221 Q ss_pred -HHHHhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcc-hhhHHHHHHHHHHHHHHHHH Q lcl|NC_018863. 81 -TVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNN-IADPMTILTEDAISVIAKSI 158 (479) Q Consensus 81 -tv~~y~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~-~~dp~~~~~~~ai~~~~~~~ 158 (479) |=|++.+ +-+--...|-.=.....-+.++..|++..++.|..-..|-+...-.++ ..+-+++|.+.-|..+.+.+ T Consensus 49 ~tg~~~~v---rt~LP~~~fR~lN~g~~~s~~tt~qvt~~l~ilgg~~eVDr~La~~~Gn~a~~ra~e~~~~ikam~q~~ 125 (335) T protein:vir:73 49 GSKHKTTI---RAGIPEPVWRRYNQGVQPTKTQTVPVTDTTGMLYDLGFVDKALADRSNNAAAFRVSENMGKLQGFNNKV 125 (335) T ss_pred CcccceeE---EEecCCchhhhcCCccccccceEEEEEEEEEEecchhhhhHHHHhhcCCHHHHHHHHHHHHHHHHHHHH Confidence 1121111 112222334221223344569999999999999999999887665544 67889999999999999999 Q ss_pred HHHHhhcccccCCCCCCcccchhhhHHHhhc--------cCCcEEEccCCCCCHHHhhhhhheeecccCceeeeecChHH Q lcl|NC_018863. 159 EWAIFYGDAALAAEADNQAGIEFDGLTKLID--------EATNVIDLKGERLDEATLNKAAVIVGKGYGRATDAFMPIGV 230 (479) Q Consensus 159 e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~--------~~~NviDarG~~l~~~~l~~aa~~i~~~fG~atd~~mp~~v 230 (479) +..+||||++.+| -+||||.+.+. .+.|+||+.|.--..-.|+ ++.=+=..+ ..+-|-+- T Consensus 126 ~~~~iyGDsa~~p-------~~FdGL~kR~~~~st~~a~~a~~iIdaGGtG~~~TSi~----~v~wg~~~~-~giyPkG~ 193 (335) T protein:vir:73 126 ARYSIYGNTDAEP-------EAFMGLAPRFNTLSTSKAASAENVFSAGGSGSTNTSIW----FMSWGENTA-HMIYPEGM 193 (335) T ss_pred HHHhccCCcCCCh-------hhccchhhhhcCccccccCcccceeeccccccCceEEE----EEEEcCCee-EEEcccCc Confidence 9999999999887 37999999872 2469999998544332222 111111222 33448888 Q ss_pred HhhHHHhhcCceeEEeecCCCc--------cccCccccceecCce--eEEec-------------------------CCc Q lcl|NC_018863. 231 QADFTNNLLDRQRVIQPSQAGG--------FSTGFSINQFLSTRG--AINLH-------------------------GST 275 (479) Q Consensus 231 ka~f~q~~~~~qrv~~~~n~g~--------~~~G~~V~~~~ss~g--~I~L~-------------------------~s~ 275 (479) |+-|+-.-++.|.+...+...- --.|..|..|.++-= +|+.. .+. T Consensus 194 kaGl~~~d~g~~~~~d~~G~~y~~~~~~~~w~~Gl~i~d~r~vvRI~NIdvs~l~~d~~~~~~l~~lmi~a~~~~~ip~~ 273 (335) T protein:vir:73 194 VAGFQHEDLGDDLVSDGNGGQFRAYRDEFKWDIGLSVRDWRSISRICNIDVTTLTKDASTGADLISMMVDAYYARDVAML 273 (335) T ss_pred cccceeeeccceeeecCCCCEEeEEEeeeeeeeeeEEeCcccEEEEeecccccccccccchhhHHhhHHHHHHHHhccCC Confidence 8888777777777663331110 122333333322100 11100 010 Q ss_pred ccCCCccccCc-------c--cC--------CCCCcccceEEEeecccccCcccccccceeeEEEEEE Q lcl|NC_018863. 276 IMENDNILVDR-------I--PE--------PNAPQAPASVVATVKVNDKGAFRPVKDIKTHSYKVVV 326 (479) Q Consensus 276 v~~a~~~lver-------~--~s--------~~aP~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a 326 (479) -+..+.++..| . .+ .+....+....-. -+--.-.-..+. .=+|+| T Consensus 274 ~~~~~~~y~n~~v~~~L~~q~~~~~n~~l~~~~~~g~~~t~~~g-ipir~~Dail~t-----E~~v~~ 335 (335) T protein:vir:73 274 GDGKEVIYANKTIHAWLHKQAMNAKNVNLTIEEYGGKKIVSFLG-IPIRRVDAILNT-----ESAVTA 335 (335) T ss_pred CCCceEEEechHHHHHHHHHHhccCceeeeeeccCCceeEEECC-eEEEEEeeeecC-----cccccC Confidence 11111122211 0 00 0000011000000 000000000011 113334 No 52 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=96.41 E-value=0.00065 Score=38.05 Aligned_cols=291 Identities=12% Similarity=0.070 Sum_probs=138.2 Q ss_pred HHHHHhhcCcccCcccccCccccchhhhHHHHHHHhhccccccchhhhccchhHHHHHHhhhhhccCccccccccccccc Q lcl|NC_018863. 26 LVSKSFTTGTGITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGV 105 (479) Q Consensus 26 ~~~Ksf~ag~~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~ 105 (479) +-.+.|.+. +..+.+++++|-++.+.++|..+..... .+++...+.+..+.-..+ +..-.+.....+++|++. T Consensus 1 m~~~~~~~~---~~~~t~~~~~lvP~~~~~~ii~~~~~~s--~l~~~~~~~~~~~~~~~~--~~~~~~~~~a~~v~Eg~~ 73 (297) T protein:vir:95 1 MTVQTFNPE---NVLVSQKKDGTLHKEFTDIIMKEVAQNS--LVMQLGQYQEMEGEQEKT--VYVQTDGISAYWVNETEK 73 (297) T ss_pred CCccccccc---cccccCCCcceechhHHHHHHHHHHhhc--hhhhhcceeecCCCccEE--EEEEcCCceeEEeecCcc Confidence 111222222 1122346667888888888766665444 344444444443221111 111222335679999999 Q ss_pred ccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHH Q lcl|NC_018863. 106 ASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLT 185 (479) Q Consensus 106 ~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~ 185 (479) .+..++++.......+=++-.-.+|..+ +.++..|.+....+.--..+.+.+|.++|+|+.+-.+ .|+. T Consensus 74 ~~~~~~~f~~v~l~~~k~~~~~~is~el-l~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~----------~gi~ 142 (297) T protein:vir:95 74 IKTDKPEVVPVTLKAHKLGIILVTSREA-LNYTWKKFFEDMKPQIVEAFYKKIDEAGLLGHDTPFA----------NSVA 142 (297) T ss_pred ccccccceeEEEEeeEEEEEeehhhHHH-HhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCccc----------cccc Confidence 9999999999999999999988888742 2345678888888888999999999999999865322 3566 Q ss_pred HhhccCCcEEEccCCCCCHHHhhhhhheeecccCceeeeecChHHHhhHHHhhcCceeEEeecCCCccccCccccceecC Q lcl|NC_018863. 186 KLIDEATNVIDLKGERLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSQAGGFSTGFSINQFLST 265 (479) Q Consensus 186 ~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~atd~~mp~~vka~f~q~~~~~qrv~~~~n~g~~~~G~~V~~~~ss 265 (479) +.+.. .+... +.-++.+.|.++.-.+..++...+-..|++.+.+.+.......-|.+...+++ .-.|.+|....+. T Consensus 143 ~~~~~-~~~~~--~~~~t~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~l~d~~G~~i~~~~~~-~l~G~Pv~~~~~~ 218 (297) T protein:vir:95 143 KAAKD-ANKVI--GGPINYDNILKLQDALYDADVEPNAFVSKIQNRSALREARDGNKVSIYDKAAN-TIDGITTVDLKSA 218 (297) T ss_pred ccccc-cceec--ccccCHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHHhhccCCceeecCCCC-cccceeeEeecCC Confidence 65542 23322 34456666666666667778888889999999999987554333333322221 1223322100000 Q ss_pred ceeEEecCCcccCCCccccCcccCCCCCcccceEEEe------ecccccCcccccccceeeEEEEEEEcCCCCcccccce Q lcl|NC_018863. 266 RGAINLHGSTIMENDNILVDRIPEPNAPQAPASVVAT------VKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLASEAV 339 (479) Q Consensus 266 ~g~I~L~~s~v~~a~~~lver~~s~~aP~~P~~vta~------~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~V 339 (479) ... .+..+..+........- ........ ...+..|..+..-..+.=.+++...-+.+-- T Consensus 219 --~~~-~~~~~~gd~s~~~~~~~------~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~------ 283 (297) T protein:vir:95 219 --RFE-KGDLLAGDFDNLIYGVP------YNITYKISEEGQISTITNADGTPINLFEQEMIAIRATMDIAVMIT------ 283 (297) T ss_pred --CCC-CceEEEEecccEEEEEe------cCeEEEEeeccccccccccCccchhhhhcCcEEEEEEEEeccEee------ Confidence 000 00000000000000000 00000000 0000000000000000001111111111111 Q ss_pred eeeeecCCCeEEEEEeecCC Q lcl|NC_018863. 340 TAVVANPTDSVSLAVKLQSL 359 (479) Q Consensus 340 taT~a~~~~~V~LtIt~~~~ 359 (479) ....-+.|+... ++ T Consensus 284 -----~~~a~~~l~~at-~~ 297 (297) T protein:vir:95 284 -----KTDAFAKLTPAE-RV 297 (297) T ss_pred -----cccceEEEeecC-CC Confidence 111111222111 11 No 53 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=96.40 E-value=0.00066 Score=38.01 Aligned_cols=286 Identities=11% Similarity=0.044 Sum_probs=129.1 Q ss_pred hhcCcccCcc-------cccCccccchhhhHHHHHHHhhccccccchhhhccchhHHHHHHhhhhhccCccccccccccc Q lcl|NC_018863. 31 FTTGTGITPD-------TQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREV 103 (479) Q Consensus 31 f~ag~~~~~~-------~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~ 103 (479) |.+|...+++ +-++++.+-.+.+..++..+... ...+.+.+...++.+--.+|.++. +.....+++|+ T Consensus 1 ~~~~~~~~~~~~~~~~t~~~~~~~~ip~~~~~~ii~~~~~--~s~l~~~~~~~~~~~~~~~~p~~~---~~~~a~~v~E~ 75 (320) T protein:vir:10 1 MAAGTAFQVDHAQIAQTGDTMFKGYLEPEQAKDYFAEAEK--TSIVQQFAQKVPMGTTGQKIPHWI---GDVSAQWIGEG 75 (320) T ss_pred CCCCccCCHHHHHhhccccccccccccHHHHHHHHHHHHh--ccchhhhcceeeccCCceEEEEEe---CCcceEEecCC Confidence 3333332221 11223334444444555333332 233555555555544333444443 23346799999 Q ss_pred ccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhh Q lcl|NC_018863. 104 GVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDG 183 (479) Q Consensus 104 g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDG 183 (479) +..+.+++++.+....++=++..-.+|+-+-. ++..|.+....+.-...+++.+|.++|.|+.+-.+ +.+.| T Consensus 76 ~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~-ds~~~l~~~i~~~l~~a~a~~~d~a~l~G~g~~~~-------~~~~~ 147 (320) T protein:vir:10 76 DMKPITKGNMTSQNIAPHKIATIFVASAETVR-ANPANYLGTMRTKVATAFAMAFDSAALNGTDSPFP-------TYLAQ 147 (320) T ss_pred ccccccccceeEEEEeeEEEEEeehhhHHHHh-cChHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCC-------ccccc Confidence 99999999999999999999988888876433 45568888888999999999999999999975322 22333 Q ss_pred HHHhhccCCcEEEccCCC---C--CHHHhhhhhheeecccCceeeeecChHHHhhHHHhhcCceeEEeecCCCccccCcc Q lcl|NC_018863. 184 LTKLIDEATNVIDLKGER---L--DEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSQAGGFSTGFS 258 (479) Q Consensus 184 l~~~I~~~~NviDarG~~---l--~~~~l~~aa~~i~~~fG~atd~~mp~~vka~f~q~~~~~qrv~~~~n~g~~~~G~~ 258 (479) ..+. .++....|.- + ..+.+-.+...+...+....-..|++.+...+...-...-+.+.+........ . T Consensus 148 ~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~-~- 221 (320) T protein:vir:10 148 TTKS----VSLADPGGATASDLTAYDAVAVNGLSLLVNAKKKWTHTLLDDIVEPILNGAKDKNGRPLFIESTYTDEN-S- 221 (320) T ss_pred cccc----ccceecccccccccccHHHHHHHHHhhhhcccCCCcEEEEcHHHHHHHHHhhccCCceeeccccccCcc-c- Confidence 3222 2333333321 1 12234444555556677778899999999999865543323332221111000 0 Q ss_pred ccceecCceeEEecCCcccCCCccccCcccCCC-----------------CCcccceE--EEeecccccCc---cccccc Q lcl|NC_018863. 259 INQFLSTRGAINLHGSTIMENDNILVDRIPEPN-----------------APQAPASV--VATVKVNDKGA---FRPVKD 316 (479) Q Consensus 259 V~~~~ss~g~I~L~~s~v~~a~~~lver~~s~~-----------------aP~~P~~v--ta~~~~~~~g~---~~~~sd 316 (479) .+.+.++++.+.+..+..+..+ ......+. .........+. .|.. + T Consensus 222 -----------~~~~~~i~g~pv~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~-~ 289 (320) T protein:vir:10 222 -----------PFRAGRIVSRPTILSDHVADGTTVGYMGDFRNVIWGQVGGLSFDVTDQATLNLGTPTEPNFVSLWQH-N 289 (320) T ss_pred -----------cccCceeeeeeeEecCCCCCCceEEEEeecceEEEEEecCeEEEEeecceeeeccccccccchhhhc-C Confidence 0111111111111111100000 00000000 00000000000 0000 0 Q ss_pred ceeeEEEEEEEcCCCCcccccceeeeeecCCCe Q lcl|NC_018863. 317 IKTHSYKVVVHSDDAESLASEAVTAVVANPTDS 349 (479) Q Consensus 317 ~g~Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~ 349 (479) .-.+++...-+..--.+...+..+.....++ T Consensus 290 --~~~~r~~~~~d~~v~~~~a~~~l~~~~ap~~ 320 (320) T protein:vir:10 290 --LVAVRVEAEYAFHNNDKDAFVKLTNVVTPDA 320 (320) T ss_pred --cEEEEEEEeeccEEecccceEEEEeccCCCC Confidence 0011111110000000111111111111111 No 54 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=96.39 E-value=0.00056 Score=38.42 Aligned_cols=287 Identities=12% Similarity=0.125 Sum_probs=134.3 Q ss_pred cCcccccCccccchhhhHHHHHHHhhccccccchhhhccchhHHHHHHhhhhhccCccccccccccccc-----ccccCc Q lcl|NC_018863. 37 ITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGV-----ASINDP 111 (479) Q Consensus 37 ~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~-----~~~~d~ 111 (479) ....+-++|+.|-.+.+.++|........ .+.+.+...+..+.-.+|.+ ..+.....+++|++. .+.+++ T Consensus 1 ma~~t~~~gg~liP~~~~~~Ii~~~~~~s--~l~~l~~~~~~~~~~~~~p~---~~~~~~a~wv~E~~~~~~~~~~~s~~ 75 (305) T protein:vir:25 1 MADISRAEVASLIQEAYSDTLLAAAKQGS--TVLSAFQNVNMGTKTTHLPV---LATLPEADWVGESATDPKGVKPTSKV 75 (305) T ss_pred CCCccCCccceecCHHHHHHHHHHHHhhc--hhhhhcceeeccCCcEEEEE---EeCCcceEEeeccccccccccccccc Confidence 33334566888889999888865555544 34555544444433233323 333445678999874 456789 Q ss_pred ceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccC Q lcl|NC_018863. 112 NIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEA 191 (479) Q Consensus 112 ~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~ 191 (479) .+.+....++=++.--.+|.-+ +.++..|.+....+.-...+++.+|.++|+|+.+-. |++=-+........ T Consensus 76 ~f~~i~~~~~k~~~~~~is~el-l~ds~~~~~~~i~~~l~~~~a~~~d~a~~~G~g~~~-------~~~~~~~~~~~~~~ 147 (305) T protein:vir:25 76 TWANRTLVAEEIAVIIPVHENV-IDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPA-------SWVSPALIPAAVTA 147 (305) T ss_pred ceeeEEeeeEEEEEeehhhHHH-HhcchHHHHHHHHHHHHHHHHHHHhhhheeccCCCC-------Cccccccccccccc Confidence 9999999999888888888743 235667889999999999999999999999986411 11111222222223 Q ss_pred CcEEEccCCCCC-H---HHhhhhhheeecccCceeeeecChHHHhhHHHhhcCceeEEeecCCCccccCccccceecCce Q lcl|NC_018863. 192 TNVIDLKGERLD-E---ATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSQAGGFSTGFSINQFLSTRG 267 (479) Q Consensus 192 ~NviDarG~~l~-~---~~l~~aa~~i~~~fG~atd~~mp~~vka~f~q~~~~~qrv~~~~n~g~~~~G~~V~~~~ss~g 267 (479) .+.....+.... . +.+.++...+...+..++..+|++...+.+...-....|.+.+.+ .-.|.+|. .+... T Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~i~~~~---~l~G~Pv~--~~~~~ 222 (305) T protein:vir:25 148 GQAVEVVGGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVANIRDANGNPVFRDD---SFAGFRTF--FNRNG 222 (305) T ss_pred cccccccccchhhhHHHHHHHHHHHhhhhcccccceeEecHHHHHHHHHhhccCCceeecCC---cccccceE--EcCcc Confidence 344444443332 2 224455556667777788899999998888765443223332211 11222220 00000 Q ss_pred eEEec-CCcccCCC-cccc-CcccCCCCCcccceEEEeecccccCcccccccceeeEEEEEEEcCCCCcccccceeeeee Q lcl|NC_018863. 268 AINLH-GSTIMEND-NILV-DRIPEPNAPQAPASVVATVKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLASEAVTAVVA 344 (479) Q Consensus 268 ~I~L~-~s~v~~a~-~~lv-er~~s~~aP~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~a 344 (479) +.... +..+.++. .++. .+ . + ......-..+- ..+... ......-...+.+..+.|.-.. T Consensus 223 ~~~~~~~~~~~gd~s~~~i~~~--~--~--~~i~~~~~~~~-~~~~~~-~~~~~~~~~~~R~~~r~~~~v~--------- 285 (305) T protein:vir:25 223 AWDADAAIEVIADSSRVKIGVR--Q--D--ITVKFLDQATL-GTGENQ-INLAERDMVALRLKARFAYVLG--------- 285 (305) T ss_pred CCCCCccEEEEEecceEEEEEe--c--C--eEEEEeeeeee-ecCCce-eeeeecCcEEEEEEEeecceee--------- Confidence 00000 00000000 0000 00 0 0 00000000000 000000 0000000111222222221111 Q ss_pred cCCCeEEEE------EeecC Q lcl|NC_018863. 345 NPTDSVSLA------VKLQS 358 (479) Q Consensus 345 ~~~~~V~Lt------It~~~ 358 (479) .+..-+.++ ++|+. T Consensus 286 ~p~a~v~~~~~~~~~~~pa~ 305 (305) T protein:vir:25 286 VSATAQGANKTPVAVVAPAA 305 (305) T ss_pred CcccEEEEccccccccCCCC Confidence 111111221 12221 No 55 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=96.39 E-value=0.00068 Score=37.96 Aligned_cols=297 Identities=11% Similarity=0.056 Sum_probs=139.3 Q ss_pred cccccCccccchhhhHHHHHHHhhccccccchhhhccchhHHHHHHhhhhhccCcccccccccccccccccCcceEEEEE Q lcl|NC_018863. 39 PDTQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTV 118 (479) Q Consensus 39 ~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~ 118 (479) =.+.+.|+.|-.+.+..+|..+.... ..+.+-....+..+--.+|.++. +.+...+++|++..+.+++.+.+... T Consensus 1 m~t~t~gg~liP~~~~~~ii~~l~~~--s~i~~l~~~~~~~~~~~~ip~~~---~~~~a~wv~E~~~~~~s~~~f~~v~l 75 (303) T protein:vir:97 1 MGTETSKASLFDKHLVSDLINKVKGH--SSLAKLSSQKPIPFNGSKEFTFT---LDSDIDVVAENGKKTHGGLSLEPVTI 75 (303) T ss_pred CcccCCCCeEcchhHHHHHHHHHHhh--chhhhhcceeecCCCceEEEEEe---cCcceEEeecCccccccccceeeEEe Confidence 22445677788888888885554443 33444444444443222443333 33457899999999999999999999 Q ss_pred EEEeeeehhhhhhhHhhhc--chhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEE Q lcl|NC_018863. 119 QMKFLSDTKQQSLAAGLVN--NIADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVID 196 (479) Q Consensus 119 ~~k~l~~~~~vs~~~~lv~--~~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviD 196 (479) ..|=++.--.+|.-+-.++ ...+.+....+..-..+++.+|.++++|+.+-... +..--|.........++.- T Consensus 76 ~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g~-----~~~~~~~~~~~~~~~~~~~ 150 (303) T protein:vir:97 76 VPIKVEYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPRTKK-----ASDVIGTNHFDSKVTQVVK 150 (303) T ss_pred eeEEEEEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccCCcc-----ccccccccccccccccccc Confidence 9999998888887654433 34477788899999999999999999997542211 1111111111111223222 Q ss_pred ccCCCCCHHHhhhhhheeecccCceeeeecChHHHhhHHHhhcCceeEE-eecCCCccccCccccceecCceeEEecCCc Q lcl|NC_018863. 197 LKGERLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVI-QPSQAGGFSTGFSINQFLSTRGAINLHGST 275 (479) Q Consensus 197 arG~~l~~~~l~~aa~~i~~~fG~atd~~mp~~vka~f~q~~~~~qrv~-~~~n~g~~~~G~~V~~~~ss~g~I~L~~s~ 275 (479) .-+.....+.|.++.-.+..+++.++...|++.+...+...-...-+-+ ++.-+. |... .+ T Consensus 151 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~g~~~~~~~~~~----~~~~--------------~~ 212 (303) T protein:vir:97 151 FTESEDADANIEAAVNLIQGAEGVVTGLAMDTEFSTALAKVTNGEMGPKMYPELAW----GANP--------------DS 212 (303) T ss_pred cccccchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHhhccCCCeEEecCccC----CCCC--------------ce Confidence 2222233455565555566678888889999999998876543221222 222111 1111 11 Q ss_pred ccCCCccccCcccCCCCCcccceEEEeecccccCccccccccee---eEEEEEEEcCCCCcccccceeeeeecCCCeEEE Q lcl|NC_018863. 276 IMENDNILVDRIPEPNAPQAPASVVATVKVNDKGAFRPVKDIKT---HSYKVVVHSDDAESLASEAVTAVVANPTDSVSL 352 (479) Q Consensus 276 v~~a~~~lver~~s~~aP~~P~~vta~~~~~~~g~~~~~sd~g~---Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~V~L 352 (479) +++.+.+..+..+.......+... -..|.|...-..+. ..+++ ..++....+.. . ....+-+.+ T Consensus 213 l~G~Pv~~s~~v~~~~~~~~~~~~------~~~Gdf~~~~~~~~~~~~~~~~---~~~~~~d~~~~-~---~~~~n~~~~ 279 (303) T protein:vir:97 213 INGLKSSVNTTVGAGADEAESKDL------VIIGDFESMFKWGYAKQIPMEI---IKYGDPDNSGK-D---LKGYNQIYL 279 (303) T ss_pred ecceeeEEecccCCccccCCCccE------EEEeeccccEEEEEecCcEEEE---eeccCCCCcch-h---hhhcCcEEE Confidence 222222222221111110001000 11122110000000 11111 11221111100 0 001111111 Q ss_pred EEeecCCccccceEEEEEeccCCCCcEEEEEEeee Q lcl|NC_018863. 353 AVKLQSLYQAKPQFISVYRQGNETGHYFLVARVPL 387 (479) Q Consensus 353 tIt~~~~~~~~~~y~~IYR~t~~~g~~~~i~rV~~ 387 (479) ...-- .+ ..|.| +.-|-.+...++ T Consensus 280 r~~~r-~~------~~v~~----p~af~~l~~~~~ 303 (303) T protein:vir:97 280 RAEAY-IG------WGILD----AKSFARVTKGEV 303 (303) T ss_pred EEEEE-ec------cEeec----ccceEEeeCCCC Confidence 11100 00 01111 222333333222 No 56 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=96.30 E-value=0.00065 Score=38.07 Aligned_cols=302 Identities=11% Similarity=0.017 Sum_probs=136.1 Q ss_pred cCchhHHHHHHHHHHHhhcCcccCcccccCccccchhhhHHHHHHHhhccccccchhhhccchhHHHHHHhhhhhccCcc Q lcl|NC_018863. 15 LPAGAEAELAELVSKSFTTGTGITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQVNSTVAKYAVFNQHGRT 94 (479) Q Consensus 15 ~~~~~~~~~~e~~~Ksf~ag~~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~ 94 (479) |-...+-..++ .+..++ +-++++.+-.+.+..++..+..... .+.+.....+..+.-.+|.++. +. T Consensus 1 ~~~~~~~~~e~--~~~~~~-------~~~~~~~~ip~~~~~~ii~~~~~~~--~l~~~~~~~~~~~~~~~ip~~~---~~ 66 (318) T protein:vir:24 1 MAAGTAFAVDH--AQIAQT-------GDTMFKGYLEPEQAKDYFAEAEKTS--IVQQFAQKVPMGTTGQKIPHWV---GD 66 (318) T ss_pred CCCCCCCCHHH--HHhhcc-------cCcccceeechhHHHHHHHHHHhhc--hhhhhcceeeccCCceEEEEEe---CC Confidence 11111101001 111222 1244556667777777755554443 3455555555544434444433 44 Q ss_pred cccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCC Q lcl|NC_018863. 95 GHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAISVIAKSIEWAIFYGDAALAAEAD 174 (479) Q Consensus 95 g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~ 174 (479) +...+++|++..+.+++.+.+.....+=++..-.+|.-+ +.++..|.+....+.-...+++.+|.++++|+.+-.+ T Consensus 67 ~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~e~-l~ds~~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~--- 142 (318) T protein:vir:24 67 VSAQWIGEGDMKPITKGNMTSQTIAPHKIATIFVASAET-VRANPANYLGTMRTKVATAFAMAFDGAAMHGTDSPFP--- 142 (318) T ss_pred cceEEecCCccccccccceeEEEEeeEEEEEeehhhHHH-hhcChHHHHHHHHHHHHHHHHHHHHHhhhcccCCCCC--- Confidence 567899999999999999999999999998877777732 2356678889999999999999999999999864221 Q ss_pred CcccchhhhHHHhhccCCcEEEccC-CCCCHHHhhhhhheeecccCceeeeecChHHHhhHHHhhcCceeEEeecCCCcc Q lcl|NC_018863. 175 NQAGIEFDGLTKLIDEATNVIDLKG-ERLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSQAGGF 253 (479) Q Consensus 175 ~~~gleFDGl~~~I~~~~NviDarG-~~l~~~~l~~aa~~i~~~fG~atd~~mp~~vka~f~q~~~~~qrv~~~~n~g~~ 253 (479) .|+...+.. .+.-...+ .....+.+.++...+...+.......|++.+.+.+...-...-|-+...+..+. T Consensus 143 -------~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~ 214 (318) T protein:vir:24 143 -------TYIGQTTKA-ISIADTTGATTVYDQVAVNGLSLLVNDGKKWTHTLLDDITEPILNGAKDQNGRPLFIESTYGE 214 (318) T ss_pred -------ccccccccc-ccccccccccchHHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhhccCCceeecCccccC Confidence 233333321 11111111 122233344455556667777778999999999998655433333322222211 Q ss_pred cc----CccccceecCceeEEecC-----Cc--ccCCCccccCcccC---CCCCcccceEEEeecccccCccccccccee Q lcl|NC_018863. 254 ST----GFSINQFLSTRGAINLHG-----ST--IMENDNILVDRIPE---PNAPQAPASVVATVKVNDKGAFRPVKDIKT 319 (479) Q Consensus 254 ~~----G~~V~~~~ss~g~I~L~~-----s~--v~~a~~~lver~~s---~~aP~~P~~vta~~~~~~~g~~~~~sd~g~ 319 (479) .. |..+..+ ++.... .. +..+....+.+... -...........+.........|. . +. T Consensus 215 ~~~~~~~~~i~g~-----pv~~~~~~~~~~~~~~~gdfs~~~~~~~~~l~i~~~~~~~~~~~~~~~~~~~~~f~-~--~~ 286 (318) T protein:vir:24 215 AASPFRSGRIVAR-----PTILSDHVVEGTTVGFMGDFSQLIWGQIGGLSFDVTDQATLNLGTVESPNFVSLWQ-H--NL 286 (318) T ss_pred ccccccCceEEEE-----eeEEeCCCCCCccEEEEeecceEEEEEecCeEEEEeeccceeccccccccchhhhh-c--Cc Confidence 11 1111100 001000 00 01111001000000 000000000000000000001111 1 11 Q ss_pred eEEEEEEEcCCCCcccccceeeeeecCCC-eE Q lcl|NC_018863. 320 HSYKVVVHSDDAESLASEAVTAVVANPTD-SV 350 (479) Q Consensus 320 Y~YkV~a~n~~GES~~S~~VtaT~a~~~~-~V 350 (479) ..+++...-+..--.+...+..+....++ .. T Consensus 287 ~~~r~~~r~d~~v~~~~a~~~i~~~~a~~~~~ 318 (318) T protein:vir:24 287 VAVRVEAEYAFHCNDAEAFVALTNVVSGGGEG 318 (318) T ss_pred EEEEEEEEEccEEecccceEEEEeeccCCCCC Confidence 22222221111111111111111111111 11 No 57 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=96.28 E-value=0.00079 Score=37.58 Aligned_cols=313 Identities=12% Similarity=0.046 Sum_probs=137.1 Q ss_pred Cccc---ccccceeeeecCchhHHHHHHHHHHHhhcCcc-----cCcccccCccccchhhhHHHHHHHhhccccccchhh Q lcl|NC_018863. 1 MTEL---QKEQKVEARKLPAGAEAELAELVSKSFTTGTG-----ITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIYPL 72 (479) Q Consensus 1 ~~~~---~~~~~~~~~~~~~~~~~~~~e~~~Ksf~ag~~-----~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~ 72 (479) ..+. ....+.... ......++....+.+.+..+-. .+..+.++|+.|..+.+..+|..+..... .+++. T Consensus 66 ~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~--~l~~~ 142 (397) T protein:vir:48 66 NEVVNMSEEEKKPLTK-SEEEVKAGFVKDFKNLVRGRYQNLLDSKTDASGSDAGLTIPQDIQTAIHTLVRQYD--SLQEY 142 (397) T ss_pred hhhhhhhhhccccccc-hhhHHHHHHHHHHHHHHhhhhhHHHHHhhccCCccccccccHHHHHHHHHHHHHHH--HHHhh Confidence 0000 000011111 1111222222333333322211 11223456888999999888866655544 34555 Q ss_pred hccchhHHHHHHhhhhhccCccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHH Q lcl|NC_018863. 73 INKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVAS-INDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAI 151 (479) Q Consensus 73 i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~~~~~ai 151 (479) +....+.+...++.....-+..+...+++|++..+ ..++.+.+....++-++.-..+|.-+ +.++..|.+....+.-- T Consensus 143 ~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~el-l~ds~~~l~~~v~~~l~ 221 (397) T protein:vir:48 143 VNVENVTTLTGSRVYEKWADITGLAKLDDEAGSIGTNDDPKLYPIRYAIKRYAGISTVTNSL-LADSAENILAWLSGWIA 221 (397) T ss_pred hceeeccCCcceEEEEeecCCCcceeeeccccccccccccceeeEEeeheeeeeehhhHHHH-HhhchHHHHHHHHHHHH Confidence 55555554434443322233444567899998865 55799999999999999888888754 34556678888888888 Q ss_pred HHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhhheeecccCceeeeecChHHH Q lcl|NC_018863. 152 SVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAAVIVGKGYGRATDAFMPIGVQ 231 (479) Q Consensus 152 ~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~atd~~mp~~vk 231 (479) ..++..++.+++.|+..-.+.+ ..+-+|+|.+++. .+...|......+|++.+. T Consensus 222 ~~~~~~~d~~il~G~g~~~~~~---~~~~~d~i~~~~~-----------------------~l~~~~~~~a~~v~n~~~~ 275 (397) T protein:vir:48 222 KKVVVTRNKAILEAIATLPTKP---TLTKWDDIIDLQA-----------------------KVDPAIKQTSFFLTNTSGF 275 (397) T ss_pred HHHHHHHHHHHhhccccccccc---ccccHHHHHHHHH-----------------------HhhhhhcCCCEEEECHHHH Confidence 9999999999999998755432 2344555544442 2233445556778899888 Q ss_pred hhHHHhhcCceeEEeecCCCc----cccCccccceecCceeEEecCCcccCCCccccCcccCCCC--CcccceEEEeecc Q lcl|NC_018863. 232 ADFTNNLLDRQRVIQPSQAGG----FSTGFSINQFLSTRGAINLHGSTIMENDNILVDRIPEPNA--PQAPASVVATVKV 305 (479) Q Consensus 232 a~f~q~~~~~qrv~~~~n~g~----~~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver~~s~~a--P~~P~~vta~~~~ 305 (479) +.+...-...-|-+.+.+..+ .-.|.+|.-..+. ++. +.-.++..++ .+-++... ... ......... T Consensus 276 ~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~--~~~---~~~~~~~~~~-~gd~~~~~~~~~~-~~~~i~~~~ 348 (397) T protein:vir:48 276 TALKKVKNAFGDYLMERDVKSPTGYSIDGFAVKEVADR--WLA---NASSGAMPLY-FGDLKQAVTLFDR-QQMSLLSTN 348 (397) T ss_pred HHHHHhhcCCCceeeccCcCCCCCceeccceeEEeccc--ccC---CcCCCceEEE-EEeccceEEEEee-cceEEEEec Confidence 888765433333332222211 2233333110000 000 0000000000 00000000 000 000000000 Q ss_pred cccCcccccccceeeEEEEEEEcCCCCcccccceeeee----ecCCCeEEEEE Q lcl|NC_018863. 306 NDKGAFRPVKDIKTHSYKVVVHSDDAESLASEAVTAVV----ANPTDSVSLAV 354 (479) Q Consensus 306 ~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~----a~~~~~V~LtI 354 (479) ..+..|. . +...|++...=+..-=.+...+..+. +......++-+ T Consensus 349 -~~~~~~~-~--~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~~~~~~ 397 (397) T protein:vir:48 349 -IGGGAFE-T--DTTKIRVIDRFDVVATDTESFVPASFKAIADQKGNLGSTAV 397 (397) T ss_pred -cchhhhh-c--CceeEEEEeeeccEEecccceEEEEecccccCCCCccccCC Confidence 0001110 0 11122221111111001111111111 11111110000 No 58 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=96.18 E-value=0.00091 Score=37.27 Aligned_cols=290 Identities=11% Similarity=0.018 Sum_probs=141.6 Q ss_pred cccCccccchhhhHHHHHHHhhccccccchhhhccchhHHHHHHhhhhhccCcccccccccccccccccCcceEEEEEEE Q lcl|NC_018863. 41 TQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQM 120 (479) Q Consensus 41 ~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~ 120 (479) =.+.|+.|-.+.+..+|..+.... ..+.+-..+.+..+--.+|.++ .+.+...+++|++..+.+|+.+.+..... T Consensus 1 ma~~gG~lvp~~~~~~ii~~~~~~--s~i~~l~~~~~~~~~~~~ip~~---~~~~~a~~v~E~~~~~~~~~~f~~v~l~~ 75 (298) T protein:vir:16 1 MVLNKGTLFDPTLVTDLISKVAGK--SSIARLSAQKPIPFNGEKVFTF---TMDSEIDVVAESGKKTHGGVTLAPQTMVP 75 (298) T ss_pred CcccCcceechhHHHHHHHHHHhh--hhhhhhcceeeccCCceEEEEE---ecCcceEEecCCccccccccceeEEEEee Confidence 114556666666677776555543 2344444445555433344333 33455789999999999999999999999 Q ss_pred EeeeehhhhhhhHhhhc--chhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEc- Q lcl|NC_018863. 121 KFLSDTKQQSLAAGLVN--NIADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDL- 197 (479) Q Consensus 121 k~l~~~~~vs~~~~lv~--~~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDa- 197 (479) +=++..-.+|.-+-..+ ...|.+....+.-...+++.+|.++|+|...-. +....+.|+........+.... T Consensus 76 ~k~a~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~-----g~~~~~~~~~~~~~~~~~~~~~~ 150 (298) T protein:vir:16 76 IKVEYGARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRL-----GTASAVIGTNHFDSKVTQKVEAP 150 (298) T ss_pred eeEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCC-----Ccccccccccccccccccccccc Confidence 99998888888764433 345677778888899999999999999964422 1122334443333322232222 Q ss_pred -cCCCCCHHHhhhhhheeecccCceeeeecChHHHhhHHHhhcCceeEEeecCCCccccCccccceecCceeEEecCCcc Q lcl|NC_018863. 198 -KGERLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSQAGGFSTGFSINQFLSTRGAINLHGSTI 276 (479) Q Consensus 198 -rG~~l~~~~l~~aa~~i~~~fG~atd~~mp~~vka~f~q~~~~~qrv~~~~n~g~~~~G~~V~~~~ss~g~I~L~~s~v 276 (479) .+..+ .+.|.++...+..++.......|++.+.+.+...-...-|.+.+..+.+.. +.++ T Consensus 151 ~~~~~~-~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~------------------~~~l 211 (298) T protein:vir:16 151 RGIADP-NGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQDNALFPELKWGAT------------------PDTI 211 (298) T ss_pred cccccH-HHHHHHHHHHhhhcCCCccEEEEcHHHHHHHHHhhccCCCeeecCcccCCC------------------Ccee Confidence 22222 234555555566677888889999999999877554333444332211110 1122 Q ss_pred cCCCccccCcccCCCCCcccceEEEeecccccCcccc---cccceeeEEEEEEEcCCCCcccccceeeeeecCCCeEEEE Q lcl|NC_018863. 277 MENDNILVDRIPEPNAPQAPASVVATVKVNDKGAFRP---VKDIKTHSYKVVVHSDDAESLASEAVTAVVANPTDSVSLA 353 (479) Q Consensus 277 ~~a~~~lver~~s~~aP~~P~~vta~~~~~~~g~~~~---~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~V~Lt 353 (479) ++.+.+..+..+.... .+... -..|.|-. -...+.+.+++ ++++.+..+.. + ....+.+.+. T Consensus 212 ~G~PV~~~~~v~~~~~--~~~~~------~~~GDfs~~~~~~~~~~~~~~~---~~~~~~~~~~~-~---~f~~~~v~~r 276 (298) T protein:vir:16 212 NGLPVDVNKTVSDMSL--TQRDR------AIIGDFANGFKWGYAKEVPLEV---IQYGDPDNSGL-D---LKGYNQVYIR 276 (298) T ss_pred cceeeEEecccccccC--CCccE------EEEeeccceEEEEEecCceEEE---eeccCCcCcch-h---hhhcCcEEEE Confidence 2333332222221000 00000 01111100 00001122222 22222211110 0 0111222222 Q ss_pred Eeec-CCccccceEEEEEeccC Q lcl|NC_018863. 354 VKLQ-SLYQAKPQFISVYRQGN 374 (479) Q Consensus 354 It~~-~~~~~~~~y~~IYR~t~ 374 (479) +.-- ...-..|.-+....... T Consensus 277 a~~r~d~~v~~~~a~~~l~~at 298 (298) T protein:vir:16 277 AELFLGWGILDATKFARVTEAN 298 (298) T ss_pred EEEEEccEeecccceEEEeecC Confidence 1110 11111222222222222 No 59 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=96.13 E-value=0.00096 Score=37.12 Aligned_cols=319 Identities=12% Similarity=0.079 Sum_probs=141.7 Q ss_pred Cccc-----------cccc--------ceeeeecCchhHH-HHHHHHHHHhhcCcc---------------------cCc Q lcl|NC_018863. 1 MTEL-----------QKEQ--------KVEARKLPAGAEA-ELAELVSKSFTTGTG---------------------ITP 39 (479) Q Consensus 1 ~~~~-----------~~~~--------~~~~~~~~~~~~~-~~~e~~~Ksf~ag~~---------------------~~~ 39 (479) +.++ .... +....+....+.. .-...+.|++..+-+ .+. T Consensus 55 ~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 134 (435) T protein:vir:80 55 AEAAERMAAAAAVPVDPNPAAVTASAAAPVYAQPKAPEVKGAKMARMVRALAAARGDAQLASKLAIERGFGEEVAMSLNT 134 (435) T ss_pred HHHHHHHHHhhcccccchhhhhccccccccccccchhhhhHHHHHHHHHHHHhccchhHHHHHHHHhhhhhhhhhhhhcc Confidence 0000 0000 0000000111111 111223333332211 122 Q ss_pred ccccCccccchhhhHHHHHHHhhccccccchhhhccchh--HHHHHHhhhhhccCcccccccccccccccccCcceEEEE Q lcl|NC_018863. 40 DTQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQV--NSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKT 117 (479) Q Consensus 40 ~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~~--~stv~~y~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~ 117 (479) .+...|+.|-.+.+..+|..+..... .+..+..+.+ .+--.+|.++. +.+...|++|++..+..|+.+.+.. T Consensus 135 ~~~~~gg~lvP~~~~~~ii~~l~~~~---~i~~~~~~~v~~~~~~~~~p~~~---~~~~a~~v~E~~~~~~~~~~f~~i~ 208 (435) T protein:vir:80 135 LSPGAGGVLVPENLSSEVIELLRPKS---VVRKLGARTLPLSNGNITIPRLK---GGAIVGYIGADTDIPTTQQQFDDLK 208 (435) T ss_pred cCCCCCccccchhHHHHHHHHHhhhc---hhhhccceeeecCCCceEEEEEe---CCcceeeeccCccccccccceeeEE Confidence 33445777888888888866654333 2223321122 22223343332 3345679999999999999999999 Q ss_pred EEEEeeeehhhhhhhHhhhcc-h-hhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEE Q lcl|NC_018863. 118 VQMKFLSDTKQQSLAAGLVNN-I-ADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVI 195 (479) Q Consensus 118 ~~~k~l~~~~~vs~~~~lv~~-~-~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~Nvi 195 (479) ..++=++....+|.-+ +.++ + -+.+....+.-...+...+|.++|+|+..= -+..||.+... ..++. T Consensus 209 ~~~~k~~~~~~is~el-l~ds~~~~~l~~~i~~~l~~a~~~~~d~a~l~G~G~~---------~~p~Gi~~~~~-~~~~~ 277 (435) T protein:vir:80 209 LTAKKMAALVPIANDL-IKYAGVNPNVDQIVVGDLTAAIGAREDKAFIRDDGTA---------NTPKGLRFWAL-PGNVI 277 (435) T ss_pred EeeEEEEEeehhhHHH-HHhhcccHHHHHHHHHHHHHHHHHHHHHHhhccCCCC---------Ccccceeeccc-cccee Confidence 9999898888888765 3333 2 356788899999999999999999997431 12357766554 24444 Q ss_pred Ecc-CCCCC--HHHhhhhhheeecc--cCceeeeecChHHHhhHHHhhcCceeEEeecCCCccccCccccceecCceeEE Q lcl|NC_018863. 196 DLK-GERLD--EATLNKAAVIVGKG--YGRATDAFMPIGVQADFTNNLLDRQRVIQPSQAGGFSTGFSINQFLSTRGAIN 270 (479) Q Consensus 196 Dar-G~~l~--~~~l~~aa~~i~~~--fG~atd~~mp~~vka~f~q~~~~~qrv~~~~n~g~~~~G~~V~~~~ss~g~I~ 270 (479) ..- |.... ...+.++-.....+ +-...-..|++.+...+...-...-+-+.+...++.-.|.+|-. +..-+.+ T Consensus 278 ~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~l~~~~~~~~l~G~pv~~--~~~~p~~ 355 (435) T protein:vir:80 278 TASDGSTLQKIETDLGKAILALENADANLTQPGWIMAPRTFRFLEGLRDGNGNKVYPELANGMLKGYPVGK--TTQVPIN 355 (435) T ss_pred ecccccchhhHHHHHHHHHHHhhccccccccCEEEEcHHHHHHHHhhhccCCceeccCCCCCeEeeeeeEE--ecccccc Confidence 333 33332 12233332222222 22223467899999888775544334444443343344444411 1110101 Q ss_pred ecC-----CcccCCCc-ccc-CcccCCCCCcccceEEEeec-----ccccCcccccccceeeEEEEEEEcCCCCcccccc Q lcl|NC_018863. 271 LHG-----STIMENDN-ILV-DRIPEPNAPQAPASVVATVK-----VNDKGAFRPVKDIKTHSYKVVVHSDDAESLASEA 338 (479) Q Consensus 271 L~~-----s~v~~a~~-~lv-er~~s~~aP~~P~~vta~~~-----~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~ 338 (479) +.. ..+.++.. +++ +| .. ....+.-.++ ...-.-|..+ .-.+++...=+.+--.|+.. T Consensus 356 ~~~~~~~~~i~~gd~s~~~i~~~-----~~-~~i~~~~~~~~~~~~~~~~~~f~~n----~~~~r~~~r~d~~~~~~~a~ 425 (435) T protein:vir:80 356 LGEAGKESEIYFTDFGDVFIGEE-----ET-LEIDYSKEATYKDADGHMVSAFQRD----QTLIRVIAKNDFGPRHVESI 425 (435) T ss_pred ccCCCCcceEEEEEcccEEEEee-----cc-eEEEEeccccccccccchhhhhhcC----cceeeeeeeeCcEeecccce Confidence 000 00111111 111 10 00 0000000000 0000001111 12233333333333333333 Q ss_pred eeeeeecCCC Q lcl|NC_018863. 339 VTAVVANPTD 348 (479) Q Consensus 339 VtaT~a~~~~ 348 (479) +..+...-+- T Consensus 426 ~~l~~~~~~~ 435 (435) T protein:vir:80 426 AVLSGVAWGA 435 (435) T ss_pred EEEeccCCCC Confidence 3333333222 No 60 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=96.12 E-value=0.00098 Score=37.09 Aligned_cols=280 Identities=14% Similarity=0.134 Sum_probs=126.0 Q ss_pred CcccccccceeeeecCchhHHHHHHHHHHHhhcCc-----ccCcccccCccccchhhhHHHHHHHhhccccccchhhhcc Q lcl|NC_018863. 1 MTELQKEQKVEARKLPAGAEAELAELVSKSFTTGT-----GITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINK 75 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~Ksf~ag~-----~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k 75 (479) +++..+ +... .......+.....|.+.+..+- .....+..+|+.|.++.+..+|..+..... .+++.+.. T Consensus 71 ~~~~~~--~~~~-~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~--~l~~~~~~ 145 (397) T protein:vir:49 71 MSEEEK--KPLT-KNEEEVKANFVKDFKNLVRGRYQNLLDSKTDGSGSDAGLTIPQDIRTAINTLVRQFD--SLQEYVNV 145 (397) T ss_pred cccccc--cccc-chhhHHHHHHHHHHHHHhhcchhhHHHhhhccCCccCcceecHHHHHHHHHHHHhhh--hHhhhcce Confidence 111100 1000 0011112222233444333321 122234556788889998888866555544 34444444 Q ss_pred chhHHHHHHhhhhhccC-cccccccccccccc-cccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHH Q lcl|NC_018863. 76 QQVNSTVAKYAVFNQHG-RTGHSRFVREVGVA-SINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAISV 153 (479) Q Consensus 76 ~~~~stv~~y~~~~~~G-~~g~~~fv~E~g~~-~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~~~~~ai~~ 153 (479) ..+..-.-++. +..+. ..+.+.+++|++.. +...+.+......++-++.-..+|.-+- .++..|.+....+..... T Consensus 146 ~~~~~~~~~~~-~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~l~~~ 223 (397) T protein:vir:49 146 ENVTTLTGSRV-YEKWADITGLAKLDDEGGQIGQNDDPKLSLIRYAIKRYAGISTVTNSLL-ADSAENILAWLSGWIAKK 223 (397) T ss_pred eeccCCcceEE-EEeeccCCcceeeeccccccccccccceeeeEeeeeeeEeehhhHHHHH-hhhhHHHHHHHHHHHHHH Confidence 44443222221 12222 23456799999975 4566899999999999998888886432 345668888899999999 Q ss_pred HHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhhheeecccCceeeeecChHHHhh Q lcl|NC_018863. 154 IAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAAVIVGKGYGRATDAFMPIGVQAD 233 (479) Q Consensus 154 ~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~atd~~mp~~vka~ 233 (479) +++.++.++++|+..-.+.+ ..+-+|++.+++.. +..+|......+|++.+.+. T Consensus 224 ~~~~~d~ail~G~g~~~~~~---~~~~~d~i~~~~~~-----------------------l~~~~~~~a~~v~n~~~~~~ 277 (397) T protein:vir:49 224 VVVTRNKAILEAIGTLPNKP---TLAKWDDIIDLQAK-----------------------VDPAIKQTSLFLTNTSGFTA 277 (397) T ss_pred HHHHHHHHHHhccccccccc---cccCHHHHHHHHHh-----------------------hhhhhcCCCEEEEcHHHHHH Confidence 99999999999998765532 24567777666642 12223333444555555544 Q ss_pred HHHhhcCceeEE-eec---CCCccccCccccce--------------------ec-----CceeEEecCCcccC------ Q lcl|NC_018863. 234 FTNNLLDRQRVI-QPS---QAGGFSTGFSINQF--------------------LS-----TRGAINLHGSTIME------ 278 (479) Q Consensus 234 f~q~~~~~qrv~-~~~---n~g~~~~G~~V~~~--------------------~s-----s~g~I~L~~s~v~~------ 278 (479) +...-...-|.+ +++ .....-.|.+|.-. .. .++.+.+..+...+ T Consensus 278 l~~lkd~~g~~l~~~~~~~g~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~ 357 (397) T protein:vir:49 278 LKKVKNAMGDYLMERDVKSPTGYSIDGFVVKEISDRFLPNGTGGAMPLYFGDLKQAVTLFDRQHLSLLSTNIGGGAFETD 357 (397) T ss_pred HHHhhccCCceeecccccCCCCceecceeeEEecccccccccCCceeEEEeeccceEEEEeecccEEEEeccccchhhcC Confidence 443332211211 111 00111222222100 00 01111111111000 Q ss_pred CCccccC-c----ccCCCCCcccceEEEeecccccCcccccccc Q lcl|NC_018863. 279 NDNILVD-R----IPEPNAPQAPASVVATVKVNDKGAFRPVKDI 317 (479) Q Consensus 279 a~~~lve-r----~~s~~aP~~P~~vta~~~~~~~g~~~~~sd~ 317 (479) ...+..+ | +..+++ ......+++ ++..... ..+++ T Consensus 358 ~~~~~~~~r~d~~~~~~~a-~~~~~~~~~--~~~~~~~-~~~~~ 397 (397) T protein:vir:49 358 TTKVRVIDRFDVVSTDTEA-FVPASFKAI--ADQKAKL-STAGA 397 (397) T ss_pred eeeEEEEEeeccEEecccc-eEEEEeccc--ccccCcc-cccCC Confidence 0001111 0 111111 011111111 1111110 11111 No 61 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=95.87 E-value=0.0013 Score=36.37 Aligned_cols=290 Identities=11% Similarity=0.022 Sum_probs=136.8 Q ss_pred cccCccccchhhhHHHHHHHhhccccccchhhhccchhHHHHHHhhhhhccCcccccccccccccccccCcceEEEEEEE Q lcl|NC_018863. 41 TQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQM 120 (479) Q Consensus 41 ~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~ 120 (479) =.++|+.|-.+.+..+|..+..... .+.+.....+..+.-.+|.++.. .+...+++|++..+.+++.+.+..... T Consensus 1 ma~~gG~lip~~~~~~ii~~~~~~s--~i~~~~~~~~~~~~~~~~p~~~~---~~~a~~v~Eg~~~~~~~~~f~~v~l~~ 75 (298) T protein:vir:94 1 MVLNKGTLFDPELVTDLISKVAGKS--SIARLSAQKPIPFNGEKVFTFTM---DSEIDVVAESGKKTHGGVTLAPQTMVP 75 (298) T ss_pred CeeccccccChhHHHHHHHHHHhhc--hhhhhcceeeccCCceEEEEEec---CcceEEeeCCccccccccceeEEEEee Confidence 1245667777777777755554433 23443443444433234544433 234578999999999999999999999 Q ss_pred EeeeehhhhhhhHhhhc--chhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEc- Q lcl|NC_018863. 121 KFLSDTKQQSLAAGLVN--NIADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDL- 197 (479) Q Consensus 121 k~l~~~~~vs~~~~lv~--~~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDa- 197 (479) +=++.--.+|.-+-..+ ...+.++...++-...+++.+|.++++|...-+ +....+.|....+....+.... T Consensus 76 ~k~~~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~-----g~~~~~~~~~~~~~~~~~~~~~~ 150 (298) T protein:vir:94 76 IKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRL-----GTASAVIGTNHFDSKVTQKVEAP 150 (298) T ss_pred eEEEEeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCC-----Ccccccccccccccccccccccc Confidence 99988888887763222 344667778888999999999999999954322 1122333433333322332222 Q ss_pred -cCCCCCHHHhhhhhheeecccCceeeeecChHHHhhHHHhhcCceeEEeecCCCccccCccccceecCceeEEecCCcc Q lcl|NC_018863. 198 -KGERLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSQAGGFSTGFSINQFLSTRGAINLHGSTI 276 (479) Q Consensus 198 -rG~~l~~~~l~~aa~~i~~~fG~atd~~mp~~vka~f~q~~~~~qrv~~~~n~g~~~~G~~V~~~~ss~g~I~L~~s~v 276 (479) .+..+ .+.|.++...+..++.......|++.+.+.+...-...-|.+.+....+.. ..++ T Consensus 151 ~~~~~~-~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~------------------~~tl 211 (298) T protein:vir:94 151 RGIADP-NGAIENAVELLTGVDADVTGIAINPSFRSALAKQKDLQGNALFPELKWGAT------------------PDTI 211 (298) T ss_pred cccccH-HHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHHhhccCCCeeecCcccCCC------------------Ccee Confidence 22222 334556655666677778889999999999877553333333222111100 1122 Q ss_pred cCCCccccCcccCCCCCcccceEEEeecccccCcccc---cccceeeEEEEEEEcCCCCcccccceeeeeecCCCeEEEE Q lcl|NC_018863. 277 MENDNILVDRIPEPNAPQAPASVVATVKVNDKGAFRP---VKDIKTHSYKVVVHSDDAESLASEAVTAVVANPTDSVSLA 353 (479) Q Consensus 277 ~~a~~~lver~~s~~aP~~P~~vta~~~~~~~g~~~~---~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~V~Lt 353 (479) ++.+.+..+..+.. ...+... + ..|.|-. -...+.+.++| .++++...+.. .....+.+.+. T Consensus 212 ~G~PV~~~~~v~~~--~~~~~~~-~-----~~Gdfs~~~~~~~~~~~~~~~---~~~~~~d~~~~----~~f~~~~v~~r 276 (298) T protein:vir:94 212 NGLPVDVNKTVSDM--SLTQRDR-A-----IIGDFANGFKWGYAKEVPLEV---IQYGDPDNSGL----DLKGYNQVYIR 276 (298) T ss_pred cceeeEEecccccc--cCCCccE-E-----EEeeccceEEEEEecCceEEE---eecCCCcCcch----hhhhcCcEEEE Confidence 22222222211110 0000000 0 0111100 00001111221 12222111100 00112222222 Q ss_pred Eeec-CCccccceEEEEEeccC Q lcl|NC_018863. 354 VKLQ-SLYQAKPQFISVYRQGN 374 (479) Q Consensus 354 It~~-~~~~~~~~y~~IYR~t~ 374 (479) +..- +..-..|.-+...-... T Consensus 277 ~~~r~~~~~~~~~a~~~l~~~t 298 (298) T protein:vir:94 277 AELFLGWGILDATKFARVTEAN 298 (298) T ss_pred EEEEeccEeecccceEEEEecC Confidence 1110 01111111112111111 No 62 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=95.82 E-value=0.0014 Score=36.23 Aligned_cols=317 Identities=13% Similarity=0.038 Sum_probs=142.2 Q ss_pred CcccccccceeeeecCch----------------hHHHHHHHHHHHhhcCccc--CcccccCccccchhhhHHHHHHHhh Q lcl|NC_018863. 1 MTELQKEQKVEARKLPAG----------------AEAELAELVSKSFTTGTGI--TPDTQHDAAALRRELLDDQVKMLAF 62 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~----------------~~~~~~e~~~Ksf~ag~~~--~~~~~~~gaAlr~esld~~i~~l~~ 62 (479) +....+..+....+.... ..++ ...|.+.+..+... ...+-.+|+.|..+.+.+.|..+.. T Consensus 68 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~ 146 (415) T protein:vir:98 68 SENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQE-VRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKE 146 (415) T ss_pred hhhcccccccchhhhHHHHHHHHHHhhhhhhhhhHHHH-HHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHH Confidence 111111111100000000 0011 12233333233211 1123346888999999988876666 Q ss_pred ccccccchhhhccchhHHHHHHhhhhhccCccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcchhh Q lcl|NC_018863. 63 TNGDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVAS-INDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIAD 141 (479) Q Consensus 63 ~~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~d 141 (479) .... +.+.+...++.+.-.+|......+ .....+++|++..+ ..++.+......++-++.-..+|.-+ +.++..| T Consensus 147 ~~~~--l~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~el-l~ds~~~ 222 (415) T protein:vir:98 147 VEFN--LDKYVTVKRVTNGSGKYPVVRQSE-VAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREA-IEDAKVN 222 (415) T ss_pred hhhh--hhhheeeeeccCCceeEEEEeecC-CccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHH-HhhchHH Confidence 5543 444444445554444554443333 34566899988765 67799999999999999888888764 2455667 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhhheeecccCce Q lcl|NC_018863. 142 PMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAAVIVGKGYGRA 221 (479) Q Consensus 142 p~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~a 221 (479) .+....+.-...+++.++.+++.|+-.=.+.+ .++.... ..+.....+. .+.+.|.++--.+...|... T Consensus 223 l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~--------~~~~~~~--~~~~~~~~~~-~~~~~i~~~~~~~~~~~~~~ 291 (415) T protein:vir:98 223 VLQELKLWMARTIAATRNKAIIDVITKGSTGS--------TSSGFEK--EGKKLEVKKA-KSLDDIKDAINLNVKPNYEH 291 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhccccCcccc--------ccccccc--cccccccccc-cchhHHHHHHHhhhhhccCC Confidence 88888888889999999999999986522211 0111111 1233344443 33444444333334455556 Q ss_pred eeeecChHHHhhHHHhhcCceeEEeecCCCc----cccCccccceecCceeEEecCC--cccCC--Ccccc-CcccCCCC Q lcl|NC_018863. 222 TDAFMPIGVQADFTNNLLDRQRVIQPSQAGG----FSTGFSINQFLSTRGAINLHGS--TIMEN--DNILV-DRIPEPNA 292 (479) Q Consensus 222 td~~mp~~vka~f~q~~~~~qrv~~~~n~g~----~~~G~~V~~~~ss~g~I~L~~s--~v~~a--~~~lv-er~~s~~a 292 (479) +-.+||+.+.+.+...-...-|.+...+..+ .-.|++|..... .+.--.|+ .+.++ ..++. +|. . T Consensus 292 ~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~--~~~~~~~~~~~~~Gd~~~~~~~~~~~----~ 365 (415) T protein:vir:98 292 NVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPD--EVLGQKGNNTLIIGNLKDAIVLFDRS----Q 365 (415) T ss_pred CEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeEEecc--cccCCCCccEEEEEehhccEEEEeec----c Confidence 6789999998888764433333333222221 223333311110 00000000 01110 01111 110 0 Q ss_pred CcccceEEEeecccccCcccccccceeeEEEEEEEcCCCCcccccceeee---eecCCCeEEEEE Q lcl|NC_018863. 293 PQAPASVVATVKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLASEAVTAV---VANPTDSVSLAV 354 (479) Q Consensus 293 P~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~VtaT---~a~~~~~V~LtI 354 (479) ..+. .+....+.+.- .+...+=+.+.+ +...+..+ ++...|..-|.. T Consensus 366 ----~~v~----~~~~~~~~~~~-~~~~r~d~~v~~------~~a~~~~~~~~~~~~~~~~~~~~ 415 (415) T protein:vir:98 366 ----YQAS----WTDYMHFGECL-MIAVRQDCRILD------YKSAIVIEYDDSERGEGDLGLEA 415 (415) T ss_pred ----eEEE----EeccccCceEE-EEEEEeccEEec------cccEEEEEEeccCCCCCccccCC Confidence 0000 00111111100 011111111221 21121111 112222333333 No 63 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=95.82 E-value=0.0014 Score=36.23 Aligned_cols=317 Identities=13% Similarity=0.038 Sum_probs=142.2 Q ss_pred CcccccccceeeeecCch----------------hHHHHHHHHHHHhhcCccc--CcccccCccccchhhhHHHHHHHhh Q lcl|NC_018863. 1 MTELQKEQKVEARKLPAG----------------AEAELAELVSKSFTTGTGI--TPDTQHDAAALRRELLDDQVKMLAF 62 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~----------------~~~~~~e~~~Ksf~ag~~~--~~~~~~~gaAlr~esld~~i~~l~~ 62 (479) +....+..+....+.... ..++ ...|.+.+..+... ...+-.+|+.|..+.+.+.|..+.. T Consensus 68 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~ 146 (415) T protein:vir:79 68 SENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQE-VRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKE 146 (415) T ss_pred hhhcccccccchhhhHHHHHHHHHHhhhhhhhhhHHHH-HHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHH Confidence 111111111100000000 0011 12233333233211 1123346888999999988876666 Q ss_pred ccccccchhhhccchhHHHHHHhhhhhccCccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcchhh Q lcl|NC_018863. 63 TNGDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVAS-INDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIAD 141 (479) Q Consensus 63 ~~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~d 141 (479) .... +.+.+...++.+.-.+|......+ .....+++|++..+ ..++.+......++-++.-..+|.-+ +.++..| T Consensus 147 ~~~~--l~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~el-l~ds~~~ 222 (415) T protein:vir:79 147 VEFN--LDKYVTVKRVTNGSGKYPVVRQSE-VAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREA-IEDAKVN 222 (415) T ss_pred hhhh--hhhheeeeeccCCceeEEEEeecC-CccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHH-HhhchHH Confidence 5543 444444445554444554443333 34566899988765 67799999999999999888888764 2455667 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhhheeecccCce Q lcl|NC_018863. 142 PMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAAVIVGKGYGRA 221 (479) Q Consensus 142 p~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~a 221 (479) .+....+.-...+++.++.+++.|+-.=.+.+ .++.... ..+.....+. .+.+.|.++--.+...|... T Consensus 223 l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~--------~~~~~~~--~~~~~~~~~~-~~~~~i~~~~~~~~~~~~~~ 291 (415) T protein:vir:79 223 VLQELKLWMARTIAATRNKAIIDVITKGSTGS--------TSSGFEK--EGKKLEVKKA-KSLDDIKDAINLNVKPNYEH 291 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhccccCcccc--------ccccccc--cccccccccc-cchhHHHHHHHhhhhhccCC Confidence 88888888889999999999999986522211 0111111 1233344443 33444444333334455556 Q ss_pred eeeecChHHHhhHHHhhcCceeEEeecCCCc----cccCccccceecCceeEEecCC--cccCC--Ccccc-CcccCCCC Q lcl|NC_018863. 222 TDAFMPIGVQADFTNNLLDRQRVIQPSQAGG----FSTGFSINQFLSTRGAINLHGS--TIMEN--DNILV-DRIPEPNA 292 (479) Q Consensus 222 td~~mp~~vka~f~q~~~~~qrv~~~~n~g~----~~~G~~V~~~~ss~g~I~L~~s--~v~~a--~~~lv-er~~s~~a 292 (479) +-.+||+.+.+.+...-...-|.+...+..+ .-.|++|..... .+.--.|+ .+.++ ..++. +|. . T Consensus 292 ~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~--~~~~~~~~~~~~~Gd~~~~~~~~~~~----~ 365 (415) T protein:vir:79 292 NVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPD--EVLGQKGNNTLIIGNLKDAIVLFDRS----Q 365 (415) T ss_pred CEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeEEecc--cccCCCCccEEEEEehhccEEEEeec----c Confidence 6789999998888764433333333222221 223333311110 00000000 01110 01111 110 0 Q ss_pred CcccceEEEeecccccCcccccccceeeEEEEEEEcCCCCcccccceeee---eecCCCeEEEEE Q lcl|NC_018863. 293 PQAPASVVATVKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLASEAVTAV---VANPTDSVSLAV 354 (479) Q Consensus 293 P~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~VtaT---~a~~~~~V~LtI 354 (479) ..+. .+....+.+.- .+...+=+.+.+ +...+..+ ++...|..-|.. T Consensus 366 ----~~v~----~~~~~~~~~~~-~~~~r~d~~v~~------~~a~~~~~~~~~~~~~~~~~~~~ 415 (415) T protein:vir:79 366 ----YQAS----WTDYMHFGECL-MIAVRQDCRILD------YKSAIVIEYDDSERGEGDLGLEA 415 (415) T ss_pred ----eEEE----EeccccCceEE-EEEEEeccEEec------cccEEEEEEeccCCCCCccccCC Confidence 0000 00111111100 011111111221 21121111 112222333333 No 64 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=95.82 E-value=0.0014 Score=36.23 Aligned_cols=317 Identities=13% Similarity=0.038 Sum_probs=142.2 Q ss_pred CcccccccceeeeecCch----------------hHHHHHHHHHHHhhcCccc--CcccccCccccchhhhHHHHHHHhh Q lcl|NC_018863. 1 MTELQKEQKVEARKLPAG----------------AEAELAELVSKSFTTGTGI--TPDTQHDAAALRRELLDDQVKMLAF 62 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~----------------~~~~~~e~~~Ksf~ag~~~--~~~~~~~gaAlr~esld~~i~~l~~ 62 (479) +....+..+....+.... ..++ ...|.+.+..+... ...+-.+|+.|..+.+.+.|..+.. T Consensus 68 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~ 146 (415) T protein:vir:81 68 SENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQE-VRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKE 146 (415) T ss_pred hhhcccccccchhhhHHHHHHHHHHhhhhhhhhhHHHH-HHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHH Confidence 111111111100000000 0011 12233333233211 1123346888999999988876666 Q ss_pred ccccccchhhhccchhHHHHHHhhhhhccCccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcchhh Q lcl|NC_018863. 63 TNGDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVAS-INDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIAD 141 (479) Q Consensus 63 ~~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~d 141 (479) .... +.+.+...++.+.-.+|......+ .....+++|++..+ ..++.+......++-++.-..+|.-+ +.++..| T Consensus 147 ~~~~--l~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~el-l~ds~~~ 222 (415) T protein:vir:81 147 VEFN--LDKYVTVKRVTNGSGKYPVVRQSE-VAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREA-IEDAKVN 222 (415) T ss_pred hhhh--hhhheeeeeccCCceeEEEEeecC-CccceeeccccccCcccccceeeEEeeeeeeEeeehhhHHH-HhhchHH Confidence 5543 444444445554444554443333 34566899988765 67799999999999999888888764 2455667 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhhheeecccCce Q lcl|NC_018863. 142 PMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAAVIVGKGYGRA 221 (479) Q Consensus 142 p~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~a 221 (479) .+....+.-...+++.++.+++.|+-.=.+.+ .++.... ..+.....+. .+.+.|.++--.+...|... T Consensus 223 l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~--------~~~~~~~--~~~~~~~~~~-~~~~~i~~~~~~~~~~~~~~ 291 (415) T protein:vir:81 223 VLQELKLWMARTIAATRNKAIIDVITKGSTGS--------TSSGFEK--EGKKLEVKKA-KSLDDIKDAINLNVKPNYEH 291 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHHhhccccCcccc--------ccccccc--cccccccccc-cchhHHHHHHHhhhhhccCC Confidence 88888888889999999999999986522211 0111111 1233344443 33444444333334455556 Q ss_pred eeeecChHHHhhHHHhhcCceeEEeecCCCc----cccCccccceecCceeEEecCC--cccCC--Ccccc-CcccCCCC Q lcl|NC_018863. 222 TDAFMPIGVQADFTNNLLDRQRVIQPSQAGG----FSTGFSINQFLSTRGAINLHGS--TIMEN--DNILV-DRIPEPNA 292 (479) Q Consensus 222 td~~mp~~vka~f~q~~~~~qrv~~~~n~g~----~~~G~~V~~~~ss~g~I~L~~s--~v~~a--~~~lv-er~~s~~a 292 (479) +-.+||+.+.+.+...-...-|.+...+..+ .-.|++|..... .+.--.|+ .+.++ ..++. +|. . T Consensus 292 ~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~--~~~~~~~~~~~~~Gd~~~~~~~~~~~----~ 365 (415) T protein:vir:81 292 NVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPD--EVLGQKGNNTLIIGNLKDAIVLFDRS----Q 365 (415) T ss_pred CEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeEEecc--cccCCCCccEEEEEehhccEEEEeec----c Confidence 6789999998888764433333333222221 223333311110 00000000 01110 01111 110 0 Q ss_pred CcccceEEEeecccccCcccccccceeeEEEEEEEcCCCCcccccceeee---eecCCCeEEEEE Q lcl|NC_018863. 293 PQAPASVVATVKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLASEAVTAV---VANPTDSVSLAV 354 (479) Q Consensus 293 P~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~VtaT---~a~~~~~V~LtI 354 (479) ..+. .+....+.+.- .+...+=+.+.+ +...+..+ ++...|..-|.. T Consensus 366 ----~~v~----~~~~~~~~~~~-~~~~r~d~~v~~------~~a~~~~~~~~~~~~~~~~~~~~ 415 (415) T protein:vir:81 366 ----YQAS----WTDYMHFGECL-MIAVRQDCRILD------YKSAIVIEYDDSERGEGDLGLEA 415 (415) T ss_pred ----eEEE----EeccccCceEE-EEEEEeccEEec------cccEEEEEEeccCCCCCccccCC Confidence 0000 00111111100 011111111221 21121111 112222333333 No 65 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=95.81 E-value=0.0014 Score=36.18 Aligned_cols=296 Identities=11% Similarity=0.052 Sum_probs=138.9 Q ss_pred hhcCcccCcccccCccccchhhhHHHHHHHhhccccccchhhhccchhHHHHHHhhhhhccCcccccccccccccccccC Q lcl|NC_018863. 31 FTTGTGITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASIND 110 (479) Q Consensus 31 f~ag~~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~~~d 110 (479) |-+ ..++|+.|-.+.+..+|..+..... .+.+-..+.+..+--.+|.++. +.....+++|++..+.++ T Consensus 1 Mat-------~tt~~g~~vP~~~~~~ii~~~~~~s--~l~~~~~~i~~~~~~~~~p~~~---~~~~a~wv~Eg~~~~~~~ 68 (311) T protein:vir:99 1 MAT-------FGTGNLKNLPRNIADGMVKDVVQGS--TVAVLSARKPQRFGNEDIITFN---GRPKAEFVGEGQQKSSTT 68 (311) T ss_pred Cce-------ecCCCceeccHHHHHHHHHHHHhhc--hhhhhcceeeccCCceEEEEEe---CCceeEEeecCccccccc Confidence 332 2356667778888777766655443 2344444444443222443333 334577999999999999 Q ss_pred cceEEEEEEEEeeeehhhhhhhHhhh--cchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhh Q lcl|NC_018863. 111 PNIRQKTVQMKFLSDTKQQSLAAGLV--NNIADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLI 188 (479) Q Consensus 111 ~~~~r~~~~~k~l~~~~~vs~~~~lv--~~~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I 188 (479) +++.......|=++.--.+|.-+-.. ++..|.+....+.-...+++.+|.++|+|+..-. |..+-|+...+ T Consensus 69 ~~f~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~-------g~~~~g~~~~~ 141 (311) T protein:vir:99 69 GEFDFVTSTPKKAQVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINPLT-------GTVIPGWSNYL 141 (311) T ss_pred ceeeEEEEeeEEEEEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCccc-------Ccccccccccc Confidence 99999999999888888888776333 3456788899999999999999999999987533 23445666666 Q ss_pred ccCCcEEEccCCCCC--HHHhhhhhheeec--ccCceeeeecChHHHhhHHHhhcCceeEEeecCCCccccCccccceec Q lcl|NC_018863. 189 DEATNVIDLKGERLD--EATLNKAAVIVGK--GYGRATDAFMPIGVQADFTNNLLDRQRVIQPSQAGGFSTGFSINQFLS 264 (479) Q Consensus 189 ~~~~NviDarG~~l~--~~~l~~aa~~i~~--~fG~atd~~mp~~vka~f~q~~~~~qrv~~~~n~g~~~~G~~V~~~~s 264 (479) ....+.+...+.... .+.+..+...+.. ....++.+.|++.+...+...-...-|-+.+..+..... T Consensus 142 ~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~l~~~~~~~~~~--------- 212 (311) T protein:vir:99 142 GAASKRVELTADTIANPDLAIEAAVGLLVANGHPTPVNGLALHPSIAWGLSTARYTDGRKKFPELGLGIGV--------- 212 (311) T ss_pred ccccceeeccccccchhHHHHHHHHHHHhhhccCCCccEEEEcHHHHHHHHhhhccCCCeeecCcccCCCC--------- Confidence 655676666554433 2334443332222 234455689999999999775443223332222111110 Q ss_pred CceeEEecCCcccCCCccccCcccCCCCCcccceEEEeecccccCccccccccee-eEEEEEE-----EcCCCCcccccc Q lcl|NC_018863. 265 TRGAINLHGSTIMENDNILVDRIPEPNAPQAPASVVATVKVNDKGAFRPVKDIKT-HSYKVVV-----HSDDAESLASEA 338 (479) Q Consensus 265 s~g~I~L~~s~v~~a~~~lver~~s~~aP~~P~~vta~~~~~~~g~~~~~sd~g~-Y~YkV~a-----~n~~GES~~S~~ 338 (479) .++++.+.+..+..+.-.... ......... .+...+ -+|-.. +.|.+.- .+.+++.. .. T Consensus 213 ---------~~l~G~Pv~~s~~i~~~~~~~--~~~~~~~~~-~~~~~~-~Gdf~~~~~~~~~~~~~~~~~~~~~~~--~~ 277 (311) T protein:vir:99 213 ---------SSFEGIDASVSDTVNGGDEAD--PDDEDLDAA-RAVRGI-VGDFANGIHWGVQRDIPVELIKYGDPD--GQ 277 (311) T ss_pred ---------ceecceeeEeecccccccccc--cccchhhcc-CcceEE-EeeccccEEEEEecCceEEEeecCCCC--cc Confidence 111111211111111100000 000000000 000000 000000 1111110 01111100 00 Q ss_pred eeeeeecCCCeEEEEEeecCCccccceEEEEEeccC---CCCcEEEEEEeeeeec Q lcl|NC_018863. 339 VTAVVANPTDSVSLAVKLQSLYQAKPQFISVYRQGN---ETGHYFLVARVPLSKA 390 (479) Q Consensus 339 VtaT~a~~~~~V~LtIt~~~~~~~~~~y~~IYR~t~---~~g~~~~i~rV~~s~~ 390 (479) + -....+.+.+ +.+.|=+- ..... ++-...+ T Consensus 278 ~---~~~~~d~~~~--------------r~~~r~d~~v~~~~~v----~~~~~~A 311 (311) T protein:vir:99 278 G---DLKRHNQIAL--------------RLEIVYGWYVFTDRFV----VIENAVA 311 (311) T ss_pred h---hhhhcCcEEE--------------EEEEeecceecChhHe----eeecccC Confidence 0 0000111111 11111110 00000 0110011 No 66 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=95.75 E-value=0.0015 Score=36.05 Aligned_cols=293 Identities=10% Similarity=0.090 Sum_probs=142.4 Q ss_pred cccCccc---ccCccccchhhhHHHHHHHhhccccccchhhhccchhHHHHHHhhhhhccCcccccccccccccccccCc Q lcl|NC_018863. 35 TGITPDT---QHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDP 111 (479) Q Consensus 35 ~~~~~~~---~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~~~d~ 111 (479) -|.+++. ..+|+.|-.+.+.++|........ .+.+.....++.+...++.+ ..+ ....|++|++..+..++ T Consensus 1 ~g~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s--~l~~~~~~~~~~~~~~~~~~---~~~-~~a~~v~E~~~~~~~~~ 74 (299) T protein:vir:41 1 MGFNPDTTTMQSAKTGSIPINISEQIITGVKNGS--AAMKLAKAVPMTKPEEEFTF---MSG-VGAFWVDEAERIQTSKP 74 (299) T ss_pred CCcCCCcccccCCCceecchhHHHHHHHHHHhcc--hhhhhceeeecCCCcEEEEE---EcC-CceeeeecCcccccccc Confidence 4444433 345677888888888866655444 34444455555555555533 233 24679999999999999 Q ss_pred ceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccC Q lcl|NC_018863. 112 NIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEA 191 (479) Q Consensus 112 ~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~ 191 (479) .+.......|-++---.+|.-+- .++..|.+....+.-...+++.+|.++++|+.+-.+ .|+.+..... T Consensus 75 ~f~~v~l~~~k~~~~~~is~ell-~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~g~~~~----------~gil~~~~~~ 143 (299) T protein:vir:41 75 TFTKAKMRSKKMGVIIPTTKENL-NYSVTNFFSLMQAEIVEAFYKKFDQAVFTGVESPYN----------WNILKSATDA 143 (299) T ss_pred ceeEEEEeeEEEEEeehhhHHHH-hcCHHHHHHHHHHHHHHHHHHHHHHHHhhcccCccc----------cccccccccc Confidence 99999999999999988888432 345678888888999999999999999999965222 2566655444 Q ss_pred CcEEEccCCCCCHHHhhhhhheeecccCceeeeecChHHHhhHHHhhcCceeEEeecCCCccccCccccceecCceeEEe Q lcl|NC_018863. 192 TNVIDLKGERLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSQAGGFSTGFSINQFLSTRGAINL 271 (479) Q Consensus 192 ~NviDarG~~l~~~~l~~aa~~i~~~fG~atd~~mp~~vka~f~q~~~~~qrv~~~~n~g~~~~G~~V~~~~ss~g~I~L 271 (479) .+.... ...+.+.|.++.-.+...+...+...|++.+...+...-...-|-+...+..+. .+ .| T Consensus 144 ~~~~~~--~~~~~~~l~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~-~~-------------~l 207 (299) T protein:vir:41 144 SNLVEE--TANKYDDLNEAIGLIEAEDLEPNGIATIRKQRVKYRSTKDGNGMPIFNTATSNG-VD-------------DV 207 (299) T ss_pred ceeecc--ccccHHHHHHHHHhhhcccCCcCEEEEcHHHHHHHHHhhccCCceeecCCcCCC-Cc-------------ee Confidence 454432 334556666665556677778888999999988888755433333332211110 00 11 Q ss_pred cCCcccCCCccccCcccCCCCCcccceEEEeecccccCcccccccceeeEEEEEEEcCCCCcccccceeeee-ecCCCeE Q lcl|NC_018863. 272 HGSTIMENDNILVDRIPEPNAPQAPASVVATVKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLASEAVTAVV-ANPTDSV 350 (479) Q Consensus 272 ~~s~v~~a~~~lver~~s~~aP~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~-a~~~~~V 350 (479) ++.+.+..+..+. .+. . +...- .+ ....+. .+.+...+++. ................ ....+.+ T Consensus 208 -----~G~PV~~~~~~~~-~~~-~---~~~~~-gd-fs~~~i-~~~~~~~i~~~--~~~~~~~~~~~~~~~~~~~~~~~~ 272 (299) T protein:vir:41 208 -----LGLPIAYTPKYTF-GDK-D---ISELV-GD-WNQAYY-GILRGVEYEIL--TEATLTTVADETGKPLNLAERDMA 272 (299) T ss_pred -----cceeeEEecccCC-CCC-c---eEEEE-Ee-cccEEE-EEecCcEEEEe--ecccccccccccccchhhhhcCcE Confidence 1111111111110 000 0 00000 00 000000 00001111111 0000000000000000 0000111 Q ss_pred EEEEeec-CCccccc-eEEEEEeccCC Q lcl|NC_018863. 351 SLAVKLQ-SLYQAKP-QFISVYRQGNE 375 (479) Q Consensus 351 ~LtIt~~-~~~~~~~-~y~~IYR~t~~ 375 (479) .+.+..- ...-..| -+..|=-++++ T Consensus 273 ~~r~~~~~d~~v~~~~A~~~l~~~aa~ 299 (299) T protein:vir:41 273 AIKATFEVGFMVVKDEAFSAVQPKAGN 299 (299) T ss_pred EEEEEEEeccEEecccceEEEEeccCC Confidence 1100000 0000000 01111111111 No 67 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=95.66 E-value=0.00089 Score=37.31 Aligned_cols=304 Identities=16% Similarity=0.144 Sum_probs=123.1 Q ss_pred Cccccccc---ceeeeecCch-hHHHHHHHHHHHhh----cCcc---------cCcccccCccccchhhhHHHHHHHhhc Q lcl|NC_018863. 1 MTELQKEQ---KVEARKLPAG-AEAELAELVSKSFT----TGTG---------ITPDTQHDAAALRRELLDDQVKMLAFT 63 (479) Q Consensus 1 ~~~~~~~~---~~~~~~~~~~-~~~~~~e~~~Ksf~----ag~~---------~~~~~~~~gaAlr~esld~~i~~l~~~ 63 (479) +.+..++. .-...+.+.+ .++..-+...|+|. .+.+ ....+..+|+.|-.+.+..+|..+... T Consensus 63 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~Ii~~~~~ 142 (408) T protein:vir:10 63 LVEAQAEQVVNMREEEKGPLNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQ 142 (408) T ss_pred HHHHHHHHHhccccccccccccchhhhHHHHHHHHHHHhhcchhhhhhhhhhhhhcccccCCceeccHhHHHHHHHHHHh Confidence 01110000 0000011100 01111122223322 1111 112234567888899999888666655 Q ss_pred cccccchhhhccchhHHHHHHhhhhhccCccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhH Q lcl|NC_018863. 64 NGDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVAS-INDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADP 142 (479) Q Consensus 64 ~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp 142 (479) ... +++-+...++.+..-++......++.+...+++|++... .+++.+.......+-++.-..+|.-+ +.++.-|. T Consensus 143 ~~~--l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~el-l~ds~~~l 219 (408) T protein:vir:10 143 YDS--LQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTS-LKDTAENI 219 (408) T ss_pred hch--hhhhcceeeccCCcceEEEeeccccccceeeecCccccccccCcceeeEEeeeeeEEeeehhHHHH-HhhchHHH Confidence 543 444455455544444443333334556788999998765 67799999999999999887777754 33456677 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhcc-CCcEEEccCCCC-CHHHhhhhhheeecccCc Q lcl|NC_018863. 143 MTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDE-ATNVIDLKGERL-DEATLNKAAVIVGKGYGR 220 (479) Q Consensus 143 ~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~-~~NviDarG~~l-~~~~l~~aa~~i~~~fG~ 220 (479) .....+.-...+.+.++.+++.|+.+-.+.. ...-+|.+...+.. -..-+...+..+ +...++....+- ..-|. T Consensus 220 ~~~i~~~l~~~~~~~~~~~il~g~g~~~~~~---~~~~~~~l~~~~~~~~~~~~~~~a~~v~n~~~~~~l~~lk-d~~G~ 295 (408) T protein:vir:10 220 LAWLSSWIAKKVVVTRNQAIIEVMKAAPKKP---TIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNKLALVK-TAEGK 295 (408) T ss_pred HHHHHHHHHHHHHHHHHHHHhhccccccccc---ccccHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHHHHhh-ccCCc Confidence 8888888889999999999999998865432 24567777665531 111111111111 222222211110 11111 Q ss_pred eeeeecChHHHhhHHHhhcCceeEEeecCCCccccCcccc-cee---------cCceeEEecCCcccC------CCcccc Q lcl|NC_018863. 221 ATDAFMPIGVQADFTNNLLDRQRVIQPSQAGGFSTGFSIN-QFL---------STRGAINLHGSTIME------NDNILV 284 (479) Q Consensus 221 atd~~mp~~vka~f~q~~~~~qrv~~~~n~g~~~~G~~V~-~~~---------ss~g~I~L~~s~v~~------a~~~lv 284 (479) .+|.|.. .+.-...+++...++..+ ...++.+.+-. -+. ..++.+.+.++.... ...+.. T Consensus 296 --~i~~~~~-~~~~~~~l~G~PV~~~~~-~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~v~~~~~~~~~f~~~~~~~r~ 371 (408) T protein:vir:10 296 --YLLEPDP-TKPNSYLIKGKQVIVVAD-RWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRV 371 (408) T ss_pred --eEeccCc-CCCCCceecceeeEEecc-cccCccCCCceEEEEEehhccEEEEEecceEEEEcccccchhhcCceEEEE Confidence 1222211 111111222222222111 01011110000 000 001111111111000 000000 Q ss_pred C-c----ccCCCCCcccceEEEeecccccCcccccccceeeEEEEEEEcCCCCccc-ccceeeeee Q lcl|NC_018863. 285 D-R----IPEPNAPQAPASVVATVKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLA-SEAVTAVVA 344 (479) Q Consensus 285 e-r----~~s~~aP~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~-S~~VtaT~a 344 (479) + | ...+++ +++.-+..-....+ +...+++.. T Consensus 372 ~~r~d~~v~~~~a-----------------------------~~~~~~~~~~~~~~~~~~~~~~~~ 408 (408) T protein:vir:10 372 IDRFDVKATDSEA-----------------------------LVAGSFSAIADQVGNFKTTTSTAV 408 (408) T ss_pred EEeeccEEecccc-----------------------------EEEEEeeccccCCCCCCCCCcccC Confidence 0 0 000000 00001110001111 111011110 No 68 >protein:vir:105563 Length: 396 # NCBI annotation: hypothetical protein # Family: family:all:27455 # MgeID: mge:1540 # MgeName: F116 # Cross-refs: genbank:acc:YP_164316;genbank:gi:56692963;genbank:GeneID:3197174 Probab=95.54 E-value=0.00018 Score=41.06 Aligned_cols=265 Identities=18% Similarity=0.180 Sum_probs=100.4 Q ss_pred HHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcE-EEccCCCC--------CHHHhhh--hhheeecccCceee Q lcl|NC_018863. 155 AKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNV-IDLKGERL--------DEATLNK--AAVIVGKGYGRATD 223 (479) Q Consensus 155 ~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~Nv-iDarG~~l--------~~~~l~~--aa~~i~~~fG~atd 223 (479) |-+.--.-|-|=..+.++..-+.|-|=|++ -...+.|| ||+.|+.= +..-|.- .+-...+.|+ T Consensus 1 ~~~~~~~~~~ginnv~~e~~l~~~~~~~~~--~~r~a~nvdi~~~G~~~~r~~~tr~~~g~l~~~~~~~~~~~~~~---- 74 (396) T protein:vir:10 1 MATTSLVPLAGINNVAEDAALQRGGESPRL--YVRDAVNIDLSPAGKAQLRASVRQVTDQPFRQLWQSPLHGDAFG---- 74 (396) T ss_pred CcceeeeeeecccccccccccccCCCcccc--eeeeeeeecccCCCchhhhccCcccCCceecccccCccccceee---- Confidence 222222223333333332211111111111 11123453 66666431 2111110 0000111111 Q ss_pred eecChHHHhhHHHhhcCceeEEeecCCCccccCccccc---eecCceeEEecCCcccCCCccccCcccCCCCCcccceEE Q lcl|NC_018863. 224 AFMPIGVQADFTNNLLDRQRVIQPSQAGGFSTGFSINQ---FLSTRGAINLHGSTIMENDNILVDRIPEPNAPQAPASVV 300 (479) Q Consensus 224 ~~mp~~vka~f~q~~~~~qrv~~~~n~g~~~~G~~V~~---~~ss~g~I~L~~s~v~~a~~~lver~~s~~aP~~P~~vt 300 (479) ++..+--.+++.. -++....+.+..-+-++... |.+-.+.+..+ ++--.+| ....+|.+|. .. T Consensus 75 --~~~~tl~~~~~~~---w~~~~~v~v~~~pva~d~~~~Rvy~t~~~~p~~~-------~~~~~y~-L~vp~P~~a~-~~ 140 (396) T protein:vir:10 75 --ALGDQWGKVDPHS---WTFEPLAQIGEGDLSHEVLNNRVCVAGTAGIFTY-------DGAQAER-LTLDTPAPPL-LV 140 (396) T ss_pred --eCCceEEEEeCCe---EEEEeeeeeccCchhccccCCeEEEEcCCCceee-------eCCccee-cCcCCCcccc-cc Confidence 1111111111111 11121112111111111110 00001111100 0000011 1112222222 11 Q ss_pred EeecccccCcccccccceeeEEEEEEEcCCCCcccccceeeeeecCCCeEEEEEeecCCccccceEEEEEeccCCCCcEE Q lcl|NC_018863. 301 ATVKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLASEAVTAVVANPTDSVSLAVKLQSLYQAKPQFISVYRQGNETGHYF 380 (479) Q Consensus 301 a~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~V~LtIt~~~~~~~~~~y~~IYR~t~~~g~~~ 380 (479) + ..|+ .+-++|.|.++-++..||..++.+++.-++ .+++++|++++. ++ .+...+.|||+.++++.|+ T Consensus 141 a-----~~Gs----l~~~~~~Y~~t~V~~~gEEs~p~~~S~~v~-~~gg~~vtl~~~-~~-~~i~~~RiYrS~~~G~~~~ 208 (396) T protein:vir:10 141 A-----GAGS----LSQGTYGAAVAWLRGPQESAPSLIAFAEVT-DAGALEVTFPLC-LD-ASVTGARLYLTRANGGELL 208 (396) T ss_pred c-----ccCc----cCCceEEEEEEEEecCCCcCcccccccccC-CCCCcEEEEEcc-cC-CCcceEEEEEeCCChhhhh Confidence 1 2233 223689999999998888877776666654 778888888864 33 3456789999999999999 Q ss_pred EEEEeeeeeccCCCeeEEEeccCCCCCccceeeccccHHHHHHHHhccccccCccccCchhHHHHHhhhhhheeccceeE Q lcl|NC_018863. 381 LVARVPLSKADENGVITFVDRNQVIPETTDVFIGELTPQVISLLELLPMMKLPLAQMNATTTFTVLWYGALALYAPKKWV 460 (479) Q Consensus 381 ~i~rV~~s~~n~~~tttf~D~N~~iPgT~~~fvge~~~q~i~l~ellPm~k~Pla~~~~~~~~~V~~yg~L~l~aPkk~~ 460 (479) +++-++. ++.+|++.- +| .++.|. .|+.|.| +|.+.+-+-..+ =.+.|=.+=+ T Consensus 209 l~aE~~a------~~~s~vlPs--~~-------w~gpP~--~~~gL~p---mP~G~~~A~faG-------Ri~~A~Gn~V 261 (396) T protein:vir:10 209 LAGDYPL------GAATVILPT--LP-------ELGRPA--QFRHLSP---MPTGKHLAYWRG-------RLLIARANVL 261 (396) T ss_pred heehhcc------ceeeeeeec--CC-------CCCCCc--ccccccc---CchhHhhhhhcc-------eEEEEeCCEE Confidence 9998885 356665422 11 112222 2344444 454332222211 1111111111 Q ss_pred EEEeccccCcccccc-ccCC Q lcl|NC_018863. 461 RIKNVQYIPALAADV-TYRP 479 (479) Q Consensus 461 ~ikNV~~~~~~~~~~-~~~~ 479 (479) ++.- .|.|-|.+-. .|+| T Consensus 262 ~FSE-p~~Ph~~~~~~~~~~ 280 (396) T protein:vir:10 262 RFSE-ALAYHLHDERYGFVQ 280 (396) T ss_pred EEec-CCCCceecchhccCC Confidence 1111 1122222222 2333 No 69 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=95.51 E-value=0.0019 Score=35.54 Aligned_cols=321 Identities=12% Similarity=0.034 Sum_probs=143.2 Q ss_pred CcccccccceeeeecCchhH-----------HHHH----HHHHHHhhcCccc--CcccccCccccchhhhHHHHHHHhhc Q lcl|NC_018863. 1 MTELQKEQKVEARKLPAGAE-----------AELA----ELVSKSFTTGTGI--TPDTQHDAAALRRELLDDQVKMLAFT 63 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~-----------~~~~----e~~~Ksf~ag~~~--~~~~~~~gaAlr~esld~~i~~l~~~ 63 (479) ..+.++..++...+...... ..+. ..|.+.+..+... ...+..+|+.+..+.+...|..+... T Consensus 68 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~g~~~iP~~~~~~ii~~~~~ 147 (415) T protein:vir:94 68 SENNQQSVEVNEASTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEV 147 (415) T ss_pred hhhccccccccchhhHHHHHHHHHHHhhhhhhhhhHHHHHHHHHHhhhhhhhhhhccccccccccCcHHHHHHHHHHHHh Confidence 11111111111111110000 0000 1122222222211 11223568888899888888666655 Q ss_pred cccccchhhhccchhHHHHHHhhhhhccCccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhH Q lcl|NC_018863. 64 NGDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVAS-INDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADP 142 (479) Q Consensus 64 ~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp 142 (479) ... +.+-+...++.+--.+|.+. ...+.+...+++|++..+ ..++.+.+....++-++.-..+|.-+ +.++..|. T Consensus 148 ~~~--l~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~v~Eg~~~~~~~~~~~~~i~~~~~k~~~~~~is~el-l~ds~~~~ 223 (415) T protein:vir:94 148 EFN--LDKYVTVKRVTNGSGKYPVV-RQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREA-IEDAKVNV 223 (415) T ss_pred hhh--hhhhcceeeccCCceeEEEE-eecCCccceeccccccccccccccceeeEeeheeeeeechhhHHH-HhhchHHH Confidence 443 34444444454433444333 333445677999998865 67799999999999999888888753 34556788 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhhheeecccCcee Q lcl|NC_018863. 143 MTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAAVIVGKGYGRAT 222 (479) Q Consensus 143 ~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~at 222 (479) +....+.-...++..++.+++.|+..-.+.+. +.. .. ...+.....+.. +.+.|.++-..+...+...+ T Consensus 224 ~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~--------~~~-~~-~~~~~~~~~~~~-~~~~i~~~~~~~~~~~~~~~ 292 (415) T protein:vir:94 224 LQELKLWMARTIAATRNKAIIDVITKGSTGST--------SSG-FE-KEGKKLEVKKAK-SLDDIKDAINLNVKPNYEHN 292 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHhhccccCccccc--------ccc-cc-cccccccccccc-chHHHHHHHHhhhhhccCCC Confidence 88888999999999999999999875332111 111 11 112333444433 33333333333345555567 Q ss_pred eeecChHHHhhHHHhhcCceeEEeecCCCc----cccCccccceecCceeEEecCC--cccCC--Ccccc-CcccCCCCC Q lcl|NC_018863. 223 DAFMPIGVQADFTNNLLDRQRVIQPSQAGG----FSTGFSINQFLSTRGAINLHGS--TIMEN--DNILV-DRIPEPNAP 293 (479) Q Consensus 223 d~~mp~~vka~f~q~~~~~qrv~~~~n~g~----~~~G~~V~~~~ss~g~I~L~~s--~v~~a--~~~lv-er~~s~~aP 293 (479) -.+|++.+.+.+...-...-|.+...+..+ .-.|.+|.-... .+.--.++ .+.++ ..++. +|. . T Consensus 293 ~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~--~~~~~~~~~~i~~gd~~~~~~~~~~~----~- 365 (415) T protein:vir:94 293 VAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPD--EVLGQKGNNTLIIGNLKDAIVLFDRS----Q- 365 (415) T ss_pred EEEEcHHHHHHHHHhhccCCCeeeccCcCCCCCceecceeeEEecc--cccCCCCccEEEEEehhccEEEEeec----c- Confidence 799999999888765433223333222221 122333311110 00000000 01111 11111 110 0 Q ss_pred cccceEEEeecccccCcccccccceeeEEEEEEEcCCCCcccccceeeeeecCCCeEEEEE Q lcl|NC_018863. 294 QAPASVVATVKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLASEAVTAVVANPTDSVSLAV 354 (479) Q Consensus 294 ~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~V~LtI 354 (479) ..+. .+....+.+. ..+...+-+.+.+.. +.....-.+++...|..-|.. T Consensus 366 ---~~v~----~~~~~~~~~~-~r~~~r~d~~~~~~~---a~~~~~~~~~~~~~~~~~~~~ 415 (415) T protein:vir:94 366 ---YQAS----WTDYMHFGEC-LMIAVRQDCRILDYK---SAIVIEYDDSERGEGDLGLEA 415 (415) T ss_pred ---eEEE----EeccccCceE-EEEEEEeccEEeccc---cEEEEEEeccCCCCCccccCC Confidence 0000 0011111111 011222222233211 111111111222233333333 No 70 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=95.50 E-value=0.002 Score=35.43 Aligned_cols=316 Identities=9% Similarity=-0.010 Sum_probs=133.4 Q ss_pred Ccccccccce--------eeeecCchhHHHHHHHHHHHhhcCcc---------cCcccccCccccchhhhHHHHHHHhhc Q lcl|NC_018863. 1 MTELQKEQKV--------EARKLPAGAEAELAELVSKSFTTGTG---------ITPDTQHDAAALRRELLDDQVKMLAFT 63 (479) Q Consensus 1 ~~~~~~~~~~--------~~~~~~~~~~~~~~e~~~Ksf~ag~~---------~~~~~~~~gaAlr~esld~~i~~l~~~ 63 (479) +-+...+... ....-.....++-...|.+.+..+.. ....+..+|+.|-++.+..+|..+... T Consensus 63 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~a~~~~t~~~gg~~iP~~~~~~ii~~~~~ 142 (404) T protein:vir:39 63 LVEAQAEQVVNMREEEKGPLNKSEYELKDKFVKEFVNMVRNPMAFLNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQ 142 (404) T ss_pred HHHHHHHHHhccccccccccccchhhhHHHHHHHHHHHHhcchhhhhhhhhhhhhcccccCCceeccHHHHHHHHHHHHh Confidence 1110000000 00000011111111223332222211 223345678889999998888666655 Q ss_pred cccccchhhhccchhHHHHHHhhhhhccCcccccccccccccc-cccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhH Q lcl|NC_018863. 64 NGDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVA-SINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADP 142 (479) Q Consensus 64 ~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~-~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp 142 (479) .. .++..+...+..+-...|.....-+..+...+++|++.. +.+++.+.+....++-++.-..+|.-+= .++..|. T Consensus 143 ~~--~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell-~ds~~~l 219 (404) T protein:vir:39 143 YD--SLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPRLTIIKYLIKRYAGIITATNTLL-KDTAENI 219 (404) T ss_pred hh--hHHhhcceeeccCCcceEEEEeecCCccceeeecCccccccccccceeeEEeeeeeEEeeehhHHHHH-hhchHHH Confidence 54 344445544444433444322222334457789999875 4789999999999999998888887432 3455677 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhhheeecccCcee Q lcl|NC_018863. 143 MTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAAVIVGKGYGRAT 222 (479) Q Consensus 143 ~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~at 222 (479) +....+.-...+...++.++++|+..-.+. +.++-+|++...|.. ....+|.... T Consensus 220 ~~~i~~~l~~~~~~~~d~~il~g~g~~~~~---~~~~~~~~i~~~~~~----------------------~~~~~~~~~a 274 (404) T protein:vir:39 220 LAWLSSWIAKKVVVTRNQAIIAAMGTVPKK---PTIAKFDDVITMINT----------------------SVDPAIIATS 274 (404) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcccccccc---cccccHHHHHHHHHH----------------------hhhhhhccCC Confidence 888888999999999999999999886553 335778887776641 0011111112 Q ss_pred eeecChHHHhhHHHhhcCceeEEeecCCCc----cccCccccceecCceeEEecCCcccCCCccccCcccCCC---CCcc Q lcl|NC_018863. 223 DAFMPIGVQADFTNNLLDRQRVIQPSQAGG----FSTGFSINQFLSTRGAINLHGSTIMENDNILVDRIPEPN---APQA 295 (479) Q Consensus 223 d~~mp~~vka~f~q~~~~~qrv~~~~n~g~----~~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver~~s~~---aP~~ 295 (479) -.+|++.+.+.+...-...-|.+...+..+ .-.|.+|.-..+. .+ +. ........+..-++.. .-.. T Consensus 275 ~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~--~~---~~-~~~~~~~~~~gd~~~~~~~~~~~ 348 (404) T protein:vir:39 275 SLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKKVIVVADR--WL---PN-SGSTVYPLYYGDMSQAITLFDRE 348 (404) T ss_pred EEEEcHHHHHHHHHhhccCCceeeccCcCCCCcceecceeEEEeccc--cc---Cc-cCCCccEEEEEeccccEEEEeec Confidence 356777777666654322222232222111 1122222100000 00 00 0000000000000000 0000 Q ss_pred cceEEEeecccccCcccccccceeeEEEEEEEcCCCCcccccceeeeeecCCCeEEEEEeec-CCccccceEE Q lcl|NC_018863. 296 PASVVATVKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLASEAVTAVVANPTDSVSLAVKLQ-SLYQAKPQFI 367 (479) Q Consensus 296 P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~V~LtIt~~-~~~~~~~~y~ 367 (479) ...+... .. .+..|. .+...+.+.-+.|-...- ...-+.|+++.. +..+..+..- T Consensus 349 ~~~i~~~--~~-~~~~~~-----~~~~~~r~~~r~d~~~~~---------~~a~~~~~~~~~a~~~~~~~~~~ 404 (404) T protein:vir:39 349 NMSLLPT--NI-GAGAFE-----TDTTKIRVIDRFDVKTTD---------SEALVAGSFTAIADQVGNFTAGK 404 (404) T ss_pred ceEEEEe--cc-chhhhh-----hceeeEEEEeeeccEEec---------ccceEEEEeeccccCCCCCCCCC Confidence 0000000 00 000000 011111111111111110 001111111111 0011111111 No 71 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=95.04 E-value=0.0014 Score=36.21 Aligned_cols=315 Identities=13% Similarity=0.144 Sum_probs=143.3 Q ss_pred Cccccccc--ceeeeecCchhHHHHHH---HHHHHhhcC-c----c--------cCcccccCccccchhhhHHHHHHHhh Q lcl|NC_018863. 1 MTELQKEQ--KVEARKLPAGAEAELAE---LVSKSFTTG-T----G--------ITPDTQHDAAALRRELLDDQVKMLAF 62 (479) Q Consensus 1 ~~~~~~~~--~~~~~~~~~~~~~~~~e---~~~Ksf~ag-~----~--------~~~~~~~~gaAlr~esld~~i~~l~~ 62 (479) +-+++..+ +-...+......+...+ .+.+.+.++ + . ..-.+-++|+.+-.+.+.++|..... T Consensus 84 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~Ii~~l~ 163 (425) T protein:vir:95 84 LEQINSKQPSNQSRQKMQGSKGDVVEMNRLQVREMLKTGEYYKRSEVVEFYEKFRNLRAVAGGELTIPEVVVNRIMDIMG 163 (425) T ss_pred HHHhhhhccchhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhHHHHHHHHHHhhcccccCceeccHHHHHHHHHHHH Confidence 11111000 00000000000000000 000111111 0 0 01112356888999999988865555 Q ss_pred ccccccchhhhccchhHHHHHHhhhhhccCcccccccccccccccccC-cceEEEEEEEEeeeehhhhhhhHhhhcchhh Q lcl|NC_018863. 63 TNGDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASIND-PNIRQKTVQMKFLSDTKQQSLAAGLVNNIAD 141 (479) Q Consensus 63 ~~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~~~d-~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~d 141 (479) ... .+++.+........+ ++ ...++.+.+.|+.|++..+..| +.+.+.....+=++.-..+|.-+ +.++..| T Consensus 164 ~~~--~i~~~~~~~~~~g~~-~i---p~~~~~~~a~~v~E~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~el-l~ds~~~ 236 (425) T protein:vir:95 164 DYT--TLYPLVDKIRVKGTT-RI---LVDTDTSPATWIEQSGALPTGDVGTIASIDFDGFKVGKVTFVDNYL-LQDSIIN 236 (425) T ss_pred hhh--hHHHhhceeecCcee-EE---EEecCCccccccccccccccccccccceeeeeheeeeeeehhhHHH-HhccHHH Confidence 543 244444433343332 23 3455566788999999865555 78988888888777655555542 2345567 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhhheeecccCce Q lcl|NC_018863. 142 PMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAAVIVGKGYGRA 221 (479) Q Consensus 142 p~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~a 221 (479) .+....+.-...+++.+|.++|+|+..-++ ++.|+.+.+....++. ..+..++.+.|.++.-.+..++... T Consensus 237 l~~~i~~~l~~~i~~~~d~~il~G~G~~~~--------~p~Gil~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 307 (425) T protein:vir:95 237 LDDYVTKKIARAIAKALDLAIVKGTGAANK--------QPLGIIPSLPPENQVT-VEADNNLLKNLVKQIGLIDTGDDSV 307 (425) T ss_pred HHHHHHHHHHHHHHHHHHHHhhccCCCCcc--------ccceeecccccccccc-cccccchHHHHHHHHHhhhhhcccc Confidence 788888888889999999999999865332 4568887775434444 3445556666666655555666543 Q ss_pred ee--eecChHHH-hh---HHHhhcCceeEE-e-ecCCCccccCccccceecCceeEEecCCcccCCCccccCcccCCCCC Q lcl|NC_018863. 222 TD--AFMPIGVQ-AD---FTNNLLDRQRVI-Q-PSQAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILVDRIPEPNAP 293 (479) Q Consensus 222 td--~~mp~~vk-a~---f~q~~~~~qrv~-~-~~n~g~~~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver~~s~~aP 293 (479) .. .+|++.+. +. ++..-...-|.+ + ++.....-.|.+|- .+..-+ .+..+.++...+.-+.- T Consensus 308 ~~~~~v~~~~~~~~~l~~l~~~kd~~g~~i~~~~~~~~~~l~G~pvv--~~~~~~---~~~i~~Gd~~~~~~~~~----- 377 (425) T protein:vir:95 308 GEIVAVMKRSTYYNRLVEFSIQVDSNGNVVGKLPNLRTPDLLGLRVV--FNNFLD---DDTVLFGEFEQYTLVER----- 377 (425) T ss_pred CceEEEEeChHHHHHHHHHHhhcCCCCceeeccCCCCCccccceeeE--EcCcCC---CccEEEEecccEEEEee----- Confidence 33 45666542 21 221112222222 2 22222222344431 111000 00111111111000000 Q ss_pred cccceEEEeecccccCcccccccceeeEEEEEEEcCCCCcccccceeeeeecCCCeE Q lcl|NC_018863. 294 QAPASVVATVKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLASEAVTAVVANPTDSV 350 (479) Q Consensus 294 ~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~V 350 (479) ... ....+ .. ..|-. +.-.|++...=+..==.|...+..+++++..+. T Consensus 378 -~~~-~i~~~-~~--~~f~~----~~~~~~~~~r~d~~~~~~~a~~~~~i~~~~~g~ 425 (425) T protein:vir:95 378 -ENI-TIDSS-TH--VKFTE----DQTAFRGKGRFDGKPVKPEAFVLVTITDPVQGA 425 (425) T ss_pred -cce-EEEee-cc--ccccc----CceEEEEEEeeCcEeecccceEEEEecCcCCCC Confidence 000 11111 11 11111 234444444444444445555555555533322 No 72 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=95.00 E-value=0.003 Score=34.42 Aligned_cols=315 Identities=9% Similarity=-0.002 Sum_probs=135.2 Q ss_pred CcccccccceeeeecCchhHHHHHHHHHHHhhcCcccCcccccCccccchhhhHHHHHHHhhccccccchhhhccchhHH Q lcl|NC_018863. 1 MTELQKEQKVEARKLPAGAEAELAELVSKSFTTGTGITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQVNS 80 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~Ksf~ag~~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~~~s 80 (479) +++.....+ ..-...........+.|++...-.....+-.+|+.|..+.+..+|..+...... +.+-+...++.+ T Consensus 74 ~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~--l~~~~~~~~~~~ 148 (395) T protein:vir:38 74 EPVNKKPLP---VKDGKPDAQAMKNQFVKDFKNLVTSGTTGTGNAGLTIPEDIQLQIRTLTRSFTS--LESLANVENVTT 148 (395) T ss_pred ccccccccc---hhhhhHHHHHHHHHHHHHHHHHHhhccCccCCCceecchhHhhHHHHHHHhhcc--hhhhcceeeccC Confidence 111111111 111111222333455555543322222334468889999998888666555443 444444444443 Q ss_pred HHHHhhhhhccCccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHH Q lcl|NC_018863. 81 TVAKYAVFNQHGRTGHSRFVREVGVAS-INDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAISVIAKSIE 159 (479) Q Consensus 81 tv~~y~~~~~~G~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~~~~~ai~~~~~~~e 159 (479) ...+|.....-+..+...+++|++..+ ..++.+.+.....+-++.--.+|.-+= .++..|.+....+.-...+...++ T Consensus 149 ~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~la~~~~~~~~ 227 (395) T protein:vir:38 149 SHGSRVYEKLADITPLKDLDDESALIGDNDDPELTVVKYLIHRYAGITTVTNTLL-KDTVDNIIQWLVNWAAKKDVVTRN 227 (395) T ss_pred CcceEEEEeeccCCccccccccccccccccccceeeEEeeeeeeEeehhhHHHHH-hhhHHHHHHHHHHHHHHHHHHHHH Confidence 333332111111223456899998865 567999999999999988888877432 344557778888888999999999 Q ss_pred HHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhhheeecccCceeeeecChHHHhhHHHhhc Q lcl|NC_018863. 160 WAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLL 239 (479) Q Consensus 160 ~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~atd~~mp~~vka~f~q~~~ 239 (479) .++++|+..-.+.+ ...-+|.+.+.+.. .+...|....-.+|++.+.+.+...-. T Consensus 228 ~~il~g~g~~~~~~---~~~~~~~i~~~~~~----------------------~l~~~~~~~a~~v~n~~~~~~L~~lkd 282 (395) T protein:vir:38 228 AKILEVMGKAPKKP---TISQFDNIKDLENN----------------------TLDPAIESTSSFITNQSGYNILSKVKD 282 (395) T ss_pred HHHhhccccccccc---ccccHHHHHHHHHH----------------------hhhhhhcCCCEEEEcHHHHHHHHHhhc Confidence 99999987754432 23456666555431 111122222336789888888776544 Q ss_pred CceeEEeecCCCccccCccccceecCceeEEecCCcccCCCccccCcccCCCCCcccceEEEeecccccCccccccccee Q lcl|NC_018863. 240 DRQRVIQPSQAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILVDRIPEPNAPQAPASVVATVKVNDKGAFRPVKDIKT 319 (479) Q Consensus 240 ~~qrv~~~~n~g~~~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver~~s~~aP~~P~~vta~~~~~~~g~~~~~sd~g~ 319 (479) ..-|.+...+..+. .+.++++.+.+..+....+.+.. +... -.|. T Consensus 283 ~~G~~l~~~~~~~~------------------~~~~l~G~pV~~~~~~~~~~~~~-------------~~~i----~~gd 327 (395) T protein:vir:38 283 ADGRYLMQPDVTSP------------------DKYLIDGKPVIRIADKWLPDVSG-------------SHPL----YFGD 327 (395) T ss_pred cCCceeeccCcCCC------------------CcceeccceeEEecccccCcCCC-------------cceE----EEEe Confidence 33333322211110 01112222222111100000000 0000 0122 Q ss_pred eEEEEEEEcCCCCcccccceeeeeecCCCeEEEEEeecCCc---cccceEEEEEecc---CCCCcEEEEEEeeeeeccCC Q lcl|NC_018863. 320 HSYKVVVHSDDAESLASEAVTAVVANPTDSVSLAVKLQSLY---QAKPQFISVYRQG---NETGHYFLVARVPLSKADEN 393 (479) Q Consensus 320 Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~V~LtIt~~~~~---~~~~~y~~IYR~t---~~~g~~~~i~rV~~s~~n~~ 393 (479) ++..+....+.| ++|.+...... .....|+.+.|=. ..+..|..+.--+++. +. T Consensus 328 ~~~~~~i~~~~~------------------~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~--~~ 387 (395) T protein:vir:38 328 LKQGITLFDRQQ------------------MQIDTTNVGAGSFEHDTTKLRFIDRFDVQLIDDGAFAAASFKTVAN--QA 387 (395) T ss_pred ccccEEEEEecc------------------eEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEeecccC--CC Confidence 221111111111 11111110000 0001111111111 0111111111111110 11 Q ss_pred CeeEEEeccC Q lcl|NC_018863. 394 GVITFVDRNQ 403 (479) Q Consensus 394 ~tttf~D~N~ 403 (479) ..|+ ++.. T Consensus 388 ~~~~--~~~~ 395 (395) T protein:vir:38 388 QGTA--GTGK 395 (395) T ss_pred CCcc--CCCC Confidence 1111 1111 No 73 >protein:vir:103370 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:11266 # MgeID: mge:1621 # MgeName: PaP2 # Cross-refs: genbank:acc:YP_024741;genbank:gi:48697083;genbank:GeneID:2846038 Probab=94.80 E-value=0.00049 Score=38.75 Aligned_cols=325 Identities=15% Similarity=0.142 Sum_probs=141.1 Q ss_pred Ccccccccce--e---eee-cCc-hhHHHHHHHHHHHhhcCcc-cCcccccCccccchhhhHHHHH--HHhhcccccc-- Q lcl|NC_018863. 1 MTELQKEQKV--E---ARK-LPA-GAEAELAELVSKSFTTGTG-ITPDTQHDAAALRRELLDDQVK--MLAFTNGDFT-- 68 (479) Q Consensus 1 ~~~~~~~~~~--~---~~~-~~~-~~~~~~~e~~~Ksf~ag~~-~~~~~~~~gaAlr~esld~~i~--~l~~~~~~f~-- 68 (479) +-|+--...+ . ... .++ ..+..||+ |++..=.. ++.....++.-|..|+=+- ++ +|.|.+..+. T Consensus 27 ~~~~PN~~~pll~li~~g~~~ta~ast~~w~~---d~~~~~~~~~ta~a~a~~T~l~ve~~~~-f~~~~l~~~~~~~Evi 102 (418) T protein:vir:10 27 LRRVPNGSAPLLAMTSVVGSTTAKASTHGYFS---KTMVFASAVVTAEAAADATVLTVENSDG-LTKGMIFYNEATGENM 102 (418) T ss_pred hhhcCCcchhhhhhhhcccccccceeEEEEEE---EEEeeeeEEEEEEEecCceEEEEcCcce-eccccEEEEccCCeEE Confidence 1111100000 0 000 000 01122322 32221111 1111112222222222211 00 1222222111 Q ss_pred chhhhccchhHHHHHHhhhhhccCcccccc--------cc----cccccccccCcceEEEEEEE-Ee---eeehhhhhhh Q lcl|NC_018863. 69 IYPLINKQQVNSTVAKYAVFNQHGRTGHSR--------FV----REVGVASINDPNIRQKTVQM-KF---LSDTKQQSLA 132 (479) Q Consensus 69 ~~~~i~k~~~~stv~~y~~~~~~G~~g~~~--------fv----~E~g~~~~~d~~~~r~~~~~-k~---l~~~~~vs~~ 132 (479) ....|+ .. +-.+....|++-+.+ |+ -||.+.... +.+.+ +.+ +| +.+.-++|.- T Consensus 103 rv~sVn----g~---~lTV~Rg~~~t~aaaia~n~~~~~Ig~~~eEGsd~~ta--~~~k~-~~vsNvtQIF~~avsvSgT 172 (418) T protein:vir:10 103 RLELVN----GL---NLTVKRQTGRISAAIIAANTKLIVIGTAFEEGSQRPTA--RSIQP-VYVPNFTQIFRNAWALTDT 172 (418) T ss_pred EEEEEe----CC---EEEEEEecCCeeEEEEecCceEEEeccccccccccCCc--ceecc-eeccchhhhhhhhhhhhhh Confidence 111221 01 111223444443332 22 366554443 33322 222 23 3445556655 Q ss_pred Hhhh---cchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccc--hhhhHHHhhcc--CCcEEEccCC-CCCH Q lcl|NC_018863. 133 AGLV---NNIADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGI--EFDGLTKLIDE--ATNVIDLKGE-RLDE 204 (479) Q Consensus 133 ~~lv---~~~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gl--eFDGl~~~I~~--~~NviDarG~-~l~~ 204 (479) +..+ -+++|+...+ .+.+.-.+.+||+++|+|-.....+ .+|+ .++||..+|.. |+||+|+.+. .++. T Consensus 173 aqAs~~q~Gvsn~~ese-~drk~~~av~iEkalI~G~~~~~~~---~~g~~R~m~GIl~~vr~~~~gnVv~a~~~t~~s~ 248 (418) T protein:vir:10 173 ARASYAEAGYSNITESR-RDCMDFHATEQETAIFFGQAFMGTY---NGQPLHTTQGIVDAVRQYAPDNVNAMPNPTAVTY 248 (418) T ss_pred hhhccccccCchHHHHH-HHHHHHHHHHHHHHHhcccccCCCc---CCcchhhHHHHHHHHhhhcccceeccCCCCccCH Confidence 5542 3567887666 4555555669999999997553322 2243 68999987753 7999999997 6899 Q ss_pred HHhhhhhhee---ecccCceee-----eecChHHHhhHHHhhcCceeEEeecCCCccccCccccceecCceeEEecCCcc Q lcl|NC_018863. 205 ATLNKAAVIV---GKGYGRATD-----AFMPIGVQADFTNNLLDRQRVIQPSQAGGFSTGFSINQFLSTRGAINLHGSTI 276 (479) Q Consensus 205 ~~l~~aa~~i---~~~fG~atd-----~~mp~~vka~f~q~~~~~qrv~~~~n~g~~~~G~~V~~~~ss~g~I~L~~s~v 276 (479) +.|.++...+ +.+-|..++ +++|...|.+++..+- .-|.. ......|..+..|...+|.|.|.-..+ T Consensus 249 d~l~~a~~~af~~g~~~G~~~q~~~f~~~V~~~~k~~I~k~~~-~I~~~----~~e~~~G~vv~~~~~~~G~I~L~~~p~ 323 (418) T protein:vir:10 249 DDVVDATIDAFKWSVNVGDNTQRVMFCDTVGMRTMQDIGRFFG-EVTVT----QRETSYGMVFTEWKFFKGRLILKEHPL 323 (418) T ss_pred HHHHHHHHHHhhccCCCcccccceeEEEEeChHHHHHhhhhhh-heeec----ccceeeeEEEEEEEcceEEEEeecccc Confidence 9988776554 445677766 4559999999988863 32333 333577999999999999997765542 Q ss_pred cCCCccccCcccCCCCCcccc-----------------------eEEEeecccccCcccccccceeeEEEEEEEcCCCCc Q lcl|NC_018863. 277 MENDNILVDRIPEPNAPQAPA-----------------------SVVATVKVNDKGAFRPVKDIKTHSYKVVVHSDDAES 333 (479) Q Consensus 277 ~~a~~~lver~~s~~aP~~P~-----------------------~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES 333 (479) ..+=+.-..++.=-+-+...- .+.+..+...+.. .+.= +=.|.+...|..|- T Consensus 324 ~~~~~lp~g~mlVvD~~~vkL~~L~~R~~~~E~l~k~G~~~~~~~~~~~~~~~~D~~--kG~i--v~E~tLe~~N~~a~- 398 (418) T protein:vir:10 324 FSAIGISPGFAVVVDVPAVKLAYMDGRNAKVENYGQGGGENKSGATDYSYGHGVDAQ--GGSL--TSEWALELLNPQGC- 398 (418) T ss_pred cccccCCCceEEEEccccceEEEeccccccchhcccCCCcccccccccccccccccc--cceE--EEEeeeeeecccce- Confidence 111111111100001111100 0111100110110 0000 11345555554432 Q ss_pred ccccceeeeeecCCCeEEEEEeecCCccccc Q lcl|NC_018863. 334 LASEAVTAVVANPTDSVSLAVKLQSLYQAKP 364 (479) Q Consensus 334 ~~S~~VtaT~a~~~~~V~LtIt~~~~~~~~~ 364 (479) +.+++. +.++-++.+.+ ..| T Consensus 399 -------avitgl-~~~~~~~~~t~---p~~ 418 (418) T protein:vir:10 399 -------AVITGL-QKAKERVYLTA---PAP 418 (418) T ss_pred -------EEeecc-ceecccccCCC---CCC Confidence 222232 33333322211 111 No 74 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=94.70 E-value=0.0035 Score=34.06 Aligned_cols=317 Identities=11% Similarity=0.087 Sum_probs=143.6 Q ss_pred Ccccccccceee---ee-cC---chhHHHHHHHHHHHhhcCc---ccCcccccCccccchhhhHHHHHHHhhccccccch Q lcl|NC_018863. 1 MTELQKEQKVEA---RK-LP---AGAEAELAELVSKSFTTGT---GITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIY 70 (479) Q Consensus 1 ~~~~~~~~~~~~---~~-~~---~~~~~~~~e~~~Ksf~ag~---~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~ 70 (479) -+|..+.+.... ++ +. .+.... .+.-.+.+...- .+.. +-..|+.|-.+.+..+|..+..... .+ T Consensus 21 ~~~~~~~kg~~~~~~~~a~a~~~g~~~~a-~~~a~~~~~~~~~~~a~~~-~~~~Gg~lvP~~~~~~ii~~l~~~s---~l 95 (366) T protein:vir:57 21 KEELQQYKGAGMTRMVMSIAAGKGNLADA-AKFAATELGDTGLSMAIST-AAGSGGALIPQNMQNEVIELLRDRT---VV 95 (366) T ss_pred ccccccccchhHHHHHHHHHhcccchhHH-HHHHHHhhcchhhhhhccc-cccCCccccchhHHHHHHHHHhhhc---ch Confidence 111111111110 00 00 000000 011112211000 0111 2234777878888888866665433 33 Q ss_pred hhhccchhHHHHH--HhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHH Q lcl|NC_018863. 71 PLINKQQVNSTVA--KYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTE 148 (479) Q Consensus 71 ~~i~k~~~~stv~--~y~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~~~~ 148 (479) ..+.-+.+.-.-. +|.++ .+.....+++|++..+.+++.+.+.....+=++.--.+|.-+- .++.-|.+....+ T Consensus 96 ~~lg~~~v~~~~g~~~~p~~---t~~~~a~wv~E~~~~~~s~~~f~~i~~~~~k~~~~~~iS~ell-~ds~~~~~~~i~~ 171 (366) T protein:vir:57 96 RILGARSIPLPNGNLSMPRL---SGGATAGYVGEGKDVVATGATFDDVKLSAKTMIALVPVSNQLI-GRAGFNVEQLLLG 171 (366) T ss_pred hhhceeeeecCCCceEEEEE---eCCcceeeeccCccccccccceeEEEEeeEEEEEeehhhHHHH-hhhhHHHHHHHHH Confidence 3332222211111 22222 2334567899999999999999999999999988777775432 2445577888889 Q ss_pred HHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhh------heeecccCcee Q lcl|NC_018863. 149 DAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAA------VIVGKGYGRAT 222 (479) Q Consensus 149 ~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa------~~i~~~fG~at 222 (479) +-...+++.++.++++||-. |-+..||.+.........+.-|..++...+.... ...-..+.... T Consensus 172 ~l~~a~~~~~d~a~l~G~G~---------~~~p~Gi~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~~~~~~~~~~~~a 242 (366) T protein:vir:57 172 DILSAIATREDKAFLRDDGT---------GDTPKGMKAVATAANRLVAWTGTAINLTTIDEYLDSLILKHMDSNSNMIRC 242 (366) T ss_pred HHHHHHHHHHHHHhhccCCC---------CccccceeeccccccceeeccccccchhhHHHHHHHHHHhhhccccccccC Confidence 99999999999999999843 1245688877654445667677666655443222 12222333444 Q ss_pred eeecChHHHhhHHHhhcCceeEEeecCCCccccCccccceecCceeEEec-----CCcccCC-Ccccc-CcccCCCCCcc Q lcl|NC_018863. 223 DAFMPIGVQADFTNNLLDRQRVIQPSQAGGFSTGFSINQFLSTRGAINLH-----GSTIMEN-DNILV-DRIPEPNAPQA 295 (479) Q Consensus 223 d~~mp~~vka~f~q~~~~~qrv~~~~n~g~~~~G~~V~~~~ss~g~I~L~-----~s~v~~a-~~~lv-er~~s~~aP~~ 295 (479) -..|++.+...+...-...-|-+.+...++.-.|++|.. +..-+-++- ...+.++ ..+++ ++ .. . T Consensus 243 ~~vmn~~~~~~L~~lkd~~G~~l~~~~~~g~l~G~Pvv~--s~~ip~~~~~~~~~~~i~~gdfs~~~i~~~-----~~-i 314 (366) T protein:vir:57 243 GWGLSNRTYMTLFGLRDGNGNKVYPEMSQGILKGYPIQR--TSAIPANLGDDGNESEIYFCDFNDVVIGED-----GM-M 314 (366) T ss_pred EEEecHHHHHHHHhhhccCCceeccCCCCCeecceeeEE--ccccccccccCCCccEEEEEecceEEEEEe-----cc-e Confidence 567999998888775543223333333333333444311 110000000 0000111 11110 00 00 0 Q ss_pred cceEEEeec-ccccCc---ccccccceeeEEEEEEEcCCCCcccccceeeeeecC Q lcl|NC_018863. 296 PASVVATVK-VNDKGA---FRPVKDIKTHSYKVVVHSDDAESLASEAVTAVVANP 346 (479) Q Consensus 296 P~~vta~~~-~~~~g~---~~~~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~a~~ 346 (479) -..++...+ ....|. .|.. +..-+++...=+-+--.+...+..|.++= T Consensus 315 ~i~~~~ea~~~~~~g~~~~~f~~---~~~~iR~~~~~d~~v~~~~a~~~lt~~~~ 366 (366) T protein:vir:57 315 KVDFSTEATYKDADGQLVSAFAR---NQSLIRVVTEHDIGFRHPEGLVLGTGVIW 366 (366) T ss_pred EEEEeeccccccccccchhhhhc---CceeEEeeeeeCcEeeccccEEEEecccC Confidence 000000000 000000 0100 11233333333333333333333333333 No 75 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=94.09 E-value=0.0054 Score=33.01 Aligned_cols=326 Identities=10% Similarity=0.063 Sum_probs=137.7 Q ss_pred CcccccccceeeeecCchh--H--------HHHHHHHHHHhhcCc-c--------cCcccccCccccchhhhHHHHHHHh Q lcl|NC_018863. 1 MTELQKEQKVEARKLPAGA--E--------AELAELVSKSFTTGT-G--------ITPDTQHDAAALRRELLDDQVKMLA 61 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~--~--------~~~~e~~~Ksf~ag~-~--------~~~~~~~~gaAlr~esld~~i~~l~ 61 (479) ..++.+. ..... .+... . ..+-+.+.|...... . .+..+..+|+.+-++.+..+|..+. T Consensus 57 ~~~~~~~-~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~a~~~~~~~~gg~~vP~~~~~~ii~~~ 134 (404) T protein:vir:10 57 ENNFNED-NVKSL-NTGKEENVIYNGALFVRAIADNLLKQKNQRGLNLSEKEINAISENIDEDGGYAVPEDIQTKINTRL 134 (404) T ss_pred HHHHhhh-hcccc-ccccchhhHHHHHHHHHHHHHHHHHHHHhhhhcchhhHHhhhccccCCCCceeechhHHHHHHHHH Confidence 0000000 00000 00000 0 011112222221111 0 1112335677888888888886666 Q ss_pred hccccccchhhhccchhHHHHHHhhhhhccCcccccccccccccccc--cCcceEEEEEEEEeeeehhhhhhhHhhhcch Q lcl|NC_018863. 62 FTNGDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASI--NDPNIRQKTVQMKFLSDTKQQSLAAGLVNNI 139 (479) Q Consensus 62 ~~~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~~--~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~ 139 (479) .... .+++.+.+..+.....+|. +..+.+.....++.|++..+. .++.+.+.....+=++.-..+|.-+ +.++. T Consensus 135 ~~~~--~l~~l~~~~~~~~~~g~~~-~~~~~~~~~~~~v~e~~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~el-l~ds~ 210 (404) T protein:vir:10 135 KDTT--DLYNMVDYEPVFTRSGSRT-YEKRSKQKPMKPLSENQQIPTNGDNGKLERFNFKLKDLADFMSIPNDL-LKFAD 210 (404) T ss_pred hhhh--hHhhhhceeeccCCccceE-EEEecCCcceeeccccccccccccccceeeeEeeheeeEeeehhhHHH-HhhcH Confidence 5544 3455555554543333332 233344445678999988655 3688999999998888888888743 34555 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhh-heeeccc Q lcl|NC_018863. 140 ADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAA-VIVGKGY 218 (479) Q Consensus 140 ~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa-~~i~~~f 218 (479) .+.+....+.-...+...+|.++++|+-. |-.+.|+...-. .+.+-..+. ...+.|..+- ..+..+| T Consensus 211 ~~l~~~i~~~la~~~~~~~~~~il~G~g~---------~~~~~gi~~~~~--~~~~~~~~~-~~~~~~~~~~~~~l~~~~ 278 (404) T protein:vir:10 211 KSLEDWIINWFVDKVRITRNAEILYGAGG---------DEHATGIMTANK--FKKITLPKS-PALKDFKKCKNVELLNVF 278 (404) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcCCC---------CCcccceeeccc--cceeecccc-ccHHHHHHHHHhhhhccc Confidence 67888888889999999999999999754 123456654432 233333333 4455555433 2344455 Q ss_pred CceeeeecChHHHhhHHHhhcCceeEEeecCCCccccCccccceecCceeEEecCCcccCCCccccCcccCCCCCcccce Q lcl|NC_018863. 219 GRATDAFMPIGVQADFTNNLLDRQRVIQPSQAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILVDRIPEPNAPQAPAS 298 (479) Q Consensus 219 G~atd~~mp~~vka~f~q~~~~~qrv~~~~n~g~~~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver~~s~~aP~~P~~ 298 (479) ...-..+|++.+.+.+...-...-|.+...+..+. .++++++.+.+.++. .-|. . T Consensus 279 ~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~------------------~~~~l~G~PV~~~~~----~~~~--~- 333 (404) T protein:vir:10 279 KATSSWIVNQDGFNYLDSLEDKTGRPYLQPDPKDP------------------TQYRFLGLPVIELPN----DLLL--S- 333 (404) T ss_pred cCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCC------------------CCccccceeeEEecc----cccC--C- Confidence 44344789999988887654332233322221110 011122222111110 0000 0 Q ss_pred EEEeecccccCcccccccceeeEEEEEEEcCCCCcccccceeeeeecCCCeEEEEEeecCCccccceEEEEEeccCCCCc Q lcl|NC_018863. 299 VVATVKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLASEAVTAVVANPTDSVSLAVKLQSLYQAKPQFISVYRQGNETGH 378 (479) Q Consensus 299 vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~V~LtIt~~~~~~~~~~y~~IYR~t~~~g~ 378 (479) +......+ .|.++..+....+.|-+..... ........+.+.+.+.-- .+ +.+-|. . T Consensus 334 ------~~~~~~~~----~gd~s~~~~~~~~~~~~i~~~~-~~~~~~~~~~~~~~~~~r-~d------~~v~~~-----~ 390 (404) T protein:vir:10 334 ------TESAIPVL----LGDTKEAYKYVSDGAYELATTN-IGAGAFETNTTKARIIMR-ID------GNVKDS-----E 390 (404) T ss_pred ------CCCccEEE----EEeccccEEEEEecceEEEEec-cccchhhcCceEEEEEEe-ec------cEEecc-----c Confidence 00000000 1222211111111111100000 000000111221111110 00 000000 0 Q ss_pred EEEEEEeeeeeccCCC Q lcl|NC_018863. 379 YFLVARVPLSKADENG 394 (479) Q Consensus 379 ~~~i~rV~~s~~n~~~ 394 (479) .+...+++.+.. .+ T Consensus 391 a~~~~~~~~aa~--~~ 404 (404) T protein:vir:10 391 ALLIAEIPVESV--QA 404 (404) T ss_pred ceEEEEeecccC--CC Confidence 111111111110 11 No 76 >protein:vir:98525 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996579;genbank:gi:45569510;genbank:GeneID:2767853 Probab=93.98 E-value=0.004 Score=33.72 Aligned_cols=313 Identities=17% Similarity=0.152 Sum_probs=138.0 Q ss_pred CcccccccceeeeecCchhHHHHHHHHHHHhhcCcccCcccccCccccchhhhHHHHHHHhhccccccchhhhccchhH- Q lcl|NC_018863. 1 MTELQKEQKVEARKLPAGAEAELAELVSKSFTTGTGITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQVN- 79 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~Ksf~ag~~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~~~- 79 (479) ||.|. ++++-+.++ +|=+.. + +.+..+ -|..|+.++. +|.+++=...+ T Consensus 1 m~~~~---------~~~~TL~e~----Ak~~~~------~-----~~l~~~----IIE~l~~tn~---IL~~lpf~e~N~ 49 (331) T protein:vir:98 1 MPTLS---------TTNPTLADV----AARMTP------D-----GKIDPQ----IVEMLNETNE---ILDDMTVIEANG 49 (331) T ss_pred CCccc---------cCcccHHHH----HHhcCc------c-----hhHHHH----HHHHHhcCch---HHhhceeeeccC Confidence 66552 333444443 221100 0 111111 1112232222 23333332233 Q ss_pred HHHHHhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhh-cchhhHHHHHHHHHHHHHHHHH Q lcl|NC_018863. 80 STVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLV-NNIADPMTILTEDAISVIAKSI 158 (479) Q Consensus 80 stv~~y~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv-~~~~dp~~~~~~~ai~~~~~~~ 158 (479) .|=|.|.+ +-+.-...|-.=...-+-+.++..|++..++.|..-..|.+...-. .+..+-.++|.+.-|..+.+.+ T Consensus 50 ~t~~~~~v---rt~LP~~~fR~lN~g~~~s~~tt~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~ 126 (331) T protein:vir:98 50 FTEHKTTV---RSGLPTGTWRKLNYGVQPEKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQ 126 (331) T ss_pred CccceeeE---EeccCCchhhccCCccCcccceeEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHH Confidence 22244433 2333334452222334567788999999999999999999876554 4456778999999999999999 Q ss_pred HHHHhhcccccCCCCCCcccchhhhHHHhhcc-----CCcEEEccCCCCCHHHhhhhhheeecccCceeeeecChHHHhh Q lcl|NC_018863. 159 EWAIFYGDAALAAEADNQAGIEFDGLTKLIDE-----ATNVIDLKGERLDEATLNKAAVIVGKGYGRATDAFMPIGVQAD 233 (479) Q Consensus 159 e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~-----~~NviDarG~~l~~~~l~~aa~~i~~~fG~atd~~mp~~vka~ 233 (479) +..+||||+..+| -+||||.+.+.. .+|+||+.|.--..-.|+ ++.=+=....=+| |-+-++- T Consensus 127 ~~~~iyGD~a~~p-------~~F~GL~kR~~~~~a~~~~q~IdaGgtG~~~TSI~----~v~~~~~~~~giy-PkG~~~G 194 (331) T protein:vir:98 127 ATTLFYGDSSIDA-------EKFMGLTPRFNSLSAENGQNIIDAGGTGSDNASIW----LTVWGPNTLHTIY-PKGSQAG 194 (331) T ss_pred HHHHhcCCcccCh-------hhhccchhhccccccccccceeecCCCCCCceEEE----EEEEcCCeeEEec-ccccccC Confidence 9999999999887 379999998842 469999999543332222 1211112233355 8888888 Q ss_pred HHHhhcCceeEEeecCCCccccCccccceecCceeEEecCCcccCCCccccCcccCCCCCcccceEEEeecccccCcccc Q lcl|NC_018863. 234 FTNNLLDRQRVIQPSQAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILVDRIPEPNAPQAPASVVATVKVNDKGAFRP 313 (479) Q Consensus 234 f~q~~~~~qrv~~~~n~g~~~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver~~s~~aP~~P~~vta~~~~~~~g~~~~ 313 (479) ++-.-++.+.+...+ .+ .=-|+ ...|..--|.--++|.-+-+=-|+-+.....+.++...- .-...-+.. ..+ T Consensus 195 l~~~d~g~~~~~~~~-G~-~y~~y-~~~~~w~~Gl~i~d~r~v~ri~NIdvs~l~~~~~~~~dl-~~lm~~a~~---~ip 267 (331) T protein:vir:98 195 LQSRDLGEDTLIDAA-GG-RYQGY-RTHYKWDIGLTLRDWRYVVRIANVDVSELTKNASAGADL-IDLMTQAVE---LIP 267 (331) T ss_pred ceEeecCceeeecCC-CC-eeeEE-EEEEEeeeeeEEcCcccEEEEeccchhccCCCcchhhhH-HHHHHHHHH---Hhc Confidence 887778887777333 22 11111 113333333333444433333333221110000000000 000000000 000 Q ss_pred cccc-eeeEEEEEEEc---CCCCcccccceeeeeecCCC-------eEEEEEeecCCccccceEEE Q lcl|NC_018863. 314 VKDI-KTHSYKVVVHS---DDAESLASEAVTAVVANPTD-------SVSLAVKLQSLYQAKPQFIS 368 (479) Q Consensus 314 ~sd~-g~Y~YkV~a~n---~~GES~~S~~VtaT~a~~~~-------~V~LtIt~~~~~~~~~~y~~ 368 (479) +-.. .++.|.=.-+- +.--+........+.-...+ +|.+..+-+ -...+.-+. T Consensus 268 ~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g~~~t~~~gipir~~da--i~~tE~~Vv 331 (331) T protein:vir:98 268 NVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCRRTDA--LLLTEARVV 331 (331) T ss_pred ccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCCcceeEECCeeEEEeee--eecCccccC Confidence 0001 13444221110 00000000111111111100 111110000 000000000 No 77 >protein:vir:107388 Length: 331 # NCBI annotation: Bbp17 # Family: family:all:1903 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958686;genbank:gi:41179378;genbank:GeneID:2717182 Probab=93.98 E-value=0.004 Score=33.72 Aligned_cols=313 Identities=17% Similarity=0.152 Sum_probs=138.0 Q ss_pred CcccccccceeeeecCchhHHHHHHHHHHHhhcCcccCcccccCccccchhhhHHHHHHHhhccccccchhhhccchhH- Q lcl|NC_018863. 1 MTELQKEQKVEARKLPAGAEAELAELVSKSFTTGTGITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQVN- 79 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~Ksf~ag~~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~~~- 79 (479) ||.|. ++++-+.++ +|=+.. + +.+..+ -|..|+.++. +|.+++=...+ T Consensus 1 m~~~~---------~~~~TL~e~----Ak~~~~------~-----~~l~~~----IIE~l~~tn~---IL~~lpf~e~N~ 49 (331) T protein:vir:10 1 MPTLS---------TTNPTLADV----AARMTP------D-----GKIDPQ----IVEMLNETNE---ILDDMTVIEANG 49 (331) T ss_pred CCccc---------cCcccHHHH----HHhcCc------c-----hhHHHH----HHHHHhcCch---HHhhceeeeccC Confidence 66552 333444443 221100 0 111111 1112232222 23333332233 Q ss_pred HHHHHhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhh-cchhhHHHHHHHHHHHHHHHHH Q lcl|NC_018863. 80 STVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLV-NNIADPMTILTEDAISVIAKSI 158 (479) Q Consensus 80 stv~~y~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv-~~~~dp~~~~~~~ai~~~~~~~ 158 (479) .|=|.|.+ +-+.-...|-.=...-+-+.++..|++..++.|..-..|.+...-. .+..+-.++|.+.-|..+.+.+ T Consensus 50 ~t~~~~~v---rt~LP~~~fR~lN~g~~~s~~tt~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~ 126 (331) T protein:vir:10 50 FTEHKTTV---RSGLPTGTWRKLNYGVQPEKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQ 126 (331) T ss_pred CccceeeE---EeccCCchhhccCCccCcccceeEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHH Confidence 22244433 2333334452222334567788999999999999999999876554 4456778999999999999999 Q ss_pred HHHHhhcccccCCCCCCcccchhhhHHHhhcc-----CCcEEEccCCCCCHHHhhhhhheeecccCceeeeecChHHHhh Q lcl|NC_018863. 159 EWAIFYGDAALAAEADNQAGIEFDGLTKLIDE-----ATNVIDLKGERLDEATLNKAAVIVGKGYGRATDAFMPIGVQAD 233 (479) Q Consensus 159 e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~-----~~NviDarG~~l~~~~l~~aa~~i~~~fG~atd~~mp~~vka~ 233 (479) +..+||||+..+| -+||||.+.+.. .+|+||+.|.--..-.|+ ++.=+=....=+| |-+-++- T Consensus 127 ~~~~iyGD~a~~p-------~~F~GL~kR~~~~~a~~~~q~IdaGgtG~~~TSI~----~v~~~~~~~~giy-PkG~~~G 194 (331) T protein:vir:10 127 ATTLFYGDSSIDA-------EKFMGLTPRFNSLSAENGQNIIDAGGTGSDNASIW----LTVWGPNTLHTIY-PKGSQAG 194 (331) T ss_pred HHHHhcCCcccCh-------hhhccchhhccccccccccceeecCCCCCCceEEE----EEEEcCCeeEEec-ccccccC Confidence 9999999999887 379999998842 469999999543332222 1211112233355 8888888 Q ss_pred HHHhhcCceeEEeecCCCccccCccccceecCceeEEecCCcccCCCccccCcccCCCCCcccceEEEeecccccCcccc Q lcl|NC_018863. 234 FTNNLLDRQRVIQPSQAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILVDRIPEPNAPQAPASVVATVKVNDKGAFRP 313 (479) Q Consensus 234 f~q~~~~~qrv~~~~n~g~~~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver~~s~~aP~~P~~vta~~~~~~~g~~~~ 313 (479) ++-.-++.+.+...+ .+ .=-|+ ...|..--|.--++|.-+-+=-|+-+.....+.++...- .-...-+.. ..+ T Consensus 195 l~~~d~g~~~~~~~~-G~-~y~~y-~~~~~w~~Gl~i~d~r~v~ri~NIdvs~l~~~~~~~~dl-~~lm~~a~~---~ip 267 (331) T protein:vir:10 195 LQSRDLGEDTLIDAA-GG-RYQGY-RTHYKWDIGLTLRDWRYVVRIANVDVSELTKNASAGADL-IDLMTQAVE---LIP 267 (331) T ss_pred ceEeecCceeeecCC-CC-eeeEE-EEEEEeeeeeEEcCcccEEEEeccchhccCCCcchhhhH-HHHHHHHHH---Hhc Confidence 887778887777333 22 11111 113333333333444433333333221110000000000 000000000 000 Q ss_pred cccc-eeeEEEEEEEc---CCCCcccccceeeeeecCCC-------eEEEEEeecCCccccceEEE Q lcl|NC_018863. 314 VKDI-KTHSYKVVVHS---DDAESLASEAVTAVVANPTD-------SVSLAVKLQSLYQAKPQFIS 368 (479) Q Consensus 314 ~sd~-g~Y~YkV~a~n---~~GES~~S~~VtaT~a~~~~-------~V~LtIt~~~~~~~~~~y~~ 368 (479) +-.. .++.|.=.-+- +.--+........+.-...+ +|.+..+-+ -...+.-+. T Consensus 268 ~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g~~~t~~~gipir~~da--i~~tE~~Vv 331 (331) T protein:vir:10 268 NVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCRRTDA--LLLTEARVV 331 (331) T ss_pred ccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCCcceeEECCeeEEEeee--eecCccccC Confidence 0001 13444221110 00000000111111111100 111110000 000000000 No 78 >protein:vir:107826 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996627;genbank:gi:45580761;genbank:GeneID:2767902 Probab=93.98 E-value=0.004 Score=33.72 Aligned_cols=313 Identities=17% Similarity=0.152 Sum_probs=138.0 Q ss_pred CcccccccceeeeecCchhHHHHHHHHHHHhhcCcccCcccccCccccchhhhHHHHHHHhhccccccchhhhccchhH- Q lcl|NC_018863. 1 MTELQKEQKVEARKLPAGAEAELAELVSKSFTTGTGITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQVN- 79 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~Ksf~ag~~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~~~- 79 (479) ||.|. ++++-+.++ +|=+.. + +.+..+ -|..|+.++. +|.+++=...+ T Consensus 1 m~~~~---------~~~~TL~e~----Ak~~~~------~-----~~l~~~----IIE~l~~tn~---IL~~lpf~e~N~ 49 (331) T protein:vir:10 1 MPTLS---------TTNPTLADV----AARMTP------D-----GKIDPQ----IVEMLNETNE---ILDDMTVIEANG 49 (331) T ss_pred CCccc---------cCcccHHHH----HHhcCc------c-----hhHHHH----HHHHHhcCch---HHhhceeeeccC Confidence 66552 333444443 221100 0 111111 1112232222 23333332233 Q ss_pred HHHHHhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhh-cchhhHHHHHHHHHHHHHHHHH Q lcl|NC_018863. 80 STVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLV-NNIADPMTILTEDAISVIAKSI 158 (479) Q Consensus 80 stv~~y~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv-~~~~dp~~~~~~~ai~~~~~~~ 158 (479) .|=|.|.+ +-+.-...|-.=...-+-+.++..|++..++.|..-..|.+...-. .+..+-.++|.+.-|..+.+.+ T Consensus 50 ~t~~~~~v---rt~LP~~~fR~lN~g~~~s~~tt~q~t~~l~ilgg~~eVDk~la~~~Gn~~~~ra~e~~~~ik~m~~~~ 126 (331) T protein:vir:10 50 FTEHKTTV---RSGLPTGTWRKLNYGVQPEKSRTVQVKDSMGMLETYAEVDKALADLNGNSAAWRLSEDRAFIEGMNQTQ 126 (331) T ss_pred CccceeeE---EeccCCchhhccCCccCcccceeEEEEEEEEEeccceeechHHHhhcCCHHHHHHHHHHHHHHHHHHHH Confidence 22244433 2333334452222334567788999999999999999999876554 4456778999999999999999 Q ss_pred HHHHhhcccccCCCCCCcccchhhhHHHhhcc-----CCcEEEccCCCCCHHHhhhhhheeecccCceeeeecChHHHhh Q lcl|NC_018863. 159 EWAIFYGDAALAAEADNQAGIEFDGLTKLIDE-----ATNVIDLKGERLDEATLNKAAVIVGKGYGRATDAFMPIGVQAD 233 (479) Q Consensus 159 e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~-----~~NviDarG~~l~~~~l~~aa~~i~~~fG~atd~~mp~~vka~ 233 (479) +..+||||+..+| -+||||.+.+.. .+|+||+.|.--..-.|+ ++.=+=....=+| |-+-++- T Consensus 127 ~~~~iyGD~a~~p-------~~F~GL~kR~~~~~a~~~~q~IdaGgtG~~~TSI~----~v~~~~~~~~giy-PkG~~~G 194 (331) T protein:vir:10 127 ATTLFYGDSSIDA-------EKFMGLTPRFNSLSAENGQNIIDAGGTGSDNASIW----LTVWGPNTLHTIY-PKGSQAG 194 (331) T ss_pred HHHHhcCCcccCh-------hhhccchhhccccccccccceeecCCCCCCceEEE----EEEEcCCeeEEec-ccccccC Confidence 9999999999887 379999998842 469999999543332222 1211112233355 8888888 Q ss_pred HHHhhcCceeEEeecCCCccccCccccceecCceeEEecCCcccCCCccccCcccCCCCCcccceEEEeecccccCcccc Q lcl|NC_018863. 234 FTNNLLDRQRVIQPSQAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILVDRIPEPNAPQAPASVVATVKVNDKGAFRP 313 (479) Q Consensus 234 f~q~~~~~qrv~~~~n~g~~~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver~~s~~aP~~P~~vta~~~~~~~g~~~~ 313 (479) ++-.-++.+.+...+ .+ .=-|+ ...|..--|.--++|.-+-+=-|+-+.....+.++...- .-...-+.. ..+ T Consensus 195 l~~~d~g~~~~~~~~-G~-~y~~y-~~~~~w~~Gl~i~d~r~v~ri~NIdvs~l~~~~~~~~dl-~~lm~~a~~---~ip 267 (331) T protein:vir:10 195 LQSRDLGEDTLIDAA-GG-RYQGY-RTHYKWDIGLTLRDWRYVVRIANVDVSELTKNASAGADL-IDLMTQAVE---LIP 267 (331) T ss_pred ceEeecCceeeecCC-CC-eeeEE-EEEEEeeeeeEEcCcccEEEEeccchhccCCCcchhhhH-HHHHHHHHH---Hhc Confidence 887778887777333 22 11111 113333333333444433333333221110000000000 000000000 000 Q ss_pred cccc-eeeEEEEEEEc---CCCCcccccceeeeeecCCC-------eEEEEEeecCCccccceEEE Q lcl|NC_018863. 314 VKDI-KTHSYKVVVHS---DDAESLASEAVTAVVANPTD-------SVSLAVKLQSLYQAKPQFIS 368 (479) Q Consensus 314 ~sd~-g~Y~YkV~a~n---~~GES~~S~~VtaT~a~~~~-------~V~LtIt~~~~~~~~~~y~~ 368 (479) +-.. .++.|.=.-+- +.--+........+.-...+ +|.+..+-+ -...+.-+. T Consensus 268 ~~~~~~~~~y~n~~v~~~L~~q~~~~~~~~~~~~~~~~g~~~t~~~gipir~~da--i~~tE~~Vv 331 (331) T protein:vir:10 268 NVGMGRPAFYMPRKIRSFLRRQITNKVAASTLTMEEIAGKKVVAFDGIPCRRTDA--LLLTEARVV 331 (331) T ss_pred ccCCCCeEEEechHHHHHHHHHHhhccceeeeeeeecCCcceeEECCeeEEEeee--eecCccccC Confidence 0001 13444221110 00000000111111111100 111110000 000000000 No 79 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=93.79 E-value=0.0064 Score=32.63 Aligned_cols=320 Identities=13% Similarity=0.026 Sum_probs=139.9 Q ss_pred Cccc----------ccccceeeeecCc----------------hhHHHHHHHHHHHhhcCccc--CcccccCccccchhh Q lcl|NC_018863. 1 MTEL----------QKEQKVEARKLPA----------------GAEAELAELVSKSFTTGTGI--TPDTQHDAAALRREL 52 (479) Q Consensus 1 ~~~~----------~~~~~~~~~~~~~----------------~~~~~~~e~~~Ksf~ag~~~--~~~~~~~gaAlr~es 52 (479) +.++ +++.+....+... ....+. ..|.+....+... ...+-.+|+.+..+. T Consensus 58 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~t~~g~~~iP~~ 136 (415) T protein:vir:46 58 LDKLKEKDRTSENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEV-RDFTEYLETRNDIQGGSLKTDSGFVVIPEE 136 (415) T ss_pred HHHHHHHHHhhhhcccccccchhhhhHHHHHHHHHHHhhhhhhhhHHHH-HHHHHHHhhhhhhhhccccccCCcccccHH Confidence 1000 0000100000000 000011 1222222222111 111224678899999 Q ss_pred hHHHHHHHhhccccccchhhhccchhHHHHHHhhhhhccCccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhh Q lcl|NC_018863. 53 LDDQVKMLAFTNGDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVAS-INDPNIRQKTVQMKFLSDTKQQSL 131 (479) Q Consensus 53 ld~~i~~l~~~~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~ 131 (479) +...|..+...... +++.+....+.+.-..|......+ .....+++|++..+ .+++.+.......+-++.-..+|. T Consensus 137 ~~~~ii~~~~~~~~--l~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ 213 (415) T protein:vir:46 137 IVTDILKLKEVEFN--LDKYVTVKRVTNGSGKYPVVRQSE-VAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISR 213 (415) T ss_pred HHHHHHHHHHhhhh--hhhhcceeeccCCceeEEEEEecC-CcceeecccccccccccccceeeEEeeeeeeEeeehhhH Confidence 99988666655543 444455445554444454443333 34566899998776 578999999999999998887776 Q ss_pred hHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhh Q lcl|NC_018863. 132 AAGLVNNIADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAA 211 (479) Q Consensus 132 ~~~lv~~~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa 211 (479) -+- .++..|.+....+.....++..++.+++.|+-.=.+.+ +...... ..+.....+... .+.|.++- T Consensus 214 ell-~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~---------~~~~~~~-~~~~~~~~~~~~-~~~i~~~~ 281 (415) T protein:vir:46 214 EAI-EDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGS---------TSSGFEK-EGKKLEVKKAKS-LDDIKDAI 281 (415) T ss_pred HHH-hhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCccc---------ccccccc-ccceeccccccc-hHHHHHHH Confidence 432 34556788889999999999999999999985422211 1111111 233443333333 33333333 Q ss_pred heeecccCceeeeecChHHHhhHHHhhcCceeEEeecCCCc----cccCccccceecCceeEEecCC--cccCC-C-ccc Q lcl|NC_018863. 212 VIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSQAGG----FSTGFSINQFLSTRGAINLHGS--TIMEN-D-NIL 283 (479) Q Consensus 212 ~~i~~~fG~atd~~mp~~vka~f~q~~~~~qrv~~~~n~g~----~~~G~~V~~~~ss~g~I~L~~s--~v~~a-~-~~l 283 (479) -.+...+...+-.+|++.+.+.+...-...-|.+...+..+ .-.|.+|.-... -+.--.++ .+.++ . .++ T Consensus 282 ~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~pV~~~~~--~~~~~~~~~~~~~gd~~~~~~ 359 (415) T protein:vir:46 282 NLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPD--EVLGQKGNNTLIIGNLKDAIV 359 (415) T ss_pred HhhhhhccCCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCCccccceeeEEecc--ccccCCCccEEEEEehhccEE Confidence 33334455566788999998888764433223333222221 223444311100 00000000 01110 0 011 Q ss_pred -cCcccCCCCCcccceEEEeecccccCcccccccceeeEEEEEEEcCCCCcccccceeeeeecCCCeEEEEE Q lcl|NC_018863. 284 -VDRIPEPNAPQAPASVVATVKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLASEAVTAVVANPTDSVSLAV 354 (479) Q Consensus 284 -ver~~s~~aP~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~V~LtI 354 (479) .+|. ...+.. +....+.+. -.+...+=+.+.+. .+.....-++++...|..-|.. T Consensus 360 ~~~~~--------~~~v~~----~~~~~~~~~-~~~~~r~d~~v~~~---~a~~~~~~~~~~~~~~~~~~~~ 415 (415) T protein:vir:46 360 LFDRS--------QYQASW----TDYMHFGEC-LMIAVRQDCRILDY---KSAIVIEYDDSERGEGDLGLEA 415 (415) T ss_pred EEeec--------ceEEEe----eccccCceE-EEEEEEeccEEecc---ccEEEEEeeccCCCCCCccCCC Confidence 0110 000000 001111110 00111111122221 1111111111222233333333 No 80 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=93.79 E-value=0.0064 Score=32.63 Aligned_cols=320 Identities=13% Similarity=0.026 Sum_probs=139.9 Q ss_pred Cccc----------ccccceeeeecCc----------------hhHHHHHHHHHHHhhcCccc--CcccccCccccchhh Q lcl|NC_018863. 1 MTEL----------QKEQKVEARKLPA----------------GAEAELAELVSKSFTTGTGI--TPDTQHDAAALRREL 52 (479) Q Consensus 1 ~~~~----------~~~~~~~~~~~~~----------------~~~~~~~e~~~Ksf~ag~~~--~~~~~~~gaAlr~es 52 (479) +.++ +++.+....+... ....+. ..|.+....+... ...+-.+|+.+..+. T Consensus 58 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~t~~g~~~iP~~ 136 (415) T protein:vir:47 58 LDKLKEKDRTSENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEV-RDFTEYLETRNDIQGGSLKTDSGFVVIPEE 136 (415) T ss_pred HHHHHHHHHhhhhcccccccchhhhhHHHHHHHHHHHhhhhhhhhHHHH-HHHHHHHhhhhhhhhccccccCCcccccHH Confidence 1000 0000100000000 000011 1222222222111 111224678899999 Q ss_pred hHHHHHHHhhccccccchhhhccchhHHHHHHhhhhhccCccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhh Q lcl|NC_018863. 53 LDDQVKMLAFTNGDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVAS-INDPNIRQKTVQMKFLSDTKQQSL 131 (479) Q Consensus 53 ld~~i~~l~~~~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~ 131 (479) +...|..+...... +++.+....+.+.-..|......+ .....+++|++..+ .+++.+.......+-++.-..+|. T Consensus 137 ~~~~ii~~~~~~~~--l~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ 213 (415) T protein:vir:47 137 IVTDILKLKEVEFN--LDKYVTVKRVTNGSGKYPVVRQSE-VAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISR 213 (415) T ss_pred HHHHHHHHHHhhhh--hhhhcceeeccCCceeEEEEEecC-CcceeecccccccccccccceeeEEeeeeeeEeeehhhH Confidence 99988666655543 444455445554444454443333 34566899998776 578999999999999998887776 Q ss_pred hHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhh Q lcl|NC_018863. 132 AAGLVNNIADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAA 211 (479) Q Consensus 132 ~~~lv~~~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa 211 (479) -+- .++..|.+....+.....++..++.+++.|+-.=.+.+ +...... ..+.....+... .+.|.++- T Consensus 214 ell-~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~---------~~~~~~~-~~~~~~~~~~~~-~~~i~~~~ 281 (415) T protein:vir:47 214 EAI-EDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGS---------TSSGFEK-EGKKLEVKKAKS-LDDIKDAI 281 (415) T ss_pred HHH-hhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCccc---------ccccccc-ccceeccccccc-hHHHHHHH Confidence 432 34556788889999999999999999999985422211 1111111 233443333333 33333333 Q ss_pred heeecccCceeeeecChHHHhhHHHhhcCceeEEeecCCCc----cccCccccceecCceeEEecCC--cccCC-C-ccc Q lcl|NC_018863. 212 VIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSQAGG----FSTGFSINQFLSTRGAINLHGS--TIMEN-D-NIL 283 (479) Q Consensus 212 ~~i~~~fG~atd~~mp~~vka~f~q~~~~~qrv~~~~n~g~----~~~G~~V~~~~ss~g~I~L~~s--~v~~a-~-~~l 283 (479) -.+...+...+-.+|++.+.+.+...-...-|.+...+..+ .-.|.+|.-... -+.--.++ .+.++ . .++ T Consensus 282 ~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~pV~~~~~--~~~~~~~~~~~~~gd~~~~~~ 359 (415) T protein:vir:47 282 NLNVKPNYEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPD--EVLGQKGNNTLIIGNLKDAIV 359 (415) T ss_pred HhhhhhccCCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCCccccceeeEEecc--ccccCCCccEEEEEehhccEE Confidence 33334455566788999998888764433223333222221 223444311100 00000000 01110 0 011 Q ss_pred -cCcccCCCCCcccceEEEeecccccCcccccccceeeEEEEEEEcCCCCcccccceeeeeecCCCeEEEEE Q lcl|NC_018863. 284 -VDRIPEPNAPQAPASVVATVKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLASEAVTAVVANPTDSVSLAV 354 (479) Q Consensus 284 -ver~~s~~aP~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~V~LtI 354 (479) .+|. ...+.. +....+.+. -.+...+=+.+.+. .+.....-++++...|..-|.. T Consensus 360 ~~~~~--------~~~v~~----~~~~~~~~~-~~~~~r~d~~v~~~---~a~~~~~~~~~~~~~~~~~~~~ 415 (415) T protein:vir:47 360 LFDRS--------QYQASW----TDYMHFGEC-LMIAVRQDCRILDY---KSAIVIEYDDSERGEGDLGLEA 415 (415) T ss_pred EEeec--------ceEEEe----eccccCceE-EEEEEEeccEEecc---ccEEEEEeeccCCCCCCccCCC Confidence 0110 000000 001111110 00111111122221 1111111111222233333333 No 81 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=93.61 E-value=0.007 Score=32.41 Aligned_cols=292 Identities=10% Similarity=0.043 Sum_probs=133.2 Q ss_pred Cccccc---ccceeeeecCchhHHHHHHHHHHHhhcCcc---------------cCcccccCccccchhhhHHHHHHHhh Q lcl|NC_018863. 1 MTELQK---EQKVEARKLPAGAEAELAELVSKSFTTGTG---------------ITPDTQHDAAALRRELLDDQVKMLAF 62 (479) Q Consensus 1 ~~~~~~---~~~~~~~~~~~~~~~~~~e~~~Ksf~ag~~---------------~~~~~~~~gaAlr~esld~~i~~l~~ 62 (479) .+...+ ..+...... .....+...+|.|++..+.. .+..+-.+|+.|-.+.+.+.|..+.. T Consensus 70 ~~~~~~~~~~~~~~~~~~-~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~lvP~~~~~~ii~~~~ 148 (397) T protein:vir:12 70 VPEQERNPEGQRSQGQGN-EERQQQYSKAFLKGLRGKRLTDEERDLLDSPEFRAMSGINDEDGGILIPEDIGRQIHEFKR 148 (397) T ss_pred hhhhhhhhcccccccchh-hHHHHHHHHHHHHHHhccCCcHHHHHHHhhhhhhhccccccccCcccCchhHHHHHHHhhh Confidence 010000 000000000 11111222344555443321 11123456788889999888877766 Q ss_pred ccccccchhhhccchhHHHHHHhhhhhccCccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcchhh Q lcl|NC_018863. 63 TNGDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVAS-INDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIAD 141 (479) Q Consensus 63 ~~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~d 141 (479) .... +++.+....+.+...+|....+.++. ...+++|++..+ .+++.+.......+-++.--.+|.-+- .++..| T Consensus 149 ~~~~--l~~~~~~~~~~~~~~~~~~~~~~~~~-~a~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~is~e~l-~ds~~~ 224 (397) T protein:vir:12 149 QFEP--LEQYVTVEPVTTRSGTRLLEKNADMV-PFSPVEELGNLPEIDQPRFTKVSYSIIDYGGIMTLSNSML-NDSDQA 224 (397) T ss_pred hhhh--HHhhcceeeccCCceeEEEEEecCCc-ceeeecccccccccccccceeEEeeheeeEeeehhhHHHH-hhchHH Confidence 5543 45555555555433444333333333 466999998755 678999999999998888777666432 344567 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhhheeecccCce Q lcl|NC_018863. 142 PMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAAVIVGKGYGRA 221 (479) Q Consensus 142 p~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~a 221 (479) .+....+.-...+++.++.++++|+..-.|.+. +-+|++.+.+.. .+...|... T Consensus 225 l~~~i~~~l~~~~~~~~d~~il~G~g~~~~~g~----~~~~~i~~~~~~----------------------~l~~~~~~~ 278 (397) T protein:vir:12 225 IMTYVAKWFAKKSVVTRNNLILAAIASLKKVDI----DGLDGIKKALNV----------------------TLDPMVAPG 278 (397) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcccccccccc----ccHHHHHHHHhh----------------------ccchhhhCC Confidence 788888899999999999999999988665432 446777665531 111123333 Q ss_pred eeeecChHHHhhHHHhhcCceeEEeecCCCccccCccccceecCceeEEecCCcccCCCccccCc-ccCCCCCcccceEE Q lcl|NC_018863. 222 TDAFMPIGVQADFTNNLLDRQRVIQPSQAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILVDR-IPEPNAPQAPASVV 300 (479) Q Consensus 222 td~~mp~~vka~f~q~~~~~qrv~~~~n~g~~~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver-~~s~~aP~~P~~vt 300 (479) ...+|++.+.+.+...-...-|.+...+..+.. +.++++.+.+.++. .+...+...+ ... T Consensus 279 a~~~~n~~~~~~L~~lkd~~G~~l~~~~~~~g~------------------~~~l~G~pv~~~~~~~~~~~~~~~~-~~~ 339 (397) T protein:vir:12 279 SIVLTNQDGYDWLDTLKDGTGRYLLQPDPTNPT------------------KKLLDGRPVVPFTNRVLKTQKGKAP-LII 339 (397) T ss_pred CEEEEcHHHHHHHHHhhccCCceeecccccCCC------------------CccccceeeEEecccccccCCCccE-EEE Confidence 347788888888876543322222222211110 01111111111110 0000000000 000 Q ss_pred ----------------EeecccccCcccccccceeeEEEEEEEcCCCCcccccceeeeeecCCCeEEEEEeec Q lcl|NC_018863. 301 ----------------ATVKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLASEAVTAVVANPTDSVSLAVKLQ 357 (479) Q Consensus 301 ----------------a~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~V~LtIt~~ 357 (479) ..........|.. +...|++...-+.+ +.....-+.|++|-- T Consensus 340 gd~~~~~~~~~~~~~~i~~~~~~~~~f~~----~~~~~r~~~r~d~~-----------~~~~~a~~~~~~t~~ 397 (397) T protein:vir:12 340 GNLKEAIVLFDREQQSIASTDTGAGAFET----NSTKVRGIEREDVR-----------KWDEDAVVFGQITVE 397 (397) T ss_pred EehhceEEEEeecceEEEEeccccchhhc----CceEEEEEEeeccE-----------EecccceEEEEEeeC Confidence 0000000000000 00111111111110 111111111222211 No 82 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=92.63 E-value=0.011 Score=31.41 Aligned_cols=319 Identities=11% Similarity=0.071 Sum_probs=146.2 Q ss_pred cCchhHHHHHHHHHHHhhcCcccCcccccCccccchhhhHHHHHHHhhccccccchhhhccchhHHHHHHhhhhhccCcc Q lcl|NC_018863. 15 LPAGAEAELAELVSKSFTTGTGITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQVNSTVAKYAVFNQHGRT 94 (479) Q Consensus 15 ~~~~~~~~~~e~~~Ksf~ag~~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~ 94 (479) |-.++|.+. +.+++ +..+|+.|-.|...+-|..|. +...+.+...+.+..+.-.+|.++ .+. T Consensus 1 ~g~~~e~~~------~~~~~------t~~~~g~l~~~~~~~ii~~l~---~~s~i~~l~~~~~~~~~~~~ip~~---~~~ 62 (397) T protein:vir:23 1 MGFSADHSQ------IAQTK------DTMFTGYLDPVQAKDYFAEAE---KTSIVQRVAQKIPMGATGIVIPHW---TGD 62 (397) T ss_pred CCcCHHHHH------Hhhcc------CCCCccccchhHHHHHHHHHH---hccchhhhcceeeccCCceEEEEE---cCC Confidence 888887663 12221 223345566666555454443 223345544444554433344333 334 Q ss_pred cccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCC Q lcl|NC_018863. 95 GHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAISVIAKSIEWAIFYGDAALAAEAD 174 (479) Q Consensus 95 g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~ 174 (479) ....|++|++..+.+++.+.+....+|=++--..+|.-+-. ++..|.+....+.-...+++.+|.++++|+..-. T Consensus 63 ~~a~wv~Eg~~~~~s~~~f~~v~l~~~k~~~~v~iS~ell~-ds~~~l~~~i~~~l~~aia~~~d~a~l~G~gt~~---- 137 (397) T protein:vir:23 63 VSAQWIGEGDMKPITKGNMTKRDVHPAKIATIFVASAETVR-ANPANYLGTMRTKVATAIAMAFDNAALHGTNAPS---- 137 (397) T ss_pred cceEEecCCccccccccceeEEEEeeEEEEEeehhhHHHHh-cchHHHHHHHHHHHHHHHHHHHHHHHhhcccCCc---- Confidence 45679999999999999999999999999988888875433 4567888999999999999999999999997511 Q ss_pred CcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhhheeecccCceeeeecChHHHhhHHHhhcCceeEE-eecCCCc- Q lcl|NC_018863. 175 NQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVI-QPSQAGG- 252 (479) Q Consensus 175 ~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~atd~~mp~~vka~f~q~~~~~qrv~-~~~n~g~- 252 (479) ...|+...- ....-..+.......++ +...+...+....-..|++.+...+...-...-|-+ +++..+. T Consensus 138 -----~~~~~~~~~---~~~~~~~~~~~~~~~~~-~~~~l~~~~~~~a~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~ 208 (397) T protein:vir:23 138 -----AFQGYLDQS---NKTQSISPNAYQGLGVS-GLTKLVTDGKKWTHTLLDDTVEPVLNGSVDANGRPLFVESTYESL 208 (397) T ss_pred -----ccccccccc---cceeeecccchhHHHHH-HHHhhhhcccCCCEEEEcHHHHHHHHHhhccCCceeecccccccc Confidence 123433322 22222333333333343 333455677777889999999998887543322222 2221111 Q ss_pred -------cccCcccc--ceec-----------C------ceeEEecCC--------------c---cc-CCCccccC-c- Q lcl|NC_018863. 253 -------FSTGFSIN--QFLS-----------T------RGAINLHGS--------------T---IM-ENDNILVD-R- 286 (479) Q Consensus 253 -------~~~G~~V~--~~~s-----------s------~g~I~L~~s--------------~---v~-~a~~~lve-r- 286 (479) .-.|.+|. .... + ++.|.+.-+ + +. +...+..+ | T Consensus 209 ~~~~~~~tl~G~Pv~~s~~~~~g~~~~~~gDfs~~~i~~~~~i~i~~~~e~~~~~~~~~~~~~~~lf~~d~v~~ra~~r~ 288 (397) T protein:vir:23 209 TTPFREGRILGRPTILSDHVAEGDVVGYAGDFSQIIWGQVGGLSFDVTDQATLNLGSQESPNFVSLWQHNLVAVRVEAEY 288 (397) T ss_pred cccccCceeeeeeEEEeCCCCCCceEEEEeecceEEEEEEeceEEEEeeeeeeeeccccccceeeeeeccceeEEEEeee Confidence 01122211 0000 0 111111100 0 00 00011110 0 Q ss_pred ---ccCCCC------CcccceEEEeecccccCcccccccceeeEEEEEEEcCCCCcccccceeee--------------- Q lcl|NC_018863. 287 ---IPEPNA------PQAPASVVATVKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLASEAVTAV--------------- 342 (479) Q Consensus 287 ---~~s~~a------P~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~VtaT--------------- 342 (479) +..+++ .......+.+.++.. ..+|+++.- |++...-+-.|| T Consensus 289 d~~v~~~~a~~~~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~---~~~~~~~~~~a~~~~~~~~~~~~~~~~ 354 (397) T protein:vir:23 289 GLLINDVNAFVKLTFDPVLTTYALDLDGAS-----------AGNFTLSLD---GKTSANIAYNASTATVKSAIVAIDDGV 354 (397) T ss_pred ccceecccceEEEeeccccceeeecccccC-----------cceEEEEec---CccccCcccccchhhhHHHhhhccccc Confidence 111111 000111111111111 123333321 222222211122 Q ss_pred ------eecCCCeEEEEEeecCCccccceEEEEEeccCCCCcEEEEEEeeeeeccCC Q lcl|NC_018863. 343 ------VANPTDSVSLAVKLQSLYQAKPQFISVYRQGNETGHYFLVARVPLSKADEN 393 (479) Q Consensus 343 ------~a~~~~~V~LtIt~~~~~~~~~~y~~IYR~t~~~g~~~~i~rV~~s~~n~~ 393 (479) ++..++.+ +|+..+..... .+.=.+. ..+++.-++.| T Consensus 355 ~~~~~~~~~~~~~~--~~~~~~~~~~~----~~~~~~~--------~~~~~~~~~~~ 397 (397) T protein:vir:23 355 SADDVTVTGSAGDY--TITVPGTLTAD----FSGLTDG--------EGASISVVSVG 397 (397) T ss_pred ccceeeeecCCcee--EEEeccccccC----ccccccC--------ccccceeeecC Confidence 22212222 33333221111 0100111 11122211112 No 83 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=92.55 E-value=0.0082 Score=32.01 Aligned_cols=313 Identities=12% Similarity=0.039 Sum_probs=135.5 Q ss_pred Ccccccccce--------eeeecC----chhHHHHHHHH-------------HHHhhcCcccCcccccCccccchhhhHH Q lcl|NC_018863. 1 MTELQKEQKV--------EARKLP----AGAEAELAELV-------------SKSFTTGTGITPDTQHDAAALRRELLDD 55 (479) Q Consensus 1 ~~~~~~~~~~--------~~~~~~----~~~~~~~~e~~-------------~Ksf~ag~~~~~~~~~~gaAlr~esld~ 55 (479) +.+..+++.- ...... ....++..... .+++.. .....-+..+++.+-.+.+.+ T Consensus 58 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~vp~~~~~ 136 (413) T protein:vir:81 58 SVDSEKSGELTRKGEGYKSIGEFFAKRAGDQIKQQAGGAQLNYSVGEYVAPRVKAASD-PASTATLTDEFQGGYGTTWNR 136 (413) T ss_pred HHhHHHhhhHhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhHHHhhhh-hhhhcccccccccccchhhHH Confidence 1010000000 000000 00000000000 011110 011112235677777888888 Q ss_pred HHHHHhhccccccchhhhccchhHHHHHHhhhhhccC-cccccccccccccccccC-cceEEEEEEEEeeeehhhhhhhH Q lcl|NC_018863. 56 QVKMLAFTNGDFTIYPLINKQQVNSTVAKYAVFNQHG-RTGHSRFVREVGVASIND-PNIRQKTVQMKFLSDTKQQSLAA 133 (479) Q Consensus 56 ~i~~l~~~~~~f~~~~~i~k~~~~stv~~y~~~~~~G-~~g~~~fv~E~g~~~~~d-~~~~r~~~~~k~l~~~~~vs~~~ 133 (479) +|..+...... +.+.+...+..+.--+|.+..... ..+...+++|++....++ +.+.+....++=++.-..+|..+ T Consensus 137 ~ii~~~~~~~~--l~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~el 214 (413) T protein:vir:81 137 NIIYRRREKLV--VADLMDNLTMTNTTIKYLMEKANRVVEGGFKTVAEGGKKPYMRFADFDIVTESLSKIAGLTKITDEM 214 (413) T ss_pred HHHHHHhhhhh--HHhhcceeeccCCceeEEEeccccccccccceecCcccccccCcccceeeEeeeeeEEEeehhhHHH Confidence 87665554442 344444445554433443333221 234577999998876665 78999999998888777888753 Q ss_pred hhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhhhe Q lcl|NC_018863. 134 GLVNNIADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAAVI 213 (479) Q Consensus 134 ~lv~~~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~ 213 (479) +.++ .+.+....+.-...+++.+|.++++|+-. |-.+.||.+.-.. +..-..+..-..+.|.++... T Consensus 215 -l~ds-~~l~~~i~~~la~~~~~~~d~~~l~G~G~---------~~~~~Gi~~~~~~--~~~~~~~~~~~~~~i~~~~~~ 281 (413) T protein:vir:81 215 -IEDY-DFLVSYINARLLEELAIEEERQLLLGDGT---------GNNLTGLLKRDGI--QTLAVSNKDELADSIYKAMTN 281 (413) T ss_pred -HHHH-HHHHHHHHHHHHHHHHHHHHHHHhccCCC---------CCccccccccccc--ccccccccchhHHHHHHHHHH Confidence 3333 44667777777888999999999999732 1235677665431 222222222224455555443 Q ss_pred eecc-cCceeeeecChHHHhhHHHhhcCceeEEeec-C-C--Cc-------cccCccccceecCceeEEecCCcccCCC- Q lcl|NC_018863. 214 VGKG-YGRATDAFMPIGVQADFTNNLLDRQRVIQPS-Q-A--GG-------FSTGFSINQFLSTRGAINLHGSTIMEND- 280 (479) Q Consensus 214 i~~~-fG~atd~~mp~~vka~f~q~~~~~qrv~~~~-n-~--g~-------~~~G~~V~~~~ss~g~I~L~~s~v~~a~- 280 (479) +... ...++-++|++.+.+.+...-...-|.+... . . ++ .-.|.+|. .+.. +. .+..+.++. T Consensus 282 ~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~l~G~pv~--~s~~--~~-~~~~~~gd~~ 356 (413) T protein:vir:81 282 ISLATPFQADALVINPLDYQELRLAKDANGQYYGGGVFQGQYGSGGIMLDPAPWGLRTV--QSQV--VP-VGKPVVGAFR 356 (413) T ss_pred hhhhccCCCcEEEEcHHHHHHHHHhhccCCceeccccccccccccccccCceecceeeE--EcCC--CC-cccEEEEecc Confidence 3333 3355668999999888876554333333211 1 1 11 11233331 1110 00 011111111 Q ss_pred -ccccCcccCCCCCcccceEEEeecccccCcccccc--cceeeEEEEEEEcCCCCcccccceeeeee Q lcl|NC_018863. 281 -NILVDRIPEPNAPQAPASVVATVKVNDKGAFRPVK--DIKTHSYKVVVHSDDAESLASEAVTAVVA 344 (479) Q Consensus 281 -~~lver~~s~~aP~~P~~vta~~~~~~~g~~~~~s--d~g~Y~YkV~a~n~~GES~~S~~VtaT~a 344 (479) .++... ..+. .+-.. .....-|-.+. -.....|-+.+.+..+--..+-. +++++ T Consensus 357 ~~~~~~~---~~~~----~v~~~--~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~-~~~~p 413 (413) T protein:vir:81 357 SAASVLR---KGGV----RIDST--NTNVDDFENNLITVRAEERVGLMVTFPEAIVQLDVA-EVVTP 413 (413) T ss_pred cEEEEEE---ecce----EEEEe--ccccchhhcCcEEEEEEEeeccEEecccceEEEEec-CCCCC Confidence 111100 0010 01000 00000010110 00112334444444444333322 11111 No 84 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=91.96 E-value=0.013 Score=30.84 Aligned_cols=316 Identities=9% Similarity=-0.008 Sum_probs=132.5 Q ss_pred Cccccccccee----eeecCchhHHHHHHHH--------HHHh----hcC-cccCcccccCccccchhhhHHHHHHHhhc Q lcl|NC_018863. 1 MTELQKEQKVE----ARKLPAGAEAELAELV--------SKSF----TTG-TGITPDTQHDAAALRRELLDDQVKMLAFT 63 (479) Q Consensus 1 ~~~~~~~~~~~----~~~~~~~~~~~~~e~~--------~Ksf----~ag-~~~~~~~~~~gaAlr~esld~~i~~l~~~ 63 (479) .+......... ................ .+.+ ..+ ......+..++..+..+.+...+..+... T Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i~~~~~~ 150 (419) T protein:vir:94 71 TPLTPAEAGTFRSLAQRFADSDGLREYRARDKRGQFQVEMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIVPTTPDL 150 (419) T ss_pred ccccccccccccchhhhhhhHHHHHHHHHhhhhhhhhHHHHHHHHHHhhccccccccccCCcccccchhhhHHHHHHHhh Confidence 11000000000 0000000000000000 0000 011 11111223455566666666666554433 Q ss_pred cccccchhhhccchhHHHHHHhhhhhc-----cCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcc Q lcl|NC_018863. 64 NGDFTIYPLINKQQVNSTVAKYAVFNQ-----HGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNN 138 (479) Q Consensus 64 ~~~f~~~~~i~k~~~~stv~~y~~~~~-----~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~ 138 (479) .-. +.+.+...+..+-...|.+... .++.+...+++|++..+.+++.+.+.+..++-++.--.+|..+ .+. T Consensus 151 ~~~--i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~el--l~d 226 (419) T protein:vir:94 151 PLL--VADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQA--ADD 226 (419) T ss_pred hhh--hhhcceeeeccCCceeeeeeccccccccccCcccceecCCccccccccceeeEEeeeeeEEEeehhhHHH--HHh Confidence 321 1111222222222233333222 2223346799999999999999999999999999888888753 344 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEcc-C-----CCCCHHHhhhhhh Q lcl|NC_018863. 139 IADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLK-G-----ERLDEATLNKAAV 212 (479) Q Consensus 139 ~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDar-G-----~~l~~~~l~~aa~ 212 (479) ..+.+....+.--..++..++.++++||-+=. .-|+.+.-. -+.+... + .....+.|.++-. T Consensus 227 ~~~l~~~i~~~la~a~~~~~d~aii~G~G~~~----------p~Gi~~~~~--~~~~~~~~~~~~~t~~~~~~~l~~~~~ 294 (419) T protein:vir:94 227 NSQLMGYIQGRLTYGLRFLRDRQLLNGNGSTE----------MQGILTTPG--IGTYQQPKPTAPATDEPPLVDIRRAKT 294 (419) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhccCccc----------ccceecccc--cccccccccccccccchhHHHHHHHHH Confidence 45778888888999999999999999987622 236655321 1111111 1 1112444555555 Q ss_pred eeecccCceeeeecChHHHhhHHHhhcC-ceeEEeecCCCccc----cCccccceecCceeEEecCCcccCCCccccCcc Q lcl|NC_018863. 213 IVGKGYGRATDAFMPIGVQADFTNNLLD-RQRVIQPSQAGGFS----TGFSINQFLSTRGAINLHGSTIMENDNILVDRI 287 (479) Q Consensus 213 ~i~~~fG~atd~~mp~~vka~f~q~~~~-~qrv~~~~n~g~~~----~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver~ 287 (479) .+...+..++-.+|++.+...+...... ..+.+.+.+..+.. .|.+|- .+.. +. .+..+.++..... .. T Consensus 295 ~~~~~~~~~~~~v~n~~~~~~l~~~k~~~~~~~~~~~~~~~~~~~~l~G~pV~--~~~~--~~-~~~~~~gd~~~~~-~~ 368 (419) T protein:vir:94 295 VAEIAGFPPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVV--STVA--IA-QGTALVGGFRQGA-TL 368 (419) T ss_pred hhhhccCCCCEEEEcHHHHHHHHHHhhcCCCceeecCCcccCCCccccceeeE--EcCC--CC-CccEEEeeccceE-EE Confidence 5556666777899999999888776654 33333333322211 111110 0000 00 0000000000000 00 Q ss_pred cCCCCCcccceEEEeecccccCcccccccceeeEEEEEEEcCCCCcccccceeeeeecCCCeEEEEEeecCC Q lcl|NC_018863. 288 PEPNAPQAPASVVATVKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLASEAVTAVVANPTDSVSLAVKLQSL 359 (479) Q Consensus 288 ~s~~aP~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~V~LtIt~~~~ 359 (479) ..-.+. ++ .... ..+..|. . +...|++...-+.+--.+ ..-+.|+++.+.+ T Consensus 369 ~~~~~~----~v--~~~~-~~~~~~~-~--~~~~~r~~~r~d~~v~~~-----------~a~~~~~~~aa~~ 419 (419) T protein:vir:94 369 WSRQGI----TV--LMTD-SHADFFT-A--NTLVILAEFRANLAVYQP-----------KAFVRVTFAAATT 419 (419) T ss_pred EEecce----EE--EEec-cccchhh-c--CcEEEEEEEeeccEEecc-----------ccEEEEEeccCCC Confidence 000000 00 0000 0000000 0 111222221111111111 1122233333333 No 85 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=91.74 E-value=0.014 Score=30.67 Aligned_cols=294 Identities=14% Similarity=0.108 Sum_probs=130.6 Q ss_pred HHHHHHHhhcCcccCcccccCccccchhhhHHHHHHHhhccccccchhhhccch-hHHHHHHhhhhhccCccc-c-cccc Q lcl|NC_018863. 24 AELVSKSFTTGTGITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQ-VNSTVAKYAVFNQHGRTG-H-SRFV 100 (479) Q Consensus 24 ~e~~~Ksf~ag~~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~-~~stv~~y~~~~~~G~~g-~-~~fv 100 (479) .|.+.|.|+.--.++- +-.+|+-|..|-+++-|..|. .... |++.+...+ ..|.-.+...+ .+|+.- . ..-. T Consensus 1 ~~~~~~~~~~~k~it~-~d~~gG~L~P~~~~~~i~~l~-e~s~--i~~~a~vi~t~~s~~~~i~~i-~~g~~~~~~~~~~ 75 (314) T protein:vir:41 1 MDFLNKPFQITPKIDV-PDLGKGILAVQRFGEFVREVR-ENSA--IIKDARVLNALKSYEVDISRI-SLGVELEPGRNTS 75 (314) T ss_pred CchhhhHHHhhccccc-ccCCCceeChHHHHHHHHHHH-hccc--hhhheeeecccCccceeeccc-ccCcccccccccc Confidence 2334444443322332 233577899999986554443 3222 333332211 12222222222 333321 1 1122 Q ss_pred cccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcch--hhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCccc Q lcl|NC_018863. 101 REVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNI--ADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAG 178 (479) Q Consensus 101 ~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~--~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~g 178 (479) +|......+|+++.+....+|=|+.--.+|.- -|.++. .|-+....+.=..++....|.+.|-||.+..++... .. T Consensus 76 ~~~~~~~~~~~tf~~~~l~~~kl~~~v~is~e-~L~D~a~~~~le~~i~~~~Ae~~g~~~~~~~~nGdg~~~s~~~~-~~ 153 (314) T protein:vir:41 76 GTKVAPTADEVTVSTNTLEMKELVTKVVLEDE-ALEDNIEQSAFEQTITSLLASGVTYDLECFFLHADSSLTTGREL-YR 153 (314) T ss_pred cCCccCCcccccccceeeeeEEEEEeecccHH-HHHhhhchhhHHHHHHHHHHHHHHHHHHHHhhccccCCcCcccc-hh Confidence 45555567889998888888877765444432 234554 377777777788899999999999999875432110 01 Q ss_pred chhhhHHHhhccCCcEEEccC--CCCCHHHhhhhhheeeccc-Ccee--eeecChHHHhhHHHhhcCceeEEeecCCCcc Q lcl|NC_018863. 179 IEFDGLTKLIDEATNVIDLKG--ERLDEATLNKAAVIVGKGY-GRAT--DAFMPIGVQADFTNNLLDRQRVIQPSQAGGF 253 (479) Q Consensus 179 leFDGl~~~I~~~~NviDarG--~~l~~~~l~~aa~~i~~~f-G~at--d~~mp~~vka~f~q~~~~~qrv~~~~n~g~~ 253 (479) +.||+.+... ..+.+..+ ...+.+.+..+-..+...| -..+ ..+|+..+...+-..+.++.+-+-... . T Consensus 154 -~p~G~l~~a~--~~~~~~~~~~~~~~~~~~~~l~~sl~~~yr~~~~~~~~~m~~~t~~~~r~~l~~~~~~l~~~~---~ 227 (314) T protein:vir:41 154 -INDGWMKLAG--NQYTDAEPEDENWPLNLFDGMMDELDTRYLQLKPRMKFYVSNEIYNGYRKQLLVRETGLGDSA---L 227 (314) T ss_pred -cchhhhhhcc--cceeecCccccccHHHHHHHHHHhcCchhhcCCCceEEEecHHHHHHHHHHHhccCCcccchh---h Confidence 6899988764 34666544 4456677766554444444 2222 477999998888877766544331110 0 Q ss_pred ccCccccceecCceeEEecCCcccCCCccccCcccCCCCCcccceEEEeecccccCcccccccceeeEE----EEEEE-c Q lcl|NC_018863. 254 STGFSINQFLSTRGAINLHGSTIMENDNILVDRIPEPNAPQAPASVVATVKVNDKGAFRPVKDIKTHSY----KVVVH-S 328 (479) Q Consensus 254 ~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver~~s~~aP~~P~~vta~~~~~~~g~~~~~sd~g~Y~Y----kV~a~-n 328 (479) . ..++..+.|= +.+.++-.....+|. .+..- |.| ++|-| -|... . T Consensus 228 ~----------~~~~~~l~G~-----PV~~~~~~~~~~~~~---~~i~f------gd~------~nlv~~~~~~ir~~~~ 277 (314) T protein:vir:41 228 I----------GATGLQYDGI-----PIQYVPALDALGDDK---ARALL------TVP------TNLVYGFWRNIRIEPK 277 (314) T ss_pred h----------CCCCceecce-----eeEecccccccCCCC---ceEEE------ech------hheEEEeeceeEEeec Confidence 0 0011111111 111111111111111 00000 000 11111 11100 0 Q ss_pred CCCCcccccc-----eeeeeecCCCeEEEEEeecCCc Q lcl|NC_018863. 329 DDAESLASEA-----VTAVVANPTDSVSLAVKLQSLY 360 (479) Q Consensus 329 ~~GES~~S~~-----VtaT~a~~~~~V~LtIt~~~~~ 360 (479) ++.+..--.. +.....-....+...|..+..+ T Consensus 278 ~~a~~~~~~~~~~~r~d~~~~~~~aa~~~~~~~~~~~ 314 (314) T protein:vir:41 278 RDAAMRRTEYIASLRADCNYEDENAAVAAVIDMSSGG 314 (314) T ss_pred ccCcCCeEEEEEEEEeceEEEEcCcEEEEEeeccCCC Confidence 0000000000 0000000011111112222111 No 86 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=91.52 E-value=0.015 Score=30.51 Aligned_cols=308 Identities=15% Similarity=0.136 Sum_probs=136.0 Q ss_pred Ccccccc--------------cceeeeecCchhHHHHHHHHHHHhhc-----Cc------------ccCcccccCccccc Q lcl|NC_018863. 1 MTELQKE--------------QKVEARKLPAGAEAELAELVSKSFTT-----GT------------GITPDTQHDAAALR 49 (479) Q Consensus 1 ~~~~~~~--------------~~~~~~~~~~~~~~~~~e~~~Ksf~a-----g~------------~~~~~~~~~gaAlr 49 (479) +.+..++ .................+.-.|+|.. +. .....+..+|+.+. T Consensus 95 ~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~~~i 174 (458) T protein:vir:10 95 IVGLQDEIKSLLTAREGRSFVGDSVAKALYGTQENFEDEVEKLVLLSYVMEKGVFETEHGQRHLKAVNQSSSVEVSSESY 174 (458) T ss_pred HHHHHHHHHHHHHHHHhhhhhhhhhhccchhhhhhHHHHHHHHHHHHHHHhhccchhhhhhhhhhhhhhcccCcccccee Confidence 0110000 00000011111111110111111110 00 00011234577788 Q ss_pred hhhhHHHHHHHhhccccccchhhhccchhHHHHHHhhhhhccCcccccccccccccc------cccCcceEEEEEEEEee Q lcl|NC_018863. 50 RELLDDQVKMLAFTNGDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVA------SINDPNIRQKTVQMKFL 123 (479) Q Consensus 50 ~esld~~i~~l~~~~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~------~~~d~~~~r~~~~~k~l 123 (479) .+.+.++|..+..... .+.+.....++.+....|.+. .+.+...+++|++.. +.+++.+.+.....+=+ T Consensus 175 p~~~~~~ii~~~~~~~--~l~~~~~~~~~~~~~~~~~~~---~~~~~a~~v~e~~~~~~~~~~~~~~~~~~~i~~~~~k~ 249 (458) T protein:vir:10 175 ETIFSQRIIRDLQKEL--VVGALFEELPMSSKILTMLVE---PDAGKATWVAASTYGTDTTTGEEVKGALKEIHFSTYKL 249 (458) T ss_pred hhhHhHHHHHHHHhhh--hHHhhcceeecCCcceEEEEe---cCCcceeecccccccccccccccccccceeeEeeeeeE Confidence 8888888876665544 344444444555544444333 333345667766554 35678888888887777 Q ss_pred eehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhcc-CCc-EEEccC-- Q lcl|NC_018863. 124 SDTKQQSLAAGLVNNIADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDE-ATN-VIDLKG-- 199 (479) Q Consensus 124 ~~~~~vs~~~~lv~~~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~-~~N-viDarG-- 199 (479) +.--.+|.-+ +.++..|.+....+.....++..++.++|+||-. + +..|+.+.... ..+ +.+.-+ T Consensus 250 ~~~v~is~el-l~ds~~~~~~~i~~~l~~~i~~~~d~~~l~G~G~----~------~p~Gi~~~~~~~~~~~~~~~~~~~ 318 (458) T protein:vir:10 250 AAKSFITDET-EEDAIFSLLPLLRKRLIEAHAVSIEEAFMTGDGS----G------KPKGLLTLASEDSAKVVTEAKADG 318 (458) T ss_pred EeeehhhHHH-HhcchHHHHHHHHHHHHHHHHHHHHHHhhcCCCC----C------ccceeeecccccccceeecccccc Confidence 7766677653 3445667888888999999999999999999853 1 23466665431 112 222222 Q ss_pred -CCCCHHHhhhhhheeecccCceeeeecChHHHhhHHHhhcC-ceeEEeecCCCc-------cccCccccce--ecCcee Q lcl|NC_018863. 200 -ERLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLD-RQRVIQPSQAGG-------FSTGFSINQF--LSTRGA 268 (479) Q Consensus 200 -~~l~~~~l~~aa~~i~~~fG~atd~~mp~~vka~f~q~~~~-~qrv~~~~n~g~-------~~~G~~V~~~--~ss~g~ 268 (479) ..++.+.|.++-..+..+|......+||+.+.+.+...-.. ++-+.++..... .-.|.+|-.. +...+ T Consensus 319 ~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~- 397 (458) T protein:vir:10 319 SVLVTAKTISKLRRKLGRHGLKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDSVKLQGQVGRIYGLPVVVSEYFPAKA- 397 (458) T ss_pred cccccHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHhhcccCCceeeccccccccccCcCceecceeeEEcccccccc- Confidence 23455555555555566776777789999998887765432 222322221111 1123333111 10000 Q ss_pred EEecCCcccC--CCcccc-CcccCCCCCcccceEEEeec-ccccCcccccccceeeEEEEEEEcCCCCcccccceeeeee Q lcl|NC_018863. 269 INLHGSTIME--NDNILV-DRIPEPNAPQAPASVVATVK-VNDKGAFRPVKDIKTHSYKVVVHSDDAESLASEAVTAVVA 344 (479) Q Consensus 269 I~L~~s~v~~--a~~~lv-er~~s~~aP~~P~~vta~~~-~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~a 344 (479) -....+.. .+++++ +|. . ..+....- ....-.|+.-..+|-.-| -|+..|..|.+ T Consensus 398 --~~~~~~~~~f~~~~~~~~~~-~-------~~v~~d~~~~~~~~~~~~~~r~~~~v~-----------~~~a~v~~~~a 456 (458) T protein:vir:10 398 --NSAEFAVIVYKDNFVMPRQR-A-------VTVERERQAGKQRDAYYVTQRVNLQRY-----------FANGVVSGTYA 456 (458) T ss_pred --CCcceEEEEecccEEEEEee-c-------eEEEeecccCCCceEEEEEEEecceEe-----------cccceEEEeec Confidence 00000110 011111 110 0 00100000 000001111111111111 22333444444 Q ss_pred cC Q lcl|NC_018863. 345 NP 346 (479) Q Consensus 345 ~~ 346 (479) .. T Consensus 457 a~ 458 (458) T protein:vir:10 457 AS 458 (458) T ss_pred cC Confidence 44 No 87 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=91.26 E-value=0.017 Score=30.33 Aligned_cols=311 Identities=13% Similarity=0.049 Sum_probs=133.8 Q ss_pred Ccc---ccc-----ccceeeeecCchhH---HHH-HHHHHHHhhcCc---------ccCcccccCccccchhhhHHHHHH Q lcl|NC_018863. 1 MTE---LQK-----EQKVEARKLPAGAE---AEL-AELVSKSFTTGT---------GITPDTQHDAAALRRELLDDQVKM 59 (479) Q Consensus 1 ~~~---~~~-----~~~~~~~~~~~~~~---~~~-~e~~~Ksf~ag~---------~~~~~~~~~gaAlr~esld~~i~~ 59 (479) +-+ ..+ +.++...+-+.... ... -+.+.++...|. .....+..+|+.+-.+..++.|.. T Consensus 54 i~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~g~~~~~~~~~~~~~~~~~t~~~~g~~~~~~~~~~~i~~ 133 (392) T protein:vir:13 54 IKRGIDAIKATDAVTSLLSGLQGSGSGAQRSADHDDDAVLRAGNLGEARSFEFAPEKRDGTKAGNPNVLSRTLYGQLIAQ 133 (392) T ss_pred HHHHHHHHHHHHHHHHHhcccCCcccchhhhhhHHHHHHHhccchhhhHHHHhhhhhhcccccCCCccccccchHHHHHH Confidence 000 000 00000000000000 000 011122222111 111112233445555555554444 Q ss_pred HhhccccccchhhhccchhHHHHHHhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcch Q lcl|NC_018863. 60 LAFTNGDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNI 139 (479) Q Consensus 60 l~~~~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~ 139 (479) +.. .+..++.+....-.+.-.+|..... .+.....+++|++..+.+++.+.+....++=++.--.+|.-+ |.++. T Consensus 134 ~~~---~~~~l~~~~~~~~~~~~~~~~~~~~-~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~iS~el-l~ds~ 208 (392) T protein:vir:13 134 AVE---RSAIMRGGASTFTTSDANPMDFTVI-TGRATAGIVGETAEIPESYPATTQRSMGGFKYGFASVVSYEF-ATDQV 208 (392) T ss_pred HHh---hhhhhhhcceeeecCCCceeEEEEE-cCCcceeeecccccccccccceeeEEeeeeeEEeeehhHHHH-Hhcch Confidence 332 2334444432211122222322222 223456789999999999999999999998888777777653 33555 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCc-EEEccCCCCCHHHhhhhhheeeccc Q lcl|NC_018863. 140 ADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATN-VIDLKGERLDEATLNKAAVIVGKGY 218 (479) Q Consensus 140 ~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~N-viDarG~~l~~~~l~~aa~~i~~~f 218 (479) .|.+....+.-...+++.++.++|+||-.-. ..|+.+....... +..+....++-+.|.++-..+..+| T Consensus 209 ~~l~~~i~~~l~~~i~~~~d~~~l~G~Gt~~----------p~Gil~~~~~~~~~~~~~~~~~~~~d~l~~~~~~l~~~~ 278 (392) T protein:vir:13 209 LDLVGFLVSDAGPAIGDAMGRHFLTGTGTGQ----------PRGILTDATGANAAFGEADADSKVSDALIDLFHEVPSAY 278 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcccCCcc----------ccccccccccccccccccccccccHHHHHHHHHhhhhhh Confidence 6778888888889999999999999985411 2367666542222 2233345555555554433344556 Q ss_pred CceeeeecChHHHhhHHHhhcCceeEE-eecCC-Cc--cccCccccceecCceeEEecCCcccCCCccccCcccCC--CC Q lcl|NC_018863. 219 GRATDAFMPIGVQADFTNNLLDRQRVI-QPSQA-GG--FSTGFSINQFLSTRGAINLHGSTIMENDNILVDRIPEP--NA 292 (479) Q Consensus 219 G~atd~~mp~~vka~f~q~~~~~qrv~-~~~n~-g~--~~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver~~s~--~a 292 (479) .......|++.+.+.+...-...-|.+ +++.. +. .-.|.+|. .+. .+.++.++. +-++. -+ T Consensus 279 ~~~a~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~g~~~~l~G~Pv~--~~~----------~~~~~~i~~-Gdf~~~~i~ 345 (392) T protein:vir:13 279 RKNAKFVVNDLRAAQMRKLKDANGQYLWQSALTVGAPDTFNGKVVE--TDD----------GMPADKVLF-ADLSKYRVR 345 (392) T ss_pred hcCCEEEEcHHHHHHHHHhhccCCceeecCCcCCCCCceecceeeE--EcC----------CCCCCcEEE-eeccceeEE Confidence 555568899999988887555444444 33311 11 12233321 110 001111110 00000 00 Q ss_pred CcccceEEEeecccccCcccccccceeeEEEEEEEcCCCCcccccceeeeeecCCCeEEEEEeecC Q lcl|NC_018863. 293 PQAPASVVATVKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLASEAVTAVVANPTDSVSLAVKLQS 358 (479) Q Consensus 293 P~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~V~LtIt~~~ 358 (479) --....+ .... ..++ .. +...|++...=+ | -+....--+.|+++.+. T Consensus 346 ~~~~~~i--~~~~---~~~~-~~--~~~~~r~~~r~d-~----------~~~~~~A~~~~~~~~aa 392 (392) T protein:vir:13 346 FAGSLRV--DRSV---DAKF-ST--DQIVYRFLQRAD-G----------LLVDARGAKVLTVTPAA 392 (392) T ss_pred eecceEE--Eeec---cccc-cC--CcEEEEEEEEec-c----------EEecccceEEEEeeccC Confidence 0000000 0000 0001 01 111222221111 1 11111222334555443 No 88 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=90.58 E-value=0.02 Score=29.88 Aligned_cols=292 Identities=12% Similarity=0.094 Sum_probs=126.0 Q ss_pred Ccc-------c------ccccceeeeec--------------------------CchhHHHHHHHHHHHhhcCcccCccc Q lcl|NC_018863. 1 MTE-------L------QKEQKVEARKL--------------------------PAGAEAELAELVSKSFTTGTGITPDT 41 (479) Q Consensus 1 ~~~-------~------~~~~~~~~~~~--------------------------~~~~~~~~~e~~~Ksf~ag~~~~~~~ 41 (479) +-+ + .++........ ......++.....+.....-.....+ T Consensus 53 i~~l~~~~~~~e~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t 132 (394) T protein:vir:97 53 LVEAENDLKLYESSVEVGGAENIGGKEVTQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIK 132 (394) T ss_pred HHHHHHHHHHHHHHhhhhccccccccccchhhHHHHHHHHHHHHHHHHHhhhhhhhhhHHHHHHHHHhhhhhhhhccccc Confidence 000 0 00000000000 00000011000000000001122234 Q ss_pred ccCccccchhhhHHHHHHHhhccccccchhhhccchhHHHHHHhhhhhccCccccccccccccccc-ccCcceEEEEEEE Q lcl|NC_018863. 42 QHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVAS-INDPNIRQKTVQM 120 (479) Q Consensus 42 ~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~-~~d~~~~r~~~~~ 120 (479) ..+|+.|..+.+...|..+......+ .+.+...++.+.-.+|.... .+.+...+++|++... .+++.+...+... T Consensus 133 ~~~gg~liP~~~~~~ii~~~~~~~~l--~~~~~~~~~~~~~~~~~~~~--~~~~~~~~v~E~~~~~~~~~~~~~~v~l~~ 208 (394) T protein:vir:97 133 KENAKPVSSEEILYTPAREVKTVVDL--KPFTTVYQAKKASGKYPVLQ--RATTKMVTVAELEKNPALAKPDFKDVAWNI 208 (394) T ss_pred cccccccChHHHHHHHHHHhhhhhhh--hhhceeeeccCcceEEEEEe--cCCCccceecccccccccccccceeEEeeh Confidence 56688899999988887666554433 33333333333323343322 2223456899998765 6789999999999 Q ss_pred EeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCC Q lcl|NC_018863. 121 KFLSDTKQQSLAAGLVNNIADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGE 200 (479) Q Consensus 121 k~l~~~~~vs~~~~lv~~~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~ 200 (479) +-++.--.+|.-+ +.++..|.+....+.-...+..+++.++..|.....+.+ ..-+|+++..+.. .+|.. T Consensus 209 ~k~~~~i~is~el-l~ds~~~~~~~i~~~la~~~~~~~~~~i~~g~~~~~~~~----~~~~~~~~~~~~~---~~~~~-- 278 (394) T protein:vir:97 209 DTYRGAIPLSQES-IDDADVDLVGIVSESISQIKVNTTNDAIAKVLKSFTTKT----VKNLDEIKALLNG---GFDPA-- 278 (394) T ss_pred hheeeehhhHHHH-HhhhhHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc----cccHHHHHHHHHh---hhhhh-- Confidence 9888777777643 234455677778888888889999999999987765432 4568888777742 01111 Q ss_pred CCCHHHhhhhhheeecccCceeeeecChHHHhhHHHhhcCceeEEeecCCC----ccccCccccceecC---ceeEEecC Q lcl|NC_018863. 201 RLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSQAG----GFSTGFSINQFLST---RGAINLHG 273 (479) Q Consensus 201 ~l~~~~l~~aa~~i~~~fG~atd~~mp~~vka~f~q~~~~~qrv~~~~n~g----~~~~G~~V~~~~ss---~g~I~L~~ 273 (479) + .....|++.+.+.+...-...-|.+...+.. ..-.|.+|.-..+. .+.| T Consensus 279 -----------------~--~a~~v~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~---- 335 (394) T protein:vir:97 279 -----------------Y--NVSLIVSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLSDEVLGANKA---- 335 (394) T ss_pred -----------------h--CCEEEEcHHHHHHHHHhhccCCCeeeecCcCCCCCceeccceeEEecccccCCccE---- Confidence 0 1236777777777766543322323221221 12334443211110 0000 Q ss_pred CcccCC--Ccccc-CcccCCCCCcccceEEEeecccccCcccccccceeeEEEEEEEcCCCCcccccceeeeeecC Q lcl|NC_018863. 274 STIMEN--DNILV-DRIPEPNAPQAPASVVATVKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLASEAVTAVVANP 346 (479) Q Consensus 274 s~v~~a--~~~lv-er~~s~~aP~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~a~~ 346 (479) +.++ ..++. +|.- . .+ ....-......+ .+...+-+.+.+..+--...- +.+++-. T Consensus 336 --~~gd~~~~~~~~~~~~----~----~~-~~~~~~~~~~~~----~~~~r~d~~v~~~~a~~~~~~--~~~~~p~ 394 (394) T protein:vir:97 336 --FIGDFKRGVLFADRKD----L----GL-RWADNEIYGQYL----QAVLRFGVSKVDDKAGYYVTF--TPEPLPL 394 (394) T ss_pred --EEeeccccEEEEEecc----e----EE-EEecccccceeE----EEEEEEccEEecccceEEEEe--cccccCC Confidence 1111 11111 1100 0 00 000000000000 011122222222222221111 1111111 No 89 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=90.41 E-value=0.021 Score=29.78 Aligned_cols=259 Identities=14% Similarity=0.150 Sum_probs=122.3 Q ss_pred hhcCcccCcccccCccccchhhhHHHHHHHhhccccccchhhhccchhHHHHHH----hhhhhccCcccccccccccccc Q lcl|NC_018863. 31 FTTGTGITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQVNSTVAK----YAVFNQHGRTGHSRFVREVGVA 106 (479) Q Consensus 31 f~ag~~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~~~stv~~----y~~~~~~G~~g~~~fv~E~g~~ 106 (479) |.. +. -+.+..+.+|.+.+.+....... ..| ..+.. .+.++.. --.+..++..|...++.|++.. T Consensus 1 MA~-~~-----T~~~~~~iPev~s~~v~~~~~~~--~~~-~~~~~--~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i 69 (272) T protein:vir:30 1 MAV-GT-----TKMAQMLDPEVLADMIDAEVGKA--IRF-APLAE--VDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAI 69 (272) T ss_pred CCC-cc-----ccchheechHHHHHHHHHHHHHH--hhh-hcccc--ccccccCCCCCEEEEEEecCCCCcccccCCCcc Confidence 221 11 13455777777777663322111 111 11111 1111110 0012345566777789999999 Q ss_pred cccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHH Q lcl|NC_018863. 107 SINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTK 186 (479) Q Consensus 107 ~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~ 186 (479) ...+........+++-++....+|..+.. ++..|++....+.....+++.++-.+|= .+. T Consensus 70 ~~~~~~~~~~~~~~~~~~~~~~itd~~~~-~s~~d~~~~~~~~~~~~~a~~~d~~i~~---~~~---------------- 129 (272) T protein:vir:30 70 PMTQLGFKKTTMTIKKAGKGVEITDEAIL-SGYGDPVGQAAKQIVEAIDHKVDADVLD---ALS---------------- 129 (272) T ss_pred cccccccceEEEEeeeeeeeeeecHHHHh-hccccHHHHHHHHHHHHHHHHHHHHHHH---Hhc---------------- Confidence 99999999999999999999999988754 4567999999999999999999977662 111 Q ss_pred hhccCCcEEEccCCCCCHHHhhhhhheeecccCceeeeecChHHHhhHHHhhcCceeEEeecCCCc--cccCccccceec Q lcl|NC_018863. 187 LIDEATNVIDLKGERLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSQAGG--FSTGFSINQFLS 264 (479) Q Consensus 187 ~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~atd~~mp~~vka~f~q~~~~~qrv~~~~n~g~--~~~G~~V~~~~s 264 (479) ...+.++ ...+.+.|..|...........+-++||+.+.+.+...-+.. ....++.+. +..|. +.. T Consensus 130 ---~a~~~~~---~~~t~d~i~da~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~--~~~~~~~~~~~~~~g~-ig~--- 197 (272) T protein:vir:30 130 ---KSTQTVE---ATATVDGVSKALDIFNDEDDAETVIVMNPADASTLRLDAAKE--WLGATEVGANRVVSGV-YGE--- 197 (272) T ss_pred ---ccccccc---cccCHHHHHHHHHHHhccCCCccEEEEcHHHHHHHHHhcccc--cccccccccccccccc-chh--- Confidence 1111111 122445556555555666666778999999988875443221 111111111 01110 111 Q ss_pred CceeEEecCCcccCC------CccccCc-ccCCCCCcccceEEEeecccccCcccccccceeeEEEEEEEcCCCCccccc Q lcl|NC_018863. 265 TRGAINLHGSTIMEN------DNILVDR-IPEPNAPQAPASVVATVKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLASE 337 (479) Q Consensus 265 s~g~I~L~~s~v~~a------~~~lver-~~s~~aP~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~ 337 (479) +.|-+|... +.++... ...- .-..++.+ +. .-.-.++...=.+.+.|-+... T Consensus 198 ------i~G~~Vi~s~~~p~~t~~~~~~~a~~~-~~~~~~~v--e~--~r~~~~~~~~i~~~~~~~~~v~---------- 256 (272) T protein:vir:30 198 ------VLGVQIVRSRKCPKGTAYMVRKGALRI-MLKRNTMV--ET--DRDITKAINQIVANKHYGVYLY---------- 256 (272) T ss_pred ------hcCeeEEEcCCCCcceEEEEcCCeEEE-EecCCcee--ee--ccccccceeEEEEEEEEEEEEE---------- Confidence 112111111 1111110 0000 00000000 00 0000000000011233333333 Q ss_pred ceeeeeecCCCeEEEEEeecCCc Q lcl|NC_018863. 338 AVTAVVANPTDSVSLAVKLQSLY 360 (479) Q Consensus 338 ~VtaT~a~~~~~V~LtIt~~~~~ 360 (479) ...+.|.+|+.+++-. T Consensus 257 -------~~~~vv~~t~~~a~~~ 272 (272) T protein:vir:30 257 -------KAEKAVKITLKDAAKK 272 (272) T ss_pred -------cCCceEEEEecccccC Confidence 2333444444433222 No 90 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=90.41 E-value=0.021 Score=29.78 Aligned_cols=259 Identities=14% Similarity=0.150 Sum_probs=122.3 Q ss_pred hhcCcccCcccccCccccchhhhHHHHHHHhhccccccchhhhccchhHHHHHH----hhhhhccCcccccccccccccc Q lcl|NC_018863. 31 FTTGTGITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQVNSTVAK----YAVFNQHGRTGHSRFVREVGVA 106 (479) Q Consensus 31 f~ag~~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~~~stv~~----y~~~~~~G~~g~~~fv~E~g~~ 106 (479) |.. +. -+.+..+.+|.+.+.+....... ..| ..+.. .+.++.. --.+..++..|...++.|++.. T Consensus 1 MA~-~~-----T~~~~~~iPev~s~~v~~~~~~~--~~~-~~~~~--~~~~~~g~~G~tv~iP~~~~~~~a~~v~eg~~i 69 (272) T protein:vir:98 1 MAV-GT-----TKMAQMLDPEVLADMIDAEVGKA--IRF-APLAE--VDTTLEGQPGTTLTVPKWDYIGDAEDVAEGEAI 69 (272) T ss_pred CCC-cc-----ccchheechHHHHHHHHHHHHHH--hhh-hcccc--ccccccCCCCCEEEEEEecCCCCcccccCCCcc Confidence 221 11 13455777777777663322111 111 11111 1111110 0012345566777789999999 Q ss_pred cccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHH Q lcl|NC_018863. 107 SINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTK 186 (479) Q Consensus 107 ~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~ 186 (479) ...+........+++-++....+|..+.. ++..|++....+.....+++.++-.+|= .+. T Consensus 70 ~~~~~~~~~~~~~~~~~~~~~~itd~~~~-~s~~d~~~~~~~~~~~~~a~~~d~~i~~---~~~---------------- 129 (272) T protein:vir:98 70 PMTQLGFKKTTMTIKKAGKGVEITDEAIL-SGYGDPVGQAAKQIVEAIDHKVDADVLD---ALS---------------- 129 (272) T ss_pred cccccccceEEEEeeeeeeeeeecHHHHh-hccccHHHHHHHHHHHHHHHHHHHHHHH---Hhc---------------- Confidence 99999999999999999999999988754 4567999999999999999999977662 111 Q ss_pred hhccCCcEEEccCCCCCHHHhhhhhheeecccCceeeeecChHHHhhHHHhhcCceeEEeecCCCc--cccCccccceec Q lcl|NC_018863. 187 LIDEATNVIDLKGERLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSQAGG--FSTGFSINQFLS 264 (479) Q Consensus 187 ~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~atd~~mp~~vka~f~q~~~~~qrv~~~~n~g~--~~~G~~V~~~~s 264 (479) ...+.++ ...+.+.|..|...........+-++||+.+.+.+...-+.. ....++.+. +..|. +.. T Consensus 130 ---~a~~~~~---~~~t~d~i~da~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~--~~~~~~~~~~~~~~g~-ig~--- 197 (272) T protein:vir:98 130 ---KSTQTVE---ATATVDGVSKALDIFNDEDDAETVIVMNPADASTLRLDAAKE--WLGATEVGANRVVSGV-YGE--- 197 (272) T ss_pred ---ccccccc---cccCHHHHHHHHHHHhccCCCccEEEEcHHHHHHHHHhcccc--cccccccccccccccc-chh--- Confidence 1111111 122445556555555666666778999999988875443221 111111111 01110 111 Q ss_pred CceeEEecCCcccCC------CccccCc-ccCCCCCcccceEEEeecccccCcccccccceeeEEEEEEEcCCCCccccc Q lcl|NC_018863. 265 TRGAINLHGSTIMEN------DNILVDR-IPEPNAPQAPASVVATVKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLASE 337 (479) Q Consensus 265 s~g~I~L~~s~v~~a------~~~lver-~~s~~aP~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~ 337 (479) +.|-+|... +.++... ...- .-..++.+ +. .-.-.++...=.+.+.|-+... T Consensus 198 ------i~G~~Vi~s~~~p~~t~~~~~~~a~~~-~~~~~~~v--e~--~r~~~~~~~~i~~~~~~~~~v~---------- 256 (272) T protein:vir:98 198 ------VLGVQIVRSRKCPKGTAYMVRKGALRI-MLKRNTMV--ET--DRDITKAINQIVANKHYGVYLY---------- 256 (272) T ss_pred ------hcCeeEEEcCCCCcceEEEEcCCeEEE-EecCCcee--ee--ccccccceeEEEEEEEEEEEEE---------- Confidence 112111111 1111110 0000 00000000 00 0000000000011233333333 Q ss_pred ceeeeeecCCCeEEEEEeecCCc Q lcl|NC_018863. 338 AVTAVVANPTDSVSLAVKLQSLY 360 (479) Q Consensus 338 ~VtaT~a~~~~~V~LtIt~~~~~ 360 (479) ...+.|.+|+.+++-. T Consensus 257 -------~~~~vv~~t~~~a~~~ 272 (272) T protein:vir:98 257 -------KAEKAVKITLKDAAKK 272 (272) T ss_pred -------cCCceEEEEecccccC Confidence 2333444444433222 No 91 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=90.01 E-value=0.023 Score=29.55 Aligned_cols=300 Identities=12% Similarity=0.090 Sum_probs=130.3 Q ss_pred Ccccccccce----------ee--------------eecCchhHHHHH---------------HHHHHHhhcCcccCccc Q lcl|NC_018863. 1 MTELQKEQKV----------EA--------------RKLPAGAEAELA---------------ELVSKSFTTGTGITPDT 41 (479) Q Consensus 1 ~~~~~~~~~~----------~~--------------~~~~~~~~~~~~---------------e~~~Ksf~ag~~~~~~~ 41 (479) ++.+.....+ .. .........++. +...+++.++. T Consensus 288 ~~~~~~~~~i~~~~re~~~~~l~rai~a~a~~~~~~a~~~~e~a~~~a~~~G~~arg~~~~~~~l~~ra~~~~t------ 361 (632) T protein:vir:96 288 KPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVLVQRQLEKKT------ 361 (632) T ss_pred hhhhhhhhhhhhhHHHHHHHHHHHHHHhhhccchhhhhhhhHHHHHHHHhhhhhhhhhhhhHHHHHHhhhhccc------ Confidence 1111110000 00 000000000100 01233444432 Q ss_pred ccCccccch-hhhHHHHHHHhhccccccchhhhccchhHHHHHHhhhhhccCcccccccccccccccccCcceEEEEEEE Q lcl|NC_018863. 42 QHDAAALRR-ELLDDQVKMLAFTNGDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQM 120 (479) Q Consensus 42 ~~~gaAlr~-esld~~i~~l~~~~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~ 120 (479) ..+|+.|-. |.+..++..+.... ..+..+.-+.+......+ .+..+.+.+...+++|++..+.+++.+.+.+... T Consensus 362 ~~~gg~lvp~~~~~~~iie~lr~~---s~i~~l~~~~~~~~~g~~-~ip~~~~~~~a~wv~E~~~~~~s~~~f~~i~l~~ 437 (632) T protein:vir:96 362 AGKGGELVATELLSEEFIDILRNK---AIIGQMGARMLPGLVGDV-DIPKKTSGANFYWIGEDEDVQDSDFDFTTLSFSP 437 (632) T ss_pred ccccccccccccchHHHHHHHhhc---chhhhhcceEeecCCcce-EEEEEeCCceeEeecCCccccccccceeeEEeee Confidence 234555555 33445543333221 122233222222222222 1223334445779999999999999999999999 Q ss_pred EeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCC Q lcl|NC_018863. 121 KFLSDTKQQSLAAGLVNNIADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGE 200 (479) Q Consensus 121 k~l~~~~~vs~~~~lv~~~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~ 200 (479) |=++.-..+|..+= .++.-|.+....++-...++..++.++++|+.. + ++ --|+.+.-. -+.+...+. T Consensus 438 ~k~~~~v~iS~ell-~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~-~--~~------p~Gi~~~~~--~~~~~~~~~ 505 (632) T protein:vir:96 438 KTIAGAVPVTRKLR-KQSSIHVENLIREDLIEGIGVALDLAMLTGTGL-A--ND------PVGLLNMTG--VPALTYPAG 505 (632) T ss_pred eEEEEehhhHHHHH-hccchHHHHHHHHHHHHHHHHHHHHHhhcccCC-C--Cc------cceeeeccc--ccceecccc Confidence 99998888877652 244457788888999999999999999999753 1 11 236654432 234555555 Q ss_pred CCCHHHhhhhhheeecccCceee--eecChHHHhhHHHhh--cCceeEEeecCCCccccCccccceecCceeEEecCCcc Q lcl|NC_018863. 201 RLDEATLNKAAVIVGKGYGRATD--AFMPIGVQADFTNNL--LDRQRVIQPSQAGGFSTGFSINQFLSTRGAINLHGSTI 276 (479) Q Consensus 201 ~l~~~~l~~aa~~i~~~fG~atd--~~mp~~vka~f~q~~--~~~qrv~~~~n~g~~~~G~~V~~~~ss~g~I~L~~s~v 276 (479) .++.+.|..+...+...++.... ..|++.+...+..-. ...-+.+..++ .-.|++|-. +.. |. .+..+ T Consensus 506 ~~~~~~i~~~~~~i~~~~~~~~~~~~~~~~~~~~~l~~~~l~d~~G~~i~~~~---~l~G~pv~~--s~~--ip-~~~~~ 577 (632) T protein:vir:96 506 GVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGERIWQNN---EVNGYRAEA--SNQ--IP-ADTWI 577 (632) T ss_pred cCCHHHHHHHHHHHhhcccccCccEEEEchhHHHHHHHHhccCCCCceeecCC---eecccceEe--ccc--cc-cCcEE Confidence 56666666555555555554432 457777666655422 22223333221 222443311 110 00 01122 Q ss_pred cCCCccccCcccCCCCCcccceEEEeecc-cccCcccccccceeeEEEEEEEcCCCCcccccceeeeeec Q lcl|NC_018863. 277 MENDNILVDRIPEPNAPQAPASVVATVKV-NDKGAFRPVKDIKTHSYKVVVHSDDAESLASEAVTAVVAN 345 (479) Q Consensus 277 ~~a~~~lver~~s~~aP~~P~~vta~~~~-~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~a~ 345 (479) .++.....-+... + ........+ ..+|.- .-.....+-+.+....+--...- .. T Consensus 578 ~gd~s~~~i~~~~--~----~~i~~~~~~~~~~~~v---~~~~~~~~d~~v~~~~af~~~k~------~A 632 (632) T protein:vir:96 578 FGDWSQIVIAMWG--V----LDLKVDPYTKAASDGL---VLRVFQDVDAGVRRKEAFCIAKK------GA 632 (632) T ss_pred EeecceEEEEEec--c----eEEEEccccccccCce---EEEEEeecCceeechhhhhheee------cC Confidence 2222111111000 0 001100000 000100 00011233333333222222211 11 No 92 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=89.80 E-value=0.024 Score=29.44 Aligned_cols=312 Identities=13% Similarity=0.090 Sum_probs=130.8 Q ss_pred Ccccccccceee-----eecCchhHHHHHHHHHHHhhc----Ccc---------cCcccccCccccchhhhHHHHHHHhh Q lcl|NC_018863. 1 MTELQKEQKVEA-----RKLPAGAEAELAELVSKSFTT----GTG---------ITPDTQHDAAALRRELLDDQVKMLAF 62 (479) Q Consensus 1 ~~~~~~~~~~~~-----~~~~~~~~~~~~e~~~Ksf~a----g~~---------~~~~~~~~gaAlr~esld~~i~~l~~ 62 (479) +.+..++..... ...+.. ..+.-+.+.|+|.. +.. ....+..+|+.+-.+.+...|..+.. T Consensus 63 ~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~vP~~~~~~Ii~~~~ 141 (408) T protein:vir:74 63 LVEAQAEQVVNMREEEKGPLNKS-ENELKDKFVKDFVNMVRNPMAFLNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVR 141 (408) T ss_pred HHHHHHHHHhhccccccccccch-hhhhHHHHHHHHHHHHhcchhhhhhhhhhhhcccccCCCceeechhHhhHHHHHHh Confidence 111111110000 000111 11111223333321 110 11223455778888999888865555 Q ss_pred ccccccchhhhccchhHHHHHHhhhhhccCccc-ccccccccccc-cccCcceEEEEEEEEeeeehhhhhhhHhhhcchh Q lcl|NC_018863. 63 TNGDFTIYPLINKQQVNSTVAKYAVFNQHGRTG-HSRFVREVGVA-SINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIA 140 (479) Q Consensus 63 ~~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g-~~~fv~E~g~~-~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~ 140 (479) ... .+.+.+....+.+....|. +..+...+ ...+++|++.. +.+++.+.+....++-++.--.+|.-+= .++.. T Consensus 142 ~~~--~l~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell-~ds~~ 217 (408) T protein:vir:74 142 QYD--SLQQYVRVESVSTSSGSRV-YEKWTDVTPLKAMDEEDGKIPDLDNPRLTIIKYLIKRYAGIITATNTLL-KDTAE 217 (408) T ss_pred hhc--chhhhcceeeccCCcceEE-EEeecCCcccccccccccccccccccceeeEEeeeeeEEeeehhHHHHH-hhchH Confidence 544 3555565555554433332 23333332 35688898875 4788999999999999998888887532 34566 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhhheeecccCc Q lcl|NC_018863. 141 DPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAAVIVGKGYGR 220 (479) Q Consensus 141 dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~ 220 (479) |.+....+.--..+...++.+++.|+..-.+. +..+-+|+++..+.. .+-.+|.. T Consensus 218 ~l~~~i~~~l~~~~~~~~d~~il~G~G~~~~~---~~~~~~~~i~~~~~~----------------------~l~~~~~~ 272 (408) T protein:vir:74 218 NILAWLSSWIAKKVVVTRNQAIIAAMGTVPKK---PTIANFDDVITMINT----------------------SVDPAIIA 272 (408) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcccccccc---cccccHHHHHHHHHH----------------------hhhhhhcC Confidence 77888888888999999999999999875443 335677777666531 01111211 Q ss_pred eeeeecChHHHhhHHHhhcC-ceeEEeecCCC----ccccCccccceecCceeEEecCCcccCCCccccCcccCC---CC Q lcl|NC_018863. 221 ATDAFMPIGVQADFTNNLLD-RQRVIQPSQAG----GFSTGFSINQFLSTRGAINLHGSTIMENDNILVDRIPEP---NA 292 (479) Q Consensus 221 atd~~mp~~vka~f~q~~~~-~qrv~~~~n~g----~~~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver~~s~---~a 292 (479) ..-.+|++.+.+.+...-.. ++-.++++ .. ..-.|.+|.-..+ ..-+.+.......+..-++. .. T Consensus 273 ~a~~v~n~~~~~~l~~lkd~~G~~l~~~~-~~~~~~~~l~G~pV~~~~~------~~~~~~~~~~~~i~~gd~~~~~~~~ 345 (408) T protein:vir:74 273 TSSLLTNQSGLNKLALVKTAEGKYLLEPD-PTKPNSYLIKGKQVIVVAD------RWLPNSGSTVYPLYYGDMSQAITLF 345 (408) T ss_pred CCEEEEcHHHHHHHHHhhcCCCceEeccC-cCCCCCceecceeeEEecC------cccccccCCcceEEEEehhccEEEE Confidence 12256777777777654322 22222222 11 1122332210000 00000000000000000000 00 Q ss_pred CcccceEEEeecccccCcccccccceeeEEEE------EEEcCCCCcccccceeeeeecCCCeEEEEEeecCCcccc Q lcl|NC_018863. 293 PQAPASVVATVKVNDKGAFRPVKDIKTHSYKV------VVHSDDAESLASEAVTAVVANPTDSVSLAVKLQSLYQAK 363 (479) Q Consensus 293 P~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV------~a~n~~GES~~S~~VtaT~a~~~~~V~LtIt~~~~~~~~ 363 (479) ......+. ........|.. +...|++ .+.+..+ -..-.+++++..++...+. +.++. T Consensus 346 ~~~~~~i~--~~~~~~~~f~~----~~~~~r~~~r~d~~~~~~~a--~~~~~~~~~~~~~~~~~~~------~~~~~ 408 (408) T protein:vir:74 346 DRENMSLL--PTNIGAGAFET----DTTKIRVIDRFDVKATDSEA--LVAGSFTAIADQVGNFKTT------TSTAV 408 (408) T ss_pred EecceEEE--Eeccccchhhc----ceeeEEEEEeeCcEEecccc--eEEEEeecccCCCCCCCCC------ccccC Confidence 00000000 00000000100 0111111 1111110 0000011111111111111 11111 No 93 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=89.69 E-value=0.025 Score=29.37 Aligned_cols=309 Identities=11% Similarity=0.047 Sum_probs=129.0 Q ss_pred CcccccccceeeeecCchhHHHHHHHHHHHhhcC---------------cccCcccccCccccchhhhHHHHHHHhhccc Q lcl|NC_018863. 1 MTELQKEQKVEARKLPAGAEAELAELVSKSFTTG---------------TGITPDTQHDAAALRRELLDDQVKMLAFTNG 65 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~Ksf~ag---------------~~~~~~~~~~gaAlr~esld~~i~~l~~~~~ 65 (479) ..+-....++ ..+.+.....+-.++|.+.+..| -.....+..+|+.|-++-+..+|..+..... T Consensus 67 ~~~~~~~~~~-~~~~~~~~~~~~~~a~~~~l~~~~~~~~~~e~~~~~~~~a~~~~~~~~gg~liP~~~~~~ii~~~~~~~ 145 (409) T protein:vir:45 67 SNEEEQRQNL-DPENNSQQDEKRAQVFDKWMRHGASELTSEERKALRELRAQGVAQDEKGGYTVPETFLAKVVEKMKSYG 145 (409) T ss_pred hhhhhhcccC-CCCCcchhhHHHHHHHHHHHHhhhhhccHHHHHHHHHHhhccCccCcCCceeccHhHHHHHHHHHHhhh Confidence 0000000000 00111111011011222222111 1111123345777888888888877765444 Q ss_pred cccchhhhccchhHHHHHHhhhhhccCc-ccccccccccccccccCcceEEEEEEE-EeeeehhhhhhhHhhhcchhhHH Q lcl|NC_018863. 66 DFTIYPLINKQQVNSTVAKYAVFNQHGR-TGHSRFVREVGVASINDPNIRQKTVQM-KFLSDTKQQSLAAGLVNNIADPM 143 (479) Q Consensus 66 ~f~~~~~i~k~~~~stv~~y~~~~~~G~-~g~~~fv~E~g~~~~~d~~~~r~~~~~-k~l~~~~~vs~~~~lv~~~~dp~ 143 (479) . +.+.+...+..+ ..+..+...++ ...+.+++|++..+..++.+....... |+.+.--.+|.-+ +.++.-|.+ T Consensus 146 ~--l~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~f~~~~l~~~k~~~~~i~is~el-l~ds~~~l~ 220 (409) T protein:vir:45 146 G--IASVAQILTTSD--GRTMEWATADGTSEVGVLLGENEEAGEEDTDFGMGSLGALKMTSKIIRVSNEL-LQDSAIDME 220 (409) T ss_pred h--hhhhceeeecCC--CceEEEEeeccCccccccccccccccccccccceeeeeeeeeeeeehhhhHHH-HhccHHHHH Confidence 3 333232222222 11112223333 334679999999999999999888654 5554333344432 234455777 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhhheeeccc-Ccee Q lcl|NC_018863. 144 TILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAAVIVGKGY-GRAT 222 (479) Q Consensus 144 ~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~f-G~at 222 (479) ....+.--..+...++.++++|+..-.+ .++.|+.+.... .+.. +....++.+.|.++..-+..+| ..+. T Consensus 221 ~~i~~~la~a~~~~~~~a~l~G~G~~~~-------~~p~Gil~~~~~-~~~~-~~~~~~~~d~i~~l~~~l~~~~~~~a~ 291 (409) T protein:vir:45 221 AYLARRIAERIGRGEARYLIQGTGAGTP-------KQPKGLAASVTG-TTQT-AAANAVKWQEILALKHSIDPAYRRGPK 291 (409) T ss_pred HHHHHHHHHHHHHHHHHHhhccCCCCCc-------cccceeeecccc-cccc-ccccccchHHHHHHHHhhhhhhccCCe Confidence 8888888888999999999999976332 345687766542 2322 3334455555554443344443 3444 Q ss_pred e-eecChHHHhhHHHhhcCceeEEeecCCCccccCccccceecCceeEEecCCcccCCCccccCcccCCCCCcccceE-- Q lcl|NC_018863. 223 D-AFMPIGVQADFTNNLLDRQRVIQPSQAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILVDRIPEPNAPQAPASV-- 299 (479) Q Consensus 223 d-~~mp~~vka~f~q~~~~~qrv~~~~n~g~~~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver~~s~~aP~~P~~v-- 299 (479) - ++|+..+.+.+...-...-|.+...+..+... .++++.+.+..+..+...+ ...+.. T Consensus 292 ~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~------------------~~l~G~PV~~~~~~p~~~~-~~~~i~~G 352 (409) T protein:vir:45 292 FRLAFNDNTLKLISEMEDGQGRPLWLPDIVGVAP------------------ASVLNVPYVIDQEIDDIGA-GKKFMFCG 352 (409) T ss_pred EEEEECHHHHHHHHHhhcCCCceeeccCcCCCCC------------------ceecceeeEEecCcCCccC-CccEEEEe Confidence 3 35788888887765443334332222111100 0111122222221111000 000000 Q ss_pred -------------EEeecccccCcccccccceeeEEEEEEEcCCCCcccccceeeeeecCCCeEEEEEeecCCccc Q lcl|NC_018863. 300 -------------VATVKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLASEAVTAVVANPTDSVSLAVKLQSLYQA 362 (479) Q Consensus 300 -------------ta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~V~LtIt~~~~~~~ 362 (479) +...... .++ .. +...|++...=+.+ +....--+.|++..+.. + T Consensus 353 d~~~~~i~~~~~~~~~~~~d---~~~-~~--~~~~~~~~~r~d~~-----------~~~~~A~~~l~~k~s~~--~ 409 (409) T protein:vir:45 353 DFDRFIIRRVRYMILKRLVE---RYA-EY--DQTGFLAFHRFDCI-----------LEDTSAIKALVGKGSVG--G 409 (409) T ss_pred ehhhhheeeccceEEEEeec---ccc-cC--CcEEEEEEEEeccE-----------eechhheEEEEeccCCC--C Confidence 0000000 000 00 01112221111111 11111112223332211 1 No 94 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=89.04 E-value=0.0072 Score=32.34 Aligned_cols=310 Identities=15% Similarity=0.065 Sum_probs=114.4 Q ss_pred Ccccccc--------------c----ceeeeecCchhHHHHHHHHHHHhhcCcc--cCcccccCccccchhhhHHHHHHH Q lcl|NC_018863. 1 MTELQKE--------------Q----KVEARKLPAGAEAELAELVSKSFTTGTG--ITPDTQHDAAALRRELLDDQVKML 60 (479) Q Consensus 1 ~~~~~~~--------------~----~~~~~~~~~~~~~~~~e~~~Ksf~ag~~--~~~~~~~~gaAlr~esld~~i~~l 60 (479) ..++.++ . +....+........-.+.+.+.+..+-. ....+..+|+.|..+.+...|..+ T Consensus 100 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~g~lvp~~~~~~i~~~ 179 (437) T protein:vir:10 100 KTETKSEAEKDKKTVKDEEKRDAGGLQDMKLKVGGEIADKKVTAFADYLKTGEVRDVTGIALKDGKVIIPETILTPEKEV 179 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHhHHHHhHHHHHHHHHHHHhhhhhhHHHHHhhhhhhhhhcccccccccchHHHHHHHHHh Confidence 0000000 0 0000000000000000122232222211 111234567778888887777554 Q ss_pred hhccccccchhhhccchhHHHHHHhhhhhccCccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcch Q lcl|NC_018863. 61 AFTNGDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVAS-INDPNIRQKTVQMKFLSDTKQQSLAAGLVNNI 139 (479) Q Consensus 61 ~~~~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~ 139 (479) ... -.+...+...+..+---+|......+ +...+++|++... .+++.+.+....++=++.-..+|.-+ +.++. T Consensus 180 ~~~---~~l~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~e~~~~~e~~~~~~~~v~~~~~k~~~~~~is~el-l~ds~ 253 (437) T protein:vir:10 180 HQF---PRLGSLVRTESVTTTTGKLPIFNNST--DLLTAHTEYGQTTKNATPVITPILWDLKTYTGGYVFSQEL-ISDSS 253 (437) T ss_pred hhh---hhhhhcceeEeeccCceeeEEeeccc--cccccccccccccccccccceeeeeehhheeeehhhhHHH-HhhhH Confidence 221 12333333333333333343332222 3456788888665 78899999999888787766777643 34556 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhcc-CCcEEEccCCCC-CHHHhhhhhheeecc Q lcl|NC_018863. 140 ADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDE-ATNVIDLKGERL-DEATLNKAAVIVGKG 217 (479) Q Consensus 140 ~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~-~~NviDarG~~l-~~~~l~~aa~~i~~~ 217 (479) .|......+.-...+...++.+++.|+.+-.+.+. .....|.+...|.. -...+...+..+ +...++....+. .+ T Consensus 254 ~~~~~~i~~~l~~~~~~~~~~~i~~g~g~~~~~~~--~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~l~~lk-d~ 330 (437) T protein:vir:10 254 YDWQAELQSRLIELRDNTDDSLIITALTDGIKKTT--STYLLGDLKKVLNVTLKPQDSAAASIVMSQSAYNLFDMAT-DA 330 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccc--cccchhhHHHHHHhhhhhhhhcCCEEEEcHHHHHHHHHhh-cc Confidence 67778888888899999999999999987555432 23456777666541 112222222211 222222222211 11 Q ss_pred cCceeeeecChHHHhhHHHhhcCceeEEeecCC-CccccCc---cccceecCceeEEecCCcccCCCccccCcccCCCCC Q lcl|NC_018863. 218 YGRATDAFMPIGVQADFTNNLLDRQRVIQPSQA-GGFSTGF---SINQFLSTRGAINLHGSTIMENDNILVDRIPEPNAP 293 (479) Q Consensus 218 fG~atd~~mp~~vka~f~q~~~~~qrv~~~~n~-g~~~~G~---~V~~~~ss~g~I~L~~s~v~~a~~~lver~~s~~aP 293 (479) -|. .+|.|.. +..-...++|+..++..+.. .....|. -+.-|...+...++.+-.+.-.+.+ .. T Consensus 331 ~g~--~~~~~~~-~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~r~~~~~~~~~~~---------~~ 398 (437) T protein:vir:10 331 MGR--PLLQPNV-TAATGYTLLGKTVVIVDDKLFPSASAGDVNIVVAPLKKAVINFKLTEITGQFQDTY---------DI 398 (437) T ss_pred CCC--eeeccCc-cCCCCcccccceeEEecccccCCcCCCceEEEEeeccccEEEEeeeceEEEEeccc---------cc Confidence 121 1333321 11111223333222211100 0000110 0000000000000111111000000 00 Q ss_pred cccceEEEeecccccCcccccccc-eeeEEEEEEEcCCCCcccccceeee Q lcl|NC_018863. 294 QAPASVVATVKVNDKGAFRPVKDI-KTHSYKVVVHSDDAESLASEAVTAV 342 (479) Q Consensus 294 ~~P~~vta~~~~~~~g~~~~~sd~-g~Y~YkV~a~n~~GES~~S~~VtaT 342 (479) .....-+.. =..+... ..++ .....++.++ .....+++ T Consensus 399 ~~~~~~~~~---r~d~~~~-~~~a~~~l~~~~~~~-------~~~~~~~~ 437 (437) T protein:vir:10 399 WYKQLGIFL---RQNVVQA-SKDLIVNLTGKLKAV-------TVVQSTAV 437 (437) T ss_pred ccceeeEEE---EEccEEe-cccceEEEEeecccc-------ccCCCCCC Confidence 000000000 0000000 0000 0011111111 10000000 No 95 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=88.60 E-value=0.031 Score=28.83 Aligned_cols=309 Identities=13% Similarity=0.095 Sum_probs=142.4 Q ss_pred Ccc----------cccc-cceeeeecCchhHHHHHHHH----------------HHHhhcCcccCcccccCccccchhhh Q lcl|NC_018863. 1 MTE----------LQKE-QKVEARKLPAGAEAELAELV----------------SKSFTTGTGITPDTQHDAAALRRELL 53 (479) Q Consensus 1 ~~~----------~~~~-~~~~~~~~~~~~~~~~~e~~----------------~Ksf~ag~~~~~~~~~~gaAlr~esl 53 (479) +.+ ...+ .+... ....+..++.-++| .|++++| +..+|+.|-++.+ T Consensus 51 ~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~e~~~a~~~~lr~~~~~~~~~~e~~a~~~~------~~~~GG~~iP~~~ 123 (401) T protein:vir:44 51 LSELENLKSDLEKELLELKRPAR-GAQNKVAAEHKDAFVGFLRKGREDGLRDLERKALQVG------TDEDGGYAVPEEL 123 (401) T ss_pred HHHHHHHHHHHHHHHHHhhcccc-ccccchhHHHHHHHHHHHhhhhhhhhHHHHHHHhhcC------CCCCCceeccHhH Confidence 000 0000 00000 00011111111112 2223322 2345677888888 Q ss_pred HHHHHHHhhccccccchhhhccchhHHHHHHhhhhhccCccccccccccccc-ccccCcceEEEEEEEEeeeehhhhhhh Q lcl|NC_018863. 54 DDQVKMLAFTNGDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGV-ASINDPNIRQKTVQMKFLSDTKQQSLA 132 (479) Q Consensus 54 d~~i~~l~~~~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~-~~~~d~~~~r~~~~~k~l~~~~~vs~~ 132 (479) .++|..+..... .+.+-+...++.+....|.+ .-++. ...+++|++. ++..++.+.+....++=++.--.+|.- T Consensus 124 ~~~ii~~~~~~~--~l~~~~~~~~~~~~~~~~~~--~~~~~-~a~wv~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~e 198 (401) T protein:vir:44 124 DRSILSLLKDEV--VMRQEATVITVGGSDYKKLV--NLGGT-ASGWVGETDTRSQTATSRLGLIEPFMGEIYGNPQATQK 198 (401) T ss_pred HHHHHHHHHhhh--hhhhhceeeecCCCceEEEE--ecCCc-cceeeccccccCccccccceeeeeehhheeeehhhhHH Confidence 888866665443 23444444455544333333 22333 3457999985 567778999998888877776666664 Q ss_pred HhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhcc-----------CCcEEEccCCC Q lcl|NC_018863. 133 AGLVNNIADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDE-----------ATNVIDLKGER 201 (479) Q Consensus 133 ~~lv~~~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~-----------~~NviDarG~~ 201 (479) + +.++..|.+....+.-...++..++.++|+||-. + . -.|+.+.... .+.+....... T Consensus 199 l-l~ds~~~l~~~i~~~la~ai~~~~~~~~l~G~G~-~-~--------p~Gil~~~~~~~~~~~~~~~~~~~~~t~~~~~ 267 (401) T protein:vir:44 199 M-LDDAFFNVEAWINSELATEFAEQEEIAFTTGDGT-K-K--------PKGFLAYESTEESDKARAFGKLQHIVSGEATA 267 (401) T ss_pred H-HhcchHHHHHHHHHHHHHHHHHHHHhhhhccCCC-C-c--------cceeeccccccccccccccccccccccccccc Confidence 2 2355668888888888889999999999999875 2 1 1355544321 12333344455 Q ss_pred CCHHHhhhhhheeecccCceeeeecChHHHhhHHHhhcCceeEEeecCCCccccCccccceecCceeEEecCCcccCCCc Q lcl|NC_018863. 202 LDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSQAGGFSTGFSINQFLSTRGAINLHGSTIMENDN 281 (479) Q Consensus 202 l~~~~l~~aa~~i~~~fG~atd~~mp~~vka~f~q~~~~~qrv~~~~n~g~~~~G~~V~~~~ss~g~I~L~~s~v~~a~~ 281 (479) ++.+.|..+.-.+...|....-.+|+..+...+...-...-|.+...+... | .+.++++.+. T Consensus 268 ~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~---g---------------~~~~l~G~PV 329 (401) T protein:vir:44 268 VTADAIIKLIYTLRKAHRTGAKFMMNNNSLFAIRLLKDTEGNYLWRPGLEL---G---------------QPSSLAGYGI 329 (401) T ss_pred cCHHHHHHHHHhcchhhhcCCEEEEcHHHHHHHHHhhccCCceeecCCcCC---C---------------CCceecceee Confidence 555555544444455555545588999888888765443334432221111 1 0112333333 Q ss_pred cccCcccCCCCCcccceEEEeecccccCcccccccceeeEEEEEEEcCCCCcccccceeeeeecCCCeEEEEEeecCCcc Q lcl|NC_018863. 282 ILVDRIPEPNAPQAPASVVATVKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLASEAVTAVVANPTDSVSLAVKLQSLYQ 361 (479) Q Consensus 282 ~lver~~s~~aP~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~V~LtIt~~~~~~ 361 (479) +..+..+.. +...+ +. . .|.+++.+....+.|-+...+ .-...+-+. T Consensus 330 v~~~~~p~~-~~~~~--~i------~---------~Gd~~~~~~i~~~~~~~~~~~-----~~~~~~~v~---------- 376 (401) T protein:vir:44 330 AENEQMPDI-AADAK--AI------A---------FGNFKRGYTIVDRIGTRILRD-----PYTNKPFVG---------- 376 (401) T ss_pred EEecCcCCc-cCCcc--EE------E---------EeehhccEEEEEecceEEeee-----ccccCCcEE---------- Confidence 333322110 00000 00 0 123332222333333221100 000112222 Q ss_pred ccceEEEEEeccC---CCCcEEEEEEeeee Q lcl|NC_018863. 362 AKPQFISVYRQGN---ETGHYFLVARVPLS 388 (479) Q Consensus 362 ~~~~y~~IYR~t~---~~g~~~~i~rV~~s 388 (479) |+.+.|=+- .+.-| .+-+++.+ T Consensus 377 ----~~a~~r~d~~~~~~~a~-~~l~~~aa 401 (401) T protein:vir:44 377 ----FYTTKRTGGMLVDSQAI-KLLKIAAA 401 (401) T ss_pred ----EEEEEEeccEEecccce-EEEEeecC Confidence 333333331 11112 22223322 No 96 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=88.30 E-value=0.033 Score=28.69 Aligned_cols=311 Identities=12% Similarity=0.044 Sum_probs=140.4 Q ss_pred CcccccccceeeeecCchhHHHHHHHH---------HHHhhcCcccCcccccCccccchhhhHHHHHHHhhccccccchh Q lcl|NC_018863. 1 MTELQKEQKVEARKLPAGAEAELAELV---------SKSFTTGTGITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIYP 71 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~---------~Ksf~ag~~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~ 71 (479) ..+..+++.......+... .+..++| .++++.| +..+|+.|-.+.+..+|..+..... .+++ T Consensus 92 ~~~~~~~~~~~~~~~~~~~-~~~~~af~~~l~~~e~~~al~~~------t~~~gG~lvP~~~~~~ii~~~~~~s--~l~~ 162 (425) T protein:vir:10 92 NIKIAAAQMGANGVKPLRD-PEYTEAFKAHVKRGDVQAALNKG------EDSEGGYLTPIEWDRTITNKLVLIS--PMRQ 162 (425) T ss_pred HHHHHhhhccccccccccc-HHHHHHHHHHhhhhhhHHHhhcC------cCCCCceeccHhHHHHHHHHHHhhh--hhhh Confidence 1111111111111111111 1111222 2333332 3456788889999888866655443 3444 Q ss_pred hhccchhHHHHHHhhhhhccCccccccccccccc-ccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHH Q lcl|NC_018863. 72 LINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGV-ASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDA 150 (479) Q Consensus 72 ~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~-~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~~~~~a 150 (479) -....++.+.-..|.+.. + .....+++|++. ++...+.+.+.....+=++.-..+|.-+ +.++.-|.+....+.- T Consensus 163 l~~~~~~~~~~~~~~~~~--~-~~~a~wv~E~~~~~~~~~~~f~~v~~~~~k~~~~i~iS~el-l~ds~~~l~~~i~~~l 238 (425) T protein:vir:10 163 LCRVQPVSKAGFSKLFNM--G-GTTSGWVGEASQRPQTNAATFQPLSFASGEIYANPAATQQI-LDDAEIDLESWLATEV 238 (425) T ss_pred hceeeeccCCceEEEEEc--C-CcceeeeccccccccccccccceeeeeheeeEeehHhHHHH-HhcchhHHHHHHHHHH Confidence 444444444333333322 2 234678999987 4566689999988888777766676643 2344567888889999 Q ss_pred HHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCc-----------EEEccCCCCCHHHhhhhhheeecccC Q lcl|NC_018863. 151 ISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATN-----------VIDLKGERLDEATLNKAAVIVGKGYG 219 (479) Q Consensus 151 i~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~N-----------viDarG~~l~~~~l~~aa~~i~~~fG 219 (479) ...++..++.++++||-.-. -.|+++.+....+ +.......++.+.|-++...+...|- T Consensus 239 a~ai~~~~d~~~l~G~G~~~----------p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~l~~l~~~l~~~~~ 308 (425) T protein:vir:10 239 QTEFAKQEGKAFLAGDGTNK----------PNGLLTYIAGGANAAKHPFGAIEVVNSGAAADITSDGIIDLVYDLPSAFT 308 (425) T ss_pred HHHHHHHHHhhhhcccCCCC----------cceeeeccccccccccccccccccccccccccccHHHHHHHHhhhhhhhc Confidence 99999999999999986421 2477766542111 11122233444444433333344454 Q ss_pred ceeeeecChHHHhhHHHhhcCceeEEeecCCCccccCccccceecCceeEEecCCcccCCCccccCcccCCCCCcccceE Q lcl|NC_018863. 220 RATDAFMPIGVQADFTNNLLDRQRVIQPSQAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILVDRIPEPNAPQAPASV 299 (479) Q Consensus 220 ~atd~~mp~~vka~f~q~~~~~qrv~~~~n~g~~~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver~~s~~aP~~P~~v 299 (479) ...-..|++.+...+...-...-|.+...+..+ |. +.++++.+.+..+..+...+. ... T Consensus 309 ~~a~~vmn~~~~~~L~~lkD~~G~~l~~~~~~~---g~---------------~~~l~G~PV~~~~~~p~~~~~-~~~-- 367 (425) T protein:vir:10 309 GNARFAMNRNTQRQVRKLKDGQGNYLWQPSYVA---GQ---------------PATLAGYPVTEVPDMPDVAAN-STP-- 367 (425) T ss_pred cCCEEEEchHHHHHHHHhhcCCCceeeccCccC---CC---------------CceecceeeEEecCcCCccCC-ccE-- Confidence 444578999998888765543334332221111 10 112222333222221110000 000 Q ss_pred EEeecccccCcccccccceeeEEEEEEEcCCCCcccccceeeeeecCCCeEEEEEe-ecCCccccceEEEEE-eccCC Q lcl|NC_018863. 300 VATVKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLASEAVTAVVANPTDSVSLAVK-LQSLYQAKPQFISVY-RQGNE 375 (479) Q Consensus 300 ta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~V~LtIt-~~~~~~~~~~y~~IY-R~t~~ 375 (479) . .. |.+++.+..+.+.|-....+. -...+-+.+.+. --...-..|.-+.+. -++++ T Consensus 368 i------~~---------Gd~~~~~~i~~~~~~~v~~d~-----~~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~as~ 425 (425) T protein:vir:10 368 I------LF---------GDFQQTYLIIDRIGVRVLRDP-----YTAKPYVLFYTTKRVGGGLLNPEPMRAMKVAASE 425 (425) T ss_pred E------EE---------EehhccEEEEEecceEEEecc-----cccCCcEEEEEEEEeccEeecccceEEEEeeccC Confidence 0 01 222211112222221111110 011222222211 111111122222222 22232 No 97 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=88.10 E-value=0.034 Score=28.61 Aligned_cols=320 Identities=13% Similarity=0.017 Sum_probs=126.2 Q ss_pred Cccccccc------ce-----eeeecCchhHHHHHHHHHHHhhcCcccCcccccCccccchhhhHHHHHHHhhccccccc Q lcl|NC_018863. 1 MTELQKEQ------KV-----EARKLPAGAEAELAELVSKSFTTGTGITPDTQHDAAALRRELLDDQVKMLAFTNGDFTI 69 (479) Q Consensus 1 ~~~~~~~~------~~-----~~~~~~~~~~~~~~e~~~Ksf~ag~~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~ 69 (479) +..+.++. +. .......-...+ .+++.+..++| +-.+|+.|-.+.+.+.|....... ..+ T Consensus 39 ~~~~~~~~~~~~~~e~~~~~~~~~~~~~lt~ee-~~~~~~~~~~~------~~~~gg~~vP~~~~~~I~~~l~~~--s~i 109 (377) T protein:vir:98 39 FTTMGDEILAKNEEEMERMFDLRDKNRELTAEE-IKFFNDIDKNV------GGKDKFKLLPEETMVQVFDDLVAE--HPL 109 (377) T ss_pred HHhHHHHHHHHHHHHHHHHHHhccCCcccCHHH-HHHHHHHHhcc------CCCCCccccCHHHHHHHHHHHHHh--hhh Confidence 11100000 00 000000000011 12333333332 335677788887777764433222 234 Q ss_pred hhhhccchhHHHHHHhhhhhccCccccccccccccc-ccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHH Q lcl|NC_018863. 70 YPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGV-ASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTE 148 (479) Q Consensus 70 ~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~-~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~~~~ 148 (479) ++.+....+.+.+ .+....+.+.+.+++|.+. .+..++.+.+.....+=|+.--.+|.-+ |.++..|.+....+ T Consensus 110 ~~~~~v~~~~~~~----~~~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~a~~~is~el-L~ds~~~ie~~i~~ 184 (377) T protein:vir:98 110 LKVINFKNTSLRL----KALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDA-LKFGPKWIKQFITE 184 (377) T ss_pred hhheeeEecCcce----EEEEecCCcceeEeecccccCcccCccceeEeecceeEEeeecccHHh-hhccHhHHHHHHHH Confidence 4544444443332 2344445556678899876 4578999999999998888776776655 55678899999999 Q ss_pred HHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhcc----CCcEEEccCCCCCHHHhhhhhheeecccCceeee Q lcl|NC_018863. 149 DAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDE----ATNVIDLKGERLDEATLNKAAVIVGKGYGRATDA 224 (479) Q Consensus 149 ~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~----~~NviDarG~~l~~~~l~~aa~~i~~~fG~atd~ 224 (479) .--.++++.++.+++.||-.-- --||++.+.. .....++.+...+.+.|-++.-.....|..--.. T Consensus 185 ~la~~~a~~~~~a~i~G~G~~q----------P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~a~~ 254 (377) T protein:vir:98 185 QLKEAIAVALELAIVKGDGLLQ----------PVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLTPDNAPKKLVP 254 (377) T ss_pred HHHHHHHHHHhhceEeccCCCc----------ceeeeecccccccccccccccccccchhhhHhhhhhhchhHHHHHHHH Confidence 9999999999999999996421 2477766531 1112233333333223322221111222221123 Q ss_pred ecChHHHhhHHHhhcCceeEEeecCCCccccCccccceecCceeEEecCCcccCCCccccCcccCCCCCcccceEEEeec Q lcl|NC_018863. 225 FMPIGVQADFTNNLLDRQRVIQPSQAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILVDRIPEPNAPQAPASVVATVK 304 (479) Q Consensus 225 ~mp~~vka~f~q~~~~~qrv~~~~n~g~~~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver~~s~~aP~~P~~vta~~~ 304 (479) .|...+.......-...-+.+..-|+.....-........+.|. +.+++..+...++ +...|. T Consensus 255 ~m~~~t~~~~~klkd~~G~~i~~~n~~~~~~~~p~~~~~~~~G~----~~t~lg~p~~vv~---s~~~p~---------- 317 (377) T protein:vir:98 255 VMKHLSVNDKKRPLKIAGQVKLILNPEDRWALEAQFTSRNQFGE----YVTVLPHGITILE---SLAVET---------- 317 (377) T ss_pred HHHHHHHHHHhhhhccCCceEEEecccchhhccccccccCCCCc----cccccCCCceEEe---cCCCCc---------- Confidence 33444433333222222222221112110000000000001010 1112211111111 111111 Q ss_pred ccccCcccccccceeeEEEEEEEcCCCCcccccceeeeeecCCCeEEEEEeecCCccccceEEEEEeccC---CCCcEEE Q lcl|NC_018863. 305 VNDKGAFRPVKDIKTHSYKVVVHSDDAESLASEAVTAVVANPTDSVSLAVKLQSLYQAKPQFISVYRQGN---ETGHYFL 381 (479) Q Consensus 305 ~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~V~LtIt~~~~~~~~~~y~~IYR~t~---~~g~~~~ 381 (479) ++- .-.|-+ + |.+. .+.|-+..... -+-...+.+ -|+.+.|=.- +...+ . T Consensus 318 ----~~i-~fgdf~-~-Y~i~--~r~~~~i~~~~---~~~~~~d~~--------------~f~~~~r~dg~~~~~~a~-~ 370 (377) T protein:vir:98 318 ----GKA-IAFVAN-R-YDAF--MATASTIEEYD---QTFAMEDLQ--------------LYLTKNYFYGKAKDNHTA-A 370 (377) T ss_pred ----ccE-EEEEec-c-eeEE--eecceEEEeec---hhhhhcCce--------------EEEEEEEEcCEEeccCcE-E Confidence 000 001111 1 3322 12222211100 000000100 1222222221 11111 1 Q ss_pred EEEeeeeeccCCC Q lcl|NC_018863. 382 VARVPLSKADENG 394 (479) Q Consensus 382 i~rV~~s~~n~~~ 394 (479) +-.|. +| T Consensus 371 vl~i~------~~ 377 (377) T protein:vir:98 371 LLTLA------GG 377 (377) T ss_pred EEEEe------cC Confidence 11111 11 No 98 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=88.07 E-value=0.035 Score=28.59 Aligned_cols=311 Identities=14% Similarity=0.088 Sum_probs=119.9 Q ss_pred Ccccccc-cceeeeec----------CchhHHHHHHHHHHHhhcCcc-------cCcccccCccccchhhhHHHHHHHhh Q lcl|NC_018863. 1 MTELQKE-QKVEARKL----------PAGAEAELAELVSKSFTTGTG-------ITPDTQHDAAALRRELLDDQVKMLAF 62 (479) Q Consensus 1 ~~~~~~~-~~~~~~~~----------~~~~~~~~~e~~~Ksf~ag~~-------~~~~~~~~gaAlr~esld~~i~~l~~ 62 (479) +.+-.++ ..+..... .........| ..+.+..+.. ..-.+-.+|+.|-.+.+..+|..+.. T Consensus 98 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vp~~~~~~ii~~~~ 176 (497) T protein:vir:78 98 MNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAE-LMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLF 176 (497) T ss_pred hhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHH-HHHHHhhhhhhHHHHHhhhcccCcccccccchhhhHHHHHHHH Confidence 0000000 00000000 0000000000 0111111100 00012234666777777778765554 Q ss_pred ccccccchhhhccchhHHHHHHhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhH Q lcl|NC_018863. 63 TNGDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADP 142 (479) Q Consensus 63 ~~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp 142 (479) . ...+.+-+...+..+---.|.+ .-++.+...+++|++..+.+|+.+.+.....+=++.--.+|.-+ |.++ .+. T Consensus 177 ~--~~~i~~l~~~~~~~~~~~~~~~--~~~~~~~a~wv~E~~~~~~s~~~f~~i~~~~~k~a~~~~iS~el-l~d~-~~l 250 (497) T protein:vir:78 177 Y--ELSLADLISSRPVTSPNLSYLT--ESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEG-LRDA-PEL 250 (497) T ss_pred h--hhhHHhhccccccCCCceEEEE--EcCCCCcceeeccCcccccccccceeeEeeeeeeEeecHhHHHH-HHhH-HHH Confidence 3 3344555554444432222322 23444567799999999999999999999999999988888764 2333 467 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhc-------------------cCC-cEEEccCCCC Q lcl|NC_018863. 143 MTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLID-------------------EAT-NVIDLKGERL 202 (479) Q Consensus 143 ~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~-------------------~~~-NviDarG~~l 202 (479) +....++-...+++.++.++++||-.-.| .||.+.-. ... ++-...+... T Consensus 251 ~~~i~~~l~~~i~~~~d~~~l~G~G~~~p----------~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 320 (497) T protein:vir:78 251 FNFVQGRLLEGIQRKEEVQLLAGGGYPGV----------NGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFV 320 (497) T ss_pred HHHHHHHHHHHHHHHHHHHhhcCCCcccc----------cccccccccccccccccchhhhhhhhhhhhhhcccccchhh Confidence 78888888899999999999999854221 23322211 000 0000000000 Q ss_pred C---------------------------------HHHhhhhhheeecc-cCceeeeecChHHHhhHHHhhcCceeEEeec Q lcl|NC_018863. 203 D---------------------------------EATLNKAAVIVGKG-YGRATDAFMPIGVQADFTNNLLDRQRVIQPS 248 (479) Q Consensus 203 ~---------------------------------~~~l~~aa~~i~~~-fG~atd~~mp~~vka~f~q~~~~~qrv~~~~ 248 (479) . ...+.++-..+... +-.++-..|++.+-..+...-...-|.+.++ T Consensus 321 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~ 400 (497) T protein:vir:78 321 GQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGN 400 (497) T ss_pred hhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHHHHHHHHhhcCCCceeccC Confidence 1 11222333333333 3334446688887777765543322333222 Q ss_pred CCC----------ccccCccccceecCceeEEecCCcccCCCc---ccc-CcccCCCCCcccceEEEeecccccCccccc Q lcl|NC_018863. 249 QAG----------GFSTGFSINQFLSTRGAINLHGSTIMENDN---ILV-DRIPEPNAPQAPASVVATVKVNDKGAFRPV 314 (479) Q Consensus 249 n~g----------~~~~G~~V~~~~ss~g~I~L~~s~v~~a~~---~lv-er~~s~~aP~~P~~vta~~~~~~~g~~~~~ 314 (479) ..+ ..-.|.+|-.... |. .+..+.++.. +++ +| .+. ++... .-...-|..+ T Consensus 401 ~~~~~~~~~~~~~~~l~G~pV~~t~~----~~-~~~~~~Gd~~~~~~~i~~r----~~~----~v~~~--~~~~~~f~~n 465 (497) T protein:vir:78 401 FFGNAYGNPVNGGKNIWGVPVVTTPL----IP-LGTILVGHFAPSVIQTARR----EGV----TMQMT--NSNGTDFVDG 465 (497) T ss_pred cccccccccccCCceeeceeeEecCC----CC-CCceEEeecccceEEEEEe----ccc----EEEee--cccchhhhcC Confidence 111 0111222210000 00 0000000000 000 00 000 00000 0000001000 Q ss_pred cccee---eEEEEEEEcCCCCcccccceeeeeecCCCe Q lcl|NC_018863. 315 KDIKT---HSYKVVVHSDDAESLASEAVTAVVANPTDS 349 (479) Q Consensus 315 sd~g~---Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~ 349 (479) . ++. -.+-..+.+..+--..+-. +++ .++ T Consensus 466 ~-v~~r~~~r~~~~v~~p~A~~~l~~~--~~~---~~~ 497 (497) T protein:vir:78 466 K-VTVRAEERLGLLVYRPSAFQLIQLK--KGA---TGS 497 (497) T ss_pred c-EEEEEEEeecceeeccccEEEEEec--CCc---cCC Confidence 0 000 0000111111111000000 000 001 No 99 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=88.07 E-value=0.035 Score=28.59 Aligned_cols=311 Identities=14% Similarity=0.088 Sum_probs=119.9 Q ss_pred Ccccccc-cceeeeec----------CchhHHHHHHHHHHHhhcCcc-------cCcccccCccccchhhhHHHHHHHhh Q lcl|NC_018863. 1 MTELQKE-QKVEARKL----------PAGAEAELAELVSKSFTTGTG-------ITPDTQHDAAALRRELLDDQVKMLAF 62 (479) Q Consensus 1 ~~~~~~~-~~~~~~~~----------~~~~~~~~~e~~~Ksf~ag~~-------~~~~~~~~gaAlr~esld~~i~~l~~ 62 (479) +.+-.++ ..+..... .........| ..+.+..+.. ..-.+-.+|+.|-.+.+..+|..+.. T Consensus 98 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vp~~~~~~ii~~~~ 176 (497) T protein:vir:10 98 MNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAE-LMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLF 176 (497) T ss_pred hhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHH-HHHHHhhhhhhHHHHHhhhcccCcccccccchhhhHHHHHHHH Confidence 0000000 00000000 0000000000 0111111100 00012234666777777778765554 Q ss_pred ccccccchhhhccchhHHHHHHhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhH Q lcl|NC_018863. 63 TNGDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADP 142 (479) Q Consensus 63 ~~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp 142 (479) . ...+.+-+...+..+---.|.+ .-++.+...+++|++..+.+|+.+.+.....+=++.--.+|.-+ |.++ .+. T Consensus 177 ~--~~~i~~l~~~~~~~~~~~~~~~--~~~~~~~a~wv~E~~~~~~s~~~f~~i~~~~~k~a~~~~iS~el-l~d~-~~l 250 (497) T protein:vir:10 177 Y--ELSLADLISSRPVTSPNLSYLT--ESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEG-LRDA-PEL 250 (497) T ss_pred h--hhhHHhhccccccCCCceEEEE--EcCCCCcceeeccCcccccccccceeeEeeeeeeEeecHhHHHH-HHhH-HHH Confidence 3 3344555554444432222322 23444567799999999999999999999999999988888764 2333 467 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhc-------------------cCC-cEEEccCCCC Q lcl|NC_018863. 143 MTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLID-------------------EAT-NVIDLKGERL 202 (479) Q Consensus 143 ~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~-------------------~~~-NviDarG~~l 202 (479) +....++-...+++.++.++++||-.-.| .||.+.-. ... ++-...+... T Consensus 251 ~~~i~~~l~~~i~~~~d~~~l~G~G~~~p----------~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 320 (497) T protein:vir:10 251 FNFVQGRLLEGIQRKEEVQLLAGGGYPGV----------NGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFV 320 (497) T ss_pred HHHHHHHHHHHHHHHHHHHhhcCCCcccc----------cccccccccccccccccchhhhhhhhhhhhhhcccccchhh Confidence 78888888899999999999999854221 23322211 000 0000000000 Q ss_pred C---------------------------------HHHhhhhhheeecc-cCceeeeecChHHHhhHHHhhcCceeEEeec Q lcl|NC_018863. 203 D---------------------------------EATLNKAAVIVGKG-YGRATDAFMPIGVQADFTNNLLDRQRVIQPS 248 (479) Q Consensus 203 ~---------------------------------~~~l~~aa~~i~~~-fG~atd~~mp~~vka~f~q~~~~~qrv~~~~ 248 (479) . ...+.++-..+... +-.++-..|++.+-..+...-...-|.+.++ T Consensus 321 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~ 400 (497) T protein:vir:10 321 GQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTKDANGQYMGGN 400 (497) T ss_pred hhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHHHHHHHHhhcCCCceeccC Confidence 1 11222333333333 3334446688887777765543322333222 Q ss_pred CCC----------ccccCccccceecCceeEEecCCcccCCCc---ccc-CcccCCCCCcccceEEEeecccccCccccc Q lcl|NC_018863. 249 QAG----------GFSTGFSINQFLSTRGAINLHGSTIMENDN---ILV-DRIPEPNAPQAPASVVATVKVNDKGAFRPV 314 (479) Q Consensus 249 n~g----------~~~~G~~V~~~~ss~g~I~L~~s~v~~a~~---~lv-er~~s~~aP~~P~~vta~~~~~~~g~~~~~ 314 (479) ..+ ..-.|.+|-.... |. .+..+.++.. +++ +| .+. ++... .-...-|..+ T Consensus 401 ~~~~~~~~~~~~~~~l~G~pV~~t~~----~~-~~~~~~Gd~~~~~~~i~~r----~~~----~v~~~--~~~~~~f~~n 465 (497) T protein:vir:10 401 FFGNAYGNPVNGGKNIWGVPVVTTPL----IP-LGTILVGHFAPSVIQTARR----EGV----TMQMT--NSNGTDFVDG 465 (497) T ss_pred cccccccccccCCceeeceeeEecCC----CC-CCceEEeecccceEEEEEe----ccc----EEEee--cccchhhhcC Confidence 111 0111222210000 00 0000000000 000 00 000 00000 0000001000 Q ss_pred cccee---eEEEEEEEcCCCCcccccceeeeeecCCCe Q lcl|NC_018863. 315 KDIKT---HSYKVVVHSDDAESLASEAVTAVVANPTDS 349 (479) Q Consensus 315 sd~g~---Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~ 349 (479) . ++. -.+-..+.+..+--..+-. +++ .++ T Consensus 466 ~-v~~r~~~r~~~~v~~p~A~~~l~~~--~~~---~~~ 497 (497) T protein:vir:10 466 K-VTVRAEERLGLLVYRPSAFQLIQLK--KGA---TGS 497 (497) T ss_pred c-EEEEEEEeecceeeccccEEEEEec--CCc---cCC Confidence 0 000 0000111111111000000 000 001 No 100 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=87.81 E-value=0.036 Score=28.48 Aligned_cols=283 Identities=11% Similarity=0.052 Sum_probs=132.0 Q ss_pred Ccccccccceeee-ecCchhHHHHH--------------------HHHHHHhhcCcccCcccccCccccchhhhHHHHHH Q lcl|NC_018863. 1 MTELQKEQKVEAR-KLPAGAEAELA--------------------ELVSKSFTTGTGITPDTQHDAAALRRELLDDQVKM 59 (479) Q Consensus 1 ~~~~~~~~~~~~~-~~~~~~~~~~~--------------------e~~~Ksf~ag~~~~~~~~~~gaAlr~esld~~i~~ 59 (479) .+........... ........+.. +.....+++ ..+..+|+.|..+.+..+|.. T Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~gg~~vP~~~~~~ii~ 156 (400) T protein:vir:38 82 KPDHPEEHSYRDALNAYLHTRGRNTDGVNFEKTDVGTFAVLRAVPTDASDAVNA-----GVKAADAASTIPETISNTPQR 156 (400) T ss_pred cccchhhhhHHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHHhh-----cccccCCcccccHHHHHHHHH Confidence 0000000000000 00000000000 001111112 123466888999999998866 Q ss_pred HhhccccccchhhhccchhHHHHHHhhhhhccCccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcc Q lcl|NC_018863. 60 LAFTNGDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVAS-INDPNIRQKTVQMKFLSDTKQQSLAAGLVNN 138 (479) Q Consensus 60 l~~~~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~ 138 (479) +..... .+++.+...++.+.--+|.+... ..+...+++|++... .+++.+.+....++-++.-..+|.-+ +.++ T Consensus 157 ~~~~~~--~l~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~E~~~~~~~~~~~f~~i~~~~~k~~~~~~is~el-l~ds 231 (400) T protein:vir:38 157 ELQTVV--DLKPFTNVFQASTQKGTYPTVAN--ATTKMVTVAELEKNPAMAKPEFKPVNWSVETYRQALPVSQES-IDDS 231 (400) T ss_pred HHHhhh--hhhhcceeEeccCcceEEEEEec--CCCccccccccccccccccccceeeEeehhheeeehhhHHHH-Hhhh Confidence 665443 34454544444433233433332 234566888887665 68999999999998888777777631 2355 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhhheeeccc Q lcl|NC_018863. 139 IADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAAVIVGKGY 218 (479) Q Consensus 139 ~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~f 218 (479) ..|.+....+.....+..+++.++++|.....+.+ ...+|++...+.. .+|... T Consensus 232 ~~~~~~~i~~~l~~~~~~~~~~~i~~~~~~~~~~~----~~~~~~~~~~~~~---~~~~~~------------------- 285 (400) T protein:vir:38 232 AIDLVGLIAQNGQQIKVNTTNGAVATLLKGFTAKT----ISSVDDLKHINNV---DLDPAY------------------- 285 (400) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhccccccccc----cccHHHHHHHHHh---hhhhhh------------------- Confidence 66788888888889999999999999998876543 3557888776642 112110 Q ss_pred CceeeeecChHHHhhHHHhhcCcee-EEeecCCCccccCccccceecCceeEEecCCcccCCCccccCcccCCCCCcccc Q lcl|NC_018863. 219 GRATDAFMPIGVQADFTNNLLDRQR-VIQPSQAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILVDRIPEPNAPQAPA 297 (479) Q Consensus 219 G~atd~~mp~~vka~f~q~~~~~qr-v~~~~n~g~~~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver~~s~~aP~~P~ 297 (479) ..-..|++.+...+...-...-| +++|+ ..+. .+.++++.+.+.++..+...+... . T Consensus 286 --~a~~v~~~~~~~~l~~lkd~~G~~i~~~~-~~~~------------------~~~~l~G~pv~~~~~~~~~~~g~~-~ 343 (400) T protein:vir:38 286 --SRVIIASQSFYNFLDTVKDGNGRYLLQDS-ILTP------------------SGKSVLGMPIAVVSDDTLGAAGEA-H 343 (400) T ss_pred --CcEEEEcHHHHHHHHHhhccCCCeeeecC-cCCC------------------CccccccceeEEecccccCCCCce-E Confidence 12368899888888765432222 22332 2111 112233333333332211111100 0 Q ss_pred eEEEeecccccCcccccccceeeEEEEEEEcCCCCcccc-c------------ceeeeeecCCCeEEEEEeecC Q lcl|NC_018863. 298 SVVATVKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLAS-E------------AVTAVVANPTDSVSLAVKLQS 358 (479) Q Consensus 298 ~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S-~------------~VtaT~a~~~~~V~LtIt~~~ 358 (479) .. .|.++..+....+.+-+... . .....+.....-+.|++++.. T Consensus 344 ~~-----------------~gd~s~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~d~~~~~~~a~~~l~~~~~a 400 (400) T protein:vir:38 344 AF-----------------LGDIKRAILFANRADFMVRWVDDQIYGQFLQAGMRFGVSVADEKAGYFLTYTPKA 400 (400) T ss_pred EE-----------------EEeccccEEEEeecceEEEEecccccceeEEEEEEeccEEecccceEEEEeecCC Confidence 00 01221111111111111100 0 001111112222233444332 No 101 >protein:vir:96442 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:11266 # MgeID: mge:1616 # MgeName: 119X # Cross-refs: genbank:acc:YP_001218814;genbank:gi:147917331;genbank:GeneID:5142645 Probab=87.81 E-value=0.031 Score=28.85 Aligned_cols=325 Identities=15% Similarity=0.123 Sum_probs=143.5 Q ss_pred Ccccccccc-----eeeeec-Cch-hHHHHHHHHHHHhhcCcccC-cccccCccccchhhhHHHHH--HHhhcc--cccc Q lcl|NC_018863. 1 MTELQKEQK-----VEARKL-PAG-AEAELAELVSKSFTTGTGIT-PDTQHDAAALRRELLDDQVK--MLAFTN--GDFT 68 (479) Q Consensus 1 ~~~~~~~~~-----~~~~~~-~~~-~~~~~~e~~~Ksf~ag~~~~-~~~~~~gaAlr~esld~~i~--~l~~~~--~~f~ 68 (479) +-|+--... .-++.- +.. .+..||+ |.++.-..+. .+...++.-|..|+=|- ++ +|.|.+ .+.- T Consensus 27 ~~~~PN~~~p~l~~i~~g~~~~~~~~t~~w~~---d~l~~~~~~~ta~~~a~~T~i~V~~~~~-f~~~~l~~~~~~~Evi 102 (418) T protein:vir:96 27 LRRVPNGSAPLLAMTSVVGSTTAKASTHGYFS---KTMVFASAVVTAEALADATVLTVENSDG-LTKGMIFYNEATGENM 102 (418) T ss_pred hhhcCCcccchhhhhcccCccccceeEEEEEe---eEeeeeeEEEEEEEecCceEEEecCCcc-cccccEEEEecCCeEE Confidence 111110000 000000 000 1222322 3333222111 11112222233332221 10 111211 1111 Q ss_pred chhhhccchhHHHHHHhhhhhccCcc--------ccccc----ccccccccccCcceE--EEEEEEEeeeehhhhhhhHh Q lcl|NC_018863. 69 IYPLINKQQVNSTVAKYAVFNQHGRT--------GHSRF----VREVGVASINDPNIR--QKTVQMKFLSDTKQQSLAAG 134 (479) Q Consensus 69 ~~~~i~k~~~~stv~~y~~~~~~G~~--------g~~~f----v~E~g~~~~~d~~~~--r~~~~~k~l~~~~~vs~~~~ 134 (479) -..+|+- .+ -++....|++ ..-.+ +-||.+...+. .+. ++...+--+.+..+||.-++ T Consensus 103 rVtsVng----~~---lTV~RG~~~t~aa~iaag~~~~~ig~~~eEGsd~~ta~-~~k~~~vsN~tQIf~e~vsVSgTAq 174 (418) T protein:vir:96 103 RLELVNG----LN---LTVKRQTGRIAAAIIAANTKLIVIGTAFEEGSQRPTAR-SIQPVYVPNFTQIFRNAWALTDTAR 174 (418) T ss_pred EEEEEeC----CE---EEEEEccCCeeeeeeecCceEEEeecCcccccccCCcc-eecceeccchhheehhhhhhhhhhh Confidence 1112210 00 1112222221 10112 24776665543 111 22222333456667776654 Q ss_pred h-h--cchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcc-c--chhhhHHHhhccCCcEEEccCC-CCCHHHh Q lcl|NC_018863. 135 L-V--NNIADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQA-G--IEFDGLTKLIDEATNVIDLKGE-RLDEATL 207 (479) Q Consensus 135 l-v--~~~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~-g--leFDGl~~~I~~~~NviDarG~-~l~~~~l 207 (479) . + -+++|....+ .|+|.-.+..+|.++++|.+.....+...- . =.-|||..-+ ++||+++.+. .++++.| T Consensus 175 A~v~qaGvsn~~~~e-~d~l~~~kv~iE~ali~g~~~~~~~ng~p~~~t~R~m~gI~~f~--~~Nvi~ag~~~~~t~d~L 251 (418) T protein:vir:96 175 ASYAEAGYSNITESR-RDCMDFHATEQETAIFFGQAFMGTYNGQPLHTTQGIVDAIRQYA--PDNVNAMPNPTAVTYDDV 251 (418) T ss_pred hhhhhcCcchhHHHH-HHHHHHHHHHHHHhhhccccccCCCCCcccccccchhHHHHhhc--cccccccCCCCcCCHHHH Confidence 4 2 2566776666 699999999999999999997742211000 0 0235665555 5899999986 6899999 Q ss_pred hhhhheeec---ccCceee-----eecChHHHhhHHHhhcCceeEEeecCCCccccCccccceecCceeEEecCCcccCC Q lcl|NC_018863. 208 NKAAVIVGK---GYGRATD-----AFMPIGVQADFTNNLLDRQRVIQPSQAGGFSTGFSINQFLSTRGAINLHGSTIMEN 279 (479) Q Consensus 208 ~~aa~~i~~---~fG~atd-----~~mp~~vka~f~q~~~~~qrv~~~~n~g~~~~G~~V~~~~ss~g~I~L~~s~v~~a 279 (479) ..+...+-+ +-|..++ ++++...|.++...+ ..-|..++ ....|..|+.|.|-.|-|++--+.-+.+ T Consensus 252 ~~~~~~a~~~g~n~G~~~~~~~y~~~V~a~~k~~I~k~~-~~I~~~~~----en~~G~vv~~~~Td~G~v~ii~n~~~pa 326 (418) T protein:vir:96 252 VDATIDAFKWSVNVGDNTQRVMFCDTVGMRTMQDIGRFF-GEVTVTQR----ETSYGMVFTEWKFFKGRLIIKEHPLFSA 326 (418) T ss_pred HHHHHHHHhhcCCCCCcccceEEEEEeChHHHHHHhhhh-ceeEeccc----cceeceEEEEEEeeccEEEEEecCCCCc Confidence 887655544 4577766 566999999998865 33344433 3566999999999999999877776666 Q ss_pred Ccc------ccCcccCCCC-----Ccccc------------eEEEeecccccCcccccccceeeEEEEEEEcCCCCcccc Q lcl|NC_018863. 280 DNI------LVDRIPEPNA-----PQAPA------------SVVATVKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLAS 336 (479) Q Consensus 280 ~~~------lver~~s~~a-----P~~P~------------~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S 336 (479) +.+ +++..--.-. +..+- .+.+..+...+... +.= .=.|.+...|..+- T Consensus 327 d~I~~g~mlVvD~~~vkL~yL~~R~~~~E~l~k~G~~~~~~~~~~~~~~~~D~~~--G~l--~~Eltle~~N~~a~---- 398 (418) T protein:vir:96 327 IGISPGFAVVVDVPAVKLAYMDGRNAKVENYGQGGGENKSGATDYSYGHGVDAQG--GSL--TSEWALELLNPQGC---- 398 (418) T ss_pred cccCcceEEEEecCceEEEEecCCCccchhcccCCCccccccccccccccccccc--CEE--EEEEEEEeeccccc---- Confidence 664 2232110000 10000 00000000001100 000 11344444554332 Q ss_pred cceeeeeecCC---CeEEEEEeecCCccccceEEEEEeccCCC Q lcl|NC_018863. 337 EAVTAVVANPT---DSVSLAVKLQSLYQAKPQFISVYRQGNET 376 (479) Q Consensus 337 ~~VtaT~a~~~---~~V~LtIt~~~~~~~~~~y~~IYR~t~~~ 376 (479) +.+++.+ ..|.|+.+ .+ T Consensus 399 ----a~itgl~~~~~~~~~~~~-------------------~~ 418 (418) T protein:vir:96 399 ----AVITGLQKAKERVYLTAP-------------------AP 418 (418) T ss_pred ----EEeecccccccccccCCC-------------------CC Confidence 1122211 12222211 11 No 102 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=86.68 E-value=0.044 Score=28.03 Aligned_cols=315 Identities=11% Similarity=0.040 Sum_probs=135.0 Q ss_pred Cccccccc---ceeeeecCchhHHHHHHHHHHHhhcCcc----------------cCcccccCccccchhhhHHHHHHHh Q lcl|NC_018863. 1 MTELQKEQ---KVEARKLPAGAEAELAELVSKSFTTGTG----------------ITPDTQHDAAALRRELLDDQVKMLA 61 (479) Q Consensus 1 ~~~~~~~~---~~~~~~~~~~~~~~~~e~~~Ksf~ag~~----------------~~~~~~~~gaAlr~esld~~i~~l~ 61 (479) +.+...+. .....+-+.....+.-+++.|.+.-+.. .+..+-.+|+.|-.+.+..+|..+. T Consensus 51 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~ 130 (392) T protein:vir:10 51 LDEAETEERNNGREVETRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELA 130 (392) T ss_pred HHHHHHHHhhccccccccCccchHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHH Confidence 00000000 0000111111111222334444432221 1112335678888888888886665 Q ss_pred hccccccchhhhccchhHHHHHHhhhhhccCccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcchh Q lcl|NC_018863. 62 FTNGDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVAS-INDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIA 140 (479) Q Consensus 62 ~~~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~ 140 (479) .... .+++.+....+.+.-.+|..... .......+++|++... ...+.+.......+-++--..+|.-+ +.++.- T Consensus 131 ~~~s--~l~~~~~~~~~~~~~~~~~~~~~-~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~el-l~ds~~ 206 (392) T protein:vir:10 131 RSFD--ALEQYVTVEPVRTRSGSRVLEKN-SDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSL-LQDSDQ 206 (392) T ss_pred Hhhh--hhhhhceeeeccCCceeEEEEee-cCCccceeecccccccccccccceeEEeeeeeEEEeehhhHHH-HhhhHH Confidence 5543 34444554555544334433322 2233566999999876 56799999999999998888888754 234556 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhhheeecccCc Q lcl|NC_018863. 141 DPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAAVIVGKGYGR 220 (479) Q Consensus 141 dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~ 220 (479) |.+....+.--..+++.++.+++.|+....+. ..+-+|.++..|.. ....+|-. T Consensus 207 ~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~----~~~~~d~i~~~~~~----------------------~l~~~~~~ 260 (392) T protein:vir:10 207 NILKYVTKWLGKKSKVTRNVLILGVIEKLTKQ----AIKSLDDIKDVLNV----------------------KLDPAISP 260 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcccccccc----CccCHHHHHHHHHH----------------------hhhhhhcc Confidence 77888888889999999999999999886653 24556776665531 01112222 Q ss_pred eeeeecChHHHhhHHHhhcCceeEEeecCCCccc----cCccccceecCceeEEecCCcccCCCccccCcccCC---CCC Q lcl|NC_018863. 221 ATDAFMPIGVQADFTNNLLDRQRVIQPSQAGGFS----TGFSINQFLSTRGAINLHGSTIMENDNILVDRIPEP---NAP 293 (479) Q Consensus 221 atd~~mp~~vka~f~q~~~~~qrv~~~~n~g~~~----~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver~~s~---~aP 293 (479) ....+|++.+.+.+...-...-|.+...+..+.. .|..+--..+.. .--......+...++ ..-++. ..- T Consensus 261 ~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~--~~~~~~~~~~~~~~~-~gdfs~~~~i~~ 337 (392) T protein:vir:10 261 NAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNR--FLKSKGTTAKKAPLI-IGDLKEAIVLFK 337 (392) T ss_pred CCEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCcccEEEeccc--ccCCCcccCCceEEE-EEehhceEEEEe Confidence 2337888888888876543222333222221111 111110000000 000000000000000 000000 000 Q ss_pred cccceEEEeecccccCcccccccceeeEEEEEEEcCCCCcccccceeeeeecCCCeEEEEEeecCCccccceE Q lcl|NC_018863. 294 QAPASVVATVKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLASEAVTAVVANPTDSVSLAVKLQSLYQAKPQF 366 (479) Q Consensus 294 ~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~V~LtIt~~~~~~~~~~y 366 (479) -...... . .. ..+..|. . +...|++.. +.|-- +.....-+.|++++.... .+|+. T Consensus 338 ~~~~~~~-~-~~-~~~~~f~-~--~~~~~r~~~--r~d~~---------v~~~~a~~~l~~~~~a~~-~~~~~ 392 (392) T protein:vir:10 338 REDMELA-S-TD-VGGKAFT-R--NTLDLRAIQ--RDDVQ---------MWDNEAAVYGEIDLSAPV-EQPQG 392 (392) T ss_pred ecceEEE-E-ec-cccchhh-c--CceEEEEEE--eeccE---------EecccceEEEEecccccc-cCCCC Confidence 0000000 0 00 0000010 0 011122221 11111 111222222333322111 11122 No 103 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=86.68 E-value=0.044 Score=28.03 Aligned_cols=315 Identities=11% Similarity=0.040 Sum_probs=135.0 Q ss_pred Cccccccc---ceeeeecCchhHHHHHHHHHHHhhcCcc----------------cCcccccCccccchhhhHHHHHHHh Q lcl|NC_018863. 1 MTELQKEQ---KVEARKLPAGAEAELAELVSKSFTTGTG----------------ITPDTQHDAAALRRELLDDQVKMLA 61 (479) Q Consensus 1 ~~~~~~~~---~~~~~~~~~~~~~~~~e~~~Ksf~ag~~----------------~~~~~~~~gaAlr~esld~~i~~l~ 61 (479) +.+...+. .....+-+.....+.-+++.|.+.-+.. .+..+-.+|+.|-.+.+..+|..+. T Consensus 51 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~ 130 (392) T protein:vir:10 51 LDEAETEERNNGREVETRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELA 130 (392) T ss_pred HHHHHHHHhhccccccccCccchHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHH Confidence 00000000 0000111111111222334444432221 1112335678888888888886665 Q ss_pred hccccccchhhhccchhHHHHHHhhhhhccCccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcchh Q lcl|NC_018863. 62 FTNGDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVAS-INDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIA 140 (479) Q Consensus 62 ~~~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~ 140 (479) .... .+++.+....+.+.-.+|..... .......+++|++... ...+.+.......+-++--..+|.-+ +.++.- T Consensus 131 ~~~s--~l~~~~~~~~~~~~~~~~~~~~~-~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~el-l~ds~~ 206 (392) T protein:vir:10 131 RSFD--ALEQYVTVEPVRTRSGSRVLEKN-SDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSL-LQDSDQ 206 (392) T ss_pred Hhhh--hhhhhceeeeccCCceeEEEEee-cCCccceeecccccccccccccceeEEeeeeeEEEeehhhHHH-HhhhHH Confidence 5543 34444554555544334433322 2233566999999876 56799999999999998888888754 234556 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhhheeecccCc Q lcl|NC_018863. 141 DPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAAVIVGKGYGR 220 (479) Q Consensus 141 dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~ 220 (479) |.+....+.--..+++.++.+++.|+....+. ..+-+|.++..|.. ....+|-. T Consensus 207 ~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~----~~~~~d~i~~~~~~----------------------~l~~~~~~ 260 (392) T protein:vir:10 207 NILKYVTKWLGKKSKVTRNVLILGVIEKLTKQ----AIKSLDDIKDVLNV----------------------KLDPAISP 260 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcccccccc----CccCHHHHHHHHHH----------------------hhhhhhcc Confidence 77888888889999999999999999886653 24556776665531 01112222 Q ss_pred eeeeecChHHHhhHHHhhcCceeEEeecCCCccc----cCccccceecCceeEEecCCcccCCCccccCcccCC---CCC Q lcl|NC_018863. 221 ATDAFMPIGVQADFTNNLLDRQRVIQPSQAGGFS----TGFSINQFLSTRGAINLHGSTIMENDNILVDRIPEP---NAP 293 (479) Q Consensus 221 atd~~mp~~vka~f~q~~~~~qrv~~~~n~g~~~----~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver~~s~---~aP 293 (479) ....+|++.+.+.+...-...-|.+...+..+.. .|..+--..+.. .--......+...++ ..-++. ..- T Consensus 261 ~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~--~~~~~~~~~~~~~~~-~gdfs~~~~i~~ 337 (392) T protein:vir:10 261 NAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNR--FLKSKGTTAKKAPLI-IGDLKEAIVLFK 337 (392) T ss_pred CCEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCcccEEEeccc--ccCCCcccCCceEEE-EEehhceEEEEe Confidence 2337888888888876543222333222221111 111110000000 000000000000000 000000 000 Q ss_pred cccceEEEeecccccCcccccccceeeEEEEEEEcCCCCcccccceeeeeecCCCeEEEEEeecCCccccceE Q lcl|NC_018863. 294 QAPASVVATVKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLASEAVTAVVANPTDSVSLAVKLQSLYQAKPQF 366 (479) Q Consensus 294 ~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~V~LtIt~~~~~~~~~~y 366 (479) -...... . .. ..+..|. . +...|++.. +.|-- +.....-+.|++++.... .+|+. T Consensus 338 ~~~~~~~-~-~~-~~~~~f~-~--~~~~~r~~~--r~d~~---------v~~~~a~~~l~~~~~a~~-~~~~~ 392 (392) T protein:vir:10 338 REDMELA-S-TD-VGGKAFT-R--NTLDLRAIQ--RDDVQ---------MWDNEAAVYGEIDLSAPV-EQPQG 392 (392) T ss_pred ecceEEE-E-ec-cccchhh-c--CceEEEEEE--eeccE---------EecccceEEEEecccccc-cCCCC Confidence 0000000 0 00 0000010 0 011122221 11111 111222222333322111 11122 No 104 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=86.68 E-value=0.044 Score=28.03 Aligned_cols=315 Identities=11% Similarity=0.040 Sum_probs=135.0 Q ss_pred Cccccccc---ceeeeecCchhHHHHHHHHHHHhhcCcc----------------cCcccccCccccchhhhHHHHHHHh Q lcl|NC_018863. 1 MTELQKEQ---KVEARKLPAGAEAELAELVSKSFTTGTG----------------ITPDTQHDAAALRRELLDDQVKMLA 61 (479) Q Consensus 1 ~~~~~~~~---~~~~~~~~~~~~~~~~e~~~Ksf~ag~~----------------~~~~~~~~gaAlr~esld~~i~~l~ 61 (479) +.+...+. .....+-+.....+.-+++.|.+.-+.. .+..+-.+|+.|-.+.+..+|..+. T Consensus 51 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~ 130 (392) T protein:vir:10 51 LDEAETEERNNGREVETRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELA 130 (392) T ss_pred HHHHHHHHhhccccccccCccchHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHH Confidence 00000000 0000111111111222334444432221 1112335678888888888886665 Q ss_pred hccccccchhhhccchhHHHHHHhhhhhccCccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcchh Q lcl|NC_018863. 62 FTNGDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVAS-INDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIA 140 (479) Q Consensus 62 ~~~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~ 140 (479) .... .+++.+....+.+.-.+|..... .......+++|++... ...+.+.......+-++--..+|.-+ +.++.- T Consensus 131 ~~~s--~l~~~~~~~~~~~~~~~~~~~~~-~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~el-l~ds~~ 206 (392) T protein:vir:10 131 RSFD--ALEQYVTVEPVRTRSGSRVLEKN-SDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSL-LQDSDQ 206 (392) T ss_pred Hhhh--hhhhhceeeeccCCceeEEEEee-cCCccceeecccccccccccccceeEEeeeeeEEEeehhhHHH-HhhhHH Confidence 5543 34444554555544334433322 2233566999999876 56799999999999998888888754 234556 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhhheeecccCc Q lcl|NC_018863. 141 DPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAAVIVGKGYGR 220 (479) Q Consensus 141 dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~ 220 (479) |.+....+.--..+++.++.+++.|+....+. ..+-+|.++..|.. ....+|-. T Consensus 207 ~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~----~~~~~d~i~~~~~~----------------------~l~~~~~~ 260 (392) T protein:vir:10 207 NILKYVTKWLGKKSKVTRNVLILGVIEKLTKQ----AIKSLDDIKDVLNV----------------------KLDPAISP 260 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcccccccc----CccCHHHHHHHHHH----------------------hhhhhhcc Confidence 77888888889999999999999999886653 24556776665531 01112222 Q ss_pred eeeeecChHHHhhHHHhhcCceeEEeecCCCccc----cCccccceecCceeEEecCCcccCCCccccCcccCC---CCC Q lcl|NC_018863. 221 ATDAFMPIGVQADFTNNLLDRQRVIQPSQAGGFS----TGFSINQFLSTRGAINLHGSTIMENDNILVDRIPEP---NAP 293 (479) Q Consensus 221 atd~~mp~~vka~f~q~~~~~qrv~~~~n~g~~~----~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver~~s~---~aP 293 (479) ....+|++.+.+.+...-...-|.+...+..+.. .|..+--..+.. .--......+...++ ..-++. ..- T Consensus 261 ~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~--~~~~~~~~~~~~~~~-~gdfs~~~~i~~ 337 (392) T protein:vir:10 261 NAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNR--FLKSKGTTAKKAPLI-IGDLKEAIVLFK 337 (392) T ss_pred CCEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCcccEEEeccc--ccCCCcccCCceEEE-EEehhceEEEEe Confidence 2337888888888876543222333222221111 111110000000 000000000000000 000000 000 Q ss_pred cccceEEEeecccccCcccccccceeeEEEEEEEcCCCCcccccceeeeeecCCCeEEEEEeecCCccccceE Q lcl|NC_018863. 294 QAPASVVATVKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLASEAVTAVVANPTDSVSLAVKLQSLYQAKPQF 366 (479) Q Consensus 294 ~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~V~LtIt~~~~~~~~~~y 366 (479) -...... . .. ..+..|. . +...|++.. +.|-- +.....-+.|++++.... .+|+. T Consensus 338 ~~~~~~~-~-~~-~~~~~f~-~--~~~~~r~~~--r~d~~---------v~~~~a~~~l~~~~~a~~-~~~~~ 392 (392) T protein:vir:10 338 REDMELA-S-TD-VGGKAFT-R--NTLDLRAIQ--RDDVQ---------MWDNEAAVYGEIDLSAPV-EQPQG 392 (392) T ss_pred ecceEEE-E-ec-cccchhh-c--CceEEEEEE--eeccE---------EecccceEEEEecccccc-cCCCC Confidence 0000000 0 00 0000010 0 011122221 11111 111222222333322111 11122 No 105 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=86.68 E-value=0.044 Score=28.03 Aligned_cols=315 Identities=11% Similarity=0.040 Sum_probs=135.0 Q ss_pred Cccccccc---ceeeeecCchhHHHHHHHHHHHhhcCcc----------------cCcccccCccccchhhhHHHHHHHh Q lcl|NC_018863. 1 MTELQKEQ---KVEARKLPAGAEAELAELVSKSFTTGTG----------------ITPDTQHDAAALRRELLDDQVKMLA 61 (479) Q Consensus 1 ~~~~~~~~---~~~~~~~~~~~~~~~~e~~~Ksf~ag~~----------------~~~~~~~~gaAlr~esld~~i~~l~ 61 (479) +.+...+. .....+-+.....+.-+++.|.+.-+.. .+..+-.+|+.|-.+.+..+|..+. T Consensus 51 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~ 130 (392) T protein:vir:10 51 LDEAETEERNNGREVETRNVDGEMEYRDVFMKALRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGLVIPQDIQTQINELA 130 (392) T ss_pred HHHHHHHHhhccccccccCccchHHHHHHHHHHHhcccccHHHHHHHhhhhhhhhccccccCCCceecchhHHHHHHHHH Confidence 00000000 0000111111111222334444432221 1112335678888888888886665 Q ss_pred hccccccchhhhccchhHHHHHHhhhhhccCccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcchh Q lcl|NC_018863. 62 FTNGDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVAS-INDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIA 140 (479) Q Consensus 62 ~~~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~ 140 (479) .... .+++.+....+.+.-.+|..... .......+++|++... ...+.+.......+-++--..+|.-+ +.++.- T Consensus 131 ~~~s--~l~~~~~~~~~~~~~~~~~~~~~-~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~el-l~ds~~ 206 (392) T protein:vir:10 131 RSFD--ALEQYVTVEPVRTRSGSRVLEKN-SDMIPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSL-LQDSDQ 206 (392) T ss_pred Hhhh--hhhhhceeeeccCCceeEEEEee-cCCccceeecccccccccccccceeEEeeeeeEEEeehhhHHH-HhhhHH Confidence 5543 34444554555544334433322 2233566999999876 56799999999999998888888754 234556 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhhheeecccCc Q lcl|NC_018863. 141 DPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAAVIVGKGYGR 220 (479) Q Consensus 141 dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~ 220 (479) |.+....+.--..+++.++.+++.|+....+. ..+-+|.++..|.. ....+|-. T Consensus 207 ~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~~----~~~~~d~i~~~~~~----------------------~l~~~~~~ 260 (392) T protein:vir:10 207 NILKYVTKWLGKKSKVTRNVLILGVIEKLTKQ----AIKSLDDIKDVLNV----------------------KLDPAISP 260 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcccccccc----CccCHHHHHHHHHH----------------------hhhhhhcc Confidence 77888888889999999999999999886653 24556776665531 01112222 Q ss_pred eeeeecChHHHhhHHHhhcCceeEEeecCCCccc----cCccccceecCceeEEecCCcccCCCccccCcccCC---CCC Q lcl|NC_018863. 221 ATDAFMPIGVQADFTNNLLDRQRVIQPSQAGGFS----TGFSINQFLSTRGAINLHGSTIMENDNILVDRIPEP---NAP 293 (479) Q Consensus 221 atd~~mp~~vka~f~q~~~~~qrv~~~~n~g~~~----~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver~~s~---~aP 293 (479) ....+|++.+.+.+...-...-|.+...+..+.. .|..+--..+.. .--......+...++ ..-++. ..- T Consensus 261 ~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~--~~~~~~~~~~~~~~~-~gdfs~~~~i~~ 337 (392) T protein:vir:10 261 NAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNR--FLKSKGTTAKKAPLI-IGDLKEAIVLFK 337 (392) T ss_pred CCEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCcccEEEeccc--ccCCCcccCCceEEE-EEehhceEEEEe Confidence 2337888888888876543222333222221111 111110000000 000000000000000 000000 000 Q ss_pred cccceEEEeecccccCcccccccceeeEEEEEEEcCCCCcccccceeeeeecCCCeEEEEEeecCCccccceE Q lcl|NC_018863. 294 QAPASVVATVKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLASEAVTAVVANPTDSVSLAVKLQSLYQAKPQF 366 (479) Q Consensus 294 ~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~V~LtIt~~~~~~~~~~y 366 (479) -...... . .. ..+..|. . +...|++.. +.|-- +.....-+.|++++.... .+|+. T Consensus 338 ~~~~~~~-~-~~-~~~~~f~-~--~~~~~r~~~--r~d~~---------v~~~~a~~~l~~~~~a~~-~~~~~ 392 (392) T protein:vir:10 338 REDMELA-S-TD-VGGKAFT-R--NTLDLRAIQ--RDDVQ---------MWDNEAAVYGEIDLSAPV-EQPQG 392 (392) T ss_pred ecceEEE-E-ec-cccchhh-c--CceEEEEEE--eeccE---------EecccceEEEEecccccc-cCCCC Confidence 0000000 0 00 0000010 0 011122221 11111 111222222333322111 11122 No 106 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=86.16 E-value=0.048 Score=27.84 Aligned_cols=320 Identities=12% Similarity=0.070 Sum_probs=141.9 Q ss_pred Ccccc-----------ccc-ceeeeecCchhHHHHHHHHHHHhhcCccc----------CcccccCccccchhhhHHHHH Q lcl|NC_018863. 1 MTELQ-----------KEQ-KVEARKLPAGAEAELAELVSKSFTTGTGI----------TPDTQHDAAALRRELLDDQVK 58 (479) Q Consensus 1 ~~~~~-----------~~~-~~~~~~~~~~~~~~~~e~~~Ksf~ag~~~----------~~~~~~~gaAlr~esld~~i~ 58 (479) +.++. +.. .....+. ...++.-++|.+.+-.|-.. .-.+..+|+.|-+|.+.++|. T Consensus 50 ~~~~e~~~~~~~~~~~~~~~~~~~~~~--~~~~e~~~a~~~~l~~g~~~~~~~~e~~a~~~~t~~~gG~~iP~~~~~~I~ 127 (407) T protein:vir:48 50 LAELENLKSDLEAELAEVKRPAGGTQN--KVASEHKEAFIGFMRKGREDGLRELERKALQVGNDEDGGYAIPEELDRTIL 127 (407) T ss_pred HHHHHHHHHHHHHHHHHhhcccccccc--chhhHHHHHHHHHHhccchhhhhHHHHHhhhcccCCCCcccccHhHHHHHH Confidence 00000 000 0000111 11112223333333333211 112335678899999999987 Q ss_pred HHhhccccccchhhhccchhHHHHHHhhhhhccCcccccccccccccc-cccCcceEEEEEEEEeeeehhhhhhhHhhhc Q lcl|NC_018863. 59 MLAFTNGDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVA-SINDPNIRQKTVQMKFLSDTKQQSLAAGLVN 137 (479) Q Consensus 59 ~l~~~~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~-~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~ 137 (479) .+..... .+++.+...+..+--..|. ...++ ....+++|++.. +..++.+......++=++.-..+|.-+ +.+ T Consensus 128 ~~~~~~~--~l~~~~~~~~~~~~~~~~~--~~~~~-~~a~~v~E~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~el-l~d 201 (407) T protein:vir:48 128 TLLKDEV--VMRQEATVITLGGSDYKKL--VNLGG-TTSGWVGETDARPETATSKLGLIEPFMGEIYGNPQATQKM-LDD 201 (407) T ss_pred HHHHhhh--hhhhhceeeecCCCceEEE--EecCC-cceeeecccccccccccccceeEEeeeeeeEeehhhHHHH-Hhc Confidence 7765543 3444444344443322222 22233 346689999975 567799999999998777777777653 235 Q ss_pred chhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCC-c----------EEEccCCCCCHHH Q lcl|NC_018863. 138 NIADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEAT-N----------VIDLKGERLDEAT 206 (479) Q Consensus 138 ~~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~-N----------viDarG~~l~~~~ 206 (479) +..|.+....+.-...+...+|.++++||-. + +..|+++...... + +.-..-..++.+. T Consensus 202 s~~~l~~~i~~~l~~~i~~~~~~a~l~G~G~-~---------~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 271 (407) T protein:vir:48 202 AFFNVEDWINSELALEFAEQEEIAFTSGDGS-K---------KPKGFLAYESTDEDDKTRAFGKLQHIASGAASGVTADA 271 (407) T ss_pred chHHHHHHHHHHHHHHHHHHHHhhhhccCCC-C---------ccceeeecccccccccccccccccccccccccccChHH Confidence 5667888888888888999999999999765 1 2357765543111 1 1111223345555 Q ss_pred hhhhhheeecccCceeeeecChHHHhhHHHhhcCceeEE-eecCCCccccCccccceecCceeEEecCCcccCCCccccC Q lcl|NC_018863. 207 LNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVI-QPSQAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILVD 285 (479) Q Consensus 207 l~~aa~~i~~~fG~atd~~mp~~vka~f~q~~~~~qrv~-~~~n~g~~~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lve 285 (479) |.++...+..+|-.....+|+..+.+.+...-...-|-+ +|+ .. .|. +.++++.+.+..+ T Consensus 272 i~~l~~~l~~~~~~~a~~v~n~~~~~~L~~lkD~~Gr~l~~~~-~~---~g~---------------~~~l~G~PV~~~~ 332 (407) T protein:vir:48 272 IIKLIYTLRKAHRSGAKFMMNNSSLFAIRLLKDNDGNYLWRPG-IE---LGQ---------------PSSLAGYGIVENE 332 (407) T ss_pred HHHHHHhhchhhhcCCEEEEcHHHHHHHHHhhccCCceeeccC-cC---CCC---------------CceecceeeEEec Confidence 554443344455444457899999888876544333333 333 11 010 0122333333222 Q ss_pred cccCCCCCcccceEEEeecccccCcccccccceeeEEEEEEEcCCCCcccccceeeeeecCCCeEEEEEeecCCccccce Q lcl|NC_018863. 286 RIPEPNAPQAPASVVATVKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLASEAVTAVVANPTDSVSLAVKLQSLYQAKPQ 365 (479) Q Consensus 286 r~~s~~aP~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~V~LtIt~~~~~~~~~~ 365 (479) ..+...+. .+.. . .|.+++.+....+.|-....+ --...+.+ . T Consensus 333 ~~p~~~~~-~~~i--------~---------~Gd~~~~~~i~~~~~~~i~~d-----~~~~~~~~--------------~ 375 (407) T protein:vir:48 333 QMPDIAAD-AKAI--------A---------FGNFKRGYTIVDRIGTRILRD-----PYTNKPFV--------------G 375 (407) T ss_pred CcCCccCC-ccEE--------E---------EEeccccEEEEEeeceEEEee-----ccccCCcE--------------E Confidence 21110000 0000 0 122222111122211110000 00001111 2 Q ss_pred EEEEEecc---CCCCcEEEEEEeeeeeccCCCe Q lcl|NC_018863. 366 FISVYRQG---NETGHYFLVARVPLSKADENGV 395 (479) Q Consensus 366 y~~IYR~t---~~~g~~~~i~rV~~s~~n~~~t 395 (479) |+...|=. ..+.-| ...+++.+....... T Consensus 376 ~~~~~r~d~~v~~~~a~-~~l~~~aa~~~~~~~ 407 (407) T protein:vir:48 376 FYTTKRTGGMLVDSQAI-KLMKIGAATRQKAAA 407 (407) T ss_pred EEEEEEeccEEecccce-EEEEeeccCCCCCCC Confidence 22222222 011112 122222221111111 No 107 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=85.75 E-value=0.05 Score=27.69 Aligned_cols=323 Identities=11% Similarity=0.025 Sum_probs=131.9 Q ss_pred Cccccccc--ceeeeecCchh-----------------HHHHHH------------HHHHHhhcCcccCcccccCccccc Q lcl|NC_018863. 1 MTELQKEQ--KVEARKLPAGA-----------------EAELAE------------LVSKSFTTGTGITPDTQHDAAALR 49 (479) Q Consensus 1 ~~~~~~~~--~~~~~~~~~~~-----------------~~~~~e------------~~~Ksf~ag~~~~~~~~~~gaAlr 49 (479) +.+...+. +....+....+ ..++.. ...+....--..+..+..||.... T Consensus 90 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~lv~ 169 (477) T protein:vir:84 90 VRKATVEVNEALTYEKGNGQSYFRDLAMQTVGMADEPAKERLRRHMVDVESDKEIRKIAKVGEEYRDLDRNGGTGGYAVP 169 (477) T ss_pred hcccccccccchhhhhhHHHHHHHHHHHHHhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHhhhhhccccccCCCcceeec Confidence 11000000 00000000000 000000 000111111111222334555555 Q ss_pred hhhhHHHHHHHhhccccccchhhhccchhHHHHHHhhhhhccCcccccccccccc-----cccccCcceEEEEEEEEeee Q lcl|NC_018863. 50 RELLDDQVKMLAFTNGDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVG-----VASINDPNIRQKTVQMKFLS 124 (479) Q Consensus 50 ~esld~~i~~l~~~~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g-----~~~~~d~~~~r~~~~~k~l~ 124 (479) .|-+..+|..+..... .+.+-+...........+..-...++..-+..++|++ ..+.+|+.+.+....+|=++ T Consensus 170 ~~~~~~~ii~~l~~~~--~i~~~~~~~~~~~~~~~~~ip~~~~~~~~a~~~~Eg~~~~~~~~~~s~~~f~~i~~~~~k~~ 247 (477) T protein:vir:84 170 PLWMMNRFIELARAGR--TYANLCPTEPLPGGTSSINIPKILTGTSTAIQAADNAALTAPSAHEVDLTDGFVQANVKTIA 247 (477) T ss_pred cchhHHHHHHHhhhcc--hHHHhhceeeecCCcceeEEEEEecCcceeeeeccCcccccccccccccceeeEEEeeeeEE Confidence 6655555544443322 2333334344443333221111122222234577775 34678899999999998888 Q ss_pred ehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCH Q lcl|NC_018863. 125 DTKQQSLAAGLVNNIADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDE 204 (479) Q Consensus 125 ~~~~vs~~~~lv~~~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~ 204 (479) .--.+|..+= .++.-|.+....++-...+++.+|.++|+|+-. + -+..||.+.-. .+.+++-+...+. T Consensus 248 ~~~~iS~ell-~ds~~~l~~~i~~~l~~~~~~~~d~~~l~G~Gt---~------~~p~Gi~~~~~--~~~~~~~~~~~t~ 315 (477) T protein:vir:84 248 GQQGIAIQLL-DQAAVSVDEFVFRDLAADYANKLNVQVISGTGS---N------NQVVGVRATAG--ITQVTATSAGSAL 315 (477) T ss_pred eeeHHHHHHH-hccchhHHHHHHHHHHHHHHHHHHHHHhccCCC---C------Cccceeeeccc--cccccccccccch Confidence 8877876543 344446788888889999999999999999743 1 13457776532 3444444443332 Q ss_pred HH-------hhhhhheeecccCcee-eeecChHHHhhHHHhhc-CceeEEeecCCCccccCccccceecCceeEEecCCc Q lcl|NC_018863. 205 AT-------LNKAAVIVGKGYGRAT-DAFMPIGVQADFTNNLL-DRQRVIQPSQAGGFSTGFSINQFLSTRGAINLHGST 275 (479) Q Consensus 205 ~~-------l~~aa~~i~~~fG~at-d~~mp~~vka~f~q~~~-~~qrv~~~~n~g~~~~G~~V~~~~ss~g~I~L~~s~ 275 (479) .. |-.+...+..+|+... -.+|++.+.+.+...-. +++-+.+|+.++....+.....+. .....+ T Consensus 316 ~~~~~~~~~i~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~------~~~~~~ 389 (477) T protein:vir:84 316 EKHQIIYQKIADAIQRVHTSRFLEPEVIVMHPRRWASFHAIFAGDDRPLIVPSGPGFNNLGVLTEVAS------QRVVGQ 389 (477) T ss_pred hhHHHHHHHHHHHHhhccccccCCccEEEEcHHHHHHHHHhhccCCCeeeecCccccccccccccccc------ccccch Confidence 22 2233334445666544 47779998888877664 344455665443322222211100 111122 Q ss_pred ccCCCccccCcccCCCCCcccceEEEeecccccCcccccccceeeEEEEEEEcCCCCcccccceeeeeecC--------- Q lcl|NC_018863. 276 IMENDNILVDRIPEPNAPQAPASVVATVKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLASEAVTAVVANP--------- 346 (479) Q Consensus 276 v~~a~~~lver~~s~~aP~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~a~~--------- 346 (479) +++.+.+..+..+.-.+... ..... -.|..++-+.. ++..+.. ..+-... T Consensus 390 l~G~pVv~s~~~p~~~~~~~-----------d~~~i----~~gd~~~~~i~-----~~~~~~~-~~~~~~~~~~~~~~~v 448 (477) T protein:vir:84 390 MHGLPVVTDPTLPTTLGTGT-----------DQDVI----HVLRASDLALF-----ESSVRMR-ALQETRAENLSVLLQV 448 (477) T ss_pred hcccceEecCcccccccccC-----------CcceE----EEEEeceEEEE-----eeceeEE-eccccccccceeeeee Confidence 23333322221111000000 00000 01111111111 0000000 0000000 Q ss_pred -----------CCeEEEEEeecCCccccceEE Q lcl|NC_018863. 347 -----------TDSVSLAVKLQSLYQAKPQFI 367 (479) Q Consensus 347 -----------~~~V~LtIt~~~~~~~~~~y~ 367 (479) ..++. .|| +.+-..|+|- T Consensus 449 ~~~~~~~~~r~~~afv-~~t--~~~~~~~~~~ 477 (477) T protein:vir:84 449 YGYLAFTAARFPQSVV-EIG--GTALTAPTFA 477 (477) T ss_pred hhhhhhhhhccccceE-Eee--cccccccccC Confidence 11110 111 2223333332 No 108 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=85.24 E-value=0.054 Score=27.52 Aligned_cols=303 Identities=12% Similarity=0.067 Sum_probs=139.2 Q ss_pred cCchhHHHHHHHHHHHhhcCcccCcccccCccccchhhhHHHHHHHhhccccccchhhhccchhHHHHHHhhhhhccCcc Q lcl|NC_018863. 15 LPAGAEAELAELVSKSFTTGTGITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQVNSTVAKYAVFNQHGRT 94 (479) Q Consensus 15 ~~~~~~~~~~e~~~Ksf~ag~~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~ 94 (479) |+.-..++-+..+.|. -.++..+..+|..+..|.-.+-+..+. ... .|++.++-.++++-.. ++...|-. T Consensus 1 ~~~k~~~~~l~~~~~~----~~~~~~~~~~g~~v~~~~~~~l~~~i~-e~s--~~l~~i~v~~v~~~~~---~i~~~~~~ 70 (321) T protein:vir:31 1 MASRTINNDLSRITEK----NALTVDDLDAGGTLPDPLWDEFWTDMI-EET--PLLDAIRTETVGAKKT---RIPTLNIG 70 (321) T ss_pred CchHHHHHHHHHHHHh----ccccccccCCcceeCHHHHHHHHHHHH-Hhh--hhhhhceeeeccCcce---eeeeeccC Confidence 4444433322223321 133334455566776665555444433 332 3677776555543222 22222211 Q ss_pred ccccccc-cc-ccccccCcceEEEEEEEEeeeehhhhhhhHhhhcch--hhHHHHHHHHHHHHHHHHHHHHHhhcccccC Q lcl|NC_018863. 95 GHSRFVR-EV-GVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNI--ADPMTILTEDAISVIAKSIEWAIFYGDAALA 170 (479) Q Consensus 95 g~~~fv~-E~-g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~--~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~ 170 (479) +.....+ |+ +....++|.+.+....++=+..--.+|.-. |-++. .|-+....+.-..+++.+++.+.|+||..-. T Consensus 71 ~~~~~~~~e~~~~~~~~~~~~~~~~~~~~k~~~~~~it~e~-L~d~a~~~d~e~~i~~~ia~~~a~~~~~~~~nGd~~~~ 149 (321) T protein:vir:31 71 ERHRRPQDEGEWNENESDVSTGTIDISTEKATVAWDLPREV-VQENPEGEALADRILNLMTDAWSADVEDLAANGDEDAE 149 (321) T ss_pred CcccccccccccccccccceeeeeeeeeEEEEeehhccHHH-HHhhhcchhHHHHHHHHHHHHHHHHHHhheeeccccCC Confidence 1111222 33 344567888877777776665555555432 22332 4788888888889999999999999997644 Q ss_pred CCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhhheeecccCc-ee-eeecChHHHhhHHHhhcCceeEEeec Q lcl|NC_018863. 171 AEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAAVIVGKGYGR-AT-DAFMPIGVQADFTNNLLDRQRVIQPS 248 (479) Q Consensus 171 ~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~-at-d~~mp~~vka~f~q~~~~~qrv~~~~ 248 (479) +..+ ...+|+++.+....+.++..+..++.+.|..+-..+-..|-. .+ -.+|+..+.+++...+.+++.-+ T Consensus 150 ~~~~----~~n~G~l~~a~~~~~~~~~~~~~~~~d~l~~l~~~l~~~yr~~~~~v~im~~~~~~~~~~~l~~~~~~~--- 222 (321) T protein:vir:31 150 DSFE----NQNDGFITVAEGDVETIDAADDILDNDLVIRTIAGLDSKYRARMNPALIVSEDQLLSYHYTLTDRDTPL--- 222 (321) T ss_pred Cccc----ccchhhhhhhccccccccccccccCHHHHHHHHHhccHhHhcCCCeEEEechHHHHHHHHHHhcCCCcc--- Confidence 4222 246999999976667889999999988888766667666633 22 25699888777777665543211 Q ss_pred CCCccccCccccceecCceeEEecCCcccCCCccccCcccCCCCCcccceEEEeecccccCcccc------------ccc Q lcl|NC_018863. 249 QAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILVDRIPEPNAPQAPASVVATVKVNDKGAFRP------------VKD 316 (479) Q Consensus 249 n~g~~~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver~~s~~aP~~P~~vta~~~~~~~g~~~~------------~sd 316 (479) +..+. ...... ++++.+.+.++..+. - ....+.-.+..-.++. .++ T Consensus 223 -------~~~~l---~~~~~~-----tl~G~pvv~~~~mP~-----~--~il~t~~~nl~~~~~~~~~~~~~~~~~~~~~ 280 (321) T protein:vir:31 223 -------GDNVI---MGEADV-----NPFSFPIIGSGLWPD-----D--KAMFTDPQNLIYALYRDLEIDVLTESDKVSE 280 (321) T ss_pred -------ccchh---hccccc-----cccceeEEEcCCCCC-----C--cEEEeccccEEEEEeeccEEEEeecCccccc Confidence 10000 000001 112222211111110 0 0000000000000000 000 Q ss_pred ceeeEEEEEEEcCCCCcccccceeeeeecCCCeEEEEEeecCC Q lcl|NC_018863. 317 IKTHSYKVVVHSDDAESLASEAVTAVVANPTDSVSLAVKLQSL 359 (479) Q Consensus 317 ~g~Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~V~LtIt~~~~ 359 (479) -...+|.....|.+.-=-- ....+.+.+.....+. +-...+ T Consensus 281 ~~~~~~~~~~~~~~~~ve~-~~a~a~~~~i~~~~~~-~~~~~~ 321 (321) T protein:vir:31 281 RDLHARYFMRGDDDFAIEN-TEAVVLAEGLGDPLEH-LEEETS 321 (321) T ss_pred cceeeEeeeeeecceeEec-cccEEEEecCCcchhc-ccCCCC Confidence 0111111111111100000 1111112222222221 111111 No 109 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=84.93 E-value=0.057 Score=27.42 Aligned_cols=280 Identities=11% Similarity=0.040 Sum_probs=129.6 Q ss_pred ceeeeecCchhHHHHHHHHHHHhhcCcccCcccccCccccchhhhHHHHHHHhhccccccchhhhccchhHHHHHHhhhh Q lcl|NC_018863. 9 KVEARKLPAGAEAELAELVSKSFTTGTGITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQQVNSTVAKYAVF 88 (479) Q Consensus 9 ~~~~~~~~~~~~~~~~e~~~Ksf~ag~~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~~~~stv~~y~~~ 88 (479) -.+. ...-...+ ..+.|++++ +-.+|+.|..|.+++-|.. ..... .|++.+.-.+ +...+... T Consensus 1 ~~~~---~~~~~~~~-~~~~k~~t~-------~d~~Gg~l~P~~~~~~i~~-~~e~s--~~l~~~~vi~---~~~~~~~~ 63 (315) T protein:vir:41 1 MLTI---EDIRGGKP-FEIVPKIDV-------PDLGRGVLSVDRFGEFVKA-VRDSA--VIIPEARIDN---ALKSYEKD 63 (315) T ss_pred Cccc---chhhcCCh-hhhhhhcCC-------cCCCCceechHHHHHHHHH-HHhhh--hhhhhceeee---cccccccc Confidence 0000 11112222 224577764 2347888999999886654 44433 2444443211 11111111 Q ss_pred hccCccc-----ccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcch--hhHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018863. 89 NQHGRTG-----HSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNI--ADPMTILTEDAISVIAKSIEWA 161 (479) Q Consensus 89 ~~~G~~g-----~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~--~dp~~~~~~~ai~~~~~~~e~a 161 (479) ....+.| ...-.+|.+.+..++|.+.+....++-+..--.+|.-+ |.++. .|-+......--.+++...|.+ T Consensus 64 i~~~g~~~~~~~g~~~~~~~~~~~~~~~~f~~~~l~~~~l~~~~~it~el-L~D~~~~~~~e~~l~~~~a~~~a~~~~~~ 142 (315) T protein:vir:41 64 ISRLSLVLDVGPGRDETGQKLAPPESTAEVKTNTLYMREMVTKVVIHEDA-IEDNIEGKAFEQKIVTLLGEGISYVLEKY 142 (315) T ss_pred ccccccCcccccccccccCcCCCCCCccccceeeeceeeeeeeccccHHH-HHhhhccccHHHHHHHHHHHHHHHHHHHH Confidence 2221111 11234566667778899999998888877654444322 23443 3788888888889999999999 Q ss_pred HhhcccccCCCCCCcccchhhhHHHhhccC--CcEEEccCCCCCHHHhhhhhheeeccc-Cc--eeeeecChHHHhhHHH Q lcl|NC_018863. 162 IFYGDAALAAEADNQAGIEFDGLTKLIDEA--TNVIDLKGERLDEATLNKAAVIVGKGY-GR--ATDAFMPIGVQADFTN 236 (479) Q Consensus 162 ~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~--~NviDarG~~l~~~~l~~aa~~i~~~f-G~--atd~~mp~~vka~f~q 236 (479) .|.||..-.+..- -+.||+.+.+... ....|.....++.+.|..+.--+-..| -. --..+|+..+.+.+.. T Consensus 143 ~~nGdg~s~~p~~----~~~~G~l~~a~~~~~~~~~~~~a~~~~~d~l~~l~~sl~~~yr~~~~~~~~imn~~t~~~~rk 218 (315) T protein:vir:41 143 YLHGDTSSSDPLL----RMSDGWLKLASEKLTESDVDPEAEDWPMNLFDTMIESLPTPYRNNLPNMKFYVTWDIYRAYRD 218 (315) T ss_pred hhccCCcCcCccc----cccccceecccccccccccccccccccHHHHHHHHHhcChHHhhcCCceEEEEcHHHHHHHHH Confidence 9999986221110 1468999877521 223455555566776665443333333 22 2247799999888766 Q ss_pred hhcCceeEEeecCCCccccCccccceecCceeEEecCCcccCCCccccCcccCCCCCcccceEEEeecccccCccccccc Q lcl|NC_018863. 237 NLLDRQRVIQPSQAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILVDRIPEPNAPQAPASVVATVKVNDKGAFRPVKD 316 (479) Q Consensus 237 ~~~~~qrv~~~~n~g~~~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver~~s~~aP~~P~~vta~~~~~~~g~~~~~sd 316 (479) ....+.+-+-... ...| .++++++.+...++-......|..+. . .|. T Consensus 219 lk~~~g~~lw~~~---~~~g---------------~~~tl~G~PV~~~~~m~~~~~~~~~i---l------f~d------ 265 (315) T protein:vir:41 219 ALKGRETGLGDQA---LTGA---------------NSILYDGRPVQYVPALEALNDGKSRA---L------FVV------ 265 (315) T ss_pred HhccCCCccccch---hhcC---------------CCceecccceEecccccccCCCCccE---E------Eec------ Confidence 6554433221110 0000 01122222222222222211121110 0 000 Q ss_pred ceeeEE---------------------EEE--EEcCCCCcccccceeeeeecCCCeEEEEE Q lcl|NC_018863. 317 IKTHSY---------------------KVV--VHSDDAESLASEAVTAVVANPTDSVSLAV 354 (479) Q Consensus 317 ~g~Y~Y---------------------kV~--a~n~~GES~~S~~VtaT~a~~~~~V~LtI 354 (479) -++|.| ... +....+.+....+ ..|+| T Consensus 266 ~~nl~~~~~~~i~i~~~~~a~~~~~~~~~~~r~d~~~~~~~~~a~-----------~~~~v 315 (315) T protein:vir:41 266 PTQLVYGFWRNIKVVPDYDAEMRLTKYVASLRTDNHYEDEEGAVS-----------ATITV 315 (315) T ss_pred ccceEEEeccccEEEeeecCCCCceEEEEEEEeceeEEeccceeE-----------eeeeC Confidence 011211 111 1111111111100 00112 No 110 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=83.91 E-value=0.065 Score=27.10 Aligned_cols=297 Identities=14% Similarity=0.084 Sum_probs=119.3 Q ss_pred Cccccccc---ceeeeecCchhHHHH---HHHHHHHhhcCcccCcccccCccccchhhhHHHHHHHhhccccccchhhhc Q lcl|NC_018863. 1 MTELQKEQ---KVEARKLPAGAEAEL---AELVSKSFTTGTGITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIYPLIN 74 (479) Q Consensus 1 ~~~~~~~~---~~~~~~~~~~~~~~~---~e~~~Ksf~ag~~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~ 74 (479) .++..++. .....+.......+. .+.+.++... -.....+..+|+.+-.+.+...|..+-.. -.+.+.+. T Consensus 91 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~vp~~~~~~i~~~~~~---~~l~~~~~ 166 (397) T protein:vir:96 91 TDQKPKDGEKRKMKKFKVTEEELAEKRSAINAFVKSKGA-EKRDGFTSVEGGALIPQELLQPQLEPKDI---VDLSKYVR 166 (397) T ss_pred hhhhhHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHhhhh-hhhhcccccccccchhHHHHHHHHHhhhh---hhHHHhhh Confidence 00000000 000000000000000 0111111110 01112234556667777777777654221 12333333 Q ss_pred cchhHHHHHHhhhhhccCccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHH Q lcl|NC_018863. 75 KQQVNSTVAKYAVFNQHGRTGHSRFVREVGVAS-INDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAISV 153 (479) Q Consensus 75 k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~~~~~ai~~ 153 (479) ...+.+.-.+|..... ..+...+++|++... .+++.+.+....++=++.--.+|.-+ +.++..|.+....+.--.. T Consensus 167 ~~~~~~~~~~~~~~~~--~~~~~~~~~E~~~~~~~~~~~~~~i~~~~~~~~~~~~~s~el-l~ds~~~l~~~i~~~l~~~ 243 (397) T protein:vir:96 167 SVPVNSASGKFPVISK--SGSKMATVQQLEKNPQLANPKMVEIDYSVATRRGYIPISQEM-IDDASYDVTGLIADEIQDQ 243 (397) T ss_pred hccccccceeEEEEec--cCCccccccccccccccccccccceeecHhHhhcchhhHHHH-HhhhHHHHHHHHHHHHHHH Confidence 3333332233332222 224456788888665 68999999999988777655555532 2345556667777777788 Q ss_pred HHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhhheeecccCceeeeecChHHHhh Q lcl|NC_018863. 154 IAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAAVIVGKGYGRATDAFMPIGVQAD 233 (479) Q Consensus 154 ~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~atd~~mp~~vka~ 233 (479) +...++.+++.|+..-.+.+ .+-+|++..+|.. .++ .+++ .-.+||+.+... T Consensus 244 ~~~~~~~~i~~g~g~~~~~~----~~~~d~~~~~~~~---~~~-------------------~~~~--a~~v~n~~~~~~ 295 (397) T protein:vir:96 244 SLNTKNADIAAVLKTATAKS----VVGVDGLKDLINK---EIK-------------------KVYD--VKLFISASMYSE 295 (397) T ss_pred HHHHHHHHHhhccccccccc----ccchHHHHHHHHH---hhh-------------------hhcC--cEEEEcHHHHHH Confidence 88899999999988765543 3457777766531 011 1111 237999999888 Q ss_pred HHHhhcCceeEEeecCCCccccCccccceecCceeEEecCCcccCCCccccCcccCCCCCcccceEEEeecccccCcccc Q lcl|NC_018863. 234 FTNNLLDRQRVIQPSQAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILVDRIPEPNAPQAPASVVATVKVNDKGAFRP 313 (479) Q Consensus 234 f~q~~~~~qrv~~~~n~g~~~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver~~s~~aP~~P~~vta~~~~~~~g~~~~ 313 (479) +...-...-|.+...+..+. .+.++++.+.+..+-.....+... .. .+ T Consensus 296 l~~lkd~~G~~~~~~~~~~~------------------~~~~l~G~pv~~~~~~~~~~~~~~---~~----------~~- 343 (397) T protein:vir:96 296 LDKLKDKNGRYLLQDSITAA------------------SGKQLLGKEVVVLDDDVIGKSVGN---VV----------GF- 343 (397) T ss_pred HHHhhccCCCeEeccCccCC------------------CcccccccceEEecccccCCCCCc---eE----------EE- Confidence 87654332233322111110 011222222222111000000000 00 00 Q ss_pred cccceeeEEEEEEEcCCCCcccccceeeeeecCCCeEEEEEeecCCccccceEEEEEeccC---CCCcEEEEEEeeeeec Q lcl|NC_018863. 314 VKDIKTHSYKVVVHSDDAESLASEAVTAVVANPTDSVSLAVKLQSLYQAKPQFISVYRQGN---ETGHYFLVARVPLSKA 390 (479) Q Consensus 314 ~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~V~LtIt~~~~~~~~~~y~~IYR~t~---~~g~~~~i~rV~~s~~ 390 (479) .|.++.-+....+.|-+.... ..... .-.++.++|=.- .+..| ..+.+..+ T Consensus 344 ---~gd~~~~~~~~~~~~~~~~~~------~~~~~--------------~~~~~~~~r~d~~~~~~~a~---~~~~~~~a 397 (397) T protein:vir:96 344 ---IGDAKAFASFFDRKQVSVSWV------DNNIY--------------GQLLAGIIRYDVKATDKKAG---FYVTFTIG 397 (397) T ss_pred ---EeehhcceEeEeecceEEEEe------ccccc--------------ceeEEEEEEEccEEecccce---EEEEeecC Confidence 122221111111111111000 00000 001222222221 01111 11111111 No 111 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=81.77 E-value=0.0073 Score=32.30 Aligned_cols=287 Identities=15% Similarity=0.112 Sum_probs=110.5 Q ss_pred Cc----ccccccceeeeecCchhHHHHHHHHH-------------HHhhcCcccCcccccCccccchhhhHHHHHHHhhc Q lcl|NC_018863. 1 MT----ELQKEQKVEARKLPAGAEAELAELVS-------------KSFTTGTGITPDTQHDAAALRRELLDDQVKMLAFT 63 (479) Q Consensus 1 ~~----~~~~~~~~~~~~~~~~~~~~~~e~~~-------------Ksf~ag~~~~~~~~~~gaAlr~esld~~i~~l~~~ 63 (479) +. ++..+.+ . ......+.+..+.+. ++|.. .+...+..+|+.|-.+.+.+.|...... T Consensus 38 ~~~~~~~~~~~~~--~-~~~~e~~~~~~~~~~~~~r~~~~l~~ee~~~~~--~~~~~t~~~gG~liP~~~~~~Ii~~l~~ 112 (395) T protein:vir:95 38 FGAMFDALSNDLQ--E-EITAEINNRVVDNGILAKRSQDPLTSEERKFFN--DINYDVGYTDEKILPETVVERVFDDLQK 112 (395) T ss_pred HHHHHHHHHHHHH--H-HHHHHHHHHHHHHHHHhhcCccccchHHHHHHH--HHhhccCCCCceeccHHHHHHHHHHHHh Confidence 00 1100000 0 000000001101000 11100 1222355678888888888888554443 Q ss_pred cccccchhhhccchhHHHHHHhhhhhccCccccccccccccc-ccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhH Q lcl|NC_018863. 64 NGDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGV-ASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADP 142 (479) Q Consensus 64 ~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~-~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp 142 (479) .. .+++.+....+..++ .+...++.+.+.++.|.+. ...+++.+.+.....+=|+.-..+|..+ |.++..|. T Consensus 113 ~s--~i~~~~~v~~~~~~~----~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~iS~el-l~ds~~~i 185 (395) T protein:vir:95 113 DH--PLLSKINFQNAGIKT----RVIKADPAGQAVWGKVFGEIKGQLDAAFREENFTQYKLTCFVVLPDDL-STFGPAWI 185 (395) T ss_pred hh--hhhhhceeEecCCce----EEEEecCCcceEEeecccccCccccccceeeeeceeeEEEeecccHHH-HhcchhHH Confidence 32 345555444454432 2344455556667777665 4578999999999999998777777665 56677888 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEE-Ecc-CCCCC-------HHH----hhh Q lcl|NC_018863. 143 MTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVI-DLK-GERLD-------EAT----LNK 209 (479) Q Consensus 143 ~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~Nvi-Dar-G~~l~-------~~~----l~~ 209 (479) +....+.--..+++.+|.+++.|+-.=...| -||++.+....... +.. ...+. ... +.. T Consensus 186 e~~i~~~la~~ia~~~~~a~i~G~G~~~~qP--------~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~ 257 (395) T protein:vir:95 186 ERFVRTQIQEAISVALESAIINGGGAAKTQP--------VGLMKDVNTNSGAVTDKASSGTLTFADADTTILELNDVLKN 257 (395) T ss_pred HHHHHHHHHHHHHHHHhhheeeccCCCCcCc--------eeeeecccccccccccccccchhhhhhhHhhHHHHHHHHHh Confidence 9999999999999999999999985421111 13333221100000 000 00000 000 011 Q ss_pred hhhe-eecc---cCceeeeecChHHHhhHHHhh------------cC-ceeEEeecC--CCccccCccccc-eecCceeE Q lcl|NC_018863. 210 AAVI-VGKG---YGRATDAFMPIGVQADFTNNL------------LD-RQRVIQPSQ--AGGFSTGFSINQ-FLSTRGAI 269 (479) Q Consensus 210 aa~~-i~~~---fG~atd~~mp~~vka~f~q~~------------~~-~qrv~~~~n--~g~~~~G~~V~~-~~ss~g~I 269 (479) +++. ..+. -|.+ ...|++.+..+..-.+ ++ .-.|+..++ .+....|. -.. +..-++.+ T Consensus 258 ~~~~~~~~~~~~~~~~-~~~mn~~t~~~~~g~~~~~~~~G~~~~~lg~g~~v~~~~~~p~~~i~fgd-fs~y~i~~r~~~ 335 (395) T protein:vir:95 258 LSVDEKGKELKIDGKV-ALVVNPRDSWDVQARYTYLTANGGFVTVLPYNVTIITSEFVPEGKLVAFV-TDRYNAVRGGGL 335 (395) T ss_pred hccccccchhhhcCce-EEEEcchhhhhcCCcceeccCCCcceeccCCcceEEEcCCCCCCcEEEEe-cccEEEEEecce Confidence 1100 0000 0111 0122222222211110 00 001111110 00000000 000 00001111 Q ss_pred Ee--------------------cCCcccCCCccccCcccCCCCCcccceEEEeecccccCccccc Q lcl|NC_018863. 270 NL--------------------HGSTIMENDNILVDRIPEPNAPQAPASVVATVKVNDKGAFRPV 314 (479) Q Consensus 270 ~L--------------------~~s~v~~a~~~lver~~s~~aP~~P~~vta~~~~~~~g~~~~~ 314 (479) ++ .+-.+.+..-+.+-...-..+|..++.+- ++.+| |+-+ T Consensus 336 ~i~~~~~~~~~~d~~~f~~~~r~dg~~~~~~A~~~l~i~~~~~~~~~~~~~----~~~~~-~~~~ 395 (395) T protein:vir:95 336 TVKKFDQTLALEDAVLFTAKTFAYGQPDDNKASAVYDLKVASAPRRQTSAG----GTTDG-IAEA 395 (395) T ss_pred EEEeccchhhhCCcEEEEEEEEECCEEeccccEEEEEeeccCCCCCCCCCC----CCCCc-cccC Confidence 11 11111112211111122222232222111 11111 1111 No 112 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=75.78 E-value=0.14 Score=25.22 Aligned_cols=321 Identities=12% Similarity=-0.009 Sum_probs=120.1 Q ss_pred CcccccccceeeeecCc-hhHHH--------------HHHHHHHHhhcCcccCcccccCccccchhhhHHHHHHHhhccc Q lcl|NC_018863. 1 MTELQKEQKVEARKLPA-GAEAE--------------LAELVSKSFTTGTGITPDTQHDAAALRRELLDDQVKMLAFTNG 65 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~-~~~~~--------------~~e~~~Ksf~ag~~~~~~~~~~gaAlr~esld~~i~~l~~~~~ 65 (479) +.+..+...=...+-+. ....+ +-+...|.+++- ....+-++|+.|-.+.+..+|..+..... T Consensus 35 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~r~~~~~~--~~~~~~~~gg~lvP~~~~~~I~~~~~~~s 112 (390) T protein:vir:40 35 FTNMAEQIQNNIIAQARKEVNREMNDNNVLASRGANALTSDESKYYNEV--IAGNGFAGVTALLPPTVFERVFEDLTVEH 112 (390) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCchhccHHHHHHHHHH--HhccCcccCcccccHHHHHHHHHHHHhhh Confidence 00000000000000000 00000 000012222110 01112357888999999988866655544 Q ss_pred cccchhhhccchhHHHHHHhhhhhccCccccccccccccc-ccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHH Q lcl|NC_018863. 66 DFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGV-ASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMT 144 (479) Q Consensus 66 ~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~-~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~ 144 (479) . +++.+...++.+.-. .+....+.+...++.|++. ++..++.+.+.....+=++.-..+|.-+ +.++..|.+. T Consensus 113 ~--i~~~~~~~~~~~~~~---~i~~~~~~~~a~~~~E~~~~~~~~~~~f~~i~l~~~k~~~~i~iS~el-l~ds~~~l~~ 186 (390) T protein:vir:40 113 P--LLSKINFVNTTATTE---WIISVGDVATAWWGPLCAEIKEVLDNGFDKIQTGMYKLSAYIPVCNAM-LDLGPSWLDQ 186 (390) T ss_pred h--hhhhceeeecCCcee---EEEEEcCCcceeeeccccccCccccccceeeEeeeeeEEEeehhhHHH-HhcchHHHHH Confidence 3 455555544444222 2334555666778899776 4578999999999999888877777433 2355668889 Q ss_pred HHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCc--EEEccCCCCCHHHh-------hhhh-hee Q lcl|NC_018863. 145 ILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATN--VIDLKGERLDEATL-------NKAA-VIV 214 (479) Q Consensus 145 ~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~N--viDarG~~l~~~~l-------~~aa-~~i 214 (479) ...+.-...++..++.++++|+-.=. -.|+++.+..... ..+.....++-..+ ..+- ..- T Consensus 187 ~i~~~la~~i~~~~~~a~l~G~G~~~----------P~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~ 256 (390) T protein:vir:40 187 YVRTILGEAMALGLEAGIVNGSGKDQ----------PIGMMRDLNNVTAGEHPVKTATPLTDLTPATLATKVMLPLTDNG 256 (390) T ss_pred HHHHHHHHHHHHHHHhhhhcccCCCc----------cceeeeccccccccccccccccccchhhHHHHHHHHHHHhhcch Confidence 99999999999999999999985311 1366654421110 11111122221111 1111 111 Q ss_pred ecccCceeeeecChHHHhhHHH---hhcCce-eEEeecCCCccccCccc--cceecCceeEEe-cCC--cccCCCccccC Q lcl|NC_018863. 215 GKGYGRATDAFMPIGVQADFTN---NLLDRQ-RVIQPSQAGGFSTGFSI--NQFLSTRGAINL-HGS--TIMENDNILVD 285 (479) Q Consensus 215 ~~~fG~atd~~mp~~vka~f~q---~~~~~q-rv~~~~n~g~~~~G~~V--~~~~ss~g~I~L-~~s--~v~~a~~~lve 285 (479) .+.++.+. .+|++.+...+-. .+.+.. +-+.+.. ..|..| ...+.. +.|-+ .++ .+.+..++.++ T Consensus 257 ~~~~~~a~-~i~n~~t~~~~l~~~~~~~d~~G~~v~~~~----~~g~pvv~~~~~p~-~~i~~Gd~s~~~i~~~~~~~v~ 330 (390) T protein:vir:40 257 KKSVSDAI-LVINPADYWSKIYAATSYMTPQGVWVTGIL----PVPLEIVQSVAVPV-GKAVAGRAKDYFMGIGSEQVIR 330 (390) T ss_pred hhhhcCce-EEEcchhHHHHHHHHhhccCCCCccccccC----CCceeEEEcCCCCC-CcEEEEeeceEEEEeecceEEE Confidence 11222221 3456554322111 111100 0000000 011111 000000 00000 000 00000111111 Q ss_pred cccCCCCCcccceEEEeecccccCcccccccceeeEEEEEEEcCCCCcccccceeeeeecCCCeEE Q lcl|NC_018863. 286 RIPEPNAPQAPASVVATVKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLASEAVTAVVANPTDSVS 351 (479) Q Consensus 286 r~~s~~aP~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~V~ 351 (479) +..+ .-+..-.+.--..--..|.....+ +....++++.+.+.- .+-..++.++.....+ + T Consensus 331 ~~~~--~~f~~~~~~~r~~~r~dg~v~~~~--A~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~~-~ 390 (390) T protein:vir:40 331 TSTE--YRLLDDETLYYAKQYANGRPKDNS--SFLVFDITGLEGSPA-IDVNVVNNATPSETPA-E 390 (390) T ss_pred ecch--hhhhcCcEEEEEEEEeCCEEeccc--ceEEEEeeccCCCCC-CCcceeeCCCCCCCCC-C Confidence 0000 000000000000011111111110 123333443332111 1111111111111111 1 No 113 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=75.54 E-value=0.089 Score=26.34 Aligned_cols=269 Identities=8% Similarity=0.033 Sum_probs=117.5 Q ss_pred eecCchh-HHHHHHHHHHHhhcCcccCcccccC-ccccch--hhhHHHHHHHhhccccccchhhhccchhHHHHHHhh-- Q lcl|NC_018863. 13 RKLPAGA-EAELAELVSKSFTTGTGITPDTQHD-AAALRR--ELLDDQVKMLAFTNGDFTIYPLINKQQVNSTVAKYA-- 86 (479) Q Consensus 13 ~~~~~~~-~~~~~e~~~Ksf~ag~~~~~~~~~~-gaAlr~--esld~~i~~l~~~~~~f~~~~~i~k~~~~stv~~y~-- 86 (479) ..|.... ++.+..+++.... .+... |+-|-+ |.+|+++....+. +++.-+.|+ +.+.+.+|. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~-------~~~d~~~~fl~~ql~~id~~v~e~~~~--~~~~~~~i~---v~~~~~~~~et 68 (314) T protein:vir:10 1 MAIKFDAEQAKITTHLEQMGV-------EKADAAGIWAVSQLTAALNRAYEKEYA--ENSVVNIFP---VTNEIPGHAKY 68 (314) T ss_pred CccchHHHHHHHHHHHHhhcc-------cchhhhHHHHHHHHHHHHHHHhhhhcc--ccccceeec---cccCCCCceeE Confidence 3333332 2222222222221 23333 344444 4666666443322 233333333 112222211 Q ss_pred -hhhccCcccccccccc-cccccccCcceEEEEEEEEeeeehhhhhhhHhh--hcchhhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_018863. 87 -VFNQHGRTGHSRFVRE-VGVASINDPNIRQKTVQMKFLSDTKQQSLAAGL--VNNIADPMTILTEDAISVIAKSIEWAI 162 (479) Q Consensus 87 -~~~~~G~~g~~~fv~E-~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~l--v~~~~dp~~~~~~~ai~~~~~~~e~a~ 162 (479) .+......|....++. .+..+..|.++.|++..+..++....++..--. ...-.+..+.....|.+.+.+.+-..+ T Consensus 69 ~~~~~~e~~G~a~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~ 148 (314) T protein:vir:10 69 FEYPEFDGVGIAQIIADYSDDLPLVDAFMTEKQGKVFRFGNAFLISTDEIKAGAATGQSLSARKQALAFEAHDNLLDKLV 148 (314) T ss_pred EEeeeeccccceeeeCCcccccceeecccceeEEEEEEEEeeEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEE Confidence 2223345555555564 444678899999999999999999999754222 233346677888888888899999999 Q ss_pred hhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCC----HHHhhhh---hheeecccCceeeeecChHHHhhHH Q lcl|NC_018863. 163 FYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLD----EATLNKA---AVIVGKGYGRATDAFMPIGVQADFT 235 (479) Q Consensus 163 f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~----~~~l~~a---a~~i~~~fG~atd~~mp~~vka~f~ 235 (479) ||||+.++ +-||++-=. -...-+.+.--+ .+.|+++ ....++++-.|+.+.||+.-.+.+. T Consensus 149 f~G~~~~g----------~~GLlN~p~--v~~~~~~~~WaT~~ei~~Di~~~~~~l~~~s~g~~~p~~l~Lpp~~~~~L~ 216 (314) T protein:vir:10 149 WSGSAPHG----------IVSVFDQPN--INNVVATPNWSVPQNAIDDVTAMIDAVESSTQGLHHVTDILLPASARRVMQ 216 (314) T ss_pred Eeeccccc----------ceeEeecCC--CccccCCCCcccHHHHHHHHHHHHHHHHHhcCccccceeEEecHHHHHhhc Confidence 99988754 345553110 000001111111 2234432 3445667888999999998777665 Q ss_pred HhhcCcee-------------EEeec----CCCccccCcc-ccceecCceeEEecCCccc-------CCCccccCcccCC Q lcl|NC_018863. 236 NNLLDRQR-------------VIQPS----QAGGFSTGFS-INQFLSTRGAINLHGSTIM-------ENDNILVDRIPEP 290 (479) Q Consensus 236 q~~~~~qr-------------v~~~~----n~g~~~~G~~-V~~~~ss~g~I~L~~s~v~-------~a~~~lver~~s~ 290 (479) ........ .|... +.|. -|.+ .-.+....-.+.++-+.-+ +.-.+.+.. ... T Consensus 217 ~~~~~~~~tvl~~l~~n~~~l~I~~~~el~~ag~--~g~~~~v~y~~~~~~~~~~vp~~~~~l~~e~~~~~~~~~~-~~r 293 (314) T protein:vir:10 217 GLVPQTNLSYGELFTRNNPGLTIRFLQFLDNYDG--AGGKAALAFEKSPLNMSIEIPEVTNVLPAQPKDLHFRYPV-TSK 293 (314) T ss_pred ccccCCCccHHHHHHHhCCCcEEEEcccccccCC--CcceEEEEEecCCcEEEEecCccceeecceecCceEEEcc-eee Confidence 33211000 01110 0110 0000 0011111111222211000 011111100 000 Q ss_pred CC---CcccceEEEeecccccCcccc Q lcl|NC_018863. 291 NA---PQAPASVVATVKVNDKGAFRP 313 (479) Q Consensus 291 ~a---P~~P~~vta~~~~~~~g~~~~ 313 (479) .+ -.-|-+..-.. |=-|. T Consensus 294 ~~Gv~i~~P~ai~~~d-----GI~~~ 314 (314) T protein:vir:10 294 ATGLIVYRPLTMAVIK-----GITFA 314 (314) T ss_pred eEEEEEECcceeEeee-----eeecC Confidence 00 01122222111 11111 No 114 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=75.40 E-value=0.15 Score=25.15 Aligned_cols=307 Identities=11% Similarity=0.057 Sum_probs=129.2 Q ss_pred Ccc-----cccccceee------------------eecCchhHHHHHHHHHHHhh---cC-cc------cCcccccCccc Q lcl|NC_018863. 1 MTE-----LQKEQKVEA------------------RKLPAGAEAELAELVSKSFT---TG-TG------ITPDTQHDAAA 47 (479) Q Consensus 1 ~~~-----~~~~~~~~~------------------~~~~~~~~~~~~e~~~Ksf~---ag-~~------~~~~~~~~gaA 47 (479) .++ ..+++.-+. .....+....- ..+.++|. .| .. .+. +-.+|+. T Consensus 75 ~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~-~e~r~a~~~~l~~~~~~~e~~a~~~-~t~~GG~ 152 (434) T protein:vir:62 75 DPEKKEDPTAKENPNEKTELSEEQRSAISASIAAALSTKGHRTNKE-TEIRSVFANYIVGNIDEKEARALGL-VTGNGSV 152 (434) T ss_pred hhhhhcchhhhcchhhhHHHHHHHHHHHHHHHHhhhhhccccchHH-HHHHHHHHHHhccccchhhhhhhcc-cccccce Confidence 000 000000000 00000000000 01223322 11 00 011 1135788 Q ss_pred cchhhhHHHHHHHhhccccccchhhhccchhHHHHHHhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehh Q lcl|NC_018863. 48 LRRELLDDQVKMLAFTNGDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTK 127 (479) Q Consensus 48 lr~esld~~i~~l~~~~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~ 127 (479) |..+.+...|..+...... +.+-..+....+. .+|.++...+.........|++..+..|+.+.+.....+=++.-. T Consensus 153 lvP~~~~~~Ii~~l~~~~~--i~~~~~~~~~~~~-~~~p~~~~~~~a~~~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~ 229 (434) T protein:vir:62 153 TIPDFLSKEIITYAQEENF--LRRLGTGVKTKEN-IKYPVLVKKAEAQGHKNERTNNEMPETDIEFDEIELSPTEFDALA 229 (434) T ss_pred ecchhhHHHHHHhhhhhhh--hhhhcceeccCCc-eEEEEEecCCcccceecccccccccccccceeeEEeeheeeEeeh Confidence 8999998888766554432 2232333233333 345555444443333445678888899999999999999988877 Q ss_pred hhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHh Q lcl|NC_018863. 128 QQSLAAGLVNNIADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATL 207 (479) Q Consensus 128 ~vs~~~~lv~~~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l 207 (479) .+|.-+ +.++.-|.+....+.-...+++.++.+++.||-.=.+ ..|+... +.......+...-.+++ T Consensus 230 ~iS~el-l~ds~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~---------~~g~~~~---~~~~~~~~~~~~~d~l~ 296 (434) T protein:vir:62 230 TVTKKL-LARTGLPIEQIVMDELKKAYVRKETQYMVNGDEANNI---------NDGALAK---KAVEFKTDEKNLYDALV 296 (434) T ss_pred hhHHHH-HhcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcc---------ccceeec---ccccccccccchhhHHH Confidence 777653 2344557888888899999999999999999865222 2343321 12222233333223333 Q ss_pred hhhhheeecccCceeeeecChHHHhhHHHhhcC-ceeEEeecCC---Cc--cccCccccceecC------ceeEEecCCc Q lcl|NC_018863. 208 NKAAVIVGKGYGRATDAFMPIGVQADFTNNLLD-RQRVIQPSQA---GG--FSTGFSINQFLST------RGAINLHGST 275 (479) Q Consensus 208 ~~aa~~i~~~fG~atd~~mp~~vka~f~q~~~~-~qrv~~~~n~---g~--~~~G~~V~~~~ss------~g~I~L~~s~ 275 (479) +.-.. +...|..-...+|++.+.+.+...-.. ++-+++|.+. |. .-.|.+|.-.... .-.+-+.| T Consensus 297 ~l~~~-l~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~g~~~tl~G~pV~~~~~~~~~~~~~~~~i~~G-- 373 (434) T protein:vir:62 297 KMKNT-PVKEVRKKARWVLNTAALTKIETMKTDDGFPLLRPFNQAEGGIGYTLLGFPVEEEDAIDIPDSPDTPVFYFG-- 373 (434) T ss_pred HHHhh-cchhhhcCCEEEEcHHHHHHHHHhhccCCCEeeccCCCccCCCCceecceeeEEecCccCccCCCceEEEEe-- Confidence 32222 233443333468888888877654433 2333343221 11 1234444111000 00001100 Q ss_pred ccCCCcc-ccCccc-----CCCCCcccceEE-EeecccccCcccc-cccceeeEEEEEEEcCC Q lcl|NC_018863. 276 IMENDNI-LVDRIP-----EPNAPQAPASVV-ATVKVNDKGAFRP-VKDIKTHSYKVVVHSDD 330 (479) Q Consensus 276 v~~a~~~-lver~~-----s~~aP~~P~~vt-a~~~~~~~g~~~~-~sd~g~Y~YkV~a~n~~ 330 (479) |-..+ ..+|.- ..+..+.-..-+ --..--..|+... ......|++++++.... T Consensus 374 --dfs~~~i~~~~g~~~i~~~~~~~~~~~~v~~~~~~r~Dgk~i~~~~~~~~~~~~~~~~~~~ 434 (434) T protein:vir:62 374 --DFSKFYIQDVIGSLEVQKLVELFSRTNRVGFRIWNLLDAQLIHSPFEVPVYKYVLKAPTGA 434 (434) T ss_pred --eccceEEEEeeceeEEEeehhhhcccCceEEEEEeeecceeecCcccceEEEEEeccCCCC Confidence 00010 111100 000011000000 0000000111100 11112233333322222 No 115 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=73.95 E-value=0.16 Score=24.89 Aligned_cols=254 Identities=12% Similarity=0.095 Sum_probs=117.1 Q ss_pred cCcccccCccccch---hhhHHHHHHHhhccccccchhhhccchhHHHHHHhhh---hhccCccccccccc-cccccccc Q lcl|NC_018863. 37 ITPDTQHDAAALRR---ELLDDQVKMLAFTNGDFTIYPLINKQQVNSTVAKYAV---FNQHGRTGHSRFVR-EVGVASIN 109 (479) Q Consensus 37 ~~~~~~~~gaAlr~---esld~~i~~l~~~~~~f~~~~~i~k~~~~stv~~y~~---~~~~G~~g~~~fv~-E~g~~~~~ 109 (479) .+-|...+++++-. |.+|+++....+. +++.-+.++ +.+.+..|.+ +....+.|....++ +....+.. T Consensus 1 ~~~~~a~~~~~f~~~ql~~id~~v~e~~~~--~l~~~~~i~---v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v 75 (296) T protein:vir:10 1 MGVDKADAAGIWTVKQLTASLNKAYETEYD--QNSVVNLFP---VSNEIPGYAKYFEYPVFDGVGIAQIVADYTDDLPLV 75 (296) T ss_pred CcccchhhhHHHHHHHHHHHHHHHHhhhhc--ccccceecc---cccCCCCceeEEEeeeeeccCceeEeCCCcccccee Confidence 44444456666666 4556666443332 233222222 1111222111 12233444444444 44556788 Q ss_pred CcceEEEEEEEEeeeehhhhhhhHhh---hcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHH Q lcl|NC_018863. 110 DPNIRQKTVQMKFLSDTKQQSLAAGL---VNNIADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTK 186 (479) Q Consensus 110 d~~~~r~~~~~k~l~~~~~vs~~~~l---v~~~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~ 186 (479) |.++.|++..+..++..+.++.. ++ ...-.+..+.....|.+.+.+.....+||||+.+. +-||++ T Consensus 76 ~~~~~~~~~~i~~~~~~~~~~~~-El~~a~~~g~~l~~~ka~aA~~~~~~~~n~~~f~G~~~~g----------~~GLlN 144 (296) T protein:vir:10 76 DALATERQGKVFRFGNAFLISID-EIKVGQATGQSLSTRKQSLAFEAHDKLLDKLVWSGSTAHG----------IPSVFD 144 (296) T ss_pred eccceeEEEEEEEEEeeeeecHH-HHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeeccccc----------ceeEee Confidence 99999999999999999888743 33 33344677778888889999999999999988754 335544 Q ss_pred hhccCCcEEEccCC--CCC--HHHhhhhh---heeecccCceeeeecChHHHhhHHHhhcCce----eEEeecCCCc--- Q lcl|NC_018863. 187 LIDEATNVIDLKGE--RLD--EATLNKAA---VIVGKGYGRATDAFMPIGVQADFTNNLLDRQ----RVIQPSQAGG--- 252 (479) Q Consensus 187 ~I~~~~NviDarG~--~l~--~~~l~~aa---~~i~~~fG~atd~~mp~~vka~f~q~~~~~q----rv~~~~n~g~--- 252 (479) -=. -....+.|. -.+ .+.|+++- ...++++=.++.+.||+.....+..-+.+.- ..+..++++. T Consensus 145 ~p~--v~~~~~~~~W~~~t~i~~Di~~~~~~l~~~s~g~~~p~~l~L~p~~~~~L~~~~~~~~~t~l~~ik~~~~~l~i~ 222 (296) T protein:vir:10 145 YPN--INNVVSGGSWSQPTTAVSDITSLLDIIETSTNGQHRATHLLLPTTARRIMQNLVPGTSVSYGEFFRQNNSGVTVE 222 (296) T ss_pred cCC--CccccccCCccCHHHHHHHHHHHHHHHHHhhCceecceeEEeCHHHHHHHhhccCCCCccHHHHHHHhcCCceEE Confidence 210 012222221 111 22344433 4456678888999999999888865432110 0001111110 Q ss_pred --------cccCccc-cceecCceeEEecCCcccCC------CccccCcccCCCC---CcccceEEEeecccccCcccc Q lcl|NC_018863. 253 --------FSTGFSI-NQFLSTRGAINLHGSTIMEN------DNILVDRIPEPNA---PQAPASVVATVKVNDKGAFRP 313 (479) Q Consensus 253 --------~~~G~~V-~~~~ss~g~I~L~~s~v~~a------~~~lver~~s~~a---P~~P~~vta~~~~~~~g~~~~ 313 (479) ...|-+. -.+....-.+.++-..-+.. +-.+..+.....+ -.-|-+..... |=-|. T Consensus 223 ~~~~l~~a~~~g~~~~v~~~~~~~~~~~~v~~~~~~~~~e~~~l~~~~~~~~~~~Gv~i~~P~ai~~~d-----GI~~~ 296 (296) T protein:vir:10 223 FVQYLNDYNGTGTSAAIAYEKDPNNMAIEIPEATNALPAQPKDLHFKIPVTSKATGLIVYRPLTMAVMK-----GITFA 296 (296) T ss_pred EeeeeccCCCCcceEEEEEEcCCceEEEEcCcceeeecccccCceEEEeeEeeEEEEEEECCceeEEEe-----eeecC Confidence 0001000 01111111222221111000 0000011111111 00122222111 11111 No 116 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=73.74 E-value=0.14 Score=25.27 Aligned_cols=292 Identities=13% Similarity=0.008 Sum_probs=122.7 Q ss_pred Ccccccc----cc--e--eee---ecCchhHHHHHHHHHHHhhcCcccCcccccCccccchhhhHHHHHHHhhccccccc Q lcl|NC_018863. 1 MTELQKE----QK--V--EAR---KLPAGAEAELAELVSKSFTTGTGITPDTQHDAAALRRELLDDQVKMLAFTNGDFTI 69 (479) Q Consensus 1 ~~~~~~~----~~--~--~~~---~~~~~~~~~~~e~~~Ksf~ag~~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~ 69 (479) +.++.++ .+ . ... ..+.-...+ -+++.+..+. .+-.+|+.|.++.+.+.|...... ...+ T Consensus 39 ~~~~~~~~~~~~~~e~~~~~~~~~~~~~lt~ee-~~~~~~~~~~------~~~~~gg~lvP~~~~~~I~~~l~~--~s~i 109 (377) T protein:vir:96 39 FTTMGDEILAKNEEEMERMFDLRDKNRELTAEE-IKFFNDIDKN------VGGKDKFKLLPEETMVQVFDDLVA--EHPL 109 (377) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCcccCHHH-HHHHHHHHhc------CCCCCCceecCHHHHHHHHHHHHh--hhhh Confidence 1111000 00 0 000 000000000 0112222221 234678888888887777543322 2344 Q ss_pred hhhhccchhHHHHHHhhhhhccCccccccccccccc-ccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHH Q lcl|NC_018863. 70 YPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGV-ASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTE 148 (479) Q Consensus 70 ~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~-~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~~~~ 148 (479) ++.+....+.+.+ ++......+.+.++.|.+. ++..++.+.+.....+=|+.--.+|..+ |.++..|.+....+ T Consensus 110 ~~~~~v~~~~~~~----~i~~~~~~~~a~wv~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~~l-l~ds~~~le~~i~~ 184 (377) T protein:vir:96 110 LKVINFKNTSLRL----KALTAETSGTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDA-LKFGPKWLKQFITE 184 (377) T ss_pred hhhceeEecCCce----EEEEecCCcceeEeecccccccccCccceeEeeeeeeEEeechhhHHH-hhcchhhHHHHHHH Confidence 5544444443332 2344455556778899876 5678999999999999998877777665 56778899999999 Q ss_pred HHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCC----------cEEEc---cC--CCCCHHHhhh-hh- Q lcl|NC_018863. 149 DAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEAT----------NVIDL---KG--ERLDEATLNK-AA- 211 (479) Q Consensus 149 ~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~----------NviDa---rG--~~l~~~~l~~-aa- 211 (479) .--.++++.++.+++.||-.=- --||++-+.... -+++. -| ..++.+.+-+ .. T Consensus 185 ~l~~~~~~~~~~a~i~G~G~~~----------P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 254 (377) T protein:vir:96 185 QLKEAIAVALELAIVKGNGLLQ----------PVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSDLDPDTAVELLVP 254 (377) T ss_pred HHHHHHHHHHhhceEeccCCCc----------ceeeeeccccccccccccccccceeeccccccccccCChhHHHHHHHH Confidence 9999999999999999996421 237776553111 01111 11 1123332221 11 Q ss_pred --hee-ecccCceee------eecChHHHhhHHHhhcCceeEEeecCCCccc--cCccccceecC---ceeEEecCC--- Q lcl|NC_018863. 212 --VIV-GKGYGRATD------AFMPIGVQADFTNNLLDRQRVIQPSQAGGFS--TGFSINQFLST---RGAINLHGS--- 274 (479) Q Consensus 212 --~~i-~~~fG~atd------~~mp~~vka~f~q~~~~~qrv~~~~n~g~~~--~G~~V~~~~ss---~g~I~L~~s--- 274 (479) ... ..+.|.+.. +.|++.+..+.. .+...+++ .|.+. .|+++.-..+. .|.|-+ |+ T Consensus 255 l~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~~~-----~~~~~~~~-~G~~~~~l~~p~~v~~s~~~p~~~i~f-gdf~~ 327 (377) T protein:vir:96 255 VMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLE-----AKFTSRNQ-FGEYVTVLPHGITILESLAVETGKAIA-FVANR 327 (377) T ss_pred HHHhhccccccccccccCceEEEEchhhHHhcc-----ccccccCC-CCCceeccCCCceEEecCCCCcccEEE-EEcCc Confidence 111 112233322 447776655432 22222222 22221 22222111111 111111 10 Q ss_pred -cccCCCccccCcccCCCCCcccceEEEeecccccCcccccccceeeEEEEEEE Q lcl|NC_018863. 275 -TIMENDNILVDRIPEPNAPQAPASVVATVKVNDKGAFRPVKDIKTHSYKVVVH 327 (479) Q Consensus 275 -~v~~a~~~lver~~s~~aP~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~ 327 (479) .+.++.+..+++..+.-.-.--+.-.+. .=..|.-...+ + --.+.|+.. T Consensus 328 Y~i~~r~~~~i~~~~~~~~~~d~~~f~~~--~r~dG~~~d~~-a-~~vl~l~~~ 377 (377) T protein:vir:96 328 YDAFMATASTIEEYDQTFAMEDLQLYLTK--NYFYGKAKDNH-T-AALLTLAGG 377 (377) T ss_pred EEEEEecccEEEeehhhhhhcCCeEEEEE--EEEcCEEecCC-c-EEEEEEecC Confidence 1111222222211110000000000000 00000000000 0 011111111 No 117 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=72.75 E-value=0.18 Score=24.68 Aligned_cols=324 Identities=11% Similarity=0.072 Sum_probs=138.6 Q ss_pred Ccccc-------cccceeeeecC---chhHHHHHHHHHHH--------------hhcCc------ccCcccccCccccch Q lcl|NC_018863. 1 MTELQ-------KEQKVEARKLP---AGAEAELAELVSKS--------------FTTGT------GITPDTQHDAAALRR 50 (479) Q Consensus 1 ~~~~~-------~~~~~~~~~~~---~~~~~~~~e~~~Ks--------------f~ag~------~~~~~~~~~gaAlr~ 50 (479) +.+++ .+......... ...+... ..+.|. +...+ .....+..+|+++-+ T Consensus 84 l~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~vP 162 (466) T protein:vir:80 84 LEQLNNKEPKNNSEPAQVSGARTQQFVGGETRM-KGFFRNMPYEQRAALIARSEVKEFLAQVRTLAQQKRAVSGAELTIP 162 (466) T ss_pred HHHHHHhhhccCchhHHHHhhhhhHHhhHHHHH-HHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHhhhhhhhcccccccc Confidence 11111 00000000000 0000000 000010 00000 111223355667777 Q ss_pred hhhHHHHHHHhhccccccchhhhccchhHHHHHHhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhh Q lcl|NC_018863. 51 ELLDDQVKMLAFTNGDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQS 130 (479) Q Consensus 51 esld~~i~~l~~~~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs 130 (479) |.+-..|....... ..+.+.+...++..++ ++ ..++....+.++.|++..+..|+.+.+....++=++.--.+| T Consensus 163 ~~~~~~i~~~l~~~--~~l~~~~~v~~~~g~~-~~---~~~~~~~~a~wv~E~~~~~~~~~~f~~i~~~~~k~~~~~~iS 236 (466) T protein:vir:80 163 DVMLELLRDNMHRY--SKLISKVRLRPLKGTA-RQ---NIAGAIPEGVWTEAVANLNELSLSFSQIEVDGYKVGGFIPIP 236 (466) T ss_pred HHHHHHHHHhhhhh--hhhhhheeeeecCcee-Ee---eeecCCcceeecccccccccccccccceeecceeeeeehhhh Confidence 77766664443322 2345555444444332 22 334444457789999999999999999888888777766666 Q ss_pred hhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccC------------------C Q lcl|NC_018863. 131 LAAGLVNNIADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEA------------------T 192 (479) Q Consensus 131 ~~~~lv~~~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~------------------~ 192 (479) .-+- .++..|.+....+.-...++..++.+++.||-.=.| -|+++.+... . T Consensus 237 ~ell-~ds~~~l~~~i~~~la~~~~~~~~~ail~G~G~~~P----------~Gil~~~~~~~~~~~~~~~~~~~~~~~~~ 305 (466) T protein:vir:80 237 NSTL-EDSDLNLADEILDAIGQAIGFALDKAILYGTGTKMP----------VGIVTRLAQTTQPPNWGTKAPAWTNLSTT 305 (466) T ss_pred HHHH-hcchHHHHHHHHHHHHHHHHHHHhhheeeccCCCCc----------ceeeecccccccccccccccccccccchh Confidence 5432 356678888999999999999999999999864211 2655443210 0 Q ss_pred cEEEcc--CCCCC--HHHhhhhhheeecccCceeeeecChHHH-hhHHHhhc----CceeEEeecCCCccccCcccccee Q lcl|NC_018863. 193 NVIDLK--GERLD--EATLNKAAVIVGKGYGRATDAFMPIGVQ-ADFTNNLL----DRQRVIQPSQAGGFSTGFSINQFL 263 (479) Q Consensus 193 NviDar--G~~l~--~~~l~~aa~~i~~~fG~atd~~mp~~vk-a~f~q~~~----~~qrv~~~~n~g~~~~G~~V~~~~ 263 (479) ..+++. +..-. -..+..+....-..++.+.++|+++... ..+....+ +++.+..+++ +.+-.|.+|-... T Consensus 306 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~l~~~~~~~~~~g~~~~~~~~-~~~i~G~pvv~s~ 384 (466) T protein:vir:80 306 NLLKIDPTGKSAEEFFSELVLKLSKARANYSNGMKFWAMSSNTHAVLMSKAITFNSAGALVASLNN-TMPIVGGDIVILD 384 (466) T ss_pred hhhhhhhhccchhhHHHHHHHHHHhhhccccCCceeEEecchhHHHhhcccccccCCccccccCCC-cccccccceeecC Confidence 111110 00000 0001111222344567777877765432 22211111 1111111111 1112222221000 Q ss_pred cCceeEEecCCcccCCCc-ccc-CcccCCCCCcccceEEEeecccccCcccccccceeeEEEEEEEcCCCCcccccceee Q lcl|NC_018863. 264 STRGAINLHGSTIMENDN-ILV-DRIPEPNAPQAPASVVATVKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLASEAVTA 341 (479) Q Consensus 264 ss~g~I~L~~s~v~~a~~-~lv-er~~s~~aP~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~Vta 341 (479) .. + .+..+.+... |.. +|. +. ....+ ....|..+ ...|++...=+..==.+...+.. T Consensus 385 ~~--~---~~~~~~g~~~~y~i~~r~----~~-----~i~~~---~~~~f~~d----~~~~r~~~r~dg~~~~~~afv~~ 443 (466) T protein:vir:80 385 FI--P---DNDIIGGYGSLYLLAERA----DI-----KLAQS---EHVRFIED----QTVFKGTARYDGKPVFGEGFVAV 443 (466) T ss_pred cc--C---ccceeeeccccEEEEeec----ce-----EEEec---hhhhhhcC----cEEEEEEEEEccEEeccCceEEE Confidence 00 0 0111111111 111 110 00 01111 11112111 12344443333333344555677 Q ss_pred eeecCCCeEEEEEeecCCccccceE Q lcl|NC_018863. 342 VVANPTDSVSLAVKLQSLYQAKPQF 366 (479) Q Consensus 342 T~a~~~~~V~LtIt~~~~~~~~~~y 366 (479) ++++....+.+...|.. +..+. - T Consensus 444 ~~~~~~~~~~~~~~~~~-~~~~~-~ 466 (466) T protein:vir:80 444 NIANANPTTSITFAPDE-ANVPE-V 466 (466) T ss_pred EecCCCcccceeeecCc-CcCCC-C Confidence 77788788888777642 22332 2 No 118 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=71.82 E-value=0.19 Score=24.53 Aligned_cols=304 Identities=14% Similarity=0.067 Sum_probs=132.2 Q ss_pred CcccccccceeeeecCchhHHHHHHHHHHHhhcCc---------ccCcccccCccccchhhhHHHHHHHhhccccccchh Q lcl|NC_018863. 1 MTELQKEQKVEARKLPAGAEAELAELVSKSFTTGT---------GITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIYP 71 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~Ksf~ag~---------~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~ 71 (479) +.+............. ... +.+.++...+. .....+..+|+-+-.+..++.|..+... ...++ T Consensus 71 ~~~~~~~~~~~~~~~~-~~~----~~~~r~~~~~~~r~~~~~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~---~~~l~ 142 (390) T protein:vir:62 71 LSGLQGSGSGAQRSAD-VDD----DATLRAGNLGEARSFEFAPEKRDGTKAGNPNVLSRTLYGQLIAQAVER---SAIMR 142 (390) T ss_pred Hhhcccccccchhhcc-hHH----HHHHhhhhhhhhHHHHhhhhhhcccccCCCccccccchHHHHHHHHhh---hhhhh Confidence 1111100000000000 000 11112221111 1111223345566666666666555433 22344 Q ss_pred hhccchhHHHHHHhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHH Q lcl|NC_018863. 72 LINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAI 151 (479) Q Consensus 72 ~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~~~~~ai 151 (479) .+....-.+.-..+ .+....+.....+++|++..+.+++.+.+....++=++.-..+|.-+= .++.-|.+....+.-- T Consensus 143 ~~~~~~~~~~~~~~-~~p~~~~~~~a~wv~E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell-~ds~~~l~~~i~~~l~ 220 (390) T protein:vir:62 143 GGATTFTTSDANPL-DFTVITGRSSASIVGETAEIPESYPATAQRSMGGFKYGFASVVSYEFA-TDQVLDLVGFLVSDAG 220 (390) T ss_pred hcceeeecCCCcee-EEEEEcCCcceeeecccccccccccceeeeEeeeeeEEeehHHHHHHH-hhhhHHHHHHHHHHHH Confidence 33322111111212 122333444677899999999999999999999998888777775442 3455577788888888 Q ss_pred HHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEcc-CCCCCHHHhhhhhheeecccCceeeeecChHH Q lcl|NC_018863. 152 SVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLK-GERLDEATLNKAAVIVGKGYGRATDAFMPIGV 230 (479) Q Consensus 152 ~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDar-G~~l~~~~l~~aa~~i~~~fG~atd~~mp~~v 230 (479) ..+++.++.++++|+-+ | -||.+......+.+... ...++.+.|.++-.-+..+|-.---.+|+..+ T Consensus 221 ~~i~~~~d~~~l~G~G~--p----------~Gi~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~l~~~~~~~a~~vmn~~~ 288 (390) T protein:vir:62 221 PAIGDAMGRHFITGTGQ--P----------RGILTDASPATATFLATDTDSKVSDALIDLFHEVPSAYRANAKYVVNDLR 288 (390) T ss_pred HHHHHHHHhhhhccCCc--c----------ccccccccccccceecccccccchHHHHHHHHhhhhhhhcCCEEEEchHH Confidence 99999999999999742 2 26766664333433332 24455444443322223344332247899999 Q ss_pred HhhHHHhhcCceeEE-eecC-CCc--cccCccccceecCceeEEecCCcccCCCccccCcccCC--CCCcccceEEEeec Q lcl|NC_018863. 231 QADFTNNLLDRQRVI-QPSQ-AGG--FSTGFSINQFLSTRGAINLHGSTIMENDNILVDRIPEP--NAPQAPASVVATVK 304 (479) Q Consensus 231 ka~f~q~~~~~qrv~-~~~n-~g~--~~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver~~s~--~aP~~P~~vta~~~ 304 (479) .+.+...-...-|-+ +++- .|. .-.|.+|. .+. .+.++.++. +-++. -..-....+- .. T Consensus 289 ~~~L~~lkd~~g~~l~~~~~~~g~~~~l~G~Pv~--~~~----------~~p~~~i~~-gd~s~~~i~~~~~~~v~-~~- 353 (390) T protein:vir:62 289 AAQMRKLKDANGQYLWQSGLTVGAPSLFNGKVVE--TDD----------GMPADKILF-ADLSKYRVRFAGSLRVD-RS- 353 (390) T ss_pred HHHHHHhhccCCCeeecCCcCCCccceecccceE--Eec----------CCCCccEEE-eeccceeEEeecceEEE-ee- Confidence 888876443332333 3331 111 11222221 000 000000000 00000 0000000000 00 Q ss_pred ccccCcccccccceeeEEEEEEEcCCCCcccccceeeeeecCCCeEEEEEeecC Q lcl|NC_018863. 305 VNDKGAFRPVKDIKTHSYKVVVHSDDAESLASEAVTAVVANPTDSVSLAVKLQS 358 (479) Q Consensus 305 ~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~V~LtIt~~~ 358 (479) . ..++. . +-..|++...=+ | -+....--+.|+++++. T Consensus 354 ~---~~~~~-~--~~~~~~~~~r~d-~----------~~~~~~A~~~l~~~~~a 390 (390) T protein:vir:62 354 V---DAKFS-T--DQIVYRFLQRAD-G----------LLVDARGAKVLTVTPGA 390 (390) T ss_pred c---ccccc-C--CcEEEEEEEEeC-c----------EeechhheEEEEeecCC Confidence 0 00000 0 111222221111 1 12233334556777665 No 119 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=70.36 E-value=0.21 Score=24.30 Aligned_cols=313 Identities=13% Similarity=0.085 Sum_probs=126.1 Q ss_pred Ccccccccc--------------------------eeeeecCchhHHHHHHHHHHHhhcCcccCcccccCccccchhhhH Q lcl|NC_018863. 1 MTELQKEQK--------------------------VEARKLPAGAEAELAELVSKSFTTGTGITPDTQHDAAALRRELLD 54 (479) Q Consensus 1 ~~~~~~~~~--------------------------~~~~~~~~~~~~~~~e~~~Ksf~ag~~~~~~~~~~gaAlr~esld 54 (479) .+.+..+.+ ++...... .......+.+++.+|..+++ ...|+-+-.+.+. T Consensus 280 ~~~~~~~~~~~kg~~f~~~~~al~~~~g~~~~a~e~a~~~~~~--~~~~~~~~~~a~~~~~~~~~--~~~Gg~~vp~~~~ 355 (645) T protein:vir:93 280 APVIRVEQKLDKGIGFARFAKSLAAAKGVRSEALEVARRQYPD--DSRLHHVLKSAVGAGTTTDP--QWAGSLSEYQEYA 355 (645) T ss_pred cccccchhhhhhhhhHHHHHHHHHhcccchhHHHHHHHhhccc--chhhhhhhhhhhhccccccc--cccCCccCchhhH Confidence 111111110 00000111 11111123455666554443 3456667777777 Q ss_pred HHHHHHhhccccccchhhhccchhHHHHH-Hhhh-hhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhh Q lcl|NC_018863. 55 DQVKMLAFTNGDFTIYPLINKQQVNSTVA-KYAV-FNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLA 132 (479) Q Consensus 55 ~~i~~l~~~~~~f~~~~~i~k~~~~stv~-~y~~-~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~ 132 (479) .+|..+..... .+..+..+....... .++. +....+.+...+++|++..+.+++.+...+...|=|+.--.+|.- T Consensus 356 ~~ii~~l~~~s---vv~~l~~~~~~~~~~~~~~~~ip~~t~~~~a~wv~Eg~~~~~s~~~f~~v~l~~~kla~~~~iS~e 432 (645) T protein:vir:93 356 QDFIDYLRPQT---IIGRFGQGGIPALRQVPFNIRVHAQVSGGAAGWVGEGKTKPLTKFDFESITFSHAKVSAIAVLTEE 432 (645) T ss_pred HHHHHhhhhhh---hHHhhccccccccccccCceeeeeeecCcceEEeccCccccccccceeEEEEeeEEEEEeehhHHH Confidence 66654443322 222222211111000 1111 112222345779999999999999999999999999887777775 Q ss_pred HhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhhh Q lcl|NC_018863. 133 AGLVNNIADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAAV 212 (479) Q Consensus 133 ~~lv~~~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~ 212 (479) +=. ++.-|.+....++-...+++.++.++|.|+..-..+. .+.|+ .... ..+...| ....+..+.... T Consensus 433 ll~-ds~~~~~~~i~~~l~~aia~~~d~a~l~g~g~~~~~~-~p~gi-----~~~~----~~~~~~~-~~~~d~~~~~~~ 500 (645) T protein:vir:93 433 LIR-FSSPAADALVRNALAEAVVARLDTDFVDPKKAAVADV-SPASI-----THDV----KGTASSG-NPDADAEAAFGQ 500 (645) T ss_pred HHh-hchHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccCCc-cccce-----eccc----ccccccc-chHHHHHHHHHH Confidence 432 3445677888889999999999999999886533221 12232 1111 1111212 222343333333 Q ss_pred eeecccCceee-eecChHHHhhHHHhhcCceeEEeecCCCccccCccccceecCceeEEecCCcccCCCccccCcccCCC Q lcl|NC_018863. 213 IVGKGYGRATD-AFMPIGVQADFTNNLLDRQRVIQPSQAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILVDRIPEPN 291 (479) Q Consensus 213 ~i~~~fG~atd-~~mp~~vka~f~q~~~~~qrv~~~~n~g~~~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver~~s~~ 291 (479) ....++...+- ..|++.+...+...-...-+-+.++-. ..+.++++.+.+..+.. T Consensus 501 ~~~a~~~~~~a~~vmn~~~~~~L~~lkd~~G~~~~~~~~--------------------~~~~tL~G~PV~~s~~v---- 556 (645) T protein:vir:93 501 FVAANLQPTGAVWLMSSTNALALSMRKNALGQKEYPDMT--------------------LLGGSFQGLPVIVSQYV---- 556 (645) T ss_pred HHhcCCCccccEEEEcHHHHHHHHhccccCCceeecCCC--------------------CCCceeeceeeEEeccC---- Confidence 33344444444 458999988886654321111212100 01111111111111100 Q ss_pred CCcccceEE-EeecccccCccccccccee---eEEEEEEEcCCCCcccccceeeeeecCCCeEE---------------- Q lcl|NC_018863. 292 APQAPASVV-ATVKVNDKGAFRPVKDIKT---HSYKVVVHSDDAESLASEAVTAVVANPTDSVS---------------- 351 (479) Q Consensus 292 aP~~P~~vt-a~~~~~~~g~~~~~sd~g~---Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~V~---------------- 351 (479) | .... +..+.-.-|. +-+-.++. -.|++..... +........+..-..-.+.+. T Consensus 557 -p---~~~~~gd~s~~~ig~-~~~v~i~~s~~a~~~~~~~~~-~~~~~~~~~~~v~lf~~d~vaira~~r~d~~~~~p~a 630 (645) T protein:vir:93 557 -G---DQLVLVNAPDIYLAD-DGGVAVDMSREASLEMQSEPT-GDSTTPSPVELVSMFQTGSVAIRAERWINWRRRRTAA 630 (645) T ss_pred -C---cceeEeccccEEEEE-ecceEEEeecceeEEEeeccc-ccccccccccchhHhhcCceEEEEEEEEcceeeCccc Confidence 1 0000 0000000000 00000000 0112111000 000000000000001111122 Q ss_pred ---EE-EeecCCccc Q lcl|NC_018863. 352 ---LA-VKLQSLYQA 362 (479) Q Consensus 352 ---Lt-It~~~~~~~ 362 (479) |+ ++|....+. T Consensus 631 ~~~lt~~~~g~~~~~ 645 (645) T protein:vir:93 631 VAVITGVNYGSASGG 645 (645) T ss_pred eEEEecccCCcccCC Confidence 22 333322221 No 120 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=66.04 E-value=0.27 Score=23.67 Aligned_cols=309 Identities=12% Similarity=0.092 Sum_probs=134.4 Q ss_pred CcccccccceeeeecCchh---HHHHHHHHHHHhhc-CcccCcccccCccccchhhhHHHHHHHhhccccccchhhhccc Q lcl|NC_018863. 1 MTELQKEQKVEARKLPAGA---EAELAELVSKSFTT-GTGITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIYPLINKQ 76 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~---~~~~~e~~~Ksf~a-g~~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k~ 76 (479) +++...+.... ...... +.+-+..++|.... =......+..+|+.+-++-+..+|..+...... +++.+... T Consensus 71 ~~~~~~~~~~~--~~~~~~~~~~~~~~~~~lr~~~~~~~~~~~~t~~~gg~~vP~~~~~~i~~~~~~~~~--l~~~~~~~ 146 (389) T protein:vir:10 71 EPKDDGSKKGT--DLSKKPIDAKKKAINDFIHSHGKVIDATSKVTSTEAGVLIPEEIIYDPTAEVNSVVD--LSTLVTKT 146 (389) T ss_pred ccccccccccc--ccchhHHHHHHHHHHHHhhcchhhhhhhcccccCCcceeehHHHHHHHHHHHHhhhh--HHhhccee Confidence 22222221111 111111 11112222222110 001222334567888888888887666555443 34444445 Q ss_pred hhHHHHHHhhhhhccCccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHH Q lcl|NC_018863. 77 QVNSTVAKYAVFNQHGRTGHSRFVREVGVAS-INDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAISVIA 155 (479) Q Consensus 77 ~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~~~~~ai~~~~ 155 (479) ++.+.--+|.+... ..+...+++|++... .+++.+.+....++-++.-..+|.-+ +.++..|.+....+.-...+. T Consensus 147 ~~~~~~~~~~~~~~--~~~~~~~~~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~la~~~~ 223 (389) T protein:vir:10 147 PVTTPKGTYPILKR--ATDRFSSVAELAENPKLAEPEFNKVDWSVATYRGAIPLSEEA-IADSAVDLTALVGQSIKEKSV 223 (389) T ss_pred eccCCeeEEEEEec--CCCccccccccccccccccccceeeeeeheeeEeeehhhHHH-HhhhhHHHHHHHHHHHHHHHH Confidence 55544444543333 223445789987654 78999999999999998888888754 345566777888888888899 Q ss_pred HHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhhheeecccCceeeeecChHHHhhHH Q lcl|NC_018863. 156 KSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFT 235 (479) Q Consensus 156 ~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~atd~~mp~~vka~f~ 235 (479) ..++.++..|.....+.+.. ...-.|-|..++.. ....+|+ .-.+|++.+...+. T Consensus 224 ~~~~~~i~~g~~~~~~~~~~-~~~~~d~l~~~~~~----------------------~~~~~~~--a~~~~n~~~~~~L~ 278 (389) T protein:vir:10 224 NTYNAMIAPVLQSFTAKKTT-TDTLVDSLKHILNV----------------------DLDPAYS--RALVVTQSLFNTLD 278 (389) T ss_pred HHHHHHHhhhhccccccccc-ccccHHHHHHHHHh----------------------hhhhhhC--cEEEecHHHHHHHH Confidence 99999999988776544322 12334444443321 1111221 24689998888887 Q ss_pred HhhcCceeEEeecCCCccccCccccceecCceeEEecCCcccCCCccccCcccCCCCCcccceEEEeecccccCcccccc Q lcl|NC_018863. 236 NNLLDRQRVIQPSQAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILVDRIPEPNAPQAPASVVATVKVNDKGAFRPVK 315 (479) Q Consensus 236 q~~~~~qrv~~~~n~g~~~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver~~s~~aP~~P~~vta~~~~~~~g~~~~~s 315 (479) ..-...-|-+...+..+.+.+-. ..++++.+.+.++....+... .... -. T Consensus 279 ~lkd~~G~~i~~~~~~~~~~~~~--------------~~~l~G~pV~~~~~~~~~~~~---~~~~-----~~-------- 328 (389) T protein:vir:10 279 TLKDKNGRYLLHDASDSITDGTA--------------KGTILGVPVYVVGDTLLGSLA---GDQK-----AF-------- 328 (389) T ss_pred HhhccCCCeeeecCccccccccc--------------ccccccceeEEecccccCCCC---CceE-----EE-------- Confidence 65543223332222221111100 112222332222210000000 0000 00 Q ss_pred cceeeEEEEEEEcCCCCcccccceeeeeecCCCeEEEEEeecCCccccceEEEEEecc-----CCCCcEEEEEEeeeeec Q lcl|NC_018863. 316 DIKTHSYKVVVHSDDAESLASEAVTAVVANPTDSVSLAVKLQSLYQAKPQFISVYRQG-----NETGHYFLVARVPLSKA 390 (479) Q Consensus 316 d~g~Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~V~LtIt~~~~~~~~~~y~~IYR~t-----~~~g~~~~i~rV~~s~~ 390 (479) .|.++..+....+.|-+.. .+.... ....++.++|=. ++...+..+..+|.++. T Consensus 329 -~gd~~~~~~~~~~~~~~i~------------------~~~~~~--~~~~~~~~~r~d~~~~~~~a~~~~~~~~~~~~~~ 387 (389) T protein:vir:10 329 -VGDLKRGVLFTDRQQVTLA------------------WEDSKI--YGKYLGAAFRFGVQKADSKAGYFVTNTDVPGSAL 387 (389) T ss_pred -EeeccccEEEEeecceEEE------------------eecccc--ccceEEEEEEeccEEecccceEEEEeeccCCCCC Confidence 1122211111111111111 000000 000112222211 01122222223333221 Q ss_pred cCCCe Q lcl|NC_018863. 391 DENGV 395 (479) Q Consensus 391 n~~~t 395 (479) . . T Consensus 388 ~---~ 389 (389) T protein:vir:10 388 G---K 389 (389) T ss_pred C---C Confidence 1 1 No 121 >protein:vir:107423 Length: 681 # NCBI annotation: Bbp13 # Family: family:all:780 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958682;genbank:gi:41179374;genbank:GeneID:2717217 Probab=59.01 E-value=0.22 Score=24.16 Aligned_cols=302 Identities=19% Similarity=0.179 Sum_probs=133.6 Q ss_pred hhhhHhhhcch----hhHH--HHHHHHHHHHHHHHHHHHH--hhcccccCCCCCCcccchhhhHHHhhc----------c Q lcl|NC_018863. 129 QSLAAGLVNNI----ADPM--TILTEDAISVIAKSIEWAI--FYGDAALAAEADNQAGIEFDGLTKLID----------E 190 (479) Q Consensus 129 vs~~~~lv~~~----~dp~--~~~~~~ai~~~~~~~e~a~--f~Gd~~l~~~~~~~~gleFDGl~~~I~----------~ 190 (479) ++...-+++++ -+|+ .+.-.+.-.+-++.+|.++ -+|--.--| |++|-|-++-=. . T Consensus 1 m~~~~~~~~~f~~Ge~~p~l~~r~D~~~y~~~~~~~~N~~~~~~G~~~~R~------g~~~~~~~~~~~~~~rlipf~~~ 74 (681) T protein:vir:10 1 MSNVRVLQRSFGGGEISPEMFGRIDDVKYQSGLAICRNFVVKPQGPAENRA------GFAFVREVKDSAKKVRLIPFTYS 74 (681) T ss_pred CcceeEeeeecCCceeeeeeccchhHHHHHHHHHHhcCcEEEecCCceecC------hhHhhhhcCCCCCcEEEEEEEeC Confidence 11111123333 2555 4444455556666666553 344444332 677777655321 1 Q ss_pred CCcEEEccCCCCCHHHhhhhhheeecccCceeeeecChHHHhhHHHhhcCceeEEeecCCCccccCccccceecCceeEE Q lcl|NC_018863. 191 ATNVIDLKGERLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSQAGGFSTGFSINQFLSTRGAIN 270 (479) Q Consensus 191 ~~NviDarG~~l~~~~l~~aa~~i~~~fG~atd~~mp~~vka~f~q~~~~~qrv~~~~n~g~~~~G~~V~~~~ss~g~I~ 270 (479) .++.+.++= -.....+-++-|...+.-+|..+...|....+..-+..|.+|.--..-+...++ .+. T Consensus 75 ~~~~~~l~~--------g~~~~r~~~~~~~~~~~~~~~~~~tpy~~~~l~~l~~~q~aD~~~i~h~~~~p~------~L~ 140 (681) T protein:vir:10 75 VTQTMVIEL--------GAGYFRFHTNGGTLLDGAVPYEIANPYAEADLFNIHYVQSADVLTLVHPNYAPR------ELR 140 (681) T ss_pred CCceEEEEE--------eCCeEEEEeCCcEEeeCcEeEEecCCCChhhhcCceEEEEcCEEEEECCCCcce------EEE Confidence 122221110 111233334445544445566556666666677777777766433333333222 233 Q ss_pred ecCCcccCCCccccCcccCCCCCcccceEEEeecccccCcccccccceeeEEEEEEEcCCC--CcccccceeeeeecCCC Q lcl|NC_018863. 271 LHGSTIMENDNILVDRIPEPNAPQAPASVVATVKVNDKGAFRPVKDIKTHSYKVVVHSDDA--ESLASEAVTAVVANPTD 348 (479) Q Consensus 271 L~~s~v~~a~~~lver~~s~~aP~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~G--ES~~S~~VtaT~a~~~~ 348 (479) +++.+-+ -.+...=..+|..|++.+++... .+. --+|+|.|.++...+ +|.++..++.+...... T Consensus 141 r~~~~~W-----~l~~~~f~~~p~~p~~~~at~~~--~~~------~~t~~~~v~avda~t~~~s~~~~~~tvt~~~~~~ 207 (681) T protein:vir:10 141 RLGATNW-----QLATIAFTSPVATPTSVTATSNN--KGT------DYTYRYVVTALDAEGKTESAPSSAGTCTNNLFTN 207 (681) T ss_pred EccCCce-----EEEEEEeccccccceeeeeeccC--Ccc------ceeEeEEEEEeecccceeecCCcceEEeeeeecC Confidence 3432221 12222334566677766655311 111 126899999888776 67777776666544444 Q ss_pred eEEEEEeecCCccccceEEEEEeccCCCCcEEEEEEeeeeeccCCCeeEEEeccC------C-------------CCCcc Q lcl|NC_018863. 349 SVSLAVKLQSLYQAKPQFISVYRQGNETGHYFLVARVPLSKADENGVITFVDRNQ------V-------------IPETT 409 (479) Q Consensus 349 ~V~LtIt~~~~~~~~~~y~~IYR~t~~~g~~~~i~rV~~s~~n~~~tttf~D~N~------~-------------iPgT~ 409 (479) ...-++.|.++.++. ++.|||... +.+-.++. ...+.+.|.+. + -|.|. T Consensus 208 ~~~~t~~w~a~~g~~--~~~V~~~~~--gi~g~ig~--------~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~gyP~~v 275 (681) T protein:vir:10 208 GGANTIAWSASSGAS--RYNVYKEQG--GLYGYIGQ--------TTGTSLVDDNIAPDLSVTPPIYDAVFNAAGDYPAAV 275 (681) T ss_pred CcceeEEEEecCCce--eeeecccce--eEEEEeec--------cceeeeeecccccCccccccccccccccCCCceEEE Confidence 334455666666654 678998654 33333322 11222323221 1 13333 Q ss_pred ceeeccc-------cHHHHHH------HHhc---ccc---c--cCccccC-chhHHHHHhhhhhheecccee-------- Q lcl|NC_018863. 410 DVFIGEL-------TPQVISL------LELL---PMM---K--LPLAQMN-ATTTFTVLWYGALALYAPKKW-------- 459 (479) Q Consensus 410 ~~fvge~-------~~q~i~l------~ell---Pm~---k--~Pla~~~-~~~~~~V~~yg~L~l~aPkk~-------- 459 (479) ..|-+-| +||.|-| .-|- |+. . +.++... ..+.|+|-.=..|.+.+=.-| T Consensus 276 ~f~q~RL~f~~~~~~p~~v~~Srsgdy~nF~~~~~~~ddD~i~~~~~~~~~~~i~~~v~~~~lli~t~~~e~~l~~~~~~ 355 (681) T protein:vir:10 276 SYFEQRRCFAGTTNKPQNIWMTRSGTESAMSYSLPVRDDDRVAFRVAAREANAIRHIVPLTELLLLTSSGEWRVASVNSD 355 (681) T ss_pred EEEcceEEEeeCCCCCcEEEEEcccCcccccccCCCCCCccEEEEEcCCcceeEEEEEecCcEEEEEcCcEEEEecCCCc Confidence 2222111 4443222 2222 211 0 1111111 245566654333333333333 Q ss_pred ------EEEEeccccCccccccccCC Q lcl|NC_018863. 460 ------VRIKNVQYIPALAADVTYRP 479 (479) Q Consensus 460 ------~~ikNV~~~~~~~~~~~~~~ 479 (479) +.|+-+...++ +++ +| T Consensus 356 ~lTP~~~~~~~~s~~g~--~~~--~P 377 (681) T protein:vir:10 356 AVTPTTISVRPQSYVGA--TDV--QP 377 (681) T ss_pred cccceeEEEEEeeeecc--ccc--cc Confidence 34444444443 332 56 No 122 >protein:vir:98487 Length: 681 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:780 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996575;genbank:gi:45569506;genbank:GeneID:2767815 Probab=59.01 E-value=0.22 Score=24.16 Aligned_cols=302 Identities=19% Similarity=0.179 Sum_probs=133.6 Q ss_pred hhhhHhhhcch----hhHH--HHHHHHHHHHHHHHHHHHH--hhcccccCCCCCCcccchhhhHHHhhc----------c Q lcl|NC_018863. 129 QSLAAGLVNNI----ADPM--TILTEDAISVIAKSIEWAI--FYGDAALAAEADNQAGIEFDGLTKLID----------E 190 (479) Q Consensus 129 vs~~~~lv~~~----~dp~--~~~~~~ai~~~~~~~e~a~--f~Gd~~l~~~~~~~~gleFDGl~~~I~----------~ 190 (479) ++...-+++++ -+|+ .+.-.+.-.+-++.+|.++ -+|--.--| |++|-|-++-=. . T Consensus 1 m~~~~~~~~~f~~Ge~~p~l~~r~D~~~y~~~~~~~~N~~~~~~G~~~~R~------g~~~~~~~~~~~~~~rlipf~~~ 74 (681) T protein:vir:98 1 MSNVRVLQRSFGGGEISPEMFGRIDDVKYQSGLAICRNFVVKPQGPAENRA------GFAFVREVKDSAKKVRLIPFTYS 74 (681) T ss_pred CcceeEeeeecCCceeeeeeccchhHHHHHHHHHHhcCcEEEecCCceecC------hhHhhhhcCCCCCcEEEEEEEeC Confidence 11111123333 2555 4444455556666666553 344444332 677777655321 1 Q ss_pred CCcEEEccCCCCCHHHhhhhhheeecccCceeeeecChHHHhhHHHhhcCceeEEeecCCCccccCccccceecCceeEE Q lcl|NC_018863. 191 ATNVIDLKGERLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSQAGGFSTGFSINQFLSTRGAIN 270 (479) Q Consensus 191 ~~NviDarG~~l~~~~l~~aa~~i~~~fG~atd~~mp~~vka~f~q~~~~~qrv~~~~n~g~~~~G~~V~~~~ss~g~I~ 270 (479) .++.+.++= -.....+-++-|...+.-+|..+...|....+..-+..|.+|.--..-+...++ .+. T Consensus 75 ~~~~~~l~~--------g~~~~r~~~~~~~~~~~~~~~~~~tpy~~~~l~~l~~~q~aD~~~i~h~~~~p~------~L~ 140 (681) T protein:vir:98 75 VTQTMVIEL--------GAGYFRFHTNGGTLLDGAVPYEIANPYAEADLFNIHYVQSADVLTLVHPNYAPR------ELR 140 (681) T ss_pred CCceEEEEE--------eCCeEEEEeCCcEEeeCcEeEEecCCCChhhhcCceEEEEcCEEEEECCCCcce------EEE Confidence 122221110 111233334445544445566556666666677777777766433333333222 233 Q ss_pred ecCCcccCCCccccCcccCCCCCcccceEEEeecccccCcccccccceeeEEEEEEEcCCC--CcccccceeeeeecCCC Q lcl|NC_018863. 271 LHGSTIMENDNILVDRIPEPNAPQAPASVVATVKVNDKGAFRPVKDIKTHSYKVVVHSDDA--ESLASEAVTAVVANPTD 348 (479) Q Consensus 271 L~~s~v~~a~~~lver~~s~~aP~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~G--ES~~S~~VtaT~a~~~~ 348 (479) +++.+-+ -.+...=..+|..|++.+++... .+. --+|+|.|.++...+ +|.++..++.+...... T Consensus 141 r~~~~~W-----~l~~~~f~~~p~~p~~~~at~~~--~~~------~~t~~~~v~avda~t~~~s~~~~~~tvt~~~~~~ 207 (681) T protein:vir:98 141 RLGATNW-----QLATIAFTSPVATPTSVTATSNN--KGT------DYTYRYVVTALDAEGKTESAPSSAGTCTNNLFTN 207 (681) T ss_pred EccCCce-----EEEEEEeccccccceeeeeeccC--Ccc------ceeEeEEEEEeecccceeecCCcceEEeeeeecC Confidence 3432221 12222334566677766655311 111 126899999888776 67777776666544444 Q ss_pred eEEEEEeecCCccccceEEEEEeccCCCCcEEEEEEeeeeeccCCCeeEEEeccC------C-------------CCCcc Q lcl|NC_018863. 349 SVSLAVKLQSLYQAKPQFISVYRQGNETGHYFLVARVPLSKADENGVITFVDRNQ------V-------------IPETT 409 (479) Q Consensus 349 ~V~LtIt~~~~~~~~~~y~~IYR~t~~~g~~~~i~rV~~s~~n~~~tttf~D~N~------~-------------iPgT~ 409 (479) ...-++.|.++.++. ++.|||... +.+-.++. ...+.+.|.+. + -|.|. T Consensus 208 ~~~~t~~w~a~~g~~--~~~V~~~~~--gi~g~ig~--------~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~gyP~~v 275 (681) T protein:vir:98 208 GGANTIAWSASSGAS--RYNVYKEQG--GLYGYIGQ--------TTGTSLVDDNIAPDLSVTPPIYDAVFNAAGDYPAAV 275 (681) T ss_pred CcceeEEEEecCCce--eeeecccce--eEEEEeec--------cceeeeeecccccCccccccccccccccCCCceEEE Confidence 334455666666654 678998654 33333322 11222323221 1 13333 Q ss_pred ceeeccc-------cHHHHHH------HHhc---ccc---c--cCccccC-chhHHHHHhhhhhheecccee-------- Q lcl|NC_018863. 410 DVFIGEL-------TPQVISL------LELL---PMM---K--LPLAQMN-ATTTFTVLWYGALALYAPKKW-------- 459 (479) Q Consensus 410 ~~fvge~-------~~q~i~l------~ell---Pm~---k--~Pla~~~-~~~~~~V~~yg~L~l~aPkk~-------- 459 (479) ..|-+-| +||.|-| .-|- |+. . +.++... ..+.|+|-.=..|.+.+=.-| T Consensus 276 ~f~q~RL~f~~~~~~p~~v~~Srsgdy~nF~~~~~~~ddD~i~~~~~~~~~~~i~~~v~~~~lli~t~~~e~~l~~~~~~ 355 (681) T protein:vir:98 276 SYFEQRRCFAGTTNKPQNIWMTRSGTESAMSYSLPVRDDDRVAFRVAAREANAIRHIVPLTELLLLTSSGEWRVASVNSD 355 (681) T ss_pred EEEcceEEEeeCCCCCcEEEEEcccCcccccccCCCCCCccEEEEEcCCcceeEEEEEecCcEEEEEcCcEEEEecCCCc Confidence 2222111 4443222 2222 211 0 1111111 245566654333333333333 Q ss_pred ------EEEEeccccCccccccccCC Q lcl|NC_018863. 460 ------VRIKNVQYIPALAADVTYRP 479 (479) Q Consensus 460 ------~~ikNV~~~~~~~~~~~~~~ 479 (479) +.|+-+...++ +++ +| T Consensus 356 ~lTP~~~~~~~~s~~g~--~~~--~P 377 (681) T protein:vir:98 356 AVTPTTISVRPQSYVGA--TDV--QP 377 (681) T ss_pred cccceeEEEEEeeeecc--ccc--cc Confidence 34444444443 332 56 No 123 >protein:vir:107802 Length: 681 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:780 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996623;genbank:gi:45580757;genbank:GeneID:2767878 Probab=59.01 E-value=0.22 Score=24.16 Aligned_cols=302 Identities=19% Similarity=0.179 Sum_probs=133.6 Q ss_pred hhhhHhhhcch----hhHH--HHHHHHHHHHHHHHHHHHH--hhcccccCCCCCCcccchhhhHHHhhc----------c Q lcl|NC_018863. 129 QSLAAGLVNNI----ADPM--TILTEDAISVIAKSIEWAI--FYGDAALAAEADNQAGIEFDGLTKLID----------E 190 (479) Q Consensus 129 vs~~~~lv~~~----~dp~--~~~~~~ai~~~~~~~e~a~--f~Gd~~l~~~~~~~~gleFDGl~~~I~----------~ 190 (479) ++...-+++++ -+|+ .+.-.+.-.+-++.+|.++ -+|--.--| |++|-|-++-=. . T Consensus 1 m~~~~~~~~~f~~Ge~~p~l~~r~D~~~y~~~~~~~~N~~~~~~G~~~~R~------g~~~~~~~~~~~~~~rlipf~~~ 74 (681) T protein:vir:10 1 MSNVRVLQRSFGGGEISPEMFGRIDDVKYQSGLAICRNFVVKPQGPAENRA------GFAFVREVKDSAKKVRLIPFTYS 74 (681) T ss_pred CcceeEeeeecCCceeeeeeccchhHHHHHHHHHHhcCcEEEecCCceecC------hhHhhhhcCCCCCcEEEEEEEeC Confidence 11111123333 2555 4444455556666666553 344444332 677777655321 1 Q ss_pred CCcEEEccCCCCCHHHhhhhhheeecccCceeeeecChHHHhhHHHhhcCceeEEeecCCCccccCccccceecCceeEE Q lcl|NC_018863. 191 ATNVIDLKGERLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSQAGGFSTGFSINQFLSTRGAIN 270 (479) Q Consensus 191 ~~NviDarG~~l~~~~l~~aa~~i~~~fG~atd~~mp~~vka~f~q~~~~~qrv~~~~n~g~~~~G~~V~~~~ss~g~I~ 270 (479) .++.+.++= -.....+-++-|...+.-+|..+...|....+..-+..|.+|.--..-+...++ .+. T Consensus 75 ~~~~~~l~~--------g~~~~r~~~~~~~~~~~~~~~~~~tpy~~~~l~~l~~~q~aD~~~i~h~~~~p~------~L~ 140 (681) T protein:vir:10 75 VTQTMVIEL--------GAGYFRFHTNGGTLLDGAVPYEIANPYAEADLFNIHYVQSADVLTLVHPNYAPR------ELR 140 (681) T ss_pred CCceEEEEE--------eCCeEEEEeCCcEEeeCcEeEEecCCCChhhhcCceEEEEcCEEEEECCCCcce------EEE Confidence 122221110 111233334445544445566556666666677777777766433333333222 233 Q ss_pred ecCCcccCCCccccCcccCCCCCcccceEEEeecccccCcccccccceeeEEEEEEEcCCC--CcccccceeeeeecCCC Q lcl|NC_018863. 271 LHGSTIMENDNILVDRIPEPNAPQAPASVVATVKVNDKGAFRPVKDIKTHSYKVVVHSDDA--ESLASEAVTAVVANPTD 348 (479) Q Consensus 271 L~~s~v~~a~~~lver~~s~~aP~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~G--ES~~S~~VtaT~a~~~~ 348 (479) +++.+-+ -.+...=..+|..|++.+++... .+. --+|+|.|.++...+ +|.++..++.+...... T Consensus 141 r~~~~~W-----~l~~~~f~~~p~~p~~~~at~~~--~~~------~~t~~~~v~avda~t~~~s~~~~~~tvt~~~~~~ 207 (681) T protein:vir:10 141 RLGATNW-----QLATIAFTSPVATPTSVTATSNN--KGT------DYTYRYVVTALDAEGKTESAPSSAGTCTNNLFTN 207 (681) T ss_pred EccCCce-----EEEEEEeccccccceeeeeeccC--Ccc------ceeEeEEEEEeecccceeecCCcceEEeeeeecC Confidence 3432221 12222334566677766655311 111 126899999888776 67777776666544444 Q ss_pred eEEEEEeecCCccccceEEEEEeccCCCCcEEEEEEeeeeeccCCCeeEEEeccC------C-------------CCCcc Q lcl|NC_018863. 349 SVSLAVKLQSLYQAKPQFISVYRQGNETGHYFLVARVPLSKADENGVITFVDRNQ------V-------------IPETT 409 (479) Q Consensus 349 ~V~LtIt~~~~~~~~~~y~~IYR~t~~~g~~~~i~rV~~s~~n~~~tttf~D~N~------~-------------iPgT~ 409 (479) ...-++.|.++.++. ++.|||... +.+-.++. ...+.+.|.+. + -|.|. T Consensus 208 ~~~~t~~w~a~~g~~--~~~V~~~~~--gi~g~ig~--------~~~~~~~~~~~~~~~~~t~~~~~~~~~~~~gyP~~v 275 (681) T protein:vir:10 208 GGANTIAWSASSGAS--RYNVYKEQG--GLYGYIGQ--------TTGTSLVDDNIAPDLSVTPPIYDAVFNAAGDYPAAV 275 (681) T ss_pred CcceeEEEEecCCce--eeeecccce--eEEEEeec--------cceeeeeecccccCccccccccccccccCCCceEEE Confidence 334455666666654 678998654 33333322 11222323221 1 13333 Q ss_pred ceeeccc-------cHHHHHH------HHhc---ccc---c--cCccccC-chhHHHHHhhhhhheecccee-------- Q lcl|NC_018863. 410 DVFIGEL-------TPQVISL------LELL---PMM---K--LPLAQMN-ATTTFTVLWYGALALYAPKKW-------- 459 (479) Q Consensus 410 ~~fvge~-------~~q~i~l------~ell---Pm~---k--~Pla~~~-~~~~~~V~~yg~L~l~aPkk~-------- 459 (479) ..|-+-| +||.|-| .-|- |+. . +.++... ..+.|+|-.=..|.+.+=.-| T Consensus 276 ~f~q~RL~f~~~~~~p~~v~~Srsgdy~nF~~~~~~~ddD~i~~~~~~~~~~~i~~~v~~~~lli~t~~~e~~l~~~~~~ 355 (681) T protein:vir:10 276 SYFEQRRCFAGTTNKPQNIWMTRSGTESAMSYSLPVRDDDRVAFRVAAREANAIRHIVPLTELLLLTSSGEWRVASVNSD 355 (681) T ss_pred EEEcceEEEeeCCCCCcEEEEEcccCcccccccCCCCCCccEEEEEcCCcceeEEEEEecCcEEEEEcCcEEEEecCCCc Confidence 2222111 4443222 2222 211 0 1111111 245566654333333333333 Q ss_pred ------EEEEeccccCccccccccCC Q lcl|NC_018863. 460 ------VRIKNVQYIPALAADVTYRP 479 (479) Q Consensus 460 ------~~ikNV~~~~~~~~~~~~~~ 479 (479) +.|+-+...++ +++ +| T Consensus 356 ~lTP~~~~~~~~s~~g~--~~~--~P 377 (681) T protein:vir:10 356 AVTPTTISVRPQSYVGA--TDV--QP 377 (681) T ss_pred cccceeEEEEEeeeecc--ccc--cc Confidence 34444444443 332 56 No 124 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=52.20 E-value=0.56 Score=21.96 Aligned_cols=264 Identities=11% Similarity=0.065 Sum_probs=115.6 Q ss_pred CcccccccceeeeecCchhHHHHHHHHHHHhhcCcccCccccc-CccccchhhhHHHHHHHhhccccccchh------hh Q lcl|NC_018863. 1 MTELQKEQKVEARKLPAGAEAELAELVSKSFTTGTGITPDTQH-DAAALRRELLDDQVKMLAFTNGDFTIYP------LI 73 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~Ksf~ag~~~~~~~~~-~gaAlr~esld~~i~~l~~~~~~f~~~~------~i 73 (479) |+ . +++ -+.-+.+|.+.+.+ ++...+...|.. ++ T Consensus 1 ma------------------------------~-------~~T~~~~~iiPev~~~~v--~~~~~~~~~~~~~~~~~~~l 41 (274) T protein:vir:93 1 MP------------------------------Q-------GITKTSNQIIPEVLAPMM--QAQLEKKLRFASFAEVDSTL 41 (274) T ss_pred CC------------------------------c-------cceehhheechHHHHHHH--HHHHHhhhhhcccccccccc Confidence 11 1 110 11123334443333 111111111111 11 Q ss_pred ccchhHHHHHHhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHH Q lcl|NC_018863. 74 NKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAISV 153 (479) Q Consensus 74 ~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~~~~~ai~~ 153 (479) ..+ .-+||+ +..|+..|....+.|++.....+.+......+++...-...+++...++ +..||+....+..... T Consensus 42 ~g~-~G~tv~----ip~~~~~g~~~~~~eg~~i~~~~it~~~~~~~i~~~~~~~~i~D~~~~~-~~~d~~~~~~~~~~~~ 115 (274) T protein:vir:93 42 QGQ-PGDTLT----FPAFVYSGDAQVVAEGEKIPTDILETKKREAKIRKIAKGTSITDEALLS-GYGDPQGEQVRQHGLA 115 (274) T ss_pred cCC-CCCEEE----EEeeccCCCcccccCCCcccccccccceeEEEeeeecccccccHHHHHh-hccchHHHHHHHHHHH Confidence 111 122332 3345555666677888888999999999999999999899999986655 4679998888888888 Q ss_pred HHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhhheeecccCceeeeecChHHHhh Q lcl|NC_018863. 154 IAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAAVIVGKGYGRATDAFMPIGVQAD 233 (479) Q Consensus 154 ~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~atd~~mp~~vka~ 233 (479) ++..++..++=. +.. +. .+..+..++.+.+.+|....+..-...+-++||+.+.+. T Consensus 116 ~a~~~d~~~~~~---~~~-------------------a~--~~~~~~~~~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~ 171 (274) T protein:vir:93 116 HANKVDNDVLEA---LMG-------------------AK--LTVNADITKLNGLQSAIDKFNDEDLEPMVLFINPLDAGK 171 (274) T ss_pred HHHHHHHHHHHH---Hhc-------------------cc--ccccccccCHHHHHHHHHHhhhccCCccEEEeCHHHHHH Confidence 888888766521 111 11 111223345555665555545544566779999999988 Q ss_pred HHHhhcCceeEEeecCCCcccc-CccccceecCceeEEecCCcccCCCccccCc-ccC-CCCCcccceEEEeecccccCc Q lcl|NC_018863. 234 FTNNLLDRQRVIQPSQAGGFST-GFSINQFLSTRGAINLHGSTIMENDNILVDR-IPE-PNAPQAPASVVATVKVNDKGA 310 (479) Q Consensus 234 f~q~~~~~qrv~~~~n~g~~~~-G~~V~~~~ss~g~I~L~~s~v~~a~~~lver-~~s-~~aP~~P~~vta~~~~~~~g~ 310 (479) +...-. .+++..+..|+... .-.+..+.+.+--++ +.+-..+.++..+ .+. .... +..+ + +.-.-+ T Consensus 172 L~k~~~--~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s---~~~p~~t~~l~~~gai~~~~~~--~~~v--E--~~Rd~~ 240 (274) T protein:vir:93 172 LRGDAS--TNFTRATELGDDIIVKGAFGEALGAIIVRT---NKLEAGTAILAKKGAVKLILKR--DFFL--E--VARDAS 240 (274) T ss_pred HHhhhh--hcccccccccccceeecccceecCeeEEEc---CCCCcceEEEEeCCeEEEEecC--Cccc--c--cccchh Confidence 864321 22232222221100 001111111100000 0000111111110 000 0000 0000 0 111111 Q ss_pred ccccccceeeEEEEEEEcCCCCcccccceeeeeecCCCeEEEEEeecCCcc Q lcl|NC_018863. 311 FRPVKDIKTHSYKVVVHSDDAESLASEAVTAVVANPTDSVSLAVKLQSLYQ 361 (479) Q Consensus 311 ~~~~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~V~LtIt~~~~~~ 361 (479) .+...=.+.+.|.+...+.. +.|.|+..-+++.. T Consensus 241 ~~~d~i~~~~~y~~~~~~~~-----------------~~v~~t~~~~s~~~ 274 (274) T protein:vir:93 241 TKTTALYSDKHYVAYLYDES-----------------KAVKITKGSGSLEM 274 (274) T ss_pred hcccEEEEEEEEEEEEEcCC-----------------ceEEEeeCccccCC Confidence 11111123345555444432 23333322222222 No 125 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=50.65 E-value=0.6 Score=21.79 Aligned_cols=275 Identities=13% Similarity=0.142 Sum_probs=123.1 Q ss_pred CcccccccceeeeecCchhHHHHHHHHHHHhhcCcccCcccccCccccch---hhhHHHHHHHhhccc-cccchhhhccc Q lcl|NC_018863. 1 MTELQKEQKVEARKLPAGAEAELAELVSKSFTTGTGITPDTQHDAAALRR---ELLDDQVKMLAFTNG-DFTIYPLINKQ 76 (479) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~Ksf~ag~~~~~~~~~~gaAlr~---esld~~i~~l~~~~~-~f~~~~~i~k~ 76 (479) |. .+++ .|+++ -.+. ....-.+..-|++.+.+.+-. |-+|+.+....+..- --.|+.-...- T Consensus 1 ~~----~~~~--------~~~~~-~~~~-~~~~~~~~~~da~~~~g~~~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~ 66 (319) T protein:vir:10 1 MT----TKKF--------DEADK-SNVE-MYLIQAGVKQDAAATMGIWTAQELHRIKSQSYEEDYPVGSALRVFPVTTEL 66 (319) T ss_pred CC----Ccch--------hHHhh-HHHH-HHHhhccchhhhhhhhhhHHHHHHHHHHHHHHhhhhcceechhhcccccCC Confidence 22 1111 12222 1111 111113356667777777876 456666655544331 11223222121 Q ss_pred hhHHHHHHhhhhhccCcccccccccc-cccccccCcceEEEEEEEEeeeehhhhhhhHhh--hcchhhHHHHHHHHHHHH Q lcl|NC_018863. 77 QVNSTVAKYAVFNQHGRTGHSRFVRE-VGVASINDPNIRQKTVQMKFLSDTKQQSLAAGL--VNNIADPMTILTEDAISV 153 (479) Q Consensus 77 ~~~stv~~y~~~~~~G~~g~~~fv~E-~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~l--v~~~~dp~~~~~~~ai~~ 153 (479) .+-.....|..+. ..|....++. ....+..|.++.|++..+..++..+.++..--. ...-.+..+..-..|.+. T Consensus 67 ~~~~~~~~~~~~~---~~G~a~~~~d~~~dip~v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~ 143 (319) T protein:vir:10 67 SPTDKTFEYMTFD---KVGTAQIIADYTDDLPLVDALGTSEFGKVFRLGNAYLISIDEIKAGQATGRPLSTRKASACQLA 143 (319) T ss_pred CCceEEEEeeeec---cccceeeecCccccccceeccceeeEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHH Confidence 2221112232333 3444445554 444578889999999999999999998763222 233346667778888888 Q ss_pred HHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEE-ccCCCC---C-H---HHhhhh---hheeecccCcee Q lcl|NC_018863. 154 IAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVID-LKGERL---D-E---ATLNKA---AVIVGKGYGRAT 222 (479) Q Consensus 154 ~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviD-arG~~l---~-~---~~l~~a---a~~i~~~fG~at 222 (479) +.+.....+||||+.+. +.||.+-= .-..+- ..|... + + +.|+++ ....++++-.++ T Consensus 144 ~~~~~n~i~f~G~~~~g----------~~GLlN~p--~~~~~~~~~~~~~~t~t~~~i~~di~~~~~~l~~~s~g~~~p~ 211 (319) T protein:vir:10 144 HDQLVNRLVFKGSAPHK----------IVSVFNHP--NITKITSGKWIDVSTMKPETAEAELTQAIETIETITRGQHRAT 211 (319) T ss_pred HHHhhceEEEeeccccc----------ceeEEeCC--CceeeecCCCCCccccCHHHHHHHHHHHHHHHHHhcCceeece Confidence 99999999999988754 34554421 111222 222222 2 2 224443 234567788999 Q ss_pred eeecChHHHhhHHHhhcCcee----EEeecCCC-----------ccccCccc-cceecCceeEEecCCccc-------CC Q lcl|NC_018863. 223 DAFMPIGVQADFTNNLLDRQR----VIQPSQAG-----------GFSTGFSI-NQFLSTRGAINLHGSTIM-------EN 279 (479) Q Consensus 223 d~~mp~~vka~f~q~~~~~qr----v~~~~n~g-----------~~~~G~~V-~~~~ss~g~I~L~~s~v~-------~a 279 (479) .+.||+.....+..-+.+.-. .+..++++ ....|.+. --+....-.+.++-..-+ .. T Consensus 212 ~L~L~p~~~~~L~~~~~~~~~t~l~~lk~~~~~l~I~~~pel~~ag~~g~~~~v~y~~~~~~~~~~v~~~~~~~~~e~~~ 291 (319) T protein:vir:10 212 NILIPPSMRKVLAIRMPETTMSYLDYFKSQNSGIEIDSIAELEDIDGAGTKGVLVYEKNPMNMSIEIPEAFNMLPAQPKD 291 (319) T ss_pred EEEecHHHHHhhhcccCCCCeeHHHHHHHhcCCceEEEeeeecccCCCcceEEEEEecCCceEEEecCcceeeeeeeecC Confidence 999999998888643321100 00111110 00001000 011111112222210000 00 Q ss_pred CccccCcccCCCC---CcccceEEEeecccccCcccccccc Q lcl|NC_018863. 280 DNILVDRIPEPNA---PQAPASVVATVKVNDKGAFRPVKDI 317 (479) Q Consensus 280 ~~~lver~~s~~a---P~~P~~vta~~~~~~~g~~~~~sd~ 317 (479) -++.+. .....+ -.-|-+.... .++ T Consensus 292 l~~~~~-~~~r~~Gv~i~~P~ai~~~------------dGI 319 (319) T protein:vir:10 292 LHFKVP-CTSKCTGLTIYRPMTIVLI------------TGV 319 (319) T ss_pred ceEEEe-eeeeeEEEEEEccceeEee------------ecC Confidence 111110 000000 0112222211 112 No 126 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=39.00 E-value=1 Score=20.49 Aligned_cols=303 Identities=9% Similarity=0.028 Sum_probs=117.6 Q ss_pred Cccccccc--ceeeee---cCchhHHHHHH---HHHHHhhcCc-c-------------cCcccccCccccchhhhHHHHH Q lcl|NC_018863. 1 MTELQKEQ--KVEARK---LPAGAEAELAE---LVSKSFTTGT-G-------------ITPDTQHDAAALRRELLDDQVK 58 (479) Q Consensus 1 ~~~~~~~~--~~~~~~---~~~~~~~~~~e---~~~Ksf~ag~-~-------------~~~~~~~~gaAlr~esld~~i~ 58 (479) +.++..+. ++.... ...+.+..-.. .+.+++..+. . .+..+-++|+.|.++.+..+|. T Consensus 60 ~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~al~~~t~s~gG~~IP~~~~~~Ii 139 (387) T protein:vir:93 60 VKDIEEKEKAKVKDTGEAYQSLNDHEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIV 139 (387) T ss_pred HHHHHHHHHHhhhhccccCCCcchhhHHHHHHHHHHHHHhhhhhhhhhhhhhHHHHHhhccCcCCCCceeechhHHHHHH Confidence 11110000 000000 00011111111 1222222111 0 1112346688899999988887 Q ss_pred HHhhccccccchhhhccchhHHHHHHhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcc Q lcl|NC_018863. 59 MLAFTNGDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNN 138 (479) Q Consensus 59 ~l~~~~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~ 138 (479) .+......+ .+-+...++.+ +. +.+. . +..+...+++|++..+.+++.+......++=++.-..+|.- -|.++ T Consensus 140 ~~~~~~~~l--~~~~~v~~~~~-~~-~p~~-~-~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~iS~e-ll~Ds 212 (387) T protein:vir:93 140 SEPFAKNQL--REKARLTNIKG-LE-IPRV-S-YTLDDDDFITDVETAKELKLKGDTVKFTTNKFKVFAAISDT-VIHGS 212 (387) T ss_pred HHHHhhchh--hhheeeeecCC-ce-EEEE-e-ecCCccccccCcccccccccccceeeeeheeeeeechhhHH-HHhhh Confidence 666655533 22222222222 11 2111 1 12223568999999999999999988888877776566633 12455 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhhheeeccc Q lcl|NC_018863. 139 IADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAAVIVGKGY 218 (479) Q Consensus 139 ~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~f 218 (479) ..|.+....+.-...+.+..+..+|-+..+.+ +..|+..- ..+-...|..+ .+.|.++---+...| T Consensus 213 ~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g---------~p~g~l~~----~~~~~v~~~~~-~d~i~~~~~~l~~~~ 278 (387) T protein:vir:93 213 DVDLVNWVENALQSGLAAKERKDALAVSPKSG---------LDHMSFYN----GSVKEVEGADM-YDAIINALADLHEDY 278 (387) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhHhhcCCCcc---------ccceeeec----cccccccccch-HHHHHHHHhccChhh Confidence 66777777777777777765555553332211 12232210 00111122222 222332222233344 Q ss_pred CceeeeecChHHHhhHHHhhcCceeEEeecCCCccccCccccceecCceeEEecCCc--c-cCCCccccCcccCCCCCcc Q lcl|NC_018863. 219 GRATDAFMPIGVQADFTNNLLDRQRVIQPSQAGGFSTGFSINQFLSTRGAINLHGST--I-MENDNILVDRIPEPNAPQA 295 (479) Q Consensus 219 G~atd~~mp~~vka~f~q~~~~~qrv~~~~n~g~~~~G~~V~~~~ss~g~I~L~~s~--v-~~a~~~lver~~s~~aP~~ 295 (479) -.....+|+..+...+...+-+..+-++..++. .-.|.+|....+. ..+ +.|+- . ....+...++....... T Consensus 279 ~~~a~~~mn~~t~~~~~~~~~d~~~~~~~~~~~-~llG~PV~~~~~~-~~~-~~GDf~~~~~~~~~~~~~~~~~~~~~-- 353 (387) T protein:vir:93 279 RDNATIYMRYADYVKIISVLSNGTTNFFDTPAE-KVFGKPVVFTDAA-VKP-IVGDFNYFGINYDGTTYDTDKDVKKG-- 353 (387) T ss_pred hcCCEEEEechHHHHHHHHHhcCCCcccccCCc-cccccceEEecCC-Cce-eeeehhhhheehhhheeeecccccCC-- Confidence 433345676666555544443333333333332 2335555322111 010 11110 0 00001111111000000 Q ss_pred cceEEEeecccccCcccccccceeeEEEEEEEcCCCCcccc Q lcl|NC_018863. 296 PASVVATVKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLAS 336 (479) Q Consensus 296 P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S 336 (479) .+.-....-..|... .. -..++.-+-...-|.|| T Consensus 354 --~~~~~~~~r~d~~v~-~~----eA~~~l~~k~~~~~~~~ 387 (387) T protein:vir:93 354 --EYLFVLTAWYDQQRT-LD----SAFRIAKAKENTGSLPS 387 (387) T ss_pred --ceeEEEEeeeCceee-ch----hheEEEEeecCCCCCCC Confidence 000000000011100 00 01222233333334444 No 127 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=37.81 E-value=1.1 Score=20.36 Aligned_cols=253 Identities=11% Similarity=0.054 Sum_probs=111.1 Q ss_pred ccccCccccch---hhhHHHHHHHhhccccc-cchhhhccchhHHHHHHhhhhhccCccccccccccc-ccccccCcceE Q lcl|NC_018863. 40 DTQHDAAALRR---ELLDDQVKMLAFTNGDF-TIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREV-GVASINDPNIR 114 (479) Q Consensus 40 ~~~~~gaAlr~---esld~~i~~l~~~~~~f-~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~-g~~~~~d~~~~ 114 (479) --..+.|++-. |-+|+++....+..-.+ .|+.-..+-.+-.....|... ...|....+++. ...+..|.++. T Consensus 1 ~~~~~~g~f~~~~l~~id~~v~e~~~~~l~~r~l~~v~~~~~~~~~~~~~~~~---~~~G~~~~~~~~~~dip~~~~~~~ 77 (301) T protein:vir:80 1 MQGKITATIEARDLQAIDNVIYEPKQEELTARSVFPQKFDVNEGAESYSFDVM---TRSGAAKIIANGADDLPLVDVDMV 77 (301) T ss_pred CCccccchhhHHHHHHHHHHHHHhhhhhhhhhhhcccccCCCCceEEEEEeee---ccceeEEEecCcccccccccccce Confidence 01112234333 34555555444433222 122222222222222233222 233444444443 33477889999 Q ss_pred EEEEEEEeeeehhhhhhh--HhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhc--c Q lcl|NC_018863. 115 QKTVQMKFLSDTKQQSLA--AGLVNNIADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLID--E 190 (479) Q Consensus 115 r~~~~~k~l~~~~~vs~~--~~lv~~~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~--~ 190 (479) |++..+.-+...+.++.. ......-.+..+.....|.+.+.+.....+||||+.++ +.||.+-=. . T Consensus 78 ~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~f~G~~~~g----------~~GLlN~p~~~~ 147 (301) T protein:vir:80 78 RKSVPIYSIGIGLSYTIQDLRAARMQGTTVDAAKATTVRRAIAEKENSIAFRGEKKYA----------IKGAFEATGIQI 147 (301) T ss_pred eEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEeeeccccc----------ceeeecCCCccc Confidence 999999999998887754 33334445777888888999999999999999998865 346655321 0 Q ss_pred CCcEEEccCCCC-------C--HHHhhhhhhe---eecccCceeeeecChHHHhhHHHhhcC----------------ce Q lcl|NC_018863. 191 ATNVIDLKGERL-------D--EATLNKAAVI---VGKGYGRATDAFMPIGVQADFTNNLLD----------------RQ 242 (479) Q Consensus 191 ~~NviDarG~~l-------~--~~~l~~aa~~---i~~~fG~atd~~mp~~vka~f~q~~~~----------------~q 242 (479) ....-+..|+.- + .+.|+++-.. .++++-.+..+.||+.....+..-+.. .- T Consensus 148 ~~~~~~~~~~~~~w~~~t~~ei~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~~~~~~~~~tvl~~l~~~~~~~ 227 (301) T protein:vir:80 148 DVSPTTGVGNVSKWEKKTAEQIIDEIGEAHTKITVLPGYGTASLKLCLPPKQFELINKKRYSNEDSRSVLKVLQDNAWFS 227 (301) T ss_pred ccccCcccccccccccCCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHhhhhccccCCCCeeHHHHHHHHcCcc Confidence 001112222221 1 2345554422 344555788999999998888643211 11 Q ss_pred eEEe-ecCCCccccCcccc-ceecCceeEEecCCcc-------cCCCccccCcccCCCC--CcccceEEEeecccccCcc Q lcl|NC_018863. 243 RVIQ-PSQAGGFSTGFSIN-QFLSTRGAINLHGSTI-------MENDNILVDRIPEPNA--PQAPASVVATVKVNDKGAF 311 (479) Q Consensus 243 rv~~-~~n~g~~~~G~~V~-~~~ss~g~I~L~~s~v-------~~a~~~lver~~s~~a--P~~P~~vta~~~~~~~g~~ 311 (479) ++.. |-=.+....|.+.- .+....=.+.++-..- .....+.+.....--+ -.-|-+.... T Consensus 228 ~I~~~p~L~~~g~~g~~~~v~~~~~~d~~~~~v~~~~~~~~~e~~~~~~~~~~~~r~~Gv~i~~P~ai~~~--------- 298 (301) T protein:vir:80 228 AIVRVPDLAGMGTAGSDSFAVIHDSNETAELIIPMDITRHPEEYSFPRTKVPFEERTAGVVVRFPAAIVRV--------- 298 (301) T ss_pred eEEEcceeccCCCCcccEEEEEecCCcEEEEEecCceeeecceecCceeEeeeeeeeEEEEEEccceEEEE--------- Confidence 1111 11001000111100 0110000122221100 0011111110000000 0012222211 Q ss_pred cccccc Q lcl|NC_018863. 312 RPVKDI 317 (479) Q Consensus 312 ~~~sd~ 317 (479) .++ T Consensus 299 ---~GI 301 (301) T protein:vir:80 299 ---DGI 301 (301) T ss_pred ---ecC Confidence 112 No 128 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=34.42 E-value=1.3 Score=19.97 Aligned_cols=308 Identities=11% Similarity=0.002 Sum_probs=117.9 Q ss_pred Cccccccc-ceeeeecCchh-HHHHHHH--H------HHHhhcCcccCcccccCccccchhhhHHHHHHHhhccccccch Q lcl|NC_018863. 1 MTELQKEQ-KVEARKLPAGA-EAELAEL--V------SKSFTTGTGITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIY 70 (479) Q Consensus 1 ~~~~~~~~-~~~~~~~~~~~-~~~~~e~--~------~Ksf~ag~~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~ 70 (479) +.++..+. +-...+..... .....+. . .++..+...-...+.++++.+-.+.+...|..+... ...+. T Consensus 61 ~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ip~~~~~~ii~~~~~--~~~i~ 138 (379) T protein:vir:10 61 ADKLDVKLKEKAKSEDKSDSLVKSITENFNDIKEVRNGKSIQVKAVGDMTLPVNLTGAQPKDYNFDVVLNPSQ--MLNVS 138 (379) T ss_pred HHHHHHHHHhcccccccchhHHHHHHHHHHhHHHHHhhhhhhhhhhcccccCCCCccccchhhhhHHHHhHHh--hhhHH Confidence 11100000 00000000000 0000000 0 011111111111223344445555555555333322 22233 Q ss_pred hhhccchhHHHHHHhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHH Q lcl|NC_018863. 71 PLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDA 150 (479) Q Consensus 71 ~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~~~~~a 150 (479) +-+...+..+.--+|.+....++ +...++.|++..+..++.+.+....++=++.-..+|.-+ +.++ .+.+....+.- T Consensus 139 ~~~~~~~~~~~~~~~~~~~~~~~-~~~~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~el-l~D~-~~l~~~i~~~l 215 (379) T protein:vir:10 139 DIVGAVSISGGTYTFVRENGAGE-GAIGAQVEGATKGQKDYDISMIDVNTDFIAGFTRYSKKM-ANNL-PFLTSFIPNAL 215 (379) T ss_pred hhceeeeccCCceEEEEeecCCC-cccccccCCccccccccceeeeEeeeeeEEeeehhhHHH-HhhH-HHHHHHHHHHH Confidence 33333333333234433332222 234578999999999999999999999999888888765 4443 35666666667 Q ss_pred HHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhhheeecccCceeeeecChHH Q lcl|NC_018863. 151 ISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAAVIVGKGYGRATDAFMPIGV 230 (479) Q Consensus 151 i~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~atd~~mp~~v 230 (479) ...+++.++.+++-|+..-...+ .... -...+.+.|.++...+...+-.++-..|++.+ T Consensus 216 a~~~~~~~~~~~~~g~~~~~~~~----------~~~~-----------~~~~~~d~i~~~~~~~~~~~~~~~~~vmn~~~ 274 (379) T protein:vir:10 216 RRDYAKAENAAFNAVLAANATAS----------TEII-----------TNKNKVEMLINEIAKQENLDFPVTAIVLRPTD 274 (379) T ss_pred HHHHHHHHHHHHhcccccccccc----------cccc-----------cCcccHHHHHHHHHhhhhccCCCCEEEEcHHH Confidence 77888888988887765422111 0000 01223445555555555667777778999988 Q ss_pred HhhHHHhhcCceeE-EeecCCCccccCccccceecCceeEEecCCcccCCCccccCcccCCCCCcccceEEEeecccccC Q lcl|NC_018863. 231 QADFTNNLLDRQRV-IQPSQAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILVDRIPEPNAPQAPASVVATVKVNDKG 309 (479) Q Consensus 231 ka~f~q~~~~~qrv-~~~~n~g~~~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver~~s~~aP~~P~~vta~~~~~~~g 309 (479) ...+...-...-+. .+++..... .+.-.|.|=+|...+ .-| .| T Consensus 275 ~~~l~~lkd~~G~~l~~~~~~~~~------------~~~~~l~G~pvv~s~----------~~~--------------ag 318 (379) T protein:vir:10 275 YYDILVTQKSVGAGYGLPGVVTQD------------NGVLRINGIPLFRAT----------WLA--------------AN 318 (379) T ss_pred HHHHHHhhccCCceeccCCccCCC------------CCcceecceeeEecC----------CCC--------------CC Confidence 77776654332222 222211000 011122222221110 000 01 Q ss_pred cccccccceeeEEEEEEEcCCCCcccccceeeeeecCCCeEEEEEeecCCccccceEEEEEeccCCCCcEEEEEEe Q lcl|NC_018863. 310 AFRPVKDIKTHSYKVVVHSDDAESLASEAVTAVVANPTDSVSLAVKLQSLYQAKPQFISVYRQGNETGHYFLVARV 385 (479) Q Consensus 310 ~~~~~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~V~LtIt~~~~~~~~~~y~~IYR~t~~~g~~~~i~rV 385 (479) +.+ .|.+++-+... +.|-+......... ....+.+.+.+.-- .+ ..|.|..+ -.+..+.-| T Consensus 319 ~~~----~gdf~~~~~~~-~~~~~i~~~~~~~~-~f~~~~~~~r~~~R-~~------~~v~~p~a--~v~~~~~~~ 379 (379) T protein:vir:10 319 KYY----VGDWTRVTKVT-TEGLSLEFSEVEGT-NFVKNNITARIEAQ-VA------LAVEQPAA--LIFGDFTAV 379 (379) T ss_pred ceE----EeecccEEEEE-EeceEEEEeecccc-cccCCcEEEEEEEE-ec------cEEecCcc--EEEEEecCC Confidence 110 01111111111 11111000000000 00111111110000 00 00000000 000000011 No 129 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=28.74 E-value=1.7 Score=19.29 Aligned_cols=315 Identities=13% Similarity=0.122 Sum_probs=129.6 Q ss_pred Cccccccc--ceeee-ecCc---hh-HHHHHHHHHHHhhcCc--ccCcccccCccccchhhhHHHHHHHhhccccccchh Q lcl|NC_018863. 1 MTELQKEQ--KVEAR-KLPA---GA-EAELAELVSKSFTTGT--GITPDTQHDAAALRRELLDDQVKMLAFTNGDFTIYP 71 (479) Q Consensus 1 ~~~~~~~~--~~~~~-~~~~---~~-~~~~~e~~~Ksf~ag~--~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f~~~~ 71 (479) .+...+.. +.... .... .. .+.+ ..+.|.....- .....+..+|+.|-.+.+...|..+...... +.+ T Consensus 67 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~l~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~--l~~ 143 (394) T protein:vir:10 67 NSDPDKPVDNAQPNGTDLKKKPIDAKKKAI-NDFIHSHGKVIDNAAGHVTSTEAGVLIPEEIIYDPTAEVNSVVD--LST 143 (394) T ss_pred hcchhhhhhhhcccccchhhhHHHHHHHHH-HHHHhccchhhhhhhcccccccCceeccHHHHHHHHHHHHhhhh--hhh Confidence 11111100 00000 0001 01 1122 22222211000 0111234567788888888887666555443 344 Q ss_pred hhccchhHHHHHHhhhhhccCccccccccccccccc-ccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHH Q lcl|NC_018863. 72 LINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVAS-INDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDA 150 (479) Q Consensus 72 ~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~-~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~~~~~a 150 (479) .+...++.+.--+|... ..+.+...+++|++... .+++.+.+....++=++.-..+|.-+ +.++..|.+....+.- T Consensus 144 ~~~~~~~~~~~~~~~~~--~~~~~~~~~~~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~el-l~ds~~~l~~~i~~~l 220 (394) T protein:vir:10 144 LVTKTPVTTPKGTYPIL--KRATDRFSSVAELAENPALAEPEFEQVDWSVSTYRGAIPLSEEA-IADSAVDLTSLVGQSI 220 (394) T ss_pred hceeeeccCCceEEEEE--ecCCCccccccccccccccccccceeEEeeeeeeEeeehhHHHH-HhhhhHHHHHHHHHHH Confidence 44444444333333222 22334567899987765 68899999999999888777777653 3345567777888888 Q ss_pred HHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhhheeecccCceeeeecChHH Q lcl|NC_018863. 151 ISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAAVIVGKGYGRATDAFMPIGV 230 (479) Q Consensus 151 i~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~atd~~mp~~v 230 (479) ...++..++.++..|+..-.+... ....-+|-|...+. .....+|. .-.+|++.+ T Consensus 221 a~~~~~~~~~~il~g~g~~~~~~~-~~~~~~d~l~~~~~----------------------~~~~~~~~--a~~vmn~~~ 275 (394) T protein:vir:10 221 NEKSVNTYNAMIAPVLQSFTAKAT-TTDTLVDSLKHILN----------------------VDLDPAYS--RALVVTQSL 275 (394) T ss_pred HHHHHHHHHHHHhhcccccccccc-cccccHHHHHHHHH----------------------hhhhhhcc--CEEEecHHH Confidence 888999999999999876443211 11223333333321 11112222 248899988 Q ss_pred HhhHHHhhcCceeEEeecCCCccccCccccceecCceeEEecCCcccCCCccccCcccCCCCCcccceEEEeecccccCc Q lcl|NC_018863. 231 QADFTNNLLDRQRVIQPSQAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILVDRIPEPNAPQAPASVVATVKVNDKGA 310 (479) Q Consensus 231 ka~f~q~~~~~qrv~~~~n~g~~~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver~~s~~aP~~P~~vta~~~~~~~g~ 310 (479) ...+...-...-|-+...+..+...+.. ..++++.+.+.++...-+.+...+... T Consensus 276 ~~~l~~lkd~~G~~i~~~~~~~~~~~~~--------------~~~L~G~PV~~~~~~~~~~~~~~~~i~----------- 330 (394) T protein:vir:10 276 FNTLDTLKDKNGRYLLHDASDSITDGTA--------------KGTVLGVPVYVVGDALLGSAAGDQKAF----------- 330 (394) T ss_pred HHHHHHhhccCCCeeeeccccccccCCc--------------ccccccceeEEecccccCCCCCceEEE----------- Confidence 8887765543333333222222111110 012222232222210000000000000 Q ss_pred ccccccceeeEEEEEEEcCCCCcccccceeeeeecCCCeEEEEEeecCCccccceEEEEEecc---CCCCcEEEEEEeee Q lcl|NC_018863. 311 FRPVKDIKTHSYKVVVHSDDAESLASEAVTAVVANPTDSVSLAVKLQSLYQAKPQFISVYRQG---NETGHYFLVARVPL 387 (479) Q Consensus 311 ~~~~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~V~LtIt~~~~~~~~~~y~~IYR~t---~~~g~~~~i~rV~~ 387 (479) .|..+..+....+.+-+.. ....... ..-++.++|=. ..+..+.++.--+. T Consensus 331 ------~gd~s~~~~~~~~~~~~v~------------------~~~~~~~--~~~~~~~~r~d~~~~~~~ai~~~~~~~~ 384 (394) T protein:vir:10 331 ------VGDLKRGVLFADRQQVTLA------------------WEDSKIY--GRYLGAAFRFGVKQADSNAGYFVTNTDA 384 (394) T ss_pred ------EeeccccEEEEeecceEEE------------------Eeccccc--ceeEEEEEEeccEEeccccEEEEEeecc Confidence 0111100011111111100 0000000 00011111211 01122222211111 Q ss_pred eeccCCCeeEEEec Q lcl|NC_018863. 388 SKADENGVITFVDR 401 (479) Q Consensus 388 s~~n~~~tttf~D~ 401 (479) + .+.+.++.. T Consensus 385 ~----~~~~~~~~~ 394 (394) T protein:vir:10 385 A----SGSTSGTGK 394 (394) T ss_pred c----CCCCCCCCC Confidence 1 233333332 No 130 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=26.75 E-value=1.9 Score=19.04 Aligned_cols=301 Identities=10% Similarity=0.046 Sum_probs=117.4 Q ss_pred Ccccccccc--eeee----ecCchhHHH---HHHHHHHHhhcCc-------------ccCcccccCccccchhhhHHHHH Q lcl|NC_018863. 1 MTELQKEQK--VEAR----KLPAGAEAE---LAELVSKSFTTGT-------------GITPDTQHDAAALRRELLDDQVK 58 (479) Q Consensus 1 ~~~~~~~~~--~~~~----~~~~~~~~~---~~e~~~Ksf~ag~-------------~~~~~~~~~gaAlr~esld~~i~ 58 (479) +.++..|++ .... ..+...+.. ..|.+.+.+.... ..+..+.++|+.|.++.+..+|. T Consensus 25 ~d~~e~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~al~~~~~~~gG~lIP~~~~~~Ii 104 (352) T protein:vir:78 25 VQDIEEKEKAKVKDKGEAYQSLNDNEKLVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLSKEIV 104 (352) T ss_pred HHHHHHHHHHHhhhccccccccchhhhHHHHHHHHHHHHhhhhHHHHHHhhHHHHHHHhccCCCCCCceeccHhHHHHHH Confidence 111111110 0000 011111111 1111111111000 01122456788899999988876 Q ss_pred HHhhccccccchhhhccchhHHHHHHhhhhhccCcccccccccccccccccCcceEEEEEEEEeeeehhhhhhhHhhhcc Q lcl|NC_018863. 59 MLAFTNGDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNN 138 (479) Q Consensus 59 ~l~~~~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~ 138 (479) .+...... +.+-+....+.+ .. +.. .-+..+...+++|++..+..++++.+....++=++.-..+|.-+ |.++ T Consensus 105 ~~l~~~s~--l~~~~~v~~~~~-~~-~p~--~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~i~is~el-l~Ds 177 (352) T protein:vir:78 105 SEPFAKNQ--LREKARLTNIKG-LE-IPR--VSYTLDDDDFITDVETAKELKLKGDTVKFTTNKFKVFAAISDTV-IHGS 177 (352) T ss_pred HHHHhhcc--hhhheeeEecCC-ce-EEE--EecCCCcccccccccccccccccceeeeecceeEEeechhhHHH-Hhhh Confidence 55444332 222222222222 11 111 11223356789999999999999999999988887766666552 2345 Q ss_pred hhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEccCCCCCHHHhhhhhheeeccc Q lcl|NC_018863. 139 IADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDLKGERLDEATLNKAAVIVGKGY 218 (479) Q Consensus 139 ~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~f 218 (479) ..|.+....+.-...+++. |-.++||+-.=. ++ ..|+..- ..+--..|..+ .+.|..+---+...| T Consensus 178 ~~~l~~~i~~~la~~~~~~-e~~~~~~~g~g~--~~------~~g~l~~----~~~~~~t~~~~-~d~i~~~~~~l~~~~ 243 (352) T protein:vir:78 178 DVDLVNWVENALQSGLAAK-ERKDALAVSPKS--GL------EHMSFYN----GSVKEVEGANM-YDAIINALADLHEDY 243 (352) T ss_pred hHHHHHHHHHHHHHHHHHH-HHHhhhhcCCCC--cc------cccceec----cccccccccch-HHHHHHHHhccChhh Confidence 5666666666555556554 555555543211 11 1122110 00111122222 223332222233334 Q ss_pred CceeeeecChHHHhhHHHhhcCceeEEeecCCCccccCccccceecCceeEEecCCcccCCCcccc--CcccCCCCCcc- Q lcl|NC_018863. 219 GRATDAFMPIGVQADFTNNLLDRQRVIQPSQAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILV--DRIPEPNAPQA- 295 (479) Q Consensus 219 G~atd~~mp~~vka~f~q~~~~~qrv~~~~n~g~~~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lv--er~~s~~aP~~- 295 (479) -.-.-.+|...+...+.....+..+-+...++.. -.|.+|....+. ..| +.|+ -..++. ++..- -++. T Consensus 244 ~~~a~~~mn~~t~~~l~~~~~~~~~~~~~~~~~~-llG~PV~~~~~~-~~~-~~Gd----f~~~~~~~~~~~~--~~~~~ 314 (352) T protein:vir:78 244 RDNATIYMRYADYVKIISVLSNGTTNFFDTPAEK-VFGKPVVFTDAA-VKP-IVGD----FNYFGINYDGTTY--DTDKD 314 (352) T ss_pred hcCCEEEEehHHHHHHHHHHhccCCcccccCCcc-ccccceEEecCC-Cce-eEee----hhhhhhhhhhhee--eeecc Confidence 3333466777666666555544444444443332 235555322111 111 1111 111111 11100 0000 Q ss_pred --cceEEEeecccccCcccccccceeeEEEEEEEcCCCCcccc Q lcl|NC_018863. 296 --PASVVATVKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLAS 336 (479) Q Consensus 296 --P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S 336 (479) .-.+.-....-..|.... .-..++.-+.....+.|+ T Consensus 315 ~~~g~~~f~~~~r~Dg~~~~-----~eA~~~l~~~a~~~~~~~ 352 (352) T protein:vir:78 315 VKKGEYLFVLTAWYDQQRTL-----DSAFRIAKAKESTGSLPS 352 (352) T ss_pred ccCCeeEEEEEeeeCceeec-----hhheEEEEeecccCCCCC Confidence 000000000000010000 001233333333344444 No 131 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=26.32 E-value=2 Score=18.98 Aligned_cols=262 Identities=10% Similarity=0.054 Sum_probs=112.4 Q ss_pred ccCc-----cccchhhhHHHHHHHhhccccccchhhhccc-----hhHHHHHHhhhhhccCcccccccccccccccccCc Q lcl|NC_018863. 42 QHDA-----AALRRELLDDQVKMLAFTNGDFTIYPLINKQ-----QVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDP 111 (479) Q Consensus 42 ~~~g-----aAlr~esld~~i~~l~~~~~~f~~~~~i~k~-----~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~~~d~ 111 (479) |.++ .-+.+|-+.+.+ ++...+.+.|-.-.... +.=.||+ +..|+..|...-+.|+........ T Consensus 1 ma~~~T~~~d~iiPev~~~~v--~~~~~~~l~~~~~~~~d~~l~g~~G~tv~----iP~~~~~g~a~~~~~g~~i~~~~l 74 (274) T protein:vir:94 1 MPQGLTKTSDQIIPEVLAPMM--QAQLEKKLRFASFAEVDSTLQGQPGDTLT----FPAFVYSGDAQVVAEGEKIPTDIL 74 (274) T ss_pred CCccceehhheechHHHHHHH--HHhhhhhhhhcccceecccccCCCCCEEE----EeeecCCCccccccCCCccccccc Confidence 3332 234455555554 22222222221111110 0012221 223444455545567777788888 Q ss_pred ceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccC Q lcl|NC_018863. 112 NIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEA 191 (479) Q Consensus 112 ~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~ 191 (479) +......+++..+-.+.+++...++ +..||+....+..-..++..++..++ +.+..+ T Consensus 75 t~~~~~~~i~~~~~~~~i~D~~~~~-~~~dp~~~~~~~~a~a~a~~vd~~~~----------------------~~l~~a 131 (274) T protein:vir:94 75 ETKKREAKIRKIAKGTSITDEALLS-GYGDPQGEQVRQHGLAHANKVDNDVL----------------------EALMGA 131 (274) T ss_pred ccceeEEEeeeecceecccHHHHHh-ccchHHHHHHHHHHHHHHHHHHHHHH----------------------HHHhcc Confidence 8888999999988899999987655 56789877777776777777766554 111111 Q ss_pred CcEEEccCCCCCHHHhhhhhheeecccCceeeeecChHHHhhHHHhhcCceeEEeecCCCccc-cCccccceecCceeEE Q lcl|NC_018863. 192 TNVIDLKGERLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSQAGGFS-TGFSINQFLSTRGAIN 270 (479) Q Consensus 192 ~NviDarG~~l~~~~l~~aa~~i~~~fG~atd~~mp~~vka~f~q~~~~~qrv~~~~n~g~~~-~G~~V~~~~ss~g~I~ 270 (479) ... ..+..++.+.+.+|....+..-...+-++||+.+.+.+...-+ -+++..++.|+.. ..-.+..+.+.+--++ T Consensus 132 ~~~--~~~~~~~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~--~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s 207 (274) T protein:vir:94 132 KLT--VNADITKLNGLQSAIDKFNDEDLEPMVLFVNPLDAGKLRGDAS--TNFTRATELGDDIIVKGAFGEALGAIIVRT 207 (274) T ss_pred Ccc--ccccccCHHHHHHHHHHhhccCCCceEEEeCHHHHHHHHhhhh--hhccccCcccccceeccccceecCeeEEEc Confidence 111 1233445556666555544444456779999999988865322 1233222222110 0001111111100000 Q ss_pred ecCCcccCCCccccCc-ccCCCCCcccceEEEeecccccCcccccccceeeEEEEEEEcCCCCcccccceeeeeecCCCe Q lcl|NC_018863. 271 LHGSTIMENDNILVDR-IPEPNAPQAPASVVATVKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLASEAVTAVVANPTDS 349 (479) Q Consensus 271 L~~s~v~~a~~~lver-~~s~~aP~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~ 349 (479) +.+-..+.++..+ ...- .-..+..+ + +.-.-..+...=.+.+.|.|...+.. +. T Consensus 208 ---~~~p~~t~~l~~~gA~~~-~~~~~~~v--E--~~Rd~~~~~d~i~~~~~y~~~~~~~~-----------------~v 262 (274) T protein:vir:94 208 ---NKLEAGTAILAKKGAVKL-ILKRDFFL--E--VARDASTKTTALYSDKHYVAYLYDES-----------------KA 262 (274) T ss_pred ---CCCCcceEEEEeCcceEe-eecCCcee--c--cccchhhcccEEEEEEEEEEEEEcCC-----------------ce Confidence 0011111111110 0000 00000000 1 11111111111123345555554432 22 Q ss_pred EEEEEeecCCcc Q lcl|NC_018863. 350 VSLAVKLQSLYQ 361 (479) Q Consensus 350 V~LtIt~~~~~~ 361 (479) |.++-+-+++.. T Consensus 263 v~~t~~~~~~~~ 274 (274) T protein:vir:94 263 VKITKGSGSLEM 274 (274) T ss_pred EEEecCcccccC Confidence 222222111111 No 132 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=26.32 E-value=2 Score=18.98 Aligned_cols=262 Identities=10% Similarity=0.054 Sum_probs=112.4 Q ss_pred ccCc-----cccchhhhHHHHHHHhhccccccchhhhccc-----hhHHHHHHhhhhhccCcccccccccccccccccCc Q lcl|NC_018863. 42 QHDA-----AALRRELLDDQVKMLAFTNGDFTIYPLINKQ-----QVNSTVAKYAVFNQHGRTGHSRFVREVGVASINDP 111 (479) Q Consensus 42 ~~~g-----aAlr~esld~~i~~l~~~~~~f~~~~~i~k~-----~~~stv~~y~~~~~~G~~g~~~fv~E~g~~~~~d~ 111 (479) |.++ .-+.+|-+.+.+ ++...+.+.|-.-.... +.=.||+ +..|+..|...-+.|+........ T Consensus 1 ma~~~T~~~d~iiPev~~~~v--~~~~~~~l~~~~~~~~d~~l~g~~G~tv~----iP~~~~~g~a~~~~~g~~i~~~~l 74 (274) T protein:vir:97 1 MPQGLTKTSDQIIPEVLAPMM--QAQLEKKLRFASFAEVDSTLQGQPGDTLT----FPAFVYSGDAQVVAEGEKIPTDIL 74 (274) T ss_pred CCccceehhheechHHHHHHH--HHhhhhhhhhcccceecccccCCCCCEEE----EeeecCCCccccccCCCccccccc Confidence 3332 234455555554 22222222221111110 0012221 223444455545567777788888 Q ss_pred ceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccC Q lcl|NC_018863. 112 NIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEA 191 (479) Q Consensus 112 ~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~ 191 (479) +......+++..+-.+.+++...++ +..||+....+..-..++..++..++ +.+..+ T Consensus 75 t~~~~~~~i~~~~~~~~i~D~~~~~-~~~dp~~~~~~~~a~a~a~~vd~~~~----------------------~~l~~a 131 (274) T protein:vir:97 75 ETKKREAKIRKIAKGTSITDEALLS-GYGDPQGEQVRQHGLAHANKVDNDVL----------------------EALMGA 131 (274) T ss_pred ccceeEEEeeeecceecccHHHHHh-ccchHHHHHHHHHHHHHHHHHHHHHH----------------------HHHhcc Confidence 8888999999988899999987655 56789877777776777777766554 111111 Q ss_pred CcEEEccCCCCCHHHhhhhhheeecccCceeeeecChHHHhhHHHhhcCceeEEeecCCCccc-cCccccceecCceeEE Q lcl|NC_018863. 192 TNVIDLKGERLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSQAGGFS-TGFSINQFLSTRGAIN 270 (479) Q Consensus 192 ~NviDarG~~l~~~~l~~aa~~i~~~fG~atd~~mp~~vka~f~q~~~~~qrv~~~~n~g~~~-~G~~V~~~~ss~g~I~ 270 (479) ... ..+..++.+.+.+|....+..-...+-++||+.+.+.+...-+ -+++..++.|+.. ..-.+..+.+.+--++ T Consensus 132 ~~~--~~~~~~~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~--~~f~~~s~~g~~~~~~G~ig~~~G~~Vi~s 207 (274) T protein:vir:97 132 KLT--VNADITKLNGLQSAIDKFNDEDLEPMVLFVNPLDAGKLRGDAS--TNFTRATELGDDIIVKGAFGEALGAIIVRT 207 (274) T ss_pred Ccc--ccccccCHHHHHHHHHHhhccCCCceEEEeCHHHHHHHHhhhh--hhccccCcccccceeccccceecCeeEEEc Confidence 111 1233445556666555544444456779999999988865322 1233222222110 0001111111100000 Q ss_pred ecCCcccCCCccccCc-ccCCCCCcccceEEEeecccccCcccccccceeeEEEEEEEcCCCCcccccceeeeeecCCCe Q lcl|NC_018863. 271 LHGSTIMENDNILVDR-IPEPNAPQAPASVVATVKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLASEAVTAVVANPTDS 349 (479) Q Consensus 271 L~~s~v~~a~~~lver-~~s~~aP~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~a~~~~~ 349 (479) +.+-..+.++..+ ...- .-..+..+ + +.-.-..+...=.+.+.|.|...+.. +. T Consensus 208 ---~~~p~~t~~l~~~gA~~~-~~~~~~~v--E--~~Rd~~~~~d~i~~~~~y~~~~~~~~-----------------~v 262 (274) T protein:vir:97 208 ---NKLEAGTAILAKKGAVKL-ILKRDFFL--E--VARDASTKTTALYSDKHYVAYLYDES-----------------KA 262 (274) T ss_pred ---CCCCcceEEEEeCcceEe-eecCCcee--c--cccchhhcccEEEEEEEEEEEEEcCC-----------------ce Confidence 0011111111110 0000 00000000 1 11111111111123345555554432 22 Q ss_pred EEEEEeecCCcc Q lcl|NC_018863. 350 VSLAVKLQSLYQ 361 (479) Q Consensus 350 V~LtIt~~~~~~ 361 (479) |.++-+-+++.. T Consensus 263 v~~t~~~~~~~~ 274 (274) T protein:vir:97 263 VKITKGSGSLEM 274 (274) T ss_pred EEEecCcccccC Confidence 222222111111 No 133 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=22.64 E-value=2.4 Score=18.48 Aligned_cols=260 Identities=11% Similarity=0.098 Sum_probs=111.2 Q ss_pred hhcCcccCcccc-cCccccchhhhHHHHHHHhhccccccchhhhcc-----chhHHHHHHhhhhhccCcccccccccccc Q lcl|NC_018863. 31 FTTGTGITPDTQ-HDAAALRRELLDDQVKMLAFTNGDFTIYPLINK-----QQVNSTVAKYAVFNQHGRTGHSRFVREVG 104 (479) Q Consensus 31 f~ag~~~~~~~~-~~gaAlr~esld~~i~~l~~~~~~f~~~~~i~k-----~~~~stv~~y~~~~~~G~~g~~~fv~E~g 104 (479) |-.+ + +-..-+-+|-+.+.+.+-. .+-..|-+-... .+.=.||+ +..|+..|...-+.|++ T Consensus 1 Ma~~-------~T~l~d~i~Pev~~~~v~~~~--~~~~~~~~~~~~~~~l~g~~G~ti~----iP~~~~igda~~~~eg~ 67 (276) T protein:vir:10 1 MAQG-------TTTKSTQIVPEVLAPMMQAEL--DKKLRFAQFADIDSTLVGQPGDTLT----FPAFVYSGDATVVPEGQ 67 (276) T ss_pred CCcc-------eeehhhhhchHHHHHHHHHHH--HhhhhhcccceecccccCCCCCEEE----eeeecCCCccccccCCC Confidence 1110 1 1122234555555543222 111111111110 00111221 22344455555678888 Q ss_pred cccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhH Q lcl|NC_018863. 105 VASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGL 184 (479) Q Consensus 105 ~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl 184 (479) ......-+......+++..+-...+++.+.++. .+||+....+..-..++..+...++ T Consensus 68 ~i~~~~lt~~~~~a~i~~~~k~~~~tD~a~~~~-~~dp~~~~~~~~~~~~a~~~d~~~~--------------------- 125 (276) T protein:vir:10 68 KIPVDKIETNRREAKIHKIGKGTDITDEALLSG-YGDPQGEAVRQHGLAIANKVDNDVL--------------------- 125 (276) T ss_pred ccCccccccceeeEEeehccccccccHHHHHhh-ccchHHHHHHHHHHHHHHHHHHHHH--------------------- Confidence 888888888999999999998899998886654 5688866555544445554443322 Q ss_pred HHhhccCCcEEEccCCCCCHHHhhhhhheeecccCceeeeecChHHHhhHHHhhcCceeEEeecCCCccc-cCcccccee Q lcl|NC_018863. 185 TKLIDEATNVIDLKGERLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPSQAGGFS-TGFSINQFL 263 (479) Q Consensus 185 ~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~atd~~mp~~vka~f~q~~~~~qrv~~~~n~g~~~-~G~~V~~~~ 263 (479) +.++.....+ -+..++.+.|..|....+..-...+-+.||+.+.+.+....... ++..++.|+.- ..-.+..+. T Consensus 126 -~~l~~~~~~~--~~~~~t~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~--f~~~s~~g~~~~~~G~ig~~~ 200 (276) T protein:vir:10 126 -EALRGTKLTV--SADIGTLAGLEAAIDTFDDEDLEPMVLFINPKDAGKLRSSASDN--FTRATELGDNIIVKGAFGEAL 200 (276) T ss_pred -HHHhcccccc--cccccCHHHHHHHHHHhccccCcccEEEEcHHHHHHHHHhcccc--ccccccccccceeccccceec Confidence 1222111111 23345566666666555444335667999999999886532221 22222222110 000011111 Q ss_pred cCceeEEecCCcccCCCccccC-cccCCCCCcccceEEEeecccccCcccccccceeeEEEEEEEcCC--------CCcc Q lcl|NC_018863. 264 STRGAINLHGSTIMENDNILVD-RIPEPNAPQAPASVVATVKVNDKGAFRPVKDIKTHSYKVVVHSDD--------AESL 334 (479) Q Consensus 264 ss~g~I~L~~s~v~~a~~~lve-r~~s~~aP~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~--------GES~ 334 (479) +.+ -..-+.+-..+.|+.. .+..- .-..+.. .+ +.-.-..+...-.+.+.|.|...+.. +-|. T Consensus 201 G~~---Vi~s~~~p~~t~~l~~~gAi~~-~~~~~~~--vE--~dRd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~~~ 272 (276) T protein:vir:10 201 GAV---IVRSKKLDEGEAILAKRGAVKL-ITKRDFF--LE--TDRDPSTKTTALYSDKHYVAYLYDESKAVKVTKGAGTT 272 (276) T ss_pred cee---EEEcCCCCcceEEEEeccceee-eecCCce--ee--cccchhhcccEEEEeeEEEEEEEcCcceEEEecCCcCC Confidence 111 0111111112222221 01100 0001111 11 11111112222224456666665542 3333 Q ss_pred cccc Q lcl|NC_018863. 335 ASEA 338 (479) Q Consensus 335 ~S~~ 338 (479) |+.+ T Consensus 273 ~~~~ 276 (276) T protein:vir:10 273 DSGA 276 (276) T ss_pred cCCC Confidence 3332 No 134 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=22.45 E-value=2.4 Score=18.45 Aligned_cols=249 Identities=14% Similarity=0.133 Sum_probs=106.3 Q ss_pred hhcCcccCcccccC-ccccchhhhHHHHHHHhhccccccchh------hhccchhHHHHHHhhhhhccCccccccccccc Q lcl|NC_018863. 31 FTTGTGITPDTQHD-AAALRRELLDDQVKMLAFTNGDFTIYP------LINKQQVNSTVAKYAVFNQHGRTGHSRFVREV 103 (479) Q Consensus 31 f~ag~~~~~~~~~~-gaAlr~esld~~i~~l~~~~~~f~~~~------~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~ 103 (479) |.. +++- +.-+.+|-+.+.+ +....+...|-. ++..+ .=+||+ +..|+..|..-.+.|+ T Consensus 1 ma~-------~~T~~~d~iiPev~~~~v--~~~~~~~~~~~~~~~~~~~l~g~-~G~ti~----iP~~~~~gda~~~~eg 66 (272) T protein:vir:36 1 MSK-------QKTTLADLVNPEVLAPIV--SYELNKALRFAPLAQVDTTLQGQ-PGNTLK----FPAFTYIGDAADVAEG 66 (272) T ss_pred CCC-------cceehhhhhchHHHHHHH--HHHHHhhhhhccccccccccccC-CCCEEE----EeeeccCccccccCCC Confidence 111 0111 1122233333332 111111111111 11111 112221 3355566666678889 Q ss_pred ccccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhh Q lcl|NC_018863. 104 GVASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDG 183 (479) Q Consensus 104 g~~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDG 183 (479) +.....+-+......++|..+-...+++.+.++ +..||+..-.+..-..++..++-.++=.=++ T Consensus 67 ~~i~~~~lt~~~~~~~i~~~~k~~~vtD~~~~~-~~~d~~~~~~~~~a~~~a~~~d~~i~~~l~~--------------- 130 (272) T protein:vir:36 67 GEISLDKIGTTTKSVTIKKAAKGTEITDEAALS-GYGDPIGESNKQLGLSLANKVDDDLLSAAKT--------------- 130 (272) T ss_pred CccChhhcCCcceeEeeehhhccccccHHHHhh-ccchHHHHHHHHHHHHHHHHHHHHHHHHhcc--------------- Confidence 988888888999999999999889999977665 6789998777777777777777655411111 Q ss_pred HHHhhccCCcEEEccCCCCCHHHhhhhhheeecccCceeeeecChHHHhhHHHhhcCceeEEeec-CCCc--cccCcccc Q lcl|NC_018863. 184 LTKLIDEATNVIDLKGERLDEATLNKAAVIVGKGYGRATDAFMPIGVQADFTNNLLDRQRVIQPS-QAGG--FSTGFSIN 260 (479) Q Consensus 184 l~~~I~~~~NviDarG~~l~~~~l~~aa~~i~~~fG~atd~~mp~~vka~f~q~~~~~qrv~~~~-n~g~--~~~G~~V~ 260 (479) ..+ +. ....+.+.|.+|....+..-...+-++||+.+.+.+...- ++.-.. ..++ ...| .|. T Consensus 131 -------~~~--~~-~~~~~~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~----~~~~~~~~~~~~~~~~G-~ig 195 (272) T protein:vir:36 131 -------TSQ--TV-STKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRKDA----NAKNIGSEVGANALING-TYA 195 (272) T ss_pred -------ccc--cc-cccccHHHHHHHHHHhhhcCCCceEEEEcHHHHHHHhccc----ccccccccccccceeee-ccc Confidence 000 00 1122344455544444444445667999999888764321 111111 1110 0011 011 Q ss_pred ceecCceeEEecCCcccCCCccccCcccCCCCCcc-------------cceEEEeecccccCcccccccceeeEEEEEEE Q lcl|NC_018863. 261 QFLSTRGAINLHGSTIMENDNILVDRIPEPNAPQA-------------PASVVATVKVNDKGAFRPVKDIKTHSYKVVVH 327 (479) Q Consensus 261 ~~~ss~g~I~L~~s~v~~a~~~lver~~s~~aP~~-------------P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~ 327 (479) .+.+ -+.+..+..+..++-+. +..+..+ +.-.-..+...=.+.+.|.+... T Consensus 196 ~~~G--------------~~Vv~s~~~p~~~~~~~~~~~~~gA~~~~~~~~~~vE--~~R~~~~~~d~i~~~~~y~~~v~ 259 (272) T protein:vir:36 196 DVLG--------------AQIVRSKKLAEGSALMFKIVSNSPALKLVLKRGVQVE--TDRDIVTKTTVITADEHYAAYLY 259 (272) T ss_pred eecC--------------eeEEEeCCCCCCceeEEEEEecccceeeeecCCcccc--cccchhhcCcEEEEEEEEEEEEE Confidence 1111 11111111111111000 0000001 11111111111112344544444 Q ss_pred cCCCCcccccceeeeeecCCCeEEEEEeecCC Q lcl|NC_018863. 328 SDDAESLASEAVTAVVANPTDSVSLAVKLQSL 359 (479) Q Consensus 328 n~~GES~~S~~VtaT~a~~~~~V~LtIt~~~~ 359 (479) +..+ .|..|.++ . T Consensus 260 ~~~~------vv~~t~~g-------------~ 272 (272) T protein:vir:36 260 DLTK------VVNITFTG-------------V 272 (272) T ss_pred cCcc------EEEEeecC-------------C Confidence 4322 22233322 2 No 135 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=21.65 E-value=2.6 Score=18.34 Aligned_cols=317 Identities=11% Similarity=0.055 Sum_probs=115.2 Q ss_pred Cc-ccccccceeeeecCchh---------HHHHHH---HHHHHhhcCcccCcccccCccccchhhhHHHHHHHhhccccc Q lcl|NC_018863. 1 MT-ELQKEQKVEARKLPAGA---------EAELAE---LVSKSFTTGTGITPDTQHDAAALRRELLDDQVKMLAFTNGDF 67 (479) Q Consensus 1 ~~-~~~~~~~~~~~~~~~~~---------~~~~~e---~~~Ksf~ag~~~~~~~~~~gaAlr~esld~~i~~l~~~~~~f 67 (479) +- ++.++.+ .+.+... ...+.. .+.+++.++ +.++|+.|-.+.+.+.|........ T Consensus 43 ~~~~~~~~~~---~~~~~~~~~~~~~~~g~~~lt~~e~~~~~~~~~~------~~~~gg~lvP~~~~~~I~~~l~~~s-- 111 (383) T protein:vir:78 43 MAADIMEQAK---KEARQEADAYISASRTDKNITNEEIKFFNDINKE------VGYKEETLLPQTVVDEIFEDLTTEH-- 111 (383) T ss_pred HHHHHHHHHH---HHHHHHHHHHHHhcCChhhhhHHHHHHHHHHhcc------CCCCCccccCHHHHHHHHHHHHhhc-- Confidence 00 0000000 0000000 000000 111223232 3457788888888887754443332 Q ss_pred cchhhhccchhHHHHHHhhhhhccCccccccccccccc-ccccCcceEEEEEEEEeeeehhhhhhhHhhhcchhhHHHHH Q lcl|NC_018863. 68 TIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGV-ASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIADPMTIL 146 (479) Q Consensus 68 ~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~-~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~dp~~~~ 146 (479) .+++.+....+.+.+ + +....+.+.+.+++|.+. +...|+.+.+.....+=|+.-..+|..+ |.++..|.+... T Consensus 112 ~l~~~~~v~~~~~~~-~---i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~i~is~el-l~Ds~~~ie~~i 186 (383) T protein:vir:78 112 PFLASIGMRTTGLRT-K---FLKSETSGVAVWGKIFGEIKGQLDATFSDEESIQNKLTAFVVVPKDL-EKFGPAWVKRFV 186 (383) T ss_pred cceeeeeeEecCCce-E---EEEEcCCcceEEeecccccccccCcceeeEeecceeeEeeccchHHH-hhccHHHHHHHH Confidence 345544444444332 2 344445556668899775 4678999999999999998776666554 456777899999 Q ss_pred HHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCcEEEc---cC---CCCCHHHhhhhhheeecccCc Q lcl|NC_018863. 147 TEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATNVIDL---KG---ERLDEATLNKAAVIVGKGYGR 220 (479) Q Consensus 147 ~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~NviDa---rG---~~l~~~~l~~aa~~i~~~fG~ 220 (479) .+.--.++++.++.+++.||-.- + --||++.+....++... .+ +.++.+.+......+...+-. T Consensus 187 ~~~l~~~~a~~~~~a~i~G~G~~----q------P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~ 256 (383) T protein:vir:78 187 VTQIEEAFAVALESAYIVGDGND----K------PIGLNRKVGKGSTVVDGVYAEKAATGTLTFANPKTTVNELTDVYKY 256 (383) T ss_pred HHHHHHHHHHHHhhheEeccCCC----C------ceeeeeccCCcccccccccccccccchhhhhhhHHHHHHHHHHHhc Confidence 99999999999999999998631 1 23777766432222210 00 001000000000000000000 Q ss_pred eeeeecChHHHhhHHHhhcCceeEEeecCCCccccCccccceecCceeEEecCCcccCCCccccCcccCCCCCcccceEE Q lcl|NC_018863. 221 ATDAFMPIGVQADFTNNLLDRQRVIQPSQAGGFSTGFSINQFLSTRGAINLHGSTIMENDNILVDRIPEPNAPQAPASVV 300 (479) Q Consensus 221 atd~~mp~~vka~f~q~~~~~qrv~~~~n~g~~~~G~~V~~~~ss~g~I~L~~s~v~~a~~~lver~~s~~aP~~P~~vt 300 (479) +. ..|. ......++.-+.++.. .. -+++.. +....+.++.++.- -| -|..+. T Consensus 257 ~~-~~~~-----~~~~~~~~~~~~~~n~--~~---~~~~~~-----------~~~~~~~~G~~~t~-----l~-~~~~iv 308 (383) T protein:vir:78 257 HS-VKEN-----GHPLNVAGKVTLLVNP--TD---AWDVKK-----------QYTSLNANGVYVTA-----LP-FNLNII 308 (383) T ss_pred cc-hhcc-----cchhhhcCceEEEEcC--cc---hhhhcc-----------chhccCCCCceeee-----cC-CCceEE Confidence 00 0010 0000001111111111 00 000000 00000001100000 00 011000 Q ss_pred EeecccccCcccccccceeeE-EEEEEEcCCCCcccccceeeeeecCCCeEEEEEeecCCccccceEEEEEeccC---CC Q lcl|NC_018863. 301 ATVKVNDKGAFRPVKDIKTHS-YKVVVHSDDAESLASEAVTAVVANPTDSVSLAVKLQSLYQAKPQFISVYRQGN---ET 376 (479) Q Consensus 301 a~~~~~~~g~~~~~sd~g~Y~-YkV~a~n~~GES~~S~~VtaT~a~~~~~V~LtIt~~~~~~~~~~y~~IYR~t~---~~ 376 (479) .+.....++- .- |.++ |.+ +.+.|-+.... -..-...+. .-|+.+.|-.- +. T Consensus 309 -~s~~~p~~~i-if---gdfs~Y~i--~~r~~~~i~~~---~~~~f~~d~--------------~~f~~~~r~dG~~~~~ 364 (383) T protein:vir:78 309 -ESLFVPEKKA-IS---YVAERYDA--LIGGPLDIGTY---DQTLAIEDL--------------NLYAAKQFAYGKAKDD 364 (383) T ss_pred -ecCCCCcccE-EE---eeccceEE--EecccceEEec---chhhhhcCc--------------eEEEEEEEEcCEEecC Confidence 0000000110 11 1222 333 22323222110 000000000 01222223221 11 Q ss_pred CcEEEEEEeeeeeccCCCeeEEEe Q lcl|NC_018863. 377 GHYFLVARVPLSKADENGVITFVD 400 (479) Q Consensus 377 g~~~~i~rV~~s~~n~~~tttf~D 400 (479) ..| .+..+.+.. ..++.-. T Consensus 365 ~A~-~vl~~~~~~----~~~~~~~ 383 (383) T protein:vir:78 365 KAA-AVWTLNINP----AEQTPEG 383 (383) T ss_pred CeE-EEEEEEecC----CCCCCCC Confidence 111 111111110 0111111 No 136 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=20.24 E-value=2.8 Score=18.13 Aligned_cols=306 Identities=13% Similarity=0.031 Sum_probs=119.9 Q ss_pred Cccccccccee--eeecCchh----HHHHHHHH-------------HHHhhcCcccCcccccCccccchhhhHHHHHHHh Q lcl|NC_018863. 1 MTELQKEQKVE--ARKLPAGA----EAELAELV-------------SKSFTTGTGITPDTQHDAAALRRELLDDQVKMLA 61 (479) Q Consensus 1 ~~~~~~~~~~~--~~~~~~~~----~~~~~e~~-------------~Ksf~ag~~~~~~~~~~gaAlr~esld~~i~~l~ 61 (479) -.+.+.+. +. ...+..+. .++..+++ .|.|++ ..-.+-++|+.|-++.+...|.... T Consensus 25 ~~~~~~~~-~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~lt~~e~~~~~~---~~~~~~~~gg~lvP~~~~~~I~~~l 100 (381) T protein:vir:10 25 PQERQNEL-YGDMINQLFEETKLQAKAEAERVSSLPKSAQSLSANQRSFFMD---INKNVNYKEEKLLPEETIDRIFEDL 100 (381) T ss_pred hhHHHHHH-HHHHHHhhhhhHHHHHHHHHHHHHHhccCcccccHHHHHHHHH---HhcccCCCCceecCHHHHHHHHHHH Confidence 00000000 00 00000000 00000111 111111 0112335678888888888886544 Q ss_pred hccccccchhhhccchhHHHHHHhhhhhccCccccccccccccc-ccccCcceEEEEEEEEeeeehhhhhhhHhhhcchh Q lcl|NC_018863. 62 FTNGDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGV-ASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIA 140 (479) Q Consensus 62 ~~~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~-~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~ 140 (479) .... .+++.+....+.+.+ .+....+.+...++.|.+. ++..++.+.+.....+=|+.--.+|..+ |.++.. T Consensus 101 ~~~s--~i~~~~~v~~~~~~~----~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~el-L~Ds~~ 173 (381) T protein:vir:10 101 TTNH--PLLADLGIKNAGLRL----KFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDL-NDFGPA 173 (381) T ss_pred Hhhc--cceeheeeEecCcce----EEEEecCCcceeeecccccccccccccceeeeecceeEEeechhhHHH-hhcCHH Confidence 4443 234444333333221 3445555566778899775 4577999999999999998877777766 667788 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCc--------------EEEccCCCCCHHH Q lcl|NC_018863. 141 DPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATN--------------VIDLKGERLDEAT 206 (479) Q Consensus 141 dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~N--------------viDarG~~l~~~~ 206 (479) |.+....+.--.++++.+|.+++.||-.-.| -||++.+....+ +.++....+ .+. T Consensus 174 ~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP----------~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~-~~~ 242 (381) T protein:vir:10 174 WIERFVRVQIEEAFAVALETAFLKGTGKDQP----------IGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRAT-VNE 242 (381) T ss_pred HHHHHHHHHHHHHHHHHhhheeEeccCCCCc----------eeeeeccCcccccccccccccccccccccccchhh-HHH Confidence 9999999999999999999999999864222 266665532111 111111110 111 Q ss_pred ----hhhhhheee---cc-cCceeeeecChHHHhhHHHhhcCceeEEeecCCCccc--cCccccceec---CceeEEecC Q lcl|NC_018863. 207 ----LNKAAVIVG---KG-YGRATDAFMPIGVQADFTNNLLDRQRVIQPSQAGGFS--TGFSINQFLS---TRGAINLHG 273 (479) Q Consensus 207 ----l~~aa~~i~---~~-fG~atd~~mp~~vka~f~q~~~~~qrv~~~~n~g~~~--~G~~V~~~~s---s~g~I~L~~ 273 (479) +...++... .. -|.+ -+.|.+.+...+..... .+. ..|.+. .|+.+.-+.+ ..|.|-+ | T Consensus 243 l~~~~~~~~~~~~~~~~~~~~~a-~~~mn~~t~~~l~~~~~-----~~~-~~G~~v~~l~~g~~vv~s~~~p~~~iif-g 314 (381) T protein:vir:10 243 LTQVFKYHSTNEKGKSVAVKGNV-TMVVNPSDAFEVQAQYT-----HLN-ANGVYVTALPFNLNVIESTVQEAGKVLT-Y 314 (381) T ss_pred HHHHHHhhccccccccccccCce-EEEEccccHHhhccccc-----cCC-CCCceeecCCCCceEEecCCCCcCcEEE-E Confidence 112221110 01 1111 12456665554421111 111 111111 1111100111 0111111 1 Q ss_pred C----cccCCCccccCcccCCCCCcccceEEEeecccccCcccccccceeeEEEEEEEcCCCCcccccceeeee Q lcl|NC_018863. 274 S----TIMENDNILVDRIPEPNAPQAPASVVATVKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLASEAVTAVV 343 (479) Q Consensus 274 s----~v~~a~~~lver~~s~~aP~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~ 343 (479) + .+.+..+..+++..+. ....--+.--...=..|.-.. .--.+|.-+.--|-....+..--|. T Consensus 315 Dfs~Y~i~~r~~~~i~~~~~~--~~~~d~~~f~a~~r~dg~~~~-----~~A~~v~~l~~~~~~~~~~~~~~~~ 381 (381) T protein:vir:10 315 VKGLYDGYLAGGINVQKFKET--LALDDMDLYTAKQFAYGKAKD-----NKVAAVWKLDLKGHKPALEGTEETL 381 (381) T ss_pred ecccEEEEEecccEEEeechh--HhhcCCeEEEEEEEEcCEEec-----CceEEEEEEEecCCCcCcccccccC Confidence 0 1111111111111110 000000000000000010000 0112222122111111111111111 No 137 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=20.24 E-value=2.8 Score=18.13 Aligned_cols=306 Identities=13% Similarity=0.031 Sum_probs=119.9 Q ss_pred Cccccccccee--eeecCchh----HHHHHHHH-------------HHHhhcCcccCcccccCccccchhhhHHHHHHHh Q lcl|NC_018863. 1 MTELQKEQKVE--ARKLPAGA----EAELAELV-------------SKSFTTGTGITPDTQHDAAALRRELLDDQVKMLA 61 (479) Q Consensus 1 ~~~~~~~~~~~--~~~~~~~~----~~~~~e~~-------------~Ksf~ag~~~~~~~~~~gaAlr~esld~~i~~l~ 61 (479) -.+.+.+. +. ...+..+. .++..+++ .|.|++ ..-.+-++|+.|-++.+...|.... T Consensus 25 ~~~~~~~~-~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~lt~~e~~~~~~---~~~~~~~~gg~lvP~~~~~~I~~~l 100 (381) T protein:vir:95 25 PQERQNEL-YGDMINQLFEETKLQAKAEAERVSSLPKSAQSLSANQRSFFMD---INKNVNYKEEKLLPEETIDRIFEDL 100 (381) T ss_pred hhHHHHHH-HHHHHHhhhhhHHHHHHHHHHHHHHhccCcccccHHHHHHHHH---HhcccCCCCceecCHHHHHHHHHHH Confidence 00000000 00 00000000 00000111 111111 0112335678888888888886544 Q ss_pred hccccccchhhhccchhHHHHHHhhhhhccCccccccccccccc-ccccCcceEEEEEEEEeeeehhhhhhhHhhhcchh Q lcl|NC_018863. 62 FTNGDFTIYPLINKQQVNSTVAKYAVFNQHGRTGHSRFVREVGV-ASINDPNIRQKTVQMKFLSDTKQQSLAAGLVNNIA 140 (479) Q Consensus 62 ~~~~~f~~~~~i~k~~~~stv~~y~~~~~~G~~g~~~fv~E~g~-~~~~d~~~~r~~~~~k~l~~~~~vs~~~~lv~~~~ 140 (479) .... .+++.+....+.+.+ .+....+.+...++.|.+. ++..++.+.+.....+=|+.--.+|..+ |.++.. T Consensus 101 ~~~s--~i~~~~~v~~~~~~~----~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~el-L~Ds~~ 173 (381) T protein:vir:95 101 TTNH--PLLADLGIKNAGLRL----KFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDL-NDFGPA 173 (381) T ss_pred Hhhc--cceeheeeEecCcce----EEEEecCCcceeeecccccccccccccceeeeecceeEEeechhhHHH-hhcCHH Confidence 4443 234444333333221 3445555566778899775 4577999999999999998877777766 667788 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCCCcccchhhhHHHhhccCCc--------------EEEccCCCCCHHH Q lcl|NC_018863. 141 DPMTILTEDAISVIAKSIEWAIFYGDAALAAEADNQAGIEFDGLTKLIDEATN--------------VIDLKGERLDEAT 206 (479) Q Consensus 141 dp~~~~~~~ai~~~~~~~e~a~f~Gd~~l~~~~~~~~gleFDGl~~~I~~~~N--------------viDarG~~l~~~~ 206 (479) |.+....+.--.++++.+|.+++.||-.-.| -||++.+....+ +.++....+ .+. T Consensus 174 ~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP----------~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~-~~~ 242 (381) T protein:vir:95 174 WIERFVRVQIEEAFAVALETAFLKGTGKDQP----------IGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRAT-VNE 242 (381) T ss_pred HHHHHHHHHHHHHHHHHhhheeEeccCCCCc----------eeeeeccCcccccccccccccccccccccccchhh-HHH Confidence 9999999999999999999999999864222 266665532111 111111110 111 Q ss_pred ----hhhhhheee---cc-cCceeeeecChHHHhhHHHhhcCceeEEeecCCCccc--cCccccceec---CceeEEecC Q lcl|NC_018863. 207 ----LNKAAVIVG---KG-YGRATDAFMPIGVQADFTNNLLDRQRVIQPSQAGGFS--TGFSINQFLS---TRGAINLHG 273 (479) Q Consensus 207 ----l~~aa~~i~---~~-fG~atd~~mp~~vka~f~q~~~~~qrv~~~~n~g~~~--~G~~V~~~~s---s~g~I~L~~ 273 (479) +...++... .. -|.+ -+.|.+.+...+..... .+. ..|.+. .|+.+.-+.+ ..|.|-+ | T Consensus 243 l~~~~~~~~~~~~~~~~~~~~~a-~~~mn~~t~~~l~~~~~-----~~~-~~G~~v~~l~~g~~vv~s~~~p~~~iif-g 314 (381) T protein:vir:95 243 LTQVFKYHSTNEKGKSVAVKGNV-TMVVNPSDAFEVQAQYT-----HLN-ANGVYVTALPFNLNVIESTVQEAGKVLT-Y 314 (381) T ss_pred HHHHHHhhccccccccccccCce-EEEEccccHHhhccccc-----cCC-CCCceeecCCCCceEEecCCCCcCcEEE-E Confidence 112221110 01 1111 12456665554421111 111 111111 1111100111 0111111 1 Q ss_pred C----cccCCCccccCcccCCCCCcccceEEEeecccccCcccccccceeeEEEEEEEcCCCCcccccceeeee Q lcl|NC_018863. 274 S----TIMENDNILVDRIPEPNAPQAPASVVATVKVNDKGAFRPVKDIKTHSYKVVVHSDDAESLASEAVTAVV 343 (479) Q Consensus 274 s----~v~~a~~~lver~~s~~aP~~P~~vta~~~~~~~g~~~~~sd~g~Y~YkV~a~n~~GES~~S~~VtaT~ 343 (479) + .+.+..+..+++..+. ....--+.--...=..|.-.. .--.+|.-+.--|-....+..--|. T Consensus 315 Dfs~Y~i~~r~~~~i~~~~~~--~~~~d~~~f~a~~r~dg~~~~-----~~A~~v~~l~~~~~~~~~~~~~~~~ 381 (381) T protein:vir:95 315 VKGLYDGYLAGGINVQKFKET--LALDDMDLYTAKQFAYGKAKD-----NKVAAVWKLDLKGHKPALEGTEETL 381 (381) T ss_pred ecccEEEEEecccEEEeechh--HhhcCCeEEEEEEEEcCEEec-----CceEEEEEEEecCCCcCcccccccC Confidence 0 1111111111111110 000000000000000010000 0112222122111111111111111 Done!