Query lcl|NC_019522.1_cdsid_YP_007006955.1 [gene=F396_gp46] [protein=putative major capsid protein] [protein_id=YP_007006955.1] [location=29502..30437] Match_columns 311 No_of_seqs 110 out of 163 Neff 8.0 Searched_HMMs 1612 Date Thu Nov 7 18:07:15 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_46 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_46_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:79642 Length: 329 100.0 6E-94 3.7E-97 531.7 32.5 302 1-311 23-326 (329) 2 protein:vir:104342 Length: 314 100.0 6.4E-93 4E-96 526.0 29.9 296 1-311 14-311 (314) 3 protein:vir:107687 Length: 319 100.0 4.1E-92 2.6E-95 521.6 31.8 300 1-311 18-319 (319) 4 protein:vir:80068 Length: 301 100.0 1.6E-91 9.8E-95 518.4 32.1 300 6-311 1-301 (301) 5 protein:vir:5255 Length: 304 # 100.0 5E-91 3.1E-94 515.6 29.9 293 11-310 1-304 (304) 6 protein:vir:103285 Length: 296 100.0 1.9E-90 1.2E-93 512.5 30.8 292 4-311 1-293 (296) 7 protein:vir:94070 Length: 339 100.0 2.3E-87 1.4E-90 495.6 28.6 296 1-311 32-339 (339) 8 protein:vir:78558 Length: 336 100.0 1.6E-84 9.9E-88 480.0 26.7 296 1-311 28-336 (336) 9 protein:vir:99576 Length: 388 100.0 3E-84 1.9E-87 478.5 26.9 303 1-311 62-388 (388) 10 protein:vir:106734 Length: 336 100.0 3.7E-84 2.3E-87 478.0 26.4 293 1-311 28-336 (336) 11 protein:vir:107732 Length: 379 100.0 2.5E-83 1.5E-86 473.4 29.4 299 1-311 51-379 (379) 12 protein:vir:101557 Length: 336 100.0 1.6E-83 9.8E-87 474.5 27.5 296 1-311 31-336 (336) 13 protein:vir:3643 Length: 336 # 100.0 1.6E-83 9.8E-87 474.5 27.0 296 1-311 31-336 (336) 14 protein:vir:96079 Length: 382 100.0 3.2E-80 2E-83 456.4 26.6 300 1-311 58-382 (382) 15 protein:vir:105778 Length: 358 99.9 9.3E-27 5.8E-30 163.3 13.8 299 1-311 30-357 (358) 16 protein:vir:1433 Length: 435 # 99.1 1.8E-11 1.1E-14 79.5 17.2 285 1-311 127-431 (435) 17 protein:vir:80376 Length: 435 99.1 2.7E-11 1.7E-14 78.5 17.8 286 1-311 127-431 (435) 18 protein:vir:7771 Length: 330 # 99.0 1.8E-10 1.1E-13 74.0 19.3 283 1-311 1-321 (330) 19 protein:vir:8420 Length: 477 # 99.0 1.4E-10 8.5E-14 74.7 16.9 295 1-311 152-469 (477) 20 protein:vir:5739 Length: 366 # 98.9 4.1E-10 2.5E-13 72.1 18.6 285 1-311 61-364 (366) 21 protein:vir:108211 Length: 318 98.9 1.5E-10 9.2E-14 74.5 15.2 277 1-311 1-315 (318) 22 protein:vir:9574 Length: 300 # 98.9 5E-10 3.1E-13 71.6 17.8 275 1-311 1-298 (300) 23 protein:vir:2504 Length: 305 # 98.8 2.6E-09 1.6E-12 67.6 19.0 278 6-311 1-296 (305) 24 protein:vir:94142 Length: 304 98.8 2.7E-09 1.6E-12 67.6 17.7 276 1-311 1-303 (304) 25 protein:vir:105905 Length: 304 98.8 2.7E-09 1.6E-12 67.6 17.7 276 1-311 1-303 (304) 26 protein:vir:95763 Length: 297 98.7 8E-09 4.9E-12 65.0 19.5 272 1-311 1-294 (297) 27 protein:vir:104085 Length: 320 98.7 6.6E-09 4.1E-12 65.4 18.6 280 1-311 2-315 (320) 28 protein:vir:94771 Length: 298 98.7 3.5E-09 2.2E-12 66.9 16.8 274 9-311 1-297 (298) 29 protein:vir:96392 Length: 324 98.7 7.2E-09 4.5E-12 65.2 18.0 273 1-311 19-313 (324) 30 protein:vir:78830 Length: 324 98.7 7.2E-09 4.5E-12 65.2 18.0 273 1-311 19-313 (324) 31 protein:vir:1638 Length: 298 # 98.7 5E-09 3.1E-12 66.1 16.9 274 1-311 1-297 (298) 32 protein:vir:94673 Length: 419 98.7 8.8E-09 5.4E-12 64.8 18.1 278 1-311 116-415 (419) 33 protein:vir:105038 Length: 428 98.7 7.7E-09 4.8E-12 65.1 17.5 287 1-311 113-426 (428) 34 protein:vir:103955 Length: 324 98.6 2.1E-08 1.3E-11 62.7 18.5 270 1-311 19-313 (324) 35 protein:vir:99749 Length: 324 98.6 2.3E-08 1.4E-11 62.5 18.6 268 1-311 19-313 (324) 36 protein:vir:8187 Length: 311 # 98.6 1.7E-08 1E-11 63.2 17.6 277 3-311 1-308 (311) 37 protein:vir:97148 Length: 324 98.6 3.2E-08 2E-11 61.6 18.4 273 1-311 19-313 (324) 38 protein:vir:7855 Length: 497 # 98.6 3.3E-08 2.1E-11 61.6 17.9 283 1-311 146-491 (497) 39 protein:vir:101650 Length: 497 98.6 3.3E-08 2.1E-11 61.6 17.9 283 1-311 146-491 (497) 40 protein:vir:9309 Length: 324 # 98.5 4.4E-08 2.8E-11 60.9 18.1 270 1-311 19-313 (324) 41 protein:vir:80684 Length: 315 98.5 3.9E-08 2.4E-11 61.2 17.8 280 6-311 1-304 (315) 42 protein:vir:41 Length: 299 # N 98.5 3.5E-08 2.2E-11 61.5 17.5 270 4-311 1-296 (299) 43 protein:vir:191 Length: 385 # 98.5 3.2E-08 2E-11 61.7 17.1 270 1-311 100-382 (385) 44 protein:vir:1886 Length: 385 # 98.5 3.2E-08 2E-11 61.7 17.1 270 1-311 100-382 (385) 45 protein:vir:96223 Length: 324 98.5 6E-08 3.7E-11 60.2 18.4 270 1-311 19-313 (324) 46 protein:vir:100135 Length: 418 98.5 6.3E-08 3.9E-11 60.1 18.5 272 1-311 131-413 (418) 47 protein:vir:99920 Length: 311 98.5 3.7E-08 2.3E-11 61.4 17.1 279 1-311 1-310 (311) 48 protein:vir:4456 Length: 401 # 98.5 8.7E-09 5.4E-12 64.8 13.1 289 1-311 102-399 (401) 49 protein:vir:104256 Length: 458 98.5 4E-08 2.5E-11 61.1 16.7 283 1-311 156-456 (458) 50 protein:vir:4339 Length: 395 # 98.5 8.9E-08 5.5E-11 59.2 18.5 275 1-311 109-393 (395) 51 protein:vir:485 Length: 407 # 98.5 2.6E-08 1.6E-11 62.1 15.5 289 1-311 101-398 (407) 52 protein:vir:100247 Length: 425 98.5 3.4E-08 2.1E-11 61.5 15.9 289 1-311 121-422 (425) 53 protein:vir:78223 Length: 333 98.5 5.3E-08 3.3E-11 60.5 17.0 286 1-311 1-330 (333) 54 protein:vir:9759 Length: 303 # 98.4 5.5E-08 3.4E-11 60.4 16.3 275 6-311 1-301 (303) 55 protein:vir:81227 Length: 413 98.4 1.5E-07 9.6E-11 57.9 18.2 273 1-311 113-408 (413) 56 protein:vir:8102 Length: 543 # 98.4 1.2E-07 7.7E-11 58.4 16.8 280 1-311 245-540 (543) 57 protein:vir:10364 Length: 390 98.4 2.4E-07 1.5E-10 56.8 18.3 270 1-311 109-390 (390) 58 protein:vir:4226 Length: 326 # 98.4 1.9E-07 1.2E-10 57.5 17.5 284 1-311 15-321 (326) 59 protein:vir:2430 Length: 318 # 98.3 2.3E-07 1.4E-10 57.0 16.7 279 1-311 1-311 (318) 60 protein:vir:78523 Length: 338 98.3 5E-07 3.1E-10 55.1 17.6 287 1-311 7-333 (338) 61 protein:vir:81070 Length: 390 98.2 6.6E-07 4.1E-10 54.5 17.5 270 1-311 109-390 (390) 62 protein:vir:6212 Length: 434 # 98.2 2.1E-07 1.3E-10 57.2 14.5 277 1-311 131-427 (434) 63 protein:vir:102119 Length: 404 98.2 4E-07 2.5E-10 55.6 15.7 281 1-311 105-398 (404) 64 protein:vir:1328 Length: 392 # 98.2 4.2E-07 2.6E-10 55.6 15.7 281 1-311 106-389 (392) 65 protein:vir:98339 Length: 415 98.2 1.3E-06 7.9E-10 52.9 18.1 277 1-311 117-402 (415) 66 protein:vir:81100 Length: 415 98.2 1.3E-06 7.9E-10 52.9 18.1 277 1-311 117-402 (415) 67 protein:vir:79987 Length: 415 98.2 1.3E-06 7.9E-10 52.9 18.1 277 1-311 117-402 (415) 68 protein:vir:6242 Length: 390 # 98.2 4.6E-07 2.9E-10 55.3 15.7 280 1-311 106-387 (390) 69 protein:vir:93616 Length: 645 98.1 1.2E-06 7.6E-10 53.0 17.7 275 1-311 332-637 (645) 70 protein:vir:97053 Length: 390 98.1 1E-06 6.3E-10 53.4 17.0 271 1-311 108-390 (390) 71 protein:vir:4700 Length: 415 # 98.1 1.2E-06 7.6E-10 53.0 17.0 279 1-311 114-402 (415) 72 protein:vir:4600 Length: 415 # 98.1 1.2E-06 7.6E-10 53.0 17.0 279 1-311 114-402 (415) 73 protein:vir:96762 Length: 632 98.1 6.5E-07 4E-10 54.5 15.4 274 1-311 352-631 (632) 74 protein:vir:2344 Length: 397 # 98.0 3E-06 1.8E-09 50.9 18.0 271 4-311 1-304 (397) 75 protein:vir:80930 Length: 278 98.0 5.7E-06 3.5E-09 49.3 18.3 268 1-311 1-275 (278) 76 protein:vir:9410 Length: 415 # 98.0 2.7E-06 1.7E-09 51.1 16.1 279 1-311 114-402 (415) 77 protein:vir:8843 Length: 317 # 97.9 9.5E-07 5.9E-10 53.6 12.8 286 1-311 1-313 (317) 78 protein:vir:3158 Length: 321 # 97.9 8.7E-06 5.4E-09 48.3 17.9 282 1-311 1-309 (321) 79 protein:vir:4159 Length: 315 # 97.9 8.5E-06 5.2E-09 48.4 17.8 287 1-311 14-315 (315) 80 protein:vir:102873 Length: 392 97.9 1.1E-05 7E-09 47.7 18.4 269 1-311 101-382 (392) 81 protein:vir:105004 Length: 392 97.9 1.1E-05 7E-09 47.7 18.4 269 1-311 101-382 (392) 82 protein:vir:102082 Length: 392 97.9 1.1E-05 7E-09 47.7 18.4 269 1-311 101-382 (392) 83 protein:vir:107593 Length: 392 97.9 1.1E-05 7E-09 47.7 18.4 269 1-311 101-382 (392) 84 protein:vir:4197 Length: 314 # 97.9 1.3E-05 7.8E-09 47.5 18.7 283 1-311 9-310 (314) 85 protein:vir:4092 Length: 390 # 97.9 7.8E-06 4.9E-09 48.6 17.4 281 1-311 79-368 (390) 86 protein:vir:97255 Length: 310 97.9 7.5E-06 4.7E-09 48.7 17.1 276 1-311 1-308 (310) 87 protein:vir:3991 Length: 404 # 97.8 4.9E-06 3.1E-09 49.7 15.7 268 1-311 111-391 (404) 88 protein:vir:80128 Length: 466 97.8 9E-06 5.6E-09 48.2 16.8 283 1-311 144-446 (466) 89 protein:vir:93742 Length: 274 97.8 1.8E-05 1.1E-08 46.6 18.4 262 1-311 1-268 (274) 90 protein:vir:9820 Length: 272 # 97.8 2E-05 1.2E-08 46.4 20.1 260 1-311 1-267 (272) 91 protein:vir:3033 Length: 272 # 97.8 2E-05 1.2E-08 46.4 20.1 260 1-311 1-267 (272) 92 protein:vir:1268 Length: 397 # 97.8 5E-06 3.1E-09 49.7 14.7 266 1-311 118-395 (397) 93 protein:vir:4953 Length: 397 # 97.7 9.3E-06 5.8E-09 48.2 15.5 265 1-311 104-383 (397) 94 protein:vir:4511 Length: 409 # 97.7 9E-06 5.6E-09 48.2 14.8 279 1-311 112-404 (409) 95 protein:vir:4856 Length: 293 # 97.7 2.3E-05 1.4E-08 46.0 17.0 265 1-311 2-279 (293) 96 protein:vir:96833 Length: 275 97.6 2.7E-05 1.7E-08 45.6 17.2 262 1-311 1-275 (275) 97 protein:vir:9509 Length: 381 # 97.6 2.6E-05 1.6E-08 45.8 16.4 284 1-311 60-365 (381) 98 protein:vir:101291 Length: 381 97.6 2.6E-05 1.6E-08 45.8 16.4 284 1-311 60-365 (381) 99 protein:vir:1025 Length: 408 # 97.6 2.6E-05 1.6E-08 45.7 16.3 267 1-311 111-391 (408) 100 protein:vir:4997 Length: 397 # 97.6 1.7E-05 1.1E-08 46.7 15.3 267 1-311 104-383 (397) 101 protein:vir:94933 Length: 330 97.5 2.3E-05 1.4E-08 46.0 15.4 281 1-311 20-327 (330) 102 protein:vir:3845 Length: 395 # 97.5 1.8E-05 1.1E-08 46.6 14.7 269 1-311 102-381 (395) 103 protein:vir:9643 Length: 377 # 97.5 6.3E-05 3.9E-08 43.6 18.4 286 1-311 74-377 (377) 104 protein:vir:93881 Length: 387 97.5 1.1E-05 6.6E-09 47.8 12.7 265 1-311 113-379 (387) 105 protein:vir:96123 Length: 274 97.4 7.3E-05 4.5E-08 43.3 19.6 261 1-311 1-268 (274) 106 protein:vir:7409 Length: 408 # 97.4 3.6E-05 2.2E-08 45.0 15.0 267 1-311 111-391 (408) 107 protein:vir:100172 Length: 394 97.4 7.2E-05 4.4E-08 43.3 16.6 264 1-311 106-382 (394) 108 protein:vir:3613 Length: 272 # 97.4 7.7E-05 4.8E-08 43.1 16.8 266 1-311 1-272 (272) 109 protein:vir:97433 Length: 274 97.4 8E-05 5E-08 43.0 18.3 261 1-311 1-268 (274) 110 protein:vir:94494 Length: 274 97.4 8E-05 5E-08 43.0 18.3 261 1-311 1-268 (274) 111 protein:vir:4830 Length: 397 # 97.4 4.6E-05 2.9E-08 44.4 15.1 268 1-311 104-385 (397) 112 protein:vir:95376 Length: 425 97.3 3.3E-05 2.1E-08 45.1 14.2 274 1-311 133-418 (425) 113 protein:vir:78640 Length: 352 97.3 2.2E-05 1.4E-08 46.1 12.8 265 1-311 78-344 (352) 114 protein:vir:81160 Length: 371 97.2 0.00012 7.5E-08 42.0 15.6 267 1-311 86-368 (371) 115 protein:vir:9361 Length: 402 # 97.1 3.9E-05 2.4E-08 44.7 12.7 264 1-311 128-396 (402) 116 protein:vir:96978 Length: 387 97.1 3.7E-05 2.3E-08 44.9 11.8 265 1-311 113-379 (387) 117 protein:vir:2685 Length: 387 # 97.1 3.7E-05 2.3E-08 44.9 11.8 265 1-311 113-379 (387) 118 protein:vir:94424 Length: 387 97.1 3.7E-05 2.3E-08 44.9 11.8 265 1-311 113-379 (387) 119 protein:vir:78350 Length: 383 97.0 0.00012 7.2E-08 42.1 14.4 283 1-311 64-372 (383) 120 protein:vir:3870 Length: 400 # 96.7 0.00023 1.4E-07 40.5 13.4 263 1-311 123-397 (400) 121 protein:vir:1383 Length: 421 # 96.7 0.00042 2.6E-07 39.1 14.8 265 1-311 109-392 (421) 122 protein:vir:100632 Length: 381 96.5 0.00056 3.5E-07 38.4 16.7 284 1-311 65-367 (381) 123 protein:vir:96262 Length: 274 96.4 0.00064 4E-07 38.1 17.5 257 1-311 1-268 (274) 124 protein:vir:95898 Length: 274 96.4 0.00064 4E-07 38.1 17.5 257 1-311 1-268 (274) 125 protein:vir:95963 Length: 395 96.4 0.00067 4.1E-07 38.0 16.6 284 1-311 71-373 (395) 126 protein:vir:101607 Length: 379 96.3 0.0008 5E-07 37.6 17.5 258 1-311 101-379 (379) 127 protein:vir:105334 Length: 276 96.2 0.00088 5.5E-07 37.3 17.6 262 1-311 1-268 (276) 128 protein:vir:100884 Length: 389 96.1 0.00096 5.9E-07 37.1 16.1 265 1-311 104-382 (389) 129 protein:vir:1239 Length: 274 # 96.1 0.001 6.2E-07 37.0 17.8 262 1-311 1-268 (274) 130 protein:vir:739 Length: 231 # 96.1 0.001 6.2E-07 37.0 14.3 224 45-311 1-231 (231) 131 protein:vir:98635 Length: 377 96.0 0.0011 7E-07 36.7 18.2 280 1-310 74-377 (377) 132 protein:vir:9704 Length: 394 # 95.9 0.0013 8.3E-07 36.3 14.5 261 1-311 123-388 (394) 133 protein:vir:962 Length: 397 # 95.8 0.001 6.5E-07 36.9 12.5 262 1-311 127-395 (397) 134 protein:vir:102655 Length: 322 95.3 0.0023 1.4E-06 35.0 16.3 281 1-311 7-319 (322) 135 protein:vir:95107 Length: 270 94.4 0.0046 2.9E-06 33.4 16.8 262 1-311 1-263 (270) 136 protein:vir:99888 Length: 309 94.2 0.005 3.1E-06 33.2 13.6 272 1-311 1-301 (309) 137 protein:vir:1084 Length: 437 # 94.0 0.0058 3.6E-06 32.9 14.0 265 1-311 148-425 (437) 138 protein:vir:1541 Length: 347 # 92.6 0.011 6.6E-06 31.4 15.1 292 1-311 1-345 (347) 139 protein:vir:3364 Length: 347 # 92.5 0.011 6.8E-06 31.3 14.6 290 1-311 1-345 (347) 140 protein:vir:107882 Length: 307 92.1 0.013 8E-06 30.9 15.6 274 3-311 1-305 (307) 141 protein:vir:79928 Length: 393 89.5 0.026 1.6E-05 29.3 13.8 286 1-311 59-376 (393) 142 protein:vir:8885 Length: 347 # 86.9 0.042 2.6E-05 28.1 15.5 291 1-311 1-345 (347) 143 protein:vir:102823 Length: 470 85.5 0.053 3.3E-05 27.6 9.8 272 1-311 9-305 (470) 144 protein:vir:10450 Length: 344 83.1 0.071 4.4E-05 26.9 13.5 292 1-311 1-342 (344) 145 protein:vir:94622 Length: 341 82.4 0.077 4.8E-05 26.7 16.6 279 1-311 1-337 (341) 146 protein:vir:79078 Length: 307 82.1 0.08 4.9E-05 26.6 15.8 271 3-311 1-300 (307) 147 protein:vir:80213 Length: 334 81.6 0.084 5.2E-05 26.5 14.7 292 1-311 1-332 (334) 148 protein:vir:97031 Length: 402 79.7 0.1 6.3E-05 26.0 13.3 283 1-311 1-342 (402) 149 protein:vir:78739 Length: 332 79.0 0.11 6.7E-05 25.9 14.1 286 1-311 1-332 (332) 150 protein:vir:97331 Length: 319 76.3 0.14 8.5E-05 25.3 14.9 277 1-311 1-292 (319) 151 protein:vir:94800 Length: 319 76.3 0.14 8.5E-05 25.3 14.9 277 1-311 1-292 (319) 152 protein:vir:105822 Length: 273 75.1 0.15 9.4E-05 25.1 16.0 262 1-311 1-271 (273) 153 protein:vir:102605 Length: 273 75.1 0.15 9.4E-05 25.1 16.0 262 1-311 1-271 (273) 154 protein:vir:2201 Length: 345 # 74.7 0.16 9.6E-05 25.0 14.5 288 1-311 1-344 (345) 155 protein:vir:6324 Length: 335 # 72.2 0.19 0.00012 24.6 15.1 287 1-311 1-328 (335) 156 protein:vir:105645 Length: 400 66.3 0.27 0.00017 23.7 13.5 284 1-311 1-331 (400) 157 protein:vir:99311 Length: 463 66.1 0.27 0.00017 23.7 10.4 260 1-311 27-299 (463) 158 protein:vir:95603 Length: 463 66.1 0.27 0.00017 23.7 10.4 260 1-311 27-299 (463) 159 protein:vir:94576 Length: 347 65.5 0.28 0.00018 23.6 14.1 293 1-311 1-347 (347) 160 protein:vir:100851 Length: 514 65.2 0.19 0.00012 24.6 6.4 271 1-311 46-334 (514) 161 protein:vir:94711 Length: 347 63.8 0.31 0.00019 23.4 13.3 283 1-311 1-344 (347) 162 protein:vir:99675 Length: 324 63.6 0.31 0.00019 23.3 12.1 250 44-311 1-301 (324) 163 protein:vir:97397 Length: 517 63.1 0.32 0.0002 23.3 13.0 265 1-311 203-512 (517) 164 protein:vir:107120 Length: 329 62.1 0.34 0.00021 23.2 13.7 273 1-311 12-303 (329) 165 protein:vir:103323 Length: 364 59.0 0.4 0.00025 22.8 17.0 290 1-311 1-337 (364) 166 protein:vir:7990 Length: 273 # 58.9 0.4 0.00025 22.7 18.3 262 1-311 1-271 (273) 167 protein:vir:78935 Length: 335 53.4 0.53 0.00033 22.1 15.9 282 1-311 1-328 (335) 168 protein:vir:96666 Length: 462 50.9 0.6 0.00037 21.8 11.4 265 1-311 27-310 (462) 169 protein:vir:4902 Length: 348 # 40.3 0.98 0.00061 20.6 16.0 284 1-311 1-345 (348) 170 protein:vir:2736 Length: 348 # 39.8 1 0.00062 20.6 16.4 281 1-311 1-345 (348) 171 protein:vir:98480 Length: 348 30.8 1.5 0.00096 19.5 20.1 285 6-311 1-347 (348) 172 protein:vir:103886 Length: 302 30.0 1.6 0.001 19.4 15.8 267 6-311 1-286 (302) 173 protein:vir:100057 Length: 375 28.3 1.8 0.0011 19.2 18.5 289 1-311 9-368 (375) 174 protein:vir:80835 Length: 464 26.7 1.9 0.0012 19.0 8.3 276 1-311 23-340 (464) 175 protein:vir:107826 Length: 331 25.6 2 0.0013 18.9 14.9 228 1-311 1-240 (331) 176 protein:vir:107388 Length: 331 25.6 2 0.0013 18.9 14.9 228 1-311 1-240 (331) 177 protein:vir:98525 Length: 331 25.6 2 0.0013 18.9 14.9 228 1-311 1-240 (331) 178 protein:vir:7019 Length: 401 # 25.2 2.1 0.0013 18.8 11.9 287 1-311 1-335 (401) 179 protein:vir:95318 Length: 328 22.4 2.4 0.0015 18.4 15.8 225 1-311 1-239 (328) No 1 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=100.00 E-value=6e-94 Score=531.66 Aligned_cols=302 Identities=19% Similarity=0.242 Sum_probs=282.2 Q ss_pred CCccccc-ccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCccccccee Q lcl|NC_019522. 1 MAKSVFD-VSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPTV 79 (311) Q Consensus 1 ~~~~~~~-~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~v 79 (311) .+++.++ .+.++.++|+++||++||++|||+++++++++++||+.++++||+++++|+++|.+|++++|+++++|+|++ T Consensus 23 ~~~~~~~~~~~~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~i~~~~~~~~~~~t~~~~~~~G~a~~~~d~~~dip~v 102 (329) T protein:vir:79 23 HMQLRGAKNDASDMGIWTSQELHKIKAQAYEKEYPAGSALRVFPVTSELSDTDKTFEYQTFDKVGHAKIIADYTDDLSTV 102 (329) T ss_pred hcccccceeccchhhHHHHHHHHHHHHHHHhhhhcccchhhhcccccCCCCceeEEEeeeeecceeeeeecCccccccee Confidence 3343333 334566789999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc-eeeeecCCcceeeccC Q lcl|NC_019522. 80 DIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG-EGLYTSPNVSVEAATS 158 (311) Q Consensus 80 ~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g-~GllN~p~v~~~~~~~ 158 (311) |++++++.+|++.|+.+|+|+++||++|+++|+||+++|+.+|++++++++|+++|+|++++| |||||+||+++.. T Consensus 103 d~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~~~~g~~GLlN~p~v~~~~--- 179 (329) T protein:vir:79 103 DALMTSEFGKVFRLGNAFLISIDEIKAGQRTGKSLSTRKANAAQNAHDQLVNHLVFKGSKPHKIISVFEHPNLTTIN--- 179 (329) T ss_pred ecccceeEEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEEEeecccccceeeecCCCccccc--- Confidence 999999999999999999999999999999999999999999999999999999999999999 8999999997543 Q ss_pred CccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhCCceEE Q lcl|NC_019522. 159 TFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNFPDITF 238 (311) Q Consensus 159 ~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~~~l~i 238 (311) .++++++.|++||++||++||++++++++.++++ ++.|++|+|||++|.+|++++ +++++|+++||++|||+++| T Consensus 180 ~~~~~~~~w~~kt~~ei~~di~~~~~~l~~~s~g-~~~p~~L~Lpp~~~~~L~~~~----~~~~~tvl~~lk~~~~~l~I 254 (329) T protein:vir:79 180 SAGWNNAAGTGKKPETAQDELEQAIEKIETLTNG-QHRANMILIPPSMRKVLMVRM----PETTMSYLDYFKQQNGGITI 254 (329) T ss_pred cCCCCCccccccCHHHHHHHHHHHHHHHHHhcCc-eecccEEEecHHHHHHhhccc----CCCCccHHHHHHHhCCCcEE Confidence 3455667899999999999999999999998765 789999999999999998765 34579999999999999999 Q ss_pred EEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhccceeeCCceEEEeeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 239 EDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 239 ~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dGI 311 (311) +++|||+++|.+|+|+|++|+++++++++++|||++++ |+|+++++|++||++|+|||+||||++|+|+||| T Consensus 255 ~~~~el~~ag~~g~~~~v~y~~~~~~~~~~vp~~~~~l-~~q~~~~~~~v~~~~r~~Gv~i~~P~ai~~~dGI 326 (329) T protein:vir:79 255 ESISELEDIDGAGTKAALVYEKDPMNMSIEIPEAFNML-TAQPKDLHFKVPCTSKCTGLTIYRPLTLVLIKGL 326 (329) T ss_pred EEcccccccCCCCceEEEEEecCCceEEEecCcceeee-eceecCceEEEceeeeEEEEEEECcceeeeeeee Confidence 99999999999999999999999999999999999999 5799999999999999999999999999999999 No 2 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=100.00 E-value=6.4e-93 Score=526.01 Aligned_cols=296 Identities=19% Similarity=0.226 Sum_probs=275.8 Q ss_pred CCccccc-ccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCccccccee Q lcl|NC_019522. 1 MAKSVFD-VSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPTV 79 (311) Q Consensus 1 ~~~~~~~-~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~v 79 (311) =..+.|+ .+.+++++|+.+||++||++|||+++++++++++||+.++++||+++++|+++|.+|++++|+++++|+|++ T Consensus 14 ~~~~~~~~~~~d~~~~fl~~ql~~id~~v~e~~~~~~~~~~~i~v~~~~~~~~et~~~~~~e~~G~a~~~~d~~~dip~v 93 (314) T protein:vir:10 14 THLEQMGVEKADAAGIWAVSQLTAALNRAYEKEYAENSVVNIFPVTNEIPGHAKYFEYPEFDGVGIAQIIADYSDDLPLV 93 (314) T ss_pred HHHHhhcccchhhhHHHHHHHHHHHHHHHhhhhccccccceeeccccCCCCceeEEEeeeeccccceeeeCCccccccee Confidence 1124555 334567899999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc-eeeeecCCcceeeccC Q lcl|NC_019522. 80 DIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG-EGLYTSPNVSVEAATS 158 (311) Q Consensus 80 ~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g-~GllN~p~v~~~~~~~ 158 (311) |++++++++|++.|+.+|+|+++||++|++.|+||+++|+.+|++++++++|+++|+|++++| +||||+||++..++. T Consensus 94 d~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~~~~g~~GLlN~p~v~~~~~~- 172 (314) T protein:vir:10 94 DAFMTEKQGKVFRFGNAFLISTDEIKAGAATGQSLSARKQALAFEAHDNLLDKLVWSGSAPHGIVSVFDQPNINNVVAT- 172 (314) T ss_pred ecccceeEEEEEEEEeeEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecccccceeEeecCCCccccCC- Confidence 999999999999999999999999999999999999999999999999999999999999999 899999999754332 Q ss_pred CccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhCCceEE Q lcl|NC_019522. 159 TFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNFPDITF 238 (311) Q Consensus 159 ~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~~~l~i 238 (311) +.| +|++||++||+++++++++++++ ++.|++|+|||+.|.+|+++. +++++|+++||++|||+++| T Consensus 173 ------~~W--aT~~ei~~Di~~~~~~l~~~s~g-~~~p~~l~Lpp~~~~~L~~~~----~~~~~tvl~~l~~n~~~l~I 239 (314) T protein:vir:10 173 ------PNW--SVPQNAIDDVTAMIDAVESSTQG-LHHVTDILLPASARRVMQGLV----PQTNLSYGELFTRNNPGLTI 239 (314) T ss_pred ------CCc--ccHHHHHHHHHHHHHHHHHhcCc-cccceeEEecHHHHHhhcccc----cCCCccHHHHHHHhCCCcEE Confidence 348 58999999999999999998765 789999999999999997653 35689999999999999999 Q ss_pred EEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhccceeeCCceEEEeeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 239 EDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 239 ~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dGI 311 (311) +++|||+++|.+|++||++|+++++++++++|+|++++ |+|+++++|++||++|+|||+||||++|+|+||| T Consensus 240 ~~~~el~~ag~~g~~~~v~y~~~~~~~~~~vp~~~~~l-~~e~~~~~~~~~~~~r~~Gv~i~~P~ai~~~dGI 311 (314) T protein:vir:10 240 RFLQFLDNYDGAGGKAALAFEKSPLNMSIEIPEVTNVL-PAQPKDLHFRYPVTSKATGLIVYRPLTMAVIKGI 311 (314) T ss_pred EEcccccccCCCcceEEEEEecCCcEEEEecCccceee-cceecCceEEEcceeeeEEEEEECcceeEeeeee Confidence 99999999999999999999999999999999999998 5899999999999999999999999999999999 No 3 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=100.00 E-value=4.1e-92 Score=521.57 Aligned_cols=300 Identities=18% Similarity=0.259 Sum_probs=282.2 Q ss_pred CCcccccccchh-hhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCccccccee Q lcl|NC_019522. 1 MAKSVFDVSPVS-ALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPTV 79 (311) Q Consensus 1 ~~~~~~~~~~~~-~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~v 79 (311) +++++|+.++.. .+.|+.+||++||++|||+++++++++++||+.++++||+++++|.++|.+|++++|+|+++|+|++ T Consensus 18 ~~~~~~~~da~~~~g~~~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~~G~a~~~~d~~~dip~v 97 (319) T protein:vir:10 18 LIQAGVKQDAAATMGIWTAQELHRIKSQSYEEDYPVGSALRVFPVTTELSPTDKTFEYMTFDKVGTAQIIADYTDDLPLV 97 (319) T ss_pred HhhccchhhhhhhhhhHHHHHHHHHHHHHHhhhhcceechhhcccccCCCCceEEEEeeeeccccceeeecCccccccce Confidence 666777777644 4568899999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc-eeeeecCCcceeeccC Q lcl|NC_019522. 80 DIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG-EGLYTSPNVSVEAATS 158 (311) Q Consensus 80 ~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g-~GllN~p~v~~~~~~~ 158 (311) +++.+++.+|++.|+.+|+|+++||++|++.|+||+++|+.+|++++++++|+++|+|++++| +||||+||++..+++. T Consensus 98 ~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~f~G~~~~g~~GLlN~p~~~~~~~~~ 177 (319) T protein:vir:10 98 DALGTSEFGKVFRLGNAYLISIDEIKAGQATGRPLSTRKASACQLAHDQLVNRLVFKGSAPHKIVSVFNHPNITKITSGK 177 (319) T ss_pred eccceeeEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecccccceeEEeCCCceeeecCC Confidence 999999999999999999999999999999999999999999999999999999999999999 8999999998876654 Q ss_pred CccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhCCceEE Q lcl|NC_019522. 159 TFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNFPDITF 238 (311) Q Consensus 159 ~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~~~l~i 238 (311) + +.|.+||++||++||+++++++++++++ ++.|++|+|||++|.+|++++ +++++|+++||++|||+++| T Consensus 178 ~-----~~~~t~t~~~i~~di~~~~~~l~~~s~g-~~~p~~L~L~p~~~~~L~~~~----~~~~~t~l~~lk~~~~~l~I 247 (319) T protein:vir:10 178 W-----IDVSTMKPETAEAELTQAIETIETITRG-QHRATNILIPPSMRKVLAIRM----PETTMSYLDYFKSQNSGIEI 247 (319) T ss_pred C-----CCccccCHHHHHHHHHHHHHHHHHhcCc-eeeceEEEecHHHHHhhhccc----CCCCeeHHHHHHHhcCCceE Confidence 3 2388999999999999999999988765 789999999999999998764 35689999999999999999 Q ss_pred EEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhccceeeCCceEEEeeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 239 EDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 239 ~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dGI 311 (311) +++|||+++|++|+++|++|+++++++++++|+|++++ |+|+++++|++||++|+|||+||||+||+|+||| T Consensus 248 ~~~pel~~ag~~g~~~~v~y~~~~~~~~~~v~~~~~~~-~~e~~~l~~~~~~~~r~~Gv~i~~P~ai~~~dGI 319 (319) T protein:vir:10 248 DSIAELEDIDGAGTKGVLVYEKNPMNMSIEIPEAFNML-PAQPKDLHFKVPCTSKCTGLTIYRPMTIVLITGV 319 (319) T ss_pred EEeeeecccCCCcceEEEEEecCCceEEEecCcceeee-eeeecCceEEEeeeeeeEEEEEEccceeEeeecC Confidence 99999999999999999999999999999999999998 5799999999999999999999999999999999 No 4 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=100.00 E-value=1.6e-91 Score=518.38 Aligned_cols=300 Identities=22% Similarity=0.309 Sum_probs=288.2 Q ss_pred ccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccceeeeeccc Q lcl|NC_019522. 6 FDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPTVDIAMSQ 85 (311) Q Consensus 6 ~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~v~~~~~~ 85 (311) |. .++.++|+.+||++||++|+|++++++.+|+++|+.++++||+++++|+++|.+|++++|+++++|+|++++++++ T Consensus 1 ~~--~~~~g~f~~~~l~~id~~v~e~~~~~l~~r~l~~v~~~~~~~~~~~~~~~~~~~G~~~~~~~~~~dip~~~~~~~~ 78 (301) T protein:vir:80 1 MQ--GKITATIEARDLQAIDNVIYEPKQEELTARSVFPQKFDVNEGAESYSFDVMTRSGAAKIIANGADDLPLVDVDMVR 78 (301) T ss_pred CC--ccccchhhHHHHHHHHHHHHHhhhhhhhhhhhcccccCCCCceEEEEEeeeccceeEEEecCccccccccccccee Confidence 33 3456689999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc-eeeeecCCcceeeccCCccccC Q lcl|NC_019522. 86 GFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG-EGLYTSPNVSVEAATSTFVALV 164 (311) Q Consensus 86 ~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g-~GllN~p~v~~~~~~~~~~~~~ 164 (311) +.+|++.|+.+|+|+++||++|++.|+||+++|+.+|++++++++|+++|+|++++| +||||+||++...+++++++++ T Consensus 79 ~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~f~G~~~~g~~GLlN~p~~~~~~~~~~~~~~~ 158 (301) T protein:vir:80 79 KSVPIYSIGIGLSYTIQDLRAARMQGTTVDAAKATTVRRAIAEKENSIAFRGEKKYAIKGAFEATGIQIDVSPTTGVGNV 158 (301) T ss_pred EEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEeeecccccceeeecCCCcccccccCcccccc Confidence 999999999999999999999999999999999999999999999999999999999 8999999999999999999999 Q ss_pred cccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhCCceEEEEchhc Q lcl|NC_019522. 165 AAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNFPDITFEDDILL 244 (311) Q Consensus 165 t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~~~l~i~~~~~l 244 (311) +.|++||++||++||++++++++.++++ ++.|++|+|||++|.+|++++++ +++++|+++||++|+|+++|+++||| T Consensus 159 ~~w~~~t~~ei~~di~~~~~~l~~~s~g-~~~p~~L~L~p~~~~~L~~~~~~--~~~~~tvl~~l~~~~~~~~I~~~p~L 235 (301) T protein:vir:80 159 SKWEKKTAEQIIDEIGEAHTKITVLPGY-GTASLKLCLPPKQFELINKKRYS--NEDSRSVLKVLQDNAWFSAIVRVPDL 235 (301) T ss_pred cccccCCHHHHHHHHHHHHHHHHHhcCc-eecccEEEecHHHHHhhhhcccc--CCCCeeHHHHHHHHcCcceEEEccee Confidence 9999999999999999999999988755 78999999999999999998876 45689999999999999999999999 Q ss_pred ccCCCCcccEEEEEEcCcceeEEeecchhhhccceeeCCceEEEeeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 245 KGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 245 ~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dGI 311 (311) +++|.+|+++|++|+++++++++++|||++++ |+|+++++|+++|++|+|||+||||+||+|+||| T Consensus 236 ~~~g~~g~~~~v~~~~~~d~~~~~v~~~~~~~-~~e~~~~~~~~~~~~r~~Gv~i~~P~ai~~~~GI 301 (301) T protein:vir:80 236 AGMGTAGSDSFAVIHDSNETAELIIPMDITRH-PEEYSFPRTKVPFEERTAGVVVRFPAAIVRVDGI 301 (301) T ss_pred ccCCCCcccEEEEEecCCcEEEEEecCceeee-cceecCceeEeeeeeeeEEEEEEccceEEEEecC Confidence 99999999999999999999999999999998 5799999999999999999999999999999999 No 5 >protein:vir:5255 Length: 304 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852760;genbank:gi:31544035;uniprot:Q7Y5U0;genbank:GeneID:2753552 Probab=100.00 E-value=5e-91 Score=515.64 Aligned_cols=293 Identities=19% Similarity=0.264 Sum_probs=275.5 Q ss_pred hhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceE--EecCcccccceeeeeccceeE Q lcl|NC_019522. 11 VSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQ--LFGPNSTDVPTVDIAMSQGFK 88 (311) Q Consensus 11 ~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~--~~~~~a~dip~v~~~~~~~~~ 88 (311) .|+++||++||++||++|||+++++++++++||+.++++||+++++|+++|.+|+++ +++++++|||++|++++++.. T Consensus 1 ~~~lafl~~qL~~id~~vye~~~~~~~~~~lipv~t~~~~~~~~~~~~~~d~~G~a~~~~i~~~a~dip~vd~~~~~~~~ 80 (304) T protein:vir:52 1 MSLLAYVKNGLTAVSKDIAETKYPEIVFPQFVYVDQQTAVGITEKLHYGADEHGSLDDGLITVGTSTLDQVEVGFTPTRS 80 (304) T ss_pred CchHHHHHHHHHHHhhhhhccccccchhhhhccccCCCCcccceEEEeeeeccCcccccccCCcCCccceeecccceeEE Confidence 789999999999999999999999999999999999999999999999999999999 889999999999999999999 Q ss_pred EEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccc-cc-eeeeecCCcceeeccCCccccCcc Q lcl|NC_019522. 89 DINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKG-VG-EGLYTSPNVSVEAATSTFVALVAA 166 (311) Q Consensus 89 ~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~-~g-~GllN~p~v~~~~~~~~~~~~~t~ 166 (311) |++.|+.+|+|+++||++|+++|++|+++|+.+|++++++++|+++|+|++. .| +||||||||+..++++++++ +. T Consensus 81 ~i~~~~~~~~y~~~El~~a~~~g~~l~~~ka~aa~~a~~~~~n~v~~~Gd~~~~g~~GllN~p~v~~~~~~~~~a~--~~ 158 (304) T protein:vir:52 81 YIVPWAKSVTWTKPELEQGKLLGLALNTAKIMALNKNAQQTLQKVAFLGHAKDSRLTGLLNNKSVEVYAIKGAAQN--TK 158 (304) T ss_pred EEEEEeeeeeecHHHHHHHHHhCCCcHHHHHHHHHHHHHhhhceEEEEeeccccceEEEEeCCCcceeeecCCccC--Cc Confidence 9999999999999999999999999999999999999999999999999985 67 89999999999888776654 45 Q ss_pred cccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhCC-----ceEEEEc Q lcl|NC_019522. 167 IPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNFP-----DITFEDD 241 (311) Q Consensus 167 w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~~-----~l~i~~~ 241 (311) |++||++||++||+++++++|.++++ ++.|++|+|||++|.+|++++++ ++++|+|+||++||| +|+|+.+ T Consensus 159 w~~~T~~eI~~di~~~~~~i~~~s~~-~~~p~tl~Lpp~~~~~l~~~~~~---~~~~Tvl~~l~~n~~~~~g~~l~I~~v 234 (304) T protein:vir:52 159 VQAMDFDKAVAFFKEIFLKGMEKTKR-IEAPNTFAIDSLDLAHLALVQRA---NTDTTALEFLTKHLSAAAGRQVAIKAL 234 (304) T ss_pred cccCCHHHHHHHHHHHHHHHHhccCc-eecCceEEeCHHHHHHHhhccCC---CCCchHHHHHHHhcccccCCcceEEEe Confidence 99999999999999999999998765 78999999999999999988765 357899999999987 7889999 Q ss_pred hh-cccCCCCcccEEEEEEcCcceeEEeecchhhhccceeeCCc-eEEEeeeeeeeeEEEECCeEEEEeec Q lcl|NC_019522. 242 IL-LKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATADNV-NFKVPAILRTGGTEWRIPKAGHYVDG 310 (311) Q Consensus 242 ~~-l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~~~-~~~~~~~~~~gGv~i~~P~ai~~~dG 310 (311) ++ +.++|.+|+|||++|++++++++|++|||+++++ .|++++ .|++||++|+|||+||||.+++|+|- T Consensus 235 ~~~~~~~g~~g~~r~vvY~~d~~~~~~~vP~p~~~l~-~q~~~~~~~~vp~~~r~gGv~v~~P~a~~y~D~ 304 (304) T protein:vir:52 235 PSNYGTRVTDGKTRAMVYVNSKEHVIFDVPMSPTVLD-AQPKGLLAFESGLRMAFGGVTFMEPDSALYVDY 304 (304) T ss_pred cccccccCCCCceEEEEEecChhheEEecCccccccc-hhhcCCceEEecceeeeeeEEEEccceeeeecC Confidence 84 6778999999999999999999999999999996 466665 79999999999999999999999999 No 6 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=100.00 E-value=1.9e-90 Score=512.50 Aligned_cols=292 Identities=18% Similarity=0.258 Sum_probs=276.7 Q ss_pred ccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccceeeeec Q lcl|NC_019522. 4 SVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPTVDIAM 83 (311) Q Consensus 4 ~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~v~~~~ 83 (311) ++|| +++++++|+++||++||++|||+++++++++++||+.++++||+++++|+++|.+|++++|+++++|+|+++++. T Consensus 1 ~~~~-~a~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~v~~~~ 79 (296) T protein:vir:10 1 MGVD-KADAAGIWTVKQLTASLNKAYETEYDQNSVVNLFPVSNEIPGYAKYFEYPVFDGVGIAQIVADYTDDLPLVDALA 79 (296) T ss_pred Cccc-chhhhHHHHHHHHHHHHHHHHhhhhcccccceecccccCCCCceeEEEeeeeeccCceeEeCCCccccceeeccc Confidence 7887 557889999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc-eeeeecCCcceeeccCCccc Q lcl|NC_019522. 84 SQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG-EGLYTSPNVSVEAATSTFVA 162 (311) Q Consensus 84 ~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g-~GllN~p~v~~~~~~~~~~~ 162 (311) +++.+|++.|+.+|+|+++||++|++.|+||+++|+.+|++++++++|+++|+|++++| +||||+||++..++.+ T Consensus 80 ~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~ka~aA~~~~~~~~n~~~f~G~~~~g~~GLlN~p~v~~~~~~~---- 155 (296) T protein:vir:10 80 TERQGKVFRFGNAFLISIDEIKVGQATGQSLSTRKQSLAFEAHDKLLDKLVWSGSTAHGIPSVFDYPNINNVVSGG---- 155 (296) T ss_pred eeEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecccccceeEeecCCCccccccC---- Confidence 99999999999999999999999999999999999999999999999999999999999 7999999998765443 Q ss_pred cCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhCCceEEEEch Q lcl|NC_019522. 163 LVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNFPDITFEDDI 242 (311) Q Consensus 163 ~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~~~l~i~~~~ 242 (311) .|+++| +|++||++++++++.++++ ++.|++|+|||++|.+|++++ +++++|+++||++|||+++|+++| T Consensus 156 ---~W~~~t--~i~~Di~~~~~~l~~~s~g-~~~p~~l~L~p~~~~~L~~~~----~~~~~t~l~~ik~~~~~l~i~~~~ 225 (296) T protein:vir:10 156 ---SWSQPT--TAVSDITSLLDIIETSTNG-QHRATHLLLPTTARRIMQNLV----PGTSVSYGEFFRQNNSGVTVEFVQ 225 (296) T ss_pred ---CccCHH--HHHHHHHHHHHHHHHhhCc-eecceeEEeCHHHHHHHhhcc----CCCCccHHHHHHHhcCCceEEEee Confidence 386554 9999999999999988654 789999999999999998765 356899999999999999999999 Q ss_pred hcccCCCCcccEEEEEEcCcceeEEeecchhhhccceeeCCceEEEeeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 243 LLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 243 ~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dGI 311 (311) ||+++|++|+++|++|+++++++++++|+|++++ |+|+++++|+++|++|+|||+||||+||+|+||| T Consensus 226 ~l~~a~~~g~~~~v~~~~~~~~~~~~v~~~~~~~-~~e~~~l~~~~~~~~~~~Gv~i~~P~ai~~~dGI 293 (296) T protein:vir:10 226 YLNDYNGTGTSAAIAYEKDPNNMAIEIPEATNAL-PAQPKDLHFKIPVTSKATGLIVYRPLTMAVMKGI 293 (296) T ss_pred eeccCCCCcceEEEEEEcCCceEEEEcCcceeee-cccccCceEEEeeEeeEEEEEEECCceeEEEeee Confidence 9999999999999999999999999999999998 5899999999999999999999999999999999 No 7 >protein:vir:94070 Length: 339 # NCBI annotation: putative structural protein # Family: family:all:1653 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453625;genbank:gi:84662661;genbank:GeneID:5142580 Probab=100.00 E-value=2.3e-87 Score=495.59 Aligned_cols=296 Identities=13% Similarity=0.069 Sum_probs=271.4 Q ss_pred CCccccccc---------chhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecC Q lcl|NC_019522. 1 MAKSVFDVS---------PVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGP 71 (311) Q Consensus 1 ~~~~~~~~~---------~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~ 71 (311) ..+++||++ +.+++ ..++|++||++|||+++++++++++||+.++++|++++++|+++|.+|+|++||| T Consensus 32 ~~~~a~d~~~~~~~~~~~~~~~i--~a~~~~~i~~~vy~~~~~~~~~~~l~pv~t~g~w~~~t~~y~~~e~~G~a~~ygd 109 (339) T protein:vir:94 32 VSAYAMDAVNLTPTLQTTANAGI--PAWMTTFVDRRVIDIQLAPMAAAKIFPEVKKGDWTTTYGVFIIAEPVGQVATYSD 109 (339) T ss_pred hHhhhccccccccccccccccch--hhhhhhhhchhheeecccccchhhhcccccCCCCcccEEEEeeeecccceEEccc Confidence 334455543 23333 4679999999999999999999999999999999999999999999999999999 Q ss_pred cccccceeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc-eeeeecCC Q lcl|NC_019522. 72 NSTDVPTVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG-EGLYTSPN 150 (311) Q Consensus 72 ~a~dip~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g-~GllN~p~ 150 (311) ++++ |+++++++++++++++++.||+|+++|+++|+++|++|+++|+.+|++++++++|+++|+|+++++ |||||||| T Consensus 110 ~ad~-Pl~~~~v~~~~~~v~~~~~g~~y~~~E~~~A~~~g~~l~~~Ka~aA~~al~~~~N~i~~~Gd~~~~~~GLlN~P~ 188 (339) T protein:vir:94 110 WSAN-GMSKANVNFESRQNYRYQTWTEYGDLEMATYGEAGIDYVARQEISASLVMAKFANSSYLLGVAGIANYGLMNDPS 188 (339) T ss_pred ccCC-CcccccceeeEEeEEEEEEEEeecHHHHHHHHhhCCChHHHHHHHHHHHHHHhhceEEeeeecccceEEEEeCCC Confidence 9866 999999999999999999999999999999999999999999999999999999999999999998 89999999 Q ss_pred cceeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCce--ecceEEEeCHHHHHHHhcccccCCCCCcchHHHH Q lcl|NC_019522. 151 VSVEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTV--HRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQF 228 (311) Q Consensus 151 v~~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~--~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~ 228 (311) +++.+.+ ++.|++||++||++||++++++++.++++.+ +.|++|+|||+.+.+|+++ +.+++|+++| T Consensus 189 l~~~v~~------s~~Wa~kT~~eI~~Di~~~~~~l~~~s~g~~~~~~~~~L~LP~~~~~~L~~~-----n~~~~Tvl~~ 257 (339) T protein:vir:94 189 LPAPVAA------TVNWATAAPEDIANDVVAMVGRLISQSGGLITGQERMVMALAPSALNNVNRT-----NNFGLSAGAK 257 (339) T ss_pred ccccccC------CCCcccCCHHHHHHHHHHHHHHHHHhcCCeeeeccCcEEEecHHHHHhcccC-----CcCCccHHHH Confidence 9765433 3469999999999999999999999988754 5789999999999999875 4578999999 Q ss_pred HHHhCCceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhccceeeCCceEEEeeeeeeeeEEEECCeEEEEe Q lcl|NC_019522. 229 LRTNFPDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHYV 308 (311) Q Consensus 229 l~~n~~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~ 308 (311) ||+|||||+|+++|||+++|+++..+|+.+.++++++++++|||++++ |+|+++++|++||++|||||+||||+||+|+ T Consensus 258 lk~n~pnl~i~~~~el~~a~g~~~~~~~~~~~~~~~~~~~~p~~~~~l-pvq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~ 336 (339) T protein:vir:94 258 IAQTYPNIQFVAVPEFDTASGRLVQLWVPEVNGQPTGEVAFAEKLRSH-SIERYSTTTRQKHSGATFGAVIYQPWAVTQE 336 (339) T ss_pred HHHhcCCcEEEEccccccCCCceEEEEEEeccCCcceEEEcchhhhcc-ccEEcCceEEecceeeeeeEEEEccceeeee Confidence 999999999999999999988888888888999999999999999999 5899999999999999999999999999999 Q ss_pred ecC Q lcl|NC_019522. 309 DGV 311 (311) Q Consensus 309 dGI 311 (311) +|| T Consensus 337 ~GI 339 (339) T protein:vir:94 337 LGV 339 (339) T ss_pred ecC Confidence 999 No 8 >protein:vir:78558 Length: 336 # NCBI annotation: major capsid protein # Family: family:all:1653 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294848;genbank:gi:149882911;genbank:GeneID:5291029 Probab=100.00 E-value=1.6e-84 Score=479.99 Aligned_cols=296 Identities=16% Similarity=0.133 Sum_probs=262.9 Q ss_pred CCccccc---------ccchhhh-hhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEec Q lcl|NC_019522. 1 MAKSVFD---------VSPVSAL-SFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFG 70 (311) Q Consensus 1 ~~~~~~~---------~~~~~~~-~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~ 70 (311) ..+++|| +.+++++ .||+ ++|||++||++++++++.+|+|+.+.++|.+++++|.++|.+|++++|| T Consensus 28 ~~~~a~da~d~~~~~~t~~~~g~~~~l~---~~i~p~~~~~~~~~~~~~~l~~v~t~g~W~~~~~~~~~~e~~G~a~~yg 104 (336) T protein:vir:78 28 LAEYAMDAADLSPHLSSTGSSGIPNYLT---TYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVATYG 104 (336) T ss_pred HHHHHHhhhhhccccccCCCcchHHHHH---HhcccceeeehhhhhhhhhhcccccCCCccccEEEEeeeecceeeEEee Confidence 2223333 2233443 5777 7999999999999999999999999988888999999999999999999 Q ss_pred CcccccceeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc-eeeeecC Q lcl|NC_019522. 71 PNSTDVPTVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG-EGLYTSP 149 (311) Q Consensus 71 ~~a~dip~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g-~GllN~p 149 (311) |++|+ |++|++++++++++++++.+|+|+++|+++|+++|++|+++|+.+||+++++++|+++|+|+++++ ||||||| T Consensus 105 d~~D~-P~vd~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~~ale~~~N~~~~~Gd~~~~~~GllN~P 183 (336) T protein:vir:78 105 DYSSD-GDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDP 183 (336) T ss_pred cccCC-CeeecceeeEEEEEEEEEeeeeecHHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCeEEEEeccccceEEEEeCC Confidence 98665 999999999999999999999999999999999999999999999999999999999999999998 8999999 Q ss_pred CcceeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCce--ecceEEEeCHHHHHHHhcccccCCCCCcchHHH Q lcl|NC_019522. 150 NVSVEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTV--HRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQ 227 (311) Q Consensus 150 ~v~~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~--~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~ 227 (311) ++++..+++ +++|.+||++||++||+.++++++.++++.+ +.|++|+|||+++.+|+++ +.+++|+++ T Consensus 184 ~l~a~~t~~-----~~~w~~~T~~~I~~Di~~~~~~l~~qt~g~~~~~~~~tL~Lp~~~~~~L~~~-----n~~g~tv~~ 253 (336) T protein:vir:78 184 SLSAPITAT-----TPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSKT-----NQYGLSAAA 253 (336) T ss_pred CCCcccccC-----cCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccceEEEechHHHHhccCC-----CccCccHHH Confidence 998765443 3459999999999999999999999998754 6799999999999999864 567899999 Q ss_pred HHHHhCCceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhccceeeCCceEEEeeeeeeeeEEEECCeEEEE Q lcl|NC_019522. 228 FLRTNFPDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHY 307 (311) Q Consensus 228 ~l~~n~~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~ 307 (311) |||+||||++|+++|||+++|++....++-...+++++++++|++|++| |+|+++++|++||++|||||+||||++|+| T Consensus 254 ~lk~n~Pnl~i~t~pel~~Agg~~~~~~~~~~~~~~t~~~~~p~~f~~l-pvq~~~~~~~v~~~~rt~Gv~i~~P~ai~~ 332 (336) T protein:vir:78 254 KLKEIFPKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMRAH-SIERYSSYFRQKKSAGTWGAVIFRPFAVAQ 332 (336) T ss_pred HHHHhcCccEEEEcccccccCcceEEEEEeeccCCcceeeecchhhhcc-ceeecCceeEeccccceeeeeeeccchhee Confidence 9999999999999999998765443333344555788999999999999 689999999999999999999999999999 Q ss_pred eecC Q lcl|NC_019522. 308 VDGV 311 (311) Q Consensus 308 ~dGI 311 (311) ++|| T Consensus 333 ~~GI 336 (336) T protein:vir:78 333 MIGV 336 (336) T ss_pred eccC Confidence 9999 No 9 >protein:vir:99576 Length: 388 # NCBI annotation: hypothetical protein # Family: family:all:1653 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039801;genbank:gi:126011051;genbank:GeneID:4818271 Probab=100.00 E-value=3e-84 Score=478.49 Aligned_cols=303 Identities=16% Similarity=0.140 Sum_probs=274.0 Q ss_pred CCcccccccc-----hhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccc Q lcl|NC_019522. 1 MAKSVFDVSP-----VSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTD 75 (311) Q Consensus 1 ~~~~~~~~~~-----~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~d 75 (311) ++..+||++. ..++++...+|++|||+||++.++++++.+|||+.++++|.+++++|.++|.+|++++|||++| T Consensus 62 ~~~~a~da~~~~~~t~~~~gip~~~~~~~~p~~~~~~~~p~~~~~l~pv~t~g~W~~~~~~f~v~e~~G~A~~ygd~~D- 140 (388) T protein:vir:99 62 VATQAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTN- 140 (388) T ss_pred hhhcccCcccccccccCcccHHHHHhhhhccceeeeeechhhhhhhccccccCCccceeEEEeeeecceeEEEeecccC- Confidence 7788999762 3456788999999999999999999999999999999998899999999999999999999865 Q ss_pred cceeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeecccc---c-eeeeecCCc Q lcl|NC_019522. 76 VPTVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGV---G-EGLYTSPNV 151 (311) Q Consensus 76 ip~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~---g-~GllN~p~v 151 (311) +|++|++++++++++++++.+|+|+++|+++|+++|++|+++|+.+|++++++++|+++|||+++. + ||||||||+ T Consensus 141 ~Pl~d~~~~~~~r~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~AA~~ale~~~N~i~f~G~~g~~~~~~yGllNdP~l 220 (388) T protein:vir:99 141 IPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSL 220 (388) T ss_pred CCceeccceeeeeeEEEEEeeeeecHHHHHHHHhhCCCcHHHHHHHHHHHHHhhhceEEEEeecCCCccceEEEeeCCCc Confidence 599999999999999999999999999999999999999999999999999999999999998753 4 899999999 Q ss_pred ceeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCce---ecceEEEeCHHHHHHHhcccccCCCCCcchHHHH Q lcl|NC_019522. 152 SVEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTV---HRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQF 228 (311) Q Consensus 152 ~~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~---~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~ 228 (311) ++...+ ++.++++.|++||++||++||+.++++++.++++.+ +.|.+|+|||+++.+|+++ +.+++|+++| T Consensus 221 ~a~v~a-t~~~~~~~Wa~kT~~eI~~Di~~~~~~i~~qs~g~~~~~~~~~tL~LP~~~~~~Ls~~-----n~~g~Tvl~~ 294 (388) T protein:vir:99 221 LPAIAS-TTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-----TDLGISVRDW 294 (388) T ss_pred cccccc-ccCCcCcccccCCHHHHHHHHHHHHHHHHHhcCCeeeecccceEEEechHHHHhcccc-----CcCCccHHHH Confidence 987664 567777889999999999999999999999987643 2456999999999999864 4568999999 Q ss_pred HHHhCCceEEEEchhcccCC-CCcccEEEEEEcCcc-----------eeEEeecchhhhccceeeCCceEEEeeeeeeee Q lcl|NC_019522. 229 LRTNFPDITFEDDILLKGAG-VAGADRMAVYKKEIR-----------IVKGHDVMPLRFLAPATADNVNFKVPAILRTGG 296 (311) Q Consensus 229 l~~n~~~l~i~~~~~l~~ag-~~g~~~~v~y~~~~~-----------~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~gG 296 (311) ||+||||++|+++|||++++ .+|.+.+++|.++.+ .....+|++|+++ |+|+++++|++||++|||| T Consensus 295 lk~n~Pnl~i~t~pEl~~a~~tgg~~~~~~~~~~~~~~~~~~~~~~~t~~~~~p~~~~~l-~vq~~~~~~~~~~~~rt~G 373 (388) T protein:vir:99 295 LKQTYPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTL-GVEKRVKNYVEAYSNATAG 373 (388) T ss_pred HHHhcCCcEEEEecccccccccCCceeEEEEecccccccccCccCcceeEEecccccccc-cceecCceeEeccccceee Confidence 99999999999999999985 577788888877654 3556789999888 6899999999999999999 Q ss_pred EEEECCeEEEEeecC Q lcl|NC_019522. 297 TEWRIPKAGHYVDGV 311 (311) Q Consensus 297 v~i~~P~ai~~~dGI 311 (311) |+||||+||+|++|| T Consensus 374 v~ir~P~Ai~~~~GI 388 (388) T protein:vir:99 374 VMLKRPWAVVRLIGL 388 (388) T ss_pred eEEeccchhheeccC Confidence 999999999999999 No 10 >protein:vir:106734 Length: 336 # NCBI annotation: gp13 # Family: family:all:1653 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944321;genbank:gi:38638620;genbank:GeneID:2657363 Probab=100.00 E-value=3.7e-84 Score=477.97 Aligned_cols=293 Identities=15% Similarity=0.141 Sum_probs=263.2 Q ss_pred CCccccc---------ccchhhh-hhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEec Q lcl|NC_019522. 1 MAKSVFD---------VSPVSAL-SFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFG 70 (311) Q Consensus 1 ~~~~~~~---------~~~~~~~-~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~ 70 (311) ..+++|| +.+++++ .||+ ++|||++||.+++++++.+|+|+.++++|++++++|.++|.+|++++|| T Consensus 28 ~~~~a~da~d~~~~~~t~~~~g~~~~l~---~~i~p~~~~~~~~~~~~~~l~~v~t~g~w~~~~~~~~~~e~~G~a~~yg 104 (336) T protein:vir:10 28 LAEYAMDAADLSPHLSSTGSSGIPNYLT---TYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYG 104 (336) T ss_pred HHHHHHhhhhhccccccCCCcchHHHHH---hhcCcceeeeeechhchhhhcccccCCCcceeeEEEEeeeeeeeEEEcc Confidence 2223333 2233343 5777 7999999999999999999999999999999999999999999999999 Q ss_pred CcccccceeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc-eeeeecC Q lcl|NC_019522. 71 PNSTDVPTVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG-EGLYTSP 149 (311) Q Consensus 71 ~~a~dip~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g-~GllN~p 149 (311) |+ +|+|++|++.+++++++++++.+|+|+.+|+++|+++|++|+++|+.+||+++++++|+++|+|+++++ ||||||| T Consensus 105 d~-~d~P~~d~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~~ale~~~N~~~~~Gd~~~~~~GllN~P 183 (336) T protein:vir:10 105 DY-SSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDP 183 (336) T ss_pred cc-CCCcceeeeeeeeeeeEEEEEEEEeeCHHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCeEEEEeecccceEEEeecC Confidence 97 788999999999999999999999999999999999999999999999999999999999999999998 8999999 Q ss_pred CcceeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCce--ecceEEEeCHHHHHHHhcccccCCCCCcchHHH Q lcl|NC_019522. 150 NVSVEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTV--HRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQ 227 (311) Q Consensus 150 ~v~~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~--~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~ 227 (311) ++++..+++ +++|.+||++||++||++++++++.++++.+ +.|++|+|||+++.+|+++ +.+++|+++ T Consensus 184 ~l~a~~t~~-----~~~w~~~T~~eI~~Di~~~~~~l~~qt~g~i~~~~~~tL~Lp~~~~~~L~~~-----n~~g~tv~~ 253 (336) T protein:vir:10 184 SLSAPITAT-----TPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSKT-----NQYGLSAAA 253 (336) T ss_pred CCCcccccC-----cCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccceEEEechHHHHhccCC-----CccCccHHH Confidence 998766443 3459999999999999999999999987754 6799999999999999864 567899999 Q ss_pred HHHHhCCceEEEEchhcccCCCCcccEEEEEE---cCcceeEEeecchhhhccceeeCCceEEEeeeeeeeeEEEECCeE Q lcl|NC_019522. 228 FLRTNFPDITFEDDILLKGAGVAGADRMAVYK---KEIRIVKGHDVMPLRFLAPATADNVNFKVPAILRTGGTEWRIPKA 304 (311) Q Consensus 228 ~l~~n~~~l~i~~~~~l~~ag~~g~~~~v~y~---~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~a 304 (311) |||+||||++|+++|||.++|+ +++.+|+ .+++++++++|++|++| |+|+++++|++||++|||||+||||++ T Consensus 254 ~lk~n~Pnl~i~t~pel~~Agg---~~~~~~~~~~~~~~t~~~~~P~~f~~l-pvq~~~~~~~v~~~~rt~Gv~i~rP~a 329 (336) T protein:vir:10 254 KLKEIFPKLEFVTIPEYDTASG---RLVQLWAPRVEGKDTATCGFTEKMRAH-SIERYSSYFRQKKSAGTWGAVIFRPFA 329 (336) T ss_pred HHHHhCCccEEEEcccccccCC---ceEEEEEecccCCcceeeecChhhhcc-ceeecCceeEeccccceeeeeeeccch Confidence 9999999999999999998764 4555554 44678999999999999 689999999999999999999999999 Q ss_pred EEEeecC Q lcl|NC_019522. 305 GHYVDGV 311 (311) Q Consensus 305 i~~~dGI 311 (311) |+|++|| T Consensus 330 i~~~~GI 336 (336) T protein:vir:10 330 VAQMLGV 336 (336) T ss_pred heeeccC Confidence 9999999 No 11 >protein:vir:107732 Length: 379 # NCBI annotation: gp23 # Family: family:all:1653 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024871;genbank:gi:48697513;genbank:GeneID:2948349 Probab=100.00 E-value=2.5e-83 Score=473.44 Aligned_cols=299 Identities=15% Similarity=0.142 Sum_probs=266.4 Q ss_pred CCccccccc---------------chhhh-hhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeeccc Q lcl|NC_019522. 1 MAKSVFDVS---------------PVSAL-SFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARG 64 (311) Q Consensus 1 ~~~~~~~~~---------------~~~~~-~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G 64 (311) -...+||++ +++++ .||+ +++ |++++..++++++.+|||+.++++|++++++|.++|.+| T Consensus 51 ~~~~amd~~~~~~~~~~~~~l~~~~~~g~~~~l~---~~~-p~~i~~~tap~~a~~l~pv~t~g~W~~~~~~~~v~e~~G 126 (379) T protein:vir:10 51 LMQFAMDSNDIGPIPTPLSPLSPVSIPGLIQFLQ---NWL-PGHVRILTAVREADEFLGLSTVGQWDDEQIVQRVLEGLG 126 (379) T ss_pred hhhhhhccccccccccccCccccccccchHHHHH---hhc-chHHHHHhhhhhhhhhcccccCCCceeeeEEEeeeeeee Confidence 111245544 34443 4555 344 999999999999999999999999999999999999999 Q ss_pred ceEEecCcccccceeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeec--cccc Q lcl|NC_019522. 65 ELQLFGPNSTDVPTVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGD--KGVG 142 (311) Q Consensus 65 ~a~~~~~~a~dip~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~--~~~g 142 (311) +|++|+|++++ |+++++++++++++++++.+|+|+++|+++|+++|++|+++|+.+||+++++++|+++|||+ ++++ T Consensus 127 ~A~~ygd~~d~-pl~d~~~~~~~r~v~~~~~g~~yg~~El~~Aa~~g~~l~~~Ka~aA~~ale~~~N~i~f~G~~d~~~~ 205 (379) T protein:vir:10 127 TAQPYTDGGNM-ALMSWTPTFETRTVVRFEAGLQVAPLEEARSSRVQVSSADEKRAMVGEALEVQRNRVAFYGYNDGSGR 205 (379) T ss_pred eeEEeccccCC-CeeeeeeeeeeeeeEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCCCcc Confidence 99999998655 99999999999999999999999999999999999999999999999999999999999995 5666 Q ss_pred -eeeeecCCcceeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCce---ecceEEEeCHHHHHHHhcccccCC Q lcl|NC_019522. 143 -EGLYTSPNVSVEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTV---HRPNTFVLPPAQFQLLARTLLSTQ 218 (311) Q Consensus 143 -~GllN~p~v~~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~---~~p~~l~lpp~~~~~L~~~~~~~~ 218 (311) ||||||||+++..++++++++++.|++||++||++||+.++++++.++++.+ +.|++|+|||+++.+|+++ T Consensus 206 ~yGllNdP~l~a~~t~atg~~~~t~Wa~kT~~eI~~Di~~~~~~l~~qs~g~~~~~~~~~tL~LP~~~~~~L~~~----- 280 (379) T protein:vir:10 206 TFGFLNDPNLPAYVAVPNGAGGSPLWAQKTTLEIIADLRNGLTALQVQSMGRIKSNKTPITIGIPNAYENYITTP----- 280 (379) T ss_pred eEEEEeCCCCcccccccCCcccccccccCCHHHHHHHHHHHHHHHHHhhCCeecccccceeEEecHHHHHhhccc----- Confidence 8999999999988888999999999999999999999999999999987643 5677999999999999864 Q ss_pred CCCcchHHHHHHHhCCceEEEEchhcccCCCCcccEEEEEEcCcc--------eeEEeecchhhhccceeeCCceEEEee Q lcl|NC_019522. 219 NASNVTLLQFLRTNFPDITFEDDILLKGAGVAGADRMAVYKKEIR--------IVKGHDVMPLRFLAPATADNVNFKVPA 290 (311) Q Consensus 219 ~~~~~Tvl~~l~~n~~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~--------~~~~~~~~~~~~~~p~~~~~~~~~~~~ 290 (311) +.+++|+++||++||||++|+++|||+++|++|+. +++|.++++ .+.+++|++++++ |+|+++++|++|| T Consensus 281 n~~g~Tvl~~lk~n~Pnl~i~t~pEL~~aggg~~~-~~~~~~~~~~~~t~~~~~~~~~~p~k~~~l-~ve~~~~~~~~~~ 358 (379) T protein:vir:10 281 TELGYSVAQYMRESYPNVTFVSAPELNDANGGSSA-IYYYADAVENNGTDDGRTWLQVVPTKMFTL-GVEKKIKGYAEGY 358 (379) T ss_pred cccCccHHHHHHHhcCCcEEEEcccccccCCCccE-EEEEeeccCCCccCCcceEEEecchhhhhc-cceecCceeEecc Confidence 56789999999999999999999999999866555 555555444 5778899999998 5799999999999 Q ss_pred eeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 291 ILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 291 ~~~~gGv~i~~P~ai~~~dGI 311 (311) ++|||||+||||+||+|++|- T Consensus 359 ~~rt~Gv~ir~P~Ai~~~~G~ 379 (379) T protein:vir:10 359 TNATAGAMLKRPFATYRQTGA 379 (379) T ss_pred ccceeeeeeecchhhheecCC Confidence 999999999999999999999 No 12 >protein:vir:101557 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958117;genbank:gi:41057663;genbank:GeneID:2716814 Probab=100.00 E-value=1.6e-83 Score=474.54 Aligned_cols=296 Identities=16% Similarity=0.115 Sum_probs=265.0 Q ss_pred CCccccccc------chhhh-hhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcc Q lcl|NC_019522. 1 MAKSVFDVS------PVSAL-SFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNS 73 (311) Q Consensus 1 ~~~~~~~~~------~~~~~-~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a 73 (311) |++-++|+. +++++ .||+ ++|||++|+++++++++.+|+|+.+.++|.+++++|.++|.+|++++|||++ T Consensus 31 ~~~da~d~~~~~~~~~~~~i~~~l~---~~i~p~~~~~~~~p~~a~~l~pv~t~g~W~~~~~~~~~~e~~G~a~~ygd~~ 107 (336) T protein:vir:10 31 YAMDAADLSPHLSSTGSSGIPNYLT---TYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYS 107 (336) T ss_pred hhhhhhhccCccccCCCchhHHHHH---hhcccceeeehhhhhhhhhhccccccCCccceeEEEeeeeceeeEEEeeccC Confidence 333333332 34554 5676 8999999999999999999999999999999999999999999999999986 Q ss_pred cccceeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc-eeeeecCCcc Q lcl|NC_019522. 74 TDVPTVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG-EGLYTSPNVS 152 (311) Q Consensus 74 ~dip~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g-~GllN~p~v~ 152 (311) | +|++|++.+++++++++++.+|+|+++|+++|+++|++|+++|+.+||+++++++|+++|+|+++++ |||||||+++ T Consensus 108 D-~P~~d~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~~ale~~~N~i~~~Gd~~~~~yGllN~P~l~ 186 (336) T protein:vir:10 108 S-DGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLS 186 (336) T ss_pred C-CceeecccceeeeeEEEEEeeeeeCHHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCcEEEEeccccceEEEEeCCCCc Confidence 5 5999999999999999999999999999999999999999999999999999999999999999998 8999999998 Q ss_pred eeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCc--eecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHH Q lcl|NC_019522. 153 VEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLT--VHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLR 230 (311) Q Consensus 153 ~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~--~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~ 230 (311) +..+++ +.+|.++|++||++||++++++|+.++++. .+.|++|+|||+++.+|+++ +.+++|+++||| T Consensus 187 a~~t~~-----t~~~~~~t~eei~~Di~~~~~~l~~qs~G~i~~~~~~tL~LP~~~~~~Ls~~-----n~~g~Tvl~~lk 256 (336) T protein:vir:10 187 APITAT-----TPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSKT-----NQYGLAAAAKLK 256 (336) T ss_pred cccccC-----CCcccccCHHHHHHHHHHHHHHHHHhcCCeecccCcceEEecHHHHHhccCC-----CccCccHHHHHH Confidence 765443 335889999999999999999999998764 36799999999999999864 467899999999 Q ss_pred HhCCceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhccceeeCCceEEEeeeeeeeeEEEECCeEEEEeec Q lcl|NC_019522. 231 TNFPDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHYVDG 310 (311) Q Consensus 231 ~n~~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dG 310 (311) +||||++|+++|||+++|+.....|+-+..+++..++.+|++|++| |+|+++++|++||++|||||+||||++|+|++| T Consensus 257 ~n~Pnl~i~t~pEl~~a~G~~~~l~~~~~~~~~t~~~~~p~~~~~l-~vq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~G 335 (336) T protein:vir:10 257 DIFPKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMRAH-SIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIG 335 (336) T ss_pred HhcCccEEEEccccccCCCceEEEEEEecCCCcceeeecchhhhcc-ceeecCceeEeccccceeeeeeeccchheeeec Confidence 9999999999999999886555555556677888999999999998 579999999999999999999999999999999 Q ss_pred C Q lcl|NC_019522. 311 V 311 (311) Q Consensus 311 I 311 (311) | T Consensus 336 I 336 (336) T protein:vir:10 336 V 336 (336) T ss_pred C Confidence 9 No 13 >protein:vir:3643 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705638;genbank:gi:23752323;genbank:GeneID:955719 Probab=100.00 E-value=1.6e-83 Score=474.54 Aligned_cols=296 Identities=16% Similarity=0.115 Sum_probs=264.7 Q ss_pred CCccccccc------chhhh-hhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcc Q lcl|NC_019522. 1 MAKSVFDVS------PVSAL-SFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNS 73 (311) Q Consensus 1 ~~~~~~~~~------~~~~~-~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a 73 (311) |++-++|+. .++++ .||+ ++|||++||++++++++.+|+|+.+.++|.+++++|.++|.+|++++|||++ T Consensus 31 ~~~da~d~~~~~~~~~~~~~~~~l~---~~i~p~~~~~~~~~~~~~~l~pv~t~g~W~~~~~~~~~~e~~G~a~~ygd~~ 107 (336) T protein:vir:36 31 YAMDAADLSPHLSSTGSSGIPNYLT---TYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYS 107 (336) T ss_pred hhhhhhhccCccccCCCcchHHHHH---HhhccceEeeecchhhhhhhccccccCCccceeEEEeeeeceeeEEEeeccC Confidence 333333333 23443 5666 7999999999999999999999999999999999999999999999999986 Q ss_pred cccceeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc-eeeeecCCcc Q lcl|NC_019522. 74 TDVPTVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG-EGLYTSPNVS 152 (311) Q Consensus 74 ~dip~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g-~GllN~p~v~ 152 (311) | +|++|++.+++++++++++.+|+|+++|+++|+++|++|.++|+.+||+++++++|+++|+|+++++ |||||||+++ T Consensus 108 D-~P~~d~~~~~~~~~v~~~~~g~~yg~~E~~~Aa~~~~~l~~~Ka~aA~~ale~~~N~i~~~Gd~~~~~yGllNdP~l~ 186 (336) T protein:vir:36 108 S-DGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLS 186 (336) T ss_pred C-CceeecccceeeeeEEEEEeeeeeCHHHHHHHHHhCCCcHHHHHHHHHHHHHHhhCcEEEEeccccceEEEEecCCCc Confidence 5 4999999999999999999999999999999999999999999999999999999999999999998 8999999998 Q ss_pred eeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCc--eecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHH Q lcl|NC_019522. 153 VEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLT--VHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLR 230 (311) Q Consensus 153 ~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~--~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~ 230 (311) +..+++ +.+|.++|++||++||++++++++.++++. .+.|++|+|||+++.+|+++ +.+++|+++||| T Consensus 187 a~~t~~-----t~~~~~~t~~ei~~Di~~~~~~l~~qt~G~i~~~~~~tL~LP~~~~~~Ls~~-----n~~g~Tvl~~lk 256 (336) T protein:vir:36 187 APITAT-----TPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSKT-----NQYGLAAAAKLK 256 (336) T ss_pred cccccC-----CCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccEEEechHHHHhccCC-----CccCccHHHHHH Confidence 765443 335889999999999999999999998774 47899999999999999864 467899999999 Q ss_pred HhCCceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhccceeeCCceEEEeeeeeeeeEEEECCeEEEEeec Q lcl|NC_019522. 231 TNFPDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHYVDG 310 (311) Q Consensus 231 ~n~~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dG 310 (311) +||||++|+++|||+++|+.....|+-+..+++..++.+|++|++| |+|+++++|++||++|||||+||||++|+|++| T Consensus 257 ~n~Pnl~i~t~pEl~~a~g~~~~l~~~~~~~~~t~~~~~p~~~~~l-~vq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~G 335 (336) T protein:vir:36 257 DIFPKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMRAH-SIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIG 335 (336) T ss_pred HhcCccEEEEccccccCCCceEEEEEEecCCCcceeeecchhhhcc-ceeecCceeEeccccceeeeeeeccchheeeec Confidence 9999999999999999876555555555677888999999999998 579999999999999999999999999999999 Q ss_pred C Q lcl|NC_019522. 311 V 311 (311) Q Consensus 311 I 311 (311) | T Consensus 336 I 336 (336) T protein:vir:36 336 V 336 (336) T ss_pred C Confidence 9 No 14 >protein:vir:96079 Length: 382 # NCBI annotation: hypothetical protein ORF023 # Family: family:all:1653 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294440;genbank:gi:149408337;genbank:GeneID:5237198 Probab=100.00 E-value=3.2e-80 Score=456.39 Aligned_cols=300 Identities=14% Similarity=0.124 Sum_probs=262.8 Q ss_pred CCccccccc-----chhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccc Q lcl|NC_019522. 1 MAKSVFDVS-----PVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTD 75 (311) Q Consensus 1 ~~~~~~~~~-----~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~d 75 (311) ..+.+||++ +..+.+.+..+|++|||++|+++++++++++|||+.++++|.+++++|.++|.+|+|++|||++|+ T Consensus 58 ~~~~amDa~~~~~~t~~~~g~p~~~l~~~~p~~~~~~~~p~~~~~l~pv~t~g~W~~~t~ty~~~e~~G~A~~ygd~~D~ 137 (382) T protein:vir:96 58 RSGSAMDSNFTAPVTTPSIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNI 137 (382) T ss_pred hhhcccccccCCccccCCccHHHHHHhhhhhhhhhhhhhhhhhhhhccccccCCccceEEEEeeeecccceEEeecccCC Confidence 344689976 223445678899999999999999999999999999998888899999999999999999998665 Q ss_pred cceeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccc---cc-eeeeecCCc Q lcl|NC_019522. 76 VPTVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKG---VG-EGLYTSPNV 151 (311) Q Consensus 76 ip~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~---~g-~GllN~p~v 151 (311) |++|++++++++++++++.+|+|+.+|+++|+++|++|+++|+.+||+++++++|+++|+|+.+ ++ |||||||++ T Consensus 138 -Pl~d~~~~~~~r~v~~~~~g~~yg~lE~~rAa~~~~~l~~~Ka~aA~~ale~~~N~i~f~G~~~g~~~~~yGllNdP~l 216 (382) T protein:vir:96 138 -PLTSWNANFERRTIVRGELGLLVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNL 216 (382) T ss_pred -CccccccceeEEEEEEEEEeeeecHHHHHHHHhhCCCcHHHHHHHHHHHHHHhhceEEEEeeecCcCcceEEEEeCCCc Confidence 9999999999999999999999999999999999999999999999999999999999999743 45 899999999 Q ss_pred ceeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCcee---cceEEEeCHHHHHHHhcccccCCCCCcchHHHH Q lcl|NC_019522. 152 SVEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVH---RPNTFVLPPAQFQLLARTLLSTQNASNVTLLQF 228 (311) Q Consensus 152 ~~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~---~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~ 228 (311) ++..+++ ++.|++||++||++||++++++++.++++.+. .|.+|+|||+.+.+|+++ +.+++|+++| T Consensus 217 ~a~~t~a-----~~~Wa~kT~~eI~~Di~~l~~~i~~qt~G~~~~~~~~~~L~LP~~~~~~Ls~~-----n~~g~Tvl~~ 286 (382) T protein:vir:96 217 PPFQTPP-----SQGWATADWAGIIGDIREAVRQLRIQSQDQIDPKAEKITMALATSKVDYLSVT-----TPYGISVSDW 286 (382) T ss_pred ccccccC-----CCCcccccHHHHHHHHHHHHHHHHhccCCeeeecccceEEeechHHHhhcccc-----CccCccHHHH Confidence 8665443 24599999999999999999999999866432 366899999999999763 5678999999 Q ss_pred HHHhCCceEEEEchhcccCCC---CcccEEEEEEcCcc---eeEEeecchhhhc-------cceeeCCceEEEeeeeeee Q lcl|NC_019522. 229 LRTNFPDITFEDDILLKGAGV---AGADRMAVYKKEIR---IVKGHDVMPLRFL-------APATADNVNFKVPAILRTG 295 (311) Q Consensus 229 l~~n~~~l~i~~~~~l~~ag~---~g~~~~v~y~~~~~---~~~~~~~~~~~~~-------~p~~~~~~~~~~~~~~~~g 295 (311) |++||||++|+++|||++++. ++.+++++|.++.+ +.+...|++|++. .|+|++.++|++||++||| T Consensus 287 lk~n~Pnl~i~t~peL~~a~~~g~g~~~~~~~~~~e~~~~~~~s~~~p~~f~q~~p~~~~~l~ve~~~~~~~~~~s~~t~ 366 (382) T protein:vir:96 287 IEQTYPKMRIVSAPELSGVQMQGKTPEDALVLFVEEVDASVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTA 366 (382) T ss_pred HHHhcCCcEEEEccccccccCCCccceeEEEEecchhhhhcccccccCcceeccccceeeeccceeecceeEecccccee Confidence 999999999999999987654 35678888888766 4667777776532 2578899999999999999 Q ss_pred eEEEECCeEEEEeecC Q lcl|NC_019522. 296 GTEWRIPKAGHYVDGV 311 (311) Q Consensus 296 Gv~i~~P~ai~~~dGI 311 (311) ||+||||++|+|++|| T Consensus 367 Gv~i~~P~ai~~~~GI 382 (382) T protein:vir:96 367 GALCKRPWAVVRYLGI 382 (382) T ss_pred eeEEEcchhhhhccCC Confidence 9999999999999999 No 15 >protein:vir:105778 Length: 358 # NCBI annotation: gp9 # Family: family:all:10995 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224147;genbank:gi:62362222;genbank:GeneID:3342531 Probab=99.90 E-value=9.3e-27 Score=163.29 Aligned_cols=299 Identities=14% Similarity=0.157 Sum_probs=228.1 Q ss_pred CCcccc----------cccc-hhhhhhhHHHHHHHHHHHHhhhhhh---hhhhhhccccCCCCcceeEEEEEEeec-ccc Q lcl|NC_019522. 1 MAKSVF----------DVSP-VSALSFLVNQAAHIESEIYRIEYPQ---FKYGTLLPLDNSAPDWAQAVMFRSIDA-RGE 65 (311) Q Consensus 1 ~~~~~~----------~~~~-~~~~~fl~~~L~~id~~v~~~~~~~---~~~~~~~~v~~~~~~~~~~~~~~~~~~-~G~ 65 (311) ++..+| +..+ ++...|-...+..+|.++++.-+++ --.-+|+++.+.++.+.....|.+... .|+ T Consensus 30 ~~~~a~maan~a~~~~~~~~~NAv~~v~~D~wr~~D~~~~q~fr~e~~~~l~NDLm~ls~sv~Igktv~~y~~~gd~~~~ 109 (358) T protein:vir:10 30 AQHDAMIAANRSNMTPEWLAVNAVGGFTRDFWAEIDRQVLQLRDQEVGMEIVNDLIGVQTVLPVGKTAKLYNVIGDIADD 109 (358) T ss_pred hhhhhHHhhhHHHhhhhhheecccccCCHHHHHHHhhhhhhhcccchhHHHHhhhhhccccccHHHHHHHHhhhcCCCce Confidence 222111 1111 1112233345678999998877764 345568999999999999999988766 887 Q ss_pred eEE-ecCcc-cccceeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeecccc-- Q lcl|NC_019522. 66 LQL-FGPNS-TDVPTVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGV-- 141 (311) Q Consensus 66 a~~-~~~~a-~dip~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~-- 141 (311) +.. +++.+ ..+..+. ++++..||+.+..||+++|||++..+-.|+++.++-+....+++.+++-+++|+|+.+. T Consensus 110 v~~SmsGQ~~~~lD~~~--y~~dGtpiPIfdsg~~f~WR~~~~~~~~g~d~~~daQ~~~~~kv~~~~vdy~lNG~~~I~v 187 (358) T protein:vir:10 110 VSVSIDGQAPFSFDHTE--YASDGDPIPVFTAGYGVNWRHAAGLNSLGIDLVLDSQMAKMRKFNQKRVNYYLNGDPNIQV 187 (358) T ss_pred EEEEecccCccccccee--eeccCCEeeeeccCccccccchhhcCccccchhHHHHHHHHHHHHHHHHhhhhccCCceee Confidence 764 44443 3444444 45666677777899999999999999999999999999999999999999999998873 Q ss_pred ----ceeeeecCCcceeeccCCccccCcccccCCHHHHHHHH-HHHHHHHHhccCCceecceEEEeCHHHHHHHhccccc Q lcl|NC_019522. 142 ----GEGLYTSPNVSVEAATSTFVALVAAIPTNGTQPIIDFF-GNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLS 216 (311) Q Consensus 142 ----g~GllN~p~v~~~~~~~~~~~~~t~w~~~t~~ei~~di-~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~ 216 (311) +|||-|||++...+..+...+.+-+++++|+++++..+ .+++.++-.. +....-.+++++|+.++.|.++|+. T Consensus 188 ~g~t~~Glrn~~n~~qv~l~~~s~g~NiDlttat~~a~~~~f~~~l~~~~~~~--N~~~~~~~~~vs~ei~~n~~r~Y~~ 265 (358) T protein:vir:10 188 QSYPAQGIKNHRNTKKINLGSGSGGANIDLTTADMTALFAFFGKGAFGTLARA--NKVAQYDVMWVSPEIWANLAQPYVV 265 (358) T ss_pred cCcccccccCCcceeEEEeccCCCcceeeeccCCHHHHHHHHHHHHHHHHHhh--cccceeeEEEEcHHHHhhhhccccc Confidence 36999999999888887666667789999999999998 6678888543 3455668999999999999999985 Q ss_pred CCCCCcchHHHHHHHhCCceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchh-hhccceeeCCceEEEeeeeeee Q lcl|NC_019522. 217 TQNASNVTLLQFLRTNFPDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPL-RFLAPATADNVNFKVPAILRTG 295 (311) Q Consensus 217 ~~~~~~~Tvl~~l~~n~~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~-~~~~p~~~~~~~~~~~~~~~~g 295 (311) . +.-..|||+++++--+--+|++.+.|+| +.+++|++..+++...+.||+ ++..|....+..|....++.. T Consensus 266 ~-~~~~gTIl~~vl~~~~va~I~~~~~Lsg------Neii~~~~~~~vi~plvG~~~gt~~~pR~~p~ddY~f~vwsA~- 337 (358) T protein:vir:10 266 N-GVVSGNVLNAVLPFAPVREIRQTFALSG------NEFIAYVRRQDIISPLVGMAVGVVPLPRPLPNVNYNFQIMSAE- 337 (358) T ss_pred c-cccchhhHHHhhcccCcccccccccCCC------ccEEEEEeCCceeeeeecceeeeecCCCCCCCcchhhhhhhhh- Confidence 3 4557899999998555457999999987 889999999999999999998 555566655566777777775 Q ss_pred eEEEECCe----EEEEeecC Q lcl|NC_019522. 296 GTEWRIPK----AGHYVDGV 311 (311) Q Consensus 296 Gv~i~~P~----ai~~~dGI 311 (311) |++||.-. .+.|..-+ T Consensus 338 glqik~D~~Gks~Vv~~~~~ 357 (358) T protein:vir:10 338 GLQITADDQGLSGVVYGANL 357 (358) T ss_pred ceeeeeccccceeeEeeccc Confidence 68887753 34444444 No 16 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=99.10 E-value=1.8e-11 Score=79.52 Aligned_cols=285 Identities=13% Similarity=0.073 Sum_probs=165.8 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccceee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPTVD 80 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~v~ 80 (311) --..++.+.+...+++++. +.+...|++..++....+.+.. ...+.....+.+.+.+..+.+.|++.. ..+|..+ T Consensus 127 ~~~~~~~~~t~~~gg~~vP--~~~~~~ii~~l~~~~~i~~~~~--~~~~~~~~~~~~p~~~~~~~a~~v~E~-~~~~~~~ 201 (435) T protein:vir:14 127 EVAMSLNTLSPGAGGVLVP--ENLSSEVIELLRPKSVVRKLGA--RTLPLSNGNITIPRLKGGAIVGYIGAD-TDIPTTQ 201 (435) T ss_pred hhhhhcccCCcCCCccccc--hhHHHHHHHHHhhhchhhhhcc--eeeecCCCceEEEEEeCCcceeeeccC-ccccccc Confidence 0112233333344456665 4566778887766555555422 122222334667777777788888765 5678888 Q ss_pred eeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc--eeeeecCCcceeeccC Q lcl|NC_019522. 81 IAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG--EGLYTSPNVSVEAATS 158 (311) Q Consensus 81 ~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g--~GllN~p~v~~~~~~~ 158 (311) ..+...+...+.++..+.+|. |+..-...+.+|...-.....+++.+.+|+.+++|+...+ .|+++........... T Consensus 202 ~~f~~i~~~~~k~~~~~~iS~-ell~ds~~~~~l~~~i~~~l~~ai~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~ 280 (435) T protein:vir:14 202 QQFDDLKLTAKKMAALVPIAN-DLIKYAGVNPNVDQIVVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPSNVITAS 280 (435) T ss_pred cceeEEEeeeEEEEEeehhhH-HHHHhhccCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeecccccceeccc Confidence 888999999999999998884 4433222234587888889999999999999999986543 5998876654443332 Q ss_pred CccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhC--Cce Q lcl|NC_019522. 159 TFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNF--PDI 236 (311) Q Consensus 159 ~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~--~~l 236 (311) + ..|.+.+..|+.+++..+.....+ ..+..++|+|..+..|.+.. +..|.-++.-+. .+ ..+ T Consensus 281 ~---------~~~~~~~~~~~~~l~~~~~~~~~~--~~~~~~v~n~~~~~~L~~lk----d~~G~~l~~~~~-~g~l~G~ 344 (435) T protein:vir:14 281 D---------ASTLQKIETDLGKVILALENADAN--LTQPGWIMAPRTFRFLEGLR----DGNGNKVYPELA-NGMLKGY 344 (435) T ss_pred c---------ccchhhHHHHHHHHHHHhhhcccc--ccCCEEEEcHHHHHHHHHhh----ccCCceeccCCC-CCeeecc Confidence 2 235667788999998888654322 23457899999999986533 111222211000 00 011 Q ss_pred EEEEchhccc-CCCCcccEEEEEEcCcceeEEeecchhhhcc-c--------------eeeCCceEEEeeeeeeeeEEEE Q lcl|NC_019522. 237 TFEDDILLKG-AGVAGADRMAVYKKEIRIVKGHDVMPLRFLA-P--------------ATADNVNFKVPAILRTGGTEWR 300 (311) Q Consensus 237 ~i~~~~~l~~-ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~-p--------------~~~~~~~~~~~~~~~~gGv~i~ 300 (311) -++.+..+.. .+.+++...++|.+=.+++ +..-.++++.. + .+. + ...+.+..|++ ..++ T Consensus 345 Pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~-i~~~~~~~~~~~~~~~~~~~~~~~~~~f~~-~-~~~~r~~~r~d-~~~~ 420 (435) T protein:vir:14 345 PVGKTTQVPINLGETGKESEIYFTDFGDVF-IGEEETLEIDYSKEATYKDADGHMVSAFQR-D-QTLIRVIAKND-FGPR 420 (435) T ss_pred eeEeeccccccccCCCccceEEEeecccEE-EEEecccEEEEeccccccccccchhhhhhc-C-hhheeeeeeeC-ceee Confidence 2333333321 2223333334443333332 22222222210 0 111 1 13456677874 5899 Q ss_pred CCeEEEEeecC Q lcl|NC_019522. 301 IPKAGHYVDGV 311 (311) Q Consensus 301 ~P~ai~~~dGI 311 (311) +|.|+++++|+ T Consensus 421 ~~~a~~~l~~~ 431 (435) T protein:vir:14 421 HVESIAVLAGV 431 (435) T ss_pred cccceEEEecC Confidence 99999999999 No 17 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=99.09 E-value=2.7e-11 Score=78.55 Aligned_cols=286 Identities=13% Similarity=0.079 Sum_probs=164.6 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccceee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPTVD 80 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~v~ 80 (311) -....+.+.+...+++++. +.+...|++...+....+.+-. +..+.....+.+.+.+..+.+.|++.+ ..+|..+ T Consensus 127 ~~~~~~~~~~~~~gg~lvP--~~~~~~ii~~l~~~~~i~~~~~--~~v~~~~~~~~~p~~~~~~~a~~v~E~-~~~~~~~ 201 (435) T protein:vir:80 127 EVAMSLNTLSPGAGGVLVP--ENLSSEVIELLRPKSVVRKLGA--RTLPLSNGNITIPRLKGGAIVGYIGAD-TDIPTTQ 201 (435) T ss_pred hhhhhhcccCCCCCccccc--hhHHHHHHHHHhhhchhhhccc--eeeecCCCceEEEEEeCCcceeeeccC-ccccccc Confidence 0001122222233455554 3456678776665545555421 122222334667777777788888765 5689889 Q ss_pred eeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc--eeeeecCCcceeeccC Q lcl|NC_019522. 81 IAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG--EGLYTSPNVSVEAATS 158 (311) Q Consensus 81 ~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g--~GllN~p~v~~~~~~~ 158 (311) ..++......+.++..+.+|.+ +..-...+.+|..--......++.+.+++.+++|+...+ .|++++..+....... T Consensus 202 ~~f~~i~~~~~k~~~~~~is~e-ll~ds~~~~~l~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~ 280 (435) T protein:vir:80 202 QQFDDLKLTAKKMAALVPIAND-LIKYAGVNPNVDQIVVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPGNVITAS 280 (435) T ss_pred cceeeEEEeeEEEEEeehhhHH-HHHhhcccHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCcccceeecccccceeecc Confidence 9999999999999999998844 433333355688888999999999999999999986543 5999887654433332 Q ss_pred CccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHh-CCceE Q lcl|NC_019522. 159 TFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTN-FPDIT 237 (311) Q Consensus 159 ~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n-~~~l~ 237 (311) . ..+.+.+..|+.+++..+.....+ ..+..++|+|..+..|.+.. +..|.-++.-+..+ ...+- T Consensus 281 ~---------~~~~~~~~~d~~~~~~~~~~~~~~--~~~~~~vmn~~~~~~L~~lk----d~~G~~l~~~~~~~~l~G~p 345 (435) T protein:vir:80 281 D---------GSTLQKIETDLGKAILALENADAN--LTQPGWIMAPRTFRFLEGLR----DGNGNKVYPELANGMLKGYP 345 (435) T ss_pred c---------ccchhhHHHHHHHHHHHhhccccc--cccCEEEEcHHHHHHHHhhh----ccCCceeccCCCCCeEeeee Confidence 2 235667778898888887543222 23567899999999986633 11122222101000 00112 Q ss_pred EEEchhccc-CCCCcccEEEEEEcCcceeEEeecchhhhcc-c--------------eeeCCceEEEeeeeeeeeEEEEC Q lcl|NC_019522. 238 FEDDILLKG-AGVAGADRMAVYKKEIRIVKGHDVMPLRFLA-P--------------ATADNVNFKVPAILRTGGTEWRI 301 (311) Q Consensus 238 i~~~~~l~~-ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~-p--------------~~~~~~~~~~~~~~~~gGv~i~~ 301 (311) ++....+.. .+.++....++|.+=.+++ +..-.++++.. + .+. + ...+.+..|+ ++.+++ T Consensus 346 v~~~~~~p~~~~~~~~~~~i~~gd~s~~~-i~~~~~~~i~~~~~~~~~~~~~~~~~~f~~-n-~~~~r~~~r~-d~~~~~ 421 (435) T protein:vir:80 346 VGKTTQVPINLGEAGKESEIYFTDFGDVF-IGEEETLEIDYSKEATYKDADGHMVSAFQR-D-QTLIRVIAKN-DFGPRH 421 (435) T ss_pred eEEeccccccccCCCCcceEEEEEcccEE-EEeecceEEEEeccccccccccchhhhhhc-C-cceeeeeeee-CcEeec Confidence 333333321 1223333233333222222 11111111110 0 111 2 2455677777 688999 Q ss_pred CeEEEEeecC Q lcl|NC_019522. 302 PKAGHYVDGV 311 (311) Q Consensus 302 P~ai~~~dGI 311 (311) |.||+++.|+ T Consensus 422 ~~a~~~l~~~ 431 (435) T protein:vir:80 422 VESIAVLSGV 431 (435) T ss_pred ccceEEEecc Confidence 9999999999 No 18 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=99.01 E-value=1.8e-10 Score=74.01 Aligned_cols=283 Identities=11% Similarity=-0.008 Sum_probs=171.3 Q ss_pred CCccccccc----chhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCccccc Q lcl|NC_019522. 1 MAKSVFDVS----PVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDV 76 (311) Q Consensus 1 ~~~~~~~~~----~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~di 76 (311) |++..+.+. +.++.+++..++ . .++++..++....++++++.. .....+.|.+.+..+.+.|++.+ ..+ T Consensus 1 m~~~~~~a~~~~~t~~~g~~i~~~~--~-~~ii~~~~~~s~l~~~~~~~~---~~~~~~~~p~~~~~~~a~~v~Eg-~~~ 73 (330) T protein:vir:77 1 MAGSTVPSTQVALTGDFSAFLTPEQ--S-QDYFAEIEKTSIVQRIARKVP---MGPTGISIPHWTGAVSASWTGEA-ERK 73 (330) T ss_pred CcccccchhhccccCCCcceechhH--H-HHHHHHHHhccchhhhcceee---ccCCceEEEEEcCCcceeEecCC-Ccc Confidence 998877755 344555666532 3 457777777777777776533 22334667777777888898764 678 Q ss_pred ceeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc--eeeeecCCccee Q lcl|NC_019522. 77 PTVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG--EGLYTSPNVSVE 154 (311) Q Consensus 77 p~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g--~GllN~p~v~~~ 154 (311) |..+..+++.....+.++.-..++.+=|+ ....++...-.....+++++.+++.+|+|+...+ .|++++...... T Consensus 74 ~~~~~~f~~i~~~~~k~~~~~~is~ell~---ds~~~~~~~i~~~l~~ai~~~~~~~~l~G~g~~~~~~g~~~~~~~~~~ 150 (330) T protein:vir:77 74 PITKGSFGKQELEPVKITTIFAESAEVVR---LNPLNYLNTMRTKIAEAIALKFDAAAIHGIDKPSAFKGYLAETTKVVS 150 (330) T ss_pred ccccceeeEEEEeEEEEEEeehhhHHHHh---cchHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccccccccccccce Confidence 98888899999999999999998864333 3456788999999999999999999999987654 499988653322 Q ss_pred eccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHH-HHHh- Q lcl|NC_019522. 155 AATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQF-LRTN- 232 (311) Q Consensus 155 ~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~-l~~n- 232 (311) ....+.. -...+....++||.+++..+... + ..+..++|+++.+..|.+-. . ..+.-++.- +... T Consensus 151 ~~~~~~~-----~~~~~~~~~~~~l~~~~~~~~~~--~--~~~~~~vmn~~~~~~l~~lk-d---~~G~~l~~~~~~~~~ 217 (330) T protein:vir:77 151 LADTNLT-----TASGPQGNAYLAVNNALSLLVNS--G--KKWTGTLLDNVTEPILNTAV-D---GNGRPLFVESTYTEQ 217 (330) T ss_pred eeccccc-----ccccccchhHHHHHHHHHhhhhc--C--CCccEEEEcHHHHHHHHHHh-c---cCCceeecCcccccc Confidence 2221111 11234555688899988888543 2 23567999999999886522 1 111111110 0000 Q ss_pred ---CC-----ceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhcc----------------------ceeeC Q lcl|NC_019522. 233 ---FP-----DITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLA----------------------PATAD 282 (311) Q Consensus 233 ---~~-----~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~----------------------p~~~~ 282 (311) .. .+-++....+.. +..+....+++.+-.+.+ +.....++... ..+ + T Consensus 218 ~~~~~~~~l~G~PV~~~~~~p~-~~~~~~~~~~~gd~s~~~-i~~~~~~~i~~~~e~~~~~~~~~~~~~~~~~~~~f~-~ 294 (330) T protein:vir:77 218 VGAIREGRILGRPTYVADNVVN-GTVGNRVVGVMGDFSQVI-WGQIGGLSFDVTDQATLDFGEEQGGVWVPKLISLWQ-H 294 (330) T ss_pred ccccCCceecceeeEEeccccC-CCCCCccEEEEEecceEE-EEEecCcEEEEeecceeeecccccccccccccchhh-c Confidence 01 122333333332 223323333332222221 11111111100 011 1 Q ss_pred CceEEEeeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 283 NVNFKVPAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 283 ~~~~~~~~~~~~gGv~i~~P~ai~~~dGI 311 (311) + ...+.++.|++ +.+++|.|++.+.+. T Consensus 295 ~-~~~~r~~~r~d-~~v~~~~a~~~i~~~ 321 (330) T protein:vir:77 295 N-MVAVRCEAEFA-FMVNDKDAFVKLTDQ 321 (330) T ss_pred C-cEEEEEEEEec-cEEecccceEEEEec Confidence 2 26677888884 667889999999999 No 19 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=98.96 E-value=1.4e-10 Score=74.67 Aligned_cols=295 Identities=15% Similarity=0.121 Sum_probs=157.2 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccc-eEEecCcc----cc Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGE-LQLFGPNS----TD 75 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~-a~~~~~~a----~d 75 (311) .....+.+ +.+.+++++.. +.+...|++...+....++++.... .+-....+.+...+..+. +.|.+.++ .. T Consensus 152 ~~~~~~~~-~~~~gg~lv~~-~~~~~~ii~~l~~~~~i~~~~~~~~-~~~~~~~~~ip~~~~~~~~a~~~~Eg~~~~~~~ 228 (477) T protein:vir:84 152 EEYRDLDR-NGGTGGYAVPP-LWMMNRFIELARAGRTYANLCPTEP-LPGGTSSINIPKILTGTSTAIQAADNAALTAPS 228 (477) T ss_pred hhhccccc-cCCCcceeecc-chhHHHHHHHhhhcchHHHhhceee-ecCCcceeEEEEEecCcceeeeeccCccccccc Confidence 11122222 22233344321 2344567777777666677666432 233334455555544333 34555542 35 Q ss_pred cceeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc--eeeeecCCcce Q lcl|NC_019522. 76 VPTVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG--EGLYTSPNVSV 153 (311) Q Consensus 76 ip~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g--~GllN~p~v~~ 153 (311) .|..+..++....+.+.++.-+.+|.+ + ...+..++.+--....+.++...+|.-+++|+...+ .|++|.+++.. T Consensus 229 ~~~s~~~f~~i~~~~~k~~~~~~iS~e-l--l~ds~~~l~~~i~~~l~~~~~~~~d~~~l~G~Gt~~~p~Gi~~~~~~~~ 305 (477) T protein:vir:84 229 AHEVDLTDGFVQANVKTIAGQQGIAIQ-L--LDQAAVSVDEFVFRDLAADYANKLNVQVISGTGSNNQVVGVRATAGITQ 305 (477) T ss_pred ccccccceeeEEEeeeeEEeeeHHHHH-H--HhccchhHHHHHHHHHHHHHHHHHHHHHhccCCCCCccceeeecccccc Confidence 677788888899999999888888744 2 234456889999999999999999999999987544 59999998765 Q ss_pred eeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCC-------CcchHH Q lcl|NC_019522. 154 EAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNA-------SNVTLL 226 (311) Q Consensus 154 ~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~-------~~~Tvl 226 (311) .+.... .+.|. ..+.++.+|.+++..+. ......+...+|.|+.+..|.+-. ..++. .+.+-. T Consensus 306 ~~~~~~----~~t~~--~~~~~~~~i~~~~~~~~---~~~~~~~~~~v~~~~~~~~l~~lk-d~~G~~l~~~~~~~~~~~ 375 (477) T protein:vir:84 306 VTATSA----GSALE--KHQIIYQKIADAIQRVH---TSRFLEPEVIVMHPRRWASFHAIF-AGDDRPLIVPSGPGFNNL 375 (477) T ss_pred cccccc----ccchh--hHHHHHHHHHHHHhhcc---ccccCCccEEEEcHHHHHHHHHhh-ccCCCeeeecCccccccc Confidence 544321 11222 23445666666666553 222223456888999988875522 11110 000000 Q ss_pred HHHHH---hCC-----ceEEEEchhcc-cCCCCcccEEEEEEcCcceeEEeecchhhhccceeeCCceEEEeeeeeeeeE Q lcl|NC_019522. 227 QFLRT---NFP-----DITFEDDILLK-GAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATADNVNFKVPAILRTGGT 297 (311) Q Consensus 227 ~~l~~---n~~-----~l~i~~~~~l~-~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~gGv 297 (311) .++.. +.+ ...++..+.+. +.|.++....++|-+-.+.+-..-.+.+....-.........+......... T Consensus 376 ~~~~~~~~~~~~~~l~G~pVv~s~~~p~~~~~~~d~~~i~~gd~~~~~i~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~ 455 (477) T protein:vir:84 376 GVLTEVASQRVVGQMHGLPVVTDPTLPTTLGTGTDQDVIHVLRASDLALFESSVRMRALQETRAENLSVLLQVYGYLAFT 455 (477) T ss_pred ccccccccccccchhcccceEecCcccccccccCCcceEEEEEeceEEEEeeceeEEeccccccccceeeeeehhhhhhh Confidence 00000 011 12333344442 2333333334555554554433323333222111111222222222223335 Q ss_pred EEECCeEEEEeecC Q lcl|NC_019522. 298 EWRIPKAGHYVDGV 311 (311) Q Consensus 298 ~i~~P~ai~~~dGI 311 (311) -+|+|.|++.++|. T Consensus 456 ~~r~~~afv~~t~~ 469 (477) T protein:vir:84 456 AARFPQSVVEIGGT 469 (477) T ss_pred hhccccceEEeecc Confidence 67889999999999 No 20 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=98.94 E-value=4.1e-10 Score=72.06 Aligned_cols=285 Identities=10% Similarity=0.029 Sum_probs=158.1 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccceee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPTVD 80 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~v~ 80 (311) -...++.+.+. .+++++. +.+..+|++..++....+.+ +. +..+.....+.+.+.+..+.+.|++.. .++|..+ T Consensus 61 ~~~~a~~~~~~-~Gg~lvP--~~~~~~ii~~l~~~s~l~~l-g~-~~v~~~~g~~~~p~~t~~~~a~wv~E~-~~~~~s~ 134 (366) T protein:vir:57 61 GLSMAISTAAG-SGGALIP--QNMQNEVIELLRDRTVVRIL-GA-RSIPLPNGNLSMPRLSGGATAGYVGEG-KDVVATG 134 (366) T ss_pred hhhhhcccccc-CCccccc--hhHHHHHHHHHhhhcchhhh-ce-eeeecCCCceEEEEEeCCcceeeeccC-ccccccc Confidence 01122333332 3456654 34567788876665444444 11 112222334666777777788888765 6689989 Q ss_pred eeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeecccc-c-eeeeecCCcceeeccC Q lcl|NC_019522. 81 IAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGV-G-EGLYTSPNVSVEAATS 158 (311) Q Consensus 81 ~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~-g-~GllN~p~v~~~~~~~ 158 (311) ..+++...+.+.++....+| +|+.. ....++..--.....+++.+.+|+-+++|+..- . .|++|..+........ T Consensus 135 ~~f~~i~~~~~k~~~~~~iS-~ell~--ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~ 211 (366) T protein:vir:57 135 ATFDDVKLSAKTMIALVPVS-NQLIG--RAGFNVEQLLLGDILSAIATREDKAFLRDDGTGDTPKGMKAVATAANRLVAW 211 (366) T ss_pred cceeEEEEeeEEEEEeehhh-HHHHh--hhhHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeeccccccceeec Confidence 99999999999999999888 44432 345678888899999999999999999998643 3 6999988764433332 Q ss_pred CccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHh-CCceE Q lcl|NC_019522. 159 TFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTN-FPDIT 237 (311) Q Consensus 159 ~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n-~~~l~ 237 (311) +++ ..+...+..++..+.........+ ......+|+|..+..|.+.. +..|..++.-+... .-..- T Consensus 212 ~~t-------~~~~~~~~~~~~~~~~~~~~~~~~--~~~a~~vmn~~~~~~L~~lk----d~~G~~l~~~~~~g~l~G~P 278 (366) T protein:vir:57 212 TGT-------AINLTTIDEYLDSLILKHMDSNSN--MIRCGWGLSNRTYMTLFGLR----DGNGNKVYPEMSQGILKGYP 278 (366) T ss_pred ccc-------ccchhhHHHHHHHHHHhhhccccc--cccCEEEecHHHHHHHHhhh----ccCCceeccCCCCCeeccee Confidence 221 223334444444433332211111 12346889999999987643 22233332111110 00122 Q ss_pred EEEchhccc-CCCCcccEEEEEEcCcceeEEeecchhhhcc---------------ceeeCCceEEEeeeeeeeeEEEEC Q lcl|NC_019522. 238 FEDDILLKG-AGVAGADRMAVYKKEIRIVKGHDVMPLRFLA---------------PATADNVNFKVPAILRTGGTEWRI 301 (311) Q Consensus 238 i~~~~~l~~-ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~---------------p~~~~~~~~~~~~~~~~gGv~i~~ 301 (311) ++.+..+.. .+..+....++|.+=.+++ +..-..++... -.|. + ...+.+..++ ++.+++ T Consensus 279 vv~s~~ip~~~~~~~~~~~i~~gdfs~~~-i~~~~~i~i~~~~ea~~~~~~g~~~~~f~~-~-~~~iR~~~~~-d~~v~~ 354 (366) T protein:vir:57 279 IQRTSAIPANLGDDGNESEIYFCDFNDVV-IGEDGMMKVDFSTEATYKDADGQLVSAFAR-N-QSLIRVVTEH-DIGFRH 354 (366) T ss_pred eEEccccccccccCCCccEEEEEecceEE-EEEecceEEEEeeccccccccccchhhhhc-C-ceeEEeeeee-CcEeec Confidence 444444432 2222222233343322222 21112211100 0111 1 2456677777 577899 Q ss_pred CeEEEEeecC Q lcl|NC_019522. 302 PKAGHYVDGV 311 (311) Q Consensus 302 P~ai~~~dGI 311 (311) |.++++++|| T Consensus 355 ~~a~~~lt~~ 364 (366) T protein:vir:57 355 PEGLVLGTGV 364 (366) T ss_pred cccEEEEecc Confidence 9999999999 No 21 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=98.91 E-value=1.5e-10 Score=74.46 Aligned_cols=277 Identities=17% Similarity=0.127 Sum_probs=161.3 Q ss_pred CCcccccccchhhhhhhHHHH----HHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecc---cceEEecCcc Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQA----AHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDAR---GELQLFGPNS 73 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L----~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~---G~a~~~~~~a 73 (311) |..|+=-+.+..+...-.++| +.|..+|.+...+.+-+..||--. +.....++.|...... |.+..+.-++ T Consensus 1 ~~~~~~i~s~~~~~~itv~~ll~~P~~I~~~i~e~~~~~~iad~lf~~~--~a~~~~~v~f~~~~p~~~~~d~e~VaEgg 78 (318) T protein:vir:10 1 MTAPTGIVSVSDGPAITVRELVGNPLWIPTALKKMMVNQFISESLFRNG--GANPNGVVAYNEGNPSFLEDDVADVAEFG 78 (318) T ss_pred CCCCCcceeeecCCceehHHhhCCchhHHHHHHHHHhccchhhhhhhcc--cccccceeEEEecccccccCcHhhccCcc Confidence 777654444444444334443 467777777777777777777632 2334556777654443 5666565554 Q ss_pred cccceeeeeccceeE-EEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccceeeeecCCcc Q lcl|NC_019522. 74 TDVPTVDIAMSQGFK-DINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVGEGLYTSPNVS 152 (311) Q Consensus 74 ~dip~v~~~~~~~~~-~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g~GllN~p~v~ 152 (311) + +|.++...+..+. .+..++.++++|.+.+. ..+.+.-.+...++++.+.++.|+.++ ..|.+++++ T Consensus 79 E-iP~~~~~~G~~~ia~~~K~G~~~~vS~Em~~---~n~~~~v~r~~~~l~Nti~r~~d~~a~--------dal~sa~t~ 146 (318) T protein:vir:10 79 E-IPVSAGARGLPRTAFAVKKALGVRVSKEMID---ENRVGAVNDQMLQLRNTFIRANDRSAK--------ALLQSPIVP 146 (318) T ss_pred c-ccccCCCCCchhhhhhehhccceeccHHHHh---hcChhHHHHHHHHHHHHHHHHHHHHHH--------HHHhccccc Confidence 4 7888877755555 55799999999965433 345667788888888888888777644 346777766 Q ss_pred eeeccCCccccCccc-ccCCHHHHH----HHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHH Q lcl|NC_019522. 153 VEAATSTFVALVAAI-PTNGTQPII----DFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQ 227 (311) Q Consensus 153 ~~~~~~~~~~~~t~w-~~~t~~ei~----~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~ 227 (311) ...+++.|.+++..- ....+.|.+ .|++.+...-... .++ +.|++|+|+|..+..|.+- ..+.+ T Consensus 147 ~~~~s~~w~~~~~~~~d~~~A~e~v~~a~~~~~~a~~~~~~~-~~G-Y~pdtIVlhP~~~~~l~~n---------~~~~~ 215 (318) T protein:vir:10 147 TLAVPTAWDNGGKVRTDIAIAIEQISTAAPTAYPAGVGSSDE-YFG-FIPDTIVMHYALLPILMDN---------ENFMK 215 (318) T ss_pred cccCCcCCCCcccccccchhhhhhhhhhhhhhhhhhhhhhhh-ccC-ccceeeEECHHHHHHHhcc---------hhhhh Confidence 665555444322110 000111111 1222211111111 122 5799999999999998541 12222 Q ss_pred HHH-------------HhCC----ceEEEEchhcccCCCCcccEEEEEEcCcceeEE-eecchhhhcccee-------eC Q lcl|NC_019522. 228 FLR-------------TNFP----DITFEDDILLKGAGVAGADRMAVYKKEIRIVKG-HDVMPLRFLAPAT-------AD 282 (311) Q Consensus 228 ~l~-------------~n~~----~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~-~~~~~~~~~~p~~-------~~ 282 (311) ++. .++| .++++..|-+.. ++..+.. ..++.+ ..++|++...-.+ -. T Consensus 216 ~y~~~a~~~~~~~~~tg~~~g~~lGl~vi~s~~~p~------~~alvlq--~g~vG~~~d~~pl~~t~~~~egg~~~g~~ 287 (318) T protein:vir:10 216 VYERNANYVSTAPDWTGNFPGSVMGLNVIRSRTFPI------DRVLIME--RGTVGFYSDTRPLQFTALYPEGNGPNGGP 287 (318) T ss_pred hhhccchhhhhcccccccccceeeceEEeecCccCC------CeeEEEe--cCCcceeeccccceeeecccCCCCCCCCc Confidence 222 1222 367777777753 3344433 344544 3456665442111 14 Q ss_pred CceEEEeeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 283 NVNFKVPAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 283 ~~~~~~~~~~~~gGv~i~~P~ai~~~dGI 311 (311) +.+|...+...+ ..-|.+|+|++.++|| T Consensus 288 ~~s~~~~~~~~~-~~~V~~PkA~~~itgi 315 (318) T protein:vir:10 288 TESYRADASHKR-ALAVDQPKAALWLTGI 315 (318) T ss_pred chhhheehheee-eeeeeCcceeEEEeec Confidence 556888877666 5889999999999999 No 22 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=98.90 E-value=5e-10 Score=71.56 Aligned_cols=275 Identities=10% Similarity=-0.024 Sum_probs=159.6 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccceee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPTVD 80 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~v~ 80 (311) || .++.+++.++- +.+.+.|++...+.-..+++.++.. .+. ....+.+.+..+.|.|++.. ..+|..+ T Consensus 1 ma-----~~t~~~G~lip---~~~~~~ii~~l~~~s~i~~l~~~~~-~~~--~~~~~p~~~~~~~a~wv~Eg-~~~~~s~ 68 (300) T protein:vir:95 1 MS-----EAQLSKGNLFN---PELVTKVINKVKGHSSIAKLSPQKP-IPF--NGQREFVFDFDSDIDIVAEN-GKKTHGG 68 (300) T ss_pred Cc-----ccccCCcceec---hhhHHHHHHHHHhhhhhhhhcceee-ccC--CceEEEEEecCcceEEeeCC-ccccccc Confidence 33 33333443333 3456778887777777777776543 221 23566677777888999865 6789999 Q ss_pred eeccceeEEEEEEEEEEEecHHHHHHH-HHhCCChHHHHHHHHHHHHHHhhhheeeeeccc-cc-----eeeeecCCcce Q lcl|NC_019522. 81 IAMSQGFKDINTAALGYTYSIEEIGFA-MLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKG-VG-----EGLYTSPNVSV 153 (311) Q Consensus 81 ~~~~~~~~~v~~~~~~~~~~~~El~~a-~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~-~g-----~GllN~p~v~~ 153 (311) ..+++...+.+.++....+|. ||.+. .-...+|...-....++++++.+++.+|+|+.. .| .|..+.++... T Consensus 69 ~~f~~v~l~~~k~~~~~~iS~-ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~ 147 (300) T protein:vir:95 69 VSLDPVTIVPLKVEYGARVSD-EFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTKQASTIIGDNCFDKKVT 147 (300) T ss_pred ccceeeEeeeEEEEEeehhhH-HHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCCCCcccccccccccccc Confidence 999999999999999999984 45432 234577888889999999999999999999532 11 35555554433 Q ss_pred eeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhC Q lcl|NC_019522. 154 EAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNF 233 (311) Q Consensus 154 ~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~ 233 (311) .++..+ ....+++|.+++..+... . ..|..++|+|+.+..|.+-. +..|..++.-..... T Consensus 148 ~~~~~~------------~~~~~~~i~~~~~~~~~~-~---~~~~~~vmn~~~~~~L~~lk----d~~G~~i~~~~~~~~ 207 (300) T protein:vir:95 148 QTVPFK------------DTNPDESMEDAVGMIDGS-E---RDITGAILDPIFTTALSKMK----NAEGGKLYPELAWGG 207 (300) T ss_pred eeeccc------------ccchHHHHHHHHHHhhhc-C---CCccEEEECHHHHHHHHHhh----ccCCCeeccCccccC Confidence 322221 111256788888877432 2 23568999999999986532 222332321111111 Q ss_pred C-----ceEEEEchhcccCCCCcccEEEEEEcC-------cceeEEeecchhhhc-cc---eeeCCceEEEeeeeeeeeE Q lcl|NC_019522. 234 P-----DITFEDDILLKGAGVAGADRMAVYKKE-------IRIVKGHDVMPLRFL-AP---ATADNVNFKVPAILRTGGT 297 (311) Q Consensus 234 ~-----~l~i~~~~~l~~ag~~g~~~~v~y~~~-------~~~~~~~~~~~~~~~-~p---~~~~~~~~~~~~~~~~gGv 297 (311) . .+.++....+.......+..+++-+-+ .+.+++.+...-..- .+ .+.. ..-+.++.|+ |+ T Consensus 208 ~~~~l~G~Pv~~s~~v~~~~~~~~~~~~~GDf~~~~~~~~~~~~~~~v~~~~~~d~~~~~~f~~~--~v~~r~~~r~-d~ 284 (300) T protein:vir:95 208 VPDAINGLAVDKNRTVSYSQTDPKNTAIVGDFETMFKWGYAKEVPMEIIKYGDPDNSGRDLKGYN--QIYIRCEAYI-GW 284 (300) T ss_pred CCceecceeeEEecCCCCCCCCCccEEEEeeccceEEEEEecccEEEEeeccCCCCcchhhhhcC--cEEEEEEEee-cc Confidence 1 123333333333222333333322211 112222221100000 00 1111 1455667777 67 Q ss_pred EEECCeEEEEeecC Q lcl|NC_019522. 298 EWRIPKAGHYVDGV 311 (311) Q Consensus 298 ~i~~P~ai~~~dGI 311 (311) .+++|.+++.+.|. T Consensus 285 ~v~~~~a~~~l~~~ 298 (300) T protein:vir:95 285 GIMDAASFARIVKT 298 (300) T ss_pred eeecccceEEEecC Confidence 88999999999999 No 23 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=98.81 E-value=2.6e-09 Score=67.64 Aligned_cols=278 Identities=10% Similarity=0.000 Sum_probs=155.2 Q ss_pred ccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCccc----ccceeee Q lcl|NC_019522. 6 FDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNST----DVPTVDI 81 (311) Q Consensus 6 ~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~----dip~v~~ 81 (311) |-..+.++++.++. +.+.+.|++...+.-..+++..+.+- ...+..+.+....+.+.|++.++. ++|..+. T Consensus 1 ma~~t~~~gg~liP--~~~~~~Ii~~~~~~s~l~~l~~~~~~---~~~~~~~p~~~~~~~a~wv~E~~~~~~~~~~~s~~ 75 (305) T protein:vir:25 1 MADISRAEVASLIQ--EAYSDTLLAAAKQGSTVLSAFQNVNM---GTKTTHLPVLATLPEADWVGESATDPKGVKPTSKV 75 (305) T ss_pred CCCccCCccceecC--HHHHHHHHHHHHhhchhhhhcceeec---cCCcEEEEEEeCCcceEEeeccccccccccccccc Confidence 44444444555554 44667888888887777777765432 233466666777778888876543 4777788 Q ss_pred eccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccceeeeecCCcceeeccCCcc Q lcl|NC_019522. 82 AMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVGEGLYTSPNVSVEAATSTFV 161 (311) Q Consensus 82 ~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g~GllN~p~v~~~~~~~~~~ 161 (311) .+++.....+.++....++. |+. .....++...-.....+++++.+++.+|+|+.. +.|+.+...++....... T Consensus 76 ~f~~i~~~~~k~~~~~~is~-ell--~ds~~~~~~~i~~~l~~~~a~~~d~a~~~G~g~-~~~~~~~~~~~~~~~~~~-- 149 (305) T protein:vir:25 76 TWANRTLVAEEIAVIIPVHE-NVI--DDATVAVLTEVAELGGQAIGKKLDQAVIFGTDK-PASWVSPALIPAAVTAGQ-- 149 (305) T ss_pred ceeeEEeeeEEEEEeehhhH-HHH--hcchHHHHHHHHHHHHHHHHHHHhhhheeccCC-CCCccccccccccccccc-- Confidence 88888999999999999985 443 234567889999999999999999999999864 233333322222211111 Q ss_pred ccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhCCceEEEEc Q lcl|NC_019522. 162 ALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNFPDITFEDD 241 (311) Q Consensus 162 ~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~~~l~i~~~ 241 (311) ...+.-...+.++++.++..+...+.. . ...++.++++|..+..|.+.. +..+.-++. -.....+.+.-. T Consensus 150 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-~---~~~~~~~v~~~~~~~~l~~lk----d~~G~~i~~--~~~l~G~Pv~~~ 219 (305) T protein:vir:25 150 AVEVVGGVANESDIVGATNRAAKAVAS-A---GWAPDTLLSSLALRYEVANIR----DANGNPVFR--DDSFAGFRTFFN 219 (305) T ss_pred cccccccchhhhHHHHHHHHHHHhhhh-c---ccccceeEecHHHHHHHHHhh----ccCCceeec--CCcccccceEEc Confidence 111111222345567777776666532 2 234567999999999986532 222222210 000000111111 Q ss_pred hhcccCCCCcccEEEEEEcCcceeEEeecchhhh--------ccc------eeeCCceEEEeeeeeeeeEEEECCeEEEE Q lcl|NC_019522. 242 ILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRF--------LAP------ATADNVNFKVPAILRTGGTEWRIPKAGHY 307 (311) Q Consensus 242 ~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~--------~~p------~~~~~~~~~~~~~~~~gGv~i~~P~ai~~ 307 (311) ..+.. ..++.. +++. +...+.+.....++. ... .|. + .+.+.++.|+ |+.+.+|.+++. T Consensus 220 ~~~~~--~~~~~~-~~~g-d~s~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~-~-~~~~R~~~r~-~~~v~~p~a~v~ 292 (305) T protein:vir:25 220 RNGAW--DADAAI-EVIA-DSSRVKIGVRQDITVKFLDQATLGTGENQINLAER-D-MVALRLKARF-AYVLGVSATAQG 292 (305) T ss_pred CccCC--CCCccE-EEEE-ecceEEEEEecCeEEEEeeeeeeecCCceeeeeec-C-cEEEEEEEee-cceeeCcccEEE Confidence 11111 111111 2221 112221111111111 000 121 1 2456677888 467999999999 Q ss_pred eecC Q lcl|NC_019522. 308 VDGV 311 (311) Q Consensus 308 ~dGI 311 (311) ++|+ T Consensus 293 ~~~~ 296 (305) T protein:vir:25 293 ANKT 296 (305) T ss_pred Eccc Confidence 9999 No 24 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=98.77 E-value=2.7e-09 Score=67.60 Aligned_cols=276 Identities=11% Similarity=0.016 Sum_probs=160.6 Q ss_pred CCcccccccc---hhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccc Q lcl|NC_019522. 1 MAKSVFDVSP---VSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVP 77 (311) Q Consensus 1 ~~~~~~~~~~---~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip 77 (311) ||....++.. .+.++++.. +.+.+.|++...+....++++.+.. .......+.+.+..+.++|++.. ..+| T Consensus 1 ma~~~~~~~~~~~t~~gg~lip--~~~~~~ii~~~~~~~~l~~~~~~~~---~~~~~~~ip~~~~~~~a~~v~E~-~~~~ 74 (304) T protein:vir:94 1 MATPTYTPGNVILSDFKNGVIP--AEQGTLIMKDIMANSAIMKLAKNEP---MTAQKKKFTYLAKGVGAYWVSET-ERIQ 74 (304) T ss_pred CcccccccccccccCCCceecc--hhHHHHHHHHHHhccchhhhcceee---ccCCceEEEEEeCCcceEEeecC-cccc Confidence 9988877552 223345554 3456778887777777777665543 23344567777777788898765 4588 Q ss_pred eeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc-eeeeecCCcceeec Q lcl|NC_019522. 78 TVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG-EGLYTSPNVSVEAA 156 (311) Q Consensus 78 ~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g-~GllN~p~v~~~~~ 156 (311) ..+..++......+.++..+.++.+ +. .....+|...-.....+++++.+++.+++|+...+ .|.+....+..... T Consensus 75 ~~~~~~~~i~~~~~k~~~~~~iS~e-ll--~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~ 151 (304) T protein:vir:94 75 TSKPEYAQAEMEAKKIGVIIPLSKE-FL--KWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAEE 151 (304) T ss_pred cccceeeEEEEEEEEEEEeehhhHH-HH--hcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCcccccccccccccccc Confidence 8888999999999999999999854 33 23457788888999999999999999999987643 34443333322211 Q ss_pred cCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhC--- Q lcl|NC_019522. 157 TSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNF--- 233 (311) Q Consensus 157 ~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~--- 233 (311) ... ...+....++||.+++.++... + ..+.+++|.|+.+..|.+.. . ..+.-++ ..+. T Consensus 152 ~~~--------~~~~~~~~~~~i~~~~~~l~~~--~--~~~~~~v~~~~~~~~L~~lk-d---~~G~~l~---~~~~~~l 212 (304) T protein:vir:94 152 KGN--------VVTDTNNLYVDLSALMATIEDE--E--LDPNGVLTTRSFRSKMRNAL-D---ANDRPLF---DANGNEI 212 (304) T ss_pred ccc--------ccccccchHHHHHHHHHHhhhc--c--CCcCEEEEcHHHHHHHHHhh-c---cCCcEee---cCCCccc Confidence 111 0112233478888888888532 2 23568999999999996532 1 1122111 1110 Q ss_pred CceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhh----------ccc----------eeeCCceEEEeeeee Q lcl|NC_019522. 234 PDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRF----------LAP----------ATADNVNFKVPAILR 293 (311) Q Consensus 234 ~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~----------~~p----------~~~~~~~~~~~~~~~ 293 (311) -.+.++..+.+..... +-.+++.+ .+++-+..-.+++. ... .+. + ...+.++.| T Consensus 213 ~G~PV~~~~~~~~~~~---~~~~~~gd-~~~~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~-~-~~~~r~~~r 286 (304) T protein:vir:94 213 MGLPLSYTGADVYDKK---KSLALMGD-WDYARYGILQGIEYAISEDATLTTLQASDASGQPVSLFER-D-MFALRATMH 286 (304) T ss_pred cceeeEEecccccCCC---CcEEEEEe-hhhEEEEEecceEEEEeecceeeeecccccCccchhhhhc-C-cEEEEEEEE Confidence 0122333333322111 11122221 11111111111110 000 011 1 144566788 Q ss_pred eeeEEEECCeEEEEeecC Q lcl|NC_019522. 294 TGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 294 ~gGv~i~~P~ai~~~dGI 311 (311) + |..+++|.||+.+..- T Consensus 287 ~-~~~v~~~~a~~~l~~a 303 (304) T protein:vir:94 287 I-AYMNVKPEAFATLKPT 303 (304) T ss_pred e-ccEeecccceEEEEec Confidence 7 5667789999999999 No 25 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=98.77 E-value=2.7e-09 Score=67.60 Aligned_cols=276 Identities=11% Similarity=0.016 Sum_probs=160.6 Q ss_pred CCcccccccc---hhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccc Q lcl|NC_019522. 1 MAKSVFDVSP---VSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVP 77 (311) Q Consensus 1 ~~~~~~~~~~---~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip 77 (311) ||....++.. .+.++++.. +.+.+.|++...+....++++.+.. .......+.+.+..+.++|++.. ..+| T Consensus 1 ma~~~~~~~~~~~t~~gg~lip--~~~~~~ii~~~~~~~~l~~~~~~~~---~~~~~~~ip~~~~~~~a~~v~E~-~~~~ 74 (304) T protein:vir:10 1 MATPTYTPGNVILSDFKNGVIP--AEQGTLIMKDIMANSAIMKLAKNEP---MTAQKKKFTYLAKGVGAYWVSET-ERIQ 74 (304) T ss_pred CcccccccccccccCCCceecc--hhHHHHHHHHHHhccchhhhcceee---ccCCceEEEEEeCCcceEEeecC-cccc Confidence 9988877552 223345554 3456778887777777777665543 23344567777777788898765 4588 Q ss_pred eeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc-eeeeecCCcceeec Q lcl|NC_019522. 78 TVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG-EGLYTSPNVSVEAA 156 (311) Q Consensus 78 ~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g-~GllN~p~v~~~~~ 156 (311) ..+..++......+.++..+.++.+ +. .....+|...-.....+++++.+++.+++|+...+ .|.+....+..... T Consensus 75 ~~~~~~~~i~~~~~k~~~~~~iS~e-ll--~ds~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~~ 151 (304) T protein:vir:10 75 TSKPEYAQAEMEAKKIGVIIPLSKE-FL--KWTAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGAEE 151 (304) T ss_pred cccceeeEEEEEEEEEEEeehhhHH-HH--hcchHHHHHHHHHHHHHHHHHHHHhhheeccCCCcccccccccccccccc Confidence 8888999999999999999999854 33 23457788888999999999999999999987643 34443333322211 Q ss_pred cCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhC--- Q lcl|NC_019522. 157 TSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNF--- 233 (311) Q Consensus 157 ~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~--- 233 (311) ... ...+....++||.+++.++... + ..+.+++|.|+.+..|.+.. . ..+.-++ ..+. T Consensus 152 ~~~--------~~~~~~~~~~~i~~~~~~l~~~--~--~~~~~~v~~~~~~~~L~~lk-d---~~G~~l~---~~~~~~l 212 (304) T protein:vir:10 152 KGN--------VVTDTNNLYVDLSALMATIEDE--E--LDPNGVLTTRSFRSKMRNAL-D---ANDRPLF---DANGNEI 212 (304) T ss_pred ccc--------ccccccchHHHHHHHHHHhhhc--c--CCcCEEEEcHHHHHHHHHhh-c---cCCcEee---cCCCccc Confidence 111 0112233478888888888532 2 23568999999999996532 1 1122111 1110 Q ss_pred CceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhh----------ccc----------eeeCCceEEEeeeee Q lcl|NC_019522. 234 PDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRF----------LAP----------ATADNVNFKVPAILR 293 (311) Q Consensus 234 ~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~----------~~p----------~~~~~~~~~~~~~~~ 293 (311) -.+.++..+.+..... +-.+++.+ .+++-+..-.+++. ... .+. + ...+.++.| T Consensus 213 ~G~PV~~~~~~~~~~~---~~~~~~gd-~~~~~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~-~-~~~~r~~~r 286 (304) T protein:vir:10 213 MGLPLSYTGADVYDKK---KSLALMGD-WDYARYGILQGIEYAISEDATLTTLQASDASGQPVSLFER-D-MFALRATMH 286 (304) T ss_pred cceeeEEecccccCCC---CcEEEEEe-hhhEEEEEecceEEEEeecceeeeecccccCccchhhhhc-C-cEEEEEEEE Confidence 0122333333322111 11122221 11111111111110 000 011 1 144566788 Q ss_pred eeeEEEECCeEEEEeecC Q lcl|NC_019522. 294 TGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 294 ~gGv~i~~P~ai~~~dGI 311 (311) + |..+++|.||+.+..- T Consensus 287 ~-~~~v~~~~a~~~l~~a 303 (304) T protein:vir:10 287 I-AYMNVKPEAFATLKPT 303 (304) T ss_pred e-ccEeecccceEEEEec Confidence 7 5667789999999999 No 26 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=98.74 E-value=8e-09 Score=64.99 Aligned_cols=272 Identities=8% Similarity=-0.048 Sum_probs=159.3 Q ss_pred CCcccccccc----hhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCccccc Q lcl|NC_019522. 1 MAKSVFDVSP----VSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDV 76 (311) Q Consensus 1 ~~~~~~~~~~----~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~di 76 (311) |+...||+.. .++.+ +.. +.+..+|++...+.-..++++++..-.. .....+.+......+.+++.+ ..+ T Consensus 1 m~~~~~~~~~~~~t~~~~~-lvP--~~~~~~ii~~~~~~s~l~~~~~~~~~~~--~~~~~~~~~~~~~~a~~v~Eg-~~~ 74 (297) T protein:vir:95 1 MTVQTFNPENVLVSQKKDG-TLH--KEFTDIIMKEVAQNSLVMQLGQYQEMEG--EQEKTVYVQTDGISAYWVNET-EKI 74 (297) T ss_pred CCccccccccccccCCCcc-eec--hhHHHHHHHHHHhhchhhhhcceeecCC--CccEEEEEEcCCceeEEeecC-ccc Confidence 9888888773 22333 333 4556778888877777777777643222 223445556666778888765 568 Q ss_pred ceeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc-eeeeecCCcceee Q lcl|NC_019522. 77 PTVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG-EGLYTSPNVSVEA 155 (311) Q Consensus 77 p~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g-~GllN~p~v~~~~ 155 (311) |..+..++......+.++....++.+-++.+ ..++...-....++++.+.+++-+++|+...+ .|+++........ T Consensus 75 ~~~~~~f~~v~l~~~k~~~~~~is~ell~ds---~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~gi~~~~~~~~~~ 151 (297) T protein:vir:95 75 KTDKPEVVPVTLKAHKLGIILVTSREALNYT---WKKFFEDMKPQIVEAFYKKIDEAGLLGHDTPFANSVAKAAKDANKV 151 (297) T ss_pred cccccceeEEEEeeEEEEEeehhhHHHHhcC---HHHHHHHHHHHHHHHHHHHHHHHHhcccCCccccccccccccccee Confidence 9889999999999999999999996544433 35788889999999999999999999987654 6887764421111 Q ss_pred ccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhCCc Q lcl|NC_019522. 156 ATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNFPD 235 (311) Q Consensus 156 ~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~~~ 235 (311) .. +. -| ++||.+++.++... + ..+..+++.|+.+..|.+-. .. .|.-++ ... . T Consensus 152 ~~-----~~-----~t----~~~i~~~~~~l~~~--~--~~~~~~v~~~~~~~~L~~l~-d~---~G~~i~---~~~--~ 204 (297) T protein:vir:95 152 IG-----GP-----IN----YDNILKLQDALYDA--D--VEPNAFVSKIQNRSALREAR-DG---NKVSIY---DKA--A 204 (297) T ss_pred cc-----cc-----cC----HHHHHHHHHHhhhc--c--CCcCEEEEcHHHHHHHHHhh-cc---CCceee---cCC--C Confidence 11 11 12 56778888887532 2 23568999999999986532 11 121111 111 1 Q ss_pred eEEEEchhccc-CCCCcccEEEEEEcC------cceeEEeecchhhhccc----------eeeCCceEEEeeeeeeeeEE Q lcl|NC_019522. 236 ITFEDDILLKG-AGVAGADRMAVYKKE------IRIVKGHDVMPLRFLAP----------ATADNVNFKVPAILRTGGTE 298 (311) Q Consensus 236 l~i~~~~~l~~-ag~~g~~~~v~y~~~------~~~~~~~~~~~~~~~~p----------~~~~~~~~~~~~~~~~gGv~ 298 (311) -++...|-... ........+++-+.+ .+.+++.+-.+...... .+. + ...+.+..|+ |.. T Consensus 205 ~~l~G~Pv~~~~~~~~~~~~~~~gd~s~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~-~-~~~~r~~~~~-d~~ 281 (297) T protein:vir:95 205 NTIDGITTVDLKSARFEKGDLLAGDFDNLIYGVPYNITYKISEEGQISTITNADGTPINLFEQ-E-MIAIRATMDI-AVM 281 (297) T ss_pred CcccceeeEeecCCCCCCceEEEEecccEEEEEecCeEEEEeeccccccccccCccchhhhhc-C-cEEEEEEEEe-ccE Confidence 11211121100 011112222221111 11122222111111000 111 1 2456667777 577 Q ss_pred EECCeEEEEeecC Q lcl|NC_019522. 299 WRIPKAGHYVDGV 311 (311) Q Consensus 299 i~~P~ai~~~dGI 311 (311) +.+|.+++.+..- T Consensus 282 v~~~~a~~~l~~a 294 (297) T protein:vir:95 282 ITKTDAFAKLTPA 294 (297) T ss_pred eecccceEEEeec Confidence 8889999999887 No 27 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=98.73 E-value=6.6e-09 Score=65.43 Aligned_cols=280 Identities=12% Similarity=-0.025 Sum_probs=153.9 Q ss_pred CCcccccccc--------hhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCc Q lcl|NC_019522. 1 MAKSVFDVSP--------VSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPN 72 (311) Q Consensus 1 ~~~~~~~~~~--------~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~ 72 (311) +++.+||... ..+.+++- +.+.++|++........++++++..- ......+.+.+..+.+.|++.+ T Consensus 2 ~~~~~~~~~~~~~~~t~~~~~~~~ip---~~~~~~ii~~~~~~s~l~~~~~~~~~---~~~~~~~p~~~~~~~a~~v~E~ 75 (320) T protein:vir:10 2 AAGTAFQVDHAQIAQTGDTMFKGYLE---PEQAKDYFAEAEKTSIVQQFAQKVPM---GTTGQKIPHWIGDVSAQWIGEG 75 (320) T ss_pred CCCccCCHHHHHhhcccccccccccc---HHHHHHHHHHHHhccchhhhcceeec---cCCceEEEEEeCCcceEEecCC Confidence 4446676441 11223344 33456677777777677777665432 2334667777777888898864 Q ss_pred ccccceeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc-eeeeecCC- Q lcl|NC_019522. 73 STDVPTVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG-EGLYTSPN- 150 (311) Q Consensus 73 a~dip~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g-~GllN~p~- 150 (311) ..+|..+..+++...+.+.++..+.++.+=|+ ....++...-.....+++++.+|+.+|+|+.... .|++...+ T Consensus 76 -~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~---ds~~~l~~~i~~~l~~a~a~~~d~a~l~G~g~~~~~~~~~~~~~ 151 (320) T protein:vir:10 76 -DMKPITKGNMTSQNIAPHKIATIFVASAETVR---ANPANYLGTMRTKVATAFAMAFDSAALNGTDSPFPTYLAQTTKS 151 (320) T ss_pred -ccccccccceeEEEEeeEEEEEeehhhHHHHh---cChHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCCccccccccc Confidence 66899999999999999999999999965444 3346788899999999999999999999987532 34433322 Q ss_pred cceeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHH-H Q lcl|NC_019522. 151 VSVEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQF-L 229 (311) Q Consensus 151 v~~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~-l 229 (311) +........ .... .....+++.+++..+.. . ...+..++++|+.+..|.+-.- ..+..++.- + T Consensus 152 ~~~~~~~~~------~~~~--~~~~~~~~~~~~~~~~~---~-~~~~~~~v~n~~~~~~L~~lkd----~~G~~l~~~~~ 215 (320) T protein:vir:10 152 VSLADPGGA------TASD--LTAYDAVAVNGLSLLVN---A-KKKWTHTLLDDIVEPILNGAKD----KNGRPLFIEST 215 (320) T ss_pred ccceecccc------cccc--cccHHHHHHHHHhhhhc---c-cCCCcEEEEcHHHHHHHHHhhc----cCCceeecccc Confidence 111111110 0111 11112334444444422 1 2246789999999999965321 111111110 0 Q ss_pred H----HhCCceEEEEchhcccCC-CCcccEEEEEEcCcceeEEeecchhhhc----------cc--------eeeCCceE Q lcl|NC_019522. 230 R----TNFPDITFEDDILLKGAG-VAGADRMAVYKKEIRIVKGHDVMPLRFL----------AP--------ATADNVNF 286 (311) Q Consensus 230 ~----~n~~~l~i~~~~~l~~ag-~~g~~~~v~y~~~~~~~~~~~~~~~~~~----------~p--------~~~~~~~~ 286 (311) . .+++..++...|-..... ..++. .++|-+-.+ +-+.....++.. .+ .+. + .. T Consensus 216 ~~~~~~~~~~~~i~g~pv~~~~~~~~~~~-~~~~gd~~~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~-~-~~ 291 (320) T protein:vir:10 216 YTDENSPFRAGRIVSRPTILSDHVADGTT-VGYMGDFRN-VIWGQVGGLSFDVTDQATLNLGTPTEPNFVSLWQH-N-LV 291 (320) T ss_pred ccCccccccCceeeeeeeEecCCCCCCce-EEEEeecce-EEEEEecCeEEEEeecceeeeccccccccchhhhc-C-cE Confidence 0 111223444444332211 12221 222222111 111111111110 00 111 1 13 Q ss_pred EEeeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 287 KVPAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 287 ~~~~~~~~gGv~i~~P~ai~~~dGI 311 (311) .+.+..++ ++.+.+|.|++.+.|+ T Consensus 292 ~~r~~~~~-d~~v~~~~a~~~l~~~ 315 (320) T protein:vir:10 292 AVRVEAEY-AFHNNDKDAFVKLTNV 315 (320) T ss_pred EEEEEEee-ccEEecccceEEEEec Confidence 45666776 6888999999999999 No 28 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=98.72 E-value=3.5e-09 Score=66.91 Aligned_cols=274 Identities=11% Similarity=0.010 Sum_probs=154.8 Q ss_pred cchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccceeeeeccceeE Q lcl|NC_019522. 9 SPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPTVDIAMSQGFK 88 (311) Q Consensus 9 ~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~v~~~~~~~~~ 88 (311) .+.+++ ++.. +.+..+|++...+.-..+++.++..-.+ ....+.+....+.+.|++.+ ..+|..+..++.... T Consensus 1 ma~~gG-~lip--~~~~~~ii~~~~~~s~i~~~~~~~~~~~---~~~~~p~~~~~~~a~~v~Eg-~~~~~~~~~f~~v~l 73 (298) T protein:vir:94 1 MVLNKG-TLFD--PELVTDLISKVAGKSSIARLSAQKPIPF---NGEKVFTFTMDSEIDVVAES-GKKTHGGVTLAPQTM 73 (298) T ss_pred Ceeccc-cccC--hhHHHHHHHHHHhhchhhhhcceeeccC---CceEEEEEecCcceEEeeCC-ccccccccceeEEEE Confidence 222333 3343 3456678887777777777776543222 23566677777888998765 678998999999999 Q ss_pred EEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeecccc-c-----eeeeecCCcceeeccCCccc Q lcl|NC_019522. 89 DINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGV-G-----EGLYTSPNVSVEAATSTFVA 162 (311) Q Consensus 89 ~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~-g-----~GllN~p~v~~~~~~~~~~~ 162 (311) ..+.++.-..+|.+=|+...-...+|...-+...++++.+.+++.+++|.... | .|..+..+.... ... T Consensus 74 ~~~k~~~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~-~~~---- 148 (298) T protein:vir:94 74 VPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQ-KVE---- 148 (298) T ss_pred eeeEEEEeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCCccccccccccccccccc-ccc---- Confidence 99999998888844333223345568888899999999999999999995321 1 122221111110 000 Q ss_pred cCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhCC-----ceE Q lcl|NC_019522. 163 LVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNFP-----DIT 237 (311) Q Consensus 163 ~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~~-----~l~ 237 (311) .......+++||.+++.++... + ..+..++|+|+.+..|.+.. +..|.-++.=...+.. .+- T Consensus 149 -----~~~~~~~~~~~i~~~~~~~~~~--~--~~~~~~vmn~~~~~~l~~lk----d~~G~~l~~~~~~~~~~~tl~G~P 215 (298) T protein:vir:94 149 -----APRGIADPNGAIENAVELLTGV--D--ADVTGIAINPSFRSALAKQK----DLQGNALFPELKWGATPDTINGLP 215 (298) T ss_pred -----cccccccHHHHHHHHHHhhhhc--C--CCccEEEEcHHHHHHHHHhh----ccCCCeeecCcccCCCCceeccee Confidence 1112345678999999988542 2 24568999999999986532 1112212111111111 122 Q ss_pred EEEchhcccCCCCcccEEEEEEcCcceeEEeecchhh--hcc---c-------eeeCCceEEEeeeeeeeeEEEECCeEE Q lcl|NC_019522. 238 FEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLR--FLA---P-------ATADNVNFKVPAILRTGGTEWRIPKAG 305 (311) Q Consensus 238 i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~--~~~---p-------~~~~~~~~~~~~~~~~gGv~i~~P~ai 305 (311) ++....+.+.....++.+++-+-+ +.+.+.+-..++ +.. + .+. + ...+.++.|+ |+.+++|.|+ T Consensus 216 V~~~~~v~~~~~~~~~~~~~Gdfs-~~~~~~~~~~~~~~~~~~~~~d~~~~~~f~~-~-~v~~r~~~r~-~~~~~~~~a~ 291 (298) T protein:vir:94 216 VDVNKTVSDMSLTQRDRAIIGDFA-NGFKWGYAKEVPLEVIQYGDPDNSGLDLKGY-N-QVYIRAELFL-GWGILDATKF 291 (298) T ss_pred eEEecccccccCCCccEEEEeecc-ceEEEEEecCceEEEeecCCCcCcchhhhhc-C-cEEEEEEEEe-ccEeecccce Confidence 333333333222333333322211 112121111211 110 0 111 1 1345566776 6888999999 Q ss_pred EEeecC Q lcl|NC_019522. 306 HYVDGV 311 (311) Q Consensus 306 ~~~dGI 311 (311) +++.|. T Consensus 292 ~~l~~~ 297 (298) T protein:vir:94 292 ARVTEA 297 (298) T ss_pred EEEEec Confidence 999999 No 29 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=98.70 E-value=7.2e-09 Score=65.23 Aligned_cols=273 Identities=8% Similarity=-0.013 Sum_probs=153.8 Q ss_pred CCccccccc---chhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccc Q lcl|NC_019522. 1 MAKSVFDVS---PVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVP 77 (311) Q Consensus 1 ~~~~~~~~~---~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip 77 (311) ....++++. ..+..+.+.. +.+...|++.....-..+.++++.+ ....++.+.+.+..+.+.|++.+ ..+| T Consensus 19 ~~~~~~~a~~~~~~~~~~~~iP--~~~~~~ii~~~~~~s~l~~l~~~~~---~~~~~~~~p~~~~~~~a~~v~Eg-~~~~ 92 (324) T protein:vir:96 19 VKPQVFNPDNVMMHEKKDGTLM--NEFTTPILQEVMENSKIMQLGKYEP---MEGTEKKFTFWADKPGAYWVGEG-QKIE 92 (324) T ss_pred hhhhhhccccccccCcCccccc--hhHHHHHHHHHHhhchhhhhcceee---ccCCceEEEEEecCcceeEecCC-cccc Confidence 111222221 1122333443 3455677777777666777766543 22334667777778888998774 6689 Q ss_pred eeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc--eeeeecCCcceee Q lcl|NC_019522. 78 TVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG--EGLYTSPNVSVEA 155 (311) Q Consensus 78 ~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g--~GllN~p~v~~~~ 155 (311) ..+..+++.....+.++....++.+=++. ...++...-.....+++.+.+++.+|+|+...+ .|+++..+..... T Consensus 93 ~~~~~~~~v~~~~~k~~~~~~is~ell~d---s~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~~gi~~~~~~~~~~ 169 (324) T protein:vir:96 93 TSKATWVNATMRAFKLGVILPVTKEFLNY---TYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKV 169 (324) T ss_pred ccccceeEEEEeeEEEEEeehhhHHHHhc---chHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCcCcccccccccccee Confidence 99999999999999999999998643433 346788888999999999999999999986543 4666554432211 Q ss_pred ccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhCCc Q lcl|NC_019522. 156 ATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNFPD 235 (311) Q Consensus 156 ~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~~~ 235 (311) ..+ + .-++||.+++.++... + ..+..++|+|+.+..|.+.. . ..+..++ ..... T Consensus 170 ~~~----------~----~t~~~i~~~~~~l~~~--~--~~~~~~vmn~~~~~~L~~l~-d---~~G~~~~----~~~~~ 223 (324) T protein:vir:96 170 IKG----------D----FTQDNIIDLEALLEDD--E--LEANAFISKTQNRSLLRKIV-D---PETKERI----YDRNS 223 (324) T ss_pred ccc----------c----ccHHHHHHHHHhhhhc--c--CCCCEEEEcHHHHHHHHHhh-c---cCCCeee----cCCCC Confidence 110 1 1267788888877432 2 34678999999999986532 1 1122111 11111 Q ss_pred eEEEEchhcc-cCCCCcccEEEEEEcC------cceeEEeecchhhhcc-------c---eeeCCceEEEeeeeeeeeEE Q lcl|NC_019522. 236 ITFEDDILLK-GAGVAGADRMAVYKKE------IRIVKGHDVMPLRFLA-------P---ATADNVNFKVPAILRTGGTE 298 (311) Q Consensus 236 l~i~~~~~l~-~ag~~g~~~~v~y~~~------~~~~~~~~~~~~~~~~-------p---~~~~~~~~~~~~~~~~gGv~ 298 (311) -++...|-.. .+...++..+++-+.+ .+.+.+.+-....... + .+. + ...+.+..|+ ++. T Consensus 224 ~~l~G~PV~~~~~~~~~~~~~~~gd~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~-d-~~~~r~~~r~-d~~ 300 (324) T protein:vir:96 224 DSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQ-D-MVALRATMHV-ALH 300 (324) T ss_pred CcccceeeEeeCCCCCCcceEEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhc-C-cEEEEEEEEE-ccE Confidence 1222222111 1111222222221111 1122222211100000 0 111 1 2455666776 577 Q ss_pred EECCeEEEEeecC Q lcl|NC_019522. 299 WRIPKAGHYVDGV 311 (311) Q Consensus 299 i~~P~ai~~~dGI 311 (311) +.+|.|++++.|. T Consensus 301 v~~~~A~~~l~~a 313 (324) T protein:vir:96 301 IADDKAFAKLVPA 313 (324) T ss_pred EecccceEEEecc Confidence 7889999999999 No 30 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=98.70 E-value=7.2e-09 Score=65.23 Aligned_cols=273 Identities=8% Similarity=-0.013 Sum_probs=153.8 Q ss_pred CCccccccc---chhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccc Q lcl|NC_019522. 1 MAKSVFDVS---PVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVP 77 (311) Q Consensus 1 ~~~~~~~~~---~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip 77 (311) ....++++. ..+..+.+.. +.+...|++.....-..+.++++.+ ....++.+.+.+..+.+.|++.+ ..+| T Consensus 19 ~~~~~~~a~~~~~~~~~~~~iP--~~~~~~ii~~~~~~s~l~~l~~~~~---~~~~~~~~p~~~~~~~a~~v~Eg-~~~~ 92 (324) T protein:vir:78 19 VKPQVFNPDNVMMHEKKDGTLM--NEFTTPILQEVMENSKIMQLGKYEP---MEGTEKKFTFWADKPGAYWVGEG-QKIE 92 (324) T ss_pred hhhhhhccccccccCcCccccc--hhHHHHHHHHHHhhchhhhhcceee---ccCCceEEEEEecCcceeEecCC-cccc Confidence 111222221 1122333443 3455677777777666777766543 22334667777778888998774 6689 Q ss_pred eeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc--eeeeecCCcceee Q lcl|NC_019522. 78 TVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG--EGLYTSPNVSVEA 155 (311) Q Consensus 78 ~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g--~GllN~p~v~~~~ 155 (311) ..+..+++.....+.++....++.+=++. ...++...-.....+++.+.+++.+|+|+...+ .|+++..+..... T Consensus 93 ~~~~~~~~v~~~~~k~~~~~~is~ell~d---s~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~~gi~~~~~~~~~~ 169 (324) T protein:vir:78 93 TSKATWVNATMRAFKLGVILPVTKEFLNY---TYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKV 169 (324) T ss_pred ccccceeEEEEeeEEEEEeehhhHHHHhc---chHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCcCcccccccccccee Confidence 99999999999999999999998643433 346788888999999999999999999986543 4666554432211 Q ss_pred ccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhCCc Q lcl|NC_019522. 156 ATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNFPD 235 (311) Q Consensus 156 ~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~~~ 235 (311) ..+ + .-++||.+++.++... + ..+..++|+|+.+..|.+.. . ..+..++ ..... T Consensus 170 ~~~----------~----~t~~~i~~~~~~l~~~--~--~~~~~~vmn~~~~~~L~~l~-d---~~G~~~~----~~~~~ 223 (324) T protein:vir:78 170 IKG----------D----FTQDNIIDLEALLEDD--E--LEANAFISKTQNRSLLRKIV-D---PETKERI----YDRNS 223 (324) T ss_pred ccc----------c----ccHHHHHHHHHhhhhc--c--CCCCEEEEcHHHHHHHHHhh-c---cCCCeee----cCCCC Confidence 110 1 1267788888877432 2 34678999999999986532 1 1122111 11111 Q ss_pred eEEEEchhcc-cCCCCcccEEEEEEcC------cceeEEeecchhhhcc-------c---eeeCCceEEEeeeeeeeeEE Q lcl|NC_019522. 236 ITFEDDILLK-GAGVAGADRMAVYKKE------IRIVKGHDVMPLRFLA-------P---ATADNVNFKVPAILRTGGTE 298 (311) Q Consensus 236 l~i~~~~~l~-~ag~~g~~~~v~y~~~------~~~~~~~~~~~~~~~~-------p---~~~~~~~~~~~~~~~~gGv~ 298 (311) -++...|-.. .+...++..+++-+.+ .+.+.+.+-....... + .+. + ...+.+..|+ ++. T Consensus 224 ~~l~G~PV~~~~~~~~~~~~~~~gd~~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~-d-~~~~r~~~r~-d~~ 300 (324) T protein:vir:78 224 DSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQ-D-MVALRATMHV-ALH 300 (324) T ss_pred CcccceeeEeeCCCCCCcceEEEEecceEEEEEecCcEEEEeecccccccccccccchhhhhc-C-cEEEEEEEEE-ccE Confidence 1222222111 1111222222221111 1122222211100000 0 111 1 2455666776 577 Q ss_pred EECCeEEEEeecC Q lcl|NC_019522. 299 WRIPKAGHYVDGV 311 (311) Q Consensus 299 i~~P~ai~~~dGI 311 (311) +.+|.|++++.|. T Consensus 301 v~~~~A~~~l~~a 313 (324) T protein:vir:78 301 IADDKAFAKLVPA 313 (324) T ss_pred EecccceEEEecc Confidence 7889999999999 No 31 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=98.69 E-value=5e-09 Score=66.09 Aligned_cols=274 Identities=11% Similarity=0.027 Sum_probs=156.7 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccceee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPTVD 80 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~v~ 80 (311) || .+++ ++.. +.+..+|++...+.-..+++.++.... .....+.+.+..+.+.|++.. .++|..+ T Consensus 1 ma--------~~gG-~lvp--~~~~~~ii~~~~~~s~i~~l~~~~~~~---~~~~~ip~~~~~~~a~~v~E~-~~~~~~~ 65 (298) T protein:vir:16 1 MV--------LNKG-TLFD--PTLVTDLISKVAGKSSIARLSAQKPIP---FNGEKVFTFTMDSEIDVVAES-GKKTHGG 65 (298) T ss_pred Cc--------ccCc-ceec--hhHHHHHHHHHHhhhhhhhhcceeecc---CCceEEEEEecCcceEEecCC-ccccccc Confidence 22 2233 2332 334567788777777777777654322 233456667777889999765 6789999 Q ss_pred eeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccc-cc-----eeeeecCCccee Q lcl|NC_019522. 81 IAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKG-VG-----EGLYTSPNVSVE 154 (311) Q Consensus 81 ~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~-~g-----~GllN~p~v~~~ 154 (311) ..++......+.++.-..+|.+=|+.+.....+|...-+...++++.+.+++-+++|... .| .|+....+.... T Consensus 66 ~~f~~v~l~~~k~a~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~ 145 (298) T protein:vir:16 66 VTLAPQTMVPIKVEYGARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQ 145 (298) T ss_pred cceeEEEEeeeeEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCccccccccccccccccc Confidence 999999999999999999885544444455678888899999999999999999999532 11 233222221111 Q ss_pred eccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhCC Q lcl|NC_019522. 155 AATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNFP 234 (311) Q Consensus 155 ~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~~ 234 (311) ... . .......+.||.+++.++... + ..+..++|+|+.+..|.+-. +..|.-++.-...+.. T Consensus 146 ~~~---------~-~~~~~~~~~~i~~~~~~~~~~--~--~~~~~~vmn~~~~~~l~~lk----d~~G~~i~~~~~~~~~ 207 (298) T protein:vir:16 146 KVE---------A-PRGIADPNGAIENAVELLTGV--D--ADVTGIAINPSFRSALAKQK----DLQDNALFPELKWGAT 207 (298) T ss_pred ccc---------c-ccccccHHHHHHHHHHHhhhc--C--CCccEEEEcHHHHHHHHHhh----ccCCCeeecCcccCCC Confidence 111 1 111233477899999988542 1 23567999999999886532 2223322211111111 Q ss_pred -----ceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhh--hcc---c-------eeeCCceEEEeeeeeeeeE Q lcl|NC_019522. 235 -----DITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLR--FLA---P-------ATADNVNFKVPAILRTGGT 297 (311) Q Consensus 235 -----~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~--~~~---p-------~~~~~~~~~~~~~~~~gGv 297 (311) .+.++....+.+.....++.+++-+-+ +.+.+.+...++ +.. + .+. ++ ..+.++.|+ |. T Consensus 208 ~~~l~G~PV~~~~~v~~~~~~~~~~~~~GDfs-~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~-~~-v~~ra~~r~-d~ 283 (298) T protein:vir:16 208 PDTINGLPVDVNKTVSDMSLTQRDRAIIGDFA-NGFKWGYAKEVPLEVIQYGDPDNSGLDLKGY-NQ-VYIRAELFL-GW 283 (298) T ss_pred CceecceeeEEecccccccCCCccEEEEeecc-ceEEEEEecCceEEEeeccCCcCcchhhhhc-Cc-EEEEEEEEE-cc Confidence 122333333333333334444432221 112121111111 110 0 111 11 345666776 68 Q ss_pred EEECCeEEEEeecC Q lcl|NC_019522. 298 EWRIPKAGHYVDGV 311 (311) Q Consensus 298 ~i~~P~ai~~~dGI 311 (311) .+++|.+++++.|. T Consensus 284 ~v~~~~a~~~l~~a 297 (298) T protein:vir:16 284 GILDATKFARVTEA 297 (298) T ss_pred EeecccceEEEeec Confidence 89999999999999 No 32 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=98.69 E-value=8.8e-09 Score=64.76 Aligned_cols=278 Identities=10% Similarity=0.017 Sum_probs=153.0 Q ss_pred CCcccc--ccc-chhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEE--------eecccceEEe Q lcl|NC_019522. 1 MAKSVF--DVS-PVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRS--------IDARGELQLF 69 (311) Q Consensus 1 ~~~~~~--~~~-~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~--------~~~~G~a~~~ 69 (311) .....+ .+. ...+...+.. +.+...+......+...+.++.+..... ..+.|.+ ....+.+.|+ T Consensus 116 ~~~~~~~~~~~~~~~~~~~~~p--~~~~~~i~~~~~~~~~i~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~a~~v 190 (419) T protein:vir:94 116 NRLLSRDAPAGTITNPNVPHLP--QLVPGIVPTTPDLPLLVADLLDQQNADY---NVLEYIRDTSGTAGAGSTWNKAAVV 190 (419) T ss_pred HHhhccccccccccCCcccccc--hhhhHHHHHHHhhhhhhhhcceeeeccC---CceeeeeeccccccccccCccccee Confidence 000000 000 0111112222 2344445555555556666666543222 2222222 2233456787 Q ss_pred cCcccccceeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc-eeeeec Q lcl|NC_019522. 70 GPNSTDVPTVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG-EGLYTS 148 (311) Q Consensus 70 ~~~a~dip~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g-~GllN~ 148 (311) +.++ .+|..+..++......+.++..+.++.+=++.+ .++.+.-....++++...+|+.+++|+.... .|++|. T Consensus 191 ~Eg~-~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~----~~l~~~i~~~la~a~~~~~d~aii~G~G~~~p~Gi~~~ 265 (419) T protein:vir:94 191 PEGT-AKPQSTLSFDTITTTLKTVAHWLPITRQAADDN----SQLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTT 265 (419) T ss_pred cCCc-cccccccceeeEEeeeeeEEEeehhhHHHHHhH----HHHHHHHHHHHHHHHHHHHHHHHHhccCcccccceecc Confidence 7764 478888889999999999999999996544432 2478888888999999999999999998755 699999 Q ss_pred CCcceeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHH- Q lcl|NC_019522. 149 PNVSVEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQ- 227 (311) Q Consensus 149 p~v~~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~- 227 (311) +++.....+.. +...|....++||.+++..+... + ..+..++|+|+.+..|.+...... +.-++. T Consensus 266 ~~~~~~~~~~~-------~~~~t~~~~~~~l~~~~~~~~~~--~--~~~~~~v~n~~~~~~l~~~k~~~~---~~~~~~~ 331 (419) T protein:vir:94 266 PGIGTYQQPKP-------TAPATDEPPLVDIRRAKTVAEIA--G--FPPDGVVVHPQDWESIELDQAPGS---GVFRVIA 331 (419) T ss_pred ccccccccccc-------ccccccchhHHHHHHHHHhhhhc--c--CCCCEEEEcHHHHHHHHHHhhcCC---CceeecC Confidence 99866554432 34556667799999999998532 2 246689999999998865432211 110000 Q ss_pred HHHHhCC-----ceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhccceeeC----CceEEEeeeeeeeeEE Q lcl|NC_019522. 228 FLRTNFP-----DITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATAD----NVNFKVPAILRTGGTE 298 (311) Q Consensus 228 ~l~~n~~-----~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~----~~~~~~~~~~~~gGv~ 298 (311) .+. +.. .+.++.+..+.. + . ++|-+-.+..-+..-..++...-.+.+ .-...+.++.|++ +. T Consensus 332 ~~~-~~~~~~l~G~pV~~~~~~~~-~-----~-~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~r~~~r~d-~~ 402 (419) T protein:vir:94 332 NVQ-GEATPRIWGLNVVSTVAIAQ-G-----T-ALVGGFRQGATLWSRQGITVLMTDSHADFFTANTLVILAEFRAN-LA 402 (419) T ss_pred Ccc-cCCCccccceeeEEcCCCCC-c-----c-EEEeeccceEEEEEecceEEEEeccccchhhcCcEEEEEEEeec-cE Confidence 000 000 112333333221 1 1 111111111111111122111000100 1124456677774 66 Q ss_pred EECCeEEEEeecC Q lcl|NC_019522. 299 WRIPKAGHYVDGV 311 (311) Q Consensus 299 i~~P~ai~~~dGI 311 (311) +++|.||++++.- T Consensus 403 v~~~~a~~~~~~~ 415 (419) T protein:vir:94 403 VYQPKAFVRVTFA 415 (419) T ss_pred EeccccEEEEEec Confidence 7889999998888 No 33 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=98.67 E-value=7.7e-09 Score=65.07 Aligned_cols=287 Identities=11% Similarity=0.041 Sum_probs=152.6 Q ss_pred CC---------cccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecC Q lcl|NC_019522. 1 MA---------KSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGP 71 (311) Q Consensus 1 ~~---------~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~ 71 (311) |+ ..++.+.+. .+++++. +.+.++|++........+++..- .++.....+.+.+....+.+.|++. T Consensus 113 ~~~~~~~~~~~~~~~~~~~~-~gg~liP--~~~~~~ii~~l~~~~~l~~~~~~--~~~~~~g~~~~p~~~~~~~a~~v~E 187 (428) T protein:vir:10 113 FASDELNDQSVSMAISTAAG-SGGVLIP--QNIHSEVIELLRDRTIVRKLGAR--SIPLPNGNMSLPRLAGGATASYTGE 187 (428) T ss_pred HhhhhhhhhhHhhhhccccc-CCccccc--hhHHHHHHHHHhhhchhhhhcce--eeecCCcceEEEEEeCCcceeeecc Confidence 11 111112222 2345554 34556788777666555555221 1222222355666666677888876 Q ss_pred cccccceeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc--eeeeecC Q lcl|NC_019522. 72 NSTDVPTVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG--EGLYTSP 149 (311) Q Consensus 72 ~a~dip~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g--~GllN~p 149 (311) + ..+|..+..++......+.++.-+.+|.+=|.. ...+|..--.....+++...+|+.+++|+...+ .|++|.. T Consensus 188 g-~~~~~~~~~f~~i~~~~~k~~~~v~is~ell~d---s~~~l~~~i~~~l~~ai~~~~d~~~l~G~G~~~~p~Gi~~~~ 263 (428) T protein:vir:10 188 N-QDAKVSEARFDDVKLTAKTMIAMVPISNALIGR---AGFNVEQLVLQDILTAISVREDKAFMRDDGTGDTPIGMKARA 263 (428) T ss_pred C-ccccccccceeeEEeeeEEEEEeehhhHHHHhh---hhHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccc Confidence 5 567888888999999999999999998654443 345788888899999999999999999987543 5999876 Q ss_pred CcceeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHH Q lcl|NC_019522. 150 NVSVEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFL 229 (311) Q Consensus 150 ~v~~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l 229 (311) ............ ...+.+.+-.++..+.... ............+|.|..+..|.+.. +..|.-++.-. T Consensus 264 ~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~v~n~~~~~~L~~lk----d~~G~~i~~~~ 331 (428) T protein:vir:10 264 TQWNRLLPWAAD------AAVNLDTIDTYLDSIILMS--MDGNSNMISSGWGMSNRTYMKLFGLR----DGNGNKVYPEM 331 (428) T ss_pred cccccccccccc------ccccHHHHHHHHHHHHHhh--hccccccccCEEEEcHHHHHHHHHhh----ccCCceeccCC Confidence 543322221111 1222333222333222211 11111223457889999999886532 12222222100 Q ss_pred HH-hCCceEEEEchhcc-cCCCCcccEEEEEEcCcceeEEeecchhhhccceee--------------CCceEEEeeeee Q lcl|NC_019522. 230 RT-NFPDITFEDDILLK-GAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATA--------------DNVNFKVPAILR 293 (311) Q Consensus 230 ~~-n~~~l~i~~~~~l~-~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~--------------~~~~~~~~~~~~ 293 (311) .. ....+.++....+. +.+.++....++|.+=.+++ +..-..+++..-.+. ++ ...+.++.| T Consensus 332 ~~g~l~G~pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~-i~~~~~i~i~~~~~~~~~~~~~~~~~~f~~~-~~~~R~~~r 409 (428) T protein:vir:10 332 AQGMLKGYPIQRTSAIPANLGEGGKESEIYFADFNDVV-IGEDGNMKVDFSKEASYIDTDGKLVSAFSRN-QSLIRVVTE 409 (428) T ss_pred CCCeeeceeeEEeccccccccCCCccceEEEEecceEE-EEEecceEEEeecccccccccccccchhhcc-hhheeeeee Confidence 00 00012233333332 22333333334443322222 222222221100000 11 134567778 Q ss_pred eeeEEEECCeEEEEeecC Q lcl|NC_019522. 294 TGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 294 ~gGv~i~~P~ai~~~dGI 311 (311) + ++.+++|.||++++|| T Consensus 410 ~-d~~v~~p~a~~~~t~~ 426 (428) T protein:vir:10 410 H-DIGFRHPEGLVLGTGV 426 (428) T ss_pred e-CceeeccceEEEEecc Confidence 7 6899999999999999 No 34 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=98.62 E-value=2.1e-08 Score=62.67 Aligned_cols=270 Identities=9% Similarity=-0.011 Sum_probs=152.2 Q ss_pred CCccccccc----chhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCccccc Q lcl|NC_019522. 1 MAKSVFDVS----PVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDV 76 (311) Q Consensus 1 ~~~~~~~~~----~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~di 76 (311) ..+.+++++ ++++.+ +.. +.+...|++.....-..++++++.. ....++.+.+.+..+.+.|++.+ ..+ T Consensus 19 ~~~~~~~a~~~~~~~~~~~-liP--~~~~~~ii~~~~~~s~l~~~~~~~~---~~~~~~~~p~~~~~~~a~~v~Eg-~~~ 91 (324) T protein:vir:10 19 VKPQVFNPDNVMMHEKKDG-TLL--NDFTTPILQEVMENSKIMQLGKYEP---MEGTEKKFTFWADKPGAYWVGEG-QKI 91 (324) T ss_pred hccceecccceeccCCCcc-eec--hhHHHHHHHHHHhhchhhhhcceee---ccCCceEEEEEeCCcceeEeccC-ccc Confidence 333343332 122222 232 3455667777766666666666543 22335677777777889998865 568 Q ss_pred ceeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc--eeeeecCCccee Q lcl|NC_019522. 77 PTVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG--EGLYTSPNVSVE 154 (311) Q Consensus 77 p~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g--~GllN~p~v~~~ 154 (311) |..+..++......+.++....++.+-++.+ ..++...-.....+++.+.+++.+++|+...+ .|+++....... T Consensus 92 ~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds---~~~l~~~i~~~l~~ai~~~~d~a~l~G~g~~~~~~~i~~~~~~~~~ 168 (324) T protein:vir:10 92 ETSKATWVNATMRAFKLGVILPVTKEFLNYT---YSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNK 168 (324) T ss_pred cccccceeEEEEeeEEEEEeehhhHHHHhcc---hHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCccCccccccccccce Confidence 9989999999999999999999986544432 35788888899999999999999999976543 466654332111 Q ss_pred eccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhCC Q lcl|NC_019522. 155 AATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNFP 234 (311) Q Consensus 155 ~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~~ 234 (311) ... ...-++||.+++..+... + ..+.+++++|+.+..|.+-. . ..+.-++ .-.+ + T Consensus 169 ~~~--------------~~~t~~~i~~~~~~l~~~--~--~~~~~~v~n~~~~~~L~~l~-d---~~g~~~~--~~~~-~ 223 (324) T protein:vir:10 169 VIK--------------GDFTQDNIIDLEALLEDD--E--LEANAFISKTQNRSLLRKIV-D---PETKERI--YDRN-S 223 (324) T ss_pred ecc--------------ccCCHHHHHHHHHhhhhc--c--CCCCEEEEcHHHHHHHHHhh-c---cCCceee--cCCC-C Confidence 110 011257778888877432 2 24668999999999986532 1 1111111 1111 1 Q ss_pred ceEEEEchhcc-cCCCCcccEEEEEEcCcceeEEeecchhhhc--------cc----------eeeCCceEEEeeeeeee Q lcl|NC_019522. 235 DITFEDDILLK-GAGVAGADRMAVYKKEIRIVKGHDVMPLRFL--------AP----------ATADNVNFKVPAILRTG 295 (311) Q Consensus 235 ~l~i~~~~~l~-~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~--------~p----------~~~~~~~~~~~~~~~~g 295 (311) -++...|-.. .+...++..+++ .+ ...+-+.+..++++- .. .+. + ...+.++.|+ T Consensus 224 -~~l~G~PV~~~~~~~~~~~~~~~-gd-~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~-~-~~~~r~~~r~- 297 (324) T protein:vir:10 224 -DTLDGLPVVNLKSSNLKRGELIT-GD-FDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQ-D-MVALRATMHV- 297 (324) T ss_pred -ccccceeEEeecCCCCCcceEEE-Ee-cccEEEEEecCcEEEEeecccccccccccccchhhhhc-C-cEEEEEEEEE- Confidence 1122222111 111122222222 11 111112122221110 00 111 1 2455667777 Q ss_pred eEEEECCeEEEEeecC Q lcl|NC_019522. 296 GTEWRIPKAGHYVDGV 311 (311) Q Consensus 296 Gv~i~~P~ai~~~dGI 311 (311) |..+.+|.|++.+.|. T Consensus 298 d~~v~~~~A~~~l~~a 313 (324) T protein:vir:10 298 ALHIADDKAFAKLVPA 313 (324) T ss_pred ccEEecccceEEEEec Confidence 4666689999999999 No 35 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=98.62 E-value=2.3e-08 Score=62.50 Aligned_cols=268 Identities=8% Similarity=-0.035 Sum_probs=153.0 Q ss_pred CCcccccccc----hhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCccccc Q lcl|NC_019522. 1 MAKSVFDVSP----VSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDV 76 (311) Q Consensus 1 ~~~~~~~~~~----~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~di 76 (311) ....+++++. +.+.+ +.. +.+...|++.....-..++++.+.. ....++.+.+.+..+.+.|++.+ ..+ T Consensus 19 ~~~~~~~a~~~~~~~~~~~-lip--~~~~~~ii~~~~~~s~l~~~~~~~~---~~~~~~~~p~~~~~~~a~~v~Eg-~~~ 91 (324) T protein:vir:99 19 VKPQVFNPDNVMMHEKKDG-TLL--NDFTTPILQEVMENSKIMRLGKYEP---MEGTEKKFTFWADKPGAYWVGEG-QKI 91 (324) T ss_pred hhhhhccccceeccCCCcc-eec--hhHHHHHHHHHHhhchhhhhcceee---ccCCceEEEEEecCcceeEeccC-ccc Confidence 3334444332 22222 232 3455667777766666666665443 22334667777777888998765 668 Q ss_pred ceeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc--eeeeecCCccee Q lcl|NC_019522. 77 PTVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG--EGLYTSPNVSVE 154 (311) Q Consensus 77 p~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g--~GllN~p~v~~~ 154 (311) |..+..++......+.++....++.+-++.+ ..++...-.....+++.+.+++.+++|+...+ .|+++....... T Consensus 92 ~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds---~~~l~~~i~~~l~~ai~~~~d~~~l~G~g~~~~~~~~~~~~~~~~~ 168 (324) T protein:vir:99 92 ETSKATWVNATMRAFKLGVILPVTKEFLNYT---YSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNK 168 (324) T ss_pred cccccceeEEEEeeEEEEEeehhhHHHHhcc---hHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCccCccccccccccce Confidence 9999999999999999999999996544433 35788888899999999999999999976643 466654332111 Q ss_pred eccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhCC Q lcl|NC_019522. 155 AATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNFP 234 (311) Q Consensus 155 ~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~~ 234 (311) ...+ + .-++||.+++..+.. .+ ..+..++++|+.+..|.+-. . ..+..++ .-.... T Consensus 169 ~~~~----------~----~~~~~i~~~~~~l~~--~~--~~~~~~v~n~~~~~~L~~l~-d---~~g~~~~--~~~~~~ 224 (324) T protein:vir:99 169 VIKG----------D----FTQDNIIDLEALLED--DE--LEANAFISKTQNRSLLRKIV-D---PETKERI--YDRNSD 224 (324) T ss_pred eccc----------c----CCHHHHHHHHHhhhh--cc--CCCCEEEEcHHHHHHHHHhh-c---CCCceee--cCCCCc Confidence 1110 1 115778888887743 22 24568999999999986532 1 1111111 111111 Q ss_pred c---eEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhh--------ccc----------eeeCCceEEEeeeee Q lcl|NC_019522. 235 D---ITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRF--------LAP----------ATADNVNFKVPAILR 293 (311) Q Consensus 235 ~---l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~--------~~p----------~~~~~~~~~~~~~~~ 293 (311) . +.++..+.. ..++..++ +.+ ...+-+.+..++++ ... .+. + ...+.++.| T Consensus 225 ~l~G~PVv~~~~~----~~~~~~~i-~gd-~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~-~-~~~~r~~~r 296 (324) T protein:vir:99 225 TLDGLPVVNLKSS----NLKRGELI-TGD-FDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQ-D-MVALRATMH 296 (324) T ss_pred cccceeEEeecCC----CCCcceEE-EEe-cccEEEEEecCcEEEEeecccccccccccccchhhhhc-C-cEEEEEEEE Confidence 0 112222211 12222222 211 11122222222211 100 111 1 255666778 Q ss_pred eeeEEEECCeEEEEeecC Q lcl|NC_019522. 294 TGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 294 ~gGv~i~~P~ai~~~dGI 311 (311) + |+.+.+|.|++.++|. T Consensus 297 ~-d~~v~~~~a~~~lt~a 313 (324) T protein:vir:99 297 V-ALHIADDKAFAKLVPA 313 (324) T ss_pred E-ccEEecccceEEEEec Confidence 7 5667789999999999 No 36 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=98.61 E-value=1.7e-08 Score=63.23 Aligned_cols=277 Identities=10% Similarity=-0.006 Sum_probs=154.1 Q ss_pred cccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccceeeee Q lcl|NC_019522. 3 KSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPTVDIA 82 (311) Q Consensus 3 ~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~v~~~ 82 (311) +++. +.++++.. +.+...|++...+.-..+++.++.. .+- ....+.+.+..+.+.|++.+ ..+|..+.. T Consensus 1 mat~-----~~gg~lvP--~~~~~~ii~~~~~~s~i~~~~~~i~-~~~--~~~~~p~~~~~~~a~wv~Eg-~~~~~~~~~ 69 (311) T protein:vir:81 1 MVAL-----ATGTFQLP--KHLVPGVWQKAQGQSVLARLSMAEP-QEF--GEQQYMTLTAPPRGEVVGEG-AQKSESTAT 69 (311) T ss_pred Ccee-----cCCceEcc--hhHHHHHHHHHHhcchhhhhcceee-cCC--CceEEEEEeCCceeEEeecC-cccccccce Confidence 2222 23345554 4466778888887777788776543 222 24567777778888998765 668888888 Q ss_pred ccceeEEEEEEEEEEEecHHHHHH-HHHhCCChHHHHHHHHHHHHHHhhhheeeeecccc---c-eeeeecCCcceeecc Q lcl|NC_019522. 83 MSQGFKDINTAALGYTYSIEEIGF-AMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGV---G-EGLYTSPNVSVEAAT 157 (311) Q Consensus 83 ~~~~~~~v~~~~~~~~~~~~El~~-a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~---g-~GllN~p~v~~~~~~ 157 (311) +++.....+.++.-..+|. |+.+ ......+|...-+...++++.+.+++.+++|+... + .|+++...-...... T Consensus 70 f~~v~l~~~kl~~~~~iS~-ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~~~~gi~~~~~~~~~~~~ 148 (311) T protein:vir:81 70 FAPVTAIPRKVQVTQRFSQ-EVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTTNIVE 148 (311) T ss_pred eeEEEEeeEEEEEeehhhH-HHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCcccccccccccccceeee Confidence 9999999999998888874 4443 33455678888899999999999999999997432 2 255544211111111 Q ss_pred CCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhCCceE Q lcl|NC_019522. 158 STFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNFPDIT 237 (311) Q Consensus 158 ~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~~~l~ 237 (311) ....+...+..+|.+++..+... . ..|+.++|.|+.+..|.+-. ..+ +.-++.-.......-+ T Consensus 149 ---------~~~~~~~~~~~~i~~~~~~~~~~-~---~~~~~~vmn~~~~~~l~~lk-d~~---G~~l~~~~~~~~~~~t 211 (311) T protein:vir:81 149 ---------LTTGTSATPDLAVEAAVGLVLGD-N---LSPDGVALDNTFSFMLATQR-DSQ---GRKLYPELGFGTDVAS 211 (311) T ss_pred ---------ecccccchHHHHHHHHHHHhhhc-C---CCceEEEEcHHHHHHHHhhh-ccC---CCeeecCccccCCCce Confidence 11122223456677777776432 2 24677999999999986522 111 1111110000000011 Q ss_pred -----EEEchhccc------------CCCCcccEEEEEEcCc------ceeEEeecchhh---hccceeeCCceEEEeee Q lcl|NC_019522. 238 -----FEDDILLKG------------AGVAGADRMAVYKKEI------RIVKGHDVMPLR---FLAPATADNVNFKVPAI 291 (311) Q Consensus 238 -----i~~~~~l~~------------ag~~g~~~~v~y~~~~------~~~~~~~~~~~~---~~~p~~~~~~~~~~~~~ 291 (311) ++....+.+ ....+..++++-+.+. +.+.+.+-.... ...-.+. + ...+.+. T Consensus 212 l~G~Pv~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~gDfs~~~i~~~~~~~~~~~~~~~~~~~~~~~~~-~-~v~~r~~ 289 (311) T protein:vir:81 212 FAGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQ-N-QIAIRAE 289 (311) T ss_pred ecceeEEecccccccccccccccchhcccCCccEEEEEecccEEEEEeccceEEEeccCCCCcchhhhhc-C-cEEEEEE Confidence 111111111 0112223333322221 122222211100 0000111 1 2556677 Q ss_pred eeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 292 LRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 292 ~~~gGv~i~~P~ai~~~dGI 311 (311) .|+ |..+.+|.||+++.|. T Consensus 290 ~r~-d~~v~~~~a~~~l~~a 308 (311) T protein:vir:81 290 VVY-GIGIMSTDAFAVVRDA 308 (311) T ss_pred EEe-ccEeecccceEEEEee Confidence 887 5788899999999999 No 37 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=98.58 E-value=3.2e-08 Score=61.64 Aligned_cols=273 Identities=8% Similarity=-0.031 Sum_probs=152.2 Q ss_pred CCccccccc---chhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccc Q lcl|NC_019522. 1 MAKSVFDVS---PVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVP 77 (311) Q Consensus 1 ~~~~~~~~~---~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip 77 (311) .....+++. ..+..+.+.. +.+...|++........++++.+.. ....++.+.+.+..+.+.|++.+ ..+| T Consensus 19 ~~~~~~~a~~~~~~~~~~~~iP--~~~~~~ii~~~~~~s~l~~~~~~~~---~~~~~~~ip~~~~~~~a~~v~Eg-~~~~ 92 (324) T protein:vir:97 19 VKPQVFNPDNVMMHEKKDGTLM--NEFTTPILQEVMENSKIMQLGKYEP---MEGTEKKFTFWADKPGAYWVGEG-QKIE 92 (324) T ss_pred hhhhhhccccccccCCCcceec--hhHHHHHHHHHHhhcchhhhcceee---ccCCceEEEEEecCcceeEeccC-cccc Confidence 111111111 1112222332 3455667777776666666665433 33345677777788889999876 5689 Q ss_pred eeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc--eeeeecCCcceee Q lcl|NC_019522. 78 TVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG--EGLYTSPNVSVEA 155 (311) Q Consensus 78 ~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g--~GllN~p~v~~~~ 155 (311) ..+..++......+.++....++.+=++. ...++...-.....+++.+.+++.+++|+...+ .|+++........ T Consensus 93 ~~~~~f~~v~~~~~k~~~~~~is~ell~d---s~~~l~~~i~~~l~~aia~~~d~a~l~G~g~~~~~~gi~~~~~~~~~~ 169 (324) T protein:vir:97 93 TSKATWVNATMRAFKLGVILPVTKEFLNY---TYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNKV 169 (324) T ss_pred ccccceeEEEEeeEEEEEeehhhHHHHhc---chHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccCcccccccccccee Confidence 99999999999999999999999643333 346788999999999999999999999987644 4777654432211 Q ss_pred ccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhCCc Q lcl|NC_019522. 156 ATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNFPD 235 (311) Q Consensus 156 ~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~~~ 235 (311) ..+ +.+ ++||.+++.++... + ..+.+++|+|+.+..|.+.. +..+..++. ..+. T Consensus 170 ~~~----------~~~----~~~i~~~~~~l~~~--~--~~~~~~v~n~~~~~~L~~lk----d~~g~~~~~----~~~~ 223 (324) T protein:vir:97 170 IKG----------DFT----QDNIIDLEALLEDD--E--LEANAFISKTQNRSLLRKIV----DPETKERIY----DRNS 223 (324) T ss_pred ccc----------cCC----HHHHHHHHHhhhhc--c--CCCCEEEEcHHHHHHHHHhh----cCCCceeec----CCCC Confidence 111 112 56778888887432 2 24568999999999987532 111222111 0111 Q ss_pred eEEEEchhc-ccCCCCcccEEEEEEc------CcceeEEeecchhhhccc----------eeeCCceEEEeeeeeeeeEE Q lcl|NC_019522. 236 ITFEDDILL-KGAGVAGADRMAVYKK------EIRIVKGHDVMPLRFLAP----------ATADNVNFKVPAILRTGGTE 298 (311) Q Consensus 236 l~i~~~~~l-~~ag~~g~~~~v~y~~------~~~~~~~~~~~~~~~~~p----------~~~~~~~~~~~~~~~~gGv~ 298 (311) -++...|-. ..+...+...+++-+. ..+.+.+.+-........ .+. + ...+.+..|+ ++. T Consensus 224 ~tl~G~PV~~~~~~~~~~~~~~~gd~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~-d-~~~~r~~~r~-d~~ 300 (324) T protein:vir:97 224 DTLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQ-D-MVALRATMHV-ALH 300 (324) T ss_pred ccccceeeEeecCCCCCcceEEEEecccEEEEEecCcEEEEeecccccccccccccchhhhhc-C-cEEEEEEEEe-ccE Confidence 111111111 0111111112221111 111222222111100000 111 1 2445666777 566 Q ss_pred EECCeEEEEeecC Q lcl|NC_019522. 299 WRIPKAGHYVDGV 311 (311) Q Consensus 299 i~~P~ai~~~dGI 311 (311) +.+|.|++.+.+. T Consensus 301 v~~~~a~~~l~~~ 313 (324) T protein:vir:97 301 IADDKAFAKLVPA 313 (324) T ss_pred EecccceEEEEec Confidence 6789999999999 No 38 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=98.55 E-value=3.3e-08 Score=61.57 Aligned_cols=283 Identities=13% Similarity=0.052 Sum_probs=160.5 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEee-cccceEEecCccccccee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSID-ARGELQLFGPNSTDVPTV 79 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~-~~G~a~~~~~~a~dip~v 79 (311) .....+...+.+.+++++. +.+.+.|++..++....+.++++..-.+ .++.|.... ..+.+.|++.+ ..+|.. T Consensus 146 ~~~~~~~~~~~~~gg~~vp--~~~~~~ii~~~~~~~~i~~l~~~~~~~~---~~~~~~~~~~~~~~a~wv~E~-~~~~~s 219 (497) T protein:vir:78 146 AAIGQNPFGSTGTFAPGIL--PTFLPGIVEQLFYELSLADLISSRPVTS---PNLSYLTESAAHNNAAAVAEA-GTYPFS 219 (497) T ss_pred HHHHhhhcccCcccccccc--hhhhHHHHHHHHhhhhHHhhccccccCC---CceEEEEEcCCCCcceeeccC-cccccc Confidence 0111111222334445554 4566789998888888888887644332 245555433 34578888765 568999 Q ss_pred eeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc-eeeeecCCcceeeccC Q lcl|NC_019522. 80 DIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG-EGLYTSPNVSVEAATS 158 (311) Q Consensus 80 ~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g-~GllN~p~v~~~~~~~ 158 (311) +..++......+.++.-..+|. ||..-. . .|.+--.....+++.+.+|+-+++|+...+ .|++|.+.+....... T Consensus 220 ~~~f~~i~~~~~k~a~~~~iS~-ell~d~--~-~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~ 295 (497) T protein:vir:78 220 SEEFARVYEQVGKVANALTITD-EGLRDA--P-ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSAS 295 (497) T ss_pred cccceeeEeeeeeeEeecHhHH-HHHHhH--H-HHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccc Confidence 9999999999999999888885 555422 2 488888899999999999999999987655 5999998765443322 Q ss_pred CccccC------------------------------------------cccccCCHHHHHHHHHHHHHHHHhccCCceec Q lcl|NC_019522. 159 TFVALV------------------------------------------AAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHR 196 (311) Q Consensus 159 ~~~~~~------------------------------------------t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~ 196 (311) ...... ..-...+....+.++..++..+... .... T Consensus 296 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~ 372 (497) T protein:vir:78 296 SLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLT---LFQT 372 (497) T ss_pred cchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhh---cccC Confidence 111100 0001123445566666666666442 2345 Q ss_pred ceEEEeCHHHHHHHhcccccCCCCCcchHHH---------HHHHh--CCceEEEEchhcccCCCCcccEEEEEEcCc--- Q lcl|NC_019522. 197 PNTFVLPPAQFQLLARTLLSTQNASNVTLLQ---------FLRTN--FPDITFEDDILLKGAGVAGADRMAVYKKEI--- 262 (311) Q Consensus 197 p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~---------~l~~n--~~~l~i~~~~~l~~ag~~g~~~~v~y~~~~--- 262 (311) |+.++|.|..+..|.+-. +..|.-++. ..... .....++..+.+. +| + .++-+-+. T Consensus 373 ~~~~vmn~~~~~~l~~lk----d~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~-~~---~--~~~Gd~~~~~~ 442 (497) T protein:vir:78 373 PNAVVMNPRDWELLRLTK----DANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIP-LG---T--ILVGHFAPSVI 442 (497) T ss_pred CCeEEEchHHHHHHHHhh----cCCCceeccCcccccccccccCCceeeceeeEecCCCC-CC---c--eEEeecccceE Confidence 778999999998875422 111221110 00000 0011222222221 11 1 11111111 Q ss_pred -----ceeEEeecchhhhccceeeCCceEEEeeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 263 -----RIVKGHDVMPLRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 263 -----~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dGI 311 (311) ..+++.... .-....+. + ...+.++.|++ +.+++|.||++++-. T Consensus 443 ~i~~r~~~~v~~~~--~~~~~f~~-n-~v~~r~~~r~~-~~v~~p~A~~~l~~~ 491 (497) T protein:vir:78 443 QTARREGVTMQMTN--SNGTDFVD-G-KVTVRAEERLG-LLVYRPSAFQLIQLK 491 (497) T ss_pred EEEEecccEEEeec--ccchhhhc-C-cEEEEEEEeec-ceeeccccEEEEEec Confidence 112221110 00011222 2 35677788885 588999999999988 No 39 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=98.55 E-value=3.3e-08 Score=61.57 Aligned_cols=283 Identities=13% Similarity=0.052 Sum_probs=160.5 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEee-cccceEEecCccccccee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSID-ARGELQLFGPNSTDVPTV 79 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~-~~G~a~~~~~~a~dip~v 79 (311) .....+...+.+.+++++. +.+.+.|++..++....+.++++..-.+ .++.|.... ..+.+.|++.+ ..+|.. T Consensus 146 ~~~~~~~~~~~~~gg~~vp--~~~~~~ii~~~~~~~~i~~l~~~~~~~~---~~~~~~~~~~~~~~a~wv~E~-~~~~~s 219 (497) T protein:vir:10 146 AAIGQNPFGSTGTFAPGIL--PTFLPGIVEQLFYELSLADLISSRPVTS---PNLSYLTESAAHNNAAAVAEA-GTYPFS 219 (497) T ss_pred HHHHhhhcccCcccccccc--hhhhHHHHHHHHhhhhHHhhccccccCC---CceEEEEEcCCCCcceeeccC-cccccc Confidence 0111111222334445554 4566789998888888888887644332 245555433 34578888765 568999 Q ss_pred eeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc-eeeeecCCcceeeccC Q lcl|NC_019522. 80 DIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG-EGLYTSPNVSVEAATS 158 (311) Q Consensus 80 ~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g-~GllN~p~v~~~~~~~ 158 (311) +..++......+.++.-..+|. ||..-. . .|.+--.....+++.+.+|+-+++|+...+ .|++|.+.+....... T Consensus 220 ~~~f~~i~~~~~k~a~~~~iS~-ell~d~--~-~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~ 295 (497) T protein:vir:10 220 SEEFARVYEQVGKVANALTITD-EGLRDA--P-ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSAS 295 (497) T ss_pred cccceeeEeeeeeeEeecHhHH-HHHHhH--H-HHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccc Confidence 9999999999999999888885 555422 2 488888899999999999999999987655 5999998765443322 Q ss_pred CccccC------------------------------------------cccccCCHHHHHHHHHHHHHHHHhccCCceec Q lcl|NC_019522. 159 TFVALV------------------------------------------AAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHR 196 (311) Q Consensus 159 ~~~~~~------------------------------------------t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~ 196 (311) ...... ..-...+....+.++..++..+... .... T Consensus 296 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~ 372 (497) T protein:vir:10 296 SLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLT---LFQT 372 (497) T ss_pred cchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhh---cccC Confidence 111100 0001123445566666666666442 2345 Q ss_pred ceEEEeCHHHHHHHhcccccCCCCCcchHHH---------HHHHh--CCceEEEEchhcccCCCCcccEEEEEEcCc--- Q lcl|NC_019522. 197 PNTFVLPPAQFQLLARTLLSTQNASNVTLLQ---------FLRTN--FPDITFEDDILLKGAGVAGADRMAVYKKEI--- 262 (311) Q Consensus 197 p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~---------~l~~n--~~~l~i~~~~~l~~ag~~g~~~~v~y~~~~--- 262 (311) |+.++|.|..+..|.+-. +..|.-++. ..... .....++..+.+. +| + .++-+-+. T Consensus 373 ~~~~vmn~~~~~~l~~lk----d~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~-~~---~--~~~Gd~~~~~~ 442 (497) T protein:vir:10 373 PNAVVMNPRDWELLRLTK----DANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIP-LG---T--ILVGHFAPSVI 442 (497) T ss_pred CCeEEEchHHHHHHHHhh----cCCCceeccCcccccccccccCCceeeceeeEecCCCC-CC---c--eEEeecccceE Confidence 778999999998875422 111221110 00000 0011222222221 11 1 11111111 Q ss_pred -----ceeEEeecchhhhccceeeCCceEEEeeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 263 -----RIVKGHDVMPLRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 263 -----~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dGI 311 (311) ..+++.... .-....+. + ...+.++.|++ +.+++|.||++++-. T Consensus 443 ~i~~r~~~~v~~~~--~~~~~f~~-n-~v~~r~~~r~~-~~v~~p~A~~~l~~~ 491 (497) T protein:vir:10 443 QTARREGVTMQMTN--SNGTDFVD-G-KVTVRAEERLG-LLVYRPSAFQLIQLK 491 (497) T ss_pred EEEEecccEEEeec--ccchhhhc-C-cEEEEEEEeec-ceeeccccEEEEEec Confidence 112221110 00011222 2 35677788885 588999999999988 No 40 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=98.54 E-value=4.4e-08 Score=60.89 Aligned_cols=270 Identities=8% Similarity=-0.020 Sum_probs=151.7 Q ss_pred CCccccccc----chhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCccccc Q lcl|NC_019522. 1 MAKSVFDVS----PVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDV 76 (311) Q Consensus 1 ~~~~~~~~~----~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~di 76 (311) .....++++ +..+.+.+- +.+...|++.....-..++++.+.. .....+.|.+.+..+.+.+++.+ ..+ T Consensus 19 ~~~~~~~a~~~~~~~~~~~liP---~~~~~~ii~~~~~~s~l~~l~~~~~---~~~~~~~ip~~~~~~~a~~v~Eg-~~~ 91 (324) T protein:vir:93 19 VKPQVFNPDNVMMHEKKDGTLL---NDFTTPILQEVMENSKIMQLGKYEP---MEGTEKKFTFWADKPGAYWVGEG-QKI 91 (324) T ss_pred hhhhhcccccccccCCCcceec---hhHHHHHHHHHHhhchhhhhcceee---ccCCceEEEEEecCcceeeecCC-ccc Confidence 333333322 122223333 3345667776666666666655432 22334567777777888898765 668 Q ss_pred ceeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc--eeeeecCCccee Q lcl|NC_019522. 77 PTVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG--EGLYTSPNVSVE 154 (311) Q Consensus 77 p~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g--~GllN~p~v~~~ 154 (311) |..+..++......+.++..+.++.+=++. +..++...-.....+++++.+++.+++|+...+ .|+++....... T Consensus 92 ~~~~~~f~~i~~~~~k~~~~~~iS~ell~d---s~~~l~~~i~~~l~~aia~~~d~a~l~G~g~~~~~~~~~~~~~~~~~ 168 (324) T protein:vir:93 92 ETSKATWVNATMRAFKLGVILPVTKEFLNY---TYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIEKTNK 168 (324) T ss_pred cccccceeEEEEEeEEEEEeehhhHHHHhc---chHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcCccccccccccce Confidence 988899999999999999999998644433 235788888899999999999999999976543 466654432211 Q ss_pred eccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhCC Q lcl|NC_019522. 155 AATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNFP 234 (311) Q Consensus 155 ~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~~ 234 (311) ...+ + .-++||.+++..+... + ..+.+++++|+.+..|.+.. . ..+.-++ .... T Consensus 169 ~~~~----------~----~~~~~i~~~~~~l~~~--~--~~~~~~v~n~~~~~~L~~l~-d---~~G~~~~----~~~~ 222 (324) T protein:vir:93 169 VIKG----------D----FTQDNIIDLEALLEDD--E--LEANAFISKTQNRSLLRKIV-D---PETKERI----YDRN 222 (324) T ss_pred eccc----------c----ccHHHHHHHHHhhhhc--c--CCCCEEEEcHHHHHHHHHhh-C---CCCCeee----cCCC Confidence 1110 1 1157788888887432 2 24568999999999996532 1 1122111 1111 Q ss_pred ceEEEEchhc-ccCCCCcccEEEEEEcCcceeEEeecchhhhcc-----------c-------eeeCCceEEEeeeeeee Q lcl|NC_019522. 235 DITFEDDILL-KGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLA-----------P-------ATADNVNFKVPAILRTG 295 (311) Q Consensus 235 ~l~i~~~~~l-~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~-----------p-------~~~~~~~~~~~~~~~~g 295 (311) .-++...|-. ..+...++..+++-+ ...+-+....++++.. + .+. + ...+.+..|+ T Consensus 223 ~~~l~G~PVv~~~~~~~~~~~i~~gd--fs~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~-n-~~~~r~~~r~- 297 (324) T protein:vir:93 223 SDSLDGLPVVNLKSSNLKRGELITGD--FDKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQ-D-MVALRATMHV- 297 (324) T ss_pred CCcccceeeEeecCCCCCcceEEEEe--cceEEEEEecCcEEEEeecccccccccccccchhhhhc-C-cEEEEEEEEe- Confidence 1111111111 011112222222211 1112122222221110 0 111 1 2456667777 Q ss_pred eEEEECCeEEEEeecC Q lcl|NC_019522. 296 GTEWRIPKAGHYVDGV 311 (311) Q Consensus 296 Gv~i~~P~ai~~~dGI 311 (311) |+.+.+|.|++++.+. T Consensus 298 d~~v~~~~a~~~l~~a 313 (324) T protein:vir:93 298 ALHIADDKAFAKLVPA 313 (324) T ss_pred ccEEecccceEEEecc Confidence 5778899999999999 No 41 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=98.54 E-value=3.9e-08 Score=61.23 Aligned_cols=280 Identities=12% Similarity=-0.029 Sum_probs=151.1 Q ss_pred ccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccceeeeeccc Q lcl|NC_019522. 6 FDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPTVDIAMSQ 85 (311) Q Consensus 6 ~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~v~~~~~~ 85 (311) |-..+.+.++++.. +.+..+|++.....-..+++..+.. .....+.+.+....+.|.|++.+ ..+|..+..+++ T Consensus 1 Ma~~~~~~gg~~vP--~~~~~~ii~~l~~~s~i~~l~~~i~---~~~~~~~ip~~~~~~~a~wv~Eg-~~~~~s~~~f~~ 74 (315) T protein:vir:80 1 MADDFLSAGKLELP--GSMIGAVRDRAIDSGVLAKLSPEQP---TIFGPVKGAVFSGVPRAKIVGEG-EVKPSASVDVSA 74 (315) T ss_pred CCCCcCCcCceEcc--hHHHHHHHHHHHhhchhhhhcceee---cCCCceEEEEEeCCcceEEeeCC-ccccccccceee Confidence 33333334445554 4456778887777766777655432 23345677777888889999875 568998999999 Q ss_pred eeEEEEEEEEEEEecHHHHHHHHHhC-C-ChHHHHHHHHHHHHHHhhhheeeeecccc-ce---eeeecCCcceeeccCC Q lcl|NC_019522. 86 GFKDINTAALGYTYSIEEIGFAMLNN-V-NLDAERGQAVRDVVEQGLNKIYLLGDKGV-GE---GLYTSPNVSVEAATST 159 (311) Q Consensus 86 ~~~~v~~~~~~~~~~~~El~~a~~~g-~-~l~~~k~~aa~~~~~~~~n~~~~~G~~~~-g~---GllN~p~v~~~~~~~~ 159 (311) .....+.++.-..+|.+ +.+..... . .|.+.-....++++++.+++-+|+|+... +. |+.+.-+.... T Consensus 75 v~l~~~kl~~~~~iS~e-ll~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~~~~~~~~~~~~~~~~~----- 148 (315) T protein:vir:80 75 FTAQPIKVVTQQRVSDE-FMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKAASAVHTSLNKTKN----- 148 (315) T ss_pred eEeeeeeEEeeehhhHH-HhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCCCccccccccccccccc----- Confidence 99999999988888844 44322111 1 26677788899999999999999997532 11 33222111000 Q ss_pred ccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCC-CCCcchHHHHHHHhCC---- Q lcl|NC_019522. 160 FVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQ-NASNVTLLQFLRTNFP---- 234 (311) Q Consensus 160 ~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~-~~~~~Tvl~~l~~n~~---- 234 (311) ........++||.+++.++... ....++..+|.|+.+..|.+-..... +..+..+..=+....+ T Consensus 149 --------~~~~~~~~~~d~~~~~~~~~~~---~~~~~~~~imn~~~~~~L~~l~~~~g~~~~g~~~~~~~~~g~~~tl~ 217 (315) T protein:vir:80 149 --------IVDATDSATADLVKAVGLIAGA---GLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGLDNWR 217 (315) T ss_pred --------eeeccccchHHHHHHHHHHhhc---cCccceEEEEcHHHHHHHHHHhhccCCcccccccccccccCCCceec Confidence 1111223467888888777432 22345679999999988865432110 1111111100001111 Q ss_pred ceEEEEchhcc---cCCCCcccEEEEEEcC------cceeEEeecchhhhc-cc---eeeCCceEEEeeeeeeeeEEEEC Q lcl|NC_019522. 235 DITFEDDILLK---GAGVAGADRMAVYKKE------IRIVKGHDVMPLRFL-AP---ATADNVNFKVPAILRTGGTEWRI 301 (311) Q Consensus 235 ~l~i~~~~~l~---~ag~~g~~~~v~y~~~------~~~~~~~~~~~~~~~-~p---~~~~~~~~~~~~~~~~gGv~i~~ 301 (311) .+-+.....+. ..+...+..++.-+-+ .+.+++.+...-.-. .+ .|. + ...+.+..|+ |..|++ T Consensus 218 G~PV~~~~~~~~~~~~~~~~~~~~~~GDfs~~~~g~~~~~~i~i~~~~~~~~~~~~~~~~-~-~v~~r~~~r~-~~~v~~ 294 (315) T protein:vir:80 218 GLNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGH-N-EVMVRAEAVL-YVAIES 294 (315) T ss_pred ceeeEecCcCCcccccccccccEEEEeecccEEEEEecCeeEEEeccccccCcccchhhc-C-cEEEEEEEEe-cceeec Confidence 11222222221 1111112222221111 122222221100000 00 111 1 2556677887 688999 Q ss_pred CeEEEEeecC Q lcl|NC_019522. 302 PKAGHYVDGV 311 (311) Q Consensus 302 P~ai~~~dGI 311 (311) |.|++++.+. T Consensus 295 ~~a~~~l~~~ 304 (315) T protein:vir:80 295 LDSFAVVKEK 304 (315) T ss_pred ccceEEEeec Confidence 9999999999 No 42 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=98.54 E-value=3.5e-08 Score=61.47 Aligned_cols=270 Identities=10% Similarity=-0.023 Sum_probs=157.0 Q ss_pred ccccccc---hhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccceee Q lcl|NC_019522. 4 SVFDVSP---VSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPTVD 80 (311) Q Consensus 4 ~~~~~~~---~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~v~ 80 (311) +.++++. .+.++.+.. +.+..+|++.....-..+++..+.. .+ .....+.+.+. ..+.+++. +..+|..+ T Consensus 1 ~g~~a~~~~~~~~~~~~iP--~~~~~~ii~~~~~~s~l~~~~~~~~-~~--~~~~~~~~~~~-~~a~~v~E-~~~~~~~~ 73 (299) T protein:vir:41 1 MGFNPDTTTMQSAKTGSIP--INISEQIITGVKNGSAAMKLAKAVP-MT--KPEEEFTFMSG-VGAFWVDE-AERIQTSK 73 (299) T ss_pred CCcCCCcccccCCCceecc--hhHHHHHHHHHHhcchhhhhceeee-cC--CCcEEEEEEcC-Cceeeeec-Cccccccc Confidence 5555553 222233443 4566778887777767777665433 22 23334444443 45778865 46689889 Q ss_pred eeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc-eeeeecCCcceeeccCC Q lcl|NC_019522. 81 IAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG-EGLYTSPNVSVEAATST 159 (311) Q Consensus 81 ~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g-~GllN~p~v~~~~~~~~ 159 (311) ..++........++..+.++.+=++ ....++...-.....+++.+.+|+-+++|+.... .|+++........+.. T Consensus 74 ~~f~~v~l~~~k~~~~~~is~ell~---ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~g~~~~~gil~~~~~~~~~~~~- 149 (299) T protein:vir:41 74 PTFTKAKMRSKKMGVIIPTTKENLN---YSVTNFFSLMQAEIVEAFYKKFDQAVFTGVESPYNWNILKSATDASNLVEE- 149 (299) T ss_pred cceeEEEEeeEEEEEeehhhHHHHh---cCHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccccccccccceeecc- Confidence 9999999999999999999964333 3346788999999999999999999999987644 6888765432222111 Q ss_pred ccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhCC----c Q lcl|NC_019522. 160 FVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNFP----D 235 (311) Q Consensus 160 ~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~~----~ 235 (311) ... -++||.+++.++... + ..+..+++.|+.+..|.+-. +..|.-++.=-..+.. . T Consensus 150 --------~~~----~~~~l~~~~~~l~~~--~--~~~~~~v~n~~~~~~L~~lk----d~~G~~l~~~~~~~~~~~l~G 209 (299) T protein:vir:41 150 --------TAN----KYDDLNEAIGLIEAE--D--LEPNGIATIRKQRVKYRSTK----DGNGMPIFNTATSNGVDDVLG 209 (299) T ss_pred --------ccc----cHHHHHHHHHhhhcc--c--CCcCEEEEcHHHHHHHHHhh----ccCCceeecCCcCCCCceecc Confidence 111 267888888887532 2 24568999999999997632 1112211110000110 1 Q ss_pred eEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhccc------------------eeeCCceEEEeeeeeeeeE Q lcl|NC_019522. 236 ITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAP------------------ATADNVNFKVPAILRTGGT 297 (311) Q Consensus 236 l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p------------------~~~~~~~~~~~~~~~~gGv 297 (311) ..++..+.+. ++ ..+..+++-+-.+ +.+.+..++++..- .+ ++ ...+.+..|+ |. T Consensus 210 ~PV~~~~~~~-~~--~~~~~~~~gdfs~-~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~-~~-~~~~r~~~~~-d~ 282 (299) T protein:vir:41 210 LPIAYTPKYT-FG--DKDISELVGDWNQ-AYYGILRGVEYEILTEATLTTVADETGKPLNLAE-RD-MAAIKATFEV-GF 282 (299) T ss_pred eeeEEecccC-CC--CCceEEEEEeccc-EEEEEecCcEEEEeecccccccccccccchhhhh-cC-cEEEEEEEEe-cc Confidence 2233333332 11 1222232222222 22222222211100 11 11 2456777887 67 Q ss_pred EEECCeEEEEeecC Q lcl|NC_019522. 298 EWRIPKAGHYVDGV 311 (311) Q Consensus 298 ~i~~P~ai~~~dGI 311 (311) .+++|.||+.+.+- T Consensus 283 ~v~~~~A~~~l~~~ 296 (299) T protein:vir:41 283 MVVKDEAFSAVQPK 296 (299) T ss_pred EEecccceEEEEec Confidence 78899999999999 No 43 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=98.53 E-value=3.2e-08 Score=61.69 Aligned_cols=270 Identities=13% Similarity=0.001 Sum_probs=159.4 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeec-ccceEEecCccccccee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDA-RGELQLFGPNSTDVPTV 79 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~-~G~a~~~~~~a~dip~v 79 (311) --..++.+.+.+++.++. +.+.+.|++........+.++++..-. ...+.|.+.+. .+.+.|++.+ ..+|.. T Consensus 100 ~~~~~~~~~~~~~g~~i~---~~~~~~ii~~~~~~~~l~~~~~~~~~~---~~~~~~~~~~~~~~~a~~v~E~-~~~~~~ 172 (385) T protein:vir:19 100 TFNKSLGSDADSAGSLIQ---PMQIPGIIMPGLRRLTIRDLLAQGRTS---SNALEYVREEVFTNNADVVAEK-ALKPES 172 (385) T ss_pred HHHhhhccccccCCceec---chhhhHHHHHhhhccchhhhcceeccc---CcceEEEEEecCCcceeeeccC-cccccc Confidence 111233444444444554 335567888888888888888775422 23455666654 4567777765 568999 Q ss_pred eeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc--eeeeecCCcceeecc Q lcl|NC_019522. 80 DIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG--EGLYTSPNVSVEAAT 157 (311) Q Consensus 80 ~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g--~GllN~p~v~~~~~~ 157 (311) +..++......+.++..+.++. |+.... ..+...-....++++...+|+-+++|+...+ .|+++.+++...... T Consensus 173 ~~~~~~~~~~~~k~~~~~~is~-ell~d~---~~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~ 248 (385) T protein:vir:19 173 DITFSKQTANVKTIAHWVQASR-QVMDDA---PMLQSYINNRLMYGLALKEEGQLLNGDGTGDNLEGLNKVATAYDTSLN 248 (385) T ss_pred ccceeEEEEeeeeEEEeehhhH-HHHhhH---HHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccc Confidence 9999999999999999999995 554422 2577888888899999999999999986654 499988775443322 Q ss_pred CCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhC---- Q lcl|NC_019522. 158 STFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNF---- 233 (311) Q Consensus 158 ~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~---- 233 (311) . +.+..+++|.+++.++... + ..+..++|+|+.+..|.+-. +..|.-++.-..... T Consensus 249 ~------------~~~~~~d~i~~~~~~l~~~--~--~~~~~~~~~~~~~~~l~~lk----d~~G~~l~~~~~~~~~~~l 308 (385) T protein:vir:19 249 A------------TGDTRADIIAHAIYQVTES--E--FSASGIVLNPRDWHNIALLK----DNEGRYIFGGPQAFTSNIM 308 (385) T ss_pred c------------cccchHHHHHHHHHhhccc--c--CCCCEEEEcHHHHHHHHHhh----cCCCceeccCcccCCCcee Confidence 1 1222467788888877432 2 23568999999999886532 111222211000000 Q ss_pred CceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhc------cceeeCCceEEEeeeeeeeeEEEECCeEEEE Q lcl|NC_019522. 234 PDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFL------APATADNVNFKVPAILRTGGTEWRIPKAGHY 307 (311) Q Consensus 234 ~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~------~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~ 307 (311) ..+.++..+.+.. +.++ +-+-.+.+.+.....+++. ...+ ++ .+.+.++.|++ +.+++|.+|+. T Consensus 309 ~G~pV~~~~~~p~------~~~~-~gd~~~~~~~~~~~~~~v~~~~~~~~~~~-~~-~~~~~~~~r~~-~~v~~~~a~~~ 378 (385) T protein:vir:19 309 WGLPVVPTKAQAA------GTFT-VGGFDMASQVWDRMDATVEVSREDRDNFV-KN-MLTILCEERLA-LAHYRPTAIIK 378 (385) T ss_pred cceeeEEcCcCCC------CcEE-EeecccEEEEEEecceEEEEeccccchhh-cC-cEEEEEEEeec-cEEecccceEE Confidence 1123333343321 1112 2111121222111222111 1112 12 24566678875 66799999999 Q ss_pred eecC Q lcl|NC_019522. 308 VDGV 311 (311) Q Consensus 308 ~dGI 311 (311) ++.- T Consensus 379 ~~~~ 382 (385) T protein:vir:19 379 GTFS 382 (385) T ss_pred EEec Confidence 9998 No 44 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=98.53 E-value=3.2e-08 Score=61.69 Aligned_cols=270 Identities=13% Similarity=0.001 Sum_probs=159.4 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeec-ccceEEecCccccccee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDA-RGELQLFGPNSTDVPTV 79 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~-~G~a~~~~~~a~dip~v 79 (311) --..++.+.+.+++.++. +.+.+.|++........+.++++..-. ...+.|.+.+. .+.+.|++.+ ..+|.. T Consensus 100 ~~~~~~~~~~~~~g~~i~---~~~~~~ii~~~~~~~~l~~~~~~~~~~---~~~~~~~~~~~~~~~a~~v~E~-~~~~~~ 172 (385) T protein:vir:18 100 TFNKSLGSDADSAGSLIQ---PMQIPGIIMPGLRRLTIRDLLAQGRTS---SNALEYVREEVFTNNADVVAEK-ALKPES 172 (385) T ss_pred HHHhhhccccccCCceec---chhhhHHHHHhhhccchhhhcceeccc---CcceEEEEEecCCcceeeeccC-cccccc Confidence 111233444444444554 335567888888888888888775422 23455666654 4567777765 568999 Q ss_pred eeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc--eeeeecCCcceeecc Q lcl|NC_019522. 80 DIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG--EGLYTSPNVSVEAAT 157 (311) Q Consensus 80 ~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g--~GllN~p~v~~~~~~ 157 (311) +..++......+.++..+.++. |+.... ..+...-....++++...+|+-+++|+...+ .|+++.+++...... T Consensus 173 ~~~~~~~~~~~~k~~~~~~is~-ell~d~---~~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~ 248 (385) T protein:vir:18 173 DITFSKQTANVKTIAHWVQASR-QVMDDA---PMLQSYINNRLMYGLALKEEGQLLNGDGTGDNLEGLNKVATAYDTSLN 248 (385) T ss_pred ccceeEEEEeeeeEEEeehhhH-HHHhhH---HHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccc Confidence 9999999999999999999995 554422 2577888888899999999999999986654 499988775443322 Q ss_pred CCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhC---- Q lcl|NC_019522. 158 STFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNF---- 233 (311) Q Consensus 158 ~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~---- 233 (311) . +.+..+++|.+++.++... + ..+..++|+|+.+..|.+-. +..|.-++.-..... T Consensus 249 ~------------~~~~~~d~i~~~~~~l~~~--~--~~~~~~~~~~~~~~~l~~lk----d~~G~~l~~~~~~~~~~~l 308 (385) T protein:vir:18 249 A------------TGDTRADIIAHAIYQVTES--E--FSASGIVLNPRDWHNIALLK----DNEGRYIFGGPQAFTSNIM 308 (385) T ss_pred c------------cccchHHHHHHHHHhhccc--c--CCCCEEEEcHHHHHHHHHhh----cCCCceeccCcccCCCcee Confidence 1 1222467788888877432 2 23568999999999886532 111222211000000 Q ss_pred CceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhc------cceeeCCceEEEeeeeeeeeEEEECCeEEEE Q lcl|NC_019522. 234 PDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFL------APATADNVNFKVPAILRTGGTEWRIPKAGHY 307 (311) Q Consensus 234 ~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~------~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~ 307 (311) ..+.++..+.+.. +.++ +-+-.+.+.+.....+++. ...+ ++ .+.+.++.|++ +.+++|.+|+. T Consensus 309 ~G~pV~~~~~~p~------~~~~-~gd~~~~~~~~~~~~~~v~~~~~~~~~~~-~~-~~~~~~~~r~~-~~v~~~~a~~~ 378 (385) T protein:vir:18 309 WGLPVVPTKAQAA------GTFT-VGGFDMASQVWDRMDATVEVSREDRDNFV-KN-MLTILCEERLA-LAHYRPTAIIK 378 (385) T ss_pred cceeeEEcCcCCC------CcEE-EeecccEEEEEEecceEEEEeccccchhh-cC-cEEEEEEEeec-cEEecccceEE Confidence 1123333343321 1112 2111121222111222111 1112 12 24566678875 66799999999 Q ss_pred eecC Q lcl|NC_019522. 308 VDGV 311 (311) Q Consensus 308 ~dGI 311 (311) ++.- T Consensus 379 ~~~~ 382 (385) T protein:vir:18 379 GTFS 382 (385) T ss_pred EEec Confidence 9998 No 45 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=98.52 E-value=6e-08 Score=60.17 Aligned_cols=270 Identities=8% Similarity=-0.029 Sum_probs=150.4 Q ss_pred CCcccccccc----hhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCccccc Q lcl|NC_019522. 1 MAKSVFDVSP----VSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDV 76 (311) Q Consensus 1 ~~~~~~~~~~----~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~di 76 (311) .....+++.. ..+.+.+- +.+-.+|++.....-..++++++.. .....+.|.+.+..+.+.|++.+ ..+ T Consensus 19 ~~~~~~~a~~~~~~~~~~~lip---~~~~~~ii~~~~~~s~l~~l~~~~~---~~~~~~~~p~~~~~~~a~~v~Eg-~~~ 91 (324) T protein:vir:96 19 VKPQVFNPDNVMMHEKKDGTLL---NDFTTPILQEVMENSKIMQLGKYEP---MEGTEKKFTFWADKPGAYWVGEG-QKI 91 (324) T ss_pred hhhhhcccccccccCCCcceec---hhHHHHHHHHHHhhchhhhhcceee---ccCCceEEEEEecCcceeeecCC-ccc Confidence 3333343332 22333333 3345667776666666666665543 22234677777777888998775 668 Q ss_pred ceeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc--eeeeecCCccee Q lcl|NC_019522. 77 PTVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG--EGLYTSPNVSVE 154 (311) Q Consensus 77 p~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g--~GllN~p~v~~~ 154 (311) |..+..++......+.++....++.+=++. ...++...-.....+++.+.+|+.+|+|+...+ .|+++....... T Consensus 92 ~~~~~~f~~v~~~~~k~~~~~~is~ell~d---s~~~l~~~i~~~l~~aia~~~d~~~l~G~g~~~~~~~~~~~~~~~~~ 168 (324) T protein:vir:96 92 ETSKATWVNATMRAFKLGVILPVTKEFLNY---TYSQFFEEMKPMIAEAFYKKFDEAGILNQGNNPFGKSIAQSIKKTNK 168 (324) T ss_pred cccccceeEEEEEeEEEEEeehhhHHHHhc---chHHHHHHHHHHHHHHHHHHHHHHhhhcCCCCCcCccccccccccce Confidence 998999999999999999999988644443 336788888999999999999999999976543 355443221111 Q ss_pred eccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhCC Q lcl|NC_019522. 155 AATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNFP 234 (311) Q Consensus 155 ~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~~ 234 (311) .. ..+.| ++||.+++.++... + ..+..++++|+.+..|.+.. . ..+..++ .... T Consensus 169 ~~----------~~~~~----~~~i~~~~~~i~~~--~--~~~~~~i~n~~~~~~L~~lk-d---~~G~~~~----~~~~ 222 (324) T protein:vir:96 169 VI----------KGDFT----QDNIIDLEALLEDD--E--LEANAFISKTQNRSLLRKIV-D---PETKERI----YDRN 222 (324) T ss_pred ec----------ccccc----hHHHHHHHHhhhhc--c--CCCCEEEEcHHHHHHHHHhh-C---CCCCeee----cCCC Confidence 00 11112 56677777777432 2 34678999999999986532 1 1122111 1111 Q ss_pred ceEEEEchhc-ccCCCCcccEEEEEEcCcceeEEeecchhhhc--------cc----------eeeCCceEEEeeeeeee Q lcl|NC_019522. 235 DITFEDDILL-KGAGVAGADRMAVYKKEIRIVKGHDVMPLRFL--------AP----------ATADNVNFKVPAILRTG 295 (311) Q Consensus 235 ~l~i~~~~~l-~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~--------~p----------~~~~~~~~~~~~~~~~g 295 (311) .-++...|-. ..+...++..+++-+. .++-+.+..++++. .. .+ ++ ...+.+..|+ T Consensus 223 ~~~l~G~PV~~~~~~~~~~~~~~~gd~--s~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~-~n-~v~~r~~~r~- 297 (324) T protein:vir:96 223 SDSLDGLPVVNLKSSNLKRGELITGDF--DKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFE-QD-MVALRATMHV- 297 (324) T ss_pred CCcccceeeEeecCCCCCcceEEEEec--ceEEEEEecCcEEEEeecccccccccccccchhhhh-cC-cEEEEEEEEe- Confidence 1111111111 1111112222222111 11112122222110 00 11 11 2455667777 Q ss_pred eEEEECCeEEEEeecC Q lcl|NC_019522. 296 GTEWRIPKAGHYVDGV 311 (311) Q Consensus 296 Gv~i~~P~ai~~~dGI 311 (311) |+.+.+|.|++++.+- T Consensus 298 d~~v~~~~a~~~l~~a 313 (324) T protein:vir:96 298 ALHIADDKAFAKLVPA 313 (324) T ss_pred ccEEecccceEEEecc Confidence 5778889999999988 No 46 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=98.52 E-value=6.3e-08 Score=60.06 Aligned_cols=272 Identities=14% Similarity=0.046 Sum_probs=152.5 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeec-ccceEEecCccccccee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDA-RGELQLFGPNSTDVPTV 79 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~-~G~a~~~~~~a~dip~v 79 (311) -....+.+. .++.+++.. +.+.+.|++........++++++..-. ..++.+..... .+.+.|++.+ ..+|.. T Consensus 131 ~~~~~~~~~-~~~~g~lvp--~~~~~~ii~~~~~~~~l~~~~~~~~~~---~~~~~~~~~~~~~~~a~~v~E~-~~~~~~ 203 (418) T protein:vir:10 131 NVPATVGSG-VSGSNSLVV--ADRQAGIIAPPQRKMTIRDLLMPGQTS---SSSIEYTVETGFTNNAAAVAEG-AQKPTS 203 (418) T ss_pred HhhhhccCC-CCCCccccc--hhHHHHHHHHHhhhhhHHhhcceeecc---CCceeEEEEecCCCceeeeccC-cccccc Confidence 000111122 222333443 446667888888777777777654322 22344554444 4567787765 457888 Q ss_pred eeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc--eeeeecCCcceeecc Q lcl|NC_019522. 80 DIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG--EGLYTSPNVSVEAAT 157 (311) Q Consensus 80 ~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g--~GllN~p~v~~~~~~ 157 (311) +..++......+.++..+.+|.+ +.... .++..--.....+++.+.+|+.+++|+..-+ .|++|..++...+.. T Consensus 204 ~~~f~~v~~~~~k~~~~~~is~e-ll~ds---~~l~~~i~~~l~~a~~~~~d~a~l~G~g~~~~p~Gi~~~~~~~~~~~~ 279 (418) T protein:vir:10 204 DLKFNLKNQPVRTIAHLFKASRQ-ILDDA---PALQSYIDGRARYGLQLTEEGQILKGDGTGANILGILPQASAFMPSIT 279 (418) T ss_pred ccceeeEEEeeeeEEEeehhhHH-HHHhH---HHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccccccc Confidence 88999999999999999999854 54422 2688888888999999999999999986543 599999876544332 Q ss_pred CCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhC---- Q lcl|NC_019522. 158 STFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNF---- 233 (311) Q Consensus 158 ~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~---- 233 (311) .+ .+ .-++||.+++..+.. .+ ..+..++|+|+.+..|.+.. +..|.-++.=..... T Consensus 280 ~~------~~------~~~~~i~~~~~~~~~--~~--~~~~~~v~n~~~~~~L~~lk----d~~G~~i~~~~~~~~~~~l 339 (418) T protein:vir:10 280 LA------NA------TPIDKIRLALLQAVL--AE--FPATGIVLNPIDWASIELTK----DSQGRYIVGNPVNGTTPRL 339 (418) T ss_pred cc------cc------ccHHHHHHHHHhhcc--cc--CCCCEEEEcHHHHHHHHHhh----cCCCceeccccccCCCcee Confidence 21 11 125677777777642 22 23567999999999986532 111222221000000 Q ss_pred CceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhccceeeC----CceEEEeeeeeeeeEEEECCeEEEEee Q lcl|NC_019522. 234 PDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATAD----NVNFKVPAILRTGGTEWRIPKAGHYVD 309 (311) Q Consensus 234 ~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~----~~~~~~~~~~~~gGv~i~~P~ai~~~d 309 (311) ..+.++..+.+.. | + +++-+-.+.+-+.....+++..-.+.+ .-.....++.+++ +.+++|.|+++++ T Consensus 340 ~G~pV~~~~~~p~-~----~--~~~gd~s~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~~~d-~~~~~~~a~~~~~ 411 (418) T protein:vir:10 340 WNLPVVETQAMTA-N----E--FLVGAFSMAAQIFDRMEIEVLLSTENVDDFEKNMVSIRAEERLA-LAVYRPESFVTGA 411 (418) T ss_pred cceeeEEcCCCCC-C----c--EEEeeccceEEEEEecceEEEEecccchhhhcCceEEEEEEeec-cEEecccceEEEE Confidence 0122333333321 1 1 111111111111111122111000111 1123455667775 5699999999999 Q ss_pred cC Q lcl|NC_019522. 310 GV 311 (311) Q Consensus 310 GI 311 (311) .. T Consensus 412 ~~ 413 (418) T protein:vir:10 412 LV 413 (418) T ss_pred ec Confidence 88 No 47 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=98.51 E-value=3.7e-08 Score=61.36 Aligned_cols=279 Identities=11% Similarity=-0.006 Sum_probs=155.0 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccceee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPTVD 80 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~v~ 80 (311) || ++++ +.+++.. +.+..+|++...+.-..+++..+.. .......|.+....+.+.|++.+ ..+|..+ T Consensus 1 Ma--t~tt----~~g~~vP--~~~~~~ii~~~~~~s~l~~~~~~i~---~~~~~~~~p~~~~~~~a~wv~Eg-~~~~~~~ 68 (311) T protein:vir:99 1 MA--TFGT----GNLKNLP--RNIADGMVKDVVQGSTVAVLSARKP---QRFGNEDIITFNGRPKAEFVGEG-QQKSSTT 68 (311) T ss_pred Cc--eecC----CCceecc--HHHHHHHHHHHHhhchhhhhcceee---ccCCceEEEEEeCCceeEEeecC-ccccccc Confidence 55 3332 2233443 3455678888777777777765432 22234567777778889998765 5689889 Q ss_pred eeccceeEEEEEEEEEEEecHHHHHHH-HHhCCChHHHHHHHHHHHHHHhhhheeeeecccc-ce---eeeecCCcceee Q lcl|NC_019522. 81 IAMSQGFKDINTAALGYTYSIEEIGFA-MLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGV-GE---GLYTSPNVSVEA 155 (311) Q Consensus 81 ~~~~~~~~~v~~~~~~~~~~~~El~~a-~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~-g~---GllN~p~v~~~~ 155 (311) ..+++.....+.++.-..+|. ||.++ .....+|...-.....+++++.+|+-+|+|+... +. |+.+..+..... T Consensus 69 ~~f~~v~l~~~k~~~~~~iS~-ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~g~~~~g~~~~~~~~~~~ 147 (311) T protein:vir:99 69 GEFDFVTSTPKKAQVTMRFNE-EVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINPLTGTVIPGWSNYLGAASKR 147 (311) T ss_pred ceeeEEEEeeEEEEEeehhhH-HHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccCccccccccccccccce Confidence 999999999999999988884 45433 3456788999999999999999999999997642 22 333332221111 Q ss_pred ccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhC-C Q lcl|NC_019522. 156 ATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNF-P 234 (311) Q Consensus 156 ~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~-~ 234 (311) .. -...+......|+..++..+.... ....++.++|.|+.+..|.+-. +..|.-+++-..... + T Consensus 148 ~~---------~~~~~~~~~~~~i~~~~~~~~~~~--~~~~~~~~vmn~~~~~~L~~lk----d~~G~~l~~~~~~~~~~ 212 (311) T protein:vir:99 148 VE---------LTADTIANPDLAIEAAVGLLVANG--HPTPVNGLALHPSIAWGLSTAR----YTDGRKKFPELGLGIGV 212 (311) T ss_pred ee---------ccccccchhHHHHHHHHHHHhhhc--cCCCccEEEEcHHHHHHHHhhh----ccCCCeeecCcccCCCC Confidence 11 112233344567777777765332 2234567999999999986522 111221211110000 0 Q ss_pred ----ceEEEEchhccc-CC---------CCcccEEEEEEcCcceeEEeecchhhhcc-----c------eeeCCceEEEe Q lcl|NC_019522. 235 ----DITFEDDILLKG-AG---------VAGADRMAVYKKEIRIVKGHDVMPLRFLA-----P------ATADNVNFKVP 289 (311) Q Consensus 235 ----~l~i~~~~~l~~-ag---------~~g~~~~v~y~~~~~~~~~~~~~~~~~~~-----p------~~~~~~~~~~~ 289 (311) .+.+.....+.+ .+ .+..+.+++ -+-.+.+.+.+...+++.. + .+. + -.-+. T Consensus 213 ~~l~G~Pv~~s~~i~~~~~~~~~~~~~~~~~~~~~~~-Gdf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-d-~~~~r 289 (311) T protein:vir:99 213 SSFEGIDASVSDTVNGGDEADPDDEDLDAARAVRGIV-GDFANGIHWGVQRDIPVELIKYGDPDGQGDLKRH-N-QIALR 289 (311) T ss_pred ceecceeeEeecccccccccccccchhhccCcceEEE-eeccccEEEEEecCceEEEeecCCCCcchhhhhc-C-cEEEE Confidence 011111111111 00 011122221 1111223333222221110 0 111 1 14567 Q ss_pred eeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 290 AILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 290 ~~~~~gGv~i~~P~ai~~~dGI 311 (311) +..|+++. +++|.++...++. T Consensus 290 ~~~r~d~~-v~~~~~v~~~~~~ 310 (311) T protein:vir:99 290 LEIVYGWY-VFTDRFVVIENAV 310 (311) T ss_pred EEEeecce-ecChhHeeeeccc Confidence 78898775 6789999988888 No 48 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=98.49 E-value=8.7e-09 Score=64.76 Aligned_cols=289 Identities=9% Similarity=-0.024 Sum_probs=156.3 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccceee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPTVD 80 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~v~ 80 (311) ....+|.+.+.+.+++++. +.+.+.|++........+.++.+..- ......+.+......+.|.+.+ ...|..+ T Consensus 102 ~e~~a~~~~~~~~GG~~iP--~~~~~~ii~~~~~~~~l~~~~~~~~~---~~~~~~~~~~~~~~~a~wv~E~-~~~~~~~ 175 (401) T protein:vir:44 102 LERKALQVGTDEDGGYAVP--EELDRSILSLLKDEVVMRQEATVITV---GGSDYKKLVNLGGTASGWVGET-DTRSQTA 175 (401) T ss_pred HHHHHhhcCCCCCCceecc--HhHHHHHHHHHHhhhhhhhhceeeec---CCCceEEEEecCCccceeeccc-cccCccc Confidence 1112233333444566665 56778888887776666666654332 2234455555555566776654 3456444 Q ss_pred -eeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc-eeeeecCCcceeeccC Q lcl|NC_019522. 81 -IAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG-EGLYTSPNVSVEAATS 158 (311) Q Consensus 81 -~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g-~GllN~p~v~~~~~~~ 158 (311) ..+++.....+.++.-+.+|.+=|. ....+|...-....+.++.+.++..+++|+.... .|+||.+.....+... T Consensus 176 ~~~~~~v~~~~~k~~~~~~iS~ell~---ds~~~l~~~i~~~la~ai~~~~~~~~l~G~G~~~p~Gil~~~~~~~~~~~~ 252 (401) T protein:vir:44 176 TSRLGLIEPFMGEIYGNPQATQKMLD---DAFFNVEAWINSELATEFAEQEEIAFTTGDGTKKPKGFLAYESTEESDKAR 252 (401) T ss_pred cccceeeeeehhheeeehhhhHHHHh---cchHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccceeecccccccccccc Confidence 3678888888888888888854333 3466888889999999999999999999987644 6999998865543321 Q ss_pred CccccCcccccCCHHH-HHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhCCc-- Q lcl|NC_019522. 159 TFVALVAAIPTNGTQP-IIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNFPD-- 235 (311) Q Consensus 159 ~~~~~~t~w~~~t~~e-i~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~~~-- 235 (311) .... ...-.+.+... -++||.+++..|.. .+ ...-.++++++.+..|.+-. +..|.-++.--..++.. T Consensus 253 ~~~~-~~~~~t~~~~~~~~d~i~~~~~~l~~--~~--~~~a~~v~n~~~~~~L~~lk----d~~G~~l~~~~~~~g~~~~ 323 (401) T protein:vir:44 253 AFGK-LQHIVSGEATAVTADAIIKLIYTLRK--AH--RTGAKFMMNNNSLFAIRLLK----DTEGNYLWRPGLELGQPSS 323 (401) T ss_pred cccc-ccccccccccccCHHHHHHHHHhcch--hh--hcCCEEEEcHHHHHHHHHhh----ccCCceeecCCcCCCCCce Confidence 1111 11111111111 26677777777632 11 12236899999999986422 11122221100011111 Q ss_pred ---eEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhcc-ceeeCCceEEEeeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 236 ---ITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLA-PATADNVNFKVPAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 236 ---l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~-p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dGI 311 (311) .-++.+..+...+ .+.+ .++|-+=.+.+.+..-..++.+. +.-..+ ...+.++.|++ +.+..|.|++.+..= T Consensus 324 l~G~PVv~~~~~p~~~-~~~~-~i~~Gd~~~~~~i~~~~~~~~~~~~~~~~~-~v~~~a~~r~d-~~~~~~~a~~~l~~~ 399 (401) T protein:vir:44 324 LAGYGIAENEQMPDIA-ADAK-AIAFGNFKRGYTIVDRIGTRILRDPYTNKP-FVGFYTTKRTG-GMLVDSQAIKLLKIA 399 (401) T ss_pred ecceeeEEecCcCCcc-CCcc-EEEEeehhccEEEEEecceEEeeeccccCC-cEEEEEEEEec-cEEecccceEEEEee Confidence 1233333333222 2222 23333222322222122222211 111112 35566778885 556669999886655 No 49 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=98.49 E-value=4e-08 Score=61.12 Aligned_cols=283 Identities=8% Similarity=-0.096 Sum_probs=148.8 Q ss_pred CCcccccc-cchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccc-- Q lcl|NC_019522. 1 MAKSVFDV-SPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVP-- 77 (311) Q Consensus 1 ~~~~~~~~-~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip-- 77 (311) -+..++.. .+.+.++++.. +.+.+.|++...+....++++.+.. .......|.+....+.+.|++.+.. .| T Consensus 156 ~~~~a~~~~~~~~~g~~~ip--~~~~~~ii~~~~~~~~l~~~~~~~~---~~~~~~~~~~~~~~~~a~~v~e~~~-~~~~ 229 (458) T protein:vir:10 156 RHLKAVNQSSSVEVSSESYE--TIFSQRIIRDLQKELVVGALFEELP---MSSKILTMLVEPDAGKATWVAASTY-GTDT 229 (458) T ss_pred hhhhhhhhcccCccccceeh--hhHhHHHHHHHHhhhhHHhhcceee---cCCcceEEEEecCCcceeecccccc-cccc Confidence 01111111 12334445554 4567778888777777777766432 2223445555555667777765421 22 Q ss_pred ----eeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc-eeeeecCCcc Q lcl|NC_019522. 78 ----TVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG-EGLYTSPNVS 152 (311) Q Consensus 78 ----~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g-~GllN~p~v~ 152 (311) ..+..++......+.++..+.+|.+ +. .....++.+--......++...+|+-+++|+.... .|++|+++.. T Consensus 230 ~~~~~~~~~~~~i~~~~~k~~~~v~is~e-ll--~ds~~~~~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gi~~~~~~~ 306 (458) T protein:vir:10 230 TTGEEVKGALKEIHFSTYKLAAKSFITDE-TE--EDAIFSLLPLLRKRLIEAHAVSIEEAFMTGDGSGKPKGLLTLASED 306 (458) T ss_pred cccccccccceeeEeeeeeEEeeehhhHH-HH--hcchHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCccceeeeccccc Confidence 2344567777788888888888855 32 22335788888899999999999999999986544 6999999876 Q ss_pred eeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHH-HH- Q lcl|NC_019522. 153 VEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQF-LR- 230 (311) Q Consensus 153 ~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~-l~- 230 (311) ........++.. -...| +++|.+++..+.. .+ ..+..++|+|+.+..|..-. +..|.-++.. +. T Consensus 307 ~~~~~~~~~~~~--~~~~~----~~~i~~~~~~l~~--~~--~~~~~~v~~~~~~~~l~~lk----d~~G~~i~~~~~~~ 372 (458) T protein:vir:10 307 SAKVVTEAKADG--SVLVT----AKTISKLRRKLGR--HG--LKLSKLVLIVSMDAYYDLLE----DEEWQDVAQVGNDS 372 (458) T ss_pred ccceeecccccc--ccccc----HHHHHHHHHhhhh--hh--cCCCEEEEcHHHHHHHHhhc----ccCCceeecccccc Confidence 544433222211 11223 4566667776632 22 13457999999999886422 1111111110 00 Q ss_pred --HhCC-----ceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhc-cceeeCCceEEEeeeeeeeeEEEECC Q lcl|NC_019522. 231 --TNFP-----DITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFL-APATADNVNFKVPAILRTGGTEWRIP 302 (311) Q Consensus 231 --~n~~-----~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~-~p~~~~~~~~~~~~~~~~gGv~i~~P 302 (311) .++. ...|+....+.. +++.++ +++-+-.+.+.+..-..+++. .|.-..+ ...+-...|+ |..+++| T Consensus 373 ~~~~~~~~~l~G~pv~~~~~~p~-~~~~~~--~~~~~f~~~~~~~~~~~~~v~~d~~~~~~-~~~~~~~~r~-~~~v~~~ 447 (458) T protein:vir:10 373 VKLQGQVGRIYGLPVVVSEYFPA-KANSAE--FAVIVYKDNFVMPRQRAVTVERERQAGKQ-RDAYYVTQRV-NLQRYFA 447 (458) T ss_pred ccccCcCceecceeeEEcccccc-ccCCcc--eEEEEecccEEEEEeeceEEEeecccCCC-ceEEEEEEEe-cceEecc Confidence 0011 122333333322 111122 222222222222222222221 1111112 2445566786 6888999 Q ss_pred eEEEEeecC Q lcl|NC_019522. 303 KAGHYVDGV 311 (311) Q Consensus 303 ~ai~~~dGI 311 (311) .+|+..+== T Consensus 448 ~a~v~~~~a 456 (458) T protein:vir:10 448 NGVVSGTYA 456 (458) T ss_pred cceEEEeec Confidence 999772211 No 50 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=98.49 E-value=8.9e-08 Score=59.25 Aligned_cols=275 Identities=12% Similarity=0.033 Sum_probs=156.0 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeec-ccceEEecCccccccee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDA-RGELQLFGPNSTDVPTV 79 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~-~G~a~~~~~~a~dip~v 79 (311) +....+...+.+++ .+.. +.+.+.|++........+.++++..-. ..++.|..... .+.+.|++.+ ..+|.. T Consensus 109 ~~~~~~~~~~~~~g-~~vp--~~~~~~ii~~~~~~~~l~~l~~~~~~~---~~~~~~~~~~~~~~~a~~v~E~-~~~~~~ 181 (395) T protein:vir:43 109 MPRSAITSIDGSGG-ALVA--PDRRPGVVAAPQRRLTIRDLVAPGTTE---SNSVEYVRETGFVNNAAPVSEG-TQKPYS 181 (395) T ss_pred hhhhhhcccCCCCc-cccc--hhhHHHHHHHHHhhhhHHhhccceecC---CCceEEEEEecCCCceeeecCC-cccccc Confidence 22222222222333 3332 234567888888777777777755432 23455555433 5678888775 468998 Q ss_pred eeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc--eeeeecCCcceeecc Q lcl|NC_019522. 80 DIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG--EGLYTSPNVSVEAAT 157 (311) Q Consensus 80 ~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g--~GllN~p~v~~~~~~ 157 (311) +..++......+.++..+.++.+ +... .. .|..--....++++...+|+.+++|+...+ .|+++..++...... T Consensus 182 ~~~~~~i~~~~~k~~~~~~is~e-ll~d--~~-~l~~~v~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~ 257 (395) T protein:vir:43 182 DLTFELENAPVRTIAHLFKASRQ-ILDD--AS-ALQSYIDARARYGLMLVEECQLLYGNGTGANLHGIIPQAQAYAPPSG 257 (395) T ss_pred ccceeEEEEeeeeEEEeehhhHH-HHHh--HH-HHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccc Confidence 99999999999999999999954 5432 22 578888888999999999999999986544 499988776443322 Q ss_pred CCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHh-C--- Q lcl|NC_019522. 158 STFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTN-F--- 233 (311) Q Consensus 158 ~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n-~--- 233 (311) . ..+.+..+++|.+++..+... + ..+..++|+|+.+..|.+-. +..|.-+..-.... . T Consensus 258 ~----------~~~~~~~~~~i~~~~~~~~~~--~--~~~~~~vmn~~~~~~l~~lk----d~~G~~i~~~~~~~~~~~l 319 (395) T protein:vir:43 258 V----------VVTAEQRIDRIRLAILQAQLA--E--FPASGIVLNPIDWALIELNK----DAENRYIIGSPQNGTTPTL 319 (395) T ss_pred c----------ccccchhHHHHHHHHHhhccc--c--CCCcEEEEcHHHHHHHHHhh----ccCCceeccccccCCCcee Confidence 1 123445688888888887432 2 23568999999999886532 11122222111100 0 Q ss_pred CceEEEEchhcccCC-CCc--ccEEEEEEcCcceeEEeecchhhhccceeeCCceEEEeeeeeeeeEEEECCeEEEEeec Q lcl|NC_019522. 234 PDITFEDDILLKGAG-VAG--ADRMAVYKKEIRIVKGHDVMPLRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHYVDG 310 (311) Q Consensus 234 ~~l~i~~~~~l~~ag-~~g--~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dG 310 (311) -.+.++..+.+...- ..| ++...++++. .+.+.+... .....+.+ .+.+.++.|+ ++.+++|.++++++- T Consensus 320 ~G~pVv~~~~~~~~~~~~gd~~~~~~~~~~~--~~~i~~~~~--~~~~f~~~--~~~~r~~~r~-d~~v~~~~a~~~~~~ 392 (395) T protein:vir:43 320 WRLPVVETQAITQDEFLTGAFSLGAQIFDRM--DIEVLVSTE--NDKDFENN--MVTIRAEERL-AFAVYRPEAFVTGSL 392 (395) T ss_pred cceeeEEcCCCCCCcEEEEeccceEEEEEec--ceEEEEecc--ccchhhcC--cEEEEEEEee-ccEEecccceEEEEe Confidence 112344444442210 001 0111122111 111111000 00001111 2344555666 577899999999865 Q ss_pred C Q lcl|NC_019522. 311 V 311 (311) Q Consensus 311 I 311 (311) = T Consensus 393 t 393 (395) T protein:vir:43 393 T 393 (395) T ss_pred c Confidence 5 No 51 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=98.48 E-value=2.6e-08 Score=62.13 Aligned_cols=289 Identities=9% Similarity=-0.024 Sum_probs=156.8 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccceee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPTVD 80 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~v~ 80 (311) ...-++.+.+.+.+++++. +.+.++|++........+.++.+. +-....+.+.+......+.|++.+ ..+|-.+ T Consensus 101 ~e~~a~~~~t~~~gG~~iP--~~~~~~I~~~~~~~~~l~~~~~~~---~~~~~~~~~~~~~~~~~a~~v~E~-~~~~~~~ 174 (407) T protein:vir:48 101 LERKALQVGNDEDGGYAIP--EELDRTILTLLKDEVVMRQEATVI---TLGGSDYKKLVNLGGTTSGWVGET-DARPETA 174 (407) T ss_pred HHHHhhhcccCCCCccccc--HhHHHHHHHHHHhhhhhhhhceee---ecCCCceEEEEecCCcceeeeccc-ccccccc Confidence 1122344444444566665 567888888877766666665543 223334556566666677787665 4456544 Q ss_pred -eeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc-eeeeecCCcceeeccC Q lcl|NC_019522. 81 -IAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG-EGLYTSPNVSVEAATS 158 (311) Q Consensus 81 -~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g-~GllN~p~v~~~~~~~ 158 (311) ..++.....++.++.-+.+|.+=|. ....++...-.....+++...+++-+++|+.... .|+|+++.+....... T Consensus 175 ~~~f~~i~~~~~k~~~~~~iS~ell~---ds~~~l~~~i~~~l~~~i~~~~~~a~l~G~G~~~p~Gil~~~~~~~~~~~~ 251 (407) T protein:vir:48 175 TSKLGLIEPFMGEIYGNPQATQKMLD---DAFFNVEDWINSELALEFAEQEEIAFTSGDGSKKPKGFLAYESTDEDDKTR 251 (407) T ss_pred cccceeEEeeeeeeEeehhhHHHHHh---cchHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccceeeeccccccccccc Confidence 4678888888999988888855333 3456788888899999999999999999987644 6999998865543321 Q ss_pred CccccCcccccCCHHHH-HHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhCC--- Q lcl|NC_019522. 159 TFVALVAAIPTNGTQPI-IDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNFP--- 234 (311) Q Consensus 159 ~~~~~~t~w~~~t~~ei-~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~~--- 234 (311) ... ......+.++..+ ++||.+++..|.. .+. ..-+++++++.+..|.+-. +..|.-++.-=..++. T Consensus 252 ~~~-~~~~~~~~~~~~~~~d~i~~l~~~l~~--~~~--~~a~~v~n~~~~~~L~~lk----D~~Gr~l~~~~~~~g~~~~ 322 (407) T protein:vir:48 252 AFG-KLQHIASGAASGVTADAIIKLIYTLRK--AHR--SGAKFMMNNSSLFAIRLLK----DNDGNYLWRPGIELGQPSS 322 (407) T ss_pred ccc-cccccccccccccChHHHHHHHHhhch--hhh--cCCEEEEcHHHHHHHHHhh----ccCCceeeccCcCCCCCce Confidence 111 1111112222221 5667777776632 121 1236889999999886421 1111111100000111 Q ss_pred --ceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhcc-ceeeCCceEEEeeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 235 --DITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLA-PATADNVNFKVPAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 235 --~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~-p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dGI 311 (311) ...++....+...+ .|++. ++|-+=.+.+.+..-+.+++.. +.-..+ ...+.++.|++ +.+.+|.||+.+..= T Consensus 323 l~G~PV~~~~~~p~~~-~~~~~-i~~Gd~~~~~~i~~~~~~~i~~d~~~~~~-~~~~~~~~r~d-~~v~~~~a~~~l~~~ 398 (407) T protein:vir:48 323 LAGYGIVENEQMPDIA-ADAKA-IAFGNFKRGYTIVDRIGTRILRDPYTNKP-FVGFYTTKRTG-GMLVDSQAIKLMKIG 398 (407) T ss_pred ecceeeEEecCcCCcc-CCccE-EEEEeccccEEEEEeeceEEEeeccccCC-cEEEEEEEEec-cEEecccceEEEEee Confidence 11233333343333 22333 3332222222211111122111 111122 24566778885 567789999876654 No 52 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=98.47 E-value=3.4e-08 Score=61.54 Aligned_cols=289 Identities=11% Similarity=0.012 Sum_probs=156.3 Q ss_pred CC----cccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCccccc Q lcl|NC_019522. 1 MA----KSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDV 76 (311) Q Consensus 1 ~~----~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~di 76 (311) +. ..++...+.+.+++++. +.+.+.|++.....-..++++.+.+-.. ....+.+......+.|.+.+ ..+ T Consensus 121 l~~~e~~~al~~~t~~~gG~lvP--~~~~~~ii~~~~~~s~l~~l~~~~~~~~---~~~~~~~~~~~~~a~wv~E~-~~~ 194 (425) T protein:vir:10 121 VKRGDVQAALNKGEDSEGGYLTP--IEWDRTITNKLVLISPMRQLCRVQPVSK---AGFSKLFNMGGTTSGWVGEA-SQR 194 (425) T ss_pred hhhhhhHHHhhcCcCCCCceecc--HhHHHHHHHHHHhhhhhhhhceeeeccC---CceEEEEEcCCcceeeeccc-ccc Confidence 10 12233334455567775 5577888888887777777776543222 23344444555677787765 446 Q ss_pred ceee-eeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc-eeeeecCCccee Q lcl|NC_019522. 77 PTVD-IAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG-EGLYTSPNVSVE 154 (311) Q Consensus 77 p~v~-~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g-~GllN~p~v~~~ 154 (311) |..+ ..+++.....+.++.-+.+|.+ +. .....+|...-.....+++.+.+|+-+++|+.... .|+||++..... T Consensus 195 ~~~~~~~f~~v~~~~~k~~~~i~iS~e-ll--~ds~~~l~~~i~~~la~ai~~~~d~~~l~G~G~~~p~Gil~~~~~~~~ 271 (425) T protein:vir:10 195 PQTNAATFQPLSFASGEIYANPAATQQ-IL--DDAEIDLESWLATEVQTEFAKQEGKAFLAGDGTNKPNGLLTYIAGGAN 271 (425) T ss_pred ccccccccceeeeeheeeEeehHhHHH-HH--hcchhHHHHHHHHHHHHHHHHHHHhhhhcccCCCCcceeeeccccccc Confidence 7655 3678888888898888888754 32 34467899999999999999999999999987544 699998875443 Q ss_pred eccCCccccCccc-ccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhC Q lcl|NC_019522. 155 AATSTFVALVAAI-PTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNF 233 (311) Q Consensus 155 ~~~~~~~~~~t~w-~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~ 233 (311) ..... .+....- ...+..--++||.+++..+.. .+ ...-+++|+|+.+..|.+-. +..|.-++.-=..++ T Consensus 272 ~~~~~-~~~~~~~~~~~~~~~~~d~l~~l~~~l~~--~~--~~~a~~vmn~~~~~~L~~lk----D~~G~~l~~~~~~~g 342 (425) T protein:vir:10 272 AAKHP-FGAIEVVNSGAAADITSDGIIDLVYDLPS--AF--TGNARFAMNRNTQRQVRKLK----DGQGNYLWQPSYVAG 342 (425) T ss_pred ccccc-ccccccccccccccccHHHHHHHHhhhhh--hh--ccCCEEEEchHHHHHHHHhh----cCCCceeeccCccCC Confidence 22211 0000000 011222235566666666632 22 12347899999999986421 111221110000011 Q ss_pred C-----ceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhcc-ceeeCCceEEEeeeeeeeeEEEECCeEEEE Q lcl|NC_019522. 234 P-----DITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLA-PATADNVNFKVPAILRTGGTEWRIPKAGHY 307 (311) Q Consensus 234 ~-----~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~-p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~ 307 (311) . ...++.+..+...+ .+.+. ++|-+-.+.+.+.....+++.. +.-..+ ...+..+.|+ ++.+.+|.|++. T Consensus 343 ~~~~l~G~PV~~~~~~p~~~-~~~~~-i~~Gd~~~~~~i~~~~~~~v~~d~~~~~~-~~~~~~~~r~-d~~v~~~~A~~~ 418 (425) T protein:vir:10 343 QPATLAGYPVTEVPDMPDVA-ANSTP-ILFGDFQQTYLIIDRIGVRVLRDPYTAKP-YVLFYTTKRV-GGGLLNPEPMRA 418 (425) T ss_pred CCceecceeeEEecCcCCcc-CCccE-EEEEehhccEEEEEecceEEEecccccCC-cEEEEEEEEe-ccEeecccceEE Confidence 1 12233344443333 23333 3333322222221112222211 111122 2455566787 466777999977 Q ss_pred eecC Q lcl|NC_019522. 308 VDGV 311 (311) Q Consensus 308 ~dGI 311 (311) +..= T Consensus 419 l~~~ 422 (425) T protein:vir:10 419 MKVA 422 (425) T ss_pred EEee Confidence 6544 No 53 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=98.47 E-value=5.3e-08 Score=60.46 Aligned_cols=286 Identities=14% Similarity=0.013 Sum_probs=155.0 Q ss_pred CCc------ccccccch----hhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEec Q lcl|NC_019522. 1 MAK------SVFDVSPV----SALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFG 70 (311) Q Consensus 1 ~~~------~~~~~~~~----~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~ 70 (311) ||. ..+-+..+ +..+.+.. +.+-.+|++...+.-..+++..+.. .......+.+......+.|++ T Consensus 1 ~a~l~el~~~~~~~~~~g~~~~~~~~liP--~~~~~~ii~~l~~~s~l~~~~~~~~---~~~~~~~~p~~~~~~~a~~v~ 75 (333) T protein:vir:78 1 MATLNELLPNSAGSNHQGRLAHVPSDLLP--KEIVGPIFDKAQESSLVLRMGEQIP---ISYGETIIPTTVKRPEVGQVG 75 (333) T ss_pred CchhHHhhhhcccccccCceecCCccccc--hhHHHHHHHHHHhhchhhhhcceee---ccCCceEEEEEeCCceeEeec Confidence 322 11111111 11111332 4566778888887777777776543 222344555666555565554 Q ss_pred Cc-------ccccceeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeecccc-- Q lcl|NC_019522. 71 PN-------STDVPTVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGV-- 141 (311) Q Consensus 71 ~~-------a~dip~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~-- 141 (311) .+ +..+|..+..+++.....+.++....++.+ +.. ....++..--.....+++.+.+|+-+++|+... T Consensus 76 eg~~~~~~e~~~~~~~~~~f~~i~l~~~kl~~~~~is~e-ll~--~s~~~~~~~i~~~la~ai~~~~d~~~l~G~g~~~~ 152 (333) T protein:vir:78 76 VGTSNEQREGGLKPLSGTAWDTRSVSPIKLATIVTVSEE-FAR--MNPSGLYTKLQGDLAYAIGRGIDLAVFHGKSPLTG 152 (333) T ss_pred CcccccccccccccccccceeEEEEeeEEEEEeehhhHH-HHh--cCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCCC Confidence 32 345788888889999999999999999853 322 334578888899999999999999999998753 Q ss_pred -c-eeeeecCCcceeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCC Q lcl|NC_019522. 142 -G-EGLYTSPNVSVEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQN 219 (311) Q Consensus 142 -g-~GllN~p~v~~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~ 219 (311) + .|++|..++...+... -...+.+..+++|.+++..+... + -..+..++|.|+.+..|.+-... .+ T Consensus 153 ~~~~g~~~~~~~~~~~~~~--------~~~~~~~~~~~~i~~~~~~~~~~--~-~~~~~~~vmn~~~~~~L~~~~~~-~d 220 (333) T protein:vir:78 153 SALQGIDTDNVIANTTNVD--------YLQETGDPLLDRLLDGYDLVSAN--T-DVEFNGWAVDPRFRAHLLRAQAY-RD 220 (333) T ss_pred ccccccccccccccccccc--------ccccccchhHHHHHHHHHhhccc--c-ccCceEEEEcchHHHHHHHHhhh-cC Confidence 2 3777766654332211 11122333477788887776432 1 23456799999988777542211 11 Q ss_pred CCcchHHHHHHHhCC-----ceEEEEchhccc---CCCCcccEEEEEEcCcceeEEeecchhhhcc-----c-------- Q lcl|NC_019522. 220 ASNVTLLQFLRTNFP-----DITFEDDILLKG---AGVAGADRMAVYKKEIRIVKGHDVMPLRFLA-----P-------- 278 (311) Q Consensus 220 ~~~~Tvl~~l~~n~~-----~l~i~~~~~l~~---ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~-----p-------- 278 (311) ..+.-++........ .+-++....+.. .+.+++..+++-+. .++ -+.+..+++... + T Consensus 221 ~~G~~i~~~~~~~~~~~~l~G~Pv~~~~~i~~~~~~~~~~~~~~~~gD~-~~~-~~g~~~~~~i~~~~~~~~~~~~~~~~ 298 (333) T protein:vir:78 221 ANGNVDPSRINLAAQTGDVLGLPAQFGRAVGGDLGAAVDSKTRIIGGDF-SQL-KFGFADEIRIKMSDTATLTDSGSATV 298 (333) T ss_pred CCCceeecCccccCCCceeeceeeEEccccCCCccccCCCccEEEEEec-ccE-EEEEeeccEEEEecccccccccccee Confidence 112222222111111 122333333321 22222233322222 222 222222221110 0 Q ss_pred --eeeCCceEEEeeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 279 --ATADNVNFKVPAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 279 --~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dGI 311 (311) .+. + ...+.++.|+ ++.+++|.|++++.+- T Consensus 299 ~~~~~-~-~v~~r~~~r~-d~~v~~~~a~~~l~~~ 330 (333) T protein:vir:78 299 SMWQT-N-QIAILIEVTF-GWLLGDKQAFVKFVDD 330 (333) T ss_pred ehhhc-C-cEEEEEEEEE-ccEEecccceEEEecc Confidence 011 1 1345666776 5778999999999999 No 54 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=98.44 E-value=5.5e-08 Score=60.37 Aligned_cols=275 Identities=8% Similarity=-0.064 Sum_probs=154.2 Q ss_pred ccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccceeeeeccc Q lcl|NC_019522. 6 FDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPTVDIAMSQ 85 (311) Q Consensus 6 ~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~v~~~~~~ 85 (311) |-+++ ..+++.. +.+..+|++...+.-..+++.++.. .......+.+....+.+.|++.+ ..+|..+..++. T Consensus 1 m~t~t--~gg~liP--~~~~~~ii~~l~~~s~i~~l~~~~~---~~~~~~~ip~~~~~~~a~wv~E~-~~~~~s~~~f~~ 72 (303) T protein:vir:97 1 MGTET--SKASLFD--KHLVSDLINKVKGHSSLAKLSSQKP---IPFNGSKEFTFTLDSDIDVVAEN-GKKTHGGLSLEP 72 (303) T ss_pred CcccC--CCCeEcc--hhHHHHHHHHHHhhchhhhhcceee---cCCCceEEEEEecCcceEEeecC-ccccccccceee Confidence 33333 2334554 4456778888877777777776543 22234566677777889999865 668999999999 Q ss_pred eeEEEEEEEEEEEecHHHHH-HHHHhCCChHHHHHHHHHHHHHHhhhheeeeecccc-ceee-----eecCCcceeeccC Q lcl|NC_019522. 86 GFKDINTAALGYTYSIEEIG-FAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGV-GEGL-----YTSPNVSVEAATS 158 (311) Q Consensus 86 ~~~~v~~~~~~~~~~~~El~-~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~-g~Gl-----lN~p~v~~~~~~~ 158 (311) ...+.+.++..+.+|. |+. .......+|...-.....+++.+.+|+-+++|+... |.+. .+..+...... T Consensus 73 v~l~~~kl~~~~~iS~-ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g~~~~~~~~~~~~~~~~~~~-- 149 (303) T protein:vir:97 73 VTIVPIKVEYGARLSD-EFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPRTKKASDVIGTNHFDSKVTQVV-- 149 (303) T ss_pred EEeeeEEEEEeehhhH-HHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccCCcccccccccccccccccccc-- Confidence 9999999999999984 444 333456778888999999999999999999996432 2222 11111111000 Q ss_pred CccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHH-HHHHhC---- Q lcl|NC_019522. 159 TFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQ-FLRTNF---- 233 (311) Q Consensus 159 ~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~-~l~~n~---- 233 (311) ...+.+..++||.++++.+... . ..|..++|+|+.+..|.+-. ..+ +.-++. -+.... T Consensus 150 ---------~~~~~~~~~~~i~~~~~~~~~~-~---~~~~~~vmn~~~~~~L~~lk-d~~---g~~~~~~~~~~~~~~~~ 212 (303) T protein:vir:97 150 ---------KFTESEDADANIEAAVNLIQGA-E---GVVTGLAMDTEFSTALAKVT-NGE---MGPKMYPELAWGANPDS 212 (303) T ss_pred ---------ccccccchHHHHHHHHHHHhhc-C---CCccEEEEcHHHHHHHHHhh-ccC---CCeEEecCccCCCCCce Confidence 0112223478999999887432 2 34678999999999886422 111 111110 000000 Q ss_pred -CceEEEEchhcccCCCCcc--cEEEEEEcC-------cceeEEeecchhh---h-ccceeeCCceEEEeeeeeeeeEEE Q lcl|NC_019522. 234 -PDITFEDDILLKGAGVAGA--DRMAVYKKE-------IRIVKGHDVMPLR---F-LAPATADNVNFKVPAILRTGGTEW 299 (311) Q Consensus 234 -~~l~i~~~~~l~~ag~~g~--~~~v~y~~~-------~~~~~~~~~~~~~---~-~~p~~~~~~~~~~~~~~~~gGv~i 299 (311) ..+.++....+.+.+..+. +.+++-+.+ .+.+++.+..... . ..-.+.+. .-+.++.|+ +..+ T Consensus 213 l~G~Pv~~s~~v~~~~~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~n~--~~~r~~~r~-~~~v 289 (303) T protein:vir:97 213 INGLKSSVNTTVGAGADEAESKDLVIIGDFESMFKWGYAKQIPMEIIKYGDPDNSGKDLKGYNQ--IYLRAEAYI-GWGI 289 (303) T ss_pred ecceeeEEecccCCccccCCCccEEEEeeccccEEEEEecCcEEEEeeccCCCCcchhhhhcCc--EEEEEEEEe-ccEe Confidence 0123333333333222222 222211111 1222332211000 0 00012111 345567776 5778 Q ss_pred ECCeEEEEeecC Q lcl|NC_019522. 300 RIPKAGHYVDGV 311 (311) Q Consensus 300 ~~P~ai~~~dGI 311 (311) ++|.||+++... T Consensus 290 ~~p~af~~l~~~ 301 (303) T protein:vir:97 290 LDAKSFARVTKG 301 (303) T ss_pred ecccceEEeeCC Confidence 999999999999 No 55 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=98.42 E-value=1.5e-07 Score=57.92 Aligned_cols=273 Identities=12% Similarity=0.055 Sum_probs=149.2 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEee----cccceEEecCccccc Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSID----ARGELQLFGPNSTDV 76 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~----~~G~a~~~~~~a~di 76 (311) +.............+.+.. +.+.+.|++........++++++..-.. .+..|.+.. ..+.+.|++.+ ..+ T Consensus 113 ~~~~~~~~~~~~~~~~~vp--~~~~~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~~~~a~~v~Eg-~~~ 186 (413) T protein:vir:81 113 ASDPASTATLTDEFQGGYG--TTWNRNIIYRRREKLVVADLMDNLTMTN---TTIKYLMEKANRVVEGGFKTVAEG-GKK 186 (413) T ss_pred hhhhhhhcccccccccccc--hhhHHHHHHHHhhhhhHHhhcceeeccC---CceeEEEeccccccccccceecCc-ccc Confidence 1111111111222333343 5577889998888888888877554322 233333322 23466787755 446 Q ss_pred ceeee-eccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc--eeeeecCCcce Q lcl|NC_019522. 77 PTVDI-AMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG--EGLYTSPNVSV 153 (311) Q Consensus 77 p~v~~-~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g--~GllN~p~v~~ 153 (311) |..+. .++.....++.++..+.+|.+ +.... . .|..--....++++...+|+.+++|+...+ .|+++.+++.. T Consensus 187 ~~~~~~~f~~i~~~~~k~~~~~~iS~e-ll~ds--~-~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~Gi~~~~~~~~ 262 (413) T protein:vir:81 187 PYMRFADFDIVTESLSKIAGLTKITDE-MIEDY--D-FLVSYINARLLEELAIEEERQLLLGDGTGNNLTGLLKRDGIQT 262 (413) T ss_pred cccCcccceeeEeeeeeEEEeehhhHH-HHHHH--H-HHHHHHHHHHHHHHHHHHHHHHhccCCCCCccccccccccccc Confidence 76664 578889999999999999965 54322 2 377888888899999999999999986544 49999888653 Q ss_pred eeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHH-HHH- Q lcl|NC_019522. 154 EAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQF-LRT- 231 (311) Q Consensus 154 ~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~-l~~- 231 (311) .... +.+..++++..++..+..... ..++.++|+|+.+..|.+-. +..|.-++.- +.. T Consensus 263 ~~~~-------------~~~~~~~~i~~~~~~~~~~~~---~~~~~~vmn~~~~~~l~~lk----d~~G~~l~~~~~~~~ 322 (413) T protein:vir:81 263 LAVS-------------NKDELADSIYKAMTNISLATP---FQADALVINPLDYQELRLAK----DANGQYYGGGVFQGQ 322 (413) T ss_pred cccc-------------ccchhHHHHHHHHHHhhhhcc---CCCcEEEEcHHHHHHHHHhh----ccCCceecccccccc Confidence 3222 223456777777776643222 24667999999999886432 1111111110 000 Q ss_pred h----------CCceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhccceee----CCceEEEeeeeeeeeE Q lcl|NC_019522. 232 N----------FPDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATA----DNVNFKVPAILRTGGT 297 (311) Q Consensus 232 n----------~~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~----~~~~~~~~~~~~~gGv 297 (311) + .....++....+. +| . ++|-+=.+.+-+.....+++..-.+. ......+.++.|+ ++ T Consensus 323 ~~~~~~~~~~~l~G~pv~~s~~~~-~~-----~-~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~r~~~r~-d~ 394 (413) T protein:vir:81 323 YGSGGIMLDPAPWGLRTVQSQVVP-VG-----K-PVVGAFRSAASVLRKGGVRIDSTNTNVDDFENNLITVRAEERV-GL 394 (413) T ss_pred ccccccccCceecceeeEEcCCCC-cc-----c-EEEEecccEEEEEEecceEEEEeccccchhhcCcEEEEEEEee-cc Confidence 0 0011222222221 11 1 11211111121111122211100010 0112456667787 46 Q ss_pred EEECCeEEEEeecC Q lcl|NC_019522. 298 EWRIPKAGHYVDGV 311 (311) Q Consensus 298 ~i~~P~ai~~~dGI 311 (311) .+++|.++++++.= T Consensus 395 ~~~~~~a~~~l~~~ 408 (413) T protein:vir:81 395 MVTFPEAIVQLDVA 408 (413) T ss_pred EEecccceEEEEec Confidence 77999999998866 No 56 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=98.38 E-value=1.2e-07 Score=58.44 Aligned_cols=280 Identities=10% Similarity=-0.015 Sum_probs=149.1 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhh-hhhhhhccccCCCCcceeEEEEEEeecccceEEecCccccccee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQ-FKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPTV 79 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~-~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~v 79 (311) +......+.+...+++++. +.+.++++.....+ -..+.+..+... .-.+.+.+....+.+.|++.+ ..+|.. T Consensus 245 ~~~~~~~~~t~~~gg~lip--~~~~~~ii~~~~~~~~~l~~~~~~~~~----~g~~~~~~~~~~~~a~~v~Eg-~~~~~~ 317 (543) T protein:vir:81 245 INEVRAMGLTKADGGYLVP--FQLDPTVIITSNGSLNDIRRFARQVVA----TGDVWHGVSSAAVQWSWDAEF-EEVSDD 317 (543) T ss_pred hhhhhhcccccccCcccCc--hhhhhHHHHHHHhhhchhhhhcccccC----CcceEEEEecCCcceeecccC-cccccc Confidence 1111111112223344443 23444555433332 334455443221 223445566667788888765 557888 Q ss_pred eeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc--eeeeecCCcceeecc Q lcl|NC_019522. 80 DIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG--EGLYTSPNVSVEAAT 157 (311) Q Consensus 80 ~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g--~GllN~p~v~~~~~~ 157 (311) +..++......+.++..+.+|. |+.. ...++...-.....+++...+|+.+|+|+..-+ .|+++.+........ T Consensus 318 ~~~~~~i~~~~~k~~~~~~is~-ell~---d~~~~~~~i~~~l~~~~~~~~d~ail~G~Gt~~~p~Gi~~~~~~~~~~~~ 393 (543) T protein:vir:81 318 SPEFGQPEIPVKKAQGFVPISI-EALQ---DEANVTETVALLFAEGKDELEAVTLTTGTGQGNQPTGIVTALAGTAAEIA 393 (543) T ss_pred ccccceeeeeeeeeEeeehhhH-HHHh---ccHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccchhhccccccccc Confidence 9999999999999999999996 4543 224889999999999999999999999986543 599988664332222 Q ss_pred CCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhCC--- Q lcl|NC_019522. 158 STFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNFP--- 234 (311) Q Consensus 158 ~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~~--- 234 (311) .. + +..-.++|+.+++..+-. .+. ....++|+|..+..|.+..-+ . |.=++.-+....+ T Consensus 394 ~~-~---------~~~~~~~~~~~~~~~l~~--~~~--~~~~~v~n~~~~~~l~~lkd~-~---G~~l~~~~~~g~~~~l 455 (543) T protein:vir:81 394 PV-T---------AETFALADVYAVYEQLAA--RHR--RQGAWLANNLIYNKIRQFDTQ-G---GAGLWTTIGNGEPSQL 455 (543) T ss_pred cc-c---------cccccHHHHHHHHHhhhc--ccc--CCcEEEEcHHHHHHHHHhhcC-C---CceeccCcCCCCCccc Confidence 11 1 111236778888887732 221 234799999999998653311 1 1111110111001 Q ss_pred -ceEEEEchhcc---cCCCCcccEEEEEEcCcceeEEeecchhhhcc-ceee-----CCceEEEeeeeeeeeEEEECCeE Q lcl|NC_019522. 235 -DITFEDDILLK---GAGVAGADRMAVYKKEIRIVKGHDVMPLRFLA-PATA-----DNVNFKVPAILRTGGTEWRIPKA 304 (311) Q Consensus 235 -~l~i~~~~~l~---~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~-p~~~-----~~~~~~~~~~~~~gGv~i~~P~a 304 (311) .+.++.+..+. ..+.+..+..++|-+- ..+.+.....+++.. |.-. ....+.+..+.++ |+.+++|.| T Consensus 456 ~G~pv~~~~~~~~~~~~~~~~~~~~i~~gd~-~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~r~-d~~v~~~~A 533 (543) T protein:vir:81 456 LGRPVGEAEAMDANWNTSASADNFVLLYGNF-QNYVIADRIGMTVEFIPHLFGTNRRPNGSRGWFAYYRM-GADVVNPNA 533 (543) T ss_pred cceeeEEeccccccccccccCCcceEEEeec-cceeEEeecccEEEEeccccccchhhcCceEEEEEEee-ccEeecccc Confidence 12233333321 1111112223434332 233222222322210 1000 0112344556676 456788999 Q ss_pred EEEeecC Q lcl|NC_019522. 305 GHYVDGV 311 (311) Q Consensus 305 i~~~dGI 311 (311) |+.+.-- T Consensus 534 ~~~l~~~ 540 (543) T protein:vir:81 534 FRLLNVE 540 (543) T ss_pred eEEEEec Confidence 9988877 No 57 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=98.37 E-value=2.4e-07 Score=56.83 Aligned_cols=270 Identities=11% Similarity=-0.012 Sum_probs=151.7 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeec-ccceEEecCccccccee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDA-RGELQLFGPNSTDVPTV 79 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~-~G~a~~~~~~a~dip~v 79 (311) .....+...+.++++++.. .+-+.|++........+.++.+.+. ...++.|...+. .+.+.|++.+ ..+|.. T Consensus 109 ~~~~~~~~~~~~~g~~~~~---~~~~~ii~~~~~~~~l~~~~~~~~~---~~~~~~~~~~~~~~~~a~~v~Eg-~~~~~~ 181 (390) T protein:vir:10 109 ALNTASTDAAGSAGALTTP---NRLPGFITQPDARLTVRDLIGSGRT---DSALIEYVQETGFVNNAAIVAEG-ALKPES 181 (390) T ss_pred HHHhhhcccccccccccch---hHHHHHHHHHHhhchhhhhcceeec---cCCceEEEEEecCCcceeeecCC-cccccc Confidence 1111122222333444443 2235677777777677777665432 223455555554 4677887765 458888 Q ss_pred eeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc--eeeeecCCcceeecc Q lcl|NC_019522. 80 DIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG--EGLYTSPNVSVEAAT 157 (311) Q Consensus 80 ~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g--~GllN~p~v~~~~~~ 157 (311) +..++......+.++..+.++.+ +.... .+|..--....++++...+|+.+++|+...+ .|++|.+++...+.. T Consensus 182 ~~~~~~i~~~~~k~~~~~~is~e-ll~d~---~~l~~~i~~~l~~~~~~~~~~~il~G~G~~~~p~Gi~~~~~~~~~~~~ 257 (390) T protein:vir:10 182 SLKFAKKTDTTHVIAHTMKATRQ-ILSDA---PQLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTT 257 (390) T ss_pred ccceeEEEEeeEEEEEeehhhHH-HHHhH---HHHHHHHHHHHHHHHHHHHHHHHhhcCCCCcccccccccccccccccc Confidence 99999999999999999999864 54422 2688888889999999999999999986543 599998876543322 Q ss_pred CCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhCC--- Q lcl|NC_019522. 158 STFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNFP--- 234 (311) Q Consensus 158 ~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~~--- 234 (311) .. ....++++.+++..+... + ..+..++|+|+.|..|.+.. +..|.-++.--....+ T Consensus 258 ~~------------~~~~~~~~~~~~~~l~~~--~--~~~~~~v~n~~~~~~L~~lk----d~~g~~l~~~~~~~~~~~l 317 (390) T protein:vir:10 258 IA------------GATRVDQLRLAMLQASLA--E--YPASGIVINPIDWAAIELAK----DANNQYLIGNARGTLTPTL 317 (390) T ss_pred cc------------ccchHHHHHHHHHhhccc--c--CCCCEEEEcHHHHHHHHHhh----cCCCceeecCCcCcCCcee Confidence 11 112356777777777432 2 24568999999999887532 1112222111001111 Q ss_pred -ceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhc-----cceeeCCceEEEeeeeeeeeEEEECCeEEEEe Q lcl|NC_019522. 235 -DITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFL-----APATADNVNFKVPAILRTGGTEWRIPKAGHYV 308 (311) Q Consensus 235 -~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~-----~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~ 308 (311) .+.++..+.+.. +..+ |-+-.+.+.+.....++.. ...+. + ...+.+..|+ ++.+++|.||+++ T Consensus 318 ~G~pv~~~~~~p~------~~~~-~gdf~~~~~~~~~~~~~i~~~~~~~~~~~-~-~~~~r~~~r~-d~~v~~~~a~~~~ 387 (390) T protein:vir:10 318 WGLPVVATQAMAP------GEFL-VGAFDLAAQIFDQWDARVEIGYVNDDFQR-N-MVTVLAEERL-ALVVYRPEALISG 387 (390) T ss_pred cceeeEEcCCCCC------CcEE-EEeccceEEEEEecceEEEEeeccccccc-C-cEEEEEEEee-ccEEeccccEEEE Confidence 122333333221 1112 2111122222111222111 11221 2 2455567787 5689999999876 Q ss_pred ecC Q lcl|NC_019522. 309 DGV 311 (311) Q Consensus 309 dGI 311 (311) +== T Consensus 388 ~~a 390 (390) T protein:vir:10 388 SFA 390 (390) T ss_pred EeC Confidence 533 No 58 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=98.37 E-value=1.9e-07 Score=57.47 Aligned_cols=284 Identities=10% Similarity=-0.084 Sum_probs=150.1 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccceee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPTVD 80 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~v~ 80 (311) =...++.+.+.++.+.+- +.+-.+|++...+.-..+++..+. +.......+.+.+..+.+.+++. ...+|..+ T Consensus 15 ~e~~a~~~~~~~~g~~ip---~~~~~~ii~~~~~~s~i~~~~~~~---~~~~~~~~~p~~~~~~~a~~v~E-g~~~~~~~ 87 (326) T protein:vir:42 15 NDPKVAQTGDSMFEGYLE---PEQAQDYFAEAEKISIVQQFAQKI---PMGTTGQKIPHWTGDVSASWIGE-GDMKPITK 87 (326) T ss_pred chhhheeccccCCcceec---hhhHHHHHHHHHhcchhhhhccee---eccCCceEEEEEeCCcceEEecC-Cccccccc Confidence 011222222233333344 334466777777766666665543 33334566777777788888865 56789999 Q ss_pred eeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc-eeeeecCCcceeeccCC Q lcl|NC_019522. 81 IAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG-EGLYTSPNVSVEAATST 159 (311) Q Consensus 81 ~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g-~GllN~p~v~~~~~~~~ 159 (311) ..+++.....+.++..+.++.+=++ ....++...-.....+++.+.+++-+|+|+.... .|++|.+......... T Consensus 88 ~~f~~i~~~~~k~~~~v~iS~ell~---~s~~~~~~~i~~~l~~a~~~~~d~a~l~G~gs~~p~gi~~~~~~~~~~~~~- 163 (326) T protein:vir:42 88 GNMTSQTIAPHKIATIFVASAETVR---ANPANYLGTMRTKVATAFAMAFDNAAINGTDSPFPTFLAQTTKEVSLVDPD- 163 (326) T ss_pred cceeEEEEeeEEEEEeehhhHHHHh---cCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCccccccccccccceeecc- Confidence 9999999999999999999864333 3346788888999999999999999999987644 5888776542222111 Q ss_pred ccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhC-----C Q lcl|NC_019522. 160 FVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNF-----P 234 (311) Q Consensus 160 ~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~-----~ 234 (311) .. ......+..++ ++..++..+. .. ......++|.|+.+..|.+-. +..+.-+..--..++ + T Consensus 164 ~~---~~~~~~~~~~~--~~~~~~~~~~--~~--~~~~a~~v~n~~~~~~L~~lk----d~~G~~l~~~~~~~~~~~~~~ 230 (326) T protein:vir:42 164 GT---GSNADLTVYDA--VAVNALSLLV--NA--GKKWTHTLLDDITEPILNGAK----DKSGRPLFIESTYTEENSPFR 230 (326) T ss_pred cc---cccccchhHHH--HHHHHHhhhh--hh--ccCccEEEEeHHHHHHHHHhh----ccCCceeeccccccCcccccc Confidence 11 11112222221 1222333321 11 123457899999999986532 111111111000000 1 Q ss_pred ceEEEEchhcccCC-CCcccEEE------EEEcCcceeEEeecchhhh--ccc--------eeeCCceEEEeeeeeeeeE Q lcl|NC_019522. 235 DITFEDDILLKGAG-VAGADRMA------VYKKEIRIVKGHDVMPLRF--LAP--------ATADNVNFKVPAILRTGGT 297 (311) Q Consensus 235 ~l~i~~~~~l~~ag-~~g~~~~v------~y~~~~~~~~~~~~~~~~~--~~p--------~~~~~~~~~~~~~~~~gGv 297 (311) ..++...|-..... ..++..++ +|-...+.+.+.+-.+... ..+ .+. + ...+.+..++ ++ T Consensus 231 ~~~l~G~pv~~~~~~~~~~~~~~~Gd~s~~~~~~~~~~~v~~~~e~~~~~~~~~~~~~~~~~~~-d-~~~~r~~~~~-d~ 307 (326) T protein:vir:42 231 LGRIVARPTILSDHVASGTVVGYQGDFRQLVWGQVGGLSFDVTDQATLNLGTPQAPNFVSLWQH-N-LVAVRVEAEY-AF 307 (326) T ss_pred CceeeeeeEEEcCCCCCCceEEEEeecceEEEEEecceEEEEeecceeeecccccccchhhhhc-C-cEEEEEEEEe-cc Confidence 11222222211100 01111111 0111122223222111110 000 111 1 2456677777 67 Q ss_pred EEECCeEEEEeecC Q lcl|NC_019522. 298 EWRIPKAGHYVDGV 311 (311) Q Consensus 298 ~i~~P~ai~~~dGI 311 (311) .+.+|.||+++.++ T Consensus 308 ~v~~~~a~~~l~~~ 321 (326) T protein:vir:42 308 HCNDKDAFVKLTNV 321 (326) T ss_pred EEecccceEEEeec Confidence 88999999999999 No 59 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=98.30 E-value=2.3e-07 Score=57.02 Aligned_cols=279 Identities=10% Similarity=-0.068 Sum_probs=149.3 Q ss_pred CCc-cccccc--------chhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecC Q lcl|NC_019522. 1 MAK-SVFDVS--------PVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGP 71 (311) Q Consensus 1 ~~~-~~~~~~--------~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~ 71 (311) |+. ..|+.. +.++.+++- +.+..+|++...+.-..++++.+.. .......+.+....+.+.|++. T Consensus 1 ~~~~~~~~~e~~~~~~~~~~~~~~~ip---~~~~~~ii~~~~~~~~l~~~~~~~~---~~~~~~~ip~~~~~~~a~~v~E 74 (318) T protein:vir:24 1 MAAGTAFAVDHAQIAQTGDTMFKGYLE---PEQAKDYFAEAEKTSIVQQFAQKVP---MGTTGQKIPHWVGDVSAQWIGE 74 (318) T ss_pred CCCCCCCCHHHHHhhcccCcccceeec---hhHHHHHHHHHHhhchhhhhcceee---ccCCceEEEEEeCCcceEEecC Confidence 211 222211 112222232 3455677777777666666665432 2233466667777788889876 Q ss_pred cccccceeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeecccc-ceeeeecCC Q lcl|NC_019522. 72 NSTDVPTVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGV-GEGLYTSPN 150 (311) Q Consensus 72 ~a~dip~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~-g~GllN~p~ 150 (311) + ..+|..+..+++.....+.++....++.+=|+ ....++...-.....+++.+.+|+-+++|+... ..|+++... T Consensus 75 g-~~~~~~~~~f~~i~~~~~k~~~~~~iS~e~l~---ds~~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~~~~~~~ 150 (318) T protein:vir:24 75 G-DMKPITKGNMTSQTIAPHKIATIFVASAETVR---ANPANYLGTMRTKVATAFAMAFDGAAMHGTDSPFPTYIGQTTK 150 (318) T ss_pred C-ccccccccceeEEEEeeEEEEEeehhhHHHhh---cChHHHHHHHHHHHHHHHHHHHHHhhhcccCCCCCcccccccc Confidence 4 66898888899999999999999998864333 234678888999999999999999999998643 256665432 Q ss_pred cceeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHH Q lcl|NC_019522. 151 VSVEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLR 230 (311) Q Consensus 151 v~~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~ 230 (311) . .+.... .+.+. .. ..++.+++..+.. .+ ..+..++|+|+.+..|.+.. +..+..++.-.. T Consensus 151 ~--~~~~~~--~~~~~----~~---~~~~~~~~~~~~~--~~--~~~~~~v~n~~~~~~L~~lk----d~~G~~l~~~~~ 211 (318) T protein:vir:24 151 A--ISIADT--TGATT----VY---DQVAVNGLSLLVN--DG--KKWTHTLLDDITEPILNGAK----DQNGRPLFIEST 211 (318) T ss_pred c--cccccc--ccccc----hH---HHHHHHHHHhhcc--cc--CCCCEEEEcHHHHHHHHHhh----ccCCceeecCcc Confidence 1 111111 00110 11 2344555555432 12 23568999999999996532 111222211100 Q ss_pred -HhCC----ceEEEEchhccc-CCCCcccEEEEEEc------CcceeEEeecchhhhcc-------c---eeeCCceEEE Q lcl|NC_019522. 231 -TNFP----DITFEDDILLKG-AGVAGADRMAVYKK------EIRIVKGHDVMPLRFLA-------P---ATADNVNFKV 288 (311) Q Consensus 231 -~n~~----~l~i~~~~~l~~-ag~~g~~~~v~y~~------~~~~~~~~~~~~~~~~~-------p---~~~~~~~~~~ 288 (311) ...+ ..++...|-... .-..|+..++.-+- ..+.+.+++.....+.. | .+. + ...+ T Consensus 212 ~~~~~~~~~~~~i~g~pv~~~~~~~~~~~~~~~gdfs~~~~~~~~~l~i~~~~~~~~~~~~~~~~~~~~~f~~-~-~~~~ 289 (318) T protein:vir:24 212 YGEAASPFRSGRIVARPTILSDHVVEGTTVGFMGDFSQLIWGQIGGLSFDVTDQATLNLGTVESPNFVSLWQH-N-LVAV 289 (318) T ss_pred ccCccccccCceEEEEeeEEeCCCCCCccEEEEeecceEEEEEecCeEEEEeeccceeccccccccchhhhhc-C-cEEE Confidence 0011 112333332211 11123332221111 11122222211111000 0 111 1 2556 Q ss_pred eeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 289 PAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 289 ~~~~~~gGv~i~~P~ai~~~dGI 311 (311) .+..|+ ++.+.+|.||+++.++ T Consensus 290 r~~~r~-d~~v~~~~a~~~i~~~ 311 (318) T protein:vir:24 290 RVEAEY-AFHCNDAEAFVALTNV 311 (318) T ss_pred EEEEEE-ccEEecccceEEEEee Confidence 777887 5778999999999999 No 60 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=98.25 E-value=5e-07 Score=55.12 Aligned_cols=287 Identities=14% Similarity=0.027 Sum_probs=152.2 Q ss_pred CCcccccccchhh----hhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeec--------ccceEE Q lcl|NC_019522. 1 MAKSVFDVSPVSA----LSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDA--------RGELQL 68 (311) Q Consensus 1 ~~~~~~~~~~~~~----~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~--------~G~a~~ 68 (311) +..+++..+...+ .+-+.. +.+-.+|++...+.-..++++++.. .+.+ ...+.+... .+.+.+ T Consensus 7 ~~~~~~~~~~~~~~~~~~~~liP--~~~~~~ii~~~~~~s~l~~l~~~~~-~~~~--~~~ip~~~~~~~a~~v~~~~~~~ 81 (338) T protein:vir:78 7 LAPNTAGSNHQGRLAHVPSDLLP--KEIVGPIFDKAQESSLVLRLGENIP-ISYG--ETIIPTTVKRPEVGQVGVGTSNE 81 (338) T ss_pred hhhhhcccccccceecccccccc--hHHHHHHHHHHHhhchhhhhcceee-ccCC--ceEEEEEecCccceeeccccccc Confidence 3333333222111 111222 4456778888888878888877643 2222 333433332 234445 Q ss_pred ecCcccccceeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeecccc---c-ee Q lcl|NC_019522. 69 FGPNSTDVPTVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGV---G-EG 144 (311) Q Consensus 69 ~~~~a~dip~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~---g-~G 144 (311) .+.+ ..+|..+..++......+.++....++.+ +.. ....++...-.....+++.+.+|+-+++|+... + .| T Consensus 82 ~~Eg-~~~~~~~~~f~~v~l~~~k~~~~~~is~e-ll~--ds~~~~~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~~~g 157 (338) T protein:vir:78 82 QREG-GTKPLSGTAWDTRSVAPIKLATIVTVSEE-FAR--MNPSGLYTKLQADLAYAIGRGIDLAVFHGKSPLTGSALQG 157 (338) T ss_pred cccc-ccccccccceeEEEEEEEEEEEeehhhHH-HHh--cCHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCccccccc Confidence 4443 55788888888999999999988888854 332 234678888889999999999999999998753 2 47 Q ss_pred eeecCCcceeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcch Q lcl|NC_019522. 145 LYTSPNVSVEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVT 224 (311) Q Consensus 145 llN~p~v~~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~T 224 (311) ++++..+...+.... ........++++.+++..+..... ..+..++|+|+.+..|.+...- .+..+.- T Consensus 158 i~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~m~~~~~~~L~~~~~l-~d~~g~~ 225 (338) T protein:vir:78 158 IDTNNVIVNTTNVDY--------LQTGTTPLLDRFLDGYDLVSANTD---VDFNGWAADPRYRARLLRSQAY-RDANGNV 225 (338) T ss_pred ccccccccccccccc--------ccccchhhHHHHHHHHHHhhhhcc---ccceEEEEchHHHHHHHHHhhh-ccCCCce Confidence 776655433222111 112234557888888877743221 2356799999988877542210 1111121 Q ss_pred HHHHHHHhCCc-----eEEEEchhcc---cCCCCcccEEEEEEcCcceeEEeecchhhhc-----------cc-eeeCCc Q lcl|NC_019522. 225 LLQFLRTNFPD-----ITFEDDILLK---GAGVAGADRMAVYKKEIRIVKGHDVMPLRFL-----------AP-ATADNV 284 (311) Q Consensus 225 vl~~l~~n~~~-----l~i~~~~~l~---~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~-----------~p-~~~~~~ 284 (311) ++.-....... +-++....+. ++..+ ++..+++.+-.+ +.+....+++.. .| .+..++ T Consensus 226 l~~~~~~~~~~~~l~G~PV~~~~~ip~~~~~~~~-~~~~~~~gdfs~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~ 303 (338) T protein:vir:78 226 DPTRINLAASAGDLLGLPVQFGKAVGGDLGAATD-SKVRVVGGDFSQ-LKYGFADEIRVKMSDTATLTDNTSPTPQTVSM 303 (338) T ss_pred eecccccCCCCceeeeeeEEEccccCccccccCC-cccEEEEEecce-EEEEeecccEEEEeecccccccccccccchhh Confidence 11111111111 1222222222 12222 222222222111 112111122110 00 001111 Q ss_pred ----eEEEeeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 285 ----NFKVPAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 285 ----~~~~~~~~~~gGv~i~~P~ai~~~dGI 311 (311) ...+.++.|+ |..+.+|.|++++... T Consensus 304 ~~~~~~~~r~~~r~-d~~v~~~~a~~~l~~~ 333 (338) T protein:vir:78 304 WQTNQIAILIEVTF-GWLLGDKQAFVKFVDD 333 (338) T ss_pred hhcCcEEEEEEEEe-ccEeecccceEEEecc Confidence 1445667777 6789999999999999 No 61 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=98.22 E-value=6.6e-07 Score=54.48 Aligned_cols=270 Identities=12% Similarity=0.010 Sum_probs=150.5 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeec-ccceEEecCccccccee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDA-RGELQLFGPNSTDVPTV 79 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~-~G~a~~~~~~a~dip~v 79 (311) .....+-..+.++++++.. .+.+.|++........+.++.+... ....+.+..... .+.+.|++.+ ..+|.. T Consensus 109 ~~~~~~~~~~~~~g~~~~~---~~~~~ii~~~~~~~~l~~~~~~~~~---~~~~~~~~~~~~~~~~a~~v~Eg-~~~~~~ 181 (390) T protein:vir:81 109 ALNTASTDAAGSAGALTTP---NRLPGFITPPDARLTVRDLIGSGRT---DSALIEYVQETGFVNNAAIVAEG-ALKPES 181 (390) T ss_pred HHHhhccccccCCcceech---hhhHHHHHHHhhhhhhhhhcceeec---cCCceEEEEEecCCcceeeecCC-cccccc Confidence 1111111122333444443 2345677777777777777665432 223345555443 4678888765 558999 Q ss_pred eeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc--eeeeecCCcceeecc Q lcl|NC_019522. 80 DIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG--EGLYTSPNVSVEAAT 157 (311) Q Consensus 80 ~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g--~GllN~p~v~~~~~~ 157 (311) +..++.....++.++..+.++.+ +..-. .++..--....++++.+.+|+.+++|+...+ .|++|.+++...+.. T Consensus 182 ~~~~~~i~~~~~k~~~~~~is~e-ll~d~---~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~Gi~~~~~~~~~~~~ 257 (390) T protein:vir:81 182 SLKFAKKTDTTHVIAHTMKATRQ-ILSDA---PQLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTT 257 (390) T ss_pred cceeeEEEEeeeEEEEeehhhHH-HHHhH---HHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcccceeecccccccccc Confidence 99999999999999999999864 54422 2588888888999999999999999986543 599988775433222 Q ss_pred CCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhC-C-- Q lcl|NC_019522. 158 STFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNF-P-- 234 (311) Q Consensus 158 ~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~-~-- 234 (311) . +....++||.+++.++... + ..+..++|+|+.+..|.+-. +..|.-++.-..... + T Consensus 258 ~------------~~~~~~~~~~~~~~~~~~~--~--~~~~~~v~~~~~~~~l~~lk----d~~G~~l~~~~~~~~~~~l 317 (390) T protein:vir:81 258 I------------AGATRVDQLRLAMLQASLA--E--YNPSGIVINPIDWAAIELAK----DANNQYLIGNARGTLTPTL 317 (390) T ss_pred c------------ccchhHHHHHHHHHhhccc--c--CCCCEEEEcHHHHHHHHHhh----cCCCceeecCcccccCcee Confidence 1 1112256788888877432 2 34568999999999886532 111222211101111 1 Q ss_pred -ceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhcc-----ceeeCCceEEEeeeeeeeeEEEECCeEEEEe Q lcl|NC_019522. 235 -DITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLA-----PATADNVNFKVPAILRTGGTEWRIPKAGHYV 308 (311) Q Consensus 235 -~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~-----p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~ 308 (311) .+.++.++.+.. +.+++.+- .+.+.+.....+++.. -.+. + ...+.+..|+ +..++.|.|++.+ T Consensus 318 ~G~pv~~~~~~p~------~~~~~gd~-~~~~~~~~~~~~~v~~~~~~~~~~~-~-~v~~r~~~r~-d~~v~~~~a~v~~ 387 (390) T protein:vir:81 318 WGLPVVATQAMAP------GEFLVGAF-DLAAQIFDQWDARVEIGYVGEDFQR-N-MITVLAEERL-ALVVYRPEALISG 387 (390) T ss_pred cceeeEEcCCCCC------CcEEEEeh-hceEEEEEecceEEEEecccchhhc-C-cEEEEEEEee-ccEEecccceEEE Confidence 122333333321 11221111 1112111111221110 0121 2 2345567787 5688999999876 Q ss_pred ecC Q lcl|NC_019522. 309 DGV 311 (311) Q Consensus 309 dGI 311 (311) +== T Consensus 388 t~a 390 (390) T protein:vir:81 388 SFA 390 (390) T ss_pred EeC Confidence 522 No 62 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=98.21 E-value=2.1e-07 Score=57.23 Aligned_cols=277 Identities=10% Similarity=0.044 Sum_probs=156.0 Q ss_pred CCc-------ccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEec--C Q lcl|NC_019522. 1 MAK-------SVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFG--P 71 (311) Q Consensus 1 ~~~-------~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~--~ 71 (311) |.. .++... .+.++|++. +.+.+.|++...+....+.+..+.... ..+.|.+....+.+.+.. . T Consensus 131 l~~~~~~~e~~a~~~~-t~~GG~lvP--~~~~~~Ii~~l~~~~~i~~~~~~~~~~----~~~~~p~~~~~~~a~~~~~~~ 203 (434) T protein:vir:62 131 IVGNIDEKEARALGLV-TGNGSVTIP--DFLSKEIITYAQEENFLRRLGTGVKTK----ENIKYPVLVKKAEAQGHKNER 203 (434) T ss_pred hccccchhhhhhhccc-ccccceecc--hhhHHHHHHhhhhhhhhhhhcceeccC----CceEEEEEecCCcccceeccc Confidence 111 111111 133567776 457788888877776676666543221 235566665555555543 3 Q ss_pred cccccceeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc--eeeeecC Q lcl|NC_019522. 72 NSTDVPTVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG--EGLYTSP 149 (311) Q Consensus 72 ~a~dip~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g--~GllN~p 149 (311) ...++|..+..++......+.++.-+.+|.+ +. ..+..+|.+--....++++...+++.+++|+...+ -|+++.+ T Consensus 204 e~~~~~~~~~~f~~v~~~~~k~~~~~~iS~e-ll--~ds~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~g~~~~~ 280 (434) T protein:vir:62 204 TNNEMPETDIEFDEIELSPTEFDALATVTKK-LL--ARTGLPIEQIVMDELKKAYVRKETQYMVNGDEANNINDGALAKK 280 (434) T ss_pred ccccccccccceeeEEeeheeeEeehhhHHH-HH--hcchHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccceeecc Confidence 3467888888899999999999998888854 32 23467888889999999999999999999997655 3788776 Q ss_pred CcceeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHH-H Q lcl|NC_019522. 150 NVSVEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQ-F 228 (311) Q Consensus 150 ~v~~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~-~ 228 (311) +++..+. ....++||.+++.++... +. ..-.++|.|+.+..|.+-. . ..|.-++. . T Consensus 281 ~~~~~~~---------------~~~~~d~l~~l~~~l~~~--~~--~~a~~v~n~~~~~~L~~lk-d---~~G~~l~~~~ 337 (434) T protein:vir:62 281 AVEFKTD---------------EKNLYDALVKMKNTPVKE--VR--KKARWVLNTAALTKIETMK-T---DDGFPLLRPF 337 (434) T ss_pred ccccccc---------------ccchhhHHHHHHhhcchh--hh--cCCEEEEcHHHHHHHHHhh-c---cCCCEeeccC Confidence 6532211 111256677777776421 21 1225789999999886522 1 11221111 0 Q ss_pred HH-HhCC-----ceEEEEchhcccCCCCcccEEEEEEcCcceeEEe-e-cchhhhccceeeCCceEEEeeeeeeeeEEEE Q lcl|NC_019522. 229 LR-TNFP-----DITFEDDILLKGAGVAGADRMAVYKKEIRIVKGH-D-VMPLRFLAPATADNVNFKVPAILRTGGTEWR 300 (311) Q Consensus 229 l~-~n~~-----~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~-~-~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~ 300 (311) .. ..+. ...++....+.. +.+|....++|-+=.+++-.. . ++.+.+..-.-......-+.++.|+.|-.|+ T Consensus 338 ~~~~~g~~~tl~G~pV~~~~~~~~-~~~~~~~~i~~Gdfs~~~i~~~~g~~~i~~~~~~~~~~~~v~~~~~~r~Dgk~i~ 416 (434) T protein:vir:62 338 NQAEGGIGYTLLGFPVEEEDAIDI-PDSPDTPVFYFGDFSKFYIQDVIGSLEVQKLVELFSRTNRVGFRIWNLLDAQLIH 416 (434) T ss_pred CCccCCCCceecceeeEEecCccC-ccCCCceEEEEeeccceEEEEeeceeEEEeehhhhcccCceEEEEEeeecceeec Confidence 00 0011 112333333332 233333444443333332221 1 1112221100011222446778999888899 Q ss_pred CCeEEEEeecC Q lcl|NC_019522. 301 IPKAGHYVDGV 311 (311) Q Consensus 301 ~P~ai~~~dGI 311 (311) .|++++.+.+. T Consensus 417 ~~~~~~~~~~~ 427 (434) T protein:vir:62 417 SPFEVPVYKYV 427 (434) T ss_pred CcccceEEEEE Confidence 99999988666 No 63 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=98.18 E-value=4e-07 Score=55.63 Aligned_cols=281 Identities=11% Similarity=0.020 Sum_probs=151.7 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccce-- Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPT-- 78 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~-- 78 (311) -....|.....+.+++++. +.+.++|++........+.++++..- +...-.+.|........+.+++.+.. .|. T Consensus 105 ~e~~a~~~~~~~~gg~~vP--~~~~~~ii~~~~~~~~l~~l~~~~~~-~~~~g~~~~~~~~~~~~~~~v~e~~~-~~~~~ 180 (404) T protein:vir:10 105 KEINAISENIDEDGGYAVP--EDIQTKINTRLKDTTDLYNMVDYEPV-FTRSGSRTYEKRSKQKPMKPLSENQQ-IPTNG 180 (404) T ss_pred hHHhhhccccCCCCceeec--hhHHHHHHHHHhhhhhHhhhhceeec-cCCccceEEEEecCCcceeecccccc-ccccc Confidence 1112232333344556665 45678888887777777777765432 22222344444455556677665533 444 Q ss_pred eeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc--eeeeecCCcceeec Q lcl|NC_019522. 79 VDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG--EGLYTSPNVSVEAA 156 (311) Q Consensus 79 v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g--~GllN~p~v~~~~~ 156 (311) .+..++......+.++.-+.+|. |+. ..+..+|..--.....+++...+|+.+++|+..-+ .|+++.+++...+. T Consensus 181 ~~~~f~~i~~~~~k~~~~~~iS~-ell--~ds~~~l~~~i~~~la~~~~~~~~~~il~G~g~~~~~~gi~~~~~~~~~~~ 257 (404) T protein:vir:10 181 DNGKLERFNFKLKDLADFMSIPN-DLL--KFADKSLEDWIINWFVDKVRITRNAEILYGAGGDEHATGIMTANKFKKITL 257 (404) T ss_pred cccceeeeEeeheeeEeeehhhH-HHH--hhcHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCcccceeeccccceeec Confidence 34557778888888888888885 333 23445788888899999999999999999987643 48888877654433 Q ss_pred cCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHH-HHHHhCC- Q lcl|NC_019522. 157 TSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQ-FLRTNFP- 234 (311) Q Consensus 157 ~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~-~l~~n~~- 234 (311) ... .+ ++|+..+++.... ..+ .....++|+|+.+..|.+.. +..|.-++. -+....+ T Consensus 258 ~~~----------~~----~~~~~~~~~~~l~-~~~--~~~~~~v~n~~~~~~L~~lk----d~~G~~l~~~~~~~~~~~ 316 (404) T protein:vir:10 258 PKS----------PA----LKDFKKCKNVELL-NVF--KATSSWIVNQDGFNYLDSLE----DKTGRPYLQPDPKDPTQY 316 (404) T ss_pred ccc----------cc----HHHHHHHHHhhhh-ccc--cCCCEEEEcHHHHHHHHHhh----ccCCceeeccCcCCCCCc Confidence 211 12 4455555553222 222 23346899999999886532 111221111 0011110 Q ss_pred ---ceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhccceeeC----CceEEEeeeeeeeeEEEECCeEEEE Q lcl|NC_019522. 235 ---DITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATAD----NVNFKVPAILRTGGTEWRIPKAGHY 307 (311) Q Consensus 235 ---~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~----~~~~~~~~~~~~gGv~i~~P~ai~~ 307 (311) ...++.++.....+..+... ++|-+-.+.+.+..-..+++....+.. .-...+.++.|+ ++.+.+|.+++. T Consensus 317 ~l~G~PV~~~~~~~~~~~~~~~~-~~~gd~s~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~-d~~v~~~~a~~~ 394 (404) T protein:vir:10 317 RFLGLPVIELPNDLLLSTESAIP-VLLGDTKEAYKYVSDGAYELATTNIGAGAFETNTTKARIIMRI-DGNVKDSEALLI 394 (404) T ss_pred cccceeeEEecccccCCCCCccE-EEEEeccccEEEEEecceEEEEeccccchhhcCceEEEEEEee-ccEEecccceEE Confidence 11222222222222333333 444433333333222222221101111 112456667787 578999999998 Q ss_pred eecC Q lcl|NC_019522. 308 VDGV 311 (311) Q Consensus 308 ~dGI 311 (311) ++=- T Consensus 395 ~~~~ 398 (404) T protein:vir:10 395 AEIP 398 (404) T ss_pred EEee Confidence 7766 No 64 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=98.18 E-value=4.2e-07 Score=55.55 Aligned_cols=281 Identities=13% Similarity=0.042 Sum_probs=141.0 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccceee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPTVD 80 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~v~ 80 (311) .........+..+.+++.. +.+.+.|.+........+.+..+.... ....+.+.+.+..+.+.|++.+ ..+|..+ T Consensus 106 ~~~~~~~~t~~~~g~~~~~--~~~~~~i~~~~~~~~~l~~~~~~~~~~--~~~~~~~~~~~~~~~a~~v~E~-~~~~~~~ 180 (392) T protein:vir:13 106 FAPEKRDGTKAGNPNVLSR--TLYGQLIAQAVERSAIMRGGASTFTTS--DANPMDFTVITGRATAGIVGET-AEIPESY 180 (392) T ss_pred hhhhhhcccccCCCccccc--cchHHHHHHHHhhhhhhhhcceeeecC--CCceeEEEEEcCCcceeeeccc-ccccccc Confidence 1111111112222333332 123332333222222233333322111 2234556667777888888765 4588888 Q ss_pred eeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc-eeeeecCCcceeeccCC Q lcl|NC_019522. 81 IAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG-EGLYTSPNVSVEAATST 159 (311) Q Consensus 81 ~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g-~GllN~p~v~~~~~~~~ 159 (311) ..++......+.++.-..+|.+=|+ ....++..--....+.++.+.+|.-+++|+.... .|+|+++......... T Consensus 181 ~~f~~v~~~~~k~~~~~~iS~ell~---ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~Gt~~p~Gil~~~~~~~~~~~~- 256 (392) T protein:vir:13 181 PATTQRSMGGFKYGFASVVSYEFAT---DQVLDLVGFLVSDAGPAIGDAMGRHFLTGTGTGQPRGILTDATGANAAFGE- 256 (392) T ss_pred cceeeEEeeeeeEEeeehhHHHHHh---cchHHHHHHHHHHHHHHHHHHHHHHHhcccCCccccccccccccccccccc- Confidence 8899999999999988888855333 3466788888889999999999999999986544 6999887543221111 Q ss_pred ccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhCCceEEE Q lcl|NC_019522. 160 FVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNFPDITFE 239 (311) Q Consensus 160 ~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~~~l~i~ 239 (311) ... ...+ +++|.+++..+... + ..+-.++++|+.+..|.+-. +..|.-++.-=...+.+-++. T Consensus 257 ~~~-----~~~~----~d~l~~~~~~l~~~--~--~~~a~~v~n~~~~~~l~~lk----d~~G~~l~~~~~~~g~~~~l~ 319 (392) T protein:vir:13 257 ADA-----DSKV----SDALIDLFHEVPSA--Y--RKNAKFVVNDLRAAQMRKLK----DANGQYLWQSALTVGAPDTFN 319 (392) T ss_pred ccc-----cccc----HHHHHHHHHhhhhh--h--hcCCEEEEcHHHHHHHHHhh----ccCCceeecCCcCCCCCceec Confidence 000 1122 45566666665321 1 12336899999999886522 221221111000001111222 Q ss_pred EchhcccCCCCcccEEEEEEcCcceeEEeecchhhhcccee--eCCceEEEeeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 240 DDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPAT--ADNVNFKVPAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 240 ~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~--~~~~~~~~~~~~~~gGv~i~~P~ai~~~dGI 311 (311) ..|-...... ..+. |+|-+=.+ +-+..-..+++..-.. ...-...+.++.|++ +.+.+|.|++.+..= T Consensus 320 G~Pv~~~~~~-~~~~-i~~Gdf~~-~~i~~~~~~~i~~~~~~~~~~~~~~~r~~~r~d-~~~~~~~A~~~~~~~ 389 (392) T protein:vir:13 320 GKVVETDDGM-PADK-VLFADLSK-YRVRFAGSLRVDRSVDAKFSTDQIVYRFLQRAD-GLLVDARGAKVLTVT 389 (392) T ss_pred ceeeEEcCCC-CCCc-EEEeeccc-eeEEeecceEEEeeccccccCCcEEEEEEEEec-cEEecccceEEEEee Confidence 2222211110 1122 22222112 2222222222210000 111124566778885 668999999877766 No 65 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=98.17 E-value=1.3e-06 Score=52.89 Aligned_cols=277 Identities=11% Similarity=-0.044 Sum_probs=143.2 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccceee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPTVD 80 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~v~ 80 (311) ...... ..+.+++++++. +.+.+.|++..+.....+.++.+.. .+-+.-.+.+......+.+.+++.+ .++|..+ T Consensus 117 ~~~~~~-~~~~~~gg~~iP--~~~~~~ii~~~~~~~~l~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~v~E~-~~~~~~~ 191 (415) T protein:vir:98 117 NDIQGG-SLKTDSGFVVIP--EEIVTDILKLKEVEFNLDKYVTVKR-VTNGSGKYPVVRQSEVAALEKVEEL-EENPELA 191 (415) T ss_pred hhhhhc-cccccccccccc--hHHHHHHHHHHHhhhhhhhheeeee-ccCCceeEEEEeecCCccceeeccc-cccCccc Confidence 111111 112233455565 4567788887777777666666533 2222223333333444566777655 4456443 Q ss_pred -eeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc-e-eeeecCCcceeecc Q lcl|NC_019522. 81 -IAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG-E-GLYTSPNVSVEAAT 157 (311) Q Consensus 81 -~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g-~-GllN~p~v~~~~~~ 157 (311) ..++.....++.++.-+.+|.+ +. ..+..++..--.....+++.+.+|+.+++|+.... . ++++....... .. T Consensus 192 ~~~~~~v~~~~~k~~~~~~iS~e-ll--~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~-~~ 267 (415) T protein:vir:98 192 VKPFFQLAYDINTHRGYFRISRE-AI--EDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKK-LE 267 (415) T ss_pred ccceeeEEeeeeeeEeeehhhHH-HH--hhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccc-cc Confidence 5688889999999998888844 33 33456788888888999999999999999975432 2 22222111111 00 Q ss_pred CCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHH-HHHhCC-- Q lcl|NC_019522. 158 STFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQF-LRTNFP-- 234 (311) Q Consensus 158 ~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~-l~~n~~-- 234 (311) . . ...+ ++||.+++.++... + ..+..++|+|+.+..|.+.. . ..|.-++.- +....+ T Consensus 268 ~--~------~~~~----~~~i~~~~~~~~~~--~--~~~~~~v~n~~~~~~l~~lk-d---~~G~~l~~~~~~~~~~~~ 327 (415) T protein:vir:98 268 V--K------KAKS----LDDIKDAINLNVKP--N--YEHNVAIVSQTMFAKLDKMK-D---KLGNYLIQPDVKEKTQQR 327 (415) T ss_pred c--c------cccc----hhHHHHHHHhhhhh--c--cCCCEEEEcHHHHHHHHHhh-c---cCCceeeccCcCCCCCce Confidence 0 0 0111 56777777777432 2 24568999999999986522 1 111111100 000000 Q ss_pred --ceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhcc-ceeeCCceEEEeeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 235 --DITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLA-PATADNVNFKVPAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 235 --~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~-p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dGI 311 (311) ...++.++.+. .+..|... ++|-+=.+.+.+..-..+++.. +... .......+.|+ ++.+.+|.|+++++-- T Consensus 328 l~G~pV~~~~~~~-~~~~~~~~-~~~Gd~~~~~~~~~~~~~~v~~~~~~~--~~~~~~~~~r~-d~~v~~~~a~~~~~~~ 402 (415) T protein:vir:98 328 LLGAKIEILPDEV-LGQKGNNT-LIIGNLKDAIVLFDRSQYQASWTDYMH--FGECLMIAVRQ-DCRILDYKSAIVIEYD 402 (415) T ss_pred ecceeeEEecccc-cCCCCccE-EEEEehhccEEEEeecceEEEEecccc--CceEEEEEEEe-ccEEeccccEEEEEEe Confidence 01222222222 22233333 3333212222221122222221 1111 12233455776 5778889999999766 No 66 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=98.17 E-value=1.3e-06 Score=52.89 Aligned_cols=277 Identities=11% Similarity=-0.044 Sum_probs=143.2 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccceee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPTVD 80 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~v~ 80 (311) ...... ..+.+++++++. +.+.+.|++..+.....+.++.+.. .+-+.-.+.+......+.+.+++.+ .++|..+ T Consensus 117 ~~~~~~-~~~~~~gg~~iP--~~~~~~ii~~~~~~~~l~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~v~E~-~~~~~~~ 191 (415) T protein:vir:81 117 NDIQGG-SLKTDSGFVVIP--EEIVTDILKLKEVEFNLDKYVTVKR-VTNGSGKYPVVRQSEVAALEKVEEL-EENPELA 191 (415) T ss_pred hhhhhc-cccccccccccc--hHHHHHHHHHHHhhhhhhhheeeee-ccCCceeEEEEeecCCccceeeccc-cccCccc Confidence 111111 112233455565 4567788887777777666666533 2222223333333444566777655 4456443 Q ss_pred -eeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc-e-eeeecCCcceeecc Q lcl|NC_019522. 81 -IAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG-E-GLYTSPNVSVEAAT 157 (311) Q Consensus 81 -~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g-~-GllN~p~v~~~~~~ 157 (311) ..++.....++.++.-+.+|.+ +. ..+..++..--.....+++.+.+|+.+++|+.... . ++++....... .. T Consensus 192 ~~~~~~v~~~~~k~~~~~~iS~e-ll--~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~-~~ 267 (415) T protein:vir:81 192 VKPFFQLAYDINTHRGYFRISRE-AI--EDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKK-LE 267 (415) T ss_pred ccceeeEEeeeeeeEeeehhhHH-HH--hhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccc-cc Confidence 5688889999999998888844 33 33456788888888999999999999999975432 2 22222111111 00 Q ss_pred CCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHH-HHHhCC-- Q lcl|NC_019522. 158 STFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQF-LRTNFP-- 234 (311) Q Consensus 158 ~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~-l~~n~~-- 234 (311) . . ...+ ++||.+++.++... + ..+..++|+|+.+..|.+.. . ..|.-++.- +....+ T Consensus 268 ~--~------~~~~----~~~i~~~~~~~~~~--~--~~~~~~v~n~~~~~~l~~lk-d---~~G~~l~~~~~~~~~~~~ 327 (415) T protein:vir:81 268 V--K------KAKS----LDDIKDAINLNVKP--N--YEHNVAIVSQTMFAKLDKMK-D---KLGNYLIQPDVKEKTQQR 327 (415) T ss_pred c--c------cccc----hhHHHHHHHhhhhh--c--cCCCEEEEcHHHHHHHHHhh-c---cCCceeeccCcCCCCCce Confidence 0 0 0111 56777777777432 2 24568999999999986522 1 111111100 000000 Q ss_pred --ceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhcc-ceeeCCceEEEeeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 235 --DITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLA-PATADNVNFKVPAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 235 --~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~-p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dGI 311 (311) ...++.++.+. .+..|... ++|-+=.+.+.+..-..+++.. +... .......+.|+ ++.+.+|.|+++++-- T Consensus 328 l~G~pV~~~~~~~-~~~~~~~~-~~~Gd~~~~~~~~~~~~~~v~~~~~~~--~~~~~~~~~r~-d~~v~~~~a~~~~~~~ 402 (415) T protein:vir:81 328 LLGAKIEILPDEV-LGQKGNNT-LIIGNLKDAIVLFDRSQYQASWTDYMH--FGECLMIAVRQ-DCRILDYKSAIVIEYD 402 (415) T ss_pred ecceeeEEecccc-cCCCCccE-EEEEehhccEEEEeecceEEEEecccc--CceEEEEEEEe-ccEEeccccEEEEEEe Confidence 01222222222 22233333 3333212222221122222221 1111 12233455776 5778889999999766 No 67 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=98.17 E-value=1.3e-06 Score=52.89 Aligned_cols=277 Identities=11% Similarity=-0.044 Sum_probs=143.2 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccceee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPTVD 80 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~v~ 80 (311) ...... ..+.+++++++. +.+.+.|++..+.....+.++.+.. .+-+.-.+.+......+.+.+++.+ .++|..+ T Consensus 117 ~~~~~~-~~~~~~gg~~iP--~~~~~~ii~~~~~~~~l~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~v~E~-~~~~~~~ 191 (415) T protein:vir:79 117 NDIQGG-SLKTDSGFVVIP--EEIVTDILKLKEVEFNLDKYVTVKR-VTNGSGKYPVVRQSEVAALEKVEEL-EENPELA 191 (415) T ss_pred hhhhhc-cccccccccccc--hHHHHHHHHHHHhhhhhhhheeeee-ccCCceeEEEEeecCCccceeeccc-cccCccc Confidence 111111 112233455565 4567788887777777666666533 2222223333333444566777655 4456443 Q ss_pred -eeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc-e-eeeecCCcceeecc Q lcl|NC_019522. 81 -IAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG-E-GLYTSPNVSVEAAT 157 (311) Q Consensus 81 -~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g-~-GllN~p~v~~~~~~ 157 (311) ..++.....++.++.-+.+|.+ +. ..+..++..--.....+++.+.+|+.+++|+.... . ++++....... .. T Consensus 192 ~~~~~~v~~~~~k~~~~~~iS~e-ll--~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~-~~ 267 (415) T protein:vir:79 192 VKPFFQLAYDINTHRGYFRISRE-AI--EDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKK-LE 267 (415) T ss_pred ccceeeEEeeeeeeEeeehhhHH-HH--hhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccc-cc Confidence 5688889999999998888844 33 33456788888888999999999999999975432 2 22222111111 00 Q ss_pred CCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHH-HHHhCC-- Q lcl|NC_019522. 158 STFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQF-LRTNFP-- 234 (311) Q Consensus 158 ~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~-l~~n~~-- 234 (311) . . ...+ ++||.+++.++... + ..+..++|+|+.+..|.+.. . ..|.-++.- +....+ T Consensus 268 ~--~------~~~~----~~~i~~~~~~~~~~--~--~~~~~~v~n~~~~~~l~~lk-d---~~G~~l~~~~~~~~~~~~ 327 (415) T protein:vir:79 268 V--K------KAKS----LDDIKDAINLNVKP--N--YEHNVAIVSQTMFAKLDKMK-D---KLGNYLIQPDVKEKTQQR 327 (415) T ss_pred c--c------cccc----hhHHHHHHHhhhhh--c--cCCCEEEEcHHHHHHHHHhh-c---cCCceeeccCcCCCCCce Confidence 0 0 0111 56777777777432 2 24568999999999986522 1 111111100 000000 Q ss_pred --ceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhcc-ceeeCCceEEEeeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 235 --DITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLA-PATADNVNFKVPAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 235 --~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~-p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dGI 311 (311) ...++.++.+. .+..|... ++|-+=.+.+.+..-..+++.. +... .......+.|+ ++.+.+|.|+++++-- T Consensus 328 l~G~pV~~~~~~~-~~~~~~~~-~~~Gd~~~~~~~~~~~~~~v~~~~~~~--~~~~~~~~~r~-d~~v~~~~a~~~~~~~ 402 (415) T protein:vir:79 328 LLGAKIEILPDEV-LGQKGNNT-LIIGNLKDAIVLFDRSQYQASWTDYMH--FGECLMIAVRQ-DCRILDYKSAIVIEYD 402 (415) T ss_pred ecceeeEEecccc-cCCCCccE-EEEEehhccEEEEeecceEEEEecccc--CceEEEEEEEe-ccEEeccccEEEEEEe Confidence 01222222222 22233333 3333212222221122222221 1111 12233455776 5778889999999766 No 68 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=98.17 E-value=4.6e-07 Score=55.30 Aligned_cols=280 Identities=12% Similarity=0.038 Sum_probs=141.7 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccceee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPTVD 80 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~v~ 80 (311) .+.-..+....++.+++.. +.....|.+........+.+..+..-.+ ...+.+.+....+.+.|++.. ..+|..+ T Consensus 106 ~~~~~~~~t~~~~g~~~~~--~~~~~~i~~~~~~~~~l~~~~~~~~~~~--~~~~~~p~~~~~~~a~wv~E~-~~~~~~~ 180 (390) T protein:vir:62 106 FAPEKRDGTKAGNPNVLSR--TLYGQLIAQAVERSAIMRGGATTFTTSD--ANPLDFTVITGRSSASIVGET-AEIPESY 180 (390) T ss_pred hhhhhhcccccCCCccccc--cchHHHHHHHHhhhhhhhhcceeeecCC--CceeEEEEEcCCcceeeeccc-ccccccc Confidence 1111111111222233332 2233334443333333344443322111 234566777777888888754 5688888 Q ss_pred eeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccceeeeecCCcceeeccCCc Q lcl|NC_019522. 81 IAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVGEGLYTSPNVSVEAATSTF 160 (311) Q Consensus 81 ~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g~GllN~p~v~~~~~~~~~ 160 (311) ..++......+.++.-+.+|.+=|+ ...+++...-....++++...+|+-+++|+.. ..|++|+++.......... T Consensus 181 ~~f~~i~~~~~k~~~~~~iS~ell~---ds~~~l~~~i~~~l~~~i~~~~d~~~l~G~G~-p~Gi~~~~~~~~~~~~~~~ 256 (390) T protein:vir:62 181 PATAQRSMGGFKYGFASVVSYEFAT---DQVLDLVGFLVSDAGPAIGDAMGRHFITGTGQ-PRGILTDASPATATFLATD 256 (390) T ss_pred cceeeeEeeeeeEEeehHHHHHHHh---hhhHHHHHHHHHHHHHHHHHHHHhhhhccCCc-cccccccccccccceeccc Confidence 9999999999999999988855443 35667888888999999999999999999852 4699998765433222111 Q ss_pred cccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhCCceEEEE Q lcl|NC_019522. 161 VALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNFPDITFED 240 (311) Q Consensus 161 ~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~~~l~i~~ 240 (311) + ...+ +++|.+++.++.. .+. . .-..+|+++.+..|.+-. . ..+.=++.-=..++..-++.. T Consensus 257 ~------~~~~----~~~l~~~~~~l~~--~~~-~-~a~~vmn~~~~~~L~~lk-d---~~g~~l~~~~~~~g~~~~l~G 318 (390) T protein:vir:62 257 T------DSKV----SDALIDLFHEVPS--AYR-A-NAKYVVNDLRAAQMRKLK-D---ANGQYLWQSGLTVGAPSLFNG 318 (390) T ss_pred c------cccc----hHHHHHHHHhhhh--hhh-c-CCEEEEchHHHHHHHHhh-c---cCCCeeecCCcCCCccceecc Confidence 1 1223 4455566665532 121 1 125899999999986522 1 111111100000111111111 Q ss_pred chhcccCCCCcccEEEEEEcCcceeEEeecchhhhcc-c-eeeCCceEEEeeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 241 DILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLA-P-ATADNVNFKVPAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 241 ~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~-p-~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dGI 311 (311) .|-..... ...+. ++|-+-.++ -+..-.+++... . .....-...+..+.|++ +.+.+|.|++.+..= T Consensus 319 ~Pv~~~~~-~p~~~-i~~gd~s~~-~i~~~~~~~v~~~~~~~~~~~~~~~~~~~r~d-~~~~~~~A~~~l~~~ 387 (390) T protein:vir:62 319 KVVETDDG-MPADK-ILFADLSKY-RVRFAGSLRVDRSVDAKFSTDQIVYRFLQRAD-GLLVDARGAKVLTVT 387 (390) T ss_pred cceEEecC-CCCcc-EEEeeccce-eEEeecceEEEeeccccccCCcEEEEEEEEeC-cEeechhheEEEEee Confidence 11111000 01122 222221121 111112221110 0 00111124456678875 579999999988855 No 69 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=98.15 E-value=1.2e-06 Score=52.99 Aligned_cols=275 Identities=11% Similarity=-0.032 Sum_probs=145.5 Q ss_pred CCc-ccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcce-eEEEEEEeecccceEEecCcccccce Q lcl|NC_019522. 1 MAK-SVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWA-QAVMFRSIDARGELQLFGPNSTDVPT 78 (311) Q Consensus 1 ~~~-~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~-~~~~~~~~~~~G~a~~~~~~a~dip~ 78 (311) .+. ..+.+++.++++++.. +.+...|++..++....+++-......-... -.+........+.+.|++. +..+|. T Consensus 332 ~a~~~~~~~~~~~~Gg~~vp--~~~~~~ii~~l~~~svv~~l~~~~~~~~~~~~~~~~ip~~t~~~~a~wv~E-g~~~~~ 408 (645) T protein:vir:93 332 SAVGAGTTTDPQWAGSLSEY--QEYAQDFIDYLRPQTIIGRFGQGGIPALRQVPFNIRVHAQVSGGAAGWVGE-GKTKPL 408 (645) T ss_pred hhhhccccccccccCCccCc--hhhHHHHHHhhhhhhhHHhhccccccccccccCceeeeeeecCcceEEecc-Cccccc Confidence 111 2333444556677765 3455678887777666666644322211111 1233344445567788865 466899 Q ss_pred eeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc-----eeeeecCCcce Q lcl|NC_019522. 79 VDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG-----EGLYTSPNVSV 153 (311) Q Consensus 79 v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g-----~GllN~p~v~~ 153 (311) .+..++..+...+.++.-..+| +||.. .+..++.+--.....+++.+.+|.-+|+|+..-+ .|++|.- T Consensus 409 s~~~f~~v~l~~~kla~~~~iS-~ell~--ds~~~~~~~i~~~l~~aia~~~d~a~l~g~g~~~~~~~p~gi~~~~---- 481 (645) T protein:vir:93 409 TKFDFESITFSHAKVSAIAVLT-EELIR--FSSPAADALVRNALAEAVVARLDTDFVDPKKAAVADVSPASITHDV---- 481 (645) T ss_pred cccceeEEEEeeEEEEEeehhH-HHHHh--hchHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccCCccccceeccc---- Confidence 9999999999999999988888 44432 3456788888889999999999999999875421 2554321 Q ss_pred eeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecc-eEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHh Q lcl|NC_019522. 154 EAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRP-NTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTN 232 (311) Q Consensus 154 ~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p-~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n 232 (311) .+. ++......|+..++.++...+ .... -..+|+|..+..|.+..-. + |.-++--+... T Consensus 482 ---~~~----------~~~~~~~~d~~~~~~~~~~a~---~~~~~a~~vmn~~~~~~L~~lkd~-~---G~~~~~~~~~~ 541 (645) T protein:vir:93 482 ---KGT----------ASSGNPDADAEAAFGQFVAAN---LQPTGAVWLMSSTNALALSMRKNA-L---GQKEYPDMTLL 541 (645) T ss_pred ---ccc----------ccccchHHHHHHHHHHHHhcC---CCccccEEEEcHHHHHHHHhcccc-C---CceeecCCCCC Confidence 000 011112467778887775432 1122 2478899999988654321 1 11111000000 Q ss_pred CC---ceEEEEchhcccCCCCc--ccEEEEEEcCcceeEEeecchhhh------------------ccceeeCCceEEEe Q lcl|NC_019522. 233 FP---DITFEDDILLKGAGVAG--ADRMAVYKKEIRIVKGHDVMPLRF------------------LAPATADNVNFKVP 289 (311) Q Consensus 233 ~~---~l~i~~~~~l~~ag~~g--~~~~v~y~~~~~~~~~~~~~~~~~------------------~~p~~~~~~~~~~~ 289 (311) ++ .+-+.....+.+.-..| .+..++.. ..+.+.+...-++ ..-.| +++ +-+. T Consensus 542 ~~tL~G~PV~~s~~vp~~~~~gd~s~~~ig~~---~~v~i~~s~~a~~~~~~~~~~~~~~~~~~~~v~lf~-~d~-vair 616 (645) T protein:vir:93 542 GGSFQGLPVIVSQYVGDQLVLVNAPDIYLADD---GGVAVDMSREASLEMQSEPTGDSTTPSPVELVSMFQ-TGS-VAIR 616 (645) T ss_pred CceeeceeeEEeccCCcceeEeccccEEEEEe---cceEEEeecceeEEEeecccccccccccccchhHhh-cCc-eEEE Confidence 10 01111111111100000 11111111 1111111100000 00022 222 4567 Q ss_pred eeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 290 AILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 290 ~~~~~gGv~i~~P~ai~~~dGI 311 (311) +..++ +..+++|.||++++|+ T Consensus 617 a~~r~-d~~~~~p~a~~~lt~~ 637 (645) T protein:vir:93 617 AERWI-NWRRRRTAAVAVITGV 637 (645) T ss_pred EEEEE-cceeeCccceEEEecc Confidence 77887 5778999999999999 No 70 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=98.14 E-value=1e-06 Score=53.44 Aligned_cols=271 Identities=11% Similarity=-0.005 Sum_probs=149.4 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeec-ccceEEecCccccccee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDA-RGELQLFGPNSTDVPTV 79 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~-~G~a~~~~~~a~dip~v 79 (311) ...-++.+.+..+.+.+.. +.+.+.|++........+.++++..- ....+.|...+. .+.+.|++.+ ..+|.. T Consensus 108 ~~~~~~~~~~~~~~g~lip--~~~~~~ii~~~~~~~~i~~~~~~~~~---~~~~~~~~~~~~~~~~a~~v~Eg-~~~~~~ 181 (390) T protein:vir:97 108 AALNTASTDAAGSAGALTT--PNRLPGFITPPDARLTVRDLIGSGRT---DSALIEYVQETGFVNNAAIVAEG-ALKPES 181 (390) T ss_pred HHHHhhhcccccccccccc--hhhhHHHHHHHhhhhhhHhhcceeec---cCCceEEEEEecCCcceeeecCC-cccccc Confidence 0001111112222223332 23445677777776666666664332 233455555544 4678888765 558888 Q ss_pred eeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc--eeeeecCCcceeecc Q lcl|NC_019522. 80 DIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG--EGLYTSPNVSVEAAT 157 (311) Q Consensus 80 ~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g--~GllN~p~v~~~~~~ 157 (311) +..++......+.++.-..++. |+..-. .++.+--....++++.+.+|+-+|+|+...+ .|++|.+++...... T Consensus 182 ~~~~~~i~~~~~k~~~~~~is~-ell~ds---~~l~~~i~~~la~a~~~~~d~a~l~G~g~~~~p~Gi~~~~~~~~~~~~ 257 (390) T protein:vir:97 182 SLKFAKKTDTTHVIAHTMKATR-QILSDA---PQLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTT 257 (390) T ss_pred ccceeEEEEeeeeEEEeehhhH-HHHHhH---HHHHHHHHHHHHHHHHHHHHHHHhhcCCCCccccceeecccccccccc Confidence 8899999999999999999886 454322 2588888888999999999999999986544 599998775433222 Q ss_pred CCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHh----C Q lcl|NC_019522. 158 STFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTN----F 233 (311) Q Consensus 158 ~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n----~ 233 (311) .+.+..+++|.+++..+... + ..+..++|+|+.+..|.+-. . ..|.-++.-.... . T Consensus 258 ------------~~~~~~~d~~~~~~~~~~~~--~--~~~~~~v~n~~~~~~L~~lk-d---~~G~~l~~~~~~~~~~~l 317 (390) T protein:vir:97 258 ------------IAGATRVDQLRLAMLQASLA--E--YPASGIVINPIDWAAIELAK-D---ANNQYLIGNARGTLTPTL 317 (390) T ss_pred ------------ccccchHHHHHHHHHhhccc--c--CCCCEEEEcHHHHHHHHHhh-c---CCCceeecCccCCCCcee Confidence 12233367788888877422 2 24568999999999997532 1 1122111100000 0 Q ss_pred CceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhcc-----ceeeCCceEEEeeeeeeeeEEEECCeEEEEe Q lcl|NC_019522. 234 PDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLA-----PATADNVNFKVPAILRTGGTEWRIPKAGHYV 308 (311) Q Consensus 234 ~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~-----p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~ 308 (311) ..+.++.++.+.. +..++.+- .+.+.+.....++... ..+ ++. ....+..|+ +..+++|.|++++ T Consensus 318 ~G~pV~~~~~~~~------~~~~~gd~-~~~~~~~~~~~~~i~~~~~~~~f~-~~~-~~~r~~~r~-d~~v~~~~a~v~~ 387 (390) T protein:vir:97 318 WGLPVVATQAMAP------GEFLVGAF-DLAAQIFDQWDARVEIGYVNDDFQ-RNM-VTVLAEERL-ALVVYRPEALITG 387 (390) T ss_pred cceeeEEcCCCCC------CcEEEEec-cceEEEEEecceEEEEeecccccc-cCc-EEEEEEEee-ccEEeccccEEEE Confidence 1122333333321 11122111 1122222222222111 111 121 334555666 5778999999887 Q ss_pred ecC Q lcl|NC_019522. 309 DGV 311 (311) Q Consensus 309 dGI 311 (311) +== T Consensus 388 ~~a 390 (390) T protein:vir:97 388 SFA 390 (390) T ss_pred EeC Confidence 633 No 71 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=98.11 E-value=1.2e-06 Score=53.00 Aligned_cols=279 Identities=11% Similarity=-0.053 Sum_probs=143.4 Q ss_pred CCc--ccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccce Q lcl|NC_019522. 1 MAK--SVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPT 78 (311) Q Consensus 1 ~~~--~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~ 78 (311) ... ........+++++++. +.+.+.|++........+.++.+..- +.+...+.+......+.+.+++.++ .+|. T Consensus 114 ~~~~~~~~~~~~t~~g~~~iP--~~~~~~ii~~~~~~~~l~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~v~Eg~-~~~~ 189 (415) T protein:vir:47 114 ETRNDIQGGSLKTDSGFVVIP--EEIVTDILKLKEVEFNLDKYVTVKRV-TNGSGKYPVVRQSEVAALEKVEELE-ENPE 189 (415) T ss_pred hhhhhhhhccccccCCccccc--HHHHHHHHHHHHhhhhhhhhcceeec-cCCceeEEEEEecCCcceeeccccc-cccc Confidence 000 0001111123344554 55677788888777777777664321 1122222222233444667776653 4565 Q ss_pred e-eeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc-eeeeecCCcceeec Q lcl|NC_019522. 79 V-DIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG-EGLYTSPNVSVEAA 156 (311) Q Consensus 79 v-~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g-~GllN~p~v~~~~~ 156 (311) . ...++......+.++..+.+|.+ +. ..+..+|..--....++++.+.+|+.+++|+.... .+.+.......... T Consensus 190 ~~~~~~~~v~~~~~k~~~~~~iS~e-ll--~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~~~~~ 266 (415) T protein:vir:47 190 LAVKPFFQLAYDINTHRGYFRISRE-AI--EDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKL 266 (415) T ss_pred ccccceeeEEeeeeeeEeeehhhHH-HH--hhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCcccccccccccccee Confidence 4 35788889999999999988854 33 33456888889999999999999999999975432 22222211100000 Q ss_pred cCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhCC-- Q lcl|NC_019522. 157 TSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNFP-- 234 (311) Q Consensus 157 ~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~~-- 234 (311) . . +...-++||.+++.++... + ..+..++|.|+.+..|.+.. . ..|.-++.--..++. T Consensus 267 ~-----------~-~~~~~~~~i~~~~~~~~~~--~--~~~~~~v~n~~~~~~L~~lk-d---~~G~~i~~~~~~~~~~~ 326 (415) T protein:vir:47 267 E-----------V-KKAKSLDDIKDAINLNVKP--N--YEHNVAIVSQTMFAKLDKMK-D---KLGNYLIQPDVKEKTQQ 326 (415) T ss_pred c-----------c-ccccchHHHHHHHHhhhhh--c--cCCCEEEEcHHHHHHHHHhh-c---cCCCeeeccCcCCCCCc Confidence 0 0 0111156777777777432 2 23568999999999986532 1 111211100001111 Q ss_pred ---ceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhcc-ceeeCCceEEEeeeeeeeeEEEECCeEEEEeec Q lcl|NC_019522. 235 ---DITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLA-PATADNVNFKVPAILRTGGTEWRIPKAGHYVDG 310 (311) Q Consensus 235 ---~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~-p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dG 310 (311) ...++..+.+. .+.+|... ++|-+=.+.+.+..-..+++.. .... .......+.|+ ++.+.+|.++++++- T Consensus 327 ~l~G~pV~~~~~~~-~~~~~~~~-~~~gd~~~~~~~~~~~~~~v~~~~~~~--~~~~~~~~~r~-d~~v~~~~a~~~~~~ 401 (415) T protein:vir:47 327 RLLGAKIEILPDEV-LGQKGNNT-LIIGNLKDAIVLFDRSQYQASWTDYMH--FGECLMIAVRQ-DCRILDYKSAIVIEY 401 (415) T ss_pred cccceeeEEecccc-ccCCCccE-EEEEehhccEEEEeecceEEEeecccc--CceEEEEEEEe-ccEEeccccEEEEEe Confidence 11222222221 22233333 3333322323222222222211 1111 12233456776 677889999999885 Q ss_pred C Q lcl|NC_019522. 311 V 311 (311) Q Consensus 311 I 311 (311) - T Consensus 402 ~ 402 (415) T protein:vir:47 402 D 402 (415) T ss_pred e Confidence 5 No 72 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=98.11 E-value=1.2e-06 Score=53.00 Aligned_cols=279 Identities=11% Similarity=-0.053 Sum_probs=143.4 Q ss_pred CCc--ccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccce Q lcl|NC_019522. 1 MAK--SVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPT 78 (311) Q Consensus 1 ~~~--~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~ 78 (311) ... ........+++++++. +.+.+.|++........+.++.+..- +.+...+.+......+.+.+++.++ .+|. T Consensus 114 ~~~~~~~~~~~~t~~g~~~iP--~~~~~~ii~~~~~~~~l~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~v~Eg~-~~~~ 189 (415) T protein:vir:46 114 ETRNDIQGGSLKTDSGFVVIP--EEIVTDILKLKEVEFNLDKYVTVKRV-TNGSGKYPVVRQSEVAALEKVEELE-ENPE 189 (415) T ss_pred hhhhhhhhccccccCCccccc--HHHHHHHHHHHHhhhhhhhhcceeec-cCCceeEEEEEecCCcceeeccccc-cccc Confidence 000 0001111123344554 55677788888777777777664321 1122222222233444667776653 4565 Q ss_pred e-eeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc-eeeeecCCcceeec Q lcl|NC_019522. 79 V-DIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG-EGLYTSPNVSVEAA 156 (311) Q Consensus 79 v-~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g-~GllN~p~v~~~~~ 156 (311) . ...++......+.++..+.+|.+ +. ..+..+|..--....++++.+.+|+.+++|+.... .+.+.......... T Consensus 190 ~~~~~~~~v~~~~~k~~~~~~iS~e-ll--~ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~~~~~ 266 (415) T protein:vir:46 190 LAVKPFFQLAYDINTHRGYFRISRE-AI--EDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKL 266 (415) T ss_pred ccccceeeEEeeeeeeEeeehhhHH-HH--hhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCcccccccccccccee Confidence 4 35788889999999999988854 33 33456888889999999999999999999975432 22222211100000 Q ss_pred cCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhCC-- Q lcl|NC_019522. 157 TSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNFP-- 234 (311) Q Consensus 157 ~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~~-- 234 (311) . . +...-++||.+++.++... + ..+..++|.|+.+..|.+.. . ..|.-++.--..++. T Consensus 267 ~-----------~-~~~~~~~~i~~~~~~~~~~--~--~~~~~~v~n~~~~~~L~~lk-d---~~G~~i~~~~~~~~~~~ 326 (415) T protein:vir:46 267 E-----------V-KKAKSLDDIKDAINLNVKP--N--YEHNVAIVSQTMFAKLDKMK-D---KLGNYLIQPDVKEKTQQ 326 (415) T ss_pred c-----------c-ccccchHHHHHHHHhhhhh--c--cCCCEEEEcHHHHHHHHHhh-c---cCCCeeeccCcCCCCCc Confidence 0 0 0111156777777777432 2 23568999999999986532 1 111211100001111 Q ss_pred ---ceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhcc-ceeeCCceEEEeeeeeeeeEEEECCeEEEEeec Q lcl|NC_019522. 235 ---DITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLA-PATADNVNFKVPAILRTGGTEWRIPKAGHYVDG 310 (311) Q Consensus 235 ---~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~-p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dG 310 (311) ...++..+.+. .+.+|... ++|-+=.+.+.+..-..+++.. .... .......+.|+ ++.+.+|.++++++- T Consensus 327 ~l~G~pV~~~~~~~-~~~~~~~~-~~~gd~~~~~~~~~~~~~~v~~~~~~~--~~~~~~~~~r~-d~~v~~~~a~~~~~~ 401 (415) T protein:vir:46 327 RLLGAKIEILPDEV-LGQKGNNT-LIIGNLKDAIVLFDRSQYQASWTDYMH--FGECLMIAVRQ-DCRILDYKSAIVIEY 401 (415) T ss_pred cccceeeEEecccc-ccCCCccE-EEEEehhccEEEEeecceEEEeecccc--CceEEEEEEEe-ccEEeccccEEEEEe Confidence 11222222221 22233333 3333322323222222222211 1111 12233456776 677889999999885 Q ss_pred C Q lcl|NC_019522. 311 V 311 (311) Q Consensus 311 I 311 (311) - T Consensus 402 ~ 402 (415) T protein:vir:46 402 D 402 (415) T ss_pred e Confidence 5 No 73 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=98.11 E-value=6.5e-07 Score=54.50 Aligned_cols=274 Identities=10% Similarity=0.020 Sum_probs=146.1 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccceee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPTVD 80 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~v~ 80 (311) +...++.+.+.+.+++|+.. +.+...|++.+++....+++ +.. ..+-....+.+......+.+.|++.+ ..+|..+ T Consensus 352 l~~ra~~~~t~~~gg~lvp~-~~~~~~iie~lr~~s~i~~l-~~~-~~~~~~g~~~ip~~~~~~~a~wv~E~-~~~~~s~ 427 (632) T protein:vir:96 352 LVQRQLEKKTAGKGGELVAT-ELLSEEFIDILRNKAIIGQM-GAR-MLPGLVGDVDIPKKTSGANFYWIGED-EDVQDSD 427 (632) T ss_pred HHHhhhhccccccccccccc-ccchHHHHHHHhhcchhhhh-cce-EeecCCcceEEEEEeCCceeEeecCC-ccccccc Confidence 11223333344445555531 12345677777665555554 221 12222234566667667777887765 5578888 Q ss_pred eeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc--eeeeecCCcceeeccC Q lcl|NC_019522. 81 IAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG--EGLYTSPNVSVEAATS 158 (311) Q Consensus 81 ~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g--~GllN~p~v~~~~~~~ 158 (311) ..++..+...+.++..+.+|.+=|. ....++...-......++...+|+-+++|+...+ .|++|..+++..+..+ T Consensus 428 ~~f~~i~l~~~k~~~~v~iS~ell~---ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~ 504 (632) T protein:vir:96 428 FDFTTLSFSPKTIAGAVPVTRKLRK---QSSIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPA 504 (632) T ss_pred cceeeEEeeeeEEEEehhhHHHHHh---ccchHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccceeeecccccceeccc Confidence 8889999999999998888844233 3466788888889999999999999999987544 5999998876543322 Q ss_pred CccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhCC--ce Q lcl|NC_019522. 159 TFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNFP--DI 236 (311) Q Consensus 159 ~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~~--~l 236 (311) . . .+ +++|.++..++...... ..+...++.|.....|.+.... +..+.-++ .++. .. T Consensus 505 ~----~-----~~----~~~i~~~~~~i~~~~~~--~~~~~~~~~~~~~~~l~~~~l~--d~~G~~i~----~~~~l~G~ 563 (632) T protein:vir:96 505 G----G-----VD----WASVVDMETKISTFNAD--AGRLAYLTSVTQRGAAKKAQVF--DNTGERIW----QNNEVNGY 563 (632) T ss_pred c----c-----CC----HHHHHHHHHHHhhcccc--cCccEEEEchhHHHHHHHHhcc--CCCCceee----cCCeeccc Confidence 1 1 11 45667777776443221 1233577888877776543221 11122221 1110 01 Q ss_pred EEEEchhccc-CCCCcc-cEEEEEEcCcceeEEeecchhhhccceeeCCceEEEeeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 237 TFEDDILLKG-AGVAGA-DRMAVYKKEIRIVKGHDVMPLRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 237 ~i~~~~~l~~-ag~~g~-~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dGI 311 (311) .+.....+.. ....|. ..+++-.. ..+.+.+ -|.+. .......+.++.++ ++-+++|.+|++..== T Consensus 564 pv~~s~~ip~~~~~~gd~s~~~i~~~--~~~~i~~-~~~~~-----~~~~~v~~~~~~~~-d~~v~~~~af~~~k~~ 631 (632) T protein:vir:96 564 RAEASNQIPADTWIFGDWSQIVIAMW--GVLDLKV-DPYTK-----AASDGLVLRVFQDV-DAGVRRKEAFCIAKKG 631 (632) T ss_pred ceEeccccccCcEEEeecceEEEEEe--cceEEEE-ccccc-----cccCceEEEEEeec-Cceeechhhhhheeec Confidence 1112222211 000010 11111111 1122211 01111 11223455566776 5789999988865444 No 74 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=98.05 E-value=3e-06 Score=50.89 Aligned_cols=271 Identities=10% Similarity=-0.043 Sum_probs=146.6 Q ss_pred cccccc--------chhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccc Q lcl|NC_019522. 4 SVFDVS--------PVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTD 75 (311) Q Consensus 4 ~~~~~~--------~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~d 75 (311) |.+++. +..+.+++..++ -.++++....+-..++++.+.. .......+.+.+....+.|++.+ .. T Consensus 1 ~g~~~e~~~~~~~~t~~~~g~l~~~~---~~~ii~~l~~~s~i~~l~~~~~---~~~~~~~ip~~~~~~~a~wv~Eg-~~ 73 (397) T protein:vir:23 1 MGFSADHSQIAQTKDTMFTGYLDPVQ---AKDYFAEAEKTSIVQRVAQKIP---MGATGIVIPHWTGDVSAQWIGEG-DM 73 (397) T ss_pred CCcCHHHHHHhhccCCCCccccchhH---HHHHHHHHHhccchhhhcceee---ccCCceEEEEEcCCcceEEecCC-cc Confidence 333332 122233455432 2456666666666666665433 22344667777777788998764 66 Q ss_pred cceeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeecccc-ce-eeeecCCcce Q lcl|NC_019522. 76 VPTVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGV-GE-GLYTSPNVSV 153 (311) Q Consensus 76 ip~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~-g~-GllN~p~v~~ 153 (311) +|..+..++......+.++..+.++.+=++ ....++...-....++++.+.+|+.+++|+..- +. |+++..+... T Consensus 74 ~~~s~~~f~~v~l~~~k~~~~v~iS~ell~---ds~~~l~~~i~~~l~~aia~~~d~a~l~G~gt~~~~~~~~~~~~~~~ 150 (397) T protein:vir:23 74 KPITKGNMTKRDVHPAKIATIFVASAETVR---ANPANYLGTMRTKVATAIAMAFDNAALHGTNAPSAFQGYLDQSNKTQ 150 (397) T ss_pred ccccccceeEEEEeeEEEEEeehhhHHHHh---cchHHHHHHHHHHHHHHHHHHHHHHHhhcccCCccccccccccccee Confidence 898899999999999999999999854333 334778899999999999999999999998653 22 4444433211 Q ss_pred eeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhC Q lcl|NC_019522. 154 EAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNF 233 (311) Q Consensus 154 ~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~ 233 (311) ... .. -..+++.+++.++... + ..+..++|+++.+..|.+.. +..+.-++.=-..+. T Consensus 151 ~~~------------~~---~~~~~~~~~~~~l~~~--~--~~~a~~vmn~~~~~~L~~lk----d~~G~~i~~~~~~~~ 207 (397) T protein:vir:23 151 SIS------------PN---AYQGLGVSGLTKLVTD--G--KKWTHTLLDDTVEPVLNGSV----DANGRPLFVESTYES 207 (397) T ss_pred eec------------cc---chhHHHHHHHHhhhhc--c--cCCCEEEEcHHHHHHHHHhh----ccCCceeeccccccc Confidence 111 11 1134455555555322 2 23568999999999887532 112222211000010 Q ss_pred -C----ceEEEEchhcccCC-CCcccEEEEEEc-------CcceeEEeecchhhhcc-------c---eeeCCceEEEee Q lcl|NC_019522. 234 -P----DITFEDDILLKGAG-VAGADRMAVYKK-------EIRIVKGHDVMPLRFLA-------P---ATADNVNFKVPA 290 (311) Q Consensus 234 -~----~l~i~~~~~l~~ag-~~g~~~~v~y~~-------~~~~~~~~~~~~~~~~~-------p---~~~~~~~~~~~~ 290 (311) + .-++...|-..... ..|+.. +++.+ ..+.+.+.+.....+.. + .+. + ...+.+ T Consensus 208 ~~~~~~~~tl~G~Pv~~s~~~~~g~~~-~~~gDfs~~~i~~~~~i~i~~~~e~~~~~~~~~~~~~~~lf~~-d-~v~~ra 284 (397) T protein:vir:23 208 LTTPFREGRILGRPTILSDHVAEGDVV-GYAGDFSQIIWGQVGGLSFDVTDQATLNLGSQESPNFVSLWQH-N-LVAVRV 284 (397) T ss_pred ccccccCceeeeeeEEEeCCCCCCceE-EEEeecceEEEEEEeceEEEEeeeeeeeeccccccceeeeeec-c-ceeEEE Confidence 1 11333333221111 112211 11111 11112222211111000 0 111 1 134556 Q ss_pred eeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 291 ILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 291 ~~~~gGv~i~~P~ai~~~dGI 311 (311) +.|+ ++.+++|.+++++++- T Consensus 285 ~~r~-d~~v~~~~a~~~~~~~ 304 (397) T protein:vir:23 285 EAEY-GLLINDVNAFVKLTFD 304 (397) T ss_pred Eeee-ccceecccceEEEeec Confidence 6776 5789999999999986 No 75 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=97.98 E-value=5.7e-06 Score=49.34 Aligned_cols=268 Identities=13% Similarity=0.019 Sum_probs=140.8 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCC--CCcceeEEEEEEeecccceEEecCcccccce Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNS--APDWAQAVMFRSIDARGELQLFGPNSTDVPT 78 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~--~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~ 78 (311) ||.++ +..+-.| .. |.+.+.|.+...+.+....+..+... +.+| .++.+..++..|.+..+.++ ++|+. T Consensus 1 Ma~~~----T~~~~~i-iP--ev~s~~v~~~~~~~~v~~~~~~~~~~l~g~~G-~tv~ip~~~~~g~a~~~~~g-~~i~~ 71 (278) T protein:vir:80 1 MADLT----TKLANLI-DP--EVMGPMISAKLPKAIKFGKIAPIDNSLEGQPG-SEITVPKYKYIGDAQDVAEG-AAIDY 71 (278) T ss_pred CCCcc----eehhhee-cH--HHHHHHHHHHHHHhhhhcccceecccccCCCC-CEEEEeeeccCCcceeecCC-CcCcc Confidence 55421 1111112 22 22333344444444455555544432 2334 56778888888999888875 57888 Q ss_pred eeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccceeeeecCCcceeeccC Q lcl|NC_019522. 79 VDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVGEGLYTSPNVSVEAATS 158 (311) Q Consensus 79 v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g~GllN~p~v~~~~~~~ 158 (311) -++..+.....+...+.+|++ .|+++ ...+.++-......+.+.+++..|+.++-.-. |..+. . T Consensus 72 ~~lt~~~~~~~i~~~~~a~~v--~D~~~-~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~----~a~~~---------~ 135 (278) T protein:vir:80 72 SALETESVKHGIKKAGKGVKL--TDESV-LSGYGDPVEEAQKQIRMAIASKVDNDILEEAL----TTTLE---------V 135 (278) T ss_pred cccccceeeEeeehhhccccc--cHHHH-hhccccHHHHHHHHHHHHHHHHHHHHHHHHHh----ccccc---------c Confidence 888888888888887666554 55554 44577777888889999999999987663221 11000 0 Q ss_pred CccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhccc----ccCCCCCcchHH-HHHHHhC Q lcl|NC_019522. 159 TFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTL----LSTQNASNVTLL-QFLRTNF 233 (311) Q Consensus 159 ~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~----~~~~~~~~~Tvl-~~l~~n~ 233 (311) + .+....+.+..++.+.++..++... .+..+..|+++|..+..|.+-. ... ...+..++ .-....+ T Consensus 136 ~-----~~~t~~~~~~~~~~~~da~~~l~~~---~~~~~~~ivv~p~~~~~L~k~~~~~~~~~-~~~g~~~~~~G~ig~~ 206 (278) T protein:vir:80 136 K-----GAINIGLIDKIENTFTDAPDAIEDE---SITTTGVLFLNYKDTAKLREEAAGSWTKA-SQLGDDLLVKGAFGEL 206 (278) T ss_pred c-----cccccchhhhHHHHHHHHHHhhccc---CCCcccEEEECHHHHHHHHhhhhhhcccc-ccccccceeeccceee Confidence 0 0111223445566677766665332 2344557999999998885421 111 11111110 0000011 Q ss_pred CceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhccceeeCCceEEEeeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 234 PDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 234 ~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dGI 311 (311) -.++|+..+.+.. ...+++.+ .-+.+...++.+...-.........+..... .|+-+.+|.+++.+.-- T Consensus 207 ~G~~Vi~s~~~p~------~t~~l~~~--gAi~~~~~~~~~vE~~Rd~~~~~d~i~~~~~-yg~~v~~~~~~v~it~~ 275 (278) T protein:vir:80 207 LGWEIVRTKKLAD------GNALAVKA--GALKTFLKRNLLAESGRDMDHKLTKFNADQH-YAVALVDETKAVKVVPV 275 (278) T ss_pred cceeEEEcCCCCc------ceEEEEec--cceeeeecCCcccccccchhhccceeeeeeE-EEEEEEcCcceEEEeec Confidence 1234544444421 22333332 3344433344332211111222233333333 48999999999999877 No 76 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=97.95 E-value=2.7e-06 Score=51.15 Aligned_cols=279 Identities=11% Similarity=-0.039 Sum_probs=144.6 Q ss_pred CC--cccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccce Q lcl|NC_019522. 1 MA--KSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPT 78 (311) Q Consensus 1 ~~--~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~ 78 (311) .. ...-.....+++++++. +.+.+.|++........++++.+.. .+-+...+.+......+.+.+++.+ ..+|. T Consensus 114 ~~~~~~~~~~~~~~~g~~~iP--~~~~~~ii~~~~~~~~l~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~v~Eg-~~~~~ 189 (415) T protein:vir:94 114 ETRNDIQGGSLKTDSGFVVIP--EEIVTDILKLKEVEFNLDKYVTVKR-VTNGSGKYPVVRQSEVAALEKVEEL-EENPE 189 (415) T ss_pred hhhhhhhhhccccccccccCc--HHHHHHHHHHHHhhhhhhhhcceee-ccCCceeEEEEeecCCccceecccc-ccccc Confidence 00 00001111223344443 4577788888888777777776543 2223333444444455677777665 34564 Q ss_pred e-eeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc-e-eeeecCCcceee Q lcl|NC_019522. 79 V-DIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG-E-GLYTSPNVSVEA 155 (311) Q Consensus 79 v-~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g-~-GllN~p~v~~~~ 155 (311) . ...++.....++.++..+.+|.+ +. ..+..++.+.-....++++.+.+|+.+++|+.... . ++.+....... T Consensus 190 ~~~~~~~~i~~~~~k~~~~~~is~e-ll--~ds~~~~~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~- 265 (415) T protein:vir:94 190 LAVKPFFQLAYDINTHRGYFRISRE-AI--EDAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKK- 265 (415) T ss_pred cccccceeeEeeheeeeeechhhHH-HH--hhchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccc- Confidence 4 35688899999999988888854 32 23456788888899999999999999999876532 2 22221111100 Q ss_pred ccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHH-HHHHhCC Q lcl|NC_019522. 156 ATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQ-FLRTNFP 234 (311) Q Consensus 156 ~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~-~l~~n~~ 234 (311) ...+ ...+ ++||.+++..+... + ..+..++|+|+.+..|.+.. +..|.-++. -+....+ T Consensus 266 ~~~~--------~~~~----~~~i~~~~~~~~~~--~--~~~~~~vmn~~~~~~l~~lk----d~~G~~l~~~~~~~~~~ 325 (415) T protein:vir:94 266 LEVK--------KAKS----LDDIKDAINLNVKP--N--YEHNVAIVSQTMFAKLDKMK----DKLGNYLIQPDVKEKTQ 325 (415) T ss_pred cccc--------cccc----hHHHHHHHHhhhhh--c--cCCCEEEEcHHHHHHHHHhh----ccCCCeeeccCcCCCCC Confidence 0000 0112 56777777776432 2 23678999999999996532 111221110 0000000 Q ss_pred ----ceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhccceeeCCceEEEeeeeeeeeEEEECCeEEEEeec Q lcl|NC_019522. 235 ----DITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHYVDG 310 (311) Q Consensus 235 ----~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dG 310 (311) ...++.++.+. .+..|... ++|-+=.+.+-+..-..+++..- ...........+.|+ ++.+.+|.|+++++- T Consensus 326 ~~l~G~pV~~~~~~~-~~~~~~~~-i~~gd~~~~~~~~~~~~~~v~~~-~~~~~~~~~r~~~r~-d~~~~~~~a~~~~~~ 401 (415) T protein:vir:94 326 QRLLGAKIEILPDEV-LGQKGNNT-LIIGNLKDAIVLFDRSQYQASWT-DYMHFGECLMIAVRQ-DCRILDYKSAIVIEY 401 (415) T ss_pred ceecceeeEEecccc-cCCCCccE-EEEEehhccEEEEeecceEEEEe-ccccCceEEEEEEEe-ccEEeccccEEEEEE Confidence 11233333322 12233333 33332222222222222322210 101111223445676 577888999999975 Q ss_pred C Q lcl|NC_019522. 311 V 311 (311) Q Consensus 311 I 311 (311) - T Consensus 402 ~ 402 (415) T protein:vir:94 402 D 402 (415) T ss_pred e Confidence 5 No 77 >protein:vir:8843 Length: 317 # NCBI annotation: major head protein # Family: family:all:3919 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775251;genbank:gi:27476049;genbank:GeneID:2700597 Probab=97.90 E-value=9.5e-07 Score=53.61 Aligned_cols=286 Identities=10% Similarity=0.031 Sum_probs=144.2 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccceee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPTVD 80 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~v~ 80 (311) ||.|++--.+-...+- . +-+...|+..--.+.-...++. +.......+.|...+.....+.....+.|.|... T Consensus 1 ma~~~~~~~t~~~~g~-~---~dl~~~I~~isp~dTPf~S~i~---~~~a~~~~~~W~~d~l~~~~~~~~~EG~da~~~~ 73 (317) T protein:vir:88 1 MATPTNAVSTVEINGK-R---EDLIDIIYNIAPYDTPFMSAIG---KGVATAITHEWQTDELRQPGKNTRVEGEDATIKA 73 (317) T ss_pred CCccccceEeeeeeee-e---echhhhheecCCccCcceeeec---CceecccEEEEEeeecCCccccccccCccccccc Confidence 8888775433222210 1 1112223222111111122332 2344555666665554433332222223333322 Q ss_pred eec---cceeEEEEEEEEEEEecHHHHHHHHHhCC-ChHHHHHHHHHHHHHHhhhheeeeeccc---------cce-eee Q lcl|NC_019522. 81 IAM---SQGFKDINTAALGYTYSIEEIGFAMLNNV-NLDAERGQAVRDVVEQGLNKIYLLGDKG---------VGE-GLY 146 (311) Q Consensus 81 ~~~---~~~~~~v~~~~~~~~~~~~El~~a~~~g~-~l~~~k~~aa~~~~~~~~n~~~~~G~~~---------~g~-Gll 146 (311) ... ..-..+|++=...+..+.+-. ...|+ ++.+.....+...+.+.++..+++|.+. +.. ||+ T Consensus 74 ~~~r~~~~N~tQIf~k~v~VSgTa~av---~~~G~~~ela~q~~kk~~EikrdmE~~li~g~~a~~~~~~t~~r~~~Gl~ 150 (317) T protein:vir:88 74 GSFTTMLNNYCQISDETLQVTGTADRV---KKAGRKNELAYQLAKKSKELKLDMEYALVGAPQAKVQRNTTTPGQMANIF 150 (317) T ss_pred ccCCEEeccEEEEEEeEEEEeehhhhh---hhcCccchhHHHHHHHHHHHHHHHHHHHhcCeeeccCCCCccchhhhhHH Confidence 211 111334565555666665543 33453 5556666677778888888888888643 111 655 Q ss_pred ec---CCcceeeccCCccccCcccccCCHHHHH-HHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccC----- Q lcl|NC_019522. 147 TS---PNVSVEAATSTFVALVAAIPTNGTQPII-DFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLST----- 217 (311) Q Consensus 147 N~---p~v~~~~~~~~~~~~~t~w~~~t~~ei~-~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~----- 217 (311) +. -++..........+++..|...|+..+- ++|++++.++|..+ + .|+.+.++|..-..|++-+-.. T Consensus 151 ~~i~t~~~~~~~g~~~~~~~~~~~t~~t~~~lte~~l~~~l~~i~~~G-g---~~~~i~v~a~~k~~i~~~~~~~~~~i~ 226 (317) T protein:vir:88 151 AYYKTNGSLGANGVAPVGDGSNTGTAGDLRLLTEDMLLNASESIWRNG-G---QANSIQTSSSIKKAISKNMKGRATEIT 226 (317) T ss_pred HHhccCceeccCccccccCCCccccccccccccHHHHHHHHHHHHhcC-C---CCCEEEeChHHHHHHHHHhcCCceeEE Confidence 43 1221111111112223334444444333 55889999999754 3 3678999999888886542110 Q ss_pred ----CCCCcchHHHHHHHhCCceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhccceeeCCceEEEeeeee Q lcl|NC_019522. 218 ----QNASNVTLLQFLRTNFPDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATADNVNFKVPAILR 293 (311) Q Consensus 218 ----~~~~~~Tvl~~l~~n~~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~ 293 (311) ....+.+|-.|. .+|=.++|+..+.|.. +.++ .-|++++++....|+.. .++-+-+...+.-.+.. T Consensus 227 ~~~~~~~~g~~v~~~~-tdfG~v~ii~~r~lp~------~~~~--~~D~~~~~l~~Lr~~~~-e~laKtGd~~k~~i~~E 296 (317) T protein:vir:88 227 LDASDNRIAQTVDVYE-SDFGKYTIRANRWFHE------NTLF--VFDPKMHSLCYLRPFFQ-HELAKTGDSEKRQLLVE 296 (317) T ss_pred EcccCeEEEEEEEEEE-eCCeEEEEEeCCCCCC------CeEE--EEcccccceeeccccee-eccCCCcccceeEEEEE Confidence 001111221211 1233467777777753 4444 44567888876666533 24444444444444455 Q ss_pred eeeEEEECCeEEEEeecC Q lcl|NC_019522. 294 TGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 294 ~gGv~i~~P~ai~~~dGI 311 (311) .|++++-|.+.+.+.|| T Consensus 297 -~tLe~~N~~a~a~i~~l 313 (317) T protein:vir:88 297 -YTFRVNNEKSGALIRDV 313 (317) T ss_pred -EEEEEcCccceeEEEEe Confidence 48999999999999999 No 78 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=97.89 E-value=8.7e-06 Score=48.34 Aligned_cols=282 Identities=9% Similarity=0.006 Sum_probs=132.7 Q ss_pred CCcccccc------------cchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEE Q lcl|NC_019522. 1 MAKSVFDV------------SPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQL 68 (311) Q Consensus 1 ~~~~~~~~------------~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~ 68 (311) |+.-.|+. .+..+.+|+.. ..+-.++++.....-..++.+.+..... .... ....+..|.+.+ T Consensus 1 ~~~k~~~~~l~~~~~~~~~~~~~~~~g~~v~--~~~~~~l~~~i~e~s~~l~~i~v~~v~~-~~~~--i~~~~~~~~~~~ 75 (321) T protein:vir:31 1 MASRTINNDLSRITEKNALTVDDLDAGGTLP--DPLWDEFWTDMIEETPLLDAIRTETVGA-KKTR--IPTLNIGERHRR 75 (321) T ss_pred CchHHHHHHHHHHHHhccccccccCCcceeC--HHHHHHHHHHHHHhhhhhhhceeeeccC-ccee--eeeeccCCcccc Confidence 44444332 12233345552 1222333333333323344444332111 1111 111222222223 Q ss_pred ecCc-ccccceeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc----- Q lcl|NC_019522. 69 FGPN-STDVPTVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG----- 142 (311) Q Consensus 69 ~~~~-a~dip~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g----- 142 (311) .+.. ....+..+...+......+........+.+-|+. ...+.++...-....+++++..+++++|+|+.... T Consensus 76 ~~~e~~~~~~~~~~~~~~~~~~~~k~~~~~~it~e~L~d-~a~~~d~e~~i~~~ia~~~a~~~~~~~~nGd~~~~~~~~~ 154 (321) T protein:vir:31 76 PQDEGEWNENESDVSTGTIDISTEKATVAWDLPREVVQE-NPEGEALADRILNLMTDAWSADVEDLAANGDEDAEDSFEN 154 (321) T ss_pred cccccccccccccceeeeeeeeeEEEEeehhccHHHHHh-hhcchhHHHHHHHHHHHHHHHHHHhheeeccccCCCcccc Confidence 3322 2233344555666777788888887888665554 34577899999999999999999999999986532 Q ss_pred --eeeeecCCcceeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecc-eEEEeCHHHHHHHhcccccCCC Q lcl|NC_019522. 143 --EGLYTSPNVSVEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRP-NTFVLPPAQFQLLARTLLSTQN 219 (311) Q Consensus 143 --~GllN~p~v~~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p-~~l~lpp~~~~~L~~~~~~~~~ 219 (311) .|+|+.+.-...+... ++ .+.+. +++.+++..|-. .+. ..+ ...+|+++.+..+.+....... T Consensus 155 ~n~G~l~~a~~~~~~~~~--~~-----~~~~~----d~l~~l~~~l~~--~yr-~~~~~v~im~~~~~~~~~~~l~~~~~ 220 (321) T protein:vir:31 155 QNDGFITVAEGDVETIDA--AD-----DILDN----DLVIRTIAGLDS--KYR-ARMNPALIVSEDQLLSYHYTLTDRDT 220 (321) T ss_pred cchhhhhhhccccccccc--cc-----cccCH----HHHHHHHHhccH--hHh-cCCCeEEEechHHHHHHHHHHhcCCC Confidence 3666543211111110 00 01122 334445554421 121 122 2567888877655443332211 Q ss_pred CCcchHHHHH-HHhCCceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhcc-----ceeeCCceEEEeeeee Q lcl|NC_019522. 220 ASNVTLLQFL-RTNFPDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLA-----PATADNVNFKVPAILR 293 (311) Q Consensus 220 ~~~~Tvl~~l-~~n~~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~-----p~~~~~~~~~~~~~~~ 293 (311) ..+...+.-- ....-.+.++.+|.+.. +. +++ -+.+++.+.+....++.. +...+..++..-++.. T Consensus 221 ~~~~~~l~~~~~~tl~G~pvv~~~~mP~------~~-il~-t~~~nl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 292 (321) T protein:vir:31 221 PLGDNVIMGEADVNPFSFPIIGSGLWPD------DK-AMF-TDPQNLIYALYRDLEIDVLTESDKVSERDLHARYFMRGD 292 (321) T ss_pred ccccchhhccccccccceeEEEcCCCCC------Cc-EEE-eccccEEEEEeeccEEEEeecCccccccceeeEeeeeee Confidence 1112221100 01122345556665543 11 222 234455443333332211 1111233444444445 Q ss_pred eeeEEEECCeEEEEeecC Q lcl|NC_019522. 294 TGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 294 ~gGv~i~~P~ai~~~dGI 311 (311) + +..|..+.+++.+.|| T Consensus 293 ~-~~~ve~~~a~a~~~~i 309 (321) T protein:vir:31 293 D-DFAIENTEAVVLAEGL 309 (321) T ss_pred c-ceeEeccccEEEEecC Confidence 4 6788999999999999 No 79 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=97.89 E-value=8.5e-06 Score=48.39 Aligned_cols=287 Identities=14% Similarity=0.037 Sum_probs=140.6 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEE---EEEEeecccceEEecCcccccc Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAV---MFRSIDARGELQLFGPNSTDVP 77 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~---~~~~~~~~G~a~~~~~~a~dip 77 (311) -...++.. ++.++++|..+- .+ ++++.....-..++++.+.+........+ .+.+--..| ..+.+. ..+.+ T Consensus 14 ~~~k~~t~-~d~~Gg~l~P~~--~~-~~i~~~~e~s~~l~~~~vi~~~~~~~~~i~~~g~~~~~~~g-~~~~~~-~~~~~ 87 (315) T protein:vir:41 14 EIVPKIDV-PDLGRGVLSVDR--FG-EFVKAVRDSAVIIPEARIDNALKSYEKDISRLSLVLDVGPG-RDETGQ-KLAPP 87 (315) T ss_pred hhhhhcCC-cCCCCceechHH--HH-HHHHHHHhhhhhhhhceeeeccccccccccccccCcccccc-cccccC-cCCCC Confidence 11233433 344566666421 22 24444433344555555432221111111 110000011 112222 23345 Q ss_pred eeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeecccc-------ceeeeecCC Q lcl|NC_019522. 78 TVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGV-------GEGLYTSPN 150 (311) Q Consensus 78 ~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~-------g~GllN~p~ 150 (311) ..+..++......+.+..-...+.+-|+. ...|.++.+.-.....+++...++...|+||... ..|+|+... T Consensus 88 ~~~~~f~~~~l~~~~l~~~~~it~elL~D-~~~~~~~e~~l~~~~a~~~a~~~~~~~~nGdg~s~~p~~~~~~G~l~~a~ 166 (315) T protein:vir:41 88 ESTAEVKTNTLYMREMVTKVVIHEDAIED-NIEGKAFEQKIVTLLGEGISYVLEKYYLHGDTSSSDPLLRMSDGWLKLAS 166 (315) T ss_pred CCccccceeeeceeeeeeeccccHHHHHh-hhccccHHHHHHHHHHHHHHHHHHHHhhccCCcCcCccccccccceeccc Confidence 55566777777777777777777666664 4457899999999999999999999999998642 248888765 Q ss_pred cceeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHH Q lcl|NC_019522. 151 VSVEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLR 230 (311) Q Consensus 151 v~~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~ 230 (311) ..+.....++.. ...+.+.+.|+...+..-++.+. ..-..+|+.+.+..+.+-.-+ . +.-+++=.. T Consensus 167 ~~~~~~~~~~~a------~~~~~d~l~~l~~sl~~~yr~~~----~~~~~imn~~t~~~~rklk~~-~---g~~lw~~~~ 232 (315) T protein:vir:41 167 EKLTESDVDPEA------EDWPMNLFDTMIESLPTPYRNNL----PNMKFYVTWDIYRAYRDALKG-R---ETGLGDQAL 232 (315) T ss_pred cccccccccccc------ccccHHHHHHHHHhcChHHhhcC----CceEEEEcHHHHHHHHHHhcc-C---CCccccchh Confidence 433322222111 11122333333333222222211 123688899888766443211 1 112222111 Q ss_pred HhCCce-----EEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhccceeeCCceEEEeeeeeeeeEEEECCeEE Q lcl|NC_019522. 231 TNFPDI-----TFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATADNVNFKVPAILRTGGTEWRIPKAG 305 (311) Q Consensus 231 ~n~~~l-----~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai 305 (311) ..+-+. -++.++.+...+. +. ..+++.+ .+++-+.+...++...-...+.-.+.+-...|+++-.+-...++ T Consensus 233 ~~g~~~tl~G~PV~~~~~m~~~~~-~~-~~ilf~d-~~nl~~~~~~~i~i~~~~~a~~~~~~~~~~~r~d~~~~~~~~~a 309 (315) T protein:vir:41 233 TGANSILYDGRPVQYVPALEALND-GK-SRALFVV-PTQLVYGFWRNIKVVPDYDAEMRLTKYVASLRTDNHYEDEEGAV 309 (315) T ss_pred hcCCCceecccceEecccccccCC-CC-ccEEEec-ccceEEEeccccEEEeeecCCCCceEEEEEEEeceeEEecccee Confidence 222222 2445555543332 21 2344444 44544444444444321122223345555677776666677888 Q ss_pred EEeecC Q lcl|NC_019522. 306 HYVDGV 311 (311) Q Consensus 306 ~~~dGI 311 (311) +.+..| T Consensus 310 ~~~~~v 315 (315) T protein:vir:41 310 SATITV 315 (315) T ss_pred EeeeeC Confidence 888888 No 80 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=97.88 E-value=1.1e-05 Score=47.72 Aligned_cols=269 Identities=9% Similarity=0.013 Sum_probs=141.2 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccceee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPTVD 80 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~v~ 80 (311) ..+..+...+.+++++++. +.+.+.|++.....-..+.++.+.. .+...-.+.+......+.+.|++.++. +|..+ T Consensus 101 ~~~~~~~~~t~~~gg~~vP--~~~~~~ii~~~~~~s~l~~~~~~~~-~~~~~~~~~~~~~~~~~~a~~v~E~~~-~~~~~ 176 (392) T protein:vir:10 101 LEQRAMSGLTGEDGGLVIP--QDIQTQINELARSFDALEQYVTVEP-VRTRSGSRVLEKNSDMIPFAEITEMGE-IPETD 176 (392) T ss_pred hhhhhccccccCCCceecc--hhHHHHHHHHHHhhhhhhhhceeee-ccCCceeEEEEeecCCccceeeccccc-ccccc Confidence 2223333333345566665 4566778887777666666665432 111222233334444556778877644 55443 Q ss_pred -eeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccceeeeecCCcceeeccCC Q lcl|NC_019522. 81 -IAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVGEGLYTSPNVSVEAATST 159 (311) Q Consensus 81 -~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g~GllN~p~v~~~~~~~~ 159 (311) ..++......+.++..+.+|.+=|+. +..+|.+--.....+++.+.+|..++.|+...+. + T Consensus 177 ~~~~~~v~l~~~k~~~~~~iS~ell~d---s~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~-----~---------- 238 (392) T protein:vir:10 177 NPKFSNVQYAVKDRAGILPLSRSLLQD---SDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK-----Q---------- 238 (392) T ss_pred cccceeEEeeeeeEEEeehhhHHHHhh---hHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc-----c---------- Confidence 56788888899999999998654443 3467888889999999999999999888764210 0 Q ss_pred ccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHH-HHH------- Q lcl|NC_019522. 160 FVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQF-LRT------- 231 (311) Q Consensus 160 ~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~-l~~------- 231 (311) + ..+. +||.++++.... ..+ ...-.++|+|+.+..|.+-. +..|.-++.- +.. T Consensus 239 ---~-----~~~~----d~i~~~~~~~l~-~~~--~~~a~~vm~~~~~~~L~~lk----d~~G~~l~~~~~~~~~~~tll 299 (392) T protein:vir:10 239 ---A-----IKSL----DDIKDVLNVKLD-PAI--SPNAILLTNQDGFNYLDKLK----DKDGKYILQSDPTQKNKKLFA 299 (392) T ss_pred ---C-----ccCH----HHHHHHHHHhhh-hhh--ccCCEEEEcHHHHHHHHHhh----ccCCCeEeecCccCCcccccc Confidence 0 0122 344444432211 111 12346899999999986521 1111111100 000 Q ss_pred hCCceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhc-ccee---eCCceEEEeeeeeeeeEEEECCeEEEE Q lcl|NC_019522. 232 NFPDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFL-APAT---ADNVNFKVPAILRTGGTEWRIPKAGHY 307 (311) Q Consensus 232 n~~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~-~p~~---~~~~~~~~~~~~~~gGv~i~~P~ai~~ 307 (311) .+|.+.+.....+...+.+..+..++|-+=.+.+.+..-..+++. .+.. ...-...+.++.|++ +.+++|.+|+. T Consensus 300 G~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d-~~v~~~~a~~~ 378 (392) T protein:vir:10 300 GTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDD-VQMWDNEAAVY 378 (392) T ss_pred CcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeec-cEEecccceEE Confidence 012222222222332333323333444332232222111122111 0110 001124577788884 68899999999 Q ss_pred eecC Q lcl|NC_019522. 308 VDGV 311 (311) Q Consensus 308 ~dGI 311 (311) ++.= T Consensus 379 l~~~ 382 (392) T protein:vir:10 379 GEID 382 (392) T ss_pred EEec Confidence 8876 No 81 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=97.88 E-value=1.1e-05 Score=47.72 Aligned_cols=269 Identities=9% Similarity=0.013 Sum_probs=141.2 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccceee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPTVD 80 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~v~ 80 (311) ..+..+...+.+++++++. +.+.+.|++.....-..+.++.+.. .+...-.+.+......+.+.|++.++. +|..+ T Consensus 101 ~~~~~~~~~t~~~gg~~vP--~~~~~~ii~~~~~~s~l~~~~~~~~-~~~~~~~~~~~~~~~~~~a~~v~E~~~-~~~~~ 176 (392) T protein:vir:10 101 LEQRAMSGLTGEDGGLVIP--QDIQTQINELARSFDALEQYVTVEP-VRTRSGSRVLEKNSDMIPFAEITEMGE-IPETD 176 (392) T ss_pred hhhhhccccccCCCceecc--hhHHHHHHHHHHhhhhhhhhceeee-ccCCceeEEEEeecCCccceeeccccc-ccccc Confidence 2223333333345566665 4566778887777666666665432 111222233334444556778877644 55443 Q ss_pred -eeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccceeeeecCCcceeeccCC Q lcl|NC_019522. 81 -IAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVGEGLYTSPNVSVEAATST 159 (311) Q Consensus 81 -~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g~GllN~p~v~~~~~~~~ 159 (311) ..++......+.++..+.+|.+=|+. +..+|.+--.....+++.+.+|..++.|+...+. + T Consensus 177 ~~~~~~v~l~~~k~~~~~~iS~ell~d---s~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~-----~---------- 238 (392) T protein:vir:10 177 NPKFSNVQYAVKDRAGILPLSRSLLQD---SDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK-----Q---------- 238 (392) T ss_pred cccceeEEeeeeeEEEeehhhHHHHhh---hHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc-----c---------- Confidence 56788888899999999998654443 3467888889999999999999999888764210 0 Q ss_pred ccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHH-HHH------- Q lcl|NC_019522. 160 FVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQF-LRT------- 231 (311) Q Consensus 160 ~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~-l~~------- 231 (311) + ..+. +||.++++.... ..+ ...-.++|+|+.+..|.+-. +..|.-++.- +.. T Consensus 239 ---~-----~~~~----d~i~~~~~~~l~-~~~--~~~a~~vm~~~~~~~L~~lk----d~~G~~l~~~~~~~~~~~tll 299 (392) T protein:vir:10 239 ---A-----IKSL----DDIKDVLNVKLD-PAI--SPNAILLTNQDGFNYLDKLK----DKDGKYILQSDPTQKNKKLFA 299 (392) T ss_pred ---C-----ccCH----HHHHHHHHHhhh-hhh--ccCCEEEEcHHHHHHHHHhh----ccCCCeEeecCccCCcccccc Confidence 0 0122 344444432211 111 12346899999999986521 1111111100 000 Q ss_pred hCCceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhc-ccee---eCCceEEEeeeeeeeeEEEECCeEEEE Q lcl|NC_019522. 232 NFPDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFL-APAT---ADNVNFKVPAILRTGGTEWRIPKAGHY 307 (311) Q Consensus 232 n~~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~-~p~~---~~~~~~~~~~~~~~gGv~i~~P~ai~~ 307 (311) .+|.+.+.....+...+.+..+..++|-+=.+.+.+..-..+++. .+.. ...-...+.++.|++ +.+++|.+|+. T Consensus 300 G~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d-~~v~~~~a~~~ 378 (392) T protein:vir:10 300 GTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDD-VQMWDNEAAVY 378 (392) T ss_pred CcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeec-cEEecccceEE Confidence 012222222222332333323333444332232222111122111 0110 001124577788884 68899999999 Q ss_pred eecC Q lcl|NC_019522. 308 VDGV 311 (311) Q Consensus 308 ~dGI 311 (311) ++.= T Consensus 379 l~~~ 382 (392) T protein:vir:10 379 GEID 382 (392) T ss_pred EEec Confidence 8876 No 82 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=97.88 E-value=1.1e-05 Score=47.72 Aligned_cols=269 Identities=9% Similarity=0.013 Sum_probs=141.2 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccceee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPTVD 80 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~v~ 80 (311) ..+..+...+.+++++++. +.+.+.|++.....-..+.++.+.. .+...-.+.+......+.+.|++.++. +|..+ T Consensus 101 ~~~~~~~~~t~~~gg~~vP--~~~~~~ii~~~~~~s~l~~~~~~~~-~~~~~~~~~~~~~~~~~~a~~v~E~~~-~~~~~ 176 (392) T protein:vir:10 101 LEQRAMSGLTGEDGGLVIP--QDIQTQINELARSFDALEQYVTVEP-VRTRSGSRVLEKNSDMIPFAEITEMGE-IPETD 176 (392) T ss_pred hhhhhccccccCCCceecc--hhHHHHHHHHHHhhhhhhhhceeee-ccCCceeEEEEeecCCccceeeccccc-ccccc Confidence 2223333333345566665 4566778887777666666665432 111222233334444556778877644 55443 Q ss_pred -eeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccceeeeecCCcceeeccCC Q lcl|NC_019522. 81 -IAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVGEGLYTSPNVSVEAATST 159 (311) Q Consensus 81 -~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g~GllN~p~v~~~~~~~~ 159 (311) ..++......+.++..+.+|.+=|+. +..+|.+--.....+++.+.+|..++.|+...+. + T Consensus 177 ~~~~~~v~l~~~k~~~~~~iS~ell~d---s~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~-----~---------- 238 (392) T protein:vir:10 177 NPKFSNVQYAVKDRAGILPLSRSLLQD---SDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK-----Q---------- 238 (392) T ss_pred cccceeEEeeeeeEEEeehhhHHHHhh---hHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc-----c---------- Confidence 56788888899999999998654443 3467888889999999999999999888764210 0 Q ss_pred ccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHH-HHH------- Q lcl|NC_019522. 160 FVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQF-LRT------- 231 (311) Q Consensus 160 ~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~-l~~------- 231 (311) + ..+. +||.++++.... ..+ ...-.++|+|+.+..|.+-. +..|.-++.- +.. T Consensus 239 ---~-----~~~~----d~i~~~~~~~l~-~~~--~~~a~~vm~~~~~~~L~~lk----d~~G~~l~~~~~~~~~~~tll 299 (392) T protein:vir:10 239 ---A-----IKSL----DDIKDVLNVKLD-PAI--SPNAILLTNQDGFNYLDKLK----DKDGKYILQSDPTQKNKKLFA 299 (392) T ss_pred ---C-----ccCH----HHHHHHHHHhhh-hhh--ccCCEEEEcHHHHHHHHHhh----ccCCCeEeecCccCCcccccc Confidence 0 0122 344444432211 111 12346899999999986521 1111111100 000 Q ss_pred hCCceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhc-ccee---eCCceEEEeeeeeeeeEEEECCeEEEE Q lcl|NC_019522. 232 NFPDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFL-APAT---ADNVNFKVPAILRTGGTEWRIPKAGHY 307 (311) Q Consensus 232 n~~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~-~p~~---~~~~~~~~~~~~~~gGv~i~~P~ai~~ 307 (311) .+|.+.+.....+...+.+..+..++|-+=.+.+.+..-..+++. .+.. ...-...+.++.|++ +.+++|.+|+. T Consensus 300 G~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d-~~v~~~~a~~~ 378 (392) T protein:vir:10 300 GTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDD-VQMWDNEAAVY 378 (392) T ss_pred CcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeec-cEEecccceEE Confidence 012222222222332333323333444332232222111122111 0110 001124577788884 68899999999 Q ss_pred eecC Q lcl|NC_019522. 308 VDGV 311 (311) Q Consensus 308 ~dGI 311 (311) ++.= T Consensus 379 l~~~ 382 (392) T protein:vir:10 379 GEID 382 (392) T ss_pred EEec Confidence 8876 No 83 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=97.88 E-value=1.1e-05 Score=47.72 Aligned_cols=269 Identities=9% Similarity=0.013 Sum_probs=141.2 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccceee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPTVD 80 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~v~ 80 (311) ..+..+...+.+++++++. +.+.+.|++.....-..+.++.+.. .+...-.+.+......+.+.|++.++. +|..+ T Consensus 101 ~~~~~~~~~t~~~gg~~vP--~~~~~~ii~~~~~~s~l~~~~~~~~-~~~~~~~~~~~~~~~~~~a~~v~E~~~-~~~~~ 176 (392) T protein:vir:10 101 LEQRAMSGLTGEDGGLVIP--QDIQTQINELARSFDALEQYVTVEP-VRTRSGSRVLEKNSDMIPFAEITEMGE-IPETD 176 (392) T ss_pred hhhhhccccccCCCceecc--hhHHHHHHHHHHhhhhhhhhceeee-ccCCceeEEEEeecCCccceeeccccc-ccccc Confidence 2223333333345566665 4566778887777666666665432 111222233334444556778877644 55443 Q ss_pred -eeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccceeeeecCCcceeeccCC Q lcl|NC_019522. 81 -IAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVGEGLYTSPNVSVEAATST 159 (311) Q Consensus 81 -~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g~GllN~p~v~~~~~~~~ 159 (311) ..++......+.++..+.+|.+=|+. +..+|.+--.....+++.+.+|..++.|+...+. + T Consensus 177 ~~~~~~v~l~~~k~~~~~~iS~ell~d---s~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~-----~---------- 238 (392) T protein:vir:10 177 NPKFSNVQYAVKDRAGILPLSRSLLQD---SDQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK-----Q---------- 238 (392) T ss_pred cccceeEEeeeeeEEEeehhhHHHHhh---hHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc-----c---------- Confidence 56788888899999999998654443 3467888889999999999999999888764210 0 Q ss_pred ccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHH-HHH------- Q lcl|NC_019522. 160 FVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQF-LRT------- 231 (311) Q Consensus 160 ~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~-l~~------- 231 (311) + ..+. +||.++++.... ..+ ...-.++|+|+.+..|.+-. +..|.-++.- +.. T Consensus 239 ---~-----~~~~----d~i~~~~~~~l~-~~~--~~~a~~vm~~~~~~~L~~lk----d~~G~~l~~~~~~~~~~~tll 299 (392) T protein:vir:10 239 ---A-----IKSL----DDIKDVLNVKLD-PAI--SPNAILLTNQDGFNYLDKLK----DKDGKYILQSDPTQKNKKLFA 299 (392) T ss_pred ---C-----ccCH----HHHHHHHHHhhh-hhh--ccCCEEEEcHHHHHHHHHhh----ccCCCeEeecCccCCcccccc Confidence 0 0122 344444432211 111 12346899999999986521 1111111100 000 Q ss_pred hCCceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhc-ccee---eCCceEEEeeeeeeeeEEEECCeEEEE Q lcl|NC_019522. 232 NFPDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFL-APAT---ADNVNFKVPAILRTGGTEWRIPKAGHY 307 (311) Q Consensus 232 n~~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~-~p~~---~~~~~~~~~~~~~~gGv~i~~P~ai~~ 307 (311) .+|.+.+.....+...+.+..+..++|-+=.+.+.+..-..+++. .+.. ...-...+.++.|++ +.+++|.+|+. T Consensus 300 G~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d-~~v~~~~a~~~ 378 (392) T protein:vir:10 300 GTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDD-VQMWDNEAAVY 378 (392) T ss_pred CcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeec-cEEecccceEE Confidence 012222222222332333323333444332232222111122111 0110 001124577788884 68899999999 Q ss_pred eecC Q lcl|NC_019522. 308 VDGV 311 (311) Q Consensus 308 ~dGI 311 (311) ++.= T Consensus 379 l~~~ 382 (392) T protein:vir:10 379 GEID 382 (392) T ss_pred EEec Confidence 8876 No 84 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=97.88 E-value=1.3e-05 Score=47.46 Aligned_cols=283 Identities=12% Similarity=0.024 Sum_probs=147.5 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeec----ccceEEecCccccc Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDA----RGELQLFGPNSTDV 76 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~----~G~a~~~~~~a~di 76 (311) =...++.+ ++.++++|..+ ..+ ++++.....-..+++..+.+........+ ..... ...+.+.+ ..... T Consensus 9 ~~~k~it~-~d~~gG~L~P~--~~~-~~i~~l~e~s~i~~~a~vi~t~~s~~~~i--~~i~~g~~~~~~~~~~~-~~~~~ 81 (314) T protein:vir:41 9 QITPKIDV-PDLGKGILAVQ--RFG-EFVREVRENSAIIKDARVLNALKSYEVDI--SRISLGVELEPGRNTSG-TKVAP 81 (314) T ss_pred Hhhccccc-ccCCCceeChH--HHH-HHHHHHHhccchhhheeeecccCccceee--cccccCccccccccccc-CCccC Confidence 11122332 34455677752 233 46666665556666666543322222222 11111 01112222 23445 Q ss_pred ceeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeecccc-------c--eeeee Q lcl|NC_019522. 77 PTVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGV-------G--EGLYT 147 (311) Q Consensus 77 p~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~-------g--~GllN 147 (311) |..+..++......+.+..-+..+.+-|+- ...|.+|...-....++.+...+..+.|+||... + .|+|+ T Consensus 82 ~~~~~tf~~~~l~~~kl~~~v~is~e~L~D-~a~~~~le~~i~~~~Ae~~g~~~~~~~~nGdg~~~s~~~~~~~p~G~l~ 160 (314) T protein:vir:41 82 TADEVTVSTNTLEMKELVTKVVLEDEALED-NIEQSAFEQTITSLLASGVTYDLECFFLHADSSLTTGRELYRINDGWMK 160 (314) T ss_pred CcccccccceeeeeEEEEEeecccHHHHHh-hhchhhHHHHHHHHHHHHHHHHHHHHhhccccCCcCcccchhcchhhhh Confidence 777777888888888888888888777765 4467789999999999999999999999998641 2 47777 Q ss_pred cCCcceeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCcee-cceEEEeCHHHHHHHhcccccCCCCCcchHH Q lcl|NC_019522. 148 SPNVSVEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVH-RPNTFVLPPAQFQLLARTLLSTQNASNVTLL 226 (311) Q Consensus 148 ~p~v~~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~-~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl 226 (311) ......+.+.+ -..+.+++. +.+++..|. +.+... .....+|+++.+..+.+..-. . +..+. T Consensus 161 ~a~~~~~~~~~--------~~~~~~~~~---~~~l~~sl~--~~yr~~~~~~~~~m~~~t~~~~r~~l~~-~---~~~l~ 223 (314) T protein:vir:41 161 LAGNQYTDAEP--------EDENWPLNL---FDGMMDELD--TRYLQLKPRMKFYVSNEIYNGYRKQLLV-R---ETGLG 223 (314) T ss_pred hcccceeecCc--------cccccHHHH---HHHHHHhcC--chhhcCCCceEEEecHHHHHHHHHHHhc-c---CCccc Confidence 65433222211 112233433 444444442 112111 123688898887765432211 1 11122 Q ss_pred HHHHHhCC-----ceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhccceeeCCceEEEeeeeeeeeEEEEC Q lcl|NC_019522. 227 QFLRTNFP-----DITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATADNVNFKVPAILRTGGTEWRI 301 (311) Q Consensus 227 ~~l~~n~~-----~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~ 301 (311) +-....+. ...++.++.+.+.+.+ ...+++.+ ++++-+.+...+++..-...+.-.+.+-...|++....-. T Consensus 224 ~~~~~~~~~~~l~G~PV~~~~~~~~~~~~--~~~i~fgd-~~nlv~~~~~~ir~~~~~~a~~~~~~~~~~~r~d~~~~~~ 300 (314) T protein:vir:41 224 DSALIGATGLQYDGIPIQYVPALDALGDD--KARALLTV-PTNLVYGFWRNIRIEPKRDAAMRRTEYIASLRADCNYEDE 300 (314) T ss_pred chhhhCCCCceecceeeEecccccccCCC--CceEEEec-hhheEEEeeceeEEeecccCcCCeEEEEEEEEeceEEEEc Confidence 22222222 2346666666554432 22344433 5676666666665542222233345555566665333345 Q ss_pred CeEEEEeecC Q lcl|NC_019522. 302 PKAGHYVDGV 311 (311) Q Consensus 302 P~ai~~~dGI 311 (311) +.++..+-+= T Consensus 301 ~aa~~~~~~~ 310 (314) T protein:vir:41 301 NAAVAAVIDM 310 (314) T ss_pred CcEEEEEeec Confidence 4555555444 No 85 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=97.87 E-value=7.8e-06 Score=48.58 Aligned_cols=281 Identities=9% Similarity=-0.025 Sum_probs=141.8 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccc-ee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVP-TV 79 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip-~v 79 (311) .-...+...+.+++++++. +.+...|++.....-..+.++.+.. -......+.+....+.+.|.+..+ .+| .. T Consensus 79 ~~~~~~~~~~~~~gg~lvP--~~~~~~I~~~~~~~s~i~~~~~~~~---~~~~~~~i~~~~~~~~a~~~~E~~-~~~~~~ 152 (390) T protein:vir:40 79 YYNEVIAGNGFAGVTALLP--PTVFERVFEDLTVEHPLLSKINFVN---TTATTEWIISVGDVATAWWGPLCA-EIKEVL 152 (390) T ss_pred HHHHHHhccCcccCccccc--HHHHHHHHHHHHhhhhhhhhceeee---cCCceeEEEEEcCCcceeeecccc-ccCccc Confidence 0001111122334555554 4455667776665555556555432 222344455666777888877543 344 34 Q ss_pred eeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc-eeeeecCCcceeeccC Q lcl|NC_019522. 80 DIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG-EGLYTSPNVSVEAATS 158 (311) Q Consensus 80 ~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g-~GllN~p~v~~~~~~~ 158 (311) +..++......+.++.-+.++.+=++ .+..+|.+.-....++++...+|+-+++|+.... .|+||.++....... T Consensus 153 ~~~f~~i~l~~~k~~~~i~iS~ell~---ds~~~l~~~i~~~la~~i~~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~- 228 (390) T protein:vir:40 153 DNGFDKIQTGMYKLSAYIPVCNAMLD---LGPSWLDQYVRTILGEAMALGLEAGIVNGSGKDQPIGMMRDLNNVTAGEH- 228 (390) T ss_pred cccceeeEeeeeeEEEeehhhHHHHh---cchHHHHHHHHHHHHHHHHHHHHhhhhcccCCCccceeeecccccccccc- Confidence 67788889999999988888844333 3455788889999999999999999999987543 699998764322111 Q ss_pred CccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHH-HHHHhc--ccccCCCCCcchHHHHHHHhCCc Q lcl|NC_019522. 159 TFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQ-FQLLAR--TLLSTQNASNVTLLQFLRTNFPD 235 (311) Q Consensus 159 ~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~-~~~L~~--~~~~~~~~~~~Tvl~~l~~n~~~ 235 (311) .... ..+-|.+.+.+.+..+...+..... .....-.++|.+.. +.+|.. ... +..|.-+...+ ... T Consensus 229 -~~~~---~~~~t~~~~~~~~~~l~~~~~~~~~-~~~~~a~~i~n~~t~~~~l~~~~~~~---d~~G~~v~~~~---~~g 297 (390) T protein:vir:40 229 -PVKT---ATPLTDLTPATLATKVMLPLTDNGK-KSVSDAILVINPADYWSKIYAATSYM---TPQGVWVTGIL---PVP 297 (390) T ss_pred -cccc---ccccchhhHHHHHHHHHHHhhcchh-hhhcCceEEEcchhHHHHHHHHhhcc---CCCCccccccC---CCc Confidence 1111 1122333333333333333321111 11122346676654 333321 111 11122221111 123 Q ss_pred eEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhccceee--CCceEEEeeeeeeeeEEEECCeEEEEe--ecC Q lcl|NC_019522. 236 ITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATA--DNVNFKVPAILRTGGTEWRIPKAGHYV--DGV 311 (311) Q Consensus 236 l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~--~~~~~~~~~~~~~gGv~i~~P~ai~~~--dGI 311 (311) +.++..+.+.. + . ++|-+=.++ -+..-..+++..--+. ......+..+.|++ +.+++|.|++.+ .++ T Consensus 298 ~pvv~~~~~p~-~-----~-i~~Gd~s~~-~i~~~~~~~v~~~~~~~f~~~~~~~r~~~r~d-g~v~~~~A~~~l~~~~~ 368 (390) T protein:vir:40 298 LEIVQSVAVPV-G-----K-AVAGRAKDY-FMGIGSEQVIRTSTEYRLLDDETLYYAKQYAN-GRPKDNSSFLVFDITGL 368 (390) T ss_pred eeEEEcCCCCC-C-----c-EEEEeeceE-EEEeecceEEEecchhhhhcCcEEEEEEEEeC-CEEecccceEEEEeecc Confidence 44554444432 1 1 223222222 2222223322110011 11235566678874 567779999855 455 No 86 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=97.86 E-value=7.5e-06 Score=48.67 Aligned_cols=276 Identities=11% Similarity=0.082 Sum_probs=136.3 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEE--ecCcc--ccc Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQL--FGPNS--TDV 76 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~--~~~~a--~di 76 (311) |+.-++. ..+.+.. ..+...|+|.....-...+.+|-.... - ..+.|.....-+.+.. ++... ... T Consensus 1 mpaltLa-----ea~k~~~--d~l~~~ViE~~~~~s~lL~~LpF~~ve-g--~~~~ynR~~~~~~~~~~~v~~~~~~~g~ 70 (310) T protein:vir:97 1 MASVTLA-----ESAKLAQ--DELVAGVIENIITVNRMFDVLPFDSIE-G--NSLAYNRENVLGDVIMAGVGTTFSGAGA 70 (310) T ss_pred CcccchH-----HHhhcCc--chHHHHHHHHHhccchHHHhCCccccc-C--CcceeeEeeccCCcccccccccccCCCc Confidence 5433332 1123332 334567777665555556666643211 1 1234432222111111 11111 122 Q ss_pred ceeeeeccceeEEEEEEEEEEEecHHHHHHHHH-hCCChH--HHHHHHHHHHHHHhhhheeeeeccccc-e-eeeecCCc Q lcl|NC_019522. 77 PTVDIAMSQGFKDINTAALGYTYSIEEIGFAML-NNVNLD--AERGQAVRDVVEQGLNKIYLLGDKGVG-E-GLYTSPNV 151 (311) Q Consensus 77 p~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~-~g~~l~--~~k~~aa~~~~~~~~n~~~~~G~~~~g-~-GllN~p~v 151 (311) |......++.+..+..++..++++-+- +.. .+-+.+ ....+...+++.++.....++||...+ + ||+..-.- T Consensus 71 ~~~~~t~~~~~~~L~i~~g~~~Vd~~i---~dl~~~~~~dq~~~Ql~~~iea~~~~~e~~lINGD~a~n~F~GL~~~~~~ 147 (310) T protein:vir:97 71 GKAAATFTKVNSNLTTIMGDAEVNGLI---QATRSGDGNDQTAVQIASKAKSAGRKYQDQLINGNGAGNEFAGLIQLCAS 147 (310) T ss_pred cccccccceeeeeeeeeeehhhhhhHH---HhhhcCChHHHHHHHHHHHHHHHHHHHHHHhhccccCCCcccchhhcCCc Confidence 333344455566666666665554221 122 233333 445666778888999999999998765 4 99876322 Q ss_pred ceeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHH---HHHHhcccc-----c-CCCCCc Q lcl|NC_019522. 152 SVEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQ---FQLLARTLL-----S-TQNASN 222 (311) Q Consensus 152 ~~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~---~~~L~~~~~-----~-~~~~~~ 222 (311) ....... ..++.. | ++|+.++++.+|...+ .|+.|+++|+. ++.+.|.-. + ..+.+| T Consensus 148 ~q~i~~~-~~gg~~-----t----~d~LDeLl~~v~~~~g----~p~~~l~~~~~~r~i~A~~R~~~~~g~~~~~~~~~G 213 (310) T protein:vir:97 148 GQKATTG-ATGSAI-----S----FAILDELMDLVVDKDG----QVDYLTMHARTLRSYKALLRALGGASINEVVELPSG 213 (310) T ss_pred cceeecC-CCCCCC-----C----HHHHHHHHHHHhcCCC----CCCEEEecHHHHHHHHHHHHHhcCCCCCCccccCCC Confidence 1111111 112221 2 5789999999975322 47889999964 555544211 0 011222 Q ss_pred chHHHHHHHhCCceEEEEchhccc----CCCCcccEEEEEEcCcc-----eeEEeecc----hhhhcccee-eCCceEEE Q lcl|NC_019522. 223 VTLLQFLRTNFPDITFEDDILLKG----AGVAGADRMAVYKKEIR-----IVKGHDVM----PLRFLAPAT-ADNVNFKV 288 (311) Q Consensus 223 ~Tvl~~l~~n~~~l~i~~~~~l~~----ag~~g~~~~v~y~~~~~-----~~~~~~~~----~~~~~~p~~-~~~~~~~~ 288 (311) .-|+ .+..+-|.++..+.. ..++|+...++..-+.+ .+.++... ..+++.-.+ ..-..|.+ T Consensus 214 ~~v~-----~~~GiPi~~~d~ip~~~~~~~~~gtTsIya~r~Ge~~~~~Gv~Gl~~~~~~glsVr~~G~~~~~~v~~~~V 288 (310) T protein:vir:97 214 AEVP-----AYSGTPIFRNDYIPTNQTKGGTTGCTTIFAGTLDDGSRTHGIAGLTATQAAGIQVVDVGESEDSDEHIWRV 288 (310) T ss_pred CEEe-----eeCCeEEEEeCccCCCccccccCCceeEEEEeeCccccccceeccccCCccceeEEeCCcccCCcceeEEE Confidence 2222 122233444333321 12456677777666653 22322111 122322112 22234555 Q ss_pred eeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 289 PAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 289 ~~~~~~gGv~i~~P~ai~~~dGI 311 (311) .. .-|+-++-|.|++.+.|| T Consensus 289 ~~---Y~~~av~~~~A~a~L~~V 308 (310) T protein:vir:97 289 KW---YCGLALFSEKGLACADGI 308 (310) T ss_pred EE---eeeEEEecccceeeeccc Confidence 44 347889999999999999 No 87 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=97.83 E-value=4.9e-06 Score=49.67 Aligned_cols=268 Identities=12% Similarity=-0.046 Sum_probs=141.0 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEE-EeecccceEEecCcccccce- Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFR-SIDARGELQLFGPNSTDVPT- 78 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~-~~~~~G~a~~~~~~a~dip~- 78 (311) ....++...+.+.+++++. +.+.+.|++........+.++.+.. .+-....+.+. ..+..+.+.+++.++ .+|- T Consensus 111 ~e~~a~~~~t~~~gg~~iP--~~~~~~ii~~~~~~~~l~~~~~~~~-~~~~~~~~~~~~~~~~~~~a~~v~Eg~-~~~~~ 186 (404) T protein:vir:39 111 VSSKTETSGSDSAAGLTIP--QDIRTMINTLVRQYDSLQQYVRVES-VSTSNGSRVYEKWTDVTPLTVMDAEDG-KIPDL 186 (404) T ss_pred hhhhhhhcccccCCceecc--HHHHHHHHHHHHhhhhHHhhcceee-ccCCcceEEEEeecCCccceeeecCcc-ccccc Confidence 1122233333344456665 4566778888777777777766533 22222333332 334456778887653 4564 Q ss_pred eeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccceeeeecCCcceeeccC Q lcl|NC_019522. 79 VDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVGEGLYTSPNVSVEAATS 158 (311) Q Consensus 79 v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g~GllN~p~v~~~~~~~ 158 (311) .+..++......+.++..+.+|.+ +. .....+|.+--.....+++.+.+|+-+++|+.... | T Consensus 187 ~~~~f~~i~~~~~k~~~~~~iS~e-ll--~ds~~~l~~~i~~~l~~~~~~~~d~~il~g~g~~~------~--------- 248 (404) T protein:vir:39 187 DNPRLTIIKYLIKRYAGIITATNT-LL--KDTAENILAWLSSWIAKKVVVTRNQAIIAAMGTVP------K--------- 248 (404) T ss_pred cccceeeEEeeeeeEEeeehhHHH-HH--hhchHHHHHHHHHHHHHHHHHHHHHHHHhcccccc------c--------- Confidence 456788999999999988888854 33 23456788888899999999999999999875421 0 Q ss_pred CccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHH-HHH------ Q lcl|NC_019522. 159 TFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQF-LRT------ 231 (311) Q Consensus 159 ~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~-l~~------ 231 (311) .++ ..+. +|+.+++...... .+ .....++|+|+.+..|.+-. +..|.-++.- +.. T Consensus 249 --~~~-----~~~~----~~i~~~~~~~~~~-~~--~~~a~~v~n~~~~~~L~~lk----d~~G~~l~~~~~~~~~~~~l 310 (404) T protein:vir:39 249 --KPT-----IAKF----DDVITMINTSVDP-AI--IATSSLLTNQSGLNKLALVK----TAEGKYLLEPDPTKPNSYLI 310 (404) T ss_pred --ccc-----cccH----HHHHHHHHHhhhh-hh--ccCCEEEEcHHHHHHHHHhh----ccCCceeeccCcCCCCccee Confidence 000 1123 3444444322111 11 12346899999999997532 1112222110 000 Q ss_pred hCCceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhcc-ceee---CCceEEEeeeeeeeeEEEECCeEEEE Q lcl|NC_019522. 232 NFPDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLA-PATA---DNVNFKVPAILRTGGTEWRIPKAGHY 307 (311) Q Consensus 232 n~~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~-p~~~---~~~~~~~~~~~~~gGv~i~~P~ai~~ 307 (311) .+.++.+.....+. ..+.+...+++.+- .+.+.+..-..+++.. +... ..-...+.++.|+ |+.+++|.+++. T Consensus 311 ~G~pV~~~~~~~~~-~~~~~~~~~~~gd~-~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~-d~~~~~~~a~~~ 387 (404) T protein:vir:39 311 KGKKVIVVADRWLP-NSGSTVYPLYYGDM-SQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRF-DVKTTDSEALVA 387 (404) T ss_pred cceeEEEecccccC-ccCCCccEEEEEec-cccEEEEeecceEEEEeccchhhhhhceeeEEEEeee-ccEEecccceEE Confidence 01112222111121 12222223332222 2333322222222111 1100 0112456677887 478999999999 Q ss_pred eecC Q lcl|NC_019522. 308 VDGV 311 (311) Q Consensus 308 ~dGI 311 (311) ++.- T Consensus 388 ~~~~ 391 (404) T protein:vir:39 388 GSFT 391 (404) T ss_pred EEee Confidence 9977 No 88 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=97.81 E-value=9e-06 Score=48.24 Aligned_cols=283 Identities=11% Similarity=-0.013 Sum_probs=132.2 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccceee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPTVD 80 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~v~ 80 (311) .....-...+.++.+.+.. +.+-..|++........+.++.+....+ ...+.+......+.|.+.. .++|..+ T Consensus 144 ~~~~~~~~~~~~g~~~~vP--~~~~~~i~~~l~~~~~l~~~~~v~~~~g----~~~~~~~~~~~~a~wv~E~-~~~~~~~ 216 (466) T protein:vir:80 144 VRTLAQQKRAVSGAELTIP--DVMLELLRDNMHRYSKLISKVRLRPLKG----TARQNIAGAIPEGVWTEAV-ANLNELS 216 (466) T ss_pred HHHHhhhhhhhcccccccc--HHHHHHHHHhhhhhhhhhhheeeeecCc----eeEeeeecCCcceeecccc-ccccccc Confidence 0000000112223333333 2233445554433333333333222111 2233344444566777654 5678888 Q ss_pred eeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc-eeeeecCCcceeeccCC Q lcl|NC_019522. 81 IAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG-EGLYTSPNVSVEAATST 159 (311) Q Consensus 81 ~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g-~GllN~p~v~~~~~~~~ 159 (311) ..++.....++.++.-+.+|.+=|+ .+..++.+--....++++...+|+-+++|+.... .|+||+.+....... T Consensus 217 ~~f~~i~~~~~k~~~~~~iS~ell~---ds~~~l~~~i~~~la~~~~~~~~~ail~G~G~~~P~Gil~~~~~~~~~~~-- 291 (466) T protein:vir:80 217 LSFSQIEVDGYKVGGFIPIPNSTLE---DSDLNLADEILDAIGQAIGFALDKAILYGTGTKMPVGIVTRLAQTTQPPN-- 291 (466) T ss_pred ccccceeecceeeeeehhhhHHHHh---cchHHHHHHHHHHHHHHHHHHHhhheeeccCCCCcceeeecccccccccc-- Confidence 8899999999999988888855333 4556788999999999999999999999987644 699998764322211 Q ss_pred ccccCcccccCC-------------HHHHHHHHHHHHHHHHhccCCceecceEE-EeCHHHHHHHhcccccCCCCCcchH Q lcl|NC_019522. 160 FVALVAAIPTNG-------------TQPIIDFFGNAYNTVYLDNTLTVHRPNTF-VLPPAQFQLLARTLLSTQNASNVTL 225 (311) Q Consensus 160 ~~~~~t~w~~~t-------------~~ei~~di~~~~~~~~~~~~~~~~~p~~l-~lpp~~~~~L~~~~~~~~~~~~~Tv 225 (311) .......+...+ +...+.++...+..+.. ....|..+ .+.+..+..|.......+.. .. T Consensus 292 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~w~~~~~~~~~l~~~~~~~~~~---g~ 364 (466) T protein:vir:80 292 WGTKAPAWTNLSTTNLLKIDPTGKSAEEFFSELVLKLSKARA----NYSNGMKFWAMSSNTHAVLMSKAITFNSA---GA 364 (466) T ss_pred cccccccccccchhhhhhhhhhccchhhHHHHHHHHHHhhhc----cccCCceeEEecchhHHHhhcccccccCC---cc Confidence 111112222222 22223333222222211 11223333 44556665554432211110 00 Q ss_pred HHHHHHhCC---ceEEEEchhcccCC-CCc-ccEEEEEEcCcceeEEeecchhhhccceeeCCceEEEeeeeeeeeEEEE Q lcl|NC_019522. 226 LQFLRTNFP---DITFEDDILLKGAG-VAG-ADRMAVYKKEIRIVKGHDVMPLRFLAPATADNVNFKVPAILRTGGTEWR 300 (311) Q Consensus 226 l~~l~~n~~---~l~i~~~~~l~~ag-~~g-~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~ 300 (311) +-+-..|.+ ...|+..+.+.... .+| ....+++.+ ..+.+.......| .. + ...+....|+ +..++ T Consensus 365 ~~~~~~~~~~i~G~pvv~s~~~~~~~~~~g~~~~y~i~~r--~~~~i~~~~~~~f----~~-d-~~~~r~~~r~-dg~~~ 435 (466) T protein:vir:80 365 LVASLNNTMPIVGGDIVILDFIPDNDIIGGYGSLYLLAER--ADIKLAQSEHVRF----IE-D-QTVFKGTARY-DGKPV 435 (466) T ss_pred ccccCCCcccccccceeecCccCccceeeeccccEEEEee--cceEEEechhhhh----hc-C-cEEEEEEEEE-ccEEe Confidence 111011111 12333333221100 011 011122211 1122222111111 11 2 2345667887 45668 Q ss_pred CCeEEEEeecC Q lcl|NC_019522. 301 IPKAGHYVDGV 311 (311) Q Consensus 301 ~P~ai~~~dGI 311 (311) +|.||+++++= T Consensus 436 ~~~afv~~~~~ 446 (466) T protein:vir:80 436 FGEGFVAVNIA 446 (466) T ss_pred ccCceEEEEec Confidence 99999999744 No 89 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=97.81 E-value=1.8e-05 Score=46.64 Aligned_cols=262 Identities=9% Similarity=-0.004 Sum_probs=134.3 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCc-ceeEEEEEEeecccceEEecCccccccee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPD-WAQAVMFRSIDARGELQLFGPNSTDVPTV 79 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~-~~~~~~~~~~~~~G~a~~~~~~a~dip~v 79 (311) ||. +.+..+-. +.. |.+.+.|.+.....+....+..+...... .-.++.+..++..|.++.+..+ ++|+.- T Consensus 1 ma~----~~T~~~~~-iiP--ev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~eg-~~i~~~ 72 (274) T protein:vir:93 1 MPQ----GITKTSNQ-IIP--EVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEG-EKIPTD 72 (274) T ss_pred CCc----cceehhhe-ech--HHHHHHHHHHHHhhhhhcccccccccccCCCCCEEEEEeeccCCCcccccCC-Cccccc Confidence 433 11111111 122 22333333434444455555554433221 1336788888888999988765 578988 Q ss_pred eeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccceeeeecCCcceeeccCC Q lcl|NC_019522. 80 DIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVGEGLYTSPNVSVEAATST 159 (311) Q Consensus 80 ~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g~GllN~p~v~~~~~~~~ 159 (311) ++..+.....+...+.+|+++ |+.+++. +.++-..-...+.++++++.|+.++..-.+ -. ...+ T Consensus 73 ~it~~~~~~~i~~~~~~~~i~--D~~~~~~-~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~--------a~-----~~~~ 136 (274) T protein:vir:93 73 ILETKKREAKIRKIAKGTSIT--DEALLSG-YGDPQGEQVRQHGLAHANKVDNDVLEALMG--------AK-----LTVN 136 (274) T ss_pred ccccceeEEEeeeeccccccc--HHHHHhh-ccchHHHHHHHHHHHHHHHHHHHHHHHHhc--------cc-----cccc Confidence 888888888888877665555 4554443 455667777778888888888766521110 00 0000 Q ss_pred ccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhccc----ccCCCCCcchHH-HHHHHhCC Q lcl|NC_019522. 160 FVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTL----LSTQNASNVTLL-QFLRTNFP 234 (311) Q Consensus 160 ~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~----~~~~~~~~~Tvl-~~l~~n~~ 234 (311) .. .-+ +++|.++..++-.. ...+..|+++|..+..|.+.- +..... +..++ .-....+- T Consensus 137 ----~~---~~~----~d~i~dA~~~l~d~----~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~-g~~~~~~G~ig~~~ 200 (274) T protein:vir:93 137 ----AD---ITK----LNGLQSAIDKFNDE----DLEPMVLFINPLDAGKLRGDASTNFTRATEL-GDDIIVKGAFGEAL 200 (274) T ss_pred ----cc---ccC----HHHHHHHHHHhhhc----cCCccEEEeCHHHHHHHHhhhhhcccccccc-cccceeecccceec Confidence 00 011 45667777766422 125789999999999997521 111111 11111 00000112 Q ss_pred ceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhccceeeCCceEEEeeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 235 DITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 235 ~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dGI 311 (311) .++|+..+.+. ....+++ .+..+.+....+.+...-.......-.+..... .|+-+.+|.+++.+.-= T Consensus 201 G~~Vi~s~~~p------~~t~~l~--~~gai~~~~~~~~~vE~~Rd~~~~~d~i~~~~~-y~~~~~~~~~~v~~t~~ 268 (274) T protein:vir:93 201 GAIIVRTNKLE------AGTAILA--KKGAVKLILKRDFFLEVARDASTKTTALYSDKH-YVAYLYDESKAVKITKG 268 (274) T ss_pred CeeEEEcCCCC------cceEEEE--eCCeEEEEecCCcccccccchhhcccEEEEEEE-EEEEEEcCCceEEEeeC Confidence 34555554442 1222332 234455444343332211122222333333333 57899999999887754 No 90 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=97.78 E-value=2e-05 Score=46.39 Aligned_cols=260 Identities=10% Similarity=0.010 Sum_probs=139.0 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCC--CCcceeEEEEEEeecccceEEecCcccccce Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNS--APDWAQAVMFRSIDARGELQLFGPNSTDVPT 78 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~--~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~ 78 (311) ||- .....+. .+.. +.+.+.+.+.....+....+.-+... +..|. ++.+..++..|.+.+++.+ +++|. T Consensus 1 MA~---~~T~~~~--~~iP--ev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~-tv~iP~~~~~~~a~~v~eg-~~i~~ 71 (272) T protein:vir:98 1 MAV---GTTKMAQ--MLDP--EVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGT-TLTVPKWDYIGDAEDVAEG-EAIPM 71 (272) T ss_pred CCC---ccccchh--eech--HHHHHHHHHHHHHHhhhhccccccccccCCCCC-EEEEEEecCCCCcccccCC-Ccccc Confidence 442 1111112 2232 22333344444444444444444332 22333 6677777888999999876 67999 Q ss_pred eeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccceeeeecCCcceeeccC Q lcl|NC_019522. 79 VDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVGEGLYTSPNVSVEAATS 158 (311) Q Consensus 79 v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g~GllN~p~v~~~~~~~ 158 (311) .+...+.....+..++..+.++.++.+. .+.++...-...+.+.+++..++.++.--. |- .... T Consensus 72 ~~~~~~~~~~~~~~~~~~~~itd~~~~~---s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~----~a--------~~~~- 135 (272) T protein:vir:98 72 TQLGFKKTTMTIKKAGKGVEITDEAILS---GYGDPVGQAAKQIVEAIDHKVDADVLDALS----KS--------TQTV- 135 (272) T ss_pred cccccceEEEEeeeeeeeeeecHHHHhh---ccccHHHHHHHHHHHHHHHHHHHHHHHHhc----cc--------cccc- Confidence 9999999999999999888888665443 456788888888888898888876552100 10 0000 Q ss_pred CccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhccc-ccCCCCCcchHHHHHH----HhC Q lcl|NC_019522. 159 TFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTL-LSTQNASNVTLLQFLR----TNF 233 (311) Q Consensus 159 ~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~-~~~~~~~~~Tvl~~l~----~n~ 233 (311) + + ..| +++|.+++..+-.. + ..+..++++|..+..|.+-- ...... +......+. .+. T Consensus 136 ~--~------~~t----~d~i~da~~~l~~~--~--~~~~~~vv~p~~~~~L~k~~~~~~~~~-~~~~~~~~~~g~ig~i 198 (272) T protein:vir:98 136 E--A------TAT----VDGVSKALDIFNDE--D--DAETVIVMNPADASTLRLDAAKEWLGA-TEVGANRVVSGVYGEV 198 (272) T ss_pred c--c------ccC----HHHHHHHHHHHhcc--C--CCccEEEEcHHHHHHHHHhcccccccc-ccccccccccccchhh Confidence 0 0 012 56677777776422 2 34678999999998885421 100000 000001110 011 Q ss_pred CceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhccceeeCCceEEEeeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 234 PDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 234 ~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dGI 311 (311) -.++++..+.+.. +..+++. +..+.+..-.+.+...-.+.......+....++ |+.+.+|.+++.+.-= T Consensus 199 ~G~~Vi~s~~~p~------~t~~~~~--~~a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~-~~~v~~~~~vv~~t~~ 267 (272) T protein:vir:98 199 LGVQIVRSRKCPK------GTAYMVR--KGALRIMLKRNTMVETDRDITKAINQIVANKHY-GVYLYKAEKAVKITLK 267 (272) T ss_pred cCeeEEEcCCCCc------ceEEEEc--CCeEEEEecCCceeeeccccccceeEEEEEEEE-EEEEEcCCceEEEEec Confidence 2245555554421 1223332 234444333333222111222333444444554 6889999999988655 No 91 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=97.78 E-value=2e-05 Score=46.39 Aligned_cols=260 Identities=10% Similarity=0.010 Sum_probs=139.0 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCC--CCcceeEEEEEEeecccceEEecCcccccce Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNS--APDWAQAVMFRSIDARGELQLFGPNSTDVPT 78 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~--~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~ 78 (311) ||- .....+. .+.. +.+.+.+.+.....+....+.-+... +..|. ++.+..++..|.+.+++.+ +++|. T Consensus 1 MA~---~~T~~~~--~~iP--ev~s~~v~~~~~~~~~~~~~~~~~~~~~g~~G~-tv~iP~~~~~~~a~~v~eg-~~i~~ 71 (272) T protein:vir:30 1 MAV---GTTKMAQ--MLDP--EVLADMIDAEVGKAIRFAPLAEVDTTLEGQPGT-TLTVPKWDYIGDAEDVAEG-EAIPM 71 (272) T ss_pred CCC---ccccchh--eech--HHHHHHHHHHHHHHhhhhccccccccccCCCCC-EEEEEEecCCCCcccccCC-Ccccc Confidence 442 1111112 2232 22333344444444444444444332 22333 6677777888999999876 67999 Q ss_pred eeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccceeeeecCCcceeeccC Q lcl|NC_019522. 79 VDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVGEGLYTSPNVSVEAATS 158 (311) Q Consensus 79 v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g~GllN~p~v~~~~~~~ 158 (311) .+...+.....+..++..+.++.++.+. .+.++...-...+.+.+++..++.++.--. |- .... T Consensus 72 ~~~~~~~~~~~~~~~~~~~~itd~~~~~---s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~----~a--------~~~~- 135 (272) T protein:vir:30 72 TQLGFKKTTMTIKKAGKGVEITDEAILS---GYGDPVGQAAKQIVEAIDHKVDADVLDALS----KS--------TQTV- 135 (272) T ss_pred cccccceEEEEeeeeeeeeeecHHHHhh---ccccHHHHHHHHHHHHHHHHHHHHHHHHhc----cc--------cccc- Confidence 9999999999999999888888665443 456788888888888898888876552100 10 0000 Q ss_pred CccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhccc-ccCCCCCcchHHHHHH----HhC Q lcl|NC_019522. 159 TFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTL-LSTQNASNVTLLQFLR----TNF 233 (311) Q Consensus 159 ~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~-~~~~~~~~~Tvl~~l~----~n~ 233 (311) + + ..| +++|.+++..+-.. + ..+..++++|..+..|.+-- ...... +......+. .+. T Consensus 136 ~--~------~~t----~d~i~da~~~l~~~--~--~~~~~~vv~p~~~~~L~k~~~~~~~~~-~~~~~~~~~~g~ig~i 198 (272) T protein:vir:30 136 E--A------TAT----VDGVSKALDIFNDE--D--DAETVIVMNPADASTLRLDAAKEWLGA-TEVGANRVVSGVYGEV 198 (272) T ss_pred c--c------ccC----HHHHHHHHHHHhcc--C--CCccEEEEcHHHHHHHHHhcccccccc-ccccccccccccchhh Confidence 0 0 012 56677777776422 2 34678999999998885421 100000 000001110 011 Q ss_pred CceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhccceeeCCceEEEeeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 234 PDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 234 ~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dGI 311 (311) -.++++..+.+.. +..+++. +..+.+..-.+.+...-.+.......+....++ |+.+.+|.+++.+.-= T Consensus 199 ~G~~Vi~s~~~p~------~t~~~~~--~~a~~~~~~~~~~ve~~r~~~~~~~~i~~~~~~-~~~v~~~~~vv~~t~~ 267 (272) T protein:vir:30 199 LGVQIVRSRKCPK------GTAYMVR--KGALRIMLKRNTMVETDRDITKAINQIVANKHY-GVYLYKAEKAVKITLK 267 (272) T ss_pred cCeeEEEcCCCCc------ceEEEEc--CCeEEEEecCCceeeeccccccceeEEEEEEEE-EEEEEcCCceEEEEec Confidence 2245555554421 1223332 234444333333222111222333444444554 6889999999988655 No 92 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=97.76 E-value=5e-06 Score=49.65 Aligned_cols=266 Identities=11% Similarity=-0.038 Sum_probs=142.2 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCccccccee- Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPTV- 79 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~v- 79 (311) ....+|.+.+.+.+++++. +.+.+.|++........+.++++..-. ...-.+.+......+.+.|++.++. +|.. T Consensus 118 ~~~~a~~~~~~~~gg~lvP--~~~~~~ii~~~~~~~~l~~~~~~~~~~-~~~~~~~~~~~~~~~~a~~v~Eg~~-~~~~~ 193 (397) T protein:vir:12 118 PEFRAMSGINDEDGGILIP--EDIGRQIHEFKRQFEPLEQYVTVEPVT-TRSGTRLLEKNADMVPFSPVEELGN-LPEID 193 (397) T ss_pred hhhhhccccccccCcccCc--hhHHHHHHHhhhhhhhHHhhcceeecc-CCceeEEEEEecCCcceeeeccccc-ccccc Confidence 2223444444445566665 556778888888777777776543211 1122333444455567788887654 4643 Q ss_pred eeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc-eeeeecCCcceeeccC Q lcl|NC_019522. 80 DIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG-EGLYTSPNVSVEAATS 158 (311) Q Consensus 80 ~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g-~GllN~p~v~~~~~~~ 158 (311) ...++......+.++..+.+|.+ +. ..+..+|..--.....+++.+.+|.-+++|+.... .|++ T Consensus 194 ~~~~~~v~~~~~k~~~~~~is~e-~l--~ds~~~l~~~i~~~l~~~~~~~~d~~il~G~g~~~~~g~~------------ 258 (397) T protein:vir:12 194 QPRFTKVSYSIIDYGGIMTLSNS-ML--NDSDQAIMTYVAKWFAKKSVVTRNNLILAAIASLKKVDID------------ 258 (397) T ss_pred cccceeEEeeheeeEeeehhhHH-HH--hhchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccc------------ Confidence 35678888899999998888854 32 34456788888888999999999999999976421 1211 Q ss_pred CccccCcccccCCHHHHHHHHHHHHH-HHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhCCceE Q lcl|NC_019522. 159 TFVALVAAIPTNGTQPIIDFFGNAYN-TVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNFPDIT 237 (311) Q Consensus 159 ~~~~~~t~w~~~t~~ei~~di~~~~~-~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~~~l~ 237 (311) + ++||.+++. .+. .. ......++++|+.+..|.+-. . ..|.-++.--..++..-+ T Consensus 259 ------------~----~~~i~~~~~~~l~--~~--~~~~a~~~~n~~~~~~L~~lk-d---~~G~~l~~~~~~~g~~~~ 314 (397) T protein:vir:12 259 ------------G----LDGIKKALNVTLD--PM--VAPGSIVLTNQDGYDWLDTLK-D---GTGRYLLQPDPTNPTKKL 314 (397) T ss_pred ------------c----HHHHHHHHhhccc--hh--hhCCCEEEEcHHHHHHHHHhh-c---cCCceeecccccCCCCcc Confidence 1 334444443 221 11 123346899999999886532 1 112211110001111112 Q ss_pred EEE-----chhcccCCCCcccEEEEEEcCcceeEEeecchhhhccceeeC----CceEEEeeeeeeeeEEEECCeEEEEe Q lcl|NC_019522. 238 FED-----DILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATAD----NVNFKVPAILRTGGTEWRIPKAGHYV 308 (311) Q Consensus 238 i~~-----~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~----~~~~~~~~~~~~gGv~i~~P~ai~~~ 308 (311) +-. ++.+......| +..+++-+=.+.+.+..-..+++..-.+.. .-...+.++.|+ +..+++|.|++.+ T Consensus 315 l~G~pv~~~~~~~~~~~~~-~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~-d~~~~~~~a~~~~ 392 (397) T protein:vir:12 315 LDGRPVVPFTNRVLKTQKG-KAPLIIGNLKEAIVLFDREQQSIASTDTGAGAFETNSTKVRGIERE-DVRKWDEDAVVFG 392 (397) T ss_pred ccceeeEEecccccccCCC-ccEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEee-ccEEecccceEEE Confidence 222 22111111122 222344333333333222222221100110 113456677887 4677999999988 Q ss_pred ecC Q lcl|NC_019522. 309 DGV 311 (311) Q Consensus 309 dGI 311 (311) +-= T Consensus 393 ~~t 395 (397) T protein:vir:12 393 QIT 395 (397) T ss_pred EEe Confidence 766 No 93 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=97.71 E-value=9.3e-06 Score=48.16 Aligned_cols=265 Identities=14% Similarity=0.005 Sum_probs=143.2 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEE-eecccceEEecCcccccce- Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRS-IDARGELQLFGPNSTDVPT- 78 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~-~~~~G~a~~~~~~a~dip~- 78 (311) -....+...+.+.+++++. +.+.+.|++........++++.+..-.+ ..-.+.|.. .+..+.+.|++.++ .+|. T Consensus 104 ~~~~~~~~~t~~~gg~~vP--~~~~~~ii~~~~~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~~~a~~v~E~~-~~~~~ 179 (397) T protein:vir:49 104 NLLDSKTDASGSDAGLTIP--QDIQTAIHTLVSQYDSLQEYVNVENVTT-LTGSRVYEKWTDITGLANIDDEAG-KIADV 179 (397) T ss_pred HHHHHhhccccccCccccc--HhHHHHHHHHHHhhhhHHhhhceeeccc-CccceEEEeeccCCcceeeecCcc-ccccc Confidence 1111222233344556665 4566778888877777777766543221 122233333 33457788887764 4563 Q ss_pred eeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccceeeeecCCcceeeccC Q lcl|NC_019522. 79 VDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVGEGLYTSPNVSVEAATS 158 (311) Q Consensus 79 v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g~GllN~p~v~~~~~~~ 158 (311) ....++......+.++..+.+|.+ +.. .+..++..--.....+++...+|+-+++|+..... T Consensus 180 ~~~~~~~i~~~~~k~~~~~~iS~e-ll~--ds~~~l~~~i~~~l~~~~~~~~d~ai~~G~g~~~~--------------- 241 (397) T protein:vir:49 180 DDPKLSLIKYTIKRYAGISTVTNS-LLA--DSAENILAWLSGWIAKKVVVTRNKAILEAIAALPT--------------- 241 (397) T ss_pred cccceeeEEeeeeeEEeeehhHHH-HHh--hhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc--------------- Confidence 457788999999999998888844 432 34567888888889999999999999998764210 Q ss_pred CccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHH-HHHh----- Q lcl|NC_019522. 159 TFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQF-LRTN----- 232 (311) Q Consensus 159 ~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~-l~~n----- 232 (311) .++.+ + +++|.+++.++... + .....++|+|+.+..|.+-. +..|.-++.- +... T Consensus 242 --~~~~~-----~----~d~i~~~~~~l~~~--~--~~~a~~vmn~~~~~~l~~lk----d~~G~~l~~~~~~~~~~~~l 302 (397) T protein:vir:49 242 --KPTLT-----K----WDDIIDLEAKVDPA--I--KQTSFFLTNTSGFTALKKVK----NALGDYLMERDVKSPTGYSI 302 (397) T ss_pred --ccccc-----c----HHHHHHHHHhhhhh--h--cCCCEEEEcHHHHHHHHHhh----cCCCceeeccCcCCCCCcee Confidence 00011 1 45677777777432 1 23457899999999986532 1112222110 0000 Q ss_pred -CCceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhcc-c-----eeeCCceEEEeeeeeeeeEEEECCeEE Q lcl|NC_019522. 233 -FPDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLA-P-----ATADNVNFKVPAILRTGGTEWRIPKAG 305 (311) Q Consensus 233 -~~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~-p-----~~~~~~~~~~~~~~~~gGv~i~~P~ai 305 (311) +.++.+.....+ ..+..+.. .++|-+-.+.+-+..-..+++.. + .+. + ...+.++.|+ ++.+++|.+| T Consensus 303 ~G~PV~~~~~~~~-~~~~~~~~-~i~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~-~-~~~~r~~~r~-d~~~~~~~a~ 377 (397) T protein:vir:49 303 DGFAVKEVADRWL-ANGTGGAM-PLYFGDLKQAVTLFDRQHMSLLSTNIGGGAFET-D-TTKVRVIDRF-DVVATDTEAF 377 (397) T ss_pred cceeeEEeccccc-ccccCCce-eEEEeeccceEEEEeecceEEEEeccccchhhc-C-ceeEEEEeee-CcEEecccce Confidence 111222222222 22333332 34443333333222212222110 1 111 1 2345566776 5688999999 Q ss_pred EEeecC Q lcl|NC_019522. 306 HYVDGV 311 (311) Q Consensus 306 ~~~dGI 311 (311) +.++-= T Consensus 378 ~~~~~~ 383 (397) T protein:vir:49 378 VPASFK 383 (397) T ss_pred EEEEee Confidence 887754 No 94 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=97.66 E-value=9e-06 Score=48.24 Aligned_cols=279 Identities=11% Similarity=0.037 Sum_probs=140.4 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeeccc-ceEEecCccccccee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARG-ELQLFGPNSTDVPTV 79 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G-~a~~~~~~a~dip~v 79 (311) -..-++.+.....+++++. +.+...|++........+.++.+.+-.+ ...+.+...+..+ .+.+.+.. ..+|.. T Consensus 112 ~~~~a~~~~~~~~gg~liP--~~~~~~ii~~~~~~~~l~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~v~E~-~~~~~~ 186 (409) T protein:vir:45 112 RELRAQGVAQDEKGGYTVP--ETFLAKVVEKMKSYGGIASVAQILTTSD--GRTMEWATADGTSEVGVLLGEN-EEAGEE 186 (409) T ss_pred HHHhhccCccCcCCceecc--HhHHHHHHHHHHhhhhhhhhceeeecCC--CceEEEEeeccCcccccccccc-cccccc Confidence 0111222233344566665 4456778887777666666655443221 2233444444433 44566554 446777 Q ss_pred eeeccceeEEEEEEEE-EEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeecccc---c-eeeeecCCccee Q lcl|NC_019522. 80 DIAMSQGFKDINTAAL-GYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGV---G-EGLYTSPNVSVE 154 (311) Q Consensus 80 ~~~~~~~~~~v~~~~~-~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~---g-~GllN~p~v~~~ 154 (311) +..+.......+.... -..+|.+=++ .+..+|...-......++...+++-+++|+... . .|+++.+..... T Consensus 187 ~~~f~~~~l~~~k~~~~~i~is~ell~---ds~~~l~~~i~~~la~a~~~~~~~a~l~G~G~~~~~~p~Gil~~~~~~~~ 263 (409) T protein:vir:45 187 DTDFGMGSLGALKMTSKIIRVSNELLQ---DSAIDMEAYLARRIAERIGRGEARYLIQGTGAGTPKQPKGLAASVTGTTQ 263 (409) T ss_pred ccccceeeeeeeeeeeeehhhhHHHHh---ccHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCccccceeeeccccccc Confidence 7777766665555433 3456644332 234678888888899999999999999998652 3 599987664322 Q ss_pred eccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHH-HHHhC Q lcl|NC_019522. 155 AATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQF-LRTNF 233 (311) Q Consensus 155 ~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~-l~~n~ 233 (311) +... ..-+ ++||.+++..|... +....--.+++.+..+..|.+-. +..|.-++.- +.... T Consensus 264 ~~~~---------~~~~----~d~i~~l~~~l~~~--~~~~a~~~~~~n~~~~~~l~~lk----d~~G~~i~~~~~~~~~ 324 (409) T protein:vir:45 264 TAAA---------NAVK----WQEILALKHSIDPA--YRRGPKFRLAFNDNTLKLISEME----DGQGRPLWLPDIVGVA 324 (409) T ss_pred cccc---------cccc----hHHHHHHHHhhhhh--hccCCeEEEEECHHHHHHHHHhh----cCCCceeeccCcCCCC Confidence 2111 0112 35566666666321 21111124677998888875421 1112222110 00000 Q ss_pred ----CceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhh--c-cceeeCCceEEEeeeeeeeeEEEECCeEEE Q lcl|NC_019522. 234 ----PDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRF--L-APATADNVNFKVPAILRTGGTEWRIPKAGH 306 (311) Q Consensus 234 ----~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~--~-~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~ 306 (311) ....++....+.+.+ .|.+. ++|-+=.+++ +..-..++. . .+.- ......+.+..|+ +..+.+|.|++ T Consensus 325 ~~~l~G~PV~~~~~~p~~~-~~~~~-i~~Gd~~~~~-i~~~~~~~~~~~~d~~~-~~~~~~~~~~~r~-d~~~~~~~A~~ 399 (409) T protein:vir:45 325 PASVLNVPYVIDQEIDDIG-AGKKF-MFCGDFDRFI-IRRVRYMILKRLVERYA-EYDQTGFLAFHRF-DCILEDTSAIK 399 (409) T ss_pred CceecceeeEEecCcCCcc-CCccE-EEEeehhhhh-eeeccceEEEEeecccc-cCCcEEEEEEEEe-ccEeechhheE Confidence 112233333333322 23332 4442212222 111112211 1 1110 1122445667787 46699999999 Q ss_pred EeecC Q lcl|NC_019522. 307 YVDGV 311 (311) Q Consensus 307 ~~dGI 311 (311) .+.+= T Consensus 400 ~l~~k 404 (409) T protein:vir:45 400 ALVGK 404 (409) T ss_pred EEEec Confidence 88775 No 95 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=97.66 E-value=2.3e-05 Score=46.00 Aligned_cols=265 Identities=14% Similarity=-0.016 Sum_probs=140.8 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEee-cccceEEecCcccccce- Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSID-ARGELQLFGPNSTDVPT- 78 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~-~~G~a~~~~~~a~dip~- 78 (311) +..++..+. +++++++. +.+.+.|++........+++..+.. .+.....+.+.... ..+.+.|++.++. +|. T Consensus 2 l~~~~~~t~--~~gg~liP--~~~~~~Ii~~~~~~~~l~~~~~~~~-~~~~~g~~~~~~~~~~~~~a~~v~Eg~~-~~~~ 75 (293) T protein:vir:48 2 LDSKTDHSG--SDAGLTIP--QDIRTAINTLVRQYDSLQEYVNVEN-VTTLTGSRVYEKWTDITGLANIDDEAGK-IADI 75 (293) T ss_pred ceeeccccc--CcCceEec--hhHHHHHHHHHHhhhhhhhhceeee-ccCCcceEEEEeecCCCcceeeecCCcc-cccc Confidence 333333222 23445554 4566778888777777777665432 22222333444333 4567888876643 564 Q ss_pred eeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccceeeeecCCcceeeccC Q lcl|NC_019522. 79 VDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVGEGLYTSPNVSVEAATS 158 (311) Q Consensus 79 v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g~GllN~p~v~~~~~~~ 158 (311) .+..+++.....+.++..+.+|.+=++ .+..+|.+.-....++++...+|+-++.|..... + T Consensus 76 ~~~~~~~i~l~~~k~~~~~~iS~ell~---ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~--------------~- 137 (293) T protein:vir:48 76 DDPKLSLIKYTIKRYAGISTVTNSLLA---DSAENILAWLSGWIAKKVVVTRNKAILGVVDKLP--------------T- 137 (293) T ss_pred cccceeEEEEeeeEEEEeehhhHHHHh---hhhHHHHHHHHHHHHHHHHHHHHhHHhhcccccc--------------c- Confidence 346788889999999998888854333 3456788888888999999999998887754311 0 Q ss_pred CccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhC----- Q lcl|NC_019522. 159 TFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNF----- 233 (311) Q Consensus 159 ~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~----- 233 (311) . -...+ ++||.+++.++... + .....++|+|+.+..|.+-. +..+.-+++=-..++ T Consensus 138 -~------~~~~~----~d~i~~~~~~l~~~--~--~~~a~~vmn~~~~~~L~~lk----d~~g~~l~~~~~~~~~~~~l 198 (293) T protein:vir:48 138 -K------PTLTK----WDDIIDLEAKVDPA--I--KQTSFFLTNTSGFTALKKVK----NALGDYLMERDVKSPTGYSI 198 (293) T ss_pred -c------ccccC----HHHHHHHHHhhhhh--h--cCCCEEEEcHHHHHHHHHhh----ccCCceEeecCcCCCCCcee Confidence 0 01112 45677777777422 2 12347899999999986532 111221111000111 Q ss_pred --CceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhccceeeC----CceEEEeeeeeeeeEEEECCeEEEE Q lcl|NC_019522. 234 --PDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATAD----NVNFKVPAILRTGGTEWRIPKAGHY 307 (311) Q Consensus 234 --~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~----~~~~~~~~~~~~gGv~i~~P~ai~~ 307 (311) .++.+.....+. ....+... ++|-+-.+.+.+..-..+++..-.+.+ .-...+.+..|++ +.+++|.|++. T Consensus 199 ~G~Pv~~~~~~~~~-~~~~~~~~-~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d-~~~~~~~a~~~ 275 (293) T protein:vir:48 199 AGFAVKEISDRWLP-NASSGVMP-LYFGDLKQAVTLFDRQQMSLLSTNIGGGAFETDTTKVRVIDRFD-VVATDTEAFVP 275 (293) T ss_pred cceeeEEecccccC-CccCCceE-EEEEeccceEEEEEecceEEEEecccchhhhcCeEEEEEEEeeC-cEEecccceEE Confidence 122222222221 12223323 333332332222111222111000110 1124566678875 56789999998 Q ss_pred eecC Q lcl|NC_019522. 308 VDGV 311 (311) Q Consensus 308 ~dGI 311 (311) ++.= T Consensus 276 l~~~ 279 (293) T protein:vir:48 276 ASFK 279 (293) T ss_pred EEee Confidence 7744 No 96 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=97.64 E-value=2.7e-05 Score=45.59 Aligned_cols=262 Identities=9% Similarity=0.012 Sum_probs=129.6 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCC--CCcceeEEEEEEeecccceEEecCcccccce Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNS--APDWAQAVMFRSIDARGELQLFGPNSTDVPT 78 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~--~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~ 78 (311) ||+++.-.. +.+ +..| .+-+-|.+.....+....+..+... +.+ -.++.++.++..|.++.+..+ ++|+. T Consensus 1 ~~~~~~T~l--~d~--i~PE--v~~~~v~~~~~~~~~~~~~~~~~~~l~g~~-G~tv~iP~~~~ig~a~~~~~g-~~i~~ 72 (275) T protein:vir:96 1 MALENMTKL--ANM--VNPE--VLAPMMQAELDKKLKFAQFADIDNTLVGQP-GNTITFPAFVYSGDAKVVPEG-EEIPI 72 (275) T ss_pred CCCcccchh--hhh--hchH--HHHHHHHHHHHHhhhhcccceecccccCCC-CCEEEeeeeccCCccccccCC-CCcch Confidence 666554111 111 1221 1222233333344455555544443 222 357788888888999988765 57888 Q ss_pred eeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccceeeeecCCcceeeccC Q lcl|NC_019522. 79 VDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVGEGLYTSPNVSVEAATS 158 (311) Q Consensus 79 v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g~GllN~p~v~~~~~~~ 158 (311) -++..+.....+...+.+|+++ |++..+. +.++-.+-.+.+...++++.|+-++- .++.+-+ T Consensus 73 ~~lt~~~~~~~i~~~~~~~~i~--D~~~~~~-~~d~~~~~~~~~a~~~a~~~d~~ll~---~l~~a~~------------ 134 (275) T protein:vir:96 73 DLIETKKRQATIRKIGKGTVLT--DEALLSG-YGDPKGEAVRQHGLAIANKVDNDVLE---ALQGATL------------ 134 (275) T ss_pred hhcccceeeEEeehhccccccc--HHHHHhh-ccchHHHHHHHHHHHHHHHHHHHHHH---HHhcccc------------ Confidence 8888888888888876666555 4554344 44455566666777788887776541 1111100 Q ss_pred CccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcc----cccCCCCCcchHHH-HHHHhC Q lcl|NC_019522. 159 TFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLART----LLSTQNASNVTLLQ-FLRTNF 233 (311) Q Consensus 159 ~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~----~~~~~~~~~~Tvl~-~l~~n~ 233 (311) ..... .-+ ++.|.+++..+-.. ...+..|+++|+.+..|.+- .... +..+..++. -....+ T Consensus 135 --~~~~~---~~~----~d~i~dA~~~lgd~----~~~~~~ivv~p~~~~~L~k~~~~~f~~~-~~~g~~~~~~G~ig~~ 200 (275) T protein:vir:96 135 --KVEAD---ITK----LAGLQTAIDKFNDE----DLEPMVLFVNPLDAGKLRASATDNFTRA-TLLGDNVIVKGAFGEA 200 (275) T ss_pred --ccccc---ccC----HHHHHHHHHHhccc----cCCccEEEeCHHHHHHHHhccccccccc-ccccccceecccccee Confidence 00000 012 45566677666322 23578999999999988542 1111 111111100 000011 Q ss_pred CceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhccceeeCCceEEEeeeeeeeeEEEECCeEEEEee---- Q lcl|NC_019522. 234 PDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHYVD---- 309 (311) Q Consensus 234 ~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~d---- 309 (311) -.++|+....+. ....+++. +.-+.+....+.+...-.......-.+.. -...|+.+.+|..++.+. T Consensus 201 ~G~~Vi~s~~~p------~~t~~i~~--~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~-~~~y~~~~~~~~~vv~~t~~~~ 271 (275) T protein:vir:96 201 LGAIIVRSNKIK------EGEAILAK--RGAVKLITKRDFFLETERHASHKSTALFS-DKHYVAYLYDESKVVKITKSAS 271 (275) T ss_pred cCeeEEEeCCCC------cceEEEEe--ccceeeeecCCcccccccchhhcCcEEEE-eEEEEEEEEcCccEEEEEeccc Confidence 224555444432 12223332 22333333233221111111111222222 233588999999998864 Q ss_pred --cC Q lcl|NC_019522. 310 --GV 311 (311) Q Consensus 310 --GI 311 (311) |+ T Consensus 272 ~~~~ 275 (275) T protein:vir:96 272 GLGV 275 (275) T ss_pred ccCC Confidence 22 No 97 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=97.59 E-value=2.6e-05 Score=45.75 Aligned_cols=284 Identities=10% Similarity=0.022 Sum_probs=147.0 Q ss_pred CCc-----------ccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEe Q lcl|NC_019522. 1 MAK-----------SVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLF 69 (311) Q Consensus 1 ~~~-----------~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~ 69 (311) +.. -.+.+.+.+++++++. +.+..+|++.....-..+.++.+..- + .. ..+...+..+.+.|. T Consensus 60 ~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP--~~~~~~I~~~l~~~s~i~~~~~v~~~-~--~~-~~i~~~~~~~~a~w~ 133 (381) T protein:vir:95 60 KSAQSLSANQRSFFMDINKNVNYKEEKLLP--EETIDRIFEDLTTNHPLLADLGIKNA-G--LR-LKFLKSETSGVAVWG 133 (381) T ss_pred cCcccccHHHHHHHHHHhcccCCCCceecC--HHHHHHHHHHHHhhccceeheeeEec-C--cc-eEEEEecCCcceeee Confidence 111 1122334455667775 45667788777665566666655432 2 12 244556677888887 Q ss_pred cCcccccc-eeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc-eeeee Q lcl|NC_019522. 70 GPNSTDVP-TVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG-EGLYT 147 (311) Q Consensus 70 ~~~a~dip-~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g-~GllN 147 (311) +..+ .++ ..+..+++.....+.++.-..++.+ |. .....+|+.--.....++++..+++-+++|+.... .|+|+ T Consensus 134 ~e~~-~~~~~~~~~f~~i~l~~~kl~~~~~is~e-lL--~Ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP~Gil~ 209 (381) T protein:vir:95 134 KIYG-EIKGQLDAAFSEETAIQNKLTAFVVLPKD-LN--DFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNR 209 (381) T ss_pred cccc-cccccccccceeeeecceeEEeechhhHH-Hh--hcCHHHHHHHHHHHHHHHHHHHhhheeEeccCCCCceeeee Confidence 6543 344 3456778888899999888888743 33 23456788888999999999999999999997655 69999 Q ss_pred cCCcceeeccCCccc--cCcccccCCHHHHHHHHHHHHHHHHhccCCc--eec-ceEEEeCHHHHHHHhcccccCCCCCc Q lcl|NC_019522. 148 SPNVSVEAATSTFVA--LVAAIPTNGTQPIIDFFGNAYNTVYLDNTLT--VHR-PNTFVLPPAQFQLLARTLLSTQNASN 222 (311) Q Consensus 148 ~p~v~~~~~~~~~~~--~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~--~~~-p~~l~lpp~~~~~L~~~~~~~~~~~~ 222 (311) +++.......+.... ....+...++.-+++.+..++..+-....+. -.. .-+++|.+..+..|....... +..| T Consensus 210 ~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~~~-~~~G 288 (381) T protein:vir:95 210 QVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHL-NANG 288 (381) T ss_pred ccCcccccccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhccccccC-CCCC Confidence 876422211110000 0011223334444555555554442211110 011 125677877766654322111 1111 Q ss_pred chHHHHHHHhCCceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhcc----ceeeCCceEEEeeeeeeeeEE Q lcl|NC_019522. 223 VTLLQFLRTNFPDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLA----PATADNVNFKVPAILRTGGTE 298 (311) Q Consensus 223 ~Tvl~~l~~n~~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~----p~~~~~~~~~~~~~~~~gGv~ 298 (311) . |+..-..++.|+..+.... +.++..+.+ .+.- .....+++.. ..... ...+....|.+ .. T Consensus 289 ~----~v~~l~~g~~vv~s~~~p~------~~iifgDfs-~Y~i-~~r~~~~i~~~~~~~~~~d--~~~f~a~~r~d-g~ 353 (381) T protein:vir:95 289 V----YVTALPFNLNVIESTVQEA------GKVLTYVKG-LYDG-YLAGGINVQKFKETLALDD--MDLYTAKQFAY-GK 353 (381) T ss_pred c----eeecCCCCceEEecCCCCc------CcEEEEecc-cEEE-EEecccEEEeechhHhhcC--CeEEEEEEEEc-CE Confidence 1 1101111344554443321 112222222 2221 1122222110 01111 13455567764 56 Q ss_pred EECCeEEEEeecC Q lcl|NC_019522. 299 WRIPKAGHYVDGV 311 (311) Q Consensus 299 i~~P~ai~~~dGI 311 (311) ++.|.|+++++ | T Consensus 354 ~~~~~A~~v~~-l 365 (381) T protein:vir:95 354 AKDNKVAAVWK-L 365 (381) T ss_pred EecCceEEEEE-E Confidence 78999998877 6 No 98 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=97.59 E-value=2.6e-05 Score=45.75 Aligned_cols=284 Identities=10% Similarity=0.022 Sum_probs=147.0 Q ss_pred CCc-----------ccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEe Q lcl|NC_019522. 1 MAK-----------SVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLF 69 (311) Q Consensus 1 ~~~-----------~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~ 69 (311) +.. -.+.+.+.+++++++. +.+..+|++.....-..+.++.+..- + .. ..+...+..+.+.|. T Consensus 60 ~~~~~lt~~e~~~~~~~~~~~~~~gg~lvP--~~~~~~I~~~l~~~s~i~~~~~v~~~-~--~~-~~i~~~~~~~~a~w~ 133 (381) T protein:vir:10 60 KSAQSLSANQRSFFMDINKNVNYKEEKLLP--EETIDRIFEDLTTNHPLLADLGIKNA-G--LR-LKFLKSETSGVAVWG 133 (381) T ss_pred cCcccccHHHHHHHHHHhcccCCCCceecC--HHHHHHHHHHHHhhccceeheeeEec-C--cc-eEEEEecCCcceeee Confidence 111 1122334455667775 45667788777665566666655432 2 12 244556677888887 Q ss_pred cCcccccc-eeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc-eeeee Q lcl|NC_019522. 70 GPNSTDVP-TVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG-EGLYT 147 (311) Q Consensus 70 ~~~a~dip-~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g-~GllN 147 (311) +..+ .++ ..+..+++.....+.++.-..++.+ |. .....+|+.--.....++++..+++-+++|+.... .|+|+ T Consensus 134 ~e~~-~~~~~~~~~f~~i~l~~~kl~~~~~is~e-lL--~Ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP~Gil~ 209 (381) T protein:vir:10 134 KIYG-EIKGQLDAAFSEETAIQNKLTAFVVLPKD-LN--DFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNR 209 (381) T ss_pred cccc-cccccccccceeeeecceeEEeechhhHH-Hh--hcCHHHHHHHHHHHHHHHHHHHhhheeEeccCCCCceeeee Confidence 6543 344 3456778888899999888888743 33 23456788888999999999999999999997655 69999 Q ss_pred cCCcceeeccCCccc--cCcccccCCHHHHHHHHHHHHHHHHhccCCc--eec-ceEEEeCHHHHHHHhcccccCCCCCc Q lcl|NC_019522. 148 SPNVSVEAATSTFVA--LVAAIPTNGTQPIIDFFGNAYNTVYLDNTLT--VHR-PNTFVLPPAQFQLLARTLLSTQNASN 222 (311) Q Consensus 148 ~p~v~~~~~~~~~~~--~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~--~~~-p~~l~lpp~~~~~L~~~~~~~~~~~~ 222 (311) +++.......+.... ....+...++.-+++.+..++..+-....+. -.. .-+++|.+..+..|....... +..| T Consensus 210 ~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~~~~~~-~~~G 288 (381) T protein:vir:10 210 QVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHL-NANG 288 (381) T ss_pred ccCcccccccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhccccccC-CCCC Confidence 876422211110000 0011223334444555555554442211110 011 125677877766654322111 1111 Q ss_pred chHHHHHHHhCCceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhcc----ceeeCCceEEEeeeeeeeeEE Q lcl|NC_019522. 223 VTLLQFLRTNFPDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLA----PATADNVNFKVPAILRTGGTE 298 (311) Q Consensus 223 ~Tvl~~l~~n~~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~----p~~~~~~~~~~~~~~~~gGv~ 298 (311) . |+..-..++.|+..+.... +.++..+.+ .+.- .....+++.. ..... ...+....|.+ .. T Consensus 289 ~----~v~~l~~g~~vv~s~~~p~------~~iifgDfs-~Y~i-~~r~~~~i~~~~~~~~~~d--~~~f~a~~r~d-g~ 353 (381) T protein:vir:10 289 V----YVTALPFNLNVIESTVQEA------GKVLTYVKG-LYDG-YLAGGINVQKFKETLALDD--MDLYTAKQFAY-GK 353 (381) T ss_pred c----eeecCCCCceEEecCCCCc------CcEEEEecc-cEEE-EEecccEEEeechhHhhcC--CeEEEEEEEEc-CE Confidence 1 1101111344554443321 112222222 2221 1122222110 01111 13455567764 56 Q ss_pred EECCeEEEEeecC Q lcl|NC_019522. 299 WRIPKAGHYVDGV 311 (311) Q Consensus 299 i~~P~ai~~~dGI 311 (311) ++.|.|+++++ | T Consensus 354 ~~~~~A~~v~~-l 365 (381) T protein:vir:10 354 AKDNKVAAVWK-L 365 (381) T ss_pred EecCceEEEEE-E Confidence 78999998877 6 No 99 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=97.58 E-value=2.6e-05 Score=45.69 Aligned_cols=267 Identities=12% Similarity=-0.020 Sum_probs=139.6 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEE-EeecccceEEecCccccccee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFR-SIDARGELQLFGPNSTDVPTV 79 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~-~~~~~G~a~~~~~~a~dip~v 79 (311) ....++.+.+.+.+++++. +.+.+.|++........+.++.+..-.. ..-.+.+. ..+..+.+.+++.+ ..+|.. T Consensus 111 ~~~~a~~~~t~~~gg~~vP--~~~~~~Ii~~~~~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~~~a~~v~E~-~~~~~~ 186 (408) T protein:vir:10 111 VSSKTETSGSDSAAGLTIP--QDIRTMINTLVRQYDSLQQYVRVESVST-SNGSRVYEKWTDVTPLTVMDAED-GKIPDL 186 (408) T ss_pred hhhhhhhcccccCCceecc--HhHHHHHHHHHHhhchhhhhcceeeccC-CcceEEEeeccccccceeeecCc-cccccc Confidence 1122333333444566665 4567788888887777777755432211 11122222 22444667787765 345654 Q ss_pred e-eeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccceeeeecCCcceeeccC Q lcl|NC_019522. 80 D-IAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVGEGLYTSPNVSVEAATS 158 (311) Q Consensus 80 ~-~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g~GllN~p~v~~~~~~~ 158 (311) + ..++......+.++..+.+|.+=++ ....+|..--....++++...+++-++.|+.... | . T Consensus 187 ~~~~~~~i~~~~~k~~~~~~iS~ell~---ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~~~------~-------~- 249 (408) T protein:vir:10 187 DNPQLTIIKYLIKRYAGIITATNTSLK---DTAENILAWLSSWIAKKVVVTRNQAIIEVMKAAP------K-------K- 249 (408) T ss_pred cCcceeeEEeeeeeEEeeehhHHHHHh---hchHHHHHHHHHHHHHHHHHHHHHHHhhcccccc------c-------c- Confidence 4 5688899999999998888854333 3466788888888999999999999988876421 0 0 Q ss_pred CccccCcccccCCHHHHHHHHHHHHH-HHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHH-HHHh---- Q lcl|NC_019522. 159 TFVALVAAIPTNGTQPIIDFFGNAYN-TVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQF-LRTN---- 232 (311) Q Consensus 159 ~~~~~~t~w~~~t~~ei~~di~~~~~-~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~-l~~n---- 232 (311) .+ ..+.+ ||.+++. .+. ..+ ...-.++++|+.|..|.+-. +..|.-+++- +... T Consensus 250 ---~~-----~~~~~----~l~~~~~~~~~--~~~--~~~a~~v~n~~~~~~l~~lk----d~~G~~i~~~~~~~~~~~~ 309 (408) T protein:vir:10 250 ---PT-----IAKFD----DVITMINTAVD--PAI--IATSSLLTNQSGLNKLALVK----TAEGKYLLEPDPTKPNSYL 309 (408) T ss_pred ---cc-----cccHH----HHHHHHHHhhh--hhh--ccCCEEEEcHHHHHHHHHhh----ccCCceEeccCcCCCCCce Confidence 00 11333 4444332 221 112 22346889999999986532 1222323211 1111 Q ss_pred --CCceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhccceeeC----CceEEEeeeeeeeeEEEECCeEEE Q lcl|NC_019522. 233 --FPDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATAD----NVNFKVPAILRTGGTEWRIPKAGH 306 (311) Q Consensus 233 --~~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~----~~~~~~~~~~~~gGv~i~~P~ai~ 306 (311) +.++.+.....+...+ .+ +..++|-+-.+.+.+..-..+++..-.+.+ .-.....++.|+ ++.+.+|.+++ T Consensus 310 l~G~PV~~~~~~~~~~~~-~~-~~~i~~gd~~~~~~~~~~~~~~v~~~~~~~~~f~~~~~~~r~~~r~-d~~v~~~~a~~ 386 (408) T protein:vir:10 310 IKGKQVIVVADRWLPNTG-ST-VYPLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRF-DVKATDSEALV 386 (408) T ss_pred ecceeeEEecccccCccC-CC-ceEEEEEehhccEEEEEecceEEEEcccccchhhcCceEEEEEEee-ccEEeccccEE Confidence 1122222222222211 12 222333332333333222223221111111 112456667887 56788899999 Q ss_pred EeecC Q lcl|NC_019522. 307 YVDGV 311 (311) Q Consensus 307 ~~dGI 311 (311) .++.- T Consensus 387 ~~~~~ 391 (408) T protein:vir:10 387 AGSFS 391 (408) T ss_pred EEEee Confidence 98866 No 100 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=97.58 E-value=1.7e-05 Score=46.69 Aligned_cols=267 Identities=15% Similarity=-0.003 Sum_probs=139.8 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEe-ecccceEEecCccccccee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSI-DARGELQLFGPNSTDVPTV 79 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~-~~~G~a~~~~~~a~dip~v 79 (311) .....+...+.+.+++++. +.+...|++........+++..+.. .+.....+.+... +..+.+.|++.+ ..+|.. T Consensus 104 ~~~~~~~~~t~~~gg~~iP--~~~~~~ii~~~~~~~~l~~~~~~~~-~~~~~~~~~~~~~~~~~~~a~~v~E~-~~~~~~ 179 (397) T protein:vir:49 104 NLLDSKTDGSGSDAGLTIP--QDIRTAINTLVRQFDSLQEYVNVEN-VTTLTGSRVYEKWADITGLAKLDDEG-GQIGQN 179 (397) T ss_pred hHHHhhhccCCccCcceec--HHHHHHHHHHHHhhhhHhhhcceee-ccCCcceEEEEeeccCCcceeeeccc-cccccc Confidence 1111222222233445554 3455677777777666667665532 2222333444433 334677777664 445655 Q ss_pred e-eeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccceeeeecCCcceeeccC Q lcl|NC_019522. 80 D-IAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVGEGLYTSPNVSVEAATS 158 (311) Q Consensus 80 ~-~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g~GllN~p~v~~~~~~~ 158 (311) + ..++..+...+.++.-+.+|.+ +. ..+..++..--.....+++.+.+|+-+++|+...+ | . T Consensus 180 ~~~~~~~v~~~~~k~~~~~~iS~e-ll--~ds~~~l~~~i~~~l~~~~~~~~d~ail~G~g~~~------~-------~- 242 (397) T protein:vir:49 180 DDPKLSLIRYAIKRYAGISTVTNS-LL--ADSAENILAWLSGWIAKKVVVTRNKAILEAIGTLP------N-------K- 242 (397) T ss_pred cccceeeeEeeeeeeEeehhhHHH-HH--hhhhHHHHHHHHHHHHHHHHHHHHHHHHhcccccc------c-------c- Confidence 4 3578888888999988888854 33 23456788888899999999999999999975421 0 0 Q ss_pred CccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHH-HHHh----- Q lcl|NC_019522. 159 TFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQF-LRTN----- 232 (311) Q Consensus 159 ~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~-l~~n----- 232 (311) . +. .+ ++||.+++.++... + ..+..++|+|+.+..|.+-. +..|.-++.- +... T Consensus 243 --~-~~-----~~----~d~i~~~~~~l~~~--~--~~~a~~v~n~~~~~~l~~lk----d~~g~~l~~~~~~~g~~~~l 302 (397) T protein:vir:49 243 --P-TL-----AK----WDDIIDLQAKVDPA--I--KQTSLFLTNTSGFTALKKVK----NAMGDYLMERDVKSPTGYSI 302 (397) T ss_pred --c-cc-----cC----HHHHHHHHHhhhhh--h--cCCCEEEEcHHHHHHHHHhh----ccCCceeecccccCCCCcee Confidence 0 00 12 45677777777432 2 24568999999999986532 1112111100 0111 Q ss_pred -CCceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhccceee----CCceEEEeeeeeeeeEEEECCeEEEE Q lcl|NC_019522. 233 -FPDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATA----DNVNFKVPAILRTGGTEWRIPKAGHY 307 (311) Q Consensus 233 -~~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~----~~~~~~~~~~~~~gGv~i~~P~ai~~ 307 (311) +.++.+.....+. .+.++ +..++|-+-.+.+.+..-..+++..-... ..-.....++.|++| .+++|.||+. T Consensus 303 ~G~pV~~~~~~~~~-~~~~~-~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~d~-~~~~~~a~~~ 379 (397) T protein:vir:49 303 DGFVVKEISDRFLP-NGTGG-AMPLYFGDLKQAVTLFDRQHLSLLSTNIGGGAFETDTTKVRVIDRFDV-VSTDTEAFVP 379 (397) T ss_pred cceeeEEecccccc-cccCC-ceeEEEeeccceEEEEeecccEEEEeccccchhhcCeeeEEEEEeecc-EEecccceEE Confidence 1112222111111 22222 22334443333332222122222110011 111245667788854 5788999998 Q ss_pred eecC Q lcl|NC_019522. 308 VDGV 311 (311) Q Consensus 308 ~dGI 311 (311) ++.= T Consensus 380 ~~~~ 383 (397) T protein:vir:49 380 ASFK 383 (397) T ss_pred EEec Confidence 8643 No 101 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=97.53 E-value=2.3e-05 Score=46.01 Aligned_cols=281 Identities=9% Similarity=0.016 Sum_probs=135.2 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccceee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPTVD 80 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~v~ 80 (311) ..+..|-+-+-...+.|.. ..+...|+|.....-...+++|...-.+ + .+.|.....-+.+.+..-+...-|.-. T Consensus 20 ~p~l~m~alTLaea~~l~~--d~~~~~VIE~l~~~s~iL~~lpf~~ve~-~--~~~~~r~~~lp~a~~r~~n~~~~~~~~ 94 (330) T protein:vir:94 20 FPELKMPTVTLAESAKLSQ--DHLVSGLIETIVEVNPLYEMMPFTEIEG-N--ALAYNRENVLGDVQFLAVGGTITAKNP 94 (330) T ss_pred ccccchhhhhhhHHhhcCc--hhhHHHHHHhhhccchHHhhcccccccC-C--cceeeeeecCCcceeeeccccccccCc Confidence 3344444333333445553 3557778888776666777777432111 1 233433333344444332211111111 Q ss_pred eeccceeEEEEEEEEEEEecHHHHHHHHHhCC--ChHHHHHHHHHHHHHHhhhheeeeecccc-ce-eeeecCCcceeec Q lcl|NC_019522. 81 IAMSQGFKDINTAALGYTYSIEEIGFAMLNNV--NLDAERGQAVRDVVEQGLNKIYLLGDKGV-GE-GLYTSPNVSVEAA 156 (311) Q Consensus 81 ~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~--~l~~~k~~aa~~~~~~~~n~~~~~G~~~~-g~-GllN~p~v~~~~~ 156 (311) ..+.+.+..+..++..++++.+ -+...|- +.-....+...+++.+++.+..+|||... .+ ||++.-.-..... T Consensus 95 ~Tf~q~t~~l~~l~~~~~Vd~~---iadl~g~~~d~~~~q~~~~ieal~~~~e~~linGDs~~~~F~GL~~~~~~~q~i~ 171 (330) T protein:vir:94 95 ATFTKVTSELTTLIGDAEVNGL---IQATRSDFMDQTSVQVASKAKSIGRQYQASMITGDGTGNSFQGMMGLVAASQTIS 171 (330) T ss_pred ceeeeeeechhhhhhhHHHHHH---HHHhcCCHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccchhhcCCcccEEe Confidence 1122333334444444444322 2233444 34455566677799999999999998764 44 9975432221211 Q ss_pred cCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccC---------CCCCcchHHH Q lcl|NC_019522. 157 TSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLST---------QNASNVTLLQ 227 (311) Q Consensus 157 ~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~---------~~~~~~Tvl~ 227 (311) + .+.++.. | ++|+.+++..+|...+ .|+.|+++......+..-.... .+.+|.-|+. T Consensus 172 t-g~~gg~~-----T----~d~LDeLl~~v~~~~g----~~~~~l~n~a~~r~I~a~~R~~~~~~v~~~~~~~~G~~v~~ 237 (330) T protein:vir:94 172 A-GANGGTL-----T----FELLDQLLDLVKDKDG----QVDYLMSSFAMRRKYFSLLRALGGAAIGEVMTLPSGRQIPT 237 (330) T ss_pred c-CCCCCCC-----C----HHHHHHHHHHhcCCCC----CCcEEEechhHHHHHHHHHHhccCCCCCCcccccCCCEEee Confidence 1 1112221 2 5788899999875422 4788887666555443211100 1112222211 Q ss_pred HHHHhCCceEEEEchhc---ccCC-CCcccEEEEEEcCc-----ceeEEeecc-h---hhhcccee-eCCceEEEeeeee Q lcl|NC_019522. 228 FLRTNFPDITFEDDILL---KGAG-VAGADRMAVYKKEI-----RIVKGHDVM-P---LRFLAPAT-ADNVNFKVPAILR 293 (311) Q Consensus 228 ~l~~n~~~l~i~~~~~l---~~ag-~~g~~~~v~y~~~~-----~~~~~~~~~-~---~~~~~p~~-~~~~~~~~~~~~~ 293 (311) +..+-|.++..+ ++++ .+|+...++..-.. -+..++-+. | .+++.-.+ ..-.+|.+.. T Consensus 238 -----~~GvPi~~~d~ip~~~~~~~~~~ttsIyav~~G~~~~~qgV~Gl~~~g~~glsVr~~G~~~~k~v~~~~v~~--- 309 (330) T protein:vir:94 238 -----YRGVPWFVNDFIPSNMTQGTATNATAIFAGTFDDGSNKYGIAGLTARGSAGLRVQNVGAKENADETITRVKM--- 309 (330) T ss_pred -----eCCeEEEecccccCCCCcccCCCceeEEEEeecccccccceEeecCCCCCcceeeeCCCccccceeeEEEEE--- Confidence 112333333322 2222 34566666555332 234443221 1 12222112 1223455544 Q ss_pred eeeEEEECCeEEEEeecC Q lcl|NC_019522. 294 TGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 294 ~gGv~i~~P~ai~~~dGI 311 (311) .-|+-++.|.|++.+.|| T Consensus 310 y~~~av~~~~a~~~L~~V 327 (330) T protein:vir:94 310 YCGFANFSQLGLAAIKGL 327 (330) T ss_pred eeeeEEechhheeeeccc Confidence 346789999999999999 No 102 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=97.52 E-value=1.8e-05 Score=46.57 Aligned_cols=269 Identities=10% Similarity=-0.061 Sum_probs=136.1 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEe-ecccceEEecCccccccee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSI-DARGELQLFGPNSTDVPTV 79 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~-~~~G~a~~~~~~a~dip~v 79 (311) .......+...+.+++++. +.+.+.|++........+.++.+.. .......+.+... +..+.+.|++.+ ..+|.. T Consensus 102 ~~~~~~~~~~~~~gg~~vP--~~~~~~ii~~~~~~~~l~~~~~~~~-~~~~~~~~~~~~~~~~~~~a~~v~E~-~~~~~~ 177 (395) T protein:vir:38 102 KNLVTSGTTGTGNAGLTIP--EDIQLQIRTLTRSFTSLESLANVEN-VTTSHGSRVYEKLADITPLKDLDDES-ALIGDN 177 (395) T ss_pred HHHHhhccCccCCCceecc--hhHhhHHHHHHHhhcchhhhcceee-ccCCcceEEEEeeccCCccccccccc-cccccc Confidence 1111112222233445554 4566778888887777777765432 1112223333323 334455666654 445643 Q ss_pred -eeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccceeeeecCCcceeeccC Q lcl|NC_019522. 80 -DIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVGEGLYTSPNVSVEAATS 158 (311) Q Consensus 80 -~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g~GllN~p~v~~~~~~~ 158 (311) ...++......+.++..+.+|.. +. .....+|..--.....+++...+|+-+++|+.... +. . T Consensus 178 ~~~~f~~v~~~~~k~~~~~~iS~e-ll--~ds~~~l~~~i~~~la~~~~~~~~~~il~g~g~~~----~~--------~- 241 (395) T protein:vir:38 178 DDPELTVVKYLIHRYAGITTVTNT-LL--KDTVDNIIQWLVNWAAKKDVVTRNAKILEVMGKAP----KK--------P- 241 (395) T ss_pred cccceeeEEeeeeeeEeehhhHHH-HH--hhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc----cc--------c- Confidence 46778888889999988888843 33 23556788888899999999999999999876421 00 0 Q ss_pred CccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhCCceEE Q lcl|NC_019522. 159 TFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNFPDITF 238 (311) Q Consensus 159 ~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~~~l~i 238 (311) + ..+.+ +|.+++...... . ......++|+|+.+..|.+-. +..|.-++.--..++..-+| T Consensus 242 ----~-----~~~~~----~i~~~~~~~l~~-~--~~~~a~~v~n~~~~~~L~~lk----d~~G~~l~~~~~~~~~~~~l 301 (395) T protein:vir:38 242 ----T-----ISQFD----NIKDLENNTLDP-A--IESTSSFITNQSGYNILSKVK----DADGRYLMQPDVTSPDKYLI 301 (395) T ss_pred ----c-----cccHH----HHHHHHHHhhhh-h--hcCCCEEEEcHHHHHHHHHhh----ccCCceeeccCcCCCCccee Confidence 0 01233 344443322111 1 112346899999999986532 11122221100011111122 Q ss_pred EEch-----hcccCCCCcccEEEEEEcCcceeEEeecchhhhccceeeC----CceEEEeeeeeeeeEEEECCeEEEEee Q lcl|NC_019522. 239 EDDI-----LLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATAD----NVNFKVPAILRTGGTEWRIPKAGHYVD 309 (311) Q Consensus 239 ~~~~-----~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~----~~~~~~~~~~~~gGv~i~~P~ai~~~d 309 (311) -..| ....-+..+.. .++|-+-.+.+.+..-..+++..--+.. .-.+.+.++.|+ ++.+.+|.+++.++ T Consensus 302 ~G~pV~~~~~~~~~~~~~~~-~i~~gd~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~-d~~~~~~~a~~~~~ 379 (395) T protein:vir:38 302 DGKPVIRIADKWLPDVSGSH-PLYFGDLKQGITLFDRQQMQIDTTNVGAGSFEHDTTKLRFIDRF-DVQLIDDGAFAAAS 379 (395) T ss_pred ccceeEEecccccCcCCCcc-eEEEEeccccEEEEEecceEEEEeccccchhhcCceEEEEEEee-ccEEecccceEEEE Confidence 1111 11111122222 2344433333333222222221100111 112456677887 46788899999999 Q ss_pred cC Q lcl|NC_019522. 310 GV 311 (311) Q Consensus 310 GI 311 (311) .- T Consensus 380 ~~ 381 (395) T protein:vir:38 380 FK 381 (395) T ss_pred ee Confidence 87 No 103 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=97.46 E-value=6.3e-05 Score=43.62 Aligned_cols=286 Identities=8% Similarity=0.000 Sum_probs=144.8 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccc-ee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVP-TV 79 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip-~v 79 (311) .=...+...+.+++++++. +.+..+|++.....-..+.++.+.+-. +. ..+...+..+.+.|.+..+ .++ .. T Consensus 74 ~~~~~~~~~~~~~gg~lvP--~~~~~~I~~~l~~~s~i~~~~~v~~~~--~~--~~i~~~~~~~~a~wv~e~~-~~~~~~ 146 (377) T protein:vir:96 74 FFNDIDKNVGGKDKFKLLP--EETMVQVFDDLVAEHPLLKVINFKNTS--LR--LKALTAETSGTAVWGDIFG-EIKGQL 146 (377) T ss_pred HHHHHHhcCCCCCCceecC--HHHHHHHHHHHHhhhhhhhhceeEecC--Cc--eEEEEecCCcceeEeeccc-cccccc Confidence 0001111233456667775 335566666555444444444443221 11 2344456677888876543 343 45 Q ss_pred eeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc-eeeeecCCcceeeccC Q lcl|NC_019522. 80 DIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG-EGLYTSPNVSVEAATS 158 (311) Q Consensus 80 ~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g-~GllN~p~v~~~~~~~ 158 (311) +..++....+.+.++.-..++.+ |. ..+.++++.--.....+++...+++-+++|+.... .|+||++......... T Consensus 147 ~~~f~~i~l~~~kl~~~~~is~~-ll--~ds~~~le~~i~~~l~~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~~~~~~ 223 (377) T protein:vir:96 147 KQAFKEQDFSQFKLTAFVVIPKD-AL--KFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPTVDQST 223 (377) T ss_pred CccceeEeeeeeeEEeechhhHH-Hh--hcchhhHHHHHHHHHHHHHHHHHhhceEeccCCCcceeeeeccccccccccc Confidence 67788888999999888888744 32 34567899999999999999999999999997655 7999998765443332 Q ss_pred CccccCc--------ccccCCHHHHHHHHHHHHHHHHhccCCc---eecceEEEeCHHHHHHHhcccccC-CCCCcchHH Q lcl|NC_019522. 159 TFVALVA--------AIPTNGTQPIIDFFGNAYNTVYLDNTLT---VHRPNTFVLPPAQFQLLARTLLST-QNASNVTLL 226 (311) Q Consensus 159 ~~~~~~t--------~w~~~t~~ei~~di~~~~~~~~~~~~~~---~~~p~~l~lpp~~~~~L~~~~~~~-~~~~~~Tvl 226 (311) ....... .....+++.+.+.+..+...+.....+. ....-.++|.+..+..+...+..- .+....+++ T Consensus 224 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~~~~~~~G~~~~~l 303 (377) T protein:vir:96 224 GRDITTYKTDKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKFTSRNQFGEYVTVL 303 (377) T ss_pred cccccceeeccccccccccCChhHHHHHHHHHHHhhccccccccccccCceEEEEchhhHHhccccccccCCCCCceecc Confidence 2111111 1223456666665555555543211111 111235778887765543222211 111111221 Q ss_pred HHHHHhCCceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhccceeeC--CceEEEeeeeeeeeEEEECCeE Q lcl|NC_019522. 227 QFLRTNFPDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATAD--NVNFKVPAILRTGGTEWRIPKA 304 (311) Q Consensus 227 ~~l~~n~~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~--~~~~~~~~~~~~gGv~i~~P~a 304 (311) +.+++++....... | .++..+. .++ -+..-..+++..--+.. .-...+....|++| .++.|.| T Consensus 304 ------~~p~~v~~s~~~p~----~--~i~fgdf-~~Y-~i~~r~~~~i~~~~~~~~~~d~~~f~~~~r~dG-~~~d~~a 368 (377) T protein:vir:96 304 ------PHGITILESLAVET----G--KAIAFVA-NRY-DAFMATASTIEEYDQTFAMEDLQLYLTKNYFYG-KAKDNHT 368 (377) T ss_pred ------CCCceEEecCCCCc----c--cEEEEEc-CcE-EEEEecccEEEeehhhhhhcCCeEEEEEEEEcC-EEecCCc Confidence 22445544333221 1 1222111 112 11112222211000110 11234555677754 5678888 Q ss_pred EEEee--cC Q lcl|NC_019522. 305 GHYVD--GV 311 (311) Q Consensus 305 i~~~d--GI 311 (311) ++.++ |= T Consensus 369 ~~vl~l~~~ 377 (377) T protein:vir:96 369 AALLTLAGG 377 (377) T ss_pred EEEEEEecC Confidence 66654 11 No 104 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=97.46 E-value=1.1e-05 Score=47.83 Aligned_cols=265 Identities=9% Similarity=-0.044 Sum_probs=135.3 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccceee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPTVD 80 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~v~ 80 (311) -...+|...+.+++++++. +.+...|++.....-..+.++.+.+..+ .++ ..+....+.+.|++.. ...|..+ T Consensus 113 ~~~~al~~~t~s~gG~~IP--~~~~~~Ii~~~~~~~~l~~~~~v~~~~~---~~~-p~~~~~~~~a~~v~E~-~~~~~~~ 185 (387) T protein:vir:93 113 RLLHALPTGNDSGGDKLLP--KTLSKEIVSEPFAKNQLREKARLTNIKG---LEI-PRVSYTLDDDDFITDV-ETAKELK 185 (387) T ss_pred HHHHhhccCcCCCCceeec--hhHHHHHHHHHHhhchhhhheeeeecCC---ceE-EEEeecCCccccccCc-ccccccc Confidence 1123344444555667775 4456778877776666677766654332 111 1223344567787765 4467778 Q ss_pred eeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc--eeeeecCCcceeeccC Q lcl|NC_019522. 81 IAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG--EGLYTSPNVSVEAATS 158 (311) Q Consensus 81 ~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g--~GllN~p~v~~~~~~~ 158 (311) ..++......+.++.-+.+|.+ |. .-+..++.+--.....+++...++..+|.+..+.| .|+++++++...+. T Consensus 186 ~~f~~v~~~~~k~~~~~~iS~e-ll--~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~p~g~l~~~~~~~v~~-- 260 (387) T protein:vir:93 186 LKGDTVKFTTNKFKVFAAISDT-VI--HGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLDHMSFYNGSVKEVEG-- 260 (387) T ss_pred cccceeeeeheeeeeechhhHH-HH--hhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccc-- Confidence 8888899999999988888844 33 34456788878888888888888877665444444 58888766543211 Q ss_pred CccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhCCceEE Q lcl|NC_019522. 159 TFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNFPDITF 238 (311) Q Consensus 159 ~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~~~l~i 238 (311) ...+++|.+++..+-.. +. . .-.++|.+..+..+.+..-. . +..++ . ..+-+| T Consensus 261 --------------~~~~d~i~~~~~~l~~~--~~-~-~a~~~mn~~t~~~~~~~~~d---~-~~~~~---~--~~~~~l 313 (387) T protein:vir:93 261 --------------ADMYDAIINALADLHED--YR-D-NATIYMRYADYVKIISVLSN---G-TTNFF---D--TPAEKV 313 (387) T ss_pred --------------cchHHHHHHHHhccChh--hh-c-CCEEEEechHHHHHHHHHhc---C-CCccc---c--cCCccc Confidence 11245677777766322 21 1 12466776665444333211 1 11111 1 111122 Q ss_pred EEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhccceeeCCceEEEeeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 239 EDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 239 ~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dGI 311 (311) ...|-....+. ..++ |-+=..... ....+.+..--+.....+.+-+..|++|. +++|.|++++.-= T Consensus 314 lG~PV~~~~~~---~~~~-~GDf~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~r~d~~-v~~~eA~~~l~~k 379 (387) T protein:vir:93 314 FGKPVVFTDAA---VKPI-VGDFNYFGI--NYDGTTYDTDKDVKKGEYLFVLTAWYDQQ-RTLDSAFRIAKAK 379 (387) T ss_pred cccceEEecCC---Ccee-eeehhhhhe--ehhhheeeecccccCCceeEEEEeeeCce-eechhheEEEEee Confidence 22222211110 0111 110000000 00111111001111223445566788655 6679999987543 No 105 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=97.41 E-value=7.3e-05 Score=43.28 Aligned_cols=261 Identities=11% Similarity=0.042 Sum_probs=133.8 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCC--CCcceeEEEEEEeecccceEEecCcccccce Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNS--APDWAQAVMFRSIDARGELQLFGPNSTDVPT 78 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~--~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~ 78 (311) ||... ..-+.+ +..| -+.+.+.+.....+....+..+... +.+| .++.+..++..|.++.+..+ ++++. T Consensus 1 ma~~~---T~~~d~--i~Pe--v~s~~v~~~~~~~~~~~~~~~~~~~l~g~~G-~tv~ip~~~~~g~~~~~~~g-~~i~~ 71 (274) T protein:vir:96 1 MAQGT---TKVSNL--IVPE--VLAPMMQAELDKKLRFAQFADIDSTLVGQPG-DTLTFPAFTYSGDAQVIAEG-EKIPV 71 (274) T ss_pred CCccc---cchhhh--hhhH--HHHHHHHHHHHhhhhhcccccccccccCCCC-CEEEEEeeccCCCccccCCC-CcCch Confidence 44322 111111 2221 1223333444444555556555443 2234 46788888888999988765 57888 Q ss_pred eeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccceeeeecCCcceeeccC Q lcl|NC_019522. 79 VDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVGEGLYTSPNVSVEAATS 158 (311) Q Consensus 79 v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g~GllN~p~v~~~~~~~ 158 (311) -++..+.....+...+.+|+++ |+++++ .+.++-......+...+++..|+.++.- ++.. +. T Consensus 72 ~~it~~~~~~~i~~~~~~~~i~--D~~~~~-~~~d~~~~~~~~~~~~~a~~~d~~i~~~--------l~~a-------~~ 133 (274) T protein:vir:96 72 DQIGTSKREAKVRKIGKGTELT--DEAVLS-GFGDPQGEAVRQHGLAIANKVDNDVLEA--------LKGA-------TL 133 (274) T ss_pred hhcccceeEEEEEeeeceeeec--HHHHHh-hcchHHHHHHHHHHHHHHHHHHHHHHHH--------HhcC-------CC Confidence 8888888888888877666665 555444 4555667777888888888888866531 1110 00 Q ss_pred CccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcc----cccCCCCCcchHH-HHHHHhC Q lcl|NC_019522. 159 TFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLART----LLSTQNASNVTLL-QFLRTNF 233 (311) Q Consensus 159 ~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~----~~~~~~~~~~Tvl-~~l~~n~ 233 (311) . ....+ -+ ++.|.++..++-.. . ..+..|+++|..+..|.+- .....+ .+..++ .-....+ T Consensus 134 ~-~~~~~----~~----~d~i~dA~~~l~d~---~-~~~~~ivv~p~~~~~L~k~~~~~f~~~~~-~g~~~~~~g~ig~~ 199 (274) T protein:vir:96 134 T-VEADI----TK----LDGLQTAIDKFNDE---D-LEPMVLFVNPLDAGGLRTSASDNFTRPTQ-LGDNIIVKGAFGEA 199 (274) T ss_pred C-cCccc----cc----HHHHHHHHHHhccc---C-CCceEEEeCHHHHHHHHhccccccccccc-ccccceeeccccee Confidence 0 00000 11 45666777766322 1 2578999999999998552 111111 111110 0000011 Q ss_pred CceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhccceeeCCceEEEeeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 234 PDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 234 ~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dGI 311 (311) -..+|+..+.+. ....+++. +.-+.+....+.+...-.........+.... ..|+-+.+|..++.+.-= T Consensus 200 ~G~~Vi~s~~~p------~~t~~l~~--~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~-~yg~~~~~~~~vv~~t~~ 268 (274) T protein:vir:96 200 LGAVIVRSNKLN------KGEALLAK--KGAVKLITKRDFFLEKDRDASRKSTALYSDK-HYVAYLYDESKVVKITKG 268 (274) T ss_pred cCeeEEEcCCCC------cceEEEEe--CcceeeeecCCcccccccchhhcccEEEEee-EEEEEEEcCccEEEEEcC Confidence 123444444332 11223332 3334443333332211111112223333333 358999999988887655 No 106 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=97.40 E-value=3.6e-05 Score=44.98 Aligned_cols=267 Identities=11% Similarity=-0.034 Sum_probs=139.5 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeeccc-ceEEecCcccccce- Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARG-ELQLFGPNSTDVPT- 78 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G-~a~~~~~~a~dip~- 78 (311) ....++.....+.++++.. +.+.+.|++..+.....+.++.+..- +.....+.+......+ .+.+++.+ ..+|. T Consensus 111 ~~~~a~~~~~~~~gg~~vP--~~~~~~Ii~~~~~~~~l~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~v~E~-~~~~~~ 186 (408) T protein:vir:74 111 VSSKTETSGSDSAAGLTIP--QDIRTMINTLVRQYDSLQQYVRVESV-STSSGSRVYEKWTDVTPLKAMDEED-GKIPDL 186 (408) T ss_pred hhhhhhcccccCCCceeec--hhHhhHHHHHHhhhcchhhhcceeec-cCCcceEEEEeecCCcccccccccc-cccccc Confidence 2222333333344556665 55677888888887777777765332 2233344444444444 33455444 44564 Q ss_pred eeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccceeeeecCCcceeeccC Q lcl|NC_019522. 79 VDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVGEGLYTSPNVSVEAATS 158 (311) Q Consensus 79 v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g~GllN~p~v~~~~~~~ 158 (311) .+..++......+.++..+.+|.+ +. .....+|..--.....+++...+|+-+++|+.... | . T Consensus 187 ~~~~~~~i~~~~~k~~~~~~iS~e-ll--~ds~~~l~~~i~~~l~~~~~~~~d~~il~G~G~~~------~-------~- 249 (408) T protein:vir:74 187 DNPRLTIIKYLIKRYAGIITATNT-LL--KDTAENILAWLSSWIAKKVVVTRNQAIIAAMGTVP------K-------K- 249 (408) T ss_pred cccceeeEEeeeeeEEeeehhHHH-HH--hhchHHHHHHHHHHHHHHHHHHHHHHHhhcccccc------c-------c- Confidence 457889999999999999999854 32 34566788888889999999999999999875421 0 0 Q ss_pred CccccCcccccCCHHHHHHHHHHHHH-HHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHH-HHHh---- Q lcl|NC_019522. 159 TFVALVAAIPTNGTQPIIDFFGNAYN-TVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQF-LRTN---- 232 (311) Q Consensus 159 ~~~~~~t~w~~~t~~ei~~di~~~~~-~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~-l~~n---- 232 (311) + ...+.+ ||.+++. .+. ..+ .....++++|+.+..|.+-. . ..|.-++.- +... T Consensus 250 ---~-----~~~~~~----~i~~~~~~~l~--~~~--~~~a~~v~n~~~~~~l~~lk-d---~~G~~l~~~~~~~~~~~~ 309 (408) T protein:vir:74 250 ---P-----TIANFD----DVITMINTSVD--PAI--IATSSLLTNQSGLNKLALVK-T---AEGKYLLEPDPTKPNSYL 309 (408) T ss_pred ---c-----ccccHH----HHHHHHHHhhh--hhh--cCCCEEEEcHHHHHHHHHhh-c---CCCceEeccCcCCCCCce Confidence 0 011233 4444432 331 112 12346889999999986532 1 112222110 0111 Q ss_pred --CCceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhcc-ceee---CCceEEEeeeeeeeeEEEECCeEEE Q lcl|NC_019522. 233 --FPDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLA-PATA---DNVNFKVPAILRTGGTEWRIPKAGH 306 (311) Q Consensus 233 --~~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~-p~~~---~~~~~~~~~~~~~gGv~i~~P~ai~ 306 (311) +.++.+.....+... +++... ++|-+-.+.+.+..-..+++.. +... ..-...+.++.|++ +.+++|.+++ T Consensus 310 l~G~pV~~~~~~~~~~~-~~~~~~-i~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d-~~~~~~~a~~ 386 (408) T protein:vir:74 310 IKGKQVIVVADRWLPNS-GSTVYP-LYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFD-VKATDSEALV 386 (408) T ss_pred ecceeeEEecCcccccc-cCCcce-EEEEehhccEEEEEecceEEEEeccccchhhcceeeEEEEEeeC-cEEecccceE Confidence 112222222122222 122222 3333322322222112222110 1110 01125566778885 5688899998 Q ss_pred EeecC Q lcl|NC_019522. 307 YVDGV 311 (311) Q Consensus 307 ~~dGI 311 (311) .++.- T Consensus 387 ~~~~~ 391 (408) T protein:vir:74 387 AGSFT 391 (408) T ss_pred EEEee Confidence 88854 No 107 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=97.40 E-value=7.2e-05 Score=43.30 Aligned_cols=264 Identities=14% Similarity=0.053 Sum_probs=139.3 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeec-ccceEEecCcccccce- Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDA-RGELQLFGPNSTDVPT- 78 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~-~G~a~~~~~~a~dip~- 78 (311) ....++.+.+.+.+++++. +.+.+.|++...+....+.++.+.. -...+..|.+... .+.+.+++..+. .|. T Consensus 106 ~~~~~~~~~t~~~gg~~vP--~~~~~~ii~~~~~~~~l~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~E~~~-~~~~ 179 (394) T protein:vir:10 106 VIDNAAGHVTSTEAGVLIP--EEIIYDPTAEVNSVVDLSTLVTKTP---VTTPKGTYPILKRATDRFSSVAELAE-NPAL 179 (394) T ss_pred hhhhhhcccccccCceecc--HHHHHHHHHHHHhhhhhhhhceeee---ccCCceEEEEEecCCCcccccccccc-cccc Confidence 2223333334444556665 4577888888887777777765432 2223445555543 467778777544 453 Q ss_pred eeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccceeeeecCCcceeeccC Q lcl|NC_019522. 79 VDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVGEGLYTSPNVSVEAATS 158 (311) Q Consensus 79 v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g~GllN~p~v~~~~~~~ 158 (311) .+..++.....++.++.-..+|.+=|+. +..+|..--....++++...+|+-+++|......+ T Consensus 180 ~~~~~~~v~l~~~k~~~~~~iS~ell~d---s~~~l~~~i~~~la~~~~~~~~~~il~g~g~~~~~-------------- 242 (394) T protein:vir:10 180 AEPEFEQVDWSVSTYRGAIPLSEEAIAD---SAVDLTSLVGQSINEKSVNTYNAMIAPVLQSFTAK-------------- 242 (394) T ss_pred ccccceeEEeeeeeeEeeehhHHHHHhh---hhHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc-------------- Confidence 4567888899999999888888654443 34578888888888999999999888876531100 Q ss_pred CccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHH----H--- Q lcl|NC_019522. 159 TFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLR----T--- 231 (311) Q Consensus 159 ~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~----~--- 231 (311) +.. ...+. ++|.+++...... .+ .-.++|+|+.+..|.+-. . ..|.-++.--. . T Consensus 243 -~~~-----~~~~~----d~l~~~~~~~~~~-~~----~a~~vmn~~~~~~l~~lk-d---~~G~~i~~~~~~~~~~~~~ 303 (394) T protein:vir:10 243 -ATT-----TDTLV----DSLKHILNVDLDP-AY----SRALVVTQSLFNTLDTLK-D---KNGRYLLHDASDSITDGTA 303 (394) T ss_pred -ccc-----ccccH----HHHHHHHHhhhhh-hc----cCEEEecHHHHHHHHHhh-c---cCCCeeeeccccccccCCc Confidence 000 01123 3444444433221 11 136899999999987532 1 11211111000 0 Q ss_pred ----hCCceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhccceeeCCceEEEeeeeeeeeEEEECCeEEEE Q lcl|NC_019522. 232 ----NFPDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHY 307 (311) Q Consensus 232 ----n~~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~ 307 (311) .+.++.+...-.+ +.+..+..++|-+=.+.+.+.....+++.. .........+..+.|++ +.+++|.+|+. T Consensus 304 ~~~L~G~PV~~~~~~~~---~~~~~~~~i~~gd~s~~~~~~~~~~~~v~~-~~~~~~~~~~~~~~r~d-~~~~~~~ai~~ 378 (394) T protein:vir:10 304 KGTVLGVPVYVVGDALL---GSAAGDQKAFVGDLKRGVLFADRQQVTLAW-EDSKIYGRYLGAAFRFG-VKQADSNAGYF 378 (394) T ss_pred ccccccceeEEeccccc---CCCCCceEEEEeeccccEEEEeecceEEEE-ecccccceeEEEEEEec-cEEeccccEEE Confidence 0111222111112 222223344444333333332222232221 11111222345567875 56777999988 Q ss_pred eecC Q lcl|NC_019522. 308 VDGV 311 (311) Q Consensus 308 ~dGI 311 (311) +..= T Consensus 379 ~~~~ 382 (394) T protein:vir:10 379 VTNT 382 (394) T ss_pred EEee Confidence 7754 No 108 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=97.40 E-value=7.7e-05 Score=43.15 Aligned_cols=266 Identities=10% Similarity=0.024 Sum_probs=129.3 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCc-ceeEEEEEEeecccceEEecCccccccee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPD-WAQAVMFRSIDARGELQLFGPNSTDVPTV 79 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~-~~~~~~~~~~~~~G~a~~~~~~a~dip~v 79 (311) ||.- ...-+.+ +..| -+.+-|.+.....+....+..+.....- .-.++.+..++..|.+..++++ ++++.- T Consensus 1 ma~~---~T~~~d~--iiPe--v~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~~gda~~~~eg-~~i~~~ 72 (272) T protein:vir:36 1 MSKQ---KTTLADL--VNPE--VLAPIVSYELNKALRFAPLAQVDTTLQGQPGNTLKFPAFTYIGDAADVAEG-GEISLD 72 (272) T ss_pred CCCc---ceehhhh--hchH--HHHHHHHHHHHhhhhhccccccccccccCCCCEEEEeeeccCccccccCCC-CccChh Confidence 4321 0111111 1121 1222233333344455555555443221 1346778888888999888876 568888 Q ss_pred eeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccceeeeecCCcceeeccCC Q lcl|NC_019522. 80 DIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVGEGLYTSPNVSVEAATST 159 (311) Q Consensus 80 ~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g~GllN~p~v~~~~~~~~ 159 (311) +++.+.....+...+.+|++ .|+++++ .+-++-..-...+...+++..|+-++-. +.- . ....+ T Consensus 73 ~lt~~~~~~~i~~~~k~~~v--tD~~~~~-~~~d~~~~~~~~~a~~~a~~~d~~i~~~--------l~~--~---~~~~~ 136 (272) T protein:vir:36 73 KIGTTTKSVTIKKAAKGTEI--TDEAALS-GYGDPIGESNKQLGLSLANKVDDDLLSA--------AKT--T---SQTVS 136 (272) T ss_pred hcCCcceeEeeehhhccccc--cHHHHhh-ccchHHHHHHHHHHHHHHHHHHHHHHHH--------hcc--c---ccccc Confidence 88888888888887665555 5555544 3445555666666677777777644311 100 0 00000 Q ss_pred ccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhccccc--CCCCCcchHHH-HHHHhCCce Q lcl|NC_019522. 160 FVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLS--TQNASNVTLLQ-FLRTNFPDI 236 (311) Q Consensus 160 ~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~--~~~~~~~Tvl~-~l~~n~~~l 236 (311) ...+ +++|.+++..+-.. ...+..++++|..+..|.+-..- ..+..+..++. -....+-.+ T Consensus 137 --------~~~~----~d~i~~A~~~lgd~----~~~~~~ivv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~ 200 (272) T protein:vir:36 137 --------TKAN----VDGVQAALDIFNDE----DAQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADVLGA 200 (272) T ss_pred --------cccc----HHHHHHHHHHhhhc----CCCceEEEEcHHHHHHHhcccccccccccccccceeeeccceecCe Confidence 0112 45677777776432 12467899999999998652110 00010111100 000112235 Q ss_pred EEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhccceeeCCceEEEeeeeeeeeEEEECCeEEEEe--ecC Q lcl|NC_019522. 237 TFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHYV--DGV 311 (311) Q Consensus 237 ~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~--dGI 311 (311) +|+....+.. ++.....|-..+.-+.+...++.+...-.......-.+... ...|+.+.+|.+++.+ .|+ T Consensus 201 ~Vv~s~~~p~----~~~~~~~~~~~~gA~~~~~~~~~~vE~~R~~~~~~d~i~~~-~~y~~~v~~~~~vv~~t~~g~ 272 (272) T protein:vir:36 201 QIVRSKKLAE----GSALMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITAD-EHYAAYLYDLTKVVNITFTGV 272 (272) T ss_pred eEEEeCCCCC----CceeEEEEEecccceeeeecCCcccccccchhhcCcEEEEE-EEEEEEEEcCccEEEEeecCC Confidence 6666555531 11122222222222332222232211111111222233333 3358999999987765 688 No 109 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=97.38 E-value=8e-05 Score=43.04 Aligned_cols=261 Identities=10% Similarity=0.016 Sum_probs=131.3 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCC--CCcceeEEEEEEeecccceEEecCcccccce Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNS--APDWAQAVMFRSIDARGELQLFGPNSTDVPT 78 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~--~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~ 78 (311) ||.. ...-+.+ +.. |-+.+.|.+.....+....+..+... +.+| .++.+..++..|.++.+..+ ++|+. T Consensus 1 ma~~---~T~~~d~--iiP--ev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G-~tv~iP~~~~~g~a~~~~~g-~~i~~ 71 (274) T protein:vir:97 1 MPQG---LTKTSDQ--IIP--EVLAPMMQAQLEKKLRFASFAEVDSTLQGQPG-DTLTFPAFVYSGDAQVVAEG-EKIPT 71 (274) T ss_pred CCcc---ceehhhe--ech--HHHHHHHHHhhhhhhhhcccceecccccCCCC-CEEEEeeecCCCccccccCC-Ccccc Confidence 4431 1111111 122 22333333444444555556555433 2233 57788888888999988765 57888 Q ss_pred eeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccceeeeecCCcceeeccC Q lcl|NC_019522. 79 VDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVGEGLYTSPNVSVEAATS 158 (311) Q Consensus 79 v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g~GllN~p~v~~~~~~~ 158 (311) -++..+.....+...+.+|++ .|+++++..+ ++-......+.++++++.|+.++.- ++.-.. .. T Consensus 72 ~~lt~~~~~~~i~~~~~~~~i--~D~~~~~~~~-dp~~~~~~~~a~a~a~~vd~~~~~~--------l~~a~~-----~~ 135 (274) T protein:vir:97 72 DILETKKREAKIRKIAKGTSI--TDEALLSGYG-DPQGEQVRQHGLAHANKVDNDVLEA--------LMGAKL-----TV 135 (274) T ss_pred cccccceeEEEeeeecceecc--cHHHHHhccc-hHHHHHHHHHHHHHHHHHHHHHHHH--------HhccCc-----cc Confidence 888888888888887665555 5555555444 4445666777778888888765521 111000 00 Q ss_pred CccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhccc----ccCCCCCcchHH-HHHHHhC Q lcl|NC_019522. 159 TFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTL----LSTQNASNVTLL-QFLRTNF 233 (311) Q Consensus 159 ~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~----~~~~~~~~~Tvl-~~l~~n~ 233 (311) + + . .-+ ++.|.++..++-.. ...+..|+++|..+..|.+.- +..++. +..++ .-....+ T Consensus 136 ~--~-~----~~~----~d~i~dA~~~l~d~----~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~-g~~~~~~G~ig~~ 199 (274) T protein:vir:97 136 N--A-D----ITK----LNGLQSAIDKFNDE----DLEPMVLFVNPLDAGKLRGDASTNFTRATEL-GDDIIVKGAFGEA 199 (274) T ss_pred c--c-c----ccC----HHHHHHHHHHhhcc----CCCceEEEeCHHHHHHHHhhhhhhccccCcc-cccceecccccee Confidence 0 0 0 011 45667777766432 125689999999999997531 111111 11111 0000011 Q ss_pred CceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhccceeeCCceEEEeeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 234 PDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 234 ~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dGI 311 (311) -..+|+..+.+. ....+++. +.-+.+....+.+...-.......-.+.... ..|+-+.+|..++.+.-= T Consensus 200 ~G~~Vi~s~~~p------~~t~~l~~--~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~-~y~~~~~~~~~vv~~t~~ 268 (274) T protein:vir:97 200 LGAIIVRTNKLE------AGTAILAK--KGAVKLILKRDFFLEVARDASTKTTALYSDK-HYVAYLYDESKAVKITKG 268 (274) T ss_pred cCeeEEEcCCCC------cceEEEEe--CcceEeeecCCceeccccchhhcccEEEEEE-EEEEEEEcCCceEEEecC Confidence 224455444332 12222222 2334443333332211111112222333333 357889999888887754 No 110 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=97.38 E-value=8e-05 Score=43.04 Aligned_cols=261 Identities=10% Similarity=0.016 Sum_probs=131.3 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCC--CCcceeEEEEEEeecccceEEecCcccccce Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNS--APDWAQAVMFRSIDARGELQLFGPNSTDVPT 78 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~--~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~ 78 (311) ||.. ...-+.+ +.. |-+.+.|.+.....+....+..+... +.+| .++.+..++..|.++.+..+ ++|+. T Consensus 1 ma~~---~T~~~d~--iiP--ev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G-~tv~iP~~~~~g~a~~~~~g-~~i~~ 71 (274) T protein:vir:94 1 MPQG---LTKTSDQ--IIP--EVLAPMMQAQLEKKLRFASFAEVDSTLQGQPG-DTLTFPAFVYSGDAQVVAEG-EKIPT 71 (274) T ss_pred CCcc---ceehhhe--ech--HHHHHHHHHhhhhhhhhcccceecccccCCCC-CEEEEeeecCCCccccccCC-Ccccc Confidence 4431 1111111 122 22333333444444555556555433 2233 57788888888999988765 57888 Q ss_pred eeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccceeeeecCCcceeeccC Q lcl|NC_019522. 79 VDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVGEGLYTSPNVSVEAATS 158 (311) Q Consensus 79 v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g~GllN~p~v~~~~~~~ 158 (311) -++..+.....+...+.+|++ .|+++++..+ ++-......+.++++++.|+.++.- ++.-.. .. T Consensus 72 ~~lt~~~~~~~i~~~~~~~~i--~D~~~~~~~~-dp~~~~~~~~a~a~a~~vd~~~~~~--------l~~a~~-----~~ 135 (274) T protein:vir:94 72 DILETKKREAKIRKIAKGTSI--TDEALLSGYG-DPQGEQVRQHGLAHANKVDNDVLEA--------LMGAKL-----TV 135 (274) T ss_pred cccccceeEEEeeeecceecc--cHHHHHhccc-hHHHHHHHHHHHHHHHHHHHHHHHH--------HhccCc-----cc Confidence 888888888888887665555 5555555444 4445666777778888888765521 111000 00 Q ss_pred CccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhccc----ccCCCCCcchHH-HHHHHhC Q lcl|NC_019522. 159 TFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTL----LSTQNASNVTLL-QFLRTNF 233 (311) Q Consensus 159 ~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~----~~~~~~~~~Tvl-~~l~~n~ 233 (311) + + . .-+ ++.|.++..++-.. ...+..|+++|..+..|.+.- +..++. +..++ .-....+ T Consensus 136 ~--~-~----~~~----~d~i~dA~~~l~d~----~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~-g~~~~~~G~ig~~ 199 (274) T protein:vir:94 136 N--A-D----ITK----LNGLQSAIDKFNDE----DLEPMVLFVNPLDAGKLRGDASTNFTRATEL-GDDIIVKGAFGEA 199 (274) T ss_pred c--c-c----ccC----HHHHHHHHHHhhcc----CCCceEEEeCHHHHHHHHhhhhhhccccCcc-cccceecccccee Confidence 0 0 0 011 45667777766432 125689999999999997531 111111 11111 0000011 Q ss_pred CceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhccceeeCCceEEEeeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 234 PDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 234 ~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dGI 311 (311) -..+|+..+.+. ....+++. +.-+.+....+.+...-.......-.+.... ..|+-+.+|..++.+.-= T Consensus 200 ~G~~Vi~s~~~p------~~t~~l~~--~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~-~y~~~~~~~~~vv~~t~~ 268 (274) T protein:vir:94 200 LGAIIVRTNKLE------AGTAILAK--KGAVKLILKRDFFLEVARDASTKTTALYSDK-HYVAYLYDESKAVKITKG 268 (274) T ss_pred cCeeEEEcCCCC------cceEEEEe--CcceEeeecCCceeccccchhhcccEEEEEE-EEEEEEEcCCceEEEecC Confidence 224455444332 12222222 2334443333332211111112222333333 357889999888887754 No 111 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=97.36 E-value=4.6e-05 Score=44.36 Aligned_cols=268 Identities=14% Similarity=-0.008 Sum_probs=137.7 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCccccccee- Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPTV- 79 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~v- 79 (311) ...-.+.....+.+++++. +.+.+.|++........+.++.+..-....-........+..+.+.+++.+ ..+|.. T Consensus 104 ~~~~~~~~~t~~~gg~~iP--~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~-~~~~~~~ 180 (397) T protein:vir:48 104 NLLDSKTDASGSDAGLTIP--QDIQTAIHTLVRQYDSLQEYVNVENVTTLTGSRVYEKWADITGLAKLDDEA-GSIGTND 180 (397) T ss_pred HHHHHhhccCCcccccccc--HHHHHHHHHHHHHHHHHHhhhceeeccCCcceEEEEeecCCCcceeeeccc-ccccccc Confidence 1111111112223344554 456677888877777777776654321111122222233445567777665 445654 Q ss_pred eeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccceeeeecCCcceeeccCC Q lcl|NC_019522. 80 DIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVGEGLYTSPNVSVEAATST 159 (311) Q Consensus 80 ~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g~GllN~p~v~~~~~~~~ 159 (311) ...++......+.++..+.+|.+=|+ ....++...-.....+++.+.+|+-+++|+...+. T Consensus 181 ~~~~~~v~~~~~k~~~~~~iS~ell~---ds~~~l~~~v~~~l~~~~~~~~d~~il~G~g~~~~---------------- 241 (397) T protein:vir:48 181 DPKLYPIRYAIKRYAGISTVTNSLLA---DSAENILAWLSGWIAKKVVVTRNKAILEAIATLPT---------------- 241 (397) T ss_pred ccceeeEEeeheeeeeehhhHHHHHh---hchHHHHHHHHHHHHHHHHHHHHHHHhhccccccc---------------- Confidence 35788888899999998888855333 34567888888889999999999999999764210 Q ss_pred ccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHH-HHH------h Q lcl|NC_019522. 160 FVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQF-LRT------N 232 (311) Q Consensus 160 ~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~-l~~------n 232 (311) .++. .+ +++|.+++.++... + ..+..++++|+.+..|.+.. +..|.-++.- +.. + T Consensus 242 -~~~~-----~~----~d~i~~~~~~l~~~--~--~~~a~~v~n~~~~~~L~~lk----d~~G~~i~~~~~~~~~~~~l~ 303 (397) T protein:vir:48 242 -KPTL-----TK----WDDIIDLQAKVDPA--I--KQTSFFLTNTSGFTALKKVK----NAFGDYLMERDVKSPTGYSID 303 (397) T ss_pred -cccc-----cc----HHHHHHHHHHhhhh--h--cCCCEEEECHHHHHHHHHhh----cCCCceeeccCcCCCCCceec Confidence 0000 12 34566666666422 2 23468899999999986532 1112211110 000 1 Q ss_pred CCceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhcc-ce---eeCCceEEEeeeeeeeeEEEECCeEEEEe Q lcl|NC_019522. 233 FPDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLA-PA---TADNVNFKVPAILRTGGTEWRIPKAGHYV 308 (311) Q Consensus 233 ~~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~-p~---~~~~~~~~~~~~~~~gGv~i~~P~ai~~~ 308 (311) +.++.+.....+. .+..+...++ |-+=.+.+.+..-..+++.. +. ....-.....++.|+ ++.+++|.+++.+ T Consensus 304 G~PV~~~~~~~~~-~~~~~~~~~~-~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~-d~~~~~~~a~~~~ 380 (397) T protein:vir:48 304 GFAVKEVADRWLA-NASSGAMPLY-FGDLKQAVTLFDRQQMSLLSTNIGGGAFETDTTKIRVIDRF-DVVATDTESFVPA 380 (397) T ss_pred cceeEEecccccC-CcCCCceEEE-EEeccceEEEEeecceEEEEeccchhhhhcCceeEEEEeee-ccEEecccceEEE Confidence 2223332222222 2233333333 33222323222211221110 00 001112455667777 4677899999665 Q ss_pred ec--C Q lcl|NC_019522. 309 DG--V 311 (311) Q Consensus 309 dG--I 311 (311) +- . T Consensus 381 ~~~~~ 385 (397) T protein:vir:48 381 SFKAI 385 (397) T ss_pred Eeccc Confidence 53 3 No 112 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=97.35 E-value=3.3e-05 Score=45.15 Aligned_cols=274 Identities=9% Similarity=-0.011 Sum_probs=136.6 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccceee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPTVD 80 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~v~ 80 (311) .........+.+++++++. +.+.+.|++........+.++.+.. .+ + ...+.+....+.+.|++.++ .+|..+ T Consensus 133 ~~~~~~~~~~~~~gg~~vP--~~~~~~Ii~~l~~~~~i~~~~~~~~-~~-g--~~~ip~~~~~~~a~~v~E~~-~~~~~~ 205 (425) T protein:vir:95 133 FYEKFRNLRAVAGGELTIP--EVVVNRIMDIMGDYTTLYPLVDKIR-VK-G--TTRILVDTDTSPATWIEQSG-ALPTGD 205 (425) T ss_pred HHHHHHhhcccccCceecc--HHHHHHHHHHHHhhhhHHHhhceee-cC-c--eeEEEEecCCcccccccccc-cccccc Confidence 0011111123345556665 4466778877766666666655432 22 2 23455666777888888664 467666 Q ss_pred e-eccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeecccc-c--eeeeecCCcceeec Q lcl|NC_019522. 81 I-AMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGV-G--EGLYTSPNVSVEAA 156 (311) Q Consensus 81 ~-~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~-g--~GllN~p~v~~~~~ 156 (311) . .+++.....+.++.-+.+|.+=|. .+..+|.+--....+.++.+.+++-+++|+... + .|+|+.-... .. T Consensus 206 ~~~f~~i~l~~~k~~~~~~iS~ell~---ds~~~l~~~i~~~l~~~i~~~~d~~il~G~G~~~~~p~Gil~~~~~~--~~ 280 (425) T protein:vir:95 206 VGTIASIDFDGFKVGKVTFVDNYLLQ---DSIINLDDYVTKKIARAIAKALDLAIVKGTGAANKQPLGIIPSLPPE--NQ 280 (425) T ss_pred ccccceeeeeheeeeeeehhhHHHHh---ccHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeecccccc--cc Confidence 5 478888899999988888855333 334568888899999999999999999998653 2 5998763221 11 Q ss_pred cCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHH-HHHHhcccccCCCCCcchHHHHHHHhCC- Q lcl|NC_019522. 157 TSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQ-FQLLARTLLSTQNASNVTLLQFLRTNFP- 234 (311) Q Consensus 157 ~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~-~~~L~~~~~~~~~~~~~Tvl~~l~~n~~- 234 (311) ... .+.+ ..++++.+++..+... +........+|.+.. +..|.+-.. ..+..|.-++. ..+.+ T Consensus 281 ~~~-------~~~~---~~~~~~~~~~~~~~~~--~~~~~~~~~v~~~~~~~~~l~~l~~-~kd~~g~~i~~--~~~~~~ 345 (425) T protein:vir:95 281 VTV-------EADN---NLLKNLVKQIGLIDTG--DDSVGEIVAVMKRSTYYNRLVEFSI-QVDSNGNVVGK--LPNLRT 345 (425) T ss_pred ccc-------cccc---chHHHHHHHHHhhhhh--ccccCceEEEEeChHHHHHHHHHHh-hcCCCCceeec--cCCCCC Confidence 110 1111 1256677777665432 211112245566554 443422110 01111211111 01111 Q ss_pred ----ceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhh--hccceeeCCceEEEeeeeeeeeEEEECCeEEEEe Q lcl|NC_019522. 235 ----DITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLR--FLAPATADNVNFKVPAILRTGGTEWRIPKAGHYV 308 (311) Q Consensus 235 ----~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~--~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~ 308 (311) ...++..+.+.. +. ++|-+-.++ -+..-..++ +..-.........+.++.|+ +..+++|.|++++ T Consensus 346 ~~l~G~pvv~~~~~~~------~~-i~~Gd~~~~-~~~~~~~~~i~~~~~~~f~~~~~~~~~~~r~-d~~~~~~~a~~~~ 416 (425) T protein:vir:95 346 PDLLGLRVVFNNFLDD------DT-VLFGEFEQY-TLVERENITIDSSTHVKFTEDQTAFRGKGRF-DGKPVKPEAFVLV 416 (425) T ss_pred ccccceeeEEcCcCCC------cc-EEEEecccE-EEEeecceEEEeecccccccCceEEEEEEee-CcEeecccceEEE Confidence 112222222211 11 222221111 111111111 11000001112344555676 4678999999998 Q ss_pred ecC Q lcl|NC_019522. 309 DGV 311 (311) Q Consensus 309 dGI 311 (311) + | T Consensus 417 ~-i 418 (425) T protein:vir:95 417 T-I 418 (425) T ss_pred E-e Confidence 5 4 No 113 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=97.30 E-value=2.2e-05 Score=46.10 Aligned_cols=265 Identities=10% Similarity=-0.041 Sum_probs=138.6 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccceee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPTVD 80 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~v~ 80 (311) -...+|...+.++++|++. +.+.++|++........+.+..+.+..+. . ...+....+.+.|++.+ ..+|..+ T Consensus 78 ~~~~al~~~~~~~gG~lIP--~~~~~~Ii~~l~~~s~l~~~~~v~~~~~~---~-~p~~~~~~~~a~~v~E~-~~~~~~~ 150 (352) T protein:vir:78 78 RLLHALPTGNDSGGDKLLP--KTLSKEIVSEPFAKNQLREKARLTNIKGL---E-IPRVSYTLDDDDFITDV-ETAKELK 150 (352) T ss_pred HHHHHhccCCCCCCceecc--HhHHHHHHHHHHhhcchhhheeeEecCCc---e-EEEEecCCCcccccccc-ccccccc Confidence 0112334444556678886 45677788877777677777776543321 1 12223344677887654 5578888 Q ss_pred eeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheee-eeccccc-eeeeecCCcceeeccC Q lcl|NC_019522. 81 IAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYL-LGDKGVG-EGLYTSPNVSVEAATS 158 (311) Q Consensus 81 ~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~-~G~~~~g-~GllN~p~v~~~~~~~ 158 (311) ..++......+.++.-+.++.+=|+ .+..+|..--....++++...++..+| .|+.... .|.++++++...+.. T Consensus 151 ~~f~~v~~~~~k~~~~i~is~ell~---Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~~~~~g~l~~~~~~~~t~~- 226 (352) T protein:vir:78 151 LKGDTVKFTTNKFKVFAAISDTVIH---GSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEGA- 226 (352) T ss_pred ccceeeeecceeEEeechhhHHHHh---hhhHHHHHHHHHHHHHHHHHHHHHhhhhcCCCCcccccceecccccccccc- Confidence 8899999999999998888865333 345677777777778888777777665 4443322 588888776433211 Q ss_pred CccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhCCceEE Q lcl|NC_019522. 159 TFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNFPDITF 238 (311) Q Consensus 159 ~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~~~l~i 238 (311) ..+++|.+++..+... +. ..-+.+|.+..+..|.+..-. .+..++. ..+-++ T Consensus 227 ---------------~~~d~i~~~~~~l~~~--~~--~~a~~~mn~~t~~~l~~~~~~----~~~~~~~-----~~~~~l 278 (352) T protein:vir:78 227 ---------------NMYDAIINALADLHED--YR--DNATIYMRYADYVKIISVLSN----GTTNFFD-----TPAEKV 278 (352) T ss_pred ---------------chHHHHHHHHhccChh--hh--cCCEEEEehHHHHHHHHHHhc----cCCcccc-----cCCccc Confidence 1245666666665321 21 123577777777666443211 1222221 111122 Q ss_pred EEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhccceeeCCceEEEeeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 239 EDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 239 ~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dGI 311 (311) ...|-....+. ++ +++-+-+.=++.. ..+.+-.--+.......+.+..|++|. +.+|.||+.+.-= T Consensus 279 lG~PV~~~~~~--~~-~~~Gdf~~~~~~~---~~~~~~~~~~~~~g~~~f~~~~r~Dg~-~~~~eA~~~l~~~ 344 (352) T protein:vir:78 279 FGKPVVFTDAA--VK-PIVGDFNYFGINY---DGTTYDTDKDVKKGEYLFVLTAWYDQQ-RTLDSAFRIAKAK 344 (352) T ss_pred cccceEEecCC--Cc-eeEeehhhhhhhh---hhheeeeeccccCCeeEEEEEeeeCce-eechhheEEEEee Confidence 22221111110 01 1110000000000 011110001111223556667888655 6779999777544 No 114 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=97.17 E-value=0.00012 Score=42.05 Aligned_cols=267 Identities=9% Similarity=0.006 Sum_probs=138.9 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccce-e Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPT-V 79 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~-v 79 (311) .-..+|.+.+.+.+++++. +.+.+.|++........+.++++.. .+-..-.+.+......+.+.+++.+ ..+|. . T Consensus 86 ~~~~a~~~~t~~~gg~~vP--~~~~~~ii~~~~~~s~i~~~~~~~~-~~~~~~~~~~~~~~~~~~a~~v~Eg-~~~~~~~ 161 (371) T protein:vir:81 86 RFRNAMSEGSNQDGGYTVP--QDIQTRINELRESKDALQNLITVEP-VTTLSGSRVFKKRSQQTGFVEVAEG-AAIGEKA 161 (371) T ss_pred HHHHhhccCCCccCceeec--HhHHHHHHHHHHhhhhhhhhceeee-ccCCceeEEEEeecCCcceeeeccc-ccccccc Confidence 1112233333334455554 4566788888888777777776543 2222333444444555678888776 44664 4 Q ss_pred eeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc-eeeeecCCcceeeccC Q lcl|NC_019522. 80 DIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG-EGLYTSPNVSVEAATS 158 (311) Q Consensus 80 ~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g-~GllN~p~v~~~~~~~ 158 (311) +..++......+.++..+.+|.+=++. +..+|..--.....+++.+.+|+.+++|+.... .|. T Consensus 162 ~~~f~~i~~~~~k~~~~~~iS~ell~d---s~~~l~~~i~~~l~~a~~~~~~~~i~~g~g~~~~~~~------------- 225 (371) T protein:vir:81 162 TPQFTLLQYQVKKYAGFFRVTNELLND---STEAIVNTLVRWIGDESRVTRNGLIINVLNTKAKTAI------------- 225 (371) T ss_pred ccceeeEEeeeeEEEEeehhhHHHHhh---hhHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccc------------- Confidence 578899999999999999998654433 345788888888899999999999999876421 111 Q ss_pred CccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHH-HHHhCC--- Q lcl|NC_019522. 159 TFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQF-LRTNFP--- 234 (311) Q Consensus 159 ~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~-l~~n~~--- 234 (311) .+.+ ++..++..... ..+ .....++|+|+.+..|.+-. +..|.-++.- +....+ T Consensus 226 -----------~~~~----~i~~~~~~~l~-~~~--~~~a~~vmn~~~~~~L~~lk----d~~g~~l~~~~~~~~~~~~l 283 (371) T protein:vir:81 226 -----------ADLD----GLKQIINVQLD-PVF--RSTSSVIVNQDAFNWLDTLK----DQNGQYLLQPSISSPTGRQL 283 (371) T ss_pred -----------ccHH----HHHHHHHhhcc-hhh--hcCCEEEEcHHHHHHHHHhh----ccCCCeeeecccCCCCCcee Confidence 1233 33333322111 111 23347899999999886532 1111111100 000000 Q ss_pred -ceEEEEchhc-----ccCCCCcccEEEEEEcCcceeEEeecchhhhccceeeC----CceEEEeeeeeeeeEEEECCeE Q lcl|NC_019522. 235 -DITFEDDILL-----KGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATAD----NVNFKVPAILRTGGTEWRIPKA 304 (311) Q Consensus 235 -~l~i~~~~~l-----~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~----~~~~~~~~~~~~gGv~i~~P~a 304 (311) ..-++.+..+ ...+.+.....++|-+=.+.+.+.....++...-.+.. .-...+.++.|+ +..+++|.+ T Consensus 284 ~G~pV~~~~~~~~~~~~~~~~~~~~~~i~~Gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~v~~~~~~r~-d~~~~~~~a 362 (371) T protein:vir:81 284 LGLPVVIVSNKVLANRVDGGTGAQFAPIIVGDLKEAVVMFDRQRTEIMSSNVAMDAFETDATLWRAIERM-DVKMRDDEA 362 (371) T ss_pred cceeEEEecccccCccccccccCCcceEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEee-ccEEecccc Confidence 0112211111 11122222222333332232322222222211100100 112456677787 467888999 Q ss_pred EEEeecC Q lcl|NC_019522. 305 GHYVDGV 311 (311) Q Consensus 305 i~~~dGI 311 (311) ++.++ + T Consensus 363 ~~~~~-~ 368 (371) T protein:vir:81 363 FVFGE-V 368 (371) T ss_pred eEEEE-E Confidence 99888 6 No 115 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=97.15 E-value=3.9e-05 Score=44.73 Aligned_cols=264 Identities=9% Similarity=-0.044 Sum_probs=134.0 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEe-ecccceEEecCccccccee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSI-DARGELQLFGPNSTDVPTV 79 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~-~~~G~a~~~~~~a~dip~v 79 (311) -...++...+.+++++++. +.+...|++.....-..+.++.+.+..+ ..+... ...+.+.|++.+ ...|.. T Consensus 128 ~~~~a~~~~t~~~GG~lIP--~~~~~~Ii~~~~~~~~l~~~~~v~~~~~-----~~~p~~~~~~~~a~~v~Eg-~~~~~~ 199 (402) T protein:vir:93 128 RLLHALPTGNDSGGDKLLP--KTLSKEIVSEPFAKNQLREKARLTNIKG-----LEIPRVSYTLDDDDFITDV-ETAKEL 199 (402) T ss_pred HHHhhhccCCCcCCccccc--hhHHHHHHHhHHhhhhhhhhceeeecCC-----ceeeeeeccCCcccccccc-cccccc Confidence 1112233334445566765 4467778887776666677776654332 122222 334567787765 446777 Q ss_pred eeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc--eeeeecCCcceeecc Q lcl|NC_019522. 80 DIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG--EGLYTSPNVSVEAAT 157 (311) Q Consensus 80 ~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g--~GllN~p~v~~~~~~ 157 (311) +..++......+.++.-+.+|.+ |. .-+..++.+--....++++...+++.+|.+..+.| .|+++++++...+. T Consensus 200 ~~~f~~i~~~~~k~~~~i~iS~e-ll--~Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~p~g~~~~~~~~~~~~- 275 (402) T protein:vir:93 200 KAKGDTVKFTTNKFKVFAAISDT-VI--HGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEG- 275 (402) T ss_pred ccccceeeecceeeeeechhhHH-HH--hhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccc- Confidence 88889999999999988888844 32 23456677777777888888887776665434434 48887766543221 Q ss_pred CCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhCCceE Q lcl|NC_019522. 158 STFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNFPDIT 237 (311) Q Consensus 158 ~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~~~l~ 237 (311) ...+++|.+++..+... +. ..-.++|.+..+..+.+..-. .+..++. ..+-+ T Consensus 276 ---------------~~~~d~l~~~~~~l~~~--y~--~na~~imn~~t~~~~~~~~~d----~~~~~~~-----~~~~~ 327 (402) T protein:vir:93 276 ---------------ADMYDAIINALADLHED--YR--DNATIYMRYADYVKIISVLSN----GTTNFFD-----TPAEK 327 (402) T ss_pred ---------------cchHHHHHHHHhccChh--hh--cCCEEEEechHHHHHHHHHhc----CCCcccc-----cCCcc Confidence 11256677777766321 21 122467776665554333211 1122211 11112 Q ss_pred EEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhccceeeCCceEEEeeeeeeeeEEEECCeEEEEeec--C Q lcl|NC_019522. 238 FEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHYVDG--V 311 (311) Q Consensus 238 i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dG--I 311 (311) |...|-....+. . + ++|-+=.......-.+-+.+. -+.......+-+..|++|. +.+|.||+++.= - T Consensus 328 llG~PV~~t~~~-~-~--i~~GDf~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~r~Dg~-v~~~~A~~~l~ik~~ 396 (402) T protein:vir:93 328 VFGKPVVFTDAA-V-K--PIVGDFNYFGINYDGTTYDTD--KDVKKGEYLFVLTAWYDQQ-RTLDSAFRIAKAKEN 396 (402) T ss_pred ccccceEEecCC-C-c--eeeechhhhhhhhhhhhhhhh--hcccCCceEEEEEEEeCcE-EechhheEEEEeecC Confidence 222222211111 0 1 111110111000000101111 1111223556677888554 567999986543 2 No 116 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=97.06 E-value=3.7e-05 Score=44.90 Aligned_cols=265 Identities=9% Similarity=-0.045 Sum_probs=134.5 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccceee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPTVD 80 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~v~ 80 (311) -...++...+.+++++++. +.+..+|++.....-..+.++.+.+..+ .++ .++....+.+.|++.+ ...|..+ T Consensus 113 ~~~~a~~~~~~~~gG~lIP--~~~~~~Ii~~~~~~~~l~~~~~~~~~~~---~~~-p~~~~~~~~a~~v~Eg-~~~~~~~ 185 (387) T protein:vir:96 113 RLLHALPTGNDSGGDKLLP--KTLSKEIVSEPFAKNQLREKARLTNIKG---LEI-PRVSYTLDDDDFITDV-ETAKELK 185 (387) T ss_pred HHHhhhccCCCCCCceeec--hhHHHHHHHHHHhhchhhhhceeeecCC---cee-eeeeccCCcccccccc-ccccccc Confidence 1112233334445566665 4467778887777666677766544332 111 1223344567777654 4567778 Q ss_pred eeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc--eeeeecCCcceeeccC Q lcl|NC_019522. 81 IAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG--EGLYTSPNVSVEAATS 158 (311) Q Consensus 81 ~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g--~GllN~p~v~~~~~~~ 158 (311) ..++......+.++.-+.+|.+ |. ..+..++..--.....+++...+++.+|.+..+.| .|.++.++++..+. T Consensus 186 ~~f~~v~l~~~k~~~~i~iS~e-ll--~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~~~~-- 260 (387) T protein:vir:96 186 AKGDTVKFTTNKFKVFAAISDT-VI--HGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEG-- 260 (387) T ss_pred cccceeeechheeeeechhhHH-HH--hhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccc-- Confidence 8889999999999998888844 32 23456777777777778888887777665444434 48887766543221 Q ss_pred CccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhCCceEE Q lcl|NC_019522. 159 TFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNFPDITF 238 (311) Q Consensus 159 ~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~~~l~i 238 (311) +..+++|.+++..+... +. ..-..+|.+..+..+.+..-. .+..++. .+.-+| T Consensus 261 --------------~~~~d~i~~~~~~l~~~--y~--~na~~imn~~t~~~~~~~~~~----~~~~~~~-----~~~~~l 313 (387) T protein:vir:96 261 --------------ADMYDAIINALADLHED--YR--DNATIYMRYADYVKIISVLSN----GTTNFFD-----TPAEKV 313 (387) T ss_pred --------------cchHHHHHHHHhccChh--hh--cCCEEEEechHHHHHHHHHhc----CCCcccc-----cCCccc Confidence 11256677777766432 11 112466766665554333211 1111110 111112 Q ss_pred EEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhccceeeCCceEEEeeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 239 EDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 239 ~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dGI 311 (311) ...|-....+. ..++ |-+=..... ....+.+..-.+.......+.+..|++ ..+++|.||+++.== T Consensus 314 lG~PV~~~~~~---~~~~-~GDf~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~r~D-g~v~~~~A~~~l~~k 379 (387) T protein:vir:96 314 FGKPVVFTDAA---VKPI-VGDFNYFGI--NYDGTTYDTDKDVKKGEYLFVLTAWYD-QQRTLDSAFRIAKAK 379 (387) T ss_pred cccceEEecCC---Ccee-eechhhhhh--hhhhhhheecccccCCceEEEEEEEeC-cEeechhheEEEEee Confidence 22221111110 0111 111001100 001111110011112235666678875 556789999986542 No 117 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=97.06 E-value=3.7e-05 Score=44.90 Aligned_cols=265 Identities=9% Similarity=-0.045 Sum_probs=134.5 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccceee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPTVD 80 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~v~ 80 (311) -...++...+.+++++++. +.+..+|++.....-..+.++.+.+..+ .++ .++....+.+.|++.+ ...|..+ T Consensus 113 ~~~~a~~~~~~~~gG~lIP--~~~~~~Ii~~~~~~~~l~~~~~~~~~~~---~~~-p~~~~~~~~a~~v~Eg-~~~~~~~ 185 (387) T protein:vir:26 113 RLLHALPTGNDSGGDKLLP--KTLSKEIVSEPFAKNQLREKARLTNIKG---LEI-PRVSYTLDDDDFITDV-ETAKELK 185 (387) T ss_pred HHHhhhccCCCCCCceeec--hhHHHHHHHHHHhhchhhhhceeeecCC---cee-eeeeccCCcccccccc-ccccccc Confidence 1112233334445566665 4467778887777666677766544332 111 1223344567777654 4567778 Q ss_pred eeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc--eeeeecCCcceeeccC Q lcl|NC_019522. 81 IAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG--EGLYTSPNVSVEAATS 158 (311) Q Consensus 81 ~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g--~GllN~p~v~~~~~~~ 158 (311) ..++......+.++.-+.+|.+ |. ..+..++..--.....+++...+++.+|.+..+.| .|.++.++++..+. T Consensus 186 ~~f~~v~l~~~k~~~~i~iS~e-ll--~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~~~~-- 260 (387) T protein:vir:26 186 AKGDTVKFTTNKFKVFAAISDT-VI--HGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEG-- 260 (387) T ss_pred cccceeeechheeeeechhhHH-HH--hhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccc-- Confidence 8889999999999998888844 32 23456777777777778888887777665444434 48887766543221 Q ss_pred CccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhCCceEE Q lcl|NC_019522. 159 TFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNFPDITF 238 (311) Q Consensus 159 ~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~~~l~i 238 (311) +..+++|.+++..+... +. ..-..+|.+..+..+.+..-. .+..++. .+.-+| T Consensus 261 --------------~~~~d~i~~~~~~l~~~--y~--~na~~imn~~t~~~~~~~~~~----~~~~~~~-----~~~~~l 313 (387) T protein:vir:26 261 --------------ADMYDAIINALADLHED--YR--DNATIYMRYADYVKIISVLSN----GTTNFFD-----TPAEKV 313 (387) T ss_pred --------------cchHHHHHHHHhccChh--hh--cCCEEEEechHHHHHHHHHhc----CCCcccc-----cCCccc Confidence 11256677777766432 11 112466766665554333211 1111110 111112 Q ss_pred EEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhccceeeCCceEEEeeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 239 EDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 239 ~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dGI 311 (311) ...|-....+. ..++ |-+=..... ....+.+..-.+.......+.+..|++ ..+++|.||+++.== T Consensus 314 lG~PV~~~~~~---~~~~-~GDf~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~r~D-g~v~~~~A~~~l~~k 379 (387) T protein:vir:26 314 FGKPVVFTDAA---VKPI-VGDFNYFGI--NYDGTTYDTDKDVKKGEYLFVLTAWYD-QQRTLDSAFRIAKAK 379 (387) T ss_pred cccceEEecCC---Ccee-eechhhhhh--hhhhhhheecccccCCceEEEEEEEeC-cEeechhheEEEEee Confidence 22221111110 0111 111001100 001111110011112235666678875 556789999986542 No 118 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=97.06 E-value=3.7e-05 Score=44.90 Aligned_cols=265 Identities=9% Similarity=-0.045 Sum_probs=134.5 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccceee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPTVD 80 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~v~ 80 (311) -...++...+.+++++++. +.+..+|++.....-..+.++.+.+..+ .++ .++....+.+.|++.+ ...|..+ T Consensus 113 ~~~~a~~~~~~~~gG~lIP--~~~~~~Ii~~~~~~~~l~~~~~~~~~~~---~~~-p~~~~~~~~a~~v~Eg-~~~~~~~ 185 (387) T protein:vir:94 113 RLLHALPTGNDSGGDKLLP--KTLSKEIVSEPFAKNQLREKARLTNIKG---LEI-PRVSYTLDDDDFITDV-ETAKELK 185 (387) T ss_pred HHHhhhccCCCCCCceeec--hhHHHHHHHHHHhhchhhhhceeeecCC---cee-eeeeccCCcccccccc-ccccccc Confidence 1112233334445566665 4467778887777666677766544332 111 1223344567777654 4567778 Q ss_pred eeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc--eeeeecCCcceeeccC Q lcl|NC_019522. 81 IAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG--EGLYTSPNVSVEAATS 158 (311) Q Consensus 81 ~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g--~GllN~p~v~~~~~~~ 158 (311) ..++......+.++.-+.+|.+ |. ..+..++..--.....+++...+++.+|.+..+.| .|.++.++++..+. T Consensus 186 ~~f~~v~l~~~k~~~~i~iS~e-ll--~ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~~~~-- 260 (387) T protein:vir:94 186 AKGDTVKFTTNKFKVFAAISDT-VI--HGSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEVEG-- 260 (387) T ss_pred cccceeeechheeeeechhhHH-HH--hhhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccccc-- Confidence 8889999999999998888844 32 23456777777777778888887777665444434 48887766543221 Q ss_pred CccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhCCceEE Q lcl|NC_019522. 159 TFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNFPDITF 238 (311) Q Consensus 159 ~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~~~l~i 238 (311) +..+++|.+++..+... +. ..-..+|.+..+..+.+..-. .+..++. .+.-+| T Consensus 261 --------------~~~~d~i~~~~~~l~~~--y~--~na~~imn~~t~~~~~~~~~~----~~~~~~~-----~~~~~l 313 (387) T protein:vir:94 261 --------------ADMYDAIINALADLHED--YR--DNATIYMRYADYVKIISVLSN----GTTNFFD-----TPAEKV 313 (387) T ss_pred --------------cchHHHHHHHHhccChh--hh--cCCEEEEechHHHHHHHHHhc----CCCcccc-----cCCccc Confidence 11256677777766432 11 112466766665554333211 1111110 111112 Q ss_pred EEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhccceeeCCceEEEeeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 239 EDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 239 ~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dGI 311 (311) ...|-....+. ..++ |-+=..... ....+.+..-.+.......+.+..|++ ..+++|.||+++.== T Consensus 314 lG~PV~~~~~~---~~~~-~GDf~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~r~D-g~v~~~~A~~~l~~k 379 (387) T protein:vir:94 314 FGKPVVFTDAA---VKPI-VGDFNYFGI--NYDGTTYDTDKDVKKGEYLFVLTAWYD-QQRTLDSAFRIAKAK 379 (387) T ss_pred cccceEEecCC---Ccee-eechhhhhh--hhhhhhheecccccCCceEEEEEEEeC-cEeechhheEEEEee Confidence 22221111110 0111 111001100 001111110011112235666678875 556789999986542 No 119 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=97.03 E-value=0.00012 Score=42.15 Aligned_cols=283 Identities=13% Similarity=0.024 Sum_probs=133.4 Q ss_pred CCc--------------ccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccce Q lcl|NC_019522. 1 MAK--------------SVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGEL 66 (311) Q Consensus 1 ~~~--------------~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a 66 (311) +.. ..|...+.+++++++. +.+..+|++.....-..+.++.+.+- +.. ..+...+..+.+ T Consensus 64 ~~~~g~~~lt~~e~~~~~~~~~~~~~~gg~lvP--~~~~~~I~~~l~~~s~l~~~~~v~~~---~~~-~~i~~~~~~~~a 137 (383) T protein:vir:78 64 SASRTDKNITNEEIKFFNDINKEVGYKEETLLP--QTVVDEIFEDLTTEHPFLASIGMRTT---GLR-TKFLKSETSGVA 137 (383) T ss_pred HhcCChhhhhHHHHHHHHHHhccCCCCCccccC--HHHHHHHHHHHHhhccceeeeeeEec---CCc-eEEEEEcCCcce Confidence 000 1233445556677775 44566677665554444555444321 222 245566677788 Q ss_pred EEecCcccccc-eeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc-ee Q lcl|NC_019522. 67 QLFGPNSTDVP-TVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG-EG 144 (311) Q Consensus 67 ~~~~~~a~dip-~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g-~G 144 (311) .|.+..+ .++ ..+..++......+.++.-...+.+ |. .-..++|+.--.....+++...+++-+++|+.... .| T Consensus 138 ~w~~e~~-~~~~~~~~~f~~i~l~~~kl~~~i~is~e-ll--~Ds~~~ie~~i~~~l~~~~a~~~~~a~i~G~G~~qP~G 213 (383) T protein:vir:78 138 VWGKIFG-EIKGQLDATFSDEESIQNKLTAFVVVPKD-LE--KFGPAWVKRFVVTQIEEAFAVALESAYIVGDGNDKPIG 213 (383) T ss_pred EEeeccc-ccccccCcceeeEeecceeeEeeccchHH-Hh--hccHHHHHHHHHHHHHHHHHHHHhhheEeccCCCCcee Confidence 8866543 333 3466678888888998877777743 33 33456888999999999999999999999997555 69 Q ss_pred eeecCCcceeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCc-------eecceEEEeCHHHHHHHhcccccC Q lcl|NC_019522. 145 LYTSPNVSVEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLT-------VHRPNTFVLPPAQFQLLARTLLST 217 (311) Q Consensus 145 llN~p~v~~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~-------~~~p~~l~lpp~~~~~L~~~~~~~ 217 (311) +|++.+.......+. ....+.-...+.+.+...+ ..+..+.+...+. .-...+.++.|..+..+...+... T Consensus 214 il~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~-~~l~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~~~~ 291 (383) T protein:vir:78 214 LNRKVGKGSTVVDGV-YAEKAATGTLTFANPKTTV-NELTDVYKYHSVKENGHPLNVAGKVTLLVNPTDAWDVKKQYTSL 291 (383) T ss_pred eeeccCCcccccccc-cccccccchhhhhhhHHHH-HHHHHHHhccchhcccchhhhcCceEEEEcCcchhhhccchhcc Confidence 998755322111110 0000011111233322221 1222221111110 011123455554433222111100 Q ss_pred -CCCCcchHHHHHHHhCCceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhccceee--CCceEEEeeeeee Q lcl|NC_019522. 218 -QNASNVTLLQFLRTNFPDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATA--DNVNFKVPAILRT 294 (311) Q Consensus 218 -~~~~~~Tvl~~l~~n~~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~--~~~~~~~~~~~~~ 294 (311) .+....+++ ..++.|+....... +. +++-+-.+++ +..-..+++..--+. ..-...+....|. T Consensus 292 ~~~G~~~t~l------~~~~~iv~s~~~p~------~~-iifgdfs~Y~-i~~r~~~~i~~~~~~~f~~d~~~f~~~~r~ 357 (383) T protein:vir:78 292 NANGVYVTAL------PFNLNIIESLFVPE------KK-AISYVAERYD-ALIGGPLDIGTYDQTLAIEDLNLYAAKQFA 357 (383) T ss_pred CCCCceeeec------CCCceEEecCCCCc------cc-EEEeeccceE-EEecccceEEecchhhhhcCceEEEEEEEE Confidence 010011111 12344444333321 11 2222211221 111222221100000 0112334556777 Q ss_pred eeEEEECCeEEEEeecC Q lcl|NC_019522. 295 GGTEWRIPKAGHYVDGV 311 (311) Q Consensus 295 gGv~i~~P~ai~~~dGI 311 (311) + ..++.|.|+++++ | T Consensus 358 d-G~~~~~~A~~vl~-~ 372 (383) T protein:vir:78 358 Y-GKAKDDKAAAVWT-L 372 (383) T ss_pred c-CEEecCCeEEEEE-E Confidence 5 4788999988877 6 No 120 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=96.67 E-value=0.00023 Score=40.51 Aligned_cols=263 Identities=12% Similarity=-0.006 Sum_probs=133.9 Q ss_pred CC-----ccccccc-chhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEee-cccceEEecCcc Q lcl|NC_019522. 1 MA-----KSVFDVS-PVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSID-ARGELQLFGPNS 73 (311) Q Consensus 1 ~~-----~~~~~~~-~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~-~~G~a~~~~~~a 73 (311) .. ...+.+. +.+.+++++. +.+.+.|++.....-..+.++++. +-...+..|.+.. ..+.+.+++.++ T Consensus 123 ~~~~~~~~~~~~~~~~~~~gg~~vP--~~~~~~ii~~~~~~~~l~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~E~~ 197 (400) T protein:vir:38 123 RAVPTDASDAVNAGVKAADAASTIP--ETISNTPQRELQTVVDLKPFTNVF---QASTQKGTYPTVANATTKMVTVAELE 197 (400) T ss_pred hhhhHHHHHHHhhcccccCCccccc--HHHHHHHHHHHHhhhhhhhcceeE---eccCcceEEEEEecCCCccccccccc Confidence 00 0111111 2233455665 456777888777666666665543 2223345566554 457778887765 Q ss_pred cccceeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccceeeeecCCcce Q lcl|NC_019522. 74 TDVPTVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVGEGLYTSPNVSV 153 (311) Q Consensus 74 ~dip~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g~GllN~p~v~~ 153 (311) ..-...+..++......+.++.-+.+|.+ |. ..+..++.+--.....+++...+|+-+++|..... T Consensus 198 ~~~~~~~~~f~~i~~~~~k~~~~~~is~e-ll--~ds~~~~~~~i~~~l~~~~~~~~~~~i~~~~~~~~----------- 263 (400) T protein:vir:38 198 KNPAMAKPEFKPVNWSVETYRQALPVSQE-SI--DDSAIDLVGLIAQNGQQIKVNTTNGAVATLLKGFT----------- 263 (400) T ss_pred cccccccccceeeEeehhheeeehhhHHH-HH--hhhHHHHHHHHHHHHHHHHHHHHHHhhhhcccccc----------- Confidence 43223456778888888899988888853 33 24455788878888888888999988888765310 Q ss_pred eeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHH-HHHh Q lcl|NC_019522. 154 EAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQF-LRTN 232 (311) Q Consensus 154 ~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~-l~~n 232 (311) + . ...+. +||.+++....... . .-.++|+|+.+..|.+-. +..|.-++.- +... T Consensus 264 ---~---~------~~~~~----~~~~~~~~~~~~~~-~----~a~~v~~~~~~~~l~~lk----d~~G~~i~~~~~~~~ 318 (400) T protein:vir:38 264 ---A---K------TISSV----DDLKHINNVDLDPA-Y----SRVIIASQSFYNFLDTVK----DGNGRYLLQDSILTP 318 (400) T ss_pred ---c---c------ccccH----HHHHHHHHhhhhhh-h----CcEEEEcHHHHHHHHHhh----ccCCCeeeecCcCCC Confidence 0 0 01123 34444444322111 1 247899999999986421 1112222110 0000 Q ss_pred CC----ceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhccceeeCCceEEEeeeeeeeeEEEECCeEEEEe Q lcl|NC_019522. 233 FP----DITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHYV 308 (311) Q Consensus 233 ~~----~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~ 308 (311) .+ ...++.+.... .+..| +..++|-+-.+.+.+.....+++..... ......+.++.|++ +.+.+|.+|+.+ T Consensus 319 ~~~~l~G~pv~~~~~~~-~~~~g-~~~~~~gd~s~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~r~d-~~~~~~~a~~~l 394 (400) T protein:vir:38 319 SGKSVLGMPIAVVSDDT-LGAAG-EAHAFLGDIKRAILFANRADFMVRWVDD-QIYGQFLQAGMRFG-VSVADEKAGYFL 394 (400) T ss_pred CccccccceeEEecccc-cCCCC-ceEEEEEeccccEEEEeecceEEEEecc-cccceeEEEEEEec-cEEecccceEEE Confidence 00 11122222111 12233 3334443333322222222222211111 11123456678874 556679999998 Q ss_pred ecC Q lcl|NC_019522. 309 DGV 311 (311) Q Consensus 309 dGI 311 (311) ..= T Consensus 395 ~~~ 397 (400) T protein:vir:38 395 TYT 397 (400) T ss_pred Eee Confidence 877 No 121 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=96.67 E-value=0.00042 Score=39.07 Aligned_cols=265 Identities=9% Similarity=-0.032 Sum_probs=127.0 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccc--eEEecCcccccce Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGE--LQLFGPNSTDVPT 78 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~--a~~~~~~a~dip~ 78 (311) +.....+..+.+.+++++. +.+.+.|++........+.++.+.. ....+..|.+...... +.+.+.+ ..+|. T Consensus 109 ~~~~~ra~~t~~~gg~liP--~~~~~~Ii~~~~~~~~l~~l~~~~~---~~~~~~~~~~~~~~~~~~~~~~~E~-~~~~~ 182 (421) T protein:vir:13 109 LSEEERDIMSSTNNGAVIP--QEFVNEFEKLKEGYPSLKEHCHVIP---VNRNAGKMPVRAGASVDKLANLAKD-TELVK 182 (421) T ss_pred hhHHHhhccccCCcceecc--hhhHHHHHHHHHhhhhhhhhceeee---ccCCceEEEEeecCCccceeecccc-ccccc Confidence 2111222222233455665 4566777777777666666665432 2222334444333322 3344443 45788 Q ss_pred eeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccceeeeecCCcceeeccC Q lcl|NC_019522. 79 VDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVGEGLYTSPNVSVEAATS 158 (311) Q Consensus 79 v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g~GllN~p~v~~~~~~~ 158 (311) .+..++.....++.++.-+.+|.+ +.. .+..+|..--....++++...+|.-++..- .|+++.+++ T Consensus 183 s~~~f~~i~~~~~k~~~~v~iS~e-ll~--ds~~~l~~~i~~~la~~~~~~~~~~i~~~~----~g~~~~~~~------- 248 (421) T protein:vir:13 183 AMLKTQPMAYDIDDYGLLAPIDNS-LLE--DSEINFLEFVNEEFAEFAVNTENAEIVKQA----KAVLAEETI------- 248 (421) T ss_pred cccceeEEEeeeeeeEeehhhhHH-HHh--hhHHHHHHHHHHHHHHHHHHHhhhhHhhhh----hhccccccc------- Confidence 788888899999999988888844 332 234456666666666777766664332100 233322111 Q ss_pred CccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhCC---- Q lcl|NC_019522. 159 TFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNFP---- 234 (311) Q Consensus 159 ~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~~---- 234 (311) .+ ++||.++++++... + ..+..++|+|+.+..|.+-. +..|.=++.-+....+ T Consensus 249 -----------~~----~d~i~~~~~~l~~~--~--~~~a~~v~n~~~~~~l~~lk----d~~G~~i~~~~~~~~~~tl~ 305 (421) T protein:vir:13 249 -----------ND----YAGLVKTINSLVPN--A--RKRAIIVTNSDGRAYLDGLM----DKQGRPLLKELSDGGDLVFK 305 (421) T ss_pred -----------cc----hHHHHHHHHHhhhh--h--cCCCEEEEcHHHHHHHHHhh----cCCCceeecCcCCCCCceec Confidence 12 45677777776432 2 23458999999999986532 2222222211111111 Q ss_pred ceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhccceee--CCceEEEeeeeeeeeEEEECCeEE------- Q lcl|NC_019522. 235 DITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATA--DNVNFKVPAILRTGGTEWRIPKAG------- 305 (311) Q Consensus 235 ~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~--~~~~~~~~~~~~~gGv~i~~P~ai------- 305 (311) ...++.++.... +.++ +..++|-+-.+.+.+.....+++..-.+. ..-...+.+..|++|. +..|.++ T Consensus 306 G~pV~~~~~~~~-~~~~-~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~f~~~~~~~r~~~r~d~~-~~~~~a~~~~~~~~ 382 (421) T protein:vir:13 306 GRPVIELEESIF-DVGD-ETKFIVSDFKTLIKFMDRKQYLIDQSKEAGYTKNETIARIIERFDVN-SPLDKSSDAEKIRK 382 (421) T ss_pred ceeeEEeccccc-cCCC-ceEEEEEeccccEEEEEecceEEEeecccccccCeeEEEEEeeecce-eecchhhheeeecc Confidence 122333332221 2222 33344444333333323333332211111 1112455566776443 4445544 Q ss_pred ----EEeecC Q lcl|NC_019522. 306 ----HYVDGV 311 (311) Q Consensus 306 ----~~~dGI 311 (311) +..++. T Consensus 383 ~~a~v~~~~~ 392 (421) T protein:vir:13 383 FGVIVKLQEV 392 (421) T ss_pred cceeeccccc Confidence 444444 No 122 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=96.51 E-value=0.00056 Score=38.42 Aligned_cols=284 Identities=11% Similarity=0.028 Sum_probs=138.9 Q ss_pred CCc------ccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCccc Q lcl|NC_019522. 1 MAK------SVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNST 74 (311) Q Consensus 1 ~~~------~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~ 74 (311) +.. ..+...+.+++++++. +.+..+|++.....-..+.++.+.+-. .. ..+...+..|.+.|.+..+ T Consensus 65 l~~~e~~~~~~~~~~t~~~Gg~lvP--~~~~~~I~~~l~~~spir~~a~v~~~~---~~-~~i~~~~~~~~a~W~~e~~- 137 (381) T protein:vir:10 65 LSANQRNFFMDINKSVGYKEEKLLP--EETIDRIFEDLTTNHPLLADLGIKNAG---LR-LKFLKSETSGVAVWGKIYG- 137 (381) T ss_pred cCHHHHHHHHHHhhcCCCCCceecC--HHHHHHHHHHHHhhcceeeeeeeEecC---cc-eEEEeecCCcceEEeeccc- Confidence 000 1233444556667775 456677777665544455555443321 11 2344556677888865432 Q ss_pred ccc-eeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc-eeeeecCCcc Q lcl|NC_019522. 75 DVP-TVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG-EGLYTSPNVS 152 (311) Q Consensus 75 dip-~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g-~GllN~p~v~ 152 (311) .++ ..+..+++...+.+.++.-...+.+=| .-+.++|+.--....++++...+++-+++|+.... .|+|++.+-. T Consensus 138 ~~~~~~~~~f~~i~l~~~kl~a~i~is~elL---~Ds~~~le~~i~~~la~~~a~~~~~afi~GdG~~qP~Gil~~~~~~ 214 (381) T protein:vir:10 138 EIKGQLDAAFSEETAIQNKLTAFVVLPKDLN---DFGPAWIERFVRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKG 214 (381) T ss_pred ccccccCccceeEeecceeEEeeccccHHHH---hccHHHHHHHHHHHHHHHHHHHhhceeEecccCCCceeeeecCCcc Confidence 333 446677888888999988888874433 23456788999999999999999999999997655 6999875432 Q ss_pred eeeccCCccccCcc---cccCCHHHHHHHHHHHHHHHHhccCCc---eecceEEEeCHHHHHHHhcccccCCCCCcchHH Q lcl|NC_019522. 153 VEAATSTFVALVAA---IPTNGTQPIIDFFGNAYNTVYLDNTLT---VHRPNTFVLPPAQFQLLARTLLSTQNASNVTLL 226 (311) Q Consensus 153 ~~~~~~~~~~~~t~---w~~~t~~ei~~di~~~~~~~~~~~~~~---~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl 226 (311) .....+... ..+. ....+....++.+..++..+.....+. ....-+++|.+..+..|....... +..|. T Consensus 215 ~~~~~g~~~-~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~vmn~~t~~~l~~~~~~~-~~~G~--- 289 (381) T protein:vir:10 215 VSVTDGAYP-EKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQAQYTHL-NANGV--- 289 (381) T ss_pred ccccccccc-cccccccccccchhhHHHHHHHHHHhhhhhhccccccccCceEEEEchhhHHhhccccccC-CCCCc--- Confidence 211111100 0011 112223333333333333332111110 011235678888776664322111 11111 Q ss_pred HHHHHhCCceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhcc----ceeeCCceEEEeeeeeeeeEEEECC Q lcl|NC_019522. 227 QFLRTNFPDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLA----PATADNVNFKVPAILRTGGTEWRIP 302 (311) Q Consensus 227 ~~l~~n~~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~----p~~~~~~~~~~~~~~~~gGv~i~~P 302 (311) |+-.-..++.|+..+.... | .++..+.+ .++- ...+.+++.. ..... ...+....|.+ -.++.| T Consensus 290 -~v~~lp~g~~vv~~~~~p~----~--~i~fGDfs-~Y~i-~~r~~~~i~~~~~~~~~~d--~~~f~a~~r~d-G~~~~~ 357 (381) T protein:vir:10 290 -YVTALPFNLNVIESTVQEA----G--KVLTYVKG-LYDG-YLAGGINVQKFKETLALDD--MDLYTAKQFAY-GKAKDN 357 (381) T ss_pred -eeecCCCCceeEEcCCCCc----C--cEEEEEcc-cEEE-EEecccEEEeechhhhhcC--ceEEEEEEEEc-CEEecC Confidence 1110011344444443321 1 12222222 2221 1222222110 01111 13345567764 457788 Q ss_pred eEEEEee-cC Q lcl|NC_019522. 303 KAGHYVD-GV 311 (311) Q Consensus 303 ~ai~~~d-GI 311 (311) .|+++++ -| T Consensus 358 ~A~~v~~l~~ 367 (381) T protein:vir:10 358 KVAAVWKLDL 367 (381) T ss_pred CcEEEEEEee Confidence 8877743 12 No 123 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=96.42 E-value=0.00064 Score=38.11 Aligned_cols=257 Identities=9% Similarity=0.024 Sum_probs=127.7 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCC--CCcceeEEEEEEeecccceEEecCcccccce Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNS--APDWAQAVMFRSIDARGELQLFGPNSTDVPT 78 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~--~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~ 78 (311) ||... + .-+. ++.. |-+.+.|.+.....+....+..+... +.+| .++.++.++..|.++.+..+ ++|+. T Consensus 1 m~~~~--T-~l~d--~i~P--ev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G-~tv~iP~~~~ig~a~~~~~g-~~i~~ 71 (274) T protein:vir:96 1 MAQGM--T-KLTN--QIVP--EVLAPMMQAELEKKLRFASFAEIDNTLVGQPG-DTLTFPAFIYSGDAKVVAEG-EKIPT 71 (274) T ss_pred CCcce--e-ehhh--eech--HHHHHHHHHHHHhhhhccccceecccccCCCC-CEEEeeeecCCCccccccCC-Cccch Confidence 44421 0 0011 1122 11222233333344455555444433 2234 67788888888999988765 67888 Q ss_pred eeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccceeeeecCCcceeeccC Q lcl|NC_019522. 79 VDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVGEGLYTSPNVSVEAATS 158 (311) Q Consensus 79 v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g~GllN~p~v~~~~~~~ 158 (311) -.+..+.....+...+.+|.+ .|+++.+. +-++-......+..++++..|+.++ +.+..+-+. ... T Consensus 72 ~~lt~~~~~~~i~~~~~a~~i--~D~~~~~~-~~d~~~~~~~~~~~~~a~~vd~~i~---~~l~~a~~~--------~~~ 137 (274) T protein:vir:96 72 DILETKKREAKIRKIAKGTSI--SDEALLSG-YGDPQGEQVRQHGLAHANKVDDDVL---EALKSAKLT--------VEA 137 (274) T ss_pred hhcccceeEEEeeeeecceee--hHHHHhhc-cchHHHHHHHHHHHHHHHHHHHHHH---HHHhccccc--------ccc Confidence 888888888888876666555 46655444 4445566677777888888777554 111111100 000 Q ss_pred CccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhccc----ccCCCCCcchHHHHHHHhC- Q lcl|NC_019522. 159 TFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTL----LSTQNASNVTLLQFLRTNF- 233 (311) Q Consensus 159 ~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~----~~~~~~~~~Tvl~~l~~n~- 233 (311) ...+ ++.|+++...+-.. ...+..|+++|..+..|.+-. +..++ .+.. +..|+ T Consensus 138 ---------~~~~----~d~i~~A~~~lgd~----~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~-~g~~----~~~~G~ 195 (274) T protein:vir:96 138 ---------DITK----LTGLQTAIDKFNDE----DLEPMVLFISPLDAGKLRGDATTNFTRATE-LGDD----VIVKGA 195 (274) T ss_pred ---------cccC----HHHHHHHHHHhccc----cccccEEEeCHHHHHHHHhhcccccccccc-cccc----ceeccc Confidence 0012 45566666666321 135679999999999997631 11111 1111 11111 Q ss_pred ----CceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhccceeeCCceEEEeeeeeeeeEEEECCeEEEEee Q lcl|NC_019522. 234 ----PDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHYVD 309 (311) Q Consensus 234 ----~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~d 309 (311) -.++|+....+. ....+++. +.-+.+....+.+...-.......-.+.. -...|+-+.+|..++.+. T Consensus 196 ig~~~G~~Vi~s~~~~------~~t~~l~~--~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~-~~~y~~~~~~~~~~v~~t 266 (274) T protein:vir:96 196 FGEALGAVIVRSNKLE------AGTAILAK--KGAVKLITKRDFFLETDRDPSTKTTALYS-DKHYVAYLYDESKAVKIT 266 (274) T ss_pred cceecCeEEEEeCCCC------CceEEEEe--ccceeeeecCCcccccccccccccCEEEE-eEEEEEEEEcCCcEEEEE Confidence 123444444331 11122222 12233322223222111111112222222 344689999999888887 Q ss_pred cC Q lcl|NC_019522. 310 GV 311 (311) Q Consensus 310 GI 311 (311) -= T Consensus 267 k~ 268 (274) T protein:vir:96 267 KG 268 (274) T ss_pred cC Confidence 44 No 124 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=96.42 E-value=0.00064 Score=38.11 Aligned_cols=257 Identities=9% Similarity=0.024 Sum_probs=127.7 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCC--CCcceeEEEEEEeecccceEEecCcccccce Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNS--APDWAQAVMFRSIDARGELQLFGPNSTDVPT 78 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~--~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~ 78 (311) ||... + .-+. ++.. |-+.+.|.+.....+....+..+... +.+| .++.++.++..|.++.+..+ ++|+. T Consensus 1 m~~~~--T-~l~d--~i~P--ev~~~~v~~~~~~~l~~~~~~~~~~~l~g~~G-~tv~iP~~~~ig~a~~~~~g-~~i~~ 71 (274) T protein:vir:95 1 MAQGM--T-KLTN--QIVP--EVLAPMMQAELEKKLRFASFAEIDNTLVGQPG-DTLTFPAFIYSGDAKVVAEG-EKIPT 71 (274) T ss_pred CCcce--e-ehhh--eech--HHHHHHHHHHHHhhhhccccceecccccCCCC-CEEEeeeecCCCccccccCC-Cccch Confidence 44421 0 0011 1122 11222233333344455555444433 2234 67788888888999988765 67888 Q ss_pred eeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccceeeeecCCcceeeccC Q lcl|NC_019522. 79 VDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVGEGLYTSPNVSVEAATS 158 (311) Q Consensus 79 v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g~GllN~p~v~~~~~~~ 158 (311) -.+..+.....+...+.+|.+ .|+++.+. +-++-......+..++++..|+.++ +.+..+-+. ... T Consensus 72 ~~lt~~~~~~~i~~~~~a~~i--~D~~~~~~-~~d~~~~~~~~~~~~~a~~vd~~i~---~~l~~a~~~--------~~~ 137 (274) T protein:vir:95 72 DILETKKREAKIRKIAKGTSI--SDEALLSG-YGDPQGEQVRQHGLAHANKVDDDVL---EALKSAKLT--------VEA 137 (274) T ss_pred hhcccceeEEEeeeeecceee--hHHHHhhc-cchHHHHHHHHHHHHHHHHHHHHHH---HHHhccccc--------ccc Confidence 888888888888876666555 46655444 4445566677777888888777554 111111100 000 Q ss_pred CccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhccc----ccCCCCCcchHHHHHHHhC- Q lcl|NC_019522. 159 TFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTL----LSTQNASNVTLLQFLRTNF- 233 (311) Q Consensus 159 ~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~----~~~~~~~~~Tvl~~l~~n~- 233 (311) ...+ ++.|+++...+-.. ...+..|+++|..+..|.+-. +..++ .+.. +..|+ T Consensus 138 ---------~~~~----~d~i~~A~~~lgd~----~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~-~g~~----~~~~G~ 195 (274) T protein:vir:95 138 ---------DITK----LTGLQTAIDKFNDE----DLEPMVLFISPLDAGKLRGDATTNFTRATE-LGDD----VIVKGA 195 (274) T ss_pred ---------cccC----HHHHHHHHHHhccc----cccccEEEeCHHHHHHHHhhcccccccccc-cccc----ceeccc Confidence 0012 45566666666321 135679999999999997631 11111 1111 11111 Q ss_pred ----CceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhccceeeCCceEEEeeeeeeeeEEEECCeEEEEee Q lcl|NC_019522. 234 ----PDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHYVD 309 (311) Q Consensus 234 ----~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~d 309 (311) -.++|+....+. ....+++. +.-+.+....+.+...-.......-.+.. -...|+-+.+|..++.+. T Consensus 196 ig~~~G~~Vi~s~~~~------~~t~~l~~--~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~-~~~y~~~~~~~~~~v~~t 266 (274) T protein:vir:95 196 FGEALGAVIVRSNKLE------AGTAILAK--KGAVKLITKRDFFLETDRDPSTKTTALYS-DKHYVAYLYDESKAVKIT 266 (274) T ss_pred cceecCeEEEEeCCCC------CceEEEEe--ccceeeeecCCcccccccccccccCEEEE-eEEEEEEEEcCCcEEEEE Confidence 123444444331 11122222 12233322223222111111112222222 344689999999888887 Q ss_pred cC Q lcl|NC_019522. 310 GV 311 (311) Q Consensus 310 GI 311 (311) -= T Consensus 267 k~ 268 (274) T protein:vir:95 267 KG 268 (274) T ss_pred cC Confidence 44 No 125 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=96.39 E-value=0.00067 Score=38.00 Aligned_cols=284 Identities=11% Similarity=-0.007 Sum_probs=140.2 Q ss_pred CCc----------ccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEec Q lcl|NC_019522. 1 MAK----------SVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFG 70 (311) Q Consensus 1 ~~~----------~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~ 70 (311) ... -.+...+.+++++++. +.+..+|++.....-..+.++.+..-. . ...+...+..+.+.|.. T Consensus 71 ~~~~l~~ee~~~~~~~~~~t~~~gG~liP--~~~~~~Ii~~l~~~s~i~~~~~v~~~~---~-~~~i~~~~~~~~a~w~~ 144 (395) T protein:vir:95 71 SQDPLTSEERKFFNDINYDVGYTDEKILP--ETVVERVFDDLQKDHPLLSKINFQNAG---I-KTRVIKADPAGQAVWGK 144 (395) T ss_pred CccccchHHHHHHHHHhhccCCCCceecc--HHHHHHHHHHHHhhhhhhhhceeEecC---C-ceEEEEecCCcceEEee Confidence 000 1223334555667775 456777877776666666666654322 1 23455667778888865 Q ss_pred CcccccceeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeecccc--c-eeeee Q lcl|NC_019522. 71 PNSTDVPTVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGV--G-EGLYT 147 (311) Q Consensus 71 ~~a~dip~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~--g-~GllN 147 (311) ....--+..+..++......+.++.-..+|. ||. ..+..+|+.--....++++...+++-+++|+... . .|+|| T Consensus 145 e~~~~~~~~~~~f~~i~l~~~kl~~~~~iS~-ell--~ds~~~ie~~i~~~la~~ia~~~~~a~i~G~G~~~~qP~Gil~ 221 (395) T protein:vir:95 145 VFGEIKGQLDAAFREENFTQYKLTCFVVLPD-DLS--TFGPAWIERFVRTQIQEAISVALESAIINGGGAAKTQPVGLMK 221 (395) T ss_pred cccccCccccccceeeeeceeeEEEeecccH-HHH--hcchhHHHHHHHHHHHHHHHHHHhhheeeccCCCCcCceeeee Confidence 4332224456778888888999988888884 443 4466789999999999999999999999998653 2 69999 Q ss_pred cCCcceeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCC---ceecceEEEeCHHHHHHHhcccccCC-CCCcc Q lcl|NC_019522. 148 SPNVSVEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTL---TVHRPNTFVLPPAQFQLLARTLLSTQ-NASNV 223 (311) Q Consensus 148 ~p~v~~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~---~~~~p~~l~lpp~~~~~L~~~~~~~~-~~~~~ 223 (311) +.+............+.. .....+-....+..++..+.....+ .....-+.+|.+..+..+...+.-.+ +.... T Consensus 222 ~~~~~~~~~~~~~~~~~~--t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~mn~~t~~~~~g~~~~~~~~G~~~ 299 (395) T protein:vir:95 222 DVNTNSGAVTDKASSGTL--TFADADTTILELNDVLKNLSVDEKGKELKIDGKVALVVNPRDSWDVQARYTYLTANGGFV 299 (395) T ss_pred cccccccccccccccchh--hhhhhHhhHHHHHHHHHhhccccccchhhhcCceEEEEcchhhhhcCCcceeccCCCcce Confidence 866432221111011111 0111112223333333222110000 01112356777776655433222111 11011 Q ss_pred hHHHHHHHhCCceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhh--hccceeeCCceEEEeeeeeeeeEEEEC Q lcl|NC_019522. 224 TLLQFLRTNFPDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLR--FLAPATADNVNFKVPAILRTGGTEWRI 301 (311) Q Consensus 224 Tvl~~l~~n~~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~--~~~p~~~~~~~~~~~~~~~~gGv~i~~ 301 (311) |++ +.++.++....+.. |+ ++|-+=.++. +..-..++ +..-.........+....|++ ..++. T Consensus 300 ~~l------g~g~~v~~~~~~p~----~~---i~fgdfs~y~-i~~r~~~~i~~~~~~~~~~d~~~f~~~~r~d-g~~~~ 364 (395) T protein:vir:95 300 TVL------PYNVTIITSEFVPE----GK---LVAFVTDRYN-AVRGGGLTVKKFDQTLALEDAVLFTAKTFAY-GQPDD 364 (395) T ss_pred ecc------CCcceEEEcCCCCC----Cc---EEEEecccEE-EEEecceEEEeccchhhhCCcEEEEEEEEEC-CEEec Confidence 111 12344444333321 11 2232222221 11111111 111000001124456667874 56778 Q ss_pred CeEEEEeecC Q lcl|NC_019522. 302 PKAGHYVDGV 311 (311) Q Consensus 302 P~ai~~~dGI 311 (311) |.|+.+++ | T Consensus 365 ~~A~~~l~-i 373 (395) T protein:vir:95 365 NKASAVYD-L 373 (395) T ss_pred cccEEEEE-e Confidence 88887543 2 No 126 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=96.27 E-value=0.0008 Score=37.57 Aligned_cols=258 Identities=11% Similarity=-0.018 Sum_probs=135.4 Q ss_pred CC---cccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeeccc--ceEEecCcccc Q lcl|NC_019522. 1 MA---KSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARG--ELQLFGPNSTD 75 (311) Q Consensus 1 ~~---~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G--~a~~~~~~a~d 75 (311) +. ...+.+.+. ++.++ .+.+...|++........+.++.+.+- ...++.|......+ .+.+++. +.. T Consensus 101 ~~~~~~~~~~~~~~--~~~~i--p~~~~~~ii~~~~~~~~i~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~v~E-g~~ 172 (379) T protein:vir:10 101 IQVKAVGDMTLPVN--LTGAQ--PKDYNFDVVLNPSQMLNVSDIVGAVSI---SGGTYTFVRENGAGEGAIGAQVE-GAT 172 (379) T ss_pred hhhhhhcccccCCC--Ccccc--chhhhhHHHHhHHhhhhHHhhceeeec---cCCceEEEEeecCCCcccccccC-Ccc Confidence 11 111111111 12222 244556777777777777777665432 23345665554443 3334444 467 Q ss_pred cceeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccce-eeeecCCccee Q lcl|NC_019522. 76 VPTVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVGE-GLYTSPNVSVE 154 (311) Q Consensus 76 ip~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g~-GllN~p~v~~~ 154 (311) +|..+..++.....++.++..+.+|.+ +-.-. . .|.+--....++++...+|.-++.|....+. +.+ T Consensus 173 ~~~~~~~f~~i~~~~~k~~~~~~iS~e-ll~D~--~-~l~~~i~~~la~~~~~~~~~~~~~g~~~~~~~~~~-------- 240 (379) T protein:vir:10 173 KGQKDYDISMIDVNTDFIAGFTRYSKK-MANNL--P-FLTSFIPNALRRDYAKAENAAFNAVLAANATASTE-------- 240 (379) T ss_pred ccccccceeeeEeeeeeEEeeehhhHH-HHhhH--H-HHHHHHHHHHHHHHHHHHHHHHhcccccccccccc-------- Confidence 899899999999999999999999854 43322 1 3667777777888888888877766543221 111 Q ss_pred eccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHH--HHHHh Q lcl|NC_019522. 155 AATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQ--FLRTN 232 (311) Q Consensus 155 ~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~--~l~~n 232 (311) .. + ...-+++|.+++..+.. + + ..+..++|+|+.|..|.+-. +..|.-++. ....+ T Consensus 241 --~~--~----------~~~~~d~i~~~~~~~~~-~-~--~~~~~~vmn~~~~~~l~~lk----d~~G~~l~~~~~~~~~ 298 (379) T protein:vir:10 241 --II--T----------NKNKVEMLINEIAKQEN-L-D--FPVTAIVLRPTDYYDILVTQ----KSVGAGYGLPGVVTQD 298 (379) T ss_pred --cc--c----------CcccHHHHHHHHHhhhh-c-c--CCCCEEEEcHHHHHHHHHhh----ccCCceeccCCccCCC Confidence 00 0 00114567777776642 2 2 24567999999998886532 111221111 00011 Q ss_pred C-----CceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhh----hc--cceeeCCceEEEeeeeeeeeEEEEC Q lcl|NC_019522. 233 F-----PDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLR----FL--APATADNVNFKVPAILRTGGTEWRI 301 (311) Q Consensus 233 ~-----~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~----~~--~p~~~~~~~~~~~~~~~~gGv~i~~ 301 (311) + ..+.++..+.+. + |+ +++-+-.++.-+ +-+..+ .. ...+. + ...+.++.|+ |+.+++ T Consensus 299 ~~~~~l~G~pvv~s~~~~-a---g~---~~~gdf~~~~~~-~~~~~~i~~~~~~~~~f~~-~-~~~~r~~~R~-~~~v~~ 367 (379) T protein:vir:10 299 NGVLRINGIPLFRATWLA-A---NK---YYVGDWTRVTKV-TTEGLSLEFSEVEGTNFVK-N-NITARIEAQV-ALAVEQ 367 (379) T ss_pred CCcceecceeeEecCCCC-C---Cc---eEEeecccEEEE-EEeceEEEEeecccccccC-C-cEEEEEEEEe-ccEEec Confidence 1 113344444442 2 21 122222221111 111111 00 01221 2 2566677888 688889 Q ss_pred CeEEEE--eecC Q lcl|NC_019522. 302 PKAGHY--VDGV 311 (311) Q Consensus 302 P~ai~~--~dGI 311 (311) |.||++ +.+| T Consensus 368 p~a~v~~~~~~~ 379 (379) T protein:vir:10 368 PAALIFGDFTAV 379 (379) T ss_pred CccEEEEEecCC Confidence 999999 7788 No 127 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=96.20 E-value=0.00088 Score=37.33 Aligned_cols=262 Identities=10% Similarity=0.021 Sum_probs=130.2 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCc-ceeEEEEEEeecccceEEecCccccccee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPD-WAQAVMFRSIDARGELQLFGPNSTDVPTV 79 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~-~~~~~~~~~~~~~G~a~~~~~~a~dip~v 79 (311) ||.- ...-+.+ +.. |.+-+-|.+.....+....+..+.....- .-.++.+..++..|.++.++++ ++||.- T Consensus 1 Ma~~---~T~l~d~--i~P--ev~~~~v~~~~~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~igda~~~~eg-~~i~~~ 72 (276) T protein:vir:10 1 MAQG---TTTKSTQ--IVP--EVLAPMMQAELDKKLRFAQFADIDSTLVGQPGDTLTFPAFVYSGDATVVPEG-QKIPVD 72 (276) T ss_pred CCcc---eeehhhh--hch--HHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEeeeecCCCccccccCC-CccCcc Confidence 5421 1111222 122 12223333333344455555555444332 3457788888888999988876 578888 Q ss_pred eeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccceeeeecCCcceeeccCC Q lcl|NC_019522. 80 DIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVGEGLYTSPNVSVEAATST 159 (311) Q Consensus 80 ~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g~GllN~p~v~~~~~~~~ 159 (311) .+..+.....+...+.+|.++ |+...+ .+.++-..-.+.+...+++++|+.++- . |+.- ..+ T Consensus 73 ~lt~~~~~a~i~~~~k~~~~t--D~a~~~-~~~dp~~~~~~~~~~~~a~~~d~~~~~---~-----l~~~-------~~~ 134 (276) T protein:vir:10 73 KIETNRREAKIHKIGKGTDIT--DEALLS-GYGDPQGEAVRQHGLAIANKVDNDVLE---A-----LRGT-------KLT 134 (276) T ss_pred ccccceeeEEeehcccccccc--HHHHHh-hccchHHHHHHHHHHHHHHHHHHHHHH---H-----Hhcc-------ccc Confidence 888899999998877666665 444433 345555666777777788887765441 1 1100 000 Q ss_pred ccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcc----cccCCCCCcchHH-HHHHHhCC Q lcl|NC_019522. 160 FVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLART----LLSTQNASNVTLL-QFLRTNFP 234 (311) Q Consensus 160 ~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~----~~~~~~~~~~Tvl-~~l~~n~~ 234 (311) ... ..-| ++.|.+++..+-.. -..+..|+++|+.+..|.+- ...... .+.-++ .=....+- T Consensus 135 ~~~-----~~~t----~d~i~~A~~~lgd~----~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~-~g~~~~~~G~ig~~~ 200 (276) T protein:vir:10 135 VSA-----DIGT----LAGLEAAIDTFDDE----DLEPMVLFINPKDAGKLRSSASDNFTRATE-LGDNIIVKGAFGEAL 200 (276) T ss_pred ccc-----cccC----HHHHHHHHHHhccc----cCcccEEEEcHHHHHHHHHhcccccccccc-ccccceeccccceec Confidence 000 0112 45566666666322 12568999999999988542 111111 110000 00000112 Q ss_pred ceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhccceeeCCceEEEeeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 235 DITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 235 ~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dGI 311 (311) .++|+..+.+.. ...+++. +.-+.+....+.+...-.......-.+... ...|+-+.+|..++.+.=- T Consensus 201 G~~Vi~s~~~p~------~t~~l~~--~gAi~~~~~~~~~vE~dRd~~~~~d~i~~~-~~y~~~~~~~~~vv~~t~~ 268 (276) T protein:vir:10 201 GAVIVRSKKLDE------GEAILAK--RGAVKLITKRDFFLETDRDPSTKTTALYSD-KHYVAYLYDESKAVKVTKG 268 (276) T ss_pred ceeEEEcCCCCc------ceEEEEe--ccceeeeecCCceeecccchhhcccEEEEe-eEEEEEEEcCcceEEEecC Confidence 245555554421 2223322 233443333333221111111112233332 3358899999988888744 No 128 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=96.14 E-value=0.00096 Score=37.14 Aligned_cols=265 Identities=14% Similarity=0.038 Sum_probs=134.1 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeec-ccceEEecCccccccee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDA-RGELQLFGPNSTDVPTV 79 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~-~G~a~~~~~~a~dip~v 79 (311) -....|.....+.+++++. +.+.+.|++........+.++++.. -...+..|.+... .+.+.+++.++...+.. T Consensus 104 ~~~~~~~~~t~~~gg~~vP--~~~~~~i~~~~~~~~~l~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~ 178 (389) T protein:vir:10 104 KVIDATSKVTSTEAGVLIP--EEIIYDPTAEVNSVVDLSTLVTKTP---VTTPKGTYPILKRATDRFSSVAELAENPKLA 178 (389) T ss_pred hhhhhhcccccCCcceeeh--HHHHHHHHHHHHhhhhHHhhcceee---ccCCeeEEEEEecCCCccccccccccccccc Confidence 1112233333344566665 4466778887777766666665432 2223445554443 44556666654432245 Q ss_pred eeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccceeeeecCCcceeeccCC Q lcl|NC_019522. 80 DIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVGEGLYTSPNVSVEAATST 159 (311) Q Consensus 80 ~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g~GllN~p~v~~~~~~~~ 159 (311) +..++......+.++.-+.+|.+=|+ .+..+|...-....++++...+|.-++.|..... +. T Consensus 179 ~~~~~~i~~~~~k~~~~~~iS~ell~---ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~--------------~~- 240 (389) T protein:vir:10 179 EPEFNKVDWSVATYRGAIPLSEEAIA---DSAVDLTALVGQSIKEKSVNTYNAMIAPVLQSFT--------------AK- 240 (389) T ss_pred cccceeeeeeheeeEeeehhhHHHHh---hhhHHHHHHHHHHHHHHHHHHHHHHHhhhhcccc--------------cc- Confidence 67788889999999999999865333 3345777888888889999999988887765311 00 Q ss_pred ccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHH-----HHHH--- Q lcl|NC_019522. 160 FVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQ-----FLRT--- 231 (311) Q Consensus 160 ~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~-----~l~~--- 231 (311) +. + ...+ ++++.++++.... ..+ ...++++|+.+..|.+-. ..+ |.-++. -... T Consensus 241 ~~---~--~~~~----~d~l~~~~~~~~~-~~~----~a~~~~n~~~~~~L~~lk-d~~---G~~i~~~~~~~~~~~~~~ 302 (389) T protein:vir:10 241 KT---T--TDTL----VDSLKHILNVDLD-PAY----SRALVVTQSLFNTLDTLK-DKN---GRYLLHDASDSITDGTAK 302 (389) T ss_pred cc---c--cccc----HHHHHHHHHhhhh-hhh----CcEEEecHHHHHHHHHhh-ccC---CCeeeecCcccccccccc Confidence 00 0 0112 3445554442211 111 136899999999987532 111 111110 0000 Q ss_pred ---hCCceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhccceeeCCceEEEeeeeeeeeEEEECCeEEEEe Q lcl|NC_019522. 232 ---NFPDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHYV 308 (311) Q Consensus 232 ---n~~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~ 308 (311) .+.++.+...-.+. ..+.+..++|-+=.+.+-+...+.+++..- +.......+....|++|. +.+|.||+.+ T Consensus 303 ~~l~G~pV~~~~~~~~~---~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~-~~~~~~~~~~~~~r~d~~-~~~~~a~~~~ 377 (389) T protein:vir:10 303 GTILGVPVYVVGDTLLG---SLAGDQKAFVGDLKRGVLFTDRQQVTLAWE-DSKIYGKYLGAAFRFGVQ-KADSKAGYFV 377 (389) T ss_pred cccccceeEEecccccC---CCCCceEEEEeeccccEEEEeecceEEEee-ccccccceEEEEEEeccE-EecccceEEE Confidence 11122222221121 122233344443333222222233332211 111122234555787654 7889998866 Q ss_pred e--cC Q lcl|NC_019522. 309 D--GV 311 (311) Q Consensus 309 d--GI 311 (311) + .. T Consensus 378 ~~~~~ 382 (389) T protein:vir:10 378 TNTDV 382 (389) T ss_pred Eeecc Confidence 5 44 No 129 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=96.11 E-value=0.001 Score=37.03 Aligned_cols=262 Identities=10% Similarity=0.011 Sum_probs=125.2 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCc-ceeEEEEEEeecccceEEecCccccccee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPD-WAQAVMFRSIDARGELQLFGPNSTDVPTV 79 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~-~~~~~~~~~~~~~G~a~~~~~~a~dip~v 79 (311) ||... ..-+.+ +.. |.+.+-|.+.....+....+..+...... .-.++.++.+...|.++.+.++ ++|+.- T Consensus 1 ma~~~---T~l~d~--iiP--ev~~~~v~~~~~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~ig~a~~~~~g-~~i~~~ 72 (274) T protein:vir:12 1 MAQGL---TKTSNQ--IIP--EVLAPMMQAQLEKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEG-EKIPTD 72 (274) T ss_pred CCcce---eehhhh--hch--HHHHHHHHHHHHhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCC-Cccchh Confidence 44321 011111 111 12223333333344455555555443222 3456788888888999988775 578888 Q ss_pred eeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccceeeeecCCcceeeccCC Q lcl|NC_019522. 80 DIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVGEGLYTSPNVSVEAATST 159 (311) Q Consensus 80 ~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g~GllN~p~v~~~~~~~~ 159 (311) ++..+.....+...+.+|++ .|++..+..+ ++-......+..+++++.|+-++.- ++. +..+ T Consensus 73 ~lt~~~~~~~i~~~~~~~~i--~D~~~~~~~~-d~~~~~~~q~~~~~a~~vd~~~l~~--------~~~-------a~~~ 134 (274) T protein:vir:12 73 ILETKKREAKIRKIAKGTSI--TDEALLSGYG-DPQGEQVRQHGLAHANKVDNDVLEA--------LMG-------AKLT 134 (274) T ss_pred hcccceeeEEeeeecceeee--cHHHHHhccc-chHHHHHHHHHHHHHHHHHHHHHHH--------Hhc-------cccc Confidence 88888888888887666555 4555544444 4445566666677777777654411 110 0000 Q ss_pred ccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhccc----ccCCCCCcchHHH-HHHHhCC Q lcl|NC_019522. 160 FVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTL----LSTQNASNVTLLQ-FLRTNFP 234 (311) Q Consensus 160 ~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~----~~~~~~~~~Tvl~-~l~~n~~ 234 (311) .... ..+ ++.|+++..++-.. ...+..|+++|..+..|.+-. +...+ .+..++. =....+- T Consensus 135 --~~~~---a~~----~d~i~dA~~~lgd~----~~~~~~ivv~p~~~~~L~k~~~~~fv~~s~-~g~~~~~~G~ig~~~ 200 (274) T protein:vir:12 135 --VNAD---ITK----LNGLQSAIDKFNDE----DLEPMVLFINPLDAGKLRGDASTNFTRATE-LGDDIIVKGAFGEAL 200 (274) T ss_pred --cccc---ccC----HHHHHHHHHHhccc----cccccEEEeCHHHHHHHHhhhhhhcccccc-ccccceecccceeec Confidence 0000 012 45566666666321 135678999999999987631 11111 1111100 0000011 Q ss_pred ceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhccceeeCCceEEEeeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 235 DITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 235 ~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dGI 311 (311) ..+|+..+.+.. ...+++. +.-+.+....+.+...-.......-.+.. -...|+-+.+|..++.+..= T Consensus 201 G~~Vi~s~~~p~------~t~~l~~--~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~-~~~y~~~~~~~~~vv~~t~~ 268 (274) T protein:vir:12 201 GAIIVRSNKLEA------GTAILAK--KGAVKLILKRDFFLEVARDASTKTTALYS-DKHYVAYLYDESKAVKITKG 268 (274) T ss_pred CeeEEEeCCCCc------ceEEEEe--ccceeeeecCCceeccccchhhcccEEEe-eeEEEEEEEcCCceEEEEcC Confidence 234444433311 1112221 12222222222221111111111122222 23457888899888888866 No 130 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=96.10 E-value=0.001 Score=37.03 Aligned_cols=224 Identities=10% Similarity=-0.021 Sum_probs=114.7 Q ss_pred cCCCCcceeEEEEEEeecccceEEecCcccccceeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHH Q lcl|NC_019522. 45 DNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPTVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRD 124 (311) Q Consensus 45 ~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~ 124 (311) ...++ .-.+++++.+ .|.++.++.+ +.||...+..+..+..|...+.+++++..+. .+..|=|+ .+-.....+ T Consensus 1 ~~~~~-~Gdtit~P~~--iGda~~v~eG-~~i~~~~l~~t~~~atIk~~gk~~~itD~a~--l~~~gDp~-~ea~~Q~~~ 73 (231) T protein:vir:73 1 ENGIN-LANLCEYPND--IGDAADVAEG-GEISLDKIGTTTKSVTIKKAAKGTEITDEAA--LSGYGDPI-GESNKQLGL 73 (231) T ss_pred Ccccc-CCceEEeccc--ccchhhhcCC-CcCChhhccccceeeeEeeeccceeeeHHHH--hhccCchH-HHHHHHHHH Confidence 33333 3346677655 8899888887 5588888999999999999988888875544 44566565 445555566 Q ss_pred HHHHhhhheeeeeccccceeeeecCCcceeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCH Q lcl|NC_019522. 125 VVEQGLNKIYLLGDKGVGEGLYTSPNVSVEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPP 204 (311) Q Consensus 125 ~~~~~~n~~~~~G~~~~g~GllN~p~v~~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp 204 (311) +++.+.|.=++ +.+.. +.|..+++. -++.|++++..+.. ....+..++++| T Consensus 74 ~iA~kvD~di~---~~~~~---------------------a~l~~~~~~-t~d~i~~A~~~fgd----e~~~~~vivv~p 124 (231) T protein:vir:73 74 SLANKVDDDLL---KAAKT---------------------TSQTVSTKA-NVDGVQAALDIFND----EDAQAYVLIVNP 124 (231) T ss_pred HHHHhhhHHHH---Hhhcc---------------------ccccccccc-cHHHHHHHHHHhcc----ccccceEEEEcc Confidence 66666665322 01100 011111211 16667777777632 234678899999 Q ss_pred HHHHHHhcccccCCCCCcchHHHHHHHhC-----CceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhccce Q lcl|NC_019522. 205 AQFQLLARTLLSTQNASNVTLLQFLRTNF-----PDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPA 279 (311) Q Consensus 205 ~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~-----~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~ 279 (311) ..+..|-+-. .. ........+=+..|+ -.++|+.++.+.. | +-...-|...+.-+.+..-.+.+.-.-+ T Consensus 125 ~~~~~Lrk~~-~~-~~~~~~~g~~i~~~G~iG~i~G~~Vi~S~~~~~-~---~~~~~~~i~~~gAl~~~~k~~~~vEtdR 198 (231) T protein:vir:73 125 KDAAKIRKDA-NA-KNIGSEVGANALINGTYADVLGAQIVRSKKLAE-G---SALMFKIVSNSPALKLVLKRGVQVETDR 198 (231) T ss_pred hHHHhhhhcc-ch-hhhhhhhccceeeecccceEcceEEEEcCCCCC-C---ceeeeeEEeeccceeeeecccceeeccc Confidence 9999884411 11 000000000011122 1245555444432 1 1222112112222333222222211111 Q ss_pred eeCCceEEEeeeeeeeeEEEECCeEEEEe--ecC Q lcl|NC_019522. 280 TADNVNFKVPAILRTGGTEWRIPKAGHYV--DGV 311 (311) Q Consensus 280 ~~~~~~~~~~~~~~~gGv~i~~P~ai~~~--dGI 311 (311) ........+- .-...+|-+++|..++.+ .|+ T Consensus 199 d~~~k~~~i~-~~~~y~v~l~~~~~vv~~t~~g~ 231 (231) T protein:vir:73 199 DIVTKTTVIT-ADEHYAAYLYDLTKVVNITFTGV 231 (231) T ss_pred cccccccEEE-EeEEEEEEEEcCccEEEEEeecC Confidence 1111112222 234468899999998876 678 No 131 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=96.01 E-value=0.0011 Score=36.74 Aligned_cols=280 Identities=8% Similarity=-0.034 Sum_probs=128.1 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccceee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPTVD 80 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~v~ 80 (311) .-+..+...+.+++++++. +.+..+|++.....-..+.++.+.+-. . ...+...+..+.+.|.+-.+.--+..+ T Consensus 74 ~~~~~~~~~~~~~gg~~vP--~~~~~~I~~~l~~~s~i~~~~~v~~~~---~-~~~~~~~~~~~~a~w~~e~~~~~~~~~ 147 (377) T protein:vir:98 74 FFNDIDKNVGGKDKFKLLP--EETMVQVFDDLVAEHPLLKVINFKNTS---L-RLKALTAETSGTAVWGDIFGEIKGQLK 147 (377) T ss_pred HHHHHHhccCCCCCccccC--HHHHHHHHHHHHHhhhhhhheeeEecC---c-ceEEEEecCCcceeEeecccccCcccC Confidence 0011222334455566665 335556666554443444444332221 1 124556677888888765433223456 Q ss_pred eeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc-eeeeecCCcceeeccCC Q lcl|NC_019522. 81 IAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG-EGLYTSPNVSVEAATST 159 (311) Q Consensus 81 ~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g-~GllN~p~v~~~~~~~~ 159 (311) ..++....+.+.++.-..++.+ |. ..+.+++..--.....+++...+++-+++|+.... .|+||++.......... T Consensus 148 ~~f~~i~l~~~kl~a~~~is~e-lL--~ds~~~ie~~i~~~la~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~~~ 224 (377) T protein:vir:98 148 QAFKEQDFSQFKLTAFVVIPKD-AL--KFGPKWIKQFITEQLKEAIAVALELAIVKGDGLLQPVGLLKDLSQPTVDQSTG 224 (377) T ss_pred ccceeEeecceeEEeeecccHH-hh--hccHhHHHHHHHHHHHHHHHHHHhhceEeccCCCcceeeeecccccccccccc Confidence 6778888888998888888743 33 33456788889999999999999999999997655 69999875432221111 Q ss_pred ccccCcccccCCHHHHHHHHH------------HHHHHHHhc--cCC-ceecceEEEeCHHHHHHHhccc-ccCCCCCcc Q lcl|NC_019522. 160 FVALVAAIPTNGTQPIIDFFG------------NAYNTVYLD--NTL-TVHRPNTFVLPPAQFQLLARTL-LSTQNASNV 223 (311) Q Consensus 160 ~~~~~t~w~~~t~~ei~~di~------------~~~~~~~~~--~~~-~~~~p~~l~lpp~~~~~L~~~~-~~~~~~~~~ 223 (311) .. ..+.-..+ +-+.|+. -+.+..... ... .......+++.|..+..+.--+ ..+.+.... T Consensus 225 ~~-~~~~~~~~---~~~~~l~~~~~~~~~~~a~~~m~~~t~~~~~klkd~~G~~i~~~n~~~~~~~~p~~~~~~~~G~~~ 300 (377) T protein:vir:98 225 RD-ITTYKTDK---EAIADLSDLTPDNAPKKLVPVMKHLSVNDKKRPLKIAGQVKLILNPEDRWALEAQFTSRNQFGEYV 300 (377) T ss_pred cc-cccccchh---hhHhhhhhhchhHHHHHHHHHHHHHHHHHHhhhhccCCceEEEecccchhhccccccccCCCCccc Confidence 11 11100000 0011110 011111000 000 0011123344444333322111 000011111 Q ss_pred hHHHHHHHhCCceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhcc----ceeeCCceEEEeeeeeeeeEEE Q lcl|NC_019522. 224 TLLQFLRTNFPDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLA----PATADNVNFKVPAILRTGGTEW 299 (311) Q Consensus 224 Tvl~~l~~n~~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~----p~~~~~~~~~~~~~~~~gGv~i 299 (311) |+| +.+++++....... +.++ +-+-.++. +..-..+++.. ..... ...+....|.+ ..+ T Consensus 301 t~l------g~p~~vv~s~~~p~------~~i~-fgdf~~Y~-i~~r~~~~i~~~~~~~~~~d--~~~f~~~~r~d-g~~ 363 (377) T protein:vir:98 301 TVL------PHGITILESLAVET------GKAI-AFVANRYD-AFMATASTIEEYDQTFAMED--LQLYLTKNYFY-GKA 363 (377) T ss_pred ccc------CCCceEEecCCCCc------ccEE-EEEeccee-EEeecceEEEeechhhhhcC--ceEEEEEEEEc-CEE Confidence 222 22344444332221 1122 11111121 11111111110 01111 23355567765 478 Q ss_pred ECCeEEEEee---c Q lcl|NC_019522. 300 RIPKAGHYVD---G 310 (311) Q Consensus 300 ~~P~ai~~~d---G 310 (311) +.|.|++.++ | T Consensus 364 ~~~~a~~vl~i~~~ 377 (377) T protein:vir:98 364 KDNHTAALLTLAGG 377 (377) T ss_pred eccCcEEEEEEecC Confidence 8899977765 3 No 132 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=95.86 E-value=0.0013 Score=36.33 Aligned_cols=261 Identities=15% Similarity=0.039 Sum_probs=126.2 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEee-cccceEEecCcccccce- Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSID-ARGELQLFGPNSTDVPT- 78 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~-~~G~a~~~~~~a~dip~- 78 (311) -.....+..+.+++++++. +.+.+.|++........+.++.+.. -...+..+.+.. ..+.+.+++.++ ..|. T Consensus 123 ~~~~~~~~~t~~~gg~liP--~~~~~~ii~~~~~~~~l~~~~~~~~---~~~~~~~~~~~~~~~~~~~~v~E~~-~~~~~ 196 (394) T protein:vir:97 123 PVEPQKDGIKKENAKPVSS--EEILYTPAREVKTVVDLKPFTTVYQ---AKKASGKYPVLQRATTKMVTVAELE-KNPAL 196 (394) T ss_pred hhhhhccccccccccccCh--HHHHHHHHHHhhhhhhhhhhceeee---ccCcceEEEEEecCCCccceecccc-ccccc Confidence 0001111122333455655 4466778887776666666655432 222234444444 335667776654 3464 Q ss_pred eeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccceeeeecCCcceeeccC Q lcl|NC_019522. 79 VDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVGEGLYTSPNVSVEAATS 158 (311) Q Consensus 79 v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g~GllN~p~v~~~~~~~ 158 (311) .+..++..+...+.++.-+.+|.+ |. ..+..++..--....++++...+|.-+++|.... .+. T Consensus 197 ~~~~~~~v~l~~~k~~~~i~is~e-ll--~ds~~~~~~~i~~~la~~~~~~~~~~i~~g~~~~--------------~~~ 259 (394) T protein:vir:97 197 AKPDFKDVAWNIDTYRGAIPLSQE-SI--DDADVDLVGIVSESISQIKVNTTNDAIAKVLKSF--------------TTK 259 (394) T ss_pred ccccceeEEeehhheeeehhhHHH-HH--hhhhHHHHHHHHHHHHHHHHHHHHHHHhhccccc--------------ccc Confidence 346778888888888888888854 33 2334567777778888888888888777764321 000 Q ss_pred CccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHH-HHHHhCCceE Q lcl|NC_019522. 159 TFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQ-FLRTNFPDIT 237 (311) Q Consensus 159 ~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~-~l~~n~~~l~ 237 (311) ...+. +||.++++..... .+ .-.++|+|+.+..|.+-. +..|.-++. -+ .++..-+ T Consensus 260 ---------~~~~~----~~~~~~~~~~~~~-~~----~a~~v~n~~~~~~l~~lk----d~~G~~i~~~~~-~~~~~~~ 316 (394) T protein:vir:97 260 ---------TVKNL----DEIKALLNGGFDP-AY----NVSLIVSQSFYQTLDTLK----DGNGRYLLQDDI-TAVSGKV 316 (394) T ss_pred ---------ccccH----HHHHHHHHhhhhh-hh----CCEEEEcHHHHHHHHHhh----ccCCCeeeecCc-CCCCCce Confidence 01133 3444544433211 11 136899999999886532 111221110 00 0111111 Q ss_pred EEEchhc--ccCCCCcccEEEEEEcCcceeEEeecchhhhccceeeCCceEEEeeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 238 FEDDILL--KGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 238 i~~~~~l--~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dGI 311 (311) |-..|-. ... ..|.+.+++-+-+ +.+-+..-..++... .........+.++.|+ |+.+.+|.+|+.++.= T Consensus 317 l~G~pv~~~~~~-~~~~~~~~~gd~~-~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~r~-d~~v~~~~a~~~~~~~ 388 (394) T protein:vir:97 317 LLGKPVFVLSDE-VLGANKAFIGDFK-RGVLFADRKDLGLRW-ADNEIYGQYLQAVLRF-GVSKVDDKAGYYVTFT 388 (394) T ss_pred eccceeEEeccc-ccCCccEEEeecc-ccEEEEEecceEEEE-ecccccceeEEEEEEE-ccEEecccceEEEEec Confidence 2111111 011 1122222211111 111111111111110 0111112234667887 4577799999988877 No 133 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=95.77 E-value=0.001 Score=36.92 Aligned_cols=262 Identities=11% Similarity=-0.026 Sum_probs=125.8 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeec-ccceEEecCcccccc-e Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDA-RGELQLFGPNSTDVP-T 78 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~-~G~a~~~~~~a~dip-~ 78 (311) +....++..+....+++.. +.+...+++. ......+..+.+ .+-...+..+.+... .+.+.+++..+. .| . T Consensus 127 ~~~~~~~~~~~~~~~~~vp--~~~~~~i~~~-~~~~~l~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~E~~~-~~~~ 199 (397) T protein:vir:96 127 KGAEKRDGFTSVEGGALIP--QELLQPQLEP-KDIVDLSKYVRS---VPVNSASGKFPVISKSGSKMATVQQLEK-NPQL 199 (397) T ss_pred hhhhhhhcccccccccchh--HHHHHHHHHh-hhhhhHHHhhhh---ccccccceeEEEEeccCCcccccccccc-cccc Confidence 2222222222233333332 3455556653 333333444433 223333455555443 345566666544 34 3 Q ss_pred eeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccceeeeecCCcceeeccC Q lcl|NC_019522. 79 VDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVGEGLYTSPNVSVEAATS 158 (311) Q Consensus 79 v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g~GllN~p~v~~~~~~~ 158 (311) .+..++.....++.++.-..++.+=|.. +..++...-....++++...+|.-++.|..... + T Consensus 200 ~~~~~~~i~~~~~~~~~~~~~s~ell~d---s~~~l~~~i~~~l~~~~~~~~~~~i~~g~g~~~--------------~- 261 (397) T protein:vir:96 200 ANPKMVEIDYSVATRRGYIPISQEMIDD---ASYDVTGLIADEIQDQSLNTKNADIAAVLKTAT--------------A- 261 (397) T ss_pred ccccccceeecHhHhhcchhhHHHHHhh---hHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc--------------c- Confidence 5667778788888888777777543333 344577777778888888888888887754311 0 Q ss_pred CccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHH-HHHHhCC--- Q lcl|NC_019522. 159 TFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQ-FLRTNFP--- 234 (311) Q Consensus 159 ~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~-~l~~n~~--- 234 (311) . ...+ ++||.+++....... + .-.++|+|+.+..|.+-. +..|.-++. -+....+ T Consensus 262 ~--------~~~~----~d~~~~~~~~~~~~~-~----~a~~v~n~~~~~~l~~lk----d~~G~~~~~~~~~~~~~~~l 320 (397) T protein:vir:96 262 K--------SVVG----VDGLKDLINKEIKKV-Y----DVKLFISASMYSELDKLK----DKNGRYLLQDSITAASGKQL 320 (397) T ss_pred c--------cccc----hHHHHHHHHHhhhhh-c----CcEEEEcHHHHHHHHHhh----ccCCCeEeccCccCCCcccc Confidence 0 0112 344555554432221 1 236899999999986521 222222211 0111011 Q ss_pred -ceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhccceeeCCceEEEeeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 235 -DITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 235 -~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dGI 311 (311) ..-++.++.....+..| +..++|-+=.+.+.+..-+.+++...-+ ......+..+.|++ +.+++|.+++.+.-= T Consensus 321 ~G~pv~~~~~~~~~~~~~-~~~~~~gd~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~r~d-~~~~~~~a~~~~~~~ 395 (397) T protein:vir:96 321 LGKEVVVLDDDVIGKSVG-NVVGFIGDAKAFASFFDRKQVSVSWVDN-NIYGQLLAGIIRYD-VKATDKKAGFYVTFT 395 (397) T ss_pred cccceEEecccccCCCCC-ceEEEEeehhcceEeEeecceEEEEecc-cccceeEEEEEEEc-cEEecccceEEEEee Confidence 01122222221112223 3334444333333232223333321111 11223445667874 577899999988633 No 134 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=95.30 E-value=0.0023 Score=35.01 Aligned_cols=281 Identities=9% Similarity=0.041 Sum_probs=126.1 Q ss_pred CCc-ccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeE---EEEEEee--cccce---EEecC Q lcl|NC_019522. 1 MAK-SVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQA---VMFRSID--ARGEL---QLFGP 71 (311) Q Consensus 1 ~~~-~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~---~~~~~~~--~~G~a---~~~~~ 71 (311) |.+ +.|.++ -.-+|.......+.- ++.. +..+|-+-.+.-+....+ ..+...+ .+|+. +..+| T Consensus 7 ~~~~~~Ms~~--i~~~fv~qy~~~v~~-~~qq-----~~s~L~~tV~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d 78 (322) T protein:vir:10 7 MSMLPLIAGD--IDQAFVQTYETTLRI-LSQQ-----KSAKLKQYCQHKNESSESHNWETLASMDPDAVKRKRSRQQSAD 78 (322) T ss_pred eeeeeeeech--hhhHHHHHHHHHHHH-HHHH-----hhhhhhcccccccccccccceeecccccccccccccccccccC Confidence 444 455432 122454322222221 2222 223343332221122221 1111111 12222 23445 Q ss_pred cccccceeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeee---ccccceeeeec Q lcl|NC_019522. 72 NSTDVPTVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLG---DKGVGEGLYTS 148 (311) Q Consensus 72 ~a~dip~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G---~~~~g~GllN~ 148 (311) ..-|.|..+.........+..+..++.+...|.. ++..++...-.+++..+++++.|++++-| .+..+ . T Consensus 79 ~~~dtp~~~~~~~~r~~~~~d~~~~~~VDd~D~~---k~~~D~~~~~~~~~a~AL~R~~D~~I~~a~~g~a~~~-----~ 150 (322) T protein:vir:10 79 GTYPTPVNNKPFAKRRTNVDTYDTGHVVEQEDIS---QMLLDPNSALITSQAYAMARKTDDLIIAGAWKPASIK-----G 150 (322) T ss_pred cccCCCccccccceEEEeecccccceecchHHHH---HhhcCchHHHHHHHHHHhhhHHHHHHHhhhhcccccc-----c Confidence 5557787777666666777777766655544443 34566677788889999999999987753 33222 1 Q ss_pred CCcceeeccCCc-cccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcc-cccCCCCCcchHH Q lcl|NC_019522. 149 PNVSVEAATSTF-VALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLART-LLSTQNASNVTLL 226 (311) Q Consensus 149 p~v~~~~~~~~~-~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~-~~~~~~~~~~Tvl 226 (311) ++.+.....+.. ..+++ .-| ++.|.++...+.+.+-- -+.+..++++|+.+..|..- ...+.++.+ - T Consensus 151 ~gt~v~~~ss~~i~~g~~---g~t----~~kl~~a~~~l~~~dvp-~d~~R~~vv~p~~~~~LL~d~~~ts~D~~~---~ 219 (322) T protein:vir:10 151 TGQPVEFLATQEIGDGTK---PIS----FDYVTEITERFLENEIE-PEVSKVIVIGPTQARKLLQITEATSADYTS---A 219 (322) T ss_pred cccccccCCCcccccCcc---chh----HHHHHHHHHHHHhcCCC-CCCCeEEEeCHHHHHHHhcchhhhhhhccc---c Confidence 111110000000 00000 111 34455665555433211 11234699999999988541 111122221 2 Q ss_pred HHHHHhCC-----ceEEEEchhcc------------cCCCCcccEEEEEEcCcceeEEeecchhhh-ccceeeCCceEEE Q lcl|NC_019522. 227 QFLRTNFP-----DITFEDDILLK------------GAGVAGADRMAVYKKEIRIVKGHDVMPLRF-LAPATADNVNFKV 288 (311) Q Consensus 227 ~~l~~n~~-----~l~i~~~~~l~------------~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~-~~p~~~~~~~~~~ 288 (311) +.|..++. ..+|.....|. +...+.+...++|.++. +.+....+++. ......+...+.+ T Consensus 220 ~~l~~~G~ig~~lGf~~i~s~~lp~~~~t~~~~~~~~~~~~~~~~~~a~~k~A--v~~a~~~dv~~~i~~~~~~~~a~~I 297 (322) T protein:vir:10 220 MDLQSKGIITNWMGYTWIVSTRLDKFDPTQWGMAAEDGPQGDEIWCIAMTDMA--LGYHSCKDIWTKVAEDPSASFAWRI 297 (322) T ss_pred hhhhhcCeeeeeeeEEEEEeccCCccccccccccccCCCCccceeEEEEecCc--eeEEEeeeeeEEeeccCCcchhhhh Confidence 22333332 12333332221 11223345667777753 66655544322 1111222222345 Q ss_pred eeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 289 PAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 289 ~~~~~~gGv~i~~P~ai~~~dGI 311 (311) .....+|. .+-+|..|+.++=- T Consensus 298 ~~~~~~Ga-~ri~~~gVv~i~~~ 319 (322) T protein:vir:10 298 YSAFTADC-VRVEDEHIFKLRLK 319 (322) T ss_pred hhhhhhCc-eEeccCcEEEEEEe Confidence 54455544 44477777666555 No 135 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=94.37 E-value=0.0046 Score=33.39 Aligned_cols=262 Identities=7% Similarity=-0.012 Sum_probs=125.1 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCc-ceeEEEEEEeecccceEEecCccccccee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPD-WAQAVMFRSIDARGELQLFGPNSTDVPTV 79 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~-~~~~~~~~~~~~~G~a~~~~~~a~dip~v 79 (311) ||+=.+ +.+ ...| -+-+-|.+.....+....+..+...+.. .-.++.+..++..|.++.+.++ ++|+.- T Consensus 1 Ma~T~~-----~d~--I~Pe--v~~~~V~e~~~~~~~~~~~~~~d~~L~g~~G~ti~~P~~~~igdae~~~eg-~~i~~~ 70 (270) T protein:vir:95 1 MTQTKK-----ANL--INPE--VLANVVSAQMQNAIRFTPYAVTDDTLVGQPGDTITRPKYAYIGAAEDLQEG-VAMDTT 70 (270) T ss_pred CCceeh-----hhh--cchH--HHHHHHHHHHHhHHhhccccccccccCCCCCCEEEeeeecCCCccccccCC-Cccchh Confidence 444221 111 0111 1112222222222333444444433322 3456788888999999998886 578888 Q ss_pred eeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccceeeeecCCcceeeccCC Q lcl|NC_019522. 80 DIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVGEGLYTSPNVSVEAATST 159 (311) Q Consensus 80 ~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g~GllN~p~v~~~~~~~~ 159 (311) .+..+.....+...+.+++++ |+++....|=|+ ..-.......++++.|+.++ +.+....+. . + T Consensus 71 ~lt~~~~~a~i~~~gk~~~it--D~a~~~~~~dp~-~~~~~q~a~~~a~~~d~~li---~~l~~a~~~-------~---~ 134 (270) T protein:vir:95 71 QMSMTTTKVTVKETGKAVEVT--QTAIITNVNGTL-QEASRQLAMSLADKVEIDYI---AELNKSKQT-------A---T 134 (270) T ss_pred hcccchheeeeehhhCcceec--HHHHhhhccchH-HHHHHHHHHHHHHHHHHHHH---HHhcccccc-------c---c Confidence 888899999998887766665 445444444444 44455567777777766543 111100000 0 0 Q ss_pred ccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhCCceEEE Q lcl|NC_019522. 160 FVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNFPDITFE 239 (311) Q Consensus 160 ~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~~~l~i~ 239 (311) ...+ .++|++++..+- .....++.|+++|..+..|.+-..-.....+ +-+..|+..-++. T Consensus 135 --------~~~t----~~~~~dA~~~lg----d~~~~~~~i~vhs~~~~~Lrk~~~~~~~~~~----~~~~~~G~ig~~~ 194 (270) T protein:vir:95 135 --------VSAD----ATGILDAIEVFN----SENDEDYVLYVNPKDYNKLVKSLFKVGGNVQ----DRAISKGDLVEIV 194 (270) T ss_pred --------cccC----HHHHHHHHHHhc----cccCCCcEEEEcHHHHHHHHhhhcccccccc----cchhcccccceec Confidence 0012 456677776652 2234578999999999998642211000111 1112233222222 Q ss_pred EchhcccCCCCcccEEEEEEcCcceeEEeecchhhhccceeeCCceEEEeeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 240 DDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 240 ~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dGI 311 (311) ..+-.-..+.-.+...+++. +.-+.+....+.+.-.-+........+-. -+..+|.+.+|..++.++== T Consensus 195 G~~Viv~s~~~~~~~~~l~~--~gAi~~~~~~~~~vEtdRd~~~~~d~i~~-~~~y~v~~~~~skvv~~t~~ 263 (270) T protein:vir:95 195 GVSDIVKSKRVSENTAFLQR--YGAMEIVNKKKPEAYTDFDILKRTHLLST-NYHYSVNLKDETGVVKVTFK 263 (270) T ss_pred ceeEEEeCCCCCceeEEEEe--ccceeeeecCCceeeeccchhhcccEEEe-eeEEEEEEEccceEEEEEec Confidence 22211111111112223322 33444444444322111111111222222 34468899999988876422 No 136 >protein:vir:99888 Length: 309 # NCBI annotation: capsid protein # Family: family:all:908 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164075;genbank:gi:56692607;genbank:GeneID:3192616 Probab=94.24 E-value=0.005 Score=33.22 Aligned_cols=272 Identities=13% Similarity=0.092 Sum_probs=122.1 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccceee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPTVD 80 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~v~ 80 (311) |+...+-.++ .|+.+=... .-+++.+.++||.......+-....|...|..-....--....+.-.++ T Consensus 1 ~~~~~~~~dp---------~LT~~A~gy---~n~~~Ia~~l~P~vpV~~~~~~~~~f~~~e~F~~~~t~r~~~~~~~~v~ 68 (309) T protein:vir:99 1 MSNAPFPIDP---------ELTAIAIAY---RNGRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVE 68 (309) T ss_pred CCCCCcCcCH---------hHHHHHhhc---cChhhhhhhcCCccccCccccceeeechhhcccccchhhccCCCcceEe Confidence 4443333222 233333222 2344678888987644333333333332221101000001112233456 Q ss_pred eeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhh----hheeeeeccccceeeeecCCcceeec Q lcl|NC_019522. 81 IAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGL----NKIYLLGDKGVGEGLYTSPNVSVEAA 156 (311) Q Consensus 81 ~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~----n~~~~~G~~~~g~GllN~p~v~~~~~ 156 (311) .........+...+.......+|...|. .++++.+...+.+...+...+ -++++.-. |.|.=...+. T Consensus 69 ~~~~~~~~~~~~~~L~~~i~~~~~~~a~-~~~d~~~~Av~~l~~~i~l~rE~~~A~lv~~~a--------~y~~~~k~~L 139 (309) T protein:vir:99 69 FSATDETGSTEDHGLDAPVPQADIDNAP-TNYNPLGHATEQTTNLILLDREARTSKLVFSPN--------SYAAGNKTTL 139 (309) T ss_pred ecccCceeeecccceeecCCchhhhhcc-CCCCHHHHHHHHHHHHHHHHHHHHHHHHhcChh--------hcCCCceEEe Confidence 6666666777777777777778877653 367766666555555444333 33333211 1111111122 Q ss_pred cCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhc-----ccccCCC-CCcchHHHHHH Q lcl|NC_019522. 157 TSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLAR-----TLLSTQN-ASNVTLLQFLR 230 (311) Q Consensus 157 ~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~-----~~~~~~~-~~~~Tvl~~l~ 230 (311) . ++..|.+.++| ++.||.+...++ + ..|++++|..+.|..|.+ ..+..+. ..+.--.+.|+ T Consensus 140 s-----gt~~wsd~~SD-Pi~~i~~~~~~~----g---~~PN~~vlg~~~~~~l~~hp~i~~~ik~~~~~~g~it~~~la 206 (309) T protein:vir:99 140 S-----GADQWSDPTSN-PLPVITDALDSV----I---LRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQ 206 (309) T ss_pred c-----CccccCCCCCC-cHHHHHHHHHhh----C---CCcceEEechHHHHHHhhCHHHHHHhcCCCccccccCHHHHH Confidence 2 23358876655 588998887765 1 379999999999988754 1111111 11122245555 Q ss_pred HhCCceEEEEchh-ccc--CCCC-------cccEEEEEEcCcc-eeEEeecchhhhcccee---eCCceEEEeeeeeeee Q lcl|NC_019522. 231 TNFPDITFEDDIL-LKG--AGVA-------GADRMAVYKKEIR-IVKGHDVMPLRFLAPAT---ADNVNFKVPAILRTGG 296 (311) Q Consensus 231 ~n~~~l~i~~~~~-l~~--ag~~-------g~~~~v~y~~~~~-~~~~~~~~~~~~~~p~~---~~~~~~~~~~~~~~gG 296 (311) +-+---+|..... ... .|.. |.+..++|....- .+. .| ++....+ .....+..+++..-|| T Consensus 207 ~l~~ve~V~vg~a~~n~a~~g~~~~~~~iwg~~~~L~y~~~~~~~~~----~p-s~G~t~~~~~r~~g~~~d~~~~~~g~ 281 (309) T protein:vir:99 207 ELLELDAIYIGEARLNIARPGQNPNLIRAWGPHASFIYRDRLADTRN----GT-TFGLTAQWGDRVSGSIADPNIGLRGG 281 (309) T ss_pred HHhCcceEEeecceeeccccccccccccccCCcEEEEEcCCCCCCcc----cc-cccceeecccccCCceeeeeeccCCc Confidence 5332112322111 111 1111 3455556654432 111 11 2222222 1222356666666665 Q ss_pred EEEEC-----CeEEEEeecC Q lcl|NC_019522. 297 TEWRI-----PKAGHYVDGV 311 (311) Q Consensus 297 v~i~~-----P~ai~~~dGI 311 (311) -.||- |.-++.--|- T Consensus 282 ~~vr~~~~~k~~i~~~d~G~ 301 (309) T protein:vir:99 282 QRVRVGESVKELVTAPDLGF 301 (309) T ss_pred eEEEEeccccchhcchhcch Confidence 43331 1111111121 No 137 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=93.97 E-value=0.0058 Score=32.85 Aligned_cols=265 Identities=13% Similarity=0.056 Sum_probs=121.4 Q ss_pred CCc---ccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeec-ccceEEecCccccc Q lcl|NC_019522. 1 MAK---SVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDA-RGELQLFGPNSTDV 76 (311) Q Consensus 1 ~~~---~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~-~G~a~~~~~~a~di 76 (311) +.. ..+...+....++++. +.+...|.+ ....-..+.++.+. +-......+.+... .+.+.+++..+ .+ T Consensus 148 ~~~~e~~~~~~~~~~~~g~lvp--~~~~~~i~~-~~~~~~l~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~e~~-~~ 220 (437) T protein:vir:10 148 LKTGEVRDVTGIALKDGKVIIP--ETILTPEKE-VHQFPRLGSLVRTE---SVTTTTGKLPIFNNSTDLLTAHTEYG-QT 220 (437) T ss_pred HHhhhhhhhhhcccccccccch--HHHHHHHHH-hhhhhhhhhcceeE---eeccCceeeEEeeccccccccccccc-cc Confidence 100 0111112223344553 223333433 22222334444332 22223344555443 34555655543 34 Q ss_pred ce-eeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccceeeeecCCcceee Q lcl|NC_019522. 77 PT-VDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVGEGLYTSPNVSVEA 155 (311) Q Consensus 77 p~-v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g~GllN~p~v~~~~ 155 (311) |. .+..++..+...+.++.-+.+|.+=|+ ....+|..--....++++...+|.-+++|+.... |+ . T Consensus 221 ~e~~~~~~~~v~~~~~k~~~~~~is~ell~---ds~~~~~~~i~~~l~~~~~~~~~~~i~~g~g~~~------~~---~- 287 (437) T protein:vir:10 221 TKNATPVITPILWDLKTYTGGYVFSQELIS---DSSYDWQAELQSRLIELRDNTDDSLIITALTDGI------KK---T- 287 (437) T ss_pred cccccccceeeeeehhheeeehhhhHHHHh---hhHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccc------cc---c- Confidence 54 335678888888888888888854333 3345677778888889999999999998865310 00 0 Q ss_pred ccCCccccCcccccCCHHHHHHHHHHHHH-HHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHH-HHHh- Q lcl|NC_019522. 156 ATSTFVALVAAIPTNGTQPIIDFFGNAYN-TVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQF-LRTN- 232 (311) Q Consensus 156 ~~~~~~~~~t~w~~~t~~ei~~di~~~~~-~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~-l~~n- 232 (311) ..+.+.+. +.+++. .+. ..+ ...-.++|+|+.+..|.+-. +..|.-++.- +... T Consensus 288 -----------~~~~~~~~----~~~~~~~~l~--~~~--~~~~~~~~~~~~~~~l~~lk----d~~g~~~~~~~~~~~~ 344 (437) T protein:vir:10 288 -----------TSTYLLGD----LKKVLNVTLK--PQD--SAAASIVMSQSAYNLFDMAT----DAMGRPLLQPNVTAAT 344 (437) T ss_pred -----------ccccchhh----HHHHHHhhhh--hhh--hcCCEEEEcHHHHHHHHHhh----ccCCCeeeccCccCCC Confidence 11122333 333333 221 111 11236899999999886532 1112222110 1111 Q ss_pred -----CCceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhccceeeCCceEEEeeeeeeeeEEEECCeEEEE Q lcl|NC_019522. 233 -----FPDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHY 307 (311) Q Consensus 233 -----~~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~ 307 (311) +.++.+...-.+. .+..|. ..++|-+=.+.+.+..-+.+++..-..............|+ ++.+..|.||+. T Consensus 345 ~~~l~G~pv~~~~~~~~~-~~~~~~-~~~~~gd~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~r~-d~~~~~~~a~~~ 421 (437) T protein:vir:10 345 GYTLLGKTVVIVDDKLFP-SASAGD-VNIVVAPLKKAVINFKLTEITGQFQDTYDIWYKQLGIFLRQ-NVVQASKDLIVN 421 (437) T ss_pred CcccccceeEEecccccC-CcCCCc-eEEEEeeccccEEEEeeeceEEEEecccccccceeeEEEEE-ccEEecccceEE Confidence 1122222221111 122232 33344433333333222233221000011112233445676 566778999999 Q ss_pred eecC Q lcl|NC_019522. 308 VDGV 311 (311) Q Consensus 308 ~dGI 311 (311) +.|= T Consensus 422 l~~~ 425 (437) T protein:vir:10 422 LTGK 425 (437) T ss_pred EEee Confidence 9876 No 138 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=92.64 E-value=0.011 Score=31.42 Aligned_cols=292 Identities=15% Similarity=0.084 Sum_probs=124.1 Q ss_pred CCc----ccccccchhh------h-hhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEE- Q lcl|NC_019522. 1 MAK----SVFDVSPVSA------L-SFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQL- 68 (311) Q Consensus 1 ~~~----~~~~~~~~~~------~-~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~- 68 (311) ||. +-..+...-+ . .|+....-.++..+ ...-..+.++.+++ +.. ..++.+.. .|..+. T Consensus 1 ma~~~~~~~~~t~~~~~~~~~~~~a~~ie~f~g~V~~~f----~~~s~~~~~~~~~~-~~~-G~sv~i~~---ig~~t~~ 71 (347) T protein:vir:15 1 MANIQGGQQIGTNQGKGQSAADKLALFLKVFGGEVLTAF----ARTSVTMPRHMLRS-IAS-GKSAQFPV---IGRTKAA 71 (347) T ss_pred CCccccCCccccccccCCCcchHHHHHHHHHHHHHHHHH----HHhhhhhhcccccc-ccc-cceeEeee---ccceeee Confidence 554 2122221111 1 23333333333322 22234455555543 222 33444433 343332 Q ss_pred -ecCcccccce--eeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeee-----eccc Q lcl|NC_019522. 69 -FGPNSTDVPT--VDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLL-----GDKG 140 (311) Q Consensus 69 -~~~~a~dip~--v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~-----G~~~ 140 (311) +.. ..+++. -+..-.+....|=.. .-+..-+.++..++ +..++-....+.+..++++..|+.++- .+.. T Consensus 72 ~~~~-g~~l~~~~~~~~~~e~~ltID~~-~~~~~~VddlD~~q-~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~ 148 (347) T protein:vir:15 72 YLKP-GENLDDKRKDIKHTEKVIHIDGL-LTADVLIYDIEDAM-NHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLP 148 (347) T ss_pred eecc-CCCCCCCCCCCccceEEEEechh-hhhhHHhhhHHHHh-cCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 222 122221 112222222222111 12233446666644 566788888889999999999987762 1111 Q ss_pred c----ceeeeecCCcceeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCce-ecceEEEeCHHHHHHHhcccc Q lcl|NC_019522. 141 V----GEGLYTSPNVSVEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTV-HRPNTFVLPPAQFQLLARTLL 215 (311) Q Consensus 141 ~----g~GllN~p~v~~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~-~~p~~l~lpp~~~~~L~~~~~ 215 (311) . +.+.+-++++...+. ...+...-+..+++.|++-|.++..+|.+.+ + .....++++|+.|..|.+-.. T Consensus 149 ~~~~~~~~~~g~~~~~~~~~---~~~~~~~~~~~~~~~i~d~~~~a~~~Lde~~---VP~~gR~~vv~P~~y~~LL~~~~ 222 (347) T protein:vir:15 149 DASNENIEGLGKPTVLTLVK---PTTGDLTDPVELGKAIIAQLTIARASLTKNY---VPAADRTFYTTPDNYSAILAALM 222 (347) T ss_pred ccccccccccCccccccccc---cccccchhhhhHHHHHHHHHHHHHHHHhhcC---CCccCCEEEeCHHHHHHHhcccc Confidence 1 112222222222211 1122222233457788888888887775542 3 123689999999998865321 Q ss_pred -cCCCCCcchHHH-HHHHhCCceEEEEchhcccCCC--------CcccE----------EEEE------EcCcceeEEee Q lcl|NC_019522. 216 -STQNASNVTLLQ-FLRTNFPDITFEDDILLKGAGV--------AGADR----------MAVY------KKEIRIVKGHD 269 (311) Q Consensus 216 -~~~~~~~~Tvl~-~l~~n~~~l~i~~~~~l~~ag~--------~g~~~----------~v~y------~~~~~~~~~~~ 269 (311) ...+..+...+. -.-.+--.++|..++.|...+. .|... ...| ..+++-+...- T Consensus 223 ~~~~d~~~~~~~~~G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~g~~~~~~~~~~~~~~~~f~~~~~l~~h~~A~g~v~ 302 (347) T protein:vir:15 223 PNAANYQALIDHERGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVK 302 (347) T ss_pred cccccccccccccceEEEEEeceEEEecccccccccccccccccccccccccccccceeeeccccceeeeeccceeeeeE Confidence 111111111111 0001112467777777743111 11110 1111 11122222211 Q ss_pred cchhhhccceeeCCceEEEeeeeeeeeEEEECCeEEEE--eecC Q lcl|NC_019522. 270 VMPLRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHY--VDGV 311 (311) Q Consensus 270 ~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~--~dGI 311 (311) .++++...-...+.....+...... |+-+.||.+++- +.+| T Consensus 303 ~~~~~~e~~~~~~~~~d~i~~~~~~-G~~vlrP~~av~~~~~~~ 345 (347) T protein:vir:15 303 LKDLALERARRANYQADQIIAKYAM-GHGGLRPEAAGAIVLPKV 345 (347) T ss_pred eeceeeeecccchhhhhhhehhhhc-CCceeccccEEEEecCCC Confidence 2222111101112222333444444 899999998764 4566 No 139 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=92.53 E-value=0.011 Score=31.32 Aligned_cols=290 Identities=16% Similarity=0.088 Sum_probs=126.2 Q ss_pred CCc-cccc---ccchhh------h-hhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEE- Q lcl|NC_019522. 1 MAK-SVFD---VSPVSA------L-SFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQL- 68 (311) Q Consensus 1 ~~~-~~~~---~~~~~~------~-~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~- 68 (311) ||. ++-. +....+ . -|+....-.|+..+-+ .-..+.++.+++- - +-.++.+.. .|..+. T Consensus 1 ~~~~~~~~~~~t~~g~~~~~~~~~al~ie~~~g~V~~~f~~----~s~~~~~v~~r~~-~-~G~sv~i~~---iG~~t~~ 71 (347) T protein:vir:33 1 MANIQGGQQIGTNQGKGQSAADKLALFLKVFGGEVLTAFAR----TSVTMPRHMLRSI-A-SGKSAQFPV---IGRTKAA 71 (347) T ss_pred CCCCccCcccccccccCCcccchHHHHHHHHHHHHHHHHHH----HHhhhhhhccccc-c-ccceeEeee---ccceeee Confidence 662 2222 221111 1 2443333344443322 2245556655432 2 234444333 344332 Q ss_pred -ecCccccccee--eeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheee-----eeccc Q lcl|NC_019522. 69 -FGPNSTDVPTV--DIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYL-----LGDKG 140 (311) Q Consensus 69 -~~~~a~dip~v--~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~-----~G~~~ 140 (311) +..+ ++++.- +....+....|=.. .-+..-+.++..++ +..++-..-.+.+..++++..|+.++ .+... T Consensus 72 ~~~~g-~~l~~~~~~~~~~e~~ltiD~~-~y~~~~VddiD~~q-~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~ 148 (347) T protein:vir:33 72 YLKPG-ENLDDKRKDIKHTEKVIHIDGL-LTADVLIYDIEDAM-NHYDVRAEYTAQLGESLAMAADGAVLAELAGLVNLP 148 (347) T ss_pred eecCC-CCCCCCCCCCccceEEEEechh-hhhhHHHhhHHHHh-cCCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhh Confidence 2221 222211 11112222211111 11123345666644 46677778888999999999999876 22221 Q ss_pred -c---ceeeeecCCcceeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCce-ecceEEEeCHHHHHHHhcc-c Q lcl|NC_019522. 141 -V---GEGLYTSPNVSVEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTV-HRPNTFVLPPAQFQLLART-L 214 (311) Q Consensus 141 -~---g~GllN~p~v~~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~-~~p~~l~lpp~~~~~L~~~-~ 214 (311) . ..+.+..+........+ ++..+. +..+++.|++.|.++..+|.+.+ + .....++++|+.|..|.+- . T Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~--tg~~~d-~~~~a~~i~~~i~~a~~~Lde~~---VP~~gR~~vv~P~~y~~Ll~~~~ 222 (347) T protein:vir:33 149 DGSNENIEGLGKPTVLTLVKPT--TGSLTD-PVELGKAIIAQLTIARASLTKNY---VPAADRTFYTTPDNYSAILAALM 222 (347) T ss_pred cccccccccccccccccccccc--cccccc-hhhhHHHHHHHHHHHHHHHhhcC---CCccCcEEEeCHHHHHHHhcccc Confidence 1 12233333322222221 222222 23578889999999988886542 3 1235799999999988542 1 Q ss_pred ccCCCCCcchHH-HHHHHhCCceEEEEchhcccCCC--------Cccc----------EEEE------EEcCcceeEEee Q lcl|NC_019522. 215 LSTQNASNVTLL-QFLRTNFPDITFEDDILLKGAGV--------AGAD----------RMAV------YKKEIRIVKGHD 269 (311) Q Consensus 215 ~~~~~~~~~Tvl-~~l~~n~~~l~i~~~~~l~~ag~--------~g~~----------~~v~------y~~~~~~~~~~~ 269 (311) ....+..+...+ .=.-.+--.++|..++.|...+. +|.. .... +...++-+...- T Consensus 223 ~~~~d~~~~~~~~~G~V~~i~G~~V~~Sn~lp~~~~~~~~~~~~ag~~~~~~~~~~~~~~~a~~~~~gl~~h~~A~g~v~ 302 (347) T protein:vir:33 223 PNAANYQALLDPERGTIRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHRSAVGTVK 302 (347) T ss_pred ccccccccccccccceeEEEeceeEEEecccccCccccccccccccccccccCCcccceeccccceeeeeecchhheeee Confidence 111111111110 00000112356777777743211 1110 0000 111111121111 Q ss_pred cchh--hhccceeeCCceEEEeeeeeeeeEEEECCeEEEE--eecC Q lcl|NC_019522. 270 VMPL--RFLAPATADNVNFKVPAILRTGGTEWRIPKAGHY--VDGV 311 (311) Q Consensus 270 ~~~~--~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~--~dGI 311 (311) .+++ +... ..+.....+...... |+-+.||.+++- +.|| T Consensus 303 ~~~~~~e~~r--~~~~~~d~i~~~~~~-G~~vlrP~~av~i~~~~~ 345 (347) T protein:vir:33 303 LKDLALERAR--RANYQADQIIAKYAM-GHGGLRPEAAGAIVLPKV 345 (347) T ss_pred eeceeeeecc--chhhhhHhhhhhhhc-CCceecccceEEEecCCC Confidence 1221 1111 112222334444554 889999998764 4566 No 140 >protein:vir:107882 Length: 307 # NCBI annotation: gp34 # Family: family:all:908 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024707;genbank:gi:48696944;genbank:GeneID:2845970 Probab=92.08 E-value=0.013 Score=30.93 Aligned_cols=274 Identities=11% Similarity=0.059 Sum_probs=110.9 Q ss_pred cccccccchhhhhhhHH-HHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceE-EecCcccccceee Q lcl|NC_019522. 3 KSVFDVSPVSALSFLVN-QAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQ-LFGPNSTDVPTVD 80 (311) Q Consensus 3 ~~~~~~~~~~~~~fl~~-~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~-~~~~~a~dip~v~ 80 (311) +++|. -.|.+. .|+.+=-.- . -+++.+.++||.......+.....|.- +..-... ..+-++ +.-+++ T Consensus 1 m~~~~------~~~~~dp~LT~~A~gy-~--n~~~ia~~l~P~vpv~~~~~k~~~f~~-eaF~~~~t~r~~~~-~~~~v~ 69 (307) T protein:vir:10 1 MGRLS------KLRIVDPVLTNLAIGY-T--NAEFIGQSLMPVVEVEKEGGKIPKFGK-ESFRLYKTERALRA-RSNRMN 69 (307) T ss_pred CCCCC------CCcccChhHHHHHHhh-c--chhhhhhhcCCcccccccccceeeECc-ccccchhhhcccCC-Ccceee Confidence 33332 123332 233332211 1 235788888887654444444333321 1100000 000000 001111 Q ss_pred eec-cceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHH----hhhheeeeeccccceeeeecCCcceee Q lcl|NC_019522. 81 IAM-SQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQ----GLNKIYLLGDKGVGEGLYTSPNVSVEA 155 (311) Q Consensus 81 ~~~-~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~----~~n~~~~~G~~~~g~GllN~p~v~~~~ 155 (311) ... +.....+...+..+-... ++....+.++.++..+.+...+.. ..-++++... |.|.-...+ T Consensus 70 ~~~~~~~~~~~~~~~L~~~id~---r~~~~~~~~~~~~av~~l~d~I~l~~E~~~A~l~~~~~--------~y~~~~k~t 138 (307) T protein:vir:10 70 PEDLGSIDIVLDEHDLEYPIDY---REDQESAFPLEQAAVQTATEAIQLRREKMVADLAQNPN--------SYAGGNKKQ 138 (307) T ss_pred cccccccccccccccccccCCh---hhcCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCcc--------ccCCCceEE Confidence 110 111111111111111211 234455666666555555444433 3344544322 112111222 Q ss_pred ccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhc-----ccccCCCCCcchHHHHHH Q lcl|NC_019522. 156 ATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLAR-----TLLSTQNASNVTLLQFLR 230 (311) Q Consensus 156 ~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~-----~~~~~~~~~~~Tvl~~l~ 230 (311) +. ++..|.++++| ++.||.+.+.++...++ ..|++++|..+.+..|.+ .++.++....+|. +.|+ T Consensus 139 Ls-----Gt~~Wsd~~sD-Pi~di~~~~~ai~~~~g---~~Pn~~vlg~~a~~al~~hp~i~e~lk~~~~g~it~-~~la 208 (307) T protein:vir:10 139 LS-----ATEKFTAAGSD-PVGVIEDGKEAIRTKIG---RRPNTMVIGASAYKTLKAHPQLIEKIKYSMKGIVTV-DLLK 208 (307) T ss_pred ec-----cccccCCCCCC-cHHHHHHHHHHHHhhhC---CccceEEeCHHHHHHHhcCHHHHHHhCCccccccCH-HHHH Confidence 22 23469887654 59999999999976543 369999999999998854 1111222222222 2233 Q ss_pred HhCCceEEEEchhc--ccCC-----CCcccEEEEEEcCcce-eEEeecchhhhccceeeCCceEEEeeeeeeeeEEEEC- Q lcl|NC_019522. 231 TNFPDITFEDDILL--KGAG-----VAGADRMAVYKKEIRI-VKGHDVMPLRFLAPATADNVNFKVPAILRTGGTEWRI- 301 (311) Q Consensus 231 ~n~~~l~i~~~~~l--~~ag-----~~g~~~~v~y~~~~~~-~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~- 301 (311) + ...++.+.+-+- ..+. .-|.+..++|...... -...+..| ++..-.+..+..+..++.+ .+|+++.| T Consensus 209 ~-ll~v~~i~vg~a~~~~~~~~~~~iw~~~~vl~yv~~~~~~~~~~~~ep-sfGyT~~~~g~~~~d~~~~-~~~~~~~r~ 285 (307) T protein:vir:10 209 E-IFEVENIAVGEAIYADDKDRFTDIWGANIVLAYVPLQRGGQQRTPYEP-SYGYTLRKKGNPVVDTRIE-DGKLELVRS 285 (307) T ss_pred H-HhCceeEEEeeeeeeccCCccceeCCCceEEEecccccCCCCCccccc-ccceeEEEcCCeEeeceec-CCceeEEec Confidence 2 222332222111 1111 1134555566533211 11111222 2333334445555555555 35555442 Q ss_pred -----CeEEEEeec-----C Q lcl|NC_019522. 302 -----PKAGHYVDG-----V 311 (311) Q Consensus 302 -----P~ai~~~dG-----I 311 (311) |.-++.--| + T Consensus 286 ~~~~~~~i~~~~~G~li~~~ 305 (307) T protein:vir:10 286 TDIFRPYLLGADAGYLISGI 305 (307) T ss_pred cccccceeecccccceeccC Confidence 332222223 2 No 141 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=89.52 E-value=0.026 Score=29.29 Aligned_cols=286 Identities=12% Similarity=0.067 Sum_probs=148.3 Q ss_pred CCccc----ccc---cchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcc Q lcl|NC_019522. 1 MAKSV----FDV---SPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNS 73 (311) Q Consensus 1 ~~~~~----~~~---~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a 73 (311) |++-+ .+. .+-....+++. .-|...+.|...|-.-...++...+ ...| ++..|..+. +=.+.-++++. T Consensus 59 m~G~~p~~eV~~~e~mtt~~a~IliP--~vis~v~~Eaaepl~~~~kl~qk~~-L~~G-rsm~F~~~g-~~Ra~~IgEGg 133 (393) T protein:vir:79 59 MEGETPTNEVNLREFMATPSAQILIP--RVIVGTMREAAEPLYIGTKMLQKIR-LKSG-QSMIFPSIG-IMRAYDVAEGQ 133 (393) T ss_pred hcCCCchhheehhhhhcCCCcceech--hhhhhhhhhcccchhHHHHHHHHHh-hhcC-cceeccchh-eeeeccccccc Confidence 43311 111 11112223342 3345555554444333333333211 1111 122222111 11223344432 Q ss_pred cccceeeee---ccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccce----eee Q lcl|NC_019522. 74 TDVPTVDIA---MSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVGE----GLY 146 (311) Q Consensus 74 ~dip~v~~~---~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g~----Gll 146 (311) -+|..+.+ .+.....+-..+..+.|+.+=+. ..|.+|-.--..+|-|+++++.+..+|+|+...+. |+. T Consensus 134 -E~~~~sld~~T~dsv~~~~gK~G~~Ia~SqEmIs---DSg~Dvin~~l~aA~RaMaRkKee~a~n~fk~~ghtvfDa~s 209 (393) T protein:vir:79 134 -EIPEDSIDWQTHESPEIRVGKSGIRLRFTDEMIS---DSQWDLMSMMIKQAGRAMGRHKEQKAYHQFRSHGHTVFDNYS 209 (393) T ss_pred -cccccchhhhcCCceeEEechhhhhhhhHHHHhh---cchHHHHHHHHHHHHHHHHhhhHHHHHhhhhcccceeeeccc Confidence 24443333 45566667778888888865443 46788888889999999999999999999988763 555 Q ss_pred ecCCcceeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcc------cccCCCC Q lcl|NC_019522. 147 TSPNVSVEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLART------LLSTQNA 220 (311) Q Consensus 147 N~p~v~~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~------~~~~~~~ 220 (311) ..|-.-.+--.-++. -...-.++||.++.-++. .+++ .|++|.|.|-.|+.+.+- +.+.-++ T Consensus 210 t~t~ahptGr~~~~~--------qNGTlSleDllDm~~av~-~~hy---t~svi~MHPLAWnv~AKna~me~~~~na~gN 277 (393) T protein:vir:79 210 TNKLAHTTGLDKNGV--------QNDTFSAEDFLDLIIAVM-ANEY---TPSDLMMHPLAWTVFAKNELMGSLQANPYGN 277 (393) T ss_pred cCccceeecCCcccc--------ccccccHHHHHHHHHHHh-cccC---CcceEEEcCchhhhhhhhhhhcceeeccccc Confidence 444332221111111 111123678888877774 3443 589999999999887541 1111011 Q ss_pred Ccc--------hHHHHHHHhCC-ceEEEEchhc--ccCCCCcccEEEEEEcCcceeEEeecch-hhhccceeeCCceEEE Q lcl|NC_019522. 221 SNV--------TLLQFLRTNFP-DITFEDDILL--KGAGVAGADRMAVYKKEIRIVKGHDVMP-LRFLAPATADNVNFKV 288 (311) Q Consensus 221 ~~~--------Tvl~~l~~n~~-~l~i~~~~~l--~~ag~~g~~~~v~y~~~~~~~~~~~~~~-~~~~~p~~~~~~~~~~ 288 (311) ++. -.-+-|+...| |++|.-.|-. ..+ ..|+-.|.-|..+++..+.-+ ++.-.--.+-..-..+ T Consensus 278 ~~~~~~~ts~algp~~i~~~~~~nlnv~~sPfvp~d~k----~~rFd~~~Vd~NnvgvlLV~D~i~tdq~ddk~rdiq~i 353 (393) T protein:vir:79 278 YPAKGAPSSMALGPDSIQGRLPFNFNVNLSPFIPLDKK----SRRFDVYAVDRNNVGVLLVRDDLKTDQWDEKARGLQNI 353 (393) T ss_pred cCccccchhhhhchhhhccccccceeEEEecccccccc----cceeeEEEeecCCceEEEEecCcceeccccccccceee Confidence 110 11122333323 4677666633 222 356666766667766654433 2211101112223678 Q ss_pred eeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 289 PAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 289 ~~~~~~gGv~i~~P~ai~~~dGI 311 (311) +..+|.|=-++---.+|+....| T Consensus 354 Kl~ERYG~gvLn~gkaiavakNI 376 (393) T protein:vir:79 354 KMIERYGIGILNEGKAIAVAKNI 376 (393) T ss_pred eeeeeeceeeeeCCceEEEEecc Confidence 88899753378888899999888 No 142 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=86.90 E-value=0.042 Score=28.11 Aligned_cols=291 Identities=12% Similarity=0.040 Sum_probs=124.0 Q ss_pred CCcccc-cc----------cchhhhh-hhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEE Q lcl|NC_019522. 1 MAKSVF-DV----------SPVSALS-FLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQL 68 (311) Q Consensus 1 ~~~~~~-~~----------~~~~~~~-fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~ 68 (311) ||...- .. +++ ..+ |+....-.++..+- ..-..+.++.+++ +- +..++.|.. .|..+. T Consensus 1 ~a~~~~~~~~~~~~g~~~~~~d-~~al~ie~~~geV~~~f~----~~s~~~~~~~~r~-i~-~G~sv~~~~---iG~~~~ 70 (347) T protein:vir:88 1 MANATGGQQIGANQGKGQSAAD-KLALFLKVFGGEVLTAFV----RRSVTMDKHMVRT-IQ-NGKSASFPV---MGRTKG 70 (347) T ss_pred CCCcccchhhhccCCCCccccc-hHHHHHHHHHHHHHHHHH----HHhhhhhcccccc-cc-CcceEEEee---ecceee Confidence 664221 10 111 122 44333334444332 2235566666654 22 244554443 444443 Q ss_pred ec-Cccccc--ceeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeee------cc Q lcl|NC_019522. 69 FG-PNSTDV--PTVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLG------DK 139 (311) Q Consensus 69 ~~-~~a~di--p~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G------~~ 139 (311) .. ..++++ |..++.-++....|-.. .-+..-+.++.. .++..++-.+-.+.+..++++..|+.++.- .+ T Consensus 71 ~~~~~g~~l~~~~~~~~~~~~~i~ID~~-~y~~~~Vdd~D~-~q~~~D~r~~~~~~~g~aLA~~~D~~i~~~l~~~a~~~ 148 (347) T protein:vir:88 71 YYLAPGENLDDKRKDIKHSEKVIQIDGL-LTSDVLIYDIED-AMNHYDVRAEYSAQLGEALAIAADGAVLAEMAKLCNLP 148 (347) T ss_pred eeeccccCCCCCCCCCccceEEEEEech-hhhhhhhhhHHH-HhhcCCchHHHHHHHHHHHHHHHHHHHHHHHHHhhccc Confidence 11 112222 22233333333333222 112334456665 344566777778888888888888877521 11 Q ss_pred ccceeeeecCCcceeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCcee-cceEEEeCHHHHHHHhcccccCC Q lcl|NC_019522. 140 GVGEGLYTSPNVSVEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVH-RPNTFVLPPAQFQLLARTLLSTQ 218 (311) Q Consensus 140 ~~g~GllN~p~v~~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~-~p~~l~lpp~~~~~L~~~~~~~~ 218 (311) ....+. .+++...+....++++...-+.++++-+++.|.++...+.+.+ +- ....++|+|+.|..|.+...... T Consensus 149 ~~~~~~--~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~---VP~~gR~~vv~P~~y~~Ll~~~~~~~ 223 (347) T protein:vir:88 149 AASNEN--IAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNY---VPAGDRRFYCAPEDYSAILSALMPNA 223 (347) T ss_pred cccccc--cCCccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcC---CCCCCCEEEeCHHHHHHHhcchhhhh Confidence 100000 0011111111122222222345567778888988888875432 31 24689999999987754221111 Q ss_pred CCCcchHHHHHH---HhCCceEEEEchhcccCCCCcccEE----------EE--------EEcCccee-EE--------- Q lcl|NC_019522. 219 NASNVTLLQFLR---TNFPDITFEDDILLKGAGVAGADRM----------AV--------YKKEIRIV-KG--------- 267 (311) Q Consensus 219 ~~~~~Tvl~~l~---~n~~~l~i~~~~~l~~ag~~g~~~~----------v~--------y~~~~~~~-~~--------- 267 (311) ..+ .+..++-. .+...++|..++.+.. +..+..++ .. |.-+..+. .+ T Consensus 224 ~~~-~~~~~~~~G~vg~i~G~~V~~s~nlp~-~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~~d~~~~~~l~~~~~a~g~ 301 (347) T protein:vir:88 224 ANY-AALIDPETGNIRNVMGFEVIEVPHLTV-GGAGDNNPADGVAPTNQKHIFPATATGDDRVAQNNVVGLFNHRSAVGT 301 (347) T ss_pred hhh-ccccchhcceeeeeccceEEEeecccc-cccccccccccccccccccccccccccccccccCcEEEEEechhhhhh Confidence 111 11111111 1112356666666631 11111110 00 11111111 11 Q ss_pred eecchhhhccceeeCCceEEEeeeeeeeeEEEECCeEEEEe-ecC Q lcl|NC_019522. 268 HDVMPLRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHYV-DGV 311 (311) Q Consensus 268 ~~~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~-dGI 311 (311) .-.++++.-.-...+.....+.+.... |+-+.||++++.+ ... T Consensus 302 v~~~d~~~e~~r~~~~~~d~i~~~~~~-G~~~~rPe~a~~~~~~~ 345 (347) T protein:vir:88 302 VKLKDMALERARRPEFQADQIIGKYAM-GHGGLRPEAAGALVFTP 345 (347) T ss_pred eecccceeeeeechhhHHHHhhhhhhh-cCceeccceEEEEEeCC Confidence 111121111012223334556666665 7889999966443 233 No 143 >protein:vir:102823 Length: 470 # NCBI annotation: major structural protein # Family: family:all:2450 # MgeID: mge:1610 # MgeName: YS40 # Cross-refs: genbank:acc:YP_874086;genbank:gi:118197693;genbank:GeneID:4496015 Probab=85.48 E-value=0.053 Score=27.60 Aligned_cols=272 Identities=11% Similarity=0.012 Sum_probs=118.9 Q ss_pred CCcccc---cccchhhhhhhHHHHHHHHHHHHhhhh--hhhhhhhhccccCCCCcceeEEEEEEeecccceE--EecCcc Q lcl|NC_019522. 1 MAKSVF---DVSPVSALSFLVNQAAHIESEIYRIEY--PQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQ--LFGPNS 73 (311) Q Consensus 1 ~~~~~~---~~~~~~~~~fl~~~L~~id~~v~~~~~--~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~--~~~~~a 73 (311) +-++.. +++-..|.+ +.+ +.+|+++...-+ .+++.-.-++ +.+.......++. .++..|+.- .++ .. T Consensus 9 ~~~a~~~al~~a~~~g~A-lR~--EsLd~~l~~lt~~~~~ftf~~~i~-k~~a~STV~ey~~-~~~rhG~~g~s~~~-E~ 82 (470) T protein:vir:10 9 LDEATLKALNAAGQVAES-LER--EDLEPEVTQLNVLDTPLTDLLSKN-AVKAKAYEHEYNV-VTARHDKIGYAAFR-EG 82 (470) T ss_pred hhHHHHHHHHHhhhcchh-hhh--hhhccceeEeeecCccchhhhhcC-CchhhhHhhhhhh-hccccccccceeec-cc Confidence 222222 222222323 343 456665544322 3344444444 2233333332221 123333332 233 33 Q ss_pred cccceeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc----------- Q lcl|NC_019522. 74 TDVPTVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG----------- 142 (311) Q Consensus 74 ~dip~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g----------- 142 (311) ...+..|.++.+....+..++.+.+.+...+...+..=.++....-+.|.-.+++......||||+.+. T Consensus 83 ~l~~~~d~~~~Rr~v~~K~l~~~~~VT~~a~~~~~n~v~d~~~~~~~dai~~ia~tiE~a~FyGDs~l~s~~~g~~~gle 162 (470) T protein:vir:10 83 GLPRTVEVNVVRRRIRPMLVGHRITVTELATRTTQNGVMQIDELVKREKMIAVANEFEYLAFYGDNLLGDDVPGSPNNLQ 162 (470) T ss_pred ccCccCCCceEEEEEEEEEEeecchhhhhhhhhhhccccchHHHHHHHHHHHHHHHHHhhhhhhccccccccCcccCcee Confidence 444557888888888899999998888776655333333788888889999999999999999988642 Q ss_pred e-eeeecCCcceeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCC Q lcl|NC_019522. 143 E-GLYTSPNVSVEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNAS 221 (311) Q Consensus 143 ~-GllN~p~v~~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~ 221 (311) + ||.|--.....-..-+..|+. +. .+.|+++-..+.. +.+.-.|+-++||......|..-+.+.. T Consensus 163 FDGl~~lId~~~~~NViDarG~~--Ls-------~~~L~~aa~~I~~--~~~fGt~TD~~lp~~vka~f~~~~~~~q--- 228 (470) T protein:vir:10 163 QDGIINIIKRGAPQNVLDAGGRP--LS-------IDLLWEAESRVVS--TQAFANPTAVFISYVDKLNLQASFYQIS--- 228 (470) T ss_pred ccchhhhccCCCCccccccCCCC--cc-------HHHHHHHHhhhcc--cccccChhhhccchhHHHHHHHhhcCce--- Confidence 2 553311110000111111111 11 4667777766632 2334568999999999998875332210 Q ss_pred cchHHHHHHHhCCc-eE-EEEchhcccCCCCcccEEEEEEcCcceeEEe--ecchh-hhccceeeCCceEEEeeeeeeee Q lcl|NC_019522. 222 NVTLLQFLRTNFPD-IT-FEDDILLKGAGVAGADRMAVYKKEIRIVKGH--DVMPL-RFLAPATADNVNFKVPAILRTGG 296 (311) Q Consensus 222 ~~Tvl~~l~~n~~~-l~-i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~--~~~~~-~~~~p~~~~~~~~~~~~~~~~gG 296 (311) ..+..++++ .. =..++.+.++ .-.+.++ ..||. ...-|.+-+.-.-.++.-.-++- T Consensus 229 -----Rv~~~~N~~~~~~G~~v~~f~sa--------------~G~I~L~~s~~m~~~~k~~p~~l~~~v~~~aAP~~~~t 289 (470) T protein:vir:10 229 -----RVMTTADRRAGLLGADAQSYIGV--------------RGEHSLYPSQFLGDFHKFNPARFGAEVGDFAAPSNSWT 289 (470) T ss_pred -----EEEEecCCCceeeeeeccceeee--------------eeeeeecccccccchhhcCcccCCcccCCcccCceeEE Confidence 011111111 11 0111222211 1112221 11110 11111111000000111111222 Q ss_pred EEEECCeE-EEEeecC Q lcl|NC_019522. 297 TEWRIPKA-GHYVDGV 311 (311) Q Consensus 297 v~i~~P~a-i~~~dGI 311 (311) |.-.-|.+ ..+-+|- T Consensus 290 v~~t~~~~a~~~~sk~ 305 (470) T protein:vir:10 290 VSTTDNFVTLPYNSGL 305 (470) T ss_pred eecCCCceeecccCCC Confidence 22222222 1111222 No 144 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=83.08 E-value=0.071 Score=26.87 Aligned_cols=292 Identities=13% Similarity=0.039 Sum_probs=120.9 Q ss_pred CCcc-cc---ccc--ch-----hhhh-hhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEE Q lcl|NC_019522. 1 MAKS-VF---DVS--PV-----SALS-FLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQL 68 (311) Q Consensus 1 ~~~~-~~---~~~--~~-----~~~~-fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~ 68 (311) ||+. +. +.. .+ +..+ |+....-.|+..+-+ .-..+.++.+++ +. +-.++.+.. .|..+. T Consensus 1 ma~~~~~~~~n~~~~~~~~~~~~~~al~ie~~~geV~~~f~~----~s~~~~~~~~r~-i~-~g~s~~~~~---iG~~~~ 71 (344) T protein:vir:10 1 MANMTGGQQLGTNQGKDVMAAGDKLALFLKVFGGEVLTAFAR----TSVTTSRHMVRS-IS-SGKSAQFPV---LGRTQA 71 (344) T ss_pred CccccccccCCcccCCccCCccchhHHHHHHHHHHHHHHHHH----Hhhhcccceeee-ec-ccceEEEEe---eceeEE Confidence 6642 22 211 11 1112 443333344444433 234456666543 22 244444443 355443 Q ss_pred ec-Cccccccee--eeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeee----eccc- Q lcl|NC_019522. 69 FG-PNSTDVPTV--DIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLL----GDKG- 140 (311) Q Consensus 69 ~~-~~a~dip~v--~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~----G~~~- 140 (311) -. ..+++++-. ++.-++....|=. ..-+..-+.|+.. .++..++-..-.+.+..++++..|+.++. +... T Consensus 72 ~~~~~G~~l~~t~~~~~~~e~~l~ID~-~~y~~~~VdDiD~-~q~~~D~r~~~~~~~G~aLA~~~D~~i~~~la~~a~~~ 149 (344) T protein:vir:10 72 AYLAPGENLDDIRKDIKHTEKVITIDG-LLTADVLIYDIED-AMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNVE 149 (344) T ss_pred EeeecCCCCCCCCCCcccceEEEEEcc-hhhhhhhhhhHHH-HhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhccc Confidence 21 112333321 1222222211111 1123444567776 44566777888888889999999886652 2111 Q ss_pred ----cceeeeecCCcceeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCcee-cceEEEeCHHHHHHHhccc- Q lcl|NC_019522. 141 ----VGEGLYTSPNVSVEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVH-RPNTFVLPPAQFQLLARTL- 214 (311) Q Consensus 141 ----~g~GllN~p~v~~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~-~p~~l~lpp~~~~~L~~~~- 214 (311) ...+.+-+-.+...++. +....-+..+++.+++-|.++...+.+.+ +- ....++++|+.|..|-.-. T Consensus 150 ~~~~~~~~g~~~~~~~~~~~~----~~~~t~~~~~~~~~~~~i~~a~~~Lde~~---VP~~gR~~vv~P~~y~~Ll~~~~ 222 (344) T protein:vir:10 150 SQYNENITGLGTATVIETTQD----KTTLTDQVALGKEIIAALTKARAALTKNY---VPSSDRVFYCDPDSYSAILAALM 222 (344) T ss_pred cccccccccccccceeecccc----cccccchhhhHHHHHHHHHHHHHHHhhcC---CCccCCEEEeChHHHHHHhhccc Confidence 11111111111111111 11111234467788888888888886542 31 1257889999999885421 Q ss_pred ccCCCC-CcchHHHHHHHhCCceEEEEchhcccCC-------CCcccEE----------EEEEc------CcceeEEeec Q lcl|NC_019522. 215 LSTQNA-SNVTLLQFLRTNFPDITFEDDILLKGAG-------VAGADRM----------AVYKK------EIRIVKGHDV 270 (311) Q Consensus 215 ~~~~~~-~~~Tvl~~l~~n~~~l~i~~~~~l~~ag-------~~g~~~~----------v~y~~------~~~~~~~~~~ 270 (311) ....+. .+.....=.-.+--.++|+.++.|...+ ..|.... +.+.+ .|+-+..... T Consensus 223 ~~~~~~~~~~~~~~G~V~~v~G~~V~~Sn~lp~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~s~~~~l~~h~~A~~~v~~ 302 (344) T protein:vir:10 223 PNAANYAALIDPEKGSIRNVMGFEVVEVPHLTAGGAGTSREGTTGQKHAFPATKSGNDKVAKDNVIGLFMHRSAVGTVKL 302 (344) T ss_pred ccccccccccceeeeEEEEEeceEEEeccccccccCCcccccccCccccccCCcccceeeecceeEEEeechhhhhhhhh Confidence 111111 1111110000011235667777664211 1111100 11111 1111111111 Q ss_pred chhhhccceeeCCceEEEeeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 271 MPLRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 271 ~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dGI 311 (311) ++++...-...+...+.+.+.... |+.+.||++++.+.== T Consensus 303 ~~~~~e~~r~~~~~~d~i~g~~~~-G~~vlRPe~a~~v~~~ 342 (344) T protein:vir:10 303 RDLALERARRANFQADQIIAKYAM-GHGGLRPEAAGAVVFK 342 (344) T ss_pred ccceeecccchhHHHHHHHHHhhc-ccceecccceEEEEee Confidence 221110001122223444554554 7889999877543222 No 145 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=82.38 E-value=0.077 Score=26.68 Aligned_cols=279 Identities=10% Similarity=-0.006 Sum_probs=113.1 Q ss_pred CCcccccccchhhh--hhhHHHH-HHHHHHHHhhhhhhhhhhhhccccCCCCcc-eeEEEEEEeecccceEEecCccccc Q lcl|NC_019522. 1 MAKSVFDVSPVSAL--SFLVNQA-AHIESEIYRIEYPQFKYGTLLPLDNSAPDW-AQAVMFRSIDARGELQLFGPNSTDV 76 (311) Q Consensus 1 ~~~~~~~~~~~~~~--~fl~~~L-~~id~~v~~~~~~~~~~~~~~~v~~~~~~~-~~~~~~~~~~~~G~a~~~~~~a~di 76 (311) |+. -++-+-+++ ++..... +-+...|.+...+.+.++.++.-. ++... -+++.+...- ...++.+.. ...+ T Consensus 1 ~~~--~~~~~~~~~~t~~v~~fipei~s~~i~~~l~~~~v~~~~~~d~-~~~~~~Gdtv~ip~~g-~~~~~d~~~-~~~i 75 (341) T protein:vir:94 1 MAL--GNTITGPSINTQRGQQFIPEQWLSEVQMFRKAKMLDTSVVKTW-GAQVKKGDTFHVPRIS-ELGVEDKAT-DVPV 75 (341) T ss_pred Ccc--hhhhccccccchhHHHHHHHHHHHHHHHHHHhhcchhhccccc-cccccCCceEEEeccC-cceeeeecC-CCcc Confidence 222 111111111 1111111 334556666666667777765422 22211 3566666542 334555532 2345 Q ss_pred ceeeeeccceeEEE-EEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeecccc-ce---eeeecCCc Q lcl|NC_019522. 77 PTVDIAMSQGFKDI-NTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGV-GE---GLYTSPNV 151 (311) Q Consensus 77 p~v~~~~~~~~~~v-~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~-g~---GllN~p~v 151 (311) +.-+.+-.+....+ ..-..++.++ +++.. +...++-.+-.+.+.+++++..|+.++--.+.. +. +-...++. T Consensus 76 ~~~~~~~~~~~itiD~~~~~~~~i~--d~d~~-~~~~d~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~~~~~~~~~~~ 152 (341) T protein:vir:94 76 GVQPVNDTDFVITVDTDRTTAVALD--DLLEI-QASYDLRAPYLEAMGYALAKDMTGSILGLRAAVQNTASQNVFSSSNG 152 (341) T ss_pred ccccccCceEEEEEeeeeecceeec--hHHHH-hhccchHHHHHHHHHHHHHHHHHHHHHHHhhhccccccCccccCccc Confidence 55455555555555 2224555555 44443 345677788888888888888888765321111 01 11111110 Q ss_pred ceeeccCCccccCcccccCCHHH-HHHHHHHHHHHHHhccCCcee-cceEEEeCHHHHHHHhc--ccccCCCCCcchHHH Q lcl|NC_019522. 152 SVEAATSTFVALVAAIPTNGTQP-IIDFFGNAYNTVYLDNTLTVH-RPNTFVLPPAQFQLLAR--TLLSTQNASNVTLLQ 227 (311) Q Consensus 152 ~~~~~~~~~~~~~t~w~~~t~~e-i~~di~~~~~~~~~~~~~~~~-~p~~l~lpp~~~~~L~~--~~~~~~~~~~~Tvl~ 227 (311) . .+.+++. .++.|.++...+... ++- ....++++|+.+..|.+ .... .+..+... T Consensus 153 -----~----------~t~~~~~~~~~~i~~a~~~Lde~---~VP~~gR~lvv~P~~~~~Ll~~~~~~~-~~~~g~~~-- 211 (341) T protein:vir:94 153 -----A----------ITGNGQAFSFAVFLAARRLLLEA---DVPEEKIVLLISPGQESALFTIPQFIS-KDFINNAP-- 211 (341) T ss_pred -----c----------ccCchhhhhHHHHHHHHHHHhhc---CCCccCCEEEeCHHHHHHHhhchhhhh-hhccccch-- Confidence 0 0111121 245566666666432 221 23578999999999854 1111 11111111 Q ss_pred HHHHhC-----CceEEEEchhcccCCCCcc--c-----------------------------EEEEEEcC-cceeEEeec Q lcl|NC_019522. 228 FLRTNF-----PDITFEDDILLKGAGVAGA--D-----------------------------RMAVYKKE-IRIVKGHDV 270 (311) Q Consensus 228 ~l~~n~-----~~l~i~~~~~l~~ag~~g~--~-----------------------------~~v~y~~~-~~~~~~~~~ 270 (311) ++ ++ -.++|..++.+......+. . +-+++.++ .-.+.+.-| T Consensus 212 -l~-~G~ig~i~G~~V~~Sn~lp~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~gl~~~~~av~~~k~~~~ 289 (341) T protein:vir:94 212 -IA-QGQIGSLMGVRVIRTSLIGNNSATGWRNGAPTIAPAEATPGFTGSRYLPKQDSFTSLPATFTGNSRPVHTAVMCHM 289 (341) T ss_pred -hh-eeeeeeEeceEEEEeccccccccccccccccceecccccccccccccccccccccccEEEEEEecccccceeeecc Confidence 21 22 2345555555532111100 0 00011000 001111111 Q ss_pred chhhhccce--------eeCCceEEEeeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 271 MPLRFLAPA--------TADNVNFKVPAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 271 ~~~~~~~p~--------~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dGI 311 (311) .-+....|. ..+.....+..... -|+-+.||++++.+-=- T Consensus 290 ~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~-~G~~~lrp~~~v~~~~~ 337 (341) T protein:vir:94 290 DWAAAVVSKAPRVTQSFENREQVWLMVGRQA-YGARLYRPLHAVNIHTT 337 (341) T ss_pred hhhhccccccccccccchhhhhhhhhhhhhh-hcccccCcceeEEEecC Confidence 111111110 00011111222232 25666666665443333 No 146 >protein:vir:79078 Length: 307 # NCBI annotation: gp8 # Family: family:all:908 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111208;genbank:gi:134288798;genbank:GeneID:4960752 Probab=82.11 E-value=0.08 Score=26.61 Aligned_cols=271 Identities=11% Similarity=0.080 Sum_probs=106.9 Q ss_pred cccccccchhhhhhhHH-HHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEe----cCcccccc Q lcl|NC_019522. 3 KSVFDVSPVSALSFLVN-QAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLF----GPNSTDVP 77 (311) Q Consensus 3 ~~~~~~~~~~~~~fl~~-~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~----~~~a~dip 77 (311) +++++ -.|.+. .|+.+=... . -+++.+.++||..... .+++.|..++..+ .... +-++ +-. T Consensus 1 m~~~~------~~~~~dp~LT~~A~gy-~--n~~~Iad~lfP~vpV~---~~~~k~~~f~~e~-f~~~~t~ra~~~-~~~ 66 (307) T protein:vir:79 1 MGRLS------KLRIVDPVLTNLAIGY-T--NAEFIGQTLMPVVEVE---KEGGKIPKFGKES-FRLYQTERALRA-KSN 66 (307) T ss_pred CCCCC------CCcccCHHHHHHHhhc-c--chhhhhhhcCCccccc---ccccceeeecccc-ccccccccccCC-Ccc Confidence 33332 233332 343333222 1 3567888899965433 3334444332111 0000 0000 001 Q ss_pred eeee-eccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHH----HHHHHHhhhheeeeeccccceeeeecCCcc Q lcl|NC_019522. 78 TVDI-AMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAV----RDVVEQGLNKIYLLGDKGVGEGLYTSPNVS 152 (311) Q Consensus 78 ~v~~-~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa----~~~~~~~~n~~~~~G~~~~g~GllN~p~v~ 152 (311) .++. +++.....+...+ ..+.++. +.....++++.+++.+.. .+..+...-+++|.+. |.|.-. T Consensus 67 ~v~~~~~~~~~~~~~~~~--l~~~id~-r~~~~~~~~~~~~Av~~l~d~I~l~~E~~~A~l~~~~~--------~y~~~~ 135 (307) T protein:vir:79 67 RMNPEDIDSVDVNLDEHD--LEYPIDY-REDQESAFPLEQAAVQTATDAIQLRREKMIADLSQNPS--------SYAAGN 135 (307) T ss_pred eeeeeccccccccccccc--hhhcccc-hhcCCCCCCHHHHHHHHHHHHHHhHHHHHHHHHhcccc--------ccCCCc Confidence 1111 1111111111111 1111111 122334555544433333 2333344444554432 222222 Q ss_pred eeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhc-----ccccCCCCCcchHHH Q lcl|NC_019522. 153 VEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLAR-----TLLSTQNASNVTLLQ 227 (311) Q Consensus 153 ~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~-----~~~~~~~~~~~Tvl~ 227 (311) ..+++ ++..|.++++| ++.||.+.+.++...++ ..|++++|.++.+..|.+ .++.++...-+| .+ T Consensus 136 k~tLs-----gt~~Wsd~~sD-Pi~di~~~~~ai~~~~g---~~Pn~~vlg~~a~~~l~~h~~i~~~lk~~~~g~it-~~ 205 (307) T protein:vir:79 136 KKQLS-----ATEKFTAANSD-PVGVIEDGKEAIRTKIG---RRPNTMVIGASAYKTLKAHPQLIEKIKYSMKGIVT-VD 205 (307) T ss_pred eEEEc-----cCcccCCCCCC-cHHHHHHHHHHHHHhhC---CccceEEeCHHHHHHHhcCHHHHHHhcCccccccC-HH Confidence 22222 23358887654 69999999999976543 369999999999998854 112222222222 23 Q ss_pred HHHHhCCceEEEEchh--cccCC-----CCcccEEEEEEcCc-ceeEEeecchhhhccceeeCCceEEEeeeeeeeeEEE Q lcl|NC_019522. 228 FLRTNFPDITFEDDIL--LKGAG-----VAGADRMAVYKKEI-RIVKGHDVMPLRFLAPATADNVNFKVPAILRTGGTEW 299 (311) Q Consensus 228 ~l~~n~~~l~i~~~~~--l~~ag-----~~g~~~~v~y~~~~-~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i 299 (311) .|++-+ .++.+.+-+ +.++. .-|.+..++|.... .+-...+..| ++..-.+..+.-+..++.+ .+|+++ T Consensus 206 ~la~l~-~v~~V~vg~a~y~~~~~~~~~iw~~~~~l~y~~~~~~~~~~~~~~p-s~Gyt~~~~g~~~~d~~~~-~~~~~~ 282 (307) T protein:vir:79 206 LLKEIF-EVENIAVGEAIYADDKDRFTDIWGANIVLAYVPLQRGGQQRTPYEP-SYGYTLRKKGNPVVDTRIE-DGKLEL 282 (307) T ss_pred HHHHHh-CceeEEEeeeeeecccccchhcCCCceEEEecccccCCCCCccccc-ccceeEEecCceEEecccC-CCceeE Confidence 343321 222221111 11111 12335556665331 1111112222 2222223333333333333 234444 Q ss_pred E------CCeEEEEeecC Q lcl|NC_019522. 300 R------IPKAGHYVDGV 311 (311) Q Consensus 300 ~------~P~ai~~~dGI 311 (311) . .|.-++.--|- T Consensus 283 vrv~~~~~~~i~~~~~G~ 300 (307) T protein:vir:79 283 VRATDIFRPYLLGADAGY 300 (307) T ss_pred Eeecccccceeeccccch Confidence 3 33333333232 No 147 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=81.64 E-value=0.084 Score=26.48 Aligned_cols=292 Identities=9% Similarity=0.044 Sum_probs=125.1 Q ss_pred CCcccccccc-------hhhhh-hhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEec-C Q lcl|NC_019522. 1 MAKSVFDVSP-------VSALS-FLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFG-P 71 (311) Q Consensus 1 ~~~~~~~~~~-------~~~~~-fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~-~ 71 (311) |+.+.=+..+ ++..+ |+ +...-+|.+.-...-..+.+..+++-. +-.++.+.. .|+++... . T Consensus 1 m~~~~~~~~t~~~~~~~~~~~~l~l----e~~~geV~~af~~~s~~~~~~~~r~i~--~G~s~~~~~---iG~~~~~~~~ 71 (334) T protein:vir:80 1 MTYPAANTHTRPGWGGANSDVSLHI----EEHLGLVDASFMYSSKFASWMNVRSLR--GTNQLRVDR---VGASTIAGRK 71 (334) T ss_pred CCCCcCCCccccccccccchheehh----hhhhhHHHHHHHHhhhhhccceeeecc--ccceEEEee---ecceeeeeec Confidence 6555222222 11122 33 333333433332334555666655321 234444443 35544311 1 Q ss_pred cccccceeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeee-----ccc--cc-e Q lcl|NC_019522. 72 NSTDVPTVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLG-----DKG--VG-E 143 (311) Q Consensus 72 ~a~dip~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G-----~~~--~g-~ 143 (311) ....+.--.+.-++....|-. ..-++.-+.++..+ ++..++-..-.+.+..++++..|+.++.- ... .. . T Consensus 72 ~g~~l~~~~~~~~~~~l~ID~-~l~~~~~VddiD~~-q~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~~~~~~~ 149 (334) T protein:vir:80 72 AGEELVVQKNVSDKLNLTVDT-VLYARHFFDKFDEW-TSNLDVRKETAREDGIALARQYDQACIIQLQKCGDFLAPAHLK 149 (334) T ss_pred CCCCCCCCCcccCceEEEEee-eeehhhhHhhHHHH-hcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccc Confidence 112222111112222222211 12234445666664 34556778888888888888888866532 110 01 1 Q ss_pred eeeecCCcceeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCce-ecceEEEeCHHHHHHHhcc--cccCCCC Q lcl|NC_019522. 144 GLYTSPNVSVEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTV-HRPNTFVLPPAQFQLLART--LLSTQNA 220 (311) Q Consensus 144 GllN~p~v~~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~-~~p~~l~lpp~~~~~L~~~--~~~~~~~ 220 (311) .-+.+.+...... ++ .+.-...+++.+.+=+.++...+.+..--.. .....++++|..|..|-.- .++. +. T Consensus 150 ~~~~~G~~~~~~~--~g---~~~~~~~~~~~l~~a~~~a~~~L~e~dvp~~~~~~R~~vv~P~~y~~Ll~~~r~~n~-d~ 223 (334) T protein:vir:80 150 PAFHDGILLPSTI--SG---LAADAAADADVLVAAHRQGVEAMVFRDLGDQLMSEGVTLLDPVIFSFLLEHDRLMNV-EF 223 (334) T ss_pred ccccCCcceeecc--cc---cccchhhhHHHHHHHHHHHHHHHHhcCCCCCcCCceEEEeChHHHHHHhcccccccc-ee Confidence 1111111111111 11 1122345688888888888888765431110 0236899999999998542 2221 10 Q ss_pred ----CcchHHHHHHHhCCceEEEEchhcccCC----C-Ccc---------cEEEEEEcCcceeEEeecchhhhccceeeC Q lcl|NC_019522. 221 ----SNVTLLQFLRTNFPDITFEDDILLKGAG----V-AGA---------DRMAVYKKEIRIVKGHDVMPLRFLAPATAD 282 (311) Q Consensus 221 ----~~~Tvl~~l~~n~~~l~i~~~~~l~~ag----~-~g~---------~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~ 282 (311) .+..+...-..+.-.++|+.++.+-+.. . ++. .++.++. .++-+...-.++++.-.-.+.+ T Consensus 224 ~~s~~~~~~~~g~i~~v~G~~V~~Sn~~P~~~~t~~~~g~~~~~~agd~t~~~~~~~-~~~Al~t~~~~~~~~e~~~~~~ 302 (334) T protein:vir:80 224 GAKEGGNSFVGGRIAMLNGVRVVETPRFPQSAITANALGADFNVTDAEVRRKMITFI-PSMALISAQVHPVSAQFWEEKK 302 (334) T ss_pred ccccccccccceeEEEEeceEEEeecCCCCccccccccccccccccccccceEEEEE-eCceEEEEEEeecceeeeechh Confidence 0111111000111135677777664321 0 111 1222211 2222222222232211111222 Q ss_pred CceEEEeeeeeeeeEEEECCeE--EEEeecC Q lcl|NC_019522. 283 NVNFKVPAILRTGGTEWRIPKA--GHYVDGV 311 (311) Q Consensus 283 ~~~~~~~~~~~~gGv~i~~P~a--i~~~dGI 311 (311) ...+.+.+... .|+-+.||++ ++.++++ T Consensus 303 ~~~d~i~~~~a-~G~g~lRPeaa~vv~~~~~ 332 (334) T protein:vir:80 303 DFGHYLDTFQS-YNIGQRRPDAVAVHDITVT 332 (334) T ss_pred hHHHHHHHHHH-cCCceeccceEEEEEEeee Confidence 33334444433 4899999955 4556777 No 148 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=79.67 E-value=0.1 Score=26.02 Aligned_cols=283 Identities=9% Similarity=0.052 Sum_probs=122.0 Q ss_pred CCc------ccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEE--ecCc Q lcl|NC_019522. 1 MAK------SVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQL--FGPN 72 (311) Q Consensus 1 ~~~------~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~--~~~~ 72 (311) |.- +.-......---||...+-.++..+-+ .=..+.+..+++ +. +..++.|.. .|..+. +--+ T Consensus 1 Ms~~n~~t~~~~~~s~~~~al~le~f~geV~taF~~----~si~~~~~~vrt-i~-~GkS~qf~~---iG~~~a~y~~~G 71 (402) T protein:vir:97 1 MSTPNTLTNVAVSASGEVDSLLIEKFNGKVNEQYLK----GENILSYFDVQT-VT-GTNTVSNKY---LGETELQVLAPG 71 (402) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHH----HHhhcCcceeee-ec-ccceEEEEE---EeeeEEeeeccc Confidence 332 222222211112444444445554433 123344555543 22 334444443 344443 1111 Q ss_pred ccccceeeeeccce--eEEEEEEEEEEEecHHHHHHHHHhCCC-hHHHHHHHHHHHHHHhhhheeee-----ecccc--- Q lcl|NC_019522. 73 STDVPTVDIAMSQG--FKDINTAALGYTYSIEEIGFAMLNNVN-LDAERGQAVRDVVEQGLNKIYLL-----GDKGV--- 141 (311) Q Consensus 73 a~dip~v~~~~~~~--~~~v~~~~~~~~~~~~El~~a~~~g~~-l~~~k~~aa~~~~~~~~n~~~~~-----G~~~~--- 141 (311) +.+-...+.-++. +..-.++.-.+-| ++.. .+..++ +..+-...+..++++..|+.++- |-... T Consensus 72 -~~ldg~~~~~~k~~ItID~lL~a~~~V~---diDe-aq~~yD~vRse~s~e~G~ALA~~~Dq~ii~~i~~aa~a~t~~~ 146 (402) T protein:vir:97 72 -QSPNATPTQADKNQLVIDTTVIARNTVA---HIHD-VQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQMLLGGIANTKAE 146 (402) T ss_pred -cccCCCCcccccEEEEeCceeechhhhh---hHHH-HHhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc Confidence 1111111111221 1111222223333 3333 234555 56667777788888888885531 11110 Q ss_pred -c-eeeee-cCCcceeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcc--ccc Q lcl|NC_019522. 142 -G-EGLYT-SPNVSVEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLART--LLS 216 (311) Q Consensus 142 -g-~GllN-~p~v~~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~--~~~ 216 (311) . .+... ..+++.. + +..-...+++.+.+-|.++...+.+..=-. .-..++|||+.|..|.+- .++ T Consensus 147 ~~~~~~~~~g~s~~~~-----~---t~~~a~~~~~~l~~ai~~a~~~LdEkdVP~--~dRv~vv~P~~y~~Ll~~~rl~n 216 (402) T protein:vir:97 147 RNKPRVKGHGFSINVN-----V---TESEALANPQYVMAAVEYALEQQLEQEVDI--SDVAIMMPWKFFNALRDADRIVD 216 (402) T ss_pred cccCcccccccccccc-----c---ccchhhcCHHHHHHHHHHHHHHHHhcCCCc--cccEEEeChHHHHHHhhcccccc Confidence 0 11111 1111111 1 111224578888888888888886432111 114889999999988652 221 Q ss_pred CC---CCCcchHHHHHHHhCCceEEEEchhccc------------CCCC---------cccEEEEEEcCcceeEEeecch Q lcl|NC_019522. 217 TQ---NASNVTLLQFLRTNFPDITFEDDILLKG------------AGVA---------GADRMAVYKKEIRIVKGHDVMP 272 (311) Q Consensus 217 ~~---~~~~~Tvl~~l~~n~~~l~i~~~~~l~~------------ag~~---------g~~~~v~y~~~~~~~~~~~~~~ 272 (311) .. .+.+..+.-.+. .--.++|+.++.|.. +|.+ .+-++++|.+ +-+.-.-.+| T Consensus 217 ~d~~~~~~g~~~~G~v~-~v~Gv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~t~d~t~~~~~~f~~--~Av~tvk~~~ 293 (402) T protein:vir:97 217 KTYTISQSGATINGFVL-SSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTS--DALLVGRTIE 293 (402) T ss_pred hhhccccCCccccceeE-EEeceEEEecCccccccccccccccccCCCCccCCcCcccceeEEEEEec--ceEEEEEeec Confidence 10 011111100000 011345666665532 1111 1224555544 3333222344 Q ss_pred hhhccceeeCCceEEEeeeeeeeeEEEECCeEEEEee-----------cC Q lcl|NC_019522. 273 LRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHYVD-----------GV 311 (311) Q Consensus 273 ~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~d-----------GI 311 (311) ++.-.-.+.+...+.+.+.... |+..+||+++..+. |+ T Consensus 294 vT~~~~~d~r~~~~~id~~~a~-G~g~~RPeaa~vv~~~~~~t~~~~~~~ 342 (402) T protein:vir:97 294 VTGDIFYEKKEKTYYIDTFMAE-GAIPDRWEAVSVVTTKRDATTGDAGGP 342 (402) T ss_pred cccchhhchhHHHHHHHHHHHh-CCcccCccceEEEEEecccccccCCcc Confidence 4221112444555556666665 79999999988882 22 No 149 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=79.05 E-value=0.11 Score=25.88 Aligned_cols=286 Identities=12% Similarity=0.013 Sum_probs=127.3 Q ss_pred CCcc------------cccccchhhh-hhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceE Q lcl|NC_019522. 1 MAKS------------VFDVSPVSAL-SFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQ 67 (311) Q Consensus 1 ~~~~------------~~~~~~~~~~-~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~ 67 (311) |... .-.++.+.-. -|+...+-.|+..+- ..-..+.++.+.+- . +-.++.+... |..+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~d~~~al~le~~~geV~~~f~----~~s~~~~~~~~r~i-~-~G~tv~i~~i---g~~~ 71 (332) T protein:vir:78 1 MTTLSNFSLPNQANGGARNADYDVRYATALKLFSGEVFTAFN----NASIFKGLVRSYDL-R-GGKSKQFMFT---GKLS 71 (332) T ss_pred CcccccccCCccccCCccccccccchhhhhhhhhhhHHHHHH----HHhhhhhccccccc-c-ccceEEEEec---ccee Confidence 2110 0011111111 244333334443332 22344555555432 2 3445554443 4443 Q ss_pred --EecCcccccceeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeee----ecccc Q lcl|NC_019522. 68 --LFGPNSTDVPTVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLL----GDKGV 141 (311) Q Consensus 68 --~~~~~a~dip~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~----G~~~~ 141 (311) -+..+..-.|..+++-.+....|=. ..-+..-+.++.+++ ...++-.+..+.+..++++..|+.++- +-.. T Consensus 72 ~~~~~~g~~l~~~~~~~~~~~~l~ID~-~ky~~~~VddiD~~q-~~~dl~~~~~~~~g~aLA~~~D~~i~~~l~~aa~~- 148 (332) T protein:vir:78 72 AGYHTPGTPIVGDAGIKANEKTLVMDD-LLVSSQFVYSLDEIF-SQYSTRAEVSKQIGEALATHYDERIARVLAKASAE- 148 (332) T ss_pred EeeecCCCCCCCCCCCCCceEEEEEeh-hhhhHHHHHhHHHHh-cCcchHHHHHHHHHHHHHHHHHHHHHHHHHhhhcc- Confidence 3332222112222333333322211 133445567777754 446688888999999999999986652 1110 Q ss_pred ceeeeecCCcceeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceec-ceEEEeCHHHHHHHhc---ccccC Q lcl|NC_019522. 142 GEGLYTSPNVSVEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHR-PNTFVLPPAQFQLLAR---TLLST 217 (311) Q Consensus 142 g~GllN~p~v~~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~-p~~l~lpp~~~~~L~~---~~~~~ 217 (311) +.+.-..|+-.....+++ .+.+++.+++-|.++..+|.+.+ +-. -..++++|+.|..|.+ +.+-+ T Consensus 149 ~~~~~~~~g~~~~~~~~~--------~~~~~~~~~~~i~~a~~~Lde~~---VP~~gR~~vv~P~~y~~Ll~~~d~~~~n 217 (332) T protein:vir:78 149 ASPVTGEPGGFHVNIGAG--------NTNDAQAIVDGFFEAAAVLDERS---APQEGRVAVLSPRQYYSLISSVDTNILN 217 (332) T ss_pred cCcccccccccccccCCc--------cccCHHHHHHHHHHHHHHHhhcC---CCccCCEEEeCHHHHHHHHhhcCceeee Confidence 011111122111111111 13468889999999988886542 311 1468899999988854 11111 Q ss_pred CCCC--cchHHHH-HHHhCCceEEEEchhcccCC--------CCcc---------cEEEEEEcCcceeEEeecchhhhc- Q lcl|NC_019522. 218 QNAS--NVTLLQF-LRTNFPDITFEDDILLKGAG--------VAGA---------DRMAVYKKEIRIVKGHDVMPLRFL- 276 (311) Q Consensus 218 ~~~~--~~Tvl~~-l~~n~~~l~i~~~~~l~~ag--------~~g~---------~~~v~y~~~~~~~~~~~~~~~~~~- 276 (311) .+.. +..+..- ...+.-.++|..++.|-..+ ..|. ++ ++....++-+.+...++++.. T Consensus 218 ~~~~~~~~~~~~g~~i~~i~G~~V~~Sn~lp~~~g~~~~~~~~~~~~n~~~~~~~~~-~~~~~h~~a~~~v~~~~~~~~~ 296 (332) T protein:vir:78 218 REIGNSQGDMNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASAL-AGLIFHREAAGCIQSVAPTIQT 296 (332) T ss_pred eeccccccceecceeeeEEeeeEEEecCccccCcccccccccccccccccccccccc-eEEeecccceeeeeeeccchhh Confidence 1111 1112111 01112236777777774211 0110 11 112223444544444443211 Q ss_pred -c-ceeeCCceEEEeeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 277 -A-PATADNVNFKVPAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 277 -~-p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dGI 311 (311) . -+..+.....+..... .|+-+.||++++.+.== T Consensus 297 t~~~~~~~~~~d~i~~~~~-~G~~v~rPe~~v~l~~a 332 (332) T protein:vir:78 297 TSGDFNVQYQGDLIVGKLA-MGCGSLRTSVAGSFQAA 332 (332) T ss_pred hhcccchhhhHhhhhhhhh-hcCceecccceEEEeeC Confidence 0 0111112233344444 46889999998876655 No 150 >protein:vir:97331 Length: 319 # NCBI annotation: ORF011 # Family: family:all:701 # MgeID: mge:1666 # MgeName: 52A # Cross-refs: genbank:acc:YP_240611;genbank:gi:66396278;genbank:GeneID:5133687 Probab=76.29 E-value=0.14 Score=25.31 Aligned_cols=277 Identities=7% Similarity=-0.065 Sum_probs=120.1 Q ss_pred CCccccccc----------chhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCC-CCcceeEEEEEEeecccceEEe Q lcl|NC_019522. 1 MAKSVFDVS----------PVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNS-APDWAQAVMFRSIDARGELQLF 69 (311) Q Consensus 1 ~~~~~~~~~----------~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~-~~~~~~~~~~~~~~~~G~a~~~ 69 (311) |..-.-++. ++.+.-=.+=.|+.....+++..+....+..-+-+.++ -.-+..++....++..|-. .| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~~gg~tVkIp~i~~~gl~-DY 79 (319) T protein:vir:97 1 MNKTIKNATGMLKLNLQHFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELK-DY 79 (319) T ss_pred CCcccccccceeEeehhhhhccCCCcchHHHHHHHHHHHHHHHHHhhhhhhcccCcceEeccCcEEEEeeecccccc-cc Confidence 332222111 11111001112333344444444443333332212221 1125667777777776643 23 Q ss_pred cCcccccceeeeeccceeEEEEEEEEEEEecHHHHHHHHHhC-CChHHHHHHHHHHHHHHhhhheeeeeccccceeeeec Q lcl|NC_019522. 70 GPNSTDVPTVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNN-VNLDAERGQAVRDVVEQGLNKIYLLGDKGVGEGLYTS 148 (311) Q Consensus 70 ~~~a~dip~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g-~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g~GllN~ 148 (311) .- ..++..=+++.++.+..+-. ..++.+.++++...+..+ +.....-.+.++..+.-.+|...|---.+.. | T Consensus 80 ~R-~~g~~~g~vt~~~~t~tidq-dR~~~F~VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a-~---- 152 (319) T protein:vir:97 80 KR-NATNEFDHPKIEETTYFLDQ-EKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNK-A---- 152 (319) T ss_pred cC-CCCcccCCcccceeEEEeec-ccccccccchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhc-c---- Confidence 11 11222224445555554433 677888888888766533 2222333444555555555554333211100 0 Q ss_pred CCcceeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcc-cccCCCCCcchHH- Q lcl|NC_019522. 149 PNVSVEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLART-LLSTQNASNVTLL- 226 (311) Q Consensus 149 p~v~~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~-~~~~~~~~~~Tvl- 226 (311) ... =...|++-+++.|.++..++.... +-....|+++|..+..|.+- ....+...+.+++ T Consensus 153 ------------~~~---~~~~t~~n~y~~i~~a~~~Lde~~---VP~~Rvl~Vtp~~~~~L~~~~~f~~~~~~~~~~~~ 214 (319) T protein:vir:97 153 ------------KHL---TVGTGSDAQYDAVLDVSVELDEIK---APENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLG 214 (319) T ss_pred ------------ccc---ccccCHHHHHHHHHHHHHHHHhcC---CCCCcEEEeCHHHHHHHHhhhhhhcccccccccee Confidence 000 012366778999999999886542 32346799999999999431 1111111111111 Q ss_pred HHHHHhCCceEEEEchhcccCCCCcccEEEEEEcCcceeEEeec-chhhhccceeeCCceEEEeeeeeeeeEEEECCeEE Q lcl|NC_019522. 227 QFLRTNFPDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDV-MPLRFLAPATADNVNFKVPAILRTGGTEWRIPKAG 305 (311) Q Consensus 227 ~~l~~n~~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~-~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai 305 (311) +-.-...-...|..+|.-... +.+- ++... ..+.+..= ...+...|.+... .+.+.+ -.++|+.|.+|... T Consensus 215 ~g~Vg~idG~~Vi~vps~~~k---~in~-i~~h~--~A~~~~~k~~~~~~~~p~~~~~-a~~v~g-r~y~d~~V~~~k~~ 286 (319) T protein:vir:97 215 KGVQGELDGFVIVKVPTKLLQ---GLQA-IAVVG--EVLASPIQADLAKTNSNIPGMF-GTLAEQ-LLYTGAFVPEHLQK 286 (319) T ss_pred eeeceeecCeEEEEecccccc---cceE-EEEcC--CeeeeeeeeeeeeccCCCcccc-ceeeee-eeeeeeEEeccccc Confidence 000001112445555532111 1222 22211 11111000 0112222323222 355554 56789999999865 Q ss_pred EEeecC Q lcl|NC_019522. 306 HYVDGV 311 (311) Q Consensus 306 ~~~dGI 311 (311) +..... T Consensus 287 ~Iy~~~ 292 (319) T protein:vir:97 287 YIFTIG 292 (319) T ss_pred eEEEee Confidence 554444 No 151 >protein:vir:94800 Length: 319 # NCBI annotation: ORF012 # Family: family:all:701 # MgeID: mge:1531 # MgeName: 29 # Cross-refs: genbank:acc:YP_240536;genbank:gi:66396203;genbank:GeneID:5133580 Probab=76.29 E-value=0.14 Score=25.31 Aligned_cols=277 Identities=7% Similarity=-0.065 Sum_probs=120.1 Q ss_pred CCccccccc----------chhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCC-CCcceeEEEEEEeecccceEEe Q lcl|NC_019522. 1 MAKSVFDVS----------PVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNS-APDWAQAVMFRSIDARGELQLF 69 (311) Q Consensus 1 ~~~~~~~~~----------~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~-~~~~~~~~~~~~~~~~G~a~~~ 69 (311) |..-.-++. ++.+.-=.+=.|+.....+++..+....+..-+-+.++ -.-+..++....++..|-. .| T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~nt~~l~~k~~~~LD~~~~~~~~s~~~~~N~~~e~~gg~tVkIp~i~~~gl~-DY 79 (319) T protein:vir:94 1 MNKTIKNATGMLKLNLQHFANKSVEPGQTLLKNKHVGILERVTAVNAYSTPALISNDAIFMEGRSFTVMKGDTTELK-DY 79 (319) T ss_pred CCcccccccceeEeehhhhhccCCCcchHHHHHHHHHHHHHHHHHhhhhhhcccCcceEeccCcEEEEeeecccccc-cc Confidence 332222111 11111001112333344444444443333332212221 1125667777777776643 23 Q ss_pred cCcccccceeeeeccceeEEEEEEEEEEEecHHHHHHHHHhC-CChHHHHHHHHHHHHHHhhhheeeeeccccceeeeec Q lcl|NC_019522. 70 GPNSTDVPTVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNN-VNLDAERGQAVRDVVEQGLNKIYLLGDKGVGEGLYTS 148 (311) Q Consensus 70 ~~~a~dip~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g-~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g~GllN~ 148 (311) .- ..++..=+++.++.+..+-. ..++.+.++++...+..+ +.....-.+.++..+.-.+|...|---.+.. | T Consensus 80 ~R-~~g~~~g~vt~~~~t~tidq-dR~~~F~VD~~D~~Etn~~l~a~~i~~~~~~~~v~PEiDay~~skla~~a-~---- 152 (319) T protein:vir:94 80 KR-NATNEFDHPKIEETTYFLDQ-EKYWGRFVDALDRKDTEGNIDINYVVARQGAEVVAPYLDNLRFATLARNK-A---- 152 (319) T ss_pred cC-CCCcccCCcccceeEEEeec-ccccccccchhhHhhhhchhhHHHHHHHHHHHHhhhhhhHHHHHHHHhhc-c---- Confidence 11 11222224445555554433 677888888888766533 2222333444555555555554333211100 0 Q ss_pred CCcceeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcc-cccCCCCCcchHH- Q lcl|NC_019522. 149 PNVSVEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLART-LLSTQNASNVTLL- 226 (311) Q Consensus 149 p~v~~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~-~~~~~~~~~~Tvl- 226 (311) ... =...|++-+++.|.++..++.... +-....|+++|..+..|.+- ....+...+.+++ T Consensus 153 ------------~~~---~~~~t~~n~y~~i~~a~~~Lde~~---VP~~Rvl~Vtp~~~~~L~~~~~f~~~~~~~~~~~~ 214 (319) T protein:vir:94 153 ------------KHL---TVGTGSDAQYDAVLDVSVELDEIK---APENRVLFVSPTFYKGIKKFVIALPQGDTRQQVLG 214 (319) T ss_pred ------------ccc---ccccCHHHHHHHHHHHHHHHHhcC---CCCCcEEEeCHHHHHHHHhhhhhhcccccccccee Confidence 000 012366778999999999886542 32346799999999999431 1111111111111 Q ss_pred HHHHHhCCceEEEEchhcccCCCCcccEEEEEEcCcceeEEeec-chhhhccceeeCCceEEEeeeeeeeeEEEECCeEE Q lcl|NC_019522. 227 QFLRTNFPDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDV-MPLRFLAPATADNVNFKVPAILRTGGTEWRIPKAG 305 (311) Q Consensus 227 ~~l~~n~~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~-~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai 305 (311) +-.-...-...|..+|.-... +.+- ++... ..+.+..= ...+...|.+... .+.+.+ -.++|+.|.+|... T Consensus 215 ~g~Vg~idG~~Vi~vps~~~k---~in~-i~~h~--~A~~~~~k~~~~~~~~p~~~~~-a~~v~g-r~y~d~~V~~~k~~ 286 (319) T protein:vir:94 215 KGVQGELDGFVIVKVPTKLLQ---GLQA-IAVVG--EVLASPIQADLAKTNSNIPGMF-GTLAEQ-LLYTGAFVPEHLQK 286 (319) T ss_pred eeeceeecCeEEEEecccccc---cceE-EEEcC--CeeeeeeeeeeeeccCCCcccc-ceeeee-eeeeeeEEeccccc Confidence 000001112445555532111 1222 22211 11111000 0112222323222 355554 56789999999865 Q ss_pred EEeecC Q lcl|NC_019522. 306 HYVDGV 311 (311) Q Consensus 306 ~~~dGI 311 (311) +..... T Consensus 287 ~Iy~~~ 292 (319) T protein:vir:94 287 YIFTIG 292 (319) T ss_pred eEEEee Confidence 554444 No 152 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=75.05 E-value=0.15 Score=25.08 Aligned_cols=262 Identities=11% Similarity=0.068 Sum_probs=116.2 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccC--CCCcceeEEEEEEeecccceEEecCcccccce Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDN--SAPDWAQAVMFRSIDARGELQLFGPNSTDVPT 78 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~--~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~ 78 (311) ||- .++.. +.+-..+.+.....+....++.... .+..| .++.++.....+.+..... ...++. T Consensus 1 MA~-----------~~~~p--e~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~G-dtv~ip~~~~~~~~d~~~~-~~~~~~ 65 (273) T protein:vir:10 1 MAF-----------NNFIP--ELWSDMLLEEWTAQTVFANLVNREYEGTASKG-NVVHIAGVVAPTVKDYKAA-GRQTSA 65 (273) T ss_pred Ccc-----------hhhhH--HHHHHHHHHHHHhhhccchhhccccccccccC-ceEEEeecccccccccccC-CCccCc Confidence 221 12222 3344555555555566666655432 23333 4777776555443322111 111222 Q ss_pred eeeeccceeEEEEE-EEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccceeeeecCCcceeecc Q lcl|NC_019522. 79 VDIAMSQGFKDINT-AALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVGEGLYTSPNVSVEAAT 157 (311) Q Consensus 79 v~~~~~~~~~~v~~-~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g~GllN~p~v~~~~~~ 157 (311) -+.+.+.....+-. ...++.++ ++++.+.. .++.. -.+.+..+++...|+.++- ++..-+ T Consensus 66 ~~~~~~~~~~tid~~~~~~~~i~--d~d~~~~~-~~~~~-~~~~~~~alA~~vD~~i~~--------~~~~a~------- 126 (273) T protein:vir:10 66 DAISDTGVDLLIDQEKSIDFLVD--DIDRVQVA-GSLEA-YTRAGATALATDTDKFIAD--------MLVDNG------- 126 (273) T ss_pred cccccceEEEEEeeeeecceEee--cHHHhhhh-ccHHH-HHHHHHHHHHHHHHHHHHH--------HHhccc------- Confidence 23333444444422 34555555 44444443 35643 5566677888887765541 110000 Q ss_pred CCccccCcccccCCHHHHHHHHHHHHHHHHhccCCce-ecceEEEeCHHHHHHHhcc--cccCCCCC-cchHH-HHHHHh Q lcl|NC_019522. 158 STFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTV-HRPNTFVLPPAQFQLLART--LLSTQNAS-NVTLL-QFLRTN 232 (311) Q Consensus 158 ~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~-~~p~~l~lpp~~~~~L~~~--~~~~~~~~-~~Tvl-~~l~~n 232 (311) +...+. ..-++..+++.|.++..++... .+ .....|+++|..+..|-+- .....+.. +...+ +-...+ T Consensus 127 -~~~~~~---~~~~~~~~~~~i~~a~~~ld~~---~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~ 199 (273) T protein:vir:10 127 -TALTGS---APTDADDAFDLIAKALKELTKA---NVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGN 199 (273) T ss_pred -cccccc---cccchhHHHHHHHHHHHHhhhc---CCCcCCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeE Confidence 000000 1225667788888888887443 22 1235799999999988431 12111111 11110 000011 Q ss_pred CCceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecc-hhhhccceeeCCceEEEeeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 233 FPDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVM-PLRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 233 ~~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~-~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dGI 311 (311) .-.++|.....|.. + ....++++.++- +.+..-. .+..+. ..+.....+..... .|+.+.||.+++.+.-= T Consensus 200 i~G~~v~~s~~lp~-~--~~~~~~~~~~~A--~~~a~q~~~~e~~r--~~~~~~~~v~~~~~-yg~~v~~~~~~~~l~~~ 271 (273) T protein:vir:10 200 LLGARIVESNNLRD-T--DDEQFVAFHPSA--AAYVSQIDTVEALR--DQDSFSDRIRALHV-YGGKVVRPTGVVVFNKT 271 (273) T ss_pred EeceEEEEeccccc-C--CccEEEEEeccc--eeeeeeeehhhccc--CCCcceeeeeeeee-eeeeEeccceEEEEecc Confidence 12345555544432 1 112344444322 2221100 111111 11222333444344 47889999998886544 No 153 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=75.05 E-value=0.15 Score=25.08 Aligned_cols=262 Identities=11% Similarity=0.068 Sum_probs=116.2 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccC--CCCcceeEEEEEEeecccceEEecCcccccce Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDN--SAPDWAQAVMFRSIDARGELQLFGPNSTDVPT 78 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~--~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~ 78 (311) ||- .++.. +.+-..+.+.....+....++.... .+..| .++.++.....+.+..... ...++. T Consensus 1 MA~-----------~~~~p--e~~~~~v~~~~~~~lv~~~l~~~~~~~~~~~G-dtv~ip~~~~~~~~d~~~~-~~~~~~ 65 (273) T protein:vir:10 1 MAF-----------NNFIP--ELWSDMLLEEWTAQTVFANLVNREYEGTASKG-NVVHIAGVVAPTVKDYKAA-GRQTSA 65 (273) T ss_pred Ccc-----------hhhhH--HHHHHHHHHHHHhhhccchhhccccccccccC-ceEEEeecccccccccccC-CCccCc Confidence 221 12222 3344555555555566666655432 23333 4777776555443322111 111222 Q ss_pred eeeeccceeEEEEE-EEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccceeeeecCCcceeecc Q lcl|NC_019522. 79 VDIAMSQGFKDINT-AALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVGEGLYTSPNVSVEAAT 157 (311) Q Consensus 79 v~~~~~~~~~~v~~-~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g~GllN~p~v~~~~~~ 157 (311) -+.+.+.....+-. ...++.++ ++++.+.. .++.. -.+.+..+++...|+.++- ++..-+ T Consensus 66 ~~~~~~~~~~tid~~~~~~~~i~--d~d~~~~~-~~~~~-~~~~~~~alA~~vD~~i~~--------~~~~a~------- 126 (273) T protein:vir:10 66 DAISDTGVDLLIDQEKSIDFLVD--DIDRVQVA-GSLEA-YTRAGATALATDTDKFIAD--------MLVDNG------- 126 (273) T ss_pred cccccceEEEEEeeeeecceEee--cHHHhhhh-ccHHH-HHHHHHHHHHHHHHHHHHH--------HHhccc------- Confidence 23333444444422 34555555 44444443 35643 5566677888887765541 110000 Q ss_pred CCccccCcccccCCHHHHHHHHHHHHHHHHhccCCce-ecceEEEeCHHHHHHHhcc--cccCCCCC-cchHH-HHHHHh Q lcl|NC_019522. 158 STFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTV-HRPNTFVLPPAQFQLLART--LLSTQNAS-NVTLL-QFLRTN 232 (311) Q Consensus 158 ~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~-~~p~~l~lpp~~~~~L~~~--~~~~~~~~-~~Tvl-~~l~~n 232 (311) +...+. ..-++..+++.|.++..++... .+ .....|+++|..+..|-+- .....+.. +...+ +-...+ T Consensus 127 -~~~~~~---~~~~~~~~~~~i~~a~~~ld~~---~vP~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~ 199 (273) T protein:vir:10 127 -TALTGS---APTDADDAFDLIAKALKELTKA---NVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGN 199 (273) T ss_pred -cccccc---cccchhHHHHHHHHHHHHhhhc---CCCcCCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeE Confidence 000000 1225667788888888887443 22 1235799999999988431 12111111 11110 000011 Q ss_pred CCceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecc-hhhhccceeeCCceEEEeeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 233 FPDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVM-PLRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 233 ~~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~-~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dGI 311 (311) .-.++|.....|.. + ....++++.++- +.+..-. .+..+. ..+.....+..... .|+.+.||.+++.+.-= T Consensus 200 i~G~~v~~s~~lp~-~--~~~~~~~~~~~A--~~~a~q~~~~e~~r--~~~~~~~~v~~~~~-yg~~v~~~~~~~~l~~~ 271 (273) T protein:vir:10 200 LLGARIVESNNLRD-T--DDEQFVAFHPSA--AAYVSQIDTVEALR--DQDSFSDRIRALHV-YGGKVVRPTGVVVFNKT 271 (273) T ss_pred EeceEEEEeccccc-C--CccEEEEEeccc--eeeeeeeehhhccc--CCCcceeeeeeeee-eeeeEeccceEEEEecc Confidence 12345555544432 1 112344444322 2221100 111111 11222333444344 47889999998886544 No 154 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=74.71 E-value=0.16 Score=25.02 Aligned_cols=288 Identities=15% Similarity=0.093 Sum_probs=119.4 Q ss_pred CCcc------cccccch-----hhhh-hhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEE Q lcl|NC_019522. 1 MAKS------VFDVSPV-----SALS-FLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQL 68 (311) Q Consensus 1 ~~~~------~~~~~~~-----~~~~-fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~ 68 (311) |+.. ..++.+. +..+ |+....-.++..+-+. -..+.++.+++ +. +..++.|.. .|.++. T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~al~le~f~geV~~~f~~~----s~~~~~~~~r~-i~-~gks~~~~~---iG~~~~ 71 (345) T protein:vir:22 1 MASMTGGQQMGTNQGKGVVAAGDKLALFLKVFGGEVLTAFART----SVTTSRHMVRS-IS-SGKSAQFPV---LGRTQA 71 (345) T ss_pred CcccccchhcccccccccccCCchhHHHHHHHhHHHHHHHHHH----hhhcccceeee-cc-ccceEEEee---ecceEE Confidence 4432 2222222 1122 3433333444443322 23445555542 22 344555443 354443 Q ss_pred --ecCccccccee--eeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeee--ec---- Q lcl|NC_019522. 69 --FGPNSTDVPTV--DIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLL--GD---- 138 (311) Q Consensus 69 --~~~~a~dip~v--~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~--G~---- 138 (311) +..+ +.+... +....+....|=. ..-+..-+.++.. .++..++-..-...+..++++..|+.++. +. T Consensus 72 ~~~~~G-~~l~~~~~~~~~~e~~ltID~-~~y~~~~VddiD~-~q~~~D~r~~~s~~~G~aLA~~~D~~i~~~l~k~a~~ 148 (345) T protein:vir:22 72 AYLAPG-ENLDDKRKDIKHTEKVITIDG-LLTADVLIYDIED-AMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGLCNV 148 (345) T ss_pred EeeecC-CCCCCCCCCcccceEEEEecc-hhhhhhhHhhHHH-HhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 2222 222111 1122221111100 1122334456665 34556677778888888888888887662 10 Q ss_pred c--ccc-e-eeeecCCcceeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCcee-cceEEEeCHHHHHHHhcc Q lcl|NC_019522. 139 K--GVG-E-GLYTSPNVSVEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVH-RPNTFVLPPAQFQLLART 213 (311) Q Consensus 139 ~--~~g-~-GllN~p~v~~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~-~p~~l~lpp~~~~~L~~~ 213 (311) + ..+ . |+-+-..+........ . .-...+++.+++-|.++..++.+.+ +- .-..+++||+.|..|-.- T Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~g~~---~--t~~~~~~~~~~~ai~~a~~~Lde~~---VP~~~R~~vv~P~~y~~Ll~~ 220 (345) T protein:vir:22 149 ESKYNENIEGLGTATVIETTQNKAA---L--TDQVALGKEIIAALTKARAALTKNY---VPAADRVFYCDPDSYSAILAA 220 (345) T ss_pred ccccccccccccccccccccccccc---c--cccccCHHHHHHHHHHHHHHhhhcC---CCccCCEEEeChHHHHHHhcc Confidence 0 011 1 2211111111111111 0 1123467888888888888775432 21 125799999999998542 Q ss_pred c-ccCCCC-CcchHHHHHHHhCCceEEEEchhcccCCC---------------Cccc-----------EEEEEEcCccee Q lcl|NC_019522. 214 L-LSTQNA-SNVTLLQFLRTNFPDITFEDDILLKGAGV---------------AGAD-----------RMAVYKKEIRIV 265 (311) Q Consensus 214 ~-~~~~~~-~~~Tvl~~l~~n~~~l~i~~~~~l~~ag~---------------~g~~-----------~~v~y~~~~~~~ 265 (311) . +...+. .+....+-...+--.++|+.++.|...+. .+++ ++++|.+ +-+ T Consensus 221 ~~~~~~~~~~~~~~~~G~V~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~l~~h~--~A~ 298 (345) T protein:vir:22 221 LMPNAANYAALIDPEKGSIRNVMGFEVVEVPHLTAGGAGTAREGTTGQKHVFPANKGEGNVKVAKDNVIGLFMHR--SAV 298 (345) T ss_pred ccccccccccccccccceEEEEeceEEEecccccccccCccccCcccccccccccccceeeeeccCceEEEEEeh--hhe Confidence 1 111111 11111110000111345666655421100 0111 1122222 222 Q ss_pred EEeecchhhhccceeeCCceEEEeeeeeeeeEEEECCeEEEEee-cC Q lcl|NC_019522. 266 KGHDVMPLRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHYVD-GV 311 (311) Q Consensus 266 ~~~~~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~d-GI 311 (311) ...-.++++.-.-...+...+.+.+.... |+.+.||++++.+. -| T Consensus 299 ~~v~~~~~~~e~~r~~~~~~d~I~~~~a~-G~~vlRPeaa~~i~~~~ 344 (345) T protein:vir:22 299 GTVKLRDLALERARRANFQADQIIAKYAM-GHGGLRPEAAGAVVFKV 344 (345) T ss_pred eeeeeecceeeeeechhHHHHHHHHHHhc-CCcccccceeEEEEEee Confidence 22112222111111222333445555554 78899999887654 23 No 155 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=72.22 E-value=0.19 Score=24.60 Aligned_cols=287 Identities=11% Similarity=0.097 Sum_probs=127.4 Q ss_pred CCccccccc-----chhhhh-hhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEe----c Q lcl|NC_019522. 1 MAKSVFDVS-----PVSALS-FLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLF----G 70 (311) Q Consensus 1 ~~~~~~~~~-----~~~~~~-fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~----~ 70 (311) |.-+.=-+- +.+..+ |+. ...-+|.+.-...-..+.+..+++- -+..++.|... |+.+.. | T Consensus 1 ms~~~~~tr~~~~~s~~d~al~le----~f~geV~~af~~~s~~~~~~~~rti--~~g~s~~~~~i---G~~~~~~~~pG 71 (335) T protein:vir:63 1 MSFLNDLTRPNYAGKNADVDIHLE----EHLGIVDKHFAYTSKFAPLMNIRDL--RGSNVVRLDRL---GNVEAKGRRAG 71 (335) T ss_pred CCCcccchhhhcccccchhheehh----hhhhhHHHHHHhhhhhccccceeee--ccceeEEEeee---eeeeeecccCC Confidence 433321111 111112 433 2333333332233355566666543 22445554443 544432 1 Q ss_pred CcccccceeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheee------eeccc-cc- Q lcl|NC_019522. 71 PNSTDVPTVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYL------LGDKG-VG- 142 (311) Q Consensus 71 ~~a~dip~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~------~G~~~-~g- 142 (311) ..-++-|... +-...+..-..+.-.+ +.++.. .+...++-.+-.+..-.++++..|+.+| .+... .. T Consensus 72 ~~l~~~~~~~-~k~~itVD~ll~a~~~---I~dlDe-~~~~yDvRse~s~e~G~aLA~~~D~~~~~~i~~aa~~~a~~~~ 146 (335) T protein:vir:63 72 EELERSRVVN-DKWNLTVDTLLYLRHQ---FDHQDE-WTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDL 146 (335) T ss_pred cCcCCCCccc-cceEEEecceeechhh---hhhHHH-HhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCcccc Confidence 1112222211 1111111222222222 344444 2345566677777788888888888665 12111 11 Q ss_pred eeeeecCCcceeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCc-eecceEEEeCHHHHHHHhcc--cccCC- Q lcl|NC_019522. 143 EGLYTSPNVSVEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLT-VHRPNTFVLPPAQFQLLART--LLSTQ- 218 (311) Q Consensus 143 ~GllN~p~v~~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~-~~~p~~l~lpp~~~~~L~~~--~~~~~- 218 (311) .|.++ ||+......++.+ ...+++.+.+-+.++..++.++.=-. ...+..++++|+.|..|.+- .++.. T Consensus 147 ~~~~~-~G~~~~~~~tg~~------~~~~~~~l~~a~~~a~~~L~e~dVP~~~~~dr~~vv~P~~y~~Ll~~~~l~n~~~ 219 (335) T protein:vir:63 147 EDAFS-PGVLEKLDLTGLT------AKQAADKIVRMHRRVVETFIDRDLGDAVYSEGLTPMSPRVFSLLLEHDKLMNVEY 219 (335) T ss_pred CCCcC-CCcceeeeeccCc------ccccHHHHHHHHHHHHHHHHhccCCCcccCceEEEeChHHHHHHhcccccccccc Confidence 23332 3333322222111 12358888888888888886432110 01236899999999998552 22210 Q ss_pred -C--CCcchHHHHHHHhCCceEEEEchhcccCCCC----c----------ccEEEEEEcCcceeEEeecchhhhccceee Q lcl|NC_019522. 219 -N--ASNVTLLQFLRTNFPDITFEDDILLKGAGVA----G----------ADRMAVYKKEIRIVKGHDVMPLRFLAPATA 281 (311) Q Consensus 219 -~--~~~~Tvl~~l~~n~~~l~i~~~~~l~~ag~~----g----------~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~ 281 (311) + ..+..+...+.. --.++|+.++.|-+.+.. | +.++.+ -..++-+...-.++++.-.-.+. T Consensus 220 ~~s~~~~~~~~g~v~~-v~Gv~V~~sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~-~~~~~Al~t~~~~~vt~e~~~~~ 297 (335) T protein:vir:63 220 QATGATNDYVKSRVAI-LNGVKVLETPRFATKAIAAHPLGRHFNVSAEESERQIAL-FLPSKTLITAQVAPVQAKLWEDN 297 (335) T ss_pred ccccccccccCceeEE-eeceEEEeeccCCCCCcccccccccCCccccccceeEEE-EEecceEEEEEEeecccceeecc Confidence 0 000111111111 123567777777432111 1 122322 22333333333344432111234 Q ss_pred CCceEEEeeeeeeeeEEEECCeEE--EEeecC Q lcl|NC_019522. 282 DNVNFKVPAILRTGGTEWRIPKAG--HYVDGV 311 (311) Q Consensus 282 ~~~~~~~~~~~~~gGv~i~~P~ai--~~~dGI 311 (311) +...+.+.+.... |+-++||++. ....|| T Consensus 298 ~~~~~~i~~~~a~-G~g~lRPe~a~~i~~tg~ 328 (335) T protein:vir:63 298 EKFSWVLDTFQMY-NIGARRPDTAGAIELKGI 328 (335) T ss_pred chhhHHhHHHHHc-CCcccccceEEEEEEcCC Confidence 4455666666664 7999999665 456788 No 156 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=66.28 E-value=0.27 Score=23.70 Aligned_cols=284 Identities=8% Similarity=0.015 Sum_probs=126.3 Q ss_pred CCcccccccc-----hhhhh-hhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEec-Ccc Q lcl|NC_019522. 1 MAKSVFDVSP-----VSALS-FLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFG-PNS 73 (311) Q Consensus 1 ~~~~~~~~~~-----~~~~~-fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~-~~a 73 (311) |.-+.-.+.+ .+..+ ||...+-.++..+-+. -..+.++.+++ +. +..++.|.. .|+.+.-. ..+ T Consensus 1 Ms~~n~~t~p~~~gsg~~~aL~Le~f~GeV~taF~~~----si~~~~~~vRt-I~-~gkS~qf~~---lG~s~a~y~~pG 71 (400) T protein:vir:10 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKG----ENIMSYFDVQT-VT-GTNTVSNKY---LGETELQVLAPG 71 (400) T ss_pred CCCCccccccccccccchhhhHHhHhcchHHHHHHHH----hhhcccceeee-ec-ccceEEEEE---eeeeEEeeecCC Confidence 4433222222 11122 4333333444433221 23345555553 11 233444433 34444211 111 Q ss_pred cccceeeeeccce--eEEEEEEEEEEEecHHHHHHHHHhCCC-hHHHHHHHHHHHHHHhhhheee----e-eccc----c Q lcl|NC_019522. 74 TDVPTVDIAMSQG--FKDINTAALGYTYSIEEIGFAMLNNVN-LDAERGQAVRDVVEQGLNKIYL----L-GDKG----V 141 (311) Q Consensus 74 ~dip~v~~~~~~~--~~~v~~~~~~~~~~~~El~~a~~~g~~-l~~~k~~aa~~~~~~~~n~~~~----~-G~~~----~ 141 (311) +.+-...+.-++. +..-.++.-.+-|.++|.. ...+ +..+-....-.+++++.|+.++ . |-+. . T Consensus 72 ~~ldg~~~~~dk~~ItIDtLL~a~~~V~dlDd~q----~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~a~~a~t~~~~ 147 (400) T protein:vir:10 72 QSPAATSTQADKNQLVIDATVIARNTVAHLHDVQ----GDIDSLKPKLATNQAKQLKKMEDEMLIQQMLLGGIANTQAKR 147 (400) T ss_pred CCcCCCCcccCcEEEEeCceeeecchhhhHHHHh----hccccccHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccc Confidence 1111111222222 2223344444445555443 3444 4555555666666676666433 2 2111 1 Q ss_pred c-eeeeecCCcceeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcc--cccCC Q lcl|NC_019522. 142 G-EGLYTSPNVSVEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLART--LLSTQ 218 (311) Q Consensus 142 g-~GllN~p~v~~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~--~~~~~ 218 (311) + .|..-++.....+.. +.-...+++++...|.++...+.+..= - ..-..+++||..|..|..- +++. T Consensus 148 ~~~~g~~~g~s~~v~~~-------~~~~~~~~~~l~~A~~~A~~~LdEkdV-P-~~d~vvl~pp~~Ys~Ll~~dkLvnr- 217 (400) T protein:vir:10 148 TNPRVKGHGFSVNVEVN-------EGEALVNPQYVMAAVEFALEQQLEQEV-D-ISDVAILMPWRYFNVLRDADRIVDK- 217 (400) T ss_pred ccCCccccccceeeccc-------ccccccCHHHHHHHHHHHHHHHHhcCC-C-ccceEEEcCHHHHHHHHhCCcccch- Confidence 1 222222221111111 112234789999999998888864321 1 1235788899999777542 3321 Q ss_pred CC----CcchHHHHHHHhCCceEEEEchhccc-C--------------------CCCcccEEEEEEcCcceeEEeecchh Q lcl|NC_019522. 219 NA----SNVTLLQFLRTNFPDITFEDDILLKG-A--------------------GVAGADRMAVYKKEIRIVKGHDVMPL 273 (311) Q Consensus 219 ~~----~~~Tvl~~l~~n~~~l~i~~~~~l~~-a--------------------g~~g~~~~v~y~~~~~~~~~~~~~~~ 273 (311) +. ++..+...+. +--.++|+.++.|-. + +...+-++++|.++. +.-.=.+|+ T Consensus 218 df~~s~~g~~~~g~v~-~v~Gv~Iv~Sn~lP~~a~~~~~~~lS~a~~G~~y~~t~d~s~~~av~F~~sA--v~tvk~~~l 294 (400) T protein:vir:10 218 SYTISQSGATIQGFVL-SSYNCPVIPSNRFPKYSQGQKHHLLSNEDNGYRYDPIAEMNGAIAVLFTADA--LLVGRSIDV 294 (400) T ss_pred hccccCCCccccceEE-EEeceEEEeeCcCCcccCcccccccccCCCCccCCccccccceeEEEEehhh--eEEEEeecc Confidence 11 1111222221 122466777776632 1 111234556665542 221112343 Q ss_pred hhccceeeCCceEEEeeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 274 RFLAPATADNVNFKVPAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 274 ~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dGI 311 (311) +.-.-.+++...+.+.+...+ |+..+||.++..+.=- T Consensus 295 t~~~~~d~r~~~~~id~~~a~-G~g~~RPeaa~vv~~~ 331 (400) T protein:vir:10 295 IGDIFYEKKEKTYYIDTFMSE-GAIPDRWEAVSVVTTK 331 (400) T ss_pred ccccccchhhHHHHHHHHHHh-CCcccchhheEEEEec Confidence 221113455566667776665 7999999999887654 No 157 >protein:vir:99311 Length: 463 # NCBI annotation: putative capsid protein # Family: family:all:2450 # MgeID: mge:1655 # MgeName: K # Cross-refs: genbank:acc:YP_024474;genbank:gi:48696433;genbank:GeneID:2948039 Probab=66.12 E-value=0.27 Score=23.68 Aligned_cols=260 Identities=10% Similarity=-0.004 Sum_probs=116.1 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhh--hhhhhhhhccccCCCCcceeEE-EEEEeecccceEEecCcccccc Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEY--PQFKYGTLLPLDNSAPDWAQAV-MFRSIDARGELQLFGPNSTDVP 77 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~--~~~~~~~~~~v~~~~~~~~~~~-~~~~~~~~G~a~~~~~~a~dip 77 (311) -++..-+-+++-+++-|.+ +.+|+.|..--+ .+++.-..++- .+.......+ .|......|.+...+.. ...+ T Consensus 27 ~tg~g~~p~~q~~~~AlR~--EsL~~~i~~Lt~~~~~f~~~~~i~k-~~a~STV~~y~~~~~~G~~g~~~f~~E~-g~~~ 102 (463) T protein:vir:99 27 QTGYGITPDTQIDAGALRR--EILDDQITMLTWTNEDLIFYRDISR-RPAQSTVVKYDQYLRHGNVGHSRFVKEI-GVAP 102 (463) T ss_pred hcCCccCCccccCcchhhh--hhhhhhhheeeecccchhhhhhcCC-chhhhhhhhheeeeccCccccccccccc-cccc Confidence 2223333345554555555 445555544332 34454444543 2222222222 22223334444444443 4457 Q ss_pred eeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccce---e-eeecCCcce Q lcl|NC_019522. 78 TVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVGE---G-LYTSPNVSV 153 (311) Q Consensus 78 ~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g~---G-llN~p~v~~ 153 (311) ..+.++.+++..+..++.....++..-. +....+......+.|...+++......||||+.+.- | -|+.-|+.. T Consensus 103 ~~d~~~~Rr~~~~K~l~~~~~VS~~~~l--~n~~~d~~~~~~~dai~~ia~tiE~a~FyGds~l~~~~~~~gleFDGl~~ 180 (463) T protein:vir:99 103 VSDPNIRQKTVSMKYVSDTKNMSIASGL--VNNIADPSQILTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAK 180 (463) T ss_pred cCCCceEEEEEEeeeeehhhhhhhHHHh--hcccccHHHHHHHHHHHHHHHHHHHHHhhhhhccCCCcCccccchhhhhh Confidence 7888888999888888887777765444 333456777888888889999999999999998631 1 234444433 Q ss_pred eeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhC Q lcl|NC_019522. 154 EAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNF 233 (311) Q Consensus 154 ~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~ 233 (311) ...+.+-- +..-.-.. .+.|+++-..+. .+.-.|+-++||......|..-.++.. ..+...+ T Consensus 181 lId~envi----DarG~~Ls--~~~ln~Aa~~i~----~~fGt~TD~~lp~~vka~f~~~~l~~q--------rv~~~~N 242 (463) T protein:vir:99 181 LIDKNNVI----NAKGNQLT--EKHLNEAAVRIG----KGFGTATDAYMPIGVHADFVNSILGRQ--------MQLMQDN 242 (463) T ss_pred hcCCCCee----ecCCCccc--HHHHhhhhhhhh----cccCChhheecchHHHHHHHHHhcCce--------EEEEcCC Confidence 32221100 00000111 233665544442 234468999999999988874332210 0011111 Q ss_pred Cc-eEE-EEchhcccCCCCcccEEEEEEcCcceeEEeecc----hhhhccceeeCCceEEEeeeeeeeeEEEECCeEEEE Q lcl|NC_019522. 234 PD-ITF-EDDILLKGAGVAGADRMAVYKKEIRIVKGHDVM----PLRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHY 307 (311) Q Consensus 234 ~~-l~i-~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~----~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~ 307 (311) ++ +.. ..++.+. ...-.++++-.. |..+-.-.+.-.-... -|...+. T Consensus 243 ~~~~~~G~~v~~f~--------------s~~G~I~L~~s~~m~~~~il~~~~~~~p~ap~-------------~~~~tat 295 (463) T protein:vir:99 243 SGNVNTGYSVNGFY--------------SSRGFIKLHGSTVMENELILDESLQPLPNAPQ-------------PAKVTAT 295 (463) T ss_pred CCceeeeeecccee--------------eeeeeeeeCCceecCCcccccchhhcCCCCcc-------------CceeEEE Confidence 11 100 0011111 111122221000 0000000000000000 0111111 Q ss_pred eecC Q lcl|NC_019522. 308 VDGV 311 (311) Q Consensus 308 ~dGI 311 (311) +..- T Consensus 296 v~~~ 299 (463) T protein:vir:99 296 VETK 299 (463) T ss_pred Eeec Confidence 2111 No 158 >protein:vir:95603 Length: 463 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1577 # MgeName: G1 # Cross-refs: genbank:acc:YP_240903;genbank:gi:66394965;genbank:GeneID:5132544 Probab=66.12 E-value=0.27 Score=23.68 Aligned_cols=260 Identities=10% Similarity=-0.004 Sum_probs=116.1 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhh--hhhhhhhhccccCCCCcceeEE-EEEEeecccceEEecCcccccc Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEY--PQFKYGTLLPLDNSAPDWAQAV-MFRSIDARGELQLFGPNSTDVP 77 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~--~~~~~~~~~~v~~~~~~~~~~~-~~~~~~~~G~a~~~~~~a~dip 77 (311) -++..-+-+++-+++-|.+ +.+|+.|..--+ .+++.-..++- .+.......+ .|......|.+...+.. ...+ T Consensus 27 ~tg~g~~p~~q~~~~AlR~--EsL~~~i~~Lt~~~~~f~~~~~i~k-~~a~STV~~y~~~~~~G~~g~~~f~~E~-g~~~ 102 (463) T protein:vir:95 27 QTGYGITPDTQIDAGALRR--EILDDQITMLTWTNEDLIFYRDISR-RPAQSTVVKYDQYLRHGNVGHSRFVKEI-GVAP 102 (463) T ss_pred hcCCccCCccccCcchhhh--hhhhhhhheeeecccchhhhhhcCC-chhhhhhhhheeeeccCccccccccccc-cccc Confidence 2223333345554555555 445555544332 34454444543 2222222222 22223334444444443 4457 Q ss_pred eeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccce---e-eeecCCcce Q lcl|NC_019522. 78 TVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVGE---G-LYTSPNVSV 153 (311) Q Consensus 78 ~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g~---G-llN~p~v~~ 153 (311) ..+.++.+++..+..++.....++..-. +....+......+.|...+++......||||+.+.- | -|+.-|+.. T Consensus 103 ~~d~~~~Rr~~~~K~l~~~~~VS~~~~l--~n~~~d~~~~~~~dai~~ia~tiE~a~FyGds~l~~~~~~~gleFDGl~~ 180 (463) T protein:vir:95 103 VSDPNIRQKTVSMKYVSDTKNMSIASGL--VNNIADPSQILTEDAIAVVAKTIEWASFYGDASLTSEVEGEGLEFDGLAK 180 (463) T ss_pred cCCCceEEEEEEeeeeehhhhhhhHHHh--hcccccHHHHHHHHHHHHHHHHHHHHHhhhhhccCCCcCccccchhhhhh Confidence 7888888999888888887777765444 333456777888888889999999999999998631 1 234444433 Q ss_pred eeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHHhC Q lcl|NC_019522. 154 EAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRTNF 233 (311) Q Consensus 154 ~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~n~ 233 (311) ...+.+-- +..-.-.. .+.|+++-..+. .+.-.|+-++||......|..-.++.. ..+...+ T Consensus 181 lId~envi----DarG~~Ls--~~~ln~Aa~~i~----~~fGt~TD~~lp~~vka~f~~~~l~~q--------rv~~~~N 242 (463) T protein:vir:95 181 LIDKNNVI----NAKGNQLT--EKHLNEAAVRIG----KGFGTATDAYMPIGVHADFVNSILGRQ--------MQLMQDN 242 (463) T ss_pred hcCCCCee----ecCCCccc--HHHHhhhhhhhh----cccCChhheecchHHHHHHHHHhcCce--------EEEEcCC Confidence 32221100 00000111 233665544442 234468999999999988874332210 0011111 Q ss_pred Cc-eEE-EEchhcccCCCCcccEEEEEEcCcceeEEeecc----hhhhccceeeCCceEEEeeeeeeeeEEEECCeEEEE Q lcl|NC_019522. 234 PD-ITF-EDDILLKGAGVAGADRMAVYKKEIRIVKGHDVM----PLRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHY 307 (311) Q Consensus 234 ~~-l~i-~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~----~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~ 307 (311) ++ +.. ..++.+. ...-.++++-.. |..+-.-.+.-.-... -|...+. T Consensus 243 ~~~~~~G~~v~~f~--------------s~~G~I~L~~s~~m~~~~il~~~~~~~p~ap~-------------~~~~tat 295 (463) T protein:vir:95 243 SGNVNTGYSVNGFY--------------SSRGFIKLHGSTVMENELILDESLQPLPNAPQ-------------PAKVTAT 295 (463) T ss_pred CCceeeeeecccee--------------eeeeeeeeCCceecCCcccccchhhcCCCCcc-------------CceeEEE Confidence 11 100 0011111 111122221000 0000000000000000 0111111 Q ss_pred eecC Q lcl|NC_019522. 308 VDGV 311 (311) Q Consensus 308 ~dGI 311 (311) +..- T Consensus 296 v~~~ 299 (463) T protein:vir:95 296 VETK 299 (463) T ss_pred Eeec Confidence 2111 No 159 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=65.51 E-value=0.28 Score=23.60 Aligned_cols=293 Identities=14% Similarity=0.045 Sum_probs=123.9 Q ss_pred CCc----ccccccchhh------h-hhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEe Q lcl|NC_019522. 1 MAK----SVFDVSPVSA------L-SFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLF 69 (311) Q Consensus 1 ~~~----~~~~~~~~~~------~-~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~ 69 (311) ||. .-|.+....+ . -|+. ...-+|.+.-...-..+.++.+++ +- +..++.+. ..|..+.. T Consensus 1 ma~~~~~~~~~t~~g~~~~~~d~~al~ie----~~~geV~~~f~~~s~~~~~~~~rt-i~-~G~sv~~~---~iG~~~~~ 71 (347) T protein:vir:94 1 MANMNGGQQMGKDQGKGMSAGDKLALFLK----VFGGEVLTAFTRTSVTMNKHLVRS-IQ-SGKSAQFP---VLGRTKAA 71 (347) T ss_pred CCccccccccccccccCCcccchHHHHHH----HHhHHHHHHHHHHHhhhhhhhhee-cc-ccceEEee---eccceeEe Confidence 653 2222221111 1 1443 333334333333345566666543 22 23444444 34555542 Q ss_pred c-Cccccc--ceeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeee----eccc-- Q lcl|NC_019522. 70 G-PNSTDV--PTVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLL----GDKG-- 140 (311) Q Consensus 70 ~-~~a~di--p~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~----G~~~-- 140 (311) . ..++++ |..+...++....|=.. .-+..-+.++..+ ++..++-+.-...+..++++..|+.++- +-.. T Consensus 72 ~~~~G~~l~~~~~~~~~~e~~ltID~~-~y~~~~VddiD~~-q~~~D~rs~~~~~~g~ALA~~~D~~i~~~l~~~a~~~~ 149 (347) T protein:vir:94 72 YLQPGENLDDKRKDMKHTEKTINIDGL-LTADVLIYDIEDA-MNHYDVRSEYTAQLGESLAMAADGAVLAEMAKLCNLPT 149 (347) T ss_pred eeecCcCCCCCcCCccccceEEEEcch-hhhhhhhhhHHHH-hcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc Confidence 1 112333 22233333333222111 1233345677764 4556677778888888999999876652 1110 Q ss_pred -cceeeeecCCcceeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCce-ecceEEEeCHHHHHHHhcccccCC Q lcl|NC_019522. 141 -VGEGLYTSPNVSVEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTV-HRPNTFVLPPAQFQLLARTLLSTQ 218 (311) Q Consensus 141 -~g~GllN~p~v~~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~-~~p~~l~lpp~~~~~L~~~~~~~~ 218 (311) ......-.++-....... ....+.=..++++.+++-|.++...|.+.+ + ..+..++++|+.|..|-+..... T Consensus 150 ~~~~~~~g~~~~~~v~i~~--~~~~~~~~~~~~~~~~d~i~~a~~~Lde~d---VP~~~R~~vv~P~~y~~LLk~~~~~- 223 (347) T protein:vir:94 150 ANNENIAGLGKAHVLEVGD--QATLQGDQVKLGQAIIAQLTLARAKLTGNY---VPSSDRVFYTTPDNYSAILAALMPN- 223 (347) T ss_pred ccccccccCCcceeEeeec--cccccccccccHHHHHHHHHHHHHHhhhcC---CCCCCCEEEeChHHHHHHHHhhccc- Confidence 000000001100001110 000111224578888999998888885432 3 23568999999998876522111 Q ss_pred CCCcchHHHHHHH---hCCceEEEEchhcccCC--CCccc------------------EE-------EEEEcCcceeEEe Q lcl|NC_019522. 219 NASNVTLLQFLRT---NFPDITFEDDILLKGAG--VAGAD------------------RM-------AVYKKEIRIVKGH 268 (311) Q Consensus 219 ~~~~~Tvl~~l~~---n~~~l~i~~~~~l~~ag--~~g~~------------------~~-------v~y~~~~~~~~~~ 268 (311) .....++..+-.. +.-.++|..++.+...+ ..+.. .+ +.....++-+... T Consensus 224 ~~~~~~~~~~~~G~V~~v~G~~V~~Sn~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~y~~d~~~~~~l~~~~~A~~tv 303 (347) T protein:vir:94 224 AANYQALIDPSTGSIRNVMGFEVIEVPHLTAGGAGDNRAEEGVAPTNQKHAFPDTASGDTRVALDNVVGLFNHRSAVGTV 303 (347) T ss_pred ccccccccccccceeEEeeceEEEEcCccccccCcccccccccccccccccccccccccccccccceEEEEechhhhhhh Confidence 1111222221110 11234666666663210 00000 00 1111112211111 Q ss_pred ecchhhhccceeeCCceEEEeeeeeeeeEEEECCeEEEEee--cC Q lcl|NC_019522. 269 DVMPLRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHYVD--GV 311 (311) Q Consensus 269 ~~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~d--GI 311 (311) -.++++.-.-...+...+.+.+.... |+-++||++.+.+. -= T Consensus 304 ~~~~~~~e~~~~~~~~~~~i~~~~a~-G~g~~rPe~a~~i~~~~a 347 (347) T protein:vir:94 304 KLKDMALERARRANFQADQIIAKYAM-GHGGLRPEACGALVFKKA 347 (347) T ss_pred hhcccceeeeechhhhhhhhhhhhhh-cCcccccceeEEEEecCC Confidence 12222111112233333455555555 68889998875321 11 No 160 >protein:vir:100851 Length: 514 # NCBI annotation: hypothetical protein # Family: family:all:2450 # MgeID: mge:1633 # MgeName: LP65 # Cross-refs: genbank:acc:YP_164744;genbank:gi:56693157;genbank:GeneID:3197484 Probab=65.25 E-value=0.19 Score=24.57 Aligned_cols=271 Identities=11% Similarity=-0.020 Sum_probs=114.7 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhh--hhhhhhhhccccCCCCccee-EEEEEEeecccceEEecCcccccc Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEY--PQFKYGTLLPLDNSAPDWAQ-AVMFRSIDARGELQLFGPNSTDVP 77 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~--~~~~~~~~~~v~~~~~~~~~-~~~~~~~~~~G~a~~~~~~a~dip 77 (311) -++..-|-+++-+++-|.+ +.+|+++...-+ .+++.-.-++- .+...... ...|......|.+...+.. ...+ T Consensus 46 t~gy~~~~~~~t~gaAlR~--EsLd~~l~~Lt~~~~~ftf~~~i~k-~~a~STV~ey~~~~~~G~~G~~~f~~E~-gi~~ 121 (514) T protein:vir:10 46 TAGHSITPDTQTDGAANRI--ESLNRDLKVTTWGERDFTLYNDIAK-QPVDNTVLKYTQYYSHGRTGHSLFQPEI-GIGD 121 (514) T ss_pred ccccccCCccccCccchhh--hhhccceeEeeecCcchhhhhhcCC-chhhHHHhhhhhhcccCccccccccccc-ccCc Confidence 3334444445555655665 556666644322 33444444442 22222222 2222233344444444443 2345 Q ss_pred eeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCC-ChHHHHHHHHHHHHHHhhhheeeeeccccc-----eeeeecCCc Q lcl|NC_019522. 78 TVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNV-NLDAERGQAVRDVVEQGLNKIYLLGDKGVG-----EGLYTSPNV 151 (311) Q Consensus 78 ~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~-~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g-----~GllN~p~v 151 (311) ..+.++.++...+......+..++.--. ..|+ +......+.|...+++......||||+.+. .| |.--|+ T Consensus 122 ~~d~~~~rk~~~~k~l~~~~~vS~~~~l---~n~i~d~~~~~~~dai~~ia~tiE~a~FyGDs~L~s~~~~~g-leFDGl 197 (514) T protein:vir:10 122 VNNPNERQRTINIKYIVDTHVTSIALQR---ANTIVDSLKVQEYAAISTVIKTDEWAMFYGDADLTSGQKGEG-LQFDGL 197 (514) T ss_pred CCCcceEEEEEeeeeeeeeeeeeehhhh---ccchhhHHHHHHHHHHHHHHHHHHHHHhhhcccCCCccccCc-chhhhH Confidence 5777788888888888888888754111 1122 455566678888889999999999998752 12 122222 Q ss_pred ceeeccC---CccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHH Q lcl|NC_019522. 152 SVEAATS---TFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQF 228 (311) Q Consensus 152 ~~~~~~~---~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~ 228 (311) .....+. +..|+. +. .+.|+.+-..+ .+ + .-.|+-++||......+..-.++.. -+ T Consensus 198 ~~lI~~~NvIDarG~~--Ls-------~~~ln~aA~~i-~~-g--fGt~TD~ylp~~vka~f~~~~~~~q-----RV--- 256 (514) T protein:vir:10 198 FKLIAPENHIDLRGGR--LS-------PAALNMAARKI-GE-G--FGTPTDAYMPIGIKADFVNQHLNGQ-----RV--- 256 (514) T ss_pred HHhhcCCCeEecCCCC--cc-------HHHHhhhhhhh-hc-c--cCChhheeCchHHHHHHhhcccCcc-----eE--- Confidence 1111110 111211 11 34455543322 22 2 4468999999999988865332210 00 Q ss_pred HHHhCCc--eEEEEchhcccCCCCcccEE---EEEEcCcceeEEeecchhhhccceeeCCceEEE-eeeeeeeeEEEECC Q lcl|NC_019522. 229 LRTNFPD--ITFEDDILLKGAGVAGADRM---AVYKKEIRIVKGHDVMPLRFLAPATADNVNFKV-PAILRTGGTEWRIP 302 (311) Q Consensus 229 l~~n~~~--l~i~~~~~l~~ag~~g~~~~---v~y~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~-~~~~~~gGv~i~~P 302 (311) +..++++ ..=..++.+.++ .|.-++ .+. +...-+.+..+ -...+|..+ .+...+ |.-.. .-+| T Consensus 257 ~~~~n~~~~~~G~~v~~f~s~--~G~I~L~gs~im-~~~n~L~~~~~--~~~~Ap~~~-~va~svT~~~~g-----~~~~ 325 (514) T protein:vir:10 257 MLPGQTGGMTTGLDIDKFLSA--HGSIRIQGSTIM-DSDNKLDFDRP--VSPTAPTAP-QLSATVTPDGGG-----LWHE 325 (514) T ss_pred EeecCccceeeeeeccceeEe--ccceeecCCeee-cccccCccCCc--cCCcCCCCC-cceEEEecCccc-----ccCc Confidence 1111111 000001111110 010000 000 01111111111 112233222 222121 11100 1123 Q ss_pred eEEEEeecC Q lcl|NC_019522. 303 KAGHYVDGV 311 (311) Q Consensus 303 ~ai~~~dGI 311 (311) .-..--.|= T Consensus 326 ad~t~~~g~ 334 (514) T protein:vir:10 326 ADKTDSKGE 334 (514) T ss_pred ccccccccc Confidence 322222222 No 161 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=63.82 E-value=0.31 Score=23.37 Aligned_cols=283 Identities=12% Similarity=0.069 Sum_probs=119.4 Q ss_pred CCc---ccccccchhh------hhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEE--e Q lcl|NC_019522. 1 MAK---SVFDVSPVSA------LSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQL--F 69 (311) Q Consensus 1 ~~~---~~~~~~~~~~------~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~--~ 69 (311) ||. ..+.+....+ .+.+ |+...++|...-...-..+.++.+++ +. +-.++.+.. .|..+. + T Consensus 1 m~~~~~~~~~t~~g~~~~~~d~~al~---ik~f~~eV~~~f~~~s~~~~~~~~r~-i~-~G~sv~i~~---iG~~tv~~~ 72 (347) T protein:vir:94 1 MANVPGQKIGTDQGKGKSSSDALALF---LKVFAGEVLTAFTRRSVTADKHIVRT-IQ-NGKSAQFPV---MGRTSGVYL 72 (347) T ss_pred CCCCCccccccccccCCccccHHHHH---HHHHhHHHHHHHHHHHhhhccccccc-cc-ccceEEEec---ccceeeeee Confidence 443 2222221111 1222 23333444443222223344444443 22 234444433 354443 2 Q ss_pred cCccccccee--eeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeee---------ec Q lcl|NC_019522. 70 GPNSTDVPTV--DIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLL---------GD 138 (311) Q Consensus 70 ~~~a~dip~v--~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~---------G~ 138 (311) .-+ ++++-- +..-.+....|-.+- -+..-+.++.. .++..++-.+-.+.+..++++..|+.++. +. T Consensus 73 t~G-~~l~~~~~~~~~~e~~itID~~~-~~~~~VddiD~-~q~~~D~~~~~~~~~g~aLa~~~D~~i~~~~~~~aa~~~~ 149 (347) T protein:vir:94 73 APG-ERLSDKRKGIKHTEKVITIDGLL-TADVMIFDIED-AMNHYDVAGEYSNQLGEALAIAADGAVLAEMAILCNLPAA 149 (347) T ss_pred cCC-CCcCCCCCCCCcceEEEEecchh-hhhHHhhhHHH-HhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccc Confidence 211 332110 122222222221110 12233456665 44566788888999999999999987642 11 Q ss_pred cccce-eeeecCCcceeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCce-ecceEEEeCHHHHHHHhccccc Q lcl|NC_019522. 139 KGVGE-GLYTSPNVSVEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTV-HRPNTFVLPPAQFQLLARTLLS 216 (311) Q Consensus 139 ~~~g~-GllN~p~v~~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~-~~p~~l~lpp~~~~~L~~~~~~ 216 (311) +.... |+ -.+++.... ..+.+.=..++++.+++-|.++...+.+.+ + .....++++|..|..|.....- T Consensus 150 ~~~~~~g~-~~~s~~~~~-----~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~---VP~~~R~~vv~P~~~~~Ll~~~~~ 220 (347) T protein:vir:94 150 SNENIAGL-GTASVLEVG-----KKADLDTPAKLGEAIIGQLTIARAKLTSNY---VPAGDRYFYTTPDNYSAILAALMP 220 (347) T ss_pred cccccCCC-cccceeecc-----ccccccchhhhHHHHHHHHHHHHHHHhhcC---CCCCCcEEEeCHHHHHHHhccchh Confidence 11111 22 112221111 111111224567788888888777775432 2 1235899999999988542211 Q ss_pred CCCCCcchHHHHH----HHhC-----CceEEEEchhcccCC------------CCcccEEEE------EEc--------- Q lcl|NC_019522. 217 TQNASNVTLLQFL----RTNF-----PDITFEDDILLKGAG------------VAGADRMAV------YKK--------- 260 (311) Q Consensus 217 ~~~~~~~Tvl~~l----~~n~-----~~l~i~~~~~l~~ag------------~~g~~~~v~------y~~--------- 260 (311) +...+. ..++ -.++|+.++.|...+ .+|.+..+. |.- T Consensus 221 -------~~~~~~~~~~~~~G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~~~~aG~~~~~~~~~~~~~~~~~~~~~~l~ 293 (347) T protein:vir:94 221 -------NAANYAALIDPETGNIRNVMGFVVVEVPHLVQGGAGETRGDDGITIASGQKHAFPATASSDVKVTMDNVVGLF 293 (347) T ss_pred -------hhhhccccccccccceEEEeceEEEecCcccccccccccccCcceecCcccccccccchhhhcccccceeEEE Confidence 111111 1122 235677777664211 112122111 111 Q ss_pred -CcceeEEeecchhhhccceeeCCceEEEeeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 261 -EIRIVKGHDVMPLRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 261 -~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dGI 311 (311) +++-+...-.++++.-.-...+.....+.+.... |+-+.||++++.+.== T Consensus 294 ~h~~A~~~v~~~~~~~e~~r~~~~~~d~i~~~~~~-G~~~~rP~~a~~~~~~ 344 (347) T protein:vir:94 294 SHRSAVGTVKLRDLALERDRDVDAQGDLIVGKYAM-GHGGLRPEAAGALVFS 344 (347) T ss_pred eehhhhhhhhcccccccchhchhhHHHHhhhhhhh-cCcccccceeEEEEec Confidence 1111111111121111111222333455555554 7889999987655222 No 162 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=63.60 E-value=0.31 Score=23.34 Aligned_cols=250 Identities=11% Similarity=0.045 Sum_probs=107.2 Q ss_pred ccCCCCcceeEEEEEEeecccceEEecC-cccccce--eeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHH Q lcl|NC_019522. 44 LDNSAPDWAQAVMFRSIDARGELQLFGP-NSTDVPT--VDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQ 120 (311) Q Consensus 44 v~~~~~~~~~~~~~~~~~~~G~a~~~~~-~a~dip~--v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~ 120 (311) ....+.- ..++.+ ...|+.+...- .+++|.. -+..-.+....|=. ..-+..-+.|+..++ +..++-.+-.+ T Consensus 1 ~vr~i~~-g~s~~~---~~iG~~~~~~~~~G~~l~~~~~~~~~~e~~itID~-~l~~~~~VdDiD~~q-a~~Dlr~e~s~ 74 (324) T protein:vir:99 1 MTRTITS-GKSAQF---PVMGRTKARYLKQGQSLDDGREDIKHTEKVITIDG-LLTTDVLIYDIEDAM-NHYDVRSEYST 74 (324) T ss_pred Ceeeeec-CceEEE---eeeeeeEeccccCCCCcCCCcCCcCcccEEEEecc-hhhhhhhhhhHHHHh-cCccchhHHHH Confidence 1111111 122222 23455553221 1222211 11122222111110 112233445666644 55778888889 Q ss_pred HHHHHHHHhhhheeee---ec----cccceeeeecCCcceeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCc Q lcl|NC_019522. 121 AVRDVVEQGLNKIYLL---GD----KGVGEGLYTSPNVSVEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLT 193 (311) Q Consensus 121 aa~~~~~~~~n~~~~~---G~----~~~g~GllN~p~v~~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~ 193 (311) .+..++++..|+.+|. +. +....|-...++-......+++ +.-+..+++.+++-|.++..+|.+.+=- T Consensus 75 ~~G~aLA~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~~~~~~~----~~~~~~~~~~~~dai~~a~~~Lde~~VP- 149 (324) T protein:vir:99 75 QMGEALAMAADVANYAEMAKLVNSRKETTNENIEGLGAASLVKITGK----KEDPAKYGTQVIQALTYARAAFAKKYIP- 149 (324) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhcccccccCCcccCCccceeccccc----ccccccCHHHHHHHHHHHHHHHhhcCCC- Confidence 9999999999987652 11 1111111111111111111111 1123467889999999988888654311 Q ss_pred eecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHH---hCCceEEEEchhcccC-CCC------------------- Q lcl|NC_019522. 194 VHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRT---NFPDITFEDDILLKGA-GVA------------------- 250 (311) Q Consensus 194 ~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~---n~~~l~i~~~~~l~~a-g~~------------------- 250 (311) .....++++|+.|..|.....-....++ +.-.+... +.-.++|+.++.|... +.. T Consensus 150 -~~gR~~vv~P~~y~~Ll~~~~~~~~~~~-~~~~~~~G~V~~i~Gf~V~~Sn~lp~~~~t~~~~a~~~~~~~~~~~~~~~ 227 (324) T protein:vir:99 150 -AGDRTFYTDPDTYSAILAALMPNAANYA-ALIDPETGNIRNVMGFEVVETPHMTAQMVTNPTDAFDGTGHIFPATGDST 227 (324) T ss_pred -CCCCEEEeChHHHHHHhhcccccccccc-cccceecceEEEEeceEEEecCCccccccccccccccccccccccccccc Confidence 1235899999999988543221111111 11111110 0123566666666321 110 Q ss_pred ---------cccEEEEEEcCcceeEEeecchh--hhccceeeCCceEEEeeeeeeeeEEEECCeEEEEee-------cC Q lcl|NC_019522. 251 ---------GADRMAVYKKEIRIVKGHDVMPL--RFLAPATADNVNFKVPAILRTGGTEWRIPKAGHYVD-------GV 311 (311) Q Consensus 251 ---------g~~~~v~y~~~~~~~~~~~~~~~--~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~d-------GI 311 (311) ++-+.+++.++ -+...-.+++ +.. .+.+...+.+...... |+.+.||++++.+. |+ T Consensus 228 ~~~ky~~d~~~~~gl~~~~~--a~~tv~~~~~~~e~~--~~~~~~~d~i~~~~a~-G~~~lRPe~a~~v~l~~~~~~~~ 301 (324) T protein:vir:99 228 TTGKMTVGADNVVGLFVHRS--AVATLKLKDMALERA--RRPEYQADQIIAKYAM-GHGGLRPEAVGAIIFEDGETPAV 301 (324) T ss_pred cccccccccCceeEEEEehh--heEEEeeecceecce--echhhHHHhhhhhhhh-cCcccccceEEEEEEccCccccc Confidence 01111111111 1100001111 111 1122223344444554 78889999887665 44 No 163 >protein:vir:97397 Length: 517 # NCBI annotation: major capsid protein # Family: family:all:11745 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762590;genbank:gi:115304291;genbank:GeneID:5130600 Probab=63.07 E-value=0.32 Score=23.27 Aligned_cols=265 Identities=10% Similarity=-0.012 Sum_probs=100.8 Q ss_pred CCcc-------------------------------cccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCC Q lcl|NC_019522. 1 MAKS-------------------------------VFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAP 49 (311) Q Consensus 1 ~~~~-------------------------------~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~ 49 (311) .+.. ..........+++.. ..+-..+...........+++++.. T Consensus 203 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p--~~~~~~i~~~~~~~~~i~~~~~~~~--- 277 (517) T protein:vir:97 203 EALKVTPEATEFLKTREAEVAYMSASLTKDPKAAWTAELKERGISGMPAP--AGILKRIQDAVNDEGSLLPFIRHEN--- 277 (517) T ss_pred ccccccchhhHHHHHHHHHHHHHHhcccccccceeeeecccccccccccc--hHHHHHHHHhhhhhccceeeeeecc--- Confidence 0000 000000011111111 0111111111111112222222211 Q ss_pred cceeEEEEEEeecccceEEecCcccccceeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCC-ChHHHHHHHHHHHHHH Q lcl|NC_019522. 50 DWAQAVMFRSIDARGELQLFGPNSTDVPTVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNV-NLDAERGQAVRDVVEQ 128 (311) Q Consensus 50 ~~~~~~~~~~~~~~G~a~~~~~~a~dip~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~-~l~~~k~~aa~~~~~~ 128 (311) .....+ ..-...+.+.+...+ ...|..+..+...+.+++.++.-...|.+-|..+..--. .|.+--....+..+.+ T Consensus 278 i~~~~~--~~~~~~~~a~~~~eG-~~kp~s~~tf~~~~~~~~~ia~~~~~S~qll~Ds~~dd~~~l~s~i~~~l~~~l~~ 354 (517) T protein:vir:97 278 LPTLVV--GGDNALTQGTGHTTG-TDKTESNITLQTRVLTPQYVYKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIM 354 (517) T ss_pred ccceee--ecccccceeeeeecC-CcccccccceeeEEeeHhhhhhhhhhhHHHHHHhhhccHHHHHHHHHHHHHHHHHH Confidence 111111 111112233344433 445777777777777777777777777655554332211 1666677788999999 Q ss_pred hhhheeeeecccc--ceeeeecCCcceeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHH Q lcl|NC_019522. 129 GLNKIYLLGDKGV--GEGLYTSPNVSVEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQ 206 (311) Q Consensus 129 ~~n~~~~~G~~~~--g~GllN~p~v~~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~ 206 (311) .+++-+++|+... ..|+++..+... ..+ .-.+.+..+++..|..++. .. ....++|.|.. T Consensus 355 ~ee~a~l~GdGtg~~~~gi~~~a~~~~---~~~------~~~~~~~~d~i~~l~~a~~----~a-----~~a~~vmn~~t 416 (517) T protein:vir:97 355 AVNRAIIMGGVTGVSETQIYPVVGDAW---ATN------VTGTTNIQELLEKLSVATP----KA-----ADSTLVIHRND 416 (517) T ss_pred HHHHHHhcccCCCcccccccccccccc---ccc------ccccchHHHHHHHHHHHhh----hc-----cCCEEEECHHH Confidence 9999999998642 245554322100 000 0011122222222222211 11 12368999999 Q ss_pred HHHHhcccccCCCCCcchHHHHHHHhCCc------eEEEEchhcccCCCCcccEEEEEEcCcce-e----EEeecchhhh Q lcl|NC_019522. 207 FQLLARTLLSTQNASNVTLLQFLRTNFPD------ITFEDDILLKGAGVAGADRMAVYKKEIRI-V----KGHDVMPLRF 275 (311) Q Consensus 207 ~~~L~~~~~~~~~~~~~Tvl~~l~~n~~~------l~i~~~~~l~~ag~~g~~~~v~y~~~~~~-~----~~~~~~~~~~ 275 (311) +..|.+.. +..|.=++.=+..+.+. ..+.+.++. + +..+++.+ ++ + .+....++-+ T Consensus 417 ~~~I~klK----D~~G~Yl~~~~~~~~~~~~l~G~~~~~~~~~~------~-~~~~~~~~--~y~i~~~~g~~~~~~fd~ 483 (517) T protein:vir:97 417 LAAIRFLK----DKNGNYVFPVGVSNQTIATHFGFNRLVQSVAV------D-EKTAVSLS--GYVTNGSRGMEFEQGTIL 483 (517) T ss_pred HHHHHHhh----cCCCCeeccCcCCcccccccCCcccccccccc------C-ceeEeecc--ccEEEeecceeeeeeeec Confidence 99986533 11112111100001110 111222211 0 11111111 11 0 1111111111 Q ss_pred ccceeeCCceEEEeeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 276 LAPATADNVNFKVPAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 276 ~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dGI 311 (311) . .... .+-...++|| -|+.|+++++..=- T Consensus 484 ~----~n~~--~f~~~~~~~g-~i~~~~r~a~~~~~ 512 (517) T protein:vir:97 484 V----ENNK--EYLFEMPISG-SLEYKGTTAYGTYT 512 (517) T ss_pred c----cCce--eEeeeeeecc-ccccccceEEEEEc Confidence 1 1111 1222345544 56667766653222 No 164 >protein:vir:107120 Length: 329 # NCBI annotation: conserved phage protein # Family: family:all:701 # MgeID: mge:1571 # MgeName: CNPH82 # Cross-refs: genbank:acc:YP_950606;genbank:gi:119953686;genbank:GeneID:4643129 Probab=62.15 E-value=0.34 Score=23.15 Aligned_cols=273 Identities=7% Similarity=-0.036 Sum_probs=117.8 Q ss_pred CCccccccc----------chh-----hhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccc Q lcl|NC_019522. 1 MAKSVFDVS----------PVS-----ALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGE 65 (311) Q Consensus 1 ~~~~~~~~~----------~~~-----~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~ 65 (311) |..-.-++. ++- .++.....+..||..+.... ++..-+++-.-+. -+..++....++..|- T Consensus 12 ~~~~~~~~~~~~~~~~~~~~~~~~~~nt~~l~~k~~~~LD~~~~~~~---~s~~~~~N~~~e~-~~g~tVkIp~i~~~gl 87 (329) T protein:vir:10 12 MNKEIKNATGKLKLNLQHFANKSVEPGDTLLKNKHVGILEKVTAANS---YSAPAVISNDAIF-MQGRSFTVIKGDVTEL 87 (329) T ss_pred hhhhhhcccceeEEehhhhcCCccCCchhHHHHHHHHHHHHHHHhhc---eeeeeecccceee-ccCcEEEEeeeccccc Confidence 221111111 111 11122233444444332221 2222222211112 2566777777776664 Q ss_pred eEEecCcccccceeeeeccceeEEEEEEEEEEEecHHHHHHHHHhC-CChHHHHHHHHHHHHHHhhhheeeeecccccee Q lcl|NC_019522. 66 LQLFGPNSTDVPTVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNN-VNLDAERGQAVRDVVEQGLNKIYLLGDKGVGEG 144 (311) Q Consensus 66 a~~~~~~a~dip~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g-~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g~G 144 (311) . .|.- ..+...=+++.++....+-. ..++.+.++++...+..+ +.....-.+.++..+.-.+|...|---.+.. | T Consensus 88 ~-DY~R-~~g~~~g~vt~~~~t~tidq-dR~~~F~VD~~D~dEtn~~l~a~~i~~~~~~~~v~pEiDay~~skla~~a-~ 163 (329) T protein:vir:10 88 K-DYKR-NATNEFDHPQIQETTYFLDQ-EKYWGRFVDALDRRDTEGNIDINYVVAKQASEVVAPYLDNLRFATLARNK-A 163 (329) T ss_pred c-cccC-CCCccccccccceeEEEeec-ccceeeecchhhHhhhhhhhhHHHHHHHHHHHHhhhHHHHHHHHHHHhhc-c Confidence 3 2310 12222224455566655544 778888888888765532 2222333344555555555544432111100 0 Q ss_pred eeecCCcceeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhccc--ccCCCCCc Q lcl|NC_019522. 145 LYTSPNVSVEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTL--LSTQNASN 222 (311) Q Consensus 145 llN~p~v~~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~--~~~~~~~~ 222 (311) +. .=...|++-+++.|.++..+|... ++.....|+++|..+..|.+-. ........ T Consensus 164 ----------------~~---~~~~~t~~nay~~i~~a~~~Lde~---~vp~~Rvl~VtP~~~~~Lk~~~~f~~~~~~~~ 221 (329) T protein:vir:10 164 ----------------KH---LTVGSGADAQYDAVLDVSVELDEI---GAGASRILFVTPKFYKGIKKFVIELPQGDNRQ 221 (329) T ss_pred ----------------cc---cccccCHHHHHHHHHHHHHHHHhc---CCCCCcEEEeCHHHHHHHHhhhhhhccccccc Confidence 00 011236777899999999988653 2334568999999999996511 10000000 Q ss_pred chHHHHHHHhCCceEEEEchhcccCCCCcccEEEEEEcCcceeEEeec-chhhhccceeeCCceEEEeeeeeeeeEEEEC Q lcl|NC_019522. 223 VTLLQFLRTNFPDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDV-MPLRFLAPATADNVNFKVPAILRTGGTEWRI 301 (311) Q Consensus 223 ~Tvl~~l~~n~~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~-~~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~ 301 (311) ....+-.-...-...|..+|.--.. +.+.+++ .. .-+.+..= ..++...|.|.. ..+.+.+ -.+.|+.|.+ T Consensus 222 ~~~~~g~Vg~idG~~Ii~vps~~~k---~in~ii~-~~--~A~~~~~K~~~~~~~~p~~~~-~a~~v~g-r~yyd~~V~~ 293 (329) T protein:vir:10 222 QVLGKGVQGELDGFTIVKVPSKMLQ---GVEAMAV-IG--EVMASPIQANEAKLNSNVPGM-FGTLAEQ-MLYTGAFVPE 293 (329) T ss_pred cceeeeeeeeecCeEEEEecCCccc---ceeEEEE-cC--CceeeeeeeeeeeeeCCCCcc-chheeee-eeeeeeEEEc Confidence 0000000000112345555532211 1122222 11 11111000 011222232333 2355554 5667999999 Q ss_pred CeEEEEeecC Q lcl|NC_019522. 302 PKAGHYVDGV 311 (311) Q Consensus 302 P~ai~~~dGI 311 (311) |.+.....-+ T Consensus 294 ~k~~~I~~~~ 303 (329) T protein:vir:10 294 HLQKYIFTIG 303 (329) T ss_pred cccCEEEEec Confidence 9865544444 No 165 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=59.02 E-value=0.4 Score=22.76 Aligned_cols=290 Identities=10% Similarity=0.039 Sum_probs=124.3 Q ss_pred CCc------ccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCc-c Q lcl|NC_019522. 1 MAK------SVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPN-S 73 (311) Q Consensus 1 ~~~------~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~-a 73 (311) |.- +.-......---||....-.++..+-+. =..+.+..+++ +. +..++.|.. .|.++...-. + T Consensus 1 ms~~n~~t~~~~~~~~~~~al~le~f~geV~taf~~~----s~~~~~~~~rt-i~-~gkS~q~~~---iG~~~~~~~~~G 71 (364) T protein:vir:10 1 MSNPNVLTQPAVSASGEVDSLLIEKFNNRVHEQYLKG----ENLLQWFDVQE-VV-GTNSVSNKY---IGETELQVLSPG 71 (364) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHH----HhhcCcceeee-ec-ccceEEeee---eeeeEEeeeccC Confidence 332 2222222111124444444455544331 13334555443 22 334444433 3554431111 1 Q ss_pred cccceeeeeccceeEE--EEEEEEEEEecHHHHHHHHHhCCC-hHHHHHHHHHHHHHHhhhheeee----eccccceeee Q lcl|NC_019522. 74 TDVPTVDIAMSQGFKD--INTAALGYTYSIEEIGFAMLNNVN-LDAERGQAVRDVVEQGLNKIYLL----GDKGVGEGLY 146 (311) Q Consensus 74 ~dip~v~~~~~~~~~~--v~~~~~~~~~~~~El~~a~~~g~~-l~~~k~~aa~~~~~~~~n~~~~~----G~~~~g~Gll 146 (311) +.+-...+.-++.... -..+.-.+ +.++.. .+...+ +..+-...+..++++..|+.++- +-...-.+-. T Consensus 72 ~~ld~~~~~~~k~~itID~ll~a~~~---V~diDe-~q~~~D~vR~e~s~e~G~ALA~~~Dq~i~~~v~~aa~a~~~~~~ 147 (364) T protein:vir:10 72 KSPDASPTEFDKNRLVVDTTVIARNT---VAHFHD-VQNDIDGLKSKLSVNQAKKLKKMEDSMVIQQLVLGGISNTEAIR 147 (364) T ss_pred cccCCCCcccCcEEEEecceeeechh---hhhHHH-HhcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccc Confidence 1111111111221111 11222222 344444 234555 56666677778888877776641 1101001111 Q ss_pred ecCCcceeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCce-ecceEEEeCHHHHHHHhcc--cccCC---CC Q lcl|NC_019522. 147 TSPNVSVEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTV-HRPNTFVLPPAQFQLLART--LLSTQ---NA 220 (311) Q Consensus 147 N~p~v~~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~-~~p~~l~lpp~~~~~L~~~--~~~~~---~~ 220 (311) +.|.+......... ++.+.-...+++.+.+-|.++...+.+.. + ..-..++|||..|..|.+- .++.. .+ T Consensus 148 ~~~~~~~~g~~i~~-~~~a~~~~~~~~~l~~ai~~a~~~LdEkd---VP~~~R~~vv~P~~y~~Ll~~~~lvn~d~~~~~ 223 (364) T protein:vir:10 148 KNPRVAGHGFSIHI-VGLASSFLTSPQYMMAAIEMAMEQQTEQE---VDTSELCGLMPWTAFNCLRDADRIVDKSYTIAA 223 (364) T ss_pred cCCcccCCcceeee-cccCcchhhhHHHHHHHHHHHHHHHhhcC---CCccccEEEeChHHHHHHhcCCccccccccccC Confidence 11111000000000 11122234567888888888888875432 2 1115789999999988653 33210 01 Q ss_pred CcchHHHHHHHhCCceEEEEchhcccC------------------CC-------C--cccEEEEEEcCcceeEEeecchh Q lcl|NC_019522. 221 SNVTLLQFLRTNFPDITFEDDILLKGA------------------GV-------A--GADRMAVYKKEIRIVKGHDVMPL 273 (311) Q Consensus 221 ~~~Tvl~~l~~n~~~l~i~~~~~l~~a------------------g~-------~--g~~~~v~y~~~~~~~~~~~~~~~ 273 (311) .+..+...+. .--.++|+.++.|... |. + .+-++++|.+ +-+...-.+|+ T Consensus 224 ~~~~~~G~v~-~v~Gv~Vv~Sn~lP~~~~~~~~t~~~t~h~ls~~~~g~~y~v~~d~~~~~~~~f~~--~Al~tv~~~~~ 300 (364) T protein:vir:10 224 SDNTVDGFVL-KSWNTPIVPSNRFPKLSDNTEGTGNTKHHKLSNAGNGNRYDVTAGQTSAQAVLFTQ--DALLVGRTISI 300 (364) T ss_pred CCccccceeE-EEeceEEEeccccccccccccccccccccccccccCCcccccccccceeEEEEEec--ceEEEEEEecc Confidence 1111111110 0123556666666311 00 1 1344555544 44443333444 Q ss_pred hhccceeeCCceEEEeeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 274 RFLAPATADNVNFKVPAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 274 ~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dGI 311 (311) +.-.-...+...|.+.+.... |+-++||++++-+.== T Consensus 301 t~e~~~~~~~~~~~ida~~a~-G~g~lRPeaa~~i~~~ 337 (364) T protein:vir:10 301 TGDIFYEKKEKTWYIDTFLAE-GAIPDRWEAVAVVTAA 337 (364) T ss_pred eeeeeeccceeeeeeeeehcc-cCcccCccceEEEEec Confidence 322113445556677776665 7999999998877433 No 166 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=58.93 E-value=0.4 Score=22.75 Aligned_cols=262 Identities=11% Similarity=0.051 Sum_probs=118.6 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCC--CCcceeEEEEEEeecccceEEecCcccccce Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNS--APDWAQAVMFRSIDARGELQLFGPNSTDVPT 78 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~--~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~ 78 (311) ||- .++.. +.+...+.+.....+....++....+ +..|+ ++.++.....+.+..... ...++. T Consensus 1 MA~-----------~~~~p--ei~~~~v~~~~~~~lv~~~l~~~~~~~~~~~Gd-Tv~ip~~~~~~~~d~~~~-~~~~~~ 65 (273) T protein:vir:79 1 MAF-----------NNFIP--ELWSDMLLEEWTAQTVFANLVNREYEGIASKGN-VVHIAGVVAPTVKDYKAA-GRQTSA 65 (273) T ss_pred Ccc-----------hhhhH--HHHHHHHHHHHHhhccchhhhhccccccccCCc-EEEEeecCcccccccccC-CCccCc Confidence 221 12222 34556666666666666666543322 22233 777777655554332222 122343 Q ss_pred eeeeccceeEEEEE-EEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccceeeeecCCcceeecc Q lcl|NC_019522. 79 VDIAMSQGFKDINT-AALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVGEGLYTSPNVSVEAAT 157 (311) Q Consensus 79 v~~~~~~~~~~v~~-~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g~GllN~p~v~~~~~~ 157 (311) -+.+.++....+-. -..++.++..| ..+ ...++.. -.+.+..++++..|+.++- ++..-+. . T Consensus 66 ~~~~~~~~~~tid~~~~~~~~i~d~d--~~~-~~~~~~~-~~~~~~~ala~~vD~~i~~--------~~~~a~~-----~ 128 (273) T protein:vir:79 66 DAISDTGVDLLIDQEKSIDFLVDDID--RVQ-VAGSLEA-YTRAGATALATDTDKFIAD--------MLVDNGT-----A 128 (273) T ss_pred cccccceEEEEEeeecccceeeccHH--HHh-hcccHHH-HHHHHHHHHHHHHHHHHHH--------HHhhccc-----c Confidence 34455555666644 35566665444 333 3446753 5566677888888765431 1100000 0 Q ss_pred CCccccCcccccCCHHHHHHHHHHHHHHHHhccCCce-ecceEEEeCHHHHHHHhcc--cccCCCCCc-chHH-HHHHHh Q lcl|NC_019522. 158 STFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTV-HRPNTFVLPPAQFQLLART--LLSTQNASN-VTLL-QFLRTN 232 (311) Q Consensus 158 ~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~-~~p~~l~lpp~~~~~L~~~--~~~~~~~~~-~Tvl-~~l~~n 232 (311) .+ + =..-+++.+++.|.++..++... .+ .....++++|..+..|-+- .....+..+ ...+ +-...+ T Consensus 129 ~~--~----~~~~~~~~~~~~i~~a~~~ld~~---~vP~~~R~lvv~p~~~~~Ll~~~~~~~~~~~~~~~~~l~~G~ig~ 199 (273) T protein:vir:79 129 LT--G----SAPSDADDAFDLIASALKELTKA---NVPNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGN 199 (273) T ss_pred cc--c----ccccchhhHHHHHHHHHHHhhhc---cCCccCcEEEECHHHHHHHhhchhhhhhhhhcccccceeeeEeeE Confidence 00 0 01124556677788877777432 22 1224899999999877431 111111111 1110 000001 Q ss_pred CCceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecc-hhhhccceeeCCceEEEeeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 233 FPDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVM-PLRFLAPATADNVNFKVPAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 233 ~~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~-~~~~~~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dGI 311 (311) .-.++|+....+.... + ...+++.++- +.+..-. .+.... ........+..... .|+.+.+|.+++.+.-= T Consensus 200 ~~G~~i~~s~~lp~~~--~-~~~~a~~~~A--~~~a~~~~~~e~~r--~~~~~~~~v~~~~~-yg~~v~~p~~vv~~~~~ 271 (273) T protein:vir:79 200 LLGARIVESNNLRDTD--D-EQFVAFHPSA--AAYVSQIDTVEALR--DQDSFSDRIRALHV-YGGKVVRPTGVVVFNKT 271 (273) T ss_pred EeceEEEecccccccC--c-eEEEEEeccc--eeeeeehhhhhccc--Ccccceeeeeeeee-eeeEEecCceEEEEecc Confidence 1234566655553211 1 1233333322 2221100 111111 11222333444344 57888999998886544 No 167 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=53.41 E-value=0.53 Score=22.10 Aligned_cols=282 Identities=12% Similarity=0.116 Sum_probs=127.8 Q ss_pred CCccc------ccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEe----c Q lcl|NC_019522. 1 MAKSV------FDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLF----G 70 (311) Q Consensus 1 ~~~~~------~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~----~ 70 (311) |.-|. -...+...--|+. ...-+|.+.-...-..+.+..+++- -+..++.|.. .|+.+.. | T Consensus 1 ms~~~~~t~~~~~~s~~d~al~le----~f~geV~~af~~~s~~~~~~~~rti--~~g~s~~~~~---iG~~~~~~~~pG 71 (335) T protein:vir:78 1 MSFLNDLTRPNYAGKNADVDIHLE----EHLGIVDKHFAYTSKFAPLMNIRDL--RGSNVVRLDR---LGNVEAKGRRAG 71 (335) T ss_pred CCccccccccccccccchhhhhhh----hhhhHHHHHHHHhhhhccccceeee--ccceeEEEee---eeeeeecccccC Confidence 44332 1111111112433 3333333333333455566666542 1244555443 3555432 1 Q ss_pred Ccccccceeeeecccee--EEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheee------eeccc-c Q lcl|NC_019522. 71 PNSTDVPTVDIAMSQGF--KDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYL------LGDKG-V 141 (311) Q Consensus 71 ~~a~dip~v~~~~~~~~--~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~------~G~~~-~ 141 (311) ..-+.-|.. -++.. ..-..+.-.+ +.++.. .++..++-..-.+.+..++++..|+.++ .+-.. . T Consensus 72 ~~l~~~~~~---~~k~~itID~ll~a~~~---VddlDe-~~~~yDvR~e~s~~~G~aLA~~~Dq~~~~~l~~aa~~~a~~ 144 (335) T protein:vir:78 72 EELERSRVV---NDKWNLTVDTLLYLRHQ---FDHQDE-WTQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPV 144 (335) T ss_pred cccCCCCcc---cCCeEEEecceeechhh---HhhHHH-hhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc Confidence 111222211 12211 1112222222 444444 3456677777788888888888888765 11111 1 Q ss_pred c-eeeeecCCcceeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCcee-cc---eEEEeCHHHHHHHhcc--c Q lcl|NC_019522. 142 G-EGLYTSPNVSVEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVH-RP---NTFVLPPAQFQLLART--L 214 (311) Q Consensus 142 g-~GllN~p~v~~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~-~p---~~l~lpp~~~~~L~~~--~ 214 (311) . .+.++ ||........+. =.+.+++.+.+-+.++...+.+.. +- .+ ..++++|+.|..|.+- . T Consensus 145 ~~~~~~~-~G~~~~~~~tg~------~~~~~~~~l~~a~~~a~~~l~ekd---vP~~~~~~rv~vv~P~~y~~Ll~~~~l 214 (335) T protein:vir:78 145 DLEDAFS-PGVLEKLDLTGL------TAKEAAEKIVRMHRRVVETFIERD---LGDAVYSEGLTPMSPRVFSLLLEHDKL 214 (335) T ss_pred ccCCCcC-CCcceeeeeccc------cccccHHHHHHHHHHHHHHHHhcc---CCCCCCCccEEEeChHHHHHHhccccc Confidence 1 12121 333222222111 123468888888888877775432 21 11 4688999999998542 2 Q ss_pred ccCC--C--CCcchHHHHHHHhCCceEEEEchhcccCCCC----c----------ccEEEEEEcCcceeEEeecchhhhc Q lcl|NC_019522. 215 LSTQ--N--ASNVTLLQFLRTNFPDITFEDDILLKGAGVA----G----------ADRMAVYKKEIRIVKGHDVMPLRFL 276 (311) Q Consensus 215 ~~~~--~--~~~~Tvl~~l~~n~~~l~i~~~~~l~~ag~~----g----------~~~~v~y~~~~~~~~~~~~~~~~~~ 276 (311) ++.. + ..+......+.+ --.++|+.++.|-+.+.. | +.++.++ ..++-+...-.+++.-- T Consensus 215 ~n~~~~~s~~~~~~~~g~v~~-v~Gv~V~~Sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~-~~~~Al~t~~~~~~~~e 292 (335) T protein:vir:78 215 MSVEYQATGATNDYVKSRVAI-LNGVKVLETPRFATKAISAHPLGRHFNVSAEEAERQIALF-LPSKTLITAQVAPVQAK 292 (335) T ss_pred ccccccccccccccccceeEE-eeceEEEeeccCCCCCCccccccccCCcccccccceEEEE-EecceEEEEEEEecccc Confidence 2210 0 000111111111 123567777777532111 1 2233333 34443333333444211 Q ss_pred cceeeCCceEEEeeeeeeeeEEEECCeEE--EEeecC Q lcl|NC_019522. 277 APATADNVNFKVPAILRTGGTEWRIPKAG--HYVDGV 311 (311) Q Consensus 277 ~p~~~~~~~~~~~~~~~~gGv~i~~P~ai--~~~dGI 311 (311) .-.+.+...+.+.+.... |+-++||++. +...|| T Consensus 293 ~~~~~~~~~~~i~~~~a~-G~g~lRPe~a~~i~~tg~ 328 (335) T protein:vir:78 293 LWEDHDQFSWVLDTFQMY-NIGARRPDTAGAIELKGI 328 (335) T ss_pred eeeccchhhHhhhHHHHc-CCcccCcceEEEEEecCC Confidence 113344455666666664 7999999665 456788 No 168 >protein:vir:96666 Length: 462 # NCBI annotation: ORF016 # Family: family:all:2450 # MgeID: mge:1623 # MgeName: Twort # Cross-refs: genbank:acc:YP_238545;genbank:gi:66391271;genbank:GeneID:5130448 Probab=50.92 E-value=0.6 Score=21.82 Aligned_cols=265 Identities=9% Similarity=-0.013 Sum_probs=116.3 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhh--hhhhhhhhccccCCCCcceeEE-EEEEeecccceEEecCcccccc Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEY--PQFKYGTLLPLDNSAPDWAQAV-MFRSIDARGELQLFGPNSTDVP 77 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~--~~~~~~~~~~v~~~~~~~~~~~-~~~~~~~~G~a~~~~~~a~dip 77 (311) -++..-+-+++-+.+-|.+ |.+|+.|..--+ .+++.-+-++- .+.......+ .|......|.+...+.. ...+ T Consensus 27 ~tg~g~~p~~q~~~gAlR~--esL~~~i~~Lt~~~~~~~~~~~i~k-~~a~sTv~~y~~~~~~G~~g~~~f~~E~-g~~~ 102 (462) T protein:vir:96 27 QTGYGITPDTQVDAGALRR--EILDDQITMLTWTQDDLIFYREISR-RPAQSTVQKYDVYLRHGNVGHSRFVREV-GVAP 102 (462) T ss_pred hcCCCcCCccccccchhhh--hhhhhhhheeeecccchhhhhhcCC-chhhhhhhhheeeeccCccccccccccc-cccc Confidence 2223222244444445555 556666544333 33444444442 2222222222 22223334444444443 4467 Q ss_pred eeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccce---e-eeecCCcce Q lcl|NC_019522. 78 TVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVGE---G-LYTSPNVSV 153 (311) Q Consensus 78 ~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g~---G-llN~p~v~~ 153 (311) ..|.++.+++..+..++..-..++..-.. .-. .+..+...+.|...+++......||||+.+.- | -|+.-|+.. T Consensus 103 ~~d~~~~R~~~~~k~l~~t~~vsi~~tl~-n~~-~d~~~~~~~dai~~~a~tiE~a~Fygds~l~~~~~~~gleFDGl~~ 180 (462) T protein:vir:96 103 VSDPNIRQKTVEMKYVSDTKNLSIASTLV-NNI-QDPMQILTEDAIAVVAKTIEWASFYGDASLTADPTGQGLEFDGLAK 180 (462) T ss_pred cCCCceEEEEEEEEEEeeeeeechhhhhc-cch-hhHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCccccccchhhhhh Confidence 78899999999999999999988764331 112 23447777788888999999999999998631 1 134444432 Q ss_pred eeccC---CccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHH Q lcl|NC_019522. 154 EAATS---TFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLR 230 (311) Q Consensus 154 ~~~~~---~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~ 230 (311) ...+. +..|+. +. .+.|+.+-..+ + .+.-.|+-++||......|..-.++.. ..+. T Consensus 181 lI~~~NViDarG~~--Ls-------~~~ln~aa~~i---~-~~fGt~TD~~~p~~v~a~f~~~~l~~q--------rv~~ 239 (462) T protein:vir:96 181 LIDKDNVIDAKGES--LT-------ETLLNRSAVLI---G-KSFGTATDAYMPIGVHADFVNSVLGRQ--------MQLM 239 (462) T ss_pred hcCCCceeecCCCC--cc-------HHHHhhhhhhc---c-cccCChhheecchHHHHHHHHhhcCce--------EEEE Confidence 22221 111111 11 34555554433 1 234568999999999988864322110 0011 Q ss_pred HhCCc-eEE-EEchhcccCCCCcccEEEEEEcCcceeEEeecc----hhhhcccee---eCCceEEEeeeeeeeeEEEEC Q lcl|NC_019522. 231 TNFPD-ITF-EDDILLKGAGVAGADRMAVYKKEIRIVKGHDVM----PLRFLAPAT---ADNVNFKVPAILRTGGTEWRI 301 (311) Q Consensus 231 ~n~~~-l~i-~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~----~~~~~~p~~---~~~~~~~~~~~~~~gGv~i~~ 301 (311) ..++. +.. ..++.+. ...-.++++-.. |..+-.-.+ .-.-...+... +.--. T Consensus 240 ~~n~g~~~~G~~v~~f~--------------s~~G~I~L~~s~~m~~~~i~~~~~~~~p~ap~~~~vsaT-----v~t~~ 300 (462) T protein:vir:96 240 QDNSGNVNAGYNVQGFY--------------SSRGFIKLHGSTVMENELILDESLQPLPNAPQPATVKAT-----VETGK 300 (462) T ss_pred cCCCCceeeeeecccee--------------eeeeeeeeCCceecCcccccccccccCCCCCCCCceeEE-----EEeCC Confidence 11111 100 0011111 111112221000 000000000 00000111111 22222 Q ss_pred CeEEEEeecC Q lcl|NC_019522. 302 PKAGHYVDGV 311 (311) Q Consensus 302 P~ai~~~dGI 311 (311) +..+.-=.+. T Consensus 301 ~g~f~~~~d~ 310 (462) T protein:vir:96 301 KGLFTDEHDR 310 (462) T ss_pred CCCCCCccCc Confidence 2211111011 No 169 >protein:vir:4902 Length: 348 # NCBI annotation: gp348 # Family: family:all:1083 # MgeID: mge:107 # MgeName: Sfi11 # Cross-refs: genbank:acc:NP_056680;genbank:gi:9635015;genbank:GeneID:1262657 Probab=40.30 E-value=0.98 Score=20.64 Aligned_cols=284 Identities=9% Similarity=0.051 Sum_probs=105.1 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccc-eEEecCccccccee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGE-LQLFGPNSTDVPTV 79 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~-a~~~~~~a~dip~v 79 (311) |++- +| -|-.++|+..-..+- .+...+-...+||... .. ..+...+........ |..++-.+.....- T Consensus 1 M~~l-~d-------~f~~~~l~~~v~~~~-~~~~~~l~~~~Fp~~~-~~-~~~~~~~~~~~~~~~~a~~v~~~~~~~~~~ 69 (348) T protein:vir:49 1 MGLI-YD-------KVTASNIAGYFNALQ-ENVDSTLGESIFPARK-QL-GTKLSYITGASGQSVALKAAAFDTNVTVRD 69 (348) T ss_pred Ccch-hh-------hcCHHHHHHHHHhcc-ccchhhhHhhcCCCcc-cc-CceeEEEEeecCceeeeeeecCCCCcceec Confidence 4432 22 133344432222221 1233455677888532 11 223322332222222 22333333322222 Q ss_pred eeeccceeEEEEEEEEEEEecHHHHHHHHHhCCC--hHH---------H----HHHHHHHHHHHhhhheeeeecc---cc Q lcl|NC_019522. 80 DIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVN--LDA---------E----RGQAVRDVVEQGLNKIYLLGDK---GV 141 (311) Q Consensus 80 ~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~--l~~---------~----k~~aa~~~~~~~~n~~~~~G~~---~~ 141 (311) ...++..+..+..+.-.+..+..|++..+...-+ -.. + ...+.++..+...-+.++.|-= +. T Consensus 70 r~~~~~~~~~~p~i~~~~~i~~~d~~~l~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~i~~~ 149 (348) T protein:vir:49 70 RVSAEMHDEQMPFFKEAMLVKENDRQQLNLVKDSGNAALVNTIVAGIFNDNLTLVNGARARLEAMRMQVLATGKIAFTSD 149 (348) T ss_pred ccceeeeeeecCccccccccCHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCeEEEecC Confidence 2334555666777788888887775543333211 110 1 1122344444444455555521 11 Q ss_pred c--eee-eecCCcceeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhc-----c Q lcl|NC_019522. 142 G--EGL-YTSPNVSVEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLAR-----T 213 (311) Q Consensus 142 g--~Gl-lN~p~v~~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~-----~ 213 (311) | +.+ +..|.-...++. ..|.+.++ +++.||.+....+.. + | . .|++++|+++.+..|.+ . T Consensus 150 g~~~~vdyg~~~~~~~t~~-------~~W~~~~a-dp~~di~~~~~~~~~-~-G-~-~~~~ii~~~~~~~~l~~~~~v~~ 217 (348) T protein:vir:49 150 GVNKDIDYGVKPDHKKQVS-------KSWAEPGA-TPLADLEDAIETARE-L-G-L-NPERAVMNAKTFGLIRKAASTVK 217 (348) T ss_pred CceEEEeecCCcccceeee-------eccCCCCC-CHHHHHHHHHHHHHh-c-C-C-cccEEEeCHHHHHHHhcCHHHHH Confidence 1 110 111111111111 24877665 478999999877753 3 3 2 58999999999999843 1 Q ss_pred cccCCC--C---CcchHHHHHHHhCCceEEEEch-hcccCCCCcc-------cEEEEEEcCcc-eeEEeecch-hhhc-- Q lcl|NC_019522. 214 LLSTQN--A---SNVTLLQFLRTNFPDITFEDDI-LLKGAGVAGA-------DRMAVYKKEIR-IVKGHDVMP-LRFL-- 276 (311) Q Consensus 214 ~~~~~~--~---~~~Tvl~~l~~n~~~l~i~~~~-~l~~ag~~g~-------~~~v~y~~~~~-~~~~~~~~~-~~~~-- 276 (311) .+.... . ....+.+++...+ .++|+.-. ++.+. .|+ +.+++...... ...+..... .... T Consensus 218 ~~~~~~~~~~~i~~~~~~~~~~~~~-g~~i~~y~~~y~d~--dG~~~~~~p~~~v~l~~~~~~G~~~yg~~~e~~~~~~~ 294 (348) T protein:vir:49 218 VIKPLAGDGSSVTKAELDNYIADNF-GVTVVLENGTYRNE--KGEVSKFFPDGHLTLIPNGPLGNTVFGTTPEESDLFAD 294 (348) T ss_pred HhhccCcccccccHHHHHHHHHhhc-CceEEEEeeEEEec--CCcEeeeecCCeEEEecCCCcceeEEecChhhhhhccc Confidence 111111 1 1112333333321 12222211 12211 122 11111111100 000000000 0000 Q ss_pred --c--cee-------------eCCceEEEeeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 277 --A--PAT-------------ADNVNFKVPAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 277 --~--p~~-------------~~~~~~~~~~~~~~gGv~i~~P~ai~~~dGI 311 (311) . ..+ .......+...++ .=-.+.+|.++..++=+ T Consensus 295 ~~~~~~~~~~~~~~~~~~~~~~dP~~~~~~~~s~-~lPv~~~~~~~~~a~Vl 345 (348) T protein:vir:49 295 NTVNADVEIVDNGIAVTTTKTTDPVNVQTKVSMV-ALPSFERLDDVYMLTVI 345 (348) T ss_pred cccccceeecCCeEEEeeeecCCCceEEEEEeee-ccccccCCCcEEEEEEe Confidence 0 000 0000000000000 00123444444444444 No 170 >protein:vir:2736 Length: 348 # NCBI annotation: putative structural protein # Family: family:all:1083 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695109;genbank:gi:23455878;genbank:GeneID:955608 Probab=39.84 E-value=1 Score=20.59 Aligned_cols=281 Identities=11% Similarity=0.026 Sum_probs=104.4 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecc---cc-eEEecCccccc Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDAR---GE-LQLFGPNSTDV 76 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~---G~-a~~~~~~a~di 76 (311) |++- .--|-..+|+..-.++- .+...+-...+||... .. ...+...+.. .. +...+-.+... T Consensus 1 M~~i--------~d~f~~~~l~~~v~~~~-~~~~~~l~~~~Fp~~~-~~----~~~~~~~~~~~~~~~~a~~v~~~~~~~ 66 (348) T protein:vir:27 1 MGLI--------YDKVTASNIAGYFNALQ-ENVSSTLGESIFPARK-QL----GTKLSYIKGASGQSVALKAAAFDTNVT 66 (348) T ss_pred Ccch--------hhhcCHHHHHHHHHhcc-chhhhhhHhhcCCCcc-cc----ceeEEEEeeccCceeEeeeecCCCCcc Confidence 4431 11233444433222221 2233455667888432 11 1222222221 22 22333222221 Q ss_pred ceeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCC--ChHH---------H----HHHHHHHHHHHhhhheeeeecc-- Q lcl|NC_019522. 77 PTVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNV--NLDA---------E----RGQAVRDVVEQGLNKIYLLGDK-- 139 (311) Q Consensus 77 p~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~--~l~~---------~----k~~aa~~~~~~~~n~~~~~G~~-- 139 (311) ..-...++..+..+..+.-.+..+..|++......- +-.. + ...+.++..+...-+.++.|-= T Consensus 67 ~~~r~~~~~~~~~~p~i~~~~~i~~~d~~~~~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~~al~~Gki~i 146 (348) T protein:vir:27 67 IRDRVSAEMHDEQMPFFKEAMLVKENDRQQLNLVKDSGNAVLVNTIVAGIFNDNLTLVNGARARLEAMRMQVLATGKIAF 146 (348) T ss_pred eecccceeeeeeecCccccccccCHHHHHHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCeeEE Confidence 111223445566667777788888777654322211 1111 1 1122333334444455555521 Q ss_pred -ccceee---eecCCcceeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhc--- Q lcl|NC_019522. 140 -GVGEGL---YTSPNVSVEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLAR--- 212 (311) Q Consensus 140 -~~g~Gl---lN~p~v~~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~--- 212 (311) +.|+.. +..|.-...++ +..|.+.+++ +++||.+....+. .+ | . .|++++|+++.+..|.+ T Consensus 147 ~~~~~~~~vdfg~~~~~~~t~-------~~~W~~~~ad-p~~di~~~~~~~~-~~-G-~-~~~~ii~~~~~~~~l~~~~~ 214 (348) T protein:vir:27 147 TSDGVNKDIDYGVKPDHKKQV-------SKSWAEPGAT-PLADLEDAIETAR-EL-G-L-NPERAVMNAKTFGLIRKAAS 214 (348) T ss_pred ecCCeeEEEeecCCcccceee-------eeccCCCCCC-HHHHHHHHHHHHH-hc-C-C-cccEEEECHHHHHHHhcCHH Confidence 111111 11121111111 1248776664 6899999988774 33 3 2 68899999999999853 Q ss_pred --ccccCC--CCC---cchHHHHHHHh-CCceEEEEchhcccCCCCcc-------cEEEEEEcCcc-eeEEeec-ch--h Q lcl|NC_019522. 213 --TLLSTQ--NAS---NVTLLQFLRTN-FPDITFEDDILLKGAGVAGA-------DRMAVYKKEIR-IVKGHDV-MP--L 273 (311) Q Consensus 213 --~~~~~~--~~~---~~Tvl~~l~~n-~~~l~i~~~~~l~~ag~~g~-------~~~v~y~~~~~-~~~~~~~-~~--~ 273 (311) ..+... +.. ..-+.+++... ++.+.+.. .++.+ ..|+ +.+++...... ...+..+ .. . T Consensus 215 v~~~~~~~~~~~~~i~~~~~~~~~~~~~g~~i~~yd-~~y~d--~~G~~~~~~p~~~vvl~~~~~~G~~~yG~~~e~~~~ 291 (348) T protein:vir:27 215 TVKVIKPLAGDGSAVTKAELENYIADNFGVSIVLEN-GTYRN--DKGEVSKFYPDGHLTLIPNGPLGNTVFGTTPEESDL 291 (348) T ss_pred HHHHhcccCccccccCHHHHHHHHHhhcCceEEEEe-eEEEc--CCCcCcccccCCeEEEEcCCcceeEEeccCcchhhh Confidence 111110 011 11233333322 22222221 12221 1121 22222221110 0111000 00 0 Q ss_pred hh----------ccc-------eeeCCceEEEeeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 274 RF----------LAP-------ATADNVNFKVPAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 274 ~~----------~~p-------~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dGI 311 (311) .. ..+ .+.......+... ...=-.+.+|.++..++=+ T Consensus 292 ~~~~~~~~~~~~~~~~~~~~~~~~~dP~~~~~~~~-s~~lPv~~~~~~~~~a~Vl 345 (348) T protein:vir:27 292 FADNTVNAEVEIVDNGIAVTTTKTTDPVNVQTKVS-MVALPSFERLDDVYMLTVI 345 (348) T ss_pred hhccccccceeeeCCeeEEEeeecCCCceEEEEEe-eeeeccccCCCcEEEEEEe Confidence 00 000 0000001111111 1111224455444443322 No 171 >protein:vir:98480 Length: 348 # NCBI annotation: ORFp38 # Family: family:all:1083 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958280;genbank:gi:41057254;uniprot:Q38595;genbank:GeneID:2732864 Probab=30.84 E-value=1.5 Score=19.55 Aligned_cols=285 Identities=11% Similarity=0.047 Sum_probs=110.8 Q ss_pred ccccchhhhhhhHHHHHHHHHHHH-hhhhhhhhhhhhccccCCCCcceeEEEEEEeecc-cc---eEEecCcccccceee Q lcl|NC_019522. 6 FDVSPVSALSFLVNQAAHIESEIY-RIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDAR-GE---LQLFGPNSTDVPTVD 80 (311) Q Consensus 6 ~~~~~~~~~~fl~~~L~~id~~v~-~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~-G~---a~~~~~~a~dip~v~ 80 (311) |...-+.. -|-..+|+.+=.++. ..+...+-..++||... ...+.|...... +. +.+.+..+. -|... T Consensus 1 M~~~~~~d-~~~~~~l~~~i~~~~~~~~~~~~l~~~~fp~~~-----~~~~~~~~~~~~~~~~~~a~~~~~~~~-~~~~~ 73 (348) T protein:vir:98 1 MSWTLDTE-FIEPTQLTGLIREALRDLQVNRFRLARWLPNVD-----VDDITFEFLRGGGGLAETASYRSWDTE-SKIGR 73 (348) T ss_pred Ccchhhhh-ccCHHHHHHHHHHHhhccCcchhhHHhcCCCcc-----ccceEEEEEeccCCceeeeeeecCCCc-cceee Confidence 32211111 122244543333332 22334467789999642 223334433222 21 233332222 23222 Q ss_pred -eeccceeEEEEEEEEEEEecHHHHHHHHHhCCC-----h---HHHHHHHHHHHHHHhhhheeeeecc---ccceee-ee Q lcl|NC_019522. 81 -IAMSQGFKDINTAALGYTYSIEEIGFAMLNNVN-----L---DAERGQAVRDVVEQGLNKIYLLGDK---GVGEGL-YT 147 (311) Q Consensus 81 -~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~-----l---~~~k~~aa~~~~~~~~n~~~~~G~~---~~g~Gl-lN 147 (311) ..++..+..+..++..+.++..|+...+..-.+ + -.+...+.++..+...-+.++.|-= +.+|.+ +. T Consensus 74 r~g~~~~~~~~~~i~~~~~i~~~d~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~~~g~~~~vDyg 153 (348) T protein:vir:98 74 REGLAKVMGELPPISEKIPLNEYDRLRLRKLSRDEALPFIARDAQRLARNIGARFEVARGSALVNATVPVTELQQTVDFG 153 (348) T ss_pred cccceeeeeeccccccccccCHHHHHHhcCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCeEEEecCceEEccc Confidence 234555666777888888888888764322110 0 0112333333344444455555521 112211 11 Q ss_pred cCCcceeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhc-----ccccCCCC-- Q lcl|NC_019522. 148 SPNVSVEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLAR-----TLLSTQNA-- 220 (311) Q Consensus 148 ~p~v~~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~-----~~~~~~~~-- 220 (311) .|.-... .+++.|.......+++||.+....+...++. .|++++|+++.+..|.+ ..+.+.+. T Consensus 154 ~~~~~~~-------t~~~~Ws~~~~adp~~di~~~~~~~~~~~G~---~p~~~vm~~~~~~~l~~~~~i~~~~~~~~~~~ 223 (348) T protein:vir:98 154 RIGSHSV-------VAAVLWSVHATATPISDLESWVATYEDTNGQ---SPGVILMPKAAVSHMRQCEEVIRQVFPLAPSG 223 (348) T ss_pred cCccccc-------ccccccCCCCCCCHHHHHHHHHHHHHHccCC---cceEEEeCHHHHHHHhcCHHHHHHHhccCccc Confidence 1211111 1234685433334789999999988654432 48999999999999843 11110000 Q ss_pred -Cc-c---hHHHHHHHhC-CceEEEEchhcccCCCCcccEE------EEEEcCccee-EEeecchhhhccc-e------- Q lcl|NC_019522. 221 -SN-V---TLLQFLRTNF-PDITFEDDILLKGAGVAGADRM------AVYKKEIRIV-KGHDVMPLRFLAP-A------- 279 (311) Q Consensus 221 -~~-~---Tvl~~l~~n~-~~l~i~~~~~l~~ag~~g~~~~------v~y~~~~~~~-~~~~~~~~~~~~p-~------- 279 (311) .. . .+-.++...+ +.+.+.. .....- +...++ +++....... ....++-.+...| . T Consensus 224 ~~~~~~~~~~~~~~~~~g~~~i~~~d-~~~~~~--g~~~~~~p~~~i~l~p~~~~~~~~~~~~~G~t~~G~~~e~~~~~~ 300 (348) T protein:vir:98 224 TAPMVSVEQLNTVLSSMGLPPIEVYD-AKVAVD--GVSTRITPANAIALLPEPGATDAAQPTELGATLLGTTAESLEDDY 300 (348) T ss_pred cccccCHHHHHHHHHhhCCeEEEEee-eEEEcC--CceeceecCCeEEEEecCCcccccccccccceecccchhhhcccc Confidence 00 0 1112222222 2222221 112221 111121 1111110000 0000000000000 0 Q ss_pred ----------------eeCCceEEEeeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 280 ----------------TADNVNFKVPAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 280 ----------------~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dGI 311 (311) +.......+...++ .=..+.+|.+++.++=| T Consensus 301 ~~~~~~~~~i~~~~~~~~dP~~~~~~~~s~-~lPv~~~~~~~~~a~Vl 347 (348) T protein:vir:98 301 ALAPGEQPGIVAATWKTKDPVRLWTHAAAV-GIPVLREPNLTFKAQVL 347 (348) T ss_pred ccceeccCceeeeeeeecCCcEEEEEEeee-eeccccCCCcEEEEEEe Confidence 00111111222222 11334566666555544 No 172 >protein:vir:103886 Length: 302 # NCBI annotation: putative major head subunit protein # Family: family:all:776 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938242;genbank:gi:38229147;genbank:GeneID:2648201 Probab=29.99 E-value=1.6 Score=19.45 Aligned_cols=267 Identities=10% Similarity=0.044 Sum_probs=112.7 Q ss_pred ccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccce-EEecCcccccceeeeecc Q lcl|NC_019522. 6 FDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGEL-QLFGPNSTDVPTVDIAMS 84 (311) Q Consensus 6 ~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a-~~~~~~a~dip~v~~~~~ 84 (311) |...++. +.-|..-+...-.+=|+. .+-+|+++.-. ++.--.+.+|..+...+.. +++|++ +.-...-. T Consensus 1 m~it~~~-l~~l~~~~~~~~~~~y~~--a~~~~~~~a~~---~~sdf~~~~~~~lg~~p~l~e~~Ge~----~~~~l~~~ 70 (302) T protein:vir:10 1 MLINKQS-LNAAFVAIKTIFNNAFAA--APTTWQKIAME---VPSNTSSNDYKWLSTFPKMRRWIGAK----VVKNLKAY 70 (302) T ss_pred CcccHHH-HHHHHHHHHHHHHHHHHh--hhhhhhceeee---cCCCcceeeceecCCCCCccccccce----eecccccc Confidence 4433211 111111111111112221 22355665432 2223345566666555444 344444 33344556 Q ss_pred ceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheee----eecccc---ceeeeec--CCcceee Q lcl|NC_019522. 85 QGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYL----LGDKGV---GEGLYTS--PNVSVEA 155 (311) Q Consensus 85 ~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~----~G~~~~---g~GllN~--p~v~~~~ 155 (311) ..+.++..++..+.++.+.++-=. +-+-.+......++.++++++++| .|.... |.-|+.+ |..... T Consensus 71 ~~~i~~~~~g~~v~i~R~~i~nDd---lg~~~~~~~~~G~aaa~~~~~lv~~~L~~g~~~~~~DG~~fF~~dH~~g~~~- 146 (302) T protein:vir:10 71 KYVVENEDFEATVEVDRNDIEDDQ---IGIYSPQAKMAGYSAAQLPDELVYEAVNGAFTKPCFDGQYFIDTDHPVGDAS- 146 (302) T ss_pred ceeEEeecccceecccHHhhcccc---cchhHHHHHHHHHHHHhhHHHHHHHHHhccCCCcccCCcceecccccccccc- Confidence 788999999999999988776311 222244444445555555555555 343322 2234432 322111 Q ss_pred ccCCccccCcccccCC---HHHHHHHHHHHHHHHHhccCCc-eecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHH Q lcl|NC_019522. 156 ATSTFVALVAAIPTNG---TQPIIDFFGNAYNTVYLDNTLT-VHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRT 231 (311) Q Consensus 156 ~~~~~~~~~t~w~~~t---~~ei~~di~~~~~~~~~~~~~~-~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~ 231 (311) .+..+...|.... ..+.+.....++.+.....+.. .-.|+.|++||+......+-........+ . T Consensus 147 ---~~N~g~~~~~~~~~~l~~~~~~aa~~am~~~k~~~G~~L~i~P~~LiVp~~le~~A~~ll~~~~~~~g--------~ 215 (302) T protein:vir:10 147 ---VSNKGTAPLSNASQAAAKAGYGAARTAMKKFKDEEGRSLNVSPNVLLVGPALEDVAKMLLTNPKLADN--------T 215 (302) T ss_pred ---cccccchhhhhcccccchHHHHHHHHHHHHHhhhcccccccCCCEEEecchhHHHHHHHhhccccCCC--------C Confidence 1111222343322 2223333333333332222222 24689999999887654332211111111 1 Q ss_pred hCC---ceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhhccceeeCCceEEEeeeeeeeeEE--EECCeEEE Q lcl|NC_019522. 232 NFP---DITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRFLAPATADNVNFKVPAILRTGGTE--WRIPKAGH 306 (311) Q Consensus 232 n~~---~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~gGv~--i~~P~ai~ 306 (311) .|| .++++..|+|.+ + +..+++++ +..++..+.+. +..|. ++..--..+-|+. +++.+.+- T Consensus 216 ~Np~~g~~~~vv~p~L~s----~-~aWyL~a~-~~~i~~~~l~g--~~~P~------~~~~~~~~~dgv~~k~~~d~Gvd 281 (302) T protein:vir:10 216 PNPYVGTAELVVDGRIES----D-TAWFLLDT-TKPVKPFIFQP--RKQPE------FVSQVNLDSDDVFNLRKLKFGAE 281 (302) T ss_pred cceeccceEEEEeeccCC----C-CceEEEec-CCccceEEEcC--ccccE------EEeccCCCCCceEEEEEEEEeee Confidence 244 378999999953 2 34666654 44444332211 11111 1111111122222 12222222 Q ss_pred EeecC Q lcl|NC_019522. 307 YVDGV 311 (311) Q Consensus 307 ~~dGI 311 (311) ++.+. T Consensus 282 ~R~~~ 286 (302) T protein:vir:10 282 ARAAA 286 (302) T ss_pred eeeec Confidence 22233 No 173 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=28.30 E-value=1.8 Score=19.24 Aligned_cols=289 Identities=10% Similarity=0.065 Sum_probs=119.6 Q ss_pred CCccccccc-----chhhhh-hhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEE--ecCc Q lcl|NC_019522. 1 MAKSVFDVS-----PVSALS-FLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQL--FGPN 72 (311) Q Consensus 1 ~~~~~~~~~-----~~~~~~-fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~--~~~~ 72 (311) |.++-..+. +.+..+ |+....-.|+..+- ..-..+.++.+.+-- +-.++.|.. .|.++. +..+ T Consensus 9 ~~~~n~~t~~~~~~~~~~~al~le~f~geV~~~f~----~~si~~~~~~~rti~--~Gksv~f~~---iG~~t~~~~t~G 79 (375) T protein:vir:10 9 LGRSNLSTGTGYGGATDKYALYLKLFSGEMFKGFQ----HETIARDLVTKRTLK--NGKSLQFIY---TGRMTSSFHTPG 79 (375) T ss_pred cCccccCCccccccccchHHHHHHHHhHHHHHHHH----HHHhhhccccccccc--cCceEEEEe---eeeeEEeeecCC Confidence 221111111 112222 33333333444332 223445566654322 234444433 354443 3222 Q ss_pred c--cccceeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeee----e-ccccceee Q lcl|NC_019522. 73 S--TDVPTVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLL----G-DKGVGEGL 145 (311) Q Consensus 73 a--~dip~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~----G-~~~~g~Gl 145 (311) . ++-|..+.+..+....|=. ..-+..-+.++..+ ++..++-.+-.+.+..++++..|+.++. | ......+. T Consensus 80 ~~i~~~~~~d~~~te~~l~ID~-~~y~~~~VdDiD~a-qa~~Dlr~e~s~~~G~aLA~~~D~~i~~~l~kaa~~~~p~~~ 157 (375) T protein:vir:10 80 TPILGNADKAPPVAEKTIVMDD-LLISSAFVYDLDET-LAHYELRGEISKKIGYALAEKYDRLIFRSITRGARSASPVSA 157 (375) T ss_pred cCcCCccccCCCCCceEEEecc-hhhhhhhHhhHHHH-hcCchhHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccc Confidence 1 1112223222222222211 11234445667664 4556677788888888999999987752 1 11100000 Q ss_pred e--ecCCcceeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCcee-cceEEEeCHHHHHHHhc-----ccccC Q lcl|NC_019522. 146 Y--TSPNVSVEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVH-RPNTFVLPPAQFQLLAR-----TLLST 217 (311) Q Consensus 146 l--N~p~v~~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~-~p~~l~lpp~~~~~L~~-----~~~~~ 217 (311) - -.|+.......+ +.+.-...|++.+++-|.++..+|.+.+ +- ....++++|+.|..|-+ +.++ T Consensus 158 ~~~~~~Gg~~i~~~s----g~~~~~~~ta~~~~~ai~~a~~~Lde~~---VP~~~R~~vv~P~~y~~Ll~~~d~~~~~n- 229 (375) T protein:vir:10 158 TNFVEPGGTQIRVGS----GTNESDAFTASALVNAFYDAAAAMDEKG---VSSQGRCAVLNPRQYYALIQDIGSNGLVN- 229 (375) T ss_pred ccccccCcceeeecc----ccccccccCHHHHHHHHHHHHHHHhhcC---CCCCCCEEEeChHHHHHHHhcCCccceee- Confidence 0 011211222111 1112334579999999999998886542 21 23578999999988743 1222 Q ss_pred CCCC-cchHHHHHHHhCCceEEEEchhcccC----------------------------------CCC----------cc Q lcl|NC_019522. 218 QNAS-NVTLLQFLRTNFPDITFEDDILLKGA----------------------------------GVA----------GA 252 (311) Q Consensus 218 ~~~~-~~Tvl~~l~~n~~~l~i~~~~~l~~a----------------------------------g~~----------g~ 252 (311) .+.. +..+.....-..-.++|..+.++-.- |.. ++ T Consensus 230 ~d~~~~~~~~~g~v~~i~Gv~V~~Sn~lP~~~~~~~~~g~~~~~~a~~~~~~~~~~~~~~~~~~~g~~~~y~~d~~~~~~ 309 (375) T protein:vir:10 230 RDVQGSALQSGNGVIEIAGIHIYKSMNIPFLGKYGVKYGGTTGETSPGNLGSHIGPTPENANATGGVNNDYGTNAELGAK 309 (375) T ss_pred ecccccceeccceEEEEeceEEEEeccccccccccccccccccccchhhhhccccccCCcceeeccccccccccccccCc Confidence 1111 11111110001113444444444211 000 11 Q ss_pred cEEEEEEcCcceeEEeecchhhhcc---ceeeCCceEEEeeeeeeeeEEEECCeEEEEeecC Q lcl|NC_019522. 253 DRMAVYKKEIRIVKGHDVMPLRFLA---PATADNVNFKVPAILRTGGTEWRIPKAGHYVDGV 311 (311) Q Consensus 253 ~~~v~y~~~~~~~~~~~~~~~~~~~---p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~dGI 311 (311) -..+++ +++-+.-.-.++++... -.+.+...+.+-..... |+.+.||++++-+.== T Consensus 310 ~~~~~~--~~~A~g~v~~~~~~~~~~~~~~~~~~q~~~i~~~~a~-G~~~lrp~~av~l~~~ 368 (375) T protein:vir:10 310 SCGLIF--QKEAAGVVEAIGPQVQVTNGDVSVIYQGDVILGRMAM-GADYLNPAAAVELYIG 368 (375) T ss_pred eEEEEE--chhheeeeeeeccccccccchhhheeeeeeeeeeeee-ccCccCceeEEEEecC Confidence 112222 22222211112221110 01111222333334444 6788999987655321 No 174 >protein:vir:80835 Length: 464 # NCBI annotation: putative major capsid protein # Family: family:all:2450 # MgeID: mge:1885 # MgeName: phiEF24C # Cross-refs: genbank:acc:YP_001504125;genbank:gi:158079312;genbank:GeneID:5666484 Probab=26.67 E-value=1.9 Score=19.03 Aligned_cols=276 Identities=12% Similarity=0.059 Sum_probs=113.0 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhh--hhhhhhhhccccCCCCcceeEEEEEEeec---ccceEEecCcccc Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEY--PQFKYGTLLPLDNSAPDWAQAVMFRSIDA---RGELQLFGPNSTD 75 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~--~~~~~~~~~~v~~~~~~~~~~~~~~~~~~---~G~a~~~~~~a~d 75 (311) -++..-+-+++-+.+-|.+ |.+|+.|-.--+ .+++.-+.++-. +.... .-.|.++.. .|.+...+.. .. T Consensus 23 ttgy~~~p~~q~~~~AlRr--EsL~~~i~~Lt~~~~~f~f~~di~k~-~a~ST--V~~y~~~~~~G~~g~~~f~~E~-g~ 96 (464) T protein:vir:80 23 TTGYGITPESQTDAAALRR--EFLDDQITMLTWADGDLSFYRDITKR-PATST--VAKYDVYLAHGRVGHTRFTREI-GV 96 (464) T ss_pred HhCCccCcccccCcchhhh--hhhhhhhheeeecccchhhhhhcCCc-hhhhh--hhhhheeeccCccccccccccc-cc Confidence 3334444445555555665 456665544332 334444444422 22222 223333333 4444444443 44 Q ss_pred cceeeeeccceeEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhhheeeeeccccc------eeeeecC Q lcl|NC_019522. 76 VPTVDIAMSQGFKDINTAALGYTYSIEEIGFAMLNNVNLDAERGQAVRDVVEQGLNKIYLLGDKGVG------EGLYTSP 149 (311) Q Consensus 76 ip~v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g------~GllN~p 149 (311) .+..+.++.+++..+..+......++.- ...+. +.+--.+..+.|...+++......||||+++. .|| ..- T Consensus 97 ~~~~d~~~~Rr~~~~Kfl~~~r~vsia~-~lvn~-~~d~~~~~~~dai~~va~tiE~a~FyGds~l~~~~~~~~gl-eFD 173 (464) T protein:vir:80 97 APISDPNLRQKTVNMKYVSDTKNMSIAT-GLVNN-IEDPMRILTDDAISVVAKTIEWASFYGDSDLSENPDAGSGL-EFD 173 (464) T ss_pred cccCCCceEEEEEEeeeeecceeeeeeh-hhhcc-hhhHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCCCcccc-chh Confidence 5777888888888887776666664321 11122 22333466778888899999999999998753 111 122 Q ss_pred Ccceeecc---CCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHH-hcccccCCCCCcchH Q lcl|NC_019522. 150 NVSVEAAT---STFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLL-ARTLLSTQNASNVTL 225 (311) Q Consensus 150 ~v~~~~~~---~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L-~~~~~~~~~~~~~Tv 225 (311) |+.....+ -+..|+. +. .+.|+++-..+. .+.-.|+-++||......+ ++-+.. + T Consensus 174 Gl~~lI~~~NViDarG~~--Ls-------~~~ln~Aa~~i~----~~fGt~TD~~lp~~v~a~f~n~~l~~---q----- 232 (464) T protein:vir:80 174 GLAKLIDKHNVLDAKGAS--LT-------EALLNQASVLVG----KGYGTPTDAYMPIGVQADFVNQQLDR---Q----- 232 (464) T ss_pred hhHhhcCCCceeecCCCC--cC-------HHHHhhhhhhhh----cccCChhhcccchhHHHHHHhhhcCc---e----- Confidence 22111111 1111111 11 355665554442 2344689999999998775 321110 0 Q ss_pred HHHHHHhCCceEE-EEchhcccCCCCcccEEE--EEEcCcceeEEe------ecchhhhcccee------------eCCc Q lcl|NC_019522. 226 LQFLRTNFPDITF-EDDILLKGAGVAGADRMA--VYKKEIRIVKGH------DVMPLRFLAPAT------------ADNV 284 (311) Q Consensus 226 l~~l~~n~~~l~i-~~~~~l~~ag~~g~~~~v--~y~~~~~~~~~~------~~~~~~~~~p~~------------~~~~ 284 (311) ..++..|+.+... ..++-+.++. |--+.- .+-+++..+... .|.|-+....+. .... T Consensus 233 ~~~~~~n~~~~~~G~~v~~f~sa~--G~i~L~~s~~m~~~~~ld~~~~~~~~apaapsvt~tv~~~~~g~f~~~~~~~~~ 310 (464) T protein:vir:80 233 VQVISDNGQNATMGFNVKGFNSAR--GFIRLHGSTVMELEQILDENRMQLPNAPQKATVKATLEAGTKGKFRDEDLTIDT 310 (464) T ss_pred eEEEcCCCCcceeeeecccccccc--cceeccCccccCcccccccccccCCCCcCCceeEEEecCCcccCCcccccccee Confidence 0011122222111 0111111110 000000 000011111000 111111110001 1112 Q ss_pred eEEEeeeeeeeeEEEECCeE--EEE----eecC Q lcl|NC_019522. 285 NFKVPAILRTGGTEWRIPKA--GHY----VDGV 311 (311) Q Consensus 285 ~~~~~~~~~~gGv~i~~P~a--i~~----~dGI 311 (311) +|++...+.-|. -.|.. -++ -+|| T Consensus 311 ~Ykv~~vn~~Ge---S~ps~~~~~ti~~~~~~V 340 (464) T protein:vir:80 311 EYKVVVVSDDAE---SAPSDVASVVIDDKKKQV 340 (464) T ss_pred EEEEEEECCCCc---cccceeeeeeecCcccEE Confidence 344433332221 01111 111 1122 No 175 >protein:vir:107826 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996627;genbank:gi:45580761;genbank:GeneID:2767902 Probab=25.64 E-value=2 Score=18.89 Aligned_cols=228 Identities=12% Similarity=0.022 Sum_probs=100.4 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccceee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPTVD 80 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~v~ 80 (311) |+...|.+-+-...+=...--..+++.|+|.....=.....+|... ++-.+- ..+.+....-.+.|..-+ ..+|-.. T Consensus 1 m~~~~~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e-~N~~t~-~~~~vrt~LP~~~fR~lN-~g~~~s~ 77 (331) T protein:vir:10 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIE-ANGFTE-HKTTVRSGLPTGTWRKLN-YGVQPEK 77 (331) T ss_pred CCccccCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeee-ccCCcc-ceeeEEeccCCchhhccC-CccCccc Confidence 5544333222112211111113466778888655545567777653 221111 112232323333332222 2355556 Q ss_pred eeccceeEEEEEEEEEEEecHHHHHHHHHhCC--ChHHHHHHHHHHHHHHhhhheeeeeccccc---e-ee---eecCCc Q lcl|NC_019522. 81 IAMSQGFKDINTAALGYTYSIEEIGFAMLNNV--NLDAERGQAVRDVVEQGLNKIYLLGDKGVG---E-GL---YTSPNV 151 (311) Q Consensus 81 ~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~--~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g---~-Gl---lN~p~v 151 (311) ....+.+..+..++..+++.. ++ |...|- .+-++...+-.++..+...+.+||||...+ + || +|+++. T Consensus 78 ~tt~q~t~~l~ilgg~~eVDk-~l--a~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a 154 (331) T protein:vir:10 78 SRTVQVKDSMGMLETYAEVDK-AL--ADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSA 154 (331) T ss_pred ceeEEEEEEEEEeccceeech-HH--HhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhcccccc Confidence 666777888888887777764 23 333332 233555666677788888889999987632 2 43 222110 Q ss_pred ceeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHH Q lcl|NC_019522. 152 SVEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRT 231 (311) Q Consensus 152 ~~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~ 231 (311) . ++++ .+++ -++++ ..+.| T Consensus 155 ~------------------~~~q-------~Ida---GgtG~--~~TSI------------------------------- 173 (331) T protein:vir:10 155 E------------------NGQN-------IIDA---GGTGS--DNASI------------------------------- 173 (331) T ss_pred c------------------cccc-------eeec---CCCCC--CceEE------------------------------- Confidence 0 0000 0000 00000 00111 Q ss_pred hCCceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhh--ccceeeCC-ceEEEeeeeeeeeEEEECCeEEEEe Q lcl|NC_019522. 232 NFPDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRF--LAPATADN-VNFKVPAILRTGGTEWRIPKAGHYV 308 (311) Q Consensus 232 n~~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~--~~p~~~~~-~~~~~~~~~~~gGv~i~~P~ai~~~ 308 (311) .-+.|=.. -..-+|-+..+ ..+. ..|+.. +.-.+-+. -.|..-|.-++ |+.|+.|-+++++ T Consensus 174 -------~~v~~~~~------~~~giyPkG~~-~Gl~-~~d~g~~~~~~~~G~~y~~y~~~~~w~~-Gl~i~d~r~v~ri 237 (331) T protein:vir:10 174 -------WLTVWGPN------TLHTIYPKGSQ-AGLQ-SRDLGEDTLIDAAGGRYQGYRTHYKWDI-GLTLRDWRYVVRI 237 (331) T ss_pred -------EEEEEcCC------eeEEecccccc-cCce-EeecCceeeecCCCCeeeEEEEEEEeee-eeEEcCcccEEEE Confidence 11111000 00011111110 1110 112210 00001111 13666666676 7999999999999 Q ss_pred ecC Q lcl|NC_019522. 309 DGV 311 (311) Q Consensus 309 dGI 311 (311) -.| T Consensus 238 ~NI 240 (331) T protein:vir:10 238 ANV 240 (331) T ss_pred ecc Confidence 999 No 176 >protein:vir:107388 Length: 331 # NCBI annotation: Bbp17 # Family: family:all:1903 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958686;genbank:gi:41179378;genbank:GeneID:2717182 Probab=25.64 E-value=2 Score=18.89 Aligned_cols=228 Identities=12% Similarity=0.022 Sum_probs=100.4 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccceee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPTVD 80 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~v~ 80 (311) |+...|.+-+-...+=...--..+++.|+|.....=.....+|... ++-.+- ..+.+....-.+.|..-+ ..+|-.. T Consensus 1 m~~~~~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e-~N~~t~-~~~~vrt~LP~~~fR~lN-~g~~~s~ 77 (331) T protein:vir:10 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIE-ANGFTE-HKTTVRSGLPTGTWRKLN-YGVQPEK 77 (331) T ss_pred CCccccCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeee-ccCCcc-ceeeEEeccCCchhhccC-CccCccc Confidence 5544333222112211111113466778888655545567777653 221111 112232323333332222 2355556 Q ss_pred eeccceeEEEEEEEEEEEecHHHHHHHHHhCC--ChHHHHHHHHHHHHHHhhhheeeeeccccc---e-ee---eecCCc Q lcl|NC_019522. 81 IAMSQGFKDINTAALGYTYSIEEIGFAMLNNV--NLDAERGQAVRDVVEQGLNKIYLLGDKGVG---E-GL---YTSPNV 151 (311) Q Consensus 81 ~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~--~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g---~-Gl---lN~p~v 151 (311) ....+.+..+..++..+++.. ++ |...|- .+-++...+-.++..+...+.+||||...+ + || +|+++. T Consensus 78 ~tt~q~t~~l~ilgg~~eVDk-~l--a~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a 154 (331) T protein:vir:10 78 SRTVQVKDSMGMLETYAEVDK-AL--ADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSA 154 (331) T ss_pred ceeEEEEEEEEEeccceeech-HH--HhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhcccccc Confidence 666777888888887777764 23 333332 233555666677788888889999987632 2 43 222110 Q ss_pred ceeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHH Q lcl|NC_019522. 152 SVEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRT 231 (311) Q Consensus 152 ~~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~ 231 (311) . ++++ .+++ -++++ ..+.| T Consensus 155 ~------------------~~~q-------~Ida---GgtG~--~~TSI------------------------------- 173 (331) T protein:vir:10 155 E------------------NGQN-------IIDA---GGTGS--DNASI------------------------------- 173 (331) T ss_pred c------------------cccc-------eeec---CCCCC--CceEE------------------------------- Confidence 0 0000 0000 00000 00111 Q ss_pred hCCceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhh--ccceeeCC-ceEEEeeeeeeeeEEEECCeEEEEe Q lcl|NC_019522. 232 NFPDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRF--LAPATADN-VNFKVPAILRTGGTEWRIPKAGHYV 308 (311) Q Consensus 232 n~~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~--~~p~~~~~-~~~~~~~~~~~gGv~i~~P~ai~~~ 308 (311) .-+.|=.. -..-+|-+..+ ..+. ..|+.. +.-.+-+. -.|..-|.-++ |+.|+.|-+++++ T Consensus 174 -------~~v~~~~~------~~~giyPkG~~-~Gl~-~~d~g~~~~~~~~G~~y~~y~~~~~w~~-Gl~i~d~r~v~ri 237 (331) T protein:vir:10 174 -------WLTVWGPN------TLHTIYPKGSQ-AGLQ-SRDLGEDTLIDAAGGRYQGYRTHYKWDI-GLTLRDWRYVVRI 237 (331) T ss_pred -------EEEEEcCC------eeEEecccccc-cCce-EeecCceeeecCCCCeeeEEEEEEEeee-eeEEcCcccEEEE Confidence 11111000 00011111110 1110 112210 00001111 13666666676 7999999999999 Q ss_pred ecC Q lcl|NC_019522. 309 DGV 311 (311) Q Consensus 309 dGI 311 (311) -.| T Consensus 238 ~NI 240 (331) T protein:vir:10 238 ANV 240 (331) T ss_pred ecc Confidence 999 No 177 >protein:vir:98525 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996579;genbank:gi:45569510;genbank:GeneID:2767853 Probab=25.64 E-value=2 Score=18.89 Aligned_cols=228 Identities=12% Similarity=0.022 Sum_probs=100.4 Q ss_pred CCcccccccchhhhhhhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccceee Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPTVD 80 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~v~ 80 (311) |+...|.+-+-...+=...--..+++.|+|.....=.....+|... ++-.+- ..+.+....-.+.|..-+ ..+|-.. T Consensus 1 m~~~~~~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e-~N~~t~-~~~~vrt~LP~~~fR~lN-~g~~~s~ 77 (331) T protein:vir:98 1 MPTLSTTNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIE-ANGFTE-HKTTVRSGLPTGTWRKLN-YGVQPEK 77 (331) T ss_pred CCccccCcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeee-ccCCcc-ceeeEEeccCCchhhccC-CccCccc Confidence 5544333222112211111113466778888655545567777653 221111 112232323333332222 2355556 Q ss_pred eeccceeEEEEEEEEEEEecHHHHHHHHHhCC--ChHHHHHHHHHHHHHHhhhheeeeeccccc---e-ee---eecCCc Q lcl|NC_019522. 81 IAMSQGFKDINTAALGYTYSIEEIGFAMLNNV--NLDAERGQAVRDVVEQGLNKIYLLGDKGVG---E-GL---YTSPNV 151 (311) Q Consensus 81 ~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g~--~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g---~-Gl---lN~p~v 151 (311) ....+.+..+..++..+++.. ++ |...|- .+-++...+-.++..+...+.+||||...+ + || +|+++. T Consensus 78 ~tt~q~t~~l~ilgg~~eVDk-~l--a~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~kR~~~~~a 154 (331) T protein:vir:98 78 SRTVQVKDSMGMLETYAEVDK-AL--ADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTPRFNSLSA 154 (331) T ss_pred ceeEEEEEEEEEeccceeech-HH--HhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchhhcccccc Confidence 666777888888887777764 23 333332 233555666677788888889999987632 2 43 222110 Q ss_pred ceeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHHHH Q lcl|NC_019522. 152 SVEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFLRT 231 (311) Q Consensus 152 ~~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l~~ 231 (311) . ++++ .+++ -++++ ..+.| T Consensus 155 ~------------------~~~q-------~Ida---GgtG~--~~TSI------------------------------- 173 (331) T protein:vir:98 155 E------------------NGQN-------IIDA---GGTGS--DNASI------------------------------- 173 (331) T ss_pred c------------------cccc-------eeec---CCCCC--CceEE------------------------------- Confidence 0 0000 0000 00000 00111 Q ss_pred hCCceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhh--ccceeeCC-ceEEEeeeeeeeeEEEECCeEEEEe Q lcl|NC_019522. 232 NFPDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRF--LAPATADN-VNFKVPAILRTGGTEWRIPKAGHYV 308 (311) Q Consensus 232 n~~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~--~~p~~~~~-~~~~~~~~~~~gGv~i~~P~ai~~~ 308 (311) .-+.|=.. -..-+|-+..+ ..+. ..|+.. +.-.+-+. -.|..-|.-++ |+.|+.|-+++++ T Consensus 174 -------~~v~~~~~------~~~giyPkG~~-~Gl~-~~d~g~~~~~~~~G~~y~~y~~~~~w~~-Gl~i~d~r~v~ri 237 (331) T protein:vir:98 174 -------WLTVWGPN------TLHTIYPKGSQ-AGLQ-SRDLGEDTLIDAAGGRYQGYRTHYKWDI-GLTLRDWRYVVRI 237 (331) T ss_pred -------EEEEEcCC------eeEEecccccc-cCce-EeecCceeeecCCCCeeeEEEEEEEeee-eeEEcCcccEEEE Confidence 11111000 00011111110 1110 112210 00001111 13666666676 7999999999999 Q ss_pred ecC Q lcl|NC_019522. 309 DGV 311 (311) Q Consensus 309 dGI 311 (311) -.| T Consensus 238 ~NI 240 (331) T protein:vir:98 238 ANV 240 (331) T ss_pred ecc Confidence 999 No 178 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=25.16 E-value=2.1 Score=18.83 Aligned_cols=287 Identities=7% Similarity=-0.007 Sum_probs=118.1 Q ss_pred CCcccccccc-----hhhhh-hhHHHHHHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEec-Ccc Q lcl|NC_019522. 1 MAKSVFDVSP-----VSALS-FLVNQAAHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFG-PNS 73 (311) Q Consensus 1 ~~~~~~~~~~-----~~~~~-fl~~~L~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~-~~a 73 (311) |.-+.-.+.. .+..+ ||...+-.++..+-+. -..+.++.+++ +. +..++.|.. .|+.+.-. ..+ T Consensus 1 Ms~~n~~t~~~~~~sg~~~al~Le~f~GeV~taF~~~----si~~~~~~vRt-i~-~gkS~qf~~---~G~s~~~~~~pG 71 (401) T protein:vir:70 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKG----ENIMSYFDVQT-VT-GTNTVSNKY---LGETELQVLAPG 71 (401) T ss_pred CCCCccccccccccccchhHhHHhHhcchHHHHHHHH----hhhcccceeee-ec-ccceEEEEE---eeeeEeeeecCC Confidence 4433222222 11122 4443333444444221 23345555553 11 233444333 34443211 111 Q ss_pred cccceeeeeccce--eEEEEEEEEEEEecHHHHHHHHHhCCC-hHHHHHHHHHHHHHHhhhheee-----eeccc-cc-- Q lcl|NC_019522. 74 TDVPTVDIAMSQG--FKDINTAALGYTYSIEEIGFAMLNNVN-LDAERGQAVRDVVEQGLNKIYL-----LGDKG-VG-- 142 (311) Q Consensus 74 ~dip~v~~~~~~~--~~~v~~~~~~~~~~~~El~~a~~~g~~-l~~~k~~aa~~~~~~~~n~~~~-----~G~~~-~g-- 142 (311) +.+-.....-++. +..-..+.-.+-|. +..++ ..++ +..+-....-.++++..|+.++ -|-.. .. T Consensus 72 ~~ld~~~~~~dK~~ItID~lL~a~~~V~d---lDe~q-~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~aa~ana~~~~ 147 (401) T protein:vir:70 72 QSPAATSTQADKNQLVIDATVIARNTVAH---LHDVQ-GDIDSLKPKLATNQAKQLKRMEDEMLIQQMMLGGIANTQAKR 147 (401) T ss_pred CCcCCCCcccccEEEEeCceeehhhhhhh---HHHHH-hcccccchHHHHHHHHHHHHHHHHHHHHHHHHhccccccccc Confidence 1111111111221 11122222223333 33322 2333 3445555555566666565331 12110 00 Q ss_pred eeeeecCCcceeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcc--cccCC-- Q lcl|NC_019522. 143 EGLYTSPNVSVEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLART--LLSTQ-- 218 (311) Q Consensus 143 ~GllN~p~v~~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~--~~~~~-- 218 (311) ..-..-++-...+..+ ...-...+++++.+-|.++...+.+..=- ..-..+++||..|..|... +++.. T Consensus 148 ~~p~~~~~G~~i~v~~-----~~~~~~~~~~~l~~ai~dA~~~LdEkdVP--~~r~vvl~pp~~Ys~Ll~~d~L~nrd~~ 220 (401) T protein:vir:70 148 TNPRVKGHGFSINVEV-----AEGEALVNPQYVMAAVEFALEQQLEQEVD--ISDVAILMPWRYFNVLRDADRIVDKTYT 220 (401) T ss_pred cCCCcCCCceEEeccc-----cccccccCHHHHHHHHHHHHHHHHhcCCC--ccceEEEcCHHHHHHHHhcCcccchhhc Confidence 0000001111111111 11123457899999999999988653211 1235778899999777543 33211 Q ss_pred -CCCcchHHHHHHHhCCceEEEEchhccc------------CC---------CCcccEEEEEEcCcceeEEeecchhhhc Q lcl|NC_019522. 219 -NASNVTLLQFLRTNFPDITFEDDILLKG------------AG---------VAGADRMAVYKKEIRIVKGHDVMPLRFL 276 (311) Q Consensus 219 -~~~~~Tvl~~l~~n~~~l~i~~~~~l~~------------ag---------~~g~~~~v~y~~~~~~~~~~~~~~~~~~ 276 (311) .+.+..+...+.. --.++|+.++.|.. ++ ...+-++++|.++. +.-.=.+|++.- T Consensus 221 ~s~~g~~~~G~v~~-vaGv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~~~d~s~~~~v~f~~~A--v~tvk~~~lt~~ 297 (401) T protein:vir:70 221 ISQSGATIQGFTLS-SYNCPVIPSNRFPKYSQGQTHHLLSNEDNGYRYDPLPAMNGAIAVLFTADA--LLVGRSIDVTGD 297 (401) T ss_pred cccCCccccceEEE-EeceEEEeeccccccccccccccccccCCCccCCCCccccceeEEEEehhh--eEEEEeeccccc Confidence 0111112111110 11245555555522 11 11234556665542 221122344311 Q ss_pred cceeeCCceEEEeeeeeeeeEEEECCeEEEEe----ecC Q lcl|NC_019522. 277 APATADNVNFKVPAILRTGGTEWRIPKAGHYV----DGV 311 (311) Q Consensus 277 ~p~~~~~~~~~~~~~~~~gGv~i~~P~ai~~~----dGI 311 (311) .-.+.+...|.+.+...+ |+-.+||++++.+ +|. T Consensus 298 ~~~d~r~~~~~id~~~a~-g~g~~RPeaa~vv~~k~~~~ 335 (401) T protein:vir:70 298 IFYEKKEKTYYIDTFMAE-GAIPDRWEAVSVVTTKRNTT 335 (401) T ss_pred hhhhhhhhHHHHHHHHHh-CCcccchhheEEEeecCccc Confidence 113556666777777776 7999999999775 222 No 179 >protein:vir:95318 Length: 328 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512264;genbank:gi:89152431;genbank:GeneID:3952987 Probab=22.42 E-value=2.4 Score=18.45 Aligned_cols=225 Identities=11% Similarity=0.042 Sum_probs=106.0 Q ss_pred CCcccccccchhhhhhhHHHH--HHHHHHHHhhhhhhhhhhhhccccCCCCcceeEEEEEEeecccceEEecCcccccce Q lcl|NC_019522. 1 MAKSVFDVSPVSALSFLVNQA--AHIESEIYRIEYPQFKYGTLLPLDNSAPDWAQAVMFRSIDARGELQLFGPNSTDVPT 78 (311) Q Consensus 1 ~~~~~~~~~~~~~~~fl~~~L--~~id~~v~~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~a~dip~ 78 (311) |+...|-+-+-...+ ..+ ....+.|+|.....-.....+|...-... ....|.+....-.+.+..-+ ..+|- T Consensus 1 m~~~~~~~~TL~e~A---kr~~~d~~~~~VIE~l~~~n~IL~~lpf~e~n~g--t~~~~~v~~~LP~~~fR~lN-~g~~~ 74 (328) T protein:vir:95 1 MAVKGLTALTLADWG---KRVDPNGKVDKIIELLGQTNPILQDMPFVEGNLP--TGHRTTIRSGLPSATWRLLN-YGVQP 74 (328) T ss_pred CCccccccccHHHHH---hhhCcchhHHHHHHHHhccchhHhhcceeecccC--CcceeeEeeccCCceeeecC-CccCc Confidence 555544322211111 112 22556788877765566777776533211 11344445444444543333 33566 Q ss_pred eeeeccceeEEEEEEEEEEEecHHHHHHHHHhC-C-ChHHHHHHHHHHHHHHhhhheeeeeccccc---e-ee---eecC Q lcl|NC_019522. 79 VDIAMSQGFKDINTAALGYTYSIEEIGFAMLNN-V-NLDAERGQAVRDVVEQGLNKIYLLGDKGVG---E-GL---YTSP 149 (311) Q Consensus 79 v~~~~~~~~~~v~~~~~~~~~~~~El~~a~~~g-~-~l~~~k~~aa~~~~~~~~n~~~~~G~~~~g---~-Gl---lN~p 149 (311) ......+.+..+..++..+++.. +++ ...| . .+-+++..+-.++..++..+.+||||.+.+ + || +|++ T Consensus 75 s~~tt~q~t~~l~ilgg~~eVDr-~la--~~~Gn~~~~ra~q~~~~~ka~~~~~~~~~iyGdsa~~p~~F~GL~~R~~~~ 151 (328) T protein:vir:95 75 SKSTTVQVTDSVGMLETYAEVDK-SLA--DLNGNTAEFRLSEDRAFIEAMNQQMAQTLFYGDSSVNPQQFMGLSSRYSSL 151 (328) T ss_pred ccceeEEEEEEEEEEecceeech-HHH--hhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCccCChhhhcchhhhcCcc Confidence 67778888889999998888875 333 2233 2 234566667778888888888999987632 1 33 1111 Q ss_pred CcceeeccCCccccCcccccCCHHHHHHHHHHHHHHHHhccCCceecceEEEeCHHHHHHHhcccccCCCCCcchHHHHH Q lcl|NC_019522. 150 NVSVEAATSTFVALVAAIPTNGTQPIIDFFGNAYNTVYLDNTLTVHRPNTFVLPPAQFQLLARTLLSTQNASNVTLLQFL 229 (311) Q Consensus 150 ~v~~~~~~~~~~~~~t~w~~~t~~ei~~di~~~~~~~~~~~~~~~~~p~~l~lpp~~~~~L~~~~~~~~~~~~~Tvl~~l 229 (311) +... +.+ +. +.++ ++....|| |+ T Consensus 152 s~~~------------------a~q-----------ii--daGg------------------------tg~~~TSi--~~ 174 (328) T protein:vir:95 152 SAGN------------------AQN-----------II--DAGG------------------------TGTDNTSI--WL 174 (328) T ss_pred cccc------------------ccc-----------ee--eccc------------------------CCCCceEE--EE Confidence 1000 000 00 0000 00000000 00 Q ss_pred HHhCCceEEEEchhcccCCCCcccEEEEEEcCcceeEEeecchhhh-c-cceeeCCc-eEEEeeeeeeeeEEEECCeEEE Q lcl|NC_019522. 230 RTNFPDITFEDDILLKGAGVAGADRMAVYKKEIRIVKGHDVMPLRF-L-APATADNV-NFKVPAILRTGGTEWRIPKAGH 306 (311) Q Consensus 230 ~~n~~~l~i~~~~~l~~ag~~g~~~~v~y~~~~~~~~~~~~~~~~~-~-~p~~~~~~-~~~~~~~~~~gGv~i~~P~ai~ 306 (311) -..+++.. .-+|-+.. +..+. ..|+.. . .-.+-+.+ .|..-|.-++ |+.|+.|-+++ T Consensus 175 v~~g~~~~-----------------~giyPkG~-~~Gl~-~~d~g~~~~~~~~g~~y~~y~~~~~w~~-Gl~i~d~r~vv 234 (328) T protein:vir:95 175 VVWGENTV-----------------HGIFPKGK-KAGIQ-MEDKGQVTLEDANGGKYEGYRTHYKWDN-GLALRDWRYVV 234 (328) T ss_pred EEEcCCeE-----------------EEeccccc-ccCce-eeecCceeeecCCCCeeeEEEEEEEeee-eeEEcCcccEE Confidence 00011000 01111111 01110 112210 0 00011111 3666666676 79999999999 Q ss_pred EeecC Q lcl|NC_019522. 307 YVDGV 311 (311) Q Consensus 307 ~~dGI 311 (311) ++-.| T Consensus 235 rI~NI 239 (328) T protein:vir:95 235 RIANI 239 (328) T ss_pred EEecC Confidence 99999 Done!