Query lcl|NC_011269.1_cdsid_YP_002224124.1 [gene=96] [protein=gp96] [protein_id=YP_002224124.1] [location=48911..49912] Match_columns 333 No_of_seqs 5 out of 8 Neff 2.5 Searched_HMMs 1612 Date Thu Nov 7 14:47:26 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_91 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_91_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:3033 Length: 272 # 98.9 5.2E-11 3.2E-14 77.0 11.9 257 32-333 1-267 (272) 2 protein:vir:9820 Length: 272 # 98.9 5.2E-11 3.2E-14 77.0 11.9 257 32-333 1-267 (272) 3 protein:vir:485 Length: 407 # 98.9 8.6E-11 5.3E-14 75.8 12.8 320 1-333 61-398 (407) 4 protein:vir:9309 Length: 324 # 98.9 1.2E-10 7.7E-14 74.9 13.1 285 22-333 1-313 (324) 5 protein:vir:97148 Length: 324 98.8 3.3E-10 2E-13 72.6 13.4 285 1-333 1-313 (324) 6 protein:vir:96123 Length: 274 98.8 3.1E-10 1.9E-13 72.7 12.7 259 32-333 1-268 (274) 7 protein:vir:3613 Length: 272 # 98.8 5.4E-10 3.4E-13 71.4 13.4 258 32-333 1-270 (272) 8 protein:vir:191 Length: 385 # 98.7 2.8E-10 1.7E-13 72.9 11.7 311 1-333 61-382 (385) 9 protein:vir:1886 Length: 385 # 98.7 2.8E-10 1.7E-13 72.9 11.7 311 1-333 61-382 (385) 10 protein:vir:4953 Length: 397 # 98.7 1E-09 6.3E-13 69.9 13.5 300 1-333 65-383 (397) 11 protein:vir:96833 Length: 275 98.7 8.1E-10 5E-13 70.4 12.3 260 32-333 1-269 (275) 12 protein:vir:3991 Length: 404 # 98.7 1.6E-09 9.7E-13 68.9 13.8 301 1-333 73-391 (404) 13 protein:vir:78830 Length: 324 98.7 2E-09 1.2E-12 68.3 14.3 274 32-333 1-313 (324) 14 protein:vir:96392 Length: 324 98.7 2E-09 1.2E-12 68.3 14.3 274 32-333 1-313 (324) 15 protein:vir:93742 Length: 274 98.6 2.9E-09 1.8E-12 67.4 13.9 259 32-333 1-268 (274) 16 protein:vir:80930 Length: 278 98.6 1.6E-09 9.7E-13 68.8 12.4 263 32-333 1-275 (278) 17 protein:vir:97053 Length: 390 98.6 2.8E-09 1.8E-12 67.5 13.8 309 1-333 69-390 (390) 18 protein:vir:95763 Length: 297 98.6 2E-09 1.2E-12 68.3 12.9 268 32-333 1-294 (297) 19 protein:vir:100247 Length: 425 98.6 1.6E-09 1E-12 68.8 12.4 319 1-333 64-422 (425) 20 protein:vir:41 Length: 299 # N 98.6 8.3E-10 5.1E-13 70.4 10.7 266 32-333 1-296 (299) 21 protein:vir:10364 Length: 390 98.6 3.3E-09 2E-12 67.1 13.9 309 1-333 68-390 (390) 22 protein:vir:96262 Length: 274 98.6 1.7E-09 1.1E-12 68.7 12.0 260 37-333 1-268 (274) 23 protein:vir:95898 Length: 274 98.6 1.7E-09 1.1E-12 68.7 12.0 260 37-333 1-268 (274) 24 protein:vir:4856 Length: 293 # 98.6 1.9E-09 1.2E-12 68.4 11.9 254 46-333 1-279 (293) 25 protein:vir:4997 Length: 397 # 98.6 5.8E-09 3.6E-12 65.7 14.2 296 1-333 69-383 (397) 26 protein:vir:105334 Length: 276 98.6 5E-09 3.1E-12 66.1 13.7 260 32-333 1-268 (276) 27 protein:vir:99749 Length: 324 98.6 7E-09 4.3E-12 65.3 14.5 284 14-333 1-313 (324) 28 protein:vir:81070 Length: 390 98.6 5.5E-09 3.4E-12 65.9 13.8 310 1-333 68-390 (390) 29 protein:vir:4092 Length: 390 # 98.6 3.6E-09 2.2E-12 66.9 12.7 303 1-333 38-366 (390) 30 protein:vir:9410 Length: 415 # 98.5 1.4E-08 8.6E-12 63.7 15.5 307 1-333 65-402 (415) 31 protein:vir:94142 Length: 304 98.5 2.7E-09 1.7E-12 67.6 11.5 274 32-333 1-303 (304) 32 protein:vir:105905 Length: 304 98.5 2.7E-09 1.7E-12 67.6 11.5 274 32-333 1-303 (304) 33 protein:vir:97433 Length: 274 98.5 3.5E-09 2.2E-12 67.0 12.0 260 32-333 1-268 (274) 34 protein:vir:94494 Length: 274 98.5 3.5E-09 2.2E-12 67.0 12.0 260 32-333 1-268 (274) 35 protein:vir:100135 Length: 418 98.5 1.1E-08 6.7E-12 64.3 14.5 304 1-333 93-413 (418) 36 protein:vir:96223 Length: 324 98.5 1.2E-08 7.6E-12 64.0 14.8 284 14-333 1-313 (324) 37 protein:vir:4339 Length: 395 # 98.5 8E-09 5E-12 65.0 13.8 310 1-333 69-393 (395) 38 protein:vir:79987 Length: 415 98.5 1.5E-08 9.4E-12 63.5 15.1 311 1-333 58-402 (415) 39 protein:vir:98339 Length: 415 98.5 1.5E-08 9.4E-12 63.5 15.1 311 1-333 58-402 (415) 40 protein:vir:81100 Length: 415 98.5 1.5E-08 9.4E-12 63.5 15.1 311 1-333 58-402 (415) 41 protein:vir:4226 Length: 326 # 98.5 1.4E-09 8.7E-13 69.1 9.4 289 25-333 1-321 (326) 42 protein:vir:1239 Length: 274 # 98.5 7.5E-09 4.6E-12 65.1 12.8 260 32-333 1-268 (274) 43 protein:vir:104085 Length: 320 98.5 8.6E-09 5.4E-12 64.8 12.6 284 31-333 1-315 (320) 44 protein:vir:4600 Length: 415 # 98.5 3.2E-08 2E-11 61.7 15.5 311 1-333 65-402 (415) 45 protein:vir:4700 Length: 415 # 98.5 3.2E-08 2E-11 61.7 15.5 311 1-333 65-402 (415) 46 protein:vir:9759 Length: 303 # 98.5 7.7E-09 4.8E-12 65.1 12.1 273 46-333 1-301 (303) 47 protein:vir:104256 Length: 458 98.5 2E-08 1.2E-11 62.8 14.2 311 1-333 110-456 (458) 48 protein:vir:7771 Length: 330 # 98.4 4.8E-09 3E-12 66.2 10.5 280 37-333 1-321 (330) 49 protein:vir:4511 Length: 409 # 98.4 2.5E-08 1.6E-11 62.3 14.3 317 1-333 67-404 (409) 50 protein:vir:4456 Length: 401 # 98.4 1.2E-08 7.2E-12 64.1 12.4 312 1-333 68-399 (401) 51 protein:vir:103955 Length: 324 98.4 3.1E-08 1.9E-11 61.7 14.4 284 15-333 1-313 (324) 52 protein:vir:4830 Length: 397 # 98.4 3E-08 1.8E-11 61.9 14.2 294 1-333 64-383 (397) 53 protein:vir:100172 Length: 394 98.4 1.7E-08 1.1E-11 63.2 12.8 297 1-333 67-382 (394) 54 protein:vir:94771 Length: 298 98.4 1.6E-08 1E-11 63.3 12.5 267 50-333 1-297 (298) 55 protein:vir:2504 Length: 305 # 98.4 6.1E-09 3.8E-12 65.6 9.9 268 46-333 1-296 (305) 56 protein:vir:2344 Length: 397 # 98.4 2.5E-08 1.5E-11 62.3 13.3 279 32-333 1-304 (397) 57 protein:vir:9704 Length: 394 # 98.4 1.9E-08 1.2E-11 63.0 12.5 297 1-333 76-388 (394) 58 protein:vir:81227 Length: 413 98.4 4.8E-08 3E-11 60.7 14.7 315 1-333 67-408 (413) 59 protein:vir:1638 Length: 298 # 98.4 1.6E-08 9.8E-12 63.4 11.7 266 50-333 1-297 (298) 60 protein:vir:1025 Length: 408 # 98.4 4.3E-08 2.7E-11 60.9 13.9 301 1-333 72-391 (408) 61 protein:vir:7990 Length: 273 # 98.4 3.4E-08 2.1E-11 61.5 13.3 257 46-333 1-271 (273) 62 protein:vir:1328 Length: 392 # 98.3 2.1E-08 1.3E-11 62.7 12.0 307 1-333 65-389 (392) 63 protein:vir:101607 Length: 379 98.3 2.7E-08 1.6E-11 62.1 11.9 304 1-333 39-377 (379) 64 protein:vir:9574 Length: 300 # 98.3 3.1E-08 1.9E-11 61.7 11.9 270 46-333 1-298 (300) 65 protein:vir:7409 Length: 408 # 98.3 2.7E-08 1.7E-11 62.1 11.5 302 1-333 72-391 (408) 66 protein:vir:2430 Length: 318 # 98.3 3E-08 1.9E-11 61.8 11.5 284 30-333 1-311 (318) 67 protein:vir:5739 Length: 366 # 98.3 7.2E-08 4.4E-11 59.8 13.4 300 1-333 5-364 (366) 68 protein:vir:1268 Length: 397 # 98.3 9.2E-08 5.7E-11 59.2 13.9 298 1-333 68-395 (397) 69 protein:vir:1383 Length: 421 # 98.3 4.4E-08 2.8E-11 60.9 12.1 295 1-333 72-381 (421) 70 protein:vir:100884 Length: 389 98.2 7.3E-08 4.5E-11 59.7 12.3 296 1-333 69-380 (389) 71 protein:vir:78223 Length: 333 98.2 7.6E-08 4.7E-11 59.6 11.9 287 39-333 1-330 (333) 72 protein:vir:9643 Length: 377 # 98.2 9.8E-08 6.1E-11 59.0 12.4 316 1-333 1-375 (377) 73 protein:vir:94673 Length: 419 98.2 1.1E-07 6.7E-11 58.8 12.7 311 1-333 70-415 (419) 74 protein:vir:8187 Length: 311 # 98.2 4.5E-08 2.8E-11 60.9 10.4 272 50-333 1-308 (311) 75 protein:vir:3870 Length: 400 # 98.2 1.2E-07 7.6E-11 58.5 12.5 300 1-333 85-397 (400) 76 protein:vir:95963 Length: 395 98.2 2.3E-07 1.4E-10 57.0 13.9 304 1-333 38-374 (395) 77 protein:vir:102119 Length: 404 98.2 3.7E-07 2.3E-10 55.9 15.0 308 1-333 64-398 (404) 78 protein:vir:99920 Length: 311 98.1 6.4E-08 4E-11 60.0 10.6 271 46-333 1-310 (311) 79 protein:vir:3845 Length: 395 # 98.1 1.2E-07 7.2E-11 58.6 11.3 299 1-333 31-381 (395) 80 protein:vir:6242 Length: 390 # 98.1 1.2E-07 7.4E-11 58.5 11.3 304 1-333 69-387 (390) 81 protein:vir:80684 Length: 315 98.1 9.8E-08 6.1E-11 59.0 10.6 273 46-333 1-304 (315) 82 protein:vir:107593 Length: 392 98.1 4.1E-07 2.6E-10 55.6 13.8 299 1-333 56-382 (392) 83 protein:vir:102082 Length: 392 98.1 4.1E-07 2.6E-10 55.6 13.8 299 1-333 56-382 (392) 84 protein:vir:105004 Length: 392 98.1 4.1E-07 2.6E-10 55.6 13.8 299 1-333 56-382 (392) 85 protein:vir:102873 Length: 392 98.1 4.1E-07 2.6E-10 55.6 13.8 299 1-333 56-382 (392) 86 protein:vir:8102 Length: 543 # 98.1 4.4E-07 2.7E-10 55.4 13.6 311 1-333 173-540 (543) 87 protein:vir:102605 Length: 273 98.0 4.1E-07 2.6E-10 55.6 12.8 257 46-333 1-271 (273) 88 protein:vir:105822 Length: 273 98.0 4.1E-07 2.6E-10 55.6 12.8 257 46-333 1-271 (273) 89 protein:vir:96762 Length: 632 98.0 4.9E-07 3E-10 55.2 12.7 307 1-333 277-631 (632) 90 protein:vir:94622 Length: 341 98.0 2.1E-06 1.3E-09 51.8 15.8 281 26-333 1-337 (341) 91 protein:vir:96978 Length: 387 98.0 3.6E-07 2.2E-10 55.9 11.4 303 1-333 71-379 (387) 92 protein:vir:94424 Length: 387 98.0 3.6E-07 2.2E-10 55.9 11.4 303 1-333 71-379 (387) 93 protein:vir:2685 Length: 387 # 98.0 3.6E-07 2.2E-10 55.9 11.4 303 1-333 71-379 (387) 94 protein:vir:95376 Length: 425 97.9 1E-06 6.5E-10 53.4 13.8 310 1-333 86-419 (425) 95 protein:vir:105038 Length: 428 97.9 1.9E-06 1.2E-09 51.9 14.8 306 1-333 74-426 (428) 96 protein:vir:101650 Length: 497 97.9 1.7E-06 1.1E-09 52.2 13.9 322 1-333 78-491 (497) 97 protein:vir:7855 Length: 497 # 97.9 1.7E-06 1.1E-09 52.2 13.9 322 1-333 78-491 (497) 98 protein:vir:9361 Length: 402 # 97.9 1.1E-06 6.5E-10 53.4 12.4 303 1-333 82-394 (402) 99 protein:vir:80376 Length: 435 97.9 2.3E-06 1.4E-09 51.5 14.2 309 1-333 65-431 (435) 100 protein:vir:93881 Length: 387 97.8 1.4E-06 8.7E-10 52.7 12.4 302 1-333 71-379 (387) 101 protein:vir:1084 Length: 437 # 97.8 9.1E-07 5.6E-10 53.7 11.3 302 1-333 90-426 (437) 102 protein:vir:108211 Length: 318 97.8 2.3E-06 1.4E-09 51.5 13.0 283 1-333 1-315 (318) 103 protein:vir:78739 Length: 332 97.7 1.4E-06 8.4E-10 52.8 11.4 285 1-333 7-332 (332) 104 protein:vir:81160 Length: 371 97.6 2.6E-06 1.6E-09 51.2 11.3 300 1-333 1-369 (371) 105 protein:vir:1433 Length: 435 # 97.6 8E-06 5E-09 48.5 13.9 305 1-333 65-431 (435) 106 protein:vir:78523 Length: 338 97.5 6.9E-06 4.3E-09 48.9 12.5 285 22-333 1-333 (338) 107 protein:vir:962 Length: 397 # 97.4 1.3E-05 8.1E-09 47.4 12.7 297 1-333 64-396 (397) 108 protein:vir:78350 Length: 383 97.4 2.1E-05 1.3E-08 46.2 13.6 307 1-333 1-373 (383) 109 protein:vir:80180 Length: 381 97.4 1.9E-05 1.2E-08 46.5 13.2 286 22-333 1-303 (381) 110 protein:vir:93616 Length: 645 97.4 1.4E-05 8.8E-09 47.2 12.3 313 1-333 262-637 (645) 111 protein:vir:98635 Length: 377 97.2 8.1E-06 5.1E-09 48.5 9.3 290 1-333 33-375 (377) 112 protein:vir:78640 Length: 352 97.1 4.2E-05 2.6E-08 44.6 12.9 301 1-333 36-344 (352) 113 protein:vir:100057 Length: 375 97.1 6.2E-05 3.9E-08 43.6 13.5 290 22-333 1-368 (375) 114 protein:vir:95107 Length: 270 97.1 3.9E-05 2.4E-08 44.8 12.3 250 32-333 1-263 (270) 115 protein:vir:8885 Length: 347 # 97.1 0.00013 8.2E-08 41.8 15.0 289 1-333 1-346 (347) 116 protein:vir:94576 Length: 347 97.0 0.00018 1.1E-07 41.2 15.4 278 1-333 1-347 (347) 117 protein:vir:101291 Length: 381 97.0 3.6E-05 2.2E-08 45.0 11.2 299 1-333 1-366 (381) 118 protein:vir:9509 Length: 381 # 97.0 3.6E-05 2.2E-08 45.0 11.2 299 1-333 1-366 (381) 119 protein:vir:739 Length: 231 # 97.0 3E-05 1.8E-08 45.4 10.7 211 84-333 1-229 (231) 120 protein:vir:8420 Length: 477 # 97.0 9.5E-05 5.9E-08 42.6 13.3 319 1-333 81-469 (477) 121 protein:vir:3364 Length: 347 # 97.0 0.00016 1E-07 41.3 14.5 285 1-333 1-343 (347) 122 protein:vir:80128 Length: 466 96.8 0.00013 7.9E-08 41.9 12.9 306 1-333 95-446 (466) 123 protein:vir:94711 Length: 347 96.7 0.00036 2.3E-07 39.4 15.5 283 22-333 1-344 (347) 124 protein:vir:80213 Length: 334 96.6 0.00027 1.7E-07 40.2 13.2 284 32-333 1-330 (334) 125 protein:vir:10450 Length: 344 96.5 0.00035 2.2E-07 39.5 13.6 285 28-333 1-344 (344) 126 protein:vir:100632 Length: 381 96.5 0.00034 2.1E-07 39.6 13.2 302 1-333 1-366 (381) 127 protein:vir:6324 Length: 335 # 96.3 0.00042 2.6E-07 39.1 12.5 287 32-333 1-326 (335) 128 protein:vir:94933 Length: 330 96.1 6E-05 3.8E-08 43.7 6.9 296 1-333 5-327 (330) 129 protein:vir:99075 Length: 392 96.0 0.00094 5.8E-07 37.2 13.4 256 46-333 1-273 (392) 130 protein:vir:3158 Length: 321 # 95.8 0.00093 5.8E-07 37.2 12.4 277 37-333 1-309 (321) 131 protein:vir:1541 Length: 347 # 95.7 0.0015 9.5E-07 36.0 15.9 281 28-333 1-343 (347) 132 protein:vir:6212 Length: 434 # 95.5 0.0019 1.2E-06 35.5 13.8 313 1-333 73-430 (434) 133 protein:vir:4197 Length: 314 # 95.0 0.001 6.4E-07 37.0 10.0 284 19-333 1-311 (314) 134 protein:vir:99675 Length: 324 94.8 0.0027 1.7E-06 34.6 12.0 244 83-333 1-296 (324) 135 protein:vir:78935 Length: 335 94.1 0.0053 3.3E-06 33.0 13.2 287 32-333 1-326 (335) 136 protein:vir:7019 Length: 401 # 93.5 0.0042 2.6E-06 33.6 10.2 288 22-333 1-331 (401) 137 protein:vir:102655 Length: 322 93.4 0.0057 3.6E-06 32.9 10.8 282 1-333 1-319 (322) 138 protein:vir:97255 Length: 310 91.6 0.0081 5E-06 32.0 9.3 271 32-333 1-308 (310) 139 protein:vir:2201 Length: 345 # 91.0 0.018 1.1E-05 30.2 13.8 286 1-333 1-343 (345) 140 protein:vir:103323 Length: 364 89.7 0.025 1.5E-05 29.4 15.3 290 17-333 1-337 (364) 141 protein:vir:105645 Length: 400 88.9 0.022 1.4E-05 29.6 9.3 287 22-333 1-331 (400) 142 protein:vir:3136 Length: 322 # 87.6 0.037 2.3E-05 28.4 12.4 276 32-333 1-316 (322) 143 protein:vir:97031 Length: 402 81.3 0.087 5.4E-05 26.4 13.0 288 17-333 1-333 (402) 144 protein:vir:5974 Length: 324 # 77.4 0.12 7.8E-05 25.5 14.1 265 32-333 1-312 (324) 145 protein:vir:97397 Length: 517 56.2 0.46 0.00029 22.4 9.8 304 1-333 168-512 (517) 146 protein:vir:3525 Length: 423 # 47.2 0.71 0.00044 21.4 11.9 249 46-333 1-267 (423) 147 protein:vir:108303 Length: 418 44.7 0.8 0.0005 21.1 14.3 250 43-333 1-280 (418) 148 protein:vir:8843 Length: 317 # 37.0 0.67 0.00041 21.5 4.6 263 64-333 1-313 (317) 149 protein:vir:103841 Length: 155 31.2 1.5 0.00094 19.6 6.2 127 32-165 1-155 (155) 150 protein:vir:1781 Length: 221 # 28.1 1.8 0.0011 19.2 9.6 178 119-333 1-193 (221) 151 protein:vir:80068 Length: 301 25.6 2 0.0013 18.9 14.5 275 49-333 1-301 (301) 152 protein:vir:4159 Length: 315 # 24.7 2.1 0.0013 18.8 10.3 286 1-333 1-315 (315) 153 protein:vir:79928 Length: 393 21.0 2.7 0.0017 18.2 11.9 315 1-333 1-376 (393) 154 protein:vir:9875 Length: 296 # 20.3 2.8 0.0017 18.1 11.9 249 32-333 1-294 (296) No 1 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=98.88 E-value=5.2e-11 Score=76.97 Aligned_cols=257 Identities=17% Similarity=0.165 Sum_probs=163.1 Q ss_pred hcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccC--CCcceeecCCCCccceEEEEc Q lcl|NC_011269. 32 MGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLT--PGVPIQYDVLDDLGQAYMLHG 109 (333) Q Consensus 32 ~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~--~G~~p~y~v~~~v~~a~~~~~ 109 (333) |-.-. |+ .++ +=-||- +++.+..++...+...++...- .+++ +|....+|+.+..+.+. |.+ T Consensus 1 MA~~~----T~---~~~-~~iPev----~s~~v~~~~~~~~~~~~~~~~~---~~~~g~~G~tv~iP~~~~~~~a~-~v~ 64 (272) T protein:vir:30 1 MAVGT----TK---MAQ-MLDPEV----LADMIDAEVGKAIRFAPLAEVD---TTLEGQPGTTLTVPKWDYIGDAE-DVA 64 (272) T ss_pred CCCcc----cc---chh-eechHH----HHHHHHHHHHHHhhhhcccccc---ccccCCCCCEEEEEEecCCCCcc-ccc Confidence 22100 10 111 123333 2444555555555444433321 1222 35544455433333333 566 Q ss_pred CCCcccceeecCceeeccceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccc Q lcl|NC_011269. 110 NEGEIRITPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQP 189 (333) Q Consensus 110 ~~G~i~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p 189 (333) .-+++..+.+.-+.+++...++...-.+.-++..+...|++.+..+++..++-+..|..+++.+.++. T Consensus 65 eg~~i~~~~~~~~~~~~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~------------ 132 (272) T protein:vir:30 65 EGEAIPMTQLGFKKTTMTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKST------------ 132 (272) T ss_pred CCCcccccccccceEEEEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcccc------------ Confidence 66678888887778777766766666677777788889999999999999999999999998875543 Q ss_pred cccCCCcceEEeeccccHHHHHHHHHHHHhhCCccceEEechhhhhhhhhcCCCchh--hhHHhhhhhcceeeeeec--- Q lcl|NC_011269. 190 GVGALPNEITIAGSHLMPDDLYTAVTYTDQRQLDSSRLLANPQEYRDLYRWDINTTG--WAFKDSVVAGERIVQFGE--- 264 (333) Q Consensus 190 ~vg~~~N~i~i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~~~~Di~gw~~N~~~--~~~~DpV~~~e~il~~G~--- 264 (333) |. +.+..+-+++..|.+...+-+.+...++|||..|.+|+-=...+|. +...+. ++..|. T Consensus 133 ------~~---~~~~~t~d~i~da~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~------~~~~g~ig~ 197 (272) T protein:vir:30 133 ------QT---VEATATVDGVSKALDIFNDEDDAETVIVMNPADASTLRLDAAKEWLGATEVGAN------RVVSGVYGE 197 (272) T ss_pred ------cc---cccccCHHHHHHHHHHHhccCCCccEEEEcHHHHHHHHHhcccccccccccccc------ccccccchh Confidence 11 1244578899999999999999999999999999999751111110 011111 222333 Q ss_pred ---ccccceeeecCCeEEEeeChhhhcccccccCceeccccchhhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 265 ---FQIGKSIIIPRGTVYLTPEPEFLGVFPVMYSLDVEEDNKVERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 265 ---fgi~~skvlprgeiyvvadpE~~G~~pvR~~L~s~p~D~~er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) +.+..+..+|-|++|++. +..+|-+- +.++.+|....+.++..-=.....+|+.+.||.+||.+..+ T Consensus 198 i~G~~Vi~s~~~p~~t~~~~~-~~a~~~~~-~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~~vv~~t~~ 267 (272) T protein:vir:30 198 VLGVQIVRSRKCPKGTAYMVR-KGALRIML-KRNTMVETDRDITKAINQIVANKHYGVYLYKAEKAVKITLK 267 (272) T ss_pred hcCeeEEEcCCCCcceEEEEc-CCeEEEEe-cCCceeeeccccccceeEEEEEEEEEEEEEcCCceEEEEec Confidence 346789999999999864 44566554 67777765444444333333356789999999999999877 No 2 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=98.88 E-value=5.2e-11 Score=76.97 Aligned_cols=257 Identities=17% Similarity=0.165 Sum_probs=163.1 Q ss_pred hcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccC--CCcceeecCCCCccceEEEEc Q lcl|NC_011269. 32 MGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLT--PGVPIQYDVLDDLGQAYMLHG 109 (333) Q Consensus 32 ~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~--~G~~p~y~v~~~v~~a~~~~~ 109 (333) |-.-. |+ .++ +=-||- +++.+..++...+...++...- .+++ +|....+|+.+..+.+. |.+ T Consensus 1 MA~~~----T~---~~~-~~iPev----~s~~v~~~~~~~~~~~~~~~~~---~~~~g~~G~tv~iP~~~~~~~a~-~v~ 64 (272) T protein:vir:98 1 MAVGT----TK---MAQ-MLDPEV----LADMIDAEVGKAIRFAPLAEVD---TTLEGQPGTTLTVPKWDYIGDAE-DVA 64 (272) T ss_pred CCCcc----cc---chh-eechHH----HHHHHHHHHHHHhhhhcccccc---ccccCCCCCEEEEEEecCCCCcc-ccc Confidence 22100 10 111 123333 2444555555555444433321 1222 35544455433333333 566 Q ss_pred CCCcccceeecCceeeccceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccc Q lcl|NC_011269. 110 NEGEIRITPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQP 189 (333) Q Consensus 110 ~~G~i~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p 189 (333) .-+++..+.+.-+.+++...++...-.+.-++..+...|++.+..+++..++-+..|..+++.+.++. T Consensus 65 eg~~i~~~~~~~~~~~~~~~~~~~~~~itd~~~~~s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~------------ 132 (272) T protein:vir:98 65 EGEAIPMTQLGFKKTTMTIKKAGKGVEITDEAILSGYGDPVGQAAKQIVEAIDHKVDADVLDALSKST------------ 132 (272) T ss_pred CCCcccccccccceEEEEeeeeeeeeeecHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHhcccc------------ Confidence 66678888887778777766766666677777788889999999999999999999999998875543 Q ss_pred cccCCCcceEEeeccccHHHHHHHHHHHHhhCCccceEEechhhhhhhhhcCCCchh--hhHHhhhhhcceeeeeec--- Q lcl|NC_011269. 190 GVGALPNEITIAGSHLMPDDLYTAVTYTDQRQLDSSRLLANPQEYRDLYRWDINTTG--WAFKDSVVAGERIVQFGE--- 264 (333) Q Consensus 190 ~vg~~~N~i~i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~~~~Di~gw~~N~~~--~~~~DpV~~~e~il~~G~--- 264 (333) |. +.+..+-+++..|.+...+-+.+...++|||..|.+|+-=...+|. +...+. ++..|. T Consensus 133 ------~~---~~~~~t~d~i~da~~~l~~~~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~~------~~~~g~ig~ 197 (272) T protein:vir:98 133 ------QT---VEATATVDGVSKALDIFNDEDDAETVIVMNPADASTLRLDAAKEWLGATEVGAN------RVVSGVYGE 197 (272) T ss_pred ------cc---cccccCHHHHHHHHHHHhccCCCccEEEEcHHHHHHHHHhcccccccccccccc------ccccccchh Confidence 11 1244578899999999999999999999999999999751111110 011111 222333 Q ss_pred ---ccccceeeecCCeEEEeeChhhhcccccccCceeccccchhhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 265 ---FQIGKSIIIPRGTVYLTPEPEFLGVFPVMYSLDVEEDNKVERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 265 ---fgi~~skvlprgeiyvvadpE~~G~~pvR~~L~s~p~D~~er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) +.+..+..+|-|++|++. +..+|-+- +.++.+|....+.++..-=.....+|+.+.||.+||.+..+ T Consensus 198 i~G~~Vi~s~~~p~~t~~~~~-~~a~~~~~-~~~~~ve~~r~~~~~~~~i~~~~~~~~~v~~~~~vv~~t~~ 267 (272) T protein:vir:98 198 VLGVQIVRSRKCPKGTAYMVR-KGALRIML-KRNTMVETDRDITKAINQIVANKHYGVYLYKAEKAVKITLK 267 (272) T ss_pred hcCeeEEEcCCCCcceEEEEc-CCeEEEEe-cCCceeeeccccccceeEEEEEEEEEEEEEcCCceEEEEec Confidence 346789999999999864 44566554 67777765444444333333356789999999999999877 No 3 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=98.87 E-value=8.6e-11 Score=75.77 Aligned_cols=320 Identities=13% Similarity=0.074 Sum_probs=189.2 Q ss_pred Ccccchhhh-hhhhhhcccchHHHHHHHHHHhhcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhh Q lcl|NC_011269. 1 MTLPVAVGS-GLGRFAKASDDYVADIVEAKQRMGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILR 79 (333) Q Consensus 1 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~R 79 (333) ......... +-+.-.+..++|...+-+.-.+.-+..|+..|..++-. -.+..|+. .+-+.+..-|.+.++.....| T Consensus 61 ~~~~~~~~~~~~~~~~~~~~e~~~a~~~~l~~g~~~~~~~~e~~a~~~--~t~~~gG~-~iP~~~~~~I~~~~~~~~~l~ 137 (407) T protein:vir:48 61 EAELAEVKRPAGGTQNKVASEHKEAFIGFMRKGREDGLRELERKALQV--GNDEDGGY-AIPEELDRTILTLLKDEVVMR 137 (407) T ss_pred HHHHHHhhccccccccchhhHHHHHHHHHHhccchhhhhHHHHHhhhc--ccCCCCcc-cccHhHHHHHHHHHHhhhhhh Confidence 000000000 01111223333433332222222233444444433321 13344552 566778888999999999999 Q ss_pred hhhhccccCCCcceeecCCCCccceEEEEcCCCcccceee-cCceeeccceeeeccccccHHHhhhhcchhHHHHHHHHH Q lcl|NC_011269. 80 NVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRITPF-EGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTK 158 (333) Q Consensus 80 klL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~i-~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~ 158 (333) ++.+..++..|.. .|++...-.+ +-|++-.+.++.+.. .=+.|++.-.++.++..|..+-|.....|+..+..++.. T Consensus 138 ~~~~~~~~~~~~~-~~~~~~~~~~-a~~v~E~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~ 215 (407) T protein:vir:48 138 QEATVITLGGSDY-KKLVNLGGTT-SGWVGETDARPETATSKLGLIEPFMGEIYGNPQATQKMLDDAFFNVEDWINSELA 215 (407) T ss_pred hhceeeecCCCce-EEEEecCCcc-eeeecccccccccccccceeEEeeeeeeEeehhhHHHHHhcchHHHHHHHHHHHH Confidence 9988888776643 4444333333 346666565665543 336788888999999999999999999999999999999 Q ss_pred HHHHHHhhhHHHH---------HHhhhhhhhhhhcccccccccCCCcceEEeeccccHHHHHHHHHHHHhhCCccceEEe Q lcl|NC_011269. 159 QAIMRQEDSRLVT---------LLEAAAVSYRVVDSSAQPGVGALPNEITIAGSHLMPDDLYTAVTYTDQRQLDSSRLLA 229 (333) Q Consensus 159 qaIM~qED~~~~s---------lle~~a~~~r~~~ssA~p~vg~~~N~i~i~~g~Lt~~~L~~a~t~v~~~~L~at~il~ 229 (333) ++|...+|.-+++ +|...++. ......+.|...+..+...+.++-++|..++..+..-.......+| T Consensus 216 ~~i~~~~~~a~l~G~G~~~p~Gil~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~d~i~~l~~~l~~~~~~~a~~v~ 291 (407) T protein:vir:48 216 LEFAEQEEIAFTSGDGSKKPKGFLAYESTD----EDDKTRAFGKLQHIASGAASGVTADAIIKLIYTLRKAHRSGAKFMM 291 (407) T ss_pred HHHHHHHHhhhhccCCCCccceeeeccccc----ccccccccccccccccccccccChHHHHHHHHhhchhhhcCCEEEE Confidence 9999999965442 11111100 0011111122222334556778889998888877766666678899 Q ss_pred chhhhhhhhhcCCCchh-hhHHhhhhhcceeeeeeccccc--ceeeecC----CeEEEeeChhhhcccccccCceecccc Q lcl|NC_011269. 230 NPQEYRDLYRWDINTTG-WAFKDSVVAGERIVQFGEFQIG--KSIIIPR----GTVYLTPEPEFLGVFPVMYSLDVEEDN 302 (333) Q Consensus 230 ~~~~~~Di~gw~~N~~~-~~~~DpV~~~e~il~~G~fgi~--~skvlpr----geiyvvadpE~~G~~pvR~~L~s~p~D 302 (333) |+.-|.-|..=- +..| |.+..++..+.- .-++|.+ .+--||- +.++++.|-...=.+..|.|+++.-++ T Consensus 292 n~~~~~~L~~lk-D~~Gr~l~~~~~~~g~~---~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~i~~d~ 367 (407) T protein:vir:48 292 NNSSLFAIRLLK-DNDGNYLWRPGIELGQP---SSLAGYGIVENEQMPDIAADAKAIAFGNFKRGYTIVDRIGTRILRDP 367 (407) T ss_pred cHHHHHHHHHhh-ccCCceeeccCcCCCCC---ceecceeeEEecCcCCccCCccEEEEEeccccEEEEEeeceEEEeec Confidence 999998876611 1111 122122221110 0134433 2222443 233444554322245678999998877 Q ss_pred chhhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 303 KVERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 303 ~~er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) +.+.-..+|..++-++..+.||.+++.|..+ T Consensus 368 ~~~~~~~~~~~~~r~d~~v~~~~a~~~l~~~ 398 (407) T protein:vir:48 368 YTNKPFVGFYTTKRTGGMLVDSQAIKLMKIG 398 (407) T ss_pred cccCCcEEEEEEEEeccEEecccceEEEEee Confidence 8777778899999999999999999999887 No 4 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=98.86 E-value=1.2e-10 Score=74.89 Aligned_cols=285 Identities=13% Similarity=0.119 Sum_probs=187.3 Q ss_pred HHHHHHHHHhhcchhcchHHHHHH-HHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcceeecCCCC Q lcl|NC_011269. 22 VADIVEAKQRMGGRKLSAREKQAK-LAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPIQYDVLDD 100 (333) Q Consensus 22 ~~~~~~~~~~~~~~~ls~ee~~~L-m~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p~y~v~~~ 100 (333) .....+.|.-.+.-....++.+.+ ....+.+..+. -.+-..+++.|.+.+..+..++++.+..+++.|. ..||+... T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~a~~~~~~~~~~-~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~ip~~~~ 78 (324) T protein:vir:93 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKD-GTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTE-KKFTFWAD 78 (324) T ss_pred CchhHHHHHHHHHHHHhhhhhhhcccccccccCCCc-ceechhHHHHHHHHHHhhchhhhhcceeeccCCc-eEEEEEec Confidence 111111111111111111111111 01112122222 1466778999999999999999999988887664 35665444 Q ss_pred ccceEEEEcCCCcccceeecCceeeccceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhh Q lcl|NC_011269. 101 LGQAYMLHGNEGEIRITPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSY 180 (333) Q Consensus 101 v~~a~~~~~~~G~i~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~ 180 (333) ...+ -|++..+++++....=+.+++.-.++.....|..+-|.....|+..+.+++..++|.+.+|..+++ +.. T Consensus 79 ~~~a-~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~---G~g--- 151 (324) T protein:vir:93 79 KPGA-YWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL---NQG--- 151 (324) T ss_pred Ccce-eeecCCccccccccceeEEEEEeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhc---CCC--- Confidence 4444 467888888888877788999999999999999999999999999999999999999999975542 222 Q ss_pred hhhcccccccccCCCc---ceEEeeccccHHHHHHHHHHHHhhCCccceEEechhhhhhhhhcCCCchhhhHHhhhhhcc Q lcl|NC_011269. 181 RVVDSSAQPGVGALPN---EITIAGSHLMPDDLYTAVTYTDQRQLDSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAGE 257 (333) Q Consensus 181 r~~~ssA~p~vg~~~N---~i~i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e 257 (333) .+. .+.|.+.. ..+...+.++-+++..+...++.-+.....++||++.|.-|+. + +|.- +. T Consensus 152 ----~~~-~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~--l-------~d~~--G~ 215 (324) T protein:vir:93 152 ----NNP-FGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRK--I-------VDPE--TK 215 (324) T ss_pred ----CCC-cCccccccccccceeccccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHH--h-------hCCC--CC Confidence 111 11222211 1244567889999999999999999999999999999998875 2 2221 11 Q ss_pred eeeee----ecccccc----eeeecCCeEEEeeChhhhcccccccCceeccccchh----------------hhccceeh Q lcl|NC_011269. 258 RIVQF----GEFQIGK----SIIIPRGTVYLTPEPEFLGVFPVMYSLDVEEDNKVE----------------RFNKGWVM 313 (333) Q Consensus 258 ~il~~----G~fgi~~----skvlprgeiyvvadpE~~G~~pvR~~L~s~p~D~~e----------------r~~kGWvm 313 (333) -+++. .+.|++. +.-.+.|++| +.|+... .+-.|+++.++-.+... +--..+.. T Consensus 216 ~~~~~~~~~~l~G~PVv~~~~~~~~~~~i~-~gdfs~~-~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~n~~~~r~ 293 (324) T protein:vir:93 216 ERIYDRNSDSLDGLPVVNLKSSNLKRGELI-TGDFDKL-IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRA 293 (324) T ss_pred eeecCCCCCcccceeeEeecCCCCCcceEE-EEecceE-EEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEE Confidence 12221 2355542 3346666665 4566643 57788898887666432 22456677 Q ss_pred hhhhhhhhhccceEEEEecC Q lcl|NC_011269. 314 DELVGMAILNPRGIVILRKA 333 (333) Q Consensus 314 ~E~~g~~i~N~~siv~~~~~ 333 (333) .+-++.++.||.++|.|..| T Consensus 294 ~~r~d~~v~~~~a~~~l~~a 313 (324) T protein:vir:93 294 TMHVALHIADDKAFAKLVPA 313 (324) T ss_pred EEEeccEEecccceEEEecc Confidence 78899999999999999988 No 5 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=98.79 E-value=3.3e-10 Score=72.60 Aligned_cols=285 Identities=13% Similarity=0.117 Sum_probs=187.0 Q ss_pred CcccchhhhhhhhhhcccchHHHHHHHHHHhhcchhcchHHHHHH-HHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhh Q lcl|NC_011269. 1 MTLPVAVGSGLGRFAKASDDYVADIVEAKQRMGGRKLSAREKQAK-LAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILR 79 (333) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ls~ee~~~L-m~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~R 79 (333) |-=+--.-.=+|+|+ ...++.+.+ ......+..|.. .+-+.+.+.|.+.++.+...+ T Consensus 1 ~~~~~~~~~~~~~f~---------------------~~~~~~~~~~a~~~~~~~~~~~-~iP~~~~~~ii~~~~~~s~l~ 58 (324) T protein:vir:97 1 MEQTQKLKLNLQHFA---------------------SNNVKPQVFNPDNVMMHEKKDG-TLMNEFTTPILQEVMENSKIM 58 (324) T ss_pred CccchhHHHHHHHHH---------------------HhhhhhhhhccccccccCCCcc-eechhHHHHHHHHHHhhcchh Confidence 100000000001110 000111110 001112222332 567888999999999999999 Q ss_pred hhhhccccCCCcceeecCCCCccceEEEEcCCCcccceeecCceeeccceeeeccccccHHHhhhhcchhHHHHHHHHHH Q lcl|NC_011269. 80 NVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRITPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQ 159 (333) Q Consensus 80 klL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~q 159 (333) ++.+..+++.|. -.||+...... +-|++-.+.++.....=+.+++.-.++.....|..+-|+....++..+..+...+ T Consensus 59 ~~~~~~~~~~~~-~~ip~~~~~~~-a~~v~Eg~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~ 136 (324) T protein:vir:97 59 QLGKYEPMEGTE-KKFTFWADKPG-AYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAE 136 (324) T ss_pred hhcceeeccCCc-eEEEEEecCcc-eeEeccCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHH Confidence 999998887664 35565444333 4578877888888877788888889999999999999999999999999999999 Q ss_pred HHHHHhhhHHHHHHhhhhhhhhhhcccccccccCC---CcceEEeeccccHHHHHHHHHHHHhhCCccceEEechhhhhh Q lcl|NC_011269. 160 AIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGAL---PNEITIAGSHLMPDDLYTAVTYTDQRQLDSSRLLANPQEYRD 236 (333) Q Consensus 160 aIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~---~N~i~i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~~~~D 236 (333) +|-+.+|..+++ +.. .+..| .|.. .=.-....+.++.++|.++...+..-++....++||+..|.. T Consensus 137 aia~~~d~a~l~---G~g-------~~~~~-~gi~~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~ 205 (324) T protein:vir:97 137 AFYKKFDEAGIL---NQG-------NNPFG-KSIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSL 205 (324) T ss_pred HHHHHHHHHhhc---cCC-------CCccC-ccccccccccceeccccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHH Confidence 999999976553 111 01011 1111 001245668899999999999999999999999999999998 Q ss_pred hhhcCCCchhhhHHhhhhhcceeeeee----ccccc----ceeeecCCeEEEeeChhhhcccccccCceeccccch---- Q lcl|NC_011269. 237 LYRWDINTTGWAFKDSVVAGERIVQFG----EFQIG----KSIIIPRGTVYLTPEPEFLGVFPVMYSLDVEEDNKV---- 304 (333) Q Consensus 237 i~gw~~N~~~~~~~DpV~~~e~il~~G----~fgi~----~skvlprgeiyvvadpE~~G~~pvR~~L~s~p~D~~---- 304 (333) |+. .+|+-.++- ++.+ ++|.+ .+.-++.|++|+ .|.... .+-.|+++.++-.|.. T Consensus 206 L~~---------lkd~~g~~~--~~~~~~~tl~G~PV~~~~~~~~~~~~~~~-gd~~~~-~i~~~~~~~i~~~~~~~~~~ 272 (324) T protein:vir:97 206 LRK---------IVDPETKER--IYDRNSDTLDGLPVVNLKSSNLKRGELIT-GDFDKL-IYGIPQLIEYKIDETAQLST 272 (324) T ss_pred HHH---------hhcCCCcee--ecCCCCccccceeeEeecCCCCCcceEEE-EecccE-EEEEecCcEEEEeecccccc Confidence 875 345443332 2222 24443 122355666654 566643 5778899888766542 Q ss_pred ------------hhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 305 ------------ERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 305 ------------er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) ++-...+...+-++.++.||.+++.+..+ T Consensus 273 ~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~a~~~l~~~ 313 (324) T protein:vir:97 273 VKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPA 313 (324) T ss_pred cccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEec Confidence 33356777888899999999999999988 No 6 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=98.77 E-value=3.1e-10 Score=72.74 Aligned_cols=259 Identities=18% Similarity=0.221 Sum_probs=161.0 Q ss_pred hcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhcc-ccC--CCcceeecCCCCccceEEEE Q lcl|NC_011269. 32 MGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLED-TLT--PGVPIQYDVLDDLGQAYMLH 108 (333) Q Consensus 32 ~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~-TL~--~G~~p~y~v~~~v~~a~~~~ 108 (333) |. +++ + .+++.+ .||- +++.+-+++...+...++ .+.+ +++ +|....+|+.+..+.+- .. T Consensus 1 ma-----~~~-T-~~~d~i-~Pev----~s~~v~~~~~~~~~~~~~----~~~~~~l~g~~G~tv~ip~~~~~g~~~-~~ 63 (274) T protein:vir:96 1 MA-----QGT-T-KVSNLI-VPEV----LAPMMQAELDKKLRFAQF----ADIDSTLVGQPGDTLTFPAFTYSGDAQ-VI 63 (274) T ss_pred CC-----ccc-c-chhhhh-hhHH----HHHHHHHHHHhhhhhccc----ccccccccCCCCCEEEEEeeccCCCcc-cc Confidence 22 211 0 112222 2332 334444444444444443 3333 233 36666666643322222 23 Q ss_pred cCCCcccceeecCceeeccceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhccccc Q lcl|NC_011269. 109 GNEGEIRITPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQ 188 (333) Q Consensus 109 ~~~G~i~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~ 188 (333) .+.+++..+.+.-...++.--++-..-.+.-++..+..+|.+.++-+++..++.++.|..+++.+.++.. T Consensus 64 ~~g~~i~~~~it~~~~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~a~~---------- 133 (274) T protein:vir:96 64 AEGEKIPVDQIGTSKREAKVRKIGKGTELTDEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATL---------- 133 (274) T ss_pred CCCCcCchhhcccceeEEEEEeeeceeeecHHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC---------- Confidence 4445677766665555444333322234555555677789999999999999999999999999866432 Q ss_pred ccccCCCcceEEeeccccHHHHHHHHHHHHhhCCccceEEechhhhhhhhhcCCCchhhhHHhhhhhcceeeeeecc--- Q lcl|NC_011269. 189 PGVGALPNEITIAGSHLMPDDLYTAVTYTDQRQLDSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIVQFGEF--- 265 (333) Q Consensus 189 p~vg~~~N~i~i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~G~f--- 265 (333) ++.++-++.+.+..|.+...+-+-.-..++|+|..|..|+-=+..+| .++=-.++-++..|.| T Consensus 134 ----------~~~~~~~~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f----~~~~~~g~~~~~~g~ig~~ 199 (274) T protein:vir:96 134 ----------TVEADITKLDGLQTAIDKFNDEDLEPMVLFVNPLDAGGLRTSASDNF----TRPTQLGDNIIVKGAFGEA 199 (274) T ss_pred ----------CcCcccccHHHHHHHHHHhcccCCCceEEEeCHHHHHHHHhcccccc----cccccccccceeeccccee Confidence 23345567899999999999988889999999999999987222222 1111112224445554 Q ss_pred ---cccceeeecCCeEEEeeChhhhcccccccCceeccccchhhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 266 ---QIGKSIIIPRGTVYLTPEPEFLGVFPVMYSLDVEEDNKVERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 266 ---gi~~skvlprgeiyvvadpE~~G~~pvR~~L~s~p~D~~er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) .|..+..+|-++.|+.... -+|-+- ..++.+|......++..-=.....+|..++||.++|.++|+ T Consensus 200 ~G~~Vi~s~~~p~~t~~l~~~g-A~~~~~-~~~~~vE~~Rd~~~~~d~i~~~~~yg~~~~~~~~vv~~t~~ 268 (274) T protein:vir:96 200 LGAVIVRSNKLNKGEALLAKKG-AVKLIT-KRDFFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKG 268 (274) T ss_pred cCeeEEEcCCCCcceEEEEeCc-ceeeee-cCCcccccccchhhcccEEEEeeEEEEEEEcCccEEEEEcC Confidence 4778889999999988643 366543 45566664445555555555567789999999999999999 No 7 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=98.75 E-value=5.4e-10 Score=71.39 Aligned_cols=258 Identities=15% Similarity=0.202 Sum_probs=161.9 Q ss_pred hcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccc-c--CCCcceeecCCCCccceEEEE Q lcl|NC_011269. 32 MGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDT-L--TPGVPIQYDVLDDLGQAYMLH 108 (333) Q Consensus 32 ~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~T-L--~~G~~p~y~v~~~v~~a~~~~ 108 (333) |..- +|+ +++.+ .||-| ++.+.+++...+...+++ +.++ | .+|....+|+....+.+- .. T Consensus 1 ma~~----~T~---~~d~i-iPev~----~~~v~~~~~~~~~~~~~~----~~~~~l~g~~G~ti~iP~~~~~gda~-~~ 63 (272) T protein:vir:36 1 MSKQ----KTT---LADLV-NPEVL----APIVSYELNKALRFAPLA----QVDTTLQGQPGNTLKFPAFTYIGDAA-DV 63 (272) T ss_pred CCCc----cee---hhhhh-chHHH----HHHHHHHHHhhhhhcccc----ccccccccCCCCEEEEeeeccCcccc-cc Confidence 2210 111 01111 13332 333444444444444443 3322 2 246667777755554333 35 Q ss_pred cCCCcccceeecCceeeccceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhccccc Q lcl|NC_011269. 109 GNEGEIRITPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQ 188 (333) Q Consensus 109 ~~~G~i~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~ 188 (333) ++.+++..+.+.-...+..-.+.-..-.|.-.+-.+..+|.+.++-+++..++-++.|..+++.|.++. T Consensus 64 ~eg~~i~~~~lt~~~~~~~i~~~~k~~~vtD~~~~~~~~d~~~~~~~~~a~~~a~~~d~~i~~~l~~~~----------- 132 (272) T protein:vir:36 64 AEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALSGYGDPIGESNKQLGLSLANKVDDDLLSAAKTTS----------- 132 (272) T ss_pred CCCCccChhhcCCcceeEeeehhhccccccHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccc----------- Confidence 666677777766555555544444333455555567778999999999999999999999988885443 Q ss_pred ccccCCCcceEEeeccccHHHHHHHHHHHHhhCCccceEEechhhhhhhhhcCCCchhhhHHhhhhhcceeeeeecc--- Q lcl|NC_011269. 189 PGVGALPNEITIAGSHLMPDDLYTAVTYTDQRQLDSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIVQFGEF--- 265 (333) Q Consensus 189 p~vg~~~N~i~i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~G~f--- 265 (333) +++ ++..+-+.+..|.+...+.+-+.+.+++||..|..|+. .+.| +...+.. ..+ ++..|.| T Consensus 133 -------~~~---~~~~~~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k--~~~~-~~~~~~~-~~~-~~~~G~ig~~ 197 (272) T protein:vir:36 133 -------QTV---STKANVDGVQAALDIFNDEDAQAYVLIVNPKDAAKIRK--DANA-KNIGSEV-GAN-ALINGTYADV 197 (272) T ss_pred -------ccc---cccccHHHHHHHHHHhhhcCCCceEEEEcHHHHHHHhc--cccc-ccccccc-ccc-ceeeecccee Confidence 111 24567889999999999999999999999999999997 3322 2222221 112 3333443 Q ss_pred ---cccceeeecCCeEE---EeeChhhhcccccccCceeccccchhhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 266 ---QIGKSIIIPRGTVY---LTPEPEFLGVFPVMYSLDVEEDNKVERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 266 ---gi~~skvlprgeiy---vvadpE~~G~~pvR~~L~s~p~D~~er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) .|..+..+|-|+.+ ++.-+--.|-+- ..++.+|..-...++..-=.-.+.+|..+.||.++|.+..+ T Consensus 198 ~G~~Vv~s~~~p~~~~~~~~~~~~~gA~~~~~-~~~~~vE~~R~~~~~~d~i~~~~~y~~~v~~~~~vv~~t~~ 270 (272) T protein:vir:36 198 LGAQIVRSKKLAEGSALMFKIVSNSPALKLVL-KRGVQVETDRDIVTKTTVITADEHYAAYLYDLTKVVNITFT 270 (272) T ss_pred cCeeEEEeCCCCCCceeEEEEEecccceeeee-cCCcccccccchhhcCcEEEEEEEEEEEEEcCccEEEEeec Confidence 47789999999874 333344455443 45777775556666666666678899999999999999888 No 8 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=98.75 E-value=2.8e-10 Score=72.94 Aligned_cols=311 Identities=11% Similarity=0.084 Sum_probs=183.5 Q ss_pred CcccchhhhhhhhhhcccchHHHHHHHHHHhhcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhh Q lcl|NC_011269. 1 MTLPVAVGSGLGRFAKASDDYVADIVEAKQRMGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRN 80 (333) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~Rk 80 (333) ..-.......-+........+.+......+... ......+..+.+.. -.+..|. .+-..+...|-..++.....++ T Consensus 61 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~-~~~~~g~--~i~~~~~~~ii~~~~~~~~l~~ 136 (385) T protein:vir:19 61 EQKLASGAENPGEKKSFSERAAEELIKSWDGKQ-GTFGAKTFNKSLGS-DADSAGS--LIQPMQIPGIIMPGLRRLTIRD 136 (385) T ss_pred HHHhhccccccchhhhhHHHHHHHHHHHHHHhh-ccchhhHHHhhhcc-ccccCCc--eecchhhhHHHHHhhhccchhh Confidence 000000000001111111122222222222221 12222222221211 1122232 3555667778888888888898 Q ss_pred hhhccccCCCcceeecCCCCccceEEEEcCCCcccceeecCceeeccceeeeccccccHHHhhhhcchhHHHHHHHHHHH Q lcl|NC_011269. 81 VLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRITPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQA 160 (333) Q Consensus 81 lL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qa 160 (333) +....++..+.. .|++....+..+.|.+..+.++.....=+.+++.-.++...+.|..+-|+ .+.++..+...+..++ T Consensus 137 ~~~~~~~~~~~~-~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~-d~~~l~~~i~~~la~a 214 (385) T protein:vir:19 137 LLAQGRTSSNAL-EYVREEVFTNNADVVAEKALKPESDITFSKQTANVKTIAHWVQASRQVMD-DAPMLQSYINNRLMYG 214 (385) T ss_pred hcceecccCcce-EEEEEecCCcceeeeccCccccccccceeEEEEeeeeEEEeehhhHHHHh-hHHHHHHHHHHHHHHH Confidence 888777655532 34443333335567777777887777767888888889999999988665 4578888889999999 Q ss_pred HHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCcc----eE-EeeccccHHHHHHHHHHHHhhCCccceEEechhhhh Q lcl|NC_011269. 161 IMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPNE----IT-IAGSHLMPDDLYTAVTYTDQRQLDSSRLLANPQEYR 235 (333) Q Consensus 161 IM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N~----i~-i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~~~~ 235 (333) +-..+|.-++ .+.. +..|-.|.+..+ .+ -.++..+-++|-.+...++.-+...+.++||++-|. T Consensus 215 ~~~~~d~~~l---~G~g--------~~~~~~Gi~~~~~~~~~~~~~~~~~~~d~i~~~~~~l~~~~~~~~~~~~~~~~~~ 283 (385) T protein:vir:19 215 LALKEEGQLL---NGDG--------TGDNLEGLNKVATAYDTSLNATGDTRADIIAHAIYQVTESEFSASGIVLNPRDWH 283 (385) T ss_pred HHHHHHHHHH---hccC--------CCCcccccccccccccccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHH Confidence 9999996544 3322 223323322111 11 123456678888899889999999999999999999 Q ss_pred hhhhcCCCchhhhHHhhhhhcceeeeeeccccc--ceeeecCCeEEEeeChhhhcccccccCceeccccch----hhhcc Q lcl|NC_011269. 236 DLYRWDINTTGWAFKDSVVAGERIVQFGEFQIG--KSIIIPRGTVYLTPEPEFLGVFPVMYSLDVEEDNKV----ERFNK 309 (333) Q Consensus 236 Di~gw~~N~~~~~~~DpV~~~e~il~~G~fgi~--~skvlprgeiyvvadpE~~G~~pvR~~L~s~p~D~~----er~~k 309 (333) -|+..--.+--|.+-+|.....-. ++|++ .+-.+|.|++++ .|.-..-.+.+|.++.++..+.. ++-.. T Consensus 284 ~l~~lkd~~G~~l~~~~~~~~~~~----l~G~pV~~~~~~p~~~~~~-gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~ 358 (385) T protein:vir:19 284 NIALLKDNEGRYIFGGPQAFTSNI----MWGLPVVPTKAQAAGTFTV-GGFDMASQVWDRMDATVEVSREDRDNFVKNML 358 (385) T ss_pred HHHHhhcCCCceeccCcccCCCce----ecceeeEEcCcCCCCcEEE-eecccEEEEEEecceEEEEeccccchhhcCcE Confidence 988743111113333332221111 24533 667889999886 45543455778999887654422 22345 Q ss_pred ceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 310 GWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 310 GWvm~E~~g~~i~N~~siv~~~~~ 333 (333) +|....-++..+.||.+++.+..+ T Consensus 359 ~~~~~~r~~~~v~~~~a~~~~~~~ 382 (385) T protein:vir:19 359 TILCEERLALAHYRPTAIIKGTFS 382 (385) T ss_pred EEEEEEeeccEEecccceEEEEec Confidence 677888999999999999999988 No 9 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=98.75 E-value=2.8e-10 Score=72.94 Aligned_cols=311 Identities=11% Similarity=0.084 Sum_probs=183.5 Q ss_pred CcccchhhhhhhhhhcccchHHHHHHHHHHhhcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhh Q lcl|NC_011269. 1 MTLPVAVGSGLGRFAKASDDYVADIVEAKQRMGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRN 80 (333) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~Rk 80 (333) ..-.......-+........+.+......+... ......+..+.+.. -.+..|. .+-..+...|-..++.....++ T Consensus 61 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~-~~~~~g~--~i~~~~~~~ii~~~~~~~~l~~ 136 (385) T protein:vir:18 61 EQKLASGAENPGEKKSFSERAAEELIKSWDGKQ-GTFGAKTFNKSLGS-DADSAGS--LIQPMQIPGIIMPGLRRLTIRD 136 (385) T ss_pred HHHhhccccccchhhhhHHHHHHHHHHHHHHhh-ccchhhHHHhhhcc-ccccCCc--eecchhhhHHHHHhhhccchhh Confidence 000000000001111111122222222222221 12222222221211 1122232 3555667778888888888898 Q ss_pred hhhccccCCCcceeecCCCCccceEEEEcCCCcccceeecCceeeccceeeeccccccHHHhhhhcchhHHHHHHHHHHH Q lcl|NC_011269. 81 VLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRITPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQA 160 (333) Q Consensus 81 lL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qa 160 (333) +....++..+.. .|++....+..+.|.+..+.++.....=+.+++.-.++...+.|..+-|+ .+.++..+...+..++ T Consensus 137 ~~~~~~~~~~~~-~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~-d~~~l~~~i~~~la~a 214 (385) T protein:vir:18 137 LLAQGRTSSNAL-EYVREEVFTNNADVVAEKALKPESDITFSKQTANVKTIAHWVQASRQVMD-DAPMLQSYINNRLMYG 214 (385) T ss_pred hcceecccCcce-EEEEEecCCcceeeeccCccccccccceeEEEEeeeeEEEeehhhHHHHh-hHHHHHHHHHHHHHHH Confidence 888777655532 34443333335567777777887777767888888889999999988665 4578888889999999 Q ss_pred HHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCcc----eE-EeeccccHHHHHHHHHHHHhhCCccceEEechhhhh Q lcl|NC_011269. 161 IMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPNE----IT-IAGSHLMPDDLYTAVTYTDQRQLDSSRLLANPQEYR 235 (333) Q Consensus 161 IM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N~----i~-i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~~~~ 235 (333) +-..+|.-++ .+.. +..|-.|.+..+ .+ -.++..+-++|-.+...++.-+...+.++||++-|. T Consensus 215 ~~~~~d~~~l---~G~g--------~~~~~~Gi~~~~~~~~~~~~~~~~~~~d~i~~~~~~l~~~~~~~~~~~~~~~~~~ 283 (385) T protein:vir:18 215 LALKEEGQLL---NGDG--------TGDNLEGLNKVATAYDTSLNATGDTRADIIAHAIYQVTESEFSASGIVLNPRDWH 283 (385) T ss_pred HHHHHHHHHH---hccC--------CCCcccccccccccccccccccccchHHHHHHHHHhhccccCCCCEEEEcHHHHH Confidence 9999996544 3322 223323322111 11 123456678888899889999999999999999999 Q ss_pred hhhhcCCCchhhhHHhhhhhcceeeeeeccccc--ceeeecCCeEEEeeChhhhcccccccCceeccccch----hhhcc Q lcl|NC_011269. 236 DLYRWDINTTGWAFKDSVVAGERIVQFGEFQIG--KSIIIPRGTVYLTPEPEFLGVFPVMYSLDVEEDNKV----ERFNK 309 (333) Q Consensus 236 Di~gw~~N~~~~~~~DpV~~~e~il~~G~fgi~--~skvlprgeiyvvadpE~~G~~pvR~~L~s~p~D~~----er~~k 309 (333) -|+..--.+--|.+-+|.....-. ++|++ .+-.+|.|++++ .|.-..-.+.+|.++.++..+.. ++-.. T Consensus 284 ~l~~lkd~~G~~l~~~~~~~~~~~----l~G~pV~~~~~~p~~~~~~-gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~ 358 (385) T protein:vir:18 284 NIALLKDNEGRYIFGGPQAFTSNI----MWGLPVVPTKAQAAGTFTV-GGFDMASQVWDRMDATVEVSREDRDNFVKNML 358 (385) T ss_pred HHHHhhcCCCceeccCcccCCCce----ecceeeEEcCcCCCCcEEE-eecccEEEEEEecceEEEEeccccchhhcCcE Confidence 988743111113333332221111 24533 667889999886 45543455778999887654422 22345 Q ss_pred ceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 310 GWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 310 GWvm~E~~g~~i~N~~siv~~~~~ 333 (333) +|....-++..+.||.+++.+..+ T Consensus 359 ~~~~~~r~~~~v~~~~a~~~~~~~ 382 (385) T protein:vir:18 359 TILCEERLALAHYRPTAIIKGTFS 382 (385) T ss_pred EEEEEEeeccEEecccceEEEEec Confidence 677888999999999999999988 No 10 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=98.70 E-value=1e-09 Score=69.88 Aligned_cols=300 Identities=12% Similarity=0.077 Sum_probs=186.0 Q ss_pred Ccccchhhhhhhhhhcccc-hHHHHHHHHHHhhcchhcchHHHHHHHHHH-hcCchhHHHHHHHHHHHHHHHHHhhhhhh Q lcl|NC_011269. 1 MTLPVAVGSGLGRFAKASD-DYVADIVEAKQRMGGRKLSAREKQAKLAHI-LSDKVGGIQRLGQSMIGPIQLQLRYQGIL 78 (333) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~ls~ee~~~Lm~~A-l~~~Eg~~~aLg~~mA~pI~~q~~rqGi~ 78 (333) -...-.....-.+...... ...+.. .|.-...|-...+.++=... ..+..|++ .+-+-+...|...++....+ T Consensus 65 ~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~l~~~~~~~~~~~~~~t~~~gg~-~vP~~~~~~ii~~~~~~~~l 139 (397) T protein:vir:49 65 ANEVANMSEEEKKPLTKSEEEVKAGF----VKDFKNLVRGRYQNLLDSKTDASGSDAGL-TIPQDIQTAIHTLVSQYDSL 139 (397) T ss_pred HHhhhccccccccccccchhHHHHHH----HHHHHHHHhcchhHHHHHhhccccccCcc-cccHhHHHHHHHHHHhhhhH Confidence 0000000000000000000 000000 00000011111111110011 12234543 56788889999999999999 Q ss_pred hhhhhccccCCCc--ceeecCCCCccceEEEEcCCCcccce-eecCceeeccceeeeccccccHHHhhhhcchhHHHHHH Q lcl|NC_011269. 79 RNVLLEDTLTPGV--PIQYDVLDDLGQAYMLHGNEGEIRIT-PFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQD 155 (333) Q Consensus 79 RklL~~~TL~~G~--~p~y~v~~~v~~a~~~~~~~G~i~~Q-~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~ 155 (333) +++....+++.+. .+ |+.-.+....+-|++-.+++.++ ...=+.|++.-.++..++.|..+=|+....|+..+..+ T Consensus 140 ~~~~~~~~~~~~~~~~~-~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~ 218 (397) T protein:vir:49 140 QEYVNVENVTTLTGSRV-YEKWTDITGLANIDDEAGKIADVDDPKLSLIKYTIKRYAGISTVTNSLLADSAENILAWLSG 218 (397) T ss_pred HhhhceeecccCccceE-EEeeccCCcceeeecCccccccccccceeeEEeeeeeEEeeehhHHHHHhhhHHHHHHHHHH Confidence 9999888876543 33 33333443445678777777764 23337899999999999999999999999999999999 Q ss_pred HHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCcceEEeeccccHHHHHHHHHHHHhhCCccceEEechhhhh Q lcl|NC_011269. 156 MTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPNEITIAGSHLMPDDLYTAVTYTDQRQLDSSRLLANPQEYR 235 (333) Q Consensus 156 ~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N~i~i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~~~~ 235 (333) +..++|.+.+|.-+++-. |.- .-.++..+-+++..+...+..-......++||+.-|. T Consensus 219 ~l~~~~~~~~d~ai~~G~------------------g~~----~~~~~~~~~d~i~~~~~~l~~~~~~~a~~vmn~~~~~ 276 (397) T protein:vir:49 219 WIAKKVVVTRNKAILEAI------------------AAL----PTKPTLTKWDDIIDLEAKVDPAIKQTSFFLTNTSGFT 276 (397) T ss_pred HHHHHHHHHHHHHHHhhc------------------ccc----ccccccccHHHHHHHHHhhhhhhcCCCEEEEcHHHHH Confidence 999999999997665543 211 1134556788999999999988899999999999999 Q ss_pred hhhhcCCCchhhhHHhhhhhcceeee--eecccccc----eeeecCCe----EEEeeChhhhcccccccCceeccccchh Q lcl|NC_011269. 236 DLYRWDINTTGWAFKDSVVAGERIVQ--FGEFQIGK----SIIIPRGT----VYLTPEPEFLGVFPVMYSLDVEEDNKVE 305 (333) Q Consensus 236 Di~gw~~N~~~~~~~DpV~~~e~il~--~G~fgi~~----skvlprge----iyvvadpE~~G~~pvR~~L~s~p~D~~e 305 (333) .|+.=- +..+ .|+-+.+.--. .-++|.+- +..+|-++ .+++.|....-.+-+|+|+.++-.++.+ T Consensus 277 ~l~~lk-d~~G----~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~i~~~~~~~ 351 (397) T protein:vir:49 277 ALKKVK-NALG----DYLMERDVKSPTGYSIDGFAVKEVADRWLANGTGGAMPLYFGDLKQAVTLFDRQHMSLLSTNIGG 351 (397) T ss_pred HHHHhh-cCCC----ceeeccCcCCCCCceecceeeEEecccccccccCCceeEEEeeccceEEEEeecceEEEEecccc Confidence 998721 1111 23322221100 11355442 44566554 3667777766677889999988777554 Q ss_pred h-h---ccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 306 R-F---NKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 306 r-~---~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) . | ..++..++-++..+.||.++|.+..+ T Consensus 352 ~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~ 383 (397) T protein:vir:49 352 GAFETDTTKVRVIDRFDVVATDTEAFVPASFK 383 (397) T ss_pred chhhcCceeEEEEeeeCcEEecccceEEEEee Confidence 2 3 46789999999999999999999865 No 11 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=98.68 E-value=8.1e-10 Score=70.43 Aligned_cols=260 Identities=18% Similarity=0.229 Sum_probs=160.2 Q ss_pred hcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccc-c--CCCcceeecCCCCccceEEEE Q lcl|NC_011269. 32 MGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDT-L--TPGVPIQYDVLDDLGQAYMLH 108 (333) Q Consensus 32 ~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~T-L--~~G~~p~y~v~~~v~~a~~~~ 108 (333) |- ..++|+ +++.+ .|| .+++.+.+++...+...++ .+.++ | .+|....+|+.+..+.+- .. T Consensus 1 ~~---~~~~T~---l~d~i-~PE----v~~~~v~~~~~~~~~~~~~----~~~~~~l~g~~G~tv~iP~~~~ig~a~-~~ 64 (275) T protein:vir:96 1 MA---LENMTK---LANMV-NPE----VLAPMMQAELDKKLKFAQF----ADIDNTLVGQPGNTITFPAFVYSGDAK-VV 64 (275) T ss_pred CC---Ccccch---hhhhh-chH----HHHHHHHHHHHHhhhhccc----ceecccccCCCCCEEEeeeeccCCccc-cc Confidence 11 112222 12211 222 3344444444444444444 33222 2 246667777655443333 34 Q ss_pred cCCCcccceeecCceeeccceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhccccc Q lcl|NC_011269. 109 GNEGEIRITPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQ 188 (333) Q Consensus 109 ~~~G~i~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~ 188 (333) .+.+++..+.+.-......-.+.-.--.+.=++..+..+|.+.++...+..++-+..|..+++.+.++.. T Consensus 65 ~~g~~i~~~~lt~~~~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~ll~~l~~a~~---------- 134 (275) T protein:vir:96 65 PEGEEIPIDLIETKKRQATIRKIGKGTVLTDEALLSGYGDPKGEAVRQHGLAIANKVDNDVLEALQGATL---------- 134 (275) T ss_pred cCCCCcchhhcccceeeEEeehhcccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccc---------- Confidence 5555677766654544444333333334444455566689999999999999999999999888865431 Q ss_pred ccccCCCcceEEeeccccHHHHHHHHHHHHhhCCccceEEechhhhhhhhhcCCCchhhhHHhhhhhcceeeeeecc--- Q lcl|NC_011269. 189 PGVGALPNEITIAGSHLMPDDLYTAVTYTDQRQLDSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIVQFGEF--- 265 (333) Q Consensus 189 p~vg~~~N~i~i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~G~f--- 265 (333) ++..+-++.+.+..|.+...+-+-.-..++++|..|..|+-=...+|- .++-. ++-++..|.| T Consensus 135 ----------~~~~~~~~~d~i~dA~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~---~~~~~-g~~~~~~G~ig~~ 200 (275) T protein:vir:96 135 ----------KVEADITKLAGLQTAIDKFNDEDLEPMVLFVNPLDAGKLRASATDNFT---RATLL-GDNVIVKGAFGEA 200 (275) T ss_pred ----------cccccccCHHHHHHHHHHhccccCCccEEEeCHHHHHHHHhccccccc---ccccc-cccceecccccee Confidence 223445678999999999988777888999999999999771111221 12222 2224444544 Q ss_pred ---cccceeeecCCeEEEeeChhhhcccccccCceeccccchhhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 266 ---QIGKSIIIPRGTVYLTPEPEFLGVFPVMYSLDVEEDNKVERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 266 ---gi~~skvlprgeiyvvadpE~~G~~pvR~~L~s~p~D~~er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) .|..+..+|-|+.|+...+- +|-+. ..++.+|..-...++..-=...+.+|..+.||.+||.+.+. T Consensus 201 ~G~~Vi~s~~~p~~t~~i~~~gA-~~~~~-~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~ 269 (275) T protein:vir:96 201 LGAIIVRSNKIKEGEAILAKRGA-VKLIT-KRDFFLETERHASHKSTALFSDKHYVAYLYDESKVVKITKS 269 (275) T ss_pred cCeeEEEeCCCCcceEEEEeccc-eeeee-cCCcccccccchhhcCcEEEEeEEEEEEEEcCccEEEEEec Confidence 37789999999999987554 66554 45677665545555655555566779999999999999887 No 12 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=98.68 E-value=1.6e-09 Score=68.86 Aligned_cols=301 Identities=12% Similarity=0.086 Sum_probs=190.9 Q ss_pred CcccchhhhhhhhhhcccchHHHHHHHHHHhhcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhh Q lcl|NC_011269. 1 MTLPVAVGSGLGRFAKASDDYVADIVEAKQRMGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRN 80 (333) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~Rk 80 (333) -.-+-.....-++..+.++.|...+.... |-|...+...+..++ ....+..|+. .+-+-+.+.|...++....++. T Consensus 73 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~e~~a~--~~~t~~~gg~-~iP~~~~~~ii~~~~~~~~l~~ 148 (404) T protein:vir:39 73 NMREEEKGPLNKSEYELKDKFVKEFVNMV-RNPMAFLNTVSSKTE--TSGSDSAAGL-TIPQDIRTMINTLVRQYDSLQQ 148 (404) T ss_pred ccccccccccccchhhhHHHHHHHHHHHH-hcchhhhhhhhhhhh--hcccccCCce-eccHHHHHHHHHHHHhhhhHHh Confidence 11111122222333444455544433221 223233333333332 1233444543 5678888999999999999999 Q ss_pred hhhccccCCCccee-ecCCCCccceEEEEcCCCcccce-eecCceeeccceeeeccccccHHHhhhhcchhHHHHHHHHH Q lcl|NC_011269. 81 VLLEDTLTPGVPIQ-YDVLDDLGQAYMLHGNEGEIRIT-PFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTK 158 (333) Q Consensus 81 lL~~~TL~~G~~p~-y~v~~~v~~a~~~~~~~G~i~~Q-~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~ 158 (333) +....+++.+.... |.+..+....+.|++-.+++++. ...=+.|++.-.++..+..|..+=|+....|+..+..++.. T Consensus 149 ~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~ 228 (404) T protein:vir:39 149 YVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPRLTIIKYLIKRYAGIITATNTLLKDTAENILAWLSSWIA 228 (404) T ss_pred hcceeeccCCcceEEEEeecCCccceeeecCccccccccccceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHHH Confidence 99998888765332 33334444455677777777753 23337888888899999999999999899999999999999 Q ss_pred HHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCcceEEeeccccHHHHHHHHH-HHHhhCCccceEEechhhhhhh Q lcl|NC_011269. 159 QAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPNEITIAGSHLMPDDLYTAVT-YTDQRQLDSSRLLANPQEYRDL 237 (333) Q Consensus 159 qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N~i~i~~g~Lt~~~L~~a~t-~v~~~~L~at~il~~~~~~~Di 237 (333) ++|-+.+|.-+++-. .+..| .++..+.+++..++. .+.........++||+.-|.-| T Consensus 229 ~~~~~~~d~~il~g~-----------g~~~~-----------~~~~~~~~~i~~~~~~~~~~~~~~~a~~v~n~~~~~~L 286 (404) T protein:vir:39 229 KKVVVTRNQAIIAAM-----------GTVPK-----------KPTIAKFDDVITMINTSVDPAIIATSSLLTNQSGLNKL 286 (404) T ss_pred HHHHHHHHHHHHhcc-----------ccccc-----------ccccccHHHHHHHHHHhhhhhhccCCEEEEcHHHHHHH Confidence 999999997665432 11122 245566788877764 4555555667899999999999 Q ss_pred hhcCC-CchhhhHHhhhhhcceeee--eecccccc----eeeecCCe----EEEeeChhhhcccccccCceeccccchhh Q lcl|NC_011269. 238 YRWDI-NTTGWAFKDSVVAGERIVQ--FGEFQIGK----SIIIPRGT----VYLTPEPEFLGVFPVMYSLDVEEDNKVER 306 (333) Q Consensus 238 ~gw~~-N~~~~~~~DpV~~~e~il~--~G~fgi~~----skvlprge----iyvvadpE~~G~~pvR~~L~s~p~D~~er 306 (333) +. + +..+ -|+-+.+..-. .-++|.+- +..+|-+. .+++.|.-..-.+.+|+|+.++-.++.+. T Consensus 287 ~~--lkd~~G----~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~ 360 (404) T protein:vir:39 287 AL--VKTAEG----KYLLEPDPTKPNSYLIKGKKVIVVADRWLPNSGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAG 360 (404) T ss_pred HH--hhccCC----ceeeccCcCCCCcceecceeEEEecccccCccCCCccEEEEEeccccEEEEeecceEEEEeccchh Confidence 86 2 1111 22222221100 01244332 23345333 35677877667788999999988886643 Q ss_pred -h---ccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 307 -F---NKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 307 -~---~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) | ...+..++-++..+.||.+++.+.-. T Consensus 361 ~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~ 391 (404) T protein:vir:39 361 AFETDTTKIRVIDRFDVKTTDSEALVAGSFT 391 (404) T ss_pred hhhhceeeEEEEeeeccEEecccceEEEEee Confidence 2 46799999999999999999999844 No 13 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=98.67 E-value=2e-09 Score=68.30 Aligned_cols=274 Identities=13% Similarity=0.104 Sum_probs=188.7 Q ss_pred hcchhcchHHHHHHHHHH-----------hcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcceeecCCCC Q lcl|NC_011269. 32 MGGRKLSAREKQAKLAHI-----------LSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPIQYDVLDD 100 (333) Q Consensus 32 ~~~~~ls~ee~~~Lm~~A-----------l~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p~y~v~~~ 100 (333) |--.++-+++.+...... +.+..+.. .+-+.+++.|.+.++.+...+++.+..+++.|. .+||+... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~-~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~~p~~~~ 78 (324) T protein:vir:78 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDG-TLMNEFTTPILQEVMENSKIMQLGKYEPMEGTE-KKFTFWAD 78 (324) T ss_pred CCcchhhhHHHHHHHHHhhhhhhhccccccccCcCcc-ccchhHHHHHHHHHHhhchhhhhcceeeccCCc-eEEEEEec Confidence 333344344433333222 11222321 466788999999999999999999988887654 44666444 Q ss_pred ccceEEEEcCCCcccceeecCceeeccceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhh Q lcl|NC_011269. 101 LGQAYMLHGNEGEIRITPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSY 180 (333) Q Consensus 101 v~~a~~~~~~~G~i~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~ 180 (333) ...+ -|++..++++.....=+.+++.-.++.....|..+-|+....++..+..++..++|-+.+|..+++ +.. T Consensus 79 ~~~a-~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~---G~g--- 151 (324) T protein:vir:78 79 KPGA-YWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL---NQG--- 151 (324) T ss_pred Ccce-eEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhc---cCC--- Confidence 4444 467888888888888888999989999999999999999999999999999999999999965542 221 Q ss_pred hhhcccccccccCCCcc----eEEeeccccHHHHHHHHHHHHhhCCccceEEechhhhhhhhhcCCCchhhhHHhhhhhc Q lcl|NC_011269. 181 RVVDSSAQPGVGALPNE----ITIAGSHLMPDDLYTAVTYTDQRQLDSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAG 256 (333) Q Consensus 181 r~~~ssA~p~vg~~~N~----i~i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~ 256 (333) .+..|+ ++.|. -+..++.++-++|.++...+..-++....++||++.|..|+. + +|+- + T Consensus 152 ----~~~~~~--gi~~~~~~~~~~~~~~~t~~~i~~~~~~l~~~~~~~~~~vmn~~~~~~L~~--l-------~d~~--G 214 (324) T protein:vir:78 152 ----NNPFGK--SIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRK--I-------VDPE--T 214 (324) T ss_pred ----CCCcCc--cccccccccceeccccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHH--h-------hccC--C Confidence 111111 11121 134567889999999999999999999999999999998876 2 2322 2 Q ss_pred ceeeeee----cccccc----eeeecCCeEEEeeChhhhcccccccCceeccccch----------------hhhcccee Q lcl|NC_011269. 257 ERIVQFG----EFQIGK----SIIIPRGTVYLTPEPEFLGVFPVMYSLDVEEDNKV----------------ERFNKGWV 312 (333) Q Consensus 257 e~il~~G----~fgi~~----skvlprgeiyvvadpE~~G~~pvR~~L~s~p~D~~----------------er~~kGWv 312 (333) .-+++.| +.|.+. +.-++.|.+| ..|... -.+-.|+++..+-.+.. ++-...|. T Consensus 215 ~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~-~gd~~~-~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r 292 (324) T protein:vir:78 215 KERIYDRNSDSLDGLPVVNLKSSNLKRGELI-TGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALR 292 (324) T ss_pred CeeecCCCCCcccceeeEeeCCCCCCcceEE-EEecce-EEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEE Confidence 2222222 244442 2335666665 446664 35778899888766543 22346667 Q ss_pred hhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 313 MDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 313 m~E~~g~~i~N~~siv~~~~~ 333 (333) ..+-++.++.||-+++.|.++ T Consensus 293 ~~~r~d~~v~~~~A~~~l~~a 313 (324) T protein:vir:78 293 ATMHVALHIADDKAFAKLVPA 313 (324) T ss_pred EEEEEccEEecccceEEEecc Confidence 777789999999999999998 No 14 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=98.67 E-value=2e-09 Score=68.30 Aligned_cols=274 Identities=13% Similarity=0.104 Sum_probs=188.7 Q ss_pred hcchhcchHHHHHHHHHH-----------hcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcceeecCCCC Q lcl|NC_011269. 32 MGGRKLSAREKQAKLAHI-----------LSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPIQYDVLDD 100 (333) Q Consensus 32 ~~~~~ls~ee~~~Lm~~A-----------l~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p~y~v~~~ 100 (333) |--.++-+++.+...... +.+..+.. .+-+.+++.|.+.++.+...+++.+..+++.|. .+||+... T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~-~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~~p~~~~ 78 (324) T protein:vir:96 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNVMMHEKKDG-TLMNEFTTPILQEVMENSKIMQLGKYEPMEGTE-KKFTFWAD 78 (324) T ss_pred CCcchhhhHHHHHHHHHhhhhhhhccccccccCcCcc-ccchhHHHHHHHHHHhhchhhhhcceeeccCCc-eEEEEEec Confidence 333344344433333222 11222321 466788999999999999999999988887654 44666444 Q ss_pred ccceEEEEcCCCcccceeecCceeeccceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhh Q lcl|NC_011269. 101 LGQAYMLHGNEGEIRITPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSY 180 (333) Q Consensus 101 v~~a~~~~~~~G~i~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~ 180 (333) ...+ -|++..++++.....=+.+++.-.++.....|..+-|+....++..+..++..++|-+.+|..+++ +.. T Consensus 79 ~~~a-~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~---G~g--- 151 (324) T protein:vir:96 79 KPGA-YWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL---NQG--- 151 (324) T ss_pred Ccce-eEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhc---cCC--- Confidence 4444 467888888888888888999989999999999999999999999999999999999999965542 221 Q ss_pred hhhcccccccccCCCcc----eEEeeccccHHHHHHHHHHHHhhCCccceEEechhhhhhhhhcCCCchhhhHHhhhhhc Q lcl|NC_011269. 181 RVVDSSAQPGVGALPNE----ITIAGSHLMPDDLYTAVTYTDQRQLDSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAG 256 (333) Q Consensus 181 r~~~ssA~p~vg~~~N~----i~i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~ 256 (333) .+..|+ ++.|. -+..++.++-++|.++...+..-++....++||++.|..|+. + +|+- + T Consensus 152 ----~~~~~~--gi~~~~~~~~~~~~~~~t~~~i~~~~~~l~~~~~~~~~~vmn~~~~~~L~~--l-------~d~~--G 214 (324) T protein:vir:96 152 ----NNPFGK--SIAQSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRK--I-------VDPE--T 214 (324) T ss_pred ----CCCcCc--cccccccccceeccccccHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHH--h-------hccC--C Confidence 111111 11121 134567889999999999999999999999999999998876 2 2322 2 Q ss_pred ceeeeee----cccccc----eeeecCCeEEEeeChhhhcccccccCceeccccch----------------hhhcccee Q lcl|NC_011269. 257 ERIVQFG----EFQIGK----SIIIPRGTVYLTPEPEFLGVFPVMYSLDVEEDNKV----------------ERFNKGWV 312 (333) Q Consensus 257 e~il~~G----~fgi~~----skvlprgeiyvvadpE~~G~~pvR~~L~s~p~D~~----------------er~~kGWv 312 (333) .-+++.| +.|.+. +.-++.|.+| ..|... -.+-.|+++..+-.+.. ++-...|. T Consensus 215 ~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~-~gd~~~-~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r 292 (324) T protein:vir:96 215 KERIYDRNSDSLDGLPVVNLKSSNLKRGELI-TGDFDK-LIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALR 292 (324) T ss_pred CeeecCCCCCcccceeeEeeCCCCCCcceEE-EEecce-EEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEE Confidence 2222222 244442 2335666665 446664 35778899888766543 22346667 Q ss_pred hhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 313 MDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 313 m~E~~g~~i~N~~siv~~~~~ 333 (333) ..+-++.++.||-+++.|.++ T Consensus 293 ~~~r~d~~v~~~~A~~~l~~a 313 (324) T protein:vir:96 293 ATMHVALHIADDKAFAKLVPA 313 (324) T ss_pred EEEEEccEEecccceEEEecc Confidence 777789999999999999998 No 15 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=98.63 E-value=2.9e-09 Score=67.36 Aligned_cols=259 Identities=17% Similarity=0.167 Sum_probs=162.8 Q ss_pred hcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhcc-ccC--CCcceeecCCCCccceEEEE Q lcl|NC_011269. 32 MGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLED-TLT--PGVPIQYDVLDDLGQAYMLH 108 (333) Q Consensus 32 ~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~-TL~--~G~~p~y~v~~~v~~a~~~~ 108 (333) |.- .+| .+++.+ -|| .+++.+..++...+...+++ +.+ +|+ +|....+|+....+.+- .. T Consensus 1 ma~----~~T---~~~~~i-iPe----v~~~~v~~~~~~~~~~~~~~----~~~~~l~g~~G~tv~ip~~~~~g~~~-~~ 63 (274) T protein:vir:93 1 MPQ----GIT---KTSNQI-IPE----VLAPMMQAQLEKKLRFASFA----EVDSTLQGQPGDTLTFPAFVYSGDAQ-VV 63 (274) T ss_pred CCc----cce---ehhhee-chH----HHHHHHHHHHHhhhhhcccc----cccccccCCCCCEEEEEeeccCCCcc-cc Confidence 221 111 111211 133 23444444555444444443 222 222 36656666543333222 34 Q ss_pred cCCCcccceeecCceeeccceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhccccc Q lcl|NC_011269. 109 GNEGEIRITPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQ 188 (333) Q Consensus 109 ~~~G~i~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~ 188 (333) ....++..+.+.-...++.-.+.-.--.+.-++..+..+|.+.++-+++..++-+..|..+++.+..+. T Consensus 64 ~eg~~i~~~~it~~~~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~----------- 132 (274) T protein:vir:93 64 AEGEKIPTDILETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK----------- 132 (274) T ss_pred cCCCcccccccccceeEEEeeeecccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccc----------- Confidence 444557777666555555544444333455566677788999999999999999999999998885543 Q ss_pred ccccCCCcceEEeeccccHHHHHHHHHHHHhhCCccceEEechhhhhhhhhcCCCchhhhHHhhhhhcceeeeeec---- Q lcl|NC_011269. 189 PGVGALPNEITIAGSHLMPDDLYTAVTYTDQRQLDSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIVQFGE---- 264 (333) Q Consensus 189 p~vg~~~N~i~i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~G~---- 264 (333) .++.+.-++.+.+..|.+...+.+-.-..++++|..|..|+-=..++|- .++-. ++-++..|. T Consensus 133 ---------~~~~~~~~~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~---~~s~~-g~~~~~~G~ig~~ 199 (274) T protein:vir:93 133 ---------LTVNADITKLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFT---RATEL-GDDIIVKGAFGEA 199 (274) T ss_pred ---------ccccccccCHHHHHHHHHHhhhccCCccEEEeCHHHHHHHHhhhhhccc---ccccc-cccceeeccccee Confidence 2223344578999999999999888889999999999999861111221 11111 122333444 Q ss_pred --ccccceeeecCCeEEEeeChhhhcccccccCceeccccchhhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 265 --FQIGKSIIIPRGTVYLTPEPEFLGVFPVMYSLDVEEDNKVERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 265 --fgi~~skvlprgeiyvvadpE~~G~~pvR~~L~s~p~D~~er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) |.|..+..+|-|+.|+...+ .+|.+ ...++.+|......++...=.....+|..+.||.++|.+.|+ T Consensus 200 ~G~~Vi~s~~~p~~t~~l~~~g-ai~~~-~~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~~v~~t~~ 268 (274) T protein:vir:93 200 LGAIIVRTNKLEAGTAILAKKG-AVKLI-LKRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKG 268 (274) T ss_pred cCeeEEEcCCCCcceEEEEeCC-eEEEE-ecCCcccccccchhhcccEEEEEEEEEEEEEcCCceEEEeeC Confidence 34778889999999987644 46644 456677765556666666667778889999999999999999 No 16 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=98.63 E-value=1.6e-09 Score=68.85 Aligned_cols=263 Identities=15% Similarity=0.185 Sum_probs=147.6 Q ss_pred hcc--hhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccc-c--CCCcceeecCCCCccceEE Q lcl|NC_011269. 32 MGG--RKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDT-L--TPGVPIQYDVLDDLGQAYM 106 (333) Q Consensus 32 ~~~--~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~T-L--~~G~~p~y~v~~~v~~a~~ 106 (333) |.- -+| ++.+ -|| .++..+ .++++..-....+...+. + .+|....+|+.+..+.+- T Consensus 1 Ma~~~T~~---------~~~i-iPe----v~s~~v----~~~~~~~~v~~~~~~~~~~l~g~~G~tv~ip~~~~~g~a~- 61 (278) T protein:vir:80 1 MADLTTKL---------ANLI-DPE----VMGPMI----SAKLPKAIKFGKIAPIDNSLEGQPGSEITVPKYKYIGDAQ- 61 (278) T ss_pred CCCcceeh---------hhee-cHH----HHHHHH----HHHHHHhhhhcccceecccccCCCCCEEEEeeeccCCcce- Confidence 221 111 1111 122 222223 333333333333332222 2 246655566544444333 Q ss_pred EEcCCCcccceeecCceeeccceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhccc Q lcl|NC_011269. 107 LHGNEGEIRITPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSS 186 (333) Q Consensus 107 ~~~~~G~i~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ss 186 (333) ..++.+++..+.+.-...++.--+.-..-.++-.+..+..+|.+.++-+++..++-++.|..+++.+.++...+ + T Consensus 62 ~~~~g~~i~~~~lt~~~~~~~i~~~~~a~~v~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~a~~~~-----~ 136 (278) T protein:vir:80 62 DVAEGAAIDYSALETESVKHGIKKAGKGVKLTDESVLSGYGDPVEEAQKQIRMAIASKVDNDILEEALTTTLEV-----K 136 (278) T ss_pred eecCCCcCcccccccceeeEeeehhhccccccHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-----c Confidence 24555567777666555555433322222344455566778999999999999999999999999997765322 1 Q ss_pred ccccccCCCcceEEeeccccHHHHHHHHHHHHhhCCccc-eEEechhhhhhhhhcCCCchhhhHHhhhhhcceeeeeec- Q lcl|NC_011269. 187 AQPGVGALPNEITIAGSHLMPDDLYTAVTYTDQRQLDSS-RLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIVQFGE- 264 (333) Q Consensus 187 A~p~vg~~~N~i~i~~g~Lt~~~L~~a~t~v~~~~L~at-~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~G~- 264 (333) ..+++ .+..-..+.|..+.+..++.+.+.. .++++|..|..|+-=...+| . .++-. ++-++..|. T Consensus 137 ~~~t~---------~~~~~~~~~~~da~~~l~~~~~~~~~~ivv~p~~~~~L~k~~~~~~--~-~~~~~-g~~~~~~G~i 203 (278) T protein:vir:80 137 GAINI---------GLIDKIENTFTDAPDAIEDESITTTGVLFLNYKDTAKLREEAAGSW--T-KASQL-GDDLLVKGAF 203 (278) T ss_pred ccccc---------chhhhHHHHHHHHHHhhcccCCCcccEEEECHHHHHHHHhhhhhhc--c-ccccc-cccceeeccc Confidence 11111 1111123566777777777776654 48899999999986111122 1 11111 111333444 Q ss_pred -----ccccceeeecCCeEEEeeChhhhcccccccCceeccccchhhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 265 -----FQIGKSIIIPRGTVYLTPEPEFLGVFPVMYSLDVEEDNKVERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 265 -----fgi~~skvlprgeiyvvadpE~~G~~pvR~~L~s~p~D~~er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) |.|..+..+|-|+.|++.. --+|.+ ...++.+|......++..-=.....+|..+.||.++|.++|. T Consensus 204 g~~~G~~Vi~s~~~p~~t~~l~~~-gAi~~~-~~~~~~vE~~Rd~~~~~d~i~~~~~yg~~v~~~~~~v~it~~ 275 (278) T protein:vir:80 204 GELLGWEIVRTKKLADGNALAVKA-GALKTF-LKRNLLAESGRDMDHKLTKFNADQHYAVALVDETKAVKVVPV 275 (278) T ss_pred eeecceeEEEcCCCCcceEEEEec-cceeee-ecCCcccccccchhhccceeeeeeEEEEEEEcCcceEEEeec Confidence 4477899999999999863 335544 344667654434444443333467789999999999999999 No 17 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=98.62 E-value=2.8e-09 Score=67.45 Aligned_cols=309 Identities=11% Similarity=0.077 Sum_probs=189.9 Q ss_pred CcccchhhhhhhhhhcccchHHHHHHHHHHhhcchhcchHHHHHHHHHHhcC--chhHHHHHHHHHHHHHHHHHhhhhhh Q lcl|NC_011269. 1 MTLPVAVGSGLGRFAKASDDYVADIVEAKQRMGGRKLSAREKQAKLAHILSD--KVGGIQRLGQSMIGPIQLQLRYQGIL 78 (333) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~~Al~~--~Eg~~~aLg~~mA~pI~~q~~rqGi~ 78 (333) -..+.... ..+...+.++.+-+-....+ .++.....+.++.......+ ..++ -.+-....+.|-+.++..... T Consensus 69 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~g-~lip~~~~~~ii~~~~~~~~i 143 (390) T protein:vir:97 69 AGGDVQHV-SVGDMFVASEQFQASTGRWN---DRSARATMNIKAALNTASTDAAGSAG-ALTTPNRLPGFITPPDARLTV 143 (390) T ss_pred cccccccc-cchhhhhhhHHHHHHHHHhh---hhhhhhhhHHHHHHHhhhcccccccc-cccchhhhHHHHHHHhhhhhh Confidence 11111111 11222222222222111111 11122223334444444332 2233 245566777888889999999 Q ss_pred hhhhhccccCCCcceeecCCCCccceEEEEcCCCcccceeecCceeeccceeeeccccccHHHhhhhcchhHHHHHHHHH Q lcl|NC_011269. 79 RNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRITPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTK 158 (333) Q Consensus 79 RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~ 158 (333) +++....+++.|. -.|++.+.....+.|++.-++++.....=+.+++.-.++..+..|..+=|+.. .++..+...+.. T Consensus 144 ~~~~~~~~~~~~~-~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~ds-~~l~~~i~~~la 221 (390) T protein:vir:97 144 RDLIGSGRTDSAL-IEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQILSDA-PQLASYMNNRLI 221 (390) T ss_pred HhhcceeeccCCc-eEEEEEecCCcceeeecCCccccccccceeEEEEeeeeEEEeehhhHHHHHhH-HHHHHHHHHHHH Confidence 9998888887664 33454434333456777777777776666888888889999998998867654 688888899999 Q ss_pred HHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCC-----CcceEEeeccccHHHHHHHHHHHHhhCCccceEEechhh Q lcl|NC_011269. 159 QAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGAL-----PNEITIAGSHLMPDDLYTAVTYTDQRQLDSSRLLANPQE 233 (333) Q Consensus 159 qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~-----~N~i~i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~~ 233 (333) .+|-+.+|.-+++ +.. ++..-.|-+ .+..+..++..+-+++..+...++.-+.....++||++- T Consensus 222 ~a~~~~~d~a~l~---G~g--------~~~~p~Gi~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~v~n~~~ 290 (390) T protein:vir:97 222 RGLKVKEDAEILR---GTG--------ANDGLLGLIPQATTYAAPTTIAGATRVDQLRLAMLQASLAEYPASGIVINPID 290 (390) T ss_pred HHHHHHHHHHHhh---cCC--------CCccccceeeccccccccccccccchHHHHHHHHHhhccccCCCCEEEEcHHH Confidence 9999999965542 211 111111222 122233455666778899999999999999999999999 Q ss_pred hhhhhhcCCCchh-hhHHhhhhhcceeeeeeccccc--ceeeecCCeEEEeeChhhhcccccccCceeccccchhhh--- Q lcl|NC_011269. 234 YRDLYRWDINTTG-WAFKDSVVAGERIVQFGEFQIG--KSIIIPRGTVYLTPEPEFLGVFPVMYSLDVEEDNKVERF--- 307 (333) Q Consensus 234 ~~Di~gw~~N~~~-~~~~DpV~~~e~il~~G~fgi~--~skvlprgeiyvvadpE~~G~~pvR~~L~s~p~D~~er~--- 307 (333) |..|...- ++.| |.+-|+..... .=++|++ .+--+|.|++++ -|......+-+|.|+.++..+...-| T Consensus 291 ~~~L~~lk-d~~G~~l~~~~~~~~~----~~l~G~pV~~~~~~~~~~~~~-gd~~~~~~~~~~~~~~i~~~~~~~~f~~~ 364 (390) T protein:vir:97 291 WAAIELAK-DANNQYLIGNARGTLT----PTLWGLPVVATQAMAPGEFLV-GAFDLAAQIFDQWDARVEIGYVNDDFQRN 364 (390) T ss_pred HHHHHHhh-cCCCceeecCccCCCC----ceecceeeEEcCCCCCCcEEE-EeccceEEEEEecceEEEEeecccccccC Confidence 99988632 2221 22223221111 1124544 555689999865 55554566788999998875532223 Q ss_pred ccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 308 NKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 308 ~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) ..+|...+-++..+.+|.++|..--| T Consensus 365 ~~~~r~~~r~d~~v~~~~a~v~~~~a 390 (390) T protein:vir:97 365 MVTVLAEERLALVVYRPEALITGSFA 390 (390) T ss_pred cEEEEEEEeeccEEeccccEEEEEeC Confidence 44688889999999999999999999 No 18 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=98.62 E-value=2e-09 Score=68.32 Aligned_cols=268 Identities=11% Similarity=0.064 Sum_probs=183.3 Q ss_pred hcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcceeecCCCCccceEEEEcCC Q lcl|NC_011269. 32 MGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNE 111 (333) Q Consensus 32 ~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~ 111 (333) |-..-+.+.. .+.+..+.. .+-+.+++.|-+.+..+...+++.+..+++.+..-.+++...-. .+-|++-- T Consensus 1 m~~~~~~~~~-------~~~t~~~~~-lvP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~~~~-~a~~v~Eg 71 (297) T protein:vir:95 1 MTVQTFNPEN-------VLVSQKKDG-TLHKEFTDIIMKEVAQNSLVMQLGQYQEMEGEQEKTVYVQTDGI-SAYWVNET 71 (297) T ss_pred CCcccccccc-------ccccCCCcc-eechhHHHHHHHHHHhhchhhhhcceeecCCCccEEEEEEcCCc-eeEEeecC Confidence 4333333321 122333331 56778888888889999999999988887766555555544333 44567766 Q ss_pred CcccceeecCceeeccceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccc Q lcl|NC_011269. 112 GEIRITPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGV 191 (333) Q Consensus 112 G~i~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~v 191 (333) +++++....=+.+++...++.....|..+-|+....|+.++..+...++|-+.+|.-+++=..+ ..| . T Consensus 72 ~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~~~l~~~i~~~la~ai~~~~d~a~l~G~g~-----------~~~-~ 139 (297) T protein:vir:95 72 EKIKTDKPEVVPVTLKAHKLGIILVTSREALNYTWKKFFEDMKPQIVEAFYKKIDEAGLLGHDT-----------PFA-N 139 (297) T ss_pred ccccccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCC-----------ccc-c Confidence 6777766666788888999999999999999999999999999999999999999777632111 001 1 Q ss_pred cCCCc---ceEEeeccccHHHHHHHHHHHHhhCCccceEEechhhhhhhhhcCCCchhhhHHhhhhhcceeeee---ecc Q lcl|NC_011269. 192 GALPN---EITIAGSHLMPDDLYTAVTYTDQRQLDSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIVQF---GEF 265 (333) Q Consensus 192 g~~~N---~i~i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~---G~f 265 (333) |.+.. .-+..++.++-+++-+++..+.+-++....++||++.|..++. +- |. .+..+.+. .++ T Consensus 140 gi~~~~~~~~~~~~~~~t~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~--l~-------d~--~G~~i~~~~~~~l~ 208 (297) T protein:vir:95 140 SVAKAAKDANKVIGGPINYDNILKLQDALYDADVEPNAFVSKIQNRSALRE--AR-------DG--NKVSIYDKAANTID 208 (297) T ss_pred cccccccccceecccccCHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHH--hh-------cc--CCceeecCCCCccc Confidence 11111 1245567889999999999999999999999999999998875 21 11 12222221 134 Q ss_pred cccc----eeeecCCeEEEeeChhhhcccccccCceeccccchh-------------hh---ccceehhhhhhhhhhccc Q lcl|NC_011269. 266 QIGK----SIIIPRGTVYLTPEPEFLGVFPVMYSLDVEEDNKVE-------------RF---NKGWVMDELVGMAILNPR 325 (333) Q Consensus 266 gi~~----skvlprgeiyvvadpE~~G~~pvR~~L~s~p~D~~e-------------r~---~kGWvm~E~~g~~i~N~~ 325 (333) |++. +..++.|++++ .|..+ ..+-.++++..+-.+... .| ...+-+.+-++.++.||. T Consensus 209 G~Pv~~~~~~~~~~~~~~~-gd~s~-~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~ 286 (297) T protein:vir:95 209 GITTVDLKSARFEKGDLLA-GDFDN-LIYGVPYNITYKISEEGQISTITNADGTPINLFEQEMIAIRATMDIAVMITKTD 286 (297) T ss_pred ceeeEeecCCCCCCceEEE-Eeccc-EEEEEecCeEEEEeeccccccccccCccchhhhhcCcEEEEEEEEeccEeeccc Confidence 5442 33467787664 56653 456778888776554321 13 334456678999999999 Q ss_pred eEEEEecC Q lcl|NC_011269. 326 GIVILRKA 333 (333) Q Consensus 326 siv~~~~~ 333 (333) +++.|.+| T Consensus 287 a~~~l~~a 294 (297) T protein:vir:95 287 AFAKLTPA 294 (297) T ss_pred ceEEEeec Confidence 99999999 No 19 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=98.62 E-value=1.6e-09 Score=68.78 Aligned_cols=319 Identities=16% Similarity=0.094 Sum_probs=181.6 Q ss_pred Ccccchhhhhhhhhhcc----------cchHHHHHHHHHHhhcc-hhcc-hHHHHHH--------HHHHh---cCchhHH Q lcl|NC_011269. 1 MTLPVAVGSGLGRFAKA----------SDDYVADIVEAKQRMGG-RKLS-AREKQAK--------LAHIL---SDKVGGI 57 (333) Q Consensus 1 ~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~-~~ls-~ee~~~L--------m~~Al---~~~Eg~~ 57 (333) ...- -.-+-+..+. -+++....-..+....+ ..+. .+.+.++ ++.++ .+..|+. T Consensus 64 ~~~~---~e~~~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~af~~~l~~~e~~~al~~~t~~~gG~ 140 (425) T protein:vir:10 64 GLPT---SDALAKVDKVSADLEALQAAVDEANIKIAAAQMGANGVKPLRDPEYTEAFKAHVKRGDVQAALNKGEDSEGGY 140 (425) T ss_pred hhcc---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccHHHHHHHHHHhhhhhhHHHhhcCcCCCCce Confidence 0000 0000000010 01111100000000000 0011 1111111 11222 2233332 Q ss_pred HHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcceeecCCCCccceEEEEcCCCcccceee-cCceeeccceeeecccc Q lcl|NC_011269. 58 QRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRITPF-EGKRIEVQLFRIASFPQ 136 (333) Q Consensus 58 ~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~i-~~~ri~~P~f~Ivs~P~ 136 (333) .+-+.+...|.+.++.....+++....++..|.. .|++...-.. +-|.+-.++++.... .=+.|++.-.++...+. T Consensus 141 -lvP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~-~~~~~~~~~~-a~wv~E~~~~~~~~~~~f~~v~~~~~k~~~~i~ 217 (425) T protein:vir:10 141 -LTPIEWDRTITNKLVLISPMRQLCRVQPVSKAGF-SKLFNMGGTT-SGWVGEASQRPQTNAATFQPLSFASGEIYANPA 217 (425) T ss_pred -eccHhHHHHHHHHHHhhhhhhhhceeeeccCCce-EEEEEcCCcc-eeeeccccccccccccccceeeeeheeeEeehH Confidence 4556777888889999999999988888776654 4444334333 346676666776543 23788999999999999 Q ss_pred ccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHH---------HHhhhhhhhhhhcccccccccCCCcceEEeeccccH Q lcl|NC_011269. 137 IKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVT---------LLEAAAVSYRVVDSSAQPGVGALPNEITIAGSHLMP 207 (333) Q Consensus 137 V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~s---------lle~~a~~~r~~~ssA~p~vg~~~N~i~i~~g~Lt~ 207 (333) |..+-|.....++..+..++..++|-..+|.-+++ +|....... .++....|...-..+...+.++- T Consensus 218 iS~ell~ds~~~l~~~i~~~la~ai~~~~d~~~l~G~G~~~p~Gil~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~ 293 (425) T protein:vir:10 218 ATQQILDDAEIDLESWLATEVQTEFAKQEGKAFLAGDGTNKPNGLLTYIAGGA----NAAKHPFGAIEVVNSGAAADITS 293 (425) T ss_pred hHHHHHhcchhHHHHHHHHHHHHHHHHHHHhhhhcccCCCCcceeeecccccc----ccccccccccccccccccccccH Confidence 99999999999999999999999999999976554 222211100 11111111222223445677888 Q ss_pred HHHHHHHHHHHhhCCccceEEechhhhhhhhhcCCCchh-hhHHhhhhhcceeeeeeccccc--ceeeecC----CeEEE Q lcl|NC_011269. 208 DDLYTAVTYTDQRQLDSSRLLANPQEYRDLYRWDINTTG-WAFKDSVVAGERIVQFGEFQIG--KSIIIPR----GTVYL 280 (333) Q Consensus 208 ~~L~~a~t~v~~~~L~at~il~~~~~~~Di~gw~~N~~~-~~~~DpV~~~e~il~~G~fgi~--~skvlpr----geiyv 280 (333) ++|-.++..+..-.......+||+..|.-|..=- |..| |.+.+++..+.- +=++|.+ .+--||- ...++ T Consensus 294 d~l~~l~~~l~~~~~~~a~~vmn~~~~~~L~~lk-D~~G~~l~~~~~~~g~~---~~l~G~PV~~~~~~p~~~~~~~~i~ 369 (425) T protein:vir:10 294 DGIIDLVYDLPSAFTGNARFAMNRNTQRQVRKLK-DGQGNYLWQPSYVAGQP---ATLAGYPVTEVPDMPDVAANSTPIL 369 (425) T ss_pred HHHHHHHhhhhhhhccCCEEEEchHHHHHHHHhh-cCCCceeeccCccCCCC---ceecceeeEEecCcCCccCCccEEE Confidence 8888887766655555667899999998876511 1111 222222222110 0135544 2223442 22334 Q ss_pred eeChhhhcccccccCceeccccchhhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 281 TPEPEFLGVFPVMYSLDVEEDNKVERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 281 vadpE~~G~~pvR~~L~s~p~D~~er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) +.|-...=.+-.|.|+++...++.++-..+|..++-++..+.||-+++++..+ T Consensus 370 ~Gd~~~~~~i~~~~~~~v~~d~~~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~ 422 (425) T protein:vir:10 370 FGDFQQTYLIIDRIGVRVLRDPYTAKPYVLFYTTKRVGGGLLNPEPMRAMKVA 422 (425) T ss_pred EEehhccEEEEEecceEEEecccccCCcEEEEEEEEeccEeecccceEEEEee Confidence 45544333467789999987778777778899999999999999999999887 No 20 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=98.62 E-value=8.3e-10 Score=70.37 Aligned_cols=266 Identities=11% Similarity=0.081 Sum_probs=185.7 Q ss_pred hcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcceeecCCCCccceEEEEcCC Q lcl|NC_011269. 32 MGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNE 111 (333) Q Consensus 32 ~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~ 111 (333) ||-...+--+ +..|.. .+-+.+++.|.+.+..+.++|++.+..+++.|. ..+++..... +-|++.. T Consensus 1 ~g~~a~~~~~----------~~~~~~-~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~-~~~~~~~~~~--a~~v~E~ 66 (299) T protein:vir:41 1 MGFNPDTTTM----------QSAKTG-SIPINISEQIITGVKNGSAAMKLAKAVPMTKPE-EEFTFMSGVG--AFWVDEA 66 (299) T ss_pred CCcCCCcccc----------cCCCce-ecchhHHHHHHHHHHhcchhhhhceeeecCCCc-EEEEEEcCCc--eeeeecC Confidence 7765443322 112221 467778889999999999999999888876554 4556655443 3457777 Q ss_pred CcccceeecCceeeccceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHH---------HHhhhhhhhhh Q lcl|NC_011269. 112 GEIRITPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVT---------LLEAAAVSYRV 182 (333) Q Consensus 112 G~i~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~s---------lle~~a~~~r~ 182 (333) ++++++...=+.|++...++..+..|..+-|+....|+..+..+...++|-+.+|..+++ ++.++. T Consensus 67 ~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l~G~g~~~~~gil~~~~----- 141 (299) T protein:vir:41 67 ERIQTSKPTFTKAKMRSKKMGVIIPTTKENLNYSVTNFFSLMQAEIVEAFYKKFDQAVFTGVESPYNWNILKSAT----- 141 (299) T ss_pred ccccccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhhcccCccccccccccc----- Confidence 788888777789999999999999999999999999999999999999999999976653 221111 Q ss_pred hcccccccccCCCcceEEeeccccHHHHHHHHHHHHhhCCccceEEechhhhhhhhhcCCCchhhhHHhhhhhcceeeee Q lcl|NC_011269. 183 VDSSAQPGVGALPNEITIAGSHLMPDDLYTAVTYTDQRQLDSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIVQF 262 (333) Q Consensus 183 ~~ssA~p~vg~~~N~i~i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~ 262 (333) .++ | +..++..+-++|.+++..+.+.++....++||++.|..|+.=--..--|.+.+++..+. . T Consensus 142 ------~~~----~--~~~~~~~~~~~l~~~~~~l~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~----~ 205 (299) T protein:vir:41 142 ------DAS----N--LVEETANKYDDLNEAIGLIEAEDLEPNGIATIRKQRVKYRSTKDGNGMPIFNTATSNGV----D 205 (299) T ss_pred ------ccc----e--eeccccccHHHHHHHHHhhhcccCCcCEEEEcHHHHHHHHHhhccCCceeecCCcCCCC----c Confidence 000 1 23456677899999999999999999999999999999986211111123333332221 1 Q ss_pred eccccc--ceeeecCCe---EEEeeChhhhcccccccCceeccccchhh-------------h---ccceehhhhhhhhh Q lcl|NC_011269. 263 GEFQIG--KSIIIPRGT---VYLTPEPEFLGVFPVMYSLDVEEDNKVER-------------F---NKGWVMDELVGMAI 321 (333) Q Consensus 263 G~fgi~--~skvlprge---iyvvadpE~~G~~pvR~~L~s~p~D~~er-------------~---~kGWvm~E~~g~~i 321 (333) -++|.+ .+-.+|-|. ++++-|..+. .+-+|+++.++-.+.... | ...+-+.+-+++.+ T Consensus 206 ~l~G~PV~~~~~~~~~~~~~~~~~gdfs~~-~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v 284 (299) T protein:vir:41 206 DVLGLPIAYTPKYTFGDKDISELVGDWNQA-YYGILRGVEYEILTEATLTTVADETGKPLNLAERDMAAIKATFEVGFMV 284 (299) T ss_pred eecceeeEEecccCCCCCceEEEEEecccE-EEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEeccEE Confidence 135544 566677665 4445555543 478888888876664321 2 34455678899999 Q ss_pred hccceEEEEecC Q lcl|NC_011269. 322 LNPRGIVILRKA 333 (333) Q Consensus 322 ~N~~siv~~~~~ 333 (333) .||-+|+.+..+ T Consensus 285 ~~~~A~~~l~~~ 296 (299) T protein:vir:41 285 VKDEAFSAVQPK 296 (299) T ss_pred ecccceEEEEec Confidence 999999999877 No 21 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=98.61 E-value=3.3e-09 Score=67.09 Aligned_cols=309 Identities=11% Similarity=0.082 Sum_probs=188.0 Q ss_pred CcccchhhhhhhhhhcccchHHHHHHHHHHhhcchhcchHHHHHHHHHHhcC---chhHHHHHHHHHHHHHHHHHhhhhh Q lcl|NC_011269. 1 MTLPVAVGSGLGRFAKASDDYVADIVEAKQRMGGRKLSAREKQAKLAHILSD---KVGGIQRLGQSMIGPIQLQLRYQGI 77 (333) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~~Al~~---~Eg~~~aLg~~mA~pI~~q~~rqGi 77 (333) ..-+.......+.....++++-+ ......+++.....+.++....+... ..|. -+-..+...|-+.++.... T Consensus 68 ~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~--~~~~~~~~~ii~~~~~~~~ 142 (390) T protein:vir:10 68 GAGGDVQHVSVGDLFVASEQFQA---SAGRWNDRSARATMNIKAALNTASTDAAGSAGA--LTTPNRLPGFITQPDARLT 142 (390) T ss_pred cccccccccchhhhhhhhHHHHH---HHHhhhhhhhhhhhHHHHHHHhhhccccccccc--ccchhHHHHHHHHHHhhch Confidence 11111111112222222222221 12222333333344444444444322 2232 3555667788888899999 Q ss_pred hhhhhhccccCCCcceeecCCCCccceEEEEcCCCcccceeecCceeeccceeeeccccccHHHhhhhcchhHHHHHHHH Q lcl|NC_011269. 78 LRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRITPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMT 157 (333) Q Consensus 78 ~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A 157 (333) ++++....+...|.. .|++.......+.|.+..+.++.....-+.|++.-..+.....|..+=|+. +.++..+...+. T Consensus 143 l~~~~~~~~~~~~~~-~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d-~~~l~~~i~~~l 220 (390) T protein:vir:10 143 VRDLIGSGRTDSALI-EYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQILSD-APQLASYMNNRL 220 (390) T ss_pred hhhhcceeeccCCce-EEEEEecCCcceeeecCCccccccccceeEEEEeeEEEEEeehhhHHHHHh-HHHHHHHHHHHH Confidence 999988877766642 333333333345677777777777777788888888899998898875655 468999999999 Q ss_pred HHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCC-----cceEEeeccccHHHHHHHHHHHHhhCCccceEEechh Q lcl|NC_011269. 158 KQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALP-----NEITIAGSHLMPDDLYTAVTYTDQRQLDSSRLLANPQ 232 (333) Q Consensus 158 ~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~-----N~i~i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~ 232 (333) .++|-+-+|.-+++ +..+ +..| .|.+. +..+-.++..+-+.+..+...++.-..+.+.++||++ T Consensus 221 ~~~~~~~~~~~il~---G~G~-------~~~p-~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~v~n~~ 289 (390) T protein:vir:10 221 IRGLKVKEDAEILR---GTGA-------NDGL-LGLIPQATTYAAPTTIAGATRVDQLRLAMLQASLAEYPASGIVINPI 289 (390) T ss_pred HHHHHHHHHHHHhh---cCCC-------Cccc-cccccccccccccccccccchHHHHHHHHHhhccccCCCCEEEEcHH Confidence 99999999965543 2211 1112 12221 1122234455667888999999999999999999999 Q ss_pred hhhhhhhcCCCch-hhhHHhhhhhcceeeeeeccccc--ceeeecCCeEEEeeChhhhcccccccCceeccccc---hhh Q lcl|NC_011269. 233 EYRDLYRWDINTT-GWAFKDSVVAGERIVQFGEFQIG--KSIIIPRGTVYLTPEPEFLGVFPVMYSLDVEEDNK---VER 306 (333) Q Consensus 233 ~~~Di~gw~~N~~-~~~~~DpV~~~e~il~~G~fgi~--~skvlprgeiyvvadpE~~G~~pvR~~L~s~p~D~---~er 306 (333) -|..|..=- +.. .|.+.+++.... .=++|++ .+--||.|++|+ -|.-..-.+-+|.|+.++-.+. .++ T Consensus 290 ~~~~L~~lk-d~~g~~l~~~~~~~~~----~~l~G~pv~~~~~~p~~~~~~-gdf~~~~~~~~~~~~~i~~~~~~~~~~~ 363 (390) T protein:vir:10 290 DWAAIELAK-DANNQYLIGNARGTLT----PTLWGLPVVATQAMAPGEFLV-GAFDLAAQIFDQWDARVEIGYVNDDFQR 363 (390) T ss_pred HHHHHHHhh-cCCCceeecCCcCcCC----ceecceeeEEcCCCCCCcEEE-EeccceEEEEEecceEEEEeeccccccc Confidence 998887511 111 122223322110 1135544 566789999875 5654444567899998875443 222 Q ss_pred hccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 307 FNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 307 ~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) --.++...+-++.++.||-++|.+.-| T Consensus 364 ~~~~~r~~~r~d~~v~~~~a~~~~~~a 390 (390) T protein:vir:10 364 NMVTVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred CcEEEEEEEeeccEEeccccEEEEEeC Confidence 344666778999999999999999999 No 22 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=98.60 E-value=1.7e-09 Score=68.67 Aligned_cols=260 Identities=18% Similarity=0.201 Sum_probs=161.5 Q ss_pred cchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccC--CCcceeecCCCCccceEEEEcCCCcc Q lcl|NC_011269. 37 LSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLT--PGVPIQYDVLDDLGQAYMLHGNEGEI 114 (333) Q Consensus 37 ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~--~G~~p~y~v~~~v~~a~~~~~~~G~i 114 (333) +++++ + .+++.+ .| +.+++.+..++...+...+++-. -.+|+ +|....+|+.+..+.+- ...+..++ T Consensus 1 m~~~~-T-~l~d~i-~P----ev~~~~v~~~~~~~l~~~~~~~~---~~~l~g~~G~tv~iP~~~~ig~a~-~~~~g~~i 69 (274) T protein:vir:96 1 MAQGM-T-KLTNQI-VP----EVLAPMMQAELEKKLRFASFAEI---DNTLVGQPGDTLTFPAFIYSGDAK-VVAEGEKI 69 (274) T ss_pred CCcce-e-ehhhee-ch----HHHHHHHHHHHHhhhhcccccee---cccccCCCCCEEEeeeecCCCccc-cccCCCcc Confidence 11110 0 122222 22 33455555555555555454311 12232 47677777655444333 24454567 Q ss_pred cceeecCceeeccceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCC Q lcl|NC_011269. 115 RITPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGAL 194 (333) Q Consensus 115 ~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~ 194 (333) ..+.+.-...+..-.+.-..-.+.=.+..+..+|.+.++...+..++-...|..+++.+..+. T Consensus 70 ~~~~lt~~~~~~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~----------------- 132 (274) T protein:vir:96 70 PTDILETKKREAKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAK----------------- 132 (274) T ss_pred chhhcccceeEEEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccc----------------- Confidence 777666555555544433333333344455567999999999999999999999888775433 Q ss_pred CcceEEeeccccHHHHHHHHHHHHhhCCccceEEechhhhhhhhhcCCCchhhhHHhhhhhcceeeeeecc------ccc Q lcl|NC_011269. 195 PNEITIAGSHLMPDDLYTAVTYTDQRQLDSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIVQFGEF------QIG 268 (333) Q Consensus 195 ~N~i~i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~G~f------gi~ 268 (333) .++...-++.+.|..|.+...+-+-.-..++|+|..|..|+-=...+|- .++-. ++-++..|.| .|. T Consensus 133 ---~~~~~~~~~~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~---~~s~~-g~~~~~~G~ig~~~G~~Vi 205 (274) T protein:vir:96 133 ---LTVEADITKLTGLQTAIDKFNDEDLEPMVLFISPLDAGKLRGDATTNFT---RATEL-GDDVIVKGAFGEALGAVIV 205 (274) T ss_pred ---ccccccccCHHHHHHHHHHhccccccccEEEeCHHHHHHHHhhcccccc---ccccc-cccceeccccceecCeEEE Confidence 2233455678899999999988887888999999999999872111221 11111 1224444454 477 Q ss_pred ceeeecCCeEEEeeChhhhcccccccCceeccccchhhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 269 KSIIIPRGTVYLTPEPEFLGVFPVMYSLDVEEDNKVERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 269 ~skvlprgeiyvvadpE~~G~~pvR~~L~s~p~D~~er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) .+..+|-++.|+...+- .|.+- ..++.+|..--..++..==.-.+.+|..+.||.++|.++|. T Consensus 206 ~s~~~~~~t~~l~~~gA-~~~~~-~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~~v~~tk~ 268 (274) T protein:vir:96 206 RSNKLEAGTAILAKKGA-VKLIT-KRDFFLETDRDPSTKTTALYSDKHYVAYLYDESKAVKITKG 268 (274) T ss_pred EeCCCCCceEEEEeccc-eeeee-cCCcccccccccccccCEEEEeEEEEEEEEcCCcEEEEEcC Confidence 99999999999998766 56554 56677665444444444444567789999999999999999 No 23 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=98.60 E-value=1.7e-09 Score=68.67 Aligned_cols=260 Identities=18% Similarity=0.201 Sum_probs=161.5 Q ss_pred cchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccC--CCcceeecCCCCccceEEEEcCCCcc Q lcl|NC_011269. 37 LSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLT--PGVPIQYDVLDDLGQAYMLHGNEGEI 114 (333) Q Consensus 37 ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~--~G~~p~y~v~~~v~~a~~~~~~~G~i 114 (333) +++++ + .+++.+ .| +.+++.+..++...+...+++-. -.+|+ +|....+|+.+..+.+- ...+..++ T Consensus 1 m~~~~-T-~l~d~i-~P----ev~~~~v~~~~~~~l~~~~~~~~---~~~l~g~~G~tv~iP~~~~ig~a~-~~~~g~~i 69 (274) T protein:vir:95 1 MAQGM-T-KLTNQI-VP----EVLAPMMQAELEKKLRFASFAEI---DNTLVGQPGDTLTFPAFIYSGDAK-VVAEGEKI 69 (274) T ss_pred CCcce-e-ehhhee-ch----HHHHHHHHHHHHhhhhcccccee---cccccCCCCCEEEeeeecCCCccc-cccCCCcc Confidence 11110 0 122222 22 33455555555555555454311 12232 47677777655444333 24454567 Q ss_pred cceeecCceeeccceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCC Q lcl|NC_011269. 115 RITPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGAL 194 (333) Q Consensus 115 ~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~ 194 (333) ..+.+.-...+..-.+.-..-.+.=.+..+..+|.+.++...+..++-...|..+++.+..+. T Consensus 70 ~~~~lt~~~~~~~i~~~~~a~~i~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a~----------------- 132 (274) T protein:vir:95 70 PTDILETKKREAKIRKIAKGTSISDEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSAK----------------- 132 (274) T ss_pred chhhcccceeEEEeeeeecceeehHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccc----------------- Confidence 777666555555544433333333344455567999999999999999999999888775433 Q ss_pred CcceEEeeccccHHHHHHHHHHHHhhCCccceEEechhhhhhhhhcCCCchhhhHHhhhhhcceeeeeecc------ccc Q lcl|NC_011269. 195 PNEITIAGSHLMPDDLYTAVTYTDQRQLDSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIVQFGEF------QIG 268 (333) Q Consensus 195 ~N~i~i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~G~f------gi~ 268 (333) .++...-++.+.|..|.+...+-+-.-..++|+|..|..|+-=...+|- .++-. ++-++..|.| .|. T Consensus 133 ---~~~~~~~~~~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~---~~s~~-g~~~~~~G~ig~~~G~~Vi 205 (274) T protein:vir:95 133 ---LTVEADITKLTGLQTAIDKFNDEDLEPMVLFISPLDAGKLRGDATTNFT---RATEL-GDDVIVKGAFGEALGAVIV 205 (274) T ss_pred ---ccccccccCHHHHHHHHHHhccccccccEEEeCHHHHHHHHhhcccccc---ccccc-cccceeccccceecCeEEE Confidence 2233455678899999999988887888999999999999872111221 11111 1224444454 477 Q ss_pred ceeeecCCeEEEeeChhhhcccccccCceeccccchhhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 269 KSIIIPRGTVYLTPEPEFLGVFPVMYSLDVEEDNKVERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 269 ~skvlprgeiyvvadpE~~G~~pvR~~L~s~p~D~~er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) .+..+|-++.|+...+- .|.+- ..++.+|..--..++..==.-.+.+|..+.||.++|.++|. T Consensus 206 ~s~~~~~~t~~l~~~gA-~~~~~-~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~~v~~tk~ 268 (274) T protein:vir:95 206 RSNKLEAGTAILAKKGA-VKLIT-KRDFFLETDRDPSTKTTALYSDKHYVAYLYDESKAVKITKG 268 (274) T ss_pred EeCCCCCceEEEEeccc-eeeee-cCCcccccccccccccCEEEEeEEEEEEEEcCCcEEEEEcC Confidence 99999999999998766 56554 56677665444444444444567789999999999999999 No 24 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=98.59 E-value=1.9e-09 Score=68.42 Aligned_cols=254 Identities=11% Similarity=0.082 Sum_probs=179.5 Q ss_pred HHHHhc---CchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCc-ceeecCCCCccceEEEEcCCCccccee-ec Q lcl|NC_011269. 46 LAHILS---DKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGV-PIQYDVLDDLGQAYMLHGNEGEIRITP-FE 120 (333) Q Consensus 46 m~~Al~---~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~-~p~y~v~~~v~~a~~~~~~~G~i~~Q~-i~ 120 (333) |-++++ +..|+ -.+-+-+++.|.+.++....++++....+++.+. -..+++..+.+..+-|++.-+++.++- .. T Consensus 1 ~l~~~~~~t~~~gg-~liP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~ 79 (293) T protein:vir:48 1 MLDSKTDHSGSDAG-LTIPQDIRTAINTLVRQYDSLQEYVNVENVTTLTGSRVYEKWTDITGLANIDDEAGKIADIDDPK 79 (293) T ss_pred CceeecccccCcCc-eEechhHHHHHHHHHHhhhhhhhhceeeeccCCcceEEEEeecCCCcceeeecCCcccccccccc Confidence 333332 23344 2567888889999999999999998777776654 223333334444556788778887653 34 Q ss_pred CceeeccceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCcceEE Q lcl|NC_011269. 121 GKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPNEITI 200 (333) Q Consensus 121 ~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N~i~i 200 (333) =+.|++.-.++.....|..+=|++...|+..+..++..++|.+-||.-+++.+-+.+ - T Consensus 80 ~~~i~l~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~----------------------~ 137 (293) T protein:vir:48 80 LSLIKYTIKRYAGISTVTNSLLADSAENILAWLSGWIAKKVVVTRNKAILGVVDKLP----------------------T 137 (293) T ss_pred eeEEEEeeeEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHhHHhhcccccc----------------------c Confidence 478899999999999999999999999999999999999999999987776653221 1 Q ss_pred eeccccHHHHHHHHHHHHhhCCccceEEechhhhhhhhhcCCCchhhhHHhhhhhcceeeee----e----cccccc--- Q lcl|NC_011269. 201 AGSHLMPDDLYTAVTYTDQRQLDSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIVQF----G----EFQIGK--- 269 (333) Q Consensus 201 ~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~----G----~fgi~~--- 269 (333) .++.++-++|.+++..+..-......++||++.|..|.. +||. .+.-+++- | ++|.+. T Consensus 138 ~~~~~~~d~i~~~~~~l~~~~~~~a~~vmn~~~~~~L~~---------lkd~--~g~~l~~~~~~~~~~~~l~G~Pv~~~ 206 (293) T protein:vir:48 138 KPTLTKWDDIIDLEAKVDPAIKQTSFFLTNTSGFTALKK---------VKNA--LGDYLMERDVKSPTGYSIAGFAVKEI 206 (293) T ss_pred cccccCHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHH---------hhcc--CCceEeecCcCCCCCceecceeeEEe Confidence 235567899999999998888888899999999998876 2232 12222221 1 245442 Q ss_pred -eeeecCCe----EEEeeChhhhcccccccCceeccccch----hhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 270 -SIIIPRGT----VYLTPEPEFLGVFPVMYSLDVEEDNKV----ERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 270 -skvlprge----iyvvadpE~~G~~pvR~~L~s~p~D~~----er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) +..+|.++ .+++.|.-.+-.+-+|+++.++-.+.. ++-..++.+.+-++..+.||.+|+.+..+ T Consensus 207 ~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~ 279 (293) T protein:vir:48 207 SDRWLPNASSGVMPLYFGDLKQAVTLFDRQQMSLLSTNIGGGAFETDTTKVRVIDRFDVVATDTEAFVPASFK 279 (293) T ss_pred cccccCCccCCceEEEEEeccceEEEEEecceEEEEecccchhhhcCeEEEEEEEeeCcEEecccceEEEEee Confidence 33445433 244566554556777899888766532 23357889999999999999999999855 No 25 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=98.58 E-value=5.8e-09 Score=65.73 Aligned_cols=296 Identities=13% Similarity=0.077 Sum_probs=182.9 Q ss_pred Ccccchhhhhhhh-hhcccchHHHHHHHHHHhhcchhcchHHHHHHHHH-HhcCchhHHHHHHHHHHHHHHHHHhhhhhh Q lcl|NC_011269. 1 MTLPVAVGSGLGR-FAKASDDYVADIVEAKQRMGGRKLSAREKQAKLAH-ILSDKVGGIQRLGQSMIGPIQLQLRYQGIL 78 (333) Q Consensus 1 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~~-Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~ 78 (333) ...+..-..+..+ ......+|...+ ...|-...+.++=+. .-.+..|+. .+-+-+.+.|...++....+ T Consensus 69 ~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~l~~~~~~~~~~~~~~t~~~gg~-~iP~~~~~~ii~~~~~~~~l 139 (397) T protein:vir:49 69 ANMSEEEKKPLTKNEEEVKANFVKDF--------KNLVRGRYQNLLDSKTDGSGSDAGL-TIPQDIRTAINTLVRQFDSL 139 (397) T ss_pred hcccccccccccchhhHHHHHHHHHH--------HHHhhcchhhHHHhhhccCCccCcc-eecHHHHHHHHHHHHhhhhH Confidence 1111111111111 001111111111 111111111111001 112333442 45677788999999999999 Q ss_pred hhhhhccccCCCcc-eeecCCCCccceEEEEcCCCcccceee-cCceeeccceeeeccccccHHHhhhhcchhHHHHHHH Q lcl|NC_011269. 79 RNVLLEDTLTPGVP-IQYDVLDDLGQAYMLHGNEGEIRITPF-EGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDM 156 (333) Q Consensus 79 RklL~~~TL~~G~~-p~y~v~~~v~~a~~~~~~~G~i~~Q~i-~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~ 156 (333) +++....+++.+.. ..|++-.+....+-|.+-.+++..... .=+.|++.-.++..++.|..+=|+....|+..+..++ T Consensus 140 ~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~ 219 (397) T protein:vir:49 140 QEYVNVENVTTLTGSRVYEKWADITGLAKLDDEGGQIGQNDDPKLSLIRYAIKRYAGISTVTNSLLADSAENILAWLSGW 219 (397) T ss_pred hhhcceeeccCCcceEEEEeeccCCcceeeeccccccccccccceeeeEeeeeeeEeehhhHHHHHhhhhHHHHHHHHHH Confidence 99988888877653 234433344344556766666765432 2267888888999999999999999999999999999 Q ss_pred HHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCcceEEeeccccHHHHHHHHHHHHhhCCccceEEechhhhhh Q lcl|NC_011269. 157 TKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPNEITIAGSHLMPDDLYTAVTYTDQRQLDSSRLLANPQEYRD 236 (333) Q Consensus 157 A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N~i~i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~~~~D 236 (333) ..++|.+-+|.-+++-. -+..| .++.++-++|..++..++.-......++||++-|.. T Consensus 220 l~~~~~~~~d~ail~G~-----------g~~~~-----------~~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~~~~ 277 (397) T protein:vir:49 220 IAKKVVVTRNKAILEAI-----------GTLPN-----------KPTLAKWDDIIDLQAKVDPAIKQTSLFLTNTSGFTA 277 (397) T ss_pred HHHHHHHHHHHHHHhcc-----------ccccc-----------cccccCHHHHHHHHHhhhhhhcCCCEEEEcHHHHHH Confidence 99999999997665433 11111 245567889999999999999999999999999998 Q ss_pred hhhcCCCchhhhHHhhhhhcceeeee---ecccccc----eeeecCCe----EEEeeChhhhcccccccCceeccccchh Q lcl|NC_011269. 237 LYRWDINTTGWAFKDSVVAGERIVQF---GEFQIGK----SIIIPRGT----VYLTPEPEFLGVFPVMYSLDVEEDNKVE 305 (333) Q Consensus 237 i~gw~~N~~~~~~~DpV~~~e~il~~---G~fgi~~----skvlprge----iyvvadpE~~G~~pvR~~L~s~p~D~~e 305 (333) |+.=- +..+ .|+-+.++ .+. -++|.+. +..+|-++ .+++.|...+-.+-+|+|+.++-.++.. T Consensus 278 l~~lk-d~~g----~~l~~~~~-~~g~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~ 351 (397) T protein:vir:49 278 LKKVK-NAMG----DYLMERDV-KSPTGYSIDGFVVKEISDRFLPNGTGGAMPLYFGDLKQAVTLFDRQHLSLLSTNIGG 351 (397) T ss_pred HHHhh-ccCC----ceeecccc-cCCCCceecceeeEEecccccccccCCceeEEEeeccceEEEEeecccEEEEecccc Confidence 87621 1111 22222221 111 1355442 33455543 3566777666777889999988776543 Q ss_pred ----hhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 306 ----RFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 306 ----r~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) +-..++.+++-++..+.||.+++++.-+ T Consensus 352 ~~~~~~~~~~~~~~r~d~~~~~~~a~~~~~~~ 383 (397) T protein:vir:49 352 GAFETDTTKVRVIDRFDVVSTDTEAFVPASFK 383 (397) T ss_pred chhhcCeeeEEEEEeeccEEecccceEEEEec Confidence 2356789999999999999999999744 No 26 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=98.57 E-value=5e-09 Score=66.11 Aligned_cols=260 Identities=17% Similarity=0.200 Sum_probs=163.0 Q ss_pred hcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhcccc--CCCcceeecCCCCccceEEEEc Q lcl|NC_011269. 32 MGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTL--TPGVPIQYDVLDDLGQAYMLHG 109 (333) Q Consensus 32 ~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL--~~G~~p~y~v~~~v~~a~~~~~ 109 (333) |. +++ -.+++.+ .||- +++.+.+++...++..+++-. ..+| .+|....+|+-+..+.+- ..+ T Consensus 1 Ma-----~~~--T~l~d~i-~Pev----~~~~v~~~~~~~~~~~~~~~~---~~~l~g~~G~ti~iP~~~~igda~-~~~ 64 (276) T protein:vir:10 1 MA-----QGT--TTKSTQI-VPEV----LAPMMQAELDKKLRFAQFADI---DSTLVGQPGDTLTFPAFVYSGDAT-VVP 64 (276) T ss_pred CC-----cce--eehhhhh-chHH----HHHHHHHHHHhhhhhccccee---cccccCCCCCEEEeeeecCCCccc-ccc Confidence 32 110 0122222 2333 345555555555555555422 1223 257777777644443332 244 Q ss_pred CCCcccceeecCceeeccceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccc Q lcl|NC_011269. 110 NEGEIRITPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQP 189 (333) Q Consensus 110 ~~G~i~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p 189 (333) ...++..+.+.-...+..-.+.-.--.+.=++..+..+|.+.++-.....+|-+..|..+++.|.++. T Consensus 65 eg~~i~~~~lt~~~~~a~i~~~~k~~~~tD~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~~l~~~~------------ 132 (276) T protein:vir:10 65 EGQKIPVDKIETNRREAKIHKIGKGTDITDEALLSGYGDPQGEAVRQHGLAIANKVDNDVLEALRGTK------------ 132 (276) T ss_pred CCCccCccccccceeeEEeehccccccccHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccc------------ Confidence 54556666554333333333433333444555566678999999999999999999999998875433 Q ss_pred cccCCCcceEEeeccccHHHHHHHHHHHHhhCCccceEEechhhhhhhhhcCCCchhhhHHhhhhhcceeeeeeccc--- Q lcl|NC_011269. 190 GVGALPNEITIAGSHLMPDDLYTAVTYTDQRQLDSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIVQFGEFQ--- 266 (333) Q Consensus 190 ~vg~~~N~i~i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~G~fg--- 266 (333) .++.++-++.+.+..|.+...+.+-.-+.++|+|..|..|+-=...+|- .++-. ++-++..|.|| T Consensus 133 --------~~~~~~~~t~d~i~~A~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~f~---~~s~~-g~~~~~~G~ig~~~ 200 (276) T protein:vir:10 133 --------LTVSADIGTLAGLEAAIDTFDDEDLEPMVLFINPKDAGKLRSSASDNFT---RATEL-GDNIIVKGAFGEAL 200 (276) T ss_pred --------ccccccccCHHHHHHHHHHhccccCcccEEEEcHHHHHHHHHhcccccc---ccccc-cccceeccccceec Confidence 3445667788999999999988877888999999999999851112221 11111 12234455543 Q ss_pred ---ccceeeecCCeEEEeeChhhhcccccccCceeccccchhhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 267 ---IGKSIIIPRGTVYLTPEPEFLGVFPVMYSLDVEEDNKVERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 267 ---i~~skvlprgeiyvvadpE~~G~~pvR~~L~s~p~D~~er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) |..+..+|-|+.|+...+- +|-+- ..++.+|..--..++..-=.-.+.+|..+.||.+||.++|+ T Consensus 201 G~~Vi~s~~~p~~t~~l~~~gA-i~~~~-~~~~~vE~dRd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~ 268 (276) T protein:vir:10 201 GAVIVRSKKLDEGEAILAKRGA-VKLIT-KRDFFLETDRDPSTKTTALYSDKHYVAYLYDESKAVKVTKG 268 (276) T ss_pred ceeEEEcCCCCcceEEEEeccc-eeeee-cCCceeecccchhhcccEEEEeeEEEEEEEcCcceEEEecC Confidence 6688899999999887554 66443 56677765555555555555567789999999999999999 No 27 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=98.57 E-value=7e-09 Score=65.29 Aligned_cols=284 Identities=12% Similarity=0.120 Sum_probs=186.0 Q ss_pred hhcccchHHHHHHHHHH-hhcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcc Q lcl|NC_011269. 14 FAKASDDYVADIVEAKQ-RMGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVP 92 (333) Q Consensus 14 ~~~~~~~~~~~~~~~~~-~~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~ 92 (333) ..|- +.--.++-.-.. ...+..+.+++. ...+..+. .+-..+++.|.+.++.+..+|.+....+.+.|. T Consensus 1 ~~k~-~~~~~~~~~~~~~~~~~~~~~a~~~------~~~~~~~~--lip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~- 70 (324) T protein:vir:99 1 MEQT-QKLKLNLQHFASNNVKPQVFNPDNV------MMHEKKDG--TLLNDFTTPILQEVMENSKIMRLGKYEPMEGTE- 70 (324) T ss_pred CCCc-hHhhHHHHHHHHHhhhhhhccccce------eccCCCcc--eechhHHHHHHHHHHhhchhhhhcceeeccCCc- Confidence 1111 100000000000 001111111110 01122232 467789999999999999999999888877654 Q ss_pred eeecCCCCccceEEEEcCCCcccceeecCceeeccceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHH Q lcl|NC_011269. 93 IQYDVLDDLGQAYMLHGNEGEIRITPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTL 172 (333) Q Consensus 93 p~y~v~~~v~~a~~~~~~~G~i~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~sl 172 (333) ..||+...... +-|++..++++.....=+.+++.-.++.....|..+-|+....++..+..++..++|.+.+|..+++ T Consensus 71 ~~~p~~~~~~~-a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~~~l~- 148 (324) T protein:vir:99 71 KKFTFWADKPG-AYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL- 148 (324) T ss_pred eEEEEEecCcc-eeEeccCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhh- Confidence 45665444333 4577877888888777788888899999999999998999999999999999999999999976653 Q ss_pred HhhhhhhhhhhcccccccccCCCc----ceEEeeccccHHHHHHHHHHHHhhCCccceEEechhhhhhhhhcCCCchhhh Q lcl|NC_011269. 173 LEAAAVSYRVVDSSAQPGVGALPN----EITIAGSHLMPDDLYTAVTYTDQRQLDSSRLLANPQEYRDLYRWDINTTGWA 248 (333) Q Consensus 173 le~~a~~~r~~~ssA~p~vg~~~N----~i~i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~~~~Di~gw~~N~~~~~ 248 (333) +... +..| .|.. | ..+...+.++.++|..+...+..-++....++||+..|..|+. T Consensus 149 --G~g~-------~~~~-~~~~-~~~~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~--------- 208 (324) T protein:vir:99 149 --NQGN-------NPFG-KSIA-QSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRK--------- 208 (324) T ss_pred --cCCC-------CccC-cccc-ccccccceeccccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHH--------- Confidence 2221 1011 1111 1 1245567899999999999999999999999999999998875 Q ss_pred HHhhhhhcceeeeee----cccccc--eee--ecCCeEEEeeChhhhcccccccCceeccccch---------------- Q lcl|NC_011269. 249 FKDSVVAGERIVQFG----EFQIGK--SII--IPRGTVYLTPEPEFLGVFPVMYSLDVEEDNKV---------------- 304 (333) Q Consensus 249 ~~DpV~~~e~il~~G----~fgi~~--skv--lprgeiyvvadpE~~G~~pvR~~L~s~p~D~~---------------- 304 (333) .+|+-.+. +++.+ ++|.+. +.. .+.+.++ +.|+.+. .+=+|+++.++..|.. T Consensus 209 l~d~~g~~--~~~~~~~~~l~G~PVv~~~~~~~~~~~~i-~gd~~~~-~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f 284 (324) T protein:vir:99 209 IVDPETKE--RIYDRNSDTLDGLPVVNLKSSNLKRGELI-TGDFDKL-IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLF 284 (324) T ss_pred hhcCCCce--eecCCCCccccceeEEeecCCCCCcceEE-EEecccE-EEEEecCcEEEEeecccccccccccccchhhh Confidence 23433222 22222 245442 222 3445444 5777754 5778889888766542 Q ss_pred hhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 305 ERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 305 er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) ++--..|...+-++.++.||.+++.|..+ T Consensus 285 ~~~~~~~r~~~r~d~~v~~~~a~~~lt~a 313 (324) T protein:vir:99 285 EQDMVALRATMHVALHIADDKAFAKLVPA 313 (324) T ss_pred hcCcEEEEEEEEEccEEecccceEEEEec Confidence 23356778888899999999999999988 No 28 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=98.57 E-value=5.5e-09 Score=65.88 Aligned_cols=310 Identities=10% Similarity=0.064 Sum_probs=187.3 Q ss_pred CcccchhhhhhhhhhcccchHHHHHHHHHHhhcchhcchHHHHHHHHHHhc--CchhHHHHHHHHHHHHHHHHHhhhhhh Q lcl|NC_011269. 1 MTLPVAVGSGLGRFAKASDDYVADIVEAKQRMGGRKLSAREKQAKLAHILS--DKVGGIQRLGQSMIGPIQLQLRYQGIL 78 (333) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~~Al~--~~Eg~~~aLg~~mA~pI~~q~~rqGi~ 78 (333) ..-+..-....+.....++. ...+ ......+..+.+. +..+....+.. +..++ -.+-......|-+.++..... T Consensus 68 ~~~~~~~~~~~~~~~~~~~~-~~~~-~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~g-~~~~~~~~~~ii~~~~~~~~l 143 (390) T protein:vir:81 68 GAGGDVQHVSVGDMFVASEQ-FQAS-AGRWNDRSARATM-NIKAALNTASTDAAGSAG-ALTTPNRLPGFITPPDARLTV 143 (390) T ss_pred ccccccccccchhhhhhhHH-HHHH-HHHHhhhhhhhhh-HHHHHHHhhccccccCCc-ceechhhhHHHHHHHhhhhhh Confidence 11111111111111111111 1111 1111112222222 22233333332 22332 245666778888889999999 Q ss_pred hhhhhccccCCCcceeecCCCCccceEEEEcCCCcccceeecCceeeccceeeeccccccHHHhhhhcchhHHHHHHHHH Q lcl|NC_011269. 79 RNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRITPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTK 158 (333) Q Consensus 79 RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~ 158 (333) +++....+...|. -.|++.......+.|++.-+.++.....=+.+++.-.++.....|..+=|+.. .++..+...... T Consensus 144 ~~~~~~~~~~~~~-~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~-~~~~~~i~~~l~ 221 (390) T protein:vir:81 144 RDLIGSGRTDSAL-IEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQILSDA-PQLASYMNNRLI 221 (390) T ss_pred hhhcceeeccCCc-eEEEEEecCCcceeeecCCcccccccceeeEEEEeeeEEEEeehhhHHHHHhH-HHHHHHHHHHHH Confidence 9998877766654 23444333333456777777777766666778888888999999998867654 688888999999 Q ss_pred HHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCC-----CcceEEeeccccHHHHHHHHHHHHhhCCccceEEechhh Q lcl|NC_011269. 159 QAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGAL-----PNEITIAGSHLMPDDLYTAVTYTDQRQLDSSRLLANPQE 233 (333) Q Consensus 159 qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~-----~N~i~i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~~ 233 (333) .+|-+-+|.-+++ +-. +..+-.|.+ .+...-.++..+-++|..+...+..-+.....++||++- T Consensus 222 ~~~~~~~d~a~l~---G~g--------~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~ 290 (390) T protein:vir:81 222 RGLKVKEDAEILR---GTG--------ANDGLLGLIPQATTYAAPTTIAGATRVDQLRLAMLQASLAEYNPSGIVINPID 290 (390) T ss_pred HHHHHHHHHHHHh---cCC--------CCCcccceeecccccccccccccchhHHHHHHHHHhhccccCCCCEEEEcHHH Confidence 9999999965442 211 111111111 111222344556678889999999999999999999999 Q ss_pred hhhhhhcCCCch-hhhHHhhhhhcceeeeeeccccc--ceeeecCCeEEEeeChhhhcccccccCceeccccchhhh--- Q lcl|NC_011269. 234 YRDLYRWDINTT-GWAFKDSVVAGERIVQFGEFQIG--KSIIIPRGTVYLTPEPEFLGVFPVMYSLDVEEDNKVERF--- 307 (333) Q Consensus 234 ~~Di~gw~~N~~-~~~~~DpV~~~e~il~~G~fgi~--~skvlprgeiyvvadpE~~G~~pvR~~L~s~p~D~~er~--- 307 (333) |..|+..- +.. .|.+-+|..... .=++|++ .+--+|.|++|+ -|....-.+-+|+|+.++..+...-| T Consensus 291 ~~~l~~lk-d~~G~~l~~~~~~~~~----~~l~G~pv~~~~~~p~~~~~~-gd~~~~~~~~~~~~~~v~~~~~~~~~~~~ 364 (390) T protein:vir:81 291 WAAIELAK-DANNQYLIGNARGTLT----PTLWGLPVVATQAMAPGEFLV-GAFDLAAQIFDQWDARVEIGYVGEDFQRN 364 (390) T ss_pred HHHHHHhh-cCCCceeecCcccccC----ceecceeeEEcCCCCCCcEEE-EehhceEEEEEecceEEEEecccchhhcC Confidence 99988632 222 123333322211 1135654 566789999875 56555556788999999877643333 Q ss_pred ccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 308 NKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 308 ~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) ..+|.+.+-++..+.+|.++|.+.-| T Consensus 365 ~v~~r~~~r~d~~v~~~~a~v~~t~a 390 (390) T protein:vir:81 365 MITVLAEERLALVVYRPEALISGSFA 390 (390) T ss_pred cEEEEEEEeeccEEecccceEEEEeC Confidence 45789999999999999999999999 No 29 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=98.56 E-value=3.6e-09 Score=66.89 Aligned_cols=303 Identities=11% Similarity=0.064 Sum_probs=183.4 Q ss_pred CcccchhhhhhhhhhcccchHHHHHHHHHHhhcchhcchHHHHHHHHHHh---cCchhHHHHHHHHHHHHHHHHHhhhhh Q lcl|NC_011269. 1 MTLPVAVGSGLGRFAKASDDYVADIVEAKQRMGGRKLSAREKQAKLAHIL---SDKVGGIQRLGQSMIGPIQLQLRYQGI 77 (333) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~~Al---~~~Eg~~~aLg~~mA~pI~~q~~rqGi 77 (333) |.. ++..-...-++....-...-.+.+...|.+.|+.++|+.+ .+++ .+..|+. -+=+-+.+.|.+.++.... T Consensus 38 ~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~r~~~-~~~~~~~~~~~gg~-lvP~~~~~~I~~~~~~~s~ 113 (390) T protein:vir:40 38 MAE--QIQNNIIAQARKEVNREMNDNNVLASRGANALTSDESKYY-NEVIAGNGFAGVTA-LLPPTVFERVFEDLTVEHP 113 (390) T ss_pred HHH--HHHHHHHHHHHHHHHHHHHHHHHHHhcCchhccHHHHHHH-HHHHhccCcccCcc-cccHHHHHHHHHHHHhhhh Confidence 110 0000111111100000000112233456778898888754 3333 2445552 5668889999999999999 Q ss_pred hhhhhhccccCCCcceeecCCCCccceEEEEcCCCcccce-eecCceeeccceeeeccccccHHHhhhhcchhHHHHHHH Q lcl|NC_011269. 78 LRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRIT-PFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDM 156 (333) Q Consensus 78 ~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q-~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~ 156 (333) .+++....++..|.. .+|+..... .+.|.+..|++..+ ...=+.|++...++..++.|..+-|+....|+..+..+. T Consensus 114 i~~~~~~~~~~~~~~-~i~~~~~~~-~a~~~~E~~~~~~~~~~~f~~i~l~~~k~~~~i~iS~ell~ds~~~l~~~i~~~ 191 (390) T protein:vir:40 114 LLSKINFVNTTATTE-WIISVGDVA-TAWWGPLCAEIKEVLDNGFDKIQTGMYKLSAYIPVCNAMLDLGPSWLDQYVRTI 191 (390) T ss_pred hhhhceeeecCCcee-EEEEEcCCc-ceeeeccccccCccccccceeeEeeeeeEEEeehhhHHHHhcchHHHHHHHHHH Confidence 999988888776654 233333333 44566666777643 444478999999999999999999999999999999999 Q ss_pred HHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCcc--------eEEeeccccHHHHHHHHHHH-------HhhC Q lcl|NC_011269. 157 TKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPNE--------ITIAGSHLMPDDLYTAVTYT-------DQRQ 221 (333) Q Consensus 157 A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N~--------i~i~~g~Lt~~~L~~a~t~v-------~~~~ 221 (333) ..++|.+-+|.-+++ +-. +.+|. |.+.+. ....++.++..+...+...+ -++. T Consensus 192 la~~i~~~~~~a~l~---G~G--------~~~P~-Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~ 259 (390) T protein:vir:40 192 LGEAMALGLEAGIVN---GSG--------KDQPI-GMMRDLNNVTAGEHPVKTATPLTDLTPATLATKVMLPLTDNGKKS 259 (390) T ss_pred HHHHHHHHHHhhhhc---ccC--------CCccc-eeeeccccccccccccccccccchhhHHHHHHHHHHHhhcchhhh Confidence 999999999965554 211 11221 222110 11233445554443333322 2223 Q ss_pred CccceEEechhhhhhhhh---cCCCchhhhHHhhhhhcceeeeeeccccc--ceeeecCCeEEEeeChhhhcccccccCc Q lcl|NC_011269. 222 LDSSRLLANPQEYRDLYR---WDINTTGWAFKDSVVAGERIVQFGEFQIG--KSIIIPRGTVYLTPEPEFLGVFPVMYSL 296 (333) Q Consensus 222 L~at~il~~~~~~~Di~g---w~~N~~~~~~~DpV~~~e~il~~G~fgi~--~skvlprgeiyvvadpE~~G~~pvR~~L 296 (333) ..-...+||++-+.+... +-.+..| -++. +.-.+|++ .+--||-|+|++ .|+.. ..+-+|+++ T Consensus 260 ~~~a~~i~n~~t~~~~l~~~~~~~d~~G----~~v~------~~~~~g~pvv~~~~~p~~~i~~-Gd~s~-~~i~~~~~~ 327 (390) T protein:vir:40 260 VSDAILVINPADYWSKIYAATSYMTPQG----VWVT------GILPVPLEIVQSVAVPVGKAVA-GRAKD-YFMGIGSEQ 327 (390) T ss_pred hcCceEEEcchhHHHHHHHHhhccCCCC----cccc------ccCCCceeEEEcCCCCCCcEEE-Eeece-EEEEeecce Confidence 344456888876544332 2222222 1121 11124433 455689999765 78875 578899999 Q ss_pred eeccccch--hhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 297 DVEEDNKV--ERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 297 ~s~p~D~~--er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) .+...+.. ++-..++...+-++..+.+|.++|+|.-+ T Consensus 328 ~v~~~~~~~f~~~~~~~r~~~r~dg~v~~~~A~~~l~~~ 366 (390) T protein:vir:40 328 VIRTSTEYRLLDDETLYYAKQYANGRPKDNSSFLVFDIT 366 (390) T ss_pred EEEecchhhhhcCcEEEEEEEEeCCEEecccceEEEEee Confidence 99887732 33468899999999999999999999743 No 30 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=98.55 E-value=1.4e-08 Score=63.66 Aligned_cols=307 Identities=8% Similarity=0.039 Sum_probs=187.7 Q ss_pred Ccccchhhhhh---hhhhcccchHHHHHHHHHHhhcchhcchHHHHHHHHHHhcCch-----------hHHHHHHHHHHH Q lcl|NC_011269. 1 MTLPVAVGSGL---GRFAKASDDYVADIVEAKQRMGGRKLSAREKQAKLAHILSDKV-----------GGIQRLGQSMIG 66 (333) Q Consensus 1 ~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~~Al~~~E-----------g~~~aLg~~mA~ 66 (333) ...+..-..-. ......+..+.. ....+..+..++.+|+.+.. +.+.... |.. .+-+-+++ T Consensus 65 ~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~e~~~~~-~~~~~~~~~~~~~~~~~~g~~-~iP~~~~~ 139 (415) T protein:vir:94 65 DGTSENNQQSVEVNEASTYRNQANIN---DLGISIQNTKVTSQEVRDFT-EYLETRNDIQGGSLKTDSGFV-VIPEEIVT 139 (415) T ss_pred HHhhhhccccccccchhhHHHHHHHH---HHHhhhhhhhhhHHHHHHHH-HHhhhhhhhhhhccccccccc-cCcHHHHH Confidence 00000000000 000011111111 12333444556666665543 2222221 221 23345778 Q ss_pred HHHHHHhhhhhhhhhhhccccCCCc--ceeecCCCCccceEEEEcCCCcccce-eecCceeeccceeeeccccccHHHhh Q lcl|NC_011269. 67 PIQLQLRYQGILRNVLLEDTLTPGV--PIQYDVLDDLGQAYMLHGNEGEIRIT-PFEGKRIEVQLFRIASFPQIKKEDLY 143 (333) Q Consensus 67 pI~~q~~rqGi~RklL~~~TL~~G~--~p~y~v~~~v~~a~~~~~~~G~i~~Q-~i~~~ri~~P~f~Ivs~P~V~~~dl~ 143 (333) .|...++.....+++....+++.|. .|++.... +..+.|++.-++++.. ...-+.|++....+..++.|..+=|+ T Consensus 140 ~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~v~Eg~~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~ 217 (415) T protein:vir:94 140 DILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSE--VAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIE 217 (415) T ss_pred HHHHHHHhhhhhhhhcceeeccCCceeEEEEeecC--CccceeccccccccccccccceeeEeeheeeeeechhhHHHHh Confidence 8889999999999999998887664 34433222 2345577777777754 33457899999999999999999899 Q ss_pred hhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCC-----cceEEeeccccHHHHHHHHHHHH Q lcl|NC_011269. 144 YLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALP-----NEITIAGSHLMPDDLYTAVTYTD 218 (333) Q Consensus 144 ~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~-----N~i~i~~g~Lt~~~L~~a~t~v~ 218 (333) ....|+..+..++..++|.+-+|.-+++-+- +..|..+... ++.+. .+..+-++|..++..+. T Consensus 218 ds~~~~~~~i~~~l~~~~~~~~~~~il~g~g-----------~g~~~~~~~~~~~~~~~~~~-~~~~~~~~i~~~~~~~~ 285 (415) T protein:vir:94 218 DAKVNVLQELKLWMARTIAATRNKAIIDVIT-----------KGSTGSTSSGFEKEGKKLEV-KKAKSLDDIKDAINLNV 285 (415) T ss_pred hchHHHHHHHHHHHHHHHHHHHHHHHhhccc-----------cCcccccccccccccccccc-ccccchHHHHHHHHhhh Confidence 9999999999999999999999977776551 1111111111 22222 23466778999999998 Q ss_pred hhCCccceEEechhhhhhhhhcCCCchhhhHHhhhhhcceeeeee---ccccc--ceeeecCCe----EEEeeChhhhcc Q lcl|NC_011269. 219 QRQLDSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIVQFG---EFQIG--KSIIIPRGT----VYLTPEPEFLGV 289 (333) Q Consensus 219 ~~~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~G---~fgi~--~skvlprge----iyvvadpE~~G~ 289 (333) +-+.....++||++.|..|..=- +..| .|+-+.. +.+.+ +.|.+ .+--+|.|+ .+++.|..+.-. T Consensus 286 ~~~~~~~~~vmn~~~~~~l~~lk-d~~G----~~l~~~~-~~~~~~~~l~G~pV~~~~~~~~~~~~~~~i~~gd~~~~~~ 359 (415) T protein:vir:94 286 KPNYEHNVAIVSQTMFAKLDKMK-DKLG----NYLIQPD-VKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIV 359 (415) T ss_pred hhccCCCEEEEcHHHHHHHHHhh-ccCC----CeeeccC-cCCCCCceecceeeEEecccccCCCCccEEEEEehhccEE Confidence 88889999999999999998721 2111 2332222 11110 23433 223345554 356677766666 Q ss_pred cccccCceeccccchhhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 290 FPVMYSLDVEEDNKVERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 290 ~pvR~~L~s~p~D~~er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) +..|+++.++..|... +.++-..++.++..+.||.+++++... T Consensus 360 ~~~~~~~~v~~~~~~~-~~~~~r~~~r~d~~~~~~~a~~~~~~~ 402 (415) T protein:vir:94 360 LFDRSQYQASWTDYMH-FGECLMIAVRQDCRILDYKSAIVIEYD 402 (415) T ss_pred EEeecceEEEEecccc-CceEEEEEEEeccEEeccccEEEEEEe Confidence 7899999988776432 345566778899999999999999765 No 31 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=98.54 E-value=2.7e-09 Score=67.55 Aligned_cols=274 Identities=9% Similarity=0.083 Sum_probs=184.4 Q ss_pred hcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcceeecCCCCccceEEEEcCC Q lcl|NC_011269. 32 MGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNE 111 (333) Q Consensus 32 ~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~ 111 (333) |-+. +.++. .++.+..|. -.+-+.+...|-+.++.....+++....+++.|. ..||+...... +-|++-. T Consensus 1 ma~~-----~~~~~--~~~~t~~gg-~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~-~~ip~~~~~~~-a~~v~E~ 70 (304) T protein:vir:94 1 MATP-----TYTPG--NVILSDFKN-GVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQK-KKFTYLAKGVG-AYWVSET 70 (304) T ss_pred Cccc-----ccccc--cccccCCCc-eecchhHHHHHHHHHHhccchhhhcceeeccCCc-eEEEEEeCCcc-eEEeecC Confidence 4333 32222 145556665 3677888899999999999999998888877654 34555444333 4577777 Q ss_pred CcccceeecCceeeccceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhh--hhhhcccccc Q lcl|NC_011269. 112 GEIRITPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVS--YRVVDSSAQP 189 (333) Q Consensus 112 G~i~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~--~r~~~ssA~p 189 (333) ++++++...=+.|++...++.....|..+=|+....|+..+..++..++|-+.+|.-+++ +--.. --.......+ T Consensus 71 ~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~---G~g~~~~~~~~~~~~~~ 147 (304) T protein:vir:94 71 ERIQTSKPEYAQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIF---GTKSPYNTSTSGKPLVE 147 (304) T ss_pred cccccccceeeEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhee---ccCCCcccccccccccc Confidence 788888777788889899999999999999999999999999999999999999965542 11000 0000000001 Q ss_pred cccCCCcceEEeeccccHHHHHHHHHHHHhhCCccceEEechhhhhhhhhcCCCchhhhHHhhhhhcceeeee--e-ccc Q lcl|NC_011269. 190 GVGALPNEITIAGSHLMPDDLYTAVTYTDQRQLDSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIVQF--G-EFQ 266 (333) Q Consensus 190 ~vg~~~N~i~i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~--G-~fg 266 (333) +++ ....+..++..+-++|..++..+..-+.....++||++.|..|+. + +|. .+.-+++. | ++| T Consensus 148 ~~~--~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~--l-------kd~--~G~~l~~~~~~~l~G 214 (304) T protein:vir:94 148 GAE--EKGNVVTDTNNLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRN--A-------LDA--NDRPLFDANGNEIMG 214 (304) T ss_pred ccc--ccccccccccchHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHH--h-------hcc--CCcEeecCCCccccc Confidence 110 122344566778899999999999999999999999999999875 1 221 11112221 2 245 Q ss_pred ccc--eeeec----CCeEEEeeChhhhcccccccCceeccccch------------------hhhccceehhhhhhhhhh Q lcl|NC_011269. 267 IGK--SIIIP----RGTVYLTPEPEFLGVFPVMYSLDVEEDNKV------------------ERFNKGWVMDELVGMAIL 322 (333) Q Consensus 267 i~~--skvlp----rgeiyvvadpE~~G~~pvR~~L~s~p~D~~------------------er~~kGWvm~E~~g~~i~ 322 (333) .+- +-.+| .++ ++..|..++ .+=.|+++..+..+.+ ++--..|...+-++.++. T Consensus 215 ~PV~~~~~~~~~~~~~~-~~~gd~~~~-~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~ 292 (304) T protein:vir:94 215 LPLSYTGADVYDKKKSL-ALMGDWDYA-RYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNV 292 (304) T ss_pred eeeEEecccccCCCCcE-EEEEehhhE-EEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEee Confidence 442 22344 334 445677654 4667788776554432 222467888899999999 Q ss_pred ccceEEEEecC Q lcl|NC_011269. 323 NPRGIVILRKA 333 (333) Q Consensus 323 N~~siv~~~~~ 333 (333) ||.+++.|.+| T Consensus 293 ~~~a~~~l~~a 303 (304) T protein:vir:94 293 KPEAFATLKPT 303 (304) T ss_pred cccceEEEEec Confidence 99999999999 No 32 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=98.54 E-value=2.7e-09 Score=67.55 Aligned_cols=274 Identities=9% Similarity=0.083 Sum_probs=184.4 Q ss_pred hcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcceeecCCCCccceEEEEcCC Q lcl|NC_011269. 32 MGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNE 111 (333) Q Consensus 32 ~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~ 111 (333) |-+. +.++. .++.+..|. -.+-+.+...|-+.++.....+++....+++.|. ..||+...... +-|++-. T Consensus 1 ma~~-----~~~~~--~~~~t~~gg-~lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~-~~ip~~~~~~~-a~~v~E~ 70 (304) T protein:vir:10 1 MATP-----TYTPG--NVILSDFKN-GVIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQK-KKFTYLAKGVG-AYWVSET 70 (304) T ss_pred Cccc-----ccccc--cccccCCCc-eecchhHHHHHHHHHHhccchhhhcceeeccCCc-eEEEEEeCCcc-eEEeecC Confidence 4333 32222 145556665 3677888899999999999999998888877654 34555444333 4577777 Q ss_pred CcccceeecCceeeccceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhh--hhhhcccccc Q lcl|NC_011269. 112 GEIRITPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVS--YRVVDSSAQP 189 (333) Q Consensus 112 G~i~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~--~r~~~ssA~p 189 (333) ++++++...=+.|++...++.....|..+=|+....|+..+..++..++|-+.+|.-+++ +--.. --.......+ T Consensus 71 ~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~ia~~~d~~~l~---G~g~~~~~~~~~~~~~~ 147 (304) T protein:vir:10 71 ERIQTSKPEYAQAEMEAKKIGVIIPLSKEFLKWTAKDFFNEVKPLIAEAFYKAFDQAVIF---GTKSPYNTSTSGKPLVE 147 (304) T ss_pred cccccccceeeEEEEEEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHhhhee---ccCCCcccccccccccc Confidence 788888777788889899999999999999999999999999999999999999965542 11000 0000000001 Q ss_pred cccCCCcceEEeeccccHHHHHHHHHHHHhhCCccceEEechhhhhhhhhcCCCchhhhHHhhhhhcceeeee--e-ccc Q lcl|NC_011269. 190 GVGALPNEITIAGSHLMPDDLYTAVTYTDQRQLDSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIVQF--G-EFQ 266 (333) Q Consensus 190 ~vg~~~N~i~i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~--G-~fg 266 (333) +++ ....+..++..+-++|..++..+..-+.....++||++.|..|+. + +|. .+.-+++. | ++| T Consensus 148 ~~~--~~~~~~~~~~~~~~~i~~~~~~l~~~~~~~~~~v~~~~~~~~L~~--l-------kd~--~G~~l~~~~~~~l~G 214 (304) T protein:vir:10 148 GAE--EKGNVVTDTNNLYVDLSALMATIEDEELDPNGVLTTRSFRSKMRN--A-------LDA--NDRPLFDANGNEIMG 214 (304) T ss_pred ccc--ccccccccccchHHHHHHHHHHhhhccCCcCEEEEcHHHHHHHHH--h-------hcc--CCcEeecCCCccccc Confidence 110 122344566778899999999999999999999999999999875 1 221 11112221 2 245 Q ss_pred ccc--eeeec----CCeEEEeeChhhhcccccccCceeccccch------------------hhhccceehhhhhhhhhh Q lcl|NC_011269. 267 IGK--SIIIP----RGTVYLTPEPEFLGVFPVMYSLDVEEDNKV------------------ERFNKGWVMDELVGMAIL 322 (333) Q Consensus 267 i~~--skvlp----rgeiyvvadpE~~G~~pvR~~L~s~p~D~~------------------er~~kGWvm~E~~g~~i~ 322 (333) .+- +-.+| .++ ++..|..++ .+=.|+++..+..+.+ ++--..|...+-++.++. T Consensus 215 ~PV~~~~~~~~~~~~~~-~~~gd~~~~-~~~~~~~~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~ 292 (304) T protein:vir:10 215 LPLSYTGADVYDKKKSL-ALMGDWDYA-RYGILQGIEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNV 292 (304) T ss_pred eeeEEecccccCCCCcE-EEEEehhhE-EEEEecceEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEee Confidence 442 22344 334 445677654 4667788776554432 222467888899999999 Q ss_pred ccceEEEEecC Q lcl|NC_011269. 323 NPRGIVILRKA 333 (333) Q Consensus 323 N~~siv~~~~~ 333 (333) ||.+++.|.+| T Consensus 293 ~~~a~~~l~~a 303 (304) T protein:vir:10 293 KPEAFATLKPT 303 (304) T ss_pred cccceEEEEec Confidence 99999999999 No 33 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=98.54 E-value=3.5e-09 Score=66.97 Aligned_cols=260 Identities=17% Similarity=0.152 Sum_probs=164.8 Q ss_pred hcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccC--CCcceeecCCCCccceEEEEc Q lcl|NC_011269. 32 MGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLT--PGVPIQYDVLDDLGQAYMLHG 109 (333) Q Consensus 32 ~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~--~G~~p~y~v~~~v~~a~~~~~ 109 (333) |... .| .+++.+ .|| .+++.+..++...+...+++-.. .+|+ +|....+|+-+..+.+- ... T Consensus 1 ma~~----~T---~~~d~i-iPe----v~~~~v~~~~~~~l~~~~~~~~d---~~l~g~~G~tv~iP~~~~~g~a~-~~~ 64 (274) T protein:vir:97 1 MPQG----LT---KTSDQI-IPE----VLAPMMQAQLEKKLRFASFAEVD---STLQGQPGDTLTFPAFVYSGDAQ-VVA 64 (274) T ss_pred CCcc----ce---ehhhee-chH----HHHHHHHHhhhhhhhhcccceec---ccccCCCCCEEEEeeecCCCccc-ccc Confidence 2210 11 122222 233 33444555555555555554221 2222 37666666533333222 234 Q ss_pred CCCcccceeecCceeeccceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccc Q lcl|NC_011269. 110 NEGEIRITPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQP 189 (333) Q Consensus 110 ~~G~i~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p 189 (333) +..++..+.+.-...+..-.+.-..-.|.=++..+..+|.+.++-+.+..++-+..|..+++.+.++.. T Consensus 65 ~g~~i~~~~lt~~~~~~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~----------- 133 (274) T protein:vir:97 65 EGEKIPTDILETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL----------- 133 (274) T ss_pred CCCcccccccccceeEEEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCc----------- Confidence 445577666665555555444443344555566677789999999999999999999999988866542 Q ss_pred cccCCCcceEEeeccccHHHHHHHHHHHHhhCCccceEEechhhhhhhhhcCCCchhhhHHhhhhhcceeeeeec----- Q lcl|NC_011269. 190 GVGALPNEITIAGSHLMPDDLYTAVTYTDQRQLDSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIVQFGE----- 264 (333) Q Consensus 190 ~vg~~~N~i~i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~G~----- 264 (333) ++.+.-++.+.+..|.+...+.+-.-..++++|..|..|+-=+...|- +++- .++-++..|. T Consensus 134 ---------~~~~~~~~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~---~~s~-~g~~~~~~G~ig~~~ 200 (274) T protein:vir:97 134 ---------TVNADITKLNGLQSAIDKFNDEDLEPMVLFVNPLDAGKLRGDASTNFT---RATE-LGDDIIVKGAFGEAL 200 (274) T ss_pred ---------cccccccCHHHHHHHHHHhhccCCCceEEEeCHHHHHHHHhhhhhhcc---ccCc-ccccceeccccceec Confidence 223445678999999999999888889999999999999861111221 0111 1122333444 Q ss_pred -ccccceeeecCCeEEEeeChhhhcccccccCceeccccchhhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 265 -FQIGKSIIIPRGTVYLTPEPEFLGVFPVMYSLDVEEDNKVERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 265 -fgi~~skvlprgeiyvvadpE~~G~~pvR~~L~s~p~D~~er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) |.|..+..+|-|+.|+.-.+- +|.+- ..++.+|...-..++..-=...+.+|.++.||.++|.++|+ T Consensus 201 G~~Vi~s~~~p~~t~~l~~~gA-~~~~~-~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~ 268 (274) T protein:vir:97 201 GAIIVRTNKLEAGTAILAKKGA-VKLIL-KRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKG 268 (274) T ss_pred CeeEEEcCCCCcceEEEEeCcc-eEeee-cCCceeccccchhhcccEEEEEEEEEEEEEcCCceEEEecC Confidence 347789999999999887554 66553 45677765555555666667788899999999999999999 No 34 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=98.54 E-value=3.5e-09 Score=66.97 Aligned_cols=260 Identities=17% Similarity=0.152 Sum_probs=164.8 Q ss_pred hcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccC--CCcceeecCCCCccceEEEEc Q lcl|NC_011269. 32 MGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLT--PGVPIQYDVLDDLGQAYMLHG 109 (333) Q Consensus 32 ~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~--~G~~p~y~v~~~v~~a~~~~~ 109 (333) |... .| .+++.+ .|| .+++.+..++...+...+++-.. .+|+ +|....+|+-+..+.+- ... T Consensus 1 ma~~----~T---~~~d~i-iPe----v~~~~v~~~~~~~l~~~~~~~~d---~~l~g~~G~tv~iP~~~~~g~a~-~~~ 64 (274) T protein:vir:94 1 MPQG----LT---KTSDQI-IPE----VLAPMMQAQLEKKLRFASFAEVD---STLQGQPGDTLTFPAFVYSGDAQ-VVA 64 (274) T ss_pred CCcc----ce---ehhhee-chH----HHHHHHHHhhhhhhhhcccceec---ccccCCCCCEEEEeeecCCCccc-ccc Confidence 2210 11 122222 233 33444555555555555554221 2222 37666666533333222 234 Q ss_pred CCCcccceeecCceeeccceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccc Q lcl|NC_011269. 110 NEGEIRITPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQP 189 (333) Q Consensus 110 ~~G~i~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p 189 (333) +..++..+.+.-...+..-.+.-..-.|.=++..+..+|.+.++-+.+..++-+..|..+++.+.++.. T Consensus 65 ~g~~i~~~~lt~~~~~~~i~~~~~~~~i~D~~~~~~~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~----------- 133 (274) T protein:vir:94 65 EGEKIPTDILETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL----------- 133 (274) T ss_pred CCCcccccccccceeEEEeeeecceecccHHHHHhccchHHHHHHHHHHHHHHHHHHHHHHHHHhccCc----------- Confidence 445577666665555555444443344555566677789999999999999999999999988866542 Q ss_pred cccCCCcceEEeeccccHHHHHHHHHHHHhhCCccceEEechhhhhhhhhcCCCchhhhHHhhhhhcceeeeeec----- Q lcl|NC_011269. 190 GVGALPNEITIAGSHLMPDDLYTAVTYTDQRQLDSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIVQFGE----- 264 (333) Q Consensus 190 ~vg~~~N~i~i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~G~----- 264 (333) ++.+.-++.+.+..|.+...+.+-.-..++++|..|..|+-=+...|- +++- .++-++..|. T Consensus 134 ---------~~~~~~~~~d~i~dA~~~l~d~~~~~~~ivv~p~~~~~L~k~~~~~f~---~~s~-~g~~~~~~G~ig~~~ 200 (274) T protein:vir:94 134 ---------TVNADITKLNGLQSAIDKFNDEDLEPMVLFVNPLDAGKLRGDASTNFT---RATE-LGDDIIVKGAFGEAL 200 (274) T ss_pred ---------cccccccCHHHHHHHHHHhhccCCCceEEEeCHHHHHHHHhhhhhhcc---ccCc-ccccceeccccceec Confidence 223445678999999999999888889999999999999861111221 0111 1122333444 Q ss_pred -ccccceeeecCCeEEEeeChhhhcccccccCceeccccchhhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 265 -FQIGKSIIIPRGTVYLTPEPEFLGVFPVMYSLDVEEDNKVERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 265 -fgi~~skvlprgeiyvvadpE~~G~~pvR~~L~s~p~D~~er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) |.|..+..+|-|+.|+.-.+- +|.+- ..++.+|...-..++..-=...+.+|.++.||.++|.++|+ T Consensus 201 G~~Vi~s~~~p~~t~~l~~~gA-~~~~~-~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~ 268 (274) T protein:vir:94 201 GAIIVRTNKLEAGTAILAKKGA-VKLIL-KRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKG 268 (274) T ss_pred CeeEEEcCCCCcceEEEEeCcc-eEeee-cCCceeccccchhhcccEEEEEEEEEEEEEcCCceEEEecC Confidence 347789999999999887554 66553 45677765555555666667788899999999999999999 No 35 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=98.53 E-value=1.1e-08 Score=64.28 Aligned_cols=304 Identities=9% Similarity=0.100 Sum_probs=179.7 Q ss_pred CcccchhhhhhhhhhcccchHHHHHHHHHHhh-cchhcchHHHHHHHH--HHh--cCchhHHHHHHHHHHHHHHHHHhhh Q lcl|NC_011269. 1 MTLPVAVGSGLGRFAKASDDYVADIVEAKQRM-GGRKLSAREKQAKLA--HIL--SDKVGGIQRLGQSMIGPIQLQLRYQ 75 (333) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~ls~ee~~~Lm~--~Al--~~~Eg~~~aLg~~mA~pI~~q~~rq 75 (333) ...|-..+ ..+.+ .++... ....+ +++.... ++..... ..+ .+..++. .+-.-+...|-+.++.. T Consensus 93 ~~~~~~~~---~~~~~--~~~~~~---~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~g~-lvp~~~~~~ii~~~~~~ 162 (418) T protein:vir:10 93 LETPKTLG---QLVTE--SEEMKG---MDGSARKSVRVRV-DRKSIMNVPATVGSGVSGSNS-LVVADRQAGIIAPPQRK 162 (418) T ss_pred cchhhhhh---HHhhh--HHHHHH---HHHHHhhhhhhhh-HHHHHHHhhhhccCCCCCCcc-ccchhHHHHHHHHHhhh Confidence 00000000 00000 000000 00000 1111111 1111111 111 1222321 34445666788889999 Q ss_pred hhhhhhhhccccCCCcceeecCCCCccceEEEEcCCCcccceeecCceeeccceeeeccccccHHHhhhhcchhHHHHHH Q lcl|NC_011269. 76 GILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRITPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQD 155 (333) Q Consensus 76 Gi~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~ 155 (333) ...+++....+...|.. .|++....+..+.|++--++++.....=+.|.+...++.....|..+-|+ .+.++..+..+ T Consensus 163 ~~l~~~~~~~~~~~~~~-~~~~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~-ds~~l~~~i~~ 240 (418) T protein:vir:10 163 MTIRDLLMPGQTSSSSI-EYTVETGFTNNAAAVAEGAQKPTSDLKFNLKNQPVRTIAHLFKASRQILD-DAPALQSYIDG 240 (418) T ss_pred hhHHhhcceeeccCCce-eEEEEecCCCceeeeccCccccccccceeeEEEeeeeEEEeehhhHHHHH-hHHHHHHHHHH Confidence 99999988877766543 24433344445567887777777766667888888899999999988665 45789999999 Q ss_pred HHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCC----CcceEE-eeccccHHHHHHHHHHHHhhCCccceEEec Q lcl|NC_011269. 156 MTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGAL----PNEITI-AGSHLMPDDLYTAVTYTDQRQLDSSRLLAN 230 (333) Q Consensus 156 ~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~----~N~i~i-~~g~Lt~~~L~~a~t~v~~~~L~at~il~~ 230 (333) +..++|.+-+|.-++ ++..+ +..| .|.+ .+..+. ..+..+-++|..++..++.-+...+.++|| T Consensus 241 ~l~~a~~~~~d~a~l---~G~g~-------~~~p-~Gi~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~n 309 (418) T protein:vir:10 241 RARYGLQLTEEGQIL---KGDGT-------GANI-LGILPQASAFMPSITLANATPIDKIRLALLQAVLAEFPATGIVLN 309 (418) T ss_pred HHHHHHHHHHHHHHh---ccCCC-------Cccc-cccccccccccccccccccccHHHHHHHHHhhccccCCCCEEEEc Confidence 999999999996554 22221 1111 1211 011222 223445678888999999999999999999 Q ss_pred hhhhhhhhhcCCCchh-hhHHhhhhhcceeeeeeccccc--ceeeecCCeEEEeeChhhhcccccccCceeccccchh-- Q lcl|NC_011269. 231 PQEYRDLYRWDINTTG-WAFKDSVVAGERIVQFGEFQIG--KSIIIPRGTVYLTPEPEFLGVFPVMYSLDVEEDNKVE-- 305 (333) Q Consensus 231 ~~~~~Di~gw~~N~~~-~~~~DpV~~~e~il~~G~fgi~--~skvlprgeiyvvadpE~~G~~pvR~~L~s~p~D~~e-- 305 (333) ++.|.+|..=- ++.| |.+.+|... . +.-++|++ .+--||.|++++ .|....-.+-+|+++.+...++.. T Consensus 310 ~~~~~~L~~lk-d~~G~~i~~~~~~~-~---~~~l~G~pV~~~~~~p~~~~~~-gd~s~~~~~~~~~~~~i~~~~~~~~~ 383 (418) T protein:vir:10 310 PIDWASIELTK-DSQGRYIVGNPVNG-T---TPRLWNLPVVETQAMTANEFLV-GAFSMAAQIFDRMEIEVLLSTENVDD 383 (418) T ss_pred HHHHHHHHHhh-cCCCceeccccccC-C---CceecceeeEEcCCCCCCcEEE-eeccceEEEEEecceEEEEecccchh Confidence 99999998621 2221 223232211 1 11134533 667799999764 666544556778998887655432 Q ss_pred --hhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 306 --RFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 306 --r~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) +-..+|..+.-++.++.+|-++|.+... T Consensus 384 f~~~~~~~r~~~~~d~~~~~~~a~~~~~~~ 413 (418) T protein:vir:10 384 FEKNMVSIRAEERLALAVYRPESFVTGALV 413 (418) T ss_pred hhcCceEEEEEEeeccEEecccceEEEEec Confidence 3356888999999999999999998876 No 36 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=98.53 E-value=1.2e-08 Score=63.97 Aligned_cols=284 Identities=13% Similarity=0.156 Sum_probs=181.9 Q ss_pred hhcccchHHHHHHH-HHHhhcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcc Q lcl|NC_011269. 14 FAKASDDYVADIVE-AKQRMGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVP 92 (333) Q Consensus 14 ~~~~~~~~~~~~~~-~~~~~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~ 92 (333) ..|- +.--.++-. ++-...+..+.+.+. ...+..+. -+-..++..|.+.++.+...+.+.+..+++.|. T Consensus 1 ~~~~-~~~~~~~~~f~~~~~~~~~~~a~~~------~~~~~~~~--lip~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~- 70 (324) T protein:vir:96 1 MEQT-QKLKLNLQHFASNNVKPQVFNPDNV------MMHEKKDG--TLLNDFTTPILQEVMENSKIMQLGKYEPMEGTE- 70 (324) T ss_pred CCcc-hhhhHHHHHHHHhhhhhhhcccccc------cccCCCcc--eechhHHHHHHHHHHhhchhhhhcceeeccCCc- Confidence 1100 000000000 000000111111110 01122232 467789999999999999999999988887654 Q ss_pred eeecCCCCccceEEEEcCCCcccceeecCceeeccceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHH Q lcl|NC_011269. 93 IQYDVLDDLGQAYMLHGNEGEIRITPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTL 172 (333) Q Consensus 93 p~y~v~~~v~~a~~~~~~~G~i~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~sl 172 (333) ..||+......+ -|++..++++.....=+.+++.-.++.....|..+-|++...++..+.+++..++|-+.+|..+++ T Consensus 71 ~~~p~~~~~~~a-~~v~Eg~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~aia~~~d~~~l~- 148 (324) T protein:vir:96 71 KKFTFWADKPGA-YWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL- 148 (324) T ss_pred eEEEEEecCcce-eeecCCccccccccceeEEEEEeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhh- Confidence 345554433333 467888888887777788888889999999999998999999999999999999999999975552 Q ss_pred HhhhhhhhhhhcccccccccCCCc----ceEEeeccccHHHHHHHHHHHHhhCCccceEEechhhhhhhhhcCCCchhhh Q lcl|NC_011269. 173 LEAAAVSYRVVDSSAQPGVGALPN----EITIAGSHLMPDDLYTAVTYTDQRQLDSSRLLANPQEYRDLYRWDINTTGWA 248 (333) Q Consensus 173 le~~a~~~r~~~ssA~p~vg~~~N----~i~i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~~~~Di~gw~~N~~~~~ 248 (333) +.. .+.. ..|.. + .....++.++-++|..+...+..-+.....++||+..|..|+. + T Consensus 149 --G~g-------~~~~-~~~~~-~~~~~~~~~~~~~~~~~~i~~~~~~i~~~~~~~~~~i~n~~~~~~L~~--l------ 209 (324) T protein:vir:96 149 --NQG-------NNPF-GKSIA-QSIKKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRK--I------ 209 (324) T ss_pred --cCC-------CCCc-Ccccc-ccccccceecccccchHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHH--h------ Confidence 221 1111 11111 1 1234567789999999999999989999999999999998875 2 Q ss_pred HHhhhhhcceeeeee----cccccc----eeeecCCeEEEeeChhhhcccccccCceeccccch---------------- Q lcl|NC_011269. 249 FKDSVVAGERIVQFG----EFQIGK----SIIIPRGTVYLTPEPEFLGVFPVMYSLDVEEDNKV---------------- 304 (333) Q Consensus 249 ~~DpV~~~e~il~~G----~fgi~~----skvlprgeiyvvadpE~~G~~pvR~~L~s~p~D~~---------------- 304 (333) +|.- +.-+++.| +.|++. +.-++.|.++ ..|.... .+-.++++.++-.+.. T Consensus 210 -kd~~--G~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~-~gd~s~~-~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~ 284 (324) T protein:vir:96 210 -VDPE--TKERIYDRNSDSLDGLPVVNLKSSNLKRGELI-TGDFDKL-IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLF 284 (324) T ss_pred -hCCC--CCeeecCCCCCcccceeeEeecCCCCCcceEE-EEecceE-EEEEecCcEEEEeecccccccccccccchhhh Confidence 2221 22222222 355442 3335556555 4466543 4677888877665532 Q ss_pred hhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 305 ERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 305 er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) ++--..|...+-++.++.||-+++.|..| T Consensus 285 ~~n~v~~r~~~r~d~~v~~~~a~~~l~~a 313 (324) T protein:vir:96 285 EQDMVALRATMHVALHIADDKAFAKLVPA 313 (324) T ss_pred hcCcEEEEEEEEeccEEecccceEEEecc Confidence 22346677778889999999999999999 No 37 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=98.53 E-value=8e-09 Score=64.98 Aligned_cols=310 Identities=10% Similarity=0.078 Sum_probs=179.6 Q ss_pred CcccchhhhhhhhhhcccchHHHHHHHHHHhhcchhcchHHHHHHHHHHh--cCchhHHHHHHHHHHHHHHHHHhhhhhh Q lcl|NC_011269. 1 MTLPVAVGSGLGRFAKASDDYVADIVEAKQRMGGRKLSAREKQAKLAHIL--SDKVGGIQRLGQSMIGPIQLQLRYQGIL 78 (333) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~~Al--~~~Eg~~~aLg~~mA~pI~~q~~rqGi~ 78 (333) ...... .+-....+...+........+-.+ ..+....+...-+.++ .+..++ -.+-..+...|-..++....+ T Consensus 69 ~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~g-~~vp~~~~~~ii~~~~~~~~l 143 (395) T protein:vir:43 69 LANEKR--DGGEEAPKTAGQMVAESLKEQGVT--SSLRGSHRVSMPRSAITSIDGSGG-ALVAPDRRPGVVAAPQRRLTI 143 (395) T ss_pred Hhhhcc--ccccchhhhHHHHHHHHHHHHHHH--HHhhhhhhhhhhhhhhcccCCCCc-cccchhhHHHHHHHHHhhhhH Confidence 000000 000000000000000000000000 0000011111111121 122232 245556677888889999999 Q ss_pred hhhhhccccCCCcceeecCCCCccceEEEEcCCCcccceeecCceeeccceeeeccccccHHHhhhhcchhHHHHHHHHH Q lcl|NC_011269. 79 RNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRITPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTK 158 (333) Q Consensus 79 RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~ 158 (333) +++....++..+. ..|++..+....+.|++.-+.++.....-+.|++....+.....|..+=|. ...++..+...+.. T Consensus 144 ~~l~~~~~~~~~~-~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~-d~~~l~~~v~~~la 221 (395) T protein:vir:43 144 RDLVAPGTTESNS-VEYVRETGFVNNAAPVSEGTQKPYSDLTFELENAPVRTIAHLFKASRQILD-DASALQSYIDARAR 221 (395) T ss_pred HhhccceecCCCc-eEEEEEecCCCceeeecCCccccccccceeEEEEeeeeEEEeehhhHHHHH-hHHHHHHHHHHHHH Confidence 9999988887654 334443343334567787777888777778899999999999999988665 45678888899999 Q ss_pred HHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCc----ceE---EeeccccHHHHHHHHHHHHhhCCccceEEech Q lcl|NC_011269. 159 QAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPN----EIT---IAGSHLMPDDLYTAVTYTDQRQLDSSRLLANP 231 (333) Q Consensus 159 qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N----~i~---i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~ 231 (333) +++...+|.-++ ++.. +..|-.|.+.. ... -..+..+-+++..+...+..-+.....++||+ T Consensus 222 ~a~~~~~d~~~l---~G~g--------~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~ 290 (395) T protein:vir:43 222 YGLMLVEECQLL---YGNG--------TGANLHGIIPQAQAYAPPSGVVVTAEQRIDRIRLAILQAQLAEFPASGIVLNP 290 (395) T ss_pred HHHHHHHHHHHH---hccC--------CCCccccccccccccccccccccccchhHHHHHHHHHhhccccCCCcEEEEcH Confidence 999999996544 3322 11222222211 011 12223446677888888888888899999999 Q ss_pred hhhhhhhhcCCCchhhhHHhhhhhcceeeeeeccccc--ceeeecCCeEEEeeChhhhcccccccCceeccccchh---- Q lcl|NC_011269. 232 QEYRDLYRWDINTTGWAFKDSVVAGERIVQFGEFQIG--KSIIIPRGTVYLTPEPEFLGVFPVMYSLDVEEDNKVE---- 305 (333) Q Consensus 232 ~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~G~fgi~--~skvlprgeiyvvadpE~~G~~pvR~~L~s~p~D~~e---- 305 (333) +.|..|...--..-.|.+.||..... .-++|++ .+--+|.|++++ .|.-.+-.+-+|.|+.++-.+... T Consensus 291 ~~~~~l~~lkd~~G~~i~~~~~~~~~----~~l~G~pVv~~~~~~~~~~~~-gd~~~~~~~~~~~~~~i~~~~~~~~~f~ 365 (395) T protein:vir:43 291 IDWALIELNKDAENRYIIGSPQNGTT----PTLWRLPVVETQAITQDEFLT-GAFSLGAQIFDRMDIEVLVSTENDKDFE 365 (395) T ss_pred HHHHHHHHhhccCCceeccccccCCC----ceecceeeEEcCCCCCCcEEE-EeccceEEEEEecceEEEEeccccchhh Confidence 99999876321111233333321111 1135644 666789999865 565545556678898777555332 Q ss_pred hhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 306 RFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 306 r~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) +-..+|...+-++.++.||.++|.+.-+ T Consensus 366 ~~~~~~r~~~r~d~~v~~~~a~~~~~~t 393 (395) T protein:vir:43 366 NNMVTIRAEERLAFAVYRPEAFVTGSLT 393 (395) T ss_pred cCcEEEEEEEeeccEEecccceEEEEec Confidence 2355788889999999999999999776 No 38 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=98.52 E-value=1.5e-08 Score=63.46 Aligned_cols=311 Identities=9% Similarity=0.035 Sum_probs=185.2 Q ss_pred Cc-ccc--hhhhhhhhhhcc----cchHHHHHHHHHHhhcchhcchHHHHHHHHHHh----------cCchhHHHHHHHH Q lcl|NC_011269. 1 MT-LPV--AVGSGLGRFAKA----SDDYVADIVEAKQRMGGRKLSAREKQAKLAHIL----------SDKVGGIQRLGQS 63 (333) Q Consensus 1 ~~-~~~--~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~~Al----------~~~Eg~~~aLg~~ 63 (333) +. +-. ......++-.+. ...-....-.......+..++.+++........ .+..|+ -..-+- T Consensus 58 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg-~~iP~~ 136 (415) T protein:vir:79 58 LDKLKEKDGTSENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGF-VVIPEE 136 (415) T ss_pred HHHHHHHHhhhhhcccccccchhhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccccccccc-cccchH Confidence 00 000 000000000000 000001111112223334445555544432221 112222 134457 Q ss_pred HHHHHHHHHhhhhhhhhhhhccccCCCc--ceeecCCCCccceEEEEcCCCccccee-ecCceeeccceeeeccccccHH Q lcl|NC_011269. 64 MIGPIQLQLRYQGILRNVLLEDTLTPGV--PIQYDVLDDLGQAYMLHGNEGEIRITP-FEGKRIEVQLFRIASFPQIKKE 140 (333) Q Consensus 64 mA~pI~~q~~rqGi~RklL~~~TL~~G~--~p~y~v~~~v~~a~~~~~~~G~i~~Q~-i~~~ri~~P~f~Ivs~P~V~~~ 140 (333) +++.|.+.++....++++....+++.|. .|.... .-...+.|++..++++++- ..-+.|++.-.++..++.|..+ T Consensus 137 ~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~e 214 (415) T protein:vir:79 137 IVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQ--SEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISRE 214 (415) T ss_pred HHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEee--cCCccceeeccccccCcccccceeeEEeeeeeeEeeehhhHH Confidence 7888888899999999999998888664 444332 2223455787778887643 3447889999999999999999 Q ss_pred HhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCC-----CcceEEeeccccHHHHHHHHH Q lcl|NC_011269. 141 DLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGAL-----PNEITIAGSHLMPDDLYTAVT 215 (333) Q Consensus 141 dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~-----~N~i~i~~g~Lt~~~L~~a~t 215 (333) =|.+...|+..+..++..++|.+-+|.-+++-+-+ ..|..+.. .|..+ ..+..+-++|..++. T Consensus 215 ll~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~-----------g~~~~~~~~~~~~~~~~~-~~~~~~~~~i~~~~~ 282 (415) T protein:vir:79 215 AIEDAKVNVLQELKLWMARTIAATRNKAIIDVITK-----------GSTGSTSSGFEKEGKKLE-VKKAKSLDDIKDAIN 282 (415) T ss_pred HHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcccc-----------Cccccccccccccccccc-cccccchhHHHHHHH Confidence 99999999999999999999999999777665511 11111111 12222 234567788999998 Q ss_pred HHHhhCCccceEEechhhhhhhhhcCCCchhhhHHhhhhhcceeeee---eccccc--ceeeecCCe----EEEeeChhh Q lcl|NC_011269. 216 YTDQRQLDSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIVQF---GEFQIG--KSIIIPRGT----VYLTPEPEF 286 (333) Q Consensus 216 ~v~~~~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~---G~fgi~--~skvlprge----iyvvadpE~ 286 (333) .+.+-......++||++-|..|+.= -+..+ .|+-+.+ +.+. -++|.+ .+-.+|.|. .++..|.-. T Consensus 283 ~~~~~~~~~~~~v~n~~~~~~l~~l-kd~~G----~~l~~~~-~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~ 356 (415) T protein:vir:79 283 LNVKPNYEHNVAIVSQTMFAKLDKM-KDKLG----NYLIQPD-VKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKD 356 (415) T ss_pred hhhhhccCCCEEEEcHHHHHHHHHh-hccCC----ceeeccC-cCCCCCceecceeeEEecccccCCCCccEEEEEehhc Confidence 8888888899999999999998761 12111 2332222 1111 124444 222345443 356667654 Q ss_pred hcccccccCceeccccchhhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 287 LGVFPVMYSLDVEEDNKVERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 287 ~G~~pvR~~L~s~p~D~~er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) .-.+..|+++.++..|... +.++-..+.-++..+.||.+++++... T Consensus 357 ~~~~~~~~~~~v~~~~~~~-~~~~~~~~~r~d~~v~~~~a~~~~~~~ 402 (415) T protein:vir:79 357 AIVLFDRSQYQASWTDYMH-FGECLMIAVRQDCRILDYKSAIVIEYD 402 (415) T ss_pred cEEEEeecceEEEEecccc-CceEEEEEEEeccEEeccccEEEEEEe Confidence 4457889999988776432 345666777899999999999999866 No 39 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=98.52 E-value=1.5e-08 Score=63.46 Aligned_cols=311 Identities=9% Similarity=0.035 Sum_probs=185.2 Q ss_pred Cc-ccc--hhhhhhhhhhcc----cchHHHHHHHHHHhhcchhcchHHHHHHHHHHh----------cCchhHHHHHHHH Q lcl|NC_011269. 1 MT-LPV--AVGSGLGRFAKA----SDDYVADIVEAKQRMGGRKLSAREKQAKLAHIL----------SDKVGGIQRLGQS 63 (333) Q Consensus 1 ~~-~~~--~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~~Al----------~~~Eg~~~aLg~~ 63 (333) +. +-. ......++-.+. ...-....-.......+..++.+++........ .+..|+ -..-+- T Consensus 58 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg-~~iP~~ 136 (415) T protein:vir:98 58 LDKLKEKDGTSENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGF-VVIPEE 136 (415) T ss_pred HHHHHHHHhhhhhcccccccchhhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccccccccc-cccchH Confidence 00 000 000000000000 000001111112223334445555544432221 112222 134457 Q ss_pred HHHHHHHHHhhhhhhhhhhhccccCCCc--ceeecCCCCccceEEEEcCCCccccee-ecCceeeccceeeeccccccHH Q lcl|NC_011269. 64 MIGPIQLQLRYQGILRNVLLEDTLTPGV--PIQYDVLDDLGQAYMLHGNEGEIRITP-FEGKRIEVQLFRIASFPQIKKE 140 (333) Q Consensus 64 mA~pI~~q~~rqGi~RklL~~~TL~~G~--~p~y~v~~~v~~a~~~~~~~G~i~~Q~-i~~~ri~~P~f~Ivs~P~V~~~ 140 (333) +++.|.+.++....++++....+++.|. .|.... .-...+.|++..++++++- ..-+.|++.-.++..++.|..+ T Consensus 137 ~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~e 214 (415) T protein:vir:98 137 IVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQ--SEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISRE 214 (415) T ss_pred HHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEee--cCCccceeeccccccCcccccceeeEEeeeeeeEeeehhhHH Confidence 7888888899999999999998888664 444332 2223455787778887643 3447889999999999999999 Q ss_pred HhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCC-----CcceEEeeccccHHHHHHHHH Q lcl|NC_011269. 141 DLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGAL-----PNEITIAGSHLMPDDLYTAVT 215 (333) Q Consensus 141 dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~-----~N~i~i~~g~Lt~~~L~~a~t 215 (333) =|.+...|+..+..++..++|.+-+|.-+++-+-+ ..|..+.. .|..+ ..+..+-++|..++. T Consensus 215 ll~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~-----------g~~~~~~~~~~~~~~~~~-~~~~~~~~~i~~~~~ 282 (415) T protein:vir:98 215 AIEDAKVNVLQELKLWMARTIAATRNKAIIDVITK-----------GSTGSTSSGFEKEGKKLE-VKKAKSLDDIKDAIN 282 (415) T ss_pred HHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcccc-----------Cccccccccccccccccc-cccccchhHHHHHHH Confidence 99999999999999999999999999777665511 11111111 12222 234567788999998 Q ss_pred HHHhhCCccceEEechhhhhhhhhcCCCchhhhHHhhhhhcceeeee---eccccc--ceeeecCCe----EEEeeChhh Q lcl|NC_011269. 216 YTDQRQLDSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIVQF---GEFQIG--KSIIIPRGT----VYLTPEPEF 286 (333) Q Consensus 216 ~v~~~~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~---G~fgi~--~skvlprge----iyvvadpE~ 286 (333) .+.+-......++||++-|..|+.= -+..+ .|+-+.+ +.+. -++|.+ .+-.+|.|. .++..|.-. T Consensus 283 ~~~~~~~~~~~~v~n~~~~~~l~~l-kd~~G----~~l~~~~-~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~ 356 (415) T protein:vir:98 283 LNVKPNYEHNVAIVSQTMFAKLDKM-KDKLG----NYLIQPD-VKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKD 356 (415) T ss_pred hhhhhccCCCEEEEcHHHHHHHHHh-hccCC----ceeeccC-cCCCCCceecceeeEEecccccCCCCccEEEEEehhc Confidence 8888888899999999999998761 12111 2332222 1111 124444 222345443 356667654 Q ss_pred hcccccccCceeccccchhhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 287 LGVFPVMYSLDVEEDNKVERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 287 ~G~~pvR~~L~s~p~D~~er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) .-.+..|+++.++..|... +.++-..+.-++..+.||.+++++... T Consensus 357 ~~~~~~~~~~~v~~~~~~~-~~~~~~~~~r~d~~v~~~~a~~~~~~~ 402 (415) T protein:vir:98 357 AIVLFDRSQYQASWTDYMH-FGECLMIAVRQDCRILDYKSAIVIEYD 402 (415) T ss_pred cEEEEeecceEEEEecccc-CceEEEEEEEeccEEeccccEEEEEEe Confidence 4457889999988776432 345666777899999999999999866 No 40 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=98.52 E-value=1.5e-08 Score=63.46 Aligned_cols=311 Identities=9% Similarity=0.035 Sum_probs=185.2 Q ss_pred Cc-ccc--hhhhhhhhhhcc----cchHHHHHHHHHHhhcchhcchHHHHHHHHHHh----------cCchhHHHHHHHH Q lcl|NC_011269. 1 MT-LPV--AVGSGLGRFAKA----SDDYVADIVEAKQRMGGRKLSAREKQAKLAHIL----------SDKVGGIQRLGQS 63 (333) Q Consensus 1 ~~-~~~--~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~~Al----------~~~Eg~~~aLg~~ 63 (333) +. +-. ......++-.+. ...-....-.......+..++.+++........ .+..|+ -..-+- T Consensus 58 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg-~~iP~~ 136 (415) T protein:vir:81 58 LDKLKEKDGTSENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGF-VVIPEE 136 (415) T ss_pred HHHHHHHHhhhhhcccccccchhhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccccccccc-cccchH Confidence 00 000 000000000000 000001111112223334445555544432221 112222 134457 Q ss_pred HHHHHHHHHhhhhhhhhhhhccccCCCc--ceeecCCCCccceEEEEcCCCccccee-ecCceeeccceeeeccccccHH Q lcl|NC_011269. 64 MIGPIQLQLRYQGILRNVLLEDTLTPGV--PIQYDVLDDLGQAYMLHGNEGEIRITP-FEGKRIEVQLFRIASFPQIKKE 140 (333) Q Consensus 64 mA~pI~~q~~rqGi~RklL~~~TL~~G~--~p~y~v~~~v~~a~~~~~~~G~i~~Q~-i~~~ri~~P~f~Ivs~P~V~~~ 140 (333) +++.|.+.++....++++....+++.|. .|.... .-...+.|++..++++++- ..-+.|++.-.++..++.|..+ T Consensus 137 ~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~e 214 (415) T protein:vir:81 137 IVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQ--SEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISRE 214 (415) T ss_pred HHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEee--cCCccceeeccccccCcccccceeeEEeeeeeeEeeehhhHH Confidence 7888888899999999999998888664 444332 2223455787778887643 3447889999999999999999 Q ss_pred HhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCC-----CcceEEeeccccHHHHHHHHH Q lcl|NC_011269. 141 DLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGAL-----PNEITIAGSHLMPDDLYTAVT 215 (333) Q Consensus 141 dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~-----~N~i~i~~g~Lt~~~L~~a~t 215 (333) =|.+...|+..+..++..++|.+-+|.-+++-+-+ ..|..+.. .|..+ ..+..+-++|..++. T Consensus 215 ll~ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~-----------g~~~~~~~~~~~~~~~~~-~~~~~~~~~i~~~~~ 282 (415) T protein:vir:81 215 AIEDAKVNVLQELKLWMARTIAATRNKAIIDVITK-----------GSTGSTSSGFEKEGKKLE-VKKAKSLDDIKDAIN 282 (415) T ss_pred HHhhchHHHHHHHHHHHHHHHHHHHHHHHhhcccc-----------Cccccccccccccccccc-cccccchhHHHHHHH Confidence 99999999999999999999999999777665511 11111111 12222 234567788999998 Q ss_pred HHHhhCCccceEEechhhhhhhhhcCCCchhhhHHhhhhhcceeeee---eccccc--ceeeecCCe----EEEeeChhh Q lcl|NC_011269. 216 YTDQRQLDSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIVQF---GEFQIG--KSIIIPRGT----VYLTPEPEF 286 (333) Q Consensus 216 ~v~~~~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~---G~fgi~--~skvlprge----iyvvadpE~ 286 (333) .+.+-......++||++-|..|+.= -+..+ .|+-+.+ +.+. -++|.+ .+-.+|.|. .++..|.-. T Consensus 283 ~~~~~~~~~~~~v~n~~~~~~l~~l-kd~~G----~~l~~~~-~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~Gd~~~ 356 (415) T protein:vir:81 283 LNVKPNYEHNVAIVSQTMFAKLDKM-KDKLG----NYLIQPD-VKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKD 356 (415) T ss_pred hhhhhccCCCEEEEcHHHHHHHHHh-hccCC----ceeeccC-cCCCCCceecceeeEEecccccCCCCccEEEEEehhc Confidence 8888888899999999999998761 12111 2332222 1111 124444 222345443 356667654 Q ss_pred hcccccccCceeccccchhhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 287 LGVFPVMYSLDVEEDNKVERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 287 ~G~~pvR~~L~s~p~D~~er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) .-.+..|+++.++..|... +.++-..+.-++..+.||.+++++... T Consensus 357 ~~~~~~~~~~~v~~~~~~~-~~~~~~~~~r~d~~v~~~~a~~~~~~~ 402 (415) T protein:vir:81 357 AIVLFDRSQYQASWTDYMH-FGECLMIAVRQDCRILDYKSAIVIEYD 402 (415) T ss_pred cEEEEeecceEEEEecccc-CceEEEEEEEeccEEeccccEEEEEEe Confidence 4457889999988776432 345666777899999999999999866 No 41 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=98.52 E-value=1.4e-09 Score=69.11 Aligned_cols=289 Identities=12% Similarity=-0.024 Sum_probs=179.8 Q ss_pred HHHHHHhhcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcceeecCCCCccce Q lcl|NC_011269. 25 IVEAKQRMGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPIQYDVLDDLGQA 104 (333) Q Consensus 25 ~~~~~~~~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p~y~v~~~v~~a 104 (333) +. .-...+-..|..+|++++-- -.+..|. -|-.-+++.|-+.++.....+++....+.+.+. ..||+......+ T Consensus 1 ~~-~~~~r~~~~~~~~e~~a~~~--~~~~~g~--~ip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~-~~~p~~~~~~~a 74 (326) T protein:vir:42 1 MA-VNPDRTTPFLGVNDPKVAQT--GDSMFEG--YLEPEQAQDYFAEAEKISIVQQFAQKIPMGTTG-QKIPHWTGDVSA 74 (326) T ss_pred CC-CCccchhhhcCcchhhheec--cccCCcc--eechhhHHHHHHHHHhcchhhhhcceeeccCCc-eEEEEEeCCcce Confidence 10 00001122244444443311 1122332 467788889999999999999998888776553 455554444444 Q ss_pred EEEEcCCCcccceeecCceeeccceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhc Q lcl|NC_011269. 105 YMLHGNEGEIRITPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVD 184 (333) Q Consensus 105 ~~~~~~~G~i~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ 184 (333) + |++.-+.+++....=+.+++.-.++.....|..+=|+++..++..+..++-.++|.+.+|..+++ +.. T Consensus 75 ~-~v~Eg~~~~~~~~~f~~i~~~~~k~~~~v~iS~ell~~s~~~~~~~i~~~l~~a~~~~~d~a~l~---G~g------- 143 (326) T protein:vir:42 75 S-WIGEGDMKPITKGNMTSQTIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDNAAIN---GTD------- 143 (326) T ss_pred E-EecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhhc---ccC------- Confidence 4 56777788888888889999999999999999999999999999999999999999999976653 111 Q ss_pred ccccccccCC---------CcceEEeeccccHHHH--HHHHHHHHhhCCccceEEechhhhhhhhhcCCCchhhhHHhhh Q lcl|NC_011269. 185 SSAQPGVGAL---------PNEITIAGSHLMPDDL--YTAVTYTDQRQLDSSRLLANPQEYRDLYRWDINTTGWAFKDSV 253 (333) Q Consensus 185 ssA~p~vg~~---------~N~i~i~~g~Lt~~~L--~~a~t~v~~~~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV 253 (333) +..|. |.+ .-.-+-..+.++.++. ..+.-.+.........++||+..|..|+..--..-.|.+.+.+ T Consensus 144 -s~~p~-gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~ 221 (326) T protein:vir:42 144 -SPFPT-FLAQTTKEVSLVDPDGTGSNADLTVYDAVAVNALSLLVNAGKKWTHTLLDDITEPILNGAKDKSGRPLFIEST 221 (326) T ss_pred -CCccc-cccccccccceeecccccccccchhHHHHHHHHHhhhhhhccCccEEEEeHHHHHHHHHhhccCCceeecccc Confidence 11111 111 0111112233444443 3345556666777888999999999998742211112333333 Q ss_pred hhcceee-eee-ccccc--ceeeecCCeEEEe-eChhhhcccccccCceeccccc----------------hhhhcccee Q lcl|NC_011269. 254 VAGERIV-QFG-EFQIG--KSIIIPRGTVYLT-PEPEFLGVFPVMYSLDVEEDNK----------------VERFNKGWV 312 (333) Q Consensus 254 ~~~e~il-~~G-~fgi~--~skvlprgeiyvv-adpE~~G~~pvR~~L~s~p~D~----------------~er~~kGWv 312 (333) ..++..- ..+ ++|++ .+--+|-|++.++ .|-.. ..+=.|+++.++-.+. .++-..+|. T Consensus 222 ~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~Gd~s~-~~~~~~~~~~v~~~~e~~~~~~~~~~~~~~~~~~~d~~~~r 300 (326) T protein:vir:42 222 YTEENSPFRLGRIVARPTILSDHVASGTVVGYQGDFRQ-LVWGQVGGLSFDVTDQATLNLGTPQAPNFVSLWQHNLVAVR 300 (326) T ss_pred ccCccccccCceeeeeeEEEcCCCCCCceEEEEeecce-EEEEEecceEEEEeecceeeecccccccchhhhhcCcEEEE Confidence 3333110 111 35544 3446788887653 34332 2355777776654332 222357888 Q ss_pred hhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 313 MDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 313 m~E~~g~~i~N~~siv~~~~~ 333 (333) +.+-+++.+.||.+++.|.+. T Consensus 301 ~~~~~d~~v~~~~a~~~l~~~ 321 (326) T protein:vir:42 301 VEAEYAFHCNDKDAFVKLTNV 321 (326) T ss_pred EEEEeccEEecccceEEEeec Confidence 999999999999999988877 No 42 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=98.49 E-value=7.5e-09 Score=65.13 Aligned_cols=260 Identities=18% Similarity=0.167 Sum_probs=158.6 Q ss_pred hcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhcccc--CCCcceeecCCCCccceEEEEc Q lcl|NC_011269. 32 MGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTL--TPGVPIQYDVLDDLGQAYMLHG 109 (333) Q Consensus 32 ~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL--~~G~~p~y~v~~~v~~a~~~~~ 109 (333) |... +| .+++.+ .||- +++.+..++...+...++.-.. ++| .+|....+|+.+..+.+- ... T Consensus 1 ma~~----~T---~l~d~i-iPev----~~~~v~~~~~~~l~~~~~~~~d---~~l~g~~G~tv~iP~~~~ig~a~-~~~ 64 (274) T protein:vir:12 1 MAQG----LT---KTSNQI-IPEV----LAPMMQAQLEKKLRFASFAEVD---STLQGQPGDTLTFPAFVYSGDAQ-VVA 64 (274) T ss_pred CCcc----ee---ehhhhh-chHH----HHHHHHHHHHhhhhhcccceec---ccccCCCCCEEEEeeecCCCccc-ccc Confidence 2211 11 122222 2332 3344444444444444444332 222 247666777644433332 234 Q ss_pred CCCcccceeecCceeeccceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccc Q lcl|NC_011269. 110 NEGEIRITPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQP 189 (333) Q Consensus 110 ~~G~i~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p 189 (333) +..++..+.+.-......-.+.-.--.+.=++..+..+|.+.++-..+..++-+..|..++..+.++. T Consensus 65 ~g~~i~~~~lt~~~~~~~i~~~~~~~~i~D~~~~~~~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~a~------------ 132 (274) T protein:vir:12 65 EGEKIPTDILETKKREAKIRKIAKGTSITDEALLSGYGDPQGEQVRQHGLAHANKVDNDVLEALMGAK------------ 132 (274) T ss_pred CCCccchhhcccceeeEEeeeecceeeecHHHHHhcccchHHHHHHHHHHHHHHHHHHHHHHHHhccc------------ Confidence 44456666655454444444433333333344556668999999999999999999998887775432 Q ss_pred cccCCCcceEEeeccccHHHHHHHHHHHHhhCCccceEEechhhhhhhhhcCCCchhhhHHhhhhhcceeeeeecc---- Q lcl|NC_011269. 190 GVGALPNEITIAGSHLMPDDLYTAVTYTDQRQLDSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIVQFGEF---- 265 (333) Q Consensus 190 ~vg~~~N~i~i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~G~f---- 265 (333) .++.+.-++.+.+..|.+...+.+-.-..++++|..|..|+-=+..+|- .++- -+.-++..|.| T Consensus 133 --------~~~~~~a~~~d~i~dA~~~lgd~~~~~~~ivv~p~~~~~L~k~~~~~fv---~~s~-~g~~~~~~G~ig~~~ 200 (274) T protein:vir:12 133 --------LTVNADITKLNGLQSAIDKFNDEDLEPMVLFINPLDAGKLRGDASTNFT---RATE-LGDDIIVKGAFGEAL 200 (274) T ss_pred --------ccccccccCHHHHHHHHHHhccccccccEEEeCHHHHHHHHhhhhhhcc---cccc-ccccceecccceeec Confidence 2334556789999999999988887888999999999999871111221 0111 11224444554 Q ss_pred --cccceeeecCCeEEEeeChhhhcccccccCceeccccchhhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 266 --QIGKSIIIPRGTVYLTPEPEFLGVFPVMYSLDVEEDNKVERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 266 --gi~~skvlprgeiyvvadpE~~G~~pvR~~L~s~p~D~~er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) .|..+..+|-++.|+...+- .|.+- ..++.+|..--+.++..==.-.+.+|..+.||.++|.++|+ T Consensus 201 G~~Vi~s~~~p~~t~~l~~~gA-~~~~~-~~~~~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~ 268 (274) T protein:vir:12 201 GAIIVRSNKLEAGTAILAKKGA-VKLIL-KRDFFLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKG 268 (274) T ss_pred CeeEEEeCCCCcceEEEEeccc-eeeee-cCCceeccccchhhcccEEEeeeEEEEEEEcCCceEEEEcC Confidence 47789999999999887665 66554 56677765545545444334456789999999999999999 No 43 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=98.47 E-value=8.6e-09 Score=64.79 Aligned_cols=284 Identities=12% Similarity=0.007 Sum_probs=181.7 Q ss_pred hhcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcceeecCCCCccceEEEEcC Q lcl|NC_011269. 31 RMGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGN 110 (333) Q Consensus 31 ~~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~ 110 (333) -+-|.....++++ |...-.+.-|. .+-..+.+.|-+.+++..+.+++.+..+.+.+. ..||+......+ -|++. T Consensus 1 ~~~~~~~~~~~~~--~~~t~~~~~~~--~ip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~-~~~p~~~~~~~a-~~v~E 74 (320) T protein:vir:10 1 MAAGTAFQVDHAQ--IAQTGDTMFKG--YLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTG-QKIPHWIGDVSA-QWIGE 74 (320) T ss_pred CCCCccCCHHHHH--hhccccccccc--cccHHHHHHHHHHHHhccchhhhcceeeccCCc-eEEEEEeCCcce-EEecC Confidence 2223334444433 33333333332 567778888889999999999999888877554 355554444444 46777 Q ss_pred CCcccceeecCceeeccceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhccccccc Q lcl|NC_011269. 111 EGEIRITPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPG 190 (333) Q Consensus 111 ~G~i~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~ 190 (333) .+.+++....=+.+++.-.++.....|..+=|.+...++..+..+...++|-+.+|.-+++ +-. +..|. T Consensus 75 ~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~~~l~~~i~~~l~~a~a~~~d~a~l~---G~g--------~~~~~ 143 (320) T protein:vir:10 75 GDMKPITKGNMTSQNIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDSAALN---GTD--------SPFPT 143 (320) T ss_pred CccccccccceeEEEEeeEEEEEeehhhHHHHhcChHHHHHHHHHHHHHHHHHHHHHHhhc---ccC--------CCCCc Confidence 7788888877788999999999999999999999999999999999999999999976643 111 00010 Q ss_pred --ccCCCcceE------Eeecccc--HHHHHHHHHHHHhhCCccceEEechhhhhhhhhcCCCchhhhHHhhhhhcc-ee Q lcl|NC_011269. 191 --VGALPNEIT------IAGSHLM--PDDLYTAVTYTDQRQLDSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAGE-RI 259 (333) Q Consensus 191 --vg~~~N~i~------i~~g~Lt--~~~L~~a~t~v~~~~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e-~i 259 (333) .|.. |..+ .....++ .+.+..+...+..-..+....+||++.|..|+.=--..-.|.+-+.+..+. -. T Consensus 144 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~ 222 (320) T protein:vir:10 144 YLAQTT-KSVSLADPGGATASDLTAYDAVAVNGLSLLVNAKKKWTHTLLDDIVEPILNGAKDKNGRPLFIESTYTDENSP 222 (320) T ss_pred cccccc-ccccceecccccccccccHHHHHHHHHhhhhcccCCCcEEEEcHHHHHHHHHhhccCCceeeccccccCcccc Confidence 0011 1111 1122222 245777788888888999999999999999976110000111111111111 01 Q ss_pred eeee-ccccc--ceeeecCCeEE-EeeChhhhcccccccCceeccccc----------------hhhhccceehhhhhhh Q lcl|NC_011269. 260 VQFG-EFQIG--KSIIIPRGTVY-LTPEPEFLGVFPVMYSLDVEEDNK----------------VERFNKGWVMDELVGM 319 (333) Q Consensus 260 l~~G-~fgi~--~skvlprgeiy-vvadpE~~G~~pvR~~L~s~p~D~----------------~er~~kGWvm~E~~g~ 319 (333) ++.+ .+|++ .+--+|-|+.. ++.|.... .+-.|+++..+-.+. .++-...|.+.+-++. T Consensus 223 ~~~~~i~g~pv~~~~~~~~~~~~~~~gd~~~~-~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~~~d~ 301 (320) T protein:vir:10 223 FRAGRIVSRPTILSDHVADGTTVGYMGDFRNV-IWGQVGGLSFDVTDQATLNLGTPTEPNFVSLWQHNLVAVRVEAEYAF 301 (320) T ss_pred ccCceeeeeeeEecCCCCCCceEEEEeecceE-EEEEecCeEEEEeecceeeeccccccccchhhhcCcEEEEEEEeecc Confidence 1112 13433 33447878743 45777643 477888887764432 2333567888899999 Q ss_pred hhhccceEEEEecC Q lcl|NC_011269. 320 AILNPRGIVILRKA 333 (333) Q Consensus 320 ~i~N~~siv~~~~~ 333 (333) .+.||.+++.+.++ T Consensus 302 ~v~~~~a~~~l~~~ 315 (320) T protein:vir:10 302 HNNDKDAFVKLTNV 315 (320) T ss_pred EEecccceEEEEec Confidence 99999999999877 No 44 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=98.46 E-value=3.2e-08 Score=61.65 Aligned_cols=311 Identities=10% Similarity=0.025 Sum_probs=188.0 Q ss_pred Ccccchhh-hhhhhhhcccchHHHHHHHHHHhhcchhcchHHHHHHHHHH----------hcCchhHHHHHHHHHHHHHH Q lcl|NC_011269. 1 MTLPVAVG-SGLGRFAKASDDYVADIVEAKQRMGGRKLSAREKQAKLAHI----------LSDKVGGIQRLGQSMIGPIQ 69 (333) Q Consensus 1 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~~A----------l~~~Eg~~~aLg~~mA~pI~ 69 (333) ...+..-. ..-+...+...+ ....-.......+..++.+++....... ..+..|.. .+-+-+++.|- T Consensus 65 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~g~~-~iP~~~~~~ii 142 (415) T protein:vir:46 65 DRTSENNQQSVEVNEARTYRN-QANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFV-VIPEEIVTDIL 142 (415) T ss_pred HHhhhhcccccccchhhhhHH-HHHHHHHHHhhhhhhhhHHHHHHHHHHHhhhhhhhhccccccCCcc-cccHHHHHHHH Confidence 00000000 000000000000 1111112222333444444554443322 12333432 45677888899 Q ss_pred HHHhhhhhhhhhhhccccCCCc--ceeecCCCCccceEEEEcCCCcccce-eecCceeeccceeeeccccccHHHhhhhc Q lcl|NC_011269. 70 LQLRYQGILRNVLLEDTLTPGV--PIQYDVLDDLGQAYMLHGNEGEIRIT-PFEGKRIEVQLFRIASFPQIKKEDLYYLR 146 (333) Q Consensus 70 ~q~~rqGi~RklL~~~TL~~G~--~p~y~v~~~v~~a~~~~~~~G~i~~Q-~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~ 146 (333) ..++....++++....+++.|. .|+....+ ...+.|++.-+++++. ...=+.|++.-..+.....|..+-|.... T Consensus 143 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~ 220 (415) T protein:vir:46 143 KLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSE--VAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAK 220 (415) T ss_pred HHHHhhhhhhhhcceeeccCCceeEEEEEecC--CcceeecccccccccccccceeeEEeeeeeeEeeehhhHHHHhhch Confidence 9999999999999888887665 44443222 2345577777778753 33347889999999999999999999999 Q ss_pred chhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCcc----eEEeeccccHHHHHHHHHHHHhhCC Q lcl|NC_011269. 147 SNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPNE----ITIAGSHLMPDDLYTAVTYTDQRQL 222 (333) Q Consensus 147 ~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N~----i~i~~g~Lt~~~L~~a~t~v~~~~L 222 (333) .++..+.+++..++|-+-+|..+++-+-. ..|..+...+. ..-..+..+-++|-.++..+.+-.. T Consensus 221 ~~l~~~i~~~l~~~i~~~~d~~il~g~g~-----------g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~ 289 (415) T protein:vir:46 221 VNVLQELKLWMARTIAATRNKAIIDVITK-----------GSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNY 289 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcccc-----------CCccccccccccccceeccccccchHHHHHHHHhhhhhcc Confidence 99999999999999999999877665411 11111111111 1123345677889999988888888 Q ss_pred ccceEEechhhhhhhhhcCCCchhhhHHhhhhhcceeeee---eccccc--ceeeecCCe----EEEeeChhhhcccccc Q lcl|NC_011269. 223 DSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIVQF---GEFQIG--KSIIIPRGT----VYLTPEPEFLGVFPVM 293 (333) Q Consensus 223 ~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~---G~fgi~--~skvlprge----iyvvadpE~~G~~pvR 293 (333) +...++||++-|..|..=- +..| .|+-+.+ +.+. -++|.+ .+--+|-+. .+++.|.-..-.+.+| T Consensus 290 ~~~~~v~n~~~~~~L~~lk-d~~G----~~i~~~~-~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~ 363 (415) T protein:vir:46 290 EHNVAIVSQTMFAKLDKMK-DKLG----NYLIQPD-VKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDR 363 (415) T ss_pred CCCEEEEcHHHHHHHHHhh-ccCC----CeeeccC-cCCCCCccccceeeEEeccccccCCCccEEEEEehhccEEEEee Confidence 9999999999999887611 2111 2332222 1111 124544 222345443 4677787765667889 Q ss_pred cCceeccccchhhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 294 YSLDVEEDNKVERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 294 ~~L~s~p~D~~er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) +++.+.-.|... +.++-..++-++..+.||.+++.+... T Consensus 364 ~~~~v~~~~~~~-~~~~~~~~~r~d~~v~~~~a~~~~~~~ 402 (415) T protein:vir:46 364 SQYQASWTDYMH-FGECLMIAVRQDCRILDYKSAIVIEYD 402 (415) T ss_pred cceEEEeecccc-CceEEEEEEEeccEEeccccEEEEEee Confidence 999988777433 234456667899999999999999755 No 45 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=98.46 E-value=3.2e-08 Score=61.65 Aligned_cols=311 Identities=10% Similarity=0.025 Sum_probs=188.0 Q ss_pred Ccccchhh-hhhhhhhcccchHHHHHHHHHHhhcchhcchHHHHHHHHHH----------hcCchhHHHHHHHHHHHHHH Q lcl|NC_011269. 1 MTLPVAVG-SGLGRFAKASDDYVADIVEAKQRMGGRKLSAREKQAKLAHI----------LSDKVGGIQRLGQSMIGPIQ 69 (333) Q Consensus 1 ~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~~A----------l~~~Eg~~~aLg~~mA~pI~ 69 (333) ...+..-. ..-+...+...+ ....-.......+..++.+++....... ..+..|.. .+-+-+++.|- T Consensus 65 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~g~~-~iP~~~~~~ii 142 (415) T protein:vir:47 65 DRTSENNQQSVEVNEARTYRN-QANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFV-VIPEEIVTDIL 142 (415) T ss_pred HHhhhhcccccccchhhhhHH-HHHHHHHHHhhhhhhhhHHHHHHHHHHHhhhhhhhhccccccCCcc-cccHHHHHHHH Confidence 00000000 000000000000 1111112222333444444554443322 12333432 45677888899 Q ss_pred HHHhhhhhhhhhhhccccCCCc--ceeecCCCCccceEEEEcCCCcccce-eecCceeeccceeeeccccccHHHhhhhc Q lcl|NC_011269. 70 LQLRYQGILRNVLLEDTLTPGV--PIQYDVLDDLGQAYMLHGNEGEIRIT-PFEGKRIEVQLFRIASFPQIKKEDLYYLR 146 (333) Q Consensus 70 ~q~~rqGi~RklL~~~TL~~G~--~p~y~v~~~v~~a~~~~~~~G~i~~Q-~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~ 146 (333) ..++....++++....+++.|. .|+....+ ...+.|++.-+++++. ...=+.|++.-..+.....|..+-|.... T Consensus 143 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~ 220 (415) T protein:vir:47 143 KLKEVEFNLDKYVTVKRVTNGSGKYPVVRQSE--VAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAK 220 (415) T ss_pred HHHHhhhhhhhhcceeeccCCceeEEEEEecC--CcceeecccccccccccccceeeEEeeeeeeEeeehhhHHHHhhch Confidence 9999999999999888887665 44443222 2345577777778753 33347889999999999999999999999 Q ss_pred chhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCcc----eEEeeccccHHHHHHHHHHHHhhCC Q lcl|NC_011269. 147 SNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPNE----ITIAGSHLMPDDLYTAVTYTDQRQL 222 (333) Q Consensus 147 ~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N~----i~i~~g~Lt~~~L~~a~t~v~~~~L 222 (333) .++..+.+++..++|-+-+|..+++-+-. ..|..+...+. ..-..+..+-++|-.++..+.+-.. T Consensus 221 ~~l~~~i~~~l~~~i~~~~d~~il~g~g~-----------g~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~ 289 (415) T protein:vir:47 221 VNVLQELKLWMARTIAATRNKAIIDVITK-----------GSTGSTSSGFEKEGKKLEVKKAKSLDDIKDAINLNVKPNY 289 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcccc-----------CCccccccccccccceeccccccchHHHHHHHHhhhhhcc Confidence 99999999999999999999877665411 11111111111 1123345677889999988888888 Q ss_pred ccceEEechhhhhhhhhcCCCchhhhHHhhhhhcceeeee---eccccc--ceeeecCCe----EEEeeChhhhcccccc Q lcl|NC_011269. 223 DSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIVQF---GEFQIG--KSIIIPRGT----VYLTPEPEFLGVFPVM 293 (333) Q Consensus 223 ~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~---G~fgi~--~skvlprge----iyvvadpE~~G~~pvR 293 (333) +...++||++-|..|..=- +..| .|+-+.+ +.+. -++|.+ .+--+|-+. .+++.|.-..-.+.+| T Consensus 290 ~~~~~v~n~~~~~~L~~lk-d~~G----~~i~~~~-~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~ 363 (415) T protein:vir:47 290 EHNVAIVSQTMFAKLDKMK-DKLG----NYLIQPD-VKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNLKDAIVLFDR 363 (415) T ss_pred CCCEEEEcHHHHHHHHHhh-ccCC----CeeeccC-cCCCCCccccceeeEEeccccccCCCccEEEEEehhccEEEEee Confidence 9999999999999887611 2111 2332222 1111 124544 222345443 4677787765667889 Q ss_pred cCceeccccchhhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 294 YSLDVEEDNKVERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 294 ~~L~s~p~D~~er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) +++.+.-.|... +.++-..++-++..+.||.+++.+... T Consensus 364 ~~~~v~~~~~~~-~~~~~~~~~r~d~~v~~~~a~~~~~~~ 402 (415) T protein:vir:47 364 SQYQASWTDYMH-FGECLMIAVRQDCRILDYKSAIVIEYD 402 (415) T ss_pred cceEEEeecccc-CceEEEEEEEeccEEeccccEEEEEee Confidence 999988777433 234456667899999999999999755 No 46 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=98.46 E-value=7.7e-09 Score=65.07 Aligned_cols=273 Identities=8% Similarity=0.016 Sum_probs=172.0 Q ss_pred HHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcceeecCCCCccceEEEEcCCCcccceeecCceee Q lcl|NC_011269. 46 LAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRITPFEGKRIE 125 (333) Q Consensus 46 m~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~i~~~ri~ 125 (333) |. -. ..|.. .+=..+++.|-+.++.+...|++....+++.|.. .||+...-. .+.|++.-++++.....=+.++ T Consensus 1 m~--t~-t~gg~-liP~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~-~ip~~~~~~-~a~wv~E~~~~~~s~~~f~~v~ 74 (303) T protein:vir:97 1 MG--TE-TSKAS-LFDKHLVSDLINKVKGHSSLAKLSSQKPIPFNGS-KEFTFTLDS-DIDVVAENGKKTHGGLSLEPVT 74 (303) T ss_pred Cc--cc-CCCCe-EcchhHHHHHHHHHHhhchhhhhcceeecCCCce-EEEEEecCc-ceEEeecCccccccccceeeEE Confidence 21 12 23332 5666777888888889999999999998886542 444422222 3467876677776666557888 Q ss_pred ccceeeeccccccHHHhh---hhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccc--cc-cCCCcceE Q lcl|NC_011269. 126 VQLFRIASFPQIKKEDLY---YLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQP--GV-GALPNEIT 199 (333) Q Consensus 126 ~P~f~Ivs~P~V~~~dl~---~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p--~v-g~~~N~i~ 199 (333) ++-.++.....+..+-|+ ....++..+...+..++|.+.+|.-.++=.++..-.- ....+ .+ +...|..+ T Consensus 75 l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g~~----~~~~~~~~~~~~~~~~~~ 150 (303) T protein:vir:97 75 IVPIKVEYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPRTKKA----SDVIGTNHFDSKVTQVVK 150 (303) T ss_pred eeeEEEEEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccCCccc----cccccccccccccccccc Confidence 898999999888888775 4567899999999999999999965554321111000 00000 00 01113334 Q ss_pred EeeccccHHHHHHHHHHHHhhCCccceEEechhhhhhhhhcCCCchhhhHHhhhhhcceee--ee-ecccccc--eeeec Q lcl|NC_011269. 200 IAGSHLMPDDLYTAVTYTDQRQLDSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIV--QF-GEFQIGK--SIIIP 274 (333) Q Consensus 200 i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il--~~-G~fgi~~--skvlp 274 (333) ..++.-+-+++.+++..+...+.....++||+..|..|+.= -++.+ .|+-+-++-. +. -++|.+- +--|| T Consensus 151 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~l-kd~~g----~~~~~~~~~~~~~~~~l~G~Pv~~s~~v~ 225 (303) T protein:vir:97 151 FTESEDADANIEAAVNLIQGAEGVVTGLAMDTEFSTALAKV-TNGEM----GPKMYPELAWGANPDSINGLKSSVNTTVG 225 (303) T ss_pred cccccchHHHHHHHHHHHhhcCCCccEEEEcHHHHHHHHHh-hccCC----CeEEecCccCCCCCceecceeeEEecccC Confidence 44455566899999999988889999999999999998751 01111 1111111111 01 1456542 33355 Q ss_pred CCe-------EEEeeChhhhcccccccCceeccccch----------hhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 275 RGT-------VYLTPEPEFLGVFPVMYSLDVEEDNKV----------ERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 275 rge-------iyvvadpE~~G~~pvR~~L~s~p~D~~----------er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) -+. .+++-|=.....+-+|+++..+-.++. ++--.++...+-++.++.||-++|.|.++ T Consensus 226 ~~~~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~n~~~~r~~~r~~~~v~~p~af~~l~~~ 301 (303) T protein:vir:97 226 AGADEAESKDLVIIGDFESMFKWGYAKQIPMEIIKYGDPDNSGKDLKGYNQIYLRAEAYIGWGILDAKSFARVTKG 301 (303) T ss_pred CccccCCCccEEEEeeccccEEEEEecCcEEEEeeccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEeeCC Confidence 322 233444222345667888887654432 22245677889999999999999999999 No 47 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=98.46 E-value=2e-08 Score=62.83 Aligned_cols=311 Identities=12% Similarity=0.057 Sum_probs=176.8 Q ss_pred CcccchhhhhhhhhhcccchHHHHHHH-HH-HhhcchhcchHHHHHHHHHH----hcCchhHHHHHHHHHHHHHHHHHhh Q lcl|NC_011269. 1 MTLPVAVGSGLGRFAKASDDYVADIVE-AK-QRMGGRKLSAREKQAKLAHI----LSDKVGGIQRLGQSMIGPIQLQLRY 74 (333) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~-~~~~~~~ls~ee~~~Lm~~A----l~~~Eg~~~aLg~~mA~pI~~q~~r 74 (333) --.+.........-......+..+.-. +. +.+-.+....+++...-..+ -..+.|. -.+-..+.+.|.+.++. T Consensus 110 e~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~g~-~~ip~~~~~~ii~~~~~ 188 (458) T protein:vir:10 110 EGRSFVGDSVAKALYGTQENFEDEVEKLVLLSYVMEKGVFETEHGQRHLKAVNQSSSVEVSS-ESYETIFSQRIIRDLQK 188 (458) T ss_pred HhhhhhhhhhhccchhhhhhHHHHHHHHHHHHHHHhhccchhhhhhhhhhhhhhcccCcccc-ceehhhHhHHHHHHHHh Confidence 000000000000000000001111100 00 00111111111111111111 1222333 24556678889999999 Q ss_pred hhhhhhhhhccccCCCcceeecCCCCccceEEEEcCCCcccceeec------CceeeccceeeeccccccHHHhhhhcch Q lcl|NC_011269. 75 QGILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRITPFE------GKRIEVQLFRIASFPQIKKEDLYYLRSN 148 (333) Q Consensus 75 qGi~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~i~------~~ri~~P~f~Ivs~P~V~~~dl~~~~~~ 148 (333) ...++++....+++.|. ..|++..... .+.|.+..+.+..+.+. =+.|++...++..++.|..+-|.....+ T Consensus 189 ~~~l~~~~~~~~~~~~~-~~~~~~~~~~-~a~~v~e~~~~~~~~~~~~~~~~~~~i~~~~~k~~~~v~is~ell~ds~~~ 266 (458) T protein:vir:10 189 ELVVGALFEELPMSSKI-LTMLVEPDAG-KATWVAASTYGTDTTTGEEVKGALKEIHFSTYKLAAKSFITDETEEDAIFS 266 (458) T ss_pred hhhHHhhcceeecCCcc-eEEEEecCCc-ceeecccccccccccccccccccceeeEeeeeeEEeeehhhHHHHhcchHH Confidence 99999999988887654 4555544433 34566655555544321 1567888889999999999999999999 Q ss_pred hHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCC-------cceE-E---eeccccHHHHHHHHHHH Q lcl|NC_011269. 149 IVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALP-------NEIT-I---AGSHLMPDDLYTAVTYT 217 (333) Q Consensus 149 vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~-------N~i~-i---~~g~Lt~~~L~~a~t~v 217 (333) +..+..+...++|.+-+|.-+++ +-. +.+| .|.+. +.++ . ..+.+|.++|..++..+ T Consensus 267 ~~~~i~~~l~~~i~~~~d~~~l~---G~G--------~~~p-~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l 334 (458) T protein:vir:10 267 LLPLLRKRLIEAHAVSIEEAFMT---GDG--------SGKP-KGLLTLASEDSAKVVTEAKADGSVLVTAKTISKLRRKL 334 (458) T ss_pred HHHHHHHHHHHHHHHHHHHHhhc---CCC--------CCcc-ceeeecccccccceeecccccccccccHHHHHHHHHhh Confidence 99999999999999999965542 221 1122 12210 1111 1 12346789999999999 Q ss_pred HhhCCccceEEechhhhhhhhhcCCC-chhhhHHhhhhhcce---eeee---eccccc--ceeeecCC----eEEEeeCh Q lcl|NC_011269. 218 DQRQLDSSRLLANPQEYRDLYRWDIN-TTGWAFKDSVVAGER---IVQF---GEFQIG--KSIIIPRG----TVYLTPEP 284 (333) Q Consensus 218 ~~~~L~at~il~~~~~~~Di~gw~~N-~~~~~~~DpV~~~e~---il~~---G~fgi~--~skvlprg----eiyvvadp 284 (333) .........++||+..|..|.. +- ..| .|+-+... ..++ -++|.+ .+..||-+ .|++.... T Consensus 335 ~~~~~~~~~~v~~~~~~~~l~~--lkd~~G----~~i~~~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~~~~~~~f~ 408 (458) T protein:vir:10 335 GRHGLKLSKLVLIVSMDAYYDL--LEDEEW----QDVAQVGNDSVKLQGQVGRIYGLPVVVSEYFPAKANSAEFAVIVYK 408 (458) T ss_pred hhhhcCCCEEEEcHHHHHHHHh--hcccCC----ceeeccccccccccCcCceecceeeEEccccccccCCcceEEEEec Confidence 9999999999999999998865 21 111 12211110 1101 134544 34446664 45554443 Q ss_pred hhhcccccccCceeccccchhhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 285 EFLGVFPVMYSLDVEEDNKVERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 285 E~~G~~pvR~~L~s~p~D~~er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) .+ -.+-+|.|+++.-+++...-..++...+-+|+++.+|-++|....| T Consensus 409 ~~-~~~~~~~~~~v~~d~~~~~~~~~~~~~~r~~~~v~~~~a~v~~~~a 456 (458) T protein:vir:10 409 DN-FVMPRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFANGVVSGTYA 456 (458) T ss_pred cc-EEEEEeeceEEEeecccCCCceEEEEEEEecceEecccceEEEeec Confidence 42 3578899999987666543345667778899999999999998888 No 48 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=98.44 E-value=4.8e-09 Score=66.20 Aligned_cols=280 Identities=11% Similarity=-0.019 Sum_probs=181.4 Q ss_pred cchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcceeecCCCCccceEEEEcCCCcccc Q lcl|NC_011269. 37 LSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRI 116 (333) Q Consensus 37 ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~ 116 (333) |+-++.++.+. ..+..++ -.+-..+++.|-+.++.+...+++....++..|. ..||+...... +.|.+-.+++.+ T Consensus 1 m~~~~~~a~~~--~~t~~~g-~~i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~-~~~p~~~~~~~-a~~v~Eg~~~~~ 75 (330) T protein:vir:77 1 MAGSTVPSTQV--ALTGDFS-AFLTPEQSQDYFAEIEKTSIVQRIARKVPMGPTG-ISIPHWTGAVS-ASWTGEAERKPI 75 (330) T ss_pred Ccccccchhhc--cccCCCc-ceechhHHHHHHHHHHhccchhhhcceeeccCCc-eEEEEEcCCcc-eeEecCCCcccc Confidence 55555555543 2233333 2566777888888899999999999988877765 34666544444 457777777887 Q ss_pred eeecCceeeccceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCC- Q lcl|NC_011269. 117 TPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALP- 195 (333) Q Consensus 117 Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~- 195 (333) ....=+.+++.-.++..+..|..+-|+....++..+..++..++|-..+|.-+++ +-- +..|..|.+. T Consensus 76 ~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~ai~~~~~~~~l~---G~g--------~~~~~~g~~~~ 144 (330) T protein:vir:77 76 TKGSFGKQELEPVKITTIFAESAEVVRLNPLNYLNTMRTKIAEAIALKFDAAAIH---GID--------KPSAFKGYLAE 144 (330) T ss_pred ccceeeEEEEeEEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhc---ccC--------CCCcccccccc Confidence 7777788999999999999999999999999999999999999999999965552 111 1111111110 Q ss_pred ----------cceEEeec-cccHHHHHHHHHHHHhhCCccceEEechhhhhhhhhcCCCchhhhHHhhhhhccee-ee-e Q lcl|NC_011269. 196 ----------NEITIAGS-HLMPDDLYTAVTYTDQRQLDSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERI-VQ-F 262 (333) Q Consensus 196 ----------N~i~i~~g-~Lt~~~L~~a~t~v~~~~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~i-l~-~ 262 (333) +..+..+. ...-++|..++..+.+.+.+....+||++.|..|+.---..-.|.+.+....+... .+ . T Consensus 145 ~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~ 224 (330) T protein:vir:77 145 TTKVVSLADTNLTTASGPQGNAYLAVNNALSLLVNSGKKWTGTLLDNVTEPILNTAVDGNGRPLFVESTYTEQVGAIREG 224 (330) T ss_pred ccccceeecccccccccccchhHHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHHHhccCCceeecCccccccccccCCc Confidence 11221111 12236788888888889999999999999999998621111112222222111100 01 1 Q ss_pred eccccc--ceeeecCCe-----EEEeeChhhhcccccccCceeccccc--------------------hhhhccceehhh Q lcl|NC_011269. 263 GEFQIG--KSIIIPRGT-----VYLTPEPEFLGVFPVMYSLDVEEDNK--------------------VERFNKGWVMDE 315 (333) Q Consensus 263 G~fgi~--~skvlprge-----iyvvadpE~~G~~pvR~~L~s~p~D~--------------------~er~~kGWvm~E 315 (333) -++|.+ .+-.||-|+ +.++.|..+. .+-.|+|+.++-.+. .++-...|.... T Consensus 225 ~l~G~PV~~~~~~p~~~~~~~~~~~~gd~s~~-~i~~~~~~~i~~~~e~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~ 303 (330) T protein:vir:77 225 RILGRPTYVADNVVNGTVGNRVVGVMGDFSQV-IWGQIGGLSFDVTDQATLDFGEEQGGVWVPKLISLWQHNMVAVRCEA 303 (330) T ss_pred eecceeeEEeccccCCCCCCccEEEEEecceE-EEEEecCcEEEEeecceeeecccccccccccccchhhcCcEEEEEEE Confidence 124544 444566654 3445566543 577888887764332 223356788888 Q ss_pred hhhhhhhccceEEEEecC Q lcl|NC_011269. 316 LVGMAILNPRGIVILRKA 333 (333) Q Consensus 316 ~~g~~i~N~~siv~~~~~ 333 (333) -++.++.+|.+++.+..+ T Consensus 304 r~d~~v~~~~a~~~i~~~ 321 (330) T protein:vir:77 304 EFAFMVNDKDAFVKLTDQ 321 (330) T ss_pred EeccEEecccceEEEEec Confidence 999999999999998777 No 49 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=98.43 E-value=2.5e-08 Score=62.26 Aligned_cols=317 Identities=13% Similarity=0.114 Sum_probs=188.2 Q ss_pred CcccchhhhhhhhhhcccchHHHHHHHHHHhhcchhcchHHHHHHH-HHHh---cCchhHHHHHHHHHHHHHHHHHhhhh Q lcl|NC_011269. 1 MTLPVAVGSGLGRFAKASDDYVADIVEAKQRMGGRKLSAREKQAKL-AHIL---SDKVGGIQRLGQSMIGPIQLQLRYQG 76 (333) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm-~~Al---~~~Eg~~~aLg~~mA~pI~~q~~rqG 76 (333) ...+..-....+.-.+..+..-....+..-|.|...++.++++.+- .+++ .+..|+. .+-.-+.+.|...++.+. T Consensus 67 ~~~~~~~~~~~~~~~~~~~~~~~~a~~~~l~~~~~~~~~~e~~~~~~~~a~~~~~~~~gg~-liP~~~~~~ii~~~~~~~ 145 (409) T protein:vir:45 67 SNEEEQRQNLDPENNSQQDEKRAQVFDKWMRHGASELTSEERKALRELRAQGVAQDEKGGY-TVPETFLAKVVEKMKSYG 145 (409) T ss_pred hhhhhhcccCCCCCcchhhHHHHHHHHHHHHhhhhhccHHHHHHHHHHhhccCccCcCCce-eccHhHHHHHHHHHHhhh Confidence 1011000000000001111112222333345566788888887753 2233 2344442 456778888999999999 Q ss_pred hhhhhhhccccCCCcceeecCCCCccceEEEEcCCCcccceeecCceeeccceeeec-cccccHHHhhhhcchhHHHHHH Q lcl|NC_011269. 77 ILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRITPFEGKRIEVQLFRIAS-FPQIKKEDLYYLRSNIVEYTQD 155 (333) Q Consensus 77 i~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~i~~~ri~~P~f~Ivs-~P~V~~~dl~~~~~~vle~~q~ 155 (333) .+|++....++..|....++........+.|++--++++.+.+.-+.+++...++++ ...|..+-|.....|+..+..+ T Consensus 146 ~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~f~~~~l~~~k~~~~~i~is~ell~ds~~~l~~~i~~ 225 (409) T protein:vir:45 146 GIASVAQILTTSDGRTMEWATADGTSEVGVLLGENEEAGEEDTDFGMGSLGALKMTSKIIRVSNELLQDSAIDMEAYLAR 225 (409) T ss_pred hhhhhceeeecCCCceEEEEeeccCccccccccccccccccccccceeeeeeeeeeeeehhhhHHHHhccHHHHHHHHHH Confidence 999999888888887666666555544556777777888888877888888888865 5789999999988999999999 Q ss_pred HHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCc----ceEEeeccccHHHHHHHHHHHHhhCC-ccc-eEEe Q lcl|NC_011269. 156 MTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPN----EITIAGSHLMPDDLYTAVTYTDQRQL-DSS-RLLA 229 (333) Q Consensus 156 ~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N----~i~i~~g~Lt~~~L~~a~t~v~~~~L-~at-~il~ 229 (333) +-.++|-+.||.-+++ +-.+. +.-.| .|-+.+ .-+...+.++.++|-.++..+..-.- .++ .++| T Consensus 226 ~la~a~~~~~~~a~l~---G~G~~-----~~~~p-~Gil~~~~~~~~~~~~~~~~~d~i~~l~~~l~~~~~~~a~~~~~~ 296 (409) T protein:vir:45 226 RIAERIGRGEARYLIQ---GTGAG-----TPKQP-KGLAASVTGTTQTAAANAVKWQEILALKHSIDPAYRRGPKFRLAF 296 (409) T ss_pred HHHHHHHHHHHHHhhc---cCCCC-----Ccccc-ceeeeccccccccccccccchHHHHHHHHhhhhhhccCCeEEEEE Confidence 9999999999966553 11100 00011 111110 12234566788888888776643322 222 3578 Q ss_pred chhhhhhhhhcCCCchhhhHHhhhhhcceeee--eeccccc--ceeeecC----CeEEEeeChhhhcccccccCceeccc Q lcl|NC_011269. 230 NPQEYRDLYRWDINTTGWAFKDSVVAGERIVQ--FGEFQIG--KSIIIPR----GTVYLTPEPEFLGVFPVMYSLDVEED 301 (333) Q Consensus 230 ~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il~--~G~fgi~--~skvlpr----geiyvvadpE~~G~~pvR~~L~s~p~ 301 (333) |+.-|.-|+.=- +..+ .|+-+.+..-. .=++|.+ .+--||- +..++..|.... .+-.++++.++.. T Consensus 297 n~~~~~~l~~lk-d~~G----~~i~~~~~~~~~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~~~-~i~~~~~~~~~~~ 370 (409) T protein:vir:45 297 NDNTLKLISEME-DGQG----RPLWLPDIVGVAPASVLNVPYVIDQEIDDIGAGKKFMFCGDFDRF-IIRRVRYMILKRL 370 (409) T ss_pred CHHHHHHHHHhh-cCCC----ceeeccCcCCCCCceecceeeEEecCcCCccCCccEEEEeehhhh-heeeccceEEEEe Confidence 998888776510 2222 22222221100 0135544 3333443 233455676643 3557778766543 Q ss_pred -cchhh-hccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 302 -NKVER-FNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 302 -D~~er-~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) |.-+. -..+..+.+-++..+.||.+++++-.+ T Consensus 371 ~d~~~~~~~~~~~~~~r~d~~~~~~~A~~~l~~k 404 (409) T protein:vir:45 371 VERYAEYDQTGFLAFHRFDCILEDTSAIKALVGK 404 (409) T ss_pred ecccccCCcEEEEEEEEeccEeechhheEEEEec Confidence 32222 356788889999999999999998775 No 50 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=98.43 E-value=1.2e-08 Score=64.07 Aligned_cols=312 Identities=14% Similarity=0.085 Sum_probs=180.6 Q ss_pred CcccchhhhhhhhhhcccchHHHHHHHHHHhhcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhh Q lcl|NC_011269. 1 MTLPVAVGSGLGRFAKASDDYVADIVEAKQRMGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRN 80 (333) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~Rk 80 (333) ..-|. .|.-.+.+++|-.-+...-...-...++..|.+++ + .-.++.|+. .+-+-+.+.|.+.++....+++ T Consensus 68 ~~~~~-----~~~~~~~~~e~~~a~~~~lr~~~~~~~~~~e~~a~-~-~~~~~~GG~-~iP~~~~~~ii~~~~~~~~l~~ 139 (401) T protein:vir:44 68 LKRPA-----RGAQNKVAAEHKDAFVGFLRKGREDGLRDLERKAL-Q-VGTDEDGGY-AVPEELDRSILSLLKDEVVMRQ 139 (401) T ss_pred hhccc-----cccccchhHHHHHHHHHHHhhhhhhhhHHHHHHHh-h-cCCCCCCce-eccHhHHHHHHHHHHhhhhhhh Confidence 11111 01112222222211111110100112222222211 1 111234442 5567788889999999999999 Q ss_pred hhhccccCCCcceeecCCCCccceEEEEcCCCcccceee-cCceeeccceeeeccccccHHHhhhhcchhHHHHHHHHHH Q lcl|NC_011269. 81 VLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRITPF-EGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQ 159 (333) Q Consensus 81 lL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~i-~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~q 159 (333) +....+...+.. .+++...-. .+-|.+-.+.++.+.. .-+.|++.-..+..++.|..+-|.....|+..+...+-.+ T Consensus 140 ~~~~~~~~~~~~-~~~~~~~~~-~a~wv~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~ 217 (401) T protein:vir:44 140 EATVITVGGSDY-KKLVNLGGT-ASGWVGETDTRSQTATSRLGLIEPFMGEIYGNPQATQKMLDDAFFNVEAWINSELAT 217 (401) T ss_pred hceeeecCCCce-EEEEecCCc-cceeeccccccCccccccceeeeeehhheeeehhhhHHHHhcchHHHHHHHHHHHHH Confidence 988877765543 344322322 2335554444554432 4477888888899999999999999999999999999999 Q ss_pred HHHHHhhhHHHH---------HHhhhhhhhhhhcccccccccCCCcceEEeeccccHHHHHHHHHHHHhhCCccceEEec Q lcl|NC_011269. 160 AIMRQEDSRLVT---------LLEAAAVSYRVVDSSAQPGVGALPNEITIAGSHLMPDDLYTAVTYTDQRQLDSSRLLAN 230 (333) Q Consensus 160 aIM~qED~~~~s---------lle~~a~~~r~~~ssA~p~vg~~~N~i~i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~ 230 (333) +|-+.+|.-+++ +|... .........+.|...+..+-.++.++-+++..++..+..-.......+|| T Consensus 218 ai~~~~~~~~l~G~G~~~p~Gil~~~----~~~~~~~~~~~~~~~~~~t~~~~~~~~d~i~~~~~~l~~~~~~~a~~v~n 293 (401) T protein:vir:44 218 EFAEQEEIAFTTGDGTKKPKGFLAYE----STEESDKARAFGKLQHIVSGEATAVTADAIIKLIYTLRKAHRTGAKFMMN 293 (401) T ss_pred HHHHHHHhhhhccCCCCccceeeccc----cccccccccccccccccccccccccCHHHHHHHHHhcchhhhcCCEEEEc Confidence 999999966553 11111 11111111112222233444567788889988888776655566679999 Q ss_pred hhhhhhhhhcCCCchhhhHHhhhhhcceeeeee----cccccc--eeeecC----CeEEEeeChhhhcccccccCceecc Q lcl|NC_011269. 231 PQEYRDLYRWDINTTGWAFKDSVVAGERIVQFG----EFQIGK--SIIIPR----GTVYLTPEPEFLGVFPVMYSLDVEE 300 (333) Q Consensus 231 ~~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~G----~fgi~~--skvlpr----geiyvvadpE~~G~~pvR~~L~s~p 300 (333) ++.|.-|..=- |..+ .|+-+.++ +.| ++|.+. +-.+|- +.++++.|.-..=.+-.|.|+++.- T Consensus 294 ~~~~~~L~~lk-d~~G----~~l~~~~~--~~g~~~~l~G~PVv~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~~~~ 366 (401) T protein:vir:44 294 NNSLFAIRLLK-DTEG----NYLWRPGL--ELGQPSSLAGYGIAENEQMPDIAADAKAIAFGNFKRGYTIVDRIGTRILR 366 (401) T ss_pred HHHHHHHHHhh-ccCC----ceeecCCc--CCCCCceecceeeEEecCcCCccCCccEEEEeehhccEEEEEecceEEee Confidence 99998887621 2221 23322221 112 355442 222332 2233446653222356789999876 Q ss_pred ccchhhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 301 DNKVERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 301 ~D~~er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) .++..+-..+|..++-++..+.||.++++|..+ T Consensus 367 ~~~~~~~~v~~~a~~r~d~~~~~~~a~~~l~~~ 399 (401) T protein:vir:44 367 DPYTNKPFVGFYTTKRTGGMLVDSQAIKLLKIA 399 (401) T ss_pred eccccCCcEEEEEEEEeccEEecccceEEEEee Confidence 667666678899999999999999999999988 No 51 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=98.42 E-value=3.1e-08 Score=61.73 Aligned_cols=284 Identities=11% Similarity=0.105 Sum_probs=184.1 Q ss_pred hcccchHHHHHHHHH-HhhcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcce Q lcl|NC_011269. 15 AKASDDYVADIVEAK-QRMGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPI 93 (333) Q Consensus 15 ~~~~~~~~~~~~~~~-~~~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p 93 (333) -|-.+.--.++-.-. -...+..+.+++ .+.+..+.. .+-..+++.|.+.++....++++....+++.|. . T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~a~~-------~~~~~~~~~-liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~-~ 71 (324) T protein:vir:10 1 MEQTQKLKLNLQHFASNNVKPQVFNPDN-------VMMHEKKDG-TLLNDFTTPILQEVMENSKIMQLGKYEPMEGTE-K 71 (324) T ss_pred CCCchHHHHHHHHHHHHhhccceecccc-------eeccCCCcc-eechhHHHHHHHHHHhhchhhhhcceeeccCCc-e Confidence 111110000000000 001111222211 121222211 567889999999999999999998888877554 4 Q ss_pred eecCCCCccceEEEEcCCCcccceeecCceeeccceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHH Q lcl|NC_011269. 94 QYDVLDDLGQAYMLHGNEGEIRITPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLL 173 (333) Q Consensus 94 ~y~v~~~v~~a~~~~~~~G~i~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~sll 173 (333) .||+...... +-|++.-+++++....=+.+++.-.++.....|..+-|+....++..+..++..++|.+.+|..+++ T Consensus 72 ~~p~~~~~~~-a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~ai~~~~d~a~l~-- 148 (324) T protein:vir:10 72 KFTFWADKPG-AYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTYSQFFEEMKPMIAEAFYKKFDEAGIL-- 148 (324) T ss_pred EEEEEeCCcc-eeEeccCccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHhhh-- Confidence 5666444333 4567777778877777678888889999999999999999999999999999999999999975543 Q ss_pred hhhhhhhhhhcccccccccCCCc----ceEEeeccccHHHHHHHHHHHHhhCCccceEEechhhhhhhhhcCCCchhhhH Q lcl|NC_011269. 174 EAAAVSYRVVDSSAQPGVGALPN----EITIAGSHLMPDDLYTAVTYTDQRQLDSSRLLANPQEYRDLYRWDINTTGWAF 249 (333) Q Consensus 174 e~~a~~~r~~~ssA~p~vg~~~N----~i~i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~~~~Di~gw~~N~~~~~~ 249 (333) +... +..| .|.. | .-....+.++.++|..+...+..-+.....++||+.-|..|+- . T Consensus 149 -G~g~-------~~~~-~~i~-~~~~~~~~~~~~~~t~~~i~~~~~~l~~~~~~~~~~v~n~~~~~~L~~---------l 209 (324) T protein:vir:10 149 -NQGN-------NPFG-KSIA-QSIEKTNKVIKGDFTQDNIIDLEALLEDDELEANAFISKTQNRSLLRK---------I 209 (324) T ss_pred -cCCC-------CccC-cccc-ccccccceeccccCCHHHHHHHHHhhhhccCCCCEEEEcHHHHHHHHH---------h Confidence 2211 1111 1111 1 1134567899999999999999999999999999999998874 2 Q ss_pred Hhhhhhcceeeeee----cccccc----eeeecCCeEEEeeChhhhcccccccCceeccccch----------------h Q lcl|NC_011269. 250 KDSVVAGERIVQFG----EFQIGK----SIIIPRGTVYLTPEPEFLGVFPVMYSLDVEEDNKV----------------E 305 (333) Q Consensus 250 ~DpV~~~e~il~~G----~fgi~~----skvlprgeiyvvadpE~~G~~pvR~~L~s~p~D~~----------------e 305 (333) +|...+. +++.| ++|++. +.-.+.+.++ +.|..+. .+=+|+++.++-.|.. + T Consensus 210 ~d~~g~~--~~~~~~~~~l~G~PV~~~~~~~~~~~~~~-~gd~~~~-~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~ 285 (324) T protein:vir:10 210 VDPETKE--RIYDRNSDTLDGLPVVNLKSSNLKRGELI-TGDFDKL-IYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFE 285 (324) T ss_pred hccCCce--eecCCCCccccceeEEeecCCCCCcceEE-EEecccE-EEEEecCcEEEEeecccccccccccccchhhhh Confidence 3332222 22222 355442 2234455554 4677644 4667888887665532 2 Q ss_pred hhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 306 RFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 306 r~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) +--..|.+.+-++..+.||.+++.|..+ T Consensus 286 ~~~~~~r~~~r~d~~v~~~~A~~~l~~a 313 (324) T protein:vir:10 286 QDMVALRATMHVALHIADDKAFAKLVPA 313 (324) T ss_pred cCcEEEEEEEEEccEEecccceEEEEec Confidence 2356778888899999999999999988 No 52 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=98.41 E-value=3e-08 Score=61.85 Aligned_cols=294 Identities=12% Similarity=0.056 Sum_probs=181.5 Q ss_pred Ccccchhhh------hhhhhhcccchHHHHHHHHHHhhcchhcchHHHHHHHHHHhc-CchhHHHHHHHHHHHHHHHHHh Q lcl|NC_011269. 1 MTLPVAVGS------GLGRFAKASDDYVADIVEAKQRMGGRKLSAREKQAKLAHILS-DKVGGIQRLGQSMIGPIQLQLR 73 (333) Q Consensus 1 ~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~~Al~-~~Eg~~~aLg~~mA~pI~~q~~ 73 (333) ...+..-.. .-+.-.+....|+..+.+... ++ ....+=..+.. +..|+. .+-+-+...|.+.++ T Consensus 64 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~-----~~~~~~~~~~~t~~~gg~-~iP~~~~~~ii~~~~ 134 (397) T protein:vir:48 64 RANEVVNMSEEEKKPLTKSEEEVKAGFVKDFKNLVR---GR-----YQNLLDSKTDASGSDAGL-TIPQDIQTAIHTLVR 134 (397) T ss_pred HHhhhhhhhhhccccccchhhHHHHHHHHHHHHHHh---hh-----hhHHHHHhhccCCccccc-cccHHHHHHHHHHHH Confidence 000000000 000111112222222211110 11 11111011112 233442 567788888999999 Q ss_pred hhhhhhhhhhccccCCCc--ceeecCCCCccceEEEEcCCCcccce-eecCceeeccceeeeccccccHHHhhhhcchhH Q lcl|NC_011269. 74 YQGILRNVLLEDTLTPGV--PIQYDVLDDLGQAYMLHGNEGEIRIT-PFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIV 150 (333) Q Consensus 74 rqGi~RklL~~~TL~~G~--~p~y~v~~~v~~a~~~~~~~G~i~~Q-~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vl 150 (333) ....++++.+..+++.+. .|.++ ..+....+-|++-.+.+... ...=+.|++--.++..+..|..+-|+....++. T Consensus 135 ~~~~l~~~~~~~~~~~~~~~~~~~~-~~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~~~l~ 213 (397) T protein:vir:48 135 QYDSLQEYVNVENVTTLTGSRVYEK-WADITGLAKLDDEAGSIGTNDDPKLYPIRYAIKRYAGISTVTNSLLADSAENIL 213 (397) T ss_pred HHHHHHhhhceeeccCCcceEEEEe-ecCCCcceeeeccccccccccccceeeEEeeheeeeeehhhHHHHHhhchHHHH Confidence 999999998888877554 33222 22333334466666666643 334478888889999999999999999999999 Q ss_pred HHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCcceEEeeccccHHHHHHHHHHHHhhCCccceEEec Q lcl|NC_011269. 151 EYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPNEITIAGSHLMPDDLYTAVTYTDQRQLDSSRLLAN 230 (333) Q Consensus 151 e~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N~i~i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~ 230 (333) .+..++..++|..-+|..+++-. |.- ...++..+-+++..+...++.-......++|| T Consensus 214 ~~v~~~l~~~~~~~~d~~il~G~------------------g~~----~~~~~~~~~d~i~~~~~~l~~~~~~~a~~v~n 271 (397) T protein:vir:48 214 AWLSGWIAKKVVVTRNKAILEAI------------------ATL----PTKPTLTKWDDIIDLQAKVDPAIKQTSFFLTN 271 (397) T ss_pred HHHHHHHHHHHHHHHHHHHhhcc------------------ccc----ccccccccHHHHHHHHHHhhhhhcCCCEEEEC Confidence 99999999999999997766533 111 12345567788999999999888889999999 Q ss_pred hhhhhhhhhcCCCchhhhHHhhhhhcceeeeee----cccccc----eeeecCCe----EEEeeChhhhcccccccCcee Q lcl|NC_011269. 231 PQEYRDLYRWDINTTGWAFKDSVVAGERIVQFG----EFQIGK----SIIIPRGT----VYLTPEPEFLGVFPVMYSLDV 298 (333) Q Consensus 231 ~~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~G----~fgi~~----skvlprge----iyvvadpE~~G~~pvR~~L~s 298 (333) +.-|..|+.=- +..+ .|+-+.+. +.| ++|.+. +..+|-++ .++..|...+-.+-.|+|+.+ T Consensus 272 ~~~~~~L~~lk-d~~G----~~i~~~~~--~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i 344 (397) T protein:vir:48 272 TSGFTALKKVK-NAFG----DYLMERDV--KSPTGYSIDGFAVKEVADRWLANASSGAMPLYFGDLKQAVTLFDRQQMSL 344 (397) T ss_pred HHHHHHHHHhh-cCCC----ceeeccCc--CCCCCceeccceeEEecccccCCcCCCceEEEEEeccceEEEEeecceEE Confidence 99999998721 1111 23332221 122 356543 23445433 344567665566788999988 Q ss_pred ccccchh----hhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 299 EEDNKVE----RFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 299 ~p~D~~e----r~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) +-.+... .--.+|..++-++..+.||.+++++.-+ T Consensus 345 ~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~ 383 (397) T protein:vir:48 345 LSTNIGGGAFETDTTKIRVIDRFDVVATDTESFVPASFK 383 (397) T ss_pred EEeccchhhhhcCceeEEEEeeeccEEecccceEEEEec Confidence 8766543 2346899999999999999999999855 No 53 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=98.41 E-value=1.7e-08 Score=63.17 Aligned_cols=297 Identities=9% Similarity=0.046 Sum_probs=174.4 Q ss_pred Ccc---cc--hhhhhhhhhhcccchHHHHHHHHHHhhcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhh Q lcl|NC_011269. 1 MTL---PV--AVGSGLGRFAKASDDYVADIVEAKQRMGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQ 75 (333) Q Consensus 1 ~~~---~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rq 75 (333) ... ++ ....+--.+.+..+.+-..+-+.. |-+++ .++ . -.....+..|+. .+-.-+.+.|...++.. T Consensus 67 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l-~~~~~--~~~---~-~~~~~t~~~gg~-~vP~~~~~~ii~~~~~~ 138 (394) T protein:vir:10 67 NSDPDKPVDNAQPNGTDLKKKPIDAKKKAINDFI-HSHGK--VID---N-AAGHVTSTEAGV-LIPEEIIYDPTAEVNSV 138 (394) T ss_pred hcchhhhhhhhcccccchhhhHHHHHHHHHHHHH-hccch--hhh---h-hhcccccccCce-eccHHHHHHHHHHHHhh Confidence 110 00 000111111111111111111100 00010 000 0 011122333442 45566788899999999 Q ss_pred hhhhhhhhccccCCCcceeecCCCCccceEEEEcCCCcccc-eeecCceeeccceeeeccccccHHHhhhhcchhHHHHH Q lcl|NC_011269. 76 GILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRI-TPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQ 154 (333) Q Consensus 76 Gi~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~-Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q 154 (333) ...+++.+..+++.+. -.|+.++.-...+-|++..|++.. ....=+.|++.-..+..++.|..+=|+....|+..+.. T Consensus 139 ~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~~i~ 217 (394) T protein:vir:10 139 VDLSTLVTKTPVTTPK-GTYPILKRATDRFSSVAELAENPALAEPEFEQVDWSVSTYRGAIPLSEEAIADSAVDLTSLVG 217 (394) T ss_pred hhhhhhceeeeccCCc-eEEEEEecCCCccccccccccccccccccceeEEeeeeeeEeeehhHHHHHhhhhHHHHHHHH Confidence 9999998888877664 445554555455667888888875 44455889999999999999999999999999999999 Q ss_pred HHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCcceEEeeccccHHHHHHHHHHHHhhCCccceEEechhhh Q lcl|NC_011269. 155 DMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPNEITIAGSHLMPDDLYTAVTYTDQRQLDSSRLLANPQEY 234 (333) Q Consensus 155 ~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N~i~i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~~~ 234 (333) ++..++|..-+|..+++.+-. + .+..+ .+..+-++|-.+.....+-... ..++||++.| T Consensus 218 ~~la~~~~~~~~~~il~g~g~-----------~--------~~~~~-~~~~~~d~l~~~~~~~~~~~~~-a~~vmn~~~~ 276 (394) T protein:vir:10 218 QSINEKSVNTYNAMIAPVLQS-----------F--------TAKAT-TTDTLVDSLKHILNVDLDPAYS-RALVVTQSLF 276 (394) T ss_pred HHHHHHHHHHHHHHHhhcccc-----------c--------ccccc-cccccHHHHHHHHHhhhhhhcc-CEEEecHHHH Confidence 999999999999877665521 1 11112 2345566776665433333332 5789999999 Q ss_pred hhhhhcCCCchhhhHHhhhhhcceeee--ee----cccccc----eeeecCC--e-EEEeeChhhhcccccccCceeccc Q lcl|NC_011269. 235 RDLYRWDINTTGWAFKDSVVAGERIVQ--FG----EFQIGK----SIIIPRG--T-VYLTPEPEFLGVFPVMYSLDVEED 301 (333) Q Consensus 235 ~Di~gw~~N~~~~~~~DpV~~~e~il~--~G----~fgi~~----skvlprg--e-iyvvadpE~~G~~pvR~~L~s~p~ 301 (333) .=|+.=- +..| .|+-+..+.-. .| ++|.+- +..+|-+ . .+++.|.-..-.+-.|+++.+.-. T Consensus 277 ~~l~~lk-d~~G----~~i~~~~~~~~~~~~~~~~L~G~PV~~~~~~~~~~~~~~~~i~~gd~s~~~~~~~~~~~~v~~~ 351 (394) T protein:vir:10 277 NTLDTLK-DKNG----RYLLHDASDSITDGTAKGTVLGVPVYVVGDALLGSAAGDQKAFVGDLKRGVLFADRQQVTLAWE 351 (394) T ss_pred HHHHHhh-ccCC----CeeeeccccccccCCcccccccceeEEecccccCCCCCceEEEEeeccccEEEEeecceEEEEe Confidence 8888611 1222 23322221111 11 355442 2233432 2 356666654455667888888755 Q ss_pred cchhhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 302 NKVERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 302 D~~er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) +.. .+.+++..++-++.++.||.+|+++.-. T Consensus 352 ~~~-~~~~~~~~~~r~d~~~~~~~ai~~~~~~ 382 (394) T protein:vir:10 352 DSK-IYGRYLGAAFRFGVKQADSNAGYFVTNT 382 (394) T ss_pred ccc-ccceeEEEEEEeccEEeccccEEEEEee Confidence 532 3678999999999999999999998765 No 54 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=98.40 E-value=1.6e-08 Score=63.27 Aligned_cols=267 Identities=9% Similarity=0.011 Sum_probs=172.5 Q ss_pred hcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcceeecCCCCccceEEEEcCCCcccceeecCceeeccce Q lcl|NC_011269. 50 LSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRITPFEGKRIEVQLF 129 (333) Q Consensus 50 l~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~i~~~ri~~P~f 129 (333) |..+-|- -+-.-+.+.|-+.++.+.+.+++....+++.|.. .||+......| -|++..++++.....=..+++.-. T Consensus 1 ma~~gG~--lip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~-~~p~~~~~~~a-~~v~Eg~~~~~~~~~f~~v~l~~~ 76 (298) T protein:vir:94 1 MVLNKGT--LFDPELVTDLISKVAGKSSIARLSAQKPIPFNGE-KVFTFTMDSEI-DVVAESGKKTHGGVTLAPQTMVPI 76 (298) T ss_pred Ceecccc--ccChhHHHHHHHHHHhhchhhhhcceeeccCCce-EEEEEecCcce-EEeeCCccccccccceeEEEEeee Confidence 6666563 4667778888899999999999999999987753 45554444344 467766667765555567888888 Q ss_pred eeeccccccHHHhhhh---cchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhccccccccc--CCCcc----eE- Q lcl|NC_011269. 130 RIASFPQIKKEDLYYL---RSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVG--ALPNE----IT- 199 (333) Q Consensus 130 ~Ivs~P~V~~~dl~~~---~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg--~~~N~----i~- 199 (333) ++.....|..+-|++. ..++....+.+..++|.+.+|.-+++=.++.. ..+.++.| .+.+. .. T Consensus 77 k~~~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~-------g~~~~~~~~~~~~~~~~~~~~~ 149 (298) T protein:vir:94 77 KVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRL-------GTASAVIGTNHFDSKVTQKVEA 149 (298) T ss_pred EEEEeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCC-------Cccccccccccccccccccccc Confidence 9999999988878654 45788999999999999999965554321100 01111111 11111 11 Q ss_pred EeeccccHHHHHHHHHHHHhhCCccceEEechhhhhhhhhcCCCchhhhHHhhhhhcceee--eee-ccccc--ceeeec Q lcl|NC_011269. 200 IAGSHLMPDDLYTAVTYTDQRQLDSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIV--QFG-EFQIG--KSIIIP 274 (333) Q Consensus 200 i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il--~~G-~fgi~--~skvlp 274 (333) -.++.-.-+++..++..+.+-+++...++||++.|..|+.=- +..+ .|+-+.. .. +.| +.|.+ .+.-+| T Consensus 150 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lk-d~~G----~~l~~~~-~~~~~~~tl~G~PV~~~~~v~ 223 (298) T protein:vir:94 150 PRGIADPNGAIENAVELLTGVDADVTGIAINPSFRSALAKQK-DLQG----NALFPEL-KWGATPDTINGLPVDVNKTVS 223 (298) T ss_pred ccccccHHHHHHHHHHhhhhcCCCccEEEEcHHHHHHHHHhh-ccCC----CeeecCc-ccCCCCceecceeeEEecccc Confidence 111222246789999999999999999999999999887610 1111 2221111 00 011 23433 234445 Q ss_pred CC-----eEEEeeChhhhcccccccCceeccccchhh-------h---ccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 275 RG-----TVYLTPEPEFLGVFPVMYSLDVEEDNKVER-------F---NKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 275 rg-----eiyvvadpE~~G~~pvR~~L~s~p~D~~er-------~---~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) -+ ...+..|..+...|-+|+++..+-.++.+. | ..+|...+-+++.+.+|.+++.+.++ T Consensus 224 ~~~~~~~~~~~~Gdfs~~~~~~~~~~~~~~~~~~~~~d~~~~~~f~~~~v~~r~~~r~~~~~~~~~a~~~l~~~ 297 (298) T protein:vir:94 224 DMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEA 297 (298) T ss_pred cccCCCccEEEEeeccceEEEEEecCceEEEeecCCCcCcchhhhhcCcEEEEEEEEeccEeecccceEEEEec Confidence 32 245567776566678888988765543221 2 34577778899999999999999999 No 55 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=98.39 E-value=6.1e-09 Score=65.63 Aligned_cols=268 Identities=10% Similarity=0.020 Sum_probs=167.2 Q ss_pred HHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcceeecCCCCccceEEEEcCCCcccce-----eec Q lcl|NC_011269. 46 LAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRIT-----PFE 120 (333) Q Consensus 46 m~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q-----~i~ 120 (333) |+. +.+..|. -.+-+-+.+.|.+.++.+..++++....+.+.|. ..||+...-. .+.|++..+++... ... T Consensus 1 ma~-~t~~~gg-~liP~~~~~~Ii~~~~~~s~l~~l~~~~~~~~~~-~~~p~~~~~~-~a~wv~E~~~~~~~~~~~s~~~ 76 (305) T protein:vir:25 1 MAD-ISRAEVA-SLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKT-THLPVLATLP-EADWVGESATDPKGVKPTSKVT 76 (305) T ss_pred CCC-ccCCccc-eecCHHHHHHHHHHHHhhchhhhhcceeeccCCc-EEEEEEeCCc-ceEEeecccccccccccccccc Confidence 333 3344444 2678889999999999999999999888887664 3344433332 34566655544332 222 Q ss_pred CceeeccceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCcceEE Q lcl|NC_011269. 121 GKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPNEITI 200 (333) Q Consensus 121 ~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N~i~i 200 (333) =+.|++.-.++...+.|..+-|+....|+..+..+...++|.+.+|.-+++=-.+ . -=.-...-.+......+..+- T Consensus 77 f~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~l~~~~a~~~d~a~~~G~g~-~--~~~~~~~~~~~~~~~~~~~~~ 153 (305) T protein:vir:25 77 WANRTLVAEEIAVIIPVHENVIDDATVAVLTEVAELGGQAIGKKLDQAVIFGTDK-P--ASWVSPALIPAAVTAGQAVEV 153 (305) T ss_pred eeeEEeeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHhhhheeccCC-C--CCccccccccccccccccccc Confidence 2567778888999999999999999999999999999999999999666531000 0 000000001111111133333 Q ss_pred eeccccHHH----HHHHHHHHHhhCCccceEEechhhhhhhhhcCCCchhhhHHhhhhhcceeeeeec-ccccc--eeee Q lcl|NC_011269. 201 AGSHLMPDD----LYTAVTYTDQRQLDSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIVQFGE-FQIGK--SIII 273 (333) Q Consensus 201 ~~g~Lt~~~----L~~a~t~v~~~~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~G~-fgi~~--skvl 273 (333) ..+..+.++ +..+...+.+-+.....++||+..|..++. + || ..+..+.+.+. +|.+- +-.+ T Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~--l-------kd--~~G~~i~~~~~l~G~Pv~~~~~~ 222 (305) T protein:vir:25 154 VGGVANESDIVGATNRAAKAVASAGWAPDTLLSSLALRYEVAN--I-------RD--ANGNPVFRDDSFAGFRTFFNRNG 222 (305) T ss_pred cccchhhhHHHHHHHHHHHhhhhcccccceeEecHHHHHHHHH--h-------hc--cCCceeecCCcccccceEEcCcc Confidence 444444433 555566666667777789999999999876 2 22 12333333222 34331 1112 Q ss_pred ----cCCeEEEeeChhhhcccccccCceeccccch------------hhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 274 ----PRGTVYLTPEPEFLGVFPVMYSLDVEEDNKV------------ERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 274 ----prgeiyvvadpE~~G~~pvR~~L~s~p~D~~------------er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) +.+.+| ..|..+ -.+-+|+++.++-.|+. ++-...|...+-+|+++.||.+||.+-.. T Consensus 223 ~~~~~~~~~~-~gd~s~-~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~R~~~r~~~~v~~p~a~v~~~~~ 296 (305) T protein:vir:25 223 AWDADAAIEV-IADSSR-VKIGVRQDITVKFLDQATLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKT 296 (305) T ss_pred CCCCCccEEE-EEecce-EEEEEecCeEEEEeeeeeeecCCceeeeeecCcEEEEEEEeecceeeCcccEEEEccc Confidence 223444 467765 46778888877665532 22345677888899999999999999887 No 56 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=98.39 E-value=2.5e-08 Score=62.29 Aligned_cols=279 Identities=13% Similarity=0.006 Sum_probs=184.5 Q ss_pred hcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcceeecCCCCccceEEEEcCC Q lcl|NC_011269. 32 MGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNE 111 (333) Q Consensus 32 ~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~ 111 (333) ||- .+++ ++++..-.+..|. .|-..+.+.|-+.++.+...+++....+++.|. -.||+...... +-|++.. T Consensus 1 ~g~---~~e~--~~~~~~~t~~~~g--~l~~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~-~~ip~~~~~~~-a~wv~Eg 71 (397) T protein:vir:23 1 MGF---SADH--SQIAQTKDTMFTG--YLDPVQAKDYFAEAEKTSIVQRVAQKIPMGATG-IVIPHWTGDVS-AQWIGEG 71 (397) T ss_pred CCc---CHHH--HHHhhccCCCCcc--ccchhHHHHHHHHHHhccchhhhcceeeccCCc-eEEEEEcCCcc-eEEecCC Confidence 762 2332 3334444444443 577778888889999999999999888887664 24555444333 3467777 Q ss_pred CcccceeecCceeeccceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccc Q lcl|NC_011269. 112 GEIRITPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGV 191 (333) Q Consensus 112 G~i~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~v 191 (333) ++++.....=+.|++.-.++.....|..+=|++...|+..+..++..++|.+.+|.-+++=-. +..+-. T Consensus 72 ~~~~~s~~~f~~v~l~~~k~~~~v~iS~ell~ds~~~l~~~i~~~l~~aia~~~d~a~l~G~g-----------t~~~~~ 140 (397) T protein:vir:23 72 DMKPITKGNMTKRDVHPAKIATIFVASAETVRANPANYLGTMRTKVATAIAMAFDNAALHGTN-----------APSAFQ 140 (397) T ss_pred ccccccccceeEEEEeeEEEEEeehhhHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhhccc-----------CCcccc Confidence 778777776677888888999999999999999999999999999999999999976654211 111111 Q ss_pred cCCCc---ceEEeeccccHHHHHHHHHHHHhhCCccceEEechhhhhhhhhcCCCchh-hhHHhhhhhccee-eeee-cc Q lcl|NC_011269. 192 GALPN---EITIAGSHLMPDDLYTAVTYTDQRQLDSSRLLANPQEYRDLYRWDINTTG-WAFKDSVVAGERI-VQFG-EF 265 (333) Q Consensus 192 g~~~N---~i~i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~~~~Di~gw~~N~~~-~~~~DpV~~~e~i-l~~G-~f 265 (333) .+.| ...-..+....+++-.+...+..-......++||+..|..|+.=- +..+ |.+.+....+... .+.| ++ T Consensus 141 -~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lk-d~~G~~i~~~~~~~~~~~~~~~~tl~ 218 (397) T protein:vir:23 141 -GYLDQSNKTQSISPNAYQGLGVSGLTKLVTDGKKWTHTLLDDTVEPVLNGSV-DANGRPLFVESTYESLTTPFREGRIL 218 (397) T ss_pred -cccccccceeeecccchhHHHHHHHHhhhhcccCCCEEEEcHHHHHHHHHhh-ccCCceeecccccccccccccCceee Confidence 1111 111234555667777777778888888889999999999988611 1111 2222222111110 0011 34 Q ss_pred ccc--ceeeecCCeEE-EeeChhhhcccccccCceeccccch----------------hhhccceehhhhhhhhhhccce Q lcl|NC_011269. 266 QIG--KSIIIPRGTVY-LTPEPEFLGVFPVMYSLDVEEDNKV----------------ERFNKGWVMDELVGMAILNPRG 326 (333) Q Consensus 266 gi~--~skvlprgeiy-vvadpE~~G~~pvR~~L~s~p~D~~----------------er~~kGWvm~E~~g~~i~N~~s 326 (333) |++ .+.-+|-|++. +..|.- ...+-.|+++.++-.|.. ++--..|...+-++..+.+|.+ T Consensus 219 G~Pv~~s~~~~~g~~~~~~gDfs-~~~i~~~~~i~i~~~~e~~~~~~~~~~~~~~~lf~~d~v~~ra~~r~d~~v~~~~a 297 (397) T protein:vir:23 219 GRPTILSDHVAEGDVVGYAGDFS-QIIWGQVGGLSFDVTDQATLNLGSQESPNFVSLWQHNLVAVRVEAEYGLLINDVNA 297 (397) T ss_pred eeeEEEeCCCCCCceEEEEeecc-eEEEEEEeceEEEEeeeeeeeeccccccceeeeeeccceeEEEEeeeccceecccc Confidence 555 45558888865 456765 345778888887755532 2224677788899999999999 Q ss_pred EEEEecC Q lcl|NC_011269. 327 IVILRKA 333 (333) Q Consensus 327 iv~~~~~ 333 (333) ++.+.+. T Consensus 298 ~~~~~~~ 304 (397) T protein:vir:23 298 FVKLTFD 304 (397) T ss_pred eEEEeec Confidence 9999987 No 57 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=98.39 E-value=1.9e-08 Score=62.97 Aligned_cols=297 Identities=7% Similarity=-0.010 Sum_probs=174.1 Q ss_pred CcccchhhhhhhhhhcccchHHHHHHHHHHhhcchhcchHHH--------HHHHHHHhcCchhHHHHHHHHHHHHHHHHH Q lcl|NC_011269. 1 MTLPVAVGSGLGRFAKASDDYVADIVEAKQRMGGRKLSAREK--------QAKLAHILSDKVGGIQRLGQSMIGPIQLQL 72 (333) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ls~ee~--------~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~ 72 (333) -.-+. ........++..++....-+....+.......+.+ ....+....+..|+. .+-+-+.+.|...+ T Consensus 76 ~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~-liP~~~~~~ii~~~ 152 (394) T protein:vir:97 76 GGKEV--TQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKP-VSSEEILYTPAREV 152 (394) T ss_pred ccccc--chhhHHHHHHHHHHHHHHHHHhhhhhhhhhHHHHHHHHHhhhhhhhhccccccccccc-cChHHHHHHHHHHh Confidence 00000 00011111111222222222111111111111100 111111122333331 34466788899999 Q ss_pred hhhhhhhhhhhccccCCCcceeecCCCCccceEEEEcCCCcccc-eeecCceeeccceeeeccccccHHHhhhhcchhHH Q lcl|NC_011269. 73 RYQGILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRI-TPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVE 151 (333) Q Consensus 73 ~rqGi~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~-Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle 151 (333) +....++++.+..+++.|. ..||..+.-+..+.|++-.++++. ....=+.|++.-..+.....|..+=|.....|+.. T Consensus 153 ~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~i~is~ell~ds~~~~~~ 231 (394) T protein:vir:97 153 KTVVDLKPFTTVYQAKKAS-GKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDADVDLVG 231 (394) T ss_pred hhhhhhhhhceeeeccCcc-eEEEEEecCCCccceecccccccccccccceeEEeehhheeeehhhHHHHHhhhhHHHHH Confidence 9999999999988888876 345655554455567777777764 33344788888889999999999999999999999 Q ss_pred HHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCcceEEeeccccHHHHHHHHHHHHhhCCccceEEech Q lcl|NC_011269. 152 YTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPNEITIAGSHLMPDDLYTAVTYTDQRQLDSSRLLANP 231 (333) Q Consensus 152 ~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N~i~i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~ 231 (333) +...+..++|..-+|..+++.+.+.+ | .+..+-++|..++...-+.... ..++||+ T Consensus 232 ~i~~~la~~~~~~~~~~i~~g~~~~~-----------~------------~~~~~~~~~~~~~~~~~~~~~~-a~~v~n~ 287 (394) T protein:vir:97 232 IVSESISQIKVNTTNDAIAKVLKSFT-----------T------------KTVKNLDEIKALLNGGFDPAYN-VSLIVSQ 287 (394) T ss_pred HHHHHHHHHHHHHHHHHHhhcccccc-----------c------------cccccHHHHHHHHHhhhhhhhC-CEEEEcH Confidence 99999999999999977666542221 1 1233456666666554444333 4589999 Q ss_pred hhhhhhhhcCCCchhhhHHhhhhhcceeeee---eccccc----ceeeecCCeEEEeeChhhhcccccccCceeccccch Q lcl|NC_011269. 232 QEYRDLYRWDINTTGWAFKDSVVAGERIVQF---GEFQIG----KSIIIPRGTVYLTPEPEFLGVFPVMYSLDVEEDNKV 304 (333) Q Consensus 232 ~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~---G~fgi~----~skvlprgeiyvvadpE~~G~~pvR~~L~s~p~D~~ 304 (333) ..|.-|..= -+..| -|+-+.++ .+. =++|.+ .+.-+|.+++| +.|--.+-.+-+|+++.++..|. T Consensus 288 ~~~~~l~~l-kd~~G----~~i~~~~~-~~~~~~~l~G~pv~~~~~~~~~~~~~~-~gd~~~~~~~~~~~~~~~~~~~~- 359 (394) T protein:vir:97 288 SFYQTLDTL-KDGNG----RYLLQDDI-TAVSGKVLLGKPVFVLSDEVLGANKAF-IGDFKRGVLFADRKDLGLRWADN- 359 (394) T ss_pred HHHHHHHHh-hccCC----CeeeecCc-CCCCCceeccceeEEecccccCCccEE-EeeccccEEEEEecceEEEEecc- Confidence 999988761 11111 12222221 111 134543 25567777765 45543333455788888876553 Q ss_pred hhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 305 ERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 305 er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) ..+.++...++-++..+.||.+|+.+.-. T Consensus 360 ~~~~~~~~~~~r~d~~v~~~~a~~~~~~~ 388 (394) T protein:vir:97 360 EIYGQYLQAVLRFGVSKVDDKAGYYVTFT 388 (394) T ss_pred cccceeEEEEEEEccEEecccceEEEEec Confidence 23578899999999999999999998765 No 58 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=98.38 E-value=4.8e-08 Score=60.72 Aligned_cols=315 Identities=16% Similarity=0.159 Sum_probs=176.4 Q ss_pred Ccc----cchhhhhhhhhhcccchHHHHHHHHHHhhcchhcchHHHHHHHHHHhc---CchhHHHHHHHHHHHHHHHHHh Q lcl|NC_011269. 1 MTL----PVAVGSGLGRFAKASDDYVADIVEAKQRMGGRKLSAREKQAKLAHILS---DKVGGIQRLGQSMIGPIQLQLR 73 (333) Q Consensus 1 ~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~~Al~---~~Eg~~~aLg~~mA~pI~~q~~ 73 (333) +.- ....+....+......+-.+.. +..-.+.++..+.+. .++...+.. +..+. -.+.+.+.+.|-+.++ T Consensus 67 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~-~~vp~~~~~~ii~~~~ 143 (413) T protein:vir:81 67 LTRKGEGYKSIGEFFAKRAGDQIKQQAGG-AQLNYSVGEYVAPRV-KAASDPASTATLTDEFQ-GGYGTTWNRNIIYRRR 143 (413) T ss_pred HhhhhhhhhhhhhhhhhhhhhHHHHHHHH-HHhhhhhhhhhhhHH-Hhhhhhhhhcccccccc-cccchhhHHHHHHHHh Confidence 000 0011111111110000000000 011111222222222 222111111 12232 2456778888999999 Q ss_pred hhhhhhhhhhccccCCCcceeecC---CCCccceEEEEcCCCcccceee-cCceeeccceeeeccccccHHHhhhhcchh Q lcl|NC_011269. 74 YQGILRNVLLEDTLTPGVPIQYDV---LDDLGQAYMLHGNEGEIRITPF-EGKRIEVQLFRIASFPQIKKEDLYYLRSNI 149 (333) Q Consensus 74 rqGi~RklL~~~TL~~G~~p~y~v---~~~v~~a~~~~~~~G~i~~Q~i-~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~v 149 (333) .....|++....+++.+.. .|++ .++....+-|++..+++++..+ .=+.|++.-..+..+..|..+=|+.. .++ T Consensus 144 ~~~~l~~~~~~~~~~~~~~-~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds-~~l 221 (413) T protein:vir:81 144 EKLVVADLMDNLTMTNTTI-KYLMEKANRVVEGGFKTVAEGGKKPYMRFADFDIVTESLSKIAGLTKITDEMIEDY-DFL 221 (413) T ss_pred hhhhHHhhcceeeccCCce-eEEEeccccccccccceecCcccccccCcccceeeEeeeeeEEEeehhhHHHHHHH-HHH Confidence 9999999999888876653 3333 2233344557777777765433 22578888888888888888866655 568 Q ss_pred HHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCC----cceEEeeccccHHHHHHHHHHHHhh-CCcc Q lcl|NC_011269. 150 VEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALP----NEITIAGSHLMPDDLYTAVTYTDQR-QLDS 224 (333) Q Consensus 150 le~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~----N~i~i~~g~Lt~~~L~~a~t~v~~~-~L~a 224 (333) ..+....-.++|-+-+|.-+++ +.. ...|-.|-++ +.++...+.-.-+.+..+...+... +... T Consensus 222 ~~~i~~~la~~~~~~~d~~~l~---G~G--------~~~~~~Gi~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~ 290 (413) T protein:vir:81 222 VSYINARLLEELAIEEERQLLL---GDG--------TGNNLTGLLKRDGIQTLAVSNKDELADSIYKAMTNISLATPFQA 290 (413) T ss_pred HHHHHHHHHHHHHHHHHHHHhc---cCC--------CCCcccccccccccccccccccchhHHHHHHHHHHhhhhccCCC Confidence 8888888899999999965442 322 1122122111 2333333334456677777776543 5666 Q ss_pred ceEEechhhhhhhhhcCCCchh-hhHHhhhhhc----ceeeeeecccc--cceeeecCCeEEEeeChhhhcccccccCce Q lcl|NC_011269. 225 SRLLANPQEYRDLYRWDINTTG-WAFKDSVVAG----ERIVQFGEFQI--GKSIIIPRGTVYLTPEPEFLGVFPVMYSLD 297 (333) Q Consensus 225 t~il~~~~~~~Di~gw~~N~~~-~~~~DpV~~~----e~il~~G~fgi--~~skvlprgeiyvvadpE~~G~~pvR~~L~ 297 (333) ..++||++.|..|+.=- ++.| |.+.+++... ......=++|. ..+-.+|.|++|+ .|.-++-.+-+|+|+. T Consensus 291 ~~~vmn~~~~~~l~~lk-d~~G~~l~~~~~~~~~~~~~~~~~~~l~G~pv~~s~~~~~~~~~~-gd~~~~~~~~~~~~~~ 368 (413) T protein:vir:81 291 DALVINPLDYQELRLAK-DANGQYYGGGVFQGQYGSGGIMLDPAPWGLRTVQSQVVPVGKPVV-GAFRSAASVLRKGGVR 368 (413) T ss_pred cEEEEcHHHHHHHHHhh-ccCCceeccccccccccccccccCceecceeeEEcCCCCcccEEE-EecccEEEEEEecceE Confidence 77999999999887511 1111 2222222111 00000013443 3556689999865 6665566677899988 Q ss_pred eccccch----hhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 298 VEEDNKV----ERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 298 s~p~D~~----er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) ++-.+.. ++-..+|..++-+++.+.+|.+++.+.-+ T Consensus 369 v~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~ 408 (413) T protein:vir:81 369 IDSTNTNVDDFENNLITVRAEERVGLMVTFPEAIVQLDVA 408 (413) T ss_pred EEEeccccchhhcCcEEEEEEEeeccEEecccceEEEEec Confidence 8766643 33456999999999999999999999877 No 59 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=98.37 E-value=1.6e-08 Score=63.36 Aligned_cols=266 Identities=9% Similarity=-0.007 Sum_probs=169.1 Q ss_pred hcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcceeecCCCCccceEEEEcCCCcccceeecCceeeccce Q lcl|NC_011269. 50 LSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRITPFEGKRIEVQLF 129 (333) Q Consensus 50 l~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~i~~~ri~~P~f 129 (333) |...-|- -+-..+.+.|-+.++.+.+.|++....+++.|.. .||+...-.. +.|++..++++.....=+.+++.-. T Consensus 1 ma~~gG~--lvp~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~-~ip~~~~~~~-a~~v~E~~~~~~~~~~f~~v~l~~~ 76 (298) T protein:vir:16 1 MVLNKGT--LFDPTLVTDLISKVAGKSSIARLSAQKPIPFNGE-KVFTFTMDSE-IDVVAESGKKTHGGVTLAPQTMVPI 76 (298) T ss_pred CcccCcc--eechhHHHHHHHHHHhhhhhhhhcceeeccCCce-EEEEEecCcc-eEEecCCccccccccceeEEEEeee Confidence 6655553 4556778888899999999999999888887654 4555444333 4578877778877666678888888 Q ss_pred eeeccccccHHHhhhh---cchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhccccccccc--CCC----cceEE Q lcl|NC_011269. 130 RIASFPQIKKEDLYYL---RSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVG--ALP----NEITI 200 (333) Q Consensus 130 ~Ivs~P~V~~~dl~~~---~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg--~~~----N~i~i 200 (333) ++.....|..+=|++. ..|+..+.+.+.+++|.+.+|.-+++=.+... ..+.+..| ... |..+. T Consensus 77 k~a~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~-------g~~~~~~~~~~~~~~~~~~~~~ 149 (298) T protein:vir:16 77 KVEYGARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRL-------GTASAVIGTNHFDSKVTQKVEA 149 (298) T ss_pred eEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCC-------Cccccccccccccccccccccc Confidence 8888888888878644 46899999999999999999966554211000 01111111 000 11111 Q ss_pred eecccc-HHHHHHHHHHHHhhCCccceEEechhhhhhhhhcCCCchhhhHHhhhhhcceeeeee----ccccc--ceeee Q lcl|NC_011269. 201 AGSHLM-PDDLYTAVTYTDQRQLDSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIVQFG----EFQIG--KSIII 273 (333) Q Consensus 201 ~~g~Lt-~~~L~~a~t~v~~~~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~G----~fgi~--~skvl 273 (333) .+..-. -+++..++..+...+.+...++||+..|..|+. +.+. .--|+-+.. .+.| ++|.+ .+.-+ T Consensus 150 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~--lkd~---~G~~i~~~~--~~~~~~~~l~G~PV~~~~~v 222 (298) T protein:vir:16 150 PRGIADPNGAIENAVELLTGVDADVTGIAINPSFRSALAK--QKDL---QDNALFPEL--KWGATPDTINGLPVDVNKTV 222 (298) T ss_pred ccccccHHHHHHHHHHHhhhcCCCccEEEEcHHHHHHHHH--hhcc---CCCeeecCc--ccCCCCceecceeeEEeccc Confidence 111111 247889999999999999999999999999876 2211 012222211 0111 34433 23334 Q ss_pred cCC-----eEEEeeChhhhcccccccCceeccccc----------hhhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 274 PRG-----TVYLTPEPEFLGVFPVMYSLDVEEDNK----------VERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 274 prg-----eiyvvadpE~~G~~pvR~~L~s~p~D~----------~er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) |-+ ...++.|=.+.-.+-+|+++..+-.+. .++-..+|...+-++..+.||.+++.|.+| T Consensus 223 ~~~~~~~~~~~~~GDfs~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~ra~~r~d~~v~~~~a~~~l~~a 297 (298) T protein:vir:16 223 SDMSLTQRDRAIIGDFANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTEA 297 (298) T ss_pred ccccCCCccEEEEeeccceEEEEEecCceEEEeeccCCcCcchhhhhcCcEEEEEEEEEccEeecccceEEEeec Confidence 432 133344443333455677776644332 222357788999999999999999999999 No 60 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=98.36 E-value=4.3e-08 Score=60.95 Aligned_cols=301 Identities=12% Similarity=0.093 Sum_probs=186.0 Q ss_pred Ccccchhhhh-hhhhhcccchHHHHHHHHHHhhcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhh Q lcl|NC_011269. 1 MTLPVAVGSG-LGRFAKASDDYVADIVEAKQRMGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILR 79 (333) Q Consensus 1 ~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~R 79 (333) ...+-..... -+.-......|.....+. .|.+-..++.++.+++- ...+..|+. .+-+-+.+.|-..++.....+ T Consensus 72 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~a~~--~~t~~~gg~-~vP~~~~~~Ii~~~~~~~~l~ 147 (408) T protein:vir:10 72 VNMREEEKGPLNKSENELKDKFVKDFVNM-VRNPMAFMNTVSSKTET--SGSDSAAGL-TIPQDIRTMINTLVRQYDSLQ 147 (408) T ss_pred hccccccccccccchhhhHHHHHHHHHHH-hhcchhhhhhhhhhhhh--cccccCCce-eccHhHHHHHHHHHHhhchhh Confidence 2221111111 111112223333332221 22333334444444332 223344542 566778888999999999999 Q ss_pred hhhhccccCCCccee-ecCCCCccceEEEEcCCCccccee-ecCceeeccceeeeccccccHHHhhhhcchhHHHHHHHH Q lcl|NC_011269. 80 NVLLEDTLTPGVPIQ-YDVLDDLGQAYMLHGNEGEIRITP-FEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMT 157 (333) Q Consensus 80 klL~~~TL~~G~~p~-y~v~~~v~~a~~~~~~~G~i~~Q~-i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A 157 (333) ++.+..+++.+.... ++.-.+...-+.|++-.++++++- ..-+.|++...++...+.|..+=|+....|+..+..++. T Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l 227 (408) T protein:vir:10 148 QYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSLKDTAENILAWLSSWI 227 (408) T ss_pred hhcceeeccCCcceEEEeeccccccceeeecCccccccccCcceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHH Confidence 998888876554321 222233334456777777787643 334789999999999999999999999999999999999 Q ss_pred HHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCcceEEeeccccHHHHHHHH-HHHHhhCCccceEEechhhhhh Q lcl|NC_011269. 158 KQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPNEITIAGSHLMPDDLYTAV-TYTDQRQLDSSRLLANPQEYRD 236 (333) Q Consensus 158 ~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N~i~i~~g~Lt~~~L~~a~-t~v~~~~L~at~il~~~~~~~D 236 (333) .++|-.-+|.-+++-.- ++.|. ++..+-++|..++ ..++..-.....++||+..|.- T Consensus 228 ~~~~~~~~~~~il~g~g-----------~~~~~-----------~~~~~~~~l~~~~~~~~~~~~~~~a~~v~n~~~~~~ 285 (408) T protein:vir:10 228 AKKVVVTRNQAIIEVMK-----------AAPKK-----------PTIAKFDDVITMINTAVDPAIIATSSLLTNQSGLNK 285 (408) T ss_pred HHHHHHHHHHHHhhccc-----------ccccc-----------cccccHHHHHHHHHHhhhhhhccCCEEEEcHHHHHH Confidence 99999999976665441 12222 2334667777766 3455544555678999999998 Q ss_pred hhhcCCC-chhhhHHhhhhhcceeee--eecccccc----eeeecCCe----EEEeeChhhhcccccccCceeccccchh Q lcl|NC_011269. 237 LYRWDIN-TTGWAFKDSVVAGERIVQ--FGEFQIGK----SIIIPRGT----VYLTPEPEFLGVFPVMYSLDVEEDNKVE 305 (333) Q Consensus 237 i~gw~~N-~~~~~~~DpV~~~e~il~--~G~fgi~~----skvlprge----iyvvadpE~~G~~pvR~~L~s~p~D~~e 305 (333) |+. +- ..| .|+-+.+..-. .=++|.+. +..+|-.. .+++.|....-.+-+|+|+.++-.+... T Consensus 286 l~~--lkd~~G----~~i~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~v~~~~~~~ 359 (408) T protein:vir:10 286 LAL--VKTAEG----KYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGA 359 (408) T ss_pred HHH--hhccCC----ceEeccCcCCCCCceecceeeEEecccccCccCCCceEEEEEehhccEEEEEecceEEEEccccc Confidence 876 21 111 12222221000 01255442 34556543 3677888766678889999998777542 Q ss_pred ----hhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 306 ----RFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 306 ----r~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) +-..++..++-++..+.||.+++.+.-+ T Consensus 360 ~~f~~~~~~~r~~~r~d~~v~~~~a~~~~~~~ 391 (408) T protein:vir:10 360 GAFETDTTKIRVIDRFDVKATDSEALVAGSFS 391 (408) T ss_pred chhhcCceEEEEEEeeccEEeccccEEEEEee Confidence 2356899999999999999999999866 No 61 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=98.35 E-value=3.4e-08 Score=61.51 Aligned_cols=257 Identities=11% Similarity=0.063 Sum_probs=144.1 Q ss_pred HHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhh-hhhhhccccCCCcceeecCCCCccce-EEEEcCCCcccceeecCce Q lcl|NC_011269. 46 LAHILSDKVGGIQRLGQSMIGPIQLQLRYQGIL-RNVLLEDTLTPGVPIQYDVLDDLGQA-YMLHGNEGEIRITPFEGKR 123 (333) Q Consensus 46 m~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~-RklL~~~TL~~G~~p~y~v~~~v~~a-~~~~~~~G~i~~Q~i~~~r 123 (333) |+...--+|=| ++.+...++.++....+- |.+ +.+--+|....+|+++..+.+ | ....+.+..+.+...+ T Consensus 1 MA~~~~~pei~----~~~v~~~~~~~lv~~~l~~~~~--~~~~~~GdTv~ip~~~~~~~~d~--~~~~~~~~~~~~~~~~ 72 (273) T protein:vir:79 1 MAFNNFIPELW----SDMLLEEWTAQTVFANLVNREY--EGIASKGNVVHIAGVVAPTVKDY--KAAGRQTSADAISDTG 72 (273) T ss_pred CcchhhhHHHH----HHHHHHHHHhhccchhhhhccc--cccccCCcEEEEeecCccccccc--ccCCCccCccccccce Confidence 33322112222 222222333332211111 111 113346888999988877655 4 3334446666666666 Q ss_pred eecccee-eeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCcceEEe- Q lcl|NC_011269. 124 IEVQLFR-IASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPNEITIA- 201 (333) Q Consensus 124 i~~P~f~-Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N~i~i~- 201 (333) +++---+ ......|+-.|..+...|+ +.....+..++-..-|..+++++.+++..+ +.+.. T Consensus 73 ~~~tid~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~ala~~vD~~i~~~~~~a~~~~----------------~~~~~~ 135 (273) T protein:vir:79 73 VDLLIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADMLVDNGTAL----------------TGSAPS 135 (273) T ss_pred EEEEEeeecccceeeccHHHHhhcccH-HHHHHHHHHHHHHHHHHHHHHHHhhccccc----------------cccccc Confidence 6665323 3444567777888888886 556677778898999999988886554322 11111 Q ss_pred eccccHHHHHHHHHHHHhhCCcc--ceEEechhhhhhhhhcCCCchhhhHHhhhhhcceeeeeec------ccccceeee Q lcl|NC_011269. 202 GSHLMPDDLYTAVTYTDQRQLDS--SRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIVQFGE------FQIGKSIII 273 (333) Q Consensus 202 ~g~Lt~~~L~~a~t~v~~~~L~a--t~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~G~------fgi~~skvl 273 (333) .+...-+.|-.|.+..++.+.|. -+++++|..|.+|+. ...++..+ |-... .-.+..|. |.|..|..+ T Consensus 136 ~~~~~~~~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~Ll~--~~~~~~~~-~~~~~-~~~l~~G~ig~~~G~~i~~s~~l 211 (273) T protein:vir:79 136 DADDAFDLIASALKELTKANVPNVGRVVVVNAEMAFWLRS--SGSKLTSA-DTSGD-AAGLRAGTIGNLLGARIVESNNL 211 (273) T ss_pred chhhHHHHHHHHHHHhhhccCCccCcEEEECHHHHHHHhh--chhhhhhh-hhccc-ccceeeeEeeEEeceEEEecccc Confidence 11223467888999999999854 489999999999987 22232221 21111 11333444 447788888 Q ss_pred cCCe--EEEeeChhhhcccccccCceeccccchhhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 274 PRGT--VYLTPEPEFLGVFPVMYSLDVEEDNKVERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 274 prge--iyvvadpE~~G~~pvR~~L~s~p~D~~er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) |-++ -++---+...| +..+ -.++|..--..+|+.-=.-.+..|-.+.+|.+||++++. T Consensus 212 p~~~~~~~~a~~~~A~~-~a~~-~~~~e~~r~~~~~~~~v~~~~~yg~~v~~p~~vv~~~~~ 271 (273) T protein:vir:79 212 RDTDDEQFVAFHPSAAA-YVSQ-IDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKT 271 (273) T ss_pred cccCceEEEEEecccee-eeee-hhhhhcccCcccceeeeeeeeeeeeEEecCceEEEEecc Confidence 8654 22322333333 3333 223333222222333333345689999999999999999 No 62 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=98.35 E-value=2.1e-08 Score=62.71 Aligned_cols=307 Identities=14% Similarity=0.101 Sum_probs=174.3 Q ss_pred Ccccch----hhhhhhhhhcccchHHHHHHHHHHhhcch-hcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHH-HHHhh Q lcl|NC_011269. 1 MTLPVA----VGSGLGRFAKASDDYVADIVEAKQRMGGR-KLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQ-LQLRY 74 (333) Q Consensus 1 ~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~-~q~~r 74 (333) ...... -+.+...=.+.+.++ ++.-|.|.. .+.+.+...-......+..|. .+-.-+...+- +-++. T Consensus 65 ~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~r~g~~~~~~~~~~~~~~~~~t~~~~g~--~~~~~~~~~~i~~~~~~ 137 (392) T protein:vir:13 65 DAVTSLLSGLQGSGSGAQRSADHDD-----DAVLRAGNLGEARSFEFAPEKRDGTKAGNPN--VLSRTLYGQLIAQAVER 137 (392) T ss_pred HHHHHHhcccCCcccchhhhhhHHH-----HHHHhccchhhhHHHHhhhhhhcccccCCCc--cccccchHHHHHHHHhh Confidence 000000 000000001111111 111122210 000001000011111122221 34454555544 44455 Q ss_pred hhhhhhhhhccccCCCcceeecCCCCccceEEEEcCCCcccceeecCceeeccceeeeccccccHHHhhhhcchhHHHHH Q lcl|NC_011269. 75 QGILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRITPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQ 154 (333) Q Consensus 75 qGi~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q 154 (333) .-++|.+-...+...|....+|+......+ .|.+--+.+++....-+.|++.-.++...+-|..+=|.....|+..+.. T Consensus 138 ~~~l~~~~~~~~~~~~~~~~~~~~~~~~~a-~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~ 216 (392) T protein:vir:13 138 SAIMRGGASTFTTSDANPMDFTVITGRATA-GIVGETAEIPESYPATTQRSMGGFKYGFASVVSYEFATDQVLDLVGFLV 216 (392) T ss_pred hhhhhhcceeeecCCCceeEEEEEcCCcce-eeecccccccccccceeeEEeeeeeEEeeehhHHHHHhcchHHHHHHHH Confidence 567777655555444544555554444334 4678888888887777889999999999999999999999999999999 Q ss_pred HHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCC-------cceEEeeccccHHHHHHHHHHHHhhCCccceE Q lcl|NC_011269. 155 DMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALP-------NEITIAGSHLMPDDLYTAVTYTDQRQLDSSRL 227 (333) Q Consensus 155 ~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~-------N~i~i~~g~Lt~~~L~~a~t~v~~~~L~at~i 227 (333) ++-.++|...+|.-+++ +-.+ ..| .|-+. ..-+..++.++-++|-.++..+..--...... T Consensus 217 ~~l~~~i~~~~d~~~l~---G~Gt--------~~p-~Gil~~~~~~~~~~~~~~~~~~~~d~l~~~~~~l~~~~~~~a~~ 284 (392) T protein:vir:13 217 SDAGPAIGDAMGRHFLT---GTGT--------GQP-RGILTDATGANAAFGEADADSKVSDALIDLFHEVPSAYRKNAKF 284 (392) T ss_pred HHHHHHHHHHHHHHHhc---ccCC--------ccc-cccccccccccccccccccccccHHHHHHHHHhhhhhhhcCCEE Confidence 99999999999976553 2110 111 11110 11123456677888888777665544445578 Q ss_pred EechhhhhhhhhcCCCchh-hhHHhhhhhcceeeeeeccccc--ceeeecCCeEEEeeChhhhcccccccCceecc-ccc Q lcl|NC_011269. 228 LANPQEYRDLYRWDINTTG-WAFKDSVVAGERIVQFGEFQIG--KSIIIPRGTVYLTPEPEFLGVFPVMYSLDVEE-DNK 303 (333) Q Consensus 228 l~~~~~~~Di~gw~~N~~~-~~~~DpV~~~e~il~~G~fgi~--~skvlprgeiyvvadpE~~G~~pvR~~L~s~p-~D~ 303 (333) +||++-|.-|..= -|..| |.+.+++..+.- .-++|.+ .+-.+|-|+|++ .|... -.+-+|+++.+.. .|. T Consensus 285 v~n~~~~~~l~~l-kd~~G~~l~~~~~~~g~~---~~l~G~Pv~~~~~~~~~~i~~-Gdf~~-~~i~~~~~~~i~~~~~~ 358 (392) T protein:vir:13 285 VVNDLRAAQMRKL-KDANGQYLWQSALTVGAP---DTFNGKVVETDDGMPADKVLF-ADLSK-YRVRFAGSLRVDRSVDA 358 (392) T ss_pred EEcHHHHHHHHHh-hccCCceeecCCcCCCCC---ceecceeeEEcCCCCCCcEEE-eeccc-eeEEeecceEEEeeccc Confidence 9999998887751 11111 222222222210 0134533 667789999875 67653 4677899998874 333 Q ss_pred hhh-hccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 304 VER-FNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 304 ~er-~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) .++ -..+...++-++..+.||.+++++... T Consensus 359 ~~~~~~~~~r~~~r~d~~~~~~~A~~~~~~~ 389 (392) T protein:vir:13 359 KFSTDQIVYRFLQRADGLLVDARGAKVLTVT 389 (392) T ss_pred cccCCcEEEEEEEEeccEEecccceEEEEee Confidence 333 367899999999999999999999876 No 63 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=98.32 E-value=2.7e-08 Score=62.12 Aligned_cols=304 Identities=13% Similarity=0.076 Sum_probs=174.3 Q ss_pred Ccc-----------------------cchh-hhhhhhhhcccchHHHHHHHHHHhhcchhcchHHHHHHHHHHhcCchhH Q lcl|NC_011269. 1 MTL-----------------------PVAV-GSGLGRFAKASDDYVADIVEAKQRMGGRKLSAREKQAKLAHILSDKVGG 56 (333) Q Consensus 1 ~~~-----------------------~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~~Al~~~Eg~ 56 (333) |+- .... ..+. -....+++.....+.+ +..++..+..+....-+..+.++... T Consensus 39 ~~~~~~~~~~e~~~~~~~l~~~~~~~e~~~~~~~~--~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~ 115 (379) T protein:vir:10 39 MTSEKDLAVNELKSDMAALQAHADKLDVKLKEKAK--SEDKSDSLVKSITENF-NDIKEVRNGKSIQVKAVGDMTLPVNL 115 (379) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccc--ccccchhHHHHHHHHH-HhHHHHHhhhhhhhhhhcccccCCCC Confidence 100 0000 0000 0001111111111110 00111111100000001111111110 Q ss_pred HHHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcceeecCCCCc-cceEEEEcCCCcccceeecCceeeccceeeeccc Q lcl|NC_011269. 57 IQRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPIQYDVLDDL-GQAYMLHGNEGEIRITPFEGKRIEVQLFRIASFP 135 (333) Q Consensus 57 ~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p~y~v~~~v-~~a~~~~~~~G~i~~Q~i~~~ri~~P~f~Ivs~P 135 (333) --..-..+...|-+.++++...+++++.-|...|.. .|++.... ..+..|.+..++++.....=+.|++....+..++ T Consensus 116 ~~~ip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~ 194 (379) T protein:vir:10 116 TGAQPKDYNFDVVLNPSQMLNVSDIVGAVSISGGTY-TFVRENGAGEGAIGAQVEGATKGQKDYDISMIDVNTDFIAGFT 194 (379) T ss_pred ccccchhhhhHHHHhHHhhhhHHhhceeeeccCCce-EEEEeecCCCcccccccCCccccccccceeeeEeeeeeEEeee Confidence 001122345567777888889999998888877653 34443322 3355567776777766555578888888899999 Q ss_pred cccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCcceEEeeccccHHHHHHHHH Q lcl|NC_011269. 136 QIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPNEITIAGSHLMPDDLYTAVT 215 (333) Q Consensus 136 ~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N~i~i~~g~Lt~~~L~~a~t 215 (333) .|..+=|... .++..+..++...+|.+.+|..+++.+.+..+ +...-..+..+-+++..++. T Consensus 195 ~iS~ell~D~-~~l~~~i~~~la~~~~~~~~~~~~~g~~~~~~-----------------~~~~~~~~~~~~d~i~~~~~ 256 (379) T protein:vir:10 195 RYSKKMANNL-PFLTSFIPNALRRDYAKAENAAFNAVLAANAT-----------------ASTEIITNKNKVEMLINEIA 256 (379) T ss_pred hhhHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHhcccccccc-----------------cccccccCcccHHHHHHHHH Confidence 9998876654 57888899999999999999887776644321 11112233445678999999 Q ss_pred HHHhhCCccceEEechhhhhhhhhcCCCchhhhHHhhhhhcceeeeee----ccccc--ceeeecCCeEEEeeChhhhcc Q lcl|NC_011269. 216 YTDQRQLDSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIVQFG----EFQIG--KSIIIPRGTVYLTPEPEFLGV 289 (333) Q Consensus 216 ~v~~~~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~G----~fgi~--~skvlprgeiyvvadpE~~G~ 289 (333) .+..-++....++||++-|.-|+.= -++.| .|+-+.....+.| ++|++ .+--||-|++| +.|... +. T Consensus 257 ~~~~~~~~~~~~vmn~~~~~~l~~l-kd~~G----~~l~~~~~~~~~~~~~~l~G~pvv~s~~~~ag~~~-~gdf~~-~~ 329 (379) T protein:vir:10 257 KQENLDFPVTAIVLRPTDYYDILVT-QKSVG----AGYGLPGVVTQDNGVLRINGIPLFRATWLAANKYY-VGDWTR-VT 329 (379) T ss_pred hhhhccCCCCEEEEcHHHHHHHHHh-hccCC----ceeccCCccCCCCCcceecceeeEecCCCCCCceE-Eeeccc-EE Confidence 8999999999999999999888751 01222 2333333222333 34533 66778999976 455552 34 Q ss_pred cccccCceecccc----chhhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 290 FPVMYSLDVEEDN----KVERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 290 ~pvR~~L~s~p~D----~~er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) .=+|.|+.++-.+ +.+.--.+|...+-+++++.+|.++|.+.-+ T Consensus 330 ~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~R~~~~v~~p~a~v~~~~~ 377 (379) T protein:vir:10 330 KVTTEGLSLEFSEVEGTNFVKNNITARIEAQVALAVEQPAALIFGDFT 377 (379) T ss_pred EEEEeceEEEEeecccccccCCcEEEEEEEEeccEEecCccEEEEEec Confidence 4467776654332 3334467888899999999999999998887 No 64 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=98.30 E-value=3.1e-08 Score=61.73 Aligned_cols=270 Identities=10% Similarity=0.046 Sum_probs=174.3 Q ss_pred HHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcceeecCCCCccceEEEEcCCCcccceeecCceee Q lcl|NC_011269. 46 LAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRITPFEGKRIE 125 (333) Q Consensus 46 m~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~i~~~ri~ 125 (333) |+..-.+. |- -+=..+...|-+.++.+...|++....+++.|.. .||+...... +-|++-.++++.....=+.++ T Consensus 1 ma~~t~~~-G~--lip~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~-~~p~~~~~~~-a~wv~Eg~~~~~s~~~f~~v~ 75 (300) T protein:vir:95 1 MSEAQLSK-GN--LFNPELVTKVINKVKGHSSIAKLSPQKPIPFNGQ-REFVFDFDSD-IDIVAENGKKTHGGVSLDPVT 75 (300) T ss_pred CcccccCC-cc--eechhhHHHHHHHHHhhhhhhhhcceeeccCCce-EEEEEecCcc-eEEeeCCcccccccccceeeE Confidence 55554443 21 4566788899999999999999999988887643 3444333223 347777777777766667888 Q ss_pred ccceeeeccccccHHHhhhh---cchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCC--C--cce Q lcl|NC_011269. 126 VQLFRIASFPQIKKEDLYYL---RSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGAL--P--NEI 198 (333) Q Consensus 126 ~P~f~Ivs~P~V~~~dl~~~---~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~--~--N~i 198 (333) +.-.++.....|..+-|.+. ..++..+..++..++|-+.+|.-+++=.++.- ..+....|.. + ... T Consensus 76 l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~-------g~~~~~~~~~~~~~~~~~ 148 (300) T protein:vir:95 76 IVPLKVEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRT-------KQASTIIGDNCFDKKVTQ 148 (300) T ss_pred eeeEEEEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCC-------CCCcccccccccccccce Confidence 88889999999988877644 47899999999999999999966653321110 0000001110 1 112 Q ss_pred EEe-eccccHHHHHHHHHHHHhhCCccceEEechhhhhhhhhcCCCchhhhHHhhhhhcceee--e-eeccccc--ceee Q lcl|NC_011269. 199 TIA-GSHLMPDDLYTAVTYTDQRQLDSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIV--Q-FGEFQIG--KSII 272 (333) Q Consensus 199 ~i~-~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il--~-~G~fgi~--~skv 272 (333) +.. .+..+-++|.++...++.-+.+....+||+..|..|+.=- +..+ .|+-... .. + .=+.|.+ .+.. T Consensus 149 ~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lk-d~~G----~~i~~~~-~~~~~~~~l~G~Pv~~s~~ 222 (300) T protein:vir:95 149 TVPFKDTNPDESMEDAVGMIDGSERDITGAILDPIFTTALSKMK-NAEG----GKLYPEL-AWGGVPDAINGLAVDKNRT 222 (300) T ss_pred eecccccchHHHHHHHHHHhhhcCCCccEEEECHHHHHHHHHhh-ccCC----CeeccCc-cccCCCceecceeeEEecC Confidence 222 2344557899999999999999999999999999887611 1111 1111100 00 0 1124443 2344 Q ss_pred ecCCe-----EEEeeChhhhcccccccCceeccccchhh----------hccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 273 IPRGT-----VYLTPEPEFLGVFPVMYSLDVEEDNKVER----------FNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 273 lprge-----iyvvadpE~~G~~pvR~~L~s~p~D~~er----------~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) +|-+. +.++.|-..+-.+-+|+++..+-.++.+. -..+|...+.++.+|.||.++|.|.++ T Consensus 223 v~~~~~~~~~~~~~GDf~~~~~~~~~~~~~~~v~~~~~~d~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~ 298 (300) T protein:vir:95 223 VSYSQTDPKNTAIVGDFETMFKWGYAKEVPMEIIKYGDPDNSGRDLKGYNQIYIRCEAYIGWGIMDAASFARIVKT 298 (300) T ss_pred CCCCCCCCccEEEEeeccceEEEEEecccEEEEeeccCCCCcchhhhhcCcEEEEEEEeecceeecccceEEEecC Confidence 55443 34456765444467788877765544322 246777888899999999999999998 No 65 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=98.30 E-value=2.7e-08 Score=62.11 Aligned_cols=302 Identities=13% Similarity=0.070 Sum_probs=184.3 Q ss_pred Ccccchh-hhhhhhhhcccchHHHHHHHHHHhhcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhh Q lcl|NC_011269. 1 MTLPVAV-GSGLGRFAKASDDYVADIVEAKQRMGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILR 79 (333) Q Consensus 1 ~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~R 79 (333) ...+-.- ...-.+..+..+.|+...... .|.+-..+...+..++.. -.+..|+ -.+-+-+...|-..++.+..++ T Consensus 72 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~a~~~--~~~~~gg-~~vP~~~~~~Ii~~~~~~~~l~ 147 (408) T protein:vir:74 72 VNMREEEKGPLNKSENELKDKFVKDFVNM-VRNPMAFLNTVSSKTETS--GSDSAAG-LTIPQDIRTMINTLVRQYDSLQ 147 (408) T ss_pred hhccccccccccchhhhhHHHHHHHHHHH-Hhcchhhhhhhhhhhhcc--cccCCCc-eeechhHhhHHHHHHhhhcchh Confidence 1110000 000111222333333332221 122323334444433321 2334454 3677788889999999999999 Q ss_pred hhhhccccCCCcce-eecCCCCccceEEEEcCCCccccee-ecCceeeccceeeeccccccHHHhhhhcchhHHHHHHHH Q lcl|NC_011269. 80 NVLLEDTLTPGVPI-QYDVLDDLGQAYMLHGNEGEIRITP-FEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMT 157 (333) Q Consensus 80 klL~~~TL~~G~~p-~y~v~~~v~~a~~~~~~~G~i~~Q~-i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A 157 (333) ++....+++.+... .|++-.+.+..+-|.+..+++.++- ..=+.|++.-.++..+..|..+=|+....|+..+..++- T Consensus 148 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l 227 (408) T protein:vir:74 148 QYVRVESVSTSSGSRVYEKWTDVTPLKAMDEEDGKIPDLDNPRLTIIKYLIKRYAGIITATNTLLKDTAENILAWLSSWI 227 (408) T ss_pred hhcceeeccCCcceEEEEeecCCcccccccccccccccccccceeeEEeeeeeEEeeehhHHHHHhhchHHHHHHHHHHH Confidence 99988888766532 2333334444455677767777532 333789999999999999999999999999999999999 Q ss_pred HHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCcceEEeeccccHHHHHHHH-HHHHhhCCccceEEechhhhhh Q lcl|NC_011269. 158 KQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPNEITIAGSHLMPDDLYTAV-TYTDQRQLDSSRLLANPQEYRD 236 (333) Q Consensus 158 ~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N~i~i~~g~Lt~~~L~~a~-t~v~~~~L~at~il~~~~~~~D 236 (333) .++|.+-+|.-+++-. -+..| .++.++.+++..++ ..+.........++||+..|.- T Consensus 228 ~~~~~~~~d~~il~G~-----------G~~~~-----------~~~~~~~~~i~~~~~~~l~~~~~~~a~~v~n~~~~~~ 285 (408) T protein:vir:74 228 AKKVVVTRNQAIIAAM-----------GTVPK-----------KPTIANFDDVITMINTSVDPAIIATSSLLTNQSGLNK 285 (408) T ss_pred HHHHHHHHHHHHhhcc-----------ccccc-----------ccccccHHHHHHHHHHhhhhhhcCCCEEEEcHHHHHH Confidence 9999999997555432 11122 24556778888876 4666666666778999999998 Q ss_pred hhhcCCCchhhhHHhhhhhcceeeee--ecccccc----eeeecCCe----EEEeeChhhhcccccccCceeccccch-- Q lcl|NC_011269. 237 LYRWDINTTGWAFKDSVVAGERIVQF--GEFQIGK----SIIIPRGT----VYLTPEPEFLGVFPVMYSLDVEEDNKV-- 304 (333) Q Consensus 237 i~gw~~N~~~~~~~DpV~~~e~il~~--G~fgi~~----skvlprge----iyvvadpE~~G~~pvR~~L~s~p~D~~-- 304 (333) |+.=- +..+ -|+-+.+..-.+ =++|.+- +..+|-.. .+++.|....-.+-+|+|+.++-.++. T Consensus 286 l~~lk-d~~G----~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~i~~~~~~~~ 360 (408) T protein:vir:74 286 LALVK-TAEG----KYLLEPDPTKPNSYLIKGKQVIVVADRWLPNSGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAG 360 (408) T ss_pred HHHhh-cCCC----ceEeccCcCCCCCceecceeeEEecCcccccccCCcceEEEEehhccEEEEEecceEEEEeccccc Confidence 87611 1111 122222211100 1244432 23355432 245566654446778999998877643 Q ss_pred --hhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 305 --ERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 305 --er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) ++....+..++-++..+.||.+++++.-. T Consensus 361 ~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~ 391 (408) T protein:vir:74 361 AFETDTTKIRVIDRFDVKATDSEALVAGSFT 391 (408) T ss_pred hhhcceeeEEEEEeeCcEEecccceEEEEee Confidence 23457788999999999999999999865 No 66 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=98.28 E-value=3e-08 Score=61.81 Aligned_cols=284 Identities=11% Similarity=-0.011 Sum_probs=177.8 Q ss_pred HhhcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcceeecCCCCccceEEEEc Q lcl|NC_011269. 30 QRMGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHG 109 (333) Q Consensus 30 ~~~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~ 109 (333) -| -|.++.+|++.. ...-.+..|. .+=..+.+.|-+.++.+.+++++....+++.+. -+||+......+ -|++ T Consensus 1 ~~-~~~~~~~e~~~~--~~~~~~~~~~--~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~-~~ip~~~~~~~a-~~v~ 73 (318) T protein:vir:24 1 MA-AGTAFAVDHAQI--AQTGDTMFKG--YLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTG-QKIPHWVGDVSA-QWIG 73 (318) T ss_pred CC-CCCCCCHHHHHh--hcccCcccce--eechhHHHHHHHHHHhhchhhhhcceeeccCCc-eEEEEEeCCcce-EEec Confidence 11 235566666543 2222222232 456667788888889999999999888877554 445553443333 4777 Q ss_pred CCCcccceeecCceeeccceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccc Q lcl|NC_011269. 110 NEGEIRITPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQP 189 (333) Q Consensus 110 ~~G~i~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p 189 (333) .-+.++.+...=+.|++.--++.....+..+=|+++..++..+..++..++|.+.+|.-+++ +.- +..| T Consensus 74 Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~e~l~ds~~~~~~~i~~~l~~~~~~~~d~a~l~---G~g--------~~~~ 142 (318) T protein:vir:24 74 EGDMKPITKGNMTSQTIAPHKIATIFVASAETVRANPANYLGTMRTKVATAFAMAFDGAAMH---GTD--------SPFP 142 (318) T ss_pred CCccccccccceeEEEEeeEEEEEeehhhHHHhhcChHHHHHHHHHHHHHHHHHHHHHhhhc---ccC--------CCCC Confidence 77777777666677888888888999999998999999999999999999999999966653 111 0000 Q ss_pred -----cccCCCcceEEeeccccHHHHHHHHHHHHhhCCccceEEechhhhhhhhhcCCCchh-hhHHhhhhhccee--ee Q lcl|NC_011269. 190 -----GVGALPNEITIAGSHLMPDDLYTAVTYTDQRQLDSSRLLANPQEYRDLYRWDINTTG-WAFKDSVVAGERI--VQ 261 (333) Q Consensus 190 -----~vg~~~N~i~i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~~~~Di~gw~~N~~~-~~~~DpV~~~e~i--l~ 261 (333) ++........-.......+.+..+...+..-......++||++.|..|..=- +..+ |.+.+.+..+... .. T Consensus 143 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lk-d~~G~~l~~~~~~~~~~~~~~~ 221 (318) T protein:vir:24 143 TYIGQTTKAISIADTTGATTVYDQVAVNGLSLLVNDGKKWTHTLLDDITEPILNGAK-DQNGRPLFIESTYGEAASPFRS 221 (318) T ss_pred cccccccccccccccccccchHHHHHHHHHHhhccccCCCCEEEEcHHHHHHHHHhh-ccCCceeecCccccCccccccC Confidence 0111111111122233445677788888888999999999999999987511 1111 1111111111100 00 Q ss_pred eeccccc--ceeeecCCeE-EEeeChhhhcccccccCceeccccch----------------hhhccceehhhhhhhhhh Q lcl|NC_011269. 262 FGEFQIG--KSIIIPRGTV-YLTPEPEFLGVFPVMYSLDVEEDNKV----------------ERFNKGWVMDELVGMAIL 322 (333) Q Consensus 262 ~G~fgi~--~skvlprgei-yvvadpE~~G~~pvR~~L~s~p~D~~----------------er~~kGWvm~E~~g~~i~ 322 (333) .-.+|++ .+--+|-|+. .+..|... ..+=.|+++.++..+.. ++-...|.+.+-++..+. T Consensus 222 ~~i~g~pv~~~~~~~~~~~~~~~gdfs~-~~~~~~~~l~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~ 300 (318) T protein:vir:24 222 GRIVARPTILSDHVVEGTTVGFMGDFSQ-LIWGQIGGLSFDVTDQATLNLGTVESPNFVSLWQHNLVAVRVEAEYAFHCN 300 (318) T ss_pred ceEEEEeeEEeCCCCCCccEEEEeecce-EEEEEecCeEEEEeeccceeccccccccchhhhhcCcEEEEEEEEEccEEe Confidence 1123432 3445676664 35667764 35667888877655532 222455677788999999 Q ss_pred ccceEEEEecC Q lcl|NC_011269. 323 NPRGIVILRKA 333 (333) Q Consensus 323 N~~siv~~~~~ 333 (333) +|.+++.|.++ T Consensus 301 ~~~a~~~i~~~ 311 (318) T protein:vir:24 301 DAEAFVALTNV 311 (318) T ss_pred cccceEEEEee Confidence 99999999987 No 67 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=98.28 E-value=7.2e-08 Score=59.75 Aligned_cols=300 Identities=14% Similarity=0.131 Sum_probs=172.1 Q ss_pred Ccccch-----------------hhhhhhhhhcccc----hHHHHHHHHHHhhcchhcchHHHHHHHHHHhc--CchhHH Q lcl|NC_011269. 1 MTLPVA-----------------VGSGLGRFAKASD----DYVADIVEAKQRMGGRKLSAREKQAKLAHILS--DKVGGI 57 (333) Q Consensus 1 ~~~~~~-----------------~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~ls~ee~~~Lm~~Al~--~~Eg~~ 57 (333) -.-||. =|.|.+|++++-. +...-.-.++++.+... ++.+++ +..|+. T Consensus 5 ~a~~~~~~~~~~~~~~~~~~~~~kg~~~~~~~~a~a~~~g~~~~a~~~a~~~~~~~~---------~~~a~~~~~~~Gg~ 75 (366) T protein:vir:57 5 VAVPVKAHSVAPGIIIKEELQQYKGAGMTRMVMSIAAGKGNLADAAKFAATELGDTG---------LSMAISTAAGSGGA 75 (366) T ss_pred ccccccccccccccccccccccccchhHHHHHHHHHhcccchhHHHHHHHHhhcchh---------hhhhccccccCCcc Confidence 112221 1233334433211 11110111111211111 122332 223542 Q ss_pred HHHHHHHHHHHHHHHhhhhhhhhh-hhccccCCCcceeecCCCCccceEEEEcCCCcccceeecCceeeccceeeecccc Q lcl|NC_011269. 58 QRLGQSMIGPIQLQLRYQGILRNV-LLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRITPFEGKRIEVQLFRIASFPQ 136 (333) Q Consensus 58 ~aLg~~mA~pI~~q~~rqGi~Rkl-L~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~i~~~ri~~P~f~Ivs~P~ 136 (333) .+-+.+++.|-+.++-..+.|++ ....+...|. ..||+...-. .+-|.+..+.++.....=+.|+++..++...+. T Consensus 76 -lvP~~~~~~ii~~l~~~s~l~~lg~~~v~~~~g~-~~~p~~t~~~-~a~wv~E~~~~~~s~~~f~~i~~~~~k~~~~~~ 152 (366) T protein:vir:57 76 -LIPQNMQNEVIELLRDRTVVRILGARSIPLPNGN-LSMPRLSGGA-TAGYVGEGKDVVATGATFDDVKLSAKTMIALVP 152 (366) T ss_pred -ccchhHHHHHHHHHhhhcchhhhceeeeecCCCc-eEEEEEeCCc-ceeeeccCccccccccceeEEEEeeEEEEEeeh Confidence 45777888888888888999998 5666666674 4455543333 344677777788777666889999999999999 Q ss_pred ccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCC------CcceEEeeccccHHHH Q lcl|NC_011269. 137 IKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGAL------PNEITIAGSHLMPDDL 210 (333) Q Consensus 137 V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~------~N~i~i~~g~Lt~~~L 210 (333) |..+=|++...++-.+..++..++|.+.||.-++ .+..++ -+| .|-+ ...+..++...+-.++ T Consensus 153 iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l---~G~G~~-------~~p-~Gi~~~~~~~~~~~~~~~t~~~~~~~ 221 (366) T protein:vir:57 153 VSNQLIGRAGFNVEQLLLGDILSAIATREDKAFL---RDDGTG-------DTP-KGMKAVATAANRLVAWTGTAINLTTI 221 (366) T ss_pred hhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHhh---ccCCCC-------ccc-cceeeccccccceeeccccccchhhH Confidence 9999999999999999999999999999995433 332111 111 1111 1223334444554444 Q ss_pred HHHHHHHHhh------CCccceEEechhhhhhhhhcCCCchh-hhHHhhhhhcceeeeeeccccc--ceeeecC------ Q lcl|NC_011269. 211 YTAVTYTDQR------QLDSSRLLANPQEYRDLYRWDINTTG-WAFKDSVVAGERIVQFGEFQIG--KSIIIPR------ 275 (333) Q Consensus 211 ~~a~t~v~~~------~L~at~il~~~~~~~Di~gw~~N~~~-~~~~DpV~~~e~il~~G~fgi~--~skvlpr------ 275 (333) ......+... .......+||+..|..|+.=- +..| |.+. +..+ .-++|++ .+--||- T Consensus 222 ~~~~~~~~~~~~~~~~~~~~a~~vmn~~~~~~L~~lk-d~~G~~l~~-~~~~------g~l~G~Pvv~s~~ip~~~~~~~ 293 (366) T protein:vir:57 222 DEYLDSLILKHMDSNSNMIRCGWGLSNRTYMTLFGLR-DGNGNKVYP-EMSQ------GILKGYPIQRTSAIPANLGDDG 293 (366) T ss_pred HHHHHHHHHhhhccccccccCEEEecHHHHHHHHhhh-ccCCceecc-CCCC------CeecceeeEEccccccccccCC Confidence 4443333222 234566799999999887621 1111 1111 1111 1245644 3333443 Q ss_pred --CeEEEeeChhhhcccccccCceeccccchh-------------hhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 276 --GTVYLTPEPEFLGVFPVMYSLDVEEDNKVE-------------RFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 276 --geiyvvadpE~~G~~pvR~~L~s~p~D~~e-------------r~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) +.|| ..|.... .+-.|+++.+...+..+ +-...+...+-+++++.+|.+++++..+ T Consensus 294 ~~~~i~-~gdfs~~-~i~~~~~i~i~~~~ea~~~~~~g~~~~~f~~~~~~iR~~~~~d~~v~~~~a~~~lt~~ 364 (366) T protein:vir:57 294 NESEIY-FCDFNDV-VIGEDGMMKVDFSTEATYKDADGQLVSAFARNQSLIRVVTEHDIGFRHPEGLVLGTGV 364 (366) T ss_pred CccEEE-EEecceE-EEEEecceEEEEeeccccccccccchhhhhcCceeEEeeeeeCcEeeccccEEEEecc Confidence 2344 3666533 35678888876554321 1235677888899999999999999999 No 68 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=98.27 E-value=9.2e-08 Score=59.16 Aligned_cols=298 Identities=11% Similarity=0.065 Sum_probs=182.9 Q ss_pred Ccccchhhhhhhh------hhcccchHHHHHHHHHHhhcchhcchHHHHHHH---HHHh---cCchhHHHHHHHHHHHHH Q lcl|NC_011269. 1 MTLPVAVGSGLGR------FAKASDDYVADIVEAKQRMGGRKLSAREKQAKL---AHIL---SDKVGGIQRLGQSMIGPI 68 (333) Q Consensus 1 ~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm---~~Al---~~~Eg~~~aLg~~mA~pI 68 (333) -..+..--.+.+. ....+..|...+.+. +-|+.+..+++..+- ..++ .+..|+. .+-+-+.+.| T Consensus 68 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~---~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~-lvP~~~~~~i 143 (397) T protein:vir:12 68 NFVPEQERNPEGQRSQGQGNEERQQQYSKAFLKG---LRGKRLTDEERDLLDSPEFRAMSGINDEDGGI-LIPEDIGRQI 143 (397) T ss_pred hhhhhhhhhhcccccccchhhHHHHHHHHHHHHH---HhccCCcHHHHHHHhhhhhhhccccccccCcc-cCchhHHHHH Confidence 1111111111111 112222343333322 224556666654331 1222 2234442 4457788889 Q ss_pred HHHHhhhhhhhhhhhccccCCC--cceeecCCCCccceEEEEcCCCccccee-ecCceeeccceeeeccccccHHHhhhh Q lcl|NC_011269. 69 QLQLRYQGILRNVLLEDTLTPG--VPIQYDVLDDLGQAYMLHGNEGEIRITP-FEGKRIEVQLFRIASFPQIKKEDLYYL 145 (333) Q Consensus 69 ~~q~~rqGi~RklL~~~TL~~G--~~p~y~v~~~v~~a~~~~~~~G~i~~Q~-i~~~ri~~P~f~Ivs~P~V~~~dl~~~ 145 (333) .+.++....++++....+++.+ ..+. ++-.+. ..+-|++..++++.+- ..=+.|++.-..+.....|..+-|+.. T Consensus 144 i~~~~~~~~l~~~~~~~~~~~~~~~~~~-~~~~~~-~~a~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~is~e~l~ds 221 (397) T protein:vir:12 144 HEFKRQFEPLEQYVTVEPVTTRSGTRLL-EKNADM-VPFSPVEELGNLPEIDQPRFTKVSYSIIDYGGIMTLSNSMLNDS 221 (397) T ss_pred HHhhhhhhhHHhhcceeeccCCceeEEE-EEecCC-cceeeecccccccccccccceeEEeeheeeEeeehhhHHHHhhc Confidence 9999999999999888888754 3333 222222 2345677777777543 233788899899999999999999999 Q ss_pred cchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCcceEEeeccccHHHHHHHHH-HHHhhCCcc Q lcl|NC_011269. 146 RSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPNEITIAGSHLMPDDLYTAVT-YTDQRQLDS 224 (333) Q Consensus 146 ~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N~i~i~~g~Lt~~~L~~a~t-~v~~~~L~a 224 (333) ..++..+...+..++|-+-+|..+++-. |.- +|. |-++.+++.+++. .++...... T Consensus 222 ~~~l~~~i~~~l~~~~~~~~d~~il~G~------------------g~~-~~~----g~~~~~~i~~~~~~~l~~~~~~~ 278 (397) T protein:vir:12 222 DQAIMTYVAKWFAKKSVVTRNNLILAAI------------------ASL-KKV----DIDGLDGIKKALNVTLDPMVAPG 278 (397) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHhcc------------------ccc-ccc----ccccHHHHHHHHhhccchhhhCC Confidence 9999999999999999999996655433 111 232 3356677877763 566666677 Q ss_pred ceEEechhhhhhhhhcCCCchhhhHHhhhhhcceeeee---eccccccee---eecCCe----EEEeeChhhhccccccc Q lcl|NC_011269. 225 SRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIVQF---GEFQIGKSI---IIPRGT----VYLTPEPEFLGVFPVMY 294 (333) Q Consensus 225 t~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~---G~fgi~~sk---vlprge----iyvvadpE~~G~~pvR~ 294 (333) ..++||++.|.-|+.= -+..| .|+-+.++ .+. -++|.+-.. .+|-.. .+++.|....-.+.+|. T Consensus 279 a~~~~n~~~~~~L~~l-kd~~G----~~l~~~~~-~~g~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~ 352 (397) T protein:vir:12 279 SIVLTNQDGYDWLDTL-KDGTG----RYLLQPDP-TNPTKKLLDGRPVVPFTNRVLKTQKGKAPLIIGNLKEAIVLFDRE 352 (397) T ss_pred CEEEEcHHHHHHHHHh-hccCC----ceeecccc-cCCCCccccceeeEEecccccccCCCccEEEEEehhceEEEEeec Confidence 7899999999988761 01111 22222221 111 135555321 122211 26677877555678899 Q ss_pred Cceeccccch----hhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 295 SLDVEEDNKV----ERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 295 ~L~s~p~D~~----er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) ++.++-.+.. ++-..++..++-++..+.||.+++++.-+ T Consensus 353 ~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~t 395 (397) T protein:vir:12 353 QQSIASTDTGAGAFETNSTKVRGIEREDVRKWDEDAVVFGQIT 395 (397) T ss_pred ceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEEe Confidence 9888655433 23356899999999999999999999877 No 69 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=98.27 E-value=4.4e-08 Score=60.89 Aligned_cols=295 Identities=9% Similarity=-0.019 Sum_probs=179.6 Q ss_pred Ccccchhhhhhhhhhc------ccchHHHHHHHHHHhhcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhh Q lcl|NC_011269. 1 MTLPVAVGSGLGRFAK------ASDDYVADIVEAKQRMGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRY 74 (333) Q Consensus 1 ~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~r 74 (333) ---|. +.+..+... .+..+...+.. .+=|..++.+++ .++.+..|+. .+-+-+...|...++. T Consensus 72 ~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~r-----a~~t~~~gg~-liP~~~~~~Ii~~~~~ 140 (421) T protein:vir:13 72 RKNTN--FTGGRVIINGDSKEEKRSLQLSAMSK---TIRGIQLSEEER-----DIMSSTNNGA-VIPQEFVNEFEKLKEG 140 (421) T ss_pred Hhhhc--ccccccccccchhHHHHHHHHHHHHH---hhhccchhHHHh-----hccccCCcce-ecchhhHHHHHHHHHh Confidence 00000 000000000 00111111111 112344444444 3455666652 4557777888889999 Q ss_pred hhhhhhhhhccccCCCcceeecCCCCccce-EEEEcCCCcccceeecCceeeccceeeeccccccHHHhhhhcchhHHHH Q lcl|NC_011269. 75 QGILRNVLLEDTLTPGVPIQYDVLDDLGQA-YMLHGNEGEIRITPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYT 153 (333) Q Consensus 75 qGi~RklL~~~TL~~G~~p~y~v~~~v~~a-~~~~~~~G~i~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~ 153 (333) ...++++....++..|.. .|++++.-..+ +-|.+..++++.....=+.|++.-.++.....|..+-|.....|+..+. T Consensus 141 ~~~l~~l~~~~~~~~~~~-~~~~~~~~~~~~~~~~~E~~~~~~s~~~f~~i~~~~~k~~~~v~iS~ell~ds~~~l~~~i 219 (421) T protein:vir:13 141 YPSLKEHCHVIPVNRNAG-KMPVRAGASVDKLANLAKDTELVKAMLKTQPMAYDIDDYGLLAPIDNSLLEDSEINFLEFV 219 (421) T ss_pred hhhhhhhceeeeccCCce-EEEEeecCCccceeeccccccccccccceeEEEeeeeeeEeehhhhHHHHhhhHHHHHHHH Confidence 999999999888887643 55554443322 3356666667665555567888888899999999998998889999999 Q ss_pred HHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCcceEEeeccccHHHHHHHHHHHHhhCCccceEEechhh Q lcl|NC_011269. 154 QDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPNEITIAGSHLMPDDLYTAVTYTDQRQLDSSRLLANPQE 233 (333) Q Consensus 154 q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N~i~i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~~ 233 (333) .++..+++..-+|..+++.+.+.. + ..+..+-+++..++.-+..-..+...++||+.. T Consensus 220 ~~~la~~~~~~~~~~i~~~~~g~~------------------~----~~~~~~~d~i~~~~~~l~~~~~~~a~~v~n~~~ 277 (421) T protein:vir:13 220 NEEFAEFAVNTENAEIVKQAKAVL------------------A----EETINDYAGLVKTINSLVPNARKRAIIVTNSDG 277 (421) T ss_pred HHHHHHHHHHHhhhhHhhhhhhcc------------------c----cccccchHHHHHHHHHhhhhhcCCCEEEEcHHH Confidence 999999999999877776554432 1 122346788999999999888889999999999 Q ss_pred hhhhhhcCCCchhhhHHhhhhhcceeeeeecccccc--eeeecCCe----EEEeeChhhhcccccccCceeccccchh-- Q lcl|NC_011269. 234 YRDLYRWDINTTGWAFKDSVVAGERIVQFGEFQIGK--SIIIPRGT----VYLTPEPEFLGVFPVMYSLDVEEDNKVE-- 305 (333) Q Consensus 234 ~~Di~gw~~N~~~~~~~DpV~~~e~il~~G~fgi~~--skvlprge----iyvvadpE~~G~~pvR~~L~s~p~D~~e-- 305 (333) |..|+.=--..-.|.+.+| ..+. ..=++|.+- +--+|.+. .+++.|......+-+|+++.++-.+... T Consensus 278 ~~~l~~lkd~~G~~i~~~~-~~~~---~~tl~G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~f~ 353 (421) T protein:vir:13 278 RAYLDGLMDKQGRPLLKEL-SDGG---DLVFKGRPVIELEESIFDVGDETKFIVSDFKTLIKFMDRKQYLIDQSKEAGYT 353 (421) T ss_pred HHHHHHhhcCCCceeecCc-CCCC---CceecceeeEEeccccccCCCceEEEEEeccccEEEEEecceEEEeecccccc Confidence 9999862111111333232 1111 011345442 12234332 4667887766677889999988877432 Q ss_pred hhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 306 RFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 306 r~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) +--.+....+-++..+.+|.++.-+... T Consensus 354 ~~~~~~r~~~r~d~~~~~~~a~~~~~~~ 381 (421) T protein:vir:13 354 KNETIARIIERFDVNSPLDKSSDAEKIR 381 (421) T ss_pred cCeeEEEEEeeecceeecchhhheeeec Confidence 2234566677777777777775444333 No 70 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=98.22 E-value=7.3e-08 Score=59.72 Aligned_cols=296 Identities=9% Similarity=0.055 Sum_probs=173.8 Q ss_pred Ccccchhh--hhhhhhhcccchHHHHHHHHHHhhcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhh Q lcl|NC_011269. 1 MTLPVAVG--SGLGRFAKASDDYVADIVEAKQRMGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGIL 78 (333) Q Consensus 1 ~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~ 78 (333) ...|..-+ ...-...+..+.+ ...+....|-++ +..+.++ +..+..|+. .+-+-+...|...++..... T Consensus 69 ~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~lr~~~------~~~~~~~-~~t~~~gg~-~vP~~~~~~i~~~~~~~~~l 139 (389) T protein:vir:10 69 KTEPKDDGSKKGTDLSKKPIDAK-KKAINDFIHSHG------KVIDATS-KVTSTEAGV-LIPEEIIYDPTAEVNSVVDL 139 (389) T ss_pred hccccccccccccccchhHHHHH-HHHHHHHhhcch------hhhhhhc-ccccCCcce-eehHHHHHHHHHHHHhhhhH Confidence 11111000 0000000000000 011111111111 1111122 233455553 56677888899999999999 Q ss_pred hhhhhccccCCCcceeecCCCCccceEEEEcCCCcccce-eecCceeeccceeeeccccccHHHhhhhcchhHHHHHHHH Q lcl|NC_011269. 79 RNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRIT-PFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMT 157 (333) Q Consensus 79 RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q-~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A 157 (333) |++.+..+++.|. ..|++.++-+.++-|++-.|++..+ ...-+.|++....+...+.|..+-|..+..|+..+..+.. T Consensus 140 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l 218 (389) T protein:vir:10 140 STLVTKTPVTTPK-GTYPILKRATDRFSSVAELAENPKLAEPEFNKVDWSVATYRGAIPLSEEAIADSAVDLTALVGQSI 218 (389) T ss_pred HhhcceeeccCCe-eEEEEEecCCCccccccccccccccccccceeeeeeheeeEeeehhhHHHHhhhhHHHHHHHHHHH Confidence 9998888877654 3455544444444466766777653 3344788888899999999999999999999999999999 Q ss_pred HHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCcceEEeeccccHHHHHHHHHHHHhhCCccceEEechhhhhhh Q lcl|NC_011269. 158 KQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPNEITIAGSHLMPDDLYTAVTYTDQRQLDSSRLLANPQEYRDL 237 (333) Q Consensus 158 ~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N~i~i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~~~~Di 237 (333) .+++-.-+|..+++.+-.+. | .. +.+..+-++|-.++....+-.. ...++||+.-|.-| T Consensus 219 a~~~~~~~~~~i~~g~~~~~-----------~--------~~-~~~~~~~d~l~~~~~~~~~~~~-~a~~~~n~~~~~~L 277 (389) T protein:vir:10 219 KEKSVNTYNAMIAPVLQSFT-----------A--------KK-TTTDTLVDSLKHILNVDLDPAY-SRALVVTQSLFNTL 277 (389) T ss_pred HHHHHHHHHHHHhhhhcccc-----------c--------cc-ccccccHHHHHHHHHhhhhhhh-CcEEEecHHHHHHH Confidence 99999999988776653221 1 11 1233455666665543322222 25689999999999 Q ss_pred hhcCCCchhhhHHhhhhhcceeeeee------cccccc----eeeecC--C-eEEEeeChhhhcccccccCceeccccch Q lcl|NC_011269. 238 YRWDINTTGWAFKDSVVAGERIVQFG------EFQIGK----SIIIPR--G-TVYLTPEPEFLGVFPVMYSLDVEEDNKV 304 (333) Q Consensus 238 ~gw~~N~~~~~~~DpV~~~e~il~~G------~fgi~~----skvlpr--g-eiyvvadpE~~G~~pvR~~L~s~p~D~~ 304 (333) +..-- +.| .|+-+.+..-.++ ++|.+. +..+|- | ..+++.|....-.+-+|+++.+.-.|.. T Consensus 278 ~~lkd-~~G----~~i~~~~~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~ 352 (389) T protein:vir:10 278 DTLKD-KNG----RYLLHDASDSITDGTAKGTILGVPVYVVGDTLLGSLAGDQKAFVGDLKRGVLFTDRQQVTLAWEDSK 352 (389) T ss_pred HHhhc-cCC----CeeeecCcccccccccccccccceeEEecccccCCCCCceEEEEeeccccEEEEeecceEEEeeccc Confidence 87421 111 2332222111111 355542 223332 1 2467777765446778999888766542 Q ss_pred hhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 305 ERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 305 er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) . |.++...++-++.++.||.+++.+.-. T Consensus 353 ~-~~~~~~~~~r~d~~~~~~~a~~~~~~~ 380 (389) T protein:vir:10 353 I-YGKYLGAAFRFGVQKADSKAGYFVTNT 380 (389) T ss_pred c-ccceEEEEEEeccEEecccceEEEEee Confidence 2 566788888999999999999999854 No 71 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=98.20 E-value=7.6e-08 Score=59.61 Aligned_cols=287 Identities=11% Similarity=0.037 Sum_probs=168.8 Q ss_pred hHHHHHHHHHHh-cCchhHHH-----HHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcceeecCCCCccceEEEEcC-- Q lcl|NC_011269. 39 AREKQAKLAHIL-SDKVGGIQ-----RLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGN-- 110 (333) Q Consensus 39 ~ee~~~Lm~~Al-~~~Eg~~~-----aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~-- 110 (333) =....+|..... .+++|++- -+=..+.+.|.+.++.....+++.+..+++.|.. .+|+..... .+.|++. T Consensus 1 ~a~l~el~~~~~~~~~~g~~~~~~~~liP~~~~~~ii~~l~~~s~l~~~~~~~~~~~~~~-~~p~~~~~~-~a~~v~eg~ 78 (333) T protein:vir:78 1 MATLNELLPNSAGSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRMGEQIPISYGET-IIPTTVKRP-EVGQVGVGT 78 (333) T ss_pred CchhHHhhhhcccccccCceecCCccccchhHHHHHHHHHHhhchhhhhcceeeccCCce-EEEEEeCCc-eeEeecCcc Confidence 112233332222 23333211 2557788999999999999999999988886644 455543332 3334431 Q ss_pred ------CCcccceeecCceeeccceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhh-hhhhhh Q lcl|NC_011269. 111 ------EGEIRITPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAA-VSYRVV 183 (333) Q Consensus 111 ------~G~i~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a-~~~r~~ 183 (333) .+.++.+...=+.|++...++...+.|..+=|++...++..+..++-.++|.+-+|.-+++=-.+.. ....-. T Consensus 79 ~~~~~e~~~~~~~~~~f~~i~l~~~kl~~~~~is~ell~~s~~~~~~~i~~~la~ai~~~~d~~~l~G~g~~~~~~~~g~ 158 (333) T protein:vir:78 79 SNEQREGGLKPLSGTAWDTRSVSPIKLATIVTVSEEFARMNPSGLYTKLQGDLAYAIGRGIDLAVFHGKSPLTGSALQGI 158 (333) T ss_pred cccccccccccccccceeEEEEeeEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCCCcccccc Confidence 1223333333356777778899999999999999999999999999999999999966653221100 000000 Q ss_pred cccccccccCCCcce-EEeeccccHHHHHHHHHHHH-hhCCccceEEechhhhhhhhhcCC--Cchh-hhHHhhhhhcce Q lcl|NC_011269. 184 DSSAQPGVGALPNEI-TIAGSHLMPDDLYTAVTYTD-QRQLDSSRLLANPQEYRDLYRWDI--NTTG-WAFKDSVVAGER 258 (333) Q Consensus 184 ~ssA~p~vg~~~N~i-~i~~g~Lt~~~L~~a~t~v~-~~~L~at~il~~~~~~~Di~gw~~--N~~~-~~~~DpV~~~e~ 258 (333) ....++....+.. .-.++..+-++|..++..+. +....++.++||++.|..++.... |..+ |.+...+..+. T Consensus 159 --~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~~~~~~d~~G~~i~~~~~~~~~- 235 (333) T protein:vir:78 159 --DTDNVIANTTNVDYLQETGDPLLDRLLDGYDLVSANTDVEFNGWAVDPRFRAHLLRAQAYRDANGNVDPSRINLAAQ- 235 (333) T ss_pred --cccccccccccccccccccchhHHHHHHHHHhhccccccCceEEEEcchHHHHHHHHhhhcCCCCceeecCccccCC- Confidence 0000000000111 12345566778888877764 456677789999999988765321 1111 11111111111 Q ss_pred eeeeeccccc--ceeeecCC--------eEEEeeChhhhcccccccCceeccccchhhh-------------ccceehhh Q lcl|NC_011269. 259 IVQFGEFQIG--KSIIIPRG--------TVYLTPEPEFLGVFPVMYSLDVEEDNKVERF-------------NKGWVMDE 315 (333) Q Consensus 259 il~~G~fgi~--~skvlprg--------eiyvvadpE~~G~~pvR~~L~s~p~D~~er~-------------~kGWvm~E 315 (333) ..-++|.+ .+--||-+ ..+++.|... -.+-+|+++..+-.++.... -..|...+ T Consensus 236 --~~~l~G~Pv~~~~~i~~~~~~~~~~~~~~~~gD~~~-~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~ 312 (333) T protein:vir:78 236 --TGDVLGLPAQFGRAVGGDLGAAVDSKTRIIGGDFSQ-LKFGFADEIRIKMSDTATLTDSGSATVSMWQTNQIAILIEV 312 (333) T ss_pred --CceeeceeeEEccccCCCccccCCCccEEEEEeccc-EEEEEeeccEEEEeccccccccccceeehhhcCcEEEEEEE Confidence 01234544 22234543 2466677775 44678899888776654322 23456778 Q ss_pred hhhhhhhccceEEEEecC Q lcl|NC_011269. 316 LVGMAILNPRGIVILRKA 333 (333) Q Consensus 316 ~~g~~i~N~~siv~~~~~ 333 (333) -++..+.+|.+++.|.++ T Consensus 313 r~d~~v~~~~a~~~l~~~ 330 (333) T protein:vir:78 313 TFGWLLGDKQAFVKFVDD 330 (333) T ss_pred EEccEEecccceEEEecc Confidence 899999999999999999 No 72 >protein:vir:9643 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:173 # MgeName: 315.1 # Cross-refs: genbank:acc:NP_795405;genbank:gi:28876178;genbank:GeneID:1257724 Probab=98.19 E-value=9.8e-08 Score=59.01 Aligned_cols=316 Identities=16% Similarity=0.094 Sum_probs=184.2 Q ss_pred Ccccc---------------hhhhh------hhhhhcccchHHHHHHH--------H-HHhhcchhcchHHHHHHHHHHh Q lcl|NC_011269. 1 MTLPV---------------AVGSG------LGRFAKASDDYVADIVE--------A-KQRMGGRKLSAREKQAKLAHIL 50 (333) Q Consensus 1 ~~~~~---------------~~~~~------~~~~~~~~~~~~~~~~~--------~-~~~~~~~~ls~ee~~~Lm~~Al 50 (333) ||+.. .+-.+ --.|.+...+.-.+|.+ + .-+.+.+.|++|||+.+- +++ T Consensus 1 M~i~~~~~~~~~e~~~~l~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~lt~ee~~~~~-~~~ 79 (377) T protein:vir:96 1 MAINLKELPKYREAVAELSAKISAGATPEEQEKLFEAAFTTMGDEILAKNEEEMERMFDLRDKNRELTAEEIKFFN-DID 79 (377) T ss_pred CCccHHHHHHHHHHHHHHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccCHHHHHHHH-HHH Confidence 22111 00000 00000000011111110 0 113467889999988653 233 Q ss_pred ---cCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcceeecCCCCccceEEEEcCCCcccce-eecCceeec Q lcl|NC_011269. 51 ---SDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRIT-PFEGKRIEV 126 (333) Q Consensus 51 ---~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q-~i~~~ri~~ 126 (333) .+..|+ --+=+.+++.|.+.+..++..|++.+..+.. | .-.++...+. ..+.|.+-.+++..+ .-.=..|++ T Consensus 80 ~~~~~~~gg-~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~-~-~~~i~~~~~~-~~a~wv~e~~~~~~~~~~~f~~i~l 155 (377) T protein:vir:96 80 KNVGGKDKF-KLLPEETMVQVFDDLVAEHPLLKVINFKNTS-L-RLKALTAETS-GTAVWGDIFGEIKGQLKQAFKEQDF 155 (377) T ss_pred hcCCCCCCc-eecCHHHHHHHHHHHHhhhhhhhhceeEecC-C-ceEEEEecCC-cceeEeecccccccccCccceeEee Confidence 355555 2677889999999999999999998776653 3 3345544443 455677766666644 333378999 Q ss_pred cceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHH---------HHhhhhhhhhhhcccccccccCCCcc Q lcl|NC_011269. 127 QLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVT---------LLEAAAVSYRVVDSSAQPGVGALPNE 197 (333) Q Consensus 127 P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~s---------lle~~a~~~r~~~ssA~p~vg~~~N~ 197 (333) +..++.++|.|..+=|..+.+|+-.+..+...++|.+.||.-+++ +|...+... +..++..+++.. .+ T Consensus 156 ~~~kl~~~~~is~~ll~ds~~~le~~i~~~l~~~~~~~~~~a~i~G~G~~~P~Gil~~~~~~~--~~~~~~~~~~~~-~~ 232 (377) T protein:vir:96 156 SQFKLTAFVVIPKDALKFGPKWLKQFITEQLKEAIAVALELAIVKGNGLLQPVGLLKDLSQPT--VDQSTGRDITTY-KT 232 (377) T ss_pred eeeeEEeechhhHHHhhcchhhHHHHHHHHHHHHHHHHHhhceEeccCCCcceeeeecccccc--ccccccccccce-ee Confidence 999999999999999999999999999999999999999977665 443222111 000000000000 11 Q ss_pred eEEeecc---ccHHHHHHHHHHH----Hhh-------CCccceEEechhhhhhhhhcCCCchhhhHHhhhhhcceeeeee Q lcl|NC_011269. 198 ITIAGSH---LMPDDLYTAVTYT----DQR-------QLDSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIVQFG 263 (333) Q Consensus 198 i~i~~g~---Lt~~~L~~a~t~v----~~~-------~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~G 263 (333) .....|. .+++.+...+..+ ..- -+..-..+||+.-|.|+.|.. ..+++--+...++.+| T Consensus 233 ~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~~~~~~------~~~~~~G~~~~~l~~p 306 (377) T protein:vir:96 233 DKEAIADLSDLDPDTAVELLVPVMKHLSVNDKKHPLKIAGQVKLLLNPEDRWTLEAKF------TSRNQFGEYVTVLPHG 306 (377) T ss_pred ccccccccccCChhHHHHHHHHHHHhhccccccccccccCceEEEEchhhHHhccccc------cccCCCCCceeccCCC Confidence 1112222 3344444433332 111 122234889999998987621 1122111111122111 Q ss_pred cccccceeeecCCeEEEeeChhhhcccccccCceeccccch--hhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 264 EFQIGKSIIIPRGTVYLTPEPEFLGVFPVMYSLDVEEDNKV--ERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 264 ~fgi~~skvlprgeiyvvadpE~~G~~pvR~~L~s~p~D~~--er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) . .+..+--+|-|.|+ ..|... =.+-+|+|+.+...|.. .+-..|+...+=++-.+.+|.++|+|-=+ T Consensus 307 ~-~v~~s~~~p~~~i~-fgdf~~-Y~i~~r~~~~i~~~~~~~~~~d~~~f~~~~r~dG~~~d~~a~~vl~l~ 375 (377) T protein:vir:96 307 I-TILESLAVETGKAI-AFVANR-YDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLA 375 (377) T ss_pred c-eEEecCCCCcccEE-EEEcCc-EEEEEecccEEEeehhhhhhcCCeEEEEEEEEcCEEecCCcEEEEEEe Confidence 1 12345557888864 577765 46778999999888833 23478899999999999999999999888 No 73 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=98.19 E-value=1.1e-07 Score=58.77 Aligned_cols=311 Identities=14% Similarity=0.085 Sum_probs=172.8 Q ss_pred Cc--ccch--hhhhhhhhhcccchHHHHHHHHHHhhcchhcchHHHHHHHHHHhc---C-----chhHHHHHHHHHHHHH Q lcl|NC_011269. 1 MT--LPVA--VGSGLGRFAKASDDYVADIVEAKQRMGGRKLSAREKQAKLAHILS---D-----KVGGIQRLGQSMIGPI 68 (333) Q Consensus 1 ~~--~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~~Al~---~-----~Eg~~~aLg~~mA~pI 68 (333) .+ .+-. ....++.+...++.+-+. ..+.+.|..+ . +...+-..+.. . ..+.--...+.+..-| T Consensus 70 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~--~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~i 144 (419) T protein:vir:94 70 GTPLTPAEAGTFRSLAQRFADSDGLREY--RARDKRGQFQ--V-EMRDIDPNRLLSRDAPAGTITNPNVPHLPQLVPGIV 144 (419) T ss_pred hccccccccccccchhhhhhhHHHHHHH--HHhhhhhhhh--H-HHHHHHHHHhhccccccccccCCcccccchhhhHHH Confidence 10 0000 011112221111111110 0111111111 1 11111111111 1 1111012344555556 Q ss_pred HHHHhhhhhhhhhhhccccCCCcceeecCC--------CCccceEEEEcCCCcccceeecCceeeccceeeeccccccHH Q lcl|NC_011269. 69 QLQLRYQGILRNVLLEDTLTPGVPIQYDVL--------DDLGQAYMLHGNEGEIRITPFEGKRIEVQLFRIASFPQIKKE 140 (333) Q Consensus 69 ~~q~~rqGi~RklL~~~TL~~G~~p~y~v~--------~~v~~a~~~~~~~G~i~~Q~i~~~ri~~P~f~Ivs~P~V~~~ 140 (333) ....+.....|+++..-+...|.. .|++- .... .+-|++..+.+......=+.|++...++.....|..+ T Consensus 145 ~~~~~~~~~i~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~-~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e 222 (419) T protein:vir:94 145 PTTPDLPLLVADLLDQQNADYNVL-EYIRDTSGTAGAGSTWN-KAAVVPEGTAKPQSTLSFDTITTTLKTVAHWLPITRQ 222 (419) T ss_pred HHHHhhhhhhhhcceeeeccCCce-eeeeeccccccccccCc-ccceecCCccccccccceeeEEeeeeeEEEeehhhHH Confidence 666677778899888777665543 33321 1111 2346666666666666667899999999999999988 Q ss_pred HhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhccccccc---------ccCCCcceEEeeccccHHHHH Q lcl|NC_011269. 141 DLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPG---------VGALPNEITIAGSHLMPDDLY 211 (333) Q Consensus 141 dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~---------vg~~~N~i~i~~g~Lt~~~L~ 211 (333) -|.. ..++..+...+..++|...+|.-+++ +-. +.+|. +.....+..-..+...-++|. T Consensus 223 ll~d-~~~l~~~i~~~la~a~~~~~d~aii~---G~G--------~~~p~Gi~~~~~~~~~~~~~~~~~~t~~~~~~~l~ 290 (419) T protein:vir:94 223 AADD-NSQLMGYIQGRLTYGLRFLRDRQLLN---GNG--------STEMQGILTTPGIGTYQQPKPTAPATDEPPLVDIR 290 (419) T ss_pred HHHh-HHHHHHHHHHHHHHHHHHHHHHHHHh---ccC--------cccccceecccccccccccccccccccchhHHHHH Confidence 7765 46788889999999999999976653 111 11111 001111122223344567899 Q ss_pred HHHHHHHhhCCccceEEechhhhhhhhhcCCCchhhhHHhhhhhcceeeeeeccccc--ceeeecCCeEEEeeChhhhcc Q lcl|NC_011269. 212 TAVTYTDQRQLDSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIVQFGEFQIG--KSIIIPRGTVYLTPEPEFLGV 289 (333) Q Consensus 212 ~a~t~v~~~~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~G~fgi~--~skvlprgeiyvvadpE~~G~ 289 (333) +++..+..-+.....++||++-|..|..=--+..++...-|-...+ ...-++|.+ .+--+|-|++|+ .|....-. T Consensus 291 ~~~~~~~~~~~~~~~~v~n~~~~~~l~~~k~~~~~~~~~~~~~~~~--~~~~l~G~pV~~~~~~~~~~~~~-gd~~~~~~ 367 (419) T protein:vir:94 291 RAKTVAEIAGFPPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGE--ATPRIWGLNVVSTVAIAQGTALV-GGFRQGAT 367 (419) T ss_pred HHHHhhhhccCCCCEEEEcHHHHHHHHHHhhcCCCceeecCCcccC--CCccccceeeEEcCCCCCccEEE-eeccceEE Confidence 9999999888888999999999999875111111111111100000 001124433 566689999765 66654555 Q ss_pred cccccCceeccccchh----hhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 290 FPVMYSLDVEEDNKVE----RFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 290 ~pvR~~L~s~p~D~~e----r~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) +-+|+++.+...+... +-..+|.++.-++.++.+|.++|.+..+ T Consensus 368 ~~~~~~~~v~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~~~~~ 415 (419) T protein:vir:94 368 LWSRQGITVLMTDSHADFFTANTLVILAEFRANLAVYQPKAFVRVTFA 415 (419) T ss_pred EEEecceEEEEeccccchhhcCcEEEEEEEeeccEEeccccEEEEEec Confidence 6778999888766543 3357899999999999999999999888 No 74 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=98.18 E-value=4.5e-08 Score=60.88 Aligned_cols=272 Identities=12% Similarity=0.080 Sum_probs=160.5 Q ss_pred hcCc-hhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcceeecCCCCccceEEEEcCCCcccceeecCceeeccc Q lcl|NC_011269. 50 LSDK-VGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRITPFEGKRIEVQL 128 (333) Q Consensus 50 l~~~-Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~i~~~ri~~P~ 128 (333) |.+. .|++ .+-+.+++.|-+.++.+...|++....+++.|. ..||+......| -|++..++++.....=+.+++.- T Consensus 1 mat~~~gg~-lvP~~~~~~ii~~~~~~s~i~~~~~~i~~~~~~-~~~p~~~~~~~a-~wv~Eg~~~~~~~~~f~~v~l~~ 77 (311) T protein:vir:81 1 MVALATGTF-QLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGE-QQYMTLTAPPRG-EVVGEGAQKSESTATFAPVTAIP 77 (311) T ss_pred CceecCCce-EcchhHHHHHHHHHHhcchhhhhcceeecCCCc-eEEEEEeCCcee-EEeecCcccccccceeeEEEEee Confidence 3332 3442 456778889999999999999999999988775 456664444444 46776666666655557788888 Q ss_pred eeeeccccccHHHhhhh---cchhHHHHHHHHHHHHHHHhhhHHHHHHhhhh-hhhhhhcccccccccCCCcceEEeecc Q lcl|NC_011269. 129 FRIASFPQIKKEDLYYL---RSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAA-VSYRVVDSSAQPGVGALPNEITIAGSH 204 (333) Q Consensus 129 f~Ivs~P~V~~~dl~~~---~~~vle~~q~~A~qaIM~qED~~~~slle~~a-~~~r~~~ssA~p~vg~~~N~i~i~~g~ 204 (333) .++.....|..+-|.+. ..++......+..++|.+.+|.-.++=..+-. ......-..+-+++ |.++..+.. T Consensus 78 ~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~~~~gi~~~~~~~~----~~~~~~~~~ 153 (311) T protein:vir:81 78 RKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILDTT----NIVELTTGT 153 (311) T ss_pred EEEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCcccccccccccccc----eeeeecccc Confidence 88888888888867543 45789999999999999999966654321000 00000000000111 333333332 Q ss_pred cc--HHHHHHHHHHHHhhCCccceEEechhhhhhhhhcCCCchhhhHHhhhhhccee-ee-eeccccc--ceeeecCCeE Q lcl|NC_011269. 205 LM--PDDLYTAVTYTDQRQLDSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERI-VQ-FGEFQIG--KSIIIPRGTV 278 (333) Q Consensus 205 Lt--~~~L~~a~t~v~~~~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~i-l~-~G~fgi~--~skvlprgei 278 (333) .. ..++..+...+...+..+...+||++.|.-|+.=- +..+ .|+-+.... -+ .-++|.+ .+-.||-+.. T Consensus 154 ~~~~~~~i~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lk-d~~G----~~l~~~~~~~~~~~tl~G~Pv~~~~~i~~~~~ 228 (311) T protein:vir:81 154 SATPDLAVEAAVGLVLGDNLSPDGVALDNTFSFMLATQR-DSQG----RKLYPELGFGTDVASFAGLNAAVSDTVRGGPE 228 (311) T ss_pred cchHHHHHHHHHHHhhhcCCCceEEEEcHHHHHHHHhhh-ccCC----CeeecCccccCCCceecceeEEeccccccccc Confidence 22 23566788888888899988999999999987610 1111 111110000 00 1123433 2333554443 Q ss_pred EEeeChh-----------hhcccc-----cccCceeccccc---------hhhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 279 YLTPEPE-----------FLGVFP-----VMYSLDVEEDNK---------VERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 279 yvvadpE-----------~~G~~p-----vR~~L~s~p~D~---------~er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) +..+.-. ..|-|+ .|+++..+-.+. .++-..+|...+-++..+.||.+++.|.+| T Consensus 229 ~~~~~~~~~~~~~~~~~~~~gDfs~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~a 308 (311) T protein:vir:81 229 AVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDA 308 (311) T ss_pred ccccccchhcccCCccEEEEEecccEEEEEeccceEEEeccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEEEee Confidence 3222111 123333 466654443222 122234555668999999999999999999 No 75 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=98.17 E-value=1.2e-07 Score=58.48 Aligned_cols=300 Identities=9% Similarity=0.006 Sum_probs=176.1 Q ss_pred CcccchhhhhhhhhhcccchHHHHHHHHH--Hh-hcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhh Q lcl|NC_011269. 1 MTLPVAVGSGLGRFAKASDDYVADIVEAK--QR-MGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGI 77 (333) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~-~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi 77 (333) -.-.......++++.+.........-..+ .+ ...+.-..++..+.+.....+..|+. ..-+-+...|...++.+.. T Consensus 85 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~-~vP~~~~~~ii~~~~~~~~ 163 (400) T protein:vir:38 85 HPEEHSYRDALNAYLHTRGRNTDGVNFEKTDVGTFAVLRAVPTDASDAVNAGVKAADAAS-TIPETISNTPQRELQTVVD 163 (400) T ss_pred chhhhhHHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHHhhcccccCCcc-cccHHHHHHHHHHHHhhhh Confidence 01111222223333222211111111111 11 11111222333444555555666653 5556788889999999999 Q ss_pred hhhhhhccccCCCcceeecCCCCccceEEEEcCCCcccc-eeecCceeeccceeeeccccccHHHhhhhcchhHHHHHHH Q lcl|NC_011269. 78 LRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRI-TPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDM 156 (333) Q Consensus 78 ~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~-Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~ 156 (333) .+++....+++.|.. .||+++..+.++.|++..|++.. +...=+.|++.-..+...+-|..+=|.....|+..+..+. T Consensus 164 l~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~~~~~~i~~~ 242 (400) T protein:vir:38 164 LKPFTNVFQASTQKG-TYPTVANATTKMVTVAELEKNPAMAKPEFKPVNWSVETYRQALPVSQESIDDSAIDLVGLIAQN 242 (400) T ss_pred hhhcceeEeccCcce-EEEEEecCCCccccccccccccccccccceeeEeehhheeeehhhHHHHHhhhHHHHHHHHHHH Confidence 999999888876642 45554444445567777787775 3444468888888999999999998999999999999999 Q ss_pred HHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCcceEEeeccccHHHHHHHHHHHHhhCCccceEEechhhhhh Q lcl|NC_011269. 157 TKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPNEITIAGSHLMPDDLYTAVTYTDQRQLDSSRLLANPQEYRD 236 (333) Q Consensus 157 A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N~i~i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~~~~D 236 (333) ..++|-.-+|.-+++...+.. | .+-.+-+++..+....-+-.. ...++||+..|.- T Consensus 243 l~~~~~~~~~~~i~~~~~~~~-----------~------------~~~~~~~~~~~~~~~~~~~~~-~a~~v~~~~~~~~ 298 (400) T protein:vir:38 243 GQQIKVNTTNGAVATLLKGFT-----------A------------KTISSVDDLKHINNVDLDPAY-SRVIIASQSFYNF 298 (400) T ss_pred HHHHHHHHHHHhhhhcccccc-----------c------------cccccHHHHHHHHHhhhhhhh-CcEEEEcHHHHHH Confidence 999999999977766553211 0 122344555555543333333 3578999999998 Q ss_pred hhhcCCC-chhhhHHhhhhhcceeeeee--cccccc--eeeecCC---e-EEEeeChhhhcccccccCceeccccchhhh Q lcl|NC_011269. 237 LYRWDIN-TTGWAFKDSVVAGERIVQFG--EFQIGK--SIIIPRG---T-VYLTPEPEFLGVFPVMYSLDVEEDNKVERF 307 (333) Q Consensus 237 i~gw~~N-~~~~~~~DpV~~~e~il~~G--~fgi~~--skvlprg---e-iyvvadpE~~G~~pvR~~L~s~p~D~~er~ 307 (333) |.. +- ..| .|+-+.++.-..+ +.|.+- +-.+|-+ . ++++.|.-..-.+-+|+++.+.-.|... + T Consensus 299 l~~--lkd~~G----~~i~~~~~~~~~~~~l~G~pv~~~~~~~~~~~g~~~~~~gd~s~~~~~~~~~~~~~~~~~~~~-~ 371 (400) T protein:vir:38 299 LDT--VKDGNG----RYLLQDSILTPSGKSVLGMPIAVVSDDTLGAAGEAHAFLGDIKRAILFANRADFMVRWVDDQI-Y 371 (400) T ss_pred HHH--hhccCC----CeeeecCcCCCCccccccceeEEecccccCCCCceEEEEEeccccEEEEeecceEEEEecccc-c Confidence 876 21 111 2333332111111 244432 2223432 2 3555565433344557787776655322 4 Q ss_pred ccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 308 NKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 308 ~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) .++-..++-++..+.||.+++.+.-+ T Consensus 372 ~~~~~~~~r~d~~~~~~~a~~~l~~~ 397 (400) T protein:vir:38 372 GQFLQAGMRFGVSVADEKAGYFLTYT 397 (400) T ss_pred ceeEEEEEEeccEEecccceEEEEee Confidence 66778888999999999999999876 No 76 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=98.17 E-value=2.3e-07 Score=57.02 Aligned_cols=304 Identities=11% Similarity=0.067 Sum_probs=173.6 Q ss_pred Cc-ccchhhhhhhhhhcccchHHHHHHH--HHHhhcchhcchHHHHHH--HHHHhcCchhHHHHHHHHHHHHHHHHHhhh Q lcl|NC_011269. 1 MT-LPVAVGSGLGRFAKASDDYVADIVE--AKQRMGGRKLSAREKQAK--LAHILSDKVGGIQRLGQSMIGPIQLQLRYQ 75 (333) Q Consensus 1 ~~-~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~ls~ee~~~L--m~~Al~~~Eg~~~aLg~~mA~pI~~q~~rq 75 (333) +. .--++...+...++ +.+-..... .....|.+.|+++|++.. ++.. ....|+ -.+-+.+++.|.+.++.. T Consensus 38 ~~~~~~~~~~~~~~~~~--~e~~~~~~~~~~~~~r~~~~l~~ee~~~~~~~~~~-t~~~gG-~liP~~~~~~Ii~~l~~~ 113 (395) T protein:vir:95 38 FGAMFDALSNDLQEEIT--AEINNRVVDNGILAKRSQDPLTSEERKFFNDINYD-VGYTDE-KILPETVVERVFDDLQKD 113 (395) T ss_pred HHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHhhcCccccchHHHHHHHHHhhc-cCCCCc-eeccHHHHHHHHHHHHhh Confidence 00 00001111111110 011011100 011236778888887742 1111 123333 256678899999999999 Q ss_pred hhhhhhhhccccCCCcceeecCCCCccceEEEEcCCCcccce-eecCceeeccceeeeccccccHHHhhhhcchhHHHHH Q lcl|NC_011269. 76 GILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRIT-PFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQ 154 (333) Q Consensus 76 Gi~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q-~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q 154 (333) ...|++-+.-+. .|. -.+++.... ..+.|..-.|++..+ ...=..|++...++..++.|.-+=|.....|+..+.- T Consensus 114 s~i~~~~~v~~~-~~~-~~i~~~~~~-~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~iS~ell~ds~~~ie~~i~ 190 (395) T protein:vir:95 114 HPLLSKINFQNA-GIK-TRVIKADPA-GQAVWGKVFGEIKGQLDAAFREENFTQYKLTCFVVLPDDLSTFGPAWIERFVR 190 (395) T ss_pred hhhhhhceeEec-CCc-eEEEEecCC-cceEEeecccccCccccccceeeeeceeeEEEeecccHHHHhcchhHHHHHHH Confidence 999999776655 343 345544443 445566655666543 2222689999999999999999999999999999999 Q ss_pred HHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCcceE---------EeeccccHHHHHHHHHHHHh------ Q lcl|NC_011269. 155 DMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPNEIT---------IAGSHLMPDDLYTAVTYTDQ------ 219 (333) Q Consensus 155 ~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N~i~---------i~~g~Lt~~~L~~a~t~v~~------ 219 (333) +...++|.+.+|.-++ ++--+.- .+|. |.+ |.+. ...+.+|.++...+...+.+ T Consensus 191 ~~la~~ia~~~~~a~i---~G~G~~~------~qP~-Gil-~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~ 259 (395) T protein:vir:95 191 TQIQEAISVALESAII---NGGGAAK------TQPV-GLM-KDVNTNSGAVTDKASSGTLTFADADTTILELNDVLKNLS 259 (395) T ss_pred HHHHHHHHHHHhhhee---eccCCCC------cCce-eee-ecccccccccccccccchhhhhhhHhhHHHHHHHHHhhc Confidence 9999999999994333 2211000 0111 111 1110 11222333333322222211 Q ss_pred --------hCCccceEEechhhhhhhhhcCCCchhhhHHhhhhhcceeeeeeccc--ccceeeecCCeEEEeeChhhhcc Q lcl|NC_011269. 220 --------RQLDSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIVQFGEFQ--IGKSIIIPRGTVYLTPEPEFLGV 289 (333) Q Consensus 220 --------~~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~G~fg--i~~skvlprgeiyvvadpE~~G~ 289 (333) +.+.--..+||+.-|.|+.| + |...+..-+...++- +| +..+--||.|+|+ ..|... -. T Consensus 260 ~~~~~~~~~~~~~~~~~mn~~t~~~~~g---~---~~~~~~~G~~~~~lg---~g~~v~~~~~~p~~~i~-fgdfs~-y~ 328 (395) T protein:vir:95 260 VDEKGKELKIDGKVALVVNPRDSWDVQA---R---YTYLTANGGFVTVLP---YNVTIITSEFVPEGKLV-AFVTDR-YN 328 (395) T ss_pred cccccchhhhcCceEEEEcchhhhhcCC---c---ceeccCCCcceeccC---CcceEEEcCCCCCCcEE-EEeccc-EE Confidence 12223356899999999887 3 122221111111221 23 2356778999965 477765 46 Q ss_pred cccccCceeccccch--hhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 290 FPVMYSLDVEEDNKV--ERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 290 ~pvR~~L~s~p~D~~--er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) +-+|+|+.+...+.+ .+-..++...+=++-.+.||.++++|.=. T Consensus 329 i~~r~~~~i~~~~~~~~~~d~~~f~~~~r~dg~~~~~~A~~~l~i~ 374 (395) T protein:vir:95 329 AVRGGGLTVKKFDQTLALEDAVLFTAKTFAYGQPDDNKASAVYDLK 374 (395) T ss_pred EEEecceEEEeccchhhhCCcEEEEEEEEECCEEeccccEEEEEee Confidence 678999999877743 33467899999999999999999997644 No 77 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=98.16 E-value=3.7e-07 Score=55.86 Aligned_cols=308 Identities=12% Similarity=0.083 Sum_probs=174.0 Q ss_pred Ccccchhhhhhhh---hhcccchHHHHHHHHHHhhcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhh Q lcl|NC_011269. 1 MTLPVAVGSGLGR---FAKASDDYVADIVEAKQRMGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGI 77 (333) Q Consensus 1 ~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi 77 (333) ..-+..-+....+ ++.....+.......+.+.+.+. ..+|+.++ ... .+..|+. .+-..+.+.|...++.... T Consensus 64 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~e~~a~-~~~-~~~~gg~-~vP~~~~~~ii~~~~~~~~ 139 (404) T protein:vir:10 64 NVKSLNTGKEENVIYNGALFVRAIADNLLKQKNQRGLNL-SEKEINAI-SEN-IDEDGGY-AVPEDIQTKINTRLKDTTD 139 (404) T ss_pred hccccccccchhhHHHHHHHHHHHHHHHHHHHHhhhhcc-hhhHHhhh-ccc-cCCCCce-eechhHHHHHHHHHhhhhh Confidence 1112221111111 11111112222222222222222 22222221 111 1233332 4456778888888999999 Q ss_pred hhhhhhccccCCCcc-eeecCCCCccceEEEEcCCCccccee--ecCceeeccceeeeccccccHHHhhhhcchhHHHHH Q lcl|NC_011269. 78 LRNVLLEDTLTPGVP-IQYDVLDDLGQAYMLHGNEGEIRITP--FEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQ 154 (333) Q Consensus 78 ~RklL~~~TL~~G~~-p~y~v~~~v~~a~~~~~~~G~i~~Q~--i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q 154 (333) ++++....+++.+.- ..|++..+.. .+.|++..+.++.+. ..=+.|++.-..+..++.|..+=|.+...++..+.. T Consensus 140 l~~l~~~~~~~~~~g~~~~~~~~~~~-~~~~v~e~~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~ 218 (404) T protein:vir:10 140 LYNMVDYEPVFTRSGSRTYEKRSKQK-PMKPLSENQQIPTNGDNGKLERFNFKLKDLADFMSIPNDLLKFADKSLEDWII 218 (404) T ss_pred HhhhhceeeccCCccceEEEEecCCc-ceeeccccccccccccccceeeeEeeheeeEeeehhhHHHHhhcHHHHHHHHH Confidence 999999998886542 3354433433 344566666666542 112677788888999999999999999999999999 Q ss_pred HHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCC----cceEEeeccccHHHHHHHHHHHHhhCCc-cceEEe Q lcl|NC_011269. 155 DMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALP----NEITIAGSHLMPDDLYTAVTYTDQRQLD-SSRLLA 229 (333) Q Consensus 155 ~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~----N~i~i~~g~Lt~~~L~~a~t~v~~~~L~-at~il~ 229 (333) ++..++|-+.+|.-+++ +.. +..|-.|.+. +.++. ++..+-+++..++...-.-+.. ...++| T Consensus 219 ~~la~~~~~~~~~~il~---G~g--------~~~~~~gi~~~~~~~~~~~-~~~~~~~~~~~~~~~~l~~~~~~~~~~v~ 286 (404) T protein:vir:10 219 NWFVDKVRITRNAEILY---GAG--------GDEHATGIMTANKFKKITL-PKSPALKDFKKCKNVELLNVFKATSSWIV 286 (404) T ss_pred HHHHHHHHHHHHHHHhh---cCC--------CCCcccceeeccccceeec-cccccHHHHHHHHHhhhhccccCCCEEEE Confidence 99999999999975542 221 1122222211 23333 3445677887777644333333 346899 Q ss_pred chhhhhhhhhcCCC-chhhhHHhhhhhcceeeeee----cccccc---eeeecCCe----EEEeeChhhhcccccccCce Q lcl|NC_011269. 230 NPQEYRDLYRWDIN-TTGWAFKDSVVAGERIVQFG----EFQIGK---SIIIPRGT----VYLTPEPEFLGVFPVMYSLD 297 (333) Q Consensus 230 ~~~~~~Di~gw~~N-~~~~~~~DpV~~~e~il~~G----~fgi~~---skvlprge----iyvvadpE~~G~~pvR~~L~ 297 (333) |++-|.-|+. +- +.+ .|+-+.++ +.| +.|.+. +-.+|.++ .+++.|.-.+-.+-.|+++. T Consensus 287 n~~~~~~L~~--lkd~~G----~~l~~~~~--~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~~~gd~s~~~~~~~~~~~~ 358 (404) T protein:vir:10 287 NQDGFNYLDS--LEDKTG----RPYLQPDP--KDPTQYRFLGLPVIELPNDLLLSTESAIPVLLGDTKEAYKYVSDGAYE 358 (404) T ss_pred cHHHHHHHHH--hhccCC----ceeeccCc--CCCCCccccceeeEEecccccCCCCCccEEEEEeccccEEEEEecceE Confidence 9999998876 21 111 22221110 111 234331 12233322 24456665455667789999 Q ss_pred eccccchh----hhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 298 VEEDNKVE----RFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 298 s~p~D~~e----r~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) ++..+... +-..++.++.-++..+.+|.+++.+..+ T Consensus 359 i~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~ 398 (404) T protein:vir:10 359 LATTNIGAGAFETNTTKARIIMRIDGNVKDSEALLIAEIP 398 (404) T ss_pred EEEeccccchhhcCceEEEEEEeeccEEecccceEEEEee Confidence 87665432 2356799999999999999999999988 No 78 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=98.15 E-value=6.4e-08 Score=60.03 Aligned_cols=271 Identities=12% Similarity=0.029 Sum_probs=165.1 Q ss_pred HHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcceeecCCCCccceEEEEcCCCcccceeecCceee Q lcl|NC_011269. 46 LAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRITPFEGKRIE 125 (333) Q Consensus 46 m~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~i~~~ri~ 125 (333) |+ .+. ..|. -.+-+-+++.|-+.++.+.+.|++..+.+++.|.. +||+...... +-|++..+++++....=+.++ T Consensus 1 Ma-t~t-t~~g-~~vP~~~~~~ii~~~~~~s~l~~~~~~i~~~~~~~-~~p~~~~~~~-a~wv~Eg~~~~~~~~~f~~v~ 75 (311) T protein:vir:99 1 MA-TFG-TGNL-KNLPRNIADGMVKDVVQGSTVAVLSARKPQRFGNE-DIITFNGRPK-AEFVGEGQQKSSTTGEFDFVT 75 (311) T ss_pred Cc-eec-CCCc-eeccHHHHHHHHHHHHhhchhhhhcceeeccCCce-EEEEEeCCce-eEEeecCcccccccceeeEEE Confidence 33 222 3444 26788889999999999999999999998887765 7777655444 457787778887666667888 Q ss_pred ccceeeeccccccHHHhh---hhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccc----cCCCcce Q lcl|NC_011269. 126 VQLFRIASFPQIKKEDLY---YLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGV----GALPNEI 198 (333) Q Consensus 126 ~P~f~Ivs~P~V~~~dl~---~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~v----g~~~N~i 198 (333) +.-.++.....|..+-|+ +...++..+...+.+++|...+|.-+++--.+. +....++. +..-|.+ T Consensus 76 l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~-------~g~~~~g~~~~~~~~~~~~ 148 (311) T protein:vir:99 76 STPKKAQVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINPL-------TGTVIPGWSNYLGAASKRV 148 (311) T ss_pred EeeEEEEEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCcc-------cCcccccccccccccccee Confidence 888899999988888774 556889999999999999999996666432100 00111111 0112555 Q ss_pred EEeeccccH--HHHHHHHHHHHhhC--CccceEEechhhhhhhhhcCCCchhhhHHhhhhhcceee--eeeccccc--ce Q lcl|NC_011269. 199 TIAGSHLMP--DDLYTAVTYTDQRQ--LDSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIV--QFGEFQIG--KS 270 (333) Q Consensus 199 ~i~~g~Lt~--~~L~~a~t~v~~~~--L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il--~~G~fgi~--~s 270 (333) +..++..+. .++..+...+...+ .+..-.+||+..|..|+.-- +..+ .|+-+....- ..-++|.+ .+ T Consensus 149 ~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lk-d~~G----~~l~~~~~~~~~~~~l~G~Pv~~s 223 (311) T protein:vir:99 149 ELTADTIANPDLAIEAAVGLLVANGHPTPVNGLALHPSIAWGLSTAR-YTDG----RKKFPELGLGIGVSSFEGIDASVS 223 (311) T ss_pred eccccccchhHHHHHHHHHHHhhhccCCCccEEEEcHHHHHHHHhhh-ccCC----CeeecCcccCCCCceecceeeEee Confidence 555555442 45566666665543 44555999999999997621 1111 2222111000 01134544 22 Q ss_pred eeecCCeE---------------EEeeChhhhcccccccCceeccccc---------hhhhccceehhhhhhhhhhccce Q lcl|NC_011269. 271 IIIPRGTV---------------YLTPEPEFLGVFPVMYSLDVEEDNK---------VERFNKGWVMDELVGMAILNPRG 326 (333) Q Consensus 271 kvlprgei---------------yvvadpE~~G~~pvR~~L~s~p~D~---------~er~~kGWvm~E~~g~~i~N~~s 326 (333) -.+|-+.+ +++-|-...-.+-+|.++...-.++ .++--.++-.-+.+++++.||.. T Consensus 224 ~~i~~~~~~~~~~~~~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~r~~~r~d~~v~~~~~ 303 (311) T protein:vir:99 224 DTVNGGDEADPDDEDLDAARAVRGIVGDFANGIHWGVQRDIPVELIKYGDPDGQGDLKRHNQIALRLEIVYGWYVFTDRF 303 (311) T ss_pred cccccccccccccchhhccCcceEEEeeccccEEEEEecCceEEEeecCCCCcchhhhhcCcEEEEEEEeecceecChhH Confidence 23333222 2223322222344566654433222 22234566667888999999988 Q ss_pred EEEEecC Q lcl|NC_011269. 327 IVILRKA 333 (333) Q Consensus 327 iv~~~~~ 333 (333) |+++.++ T Consensus 304 v~~~~~~ 310 (311) T protein:vir:99 304 VVIENAV 310 (311) T ss_pred eeeeccc Confidence 8877777 No 79 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=98.11 E-value=1.2e-07 Score=58.61 Aligned_cols=299 Identities=10% Similarity=0.049 Sum_probs=173.8 Q ss_pred Cc--ccc----------------------------hhhhhh-----hhhhcccchHHHHHHHHHHhhcchhcchHHHHHH Q lcl|NC_011269. 1 MT--LPV----------------------------AVGSGL-----GRFAKASDDYVADIVEAKQRMGGRKLSAREKQAK 45 (333) Q Consensus 1 ~~--~~~----------------------------~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~ls~ee~~~L 45 (333) +. ... ...... ....+...+...+.-+. ...+-...++.. T Consensus 31 ~~~~~~~~~~ee~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~ 105 (395) T protein:vir:38 31 LGNDASSHSVDDINKLNASLKNAKMAQELAKSAYEDARANLNAEPVNKKPLPVKDGKPDAQAM-----KNQFVKDFKNLV 105 (395) T ss_pred HhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccchhhhhHHHHHH-----HHHHHHHHHHHH Confidence 00 000 000000 00000000000000000 000001111111 Q ss_pred HHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcce-eecCCCCccceEEEEcCCCccccee-ecCce Q lcl|NC_011269. 46 LAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPI-QYDVLDDLGQAYMLHGNEGEIRITP-FEGKR 123 (333) Q Consensus 46 m~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p-~y~v~~~v~~a~~~~~~~G~i~~Q~-i~~~r 123 (333) -.....+..|+. .+-+-+...|...++....+|++....+++.+... .|++..+....+-|++..+.+.+.. ..=+. T Consensus 106 ~~~~~~~~~gg~-~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~f~~ 184 (395) T protein:vir:38 106 TSGTTGTGNAGL-TIPEDIQLQIRTLTRSFTSLESLANVENVTTSHGSRVYEKLADITPLKDLDDESALIGDNDDPELTV 184 (395) T ss_pred hhccCccCCCce-ecchhHhhHHHHHHHhhcchhhhcceeeccCCcceEEEEeeccCCccccccccccccccccccceee Confidence 122233344542 45567788899999999999999888877765433 3444445555555777777776442 33378 Q ss_pred eeccceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCcceEEeec Q lcl|NC_011269. 124 IEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPNEITIAGS 203 (333) Q Consensus 124 i~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N~i~i~~g 203 (333) |++....+.....|..+=|+....|+..+..++..++|-+.+|..+++-. . +.. |. ++ T Consensus 185 v~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~la~~~~~~~~~~il~g~---g--------~~~-------~~----~~ 242 (395) T protein:vir:38 185 VKYLIHRYAGITTVTNTLLKDTVDNIIQWLVNWAAKKDVVTRNAKILEVM---G--------KAP-------KK----PT 242 (395) T ss_pred EEeeeeeeEeehhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcc---c--------ccc-------cc----cc Confidence 89999999999999999999888999999999999999999997666533 1 111 11 12 Q ss_pred cccHHHHHHHHH-HHHhhCCccceEEechhhhhhhhhcCCCchhhhHHhhhhhcceeeee---ecccccc----eeeecC Q lcl|NC_011269. 204 HLMPDDLYTAVT-YTDQRQLDSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIVQF---GEFQIGK----SIIIPR 275 (333) Q Consensus 204 ~Lt~~~L~~a~t-~v~~~~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~---G~fgi~~----skvlpr 275 (333) ..+-+++-.+.. .+...-.....++||++.|.-|+.= .+..| .|+-+.. +-+. -++|.+. +..+|- T Consensus 243 ~~~~~~i~~~~~~~l~~~~~~~a~~v~n~~~~~~L~~l-kd~~G----~~l~~~~-~~~~~~~~l~G~pV~~~~~~~~~~ 316 (395) T protein:vir:38 243 ISQFDNIKDLENNTLDPAIESTSSFITNQSGYNILSKV-KDADG----RYLMQPD-VTSPDKYLIDGKPVIRIADKWLPD 316 (395) T ss_pred cccHHHHHHHHHHhhhhhhcCCCEEEEcHHHHHHHHHh-hccCC----ceeeccC-cCCCCcceeccceeEEecccccCc Confidence 335567776664 4555555667799999999998761 11111 1221111 0011 1345442 223332 Q ss_pred ---CeEEEeeChhhhcccccccCceeccccchh----hhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 276 ---GTVYLTPEPEFLGVFPVMYSLDVEEDNKVE----RFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 276 ---geiyvvadpE~~G~~pvR~~L~s~p~D~~e----r~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) ...+++.|.-..-.+.+|+|+.++-.+..+ +-..+|.+++.++..+.||.+++.+... T Consensus 317 ~~~~~~i~~gd~~~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~ 381 (395) T protein:vir:38 317 VSGSHPLYFGDLKQGITLFDRQQMQIDTTNVGAGSFEHDTTKLRFIDRFDVQLIDDGAFAAASFK 381 (395) T ss_pred CCCcceEEEEeccccEEEEEecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEee Confidence 123456676655667889998887776443 3357899999999999999999999876 No 80 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=98.10 E-value=1.2e-07 Score=58.55 Aligned_cols=304 Identities=14% Similarity=0.118 Sum_probs=174.6 Q ss_pred CcccchhhhhhhhhhcccchHHHHHHHHHHhhcchh-cchHHHHHHHHHHhcCchhHHHHHHHHHHHH-HHHHHhhhhhh Q lcl|NC_011269. 1 MTLPVAVGSGLGRFAKASDDYVADIVEAKQRMGGRK-LSAREKQAKLAHILSDKVGGIQRLGQSMIGP-IQLQLRYQGIL 78 (333) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~p-I~~q~~rqGi~ 78 (333) -.+.-.-+.+.+.-...+.+.. +.-|-|.-. .-+.+...-.........|. .+-..+.+. |.+-++...++ T Consensus 69 ~~~~~~~~~~~~~~~~~~~~~~-----~~~r~~~~~~~r~~~~~~~~~~~t~~~~g~--~~~~~~~~~~i~~~~~~~~~l 141 (390) T protein:vir:62 69 SLLSGLQGSGSGAQRSADVDDD-----ATLRAGNLGEARSFEFAPEKRDGTKAGNPN--VLSRTLYGQLIAQAVERSAIM 141 (390) T ss_pred HHHhhcccccccchhhcchHHH-----HHHhhhhhhhhHHHHhhhhhhcccccCCCc--cccccchHHHHHHHHhhhhhh Confidence 0000001111111111111110 011111100 00001111111112222232 344444443 44556677788 Q ss_pred hhhhhccccCCCcceeecCCCCccceEEEEcCCCcccceeecCceeeccceeeeccccccHHHhhhhcchhHHHHHHHHH Q lcl|NC_011269. 79 RNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRITPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTK 158 (333) Q Consensus 79 RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~ 158 (333) |++-+..+...|....+|+..... .+.|++-.++++++...=+.+++.-.++...+-|.-+=|.....|+..+..+... T Consensus 142 ~~~~~~~~~~~~~~~~~p~~~~~~-~a~wv~E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~ 220 (390) T protein:vir:62 142 RGGATTFTTSDANPLDFTVITGRS-SASIVGETAEIPESYPATAQRSMGGFKYGFASVVSYEFATDQVLDLVGFLVSDAG 220 (390) T ss_pred hhcceeeecCCCceeEEEEEcCCc-ceeeecccccccccccceeeeEeeeeeEEeehHHHHHHHhhhhHHHHHHHHHHHH Confidence 888777666555555566544433 3457888888888888778899999999999999999999999999999999999 Q ss_pred HHHHHHhhhHHHH-------HHhhhhhhhhhhcccccccccCCCcce-EEeeccccHHHHHHHHHHHHhhCCccceEEec Q lcl|NC_011269. 159 QAIMRQEDSRLVT-------LLEAAAVSYRVVDSSAQPGVGALPNEI-TIAGSHLMPDDLYTAVTYTDQRQLDSSRLLAN 230 (333) Q Consensus 159 qaIM~qED~~~~s-------lle~~a~~~r~~~ssA~p~vg~~~N~i-~i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~ 230 (333) ++|..-+|.-+++ ++..++ ++. +.+ +...+.++.++|-.++..+..--......+|| T Consensus 221 ~~i~~~~d~~~l~G~G~p~Gi~~~~~-----------~~~----~~~~~~~~~~~~~~~l~~~~~~l~~~~~~~a~~vmn 285 (390) T protein:vir:62 221 PAIGDAMGRHFITGTGQPRGILTDAS-----------PAT----ATFLATDTDSKVSDALIDLFHEVPSAYRANAKYVVN 285 (390) T ss_pred HHHHHHHHhhhhccCCcccccccccc-----------ccc----cceecccccccchHHHHHHHHhhhhhhhcCCEEEEc Confidence 9999999976554 332111 111 111 22345677888877776654433333467999 Q ss_pred hhhhhhhhhcCCCchh-hhHHhhhhhcceeeeeeccc--ccceeeecCCeEEEeeChhhhcccccccCceecccc--chh Q lcl|NC_011269. 231 PQEYRDLYRWDINTTG-WAFKDSVVAGERIVQFGEFQ--IGKSIIIPRGTVYLTPEPEFLGVFPVMYSLDVEEDN--KVE 305 (333) Q Consensus 231 ~~~~~Di~gw~~N~~~-~~~~DpV~~~e~il~~G~fg--i~~skvlprgeiyvvadpE~~G~~pvR~~L~s~p~D--~~e 305 (333) ++.|.-|..=- +..+ |.+.+++..+.- .=++| +..+-.+|-++|++ .|.. ...+-.|+++.+...+ +.+ T Consensus 286 ~~~~~~L~~lk-d~~g~~l~~~~~~~g~~---~~l~G~Pv~~~~~~p~~~i~~-gd~s-~~~i~~~~~~~v~~~~~~~~~ 359 (390) T protein:vir:62 286 DLRAAQMRKLK-DANGQYLWQSGLTVGAP---SLFNGKVVETDDGMPADKILF-ADLS-KYRVRFAGSLRVDRSVDAKFS 359 (390) T ss_pred hHHHHHHHHhh-ccCCCeeecCCcCCCcc---ceecccceEEecCCCCccEEE-eecc-ceeEEeecceEEEeecccccc Confidence 99998876510 1111 222222222210 00344 33556689998865 6765 3467788999887433 333 Q ss_pred hhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 306 RFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 306 r~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) +-..++..++-++..+.||.+|++|.-. T Consensus 360 ~~~~~~~~~~r~d~~~~~~~A~~~l~~~ 387 (390) T protein:vir:62 360 TDQIVYRFLQRADGLLVDARGAKVLTVT 387 (390) T ss_pred CCcEEEEEEEEeCcEeechhheEEEEee Confidence 3467788889999999999999998865 No 81 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=98.09 E-value=9.8e-08 Score=59.00 Aligned_cols=273 Identities=12% Similarity=0.052 Sum_probs=165.2 Q ss_pred HHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcceeecCCCCccceEEEEcCCCcccceeecCceee Q lcl|NC_011269. 46 LAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRITPFEGKRIE 125 (333) Q Consensus 46 m~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~i~~~ri~ 125 (333) |+..- ++.|+. .+=+.+++.|-+.++.+...|++.+..+.+.|.. .||+......| -|++..++++.....=+.++ T Consensus 1 Ma~~~-~~~gg~-~vP~~~~~~ii~~l~~~s~i~~l~~~i~~~~~~~-~ip~~~~~~~a-~wv~Eg~~~~~s~~~f~~v~ 76 (315) T protein:vir:80 1 MADDF-LSAGKL-ELPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGPV-KGAVFSGVPRA-KIVGEGEVKPSASVDVSAFT 76 (315) T ss_pred CCCCc-CCcCce-EcchHHHHHHHHHHHhhchhhhhcceeecCCCce-EEEEEeCCcce-EEeeCCccccccccceeeeE Confidence 55443 334553 6778889999999999999999999888876653 45654444444 47888888888888778889 Q ss_pred ccceeeeccccccHHHhhhhcchh----HHHHHHHHHHHHHHHhhhHHHHHHhhhhh-hhhhhcccccccccCCCcceEE Q lcl|NC_011269. 126 VQLFRIASFPQIKKEDLYYLRSNI----VEYTQDMTKQAIMRQEDSRLVTLLEAAAV-SYRVVDSSAQPGVGALPNEITI 200 (333) Q Consensus 126 ~P~f~Ivs~P~V~~~dl~~~~~~v----le~~q~~A~qaIM~qED~~~~slle~~a~-~~r~~~ssA~p~vg~~~N~i~i 200 (333) +.-.++.....|..+=|++...+. -.+...+...+|.+.+|.-+++=..+..- .. +.....+...-|.++. T Consensus 77 l~~~kl~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~~~~~----~~~~~~~~~~~~~~~~ 152 (315) T protein:vir:80 77 AQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKAA----SAVHTSLNKTKNIVDA 152 (315) T ss_pred eeeeeEEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCCCccc----cccccccccccceeec Confidence 889999999999988787777663 36678888999999999655532110000 00 0000001111122222 Q ss_pred eeccccHHHHHHHHHHHHhhCCcc-ceEEechhhhhhhhhcCCCchhhhHHhhh----hhcceeeeee-ccccc--ceee Q lcl|NC_011269. 201 AGSHLMPDDLYTAVTYTDQRQLDS-SRLLANPQEYRDLYRWDINTTGWAFKDSV----VAGERIVQFG-EFQIG--KSII 272 (333) Q Consensus 201 ~~g~Lt~~~L~~a~t~v~~~~L~a-t~il~~~~~~~Di~gw~~N~~~~~~~DpV----~~~e~il~~G-~fgi~--~skv 272 (333) +..+..+|-.+...+...+... ...+||+..+..|+..-...--+....++ ..++ .+ ++|.+ .+.. T Consensus 153 --~~~~~~d~~~~~~~~~~~~~~~~~~~imn~~~~~~L~~l~~~~g~~~~g~~~~~~~~~g~----~~tl~G~PV~~~~~ 226 (315) T protein:vir:80 153 --TDSATADLVKAVGLIAGAGLQVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAG----LDNWRGLNVGASST 226 (315) T ss_pred --cccchHHHHHHHHHHhhccCccceEEEEcHHHHHHHHHHhhccCCcccccccccccccCC----CceecceeeEecCc Confidence 3334567778887777665544 45899999999998742111111111222 1111 11 34433 3334 Q ss_pred ecCCe--------EEEeeChhhhcccccccCceeccccchh----------hhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 273 IPRGT--------VYLTPEPEFLGVFPVMYSLDVEEDNKVE----------RFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 273 lprge--------iyvvadpE~~G~~pvR~~L~s~p~D~~e----------r~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) ||-+. +.+..|=.. ..+-.|+++..+-.++.+ +-...|...+-+++.|.||.++|.|.++ T Consensus 227 ~~~~~~~~~~~~~~~~~GDfs~-~~~g~~~~~~i~i~~~~~~~~~~~~~~~~~~v~~r~~~r~~~~v~~~~a~~~l~~~ 304 (315) T protein:vir:80 227 VSGAPEMSPASGVKAIVGDFSR-VHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEK 304 (315) T ss_pred CCcccccccccccEEEEeeccc-EEEEEecCeeEEEeccccccCcccchhhcCcEEEEEEEEecceeecccceEEEeec Confidence 55432 122222221 123356676665333322 2235666677899999999999999977 No 82 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=98.08 E-value=4.1e-07 Score=55.58 Aligned_cols=299 Identities=12% Similarity=0.058 Sum_probs=180.6 Q ss_pred CcccchhhhhhhhhhcccchHHHHHHHHHHhhcchhcchHHHHHHHH----HHhc---CchhHHHHHHHHHHHHHHHHHh Q lcl|NC_011269. 1 MTLPVAVGSGLGRFAKASDDYVADIVEAKQRMGGRKLSAREKQAKLA----HILS---DKVGGIQRLGQSMIGPIQLQLR 73 (333) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~----~Al~---~~Eg~~~aLg~~mA~pI~~q~~ 73 (333) ....-...++--+-.+++..|......+ +.+..++.+++..+-. .+++ +..|+. .+-..+...|...++ T Consensus 56 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~---l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~-~vP~~~~~~ii~~~~ 131 (392) T protein:vir:10 56 TEERNNGREVETRNVDGEMEYRDVFMKA---LRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGL-VIPQDIQTQINELAR 131 (392) T ss_pred HHHhhccccccccCccchHHHHHHHHHH---HhcccccHHHHHHHhhhhhhhhccccccCCCce-ecchhHHHHHHHHHH Confidence 0000011111122233444554443332 3455666666544321 1221 234442 456678888888999 Q ss_pred hhhhhhhhhhccccCCCcc-eeecCCCCccceEEEEcCCCccccee-ecCceeeccceeeeccccccHHHhhhhcchhHH Q lcl|NC_011269. 74 YQGILRNVLLEDTLTPGVP-IQYDVLDDLGQAYMLHGNEGEIRITP-FEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVE 151 (333) Q Consensus 74 rqGi~RklL~~~TL~~G~~-p~y~v~~~v~~a~~~~~~~G~i~~Q~-i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle 151 (333) ....++++....+++.+.. ..+++..+. ..+-|++-.+++.+.- ..-+.|++.-..+...+.|..+-|.+...|+.. T Consensus 132 ~~s~l~~~~~~~~~~~~~~~~~~~~~~~~-~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~ 210 (392) T protein:vir:10 132 SFDALEQYVTVEPVRTRSGSRVLEKNSDM-IPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILK 210 (392) T ss_pred hhhhhhhhceeeeccCCceeEEEEeecCC-ccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHH Confidence 9999999988888875542 123322222 2445777777776542 334788888888999999999989988899999 Q ss_pred HHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCcceEEeeccccHHHHHHHHH-HHHhhCCccceEEec Q lcl|NC_011269. 152 YTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPNEITIAGSHLMPDDLYTAVT-YTDQRQLDSSRLLAN 230 (333) Q Consensus 152 ~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N~i~i~~g~Lt~~~L~~a~t-~v~~~~L~at~il~~ 230 (333) +...+..++|..-+|..+++...+.. ..+..+-+++-.++. .+..........+|| T Consensus 211 ~i~~~l~~~i~~~~d~~~~~g~g~~~-----------------------~~~~~~~d~i~~~~~~~l~~~~~~~a~~vm~ 267 (392) T protein:vir:10 211 YVTKWLGKKSKVTRNVLILGVIEKLT-----------------------KQAIKSLDDIKDVLNVKLDPAISPNAILLTN 267 (392) T ss_pred HHHHHHHHHHHHHHHHHHhhcccccc-----------------------ccCccCHHHHHHHHHHhhhhhhccCCEEEEc Confidence 99999999999999977766542211 123356677777764 455555556779999 Q ss_pred hhhhhhhhhcCCC-chhhhHHhhhhhcceeeee--eccccccee----------eecCCeE-EEeeChhhhcccccccCc Q lcl|NC_011269. 231 PQEYRDLYRWDIN-TTGWAFKDSVVAGERIVQF--GEFQIGKSI----------IIPRGTV-YLTPEPEFLGVFPVMYSL 296 (333) Q Consensus 231 ~~~~~Di~gw~~N-~~~~~~~DpV~~~e~il~~--G~fgi~~sk----------vlprgei-yvvadpE~~G~~pvR~~L 296 (333) ++.|..|+. +- +.+ .|+-+.+.--.+ -++|.+.-. ...-|+. ++..|....-...+|+++ T Consensus 268 ~~~~~~L~~--lkd~~G----~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~ 341 (392) T protein:vir:10 268 QDGFNYLDK--LKDKDG----KYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDM 341 (392) T ss_pred HHHHHHHHH--hhccCC----CeEeecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecce Confidence 999999976 21 111 222222210000 123432111 1111222 455666644455678898 Q ss_pred eeccccch----hhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 297 DVEEDNKV----ERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 297 ~s~p~D~~----er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) .++-.++. ++...++.++.-++.++.||-+++.+... T Consensus 342 ~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~ 382 (392) T protein:vir:10 342 ELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEID 382 (392) T ss_pred EEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEec Confidence 88766533 33456799999999999999999999876 No 83 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=98.08 E-value=4.1e-07 Score=55.58 Aligned_cols=299 Identities=12% Similarity=0.058 Sum_probs=180.6 Q ss_pred CcccchhhhhhhhhhcccchHHHHHHHHHHhhcchhcchHHHHHHHH----HHhc---CchhHHHHHHHHHHHHHHHHHh Q lcl|NC_011269. 1 MTLPVAVGSGLGRFAKASDDYVADIVEAKQRMGGRKLSAREKQAKLA----HILS---DKVGGIQRLGQSMIGPIQLQLR 73 (333) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~----~Al~---~~Eg~~~aLg~~mA~pI~~q~~ 73 (333) ....-...++--+-.+++..|......+ +.+..++.+++..+-. .+++ +..|+. .+-..+...|...++ T Consensus 56 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~---l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~-~vP~~~~~~ii~~~~ 131 (392) T protein:vir:10 56 TEERNNGREVETRNVDGEMEYRDVFMKA---LRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGL-VIPQDIQTQINELAR 131 (392) T ss_pred HHHhhccccccccCccchHHHHHHHHHH---HhcccccHHHHHHHhhhhhhhhccccccCCCce-ecchhHHHHHHHHHH Confidence 0000011111122233444554443332 3455666666544321 1221 234442 456678888888999 Q ss_pred hhhhhhhhhhccccCCCcc-eeecCCCCccceEEEEcCCCccccee-ecCceeeccceeeeccccccHHHhhhhcchhHH Q lcl|NC_011269. 74 YQGILRNVLLEDTLTPGVP-IQYDVLDDLGQAYMLHGNEGEIRITP-FEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVE 151 (333) Q Consensus 74 rqGi~RklL~~~TL~~G~~-p~y~v~~~v~~a~~~~~~~G~i~~Q~-i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle 151 (333) ....++++....+++.+.. ..+++..+. ..+-|++-.+++.+.- ..-+.|++.-..+...+.|..+-|.+...|+.. T Consensus 132 ~~s~l~~~~~~~~~~~~~~~~~~~~~~~~-~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~ 210 (392) T protein:vir:10 132 SFDALEQYVTVEPVRTRSGSRVLEKNSDM-IPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILK 210 (392) T ss_pred hhhhhhhhceeeeccCCceeEEEEeecCC-ccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHH Confidence 9999999988888875542 123322222 2445777777776542 334788888888999999999989988899999 Q ss_pred HHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCcceEEeeccccHHHHHHHHH-HHHhhCCccceEEec Q lcl|NC_011269. 152 YTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPNEITIAGSHLMPDDLYTAVT-YTDQRQLDSSRLLAN 230 (333) Q Consensus 152 ~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N~i~i~~g~Lt~~~L~~a~t-~v~~~~L~at~il~~ 230 (333) +...+..++|..-+|..+++...+.. ..+..+-+++-.++. .+..........+|| T Consensus 211 ~i~~~l~~~i~~~~d~~~~~g~g~~~-----------------------~~~~~~~d~i~~~~~~~l~~~~~~~a~~vm~ 267 (392) T protein:vir:10 211 YVTKWLGKKSKVTRNVLILGVIEKLT-----------------------KQAIKSLDDIKDVLNVKLDPAISPNAILLTN 267 (392) T ss_pred HHHHHHHHHHHHHHHHHHhhcccccc-----------------------ccCccCHHHHHHHHHHhhhhhhccCCEEEEc Confidence 99999999999999977766542211 123356677777764 455555556779999 Q ss_pred hhhhhhhhhcCCC-chhhhHHhhhhhcceeeee--eccccccee----------eecCCeE-EEeeChhhhcccccccCc Q lcl|NC_011269. 231 PQEYRDLYRWDIN-TTGWAFKDSVVAGERIVQF--GEFQIGKSI----------IIPRGTV-YLTPEPEFLGVFPVMYSL 296 (333) Q Consensus 231 ~~~~~Di~gw~~N-~~~~~~~DpV~~~e~il~~--G~fgi~~sk----------vlprgei-yvvadpE~~G~~pvR~~L 296 (333) ++.|..|+. +- +.+ .|+-+.+.--.+ -++|.+.-. ...-|+. ++..|....-...+|+++ T Consensus 268 ~~~~~~L~~--lkd~~G----~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~ 341 (392) T protein:vir:10 268 QDGFNYLDK--LKDKDG----KYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDM 341 (392) T ss_pred HHHHHHHHH--hhccCC----CeEeecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecce Confidence 999999976 21 111 222222210000 123432111 1111222 455666644455678898 Q ss_pred eeccccch----hhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 297 DVEEDNKV----ERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 297 ~s~p~D~~----er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) .++-.++. ++...++.++.-++.++.||-+++.+... T Consensus 342 ~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~ 382 (392) T protein:vir:10 342 ELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEID 382 (392) T ss_pred EEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEec Confidence 88766533 33456799999999999999999999876 No 84 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=98.08 E-value=4.1e-07 Score=55.58 Aligned_cols=299 Identities=12% Similarity=0.058 Sum_probs=180.6 Q ss_pred CcccchhhhhhhhhhcccchHHHHHHHHHHhhcchhcchHHHHHHHH----HHhc---CchhHHHHHHHHHHHHHHHHHh Q lcl|NC_011269. 1 MTLPVAVGSGLGRFAKASDDYVADIVEAKQRMGGRKLSAREKQAKLA----HILS---DKVGGIQRLGQSMIGPIQLQLR 73 (333) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~----~Al~---~~Eg~~~aLg~~mA~pI~~q~~ 73 (333) ....-...++--+-.+++..|......+ +.+..++.+++..+-. .+++ +..|+. .+-..+...|...++ T Consensus 56 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~---l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~-~vP~~~~~~ii~~~~ 131 (392) T protein:vir:10 56 TEERNNGREVETRNVDGEMEYRDVFMKA---LRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGL-VIPQDIQTQINELAR 131 (392) T ss_pred HHHhhccccccccCccchHHHHHHHHHH---HhcccccHHHHHHHhhhhhhhhccccccCCCce-ecchhHHHHHHHHHH Confidence 0000011111122233444554443332 3455666666544321 1221 234442 456678888888999 Q ss_pred hhhhhhhhhhccccCCCcc-eeecCCCCccceEEEEcCCCccccee-ecCceeeccceeeeccccccHHHhhhhcchhHH Q lcl|NC_011269. 74 YQGILRNVLLEDTLTPGVP-IQYDVLDDLGQAYMLHGNEGEIRITP-FEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVE 151 (333) Q Consensus 74 rqGi~RklL~~~TL~~G~~-p~y~v~~~v~~a~~~~~~~G~i~~Q~-i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle 151 (333) ....++++....+++.+.. ..+++..+. ..+-|++-.+++.+.- ..-+.|++.-..+...+.|..+-|.+...|+.. T Consensus 132 ~~s~l~~~~~~~~~~~~~~~~~~~~~~~~-~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~ 210 (392) T protein:vir:10 132 SFDALEQYVTVEPVRTRSGSRVLEKNSDM-IPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILK 210 (392) T ss_pred hhhhhhhhceeeeccCCceeEEEEeecCC-ccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHH Confidence 9999999988888875542 123322222 2445777777776542 334788888888999999999989988899999 Q ss_pred HHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCcceEEeeccccHHHHHHHHH-HHHhhCCccceEEec Q lcl|NC_011269. 152 YTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPNEITIAGSHLMPDDLYTAVT-YTDQRQLDSSRLLAN 230 (333) Q Consensus 152 ~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N~i~i~~g~Lt~~~L~~a~t-~v~~~~L~at~il~~ 230 (333) +...+..++|..-+|..+++...+.. ..+..+-+++-.++. .+..........+|| T Consensus 211 ~i~~~l~~~i~~~~d~~~~~g~g~~~-----------------------~~~~~~~d~i~~~~~~~l~~~~~~~a~~vm~ 267 (392) T protein:vir:10 211 YVTKWLGKKSKVTRNVLILGVIEKLT-----------------------KQAIKSLDDIKDVLNVKLDPAISPNAILLTN 267 (392) T ss_pred HHHHHHHHHHHHHHHHHHhhcccccc-----------------------ccCccCHHHHHHHHHHhhhhhhccCCEEEEc Confidence 99999999999999977766542211 123356677777764 455555556779999 Q ss_pred hhhhhhhhhcCCC-chhhhHHhhhhhcceeeee--eccccccee----------eecCCeE-EEeeChhhhcccccccCc Q lcl|NC_011269. 231 PQEYRDLYRWDIN-TTGWAFKDSVVAGERIVQF--GEFQIGKSI----------IIPRGTV-YLTPEPEFLGVFPVMYSL 296 (333) Q Consensus 231 ~~~~~Di~gw~~N-~~~~~~~DpV~~~e~il~~--G~fgi~~sk----------vlprgei-yvvadpE~~G~~pvR~~L 296 (333) ++.|..|+. +- +.+ .|+-+.+.--.+ -++|.+.-. ...-|+. ++..|....-...+|+++ T Consensus 268 ~~~~~~L~~--lkd~~G----~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~ 341 (392) T protein:vir:10 268 QDGFNYLDK--LKDKDG----KYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDM 341 (392) T ss_pred HHHHHHHHH--hhccCC----CeEeecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecce Confidence 999999976 21 111 222222210000 123432111 1111222 455666644455678898 Q ss_pred eeccccch----hhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 297 DVEEDNKV----ERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 297 ~s~p~D~~----er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) .++-.++. ++...++.++.-++.++.||-+++.+... T Consensus 342 ~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~ 382 (392) T protein:vir:10 342 ELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEID 382 (392) T ss_pred EEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEec Confidence 88766533 33456799999999999999999999876 No 85 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=98.08 E-value=4.1e-07 Score=55.58 Aligned_cols=299 Identities=12% Similarity=0.058 Sum_probs=180.6 Q ss_pred CcccchhhhhhhhhhcccchHHHHHHHHHHhhcchhcchHHHHHHHH----HHhc---CchhHHHHHHHHHHHHHHHHHh Q lcl|NC_011269. 1 MTLPVAVGSGLGRFAKASDDYVADIVEAKQRMGGRKLSAREKQAKLA----HILS---DKVGGIQRLGQSMIGPIQLQLR 73 (333) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~----~Al~---~~Eg~~~aLg~~mA~pI~~q~~ 73 (333) ....-...++--+-.+++..|......+ +.+..++.+++..+-. .+++ +..|+. .+-..+...|...++ T Consensus 56 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~---l~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~-~vP~~~~~~ii~~~~ 131 (392) T protein:vir:10 56 TEERNNGREVETRNVDGEMEYRDVFMKA---LRNKPLNAEEREFLEDDLEQRAMSGLTGEDGGL-VIPQDIQTQINELAR 131 (392) T ss_pred HHHhhccccccccCccchHHHHHHHHHH---HhcccccHHHHHHHhhhhhhhhccccccCCCce-ecchhHHHHHHHHHH Confidence 0000011111122233444554443332 3455666666544321 1221 234442 456678888888999 Q ss_pred hhhhhhhhhhccccCCCcc-eeecCCCCccceEEEEcCCCccccee-ecCceeeccceeeeccccccHHHhhhhcchhHH Q lcl|NC_011269. 74 YQGILRNVLLEDTLTPGVP-IQYDVLDDLGQAYMLHGNEGEIRITP-FEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVE 151 (333) Q Consensus 74 rqGi~RklL~~~TL~~G~~-p~y~v~~~v~~a~~~~~~~G~i~~Q~-i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle 151 (333) ....++++....+++.+.. ..+++..+. ..+-|++-.+++.+.- ..-+.|++.-..+...+.|..+-|.+...|+.. T Consensus 132 ~~s~l~~~~~~~~~~~~~~~~~~~~~~~~-~~a~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds~~~l~~ 210 (392) T protein:vir:10 132 SFDALEQYVTVEPVRTRSGSRVLEKNSDM-IPFAEITEMGEIPETDNPKFSNVQYAVKDRAGILPLSRSLLQDSDQNILK 210 (392) T ss_pred hhhhhhhhceeeeccCCceeEEEEeecCC-ccceeecccccccccccccceeEEeeeeeEEEeehhhHHHHhhhHHHHHH Confidence 9999999988888875542 123322222 2445777777776542 334788888888999999999989988899999 Q ss_pred HHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCcceEEeeccccHHHHHHHHH-HHHhhCCccceEEec Q lcl|NC_011269. 152 YTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPNEITIAGSHLMPDDLYTAVT-YTDQRQLDSSRLLAN 230 (333) Q Consensus 152 ~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N~i~i~~g~Lt~~~L~~a~t-~v~~~~L~at~il~~ 230 (333) +...+..++|..-+|..+++...+.. ..+..+-+++-.++. .+..........+|| T Consensus 211 ~i~~~l~~~i~~~~d~~~~~g~g~~~-----------------------~~~~~~~d~i~~~~~~~l~~~~~~~a~~vm~ 267 (392) T protein:vir:10 211 YVTKWLGKKSKVTRNVLILGVIEKLT-----------------------KQAIKSLDDIKDVLNVKLDPAISPNAILLTN 267 (392) T ss_pred HHHHHHHHHHHHHHHHHHhhcccccc-----------------------ccCccCHHHHHHHHHHhhhhhhccCCEEEEc Confidence 99999999999999977766542211 123356677777764 455555556779999 Q ss_pred hhhhhhhhhcCCC-chhhhHHhhhhhcceeeee--eccccccee----------eecCCeE-EEeeChhhhcccccccCc Q lcl|NC_011269. 231 PQEYRDLYRWDIN-TTGWAFKDSVVAGERIVQF--GEFQIGKSI----------IIPRGTV-YLTPEPEFLGVFPVMYSL 296 (333) Q Consensus 231 ~~~~~Di~gw~~N-~~~~~~~DpV~~~e~il~~--G~fgi~~sk----------vlprgei-yvvadpE~~G~~pvR~~L 296 (333) ++.|..|+. +- +.+ .|+-+.+.--.+ -++|.+.-. ...-|+. ++..|....-...+|+++ T Consensus 268 ~~~~~~L~~--lkd~~G----~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~ 341 (392) T protein:vir:10 268 QDGFNYLDK--LKDKDG----KYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKREDM 341 (392) T ss_pred HHHHHHHHH--hhccCC----CeEeecCccCCccccccCcccEEEecccccCCCcccCCceEEEEEehhceEEEEeecce Confidence 999999976 21 111 222222210000 123432111 1111222 455666644455678898 Q ss_pred eeccccch----hhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 297 DVEEDNKV----ERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 297 ~s~p~D~~----er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) .++-.++. ++...++.++.-++.++.||-+++.+... T Consensus 342 ~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~ 382 (392) T protein:vir:10 342 ELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEID 382 (392) T ss_pred EEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEec Confidence 88766533 33456799999999999999999999876 No 86 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=98.06 E-value=4.4e-07 Score=55.42 Aligned_cols=311 Identities=18% Similarity=0.161 Sum_probs=172.9 Q ss_pred Cc----------------ccchh----------hhhhhhh--hcccchHHHHHHHHHHhhcchhcchHHHHHHH---HHH Q lcl|NC_011269. 1 MT----------------LPVAV----------GSGLGRF--AKASDDYVADIVEAKQRMGGRKLSAREKQAKL---AHI 49 (333) Q Consensus 1 ~~----------------~~~~~----------~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm---~~A 49 (333) ++ +--.. -.-+.+. +.....+.....+......+..+..+++..+- +.. T Consensus 173 ~~~~~~~~~e~l~~~~e~~~~~~~~~~~~~d~~e~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~l~~~e~~~~~~~~~~~ 252 (543) T protein:vir:81 173 LRARALSAIEKMQGASDNVRAAATKIIERFDDEDSTLARQCLATSSPAYLRAWSKMARNPHAAILTEEEKRAINEVRAMG 252 (543) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhhhHHHHHHHhhHHHHhhhhhhhhhhhhhhcc Confidence 00 00000 0000000 00111111111111111122223333322221 112 Q ss_pred hcCchhHHHHHHHHHHHH-HHHHHhhhhhhhhhhhccccCCCcceeecCCCCccceEEEEcCCCcccceeecCceeeccc Q lcl|NC_011269. 50 LSDKVGGIQRLGQSMIGP-IQLQLRYQGILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRITPFEGKRIEVQL 128 (333) Q Consensus 50 l~~~Eg~~~aLg~~mA~p-I~~q~~rqGi~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~i~~~ri~~P~ 128 (333) ..+..|+. .+-..+... |...++.....+++.++.+. .|.. .+++... +..+.|.+--+.++.....=+.|++.- T Consensus 253 ~t~~~gg~-lip~~~~~~ii~~~~~~~~~l~~~~~~~~~-~g~~-~~~~~~~-~~~a~~v~Eg~~~~~~~~~~~~i~~~~ 328 (543) T protein:vir:81 253 LTKADGGY-LVPFQLDPTVIITSNGSLNDIRRFARQVVA-TGDV-WHGVSSA-AVQWSWDAEFEEVSDDSPEFGQPEIPV 328 (543) T ss_pred cccccCcc-cCchhhhhHHHHHHHhhhchhhhhcccccC-Ccce-EEEEecC-CcceeecccCccccccccccceeeeee Confidence 33444432 222333333 33445555777777766554 4543 3444333 334556777777777777778899999 Q ss_pred eeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCC-------CcceEEe Q lcl|NC_011269. 129 FRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGAL-------PNEITIA 201 (333) Q Consensus 129 f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~-------~N~i~i~ 201 (333) .++..+..|..+=|+. +.++..+..+...++|.+-+|.-++ .+..++ ..| .|.+ ....+.. T Consensus 329 ~k~~~~~~is~ell~d-~~~~~~~i~~~l~~~~~~~~d~ail---~G~Gt~-------~~p-~Gi~~~~~~~~~~~~~~~ 396 (543) T protein:vir:81 329 KKAQGFVPISIEALQD-EANVTETVALLFAEGKDELEAVTLT---TGTGQG-------NQP-TGIVTALAGTAAEIAPVT 396 (543) T ss_pred eeeEeeehhhHHHHhc-cHHHHHHHHHHHHHHHHHHHHHHHh---ccCCCC-------ccc-ccchhhcccccccccccc Confidence 9999999999987765 4799999999999999999996554 222110 000 1111 0123456 Q ss_pred eccccHHHHHHHHHHHHhhCCccceEEechhhhhhhhhcCCCchhhhHHhhhhhcceeeeee-ccccc--ceeeecCCe- Q lcl|NC_011269. 202 GSHLMPDDLYTAVTYTDQRQLDSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIVQFG-EFQIG--KSIIIPRGT- 277 (333) Q Consensus 202 ~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~G-~fgi~--~skvlprge- 277 (333) ++.++-+++..++..+..-......++||+..|..|..=--..-.|.+- |+..+. .+ ++|.+ .+--||-+. T Consensus 397 ~~~~~~~~~~~~~~~l~~~~~~~~~~v~n~~~~~~l~~lkd~~G~~l~~-~~~~g~----~~~l~G~pv~~~~~~~~~~~ 471 (543) T protein:vir:81 397 AETFALADVYAVYEQLAARHRRQGAWLANNLIYNKIRQFDTQGGAGLWT-TIGNGE----PSQLLGRPVGEAEAMDANWN 471 (543) T ss_pred cccccHHHHHHHHHhhhccccCCcEEEEcHHHHHHHHHhhcCCCceecc-CcCCCC----CccccceeeEEecccccccc Confidence 6778899999999999888888888999999999998621111112222 222211 11 34433 333455443 Q ss_pred --------EEEeeChhhhcccccccCceeccccch---hhh---ccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 278 --------VYLTPEPEFLGVFPVMYSLDVEEDNKV---ERF---NKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 278 --------iyvvadpE~~G~~pvR~~L~s~p~D~~---er~---~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) .++..|..++ .+-+|+|+.++-+++. +.+ ..+|.++.-+|+.+.||-+++++.-+ T Consensus 472 ~~~~~~~~~i~~gd~~~~-~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~ 540 (543) T protein:vir:81 472 TSASADNFVLLYGNFQNY-VIADRIGMTVEFIPHLFGTNRRPNGSRGWFAYYRMGADVVNPNAFRLLNVE 540 (543) T ss_pred ccccCCcceEEEeeccce-eEEeecccEEEEeccccccchhhcCceEEEEEEeeccEeecccceEEEEec Confidence 2445676543 4567888777644322 222 46899999999999999999999887 No 87 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=98.02 E-value=4.1e-07 Score=55.59 Aligned_cols=257 Identities=11% Similarity=0.053 Sum_probs=142.0 Q ss_pred HHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhc---cccCCCcceeecCCCCccceEEEEcCCCcccceeecCc Q lcl|NC_011269. 46 LAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLE---DTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRITPFEGK 122 (333) Q Consensus 46 m~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~---~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~i~~~ 122 (333) |+...--+|=| ++.+...++.++. ..++... .+..+|....+|+++.++.+-+ ....|.+..+++... T Consensus 1 MA~~~~~pe~~----~~~v~~~~~~~lv----~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~-~~~~~~~~~~~~~~~ 71 (273) T protein:vir:10 1 MAFNNFIPELW----SDMLLEEWTAQTV----FANLVNREYEGTASKGNVVHIAGVVAPTVKDY-KAAGRQTSADAISDT 71 (273) T ss_pred CcchhhhHHHH----HHHHHHHHHhhhc----cchhhccccccccccCceEEEeeccccccccc-ccCCCccCccccccc Confidence 33322223322 2233333333332 2222211 1345788899999888765532 233444666666666 Q ss_pred eeecccee-eeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCcceEEe Q lcl|NC_011269. 123 RIEVQLFR-IASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPNEITIA 201 (333) Q Consensus 123 ri~~P~f~-Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N~i~i~ 201 (333) .+++---+ ....-.|+-.|-.+...++ +.....+..++-..-|..+++++-+++..+ .+ ..++ T Consensus 72 ~~~~tid~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~alA~~vD~~i~~~~~~a~~~~--------~~------~~~~- 135 (273) T protein:vir:10 72 GVDLLIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADMLVDNGTAL--------TG------SAPT- 135 (273) T ss_pred eEEEEEeeeeecceEeecHHHhhhhccH-HHHHHHHHHHHHHHHHHHHHHHHhcccccc--------cc------cccc- Confidence 66665222 2333356666667777775 667777888999999999998885554322 00 0001 Q ss_pred eccccHHHHHHHHHHHHhhCCcc--ceEEechhhhhhhhhcCCCchhhhHHhhhhhcceeeeeec------ccccceeee Q lcl|NC_011269. 202 GSHLMPDDLYTAVTYTDQRQLDS--SRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIVQFGE------FQIGKSIII 273 (333) Q Consensus 202 ~g~Lt~~~L~~a~t~v~~~~L~a--t~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~G~------fgi~~skvl 273 (333) ...-.-+.|-.|.+..++.+.|. -+++++|..|.+|.. .+.|...+ |-. ...-++..|. |.|..|..| T Consensus 136 ~~~~~~~~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~L~~--~~~~~~~~-~~~-~~~~~l~~G~ig~i~G~~v~~s~~l 211 (273) T protein:vir:10 136 DADDAFDLIAKALKELTKANVPNVGRVVVVNAEMAFWLRS--SGSKLTSA-DTS-GDAAGLRAGTIGNLLGARIVESNNL 211 (273) T ss_pred chhHHHHHHHHHHHHhhhcCCCcCCCEEEECHHHHHHHhc--chhhhhhh-hcc-ccccceeeeeeeEEeceEEEEeccc Confidence 11223467888888899999865 479999999999988 33332111 111 1112333343 447788888 Q ss_pred cCCe--EEEeeChhhhcccccccCceeccccchhhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 274 PRGT--VYLTPEPEFLGVFPVMYSLDVEEDNKVERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 274 prge--iyvvadpE~~G~~pvR~~L~s~p~D~~er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) |-++ -++.--+... .+..+ -.++|..--..+++.-=.-....|-.+.+|.++|.+++. T Consensus 212 p~~~~~~~~~~~~~A~-~~a~q-~~~~e~~r~~~~~~~~v~~~~~yg~~v~~~~~~~~l~~~ 271 (273) T protein:vir:10 212 RDTDDEQFVAFHPSAA-AYVSQ-IDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKT 271 (273) T ss_pred ccCCccEEEEEeccce-eeeee-eehhhcccCCCcceeeeeeeeeeeeeEeccceEEEEecc Confidence 8653 1222223323 34432 223332222222332222234578999999999999999 No 88 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=98.02 E-value=4.1e-07 Score=55.59 Aligned_cols=257 Identities=11% Similarity=0.053 Sum_probs=142.0 Q ss_pred HHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhc---cccCCCcceeecCCCCccceEEEEcCCCcccceeecCc Q lcl|NC_011269. 46 LAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLE---DTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRITPFEGK 122 (333) Q Consensus 46 m~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~---~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~i~~~ 122 (333) |+...--+|=| ++.+...++.++. ..++... .+..+|....+|+++.++.+-+ ....|.+..+++... T Consensus 1 MA~~~~~pe~~----~~~v~~~~~~~lv----~~~l~~~~~~~~~~~Gdtv~ip~~~~~~~~d~-~~~~~~~~~~~~~~~ 71 (273) T protein:vir:10 1 MAFNNFIPELW----SDMLLEEWTAQTV----FANLVNREYEGTASKGNVVHIAGVVAPTVKDY-KAAGRQTSADAISDT 71 (273) T ss_pred CcchhhhHHHH----HHHHHHHHHhhhc----cchhhccccccccccCceEEEeeccccccccc-ccCCCccCccccccc Confidence 33322223322 2233333333332 2222211 1345788899999888765532 233444666666666 Q ss_pred eeecccee-eeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCcceEEe Q lcl|NC_011269. 123 RIEVQLFR-IASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPNEITIA 201 (333) Q Consensus 123 ri~~P~f~-Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N~i~i~ 201 (333) .+++---+ ....-.|+-.|-.+...++ +.....+..++-..-|..+++++-+++..+ .+ ..++ T Consensus 72 ~~~~tid~~~~~~~~i~d~d~~~~~~~~-~~~~~~~~~alA~~vD~~i~~~~~~a~~~~--------~~------~~~~- 135 (273) T protein:vir:10 72 GVDLLIDQEKSIDFLVDDIDRVQVAGSL-EAYTRAGATALATDTDKFIADMLVDNGTAL--------TG------SAPT- 135 (273) T ss_pred eEEEEEeeeeecceEeecHHHhhhhccH-HHHHHHHHHHHHHHHHHHHHHHHhcccccc--------cc------cccc- Confidence 66665222 2333356666667777775 667777888999999999998885554322 00 0001 Q ss_pred eccccHHHHHHHHHHHHhhCCcc--ceEEechhhhhhhhhcCCCchhhhHHhhhhhcceeeeeec------ccccceeee Q lcl|NC_011269. 202 GSHLMPDDLYTAVTYTDQRQLDS--SRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIVQFGE------FQIGKSIII 273 (333) Q Consensus 202 ~g~Lt~~~L~~a~t~v~~~~L~a--t~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~G~------fgi~~skvl 273 (333) ...-.-+.|-.|.+..++.+.|. -+++++|..|.+|.. .+.|...+ |-. ...-++..|. |.|..|..| T Consensus 136 ~~~~~~~~i~~a~~~ld~~~vP~~~R~lvv~p~~~~~L~~--~~~~~~~~-~~~-~~~~~l~~G~ig~i~G~~v~~s~~l 211 (273) T protein:vir:10 136 DADDAFDLIAKALKELTKANVPNVGRVVVVNAEMAFWLRS--SGSKLTSA-DTS-GDAAGLRAGTIGNLLGARIVESNNL 211 (273) T ss_pred chhHHHHHHHHHHHHhhhcCCCcCCCEEEECHHHHHHHhc--chhhhhhh-hcc-ccccceeeeeeeEEeceEEEEeccc Confidence 11223467888888899999865 479999999999988 33332111 111 1112333343 447788888 Q ss_pred cCCe--EEEeeChhhhcccccccCceeccccchhhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 274 PRGT--VYLTPEPEFLGVFPVMYSLDVEEDNKVERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 274 prge--iyvvadpE~~G~~pvR~~L~s~p~D~~er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) |-++ -++.--+... .+..+ -.++|..--..+++.-=.-....|-.+.+|.++|.+++. T Consensus 212 p~~~~~~~~~~~~~A~-~~a~q-~~~~e~~r~~~~~~~~v~~~~~yg~~v~~~~~~~~l~~~ 271 (273) T protein:vir:10 212 RDTDDEQFVAFHPSAA-AYVSQ-IDTVEALRDQDSFSDRIRALHVYGGKVVRPTGVVVFNKT 271 (273) T ss_pred ccCCccEEEEEeccce-eeeee-eehhhcccCCCcceeeeeeeeeeeeeEeccceEEEEecc Confidence 8653 1222223323 34432 223332222222332222234578999999999999999 No 89 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=97.99 E-value=4.9e-07 Score=55.18 Aligned_cols=307 Identities=15% Similarity=0.088 Sum_probs=177.1 Q ss_pred Ccccc----------------------hhhhhhhhhhcc--cchHHH------HHHHHHHhhcchhcc--hHHHHHHHHH Q lcl|NC_011269. 1 MTLPV----------------------AVGSGLGRFAKA--SDDYVA------DIVEAKQRMGGRKLS--AREKQAKLAH 48 (333) Q Consensus 1 ~~~~~----------------------~~~~~~~~~~~~--~~~~~~------~~~~~~~~~~~~~ls--~ee~~~Lm~~ 48 (333) ..-+. +-+..+.|..++ ..++.. .-.+...+.| +.-. .-...++... T Consensus 277 ~~~~~a~~~~~~~~~~~~~~i~~~~re~~~~~l~rai~a~a~~~~~~a~~~~e~a~~~a~~~G-~~arg~~~~~~~l~~r 355 (632) T protein:vir:96 277 FEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASG-KEARGFYMPHEVLVQR 355 (632) T ss_pred hhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHHhhhccchhhhhhhhHHHHHHHHhhh-hhhhhhhhhHHHHHHh Confidence 00000 000011111100 000000 0001111111 1000 0012234444 Q ss_pred HhcCc---hhHHHHHH-HHHHHHHHHHHhhhhhhhhh-hhccccCCCcceeecCCCCccceEEEEcCCCcccceeecCce Q lcl|NC_011269. 49 ILSDK---VGGIQRLG-QSMIGPIQLQLRYQGILRNV-LLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRITPFEGKR 123 (333) Q Consensus 49 Al~~~---Eg~~~aLg-~~mA~pI~~q~~rqGi~Rkl-L~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~i~~~r 123 (333) +++.. .|+. ... +-+.++|-+.|+.+-+.+++ ...-+...|. ..||+...- ..+.|.+-.+.+......=+. T Consensus 356 a~~~~t~~~gg~-lvp~~~~~~~iie~lr~~s~i~~l~~~~~~~~~g~-~~ip~~~~~-~~a~wv~E~~~~~~s~~~f~~ 432 (632) T protein:vir:96 356 QLEKKTAGKGGE-LVATELLSEEFIDILRNKAIIGQMGARMLPGLVGD-VDIPKKTSG-ANFYWIGEDEDVQDSDFDFTT 432 (632) T ss_pred hhhccccccccc-ccccccchHHHHHHHhhcchhhhhcceEeecCCcc-eEEEEEeCC-ceeEeecCCccccccccceee Confidence 44432 2221 112 22456666666667777776 2334444554 345554333 344578888888888777788 Q ss_pred eeccceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCC----CcceE Q lcl|NC_011269. 124 IEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGAL----PNEIT 199 (333) Q Consensus 124 i~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~----~N~i~ 199 (333) +++.-.++...+-|..+=|.+...++-....++-.++|-+.+|.-++ .+..+ +-+|. |.+ .|.++ T Consensus 433 i~l~~~k~~~~v~iS~ell~ds~~~~~~~i~~~l~~a~~~~~d~a~l---~G~G~-------~~~p~-Gi~~~~~~~~~~ 501 (632) T protein:vir:96 433 LSFSPKTIAGAVPVTRKLRKQSSIHVENLIREDLIEGIGVALDLAML---TGTGL-------ANDPV-GLLNMTGVPALT 501 (632) T ss_pred EEeeeeEEEEehhhHHHHHhccchHHHHHHHHHHHHHHHHHHHHHhh---cccCC-------CCccc-eeeeccccccee Confidence 89888899999888888899999999999999999999999996554 23221 11221 111 25677 Q ss_pred EeeccccHHHHHHHHHHHHhhCCccc--eEEechhhhhhhhhcCCCchhhhHHhhhhhcceeeeeec-cccc--ceeeec Q lcl|NC_011269. 200 IAGSHLMPDDLYTAVTYTDQRQLDSS--RLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIVQFGE-FQIG--KSIIIP 274 (333) Q Consensus 200 i~~g~Lt~~~L~~a~t~v~~~~L~at--~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~G~-fgi~--~skvlp 274 (333) ..++..+.+++-.+...+...+.+.. ..+||+..+..+....+- | .++..+.+.|. .|.+ .+--|| T Consensus 502 ~~~~~~~~~~i~~~~~~i~~~~~~~~~~~~~~~~~~~~~l~~~~l~-------d--~~G~~i~~~~~l~G~pv~~s~~ip 572 (632) T protein:vir:96 502 YPAGGVDWASVVDMETKISTFNADAGRLAYLTSVTQRGAAKKAQVF-------D--NTGERIWQNNEVNGYRAEASNQIP 572 (632) T ss_pred cccccCCHHHHHHHHHHHhhcccccCccEEEEchhHHHHHHHHhcc-------C--CCCceeecCCeecccceEeccccc Confidence 88888999999999988888775544 568888766655532111 1 12333443333 3433 556688 Q ss_pred CCeEEEeeChhhhcccccccCceeccccchhh--hccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 275 RGTVYLTPEPEFLGVFPVMYSLDVEEDNKVER--FNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 275 rgeiyvvadpE~~G~~pvR~~L~s~p~D~~er--~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) -|.+++. |... ..+=.++|+.+.-.++... -.....+++-+++++.+|.++|+++|+ T Consensus 573 ~~~~~~g-d~s~-~~i~~~~~~~i~~~~~~~~~~~~v~~~~~~~~d~~v~~~~af~~~k~~ 631 (632) T protein:vir:96 573 ADTWIFG-DWSQ-IVIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKG 631 (632) T ss_pred cCcEEEe-ecce-EEEEEecceEEEEccccccccCceEEEEEeecCceeechhhhhheeec Confidence 8887643 3331 2244678888887664432 355678899999999999999999999 No 90 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=97.97 E-value=2.1e-06 Score=51.77 Aligned_cols=281 Identities=12% Similarity=0.114 Sum_probs=160.6 Q ss_pred HHHHHhhcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcceeecCCCCccceE Q lcl|NC_011269. 26 VEAKQRMGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPIQYDVLDDLGQAY 105 (333) Q Consensus 26 ~~~~~~~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p~y~v~~~v~~a~ 105 (333) ..--.-.+|--++-..-+..+.++++. .+-..++..+-...+.|.+- ....+|...++|.+.+.+.+. T Consensus 1 ~~~~~~~~~~~~~t~~v~~fipei~s~----------~i~~~l~~~~v~~~~~~d~~--~~~~~Gdtv~ip~~g~~~~~d 68 (341) T protein:vir:94 1 MALGNTITGPSINTQRGQQFIPEQWLS----------EVQMFRKAKMLDTSVVKTWG--AQVKKGDTFHVPRISELGVED 68 (341) T ss_pred CcchhhhccccccchhHHHHHHHHHHH----------HHHHHHHhhcchhhcccccc--ccccCCceEEEeccCcceeee Confidence 000111233333333333333333222 22222333333333333331 123568889999988886666 Q ss_pred EEEcCCCcccceeecCceeecccee-eeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhc Q lcl|NC_011269. 106 MLHGNEGEIRITPFEGKRIEVQLFR-IASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVD 184 (333) Q Consensus 106 ~~~~~~G~i~~Q~i~~~ri~~P~f~-Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ 184 (333) ..+.+.+..+++....+++..-+ ..+.-.|+-.|..+...|+.++.-..+.+++-.+-|..++.++-+++ T Consensus 69 --~~~~~~i~~~~~~~~~~~itiD~~~~~~~~i~d~d~~~~~~d~~~~~~~~~~~aLA~~~D~~i~~~~a~~~------- 139 (341) T protein:vir:94 69 --KATDVPVGVQPVNDTDFVITVDTDRTTAVALDDLLEIQASYDLRAPYLEAMGYALAKDMTGSILGLRAAVQ------- 139 (341) T ss_pred --ecCCCccccccccCceEEEEEeeeeecceeechHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHhhhcc------- Confidence 45566677787777777766322 34556688888889999999999999999999999998888773332 Q ss_pred ccccccccCCCc-ceEEeeccccHHHHHHHHHHHHhhCCccc--eEEechhhhhhhhhcCCCchhhhHHhhhhhcceeee Q lcl|NC_011269. 185 SSAQPGVGALPN-EITIAGSHLMPDDLYTAVTYTDQRQLDSS--RLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIVQ 261 (333) Q Consensus 185 ssA~p~vg~~~N-~i~i~~g~Lt~~~L~~a~t~v~~~~L~at--~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il~ 261 (333) ..+-+.++.-+| +++-.+..++-+.+-.|.+..++.+.|.. +++++|+.|.+|.- ++- +.-.|-.... .+. T Consensus 140 ~~~~~~~~~~~~~~~t~~~~~~~~~~i~~a~~~Lde~~VP~~gR~lvv~P~~~~~Ll~---~~~-~~~~~~~g~~--~l~ 213 (341) T protein:vir:94 140 NTASQNVFSSSNGAITGNGQAFSFAVFLAARRLLLEADVPEEKIVLLISPGQESALFT---IPQ-FISKDFINNA--PIA 213 (341) T ss_pred ccccCccccCccccccCchhhhhHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhh---chh-hhhhhccccc--hhh Confidence 222233222211 12222345677888899999999999864 68889999999986 222 1223333222 233 Q ss_pred eec------ccccceeeecCCeEEEe-----------eChhhhccccc---------ccCc----------e-------- Q lcl|NC_011269. 262 FGE------FQIGKSIIIPRGTVYLT-----------PEPEFLGVFPV---------MYSL----------D-------- 297 (333) Q Consensus 262 ~G~------fgi~~skvlprgeiyvv-----------adpE~~G~~pv---------R~~L----------~-------- 297 (333) .|. |.|..|..+|-++.+-. +.|-.-|+.+. .-|| + T Consensus 214 ~G~ig~i~G~~V~~Sn~lp~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~gl~~~~~av~~~k~~~~~~~~ 293 (341) T protein:vir:94 214 QGQIGSLMGVRVIRTSLIGNNSATGWRNGAPTIAPAEATPGFTGSRYLPKQDSFTSLPATFTGNSRPVHTAVMCHMDWAA 293 (341) T ss_pred eeeeeeEeceEEEEeccccccccccccccccceecccccccccccccccccccccccEEEEEEecccccceeeecchhhh Confidence 444 44778888888775532 11111111111 1122 0 Q ss_pred -eccccc---hhh--hccceehhhh--hhhhhhccceEEEEecC Q lcl|NC_011269. 298 -VEEDNK---VER--FNKGWVMDEL--VGMAILNPRGIVILRKA 333 (333) Q Consensus 298 -s~p~D~---~er--~~kGWvm~E~--~g~~i~N~~siv~~~~~ 333 (333) ..+-.. .++ -..||.|--. +|-.+.||.++|.|+.+ T Consensus 294 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~G~~~lrp~~~v~~~~~ 337 (341) T protein:vir:94 294 AVVSKAPRVTQSFENREQVWLMVGRQAYGARLYRPLHAVNIHTT 337 (341) T ss_pred ccccccccccccchhhhhhhhhhhhhhhcccccCcceeEEEecC Confidence 001000 011 1467766544 57779999999999998 No 91 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=97.96 E-value=3.6e-07 Score=55.90 Aligned_cols=303 Identities=10% Similarity=0.025 Sum_probs=167.4 Q ss_pred Ccccchhhhhhhhhhc---ccchHHHHHHHHHHhhcchhcchHHHHHHHHHHhc---CchhHHHHHHHHHHHHHHHHHhh Q lcl|NC_011269. 1 MTLPVAVGSGLGRFAK---ASDDYVADIVEAKQRMGGRKLSAREKQAKLAHILS---DKVGGIQRLGQSMIGPIQLQLRY 74 (333) Q Consensus 1 ~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~~Al~---~~Eg~~~aLg~~mA~pI~~q~~r 74 (333) +.-+-..+.....-.+ +..+|+..... +...-..........+++. +..|+. .+-+-+...|.+.++. T Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~-----~~~~~~~~~~~~~~~~a~~~~~~~~gG~-lIP~~~~~~Ii~~~~~ 144 (387) T protein:vir:96 71 VKDKGEAYQSLSDNEKMVKAKAEFYRHAIL-----PNEFEKPSMEAQRLLHALPTGNDSGGDK-LLPKTLSKEIVSEPFA 144 (387) T ss_pred hhhccccCCCCchhHHHHHHHHHHHHHHHh-----hhhHHHHHHHHHHHHhhhccCCCCCCce-eechhHHHHHHHHHHh Confidence 1111111111110000 00011111100 1010111111112233332 234442 5677788899999999 Q ss_pred hhhhhhhhhccccCCCcceeecCCCCccceEEEEcCCCcccceeecCceeeccceeeeccccccHHHhhhhcchhHHHHH Q lcl|NC_011269. 75 QGILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRITPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQ 154 (333) Q Consensus 75 qGi~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q 154 (333) +...|++....+......|+.... ...+.|++-.+++..+...-+.|++.-.++..++.|.-+=|..+..|+..+.. T Consensus 145 ~~~l~~~~~~~~~~~~~~p~~~~~---~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~ 221 (387) T protein:vir:96 145 KNQLREKARLTNIKGLEIPRVSYT---LDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVE 221 (387) T ss_pred hchhhhhceeeecCCceeeeeecc---CCccccccccccccccccccceeeechheeeeechhhHHHHhhhHHHHHHHHH Confidence 999999998888877777765532 22345788777788877777899999999999999999989999999999999 Q ss_pred HHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCcceEEeeccccHHHHHHHHHHHHhhCCccceEEechhhh Q lcl|NC_011269. 155 DMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPNEITIAGSHLMPDDLYTAVTYTDQRQLDSSRLLANPQEY 234 (333) Q Consensus 155 ~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N~i~i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~~~ 234 (333) +.-.++|.+-|+..++.-- . .+.+|......+.++-+.+.-+-++|..++.-+..--......+||+.-| T Consensus 222 ~~la~~~~~~e~~~~~~~g--~--------g~g~~~g~~~~~~~~~~~~~~~~d~i~~~~~~l~~~y~~na~~imn~~t~ 291 (387) T protein:vir:96 222 NALQSGLAAKERKDALAVS--P--------KSGLEHMSFYNGSVKEVEGADMYDAIINALADLHEDYRDNATIYMRYADY 291 (387) T ss_pred HHHHHHHHHHHHHhHhhcC--C--------CccccceeeeccccccccccchHHHHHHHHhccChhhhcCCEEEEechHH Confidence 9999999998776554222 1 22333222222334444555566777777765544333445678888877 Q ss_pred hhhhhcCCCchhhhHHhhhhhcceeeeeecccccceeeecCCeEEEeeChhhhcccccccCceeccccchhhhccceehh Q lcl|NC_011269. 235 RDLYRWDINTTGWAFKDSVVAGERIVQFGEFQIGKSIIIPRGTVYLTPEPEFLGVFPVMYSLDVEEDNKVERFNKGWVMD 314 (333) Q Consensus 235 ~Di~gw~~N~~~~~~~DpV~~~e~il~~G~fgi~~skvlprgeiyvvadpE~~G~~pvR~~L~s~p~D~~er~~kGWvm~ 314 (333) ..+..=--+..+ |+..+. -.-++|.+....=.-.. .++.|.... |=.+.++.......+..--.|++.. T Consensus 292 ~~~~~~~~~~~~-----~~~~~~---~~~llG~PV~~~~~~~~-~~~GDf~~~--~~~~~~~~~~~~~~~~~~~~~~~~~ 360 (387) T protein:vir:96 292 VKIISVLSNGTT-----NFFDTP---AEKVFGKPVVFTDAAVK-PIVGDFNYF--GINYDGTTYDTDKDVKKGEYLFVLT 360 (387) T ss_pred HHHHHHHhcCCC-----cccccC---CccccccceEEecCCCc-eeeechhhh--hhhhhhhhheecccccCCceEEEEE Confidence 766541111111 111111 01124433211100011 123333210 1122333333333333346789999 Q ss_pred hhhhhhhhccceEEEEecC Q lcl|NC_011269. 315 ELVGMAILNPRGIVILRKA 333 (333) Q Consensus 315 E~~g~~i~N~~siv~~~~~ 333 (333) +-++..+.||-++++|... T Consensus 361 ~r~Dg~v~~~~A~~~l~~k 379 (387) T protein:vir:96 361 AWYDQQRTLDSAFRIAKAK 379 (387) T ss_pred EEeCcEeechhheEEEEee Confidence 9999999999999999875 No 92 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=97.96 E-value=3.6e-07 Score=55.90 Aligned_cols=303 Identities=10% Similarity=0.025 Sum_probs=167.4 Q ss_pred Ccccchhhhhhhhhhc---ccchHHHHHHHHHHhhcchhcchHHHHHHHHHHhc---CchhHHHHHHHHHHHHHHHHHhh Q lcl|NC_011269. 1 MTLPVAVGSGLGRFAK---ASDDYVADIVEAKQRMGGRKLSAREKQAKLAHILS---DKVGGIQRLGQSMIGPIQLQLRY 74 (333) Q Consensus 1 ~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~~Al~---~~Eg~~~aLg~~mA~pI~~q~~r 74 (333) +.-+-..+.....-.+ +..+|+..... +...-..........+++. +..|+. .+-+-+...|.+.++. T Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~-----~~~~~~~~~~~~~~~~a~~~~~~~~gG~-lIP~~~~~~Ii~~~~~ 144 (387) T protein:vir:94 71 VKDKGEAYQSLSDNEKMVKAKAEFYRHAIL-----PNEFEKPSMEAQRLLHALPTGNDSGGDK-LLPKTLSKEIVSEPFA 144 (387) T ss_pred hhhccccCCCCchhHHHHHHHHHHHHHHHh-----hhhHHHHHHHHHHHHhhhccCCCCCCce-eechhHHHHHHHHHHh Confidence 1111111111110000 00011111100 1010111111112233332 234442 5677788899999999 Q ss_pred hhhhhhhhhccccCCCcceeecCCCCccceEEEEcCCCcccceeecCceeeccceeeeccccccHHHhhhhcchhHHHHH Q lcl|NC_011269. 75 QGILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRITPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQ 154 (333) Q Consensus 75 qGi~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q 154 (333) +...|++....+......|+.... ...+.|++-.+++..+...-+.|++.-.++..++.|.-+=|..+..|+..+.. T Consensus 145 ~~~l~~~~~~~~~~~~~~p~~~~~---~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~ 221 (387) T protein:vir:94 145 KNQLREKARLTNIKGLEIPRVSYT---LDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVE 221 (387) T ss_pred hchhhhhceeeecCCceeeeeecc---CCccccccccccccccccccceeeechheeeeechhhHHHHhhhHHHHHHHHH Confidence 999999998888877777765532 22345788777788877777899999999999999999989999999999999 Q ss_pred HHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCcceEEeeccccHHHHHHHHHHHHhhCCccceEEechhhh Q lcl|NC_011269. 155 DMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPNEITIAGSHLMPDDLYTAVTYTDQRQLDSSRLLANPQEY 234 (333) Q Consensus 155 ~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N~i~i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~~~ 234 (333) +.-.++|.+-|+..++.-- . .+.+|......+.++-+.+.-+-++|..++.-+..--......+||+.-| T Consensus 222 ~~la~~~~~~e~~~~~~~g--~--------g~g~~~g~~~~~~~~~~~~~~~~d~i~~~~~~l~~~y~~na~~imn~~t~ 291 (387) T protein:vir:94 222 NALQSGLAAKERKDALAVS--P--------KSGLEHMSFYNGSVKEVEGADMYDAIINALADLHEDYRDNATIYMRYADY 291 (387) T ss_pred HHHHHHHHHHHHHhHhhcC--C--------CccccceeeeccccccccccchHHHHHHHHhccChhhhcCCEEEEechHH Confidence 9999999998776554222 1 22333222222334444555566777777765544333445678888877 Q ss_pred hhhhhcCCCchhhhHHhhhhhcceeeeeecccccceeeecCCeEEEeeChhhhcccccccCceeccccchhhhccceehh Q lcl|NC_011269. 235 RDLYRWDINTTGWAFKDSVVAGERIVQFGEFQIGKSIIIPRGTVYLTPEPEFLGVFPVMYSLDVEEDNKVERFNKGWVMD 314 (333) Q Consensus 235 ~Di~gw~~N~~~~~~~DpV~~~e~il~~G~fgi~~skvlprgeiyvvadpE~~G~~pvR~~L~s~p~D~~er~~kGWvm~ 314 (333) ..+..=--+..+ |+..+. -.-++|.+....=.-.. .++.|.... |=.+.++.......+..--.|++.. T Consensus 292 ~~~~~~~~~~~~-----~~~~~~---~~~llG~PV~~~~~~~~-~~~GDf~~~--~~~~~~~~~~~~~~~~~~~~~~~~~ 360 (387) T protein:vir:94 292 VKIISVLSNGTT-----NFFDTP---AEKVFGKPVVFTDAAVK-PIVGDFNYF--GINYDGTTYDTDKDVKKGEYLFVLT 360 (387) T ss_pred HHHHHHHhcCCC-----cccccC---CccccccceEEecCCCc-eeeechhhh--hhhhhhhhheecccccCCceEEEEE Confidence 766541111111 111111 01124433211100011 123333210 1122333333333333346789999 Q ss_pred hhhhhhhhccceEEEEecC Q lcl|NC_011269. 315 ELVGMAILNPRGIVILRKA 333 (333) Q Consensus 315 E~~g~~i~N~~siv~~~~~ 333 (333) +-++..+.||-++++|... T Consensus 361 ~r~Dg~v~~~~A~~~l~~k 379 (387) T protein:vir:94 361 AWYDQQRTLDSAFRIAKAK 379 (387) T ss_pred EEeCcEeechhheEEEEee Confidence 9999999999999999875 No 93 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=97.96 E-value=3.6e-07 Score=55.90 Aligned_cols=303 Identities=10% Similarity=0.025 Sum_probs=167.4 Q ss_pred Ccccchhhhhhhhhhc---ccchHHHHHHHHHHhhcchhcchHHHHHHHHHHhc---CchhHHHHHHHHHHHHHHHHHhh Q lcl|NC_011269. 1 MTLPVAVGSGLGRFAK---ASDDYVADIVEAKQRMGGRKLSAREKQAKLAHILS---DKVGGIQRLGQSMIGPIQLQLRY 74 (333) Q Consensus 1 ~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~~Al~---~~Eg~~~aLg~~mA~pI~~q~~r 74 (333) +.-+-..+.....-.+ +..+|+..... +...-..........+++. +..|+. .+-+-+...|.+.++. T Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~-----~~~~~~~~~~~~~~~~a~~~~~~~~gG~-lIP~~~~~~Ii~~~~~ 144 (387) T protein:vir:26 71 VKDKGEAYQSLSDNEKMVKAKAEFYRHAIL-----PNEFEKPSMEAQRLLHALPTGNDSGGDK-LLPKTLSKEIVSEPFA 144 (387) T ss_pred hhhccccCCCCchhHHHHHHHHHHHHHHHh-----hhhHHHHHHHHHHHHhhhccCCCCCCce-eechhHHHHHHHHHHh Confidence 1111111111110000 00011111100 1010111111112233332 234442 5677788899999999 Q ss_pred hhhhhhhhhccccCCCcceeecCCCCccceEEEEcCCCcccceeecCceeeccceeeeccccccHHHhhhhcchhHHHHH Q lcl|NC_011269. 75 QGILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRITPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQ 154 (333) Q Consensus 75 qGi~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q 154 (333) +...|++....+......|+.... ...+.|++-.+++..+...-+.|++.-.++..++.|.-+=|..+..|+..+.. T Consensus 145 ~~~l~~~~~~~~~~~~~~p~~~~~---~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~~i~iS~ell~ds~~~l~~~i~ 221 (387) T protein:vir:26 145 KNQLREKARLTNIKGLEIPRVSYT---LDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVE 221 (387) T ss_pred hchhhhhceeeecCCceeeeeecc---CCccccccccccccccccccceeeechheeeeechhhHHHHhhhHHHHHHHHH Confidence 999999998888877777765532 22345788777788877777899999999999999999989999999999999 Q ss_pred HHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCcceEEeeccccHHHHHHHHHHHHhhCCccceEEechhhh Q lcl|NC_011269. 155 DMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPNEITIAGSHLMPDDLYTAVTYTDQRQLDSSRLLANPQEY 234 (333) Q Consensus 155 ~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N~i~i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~~~ 234 (333) +.-.++|.+-|+..++.-- . .+.+|......+.++-+.+.-+-++|..++.-+..--......+||+.-| T Consensus 222 ~~la~~~~~~e~~~~~~~g--~--------g~g~~~g~~~~~~~~~~~~~~~~d~i~~~~~~l~~~y~~na~~imn~~t~ 291 (387) T protein:vir:26 222 NALQSGLAAKERKDALAVS--P--------KSGLEHMSFYNGSVKEVEGADMYDAIINALADLHEDYRDNATIYMRYADY 291 (387) T ss_pred HHHHHHHHHHHHHhHhhcC--C--------CccccceeeeccccccccccchHHHHHHHHhccChhhhcCCEEEEechHH Confidence 9999999998776554222 1 22333222222334444555566777777765544333445678888877 Q ss_pred hhhhhcCCCchhhhHHhhhhhcceeeeeecccccceeeecCCeEEEeeChhhhcccccccCceeccccchhhhccceehh Q lcl|NC_011269. 235 RDLYRWDINTTGWAFKDSVVAGERIVQFGEFQIGKSIIIPRGTVYLTPEPEFLGVFPVMYSLDVEEDNKVERFNKGWVMD 314 (333) Q Consensus 235 ~Di~gw~~N~~~~~~~DpV~~~e~il~~G~fgi~~skvlprgeiyvvadpE~~G~~pvR~~L~s~p~D~~er~~kGWvm~ 314 (333) ..+..=--+..+ |+..+. -.-++|.+....=.-.. .++.|.... |=.+.++.......+..--.|++.. T Consensus 292 ~~~~~~~~~~~~-----~~~~~~---~~~llG~PV~~~~~~~~-~~~GDf~~~--~~~~~~~~~~~~~~~~~~~~~~~~~ 360 (387) T protein:vir:26 292 VKIISVLSNGTT-----NFFDTP---AEKVFGKPVVFTDAAVK-PIVGDFNYF--GINYDGTTYDTDKDVKKGEYLFVLT 360 (387) T ss_pred HHHHHHHhcCCC-----cccccC---CccccccceEEecCCCc-eeeechhhh--hhhhhhhhheecccccCCceEEEEE Confidence 766541111111 111111 01124433211100011 123333210 1122333333333333346789999 Q ss_pred hhhhhhhhccceEEEEecC Q lcl|NC_011269. 315 ELVGMAILNPRGIVILRKA 333 (333) Q Consensus 315 E~~g~~i~N~~siv~~~~~ 333 (333) +-++..+.||-++++|... T Consensus 361 ~r~Dg~v~~~~A~~~l~~k 379 (387) T protein:vir:26 361 AWYDQQRTLDSAFRIAKAK 379 (387) T ss_pred EEeCcEeechhheEEEEee Confidence 9999999999999999875 No 94 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=97.95 E-value=1e-06 Score=53.38 Aligned_cols=310 Identities=12% Similarity=0.100 Sum_probs=175.6 Q ss_pred Ccccchhhhhhhhhh-----cccchHHHHHHHHHHhhcchhcchHHH---HHHHHHHhcCchhHHHHHHHHHHHHHHHHH Q lcl|NC_011269. 1 MTLPVAVGSGLGRFA-----KASDDYVADIVEAKQRMGGRKLSAREK---QAKLAHILSDKVGGIQRLGQSMIGPIQLQL 72 (333) Q Consensus 1 ~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~ls~ee~---~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~ 72 (333) -.-+....+-.-++. ...+.-.....+...+ +...+..+.+ .+. ...-.+..|.. .+-+-+.+.|.+.+ T Consensus 86 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~-~~~~~~~~gg~-~vP~~~~~~Ii~~l 162 (425) T protein:vir:95 86 QINSKQPSNQSRQKMQGSKGDVVEMNRLQVREMLKT-GEYYKRSEVVEFYEKF-RNLRAVAGGEL-TIPEVVVNRIMDIM 162 (425) T ss_pred HhhhhccchhhhhhhhhhhhhHHHHHHHHHHHHHhh-hhhhhhhHHHHHHHHH-HhhcccccCce-eccHHHHHHHHHHH Confidence 000000000000000 0000000000001000 1111111111 111 11112233432 44567888899999 Q ss_pred hhhhhhhhhhhccccCCCcceeecCCCCccceEEEEcCCCcccceee-cCceeeccceeeeccccccHHHhhhhcchhHH Q lcl|NC_011269. 73 RYQGILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRITPF-EGKRIEVQLFRIASFPQIKKEDLYYLRSNIVE 151 (333) Q Consensus 73 ~rqGi~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~i-~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle 151 (333) +.....+++.+..++. |.. .+|+...... +.|+.-.++++.+.. .=+.|++.-.++..+..|..+=|+....|+.. T Consensus 163 ~~~~~i~~~~~~~~~~-g~~-~ip~~~~~~~-a~~v~E~~~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds~~~l~~ 239 (425) T protein:vir:95 163 GDYTTLYPLVDKIRVK-GTT-RILVDTDTSP-ATWIEQSGALPTGDVGTIASIDFDGFKVGKVTFVDNYLLQDSIINLDD 239 (425) T ss_pred HhhhhHHHhhceeecC-cee-EEEEecCCcc-ccccccccccccccccccceeeeeheeeeeeehhhHHHHhccHHHHHH Confidence 9999999998888764 544 5676555544 447787788877654 34789999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCC-----CcceEEeeccccHHHHHHHHHHHHhhCC--cc Q lcl|NC_011269. 152 YTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGAL-----PNEITIAGSHLMPDDLYTAVTYTDQRQL--DS 224 (333) Q Consensus 152 ~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~-----~N~i~i~~g~Lt~~~L~~a~t~v~~~~L--~a 224 (333) +..++-.++|.+-+|.-+++ +--+. ..+| .|.+ .+.++..++..+-+++..++..+..-.. .. T Consensus 240 ~i~~~l~~~i~~~~d~~il~---G~G~~------~~~p-~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 309 (425) T protein:vir:95 240 YVTKKIARAIAKALDLAIVK---GTGAA------NKQP-LGIIPSLPPENQVTVEADNNLLKNLVKQIGLIDTGDDSVGE 309 (425) T ss_pred HHHHHHHHHHHHHHHHHhhc---cCCCC------cccc-ceeecccccccccccccccchHHHHHHHHHhhhhhccccCc Confidence 99999999999999964443 21100 0011 1112 1344556778888899988876655333 33 Q ss_pred ceEEechhh-hhhhhhcCC--CchhhhHHhhhhhcceeeeee-cccc--cceeeecCCeEEEeeChhhhcccccccCcee Q lcl|NC_011269. 225 SRLLANPQE-YRDLYRWDI--NTTGWAFKDSVVAGERIVQFG-EFQI--GKSIIIPRGTVYLTPEPEFLGVFPVMYSLDV 298 (333) Q Consensus 225 t~il~~~~~-~~Di~gw~~--N~~~~~~~DpV~~~e~il~~G-~fgi--~~skvlprgeiyvvadpE~~G~~pvR~~L~s 298 (333) ...+||+.- |+.+..=.. +..+ .|+.+.. ....+ ++|. ..+-.+|-++|++ .|... -.+-+|+++.+ T Consensus 310 ~~~v~~~~~~~~~l~~l~~~kd~~g----~~i~~~~-~~~~~~l~G~pvv~~~~~~~~~i~~-Gd~~~-~~~~~~~~~~i 382 (425) T protein:vir:95 310 IVAVMKRSTYYNRLVEFSIQVDSNG----NVVGKLP-NLRTPDLLGLRVVFNNFLDDDTVLF-GEFEQ-YTLVERENITI 382 (425) T ss_pred eEEEEeChHHHHHHHHHHhhcCCCC----ceeeccC-CCCCccccceeeEEcCcCCCccEEE-Eeccc-EEEEeecceEE Confidence 345778764 444432000 1111 1222211 01111 3443 3666789998765 67664 45567888877 Q ss_pred ccccc--hhhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 299 EEDNK--VERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 299 ~p~D~--~er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) .-.+. ..+-..++..++-++..+.+|.+++++.=. T Consensus 383 ~~~~~~~f~~~~~~~~~~~r~d~~~~~~~a~~~~~i~ 419 (425) T protein:vir:95 383 DSSTHVKFTEDQTAFRGKGRFDGKPVKPEAFVLVTIT 419 (425) T ss_pred EeecccccccCceEEEEEEeeCcEeecccceEEEEec Confidence 75442 223366789999999999999999999655 No 95 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=97.92 E-value=1.9e-06 Score=51.94 Aligned_cols=306 Identities=13% Similarity=0.109 Sum_probs=168.2 Q ss_pred Cc---------ccchhhhhhhhhhcccchHHHHHHHHHHhhcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHH Q lcl|NC_011269. 1 MT---------LPVAVGSGLGRFAKASDDYVADIVEAKQRMGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQ 71 (333) Q Consensus 1 ~~---------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q 71 (333) .. -...-|.+..|++++...+-.....+. ++..+....+.....+. -.+..|+. .+.+-+.+.|.+. T Consensus 74 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~--~~~~~gg~-liP~~~~~~ii~~ 149 (428) T protein:vir:10 74 QHGPAVIVKAEPKQYTGAGMTRMVMSIAAAQGNLQDAA-KFASDELNDQSVSMAIS--TAAGSGGV-LIPQNIHSEVIEL 149 (428) T ss_pred hhccccccccccchhhhHHHHHHHHHHHHhhhhHHHHH-HHhhhhhhhhhHhhhhc--ccccCCcc-ccchhHHHHHHHH Confidence 00 011122333333332211111111111 11111111111111111 11223442 4567778888888 Q ss_pred Hhhhhhhhhhh-hccccCCCcceeecCCCCccceEEEEcCCCcccceeecCceeeccceeeeccccccHHHhhhhcchhH Q lcl|NC_011269. 72 LRYQGILRNVL-LEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRITPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIV 150 (333) Q Consensus 72 ~~rqGi~RklL-~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vl 150 (333) ++.+.++|++. ..-|.+.|. ..||+..... .+-|.+--+.+++....=+.|++.-.++.....|..+-|++...|+. T Consensus 150 l~~~~~l~~~~~~~~~~~~g~-~~~p~~~~~~-~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~v~is~ell~ds~~~l~ 227 (428) T protein:vir:10 150 LRDRTIVRKLGARSIPLPNGN-MSLPRLAGGA-TASYTGENQDAKVSEARFDDVKLTAKTMIAMVPISNALIGRAGFNVE 227 (428) T ss_pred HhhhchhhhhcceeeecCCcc-eEEEEEeCCc-ceeeeccCccccccccceeeEEeeeEEEEEeehhhHHHHhhhhHHHH Confidence 88899999983 333444453 3455543332 34467777778877776678888899999999999999999999999 Q ss_pred HHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCcceEEe--------eccccHHHHHHHHHHHHh--- Q lcl|NC_011269. 151 EYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPNEITIA--------GSHLMPDDLYTAVTYTDQ--- 219 (333) Q Consensus 151 e~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N~i~i~--------~g~Lt~~~L~~a~t~v~~--- 219 (333) .+.+++..++|-+.+|.-++ .+-- ++-.|. |.+ |..+.. ....+-+.+......+.. T Consensus 228 ~~i~~~l~~ai~~~~d~~~l---~G~G-------~~~~p~-Gi~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 295 (428) T protein:vir:10 228 QLVLQDILTAISVREDKAFM---RDDG-------TGDTPI-GMK-ARATQWNRLLPWAADAAVNLDTIDTYLDSIILMSM 295 (428) T ss_pred HHHHHHHHHHHHHHHHHHHh---ccCC-------CCcccc-ccc-cccccccccccccccccccHHHHHHHHHHHHHhhh Confidence 99999999999999996553 3321 111232 333 222211 122333333333333221 Q ss_pred ---hCCccceEEechhhhhhhhhcCCCchh-hhHHhhhhhcceeeeeeccccc--ceeeecCCe-------EEEeeChhh Q lcl|NC_011269. 220 ---RQLDSSRLLANPQEYRDLYRWDINTTG-WAFKDSVVAGERIVQFGEFQIG--KSIIIPRGT-------VYLTPEPEF 286 (333) Q Consensus 220 ---~~L~at~il~~~~~~~Di~gw~~N~~~-~~~~DpV~~~e~il~~G~fgi~--~skvlprge-------iyvvadpE~ 286 (333) -.+.....+||+..|.-|..=- +..| |.+. |..+ .-++|.+ .+-.+|-+. .+++.|.- T Consensus 296 ~~~~~~~~~~~v~n~~~~~~L~~lk-d~~G~~i~~-~~~~------g~l~G~pv~~~~~~p~~~~~~~~~~~i~~gd~s- 366 (428) T protein:vir:10 296 DGNSNMISSGWGMSNRTYMKLFGLR-DGNGNKVYP-EMAQ------GMLKGYPIQRTSAIPANLGEGGKESEIYFADFN- 366 (428) T ss_pred ccccccccCEEEEcHHHHHHHHHhh-ccCCceecc-CCCC------CeeeceeeEEeccccccccCCCccceEEEEecc- Confidence 2233456689999988776511 1111 1111 1111 1245644 333455432 33445654 Q ss_pred hcccccccCceeccccchh-------------hhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 287 LGVFPVMYSLDVEEDNKVE-------------RFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 287 ~G~~pvR~~L~s~p~D~~e-------------r~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) .-.+-.|+++.++..++.. +-...|-..+-+++++.+|-++|++... T Consensus 367 ~~~i~~~~~i~i~~~~~~~~~~~~~~~~~~f~~~~~~~R~~~r~d~~v~~p~a~~~~t~~ 426 (428) T protein:vir:10 367 DVVIGEDGNMKVDFSKEASYIDTDGKLVSAFSRNQSLIRVVTEHDIGFRHPEGLVLGTGV 426 (428) T ss_pred eEEEEEecceEEEeecccccccccccccchhhcchhheeeeeeeCceeeccceEEEEecc Confidence 3445678899888776532 2246677888899999999999999999 No 96 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=97.88 E-value=1.7e-06 Score=52.20 Aligned_cols=322 Identities=16% Similarity=0.168 Sum_probs=161.8 Q ss_pred Ccccc----hhhhhhhhhhcccchHHH-HHHHHHHhhcchh-------------------cchHHHH--HHHHH-HhcCc Q lcl|NC_011269. 1 MTLPV----AVGSGLGRFAKASDDYVA-DIVEAKQRMGGRK-------------------LSAREKQ--AKLAH-ILSDK 53 (333) Q Consensus 1 ~~~~~----~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~-------------------ls~ee~~--~Lm~~-Al~~~ 53 (333) .-+.. ..-...++.....++... .-.+...+++..+ .+...+. +.-+. .-.+. T Consensus 78 ~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (497) T protein:vir:10 78 PEVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTG 157 (497) T ss_pred HHHHhhhhhhHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhcccCc Confidence 00000 000000111100000000 0000011110000 0000000 00010 11233 Q ss_pred hhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcceeecCCCCccceEEEEcCCCcccceeecCceeeccceeeec Q lcl|NC_011269. 54 VGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRITPFEGKRIEVQLFRIAS 133 (333) Q Consensus 54 Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~i~~~ri~~P~f~Ivs 133 (333) .|+ -.+-..+...|-+.++.+...|+|....+...|.. .||+.......+-|.+-.+.++.....=+.|++.-..+.+ T Consensus 158 ~gg-~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~-~~~~~~~~~~~a~wv~E~~~~~~s~~~f~~i~~~~~k~a~ 235 (497) T protein:vir:10 158 TFA-PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNL-SYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVAN 235 (497) T ss_pred ccc-cccchhhhHHHHHHHHhhhhHHhhccccccCCCce-EEEEEcCCCCcceeeccCcccccccccceeeEeeeeeeEe Confidence 444 25667778888889999999999998888887764 5665433333455788777777766666899999999999 Q ss_pred cccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHH---------HHhhhhhhh------------------hh-hcc Q lcl|NC_011269. 134 FPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVT---------LLEAAAVSY------------------RV-VDS 185 (333) Q Consensus 134 ~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~s---------lle~~a~~~------------------r~-~~s 185 (333) .+.|..+=|+.. .++-.+..++..++|..-+|.-+++ ++..+...- .+ .+. T Consensus 236 ~~~iS~ell~d~-~~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (497) T protein:vir:10 236 ALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADG 314 (497) T ss_pred ecHhHHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchhhhhhhhhhhhhhccc Confidence 999988877654 5788999999999999999966543 111100000 00 000 Q ss_pred ccccccc----------CCCcc-----eEEeeccccH----HHHHHHHHHH-HhhCCccceEEechhhhhhhhhcCCCch Q lcl|NC_011269. 186 SAQPGVG----------ALPNE-----ITIAGSHLMP----DDLYTAVTYT-DQRQLDSSRLLANPQEYRDLYRWDINTT 245 (333) Q Consensus 186 sA~p~vg----------~~~N~-----i~i~~g~Lt~----~~L~~a~t~v-~~~~L~at~il~~~~~~~Di~gw~~N~~ 245 (333) .....++ +..|- -.+.++..+. .++..+...+ .......+.++||+.-|.-|+-.- +.. T Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lk-d~~ 393 (497) T protein:vir:10 315 TNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTK-DAN 393 (497) T ss_pred ccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHHHHHHHHhh-cCC Confidence 0000000 00000 0000000010 0111111111 222334456888888887765421 111 Q ss_pred h-hh--------HHhhhhhcceeeeeeccc--ccceeeecCCeEEEeeChhhhcc--cccccCceeccccc----hhhhc Q lcl|NC_011269. 246 G-WA--------FKDSVVAGERIVQFGEFQ--IGKSIIIPRGTVYLTPEPEFLGV--FPVMYSLDVEEDNK----VERFN 308 (333) Q Consensus 246 ~-~~--------~~DpV~~~e~il~~G~fg--i~~skvlprgeiyvvadpE~~G~--~pvR~~L~s~p~D~----~er~~ 308 (333) | |. .-+|+..... ++| +..+-.||.|++++ -|-- .+. +-+|+++.+.-.++ .++.. T Consensus 394 G~~i~~~~~~~~~~~~~~~~~~-----l~G~pV~~t~~~~~~~~~~-Gd~~-~~~~~i~~r~~~~v~~~~~~~~~f~~n~ 466 (497) T protein:vir:10 394 GQYMGGNFFGNAYGNPVNGGKN-----IWGVPVVTTPLIPLGTILV-GHFA-PSVIQTARREGVTMQMTNSNGTDFVDGK 466 (497) T ss_pred CceeccCcccccccccccCCce-----eeceeeEecCCCCCCceEE-eecc-cceEEEEEecccEEEeecccchhhhcCc Confidence 1 11 1112211111 234 33667789999765 4432 233 34689988776543 34446 Q ss_pred cceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 309 KGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 309 kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) .+....+.++..|.+|-++|.+.-. T Consensus 467 v~~r~~~r~~~~v~~p~A~~~l~~~ 491 (497) T protein:vir:10 467 VTVRAEERLGLLVYRPSAFQLIQLK 491 (497) T ss_pred EEEEEEEeecceeeccccEEEEEec Confidence 7888889999999999999999765 No 97 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=97.88 E-value=1.7e-06 Score=52.20 Aligned_cols=322 Identities=16% Similarity=0.168 Sum_probs=161.8 Q ss_pred Ccccc----hhhhhhhhhhcccchHHH-HHHHHHHhhcchh-------------------cchHHHH--HHHHH-HhcCc Q lcl|NC_011269. 1 MTLPV----AVGSGLGRFAKASDDYVA-DIVEAKQRMGGRK-------------------LSAREKQ--AKLAH-ILSDK 53 (333) Q Consensus 1 ~~~~~----~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~-------------------ls~ee~~--~Lm~~-Al~~~ 53 (333) .-+.. ..-...++.....++... .-.+...+++..+ .+...+. +.-+. .-.+. T Consensus 78 ~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 157 (497) T protein:vir:78 78 PEVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQNPFGSTG 157 (497) T ss_pred HHHHhhhhhhHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhhhcccCc Confidence 00000 000000111100000000 0000011110000 0000000 00010 11233 Q ss_pred hhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcceeecCCCCccceEEEEcCCCcccceeecCceeeccceeeec Q lcl|NC_011269. 54 VGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRITPFEGKRIEVQLFRIAS 133 (333) Q Consensus 54 Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~i~~~ri~~P~f~Ivs 133 (333) .|+ -.+-..+...|-+.++.+...|+|....+...|.. .||+.......+-|.+-.+.++.....=+.|++.-..+.+ T Consensus 158 ~gg-~~vp~~~~~~ii~~~~~~~~i~~l~~~~~~~~~~~-~~~~~~~~~~~a~wv~E~~~~~~s~~~f~~i~~~~~k~a~ 235 (497) T protein:vir:78 158 TFA-PGILPTFLPGIVEQLFYELSLADLISSRPVTSPNL-SYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVAN 235 (497) T ss_pred ccc-cccchhhhHHHHHHHHhhhhHHhhccccccCCCce-EEEEEcCCCCcceeeccCcccccccccceeeEeeeeeeEe Confidence 444 25667778888889999999999998888887764 5665433333455788777777766666899999999999 Q ss_pred cccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHH---------HHhhhhhhh------------------hh-hcc Q lcl|NC_011269. 134 FPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVT---------LLEAAAVSY------------------RV-VDS 185 (333) Q Consensus 134 ~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~s---------lle~~a~~~------------------r~-~~s 185 (333) .+.|..+=|+.. .++-.+..++..++|..-+|.-+++ ++..+...- .+ .+. T Consensus 236 ~~~iS~ell~d~-~~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 314 (497) T protein:vir:78 236 ALTITDEGLRDA-PELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADG 314 (497) T ss_pred ecHhHHHHHHhH-HHHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchhhhhhhhhhhhhhccc Confidence 999988877654 5788999999999999999966543 111100000 00 000 Q ss_pred ccccccc----------CCCcc-----eEEeeccccH----HHHHHHHHHH-HhhCCccceEEechhhhhhhhhcCCCch Q lcl|NC_011269. 186 SAQPGVG----------ALPNE-----ITIAGSHLMP----DDLYTAVTYT-DQRQLDSSRLLANPQEYRDLYRWDINTT 245 (333) Q Consensus 186 sA~p~vg----------~~~N~-----i~i~~g~Lt~----~~L~~a~t~v-~~~~L~at~il~~~~~~~Di~gw~~N~~ 245 (333) .....++ +..|- -.+.++..+. .++..+...+ .......+.++||+.-|.-|+-.- +.. T Consensus 315 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lk-d~~ 393 (497) T protein:vir:78 315 TNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMNPRDWELLRLTK-DAN 393 (497) T ss_pred ccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEchHHHHHHHHhh-cCC Confidence 0000000 00000 0000000010 0111111111 222334456888888887765421 111 Q ss_pred h-hh--------HHhhhhhcceeeeeeccc--ccceeeecCCeEEEeeChhhhcc--cccccCceeccccc----hhhhc Q lcl|NC_011269. 246 G-WA--------FKDSVVAGERIVQFGEFQ--IGKSIIIPRGTVYLTPEPEFLGV--FPVMYSLDVEEDNK----VERFN 308 (333) Q Consensus 246 ~-~~--------~~DpV~~~e~il~~G~fg--i~~skvlprgeiyvvadpE~~G~--~pvR~~L~s~p~D~----~er~~ 308 (333) | |. .-+|+..... ++| +..+-.||.|++++ -|-- .+. +-+|+++.+.-.++ .++.. T Consensus 394 G~~i~~~~~~~~~~~~~~~~~~-----l~G~pV~~t~~~~~~~~~~-Gd~~-~~~~~i~~r~~~~v~~~~~~~~~f~~n~ 466 (497) T protein:vir:78 394 GQYMGGNFFGNAYGNPVNGGKN-----IWGVPVVTTPLIPLGTILV-GHFA-PSVIQTARREGVTMQMTNSNGTDFVDGK 466 (497) T ss_pred CceeccCcccccccccccCCce-----eeceeeEecCCCCCCceEE-eecc-cceEEEEEecccEEEeecccchhhhcCc Confidence 1 11 1112211111 234 33667789999765 4432 233 34689988776543 34446 Q ss_pred cceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 309 KGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 309 kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) .+....+.++..|.+|-++|.+.-. T Consensus 467 v~~r~~~r~~~~v~~p~A~~~l~~~ 491 (497) T protein:vir:78 467 VTVRAEERLGLLVYRPSAFQLIQLK 491 (497) T ss_pred EEEEEEEeecceeeccccEEEEEec Confidence 7888889999999999999999765 No 98 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=97.86 E-value=1.1e-06 Score=53.35 Aligned_cols=303 Identities=11% Similarity=0.052 Sum_probs=163.2 Q ss_pred Ccccch-hhhhhhhhhcccchHHHHHHH-HHH-hhcchhcchHHHHHHHHHHhc---CchhHHHHHHHHHHHHHHHHHhh Q lcl|NC_011269. 1 MTLPVA-VGSGLGRFAKASDDYVADIVE-AKQ-RMGGRKLSAREKQAKLAHILS---DKVGGIQRLGQSMIGPIQLQLRY 74 (333) Q Consensus 1 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~-~~~-~~~~~~ls~ee~~~Lm~~Al~---~~Eg~~~aLg~~mA~pI~~q~~r 74 (333) .-.++. .+.....-.+.... +....+ .+. ..+...-.......-...+++ +..|+. .+-+-+...|-+.++. T Consensus 82 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~t~~~GG~-lIP~~~~~~Ii~~~~~ 159 (402) T protein:vir:93 82 EKAKVKDKGEAYQSLSDNEKM-VKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDK-LLPKTLSKEIVSEPFA 159 (402) T ss_pred HHhhhhhccccCCCCchhHHH-HHHHHHHHHHHHhhhhHHHHHHhHHHHHhhhccCCCcCCcc-ccchhHHHHHHHhHHh Confidence 100000 00000110111111 000000 000 011111111111122233443 344553 6788889999999999 Q ss_pred hhhhhhhhhccccCCCcceeecCCCCccceEEEEcCCCcccceeecCceeeccceeeeccccccHHHhhhhcchhHHHHH Q lcl|NC_011269. 75 QGILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRITPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQ 154 (333) Q Consensus 75 qGi~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q 154 (333) +...|++.+..+......|+.... ...+-|++-.+++..+...-+.|++.-.++..++.|..+=|..+.+|+..+.. T Consensus 160 ~~~l~~~~~v~~~~~~~~p~~~~~---~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~i~iS~ell~Ds~~~l~~~i~ 236 (402) T protein:vir:93 160 KNQLREKARLTNIKGLEIPRVSYT---LDDDDFITDVETAKELKAKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVE 236 (402) T ss_pred hhhhhhhceeeecCCceeeeeecc---CCccccccccccccccccccceeeecceeeeeechhhHHHHhhhHHHHHHHHH Confidence 999999998888777677765432 22345777777777776777889999999999999999999999999999999 Q ss_pred HHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCcceEEeeccccHHHHHHHHHHHHhhCCccceEEechhhh Q lcl|NC_011269. 155 DMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPNEITIAGSHLMPDDLYTAVTYTDQRQLDSSRLLANPQEY 234 (333) Q Consensus 155 ~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N~i~i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~~~ 234 (333) +.-.++|.+-|+..+|.--. .+.+|.-....+.++-+.+.-+-++|-.++.-+..--......+||+.-| T Consensus 237 ~~la~~~~~~e~~~~~~~g~----------g~g~p~g~~~~~~~~~~~~~~~~d~l~~~~~~l~~~y~~na~~imn~~t~ 306 (402) T protein:vir:93 237 NALQSGLAAKERKDALAVSP----------KSGLEHMSFYNGSVKEVEGADMYDAIINALADLHEDYRDNATIYMRYADY 306 (402) T ss_pred HHHHHHHHHHHHHhHhhcCC----------CccccceeeeccccccccccchHHHHHHHHhccChhhhcCCEEEEechHH Confidence 99999999987765543221 12233221222334444444455677776654443323344578888877 Q ss_pred hhhhhcCCCchhhhHHhhhhhcceeeeeecccccceeeecCCeEEEeeChhhhccccc----ccCceeccccchhhhccc Q lcl|NC_011269. 235 RDLYRWDINTTGWAFKDSVVAGERIVQFGEFQIGKSIIIPRGTVYLTPEPEFLGVFPV----MYSLDVEEDNKVERFNKG 310 (333) Q Consensus 235 ~Di~gw~~N~~~~~~~DpV~~~e~il~~G~fgi~~skvlprgeiyvvadpE~~G~~pv----R~~L~s~p~D~~er~~kG 310 (333) ..++.=--+..+ |+.++. ..-++|.+....=.-..+ + .|-|+- +.++.......+..-..+ T Consensus 307 ~~~~~~~~d~~~-----~~~~~~---~~~llG~PV~~t~~~~~i-~------~GDf~~~~~~~~~~~~~~~~~~~~~~~~ 371 (402) T protein:vir:93 307 VKIISVLSNGTT-----NFFDTP---AEKVFGKPVVFTDAAVKP-I------VGDFNYFGINYDGTTYDTDKDVKKGEYL 371 (402) T ss_pred HHHHHHHhcCCC-----cccccC---CccccccceEEecCCCce-e------eechhhhhhhhhhhhhhhhhcccCCceE Confidence 666541001111 111110 011344432111001111 2 233332 222222222222223678 Q ss_pred eehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 311 WVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 311 Wvm~E~~g~~i~N~~siv~~~~~ 333 (333) ++..+-++..+.||-+|++|... T Consensus 372 ~~~~~r~Dg~v~~~~A~~~l~ik 394 (402) T protein:vir:93 372 FVLTAWYDQQRTLDSAFRIAKAK 394 (402) T ss_pred EEEEEEeCcEEechhheEEEEee Confidence 99999999999999999999875 No 99 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=97.85 E-value=2.3e-06 Score=51.48 Aligned_cols=309 Identities=17% Similarity=0.155 Sum_probs=166.1 Q ss_pred Ccccchhhhhhhhhhcccch---------H---HHHHHHHHHhhcch----------hcchHHHHHHHHHHhcCchhHHH Q lcl|NC_011269. 1 MTLPVAVGSGLGRFAKASDD---------Y---VADIVEAKQRMGGR----------KLSAREKQAKLAHILSDKVGGIQ 58 (333) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~---------~---~~~~~~~~~~~~~~----------~ls~ee~~~Lm~~Al~~~Eg~~~ 58 (333) +..++.-..+.+........ . .+..+.+..+-.|. .-..+++...+. ...+..|+. T Consensus 65 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~gg~- 142 (435) T protein:vir:80 65 AAVPVDPNPAAVTASAAAPVYAQPKAPEVKGAKMARMVRALAAARGDAQLASKLAIERGFGEEVAMSLN-TLSPGAGGV- 142 (435) T ss_pred hcccccchhhhhccccccccccccchhhhhHHHHHHHHHHHHhccchhHHHHHHHHhhhhhhhhhhhhc-ccCCCCCcc- Confidence 11121111111111100000 0 00111111111110 001111111111 122333432 Q ss_pred HHHHHHHHHHHHHHhhhhhhhhhh-hccccCCCcceeecCCCCccceEEEEcCCCcccceeecCceeeccceeeeccccc Q lcl|NC_011269. 59 RLGQSMIGPIQLQLRYQGILRNVL-LEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRITPFEGKRIEVQLFRIASFPQI 137 (333) Q Consensus 59 aLg~~mA~pI~~q~~rqGi~RklL-~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~i~~~ri~~P~f~Ivs~P~V 137 (333) .+-.-+.+.|-+.++.+.+.+++. ...|...|. ..||+......+ -|.+..++++.....=+.|++.-.++...+.| T Consensus 143 lvP~~~~~~ii~~l~~~~~i~~~~~~~v~~~~~~-~~~p~~~~~~~a-~~v~E~~~~~~~~~~f~~i~~~~~k~~~~~~i 220 (435) T protein:vir:80 143 LVPENLSSEVIELLRPKSVVRKLGARTLPLSNGN-ITIPRLKGGAIV-GYIGADTDIPTTQQQFDDLKLTAKKMAALVPI 220 (435) T ss_pred ccchhHHHHHHHHHhhhchhhhccceeeecCCCc-eEEEEEeCCcce-eeeccCccccccccceeeEEEeeEEEEEeehh Confidence 345666777888888888999873 344444453 456665444444 46777777877776667888888999999999 Q ss_pred cHHHhhhhc--chhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCC-----CcceEEeeccc---cH Q lcl|NC_011269. 138 KKEDLYYLR--SNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGAL-----PNEITIAGSHL---MP 207 (333) Q Consensus 138 ~~~dl~~~~--~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~-----~N~i~i~~g~L---t~ 207 (333) ..+-|.... .++-.+..++..++|-+.+|.-++ ++.- ++-+|. |-+ .|..+.+++.. +. T Consensus 221 s~ell~ds~~~~~l~~~i~~~l~~a~~~~~d~a~l---~G~G-------~~~~p~-Gi~~~~~~~~~~~~~~~~~~~~~~ 289 (435) T protein:vir:80 221 ANDLIKYAGVNPNVDQIVVGDLTAAIGAREDKAFI---RDDG-------TANTPK-GLRFWALPGNVITASDGSTLQKIE 289 (435) T ss_pred hHHHHHhhcccHHHHHHHHHHHHHHHHHHHHHHhh---ccCC-------CCCccc-ceeecccccceeecccccchhhHH Confidence 988888874 468889999999999999995443 3321 111121 111 12222222221 23 Q ss_pred HHHHHHHHHHHhh--CCccceEEechhhhhhhhhcCCCchhhhHHhhhhhcceeeeeeccccc--ceeeecC-------- Q lcl|NC_011269. 208 DDLYTAVTYTDQR--QLDSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIVQFGEFQIG--KSIIIPR-------- 275 (333) Q Consensus 208 ~~L~~a~t~v~~~--~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~G~fgi~--~skvlpr-------- 275 (333) .++.+++.....- .......+||+..|..|..=--+.-.|.+ |-.+... ++|.+ .+-.+|- T Consensus 290 ~d~~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lkd~~G~~l~--~~~~~~~-----l~G~pv~~~~~~p~~~~~~~~~ 362 (435) T protein:vir:80 290 TDLGKAILALENADANLTQPGWIMAPRTFRFLEGLRDGNGNKVY--PELANGM-----LKGYPVGKTTQVPINLGEAGKE 362 (435) T ss_pred HHHHHHHHHhhccccccccCEEEEcHHHHHHHHhhhccCCceec--cCCCCCe-----EeeeeeEEeccccccccCCCCc Confidence 4566666655443 33455689999999888762111111221 1111111 24433 2333443 Q ss_pred CeEEEeeChhhhcccccccCceeccccchh-------------hhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 276 GTVYLTPEPEFLGVFPVMYSLDVEEDNKVE-------------RFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 276 geiyvvadpE~~G~~pvR~~L~s~p~D~~e-------------r~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) +.|| +.|... -.+-+|+++.++..++.. +-..+|...+-+++++.+|.++|++..+ T Consensus 363 ~~i~-~gd~s~-~~i~~~~~~~i~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~~~~~~a~~~l~~~ 431 (435) T protein:vir:80 363 SEIY-FTDFGD-VFIGEEETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLSGV 431 (435) T ss_pred ceEE-EEEccc-EEEEeecceEEEEeccccccccccchhhhhhcCcceeeeeeeeCcEeecccceEEEecc Confidence 2343 456654 445689999998877542 2256777899999999999999999998 No 100 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=97.81 E-value=1.4e-06 Score=52.67 Aligned_cols=302 Identities=10% Similarity=0.035 Sum_probs=160.9 Q ss_pred Ccccchhhhhhh---hhhcccchHHHHHHHHHHhhcchhcchHHHHHHHHHHhcC---chhHHHHHHHHHHHHHHHHHhh Q lcl|NC_011269. 1 MTLPVAVGSGLG---RFAKASDDYVADIVEAKQRMGGRKLSAREKQAKLAHILSD---KVGGIQRLGQSMIGPIQLQLRY 74 (333) Q Consensus 1 ~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~~Al~~---~Eg~~~aLg~~mA~pI~~q~~r 74 (333) +.-..-...+-. +-.++...|+......++ .-+..........+|.. ..|+. .+-+.+...|-+.++. T Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~-----~~~~~~~~~~~~~al~~~t~s~gG~-~IP~~~~~~Ii~~~~~ 144 (387) T protein:vir:93 71 VKDTGEAYQSLNDHEKMVKAKAEFYRHAILPNE-----FEKPSMEAQRLLHALPTGNDSGGDK-LLPKTLSKEIVSEPFA 144 (387) T ss_pred hhhccccCCCcchhhHHHHHHHHHHHHHhhhhh-----hhhhhhhhHHHHHhhccCcCCCCce-eechhHHHHHHHHHHh Confidence 000000000000 111111122222211111 11111111222233332 33442 5678888999999999 Q ss_pred hhhhhhhhhccccCCCcceeecCCCCccceEEEEcCCCcccceeecCceeeccceeeeccccccHHHhhhhcchhHHHHH Q lcl|NC_011269. 75 QGILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRITPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQ 154 (333) Q Consensus 75 qGi~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q 154 (333) +...|++.+..+......|+.... ...+-|++..+++......-+.|++.-.++..++.|..+-|..+..|+..+.. T Consensus 145 ~~~l~~~~~v~~~~~~~~p~~~~~---~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~Ds~~~l~~~i~ 221 (387) T protein:vir:93 145 KNQLREKARLTNIKGLEIPRVSYT---LDDDDFITDVETAKELKLKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVE 221 (387) T ss_pred hchhhhheeeeecCCceEEEEeec---CCccccccCcccccccccccceeeeeheeeeeechhhHHHHhhhHHHHHHHHH Confidence 999999988887776666664422 22345788777788887777889999999999999999989989999999999 Q ss_pred HHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCc-ceEEeeccccHHHHHHHHHHHHhhCCccceEEechhh Q lcl|NC_011269. 155 DMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPN-EITIAGSHLMPDDLYTAVTYTDQRQLDSSRLLANPQE 233 (333) Q Consensus 155 ~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N-~i~i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~~ 233 (333) +...++|.+-|+..+|.--. .+.+|. |.+.| .++-+.+.-+-++|-.++.-+..--......+||+.- T Consensus 222 ~~la~~~~~~e~~~~~~~g~----------g~g~p~-g~l~~~~~~~v~~~~~~d~i~~~~~~l~~~~~~~a~~~mn~~t 290 (387) T protein:vir:93 222 NALQSGLAAKERKDALAVSP----------KSGLDH-MSFYNGSVKEVEGADMYDAIINALADLHEDYRDNATIYMRYAD 290 (387) T ss_pred HHHHHHHHHHHHHhHhhcCC----------Cccccc-eeeeccccccccccchHHHHHHHHhccChhhhcCCEEEEechH Confidence 99999999988765553221 112221 11111 2222334444566666655444333344567888776 Q ss_pred hhhhhhcCCCchhhhHHhhhhhcceeeeeecccccceeeecCCeEEEeeChhhhcccccccCceeccccchhhhccceeh Q lcl|NC_011269. 234 YRDLYRWDINTTGWAFKDSVVAGERIVQFGEFQIGKSIIIPRGTVYLTPEPEFLGVFPVMYSLDVEEDNKVERFNKGWVM 313 (333) Q Consensus 234 ~~Di~gw~~N~~~~~~~DpV~~~e~il~~G~fgi~~skvlprgeiyvvadpE~~G~~pvR~~L~s~p~D~~er~~kGWvm 313 (333) |..++.=--|..++ +-.+. ..-++|.+--..=.-..+ ++.|.... |=.+.++...+....+.-..|++. T Consensus 291 ~~~~~~~~~d~~~~-----~~~~~---~~~llG~PV~~~~~~~~~-~~GDf~~~--~~~~~~~~~~~~~~~~~~~~~~~~ 359 (387) T protein:vir:93 291 YVKIISVLSNGTTN-----FFDTP---AEKVFGKPVVFTDAAVKP-IVGDFNYF--GINYDGTTYDTDKDVKKGEYLFVL 359 (387) T ss_pred HHHHHHHHhcCCCc-----ccccC---CccccccceEEecCCCce-eeeehhhh--heehhhheeeecccccCCceeEEE Confidence 65544300011111 00000 001233331110000111 22333211 112333333333344444668888 Q ss_pred hhhhhhhhhccceEEEEecC Q lcl|NC_011269. 314 DELVGMAILNPRGIVILRKA 333 (333) Q Consensus 314 ~E~~g~~i~N~~siv~~~~~ 333 (333) .+-++..+.||-+++++... T Consensus 360 ~~r~d~~v~~~eA~~~l~~k 379 (387) T protein:vir:93 360 TAWYDQQRTLDSAFRIAKAK 379 (387) T ss_pred EeeeCceeechhheEEEEee Confidence 99999999999999999765 No 101 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=97.81 E-value=9.1e-07 Score=53.71 Aligned_cols=302 Identities=11% Similarity=0.036 Sum_probs=160.9 Q ss_pred Ccccchhhhhhhhh-hcccchH---HHH--------HHHHHHhhcchhcchHHHHHHHH----------HHhcCchhHHH Q lcl|NC_011269. 1 MTLPVAVGSGLGRF-AKASDDY---VAD--------IVEAKQRMGGRKLSAREKQAKLA----------HILSDKVGGIQ 58 (333) Q Consensus 1 ~~~~~~~~~~~~~~-~~~~~~~---~~~--------~~~~~~~~~~~~ls~ee~~~Lm~----------~Al~~~Eg~~~ 58 (333) ....--. ..+.+- .+..+.. ..+ ..+.+... ......+++.++.. ....+..|+. T Consensus 90 ~~e~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~g~- 166 (437) T protein:vir:10 90 DNEEDDP-EKLKTETKSEAEKDKKTVKDEEKRDAGGLQDMKLKV-GGEIADKKVTAFADYLKTGEVRDVTGIALKDGKV- 166 (437) T ss_pred HHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHhHHHHhHHHHHH-HHHHHHhhhhhhHHHHHhhhhhhhhhcccccccc- Confidence 0000000 000000 0000000 000 00000000 01111111111111 1122333331 Q ss_pred HHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcceeecCCCCccceEEEEcCCCccccee-ecCceeeccceeeeccccc Q lcl|NC_011269. 59 RLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRITP-FEGKRIEVQLFRIASFPQI 137 (333) Q Consensus 59 aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~-i~~~ri~~P~f~Ivs~P~V 137 (333) .+-.-+...|.+ ++.....|.+....+.+.|. ..||+.++.+..+-|.+-.|.++..- ..=+.|++.-..+...+.| T Consensus 167 lvp~~~~~~i~~-~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~e~~~~~e~~~~~~~~v~~~~~k~~~~~~i 244 (437) T protein:vir:10 167 IIPETILTPEKE-VHQFPRLGSLVRTESVTTTT-GKLPIFNNSTDLLTAHTEYGQTTKNATPVITPILWDLKTYTGGYVF 244 (437) T ss_pred cchHHHHHHHHH-hhhhhhhhhcceeEeeccCc-eeeEEeeccccccccccccccccccccccceeeeeehhheeeehhh Confidence 333455566654 46666777777777777665 34666666555555666666665422 2225788888889999999 Q ss_pred cHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCcceEEeeccccHHHHHHHHHH- Q lcl|NC_011269. 138 KKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPNEITIAGSHLMPDDLYTAVTY- 216 (333) Q Consensus 138 ~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N~i~i~~g~Lt~~~L~~a~t~- 216 (333) ..+-|.....|+..+..+...++|..-+|.-+++-+-+ +.|++ .+..+-++|..++.. T Consensus 245 s~ell~ds~~~~~~~i~~~l~~~~~~~~~~~i~~g~g~-----------~~~~~----------~~~~~~~~~~~~~~~~ 303 (437) T protein:vir:10 245 SQELISDSSYDWQAELQSRLIELRDNTDDSLIITALTD-----------GIKKT----------TSTYLLGDLKKVLNVT 303 (437) T ss_pred hHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc-----------ccccc----------ccccchhhHHHHHHhh Confidence 99999998899999999999999999999877765522 22222 122233455555432 Q ss_pred HHhhCCccceEEechhhhhhhhhcCCCchhhhHHhhhhhcceeeee--ecccccc----eeeecCCe----EEEeeChhh Q lcl|NC_011269. 217 TDQRQLDSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIVQF--GEFQIGK----SIIIPRGT----VYLTPEPEF 286 (333) Q Consensus 217 v~~~~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~--G~fgi~~----skvlprge----iyvvadpE~ 286 (333) +..---.....+||+..|.-|+.=- +..| .|+-+.++--.. =++|.+. +..+|-++ ..++.|--. T Consensus 304 l~~~~~~~~~~~~~~~~~~~l~~lk-d~~g----~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~gd~~~ 378 (437) T protein:vir:10 304 LKPQDSAAASIVMSQSAYNLFDMAT-DAMG----RPLLQPNVTAATGYTLLGKTVVIVDDKLFPSASAGDVNIVVAPLKK 378 (437) T ss_pred hhhhhhcCCEEEEcHHHHHHHHHhh-ccCC----CeeeccCccCCCCcccccceeEEecccccCCcCCCceEEEEeeccc Confidence 2222223446899999998887621 1222 233222211001 1466553 33456543 244666543 Q ss_pred hcccccccCceeccccchhhhccceehhhhhhhhhhccceEEEEe-cC Q lcl|NC_011269. 287 LGVFPVMYSLDVEEDNKVERFNKGWVMDELVGMAILNPRGIVILR-KA 333 (333) Q Consensus 287 ~G~~pvR~~L~s~p~D~~er~~kGWvm~E~~g~~i~N~~siv~~~-~~ 333 (333) .-.+-+|.++.++..+..+-+.++..+.+-++.++.+|.++|+|. |. T Consensus 379 ~~~~~~r~~~~~~~~~~~~~~~~~~~~~~r~d~~~~~~~a~~~l~~~~ 426 (437) T protein:vir:10 379 AVINFKLTEITGQFQDTYDIWYKQLGIFLRQNVVQASKDLIVNLTGKL 426 (437) T ss_pred cEEEEeeeceEEEEecccccccceeeEEEEEccEEecccceEEEEeec Confidence 334566889988776655556677777788999999999999986 22 No 102 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=97.77 E-value=2.3e-06 Score=51.53 Aligned_cols=283 Identities=15% Similarity=0.105 Sum_probs=164.4 Q ss_pred CcccchhhhhhhhhhcccchHHHHHHHHHHhhcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhh Q lcl|NC_011269. 1 MTLPVAVGSGLGRFAKASDDYVADIVEAKQRMGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRN 80 (333) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~Rk 80 (333) ||-|-.+-| +.| |++|+ +++.|.+|+= | -..|.+-++-+=|+-+ T Consensus 1 ~~~~~~i~s-------~~~--------------~~~it-------v~~ll~~P~~-I-------~~~i~e~~~~~~iad~ 44 (318) T protein:vir:10 1 MTAPTGIVS-------VSD--------------GPAIT-------VRELVGNPLW-I-------PTALKKMMVNQFISES 44 (318) T ss_pred CCCCCccee-------eec--------------CCcee-------hHHhhCCchh-H-------HHHHHHHHhccchhhh Confidence 998843332 111 23555 5566777772 2 2233444455557777 Q ss_pred hhhccccCCCcceeecC-----CCCccceEEEEcCCCcccceeecCceeeccceeee----ccccccHHHhhhhcchhHH Q lcl|NC_011269. 81 VLLEDTLTPGVPIQYDV-----LDDLGQAYMLHGNEGEIRITPFEGKRIEVQLFRIA----SFPQIKKEDLYYLRSNIVE 151 (333) Q Consensus 81 lL~~~TL~~G~~p~y~v-----~~~v~~a~~~~~~~G~i~~Q~i~~~ri~~P~f~Iv----s~P~V~~~dl~~~~~~vle 151 (333) |+.+-..+.+....|-. +.+..+.. ..-|+++ +.+.....|....+ ---.|..|..+....+.++ T Consensus 45 lf~~~~a~~~~~v~f~~~~p~~~~~d~e~V---aEggEiP---~~~~~~G~~~ia~~~K~G~~~~vS~Em~~~n~~~~v~ 118 (318) T protein:vir:10 45 LFRNGGANPNGVVAYNEGNPSFLEDDVADV---AEFGEIP---VSAGARGLPRTAFAVKKALGVRVSKEMIDENRVGAVN 118 (318) T ss_pred hhhcccccccceeEEEecccccccCcHhhc---cCccccc---ccCCCCCchhhhhhehhccceeccHHHHhhcChhHHH Confidence 88877776666666632 11111111 1222222 11112222222111 1234667788889999999 Q ss_pred HHHHHHHHHHHHHhhhHHHHHHhhhhhhhhh----hcccccccccCCCcceEEeeccccHHHHHHHHHHHHhhCCccceE Q lcl|NC_011269. 152 YTQDMTKQAIMRQEDSRLVTLLEAAAVSYRV----VDSSAQPGVGALPNEITIAGSHLMPDDLYTAVTYTDQRQLDSSRL 227 (333) Q Consensus 152 ~~q~~A~qaIM~qED~~~~slle~~a~~~r~----~~ssA~p~vg~~~N~i~i~~g~Lt~~~L~~a~t~v~~~~L~at~i 227 (333) +.-+.+.-+|-+..|+..+..|+++.+...- |+.+..+..+-+ ++++.+++..-..+.-+.-..=++.+...+.| T Consensus 119 r~~~~l~Nti~r~~d~~a~dal~sa~t~~~~~s~~w~~~~~~~~d~~-~A~e~v~~a~~~~~~a~~~~~~~~~GY~pdtI 197 (318) T protein:vir:10 119 DQMLQLRNTFIRANDRSAKALLQSPIVPTLAVPTAWDNGGKVRTDIA-IAIEQISTAAPTAYPAGVGSSDEYFGFIPDTI 197 (318) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhccccccccCCcCCCCcccccccch-hhhhhhhhhhhhhhhhhhhhhhhccCccceee Confidence 9999999999999999999999888754431 222222222222 33333333222111111112224778889999 Q ss_pred EechhhhhhhhhcCCCchhhhH-----Hhhhhhcceeeeeecc-------cccceeeecCCeEEEeeChhhhcccccccC Q lcl|NC_011269. 228 LANPQEYRDLYRWDINTTGWAF-----KDSVVAGERIVQFGEF-------QIGKSIIIPRGTVYLTPEPEFLGVFPVMYS 295 (333) Q Consensus 228 l~~~~~~~Di~gw~~N~~~~~~-----~DpV~~~e~il~~G~f-------gi~~skvlprgeiyvvadpE~~G~~pvR~~ 295 (333) +|++..|.=+.+ |... .- -+|+-.+ +-.+|.| .+.-+.-+|+|++||+- --++|-|.+-.. T Consensus 198 VlhP~~~~~l~~---n~~~-~~~y~~~a~~~~~~--~~~tg~~~g~~lGl~vi~s~~~p~~~alvlq-~g~vG~~~d~~p 270 (318) T protein:vir:10 198 VMHYALLPILMD---NENF-MKVYERNANYVSTA--PDWTGNFPGSVMGLNVIRSRTFPIDRVLIME-RGTVGFYSDTRP 270 (318) T ss_pred EECHHHHHHHhc---chhh-hhhhhccchhhhhc--ccccccccceeeceEEeecCccCCCeeEEEe-cCCcceeecccc Confidence 999999999987 5331 11 1222211 1124554 35578899999999986 467999988888 Q ss_pred ceeccccc----h-hhhcccee--hhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 296 LDVEEDNK----V-ERFNKGWV--MDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 296 L~s~p~D~----~-er~~kGWv--m~E~~g~~i~N~~siv~~~~~ 333 (333) |.+++.-. . ..-+.-|. +.+--.++|.+|.+|+-|.-- T Consensus 271 l~~t~~~~egg~~~g~~~~s~~~~~~~~~~~~V~~PkA~~~itgi 315 (318) T protein:vir:10 271 LQFTALYPEGNGPNGGPTESYRADASHKRALAVDQPKAALWLTGI 315 (318) T ss_pred ceeeecccCCCCCCCCcchhhheehheeeeeeeeCcceeEEEeec Confidence 87766541 0 00123366 556778999999999988777 No 103 >protein:vir:78739 Length: 332 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:1856 # MgeName: Syn5 # Cross-refs: genbank:acc:YP_001285448;genbank:gi:148724482;genbank:GeneID:5220210 Probab=97.75 E-value=1.4e-06 Score=52.75 Aligned_cols=285 Identities=15% Similarity=0.158 Sum_probs=159.6 Q ss_pred CcccchhhhhhhhhhcccchHHHHHHHHHHhhcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhh Q lcl|NC_011269. 1 MTLPVAVGSGLGRFAKASDDYVADIVEAKQRMGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRN 80 (333) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~Rk 80 (333) |+.|=-. |-|+..-|...+.+|.=+.++. -+-.+..|+-+.+. T Consensus 7 ~~~~~~~-----------------------~~~~~~~~~d~~~al~le~~~g--------------eV~~~f~~~s~~~~ 49 (332) T protein:vir:78 7 FSLPNQA-----------------------NGGARNADYDVRYATALKLFSG--------------EVFTAFNNASIFKG 49 (332) T ss_pred ccCCccc-----------------------cCCccccccccchhhhhhhhhh--------------hHHHHHHHHhhhhh Confidence 3333211 2223333333332222222222 12334456666778 Q ss_pred hhhccccCCCcceeecCCCCccceEEEEcCCCccccee-ecC--ceeeccceeeeccccccHHHhhhhcchhHHHHHHHH Q lcl|NC_011269. 81 VLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRITP-FEG--KRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMT 157 (333) Q Consensus 81 lL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~-i~~--~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A 157 (333) +....|+..|.--+++...++..++ +++...+..+. +.. +.|.+-+....++ .|+--|=-|...|+..+.-+++ T Consensus 50 ~~~~r~i~~G~tv~i~~ig~~~~~~--~~~g~~l~~~~~~~~~~~~l~ID~~ky~~~-~VddiD~~q~~~dl~~~~~~~~ 126 (332) T protein:vir:78 50 LVRSYDLRGGKSKQFMFTGKLSAGY--HTPGTPIVGDAGIKANEKTLVMDDLLVSSQ-FVYSLDEIFSQYSTRAEVSKQI 126 (332) T ss_pred ccccccccccceEEEEeccceeEee--ecCCCCCCCCCCCCCceEEEEEehhhhhHH-HHHhHHHHhcCcchHHHHHHHH Confidence 8888889999999999999998888 44444444432 332 3344444444443 5566677788899999999999 Q ss_pred HHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCC-CcceEEeec-cccHHHH----HHHHHHHHhhCCccc--eEEe Q lcl|NC_011269. 158 KQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGAL-PNEITIAGS-HLMPDDL----YTAVTYTDQRQLDSS--RLLA 229 (333) Q Consensus 158 ~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~-~N~i~i~~g-~Lt~~~L----~~a~t~v~~~~L~at--~il~ 229 (333) -.|+-++.|..++.++-.++ +++-|..|.- -..+.+.++ ..++.++ ..|.+..++.+.|.. ++|+ T Consensus 127 g~aLA~~~D~~i~~~l~~aa-------~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv 199 (332) T protein:vir:78 127 GEALATHYDERIARVLAKAS-------AEASPVTGEPGGFHVNIGAGNTNDAQAIVDGFFEAAAVLDERSAPQEGRVAVL 199 (332) T ss_pred HHHHHHHHHHHHHHHHHhhh-------cccCcccccccccccccCCccccCHHHHHHHHHHHHHHHhhcCCCccCCEEEe Confidence 99999999999999886666 2222222110 133444544 4445444 456688889999866 4888 Q ss_pred chhhhhhhhhc-C---CCchhhhHHhhhhhcceeeeeecccccceeeecCCeEE--Eee-Ch----hhhccccccc---- Q lcl|NC_011269. 230 NPQEYRDLYRW-D---INTTGWAFKDSVVAGERIVQFGEFQIGKSIIIPRGTVY--LTP-EP----EFLGVFPVMY---- 294 (333) Q Consensus 230 ~~~~~~Di~gw-~---~N~~~~~~~DpV~~~e~il~~G~fgi~~skvlprgeiy--vva-dp----E~~G~~pvR~---- 294 (333) +|+.|..|.-= + .|.++-..-+.+.++..+.+.-=|.|..|--+|.+..- ... .+ .+.|.|+.+- T Consensus 200 ~P~~y~~Ll~~~d~~~~n~~~~~~~~~~~~g~~i~~i~G~~V~~Sn~lp~~~g~~~~~~~~~~~~n~~~~~~~~~~~~~~ 279 (332) T protein:vir:78 200 SPRQYYSLISSVDTNILNREIGNSQGDMNSGKGLYSIAGIRILKSNNLAGLYGQDLSSAAVTGENNDYQVDASALAGLIF 279 (332) T ss_pred CHHHHHHHHhhcCceeeeeeccccccceecceeeeEEeeeEEEecCccccCcccccccccccccccccccccccceEEee Confidence 99999999750 0 01111111233444443444444667788888865431 110 00 0111111111 Q ss_pred -----------Ccee--ccccchhhhccceehhhh--hhhhhhccceEEEEecC Q lcl|NC_011269. 295 -----------SLDV--EEDNKVERFNKGWVMDEL--VGMAILNPRGIVILRKA 333 (333) Q Consensus 295 -----------~L~s--~p~D~~er~~kGWvm~E~--~g~~i~N~~siv~~~~~ 333 (333) ++++ ++.+..++ ..+|.|.-. .|-.+.||.++|.++.| T Consensus 280 h~~a~~~v~~~~~~~~~t~~~~~~~-~~~d~i~~~~~~G~~v~rPe~~v~l~~a 332 (332) T protein:vir:78 280 HREAAGCIQSVAPTIQTTSGDFNVQ-YQGDLIVGKLAMGCGSLRTSVAGSFQAA 332 (332) T ss_pred cccceeeeeeeccchhhhhcccchh-hhHhhhhhhhhhcCceecccceEEEeeC Confidence 1122 12221111 335666543 56678999999999999 No 104 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=97.62 E-value=2.6e-06 Score=51.17 Aligned_cols=300 Identities=12% Similarity=0.075 Sum_probs=163.8 Q ss_pred Cccc------------chhhhhh--hhhhcccchHHHHHHHHHHhhc---------------------chhcchHHHHHH Q lcl|NC_011269. 1 MTLP------------VAVGSGL--GRFAKASDDYVADIVEAKQRMG---------------------GRKLSAREKQAK 45 (333) Q Consensus 1 ~~~~------------~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~---------------------~~~ls~ee~~~L 45 (333) |+== =-.-+-+ ..- +.-++..++|-..+.++- +.+-..+++.+. T Consensus 1 M~k~l~~l~e~~~~~~~e~~~~~~~~~~-e~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (371) T protein:vir:81 1 MPKELRELLEQINNKKEEARKLLAENKI-EEAKKLKEEIVALQEKFDVAKELYEEQKQTIEDKEPLKPTVQVKENEVEAF 79 (371) T ss_pred CcHHHHHHHHHHHHHHHHHHHHhhHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccchhhHHHHHHHH Confidence 0000 0000000 000 000000000000000000 000001111111 Q ss_pred H-------HHHhc---CchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcc-eeecCCCCccceEEEEcCCCcc Q lcl|NC_011269. 46 L-------AHILS---DKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVP-IQYDVLDDLGQAYMLHGNEGEI 114 (333) Q Consensus 46 m-------~~Al~---~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~-p~y~v~~~v~~a~~~~~~~G~i 114 (333) + .++++ +..|+. .+-.-+...|...++.....+++.+..+++.+.. ..+++.... ..+.|++.-+++ T Consensus 80 ~~~l~~~~~~a~~~~t~~~gg~-~vP~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~~~~~~~~~-~~a~~v~Eg~~~ 157 (371) T protein:vir:81 80 VNHIRTRFRNAMSEGSNQDGGY-TVPQDIQTRINELRESKDALQNLITVEPVTTLSGSRVFKKRSQQ-TGFVEVAEGAAI 157 (371) T ss_pred HHHHHHHHHHhhccCCCccCce-eecHhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEeecCC-cceeeecccccc Confidence 1 12222 222332 3555677888899999999999998888876541 122222222 234466666666 Q ss_pred ccee-ecCceeeccceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccC Q lcl|NC_011269. 115 RITP-FEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGA 193 (333) Q Consensus 115 ~~Q~-i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~ 193 (333) +++- ..=..|++....+.....|..+=|.....++..+..++..++|-+-+|..+++...+. .| T Consensus 158 ~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~a~~~~~~~~i~~g~g~~-----------~~---- 222 (371) T protein:vir:81 158 GEKATPQFTLLQYQVKKYAGFFRVTNELLNDSTEAIVNTLVRWIGDESRVTRNGLIINVLNTK-----------AK---- 222 (371) T ss_pred ccccccceeeEEeeeeEEEEeehhhHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhhcccc-----------cc---- Confidence 6542 3337889999999999999999999999999999999999999999997766654211 11 Q ss_pred CCcceEEeeccccHHHHHHHHH-HHHhhCCccceEEechhhhhhhhhcCCCchhhhHHhhhhhcceeeeee----ccccc Q lcl|NC_011269. 194 LPNEITIAGSHLMPDDLYTAVT-YTDQRQLDSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIVQFG----EFQIG 268 (333) Q Consensus 194 ~~N~i~i~~g~Lt~~~L~~a~t-~v~~~~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~G----~fgi~ 268 (333) .|-.+-+++..+.. .+...-.....++||+..|..|+.=- +..+ .|+-+... +.| ++|.+ T Consensus 223 --------~~~~~~~~i~~~~~~~l~~~~~~~a~~vmn~~~~~~L~~lk-d~~g----~~l~~~~~--~~~~~~~l~G~p 287 (371) T protein:vir:81 223 --------TAIADLDGLKQIINVQLDPVFRSTSSVIVNQDAFNWLDTLK-DQNG----QYLLQPSI--SSPTGRQLLGLP 287 (371) T ss_pred --------cccccHHHHHHHHHhhcchhhhcCCEEEEcHHHHHHHHHhh-ccCC----Ceeeeccc--CCCCCceeccee Confidence 12345556655543 23333345567899999999887511 1111 12211111 111 23432 Q ss_pred --ceeeecCCeE-----------EEeeChhhhcccccccCceeccccchh----hhccceehhhhhhhhhhccceEEEEe Q lcl|NC_011269. 269 --KSIIIPRGTV-----------YLTPEPEFLGVFPVMYSLDVEEDNKVE----RFNKGWVMDELVGMAILNPRGIVILR 331 (333) Q Consensus 269 --~skvlprgei-----------yvvadpE~~G~~pvR~~L~s~p~D~~e----r~~kGWvm~E~~g~~i~N~~siv~~~ 331 (333) .+-.+|.|.. +++.|.-.+-..-+|.++.++-.++.. +-..+|.+++-++..+.||.+++.+. T Consensus 288 V~~~~~~~~~~~~~~~~~~~~~~i~~Gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~v~~~~~~r~d~~~~~~~a~~~~~ 367 (371) T protein:vir:81 288 VVIVSNKVLANRVDGGTGAQFAPIIVGDLKEAVVMFDRQRTEIMSSNVAMDAFETDATLWRAIERMDVKMRDDEAFVFGE 367 (371) T ss_pred EEEecccccCccccccccCCcceEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEE Confidence 3334554432 334444333345568898887766543 23568999999999999999999999 Q ss_pred cC Q lcl|NC_011269. 332 KA 333 (333) Q Consensus 332 ~~ 333 (333) .+ T Consensus 368 ~~ 369 (371) T protein:vir:81 368 VQ 369 (371) T ss_pred Ee Confidence 88 No 105 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=97.61 E-value=8e-06 Score=48.52 Aligned_cols=305 Identities=16% Similarity=0.170 Sum_probs=161.7 Q ss_pred Ccccchh-----------------------hhhhhhhhcccchHHHHHHHHHHhhcchhcchHHHHHHHHHHh---cCch Q lcl|NC_011269. 1 MTLPVAV-----------------------GSGLGRFAKASDDYVADIVEAKQRMGGRKLSAREKQAKLAHIL---SDKV 54 (333) Q Consensus 1 ~~~~~~~-----------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~~Al---~~~E 54 (333) +.-++.- +...++|.++--.--.+...+. +...+.-.. ...+.++ .+.. T Consensus 65 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~----~~~~~~~~~~t~~~ 139 (435) T protein:vir:14 65 AAVPVDPNPTAVAAPAAAPVHAQPKALEVKGAKMARMVRALAAARGDAQLAS-KLAIERGFG----EEVAMSLNTLSPGA 139 (435) T ss_pred hcccccchhhhhhhccccccccccchhhhhHHHHHHHHHHHHhhcchhhHHH-HHHHhhhhh----hhhhhhcccCCcCC Confidence 1111110 1111222211000000000000 000000001 1122222 3344 Q ss_pred hHHHHHHHHHHHHHHHHHhhhhhhhhhhh-ccccCCCcceeecCCCCccceEEEEcCCCcccceeecCceeeccceeeec Q lcl|NC_011269. 55 GGIQRLGQSMIGPIQLQLRYQGILRNVLL-EDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRITPFEGKRIEVQLFRIAS 133 (333) Q Consensus 55 g~~~aLg~~mA~pI~~q~~rqGi~RklL~-~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~i~~~ri~~P~f~Ivs 133 (333) |+. .+-+.+.+.|-+.++.+.+.+++.. ..|...|. ..||+......+ .|.+..|.++.....=..|++.-.++.. T Consensus 140 gg~-~vP~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~-~~~p~~~~~~~a-~~v~E~~~~~~~~~~f~~i~~~~~k~~~ 216 (435) T protein:vir:14 140 GGV-LVPENLSSEVIELLRPKSVVRKLGARTLPLSNGN-ITIPRLKGGAIV-GYIGADTDIPTTQQQFDDLKLTAKKMAA 216 (435) T ss_pred Ccc-ccchhHHHHHHHHHhhhchhhhhcceeeecCCCc-eEEEEEeCCcce-eeeccCccccccccceeEEEeeeEEEEE Confidence 442 4566777888888888888888733 44444453 456665444444 4677777777665555678888889999 Q ss_pred cccccHHHhhhhcc--hhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCC----CcceEEeecc--- Q lcl|NC_011269. 134 FPQIKKEDLYYLRS--NIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGAL----PNEITIAGSH--- 204 (333) Q Consensus 134 ~P~V~~~dl~~~~~--~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~----~N~i~i~~g~--- 204 (333) .+.|..+-|..... ++-.+..++..++|-+.+|.-++ .+.- .+-+| .|.+ ++.+.-..+. T Consensus 217 ~~~iS~ell~ds~~~~~l~~~i~~~l~~ai~~~~d~a~l---~G~G-------~~~~p-~Gi~~~~~~~~~~~~~~~~~~ 285 (435) T protein:vir:14 217 LVPIANDLIKYAGVNPNVDQIVVGDLTAAIGAREDKAFI---RDDG-------TANTP-KGLRFWALPSNVITASDASTL 285 (435) T ss_pred eehhhHHHHHhhccCHHHHHHHHHHHHHHHHHHHHHHhh---ccCC-------CCccc-cceeecccccceeccccccch Confidence 98888888887754 47778899999999999996554 2211 00111 1111 1111111111 Q ss_pred -ccHHHHHHHHHHHHhh--CCccceEEechhhhhhhhhcCCCchhhhHHhhhhhcceeeeeeccccc--ceeeecC---- Q lcl|NC_011269. 205 -LMPDDLYTAVTYTDQR--QLDSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIVQFGEFQIG--KSIIIPR---- 275 (333) Q Consensus 205 -Lt~~~L~~a~t~v~~~--~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~G~fgi~--~skvlpr---- 275 (333) ....++.+++..+... ++.....+||+..|..|+.=--..-.|.+ |-.+.. -++|.+ .+-.+|- T Consensus 286 ~~~~~~~~~l~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~l~--~~~~~g-----~l~G~Pv~~~~~~p~~~~~ 358 (435) T protein:vir:14 286 QKIETDLGKVILALENADANLTQPGWIMAPRTFRFLEGLRDGNGNKVY--PELANG-----MLKGYPVGKTTQVPINLGE 358 (435) T ss_pred hhHHHHHHHHHHHhhhccccccCCEEEEcHHHHHHHHHhhccCCceec--cCCCCC-----eeecceeEeeccccccccC Confidence 2234566666666654 33455689999999888761111111111 111111 124433 2333433 Q ss_pred ----CeEEEeeChhhhcccccccCceeccccch-------------hhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 276 ----GTVYLTPEPEFLGVFPVMYSLDVEEDNKV-------------ERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 276 ----geiyvvadpE~~G~~pvR~~L~s~p~D~~-------------er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) +.|| +.|... ..+-+|+++.++-.++. .+--..+-..+-+++++.+|-+++.|..+ T Consensus 359 ~~~~~~i~-~gd~s~-~~i~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~l~~~ 431 (435) T protein:vir:14 359 TGKESEIY-FTDFGD-VFIGEEETLEIDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAVLAGV 431 (435) T ss_pred CCccceEE-Eeeccc-EEEEEecccEEEEeccccccccccchhhhhhcChhheeeeeeeCceeecccceEEEecC Confidence 3444 355542 34668899988766643 22246777889999999999999999998 No 106 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=97.53 E-value=6.9e-06 Score=48.88 Aligned_cols=285 Identities=12% Similarity=0.055 Sum_probs=161.0 Q ss_pred HHHHHHHHHhhcchhcchHHHHHHHHHHhcCchhHH-----HHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcceeec Q lcl|NC_011269. 22 VADIVEAKQRMGGRKLSAREKQAKLAHILSDKVGGI-----QRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPIQYD 96 (333) Q Consensus 22 ~~~~~~~~~~~~~~~ls~ee~~~Lm~~Al~~~Eg~~-----~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p~y~ 96 (333) +|-+-|.+-..- -++.+|+. --+-+.+++.|-+.++....++++....+++.|. ..|| T Consensus 1 ~~~~~e~~~~~~----------------~~~~~~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~-~~ip 63 (338) T protein:vir:78 1 MATLNELAPNTA----------------GSNHQGRLAHVPSDLLPKEIVGPIFDKAQESSLVLRLGENIPISYGE-TIIP 63 (338) T ss_pred CcchHHhhhhhc----------------ccccccceecccccccchHHHHHHHHHHHhhchhhhhcceeeccCCc-eEEE Confidence 222222221111 12223210 1467788999999999999999999998887653 2333 Q ss_pred CCCCccceEE-------EEcCCCcccceeecCceeeccceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHH Q lcl|NC_011269. 97 VLDDLGQAYM-------LHGNEGEIRITPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRL 169 (333) Q Consensus 97 v~~~v~~a~~-------~~~~~G~i~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~ 169 (333) +......++. |.+.-+++......=+.|++-..++..++.|..+-|+....++..+..++..++|-+.+|.-+ T Consensus 64 ~~~~~~~a~~v~~~~~~~~~Eg~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~~~~~~~i~~~la~a~~~~~d~~~ 143 (338) T protein:vir:78 64 TTVKRPEVGQVGVGTSNEQREGGTKPLSGTAWDTRSVAPIKLATIVTVSEEFARMNPSGLYTKLQADLAYAIGRGIDLAV 143 (338) T ss_pred EEecCccceeecccccccccccccccccccceeEEEEEEEEEEEeehhhHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHh Confidence 3222222221 222334455554444677787888999999999999999999999999999999999999665 Q ss_pred HHHHhhh-hhhhhhhcccccccccCCCcceEE----eeccccHHHHHHHHHHH-HhhCCccceEEechhhhhhhhh---- Q lcl|NC_011269. 170 VTLLEAA-AVSYRVVDSSAQPGVGALPNEITI----AGSHLMPDDLYTAVTYT-DQRQLDSSRLLANPQEYRDLYR---- 239 (333) Q Consensus 170 ~slle~~-a~~~r~~~ssA~p~vg~~~N~i~i----~~g~Lt~~~L~~a~t~v-~~~~L~at~il~~~~~~~Di~g---- 239 (333) ++=-.+. ... ...+.+.....+..+. ++...+-++|..+...+ ....+..+-++||++.|..+.. T Consensus 144 l~G~g~~~~~~-----~~gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~~~~l 218 (338) T protein:vir:78 144 FHGKSPLTGSA-----LQGIDTNNVIVNTTNVDYLQTGTTPLLDRFLDGYDLVSANTDVDFNGWAADPRYRARLLRSQAY 218 (338) T ss_pred hcccCCCcccc-----ccccccccccccccccccccccchhhHHHHHHHHHHhhhhccccceEEEEchHHHHHHHHHhhh Confidence 5311000 000 0000000011111111 12233345666665544 3466778889999999887743 Q ss_pred cCCCchhhhHHhhhhhcceeeeeecccccc--eeeecC--------CeEEEeeChhhhcccccccCceeccccch----- Q lcl|NC_011269. 240 WDINTTGWAFKDSVVAGERIVQFGEFQIGK--SIIIPR--------GTVYLTPEPEFLGVFPVMYSLDVEEDNKV----- 304 (333) Q Consensus 240 w~~N~~~~~~~DpV~~~e~il~~G~fgi~~--skvlpr--------geiyvvadpE~~G~~pvR~~L~s~p~D~~----- 304 (333) .|-|-. |.+-+....+. ..-++|++- +.-||- ..++++.|... -.+=.|+++.++-.++. T Consensus 219 ~d~~g~-~l~~~~~~~~~---~~~l~G~PV~~~~~ip~~~~~~~~~~~~~~~gdfs~-~~~~~~~~~~i~~~~~~~~~~~ 293 (338) T protein:vir:78 219 RDANGN-VDPTRINLAAS---AGDLLGLPVQFGKAVGGDLGAATDSKVRVVGGDFSQ-LKYGFADEIRVKMSDTATLTDN 293 (338) T ss_pred ccCCCc-eeecccccCCC---CceeeeeeEEEccccCccccccCCcccEEEEEecce-EEEEeecccEEEEeeccccccc Confidence 111111 11111111111 011245442 222342 13344556543 33556777776555432 Q ss_pred -----------hhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 305 -----------ERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 305 -----------er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) ++--.+|.+.+.++.++.||.+++.|.++ T Consensus 294 ~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~l~~~ 333 (338) T protein:vir:78 294 TSPTPQTVSMWQTNQIAILIEVTFGWLLGDKQAFVKFVDD 333 (338) T ss_pred ccccccchhhhhcCcEEEEEEEEeccEeecccceEEEecc Confidence 22346888999999999999999999999 No 107 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=97.41 E-value=1.3e-05 Score=47.36 Aligned_cols=297 Identities=10% Similarity=0.015 Sum_probs=158.5 Q ss_pred Ccccc----------hhhhhhhhhhcc---cchHHHHHHHHHHhhcchhcchHHHHHHHH----------HHhcCchhHH Q lcl|NC_011269. 1 MTLPV----------AVGSGLGRFAKA---SDDYVADIVEAKQRMGGRKLSAREKQAKLA----------HILSDKVGGI 57 (333) Q Consensus 1 ~~~~~----------~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~----------~Al~~~Eg~~ 57 (333) -.|-- .+-.-+.+..+. ...-...-..++. ..+.+..++.+.++.. ....+..|. T Consensus 64 ~~l~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 141 (397) T protein:vir:96 64 KDLDEKIAELQKEKQDLEDELAKAADPTDQKPKDGEKRKMKKF-KVTEEELAEKRSAINAFVKSKGAEKRDGFTSVEGG- 141 (397) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhHHHHHHHHHHH-hhhhHHHHHHHHHHHHHHHhhhhhhhhcccccccc- Confidence 00000 000000000000 0000000000000 1111111222222211 112233333 Q ss_pred HHHHHHHHHHHHHHHhhhhhhhhhhhccccC--CCcceeecCCCCccceEEEEcCCCcccce-eecCceeeccceeeecc Q lcl|NC_011269. 58 QRLGQSMIGPIQLQLRYQGILRNVLLEDTLT--PGVPIQYDVLDDLGQAYMLHGNEGEIRIT-PFEGKRIEVQLFRIASF 134 (333) Q Consensus 58 ~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~--~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q-~i~~~ri~~P~f~Ivs~ 134 (333) ..+-.-+.+.|.+ ++...-.+++....+++ .|..|++. ..+.+.-|.+-.|+.+.. ...-+.|++.-..+... T Consensus 142 ~~vp~~~~~~i~~-~~~~~~l~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~E~~~~~~~~~~~~~~i~~~~~~~~~~ 217 (397) T protein:vir:96 142 ALIPQELLQPQLE-PKDIVDLSKYVRSVPVNSASGKFPVIS---KSGSKMATVQQLEKNPQLANPKMVEIDYSVATRRGY 217 (397) T ss_pred cchhHHHHHHHHH-hhhhhhHHHhhhhccccccceeEEEEe---ccCCccccccccccccccccccccceeecHhHhhcc Confidence 2444666777765 45566666666665554 34455443 223334456666666642 33446788887888888 Q ss_pred ccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCcceEEeeccccHHHHHHHH Q lcl|NC_011269. 135 PQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPNEITIAGSHLMPDDLYTAV 214 (333) Q Consensus 135 P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N~i~i~~g~Lt~~~L~~a~ 214 (333) +.+..+-|.....|+..+..++..+++..-+|..+++-.... +| .|..+-++|..++ T Consensus 218 ~~~s~ell~ds~~~l~~~i~~~l~~~~~~~~~~~i~~g~g~~-------------------~~----~~~~~~d~~~~~~ 274 (397) T protein:vir:96 218 IPISQEMIDDASYDVTGLIADEIQDQSLNTKNADIAAVLKTA-------------------TA----KSVVGVDGLKDLI 274 (397) T ss_pred hhhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhccccc-------------------cc----ccccchHHHHHHH Confidence 889888889888999999999999999999997776554211 22 1334667777776 Q ss_pred HHHHhhCCccceEEechhhhhhhhhcCCCchhhhHHhhhhhcceeeee--ecccccce---eeecCCe----EEEeeChh Q lcl|NC_011269. 215 TYTDQRQLDSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIVQF--GEFQIGKS---IIIPRGT----VYLTPEPE 285 (333) Q Consensus 215 t~v~~~~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~--G~fgi~~s---kvlprge----iyvvadpE 285 (333) ....+-.- ....|||++.|.-|+.=- +..| .|+-+.++.-.. =++|.+-. ..+|-++ .+++.|.- T Consensus 275 ~~~~~~~~-~a~~v~n~~~~~~l~~lk-d~~G----~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~~ 348 (397) T protein:vir:96 275 NKEIKKVY-DVKLFISASMYSELDKLK-DKNG----RYLLQDSITAASGKQLLGKEVVVLDDDVIGKSVGNVVGFIGDAK 348 (397) T ss_pred HHhhhhhc-CcEEEEcHHHHHHHHHhh-ccCC----CeEeccCccCCCcccccccceEEecccccCCCCCceEEEEeehh Confidence 54333322 356899999999998721 2222 222222211101 13454421 1122222 34456665 Q ss_pred hhcccccccCceeccccchhhhccceehhhhhhhhhhccceEEEEe-cC Q lcl|NC_011269. 286 FLGVFPVMYSLDVEEDNKVERFNKGWVMDELVGMAILNPRGIVILR-KA 333 (333) Q Consensus 286 ~~G~~pvR~~L~s~p~D~~er~~kGWvm~E~~g~~i~N~~siv~~~-~~ 333 (333) .+-.+-+|+++.+...+.. .+.++...++-++..+.+|.++|.+. ++ T Consensus 349 ~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~r~d~~~~~~~a~~~~~~~~ 396 (397) T protein:vir:96 349 AFASFFDRKQVSVSWVDNN-IYGQLLAGIIRYDVKATDKKAGFYVTFTI 396 (397) T ss_pred cceEeEeecceEEEEeccc-ccceeEEEEEEEccEEecccceEEEEeec Confidence 4434667899988876642 25788889999999999999999997 44 No 108 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=97.39 E-value=2.1e-05 Score=46.23 Aligned_cols=307 Identities=12% Similarity=0.034 Sum_probs=172.0 Q ss_pred Ccc-------------------------cchhhhhhhhhhcccchHH-HHH---HH------HHHhhcchhcchHHHHHH Q lcl|NC_011269. 1 MTL-------------------------PVAVGSGLGRFAKASDDYV-ADI---VE------AKQRMGGRKLSAREKQAK 45 (333) Q Consensus 1 ~~~-------------------------~~~~~~~~~~~~~~~~~~~-~~~---~~------~~~~~~~~~ls~ee~~~L 45 (333) ||. --.-...+....++...-+ ... .+ ...+.|.+.|+++|++.+ T Consensus 1 M~~kl~~~~~~~~e~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~lt~~e~~~~ 80 (383) T protein:vir:78 1 MTIKLKNNLANYEEKRTAFVNAVKNEDTQEIQNKAYVEMVDAMAADIMEQAKKEARQEADAYISASRTDKNITNEEIKFF 80 (383) T ss_pred CchhHHHHHHHHHHHHHHHHHHHhccChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCChhhhhHHHHHHH Confidence 000 0000111111111111000 000 00 124467788999998754 Q ss_pred H--HHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcceeecCCCCccceEEEEcCCCcccce-eecCc Q lcl|NC_011269. 46 L--AHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRIT-PFEGK 122 (333) Q Consensus 46 m--~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q-~i~~~ 122 (333) - +.. .+..|+ -.+-.-+++.|.+.+...+..|++-+..+.. | .-++++..+...|+ |.+-.|.+..+ ...=+ T Consensus 81 ~~~~~~-~~~~gg-~lvP~~~~~~I~~~l~~~s~l~~~~~v~~~~-~-~~~i~~~~~~~~a~-w~~e~~~~~~~~~~~f~ 155 (383) T protein:vir:78 81 NDINKE-VGYKEE-TLLPQTVVDEIFEDLTTEHPFLASIGMRTTG-L-RTKFLKSETSGVAV-WGKIFGEIKGQLDATFS 155 (383) T ss_pred HHHhcc-CCCCCc-cccCHHHHHHHHHHHHhhccceeeeeeEecC-C-ceEEEEEcCCcceE-EeecccccccccCccee Confidence 2 222 233444 3667889999999999999999997765543 3 33566555544444 76666666443 22336 Q ss_pred eeeccceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHH---------HHhhhhhhhhhhcccccccccC Q lcl|NC_011269. 123 RIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVT---------LLEAAAVSYRVVDSSAQPGVGA 193 (333) Q Consensus 123 ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~s---------lle~~a~~~r~~~ssA~p~vg~ 193 (333) .|+++..++.++|.|..+=|+...+|+-.+..+...++|-+.+|.-+++ +|.+.+ ....++ ++ T Consensus 156 ~i~l~~~kl~~~i~is~ell~Ds~~~ie~~i~~~l~~~~a~~~~~a~i~G~G~~qP~Gil~~~~-------~~~~~~-~~ 227 (383) T protein:vir:78 156 DEESIQNKLTAFVVVPKDLEKFGPAWVKRFVVTQIEEAFAVALESAYIVGDGNDKPIGLNRKVG-------KGSTVV-DG 227 (383) T ss_pred eEeecceeeEeeccchHHHhhccHHHHHHHHHHHHHHHHHHHHhhheEeccCCCCceeeeeccC-------Cccccc-cc Confidence 7899999999999999999999999999999999999999999944432 110000 000000 00 Q ss_pred CCcceEEeeccccHHHHHHHHHHHHhh--------------CCccceEEechhhhhhhhhcCCCchhhhHHhhhhhccee Q lcl|NC_011269. 194 LPNEITIAGSHLMPDDLYTAVTYTDQR--------------QLDSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERI 259 (333) Q Consensus 194 ~~N~i~i~~g~Lt~~~L~~a~t~v~~~--------------~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~i 259 (333) . .+-....+.++-.+.......+... -+.--+.+||+.-|.+...+- ..+++ .+. . T Consensus 228 ~-~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~~~~~------~~~~~--~G~-~ 297 (383) T protein:vir:78 228 V-YAEKAATGTLTFANPKTTVNELTDVYKYHSVKENGHPLNVAGKVTLLVNPTDAWDVKKQY------TSLNA--NGV-Y 297 (383) T ss_pred c-cccccccchhhhhhhHHHHHHHHHHHhccchhcccchhhhcCceEEEEcCcchhhhccch------hccCC--CCc-e Confidence 0 1111223344444433333333211 011113455655444443311 01111 111 1 Q ss_pred eeeec-cc--ccceeeecCCeEEEeeChhhhcccccccCceeccccch--hhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 260 VQFGE-FQ--IGKSIIIPRGTVYLTPEPEFLGVFPVMYSLDVEEDNKV--ERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 260 l~~G~-fg--i~~skvlprgeiyvvadpE~~G~~pvR~~L~s~p~D~~--er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) + +.+ |+ +..+--+|-|+++ ..|.... .+=+|+|+++...|.. .+-..++...+=++-.+.||-++|+|--+ T Consensus 298 ~-t~l~~~~~iv~s~~~p~~~ii-fgdfs~Y-~i~~r~~~~i~~~~~~~f~~d~~~f~~~~r~dG~~~~~~A~~vl~~~ 373 (383) T protein:vir:78 298 V-TALPFNLNIIESLFVPEKKAI-SYVAERY-DALIGGPLDIGTYDQTLAIEDLNLYAAKQFAYGKAKDDKAAAVWTLN 373 (383) T ss_pred e-eecCCCceEEecCCCCcccEE-Eeeccce-EEEecccceEEecchhhhhcCceEEEEEEEEcCEEecCCeEEEEEEE Confidence 1 111 23 2245668888864 5677763 6678999999887733 22367899999999999999999998755 No 109 >protein:vir:80180 Length: 381 # NCBI annotation: capsid protein # Family: family:all:2203 # MgeID: mge:1878 # MgeName: Pf-WMP3 # Cross-refs: genbank:acc:YP_001285797;genbank:gi:148747831;genbank:GeneID:5220456 Probab=97.38 E-value=1.9e-05 Score=46.51 Aligned_cols=286 Identities=12% Similarity=-0.012 Sum_probs=153.8 Q ss_pred HHHHHHHHHhhcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhcccc--CCCcceeecCCC Q lcl|NC_011269. 22 VADIVEAKQRMGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTL--TPGVPIQYDVLD 99 (333) Q Consensus 22 ~~~~~~~~~~~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL--~~G~~p~y~v~~ 99 (333) .|.| |--||- |+.+++.+.-. --+-+-.++-|.+.+++..+..++-....+ ..|.-.++|++. T Consensus 1 ~~~~----~~~~~~----------~~~~~~~t~~~-~fiPev~s~~v~~~l~~~lv~~~l~~~~~~~~~~GdTV~ip~~g 65 (381) T protein:vir:80 1 MATI----QGTGGY----------KGSAVDLSNVQ-VFIPEVWSSEVRMFRDQKFAALEATKKIPFEGKKGDLIHIPNIS 65 (381) T ss_pred Ccee----cccccc----------cCcccchhhHH-hhhhHHHHHHHHHHHHHhhhhhhccccccceeecCceEEeeccC Confidence 1111 211221 22222222211 011123344455556666666665444344 467788898887 Q ss_pred CccceEEEEcCCCcccceeecCceeecc--ceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhh Q lcl|NC_011269. 100 DLGQAYMLHGNEGEIRITPFEGKRIEVQ--LFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAA 177 (333) Q Consensus 100 ~v~~a~~~~~~~G~i~~Q~i~~~ri~~P--~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a 177 (333) +.+... ..+.+.+..|++....+++. .++ .+.-.|+..|..+...|...+.-+.+..++-++.|..++.++.... T Consensus 66 ~~~a~d--~~~g~~i~~~~~~~~~~~itID~~~-~~~~~Idd~D~~~~~~D~~~~~~~~~~~aLA~~~D~~i~~~~~~~~ 142 (381) T protein:vir:80 66 RAAVYD--KQPQTPVNLQARTDSEFTFTVTKYK-ESSFMIEDIVNTQASYTLRQYYTKEAGYALARDMDNFALAHRAVIN 142 (381) T ss_pred cceeee--ecCCCcccccccCCceEEEEEeeee-ecceeechHHHHhhccChHHHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 776666 55667777887776666655 344 4445889999999999999999999999999999999888875555 Q ss_pred hhhhhhcccccccccCC-Cc-ceEEeeccccHHHHHHHHHHHHhhCCccc--eEEechhhhhhhhhcCCCchhhhHHhhh Q lcl|NC_011269. 178 VSYRVVDSSAQPGVGAL-PN-EITIAGSHLMPDDLYTAVTYTDQRQLDSS--RLLANPQEYRDLYRWDINTTGWAFKDSV 253 (333) Q Consensus 178 ~~~r~~~ssA~p~vg~~-~N-~i~i~~g~Lt~~~L~~a~t~v~~~~L~at--~il~~~~~~~Di~gw~~N~~~~~~~DpV 253 (333) .+.+-..-+..++.+.. .+ ..+-.+..++-+.|-.|.+..++.+.|.. +++++|+.|.+|.. ++. +.-.|-. T Consensus 143 ~~~~~~~~t~~~~i~~~~~~~~~t~~~~~~t~~~i~~a~~~Lde~~VP~egR~lvv~P~~~~~Ll~---~~~-~~~ad~~ 218 (381) T protein:vir:80 143 AFPSQRIYSYDTTLGDGTVNAHLTGTPAPLTYAALLLAKQKLDEADVPQEGRIVMVSPAQYIDLLS---INQ-FISVDFS 218 (381) T ss_pred cccccccccccccccccccccccccchhhHHHHHHHHHHHHHhhcCCCcCCcEEEeCHHHHHHHhh---chh-hhhhhhc Confidence 33322222222222111 01 12233456677889999999999998764 79999999999986 221 2223332 Q ss_pred hhcceeeeeec------ccccceeeecCCeEEEeeChh--hhcccccccCceeccccchhhhccceehhhhhhhhhhccc Q lcl|NC_011269. 254 VAGERIVQFGE------FQIGKSIIIPRGTVYLTPEPE--FLGVFPVMYSLDVEEDNKVERFNKGWVMDELVGMAILNPR 325 (333) Q Consensus 254 ~~~e~il~~G~------fgi~~skvlprgeiyvvadpE--~~G~~pvR~~L~s~p~D~~er~~kGWvm~E~~g~~i~N~~ 325 (333) ... .++.|. |.|..|-.||.+...-..-.- -.+..|.-.|-...+......+..+|+-. ..+.+.-.+ T Consensus 219 ~~~--~l~~G~Ig~i~G~~Vv~Sn~lp~~~~t~~~~~agap~~~~~~~~~~~~~g~~s~~a~av~~~k~--yd~~~~~~~ 294 (381) T protein:vir:80 219 QVK--PVTSGVVGTILGMEVIVTTQIGINSLTGYVNGQGAPTQPTPGVLGSPYLPDQAGTANVVNTGSA--SDLAVSLSY 294 (381) T ss_pred cch--hhhceeeeEEcceEEEeecccccccccceeeeccccccccccccccccccccccceeeeeeeee--eceeeeeee Confidence 222 233444 447788888876543211111 12222322222222322223345555421 112221111 Q ss_pred eEE-EEecC Q lcl|NC_011269. 326 GIV-ILRKA 333 (333) Q Consensus 326 siv-~~~~~ 333 (333) +.| ..-++ T Consensus 295 ~~~~~~~g~ 303 (381) T protein:vir:80 295 FGLPVFSGA 303 (381) T ss_pred ccceeeecc Confidence 111 11111 No 110 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=97.36 E-value=1.4e-05 Score=47.17 Aligned_cols=313 Identities=16% Similarity=0.110 Sum_probs=159.8 Q ss_pred Ccccc-----------------------hhhhhhhhhhccc----chHHHHHHHHHHhhcc-hhcchHHHHHHHHHHhcC Q lcl|NC_011269. 1 MTLPV-----------------------AVGSGLGRFAKAS----DDYVADIVEAKQRMGG-RKLSAREKQAKLAHILSD 52 (333) Q Consensus 1 ~~~~~-----------------------~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~-~~ls~ee~~~Lm~~Al~~ 52 (333) -..|+ .-|.+.+|++++- .+.....-.++.+.+. .++....++++-+-.-.+ T Consensus 262 ~a~pv~~~~~~~~~~~~~~~~~~~~~~~~kg~~f~~~~~al~~~~g~~~~a~e~a~~~~~~~~~~~~~~~~a~~~~~~~~ 341 (645) T protein:vir:93 262 TAQPVKQAGNGNVAAVASAPVIRVEQKLDKGIGFARFAKSLAAAKGVRSEALEVARRQYPDDSRLHHVLKSAVGAGTTTD 341 (645) T ss_pred cccccccccccccccccccccccchhhhhhhhhHHHHHHHHHhcccchhHHHHHHHhhcccchhhhhhhhhhhhcccccc Confidence 00111 1123344443321 1111111112222211 111111222211100001 Q ss_pred c--hhHHHHHHHHHHHHHHHHHhhhhhhhhhhhcc-ccCCCcce--eecCCCCccceEEEEcCCCcccceeecCceeecc Q lcl|NC_011269. 53 K--VGGIQRLGQSMIGPIQLQLRYQGILRNVLLED-TLTPGVPI--QYDVLDDLGQAYMLHGNEGEIRITPFEGKRIEVQ 127 (333) Q Consensus 53 ~--Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~-TL~~G~~p--~y~v~~~v~~a~~~~~~~G~i~~Q~i~~~ri~~P 127 (333) . -|.+ ...+.+++.|-+.++.+.+.+++-... +.-.+.+. .+|+...- ..+-|.+-.+.++.....=+.|++. T Consensus 342 ~~~~Gg~-~vp~~~~~~ii~~l~~~svv~~l~~~~~~~~~~~~~~~~ip~~t~~-~~a~wv~Eg~~~~~s~~~f~~v~l~ 419 (645) T protein:vir:93 342 PQWAGSL-SEYQEYAQDFIDYLRPQTIIGRFGQGGIPALRQVPFNIRVHAQVSG-GAAGWVGEGKTKPLTKFDFESITFS 419 (645) T ss_pred ccccCCc-cCchhhHHHHHHhhhhhhhHHhhccccccccccccCceeeeeeecC-cceEEeccCccccccccceeEEEEe Confidence 1 1332 344556667777777778888774321 11111111 12221222 2344777777787776666788888 Q ss_pred ceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCcceEEeeccccH Q lcl|NC_011269. 128 LFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPNEITIAGSHLMP 207 (333) Q Consensus 128 ~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N~i~i~~g~Lt~ 207 (333) -.++...+.|.-+=|++...++-.+...+...+|...+|.-+++=-.+.. ....| .|-.....++.++..+. T Consensus 420 ~~kla~~~~iS~ell~ds~~~~~~~i~~~l~~aia~~~d~a~l~g~g~~~-------~~~~p-~gi~~~~~~~~~~~~~~ 491 (645) T protein:vir:93 420 HAKVSAIAVLTEELIRFSSPAADALVRNALAEAVVARLDTDFVDPKKAAV-------ADVSP-ASITHDVKGTASSGNPD 491 (645) T ss_pred eEEEEEeehhHHHHHhhchHHHHHHHHHHHHHHHHHHHHHHhhcCCCccc-------CCccc-cceeccccccccccchH Confidence 99999999999888999999999999999999999999965553211110 01112 12121122344455566 Q ss_pred HHHHHHHHHHHhhCCccc--eEEechhhhhhhhhcCCCchh-hhHHhhhhhcceeeeee-ccccc--ceeeecCCeEEEe Q lcl|NC_011269. 208 DDLYTAVTYTDQRQLDSS--RLLANPQEYRDLYRWDINTTG-WAFKDSVVAGERIVQFG-EFQIG--KSIIIPRGTVYLT 281 (333) Q Consensus 208 ~~L~~a~t~v~~~~L~at--~il~~~~~~~Di~gw~~N~~~-~~~~DpV~~~e~il~~G-~fgi~--~skvlprgeiyvv 281 (333) .++-.++..+..-+.... -.+||++.+..|+.=- +..+ +.+.+ +. .+.| ++|.+ .+--+|-+.++.- T Consensus 492 ~d~~~~~~~~~~a~~~~~~a~~vmn~~~~~~L~~lk-d~~G~~~~~~-~~-----~~~~tL~G~PV~~s~~vp~~~~~gd 564 (645) T protein:vir:93 492 ADAEAAFGQFVAANLQPTGAVWLMSSTNALALSMRK-NALGQKEYPD-MT-----LLGGSFQGLPVIVSQYVGDQLVLVN 564 (645) T ss_pred HHHHHHHHHHHhcCCCccccEEEEcHHHHHHHHhcc-ccCCceeecC-CC-----CCCceeeceeeEEeccCCcceeEec Confidence 778777777766555543 4689999999887611 1111 11111 11 1111 24433 3444565544443 Q ss_pred eChhhhcccccccCceeccc------------------------cchhhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 282 PEPEFLGVFPVMYSLDVEED------------------------NKVERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 282 adpE~~G~~pvR~~L~s~p~------------------------D~~er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) ++.-.+| +++++.+.-. +--++--.++..-+-+++.+.+|-+|+.|.-+ T Consensus 565 ~s~~~ig---~~~~v~i~~s~~a~~~~~~~~~~~~~~~~~~~~v~lf~~d~vaira~~r~d~~~~~p~a~~~lt~~ 637 (645) T protein:vir:93 565 APDIYLA---DDGGVAVDMSREASLEMQSEPTGDSTTPSPVELVSMFQTGSVAIRAERWINWRRRRTAAVAVITGV 637 (645) T ss_pred cccEEEE---EecceEEEeecceeEEEeecccccccccccccchhHhhcCceEEEEEEEEcceeeCccceEEEecc Confidence 3322222 3344432211 11122245677778899999999999999987 No 111 >protein:vir:98635 Length: 377 # NCBI annotation: major coat protein # Family: family:all:635 # MgeID: mge:1601 # MgeName: phi3396 # Cross-refs: genbank:acc:YP_001039923;genbank:gi:126011098;genbank:GeneID:4818471 Probab=97.18 E-value=8.1e-06 Score=48.48 Aligned_cols=290 Identities=16% Similarity=0.092 Sum_probs=162.2 Q ss_pred Ccccchhhhhhhh--hhcccchHHHHHHHHHHhhcchhcchHHHHHHHHHHhc---CchhHHHHHHHHHHHHHHHHHhhh Q lcl|NC_011269. 1 MTLPVAVGSGLGR--FAKASDDYVADIVEAKQRMGGRKLSAREKQAKLAHILS---DKVGGIQRLGQSMIGPIQLQLRYQ 75 (333) Q Consensus 1 ~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~~Al~---~~Eg~~~aLg~~mA~pI~~q~~rq 75 (333) -.+- ..-..+.. -.++...+-..+ .-+.+.++|+++|++.+- ++++ ++.|+ -.+=+.+++.|.+.+... T Consensus 33 ~~~~-~~~~~~~~~~~~~~~~e~~~~~---~~~~~~~~lt~ee~~~~~-~~~~~~~~~~gg-~~vP~~~~~~I~~~l~~~ 106 (377) T protein:vir:98 33 KLFE-AAFTTMGDEILAKNEEEMERMF---DLRDKNRELTAEEIKFFN-DIDKNVGGKDKF-KLLPEETMVQVFDDLVAE 106 (377) T ss_pred HHHH-HHHHhHHHHHHHHHHHHHHHHH---HhccCCcccCHHHHHHHH-HHHhccCCCCCc-cccCHHHHHHHHHHHHHh Confidence 0000 00000000 000111111111 124578899999998653 3333 44555 367788999999999999 Q ss_pred hhhhhhhhccccCCCcceeecCCCCccceEEEEcCCCcccceee-cCceeeccceeeeccccccHHHhhhhcchhHHHHH Q lcl|NC_011269. 76 GILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRITPF-EGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQ 154 (333) Q Consensus 76 Gi~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~i-~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q 154 (333) +-.|++.+..+. .|. -.++...+. ..+.|.+-.+++..+.. .=..|+++..++-++|.|..+=|..+.+|+..+.. T Consensus 107 s~i~~~~~v~~~-~~~-~~~~~~~~~-~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~a~~~is~elL~ds~~~ie~~i~ 183 (377) T protein:vir:98 107 HPLLKVINFKNT-SLR-LKALTAETS-GTAVWGDIFGEIKGQLKQAFKEQDFSQFKLTAFVVIPKDALKFGPKWIKQFIT 183 (377) T ss_pred hhhhhheeeEec-Ccc-eEEEEecCC-cceeEeecccccCcccCccceeEeecceeEEeeecccHHhhhccHhHHHHHHH Confidence 999999877665 343 356654443 44557776666654422 22689999999999999999999999999999999 Q ss_pred HHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCcceEEee-------------ccccH----HHHHHHHHHH Q lcl|NC_011269. 155 DMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPNEITIAG-------------SHLMP----DDLYTAVTYT 217 (333) Q Consensus 155 ~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N~i~i~~-------------g~Lt~----~~L~~a~t~v 217 (333) +...++|-+.+|.-+++ |.|.- -|.=|.. +.-+- ++|..+.- T Consensus 184 ~~la~~~a~~~~~a~i~------------------G~G~~-qP~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~-- 242 (377) T protein:vir:98 184 EQLKEAIAVALELAIVK------------------GDGLL-QPVGLLKDLSQPTVDQSTGRDITTYKTDKEAIADLSD-- 242 (377) T ss_pred HHHHHHHHHHHhhceEe------------------ccCCC-cceeeeecccccccccccccccccccchhhhHhhhhh-- Confidence 99999999999954443 22210 1111110 00000 11111111 Q ss_pred HhhCCccceEEechhhhhhhhhcCCCchhhhHHhhh--hhcceee------------------eeec----cccc----c Q lcl|NC_011269. 218 DQRQLDSSRLLANPQEYRDLYRWDINTTGWAFKDSV--VAGERIV------------------QFGE----FQIG----K 269 (333) Q Consensus 218 ~~~~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV--~~~e~il------------------~~G~----fgi~----~ 269 (333) ..+..|+.-..|=.|...+..+.-+ ..+..++ +.|. +|++ . T Consensus 243 -----------~~~~~~~~~a~~~m~~~t~~~~~klkd~~G~~i~~~n~~~~~~~~p~~~~~~~~G~~~t~lg~p~~vv~ 311 (377) T protein:vir:98 243 -----------LTPDNAPKKLVPVMKHLSVNDKKRPLKIAGQVKLILNPEDRWALEAQFTSRNQFGEYVTVLPHGITILE 311 (377) T ss_pred -----------hchhHHHHHHHHHHHHHHHHHHhhhhccCCceEEEecccchhhccccccccCCCCccccccCCCceEEe Confidence 1111122222222221110000000 0010000 1111 2222 3 Q ss_pred eeeecCCeEEEeeChhhhcccccccCceeccccchh--hhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 270 SIIIPRGTVYLTPEPEFLGVFPVMYSLDVEEDNKVE--RFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 270 skvlprgeiyvvadpE~~G~~pvR~~L~s~p~D~~e--r~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) +--+|.|.++ ..|... =.+-+|+++.++..|... +-..++....-++-.+.||-++++|.=+ T Consensus 312 s~~~p~~~i~-fgdf~~-Y~i~~r~~~~i~~~~~~~~~~d~~~f~~~~r~dg~~~~~~a~~vl~i~ 375 (377) T protein:vir:98 312 SLAVETGKAI-AFVANR-YDAFMATASTIEEYDQTFAMEDLQLYLTKNYFYGKAKDNHTAALLTLA 375 (377) T ss_pred cCCCCcccEE-EEEecc-eeEEeecceEEEeechhhhhcCceEEEEEEEEcCEEeccCcEEEEEEe Confidence 4447777764 566654 345689999998877332 3368899999999999999999999888 No 112 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=97.14 E-value=4.2e-05 Score=44.56 Aligned_cols=301 Identities=11% Similarity=0.044 Sum_probs=155.5 Q ss_pred CcccchhhhhhhhhhcccchHHHHHHHHHHhhcchhcchHHHHHHHHHHhc---CchhHHHHHHHHHHHHHHHHHhhhhh Q lcl|NC_011269. 1 MTLPVAVGSGLGRFAKASDDYVADIVEAKQRMGGRKLSAREKQAKLAHILS---DKVGGIQRLGQSMIGPIQLQLRYQGI 77 (333) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~~Al~---~~Eg~~~aLg~~mA~pI~~q~~rqGi 77 (333) ..-....+.....-.+... ..++.+.+ ...+...............+|+ +..|+. -+-+-+...|-+.++.+.. T Consensus 36 ~~~~~~~~~~~~~~~~~~~-~~~~~~r~-~~~~~~~~~~~~~~~~~~~al~~~~~~~gG~-lIP~~~~~~Ii~~l~~~s~ 112 (352) T protein:vir:78 36 VKDKGEAYQSLNDNEKLVK-AKAEFYRH-AILPNEFEKPSMEAQRLLHALPTGNDSGGDK-LLPKTLSKEIVSEPFAKNQ 112 (352) T ss_pred hhhccccccccchhhhHHH-HHHHHHHH-HhhhhHHHHHHhhHHHHHHHhccCCCCCCce-eccHhHHHHHHHHHHhhcc Confidence 0000000000000000000 00111100 0011111111112222334443 344442 5567788889999999999 Q ss_pred hhhhhhccccCCCcceeecCCCCccceEEEEcCCCcccceeecCceeeccceeeeccccccHHHhhhhcchhHHHHHHHH Q lcl|NC_011269. 78 LRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRITPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMT 157 (333) Q Consensus 78 ~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A 157 (333) +|++.+..+......|+.... ...+-|.+--+++......-+.|++.-.++..+..|..+=|.....|+..+..+.- T Consensus 113 l~~~~~v~~~~~~~~p~~~~~---~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~i~is~ell~Ds~~~l~~~i~~~l 189 (352) T protein:vir:78 113 LREKARLTNIKGLEIPRVSYT---LDDDDFITDVETAKELKLKGDTVKFTTNKFKVFAAISDTVIHGSDVDLVNWVENAL 189 (352) T ss_pred hhhheeeEecCCceEEEEecC---CCcccccccccccccccccceeeeecceeEEeechhhHHHHhhhhHHHHHHHHHHH Confidence 999988877665566654421 12345777777777777777889999999999999999999999999999999999 Q ss_pred HHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCc-ceEEeeccccHHHHHHHHHHHHhhCCccceEEechhhhhh Q lcl|NC_011269. 158 KQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPN-EITIAGSHLMPDDLYTAVTYTDQRQLDSSRLLANPQEYRD 236 (333) Q Consensus 158 ~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N-~i~i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~~~~D 236 (333) .++|.+-|+..+|. .+ +.+.+|.- .+-| .+.-+.+.-+-++|-.++.-+..---.-...+||+.-|.. T Consensus 190 a~~~~~~e~~~~~~--~g--------~g~~~~~g-~l~~~~~~~~t~~~~~d~i~~~~~~l~~~~~~~a~~~mn~~t~~~ 258 (352) T protein:vir:78 190 QSGLAAKERKDALA--VS--------PKSGLEHM-SFYNGSVKEVEGANMYDAIINALADLHEDYRDNATIYMRYADYVK 258 (352) T ss_pred HHHHHHHHHHhhhh--cC--------CCCccccc-ceeccccccccccchHHHHHHHHhccChhhhcCCEEEEehHHHHH Confidence 99998877644441 11 11112211 1111 1111122222345555554333222223457888887776 Q ss_pred hhhcCCCchhhhHHhhhhhcceeeeeecccccceeeecCCeEEEeeChhhhcccc----cccCceeccccchhhhcccee Q lcl|NC_011269. 237 LYRWDINTTGWAFKDSVVAGERIVQFGEFQIGKSIIIPRGTVYLTPEPEFLGVFP----VMYSLDVEEDNKVERFNKGWV 312 (333) Q Consensus 237 i~gw~~N~~~~~~~DpV~~~e~il~~G~fgi~~skvlprgeiyvvadpE~~G~~p----vR~~L~s~p~D~~er~~kGWv 312 (333) ++.=--|..+ |+-++. ...++|.+....=.-..+ + .|-|+ .+.++...+....+.-..|++ T Consensus 259 l~~~~~~~~~-----~~~~~~---~~~llG~PV~~~~~~~~~-~------~Gdf~~~~~~~~~~~~~~~~~~~~g~~~f~ 323 (352) T protein:vir:78 259 IISVLSNGTT-----NFFDTP---AEKVFGKPVVFTDAAVKP-I------VGDFNYFGINYDGTTYDTDKDVKKGEYLFV 323 (352) T ss_pred HHHHHhccCC-----cccccC---CccccccceEEecCCCce-e------EeehhhhhhhhhhheeeeeccccCCeeEEE Confidence 6551111111 111111 012344332111000111 1 24443 233343333333333457899 Q ss_pred hhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 313 MDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 313 m~E~~g~~i~N~~siv~~~~~ 333 (333) .++-++..+.||-+++++..+ T Consensus 324 ~~~r~Dg~~~~~eA~~~l~~~ 344 (352) T protein:vir:78 324 LTAWYDQQRTLDSAFRIAKAK 344 (352) T ss_pred EEeeeCceeechhheEEEEee Confidence 999999999999999999766 No 113 >protein:vir:100057 Length: 375 # NCBI annotation: T7-like capsid protein # Family: family:all:975 # MgeID: mge:1604 # MgeName: P-SSP7 # Cross-refs: genbank:acc:YP_214206;genbank:gi:61806429;genbank:GeneID:3294737 Probab=97.11 E-value=6.2e-05 Score=43.64 Aligned_cols=290 Identities=20% Similarity=0.199 Sum_probs=167.3 Q ss_pred HHHHHHHHHhhcchhcchH-------HHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCccee Q lcl|NC_011269. 22 VADIVEAKQRMGGRKLSAR-------EKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPIQ 94 (333) Q Consensus 22 ~~~~~~~~~~~~~~~ls~e-------e~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p~ 94 (333) ++|.-. -|||+-+++.+ ++-+|.-+.++ +-+-....|+-+.|.+....|++-|.--+ T Consensus 1 ~~~~~~--~~~~~~n~~t~~~~~~~~~~~al~le~f~--------------geV~~~f~~~si~~~~~~~rti~~Gksv~ 64 (375) T protein:vir:10 1 MANANQ--VALGRSNLSTGTGYGGATDKYALYLKLFS--------------GEMFKGFQHETIARDLVTKRTLKNGKSLQ 64 (375) T ss_pred Cccccc--cccCccccCCccccccccchHHHHHHHHh--------------HHHHHHHHHHHhhhccccccccccCceEE Confidence 222211 24555444332 33344334333 33455677888888999999999999999 Q ss_pred ecCCCCccceEEEEcCCCcccceeecC-----ceeeccceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHH Q lcl|NC_011269. 95 YDVLDDLGQAYMLHGNEGEIRITPFEG-----KRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRL 169 (333) Q Consensus 95 y~v~~~v~~a~~~~~~~G~i~~Q~i~~-----~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~ 169 (333) |+..-++..++ +++-.++-.++..+ +.|.+-+..+. +-+|+--|=-|...|+..+.-+++-.++-++.|..+ T Consensus 65 f~~iG~~t~~~--~t~G~~i~~~~~~d~~~te~~l~ID~~~y~-~~~VdDiD~aqa~~Dlr~e~s~~~G~aLA~~~D~~i 141 (375) T protein:vir:10 65 FIYTGRMTSSF--HTPGTPILGNADKAPPVAEKTIVMDDLLIS-SAFVYDLDETLAHYELRGEISKKIGYALAEKYDRLI 141 (375) T ss_pred EEeeeeeEEee--ecCCcCcCCccccCCCCCceEEEecchhhh-hhhHhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHH Confidence 99998887788 55555555555443 23444444443 446777777788999999999999999999999999 Q ss_pred HHHHhhhhhhhhhhcccccccccCCCcceEEeec-----cccHHHH----HHHHHHHHhhCCcc--ceEEechhhhhhhh Q lcl|NC_011269. 170 VTLLEAAAVSYRVVDSSAQPGVGALPNEITIAGS-----HLMPDDL----YTAVTYTDQRQLDS--SRLLANPQEYRDLY 238 (333) Q Consensus 170 ~slle~~a~~~r~~~ssA~p~vg~~~N~i~i~~g-----~Lt~~~L----~~a~t~v~~~~L~a--t~il~~~~~~~Di~ 238 (333) +.++=.+|-+-.-+. ..|+++.-.-.+...+| ..|+.++ ..+.+..++.+.|. -.+|++|+.|.-|. T Consensus 142 ~~~l~kaa~~~~p~~--~~~~~~~Gg~~i~~~sg~~~~~~~ta~~~~~ai~~a~~~Lde~~VP~~~R~~vv~P~~y~~Ll 219 (375) T protein:vir:10 142 FRSITRGARSASPVS--ATNFVEPGGTQIRVGSGTNESDAFTASALVNAFYDAAAAMDEKGVSSQGRCAVLNPRQYYALI 219 (375) T ss_pred HHHHHHhhhhccccc--cccccccCcceeeeccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeChHHHHHHH Confidence 888754441111111 11111111123333322 3456554 45667788888884 46889999998775 Q ss_pred hc-CCC---chhhhHHhhhhhcceeeeeecccccceeeecCCeEE--------EeeChhh------------------hc Q lcl|NC_011269. 239 RW-DIN---TTGWAFKDSVVAGERIVQFGEFQIGKSIIIPRGTVY--------LTPEPEF------------------LG 288 (333) Q Consensus 239 gw-~~N---~~~~~~~DpV~~~e~il~~G~fgi~~skvlprgeiy--------vvadpE~------------------~G 288 (333) -= +.| ...|+. +-+.....+...--|.|.+|-.+|..+++ =+..|++ +. T Consensus 220 ~~~d~~~~~n~d~~~-~~~~~~g~v~~i~Gv~V~~Sn~lP~~~~~~~~~g~~~~~~a~~~~~~~~~~~~~~~~~~~g~~~ 298 (375) T protein:vir:10 220 QDIGSNGLVNRDVQG-SALQSGNGVIEIAGIHIYKSMNIPFLGKYGVKYGGTTGETSPGNLGSHIGPTPENANATGGVNN 298 (375) T ss_pred hcCCccceeeecccc-cceeccceEEEEeceEEEEeccccccccccccccccccccchhhhhccccccCCcceeeccccc Confidence 30 011 111111 11222222222333667788888976653 1122221 11 Q ss_pred cccccc-------Cc--------eecccc-chhh-------hccceehhhhhhh--hhhccceEEEEecC Q lcl|NC_011269. 289 VFPVMY-------SL--------DVEEDN-KVER-------FNKGWVMDELVGM--AILNPRGIVILRKA 333 (333) Q Consensus 289 ~~pvR~-------~L--------~s~p~D-~~er-------~~kGWvm~E~~g~--~i~N~~siv~~~~~ 333 (333) .|=... |+ -++-.| +.|+ -.+||+|.--.+| .+.||.+.|-|... T Consensus 299 ~y~~d~~~~~~~~~~~~~~~A~g~v~~~~~~~~~~~~~~~~~~q~~~i~~~~a~G~~~lrp~~av~l~~~ 368 (375) T protein:vir:10 299 DYGTNAELGAKSCGLIFQKEAAGVVEAIGPQVQVTNGDVSVIYQGDVILGRMAMGADYLNPAAAVELYIG 368 (375) T ss_pred cccccccccCceEEEEEchhheeeeeeeccccccccchhhheeeeeeeeeeeeeccCccCceeEEEEecC Confidence 111111 11 112233 3332 2689999877666 57999999999877 No 114 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=97.11 E-value=3.9e-05 Score=44.76 Aligned_cols=250 Identities=12% Similarity=0.137 Sum_probs=146.1 Q ss_pred hcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccc-c--CCCcceeecCCCCccceEEEE Q lcl|NC_011269. 32 MGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDT-L--TPGVPIQYDVLDDLGQAYMLH 108 (333) Q Consensus 32 ~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~T-L--~~G~~p~y~v~~~v~~a~~~~ 108 (333) |---+|| +.+ .|| .+++-+.+.+...++-.+++ +.++ | .+|....+|+-+-.+-+- .. T Consensus 1 Ma~T~~~---------d~I-~Pe----v~~~~V~e~~~~~~~~~~~~----~~d~~L~g~~G~ti~~P~~~~igdae-~~ 61 (270) T protein:vir:95 1 MTQTKKA---------NLI-NPE----VLANVVSAQMQNAIRFTPYA----VTDDTLVGQPGDTITRPKYAYIGAAE-DL 61 (270) T ss_pred CCceehh---------hhc-chH----HHHHHHHHHHHhHHhhcccc----ccccccCCCCCCEEEeeeecCCCccc-cc Confidence 3332222 111 222 23444444443333333333 2222 2 357777777532221111 12 Q ss_pred cCCCcccceeecCceeeccceeeec-cccccHHHhhhhc--chhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcc Q lcl|NC_011269. 109 GNEGEIRITPFEGKRIEVQLFRIAS-FPQIKKEDLYYLR--SNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDS 185 (333) Q Consensus 109 ~~~G~i~~Q~i~~~ri~~P~f~Ivs-~P~V~~~dl~~~~--~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~s 185 (333) ....++..+.+.=..- +.+|.- ...+...|+-... +|.+.++-.....++-+..|..+++.|.++..++ T Consensus 62 ~eg~~i~~~~lt~~~~---~a~i~~~gk~~~itD~a~~~~~~dp~~~~~~q~a~~~a~~~d~~li~~l~~a~~~~----- 133 (270) T protein:vir:95 62 QEGVAMDTTQMSMTTT---KVTVKETGKAVEVTQTAIITNVNGTLQEASRQLAMSLADKVEIDYIAELNKSKQTA----- 133 (270) T ss_pred cCCCccchhhcccchh---eeeeehhhCcceecHHHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHhccccccc----- Confidence 2222233222211111 122221 3466777776655 6999999999999999999999988886654221 Q ss_pred cccccccCCCcceEEeeccccHHHHHHHHHHHHhhCCccceEEechhhhhhhhhcCCCchhhhHHhhhhhcceeeeeecc Q lcl|NC_011269. 186 SAQPGVGALPNEITIAGSHLMPDDLYTAVTYTDQRQLDSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIVQFGEF 265 (333) Q Consensus 186 sA~p~vg~~~N~i~i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~G~f 265 (333) ....+-++|..|.+...+-+=..+-|+|||.-|..|+- |-+ ++...-.+-++..|.| T Consensus 134 ----------------~~~~t~~~~~dA~~~lgd~~~~~~~i~vhs~~~~~Lrk---~~~----~~~~~~~~~~~~~G~i 190 (270) T protein:vir:95 134 ----------------TVSADATGILDAIEVFNSENDEDYVLYVNPKDYNKLVK---SLF----KVGGNVQDRAISKGDL 190 (270) T ss_pred ----------------ccccCHHHHHHHHHHhccccCCCcEEEEcHHHHHHHHh---hhc----ccccccccchhccccc Confidence 12346789999999998888888899999999999985 211 1221222223434554 Q ss_pred c-------ccceeeecCCeEEEeeChhhhcccccccCceeccccchhhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 266 Q-------IGKSIIIPRGTVYLTPEPEFLGVFPVMYSLDVEEDNKVERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 266 g-------i~~skvlprgeiyvvadpE~~G~~pvR~~L~s~p~D~~er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) | |.-++..|-|+.|+..-.- +|-+.-| ++.+|....+.+...==.-.+.++..+.|+..||.+..+ T Consensus 191 g~~~G~~Viv~s~~~~~~~~~l~~~gA-i~~~~~~-~~~vEtdRd~~~~~d~i~~~~~y~v~~~~~skvv~~t~~ 263 (270) T protein:vir:95 191 VEIVGVSDIVKSKRVSENTAFLQRYGA-MEIVNKK-KPEAYTDFDILKRTHLLSTNYHYSVNLKDETGVVKVTFK 263 (270) T ss_pred ceecceeEEEeCCCCCceeEEEEeccc-eeeeecC-CceeeeccchhhcccEEEeeeEEEEEEEccceEEEEEec Confidence 4 3567788999999887544 6655544 467665555555444444557789999999999987654 No 115 >protein:vir:8885 Length: 347 # NCBI annotation: major capsid protein A # Family: family:all:975 # MgeID: mge:161 # MgeName: gh-1 # Cross-refs: genbank:acc:NP_813774;genbank:gi:29366729;genbank:GeneID:1258837 Probab=97.08 E-value=0.00013 Score=41.85 Aligned_cols=289 Identities=13% Similarity=0.109 Sum_probs=150.5 Q ss_pred Cc-ccchhhhhhhhhhcccchHHHHHHHHHHhhcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhh Q lcl|NC_011269. 1 MT-LPVAVGSGLGRFAKASDDYVADIVEAKQRMGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILR 79 (333) Q Consensus 1 ~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~R 79 (333) |. ++.+ ++.. -|-|-..-+.. +-+|--+.++.+ +.....|+-+.+ T Consensus 1 ~a~~~~~-----~~~~--------------~~~g~~~~~~d-~~al~ie~~~ge--------------V~~~f~~~s~~~ 46 (347) T protein:vir:88 1 MANATGG-----QQIG--------------ANQGKGQSAAD-KLALFLKVFGGE--------------VLTAFVRRSVTM 46 (347) T ss_pred CCCcccc-----hhhh--------------ccCCCCccccc-hHHHHHHHHHHH--------------HHHHHHHHhhhh Confidence 21 1100 0000 01111111111 233322333222 222344555666 Q ss_pred hhhhccccCCCcceeecCCCCccceEEEEcCCCcccceeecCceeecc--ceeeeccccccHHHhhhhcchhHHHHHHHH Q lcl|NC_011269. 80 NVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRITPFEGKRIEVQ--LFRIASFPQIKKEDLYYLRSNIVEYTQDMT 157 (333) Q Consensus 80 klL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~i~~~ri~~P--~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A 157 (333) .+-...|+..|..-+||..-.+..+|+--+.+-.....++...++++. +.. .++-.|+-.|-.|...|+..+.-+++ T Consensus 47 ~~~~~r~i~~G~sv~~~~iG~~~~~~~~~g~~l~~~~~~~~~~~~~i~ID~~~-y~~~~Vdd~D~~q~~~D~r~~~~~~~ 125 (347) T protein:vir:88 47 DKHMVRTIQNGKSASFPVMGRTKGYYLAPGENLDDKRKDIKHSEKVIQIDGLL-TSDVLIYDIEDAMNHYDVRAEYSAQL 125 (347) T ss_pred hccccccccCcceEEEeeecceeeeeeccccCCCCCCCCCccceEEEEEechh-hhhhhhhhHHHHhhcCCchHHHHHHH Confidence 777777899999999999999988886655554333344544444444 433 34557888899999999999999999 Q ss_pred HHHHHHHhhhHHHHHHhhhhhhhhhhc--ccccccccCC-CcceEEeeccccH--------HHHHHHHHHHHhhCCcc-- Q lcl|NC_011269. 158 KQAIMRQEDSRLVTLLEAAAVSYRVVD--SSAQPGVGAL-PNEITIAGSHLMP--------DDLYTAVTYTDQRQLDS-- 224 (333) Q Consensus 158 ~qaIM~qED~~~~slle~~a~~~r~~~--ssA~p~vg~~-~N~i~i~~g~Lt~--------~~L~~a~t~v~~~~L~a-- 224 (333) ..|+-+..|..++..+-.++ |.-. ....++.|.- ..++...+...++ ++|..|.+..++.+.|. T Consensus 126 g~aLA~~~D~~i~~~l~~~a---~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~g 202 (347) T protein:vir:88 126 GEALAIAADGAVLAEMAKLC---NLPAASNENIAGLGQAVVLNIGAAADLVDVEARGKAILKGLTLARARLTKNYVPAGD 202 (347) T ss_pred HHHHHHHHHHHHHHHHHHhh---ccccccccccCCccccccccccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCC Confidence 99999999998887663333 2111 2223333222 1222322333333 33556777788888764 Q ss_pred ceEEechhhhhhhhhcCCCch---hhhHHhhhhhcceeeeeecccccceeeecCCeE--------E-------------- Q lcl|NC_011269. 225 SRLLANPQEYRDLYRWDINTT---GWAFKDSVVAGERIVQFGEFQIGKSIIIPRGTV--------Y-------------- 279 (333) Q Consensus 225 t~il~~~~~~~Di~gw~~N~~---~~~~~DpV~~~e~il~~G~fgi~~skvlprgei--------y-------------- 279 (333) -++|++|+.|.+|.- ...+ .+.....+.++. +-+.==|.|.++--+|.+-. | T Consensus 203 R~~vv~P~~y~~Ll~--~~~~~~~~~~~~~~~~~G~-vg~i~G~~V~~s~nlp~~~~~~~~~~~~~~~t~~~~~~~~~~~ 279 (347) T protein:vir:88 203 RRFYCAPEDYSAILS--ALMPNAANYAALIDPETGN-IRNVMGFEVIEVPHLTVGGAGDNNPADGVAPTNQKHIFPATAT 279 (347) T ss_pred CEEEeCHHHHHHHhc--chhhhhhhhccccchhcce-eeeeccceEEEeecccccccccccccccccccccccccccccc Confidence 578999999999974 2111 111112222221 22222255777777764321 1 Q ss_pred ------------EeeChhhhcccccccCceeccccchhhhccceehh--hhhhhhhhccceEEEEec--C Q lcl|NC_011269. 280 ------------LTPEPEFLGVFPVMYSLDVEEDNKVERFNKGWVMD--ELVGMAILNPRGIVILRK--A 333 (333) Q Consensus 280 ------------vvadpE~~G~~pvR~~L~s~p~D~~er~~kGWvm~--E~~g~~i~N~~siv~~~~--~ 333 (333) ++--+.-+|.-- =.++++|...-+ -..+|+|. -..|-.+.||.+.|.+.- | T Consensus 280 ~~~~~d~~~~~~l~~~~~a~g~v~-~~d~~~e~~r~~--~~~~d~i~~~~~~G~~~~rPe~a~~~~~~~a 346 (347) T protein:vir:88 280 GDDRVAQNNVVGLFNHRSAVGTVK-LKDMALERARRP--EFQADQIIGKYAMGHGGLRPEAAGALVFTPA 346 (347) T ss_pred cccccccCcEEEEEechhhhhhee-cccceeeeeech--hhHHHHhhhhhhhcCceeccceEEEEEeCCC Confidence 111111111100 011222222122 25677775 456778999987766543 3 No 116 >protein:vir:94576 Length: 347 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1516 # MgeName: Berlin # Cross-refs: genbank:acc:YP_919012;genbank:gi:119637776;genbank:GeneID:5179336 Probab=97.04 E-value=0.00018 Score=41.15 Aligned_cols=278 Identities=15% Similarity=0.172 Sum_probs=149.6 Q ss_pred CcccchhhhhhhhhhcccchHHHHHHHHHHhhcchhc--------chHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHH Q lcl|NC_011269. 1 MTLPVAVGSGLGRFAKASDDYVADIVEAKQRMGGRKL--------SAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQL 72 (333) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l--------s~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~ 72 (333) |. +-++|.++ ++-...+|.-+.++ +-+.... T Consensus 1 ma---------------------------~~~~~~~~~t~~g~~~~~~d~~al~ie~~~--------------geV~~~f 39 (347) T protein:vir:94 1 MA---------------------------NMNGGQQMGKDQGKGMSAGDKLALFLKVFG--------------GEVLTAF 39 (347) T ss_pred CC---------------------------ccccccccccccccCCcccchHHHHHHHHh--------------HHHHHHH Confidence 11 11111111 11111122112222 2233445 Q ss_pred hhhhhhhhhhhccccCCCcceeecCCCCccceEEEEcCCCcccceee--cCceeeccceeeeccccccHHHhhhhcchhH Q lcl|NC_011269. 73 RYQGILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRITPF--EGKRIEVQLFRIASFPQIKKEDLYYLRSNIV 150 (333) Q Consensus 73 ~rqGi~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~i--~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vl 150 (333) .|+-+.+.+....|++.|..-+||....+..+|+.-+.+-.-..+++ -.+.|.+-+..+ ++-+|+--|=-|...|+. T Consensus 40 ~~~s~~~~~~~~rti~~G~sv~~~~iG~~~~~~~~~G~~l~~~~~~~~~~e~~ltID~~~y-~~~~VddiD~~q~~~D~r 118 (347) T protein:vir:94 40 TRTSVTMNKHLVRSIQSGKSAQFPVLGRTKAAYLQPGENLDDKRKDMKHTEKTINIDGLLT-ADVLIYDIEDAMNHYDVR 118 (347) T ss_pred HHHHhhhhhhhheeccccceEEeeeccceeEeeeecCcCCCCCcCCccccceEEEEcchhh-hhhhhhhHHHHhcCcchH Confidence 56666677777778999999999999999999976666543333433 334454544443 444677777778889999 Q ss_pred HHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhccccccccc-CCCcceEEeecc-------ccH----HHHHHHHHHHH Q lcl|NC_011269. 151 EYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVG-ALPNEITIAGSH-------LMP----DDLYTAVTYTD 218 (333) Q Consensus 151 e~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg-~~~N~i~i~~g~-------Lt~----~~L~~a~t~v~ 218 (333) .+.-+++-.|+-+..|..++..|--++ +.-..+.-|..| .-.=.+.+..+. .++ +.|.+|.+..+ T Consensus 119 s~~~~~~g~ALA~~~D~~i~~~l~~~a---~~~~~~~~~~~g~~~~~~v~i~~~~~~~~~~~~~~~~~~d~i~~a~~~Ld 195 (347) T protein:vir:94 119 SEYTAQLGESLAMAADGAVLAEMAKLC---NLPTANNENIAGLGKAHVLEVGDQATLQGDQVKLGQAIIAQLTLARAKLT 195 (347) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhh---ccccccccccccCCcceeEeeeccccccccccccHHHHHHHHHHHHHHhh Confidence 999999999999999988875542222 111111111111 111122232211 123 34667778888 Q ss_pred hhCCcc--ceEEechhhhhhhhh--cCCCchhhhHHhhhhhcceeeeeecccccceeeecCCeE---------------- Q lcl|NC_011269. 219 QRQLDS--SRLLANPQEYRDLYR--WDINTTGWAFKDSVVAGERIVQFGEFQIGKSIIIPRGTV---------------- 278 (333) Q Consensus 219 ~~~L~a--t~il~~~~~~~Di~g--w~~N~~~~~~~DpV~~~e~il~~G~fgi~~skvlprgei---------------- 278 (333) +.+.|. .+++.+|+.|.+|.- +. +.-.+.+.+.+..+. |.+.--|.|.+|--+|.+.+ T Consensus 196 e~dVP~~~R~~vv~P~~y~~LLk~~~~-~~~~~~~~~~~~~G~-V~~v~G~~V~~Sn~~p~~~~~~~~~~~~~~~~~~~~ 273 (347) T protein:vir:94 196 GNYVPSSDRVFYTTPDNYSAILAALMP-NAANYQALIDPSTGS-IRNVMGFEVIEVPHLTAGGAGDNRAEEGVAPTNQKH 273 (347) T ss_pred hcCCCCCCCEEEeChHHHHHHHHhhcc-cccccccccccccce-eEEeeceEEEEcCccccccCcccccccccccccccc Confidence 888874 456778999999984 11 111122222233332 33333355666766765432 Q ss_pred ------------------EEeeChhhhcccccccCceecccc-chhh----hccceehhhhh--hhhhhccceEE--EEe Q lcl|NC_011269. 279 ------------------YLTPEPEFLGVFPVMYSLDVEEDN-KVER----FNKGWVMDELV--GMAILNPRGIV--ILR 331 (333) Q Consensus 279 ------------------yvvadpE~~G~~pvR~~L~s~p~D-~~er----~~kGWvm~E~~--g~~i~N~~siv--~~~ 331 (333) .|+-.|+-+|. ++-.| +.|. -..||+|.--. |-.+.||.+.| .+. T Consensus 274 ~~~~~~~~~y~~d~~~~~~l~~~~~A~~t--------v~~~~~~~e~~~~~~~~~~~i~~~~a~G~g~~rPe~a~~i~~~ 345 (347) T protein:vir:94 274 AFPDTASGDTRVALDNVVGLFNHRSAVGT--------VKLKDMALERARRANFQADQIIAKYAMGHGGLRPEACGALVFK 345 (347) T ss_pred cccccccccccccccceEEEEechhhhhh--------hhhcccceeeeechhhhhhhhhhhhhhcCcccccceeEEEEec Confidence 12222222221 11222 2222 37899997655 55689996554 666 Q ss_pred cC Q lcl|NC_011269. 332 KA 333 (333) Q Consensus 332 ~~ 333 (333) +| T Consensus 346 ~a 347 (347) T protein:vir:94 346 KA 347 (347) T ss_pred CC Confidence 77 No 117 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=96.99 E-value=3.6e-05 Score=44.97 Aligned_cols=299 Identities=12% Similarity=0.052 Sum_probs=172.5 Q ss_pred Cccc-----------------------c----------hhhhhhhhhhcccchHHHHHHHHHHhhcchhcchHHHHHHHH Q lcl|NC_011269. 1 MTLP-----------------------V----------AVGSGLGRFAKASDDYVADIVEAKQRMGGRKLSAREKQAKLA 47 (333) Q Consensus 1 ~~~~-----------------------~----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~ 47 (333) |++= - +.+.-..+ ++.+.|-. .+ ..+++++.|+++|++.+-+ T Consensus 1 m~ik~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~e~~~-~~--~~~~~~~~lt~~e~~~~~~ 75 (381) T protein:vir:10 1 MTINLSETFANAKNEFINAVNNGEPQERQNELYGDMINQLFEETKL--QAKAEAER-VS--SLPKSAQSLSANQRSFFMD 75 (381) T ss_pred CchhhHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHhhhhhHHH--HHHHHHHH-HH--HhccCcccccHHHHHHHHH Confidence 0000 0 00011111 11112211 11 2356788999999886532 Q ss_pred --HHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcceeecCCCCccceEEEEcCCCcccce-eecCcee Q lcl|NC_011269. 48 --HILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRIT-PFEGKRI 124 (333) Q Consensus 48 --~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q-~i~~~ri 124 (333) ..- +..|+ -.+=.-+++.|.+.+..+...|++.+..+.. | .-.+++..+.. .+-|.+-.+.+..+ ...=..| T Consensus 76 ~~~~~-~~~gg-~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~-~-~~~i~~~~~~~-~a~w~~e~~~~~~~~~~~f~~i 150 (381) T protein:vir:10 76 INKNV-NYKEE-KLLPEETIDRIFEDLTTNHPLLADLGIKNAG-L-RLKFLKSETSG-VAVWGKIYGEIKGQLDAAFSEE 150 (381) T ss_pred Hhccc-CCCCc-eecCHHHHHHHHHHHHhhccceeheeeEecC-c-ceEEEEecCCc-ceeeecccccccccccccceee Confidence 222 23444 2567889999999999999999998776654 3 33455544443 44477765666654 2333689 Q ss_pred eccceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCcc---eE-- Q lcl|NC_011269. 125 EVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPNE---IT-- 199 (333) Q Consensus 125 ~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N~---i~-- 199 (333) +++..++.+++.|.-+=|....+|+-.+...+..++|.+.||.-+++ +- -+.+|. |.+.++ .. T Consensus 151 ~l~~~kl~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i~---G~--------G~~qP~-Gil~~~~~~~~~~ 218 (381) T protein:vir:10 151 TAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLK---GT--------GKDQPI-GLNRQVQKGVSVT 218 (381) T ss_pred eecceeEEeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeEe---cc--------CCCCce-eeeeccCcccccc Confidence 99999999999999999999999999999999999999999944332 11 011121 111000 00 Q ss_pred -------EeeccccH-------HHHHHHHHHHHhhCCcc-------ceEEechhhhhhhhhcCCCchhhhHHhhhhhcce Q lcl|NC_011269. 200 -------IAGSHLMP-------DDLYTAVTYTDQRQLDS-------SRLLANPQEYRDLYRWDINTTGWAFKDSVVAGER 258 (333) Q Consensus 200 -------i~~g~Lt~-------~~L~~a~t~v~~~~L~a-------t~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~ 258 (333) .+.+-++- +.|.........|.-.. -..+||+.-+.+++. ..+. .| +.+. T Consensus 219 ~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~--~~~~----~~---~~G~ 289 (381) T protein:vir:10 219 EGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQA--QYTH----LN---ANGV 289 (381) T ss_pred cccccccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhcc--cccc----CC---CCCc Confidence 11122222 23333333333332211 124788887777764 1111 11 1111 Q ss_pred eeeeec-cc--ccceeeecCCeEEEeeChhhhcccccccCceeccccch--hhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 259 IVQFGE-FQ--IGKSIIIPRGTVYLTPEPEFLGVFPVMYSLDVEEDNKV--ERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 259 il~~G~-fg--i~~skvlprgeiyvvadpE~~G~~pvR~~L~s~p~D~~--er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) .+ +.+ || |..+--||-|.|+ ..|.-. =.+-+|+++.+...|.. .+-..+....+=++-.+.+|.++|++.-. T Consensus 290 ~v-~~l~~g~~vv~s~~~p~~~ii-fgDfs~-Y~i~~r~~~~i~~~~~~~~~~d~~~f~a~~r~dg~~~~~~A~~v~~l~ 366 (381) T protein:vir:10 290 YV-TALPFNLNVIESTVQEAGKVL-TYVKGL-YDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLD 366 (381) T ss_pred ee-ecCCCCceEEecCCCCcCcEE-EEeccc-EEEEEecccEEEeechhHhhcCCeEEEEEEEEcCEEecCceEEEEEEE Confidence 11 111 23 3346668888864 466554 35678999998887732 22356888888899999999999986533 No 118 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=96.99 E-value=3.6e-05 Score=44.97 Aligned_cols=299 Identities=12% Similarity=0.052 Sum_probs=172.5 Q ss_pred Cccc-----------------------c----------hhhhhhhhhhcccchHHHHHHHHHHhhcchhcchHHHHHHHH Q lcl|NC_011269. 1 MTLP-----------------------V----------AVGSGLGRFAKASDDYVADIVEAKQRMGGRKLSAREKQAKLA 47 (333) Q Consensus 1 ~~~~-----------------------~----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~ 47 (333) |++= - +.+.-..+ ++.+.|-. .+ ..+++++.|+++|++.+-+ T Consensus 1 m~ik~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~e~~~-~~--~~~~~~~~lt~~e~~~~~~ 75 (381) T protein:vir:95 1 MTINLSETFANAKNEFINAVNNGEPQERQNELYGDMINQLFEETKL--QAKAEAER-VS--SLPKSAQSLSANQRSFFMD 75 (381) T ss_pred CchhhHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHhhhhhHHH--HHHHHHHH-HH--HhccCcccccHHHHHHHHH Confidence 0000 0 00011111 11112211 11 2356788999999886532 Q ss_pred --HHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcceeecCCCCccceEEEEcCCCcccce-eecCcee Q lcl|NC_011269. 48 --HILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRIT-PFEGKRI 124 (333) Q Consensus 48 --~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q-~i~~~ri 124 (333) ..- +..|+ -.+=.-+++.|.+.+..+...|++.+..+.. | .-.+++..+.. .+-|.+-.+.+..+ ...=..| T Consensus 76 ~~~~~-~~~gg-~lvP~~~~~~I~~~l~~~s~i~~~~~v~~~~-~-~~~i~~~~~~~-~a~w~~e~~~~~~~~~~~f~~i 150 (381) T protein:vir:95 76 INKNV-NYKEE-KLLPEETIDRIFEDLTTNHPLLADLGIKNAG-L-RLKFLKSETSG-VAVWGKIYGEIKGQLDAAFSEE 150 (381) T ss_pred Hhccc-CCCCc-eecCHHHHHHHHHHHHhhccceeheeeEecC-c-ceEEEEecCCc-ceeeecccccccccccccceee Confidence 222 23444 2567889999999999999999998776654 3 33455544443 44477765666654 2333689 Q ss_pred eccceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCcc---eE-- Q lcl|NC_011269. 125 EVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPNE---IT-- 199 (333) Q Consensus 125 ~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N~---i~-- 199 (333) +++..++.+++.|.-+=|....+|+-.+...+..++|.+.||.-+++ +- -+.+|. |.+.++ .. T Consensus 151 ~l~~~kl~~~~~is~elL~Ds~~~ie~~i~~~la~~~a~~~~~a~i~---G~--------G~~qP~-Gil~~~~~~~~~~ 218 (381) T protein:vir:95 151 TAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFLK---GT--------GKDQPI-GLNRQVQKGVSVT 218 (381) T ss_pred eecceeEEeechhhHHHhhcCHHHHHHHHHHHHHHHHHHHhhheeEe---cc--------CCCCce-eeeeccCcccccc Confidence 99999999999999999999999999999999999999999944332 11 011121 111000 00 Q ss_pred -------EeeccccH-------HHHHHHHHHHHhhCCcc-------ceEEechhhhhhhhhcCCCchhhhHHhhhhhcce Q lcl|NC_011269. 200 -------IAGSHLMP-------DDLYTAVTYTDQRQLDS-------SRLLANPQEYRDLYRWDINTTGWAFKDSVVAGER 258 (333) Q Consensus 200 -------i~~g~Lt~-------~~L~~a~t~v~~~~L~a-------t~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~ 258 (333) .+.+-++- +.|.........|.-.. -..+||+.-+.+++. ..+. .| +.+. T Consensus 219 ~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~a~~~mn~~t~~~l~~--~~~~----~~---~~G~ 289 (381) T protein:vir:95 219 EGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQA--QYTH----LN---ANGV 289 (381) T ss_pred cccccccccccccccccchhhHHHHHHHHHhhccccccccccccCceEEEEccccHHhhcc--cccc----CC---CCCc Confidence 11122222 23333333333332211 124788887777764 1111 11 1111 Q ss_pred eeeeec-cc--ccceeeecCCeEEEeeChhhhcccccccCceeccccch--hhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 259 IVQFGE-FQ--IGKSIIIPRGTVYLTPEPEFLGVFPVMYSLDVEEDNKV--ERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 259 il~~G~-fg--i~~skvlprgeiyvvadpE~~G~~pvR~~L~s~p~D~~--er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) .+ +.+ || |..+--||-|.|+ ..|.-. =.+-+|+++.+...|.. .+-..+....+=++-.+.+|.++|++.-. T Consensus 290 ~v-~~l~~g~~vv~s~~~p~~~ii-fgDfs~-Y~i~~r~~~~i~~~~~~~~~~d~~~f~a~~r~dg~~~~~~A~~v~~l~ 366 (381) T protein:vir:95 290 YV-TALPFNLNVIESTVQEAGKVL-TYVKGL-YDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLD 366 (381) T ss_pred ee-ecCCCCceEEecCCCCcCcEE-EEeccc-EEEEEecccEEEeechhHhhcCCeEEEEEEEEcCEEecCceEEEEEEE Confidence 11 111 23 3346668888864 466554 35678999998887732 22356888888899999999999986533 No 119 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=96.99 E-value=3e-05 Score=45.40 Aligned_cols=211 Identities=12% Similarity=0.150 Sum_probs=135.5 Q ss_pred ccccCCCcceeecCCCCccceEEEEcCCC------cccceeecCceeeccceeeec-cccccHHHhh--hhcchhHHHHH Q lcl|NC_011269. 84 EDTLTPGVPIQYDVLDDLGQAYMLHGNEG------EIRITPFEGKRIEVQLFRIAS-FPQIKKEDLY--YLRSNIVEYTQ 154 (333) Q Consensus 84 ~~TL~~G~~p~y~v~~~v~~a~~~~~~~G------~i~~Q~i~~~ri~~P~f~Ivs-~P~V~~~dl~--~~~~~vle~~q 154 (333) ++-+.-|....||+ | +|..- ++..+.+. -+-.+..|.- -..+...|+- ..-+|.+.++- T Consensus 1 ~~~~~~Gdtit~P~-------~--iGda~~v~eG~~i~~~~l~---~t~~~atIk~~gk~~~itD~a~l~~~gDp~~ea~ 68 (231) T protein:vir:73 1 ENGINLANLCEYPN-------D--IGDAADVAEGGEISLDKIG---TTTKSVTIKKAAKGTEITDEAALSGYGDPIGESN 68 (231) T ss_pred CccccCCceEEecc-------c--ccchhhhcCCCcCChhhcc---ccceeeeEeeeccceeeeHHHHhhccCchHHHHH Confidence 78888888888883 2 33322 22222111 1111223321 2344444544 44589999999 Q ss_pred HHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCcceEEeeccccHHHHHHHHHHHHhhCCccceEEechhhh Q lcl|NC_011269. 155 DMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPNEITIAGSHLMPDDLYTAVTYTDQRQLDSSRLLANPQEY 234 (333) Q Consensus 155 ~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N~i~i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~~~ 234 (333) .....+|-...|..++.-+-.+..+ + ...+|.+.+.+|.+...+-+-...-+++||..| T Consensus 69 ~Q~~~~iA~kvD~di~~~~~~a~l~--------------------~-~~~~t~d~i~~A~~~fgde~~~~~vivv~p~~~ 127 (231) T protein:vir:73 69 KQLGLSLANKVDDDLLKAAKTTSQT--------------------V-STKANVDGVQAALDIFNDEDAQAYVLIVNPKDA 127 (231) T ss_pred HHHHHHHHHhhhHHHHHhhcccccc--------------------c-cccccHHHHHHHHHHhccccccceEEEEcchHH Confidence 9999999999998877655332211 1 134689999999999999888889999999999 Q ss_pred hhhhhcCCCchhhhHHhhhhhcceeeeeeccc------ccceeeecCCeEEE---eeChhhhcccccccCceeccccchh Q lcl|NC_011269. 235 RDLYRWDINTTGWAFKDSVVAGERIVQFGEFQ------IGKSIIIPRGTVYL---TPEPEFLGVFPVMYSLDVEEDNKVE 305 (333) Q Consensus 235 ~Di~gw~~N~~~~~~~DpV~~~e~il~~G~fg------i~~skvlprgeiyv---vadpE~~G~~pvR~~L~s~p~D~~e 305 (333) .+||. +-.++..++... -+ ++-.|+|| |..|+.+|-|+.|. +.-|--+|-|.-| ++.+|..-... T Consensus 128 ~~Lrk---~~~~~~~~~~~g-~~-i~~~G~iG~i~G~~Vi~S~~~~~~~~~~~~~i~~~gAl~~~~k~-~~~vEtdRd~~ 201 (231) T protein:vir:73 128 AKIRK---DANAKNIGSEVG-AN-ALINGTYADVLGAQIVRSKKLAEGSALMFKIVSNSPALKLVLKR-GVQVETDRDIV 201 (231) T ss_pred Hhhhh---ccchhhhhhhhc-cc-eeeecccceEcceEEEEcCCCCCCceeeeeEEeeccceeeeecc-cceeecccccc Confidence 99998 333344433222 12 33466655 77899999998863 3333334444433 45566443444 Q ss_pred hhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 306 RFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 306 r~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) +...=-.-.+.++..+.||+++|.+--+ T Consensus 202 ~k~~~i~~~~~y~v~l~~~~~vv~~t~~ 229 (231) T protein:vir:73 202 TKTTVITADEHYAAYLYDLTKVVNITFT 229 (231) T ss_pred ccccEEEEeEEEEEEEEcCccEEEEEee Confidence 4555556678889999999999999777 No 120 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=96.96 E-value=9.5e-05 Score=42.62 Aligned_cols=319 Identities=11% Similarity=0.054 Sum_probs=153.2 Q ss_pred Cccc--------chhhhhhhhhh--cccchHHHHHHHHHHhhcchhcchHHHHHHH------------------HHHh-- Q lcl|NC_011269. 1 MTLP--------VAVGSGLGRFA--KASDDYVADIVEAKQRMGGRKLSAREKQAKL------------------AHIL-- 50 (333) Q Consensus 1 ~~~~--------~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm------------------~~Al-- 50 (333) .... ..+..---... .....+..++.... +..+..-..++....+ +.++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (477) T protein:vir:84 81 GKLEAETKTVRKATVEVNEALTYEKGNGQSYFRDLAMQT-VGMADEPAKERLRRHMVDVESDKEIRKIAKVGEEYRDLDR 159 (477) T ss_pred hcchhhhhhhcccccccccchhhhhhHHHHHHHHHHHHH-hhhhhhHHHHHHHHHHhhhhhhhhHHHHHHhhhhhccccc Confidence 0000 00000000000 00011222222111 1111111111111000 1111 Q ss_pred cCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcce-eecCCCCccceEEEEcCCCcc-----cceeecCcee Q lcl|NC_011269. 51 SDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPI-QYDVLDDLGQAYMLHGNEGEI-----RITPFEGKRI 124 (333) Q Consensus 51 ~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p-~y~v~~~v~~a~~~~~~~G~i-----~~Q~i~~~ri 124 (333) .+..|+-=..-+-+++.|.+.++.....+++....+++.+.-+ .||+...-...+.|.+.-+.+ +.....=+.| T Consensus 160 ~~~~gg~lv~~~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~ip~~~~~~~~a~~~~Eg~~~~~~~~~~s~~~f~~i 239 (477) T protein:vir:84 160 NGGTGGYAVPPLWMMNRFIELARAGRTYANLCPTEPLPGGTSSINIPKILTGTSTAIQAADNAALTAPSAHEVDLTDGFV 239 (477) T ss_pred cCCCcceeeccchhHHHHHHHhhhcchHHHhhceeeecCCcceeEEEEEecCcceeeeeccCcccccccccccccceeeE Confidence 1222221012234678888888888888888888888776533 455433333334455443322 2222222568 Q ss_pred eccceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCC----CcceEE Q lcl|NC_011269. 125 EVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGAL----PNEITI 200 (333) Q Consensus 125 ~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~----~N~i~i 200 (333) +++-..+.+++.|..+=|.....++..+...+..++|...+|.- +|.+.-+ +.+| .|.+ -|.++. T Consensus 240 ~~~~~k~~~~~~iS~ell~ds~~~l~~~i~~~l~~~~~~~~d~~---~l~G~Gt-------~~~p-~Gi~~~~~~~~~~~ 308 (477) T protein:vir:84 240 QANVKTIAGQQGIAIQLLDQAAVSVDEFVFRDLAADYANKLNVQ---VISGTGS-------NNQV-VGVRATAGITQVTA 308 (477) T ss_pred EEeeeeEEeeeHHHHHHHhccchhHHHHHHHHHHHHHHHHHHHH---HhccCCC-------CCcc-ceeeeccccccccc Confidence 88888999999999999999999999999999999999999953 4444321 1121 1111 044455 Q ss_pred eeccccHHH-------HHHHHHHHH-hhCCccceEEechhhhhhhhhcCCCchhhhHHhhh-hhcceeeeee-------- Q lcl|NC_011269. 201 AGSHLMPDD-------LYTAVTYTD-QRQLDSSRLLANPQEYRDLYRWDINTTGWAFKDSV-VAGERIVQFG-------- 263 (333) Q Consensus 201 ~~g~Lt~~~-------L~~a~t~v~-~~~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV-~~~e~il~~G-------- 263 (333) .+...|-.+ +..+...+. -..+.+.-++||++.|..|+..--.+-.|.+.... ......++.| T Consensus 309 ~~~~~t~~~~~~~~~~i~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~~~~~~ 388 (477) T protein:vir:84 309 TSAGSALEKHQIIYQKIADAIQRVHTSRFLEPEVIVMHPRRWASFHAIFAGDDRPLIVPSGPGFNNLGVLTEVASQRVVG 388 (477) T ss_pred cccccchhhHHHHHHHHHHHHhhccccccCCccEEEEcHHHHHHHHHhhccCCCeeeecCcccccccccccccccccccc Confidence 444444333 233333222 22345667899999998887632111111110000 0000111111 Q ss_pred -ccccc--ceeeecCCe-------EEEeeChhhhcccccccCceec--cccchhhhccceehhhhhhhhhhc-cceEEEE Q lcl|NC_011269. 264 -EFQIG--KSIIIPRGT-------VYLTPEPEFLGVFPVMYSLDVE--EDNKVERFNKGWVMDELVGMAILN-PRGIVIL 330 (333) Q Consensus 264 -~fgi~--~skvlprge-------iyvvadpE~~G~~pvR~~L~s~--p~D~~er~~kGWvm~E~~g~~i~N-~~siv~~ 330 (333) ++|.+ .+-.||-+. .+++.|-..+ +=.+.|+.++ +.++.......+-++.++.+..++ |.|+|++ T Consensus 389 ~l~G~pVv~s~~~p~~~~~~~d~~~i~~gd~~~~--~i~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~r~~~afv~~ 466 (477) T protein:vir:84 389 QMHGLPVVTDPTLPTTLGTGTDQDVIHVLRASDL--ALFESSVRMRALQETRAENLSVLLQVYGYLAFTAARFPQSVVEI 466 (477) T ss_pred hhcccceEecCcccccccccCCcceEEEEEeceE--EEEeeceeEEeccccccccceeeeeehhhhhhhhhccccceEEe Confidence 13333 445566431 2333433211 1123455444 333444456677788888886665 9999998 Q ss_pred ecC Q lcl|NC_011269. 331 RKA 333 (333) Q Consensus 331 ~~~ 333 (333) --+ T Consensus 467 t~~ 469 (477) T protein:vir:84 467 GGT 469 (477) T ss_pred ecc Confidence 766 No 121 >protein:vir:3364 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:67 # MgeName: T3 # Cross-refs: genbank:acc:NP_523335;genbank:gi:17570826;genbank:GeneID:927448 Probab=96.96 E-value=0.00016 Score=41.34 Aligned_cols=285 Identities=12% Similarity=0.091 Sum_probs=150.9 Q ss_pred CcccchhhhhhhhhhcccchHHHHHHHHHHhhcchhcc--------hHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHH Q lcl|NC_011269. 1 MTLPVAVGSGLGRFAKASDDYVADIVEAKQRMGGRKLS--------AREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQL 72 (333) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ls--------~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~ 72 (333) |. +-++|.++. .-+.-+|--+.+ .+-+.... T Consensus 1 ~~---------------------------~~~~~~~~~t~~g~~~~~~~~~al~ie~~--------------~g~V~~~f 39 (347) T protein:vir:33 1 MA---------------------------NIQGGQQIGTNQGKGQSAADKLALFLKVF--------------GGEVLTAF 39 (347) T ss_pred CC---------------------------CCccCcccccccccCCcccchHHHHHHHH--------------HHHHHHHH Confidence 11 122233321 111111211222 23344556 Q ss_pred hhhhhhhhhhhccccCCCcceeecCCCCccceEEEEcCCCcccc----eeecCceeeccceeeeccccccHHHhhhhcch Q lcl|NC_011269. 73 RYQGILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRI----TPFEGKRIEVQLFRIASFPQIKKEDLYYLRSN 148 (333) Q Consensus 73 ~rqGi~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~----Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~ 148 (333) .|+-+.+.+....|+..|..-+|+..-.+..++ +++...+.. .+..+..|.+=+... ++-+|+-.|=-|...| T Consensus 40 ~~~s~~~~~v~~r~~~~G~sv~i~~iG~~t~~~--~~~g~~l~~~~~~~~~~e~~ltiD~~~y-~~~~VddiD~~q~~~D 116 (347) T protein:vir:33 40 ARTSVTMPRHMLRSIASGKSAQFPVIGRTKAAY--LKPGENLDDKRKDIKHTEKVIHIDGLLT-ADVLIYDIEDAMNHYD 116 (347) T ss_pred HHHHhhhhhhccccccccceeEeeeccceeeee--ecCCCCCCCCCCCCccceEEEEechhhh-hhHHHhhHHHHhcCCc Confidence 677788888888899999999999988887787 444333322 222223333333333 3345666677778889 Q ss_pred hHHHHHHHHHHHHHHHhhhHHHHHHhhhh-hhhhhhcccccccccCCCcceEEeeccc------cHHH----HHHHHHHH Q lcl|NC_011269. 149 IVEYTQDMTKQAIMRQEDSRLVTLLEAAA-VSYRVVDSSAQPGVGALPNEITIAGSHL------MPDD----LYTAVTYT 217 (333) Q Consensus 149 vle~~q~~A~qaIM~qED~~~~slle~~a-~~~r~~~ssA~p~vg~~~N~i~i~~g~L------t~~~----L~~a~t~v 217 (333) +....-.++..|+.++.|..++..+-.++ .+-+-..+.+.++- ...+++...+-.. +.++ |..|.+.. T Consensus 117 ~~~~~~~~~g~aLA~~~D~~i~~~l~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~tg~~~d~~~~a~~i~~~i~~a~~~L 195 (347) T protein:vir:33 117 VRAEYTAQLGESLAMAADGAVLAELAGLVNLPDGSNENIEGLGK-PTVLTLVKPTTGSLTDPVELGKAIIAQLTIARASL 195 (347) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccccccc-cccccccccccccccchhhhHHHHHHHHHHHHHHH Confidence 99998899999999999988876663222 11111122222221 2223333332222 2233 44566788 Q ss_pred HhhCCcc--ceEEechhhhhhhhhcCCCchh---hhHHhhhhhcceeeeeecccccceeeecCCeEEEe-----eChhh- Q lcl|NC_011269. 218 DQRQLDS--SRLLANPQEYRDLYRWDINTTG---WAFKDSVVAGERIVQFGEFQIGKSIIIPRGTVYLT-----PEPEF- 286 (333) Q Consensus 218 ~~~~L~a--t~il~~~~~~~Di~gw~~N~~~---~~~~DpV~~~e~il~~G~fgi~~skvlprgeiyvv-----adpE~- 286 (333) ++.+.|. -.+|++|+.|.+|.- ...|. |...+.+.++. |.+.==|.|.+|--||.+.+.-. +.+-| T Consensus 196 de~~VP~~gR~~vv~P~~y~~Ll~--~~~~~~~d~~~~~~~~~G~-V~~i~G~~V~~Sn~lp~~~~~~~~~~~~ag~~~~ 272 (347) T protein:vir:33 196 TKNYVPAADRTFYTTPDNYSAILA--ALMPNAANYQALLDPERGT-IRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHA 272 (347) T ss_pred hhcCCCccCcEEEeCHHHHHHHhc--cccccccccccccccccce-eEEEeceeEEEecccccCcccccccccccccccc Confidence 8888863 578999999999986 22221 22222233322 22222255667777887643211 11110 Q ss_pred ---------hcccccccCc--------eecccc-chhh----hccceehhhhh--hhhhhccceEEEEecC Q lcl|NC_011269. 287 ---------LGVFPVMYSL--------DVEEDN-KVER----FNKGWVMDELV--GMAILNPRGIVILRKA 333 (333) Q Consensus 287 ---------~G~~pvR~~L--------~s~p~D-~~er----~~kGWvm~E~~--g~~i~N~~siv~~~~~ 333 (333) -+.+..+-|| .++-.| +.|+ ...||.|.-+. |-.+.||.++|-+..- T Consensus 273 ~~~~~~~~~~~a~~~~~gl~~h~~A~g~v~~~~~~~e~~r~~~~~~d~i~~~~~~G~~vlrP~~av~i~~~ 343 (347) T protein:vir:33 273 FPATSSTTVKVALDNVVGLFQHRSAVGTVKLKDLALERARRANYQADQIIAKYAMGHGGLRPEAAGAIVLP 343 (347) T ss_pred ccCCcccceeccccceeeeeecchhheeeeeeceeeeeccchhhhhHhhhhhhhcCCceecccceEEEecC Confidence 0111222222 222223 3333 35667765544 5568999999887443 No 122 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=96.81 E-value=0.00013 Score=41.94 Aligned_cols=306 Identities=12% Similarity=0.061 Sum_probs=158.0 Q ss_pred CcccchhhhhhhhhhcccchHHHHHH---HHHHhhcchhcchHHHHHH--HHHHhcCch---hHHHHHHHHHHHHHHHHH Q lcl|NC_011269. 1 MTLPVAVGSGLGRFAKASDDYVADIV---EAKQRMGGRKLSAREKQAK--LAHILSDKV---GGIQRLGQSMIGPIQLQL 72 (333) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~ls~ee~~~L--m~~Al~~~E---g~~~aLg~~mA~pI~~q~ 72 (333) -.-|-...+-..++.+...++..... +.++|-. -.+..+++..+ +++++..+. |+--.+=+-+.+.|.+.+ T Consensus 95 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~vP~~~~~~i~~~l 173 (466) T protein:vir:80 95 NSEPAQVSGARTQQFVGGETRMKGFFRNMPYEQRAA-LIARSEVKEFLAQVRTLAQQKRAVSGAELTIPDVMLELLRDNM 173 (466) T ss_pred CchhHHHHhhhhhHHhhHHHHHHHHHHhhhhhhHHH-HHHHHHHHHHHHHHHHHhhhhhhhccccccccHHHHHHHHHhh Confidence 11122222212222222222222221 1111111 01111111110 011111111 110012345677888888 Q ss_pred hhhhhhhhhhhccccCCCcceeecCCCCccceEEEEcCCCcccceeecCceeeccceeeeccccccHHHhhhhcchhHHH Q lcl|NC_011269. 73 RYQGILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRITPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEY 152 (333) Q Consensus 73 ~rqGi~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~ 152 (333) +..+-++++.+..++.... .+++..+. ..+.|.+--++++.....=+.|++...++.+++.|..+=|.....|+..+ T Consensus 174 ~~~~~l~~~~~v~~~~g~~--~~~~~~~~-~~a~wv~E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~~~l~~~ 250 (466) T protein:vir:80 174 HRYSKLISKVRLRPLKGTA--RQNIAGAI-PEGVWTEAVANLNELSLSFSQIEVDGYKVGGFIPIPNSTLEDSDLNLADE 250 (466) T ss_pred hhhhhhhhheeeeecCcee--EeeeecCC-cceeecccccccccccccccceeecceeeeeehhhhHHHHhcchHHHHHH Confidence 8888888888777765332 23332333 24457776666665554456789999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCcceEE------------eeccccHHHHHHHH------ Q lcl|NC_011269. 153 TQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPNEITI------------AGSHLMPDDLYTAV------ 214 (333) Q Consensus 153 ~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N~i~i------------~~g~Lt~~~L~~a~------ 214 (333) ......++|..-+|.-+++ +-. ..+|. |.| |.+.. ..-.+++.++..+. T Consensus 251 i~~~la~~~~~~~~~ail~---G~G--------~~~P~-Gil-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 317 (466) T protein:vir:80 251 ILDAIGQAIGFALDKAILY---GTG--------TKMPV-GIV-TRLAQTTQPPNWGTKAPAWTNLSTTNLLKIDPTGKSA 317 (466) T ss_pred HHHHHHHHHHHHHhhheee---ccC--------CCCcc-eee-ecccccccccccccccccccccchhhhhhhhhhccch Confidence 9999999999999965554 211 11221 222 11100 00112222222111 Q ss_pred --------HH----HHhhCCccceEEechhhhhhhhhcCCCchhhhHHhhhhhcceee---eee-ccccc--ceeeecCC Q lcl|NC_011269. 215 --------TY----TDQRQLDSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIV---QFG-EFQIG--KSIIIPRG 276 (333) Q Consensus 215 --------t~----v~~~~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il---~~G-~fgi~--~skvlprg 276 (333) -. ...........+||+..|.-+++=... ...++.++. |+. ++|.+ .+--+|-| T Consensus 318 ~~~~~~~~~~~~~~~~~~~~~~~~w~~~~~~~~~l~~~~~~--------~~~~g~~~~~~~~~~~i~G~pvv~s~~~~~~ 389 (466) T protein:vir:80 318 EEFFSELVLKLSKARANYSNGMKFWAMSSNTHAVLMSKAIT--------FNSAGALVASLNNTMPIVGGDIVILDFIPDN 389 (466) T ss_pred hhHHHHHHHHHHhhhccccCCceeEEecchhHHHhhccccc--------ccCCccccccCCCcccccccceeecCccCcc Confidence 00 111122333356677666666551100 011111111 111 23422 44456777 Q ss_pred eEEEeeChhhhcccccccCceeccccchh--hhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 277 TVYLTPEPEFLGVFPVMYSLDVEEDNKVE--RFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 277 eiyvvadpE~~G~~pvR~~L~s~p~D~~e--r~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) ++ +..+... ..+-+|+++.+.-.+... +-..++...+=++..+.+|-++|.+.-+ T Consensus 390 ~~-~~g~~~~-y~i~~r~~~~i~~~~~~~f~~d~~~~r~~~r~dg~~~~~~afv~~~~~ 446 (466) T protein:vir:80 390 DI-IGGYGSL-YLLAERADIKLAQSEHVRFIEDQTVFKGTARYDGKPVFGEGFVAVNIA 446 (466) T ss_pred ce-eeecccc-EEEEeecceEEEechhhhhhcCcEEEEEEEEEccEEeccCceEEEEec Confidence 75 4455553 356789999887766322 2356788888899999999999999766 No 123 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=96.75 E-value=0.00036 Score=39.43 Aligned_cols=283 Identities=11% Similarity=0.118 Sum_probs=151.9 Q ss_pred HHHHHHHHHhhcchh---cchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcceeecCC Q lcl|NC_011269. 22 VADIVEAKQRMGGRK---LSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPIQYDVL 98 (333) Q Consensus 22 ~~~~~~~~~~~~~~~---ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p~y~v~ 98 (333) .||+- -+++|-++ =++-++-+|-=+-+. +-+..+.-|+-+.+.+....++..|..-+||.. T Consensus 1 m~~~~--~~~~~t~~g~~~~~~d~~al~ik~f~--------------~eV~~~f~~~s~~~~~~~~r~i~~G~sv~i~~i 64 (347) T protein:vir:94 1 MANVP--GQKIGTDQGKGKSSSDALALFLKVFA--------------GEVLTAFTRRSVTADKHIVRTIQNGKSAQFPVM 64 (347) T ss_pred CCCCC--ccccccccccCCccccHHHHHHHHHh--------------HHHHHHHHHHHhhhcccccccccccceEEEecc Confidence 33332 23442111 012223333323332 222223345667778888889999999999999 Q ss_pred CCccceEEEEcCCCccc--ceeecCceeecc--ceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHh Q lcl|NC_011269. 99 DDLGQAYMLHGNEGEIR--ITPFEGKRIEVQ--LFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLE 174 (333) Q Consensus 99 ~~v~~a~~~~~~~G~i~--~Q~i~~~ri~~P--~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle 174 (333) -++..++ +++-+.+. .+.+...+..+. +.. .++-+|+--|=-|...|+..+.-+++..|+-++.|..++.++- T Consensus 65 G~~tv~~--~t~G~~l~~~~~~~~~~e~~itID~~~-~~~~~VddiD~~q~~~D~~~~~~~~~g~aLa~~~D~~i~~~~~ 141 (347) T protein:vir:94 65 GRTSGVY--LAPGERLSDKRKGIKHTEKVITIDGLL-TADVMIFDIEDAMNHYDVAGEYSNQLGEALAIAADGAVLAEMA 141 (347) T ss_pred cceeeee--ecCCCCcCCCCCCCCcceEEEEecchh-hhhHHhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 9998888 44444442 234455553333 443 4455788778888999999999999999999999999887773 Q ss_pred hhhhhhhhhcccccccccCC--CcceEE--eecccc----HHH----HHHHHHHHHhhCCcc--ceEEechhhhhhhhhc Q lcl|NC_011269. 175 AAAVSYRVVDSSAQPGVGAL--PNEITI--AGSHLM----PDD----LYTAVTYTDQRQLDS--SRLLANPQEYRDLYRW 240 (333) Q Consensus 175 ~~a~~~r~~~ssA~p~vg~~--~N~i~i--~~g~Lt----~~~----L~~a~t~v~~~~L~a--t~il~~~~~~~Di~gw 240 (333) .++ -+..++....++. .+.+++ .++..+ +.+ |..|.+..++.+.|. -.+|++|+.|.+|.. T Consensus 142 ~~a----a~~~~~~~~~~g~~~~s~~~~~~~~~~~~~~~~~~~~~~~i~~a~~~Lde~~VP~~~R~~vv~P~~~~~Ll~- 216 (347) T protein:vir:94 142 ILC----NLPAASNENIAGLGTASVLEVGKKADLDTPAKLGEAIIGQLTIARAKLTSNYVPAGDRYFYTTPDNYSAILA- 216 (347) T ss_pred HHh----ccccccccccCCCcccceeeccccccccchhhhHHHHHHHHHHHHHHHhhcCCCCCCcEEEeCHHHHHHHhc- Confidence 222 1222222222222 233333 223322 233 445567788888764 378999999999975 Q ss_pred CCCchhhhHHhhhhhcceeeeeec------ccccceeeecCC--------eEEEeeChhhhc-------cccc----ccC Q lcl|NC_011269. 241 DINTTGWAFKDSVVAGERIVQFGE------FQIGKSIIIPRG--------TVYLTPEPEFLG-------VFPV----MYS 295 (333) Q Consensus 241 ~~N~~~~~~~DpV~~~e~il~~G~------fgi~~skvlprg--------eiyvvadpE~~G-------~~pv----R~~ 295 (333) +.++ .-.+-...+ .++.|. |.|.+|--+|.+ .=|-+..-++++ .|.. --+ T Consensus 217 --~~~~-~~~~~~~~~--~~~~G~Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~~~~aG~~~~~~~~~~~~~~~~~~~~~~ 291 (347) T protein:vir:94 217 --ALMP-NAANYAALI--DPETGNIRNVMGFVVVEVPHLVQGGAGETRGDDGITIASGQKHAFPATASSDVKVTMDNVVG 291 (347) T ss_pred --cchh-hhhhccccc--cccccceEEEeceEEEecCcccccccccccccCcceecCcccccccccchhhhcccccceeE Confidence 2221 111111111 223333 445666666642 222222222221 1110 000 Q ss_pred c--------eecccc-chhh----hccceehhhh--hhhhhhccceEEEEecC Q lcl|NC_011269. 296 L--------DVEEDN-KVER----FNKGWVMDEL--VGMAILNPRGIVILRKA 333 (333) Q Consensus 296 L--------~s~p~D-~~er----~~kGWvm~E~--~g~~i~N~~siv~~~~~ 333 (333) + -++-.| +.|. -..+|+|.-. .|-.+.||.+.|.+... T Consensus 292 l~~h~~A~~~v~~~~~~~e~~r~~~~~~d~i~~~~~~G~~~~rP~~a~~~~~~ 344 (347) T protein:vir:94 292 LFSHRSAVGTVKLRDLALERDRDVDAQGDLIVGKYAMGHGGLRPEAAGALVFS 344 (347) T ss_pred EEeehhhhhhhhcccccccchhchhhHHHHhhhhhhhcCcccccceeEEEEec Confidence 0 111222 2222 2557887654 56678999888776544 No 124 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=96.58 E-value=0.00027 Score=40.18 Aligned_cols=284 Identities=13% Similarity=0.127 Sum_probs=157.7 Q ss_pred hcchhcchHHHHHHHHHHhcCchhHHHHHH-HHHHHHHHHHHhhhhhhhhhhhccccCCCcceeecCCCCccceEEEEcC Q lcl|NC_011269. 32 MGGRKLSAREKQAKLAHILSDKVGGIQRLG-QSMIGPIQLQLRYQGILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGN 110 (333) Q Consensus 32 ~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg-~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~ 110 (333) |---+=+..++. +-+-..+- .+|= +---+-|..+..|+-+.+.+....|++.|.--++|..-+...+| +++ T Consensus 1 m~~~~~~~~t~~-----~~~~~~~~-~~l~le~~~geV~~af~~~s~~~~~~~~r~i~~G~s~~~~~iG~~~~~~--~~~ 72 (334) T protein:vir:80 1 MTYPAANTHTRP-----GWGGANSD-VSLHIEEHLGLVDASFMYSSKFASWMNVRSLRGTNQLRVDRVGASTIAG--RKA 72 (334) T ss_pred CCCCcCCCcccc-----ccccccch-heehhhhhhhHHHHHHHHhhhhhccceeeeccccceEEEeeecceeeee--ecC Confidence 211100111100 01111110 0110 11223344567778888888888999999999999988888888 777 Q ss_pred CCcccceeecCceeecc--ceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhh-hhhhhhcccc Q lcl|NC_011269. 111 EGEIRITPFEGKRIEVQ--LFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAA-VSYRVVDSSA 187 (333) Q Consensus 111 ~G~i~~Q~i~~~ri~~P--~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a-~~~r~~~ssA 187 (333) ...+..|++..++.++- +. ..++-+|+--|=-+...|+-.+.-+.+-.|+-+..|...+-+|--++ .+- .++. T Consensus 73 g~~l~~~~~~~~~~~l~ID~~-l~~~~~VddiD~~q~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~~---~~~~ 148 (334) T protein:vir:80 73 GEELVVQKNVSDKLNLTVDTV-LYARHFFDKFDEWTSNLDVRKETAREDGIALARQYDQACIIQLQKCGDFLA---PAHL 148 (334) T ss_pred CCCCCCCCcccCceEEEEeee-eehhhhHhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcc---cccc Confidence 77777777665544443 44 44556777777788899999999999999999999987775553333 111 0111 Q ss_pred cccc-cCCCcceEEee----ccccH----HHHHHHHHHHHhhCCcc-----ceEEechhhhhhhhhcCCCchh---hhH- Q lcl|NC_011269. 188 QPGV-GALPNEITIAG----SHLMP----DDLYTAVTYTDQRQLDS-----SRLLANPQEYRDLYRWDINTTG---WAF- 249 (333) Q Consensus 188 ~p~v-g~~~N~i~i~~----g~Lt~----~~L~~a~t~v~~~~L~a-----t~il~~~~~~~Di~gw~~N~~~---~~~- 249 (333) .|+- .+....+.+.| ..-++ +++..|++..++.+.+. -.++++|..|..|.- .+.+. |.. T Consensus 149 ~~~~~~G~~~~~~~~g~~~~~~~~~~~l~~a~~~a~~~L~e~dvp~~~~~~R~~vv~P~~y~~Ll~--~~r~~n~d~~~s 226 (334) T protein:vir:80 149 KPAFHDGILLPSTISGLAADAAADADVLVAAHRQGVEAMVFRDLGDQLMSEGVTLLDPVIFSFLLE--HDRLMNVEFGAK 226 (334) T ss_pred cccccCCcceeecccccccchhhhHHHHHHHHHHHHHHHHhcCCCCCcCCceEEEeChHHHHHHhc--ccccccceeccc Confidence 1111 11101111111 11223 34446777788888883 478999999999988 21110 010 Q ss_pred --HhhhhhcceeeeeecccccceeeecCCeEEEeeChhhhcccccccC--------------cee-cccc---ch--hhh Q lcl|NC_011269. 250 --KDSVVAGERIVQFGEFQIGKSIIIPRGTVYLTPEPEFLGVFPVMYS--------------LDV-EEDN---KV--ERF 307 (333) Q Consensus 250 --~DpV~~~e~il~~G~fgi~~skvlprgeiyvvadpE~~G~~pvR~~--------------L~s-~p~D---~~--er~ 307 (333) -.++.+++ +.+.==|.|.+|--+|.+.+.- ....|.|.+--| |.+ +-.| .. +.- T Consensus 227 ~~~~~~~~g~-i~~v~G~~V~~Sn~~P~~~~t~---~~~g~~~~~~agd~t~~~~~~~~~~Al~t~~~~~~~~e~~~~~~ 302 (334) T protein:vir:80 227 EGGNSFVGGR-IAMLNGVRVVETPRFPQSAITA---NALGADFNVTDAEVRRKMITFIPSMALISAQVHPVSAQFWEEKK 302 (334) T ss_pred ccccccccee-EEEEeceEEEeecCCCCccccc---cccccccccccccccceEEEEEeCceEEEEEEeecceeeeechh Confidence 12233222 2222226678888888775332 221333332222 111 1111 11 113 Q ss_pred ccceehhhhh--hhhhhccceEEEEecC Q lcl|NC_011269. 308 NKGWVMDELV--GMAILNPRGIVILRKA 333 (333) Q Consensus 308 ~kGWvm~E~~--g~~i~N~~siv~~~~~ 333 (333) ..+|+|.-.. |-.+.||.+++++.-- T Consensus 303 ~~~d~i~~~~a~G~g~lRPeaa~vv~~~ 330 (334) T protein:vir:80 303 DFGHYLDTFQSYNIGQRRPDAVAVHDIT 330 (334) T ss_pred hHHHHHHHHHHcCCceeccceEEEEEEe Confidence 6899998765 5568999888877655 No 125 >protein:vir:10450 Length: 344 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:184 # MgeName: phiA1122 # Cross-refs: genbank:acc:NP_848297;genbank:gi:30387487;genbank:GeneID:1733971 Probab=96.55 E-value=0.00035 Score=39.50 Aligned_cols=285 Identities=13% Similarity=0.111 Sum_probs=150.8 Q ss_pred HHHhhcchhcch---------HHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcceeecCC Q lcl|NC_011269. 28 AKQRMGGRKLSA---------REKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPIQYDVL 98 (333) Q Consensus 28 ~~~~~~~~~ls~---------ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p~y~v~ 98 (333) --+-|+|..... -+.-+|.-+.+ -+-+-.+..|+-+.|.+....|++-|.--+||.. T Consensus 1 ma~~~~~~~~n~~~~~~~~~~~~~~al~ie~~--------------~geV~~~f~~~s~~~~~~~~r~i~~g~s~~~~~i 66 (344) T protein:vir:10 1 MANMTGGQQLGTNQGKDVMAAGDKLALFLKVF--------------GGEVLTAFARTSVTTSRHMVRSISSGKSAQFPVL 66 (344) T ss_pred CccccccccCCcccCCccCCccchhHHHHHHH--------------HHHHHHHHHHHhhhcccceeeeecccceEEEEee Confidence 222233322111 12222222222 2334456777788888888889999999999999 Q ss_pred CCccceEEEEcCCCcccceeecCce--eeccceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhh Q lcl|NC_011269. 99 DDLGQAYMLHGNEGEIRITPFEGKR--IEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAA 176 (333) Q Consensus 99 ~~v~~a~~~~~~~G~i~~Q~i~~~r--i~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~ 176 (333) .++..+|+.-|.+=+-..|++...+ |.+-+..+ ++-+|+--|=-|...|+..+.-+++-.|+-++.|..++..+-.+ T Consensus 67 G~~~~~~~~~G~~l~~t~~~~~~~e~~l~ID~~~y-~~~~VdDiD~~q~~~D~r~~~~~~~G~aLA~~~D~~i~~~la~~ 145 (344) T protein:vir:10 67 GRTQAAYLAPGENLDDIRKDIKHTEKVITIDGLLT-ADVLIYDIEDAMNHYDVRSEYTSQLGESLAMAADGAVLAEIAGL 145 (344) T ss_pred ceeEEEeeecCCCCCCCCCCcccceEEEEEcchhh-hhhhhhhHHHHhcCcchHHHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 9998888554443222224443333 44444444 44467777778889999999999999999999999887766322 Q ss_pred hhhhhhhcccccccccCCCcceE--EeeccccH-----HH----HHHHHHHHHhhCCccc--eEEechhhhhhhhhcC-C Q lcl|NC_011269. 177 AVSYRVVDSSAQPGVGALPNEIT--IAGSHLMP-----DD----LYTAVTYTDQRQLDSS--RLLANPQEYRDLYRWD-I 242 (333) Q Consensus 177 a~~~r~~~ssA~p~vg~~~N~i~--i~~g~Lt~-----~~----L~~a~t~v~~~~L~at--~il~~~~~~~Di~gw~-~ 242 (333) +- +.-.+.+.|+.+.-.+.+. ..+..+++ ++ |..|.+..++.+.|.. .+|++|+.|..|.-=. + T Consensus 146 a~--~~~~~~~~~~g~~~~~~~~~~~~~~~~t~~~~~~~~~~~~i~~a~~~Lde~~VP~~gR~~vv~P~~y~~Ll~~~~~ 223 (344) T protein:vir:10 146 CN--VESQYNENITGLGTATVIETTQDKTTLTDQVALGKEIIAALTKARAALTKNYVPSSDRVFYCDPDSYSAILAALMP 223 (344) T ss_pred hc--cccccccccccccccceeecccccccccchhhhHHHHHHHHHHHHHHHhhcCCCccCCEEEeChHHHHHHhhcccc Confidence 20 0011223333211111222 12223333 33 5556788888898865 6778999999987510 1 Q ss_pred CchhhhHHhhhhhcceeeeeecccccceeeecCCeE-----------EEe-------------------eChhhhccccc Q lcl|NC_011269. 243 NTTGWAFKDSVVAGERIVQFGEFQIGKSIIIPRGTV-----------YLT-------------------PEPEFLGVFPV 292 (333) Q Consensus 243 N~~~~~~~DpV~~~e~il~~G~fgi~~skvlprgei-----------yvv-------------------adpE~~G~~pv 292 (333) |...|...+...+| .+.+.-=|.|.+|--+|-|-+ |.. --|+-+|..-. T Consensus 224 ~~~~~~~~~~~~~G-~V~~v~G~~V~~Sn~lp~~~~~~~~~~~tg~~~~~~~~~~~~~~~~~s~~~~l~~h~~A~~~v~~ 302 (344) T protein:vir:10 224 NAANYAALIDPEKG-SIRNVMGFEVVEVPHLTAGGAGTSREGTTGQKHAFPATKSGNDKVAKDNVIGLFMHRSAVGTVKL 302 (344) T ss_pred cccccccccceeee-EEEEEeceEEEeccccccccCCcccccccCccccccCCcccceeeecceeEEEeechhhhhhhhh Confidence 11112222223322 232222245566666664311 100 01111111000 Q ss_pred ccCceeccccchhhhccceehhhhh--hhhhhccceE--EEEecC Q lcl|NC_011269. 293 MYSLDVEEDNKVERFNKGWVMDELV--GMAILNPRGI--VILRKA 333 (333) Q Consensus 293 R~~L~s~p~D~~er~~kGWvm~E~~--g~~i~N~~si--v~~~~~ 333 (333) .++++|... .+-..||+|.-.. |-.+.||.+. |.|+.. T Consensus 303 -~~~~~e~~r--~~~~~~d~i~g~~~~G~~vlRPe~a~~v~~~~~ 344 (344) T protein:vir:10 303 -RDLALERAR--RANFQADQIIAKYAMGHGGLRPEAAGAVVFKTK 344 (344) T ss_pred -ccceeeccc--chhHHHHHHHHHhhcccceecccceEEEEeecC Confidence 011222211 1235678887655 4458999754 666555 No 126 >protein:vir:100632 Length: 381 # NCBI annotation: 77ORF006 # Family: family:all:635 # MgeID: mge:1476 # MgeName: 77 # Cross-refs: genbank:acc:NP_958606;genbank:gi:41189521;genbank:GeneID:2743778 Probab=96.49 E-value=0.00034 Score=39.63 Aligned_cols=302 Identities=11% Similarity=0.015 Sum_probs=171.3 Q ss_pred Ccccc---------------------------------hhhhhhhhhhcccchHHHHHHHHHHhhcchhcchHHHHHH-- Q lcl|NC_011269. 1 MTLPV---------------------------------AVGSGLGRFAKASDDYVADIVEAKQRMGGRKLSAREKQAK-- 45 (333) Q Consensus 1 ~~~~~---------------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ls~ee~~~L-- 45 (333) |++=. +...-..+ ++...|-.- .-.+.|++.|+++|++.+ T Consensus 1 m~~kl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~e~~~~---~~~~~~~~~l~~~e~~~~~~ 75 (381) T protein:vir:10 1 MTINLSETFANAKNEFINAVNNGEPQERQNELYGDMINQLFEETKL--QAKAEAERV---SSLPKSAQTLSANQRNFFMD 75 (381) T ss_pred CchhHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHhhhhhHHH--HHHHHHHHH---HHhcccccccCHHHHHHHHH Confidence 11000 00000000 011111110 123468899999998853 Q ss_pred HHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcceeecCCCCccceEEEEcCCCccccee-ecCcee Q lcl|NC_011269. 46 LAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRITP-FEGKRI 124 (333) Q Consensus 46 m~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~-i~~~ri 124 (333) +... .+..|+ -.+=..+.+.|.+.+..+...|++.+..+.. | .-.+++..+.. .+.|.+-.|.+..+. ..=+.| T Consensus 76 ~~~~-t~~~Gg-~lvP~~~~~~I~~~l~~~spir~~a~v~~~~-~-~~~i~~~~~~~-~a~W~~e~~~~~~~~~~~f~~i 150 (381) T protein:vir:10 76 INKS-VGYKEE-KLLPEETIDRIFEDLTTNHPLLADLGIKNAG-L-RLKFLKSETSG-VAVWGKIYGEIKGQLDAAFSEE 150 (381) T ss_pred Hhhc-CCCCCc-eecCHHHHHHHHHHHHhhcceeeeeeeEecC-c-ceEEEeecCCc-ceEEeecccccccccCccceeE Confidence 2222 234454 2566889999999999999999998776653 3 33455544443 444666555554432 222689 Q ss_pred eccceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCC--------- Q lcl|NC_011269. 125 EVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALP--------- 195 (333) Q Consensus 125 ~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~--------- 195 (333) +++..++.++|.|..+=|....+|+-.+.-....++|-+.||.-++ .+- -+.+|. |.+. T Consensus 151 ~l~~~kl~a~i~is~elL~Ds~~~le~~i~~~la~~~a~~~~~afi---~Gd--------G~~qP~-Gil~~~~~~~~~~ 218 (381) T protein:vir:10 151 TAIQNKLTAFVVLPKDLNDFGPAWIERFVRVQIEEAFAVALETAFL---KGT--------GKDQPI-GLNRQVQKGVSVT 218 (381) T ss_pred eecceeEEeeccccHHHHhccHHHHHHHHHHHHHHHHHHHhhceeE---ecc--------cCCCce-eeeecCCcccccc Confidence 9999999999999999999999999999999999999999994332 111 111221 1110 Q ss_pred ---cceEEeeccccHHHHH-------HHHHHHHhhCC----c---cceEEechhhhhhhhhcCCCchhhhHHhhhhhcce Q lcl|NC_011269. 196 ---NEITIAGSHLMPDDLY-------TAVTYTDQRQL----D---SSRLLANPQEYRDLYRWDINTTGWAFKDSVVAGER 258 (333) Q Consensus 196 ---N~i~i~~g~Lt~~~L~-------~a~t~v~~~~L----~---at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~ 258 (333) ++...+.+-+|-.+.. ........|.. + --..+||+.-|.++++ .- ..+|+--++-- T Consensus 219 ~g~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~vmn~~t~~~l~~--~~----~~~~~~G~~v~ 292 (381) T protein:vir:10 219 DGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGNVTMVVNPSDAFEVQA--QY----THLNANGVYVT 292 (381) T ss_pred ccccccccccccccccchhhHHHHHHHHHHhhhhhhccccccccCceEEEEchhhHHhhcc--cc----ccCCCCCceee Confidence 1111222333333322 22222222221 1 1136788888888875 11 11222111110 Q ss_pred eeeeecccccceeeecCCeEEEeeChhhhcccccccCceeccccch--hhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 259 IVQFGEFQIGKSIIIPRGTVYLTPEPEFLGVFPVMYSLDVEEDNKV--ERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 259 il~~G~fgi~~skvlprgeiyvvadpE~~G~~pvR~~L~s~p~D~~--er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) .+.+|. .|..+--||-|.|+ ..|.-. =.+-+|.|+.+...|.. .+-..+....+=++-.+.+|.+++++--. T Consensus 293 ~lp~g~-~vv~~~~~p~~~i~-fGDfs~-Y~i~~r~~~~i~~~~~~~~~~d~~~f~a~~r~dG~~~~~~A~~v~~l~ 366 (381) T protein:vir:10 293 ALPFNL-NVIESTVQEAGKVL-TYVKGL-YDGYLAGGINVQKFKETLALDDMDLYTAKQFAYGKAKDNKVAAVWKLD 366 (381) T ss_pred cCCCCc-eeEEcCCCCcCcEE-EEEccc-EEEEEecccEEEeechhhhhcCceEEEEEEEEcCEEecCCcEEEEEEe Confidence 111111 13346678888864 566654 36678999998887732 22356888888888899999998885433 No 127 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=96.26 E-value=0.00042 Score=39.09 Aligned_cols=287 Identities=14% Similarity=0.136 Sum_probs=163.2 Q ss_pred hcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcceeecCCCCccceEEEEcCC Q lcl|NC_011269. 32 MGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNE 111 (333) Q Consensus 32 ~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~ 111 (333) |- -|++-++..- ..-++... + -| +---+-+..+.-|+-+.+.+....|+.-|.--+||..-+...+| +++- T Consensus 1 ms--~~~~~tr~~~--~~s~~d~a-l-~l-e~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~iG~~~~~~--~~pG 71 (335) T protein:vir:63 1 MS--FLNDLTRPNY--AGKNADVD-I-HL-EEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRLGNVEAKG--RRAG 71 (335) T ss_pred CC--Ccccchhhhc--ccccchhh-e-eh-hhhhhhHHHHHHhhhhhccccceeeeccceeEEEeeeeeeeeec--ccCC Confidence 11 1122221111 00000000 0 00 11223455567778888888889999999999999999998888 5555 Q ss_pred CcccceeecCc--eeeccceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccc-c Q lcl|NC_011269. 112 GEIRITPFEGK--RIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSA-Q 188 (333) Q Consensus 112 G~i~~Q~i~~~--ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA-~ 188 (333) ..+..|++..+ .|.+=++- +|+-+|+-.|=-+...|+..+.-.+.-.|+-+..|...+-.+=.+| |....+- . T Consensus 72 ~~l~~~~~~~~k~~itVD~ll-~a~~~I~dlDe~~~~yDvRse~s~e~G~aLA~~~D~~~~~~i~~aa---~~~a~~~~~ 147 (335) T protein:vir:63 72 EELERSRVVNDKWNLTVDTLL-YLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAA---AMDAPVDLE 147 (335) T ss_pred cCcCCCCccccceEEEeccee-echhhhhhHHHHhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhc---cccCccccC Confidence 55555555444 34555554 5666688888888899999999999999999999988775554444 2222111 1 Q ss_pred cc--ccCCCcceEEeecccc--HHH----HHHHHHHHHhhCCcc-----ceEEechhhhhhhhhcC--CCch-h-hhHHh Q lcl|NC_011269. 189 PG--VGALPNEITIAGSHLM--PDD----LYTAVTYTDQRQLDS-----SRLLANPQEYRDLYRWD--INTT-G-WAFKD 251 (333) Q Consensus 189 p~--vg~~~N~i~i~~g~Lt--~~~----L~~a~t~v~~~~L~a-----t~il~~~~~~~Di~gw~--~N~~-~-~~~~D 251 (333) ++ -|.. ..+.++|+... +.. +..|.+..++.++|- -.++++|+.|.-|.-=+ .|.+ + .+..+ T Consensus 148 ~~~~~G~~-~~~~~tg~~~~~~~~~l~~a~~~a~~~L~e~dVP~~~~~dr~~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~ 226 (335) T protein:vir:63 148 DAFSPGVL-EKLDLTGLTAKQAADKIVRMHRRVVETFIDRDLGDAVYSEGLTPMSPRVFSLLLEHDKLMNVEYQATGATN 226 (335) T ss_pred CCcCCCcc-eeeeeccCcccccHHHHHHHHHHHHHHHHhccCCCcccCceEEEeChHHHHHHhccccccccccccccccc Confidence 22 1333 44556665542 344 447778888888873 35899999999987610 1111 1 11123 Q ss_pred hhhhcceeeeeecccccceeeecCCeEEEeeC-hh---hhcccccccCc----------eecccc-chhh--hccceehh Q lcl|NC_011269. 252 SVVAGERIVQFGEFQIGKSIIIPRGTVYLTPE-PE---FLGVFPVMYSL----------DVEEDN-KVER--FNKGWVMD 314 (333) Q Consensus 252 pV~~~e~il~~G~fgi~~skvlprgeiyvvad-pE---~~G~~pvR~~L----------~s~p~D-~~er--~~kGWvm~ 314 (333) ....++ +.+.-=|.|.+|--+|.+.+.--+. .+ +-|.+..+-++ +.+++- +.++ -..+|+|. T Consensus 227 ~~~~g~-v~~v~Gv~V~~sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~vt~e~~~~~~~~~~~i~ 305 (335) T protein:vir:63 227 DYVKSR-VAILNGVKVLETPRFATKAIAAHPLGRHFNVSAEESERQIALFLPSKTLITAQVAPVQAKLWEDNEKFSWVLD 305 (335) T ss_pred cccCce-eEEeeceEEEeeccCCCCCcccccccccCCccccccceeEEEEEecceEEEEEEeecccceeeccchhhHHhH Confidence 333343 3333336688888888876442221 11 11111111111 111111 1111 35789998 Q ss_pred hhhhh--hhhccceEEEEecC Q lcl|NC_011269. 315 ELVGM--AILNPRGIVILRKA 333 (333) Q Consensus 315 E~~g~--~i~N~~siv~~~~~ 333 (333) -...| .+.||.+.+.+.-- T Consensus 306 ~~~a~G~g~lRPe~a~~i~~t 326 (335) T protein:vir:63 306 TFQMYNIGARRPDTAGAIELK 326 (335) T ss_pred HHHHcCCcccccceEEEEEEc Confidence 76655 57899998888754 No 128 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=96.06 E-value=6e-05 Score=43.71 Aligned_cols=296 Identities=13% Similarity=0.125 Sum_probs=145.6 Q ss_pred CcccchhhhhhhhhhcccchHHHHHHHHHHhhcchhcchHHHHHHHHHHhcCchhHHHHHHH-HHHHHHHHHH-hhhhhh Q lcl|NC_011269. 1 MTLPVAVGSGLGRFAKASDDYVADIVEAKQRMGGRKLSAREKQAKLAHILSDKVGGIQRLGQ-SMIGPIQLQL-RYQGIL 78 (333) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~-~mA~pI~~q~-~rqGi~ 78 (333) .|-| --|||......| -+|| |. ++...|+. .|++ ....+|-+.+ ++..++ T Consensus 5 ~~~~-----~~~~~~~~~~~~-------------p~l~-------m~-alTLaea~--~l~~d~~~~~VIE~l~~~s~iL 56 (330) T protein:vir:94 5 CTPP-----LRGRWRTLTHQF-------------PELK-------MP-TVTLAESA--KLSQDHLVSGLIETIVEVNPLY 56 (330) T ss_pred cCCc-----cccceeehhccc-------------cccc-------hh-hhhhhHHh--hcCchhhHHHHHHhhhccchHH Confidence 1111 123443222111 0111 11 11111111 1111 1122333333 344555 Q ss_pred hhhhhccccCCCcceeecCCCCccceEEEEcCCCcccceee--cCceeeccceeeeccccccHHHhhhhcchhHHHHHHH Q lcl|NC_011269. 79 RNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRITPF--EGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDM 156 (333) Q Consensus 79 RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~i--~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~ 156 (333) +.+-- +.+. |-..+|....+...+.+.-=+.|.-++.+. ......+=.++=.+..--.+.||+....|...+--.. T Consensus 57 ~~lpf-~~ve-~~~~~~~r~~~lp~a~~r~~n~~~~~~~~~Tf~q~t~~l~~l~~~~~Vd~~iadl~g~~~d~~~~q~~~ 134 (330) T protein:vir:94 57 EMMPF-TEIE-GNALAYNRENVLGDVQFLAVGGTITAKNPATFTKVTSELTTLIGDAEVNGLIQATRSDFMDQTSVQVAS 134 (330) T ss_pred hhccc-cccc-CCcceeeeeecCCcceeeeccccccccCcceeeeeeechhhhhhhHHHHHHHHHhcCCHHHHHHHHHHH Confidence 44432 2233 334667664554444544445552223322 2122222233333344445667776666666555555 Q ss_pred HHHHHHHHhhhHHHHHHhhhhhhh-hhhcccccccccCCCcceEE--eeccccHHHHHHHHHHHHhhCCccceEEechhh Q lcl|NC_011269. 157 TKQAIMRQEDSRLVTLLEAAAVSY-RVVDSSAQPGVGALPNEITI--AGSHLMPDDLYTAVTYTDQRQLDSSRLLANPQE 233 (333) Q Consensus 157 A~qaIM~qED~~~~slle~~a~~~-r~~~ssA~p~vg~~~N~i~i--~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~~ 233 (333) ..+++...+...+|+= .++.-.+ =|-++ .. -.|.|.- .||-+|+++|-++...|-+.+-..+.++||..- T Consensus 135 ~ieal~~~~e~~linG-Ds~~~~F~GL~~~-~~-----~~q~i~tg~~gg~~T~d~LDeLl~~v~~~~g~~~~~l~n~a~ 207 (330) T protein:vir:94 135 KAKSIGRQYQASMITG-DGTGNSFQGMMGL-VA-----ASQTISAGANGGTLTFELLDQLLDLVKDKDGQVDYLMSSFAM 207 (330) T ss_pred HHHHHHHHHHHHhhcc-CCCCccccchhhc-CC-----cccEEecCCCCCCCCHHHHHHHHHHhcCCCCCCcEEEechhH Confidence 5556655444333330 0010000 00000 00 1155554 579999999999999995555568899999998 Q ss_pred hhhhhhcCCCchhhhHH-hhhh-hcceeeeeecccccceeeecCC----------eEEEee--Ch----hhhccccc-cc Q lcl|NC_011269. 234 YRDLYRWDINTTGWAFK-DSVV-AGERIVQFGEFQIGKSIIIPRG----------TVYLTP--EP----EFLGVFPV-MY 294 (333) Q Consensus 234 ~~Di~gw~~N~~~~~~~-DpV~-~~e~il~~G~fgi~~skvlprg----------eiyvva--dp----E~~G~~pv-R~ 294 (333) ++=|..+.-....+..- +++. -|-.+..++-.-|...-++|-+ .||.+- +- -.+|.++. .+ T Consensus 208 ~r~I~a~~R~~~~~~v~~~~~~~~G~~v~~~~GvPi~~~d~ip~~~~~~~~~~ttsIyav~~G~~~~~qgV~Gl~~~g~~ 287 (330) T protein:vir:94 208 RRKYFSLLRALGGAAIGEVMTLPSGRQIPTYRGVPWFVNDFIPSNMTQGTATNATAIFAGTFDDGSNKYGIAGLTARGSA 287 (330) T ss_pred HHHHHHHHHhccCCCCCCcccccCCCEEeeeCCeEEEecccccCCCCcccCCCceeEEEEeecccccccceEeecCCCCC Confidence 88888753322211110 0111 1222333433334455566664 467776 43 34777766 46 Q ss_pred Cceeccccchhh-hccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 295 SLDVEEDNKVER-FNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 295 ~L~s~p~D~~er-~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) ||.++....... -.+=|.+.=-.|+++.||.++-.|++- T Consensus 288 glsVr~~G~~~~k~v~~~~v~~y~~~av~~~~a~~~L~~V 327 (330) T protein:vir:94 288 GLRVQNVGAKENADETITRVKMYCGFANFSQLGLAAIKGL 327 (330) T ss_pred cceeeeCCCccccceeeEEEEEeeeeEEechhheeeeccc Confidence 999988774443 345566666799999999999999998 No 129 >protein:vir:99075 Length: 392 # NCBI annotation: gp30 # Family: family:all:10837 # MgeID: mge:1671 # MgeName: Wildcat # Cross-refs: genbank:acc:YP_655895;genbank:gi:109521467;genbank:GeneID:4158040 Probab=96.04 E-value=0.00094 Score=37.18 Aligned_cols=256 Identities=13% Similarity=0.067 Sum_probs=129.0 Q ss_pred HHHHhcCchhHHHHHHHHHHHHHHHHHhhhhh-hhhhhhccccCCCcceeecCCCCccceEEE---EcCCCcccceeecC Q lcl|NC_011269. 46 LAHILSDKVGGIQRLGQSMIGPIQLQLRYQGI-LRNVLLEDTLTPGVPIQYDVLDDLGQAYML---HGNEGEIRITPFEG 121 (333) Q Consensus 46 m~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi-~RklL~~~TL~~G~~p~y~v~~~v~~a~~~---~~~~G~i~~Q~i~~ 121 (333) |++.+=.||=| ++.+...++.+|-+-.+ -|.|.-+..=.+|.-...++|+.+..+..- -...+.+..|.+.. T Consensus 1 Ma~~~~~p~~~----a~~~l~~l~~~lv~~~lv~~~~~~~~~~~~GdtV~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 76 (392) T protein:vir:99 1 MANAFSKPTAV----VDTAIQMLQNELILTNLVWLNGIGDFAHKFNDTITVRVPAPSRGHTRKLRGAGAERNLTVSDFTE 76 (392) T ss_pred CccccccHHHH----HHHHHHHHHhhccchhhhccccccccccCCCCeEEEeecccccceeeeccccccCCccccccccc Confidence 77777667765 33344445555443221 122222111134655666666665443321 12344577777776 Q ss_pred ceeeccc-eeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCcceEE Q lcl|NC_011269. 122 KRIEVQL-FRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPNEITI 200 (333) Q Consensus 122 ~ri~~P~-f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N~i~i 200 (333) ..+++.. -.....-.|+-.|..+...|+.++....+.+++-..-|..+++++.++. +.... T Consensus 77 ~~~~~~id~~k~~~~~i~d~e~~~~~~~~~~~~~~~a~~ala~~vd~~i~~~~~~a~------------------~~~~~ 138 (392) T protein:vir:99 77 DSFPVTLTDVAYHLGVLTDEELTFDLESFATQILPRQVRGVADILEEGVRDMIVGAP------------------YEAAG 138 (392) T ss_pred ceEEEEEeeeeecceeechHHHhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHhccc------------------ccccc Confidence 7777763 3334444577788888999999999999999999999988888875543 22222 Q ss_pred eeccccH----HHHHHHHHHHHhhCCccc-eEEechhhhhhhhhcCCCchh-hhHHh--hhhhcceeeeeecccccceee Q lcl|NC_011269. 201 AGSHLMP----DDLYTAVTYTDQRQLDSS-RLLANPQEYRDLYRWDINTTG-WAFKD--SVVAGERIVQFGEFQIGKSII 272 (333) Q Consensus 201 ~~g~Lt~----~~L~~a~t~v~~~~L~at-~il~~~~~~~Di~gw~~N~~~-~~~~D--pV~~~e~il~~G~fgi~~skv 272 (333) ....+++ +++-.|.+..++.+.|.. ++++.|+.|..|...+.-... +...+ ...+.+.+-+..-|.+..+-. T Consensus 139 ~~~~~~~~~~~~~i~~a~~~L~~~~vP~~R~~vv~p~~~~~l~~~~~~~~~~~~g~~~~~~l~~G~vg~i~G~~v~~s~~ 218 (392) T protein:vir:99 139 AVHEVAPDEFFKGVNGARRALNELYIPQGRVLVVGTAVTEQILNDDRFIKYESQGQSAVSALQEARLGRIYGYEIVESTL 218 (392) T ss_pred cccccChhhhHHHHHHHHHHHhhcCCCCCCEEEEcHHHHHHHhcccceeecccccchhhhhhhcceeeeeeeeEEEeecc Confidence 3333444 456677788888888743 689999999999873211000 01111 111122232333356777878 Q ss_pred ecCCeEEEeeChhhhcccccccCceeccccchhhhccceehhhhhhhhhhc----cceEEEEecC Q lcl|NC_011269. 273 IPRGTVYLTPEPEFLGVFPVMYSLDVEEDNKVERFNKGWVMDELVGMAILN----PRGIVILRKA 333 (333) Q Consensus 273 lprgeiyvvadpE~~G~~pvR~~L~s~p~D~~er~~kGWvm~E~~g~~i~N----~~siv~~~~~ 333 (333) +|.++.+....-. . .+-. +.-+.+.+... +....-.+- +++ .......... T Consensus 219 ~~~~t~~a~~~~a-~-~~at--~a~v~~~~~~~-----~~s~s~~~~-v~~~~~~~~~~t~~s~~ 273 (392) T protein:vir:99 219 IPHGDAYLYHPTA-F-IMAT--RAPAPPMGAVR-----STAISGDQR-IAMRWLVDYDSTITSNR 273 (392) T ss_pred cccccceeeeccc-c-cccc--ccccccccccc-----eeEEecccc-eecceeecccceeeccc Confidence 8877764321111 0 0111 11112222111 101100000 000 0000111111 No 130 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=95.80 E-value=0.00093 Score=37.20 Aligned_cols=277 Identities=14% Similarity=0.104 Sum_probs=150.3 Q ss_pred cchHHHHHHHHHHh-------cCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccC--CCcceeecCCCCccceEEE Q lcl|NC_011269. 37 LSAREKQAKLAHIL-------SDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLT--PGVPIQYDVLDDLGQAYML 107 (333) Q Consensus 37 ls~ee~~~Lm~~Al-------~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~--~G~~p~y~v~~~v~~a~~~ 107 (333) +|++....-++.+. +|..++. .+-..+++.|.+.+..+.-.++.....++. +|++|.....+.. -+ T Consensus 1 ~~~k~~~~~l~~~~~~~~~~~~~~~~g~-~v~~~~~~~l~~~i~e~s~~l~~i~v~~v~~~~~~i~~~~~~~~~----~~ 75 (321) T protein:vir:31 1 MASRTINNDLSRITEKNALTVDDLDAGG-TLPDPLWDEFWTDMIEETPLLDAIRTETVGAKKTRIPTLNIGERH----RR 75 (321) T ss_pred CchHHHHHHHHHHHHhccccccccCCcc-eeCHHHHHHHHHHHHHhhhhhhhceeeeccCcceeeeeeccCCcc----cc Confidence 34444333333332 2333331 455566666666666554444555555554 4555555443332 23 Q ss_pred EcCCCcccc--eeecCceeeccceeeeccccccHHHhhhhc--chhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhh Q lcl|NC_011269. 108 HGNEGEIRI--TPFEGKRIEVQLFRIASFPQIKKEDLYYLR--SNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVV 183 (333) Q Consensus 108 ~~~~G~i~~--Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~--~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~ 183 (333) .+.+|.-+. ....=+.+++...++.+.+.|.-+-|.... .|+-.+.-+...++|=..+++-.++=-- T Consensus 76 ~~~e~~~~~~~~~~~~~~~~~~~~k~~~~~~it~e~L~d~a~~~d~e~~i~~~ia~~~a~~~~~~~~nGd~--------- 146 (321) T protein:vir:31 76 PQDEGEWNENESDVSTGTIDISTEKATVAWDLPREVVQENPEGEALADRILNLMTDAWSADVEDLAANGDE--------- 146 (321) T ss_pred cccccccccccccceeeeeeeeeEEEEeehhccHHHHHhhhcchhHHHHHHHHHHHHHHHHHHhheeeccc--------- Confidence 333332222 212224577788889999999988887764 4777777777778887777755443211 Q ss_pred ccccccc----ccCC------CcceEEeeccccHHHHHHHHHHHHhhCCc--cceEEechhhhhhhhhcCCCchhhhHHh Q lcl|NC_011269. 184 DSSAQPG----VGAL------PNEITIAGSHLMPDDLYTAVTYTDQRQLD--SSRLLANPQEYRDLYRWDINTTGWAFKD 251 (333) Q Consensus 184 ~ssA~p~----vg~~------~N~i~i~~g~Lt~~~L~~a~t~v~~~~L~--at~il~~~~~~~Di~gw~~N~~~~~~~D 251 (333) +++.|+ -|-+ -+.+...++.++-+.|..+...+..+--. --..+||.+.+.+++-=-.+...+.. + T Consensus 147 -~~~~~~~~~n~G~l~~a~~~~~~~~~~~~~~~~d~l~~l~~~l~~~yr~~~~~v~im~~~~~~~~~~~l~~~~~~~~-~ 224 (321) T protein:vir:31 147 -DAEDSFENQNDGFITVAEGDVETIDAADDILDNDLVIRTIAGLDSKYRARMNPALIVSEDQLLSYHYTLTDRDTPLG-D 224 (321) T ss_pred -cCCCcccccchhhhhhhccccccccccccccCHHHHHHHHHhccHhHhcCCCeEEEechHHHHHHHHHHhcCCCccc-c Confidence 122221 1211 13345567778889999999888765332 23578999987766531112111222 3 Q ss_pred hhhhcceeeeeeccccc--ceeeecCCeEEEeeChhhhcccccccCceeccccc--hhh-hccce--ehhhhhhhhhhcc Q lcl|NC_011269. 252 SVVAGERIVQFGEFQIG--KSIIIPRGTVYLTPEPEFLGVFPVMYSLDVEEDNK--VER-FNKGW--VMDELVGMAILNP 324 (333) Q Consensus 252 pV~~~e~il~~G~fgi~--~skvlprgeiyvvadpE~~G~~pvR~~L~s~p~D~--~er-~~kGW--vm~E~~g~~i~N~ 324 (333) |...++--.. .+|.+ ..--||.+.+.+ .++.|+ +|-..++++.+.... .+. -.... .++.-++.+|-|+ T Consensus 225 ~~l~~~~~~t--l~G~pvv~~~~mP~~~il~-t~~~nl-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ve~~ 300 (321) T protein:vir:31 225 NVIMGEADVN--PFSFPIIGSGLWPDDKAMF-TDPQNL-IYALYRDLEIDVLTESDKVSERDLHARYFMRGDDDFAIENT 300 (321) T ss_pred chhhcccccc--ccceeEEEcCCCCCCcEEE-eccccE-EEEEeeccEEEEeecCccccccceeeEeeeeeecceeEecc Confidence 3333321111 34544 455689887655 678777 455666666544221 111 11222 2344578888899 Q ss_pred ceEEEEecC Q lcl|NC_011269. 325 RGIVILRKA 333 (333) Q Consensus 325 ~siv~~~~~ 333 (333) -+++++-.- T Consensus 301 ~a~a~~~~i 309 (321) T protein:vir:31 301 EAVVLAEGL 309 (321) T ss_pred ccEEEEecC Confidence 999888754 No 131 >protein:vir:1541 Length: 347 # NCBI annotation: major capsid protein 10A # Family: family:all:975 # MgeID: mge:31 # MgeName: phiYeO3-12 # Cross-refs: genbank:acc:NP_052109;swissprot:trembl:q9t107;genbank:gi:9634035;uniprot:Q9T107;genbank:GeneID:1262383 Probab=95.75 E-value=0.0015 Score=36.03 Aligned_cols=281 Identities=13% Similarity=0.136 Sum_probs=149.1 Q ss_pred HHHhhcchhc-chH-------HHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcceeecCCC Q lcl|NC_011269. 28 AKQRMGGRKL-SAR-------EKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPIQYDVLD 99 (333) Q Consensus 28 ~~~~~~~~~l-s~e-------e~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p~y~v~~ 99 (333) --+-++|.++ ++. +.-+|--+.+ .+-+.....|+-+.+.+....|+..|..-+++... T Consensus 1 ma~~~~~~~~~t~~~~~~~~~~~~a~~ie~f--------------~g~V~~~f~~~s~~~~~~~~~~~~~G~sv~i~~ig 66 (347) T protein:vir:15 1 MANIQGGQQIGTNQGKGQSAADKLALFLKVF--------------GGEVLTAFARTSVTMPRHMLRSIASGKSAQFPVIG 66 (347) T ss_pred CCccccCCccccccccCCCcchHHHHHHHHH--------------HHHHHHHHHHhhhhhhccccccccccceeEeeecc Confidence 1111222222 221 1112222222 23344455677788888888899999999999988 Q ss_pred CccceEEEEcCCCcccc--eeecC--ceeeccceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhh Q lcl|NC_011269. 100 DLGQAYMLHGNEGEIRI--TPFEG--KRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEA 175 (333) Q Consensus 100 ~v~~a~~~~~~~G~i~~--Q~i~~--~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~ 175 (333) .+..++ +++...+.. +.+.. ..|.+=+....++ +|+--|=-|...|+..+.-+++..|+.++.|..++..|-. T Consensus 67 ~~t~~~--~~~g~~l~~~~~~~~~~e~~ltID~~~~~~~-~VddlD~~q~~~D~~~~~~~~~g~aLA~~~D~~i~~~l~~ 143 (347) T protein:vir:15 67 RTKAAY--LKPGENLDDKRKDIKHTEKVIHIDGLLTADV-LIYDIEDAMNHYDVRAEYTAQLGESLAMAADGAVLAELAG 143 (347) T ss_pred ceeeee--eccCCCCCCCCCCCccceEEEEechhhhhhH-HhhhHHHHhcCCcchHHHHHHHHHHHHHHHHHHHHHHHHH Confidence 887777 444333322 22333 3344444444433 5677777788999999999999999999999999988854 Q ss_pred hhhhhhhhcccccccccCCCcc----eEEeecccc-H--------HHHHHHHHHHHhhCCcc--ceEEechhhhhhhhhc Q lcl|NC_011269. 176 AAVSYRVVDSSAQPGVGALPNE----ITIAGSHLM-P--------DDLYTAVTYTDQRQLDS--SRLLANPQEYRDLYRW 240 (333) Q Consensus 176 ~a~~~r~~~ssA~p~vg~~~N~----i~i~~g~Lt-~--------~~L~~a~t~v~~~~L~a--t~il~~~~~~~Di~gw 240 (333) ++-.- ..++.+..+...+. ...++|..+ + +.|..|.+..++.+.|. -++|++|+.|.+|.- T Consensus 144 ~~~~~---~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~i~d~~~~a~~~Lde~~VP~~gR~~vv~P~~y~~LL~- 219 (347) T protein:vir:15 144 LVNLP---DASNENIEGLGKPTVLTLVKPTTGDLTDPVELGKAIIAQLTIARASLTKNYVPAADRTFYTTPDNYSAILA- 219 (347) T ss_pred Hhhcc---ccccccccccCccccccccccccccchhhhhHHHHHHHHHHHHHHHHhhcCCCccCCEEEeCHHHHHHHhc- Confidence 43111 11222221111111 122333333 2 23445566778888863 367889999999987 Q ss_pred CCCchhhhHHhh-----hhhcceeeeeecccccceeeecCCeEE-----EeeChhh-----------------hcccccc Q lcl|NC_011269. 241 DINTTGWAFKDS-----VVAGERIVQFGEFQIGKSIIIPRGTVY-----LTPEPEF-----------------LGVFPVM 293 (333) Q Consensus 241 ~~N~~~~~~~Dp-----V~~~e~il~~G~fgi~~skvlprgeiy-----vvadpE~-----------------~G~~pvR 293 (333) +..+ ...|. +.++. |.+--=|.|.+|--+|-+.+- .++.+-+ .|.+.-+ T Consensus 220 --~~~~-~~~d~~~~~~~~~G~-Vg~i~G~~V~~Sn~lp~~~~t~~~~~~~~g~~~~~~~~~~~~~~~~f~~~~~l~~h~ 295 (347) T protein:vir:15 220 --ALMP-NAANYQALIDHERGT-IRNVMGFEVVEVPHLTAGGAGDTREDAPADQKHAFPATSSTTVKVALDNVVGLFQHR 295 (347) T ss_pred --cccc-ccccccccccccceE-EEEEeceEEEecccccccccccccccccccccccccccccceeeeccccceeeeecc Confidence 3221 22222 22222 222222557777777754321 0011110 1111111 Q ss_pred cCc-eecccc-chhh----hccceehhhhh--hhhhhccceEEEEecC Q lcl|NC_011269. 294 YSL-DVEEDN-KVER----FNKGWVMDELV--GMAILNPRGIVILRKA 333 (333) Q Consensus 294 ~~L-~s~p~D-~~er----~~kGWvm~E~~--g~~i~N~~siv~~~~~ 333 (333) .-+ .++-.| +.|+ ...+|.|.-.. |-.+.||.++|-+..- T Consensus 296 ~A~g~v~~~~~~~e~~~~~~~~~d~i~~~~~~G~~vlrP~~av~~~~~ 343 (347) T protein:vir:15 296 SAVGTVKLKDLALERARRANYQADQIIAKYAMGHGGLRPEAAGAIVLP 343 (347) T ss_pred ceeeeeEeeceeeeecccchhhhhhhehhhhcCCceeccccEEEEecC Confidence 111 223233 2222 35566665443 6678999998887433 No 132 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=95.55 E-value=0.0019 Score=35.55 Aligned_cols=313 Identities=15% Similarity=0.115 Sum_probs=156.4 Q ss_pred Ccccchhh-----hhhhhhhcccchHHHHH---HHHHHhhcchh--cchHHHHHHHH-----------HHh--cCchhHH Q lcl|NC_011269. 1 MTLPVAVG-----SGLGRFAKASDDYVADI---VEAKQRMGGRK--LSAREKQAKLA-----------HIL--SDKVGGI 57 (333) Q Consensus 1 ~~~~~~~~-----~~~~~~~~~~~~~~~~~---~~~~~~~~~~~--ls~ee~~~Lm~-----------~Al--~~~Eg~~ 57 (333) ...|.... .+..+...+++.+-... +.....+.+.+ ...+.|.++.. .++ .+..|+. T Consensus 73 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~r~a~~~~l~~~~~~~e~~a~~~~t~~GG~ 152 (434) T protein:vir:62 73 DDDPEKKEDPTAKENPNEKTELSEEQRSAISASIAAALSTKGHRTNKETEIRSVFANYIVGNIDEKEARALGLVTGNGSV 152 (434) T ss_pred cchhhhhcchhhhcchhhhHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHHHHHHhccccchhhhhhhcccccccce Confidence 11111110 11111111222221111 11111111111 11222222111 111 1122321 Q ss_pred HHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcceeecCCCCcc-ceEEEE-cCCCcccceeecCceeeccceeeeccc Q lcl|NC_011269. 58 QRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPIQYDVLDDLG-QAYMLH-GNEGEIRITPFEGKRIEVQLFRIASFP 135 (333) Q Consensus 58 ~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p~y~v~~~v~-~a~~~~-~~~G~i~~Q~i~~~ri~~P~f~Ivs~P 135 (333) ..-+-+.+.|.+.++...+.|++-++.+.. .-..||+..... +...+- +..++++.....=+.|++.-..+.+.+ T Consensus 153 -lvP~~~~~~Ii~~l~~~~~i~~~~~~~~~~--~~~~~p~~~~~~~a~~~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~ 229 (434) T protein:vir:62 153 -TIPDFLSKEIITYAQEENFLRRLGTGVKTK--ENIKYPVLVKKAEAQGHKNERTNNEMPETDIEFDEIELSPTEFDALA 229 (434) T ss_pred -ecchhhHHHHHHhhhhhhhhhhhcceeccC--CceEEEEEecCCcccceecccccccccccccceeeEEeeheeeEeeh Confidence 234667788999999999999998775433 234455532222 222222 223334443333367888999999999 Q ss_pred cccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCC-cceEE-eeccccHHHHHHH Q lcl|NC_011269. 136 QIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALP-NEITI-AGSHLMPDDLYTA 213 (333) Q Consensus 136 ~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~-N~i~i-~~g~Lt~~~L~~a 213 (333) .|..+=|.....|+..+..++..++|.+.+|.-+++ +-- +..|.-|.+. +.++. ..+..+-++|-.+ T Consensus 230 ~iS~ell~ds~~~l~~~i~~~la~~~~~~~d~~~l~---G~G--------~~~~~~g~~~~~~~~~~~~~~~~~d~l~~l 298 (434) T protein:vir:62 230 TVTKKLLARTGLPIEQIVMDELKKAYVRKETQYMVN---GDE--------ANNINDGALAKKAVEFKTDEKNLYDALVKM 298 (434) T ss_pred hhHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHhc---cCC--------CCccccceeecccccccccccchhhHHHHH Confidence 999999999999999999999999999999966552 221 1222222221 11222 2334556777777 Q ss_pred HHHHHhhCCccceEEechhhhhhhhhcCCCchhhhHHhhhhhcceeeeee----ccccc--ceeeecCCe-----EEEee Q lcl|NC_011269. 214 VTYTDQRQLDSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIVQFG----EFQIG--KSIIIPRGT-----VYLTP 282 (333) Q Consensus 214 ~t~v~~~~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~G----~fgi~--~skvlprge-----iyvva 282 (333) ...+..-.......+||++-|.-|+.=- +..| -|+-+-..-.+.| ++|.+ .+..+|.+. +++.. T Consensus 299 ~~~l~~~~~~~a~~v~n~~~~~~L~~lk-d~~G----~~l~~~~~~~~~g~~~tl~G~pV~~~~~~~~~~~~~~~~i~~G 373 (434) T protein:vir:62 299 KNTPVKEVRKKARWVLNTAALTKIETMK-TDDG----FPLLRPFNQAEGGIGYTLLGFPVEEEDAIDIPDSPDTPVFYFG 373 (434) T ss_pred HhhcchhhhcCCEEEEcHHHHHHHHHhh-ccCC----CEeeccCCCccCCCCceecceeeEEecCccCccCCCceEEEEe Confidence 7777666666667899999998887611 1111 1211100011122 34433 344455443 12234 Q ss_pred Chhhhcccccc-cCceeccccch--hhhccceehhhhh-hhhhhccceEEEEe---cC Q lcl|NC_011269. 283 EPEFLGVFPVM-YSLDVEEDNKV--ERFNKGWVMDELV-GMAILNPRGIVILR---KA 333 (333) Q Consensus 283 dpE~~G~~pvR-~~L~s~p~D~~--er~~kGWvm~E~~-g~~i~N~~siv~~~---~~ 333 (333) |... +..=.| +++.++..+.. .+-..|-..++=+ |+.|-+|-.+.++. |+ T Consensus 374 dfs~-~~i~~~~g~~~i~~~~~~~~~~~~v~~~~~~r~Dgk~i~~~~~~~~~~~~~~~ 430 (434) T protein:vir:62 374 DFSK-FYIQDVIGSLEVQKLVELFSRTNRVGFRIWNLLDAQLIHSPFEVPVYKYVLKA 430 (434) T ss_pred eccc-eEEEEeeceeEEEeehhhhcccCceEEEEEeeecceeecCcccceEEEEEecc Confidence 4442 222224 34444443322 2224566677766 77777788887774 22 No 133 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=94.99 E-value=0.001 Score=36.95 Aligned_cols=284 Identities=10% Similarity=0.028 Sum_probs=150.0 Q ss_pred chHHHHHHHHHHhhcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhcc-ccC--CCcceee Q lcl|NC_011269. 19 DDYVADIVEAKQRMGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLED-TLT--PGVPIQY 95 (333) Q Consensus 19 ~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~-TL~--~G~~p~y 95 (333) -|.. |++++-+|+ |. ..|..|+ .|.-..++.+.+.+.-....|++-+.. |+- ++.+|.. T Consensus 1 ~~~~------------~~~~~~~k~--it--~~d~~gG--~L~P~~~~~~i~~l~e~s~i~~~a~vi~t~~s~~~~i~~i 62 (314) T protein:vir:41 1 MDFL------------NKPFQITPK--ID--VPDLGKG--ILAVQRFGEFVREVRENSAIIKDARVLNALKSYEVDISRI 62 (314) T ss_pred Cchh------------hhHHHhhcc--cc--cccCCCc--eeChHHHHHHHHHHHhccchhhheeeecccCccceeeccc Confidence 0111 111111111 00 1344454 466566666666777777777766654 432 3555554 Q ss_pred cCCCCccceEEEEcCCCcccceeecCceeeccceeeeccccccHHHhhhhcc--hhHHHHHHHHHHHHHHHhhhHHHHHH Q lcl|NC_011269. 96 DVLDDLGQAYMLHGNEGEIRITPFEGKRIEVQLFRIASFPQIKKEDLYYLRS--NIVEYTQDMTKQAIMRQEDSRLVTLL 173 (333) Q Consensus 96 ~v~~~v~~a~~~~~~~G~i~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~--~vle~~q~~A~qaIM~qED~~~~sll 173 (333) ...-.+...+-|.+..+....+...=..+++..-++.+...|.-+-|+.... |+=.+.-++-+++|=+.|....+ T Consensus 63 ~~g~~~~~~~~~~~~~~~~~~~~~tf~~~~l~~~kl~~~v~is~e~L~D~a~~~~le~~i~~~~Ae~~g~~~~~~~~--- 139 (314) T protein:vir:41 63 SLGVELEPGRNTSGTKVAPTADEVTVSTNTLEMKELVTKVVLEDEALEDNIEQSAFEQTITSLLASGVTYDLECFFL--- 139 (314) T ss_pred ccCcccccccccccCCccCCcccccccceeeeeEEEEEeecccHHHHHhhhchhhHHHHHHHHHHHHHHHHHHHHhh--- Confidence 4333344455566666655554444456677777888888888888888765 77777777777777776654332 Q ss_pred hhhhhhhhhhccccccc----ccCC---CcceEE---eeccccHHHHHHHHHHHHhhCCc---cceEEechhhhhhhhhc Q lcl|NC_011269. 174 EAAAVSYRVVDSSAQPG----VGAL---PNEITI---AGSHLMPDDLYTAVTYTDQRQLD---SSRLLANPQEYRDLYRW 240 (333) Q Consensus 174 e~~a~~~r~~~ssA~p~----vg~~---~N~i~i---~~g~Lt~~~L~~a~t~v~~~~L~---at~il~~~~~~~Di~gw 240 (333) ++=-.. .+++|- .|-+ .+.++- ..+..+.+.|..++..+..+-+. --..+||.+.+..++-. T Consensus 140 nGdg~~-----~s~~~~~~~p~G~l~~a~~~~~~~~~~~~~~~~~~~~~l~~sl~~~yr~~~~~~~~~m~~~t~~~~r~~ 214 (314) T protein:vir:41 140 HADSSL-----TTGRELYRINDGWMKLAGNQYTDAEPEDENWPLNLFDGMMDELDTRYLQLKPRMKFYVSNEIYNGYRKQ 214 (314) T ss_pred ccccCC-----cCcccchhcchhhhhhcccceeecCccccccHHHHHHHHHHhcCchhhcCCCceEEEecHHHHHHHHHH Confidence 221000 011221 1111 011111 12335566678888877775443 44589998877665531 Q ss_pred CCCchhhhHHhhhhhcceeeeeeccccccee-------eecCCeEEEeeChhhhcccccccCceeccccchhhhccceeh Q lcl|NC_011269. 241 DINTTGWAFKDSVVAGERIVQFGEFQIGKSI-------IIPRGTVYLTPEPEFLGVFPVMYSLDVEEDNKVERFNKGWVM 313 (333) Q Consensus 241 ~~N~~~~~~~DpV~~~e~il~~G~fgi~~sk-------vlprgeiyvvadpE~~G~~pvR~~L~s~p~D~~er~~kGWvm 313 (333) --+...+ .-||.-++.--. =.+|++.-. -.|.+ .++..|+.|. +|..+.+++.++.=+.++-..+++. T Consensus 215 l~~~~~~-l~~~~~~~~~~~--~l~G~PV~~~~~~~~~~~~~~-~i~fgd~~nl-v~~~~~~ir~~~~~~a~~~~~~~~~ 289 (314) T protein:vir:41 215 LLVRETG-LGDSALIGATGL--QYDGIPIQYVPALDALGDDKA-RALLTVPTNL-VYGFWRNIRIEPKRDAAMRRTEYIA 289 (314) T ss_pred HhccCCc-ccchhhhCCCCc--eecceeeEecccccccCCCCc-eEEEechhhe-EEEeeceeEEeecccCcCCeEEEEE Confidence 1011101 113332222111 023544221 13444 4555679988 8889999999877666665666666 Q ss_pred hhhhhhhhhcc--ceEEEEecC Q lcl|NC_011269. 314 DELVGMAILNP--RGIVILRKA 333 (333) Q Consensus 314 ~E~~g~~i~N~--~siv~~~~~ 333 (333) .-=++..+.-+ -++.++.|+ T Consensus 290 ~~r~d~~~~~~~aa~~~~~~~~ 311 (314) T protein:vir:41 290 SLRADCNYEDENAAVAAVIDMS 311 (314) T ss_pred EEEeceEEEEcCcEEEEEeecc Confidence 55555555433 345566777 No 134 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=94.84 E-value=0.0027 Score=34.62 Aligned_cols=244 Identities=14% Similarity=0.153 Sum_probs=133.4 Q ss_pred hccccCCCcceeecCCCCccceEEEEcCCCccc--ceeecCce--eeccceeeeccccccHHHhhhhcchhHHHHHHHHH Q lcl|NC_011269. 83 LEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIR--ITPFEGKR--IEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTK 158 (333) Q Consensus 83 ~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~--~Q~i~~~r--i~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~ 158 (333) ...||+-|.-=++|..-++..+| +++-.++- .|.+...+ |.+=+..+.+ -+|+--|=-|...|+..+.-+++- T Consensus 1 ~vr~i~~g~s~~~~~iG~~~~~~--~~~G~~l~~~~~~~~~~e~~itID~~l~~~-~~VdDiD~~qa~~Dlr~e~s~~~G 77 (324) T protein:vir:99 1 MTRTITSGKSAQFPVMGRTKARY--LKQGQSLDDGREDIKHTEKVITIDGLLTTD-VLIYDIEDAMNHYDVRSEYSTQMG 77 (324) T ss_pred CeeeeecCceEEEeeeeeeEecc--ccCCCCcCCCcCCcCcccEEEEecchhhhh-hhhhhHHHHhcCccchhHHHHHHH Confidence 77889999999999988887788 44444332 23333333 4445554444 456666667788999999999999 Q ss_pred HHHHHHhhhHHHHHHhhhhhhhhhhcccccc--cccCCCcceEEeecccc----H----HHHHHHHHHHHhhCCccc--e Q lcl|NC_011269. 159 QAIMRQEDSRLVTLLEAAAVSYRVVDSSAQP--GVGALPNEITIAGSHLM----P----DDLYTAVTYTDQRQLDSS--R 226 (333) Q Consensus 159 qaIM~qED~~~~slle~~a~~~r~~~ssA~p--~vg~~~N~i~i~~g~Lt----~----~~L~~a~t~v~~~~L~at--~ 226 (333) .++.++.|.-++..+-..+-+ .-..++-| +.|+. ..+.+.++.-. + +.|..|.+..++.++|.. . T Consensus 78 ~aLA~~~Dq~i~~~~a~~~~~--~a~~~~~~~~~~g~~-~~~~~~~~~~~~~~~~~~~~dai~~a~~~Lde~~VP~~gR~ 154 (324) T protein:vir:99 78 EALAMAADVANYAEMAKLVNS--RKETTNENIEGLGAA-SLVKITGKKEDPAKYGTQVIQALTYARAAFAKKYIPAGDRT 154 (324) T ss_pred HHHHHHHHHHHHHHHHHhhhc--ccccccCCcccCCcc-ceecccccccccccCHHHHHHHHHHHHHHHhhcCCCCCCCE Confidence 999999998887665222200 00111111 11211 11223333322 2 334556688888888743 6 Q ss_pred EEechhhhhhhhhc-CCCchhhhHHhhhhhcceeeeeecccccceeeecCCeEEEeeChh-hh-cccccc---------- Q lcl|NC_011269. 227 LLANPQEYRDLYRW-DINTTGWAFKDSVVAGERIVQFGEFQIGKSIIIPRGTVYLTPEPE-FL-GVFPVM---------- 293 (333) Q Consensus 227 il~~~~~~~Di~gw-~~N~~~~~~~DpV~~~e~il~~G~fgi~~skvlprgeiyvvadpE-~~-G~~pvR---------- 293 (333) +|++|+.|..|.-= ..+...|...+.+.++ .|.+---|.|.+|--+|-+...=..+.- +. +.+|.- T Consensus 155 ~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~~G-~V~~i~Gf~V~~Sn~lp~~~~t~~~~a~~~~~~~~~~~~~~~~~~ky~ 233 (324) T protein:vir:99 155 FYTDPDTYSAILAALMPNAANYAALIDPETG-NIRNVMGFEVVETPHMTAQMVTNPTDAFDGTGHIFPATGDSTTTGKMT 233 (324) T ss_pred EEeChHHHHHHhhcccccccccccccceecc-eEEEEeceEEEecCCccccccccccccccccccccccccccccccccc Confidence 89999999988741 0111122333333333 2433444667777777754322111000 00 001110 Q ss_pred c------Cc--------eecccc-chhh----hccceehhhhhhh--hhhccceE--EEEecC Q lcl|NC_011269. 294 Y------SL--------DVEEDN-KVER----FNKGWVMDELVGM--AILNPRGI--VILRKA 333 (333) Q Consensus 294 ~------~L--------~s~p~D-~~er----~~kGWvm~E~~g~--~i~N~~si--v~~~~~ 333 (333) + +| .++-.+ +.|. -..||.|.-...| .+.||.++ |.|.+- T Consensus 234 ~d~~~~~gl~~~~~a~~tv~~~~~~~e~~~~~~~~~d~i~~~~a~G~~~lRPe~a~~v~l~~~ 296 (324) T protein:vir:99 234 VGADNVVGLFVHRSAVATLKLKDMALERARRPEYQADQIIAKYAMGHGGLRPEAVGAIIFEDG 296 (324) T ss_pred cccCceeEEEEehhheEEEeeecceecceechhhHHHhhhhhhhhcCcccccceEEEEEEccC Confidence 0 11 122222 2232 2678998776555 57899755 555544 No 135 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=94.12 E-value=0.0053 Score=33.04 Aligned_cols=287 Identities=13% Similarity=0.150 Sum_probs=154.6 Q ss_pred hcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcceeecCCCCccceEEEEcCC Q lcl|NC_011269. 32 MGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNE 111 (333) Q Consensus 32 ~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~ 111 (333) |- -+++-++... .+-++... + -| +---+-+..+.-|+-+.+.+....|+..|.--+||..-+...+|+--|. T Consensus 1 ms--~~~~~t~~~~--~~s~~d~a-l-~l-e~f~geV~~af~~~s~~~~~~~~rti~~g~s~~~~~iG~~~~~~~~pG~- 72 (335) T protein:vir:78 1 MS--FLNDLTRPNY--AGKNADVD-I-HL-EEHLGIVDKHFAYTSKFAPLMNIRDLRGSNVVRLDRLGNVEAKGRRAGE- 72 (335) T ss_pred CC--cccccccccc--ccccchhh-h-hh-hhhhhHHHHHHHHhhhhccccceeeeccceeEEEeeeeeeeecccccCc- Confidence 21 1111111100 00000000 0 00 1112334456677778888888889999999999999888888854443 Q ss_pred CcccceeecC--ceeeccceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcc-ccc Q lcl|NC_011269. 112 GEIRITPFEG--KRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDS-SAQ 188 (333) Q Consensus 112 G~i~~Q~i~~--~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~s-sA~ 188 (333) .+-.|++.. ..|.+=++- +|+-+|+-.|=-+.+.|+..+.-.+.-.|+-+..|...+-.+=-++ |.-.. +.. T Consensus 73 -~l~~~~~~~~k~~itID~ll-~a~~~VddlDe~~~~yDvR~e~s~~~G~aLA~~~Dq~~~~~l~~aa---~~~a~~~~~ 147 (335) T protein:vir:78 73 -ELERSRVVNDKWNLTVDTLL-YLRHQFDHQDEWTQSFDMRKEVAELDGQELARKFDQACLIQVIKAA---AMDAPVDLE 147 (335) T ss_pred -ccCCCCcccCCeEEEeccee-echhhHhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhc---ccccccccC Confidence 344444433 345555554 5666688888889999999999999999999999987775543333 21111 112 Q ss_pred ccc--cCCCcceEEeeccc--cHHHHHH----HHHHHHhhCCccc-----eEEechhhhhhhhhcC--CCch-h-hhHHh Q lcl|NC_011269. 189 PGV--GALPNEITIAGSHL--MPDDLYT----AVTYTDQRQLDSS-----RLLANPQEYRDLYRWD--INTT-G-WAFKD 251 (333) Q Consensus 189 p~v--g~~~N~i~i~~g~L--t~~~L~~----a~t~v~~~~L~at-----~il~~~~~~~Di~gw~--~N~~-~-~~~~D 251 (333) |+. |.. =...++|+.. .+..|.. |.+..++.++|.. .++++|+.|.-|+-=+ .|.. + ....+ T Consensus 148 ~~~~~G~~-~~~~~tg~~~~~~~~~l~~a~~~a~~~l~ekdvP~~~~~~rv~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~ 226 (335) T protein:vir:78 148 DAFSPGVL-EKLDLTGLTAKEAAEKIVRMHRRVVETFIERDLGDAVYSEGLTPMSPRVFSLLLEHDKLMSVEYQATGATN 226 (335) T ss_pred CCcCCCcc-eeeeeccccccccHHHHHHHHHHHHHHHHhccCCCCCCCccEEEeChHHHHHHhccccccccccccccccc Confidence 221 222 1223344443 3334444 4455777888643 4889999999988610 1111 0 01122 Q ss_pred hhhhcceeeeeecccccceeeecCCeEEEeeChhhh----cccccccCc----------eecccc-chhh--hccceehh Q lcl|NC_011269. 252 SVVAGERIVQFGEFQIGKSIIIPRGTVYLTPEPEFL----GVFPVMYSL----------DVEEDN-KVER--FNKGWVMD 314 (333) Q Consensus 252 pV~~~e~il~~G~fgi~~skvlprgeiyvvadpE~~----G~~pvR~~L----------~s~p~D-~~er--~~kGWvm~ 314 (333) ....++ +.+.-=|.|.+|--+|.+.+.--+..... +.+..+-++ +.++.- ..++ -..+|+|. T Consensus 227 ~~~~g~-v~~v~Gv~V~~Sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~Al~t~~~~~~~~e~~~~~~~~~~~i~ 305 (335) T protein:vir:78 227 DYVKSR-VAILNGVKVLETPRFATKAISAHPLGRHFNVSAEEAERQIALFLPSKTLITAQVAPVQAKLWEDHDQFSWVLD 305 (335) T ss_pred ccccce-eEEeeceEEEeeccCCCCCCccccccccCCcccccccceEEEEEecceEEEEEEEecccceeeccchhhHhhh Confidence 333333 33333366888888888764322211100 111111111 111111 2222 35789998 Q ss_pred hhhhh--hhhccceEEEEecC Q lcl|NC_011269. 315 ELVGM--AILNPRGIVILRKA 333 (333) Q Consensus 315 E~~g~--~i~N~~siv~~~~~ 333 (333) -...| .+.||.+.|.+.-- T Consensus 306 ~~~a~G~g~lRPe~a~~i~~t 326 (335) T protein:vir:78 306 TFQMYNIGARRPDTAGAIELK 326 (335) T ss_pred HHHHcCCcccCcceEEEEEec Confidence 76655 57899998888754 No 136 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=93.47 E-value=0.0042 Score=33.60 Aligned_cols=288 Identities=11% Similarity=0.023 Sum_probs=157.5 Q ss_pred HHHHHHHHHhhcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcceeecCCCCc Q lcl|NC_011269. 22 VADIVEAKQRMGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPIQYDVLDDL 101 (333) Q Consensus 22 ~~~~~~~~~~~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p~y~v~~~v 101 (333) ..++ .--+.+|-.-|+ +..+|.=+.+.- -+..+.-|+-+-+.+....||+-|.-=+||..-+. T Consensus 1 Ms~~--n~~t~~~~~~sg-~~~al~Le~f~G--------------eV~taF~~~si~~~~~~vRti~~gkS~qf~~~G~s 63 (401) T protein:vir:70 1 MSTP--NNLTNVAVSASG-EVDSLLIEKFNG--------------KVNEQYLKGENIMSYFDVQTVTGTNTVSNKYLGET 63 (401) T ss_pred CCCC--cccccccccccc-chhHhHHhHhcc--------------hHHHHHHHHhhhcccceeeeecccceEEEEEeeee Confidence 0000 000011111111 222333222222 23345667778888888999999999999999888 Q ss_pred cceEEEEcCCCcccceeec-Cc-eeeccceeeeccccccHHHhhhhcchhHH-HHHHHHHHHHHHHhhhHHHHHHhhhhh Q lcl|NC_011269. 102 GQAYMLHGNEGEIRITPFE-GK-RIEVQLFRIASFPQIKKEDLYYLRSNIVE-YTQDMTKQAIMRQEDSRLVTLLEAAAV 178 (333) Q Consensus 102 ~~a~~~~~~~G~i~~Q~i~-~~-ri~~P~f~Ivs~P~V~~~dl~~~~~~vle-~~q~~A~qaIM~qED~~~~slle~~a~ 178 (333) ..+|+--|.+ +-.|++. +| .|.+=+.-+ ++-+|.-.|=.+...|.+. +.-+..-+|+-+..|..++.++.+++ T Consensus 64 ~~~~~~pG~~--ld~~~~~~dK~~ItID~lL~-a~~~V~dlDe~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~aa- 139 (401) T protein:vir:70 64 ELQVLAPGQS--PAATSTQADKNQLVIDATVI-ARNTVAHLHDVQGDIDSLKPKLATNQAKQLKRMEDEMLIQQMMLGG- 139 (401) T ss_pred EeeeecCCCC--cCCCCcccccEEEEeCceee-hhhhhhhHHHHHhcccccchHHHHHHHHHHHHHHHHHHHHHHHHhc- Confidence 8888655443 3333332 23 356666554 4555666666777777444 33344567888899999999998777 Q ss_pred hhhhhcc--cccccccCCCcceEEeecc----ccHHH----HHHHHHHHHhhCCccceE--EechhhhhhhhhcC--C-C Q lcl|NC_011269. 179 SYRVVDS--SAQPGVGALPNEITIAGSH----LMPDD----LYTAVTYTDQRQLDSSRL--LANPQEYRDLYRWD--I-N 243 (333) Q Consensus 179 ~~r~~~s--sA~p~vg~~~N~i~i~~g~----Lt~~~----L~~a~t~v~~~~L~at~i--l~~~~~~~Di~gw~--~-N 243 (333) |..-+ ++-|.++.-...|++.+.. .++.+ +..|....+..++|..+. ++.++.|+=|.-=+ . + T Consensus 140 --~ana~~~~~~p~~~~~G~~i~v~~~~~~~~~~~~~l~~ai~dA~~~LdEkdVP~~r~vvl~pp~~Ys~Ll~~d~L~nr 217 (401) T protein:vir:70 140 --IANTQAKRTNPRVKGHGFSINVEVAEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRDADRIVDK 217 (401) T ss_pred --cccccccccCCCcCCCceEEeccccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHhcCcccch Confidence 32222 2334444444455554432 12333 457778888899987754 44588886555411 0 2 Q ss_pred chhhhHHhhhhhcceeeeeecccccceeeecCCeEEEeeChh----hhcccccccC------ce--------ecccc--- Q lcl|NC_011269. 244 TTGWAFKDSVVAGERIVQFGEFQIGKSIIIPRGTVYLTPEPE----FLGVFPVMYS------LD--------VEEDN--- 302 (333) Q Consensus 244 ~~~~~~~DpV~~~e~il~~G~fgi~~skvlprgeiyvvadpE----~~G~~pvR~~------L~--------s~p~D--- 302 (333) +|+++.-+...++. ++.-==|.|.+|--+|-+.--++..+. +.-.|.++++ +- ++-.| T Consensus 218 d~~~s~~g~~~~G~-v~~vaGv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~~~d~s~~~~v~f~~~Av~tvk~~~lt~ 296 (401) T protein:vir:70 218 TYTISQSGATIQGF-TLSSYNCPVIPSNRFPKYSQGQTHHLLSNEDNGYRYDPLPAMNGAIAVLFTADALLVGRSIDVTG 296 (401) T ss_pred hhccccCCccccce-EEEEeceEEEeeccccccccccccccccccCCCccCCCCccccceeEEEEehhheEEEEeecccc Confidence 33333334444444 443333667788888875544443322 1122333332 22 22222 Q ss_pred chhh--hccceehhh--hhhhhhhccceEEEEecC Q lcl|NC_011269. 303 KVER--FNKGWVMDE--LVGMAILNPRGIVILRKA 333 (333) Q Consensus 303 ~~er--~~kGWvm~E--~~g~~i~N~~siv~~~~~ 333 (333) ..+| -.++|+|+- +.|-...||-++.++.-+ T Consensus 297 ~~~~d~r~~~~~id~~~a~g~g~~RPeaa~vv~~k 331 (401) T protein:vir:70 297 DIFYEKKEKTYYIDTFMAEGAIPDRWEAVSVVTTK 331 (401) T ss_pred chhhhhhhhHHHHHHHHHhCCcccchhheEEEeec Confidence 1122 378999975 456667899998887444 No 137 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=93.37 E-value=0.0057 Score=32.87 Aligned_cols=282 Identities=14% Similarity=0.088 Sum_probs=136.9 Q ss_pred CcccchhhhhhhhhhcccchHHHHHHHHHHhhcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhh Q lcl|NC_011269. 1 MTLPVAVGSGLGRFAKASDDYVADIVEAKQRMGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRN 80 (333) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~Rk 80 (333) |.|- +.=||+=-. |-+- +-..-+|--..-.+..|++..+|+.... -++=..+ T Consensus 1 ~~~~-~~~~~~~~M---s~~i--~~~fv~qy~~~v~~~~qq~~s~L~~tV~----------------------~~~~~~~ 52 (322) T protein:vir:10 1 MKLN-AIMSMLPLI---AGDI--DQAFVQTYETTLRILSQQKSAKLKQYCQ----------------------HKNESSE 52 (322) T ss_pred Cccc-ceeeeeeee---echh--hhHHHHHHHHHHHHHHHHhhhhhhcccc----------------------ccccccc Confidence 2221 111121110 1110 0000022111122222222222222211 1111111 Q ss_pred hhhccccCCCcceeecCCCCccceEEEEcCCCcccc--eeecCceeeccceeeeccccccHHHhhhhcchhHHHHHHHHH Q lcl|NC_011269. 81 VLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRI--TPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTK 158 (333) Q Consensus 81 lL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~--Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~ 158 (333) --..|++.-...+.|. ++-....+..+.. +++. |+...-++..+.+. .+-+|+-.|..+...|.....-+.+. T Consensus 53 ~~~~~~~~~~~~~~~~--~~~~~~~~~d~~~-dtp~~~~~~~~r~~~~~d~~--~~~~VDd~D~~k~~~D~~~~~~~~~a 127 (322) T protein:vir:10 53 SHNWETLASMDPDAVK--RKRSRQQSADGTY-PTPVNNKPFAKRRTNVDTYD--TGHVVEQEDISQMLLDPNSALITSQA 127 (322) T ss_pred ccceeecccccccccc--cccccccccCccc-CCCccccccceEEEeecccc--cceecchHHHHHhhcCchHHHHHHHH Confidence 1111222222222222 2211111111110 1222 23333344455554 24589999999999999999999999 Q ss_pred HHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCC---cc-eEEeeccccHHHHHHHHHHHHhhCCcc---ceEEech Q lcl|NC_011269. 159 QAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALP---NE-ITIAGSHLMPDDLYTAVTYTDQRQLDS---SRLLANP 231 (333) Q Consensus 159 qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~---N~-i~i~~g~Lt~~~L~~a~t~v~~~~L~a---t~il~~~ 231 (333) -|+=+..|..+++.+=+.|. -.-+++ .+. ++ +.-.+..+|-+.|-.|.+...+.+.|- .+++..| T Consensus 128 ~AL~R~~D~~I~~a~~g~a~-------~~~~gt-~v~~~ss~~i~~g~~g~t~~kl~~a~~~l~~~dvp~d~~R~~vv~p 199 (322) T protein:vir:10 128 YAMARKTDDLIIAGAWKPAS-------IKGTGQ-PVEFLATQEIGDGTKPISFDYVTEITERFLENEIEPEVSKVIVIGP 199 (322) T ss_pred HHhhhHHHHHHHhhhhcccc-------cccccc-ccccCCCcccccCccchhHHHHHHHHHHHHhcCCCCCCCeEEEeCH Confidence 99999999888876644441 111221 111 11 333455899999999999999999984 3699999 Q ss_pred hhhhhhhhcCCCchhhhHHhhhhhcceeeeeec------ccccceeeecCCeEEEeeChhhhccccc------------- Q lcl|NC_011269. 232 QEYRDLYRWDINTTGWAFKDSVVAGERIVQFGE------FQIGKSIIIPRGTVYLTPEPEFLGVFPV------------- 292 (333) Q Consensus 232 ~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~G~------fgi~~skvlprgeiyvvadpE~~G~~pv------------- 292 (333) ..|.||.. ..+| +-.|=..-.+ +.+.|. |-++.|-.||..- +...+ .|..++ T Consensus 200 ~~~~~LL~--d~~~--ts~D~~~~~~-l~~~G~ig~~lGf~~i~s~~lp~~~---~t~~~-~~~~~~~~~~~~~~~a~~k 270 (322) T protein:vir:10 200 TQARKLLQ--ITEA--TSADYTSAMD-LQSKGIITNWMGYTWIVSTRLDKFD---PTQWG-MAAEDGPQGDEIWCIAMTD 270 (322) T ss_pred HHHHHHhc--chhh--hhhhcccchh-hhhcCeeeeeeeEEEEEeccCCccc---ccccc-ccccCCCCccceeEEEEec Confidence 99999997 3322 3333332222 112233 3456676676321 00000 111111 Q ss_pred -------ccCceeccccchhhhccceehh--hhhhhhhhccceEEEEecC Q lcl|NC_011269. 293 -------MYSLDVEEDNKVERFNKGWVMD--ELVGMAILNPRGIVILRKA 333 (333) Q Consensus 293 -------R~~L~s~p~D~~er~~kGWvm~--E~~g~~i~N~~siv~~~~~ 333 (333) ...+.++ +++...-...|-+| .++|=.+.+|.+||-++-. T Consensus 271 ~Av~~a~~~dv~~~-i~~~~~~~~a~~I~~~~~~Ga~ri~~~gVv~i~~~ 319 (322) T protein:vir:10 271 MALGYHSCKDIWTK-VAEDPSASFAWRIYSAFTADCVRVEDEHIFKLRLK 319 (322) T ss_pred CceeEEEeeeeeEE-eeccCCcchhhhhhhhhhhCceEeccCcEEEEEEe Confidence 1122222 12322234468888 7788889999999998876 No 138 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=91.64 E-value=0.0081 Score=32.04 Aligned_cols=271 Identities=11% Similarity=0.120 Sum_probs=132.9 Q ss_pred hcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcceeecC---CCCccceEEE- Q lcl|NC_011269. 32 MGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPIQYDV---LDDLGQAYML- 107 (333) Q Consensus 32 ~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p~y~v---~~~v~~a~~~- 107 (333) |-+ |+--|.+.+.. ..|.+.+ |+...++..+++.|--.+ +. |-.++|.. ++++..+.+- T Consensus 1 mpa--ltLaea~k~~~----------d~l~~~V---iE~~~~~s~lL~~LpF~~-ve-g~~~~ynR~~~~~~~~~~~v~~ 63 (310) T protein:vir:97 1 MAS--VTLAESAKLAQ----------DELVAGV---IENIITVNRMFDVLPFDS-IE-GNSLAYNRENVLGDVIMAGVGT 63 (310) T ss_pred Ccc--cchHHHhhcCc----------chHHHHH---HHHHhccchHHHhCCccc-cc-CCcceeeEeeccCCcccccccc Confidence 211 11111111111 1222222 555556777776665444 33 44566664 4555444322 Q ss_pred -EcCCCcccceeecCceeeccceeee-cccccc--HHHhhhh-cchhHHH-HHHHHHHHHHHHhhhHHHHHHhhhh--hh Q lcl|NC_011269. 108 -HGNEGEIRITPFEGKRIEVQLFRIA-SFPQIK--KEDLYYL-RSNIVEY-TQDMTKQAIMRQEDSRLVTLLEAAA--VS 179 (333) Q Consensus 108 -~~~~G~i~~Q~i~~~ri~~P~f~Iv-s~P~V~--~~dl~~~-~~~vle~-~q~~A~qaIM~qED~~~~slle~~a--~~ 179 (333) .++.|.-+.-.+- +..+- ++.|+ .--+|+ +.|++.. -.|-++. .+..++..-.++|+..+. +=. -. T Consensus 64 ~~~~~g~~~~~~t~-~~~~~-~L~i~~g~~~Vd~~i~dl~~~~~~dq~~~Ql~~~iea~~~~~e~~lIN----GD~a~n~ 137 (310) T protein:vir:97 64 TFSGAGAGKAAATF-TKVNS-NLTTIMGDAEVNGLIQATRSGDGNDQTAVQIASKAKSAGRKYQDQLIN----GNGAGNE 137 (310) T ss_pred cccCCCcccccccc-ceeee-eeeeeeehhhhhhHHHhhhcCChHHHHHHHHHHHHHHHHHHHHHHhhc----cccCCCc Confidence 2344422211111 00000 11111 122333 4677644 2333333 444455555666663332 100 00 Q ss_pred h-hhhcccccccccCCCcceEE--eeccccHHHHHHHHHHHHhhCCccceEEechhhhhhhhhcCCCc----hhhhHHhh Q lcl|NC_011269. 180 Y-RVVDSSAQPGVGALPNEITI--AGSHLMPDDLYTAVTYTDQRQLDSSRLLANPQEYRDLYRWDINT----TGWAFKDS 252 (333) Q Consensus 180 ~-r~~~ssA~p~vg~~~N~i~i--~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~~~~Di~gw~~N~----~~~~~~Dp 252 (333) + =|-++ ..+ .+.|.. .||.+|+++|-++...|-+.+-.++.++||++-++=|.+-.-.. -.+.+.|+ T Consensus 138 F~GL~~~-~~~-----~q~i~~~~~gg~~t~d~LDeLl~~v~~~~g~p~~~l~~~~~~r~i~A~~R~~~~~g~~~~~~~~ 211 (310) T protein:vir:97 138 FAGLIQL-CAS-----GQKATTGATGSAISFAILDELMDLVVDKDGQVDYLTMHARTLRSYKALLRALGGASINEVVELP 211 (310) T ss_pred ccchhhc-CCc-----cceeecCCCCCCCCHHHHHHHHHHHhcCCCCCCEEEecHHHHHHHHHHHHHhcCCCCCCccccC Confidence 0 00011 111 134544 57999999999999999666667889999997666555311111 00111221 Q ss_pred hhhcceeeeeecccccceeeecCCe----------EEEeeChh---hhcccc----cccCceeccccchhhh-ccceehh Q lcl|NC_011269. 253 VVAGERIVQFGEFQIGKSIIIPRGT----------VYLTPEPE---FLGVFP----VMYSLDVEEDNKVERF-NKGWVMD 314 (333) Q Consensus 253 V~~~e~il~~G~fgi~~skvlprge----------iyvvadpE---~~G~~p----vR~~L~s~p~D~~er~-~kGWvm~ 314 (333) . |-.+..++-.-|...-++|-++ ||.+-==| .+|+-. -.+||.+.......+. ..=|.+. T Consensus 212 ~--G~~v~~~~GiPi~~~d~ip~~~~~~~~~gtTsIya~r~Ge~~~~~Gv~Gl~~~~~~glsVr~~G~~~~~~v~~~~V~ 289 (310) T protein:vir:97 212 S--GAEVPAYSGTPIFRNDYIPTNQTKGGTTGCTTIFAGTLDDGSRTHGIAGLTATQAAGIQVVDVGESEDSDEHIWRVK 289 (310) T ss_pred C--CCEEeeeCCeEEEEeCccCCCccccccCCceeEEEEeeCccccccceeccccCCccceeEEeCCcccCCcceeEEEE Confidence 1 1223444444455666677654 67665322 344432 2467888887655442 2335555 Q ss_pred hhhhhhhhccceEEEEecC Q lcl|NC_011269. 315 ELVGMAILNPRGIVILRKA 333 (333) Q Consensus 315 E~~g~~i~N~~siv~~~~~ 333 (333) =-.|+++.||.++-.|++- T Consensus 290 ~Y~~~av~~~~A~a~L~~V 308 (310) T protein:vir:97 290 WYCGLALFSEKGLACADGI 308 (310) T ss_pred EeeeEEEecccceeeeccc Confidence 5699999999999999999 No 139 >protein:vir:2201 Length: 345 # NCBI annotation: major capsid protein # Family: family:all:975 # MgeID: mge:49 # MgeName: T7 # Cross-refs: genbank:acc:NP_041998;swissprot:sw:p19726;genbank:gi:9627469;goa:P19726;uniprot:P19726;genbank:GeneID:1261026 Probab=91.01 E-value=0.018 Score=30.16 Aligned_cols=286 Identities=13% Similarity=0.097 Sum_probs=150.4 Q ss_pred Cccc-----chhhhhhhhhhcccchHHHHHHHHHHhhcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhh Q lcl|NC_011269. 1 MTLP-----VAVGSGLGRFAKASDDYVADIVEAKQRMGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQ 75 (333) Q Consensus 1 ~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rq 75 (333) |+-- ..+-.|-|+.+ ++ ++-+|.-+.++. -+-....|+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~--~~---------------------~~~al~le~f~g--------------eV~~~f~~~ 43 (345) T protein:vir:22 1 MASMTGGQQMGTNQGKGVVA--AG---------------------DKLALFLKVFGG--------------EVLTAFART 43 (345) T ss_pred Ccccccchhccccccccccc--CC---------------------chhHHHHHHHhH--------------HHHHHHHHH Confidence 2211 11122223322 11 222222222221 223345566 Q ss_pred hhhhhhhhccccCCCcceeecCCCCccceEEEEcCCCccccee--e--cCceeeccceeeeccccccHHHhhhhcchhHH Q lcl|NC_011269. 76 GILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRITP--F--EGKRIEVQLFRIASFPQIKKEDLYYLRSNIVE 151 (333) Q Consensus 76 Gi~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~--i--~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle 151 (333) -+.|.+....|++-|.--+||...++..+|+.-| .++..+. + -.+.|.+=+..+ ++-+|+--|=-|...|+.. T Consensus 44 s~~~~~~~~r~i~~gks~~~~~iG~~~~~~~~~G--~~l~~~~~~~~~~e~~ltID~~~y-~~~~VddiD~~q~~~D~r~ 120 (345) T protein:vir:22 44 SVTTSRHMVRSISSGKSAQFPVLGRTQAAYLAPG--ENLDDKRKDIKHTEKVITIDGLLT-ADVLIYDIEDAMNHYDVRS 120 (345) T ss_pred hhhcccceeeeccccceEEEeeecceEEEeeecC--CCCCCCCCCcccceEEEEecchhh-hhhhHhhHHHHhcCchhHH Confidence 6777777778999999999999999999995544 3343321 2 224455555544 4446777777788999999 Q ss_pred HHHHHHHHHHHHHhhhHHHHHHhhhhhhhhh-hcccccccccCCC--cceEEeecccc---------HHHHHHHHHHHHh Q lcl|NC_011269. 152 YTQDMTKQAIMRQEDSRLVTLLEAAAVSYRV-VDSSAQPGVGALP--NEITIAGSHLM---------PDDLYTAVTYTDQ 219 (333) Q Consensus 152 ~~q~~A~qaIM~qED~~~~slle~~a~~~r~-~~ssA~p~vg~~~--N~i~i~~g~Lt---------~~~L~~a~t~v~~ 219 (333) +.-+++-.|+-++.|..++--|-.++ |. -..+..|+.+.-- -.++..|..++ -++|..|.+..++ T Consensus 121 ~~s~~~G~aLA~~~D~~i~~~l~k~a---~~~~~~~~~~~~~~~~~~~~~~~~g~~~t~~~~~~~~~~~ai~~a~~~Lde 197 (345) T protein:vir:22 121 EYTSQLGESLAMAADGAVLAEIAGLC---NVESKYNENIEGLGTATVIETTQNKAALTDQVALGKEIIAALTKARAALTK 197 (345) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhh---cccccccccccccccccccccccccccccccccCHHHHHHHHHHHHHHhhh Confidence 99999999999999988876542222 11 0112233322111 11333344443 2345566678888 Q ss_pred hCCcc--ceEEechhhhhhhhhcC-CCchhhhHHhhhhhcceeeeeecccccceeeecCC-------------------- Q lcl|NC_011269. 220 RQLDS--SRLLANPQEYRDLYRWD-INTTGWAFKDSVVAGERIVQFGEFQIGKSIIIPRG-------------------- 276 (333) Q Consensus 220 ~~L~a--t~il~~~~~~~Di~gw~-~N~~~~~~~DpV~~~e~il~~G~fgi~~skvlprg-------------------- 276 (333) .+.|. -.+|++|+.|..|.-=. +|...|..-+...++ .+.+.-=|.|.+|--+|-+ T Consensus 198 ~~VP~~~R~~vv~P~~y~~Ll~~~~~~~~~~~~~~~~~~G-~V~~i~G~~V~~sn~lp~~~~~~~~~~~~~~~~~~~~~~ 276 (345) T protein:vir:22 198 NYVPAADRVFYCDPDSYSAILAALMPNAANYAALIDPEKG-SIRNVMGFEVVEVPHLTAGGAGTAREGTTGQKHVFPANK 276 (345) T ss_pred cCCCccCCEEEeChHHHHHHhccccccccccccccccccc-eEEEEeceEEEecccccccccCccccCcccccccccccc Confidence 88887 47899999999987510 011112333333333 2333333445555545422 Q ss_pred -eEE----------EeeChhhhcccccccCceeccccchhhhccceehhhhh--hhhhhccceEEEEecC Q lcl|NC_011269. 277 -TVY----------LTPEPEFLGVFPVMYSLDVEEDNKVERFNKGWVMDELV--GMAILNPRGIVILRKA 333 (333) Q Consensus 277 -eiy----------vvadpE~~G~~pvR~~L~s~p~D~~er~~kGWvm~E~~--g~~i~N~~siv~~~~~ 333 (333) +-| ++..|+-+|.--. .++++|-.. .+-..+|+|.-.. |-.+.||.+.|.+.-- T Consensus 277 g~~~~~~~~~~~~~l~~h~~A~~~v~~-~~~~~e~~r--~~~~~~d~I~~~~a~G~~vlRPeaa~~i~~~ 343 (345) T protein:vir:22 277 GEGNVKVAKDNVIGLFMHRSAVGTVKL-RDLALERAR--RANFQADQIIAKYAMGHGGLRPEAAGAVVFK 343 (345) T ss_pred cceeeeeccCceEEEEEehhheeeeee-ecceeeeee--chhHHHHHHHHHHhcCCcccccceeEEEEEe Confidence 111 1112222211100 011222211 1136678887654 5568999988766533 No 140 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=89.71 E-value=0.025 Score=29.39 Aligned_cols=290 Identities=14% Similarity=0.058 Sum_probs=152.6 Q ss_pred ccchHHHHHHHHHHhhcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcceeec Q lcl|NC_011269. 17 ASDDYVADIVEAKQRMGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPIQYD 96 (333) Q Consensus 17 ~~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p~y~ 96 (333) -|+ . .--+-++..-|+ +..+|-=+.+. +-+..+..|.-+.+.+....|+.-|.-=+|| T Consensus 1 ms~-----~--n~~t~~~~~~~~-~~~al~le~f~--------------geV~taf~~~s~~~~~~~~rti~~gkS~q~~ 58 (364) T protein:vir:10 1 MSN-----P--NVLTQPAVSASG-EVDSLLIEKFN--------------NRVHEQYLKGENLLQWFDVQEVVGTNSVSNK 58 (364) T ss_pred CCC-----c--cccccccccccc-chhhhhhhhhh--------------hhHHHHHHHHHhhcCcceeeeecccceEEee Confidence 000 0 000111222111 22222222222 2334456677788888888999999999999 Q ss_pred CCCCccceEEEEcCCCcccceeecCc--eeeccceeeeccccccHHHhhhhcchhHHHHH-HHHHHHHHHHhhhHHHHHH Q lcl|NC_011269. 97 VLDDLGQAYMLHGNEGEIRITPFEGK--RIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQ-DMTKQAIMRQEDSRLVTLL 173 (333) Q Consensus 97 v~~~v~~a~~~~~~~G~i~~Q~i~~~--ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q-~~A~qaIM~qED~~~~sll 173 (333) ..-+...+|+--|. .+-.|++..+ .|.+=++- .++-+|+--|=.|...|.+.-.+ .++-+|+-+..|..++.++ T Consensus 59 ~iG~~~~~~~~~G~--~ld~~~~~~~k~~itID~ll-~a~~~V~diDe~q~~~D~vR~e~s~e~G~ALA~~~Dq~i~~~v 135 (364) T protein:vir:10 59 YIGETELQVLSPGK--SPDASPTEFDKNRLVVDTTV-IARNTVAHFHDVQNDIDGLKSKLSVNQAKKLKKMEDSMVIQQL 135 (364) T ss_pred eeeeeEEeeeccCc--ccCCCCcccCcEEEEeccee-eechhhhhHHHHhcCccchhHHHHHHHHHHHHHHHHHHHHHHH Confidence 98888888854433 2333333333 44444554 45556777777788888555444 6778899999999998877 Q ss_pred hhhhhhhhhhcccccccccCCCcceEEe---ec-cccHHHH----HHHHHHHHhhCCccc--eEEechhhhhhhhhcC-- Q lcl|NC_011269. 174 EAAAVSYRVVDSSAQPGVGALPNEITIA---GS-HLMPDDL----YTAVTYTDQRQLDSS--RLLANPQEYRDLYRWD-- 241 (333) Q Consensus 174 e~~a~~~r~~~ssA~p~vg~~~N~i~i~---~g-~Lt~~~L----~~a~t~v~~~~L~at--~il~~~~~~~Di~gw~-- 241 (333) -+++.+= +.--+..|.+..-..-|++. ++ .-....| ..|.+..++-+.|.. .++++|..|.-|.-=+ T Consensus 136 ~~aa~a~-~~~~~~~~~~~~~g~~i~~~~~a~~~~~~~~~l~~ai~~a~~~LdEkdVP~~~R~~vv~P~~y~~Ll~~~~l 214 (364) T protein:vir:10 136 VLGGISN-TEAIRKNPRVAGHGFSIHIVGLASSFLTSPQYMMAAIEMAMEQQTEQEVDTSELCGLMPWTAFNCLRDADRI 214 (364) T ss_pred Hhhhhhc-ccccccCCcccCCcceeeecccCcchhhhHHHHHHHHHHHHHHHhhcCCCccccEEEeChHHHHHHhcCCcc Confidence 6554211 11111222221111122221 11 1112333 357777888888665 5789999998887610 Q ss_pred CC-chhhhHHhhhhhcceeeeeecccccceeeecCCeEE------EeeChh---hhc-cccccc------Cceecc---- Q lcl|NC_011269. 242 IN-TTGWAFKDSVVAGERIVQFGEFQIGKSIIIPRGTVY------LTPEPE---FLG-VFPVMY------SLDVEE---- 300 (333) Q Consensus 242 ~N-~~~~~~~DpV~~~e~il~~G~fgi~~skvlprgeiy------vvadpE---~~G-~~pvR~------~L~s~p---- 300 (333) .| +|+.+.-+...++. +++.-=|.|.+|--+|..--. +.+-|. ..| .|.+.+ ++-..| T Consensus 215 vn~d~~~~~~~~~~~G~-v~~v~Gv~Vv~Sn~lP~~~~~~~~t~~~t~h~ls~~~~g~~y~v~~d~~~~~~~~f~~~Al~ 293 (364) T protein:vir:10 215 VDKSYTIAASDNTVDGF-VLKSWNTPIVPSNRFPKLSDNTEGTGNTKHHKLSNAGNGNRYDVTAGQTSAQAVLFTQDALL 293 (364) T ss_pred ccccccccCCCccccce-eEEEeceEEEeccccccccccccccccccccccccccCCcccccccccceeEEEEEecceEE Confidence 11 12222223333333 443333557777777753211 111111 001 222222 232222 Q ss_pred ----cc-ch--hh--hccceehhhhhh--hhhhccceEEEEecC Q lcl|NC_011269. 301 ----DN-KV--ER--FNKGWVMDELVG--MAILNPRGIVILRKA 333 (333) Q Consensus 301 ----~D-~~--er--~~kGWvm~E~~g--~~i~N~~siv~~~~~ 333 (333) .| +. ++ ...+|+|.-..+ -.+.||.++|.+..+ T Consensus 294 tv~~~~~t~e~~~~~~~~~~~ida~~a~G~g~lRPeaa~~i~~~ 337 (364) T protein:vir:10 294 VGRTISITGDIFYEKKEKTWYIDTFLAEGAIPDRWEAVAVVTAA 337 (364) T ss_pred EEEEecceeeeeeccceeeeeeeeehcccCcccCccceEEEEec Confidence 22 22 12 478999986654 467999999999887 No 141 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=88.92 E-value=0.022 Score=29.63 Aligned_cols=287 Identities=9% Similarity=-0.008 Sum_probs=153.7 Q ss_pred HHHHHHHHHhhcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcceeecCCCCc Q lcl|NC_011269. 22 VADIVEAKQRMGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPIQYDVLDDL 101 (333) Q Consensus 22 ~~~~~~~~~~~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p~y~v~~~v 101 (333) ..++ .--+.++-.-|+ +..+|.=+.+.- -+..+.-|+-+-+.+....||+-|.-=+||..-+. T Consensus 1 Ms~~--n~~t~p~~~gsg-~~~aL~Le~f~G--------------eV~taF~~~si~~~~~~vRtI~~gkS~qf~~lG~s 63 (400) T protein:vir:10 1 MSTP--NNLTNVAVSASG-EVDSLLIEKFNG--------------KVNEQYLKGENIMSYFDVQTVTGTNTVSNKYLGET 63 (400) T ss_pred CCCC--cccccccccccc-chhhhHHhHhcc--------------hHHHHHHHHhhhcccceeeeecccceEEEEEeeee Confidence 0000 000111111111 223333222222 23335667778888888999999999999999888 Q ss_pred cceEEEEcCCCcccceeecCce-eeccceeeeccccccHHHhhhhcchhHHHHH-HHHHHHHHHHhhhHHHHHHhhhhhh Q lcl|NC_011269. 102 GQAYMLHGNEGEIRITPFEGKR-IEVQLFRIASFPQIKKEDLYYLRSNIVEYTQ-DMTKQAIMRQEDSRLVTLLEAAAVS 179 (333) Q Consensus 102 ~~a~~~~~~~G~i~~Q~i~~~r-i~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q-~~A~qaIM~qED~~~~slle~~a~~ 179 (333) ..+|+--|.+=+ +..+.-+|. |.+=+.-+ |+-+|...|=.+...|.+.-.+ ..--.|+-+..|..++.++.+++.+ T Consensus 64 ~a~y~~pG~~ld-g~~~~~dk~~ItIDtLL~-a~~~V~dlDd~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~a~~a 141 (400) T protein:vir:10 64 ELQVLAPGQSPA-ATSTQADKNQLVIDATVI-ARNTVAHLHDVQGDIDSLKPKLATNQAKQLKKMEDEMLIQQMLLGGIA 141 (400) T ss_pred EEeeecCCCCcC-CCCcccCcEEEEeCceee-ecchhhhHHHHhhccccccHHHHHHHHHHHHHHHHHHHHHHHHHhccc Confidence 889965554422 222222333 66666654 5556666666677777333222 4445688899999999988777621 Q ss_pred hhhhcccccc--cccCCCcceEEeeccccH------HH----HHHHHHHHHhhCCccce--EEechhhhhhhhhcC--C- Q lcl|NC_011269. 180 YRVVDSSAQP--GVGALPNEITIAGSHLMP------DD----LYTAVTYTDQRQLDSSR--LLANPQEYRDLYRWD--I- 242 (333) Q Consensus 180 ~r~~~ssA~p--~vg~~~N~i~i~~g~Lt~------~~----L~~a~t~v~~~~L~at~--il~~~~~~~Di~gw~--~- 242 (333) +++.| ..|+..++.++.....+. .. +..|....+..++|..+ +++.++.|+=|..=+ . T Consensus 142 -----~t~~~~~~~~g~~~g~s~~v~~~~~~~~~~~~~l~~A~~~A~~~LdEkdVP~~d~vvl~pp~~Ys~Ll~~dkLvn 216 (400) T protein:vir:10 142 -----NTQAKRTNPRVKGHGFSVNVEVNEGEALVNPQYVMAAVEFALEQQLEQEVDISDVAILMPWRYFNVLRDADRIVD 216 (400) T ss_pred -----ccccccccCCccccccceeecccccccccCHHHHHHHHHHHHHHHHhcCCCccceEEEcCHHHHHHHHhCCcccc Confidence 12222 345555655554322222 23 44677778888888765 456688887665521 0 Q ss_pred CchhhhHHhhhhhcceeeeeecccccceeeecCCeEEEeeChh---h-hccccccc------Cce--------ecccc-- Q lcl|NC_011269. 243 NTTGWAFKDSVVAGERIVQFGEFQIGKSIIIPRGTVYLTPEPE---F-LGVFPVMY------SLD--------VEEDN-- 302 (333) Q Consensus 243 N~~~~~~~DpV~~~e~il~~G~fgi~~skvlprgeiyvvadpE---~-~G~~pvR~------~L~--------s~p~D-- 302 (333) ++|+.+.-+...+++ +++-==+.|.+|--+|.+---+.+.+. . .-.|.+++ ++- ++-.| T Consensus 217 rdf~~s~~g~~~~g~-v~~v~Gv~Iv~Sn~lP~~a~~~~~~~lS~a~~G~~y~~t~d~s~~~av~F~~sAv~tvk~~~lt 295 (400) T protein:vir:10 217 KSYTISQSGATIQGF-VLSSYNCPVIPSNRFPKYSQGQKHHLLSNEDNGYRYDPIAEMNGAIAVLFTADALLVGRSIDVI 295 (400) T ss_pred hhccccCCCccccce-EEEEeceEEEeeCcCCcccCcccccccccCCCCccCCccccccceeEEEEehhheEEEEeeccc Confidence 133333334444444 333333556677777753211111110 0 11233222 221 22222 Q ss_pred -chhh--hccceehhh--hhhhhhhccceEEEEecC Q lcl|NC_011269. 303 -KVER--FNKGWVMDE--LVGMAILNPRGIVILRKA 333 (333) Q Consensus 303 -~~er--~~kGWvm~E--~~g~~i~N~~siv~~~~~ 333 (333) ..++ -.++|+|.- +.|-...||-++.++.-+ T Consensus 296 ~~~~~d~r~~~~~id~~~a~G~g~~RPeaa~vv~~~ 331 (400) T protein:vir:10 296 GDIFYEKKEKTYYIDTFMSEGAIPDRWEAVSVVTTK 331 (400) T ss_pred cccccchhhHHHHHHHHHHhCCcccchhheEEEEec Confidence 1112 378999975 556677899998888766 No 142 >protein:vir:3136 Length: 322 # NCBI annotation: hypothetical protein # Family: family:all:11728 # MgeID: mge:64 # MgeName: VpV262 # Cross-refs: genbank:acc:NP_640318;genbank:gi:21234405;genbank:GeneID:956058 Probab=87.63 E-value=0.037 Score=28.41 Aligned_cols=276 Identities=13% Similarity=0.085 Sum_probs=141.7 Q ss_pred hcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcceeecCCCCccceEEEEcCC Q lcl|NC_011269. 32 MGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNE 111 (333) Q Consensus 32 ~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~ 111 (333) |.--.=+-.+ ++ |=-+|=| .+.+=.-+++.|=..-+.|+ .-...|.....+-.-+++-+= ..+. T Consensus 1 ~~~~n~ts~~-qa-----fi~~EiW----sa~il~~l~~~Lv~~~~~~~----~d~g~GDtV~InsIg~~tV~d--Y~~~ 64 (322) T protein:vir:31 1 MSTGNNTSNT-QA-----LIVSEIW----ADEIEDILHEKLLDVNIARV----VDFPDGDKLTIPSVGTPVVRS--RPEQ 64 (322) T ss_pred CCCCCCcccc-eE-----Eeehhhh----HHHHHHHhhhhhhhhhhhcc----cccCCCCeEEecccccccccc--ccCC Confidence 2100000000 00 0012222 11111111222222222221 222346555555543332121 3456 Q ss_pred CcccceeecCceeeccce-eeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhccccccc Q lcl|NC_011269. 112 GEIRITPFEGKRIEVQLF-RIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPG 190 (333) Q Consensus 112 G~i~~Q~i~~~ri~~P~f-~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~ 190 (333) +.+-+|+...-.+++.-- +.-..-.|+- |.-|.+.++.....++|.+++....|..+-+||-..|-. -+++++ T Consensus 65 ~~i~~d~ltt~~~~l~IDq~KYfaf~VdD-D~~Qa~~dl~~~~~~~aa~ala~~~D~fva~lL~~gA~~-----~~~~~~ 138 (322) T protein:vir:31 65 GDFTFDNLDTGEISIILRDEVYAGNAISK-KLRQDSRWISNVGAMLPAEQARAIMERYQTDLLALGNAQ-----FAGQND 138 (322) T ss_pred CCcccccCCCceEEEEEehhhhhccccch-hHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh-----hhccCC Confidence 778888877766666511 1222234776 888999999999999999999999999999977544311 112221 Q ss_pred ---ccCCCcceEEeecc--ccHHHHHHHHHHHHhhCCcc--ceEEechhhhhhhhhcCCCchhhhHHhhhhhcceeeeee Q lcl|NC_011269. 191 ---VGALPNEITIAGSH--LMPDDLYTAVTYTDQRQLDS--SRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIVQFG 263 (333) Q Consensus 191 ---vg~~~N~i~i~~g~--Lt~~~L~~a~t~v~~~~L~a--t~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~G 263 (333) +.++|-.+--+|.. ..=+.|-.+.+..++.+.|. -.+|++|+.+..+.+ ++.+.-..+||=--+ +.+.| T Consensus 139 p~vin~~~~~iv~~gt~~~~ay~~lv~l~~kLdkanVP~~gR~vVV~P~~~~~L~~--i~~~~~l~~D~rf~~--i~~sG 214 (322) T protein:vir:31 139 PNVINGVPHRFVGTGTDQTMDVTDFSRVNYVMTQSKMPMGGMIGIIDPSVAHHLET--ITNISNISNNPRWEG--IVESG 214 (322) T ss_pred cceecCCccceeccCCCchhhHHHHHHHHHHhccccCCCCCeEEEeCchhhhhhhh--hhhhhhhhccccccc--ccccc Confidence 12222222223321 22246777778888888885 357788999999977 555543444432100 23333 Q ss_pred c------------ccccceeeecCCeEEEeeChh----------------hhcccccccCceeccccchhh----hccce Q lcl|NC_011269. 264 E------------FQIGKSIIIPRGTVYLTPEPE----------------FLGVFPVMYSLDVEEDNKVER----FNKGW 311 (333) Q Consensus 264 ~------------fgi~~skvlprgeiyvvadpE----------------~~G~~pvR~~L~s~p~D~~er----~~kGW 311 (333) . |.|..|-.||-+..=++|-.. --|.-|.++..|--|.-..|| |+.+- T Consensus 215 ~a~g~~~Vg~~~GF~V~~SN~l~~~~~~i~aG~d~~~t~ag~~n~f~~~~~~~~~~~~~~~~~l~~~e~~r~~~~~~d~~ 294 (322) T protein:vir:31 215 IAPDMQFVRSVYGIDLFVSNLLADANETINAGGDARSTTAGKCNMFMNVSDMGLLPFVVAWKEMPTTKSFIDDYNDDLNT 294 (322) T ss_pred chhhHHHHHHHhceeeeeeccccccccccccCcccccccceeecccccccchhhhhhhhHhhhhhhhhcccCccccccce Confidence 2 445556555422211111111 013445555555555444554 55555 Q ss_pred ehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 312 VMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 312 vm~E~~g~~i~N~~siv~~~~~ 333 (333) .----.|.+|++|-+++.|--. T Consensus 295 ~~~~~~g~g~~r~e~l~~~~a~ 316 (322) T protein:vir:31 295 ATTARWGNGLVRDENLVCVLAN 316 (322) T ss_pred eeeeeecceeecccceEEEEec Confidence 5555569999999999887644 No 143 >protein:vir:97031 Length: 402 # NCBI annotation: 31 # Family: family:all:2806 # MgeID: mge:1644 # MgeName: K1-5 # Cross-refs: genbank:acc:YP_654132;genbank:gi:108862016;genbank:GeneID:5075980 Probab=81.31 E-value=0.087 Score=26.40 Aligned_cols=288 Identities=12% Similarity=0.069 Sum_probs=151.2 Q ss_pred ccchHHHHHHHHHHhhcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcceeec Q lcl|NC_011269. 17 ASDDYVADIVEAKQRMGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPIQYD 96 (333) Q Consensus 17 ~~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p~y~ 96 (333) -|+ . .--+-++..-|+ ...+|.=+.+. +-+..+..|.-+.+.+....|+.-|.-=+|| T Consensus 1 Ms~-----~--n~~t~~~~~~s~-~~~al~le~f~--------------geV~taF~~~si~~~~~~vrti~~GkS~qf~ 58 (402) T protein:vir:97 1 MST-----P--NTLTNVAVSASG-EVDSLLIEKFN--------------GKVNEQYLKGENILSYFDVQTVTGTNTVSNK 58 (402) T ss_pred CCC-----c--cccccccccccc-chhhhhhhhhh--------------hhHHHHHHHHHhhcCcceeeeecccceEEEE Confidence 000 0 000111222121 22222222222 2334456677788888888999999999999 Q ss_pred CCCCccceEEEEcCCCcccceeec-Cc-eeeccceeeeccccccHHHhhhhcchhHHHHH-HHHHHHHHHHhhhHHHHHH Q lcl|NC_011269. 97 VLDDLGQAYMLHGNEGEIRITPFE-GK-RIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQ-DMTKQAIMRQEDSRLVTLL 173 (333) Q Consensus 97 v~~~v~~a~~~~~~~G~i~~Q~i~-~~-ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q-~~A~qaIM~qED~~~~sll 173 (333) ..-+...+|+--|.+ +-.+++. +| .|.+=++-+ ++-+|+--|=.|...|.+.-.+ .+.-+|+-+..|..++.++ T Consensus 59 ~iG~~~a~y~~~G~~--ldg~~~~~~k~~ItID~lL~-a~~~V~diDeaq~~yD~vRse~s~e~G~ALA~~~Dq~ii~~i 135 (402) T protein:vir:97 59 YLGETELQVLAPGQS--PNATPTQADKNQLVIDTTVI-ARNTVAHIHDVQGDIDSLKPKLAMNQAKQLKRLEDQMAIQQM 135 (402) T ss_pred EEeeeEEeeeccccc--cCCCCcccccEEEEeCceee-chhhhhhHHHHHhcccchhHHHHHHHHHHHHHHHHHHHHHHH Confidence 988888888654433 2223322 22 366666654 4455776777788888544334 6677899999999999888 Q ss_pred hhhhhhhhhhccc-cc-cccc--CCCcceEEeec--cccHHHHH----HHHHHHHhhCCccc--eEEechhhhhhhhhcC Q lcl|NC_011269. 174 EAAAVSYRVVDSS-AQ-PGVG--ALPNEITIAGS--HLMPDDLY----TAVTYTDQRQLDSS--RLLANPQEYRDLYRWD 241 (333) Q Consensus 174 e~~a~~~r~~~ss-A~-p~vg--~~~N~i~i~~g--~Lt~~~L~----~a~t~v~~~~L~at--~il~~~~~~~Di~gw~ 241 (333) -+++.+= +... +- +++| ... +++.++. .-++.+|. .|.+..+..+.|.. .++++|+.|.-|.-=+ T Consensus 136 ~~aa~a~--t~~~~~~~~~~~~g~s~-~~~~t~~~a~~~~~~l~~ai~~a~~~LdEkdVP~~dRv~vv~P~~y~~Ll~~~ 212 (402) T protein:vir:97 136 LLGGIAN--TKAERNKPRVKGHGFSI-NVNVTESEALANPQYVMAAVEYALEQQLEQEVDISDVAIMMPWKFFNALRDAD 212 (402) T ss_pred HHhhccc--cccccccCccccccccc-ccccccchhhcCHHHHHHHHHHHHHHHHhcCCCccccEEEeChHHHHHHhhcc Confidence 6655210 0000 00 1111 110 1112211 34555555 56677777887765 5788999999888500 Q ss_pred --C-CchhhhHHhhhhhcceeeeeecccccceeeecCCeEEEeeCh---hhhc-ccccccCce--------------ecc Q lcl|NC_011269. 242 --I-NTTGWAFKDSVVAGERIVQFGEFQIGKSIIIPRGTVYLTPEP---EFLG-VFPVMYSLD--------------VEE 300 (333) Q Consensus 242 --~-N~~~~~~~DpV~~~e~il~~G~fgi~~skvlprgeiyvvadp---E~~G-~~pvR~~L~--------------s~p 300 (333) . ++|+...-+....+. +.+-==|.|.+|--+|.+--.+...+ ..-| .|.++++-. ++- T Consensus 213 rl~n~d~~~~~~g~~~~G~-v~~v~Gv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~t~d~t~~~~~~f~~~Av~tvk~ 291 (402) T protein:vir:97 213 RIVDKTYTISQSGATINGF-VLSSYNCPVIPSNRFPTFAQDQAHHLLSNEDNGYRYDPIAEMNGAVAVLFTSDALLVGRT 291 (402) T ss_pred cccchhhccccCCccccce-eEEEeceEEEecCccccccccccccccccCCCCccCCcCcccceeEEEEEecceEEEEEe Confidence 0 112222234444444 33222256777777776432232211 1112 244444322 111 Q ss_pred cc---ch--hhhccceehhhhhhhh--hhccc--eEEEEecC Q lcl|NC_011269. 301 DN---KV--ERFNKGWVMDELVGMA--ILNPR--GIVILRKA 333 (333) Q Consensus 301 ~D---~~--er~~kGWvm~E~~g~~--i~N~~--siv~~~~~ 333 (333) .| .. +.-.++|+|.-..+|+ ..||- +||++++- T Consensus 292 ~~vT~~~~~d~r~~~~~id~~~a~G~g~~RPeaa~vv~~~~~ 333 (402) T protein:vir:97 292 IEVTGDIFYEKKEKTYYIDTFMAEGAIPDRWEAVSVVTTKRD 333 (402) T ss_pred eccccchhhchhHHHHHHHHHHHhCCcccCccceEEEEEecc Confidence 11 11 2248899998766554 55674 56666663 No 144 >protein:vir:5974 Length: 324 # NCBI annotation: hypothetical protein # Family: family:all:1522 # MgeID: mge:125 # MgeName: SPP1 # Cross-refs: genbank:acc:NP_690674;genbank:geneid:6329212;genbank:gi:22855068;goa:Q38582;uniprot:Q38582;genbank:GeneID:955303 Probab=77.41 E-value=0.12 Score=25.53 Aligned_cols=265 Identities=14% Similarity=0.105 Sum_probs=122.6 Q ss_pred hcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHh--hhhhhhhhh-hcccc---CCCcceeecCCCCc-cce Q lcl|NC_011269. 32 MGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLR--YQGILRNVL-LEDTL---TPGVPIQYDVLDDL-GQA 104 (333) Q Consensus 32 ~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~--rqGi~RklL-~~~TL---~~G~~p~y~v~~~v-~~a 104 (333) |-.-+|| |--.- +++++-+..++-+..+ .-|+....- ...+| .+|....+|.-.++ +.+ T Consensus 1 MA~T~ls-------------d~i~p-eVf~~yv~~~~~~~~~l~qSg~i~~~a~i~~~l~~~~~G~~i~~P~~~~l~Gd~ 66 (324) T protein:vir:59 1 MAYTKIS-------------DVIVP-ELFNPYVINTTTQLSAFFQSGIAATDDELNALAKKAGGGSTLNMPYWNDLDGDS 66 (324) T ss_pred CCceeee-------------ceech-hHHHHHHHhhhHHHHHHhhcccccccHHHHHHhhccCCCCEEEecccccCCCcc Confidence 3332222 11111 3344444333322211 112211111 11122 24666666665554 222 Q ss_pred EEEEcCCCcccceeecCceeeccceeeeccccccHHHhhh--hcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhh Q lcl|NC_011269. 105 YMLHGNEGEIRITPFEGKRIEVQLFRIASFPQIKKEDLYY--LRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRV 182 (333) Q Consensus 105 ~~~~~~~G~i~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~--~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~ 182 (333) - .....++|....+.-..-.-.-.. ..--+...|+-+ .-+|-++++-+.-..++++..|..+++.|.++.-+- T Consensus 67 ~-~v~~~~~i~~~~l~t~~~~a~i~~--~~k~~~~tD~a~~~sg~dp~~~i~~q~a~~~~~~~~~~lia~l~g~~~~~-- 141 (324) T protein:vir:59 67 Q-VLNDTDDLVPQKINAGQDKAVLIL--RGNAWSSHDLAATLSGSDPMQAIGSRVAAYWAREMQKIVFAELAGVFSND-- 141 (324) T ss_pred c-ccCCCcccchhhcccceeeEEEEe--ecCceeehhhhhhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcc-- Confidence 2 223333333322221111111111 111123334322 356888999999999999999999999998765211 Q ss_pred hcccccccccCCCcceEEeecc---ccHHHHHHHHHHHHhhCCccceEEechhhhhhhhhcCCCchhhhHHhhhhhccee Q lcl|NC_011269. 183 VDSSAQPGVGALPNEITIAGSH---LMPDDLYTAVTYTDQRQLDSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERI 259 (333) Q Consensus 183 ~~ssA~p~vg~~~N~i~i~~g~---Lt~~~L~~a~t~v~~~~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~i 259 (333) ++-.|.+.+++.. ++.+.|..|.+..-|..=..+-++|++.-|.+|+.=++ ++-+..-+-- T Consensus 142 ---------~~~~~~~dvsa~~~~~~s~~~l~~A~~~~GD~~~~~~~ivmhS~v~~~L~~~~l-------i~~~~~s~~~ 205 (324) T protein:vir:59 142 ---------DMKDNKLDISGTADGIYSAETFVDASYKLGDHESLLTAIGMHSATMASAVKQDL-------IEFVKDSQSG 205 (324) T ss_pred ---------ccccceeeeeccccceecHHHHHHHHHHhCCcccCcEEEEEchHHHHHHHHhhh-------hhhccccccC Confidence 1223666776654 78899999999999988889999999999999998222 2333333322 Q ss_pred eeeeccc---ccceeeecC---------CeEEEeeChhhhcccccccCceeccccchhh----h--ccceehhhhhhhhh Q lcl|NC_011269. 260 VQFGEFQ---IGKSIIIPR---------GTVYLTPEPEFLGVFPVMYSLDVEEDNKVER----F--NKGWVMDELVGMAI 321 (333) Q Consensus 260 l~~G~fg---i~~skvlpr---------geiyvvadpE~~G~~pvR~~L~s~p~D~~er----~--~kGWvm~E~~g~~i 321 (333) ...|.+. +..+..+|- .+.|++..-- .|...-+.++.+|.+..+.+ + ++=|++ -..|++. T Consensus 206 ~~i~~~~G~~VivdD~~p~~~~~~~~~~y~s~l~~~GA-i~~~~~~~~v~vE~dRd~~~g~~~l~~r~~~~~-~p~G~s~ 283 (324) T protein:vir:59 206 IRFPTYMNKRVIVDDSMPVETLEDGTKVFTSYLFGAGA-LGYAEGQPEVPTETARNALGSQDILINRKHFVL-HPRGVKF 283 (324) T ss_pred ceeeeecccEEEEeCCCCccccCCCCceEEEEEEecCe-EEEeecCCCcceecccCccccceEEEEeeEEEe-EeeeEEe Confidence 2334431 444444552 2456655332 23333333444444322211 0 111111 1111111 Q ss_pred -------hccceE----------EEEecC Q lcl|NC_011269. 322 -------LNPRGI----------VILRKA 333 (333) Q Consensus 322 -------~N~~si----------v~~~~~ 333 (333) .||+-- |.=+|+ T Consensus 284 ~~~~~~~~sPt~~~L~~~~NW~~v~~~k~ 312 (324) T protein:vir:59 284 TENAMAGTTPTDEELANGANWQRVYDPKK 312 (324) T ss_pred cccccCCCCCChhhhcCCcccccccCccc Confidence 122110 111111 No 145 >protein:vir:97397 Length: 517 # NCBI annotation: major capsid protein # Family: family:all:11745 # MgeID: mge:1675 # MgeName: Q54 # Cross-refs: genbank:acc:YP_762590;genbank:gi:115304291;genbank:GeneID:5130600 Probab=56.18 E-value=0.46 Score=22.42 Aligned_cols=304 Identities=11% Similarity=0.072 Sum_probs=120.0 Q ss_pred Ccccchhhhhhhh-hhcccchHHHHHHHHHH----hhcchhcc----------hHHHHHHHHHHh--------------c Q lcl|NC_011269. 1 MTLPVAVGSGLGR-FAKASDDYVADIVEAKQ----RMGGRKLS----------AREKQAKLAHIL--------------S 51 (333) Q Consensus 1 ~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~----~~~~~~ls----------~ee~~~Lm~~Al--------------~ 51 (333) ++ .....+-. -.+.-+...+.+ +.+| .+++.+.. .......-.... + T Consensus 168 l~---~~~~~~~~~~~e~~~~l~a~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 243 (517) T protein:vir:97 168 LK---ERENGGDNAALKTVSELAANL-MKQRESEKILGVEALKVTPEATEFLKTREAEVAYMSASLTKDPKAAWTAELKE 243 (517) T ss_pred HH---HHHHHHHHHHHhhhhhhhhhH-HHHHHhhhhcccccccccchhhHHHHHHHHHHHHHHhcccccccceeeeeccc Confidence 00 00000000 000000000000 0000 01111100 000000000000 0 Q ss_pred CchhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcceeecCCCCccceEEEEcCCCcccceeecCceeeccceee Q lcl|NC_011269. 52 DKVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRITPFEGKRIEVQLFRI 131 (333) Q Consensus 52 ~~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~i~~~ri~~P~f~I 131 (333) ...+.. ..-..+...|...+......+++.+..+++....|... +- ..+-+++.-++.+...+.=..++++-.+| T Consensus 244 ~~~~~~-~~p~~~~~~i~~~~~~~~~i~~~~~~~~i~~~~~~~~~---~~-~~a~~~~eG~~kp~s~~tf~~~~~~~~~i 318 (517) T protein:vir:97 244 RGISGM-PAPAGILKRIQDAVNDEGSLLPFIRHENLPTLVVGGDN---AL-TQGTGHTTGTDKTESNITLQTRVLTPQYV 318 (517) T ss_pred cccccc-ccchHHHHHHHHhhhhhccceeeeeeccccceeeeccc---cc-ceeeeeecCCcccccccceeeEEeeHhhh Confidence 111111 11123344454444444445555555555443333221 11 12223444444555554445677777788 Q ss_pred eccccccHHHhhhhcch----hHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCcceEEee----c Q lcl|NC_011269. 132 ASFPQIKKEDLYYLRSN----IVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPNEITIAG----S 203 (333) Q Consensus 132 vs~P~V~~~dl~~~~~~----vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N~i~i~~----g 203 (333) ...+.+....|.....| +-.+...+-..++...||.-++ .+-- ......|.+++.-+... + T Consensus 319 a~~~~~S~qll~Ds~~dd~~~l~s~i~~~l~~~l~~~ee~a~l---~GdG--------tg~~~~gi~~~a~~~~~~~~~~ 387 (517) T protein:vir:97 319 YKYIKLPKIVMNSNATDIAGAILTYVMNRLPDMVIMAVNRAII---MGGV--------TGVSETQIYPVVGDAWATNVTG 387 (517) T ss_pred hhhhhhhHHHHHHhhhccHHHHHHHHHHHHHHHHHHHHHHHHh---cccC--------CCcccccccccccccccccccc Confidence 88888888888777776 6678888899999999995543 2211 11222333332211111 1 Q ss_pred cccHHHHHHHH-HHHHhhCCccceEEechhhhhhhhhcCCCchhhhHHhhhhhcceeeeeecccccc-eeeecCCeEEEe Q lcl|NC_011269. 204 HLMPDDLYTAV-TYTDQRQLDSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIVQFGEFQIGK-SIIIPRGTVYLT 281 (333) Q Consensus 204 ~Lt~~~L~~a~-t~v~~~~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~G~fgi~~-skvlprgeiyvv 281 (333) ..+-.++...+ .... +.....++||+.-|.-|+---=+.--|.+-+.+..... +++||+-. -..++-|+..++ T Consensus 388 ~~~~~d~i~~l~~a~~--~a~~a~~vmn~~t~~~I~klKD~~G~Yl~~~~~~~~~~---~~l~G~~~~~~~~~~~~~~~~ 462 (517) T protein:vir:97 388 TTNIQELLEKLSVATP--KAADSTLVIHRNDLAAIRFLKDKNGNYVFPVGVSNQTI---ATHFGFNRLVQSVAVDEKTAV 462 (517) T ss_pred cchHHHHHHHHHHHhh--hccCCEEEECHHHHHHHHHhhcCCCCeeccCcCCcccc---cccCCccccccccccCceeEe Confidence 12222322222 2222 23456689999999988852111111333222222221 23344210 112333443332 Q ss_pred eChhhhcccccccCceecccc-ch-hhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 282 PEPEFLGVFPVMYSLDVEEDN-KV-ERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 282 adpE~~G~~pvR~~L~s~p~D-~~-er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) |+.-|-+.+..-.+-.+ .. .....-|+.-..+|.+|.+|......-.- T Consensus 463 ----~~~~y~i~~~~g~~~~~~fd~~~n~~~f~~~~~~~g~i~~~~r~a~~~~~ 512 (517) T protein:vir:97 463 ----SLSGYVTNGSRGMEFEQGTILVENNKEYLFEMPISGSLEYKGTTAYGTYT 512 (517) T ss_pred ----eccccEEEeecceeeeeeeecccCceeEeeeeeeccccccccceEEEEEc Confidence 22223222211111111 10 01133455555667777777665543332 No 146 >protein:vir:3525 Length: 423 # NCBI annotation: major head protein # Family: family:all:1412 # MgeID: mge:72 # MgeName: APSE-1 # Cross-refs: genbank:acc:NP_050985;genbank:gi:9633571;genbank:GeneID:1262318 Probab=47.18 E-value=0.71 Score=21.40 Aligned_cols=249 Identities=9% Similarity=0.020 Sum_probs=117.9 Q ss_pred HHHHhcC--chhHHHHHHHHHHHHHHHHHhhhh-hhhhhhhcccc-CCCcceeecCCCCccceEEEEcCCCcccceeecC Q lcl|NC_011269. 46 LAHILSD--KVGGIQRLGQSMIGPIQLQLRYQG-ILRNVLLEDTL-TPGVPIQYDVLDDLGQAYMLHGNEGEIRITPFEG 121 (333) Q Consensus 46 m~~Al~~--~Eg~~~aLg~~mA~pI~~q~~rqG-i~RklL~~~TL-~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~i~~ 121 (333) |+.-|.+ ++- .++.....++.+|=.-. +-|.+.-+... ..|.-...++|+++..+=+-.+.-+.+..|.+-. T Consensus 1 MAN~llT~iP~i----ia~~al~~l~~~lV~~~lV~r~y~ge~~~a~~GDTV~I~~p~~~~v~d~~~~~~~~~~~~~~~e 76 (423) T protein:vir:35 1 MANNLESNISQI----VLKKFLPGFMSDIVLCKTVDRQLLSGEINSNTGDSVSFKRPHQFKSERTETGDITGKDKNGLFS 76 (423) T ss_pred CccchhhhhHHH----HHHHHHHHHHhhcccchhcccCCCcccccccCCCEEEEeeCCcceeecccCcCCCCcccccccc Confidence 5544444 233 23344444444443211 11222222211 3466777777776644332222222233333332 Q ss_pred ceeecc-ceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCcceEE Q lcl|NC_011269. 122 KRIEVQ-LFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPNEITI 200 (333) Q Consensus 122 ~ri~~P-~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N~i~i 200 (333) ..+++. .-.......++-.|+-+...+. ++....+..++..+-|..+.+++...+ . ..+|.. -+. T Consensus 77 ~~v~l~id~~k~~a~~v~d~e~~l~i~~~-~~~l~~a~~ala~~vd~~l~~~l~~~a--~--------~~vgt~---~t~ 142 (423) T protein:vir:35 77 AKATGKVGKYITVAVEWTQIEEALKLNQL-DQILSPIHERMVTDLETELAHFMMNNG--A--------LSLGSP---NTA 142 (423) T ss_pred ceeeEEeccceeccceeCHHHHHhhHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhcc--c--------cccccc---cCC Confidence 335555 4445555566777766655555 666677888888888888888664433 1 111111 010 Q ss_pred eeccccHHHHHHHHHHHHhhCCcc--ceEEechhhhhhhhhcCCCchhhh----HHhhhhhcceeeeeecccccceeeec Q lcl|NC_011269. 201 AGSHLMPDDLYTAVTYTDQRQLDS--SRLLANPQEYRDLYRWDINTTGWA----FKDSVVAGERIVQFGEFQIGKSIIIP 274 (333) Q Consensus 201 ~~g~Lt~~~L~~a~t~v~~~~L~a--t~il~~~~~~~Di~gw~~N~~~~~----~~DpV~~~e~il~~G~fgi~~skvlp 274 (333) . -.-+++-.|.+..++.+.|- -++|++|+.|..|.+ ...++.. ..+-++.++++-+..=|-|..|--+| T Consensus 143 ~---~~~~~i~~a~~~Ld~~~vP~~~R~~Vv~p~~~a~Ll~--~~~~~~~~~~~~~~alr~g~i~G~i~GFdv~~Snnvp 217 (423) T protein:vir:35 143 I---KKWADVAQTASFIKDIGIKTGENYAIMDPWSAQRLAD--AQSGLHAADQLVRTAWENAQISGNFGGIRALMSNGLA 217 (423) T ss_pred c---chHHHHHHHHHHHHHhcCCcCCCEEEeCHHHHHHHhc--cccceeccccchhHHHhhccceeeecceEEEEcCCCc Confidence 0 12478999999999999995 578999999999987 2222111 12345555433233335577888888 Q ss_pred CCeEEEeeChhhhcccccccCceeccccchhh------hccc-eehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 275 RGTVYLTPEPEFLGVFPVMYSLDVEEDNKVER------FNKG-WVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 275 rgeiyvvadpE~~G~~pvR~~L~s~p~D~~er------~~kG-Wvm~E~~g~~i~N~~siv~~~~~ 333 (333) ..+-+- ++|. ++=.+=..-+...+.- .-.| |+ +..+.+..-.. T Consensus 218 ~~T~gt-----~~~~-~~v~~a~~v~~~a~~~~~~~~~~~~~~~~----------~~~g~l~~GD~ 267 (423) T protein:vir:35 218 SRKQGD-----FDGA-ITVKTAPNVDYLSVKDSYQFTVALTGATP----------SKTGFLKAGDQ 267 (423) T ss_pred cccccc-----cccc-eeeccccccccccccccccceeeeeeeee----------ccCCcEEecce Confidence 654442 1222 1100000000000000 0111 11 11111110001 No 147 >protein:vir:108303 Length: 418 # NCBI annotation: hypothetical protein # Family: family:all:1412 # MgeID: mge:2007 # MgeName: BA3 # Cross-refs: genbank:acc:YP_001552282;genbank:gi:160700607;genbank:GeneID:5758819 Probab=44.65 E-value=0.8 Score=21.12 Aligned_cols=250 Identities=14% Similarity=0.067 Sum_probs=120.6 Q ss_pred HHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhh-hhhhhhhccccCCCcceeecCCCCccceEEEEcCCCcccceeecC Q lcl|NC_011269. 43 QAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQG-ILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRITPFEG 121 (333) Q Consensus 43 ~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqG-i~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~i~~ 121 (333) -++.+..|-.++- .++.+...++.++-.-. +-|++.-+. -..|.....++|.++..+= + ..+..|.+-. T Consensus 1 m~~~~N~~ltp~i----ia~~~l~~l~~~lV~~~lv~r~y~~e~-~~~GDTV~I~vp~~~~v~d---g--~~~~~~~~te 70 (418) T protein:vir:10 1 MAVQDNNLLTDDV----IAKEALRLLKNNLVMAKCVYRNYEKTF-GKVGDTIRLKLPYRVKSAS---G--RTLVKQPMVD 70 (418) T ss_pred CCccccccccHHH----HHHHHHHHHHHhccchhhhcCCCchHH-hhCCCEEEEeeCCceeecc---c--CCcccccccc Confidence 1222223323332 33444444554442110 111111111 1234444555555552221 1 1255566655 Q ss_pred ceeecc-ceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCcceEE Q lcl|NC_011269. 122 KRIEVQ-LFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPNEITI 200 (333) Q Consensus 122 ~ri~~P-~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N~i~i 200 (333) .-+++. .-.....-.|+-.|..+...++.++.-..|..++-.+-|.-+.+++.++.. +..+.|.-+| T Consensus 71 ~~v~l~id~~k~~~~~itD~e~a~~~~d~~~~~l~~A~~aLA~~vD~~ia~l~~~a~~--------~~gt~gt~~~---- 138 (418) T protein:vir:10 71 QTIPFKIAYQEHVGLEYTVKDKTLDIMQFSERYLKSGMVQIANQIDRSLALTLKKAFH--------SSGTPGVRPG---- 138 (418) T ss_pred ceEEEEEecccccceeechHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc--------ccccCCcCcc---- Confidence 666555 233344455666677799999999999999999999999999998876652 1222222222 Q ss_pred eeccccHHHHHHHHHHHHhhCCcc---ceEEechhhhhhhhhcCCCchhhhHHhhhhhcceeeeeec------cccccee Q lcl|NC_011269. 201 AGSHLMPDDLYTAVTYTDQRQLDS---SRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIVQFGE------FQIGKSI 271 (333) Q Consensus 201 ~~g~Lt~~~L~~a~t~v~~~~L~a---t~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~G~------fgi~~sk 271 (333) .-+++-.|.+..++.+.|. -++|++|+.|.+|.+ ...|.+ ++-...+ .+..|. |.|.+|- T Consensus 139 -----~~~~i~~a~~~Ld~~~VP~~G~R~lVv~P~~~~~L~~--~~~~~~---~~~~~~~-~lr~G~IG~i~GF~V~~S~ 207 (418) T protein:vir:10 139 -----AFIDFANAGAKQTTYAVPQDGMRHAVLDPFTCASLSD--EVTKLF---KESMVEQ-AYKMGYRGNVAAYEVYESQ 207 (418) T ss_pred -----hHHHHHHHHHHHHhcCCCCCCceEEEeCHHHHHHHhh--hccccc---cccccch-hhheeeeeeeeceEEEEec Confidence 2467889999999999985 368999999999987 222321 2211112 233344 4477888 Q ss_pred eecCCeE-------EEeeChhhhcccccccCceeccccchhhhccceehhhhhhhhhh------------ccceEEEEec Q lcl|NC_011269. 272 IIPRGTV-------YLTPEPEFLGVFPVMYSLDVEEDNKVERFNKGWVMDELVGMAIL------------NPRGIVILRK 332 (333) Q Consensus 272 vlprgei-------yvvadpE~~G~~pvR~~L~s~p~D~~er~~kGWvm~E~~g~~i~------------N~~siv~~~~ 332 (333) .+|..+- .|-.--.......+.++-.+... .+++| +-|.++=. .++-.++..- T Consensus 208 nip~~tag~~~~t~~v~ga~~~~~~~~~~~~t~s~~g----~l~~G----d~~ti~gv~~v~~~t~~~~~~~~~f~V~~~ 279 (418) T protein:vir:10 208 NLPKHTVGDHGGTPLVNGTVVNGDTVGFDGGTASTTG----FLKAG----DVITFGGVFGVNPQNYETTGLLQEFVVLED 279 (418) T ss_pred CCCcccccccccceeeecccccceeEEEeecceeecc----ceeec----cEEEECceeecccccccccccceEEEEEee Confidence 8886432 22111111111111111000000 01111 11112111 1222333221 Q ss_pred C Q lcl|NC_011269. 333 A 333 (333) Q Consensus 333 ~ 333 (333) + T Consensus 280 ~ 280 (418) T protein:vir:10 280 V 280 (418) T ss_pred c Confidence 1 No 148 >protein:vir:8843 Length: 317 # NCBI annotation: major head protein # Family: family:all:3919 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775251;genbank:gi:27476049;genbank:GeneID:2700597 Probab=36.97 E-value=0.67 Score=21.54 Aligned_cols=263 Identities=12% Similarity=0.027 Sum_probs=125.1 Q ss_pred HHHHHHHHHhhhhhhhhhhhccccCCCcc---eeecCCC--Cccce-EEEEc----CCCcccceeecC---------cee Q lcl|NC_011269. 64 MIGPIQLQLRYQGILRNVLLEDTLTPGVP---IQYDVLD--DLGQA-YMLHG----NEGEIRITPFEG---------KRI 124 (333) Q Consensus 64 mA~pI~~q~~rqGi~RklL~~~TL~~G~~---p~y~v~~--~v~~a-~~~~~----~~G~i~~Q~i~~---------~ri 124 (333) ||.|=-----|+..+-+.-+.+-+-.=.+ |-+...- +.++. +.|.. .+. ...|. || .|. T Consensus 1 ma~~~~~~~t~~~~g~~~dl~~~I~~isp~dTPf~S~i~~~~a~~~~~~W~~d~l~~~~-~~~~~-EG~da~~~~~~~r~ 78 (317) T protein:vir:88 1 MATPTNAVSTVEINGKREDLIDIIYNIAPYDTPFMSAIGKGVATAITHEWQTDELRQPG-KNTRV-EGEDATIKAGSFTT 78 (317) T ss_pred CCccccceEeeeeeeeeechhhhheecCCccCcceeeecCceecccEEEEEeeecCCcc-ccccc-cCcccccccccCCE Confidence 66664322224444444333322211111 1111100 00000 00110 000 01111 11 122 Q ss_pred ecc-ceeeeccccccHHHh-----hhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhh----hhhhh-------hcccc Q lcl|NC_011269. 125 EVQ-LFRIASFPQIKKEDL-----YYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAA----VSYRV-------VDSSA 187 (333) Q Consensus 125 ~~P-~f~Ivs~P~V~~~dl-----~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a----~~~r~-------~~ssA 187 (333) .+. --||+. --+.+|.- .++++|.+.|--..+.+.|-+.+..-+++--.+.+ +.-|. +++.. T Consensus 79 ~~~N~tQIf~-k~v~VSgTa~av~~~G~~~ela~q~~kk~~EikrdmE~~li~g~~a~~~~~~t~~r~~~Gl~~~i~t~~ 157 (317) T protein:vir:88 79 MLNNYCQISD-ETLQVTGTADRVKKAGRKNELAYQLAKKSKELKLDMEYALVGAPQAKVQRNTTTPGQMANIFAYYKTNG 157 (317) T ss_pred EeccEEEEEE-eEEEEeehhhhhhhcCccchhHHHHHHHHHHHHHHHHHHHhcCeeeccCCCCccchhhhhHHHHhccCc Confidence 222 112211 11111111 23567877777777777777777765555443321 11111 11110 Q ss_pred c---ccc-cCCCcceE---EeeccccHHHHHHHHHHHHhhCCccceEEechhh---hhhhhhcCCCchhhhHHh-h---h Q lcl|NC_011269. 188 Q---PGV-GALPNEIT---IAGSHLMPDDLYTAVTYTDQRQLDSSRLLANPQE---YRDLYRWDINTTGWAFKD-S---V 253 (333) Q Consensus 188 ~---p~v-g~~~N~i~---i~~g~Lt~~~L~~a~t~v~~~~L~at~il~~~~~---~~Di~gw~~N~~~~~~~D-p---V 253 (333) . +|. .+.+++-. -++..||+++|.++.+-+=+-|.....+++++.. +++++. +.++.-..+-+ - . T Consensus 158 ~~~~~g~~~~~~~~~~~t~~t~~~lte~~l~~~l~~i~~~Gg~~~~i~v~a~~k~~i~~~~~-~~~~~i~~~~~~~~~g~ 236 (317) T protein:vir:88 158 SLGANGVAPVGDGSNTGTAGDLRLLTEDMLLNASESIWRNGGQANSIQTSSSIKKAISKNMK-GRATEITLDASDNRIAQ 236 (317) T ss_pred eeccCccccccCCCccccccccccccHHHHHHHHHHHHhcCCCCCEEEeChHHHHHHHHHhc-CCceeEEEcccCeEEEE Confidence 0 000 01111111 2333599999999999999999999999998864 444432 10100000000 0 0 Q ss_pred hhcceeeeeecccccceeeecCCeEEEeeChhhhcccccccCceeccccchhhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 254 VAGERIVQFGEFQIGKSIIIPRGTVYLTPEPEFLGVFPVMYSLDVEEDNKVERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 254 ~~~e~il~~G~fgi~~skvlprgeiyvvadpE~~G~~pvR~~L~s~p~D~~er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) .--..+=+||...|..+-.||-|++|++ ||+.+..-+-| .++.|+.-| .--...|+++-++++-+-||-+.-++.-. T Consensus 237 ~v~~~~tdfG~v~ii~~r~lp~~~~~~~-D~~~~~l~~Lr-~~~~e~laK-tGd~~k~~i~~E~tLe~~N~~a~a~i~~l 313 (317) T protein:vir:88 237 TVDVYESDFGKYTIRANRWFHENTLFVF-DPKMHSLCYLR-PFFQHELAK-TGDSEKRQLLVEYTFRVNNEKSGALIRDV 313 (317) T ss_pred EEEEEEeCCeEEEEEeCCCCCCCeEEEE-cccccceeecc-cceeeccCC-CcccceeEEEEEEEEEEcCccceeEEEEe Confidence 0001223466666777778888888876 66656655553 444444332 22466799999999999999998888766 No 149 >protein:vir:103841 Length: 155 # NCBI annotation: virion morphogenesis protein # Family: family:all:274 # MgeID: mge:1522 # MgeName: D3112 # Cross-refs: genbank:acc:NP_938236;genbank:gi:38229141;genbank:GeneID:2648156 Probab=31.19 E-value=1.5 Score=19.59 Aligned_cols=127 Identities=13% Similarity=0.121 Sum_probs=65.2 Q ss_pred hcch-hcc---hHHHHH--HHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhh-------------------hhhhhccc Q lcl|NC_011269. 32 MGGR-KLS---AREKQA--KLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGIL-------------------RNVLLEDT 86 (333) Q Consensus 32 ~~~~-~ls---~ee~~~--Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~-------------------RklL~~~T 86 (333) |+.+ .++ .+-.++ -+...+.+...-|+.+|+.|...+++-.+-+|-. .+.|.. T Consensus 1 Ms~~i~i~~~~~~~~~~L~~l~~~~~~~~~l~~~ig~~l~~~~~~rF~p~G~~W~plsp~t~~~r~k~g~~~~~~L~~-- 78 (155) T protein:vir:10 1 MANRIELELVDREVQERLAALYAAVTDTLPLMRGIAAELLAETEFAFMDEGPGWPQLSPVTVAARAAKGRGAHPILQV-- 78 (155) T ss_pred CCceEEEEechHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHhhcCCCCCCCCccchHHHHhccCCCCCcccc-- Confidence 3311 000 001111 2444455555556667777777777666655421 112211 Q ss_pred cCCCcc---eeecCCCCccceEEEEcCCCcccceeecCceeeccceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHH Q lcl|NC_011269. 87 LTPGVP---IQYDVLDDLGQAYMLHGNEGEIRITPFEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMR 163 (333) Q Consensus 87 L~~G~~---p~y~v~~~v~~a~~~~~~~G~i~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~ 163 (333) .|.. ..|..-++ .+.+-|+.--..++.++++--..+...|-++|++-+++-.+....+.+.+.+-..+.+-+ T Consensus 79 --tG~L~~Si~~~~~~~---~v~vGtn~~YA~iHqfGg~~~~~~~~~iPARPfLG~s~~~e~~~ei~~~I~~~i~~~l~~ 153 (155) T protein:vir:10 79 --TNALARSITTRADRD---QAQIGSNLSYAAIQQLGGQAGRGRKVTIPARPYLPVLRNGQLKPSARDAVLDVLLAALSQ 153 (155) T ss_pred --chhhhhhhhceecCC---EEEEecCcchhhhhhcccccCCCCccccCCccccCCCccccchHHHHHHHHHHHHHHHhh Confidence 2222 23443222 345567777788899998766667788889999987654443334444443333333322 Q ss_pred Hh Q lcl|NC_011269. 164 QE 165 (333) Q Consensus 164 qE 165 (333) .- T Consensus 154 ~r 155 (155) T protein:vir:10 154 GR 155 (155) T ss_pred cC Confidence 22 No 150 >protein:vir:1781 Length: 221 # NCBI annotation: minor capsid protein # Family: family:all:975 # MgeID: mge:38 # MgeName: P60 # Cross-refs: genbank:acc:NP_570347;genbank:gi:18640506;genbank:GeneID:932719 Probab=28.10 E-value=1.8 Score=19.21 Aligned_cols=178 Identities=19% Similarity=0.205 Sum_probs=94.8 Q ss_pred ecCceeeccceeeeccccccHHHhhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCc-c Q lcl|NC_011269. 119 FEGKRIEVQLFRIASFPQIKKEDLYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPN-E 197 (333) Q Consensus 119 i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N-~ 197 (333) |. + -++|.-+|+--|=.|.+.|+..+.-+++-+|+-.+-|..+..++-.+| +++.|..+...- . T Consensus 1 iD-------~-lL~a~~~VdDiD~aqa~~dvr~e~t~e~G~ALA~~~D~~i~~~~~~aA-------~~~~p~~~~~~g~~ 65 (221) T protein:vir:17 1 MD-------D-LLVASQFVYDLDEILAQWNTRSEISKQIGEALAIHYDERIARVLASAS-------IAAAPVTGQDGGFS 65 (221) T ss_pred CC-------c-chhHHHHHHhHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh-------hhcCcccccccCcc Confidence 11 1 256888899999999999999999999999999999999888775444 222222211000 0 Q ss_pred e-EEeeccccHHHH----HHHHHHHHhhCCccc-e-EEechhhhhhhhhcCCCchh----hhHHhh-hhhcceeeeeecc Q lcl|NC_011269. 198 I-TIAGSHLMPDDL----YTAVTYTDQRQLDSS-R-LLANPQEYRDLYRWDINTTG----WAFKDS-VVAGERIVQFGEF 265 (333) Q Consensus 198 i-~i~~g~Lt~~~L----~~a~t~v~~~~L~at-~-il~~~~~~~Di~gw~~N~~~----~~~~Dp-V~~~e~il~~G~f 265 (333) . ...+...++.+| ..|.+..++.+.|.. + ++.+|+.|..|.-= .+.+. +..-+- +..+..+.+.-=| T Consensus 66 ~~~~a~~t~~~~~l~dai~~a~~~LdekdVP~~gR~~vv~P~~y~~LL~~-~d~~~~n~d~~~s~g~~~~g~~i~~v~G~ 144 (221) T protein:vir:17 66 VNIGAGNTNNAQAIVDGFFEAAAVLDERSAPMDGRVAVLSPRQYYSLISS-VDTNILNREIGNTQGDMNTGKGLYVNAGI 144 (221) T ss_pred eeccccccCCHHHHHHHHHHHHHHHhhcCCCCCCCEEEeCcHHHHHHHHh-cCcceeeeecccccccccccceeeeecCc Confidence 0 011233444444 557778889998843 3 67799999999841 11110 000011 1111112222225 Q ss_pred cccceeeecC--CeEEEeeChhhhcccccccCceeccccchhhhccceehhhhhhhhhhccceEEEEecC Q lcl|NC_011269. 266 QIGKSIIIPR--GTVYLTPEPEFLGVFPVMYSLDVEEDNKVERFNKGWVMDELVGMAILNPRGIVILRKA 333 (333) Q Consensus 266 gi~~skvlpr--geiyvvadpE~~G~~pvR~~L~s~p~D~~er~~kGWvm~E~~g~~i~N~~siv~~~~~ 333 (333) .|.+|-.+|. |+=++ .+.|.+.+ +.+... +.|. ...|.-++|.-|.| T Consensus 145 ~V~~SnnlP~~~gt~~~----~~ag~~~~----~~~~~~-~yr~------------~fs~~~glv~~~~A 193 (221) T protein:vir:17 145 RIYKSNVLASLYGTNLV----TDPGDATT----SGENNG-SYRP------------AITDRAGLVFHKEA 193 (221) T ss_pred EEEEeccCCcccccccc----cCCccccc----cccccc-cccc------------cccceEEEEEcchh Confidence 6778888886 32121 23333321 111111 1110 02233455555555 No 151 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=25.64 E-value=2 Score=18.89 Aligned_cols=275 Identities=13% Similarity=0.099 Sum_probs=143.8 Q ss_pred HhcCchhH-HHHHHHHHHHHHHHHHhhhhhhhhhh-hccccCCCc-ceeecCCCCccceEEEEcCCCcccceeecCceee Q lcl|NC_011269. 49 ILSDKVGG-IQRLGQSMIGPIQLQLRYQGILRNVL-LEDTLTPGV-PIQYDVLDDLGQAYMLHGNEGEIRITPFEGKRIE 125 (333) Q Consensus 49 Al~~~Eg~-~~aLg~~mA~pI~~q~~rqGi~RklL-~~~TL~~G~-~p~y~v~~~v~~a~~~~~~~G~i~~Q~i~~~ri~ 125 (333) .-+|..|- -...-+.|=..|++.+...=.+|+++ +...++.|. .-.|++-..++++-.......+++-.....+|-. T Consensus 1 ~~~~~~g~f~~~~l~~id~~v~e~~~~~l~~r~l~~v~~~~~~~~~~~~~~~~~~~G~~~~~~~~~~dip~~~~~~~~~~ 80 (301) T protein:vir:80 1 MQGKITATIEARDLQAIDNVIYEPKQEELTARSVFPQKFDVNEGAESYSFDVMTRSGAAKIIANGADDLPLVDVDMVRKS 80 (301) T ss_pred CCccccchhhHHHHHHHHHHHHHhhhhhhhhhhhcccccCCCCceEEEEEeeeccceeEEEecCcccccccccccceeEE Confidence 33444442 11222556666777777777788876 555677764 3567777777766533333444666566667888 Q ss_pred ccceeeeccccccHHHhhhh---cchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccccCCCcceEEee Q lcl|NC_011269. 126 VQLFRIASFPQIKKEDLYYL---RSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGVGALPNEITIAG 202 (333) Q Consensus 126 ~P~f~Ivs~P~V~~~dl~~~---~~~vle~~q~~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~vg~~~N~i~i~~ 202 (333) .|.+++-.-=.+...||+.. +.++-.+++..|..++-+.||+-.|.=. + ...+-----.|++....++.+-.+ T Consensus 81 ~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~f~G~---~-~~g~~GLlN~p~~~~~~~~~~~~~ 156 (301) T protein:vir:80 81 VPIYSIGIGLSYTIQDLRAARMQGTTVDAAKATTVRRAIAEKENSIAFRGE---K-KYAIKGAFEATGIQIDVSPTTGVG 156 (301) T ss_pred EEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEeeec---c-cccceeeecCCCcccccccCcccc Confidence 88888888778888888765 7889999999999999999996554211 1 000000001122111111111111 Q ss_pred cc-----cc----HHHHHHHHHHHHh---hCCccceEEechhhhhhhhhc-CCCchhhhHHhhhhhcceeeeeeccccc- Q lcl|NC_011269. 203 SH-----LM----PDDLYTAVTYTDQ---RQLDSSRLLANPQEYRDLYRW-DINTTGWAFKDSVVAGERIVQFGEFQIG- 268 (333) Q Consensus 203 g~-----Lt----~~~L~~a~t~v~~---~~L~at~il~~~~~~~Di~gw-~~N~~~~~~~DpV~~~e~il~~G~fgi~- 268 (333) +. -| -++++++++.+.. ..-...+++++++.|.+|-.= .++.++-+-.+=+.+.-.-++ ..+++ T Consensus 157 ~~~~w~~~t~~ei~~di~~~~~~l~~~s~g~~~p~~L~L~p~~~~~L~~~~~~~~~~~tvl~~l~~~~~~~~--I~~~p~ 234 (301) T protein:vir:80 157 NVSKWEKKTAEQIIDEIGEAHTKITVLPGYGTASLKLCLPPKQFELINKKRYSNEDSRSVLKVLQDNAWFSA--IVRVPD 234 (301) T ss_pred cccccccCCHHHHHHHHHHHHHHHHHhcCceecccEEEecHHHHHhhhhccccCCCCeeHHHHHHHHcCcce--EEEcce Confidence 11 12 3577888877654 233568999999999999421 112222222222222111110 01111 Q ss_pred -ceeee-cCCe-EEEeeChhhhcccccccCceeccccchhhhccceeh---hhhhhhhhhccceEEEEecC Q lcl|NC_011269. 269 -KSIII-PRGT-VYLTPEPEFLGVFPVMYSLDVEEDNKVERFNKGWVM---DELVGMAILNPRGIVILRKA 333 (333) Q Consensus 269 -~skvl-prge-iyvvadpE~~G~~pvR~~L~s~p~D~~er~~kGWvm---~E~~g~~i~N~~siv~~~~~ 333 (333) +.+-. --+- +.+..+|+ ...+.+=..++..|+ +.-+..|.. ....|..|-.|.+|+.+.-- T Consensus 235 L~~~g~~g~~~~v~~~~~~d-~~~~~v~~~~~~~~~---e~~~~~~~~~~~~r~~Gv~i~~P~ai~~~~GI 301 (301) T protein:vir:80 235 LAGMGTAGSDSFAVIHDSNE-TAELIIPMDITRHPE---EYSFPRTKVPFEERTAGVVVRFPAAIVRVDGI 301 (301) T ss_pred eccCCCCcccEEEEEecCCc-EEEEEecCceeeecc---eecCceeEeeeeeeeEEEEEEccceEEEEecC Confidence 00000 0011 33333565 333333333444332 233344443 23346667888888877666 No 152 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=24.69 E-value=2.1 Score=18.77 Aligned_cols=286 Identities=11% Similarity=-0.030 Sum_probs=124.8 Q ss_pred CcccchhhhhhhhhhcccchHHHHHHHHHHhhcchhcchHHHHHHHHHHhcCchhHHHHHHHHHHHHHHHHHhhhhhhhh Q lcl|NC_011269. 1 MTLPVAVGSGLGRFAKASDDYVADIVEAKQRMGGRKLSAREKQAKLAHILSDKVGGIQRLGQSMIGPIQLQLRYQGILRN 80 (333) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ls~ee~~~Lm~~Al~~~Eg~~~aLg~~mA~pI~~q~~rqGi~Rk 80 (333) |..+=-|= +|++.... ..+-- .-.||--|-.++-+++++.+.+++. .|+ T Consensus 1 ~~~~~~~~--~~~~~~~~----k~~t~--~d~~Gg~l~P~~~~~~i~~~~e~s~-----------------------~l~ 49 (315) T protein:vir:41 1 MLTIEDIR--GGKPFEIV----PKIDV--PDLGRGVLSVDRFGEFVKAVRDSAV-----------------------IIP 49 (315) T ss_pred Ccccchhh--cCChhhhh----hhcCC--cCCCCceechHHHHHHHHHHHhhhh-----------------------hhh Confidence 21111111 11111000 00000 0113333444554555544444322 222 Q ss_pred hhhcccc---CCCcceeecCCCCccceEEEEcCCCcccceeecCceeeccceeeeccccccHHHhhhhcc--hhHHHHHH Q lcl|NC_011269. 81 VLLEDTL---TPGVPIQYDVLDDLGQAYMLHGNEGEIRITPFEGKRIEVQLFRIASFPQIKKEDLYYLRS--NIVEYTQD 155 (333) Q Consensus 81 lL~~~TL---~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~i~~~ri~~P~f~Ivs~P~V~~~dl~~~~~--~vle~~q~ 155 (333) +-...+. .+++++.-.....+...+-|.+..+....+...=+.++++..++.+++.|.-+-|..... |+-.+.-+ T Consensus 50 ~~~vi~~~~~~~~~i~~~g~~~~~~~g~~~~~~~~~~~~~~~~f~~~~l~~~~l~~~~~it~elL~D~~~~~~~e~~l~~ 129 (315) T protein:vir:41 50 EARIDNALKSYEKDISRLSLVLDVGPGRDETGQKLAPPESTAEVKTNTLYMREMVTKVVIHEDAIEDNIEGKAFEQKIVT 129 (315) T ss_pred hceeeeccccccccccccccCcccccccccccCcCCCCCCccccceeeeceeeeeeeccccHHHHHhhhccccHHHHHHH Confidence 2221111 111111110001111222234443333333223356788888899999998888877654 77777777 Q ss_pred HHHHHHHHHhhhHHHHHHhhhhhhhhhhcccccccc----cC--------CCcceEEeeccccHHHHHHHHHHHHhhCC- Q lcl|NC_011269. 156 MTKQAIMRQEDSRLVTLLEAAAVSYRVVDSSAQPGV----GA--------LPNEITIAGSHLMPDDLYTAVTYTDQRQL- 222 (333) Q Consensus 156 ~A~qaIM~qED~~~~slle~~a~~~r~~~ssA~p~v----g~--------~~N~i~i~~g~Lt~~~L~~a~t~v~~~~L- 222 (333) +.++++=+.|+.-.++== - +|++|.+ |- ..+.++-.++.++.+.|..++..+...=. T Consensus 130 ~~a~~~a~~~~~~~~nGd---g-------~s~~p~~~~~~G~l~~a~~~~~~~~~~~~a~~~~~d~l~~l~~sl~~~yr~ 199 (315) T protein:vir:41 130 LLGEGISYVLEKYYLHGD---T-------SSSDPLLRMSDGWLKLASEKLTESDVDPEAEDWPMNLFDTMIESLPTPYRN 199 (315) T ss_pred HHHHHHHHHHHHHhhccC---C-------cCcCccccccccceecccccccccccccccccccHHHHHHHHHhcChHHhh Confidence 777777777764333221 0 1233321 11 11223333445556666666665544222 Q ss_pred --ccceEEechhhhhhhhhcCCCchhhhHHhhhhhcceeeeeecccccc--eeee-----cCCeEEEeeChhhhcccccc Q lcl|NC_011269. 223 --DSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIVQFGEFQIGK--SIII-----PRGTVYLTPEPEFLGVFPVM 293 (333) Q Consensus 223 --~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~G~fgi~~--skvl-----prgeiyvvadpE~~G~~pvR 293 (333) +--..+||...+..++-.-.+.-.|+--..+..+.-. -++|.+. .--| |.+. ++..|+.|. ++-+| T Consensus 200 ~~~~~~~imn~~t~~~~rklk~~~g~~lw~~~~~~g~~~---tl~G~PV~~~~~m~~~~~~~~~-ilf~d~~nl-~~~~~ 274 (315) T protein:vir:41 200 NLPNMKFYVTWDIYRAYRDALKGRETGLGDQALTGANSI---LYDGRPVQYVPALEALNDGKSR-ALFVVPTQL-VYGFW 274 (315) T ss_pred cCCceEEEEcHHHHHHHHHHhccCCCccccchhhcCCCc---eecccceEecccccccCCCCcc-EEEecccce-EEEec Confidence 2335899999988776543332222221222222211 1244442 1112 4554 556678875 77888 Q ss_pred cCceeccccchhhhccceehhhhhhh--hhhccceEEEEecC Q lcl|NC_011269. 294 YSLDVEEDNKVERFNKGWVMDELVGM--AILNPRGIVILRKA 333 (333) Q Consensus 294 ~~L~s~p~D~~er~~kGWvm~E~~g~--~i~N~~siv~~~~~ 333 (333) .+++.++.-..+.-...++..--++- ++.| -.+|-+-|- T Consensus 275 ~~i~i~~~~~a~~~~~~~~~~~r~d~~~~~~~-~~a~~~~~v 315 (315) T protein:vir:41 275 RNIKVVPDYDAEMRLTKYVASLRTDNHYEDEE-GAVSATITV 315 (315) T ss_pred cccEEEeeecCCCCceEEEEEEEeceeEEecc-ceeEeeeeC Confidence 99988876554432223332222222 2223 222222233 No 153 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=20.99 E-value=2.7 Score=18.24 Aligned_cols=315 Identities=16% Similarity=0.215 Sum_probs=169.6 Q ss_pred Cc--ccchhhhhhhh-----------------h-hc------ccchHHHHHHHHHHh-hcchhcchHHHHHHHHHHhcCc Q lcl|NC_011269. 1 MT--LPVAVGSGLGR-----------------F-AK------ASDDYVADIVEAKQR-MGGRKLSAREKQAKLAHILSDK 53 (333) Q Consensus 1 ~~--~~~~~~~~~~~-----------------~-~~------~~~~~~~~~~~~~~~-~~~~~ls~ee~~~Lm~~Al~~~ 53 (333) |- |--.--||.-. - +. +-|.---.|+|.--. |-|.--.+| --++++|.++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~e~k~lr~~me~~et~~e~~~~~~~~~~~e~el~E~f~Kmm~G~~p~~e---V~~~e~mtt~ 77 (393) T protein:vir:79 1 MENWLKQLKESGFTETQVQEQKSLRTRMERGETLAEADANKLALNEEETQILESFAKMMEGETPTNE---VNLREFMATP 77 (393) T ss_pred CchHHHHHHhccCchhHHHHHHHHHHHhhhhhhhhhhhhhhhhcchhHHHHHHHHHHHhcCCCchhh---eehhhhhcCC Confidence 10 00001111100 0 00 011111223333222 223322222 3356777777 Q ss_pred hhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcceeecCCCCccceEEEEcCCCcccceeecCceeeccceeee- Q lcl|NC_011269. 54 VGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPIQYDVLDDLGQAYMLHGNEGEIRITPFEGKRIEVQLFRIA- 132 (333) Q Consensus 54 Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p~y~v~~~v~~a~~~~~~~G~i~~Q~i~~~ri~~P~f~Iv- 132 (333) .+-| -.-..|.+-|.+.+.---++.|++-|.+|.-|.---|+----. -++=++..|+++.+.++...-..|+.+.= T Consensus 78 ~a~I-liP~vis~v~~Eaaepl~~~~kl~qk~~L~~Grsm~F~~~g~~--Ra~~IgEGgE~~~~sld~~T~dsv~~~~gK 154 (393) T protein:vir:79 78 SAQI-LIPRVIVGTMREAAEPLYIGTKMLQKIRLKSGQSMIFPSIGIM--RAYDVAEGQEIPEDSIDWQTHESPEIRVGK 154 (393) T ss_pred Ccce-echhhhhhhhhhcccchhHHHHHHHHHhhhcCcceeccchhee--eeccccccccccccchhhhcCCceeEEech Confidence 7754 4455566667776666778999999999999987777632211 22235555666666665333333333221 Q ss_pred ccccccHHH--hhhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhhhh-hhhhhcccccc-ccc-CCCcceEEeeccccH Q lcl|NC_011269. 133 SFPQIKKED--LYYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAAAV-SYRVVDSSAQP-GVG-ALPNEITIAGSHLMP 207 (333) Q Consensus 133 s~P~V~~~d--l~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~a~-~~r~~~ssA~p-~vg-~~~N~i~i~~g~Lt~ 207 (333) +--+|.++| ++.+-.|++-..---|..+.-+--|-.-|+.+++-+. +|--.-++... .+| ..+| .-.|-|.- T Consensus 155 ~G~~Ia~SqEmIsDSg~Dvin~~l~aA~RaMaRkKee~a~n~fk~~ghtvfDa~st~t~ahptGr~~~~---~qNGTlSl 231 (393) T protein:vir:79 155 SGIRLRFTDEMISDSQWDLMSMMIKQAGRAMGRHKEQKAYHQFRSHGHTVFDNYSTNKLAHTTGLDKNG---VQNDTFSA 231 (393) T ss_pred hhhhhhhHHHHhhcchHHHHHHHHHHHHHHHHhhhHHHHHhhhhcccceeeeccccCccceeecCCccc---cccccccH Confidence 112333333 3444556666666666666666677667777766553 11111111111 112 1212 33477888 Q ss_pred HHHHHHHHHHHhhCCccceEEechhhhhhhhhcCCCchhhhHHhhhhhcceeeeeec---------------------cc Q lcl|NC_011269. 208 DDLYTAVTYTDQRQLDSSRLLANPQEYRDLYRWDINTTGWAFKDSVVAGERIVQFGE---------------------FQ 266 (333) Q Consensus 208 ~~L~~a~t~v~~~~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV~~~e~il~~G~---------------------fg 266 (333) ++|-...=.+---++-.+.++|.|=.|+=++- |. ..-.+-+.- .-|+|- |- T Consensus 232 eDllDm~~av~~~hyt~svi~MHPLAWnv~AK---na----~me~~~~na-~gN~~~~~~~ts~algp~~i~~~~~~nln 303 (393) T protein:vir:79 232 EDFLDLIIAVMANEYTPSDLMMHPLAWTVFAK---NE----LMGSLQANP-YGNYPAKGAPSSMALGPDSIQGRLPFNFN 303 (393) T ss_pred HHHHHHHHHHhcccCCcceEEEcCchhhhhhh---hh----hhcceeecc-ccccCccccchhhhhchhhhcccccccee Confidence 99988888888889999999999998886654 21 000000000 002222 11 Q ss_pred ccceeeecCC------eEEEeeChhhhcccccccCceeccccchhhhccceehhhhhhhhhhcc-ceEEEEecC Q lcl|NC_011269. 267 IGKSIIIPRG------TVYLTPEPEFLGVFPVMYSLDVEEDNKVERFNKGWVMDELVGMAILNP-RGIVILRKA 333 (333) Q Consensus 267 i~~skvlprg------eiyvvadpE~~G~~pvR~~L~s~p~D~~er~~kGWvm~E~~g~~i~N~-~siv~~~~~ 333 (333) |-.|--+|-+ .+|.| |..|+|++=+|-+|.++-.|.+.|--.---|.|=.|++|+|- .+|-..|.- T Consensus 304 v~~sPfvp~d~k~~rFd~~~V-d~NnvgvlLV~D~i~tdq~ddk~rdiq~iKl~ERYG~gvLn~gkaiavakNI 376 (393) T protein:vir:79 304 VNLSPFIPLDKKSRRFDVYAV-DRNNVGVLLVRDDLKTDQWDEKARGLQNIKMIERYGIGILNEGKAIAVAKNI 376 (393) T ss_pred EEEecccccccccceeeEEEe-ecCCceEEEEecCcceeccccccccceeeeeeeeeceeeeeCCceEEEEecc Confidence 2222222322 34444 677899999999999999998888655667888999988885 455555443 No 154 >protein:vir:9875 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:177 # MgeName: 315.5 # Cross-refs: genbank:acc:NP_795637;genbank:gi:28876404;genbank:GeneID:1257935 Probab=20.35 E-value=2.8 Score=18.14 Aligned_cols=249 Identities=15% Similarity=0.159 Sum_probs=129.3 Q ss_pred hcchhcchHHHHHHHHHHhcC--chhHHHHHHHHHHHHHHHHHhhhhhhhhhhhccccCCCcce-eecCCCCccceEEEE Q lcl|NC_011269. 32 MGGRKLSAREKQAKLAHILSD--KVGGIQRLGQSMIGPIQLQLRYQGILRNVLLEDTLTPGVPI-QYDVLDDLGQAYMLH 108 (333) Q Consensus 32 ~~~~~ls~ee~~~Lm~~Al~~--~Eg~~~aLg~~mA~pI~~q~~rqGi~RklL~~~TL~~G~~p-~y~v~~~v~~a~~~~ 108 (333) |--..--.||.-- .+..|.. +---+++++++ |..-++--|+= -..+|.+|.-+ -|| ++-.+ T Consensus 1 ~~~~~~~~e~nlt-~~~dl~~~~siDf~~~f~~~----i~~L~~~LGv~----r~~pla~GstIkt~k-------~~~y~ 64 (296) T protein:vir:98 1 MVTSRTYPEENLI-KSTDLKYPITIDVTNKFQEN----ISKLLEMLGVT----RKISVSEGMTLKTYA-------GYDVT 64 (296) T ss_pred CCCccccCcCCCc-chhhhhhhhhhhhHHHHhhh----HHHHHHHhhhc----ccccccCCCEEeecc-------ceeee Confidence 1110000111000 0000100 00011122222 22222222222 56899999988 665 34447 Q ss_pred cCCCcccceeecCceeecc----------ceeeec-cccccHHHh-hhhcchhHHHHHHHHHHHHHHHhhhHHHHHHhhh Q lcl|NC_011269. 109 GNEGEIRITPFEGKRIEVQ----------LFRIAS-FPQIKKEDL-YYLRSNIVEYTQDMTKQAIMRQEDSRLVTLLEAA 176 (333) Q Consensus 109 ~~~G~i~~Q~i~~~ri~~P----------~f~Ivs-~P~V~~~dl-~~~~~~vle~~q~~A~qaIM~qED~~~~slle~~ 176 (333) +.+|+|. ||+.|.+- +..|-- +-.+..+.+ ..+.++-+-++...-..+|-..=|..+|+.|-.+ T Consensus 65 gda~dVa----EGe~Iplskvt~~~~~t~t~~ikK~rK~tTdEAIqlsGyg~aVgetd~qL~~~iq~kId~d~~t~Lkta 140 (296) T protein:vir:98 65 LAEGNVP----EGEVIPLSKVERKIHSEKKIELKKYRKATTGEDIQMYGSNEAVTNTDNALVRQLQKKIRTDFVTALKTG 140 (296) T ss_pred ecccccc----CCcccchhhheeeecceEEEEeeccccccCHHHHHhhcCCchhHHHHHHHHHHHHHhhhHHHHHHHhcc Confidence 7777665 45544332 222222 334677887 6788888888888888899999999999888333 Q ss_pred hhhhhhhcccccccccCCCcceEEeecccc---HHHHHHHHHHHHhhCCccceEEechhhhhhhhhcCCCchhhhHHhhh Q lcl|NC_011269. 177 AVSYRVVDSSAQPGVGALPNEITIAGSHLM---PDDLYTAVTYTDQRQLDSSRLLANPQEYRDLYRWDINTTGWAFKDSV 253 (333) Q Consensus 177 a~~~r~~~ssA~p~vg~~~N~i~i~~g~Lt---~~~L~~a~t~v~~~~L~at~il~~~~~~~Di~gw~~N~~~~~~~DpV 253 (333) . .+.+.++..|+ ...+.++....++-+-..+-+.+||.-..+.+| |... - T Consensus 141 T------------------~t~~~t~~~lQ~Ala~~~~~l~~~feded~~~~V~FVnP~D~a~ylg---~a~i------t 193 (296) T protein:vir:98 141 T------------------GTQDALGAGLQGALASAWGKLQVLFEDYGSERAIVFANSLDVAEYIA---KAGI------T 193 (296) T ss_pred c------------------ceeeechhhHHHHHHHHhhhhhhhccccCCCceEEEEehHHHHHHhc---CCcc------c Confidence 2 34444444444 233445556677766667778899998888887 4321 1 Q ss_pred hhcceeeeeec----ccccceeeecCCeEEEeeChhhhccc-ccccC-ceeccccchhh--------------------h Q lcl|NC_011269. 254 VAGERIVQFGE----FQIGKSIIIPRGTVYLTPEPEFLGVF-PVMYS-LDVEEDNKVER--------------------F 307 (333) Q Consensus 254 ~~~e~il~~G~----fgi~~skvlprgeiyvvadpE~~G~~-pvR~~-L~s~p~D~~er--------------------~ 307 (333) ++.+.=++|++ -+|..|+++|.|++|.+|..--...| |.++| |-..=..+..+ + T Consensus 194 ~qt~fG~tyl~nfLG~~II~S~kV~~G~~~~T~~~Ni~~ay~~~~~~~l~~~f~~~~d~tglIGv~h~~~~~~~t~eT~~ 273 (296) T protein:vir:98 194 TQTAFGLTYLVDFTGTVIISTNDVTKGEIWATVPENIIFAYINPNNSELAKEFNLYGDPTGYIGMNHFQENTTLTIQTLL 273 (296) T ss_pred hhheechhhhhhccccEEEEcCcCCCceEEEeeecceEEEeecccccchhhhhccccccccceEEEeccccceeeehhHh Confidence 33333333332 24779999999999999876655555 44434 22111111111 1 Q ss_pred ccceehhhhhhhhhhccceEEEEe-cC Q lcl|NC_011269. 308 NKGWVMDELVGMAILNPRGIVILR-KA 333 (333) Q Consensus 308 ~kGWvm~E~~g~~i~N~~siv~~~-~~ 333 (333) -.||.|+-| ++-+||..- ++ T Consensus 274 ~~~~~lfpE------~~dgiv~~tI~~ 294 (296) T protein:vir:98 274 VSGMLMYPE------RIDGIVKVTLTP 294 (296) T ss_pred HhHHHhccc------ccceEEEEEecC Confidence 334444433 344544322 22 Done!