Query lcl|Aclame:protein:vir:101557|NCBI_annot:gp12|genbank:acc:NP_958117;genbank:gi:41057663;genbank:GeneID:2716814 Match_columns 336 No_of_seqs 106 out of 110 Neff 6.5 Searched_HMMs 1612 Date Sat Nov 30 19:26:07 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_40 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_40_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:78558 Length: 336 100.0 5E-135 3E-138 756.8 34.3 336 1-336 1-336 (336) 2 protein:vir:106734 Length: 336 100.0 9E-135 5E-138 755.5 34.4 336 1-336 1-336 (336) 3 protein:vir:101557 Length: 336 100.0 2E-134 1E-137 754.0 34.2 336 1-336 1-336 (336) 4 protein:vir:3643 Length: 336 # 100.0 2E-134 1E-137 753.7 34.4 336 1-336 1-336 (336) 5 protein:vir:94070 Length: 339 100.0 3E-122 2E-125 686.5 32.4 335 1-336 4-339 (339) 6 protein:vir:107732 Length: 379 100.0 3E-117 2E-120 659.3 30.7 334 1-336 23-379 (379) 7 protein:vir:99576 Length: 388 100.0 5E-115 3E-118 647.1 31.1 336 1-336 21-388 (388) 8 protein:vir:96079 Length: 382 100.0 7E-111 4E-114 624.5 27.5 334 1-336 21-382 (382) 9 protein:vir:79642 Length: 329 100.0 8.6E-92 5.4E-95 519.8 29.4 318 12-336 1-326 (329) 10 protein:vir:104342 Length: 314 100.0 2.8E-90 1.7E-93 511.6 27.8 302 13-336 1-311 (314) 11 protein:vir:107687 Length: 319 100.0 1.1E-87 7E-91 497.3 28.9 315 1-336 1-319 (319) 12 protein:vir:80068 Length: 301 100.0 2.3E-87 1.4E-90 495.6 27.7 291 42-336 1-301 (301) 13 protein:vir:103285 Length: 296 100.0 4.2E-85 2.6E-88 483.2 26.4 290 31-336 1-293 (296) 14 protein:vir:5255 Length: 304 # 100.0 1.7E-82 1.1E-85 468.9 27.0 284 33-335 1-304 (304) 15 protein:vir:105778 Length: 358 98.8 2.3E-10 1.4E-13 73.4 12.6 315 1-336 12-357 (358) 16 protein:vir:7771 Length: 330 # 98.7 1.3E-09 7.8E-13 69.4 13.4 288 31-336 1-321 (330) 17 protein:vir:80376 Length: 435 98.5 1.6E-08 1E-11 63.3 14.1 317 1-336 55-431 (435) 18 protein:vir:105905 Length: 304 98.4 2.4E-08 1.5E-11 62.4 14.4 282 31-336 1-303 (304) 19 protein:vir:94142 Length: 304 98.4 2.4E-08 1.5E-11 62.4 14.4 282 31-336 1-303 (304) 20 protein:vir:1433 Length: 435 # 98.4 3.3E-08 2E-11 61.6 14.4 316 1-336 52-431 (435) 21 protein:vir:94771 Length: 298 98.3 4.2E-08 2.6E-11 61.0 13.3 274 42-336 1-297 (298) 22 protein:vir:5739 Length: 366 # 98.3 1.6E-07 1E-10 57.8 16.2 309 1-336 27-364 (366) 23 protein:vir:104085 Length: 320 98.3 7.1E-08 4.4E-11 59.8 13.9 289 10-336 1-315 (320) 24 protein:vir:9574 Length: 300 # 98.3 5.7E-08 3.5E-11 60.3 13.0 277 31-336 1-298 (300) 25 protein:vir:99920 Length: 311 98.3 8.3E-08 5.1E-11 59.4 13.5 280 31-336 1-310 (311) 26 protein:vir:2504 Length: 305 # 98.2 1.7E-07 1.1E-10 57.6 14.2 274 34-336 1-296 (305) 27 protein:vir:1638 Length: 298 # 98.2 1.8E-07 1.1E-10 57.6 13.7 274 31-336 1-297 (298) 28 protein:vir:8187 Length: 311 # 98.2 1.6E-07 1E-10 57.8 13.4 278 33-336 1-308 (311) 29 protein:vir:108211 Length: 318 98.2 2.4E-08 1.5E-11 62.4 8.7 277 1-336 1-315 (318) 30 protein:vir:95763 Length: 297 98.1 3E-07 1.9E-10 56.3 13.6 275 31-336 1-294 (297) 31 protein:vir:8420 Length: 477 # 98.1 3.5E-07 2.2E-10 55.9 13.8 317 1-336 103-469 (477) 32 protein:vir:94673 Length: 419 98.1 1.1E-06 6.8E-10 53.3 15.9 307 1-336 56-415 (419) 33 protein:vir:4226 Length: 326 # 98.0 6E-07 3.7E-10 54.7 13.4 291 22-336 1-321 (326) 34 protein:vir:96392 Length: 324 97.9 1.4E-06 8.5E-10 52.7 14.0 294 1-336 1-313 (324) 35 protein:vir:78830 Length: 324 97.9 1.4E-06 8.5E-10 52.7 14.0 294 1-336 1-313 (324) 36 protein:vir:41 Length: 299 # N 97.9 9.8E-07 6.1E-10 53.5 13.1 276 31-336 1-296 (299) 37 protein:vir:9759 Length: 303 # 97.9 1.2E-06 7.3E-10 53.1 13.0 282 31-336 1-301 (303) 38 protein:vir:97148 Length: 324 97.8 2E-06 1.2E-09 51.8 13.2 293 1-336 1-313 (324) 39 protein:vir:78523 Length: 338 97.8 2.2E-06 1.4E-09 51.6 13.2 295 22-336 1-333 (338) 40 protein:vir:105038 Length: 428 97.8 6E-06 3.7E-09 49.2 15.3 316 1-336 53-426 (428) 41 protein:vir:2430 Length: 318 # 97.8 3.1E-06 1.9E-09 50.8 13.6 286 10-336 1-311 (318) 42 protein:vir:78223 Length: 333 97.8 2.5E-06 1.6E-09 51.3 13.0 292 7-336 1-330 (333) 43 protein:vir:103955 Length: 324 97.7 3.9E-06 2.4E-09 50.3 13.7 292 1-336 1-313 (324) 44 protein:vir:9309 Length: 324 # 97.7 1E-05 6.3E-09 48.0 15.6 293 1-336 1-313 (324) 45 protein:vir:96223 Length: 324 97.7 6.2E-06 3.8E-09 49.1 14.2 294 1-336 1-313 (324) 46 protein:vir:100135 Length: 418 97.7 1.2E-05 7.5E-09 47.5 15.6 302 1-336 67-413 (418) 47 protein:vir:99749 Length: 324 97.7 6E-06 3.7E-09 49.2 13.9 292 1-336 1-313 (324) 48 protein:vir:104256 Length: 458 97.7 1E-05 6.4E-09 47.9 15.2 309 1-336 99-456 (458) 49 protein:vir:101650 Length: 497 97.5 5.4E-06 3.3E-09 49.5 12.1 311 1-336 86-491 (497) 50 protein:vir:7855 Length: 497 # 97.5 5.4E-06 3.3E-09 49.5 12.1 311 1-336 86-491 (497) 51 protein:vir:191 Length: 385 # 97.5 8.7E-06 5.4E-09 48.3 12.8 304 1-336 50-382 (385) 52 protein:vir:1886 Length: 385 # 97.5 8.7E-06 5.4E-09 48.3 12.8 304 1-336 50-382 (385) 53 protein:vir:81227 Length: 413 97.4 3.7E-05 2.3E-08 44.9 15.0 301 1-336 58-408 (413) 54 protein:vir:4339 Length: 395 # 97.4 2.1E-05 1.3E-08 46.2 13.4 305 1-336 54-393 (395) 55 protein:vir:80684 Length: 315 97.3 1.3E-05 7.9E-09 47.4 11.7 278 31-336 1-304 (315) 56 protein:vir:10364 Length: 390 97.2 5.2E-05 3.2E-08 44.1 14.0 300 1-336 54-390 (390) 57 protein:vir:3613 Length: 272 # 97.0 5.2E-05 3.2E-08 44.1 11.9 257 34-336 1-272 (272) 58 protein:vir:8102 Length: 543 # 96.8 0.00015 9.2E-08 41.6 12.9 305 1-336 191-540 (543) 59 protein:vir:97053 Length: 390 96.7 0.00015 9.5E-08 41.5 12.7 297 1-336 54-390 (390) 60 protein:vir:96123 Length: 274 96.6 0.00026 1.6E-07 40.2 13.3 256 34-336 1-268 (274) 61 protein:vir:2344 Length: 397 # 96.6 0.00028 1.8E-07 40.0 13.4 278 13-336 1-304 (397) 62 protein:vir:96833 Length: 275 96.6 9.4E-05 5.9E-08 42.7 10.5 257 31-336 1-275 (275) 63 protein:vir:9410 Length: 415 # 96.5 0.00037 2.3E-07 39.4 13.7 304 1-336 54-402 (415) 64 protein:vir:1328 Length: 392 # 96.5 0.00049 3E-07 38.8 14.2 303 1-336 58-389 (392) 65 protein:vir:93616 Length: 645 96.5 0.00043 2.7E-07 39.0 13.6 300 1-336 290-637 (645) 66 protein:vir:96762 Length: 632 96.4 0.00046 2.8E-07 38.9 13.7 304 1-336 304-631 (632) 67 protein:vir:81070 Length: 390 96.4 0.00046 2.8E-07 38.9 13.5 302 1-336 54-390 (390) 68 protein:vir:4159 Length: 315 # 96.4 0.00067 4.2E-07 38.0 14.3 299 15-336 1-315 (315) 69 protein:vir:4600 Length: 415 # 96.4 0.0007 4.3E-07 37.9 15.0 303 1-336 51-402 (415) 70 protein:vir:4700 Length: 415 # 96.4 0.0007 4.3E-07 37.9 15.0 303 1-336 51-402 (415) 71 protein:vir:93742 Length: 274 96.2 0.00066 4.1E-07 38.0 13.1 255 34-336 1-268 (274) 72 protein:vir:98339 Length: 415 96.1 0.001 6.2E-07 37.0 14.1 304 1-336 51-402 (415) 73 protein:vir:81100 Length: 415 96.1 0.001 6.2E-07 37.0 14.1 304 1-336 51-402 (415) 74 protein:vir:79987 Length: 415 96.1 0.001 6.2E-07 37.0 14.1 304 1-336 51-402 (415) 75 protein:vir:80930 Length: 278 96.0 0.0006 3.7E-07 38.3 12.3 262 31-336 1-275 (278) 76 protein:vir:4456 Length: 401 # 96.0 0.00094 5.8E-07 37.2 13.3 309 1-336 51-399 (401) 77 protein:vir:97433 Length: 274 96.0 0.00087 5.4E-07 37.4 12.9 255 34-336 1-268 (274) 78 protein:vir:94494 Length: 274 96.0 0.00087 5.4E-07 37.4 12.9 255 34-336 1-268 (274) 79 protein:vir:6212 Length: 434 # 95.9 0.00059 3.7E-07 38.3 11.7 308 1-336 85-427 (434) 80 protein:vir:6242 Length: 390 # 95.7 0.0014 8.5E-07 36.3 13.1 301 1-336 54-387 (390) 81 protein:vir:100247 Length: 425 95.7 0.0016 9.6E-07 36.0 13.9 313 1-336 81-422 (425) 82 protein:vir:96262 Length: 274 95.3 0.0023 1.5E-06 35.0 13.2 256 34-336 1-268 (274) 83 protein:vir:95898 Length: 274 95.3 0.0023 1.5E-06 35.0 13.2 256 34-336 1-268 (274) 84 protein:vir:4092 Length: 390 # 95.3 0.0024 1.5E-06 34.9 15.0 301 1-336 55-368 (390) 85 protein:vir:105334 Length: 276 95.3 0.0019 1.2E-06 35.5 12.2 252 34-336 1-268 (276) 86 protein:vir:1239 Length: 274 # 95.2 0.0025 1.6E-06 34.8 13.2 255 34-336 1-268 (274) 87 protein:vir:3033 Length: 272 # 95.2 0.0026 1.6E-06 34.8 14.7 251 31-336 1-267 (272) 88 protein:vir:9820 Length: 272 # 95.2 0.0026 1.6E-06 34.8 14.7 251 31-336 1-267 (272) 89 protein:vir:4856 Length: 293 # 95.1 0.0027 1.7E-06 34.7 12.5 258 32-336 1-279 (293) 90 protein:vir:485 Length: 407 # 95.0 0.0031 1.9E-06 34.4 15.3 313 1-336 53-398 (407) 91 protein:vir:4197 Length: 314 # 94.9 0.0032 2E-06 34.2 13.6 290 1-336 1-310 (314) 92 protein:vir:4830 Length: 397 # 94.4 0.0046 2.8E-06 33.4 13.5 296 1-336 50-385 (397) 93 protein:vir:3870 Length: 400 # 94.2 0.0046 2.9E-06 33.4 11.7 287 1-336 65-397 (400) 94 protein:vir:3991 Length: 404 # 93.0 0.009 5.6E-06 31.8 13.5 298 1-336 63-391 (404) 95 protein:vir:93881 Length: 387 92.0 0.012 7.1E-06 31.2 10.6 290 1-336 56-379 (387) 96 protein:vir:1025 Length: 408 # 92.0 0.013 8.3E-06 30.9 11.5 297 1-336 59-391 (408) 97 protein:vir:95107 Length: 270 91.9 0.0079 4.9E-06 32.1 9.5 250 31-336 1-263 (270) 98 protein:vir:102119 Length: 404 91.6 0.015 9.4E-06 30.6 13.6 306 1-336 44-398 (404) 99 protein:vir:4511 Length: 409 # 91.0 0.018 1.1E-05 30.2 13.0 298 1-336 57-404 (409) 100 protein:vir:7409 Length: 408 # 91.0 0.018 1.1E-05 30.2 12.9 298 1-336 59-391 (408) 101 protein:vir:81160 Length: 371 89.8 0.024 1.5E-05 29.4 11.0 291 1-336 45-368 (371) 102 protein:vir:97255 Length: 310 89.7 0.025 1.5E-05 29.4 11.4 268 17-336 1-308 (310) 103 protein:vir:4953 Length: 397 # 88.7 0.031 1.9E-05 28.9 12.5 296 1-336 53-383 (397) 104 protein:vir:101607 Length: 379 88.5 0.032 2E-05 28.8 13.0 291 1-336 54-379 (379) 105 protein:vir:8843 Length: 317 # 87.6 0.038 2.3E-05 28.4 10.5 280 37-336 1-313 (317) 106 protein:vir:94424 Length: 387 86.9 0.043 2.6E-05 28.1 10.1 295 1-336 50-379 (387) 107 protein:vir:2685 Length: 387 # 86.9 0.043 2.6E-05 28.1 10.1 295 1-336 50-379 (387) 108 protein:vir:96978 Length: 387 86.9 0.043 2.6E-05 28.1 10.1 295 1-336 50-379 (387) 109 protein:vir:79078 Length: 307 86.9 0.043 2.6E-05 28.1 11.7 263 33-336 1-300 (307) 110 protein:vir:3845 Length: 395 # 86.3 0.046 2.9E-05 27.9 11.1 295 1-336 40-381 (395) 111 protein:vir:107882 Length: 307 86.0 0.048 3E-05 27.8 13.6 261 33-336 1-300 (307) 112 protein:vir:98480 Length: 348 85.8 0.051 3.1E-05 27.7 11.2 275 33-336 1-341 (348) 113 protein:vir:105004 Length: 392 85.7 0.051 3.2E-05 27.7 12.9 295 1-336 35-382 (392) 114 protein:vir:107593 Length: 392 85.7 0.051 3.2E-05 27.7 12.9 295 1-336 35-382 (392) 115 protein:vir:102873 Length: 392 85.7 0.051 3.2E-05 27.7 12.9 295 1-336 35-382 (392) 116 protein:vir:102082 Length: 392 85.7 0.051 3.2E-05 27.7 12.9 295 1-336 35-382 (392) 117 protein:vir:1383 Length: 421 # 85.6 0.052 3.2E-05 27.6 14.1 295 1-336 54-392 (421) 118 protein:vir:1268 Length: 397 # 85.3 0.054 3.3E-05 27.6 12.7 286 1-336 89-395 (397) 119 protein:vir:4997 Length: 397 # 84.3 0.062 3.8E-05 27.2 13.7 294 1-336 53-383 (397) 120 protein:vir:94933 Length: 330 84.1 0.036 2.2E-05 28.5 7.8 288 1-336 1-327 (330) 121 protein:vir:100172 Length: 394 82.1 0.079 4.9E-05 26.6 13.7 290 1-336 57-382 (394) 122 protein:vir:80128 Length: 466 82.1 0.08 4.9E-05 26.6 15.1 306 1-336 73-446 (466) 123 protein:vir:78640 Length: 352 80.9 0.09 5.6E-05 26.3 11.2 291 1-336 21-344 (352) 124 protein:vir:102655 Length: 322 78.5 0.11 7.1E-05 25.7 9.7 282 31-336 1-319 (322) 125 protein:vir:9361 Length: 402 # 76.5 0.13 8.3E-05 25.4 11.6 295 1-336 65-396 (402) 126 protein:vir:739 Length: 231 # 74.7 0.16 9.7E-05 25.0 13.2 217 78-336 1-231 (231) 127 protein:vir:78350 Length: 383 70.2 0.21 0.00013 24.3 10.6 304 1-336 49-372 (383) 128 protein:vir:6324 Length: 335 # 68.2 0.24 0.00015 24.0 8.9 274 42-336 1-328 (335) 129 protein:vir:95376 Length: 425 65.6 0.28 0.00017 23.6 13.1 302 1-336 73-418 (425) 130 protein:vir:9704 Length: 394 # 62.8 0.33 0.0002 23.2 11.2 281 1-336 85-388 (394) 131 protein:vir:94622 Length: 341 46.4 0.74 0.00046 21.3 12.1 279 31-336 1-337 (341) 132 protein:vir:9509 Length: 381 # 44.3 0.81 0.0005 21.1 14.9 307 1-336 41-368 (381) 133 protein:vir:101291 Length: 381 44.3 0.81 0.0005 21.1 14.9 307 1-336 41-368 (381) 134 protein:vir:3158 Length: 321 # 43.8 0.83 0.00052 21.0 12.5 283 1-336 1-309 (321) 135 protein:vir:99675 Length: 324 43.3 0.85 0.00053 21.0 8.3 246 75-336 1-301 (324) 136 protein:vir:99888 Length: 309 43.2 0.86 0.00053 21.0 15.3 266 34-336 1-301 (309) 137 protein:vir:100884 Length: 389 39.4 1 0.00063 20.5 13.6 289 1-336 47-382 (389) 138 protein:vir:80213 Length: 334 34.0 1.3 0.00082 19.9 7.1 292 17-336 1-332 (334) 139 protein:vir:79928 Length: 393 33.7 1.3 0.00083 19.9 12.6 311 1-336 28-376 (393) 140 protein:vir:105645 Length: 400 31.9 1.5 0.00091 19.7 8.0 287 31-336 1-331 (400) 141 protein:vir:7990 Length: 273 # 31.9 1.5 0.00091 19.7 13.8 259 42-336 1-271 (273) 142 protein:vir:105822 Length: 273 25.6 2 0.0013 18.9 14.2 256 42-336 1-273 (273) 143 protein:vir:102605 Length: 273 25.6 2 0.0013 18.9 14.2 256 42-336 1-273 (273) 144 protein:vir:78935 Length: 335 20.1 2.8 0.0018 18.1 8.9 272 42-336 1-328 (335) No 1 >protein:vir:78558 Length: 336 # NCBI annotation: major capsid protein # Family: family:all:1653 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294848;genbank:gi:149882911;genbank:GeneID:5291029 Probab=100.00 E-value=5.2e-135 Score=756.78 Aligned_cols=336 Identities=97% Similarity=1.420 Sum_probs=333.8 Q ss_pred CchHHHHHHhhhcceeccchhhhccchhHHHHHhhhhhcccccccCcchHHHHHHHhhCceeeeeeccccchhhhccccc Q lcl|Aclame:pro 1 MRDAQRIQNLARAGVILPRSVQNVSTPLTEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESK 80 (336) Q Consensus 1 ~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~a~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t 80 (336) |||++++++|+|+||+||+++.+|+++.+.|||||+|++|+|+|++|+|||||||+||||++||++|+||++++|||++| T Consensus 1 ~~~~~~~~~l~~~gi~~~~~~~~~~~~~~~~a~da~d~~~~~~t~~~~g~~~~l~~~i~p~~~~~~~~~~~~~~l~~v~t 80 (336) T protein:vir:78 1 MRDAQRIQNLARAGVILPRSVKNVSTPLAEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESK 80 (336) T ss_pred CchHHHHHHHhccCeecchhhhhhhHHHHHHHHhhhhhccccccCCCcchHHHHHHhcccceeeehhhhhhhhhhccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCcceeeEEEeeeecceeeEEeecccCCceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHH Q lcl|Aclame:pro 81 KGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLA 160 (336) Q Consensus 81 ~g~w~~~t~~~~~~e~~G~a~~ygd~~diP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e 160 (336) +|+|++++++|+++|.+|+|++|||++|+|++|+++++++|+++++++||+||++|+++|+++|++|+++|+.+||+++| T Consensus 81 ~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P~vd~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~~ale 160 (336) T protein:vir:78 81 KGDWTTLVAAFITAEPTTTVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLA 160 (336) T ss_pred CCCccccEEEEeeeecceeeEEeecccCCCeeecceeeEEEEEEEEEeeeeecHHHHHHHHHhCCCcHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhhcceEEeeccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHh Q lcl|Aclame:pro 161 KFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSD 240 (336) Q Consensus 161 ~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~ 240 (336) +++|++|||||+++++||||||||||+.++++++||+++|+|||++||++++++|++||+|.+++|+|+||+||++++.+ T Consensus 161 ~~~N~~~~~Gd~~~~~~GllN~P~l~a~~t~~~~~w~~~T~~~I~~Di~~~~~~l~~qt~g~~~~~~~~tL~Lp~~~~~~ 240 (336) T protein:vir:78 161 KFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSD 240 (336) T ss_pred HhhCeEEEEeccccceEEEEeCCCCCcccccCcCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccceEEEechHHHHh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccCCCCCccHHHHHHHhCCccEEEEcccccCCCCceEEEEEEeecCCceEEEEcChhhhcccceecCCceEEcccccee Q lcl|Aclame:pro 241 LSKTNQYGLAAAAKLKDIFPKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTW 320 (336) Q Consensus 241 L~~~~~~~~Tvl~~l~~n~pnl~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p~~~r~l~~~~~~~~~~vp~~~~t~ 320 (336) |+++|++|+||++|||+|||||+|+++|||++|+|++++||++++++++++++++||+||+||+|+++++|+|||++||| T Consensus 241 L~~~n~~g~tv~~~lk~n~Pnl~i~t~pel~~Agg~~~~~~~~~~~~~~t~~~~~p~~f~~lpvq~~~~~~~v~~~~rt~ 320 (336) T protein:vir:78 241 LSKTNQYGLSAAAKLKEIFPKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTW 320 (336) T ss_pred ccCCCccCccHHHHHHHhcCccEEEEcccccccCcceEEEEEeeccCCcceeeecchhhhccceeecCceeEecccccee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeecccceeeeccC Q lcl|Aclame:pro 321 GAVIFRPFAVAQMIGV 336 (336) Q Consensus 321 Gv~ir~P~av~~~~GI 336 (336) ||+||||+||++++|| T Consensus 321 Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:78 321 GAVIFRPFAVAQMIGV 336 (336) T ss_pred eeeeeccchheeeccC Confidence 9999999999999999 No 2 >protein:vir:106734 Length: 336 # NCBI annotation: gp13 # Family: family:all:1653 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944321;genbank:gi:38638620;genbank:GeneID:2657363 Probab=100.00 E-value=8.9e-135 Score=755.51 Aligned_cols=336 Identities=97% Similarity=1.422 Sum_probs=333.9 Q ss_pred CchHHHHHHhhhcceeccchhhhccchhHHHHHhhhhhcccccccCcchHHHHHHHhhCceeeeeeccccchhhhccccc Q lcl|Aclame:pro 1 MRDAQRIQNLARAGVILPRSVQNVSTPLTEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESK 80 (336) Q Consensus 1 ~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~a~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t 80 (336) |||++++++|+|+||+||+++.+|+++.+.|||||+|++|+|+|++|+|||||||+||||++||++|+||++++||||+| T Consensus 1 ~~~~~~~~~l~~~gi~~~~~~~~~~~~~~~~a~da~d~~~~~~t~~~~g~~~~l~~~i~p~~~~~~~~~~~~~~l~~v~t 80 (336) T protein:vir:10 1 MRDAQRIQNLARAGVILPRSVKNVSTPLAEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESK 80 (336) T ss_pred CchHHHHHHHhccCeecchhhhhhhHHHHHHHHhhhhhccccccCCCcchHHHHHhhcCcceeeeeechhchhhhccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCcceeeEEEeeeecceeeEEeecccCCceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHH Q lcl|Aclame:pro 81 KGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLA 160 (336) Q Consensus 81 ~g~w~~~t~~~~~~e~~G~a~~ygd~~diP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e 160 (336) +|+|++++++|+++|.+|++++|||++|+|++|+++++++|++|++++||+||++|+++|+++|++|+++|+.+||+++| T Consensus 81 ~g~w~~~~~~~~~~e~~G~a~~ygd~~d~P~~d~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~~ale 160 (336) T protein:vir:10 81 KGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLA 160 (336) T ss_pred CCCcceeeEEEEeeeeeeeEEEccccCCCcceeeeeeeeeeeEEEEEEEEeeCHHHHHHHHHhCCCcHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhhcceEEeeccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHh Q lcl|Aclame:pro 161 KFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSD 240 (336) Q Consensus 161 ~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~ 240 (336) +++|++|||||+++++||||||||||++++++++||++||+|||++||++++++|++||+|.+++|+|+||+|||+++.+ T Consensus 161 ~~~N~~~~~Gd~~~~~~GllN~P~l~a~~t~~~~~w~~~T~~eI~~Di~~~~~~l~~qt~g~i~~~~~~tL~Lp~~~~~~ 240 (336) T protein:vir:10 161 KFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSD 240 (336) T ss_pred HhhCeEEEEeecccceEEEeecCCCCcccccCcCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccceEEEechHHHHh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccCCCCCccHHHHHHHhCCccEEEEcccccCCCCceEEEEEEeecCCceEEEEcChhhhcccceecCCceEEcccccee Q lcl|Aclame:pro 241 LSKTNQYGLAAAAKLKDIFPKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTW 320 (336) Q Consensus 241 L~~~~~~~~Tvl~~l~~n~pnl~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p~~~r~l~~~~~~~~~~vp~~~~t~ 320 (336) |+++|++|+|+++|||+|||||+|+++|||++|+|++++||++++++++++++++||+||+||+|+++++|+|||++||| T Consensus 241 L~~~n~~g~tv~~~lk~n~Pnl~i~t~pel~~Agg~~~~~~~~~~~~~~t~~~~~P~~f~~lpvq~~~~~~~v~~~~rt~ 320 (336) T protein:vir:10 241 LSKTNQYGLSAAAKLKEIFPKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTW 320 (336) T ss_pred ccCCCccCccHHHHHHHhCCccEEEEcccccccCCceEEEEEecccCCcceeeecChhhhccceeecCceeEecccccee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeecccceeeeccC Q lcl|Aclame:pro 321 GAVIFRPFAVAQMIGV 336 (336) Q Consensus 321 Gv~ir~P~av~~~~GI 336 (336) ||+||||+||++++|| T Consensus 321 Gv~i~rP~ai~~~~GI 336 (336) T protein:vir:10 321 GAVIFRPFAVAQMLGV 336 (336) T ss_pred eeeeeccchheeeccC Confidence 9999999999999999 No 3 >protein:vir:101557 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958117;genbank:gi:41057663;genbank:GeneID:2716814 Probab=100.00 E-value=1.7e-134 Score=754.01 Aligned_cols=336 Identities=100% Similarity=1.443 Sum_probs=333.8 Q ss_pred CchHHHHHHhhhcceeccchhhhccchhHHHHHhhhhhcccccccCcchHHHHHHHhhCceeeeeeccccchhhhccccc Q lcl|Aclame:pro 1 MRDAQRIQNLARAGVILPRSVQNVSTPLTEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESK 80 (336) Q Consensus 1 ~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~a~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t 80 (336) |||++++++|+|+||+||+++.+|+.+...|||||||++|+|+|++|+|||||||+||||++||++|+||++++||||+| T Consensus 1 ~~~~~~~~~l~~~gi~~~~~~~~~~~~~~~~~~da~d~~~~~~~~~~~~i~~~l~~~i~p~~~~~~~~p~~a~~l~pv~t 80 (336) T protein:vir:10 1 MRDAQRIQNLARAGVILPRSVQNVSTPLTEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESK 80 (336) T ss_pred CchHHHHHHHhhcCeeecchhhhhhhhHHHhhhhhhhccCccccCCCchhHHHHHhhcccceeeehhhhhhhhhhccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCcceeeEEEeeeecceeeEEeecccCCceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHH Q lcl|Aclame:pro 81 KGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLA 160 (336) Q Consensus 81 ~g~w~~~t~~~~~~e~~G~a~~ygd~~diP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e 160 (336) +|+|++++++|+++|.+|+|++||||+|+|++|++++|++|+++++++||+||++|+++|+++|++|+++|+.+||+++| T Consensus 81 ~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P~~d~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~~ale 160 (336) T protein:vir:10 81 KGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLA 160 (336) T ss_pred cCCccceeEEEeeeeceeeEEEeeccCCCceeecccceeeeeEEEEEeeeeeCHHHHHHHHHhCCCcHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhhcceEEeeccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHh Q lcl|Aclame:pro 161 KFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSD 240 (336) Q Consensus 161 ~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~ 240 (336) +++|++|||||+++++|||||||||+++++++++||+++|+|||++||++++++|+.||+|.++.|.|+||+||++++.+ T Consensus 161 ~~~N~i~~~Gd~~~~~yGllN~P~l~a~~t~~t~~~~~~t~eei~~Di~~~~~~l~~qs~G~i~~~~~~tL~LP~~~~~~ 240 (336) T protein:vir:10 161 KFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSD 240 (336) T ss_pred HhhCcEEEEeccccceEEEEeCCCCccccccCCCcccccCHHHHHHHHHHHHHHHHHhcCCeecccCcceEEecHHHHHh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccCCCCCccHHHHHHHhCCccEEEEcccccCCCCceEEEEEEeecCCceEEEEcChhhhcccceecCCceEEcccccee Q lcl|Aclame:pro 241 LSKTNQYGLAAAAKLKDIFPKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTW 320 (336) Q Consensus 241 L~~~~~~~~Tvl~~l~~n~pnl~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p~~~r~l~~~~~~~~~~vp~~~~t~ 320 (336) |+++|++|+||++|||+|||||+|+++|||++++|++++||++++++++++++++||+||+||+|+++++|+|||++||| T Consensus 241 Ls~~n~~g~Tvl~~lk~n~Pnl~i~t~pEl~~a~G~~~~l~~~~~~~~~t~~~~~p~~~~~l~vq~~~~~~~v~~~~rt~ 320 (336) T protein:vir:10 241 LSKTNQYGLAAAAKLKDIFPKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTW 320 (336) T ss_pred ccCCCccCccHHHHHHHhcCccEEEEccccccCCCceEEEEEEecCCCcceeeecchhhhccceeecCceeEecccccee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeecccceeeeccC Q lcl|Aclame:pro 321 GAVIFRPFAVAQMIGV 336 (336) Q Consensus 321 Gv~ir~P~av~~~~GI 336 (336) ||+||||+||++++|| T Consensus 321 Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:10 321 GAVIFRPFAVAQMIGV 336 (336) T ss_pred eeeeeccchheeeecC Confidence 9999999999999999 No 4 >protein:vir:3643 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705638;genbank:gi:23752323;genbank:GeneID:955719 Probab=100.00 E-value=1.9e-134 Score=753.68 Aligned_cols=336 Identities=100% Similarity=1.442 Sum_probs=333.8 Q ss_pred CchHHHHHHhhhcceeccchhhhccchhHHHHHhhhhhcccccccCcchHHHHHHHhhCceeeeeeccccchhhhccccc Q lcl|Aclame:pro 1 MRDAQRIQNLARAGVILPRSVQNVSTPLTEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESK 80 (336) Q Consensus 1 ~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~a~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t 80 (336) |||++++++|+|+||+||+++.+++.+...|+|||||++|+|+|++|+|||||||+||||++||++|+||++++||||+| T Consensus 1 ~~~~~~~~~l~~~gi~~~~~~~~~~~~~~~~~~da~d~~~~~~~~~~~~~~~~l~~~i~p~~~~~~~~~~~~~~l~pv~t 80 (336) T protein:vir:36 1 MRDAQRIQNLARAGVILPRSVQNVSTPLTEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESK 80 (336) T ss_pred CchHHHHHHHhhcCeeecchhhhhhhHHHHhhhhhhhccCccccCCCcchHHHHHHhhccceEeeecchhhhhhhccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCcceeeEEEeeeecceeeEEeecccCCceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHH Q lcl|Aclame:pro 81 KGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLA 160 (336) Q Consensus 81 ~g~w~~~t~~~~~~e~~G~a~~ygd~~diP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e 160 (336) +|+|++++++|+++|.+|+|++||||+|+|++|++++|++|+++++++||+||++|+++|+++|++|.++|+.+||+++| T Consensus 81 ~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P~~d~~~~~~~~~v~~~~~g~~yg~~E~~~Aa~~~~~l~~~Ka~aA~~ale 160 (336) T protein:vir:36 81 KGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLA 160 (336) T ss_pred cCCccceeEEEeeeeceeeEEEeeccCCCceeecccceeeeeEEEEEeeeeeCHHHHHHHHHhCCCcHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhhcceEEeeccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHh Q lcl|Aclame:pro 161 KFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSD 240 (336) Q Consensus 161 ~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~ 240 (336) +++|++|||||+++++|||||||||++.++++++||+++|+|||++||++++++|++||+|.++.|+|+||+||++++.+ T Consensus 161 ~~~N~i~~~Gd~~~~~yGllNdP~l~a~~t~~t~~~~~~t~~ei~~Di~~~~~~l~~qt~G~i~~~~~~tL~LP~~~~~~ 240 (336) T protein:vir:36 161 KFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSD 240 (336) T ss_pred HhhCcEEEEeccccceEEEEecCCCccccccCCCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccEEEechHHHHh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccCCCCCccHHHHHHHhCCccEEEEcccccCCCCceEEEEEEeecCCceEEEEcChhhhcccceecCCceEEcccccee Q lcl|Aclame:pro 241 LSKTNQYGLAAAAKLKDIFPKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTW 320 (336) Q Consensus 241 L~~~~~~~~Tvl~~l~~n~pnl~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p~~~r~l~~~~~~~~~~vp~~~~t~ 320 (336) |+++|++|+||++|||+|||||+|+++|||++|+|++++||++++++++++++++||+||+||+|+++++|+|||++||| T Consensus 241 Ls~~n~~g~Tvl~~lk~n~Pnl~i~t~pEl~~a~g~~~~l~~~~~~~~~t~~~~~p~~~~~l~vq~~~~~~~v~~~~rt~ 320 (336) T protein:vir:36 241 LSKTNQYGLAAAAKLKDIFPKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTW 320 (336) T ss_pred ccCCCccCccHHHHHHHhcCccEEEEccccccCCCceEEEEEEecCCCcceeeecchhhhccceeecCceeEecccccee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeecccceeeeccC Q lcl|Aclame:pro 321 GAVIFRPFAVAQMIGV 336 (336) Q Consensus 321 Gv~ir~P~av~~~~GI 336 (336) ||+||||+||++++|| T Consensus 321 Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:36 321 GAVIFRPFAVAQMIGV 336 (336) T ss_pred eeeeeccchheeeecC Confidence 9999999999999999 No 5 >protein:vir:94070 Length: 339 # NCBI annotation: putative structural protein # Family: family:all:1653 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453625;genbank:gi:84662661;genbank:GeneID:5142580 Probab=100.00 E-value=3.4e-122 Score=686.53 Aligned_cols=335 Identities=48% Similarity=0.850 Sum_probs=325.1 Q ss_pred CchHHHHHHhhhcceeccch-hhhccchhHHHHHhhhhhcccccccCcchHHHHHHHhhCceeeeeeccccchhhhcccc Q lcl|Aclame:pro 1 MRDAQRIQNLARAGVILPRS-VQNVSTPLTEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGES 79 (336) Q Consensus 1 ~~~~~~~~~l~~~g~~~~~~-~~~~~~~~~~~a~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~ 79 (336) =.|.+++++|+++||+||+. .+.++.+...|||||+|++|.++|..|+|||+++++||||++||++|+++++++|||++ T Consensus 4 ~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~a~d~~~~~~~~~~~~~~~i~a~~~~~i~~~vy~~~~~~~~~~~l~pv~ 83 (339) T protein:vir:94 4 NNDRTDIKQLEKVGIIFDGYSPKSISSEVSAYAMDAVNLTPTLQTTANAGIPAWMTTFVDRRVIDIQLAPMAAAKIFPEV 83 (339) T ss_pred echHHHHHHHHhhceeeccchhhhcchhhHhhhccccccccccccccccchhhhhhhhhchhheeecccccchhhhcccc Confidence 56899999999999999965 45578899999999999999999999999999999999999999999999999999999 Q ss_pred cCCCcceeeEEEeeeecceeeEEeecccCCceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHH Q lcl|Aclame:pro 80 KKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGL 159 (336) Q Consensus 80 t~g~w~~~t~~~~~~e~~G~a~~ygd~~diP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~ 159 (336) |+|+|++++++|+++|.+|+|++|||++|+|++|+++++++|+++++++||+|+++|+++|+++|++|+++|+.+||+++ T Consensus 84 t~g~w~~~t~~y~~~e~~G~a~~ygd~ad~Pl~~~~v~~~~~~v~~~~~g~~y~~~E~~~A~~~g~~l~~~Ka~aA~~al 163 (339) T protein:vir:94 84 KKGDWTTTYGVFIIAEPVGQVATYSDWSANGMSKANVNFESRQNYRYQTWTEYGDLEMATYGEAGIDYVARQEISASLVM 163 (339) T ss_pred cCCCCcccEEEEeeeecccceEEcccccCCCcccccceeeEEeEEEEEEEEeecHHHHHHHHhhCCChHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHhhcceEEeeccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHH Q lcl|Aclame:pro 160 AKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMS 239 (336) Q Consensus 160 e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~ 239 (336) |+++|+++|||++++++||||||||+++.++++++| +++|+|||++||++++++|+.||+|.+++++|+||+|||+++. T Consensus 164 ~~~~N~i~~~Gd~~~~~~GLlN~P~l~~~v~~s~~W-a~kT~~eI~~Di~~~~~~l~~~s~g~~~~~~~~~L~LP~~~~~ 242 (339) T protein:vir:94 164 AKFANSSYLLGVAGIANYGLMNDPSLPAPVAATVNW-ATAAPEDIANDVVAMVGRLISQSGGLITGQERMVMALAPSALN 242 (339) T ss_pred HHhhceEEeeeecccceEEEEeCCCccccccCCCCc-ccCCHHHHHHHHHHHHHHHHHhcCCeeeeccCcEEEecHHHHH Confidence 999999999999999999999999999877776665 5678999999999999999999999999999999999999999 Q ss_pred hcccCCCCCccHHHHHHHhCCccEEEEcccccCCCCceEEEEEEeecCCceEEEEcChhhhcccceecCCceEEccccce Q lcl|Aclame:pro 240 DLSKTNQYGLAAAAKLKDIFPKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSIERYSSYFRQKKSAGT 319 (336) Q Consensus 240 ~L~~~~~~~~Tvl~~l~~n~pnl~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p~~~r~l~~~~~~~~~~vp~~~~t 319 (336) +|+++|++|+|+++|||+|||||+|+++|||++++|++.+||+.+++++++++++||||||+||+|+++++|+|||++|| T Consensus 243 ~L~~~n~~~~Tvl~~lk~n~pnl~i~~~~el~~a~g~~~~~~~~~~~~~~~~~~~~p~~~~~lpvq~~~~~~~v~~~~rt 322 (339) T protein:vir:94 243 NVNRTNNFGLSAGAKIAQTYPNIQFVAVPEFDTASGRLVQLWVPEVNGQPTGEVAFAEKLRSHSIERYSTTTRQKHSGAT 322 (339) T ss_pred hcccCCcCCccHHHHHHHhcCCcEEEEccccccCCCceEEEEEEeccCCcceEEEcchhhhccccEEcCceEEecceeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeeecccceeeeccC Q lcl|Aclame:pro 320 WGAVIFRPFAVAQMIGV 336 (336) Q Consensus 320 ~Gv~ir~P~av~~~~GI 336 (336) |||+||||+||+|++|| T Consensus 323 ~Gv~i~~P~ai~~~~GI 339 (339) T protein:vir:94 323 FGAVIYQPWAVTQELGV 339 (339) T ss_pred eeEEEEccceeeeeecC Confidence 99999999999999999 No 6 >protein:vir:107732 Length: 379 # NCBI annotation: gp23 # Family: family:all:1653 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024871;genbank:gi:48697513;genbank:GeneID:2948349 Probab=100.00 E-value=3.2e-117 Score=659.28 Aligned_cols=334 Identities=22% Similarity=0.329 Sum_probs=309.3 Q ss_pred CchHH--HHHHhhhcceeccchhhhccchhHHHHHhhhhhcc------cccccCcchHHHHHHHhhCceeeeeeccccch Q lcl|Aclame:pro 1 MRDAQ--RIQNLARAGVILPRSVQNVSTPLTEYAMDAADLSP------HLSSTGSSGIPNYLTTYVDPAVIDILVAPMKA 72 (336) Q Consensus 1 ~~~~~--~~~~l~~~g~~~~~~~~~~~~~~~~~a~da~d~~~------~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~ 72 (336) -.|+. ++++|+|+||+||+...+ ..+...+||||.|.+| +|++++|+|||||||+|+ |+++|++++||++ T Consensus 23 ~~~~~~~~~~~l~~~gi~~~~~~~~-~~~~~~~amd~~~~~~~~~~~~~l~~~~~~g~~~~l~~~~-p~~i~~~tap~~a 100 (379) T protein:vir:10 23 SADVTLDNLKHLESYGIHLNGRKNK-LFELMQFAMDSNDIGPIPTPLSPLSPVSIPGLIQFLQNWL-PGHVRILTAVREA 100 (379) T ss_pred cccccHHHHHHHHhcCccccchhhh-hhhhhhhhhccccccccccccCccccccccchHHHHHhhc-chHHHHHhhhhhh Confidence 23433 789999999999966433 3456677999999985 888999999999999999 9999999999999 Q ss_pred hhhcccccCCCcceeeEEEeeeecceeeEEeecccCCceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHH Q lcl|Aclame:pro 73 AELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELN 152 (336) Q Consensus 73 ~~l~~v~t~g~w~~~t~~~~~~e~~G~a~~ygd~~diP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~ 152 (336) +|||||+|+|+|++++++|+++|.+|+|++|||++|+|++|+++++++|++|++++||+||++|+++|+++|++|+++|+ T Consensus 101 ~~l~pv~t~g~W~~~~~~~~v~e~~G~A~~ygd~~d~pl~d~~~~~~~r~v~~~~~g~~yg~~El~~Aa~~g~~l~~~Ka 180 (379) T protein:vir:10 101 DEFLGLSTVGQWDDEQIVQRVLEGLGTAQPYTDGGNMALMSWTPTFETRTVVRFEAGLQVAPLEEARSSRVQVSSADEKR 180 (379) T ss_pred hhhcccccCCCceeeeEEEeeeeeeeeeEEeccccCCCeeeeeeeeeeeeeEEEEEEEeecHHHHHHHHHhCCChHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhhcceEEee--ccccceEEEEecCCCCccccccc-----ccccccCHHHHHHHHHHHHHHHHHHhCCceec Q lcl|Aclame:pro 153 YSSALGLAKFLNGSYLFG--VAGLENYGLINDPSLSAPITATT-----PWSGSPAVEAVVNEVVALFQVLQTQSQGIITQ 225 (336) Q Consensus 153 ~aAr~a~e~~~n~~~~~G--d~~~g~~GllN~Pnl~~~~~~~t-----~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~ 225 (336) .+||+++|+++|+++||| |+++++|||||||||++.+++++ +.|+++|+|||++||++++++++.||+|.+++ T Consensus 181 ~aA~~ale~~~N~i~f~G~~d~~~~~yGllNdP~l~a~~t~atg~~~~t~Wa~kT~~eI~~Di~~~~~~l~~qs~g~~~~ 260 (379) T protein:vir:10 181 AMVGEALEVQRNRVAFYGYNDGSGRTFGFLNDPNLPAYVAVPNGAGGSPLWAQKTTLEIIADLRNGLTALQVQSMGRIKS 260 (379) T ss_pred HHHHHHHHHhhceEEEEeecCCCcceEEEEeCCCCcccccccCCcccccccccCCHHHHHHHHHHHHHHHHHhhCCeecc Confidence 999999999999999999 67999999999999987766543 44778899999999999999999999998866 Q ss_pred -ccccEEEecHHHHHhcccCCCCCccHHHHHHHhCCccEEEEcccccCCCCc--eEEEEEEeecCCc-----eEEEEcCh Q lcl|Aclame:pro 226 -EDVLRMGLPPTAMSDLSKTNQYGLAAAAKLKDIFPKLEFVTIPEYDTASGR--LVQLWAPRVEGKD-----TATCGFTE 297 (336) Q Consensus 226 -~~p~tL~Lp~~~~~~L~~~~~~~~Tvl~~l~~n~pnl~i~~~pel~~a~G~--~~~~~~~~~~~~~-----~~~~~~p~ 297 (336) +.|++|+|||+++.+|+++|++|+||++|||+|||||+|+++|||++++|+ .++||++++++++ ++.|+||| T Consensus 261 ~~~~~tL~LP~~~~~~L~~~n~~g~Tvl~~lk~n~Pnl~i~t~pEL~~aggg~~~~~~~~~~~~~~~t~~~~~~~~~~p~ 340 (379) T protein:vir:10 261 NKTPITIGIPNAYENYITTPTELGYSVAQYMRESYPNVTFVSAPELNDANGGSSAIYYYADAVENNGTDDGRTWLQVVPT 340 (379) T ss_pred cccceeEEecHHHHHhhccccccCccHHHHHHHhcCCcEEEEcccccccCCCccEEEEEeeccCCCccCCcceEEEecch Confidence 579999999999999999999999999999999999999999999998764 6789988887654 46799999 Q ss_pred hhhcccceecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 298 KMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 298 ~~r~l~~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) |||+||+|++.++|+|||++|||||+||||+||+|++|- T Consensus 341 k~~~l~ve~~~~~~~~~~~~rt~Gv~ir~P~Ai~~~~G~ 379 (379) T protein:vir:10 341 KMFTLGVEKKIKGYAEGYTNATAGAMLKRPFATYRQTGA 379 (379) T ss_pred hhhhccceecCceeEeccccceeeeeeecchhhheecCC Confidence 999999999999999999999999999999999999999 No 7 >protein:vir:99576 Length: 388 # NCBI annotation: hypothetical protein # Family: family:all:1653 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039801;genbank:gi:126011051;genbank:GeneID:4818271 Probab=100.00 E-value=5.2e-115 Score=647.14 Aligned_cols=336 Identities=23% Similarity=0.324 Sum_probs=303.4 Q ss_pred Cc--------hHHHHHHhhhcceeccchhhhccchh---HHHHHhhhhhcc-cccccCcchHHHHHHHhhCceeeeeecc Q lcl|Aclame:pro 1 MR--------DAQRIQNLARAGVILPRSVQNVSTPL---TEYAMDAADLSP-HLSSTGSSGIPNYLTTYVDPAVIDILVA 68 (336) Q Consensus 1 ~~--------~~~~~~~l~~~g~~~~~~~~~~~~~~---~~~a~da~d~~~-~l~t~~~~~i~~~l~~~idp~v~~~~~~ 68 (336) |. |....++|+|+||+||++..++..+. ..++++|||+++ +..|++|.|||++|++|+||++||++++ T Consensus 21 ~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~a~da~~~~~~t~~~~gip~~~~~~~~p~~~~~~~~ 100 (388) T protein:vir:99 21 MANGKADYRLTDMAVRELKKFGLVFDHATVKRQIELLHEGGVATQAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTS 100 (388) T ss_pred hhcCCcceeeechhhHhhhhcceeccCccchhhhhhhhhhhhhhcccCcccccccccCcccHHHHHhhhhccceeeeeec Confidence 21 24557789999999999866654332 333444555443 2358899999999999999999999999 Q ss_pred ccchhhhcccccCCCcceeeEEEeeeecceeeEEeecccCCceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHH Q lcl|Aclame:pro 69 PMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLA 148 (336) Q Consensus 69 ~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~~ygd~~diP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~ 148 (336) ||++++||||+|+|+|++++++|+++|.+|+|++|||++|+|++|+++++++|++|++++||+|+++|+++|+++|++|+ T Consensus 101 p~~~~~l~pv~t~g~W~~~~~~f~v~e~~G~A~~ygd~~D~Pl~d~~~~~~~r~v~~~~~g~~yg~~El~~A~~~g~~l~ 180 (388) T protein:vir:99 101 ARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSA 180 (388) T ss_pred hhhhhhhccccccCCccceeEEEeeeecceeEEEeecccCCCceeccceeeeeeEEEEEeeeeecHHHHHHHHhhCCCcH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHhhcceEEeeccc---cceEEEEecCCCCccccccc----ccccccCHHHHHHHHHHHHHHHHHHhCC Q lcl|Aclame:pro 149 SELNYSSALGLAKFLNGSYLFGVAG---LENYGLINDPSLSAPITATT----PWSGSPAVEAVVNEVVALFQVLQTQSQG 221 (336) Q Consensus 149 ~~k~~aAr~a~e~~~n~~~~~Gd~~---~g~~GllN~Pnl~~~~~~~t----~w~~~~t~~eI~~Di~~l~~~l~~~s~g 221 (336) ++|+.+||+++|+++|+++|||+++ +++|||||||||++.+++++ +.|+++|+|||++||++++++|++||+| T Consensus 181 ~~Ka~AA~~ale~~~N~i~f~G~~g~~~~~~yGllNdP~l~a~v~at~~~~~~~Wa~kT~~eI~~Di~~~~~~i~~qs~g 260 (388) T protein:vir:99 181 EVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSED 260 (388) T ss_pred HHHHHHHHHHHHhhhceEEEEeecCCCccceEEEeeCCCcccccccccCCcCcccccCCHHHHHHHHHHHHHHHHHhcCC Confidence 9999999999999999999999875 58999999999998776543 2377889999999999999999999999 Q ss_pred ceec-ccccEEEecHHHHHhcccCCCCCccHHHHHHHhCCccEEEEcccccCCC----CceEEEEEEeec--------CC Q lcl|Aclame:pro 222 IITQ-EDVLRMGLPPTAMSDLSKTNQYGLAAAAKLKDIFPKLEFVTIPEYDTAS----GRLVQLWAPRVE--------GK 288 (336) Q Consensus 222 ~v~~-~~p~tL~Lp~~~~~~L~~~~~~~~Tvl~~l~~n~pnl~i~~~pel~~a~----G~~~~~~~~~~~--------~~ 288 (336) .++. +.|+||+|||+++.+|+++|++|+||++|||+|||||+|+++|||++++ |.+++|++++++ ++ T Consensus 261 ~~~~~~~~~tL~LP~~~~~~Ls~~n~~g~Tvl~~lk~n~Pnl~i~t~pEl~~a~~tgg~~~~~~~~~~~~~~~~~~~~~~ 340 (388) T protein:vir:99 261 NIDPEDVDITLVLPMNKVDMLSVVTDLGISVRDWLKQTYPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGG 340 (388) T ss_pred eeeecccceEEEechHHHHhccccCcCCccHHHHHHHhcCCcEEEEecccccccccCCceeEEEEecccccccccCccCc Confidence 9886 4799999999999999999999999999999999999999999999763 467788888764 67 Q ss_pred ceEEEEcChhhhcccceecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 289 DTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 289 ~~~~~~~p~~~r~l~~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) +++.+++||||++||+|+++++|+|||++|||||+||||+||+|++|| T Consensus 341 ~t~~~~~p~~~~~l~vq~~~~~~~~~~~~rt~Gv~ir~P~Ai~~~~GI 388 (388) T protein:vir:99 341 DTWAQLVQSKFVTLGVEKRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) T ss_pred ceeEEecccccccccceecCceeEeccccceeeeEEeccchhheeccC Confidence 899999999999999999999999999999999999999999999999 No 8 >protein:vir:96079 Length: 382 # NCBI annotation: hypothetical protein ORF023 # Family: family:all:1653 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294440;genbank:gi:149408337;genbank:GeneID:5237198 Probab=100.00 E-value=6.8e-111 Score=624.55 Aligned_cols=334 Identities=24% Similarity=0.354 Sum_probs=298.4 Q ss_pred CchH--HHHHHhhhcceeccchhhh--cc------chhHHHHHhhhhhcccccccCcchHHHHHHHhhCceeeeeecccc Q lcl|Aclame:pro 1 MRDA--QRIQNLARAGVILPRSVQN--VS------TPLTEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPM 70 (336) Q Consensus 1 ~~~~--~~~~~l~~~g~~~~~~~~~--~~------~~~~~~a~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~ 70 (336) +.++ .++++|+|+||+||++..+ +. .++..+||||++.+| +|.+|.|||++|++|+||++||++|+|| T Consensus 21 ~~~~~~~~~~~l~~~gi~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~--~t~~~~g~p~~~l~~~~p~~~~~~~~p~ 98 (382) T protein:vir:96 21 LKNVTHEAVAALGRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAP--VTTPSIPTPIQFLQTWLPGFVKVMTAAR 98 (382) T ss_pred hhcccHHHHHHHhccccccCcccchhHhhhhhhhhhhhhhcccccccCCc--cccCCccHHHHHHhhhhhhhhhhhhhhh Confidence 3333 5689999999999997422 22 223346787765554 4889999999999999999999999999 Q ss_pred chhhhcccccCCCcceeeEEEeeeecceeeEEeecccCCceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHH Q lcl|Aclame:pro 71 KAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASE 150 (336) Q Consensus 71 ~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~~ygd~~diP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~ 150 (336) ++++||||+|+|+|++++++|+++|.+|+|++|||++|+|++|+++++++|++|++++||+|+.+|+++|+++|++|.++ T Consensus 99 ~~~~l~pv~t~g~W~~~t~ty~~~e~~G~A~~ygd~~D~Pl~d~~~~~~~r~v~~~~~g~~yg~lE~~rAa~~~~~l~~~ 178 (382) T protein:vir:96 99 KIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPLTSWNANFERRTIVRGELGLLVGTLEEGRASAIRLNSAET 178 (382) T ss_pred hhhhhccccccCCccceEEEEeeeecccceEEeecccCCCccccccceeEEEEEEEEEeeeecHHHHHHHHhhCCCcHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhhcceEEeec---cccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceecc- Q lcl|Aclame:pro 151 LNYSSALGLAKFLNGSYLFGV---AGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQE- 226 (336) Q Consensus 151 k~~aAr~a~e~~~n~~~~~Gd---~~~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~- 226 (336) |+.+||+++|+++|+++|||+ .++++||||||||||+.+++++++|+++|+|||++||++++++|++||+|.++.+ T Consensus 179 Ka~aA~~ale~~~N~i~f~G~~~g~~~~~yGllNdP~l~a~~t~a~~~Wa~kT~~eI~~Di~~l~~~i~~qt~G~~~~~~ 258 (382) T protein:vir:96 179 KRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPPFQTPPSQGWATADWAGIIGDIREAVRQLRIQSQDQIDPKA 258 (382) T ss_pred HHHHHHHHHHHhhceEEEEeeecCcCcceEEEEeCCCcccccccCCCCcccccHHHHHHHHHHHHHHHHhccCCeeeecc Confidence 999999999999999999998 4589999999999999888888889999999999999999999999999998864 Q ss_pred cccEEEecHHHHHhcccCCCCCccHHHHHHHhCCccEEEEcccccCCC--C----ceEEEEEEeec----C----CceEE Q lcl|Aclame:pro 227 DVLRMGLPPTAMSDLSKTNQYGLAAAAKLKDIFPKLEFVTIPEYDTAS--G----RLVQLWAPRVE----G----KDTAT 292 (336) Q Consensus 227 ~p~tL~Lp~~~~~~L~~~~~~~~Tvl~~l~~n~pnl~i~~~pel~~a~--G----~~~~~~~~~~~----~----~~~~~ 292 (336) .|++|+|||+++.+|+++|++|+||++|||+|||||+|+++|||++++ | ...+++.++++ + +.++. T Consensus 259 ~~~~L~LP~~~~~~Ls~~n~~g~Tvl~~lk~n~Pnl~i~t~peL~~a~~~g~g~~~~~~~~~~e~~~~~~~s~~~p~~f~ 338 (382) T protein:vir:96 259 EKITMALATSKVDYLSVTTPYGISVSDWIEQTYPKMRIVSAPELSGVQMQGKTPEDALVLFVEEVDASVDGSTDGGSVFS 338 (382) T ss_pred cceEEeechHHHhhccccCccCccHHHHHHHhcCCcEEEEccccccccCCCccceeEEEEecchhhhhcccccccCccee Confidence 699999999999999999999999999999999999999999998762 2 23345555542 2 33444 Q ss_pred EEcChhhhcccceecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 293 CGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 293 ~~~p~~~r~l~~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) +.+|.+++.||+|++.++|++||++|||||+||||+||+|++|| T Consensus 339 q~~p~~~~~l~ve~~~~~~~~~~s~~t~Gv~i~~P~ai~~~~GI 382 (382) T protein:vir:96 339 QLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) T ss_pred ccccceeeeccceeecceeEeccccceeeeEEEcchhhhhccCC Confidence 56677888899999999999999999999999999999999999 No 9 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=100.00 E-value=8.6e-92 Score=519.83 Aligned_cols=318 Identities=11% Similarity=0.093 Sum_probs=282.2 Q ss_pred hcceeccchhhhccchhHHHHHhhhhhcccccccCcchHHHHHH---HhhCceeeeeeccccchhhhcccccCCCcceee Q lcl|Aclame:pro 12 RAGVILPRSVQNVSTPLTEYAMDAADLSPHLSSTGSSGIPNYLT---TYVDPAVIDILVAPMKAAELVGESKKGDWTTLV 88 (336) Q Consensus 12 ~~g~~~~~~~~~~~~~~~~~a~da~d~~~~l~t~~~~~i~~~l~---~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t 88 (336) -.|..+.. ++...-...+.-|+++++.+.+..+.++++|++ ++|||++||+++++++++++||++++++|++++ T Consensus 1 ~~~~~~~~---~~~~d~~~~~~~a~~~~~~~~~~~~~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~i~~~~~~~~~~ 77 (329) T protein:vir:79 1 MRGNIMSK---EMKYDEFEANVIANHMQLRGAKNDASDMGIWTSQELHKIKAQAYEKEYPAGSALRVFPVTSELSDTDKT 77 (329) T ss_pred Cccchhhh---hhccchhhhhhHhhhcccccceeccchhhHHHHHHHHHHHHHHHhhhhcccchhhhcccccCCCCceeE Confidence 22333222 222111222344557788888888888999998 899999999999999999999999999999999 Q ss_pred EEEeeeecceeeEEeecc-cCCceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceE Q lcl|Aclame:pro 89 AAFITAEPTTKVATYGDY-SSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSY 167 (336) Q Consensus 89 ~~~~~~e~~G~a~~ygd~-~diP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~ 167 (336) ++|+++|.+|++++|||+ +|+|++|++++++++++++++.+|+|+++|+++|+++|++|+++|+.+|++++++++|+++ T Consensus 78 ~t~~~~~~~G~a~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~ 157 (329) T protein:vir:79 78 FEYQTFDKVGHAKIIADYTDDLSTVDALMTSEFGKVFRLGNAFLISIDEIKAGQRTGKSLSTRKANAAQNAHDQLVNHLV 157 (329) T ss_pred EEeeeeecceeeeeecCcccccceeecccceeEEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEE Confidence 999999999999999997 5779999999999999999999999999999999999999999999999999999999999 Q ss_pred EeeccccceEEEEecCCCCcccccc--cccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhcccC- Q lcl|Aclame:pro 168 LFGVAGLENYGLINDPSLSAPITAT--TPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSKT- 244 (336) Q Consensus 168 ~~Gd~~~g~~GllN~Pnl~~~~~~~--t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~~- 244 (336) |+|++++++|||||||++++..+++ ++.|+++|++||++||++++++++.+|+|. +.|++|+|||+++.+|+++ T Consensus 158 f~G~~~~g~~GLlN~p~v~~~~~~~~~~~~w~~kt~~ei~~di~~~~~~l~~~s~g~---~~p~~L~Lpp~~~~~L~~~~ 234 (329) T protein:vir:79 158 FKGSKPHKIISVFEHPNLTTINSAGWNNAAGTGKKPETAQDELEQAIEKIETLTNGQ---HRANMILIPPSMRKVLMVRM 234 (329) T ss_pred EeecccccceeeecCCCccccccCCCCCccccccCHHHHHHHHHHHHHHHHHhcCce---ecccEEEecHHHHHHhhccc Confidence 9999999999999999998655443 234777899999999999999999999985 6899999999999999764 Q ss_pred CCCCccHHHHHHHhCCccEEEEcccccCCCCc-eEEEEEEeecCCceEEEEcChhhhcccceecCCceEEccccceeeee Q lcl|Aclame:pro 245 NQYGLAAAAKLKDIFPKLEFVTIPEYDTASGR-LVQLWAPRVEGKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAV 323 (336) Q Consensus 245 ~~~~~Tvl~~l~~n~pnl~i~~~pel~~a~G~-~~~~~~~~~~~~~~~~~~~p~~~r~l~~~~~~~~~~vp~~~~t~Gv~ 323 (336) +.+|+|+++||++|||+|+|+++|||++++++ +..++++ .++++++++++||||++||+|+++++|+|||++|||||+ T Consensus 235 ~~~~~tvl~~lk~~~~~l~I~~~~el~~ag~~g~~~~v~y-~~~~~~~~~~vp~~~~~l~~q~~~~~~~v~~~~r~~Gv~ 313 (329) T protein:vir:79 235 PETTMSYLDYFKQQNGGITIESISELEDIDGAGTKAALVY-EKDPMNMSIEIPEAFNMLTAQPKDLHFKVPCTSKCTGLT 313 (329) T ss_pred CCCCccHHHHHHHhCCCcEEEEcccccccCCCCceEEEEE-ecCCceEEEecCcceeeeeceecCceEEEceeeeEEEEE Confidence 57899999999999999999999999999754 4445544 578999999999999999999999999999999999999 Q ss_pred eecccceeeeccC Q lcl|Aclame:pro 324 IFRPFAVAQMIGV 336 (336) Q Consensus 324 ir~P~av~~~~GI 336 (336) ||||+||+|++|| T Consensus 314 i~~P~ai~~~dGI 326 (329) T protein:vir:79 314 IYRPLTLVLIKGL 326 (329) T ss_pred EECcceeeeeeee Confidence 9999999999999 No 10 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=100.00 E-value=2.8e-90 Score=511.57 Aligned_cols=302 Identities=14% Similarity=0.158 Sum_probs=262.7 Q ss_pred cceeccchhhhccchhHHHHHhhhhhcccccccC---cchHHHHHH---HhhCceeeeeeccccchhhhcccccCCCcce Q lcl|Aclame:pro 13 AGVILPRSVQNVSTPLTEYAMDAADLSPHLSSTG---SSGIPNYLT---TYVDPAVIDILVAPMKAAELVGESKKGDWTT 86 (336) Q Consensus 13 ~g~~~~~~~~~~~~~~~~~a~da~d~~~~l~t~~---~~~i~~~l~---~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~ 86 (336) .-++| ++|+++.+..+.+.. +-...+|++ ++|||+|||+++++++++++||++++++|++ T Consensus 1 ~~~~~--------------~~~~~~~~~~~~~~~~~~~d~~~~fl~~ql~~id~~v~e~~~~~~~~~~~i~v~~~~~~~~ 66 (314) T protein:vir:10 1 MAIKF--------------DAEQAKITTHLEQMGVEKADAAGIWAVSQLTAALNRAYEKEYAENSVVNIFPVTNEIPGHA 66 (314) T ss_pred Cccch--------------HHHHHHHHHHHHhhcccchhhhHHHHHHHHHHHHHHHhhhhccccccceeeccccCCCCce Confidence 11122 222222222221111 011113555 6999999999999999999999999999999 Q ss_pred eeEEEeeeecceeeEEeecc-cCCceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcc Q lcl|Aclame:pro 87 LVAAFITAEPTTKVATYGDY-SSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNG 165 (336) Q Consensus 87 ~t~~~~~~e~~G~a~~ygd~-~diP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~ 165 (336) ++++|+++|.+|++++|||+ +|+|++|++++++++++++++.+|+|+++|+++|+++|++|+++|+.+|++++++++|+ T Consensus 67 et~~~~~~e~~G~a~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~ 146 (314) T protein:vir:10 67 KYFEYPEFDGVGIAQIIADYSDDLPLVDAFMTEKQGKVFRFGNAFLISTDEIKAGAATGQSLSARKQALAFEAHDNLLDK 146 (314) T ss_pred eEEEeeeeccccceeeeCCcccccceeecccceeEEEEEEEEeeEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhce Confidence 99999999999999999997 46799999999999999999999999999999999999999999999999999999999 Q ss_pred eEEeeccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhcccCC Q lcl|Aclame:pro 166 SYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSKTN 245 (336) Q Consensus 166 ~~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~~~ 245 (336) ++|+|+++++++|||||||++. .+++++| +|++||++||+++++++++||+|. |.|++|+|||+++.+|++++ T Consensus 147 i~f~G~~~~g~~GLlN~p~v~~-~~~~~~W---aT~~ei~~Di~~~~~~l~~~s~g~---~~p~~l~Lpp~~~~~L~~~~ 219 (314) T protein:vir:10 147 LVWSGSAPHGIVSVFDQPNINN-VVATPNW---SVPQNAIDDVTAMIDAVESSTQGL---HHVTDILLPASARRVMQGLV 219 (314) T ss_pred EEEeecccccceeEeecCCCcc-ccCCCCc---ccHHHHHHHHHHHHHHHHHhcCcc---ccceeEEecHHHHHhhcccc Confidence 9999999999999999999975 3445566 479999999999999999999985 68999999999999998775 Q ss_pred C-CCccHHHHHHHhCCccEEEEcccccCCCCce-EEEEEEeecCCceEEEEcChhhhcccceecCCceEEccccceeeee Q lcl|Aclame:pro 246 Q-YGLAAAAKLKDIFPKLEFVTIPEYDTASGRL-VQLWAPRVEGKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAV 323 (336) Q Consensus 246 ~-~~~Tvl~~l~~n~pnl~i~~~pel~~a~G~~-~~~~~~~~~~~~~~~~~~p~~~r~l~~~~~~~~~~vp~~~~t~Gv~ 323 (336) . +|+|+++||++|||||+|+++|||++++|+. .+|++ ..++++++++++||||++||+|+++++|++||++|||||+ T Consensus 220 ~~~~~tvl~~l~~n~~~l~I~~~~el~~ag~~g~~~~v~-y~~~~~~~~~~vp~~~~~l~~e~~~~~~~~~~~~r~~Gv~ 298 (314) T protein:vir:10 220 PQTNLSYGELFTRNNPGLTIRFLQFLDNYDGAGGKAALA-FEKSPLNMSIEIPEVTNVLPAQPKDLHFRYPVTSKATGLI 298 (314) T ss_pred cCCCccHHHHHHHhCCCcEEEEcccccccCCCcceEEEE-EecCCcEEEEecCccceeecceecCceEEEcceeeeEEEE Confidence 4 6999999999999999999999999997644 44444 4578999999999999999999999999999999999999 Q ss_pred eecccceeeeccC Q lcl|Aclame:pro 324 IFRPFAVAQMIGV 336 (336) Q Consensus 324 ir~P~av~~~~GI 336 (336) ||||+||+|++|| T Consensus 299 i~~P~ai~~~dGI 311 (314) T protein:vir:10 299 VYRPLTMAVIKGI 311 (314) T ss_pred EECcceeEeeeee Confidence 9999999999999 No 11 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=100.00 E-value=1.1e-87 Score=497.27 Aligned_cols=315 Identities=12% Similarity=0.085 Sum_probs=268.9 Q ss_pred CchHHH-HHHhhhcceeccchhhhccchhHHHHHhhhhhcccccccCcchHHHHHHHhhCceeeeeeccccchhhhcccc Q lcl|Aclame:pro 1 MRDAQR-IQNLARAGVILPRSVQNVSTPLTEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGES 79 (336) Q Consensus 1 ~~~~~~-~~~l~~~g~~~~~~~~~~~~~~~~~a~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~ 79 (336) |.+-+. .++++ .| ......|.++. |++ .+-+.+.+...++|||++||+++++++++++||+. T Consensus 1 ~~~~~~~~~~~~--~~---------~~~~~~~~~~~-da~-----~~~g~~~~~ql~~id~~v~e~~~~~l~~~~~i~v~ 63 (319) T protein:vir:10 1 MTTKKFDEADKS--NV---------EMYLIQAGVKQ-DAA-----ATMGIWTAQELHRIKSQSYEEDYPVGSALRVFPVT 63 (319) T ss_pred CCCcchhHHhhH--HH---------HHHHhhccchh-hhh-----hhhhhHHHHHHHHHHHHHHhhhhcceechhhcccc Confidence 443111 01111 00 00111111221 111 11122445566799999999999999999999999 Q ss_pred cCCCcceeeEEEeeeecceeeEEeeccc-CCceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHH Q lcl|Aclame:pro 80 KKGDWTTLVAAFITAEPTTKVATYGDYS-SDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALG 158 (336) Q Consensus 80 t~g~w~~~t~~~~~~e~~G~a~~ygd~~-diP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a 158 (336) ++++|++++++|.++|.+|++++|||++ |+|++|++.+++++++++++.+|+|+++|+++|+++|++|+++|+.+|+++ T Consensus 64 ~~~~~~~~~~~~~~~~~~G~a~~~~d~~~dip~v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~ 143 (319) T protein:vir:10 64 TELSPTDKTFEYMTFDKVGTAQIIADYTDDLPLVDALGTSEFGKVFRLGNAYLISIDEIKAGQATGRPLSTRKASACQLA 143 (319) T ss_pred cCCCCceEEEEeeeeccccceeeecCccccccceeccceeeEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHH Confidence 9999999999999999999999999975 579999999999999999999999999999999999999999999999999 Q ss_pred HHHhhcceEEeeccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHH Q lcl|Aclame:pro 159 LAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAM 238 (336) Q Consensus 159 ~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~ 238 (336) +++++|+++|+|++++|++||||||+++....+.+.+|+++|+|||++||++++++++++|+|. +.|++|+|||+++ T Consensus 144 ~~~~~n~i~f~G~~~~g~~GLlN~p~~~~~~~~~~~~~~t~t~~~i~~di~~~~~~l~~~s~g~---~~p~~L~L~p~~~ 220 (319) T protein:vir:10 144 HDQLVNRLVFKGSAPHKIVSVFNHPNITKITSGKWIDVSTMKPETAEAELTQAIETIETITRGQ---HRATNILIPPSMR 220 (319) T ss_pred HHHhhceEEEeecccccceeEEeCCCceeeecCCCCCccccCHHHHHHHHHHHHHHHHHhcCce---eeceEEEecHHHH Confidence 9999999999999999999999999998765555555778899999999999999999999986 5799999999999 Q ss_pred HhcccC-CCCCccHHHHHHHhCCccEEEEcccccCCCCc-eEEEEEEeecCCceEEEEcChhhhcccceecCCceEEccc Q lcl|Aclame:pro 239 SDLSKT-NQYGLAAAAKLKDIFPKLEFVTIPEYDTASGR-LVQLWAPRVEGKDTATCGFTEKMRAHSIERYSSYFRQKKS 316 (336) Q Consensus 239 ~~L~~~-~~~~~Tvl~~l~~n~pnl~i~~~pel~~a~G~-~~~~~~~~~~~~~~~~~~~p~~~r~l~~~~~~~~~~vp~~ 316 (336) .+|+++ +.+|+|+++|||+||||++|+++|||++++|+ +..|+++ .++++++++++||||++||+|+++++|++||+ T Consensus 221 ~~L~~~~~~~~~t~l~~lk~~~~~l~I~~~pel~~ag~~g~~~~v~y-~~~~~~~~~~v~~~~~~~~~e~~~l~~~~~~~ 299 (319) T protein:vir:10 221 KVLAIRMPETTMSYLDYFKSQNSGIEIDSIAELEDIDGAGTKGVLVY-EKNPMNMSIEIPEAFNMLPAQPKDLHFKVPCT 299 (319) T ss_pred HhhhcccCCCCeeHHHHHHHhcCCceEEEeeeecccCCCcceEEEEE-ecCCceEEEecCcceeeeeeeecCceEEEeee Confidence 999854 67899999999999999999999999998764 4455554 56799999999999999999999999999999 Q ss_pred cceeeeeeecccceeeeccC Q lcl|Aclame:pro 317 AGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 317 ~~t~Gv~ir~P~av~~~~GI 336 (336) +|||||+||||+||+|++|| T Consensus 300 ~r~~Gv~i~~P~ai~~~dGI 319 (319) T protein:vir:10 300 SKCTGLTIYRPMTIVLITGV 319 (319) T ss_pred eeeEEEEEEccceeEeeecC Confidence 99999999999999999999 No 12 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=100.00 E-value=2.3e-87 Score=495.59 Aligned_cols=291 Identities=12% Similarity=0.056 Sum_probs=272.3 Q ss_pred ccccCcchHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecceeeEEeecc-cCCceeeeeeeeee Q lcl|Aclame:pro 42 LSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDY-SSDGDSGANINYPQ 120 (336) Q Consensus 42 l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~~ygd~-~diP~~~~~~~~~~ 120 (336) |++.+++.+++.+.++|||++||++++++++++|||++++++|++++++|+++|.+|++++|||+ +|+|++++++++++ T Consensus 1 ~~~~~~g~f~~~~l~~id~~v~e~~~~~l~~r~l~~v~~~~~~~~~~~~~~~~~~~G~~~~~~~~~~dip~~~~~~~~~~ 80 (301) T protein:vir:80 1 MQGKITATIEARDLQAIDNVIYEPKQEELTARSVFPQKFDVNEGAESYSFDVMTRSGAAKIIANGADDLPLVDVDMVRKS 80 (301) T ss_pred CCccccchhhHHHHHHHHHHHHHhhhhhhhhhhhcccccCCCCceEEEEEeeeccceeEEEecCcccccccccccceeEE Confidence 88999999999999999999999999999999999999999999999999999999999999997 46799999999999 Q ss_pred eeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCccccccc-----cc Q lcl|Aclame:pro 121 RQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATT-----PW 195 (336) Q Consensus 121 ~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~~t-----~w 195 (336) +++++++.+|+|+++||++|+++|++|+++|+.+|++++++++|+++|+|++++++|||||||++++..++.+ +. T Consensus 81 ~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~f~G~~~~g~~GLlN~p~~~~~~~~~~~~~~~~~ 160 (301) T protein:vir:80 81 VPIYSIGIGLSYTIQDLRAARMQGTTVDAAKATTVRRAIAEKENSIAFRGEKKYAIKGAFEATGIQIDVSPTTGVGNVSK 160 (301) T ss_pred EEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEeeecccccceeeecCCCcccccccCcccccccc Confidence 9999999999999999999999999999999999999999999999999999999999999999987665432 34 Q ss_pred ccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhcccC---CCCCccHHHHHHHhCCccEEEEcccccC Q lcl|Aclame:pro 196 SGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSKT---NQYGLAAAAKLKDIFPKLEFVTIPEYDT 272 (336) Q Consensus 196 ~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~~---~~~~~Tvl~~l~~n~pnl~i~~~pel~~ 272 (336) |+++|+|||++||++++++++.+|+|. +.|++|+|||+++.+|+++ +.+|+|+++||++|||+++|+++|||++ T Consensus 161 w~~~t~~ei~~di~~~~~~l~~~s~g~---~~p~~L~L~p~~~~~L~~~~~~~~~~~tvl~~l~~~~~~~~I~~~p~L~~ 237 (301) T protein:vir:80 161 WEKKTAEQIIDEIGEAHTKITVLPGYG---TASLKLCLPPKQFELINKKRYSNEDSRSVLKVLQDNAWFSAIVRVPDLAG 237 (301) T ss_pred cccCCHHHHHHHHHHHHHHHHHhcCce---ecccEEEecHHHHHhhhhccccCCCCeeHHHHHHHHcCcceEEEcceecc Confidence 677899999999999999999999986 5799999999999999865 5679999999999999999999999999 Q ss_pred CCC-ceEEEEEEeecCCceEEEEcChhhhcccceecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 273 ASG-RLVQLWAPRVEGKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 273 a~G-~~~~~~~~~~~~~~~~~~~~p~~~r~l~~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) +++ ++..++++ .++++++++++||||++||+|+++++|+|||++|||||+||||.||+|++|| T Consensus 238 ~g~~g~~~~v~~-~~~~d~~~~~v~~~~~~~~~e~~~~~~~~~~~~r~~Gv~i~~P~ai~~~~GI 301 (301) T protein:vir:80 238 MGTAGSDSFAVI-HDSNETAELIIPMDITRHPEEYSFPRTKVPFEERTAGVVVRFPAAIVRVDGI 301 (301) T ss_pred CCCCcccEEEEE-ecCCcEEEEEecCceeeecceecCceeEeeeeeeeEEEEEEccceEEEEecC Confidence 864 44444444 5689999999999999999999999999999999999999999999999999 No 13 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=100.00 E-value=4.2e-85 Score=483.18 Aligned_cols=290 Identities=13% Similarity=0.113 Sum_probs=264.2 Q ss_pred HHHhhhhhcccccccCcchHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecceeeEEeeccc-CC Q lcl|Aclame:pro 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYS-SD 109 (336) Q Consensus 31 ~a~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~~ygd~~-di 109 (336) |-+|.+|.++.+ -+.-.++|||++||+++++++++++||++++++|++++++|+++|.+|++++|||++ |+ T Consensus 1 ~~~~~a~~~~~f--------~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~di 72 (296) T protein:vir:10 1 MGVDKADAAGIW--------TVKQLTASLNKAYETEYDQNSVVNLFPVSNEIPGYAKYFEYPVFDGVGIAQIVADYTDDL 72 (296) T ss_pred CcccchhhhHHH--------HHHHHHHHHHHHHhhhhcccccceecccccCCCCceeEEEeeeeeccCceeEeCCCcccc Confidence 788866654332 233447999999999999999999999999999999999999999999999999985 67 Q ss_pred ceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCccc Q lcl|Aclame:pro 110 GDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPI 189 (336) Q Consensus 110 P~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~ 189 (336) |++|++.+++++++++++.+|+|+++||++|++.|++|+++|+.+|++++++++|+++|+|++++|++||||||+++.. T Consensus 73 p~v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~ka~aA~~~~~~~~n~~~f~G~~~~g~~GLlN~p~v~~~- 151 (296) T protein:vir:10 73 PLVDALATERQGKVFRFGNAFLISIDEIKVGQATGQSLSTRKQSLAFEAHDKLLDKLVWSGSTAHGIPSVFDYPNINNV- 151 (296) T ss_pred ceeeccceeEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecccccceeEeecCCCccc- Confidence 9999999999999999999999999999999999999999999999999999999999999999999999999999753 Q ss_pred ccccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhcccC-CCCCccHHHHHHHhCCccEEEEcc Q lcl|Aclame:pro 190 TATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSKT-NQYGLAAAAKLKDIFPKLEFVTIP 268 (336) Q Consensus 190 ~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~~-~~~~~Tvl~~l~~n~pnl~i~~~p 268 (336) +++++|. ++.+|++||++++++++.+|+|. +.|++|+|||+++.+|+++ +.+|+|+++||++||||++|+++| T Consensus 152 ~~~~~W~---~~t~i~~Di~~~~~~l~~~s~g~---~~p~~l~L~p~~~~~L~~~~~~~~~t~l~~ik~~~~~l~i~~~~ 225 (296) T protein:vir:10 152 VSGGSWS---QPTTAVSDITSLLDIIETSTNGQ---HRATHLLLPTTARRIMQNLVPGTSVSYGEFFRQNNSGVTVEFVQ 225 (296) T ss_pred cccCCcc---CHHHHHHHHHHHHHHHHHhhCce---ecceeEEeCHHHHHHHhhccCCCCccHHHHHHHhcCCceEEEee Confidence 3345562 35699999999999999999986 6799999999999999865 678999999999999999999999 Q ss_pred cccCCCC-ceEEEEEEeecCCceEEEEcChhhhcccceecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 269 EYDTASG-RLVQLWAPRVEGKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 269 el~~a~G-~~~~~~~~~~~~~~~~~~~~p~~~r~l~~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) ||++++| ++..|++++ ++++++++++||||++||+|+++++|++||++|||||+||||.||++++|| T Consensus 226 ~l~~a~~~g~~~~v~~~-~~~~~~~~~v~~~~~~~~~e~~~l~~~~~~~~~~~Gv~i~~P~ai~~~dGI 293 (296) T protein:vir:10 226 YLNDYNGTGTSAAIAYE-KDPNNMAIEIPEATNALPAQPKDLHFKIPVTSKATGLIVYRPLTMAVMKGI 293 (296) T ss_pred eeccCCCCcceEEEEEE-cCCceEEEEcCcceeeecccccCceEEEeeEeeEEEEEEECCceeEEEeee Confidence 9999876 455666654 689999999999999999999999999999999999999999999999999 No 14 >protein:vir:5255 Length: 304 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852760;genbank:gi:31544035;uniprot:Q7Y5U0;genbank:GeneID:2753552 Probab=100.00 E-value=1.7e-82 Score=468.86 Aligned_cols=284 Identities=12% Similarity=0.061 Sum_probs=255.0 Q ss_pred HhhhhhcccccccCcchHHHHHH---HhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecceeeE--Eeecc- Q lcl|Aclame:pro 33 MDAADLSPHLSSTGSSGIPNYLT---TYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVA--TYGDY- 106 (336) Q Consensus 33 ~da~d~~~~l~t~~~~~i~~~l~---~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~--~ygd~- 106 (336) |.+ -+||. ++||++|||.+|+++++.+|||++++++|++++++|+++|.+|+++ +++++ T Consensus 1 ~~~---------------lafl~~qL~~id~~vye~~~~~~~~~~lipv~t~~~~~~~~~~~~~~d~~G~a~~~~i~~~a 65 (304) T protein:vir:52 1 MSL---------------LAYVKNGLTAVSKDIAETKYPEIVFPQFVYVDQQTAVGITEKLHYGADEHGSLDDGLITVGT 65 (304) T ss_pred Cch---------------HHHHHHHHHHHhhhhhccccccchhhhhccccCCCCcccceEEEeeeeccCcccccccCCcC Confidence 211 13444 6899999999999999999999999999999999999999999999 55765 Q ss_pred cCCceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccc-cceEEEEecCCC Q lcl|Aclame:pro 107 SSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAG-LENYGLINDPSL 185 (336) Q Consensus 107 ~diP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~-~g~~GllN~Pnl 185 (336) +|+|++|++++++++++++++.+|+|+++||++|+++|++|+++|+++||+++++++|+++|+|+++ +|++|||||||+ T Consensus 66 ~dip~vd~~~~~~~~~i~~~~~~~~y~~~El~~a~~~g~~l~~~ka~aa~~a~~~~~n~v~~~Gd~~~~g~~GllN~p~v 145 (304) T protein:vir:52 66 STLDQVEVGFTPTRSYIVPWAKSVTWTKPELEQGKLLGLALNTAKIMALNKNAQQTLQKVAFLGHAKDSRLTGLLNNKSV 145 (304) T ss_pred CccceeecccceeEEEEEEEeeeeeecHHHHHHHHHhCCCcHHHHHHHHHHHHHhhhceEEEEeeccccceEEEEeCCCc Confidence 7889999999999999999999999999999999999999999999999999999999999999985 789999999999 Q ss_pred Ccccccc---cccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhcccC--CCCCccHHHHHHHhCC Q lcl|Aclame:pro 186 SAPITAT---TPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSKT--NQYGLAAAAKLKDIFP 260 (336) Q Consensus 186 ~~~~~~~---t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~~--~~~~~Tvl~~l~~n~p 260 (336) +...+++ ++.|.++|+|||++||++++++++.+|+|. |.|+||+|||+++.+|+.+ +++|+|+++||++||| T Consensus 146 ~~~~~~~~~a~~~w~~~T~~eI~~di~~~~~~i~~~s~~~---~~p~tl~Lpp~~~~~l~~~~~~~~~~Tvl~~l~~n~~ 222 (304) T protein:vir:52 146 EVYAIKGAAQNTKVQAMDFDKAVAFFKEIFLKGMEKTKRI---EAPNTFAIDSLDLAHLALVQRANTDTTALEFLTKHLS 222 (304) T ss_pred ceeeecCCccCCccccCCHHHHHHHHHHHHHHHHhccCce---ecCceEEeCHHHHHHHhhccCCCCCchHHHHHHHhcc Confidence 8654432 233678899999999999999999999975 7899999999999999753 5689999999999988 Q ss_pred -----ccEEEEccc-ccCC-CCceEEEEEEeecCCceEEEEcChhhhcccceecCC-ceEEccccceeeeeeecccceee Q lcl|Aclame:pro 261 -----KLEFVTIPE-YDTA-SGRLVQLWAPRVEGKDTATCGFTEKMRAHSIERYSS-YFRQKKSAGTWGAVIFRPFAVAQ 332 (336) Q Consensus 261 -----nl~i~~~pe-l~~a-~G~~~~~~~~~~~~~~~~~~~~p~~~r~l~~~~~~~-~~~vp~~~~t~Gv~ir~P~av~~ 332 (336) +|+|+.+|+ +.++ +|++.+|++++ +++++.++++||||++||+|++++ .|++||++|||||+||||++++| T Consensus 223 ~~~g~~l~I~~v~~~~~~~g~~g~~r~vvY~-~d~~~~~~~vP~p~~~l~~q~~~~~~~~vp~~~r~gGv~v~~P~a~~y 301 (304) T protein:vir:52 223 AAAGRQVAIKALPSNYGTRVTDGKTRAMVYV-NSKEHVIFDVPMSPTVLDAQPKGLLAFESGLRMAFGGVTFMEPDSALY 301 (304) T ss_pred cccCCcceEEEecccccccCCCCceEEEEEe-cChhheEEecCccccccchhhcCCceEEecceeeeeeEEEEccceeee Confidence 678999984 5544 46778888875 568999999999999999999987 79999999999999999999999 Q ss_pred ecc Q lcl|Aclame:pro 333 MIG 335 (336) Q Consensus 333 ~~G 335 (336) .|= T Consensus 302 ~D~ 304 (304) T protein:vir:52 302 VDY 304 (304) T ss_pred ecC Confidence 999 No 15 >protein:vir:105778 Length: 358 # NCBI annotation: gp9 # Family: family:all:10995 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224147;genbank:gi:62362222;genbank:GeneID:3342531 Probab=98.80 E-value=2.3e-10 Score=73.43 Aligned_cols=315 Identities=11% Similarity=0.017 Sum_probs=181.0 Q ss_pred CchHHHHHHhhhcceeccchhhhccchhHHHHHhhhhhccccccc-CcchHHHHHHHhhCceeeeeeccc---cchhhhc Q lcl|Aclame:pro 1 MRDAQRIQNLARAGVILPRSVQNVSTPLTEYAMDAADLSPHLSST-GSSGIPNYLTTYVDPAVIDILVAP---MKAAELV 76 (336) Q Consensus 1 ~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~a~da~d~~~~l~t~-~~~~i~~~l~~~idp~v~~~~~~~---~~~~~l~ 76 (336) |+.....++|.---..|.. ...+.||...+...|-+-+. +-+++|+-+-.-+|.++.++.-+. --..+|. T Consensus 12 ~~~~~qw~~L~~~Rna~n~------~~~a~maan~a~~~~~~~~~NAv~~v~~D~wr~~D~~~~q~fr~e~~~~l~NDLm 85 (358) T protein:vir:10 12 SRLGGHWNELWANRNMWNA------QHDAMIAANRSNMTPEWLAVNAVGGFTRDFWAEIDRQVLQLRDQEVGMEIVNDLI 85 (358) T ss_pred HHHHHHHHHHHHHHHHhhh------hhhhHHhhhHHHhhhhhheecccccCCHHHHHHHhhhhhhhcccchhHHHHhhhh Confidence 5555555554310001100 11123444444444443322 235667766667888887776664 3356778 Q ss_pred ccccCCCcceeeEEEeeeec-ceeeEEe--ecc-cCCceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHH Q lcl|Aclame:pro 77 GESKKGDWTTLVAAFITAEP-TTKVATY--GDY-SSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELN 152 (336) Q Consensus 77 ~v~t~g~w~~~t~~~~~~e~-~G~a~~y--gd~-~diP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~ 152 (336) |+++.-.=......|++.-. .|++... |.. ...--+..+..=...+| +-.||+.+.+|+.-.+-.|+++..+-+ T Consensus 86 ~ls~sv~Igktv~~y~~~gd~~~~v~~SmsGQ~~~~lD~~~y~~dGtpiPI--fdsg~~f~WR~~~~~~~~g~d~~~daQ 163 (358) T protein:vir:10 86 GVQTVLPVGKTAKLYNVIGDIADDVSVSIDGQAPFSFDHTEYASDGDPIPV--FTAGYGVNWRHAAGLNSLGIDLVLDSQ 163 (358) T ss_pred hccccccHHHHHHHHhhhcCCCceEEEEecccCcccccceeeeccCCEeee--eccCccccccchhhcCccccchhHHHH Confidence 88776554444445555443 6655432 432 12222332233333334 445666666888888899999999999 Q ss_pred HHHHHHHHHhhcceEEeecc-----ccceEEEEecCCCCccccccc-c----cccccCHHHHHHHH-HHHHHHHHHHhCC Q lcl|Aclame:pro 153 YSSALGLAKFLNGSYLFGVA-----GLENYGLINDPSLSAPITATT-P----WSGSPAVEAVVNEV-VALFQVLQTQSQG 221 (336) Q Consensus 153 ~aAr~a~e~~~n~~~~~Gd~-----~~g~~GllN~Pnl~~~~~~~t-~----w~~~~t~~eI~~Di-~~l~~~l~~~s~g 221 (336) .+..+.+.++.-..+|.|+. ++..|||-||||+-...-.++ + -..+.|+++++.-. .+++.++....+ T Consensus 164 ~~~~~kv~~~~vdy~lNG~~~I~v~g~t~~Glrn~~n~~qv~l~~~s~g~NiDlttat~~a~~~~f~~~l~~~~~~~N~- 242 (358) T protein:vir:10 164 MAKMRKFNQKRVNYYLNGDPNIQVQSYPAQGIKNHRNTKKINLGSGSGGANIDLTTADMTALFAFFGKGAFGTLARANK- 242 (358) T ss_pred HHHHHHHHHHHHhhhhccCCceeecCcccccccCCcceeEEEeccCCCcceeeeccCCHHHHHHHHHHHHHHHHHhhcc- Confidence 99999999999999999986 677899999999853222211 1 23466788888877 667777765443 Q ss_pred ceecccccEEEecHHHHHhcccC-CCCC---ccHHHHHHHhCCcc-EEEEcccccCCCCceEEEEEEeecCCceEEEEcC Q lcl|Aclame:pro 222 IITQEDVLRMGLPPTAMSDLSKT-NQYG---LAAAAKLKDIFPKL-EFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFT 296 (336) Q Consensus 222 ~v~~~~p~tL~Lp~~~~~~L~~~-~~~~---~Tvl~~l~~n~pnl-~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p 296 (336) ...-.++..+|+.+..|.++ ...| -||++++++- +++ .|++.+.|+ |+-...++. ..++..-.+- T Consensus 243 ---~~~~~~~~vs~ei~~n~~r~Y~~~~~~~gTIl~~vl~~-~~va~I~~~~~Ls---gNeii~~~~---~~~vi~plvG 312 (358) T protein:vir:10 243 ---VAQYDVMWVSPEIWANLAQPYVVNGVVSGNVLNAVLPF-APVREIRQTFALS---GNEFIAYVR---RQDIISPLVG 312 (358) T ss_pred ---cceeeEEEEcHHHHhhhhcccccccccchhhHHHhhcc-cCcccccccccCC---CccEEEEEe---CCceeeeeec Confidence 12457899999999999874 2223 4999999754 444 577777776 655444432 2344444444 Q ss_pred hhhhcccceecCCceEEcccccee---eeeeeccc----ceeeeccC Q lcl|Aclame:pro 297 EKMRAHSIERYSSYFRQKKSAGTW---GAVIFRPF----AVAQMIGV 336 (336) Q Consensus 297 ~~~r~l~~~~~~~~~~vp~~~~t~---Gv~ir~P~----av~~~~GI 336 (336) |++-..|.-..+ +.-.+..++| |++||.=. .|.+..-+ T Consensus 313 ~~~gt~~~pR~~--p~ddY~f~vwsA~glqik~D~~Gks~Vv~~~~~ 357 (358) T protein:vir:10 313 MAVGVVPLPRPL--PNVNYNFQIMSAEGLQITADDQGLSGVVYGANL 357 (358) T ss_pred ceeeeecCCCCC--CCcchhhhhhhhhceeeeeccccceeeEeeccc Confidence 444333321111 1222333333 34444432 23333333 No 16 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=98.68 E-value=1.3e-09 Score=69.37 Aligned_cols=288 Identities=8% Similarity=-0.028 Sum_probs=164.5 Q ss_pred HHHhhhhhcccccccCcch-H-HHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecceeeEEeecccC Q lcl|Aclame:pro 31 YAMDAADLSPHLSSTGSSG-I-PNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSS 108 (336) Q Consensus 31 ~a~da~d~~~~l~t~~~~~-i-~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~~ygd~~d 108 (336) |+.+...+.-..+|.+.++ + |.+.. ++++.+.+.....+++++.+... ....|++.+..+.+..++.... T Consensus 1 m~~~~~~a~~~~~t~~~g~~i~~~~~~-----~ii~~~~~~s~l~~~~~~~~~~~---~~~~~p~~~~~~~a~~v~Eg~~ 72 (330) T protein:vir:77 1 MAGSTVPSTQVALTGDFSAFLTPEQSQ-----DYFAEIEKTSIVQRIARKVPMGP---TGISIPHWTGAVSASWTGEAER 72 (330) T ss_pred CcccccchhhccccCCCcceechhHHH-----HHHHHHHhccchhhhcceeeccC---CceEEEEEcCCcceeEecCCCc Confidence 4444433332233333333 3 33333 34444455555666666544322 3367888888888999999999 Q ss_pred CceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccc-cceEEEEecCCCCc Q lcl|Aclame:pro 109 DGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAG-LENYGLINDPSLSA 187 (336) Q Consensus 109 iP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~-~g~~GllN~Pnl~~ 187 (336) +|..+........+.+.++..+.++.+=++. ...++.+.-.....+++.+.+++-.++|+.. ....|++|++.... T Consensus 73 ~~~~~~~f~~i~~~~~k~~~~~~is~ell~d---s~~~~~~~i~~~l~~ai~~~~~~~~l~G~g~~~~~~g~~~~~~~~~ 149 (330) T protein:vir:77 73 KPITKGSFGKQELEPVKITTIFAESAEVVRL---NPLNYLNTMRTKIAEAIALKFDAAAIHGIDKPSAFKGYLAETTKVV 149 (330) T ss_pred cccccceeeEEEEeEEEEEEeehhhHHHHhc---chHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccccccccccccc Confidence 9999988888888899999988888765543 3567899999999999999999999999974 55679999764332 Q ss_pred ccccccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhccc-CCCCCccHHHH-HHHh----CCc Q lcl|Aclame:pro 188 PITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAK-LKDI----FPK 261 (336) Q Consensus 188 ~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~~-l~~n----~pn 261 (336) .......-....+...+++||.+++..+...- ..+..++|.++.+..|.+ .+..|.-++.- +... ..+ T Consensus 150 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~------~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~ 223 (330) T protein:vir:77 150 SLADTNLTTASGPQGNAYLAVNNALSLLVNSG------KKWTGTLLDNVTEPILNTAVDGNGRPLFVESTYTEQVGAIRE 223 (330) T ss_pred eeecccccccccccchhHHHHHHHHHhhhhcC------CCccEEEEcHHHHHHHHHHhccCCceeecCccccccccccCC Confidence 22211111112334567899999888875542 135679999999888854 23333322210 0000 111 Q ss_pred cE-----EEEccccc-CCCCceEEEEEEeecC-----CceEEEEcChhhh-c-----------ccce-ecCCceEEcccc Q lcl|Aclame:pro 262 LE-----FVTIPEYD-TASGRLVQLWAPRVEG-----KDTATCGFTEKMR-A-----------HSIE-RYSSYFRQKKSA 317 (336) Q Consensus 262 l~-----i~~~pel~-~a~G~~~~~~~~~~~~-----~~~~~~~~p~~~r-~-----------l~~~-~~~~~~~vp~~~ 317 (336) .+ ++...... +.++++..+++-.... ..-..+.+..... . .++- ...-...+.|+. T Consensus 224 ~~l~G~PV~~~~~~p~~~~~~~~~~~~gd~s~~~i~~~~~~~i~~~~e~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~ 303 (330) T protein:vir:77 224 GRILGRPTYVADNVVNGTVGNRVVGVMGDFSQVIWGQIGGLSFDVTDQATLDFGEEQGGVWVPKLISLWQHNMVAVRCEA 303 (330) T ss_pred ceecceeeEEeccccCCCCCCccEEEEEecceEEEEEecCcEEEEeecceeeecccccccccccccchhhcCcEEEEEEE Confidence 22 22222222 2233444333322110 0001111111100 0 0000 011236667788 Q ss_pred ceeeeeeecccceeeeccC Q lcl|Aclame:pro 318 GTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 318 ~t~Gv~ir~P~av~~~~GI 336 (336) |.++.+ ++|.||+++.+. T Consensus 304 r~d~~v-~~~~a~~~i~~~ 321 (330) T protein:vir:77 304 EFAFMV-NDKDAFVKLTDQ 321 (330) T ss_pred EeccEE-ecccceEEEEec Confidence 887766 669999999999 No 17 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=98.47 E-value=1.6e-08 Score=63.28 Aligned_cols=317 Identities=10% Similarity=0.041 Sum_probs=155.4 Q ss_pred CchHHHHHHhhh------cce----------eccchhhhccchhH-----------------------HHHHhhhhhccc Q lcl|Aclame:pro 1 MRDAQRIQNLAR------AGV----------ILPRSVQNVSTPLT-----------------------EYAMDAADLSPH 41 (336) Q Consensus 1 ~~~~~~~~~l~~------~g~----------~~~~~~~~~~~~~~-----------------------~~a~da~d~~~~ 41 (336) +++.+....... -+. .-+........... ....+.+.+... T Consensus 55 ~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 134 (435) T protein:vir:80 55 AEAAERMAAAAAVPVDPNPAAVTASAAAPVYAQPKAPEVKGAKMARMVRALAAARGDAQLASKLAIERGFGEEVAMSLNT 134 (435) T ss_pred HHHHHHHHHhhcccccchhhhhccccccccccccchhhhhHHHHHHHHHHHHhccchhHHHHHHHHhhhhhhhhhhhhcc Confidence 222111110000 000 00000000000000 000011110111 Q ss_pred ccccCcc--hHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecceeeEEeecccCCceeeeeeeee Q lcl|Aclame:pro 42 LSSTGSS--GIPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYP 119 (336) Q Consensus 42 l~t~~~~--~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~~ygd~~diP~~~~~~~~~ 119 (336) .+...+ .+|..+.+ +|++.+.+......+-. +.-......+.|++.+..+.+...+.....|..+...... T Consensus 135 -~~~~~gg~lvP~~~~~----~ii~~l~~~~~i~~~~~--~~v~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~f~~i 207 (435) T protein:vir:80 135 -LSPGAGGVLVPENLSS----EVIELLRPKSVVRKLGA--RTLPLSNGNITIPRLKGGAIVGYIGADTDIPTTQQQFDDL 207 (435) T ss_pred -cCCCCCccccchhHHH----HHHHHHhhhchhhhccc--eeeecCCCceEEEEEeCCcceeeeccCccccccccceeeE Confidence 122222 25655543 33443332222222211 1111112345777888788888888888899999888888 Q ss_pred eeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccc-cceEEEEecCCCCcccccccccccc Q lcl|Aclame:pro 120 QRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAG-LENYGLINDPSLSAPITATTPWSGS 198 (336) Q Consensus 120 ~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~-~g~~GllN~Pnl~~~~~~~t~w~~~ 198 (336) .-.++.++..+.+|.+=|.. ...+-++.+.-......++.+.+++-+++|++. ....|++++........ .+ +. T Consensus 208 ~~~~~k~~~~~~is~ell~d-s~~~~~l~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~-~~---~~ 282 (435) T protein:vir:80 208 KLTAKKMAALVPIANDLIKY-AGVNPNVDQIVVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPGNVIT-AS---DG 282 (435) T ss_pred EEeeEEEEEeehhhHHHHHh-hcccHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCcccceeecccccceee-cc---cc Confidence 88888998888887553333 333456777788888888888889888999863 56789999764432222 21 12 Q ss_pred cCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhccc-CCCCCccHHHHHHHh-CCccEEE---EcccccCC Q lcl|Aclame:pro 199 PAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAKLKDI-FPKLEFV---TIPEYDTA 273 (336) Q Consensus 199 ~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~~l~~n-~pnl~i~---~~pel~~a 273 (336) .+.+.+..|+.+++..+.....+ -.+..++|.+..+..|.+ .+..|.-++.-+..+ +-++.++ .+|...+. T Consensus 283 ~~~~~~~~d~~~~~~~~~~~~~~----~~~~~~vmn~~~~~~L~~lkd~~G~~l~~~~~~~~l~G~pv~~~~~~p~~~~~ 358 (435) T protein:vir:80 283 STLQKIETDLGKAILALENADAN----LTQPGWIMAPRTFRFLEGLRDGNGNKVYPELANGMLKGYPVGKTTQVPINLGE 358 (435) T ss_pred cchhhHHHHHHHHHHHhhccccc----cccCEEEEcHHHHHHHHhhhccCCceeccCCCCCeEeeeeeEEeccccccccC Confidence 45677888999988888654322 135678999999998864 333344333111111 1112222 33333333 Q ss_pred CCceEEEEEEeecCCce------EEEEc-Chh-hh-----cccceecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 274 SGRLVQLWAPRVEGKDT------ATCGF-TEK-MR-----AHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 274 ~G~~~~~~~~~~~~~~~------~~~~~-p~~-~r-----~l~~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) ++....+++-... +-. ..+.+ ++. +. .+....++ ...+-+..| .++.+++|.||+++.|+ T Consensus 359 ~~~~~~i~~gd~s-~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~f~~n-~~~~r~~~r-~d~~~~~~~a~~~l~~~ 431 (435) T protein:vir:80 359 AGKESEIYFTDFG-DVFIGEEETLEIDYSKEATYKDADGHMVSAFQRD-QTLIRVIAK-NDFGPRHVESIAVLSGV 431 (435) T ss_pred CCCcceEEEEEcc-cEEEEeecceEEEEeccccccccccchhhhhhcC-cceeeeeee-eCcEeecccceEEEecc Confidence 4333333322211 100 00100 000 00 00001111 123344444 45678899999999999 No 18 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=98.45 E-value=2.4e-08 Score=62.39 Aligned_cols=282 Identities=10% Similarity=0.015 Sum_probs=157.4 Q ss_pred HHHhhhhhcccccccCcch--HHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecceeeEEeecccC Q lcl|Aclame:pro 31 YAMDAADLSPHLSSTGSSG--IPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSS 108 (336) Q Consensus 31 ~a~da~d~~~~l~t~~~~~--i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~~ygd~~d 108 (336) ||.....+.- ..++.++| ||..+. .++++.+........++.+...+. ....+++.+..+.+..++.... T Consensus 1 ma~~~~~~~~-~~~t~~gg~lip~~~~----~~ii~~~~~~~~l~~~~~~~~~~~---~~~~ip~~~~~~~a~~v~E~~~ 72 (304) T protein:vir:10 1 MATPTYTPGN-VILSDFKNGVIPAEQG----TLIMKDIMANSAIMKLAKNEPMTA---QKKKFTYLAKGVGAYWVSETER 72 (304) T ss_pred Cccccccccc-ccccCCCceecchhHH----HHHHHHHHhccchhhhcceeeccC---CceEEEEEeCCcceEEeecCcc Confidence 4443322222 22233333 666554 344555555555566665544332 3356778888888889998889 Q ss_pred CceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCcc Q lcl|Aclame:pro 109 DGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAP 188 (336) Q Consensus 109 iP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~ 188 (336) +|..+........+.+.++..+.+|.+=++.+ ..++.+.-.....+++.+.+++-+++|++..+-.|.+.+..+... T Consensus 73 ~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds---~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~ 149 (304) T protein:vir:10 73 IQTSKPEYAQAEMEAKKIGVIIPLSKEFLKWT---AKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGA 149 (304) T ss_pred cccccceeeEEEEEEEEEEEeehhhHHHHhcc---hHHHHHHHHHHHHHHHHHHHHhhheeccCCCcccccccccccccc Confidence 99999888888889999999988887544433 477888888999999999999999999987766666555444322 Q ss_pred cccccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhccc-CCCCCccHHHHHHHhCCccEEEEc Q lcl|Aclame:pro 189 ITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAKLKDIFPKLEFVTI 267 (336) Q Consensus 189 ~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~~l~~n~pnl~i~~~ 267 (336) .+.... .++....++||.+++..+...- ..+..++|.++.+..|.+ .+..|.-+++-=-.++-++.++.. T Consensus 150 ~~~~~~---~~~~~~~~~~i~~~~~~l~~~~------~~~~~~v~~~~~~~~L~~lkd~~G~~l~~~~~~~l~G~PV~~~ 220 (304) T protein:vir:10 150 EEKGNV---VTDTNNLYVDLSALMATIEDEE------LDPNGVLTTRSFRSKMRNALDANDRPLFDANGNEIMGLPLSYT 220 (304) T ss_pred cccccc---cccccchHHHHHHHHHHhhhcc------CCcCEEEEcHHHHHHHHHhhccCCcEeecCCCccccceeeEEe Confidence 222211 1122346889999988875421 235679999999998864 333333221100000112233332 Q ss_pred ccccCCCCceEEEE-------EEeecCCceEEEEcChhh--------hcccce---ecCCceEEccccceeeeeeecccc Q lcl|Aclame:pro 268 PEYDTASGRLVQLW-------APRVEGKDTATCGFTEKM--------RAHSIE---RYSSYFRQKKSAGTWGAVIFRPFA 329 (336) Q Consensus 268 pel~~a~G~~~~~~-------~~~~~~~~~~~~~~p~~~--------r~l~~~---~~~~~~~vp~~~~t~Gv~ir~P~a 329 (336) +.+...++....++ +-.+.+ .++.+-..- ...+.. ...-....-++.|+++.+ ++|.| T Consensus 221 ~~~~~~~~~~~~~~gd~~~~~~~~~~~---~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v-~~~~a 296 (304) T protein:vir:10 221 GADVYDKKKSLALMGDWDYARYGILQG---IEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMN-VKPEA 296 (304) T ss_pred cccccCCCCcEEEEEehhhEEEEEecc---eEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEe-ecccc Confidence 33322222212121 111111 011110000 000000 011124445666666655 45999 Q ss_pred eeeeccC Q lcl|Aclame:pro 330 VAQMIGV 336 (336) Q Consensus 330 v~~~~GI 336 (336) |+.+..- T Consensus 297 ~~~l~~a 303 (304) T protein:vir:10 297 FATLKPT 303 (304) T ss_pred eEEEEec Confidence 9999888 No 19 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=98.45 E-value=2.4e-08 Score=62.39 Aligned_cols=282 Identities=10% Similarity=0.015 Sum_probs=157.4 Q ss_pred HHHhhhhhcccccccCcch--HHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecceeeEEeecccC Q lcl|Aclame:pro 31 YAMDAADLSPHLSSTGSSG--IPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSS 108 (336) Q Consensus 31 ~a~da~d~~~~l~t~~~~~--i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~~ygd~~d 108 (336) ||.....+.- ..++.++| ||..+. .++++.+........++.+...+. ....+++.+..+.+..++.... T Consensus 1 ma~~~~~~~~-~~~t~~gg~lip~~~~----~~ii~~~~~~~~l~~~~~~~~~~~---~~~~ip~~~~~~~a~~v~E~~~ 72 (304) T protein:vir:94 1 MATPTYTPGN-VILSDFKNGVIPAEQG----TLIMKDIMANSAIMKLAKNEPMTA---QKKKFTYLAKGVGAYWVSETER 72 (304) T ss_pred Cccccccccc-ccccCCCceecchhHH----HHHHHHHHhccchhhhcceeeccC---CceEEEEEeCCcceEEeecCcc Confidence 4443322222 22233333 666554 344555555555566665544332 3356778888888889998889 Q ss_pred CceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCcc Q lcl|Aclame:pro 109 DGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAP 188 (336) Q Consensus 109 iP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~ 188 (336) +|..+........+.+.++..+.+|.+=++.+ ..++.+.-.....+++.+.+++-+++|++..+-.|.+.+..+... T Consensus 73 ~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds---~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~ 149 (304) T protein:vir:94 73 IQTSKPEYAQAEMEAKKIGVIIPLSKEFLKWT---AKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGA 149 (304) T ss_pred cccccceeeEEEEEEEEEEEeehhhHHHHhcc---hHHHHHHHHHHHHHHHHHHHHhhheeccCCCcccccccccccccc Confidence 99999888888889999999988887544433 477888888999999999999999999987766666555444322 Q ss_pred cccccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhccc-CCCCCccHHHHHHHhCCccEEEEc Q lcl|Aclame:pro 189 ITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAKLKDIFPKLEFVTI 267 (336) Q Consensus 189 ~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~~l~~n~pnl~i~~~ 267 (336) .+.... .++....++||.+++..+...- ..+..++|.++.+..|.+ .+..|.-+++-=-.++-++.++.. T Consensus 150 ~~~~~~---~~~~~~~~~~i~~~~~~l~~~~------~~~~~~v~~~~~~~~L~~lkd~~G~~l~~~~~~~l~G~PV~~~ 220 (304) T protein:vir:94 150 EEKGNV---VTDTNNLYVDLSALMATIEDEE------LDPNGVLTTRSFRSKMRNALDANDRPLFDANGNEIMGLPLSYT 220 (304) T ss_pred cccccc---cccccchHHHHHHHHHHhhhcc------CCcCEEEEcHHHHHHHHHhhccCCcEeecCCCccccceeeEEe Confidence 222211 1122346889999988875421 235679999999998864 333333221100000112233332 Q ss_pred ccccCCCCceEEEE-------EEeecCCceEEEEcChhh--------hcccce---ecCCceEEccccceeeeeeecccc Q lcl|Aclame:pro 268 PEYDTASGRLVQLW-------APRVEGKDTATCGFTEKM--------RAHSIE---RYSSYFRQKKSAGTWGAVIFRPFA 329 (336) Q Consensus 268 pel~~a~G~~~~~~-------~~~~~~~~~~~~~~p~~~--------r~l~~~---~~~~~~~vp~~~~t~Gv~ir~P~a 329 (336) +.+...++....++ +-.+.+ .++.+-..- ...+.. ...-....-++.|+++.+ ++|.| T Consensus 221 ~~~~~~~~~~~~~~gd~~~~~~~~~~~---~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v-~~~~a 296 (304) T protein:vir:94 221 GADVYDKKKSLALMGDWDYARYGILQG---IEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMN-VKPEA 296 (304) T ss_pred cccccCCCCcEEEEEehhhEEEEEecc---eEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEe-ecccc Confidence 33322222212121 111111 011110000 000000 011124445666666655 45999 Q ss_pred eeeeccC Q lcl|Aclame:pro 330 VAQMIGV 336 (336) Q Consensus 330 v~~~~GI 336 (336) |+.+..- T Consensus 297 ~~~l~~a 303 (304) T protein:vir:94 297 FATLKPT 303 (304) T ss_pred eEEEEec Confidence 9999888 No 20 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=98.41 E-value=3.3e-08 Score=61.62 Aligned_cols=316 Identities=10% Similarity=0.044 Sum_probs=153.8 Q ss_pred CchHHHHHHhh---------------hcceeccch----hhhccc--------------------hhHHHHHhhhhhccc Q lcl|Aclame:pro 1 MRDAQRIQNLA---------------RAGVILPRS----VQNVST--------------------PLTEYAMDAADLSPH 41 (336) Q Consensus 1 ~~~~~~~~~l~---------------~~g~~~~~~----~~~~~~--------------------~~~~~a~da~d~~~~ 41 (336) +...++..++. ..+..-.+. ...... ............... T Consensus 52 I~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 131 (435) T protein:vir:14 52 IERAEAAERMAAAAAVPVDPNPTAVAAPAAAPVHAQPKALEVKGAKMARMVRALAAARGDAQLASKLAIERGFGEEVAMS 131 (435) T ss_pred HHHHHHHHHHHHhhcccccchhhhhhhccccccccccchhhhhHHHHHHHHHHHHhhcchhhHHHHHHHhhhhhhhhhhh Confidence 11111110000 000000000 000000 000000000001111 Q ss_pred cc--ccCcc--hHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecceeeEEeecccCCceeeeeee Q lcl|Aclame:pro 42 LS--STGSS--GIPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANIN 117 (336) Q Consensus 42 l~--t~~~~--~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~~ygd~~diP~~~~~~~ 117 (336) ++ +...+ .+|..+.+ +|++.+.+......+.. +.-......+.|++.+..+.+...+....+|..+.... T Consensus 132 ~~~~t~~~gg~~vP~~~~~----~ii~~l~~~~~i~~~~~--~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~f~ 205 (435) T protein:vir:14 132 LNTLSPGAGGVLVPENLSS----EVIELLRPKSVVRKLGA--RTLPLSNGNITIPRLKGGAIVGYIGADTDIPTTQQQFD 205 (435) T ss_pred cccCCcCCCccccchhHHH----HHHHHHhhhchhhhhcc--eeeecCCCceEEEEEeCCcceeeeccCcccccccccee Confidence 11 12222 25654432 34444433333333311 11111123467888888888888888888998888777 Q ss_pred eeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccc-cceEEEEecCCCCcccccccccc Q lcl|Aclame:pro 118 YPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAG-LENYGLINDPSLSAPITATTPWS 196 (336) Q Consensus 118 ~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~-~g~~GllN~Pnl~~~~~~~t~w~ 196 (336) ..+-.++.++..+.+|.+=|.-+ ..+.+|.+.-......++.+.+++-.++|++. ....|+++........+ .+. T Consensus 206 ~i~~~~~k~~~~~~iS~ell~ds-~~~~~l~~~i~~~l~~ai~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~-~~~-- 281 (435) T protein:vir:14 206 DLKLTAKKMAALVPIANDLIKYA-GVNPNVDQIVVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPSNVIT-ASD-- 281 (435) T ss_pred EEEeeeEEEEEeehhhHHHHHhh-ccCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeecccccceec-ccc-- Confidence 77888888888888875433332 22345778888888888888999988999874 45789998654432222 221 Q ss_pred cccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhccc-CCCCCccHHHHHHHh-CCccEEEE---ccccc Q lcl|Aclame:pro 197 GSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAKLKDI-FPKLEFVT---IPEYD 271 (336) Q Consensus 197 ~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~~l~~n-~pnl~i~~---~pel~ 271 (336) .++.+.+.+|+.+++..+.....+. .+..++|.+..+..|.. .+..|.-++.-+... .-++.++. +|.-. T Consensus 282 -~~~~~~~~~~~~~l~~~~~~~~~~~----~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~g~l~G~Pv~~~~~~p~~~ 356 (435) T protein:vir:14 282 -ASTLQKIETDLGKVILALENADANL----TQPGWIMAPRTFRFLEGLRDGNGNKVYPELANGMLKGYPVGKTTQVPINL 356 (435) T ss_pred -ccchhhHHHHHHHHHHHhhhccccc----cCCEEEEcHHHHHHHHHhhccCCceeccCCCCCeeecceeEeeccccccc Confidence 2456778899999988887653321 34568999999988864 333343332111000 11122222 23222 Q ss_pred CCCCceEEEEEEeecCCceEEEEcChhhhc--cc-------------ceecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 272 TASGRLVQLWAPRVEGKDTATCGFTEKMRA--HS-------------IERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 272 ~a~G~~~~~~~~~~~~~~~~~~~~p~~~r~--l~-------------~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) +.++....+++-... + . .+..-+.++. .+ -..++ ...+-+..|+++ .+++|.||+.+.|+ T Consensus 357 ~~~~~~~~i~~gd~s-~-~-~i~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~-~~~~r~~~r~d~-~~~~~~a~~~l~~~ 431 (435) T protein:vir:14 357 GETGKESEIYFTDFG-D-V-FIGEEETLEIDYSKEATYKDADGHMVSAFQRD-QTLIRVIAKNDF-GPRHVESIAVLAGV 431 (435) T ss_pred cCCCccceEEEeecc-c-E-EEEEecccEEEEeccccccccccchhhhhhcC-hhheeeeeeeCc-eeecccceEEEecC Confidence 233332222222111 1 0 0110011100 00 00111 233445666665 88999999999999 No 21 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=98.33 E-value=4.2e-08 Score=61.02 Aligned_cols=274 Identities=11% Similarity=0.030 Sum_probs=148.7 Q ss_pred ccccCcchHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecceeeEEeecccCCceeeeeeeeeee Q lcl|Aclame:pro 42 LSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYPQR 121 (336) Q Consensus 42 l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~~ygd~~diP~~~~~~~~~~~ 121 (336) |.+.....+|..+. .++++.+.+.-....+.++.+.+. ....+++....+.|..++.+.++|..+.......- T Consensus 1 ma~~gG~lip~~~~----~~ii~~~~~~s~i~~~~~~~~~~~---~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l 73 (298) T protein:vir:94 1 MVLNKGTLFDPELV----TDLISKVAGKSSIARLSAQKPIPF---NGEKVFTFTMDSEIDVVAESGKKTHGGVTLAPQTM 73 (298) T ss_pred CeeccccccChhHH----HHHHHHHHhhchhhhhcceeeccC---CceEEEEEecCcceEEeeCCccccccccceeEEEE Confidence 33333334555444 244555555555666666544333 23577888888889999999999999988888888 Q ss_pred eEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeecccc-----ceEEEEecCCCCcccccccccc Q lcl|Aclame:pro 122 QSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGL-----ENYGLINDPSLSAPITATTPWS 196 (336) Q Consensus 122 ~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~-----g~~GllN~Pnl~~~~~~~t~w~ 196 (336) ..+.++....+|.+=++...-...+|.+.-+...++++.+.++.-.++|.... ...|..+..+...... . T Consensus 74 ~~~k~~~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~~~~-----~ 148 (298) T protein:vir:94 74 VPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVTQKV-----E 148 (298) T ss_pred eeeEEEEeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCCccccccccccccccccccc-----c Confidence 88889888888866454444556678888888999999999999999885321 1112111111110000 1 Q ss_pred cccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhccc-CCCCCccHHHHHHHh-----CCccEEEEcccc Q lcl|Aclame:pro 197 GSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAKLKDI-----FPKLEFVTIPEY 270 (336) Q Consensus 197 ~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~~l~~n-----~pnl~i~~~pel 270 (336) ..+....+++||.+++..+...- ..+..++|.++.+..|.+ .+..|.-++.=...+ .-++.++....+ T Consensus 149 ~~~~~~~~~~~i~~~~~~~~~~~------~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~tl~G~PV~~~~~v 222 (298) T protein:vir:94 149 APRGIADPNGAIENAVELLTGVD------ADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTINGLPVDVNKTV 222 (298) T ss_pred cccccccHHHHHHHHHHhhhhcC------CCccEEEEcHHHHHHHHHhhccCCCeeecCcccCCCCceecceeeEEeccc Confidence 11223457889999988875531 245679999999988854 233343222111000 111223222222 Q ss_pred cCC-CCceEEEEEEeecCCceEEEEc--Chhhhcccc-ee--------cCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 271 DTA-SGRLVQLWAPRVEGKDTATCGF--TEKMRAHSI-ER--------YSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 271 ~~a-~G~~~~~~~~~~~~~~~~~~~~--p~~~r~l~~-~~--------~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) .+. ++.+..+++-.. .+...+.+ ...+...+- .. +.-...+-++.|. |+.+++|.||+++.|. T Consensus 223 ~~~~~~~~~~~~~Gdf--s~~~~~~~~~~~~~~~~~~~~~d~~~~~~f~~~~v~~r~~~r~-~~~~~~~~a~~~l~~~ 297 (298) T protein:vir:94 223 SDMSLTQRDRAIIGDF--ANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFL-GWGILDATKFARVTEA 297 (298) T ss_pred ccccCCCccEEEEeec--cceEEEEEecCceEEEeecCCCcCcchhhhhcCcEEEEEEEEe-ccEeecccceEEEEec Confidence 211 222222222111 00000101 011111110 00 0112334455555 5667779999999999 No 22 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=98.32 E-value=1.6e-07 Score=57.78 Aligned_cols=309 Identities=14% Similarity=0.088 Sum_probs=151.2 Q ss_pred CchH---HHHHHhhh-cceeccchh----hhccchhHHHHHhhhhhccccccc-Ccch--HHHHHHHhhCceeeeeeccc Q lcl|Aclame:pro 1 MRDA---QRIQNLAR-AGVILPRSV----QNVSTPLTEYAMDAADLSPHLSST-GSSG--IPNYLTTYVDPAVIDILVAP 69 (336) Q Consensus 1 ~~~~---~~~~~l~~-~g~~~~~~~----~~~~~~~~~~a~da~d~~~~l~t~-~~~~--i~~~l~~~idp~v~~~~~~~ 69 (336) .+.. .....|++ .|- +..+. ..+..... +..++++ .++| ||..+.+ +|++.+.+. T Consensus 27 ~kg~~~~~~~~a~a~~~g~-~~~a~~~a~~~~~~~~~---------~~a~~~~~~~Gg~lvP~~~~~----~ii~~l~~~ 92 (366) T protein:vir:57 27 YKGAGMTRMVMSIAAGKGN-LADAAKFAATELGDTGL---------SMAISTAAGSGGALIPQNMQN----EVIELLRDR 92 (366) T ss_pred ccchhHHHHHHHHHhcccc-hhHHHHHHHHhhcchhh---------hhhccccccCCccccchhHHH----HHHHHHhhh Confidence 1111 11222222 121 11110 01111111 1122222 2233 5766553 333333322 Q ss_pred cchhhhcccccCCCcceeeEEEeeeecceeeEEeecccCCceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHH Q lcl|Aclame:pro 70 MKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLAS 149 (336) Q Consensus 70 ~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~~ygd~~diP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~ 149 (336) .-.+.+ +....+- ....+.+++.+..+.+...+...++|..+.......-+.+.++....+|.+=|+.+ ..++.+ T Consensus 93 s~l~~l-g~~~v~~-~~g~~~~p~~t~~~~a~wv~E~~~~~~s~~~f~~i~~~~~k~~~~~~iS~ell~ds---~~~~~~ 167 (366) T protein:vir:57 93 TVVRIL-GARSIPL-PNGNLSMPRLSGGATAGYVGEGKDVVATGATFDDVKLSAKTMIALVPVSNQLIGRA---GFNVEQ 167 (366) T ss_pred cchhhh-ceeeeec-CCCceEEEEEeCCcceeeeccCccccccccceeEEEEeeEEEEEeehhhHHHHhhh---hHHHHH Confidence 222222 1111110 11346778877777888889999999999888888888999998888885544433 356888 Q ss_pred HHHHHHHHHHHHhhcceEEeecc-ccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceecccc Q lcl|Aclame:pro 150 ELNYSSALGLAKFLNGSYLFGVA-GLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDV 228 (336) Q Consensus 150 ~k~~aAr~a~e~~~n~~~~~Gd~-~~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p 228 (336) .-......++.+.+++-.++|+. +..-.|++|.+.........+. ...+...+..++..+.........+ -.. T Consensus 168 ~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~--t~~~~~~~~~~~~~~~~~~~~~~~~----~~~ 241 (366) T protein:vir:57 168 LLLGDILSAIATREDKAFLRDDGTGDTPKGMKAVATAANRLVAWTG--TAINLTTIDEYLDSLILKHMDSNSN----MIR 241 (366) T ss_pred HHHHHHHHHHHHHHHHHhhccCCCCccccceeeccccccceeeccc--cccchhhHHHHHHHHHHhhhccccc----ccc Confidence 88888888899999999999997 4577899997654322211111 1223344444444433322211111 124 Q ss_pred cEEEecHHHHHhccc-CCCCCccHHHHHHH----hCCccEEEEcccccCCCCceEEEEEEeecC-----CceEEEEc-Ch Q lcl|Aclame:pro 229 LRMGLPPTAMSDLSK-TNQYGLAAAAKLKD----IFPKLEFVTIPEYDTASGRLVQLWAPRVEG-----KDTATCGF-TE 297 (336) Q Consensus 229 ~tL~Lp~~~~~~L~~-~~~~~~Tvl~~l~~----n~pnl~i~~~pel~~a~G~~~~~~~~~~~~-----~~~~~~~~-p~ 297 (336) ...+|.+..+..|.+ ++..|..++.-+.. .||=+.-..+|.-.++.++...+++-.... ..-.++.+ ++ T Consensus 242 a~~vmn~~~~~~L~~lkd~~G~~l~~~~~~g~l~G~Pvv~s~~ip~~~~~~~~~~~i~~gdfs~~~i~~~~~i~i~~~~e 321 (366) T protein:vir:57 242 CGWGLSNRTYMTLFGLRDGNGNKVYPEMSQGILKGYPIQRTSAIPANLGDDGNESEIYFCDFNDVVIGEDGMMKVDFSTE 321 (366) T ss_pred CEEEecHHHHHHHHhhhccCCceeccCCCCCeecceeeEEccccccccccCCCccEEEEEecceEEEEEecceEEEEeec Confidence 568899999888864 34445544421111 133222223344333333333333222110 00001111 11 Q ss_pred h-hh----c-ccceecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 298 K-MR----A-HSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 298 ~-~r----~-l~~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) . +. . +.. ...-...+-+..++. +.+++|.||+++.|| T Consensus 322 a~~~~~~g~~~~~-f~~~~~~iR~~~~~d-~~v~~~~a~~~lt~~ 364 (366) T protein:vir:57 322 ATYKDADGQLVSA-FARNQSLIRVVTEHD-IGFRHPEGLVLGTGV 364 (366) T ss_pred cccccccccchhh-hhcCceeEEeeeeeC-cEeeccccEEEEecc Confidence 1 00 0 000 011123455555554 455999999999999 No 23 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=98.30 E-value=7.1e-08 Score=59.79 Aligned_cols=289 Identities=10% Similarity=-0.027 Sum_probs=152.3 Q ss_pred hhhcceeccchhhhccchhHHHHHhhhhhcccccccCcchHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeE Q lcl|Aclame:pro 10 LARAGVILPRSVQNVSTPLTEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVA 89 (336) Q Consensus 10 l~~~g~~~~~~~~~~~~~~~~~a~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~ 89 (336) +++ |-.|+ .+.+.++. .-++.....||..+. .++++.+.......++.++...+. .+. T Consensus 1 ~~~-~~~~~-------~~~~~~~~-------t~~~~~~~~ip~~~~----~~ii~~~~~~s~l~~~~~~~~~~~---~~~ 58 (320) T protein:vir:10 1 MAA-GTAFQ-------VDHAQIAQ-------TGDTMFKGYLEPEQA----KDYFAEAEKTSIVQQFAQKVPMGT---TGQ 58 (320) T ss_pred CCC-CccCC-------HHHHHhhc-------cccccccccccHHHH----HHHHHHHHhccchhhhcceeeccC---Cce Confidence 000 11111 11111111 011112223565544 345555555556667766654332 346 Q ss_pred EEeeeecceeeEEeecccCCceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEe Q lcl|Aclame:pro 90 AFITAEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLF 169 (336) Q Consensus 90 ~~~~~e~~G~a~~ygd~~diP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~ 169 (336) .|++.+..+.+...+...++|..+........+++.++..+.+|.+=++.+. .++.+.-....++++.+.+++-.+. T Consensus 59 ~~p~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~---~~l~~~i~~~l~~a~a~~~d~a~l~ 135 (320) T protein:vir:10 59 KIPHWIGDVSAQWIGEGDMKPITKGNMTSQNIAPHKIATIFVASAETVRANP---ANYLGTMRTKVATAFAMAFDSAALN 135 (320) T ss_pred EEEEEeCCcceEEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcCh---HHHHHHHHHHHHHHHHHHHHHHhhc Confidence 7888888888999999999999999999999999999999999977666433 6788888888999999999999999 Q ss_pred eccccceEEEEecCCCCccccc-ccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhccc-CCCC Q lcl|Aclame:pro 170 GVAGLENYGLINDPSLSAPITA-TTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQY 247 (336) Q Consensus 170 Gd~~~g~~GllN~Pnl~~~~~~-~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~ 247 (336) |+....-.|++...+....... ...+.+... .-+++.+++..+... . ..+..++|.|+.+..|.+ .+.. T Consensus 136 G~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~---~---~~~~~~v~n~~~~~~L~~lkd~~ 206 (320) T protein:vir:10 136 GTDSPFPTYLAQTTKSVSLADPGGATASDLTA---YDAVAVNGLSLLVNA---K---KKWTHTLLDDIVEPILNGAKDKN 206 (320) T ss_pred ccCCCCCcccccccccccceeccccccccccc---HHHHHHHHHhhhhcc---c---CCCcEEEEcHHHHHHHHHhhccC Confidence 9985444444443221111111 111111111 112233333333221 1 246789999999998864 3333 Q ss_pred CccHHHH-H----HHhCCccEEEEcccccCC---CCceEEEE-------EEeecCCceEEEEcChh-hhcccce------ Q lcl|Aclame:pro 248 GLAAAAK-L----KDIFPKLEFVTIPEYDTA---SGRLVQLW-------APRVEGKDTATCGFTEK-MRAHSIE------ 305 (336) Q Consensus 248 ~~Tvl~~-l----~~n~pnl~i~~~pel~~a---~G~~~~~~-------~~~~~~~~~~~~~~p~~-~r~l~~~------ 305 (336) |..++.- + ..+++..++...|-.... .|....++ +-.+.+ ..+.+-.. .-..... T Consensus 207 G~~l~~~~~~~~~~~~~~~~~i~g~pv~~~~~~~~~~~~~~~gd~~~~~~~~~~~---~~i~~~~~~~~~~~~~~~~~~~ 283 (320) T protein:vir:10 207 GRPLFIESTYTDENSPFRAGRIVSRPTILSDHVADGTTVGYMGDFRNVIWGQVGG---LSFDVTDQATLNLGTPTEPNFV 283 (320) T ss_pred CceeeccccccCccccccCceeeeeeeEecCCCCCCceEEEEeecceEEEEEecC---eEEEEeecceeeeccccccccc Confidence 3322211 1 112334455555554322 22222222 111111 11111100 0000000 Q ss_pred --ecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 306 --RYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 306 --~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) .+.-...+-+..|+ |+.+.+|.||+++.|+ T Consensus 284 ~~f~~~~~~~r~~~~~-d~~v~~~~a~~~l~~~ 315 (320) T protein:vir:10 284 SLWQHNLVAVRVEAEY-AFHNNDKDAFVKLTNV 315 (320) T ss_pred hhhhcCcEEEEEEEee-ccEEecccceEEEEec Confidence 01112334455555 6677999999999999 No 24 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=98.28 E-value=5.7e-08 Score=60.31 Aligned_cols=277 Identities=11% Similarity=-0.014 Sum_probs=152.5 Q ss_pred HHHhhhhhcccccccCcchHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecceeeEEeecccCCc Q lcl|Aclame:pro 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDG 110 (336) Q Consensus 31 ~a~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~~ygd~~diP 110 (336) ||-.+ +.++. -+|..+. +++++.+...-....+.++.+.+. ....|++.+..+.|..+|...+.| T Consensus 1 ma~~t-~~~G~-------lip~~~~----~~ii~~l~~~s~i~~l~~~~~~~~---~~~~~p~~~~~~~a~wv~Eg~~~~ 65 (300) T protein:vir:95 1 MSEAQ-LSKGN-------LFNPELV----TKVINKVKGHSSIAKLSPQKPIPF---NGQREFVFDFDSDIDIVAENGKKT 65 (300) T ss_pred Ccccc-cCCcc-------eechhhH----HHHHHHHHhhhhhhhhcceeeccC---CceEEEEEecCcceEEeeCCcccc Confidence 33321 12221 2343333 345555555555555655544332 235788888888999999999999 Q ss_pred eeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeecc-----ccceEEEEecCCC Q lcl|Aclame:pro 111 DSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVA-----GLENYGLINDPSL 185 (336) Q Consensus 111 ~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~-----~~g~~GllN~Pnl 185 (336) ..+.......-+.+.++....+|.+=+++......++.+.-....++++.+.+++-.++|+. +....|..+.+.. T Consensus 66 ~s~~~f~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~ 145 (300) T protein:vir:95 66 HGGVSLDPVTIVPLKVEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTKQASTIIGDNCFDKK 145 (300) T ss_pred cccccceeeEeeeEEEEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCCCCcccccccccccc Confidence 99998888888889999988888664444445668888888899999999999999999952 3444555554433 Q ss_pred CcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhccc-CCCCCccHHHHHHHh-----C Q lcl|Aclame:pro 186 SAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAKLKDI-----F 259 (336) Q Consensus 186 ~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~~l~~n-----~ 259 (336) ...+. .+ +....++||.+++..+... ++ .+..++|.|..+..|.+ .+..|..++.-.... . T Consensus 146 ~~~~~-~~------~~~~~~~~i~~~~~~~~~~-~~-----~~~~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l 212 (300) T protein:vir:95 146 VTQTV-PF------KDTNPDESMEDAVGMIDGS-ER-----DITGAILDPIFTTALSKMKNAEGGKLYPELAWGGVPDAI 212 (300) T ss_pred cceee-cc------cccchHHHHHHHHHHhhhc-CC-----CccEEEECHHHHHHHHHhhccCCCeeccCccccCCCcee Confidence 22111 11 1112357788888776442 21 35679999999888864 344454443211111 1 Q ss_pred CccEEEEcccccCC-CCceEEEEEEeec------CCceEEEEcChhhhc--ccce-ecCCceEEccccceeeeeeecccc Q lcl|Aclame:pro 260 PKLEFVTIPEYDTA-SGRLVQLWAPRVE------GKDTATCGFTEKMRA--HSIE-RYSSYFRQKKSAGTWGAVIFRPFA 329 (336) Q Consensus 260 pnl~i~~~pel~~a-~G~~~~~~~~~~~------~~~~~~~~~p~~~r~--l~~~-~~~~~~~vp~~~~t~Gv~ir~P~a 329 (336) -++.++........ .+.+..+++-... -..-.++.+-..-.. -++. ...-.+-+-++.|+ |+.|++|.| T Consensus 213 ~G~Pv~~s~~v~~~~~~~~~~~~~GDf~~~~~~~~~~~~~~~v~~~~~~d~~~~~~f~~~~v~~r~~~r~-d~~v~~~~a 291 (300) T protein:vir:95 213 NGLAVDKNRTVSYSQTDPKNTAIVGDFETMFKWGYAKEVPMEIIKYGDPDNSGRDLKGYNQIYIRCEAYI-GWGIMDAAS 291 (300) T ss_pred cceeeEEecCCCCCCCCCccEEEEeeccceEEEEEecccEEEEeeccCCCCcchhhhhcCcEEEEEEEee-cceeecccc Confidence 12233222222222 2222222221110 000011111100000 0000 01112444556666 556777999 Q ss_pred eeeeccC Q lcl|Aclame:pro 330 VAQMIGV 336 (336) Q Consensus 330 v~~~~GI 336 (336) |+++.|. T Consensus 292 ~~~l~~~ 298 (300) T protein:vir:95 292 FARIVKT 298 (300) T ss_pred eEEEecC Confidence 9999999 No 25 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=98.27 E-value=8.3e-08 Score=59.41 Aligned_cols=280 Identities=14% Similarity=0.046 Sum_probs=156.4 Q ss_pred HHHhhhhhcccccccCcchHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecceeeEEeecccCCc Q lcl|Aclame:pro 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDG 110 (336) Q Consensus 31 ~a~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~~ygd~~diP 110 (336) || .+++.....+|..+.+ +|++.+.+......+..+...+. ....|++....+.+..+|....+| T Consensus 1 Ma--------t~tt~~g~~vP~~~~~----~ii~~~~~~s~l~~~~~~i~~~~---~~~~~p~~~~~~~a~wv~Eg~~~~ 65 (311) T protein:vir:99 1 MA--------TFGTGNLKNLPRNIAD----GMVKDVVQGSTVAVLSARKPQRF---GNEDIITFNGRPKAEFVGEGQQKS 65 (311) T ss_pred Cc--------eecCCCceeccHHHHH----HHHHHHHhhchhhhhcceeeccC---CceEEEEEeCCceeEEeecCcccc Confidence 21 2333334446655543 34444444444555554433222 235788888888999999999999 Q ss_pred eeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeecc---ccceEEEEecCCCCc Q lcl|Aclame:pro 111 DSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVA---GLENYGLINDPSLSA 187 (336) Q Consensus 111 ~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~---~~g~~GllN~Pnl~~ 187 (336) ..+....+..-..+.++..+..|.+=++.+-....+|...-.....+++.+.+++-.++|+. +.+..|+.|-..... T Consensus 66 ~~~~~f~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~g~~~~g~~~~~~~~~ 145 (311) T protein:vir:99 66 STTGEFDFVTSTPKKAQVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINPLTGTVIPGWSNYLGAAS 145 (311) T ss_pred cccceeeEEEEeeEEEEEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccCcccccccccccccc Confidence 99988888888888998888888664444456678899999999999999999999999986 344455554322211 Q ss_pred ccccccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhccc-CCCCCccHHHHHHH--------h Q lcl|Aclame:pro 188 PITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAKLKD--------I 258 (336) Q Consensus 188 ~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~~l~~--------n 258 (336) ... + ....+......||.+++..+...... ..++.++|.+..+..|.+ .+..|.-+++-... - T Consensus 146 ~~~--~--~~~~~~~~~~~~i~~~~~~~~~~~~~----~~~~~~vmn~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G 217 (311) T protein:vir:99 146 KRV--E--LTADTIANPDLAIEAAVGLLVANGHP----TPVNGLALHPSIAWGLSTARYTDGRKKFPELGLGIGVSSFEG 217 (311) T ss_pred cee--e--ccccccchhHHHHHHHHHHHhhhccC----CCccEEEEcHHHHHHHHhhhccCCCeeecCcccCCCCceecc Confidence 111 1 11122344567788887776655432 245669999999888854 33334333221100 0 Q ss_pred CCccEEEEcccccCC--------CCceEEEEEEeecCCceEEEEcChhh--hccc----ce----ecCCceEEcccccee Q lcl|Aclame:pro 259 FPKLEFVTIPEYDTA--------SGRLVQLWAPRVEGKDTATCGFTEKM--RAHS----IE----RYSSYFRQKKSAGTW 320 (336) Q Consensus 259 ~pnl~i~~~pel~~a--------~G~~~~~~~~~~~~~~~~~~~~p~~~--r~l~----~~----~~~~~~~vp~~~~t~ 320 (336) +|-+.-..+|.-... .+....+++-+.. +-..+.+.... .... -. ...-..-+-|+.|++ T Consensus 218 ~Pv~~s~~i~~~~~~~~~~~~~~~~~~~~~~~Gdf~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~r~~~r~d 295 (311) T protein:vir:99 218 IDASVSDTVNGGDEADPDDEDLDAARAVRGIVGDFA--NGIHWGVQRDIPVELIKYGDPDGQGDLKRHNQIALRLEIVYG 295 (311) T ss_pred eeeEeecccccccccccccchhhccCcceEEEeecc--ccEEEEEecCceEEEeecCCCCcchhhhhcCcEEEEEEEeec Confidence 121111112221111 1222333332211 11111111111 1110 00 111235677899999 Q ss_pred eeeeecccceeeeccC Q lcl|Aclame:pro 321 GAVIFRPFAVAQMIGV 336 (336) Q Consensus 321 Gv~ir~P~av~~~~GI 336 (336) |. |++|.+++..++. T Consensus 296 ~~-v~~~~~v~~~~~~ 310 (311) T protein:vir:99 296 WY-VFTDRFVVIENAV 310 (311) T ss_pred ce-ecChhHeeeeccc Confidence 97 5679888888888 No 26 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=98.21 E-value=1.7e-07 Score=57.64 Aligned_cols=274 Identities=10% Similarity=0.027 Sum_probs=145.6 Q ss_pred hhhhhccccccc-CcchHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecceeeEEeeccc----- Q lcl|Aclame:pro 34 DAADLSPHLSST-GSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYS----- 107 (336) Q Consensus 34 da~d~~~~l~t~-~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~~ygd~~----- 107 (336) .| ..+++ ....+|..+. ++|++.+...-....+..+.+.+. .+..+++.+..+.+..+|... T Consensus 1 ma-----~~t~~~gg~liP~~~~----~~Ii~~~~~~s~l~~l~~~~~~~~---~~~~~p~~~~~~~a~wv~E~~~~~~~ 68 (305) T protein:vir:25 1 MA-----DISRAEVASLIQEAYS----DTLLAAAKQGSTVLSAFQNVNMGT---KTTHLPVLATLPEADWVGESATDPKG 68 (305) T ss_pred CC-----CccCCccceecCHHHH----HHHHHHHHhhchhhhhcceeeccC---CcEEEEEEeCCcceEEeecccccccc Confidence 01 11111 1222666555 344555555555666666544332 235777777777888886653 Q ss_pred CCceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCc Q lcl|Aclame:pro 108 SDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSA 187 (336) Q Consensus 108 diP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~ 187 (336) ++|..+.......-..+.++..+.+|.+=++ ....++.+.-.....+++.+.+++-.++|+.... |+.+...++. T Consensus 69 ~~~~s~~~f~~i~~~~~k~~~~~~is~ell~---ds~~~~~~~i~~~l~~~~a~~~d~a~~~G~g~~~--~~~~~~~~~~ 143 (305) T protein:vir:25 69 VKPTSKVTWANRTLVAEEIAVIIPVHENVID---DATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPA--SWVSPALIPA 143 (305) T ss_pred cccccccceeeEEeeeEEEEEeehhhHHHHh---cchHHHHHHHHHHHHHHHHHHHhhhheeccCCCC--Cccccccccc Confidence 3577777777778888888888888875443 3346789999999999999999999999997532 3333222222 Q ss_pred ccc---cccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhccc-CCCCCccHHHHHHHhCCccE Q lcl|Aclame:pro 188 PIT---ATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAKLKDIFPKLE 263 (336) Q Consensus 188 ~~~---~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~~l~~n~pnl~ 263 (336) ... ....+....+..++++++..+...+.... ..++.++|.+..+..|.+ .+..|.-++. -...-++. T Consensus 144 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~v~~~~~~~~l~~lkd~~G~~i~~--~~~l~G~P 215 (305) T protein:vir:25 144 AVTAGQAVEVVGGVANESDIVGATNRAAKAVASAG------WAPDTLLSSLALRYEVANIRDANGNPVFR--DDSFAGFR 215 (305) T ss_pred cccccccccccccchhhhHHHHHHHHHHHhhhhcc------cccceeEecHHHHHHHHHhhccCCceeec--CCcccccc Confidence 111 11222223334567777776666553321 245679999999888854 3444443321 00111122 Q ss_pred EEEcccccCCCCceEEEEEE-------eecCCceEEEEcChhh--hcc--cce-ecCCceEEccccceeeeeeeccccee Q lcl|Aclame:pro 264 FVTIPEYDTASGRLVQLWAP-------RVEGKDTATCGFTEKM--RAH--SIE-RYSSYFRQKKSAGTWGAVIFRPFAVA 331 (336) Q Consensus 264 i~~~pel~~a~G~~~~~~~~-------~~~~~~~~~~~~p~~~--r~l--~~~-~~~~~~~vp~~~~t~Gv~ir~P~av~ 331 (336) +.-.......++....++.+ ...+ ..+.+-... ... +.. ...-.+.+-++.|++ ..|.+|.||+ T Consensus 216 v~~~~~~~~~~~~~~~~~gd~s~~~i~~~~~---~~i~~~~~~~~~~~~~~~~~~~~~~~~~R~~~r~~-~~v~~p~a~v 291 (305) T protein:vir:25 216 TFFNRNGAWDADAAIEVIADSSRVKIGVRQD---ITVKFLDQATLGTGENQINLAERDMVALRLKARFA-YVLGVSATAQ 291 (305) T ss_pred eEEcCccCCCCCccEEEEEecceEEEEEecC---eEEEEeeeeeeecCCceeeeeecCcEEEEEEEeec-ceeeCcccEE Confidence 22111111112221212211 1111 011111000 000 000 111234455667775 4577899999 Q ss_pred eeccC Q lcl|Aclame:pro 332 QMIGV 336 (336) Q Consensus 332 ~~~GI 336 (336) .++|+ T Consensus 292 ~~~~~ 296 (305) T protein:vir:25 292 GANKT 296 (305) T ss_pred EEccc Confidence 99999 No 27 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=98.18 E-value=1.8e-07 Score=57.60 Aligned_cols=274 Identities=11% Similarity=0.047 Sum_probs=149.5 Q ss_pred HHHhhhhhcccccccCcchHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecceeeEEeecccCCc Q lcl|Aclame:pro 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDG 110 (336) Q Consensus 31 ~a~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~~ygd~~diP 110 (336) || +.....+|..+. .++++.+.+......+.++.+.+. ....+++.+..+.|..+|...++| T Consensus 1 ma-----------~~gG~lvp~~~~----~~ii~~~~~~s~i~~l~~~~~~~~---~~~~ip~~~~~~~a~~v~E~~~~~ 62 (298) T protein:vir:16 1 MV-----------LNKGTLFDPTLV----TDLISKVAGKSSIARLSAQKPIPF---NGEKVFTFTMDSEIDVVAESGKKT 62 (298) T ss_pred Cc-----------ccCcceechhHH----HHHHHHHHhhhhhhhhcceeeccC---CceEEEEEecCcceEEecCCcccc Confidence 22 222222333332 233444444444555555443332 335678888889999999999999 Q ss_pred eeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeecc-----ccceEEEEecCCC Q lcl|Aclame:pro 111 DSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVA-----GLENYGLINDPSL 185 (336) Q Consensus 111 ~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~-----~~g~~GllN~Pnl 185 (336) ..+.......-..+.++....+|.+=++.+.....++.+.-+...++++.+.++.-.++|.. ..+..|+....+. T Consensus 63 ~~~~~f~~v~l~~~k~a~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~ 142 (298) T protein:vir:16 63 HGGVTLAPQTMVPIKVEYGARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSK 142 (298) T ss_pred ccccceeEEEEeeeeEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcccccccccccccc Confidence 99988888888899999888888776666666678888888889999999999999999953 1223333322211 Q ss_pred CcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhccc-CCCCCccHHHHHHH-----hC Q lcl|Aclame:pro 186 SAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAKLKD-----IF 259 (336) Q Consensus 186 ~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~~l~~-----n~ 259 (336) ...... . .......++||.+++..+...- ..+..++|.++.+..|.+ .+..|.-++.-.-. .. T Consensus 143 ~~~~~~----~-~~~~~~~~~~i~~~~~~~~~~~------~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l 211 (298) T protein:vir:16 143 VTQKVE----A-PRGIADPNGAIENAVELLTGVD------ADVTGIAINPSFRSALAKQKDLQDNALFPELKWGATPDTI 211 (298) T ss_pred cccccc----c-ccccccHHHHHHHHHHHhhhcC------CCccEEEEcHHHHHHHHHhhccCCCeeecCcccCCCCcee Confidence 111110 1 1112446788999988775421 135579999999888864 33334433321100 01 Q ss_pred CccEEEEcccccC-CCCceEEEEEEeecCCceEEEEcChhh--hcccc-eecC--------CceEEccccceeeeeeecc Q lcl|Aclame:pro 260 PKLEFVTIPEYDT-ASGRLVQLWAPRVEGKDTATCGFTEKM--RAHSI-ERYS--------SYFRQKKSAGTWGAVIFRP 327 (336) Q Consensus 260 pnl~i~~~pel~~-a~G~~~~~~~~~~~~~~~~~~~~p~~~--r~l~~-~~~~--------~~~~vp~~~~t~Gv~ir~P 327 (336) -++.++......+ +.+.+..+++-... +...+.+.+.+ ...+. ...+ -....-|+.| .|..+++| T Consensus 212 ~G~PV~~~~~v~~~~~~~~~~~~~GDfs--~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~ra~~r-~d~~v~~~ 288 (298) T protein:vir:16 212 NGLPVDVNKTVSDMSLTQRDRAIIGDFA--NGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELF-LGWGILDA 288 (298) T ss_pred cceeeEEecccccccCCCccEEEEeecc--ceEEEEEecCceEEEeeccCCcCcchhhhhcCcEEEEEEEE-EccEeecc Confidence 1222332222222 22233333332211 00111111110 01110 0000 0122233333 56678899 Q ss_pred cceeeeccC Q lcl|Aclame:pro 328 FAVAQMIGV 336 (336) Q Consensus 328 ~av~~~~GI 336 (336) .||+++.|. T Consensus 289 ~a~~~l~~a 297 (298) T protein:vir:16 289 TKFARVTEA 297 (298) T ss_pred cceEEEeec Confidence 999999999 No 28 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=98.18 E-value=1.6e-07 Score=57.79 Aligned_cols=278 Identities=12% Similarity=-0.021 Sum_probs=146.9 Q ss_pred HhhhhhcccccccCcchHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecceeeEEeecccCCcee Q lcl|Aclame:pro 33 MDAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDS 112 (336) Q Consensus 33 ~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~~ygd~~diP~~ 112 (336) |-+. ++ ...-+|..+. .+|++.+.+.-....+.++.+.+. ....+++.+..+.+..++.+..+|.. T Consensus 1 mat~------~~-gg~lvP~~~~----~~ii~~~~~~s~i~~~~~~i~~~~---~~~~~p~~~~~~~a~wv~Eg~~~~~~ 66 (311) T protein:vir:81 1 MVAL------AT-GTFQLPKHLV----PGVWQKAQGQSVLARLSMAEPQEF---GEQQYMTLTAPPRGEVVGEGAQKSES 66 (311) T ss_pred Ccee------cC-CceEcchhHH----HHHHHHHHhcchhhhhcceeecCC---CceEEEEEeCCceeEEeecCcccccc Confidence 2111 11 1123455444 344454444445555555543222 24678888888999999999999999 Q ss_pred eeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeecc---ccceEEEEecCCCCccc Q lcl|Aclame:pro 113 GANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVA---GLENYGLINDPSLSAPI 189 (336) Q Consensus 113 ~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~---~~g~~GllN~Pnl~~~~ 189 (336) +.......-+.+.++....+|.+=++...-...+|.+.-+...++++.+.++.-.++|+. +....|+++.. .... T Consensus 67 ~~~f~~v~l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~~~~gi~~~~--~~~~ 144 (311) T protein:vir:81 67 TATFAPVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKI--LDTT 144 (311) T ss_pred cceeeEEEEeeEEEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCcccccccccc--cccc Confidence 988888888888888877777654444445667788989999999999999999999974 33445666631 1111 Q ss_pred ccccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhccc-CCCCCccHHHHH-HHh-------CC Q lcl|Aclame:pro 190 TATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAKL-KDI-------FP 260 (336) Q Consensus 190 ~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~~l-~~n-------~p 260 (336) ... -.++.+...+..+|.+++..+... + ..+..++|.+..+..|.+ .+..|.-++.-. ... +| T Consensus 145 ~~~--~~~~~~~~~~~~~i~~~~~~~~~~-~-----~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~tl~G~P 216 (311) T protein:vir:81 145 NIV--ELTTGTSATPDLAVEAAVGLVLGD-N-----LSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLN 216 (311) T ss_pred eee--eecccccchHHHHHHHHHHHhhhc-C-----CCceEEEEcHHHHHHHHhhhccCCCeeecCccccCCCceeccee Confidence 110 011222233456677777666432 2 246679999999888854 233333232111 000 12 Q ss_pred ccEEEEcccccC----------CCCceEEEEEEeec-------CCceEEEEcChhhhcccc-eecCCceEEccccceeee Q lcl|Aclame:pro 261 KLEFVTIPEYDT----------ASGRLVQLWAPRVE-------GKDTATCGFTEKMRAHSI-ERYSSYFRQKKSAGTWGA 322 (336) Q Consensus 261 nl~i~~~pel~~----------a~G~~~~~~~~~~~-------~~~~~~~~~p~~~r~l~~-~~~~~~~~vp~~~~t~Gv 322 (336) =+.-..+|.-.. .++....+++-+.. .+-+.++ .++....-.. -...-...+-|..|+++ T Consensus 217 v~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~gDfs~~~i~~~~~~~~~~-~~~~~~~~~~~~~~~~~v~~r~~~r~d~- 294 (311) T protein:vir:81 217 AAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLEL-IEFGDPDGLGDLKRQNQIAIRAEVVYGI- 294 (311) T ss_pred EEecccccccccccccccchhcccCCccEEEEEecccEEEEEeccceEEE-eccCCCCcchhhhhcCcEEEEEEEEecc- Confidence 110011211100 01111222221110 0111111 1111000000 01112245556666655 Q ss_pred eeecccceeeeccC Q lcl|Aclame:pro 323 VIFRPFAVAQMIGV 336 (336) Q Consensus 323 ~ir~P~av~~~~GI 336 (336) .+.+|.||+++.|. T Consensus 295 ~v~~~~a~~~l~~a 308 (311) T protein:vir:81 295 GIMSTDAFAVVRDA 308 (311) T ss_pred EeecccceEEEEee Confidence 55679999999999 No 29 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=98.17 E-value=2.4e-08 Score=62.39 Aligned_cols=277 Identities=13% Similarity=0.163 Sum_probs=145.9 Q ss_pred CchHHHHHHhhhcceeccchhhhccchhHHHHHhhhhhcccccccCcchHHHHHH--HhhCceeeeeeccccchhhhccc Q lcl|Aclame:pro 1 MRDAQRIQNLARAGVILPRSVQNVSTPLTEYAMDAADLSPHLSSTGSSGIPNYLT--TYVDPAVIDILVAPMKAAELVGE 78 (336) Q Consensus 1 ~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~a~da~d~~~~l~t~~~~~i~~~l~--~~idp~v~~~~~~~~~~~~l~~v 78 (336) |-.- -||. +....+.+ ++..+|. ++|..++.+..=+.+-++.||-. T Consensus 1 ~~~~--------~~i~------------------s~~~~~~i------tv~~ll~~P~~I~~~i~e~~~~~~iad~lf~~ 48 (318) T protein:vir:10 1 MTAP--------TGIV------------------SVSDGPAI------TVRELVGNPLWIPTALKKMMVNQFISESLFRN 48 (318) T ss_pred CCCC--------Ccce------------------eeecCCce------ehHHhhCCchhHHHHHHHHHhccchhhhhhhc Confidence 0000 0110 00001111 1112222 23333333333333344445432 Q ss_pred ccCCCcceeeEEEeeeecc---eeeEEeecccCCceeeeeeeeeee-eEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHH Q lcl|Aclame:pro 79 SKKGDWTTLVAAFITAEPT---TKVATYGDYSSDGDSGANINYPQR-QSYFFQTWTRWGERELEMAGAGRVDLASELNYS 154 (336) Q Consensus 79 ~t~g~w~~~t~~~~~~e~~---G~a~~ygd~~diP~~~~~~~~~~~-~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~a 154 (336) .+.-....+.|.-.++. |.+...+-+..+|+++....+++. .+..++.++++|.+.+.+ .+++...+...+ T Consensus 49 --~~a~~~~~v~f~~~~p~~~~~d~e~VaEggEiP~~~~~~G~~~ia~~~K~G~~~~vS~Em~~~---n~~~~v~r~~~~ 123 (318) T protein:vir:10 49 --GGANPNGVVAYNEGNPSFLEDDVADVAEFGEIPVSAGARGLPRTAFAVKKALGVRVSKEMIDE---NRVGAVNDQMLQ 123 (318) T ss_pred --ccccccceeEEEecccccccCcHhhccCcccccccCCCCCchhhhhhehhccceeccHHHHhh---cChhHHHHHHHH Confidence 12212234455433333 666666777889999976655555 457899999999876554 457778888888 Q ss_pred HHHHHHHhhcceEEeeccccceEEEEecCCCCcccccccccccc--cCHH-----HHH----HHHHHHHHHHHHHhCCce Q lcl|Aclame:pro 155 SALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGS--PAVE-----AVV----NEVVALFQVLQTQSQGII 223 (336) Q Consensus 155 Ar~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~--~t~~-----eI~----~Di~~l~~~l~~~s~g~v 223 (336) +++++-++.|+.+ +..|.+++++. ..+++.|... ...+ |.+ .|++.+...=.....|. T Consensus 124 l~Nti~r~~d~~a---------~dal~sa~t~~-~~~s~~w~~~~~~~~d~~~A~e~v~~a~~~~~~a~~~~~~~~~GY- 192 (318) T protein:vir:10 124 LRNTFIRANDRSA---------KALLQSPIVPT-LAVPTAWDNGGKVRTDIAIAIEQISTAAPTAYPAGVGSSDEYFGF- 192 (318) T ss_pred HHHHHHHHHHHHH---------HHHHhcccccc-ccCCcCCCCcccccccchhhhhhhhhhhhhhhhhhhhhhhhccCc- Confidence 8888888877763 34566666543 4445555421 1111 111 12221111111111122 Q ss_pred ecccccEEEecHHHHHhcccCCC----C---CccHHHHH--HHhCC----ccEEEEcccccCCCCceEEEEEEeecCCce Q lcl|Aclame:pro 224 TQEDVLRMGLPPTAMSDLSKTNQ----Y---GLAAAAKL--KDIFP----KLEFVTIPEYDTASGRLVQLWAPRVEGKDT 290 (336) Q Consensus 224 ~~~~p~tL~Lp~~~~~~L~~~~~----~---~~Tvl~~l--~~n~p----nl~i~~~pel~~a~G~~~~~~~~~~~~~~~ 290 (336) .|+||+|-|..+..|.+-.. | +..+...+ ..+|| +|+++..|-+.. +.+ ++++ .+.+ T Consensus 193 ---~pdtIVlhP~~~~~l~~n~~~~~~y~~~a~~~~~~~~~tg~~~g~~lGl~vi~s~~~p~---~~a-lvlq---~g~v 262 (318) T protein:vir:10 193 ---IPDTIVMHYALLPILMDNENFMKVYERNANYVSTAPDWTGNFPGSVMGLNVIRSRTFPI---DRV-LIME---RGTV 262 (318) T ss_pred ---cceeeEECHHHHHHHhcchhhhhhhhccchhhhhcccccccccceeeceEEeecCccCC---Cee-EEEe---cCCc Confidence 59999999999999964211 1 11111111 12243 367777676662 222 3333 3667 Q ss_pred EEEEcChhhhccccee--------cCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 291 ATCGFTEKMRAHSIER--------YSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 291 ~~~~~p~~~r~l~~~~--------~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) +.+..++++.+.+--. .+.+|....+..+ ...|..|+|+..++|| T Consensus 263 G~~~d~~pl~~t~~~~egg~~~g~~~~s~~~~~~~~~-~~~V~~PkA~~~itgi 315 (318) T protein:vir:10 263 GFYSDTRPLQFTALYPEGNGPNGGPTESYRADASHKR-ALAVDQPKAALWLTGI 315 (318) T ss_pred ceeeccccceeeecccCCCCCCCCcchhhheehheee-eeeeeCcceeEEEeec Confidence 7777788777666443 3345666655443 5778999999999999 No 30 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=98.11 E-value=3e-07 Score=56.33 Aligned_cols=275 Identities=8% Similarity=-0.029 Sum_probs=151.1 Q ss_pred HHHhhhhhcccccc-cCcchHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecceeeEEeecccCC Q lcl|Aclame:pro 31 YAMDAADLSPHLSS-TGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSD 109 (336) Q Consensus 31 ~a~da~d~~~~l~t-~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~~ygd~~di 109 (336) |..-.+|+...+++ ...+.+|..+.+ ++++.+...-....+.++...+. .....+++......+..++.+.++ T Consensus 1 m~~~~~~~~~~~~t~~~~~lvP~~~~~----~ii~~~~~~s~l~~~~~~~~~~~--~~~~~~~~~~~~~~a~~v~Eg~~~ 74 (297) T protein:vir:95 1 MTVQTFNPENVLVSQKKDGTLHKEFTD----IIMKEVAQNSLVMQLGQYQEMEG--EQEKTVYVQTDGISAYWVNETEKI 74 (297) T ss_pred CCccccccccccccCCCcceechhHHH----HHHHHHHhhchhhhhcceeecCC--CccEEEEEEcCCceeEEeecCccc Confidence 21111122222222 223346666653 44555555555555555543221 123456666777788899999999 Q ss_pred ceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCccc Q lcl|Aclame:pro 110 GDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPI 189 (336) Q Consensus 110 P~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~ 189 (336) |..+........+.+.++..+.++.+-++.+. .++.+.-....++++.+.+++-.++|+...+-.|+++...... T Consensus 75 ~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~---~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~gi~~~~~~~~-- 149 (297) T protein:vir:95 75 KTDKPEVVPVTLKAHKLGIILVTSREALNYTW---KKFFEDMKPQIVEAFYKKIDEAGLLGHDTPFANSVAKAAKDAN-- 149 (297) T ss_pred cccccceeEEEEeeEEEEEeehhhHHHHhcCH---HHHHHHHHHHHHHHHHHHHHHHHhcccCCcccccccccccccc-- Confidence 99998888889999999999999886666443 5788888899999999999999999999888888887433211 Q ss_pred ccccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhccc-CCCCCccHHHHHHHhCCc---cEEE Q lcl|Aclame:pro 190 TATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAKLKDIFPK---LEFV 265 (336) Q Consensus 190 ~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~~l~~n~pn---l~i~ 265 (336) +...++.| ++||.+++.++...- ..+..++|.++.+..|.+ .+..|.-++ ...... +.++ T Consensus 150 ---~~~~~~~t----~~~i~~~~~~l~~~~------~~~~~~v~~~~~~~~L~~l~d~~G~~i~---~~~~~~l~G~Pv~ 213 (297) T protein:vir:95 150 ---KVIGGPIN----YDNILKLQDALYDAD------VEPNAFVSKIQNRSALREARDGNKVSIY---DKAANTIDGITTV 213 (297) T ss_pred ---eecccccC----HHHHHHHHHHhhhcc------CCcCEEEEcHHHHHHHHHhhccCCceee---cCCCCcccceeeE Confidence 11111223 566777777775431 135689999999998864 233333222 111111 1222 Q ss_pred EcccccCCC-----CceEEEEEEeecCCceEEEEcChhhhc-ccce--------ecCCceEEccccceeeeeeeccccee Q lcl|Aclame:pro 266 TIPEYDTAS-----GRLVQLWAPRVEGKDTATCGFTEKMRA-HSIE--------RYSSYFRQKKSAGTWGAVIFRPFAVA 331 (336) Q Consensus 266 ~~pel~~a~-----G~~~~~~~~~~~~~~~~~~~~p~~~r~-l~~~--------~~~~~~~vp~~~~t~Gv~ir~P~av~ 331 (336) ..+.-.... |+..++++-...+ .++.+-..-.. .... ...-.....++.|.++ .+++|.||+ T Consensus 214 ~~~~~~~~~~~~~~gd~s~~~~~~~~~---~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~-~v~~~~a~~ 289 (297) T protein:vir:95 214 DLKSARFEKGDLLAGDFDNLIYGVPYN---ITYKISEEGQISTITNADGTPINLFEQEMIAIRATMDIAV-MITKTDAFA 289 (297) T ss_pred eecCCCCCCceEEEEecccEEEEEecC---eEEEEeeccccccccccCccchhhhhcCcEEEEEEEEecc-EeecccceE Confidence 111111111 2222222211111 11111111100 0000 0112344455555555 556699999 Q ss_pred eeccC Q lcl|Aclame:pro 332 QMIGV 336 (336) Q Consensus 332 ~~~GI 336 (336) .+..- T Consensus 290 ~l~~a 294 (297) T protein:vir:95 290 KLTPA 294 (297) T ss_pred EEeec Confidence 98877 No 31 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=98.10 E-value=3.5e-07 Score=55.94 Aligned_cols=317 Identities=14% Similarity=0.074 Sum_probs=144.2 Q ss_pred CchH---HHHHHhhh--cceeccchhhhc--------cchhHHHHHhhhhhcccccccCcch---H-HHHHHHhhCceee Q lcl|Aclame:pro 1 MRDA---QRIQNLAR--AGVILPRSVQNV--------STPLTEYAMDAADLSPHLSSTGSSG---I-PNYLTTYVDPAVI 63 (336) Q Consensus 1 ~~~~---~~~~~l~~--~g~~~~~~~~~~--------~~~~~~~a~da~d~~~~l~t~~~~~---i-~~~l~~~idp~v~ 63 (336) .+.. ..+..+.. .+-........+ .................+++.+..| + |.++. .+|+ T Consensus 103 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~lv~~~~~~----~~ii 178 (477) T protein:vir:84 103 YEKGNGQSYFRDLAMQTVGMADEPAKERLRRHMVDVESDKEIRKIAKVGEEYRDLDRNGGTGGYAVPPLWMM----NRFI 178 (477) T ss_pred hhhhHHHHHHHHHHHHHhhhhhhHHHHHHHHHHhhhhhhhhHHHHHHhhhhhccccccCCCcceeeccchhH----HHHH Confidence 0000 00000000 000000000000 0000000000111122233222222 2 33332 3455 Q ss_pred eeeccccchhhhcccccCCCcceeeEEEeeeecceee-EEeeccc-----CCceeeeeeeeeeeeEEEEEEEEeeCHHHH Q lcl|Aclame:pro 64 DILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKV-ATYGDYS-----SDGDSGANINYPQRQSYFFQTWTRWGEREL 137 (336) Q Consensus 64 ~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a-~~ygd~~-----diP~~~~~~~~~~~~v~~~~~~~~y~~~El 137 (336) +.+-+......+++..+... ....+.++..+..+.. ...+.+. +.|..+.......-+.+.++..+.+|.+=| T Consensus 179 ~~l~~~~~i~~~~~~~~~~~-~~~~~~ip~~~~~~~~a~~~~Eg~~~~~~~~~~s~~~f~~i~~~~~k~~~~~~iS~ell 257 (477) T protein:vir:84 179 ELARAGRTYANLCPTEPLPG-GTSSINIPKILTGTSTAIQAADNAALTAPSAHEVDLTDGFVQANVKTIAGQQGIAIQLL 257 (477) T ss_pred HHhhhcchHHHhhceeeecC-CcceeEEEEEecCcceeeeeccCcccccccccccccceeeEEEeeeeEEeeeHHHHHHH Confidence 55555555566666543322 2234566665544433 3445542 447777777777778888888877775544 Q ss_pred HHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeecc-ccceEEEEecCCCCcccccc-cccccccCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 138 EMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVA-GLENYGLINDPSLSAPITAT-TPWSGSPAVEAVVNEVVALFQVL 215 (336) Q Consensus 138 ~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~-~~g~~GllN~Pnl~~~~~~~-t~w~~~~t~~eI~~Di~~l~~~l 215 (336) +.+ ..++.+--....+.++...++.-.++|++ +....|++|.+++.....+. +.-| +..+..+++|..++..+ T Consensus 258 ~ds---~~~l~~~i~~~l~~~~~~~~d~~~l~G~Gt~~~p~Gi~~~~~~~~~~~~~~~~t~--~~~~~~~~~i~~~~~~~ 332 (477) T protein:vir:84 258 DQA---AVSVDEFVFRDLAADYANKLNVQVISGTGSNNQVVGVRATAGITQVTATSAGSAL--EKHQIIYQKIADAIQRV 332 (477) T ss_pred hcc---chhHHHHHHHHHHHHHHHHHHHHHhccCCCCCccceeeeccccccccccccccch--hhHHHHHHHHHHHHhhc Confidence 433 45788888888999999999999999997 45689999988775432221 1111 12344555555555544 Q ss_pred HHHhCCceecccccEEEecHHHHHhccc-CCCCCccHH----------HHHHHh--------CCccEEEEcccc---cCC Q lcl|Aclame:pro 216 QTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAA----------AKLKDI--------FPKLEFVTIPEY---DTA 273 (336) Q Consensus 216 ~~~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl----------~~l~~n--------~pnl~i~~~pel---~~a 273 (336) .... ...+...+|-|..+..|.+ .+..|.-++ .++..+ .-++.++..+.+ .++ T Consensus 333 ~~~~-----~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~~~~~~~l~G~pVv~s~~~p~~~~~ 407 (477) T protein:vir:84 333 HTSR-----FLEPEVIVMHPRRWASFHAIFAGDDRPLIVPSGPGFNNLGVLTEVASQRVVGQMHGLPVVTDPTLPTTLGT 407 (477) T ss_pred cccc-----cCCccEEEEcHHHHHHHHHhhccCCCeeeecCcccccccccccccccccccchhcccceEecCcccccccc Confidence 3221 1234568888887777643 222222111 111111 112233333333 223 Q ss_pred CCceEEEEEEeecCCceEEEEcChhhhcccceecC-C--ceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 274 SGRLVQLWAPRVEGKDTATCGFTEKMRAHSIERYS-S--YFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 274 ~G~~~~~~~~~~~~~~~~~~~~p~~~r~l~~~~~~-~--~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) ++....+++-.. .+..-..--+.++..+--... . .|++ .+......+|+|.||+.++|. T Consensus 408 ~~d~~~i~~gd~--~~~~i~~~~~~~~~~~~~~~~~~~~~~~v--~~~~~~~~~r~~~afv~~t~~ 469 (477) T protein:vir:84 408 GTDQDVIHVLRA--SDLALFESSVRMRALQETRAENLSVLLQV--YGYLAFTAARFPQSVVEIGGT 469 (477) T ss_pred cCCcceEEEEEe--ceEEEEeeceeEEeccccccccceeeeee--hhhhhhhhhccccceEEeecc Confidence 344333333222 111111111122222211111 1 1222 112233577899999999999 No 32 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=98.07 E-value=1.1e-06 Score=53.28 Aligned_cols=307 Identities=11% Similarity=0.085 Sum_probs=151.8 Q ss_pred CchHHH--------------------------------HHHhh---hcceeccchhhhccchhHHHHHhhhhhccccccc Q lcl|Aclame:pro 1 MRDAQR--------------------------------IQNLA---RAGVILPRSVQNVSTPLTEYAMDAADLSPHLSST 45 (336) Q Consensus 1 ~~~~~~--------------------------------~~~l~---~~g~~~~~~~~~~~~~~~~~a~da~d~~~~l~t~ 45 (336) ++...+ ++.+. +.|..- .....+. ......+ ...++..+++ T Consensus 56 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~-~~~~~~~--~~~~~~~~~~ 131 (419) T protein:vir:94 56 LRTAPPAPKGPADGGTPLTPAEAGTFRSLAQRFADSDGLREYRARDKRGQFQ-VEMRDID-PNRLLSR--DAPAGTITNP 131 (419) T ss_pred HHHHHHHHHHHhhhhccccccccccccchhhhhhhHHHHHHHHHhhhhhhhh-HHHHHHH-HHHhhcc--ccccccccCC Confidence 111111 00000 000000 0000000 0000000 0122223333 Q ss_pred CcchHHHHHHHhhCceeeeeeccccchhhhcccccCCC----cceee-EEEeeeecceeeEEeecccCCceeeeeeeeee Q lcl|Aclame:pro 46 GSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESKKGD----WTTLV-AAFITAEPTTKVATYGDYSSDGDSGANINYPQ 120 (336) Q Consensus 46 ~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~----w~~~t-~~~~~~e~~G~a~~ygd~~diP~~~~~~~~~~ 120 (336) ...-+|..+...| ......+....+++.+.+... +..++ ....+....+.+.+.+.+...|..+......+ T Consensus 132 ~~~~~p~~~~~~i----~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~ 207 (419) T protein:vir:94 132 NVPHLPQLVPGIV----PTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTIT 207 (419) T ss_pred cccccchhhhHHH----HHHHhhhhhhhhcceeeeccCCceeeeeeccccccccccCcccceecCCccccccccceeeEE Confidence 3344565555444 222233333444444432221 11111 11222333456778888888999998888889 Q ss_pred eeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCcccccccccccccC Q lcl|Aclame:pro 121 RQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPA 200 (336) Q Consensus 121 ~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~t 200 (336) ...+.++..+.+|.+=++-+. ++.+.-....++++.+.+|+-.++|++.....|++|++.+....+... +...| T Consensus 208 ~~~~k~~~~~~is~ell~d~~----~l~~~i~~~la~a~~~~~d~aii~G~G~~~p~Gi~~~~~~~~~~~~~~--~~~~t 281 (419) T protein:vir:94 208 TTLKTVAHWLPITRQAADDNS----QLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKP--TAPAT 281 (419) T ss_pred eeeeeEEEeehhhHHHHHhHH----HHHHHHHHHHHHHHHHHHHHHHHhccCcccccceeccccccccccccc--ccccc Confidence 999999999999976555432 477877888888888899998999999999999999988765433322 22344 Q ss_pred HHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhcccC-CCCCccH-HH-HHHH----hCCccEEEEcccccCC Q lcl|Aclame:pro 201 VEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSKT-NQYGLAA-AA-KLKD----IFPKLEFVTIPEYDTA 273 (336) Q Consensus 201 ~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~~-~~~~~Tv-l~-~l~~----n~pnl~i~~~pel~~a 273 (336) ....++||.+++..+...- -.+..++|.++.+..|... +..|-.+ +. .+.. .+-++.++....+... T Consensus 282 ~~~~~~~l~~~~~~~~~~~------~~~~~~v~n~~~~~~l~~~k~~~~~~~~~~~~~~~~~~~~l~G~pV~~~~~~~~~ 355 (419) T protein:vir:94 282 DEPPLVDIRRAKTVAEIAG------FPPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQG 355 (419) T ss_pred cchhHHHHHHHHHhhhhcc------CCCCEEEEcHHHHHHHHHHhhcCCCceeecCCcccCCCccccceeeEEcCCCCCc Confidence 5678999999998885421 1366899999998877532 2212111 10 0110 0011222222222110 Q ss_pred ---CCceE--EEEEEeecCCceEEEEcChhhhcccc-eecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 274 ---SGRLV--QLWAPRVEGKDTATCGFTEKMRAHSI-ERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 274 ---~G~~~--~~~~~~~~~~~~~~~~~p~~~r~l~~-~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) -|... +.++++ .+ ..+.+ -.+.. ....-....-+..|++|. ++.|-||+++..- T Consensus 356 ~~~~gd~~~~~~~~~~-~~---~~v~~----~~~~~~~~~~~~~~~r~~~r~d~~-v~~~~a~~~~~~~ 415 (419) T protein:vir:94 356 TALVGGFRQGATLWSR-QG---ITVLM----TDSHADFFTANTLVILAEFRANLA-VYQPKAFVRVTFA 415 (419) T ss_pred cEEEeeccceEEEEEe-cc---eEEEE----eccccchhhcCcEEEEEEEeeccE-EeccccEEEEEec Confidence 02211 122211 11 11110 00000 000122344555666554 5779999998877 No 33 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=98.00 E-value=6e-07 Score=54.68 Aligned_cols=291 Identities=10% Similarity=-0.022 Sum_probs=150.2 Q ss_pred hhccchhHHHHHhhhhhcccc--cccCcc-hHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecce Q lcl|Aclame:pro 22 QNVSTPLTEYAMDAADLSPHL--SSTGSS-GIPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTT 98 (336) Q Consensus 22 ~~~~~~~~~~a~da~d~~~~l--~t~~~~-~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G 98 (336) ++.+..+. +-...++....+ .+.+.+ -+|..+. .++++.+........+..+...+. .+..|++.+..+ T Consensus 1 ~~~~~~r~-~~~~~~~e~~a~~~~~~~~g~~ip~~~~----~~ii~~~~~~s~i~~~~~~~~~~~---~~~~~p~~~~~~ 72 (326) T protein:vir:42 1 MAVNPDRT-TPFLGVNDPKVAQTGDSMFEGYLEPEQA----QDYFAEAEKISIVQQFAQKIPMGT---TGQKIPHWTGDV 72 (326) T ss_pred CCCCccch-hhhcCcchhhheeccccCCcceechhhH----HHHHHHHHhcchhhhhcceeeccC---CceEEEEEeCCc Confidence 11111111 100011111112 112222 2454433 344555555555555555543332 335778888888 Q ss_pred eeEEeecccCCceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEE Q lcl|Aclame:pro 99 KVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYG 178 (336) Q Consensus 99 ~a~~ygd~~diP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~G 178 (336) .+..++.+..+|..+...+...-..+.++..+.+|.+=++.+ ..++.+.-....++++.+.+++-.++|+...+-.| T Consensus 73 ~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~v~iS~ell~~s---~~~~~~~i~~~l~~a~~~~~d~a~l~G~gs~~p~g 149 (326) T protein:vir:42 73 SASWIGEGDMKPITKGNMTSQTIAPHKIATIFVASAETVRAN---PANYLGTMRTKVATAFAMAFDNAAINGTDSPFPTF 149 (326) T ss_pred ceEEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcC---HHHHHHHHHHHHHHHHHHHHHHHhhcccCCCcccc Confidence 888899999999999888888888999999888887555433 36788888888899999999999999998777788 Q ss_pred EEecCCCCcccccccccccccCHHHHHHHHH--HHHHHHHHHhCCceecccccEEEecHHHHHhccc-CCCCCccHHHHH Q lcl|Aclame:pro 179 LINDPSLSAPITATTPWSGSPAVEAVVNEVV--ALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAKL 255 (336) Q Consensus 179 llN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~--~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~~l 255 (336) ++|.+.....+...+.. .+.+...+|+. .++..+.... .....++|.+..+..|.+ .+..|.-++.-- T Consensus 150 i~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~------~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~ 220 (326) T protein:vir:42 150 LAQTTKEVSLVDPDGTG---SNADLTVYDAVAVNALSLLVNAG------KKWTHTLLDDITEPILNGAKDKSGRPLFIES 220 (326) T ss_pred ccccccccceeeccccc---ccccchhHHHHHHHHHhhhhhhc------cCccEEEEeHHHHHHHHHhhccCCceeeccc Confidence 88865432222222211 11122233332 2222221111 134568999999888854 333333222110 Q ss_pred HHh-----CCccEEEEcccccC---CC-------CceEEEEEEeecCCceEEEEcC-hhhhcccc----e----ecCCce Q lcl|Aclame:pro 256 KDI-----FPKLEFVTIPEYDT---AS-------GRLVQLWAPRVEGKDTATCGFT-EKMRAHSI----E----RYSSYF 311 (336) Q Consensus 256 ~~n-----~pnl~i~~~pel~~---a~-------G~~~~~~~~~~~~~~~~~~~~p-~~~r~l~~----~----~~~~~~ 311 (336) ..+ ++.-++...|-.-. .. |+..++++-.+.+. .+.+- +....... . ...-.. T Consensus 221 ~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~Gd~s~~~~~~~~~~---~v~~~~e~~~~~~~~~~~~~~~~~~~d~~ 297 (326) T protein:vir:42 221 TYTEENSPFRLGRIVARPTILSDHVASGTVVGYQGDFRQLVWGQVGGL---SFDVTDQATLNLGTPQAPNFVSLWQHNLV 297 (326) T ss_pred cccCccccccCceeeeeeEEEcCCCCCCceEEEEeecceEEEEEecce---EEEEeecceeeecccccccchhhhhcCcE Confidence 000 11122332332211 11 22222232222111 11111 11100000 0 011235 Q ss_pred EEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 312 RQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 312 ~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) .+.+..|. ++.+.+|.||+++.++ T Consensus 298 ~~r~~~~~-d~~v~~~~a~~~l~~~ 321 (326) T protein:vir:42 298 AVRVEAEY-AFHCNDKDAFVKLTNV 321 (326) T ss_pred EEEEEEEe-ccEEecccceEEEeec Confidence 55666776 5567999999999999 No 34 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=97.92 E-value=1.4e-06 Score=52.73 Aligned_cols=294 Identities=10% Similarity=0.016 Sum_probs=153.6 Q ss_pred CchHHHHHHhhhcceeccchhhhccchhHHHHHhhhhhcccccccCcchHHHHHHHhhCceeeeeeccccchhhhccccc Q lcl|Aclame:pro 1 MRDAQRIQNLARAGVILPRSVQNVSTPLTEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESK 80 (336) Q Consensus 1 ~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~a~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t 80 (336) |+..+..+.-.+ . |-...... -..+|. .. .........||..+. .+|++.....-....++++.+ T Consensus 1 ~~~~~~~~~~~~--~-~~~~~~~~------~~~~a~-~~-~~~~~~~~~iP~~~~----~~ii~~~~~~s~l~~l~~~~~ 65 (324) T protein:vir:96 1 MEQTQKLKLNLQ--H-FASNNVKP------QVFNPD-NV-MMHEKKDGTLMNEFT----TPILQEVMENSKIMQLGKYEP 65 (324) T ss_pred CCcchhhhHHHH--H-HHHHhhhh------hhhccc-cc-cccCcCccccchhHH----HHHHHHHHhhchhhhhcceee Confidence 777666554222 1 11111010 011110 00 011223344665554 344555555555666666544 Q ss_pred CCCcceeeEEEeeeecceeeEEeecccCCceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHH Q lcl|Aclame:pro 81 KGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLA 160 (336) Q Consensus 81 ~g~w~~~t~~~~~~e~~G~a~~ygd~~diP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e 160 (336) ... .++.+++.+..+.+..++.+..+|..+..........+.++....+|.+=++.+ ..++.+.-.....+++. T Consensus 66 ~~~---~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds---~~~l~~~i~~~la~ai~ 139 (324) T protein:vir:96 66 MEG---TEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYT---YSQFFEEMKPMIAEAFY 139 (324) T ss_pred ccC---CceEEEEEecCcceeEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcc---hHHHHHHHHHHHHHHHH Confidence 332 346788888888999999999999999988888999999998888887655544 35788888888888888 Q ss_pred HhhcceEEeeccccc-eEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHH Q lcl|Aclame:pro 161 KFLNGSYLFGVAGLE-NYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMS 239 (336) Q Consensus 161 ~~~n~~~~~Gd~~~g-~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~ 239 (336) +.+++-+++|+.... ..|+++........ . .++. -++||.+++..+... + ..+..++|.++.+. T Consensus 140 ~~~d~a~l~G~g~~~~~~gi~~~~~~~~~~--~---~~~~----t~~~i~~~~~~l~~~--~----~~~~~~vmn~~~~~ 204 (324) T protein:vir:96 140 KKFDEAGILNQGNNPFGKSIAQSIEKTNKV--I---KGDF----TQDNIIDLEALLEDD--E----LEANAFISKTQNRS 204 (324) T ss_pred HHHHHHHhccCCCCCcCcccccccccccee--c---cccc----cHHHHHHHHHhhhhc--c----CCCCEEEEcHHHHH Confidence 888888888876432 24455432221111 1 1112 366777777766432 1 24668999999998 Q ss_pred hcccC-CCCCccHHHHHHHhCC---ccEEEEcccccCCC-----CceEEEEEEeecCCceEEEEcChhhhcc-------- Q lcl|Aclame:pro 240 DLSKT-NQYGLAAAAKLKDIFP---KLEFVTIPEYDTAS-----GRLVQLWAPRVEGKDTATCGFTEKMRAH-------- 302 (336) Q Consensus 240 ~L~~~-~~~~~Tvl~~l~~n~p---nl~i~~~pel~~a~-----G~~~~~~~~~~~~~~~~~~~~p~~~r~l-------- 302 (336) .|.+. +..|..++. ...-+ ++-++..+-..... |...++++-.+.+ ..+.+-..-... T Consensus 205 ~L~~l~d~~G~~~~~--~~~~~~l~G~PV~~~~~~~~~~~~~~~gd~~~~~~g~~~~---~~i~~~~~~~~~~~~~~~~~ 279 (324) T protein:vir:96 205 LLRKIVDPETKERIY--DRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQL---IEYKIDETAQLSTVKNEDGT 279 (324) T ss_pred HHHHhhccCCCeeec--CCCCCcccceeeEeeCCCCCCcceEEEEecceEEEEEecC---cEEEEeeccccccccccccc Confidence 88643 333332221 01111 11222222111111 1111111111111 111111100000 Q ss_pred cce-ecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 303 SIE-RYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 303 ~~~-~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) +.. ...-....-++.|. |+.+++|.||+++.|. T Consensus 280 ~~~~f~~d~~~~r~~~r~-d~~v~~~~A~~~l~~a 313 (324) T protein:vir:96 280 PVNLFEQDMVALRATMHV-ALHIADDKAFAKLVPA 313 (324) T ss_pred chhhhhcCcEEEEEEEEE-ccEEecccceEEEecc Confidence 000 00112344555566 5556669999999999 No 35 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=97.92 E-value=1.4e-06 Score=52.73 Aligned_cols=294 Identities=10% Similarity=0.016 Sum_probs=153.6 Q ss_pred CchHHHHHHhhhcceeccchhhhccchhHHHHHhhhhhcccccccCcchHHHHHHHhhCceeeeeeccccchhhhccccc Q lcl|Aclame:pro 1 MRDAQRIQNLARAGVILPRSVQNVSTPLTEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESK 80 (336) Q Consensus 1 ~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~a~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t 80 (336) |+..+..+.-.+ . |-...... -..+|. .. .........||..+. .+|++.....-....++++.+ T Consensus 1 ~~~~~~~~~~~~--~-~~~~~~~~------~~~~a~-~~-~~~~~~~~~iP~~~~----~~ii~~~~~~s~l~~l~~~~~ 65 (324) T protein:vir:78 1 MEQTQKLKLNLQ--H-FASNNVKP------QVFNPD-NV-MMHEKKDGTLMNEFT----TPILQEVMENSKIMQLGKYEP 65 (324) T ss_pred CCcchhhhHHHH--H-HHHHhhhh------hhhccc-cc-cccCcCccccchhHH----HHHHHHHHhhchhhhhcceee Confidence 777666554222 1 11111010 011110 00 011223344665554 344555555555666666544 Q ss_pred CCCcceeeEEEeeeecceeeEEeecccCCceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHH Q lcl|Aclame:pro 81 KGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLA 160 (336) Q Consensus 81 ~g~w~~~t~~~~~~e~~G~a~~ygd~~diP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e 160 (336) ... .++.+++.+..+.+..++.+..+|..+..........+.++....+|.+=++.+ ..++.+.-.....+++. T Consensus 66 ~~~---~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds---~~~l~~~i~~~la~ai~ 139 (324) T protein:vir:78 66 MEG---TEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYT---YSQFFEEMKPMIAEAFY 139 (324) T ss_pred ccC---CceEEEEEecCcceeEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcc---hHHHHHHHHHHHHHHHH Confidence 332 346788888888999999999999999988888999999998888887655544 35788888888888888 Q ss_pred HhhcceEEeeccccc-eEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHH Q lcl|Aclame:pro 161 KFLNGSYLFGVAGLE-NYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMS 239 (336) Q Consensus 161 ~~~n~~~~~Gd~~~g-~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~ 239 (336) +.+++-+++|+.... ..|+++........ . .++. -++||.+++..+... + ..+..++|.++.+. T Consensus 140 ~~~d~a~l~G~g~~~~~~gi~~~~~~~~~~--~---~~~~----t~~~i~~~~~~l~~~--~----~~~~~~vmn~~~~~ 204 (324) T protein:vir:78 140 KKFDEAGILNQGNNPFGKSIAQSIEKTNKV--I---KGDF----TQDNIIDLEALLEDD--E----LEANAFISKTQNRS 204 (324) T ss_pred HHHHHHHhccCCCCCcCcccccccccccee--c---cccc----cHHHHHHHHHhhhhc--c----CCCCEEEEcHHHHH Confidence 888888888876432 24455432221111 1 1112 366777777766432 1 24668999999998 Q ss_pred hcccC-CCCCccHHHHHHHhCC---ccEEEEcccccCCC-----CceEEEEEEeecCCceEEEEcChhhhcc-------- Q lcl|Aclame:pro 240 DLSKT-NQYGLAAAAKLKDIFP---KLEFVTIPEYDTAS-----GRLVQLWAPRVEGKDTATCGFTEKMRAH-------- 302 (336) Q Consensus 240 ~L~~~-~~~~~Tvl~~l~~n~p---nl~i~~~pel~~a~-----G~~~~~~~~~~~~~~~~~~~~p~~~r~l-------- 302 (336) .|.+. +..|..++. ...-+ ++-++..+-..... |...++++-.+.+ ..+.+-..-... T Consensus 205 ~L~~l~d~~G~~~~~--~~~~~~l~G~PV~~~~~~~~~~~~~~~gd~~~~~~g~~~~---~~i~~~~~~~~~~~~~~~~~ 279 (324) T protein:vir:78 205 LLRKIVDPETKERIY--DRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQL---IEYKIDETAQLSTVKNEDGT 279 (324) T ss_pred HHHHhhccCCCeeec--CCCCCcccceeeEeeCCCCCCcceEEEEecceEEEEEecC---cEEEEeeccccccccccccc Confidence 88643 333332221 01111 11222222111111 1111111111111 111111100000 Q ss_pred cce-ecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 303 SIE-RYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 303 ~~~-~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) +.. ...-....-++.|. |+.+++|.||+++.|. T Consensus 280 ~~~~f~~d~~~~r~~~r~-d~~v~~~~A~~~l~~a 313 (324) T protein:vir:78 280 PVNLFEQDMVALRATMHV-ALHIADDKAFAKLVPA 313 (324) T ss_pred chhhhhcCcEEEEEEEEE-ccEEecccceEEEecc Confidence 000 00112344555566 5556669999999999 No 36 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=97.91 E-value=9.8e-07 Score=53.52 Aligned_cols=276 Identities=8% Similarity=0.043 Sum_probs=149.4 Q ss_pred HHHhhhhhcccccccCcchHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecceeeEEeecccCCc Q lcl|Aclame:pro 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDG 110 (336) Q Consensus 31 ~a~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~~ygd~~diP 110 (336) |.-+|+ ..-..+ ...+.||..+. .+|++.+...-....+..+.+.+. .+..+++.+. ..+..++.+.++| T Consensus 1 ~g~~a~-~~~~~~-~~~~~iP~~~~----~~ii~~~~~~s~l~~~~~~~~~~~---~~~~~~~~~~-~~a~~v~E~~~~~ 70 (299) T protein:vir:41 1 MGFNPD-TTTMQS-AKTGSIPINIS----EQIITGVKNGSAAMKLAKAVPMTK---PEEEFTFMSG-VGAFWVDEAERIQ 70 (299) T ss_pred CCcCCC-cccccC-CCceecchhHH----HHHHHHHHhcchhhhhceeeecCC---CcEEEEEEcC-CceeeeecCcccc Confidence 222221 110001 11223555444 234444444444555554433322 2234555443 5577888889999 Q ss_pred eeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCcccc Q lcl|Aclame:pro 111 DSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPIT 190 (336) Q Consensus 111 ~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~ 190 (336) ..+............++..+.++.+=+.. ...++.+.-.....+++.+.+++-.++|+....-.|+++......... T Consensus 71 ~~~~~f~~v~l~~~k~~~~~~is~ell~d---s~~~~~~~i~~~l~~a~~~~~d~a~l~G~g~~~~~gil~~~~~~~~~~ 147 (299) T protein:vir:41 71 TSKPTFTKAKMRSKKMGVIIPTTKENLNY---SVTNFFSLMQAEIVEAFYKKFDQAVFTGVESPYNWNILKSATDASNLV 147 (299) T ss_pred ccccceeEEEEeeEEEEEeehhhHHHHhc---CHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccccccccccceee Confidence 99999999999999999999998755543 336788889999999999999999999998887789988543321111 Q ss_pred cccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhccc-CCCCCccHHHH-HHH---hCCccEEE Q lcl|Aclame:pro 191 ATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAK-LKD---IFPKLEFV 265 (336) Q Consensus 191 ~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~~-l~~---n~pnl~i~ 265 (336) . .++. -++||.+++.++... + -.+..++|.++.+..|.+ .+..|.-++.= +.. .+-++.+. T Consensus 148 ~----~~~~----~~~~l~~~~~~l~~~--~----~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~l~G~PV~ 213 (299) T protein:vir:41 148 E----ETAN----KYDDLNEAIGLIEAE--D----LEPNGIATIRKQRVKYRSTKDGNGMPIFNTATSNGVDDVLGLPIA 213 (299) T ss_pred c----cccc----cHHHHHHHHHhhhcc--c----CCcCEEEEcHHHHHHHHHhhccCCceeecCCcCCCCceecceeeE Confidence 1 1112 367888888877532 1 136679999999988864 33333322210 000 00112233 Q ss_pred EcccccCCCCceEEEEEEeec------CCceEEEEcChh-hhccccee--------cCCceEEccccceeeeeeecccce Q lcl|Aclame:pro 266 TIPEYDTASGRLVQLWAPRVE------GKDTATCGFTEK-MRAHSIER--------YSSYFRQKKSAGTWGAVIFRPFAV 330 (336) Q Consensus 266 ~~pel~~a~G~~~~~~~~~~~------~~~~~~~~~p~~-~r~l~~~~--------~~~~~~vp~~~~t~Gv~ir~P~av 330 (336) ..+.+.. +++...+++-... .++ .++.+-.. ........ ..-...+.+..|+ |..+++|.|| T Consensus 214 ~~~~~~~-~~~~~~~~~gdfs~~~i~~~~~-~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~-d~~v~~~~A~ 290 (299) T protein:vir:41 214 YTPKYTF-GDKDISELVGDWNQAYYGILRG-VEYEILTEATLTTVADETGKPLNLAERDMAAIKATFEV-GFMVVKDEAF 290 (299) T ss_pred EecccCC-CCCceEEEEEecccEEEEEecC-cEEEEeecccccccccccccchhhhhcCcEEEEEEEEe-ccEEecccce Confidence 3333322 2222222221110 000 11111111 10000000 0113445666676 5566779999 Q ss_pred eeeccC Q lcl|Aclame:pro 331 AQMIGV 336 (336) Q Consensus 331 ~~~~GI 336 (336) +.+.+- T Consensus 291 ~~l~~~ 296 (299) T protein:vir:41 291 SAVQPK 296 (299) T ss_pred EEEEec Confidence 999999 No 37 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=97.88 E-value=1.2e-06 Score=53.08 Aligned_cols=282 Identities=11% Similarity=-0.021 Sum_probs=146.4 Q ss_pred HHHhhhhhcccccccCcchHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecceeeEEeecccCCc Q lcl|Aclame:pro 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDG 110 (336) Q Consensus 31 ~a~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~~ygd~~diP 110 (336) ||-+ +....-||..+.+ +|++.+.+......+.++...+. .+..+++....+.+..++.+..+| T Consensus 1 m~t~---------t~gg~liP~~~~~----~ii~~l~~~s~i~~l~~~~~~~~---~~~~ip~~~~~~~a~wv~E~~~~~ 64 (303) T protein:vir:97 1 MGTE---------TSKASLFDKHLVS----DLINKVKGHSSLAKLSSQKPIPF---NGSKEFTFTLDSDIDVVAENGKKT 64 (303) T ss_pred Cccc---------CCCCeEcchhHHH----HHHHHHHhhchhhhhcceeecCC---CceEEEEEecCcceEEeecCcccc Confidence 2221 1222234544432 34444444455555555543322 345778888888999999999999 Q ss_pred eeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCcccc Q lcl|Aclame:pro 111 DSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPIT 190 (336) Q Consensus 111 ~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~ 190 (336) ..+.......-+.+.++..+.+|.+=++.......++.+.-.....+++.+.++.-.++|+....-.+...-+... ... T Consensus 65 ~s~~~f~~v~l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g~~~~~~~~~~-~~~ 143 (303) T protein:vir:97 65 HGGLSLEPVTIVPIKVEYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPRTKKASDVIGTNH-FDS 143 (303) T ss_pred ccccceeeEEeeeEEEEEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccCCccccccccccc-ccc Confidence 9998888888889999988888866444444567788889999999999999999999996432222211111000 000 Q ss_pred cccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhccc-CCCCCccHH-HHHHHh-----CCccE Q lcl|Aclame:pro 191 ATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAA-AKLKDI-----FPKLE 263 (336) Q Consensus 191 ~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl-~~l~~n-----~pnl~ 263 (336) ..+.--..++.+..++||.+++..+... + ..+..++|.|+.+..|.+ .+..|.-++ .=+... .-++. T Consensus 144 ~~~~~~~~~~~~~~~~~i~~~~~~~~~~-~-----~~~~~~vmn~~~~~~L~~lkd~~g~~~~~~~~~~~~~~~~l~G~P 217 (303) T protein:vir:97 144 KVTQVVKFTESEDADANIEAAVNLIQGA-E-----GVVTGLAMDTEFSTALAKVTNGEMGPKMYPELAWGANPDSINGLK 217 (303) T ss_pred ccccccccccccchHHHHHHHHHHHhhc-C-----CCccEEEEcHHHHHHHHHhhccCCCeEEecCccCCCCCceeccee Confidence 0010001112344678999998877542 2 246679999998887753 333222111 000000 01122 Q ss_pred EE---EcccccCCCCceEEEEEEeecC------CceEEEEcChhhhcccce---ecCCceEEccccceeeeeeeccccee Q lcl|Aclame:pro 264 FV---TIPEYDTASGRLVQLWAPRVEG------KDTATCGFTEKMRAHSIE---RYSSYFRQKKSAGTWGAVIFRPFAVA 331 (336) Q Consensus 264 i~---~~pel~~a~G~~~~~~~~~~~~------~~~~~~~~p~~~r~l~~~---~~~~~~~vp~~~~t~Gv~ir~P~av~ 331 (336) ++ .+|.-.+.+.....+++-.... .+-.++.+.......... ...-..-+-++.|++ ..+++|.||+ T Consensus 218 v~~s~~v~~~~~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~n~~~~r~~~r~~-~~v~~p~af~ 296 (303) T protein:vir:97 218 SSVNTTVGAGADEAESKDLVIIGDFESMFKWGYAKQIPMEIIKYGDPDNSGKDLKGYNQIYLRAEAYIG-WGILDAKSFA 296 (303) T ss_pred eEEecccCCccccCCCccEEEEeeccccEEEEEecCcEEEEeeccCCCCcchhhhhcCcEEEEEEEEec-cEeecccceE Confidence 22 2332222221111122111100 000111111100000000 001123344555554 5567799999 Q ss_pred eeccC Q lcl|Aclame:pro 332 QMIGV 336 (336) Q Consensus 332 ~~~GI 336 (336) ++... T Consensus 297 ~l~~~ 301 (303) T protein:vir:97 297 RVTKG 301 (303) T ss_pred EeeCC Confidence 99998 No 38 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=97.81 E-value=2e-06 Score=51.85 Aligned_cols=293 Identities=11% Similarity=0.027 Sum_probs=153.9 Q ss_pred CchHHHHHH-hhhcceeccchhhhccchhHHHHHhhhhhcccccccCcchHHHHHHHhhCceeeeeeccccchhhhcccc Q lcl|Aclame:pro 1 MRDAQRIQN-LARAGVILPRSVQNVSTPLTEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGES 79 (336) Q Consensus 1 ~~~~~~~~~-l~~~g~~~~~~~~~~~~~~~~~a~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~ 79 (336) |+..+..+. ++++-. .... .... +| +.. ...+.....+|..+.+ ++++.+........++.+. T Consensus 1 ~~~~~~~~~~~~~f~~----~~~~-~~~~-----~a-~~~-~~~~~~~~~iP~~~~~----~ii~~~~~~s~l~~~~~~~ 64 (324) T protein:vir:97 1 MEQTQKLKLNLQHFAS----NNVK-PQVF-----NP-DNV-MMHEKKDGTLMNEFTT----PILQEVMENSKIMQLGKYE 64 (324) T ss_pred CccchhHHHHHHHHHH----hhhh-hhhh-----cc-ccc-cccCCCcceechhHHH----HHHHHHHhhcchhhhccee Confidence 888877653 222211 1000 0011 11 111 0112233345665543 3344444444455555444 Q ss_pred cCCCcceeeEEEeeeecceeeEEeecccCCceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHH Q lcl|Aclame:pro 80 KKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGL 159 (336) Q Consensus 80 t~g~w~~~t~~~~~~e~~G~a~~ygd~~diP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~ 159 (336) +.+. .+..+++....+.+...+.+..+|..+..........+.++..+.+|.+-++.+ ..++.+.-.....+++ T Consensus 65 ~~~~---~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds---~~~l~~~i~~~l~~ai 138 (324) T protein:vir:97 65 PMEG---TEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYT---YSQFFEEMKPMIAEAF 138 (324) T ss_pred eccC---CceEEEEEecCcceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHHhcc---hHHHHHHHHHHHHHHH Confidence 3332 346788888888999999999999999988888889999999999998555544 3678888888889999 Q ss_pred HHhhcceEEeeccccc-eEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHH Q lcl|Aclame:pro 160 AKFLNGSYLFGVAGLE-NYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAM 238 (336) Q Consensus 160 e~~~n~~~~~Gd~~~g-~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~ 238 (336) .+.+++-.+.|++... ..|+++........+ .++. -++||.+++..+... + ..+.+++|.+..+ T Consensus 139 a~~~d~a~l~G~g~~~~~~gi~~~~~~~~~~~-----~~~~----~~~~i~~~~~~l~~~--~----~~~~~~v~n~~~~ 203 (324) T protein:vir:97 139 YKKFDEAGILNQGNNPFGKSIAQSIEKTNKVI-----KGDF----TQDNIIDLEALLEDD--E----LEANAFISKTQNR 203 (324) T ss_pred HHHHHHHhhccCCCCccCccccccccccceec-----cccC----CHHHHHHHHHhhhhc--c----CCCCEEEEcHHHH Confidence 9999999999987442 345555322211111 1112 256777777766432 1 2466899999999 Q ss_pred Hhccc-CCCCCccHHHHHHHhCC---ccEEEEcccccCCC-----CceEEEEEEeecCCceEEEEcChhhh-ccccee-- Q lcl|Aclame:pro 239 SDLSK-TNQYGLAAAAKLKDIFP---KLEFVTIPEYDTAS-----GRLVQLWAPRVEGKDTATCGFTEKMR-AHSIER-- 306 (336) Q Consensus 239 ~~L~~-~~~~~~Tvl~~l~~n~p---nl~i~~~pel~~a~-----G~~~~~~~~~~~~~~~~~~~~p~~~r-~l~~~~-- 306 (336) ..|.+ .+..|..++. -.... ++.++..+-..... |...++++-.+.+ .++.+-..-. ...... T Consensus 204 ~~L~~lkd~~g~~~~~--~~~~~tl~G~PV~~~~~~~~~~~~~~~gd~~~~~i~~~~~---~~i~~~~~~~~~~~~~~~~ 278 (324) T protein:vir:97 204 SLLRKIVDPETKERIY--DRNSDTLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQL---IEYKIDETAQLSTVKNEDG 278 (324) T ss_pred HHHHHhhcCCCceeec--CCCCccccceeeEeecCCCCCcceEEEEecccEEEEEecC---cEEEEeecccccccccccc Confidence 88864 2333332221 01111 12222222111111 1111222211111 1111111000 000000 Q ss_pred ------cCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 307 ------YSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 307 ------~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) ..-....-+..|.+ +.+++|.||+.+.+. T Consensus 279 ~~~~~f~~d~~~~r~~~r~d-~~v~~~~a~~~l~~~ 313 (324) T protein:vir:97 279 TPVNLFEQDMVALRATMHVA-LHIADDKAFAKLVPA 313 (324) T ss_pred cchhhhhcCcEEEEEEEEec-cEEecccceEEEEec Confidence 00123334455554 455579999999999 No 39 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=97.79 E-value=2.2e-06 Score=51.56 Aligned_cols=295 Identities=9% Similarity=0.035 Sum_probs=159.7 Q ss_pred hhccchhHHHHHhhhhhcccccccCcchHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeec----- Q lcl|Aclame:pro 22 QNVSTPLTEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEP----- 96 (336) Q Consensus 22 ~~~~~~~~~~a~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~----- 96 (336) +-.-.+++.+.+-. +.++.+++.+...+|..+.+ +|++.+........+.++...+. ....+++.+. T Consensus 1 ~~~~~e~~~~~~~~-~~~~~~~~~~~~liP~~~~~----~ii~~~~~~s~l~~l~~~~~~~~---~~~~ip~~~~~~~a~ 72 (338) T protein:vir:78 1 MATLNELAPNTAGS-NHQGRLAHVPSDLLPKEIVG----PIFDKAQESSLVLRLGENIPISY---GETIIPTTVKRPEVG 72 (338) T ss_pred CcchHHhhhhhccc-ccccceecccccccchHHHH----HHHHHHHhhchhhhhcceeeccC---CceEEEEEecCccce Confidence 11123344444433 34444555555667876664 44666666666666666544332 2344444432 Q ss_pred ---ceeeEEeecccCCceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccc Q lcl|Aclame:pro 97 ---TTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAG 173 (336) Q Consensus 97 ---~G~a~~ygd~~diP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~ 173 (336) .+.+...+.+..+|..+...+....+.+.++....+|.+=++. ...++.+.-....++++.+.+++-.+.|+.. T Consensus 73 ~v~~~~~~~~~Eg~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~d---s~~~~~~~i~~~la~a~~~~~d~~~l~G~g~ 149 (338) T protein:vir:78 73 QVGVGTSNEQREGGTKPLSGTAWDTRSVAPIKLATIVTVSEEFARM---NPSGLYTKLQADLAYAIGRGIDLAVFHGKSP 149 (338) T ss_pred eecccccccccccccccccccceeEEEEEEEEEEEeehhhHHHHhc---CHHHHHHHHHHHHHHHHHHHHHHHhhcccCC Confidence 2445566777888999888888888889998888888743333 3367888888889999999999999999974 Q ss_pred ---cceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhccc----CCC Q lcl|Aclame:pro 174 ---LENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK----TNQ 246 (336) Q Consensus 174 ---~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~----~~~ 246 (336) .+..|++++..+....+.... ++.....++++.+++..+.....+ .+..++|.+..+..|.+ .+. T Consensus 150 ~~~~~~~gi~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~m~~~~~~~L~~~~~l~d~ 221 (338) T protein:vir:78 150 LTGSALQGIDTNNVIVNTTNVDYL---QTGTTPLLDRFLDGYDLVSANTDV-----DFNGWAADPRYRARLLRSQAYRDA 221 (338) T ss_pred Cccccccccccccccccccccccc---cccchhhHHHHHHHHHHhhhhccc-----cceEEEEchHHHHHHHHHhhhccC Confidence 456677776554332222221 223456788888888877554321 35679999988776632 233 Q ss_pred CCccHHHHHH-Hh----CCccEEEE---cccccCC-CCceEEEEEEeecC-----CceEEEEc-Chhhhccccee----c Q lcl|Aclame:pro 247 YGLAAAAKLK-DI----FPKLEFVT---IPEYDTA-SGRLVQLWAPRVEG-----KDTATCGF-TEKMRAHSIER----Y 307 (336) Q Consensus 247 ~~~Tvl~~l~-~n----~pnl~i~~---~pel~~a-~G~~~~~~~~~~~~-----~~~~~~~~-p~~~r~l~~~~----~ 307 (336) .|.-++.-.. .. +-++-++. +|.-.++ .+.+..+++-.... ..-..+.+ ++.-......+ . T Consensus 222 ~g~~l~~~~~~~~~~~~l~G~PV~~~~~ip~~~~~~~~~~~~~~~gdfs~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~ 301 (338) T protein:vir:78 222 NGNVDPTRINLAASAGDLLGLPVQFGKAVGGDLGAATDSKVRVVGGDFSQLKYGFADEIRVKMSDTATLTDNTSPTPQTV 301 (338) T ss_pred CCceeecccccCCCCceeeeeeEEEccccCccccccCCcccEEEEEecceEEEEeecccEEEEeecccccccccccccch Confidence 3332221111 10 11222222 3332222 23333333322100 00011111 11000000000 0 Q ss_pred ----CCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 308 ----SSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 308 ----~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) .-....-|+.|. |+.+.+|.||+++... T Consensus 302 ~~~~~~~~~~r~~~r~-d~~v~~~~a~~~l~~~ 333 (338) T protein:vir:78 302 SMWQTNQIAILIEVTF-GWLLGDKQAFVKFVDD 333 (338) T ss_pred hhhhcCcEEEEEEEEe-ccEeecccceEEEecc Confidence 011334455555 5567889999999999 No 40 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=97.77 E-value=6e-06 Score=49.22 Aligned_cols=316 Identities=10% Similarity=0.082 Sum_probs=144.0 Q ss_pred CchHHHHHHhh----h--------------------cceec-------cchhhhccchhHHHHHhhh---hhccccccc- Q lcl|Aclame:pro 1 MRDAQRIQNLA----R--------------------AGVIL-------PRSVQNVSTPLTEYAMDAA---DLSPHLSST- 45 (336) Q Consensus 1 ~~~~~~~~~l~----~--------------------~g~~~-------~~~~~~~~~~~~~~a~da~---d~~~~l~t~- 45 (336) +...+....+. + .|..+ ......+. ....++.+.. .....+.+. T Consensus 53 i~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 131 (428) T protein:vir:10 53 MDRMEATERAAALVAKPVKATQHGPAVIVKAEPKQYTGAGMTRMVMSIAAAQGNLQ-DAAKFASDELNDQSVSMAISTAA 131 (428) T ss_pred HHHHHHHHHHHHHHhhhhhchhhccccccccccchhhhHHHHHHHHHHHHhhhhHH-HHHHHhhhhhhhhhHhhhhcccc Confidence 11101000000 0 00000 00000000 0000110000 000011222 Q ss_pred Ccc--hHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecceeeEEeecccCCceeeeeeeeeeeeE Q lcl|Aclame:pro 46 GSS--GIPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYPQRQS 123 (336) Q Consensus 46 ~~~--~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~~ygd~~diP~~~~~~~~~~~~v 123 (336) .++ .||..+.+ +|++.+........+ +... .......+.+++....+.+...+.+...|..+.......-.. T Consensus 132 ~~gg~liP~~~~~----~ii~~l~~~~~l~~~-~~~~-~~~~~g~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~ 205 (428) T protein:vir:10 132 GSGGVLIPQNIHS----EVIELLRDRTIVRKL-GARS-IPLPNGNMSLPRLAGGATASYTGENQDAKVSEARFDDVKLTA 205 (428) T ss_pred cCCccccchhHHH----HHHHHHhhhchhhhh-ccee-eecCCcceEEEEEeCCcceeeeccCccccccccceeeEEeee Confidence 222 25765543 344443333333333 1111 111122357777777778888898899999998888888888 Q ss_pred EEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccc-cceEEEEecCCCCcccccccccccccCHH Q lcl|Aclame:pro 124 YFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAG-LENYGLINDPSLSAPITATTPWSGSPAVE 202 (336) Q Consensus 124 ~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~-~g~~GllN~Pnl~~~~~~~t~w~~~~t~~ 202 (336) +.++..+.+|.+=+..+ ..++.+--......++.+.+++.+++|++. ....|++|.......+.. +......+.+ T Consensus 206 ~k~~~~v~is~ell~ds---~~~l~~~i~~~l~~ai~~~~d~~~l~G~G~~~~p~Gi~~~~~~~~~~~~-~~~~~~~~~~ 281 (428) T protein:vir:10 206 KTMIAMVPISNALIGRA---GFNVEQLVLQDILTAISVREDKAFMRDDGTGDTPIGMKARATQWNRLLP-WAADAAVNLD 281 (428) T ss_pred EEEEEeehhhHHHHhhh---hHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccccccc-ccccccccHH Confidence 89998888887755543 346788888888888889999989999874 466799996543221111 1111233333 Q ss_pred HHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhccc-CCCCCccHHHHHHH-hCCccEEEE---cccccCCCCce Q lcl|Aclame:pro 203 AVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAKLKD-IFPKLEFVT---IPEYDTASGRL 277 (336) Q Consensus 203 eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~~l~~-n~pnl~i~~---~pel~~a~G~~ 277 (336) .+-..++.+...... .. ........+|.+..+..|.. .+..|.-++.=... .+-++.++. +|.-.+.+++. T Consensus 282 ~~~~~~~~~~~~~~~-~~---~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~g~l~G~pv~~~~~~p~~~~~~~~~ 357 (428) T protein:vir:10 282 TIDTYLDSIILMSMD-GN---SNMISSGWGMSNRTYMKLFGLRDGNGNKVYPEMAQGMLKGYPIQRTSAIPANLGEGGKE 357 (428) T ss_pred HHHHHHHHHHHhhhc-cc---cccccCEEEEcHHHHHHHHHhhccCCceeccCCCCCeeeceeeEEeccccccccCCCcc Confidence 333333322221111 11 01134568899998888854 33334433321110 111222322 23222223333 Q ss_pred EEEEEEeecCCceEEEEcChhhhcc--c-------------ceecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 278 VQLWAPRVEGKDTATCGFTEKMRAH--S-------------IERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 278 ~~~~~~~~~~~~~~~~~~p~~~r~l--~-------------~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) ..+++-.. ++ ..+..-..++.. + -..++ ...+-+..| -|+.+++|.||+.++|| T Consensus 358 ~~i~~gd~-s~--~~i~~~~~i~i~~~~~~~~~~~~~~~~~~f~~~-~~~~R~~~r-~d~~v~~p~a~~~~t~~ 426 (428) T protein:vir:10 358 SEIYFADF-ND--VVIGEDGNMKVDFSKEASYIDTDGKLVSAFSRN-QSLIRVVTE-HDIGFRHPEGLVLGTGV 426 (428) T ss_pred ceEEEEec-ce--EEEEEecceEEEeecccccccccccccchhhcc-hhheeeeee-eCceeeccceEEEEecc Confidence 22222111 11 111111111110 0 00011 122234444 46778999999999999 No 41 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=97.76 E-value=3.1e-06 Score=50.77 Aligned_cols=286 Identities=11% Similarity=0.017 Sum_probs=148.2 Q ss_pred hhhcceeccchhhhccchhHHHHHhhhhhcccccccCcchHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeE Q lcl|Aclame:pro 10 LARAGVILPRSVQNVSTPLTEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVA 89 (336) Q Consensus 10 l~~~g~~~~~~~~~~~~~~~~~a~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~ 89 (336) |+ .|-.|.. +.+.++. .. ++...+.+|..+.+ ++++.+.+......+..+...+. .+. T Consensus 1 ~~-~~~~~~~-------e~~~~~~-~~------~~~~~~~ip~~~~~----~ii~~~~~~~~l~~~~~~~~~~~---~~~ 58 (318) T protein:vir:24 1 MA-AGTAFAV-------DHAQIAQ-TG------DTMFKGYLEPEQAK----DYFAEAEKTSIVQQFAQKVPMGT---TGQ 58 (318) T ss_pred CC-CCCCCCH-------HHHHhhc-cc------CcccceeechhHHH----HHHHHHHhhchhhhhcceeeccC---Cce Confidence 22 1222211 1111111 10 12222335655553 33444444444555554433221 346 Q ss_pred EEeeeecceeeEEeecccCCceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEe Q lcl|Aclame:pro 90 AFITAEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLF 169 (336) Q Consensus 90 ~~~~~e~~G~a~~ygd~~diP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~ 169 (336) .+++....+.+...+....+|..+...+...-..+.++....+|.+-++.+ ..++.+.-.....+++.+.+++-++. T Consensus 59 ~ip~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~e~l~ds---~~~~~~~i~~~l~~~~~~~~d~a~l~ 135 (318) T protein:vir:24 59 KIPHWVGDVSAQWIGEGDMKPITKGNMTSQTIAPHKIATIFVASAETVRAN---PANYLGTMRTKVATAFAMAFDGAAMH 135 (318) T ss_pred EEEEEeCCcceEEecCCccccccccceeEEEEeeEEEEEeehhhHHHhhcC---hHHHHHHHHHHHHHHHHHHHHHhhhc Confidence 777778888899999999999999888888888899999888887655533 35788888899999999999999999 Q ss_pred eccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhccc-CCCCC Q lcl|Aclame:pro 170 GVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYG 248 (336) Q Consensus 170 Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~ 248 (336) |+....-.|+++..... ..+..+ ...+.. .+++.+++..+...- ..+..++|.++.+..|.+ .+..| T Consensus 136 G~g~~~~~~~~~~~~~~-~~~~~~--~~~~~~---~~~~~~~~~~~~~~~------~~~~~~v~n~~~~~~L~~lkd~~G 203 (318) T protein:vir:24 136 GTDSPFPTYIGQTTKAI-SIADTT--GATTVY---DQVAVNGLSLLVNDG------KKWTHTLLDDITEPILNGAKDQNG 203 (318) T ss_pred ccCCCCCcccccccccc-cccccc--cccchH---HHHHHHHHHhhcccc------CCCCEEEEcHHHHHHHHHhhccCC Confidence 99765555666532211 011111 011111 223344444332211 245689999999998864 33334 Q ss_pred ccHHHHHHHh-----CCccEEEEcccccC--C-CCceEEEEEE-------eecCCceEEEEcChhhhccc-c----e--- Q lcl|Aclame:pro 249 LAAAAKLKDI-----FPKLEFVTIPEYDT--A-SGRLVQLWAP-------RVEGKDTATCGFTEKMRAHS-I----E--- 305 (336) Q Consensus 249 ~Tvl~~l~~n-----~pnl~i~~~pel~~--a-~G~~~~~~~~-------~~~~~~~~~~~~p~~~r~l~-~----~--- 305 (336) ..++.-...+ +...++...|-.-. . .|....++.+ .+.+ ..+.+.......- . . T Consensus 204 ~~l~~~~~~~~~~~~~~~~~i~g~pv~~~~~~~~~~~~~~~gdfs~~~~~~~~~---l~i~~~~~~~~~~~~~~~~~~~~ 280 (318) T protein:vir:24 204 RPLFIESTYGEAASPFRSGRIVARPTILSDHVVEGTTVGFMGDFSQLIWGQIGG---LSFDVTDQATLNLGTVESPNFVS 280 (318) T ss_pred ceeecCccccCccccccCceEEEEeeEEeCCCCCCccEEEEeecceEEEEEecC---eEEEEeeccceeccccccccchh Confidence 4332111111 11123333333321 1 1222222211 1111 1121211111000 0 0 Q ss_pred -ecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 306 -RYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 306 -~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) ...-....-+..|. |+.+++|.||+.+.++ T Consensus 281 ~f~~~~~~~r~~~r~-d~~v~~~~a~~~i~~~ 311 (318) T protein:vir:24 281 LWQHNLVAVRVEAEY-AFHCNDAEAFVALTNV 311 (318) T ss_pred hhhcCcEEEEEEEEE-ccEEecccceEEEEee Confidence 11123445566666 5556889999999999 No 42 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=97.75 E-value=2.5e-06 Score=51.27 Aligned_cols=292 Identities=9% Similarity=0.038 Sum_probs=148.7 Q ss_pred HHHhhhcceeccchhhhccchhHHHHHhhhhhcccccccCcchHHHHHHHhhCceeeeeeccccchhhhcccccCCCcce Q lcl|Aclame:pro 7 IQNLARAGVILPRSVQNVSTPLTEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTT 86 (336) Q Consensus 7 ~~~l~~~g~~~~~~~~~~~~~~~~~a~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~ 86 (336) ++. ..+++.+.+= .+.++.+.......+|..+.+ +|++.+.+.-...++..+.+.+. T Consensus 1 ~a~---------------l~el~~~~~~-~~~~g~~~~~~~~liP~~~~~----~ii~~l~~~s~l~~~~~~~~~~~--- 57 (333) T protein:vir:78 1 MAT---------------LNELLPNSAG-SNHQGRLAHVPSDLLPKEIVG----PIFDKAQESSLVLRMGEQIPISY--- 57 (333) T ss_pred Cch---------------hHHhhhhccc-ccccCceecCCccccchhHHH----HHHHHHHhhchhhhhcceeeccC--- Confidence 010 1122222221 123333333334456655543 44454444444455554433221 Q ss_pred eeEEEeeeecceee--------EEeecccCCceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHH Q lcl|Aclame:pro 87 LVAAFITAEPTTKV--------ATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALG 158 (336) Q Consensus 87 ~t~~~~~~e~~G~a--------~~ygd~~diP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a 158 (336) ....+++......+ ...++...+|..+..........+.++....+|.+=++. ...++.+.-+....++ T Consensus 58 ~~~~~p~~~~~~~a~~v~eg~~~~~~e~~~~~~~~~~f~~i~l~~~kl~~~~~is~ell~~---s~~~~~~~i~~~la~a 134 (333) T protein:vir:78 58 GETIIPTTVKRPEVGQVGVGTSNEQREGGLKPLSGTAWDTRSVSPIKLATIVTVSEEFARM---NPSGLYTKLQGDLAYA 134 (333) T ss_pred CceEEEEEeCCceeEeecCcccccccccccccccccceeEEEEeeEEEEEeehhhHHHHhc---CHHHHHHHHHHHHHHH Confidence 22345555444333 344555677888888888888889999988888744443 3356888888888999 Q ss_pred HHHhhcceEEeeccc---cceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecH Q lcl|Aclame:pro 159 LAKFLNGSYLFGVAG---LENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPP 235 (336) Q Consensus 159 ~e~~~n~~~~~Gd~~---~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~ 235 (336) +.+.++.-.+.|+.. .+..|+++...+..... ... ...+.+..++||.+++..+..... ..+..++|.| T Consensus 135 i~~~~d~~~l~G~g~~~~~~~~g~~~~~~~~~~~~-~~~--~~~~~~~~~~~i~~~~~~~~~~~~-----~~~~~~vmn~ 206 (333) T protein:vir:78 135 IGRGIDLAVFHGKSPLTGSALQGIDTDNVIANTTN-VDY--LQETGDPLLDRLLDGYDLVSANTD-----VEFNGWAVDP 206 (333) T ss_pred HHHHHHHHHhcccCCCCCccccccccccccccccc-ccc--cccccchhHHHHHHHHHhhccccc-----cCceEEEEcc Confidence 999999999999874 55677777655432211 111 112234467888888877644321 2466799998 Q ss_pred HHHHhccc----CCCCCccHHHHHHHh-----CCccEEEEccccc---C-CCCceEEEEEEeecCCceEEEEcChh--hh Q lcl|Aclame:pro 236 TAMSDLSK----TNQYGLAAAAKLKDI-----FPKLEFVTIPEYD---T-ASGRLVQLWAPRVEGKDTATCGFTEK--MR 300 (336) Q Consensus 236 ~~~~~L~~----~~~~~~Tvl~~l~~n-----~pnl~i~~~pel~---~-a~G~~~~~~~~~~~~~~~~~~~~p~~--~r 300 (336) ..+..|.+ .+..|.-++...... .-++.++....+. + +.+++..+++-+.. + ..+.+... +. T Consensus 207 ~~~~~L~~~~~~~d~~G~~i~~~~~~~~~~~~l~G~Pv~~~~~i~~~~~~~~~~~~~~~~gD~~--~-~~~g~~~~~~i~ 283 (333) T protein:vir:78 207 RFRAHLLRAQAYRDANGNVDPSRINLAAQTGDVLGLPAQFGRAVGGDLGAAVDSKTRIIGGDFS--Q-LKFGFADEIRIK 283 (333) T ss_pred hHHHHHHHHhhhcCCCCceeecCccccCCCceeeceeeEEccccCCCccccCCCccEEEEEecc--c-EEEEEeeccEEE Confidence 87766632 233344333222111 1122333322222 1 22233333332221 1 11111111 11 Q ss_pred cccc--e----------ecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 301 AHSI--E----------RYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 301 ~l~~--~----------~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) ..+. . ...-...+-++.|.+ +.|+.|.||+++.+- T Consensus 284 ~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d-~~v~~~~a~~~l~~~ 330 (333) T protein:vir:78 284 MSDTATLTDSGSATVSMWQTNQIAILIEVTFG-WLLGDKQAFVKFVDD 330 (333) T ss_pred EeccccccccccceeehhhcCcEEEEEEEEEc-cEEecccceEEEecc Confidence 1110 0 001112234455554 556999999999998 No 43 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=97.73 E-value=3.9e-06 Score=50.25 Aligned_cols=292 Identities=10% Similarity=0.017 Sum_probs=151.1 Q ss_pred CchHHHHH-HhhhcceeccchhhhccchhHHHHHhhhhhcccccc-cCcchHHHHHHHhhCceeeeeeccccchhhhccc Q lcl|Aclame:pro 1 MRDAQRIQ-NLARAGVILPRSVQNVSTPLTEYAMDAADLSPHLSS-TGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGE 78 (336) Q Consensus 1 ~~~~~~~~-~l~~~g~~~~~~~~~~~~~~~~~a~da~d~~~~l~t-~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v 78 (336) |+..+..+ +++++--.+.... .+. +...+++ ...+.+|..+.+ +|++.+...-...+++++ T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~-----~~~--------a~~~~~~~~~~~liP~~~~~----~ii~~~~~~s~l~~~~~~ 63 (324) T protein:vir:10 1 MEQTQKLKLNLQHFASNNVKPQ-----VFN--------PDNVMMHEKKDGTLLNDFTT----PILQEVMENSKIMQLGKY 63 (324) T ss_pred CCCchHHHHHHHHHHHHhhccc-----eec--------ccceeccCCCcceechhHHH----HHHHHHHhhchhhhhcce Confidence 77666655 3333211110000 000 0001111 112235543332 334444444444555554 Q ss_pred ccCCCcceeeEEEeeeecceeeEEeecccCCceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHH Q lcl|Aclame:pro 79 SKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALG 158 (336) Q Consensus 79 ~t~g~w~~~t~~~~~~e~~G~a~~ygd~~diP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a 158 (336) .+.+. .++.|++.+..+.+..++.+..+|..+.......-..+.++..+.+|.+-++.+ ..++.+.-.....++ T Consensus 64 ~~~~~---~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds---~~~l~~~i~~~l~~a 137 (324) T protein:vir:10 64 EPMEG---TEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYT---YSQFFEEMKPMIAEA 137 (324) T ss_pred eeccC---CceEEEEEeCCcceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHHhcc---hHHHHHHHHHHHHHH Confidence 43332 346788888888999999999999999888888888999998889888666544 357888888888888 Q ss_pred HHHhhcceEEeeccccc-eEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHH Q lcl|Aclame:pro 159 LAKFLNGSYLFGVAGLE-NYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTA 237 (336) Q Consensus 159 ~e~~~n~~~~~Gd~~~g-~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~ 237 (336) +.+.+++-.++|+.... ..|+++..... .+...++. -++||.+++..+... + ..+..++|.++. T Consensus 138 i~~~~d~a~l~G~g~~~~~~~i~~~~~~~-----~~~~~~~~----t~~~i~~~~~~l~~~--~----~~~~~~v~n~~~ 202 (324) T protein:vir:10 138 FYKKFDEAGILNQGNNPFGKSIAQSIEKT-----NKVIKGDF----TQDNIIDLEALLEDD--E----LEANAFISKTQN 202 (324) T ss_pred HHHHHHHHhhhcCCCCccCcccccccccc-----ceeccccC----CHHHHHHHHHhhhhc--c----CCCCEEEEcHHH Confidence 88888888888876542 23444421111 11111112 367777787777432 1 246789999999 Q ss_pred HHhcccC-CCCCccHHHHHHHhCC---ccEEEEcccccCCCC-----ceEEEEEEeecCCceEEEEcChhhhcccc---- Q lcl|Aclame:pro 238 MSDLSKT-NQYGLAAAAKLKDIFP---KLEFVTIPEYDTASG-----RLVQLWAPRVEGKDTATCGFTEKMRAHSI---- 304 (336) Q Consensus 238 ~~~L~~~-~~~~~Tvl~~l~~n~p---nl~i~~~pel~~a~G-----~~~~~~~~~~~~~~~~~~~~p~~~r~l~~---- 304 (336) +..|.+- +..|..++ .-.+.. ++.++..+-.....| ....+++-.+.+ ..+.+-..-..... T Consensus 203 ~~~L~~l~d~~g~~~~--~~~~~~~l~G~PV~~~~~~~~~~~~~~~gd~~~~~~~~~~~---~~i~~~~~~~~~~~~~~~ 277 (324) T protein:vir:10 203 RSLLRKIVDPETKERI--YDRNSDTLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQL---IEYKIDETAQLSTVKNED 277 (324) T ss_pred HHHHHHhhccCCceee--cCCCCccccceeEEeecCCCCCcceEEEEecccEEEEEecC---cEEEEeeccccccccccc Confidence 9988642 33332221 111111 122222221111111 111111111111 11111111000000 Q ss_pred -e----ecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 305 -E----RYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 305 -~----~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) . ...-....-++.|+++.++ +|.||+++.|. T Consensus 278 ~~~~~~~~~~~~~~r~~~r~d~~v~-~~~A~~~l~~a 313 (324) T protein:vir:10 278 GTPVNLFEQDMVALRATMHVALHIA-DDKAFAKLVPA 313 (324) T ss_pred ccchhhhhcCcEEEEEEEEEccEEe-cccceEEEEec Confidence 0 1112355566677755554 69999999999 No 44 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=97.70 E-value=1e-05 Score=47.98 Aligned_cols=293 Identities=10% Similarity=0.004 Sum_probs=149.9 Q ss_pred CchHHHHHHhhhcceeccchhhhccchhHHHHHhhhhhcccccc-cCcchHHHHHHHhhCceeeeeeccccchhhhcccc Q lcl|Aclame:pro 1 MRDAQRIQNLARAGVILPRSVQNVSTPLTEYAMDAADLSPHLSS-TGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGES 79 (336) Q Consensus 1 ~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~a~da~d~~~~l~t-~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~ 79 (336) |+..+..+.-.| .|-.... .++...+...+++ ...+.||..+.+ ++++.+........+.++. T Consensus 1 ~~~~~~~~~~~~---~f~~~~~---------~~~~~~a~~~~~~~~~~~liP~~~~~----~ii~~~~~~s~l~~l~~~~ 64 (324) T protein:vir:93 1 MEQTQKLKLNLQ---HFASNNV---------KPQVFNPDNVMMHEKKDGTLLNDFTT----PILQEVMENSKIMQLGKYE 64 (324) T ss_pred CchhHHHHHHHH---HHHHhhh---------hhhhcccccccccCCCcceechhHHH----HHHHHHHhhchhhhhccee Confidence 766665553222 1111100 0001011111111 222345655443 3344444444455555444 Q ss_pred cCCCcceeeEEEeeeecceeeEEeecccCCceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHH Q lcl|Aclame:pro 80 KKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGL 159 (336) Q Consensus 80 t~g~w~~~t~~~~~~e~~G~a~~ygd~~diP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~ 159 (336) +.+. .++.|++.+..+.+...+.+.++|..+..........+.++..+.+|.+=++.+ ..++.+.-......++ T Consensus 65 ~~~~---~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds---~~~l~~~i~~~l~~ai 138 (324) T protein:vir:93 65 PMEG---TEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYT---YSQFFEEMKPMIAEAF 138 (324) T ss_pred eccC---CceEEEEEecCcceeeecCCccccccccceeEEEEEeEEEEEeehhhHHHHhcc---hHHHHHHHHHHHHHHH Confidence 3322 346788888888899999999999999888888888888888888887555543 2578888888888888 Q ss_pred HHhhcceEEeecccc-ceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHH Q lcl|Aclame:pro 160 AKFLNGSYLFGVAGL-ENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAM 238 (336) Q Consensus 160 e~~~n~~~~~Gd~~~-g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~ 238 (336) .+.+++-++.|+... ...|+++........+. ++. -++||.+++..|... + ..+.+++|.++.+ T Consensus 139 a~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~~-----~~~----~~~~i~~~~~~l~~~-~-----~~~~~~v~n~~~~ 203 (324) T protein:vir:93 139 YKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIK-----GDF----TQDNIIDLEALLEDD-E-----LEANAFISKTQNR 203 (324) T ss_pred HHHHHHHHhcCCCCCCcCccccccccccceecc-----ccc----cHHHHHHHHHhhhhc-c-----CCCCEEEEcHHHH Confidence 888888888887643 22445542221111110 112 367788888777443 1 2466899999999 Q ss_pred Hhccc-CCCCCccHHHHHHHhCC---ccEEEEcccccCCC-----CceEEEEEEeecCCceEEEEcChhhhcc----cc- Q lcl|Aclame:pro 239 SDLSK-TNQYGLAAAAKLKDIFP---KLEFVTIPEYDTAS-----GRLVQLWAPRVEGKDTATCGFTEKMRAH----SI- 304 (336) Q Consensus 239 ~~L~~-~~~~~~Tvl~~l~~n~p---nl~i~~~pel~~a~-----G~~~~~~~~~~~~~~~~~~~~p~~~r~l----~~- 304 (336) ..|.+ .+..|.-++. ...-+ ++-++..+-..... |.-.++++-.+.+ .++.+-...... +- T Consensus 204 ~~L~~l~d~~G~~~~~--~~~~~~l~G~PVv~~~~~~~~~~~i~~gdfs~~~~~~~~~---~~i~~~~~~~~~~~~~~~~ 278 (324) T protein:vir:93 204 SLLRKIVDPETKERIY--DRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQL---IEYKIDETAQLSTVKNEDG 278 (324) T ss_pred HHHHHhhCCCCCeeec--CCCCCcccceeeEeecCCCCCcceEEEEecceEEEEEecC---cEEEEeecccccccccccc Confidence 98864 3333332210 01111 12222211111111 1111121111111 111110000000 00 Q ss_pred e----ecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 305 E----RYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 305 ~----~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) . ...-...+-+..|+ |+.+.+|.||+++.+. T Consensus 279 ~~~~~f~~n~~~~r~~~r~-d~~v~~~~a~~~l~~a 313 (324) T protein:vir:93 279 TPVNLFEQDMVALRATMHV-ALHIADDKAFAKLVPA 313 (324) T ss_pred cchhhhhcCcEEEEEEEEe-ccEEecccceEEEecc Confidence 0 01112444555666 5557779999999998 No 45 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=97.69 E-value=6.2e-06 Score=49.14 Aligned_cols=294 Identities=11% Similarity=0.041 Sum_probs=150.8 Q ss_pred CchHHHHH-HhhhcceeccchhhhccchhHHHHHhhhhhcccc-cccCcchHHHHHHHhhCceeeeeeccccchhhhccc Q lcl|Aclame:pro 1 MRDAQRIQ-NLARAGVILPRSVQNVSTPLTEYAMDAADLSPHL-SSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGE 78 (336) Q Consensus 1 ~~~~~~~~-~l~~~g~~~~~~~~~~~~~~~~~a~da~d~~~~l-~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v 78 (336) |+.-+.++ +++++- ..... ....++ ...+ .+...+.+|..+.+ +|++.+-.......++++ T Consensus 1 ~~~~~~~~~~~~~f~----~~~~~------~~~~~a---~~~~~~~~~~~lip~~~~~----~ii~~~~~~s~l~~l~~~ 63 (324) T protein:vir:96 1 MEQTQKLKLNLQHFA----SNNVK------PQVFNP---DNVMMHEKKDGTLLNDFTT----PILQEVMENSKIMQLGKY 63 (324) T ss_pred CCcchhhhHHHHHHH----Hhhhh------hhhccc---ccccccCCCcceechhHHH----HHHHHHHhhchhhhhcce Confidence 77776665 333211 11000 000011 1111 11223345655543 334444444445555555 Q ss_pred ccCCCcceeeEEEeeeecceeeEEeecccCCceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHH Q lcl|Aclame:pro 79 SKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALG 158 (336) Q Consensus 79 ~t~g~w~~~t~~~~~~e~~G~a~~ygd~~diP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a 158 (336) .+.+. .++.|++.+..+.+..+|....+|..+..........+.++..+.+|.+=++.+ ..++.+.-......+ T Consensus 64 ~~~~~---~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds---~~~l~~~i~~~l~~a 137 (324) T protein:vir:96 64 EPMEG---TEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYT---YSQFFEEMKPMIAEA 137 (324) T ss_pred eeccC---CceEEEEEecCcceeeecCCccccccccceeEEEEEeEEEEEeehhhHHHHhcc---hHHHHHHHHHHHHHH Confidence 44332 346788888888899999999999999888888888899998888887555543 367888888888889 Q ss_pred HHHhhcceEEeeccccce-EEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHH Q lcl|Aclame:pro 159 LAKFLNGSYLFGVAGLEN-YGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTA 237 (336) Q Consensus 159 ~e~~~n~~~~~Gd~~~g~-~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~ 237 (336) +.+.+++.+++|+..... .|+++. +. .... .... ..-++||.+++..+... + ..+..++|.++. T Consensus 138 ia~~~d~~~l~G~g~~~~~~~~~~~--~~----~~~~-~~~~--~~~~~~i~~~~~~i~~~--~----~~~~~~i~n~~~ 202 (324) T protein:vir:96 138 FYKKFDEAGILNQGNNPFGKSIAQS--IK----KTNK-VIKG--DFTQDNIIDLEALLEDD--E----LEANAFISKTQN 202 (324) T ss_pred HHHHHHHHhhhcCCCCCcCcccccc--cc----ccce-eccc--ccchHHHHHHHHhhhhc--c----CCCCEEEEcHHH Confidence 999999999999864322 233331 11 1111 1111 11256677777766432 1 246789999999 Q ss_pred HHhccc-CCCCCccHHHHH-HHhCCccEEEEcccccCCC-----CceEEEEEEeecCCceEEEEcChhhhcccc-e---- Q lcl|Aclame:pro 238 MSDLSK-TNQYGLAAAAKL-KDIFPKLEFVTIPEYDTAS-----GRLVQLWAPRVEGKDTATCGFTEKMRAHSI-E---- 305 (336) Q Consensus 238 ~~~L~~-~~~~~~Tvl~~l-~~n~pnl~i~~~pel~~a~-----G~~~~~~~~~~~~~~~~~~~~p~~~r~l~~-~---- 305 (336) +..|.+ .+..|..++.-- ..++-++.++..+...... |....+++-.+.+ .++.+-..-..... . T Consensus 203 ~~~L~~lkd~~G~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~gd~s~~~~~~~~~---~~i~~~~~~~~~~~~~~~~~ 279 (324) T protein:vir:96 203 RSLLRKIVDPETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQL---IEYKIDETAQLSTVKNEDGT 279 (324) T ss_pred HHHHHHhhCCCCCeeecCCCCCcccceeeEeecCCCCCcceEEEEecceEEEEEecC---cEEEEeeccccccccccccc Confidence 988864 333343222100 0001122232222221111 1111122211111 11111110000000 0 Q ss_pred ----ecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 306 ----RYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 306 ----~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) ...-....-+..|+ |+.+++|.||+++.+- T Consensus 280 ~~~~~~~n~v~~r~~~r~-d~~v~~~~a~~~l~~a 313 (324) T protein:vir:96 280 PVNLFEQDMVALRATMHV-ALHIADDKAFAKLVPA 313 (324) T ss_pred chhhhhcCcEEEEEEEEe-ccEEecccceEEEecc Confidence 00112344555666 5557779999999988 No 46 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=97.67 E-value=1.2e-05 Score=47.55 Aligned_cols=302 Identities=12% Similarity=0.056 Sum_probs=147.9 Q ss_pred CchHHHHH----Hhh----hcceeccch------------------------hhhccchhHHHHHhhhhhcccccccCcc Q lcl|Aclame:pro 1 MRDAQRIQ----NLA----RAGVILPRS------------------------VQNVSTPLTEYAMDAADLSPHLSSTGSS 48 (336) Q Consensus 1 ~~~~~~~~----~l~----~~g~~~~~~------------------------~~~~~~~~~~~a~da~d~~~~l~t~~~~ 48 (336) +.+..++. +++ +.+-.-... ........ ............=.+.... T Consensus 67 ~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~g~ 145 (418) T protein:vir:10 67 LIKQGELQARLLEAEQKLARGGGSAELETPKTLGQLVTESEEMKGMDGSARKSVRVRVDR-KSIMNVPATVGSGVSGSNS 145 (418) T ss_pred HHHHHHHHHHHHHHHHHHhhcccccccchhhhhhHHhhhHHHHHHHHHHHhhhhhhhhHH-HHHHHhhhhccCCCCCCcc Confidence 11110000 000 000000000 00000000 0000000011100111222 Q ss_pred hHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeec-ceeeEEeecccCCceeeeeeeeeeeeEEEEE Q lcl|Aclame:pro 49 GIPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEP-TTKVATYGDYSSDGDSGANINYPQRQSYFFQ 127 (336) Q Consensus 49 ~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~-~G~a~~ygd~~diP~~~~~~~~~~~~v~~~~ 127 (336) -+|..+. ++|++.+.......+++++...+. .++.+..... .+.+...+.+..+|..+.......-..+.++ T Consensus 146 lvp~~~~----~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~ 218 (418) T protein:vir:10 146 LVVADRQ----AGIIAPPQRKMTIRDLLMPGQTSS---SSIEYTVETGFTNNAAAVAEGAQKPTSDLKFNLKNQPVRTIA 218 (418) T ss_pred ccchhHH----HHHHHHHhhhhhHHhhcceeeccC---CceeEEEEecCCCceeeeccCccccccccceeeEEEeeeeEE Confidence 3554433 455666677777777776654432 2345555444 3566677888889999988888888899999 Q ss_pred EEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeecccc-ceEEEEecCCCCcccccccccccccCHHHHHH Q lcl|Aclame:pro 128 TWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGL-ENYGLINDPSLSAPITATTPWSGSPAVEAVVN 206 (336) Q Consensus 128 ~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~-g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~ 206 (336) ..+.+|.+=+..+ .++.+.-......++.+.+++-.++|++.. ...|++|...........+ ...-++ T Consensus 219 ~~~~is~ell~ds----~~l~~~i~~~l~~a~~~~~d~a~l~G~g~~~~p~Gi~~~~~~~~~~~~~~-------~~~~~~ 287 (418) T protein:vir:10 219 HLFKASRQILDDA----PALQSYIDGRARYGLQLTEEGQILKGDGTGANILGILPQASAFMPSITLA-------NATPID 287 (418) T ss_pred EeehhhHHHHHhH----HHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccccccccc-------ccccHH Confidence 9888887544433 267888888888889999999999998643 4789999765432222111 112356 Q ss_pred HHHHHHHHHHHHhCCceecccccEEEecHHHHHhccc-CCCCCccHHHHHHHh----CCccEEEEcccccCC---CCce- Q lcl|Aclame:pro 207 EVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAKLKDI----FPKLEFVTIPEYDTA---SGRL- 277 (336) Q Consensus 207 Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~~l~~n----~pnl~i~~~pel~~a---~G~~- 277 (336) ||..++..+... + -.+..++|.+..+..|.+ .+..|.-++.=.... +-++.++..+.+... -|.- T Consensus 288 ~i~~~~~~~~~~--~----~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~l~G~pV~~~~~~p~~~~~~gd~s 361 (418) T protein:vir:10 288 KIRLALLQAVLA--E----FPATGIVLNPIDWASIELTKDSQGRYIVGNPVNGTTPRLWNLPVVETQAMTANEFLVGAFS 361 (418) T ss_pred HHHHHHHhhccc--c----CCCCEEEEcHHHHHHHHHhhcCCCceeccccccCCCceecceeeEEcCCCCCCcEEEeecc Confidence 677676665321 1 135679999999988854 333343333211100 111223322222110 1221 Q ss_pred -EEEEEEeecCCceEEEEcChhhhcccc-eecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 278 -VQLWAPRVEGKDTATCGFTEKMRAHSI-ERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 278 -~~~~~~~~~~~~~~~~~~p~~~r~l~~-~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) .+.+++. .+ ..+.+ -.+.. ....-....-+..+.+| .++.|.||++.+.. T Consensus 362 ~~~~~~~~-~~---~~i~~----~~~~~~~f~~~~~~~r~~~~~d~-~~~~~~a~~~~~~~ 413 (418) T protein:vir:10 362 MAAQIFDR-ME---IEVLL----STENVDDFEKNMVSIRAEERLAL-AVYRPESFVTGALV 413 (418) T ss_pred ceEEEEEe-cc---eEEEE----ecccchhhhcCceEEEEEEeecc-EEecccceEEEEec Confidence 1222211 11 11111 00000 00011233445556655 58899999999988 No 47 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=97.67 E-value=6e-06 Score=49.22 Aligned_cols=292 Identities=10% Similarity=0.013 Sum_probs=151.7 Q ss_pred CchHHHHH-HhhhcceeccchhhhccchhHHHHHhhhhhcccccc-cCcchHHHHHHHhhCceeeeeeccccchhhhccc Q lcl|Aclame:pro 1 MRDAQRIQ-NLARAGVILPRSVQNVSTPLTEYAMDAADLSPHLSS-TGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGE 78 (336) Q Consensus 1 ~~~~~~~~-~l~~~g~~~~~~~~~~~~~~~~~a~da~d~~~~l~t-~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v 78 (336) |+..+..+ +++++- ...... ..+ + +...+++ ...+.+|..+. .++++.+...-....++.+ T Consensus 1 ~~k~~~~~~~~~~~~----~~~~~~-~~~-----~---a~~~~~~~~~~~lip~~~~----~~ii~~~~~~s~l~~~~~~ 63 (324) T protein:vir:99 1 MEQTQKLKLNLQHFA----SNNVKP-QVF-----N---PDNVMMHEKKDGTLLNDFT----TPILQEVMENSKIMRLGKY 63 (324) T ss_pred CCCchHhhHHHHHHH----HHhhhh-hhc-----c---ccceeccCCCcceechhHH----HHHHHHHHhhchhhhhcce Confidence 77766655 333311 110000 000 0 1111111 12234554333 3444444444445555554 Q ss_pred ccCCCcceeeEEEeeeecceeeEEeecccCCceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHH Q lcl|Aclame:pro 79 SKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALG 158 (336) Q Consensus 79 ~t~g~w~~~t~~~~~~e~~G~a~~ygd~~diP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a 158 (336) .+.+. .+..|++.+..+.+...+.+..+|..+.......-..+.++..+.+|.+-++.+. .++.+.-.....++ T Consensus 64 ~~~~~---~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~---~~l~~~i~~~l~~a 137 (324) T protein:vir:99 64 EPMEG---TEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY---SQFFEEMKPMIAEA 137 (324) T ss_pred eeccC---CceEEEEEecCcceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHHhcch---HHHHHHHHHHHHHH Confidence 43332 3467888888888999999999999999888889999999999999986666543 56888888888888 Q ss_pred HHHhhcceEEeeccccc-eEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHH Q lcl|Aclame:pro 159 LAKFLNGSYLFGVAGLE-NYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTA 237 (336) Q Consensus 159 ~e~~~n~~~~~Gd~~~g-~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~ 237 (336) +.+.+++-.++|+.... ..|+++..... . +...++. -++||.+++..|... + ..+..++|.++. T Consensus 138 i~~~~d~~~l~G~g~~~~~~~~~~~~~~~--~---~~~~~~~----~~~~i~~~~~~l~~~--~----~~~~~~v~n~~~ 202 (324) T protein:vir:99 138 FYKKFDEAGILNQGNNPFGKSIAQSIEKT--N---KVIKGDF----TQDNIIDLEALLEDD--E----LEANAFISKTQN 202 (324) T ss_pred HHHHHHHHhhhcCCCCccCcccccccccc--c---eeccccC----CHHHHHHHHHhhhhc--c----CCCCEEEEcHHH Confidence 88888888888886542 23444422111 1 1111122 367777787777432 1 246689999999 Q ss_pred HHhcccC-CCCCccHHHHHHHhCC---ccEEEEcccccCCCC-----ceEEEEEEeecCCceEEEEcChhhhcccc-e-- Q lcl|Aclame:pro 238 MSDLSKT-NQYGLAAAAKLKDIFP---KLEFVTIPEYDTASG-----RLVQLWAPRVEGKDTATCGFTEKMRAHSI-E-- 305 (336) Q Consensus 238 ~~~L~~~-~~~~~Tvl~~l~~n~p---nl~i~~~pel~~a~G-----~~~~~~~~~~~~~~~~~~~~p~~~r~l~~-~-- 305 (336) +..|.+- +..|..++ .-.... ++.++..+-.....| ....+++-.+.+ .++.+-..-..... . T Consensus 203 ~~~L~~l~d~~g~~~~--~~~~~~~l~G~PVv~~~~~~~~~~~~i~gd~~~~~~~~~~~---~~i~~~~~~~~~~~~~~~ 277 (324) T protein:vir:99 203 RSLLRKIVDPETKERI--YDRNSDTLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQL---IEYKIDETAQLSTVKNED 277 (324) T ss_pred HHHHHHhhcCCCceee--cCCCCccccceeEEeecCCCCCcceEEEEecccEEEEEecC---cEEEEeeccccccccccc Confidence 9888642 33232221 111111 122222222211111 111111111111 11111111000000 0 Q ss_pred ------ecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 306 ------RYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 306 ------~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) ...-...+-+..|+++. +.+|.||+.+.|. T Consensus 278 ~~~~~~f~~~~~~~r~~~r~d~~-v~~~~a~~~lt~a 313 (324) T protein:vir:99 278 GTPVNLFEQDMVALRATMHVALH-IADDKAFAKLVPA 313 (324) T ss_pred ccchhhhhcCcEEEEEEEEEccE-EecccceEEEEec Confidence 11123555566677555 5579999999999 No 48 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=97.67 E-value=1e-05 Score=47.91 Aligned_cols=309 Identities=10% Similarity=0.001 Sum_probs=138.0 Q ss_pred CchHHH-HHHhhhcceec-------cchh-------------------hhccchhHHHHHhhhhhcccccccCcchHHHH Q lcl|Aclame:pro 1 MRDAQR-IQNLARAGVIL-------PRSV-------------------QNVSTPLTEYAMDAADLSPHLSSTGSSGIPNY 53 (336) Q Consensus 1 ~~~~~~-~~~l~~~g~~~-------~~~~-------------------~~~~~~~~~~a~da~d~~~~l~t~~~~~i~~~ 53 (336) ++.... .....+..... .... ............-+...+ ...+.....+|.. T Consensus 99 ~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~a~~~~-~~~~~g~~~ip~~ 177 (458) T protein:vir:10 99 QDEIKSLLTAREGRSFVGDSVAKALYGTQENFEDEVEKLVLLSYVMEKGVFETEHGQRHLKAVNQS-SSVEVSSESYETI 177 (458) T ss_pred HHHHHHHHHHHHhhhhhhhhhhccchhhhhhHHHHHHHHHHHHHHHhhccchhhhhhhhhhhhhhc-ccCccccceehhh Confidence 000000 00000000000 0000 000000011111111111 1111223345543 Q ss_pred HHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecceeeEEeecccCCce------eeeeeeeeeeeEEEEE Q lcl|Aclame:pro 54 LTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGD------SGANINYPQRQSYFFQ 127 (336) Q Consensus 54 l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~~ygd~~diP~------~~~~~~~~~~~v~~~~ 127 (336) +. +.|++.+.+......++.+...+. ....|.+....+.+...+.+...|- .+..........+.++ T Consensus 178 ~~----~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~a~~v~e~~~~~~~~~~~~~~~~~~~i~~~~~k~~ 250 (458) T protein:vir:10 178 FS----QRIIRDLQKELVVGALFEELPMSS---KILTMLVEPDAGKATWVAASTYGTDTTTGEEVKGALKEIHFSTYKLA 250 (458) T ss_pred Hh----HHHHHHHHhhhhHHhhcceeecCC---cceEEEEecCCcceeecccccccccccccccccccceeeEeeeeeEE Confidence 33 445555555555555555433221 2235555555566666666554442 3334555566667788 Q ss_pred EEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCcccccccccccccCHHHHHHH Q lcl|Aclame:pro 128 TWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNE 207 (336) Q Consensus 128 ~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~D 207 (336) ..+.+|.+=+.-+ ..++.+.-......++.+.++.-+++|++.....|++|++......++...-.+.. ..--++| T Consensus 251 ~~v~is~ell~ds---~~~~~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~-~~~~~~~ 326 (458) T protein:vir:10 251 AKSFITDETEEDA---IFSLLPLLRKRLIEAHAVSIEEAFMTGDGSGKPKGLLTLASEDSAKVVTEAKADGS-VLVTAKT 326 (458) T ss_pred eeehhhHHHHhcc---hHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCccceeeecccccccceeeccccccc-ccccHHH Confidence 8788886644333 35688888888888999999999999998888899999987653322221111111 1112566 Q ss_pred HHHHHHHHHHHhCCceecccccEEEecHHHHHhccc-CCCCCccHHHH-HHHh--------CCccEEEEcccccCCCCce Q lcl|Aclame:pro 208 VVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAK-LKDI--------FPKLEFVTIPEYDTASGRL 277 (336) Q Consensus 208 i~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~~-l~~n--------~pnl~i~~~pel~~a~G~~ 277 (336) |.+++..+... + ..+..++|.+..+..|.. .+..|.-++.. +... +-.+.|+....+-..++.. T Consensus 327 i~~~~~~l~~~--~----~~~~~~v~~~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~ 400 (458) T protein:vir:10 327 ISKLRRKLGRH--G----LKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDSVKLQGQVGRIYGLPVVVSEYFPAKANSA 400 (458) T ss_pred HHHHHHhhhhh--h----cCCCEEEEcHHHHHHHHhhcccCCceeeccccccccccCcCceecceeeEEccccccccCCc Confidence 66677666332 1 135679999999988854 23323222211 1100 1112233222221111211 Q ss_pred EEEEEEeecC------CceEEEEcChhhhcccceecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 278 VQLWAPRVEG------KDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 278 ~~~~~~~~~~------~~~~~~~~p~~~r~l~~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) ...+....+. .. .++ ...+|. ..-...+=...| -|..+++|.+|+..+== T Consensus 401 ~~~~~~f~~~~~~~~~~~-~~v-~~d~~~------~~~~~~~~~~~r-~~~~v~~~~a~v~~~~a 456 (458) T protein:vir:10 401 EFAVIVYKDNFVMPRQRA-VTV-ERERQA------GKQRDAYYVTQR-VNLQRYFANGVVSGTYA 456 (458) T ss_pred ceEEEEecccEEEEEeec-eEE-Eeeccc------CCCceEEEEEEE-ecceEecccceEEEeec Confidence 1112111000 00 011 011111 111233444555 46788899988772111 No 49 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=97.55 E-value=5.4e-06 Score=49.48 Aligned_cols=311 Identities=15% Similarity=0.114 Sum_probs=155.4 Q ss_pred CchHHH----------HHHhh--hcceeccch---------hhhccchhHH------HHHhhhhhcccccccCcch--HH Q lcl|Aclame:pro 1 MRDAQR----------IQNLA--RAGVILPRS---------VQNVSTPLTE------YAMDAADLSPHLSSTGSSG--IP 51 (336) Q Consensus 1 ~~~~~~----------~~~l~--~~g~~~~~~---------~~~~~~~~~~------~a~da~d~~~~l~t~~~~~--i~ 51 (336) ....+. +.... +.+-.+... .......... .+.... ..-...+++.+| +| T Consensus 86 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~gg~~vp 164 (497) T protein:vir:10 86 KQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAI-GQNPFGSTGTFAPGIL 164 (497) T ss_pred hhHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHH-HhhhcccCcccccccc Confidence 000000 00000 000000000 0000000000 000000 000112222222 44 Q ss_pred HHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeec-ceeeEEeecccCCceeeeeeeeeeeeEEEEEEEE Q lcl|Aclame:pro 52 NYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEP-TTKVATYGDYSSDGDSGANINYPQRQSYFFQTWT 130 (336) Q Consensus 52 ~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~-~G~a~~ygd~~diP~~~~~~~~~~~~v~~~~~~~ 130 (336) ..+. ++|++.+.+......++++.+.+. ..+.|+.... .+.+.+++.+..+|..+..........+.++..+ T Consensus 165 ~~~~----~~ii~~~~~~~~i~~l~~~~~~~~---~~~~~~~~~~~~~~a~wv~E~~~~~~s~~~f~~i~~~~~k~a~~~ 237 (497) T protein:vir:10 165 PTFL----PGIVEQLFYELSLADLISSRPVTS---PNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANAL 237 (497) T ss_pred hhhh----HHHHHHHHhhhhHHhhccccccCC---CceEEEEEcCCCCcceeeccCcccccccccceeeEeeeeeeEeec Confidence 3322 566777777777788877754433 2356665433 4577888999999999988888888899999988 Q ss_pred eeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCccccccccccc------------- Q lcl|Aclame:pro 131 RWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSG------------- 197 (336) Q Consensus 131 ~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~------------- 197 (336) .+|.+=|+-+ . .|.+--....++++.+.+|.-.++|++..+..|++|++.........+.+.+ T Consensus 238 ~iS~ell~d~--~--~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 313 (497) T protein:vir:10 238 TITDEGLRDA--P--ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPAD 313 (497) T ss_pred HhHHHHHHhH--H--HHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchhhhhhhhhhhhhhcc Confidence 8886534332 2 4788888888999999999999999998889999998865432211110000 Q ss_pred ----------------------------------ccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhccc Q lcl|Aclame:pro 198 ----------------------------------SPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK 243 (336) Q Consensus 198 ----------------------------------~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~ 243 (336) ..+...++.++..++..+..... ..|+.++|.+..+..|.+ T Consensus 314 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~vmn~~~~~~l~~ 388 (497) T protein:vir:10 314 GTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLF-----QTPNAVVMNPRDWELLRL 388 (497) T ss_pred cccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcc-----cCCCeEEEchHHHHHHHH Confidence 00122344555555555544321 246778888888877753 Q ss_pred -CCCCCccHHHH---------HHHh--CCccEEEEcccccCCC---Cce---EEEEEEeecCCceEEEEcChhhhcccce Q lcl|Aclame:pro 244 -TNQYGLAAAAK---------LKDI--FPKLEFVTIPEYDTAS---GRL---VQLWAPRVEGKDTATCGFTEKMRAHSIE 305 (336) Q Consensus 244 -~~~~~~Tvl~~---------l~~n--~pnl~i~~~pel~~a~---G~~---~~~~~~~~~~~~~~~~~~p~~~r~l~~~ 305 (336) .+..|.-++.- .... .-+..++..+...... |.- .+.++++ .+ .++.+... .... T Consensus 389 lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~~~~Gd~~~~~~~i~~r-~~---~~v~~~~~---~~~~ 461 (497) T protein:vir:10 389 TKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARR-EG---VTMQMTNS---NGTD 461 (497) T ss_pred hhcCCCceeccCcccccccccccCCceeeceeeEecCCCCCCceEEeecccceEEEEEe-cc---cEEEeecc---cchh Confidence 23334322210 0000 0012222222221100 111 1122222 11 11211110 0111 Q ss_pred ecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 306 RYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 306 ~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) ...-.+.+-++.|++| .|++|.||++++-. T Consensus 462 f~~n~v~~r~~~r~~~-~v~~p~A~~~l~~~ 491 (497) T protein:vir:10 462 FVDGKVTVRAEERLGL-LVYRPSAFQLIQLK 491 (497) T ss_pred hhcCcEEEEEEEeecc-eeeccccEEEEEec Confidence 1123456677778777 67799999999888 No 50 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=97.55 E-value=5.4e-06 Score=49.48 Aligned_cols=311 Identities=15% Similarity=0.114 Sum_probs=155.4 Q ss_pred CchHHH----------HHHhh--hcceeccch---------hhhccchhHH------HHHhhhhhcccccccCcch--HH Q lcl|Aclame:pro 1 MRDAQR----------IQNLA--RAGVILPRS---------VQNVSTPLTE------YAMDAADLSPHLSSTGSSG--IP 51 (336) Q Consensus 1 ~~~~~~----------~~~l~--~~g~~~~~~---------~~~~~~~~~~------~a~da~d~~~~l~t~~~~~--i~ 51 (336) ....+. +.... +.+-.+... .......... .+.... ..-...+++.+| +| T Consensus 86 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~gg~~vp 164 (497) T protein:vir:78 86 KQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAI-GQNPFGSTGTFAPGIL 164 (497) T ss_pred hhHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHH-HhhhcccCcccccccc Confidence 000000 00000 000000000 0000000000 000000 000112222222 44 Q ss_pred HHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeec-ceeeEEeecccCCceeeeeeeeeeeeEEEEEEEE Q lcl|Aclame:pro 52 NYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEP-TTKVATYGDYSSDGDSGANINYPQRQSYFFQTWT 130 (336) Q Consensus 52 ~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~-~G~a~~ygd~~diP~~~~~~~~~~~~v~~~~~~~ 130 (336) ..+. ++|++.+.+......++++.+.+. ..+.|+.... .+.+.+++.+..+|..+..........+.++..+ T Consensus 165 ~~~~----~~ii~~~~~~~~i~~l~~~~~~~~---~~~~~~~~~~~~~~a~wv~E~~~~~~s~~~f~~i~~~~~k~a~~~ 237 (497) T protein:vir:78 165 PTFL----PGIVEQLFYELSLADLISSRPVTS---PNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANAL 237 (497) T ss_pred hhhh----HHHHHHHHhhhhHHhhccccccCC---CceEEEEEcCCCCcceeeccCcccccccccceeeEeeeeeeEeec Confidence 3322 566777777777788877754433 2356665433 4577888999999999988888888899999988 Q ss_pred eeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCccccccccccc------------- Q lcl|Aclame:pro 131 RWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSG------------- 197 (336) Q Consensus 131 ~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~------------- 197 (336) .+|.+=|+-+ . .|.+--....++++.+.+|.-.++|++..+..|++|++.........+.+.+ T Consensus 238 ~iS~ell~d~--~--~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 313 (497) T protein:vir:78 238 TITDEGLRDA--P--ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPAD 313 (497) T ss_pred HhHHHHHHhH--H--HHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchhhhhhhhhhhhhhcc Confidence 8886534332 2 4788888888999999999999999998889999998865432211110000 Q ss_pred ----------------------------------ccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhccc Q lcl|Aclame:pro 198 ----------------------------------SPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK 243 (336) Q Consensus 198 ----------------------------------~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~ 243 (336) ..+...++.++..++..+..... ..|+.++|.+..+..|.+ T Consensus 314 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~vmn~~~~~~l~~ 388 (497) T protein:vir:78 314 GTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLF-----QTPNAVVMNPRDWELLRL 388 (497) T ss_pred cccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcc-----cCCCeEEEchHHHHHHHH Confidence 00122344555555555544321 246778888888877753 Q ss_pred -CCCCCccHHHH---------HHHh--CCccEEEEcccccCCC---Cce---EEEEEEeecCCceEEEEcChhhhcccce Q lcl|Aclame:pro 244 -TNQYGLAAAAK---------LKDI--FPKLEFVTIPEYDTAS---GRL---VQLWAPRVEGKDTATCGFTEKMRAHSIE 305 (336) Q Consensus 244 -~~~~~~Tvl~~---------l~~n--~pnl~i~~~pel~~a~---G~~---~~~~~~~~~~~~~~~~~~p~~~r~l~~~ 305 (336) .+..|.-++.- .... .-+..++..+...... |.- .+.++++ .+ .++.+... .... T Consensus 389 lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~~~~~~~Gd~~~~~~~i~~r-~~---~~v~~~~~---~~~~ 461 (497) T protein:vir:78 389 TKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIPLGTILVGHFAPSVIQTARR-EG---VTMQMTNS---NGTD 461 (497) T ss_pred hhcCCCceeccCcccccccccccCCceeeceeeEecCCCCCCceEEeecccceEEEEEe-cc---cEEEeecc---cchh Confidence 23334322210 0000 0012222222221100 111 1122222 11 11211110 0111 Q ss_pred ecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 306 RYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 306 ~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) ...-.+.+-++.|++| .|++|.||++++-. T Consensus 462 f~~n~v~~r~~~r~~~-~v~~p~A~~~l~~~ 491 (497) T protein:vir:78 462 FVDGKVTVRAEERLGL-LVYRPSAFQLIQLK 491 (497) T ss_pred hhcCcEEEEEEEeecc-eeeccccEEEEEec Confidence 1123456677778777 67799999999888 No 51 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=97.51 E-value=8.7e-06 Score=48.32 Aligned_cols=304 Identities=14% Similarity=0.095 Sum_probs=151.7 Q ss_pred CchHHH-HHHhhh---cceeccchhh----hccchhHHHHHh------hhhhccccccc---CcchHHHHHHHhhCceee Q lcl|Aclame:pro 1 MRDAQR-IQNLAR---AGVILPRSVQ----NVSTPLTEYAMD------AADLSPHLSST---GSSGIPNYLTTYVDPAVI 63 (336) Q Consensus 1 ~~~~~~-~~~l~~---~g~~~~~~~~----~~~~~~~~~a~d------a~d~~~~l~t~---~~~~i~~~l~~~idp~v~ 63 (336) ++..++ +.++++ .+..-+.... ............ .......+.+. +..-+|.. + .++++ T Consensus 50 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~---~-~~~ii 125 (385) T protein:vir:19 50 LTKSGTRLFDLEQKLASGAENPGEKKSFSERAAEELIKSWDGKQGTFGAKTFNKSLGSDADSAGSLIQPM---Q-IPGII 125 (385) T ss_pred HHHHHHHHHHHHHHhhccccccchhhhhHHHHHHHHHHHHHHhhccchhhHHHhhhccccccCCceecch---h-hhHHH Confidence 111111 111111 1111111000 000000000000 00000011111 11113322 2 24566 Q ss_pred eeeccccchhhhcccccCCCcceeeEEEeeeec-ceeeEEeecccCCceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHh Q lcl|Aclame:pro 64 DILVAPMKAAELVGESKKGDWTTLVAAFITAEP-TTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGA 142 (336) Q Consensus 64 ~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~-~G~a~~ygd~~diP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~ 142 (336) +..........++++...+. ..+.|++.+. .+.+...+.+..+|..+............++..+.+|.+ +..-. T Consensus 126 ~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~k~~~~~~is~e-ll~d~- 200 (385) T protein:vir:19 126 MPGLRRLTIRDLLAQGRTSS---NALEYVREEVFTNNADVVAEKALKPESDITFSKQTANVKTIAHWVQASRQ-VMDDA- 200 (385) T ss_pred HHhhhccchhhhcceecccC---cceEEEEEecCCcceeeeccCccccccccceeEEEEeeeeEEEeehhhHH-HHhhH- Confidence 66666677777777754432 2356676655 356677788889999999999999999999999999964 43322 Q ss_pred hCCCHHHHHHHHHHHHHHHhhcceEEeeccc-cceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCC Q lcl|Aclame:pro 143 GRVDLASELNYSSALGLAKFLNGSYLFGVAG-LENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQG 221 (336) Q Consensus 143 ~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~-~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g 221 (336) .++.+.-....+.++.+.++.-.+.|++. ....|+++.+....... ..+.+..++||.+++.++...- T Consensus 201 --~~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~-------~~~~~~~~d~i~~~~~~l~~~~-- 269 (385) T protein:vir:19 201 --PMLQSYINNRLMYGLALKEEGQLLNGDGTGDNLEGLNKVATAYDTSL-------NATGDTRADIIAHAIYQVTESE-- 269 (385) T ss_pred --HHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccccc-------cccccchHHHHHHHHHhhcccc-- Confidence 24777777788888888888888899854 34578888765432211 1112335777888877774321 Q ss_pred ceecccccEEEecHHHHHhccc-CCCCCccHHHHHHHh----CCccEEEEcccccCC---CCc--eEEEEEEeecCCceE Q lcl|Aclame:pro 222 IITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAKLKDI----FPKLEFVTIPEYDTA---SGR--LVQLWAPRVEGKDTA 291 (336) Q Consensus 222 ~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~~l~~n----~pnl~i~~~pel~~a---~G~--~~~~~~~~~~~~~~~ 291 (336) ..+..++|+|+.+..|.. .+..|.-++.-.... +-++.++..+.+... -|+ ..+.+++. .+ . T Consensus 270 ----~~~~~~~~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~l~G~pV~~~~~~p~~~~~~gd~~~~~~~~~~---~~-~ 341 (385) T protein:vir:19 270 ----FSASGIVLNPRDWHNIALLKDNEGRYIFGGPQAFTSNIMWGLPVVPTKAQAAGTFTVGGFDMASQVWDR---MD-A 341 (385) T ss_pred ----CCCCEEEEcHHHHHHHHHhhcCCCceeccCcccCCCceecceeeEEcCcCCCCcEEEeecccEEEEEEe---cc-e Confidence 246789999999988854 333343332211111 112223322222110 121 11222221 11 1 Q ss_pred EEEcChhhhcccceecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 292 TCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 292 ~~~~p~~~r~l~~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) .+.+...-. -....-.+.+-+..|++|. +++|.||++++.- T Consensus 342 ~v~~~~~~~---~~~~~~~~~~~~~~r~~~~-v~~~~a~~~~~~~ 382 (385) T protein:vir:19 342 TVEVSREDR---DNFVKNMLTILCEERLALA-HYRPTAIIKGTFS 382 (385) T ss_pred EEEEecccc---chhhcCcEEEEEEEeeccE-EecccceEEEEec Confidence 111111000 0011123455667777754 5789999999988 No 52 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=97.51 E-value=8.7e-06 Score=48.32 Aligned_cols=304 Identities=14% Similarity=0.095 Sum_probs=151.7 Q ss_pred CchHHH-HHHhhh---cceeccchhh----hccchhHHHHHh------hhhhccccccc---CcchHHHHHHHhhCceee Q lcl|Aclame:pro 1 MRDAQR-IQNLAR---AGVILPRSVQ----NVSTPLTEYAMD------AADLSPHLSST---GSSGIPNYLTTYVDPAVI 63 (336) Q Consensus 1 ~~~~~~-~~~l~~---~g~~~~~~~~----~~~~~~~~~a~d------a~d~~~~l~t~---~~~~i~~~l~~~idp~v~ 63 (336) ++..++ +.++++ .+..-+.... ............ .......+.+. +..-+|.. + .++++ T Consensus 50 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~---~-~~~ii 125 (385) T protein:vir:18 50 LTKSGTRLFDLEQKLASGAENPGEKKSFSERAAEELIKSWDGKQGTFGAKTFNKSLGSDADSAGSLIQPM---Q-IPGII 125 (385) T ss_pred HHHHHHHHHHHHHHhhccccccchhhhhHHHHHHHHHHHHHHhhccchhhHHHhhhccccccCCceecch---h-hhHHH Confidence 111111 111111 1111111000 000000000000 00000011111 11113322 2 24566 Q ss_pred eeeccccchhhhcccccCCCcceeeEEEeeeec-ceeeEEeecccCCceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHh Q lcl|Aclame:pro 64 DILVAPMKAAELVGESKKGDWTTLVAAFITAEP-TTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGA 142 (336) Q Consensus 64 ~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~-~G~a~~ygd~~diP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~ 142 (336) +..........++++...+. ..+.|++.+. .+.+...+.+..+|..+............++..+.+|.+ +..-. T Consensus 126 ~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~k~~~~~~is~e-ll~d~- 200 (385) T protein:vir:18 126 MPGLRRLTIRDLLAQGRTSS---NALEYVREEVFTNNADVVAEKALKPESDITFSKQTANVKTIAHWVQASRQ-VMDDA- 200 (385) T ss_pred HHhhhccchhhhcceecccC---cceEEEEEecCCcceeeeccCccccccccceeEEEEeeeeEEEeehhhHH-HHhhH- Confidence 66666677777777754432 2356676655 356677788889999999999999999999999999964 43322 Q ss_pred hCCCHHHHHHHHHHHHHHHhhcceEEeeccc-cceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCC Q lcl|Aclame:pro 143 GRVDLASELNYSSALGLAKFLNGSYLFGVAG-LENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQG 221 (336) Q Consensus 143 ~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~-~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g 221 (336) .++.+.-....+.++.+.++.-.+.|++. ....|+++.+....... ..+.+..++||.+++.++...- T Consensus 201 --~~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~-------~~~~~~~~d~i~~~~~~l~~~~-- 269 (385) T protein:vir:18 201 --PMLQSYINNRLMYGLALKEEGQLLNGDGTGDNLEGLNKVATAYDTSL-------NATGDTRADIIAHAIYQVTESE-- 269 (385) T ss_pred --HHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccccc-------cccccchHHHHHHHHHhhcccc-- Confidence 24777777788888888888888899854 34578888765432211 1112335777888877774321 Q ss_pred ceecccccEEEecHHHHHhccc-CCCCCccHHHHHHHh----CCccEEEEcccccCC---CCc--eEEEEEEeecCCceE Q lcl|Aclame:pro 222 IITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAKLKDI----FPKLEFVTIPEYDTA---SGR--LVQLWAPRVEGKDTA 291 (336) Q Consensus 222 ~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~~l~~n----~pnl~i~~~pel~~a---~G~--~~~~~~~~~~~~~~~ 291 (336) ..+..++|+|+.+..|.. .+..|.-++.-.... +-++.++..+.+... -|+ ..+.+++. .+ . T Consensus 270 ----~~~~~~~~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~l~G~pV~~~~~~p~~~~~~gd~~~~~~~~~~---~~-~ 341 (385) T protein:vir:18 270 ----FSASGIVLNPRDWHNIALLKDNEGRYIFGGPQAFTSNIMWGLPVVPTKAQAAGTFTVGGFDMASQVWDR---MD-A 341 (385) T ss_pred ----CCCCEEEEcHHHHHHHHHhhcCCCceeccCcccCCCceecceeeEEcCcCCCCcEEEeecccEEEEEEe---cc-e Confidence 246789999999988854 333343332211111 112223322222110 121 11222221 11 1 Q ss_pred EEEcChhhhcccceecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 292 TCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 292 ~~~~p~~~r~l~~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) .+.+...-. -....-.+.+-+..|++|. +++|.||++++.- T Consensus 342 ~v~~~~~~~---~~~~~~~~~~~~~~r~~~~-v~~~~a~~~~~~~ 382 (385) T protein:vir:18 342 TVEVSREDR---DNFVKNMLTILCEERLALA-HYRPTAIIKGTFS 382 (385) T ss_pred EEEEecccc---chhhcCcEEEEEEEeeccE-EecccceEEEEec Confidence 111111000 0011123455667777754 5789999999988 No 53 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=97.40 E-value=3.7e-05 Score=44.86 Aligned_cols=301 Identities=12% Similarity=0.068 Sum_probs=147.3 Q ss_pred CchHHHHHHhhhcceeccc------------------------hhhhccchhHHHHHhhhhhcccccccCcchHHHHHHH Q lcl|Aclame:pro 1 MRDAQRIQNLARAGVILPR------------------------SVQNVSTPLTEYAMDAADLSPHLSSTGSSGIPNYLTT 56 (336) Q Consensus 1 ~~~~~~~~~l~~~g~~~~~------------------------~~~~~~~~~~~~a~da~d~~~~l~t~~~~~i~~~l~~ 56 (336) +.+.+.-..+.+.+..... ..............+. ....+.++.....+|..+. T Consensus 58 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~vp~~~~- 135 (413) T protein:vir:81 58 SVDSEKSGELTRKGEGYKSIGEFFAKRAGDQIKQQAGGAQLNYSVGEYVAPRVKAASDP-ASTATLTDEFQGGYGTTWN- 135 (413) T ss_pred HHhHHHhhhHhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhHHHhhhhh-hhhcccccccccccchhhH- Confidence 1111111111111110000 0000000000001111 1112223344444664443 Q ss_pred hhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeec----ceeeEEeecccCCceeee-eeeeeeeeEEEEEEEEe Q lcl|Aclame:pro 57 YVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEP----TTKVATYGDYSSDGDSGA-NINYPQRQSYFFQTWTR 131 (336) Q Consensus 57 ~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~----~G~a~~ygd~~diP~~~~-~~~~~~~~v~~~~~~~~ 131 (336) +++++.+.......+++++.+... .+..|++... .+.+...+.+..+|-.+. .....+..++.++..+. T Consensus 136 ---~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~ 209 (413) T protein:vir:81 136 ---RNIIYRRREKLVVADLMDNLTMTN---TTIKYLMEKANRVVEGGFKTVAEGGKKPYMRFADFDIVTESLSKIAGLTK 209 (413) T ss_pred ---HHHHHHHhhhhhHHhhcceeeccC---CceeEEEeccccccccccceecCcccccccCcccceeeEeeeeeEEEeeh Confidence 567777777777888877654332 2233333222 245667787888887774 57777888888888888 Q ss_pred eCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccc-cceEEEEecCCCCcccccccccccccCHHHHHHHHHH Q lcl|Aclame:pro 132 WGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAG-LENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVA 210 (336) Q Consensus 132 y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~-~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~ 210 (336) +|.+=|..+. .|.+--....+.++.+.+++-.++|++. ....|++|.+++.+... .+.+.++++|.. T Consensus 210 iS~ell~ds~----~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~Gi~~~~~~~~~~~--------~~~~~~~~~i~~ 277 (413) T protein:vir:81 210 ITDEMIEDYD----FLVSYINARLLEELAIEEERQLLLGDGTGNNLTGLLKRDGIQTLAV--------SNKDELADSIYK 277 (413) T ss_pred hhHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHhccCCCCCcccccccccccccccc--------cccchhHHHHHH Confidence 8876444332 2777777777788888888888899853 33579999876643211 123446777777 Q ss_pred HHHHHHHHhCCceecccccEEEecHHHHHhccc-CCCCCccHHH-HHHHh--CC---------ccEEEEcccccCCCCc- Q lcl|Aclame:pro 211 LFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAA-KLKDI--FP---------KLEFVTIPEYDTASGR- 276 (336) Q Consensus 211 l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~-~l~~n--~p---------nl~i~~~pel~~a~G~- 276 (336) ++..+....+ ..++.++|.++.+..|.+ .+..|.-++. -+... .+ ++.++...... .|. T Consensus 278 ~~~~~~~~~~-----~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~l~G~pv~~s~~~~--~~~~ 350 (413) T protein:vir:81 278 AMTNISLATP-----FQADALVINPLDYQELRLAKDANGQYYGGGVFQGQYGSGGIMLDPAPWGLRTVQSQVVP--VGKP 350 (413) T ss_pred HHHHhhhhcc-----CCCcEEEEcHHHHHHHHHhhccCCceeccccccccccccccccCceecceeeEEcCCCC--cccE Confidence 7766544332 136679999999888753 3333433321 11110 00 11222211111 121 Q ss_pred ----e--EEEEEEeecCCceEEEEcChhhhcccceecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 277 ----L--VQLWAPRVEGKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 277 ----~--~~~~~~~~~~~~~~~~~~p~~~r~l~~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) . .+.++++ .+ ..+.+.. ........-....-+..|++| .+++|.||+.++.= T Consensus 351 ~~gd~~~~~~~~~~-~~---~~v~~~~---~~~~~~~~~~~~~r~~~r~d~-~~~~~~a~~~l~~~ 408 (413) T protein:vir:81 351 VVGAFRSAASVLRK-GG---VRIDSTN---TNVDDFENNLITVRAEERVGL-MVTFPEAIVQLDVA 408 (413) T ss_pred EEEecccEEEEEEe-cc---eEEEEec---cccchhhcCcEEEEEEEeecc-EEecccceEEEEec Confidence 1 1122111 11 1111100 000001112344555666654 55789999988766 No 54 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=97.37 E-value=2.1e-05 Score=46.21 Aligned_cols=305 Identities=12% Similarity=0.063 Sum_probs=149.0 Q ss_pred CchHHH-HHHh-------hhcceeccchhhhc----cc---------hhHHHHHhhhhhcccccccCcch--HHHHHHHh Q lcl|Aclame:pro 1 MRDAQR-IQNL-------ARAGVILPRSVQNV----ST---------PLTEYAMDAADLSPHLSSTGSSG--IPNYLTTY 57 (336) Q Consensus 1 ~~~~~~-~~~l-------~~~g~~~~~~~~~~----~~---------~~~~~a~da~d~~~~l~t~~~~~--i~~~l~~~ 57 (336) +...+. +.+. ++.+.. +...... .. ..+...+-.+......++..++| +|-. + T Consensus 54 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~vp~~---~ 129 (395) T protein:vir:43 54 QGELQARLSAAEQAMLANEKRDGG-EEAPKTAGQMVAESLKEQGVTSSLRGSHRVSMPRSAITSIDGSGGALVAPD---R 129 (395) T ss_pred HHHHHHHHHHHHHHHHhhhccccc-cchhhhHHHHHHHHHHHHHHHHHhhhhhhhhhhhhhhcccCCCCccccchh---h Confidence 000000 0000 000000 0000000 00 00000000000000011222221 3322 2 Q ss_pred hCceeeeeeccccchhhhcccccCCCcceeeEEEeeeec-ceeeEEeecccCCceeeeeeeeeeeeEEEEEEEEeeCHHH Q lcl|Aclame:pro 58 VDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEP-TTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERE 136 (336) Q Consensus 58 idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~-~G~a~~ygd~~diP~~~~~~~~~~~~v~~~~~~~~y~~~E 136 (336) . ++|++.+........++++.+.+. .++.|++... .+.+...|.+...|..+........+.+.++..+.+|.+= T Consensus 130 ~-~~ii~~~~~~~~l~~l~~~~~~~~---~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~i~~~~~k~~~~~~is~el 205 (395) T protein:vir:43 130 R-PGVVAAPQRRLTIRDLVAPGTTES---NSVEYVRETGFVNNAAPVSEGTQKPYSDLTFELENAPVRTIAHLFKASRQI 205 (395) T ss_pred H-HHHHHHHHhhhhHHhhccceecCC---CceEEEEEecCCCceeeecCCccccccccceeEEEEeeeeEEEeehhhHHH Confidence 2 456666677777777777765443 2356666533 4677888988899999999999999999999999999654 Q ss_pred HHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccc-eEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 137 LEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLE-NYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVL 215 (336) Q Consensus 137 l~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g-~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l 215 (336) ++.+ . .+.+.-....++++...++.-.++|++..+ ..|+++.+.+..... + ...+.+..++||.+++..+ T Consensus 206 l~d~---~-~l~~~v~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~--~---~~~~~~~~~~~i~~~~~~~ 276 (395) T protein:vir:43 206 LDDA---S-ALQSYIDARARYGLMLVEECQLLYGNGTGANLHGIIPQAQAYAPPS--G---VVVTAEQRIDRIRLAILQA 276 (395) T ss_pred HHhH---H-HHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccccc--c---cccccchhHHHHHHHHHhh Confidence 4322 2 577777777788888888888888986433 469998765532211 1 1233456788888888777 Q ss_pred HHHhCCceecccccEEEecHHHHHhccc-CCCCCccHHHHHHHh----CCccEEEEcccccCC---CCceE--EEEEEee Q lcl|Aclame:pro 216 QTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAKLKDI----FPKLEFVTIPEYDTA---SGRLV--QLWAPRV 285 (336) Q Consensus 216 ~~~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~~l~~n----~pnl~i~~~pel~~a---~G~~~--~~~~~~~ 285 (336) ...- -.+..++|.|..+..|.+ .+..|.-++.-.... +-++.++..+.+... -|... +.++++ T Consensus 277 ~~~~------~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~l~G~pVv~~~~~~~~~~~~gd~~~~~~~~~~- 349 (395) T protein:vir:43 277 QLAE------FPASGIVLNPIDWALIELNKDAENRYIIGSPQNGTTPTLWRLPVVETQAITQDEFLTGAFSLGAQIFDR- 349 (395) T ss_pred cccc------CCCcEEEEcHHHHHHHHHhhccCCceeccccccCCCceecceeeEEcCCCCCCcEEEEeccceEEEEEe- Confidence 4321 135679999999888753 333343333211111 112333333333211 12211 222211 Q ss_pred cCCceEEEEcChhhhcccceecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 286 EGKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 286 ~~~~~~~~~~p~~~r~l~~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) .+ ..+.+... .+.-.. .-.+..-+..| .|+.+++|-||++++-= T Consensus 350 ~~---~~i~~~~~--~~~~f~-~~~~~~r~~~r-~d~~v~~~~a~~~~~~t 393 (395) T protein:vir:43 350 MD---IEVLVSTE--NDKDFE-NNMVTIRAEER-LAFAVYRPEAFVTGSLT 393 (395) T ss_pred cc---eEEEEecc--ccchhh-cCcEEEEEEEe-eccEEecccceEEEEec Confidence 11 11111100 000000 01122223333 45566889999888544 No 55 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=97.33 E-value=1.3e-05 Score=47.43 Aligned_cols=278 Identities=12% Similarity=-0.018 Sum_probs=141.7 Q ss_pred HHHhhhhhcccccccCcchHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecceeeEEeecccCCc Q lcl|Aclame:pro 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDG 110 (336) Q Consensus 31 ~a~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~~ygd~~diP 110 (336) ||..+.. .....+|..+.+ +|++.+...-..+.+..+...+. ....+++....+.|.++|.+..+| T Consensus 1 Ma~~~~~-------~gg~~vP~~~~~----~ii~~l~~~s~i~~l~~~i~~~~---~~~~ip~~~~~~~a~wv~Eg~~~~ 66 (315) T protein:vir:80 1 MADDFLS-------AGKLELPGSMIG----AVRDRAIDSGVLAKLSPEQPTIF---GPVKGAVFSGVPRAKIVGEGEVKP 66 (315) T ss_pred CCCCcCC-------cCceEcchHHHH----HHHHHHHhhchhhhhcceeecCC---CceEEEEEeCCcceEEeeCCcccc Confidence 4433221 222335665553 33444444444444444332221 346788888888999999999999 Q ss_pred eeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhC-CCHHHHHHHHHHHHHHHhhcceEEeecccc---ceEEEEecCCCC Q lcl|Aclame:pro 111 DSGANINYPQRQSYFFQTWTRWGERELEMAGAGR-VDLASELNYSSALGLAKFLNGSYLFGVAGL---ENYGLINDPSLS 186 (336) Q Consensus 111 ~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g-~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~---g~~GllN~Pnl~ 186 (336) ..+...++..-..+.++....+|.+=++.....- -.|.+.-....++++.+.++.-.++|+... +..|+.+. + T Consensus 67 ~s~~~f~~v~l~~~kl~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~~~~~~~~~~~--~- 143 (315) T protein:vir:80 67 SASVDVSAFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKAASAVHTS--L- 143 (315) T ss_pred ccccceeeeEeeeeeEEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCCCccccccccc--c- Confidence 9998888888888888887777765443322222 226677778888899999999999997532 22233321 1 Q ss_pred cccccccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhcccC-CC-----CCccHHHHHHHhCC Q lcl|Aclame:pro 187 APITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSKT-NQ-----YGLAAAAKLKDIFP 260 (336) Q Consensus 187 ~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~~-~~-----~~~Tvl~~l~~n~p 260 (336) ...+. ........++||.+++..+..... ..++..+|-|..+..|.+- +. .+..++.=+...-| T Consensus 144 ---~~~~~--~~~~~~~~~~d~~~~~~~~~~~~~-----~~~~~~imn~~~~~~L~~l~~~~g~~~~g~~~~~~~~~g~~ 213 (315) T protein:vir:80 144 ---NKTKN--IVDATDSATADLVKAVGLIAGAGL-----QVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGL 213 (315) T ss_pred ---ccccc--eeeccccchHHHHHHHHHHhhccC-----ccceEEEEcHHHHHHHHHHhhccCCcccccccccccccCCC Confidence 11111 111223467888888877643321 2355699999988877532 11 11111110111101 Q ss_pred ----ccEEE---EcccccCCC-CceEEEEEEeecC-----CceEEEEcChhhhcccc---eecCCceEEccccceeeeee Q lcl|Aclame:pro 261 ----KLEFV---TIPEYDTAS-GRLVQLWAPRVEG-----KDTATCGFTEKMRAHSI---ERYSSYFRQKKSAGTWGAVI 324 (336) Q Consensus 261 ----nl~i~---~~pel~~a~-G~~~~~~~~~~~~-----~~~~~~~~p~~~r~l~~---~~~~~~~~vp~~~~t~Gv~i 324 (336) ++.++ .+|.....+ +.+..+++-.... ..-..+.+...-..... -.+.-...+-|+.|+ |..| T Consensus 214 ~tl~G~PV~~~~~~~~~~~~~~~~~~~~~~GDfs~~~~g~~~~~~i~i~~~~~~~~~~~~~~~~~~v~~r~~~r~-~~~v 292 (315) T protein:vir:80 214 DNWRGLNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVL-YVAI 292 (315) T ss_pred ceecceeeEecCcCCcccccccccccEEEEeecccEEEEEecCeeEEEeccccccCcccchhhcCcEEEEEEEEe-ccee Confidence 11222 233322221 2222233211100 00011111100000000 001112455566665 5567 Q ss_pred ecccceeeeccC Q lcl|Aclame:pro 325 FRPFAVAQMIGV 336 (336) Q Consensus 325 r~P~av~~~~GI 336 (336) ++|.||+++.+. T Consensus 293 ~~~~a~~~l~~~ 304 (315) T protein:vir:80 293 ESLDSFAVVKEK 304 (315) T ss_pred ecccceEEEeec Confidence 899999999998 No 56 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=97.22 E-value=5.2e-05 Score=44.09 Aligned_cols=300 Identities=13% Similarity=0.070 Sum_probs=146.3 Q ss_pred CchH-HHHHHhhhcceeccchhhh----------------------ccchhHHHHHhhhhhcccccccCcc-hH-HHHHH Q lcl|Aclame:pro 1 MRDA-QRIQNLARAGVILPRSVQN----------------------VSTPLTEYAMDAADLSPHLSSTGSS-GI-PNYLT 55 (336) Q Consensus 1 ~~~~-~~~~~l~~~g~~~~~~~~~----------------------~~~~~~~~a~da~d~~~~l~t~~~~-~i-~~~l~ 55 (336) ++.. ++++++++.+...+..... ........+.. ..+....+...+ .+ |.++ T Consensus 54 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~g~~~~~~~~- 130 (390) T protein:vir:10 54 VQAARQRVAELEGNGAGGDVQHVSVGDLFVASEQFQASAGRWNDRSARATMNIKAAL--NTASTDAAGSAGALTTPNRL- 130 (390) T ss_pred HHHHHHHHHHHHhhcccccccccchhhhhhhhHHHHHHHHhhhhhhhhhhhHHHHHH--HhhhcccccccccccchhHH- Confidence 1110 0111111111111110000 00000001110 111111222222 22 3333 Q ss_pred HhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeec-ceeeEEeecccCCceeeeeeeeeeeeEEEEEEEEeeCH Q lcl|Aclame:pro 56 TYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEP-TTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGE 134 (336) Q Consensus 56 ~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~-~G~a~~ygd~~diP~~~~~~~~~~~~v~~~~~~~~y~~ 134 (336) +++++.+........++.+.+.+. .++.|+..+. .+.+...+.+..+|-.+..........+.++..+.+|. T Consensus 131 ----~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ 203 (390) T protein:vir:10 131 ----PGFITQPDARLTVRDLIGSGRTDS---ALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATR 203 (390) T ss_pred ----HHHHHHHHhhchhhhhcceeeccC---CceEEEEEecCCcceeeecCCccccccccceeEEEEeeEEEEEeehhhH Confidence 345555566666677777655433 2346666554 46777888888899999888888999999999989887 Q ss_pred HHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccc-cceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHH Q lcl|Aclame:pro 135 RELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAG-LENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQ 213 (336) Q Consensus 135 ~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~-~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~ 213 (336) + +..-. .++.+.-....++++.+.+++-.++|++. ....|++|.+......+. .+....++++..++. T Consensus 204 e-ll~d~---~~l~~~i~~~l~~~~~~~~~~~il~G~G~~~~p~Gi~~~~~~~~~~~~-------~~~~~~~~~~~~~~~ 272 (390) T protein:vir:10 204 Q-ILSDA---PQLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTT-------IAGATRVDQLRLAML 272 (390) T ss_pred H-HHHhH---HHHHHHHHHHHHHHHHHHHHHHHhhcCCCCcccccccccccccccccc-------ccccchHHHHHHHHH Confidence 5 43322 26788888888889999999989999863 447899997655322211 112224667777777 Q ss_pred HHHHHhCCceecccccEEEecHHHHHhccc-CCCCCccHHHHHHHh----CCccEEEEcccccCC---CCceE--EEEEE Q lcl|Aclame:pro 214 VLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAKLKDI----FPKLEFVTIPEYDTA---SGRLV--QLWAP 283 (336) Q Consensus 214 ~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~~l~~n----~pnl~i~~~pel~~a---~G~~~--~~~~~ 283 (336) .+...- ..+..++|.|+.+..|.+ .+..|.-++.--... .-++.++..+.+... -|.-. +.+++ T Consensus 273 ~l~~~~------~~~~~~v~n~~~~~~L~~lkd~~g~~l~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~gdf~~~~~~~~ 346 (390) T protein:vir:10 273 QASLAE------YPASGIVINPIDWAAIELAKDANNQYLIGNARGTLTPTLWGLPVVATQAMAPGEFLVGAFDLAAQIFD 346 (390) T ss_pred hhcccc------CCCCEEEEcHHHHHHHHHhhcCCCceeecCCcCcCCceecceeeEEcCCCCCCcEEEEeccceEEEEE Confidence 664321 135679999999888864 333343232111111 011223222222100 02111 11111 Q ss_pred eecCCceEEEEcChhhhcccceecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 284 RVEGKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 284 ~~~~~~~~~~~~p~~~r~l~~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) + .+ ..+.+.. .+.- ...-...+-+..|++| .+++|.||+..+== T Consensus 347 ~-~~---~~i~~~~---~~~~-~~~~~~~~r~~~r~d~-~v~~~~a~~~~~~a 390 (390) T protein:vir:10 347 Q-WD---ARVEIGY---VNDD-FQRNMVTVLAEERLAL-VVYRPEALISGSFA 390 (390) T ss_pred e-cc---eEEEEee---cccc-cccCcEEEEEEEeecc-EEeccccEEEEEeC Confidence 1 11 1111110 0100 1112233345555554 67888887654311 No 57 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=96.97 E-value=5.2e-05 Score=44.09 Aligned_cols=257 Identities=12% Similarity=0.048 Sum_probs=133.7 Q ss_pred hhhhhcccccccCcchHHHHHHHhhCceeeeeeccccchhhhcccccC--CCcceeeEEEeeeecceeeEEeecccCCce Q lcl|Aclame:pro 34 DAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESKK--GDWTTLVAAFITAEPTTKVATYGDYSSDGD 111 (336) Q Consensus 34 da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~--g~w~~~t~~~~~~e~~G~a~~ygd~~diP~ 111 (336) .|+ ..++-++.-+|..+..||.-++ -..+....+..+... |.- -.++.++.+...|.+..++++++++. T Consensus 1 ma~----~~T~~~d~iiPev~~~~v~~~~----~~~~~~~~~~~~~~~l~g~~-G~ti~iP~~~~~gda~~~~eg~~i~~ 71 (272) T protein:vir:36 1 MSK----QKTTLADLVNPEVLAPIVSYEL----NKALRFAPLAQVDTTLQGQP-GNTLKFPAFTYIGDAADVAEGGEISL 71 (272) T ss_pred CCC----cceehhhhhchHHHHHHHHHHH----HhhhhhccccccccccccCC-CCEEEEeeeccCccccccCCCCccCh Confidence 111 1355566777899988884443 334444555555432 322 26799999999999999999999999 Q ss_pred eeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCccccc Q lcl|Aclame:pro 112 SGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITA 191 (336) Q Consensus 112 ~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~ 191 (336) .+.+.......+.+.+-++++ .++.+++. +-++..+-...+..++.+.+++..+ ..++ .+.... T Consensus 72 ~~lt~~~~~~~i~~~~k~~~v--tD~~~~~~-~~d~~~~~~~~~a~~~a~~~d~~i~---------~~l~----~~~~~~ 135 (272) T protein:vir:36 72 DKIGTTTKSVTIKKAAKGTEI--TDEAALSG-YGDPIGESNKQLGLSLANKVDDDLL---------SAAK----TTSQTV 135 (272) T ss_pred hhcCCcceeEeeehhhccccc--cHHHHhhc-cchHHHHHHHHHHHHHHHHHHHHHH---------HHhc----cccccc Confidence 999988888888877655554 55555554 4455555555556666666664321 1111 000000 Q ss_pred ccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhcccCCCC---CccHHHHHH-----HhCCccE Q lcl|Aclame:pro 192 TTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSKTNQY---GLAAAAKLK-----DIFPKLE 263 (336) Q Consensus 192 ~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~~~~~---~~Tvl~~l~-----~n~pnl~ 263 (336) + .+.+ +++|.+++..+-.. ...+..+++.|..+..|.+-... +.+..+-+. -.|-+++ T Consensus 136 ~----~~~~----~d~i~~A~~~lgd~------~~~~~~ivv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~G~~ 201 (272) T protein:vir:36 136 S----TKAN----VDGVQAALDIFNDE------DAQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADVLGAQ 201 (272) T ss_pred c----cccc----HHHHHHHHHHhhhc------CCCceEEEEcHHHHHHHhcccccccccccccccceeeeccceecCee Confidence 1 1223 34555555544221 12467899999999888642211 001001000 1233566 Q ss_pred EEEcccccCCCCc-eEEEEEEeecCCceEEEEcChhhh--cccceecCCceEEccccceeeeeeecccceeee--ccC Q lcl|Aclame:pro 264 FVTIPEYDTASGR-LVQLWAPRVEGKDTATCGFTEKMR--AHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQM--IGV 336 (336) Q Consensus 264 i~~~pel~~a~G~-~~~~~~~~~~~~~~~~~~~p~~~r--~l~~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~--~GI 336 (336) |+....+-...|. ..+++. +.-+........+ ..-.+.+... .+- .-...|+-+.+|-+++.+ .|+ T Consensus 202 Vv~s~~~p~~~~~~~~~~~~-----~gA~~~~~~~~~~vE~~R~~~~~~d-~i~-~~~~y~~~v~~~~~vv~~t~~g~ 272 (272) T protein:vir:36 202 IVRSKKLAEGSALMFKIVSN-----SPALKLVLKRGVQVETDRDIVTKTT-VIT-ADEHYAAYLYDLTKVVNITFTGV 272 (272) T ss_pred EEEeCCCCCCceeEEEEEec-----ccceeeeecCCcccccccchhhcCc-EEE-EEEEEEEEEEcCccEEEEeecCC Confidence 6654444322221 122221 1111111111111 1111111111 111 224589999999987765 688 No 58 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=96.76 E-value=0.00015 Score=41.57 Aligned_cols=305 Identities=10% Similarity=0.019 Sum_probs=145.0 Q ss_pred Cch-----HHHHHHhh----hcceecc--------------chhhhccchhHHHHHhhhhhcccccccCcch--HHHHHH Q lcl|Aclame:pro 1 MRD-----AQRIQNLA----RAGVILP--------------RSVQNVSTPLTEYAMDAADLSPHLSSTGSSG--IPNYLT 55 (336) Q Consensus 1 ~~~-----~~~~~~l~----~~g~~~~--------------~~~~~~~~~~~~~a~da~d~~~~l~t~~~~~--i~~~l~ 55 (336) ++. .+.+..++ +....-. .....+.. ....++...... + .|.+++| ||..+. T Consensus 191 ~~~~~~~~~~~~d~~e~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~l~~-~e~~~~~~~~~~-~-~t~~~gg~lip~~~~ 267 (543) T protein:vir:81 191 VRAAATKIIERFDDEDSTLARQCLATSSPAYLRAWSKMARNPHAAILTE-EEKRAINEVRAM-G-LTKADGGYLVPFQLD 267 (543) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhhhHHHHHHHhhHHHHhhh-hhhhhhhhhhhc-c-cccccCcccCchhhh Confidence 000 00000000 0000000 00000110 111122211111 1 2233333 443322 Q ss_pred HhhCceeeeeeccc-cchhhhcccccCCCcceeeEEEeeeecceeeEEeecccCCceeeeeeeeeeeeEEEEEEEEeeCH Q lcl|Aclame:pro 56 TYVDPAVIDILVAP-MKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGE 134 (336) Q Consensus 56 ~~idp~v~~~~~~~-~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~~ygd~~diP~~~~~~~~~~~~v~~~~~~~~y~~ 134 (336) ++++.....+ -....+..+.+. ...+.+++....+.+...|.+..+|..+.........++.++..+.+|. T Consensus 268 ----~~ii~~~~~~~~~l~~~~~~~~~----~g~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ 339 (543) T protein:vir:81 268 ----PTVIITSNGSLNDIRRFARQVVA----TGDVWHGVSSAAVQWSWDAEFEEVSDDSPEFGQPEIPVKKAQGFVPISI 339 (543) T ss_pred ----hHHHHHHHhhhchhhhhcccccC----CcceEEEEecCCcceeecccCccccccccccceeeeeeeeeEeeehhhH Confidence 2322221211 122333333221 1234666777777888889888999999888888888999999999988 Q ss_pred HHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccc-cceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHH Q lcl|Aclame:pro 135 RELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAG-LENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQ 213 (336) Q Consensus 135 ~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~-~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~ 213 (336) + +..-. .++.+.-......++.+.++.-+++|++. ....|+++++..... +.... ++..-.++|+.+++. T Consensus 340 e-ll~d~---~~~~~~i~~~l~~~~~~~~d~ail~G~Gt~~~p~Gi~~~~~~~~~----~~~~~-~~~~~~~~~~~~~~~ 410 (543) T protein:vir:81 340 E-ALQDE---ANVTETVALLFAEGKDELEAVTLTTGTGQGNQPTGIVTALAGTAA----EIAPV-TAETFALADVYAVYE 410 (543) T ss_pred H-HHhcc---HHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccchhhcccccc----ccccc-ccccccHHHHHHHHH Confidence 4 43322 48999999999999999999999999963 467899986543211 11111 112335778888877 Q ss_pred HHHHHhCCceecccccEEEecHHHHHhccc-CCCCCccHHHHHHHhCC----ccEEE---EcccccC---CCCceEEEEE Q lcl|Aclame:pro 214 VLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAKLKDIFP----KLEFV---TIPEYDT---ASGRLVQLWA 282 (336) Q Consensus 214 ~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~~l~~n~p----nl~i~---~~pel~~---a~G~~~~~~~ 282 (336) .+...- .....++|.+..+..|.+ .+..|.=++.-+...-| ++.++ .+|.... +.|....+|. T Consensus 411 ~l~~~~------~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~g~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~i~~g 484 (543) T protein:vir:81 411 QLAARH------RRQGAWLANNLIYNKIRQFDTQGGAGLWTTIGNGEPSQLLGRPVGEAEAMDANWNTSASADNFVLLYG 484 (543) T ss_pred hhhccc------cCCcEEEEcHHHHHHHHHhhcCCCceeccCcCCCCCccccceeeEEeccccccccccccCCcceEEEe Confidence 764321 123479999999988854 23333222221111111 12222 2333321 1233222222 Q ss_pred EeecCCceEEEEc--Chhhhcccc-----eecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 283 PRVEGKDTATCGF--TEKMRAHSI-----ERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 283 ~~~~~~~~~~~~~--p~~~r~l~~-----~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) .+. . ..+.. .+.+...+- ....-.+.+-...|++| .+++|-||+.+.-- T Consensus 485 -d~~-~--~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~r~d~-~v~~~~A~~~l~~~ 540 (543) T protein:vir:81 485 -NFQ-N--YVIADRIGMTVEFIPHLFGTNRRPNGSRGWFAYYRMGA-DVVNPNAFRLLNVE 540 (543) T ss_pred -ecc-c--eeEEeecccEEEEeccccccchhhcCceEEEEEEeecc-EeecccceEEEEec Confidence 221 1 11111 111111110 00111233344455555 55779999887766 No 59 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=96.71 E-value=0.00015 Score=41.49 Aligned_cols=297 Identities=13% Similarity=0.094 Sum_probs=143.2 Q ss_pred CchHH-HHHHhhhcceeccchhhhc-----------------------cchhHHHHHhhhhhcccccccCcc-hHHHHHH Q lcl|Aclame:pro 1 MRDAQ-RIQNLARAGVILPRSVQNV-----------------------STPLTEYAMDAADLSPHLSSTGSS-GIPNYLT 55 (336) Q Consensus 1 ~~~~~-~~~~l~~~g~~~~~~~~~~-----------------------~~~~~~~a~da~d~~~~l~t~~~~-~i~~~l~ 55 (336) +...+ +++++++.+-..+...... ..........+ ....++...+ -+|..+ T Consensus 54 i~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~g~lip~~~- 129 (390) T protein:vir:97 54 VQAARQRVAELEGNGAGGDVQHVSVGDMFVASEQFQASTGRWNDRSARATMNIKAALNT---ASTDAAGSAGALTTPNR- 129 (390) T ss_pred HHHHHHHHHHHHhcccccccccccchhhhhhhHHHHHHHHHhhhhhhhhhhHHHHHHHh---hhcccccccccccchhh- Confidence 11111 0111111111000000000 00000001111 1111111111 133222 Q ss_pred HhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeec-ceeeEEeecccCCceeeeeeeeeeeeEEEEEEEEeeCH Q lcl|Aclame:pro 56 TYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEP-TTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGE 134 (336) Q Consensus 56 ~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~-~G~a~~ygd~~diP~~~~~~~~~~~~v~~~~~~~~y~~ 134 (336) + +++++.+........++++...+. .+..|+..+. .+.+...+.+..+|-.+......+.....++....++. T Consensus 130 --~-~~ii~~~~~~~~i~~~~~~~~~~~---~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ 203 (390) T protein:vir:97 130 --L-PGFITPPDARLTVRDLIGSGRTDS---ALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATR 203 (390) T ss_pred --h-HHHHHHHhhhhhhHhhcceeeccC---CceEEEEEecCCcceeeecCCccccccccceeEEEEeeeeEEEeehhhH Confidence 2 355555556666666666654432 2345666554 46778888888999999888888888999998888887 Q ss_pred HHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeecccc-ceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHH Q lcl|Aclame:pro 135 RELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGL-ENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQ 213 (336) Q Consensus 135 ~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~-g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~ 213 (336) + +-.-. .++.+.-....++++.+.+++-.++|+... ...|++|.+......+ ..+.+..++||..++. T Consensus 204 e-ll~ds---~~l~~~i~~~la~a~~~~~d~a~l~G~g~~~~p~Gi~~~~~~~~~~~-------~~~~~~~~d~~~~~~~ 272 (390) T protein:vir:97 204 Q-ILSDA---PQLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPT-------TIAGATRVDQLRLAML 272 (390) T ss_pred H-HHHhH---HHHHHHHHHHHHHHHHHHHHHHHhhcCCCCccccceeeccccccccc-------cccccchHHHHHHHHH Confidence 5 43322 257888888888899999999889998643 3789999765432211 1123345677777777 Q ss_pred HHHHHhCCceecccccEEEecHHHHHhccc-CCCCCccHHHHHHH----hCCccEEEEcccccCC---CCce--EEEEEE Q lcl|Aclame:pro 214 VLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAKLKD----IFPKLEFVTIPEYDTA---SGRL--VQLWAP 283 (336) Q Consensus 214 ~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~~l~~----n~pnl~i~~~pel~~a---~G~~--~~~~~~ 283 (336) .+...- ..+..++|.|+.+..|.+ .+..|.-++.-... .+-++.++..+.+... -|.. .+.++. T Consensus 273 ~~~~~~------~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~gd~~~~~~~~~ 346 (390) T protein:vir:97 273 QASLAE------YPASGIVINPIDWAAIELAKDANNQYLIGNARGTLTPTLWGLPVVATQAMAPGEFLVGAFDLAAQIFD 346 (390) T ss_pred hhcccc------CCCCEEEEcHHHHHHHHHhhcCCCceeecCccCCCCceecceeeEEcCCCCCCcEEEEeccceEEEEE Confidence 664321 246689999999988864 33334322211000 0112233322222110 0211 122221 Q ss_pred eecCCceEEEEc---ChhhhcccceecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 284 RVEGKDTATCGF---TEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 284 ~~~~~~~~~~~~---p~~~r~l~~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) + .+ ..+.+ ...|+ + -....-+..| .|..+++|.||++.+== T Consensus 347 ~-~~---~~i~~~~~~~~f~------~-~~~~~r~~~r-~d~~v~~~~a~v~~~~a 390 (390) T protein:vir:97 347 Q-WD---ARVEIGYVNDDFQ------R-NMVTVLAEER-LALVVYRPEALITGSFA 390 (390) T ss_pred e-cc---eEEEEeecccccc------c-CcEEEEEEEe-eccEEeccccEEEEEeC Confidence 1 11 11111 11111 1 1122233333 45567778887664422 No 60 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=96.61 E-value=0.00026 Score=40.23 Aligned_cols=256 Identities=10% Similarity=0.024 Sum_probs=132.6 Q ss_pred hhhhhcccccccCcchHHHHHHHhhCceeeeeeccccchhhhcccccC--CCcceeeEEEeeeecceeeEEeecccCCce Q lcl|Aclame:pro 34 DAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESKK--GDWTTLVAAFITAEPTTKVATYGDYSSDGD 111 (336) Q Consensus 34 da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~--g~w~~~t~~~~~~e~~G~a~~ygd~~diP~ 111 (336) .|+ .-++-++--+|..+..|+..++ ...+....+..+.+. |..+ .+++++.+...|.+..|.+++++|. T Consensus 1 ma~----~~T~~~d~i~Pev~s~~v~~~~----~~~~~~~~~~~~~~~l~g~~G-~tv~ip~~~~~g~~~~~~~g~~i~~ 71 (274) T protein:vir:96 1 MAQ----GTTKVSNLIVPEVLAPMMQAEL----DKKLRFAQFADIDSTLVGQPG-DTLTFPAFTYSGDAQVIAEGEKIPV 71 (274) T ss_pred CCc----cccchhhhhhhHHHHHHHHHHH----HhhhhhcccccccccccCCCC-CEEEEEeeccCCCccccCCCCcCch Confidence 111 1133466778888888875443 444455566655442 3223 5799999999999999999999999 Q ss_pred eeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCccccc Q lcl|Aclame:pro 112 SGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITA 191 (336) Q Consensus 112 ~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~ 191 (336) .+.+.......+...+-+++++. +.+++ .+.++..+....+..++.+.+++..+- .++. ++.+. T Consensus 72 ~~it~~~~~~~i~~~~~~~~i~D--~~~~~-~~~d~~~~~~~~~~~~~a~~~d~~i~~---------~l~~----a~~~~ 135 (274) T protein:vir:96 72 DQIGTSKREAKVRKIGKGTELTD--EAVLS-GFGDPQGEAVRQHGLAIANKVDNDVLE---------ALKG----ATLTV 135 (274) T ss_pred hhcccceeEEEEEeeeceeeecH--HHHHh-hcchHHHHHHHHHHHHHHHHHHHHHHH---------HHhc----CCCCc Confidence 99998888888887766666654 44444 455666777777777787777764331 1111 00001 Q ss_pred ccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhcccCC--------CCCccHH-HHHHHhCCcc Q lcl|Aclame:pro 192 TTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSKTN--------QYGLAAA-AKLKDIFPKL 262 (336) Q Consensus 192 ~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~~~--------~~~~Tvl-~~l~~n~pnl 262 (336) + .++.+ ++.|.++...+-.. ...+..|+++|..+..|.+-+ ..|..++ .-.-.+|-++ T Consensus 136 ~---~~~~~----~d~i~dA~~~l~d~------~~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~g~ig~~~G~ 202 (274) T protein:vir:96 136 E---ADITK----LDGLQTAIDKFNDE------DLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEALGA 202 (274) T ss_pred C---ccccc----HHHHHHHHHHhccc------CCCceEEEeCHHHHHHHHhcccccccccccccccceeecccceecCe Confidence 1 11223 34444455544221 125778999999999885421 1111100 0000112234 Q ss_pred EEEEcccccCCCCceEEEEEEeecCCceEEEEcChhhhcccceecCCceEEcccc-ceeeeeeecccceeeeccC Q lcl|Aclame:pro 263 EFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSIERYSSYFRQKKSA-GTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 263 ~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p~~~r~l~~~~~~~~~~vp~~~-~t~Gv~ir~P~av~~~~GI 336 (336) +|.....+- -.+.+++- +.-+......+... ..+...+...--... ...|+-+.+|-+++.+.== T Consensus 203 ~Vi~s~~~p---~~t~~l~~-----~gA~~~~~~~~~~v-E~~Rd~~~~~d~i~~~~~yg~~~~~~~~vv~~t~~ 268 (274) T protein:vir:96 203 VIVRSNKLN---KGEALLAK-----KGAVKLITKRDFFL-EKDRDASRKSTALYSDKHYVAYLYDESKVVKITKG 268 (274) T ss_pred eEEEcCCCC---cceEEEEe-----CcceeeeecCCccc-ccccchhhcccEEEEeeEEEEEEEcCccEEEEEcC Confidence 444333221 11222221 11111111111110 011111111111111 2578999999877766544 No 61 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=96.60 E-value=0.00028 Score=40.03 Aligned_cols=278 Identities=12% Similarity=0.041 Sum_probs=140.3 Q ss_pred cceeccchhhhccchhHHHHHhhhhhcccccccCcchHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEe Q lcl|Aclame:pro 13 AGVILPRSVQNVSTPLTEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFI 92 (336) Q Consensus 13 ~g~~~~~~~~~~~~~~~~~a~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~ 92 (336) .|+ +.+.+.++.-.. +.....-+|.+....| +.+.......+++.+.+.+. .+..|+ T Consensus 1 ~g~---------~~e~~~~~~~~t------~~~~g~l~~~~~~~ii-----~~l~~~s~i~~l~~~~~~~~---~~~~ip 57 (397) T protein:vir:23 1 MGF---------SADHSQIAQTKD------TMFTGYLDPVQAKDYF-----AEAEKTSIVQRVAQKIPMGA---TGIVIP 57 (397) T ss_pred CCc---------CHHHHHHhhccC------CCCccccchhHHHHHH-----HHHHhccchhhhcceeeccC---CceEEE Confidence 222 112222221111 0111112344444333 33333444455555443332 236788 Q ss_pred eeecceeeEEeecccCCceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeecc Q lcl|Aclame:pro 93 TAEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVA 172 (336) Q Consensus 93 ~~e~~G~a~~ygd~~diP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~ 172 (336) +.+....+..++.+..+|..+.........++.++..+.++.+=++.+ ..++.+.-+...++++.+.+++-+++|+. T Consensus 58 ~~~~~~~a~wv~Eg~~~~~s~~~f~~v~l~~~k~~~~v~iS~ell~ds---~~~l~~~i~~~l~~aia~~~d~a~l~G~g 134 (397) T protein:vir:23 58 HWTGDVSAQWIGEGDMKPITKGNMTKRDVHPAKIATIFVASAETVRAN---PANYLGTMRTKVATAIAMAFDNAALHGTN 134 (397) T ss_pred EEcCCcceEEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcc---hHHHHHHHHHHHHHHHHHHHHHHHhhccc Confidence 888888889999999999999888888888999999988887655543 37789999999999999999999999986 Q ss_pred c-cceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhcccC-CCCCcc Q lcl|Aclame:pro 173 G-LENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSKT-NQYGLA 250 (336) Q Consensus 173 ~-~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~~-~~~~~T 250 (336) . .++-|+.+..+.. ..+.... ..+|+..++..+...- ..+..++|.++.+..|.+- +..|.- T Consensus 135 t~~~~~~~~~~~~~~------~~~~~~~----~~~~~~~~~~~l~~~~------~~~a~~vmn~~~~~~L~~lkd~~G~~ 198 (397) T protein:vir:23 135 APSAFQGYLDQSNKT------QSISPNA----YQGLGVSGLTKLVTDG------KKWTHTLLDDTVEPVLNGSVDANGRP 198 (397) T ss_pred CCcccccccccccce------eeecccc----hhHHHHHHHHhhhhcc------cCCCEEEEcHHHHHHHHHhhccCCce Confidence 4 3334444422221 1111122 2334444444443221 1356799999998888642 333433 Q ss_pred HHHH-HHHhCC----ccEEEEcccccC---CCCceE-------EEEEEeecCCceEEEEcChhhh-cccceec------- Q lcl|Aclame:pro 251 AAAK-LKDIFP----KLEFVTIPEYDT---ASGRLV-------QLWAPRVEGKDTATCGFTEKMR-AHSIERY------- 307 (336) Q Consensus 251 vl~~-l~~n~p----nl~i~~~pel~~---a~G~~~-------~~~~~~~~~~~~~~~~~p~~~r-~l~~~~~------- 307 (336) ++.= .....| .-++...|-.-. ..|... .+++..+.+ ..+.+..... ....... T Consensus 199 i~~~~~~~~~~~~~~~~tl~G~Pv~~s~~~~~g~~~~~~gDfs~~~i~~~~~---i~i~~~~e~~~~~~~~~~~~~~~lf 275 (397) T protein:vir:23 199 LFVESTYESLTTPFREGRILGRPTILSDHVAEGDVVGYAGDFSQIIWGQVGG---LSFDVTDQATLNLGSQESPNFVSLW 275 (397) T ss_pred eecccccccccccccCceeeeeeEEEeCCCCCCceEEEEeecceEEEEEEec---eEEEEeeeeeeeeccccccceeeee Confidence 3211 011111 112333332211 122221 122211111 1111111000 0000000 Q ss_pred -CCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 308 -SSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 308 -~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) .-....-+..|+ |+.+++|.+|+++.+- T Consensus 276 ~~d~v~~ra~~r~-d~~v~~~~a~~~~~~~ 304 (397) T protein:vir:23 276 QHNLVAVRVEAEY-GLLINDVNAFVKLTFD 304 (397) T ss_pred eccceeEEEEeee-ccceecccceEEEeec Confidence 011223334444 4588899999999886 No 62 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=96.55 E-value=9.4e-05 Score=42.65 Aligned_cols=257 Identities=11% Similarity=0.029 Sum_probs=131.5 Q ss_pred HHHhhhhhcccccccCcchHHHHHHHhhCceeeeeeccccchhhhcccccC--CCcceeeEEEeeeecceeeEEeecccC Q lcl|Aclame:pro 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESKK--GDWTTLVAAFITAEPTTKVATYGDYSS 108 (336) Q Consensus 31 ~a~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~--g~w~~~t~~~~~~e~~G~a~~ygd~~d 108 (336) |||-+ .+.-++--+|+.+..||..++ ...+....|..+.+. |.-+ .++.++.++..|.+..|.++++ T Consensus 1 ~~~~~------~T~l~d~i~PEv~~~~v~~~~----~~~~~~~~~~~~~~~l~g~~G-~tv~iP~~~~ig~a~~~~~g~~ 69 (275) T protein:vir:96 1 MALEN------MTKLANMVNPEVLAPMMQAEL----DKKLKFAQFADIDNTLVGQPG-NTITFPAFVYSGDAKVVPEGEE 69 (275) T ss_pred CCCcc------cchhhhhhchHHHHHHHHHHH----HHhhhhcccceecccccCCCC-CEEEeeeeccCCccccccCCCC Confidence 55422 355567777999998885444 334444555544332 3222 6799999999999999999999 Q ss_pred CceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCcc Q lcl|Aclame:pro 109 DGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAP 188 (336) Q Consensus 109 iP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~ 188 (336) ++..+.........+...+-+++++. +.+.+.. -++..+-...+..++.+.+++..+ ..++.- . T Consensus 70 i~~~~lt~~~~~~~i~~~~~~~~i~D--~~~~~~~-~d~~~~~~~~~a~~~a~~~d~~ll---------~~l~~a----~ 133 (275) T protein:vir:96 70 IPIDLIETKKRQATIRKIGKGTVLTD--EALLSGY-GDPKGEAVRQHGLAIANKVDNDVL---------EALQGA----T 133 (275) T ss_pred cchhhcccceeeEEeehhcccccccH--HHHHhhc-cchHHHHHHHHHHHHHHHHHHHHH---------HHHhcc----c Confidence 99999988888888887766655554 4444443 345555555566666666665432 111110 0 Q ss_pred cccccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhcccCC--------CCCccHHH-HHHHhC Q lcl|Aclame:pro 189 ITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSKTN--------QYGLAAAA-KLKDIF 259 (336) Q Consensus 189 ~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~~~--------~~~~Tvl~-~l~~n~ 259 (336) .+.++ +..+.+ .|.+++..+-.. ...+..|+++|..+..|.+-. ..+..++. =.-..| T Consensus 134 ~~~~~---~~~~~d----~i~dA~~~lgd~------~~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~G~ig~~ 200 (275) T protein:vir:96 134 LKVEA---DITKLA----GLQTAIDKFNDE------DLEPMVLFVNPLDAGKLRASATDNFTRATLLGDNVIVKGAFGEA 200 (275) T ss_pred ccccc---cccCHH----HHHHHHHHhccc------cCCccEEEeCHHHHHHHHhcccccccccccccccceecccccee Confidence 11111 122333 444454444221 135788999999998884421 11111100 000012 Q ss_pred CccEEEEcccccCCCCceEEEEEEeecCCceEEEEcChhhhcccceecCCceEEc-cccceeeeeeecccceeee----- Q lcl|Aclame:pro 260 PKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSIERYSSYFRQK-KSAGTWGAVIFRPFAVAQM----- 333 (336) Q Consensus 260 pnl~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p~~~r~l~~~~~~~~~~vp-~~~~t~Gv~ir~P~av~~~----- 333 (336) -+++|+....+. -++.+++- +.-+......+... ......++..-- ..-..+|+-+.+|-+++.+ T Consensus 201 ~G~~Vi~s~~~p---~~t~~i~~-----~gA~~~~~~~~~~v-E~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~ 271 (275) T protein:vir:96 201 LGAIIVRSNKIK---EGEAILAK-----RGAVKLITKRDFFL-ETERHASHKSTALFSDKHYVAYLYDESKVVKITKSAS 271 (275) T ss_pred cCeeEEEeCCCC---cceEEEEe-----ccceeeeecCCccc-ccccchhhcCcEEEEeEEEEEEEEcCccEEEEEeccc Confidence 245554433322 12223321 11111111111110 111111111111 1223568889999888775 Q ss_pred -ccC Q lcl|Aclame:pro 334 -IGV 336 (336) Q Consensus 334 -~GI 336 (336) +|+ T Consensus 272 ~~~~ 275 (275) T protein:vir:96 272 GLGV 275 (275) T ss_pred ccCC Confidence 344 No 63 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=96.54 E-value=0.00037 Score=39.39 Aligned_cols=304 Identities=8% Similarity=0.041 Sum_probs=137.1 Q ss_pred Cch-HHHHHH-------------------------hhhcceeccchhhhccchhHHH---HHhhhhhccccc--ccCcch Q lcl|Aclame:pro 1 MRD-AQRIQN-------------------------LARAGVILPRSVQNVSTPLTEY---AMDAADLSPHLS--STGSSG 49 (336) Q Consensus 1 ~~~-~~~~~~-------------------------l~~~g~~~~~~~~~~~~~~~~~---a~da~d~~~~l~--t~~~~~ 49 (336) +++ ..++.+ ....+..+... .....+++.+ ............ ...... T Consensus 54 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~e~~~~~~~~~~~~~~~~~~~~~~~g~~~ 132 (415) T protein:vir:94 54 KQEELDKLKEKDGTSENNQQSVEVNEASTYRNQANINDLGISIQNT-KVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVV 132 (415) T ss_pred HHHHHHHHHHHHHhhhhccccccccchhhHHHHHHHHHHHhhhhhh-hhhHHHHHHHHHHhhhhhhhhhhcccccccccc Confidence 000 000000 00000000000 0000011110 000001111111 122233 Q ss_pred HHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecceeeEEeecccCCcee-eeeeeeeeeeEEEEEE Q lcl|Aclame:pro 50 IPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDS-GANINYPQRQSYFFQT 128 (336) Q Consensus 50 i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~~ygd~~diP~~-~~~~~~~~~~v~~~~~ 128 (336) ||..+ .+++++.+........++++..... ....+.+......+.+...+.+.++|-. ..........++.++. T Consensus 133 iP~~~----~~~ii~~~~~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~i~~~~~k~~~ 207 (415) T protein:vir:94 133 IPEEI----VTDILKLKEVEFNLDKYVTVKRVTN-GSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRG 207 (415) T ss_pred CcHHH----HHHHHHHHHhhhhhhhhcceeeccC-CceeEEEEeecCCccceeccccccccccccccceeeEeeheeeee Confidence 55333 3566777677777777766644321 1122333333445566777888888854 4567888888888888 Q ss_pred EEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEec-CCCCcccccccccccccCHHHHHHH Q lcl|Aclame:pro 129 WTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLIND-PSLSAPITATTPWSGSPAVEAVVNE 207 (336) Q Consensus 129 ~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~-Pnl~~~~~~~t~w~~~~t~~eI~~D 207 (336) .+.+|.+=+. ....++.+.-....++++.+.+|+-++.|+......+.... .... .+.. .+ ...-++| T Consensus 208 ~~~is~ell~---ds~~~~~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~--~~~~----~~--~~~~~~~ 276 (415) T protein:vir:94 208 YFRISREAIE---DAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEG--KKLE----VK--KAKSLDD 276 (415) T ss_pred echhhHHHHh---hchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccc--cccc----cc--cccchHH Confidence 8888865333 33467888888888888888888888887754322222221 1111 1111 01 1123667 Q ss_pred HHHHHHHHHHHhCCceecccccEEEecHHHHHhccc-CCCCCccHHH-HHHHh----CCccEEEEccccc-CCCCceEEE Q lcl|Aclame:pro 208 VVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAA-KLKDI----FPKLEFVTIPEYD-TASGRLVQL 280 (336) Q Consensus 208 i~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~-~l~~n----~pnl~i~~~pel~-~a~G~~~~~ 280 (336) |.+++..+... + -.+..++|.++.+..|.+ .+..|.-++. -+... +-++.++..+.+- ++.|....+ T Consensus 277 i~~~~~~~~~~--~----~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~i~ 350 (415) T protein:vir:94 277 IKDAINLNVKP--N----YEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLI 350 (415) T ss_pred HHHHHHhhhhh--c----cCCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCCceecceeeEEecccccCCCCccEEE Confidence 77777766432 1 136789999999998864 3433433321 01110 1112333333332 223333333 Q ss_pred EEEeec-----CCceEEEEcChhhhcccceecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 281 WAPRVE-----GKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 281 ~~~~~~-----~~~~~~~~~p~~~r~l~~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) +.+-.+ ...-..+.. .+|.. .....-...|. |+.+.+|.||+++.-- T Consensus 351 ~gd~~~~~~~~~~~~~~v~~-~~~~~-------~~~~~r~~~r~-d~~~~~~~a~~~~~~~ 402 (415) T protein:vir:94 351 IGNLKDAIVLFDRSQYQASW-TDYMH-------FGECLMIAVRQ-DCRILDYKSAIVIEYD 402 (415) T ss_pred EEehhccEEEEeecceEEEE-ecccc-------CceEEEEEEEe-ccEEeccccEEEEEEe Confidence 322100 000011111 11111 11111234444 5666779999988644 No 64 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=96.52 E-value=0.00049 Score=38.75 Aligned_cols=303 Identities=10% Similarity=0.028 Sum_probs=140.3 Q ss_pred CchHHHHHHhhh-----cceeccchhh-------hc----cchhHHHHHhhhhhcccccccCcc-hHH-HHHHHhhCcee Q lcl|Aclame:pro 1 MRDAQRIQNLAR-----AGVILPRSVQ-------NV----STPLTEYAMDAADLSPHLSSTGSS-GIP-NYLTTYVDPAV 62 (336) Q Consensus 1 ~~~~~~~~~l~~-----~g~~~~~~~~-------~~----~~~~~~~a~da~d~~~~l~t~~~~-~i~-~~l~~~idp~v 62 (336) ....+....+.+ -+-.-+.... ++ ..+.+.. ..+.....+ ++..++ .+| ......|+ ++ T Consensus 58 ~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~g~~~~~~~~-~~~~~~~~~-t~~~~g~~~~~~~~~~~i~-~~ 134 (392) T protein:vir:13 58 IDAIKATDAVTSLLSGLQGSGSGAQRSADHDDDAVLRAGNLGEARSF-EFAPEKRDG-TKAGNPNVLSRTLYGQLIA-QA 134 (392) T ss_pred HHHHHHHHHHHHHhcccCCcccchhhhhhHHHHHHHhccchhhhHHH-Hhhhhhhcc-cccCCCccccccchHHHHH-HH Confidence 000000001000 0000000000 00 0000111 011111111 122222 122 22222221 11 Q ss_pred eeeeccccc-hhhhcccccCCCcceeeEEEeeeecceeeEEeecccCCceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHH Q lcl|Aclame:pro 63 IDILVAPMK-AAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAG 141 (336) Q Consensus 63 ~~~~~~~~~-~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~~ygd~~diP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~ 141 (336) ...+..++ ....++... ...+.+++.+..+.+..++.+..+|..+.......-.++.++..+.+|.+=|+. T Consensus 135 -~~~~~~l~~~~~~~~~~~-----~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~d-- 206 (392) T protein:vir:13 135 -VERSAIMRGGASTFTTSD-----ANPMDFTVITGRATAGIVGETAEIPESYPATTQRSMGGFKYGFASVVSYEFATD-- 206 (392) T ss_pred -HhhhhhhhhcceeeecCC-----CceeEEEEEcCCcceeeecccccccccccceeeEEeeeeeEEeeehhHHHHHhc-- Confidence 01111111 122222211 134567777888888888999999999988888888888888888887665553 Q ss_pred hhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCC Q lcl|Aclame:pro 142 AGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQG 221 (336) Q Consensus 142 ~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g 221 (336) ...++.+.-....+.++.+.++.-+++|++...-.|+++++.... ....+...+ .-.++||.+++..|...-. T Consensus 207 -s~~~l~~~i~~~l~~~i~~~~d~~~l~G~Gt~~p~Gil~~~~~~~---~~~~~~~~~--~~~~d~l~~~~~~l~~~~~- 279 (392) T protein:vir:13 207 -QVLDLVGFLVSDAGPAIGDAMGRHFLTGTGTGQPRGILTDATGAN---AAFGEADAD--SKVSDALIDLFHEVPSAYR- 279 (392) T ss_pred -chHHHHHHHHHHHHHHHHHHHHHHHhcccCCcccccccccccccc---ccccccccc--cccHHHHHHHHHhhhhhhh- Confidence 355788888888888899999999999998878889999754321 111121111 1235666677776643321 Q ss_pred ceecccccEEEecHHHHHhccc-CCCCCccHHH-HHHHhCCccEEEEcccccCC---C-----CceEEEEEEeecCCceE Q lcl|Aclame:pro 222 IITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAA-KLKDIFPKLEFVTIPEYDTA---S-----GRLVQLWAPRVEGKDTA 291 (336) Q Consensus 222 ~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~-~l~~n~pnl~i~~~pel~~a---~-----G~~~~~~~~~~~~~~~~ 291 (336) ..-.++|.++.+..|.. .+..|.-++. -+...-| -++-..|-.... . |.-.+.++..+ +.-+. T Consensus 280 -----~~a~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~g~~-~~l~G~Pv~~~~~~~~~~i~~Gdf~~~~i~~~-~~~~i 352 (392) T protein:vir:13 280 -----KNAKFVVNDLRAAQMRKLKDANGQYLWQSALTVGAP-DTFNGKVVETDDGMPADKVLFADLSKYRVRFA-GSLRV 352 (392) T ss_pred -----cCCEEEEcHHHHHHHHHhhccCCceeecCCcCCCCC-ceecceeeEEcCCCCCCcEEEeeccceeEEee-cceEE Confidence 23468999998888753 3333432221 0000001 012222222111 1 21111111111 11111 Q ss_pred EEEcChhhhcccceecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 292 TCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 292 ~~~~p~~~r~l~~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) .. .... ....-...+-+..|.+|. +++|-||+.+..- T Consensus 353 ~~-~~~~------~~~~~~~~~r~~~r~d~~-~~~~~A~~~~~~~ 389 (392) T protein:vir:13 353 DR-SVDA------KFSTDQIVYRFLQRADGL-LVDARGAKVLTVT 389 (392) T ss_pred Ee-eccc------cccCCcEEEEEEEEeccE-EecccceEEEEee Confidence 00 1111 111223455566777655 7789998866655 No 65 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=96.46 E-value=0.00043 Score=39.04 Aligned_cols=300 Identities=10% Similarity=0.015 Sum_probs=137.6 Q ss_pred CchHH---HHHHhhhcceeccchhh-----hccchhHHHHHhhhhhcccccccCc--c--hHHHHHHHhhCceeeeeecc Q lcl|Aclame:pro 1 MRDAQ---RIQNLARAGVILPRSVQ-----NVSTPLTEYAMDAADLSPHLSSTGS--S--GIPNYLTTYVDPAVIDILVA 68 (336) Q Consensus 1 ~~~~~---~~~~l~~~g~~~~~~~~-----~~~~~~~~~a~da~d~~~~l~t~~~--~--~i~~~l~~~idp~v~~~~~~ 68 (336) ++... .++.|...+-..-.+.+ +...........++-. .++++.+. + .+|+.+. .+|++.+.+ T Consensus 290 ~kg~~f~~~~~al~~~~g~~~~a~e~a~~~~~~~~~~~~~~~~a~~-~~~~~~~~~~Gg~~vp~~~~----~~ii~~l~~ 364 (645) T protein:vir:93 290 DKGIGFARFAKSLAAAKGVRSEALEVARRQYPDDSRLHHVLKSAVG-AGTTTDPQWAGSLSEYQEYA----QDFIDYLRP 364 (645) T ss_pred hhhhhHHHHHHHHHhcccchhHHHHHHHhhcccchhhhhhhhhhhh-ccccccccccCCccCchhhH----HHHHHhhhh Confidence 11111 12222221111101000 0000011111111111 12222211 2 2344433 234444444 Q ss_pred ccchhhhcccccCCCcc-eeeEEEeeeecceeeEEeecccCCceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCH Q lcl|Aclame:pro 69 PMKAAELVGESKKGDWT-TLVAAFITAEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDL 147 (336) Q Consensus 69 ~~~~~~l~~v~t~g~w~-~~t~~~~~~e~~G~a~~ygd~~diP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l 147 (336) ......+-.....+... ...+..+.....+.+.+.|...+.|..+......+-+.+.++....+|.+=|+.+ ..++ T Consensus 365 ~svv~~l~~~~~~~~~~~~~~~~ip~~t~~~~a~wv~Eg~~~~~s~~~f~~v~l~~~kla~~~~iS~ell~ds---~~~~ 441 (645) T protein:vir:93 365 QTIIGRFGQGGIPALRQVPFNIRVHAQVSGGAAGWVGEGKTKPLTKFDFESITFSHAKVSAIAVLTEELIRFS---SPAA 441 (645) T ss_pred hhhHHhhccccccccccccCceeeeeeecCcceEEeccCccccccccceeEEEEeeEEEEEeehhHHHHHhhc---hHHH Confidence 43333433221111111 1234555656667778888888999999888888888888888888775544433 4567 Q ss_pred HHHHHHHHHHHHHHhhcceEEeecccc----ceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCce Q lcl|Aclame:pro 148 ASELNYSSALGLAKFLNGSYLFGVAGL----ENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGII 223 (336) Q Consensus 148 ~~~k~~aAr~a~e~~~n~~~~~Gd~~~----g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v 223 (336) .+--......++.+.++.-++.|+..- .-.|++|. +. .+. +......|+..++..+..... T Consensus 442 ~~~i~~~l~~aia~~~d~a~l~g~g~~~~~~~p~gi~~~--~~-----~~~-----~~~~~~~d~~~~~~~~~~a~~--- 506 (645) T protein:vir:93 442 DALVRNALAEAVVARLDTDFVDPKKAAVADVSPASITHD--VK-----GTA-----SSGNPDADAEAAFGQFVAANL--- 506 (645) T ss_pred HHHHHHHHHHHHHHHHHHHhhcCCCcccCCccccceecc--cc-----ccc-----cccchHHHHHHHHHHHHhcCC--- Confidence 777777788888888888888776432 12344441 10 110 111234678778777754431 Q ss_pred ecccccEEEecHHHHHhcccC-CCCCccHHHHHHHhCCcc-----EEEEcccccCC-------CCceEEEEEEeecCCce Q lcl|Aclame:pro 224 TQEDVLRMGLPPTAMSDLSKT-NQYGLAAAAKLKDIFPKL-----EFVTIPEYDTA-------SGRLVQLWAPRVEGKDT 290 (336) Q Consensus 224 ~~~~p~tL~Lp~~~~~~L~~~-~~~~~Tvl~~l~~n~pnl-----~i~~~pel~~a-------~G~~~~~~~~~~~~~~~ 290 (336) .. .--..+|.|..+..|.+- +..|.-+ ||++ ++-..|-.... -|.-..+++-.. ++ T Consensus 507 ~~-~~a~~vmn~~~~~~L~~lkd~~G~~~-------~~~~~~~~~tL~G~PV~~s~~vp~~~~~gd~s~~~ig~~-~~-- 575 (645) T protein:vir:93 507 QP-TGAVWLMSSTNALALSMRKNALGQKE-------YPDMTLLGGSFQGLPVIVSQYVGDQLVLVNAPDIYLADD-GG-- 575 (645) T ss_pred Cc-cccEEEEcHHHHHHHHhccccCCcee-------ecCCCCCCceeeceeeEEeccCCcceeEeccccEEEEEe-cc-- Confidence 11 112478999988888643 3323222 1211 12222211110 011111111111 10 Q ss_pred EEEEcChhh------------------hcccceecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 291 ATCGFTEKM------------------RAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 291 ~~~~~p~~~------------------r~l~~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) ..+.+...- ..+... ..-.+-+.|+.|+++.+ ++|-||++++|+ T Consensus 576 v~i~~s~~a~~~~~~~~~~~~~~~~~~~~v~lf-~~d~vaira~~r~d~~~-~~p~a~~~lt~~ 637 (645) T protein:vir:93 576 VAVDMSREASLEMQSEPTGDSTTPSPVELVSMF-QTGSVAIRAERWINWRR-RRTAAVAVITGV 637 (645) T ss_pred eEEEeecceeEEEeecccccccccccccchhHh-hcCceEEEEEEEEccee-eCccceEEEecc Confidence 111111100 000001 12235567777776654 999999999999 No 66 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=96.45 E-value=0.00046 Score=38.90 Aligned_cols=304 Identities=13% Similarity=0.037 Sum_probs=136.9 Q ss_pred CchHHHHH---Hhhh-----cceeccch---hhhccchhH--HHHHhhhhhcccc--cccCcch--HH-HHHHHhhCcee Q lcl|Aclame:pro 1 MRDAQRIQ---NLAR-----AGVILPRS---VQNVSTPLT--EYAMDAADLSPHL--SSTGSSG--IP-NYLTTYVDPAV 62 (336) Q Consensus 1 ~~~~~~~~---~l~~-----~g~~~~~~---~~~~~~~~~--~~a~da~d~~~~l--~t~~~~~--i~-~~l~~~idp~v 62 (336) |+.....+ .++. .++..-.+ ....+...+ .+.+++. ....+ .|..++| +| .++. .++ T Consensus 304 ~~~~~l~rai~a~a~~~~~~a~~~~e~a~~~a~~~G~~arg~~~~~~~l-~~ra~~~~t~~~gg~lvp~~~~~----~~i 378 (632) T protein:vir:96 304 LQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVL-VQRQLEKKTAGKGGELVATELLS----EEF 378 (632) T ss_pred HHHHHHHHHHHhhhccchhhhhhhhHHHHHHHHhhhhhhhhhhhhHHHH-HHhhhhcccccccccccccccch----HHH Confidence 11111111 1110 00000000 000000000 0111110 00111 1222222 23 3332 223 Q ss_pred eeeeccccchhhhcccccCCCcceeeEEEeeeecceeeEEeecccCCceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHh Q lcl|Aclame:pro 63 IDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGA 142 (336) Q Consensus 63 ~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~~ygd~~diP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~ 142 (336) ++.+.+..-+..+ +...... ....+.++.....+.+...|-...+|..+...+..+-..+.++..+.+|.+=|..+ T Consensus 379 ie~lr~~s~i~~l-~~~~~~~-~~g~~~ip~~~~~~~a~wv~E~~~~~~s~~~f~~i~l~~~k~~~~v~iS~ell~ds-- 454 (632) T protein:vir:96 379 IDILRNKAIIGQM-GARMLPG-LVGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQS-- 454 (632) T ss_pred HHHHhhcchhhhh-cceEeec-CCcceEEEEEeCCceeEeecCCccccccccceeeEEeeeeEEEEehhhHHHHHhcc-- Confidence 3333332223332 2221110 11246788888778888888888899988888888888888888888876544433 Q ss_pred hCCCHHHHHHHHHHHHHHHhhcceEEeecc-ccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCC Q lcl|Aclame:pro 143 GRVDLASELNYSSALGLAKFLNGSYLFGVA-GLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQG 221 (336) Q Consensus 143 ~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~-~~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g 221 (336) ..++.+.-......++.+.++.-+++|++ +....|++|...++....++ +..+ ++||.++...+..... T Consensus 455 -~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~----~~~~----~~~i~~~~~~i~~~~~- 524 (632) T protein:vir:96 455 -SIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPA----GGVD----WASVVDMETKISTFNA- 524 (632) T ss_pred -chHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccceeeecccccceeccc----ccCC----HHHHHHHHHHHhhccc- Confidence 56778887788888888888988899987 45678999977665322111 1222 3455556655544321 Q ss_pred ceecccccEEEecHHHHHhccc---CCCCCccHHH--HHHHhCCccEEEEcccccCCCCceEEEEEEeecCCceEEEEcC Q lcl|Aclame:pro 222 IITQEDVLRMGLPPTAMSDLSK---TNQYGLAAAA--KLKDIFPKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFT 296 (336) Q Consensus 222 ~v~~~~p~tL~Lp~~~~~~L~~---~~~~~~Tvl~--~l~~n~pnl~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p 296 (336) + -.....+|.+.....|.. .+..|.-+++ .| .-||-+.-..+|.-...-|+-...++-.+.+-... .-| T Consensus 525 --~-~~~~~~~~~~~~~~~l~~~~l~d~~G~~i~~~~~l-~G~pv~~s~~ip~~~~~~gd~s~~~i~~~~~~~i~--~~~ 598 (632) T protein:vir:96 525 --D-AGRLAYLTSVTQRGAAKKAQVFDNTGERIWQNNEV-NGYRAEASNQIPADTWIFGDWSQIVIAMWGVLDLK--VDP 598 (632) T ss_pred --c-cCccEEEEchhHHHHHHHHhccCCCCceeecCCee-cccceEeccccccCcEEEeecceEEEEEecceEEE--Ecc Confidence 1 123457888877666642 2333332321 00 01332222223322222243333333332221111 011 Q ss_pred hhhhcccceecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 297 EKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 297 ~~~r~l~~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) ... .......+-+..+ .++-+++|-+|+...== T Consensus 599 ~~~------~~~~~v~~~~~~~-~d~~v~~~~af~~~k~~ 631 (632) T protein:vir:96 599 YTK------AASDGLVLRVFQD-VDAGVRRKEAFCIAKKG 631 (632) T ss_pred ccc------cccCceEEEEEee-cCceeechhhhhheeec Confidence 110 0111222333333 34567778777643222 No 67 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=96.43 E-value=0.00046 Score=38.90 Aligned_cols=302 Identities=14% Similarity=0.094 Sum_probs=144.3 Q ss_pred CchHHH-HHHhhhcceeccchhhhccc-------------------hhHHHHHhhh-hhcccccccCcc--hHHHHHHHh Q lcl|Aclame:pro 1 MRDAQR-IQNLARAGVILPRSVQNVST-------------------PLTEYAMDAA-DLSPHLSSTGSS--GIPNYLTTY 57 (336) Q Consensus 1 ~~~~~~-~~~l~~~g~~~~~~~~~~~~-------------------~~~~~a~da~-d~~~~l~t~~~~--~i~~~l~~~ 57 (336) +...++ +..+++..-........... .....-..++ ..+....+.+.+ -+|.+. T Consensus 54 i~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~--- 130 (390) T protein:vir:81 54 VQAARQRVAELEGNGAGGDVQHVSVGDMFVASEQFQASAGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRL--- 130 (390) T ss_pred HHHHHHHHHHHHhcccccccccccchhhhhhhHHHHHHHHHHhhhhhhhhhHHHHHHHhhccccccCCcceechhhh--- Confidence 111110 11111111100000000000 0000000010 111111111111 122332 Q ss_pred hCceeeeeeccccchhhhcccccCCCcceeeEEEeeeec-ceeeEEeecccCCceeeeeeeeeeeeEEEEEEEEeeCHHH Q lcl|Aclame:pro 58 VDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEP-TTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERE 136 (336) Q Consensus 58 idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~-~G~a~~ygd~~diP~~~~~~~~~~~~v~~~~~~~~y~~~E 136 (336) +++++.+........++++.+.+. .+..++.... .+.+...+.+..+|-.+.........+..++..+.+|.+= T Consensus 131 --~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~el 205 (390) T protein:vir:81 131 --PGFITPPDARLTVRDLIGSGRTDS---ALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQI 205 (390) T ss_pred --HHHHHHHhhhhhhhhhcceeeccC---CceEEEEEecCCcceeeecCCcccccccceeeEEEEeeeEEEEeehhhHHH Confidence 345555566666667766644332 2345555544 4677788888899999998888899999999998888753 Q ss_pred HHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccc-cceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 137 LEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAG-LENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVL 215 (336) Q Consensus 137 l~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~-~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l 215 (336) ++.+ .++.+.-....++++.+.+|+-.++|+.. ....|++|.+........ .+....++||..++..+ T Consensus 206 l~d~----~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~Gi~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~ 274 (390) T protein:vir:81 206 LSDA----PQLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTT-------IAGATRVDQLRLAMLQA 274 (390) T ss_pred HHhH----HHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcccceeecccccccccc-------cccchhHHHHHHHHHhh Confidence 3322 25888888888888888889888999864 347899997655322111 11222456777777766 Q ss_pred HHHhCCceecccccEEEecHHHHHhccc-CCCCCccHHHHHHHh----CCccEEEEcccccCC---CCce--EEEEEEee Q lcl|Aclame:pro 216 QTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAKLKDI----FPKLEFVTIPEYDTA---SGRL--VQLWAPRV 285 (336) Q Consensus 216 ~~~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~~l~~n----~pnl~i~~~pel~~a---~G~~--~~~~~~~~ 285 (336) ... + ..+..++|.|+.+..|.+ .+..|.-++.-.... .-++.++..+.+... -|.. .+.++++ T Consensus 275 ~~~--~----~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~- 347 (390) T protein:vir:81 275 SLA--E----YNPSGIVINPIDWAAIELAKDANNQYLIGNARGTLTPTLWGLPVVATQAMAPGEFLVGAFDLAAQIFDQ- 347 (390) T ss_pred ccc--c----CCCCEEEEcHHHHHHHHHhhcCCCceeecCcccccCceecceeeEEcCCCCCCcEEEEehhceEEEEEe- Confidence 432 1 246689999999888864 344343332211111 012223322222110 1211 1222211 Q ss_pred cCCceEEEEcChhhhcccceecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 286 EGKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 286 ~~~~~~~~~~p~~~r~l~~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) .+ ..+ .+...+.....-...+-+..|++| .++.|.||++.+== T Consensus 348 ~~---~~v----~~~~~~~~~~~~~v~~r~~~r~d~-~v~~~~a~v~~t~a 390 (390) T protein:vir:81 348 WD---ARV----EIGYVGEDFQRNMITVLAEERLAL-VVYRPEALISGSFA 390 (390) T ss_pred cc---eEE----EEecccchhhcCcEEEEEEEeecc-EEecccceEEEEeC Confidence 11 111 111111111111233445566655 66777777654311 No 68 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=96.39 E-value=0.00067 Score=37.98 Aligned_cols=299 Identities=11% Similarity=0.038 Sum_probs=135.5 Q ss_pred eeccchhhhccchhHHHHHhhhhhcccccccCcchHHHHHHHhhCceeeeeeccccchhhhcccccCCCccee--eEEEe Q lcl|Aclame:pro 15 VILPRSVQNVSTPLTEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTL--VAAFI 92 (336) Q Consensus 15 ~~~~~~~~~~~~~~~~~a~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~--t~~~~ 92 (336) +--+++.-.-......-++...|..++.. .|.+++.+|+ .+.+ .-+-++....+... +++.-+ ..-+. T Consensus 1 ~~~~~~~~~~~~~~~~k~~t~~d~~Gg~l------~P~~~~~~i~-~~~e-~s~~l~~~~vi~~~--~~~~~~i~~~g~~ 70 (315) T protein:vir:41 1 MLTIEDIRGGKPFEIVPKIDVPDLGRGVL------SVDRFGEFVK-AVRD-SAVIIPEARIDNAL--KSYEKDISRLSLV 70 (315) T ss_pred CcccchhhcCChhhhhhhcCCcCCCCcee------chHHHHHHHH-HHHh-hhhhhhhceeeecc--ccccccccccccC Confidence 11111110000111111222223333322 3566666553 3333 12222222222111 111100 00000 Q ss_pred eeecceeeEEeecccCCceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeecc Q lcl|Aclame:pro 93 TAEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVA 172 (336) Q Consensus 93 ~~e~~G~a~~ygd~~diP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~ 172 (336) ..-..| +...|.....|-.+..........+.+..-...+.+.|. -.+-+.++.+.-......++.+.+....+.||+ T Consensus 71 ~~~~~g-~~~~~~~~~~~~~~~~f~~~~l~~~~l~~~~~it~elL~-D~~~~~~~e~~l~~~~a~~~a~~~~~~~~nGdg 148 (315) T protein:vir:41 71 LDVGPG-RDETGQKLAPPESTAEVKTNTLYMREMVTKVVIHEDAIE-DNIEGKAFEQKIVTLLGEGISYVLEKYYLHGDT 148 (315) T ss_pred cccccc-cccccCcCCCCCCccccceeeeceeeeeeeccccHHHHH-hhhccccHHHHHHHHHHHHHHHHHHHHhhccCC Confidence 000001 112233334444444333333334444444455555554 344578999999999999999999999999987 Q ss_pred c------cceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhccc-CC Q lcl|Aclame:pro 173 G------LENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TN 245 (336) Q Consensus 173 ~------~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~ 245 (336) . ....|+|+.....+.. ....+.+...+.+.+.|+...+..-..+.. .....+|+.+.+..+.+ .+ T Consensus 149 ~s~~p~~~~~~G~l~~a~~~~~~-~~~~~~a~~~~~d~l~~l~~sl~~~yr~~~------~~~~~imn~~t~~~~rklk~ 221 (315) T protein:vir:41 149 SSSDPLLRMSDGWLKLASEKLTE-SDVDPEAEDWPMNLFDTMIESLPTPYRNNL------PNMKFYVTWDIYRAYRDALK 221 (315) T ss_pred cCcCccccccccceecccccccc-cccccccccccHHHHHHHHHhcChHHhhcC------CceEEEEcHHHHHHHHHHhc Confidence 4 4567999865443221 122222222233444444433333222211 12468888887765432 12 Q ss_pred CCCccHHHHH-HH----hCCccEEEEcccccCC-CCceEEEEEEeecCCceEEEEcChhhhcccc-eecCCceEEccccc Q lcl|Aclame:pro 246 QYGLAAAAKL-KD----IFPKLEFVTIPEYDTA-SGRLVQLWAPRVEGKDTATCGFTEKMRAHSI-ERYSSYFRQKKSAG 318 (336) Q Consensus 246 ~~~~Tvl~~l-~~----n~pnl~i~~~pel~~a-~G~~~~~~~~~~~~~~~~~~~~p~~~r~l~~-~~~~~~~~vp~~~~ 318 (336) .-|.-+++=. .. .+-+..++.+|.+... .+....++.+. +.....+-..+|.++- ..+...+..-.+.| T Consensus 222 ~~g~~lw~~~~~~g~~~tl~G~PV~~~~~m~~~~~~~~~ilf~d~----~nl~~~~~~~i~i~~~~~a~~~~~~~~~~~r 297 (315) T protein:vir:41 222 GRETGLGDQALTGANSILYDGRPVQYVPALEALNDGKSRALFVVP----TQLVYGFWRNIKVVPDYDAEMRLTKYVASLR 297 (315) T ss_pred cCCCccccchhhcCCCceecccceEecccccccCCCCccEEEecc----cceEEEeccccEEEeeecCCCCceEEEEEEE Confidence 1122222211 11 1223456777777543 34445555432 1122333344444432 12223355555677 Q ss_pred eeeeeeecccceeeeccC Q lcl|Aclame:pro 319 TWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 319 t~Gv~ir~P~av~~~~GI 336 (336) .+|-.+-...+++....| T Consensus 298 ~d~~~~~~~~~a~~~~~v 315 (315) T protein:vir:41 298 TDNHYEDEEGAVSATITV 315 (315) T ss_pred eceeEEeccceeEeeeeC Confidence 877666688999999999 No 69 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=96.37 E-value=0.0007 Score=37.89 Aligned_cols=303 Identities=8% Similarity=0.053 Sum_probs=138.7 Q ss_pred Cch---H-HHHHHhhh-------------------------cceeccchhhhccchhHHH---HHhhhhhcccccccCcc Q lcl|Aclame:pro 1 MRD---A-QRIQNLAR-------------------------AGVILPRSVQNVSTPLTEY---AMDAADLSPHLSSTGSS 48 (336) Q Consensus 1 ~~~---~-~~~~~l~~-------------------------~g~~~~~~~~~~~~~~~~~---a~da~d~~~~l~t~~~~ 48 (336) ++. . +++....+ .+..+... .....+.+.+ .....+......++..+ T Consensus 51 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~t~~g 129 (415) T protein:vir:46 51 IQEKQEELDKLKEKDRTSENNQQSVEVNEARTYRNQANINDLGISIQNT-KVTSQEVRDFTEYLETRNDIQGGSLKTDSG 129 (415) T ss_pred HHHHHHHHHHHHHHHHhhhhcccccccchhhhhHHHHHHHHHHHhhhhh-hhhHHHHHHHHHHHhhhhhhhhccccccCC Confidence 000 0 00000000 00000000 0000011110 00000111111122222 Q ss_pred --hHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeee--ecceeeEEeecccCCcee-eeeeeeeeeeE Q lcl|Aclame:pro 49 --GIPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITA--EPTTKVATYGDYSSDGDS-GANINYPQRQS 123 (336) Q Consensus 49 --~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~--e~~G~a~~ygd~~diP~~-~~~~~~~~~~v 123 (336) -+|..+. ++|++.+........++.+..... .+..+++. ...+.+...+.+..+|-. ........... T Consensus 130 ~~~iP~~~~----~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~v~~~~ 202 (415) T protein:vir:46 130 FVVIPEEIV----TDILKLKEVEFNLDKYVTVKRVTN---GSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDI 202 (415) T ss_pred cccccHHHH----HHHHHHHHhhhhhhhhcceeeccC---CceeEEEEEecCCcceeecccccccccccccceeeEEeee Confidence 3665544 455666666666777766533221 11233333 334456677888888854 45677788888 Q ss_pred EEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCcccccccccccccCHHH Q lcl|Aclame:pro 124 YFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEA 203 (336) Q Consensus 124 ~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~t~~e 203 (336) +.++..+.+|.+=+. ....+|.+.-....+.++.+.+++-++.|+......+........ ... + ..+ ... T Consensus 203 ~k~~~~~~iS~ell~---ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~-~~~----~-~~~-~~~ 272 (415) T protein:vir:46 203 NTHRGYFRISREAIE---DAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKE-GKK----L-EVK-KAK 272 (415) T ss_pred eeeEeeehhhHHHHh---hchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCccccccccccc-cce----e-ccc-ccc Confidence 888888888875443 344688888888888888999999888887654333333321111 111 1 111 111 Q ss_pred HHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhccc-CCCCCccHHH-HHHHh----CCccEEEEccccc-CCCCc Q lcl|Aclame:pro 204 VVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAA-KLKDI----FPKLEFVTIPEYD-TASGR 276 (336) Q Consensus 204 I~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~-~l~~n----~pnl~i~~~pel~-~a~G~ 276 (336) -++||.+++..+...- -.+..++|.++.+..|.+ .+..|.-++. -+... .-++.++..+..- +++|. T Consensus 273 ~~~~i~~~~~~~~~~~------~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~ 346 (415) T protein:vir:46 273 SLDDIKDAINLNVKPN------YEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGN 346 (415) T ss_pred chHHHHHHHHhhhhhc------cCCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCCccccceeeEEeccccccCCCc Confidence 3567777777765431 135689999999998854 3333332221 01111 1112333333222 23344 Q ss_pred eEEEEEEeec-----CCceEEEEcChhhhcccceecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 277 LVQLWAPRVE-----GKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 277 ~~~~~~~~~~-----~~~~~~~~~p~~~r~l~~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) ...++.+-.+ ...-..+.. .+|... ....-+..|+ |+.+.+|-||++++-- T Consensus 347 ~~~~~gd~~~~~~~~~~~~~~v~~-~~~~~~-------~~~~~~~~r~-d~~v~~~~a~~~~~~~ 402 (415) T protein:vir:46 347 NTLIIGNLKDAIVLFDRSQYQASW-TDYMHF-------GECLMIAVRQ-DCRILDYKSAIVIEYD 402 (415) T ss_pred cEEEEEehhccEEEEeecceEEEe-eccccC-------ceEEEEEEEe-ccEEeccccEEEEEee Confidence 3333332110 000011111 111111 1122345554 6667789999888644 No 70 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=96.37 E-value=0.0007 Score=37.89 Aligned_cols=303 Identities=8% Similarity=0.053 Sum_probs=138.7 Q ss_pred Cch---H-HHHHHhhh-------------------------cceeccchhhhccchhHHH---HHhhhhhcccccccCcc Q lcl|Aclame:pro 1 MRD---A-QRIQNLAR-------------------------AGVILPRSVQNVSTPLTEY---AMDAADLSPHLSSTGSS 48 (336) Q Consensus 1 ~~~---~-~~~~~l~~-------------------------~g~~~~~~~~~~~~~~~~~---a~da~d~~~~l~t~~~~ 48 (336) ++. . +++....+ .+..+... .....+.+.+ .....+......++..+ T Consensus 51 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~t~~g 129 (415) T protein:vir:47 51 IQEKQEELDKLKEKDRTSENNQQSVEVNEARTYRNQANINDLGISIQNT-KVTSQEVRDFTEYLETRNDIQGGSLKTDSG 129 (415) T ss_pred HHHHHHHHHHHHHHHHhhhhcccccccchhhhhHHHHHHHHHHHhhhhh-hhhHHHHHHHHHHHhhhhhhhhccccccCC Confidence 000 0 00000000 00000000 0000011110 00000111111122222 Q ss_pred --hHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeee--ecceeeEEeecccCCcee-eeeeeeeeeeE Q lcl|Aclame:pro 49 --GIPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITA--EPTTKVATYGDYSSDGDS-GANINYPQRQS 123 (336) Q Consensus 49 --~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~--e~~G~a~~ygd~~diP~~-~~~~~~~~~~v 123 (336) -+|..+. ++|++.+........++.+..... .+..+++. ...+.+...+.+..+|-. ........... T Consensus 130 ~~~iP~~~~----~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~v~~~~ 202 (415) T protein:vir:47 130 FVVIPEEIV----TDILKLKEVEFNLDKYVTVKRVTN---GSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDI 202 (415) T ss_pred cccccHHHH----HHHHHHHHhhhhhhhhcceeeccC---CceeEEEEEecCCcceeecccccccccccccceeeEEeee Confidence 3665544 455666666666777766533221 11233333 334456677888888854 45677788888 Q ss_pred EEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCcccccccccccccCHHH Q lcl|Aclame:pro 124 YFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEA 203 (336) Q Consensus 124 ~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~t~~e 203 (336) +.++..+.+|.+=+. ....+|.+.-....+.++.+.+++-++.|+......+........ ... + ..+ ... T Consensus 203 ~k~~~~~~iS~ell~---ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~-~~~----~-~~~-~~~ 272 (415) T protein:vir:47 203 NTHRGYFRISREAIE---DAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKE-GKK----L-EVK-KAK 272 (415) T ss_pred eeeEeeehhhHHHHh---hchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCccccccccccc-cce----e-ccc-ccc Confidence 888888888875443 344688888888888888999999888887654333333321111 111 1 111 111 Q ss_pred HHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhccc-CCCCCccHHH-HHHHh----CCccEEEEccccc-CCCCc Q lcl|Aclame:pro 204 VVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAA-KLKDI----FPKLEFVTIPEYD-TASGR 276 (336) Q Consensus 204 I~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~-~l~~n----~pnl~i~~~pel~-~a~G~ 276 (336) -++||.+++..+...- -.+..++|.++.+..|.+ .+..|.-++. -+... .-++.++..+..- +++|. T Consensus 273 ~~~~i~~~~~~~~~~~------~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~ 346 (415) T protein:vir:47 273 SLDDIKDAINLNVKPN------YEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGN 346 (415) T ss_pred chHHHHHHHHhhhhhc------cCCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCCccccceeeEEeccccccCCCc Confidence 3567777777765431 135689999999998854 3333332221 01111 1112333333222 23344 Q ss_pred eEEEEEEeec-----CCceEEEEcChhhhcccceecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 277 LVQLWAPRVE-----GKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 277 ~~~~~~~~~~-----~~~~~~~~~p~~~r~l~~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) ...++.+-.+ ...-..+.. .+|... ....-+..|+ |+.+.+|-||++++-- T Consensus 347 ~~~~~gd~~~~~~~~~~~~~~v~~-~~~~~~-------~~~~~~~~r~-d~~v~~~~a~~~~~~~ 402 (415) T protein:vir:47 347 NTLIIGNLKDAIVLFDRSQYQASW-TDYMHF-------GECLMIAVRQ-DCRILDYKSAIVIEYD 402 (415) T ss_pred cEEEEEehhccEEEEeecceEEEe-eccccC-------ceEEEEEEEe-ccEEeccccEEEEEee Confidence 3333332110 000011111 111111 1122345554 6667789999888644 No 71 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=96.16 E-value=0.00066 Score=38.01 Aligned_cols=255 Identities=9% Similarity=0.013 Sum_probs=131.9 Q ss_pred hhhhhcccccccCcchHHHHHHHhhCceeeeeeccccchhhhcccccC--CCcceeeEEEeeeecceeeEEeecccCCce Q lcl|Aclame:pro 34 DAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESKK--GDWTTLVAAFITAEPTTKVATYGDYSSDGD 111 (336) Q Consensus 34 da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~--g~w~~~t~~~~~~e~~G~a~~ygd~~diP~ 111 (336) .|+ ..+.-++--+|..+..||.-++ ...+....+..+... |.- -.++.++.+...|.+..|.++++++. T Consensus 1 ma~----~~T~~~~~iiPev~~~~v~~~~----~~~~~~~~~~~~~~~l~g~~-G~tv~ip~~~~~g~~~~~~eg~~i~~ 71 (274) T protein:vir:93 1 MPQ----GITKTSNQIIPEVLAPMMQAQL----EKKLRFASFAEVDSTLQGQP-GDTLTFPAFVYSGDAQVVAEGEKIPT 71 (274) T ss_pred CCc----cceehhheechHHHHHHHHHHH----HhhhhhcccccccccccCCC-CCEEEEEeeccCCCcccccCCCcccc Confidence 121 2344556678888888885443 334445555555432 222 25799999999999999999999999 Q ss_pred eeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCccccc Q lcl|Aclame:pro 112 SGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITA 191 (336) Q Consensus 112 ~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~ 191 (336) .+.........+.+.+-+++++ ++.+++. +-++..+-...+.+++.+++++..+- .++. +..+. T Consensus 72 ~~it~~~~~~~i~~~~~~~~i~--D~~~~~~-~~d~~~~~~~~~~~~~a~~~d~~~~~---------~~~~----a~~~~ 135 (274) T protein:vir:93 72 DILETKKREAKIRKIAKGTSIT--DEALLSG-YGDPQGEQVRQHGLAHANKVDNDVLE---------ALMG----AKLTV 135 (274) T ss_pred cccccceeEEEeeeeccccccc--HHHHHhh-ccchHHHHHHHHHHHHHHHHHHHHHH---------HHhc----ccccc Confidence 9999998888887776555554 4444444 44566667777777787777764331 1111 00001 Q ss_pred ccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhcccCC--------CCCccHH-HHHHHhCCcc Q lcl|Aclame:pro 192 TTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSKTN--------QYGLAAA-AKLKDIFPKL 262 (336) Q Consensus 192 ~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~~~--------~~~~Tvl-~~l~~n~pnl 262 (336) ++ ++.+ +++|.+++..+-.. + ..+..|+++|..+..|.+.. ..|..++ +=.-..|-++ T Consensus 136 ~~---~~~~----~d~i~dA~~~l~d~--~----~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~ 202 (274) T protein:vir:93 136 NA---DITK----LNGLQSAIDKFNDE--D----LEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGA 202 (274) T ss_pred cc---cccC----HHHHHHHHHHhhhc--c----CCccEEEeCHHHHHHHHhhhhhcccccccccccceeecccceecCe Confidence 11 1223 34445555544321 1 25778999999998886421 1111110 0000012345 Q ss_pred EEEEcccccCCCCceEEEEEEeecCCceEEEEcChhh--hcccceecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 263 EFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKM--RAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 263 ~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p~~~--r~l~~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) +|...+.+- -++.+++- +.-+....-.+. ...-.+.+.. -.+- .-...|+-+.+|-+++.+.-= T Consensus 203 ~Vi~s~~~p---~~t~~l~~-----~gai~~~~~~~~~vE~~Rd~~~~~-d~i~-~~~~y~~~~~~~~~~v~~t~~ 268 (274) T protein:vir:93 203 IIVRTNKLE---AGTAILAK-----KGAVKLILKRDFFLEVARDASTKT-TALY-SDKHYVAYLYDESKAVKITKG 268 (274) T ss_pred eEEEcCCCC---cceEEEEe-----CCeEEEEecCCcccccccchhhcc-cEEE-EEEEEEEEEEcCCceEEEeeC Confidence 555433332 12222221 111111111111 1111111111 1111 123467888888777776643 No 72 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=96.10 E-value=0.001 Score=37.02 Aligned_cols=304 Identities=8% Similarity=0.047 Sum_probs=133.6 Q ss_pred CchHH----HHHHh-------------------------hhcceeccchhhhccchhHHH---HHhhhhhcccccccCcc Q lcl|Aclame:pro 1 MRDAQ----RIQNL-------------------------ARAGVILPRSVQNVSTPLTEY---AMDAADLSPHLSSTGSS 48 (336) Q Consensus 1 ~~~~~----~~~~l-------------------------~~~g~~~~~~~~~~~~~~~~~---a~da~d~~~~l~t~~~~ 48 (336) ++..+ ++... ...+-.+... .....+.+.+ .....+......+...+ T Consensus 51 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 129 (415) T protein:vir:98 51 IQEKQEELDKLKEKDGTSENNQQSVEVNEARTYRNQANINDLGISIQNT-KVTSQEVRDFTEYLETRNDIQGGSLKTDSG 129 (415) T ss_pred HHHHHHHHHHHHHHHhhhhhcccccccchhhhHHHHHHHHHHhhhhhhh-hhHHHHHHHHHHHHhhhhhhhhcccccccc Confidence 00000 00000 0000000000 0000111111 00000111111222223 Q ss_pred --hHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecceeeEEeecccCCceee-eeeeeeeeeEEE Q lcl|Aclame:pro 49 --GIPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSG-ANINYPQRQSYF 125 (336) Q Consensus 49 --~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~~ygd~~diP~~~-~~~~~~~~~v~~ 125 (336) -+|..+. ++|++..........++.+.....-. ..+.+......+.+...+.+.++|-.+ .........++. T Consensus 130 g~~iP~~~~----~~ii~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k 204 (415) T protein:vir:98 130 FVVIPEEIV----TDILKLKEVEFNLDKYVTVKRVTNGS-GKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINT 204 (415) T ss_pred ccccchHHH----HHHHHHHHhhhhhhhheeeeeccCCc-eeEEEEeecCCccceeeccccccCcccccceeeEEeeeee Confidence 3665443 45566666666666666554322101 122222333344556667778887544 567778888888 Q ss_pred EEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccce-EEEEecCCCCcccccccccccccCHHHH Q lcl|Aclame:pro 126 FQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLEN-YGLINDPSLSAPITATTPWSGSPAVEAV 204 (336) Q Consensus 126 ~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~-~GllN~Pnl~~~~~~~t~w~~~~t~~eI 204 (336) ++..+.+|.+=++ ....++.+.-......++.+.+|+-++.|+..... .++++....+ .+.+. .+.. - T Consensus 205 ~~~~~~iS~ell~---ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~--~~~~~--~~~~----~ 273 (415) T protein:vir:98 205 HRGYFRISREAIE---DAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEG--KKLEV--KKAK----S 273 (415) T ss_pred eEeeehhhHHHHh---hchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccc--ccccc--cccc----c Confidence 8888888765333 34557888888888888888888888887754322 2222211111 11111 1112 2 Q ss_pred HHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhccc-CCCCCccHHH-HHHHhCC----ccEEEEcccccC-CCCce Q lcl|Aclame:pro 205 VNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAA-KLKDIFP----KLEFVTIPEYDT-ASGRL 277 (336) Q Consensus 205 ~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~-~l~~n~p----nl~i~~~pel~~-a~G~~ 277 (336) ++||.+++..+... + ..+..++|.++.+..|.+ .+..|.-++. =+....+ +..++..+..-. ++|.. T Consensus 274 ~~~i~~~~~~~~~~--~----~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~ 347 (415) T protein:vir:98 274 LDDIKDAINLNVKP--N----YEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNN 347 (415) T ss_pred hhHHHHHHHhhhhh--c----cCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeEEecccccCCCCcc Confidence 56677777766432 1 136679999999888854 3333332221 0011111 122333333222 23333 Q ss_pred EEEEEEeec-----CCceEEEEcChhhhcccceecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 278 VQLWAPRVE-----GKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 278 ~~~~~~~~~-----~~~~~~~~~p~~~r~l~~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) ..+|.+-.+ ...-..+.. .+|... ....-+..|+ |+.+++|-||+.++-- T Consensus 348 ~~~~Gd~~~~~~~~~~~~~~v~~-~~~~~~-------~~~~~~~~r~-d~~v~~~~a~~~~~~~ 402 (415) T protein:vir:98 348 TLIIGNLKDAIVLFDRSQYQASW-TDYMHF-------GECLMIAVRQ-DCRILDYKSAIVIEYD 402 (415) T ss_pred EEEEEehhccEEEEeecceEEEE-eccccC-------ceEEEEEEEe-ccEEeccccEEEEEEe Confidence 333322100 000011111 111111 1112234555 5666779999988655 No 73 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=96.10 E-value=0.001 Score=37.02 Aligned_cols=304 Identities=8% Similarity=0.047 Sum_probs=133.6 Q ss_pred CchHH----HHHHh-------------------------hhcceeccchhhhccchhHHH---HHhhhhhcccccccCcc Q lcl|Aclame:pro 1 MRDAQ----RIQNL-------------------------ARAGVILPRSVQNVSTPLTEY---AMDAADLSPHLSSTGSS 48 (336) Q Consensus 1 ~~~~~----~~~~l-------------------------~~~g~~~~~~~~~~~~~~~~~---a~da~d~~~~l~t~~~~ 48 (336) ++..+ ++... ...+-.+... .....+.+.+ .....+......+...+ T Consensus 51 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 129 (415) T protein:vir:81 51 IQEKQEELDKLKEKDGTSENNQQSVEVNEARTYRNQANINDLGISIQNT-KVTSQEVRDFTEYLETRNDIQGGSLKTDSG 129 (415) T ss_pred HHHHHHHHHHHHHHHhhhhhcccccccchhhhHHHHHHHHHHhhhhhhh-hhHHHHHHHHHHHHhhhhhhhhcccccccc Confidence 00000 00000 0000000000 0000111111 00000111111222223 Q ss_pred --hHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecceeeEEeecccCCceee-eeeeeeeeeEEE Q lcl|Aclame:pro 49 --GIPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSG-ANINYPQRQSYF 125 (336) Q Consensus 49 --~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~~ygd~~diP~~~-~~~~~~~~~v~~ 125 (336) -+|..+. ++|++..........++.+.....-. ..+.+......+.+...+.+.++|-.+ .........++. T Consensus 130 g~~iP~~~~----~~ii~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k 204 (415) T protein:vir:81 130 FVVIPEEIV----TDILKLKEVEFNLDKYVTVKRVTNGS-GKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINT 204 (415) T ss_pred ccccchHHH----HHHHHHHHhhhhhhhheeeeeccCCc-eeEEEEeecCCccceeeccccccCcccccceeeEEeeeee Confidence 3665443 45566666666666666554322101 122222333344556667778887544 567778888888 Q ss_pred EEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccce-EEEEecCCCCcccccccccccccCHHHH Q lcl|Aclame:pro 126 FQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLEN-YGLINDPSLSAPITATTPWSGSPAVEAV 204 (336) Q Consensus 126 ~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~-~GllN~Pnl~~~~~~~t~w~~~~t~~eI 204 (336) ++..+.+|.+=++ ....++.+.-......++.+.+|+-++.|+..... .++++....+ .+.+. .+.. - T Consensus 205 ~~~~~~iS~ell~---ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~--~~~~~--~~~~----~ 273 (415) T protein:vir:81 205 HRGYFRISREAIE---DAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEG--KKLEV--KKAK----S 273 (415) T ss_pred eEeeehhhHHHHh---hchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccc--ccccc--cccc----c Confidence 8888888765333 34557888888888888888888888887754322 2222211111 11111 1112 2 Q ss_pred HHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhccc-CCCCCccHHH-HHHHhCC----ccEEEEcccccC-CCCce Q lcl|Aclame:pro 205 VNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAA-KLKDIFP----KLEFVTIPEYDT-ASGRL 277 (336) Q Consensus 205 ~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~-~l~~n~p----nl~i~~~pel~~-a~G~~ 277 (336) ++||.+++..+... + ..+..++|.++.+..|.+ .+..|.-++. =+....+ +..++..+..-. ++|.. T Consensus 274 ~~~i~~~~~~~~~~--~----~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~ 347 (415) T protein:vir:81 274 LDDIKDAINLNVKP--N----YEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNN 347 (415) T ss_pred hhHHHHHHHhhhhh--c----cCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeEEecccccCCCCcc Confidence 56677777766432 1 136679999999888854 3333332221 0011111 122333333222 23333 Q ss_pred EEEEEEeec-----CCceEEEEcChhhhcccceecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 278 VQLWAPRVE-----GKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 278 ~~~~~~~~~-----~~~~~~~~~p~~~r~l~~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) ..+|.+-.+ ...-..+.. .+|... ....-+..|+ |+.+++|-||+.++-- T Consensus 348 ~~~~Gd~~~~~~~~~~~~~~v~~-~~~~~~-------~~~~~~~~r~-d~~v~~~~a~~~~~~~ 402 (415) T protein:vir:81 348 TLIIGNLKDAIVLFDRSQYQASW-TDYMHF-------GECLMIAVRQ-DCRILDYKSAIVIEYD 402 (415) T ss_pred EEEEEehhccEEEEeecceEEEE-eccccC-------ceEEEEEEEe-ccEEeccccEEEEEEe Confidence 333322100 000011111 111111 1112234555 5666779999988655 No 74 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=96.10 E-value=0.001 Score=37.02 Aligned_cols=304 Identities=8% Similarity=0.047 Sum_probs=133.6 Q ss_pred CchHH----HHHHh-------------------------hhcceeccchhhhccchhHHH---HHhhhhhcccccccCcc Q lcl|Aclame:pro 1 MRDAQ----RIQNL-------------------------ARAGVILPRSVQNVSTPLTEY---AMDAADLSPHLSSTGSS 48 (336) Q Consensus 1 ~~~~~----~~~~l-------------------------~~~g~~~~~~~~~~~~~~~~~---a~da~d~~~~l~t~~~~ 48 (336) ++..+ ++... ...+-.+... .....+.+.+ .....+......+...+ T Consensus 51 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 129 (415) T protein:vir:79 51 IQEKQEELDKLKEKDGTSENNQQSVEVNEARTYRNQANINDLGISIQNT-KVTSQEVRDFTEYLETRNDIQGGSLKTDSG 129 (415) T ss_pred HHHHHHHHHHHHHHHhhhhhcccccccchhhhHHHHHHHHHHhhhhhhh-hhHHHHHHHHHHHHhhhhhhhhcccccccc Confidence 00000 00000 0000000000 0000111111 00000111111222223 Q ss_pred --hHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecceeeEEeecccCCceee-eeeeeeeeeEEE Q lcl|Aclame:pro 49 --GIPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSG-ANINYPQRQSYF 125 (336) Q Consensus 49 --~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~~ygd~~diP~~~-~~~~~~~~~v~~ 125 (336) -+|..+. ++|++..........++.+.....-. ..+.+......+.+...+.+.++|-.+ .........++. T Consensus 130 g~~iP~~~~----~~ii~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k 204 (415) T protein:vir:79 130 FVVIPEEIV----TDILKLKEVEFNLDKYVTVKRVTNGS-GKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINT 204 (415) T ss_pred ccccchHHH----HHHHHHHHhhhhhhhheeeeeccCCc-eeEEEEeecCCccceeeccccccCcccccceeeEEeeeee Confidence 3665443 45566666666666666554322101 122222333344556667778887544 567778888888 Q ss_pred EEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccce-EEEEecCCCCcccccccccccccCHHHH Q lcl|Aclame:pro 126 FQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLEN-YGLINDPSLSAPITATTPWSGSPAVEAV 204 (336) Q Consensus 126 ~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~-~GllN~Pnl~~~~~~~t~w~~~~t~~eI 204 (336) ++..+.+|.+=++ ....++.+.-......++.+.+|+-++.|+..... .++++....+ .+.+. .+.. - T Consensus 205 ~~~~~~iS~ell~---ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~--~~~~~--~~~~----~ 273 (415) T protein:vir:79 205 HRGYFRISREAIE---DAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEG--KKLEV--KKAK----S 273 (415) T ss_pred eEeeehhhHHHHh---hchHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccc--ccccc--cccc----c Confidence 8888888765333 34557888888888888888888888887754322 2222211111 11111 1112 2 Q ss_pred HHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhccc-CCCCCccHHH-HHHHhCC----ccEEEEcccccC-CCCce Q lcl|Aclame:pro 205 VNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAA-KLKDIFP----KLEFVTIPEYDT-ASGRL 277 (336) Q Consensus 205 ~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~-~l~~n~p----nl~i~~~pel~~-a~G~~ 277 (336) ++||.+++..+... + ..+..++|.++.+..|.+ .+..|.-++. =+....+ +..++..+..-. ++|.. T Consensus 274 ~~~i~~~~~~~~~~--~----~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~ 347 (415) T protein:vir:79 274 LDDIKDAINLNVKP--N----YEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNN 347 (415) T ss_pred hhHHHHHHHhhhhh--c----cCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeEEecccccCCCCcc Confidence 56677777766432 1 136679999999888854 3333332221 0011111 122333333222 23333 Q ss_pred EEEEEEeec-----CCceEEEEcChhhhcccceecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 278 VQLWAPRVE-----GKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 278 ~~~~~~~~~-----~~~~~~~~~p~~~r~l~~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) ..+|.+-.+ ...-..+.. .+|... ....-+..|+ |+.+++|-||+.++-- T Consensus 348 ~~~~Gd~~~~~~~~~~~~~~v~~-~~~~~~-------~~~~~~~~r~-d~~v~~~~a~~~~~~~ 402 (415) T protein:vir:79 348 TLIIGNLKDAIVLFDRSQYQASW-TDYMHF-------GECLMIAVRQ-DCRILDYKSAIVIEYD 402 (415) T ss_pred EEEEEehhccEEEEeecceEEEE-eccccC-------ceEEEEEEEe-ccEEeccccEEEEEEe Confidence 333322100 000011111 111111 1112234555 5666779999988655 No 75 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=96.04 E-value=0.0006 Score=38.25 Aligned_cols=262 Identities=7% Similarity=-0.025 Sum_probs=136.3 Q ss_pred HHHhhhhhcccccccCcchHHHHHHHhhCceeeeeeccccchhhhccccc--CCCcceeeEEEeeeecceeeEEeecccC Q lcl|Aclame:pro 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESK--KGDWTTLVAAFITAEPTTKVATYGDYSS 108 (336) Q Consensus 31 ~a~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t--~g~w~~~t~~~~~~e~~G~a~~ygd~~d 108 (336) ||- ..+.-++.-+|..+..||.-++ ...+....+..+.. +|.-+ .++.++.+...|.+..|.++++ T Consensus 1 Ma~-------~~T~~~~~iiPev~s~~v~~~~----~~~~v~~~~~~~~~~l~g~~G-~tv~ip~~~~~g~a~~~~~g~~ 68 (278) T protein:vir:80 1 MAD-------LTTKLANLIDPEVMGPMISAKL----PKAIKFGKIAPIDNSLEGQPG-SEITVPKYKYIGDAQDVAEGAA 68 (278) T ss_pred CCC-------cceehhheecHHHHHHHHHHHH----HHhhhhcccceecccccCCCC-CEEEEeeeccCCcceeecCCCc Confidence 221 1244556678888888885443 22333334443332 22223 6789999999999999999999 Q ss_pred CceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCcc Q lcl|Aclame:pro 109 DGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAP 188 (336) Q Consensus 109 iP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~ 188 (336) ++..+.+.......+.+.+-+++++ ++.+.+ .+.++..+-...+..++.+.+++..+-.-. |..+. + T Consensus 69 i~~~~lt~~~~~~~i~~~~~a~~v~--D~~~~~-~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~-----~a~~~--~--- 135 (278) T protein:vir:80 69 IDYSALETESVKHGIKKAGKGVKLT--DESVLS-GYGDPVEEAQKQIRMAIASKVDNDILEEAL-----TTTLE--V--- 135 (278) T ss_pred CcccccccceeeEeeehhhcccccc--HHHHhh-ccccHHHHHHHHHHHHHHHHHHHHHHHHHh-----ccccc--c--- Confidence 9999998888888887776665554 444333 477788888888888888888875542111 21111 1 Q ss_pred cccccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhcccCC--------CCCccHH-HHHHHhC Q lcl|Aclame:pro 189 ITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSKTN--------QYGLAAA-AKLKDIF 259 (336) Q Consensus 189 ~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~~~--------~~~~Tvl-~~l~~n~ 259 (336) +.+ ....+.+..++.+.++...+-.. + + ..+..|+++|..+..|.+-. .++..++ +-.--.| T Consensus 136 -~~~---~t~~~~~~~~~~~~da~~~l~~~--~-~--~~~~~ivv~p~~~~~L~k~~~~~~~~~~~~g~~~~~~G~ig~~ 206 (278) T protein:vir:80 136 -KGA---INIGLIDKIENTFTDAPDAIEDE--S-I--TTTGVLFLNYKDTAKLREEAAGSWTKASQLGDDLLVKGAFGEL 206 (278) T ss_pred -ccc---cccchhhhHHHHHHHHHHhhccc--C-C--CcccEEEECHHHHHHHHhhhhhhccccccccccceeeccceee Confidence 000 01122344455555554444221 1 1 23446999999988885321 1111110 0000112 Q ss_pred CccEEEEcccccCCCCceEEEEEEeecCCceEEEEcChh--hhcccceecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 260 PKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEK--MRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 260 pnl~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p~~--~r~l~~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) -+++|.....+. -++.+++-.. -+......+ ....-.+.+.. -.+.. -...|+-+.+|-+++.+.-- T Consensus 207 ~G~~Vi~s~~~p---~~t~~l~~~g-----Ai~~~~~~~~~vE~~Rd~~~~~-d~i~~-~~~yg~~v~~~~~~v~it~~ 275 (278) T protein:vir:80 207 LGWEIVRTKKLA---DGNALAVKAG-----ALKTFLKRNLLAESGRDMDHKL-TKFNA-DQHYAVALVDETKAVKVVPV 275 (278) T ss_pred cceeEEEcCCCC---cceEEEEecc-----ceeeeecCCcccccccchhhcc-ceeee-eeEEEEEEEcCcceEEEeec Confidence 234554433332 1233333211 111111111 11111111111 11111 23468999999998888766 No 76 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=96.02 E-value=0.00094 Score=37.19 Aligned_cols=309 Identities=10% Similarity=0.053 Sum_probs=141.5 Q ss_pred CchHHH-H-------HHhhhcceeccch--hh-------hcc----chhHHHHHhhhhhcccccccCcc--hHHHHHHHh Q lcl|Aclame:pro 1 MRDAQR-I-------QNLARAGVILPRS--VQ-------NVS----TPLTEYAMDAADLSPHLSSTGSS--GIPNYLTTY 57 (336) Q Consensus 1 ~~~~~~-~-------~~l~~~g~~~~~~--~~-------~~~----~~~~~~a~da~d~~~~l~t~~~~--~i~~~l~~~ 57 (336) +...+. . ...++........ .. ++- .+.+.....+ . ...+.+.+ .||..+.+ T Consensus 51 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~a~~~~lr~~~~~~~~~~e~~a--~--~~~~~~~GG~~iP~~~~~- 125 (401) T protein:vir:44 51 LSELENLKSDLEKELLELKRPARGAQNKVAAEHKDAFVGFLRKGREDGLRDLERKA--L--QVGTDEDGGYAVPEELDR- 125 (401) T ss_pred HHHHHHHHHHHHHHHHHhhccccccccchhHHHHHHHHHHHhhhhhhhhHHHHHHH--h--hcCCCCCCceeccHhHHH- Confidence 100000 0 0011111100000 00 000 0000000000 0 01112222 35655443 Q ss_pred hCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecceeeEEeecccCCceee-eeeeeeeeeEEEEEEEEeeCHHH Q lcl|Aclame:pro 58 VDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSG-ANINYPQRQSYFFQTWTRWGERE 136 (336) Q Consensus 58 idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~~ygd~~diP~~~-~~~~~~~~~v~~~~~~~~y~~~E 136 (336) +|++.+-..-....+..+.+.+. ....+++......+.+.+.....|-.+ .......-.++.++..+.+|.+= T Consensus 126 ---~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~a~wv~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~el 199 (401) T protein:vir:44 126 ---SILSLLKDEVVMRQEATVITVGG---SDYKKLVNLGGTASGWVGETDTRSQTATSRLGLIEPFMGEIYGNPQATQKM 199 (401) T ss_pred ---HHHHHHHhhhhhhhhceeeecCC---CceEEEEecCCccceeeccccccCccccccceeeeeehhheeeehhhhHHH Confidence 44444433333444444332222 123455555555566667666676554 35666677777788878888765 Q ss_pred HHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCccccccccccc------ccCHHHHHHHHHH Q lcl|Aclame:pro 137 LEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSG------SPAVEAVVNEVVA 210 (336) Q Consensus 137 l~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~------~~t~~eI~~Di~~ 210 (336) +.. ...+|.+.-....+.++.+.++.-.++|++.....|+|+++.......+. .+.. .++..--++||.+ T Consensus 200 l~d---s~~~l~~~i~~~la~ai~~~~~~~~l~G~G~~~p~Gil~~~~~~~~~~~~-~~~~~~~~~t~~~~~~~~d~i~~ 275 (401) T protein:vir:44 200 LDD---AFFNVEAWINSELATEFAEQEEIAFTTGDGTKKPKGFLAYESTEESDKAR-AFGKLQHIVSGEATAVTADAIIK 275 (401) T ss_pred Hhc---chHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccceeecccccccccccc-ccccccccccccccccCHHHHHH Confidence 543 35688888888999999999999999999988899999987765432211 1100 0111112677777 Q ss_pred HHHHHHHHhCCceecccccEEEecHHHHHhccc-CCCCCccHHH-HHHHhCC----ccEEE---EcccccCCCCceEEEE Q lcl|Aclame:pro 211 LFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAA-KLKDIFP----KLEFV---TIPEYDTASGRLVQLW 281 (336) Q Consensus 211 l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~-~l~~n~p----nl~i~---~~pel~~a~G~~~~~~ 281 (336) ++..|...- ...-+++|.++.+..|.. .+..|.-++. =+...-| ++-++ .+|.. ++|....+| T Consensus 276 ~~~~l~~~~------~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~g~~~~l~G~PVv~~~~~p~~--~~~~~~i~~ 347 (401) T protein:vir:44 276 LIYTLRKAH------RTGAKFMMNNNSLFAIRLLKDTEGNYLWRPGLELGQPSSLAGYGIAENEQMPDI--AADAKAIAF 347 (401) T ss_pred HHHhcchhh------hcCCEEEEcHHHHHHHHHhhccCCceeecCCcCCCCCceecceeeEEecCcCCc--cCCccEEEE Confidence 777664321 123468999999888853 3333433321 0111111 11222 22322 223333333 Q ss_pred EEeecCCceEEEEcChhhhccc-ceecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 282 APRVEGKDTATCGFTEKMRAHS-IERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 282 ~~~~~~~~~~~~~~p~~~r~l~-~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) .+-. +. ..+.--+.++.+- .....-....-+..|.+|.++. |.||+.+..= T Consensus 348 Gd~~--~~-~~i~~~~~~~~~~~~~~~~~~v~~~a~~r~d~~~~~-~~a~~~l~~~ 399 (401) T protein:vir:44 348 GNFK--RG-YTIVDRIGTRILRDPYTNKPFVGFYTTKRTGGMLVD-SQAIKLLKIA 399 (401) T ss_pred eehh--cc-EEEEEecceEEeeeccccCCcEEEEEEEEeccEEec-ccceEEEEee Confidence 2110 00 0000001111110 0111223445566666665554 8888765544 No 77 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=95.96 E-value=0.00087 Score=37.35 Aligned_cols=255 Identities=9% Similarity=0.002 Sum_probs=131.4 Q ss_pred hhhhhcccccccCcchHHHHHHHhhCceeeeeeccccchhhhcccccC--CCcceeeEEEeeeecceeeEEeecccCCce Q lcl|Aclame:pro 34 DAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESKK--GDWTTLVAAFITAEPTTKVATYGDYSSDGD 111 (336) Q Consensus 34 da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~--g~w~~~t~~~~~~e~~G~a~~ygd~~diP~ 111 (336) .|. ..+.-++--+|..+..||.-++ ...+....+..+... |..+ .+++++.+...|.+..|.++++++. T Consensus 1 ma~----~~T~~~d~iiPev~~~~v~~~~----~~~l~~~~~~~~d~~l~g~~G-~tv~iP~~~~~g~a~~~~~g~~i~~ 71 (274) T protein:vir:97 1 MPQ----GLTKTSDQIIPEVLAPMMQAQL----EKKLRFASFAEVDSTLQGQPG-DTLTFPAFVYSGDAQVVAEGEKIPT 71 (274) T ss_pred CCc----cceehhheechHHHHHHHHHhh----hhhhhhcccceecccccCCCC-CEEEEeeecCCCccccccCCCcccc Confidence 111 2345566678888888885443 344555566555432 3223 6899999999999999999999999 Q ss_pred eeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCccccc Q lcl|Aclame:pro 112 SGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITA 191 (336) Q Consensus 112 ~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~ 191 (336) -+.........+...+-+ |.+.++.+++..+ ++..+-...+.+++.+++++..+- .++.-.+ .. T Consensus 72 ~~lt~~~~~~~i~~~~~~--~~i~D~~~~~~~~-dp~~~~~~~~a~a~a~~vd~~~~~---------~l~~a~~----~~ 135 (274) T protein:vir:97 72 DILETKKREAKIRKIAKG--TSITDEALLSGYG-DPQGEQVRQHGLAHANKVDNDVLE---------ALMGAKL----TV 135 (274) T ss_pred cccccceeEEEeeeecce--ecccHHHHHhccc-hHHHHHHHHHHHHHHHHHHHHHHH---------HHhccCc----cc Confidence 999988888888776655 4555555555444 455566666667777777764331 1111000 01 Q ss_pred ccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhcccCC--------CCCccHH-HHHHHhCCcc Q lcl|Aclame:pro 192 TTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSKTN--------QYGLAAA-AKLKDIFPKL 262 (336) Q Consensus 192 ~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~~~--------~~~~Tvl-~~l~~n~pnl 262 (336) + .++.+ +++|.++...+-.. ...+..|+++|..+..|.+.+ .+|..++ .=.--.|-++ T Consensus 136 ~---~~~~~----~d~i~dA~~~l~d~------~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~ 202 (274) T protein:vir:97 136 N---ADITK----LNGLQSAIDKFNDE------DLEPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGA 202 (274) T ss_pred c---ccccC----HHHHHHHHHHhhcc------CCCceEEEeCHHHHHHHHhhhhhhccccCcccccceeccccceecCe Confidence 1 11223 34455555554322 124678999999998886421 1121111 0000012244 Q ss_pred EEEEcccccCCCCceEEEEEEeecCCceEEEEcChhhh--cccceecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 263 EFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMR--AHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 263 ~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p~~~r--~l~~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) +|...+.+- -.+.+++- +.-+......+.+ ..-.+.+.. -.+- .-..+|+-+.+|-.++.+.-= T Consensus 203 ~Vi~s~~~p---~~t~~l~~-----~gA~~~~~~~~~~vE~~Rd~~~~~-d~i~-~~~~y~~~~~~~~~vv~~t~~ 268 (274) T protein:vir:97 203 IIVRTNKLE---AGTAILAK-----KGAVKLILKRDFFLEVARDASTKT-TALY-SDKHYVAYLYDESKAVKITKG 268 (274) T ss_pred eEEEcCCCC---cceEEEEe-----CcceEeeecCCceeccccchhhcc-cEEE-EEEEEEEEEEcCCceEEEecC Confidence 554333221 11222221 1111111111111 000011111 1111 112567788888777765543 No 78 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=95.96 E-value=0.00087 Score=37.35 Aligned_cols=255 Identities=9% Similarity=0.002 Sum_probs=131.4 Q ss_pred hhhhhcccccccCcchHHHHHHHhhCceeeeeeccccchhhhcccccC--CCcceeeEEEeeeecceeeEEeecccCCce Q lcl|Aclame:pro 34 DAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESKK--GDWTTLVAAFITAEPTTKVATYGDYSSDGD 111 (336) Q Consensus 34 da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~--g~w~~~t~~~~~~e~~G~a~~ygd~~diP~ 111 (336) .|. ..+.-++--+|..+..||.-++ ...+....+..+... |..+ .+++++.+...|.+..|.++++++. T Consensus 1 ma~----~~T~~~d~iiPev~~~~v~~~~----~~~l~~~~~~~~d~~l~g~~G-~tv~iP~~~~~g~a~~~~~g~~i~~ 71 (274) T protein:vir:94 1 MPQ----GLTKTSDQIIPEVLAPMMQAQL----EKKLRFASFAEVDSTLQGQPG-DTLTFPAFVYSGDAQVVAEGEKIPT 71 (274) T ss_pred CCc----cceehhheechHHHHHHHHHhh----hhhhhhcccceecccccCCCC-CEEEEeeecCCCccccccCCCcccc Confidence 111 2345566678888888885443 344555566555432 3223 6899999999999999999999999 Q ss_pred eeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCccccc Q lcl|Aclame:pro 112 SGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITA 191 (336) Q Consensus 112 ~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~ 191 (336) -+.........+...+-+ |.+.++.+++..+ ++..+-...+.+++.+++++..+- .++.-.+ .. T Consensus 72 ~~lt~~~~~~~i~~~~~~--~~i~D~~~~~~~~-dp~~~~~~~~a~a~a~~vd~~~~~---------~l~~a~~----~~ 135 (274) T protein:vir:94 72 DILETKKREAKIRKIAKG--TSITDEALLSGYG-DPQGEQVRQHGLAHANKVDNDVLE---------ALMGAKL----TV 135 (274) T ss_pred cccccceeEEEeeeecce--ecccHHHHHhccc-hHHHHHHHHHHHHHHHHHHHHHHH---------HHhccCc----cc Confidence 999988888888776655 4555555555444 455566666667777777764331 1111000 01 Q ss_pred ccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhcccCC--------CCCccHH-HHHHHhCCcc Q lcl|Aclame:pro 192 TTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSKTN--------QYGLAAA-AKLKDIFPKL 262 (336) Q Consensus 192 ~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~~~--------~~~~Tvl-~~l~~n~pnl 262 (336) + .++.+ +++|.++...+-.. ...+..|+++|..+..|.+.+ .+|..++ .=.--.|-++ T Consensus 136 ~---~~~~~----~d~i~dA~~~l~d~------~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~ 202 (274) T protein:vir:94 136 N---ADITK----LNGLQSAIDKFNDE------DLEPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGA 202 (274) T ss_pred c---ccccC----HHHHHHHHHHhhcc------CCCceEEEeCHHHHHHHHhhhhhhccccCcccccceeccccceecCe Confidence 1 11223 34455555554322 124678999999998886421 1121111 0000012244 Q ss_pred EEEEcccccCCCCceEEEEEEeecCCceEEEEcChhhh--cccceecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 263 EFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMR--AHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 263 ~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p~~~r--~l~~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) +|...+.+- -.+.+++- +.-+......+.+ ..-.+.+.. -.+- .-..+|+-+.+|-.++.+.-= T Consensus 203 ~Vi~s~~~p---~~t~~l~~-----~gA~~~~~~~~~~vE~~Rd~~~~~-d~i~-~~~~y~~~~~~~~~vv~~t~~ 268 (274) T protein:vir:94 203 IIVRTNKLE---AGTAILAK-----KGAVKLILKRDFFLEVARDASTKT-TALY-SDKHYVAYLYDESKAVKITKG 268 (274) T ss_pred eEEEcCCCC---cceEEEEe-----CcceEeeecCCceeccccchhhcc-cEEE-EEEEEEEEEEcCCceEEEecC Confidence 554333221 11222221 1111111111111 000011111 1111 112567788888777765543 No 79 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=95.89 E-value=0.00059 Score=38.28 Aligned_cols=308 Identities=10% Similarity=0.024 Sum_probs=143.9 Q ss_pred Cch----------HH-HHHHhhhcceeccchhhhccchhHHH-----H--Hhhhhhccccc-ccCcch--HHHHHHHhhC Q lcl|Aclame:pro 1 MRD----------AQ-RIQNLARAGVILPRSVQNVSTPLTEY-----A--MDAADLSPHLS-STGSSG--IPNYLTTYVD 59 (336) Q Consensus 1 ~~~----------~~-~~~~l~~~g~~~~~~~~~~~~~~~~~-----a--~da~d~~~~l~-t~~~~~--i~~~l~~~id 59 (336) .+. .. ........+............+.+.. . .+.. .+..++ +++++| ||..+.+ T Consensus 85 ~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~r~a~~~~l~~~~~~~-e~~a~~~~t~~GG~lvP~~~~~--- 160 (434) T protein:vir:62 85 KENPNEKTELSEEQRSAISASIAAALSTKGHRTNKETEIRSVFANYIVGNIDEK-EARALGLVTGNGSVTIPDFLSK--- 160 (434) T ss_pred hcchhhhHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHHHHHHhccccchh-hhhhhcccccccceecchhhHH--- Confidence 000 00 00000000111000000000000000 0 0000 001111 122333 6766554 Q ss_pred ceeeeeeccccchhhhcccccCCCcceeeEEEeeeecceeeEEe---ecccCCceeeeeeeeeeeeEEEEEEEEeeCHHH Q lcl|Aclame:pro 60 PAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATY---GDYSSDGDSGANINYPQRQSYFFQTWTRWGERE 136 (336) Q Consensus 60 p~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~~y---gd~~diP~~~~~~~~~~~~v~~~~~~~~y~~~E 136 (336) +|++.+.+......+..+...+ ..+.|++....+.+... +..++.|..+.......-.++.++..+.+|.+= T Consensus 161 -~Ii~~l~~~~~i~~~~~~~~~~----~~~~~p~~~~~~~a~~~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~iS~el 235 (434) T protein:vir:62 161 -EIITYAQEENFLRRLGTGVKTK----ENIKYPVLVKKAEAQGHKNERTNNEMPETDIEFDEIELSPTEFDALATVTKKL 235 (434) T ss_pred -HHHHhhhhhhhhhhhcceeccC----CceEEEEEecCCcccceecccccccccccccceeeEEeeheeeEeehhhHHHH Confidence 3344333333333443332111 23567776666666544 235677888888888888888888888888664 Q ss_pred HHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccc-eEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 137 LEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLE-NYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVL 215 (336) Q Consensus 137 l~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g-~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l 215 (336) |.- ..++|.+.-....+.++.+.+++-.+.|++..+ .-|+++.+.+.. ++ +....++||.+++..+ T Consensus 236 l~d---s~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~g~~~~~~~~~----~~------~~~~~~d~l~~l~~~l 302 (434) T protein:vir:62 236 LAR---TGLPIEQIVMDELKKAYVRKETQYMVNGDEANNINDGALAKKAVEF----KT------DEKNLYDALVKMKNTP 302 (434) T ss_pred Hhc---chHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccceeecccccc----cc------cccchhhHHHHHHhhc Confidence 443 356788888888899999999999999997544 447776544421 11 1112456777777766 Q ss_pred HHHhCCceecccccEEEecHHHHHhccc-CCCCCccHHH-HHHHh------CCccEEEEcccccC-CCCceEEEEEEeec Q lcl|Aclame:pro 216 QTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAA-KLKDI------FPKLEFVTIPEYDT-ASGRLVQLWAPRVE 286 (336) Q Consensus 216 ~~~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~-~l~~n------~pnl~i~~~pel~~-a~G~~~~~~~~~~~ 286 (336) ...-. ..-.++|.+..+..|.+ .+..|.-++. ...-+ .-+..++....+.. ++|+...+++-... T Consensus 303 ~~~~~------~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~g~~~tl~G~pV~~~~~~~~~~~~~~~~i~~Gdfs 376 (434) T protein:vir:62 303 VKEVR------KKARWVLNTAALTKIETMKTDDGFPLLRPFNQAEGGIGYTLLGFPVEEEDAIDIPDSPDTPVFYFGDFS 376 (434) T ss_pred chhhh------cCCEEEEcHHHHHHHHHhhccCCCEeeccCCCccCCCCceecceeeEEecCccCccCCCceEEEEeecc Confidence 43211 12257888988888854 3333332321 01000 11223333333322 23333322221110 Q ss_pred CCceEEEEcChhhhcccc-eecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 287 GKDTATCGFTEKMRAHSI-ERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 287 ~~~~~~~~~p~~~r~l~~-~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) .--.+...=++.+..+.. .......-.-+..|..|-.||.|.+++-..+. T Consensus 377 ~~~i~~~~g~~~i~~~~~~~~~~~~v~~~~~~r~Dgk~i~~~~~~~~~~~~ 427 (434) T protein:vir:62 377 KFYIQDVIGSLEVQKLVELFSRTNRVGFRIWNLLDAQLIHSPFEVPVYKYV 427 (434) T ss_pred ceEEEEeeceeEEEeehhhhcccCceEEEEEeeecceeecCcccceEEEEE Confidence 000000000111222111 11233445667788889999999998877555 No 80 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=95.74 E-value=0.0014 Score=36.28 Aligned_cols=301 Identities=10% Similarity=0.053 Sum_probs=138.5 Q ss_pred Cc----hHHHHH----Hhh-hcceeccchhhh-------c----cchhHHHHHhhhhhcccccccCcch--HHHHHHHhh Q lcl|Aclame:pro 1 MR----DAQRIQ----NLA-RAGVILPRSVQN-------V----STPLTEYAMDAADLSPHLSSTGSSG--IPNYLTTYV 58 (336) Q Consensus 1 ~~----~~~~~~----~l~-~~g~~~~~~~~~-------~----~~~~~~~a~da~d~~~~l~t~~~~~--i~~~l~~~i 58 (336) ++ ..+... .+. ..+......... + ....+.... +.... ..+++.+++ +|.+....| T Consensus 54 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~r~~~~-~~~~~-~~t~~~~g~~~~~~~~~~~i 131 (390) T protein:vir:62 54 IKRGIEAIKAIDPVTSLLSGLQGSGSGAQRSADVDDDATLRAGNLGEARSFEF-APEKR-DGTKAGNPNVLSRTLYGQLI 131 (390) T ss_pred HHHHHHHHHHHHHHHHHHhhcccccccchhhcchHHHHHHhhhhhhhhHHHHh-hhhhh-cccccCCCccccccchHHHH Confidence 00 000000 000 011111111000 0 000011101 11111 122222322 222222222 Q ss_pred Cceeeeeeccccc-hhhhcccccCCCcceeeEEEeeeecceeeEEeecccCCceeeeeeeeeeeeEEEEEEEEeeCHHHH Q lcl|Aclame:pro 59 DPAVIDILVAPMK-AAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGEREL 137 (336) Q Consensus 59 dp~v~~~~~~~~~-~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~~ygd~~diP~~~~~~~~~~~~v~~~~~~~~y~~~El 137 (336) . ++.+ .+..++ ....++..+. ..+.+++....+.+...+-...+|-.+......+-.+..++..+.+|.+=| T Consensus 132 ~-~~~~-~~~~l~~~~~~~~~~~~-----~~~~~p~~~~~~~a~wv~E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell 204 (390) T protein:vir:62 132 A-QAVE-RSAIMRGGATTFTTSDA-----NPLDFTVITGRSSASIVGETAEIPESYPATAQRSMGGFKYGFASVVSYEFA 204 (390) T ss_pred H-HHHh-hhhhhhhcceeeecCCC-----ceeEEEEEcCCcceeeecccccccccccceeeeEeeeeeEEeehHHHHHHH Confidence 1 1111 122221 2233332221 235677888888888888888999999888888888999998888886655 Q ss_pred HHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 138 EMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQT 217 (336) Q Consensus 138 ~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~ 217 (336) +. ..+++.+.-....+.++.+.+++-.++|++. -.|++|+++...... .....+..+ ++||.+++..|.. T Consensus 205 ~d---s~~~l~~~i~~~l~~~i~~~~d~~~l~G~G~--p~Gi~~~~~~~~~~~-~~~~~~~~~----~~~l~~~~~~l~~ 274 (390) T protein:vir:62 205 TD---QVLDLVGFLVSDAGPAIGDAMGRHFITGTGQ--PRGILTDASPATATF-LATDTDSKV----SDALIDLFHEVPS 274 (390) T ss_pred hh---hhHHHHHHHHHHHHHHHHHHHHhhhhccCCc--cccccccccccccce-ecccccccc----hHHHHHHHHhhhh Confidence 54 4557888888888899999999989999874 369999876542221 111222333 4556666665533 Q ss_pred HhCCceecccccEEEecHHHHHhccc-CCCCCccHH-HHHHHhCCccEEEEcccccC----CC----CceEEEEEEeecC Q lcl|Aclame:pro 218 QSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAA-AKLKDIFPKLEFVTIPEYDT----AS----GRLVQLWAPRVEG 287 (336) Q Consensus 218 ~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl-~~l~~n~pnl~i~~~pel~~----a~----G~~~~~~~~~~~~ 287 (336) ... .--..+|.++.+..|.+ .+..|.=++ .=+...-|. ++-..|-... ++ |.-.+.++....+ T Consensus 275 ~~~------~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~g~~~-~l~G~Pv~~~~~~p~~~i~~gd~s~~~i~~~~~ 347 (390) T protein:vir:62 275 AYR------ANAKYVVNDLRAAQMRKLKDANGQYLWQSGLTVGAPS-LFNGKVVETDDGMPADKILFADLSKYRVRFAGS 347 (390) T ss_pred hhh------cCCEEEEchHHHHHHHHhhccCCCeeecCCcCCCccc-eecccceEEecCCCCccEEEeeccceeEEeecc Confidence 211 11258899998888853 222222111 001101110 1111111111 11 2211111111111 Q ss_pred CceEEEEcChhhhcccceecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 288 KDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 288 ~~~~~~~~p~~~r~l~~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) ..+.. ... +....-...+-+..|.+| .+..|.||+.+..= T Consensus 348 ~~v~~--~~~------~~~~~~~~~~~~~~r~d~-~~~~~~A~~~l~~~ 387 (390) T protein:vir:62 348 LRVDR--SVD------AKFSTDQIVYRFLQRADG-LLVDARGAKVLTVT 387 (390) T ss_pred eEEEe--ecc------ccccCCcEEEEEEEEeCc-EeechhheEEEEee Confidence 11000 011 111122344556666665 68899998887755 No 81 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=95.73 E-value=0.0016 Score=35.98 Aligned_cols=313 Identities=14% Similarity=0.069 Sum_probs=146.2 Q ss_pred Cc-------hHHHHHHhhhcceeccchhhhccchhHHHHHhh----hhhcccc--cccCcc--hHHHHHHHhhCceeeee Q lcl|Aclame:pro 1 MR-------DAQRIQNLARAGVILPRSVQNVSTPLTEYAMDA----ADLSPHL--SSTGSS--GIPNYLTTYVDPAVIDI 65 (336) Q Consensus 1 ~~-------~~~~~~~l~~~g~~~~~~~~~~~~~~~~~a~da----~d~~~~l--~t~~~~--~i~~~l~~~idp~v~~~ 65 (336) +. .......-.+.+-. ....+...-...+.+. .+....+ .+.+++ -+|..+. ++|++. T Consensus 81 i~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~af~~~l~~~e~~~al~~~t~~~gG~lvP~~~~----~~ii~~ 153 (425) T protein:vir:10 81 LEALQAAVDEANIKIAAAQMGAN---GVKPLRDPEYTEAFKAHVKRGDVQAALNKGEDSEGGYLTPIEWD----RTITNK 153 (425) T ss_pred HHHHHHHHHHHHHHHHhhhcccc---cccccccHHHHHHHHHHhhhhhhHHHhhcCcCCCCceeccHhHH----HHHHHH Confidence 00 00000000000000 0000000000001100 0111111 223333 3565444 344555 Q ss_pred eccccchhhhcccccCCCcceeeEEEeeeecceeeEEeecccCCceeee-eeeeeeeeEEEEEEEEeeCHHHHHHHHhhC Q lcl|Aclame:pro 66 LVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGA-NINYPQRQSYFFQTWTRWGERELEMAGAGR 144 (336) Q Consensus 66 ~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~~ygd~~diP~~~~-~~~~~~~~v~~~~~~~~y~~~El~~A~~~g 144 (336) +...-....+..+.+... ....+++....+.+...|.+..+|-.+. ......-..+.++..+.+|.+=++ ... T Consensus 154 ~~~~s~l~~l~~~~~~~~---~~~~~~~~~~~~~a~wv~E~~~~~~~~~~~f~~v~~~~~k~~~~i~iS~ell~---ds~ 227 (425) T protein:vir:10 154 LVLISPMRQLCRVQPVSK---AGFSKLFNMGGTTSGWVGEASQRPQTNAATFQPLSFASGEIYANPAATQQILD---DAE 227 (425) T ss_pred HHhhhhhhhhceeeeccC---CceEEEEEcCCcceeeeccccccccccccccceeeeeheeeEeehHhHHHHHh---cch Confidence 544444555555433222 1235555555567777788888887664 567777788888888888765443 345 Q ss_pred CCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCcccccc--cccc---cccCHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 145 VDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITAT--TPWS---GSPAVEAVVNEVVALFQVLQTQS 219 (336) Q Consensus 145 ~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~~--t~w~---~~~t~~eI~~Di~~l~~~l~~~s 219 (336) .++.+.-......++.+.+|.-+++|++.....|++|++...+..... +.+. ...+...-++||.+++..|...- T Consensus 228 ~~l~~~i~~~la~ai~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~l~~l~~~l~~~~ 307 (425) T protein:vir:10 228 IDLESWLATEVQTEFAKQEGKAFLAGDGTNKPNGLLTYIAGGANAAKHPFGAIEVVNSGAAADITSDGIIDLVYDLPSAF 307 (425) T ss_pred hHHHHHHHHHHHHHHHHHHHhhhhcccCCCCcceeeeccccccccccccccccccccccccccccHHHHHHHHhhhhhhh Confidence 789999999999999999999999999988899999987654322111 1110 01122334677777777664321 Q ss_pred CCceecccccEEEecHHHHHhccc-CCCCCccHHH-HHHHhCC----ccEEEEcccccC-CCCceEEEEEEeecCCceEE Q lcl|Aclame:pro 220 QGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAA-KLKDIFP----KLEFVTIPEYDT-ASGRLVQLWAPRVEGKDTAT 292 (336) Q Consensus 220 ~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~-~l~~n~p----nl~i~~~pel~~-a~G~~~~~~~~~~~~~~~~~ 292 (336) ...-+++|.++.+..|.+ .+..|.-++. =+..-.| +..++....+.. +.|....+|.+-. +-. . T Consensus 308 ------~~~a~~vmn~~~~~~L~~lkD~~G~~l~~~~~~~g~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~--~~~-~ 378 (425) T protein:vir:10 308 ------TGNARFAMNRNTQRQVRKLKDGQGNYLWQPSYVAGQPATLAGYPVTEVPDMPDVAANSTPILFGDFQ--QTY-L 378 (425) T ss_pred ------ccCCEEEEchHHHHHHHHhhcCCCceeeccCccCCCCceecceeeEEecCcCCccCCccEEEEEehh--ccE-E Confidence 123468999999888853 3333332221 0010011 122333222222 2233333332210 000 0 Q ss_pred EEcChhhhccc-ceecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 293 CGFTEKMRAHS-IERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 293 ~~~p~~~r~l~-~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) +.--.-++.+- .....-...+-...|.+|.+ +.|.||+.+..= T Consensus 379 i~~~~~~~v~~d~~~~~~~~~~~~~~r~d~~v-~~~~A~~~l~~~ 422 (425) T protein:vir:10 379 IIDRIGVRVLRDPYTAKPYVLFYTTKRVGGGL-LNPEPMRAMKVA 422 (425) T ss_pred EEEecceEEEecccccCCcEEEEEEEEeccEe-ecccceEEEEee Confidence 00001111110 00111223444556655554 459988665433 No 82 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=95.30 E-value=0.0023 Score=35.00 Aligned_cols=256 Identities=9% Similarity=0.015 Sum_probs=129.8 Q ss_pred hhhhhcccccccCcchHHHHHHHhhCceeeeeeccccchhhhccccc--CCCcceeeEEEeeeecceeeEEeecccCCce Q lcl|Aclame:pro 34 DAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESK--KGDWTTLVAAFITAEPTTKVATYGDYSSDGD 111 (336) Q Consensus 34 da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t--~g~w~~~t~~~~~~e~~G~a~~ygd~~diP~ 111 (336) .|+ +.+.-++--+|..+..||.-++ ...+....|..+.. +|.-+ .++.++.+...|.+..|.++++++. T Consensus 1 m~~----~~T~l~d~i~Pev~~~~v~~~~----~~~l~~~~~~~~~~~l~g~~G-~tv~iP~~~~ig~a~~~~~g~~i~~ 71 (274) T protein:vir:96 1 MAQ----GMTKLTNQIVPEVLAPMMQAEL----EKKLRFASFAEIDNTLVGQPG-DTLTFPAFIYSGDAKVVAEGEKIPT 71 (274) T ss_pred CCc----ceeehhheechHHHHHHHHHHH----HhhhhccccceecccccCCCC-CEEEeeeecCCCccccccCCCccch Confidence 111 2355566667888888875443 44444455544432 23333 7899999999999999999999999 Q ss_pred eeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCccccc Q lcl|Aclame:pro 112 SGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITA 191 (336) Q Consensus 112 ~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~ 191 (336) -..........+...+-++.++ ++.+.+. +-++..+-...+..++.+.+++..+ ..++.-... + T Consensus 72 ~~lt~~~~~~~i~~~~~a~~i~--D~~~~~~-~~d~~~~~~~~~~~~~a~~vd~~i~---------~~l~~a~~~--~-- 135 (274) T protein:vir:96 72 DILETKKREAKIRKIAKGTSIS--DEALLSG-YGDPQGEQVRQHGLAHANKVDDDVL---------EALKSAKLT--V-- 135 (274) T ss_pred hhcccceeEEEeeeeecceeeh--HHHHhhc-cchHHHHHHHHHHHHHHHHHHHHHH---------HHHhccccc--c-- Confidence 8998888888888766665555 5555544 3455556666666777766665322 111100000 0 Q ss_pred ccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhcccCC--------CCCccHHH-HHHHhCCcc Q lcl|Aclame:pro 192 TTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSKTN--------QYGLAAAA-KLKDIFPKL 262 (336) Q Consensus 192 ~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~~~--------~~~~Tvl~-~l~~n~pnl 262 (336) + .++.+ ++.|.++...+-.. ...+..|+++|..+..|.+-. +.+..++- =.--.|-++ T Consensus 136 ~---~~~~~----~d~i~~A~~~lgd~------~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~ 202 (274) T protein:vir:96 136 E---ADITK----LTGLQTAIDKFNDE------DLEPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGEALGA 202 (274) T ss_pred c---ccccC----HHHHHHHHHHhccc------cccccEEEeCHHHHHHHHhhccccccccccccccceeccccceecCe Confidence 0 01223 44455555544221 125778999999999886421 11111100 000012234 Q ss_pred EEEEcccccCCCCceEEEEEEeecCCceEEEEcChhhhcccceecCCce-EEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 263 EFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSIERYSSYF-RQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 263 ~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p~~~r~l~~~~~~~~~-~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) +|+....+. -.+.+++- +.-+......+.+. ......++. ..=..-..+|+-+.+|-.++.+.-= T Consensus 203 ~Vi~s~~~~---~~t~~l~~-----~gA~~~~~~~~~~v-E~~Rd~~~~~d~i~~~~~y~~~~~~~~~~v~~tk~ 268 (274) T protein:vir:96 203 VIVRSNKLE---AGTAILAK-----KGAVKLITKRDFFL-ETDRDPSTKTTALYSDKHYVAYLYDESKAVKITKG 268 (274) T ss_pred EEEEeCCCC---CceEEEEe-----ccceeeeecCCccc-ccccccccccCEEEEeEEEEEEEEcCCcEEEEEcC Confidence 544333221 11222221 11111111111110 011111110 1111224678888899877776633 No 83 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=95.30 E-value=0.0023 Score=35.00 Aligned_cols=256 Identities=9% Similarity=0.015 Sum_probs=129.8 Q ss_pred hhhhhcccccccCcchHHHHHHHhhCceeeeeeccccchhhhccccc--CCCcceeeEEEeeeecceeeEEeecccCCce Q lcl|Aclame:pro 34 DAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESK--KGDWTTLVAAFITAEPTTKVATYGDYSSDGD 111 (336) Q Consensus 34 da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t--~g~w~~~t~~~~~~e~~G~a~~ygd~~diP~ 111 (336) .|+ +.+.-++--+|..+..||.-++ ...+....|..+.. +|.-+ .++.++.+...|.+..|.++++++. T Consensus 1 m~~----~~T~l~d~i~Pev~~~~v~~~~----~~~l~~~~~~~~~~~l~g~~G-~tv~iP~~~~ig~a~~~~~g~~i~~ 71 (274) T protein:vir:95 1 MAQ----GMTKLTNQIVPEVLAPMMQAEL----EKKLRFASFAEIDNTLVGQPG-DTLTFPAFIYSGDAKVVAEGEKIPT 71 (274) T ss_pred CCc----ceeehhheechHHHHHHHHHHH----HhhhhccccceecccccCCCC-CEEEeeeecCCCccccccCCCccch Confidence 111 2355566667888888875443 44444455544432 23333 7899999999999999999999999 Q ss_pred eeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCccccc Q lcl|Aclame:pro 112 SGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITA 191 (336) Q Consensus 112 ~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~ 191 (336) -..........+...+-++.++ ++.+.+. +-++..+-...+..++.+.+++..+ ..++.-... + T Consensus 72 ~~lt~~~~~~~i~~~~~a~~i~--D~~~~~~-~~d~~~~~~~~~~~~~a~~vd~~i~---------~~l~~a~~~--~-- 135 (274) T protein:vir:95 72 DILETKKREAKIRKIAKGTSIS--DEALLSG-YGDPQGEQVRQHGLAHANKVDDDVL---------EALKSAKLT--V-- 135 (274) T ss_pred hhcccceeEEEeeeeecceeeh--HHHHhhc-cchHHHHHHHHHHHHHHHHHHHHHH---------HHHhccccc--c-- Confidence 8998888888888766665555 5555544 3455556666666777766665322 111100000 0 Q ss_pred ccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhcccCC--------CCCccHHH-HHHHhCCcc Q lcl|Aclame:pro 192 TTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSKTN--------QYGLAAAA-KLKDIFPKL 262 (336) Q Consensus 192 ~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~~~--------~~~~Tvl~-~l~~n~pnl 262 (336) + .++.+ ++.|.++...+-.. ...+..|+++|..+..|.+-. +.+..++- =.--.|-++ T Consensus 136 ~---~~~~~----~d~i~~A~~~lgd~------~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~ 202 (274) T protein:vir:95 136 E---ADITK----LTGLQTAIDKFNDE------DLEPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGEALGA 202 (274) T ss_pred c---ccccC----HHHHHHHHHHhccc------cccccEEEeCHHHHHHHHhhccccccccccccccceeccccceecCe Confidence 0 01223 44455555544221 125778999999999886421 11111100 000012234 Q ss_pred EEEEcccccCCCCceEEEEEEeecCCceEEEEcChhhhcccceecCCce-EEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 263 EFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSIERYSSYF-RQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 263 ~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p~~~r~l~~~~~~~~~-~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) +|+....+. -.+.+++- +.-+......+.+. ......++. ..=..-..+|+-+.+|-.++.+.-= T Consensus 203 ~Vi~s~~~~---~~t~~l~~-----~gA~~~~~~~~~~v-E~~Rd~~~~~d~i~~~~~y~~~~~~~~~~v~~tk~ 268 (274) T protein:vir:95 203 VIVRSNKLE---AGTAILAK-----KGAVKLITKRDFFL-ETDRDPSTKTTALYSDKHYVAYLYDESKAVKITKG 268 (274) T ss_pred EEEEeCCCC---CceEEEEe-----ccceeeeecCCccc-ccccccccccCEEEEeEEEEEEEEcCCcEEEEEcC Confidence 544333221 11222221 11111111111110 011111110 1111224678888899877776633 No 84 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=95.27 E-value=0.0024 Score=34.95 Aligned_cols=301 Identities=11% Similarity=0.001 Sum_probs=135.3 Q ss_pred CchHHHHHHhhhcceeccchhhhccchhHHHHHhhhhhcccccccCcchHHHHHHHhhCceeeeeeccccchhhhccccc Q lcl|Aclame:pro 1 MRDAQRIQNLARAGVILPRSVQNVSTPLTEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESK 80 (336) Q Consensus 1 ~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~a~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t 80 (336) .+...+...+.+.|. ..++.+.++... ++-.. +-.+.....+|..+.+- |++.+...-....++.+.+ T Consensus 55 ~~~~~~~~~~~~~~~------~~l~~~~r~~~~-~~~~~-~~~~~gg~lvP~~~~~~----I~~~~~~~s~i~~~~~~~~ 122 (390) T protein:vir:40 55 NREMNDNNVLASRGA------NALTSDESKYYN-EVIAG-NGFAGVTALLPPTVFER----VFEDLTVEHPLLSKINFVN 122 (390) T ss_pred HHHHHHHHHHHhcCc------hhccHHHHHHHH-HHHhc-cCcccCcccccHHHHHH----HHHHHHhhhhhhhhceeee Confidence 000000111111111 122222222111 11111 11222333467666543 3333333323334444433 Q ss_pred CCCcceeeEEEeeeecceeeEEeecccCCc-eeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHH Q lcl|Aclame:pro 81 KGDWTTLVAAFITAEPTTKVATYGDYSSDG-DSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGL 159 (336) Q Consensus 81 ~g~w~~~t~~~~~~e~~G~a~~ygd~~diP-~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~ 159 (336) .+. ....++.....+.+...+....+| ..+.......-..+.+...+.+|.+=++.+ ..++.+.-....++++ T Consensus 123 ~~~---~~~~i~~~~~~~~a~~~~E~~~~~~~~~~~f~~i~l~~~k~~~~i~iS~ell~ds---~~~l~~~i~~~la~~i 196 (390) T protein:vir:40 123 TTA---TTEWIISVGDVATAWWGPLCAEIKEVLDNGFDKIQTGMYKLSAYIPVCNAMLDLG---PSWLDQYVRTILGEAM 196 (390) T ss_pred cCC---ceeEEEEEcCCcceeeeccccccCccccccceeeEeeeeeEEEeehhhHHHHhcc---hHHHHHHHHHHHHHHH Confidence 222 223456666677777777666664 456667777777888888888885555533 4578888889999999 Q ss_pred HHhhcceEEeeccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHH-HH Q lcl|Aclame:pro 160 AKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPT-AM 238 (336) Q Consensus 160 e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~-~~ 238 (336) ...+|+-+++|++...-.|++|++.........+....+.|.+.+.+.+..+...+.....- ...--.++|-++ .+ T Consensus 197 ~~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~---~~~~a~~i~n~~t~~ 273 (390) T protein:vir:40 197 ALGLEAGIVNGSGKDQPIGMMRDLNNVTAGEHPVKTATPLTDLTPATLATKVMLPLTDNGKK---SVSDAILVINPADYW 273 (390) T ss_pred HHHHHhhhhcccCCCccceeeeccccccccccccccccccchhhHHHHHHHHHHHhhcchhh---hhcCceEEEcchhHH Confidence 99999999999988788899997653211111111122223333444333343333222210 011223455543 34 Q ss_pred Hhcc---c-CCCCCccHHHHHHHhCCccEEEEcccccCC---CCceEEEEEEeecCCceEEEEcChhhhccccee--cCC Q lcl|Aclame:pro 239 SDLS---K-TNQYGLAAAAKLKDIFPKLEFVTIPEYDTA---SGRLVQLWAPRVEGKDTATCGFTEKMRAHSIER--YSS 309 (336) Q Consensus 239 ~~L~---~-~~~~~~Tvl~~l~~n~pnl~i~~~pel~~a---~G~~~~~~~~~~~~~~~~~~~~p~~~r~l~~~~--~~~ 309 (336) .+|. . .+..|.-++..+ .-++.++..+..... -|.-.+.++..+.+ .++ ...+ +. ..- T Consensus 274 ~~l~~~~~~~d~~G~~v~~~~---~~g~pvv~~~~~p~~~i~~Gd~s~~~i~~~~~---~~v------~~~~-~~~f~~~ 340 (390) T protein:vir:40 274 SKIYAATSYMTPQGVWVTGIL---PVPLEIVQSVAVPVGKAVAGRAKDYFMGIGSE---QVI------RTST-EYRLLDD 340 (390) T ss_pred HHHHHHhhccCCCCccccccC---CCceeEEEcCCCCCCcEEEEeeceEEEEeecc---eEE------Eecc-hhhhhcC Confidence 4442 1 122232222111 012344433222210 12222222211111 111 1111 11 112 Q ss_pred ceEEccccceeeeeeeccccee--eeccC Q lcl|Aclame:pro 310 YFRQKKSAGTWGAVIFRPFAVA--QMIGV 336 (336) Q Consensus 310 ~~~vp~~~~t~Gv~ir~P~av~--~~~GI 336 (336) ....-...|.+|.. +.|.||+ .+.++ T Consensus 341 ~~~~r~~~r~dg~v-~~~~A~~~l~~~~~ 368 (390) T protein:vir:40 341 ETLYYAKQYANGRP-KDNSSFLVFDITGL 368 (390) T ss_pred cEEEEEEEEeCCEE-ecccceEEEEeecc Confidence 34455566666654 4588888 44555 No 85 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=95.27 E-value=0.0019 Score=35.53 Aligned_cols=252 Identities=9% Similarity=0.031 Sum_probs=129.4 Q ss_pred hhhhhcccccccCcchHHHHHHHhhCceeeeeeccccchhhhcccccC--CCcceeeEEEeeeecceeeEEeecccCCce Q lcl|Aclame:pro 34 DAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESKK--GDWTTLVAAFITAEPTTKVATYGDYSSDGD 111 (336) Q Consensus 34 da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~--g~w~~~t~~~~~~e~~G~a~~ygd~~diP~ 111 (336) .|. ..++-++--+|..+..||.-++ -..+....|..+.+. |.. -.+++++.++..|.+..+++++++|. T Consensus 1 Ma~----~~T~l~d~i~Pev~~~~v~~~~----~~~~~~~~~~~~~~~l~g~~-G~ti~iP~~~~igda~~~~eg~~i~~ 71 (276) T protein:vir:10 1 MAQ----GTTTKSTQIVPEVLAPMMQAEL----DKKLRFAQFADIDSTLVGQP-GDTLTFPAFVYSGDATVVPEGQKIPV 71 (276) T ss_pred CCc----ceeehhhhhchHHHHHHHHHHH----HhhhhhcccceecccccCCC-CCEEEeeeecCCCccccccCCCccCc Confidence 111 1244556667888888884443 333444555554432 222 36799999999999999999999999 Q ss_pred eeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCccccc Q lcl|Aclame:pro 112 SGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITA 191 (336) Q Consensus 112 ~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~ 191 (336) ...........+.+.+-++.++.++ ..+. +.+.-.+-......++.+.+++..+ ..++. +..+. T Consensus 72 ~~lt~~~~~a~i~~~~k~~~~tD~a--~~~~-~~dp~~~~~~~~~~~~a~~~d~~~~---------~~l~~----~~~~~ 135 (276) T protein:vir:10 72 DKIETNRREAKIHKIGKGTDITDEA--LLSG-YGDPQGEAVRQHGLAIANKVDNDVL---------EALRG----TKLTV 135 (276) T ss_pred cccccceeeEEeehccccccccHHH--HHhh-ccchHHHHHHHHHHHHHHHHHHHHH---------HHHhc----ccccc Confidence 9999999999998876666665444 3333 4455555566666666666664322 11110 00000 Q ss_pred ccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhcccC--------CCCCccHHHHHH----HhC Q lcl|Aclame:pro 192 TTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSKT--------NQYGLAAAAKLK----DIF 259 (336) Q Consensus 192 ~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~~--------~~~~~Tvl~~l~----~n~ 259 (336) + .++.+. +.|.+++..+-.. ...+..|++.|..+..|.+- +..|.. .+. ..| T Consensus 136 ~---~~~~t~----d~i~~A~~~lgd~------~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~---~~~~G~ig~~ 199 (276) T protein:vir:10 136 S---ADIGTL----AGLEAAIDTFDDE------DLEPMVLFINPKDAGKLRSSASDNFTRATELGDN---IIVKGAFGEA 199 (276) T ss_pred c---ccccCH----HHHHHHHHHhccc------cCcccEEEEcHHHHHHHHHhcccccccccccccc---ceecccccee Confidence 1 112333 4444455444222 12467899999998888532 111111 111 012 Q ss_pred CccEEEEcccccCCCCceEEEEEEeecCCceEEEEcChhhh--cccceecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 260 PKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMR--AHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 260 pnl~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p~~~r--~l~~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) -+++|+....+. -.+.+++- +.-+.+....+.. ....+.+.. -.+ ..-..+|+-+.+|-.++.+.=- T Consensus 200 ~G~~Vi~s~~~p---~~t~~l~~-----~gAi~~~~~~~~~vE~dRd~~~~~-d~i-~~~~~y~~~~~~~~~vv~~t~~ 268 (276) T protein:vir:10 200 LGAVIVRSKKLD---EGEAILAK-----RGAVKLITKRDFFLETDRDPSTKT-TAL-YSDKHYVAYLYDESKAVKVTKG 268 (276) T ss_pred cceeEEEcCCCC---cceEEEEe-----ccceeeeecCCceeecccchhhcc-cEE-EEeeEEEEEEEcCcceEEEecC Confidence 245555444332 12222221 1111111111111 000011111 111 1223568888888877776633 No 86 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=95.20 E-value=0.0025 Score=34.81 Aligned_cols=255 Identities=10% Similarity=0.017 Sum_probs=126.2 Q ss_pred hhhhhcccccccCcchHHHHHHHhhCceeeeeeccccchhhhcccccC--CCcceeeEEEeeeecceeeEEeecccCCce Q lcl|Aclame:pro 34 DAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESKK--GDWTTLVAAFITAEPTTKVATYGDYSSDGD 111 (336) Q Consensus 34 da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~--g~w~~~t~~~~~~e~~G~a~~ygd~~diP~ 111 (336) .|+ +.+.-++--+|..+..||..++ ...+....|..+... |.. -.+++++.+...|.+..|.++++++. T Consensus 1 ma~----~~T~l~d~iiPev~~~~v~~~~----~~~l~~~~~~~~d~~l~g~~-G~tv~iP~~~~ig~a~~~~~g~~i~~ 71 (274) T protein:vir:12 1 MAQ----GLTKTSNQIIPEVLAPMMQAQL----EKKLRFASFAEVDSTLQGQP-GDTLTFPAFVYSGDAQVVAEGEKIPT 71 (274) T ss_pred CCc----ceeehhhhhchHHHHHHHHHHH----HhhhhhcccceecccccCCC-CCEEEEeeecCCCccccccCCCccch Confidence 111 1345566778899998885553 444555555555433 222 36899999999999999999999999 Q ss_pred eeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCccccc Q lcl|Aclame:pro 112 SGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITA 191 (336) Q Consensus 112 ~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~ 191 (336) .+.........+...+-+++++ ++.+++..+ ++..+-...+..++.+.+++-.+ . .++. +..+. T Consensus 72 ~~lt~~~~~~~i~~~~~~~~i~--D~~~~~~~~-d~~~~~~~q~~~~~a~~vd~~~l-~--------~~~~----a~~~~ 135 (274) T protein:vir:12 72 DILETKKREAKIRKIAKGTSIT--DEALLSGYG-DPQGEQVRQHGLAHANKVDNDVL-E--------ALMG----AKLTV 135 (274) T ss_pred hhcccceeeEEeeeecceeeec--HHHHHhccc-chHHHHHHHHHHHHHHHHHHHHH-H--------HHhc----ccccc Confidence 9999999888888876555554 455555443 44555556666666666665322 0 0110 00000 Q ss_pred ccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhcccCC--------CCCccHHH-HHHHhCCcc Q lcl|Aclame:pro 192 TTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSKTN--------QYGLAAAA-KLKDIFPKL 262 (336) Q Consensus 192 ~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~~~--------~~~~Tvl~-~l~~n~pnl 262 (336) + .++.+ ++.|.+++..+-.. ...+..|+++|..+..|.+-. +++..++- =.-..|-++ T Consensus 136 ~---~~a~~----~d~i~dA~~~lgd~------~~~~~~ivv~p~~~~~L~k~~~~~fv~~s~~g~~~~~~G~ig~~~G~ 202 (274) T protein:vir:12 136 N---ADITK----LNGLQSAIDKFNDE------DLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEALGA 202 (274) T ss_pred c---ccccC----HHHHHHHHHHhccc------cccccEEEeCHHHHHHHHhhhhhhccccccccccceecccceeecCe Confidence 1 11223 33444454444221 125678999999998886421 11211100 000012344 Q ss_pred EEEEcccccCCCCceEEEEEEeecCCceEEEEcChhhh--cccceecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 263 EFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMR--AHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 263 ~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p~~~r--~l~~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) +|+....+. -.+.+++- +.-+......+.+ ..-.+.+... .=..-..+||-+.+|-.++.+..= T Consensus 203 ~Vi~s~~~p---~~t~~l~~-----~gA~~~~~~~~~~vE~~Rd~~~~~d--~i~~~~~y~~~~~~~~~vv~~t~~ 268 (274) T protein:vir:12 203 IIVRSNKLE---AGTAILAK-----KGAVKLILKRDFFLEVARDASTKTT--ALYSDKHYVAYLYDESKAVKITKG 268 (274) T ss_pred eEEEeCCCC---cceEEEEe-----ccceeeeecCCceeccccchhhccc--EEEeeeEEEEEEEcCCceEEEEcC Confidence 544322221 11122221 1111111111100 0000011111 111123456666666666665544 No 87 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=95.19 E-value=0.0026 Score=34.78 Aligned_cols=251 Identities=8% Similarity=0.045 Sum_probs=132.6 Q ss_pred HHHhhhhhcccccccCcchHHHHHHHhhCceeeeeeccccchhhhccccc--CCCcceeeEEEeeeecceeeEEeecccC Q lcl|Aclame:pro 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESK--KGDWTTLVAAFITAEPTTKVATYGDYSS 108 (336) Q Consensus 31 ~a~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t--~g~w~~~t~~~~~~e~~G~a~~ygd~~d 108 (336) ||-- -++.++.-+|..+..+|.-++ ...+....+.-+.. .|.-+ .++.++.++..|.+..|+.+++ T Consensus 1 MA~~-------~T~~~~~~iPev~s~~v~~~~----~~~~~~~~~~~~~~~~~g~~G-~tv~iP~~~~~~~a~~v~eg~~ 68 (272) T protein:vir:30 1 MAVG-------TTKMAQMLDPEVLADMIDAEV----GKAIRFAPLAEVDTTLEGQPG-TTLTVPKWDYIGDAEDVAEGEA 68 (272) T ss_pred CCCc-------cccchheechHHHHHHHHHHH----HHHhhhhccccccccccCCCC-CEEEEEEecCCCCcccccCCCc Confidence 2211 134455667888887774332 22222233333222 12222 4788999999999999999999 Q ss_pred CceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCcc Q lcl|Aclame:pro 109 DGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAP 188 (336) Q Consensus 109 iP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~ 188 (336) +|..+.........+.+++..+.++.++...+ ..++.+.-...+.+++.+.+++..+ + .++ .+. T Consensus 69 i~~~~~~~~~~~~~~~~~~~~~~itd~~~~~s---~~d~~~~~~~~~~~~~a~~~d~~i~-~--------~~~----~a~ 132 (272) T protein:vir:30 69 IPMTQLGFKKTTMTIKKAGKGVEITDEAILSG---YGDPVGQAAKQIVEAIDHKVDADVL-D--------ALS----KST 132 (272) T ss_pred ccccccccceEEEEeeeeeeeeeecHHHHhhc---cccHHHHHHHHHHHHHHHHHHHHHH-H--------Hhc----ccc Confidence 99999999999999999988888887765443 4567777777777777777775432 1 111 001 Q ss_pred cccccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhcccC--------CCCCccHHHHHH---- Q lcl|Aclame:pro 189 ITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSKT--------NQYGLAAAAKLK---- 256 (336) Q Consensus 189 ~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~~--------~~~~~Tvl~~l~---- 256 (336) .... ++.| +++|.+++..+-.. + ..+..+++.|..+..|.+. ++++. ..+. T Consensus 133 ~~~~----~~~t----~d~i~da~~~l~~~--~----~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~---~~~~~g~i 195 (272) T protein:vir:30 133 QTVE----ATAT----VDGVSKALDIFNDE--D----DAETVIVMNPADASTLRLDAAKEWLGATEVGA---NRVVSGVY 195 (272) T ss_pred cccc----cccC----HHHHHHHHHHHhcc--C----CCccEEEEcHHHHHHHHHhccccccccccccc---cccccccc Confidence 1111 1123 45566666555322 1 2467899999998877432 11111 1111 Q ss_pred HhCCccEEEEcccccCCCCceEEEEEEeecCCceEEEEcChhhhcccceecCC--ceEEccccceeeeeeecccceeeec Q lcl|Aclame:pro 257 DIFPKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSIERYSS--YFRQKKSAGTWGAVIFRPFAVAQMI 334 (336) Q Consensus 257 ~n~pnl~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p~~~r~l~~~~~~~--~~~vp~~~~t~Gv~ir~P~av~~~~ 334 (336) .++-+++++..+-+. -++.+++- +.-+.+..-...+ ........ ...+-.. +..|+-+.+|.+++... T Consensus 196 g~i~G~~Vi~s~~~p---~~t~~~~~-----~~a~~~~~~~~~~-ve~~r~~~~~~~~i~~~-~~~~~~v~~~~~vv~~t 265 (272) T protein:vir:30 196 GEVLGVQIVRSRKCP---KGTAYMVR-----KGALRIMLKRNTM-VETDRDITKAINQIVAN-KHYGVYLYKAEKAVKIT 265 (272) T ss_pred hhhcCeeEEEcCCCC---cceEEEEc-----CCeEEEEecCCce-eeeccccccceeEEEEE-EEEEEEEEcCCceEEEE Confidence 122345554444332 12222221 1111111111111 01111111 1222222 34567888998888775 Q ss_pred cC Q lcl|Aclame:pro 335 GV 336 (336) Q Consensus 335 GI 336 (336) -= T Consensus 266 ~~ 267 (272) T protein:vir:30 266 LK 267 (272) T ss_pred ec Confidence 44 No 88 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=95.19 E-value=0.0026 Score=34.78 Aligned_cols=251 Identities=8% Similarity=0.045 Sum_probs=132.6 Q ss_pred HHHhhhhhcccccccCcchHHHHHHHhhCceeeeeeccccchhhhccccc--CCCcceeeEEEeeeecceeeEEeecccC Q lcl|Aclame:pro 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESK--KGDWTTLVAAFITAEPTTKVATYGDYSS 108 (336) Q Consensus 31 ~a~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t--~g~w~~~t~~~~~~e~~G~a~~ygd~~d 108 (336) ||-- -++.++.-+|..+..+|.-++ ...+....+.-+.. .|.-+ .++.++.++..|.+..|+.+++ T Consensus 1 MA~~-------~T~~~~~~iPev~s~~v~~~~----~~~~~~~~~~~~~~~~~g~~G-~tv~iP~~~~~~~a~~v~eg~~ 68 (272) T protein:vir:98 1 MAVG-------TTKMAQMLDPEVLADMIDAEV----GKAIRFAPLAEVDTTLEGQPG-TTLTVPKWDYIGDAEDVAEGEA 68 (272) T ss_pred CCCc-------cccchheechHHHHHHHHHHH----HHHhhhhccccccccccCCCC-CEEEEEEecCCCCcccccCCCc Confidence 2211 134455667888887774332 22222233333222 12222 4788999999999999999999 Q ss_pred CceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCcc Q lcl|Aclame:pro 109 DGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAP 188 (336) Q Consensus 109 iP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~ 188 (336) +|..+.........+.+++..+.++.++...+ ..++.+.-...+.+++.+.+++..+ + .++ .+. T Consensus 69 i~~~~~~~~~~~~~~~~~~~~~~itd~~~~~s---~~d~~~~~~~~~~~~~a~~~d~~i~-~--------~~~----~a~ 132 (272) T protein:vir:98 69 IPMTQLGFKKTTMTIKKAGKGVEITDEAILSG---YGDPVGQAAKQIVEAIDHKVDADVL-D--------ALS----KST 132 (272) T ss_pred ccccccccceEEEEeeeeeeeeeecHHHHhhc---cccHHHHHHHHHHHHHHHHHHHHHH-H--------Hhc----ccc Confidence 99999999999999999988888887765443 4567777777777777777775432 1 111 001 Q ss_pred cccccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhcccC--------CCCCccHHHHHH---- Q lcl|Aclame:pro 189 ITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSKT--------NQYGLAAAAKLK---- 256 (336) Q Consensus 189 ~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~~--------~~~~~Tvl~~l~---- 256 (336) .... ++.| +++|.+++..+-.. + ..+..+++.|..+..|.+. ++++. ..+. T Consensus 133 ~~~~----~~~t----~d~i~da~~~l~~~--~----~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~---~~~~~g~i 195 (272) T protein:vir:98 133 QTVE----ATAT----VDGVSKALDIFNDE--D----DAETVIVMNPADASTLRLDAAKEWLGATEVGA---NRVVSGVY 195 (272) T ss_pred cccc----cccC----HHHHHHHHHHHhcc--C----CCccEEEEcHHHHHHHHHhccccccccccccc---cccccccc Confidence 1111 1123 45566666555322 1 2467899999998877432 11111 1111 Q ss_pred HhCCccEEEEcccccCCCCceEEEEEEeecCCceEEEEcChhhhcccceecCC--ceEEccccceeeeeeecccceeeec Q lcl|Aclame:pro 257 DIFPKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSIERYSS--YFRQKKSAGTWGAVIFRPFAVAQMI 334 (336) Q Consensus 257 ~n~pnl~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p~~~r~l~~~~~~~--~~~vp~~~~t~Gv~ir~P~av~~~~ 334 (336) .++-+++++..+-+. -++.+++- +.-+.+..-...+ ........ ...+-.. +..|+-+.+|.+++... T Consensus 196 g~i~G~~Vi~s~~~p---~~t~~~~~-----~~a~~~~~~~~~~-ve~~r~~~~~~~~i~~~-~~~~~~v~~~~~vv~~t 265 (272) T protein:vir:98 196 GEVLGVQIVRSRKCP---KGTAYMVR-----KGALRIMLKRNTM-VETDRDITKAINQIVAN-KHYGVYLYKAEKAVKIT 265 (272) T ss_pred hhhcCeeEEEcCCCC---cceEEEEc-----CCeEEEEecCCce-eeeccccccceeEEEEE-EEEEEEEEcCCceEEEE Confidence 122345554444332 12222221 1111111111111 01111111 1222222 34567888998888775 Q ss_pred cC Q lcl|Aclame:pro 335 GV 336 (336) Q Consensus 335 GI 336 (336) -= T Consensus 266 ~~ 267 (272) T protein:vir:98 266 LK 267 (272) T ss_pred ec Confidence 44 No 89 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=95.07 E-value=0.0027 Score=34.70 Aligned_cols=258 Identities=10% Similarity=-0.044 Sum_probs=126.5 Q ss_pred HHhhhhhcccccccCcc--hHHHHHHHhhCceeeeeeccccchhhh---cccccCCCcceeeEEEeeee-cceeeEEeec Q lcl|Aclame:pro 32 AMDAADLSPHLSSTGSS--GIPNYLTTYVDPAVIDILVAPMKAAEL---VGESKKGDWTTLVAAFITAE-PTTKVATYGD 105 (336) Q Consensus 32 a~da~d~~~~l~t~~~~--~i~~~l~~~idp~v~~~~~~~~~~~~l---~~v~t~g~w~~~t~~~~~~e-~~G~a~~ygd 105 (336) .+.++ ..++++++ -+|..+.+ +|++.+........+ +|+++.. ....+...+ ..+.+...+. T Consensus 1 ~l~~~----~~~t~~~gg~liP~~~~~----~Ii~~~~~~~~l~~~~~~~~~~~~~----g~~~~~~~~~~~~~a~~v~E 68 (293) T protein:vir:48 1 MLDSK----TDHSGSDAGLTIPQDIRT----AINTLVRQYDSLQEYVNVENVTTLT----GSRVYEKWTDITGLANIDDE 68 (293) T ss_pred Cceee----cccccCcCceEechhHHH----HHHHHHHhhhhhhhhceeeeccCCc----ceEEEEeecCCCcceeeecC Confidence 11111 11223333 35655543 344444444444444 4443321 122343333 4567788888 Q ss_pred ccCCce-eeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCC Q lcl|Aclame:pro 106 YSSDGD-SGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPS 184 (336) Q Consensus 106 ~~diP~-~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pn 184 (336) ...+|- .+.......-+.+.++..+.+|.+=++-+ ..+|.+.-....++++.+.+|+-.+.|..... T Consensus 69 g~~~~~~~~~~~~~i~l~~~k~~~~~~iS~ell~ds---~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~--------- 136 (293) T protein:vir:48 69 AGKIADIDDPKLSLIKYTIKRYAGISTVTNSLLADS---AENILAWLSGWIAKKVVVTRNKAILGVVDKLP--------- 136 (293) T ss_pred CcccccccccceeEEEEeeeEEEEeehhhHHHHhhh---hHHHHHHHHHHHHHHHHHHHHhHHhhcccccc--------- Confidence 888875 45677888888888998888886655543 46788888888888888888876665543211 Q ss_pred CCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhccc-CCCCCccHHH-HHHHhCC-- Q lcl|Aclame:pro 185 LSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAA-KLKDIFP-- 260 (336) Q Consensus 185 l~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~-~l~~n~p-- 260 (336) ...+..+ ++||.+++..+...- .....++|.++.+..|.+ .+..|.-+++ -+....+ T Consensus 137 ---------~~~~~~~----~d~i~~~~~~l~~~~------~~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~ 197 (293) T protein:vir:48 137 ---------TKPTLTK----WDDIIDLEAKVDPAI------KQTSFFLTNTSGFTALKKVKNALGDYLMERDVKSPTGYS 197 (293) T ss_pred ---------ccccccC----HHHHHHHHHhhhhhh------cCCCEEEEcHHHHHHHHHhhccCCceEeecCcCCCCCce Confidence 1112233 456666777664321 123478999999998854 3333332221 0111111 Q ss_pred --c--cEEEEccccc-CCCCceEEEEEEeecC-----CceEEEEcChhhhcccceecCCceEEccccceeeeeeecccce Q lcl|Aclame:pro 261 --K--LEFVTIPEYD-TASGRLVQLWAPRVEG-----KDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAV 330 (336) Q Consensus 261 --n--l~i~~~pel~-~a~G~~~~~~~~~~~~-----~~~~~~~~p~~~r~l~~~~~~~~~~vp~~~~t~Gv~ir~P~av 330 (336) + +.+....-+. .++|....++.+-.+. ..-.++.+.. ........-...+-+..|.+| .+++|.|| T Consensus 198 l~G~Pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~---~~~~~~~~~~~~~r~~~r~d~-~~~~~~a~ 273 (293) T protein:vir:48 198 IAGFAVKEISDRWLPNASSGVMPLYFGDLKQAVTLFDRQQMSLLSTN---IGGGAFETDTTKVRVIDRFDV-VATDTEAF 273 (293) T ss_pred ecceeeEEecccccCCccCCceEEEEEeccceEEEEEecceEEEEec---ccchhhhcCeEEEEEEEeeCc-EEecccce Confidence 1 1111111111 1233333332211000 0001111110 000001112244455666666 46789999 Q ss_pred eeeccC Q lcl|Aclame:pro 331 AQMIGV 336 (336) Q Consensus 331 ~~~~GI 336 (336) +.+..= T Consensus 274 ~~l~~~ 279 (293) T protein:vir:48 274 VPASFK 279 (293) T ss_pred EEEEee Confidence 977743 No 90 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=94.96 E-value=0.0031 Score=34.35 Aligned_cols=313 Identities=11% Similarity=0.040 Sum_probs=143.5 Q ss_pred CchHHHH--HHhhhcceeccchhhhccchhHHHHHhhhh----------hcccc--cccCcc--hHHHHHHHhhCceeee Q lcl|Aclame:pro 1 MRDAQRI--QNLARAGVILPRSVQNVSTPLTEYAMDAAD----------LSPHL--SSTGSS--GIPNYLTTYVDPAVID 64 (336) Q Consensus 1 ~~~~~~~--~~l~~~g~~~~~~~~~~~~~~~~~a~da~d----------~~~~l--~t~~~~--~i~~~l~~~idp~v~~ 64 (336) +++..+- +.+....-...........+.+...+..+. -...+ .|.+++ .||..+.+ +|++ T Consensus 53 ~e~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~a~~~~l~~g~~~~~~~~e~~a~~~~t~~~gG~~iP~~~~~----~I~~ 128 (407) T protein:vir:48 53 LENLKSDLEAELAEVKRPAGGTQNKVASEHKEAFIGFMRKGREDGLRELERKALQVGNDEDGGYAIPEELDR----TILT 128 (407) T ss_pred HHHHHHHHHHHHHHhhccccccccchhhHHHHHHHHHHhccchhhhhHHHHHhhhcccCCCCcccccHhHHH----HHHH Confidence 1111110 000000000000000000000000000000 00011 222333 35665543 3333 Q ss_pred eeccccchhhhcccccCCCcceeeEEEeeeecceeeEEeecccCCceee-eeeeeeeeeEEEEEEEEeeCHHHHHHHHhh Q lcl|Aclame:pro 65 ILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSG-ANINYPQRQSYFFQTWTRWGERELEMAGAG 143 (336) Q Consensus 65 ~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~~ygd~~diP~~~-~~~~~~~~~v~~~~~~~~y~~~El~~A~~~ 143 (336) .+-..-....+..+.+.+. ....+++......+...+.+...|-.+ .......-.++.++..+.+|.+=++. . T Consensus 129 ~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~d---s 202 (407) T protein:vir:48 129 LLKDEVVMRQEATVITLGG---SDYKKLVNLGGTTSGWVGETDARPETATSKLGLIEPFMGEIYGNPQATQKMLDD---A 202 (407) T ss_pred HHHhhhhhhhhceeeecCC---CceEEEEecCCcceeeecccccccccccccceeEEeeeeeeEeehhhHHHHHhc---c Confidence 3333333334443322221 235666666666777778777777554 45677777788888888888765543 3 Q ss_pred CCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCccccccccc------ccccCHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 144 RVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPW------SGSPAVEAVVNEVVALFQVLQT 217 (336) Q Consensus 144 g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w------~~~~t~~eI~~Di~~l~~~l~~ 217 (336) ..++.+.-......++.+.++.-.++|++.....|+|+++.+....... .| ...++..--++||.+++..|.. T Consensus 203 ~~~l~~~i~~~l~~~i~~~~~~a~l~G~G~~~p~Gil~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~d~i~~l~~~l~~ 281 (407) T protein:vir:48 203 FFNVEDWINSELALEFAEQEEIAFTSGDGSKKPKGFLAYESTDEDDKTR-AFGKLQHIASGAASGVTADAIIKLIYTLRK 281 (407) T ss_pred hHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccceeeeccccccccccc-ccccccccccccccccChHHHHHHHHhhch Confidence 4578888888888888999999999999988899999988764322211 11 0011111125677777776643 Q ss_pred HhCCceecccccEEEecHHHHHhccc-CCCCCccHHH-HHHH-------hCCccEEEEcccccCCCCceEEEEEEeecCC Q lcl|Aclame:pro 218 QSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAA-KLKD-------IFPKLEFVTIPEYDTASGRLVQLWAPRVEGK 288 (336) Q Consensus 218 ~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~-~l~~-------n~pnl~i~~~pel~~a~G~~~~~~~~~~~~~ 288 (336) .-. ..-+++|.+..+..|.+ .+..|.-++. =+.. -+|=+....+|.. ++|....+|.+-. . T Consensus 282 ~~~------~~a~~v~n~~~~~~L~~lkD~~Gr~l~~~~~~~g~~~~l~G~PV~~~~~~p~~--~~~~~~i~~Gd~~--~ 351 (407) T protein:vir:48 282 AHR------SGAKFMMNNSSLFAIRLLKDNDGNYLWRPGIELGQPSSLAGYGIVENEQMPDI--AADAKAIAFGNFK--R 351 (407) T ss_pred hhh------cCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeEEecCcCCc--cCCccEEEEEecc--c Confidence 211 12358899998888753 2333332211 0011 1221122223322 2333333332210 0 Q ss_pred ceEEEEcChhhhccc-ceecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 289 DTATCGFTEKMRAHS-IERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 289 ~~~~~~~p~~~r~l~-~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) .. .+.--+.++.+- .....-....-+..|.+|. +..|.||+.+..= T Consensus 352 ~~-~i~~~~~~~i~~d~~~~~~~~~~~~~~r~d~~-v~~~~a~~~l~~~ 398 (407) T protein:vir:48 352 GY-TIVDRIGTRILRDPYTNKPFVGFYTTKRTGGM-LVDSQAIKLMKIG 398 (407) T ss_pred cE-EEEEeeceEEEeeccccCCcEEEEEEEEeccE-EecccceEEEEee Confidence 00 000000011100 0011223444566777664 4559999765543 No 91 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=94.89 E-value=0.0032 Score=34.23 Aligned_cols=290 Identities=12% Similarity=0.054 Sum_probs=136.0 Q ss_pred CchHHHHHHhhhcceeccchhhhccchhHHHHHhhhhhcccccccCcchHHHHHHHhhCceeeeeeccccchhhhccccc Q lcl|Aclame:pro 1 MRDAQRIQNLARAGVILPRSVQNVSTPLTEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESK 80 (336) Q Consensus 1 ~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~a~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t 80 (336) |.-.+++-+.-| ++...|++++.. .|+.++.+|+ .+ +..-+-++....+. + T Consensus 1 ~~~~~~~~~~~k-------------------~it~~d~~gG~L------~P~~~~~~i~-~l-~e~s~i~~~a~vi~--t 51 (314) T protein:vir:41 1 MDFLNKPFQITP-------------------KIDVPDLGKGIL------AVQRFGEFVR-EV-RENSAIIKDARVLN--A 51 (314) T ss_pred CchhhhHHHhhc-------------------ccccccCCCcee------ChHHHHHHHH-HH-Hhccchhhheeeec--c Confidence 221111111111 111223333321 2455555552 22 22222222222221 2 Q ss_pred CCCcceeeEEEeeeec----ceeeEEeecccCCceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHH Q lcl|Aclame:pro 81 KGDWTTLVAAFITAEP----TTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSA 156 (336) Q Consensus 81 ~g~w~~~t~~~~~~e~----~G~a~~ygd~~diP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr 156 (336) .++.. ..+..+.. ...+...|+.++.|-.+..........+.+..-+..+.+.|+-++ -|.++...-....+ T Consensus 52 ~~s~~---~~i~~i~~g~~~~~~~~~~~~~~~~~~~~~tf~~~~l~~~kl~~~v~is~e~L~D~a-~~~~le~~i~~~~A 127 (314) T protein:vir:41 52 LKSYE---VDISRISLGVELEPGRNTSGTKVAPTADEVTVSTNTLEMKELVTKVVLEDEALEDNI-EQSAFEQTITSLLA 127 (314) T ss_pred cCccc---eeecccccCcccccccccccCCccCCcccccccceeeeeEEEEEeecccHHHHHhhh-chhhHHHHHHHHHH Confidence 22111 11111111 111223344555565665555555555666666777766666443 56789999999999 Q ss_pred HHHHHhhcceEEeeccc--------cceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceecccc Q lcl|Aclame:pro 157 LGLAKFLNGSYLFGVAG--------LENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDV 228 (336) Q Consensus 157 ~a~e~~~n~~~~~Gd~~--------~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p 228 (336) ..+.+.+....+.||+. ....|+|+..... ++..+. .+.+.+++.+.|+-..+..-..+.. .. T Consensus 128 e~~g~~~~~~~~nGdg~~~s~~~~~~~p~G~l~~a~~~--~~~~~~-~~~~~~~~~~~~l~~sl~~~yr~~~------~~ 198 (314) T protein:vir:41 128 SGVTYDLECFFLHADSSLTTGRELYRINDGWMKLAGNQ--YTDAEP-EDENWPLNLFDGMMDELDTRYLQLK------PR 198 (314) T ss_pred HHHHHHHHHHhhccccCCcCcccchhcchhhhhhcccc--eeecCc-cccccHHHHHHHHHHhcCchhhcCC------Cc Confidence 99999999999999974 2456777643221 111111 1112233333333333322222221 23 Q ss_pred cEEEecHHHHHhcccC-CCCCccHHHHH--HHh---CCccEEEEcccccCCC-CceEEEEEEeecCCceEEEEcChhhhc Q lcl|Aclame:pro 229 LRMGLPPTAMSDLSKT-NQYGLAAAAKL--KDI---FPKLEFVTIPEYDTAS-GRLVQLWAPRVEGKDTATCGFTEKMRA 301 (336) Q Consensus 229 ~tL~Lp~~~~~~L~~~-~~~~~Tvl~~l--~~n---~pnl~i~~~pel~~a~-G~~~~~~~~~~~~~~~~~~~~p~~~r~ 301 (336) ...+|++..+..+.+- ..-+..+++.. ... +-+..++.+|.+.+.+ +....|+.+- +.+...+...+|. T Consensus 199 ~~~~m~~~t~~~~r~~l~~~~~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~i~fgd~----~nlv~~~~~~ir~ 274 (314) T protein:vir:41 199 MKFYVSNEIYNGYRKQLLVRETGLGDSALIGATGLQYDGIPIQYVPALDALGDDKARALLTVP----TNLVYGFWRNIRI 274 (314) T ss_pred eEEEecHHHHHHHHHHHhccCCcccchhhhCCCCceecceeeEecccccccCCCCceEEEech----hheEEEeeceeEE Confidence 4688888766544321 11111122211 111 3345677888887654 5566666542 2233455566665 Q ss_pred ccc-eecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 302 HSI-ERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 302 l~~-~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) ++- ..+...+..-.+.|+.....-.+.++....+= T Consensus 275 ~~~~~a~~~~~~~~~~~r~d~~~~~~~aa~~~~~~~ 310 (314) T protein:vir:41 275 EPKRDAAMRRTEYIASLRADCNYEDENAAVAAVIDM 310 (314) T ss_pred eecccCcCCeEEEEEEEEeceEEEEcCcEEEEEeec Confidence 542 22223455555556655555566777666655 No 92 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=94.38 E-value=0.0046 Score=33.41 Aligned_cols=296 Identities=11% Similarity=-0.007 Sum_probs=132.3 Q ss_pred CchHHHHHHh---------------hhcceeccchhhhccchhHHH------HHhhhhhcccccccCcch--HHHHHHHh Q lcl|Aclame:pro 1 MRDAQRIQNL---------------ARAGVILPRSVQNVSTPLTEY------AMDAADLSPHLSSTGSSG--IPNYLTTY 57 (336) Q Consensus 1 ~~~~~~~~~l---------------~~~g~~~~~~~~~~~~~~~~~------a~da~d~~~~l~t~~~~~--i~~~l~~~ 57 (336) ....+.+... .+....-... .......+.+ .-.....+...++++++| ||..+. T Consensus 50 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~iP~~~~-- 126 (397) T protein:vir:48 50 KMKRDMFKEQYTEARANEVVNMSEEEKKPLTKSEE-EVKAGFVKDFKNLVRGRYQNLLDSKTDASGSDAGLTIPQDIQ-- 126 (397) T ss_pred HHHHHHHHHHHHHHHHhhhhhhhhhccccccchhh-HHHHHHHHHHHHHHhhhhhHHHHHhhccCCccccccccHHHH-- Confidence 0000000000 0000000000 0000000000 000011122233444444 565443 Q ss_pred hCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecceeeEEeecccCCceee-eeeeeeeeeEEEEEEEEeeCHHH Q lcl|Aclame:pro 58 VDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSG-ANINYPQRQSYFFQTWTRWGERE 136 (336) Q Consensus 58 idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~~ygd~~diP~~~-~~~~~~~~~v~~~~~~~~y~~~E 136 (336) ++|++.+........++++.......-....+...+..+.+...+....+|-.+ ......+-..+.++..+.+|.+= T Consensus 127 --~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~el 204 (397) T protein:vir:48 127 --TAIHTLVRQYDSLQEYVNVENVTTLTGSRVYEKWADITGLAKLDDEAGSIGTNDDPKLYPIRYAIKRYAGISTVTNSL 204 (397) T ss_pred --HHHHHHHHHHHHHHhhhceeeccCCcceEEEEeecCCCcceeeeccccccccccccceeeEEeeheeeeeehhhHHHH Confidence 466666666666666666543332222223333445556677777777887654 56777788888888888888665 Q ss_pred HHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 137 LEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQ 216 (336) Q Consensus 137 l~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~ 216 (336) ++.+ ..++.+.-......++.+.+++-.+.|++.... .+ ...+. +||.+++..|. T Consensus 205 l~ds---~~~l~~~v~~~l~~~~~~~~d~~il~G~g~~~~------------~~------~~~~~----d~i~~~~~~l~ 259 (397) T protein:vir:48 205 LADS---AENILAWLSGWIAKKVVVTRNKAILEAIATLPT------------KP------TLTKW----DDIIDLQAKVD 259 (397) T ss_pred Hhhc---hHHHHHHHHHHHHHHHHHHHHHHHhhccccccc------------cc------ccccH----HHHHHHHHHhh Confidence 5443 357777777778888888888888888754321 00 11233 34555555553 Q ss_pred HHhCCceecccccEEEecHHHHHhccc-CCCCCccHHHH-HHHhCC----c--cEEEEcccccC-CCCceEEEEEEeecC Q lcl|Aclame:pro 217 TQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAK-LKDIFP----K--LEFVTIPEYDT-ASGRLVQLWAPRVEG 287 (336) Q Consensus 217 ~~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~~-l~~n~p----n--l~i~~~pel~~-a~G~~~~~~~~~~~~ 287 (336) ..- .....++|.+..+..|.+ .+..|.-++.- +...-+ + +.+....-+.. +.+....+|.+ . T Consensus 260 ~~~------~~~a~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~~~~gd-~-- 330 (397) T protein:vir:48 260 PAI------KQTSFFLTNTSGFTALKKVKNAFGDYLMERDVKSPTGYSIDGFAVKEVADRWLANASSGAMPLYFGD-L-- 330 (397) T ss_pred hhh------cCCCEEEECHHHHHHHHHhhcCCCceeeccCcCCCCCceeccceeEEecccccCCcCCCceEEEEEe-c-- Confidence 321 134678999999988854 23333322210 111111 1 11111111222 22333333221 1 Q ss_pred CceEEE----EcChhhhcccc-eecCCceEEccccceeeeeeecccceeeec--cC Q lcl|Aclame:pro 288 KDTATC----GFTEKMRAHSI-ERYSSYFRQKKSAGTWGAVIFRPFAVAQMI--GV 336 (336) Q Consensus 288 ~~~~~~----~~p~~~r~l~~-~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~--GI 336 (336) .+...+ .+....-.+.- ....-....-+..|.+| .+++|.+|+..+ +. T Consensus 331 ~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~-~~~~~~a~~~~~~~~~ 385 (397) T protein:vir:48 331 KQAVTLFDRQQMSLLSTNIGGGAFETDTTKIRVIDRFDV-VATDTESFVPASFKAI 385 (397) T ss_pred cceEEEEeecceEEEEeccchhhhhcCceeEEEEeeecc-EEecccceEEEEeccc Confidence 100000 01111111110 01112234455666655 557788886554 33 No 93 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=94.19 E-value=0.0046 Score=33.38 Aligned_cols=287 Identities=10% Similarity=0.009 Sum_probs=124.9 Q ss_pred CchHHHHHHhhhcc------------------------------eeccchhhhccchhHHHHHhhhhhcccccccCcc-- Q lcl|Aclame:pro 1 MRDAQRIQNLARAG------------------------------VILPRSVQNVSTPLTEYAMDAADLSPHLSSTGSS-- 48 (336) Q Consensus 1 ~~~~~~~~~l~~~g------------------------------~~~~~~~~~~~~~~~~~a~da~d~~~~l~t~~~~-- 48 (336) ....+......+.. ..+-.........................+.+++ T Consensus 65 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~ 144 (400) T protein:vir:38 65 RDLYEAALKGNEQSSGKKPDHPEEHSYRDALNAYLHTRGRNTDGVNFEKTDVGTFAVLRAVPTDASDAVNAGVKAADAAS 144 (400) T ss_pred HHHHHHHHHHHhhcccccccchhhhhHHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHHhhcccccCCcc Confidence 00000000000000 0000000000000001111111111112223333 Q ss_pred hHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeee-cceeeEEeecccCCce-eeeeeeeeeeeEEEE Q lcl|Aclame:pro 49 GIPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAE-PTTKVATYGDYSSDGD-SGANINYPQRQSYFF 126 (336) Q Consensus 49 ~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e-~~G~a~~ygd~~diP~-~~~~~~~~~~~v~~~ 126 (336) -||..+. ++|++.+........++++.+.+. .+..|++.. ..|.+..++....+|- .+...+...-.++.+ T Consensus 145 ~vP~~~~----~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~i~~~~~k~ 217 (400) T protein:vir:38 145 TIPETIS----NTPQRELQTVVDLKPFTNVFQAST---QKGTYPTVANATTKMVTVAELEKNPAMAKPEFKPVNWSVETY 217 (400) T ss_pred cccHHHH----HHHHHHHHhhhhhhhcceeEeccC---cceEEEEEecCCCccccccccccccccccccceeeEeehhhe Confidence 3565444 445555555555666666543321 234666655 4567778887777764 456667777777788 Q ss_pred EEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCcccccccccccccCHHHHHH Q lcl|Aclame:pro 127 QTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVN 206 (336) Q Consensus 127 ~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~ 206 (336) +..+.+|.+=|+ ....++.+.-......++...+|.-.++|..... +. ...+.+ T Consensus 218 ~~~~~is~ell~---ds~~~~~~~i~~~l~~~~~~~~~~~i~~~~~~~~---------------~~----~~~~~~---- 271 (400) T protein:vir:38 218 RQALPVSQESID---DSAIDLVGLIAQNGQQIKVNTTNGAVATLLKGFT---------------AK----TISSVD---- 271 (400) T ss_pred eeehhhHHHHHh---hhHHHHHHHHHHHHHHHHHHHHHHhhhhcccccc---------------cc----ccccHH---- Confidence 877777764333 3345677777777777787888876666654211 00 112333 Q ss_pred HHHHHHHHHHHHhCCceecccccEEEecHHHHHhccc-CCCCCccHHH-HHHHh----CCccEEEEccccc-CCCCceEE Q lcl|Aclame:pro 207 EVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAA-KLKDI----FPKLEFVTIPEYD-TASGRLVQ 279 (336) Q Consensus 207 Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~-~l~~n----~pnl~i~~~pel~-~a~G~~~~ 279 (336) ||..++...... ...-.++|.|+.+..|.+ .+..|.-++. -+... .-+..++..+..- ...|.... T Consensus 272 ~~~~~~~~~~~~-------~~~a~~v~~~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~pv~~~~~~~~~~~g~~~~ 344 (400) T protein:vir:38 272 DLKHINNVDLDP-------AYSRVIIASQSFYNFLDTVKDGNGRYLLQDSILTPSGKSVLGMPIAVVSDDTLGAAGEAHA 344 (400) T ss_pred HHHHHHHhhhhh-------hhCcEEEEcHHHHHHHHHhhccCCCeeeecCcCCCCccccccceeEEecccccCCCCceEE Confidence 344333322111 113468999999888864 3333443321 01111 1112232222111 22344433 Q ss_pred EEEEeecC-----CceEEEEcChhhhcccceecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 280 LWAPRVEG-----KDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 280 ~~~~~~~~-----~~~~~~~~p~~~r~l~~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) +|.+-.+. ..-..+.+. .+.. -....-+..|.+|.+ ..|-+|+.+..- T Consensus 345 ~~gd~s~~~~~~~~~~~~~~~~----~~~~----~~~~~~~~~r~d~~~-~~~~a~~~l~~~ 397 (400) T protein:vir:38 345 FLGDIKRAILFANRADFMVRWV----DDQI----YGQFLQAGMRFGVSV-ADEKAGYFLTYT 397 (400) T ss_pred EEEeccccEEEEeecceEEEEe----cccc----cceeEEEEEEeccEE-ecccceEEEEee Confidence 33321100 001111111 1111 112234556655544 468888887766 No 94 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=93.05 E-value=0.009 Score=31.80 Aligned_cols=298 Identities=9% Similarity=-0.042 Sum_probs=129.5 Q ss_pred CchHHHHHHhhhc-ceeccch--hhhcc-chhHHH---------HHhhhhhcccc--cccCcc--hHHHHHHHhhCceee Q lcl|Aclame:pro 1 MRDAQRIQNLARA-GVILPRS--VQNVS-TPLTEY---------AMDAADLSPHL--SSTGSS--GIPNYLTTYVDPAVI 63 (336) Q Consensus 1 ~~~~~~~~~l~~~-g~~~~~~--~~~~~-~~~~~~---------a~da~d~~~~l--~t~~~~--~i~~~l~~~idp~v~ 63 (336) +++.+........ ...-+.. ..... ..++.+ .+. +.....+ .+.+++ .||..+. ++|+ T Consensus 63 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~e~~a~~~~t~~~gg~~iP~~~~----~~ii 137 (404) T protein:vir:39 63 LVEAQAEQVVNMREEEKGPLNKSEYELKDKFVKEFVNMVRNPMAFLN-TVSSKTETSGSDSAAGLTIPQDIR----TMIN 137 (404) T ss_pred HHHHHHHHHhccccccccccccchhhhHHHHHHHHHHHHhcchhhhh-hhhhhhhhcccccCCceeccHHHH----HHHH Confidence 1111111111000 0000000 00000 000000 000 0011111 222222 2665554 3455 Q ss_pred eeeccccchhhhcccccCCCcceeeEEEeeeecceeeEEeecccCCce-eeeeeeeeeeeEEEEEEEEeeCHHHHHHHHh Q lcl|Aclame:pro 64 DILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGD-SGANINYPQRQSYFFQTWTRWGERELEMAGA 142 (336) Q Consensus 64 ~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~~ygd~~diP~-~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~ 142 (336) +..........++.+.....-.-........+..+.+...+....+|- .+.........++.++..+.+|.+=+.. T Consensus 138 ~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~d--- 214 (404) T protein:vir:39 138 TLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPRLTIIKYLIKRYAGIITATNTLLKD--- 214 (404) T ss_pred HHHHhhhhHHhhcceeeccCCcceEEEEeecCCccceeeecCccccccccccceeeEEeeeeeEEeeehhHHHHHhh--- Confidence 555555556666555332211111122333455567788888888885 5567788888888888888888654443 Q ss_pred hCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCc Q lcl|Aclame:pro 143 GRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGI 222 (336) Q Consensus 143 ~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~ 222 (336) ...+|.+.-......++.+.+++-++.|++... +. + ..++.+ ||..++....... T Consensus 215 s~~~l~~~i~~~l~~~~~~~~d~~il~g~g~~~----------~~-----~---~~~~~~----~i~~~~~~~~~~~--- 269 (404) T protein:vir:39 215 TAENILAWLSSWIAKKVVVTRNQAIIAAMGTVP----------KK-----P---TIAKFD----DVITMINTSVDPA--- 269 (404) T ss_pred chHHHHHHHHHHHHHHHHHHHHHHHHhcccccc----------cc-----c---ccccHH----HHHHHHHHhhhhh--- Confidence 346778888888888888888888888875421 10 0 112334 4443433221111 Q ss_pred eecccccEEEecHHHHHhccc-CCCCCccHHHH-HHHhCC----ccEEEE--cccccCCCCceEEEEEEeecCCceEEEE Q lcl|Aclame:pro 223 ITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAK-LKDIFP----KLEFVT--IPEYDTASGRLVQLWAPRVEGKDTATCG 294 (336) Q Consensus 223 v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~~-l~~n~p----nl~i~~--~pel~~a~G~~~~~~~~~~~~~~~~~~~ 294 (336) + .....++|.++.+..|.+ .+..|.-++.- +...-+ +..++. ...+...+.....+++-.. .+...+. T Consensus 270 ~--~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~--~~~~~~~ 345 (404) T protein:vir:39 270 I--IATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKKVIVVADRWLPNSGSTVYPLYYGDM--SQAITLF 345 (404) T ss_pred h--ccCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCcceecceeEEEecccccCccCCCccEEEEEec--cccEEEE Confidence 1 123468999999988864 23334333210 000111 111111 1111111112222222111 0101110 Q ss_pred cChhhh--cccce---ecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 295 FTEKMR--AHSIE---RYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 295 ~p~~~r--~l~~~---~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) .-+.++ ..+.. ...-....-+..|.+ +.+++|.||+.+..- T Consensus 346 ~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d-~~~~~~~a~~~~~~~ 391 (404) T protein:vir:39 346 DRENMSLLPTNIGAGAFETDTTKIRVIDRFD-VKTTDSEALVAGSFT 391 (404) T ss_pred eecceEEEEeccchhhhhhceeeEEEEeeec-cEEecccceEEEEee Confidence 001111 00100 001123445566665 578889999998877 No 95 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=92.02 E-value=0.012 Score=31.21 Aligned_cols=290 Identities=10% Similarity=0.019 Sum_probs=126.5 Q ss_pred CchH-HHHHHh-----hhcceecc--chhh-------------hccchhHHHHHhhhhhcccc--cccCcc--hHHHHHH Q lcl|Aclame:pro 1 MRDA-QRIQNL-----ARAGVILP--RSVQ-------------NVSTPLTEYAMDAADLSPHL--SSTGSS--GIPNYLT 55 (336) Q Consensus 1 ~~~~-~~~~~l-----~~~g~~~~--~~~~-------------~~~~~~~~~a~da~d~~~~l--~t~~~~--~i~~~l~ 55 (336) ++.. +++... ++.+-... ...+ ..........+.+-.....+ .+.+++ .||..+. T Consensus 56 l~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~al~~~t~s~gG~~IP~~~~ 135 (387) T protein:vir:93 56 VERQVKDIEEKEKAKVKDTGEAYQSLNDHEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLS 135 (387) T ss_pred HHHHHHHHHHHHHHhhhhccccCCCcchhhHHHHHHHHHHHHHhhhhhhhhhhhhhHHHHHhhccCcCCCCceeechhHH Confidence 1000 000000 00000000 0000 00000000011110011111 233333 3777665 Q ss_pred HhhCceeeeeeccccchhhhcccccCCCcceeeEEEee-eecceeeEEeecccCCceeeeeeeeeeeeEEEEEEEEeeCH Q lcl|Aclame:pro 56 TYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFIT-AEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGE 134 (336) Q Consensus 56 ~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~-~e~~G~a~~ygd~~diP~~~~~~~~~~~~v~~~~~~~~y~~ 134 (336) + +|++.+...-....+..+.+.+.. .++. ....+.+...+.....|-.+.......-..+.++..+.+|. T Consensus 136 ~----~Ii~~~~~~~~l~~~~~v~~~~~~-----~~p~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ 206 (387) T protein:vir:93 136 K----EIVSEPFAKNQLREKARLTNIKGL-----EIPRVSYTLDDDDFITDVETAKELKLKGDTVKFTTNKFKVFAAISD 206 (387) T ss_pred H----HHHHHHHhhchhhhheeeeecCCc-----eEEEEeecCCccccccCcccccccccccceeeeeheeeeeechhhH Confidence 4 334444443344556655544432 2222 23445677778887888888777777888888888888885 Q ss_pred HHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEE-eeccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHH Q lcl|Aclame:pro 135 RELEMAGAGRVDLASELNYSSALGLAKFLNGSYL-FGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQ 213 (336) Q Consensus 135 ~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~-~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~ 213 (336) +=|+ ....++.+--....+.++.+.++..+| .|++...-.|.++++.+.. + +....++||.+++. T Consensus 207 ell~---Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~p~g~l~~~~~~~-v----------~~~~~~d~i~~~~~ 272 (387) T protein:vir:93 207 TVIH---GSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLDHMSFYNGSVKE-V----------EGADMYDAIINALA 272 (387) T ss_pred HHHh---hhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeecccccc-c----------cccchHHHHHHHHh Confidence 4333 344567777777777777777666444 4555555578887655432 1 11223567777777 Q ss_pred HHHHHhCCceecccccEEEecHHH-HHhcccCCCCCccHHHHHHHhCCccEEEEcccccCCC------CceEEEEEEeec Q lcl|Aclame:pro 214 VLQTQSQGIITQEDVLRMGLPPTA-MSDLSKTNQYGLAAAAKLKDIFPKLEFVTIPEYDTAS------GRLVQLWAPRVE 286 (336) Q Consensus 214 ~l~~~s~g~v~~~~p~tL~Lp~~~-~~~L~~~~~~~~Tvl~~l~~n~pnl~i~~~pel~~a~------G~~~~~~~~~~~ 286 (336) +|...-. . .-..+|-+.. ...+....+.|-.++ .. .|+ +|-..|-..+++ |.-.+.+.. ++ T Consensus 273 ~l~~~~~-----~-~a~~~mn~~t~~~~~~~~~d~~~~~~---~~-~~~-~llG~PV~~~~~~~~~~~GDf~~~~~~-~~ 340 (387) T protein:vir:93 273 DLHEDYR-----D-NATIYMRYADYVKIISVLSNGTTNFF---DT-PAE-KVFGKPVVFTDAAVKPIVGDFNYFGIN-YD 340 (387) T ss_pred ccChhhh-----c-CCEEEEechHHHHHHHHHhcCCCccc---cc-CCc-cccccceEEecCCCceeeeehhhhhee-hh Confidence 6644321 1 1135565443 444443333232222 11 121 222222222211 211111111 11 Q ss_pred CCceEEEEcChhhhcccceecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 287 GKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 287 ~~~~~~~~~p~~~r~l~~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) + +-+.++. +.....+.+-+..|.+|.+ ++|-||+.+.-= T Consensus 341 ~---------~~~~~~~-~~~~~~~~~~~~~r~d~~v-~~~eA~~~l~~k 379 (387) T protein:vir:93 341 G---------TTYDTDK-DVKKGEYLFVLTAWYDQQR-TLDSAFRIAKAK 379 (387) T ss_pred h---------heeeecc-cccCCceeEEEEeeeCcee-echhheEEEEee Confidence 0 1111111 1112233344455666664 569999865432 No 96 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=91.99 E-value=0.013 Score=30.87 Aligned_cols=297 Identities=8% Similarity=-0.041 Sum_probs=128.9 Q ss_pred CchH-HHHHHhhhcceec----cch--hhhccc-hhHHH--------HHhhhhhcccc--cccCcch--HHHHHHHhhCc Q lcl|Aclame:pro 1 MRDA-QRIQNLARAGVIL----PRS--VQNVST-PLTEY--------AMDAADLSPHL--SSTGSSG--IPNYLTTYVDP 60 (336) Q Consensus 1 ~~~~-~~~~~l~~~g~~~----~~~--~~~~~~-~~~~~--------a~da~d~~~~l--~t~~~~~--i~~~l~~~idp 60 (336) .+.. .+.....+.+..- +.. ...... ..+.+ ..........+ .+.+++| +|..+. + T Consensus 59 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~----~ 134 (408) T protein:vir:10 59 LREQLVEAQAEQVVNMREEEKGPLNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIR----T 134 (408) T ss_pred HHHHHHHHHHHHHhccccccccccccchhhhHHHHHHHHHHHhhcchhhhhhhhhhhhhcccccCCceeccHhHH----H Confidence 0000 0000000111100 000 000000 00000 00001111122 2233332 676555 4 Q ss_pred eeeeeeccccchhhhcccccC--CCcceeeEEEeeeecceeeEEeecccCCceee-eeeeeeeeeEEEEEEEEeeCHHHH Q lcl|Aclame:pro 61 AVIDILVAPMKAAELVGESKK--GDWTTLVAAFITAEPTTKVATYGDYSSDGDSG-ANINYPQRQSYFFQTWTRWGEREL 137 (336) Q Consensus 61 ~v~~~~~~~~~~~~l~~v~t~--g~w~~~t~~~~~~e~~G~a~~ygd~~diP~~~-~~~~~~~~~v~~~~~~~~y~~~El 137 (336) +|++.+........+..+... +.+. .......+..+.+...|....+|-.+ ......+.+.+.++..+.+|.+=+ T Consensus 135 ~Ii~~~~~~~~l~~~~~~~~~~~~~~~--~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell 212 (408) T protein:vir:10 135 MINTLVRQYDSLQQYVRVESVSTSNGS--RVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNTSL 212 (408) T ss_pred HHHHHHHhhchhhhhcceeeccCCcce--EEEeeccccccceeeecCccccccccCcceeeEEeeeeeEEeeehhHHHHH Confidence 556666655556666544322 1221 11112224456777888888888544 567788888888888888886644 Q ss_pred HHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 138 EMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQT 217 (336) Q Consensus 138 ~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~ 217 (336) +- ...+|.+.-....++++...+++-.+.|++... + . ...++.+.|++.++..+.. T Consensus 213 ~d---s~~~l~~~i~~~l~~~~~~~~~~~il~g~g~~~----------~--~------~~~~~~~~l~~~~~~~~~~--- 268 (408) T protein:vir:10 213 KD---TAENILAWLSSWIAKKVVVTRNQAIIEVMKAAP----------K--K------PTIAKFDDVITMINTAVDP--- 268 (408) T ss_pred hh---chHHHHHHHHHHHHHHHHHHHHHHHhhcccccc----------c--c------cccccHHHHHHHHHHhhhh--- Confidence 43 355777777888888888888887777776421 0 0 0122344444433322211 Q ss_pred HhCCceecccccEEEecHHHHHhccc-CCCCCccHHHH-HHHhCC----ccEEEEcc--cccCCCCceEEEEEEeecCCc Q lcl|Aclame:pro 218 QSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAK-LKDIFP----KLEFVTIP--EYDTASGRLVQLWAPRVEGKD 289 (336) Q Consensus 218 ~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~~-l~~n~p----nl~i~~~p--el~~a~G~~~~~~~~~~~~~~ 289 (336) . + ...-.++|.+..+..|.+ .+..|.-+++- +....| +..++... .+...+.+...+++-.. .+ T Consensus 269 --~--~--~~~a~~v~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~--~~ 340 (408) T protein:vir:10 269 --A--I--IATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDM--SQ 340 (408) T ss_pred --h--h--ccCCEEEEcHHHHHHHHHhhccCCceEeccCcCCCCCceecceeeEEecccccCccCCCceEEEEEeh--hc Confidence 1 1 123468999999988854 33334434321 111111 11222111 11111122222222111 00 Q ss_pred eEEEEc--Chhhhcccc-ee--cCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 290 TATCGF--TEKMRAHSI-ER--YSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 290 ~~~~~~--p~~~r~l~~-~~--~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) ...+.. .+.+...+. .. ..-....-+..|.+| .++.|-||+.++.- T Consensus 341 ~~~~~~~~~~~v~~~~~~~~~f~~~~~~~r~~~r~d~-~v~~~~a~~~~~~~ 391 (408) T protein:vir:10 341 AITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDV-KATDSEALVAGSFS 391 (408) T ss_pred cEEEEEecceEEEEcccccchhhcCceEEEEEEeecc-EEeccccEEEEEee Confidence 000000 001111110 00 112345556666666 55669999988755 No 97 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=91.88 E-value=0.0079 Score=32.11 Aligned_cols=250 Identities=10% Similarity=-0.031 Sum_probs=128.0 Q ss_pred HHHhhhhhcccccccCcchHHHHHHHhhCceeeeeeccccchhhhcccccC--CCcceeeEEEeeeecceeeEEeecccC Q lcl|Aclame:pro 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESKK--GDWTTLVAAFITAEPTTKVATYGDYSS 108 (336) Q Consensus 31 ~a~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~--g~w~~~t~~~~~~e~~G~a~~ygd~~d 108 (336) || .+.-++--+|..|..||--++ -...+...+..+++. |.. =.+++++.++..|.+..+.++++ T Consensus 1 Ma---------~T~~~d~I~Pev~~~~V~e~~----~~~~~~~~~~~~d~~L~g~~-G~ti~~P~~~~igdae~~~eg~~ 66 (270) T protein:vir:95 1 MT---------QTKKANLINPEVLANVVSAQM----QNAIRFTPYAVTDDTLVGQP-GDTITRPKYAYIGAAEDLQEGVA 66 (270) T ss_pred CC---------ceehhhhcchHHHHHHHHHHH----HhHHhhccccccccccCCCC-CCEEEeeeecCCCccccccCCCc Confidence 11 244556668999999884443 223344455555433 222 27799999999999999999999 Q ss_pred CceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCcc Q lcl|Aclame:pro 109 DGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAP 188 (336) Q Consensus 109 iP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~ 188 (336) ++..+........++.+.+-+++++. +.+....+=+ ..+-.....+.+.+++++..+ +.+ .|... T Consensus 67 i~~~~lt~~~~~a~i~~~gk~~~itD--~a~~~~~~dp-~~~~~~q~a~~~a~~~d~~li---~~l--~~a~~------- 131 (270) T protein:vir:95 67 MDTTQMSMTTTKVTVKETGKAVEVTQ--TAIITNVNGT-LQEASRQLAMSLADKVEIDYI---AEL--NKSKQ------- 131 (270) T ss_pred cchhhcccchheeeeehhhCcceecH--HHHhhhccch-HHHHHHHHHHHHHHHHHHHHH---HHh--ccccc------- Confidence 99999999999999988877666654 4444444444 344444455666665554322 000 01100 Q ss_pred cccccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhcccCC-----CCCccHHHHHH-Hh---C Q lcl|Aclame:pro 189 ITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSKTN-----QYGLAAAAKLK-DI---F 259 (336) Q Consensus 189 ~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~~~-----~~~~Tvl~~l~-~n---~ 259 (336) +.+ .+.+. ++|++++..+ |. +...+..|++.|..+..|.+-. ..+.. .+. -. | T Consensus 132 -~~~----~~~t~----~~~~dA~~~l-----gd-~~~~~~~i~vhs~~~~~Lrk~~~~~~~~~~~~---~~~~G~ig~~ 193 (270) T protein:vir:95 132 -TAT----VSADA----TGILDAIEVF-----NS-ENDEDYVLYVNPKDYNKLVKSLFKVGGNVQDR---AISKGDLVEI 193 (270) T ss_pred -ccc----cccCH----HHHHHHHHHh-----cc-ccCCCcEEEEcHHHHHHHHhhhcccccccccc---hhccccccee Confidence 000 01233 3444444333 11 2345789999999998886421 11111 111 11 2 Q ss_pred CccEE-EEcccccCCCCceEEEEEEeecCCceEEEEcChhhhcccceecCCc-eEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 260 PKLEF-VTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSIERYSSY-FRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 260 pnl~i-~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p~~~r~l~~~~~~~~-~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) -++++ ++-.-. .-.+.+++- +..+.+....+.+. ......+. -..=..-..+||-+..|..++.++== T Consensus 194 ~G~~Viv~s~~~---~~~~~~l~~-----~gAi~~~~~~~~~v-EtdRd~~~~~d~i~~~~~y~v~~~~~skvv~~t~~ 263 (270) T protein:vir:95 194 VGVSDIVKSKRV---SENTAFLQR-----YGAMEIVNKKKPEA-YTDFDILKRTHLLSTNYHYSVNLKDETGVVKVTFK 263 (270) T ss_pred cceeEEEeCCCC---CceeEEEEe-----ccceeeeecCCcee-eeccchhhcccEEEeeeEEEEEEEccceEEEEEec Confidence 23442 321110 112333331 22222222222211 11111111 01111234688888888888866322 No 98 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=91.60 E-value=0.015 Score=30.57 Aligned_cols=306 Identities=10% Similarity=0.027 Sum_probs=136.0 Q ss_pred CchHHH---HH----Hhhhccee-ccchhh-----------------------hccchhHHHHHhhhhhcccccccCcc- Q lcl|Aclame:pro 1 MRDAQR---IQ----NLARAGVI-LPRSVQ-----------------------NVSTPLTEYAMDAADLSPHLSSTGSS- 48 (336) Q Consensus 1 ~~~~~~---~~----~l~~~g~~-~~~~~~-----------------------~~~~~~~~~a~da~d~~~~l~t~~~~- 48 (336) +...+. .. .+++.... ...... ............++ . .++.+++ T Consensus 44 ~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~a~--~--~~~~~~gg 119 (404) T protein:vir:10 44 QAKIEAQKRKENIENNFNEDNVKSLNTGKEENVIYNGALFVRAIADNLLKQKNQRGLNLSEKEINAI--S--ENIDEDGG 119 (404) T ss_pred HHHHHHHHHHHHHHHHHhhhhccccccccchhhHHHHHHHHHHHHHHHHHHHHhhhhcchhhHHhhh--c--cccCCCCc Confidence 000000 00 00000000 000000 00000000011110 0 1122222 Q ss_pred -hHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecceeeEEeecccCCceee--eeeeeeeeeEEE Q lcl|Aclame:pro 49 -GIPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSG--ANINYPQRQSYF 125 (336) Q Consensus 49 -~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~~ygd~~diP~~~--~~~~~~~~~v~~ 125 (336) .+|..+. +++++..........++++.....-. ..+.+......+.+...+.+..+|... .......-..+. T Consensus 120 ~~vP~~~~----~~ii~~~~~~~~l~~l~~~~~~~~~~-g~~~~~~~~~~~~~~~v~e~~~~~~~~~~~~f~~i~~~~~k 194 (404) T protein:vir:10 120 YAVPEDIQ----TKINTRLKDTTDLYNMVDYEPVFTRS-GSRTYEKRSKQKPMKPLSENQQIPTNGDNGKLERFNFKLKD 194 (404) T ss_pred eeechhHH----HHHHHHHhhhhhHhhhhceeeccCCc-cceEEEEecCCcceeeccccccccccccccceeeeEeehee Confidence 2454333 45566666656666666665432111 224455555555666777777777643 445556666777 Q ss_pred EEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccc-cceEEEEecCCCCcccccccccccccCHHHH Q lcl|Aclame:pro 126 FQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAG-LENYGLINDPSLSAPITATTPWSGSPAVEAV 204 (336) Q Consensus 126 ~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~-~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI 204 (336) ++..+.+|.+=+. ....+|.+.-....++++.+.+++-+++|++. ....|+++.+...+.... ...+ T Consensus 195 ~~~~~~iS~ell~---ds~~~l~~~i~~~la~~~~~~~~~~il~G~g~~~~~~gi~~~~~~~~~~~~-----~~~~---- 262 (404) T protein:vir:10 195 LADFMSIPNDLLK---FADKSLEDWIINWFVDKVRITRNAEILYGAGGDEHATGIMTANKFKKITLP-----KSPA---- 262 (404) T ss_pred eEeeehhhHHHHh---hcHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCcccceeeccccceeecc-----cccc---- Confidence 8888888874333 33457888888888888888999989999874 456788886555322111 1223 Q ss_pred HHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhccc-CCCCCccHHH-HHHHhCC----ccEEEEccc--ccCCCCc Q lcl|Aclame:pro 205 VNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAA-KLKDIFP----KLEFVTIPE--YDTASGR 276 (336) Q Consensus 205 ~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~-~l~~n~p----nl~i~~~pe--l~~a~G~ 276 (336) ++|+..+++.... ..+ .....++|.+..+..|.+ .+..|.-++. -+....+ +.-++..+. +..+++. T Consensus 263 ~~~~~~~~~~~l~-~~~----~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~ 337 (404) T protein:vir:10 263 LKDFKKCKNVELL-NVF----KATSSWIVNQDGFNYLDSLEDKTGRPYLQPDPKDPTQYRFLGLPVIELPNDLLLSTESA 337 (404) T ss_pred HHHHHHHHHhhhh-ccc----cCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCccccceeeEEecccccCCCCCc Confidence 3455544442211 111 123468999998888854 2333332221 0111111 122332332 2223344 Q ss_pred eEEEEEEeecCCceEEEEc--Chhhhcccc---eecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 277 LVQLWAPRVEGKDTATCGF--TEKMRAHSI---ERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 277 ~~~~~~~~~~~~~~~~~~~--p~~~r~l~~---~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) ...++.+-.+ ...+.. ...+..... ....-....-+..|.+ +.+++|.||+.++=- T Consensus 338 ~~~~~gd~s~---~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~d-~~v~~~~a~~~~~~~ 398 (404) T protein:vir:10 338 IPVLLGDTKE---AYKYVSDGAYELATTNIGAGAFETNTTKARIIMRID-GNVKDSEALLIAEIP 398 (404) T ss_pred cEEEEEeccc---cEEEEEecceEEEEeccccchhhcCceEEEEEEeec-cEEecccceEEEEee Confidence 4443332111 011100 111111000 0001123344555554 477788888766644 No 99 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=91.05 E-value=0.018 Score=30.18 Aligned_cols=298 Identities=12% Similarity=0.129 Sum_probs=128.9 Q ss_pred Cc--------------------------hHHH---HHHhhhcceeccchhhhcc-chhHHHHHhhhhhcccccccCcch- Q lcl|Aclame:pro 1 MR--------------------------DAQR---IQNLARAGVILPRSVQNVS-TPLTEYAMDAADLSPHLSSTGSSG- 49 (336) Q Consensus 1 ~~--------------------------~~~~---~~~l~~~g~~~~~~~~~~~-~~~~~~a~da~d~~~~l~t~~~~~- 49 (336) ++ +... +.+.-+.|.. .+. .+++. +....+. +.++.+.+| T Consensus 57 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~l~~~~~------~~~~~e~~~--~~~~~a~-~~~~~~~gg~ 127 (409) T protein:vir:45 57 LRRQDQAYIESNEEEQRQNLDPENNSQQDEKRAQVFDKWMRHGAS------ELTSEERKA--LRELRAQ-GVAQDEKGGY 127 (409) T ss_pred HHHHHHHHHhhhhhhhcccCCCCCcchhhHHHHHHHHHHHHhhhh------hccHHHHHH--HHHHhhc-cCccCcCCce Confidence 00 0000 0000000100 000 01111 1111111 112333333 Q ss_pred -HHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecce-eeEEeecccCCceeeeeeeeeeeeEEEEE Q lcl|Aclame:pro 50 -IPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTT-KVATYGDYSSDGDSGANINYPQRQSYFFQ 127 (336) Q Consensus 50 -i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G-~a~~ygd~~diP~~~~~~~~~~~~v~~~~ 127 (336) ||..+.+ +|++.+........+..+.+... .....+...+..+ .+...+.....|-.+.......-..+.+. T Consensus 128 liP~~~~~----~ii~~~~~~~~l~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~f~~~~l~~~k~~ 201 (409) T protein:vir:45 128 TVPETFLA----KVVEKMKSYGGIASVAQILTTSD--GRTMEWATADGTSEVGVLLGENEEAGEEDTDFGMGSLGALKMT 201 (409) T ss_pred eccHhHHH----HHHHHHHhhhhhhhhceeeecCC--CceEEEEeeccCccccccccccccccccccccceeeeeeeeee Confidence 5655443 34444444333444433322221 1233444444443 34566777777877765554444444443 Q ss_pred E-EEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccc---cceEEEEecCCCCcccccccccccccCHHH Q lcl|Aclame:pro 128 T-WTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAG---LENYGLINDPSLSAPITATTPWSGSPAVEA 203 (336) Q Consensus 128 ~-~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~---~g~~GllN~Pnl~~~~~~~t~w~~~~t~~e 203 (336) . .+.+|.+=+.-+ ..++.+.-......++...+++-.++|+.. .+..|+++.+....... ..+..+ T Consensus 202 ~~~i~is~ell~ds---~~~l~~~i~~~la~a~~~~~~~a~l~G~G~~~~~~p~Gil~~~~~~~~~~----~~~~~~--- 271 (409) T protein:vir:45 202 SKIIRVSNELLQDS---AIDMEAYLARRIAERIGRGEARYLIQGTGAGTPKQPKGLAASVTGTTQTA----AANAVK--- 271 (409) T ss_pred eeehhhhHHHHhcc---HHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCccccceeeeccccccccc----cccccc--- Confidence 3 345655444333 457888888888888889999999999864 46789999654321111 111233 Q ss_pred HHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhccc-CCCCCccHHH-HHHHh----CCccEEEEcccccC-CCCc Q lcl|Aclame:pro 204 VVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAA-KLKDI----FPKLEFVTIPEYDT-ASGR 276 (336) Q Consensus 204 I~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~-~l~~n----~pnl~i~~~pel~~-a~G~ 276 (336) ++||.+++..|...-. ....-.+++.+..+..|.+ .+..|.-+++ -+... .-+..++....+.+ ++|. T Consensus 272 -~d~i~~l~~~l~~~~~----~~a~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~p~~~~~~ 346 (409) T protein:vir:45 272 -WQEILALKHSIDPAYR----RGPKFRLAFNDNTLKLISEMEDGQGRPLWLPDIVGVAPASVLNVPYVIDQEIDDIGAGK 346 (409) T ss_pred -hHHHHHHHHhhhhhhc----cCCeEEEEECHHHHHHHHHhhcCCCceeeccCcCCCCCceecceeeEEecCcCCccCCc Confidence 4556666666643211 1112246778877777643 2333332221 00001 11222322222221 1232 Q ss_pred eEEEEEEee------cCCceEEEEcChhhhcccceecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 277 LVQLWAPRV------EGKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 277 ~~~~~~~~~------~~~~~~~~~~p~~~r~l~~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) ...+|.+-. .+.-+.+. ...+| .+.....+-+..|.+| .+..|-||+.+.+= T Consensus 347 ~~i~~Gd~~~~~i~~~~~~~~~~-~~d~~------~~~~~~~~~~~~r~d~-~~~~~~A~~~l~~k 404 (409) T protein:vir:45 347 KFMFCGDFDRFIIRRVRYMILKR-LVERY------AEYDQTGFLAFHRFDC-ILEDTSAIKALVGK 404 (409) T ss_pred cEEEEeehhhhheeeccceEEEE-eeccc------ccCCcEEEEEEEEecc-EeechhheEEEEec Confidence 222222110 01111100 11111 1122344556666655 47889998887765 No 100 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=91.01 E-value=0.018 Score=30.16 Aligned_cols=298 Identities=7% Similarity=-0.050 Sum_probs=129.9 Q ss_pred CchHH-HHHHhhhcceec------cchhhhccchhHHHHHh---------hhhhcccc--cccCcc--hHHHHHHHhhCc Q lcl|Aclame:pro 1 MRDAQ-RIQNLARAGVIL------PRSVQNVSTPLTEYAMD---------AADLSPHL--SSTGSS--GIPNYLTTYVDP 60 (336) Q Consensus 1 ~~~~~-~~~~l~~~g~~~------~~~~~~~~~~~~~~a~d---------a~d~~~~l--~t~~~~--~i~~~l~~~idp 60 (336) +++.- +.......+..- ................. .......+ .+.+++ .+|..+. + T Consensus 59 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~vP~~~~----~ 134 (408) T protein:vir:74 59 LREQLVEAQAEQVVNMREEEKGPLNKSENELKDKFVKDFVNMVRNPMAFLNTVSSKTETSGSDSAAGLTIPQDIR----T 134 (408) T ss_pred HHHHHHHHHHHHHhhccccccccccchhhhhHHHHHHHHHHHHhcchhhhhhhhhhhhcccccCCCceeechhHh----h Confidence 11100 000001111100 00000000000000000 00111111 122222 2565554 4 Q ss_pred eeeeeeccccchhhhcccccCCCcceeeEEEeeeecce-eeEEeecccCCce-eeeeeeeeeeeEEEEEEEEeeCHHHHH Q lcl|Aclame:pro 61 AVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTT-KVATYGDYSSDGD-SGANINYPQRQSYFFQTWTRWGERELE 138 (336) Q Consensus 61 ~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G-~a~~ygd~~diP~-~~~~~~~~~~~v~~~~~~~~y~~~El~ 138 (336) .|++.+........++++..... ....+.+......+ .+...+...++|- .+........+.+.++..+.+|.+=+. T Consensus 135 ~Ii~~~~~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ 213 (408) T protein:vir:74 135 MINTLVRQYDSLQQYVRVESVST-SSGSRVYEKWTDVTPLKAMDEEDGKIPDLDNPRLTIIKYLIKRYAGIITATNTLLK 213 (408) T ss_pred HHHHHHhhhcchhhhcceeeccC-CcceEEEEeecCCcccccccccccccccccccceeeEEeeeeeEEeeehhHHHHHh Confidence 55666666666667665543221 11223343333333 3345566778874 557788888888899988888876443 Q ss_pred HHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 139 MAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQ 218 (336) Q Consensus 139 ~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~ 218 (336) ....+|.+.-.....+++.+.+|+-.+.|++... + .....+.+.|++.++..+..-. T Consensus 214 ---ds~~~l~~~i~~~l~~~~~~~~d~~il~G~G~~~----------~--------~~~~~~~~~i~~~~~~~l~~~~-- 270 (408) T protein:vir:74 214 ---DTAENILAWLSSWIAKKVVVTRNQAIIAAMGTVP----------K--------KPTIANFDDVITMINTSVDPAI-- 270 (408) T ss_pred ---hchHHHHHHHHHHHHHHHHHHHHHHHhhcccccc----------c--------ccccccHHHHHHHHHHhhhhhh-- Confidence 3455778888888888888888888888875321 1 1112344544444332222111 Q ss_pred hCCceecccccEEEecHHHHHhccc-CCCCCccHHHH-HHHhCC----ccEEEEcc--cccCCCCceEEEEEEeecCCce Q lcl|Aclame:pro 219 SQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAK-LKDIFP----KLEFVTIP--EYDTASGRLVQLWAPRVEGKDT 290 (336) Q Consensus 219 s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~~-l~~n~p----nl~i~~~p--el~~a~G~~~~~~~~~~~~~~~ 290 (336) .....++|.+..+..|.+ .+..|.-++.- +....| +..++..+ .+...+.+...+++-... +.. T Consensus 271 -------~~~a~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~i~~gd~~-~~~ 342 (408) T protein:vir:74 271 -------IATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNSGSTVYPLYYGDMS-QAI 342 (408) T ss_pred -------cCCCEEEEcHHHHHHHHHhhcCCCceEeccCcCCCCCceecceeeEEecCcccccccCCcceEEEEehh-ccE Confidence 123468999999988854 23334333210 111111 11122111 111122222222221110 000 Q ss_pred EEEE--cChhhhcccce---ecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 291 ATCG--FTEKMRAHSIE---RYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 291 ~~~~--~p~~~r~l~~~---~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) .+. -...+...+.. ...-...+-+..|.+|. ++.|-||+..+.- T Consensus 343 -~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~-~~~~~a~~~~~~~ 391 (408) T protein:vir:74 343 -TLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVK-ATDSEALVAGSFT 391 (408) T ss_pred -EEEEecceEEEEeccccchhhcceeeEEEEEeeCcE-EecccceEEEEee Confidence 000 00111111110 01123445567777765 6679998887754 No 101 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=89.78 E-value=0.024 Score=29.43 Aligned_cols=291 Identities=9% Similarity=-0.010 Sum_probs=132.8 Q ss_pred CchHHHHHHhhhcceeccc----hhhhc-------cchhHHHHHhhhhhcccccccCcch--HHHHHHHhhCceeeeeec Q lcl|Aclame:pro 1 MRDAQRIQNLARAGVILPR----SVQNV-------STPLTEYAMDAADLSPHLSSTGSSG--IPNYLTTYVDPAVIDILV 67 (336) Q Consensus 1 ~~~~~~~~~l~~~g~~~~~----~~~~~-------~~~~~~~a~da~d~~~~l~t~~~~~--i~~~l~~~idp~v~~~~~ 67 (336) +...++..+.++....... ..... ....+..-..++ ...+.+++| +|..+. +++++.+- T Consensus 45 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~----~~~t~~~gg~~vP~~~~----~~ii~~~~ 116 (371) T protein:vir:81 45 FDVAKELYEEQKQTIEDKEPLKPTVQVKENEVEAFVNHIRTRFRNAM----SEGSNQDGGYTVPQDIQ----TRINELRE 116 (371) T ss_pred HHHHHHHHHHHHHhhccccccccchhhHHHHHHHHHHHHHHHHHHhh----ccCCCccCceeecHhHH----HHHHHHHH Confidence 1111111111111110000 00000 000001011111 112233333 665444 45566666 Q ss_pred cccchhhhcccccCCCcceeeEEEeeeecceeeEEeecccCCc-eeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCC Q lcl|Aclame:pro 68 APMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDG-DSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVD 146 (336) Q Consensus 68 ~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~~ygd~~diP-~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~ 146 (336) ..-....++++...+. ...++.+......+.+...+.++++| ..+......+.+.+.++..+.+|.+=++.+. .+ T Consensus 117 ~~s~i~~~~~~~~~~~-~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~---~~ 192 (371) T protein:vir:81 117 SKDALQNLITVEPVTT-LSGSRVFKKRSQQTGFVEVAEGAAIGEKATPQFTLLQYQVKKYAGFFRVTNELLNDST---EA 192 (371) T ss_pred hhhhhhhhceeeeccC-CceeEEEEeecCCcceeeeccccccccccccceeeEEeeeeEEEEeehhhHHHHhhhh---HH Confidence 6556666666544322 12334455555566788888888888 4667888889999999999999877665443 46 Q ss_pred HHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceecc Q lcl|Aclame:pro 147 LASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQE 226 (336) Q Consensus 147 l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~ 226 (336) |.+--......++.+.+|+..+.|++...- . ...+.+.|...++..+... + . T Consensus 193 l~~~i~~~l~~a~~~~~~~~i~~g~g~~~~---------------~----~~~~~~~i~~~~~~~l~~~-------~--~ 244 (371) T protein:vir:81 193 IVNTLVRWIGDESRVTRNGLIINVLNTKAK---------------T----AIADLDGLKQIINVQLDPV-------F--R 244 (371) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcccccc---------------c----ccccHHHHHHHHHhhcchh-------h--h Confidence 777777777888888888877777653221 0 0123344444333222111 0 1 Q ss_pred cccEEEecHHHHHhcccC-CCCCccHHH-HHHHh-------CCccEEEEccccc----CCC-CceEEEEEEeecCCceEE Q lcl|Aclame:pro 227 DVLRMGLPPTAMSDLSKT-NQYGLAAAA-KLKDI-------FPKLEFVTIPEYD----TAS-GRLVQLWAPRVEGKDTAT 292 (336) Q Consensus 227 ~p~tL~Lp~~~~~~L~~~-~~~~~Tvl~-~l~~n-------~pnl~i~~~pel~----~a~-G~~~~~~~~~~~~~~~~~ 292 (336) ....++|.+..+..|.+- +..|.-++. =+... +|=+.....|... +.+ +....++.+ . .+-.. T Consensus 245 ~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~~~i~~Gd-~--~~~~~ 321 (371) T protein:vir:81 245 STSSVIVNQDAFNWLDTLKDQNGQYLLQPSISSPTGRQLLGLPVVIVSNKVLANRVDGGTGAQFAPIIVGD-L--KEAVV 321 (371) T ss_pred cCCEEEEcHHHHHHHHHhhccCCCeeeecccCCCCCceecceeEEEecccccCccccccccCCcceEEEEe-h--hceEE Confidence 234689999988888542 333322210 00111 1211112223211 111 122222221 1 00001 Q ss_pred EEcChhhh--cccce---ecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 293 CGFTEKMR--AHSIE---RYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 293 ~~~p~~~r--~l~~~---~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) +...+.++ ..... ...-...+-+..|.+| .+++|.||+.++ + T Consensus 322 ~~~~~~~~i~~~~~~~~~f~~~~v~~~~~~r~d~-~~~~~~a~~~~~-~ 368 (371) T protein:vir:81 322 MFDRQRTEIMSSNVAMDAFETDATLWRAIERMDV-KMRDDEAFVFGE-V 368 (371) T ss_pred EEeecceEEEEeccccchhhcCceEEEEEEeecc-EEecccceEEEE-E Confidence 10111111 11100 0112344556666655 566788887776 5 No 102 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=89.69 E-value=0.025 Score=29.37 Aligned_cols=268 Identities=14% Similarity=0.125 Sum_probs=118.8 Q ss_pred ccchhhhccchhHHHHHhhhhhcccccccCcchHHHHHHHhhCceeeeeeccccchhhhccccc-CCCcceeeEEEeee- Q lcl|Aclame:pro 17 LPRSVQNVSTPLTEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESK-KGDWTTLVAAFITA- 94 (336) Q Consensus 17 ~~~~~~~~~~~~~~~a~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t-~g~w~~~t~~~~~~- 94 (336) +|. ++ +. +|+ - ++ . .-+..+|+|..-..-...+.+|=+. +| .++.|.-. T Consensus 1 mpa----lt--La----ea~---k-~~--~---------d~l~~~ViE~~~~~s~lL~~LpF~~veg----~~~~ynR~~ 51 (310) T protein:vir:97 1 MAS----VT--LA----ESA---K-LA--Q---------DELVAGVIENIITVNRMFDVLPFDSIEG----NSLAYNREN 51 (310) T ss_pred Ccc----cc--hH----HHh---h-cC--c---------chHHHHHHHHHhccchHHHhCCcccccC----CcceeeEee Confidence 110 00 00 010 0 00 0 1112233444433344445555321 22 12333322 Q ss_pred ecceee--EEeecccCCceeeeeeeeeeeeEEEEEEEEeeCHHHHHHH--Hhh-CCCHHH--HHHHHHHHHHHHhhcceE Q lcl|Aclame:pro 95 EPTTKV--ATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMA--GAG-RVDLAS--ELNYSSALGLAKFLNGSY 167 (336) Q Consensus 95 e~~G~a--~~ygd~~diP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A--~~~-g~~l~~--~k~~aAr~a~e~~~n~~~ 167 (336) +..+.. .+.-.+++.|.......+ ..+.++...--+..|+-+. ... +-+.+. .+-....+++.+...... T Consensus 52 ~~~~~~~~~v~~~~~~~g~~~~~~t~---~~~~~~L~i~~g~~~Vd~~i~dl~~~~~~dq~~~Ql~~~iea~~~~~e~~l 128 (310) T protein:vir:97 52 VLGDVIMAGVGTTFSGAGAGKAAATF---TKVNSNLTTIMGDAEVNGLIQATRSGDGNDQTAVQIASKAKSAGRKYQDQL 128 (310) T ss_pred ccCCcccccccccccCCCcccccccc---ceeeeeeeeeeehhhhhhHHHhhhcCChHHHHHHHHHHHHHHHHHHHHHHh Confidence 222222 222223333333332332 2334445555555665542 322 323222 333455667778888888 Q ss_pred Eeecc-ccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHH---HHhccc Q lcl|Aclame:pro 168 LFGVA-GLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTA---MSDLSK 243 (336) Q Consensus 168 ~~Gd~-~~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~---~~~L~~ 243 (336) ++||. ...++||+..-.....+.+.+. .+..| .+|+.+|+..+|..-+ .|..|++.|.. +.-+.+ T Consensus 129 INGD~a~n~F~GL~~~~~~~q~i~~~~~-gg~~t----~d~LDeLl~~v~~~~g------~p~~~l~~~~~~r~i~A~~R 197 (310) T protein:vir:97 129 INGNGAGNEFAGLIQLCASGQKATTGAT-GSAIS----FAILDELMDLVVDKDG------QVDYLTMHARTLRSYKALLR 197 (310) T ss_pred hccccCCCcccchhhcCCccceeecCCC-CCCCC----HHHHHHHHHHHhcCCC------CCCEEEecHHHHHHHHHHHH Confidence 89987 5677899984211111211111 12344 4688888888876432 47789999975 333332 Q ss_pred C-----------CCCCccHHHHHHHhCCccEEEEcccccC-----CCCceEEEEEEeecCCce---EEEEcC------hh Q lcl|Aclame:pro 244 T-----------NQYGLAAAAKLKDIFPKLEFVTIPEYDT-----ASGRLVQLWAPRVEGKDT---ATCGFT------EK 298 (336) Q Consensus 244 ~-----------~~~~~Tvl~~l~~n~pnl~i~~~pel~~-----a~G~~~~~~~~~~~~~~~---~~~~~p------~~ 298 (336) . +.+|.-|+ .|-++-|...-.... .++++.-+|+-+.. .+. +-+.++ .. T Consensus 198 ~~~~~g~~~~~~~~~G~~v~-----~~~GiPi~~~d~ip~~~~~~~~~gtTsIya~r~G-e~~~~~Gv~Gl~~~~~~gls 271 (310) T protein:vir:97 198 ALGGASINEVVELPSGAEVP-----AYSGTPIFRNDYIPTNQTKGGTTGCTTIFAGTLD-DGSRTHGIAGLTATQAAGIQ 271 (310) T ss_pred HhcCCCCCCccccCCCCEEe-----eeCCeEEEEeCccCCCccccccCCceeEEEEeeC-ccccccceeccccCCcccee Confidence 1 11222221 233444333222211 12233334443443 221 111211 22 Q ss_pred hhccc-ceecC-CceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 299 MRAHS-IERYS-SYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 299 ~r~l~-~~~~~-~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) .|.++ .+.+. .+|.| ..+-|+.+.-|.|++.+.|| T Consensus 272 Vr~~G~~~~~~v~~~~V---~~Y~~~av~~~~A~a~L~~V 308 (310) T protein:vir:97 272 VVDVGESEDSDEHIWRV---KWYCGLALFSEKGLACADGI 308 (310) T ss_pred EEeCCcccCCcceeEEE---EEeeeEEEecccceeeeccc Confidence 34444 22222 24555 34689999999999999999 No 103 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=88.69 E-value=0.031 Score=28.87 Aligned_cols=296 Identities=11% Similarity=-0.025 Sum_probs=132.3 Q ss_pred CchHHHH-H---Hhhhcce------eccchhhhccchhHHHHHhh-------hhhcccccccCcc--hHHHHHHHhhCce Q lcl|Aclame:pro 1 MRDAQRI-Q---NLARAGV------ILPRSVQNVSTPLTEYAMDA-------ADLSPHLSSTGSS--GIPNYLTTYVDPA 61 (336) Q Consensus 1 ~~~~~~~-~---~l~~~g~------~~~~~~~~~~~~~~~~a~da-------~d~~~~l~t~~~~--~i~~~l~~~idp~ 61 (336) ++...+. . .....+. .............+....+. ...+....+.+++ .+|..+. ++ T Consensus 53 ~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~t~~~gg~~vP~~~~----~~ 128 (397) T protein:vir:49 53 RDMFKEQYTEARANEVANMSEEEKKPLTKSEEEVKAGFVKDFKNLVRGRYQNLLDSKTDASGSDAGLTIPQDIQ----TA 128 (397) T ss_pred HHHHHHHHHHHHHHhhhccccccccccccchhHHHHHHHHHHHHHHhcchhHHHHHhhccccccCcccccHhHH----HH Confidence 1100000 0 0000000 00000000000000000000 0001111223333 3565554 45 Q ss_pred eeeeeccccchhhhcccccCCCcceeeEEEee-eecceeeEEeecccCCce-eeeeeeeeeeeEEEEEEEEeeCHHHHHH Q lcl|Aclame:pro 62 VIDILVAPMKAAELVGESKKGDWTTLVAAFIT-AEPTTKVATYGDYSSDGD-SGANINYPQRQSYFFQTWTRWGERELEM 139 (336) Q Consensus 62 v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~-~e~~G~a~~ygd~~diP~-~~~~~~~~~~~v~~~~~~~~y~~~El~~ 139 (336) |++.+........++.+.....-. ..+.|.. .+..|.+...+.+..+|- .+.......-.++.++..+.+|..=++. T Consensus 129 ii~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~d 207 (397) T protein:vir:49 129 IHTLVSQYDSLQEYVNVENVTTLT-GSRVYEKWTDITGLANIDDEAGKIADVDDPKLSLIKYTIKRYAGISTVTNSLLAD 207 (397) T ss_pred HHHHHHhhhhHHhhhceeecccCc-cceEEEeeccCCcceeeecCccccccccccceeeEEeeeeeEEeeehhHHHHHhh Confidence 566666666666665553322111 1123333 344577888888888874 5677888888899999888888654443 Q ss_pred HHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHh Q lcl|Aclame:pro 140 AGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQS 219 (336) Q Consensus 140 A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s 219 (336) + ..++.+.-....++++.+.++.-.+.|++..... ...++ ++||.+++..+...- T Consensus 208 s---~~~l~~~i~~~l~~~~~~~~d~ai~~G~g~~~~~------------------~~~~~----~d~i~~~~~~l~~~~ 262 (397) T protein:vir:49 208 S---AENILAWLSGWIAKKVVVTRNKAILEAIAALPTK------------------PTLTK----WDDIIDLEAKVDPAI 262 (397) T ss_pred h---HHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc------------------ccccc----HHHHHHHHHhhhhhh Confidence 3 3567777777788888888888788886543210 01122 355666666664432 Q ss_pred CCceecccccEEEecHHHHHhccc-CCCCCccHHHH-HHHhCC----ccEEEE--ccccc-CCCCceEEEEEEeecCCce Q lcl|Aclame:pro 220 QGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAK-LKDIFP----KLEFVT--IPEYD-TASGRLVQLWAPRVEGKDT 290 (336) Q Consensus 220 ~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~~-l~~n~p----nl~i~~--~pel~-~a~G~~~~~~~~~~~~~~~ 290 (336) .....++|.++.+..|.+ .+..|.-++.= +....+ ++-++. ...+. ++.|....++.+-. +- T Consensus 263 ------~~~a~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~---~~ 333 (397) T protein:vir:49 263 ------KQTSFFLTNTSGFTALKKVKNALGDYLMERDVKSPTGYSIDGFAVKEVADRWLANGTGGAMPLYFGDLK---QA 333 (397) T ss_pred ------cCCCEEEEcHHHHHHHHHhhcCCCceeeccCcCCCCCceecceeeEEecccccccccCCceeEEEeecc---ce Confidence 134578999999988854 23334333210 111111 111211 11122 22334334433211 00 Q ss_pred EEEEc--Chhhhcccc---eecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 291 ATCGF--TEKMRAHSI---ERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 291 ~~~~~--p~~~r~l~~---~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) ..+.. ...+...+. ....-....-+..|.+ +.+++|.||+.+..= T Consensus 334 ~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d-~~~~~~~a~~~~~~~ 383 (397) T protein:vir:49 334 VTLFDRQHMSLLSTNIGGGAFETDTTKVRVIDRFD-VVATDTEAFVPASFK 383 (397) T ss_pred EEEEeecceEEEEeccccchhhcCceeEEEEeeeC-cEEecccceEEEEee Confidence 00000 111111110 0001122333444544 467788888876644 No 104 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=88.53 E-value=0.032 Score=28.80 Aligned_cols=291 Identities=12% Similarity=0.077 Sum_probs=131.5 Q ss_pred CchHH-HHHH----hhhcceeccchh---hhcc------chhHHHHHhhhhhcccccccCcc--hHHHHHHHhhCceeee Q lcl|Aclame:pro 1 MRDAQ-RIQN----LARAGVILPRSV---QNVS------TPLTEYAMDAADLSPHLSSTGSS--GIPNYLTTYVDPAVID 64 (336) Q Consensus 1 ~~~~~-~~~~----l~~~g~~~~~~~---~~~~------~~~~~~a~da~d~~~~l~t~~~~--~i~~~l~~~idp~v~~ 64 (336) .++.+ .+.+ +++.+-.-.... .... ...+....-.+..++.++++.+. .||.... +.|++ T Consensus 54 ~~~l~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ip~~~~----~~ii~ 129 (379) T protein:vir:10 54 MAALQAHADKLDVKLKEKAKSEDKSDSLVKSITENFNDIKEVRNGKSIQVKAVGDMTLPVNLTGAQPKDYN----FDVVL 129 (379) T ss_pred HHHHHHHHHHHHHHHHhcccccccchhHHHHHHHHHHhHHHHHhhhhhhhhhhcccccCCCCccccchhhh----hHHHH Confidence 11110 0111 111111100010 0000 00011100112333444443333 4565444 34555 Q ss_pred eeccccchhhhcccccCCCcceeeEEEeeeecceee--EEeecccCCceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHh Q lcl|Aclame:pro 65 ILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKV--ATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGA 142 (336) Q Consensus 65 ~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a--~~ygd~~diP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~ 142 (336) ..-.......++.+.+... .++.|+.....+.+ ...+.+...|..+.........++.++..+.+|.+=|+-+. T Consensus 130 ~~~~~~~i~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~D~~- 205 (379) T protein:vir:10 130 NPSQMLNVSDIVGAVSISG---GTYTFVRENGAGEGAIGAQVEGATKGQKDYDISMIDVNTDFIAGFTRYSKKMANNLP- 205 (379) T ss_pred hHHhhhhHHhhceeeeccC---CceEEEEeecCCCcccccccCCccccccccceeeeEeeeeeEEeeehhhHHHHhhHH- Confidence 5555555666665543322 34566665544333 34577788999998888889999999998888865444432 Q ss_pred hCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCc Q lcl|Aclame:pro 143 GRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGI 222 (336) Q Consensus 143 ~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~ 222 (336) .|.+--....++++.+.+|.-.+.|+...+..+ ....+ +.++ ++||.+++..+... + T Consensus 206 ---~l~~~i~~~la~~~~~~~~~~~~~g~~~~~~~~----------~~~~~---~~~~----~d~i~~~~~~~~~~--~- 262 (379) T protein:vir:10 206 ---FLTSFIPNALRRDYAKAENAAFNAVLAANATAS----------TEIIT---NKNK----VEMLINEIAKQENL--D- 262 (379) T ss_pred ---HHHHHHHHHHHHHHHHHHHHHHhcccccccccc----------ccccc---Cccc----HHHHHHHHHhhhhc--c- Confidence 366666666666666666654444433222111 01111 1122 45666666655322 1 Q ss_pred eecccccEEEecHHHHHhccc-CCCCCccHHH--HHHHh-----CCccEEEEcccccCCCCc-----eEE-EEEEeecCC Q lcl|Aclame:pro 223 ITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAA--KLKDI-----FPKLEFVTIPEYDTASGR-----LVQ-LWAPRVEGK 288 (336) Q Consensus 223 v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~--~l~~n-----~pnl~i~~~pel~~a~G~-----~~~-~~~~~~~~~ 288 (336) ..+..++|.|..+..|.+ .+..|.-++. ...++ .-++.++..+.+. +|. -.. .+..+ .+ T Consensus 263 ---~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~l~G~pvv~s~~~~--ag~~~~gdf~~~~~~~~-~~- 335 (379) T protein:vir:10 263 ---FPVTAIVLRPTDYYDILVTQKSVGAGYGLPGVVTQDNGVLRINGIPLFRATWLA--ANKYYVGDWTRVTKVTT-EG- 335 (379) T ss_pred ---CCCCEEEEcHHHHHHHHHhhccCCceeccCCccCCCCCcceecceeeEecCCCC--CCceEEeecccEEEEEE-ec- Confidence 245679999998887753 2333332221 00011 1123333333332 221 111 11111 01 Q ss_pred ceEEEEcChhhhcccc-eecCCceEEccccceeeeeeecccceee--eccC Q lcl|Aclame:pro 289 DTATCGFTEKMRAHSI-ERYSSYFRQKKSAGTWGAVIFRPFAVAQ--MIGV 336 (336) Q Consensus 289 ~~~~~~~p~~~r~l~~-~~~~~~~~vp~~~~t~Gv~ir~P~av~~--~~GI 336 (336) ..+.+ ...+. ....-.+.+-++.|. |+.|++|-||++ +.+| T Consensus 336 --~~i~~----~~~~~~~f~~~~~~~r~~~R~-~~~v~~p~a~v~~~~~~~ 379 (379) T protein:vir:10 336 --LSLEF----SEVEGTNFVKNNITARIEAQV-ALAVEQPAALIFGDFTAV 379 (379) T ss_pred --eEEEE----eecccccccCCcEEEEEEEEe-ccEEecCccEEEEEecCC Confidence 11111 11110 011123445556666 556678999998 7788 No 105 >protein:vir:8843 Length: 317 # NCBI annotation: major head protein # Family: family:all:3919 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775251;genbank:gi:27476049;genbank:GeneID:2700597 Probab=87.57 E-value=0.038 Score=28.38 Aligned_cols=280 Identities=9% Similarity=-0.032 Sum_probs=114.6 Q ss_pred hhccc--ccccCcchHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecceeeEEe-ecccCCceee Q lcl|Aclame:pro 37 DLSPH--LSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATY-GDYSSDGDSG 113 (336) Q Consensus 37 d~~~~--l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~~y-gd~~diP~~~ 113 (336) .+.|. .+|.-+.|.--=|...| +.+--.+.-...+++-.+. +..++.|+.-+....+... ..++|-|... T Consensus 1 ma~~~~~~~t~~~~g~~~dl~~~I----~~isp~dTPf~S~i~~~~a---~~~~~~W~~d~l~~~~~~~~~EG~da~~~~ 73 (317) T protein:vir:88 1 MATPTNAVSTVEINGKREDLIDII----YNIAPYDTPFMSAIGKGVA---TAITHEWQTDELRQPGKNTRVEGEDATIKA 73 (317) T ss_pred CCccccceEeeeeeeeeechhhhh----eecCCccCcceeeecCcee---cccEEEEEeeecCCccccccccCccccccc Confidence 12222 22223333322233333 2222222222334443221 2233444443433333211 1223333222 Q ss_pred eeeee---eeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeecc---------ccceEEEEe Q lcl|Aclame:pro 114 ANINY---PQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVA---------GLENYGLIN 181 (336) Q Consensus 114 ~~~~~---~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~---------~~g~~GllN 181 (336) ..... -.-||++=...+.++.+-...++. -+..+....-+...+.+.++...+.|.. ...+-|+++ T Consensus 74 ~~~r~~~~N~tQIf~k~v~VSgTa~av~~~G~--~~ela~q~~kk~~EikrdmE~~li~g~~a~~~~~~t~~r~~~Gl~~ 151 (317) T protein:vir:88 74 GSFTTMLNNYCQISDETLQVTGTADRVKKAGR--KNELAYQLAKKSKELKLDMEYALVGAPQAKVQRNTTTPGQMANIFA 151 (317) T ss_pred ccCCEEeccEEEEEEeEEEEeehhhhhhhcCc--cchhHHHHHHHHHHHHHHHHHHHhcCeeeccCCCCccchhhhhHHH Confidence 21111 122355555555566655544432 2322222222222333333333334332 245667665 Q ss_pred c--CC-C-Ccc----c-ccccccccccCHHHH-HHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhcccCCCCCccH Q lcl|Aclame:pro 182 D--PS-L-SAP----I-TATTPWSGSPAVEAV-VNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSKTNQYGLAA 251 (336) Q Consensus 182 ~--Pn-l-~~~----~-~~~t~w~~~~t~~eI-~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~~~~~~~Tv 251 (336) - +| + .+. + ..+..|-+ -|+..+ -+||++++.++|..-+ .|+++.+++..-..|+.-...+.+. T Consensus 152 ~i~t~~~~~~~g~~~~~~~~~~~t~-~t~~~lte~~l~~~l~~i~~~Gg------~~~~i~v~a~~k~~i~~~~~~~~~~ 224 (317) T protein:vir:88 152 YYKTNGSLGANGVAPVGDGSNTGTA-GDLRLLTEDMLLNASESIWRNGG------QANSIQTSSSIKKAISKNMKGRATE 224 (317) T ss_pred HhccCceeccCccccccCCCccccc-cccccccHHHHHHHHHHHHhcCC------CCCEEEeChHHHHHHHHHhcCCcee Confidence 2 11 1 000 0 01112211 122223 4568889999998543 4678999999888776321111110 Q ss_pred HHHHHHhCCccEEEEcccccCCCCceEEEEEEeec--------CCceEEEEcChhhhcccceecCCceEEccccceeeee Q lcl|Aclame:pro 252 AAKLKDIFPKLEFVTIPEYDTASGRLVQLWAPRVE--------GKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAV 323 (336) Q Consensus 252 l~~l~~n~pnl~i~~~pel~~a~G~~~~~~~~~~~--------~~~~~~~~~p~~~r~l~~~~~~~~~~vp~~~~t~Gv~ 323 (336) .... .-.+.-...+-.+.+.-|. ..++..+.- +++.+++++=.+|..- ...+....+--....=+|+. T Consensus 225 i~~~--~~~~~~g~~v~~~~tdfG~-v~ii~~r~lp~~~~~~~D~~~~~l~~Lr~~~~e-~laKtGd~~k~~i~~E~tLe 300 (317) T protein:vir:88 225 ITLD--ASDNRIAQTVDVYESDFGK-YTIRANRWFHENTLFVFDPKMHSLCYLRPFFQH-ELAKTGDSEKRQLLVEYTFR 300 (317) T ss_pred EEEc--ccCeEEEEEEEEEEeCCeE-EEEEeCCCCCCCeEEEEcccccceeecccceee-ccCCCcccceeEEEEEEEEE Confidence 0000 0011122223333333332 222222211 2334444333333222 22333344445566678999 Q ss_pred eecccceeeeccC Q lcl|Aclame:pro 324 IFRPFAVAQMIGV 336 (336) Q Consensus 324 ir~P~av~~~~GI 336 (336) ++-|.|.+...|| T Consensus 301 ~~N~~a~a~i~~l 313 (317) T protein:vir:88 301 VNNEKSGALIRDV 313 (317) T ss_pred EcCccceeEEEEe Confidence 9999999999999 No 106 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=86.87 E-value=0.043 Score=28.10 Aligned_cols=295 Identities=8% Similarity=-0.030 Sum_probs=124.7 Q ss_pred CchHHHHH----Hh----h----hcce--eccchhhhc-------------cchhHHHHHhhhhhcccc--cccCcc--h Q lcl|Aclame:pro 1 MRDAQRIQ----NL----A----RAGV--ILPRSVQNV-------------STPLTEYAMDAADLSPHL--SSTGSS--G 49 (336) Q Consensus 1 ~~~~~~~~----~l----~----~~g~--~~~~~~~~~-------------~~~~~~~a~da~d~~~~l--~t~~~~--~ 49 (336) ...++.++ .+ + ..+- .-+...... ..........+......+ .+.+++ . T Consensus 50 ~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gG~l 129 (387) T protein:vir:94 50 QQRFNIVERQVQDIEEKEKAKVKDKGEAYQSLSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKL 129 (387) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhhccccCCCCchhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCCCCCCcee Confidence 11111100 00 0 0000 000000000 000000011111111112 222222 3 Q ss_pred HHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEee-eecceeeEEeecccCCceeeeeeeeeeeeEEEEEE Q lcl|Aclame:pro 50 IPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFIT-AEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQT 128 (336) Q Consensus 50 i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~-~e~~G~a~~ygd~~diP~~~~~~~~~~~~v~~~~~ 128 (336) ||..+.+ +|++.+...-....+..+.+.+.. .++. ....+.+...+.+...|-.+.......-..+.++. T Consensus 130 IP~~~~~----~Ii~~~~~~~~l~~~~~~~~~~~~-----~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~ 200 (387) T protein:vir:94 130 LPKTLSK----EIVSEPFAKNQLREKARLTNIKGL-----EIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKV 200 (387) T ss_pred echhHHH----HHHHHHHhhchhhhhceeeecCCc-----eeeeeeccCCccccccccccccccccccceeeechheeee Confidence 6765553 445554444445566655554432 2222 23345667778888888888887888888888888 Q ss_pred EEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceE-EeeccccceEEEEecCCCCcccccccccccccCHHHHHHH Q lcl|Aclame:pro 129 WTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSY-LFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNE 207 (336) Q Consensus 129 ~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~-~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~D 207 (336) .+.+|.+=|.- ...++.+--....++++...+++.+ .-|++...-.|.++++.+... +.+..++| T Consensus 201 ~i~iS~ell~d---s~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~~-----------~~~~~~d~ 266 (387) T protein:vir:94 201 FAAISDTVIHG---SDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEV-----------EGADMYDA 266 (387) T ss_pred echhhHHHHhh---hHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccc-----------cccchHHH Confidence 88888553432 3455666666666666666655543 345544445677776544321 11223667 Q ss_pred HHHHHHHHHHHhCCceecccccEEEecH-HHHHhcccCCCCCccHHHHHHHhCCccEEEEcccccCCCCceEEEEEEeec Q lcl|Aclame:pro 208 VVALFQVLQTQSQGIITQEDVLRMGLPP-TAMSDLSKTNQYGLAAAAKLKDIFPKLEFVTIPEYDTASGRLVQLWAPRVE 286 (336) Q Consensus 208 i~~l~~~l~~~s~g~v~~~~p~tL~Lp~-~~~~~L~~~~~~~~Tvl~~l~~n~pnl~i~~~pel~~a~G~~~~~~~~~~~ 286 (336) |.+++.+|...-. + .-..+|-+ .....+....+.|..++. --|+ ++-..|-.-+.+ .....|-+ .. T Consensus 267 i~~~~~~l~~~y~-----~-na~~imn~~t~~~~~~~~~~~~~~~~~----~~~~-~llG~PV~~~~~-~~~~~~GD-f~ 333 (387) T protein:vir:94 267 IINALADLHEDYR-----D-NATIYMRYADYVKIISVLSNGTTNFFD----TPAE-KVFGKPVVFTDA-AVKPIVGD-FN 333 (387) T ss_pred HHHHHhccChhhh-----c-CCEEEEechHHHHHHHHHhcCCCcccc----cCCc-cccccceEEecC-CCceeeec-hh Confidence 7777776644321 1 12355544 444444433333332221 1121 111112111111 01111110 00 Q ss_pred CCceEEEEcChhhhccc-ceecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 287 GKDTATCGFTEKMRAHS-IERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 287 ~~~~~~~~~p~~~r~l~-~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) . ..+ .-..+...+ -+........-+..|..|.+ ++|-||+.+.-= T Consensus 334 -~--~~~-~~~~~~~~~~~~~~~~~~~~~~~~r~Dg~v-~~~~A~~~l~~k 379 (387) T protein:vir:94 334 -Y--FGI-NYDGTTYDTDKDVKKGEYLFVLTAWYDQQR-TLDSAFRIAKAK 379 (387) T ss_pred -h--hhh-hhhhhhheecccccCCceEEEEEEEeCcEe-echhheEEEEee Confidence 0 000 000011111 11122345556677776665 469999875432 No 107 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=86.87 E-value=0.043 Score=28.10 Aligned_cols=295 Identities=8% Similarity=-0.030 Sum_probs=124.7 Q ss_pred CchHHHHH----Hh----h----hcce--eccchhhhc-------------cchhHHHHHhhhhhcccc--cccCcc--h Q lcl|Aclame:pro 1 MRDAQRIQ----NL----A----RAGV--ILPRSVQNV-------------STPLTEYAMDAADLSPHL--SSTGSS--G 49 (336) Q Consensus 1 ~~~~~~~~----~l----~----~~g~--~~~~~~~~~-------------~~~~~~~a~da~d~~~~l--~t~~~~--~ 49 (336) ...++.++ .+ + ..+- .-+...... ..........+......+ .+.+++ . T Consensus 50 ~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gG~l 129 (387) T protein:vir:26 50 QQRFNIVERQVQDIEEKEKAKVKDKGEAYQSLSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKL 129 (387) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhhccccCCCCchhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCCCCCCcee Confidence 11111100 00 0 0000 000000000 000000011111111112 222222 3 Q ss_pred HHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEee-eecceeeEEeecccCCceeeeeeeeeeeeEEEEEE Q lcl|Aclame:pro 50 IPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFIT-AEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQT 128 (336) Q Consensus 50 i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~-~e~~G~a~~ygd~~diP~~~~~~~~~~~~v~~~~~ 128 (336) ||..+.+ +|++.+...-....+..+.+.+.. .++. ....+.+...+.+...|-.+.......-..+.++. T Consensus 130 IP~~~~~----~Ii~~~~~~~~l~~~~~~~~~~~~-----~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~ 200 (387) T protein:vir:26 130 LPKTLSK----EIVSEPFAKNQLREKARLTNIKGL-----EIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKV 200 (387) T ss_pred echhHHH----HHHHHHHhhchhhhhceeeecCCc-----eeeeeeccCCccccccccccccccccccceeeechheeee Confidence 6765553 445554444445566655554432 2222 23345667778888888888887888888888888 Q ss_pred EEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceE-EeeccccceEEEEecCCCCcccccccccccccCHHHHHHH Q lcl|Aclame:pro 129 WTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSY-LFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNE 207 (336) Q Consensus 129 ~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~-~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~D 207 (336) .+.+|.+=|.- ...++.+--....++++...+++.+ .-|++...-.|.++++.+... +.+..++| T Consensus 201 ~i~iS~ell~d---s~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~~-----------~~~~~~d~ 266 (387) T protein:vir:26 201 FAAISDTVIHG---SDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEV-----------EGADMYDA 266 (387) T ss_pred echhhHHHHhh---hHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccc-----------cccchHHH Confidence 88888553432 3455666666666666666655543 345544445677776544321 11223667 Q ss_pred HHHHHHHHHHHhCCceecccccEEEecH-HHHHhcccCCCCCccHHHHHHHhCCccEEEEcccccCCCCceEEEEEEeec Q lcl|Aclame:pro 208 VVALFQVLQTQSQGIITQEDVLRMGLPP-TAMSDLSKTNQYGLAAAAKLKDIFPKLEFVTIPEYDTASGRLVQLWAPRVE 286 (336) Q Consensus 208 i~~l~~~l~~~s~g~v~~~~p~tL~Lp~-~~~~~L~~~~~~~~Tvl~~l~~n~pnl~i~~~pel~~a~G~~~~~~~~~~~ 286 (336) |.+++.+|...-. + .-..+|-+ .....+....+.|..++. --|+ ++-..|-.-+.+ .....|-+ .. T Consensus 267 i~~~~~~l~~~y~-----~-na~~imn~~t~~~~~~~~~~~~~~~~~----~~~~-~llG~PV~~~~~-~~~~~~GD-f~ 333 (387) T protein:vir:26 267 IINALADLHEDYR-----D-NATIYMRYADYVKIISVLSNGTTNFFD----TPAE-KVFGKPVVFTDA-AVKPIVGD-FN 333 (387) T ss_pred HHHHHhccChhhh-----c-CCEEEEechHHHHHHHHHhcCCCcccc----cCCc-cccccceEEecC-CCceeeec-hh Confidence 7777776644321 1 12355544 444444433333332221 1121 111112111111 01111110 00 Q ss_pred CCceEEEEcChhhhccc-ceecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 287 GKDTATCGFTEKMRAHS-IERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 287 ~~~~~~~~~p~~~r~l~-~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) . ..+ .-..+...+ -+........-+..|..|.+ ++|-||+.+.-= T Consensus 334 -~--~~~-~~~~~~~~~~~~~~~~~~~~~~~~r~Dg~v-~~~~A~~~l~~k 379 (387) T protein:vir:26 334 -Y--FGI-NYDGTTYDTDKDVKKGEYLFVLTAWYDQQR-TLDSAFRIAKAK 379 (387) T ss_pred -h--hhh-hhhhhhheecccccCCceEEEEEEEeCcEe-echhheEEEEee Confidence 0 000 000011111 11122345556677776665 469999875432 No 108 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=86.87 E-value=0.043 Score=28.10 Aligned_cols=295 Identities=8% Similarity=-0.030 Sum_probs=124.7 Q ss_pred CchHHHHH----Hh----h----hcce--eccchhhhc-------------cchhHHHHHhhhhhcccc--cccCcc--h Q lcl|Aclame:pro 1 MRDAQRIQ----NL----A----RAGV--ILPRSVQNV-------------STPLTEYAMDAADLSPHL--SSTGSS--G 49 (336) Q Consensus 1 ~~~~~~~~----~l----~----~~g~--~~~~~~~~~-------------~~~~~~~a~da~d~~~~l--~t~~~~--~ 49 (336) ...++.++ .+ + ..+- .-+...... ..........+......+ .+.+++ . T Consensus 50 ~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gG~l 129 (387) T protein:vir:96 50 QQRFNIVERQVQDIEEKEKAKVKDKGEAYQSLSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKL 129 (387) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhhccccCCCCchhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCCCCCCcee Confidence 11111100 00 0 0000 000000000 000000011111111112 222222 3 Q ss_pred HHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEee-eecceeeEEeecccCCceeeeeeeeeeeeEEEEEE Q lcl|Aclame:pro 50 IPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFIT-AEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQT 128 (336) Q Consensus 50 i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~-~e~~G~a~~ygd~~diP~~~~~~~~~~~~v~~~~~ 128 (336) ||..+.+ +|++.+...-....+..+.+.+.. .++. ....+.+...+.+...|-.+.......-..+.++. T Consensus 130 IP~~~~~----~Ii~~~~~~~~l~~~~~~~~~~~~-----~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~ 200 (387) T protein:vir:96 130 LPKTLSK----EIVSEPFAKNQLREKARLTNIKGL-----EIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKV 200 (387) T ss_pred echhHHH----HHHHHHHhhchhhhhceeeecCCc-----eeeeeeccCCccccccccccccccccccceeeechheeee Confidence 6765553 445554444445566655554432 2222 23345667778888888888887888888888888 Q ss_pred EEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceE-EeeccccceEEEEecCCCCcccccccccccccCHHHHHHH Q lcl|Aclame:pro 129 WTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSY-LFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNE 207 (336) Q Consensus 129 ~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~-~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~D 207 (336) .+.+|.+=|.- ...++.+--....++++...+++.+ .-|++...-.|.++++.+... +.+..++| T Consensus 201 ~i~iS~ell~d---s~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~~-----------~~~~~~d~ 266 (387) T protein:vir:96 201 FAAISDTVIHG---SDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEV-----------EGADMYDA 266 (387) T ss_pred echhhHHHHhh---hHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccc-----------cccchHHH Confidence 88888553432 3455666666666666666655543 345544445677776544321 11223667 Q ss_pred HHHHHHHHHHHhCCceecccccEEEecH-HHHHhcccCCCCCccHHHHHHHhCCccEEEEcccccCCCCceEEEEEEeec Q lcl|Aclame:pro 208 VVALFQVLQTQSQGIITQEDVLRMGLPP-TAMSDLSKTNQYGLAAAAKLKDIFPKLEFVTIPEYDTASGRLVQLWAPRVE 286 (336) Q Consensus 208 i~~l~~~l~~~s~g~v~~~~p~tL~Lp~-~~~~~L~~~~~~~~Tvl~~l~~n~pnl~i~~~pel~~a~G~~~~~~~~~~~ 286 (336) |.+++.+|...-. + .-..+|-+ .....+....+.|..++. --|+ ++-..|-.-+.+ .....|-+ .. T Consensus 267 i~~~~~~l~~~y~-----~-na~~imn~~t~~~~~~~~~~~~~~~~~----~~~~-~llG~PV~~~~~-~~~~~~GD-f~ 333 (387) T protein:vir:96 267 IINALADLHEDYR-----D-NATIYMRYADYVKIISVLSNGTTNFFD----TPAE-KVFGKPVVFTDA-AVKPIVGD-FN 333 (387) T ss_pred HHHHHhccChhhh-----c-CCEEEEechHHHHHHHHHhcCCCcccc----cCCc-cccccceEEecC-CCceeeec-hh Confidence 7777776644321 1 12355544 444444433333332221 1121 111112111111 01111110 00 Q ss_pred CCceEEEEcChhhhccc-ceecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 287 GKDTATCGFTEKMRAHS-IERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 287 ~~~~~~~~~p~~~r~l~-~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) . ..+ .-..+...+ -+........-+..|..|.+ ++|-||+.+.-= T Consensus 334 -~--~~~-~~~~~~~~~~~~~~~~~~~~~~~~r~Dg~v-~~~~A~~~l~~k 379 (387) T protein:vir:96 334 -Y--FGI-NYDGTTYDTDKDVKKGEYLFVLTAWYDQQR-TLDSAFRIAKAK 379 (387) T ss_pred -h--hhh-hhhhhhheecccccCCceEEEEEEEeCcEe-echhheEEEEee Confidence 0 000 000011111 11122345556677776665 469999875432 No 109 >protein:vir:79078 Length: 307 # NCBI annotation: gp8 # Family: family:all:908 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111208;genbank:gi:134288798;genbank:GeneID:4960752 Probab=86.86 E-value=0.043 Score=28.09 Aligned_cols=263 Identities=12% Similarity=0.066 Sum_probs=104.7 Q ss_pred HhhhhhcccccccCcchHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecceeeE------Eeecc Q lcl|Aclame:pro 33 MDAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVA------TYGDY 106 (336) Q Consensus 33 ~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~------~ygd~ 106 (336) |... .++- ..++-+.++-..|= -+++-+++|||....+.-. +.|..++.-+... .-|+. T Consensus 1 m~~~-~~~~---~~dp~LT~~A~gy~--------n~~~Iad~lfP~vpV~~~~---~k~~~f~~e~f~~~~t~ra~~~~~ 65 (307) T protein:vir:79 1 MGRL-SKLR---IVDPVLTNLAIGYT--------NAEFIGQTLMPVVEVEKEG---GKIPKFGKESFRLYQTERALRAKS 65 (307) T ss_pred CCCC-CCCc---ccCHHHHHHHhhcc--------chhhhhhhcCCcccccccc---cceeeeccccccccccccccCCCc Confidence 2111 1111 11222223333332 3557788888876554322 2232222111100 01111 Q ss_pred cCCceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHH----HHHHHHHhhcceEEeeccccceEEEEec Q lcl|Aclame:pro 107 SSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYS----SALGLAKFLNGSYLFGVAGLENYGLIND 182 (336) Q Consensus 107 ~diP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~a----Ar~a~e~~~n~~~~~Gd~~~g~~GllN~ 182 (336) +.+...+ ++.....+. +.+..+.++ .+..+..++++.+++..- ..+..|...-++++-.. |. T Consensus 66 ~~v~~~~--~~~~~~~~~--~~~l~~~id-~r~~~~~~~~~~~~Av~~l~d~I~l~~E~~~A~l~~~~~---------~y 131 (307) T protein:vir:79 66 NRMNPED--IDSVDVNLD--EHDLEYPID-YREDQESAFPLEQAAVQTATDAIQLRREKMIADLSQNPS---------SY 131 (307) T ss_pred ceeeeec--ccccccccc--ccchhhccc-chhcCCCCCCHHHHHHHHHHHHHHhHHHHHHHHHhcccc---------cc Confidence 1111000 000000001 111111111 123334455555544333 23444444445554322 12 Q ss_pred CCCC-cccccccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhccc---------CCCCC-ccH Q lcl|Aclame:pro 183 PSLS-APITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK---------TNQYG-LAA 251 (336) Q Consensus 183 Pnl~-~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~---------~~~~~-~Tv 251 (336) |+-. ...+.++.|.+. +.| ++.||.+....+...++ -.|++++|....+..|.+ .+..+ +| T Consensus 132 ~~~~k~tLsgt~~Wsd~-~sD-Pi~di~~~~~ai~~~~g-----~~Pn~~vlg~~a~~~l~~h~~i~~~lk~~~~g~it- 203 (307) T protein:vir:79 132 AAGNKKQLSATEKFTAA-NSD-PVGVIEDGKEAIRTKIG-----RRPNTMVIGASAYKTLKAHPQLIEKIKYSMKGIVT- 203 (307) T ss_pred CCCceEEEccCcccCCC-CCC-cHHHHHHHHHHHHHhhC-----CccceEEeCHHHHHHHhcCHHHHHHhcCccccccC- Confidence 2211 122344566553 444 89999999999988775 369999999999998853 12223 34 Q ss_pred HHHHHHhCCccEEEEccc--ccCCCC-------ceEE-EEEEeecCCceEEEEcC-hhhhcccceecCCceEEcccc--- Q lcl|Aclame:pro 252 AAKLKDIFPKLEFVTIPE--YDTASG-------RLVQ-LWAPRVEGKDTATCGFT-EKMRAHSIERYSSYFRQKKSA--- 317 (336) Q Consensus 252 l~~l~~n~pnl~i~~~pe--l~~a~G-------~~~~-~~~~~~~~~~~~~~~~p-~~~r~l~~~~~~~~~~vp~~~--- 317 (336) .++|++-+ .++.+.+-+ +.++++ +... +++...-+.....+.-| .-+. .+.++..+..+..+ T Consensus 204 ~~~la~l~-~v~~V~vg~a~y~~~~~~~~~iw~~~~~l~y~~~~~~~~~~~~~~ps~Gyt---~~~~g~~~~d~~~~~~~ 279 (307) T protein:vir:79 204 VDLLKEIF-EVENIAVGEAIYADDKDRFTDIWGANIVLAYVPLQRGGQQRTPYEPSYGYT---LRKKGNPVVDTRIEDGK 279 (307) T ss_pred HHHHHHHh-CceeEEEeeeeeecccccchhcCCCceEEEecccccCCCCCccccccccee---EEecCceEEecccCCCc Confidence 34565543 344222222 223332 2222 22221111010000001 1111 12222223333332 Q ss_pred --ceeeeeeecccceeeeccC Q lcl|Aclame:pro 318 --GTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 318 --~t~Gv~ir~P~av~~~~GI 336 (336) .+....+..|.-++.-.|. T Consensus 280 ~~~vrv~~~~~~~i~~~~~G~ 300 (307) T protein:vir:79 280 LELVRATDIFRPYLLGADAGY 300 (307) T ss_pred eeEEeecccccceeeccccch Confidence 2333444567666666665 No 110 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=86.33 E-value=0.046 Score=27.90 Aligned_cols=295 Identities=11% Similarity=-0.030 Sum_probs=125.8 Q ss_pred CchHHHH--------------HHh---hhcceec----cchhhhc---------cchhHHHHHhhhhhcccccccCcch- Q lcl|Aclame:pro 1 MRDAQRI--------------QNL---ARAGVIL----PRSVQNV---------STPLTEYAMDAADLSPHLSSTGSSG- 49 (336) Q Consensus 1 ~~~~~~~--------------~~l---~~~g~~~----~~~~~~~---------~~~~~~~a~da~d~~~~l~t~~~~~- 49 (336) +.+..++ +.. ++..+.- +...... ........... .... .++.+++| T Consensus 40 ~ee~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~-~~~~~~gg~ 117 (395) T protein:vir:38 40 VDDINKLNASLKNAKMAQELAKSAYEDARANLNAEPVNKKPLPVKDGKPDAQAMKNQFVKDFKNL-VTSG-TTGTGNAGL 117 (395) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccchhhhhHHHHHHHHHHHHHHHHH-Hhhc-cCccCCCce Confidence 0000000 000 0000000 0000000 00001111111 1111 22233333 Q ss_pred -HHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeee-ecceeeEEeecccCCcee-eeeeeeeeeeEEEE Q lcl|Aclame:pro 50 -IPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITA-EPTTKVATYGDYSSDGDS-GANINYPQRQSYFF 126 (336) Q Consensus 50 -i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~-e~~G~a~~ygd~~diP~~-~~~~~~~~~~v~~~ 126 (336) +|..+. ++|++.....-....+..+.....-. ..+.+... +..+.+...+....+|-. .........+.+.+ T Consensus 118 ~vP~~~~----~~ii~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~f~~v~~~~~k~ 192 (395) T protein:vir:38 118 TIPEDIQ----LQIRTLTRSFTSLESLANVENVTTSH-GSRVYEKLADITPLKDLDDESALIGDNDDPELTVVKYLIHRY 192 (395) T ss_pred ecchhHh----hHHHHHHHhhcchhhhcceeeccCCc-ceEEEEeeccCCccccccccccccccccccceeeEEeeeeee Confidence 565554 35555555555555554432221101 12233322 333445566777777744 45677778888888 Q ss_pred EEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCcccccccccccccCHHHHHH Q lcl|Aclame:pro 127 QTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVN 206 (336) Q Consensus 127 ~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~ 206 (336) +..+.+|..=++. ...+|.+--......++.+.+++-+++|++.... .+ ..++.+ T Consensus 193 ~~~~~iS~ell~d---s~~~l~~~i~~~la~~~~~~~~~~il~g~g~~~~--------~~----------~~~~~~---- 247 (395) T protein:vir:38 193 AGITTVTNTLLKD---TVDNIIQWLVNWAAKKDVVTRNAKILEVMGKAPK--------KP----------TISQFD---- 247 (395) T ss_pred EeehhhHHHHHhh---hHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc--------cc----------ccccHH---- Confidence 8888887653332 3456777778888888888888888888764321 00 012233 Q ss_pred HHHHHHHHHHHHhCCceecccccEEEecHHHHHhccc-CCCCCccHHHH-HHHhCCc----cEEEEcc--cccCCCCceE Q lcl|Aclame:pro 207 EVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAK-LKDIFPK----LEFVTIP--EYDTASGRLV 278 (336) Q Consensus 207 Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~~-l~~n~pn----l~i~~~p--el~~a~G~~~ 278 (336) ||.++++...... + .....++|.+..+..|.+ .+..|.-++.- +....|+ ..+.... .+..+++... T Consensus 248 ~i~~~~~~~l~~~---~--~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~ 322 (395) T protein:vir:38 248 NIKDLENNTLDPA---I--ESTSSFITNQSGYNILSKVKDADGRYLMQPDVTSPDKYLIDGKPVIRIADKWLPDVSGSHP 322 (395) T ss_pred HHHHHHHHhhhhh---h--cCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCcceeccceeEEecccccCcCCCcce Confidence 4444443221111 0 123468999999888854 33334333210 1111111 1111111 1222233333 Q ss_pred EEEEEeecCCceEEEEcChh--hhccc---ceecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 279 QLWAPRVEGKDTATCGFTEK--MRAHS---IERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 279 ~~~~~~~~~~~~~~~~~p~~--~r~l~---~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) .++.+-. +...+..-+. +.... .....-.+..-+..|.+| .+.+|.||+.++.- T Consensus 323 i~~gd~~---~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~-~~~~~~a~~~~~~~ 381 (395) T protein:vir:38 323 LYFGDLK---QGITLFDRQQMQIDTTNVGAGSFEHDTTKLRFIDRFDV-QLIDDGAFAAASFK 381 (395) T ss_pred EEEEecc---ccEEEEEecceEEEEeccccchhhcCceEEEEEEeecc-EEecccceEEEEee Confidence 3333211 0000000000 11100 001112344555666655 56669999999977 No 111 >protein:vir:107882 Length: 307 # NCBI annotation: gp34 # Family: family:all:908 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024707;genbank:gi:48696944;genbank:GeneID:2845970 Probab=86.05 E-value=0.048 Score=27.80 Aligned_cols=261 Identities=11% Similarity=0.077 Sum_probs=112.1 Q ss_pred HhhhhhcccccccCcchHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecceeeE--------Eee Q lcl|Aclame:pro 33 MDAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVA--------TYG 104 (336) Q Consensus 33 ~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~--------~yg 104 (336) |..+ .++ -..++-+.++-+.|=.+ .|-+++|||...++--.-...+|+ . ++. .-| T Consensus 1 m~~~-~~~---~~~dp~LT~~A~gy~n~--------~~ia~~l~P~vpv~~~~~k~~~f~---~--eaF~~~~t~r~~~~ 63 (307) T protein:vir:10 1 MGRL-SKL---RIVDPVLTNLAIGYTNA--------EFIGQSLMPVVEVEKEGGKIPKFG---K--ESFRLYKTERALRA 63 (307) T ss_pred CCCC-CCC---cccChhHHHHHHhhcch--------hhhhhhcCCcccccccccceeeEC---c--ccccchhhhcccCC Confidence 2111 111 11233333444555443 467788888876554332333342 2 121 111 Q ss_pred cccCCceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHH----HHhhcceEEeeccccceEEEE Q lcl|Aclame:pro 105 DYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGL----AKFLNGSYLFGVAGLENYGLI 180 (336) Q Consensus 105 d~~diP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~----e~~~n~~~~~Gd~~~g~~Gll 180 (336) +.+-+-.... .......-+.+..+-+. .+.++.+..++.+++...++..+ |...-++++-.. T Consensus 64 ~~~~v~~~~~----~~~~~~~~~~~L~~~id-~r~~~~~~~~~~~~av~~l~d~I~l~~E~~~A~l~~~~~--------- 129 (307) T protein:vir:10 64 RSNRMNPEDL----GSIDIVLDEHDLEYPID-YREDQESAFPLEQAAVQTATEAIQLRREKMVADLAQNPN--------- 129 (307) T ss_pred Ccceeecccc----cccccccccccccccCC-hhhcCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCcc--------- Confidence 1111100000 00111111112222222 13445556666665554443333 333344433211 Q ss_pred ecCCCCc-ccccccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhccc---------CCCCC-c Q lcl|Aclame:pro 181 NDPSLSA-PITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK---------TNQYG-L 249 (336) Q Consensus 181 N~Pnl~~-~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~---------~~~~~-~ 249 (336) |.|+-.. ..+.+++|.+ .+.| ++.||.+....+...++ -.|++++|..+.+..|.+ .+..| + T Consensus 130 ~y~~~~k~tLsGt~~Wsd-~~sD-Pi~di~~~~~ai~~~~g-----~~Pn~~vlg~~a~~al~~hp~i~e~lk~~~~g~i 202 (307) T protein:vir:10 130 SYAGGNKKQLSATEKFTA-AGSD-PVGVIEDGKEAIRTKIG-----RRPNTMVIGASAYKTLKAHPQLIEKIKYSMKGIV 202 (307) T ss_pred ccCCCceEEeccccccCC-CCCC-cHHHHHHHHHHHHhhhC-----CccceEEeCHHHHHHHhcCHHHHHHhCCcccccc Confidence 1121111 1233456654 4444 89999999999988765 369999999999998853 11223 4 Q ss_pred cHHHHHHHhCCccEEEEccc--ccCCCC-------ceE-EEEEEeecCCceEEEEcChhhhcccceecCCceEEccccce Q lcl|Aclame:pro 250 AAAAKLKDIFPKLEFVTIPE--YDTASG-------RLV-QLWAPRVEGKDTATCGFTEKMRAHSIERYSSYFRQKKSAGT 319 (336) Q Consensus 250 Tvl~~l~~n~pnl~i~~~pe--l~~a~G-------~~~-~~~~~~~~~~~~~~~~~p~~~r~l~~~~~~~~~~vp~~~~t 319 (336) |. +.|++-+ .++.+.+-+ +.++.+ +.. .++++...+.+...+..|. | -+-.+.++..+..+..+. T Consensus 203 t~-~~la~ll-~v~~i~vg~a~~~~~~~~~~~iw~~~~vl~yv~~~~~~~~~~~~eps-f-GyT~~~~g~~~~d~~~~~- 277 (307) T protein:vir:10 203 TV-DLLKEIF-EVENIAVGEAIYADDKDRFTDIWGANIVLAYVPLQRGGQQRTPYEPS-Y-GYTLRKKGNPVVDTRIED- 277 (307) T ss_pred CH-HHHHHHh-CceeEEEeeeeeeccCCccceeCCCceEEEecccccCCCCCcccccc-c-ceeEEEcCCeEeeceecC- Confidence 43 4555443 344333322 223332 222 2333222111211111121 1 111233445555555543 Q ss_pred eeee------eecccceeeeccC Q lcl|Aclame:pro 320 WGAV------IFRPFAVAQMIGV 336 (336) Q Consensus 320 ~Gv~------ir~P~av~~~~GI 336 (336) +|+. +.+|.-+....|. T Consensus 278 ~~~~~~r~~~~~~~~i~~~~~G~ 300 (307) T protein:vir:10 278 GKLELVRSTDIFRPYLLGADAGY 300 (307) T ss_pred CceeEEeccccccceeecccccc Confidence 3433 3456666666665 No 112 >protein:vir:98480 Length: 348 # NCBI annotation: ORFp38 # Family: family:all:1083 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958280;genbank:gi:41057254;uniprot:Q38595;genbank:GeneID:2732864 Probab=85.75 E-value=0.051 Score=27.69 Aligned_cols=275 Identities=10% Similarity=0.015 Sum_probs=100.2 Q ss_pred HhhhhhcccccccCcchHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecceeeEEeecc----cC Q lcl|Aclame:pro 33 MDAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDY----SS 108 (336) Q Consensus 33 ~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~~ygd~----~d 108 (336) |.- +...+--=|..|+.+|.........+.+-.+.+||.... .++.|..+.........+.+ .. T Consensus 1 M~~-------~~~~d~~~~~~l~~~i~~~~~~~~~~~~l~~~~fp~~~~-----~~~~~~~~~~~~~~~~~a~~~~~~~~ 68 (348) T protein:vir:98 1 MSW-------TLDTEFIEPTQLTGLIREALRDLQVNRFRLARWLPNVDV-----DDITFEFLRGGGGLAETASYRSWDTE 68 (348) T ss_pred Ccc-------hhhhhccCHHHHHHHHHHHhhccCcchhhHHhcCCCccc-----cceEEEEEeccCCceeeeeeecCCCc Confidence 110 000010113556666621111222333556788886431 22334333222221111221 12 Q ss_pred Cceeee-eeeeeeeeEEEEEEEEeeCHHHHHHHHhhCC--------CHHHHHHHHHHHHHHHhhc------ceEEeeccc Q lcl|Aclame:pro 109 DGDSGA-NINYPQRQSYFFQTWTRWGERELEMAGAGRV--------DLASELNYSSALGLAKFLN------GSYLFGVAG 173 (336) Q Consensus 109 iP~~~~-~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~--------~l~~~k~~aAr~a~e~~~n------~~~~~Gd~~ 173 (336) -|+.+- ..+..+.++-.++..+..+..|+...+.... +...+...++++..|.... ++.+-|.. T Consensus 69 ~~~~~r~g~~~~~~~~~~i~~~~~i~~~d~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~~~g~~- 147 (348) T protein:vir:98 69 SKIGRREGLAKVMGELPPISEKIPLNEYDRLRLRKLSRDEALPFIARDAQRLARNIGARFEVARGSALVNATVPVTELQ- 147 (348) T ss_pred cceeecccceeeeeeccccccccccCHHHHHHhcCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCeEEEecCc- Confidence 233321 1222333344455556677766655432211 0111223334444443332 34444433 Q ss_pred cce-EEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhcccC-------- Q lcl|Aclame:pro 174 LEN-YGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSKT-------- 244 (336) Q Consensus 174 ~g~-~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~~-------- 244 (336) +.+ ||. |+-. ..++++.|++..+++ +++||.+....+...++ . .|++++|.+..+..|.+- T Consensus 148 ~~vDyg~---~~~~-~~t~~~~Ws~~~~ad-p~~di~~~~~~~~~~~G-~----~p~~~vm~~~~~~~l~~~~~i~~~~~ 217 (348) T protein:vir:98 148 QTVDFGR---IGSH-SVVAAVLWSVHATAT-PISDLESWVATYEDTNG-Q----SPGVILMPKAAVSHMRQCEEVIRQVF 217 (348) T ss_pred eEEcccc---Cccc-ccccccccCCCCCCC-HHHHHHHHHHHHHHccC-C----cceEEEeCHHHHHHHhcCHHHHHHHh Confidence 221 333 3222 245677887555544 88999999888876554 2 589999999999988431 Q ss_pred --C-CC--C-cc--HHHHHHHhCCccEEEEcccccCCCCceEEEEEEeecCCceEEEEcCh------------hhhcccc Q lcl|Aclame:pro 245 --N-QY--G-LA--AAAKLKDIFPKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTE------------KMRAHSI 304 (336) Q Consensus 245 --~-~~--~-~T--vl~~l~~n~pnl~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p~------------~~r~l~~ 304 (336) + +. . .+ .+..+...+--..|+.--+.-...|....++-+ +.. +.+|. -.+.+++ T Consensus 218 ~~~~~~~~~~~~~~~~~~~~~~~g~~~i~~~d~~~~~~g~~~~~~p~-----~~i-~l~p~~~~~~~~~~~~~G~t~~G~ 291 (348) T protein:vir:98 218 PLAPSGTAPMVSVEQLNTVLSSMGLPPIEVYDAKVAVDGVSTRITPA-----NAI-ALLPEPGATDAAQPTELGATLLGT 291 (348) T ss_pred ccCccccccccCHHHHHHHHHhhCCeEEEEeeeEEEcCCceeceecC-----CeE-EEEecCCcccccccccccceeccc Confidence 0 00 0 11 112222222212222211111112322222210 011 01111 0011110 Q ss_pred --eecCC---------------ceEEccccceeeeeeecccce-eeeccC Q lcl|Aclame:pro 305 --ERYSS---------------YFRQKKSAGTWGAVIFRPFAV-AQMIGV 336 (336) Q Consensus 305 --~~~~~---------------~~~vp~~~~t~Gv~ir~P~av-~~~~GI 336 (336) +...+ .|+..--.+.+=..--+|+-+ .+.+++ T Consensus 292 ~~e~~~~~~~~~~~~~~~i~~~~~~~~dP~~~~~~~~s~~lPv~~~~~~~ 341 (348) T protein:vir:98 292 TAESLEDDYALAPGEQPGIVAATWKTKDPVRLWTHAAAVGIPVLREPNLT 341 (348) T ss_pred chhhhccccccceeccCceeeeeeeecCCcEEEEEEeeeeeccccCCCcE Confidence 00111 111110001110011112211 112222 No 113 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=85.70 E-value=0.051 Score=27.67 Aligned_cols=295 Identities=6% Similarity=-0.069 Sum_probs=130.9 Q ss_pred CchH----------HHHHHhhh----cceeccc-----hhh------------hccchhHHHHHhhhhhcc--cccccCc Q lcl|Aclame:pro 1 MRDA----------QRIQNLAR----AGVILPR-----SVQ------------NVSTPLTEYAMDAADLSP--HLSSTGS 47 (336) Q Consensus 1 ~~~~----------~~~~~l~~----~g~~~~~-----~~~------------~~~~~~~~~a~da~d~~~--~l~t~~~ 47 (336) +.+. +++.+.+. .+-.... ..+ .+....+. .......+. ...|+++ T Consensus 35 ~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~t~~~ 113 (392) T protein:vir:10 35 MEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDVFMKALRNKPLNAEERE-FLEDDLEQRAMSGLTGED 113 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHHHHHHHhcccccHHHHH-HHhhhhhhhhccccccCC Confidence 0000 00000000 0000000 000 00000000 111111111 1123334 Q ss_pred c--hHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecceeeEEeecccCCceee-eeeeeeeeeEE Q lcl|Aclame:pro 48 S--GIPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSG-ANINYPQRQSY 124 (336) Q Consensus 48 ~--~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~~ygd~~diP~~~-~~~~~~~~~v~ 124 (336) + .+|..+. ++|++.+...-....+.++..... ......+......+.+...+.+...|-.+ ...+...-..+ T Consensus 114 gg~~vP~~~~----~~ii~~~~~~s~l~~~~~~~~~~~-~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~ 188 (392) T protein:vir:10 114 GGLVIPQDIQ----TQINELARSFDALEQYVTVEPVRT-RSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVK 188 (392) T ss_pred CceecchhHH----HHHHHHHHhhhhhhhhceeeeccC-CceeEEEEeecCCccceeecccccccccccccceeEEeeee Confidence 4 3665444 455666555555556655533221 11123344444455677778777777544 56777788888 Q ss_pred EEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCcccccccccccccCHHHH Q lcl|Aclame:pro 125 FFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAV 204 (336) Q Consensus 125 ~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI 204 (336) .++..+.+|.+=|+.+ ..+|.+.-....+.++.+.++.-.+.|++.... . +.++.+.| T Consensus 189 k~~~~~~iS~ell~ds---~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~---------------~----~~~~~d~i 246 (392) T protein:vir:10 189 DRAGILPLSRSLLQDS---DQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK---------------Q----AIKSLDDI 246 (392) T ss_pred eEEEeehhhHHHHhhh---HHHHHHHHHHHHHHHHHHHHHHHHhhccccccc---------------c----CccCHHHH Confidence 8888888987766543 456888888888888888888777766653211 0 11233444 Q ss_pred HHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhccc-CCCCCccHHHH-HHHhCC----ccEEE---Ec--cc-ccC Q lcl|Aclame:pro 205 VNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAK-LKDIFP----KLEFV---TI--PE-YDT 272 (336) Q Consensus 205 ~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~~-l~~n~p----nl~i~---~~--pe-l~~ 272 (336) .+-++..+... . . ..-+++|.++.+..|.+ .+..|.-++.- +....+ ++.++ .. |. ... T Consensus 247 ~~~~~~~l~~~------~-~--~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~ 317 (392) T protein:vir:10 247 KDVLNVKLDPA------I-S--PNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGT 317 (392) T ss_pred HHHHHHhhhhh------h-c--cCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCcccEEEecccccCCCcc Confidence 44333222111 0 1 23468999999888854 23333222210 111111 12211 11 11 122 Q ss_pred CCCceEEEEEEeecCCceEEEE--cChhhhcccc---eecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 273 ASGRLVQLWAPRVEGKDTATCG--FTEKMRAHSI---ERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 273 a~G~~~~~~~~~~~~~~~~~~~--~p~~~r~l~~---~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) +.+....++.+-. +...+. -...+...+. ....-...+-|..|.+| .+++|.||+.+..- T Consensus 318 ~~~~~~~~~gdfs---~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~-~v~~~~a~~~l~~~ 382 (392) T protein:vir:10 318 TAKKAPLIIGDLK---EAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDV-QMWDNEAAVYGEID 382 (392) T ss_pred cCCceEEEEEehh---ceEEEEeecceEEEEeccccchhhcCceEEEEEEeecc-EEecccceEEEEec Confidence 2344444443210 000000 0111111111 01112344667777775 67779999998765 No 114 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=85.70 E-value=0.051 Score=27.67 Aligned_cols=295 Identities=6% Similarity=-0.069 Sum_probs=130.9 Q ss_pred CchH----------HHHHHhhh----cceeccc-----hhh------------hccchhHHHHHhhhhhcc--cccccCc Q lcl|Aclame:pro 1 MRDA----------QRIQNLAR----AGVILPR-----SVQ------------NVSTPLTEYAMDAADLSP--HLSSTGS 47 (336) Q Consensus 1 ~~~~----------~~~~~l~~----~g~~~~~-----~~~------------~~~~~~~~~a~da~d~~~--~l~t~~~ 47 (336) +.+. +++.+.+. .+-.... ..+ .+....+. .......+. ...|+++ T Consensus 35 ~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~t~~~ 113 (392) T protein:vir:10 35 MEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDVFMKALRNKPLNAEERE-FLEDDLEQRAMSGLTGED 113 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHHHHHHHhcccccHHHHH-HHhhhhhhhhccccccCC Confidence 0000 00000000 0000000 000 00000000 111111111 1123334 Q ss_pred c--hHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecceeeEEeecccCCceee-eeeeeeeeeEE Q lcl|Aclame:pro 48 S--GIPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSG-ANINYPQRQSY 124 (336) Q Consensus 48 ~--~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~~ygd~~diP~~~-~~~~~~~~~v~ 124 (336) + .+|..+. ++|++.+...-....+.++..... ......+......+.+...+.+...|-.+ ...+...-..+ T Consensus 114 gg~~vP~~~~----~~ii~~~~~~s~l~~~~~~~~~~~-~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~ 188 (392) T protein:vir:10 114 GGLVIPQDIQ----TQINELARSFDALEQYVTVEPVRT-RSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVK 188 (392) T ss_pred CceecchhHH----HHHHHHHHhhhhhhhhceeeeccC-CceeEEEEeecCCccceeecccccccccccccceeEEeeee Confidence 4 3665444 455666555555556655533221 11123344444455677778777777544 56777788888 Q ss_pred EEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCcccccccccccccCHHHH Q lcl|Aclame:pro 125 FFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAV 204 (336) Q Consensus 125 ~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI 204 (336) .++..+.+|.+=|+.+ ..+|.+.-....+.++.+.++.-.+.|++.... . +.++.+.| T Consensus 189 k~~~~~~iS~ell~ds---~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~---------------~----~~~~~d~i 246 (392) T protein:vir:10 189 DRAGILPLSRSLLQDS---DQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK---------------Q----AIKSLDDI 246 (392) T ss_pred eEEEeehhhHHHHhhh---HHHHHHHHHHHHHHHHHHHHHHHHhhccccccc---------------c----CccCHHHH Confidence 8888888987766543 456888888888888888888777766653211 0 11233444 Q ss_pred HHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhccc-CCCCCccHHHH-HHHhCC----ccEEE---Ec--cc-ccC Q lcl|Aclame:pro 205 VNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAK-LKDIFP----KLEFV---TI--PE-YDT 272 (336) Q Consensus 205 ~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~~-l~~n~p----nl~i~---~~--pe-l~~ 272 (336) .+-++..+... . . ..-+++|.++.+..|.+ .+..|.-++.- +....+ ++.++ .. |. ... T Consensus 247 ~~~~~~~l~~~------~-~--~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~ 317 (392) T protein:vir:10 247 KDVLNVKLDPA------I-S--PNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGT 317 (392) T ss_pred HHHHHHhhhhh------h-c--cCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCcccEEEecccccCCCcc Confidence 44333222111 0 1 23468999999888854 23333222210 111111 12211 11 11 122 Q ss_pred CCCceEEEEEEeecCCceEEEE--cChhhhcccc---eecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 273 ASGRLVQLWAPRVEGKDTATCG--FTEKMRAHSI---ERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 273 a~G~~~~~~~~~~~~~~~~~~~--~p~~~r~l~~---~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) +.+....++.+-. +...+. -...+...+. ....-...+-|..|.+| .+++|.||+.+..- T Consensus 318 ~~~~~~~~~gdfs---~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~-~v~~~~a~~~l~~~ 382 (392) T protein:vir:10 318 TAKKAPLIIGDLK---EAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDV-QMWDNEAAVYGEID 382 (392) T ss_pred cCCceEEEEEehh---ceEEEEeecceEEEEeccccchhhcCceEEEEEEeecc-EEecccceEEEEec Confidence 2344444443210 000000 0111111111 01112344667777775 67779999998765 No 115 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=85.70 E-value=0.051 Score=27.67 Aligned_cols=295 Identities=6% Similarity=-0.069 Sum_probs=130.9 Q ss_pred CchH----------HHHHHhhh----cceeccc-----hhh------------hccchhHHHHHhhhhhcc--cccccCc Q lcl|Aclame:pro 1 MRDA----------QRIQNLAR----AGVILPR-----SVQ------------NVSTPLTEYAMDAADLSP--HLSSTGS 47 (336) Q Consensus 1 ~~~~----------~~~~~l~~----~g~~~~~-----~~~------------~~~~~~~~~a~da~d~~~--~l~t~~~ 47 (336) +.+. +++.+.+. .+-.... ..+ .+....+. .......+. ...|+++ T Consensus 35 ~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~t~~~ 113 (392) T protein:vir:10 35 MEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDVFMKALRNKPLNAEERE-FLEDDLEQRAMSGLTGED 113 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHHHHHHHhcccccHHHHH-HHhhhhhhhhccccccCC Confidence 0000 00000000 0000000 000 00000000 111111111 1123334 Q ss_pred c--hHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecceeeEEeecccCCceee-eeeeeeeeeEE Q lcl|Aclame:pro 48 S--GIPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSG-ANINYPQRQSY 124 (336) Q Consensus 48 ~--~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~~ygd~~diP~~~-~~~~~~~~~v~ 124 (336) + .+|..+. ++|++.+...-....+.++..... ......+......+.+...+.+...|-.+ ...+...-..+ T Consensus 114 gg~~vP~~~~----~~ii~~~~~~s~l~~~~~~~~~~~-~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~ 188 (392) T protein:vir:10 114 GGLVIPQDIQ----TQINELARSFDALEQYVTVEPVRT-RSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVK 188 (392) T ss_pred CceecchhHH----HHHHHHHHhhhhhhhhceeeeccC-CceeEEEEeecCCccceeecccccccccccccceeEEeeee Confidence 4 3665444 455666555555556655533221 11123344444455677778777777544 56777788888 Q ss_pred EEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCcccccccccccccCHHHH Q lcl|Aclame:pro 125 FFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAV 204 (336) Q Consensus 125 ~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI 204 (336) .++..+.+|.+=|+.+ ..+|.+.-....+.++.+.++.-.+.|++.... . +.++.+.| T Consensus 189 k~~~~~~iS~ell~ds---~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~---------------~----~~~~~d~i 246 (392) T protein:vir:10 189 DRAGILPLSRSLLQDS---DQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK---------------Q----AIKSLDDI 246 (392) T ss_pred eEEEeehhhHHHHhhh---HHHHHHHHHHHHHHHHHHHHHHHHhhccccccc---------------c----CccCHHHH Confidence 8888888987766543 456888888888888888888777766653211 0 11233444 Q ss_pred HHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhccc-CCCCCccHHHH-HHHhCC----ccEEE---Ec--cc-ccC Q lcl|Aclame:pro 205 VNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAK-LKDIFP----KLEFV---TI--PE-YDT 272 (336) Q Consensus 205 ~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~~-l~~n~p----nl~i~---~~--pe-l~~ 272 (336) .+-++..+... . . ..-+++|.++.+..|.+ .+..|.-++.- +....+ ++.++ .. |. ... T Consensus 247 ~~~~~~~l~~~------~-~--~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~ 317 (392) T protein:vir:10 247 KDVLNVKLDPA------I-S--PNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGT 317 (392) T ss_pred HHHHHHhhhhh------h-c--cCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCcccEEEecccccCCCcc Confidence 44333222111 0 1 23468999999888854 23333222210 111111 12211 11 11 122 Q ss_pred CCCceEEEEEEeecCCceEEEE--cChhhhcccc---eecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 273 ASGRLVQLWAPRVEGKDTATCG--FTEKMRAHSI---ERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 273 a~G~~~~~~~~~~~~~~~~~~~--~p~~~r~l~~---~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) +.+....++.+-. +...+. -...+...+. ....-...+-|..|.+| .+++|.||+.+..- T Consensus 318 ~~~~~~~~~gdfs---~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~-~v~~~~a~~~l~~~ 382 (392) T protein:vir:10 318 TAKKAPLIIGDLK---EAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDV-QMWDNEAAVYGEID 382 (392) T ss_pred cCCceEEEEEehh---ceEEEEeecceEEEEeccccchhhcCceEEEEEEeecc-EEecccceEEEEec Confidence 2344444443210 000000 0111111111 01112344667777775 67779999998765 No 116 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=85.70 E-value=0.051 Score=27.67 Aligned_cols=295 Identities=6% Similarity=-0.069 Sum_probs=130.9 Q ss_pred CchH----------HHHHHhhh----cceeccc-----hhh------------hccchhHHHHHhhhhhcc--cccccCc Q lcl|Aclame:pro 1 MRDA----------QRIQNLAR----AGVILPR-----SVQ------------NVSTPLTEYAMDAADLSP--HLSSTGS 47 (336) Q Consensus 1 ~~~~----------~~~~~l~~----~g~~~~~-----~~~------------~~~~~~~~~a~da~d~~~--~l~t~~~ 47 (336) +.+. +++.+.+. .+-.... ..+ .+....+. .......+. ...|+++ T Consensus 35 ~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~t~~~ 113 (392) T protein:vir:10 35 MEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDVFMKALRNKPLNAEERE-FLEDDLEQRAMSGLTGED 113 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHHHHHHHhcccccHHHHH-HHhhhhhhhhccccccCC Confidence 0000 00000000 0000000 000 00000000 111111111 1123334 Q ss_pred c--hHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecceeeEEeecccCCceee-eeeeeeeeeEE Q lcl|Aclame:pro 48 S--GIPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSG-ANINYPQRQSY 124 (336) Q Consensus 48 ~--~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~~ygd~~diP~~~-~~~~~~~~~v~ 124 (336) + .+|..+. ++|++.+...-....+.++..... ......+......+.+...+.+...|-.+ ...+...-..+ T Consensus 114 gg~~vP~~~~----~~ii~~~~~~s~l~~~~~~~~~~~-~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~ 188 (392) T protein:vir:10 114 GGLVIPQDIQ----TQINELARSFDALEQYVTVEPVRT-RSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVK 188 (392) T ss_pred CceecchhHH----HHHHHHHHhhhhhhhhceeeeccC-CceeEEEEeecCCccceeecccccccccccccceeEEeeee Confidence 4 3665444 455666555555556655533221 11123344444455677778777777544 56777788888 Q ss_pred EEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCcccccccccccccCHHHH Q lcl|Aclame:pro 125 FFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAV 204 (336) Q Consensus 125 ~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI 204 (336) .++..+.+|.+=|+.+ ..+|.+.-....+.++.+.++.-.+.|++.... . +.++.+.| T Consensus 189 k~~~~~~iS~ell~ds---~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~---------------~----~~~~~d~i 246 (392) T protein:vir:10 189 DRAGILPLSRSLLQDS---DQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK---------------Q----AIKSLDDI 246 (392) T ss_pred eEEEeehhhHHHHhhh---HHHHHHHHHHHHHHHHHHHHHHHHhhccccccc---------------c----CccCHHHH Confidence 8888888987766543 456888888888888888888777766653211 0 11233444 Q ss_pred HHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhccc-CCCCCccHHHH-HHHhCC----ccEEE---Ec--cc-ccC Q lcl|Aclame:pro 205 VNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAK-LKDIFP----KLEFV---TI--PE-YDT 272 (336) Q Consensus 205 ~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~~-l~~n~p----nl~i~---~~--pe-l~~ 272 (336) .+-++..+... . . ..-+++|.++.+..|.+ .+..|.-++.- +....+ ++.++ .. |. ... T Consensus 247 ~~~~~~~l~~~------~-~--~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~ 317 (392) T protein:vir:10 247 KDVLNVKLDPA------I-S--PNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGT 317 (392) T ss_pred HHHHHHhhhhh------h-c--cCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCcccEEEecccccCCCcc Confidence 44333222111 0 1 23468999999888854 23333222210 111111 12211 11 11 122 Q ss_pred CCCceEEEEEEeecCCceEEEE--cChhhhcccc---eecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 273 ASGRLVQLWAPRVEGKDTATCG--FTEKMRAHSI---ERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 273 a~G~~~~~~~~~~~~~~~~~~~--~p~~~r~l~~---~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) +.+....++.+-. +...+. -...+...+. ....-...+-|..|.+| .+++|.||+.+..- T Consensus 318 ~~~~~~~~~gdfs---~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~-~v~~~~a~~~l~~~ 382 (392) T protein:vir:10 318 TAKKAPLIIGDLK---EAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDV-QMWDNEAAVYGEID 382 (392) T ss_pred cCCceEEEEEehh---ceEEEEeecceEEEEeccccchhhcCceEEEEEEeecc-EEecccceEEEEec Confidence 2344444443210 000000 0111111111 01112344667777775 67779999998765 No 117 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=85.60 E-value=0.052 Score=27.64 Aligned_cols=295 Identities=8% Similarity=-0.064 Sum_probs=117.6 Q ss_pred CchHH-------HHHHhhhccee-ccc-------hh-hhccchhHHH--HHhh---hhhcccccccCcc--hHHHHHHHh Q lcl|Aclame:pro 1 MRDAQ-------RIQNLARAGVI-LPR-------SV-QNVSTPLTEY--AMDA---ADLSPHLSSTGSS--GIPNYLTTY 57 (336) Q Consensus 1 ~~~~~-------~~~~l~~~g~~-~~~-------~~-~~~~~~~~~~--a~da---~d~~~~l~t~~~~--~i~~~l~~~ 57 (336) |...+ ......+.... ... .. .......+.. .+.. ......+.+.+++ -||..+. T Consensus 54 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ra~~t~~~gg~liP~~~~-- 131 (421) T protein:vir:13 54 MEIIEEEIESVMTAIDEERKNTNFTGGRVIINGDSKEEKRSLQLSAMSKTIRGIQLSEEERDIMSSTNNGAVIPQEFV-- 131 (421) T ss_pred HHHHHHHHHHHHHHHHHHHhhhcccccccccccchhHHHHHHHHHHHHHhhhccchhHHHhhccccCCcceecchhhH-- Confidence 11111 00000000000 000 00 0000000000 0000 0001112333333 3665444 Q ss_pred hCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecce--eeEEeecccCCceeeeeeeeeeeeEEEEEEEEeeCHH Q lcl|Aclame:pro 58 VDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTT--KVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGER 135 (336) Q Consensus 58 idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G--~a~~ygd~~diP~~~~~~~~~~~~v~~~~~~~~y~~~ 135 (336) ++|++.+........++.+.+... .+..|++..... .+...+...++|..+.......-.++.++..+.+|.+ T Consensus 132 --~~Ii~~~~~~~~l~~l~~~~~~~~---~~~~~~~~~~~~~~~~~~~~E~~~~~~s~~~f~~i~~~~~k~~~~v~iS~e 206 (421) T protein:vir:13 132 --NEFEKLKEGYPSLKEHCHVIPVNR---NAGKMPVRAGASVDKLANLAKDTELVKAMLKTQPMAYDIDDYGLLAPIDNS 206 (421) T ss_pred --HHHHHHHHhhhhhhhhceeeeccC---CceEEEEeecCCccceeeccccccccccccceeEEEeeeeeeEeehhhhHH Confidence 344444444444445544432221 123344333332 2344566778888887777777788888888888765 Q ss_pred HHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 136 ELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVL 215 (336) Q Consensus 136 El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l 215 (336) =|+-+ ..+|.+.-....++++...+|.-.. ....|+++.+ +..+ ++||.+++..+ T Consensus 207 ll~ds---~~~l~~~i~~~la~~~~~~~~~~i~-----~~~~g~~~~~-------------~~~~----~d~i~~~~~~l 261 (421) T protein:vir:13 207 LLEDS---EINFLEFVNEEFAEFAVNTENAEIV-----KQAKAVLAEE-------------TIND----YAGLVKTINSL 261 (421) T ss_pred HHhhh---HHHHHHHHHHHHHHHHHHHhhhhHh-----hhhhhccccc-------------cccc----hHHHHHHHHHh Confidence 44433 3355655555556666665553211 1122322210 1122 45666677766 Q ss_pred HHHhCCceecccccEEEecHHHHHhccc-CCCCCccHHHHHHHhCC----ccEEEEcccccCC-CCceEEEEEEeecCCc Q lcl|Aclame:pro 216 QTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAKLKDIFP----KLEFVTIPEYDTA-SGRLVQLWAPRVEGKD 289 (336) Q Consensus 216 ~~~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~~l~~n~p----nl~i~~~pel~~a-~G~~~~~~~~~~~~~~ 289 (336) ...- .....++|.+..+..|.+ .+..|.=++.-+...-| ++.++..+..-.. +|....++.+-. + T Consensus 262 ~~~~------~~~a~~v~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~tl~G~pV~~~~~~~~~~~~~~~~~~gd~~---~ 332 (421) T protein:vir:13 262 VPNA------RKRAIIVTNSDGRAYLDGLMDKQGRPLLKELSDGGDLVFKGRPVIELEESIFDVGDETKFIVSDFK---T 332 (421) T ss_pred hhhh------cCCCEEEEcHHHHHHHHHhhcCCCceeecCcCCCCCceecceeeEEeccccccCCCceEEEEEecc---c Confidence 4321 134679999999888864 33334333322211111 2233333322222 222222222110 0 Q ss_pred eEEEEcChhhhcc--c-ceecCCceEEccccceee----------eeeecccceeeeccC Q lcl|Aclame:pro 290 TATCGFTEKMRAH--S-IERYSSYFRQKKSAGTWG----------AVIFRPFAVAQMIGV 336 (336) Q Consensus 290 ~~~~~~p~~~r~l--~-~~~~~~~~~vp~~~~t~G----------v~ir~P~av~~~~GI 336 (336) ...+...+.++.. . .....-.+.+-+..|.+| +.+.+|.+++...++ T Consensus 333 ~~~~~~~~~~~v~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~a~v~~~~~ 392 (421) T protein:vir:13 333 LIKFMDRKQYLIDQSKEAGYTKNETIARIIERFDVNSPLDKSSDAEKIRKFGVIVKLQEV 392 (421) T ss_pred cEEEEEecceEEEeecccccccCeeEEEEEeeecceeecchhhheeeecccceeeccccc Confidence 0000000111100 0 000111233334445444 444555666666666 No 118 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=85.34 E-value=0.054 Score=27.55 Aligned_cols=286 Identities=8% Similarity=-0.039 Sum_probs=131.9 Q ss_pred CchHHHHHHhhh--cceeccchhhhccchhHHHHHhhhhhcccc--cccCcch--HHHHHHHhhCceeeeeeccccchhh Q lcl|Aclame:pro 1 MRDAQRIQNLAR--AGVILPRSVQNVSTPLTEYAMDAADLSPHL--SSTGSSG--IPNYLTTYVDPAVIDILVAPMKAAE 74 (336) Q Consensus 1 ~~~~~~~~~l~~--~g~~~~~~~~~~~~~~~~~a~da~d~~~~l--~t~~~~~--i~~~l~~~idp~v~~~~~~~~~~~~ 74 (336) -+.....+.+.+ .|-.. ....+... .......+ ++.+++| ||..+. ++|++.+...-.... T Consensus 89 ~~~~~~~~a~~~~~~~~~~-------~~~~~~~~--~~~~~~a~~~~~~~~gg~lvP~~~~----~~ii~~~~~~~~l~~ 155 (397) T protein:vir:12 89 ERQQQYSKAFLKGLRGKRL-------TDEERDLL--DSPEFRAMSGINDEDGGILIPEDIG----RQIHEFKRQFEPLEQ 155 (397) T ss_pred HHHHHHHHHHHHHHhccCC-------cHHHHHHH--hhhhhhhccccccccCcccCchhHH----HHHHHhhhhhhhHHh Confidence 000000111111 01111 00111000 00001111 2223333 454443 456666666665666 Q ss_pred hcccccCCCcceeeEEEeeeecceeeEEeecccCCcee-eeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHH Q lcl|Aclame:pro 75 LVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDS-GANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNY 153 (336) Q Consensus 75 l~~v~t~g~w~~~t~~~~~~e~~G~a~~ygd~~diP~~-~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~ 153 (336) ++++..... ....+.+......+.+...+.+..+|-. ...........+.++..+.+|.+=+. ....+|.+--.. T Consensus 156 ~~~~~~~~~-~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~is~e~l~---ds~~~l~~~i~~ 231 (397) T protein:vir:12 156 YVTVEPVTT-RSGTRLLEKNADMVPFSPVEELGNLPEIDQPRFTKVSYSIIDYGGIMTLSNSMLN---DSDQAIMTYVAK 231 (397) T ss_pred hcceeeccC-CceeEEEEEecCCcceeeecccccccccccccceeEEeeheeeEeeehhhHHHHh---hchHHHHHHHHH Confidence 655533221 1123445555566677888888888754 45678888888888888888866443 334577777777 Q ss_pred HHHHHHHHhhcceEEeeccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHH-HHHHHhCCceecccccEEE Q lcl|Aclame:pro 154 SSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQ-VLQTQSQGIITQEDVLRMG 232 (336) Q Consensus 154 aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~-~l~~~s~g~v~~~~p~tL~ 232 (336) ...+++.+.++.-.+.|++...-.| ..+ ++||..++. .+... + .....++ T Consensus 232 ~l~~~~~~~~d~~il~G~g~~~~~g-------------------~~~----~~~i~~~~~~~l~~~----~--~~~a~~~ 282 (397) T protein:vir:12 232 WFAKKSVVTRNNLILAAIASLKKVD-------------------IDG----LDGIKKALNVTLDPM----V--APGSIVL 282 (397) T ss_pred HHHHHHHHHHHHHHHhccccccccc-------------------ccc----HHHHHHHHhhccchh----h--hCCCEEE Confidence 8888888888888888876432111 112 334444443 22111 1 1235689 Q ss_pred ecHHHHHhccc-CCCCCccHHH-HHHHhCC----ccEEEEcccc--cCCCCceEEEEEEeecCCceEEEEcChh--hhc- Q lcl|Aclame:pro 233 LPPTAMSDLSK-TNQYGLAAAA-KLKDIFP----KLEFVTIPEY--DTASGRLVQLWAPRVEGKDTATCGFTEK--MRA- 301 (336) Q Consensus 233 Lp~~~~~~L~~-~~~~~~Tvl~-~l~~n~p----nl~i~~~pel--~~a~G~~~~~~~~~~~~~~~~~~~~p~~--~r~- 301 (336) |.|..+..|.+ .+..|.-++. -+....| ++.+...+.. ..+.|....++.+- .+...+..-+. +.. T Consensus 283 ~n~~~~~~L~~lkd~~G~~l~~~~~~~g~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~---~~~~~~~~~~~~~i~~~ 359 (397) T protein:vir:12 283 TNQDGYDWLDTLKDGTGRYLLQPDPTNPTKKLLDGRPVVPFTNRVLKTQKGKAPLIIGNL---KEAIVLFDREQQSIAST 359 (397) T ss_pred EcHHHHHHHHHhhccCCceeecccccCCCCccccceeeEEecccccccCCCccEEEEEeh---hceEEEEeecceEEEEe Confidence 99998888854 3333432221 0111111 1222222221 12233333333221 11011111011 111 Q ss_pred -cc-ceecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 302 -HS-IERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 302 -l~-~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) ++ .....-...+-+..|.+| .++.|-||+.++-= T Consensus 360 ~~~~~~f~~~~~~~r~~~r~d~-~~~~~~a~~~~~~t 395 (397) T protein:vir:12 360 DTGAGAFETNSTKVRGIEREDV-RKWDEDAVVFGQIT 395 (397) T ss_pred ccccchhhcCceEEEEEEeecc-EEecccceEEEEEe Confidence 01 011112345667777766 55889998877655 No 119 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=84.28 E-value=0.062 Score=27.22 Aligned_cols=294 Identities=10% Similarity=-0.045 Sum_probs=131.3 Q ss_pred CchHHH-HHHhh-----------hcceeccchhhhccchh---HHH----HHhhhhhcccccccCc--chHHHHHHHhhC Q lcl|Aclame:pro 1 MRDAQR-IQNLA-----------RAGVILPRSVQNVSTPL---TEY----AMDAADLSPHLSSTGS--SGIPNYLTTYVD 59 (336) Q Consensus 1 ~~~~~~-~~~l~-----------~~g~~~~~~~~~~~~~~---~~~----a~da~d~~~~l~t~~~--~~i~~~l~~~id 59 (336) ++..++ +...+ +.+..-... ......+ ..+ -.++ ..+....+.+. ..||..+.+ T Consensus 53 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~l~~~~~~~-~~~~~~~t~~~gg~~iP~~~~~--- 127 (397) T protein:vir:49 53 RDLFKEQYTEARANEVANMSEEEKKPLTKNEE-EVKANFVKDFKNLVRGRYQNL-LDSKTDGSGSDAGLTIPQDIRT--- 127 (397) T ss_pred HHHHHHHHHHHHHhhhhcccccccccccchhh-HHHHHHHHHHHHHhhcchhhH-HHhhhccCCccCcceecHHHHH--- Confidence 100000 00000 000000000 0000000 000 0011 11111222222 335665543 Q ss_pred ceeeeeeccccchhhhcccccCCCcceeeEEEeee-ecceeeEEeecccCCceee-eeeeeeeeeEEEEEEEEeeCHHHH Q lcl|Aclame:pro 60 PAVIDILVAPMKAAELVGESKKGDWTTLVAAFITA-EPTTKVATYGDYSSDGDSG-ANINYPQRQSYFFQTWTRWGEREL 137 (336) Q Consensus 60 p~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~-e~~G~a~~ygd~~diP~~~-~~~~~~~~~v~~~~~~~~y~~~El 137 (336) .|++.+.......++..+..... ....+.+... +..+.+...+....+|-.+ ......+...+.++..+.+|.+=+ T Consensus 128 -~ii~~~~~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell 205 (397) T protein:vir:49 128 -AINTLVRQFDSLQEYVNVENVTT-LTGSRVYEKWADITGLAKLDDEGGQIGQNDDPKLSLIRYAIKRYAGISTVTNSLL 205 (397) T ss_pred -HHHHHHHhhhhHhhhcceeeccC-CcceEEEEeeccCCcceeeeccccccccccccceeeeEeeeeeeEeehhhHHHHH Confidence 44555555555555555432211 0122334333 3446677777777787665 356777778888888888876544 Q ss_pred HHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 138 EMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQT 217 (336) Q Consensus 138 ~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~ 217 (336) +. ..+++.+.-.....+++.+.+|+-.++|++... +. + ...+ ++||.+++..+.. T Consensus 206 ~d---s~~~l~~~i~~~l~~~~~~~~d~ail~G~g~~~----------~~--~------~~~~----~d~i~~~~~~l~~ 260 (397) T protein:vir:49 206 AD---SAENILAWLSGWIAKKVVVTRNKAILEAIGTLP----------NK--P------TLAK----WDDIIDLQAKVDP 260 (397) T ss_pred hh---hhHHHHHHHHHHHHHHHHHHHHHHHHhcccccc----------cc--c------cccC----HHHHHHHHHhhhh Confidence 33 346788888888888888888888888876421 10 0 1123 3566667666643 Q ss_pred HhCCceecccccEEEecHHHHHhccc-CCCCCccHHH-HHHHhCC----ccEEEEcc--cc-cCCCCceEEEEEEeecCC Q lcl|Aclame:pro 218 QSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAA-KLKDIFP----KLEFVTIP--EY-DTASGRLVQLWAPRVEGK 288 (336) Q Consensus 218 ~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~-~l~~n~p----nl~i~~~p--el-~~a~G~~~~~~~~~~~~~ 288 (336) .- ..+..++|.+..+..|.+ .+..|.-++. =+....+ ++.++... .+ .+.++....+|.+-. T Consensus 261 ~~------~~~a~~v~n~~~~~~l~~lkd~~g~~l~~~~~~~g~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~~--- 331 (397) T protein:vir:49 261 AI------KQTSLFLTNTSGFTALKKVKNAMGDYLMERDVKSPTGYSIDGFVVKEISDRFLPNGTGGAMPLYFGDLK--- 331 (397) T ss_pred hh------cCCCEEEEcHHHHHHHHHhhccCCceeecccccCCCCceecceeeEEecccccccccCCceeEEEeecc--- Confidence 21 235689999999998854 3333332221 0111111 11121111 11 222333333333210 Q ss_pred ceEEEE--cChhhhcccc---eecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 289 DTATCG--FTEKMRAHSI---ERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 289 ~~~~~~--~p~~~r~l~~---~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) +...+. -...+...+. ....-....-+..|.+|. +++|.||+....= T Consensus 332 ~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~d~~-~~~~~a~~~~~~~ 383 (397) T protein:vir:49 332 QAVTLFDRQHLSLLSTNIGGGAFETDTTKVRVIDRFDVV-STDTEAFVPASFK 383 (397) T ss_pred ceEEEEeecccEEEEeccccchhhcCeeeEEEEEeeccE-EecccceEEEEec Confidence 000000 0011111110 111223445667777775 6779999877633 No 120 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=84.13 E-value=0.036 Score=28.49 Aligned_cols=288 Identities=14% Similarity=0.135 Sum_probs=119.4 Q ss_pred CchHHHHHHhhhcceeccchhh---hccchhHHHHHhh---hhhcccccccCcchHHHHHHHhhCceeeeeeccccchhh Q lcl|Aclame:pro 1 MRDAQRIQNLARAGVILPRSVQ---NVSTPLTEYAMDA---ADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAE 74 (336) Q Consensus 1 ~~~~~~~~~l~~~g~~~~~~~~---~~~~~~~~~a~da---~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~ 74 (336) |- | |--|.-.. .|+-.+-..+|=+ ++++ -++ +..+..+++|.+-..-...+ T Consensus 1 ~~---------~--~~~~~~~~~~~~~~~~~p~l~m~alTLaea~-~l~-----------~d~~~~~VIE~l~~~s~iL~ 57 (330) T protein:vir:94 1 MV---------R--ICTPPLRGRWRTLTHQFPELKMPTVTLAESA-KLS-----------QDHLVSGLIETIVEVNPLYE 57 (330) T ss_pred Cc---------e--ecCCccccceeehhccccccchhhhhhhHHh-hcC-----------chhhHHHHHHhhhccchHHh Confidence 10 0 00000000 0000000111111 1111 011 12233456666655556666 Q ss_pred hccccc-CCCcceeeEEEeeeecceeeEEe---ecccC-CceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCC--CH Q lcl|Aclame:pro 75 LVGESK-KGDWTTLVAAFITAEPTTKVATY---GDYSS-DGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRV--DL 147 (336) Q Consensus 75 l~~v~t-~g~w~~~t~~~~~~e~~G~a~~y---gd~~d-iP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~--~l 147 (336) .+|-+. ++. ...|.....-+.+... +-+.. .|. .....+.....++..+ .-+-+-|...|- ++ T Consensus 58 ~lpf~~ve~~----~~~~~r~~~lp~a~~r~~n~~~~~~~~~---Tf~q~t~~l~~l~~~~---~Vd~~iadl~g~~~d~ 127 (330) T protein:vir:94 58 MMPFTEIEGN----ALAYNRENVLGDVQFLAVGGTITAKNPA---TFTKVTSELTTLIGDA---EVNGLIQATRSDFMDQ 127 (330) T ss_pred hcccccccCC----cceeeeeecCCcceeeeccccccccCcc---eeeeeeechhhhhhhH---HHHHHHHHhcCCHHHH Confidence 666432 221 2334332222332221 11111 121 1111122222233322 223333444453 33 Q ss_pred HHHHHHHHHHHHHHhhcceEEeeccc-cceEEEEecCCCCcccccccc-cccccCHHHHHHHHHHHHHHHHHHhCCceec Q lcl|Aclame:pro 148 ASELNYSSALGLAKFLNGSYLFGVAG-LENYGLINDPSLSAPITATTP-WSGSPAVEAVVNEVVALFQVLQTQSQGIITQ 225 (336) Q Consensus 148 ~~~k~~aAr~a~e~~~n~~~~~Gd~~-~g~~GllN~Pnl~~~~~~~t~-w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~ 225 (336) ......+-.+++.+.+..-.++||.. .++.||++ ++.......+. -.+.-| ++|+.+|+..++..-+ T Consensus 128 ~~~q~~~~ieal~~~~e~~linGDs~~~~F~GL~~--~~~~~q~i~tg~~gg~~T----~d~LDeLl~~v~~~~g----- 196 (330) T protein:vir:94 128 TSVQVASKAKSIGRQYQASMITGDGTGNSFQGMMG--LVAASQTISAGANGGTLT----FELLDQLLDLVKDKDG----- 196 (330) T ss_pred HHHHHHHHHHHHHHHHHHHhhccCCCCccccchhh--cCCcccEEecCCCCCCCC----HHHHHHHHHHhcCCCC----- Confidence 33444455667888888888999865 67779987 34322221110 112334 5678888887765322 Q ss_pred ccccEEEecHHHHHhccc---C-CCCCc---cHHHHHH--HhCCccEEEEc---ccccC--CCCceEEEEEEeecCC--c Q lcl|Aclame:pro 226 EDVLRMGLPPTAMSDLSK---T-NQYGL---AAAAKLK--DIFPKLEFVTI---PEYDT--ASGRLVQLWAPRVEGK--D 289 (336) Q Consensus 226 ~~p~tL~Lp~~~~~~L~~---~-~~~~~---Tvl~~l~--~n~pnl~i~~~---pel~~--a~G~~~~~~~~~~~~~--~ 289 (336) .|..|+|+......+.. . +.+++ ++..+=+ ..|-.+-|... |.=.+ .+++..-+|+-++... + T Consensus 197 -~~~~~l~n~a~~r~I~a~~R~~~~~~v~~~~~~~~G~~v~~~~GvPi~~~d~ip~~~~~~~~~~ttsIyav~~G~~~~~ 275 (330) T protein:vir:94 197 -QVDYLMSSFAMRRKYFSLLRALGGAAIGEVMTLPSGRQIPTYRGVPWFVNDFIPSNMTQGTATNATAIFAGTFDDGSNK 275 (330) T ss_pred -CCcEEEechhHHHHHHHHHHhccCCCCCCcccccCCCEEeeeCCeEEEecccccCCCCcccCCCceeEEEEeecccccc Confidence 47889988876665532 1 11121 1111000 11223333322 22111 2233333343343211 1 Q ss_pred eEEEEcChh------hhccc-ceecC-CceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 290 TATCGFTEK------MRAHS-IERYS-SYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 290 ~~~~~~p~~------~r~l~-~~~~~-~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) -+.+.+.++ .|..+ .+.+. .+|.+ ..+.|+.+.-|.|++.+.|| T Consensus 276 qgV~Gl~~~g~~glsVr~~G~~~~k~v~~~~v---~~y~~~av~~~~a~~~L~~V 327 (330) T protein:vir:94 276 YGIAGLTARGSAGLRVQNVGAKENADETITRV---KMYCGFANFSQLGLAAIKGL 327 (330) T ss_pred cceEeecCCCCCcceeeeCCCccccceeeEEE---EEeeeeEEechhheeeeccc Confidence 122333211 12322 11111 23444 34789999999999999999 No 121 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=82.14 E-value=0.079 Score=26.61 Aligned_cols=290 Identities=9% Similarity=0.030 Sum_probs=126.8 Q ss_pred CchHHH---HHHhhhccee--ccch---hhh-ccc---hhHHHHH---hhhhhcccccccCcc--hHHHHHHHhhCceee Q lcl|Aclame:pro 1 MRDAQR---IQNLARAGVI--LPRS---VQN-VST---PLTEYAM---DAADLSPHLSSTGSS--GIPNYLTTYVDPAVI 63 (336) Q Consensus 1 ~~~~~~---~~~l~~~g~~--~~~~---~~~-~~~---~~~~~a~---da~d~~~~l~t~~~~--~i~~~l~~~idp~v~ 63 (336) .++.+. .......++. .... ... ... .+..+.. -..+.+....+++++ .+|..+. .+|+ T Consensus 57 i~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~t~~~gg~~vP~~~~----~~ii 132 (394) T protein:vir:10 57 IKDLEAENKANSDPDKPVDNAQPNGTDLKKKPIDAKKKAINDFIHSHGKVIDNAAGHVTSTEAGVLIPEEII----YDPT 132 (394) T ss_pred HHHHHHHHHhhcchhhhhhhhcccccchhhhHHHHHHHHHHHHHhccchhhhhhhcccccccCceeccHHHH----HHHH Confidence 111100 0000000000 0000 000 000 0000000 000111112233332 3554443 4556 Q ss_pred eeeccccchhhhcccccCCCcceeeEEEeeeec-ceeeEEeecccCCce-eeeeeeeeeeeEEEEEEEEeeCHHHHHHHH Q lcl|Aclame:pro 64 DILVAPMKAAELVGESKKGDWTTLVAAFITAEP-TTKVATYGDYSSDGD-SGANINYPQRQSYFFQTWTRWGERELEMAG 141 (336) Q Consensus 64 ~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~-~G~a~~ygd~~diP~-~~~~~~~~~~~v~~~~~~~~y~~~El~~A~ 141 (336) +.+.+......++.+.+.+. .+..|++... .+.+...+...++|- .+.........++.++..+.+|.+=|+.+ T Consensus 133 ~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds- 208 (394) T protein:vir:10 133 AEVNSVVDLSTLVTKTPVTT---PKGTYPILKRATDRFSSVAELAENPALAEPEFEQVDWSVSTYRGAIPLSEEAIADS- 208 (394) T ss_pred HHHHhhhhhhhhceeeeccC---CceEEEEEecCCCccccccccccccccccccceeEEeeeeeeEeeehhHHHHHhhh- Confidence 66666666666665543321 2345555554 466777888888884 55677777888888888888887766654 Q ss_pred hhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCC Q lcl|Aclame:pro 142 AGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQG 221 (336) Q Consensus 142 ~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g 221 (336) ..+|.+.-....+.++...+|+-.+.|.+.. .+..+ .+..+.+ ||..++...... .+ T Consensus 209 --~~~l~~~i~~~la~~~~~~~~~~il~g~g~~----------~~~~~------~~~~~~d----~l~~~~~~~~~~-~~ 265 (394) T protein:vir:10 209 --AVDLTSLVGQSINEKSVNTYNAMIAPVLQSF----------TAKAT------TTDTLVD----SLKHILNVDLDP-AY 265 (394) T ss_pred --hHHHHHHHHHHHHHHHHHHHHHHHhhccccc----------ccccc------cccccHH----HHHHHHHhhhhh-hc Confidence 3467777777777777777777666655421 11101 1122333 444443322111 11 Q ss_pred ceecccccEEEecHHHHHhccc-CCCCCccHHHHH-----HHhC----CccEEEEcc--cccCCCCceEEEEEEeec--- Q lcl|Aclame:pro 222 IITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAAKL-----KDIF----PKLEFVTIP--EYDTASGRLVQLWAPRVE--- 286 (336) Q Consensus 222 ~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~~l-----~~n~----pnl~i~~~p--el~~a~G~~~~~~~~~~~--- 286 (336) ...++|.++.+..|.+ .+..|.-++.-- .... -++.++... .+.++.|....++.+-.+ T Consensus 266 ------~a~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~L~G~PV~~~~~~~~~~~~~~~~i~~gd~s~~~~ 339 (394) T protein:vir:10 266 ------SRALVVTQSLFNTLDTLKDKNGRYLLHDASDSITDGTAKGTVLGVPVYVVGDALLGSAAGDQKAFVGDLKRGVL 339 (394) T ss_pred ------cCEEEecHHHHHHHHHhhccCCCeeeeccccccccCCcccccccceeEEecccccCCCCCceEEEEeeccccEE Confidence 1368999998888864 333343222100 0011 122333222 223334444434332110 Q ss_pred --CCceEEEEcChhhhcccceecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 287 --GKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 287 --~~~~~~~~~p~~~r~l~~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) ...-.++.+ .++.. . ...+-...|.+| .+++|.||+.+..= T Consensus 340 ~~~~~~~~v~~-~~~~~------~-~~~~~~~~r~d~-~~~~~~ai~~~~~~ 382 (394) T protein:vir:10 340 FADRQQVTLAW-EDSKI------Y-GRYLGAAFRFGV-KQADSNAGYFVTNT 382 (394) T ss_pred EEeecceEEEE-ecccc------c-ceeEEEEEEecc-EEeccccEEEEEee Confidence 001111211 11100 0 111223456554 56669999887644 No 122 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=82.13 E-value=0.08 Score=26.61 Aligned_cols=306 Identities=13% Similarity=0.100 Sum_probs=127.1 Q ss_pred Cc----hH-HHHHHhhh---------------cceeccchhh-----------------hccchhHHH---HHhhhhhcc Q lcl|Aclame:pro 1 MR----DA-QRIQNLAR---------------AGVILPRSVQ-----------------NVSTPLTEY---AMDAADLSP 40 (336) Q Consensus 1 ~~----~~-~~~~~l~~---------------~g~~~~~~~~-----------------~~~~~~~~~---a~da~d~~~ 40 (336) ++ +. +.++++.+ ..-.+..... ......+.. ..+.+-... T Consensus 73 l~~ei~~le~el~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 152 (466) T protein:vir:80 73 LEGEIKELENELEQLNNKEPKNNSEPAQVSGARTQQFVGGETRMKGFFRNMPYEQRAALIARSEVKEFLAQVRTLAQQKR 152 (466) T ss_pred HHHHHHHHHHHHHHHHHhhhccCchhHHHHhhhhhHHhhHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHhhhhh Confidence 00 00 00000000 0000000000 000000000 000000000 Q ss_pred cccccCcchHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecceeeEEeecccCCceeeeeeeeee Q lcl|Aclame:pro 41 HLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYPQ 120 (336) Q Consensus 41 ~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~~ygd~~diP~~~~~~~~~~ 120 (336) + .+.....+|.++.+-|-..+ +...+-++.....|++. +..+.+......+.+.+...++|..+....... T Consensus 153 ~-~~g~~~~vP~~~~~~i~~~l-~~~~~l~~~~~v~~~~g-------~~~~~~~~~~~~a~wv~E~~~~~~~~~~f~~i~ 223 (466) T protein:vir:80 153 A-VSGAELTIPDVMLELLRDNM-HRYSKLISKVRLRPLKG-------TARQNIAGAIPEGVWTEAVANLNELSLSFSQIE 223 (466) T ss_pred h-hccccccccHHHHHHHHHhh-hhhhhhhhheeeeecCc-------eeEeeeecCCcceeeccccccccccccccccee Confidence 1 11112346766655442222 11112222233333221 234444445556677777888888887777777 Q ss_pred eeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCccccc---cccccc Q lcl|Aclame:pro 121 RQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITA---TTPWSG 197 (336) Q Consensus 121 ~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~---~t~w~~ 197 (336) ..++.+...+.+|.+=|. ....++.+--....+.++...+|.-++.|++...-.|+||+......... ..+.+. T Consensus 224 ~~~~k~~~~~~iS~ell~---ds~~~l~~~i~~~la~~~~~~~~~ail~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~ 300 (466) T protein:vir:80 224 VDGYKVGGFIPIPNSTLE---DSDLNLADEILDAIGQAIGFALDKAILYGTGTKMPVGIVTRLAQTTQPPNWGTKAPAWT 300 (466) T ss_pred ecceeeeeehhhhHHHHh---cchHHHHHHHHHHHHHHHHHHHhhheeeccCCCCcceeeeccccccccccccccccccc Confidence 788888877777765554 34457888888888999999999999999988888899997543211111 111111 Q ss_pred ccCHHH-------------HHHHHHHHHHHHHHHhCCceecccccEE-EecHHHHHhcc-cC---CCCCccHHHHHHHh- Q lcl|Aclame:pro 198 SPAVEA-------------VVNEVVALFQVLQTQSQGIITQEDVLRM-GLPPTAMSDLS-KT---NQYGLAAAAKLKDI- 258 (336) Q Consensus 198 ~~t~~e-------------I~~Di~~l~~~l~~~s~g~v~~~~p~tL-~Lp~~~~~~L~-~~---~~~~~Tvl~~l~~n- 258 (336) ..+... .+.|+..++..+... ...+..+ ++.+..+..|. .. +..|. +-+--.| T Consensus 301 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~w~~~~~~~~~l~~~~~~~~~~g~--~~~~~~~~ 372 (466) T protein:vir:80 301 NLSTTNLLKIDPTGKSAEEFFSELVLKLSKARAN------YSNGMKFWAMSSNTHAVLMSKAITFNSAGA--LVASLNNT 372 (466) T ss_pred ccchhhhhhhhhhccchhhHHHHHHHHHHhhhcc------ccCCceeEEecchhHHHhhcccccccCCcc--ccccCCCc Confidence 111111 122221111111110 0123333 33344444432 11 11111 0000001 Q ss_pred CC--ccEEEE---cccccCCCCce-EEEEEEeecCCceEEEEcChhhhcccceecCCceEEccccceeeeeeecccceee Q lcl|Aclame:pro 259 FP--KLEFVT---IPEYDTASGRL-VQLWAPRVEGKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQ 332 (336) Q Consensus 259 ~p--nl~i~~---~pel~~a~G~~-~~~~~~~~~~~~~~~~~~p~~~r~l~~~~~~~~~~vp~~~~t~Gv~ir~P~av~~ 332 (336) .| +..|+. +|+-.--.|.. .+.+.++ .+ +++.......+ . .-...+-+..|.+|- ++.|-||+. T Consensus 373 ~~i~G~pvv~s~~~~~~~~~~g~~~~y~i~~r-~~---~~i~~~~~~~f----~-~d~~~~r~~~r~dg~-~~~~~afv~ 442 (466) T protein:vir:80 373 MPIVGGDIVILDFIPDNDIIGGYGSLYLLAER-AD---IKLAQSEHVRF----I-EDQTVFKGTARYDGK-PVFGEGFVA 442 (466) T ss_pred ccccccceeecCccCccceeeeccccEEEEee-cc---eEEEechhhhh----h-cCcEEEEEEEEEccE-EeccCceEE Confidence 01 122222 22211112322 2333322 11 22222221111 1 123455667777554 478999999 Q ss_pred eccC Q lcl|Aclame:pro 333 MIGV 336 (336) Q Consensus 333 ~~GI 336 (336) +++= T Consensus 443 ~~~~ 446 (466) T protein:vir:80 443 VNIA 446 (466) T ss_pred EEec Confidence 8744 No 123 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=80.93 E-value=0.09 Score=26.31 Aligned_cols=291 Identities=10% Similarity=0.000 Sum_probs=130.6 Q ss_pred CchH-HH--HH---Hhhhcc---eeccchhh------------hccchhHHHHHhhhhhccccc--ccCcc--hHHHHHH Q lcl|Aclame:pro 1 MRDA-QR--IQ---NLARAG---VILPRSVQ------------NVSTPLTEYAMDAADLSPHLS--STGSS--GIPNYLT 55 (336) Q Consensus 1 ~~~~-~~--~~---~l~~~g---~~~~~~~~------------~~~~~~~~~a~da~d~~~~l~--t~~~~--~i~~~l~ 55 (336) +++. .+ .. .....+ -....... ..........+........++ +.+++ .||..+. T Consensus 21 l~~~~d~~e~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~al~~~~~~~gG~lIP~~~~ 100 (352) T protein:vir:78 21 VERQVQDIEEKEKAKVKDKGEAYQSLNDNEKLVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKLLPKTLS 100 (352) T ss_pred HHHHHHHHHHHHHHHhhhccccccccchhhhHHHHHHHHHHHHhhhhHHHHHHhhHHHHHHHhccCCCCCCceeccHhHH Confidence 1000 00 00 000000 00000000 000111111111111111122 22222 4776554 Q ss_pred HhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecceeeEEeecccCCceeeeeeeeeeeeEEEEEEEEeeCHH Q lcl|Aclame:pro 56 TYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGER 135 (336) Q Consensus 56 ~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~~ygd~~diP~~~~~~~~~~~~v~~~~~~~~y~~~ 135 (336) + +|++.+......+.+..+.+.+... ...+....+.+.+.+....+|-.+.......-.++.++..+.+|.+ T Consensus 101 ~----~Ii~~l~~~s~l~~~~~v~~~~~~~----~p~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~i~is~e 172 (352) T protein:vir:78 101 K----EIVSEPFAKNQLREKARLTNIKGLE----IPRVSYTLDDDDFITDVETAKELKLKGDTVKFTTNKFKVFAAISDT 172 (352) T ss_pred H----HHHHHHHhhcchhhheeeEecCCce----EEEEecCCCcccccccccccccccccceeeeecceeEEeechhhHH Confidence 3 3344444444445566565555432 2233334467778888888888888888888888889888888876 Q ss_pred HHHHHHhhCCCHHHHHHHHHHHHHHHhhcce-EEeeccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHH Q lcl|Aclame:pro 136 ELEMAGAGRVDLASELNYSSALGLAKFLNGS-YLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQV 214 (336) Q Consensus 136 El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~-~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~ 214 (336) =|..+ ..+|.+--....++++...++.. +..|++.....|.++++.+... + ....++||.+++.. T Consensus 173 ll~Ds---~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~~~~~g~l~~~~~~~~-t----------~~~~~d~i~~~~~~ 238 (352) T protein:vir:78 173 VIHGS---DVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEV-E----------GANMYDAIINALAD 238 (352) T ss_pred HHhhh---hHHHHHHHHHHHHHHHHHHHHHhhhhcCCCCcccccceeccccccc-c----------ccchHHHHHHHHhc Confidence 44433 35677777776666666665554 4456655556777777665421 1 11125666667666 Q ss_pred HHHHhCCceecccccEEEecHHHHHh-cccCCCCCccHHHHHHHhCCccEEEEcccccCCC------CceEEEEEEeecC Q lcl|Aclame:pro 215 LQTQSQGIITQEDVLRMGLPPTAMSD-LSKTNQYGLAAAAKLKDIFPKLEFVTIPEYDTAS------GRLVQLWAPRVEG 287 (336) Q Consensus 215 l~~~s~g~v~~~~p~tL~Lp~~~~~~-L~~~~~~~~Tvl~~l~~n~pnl~i~~~pel~~a~------G~~~~~~~~~~~~ 287 (336) |...-. ..-+.+|-+..+.. +......|..++. .-|+ ++-..|-..+++ |.-.+.+.. +++ T Consensus 239 l~~~~~------~~a~~~mn~~t~~~l~~~~~~~~~~~~~----~~~~-~llG~PV~~~~~~~~~~~Gdf~~~~~~-~~~ 306 (352) T protein:vir:78 239 LHEDYR------DNATIYMRYADYVKIISVLSNGTTNFFD----TPAE-KVFGKPVVFTDAAVKPIVGDFNYFGIN-YDG 306 (352) T ss_pred cChhhh------cCCEEEEehHHHHHHHHHHhccCCcccc----cCCc-cccccceEEecCCCceeEeehhhhhhh-hhh Confidence 533211 11246665554433 3433333433331 1121 111222222211 111111111 010 Q ss_pred CceEEEEcChhhhcccceecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 288 KDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 288 ~~~~~~~~p~~~r~l~~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) +-+..+- +........-+..|..|.+ .+|-||+.+.-= T Consensus 307 ---------~~~~~~~-~~~~g~~~f~~~~r~Dg~~-~~~eA~~~l~~~ 344 (352) T protein:vir:78 307 ---------TTYDTDK-DVKKGEYLFVLTAWYDQQR-TLDSAFRIAKAK 344 (352) T ss_pred ---------heeeeec-cccCCeeEEEEEeeeCcee-echhheEEEEee Confidence 1111111 1122345566777888775 559998666433 No 124 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=78.45 E-value=0.11 Score=25.75 Aligned_cols=282 Identities=12% Similarity=0.083 Sum_probs=112.4 Q ss_pred HHHhhhhhc-ccccccCcchHH-HHHHHhhCceeeeeeccccchhhhcccccCCCccee--e-EEEeeee--cceeeE-- Q lcl|Aclame:pro 31 YAMDAADLS-PHLSSTGSSGIP-NYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTL--V-AAFITAE--PTTKVA-- 101 (336) Q Consensus 31 ~a~da~d~~-~~l~t~~~~~i~-~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~--t-~~~~~~e--~~G~a~-- 101 (336) |+.-+-.++ |.|++ .|+ +|.++|-+ -++..+.... ..|-|..+...-... + ..|.+.+ .+|+.. T Consensus 1 ~~~~~~~~~~~~Ms~----~i~~~fv~qy~~--~v~~~~qq~~-s~L~~tV~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 73 (322) T protein:vir:10 1 MKLNAIMSMLPLIAG----DIDQAFVQTYET--TLRILSQQKS-AKLKQYCQHKNESSESHNWETLASMDPDAVKRKRSR 73 (322) T ss_pred Ccccceeeeeeeeec----hhhhHHHHHHHH--HHHHHHHHhh-hhhhcccccccccccccceeeccccccccccccccc Confidence 444333333 44444 234 46666652 2333333322 333333221100111 0 1122111 233332 Q ss_pred -Eeeccc-CCceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEe---eccccce Q lcl|Aclame:pro 102 -TYGDYS-SDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLF---GVAGLEN 176 (336) Q Consensus 102 -~ygd~~-diP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~---Gd~~~g~ 176 (336) ..+|.. |.|...............+..++.+...++.+ +..+..+.-.+++..|++++.+++.+- |.+..+. T Consensus 74 ~~~~d~~~dtp~~~~~~~~r~~~~~d~~~~~~VDd~D~~k---~~~D~~~~~~~~~a~AL~R~~D~~I~~a~~g~a~~~~ 150 (322) T protein:vir:10 74 QQSADGTYPTPVNNKPFAKRRTNVDTYDTGHVVEQEDISQ---MLLDPNSALITSQAYAMARKTDDLIIAGAWKPASIKG 150 (322) T ss_pred ccccCcccCCCccccccceEEEeecccccceecchHHHHH---hhcCchHHHHHHHHHHhhhHHHHHHHhhhhccccccc Confidence 334543 66766654333334444455555555554433 345566666667777777777774433 4333221 Q ss_pred EEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhcccCC---CCCccHHH Q lcl|Aclame:pro 177 YGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSKTN---QYGLAAAA 253 (336) Q Consensus 177 ~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~~~---~~~~Tvl~ 253 (336) .| .+....++..-.+..+ .--++.|.++...+.... +..+.+-.++++|.++..|-.-. +.+..--+ T Consensus 151 ~g------t~v~~~ss~~i~~g~~-g~t~~kl~~a~~~l~~~d---vp~d~~R~~vv~p~~~~~LL~d~~~ts~D~~~~~ 220 (322) T protein:vir:10 151 TG------QPVEFLATQEIGDGTK-PISFDYVTEITERFLENE---IEPEVSKVIVIGPTQARKLLQITEATSADYTSAM 220 (322) T ss_pred cc------cccccCCCcccccCcc-chhHHHHHHHHHHHHhcC---CCCCCCeEEEeCHHHHHHHhcchhhhhhhcccch Confidence 11 1110000100000111 112333444444443322 32233457999999988774321 11111123 Q ss_pred HHHHh--------CCccEEEEccccc----------CCCCceEEEEEEeecCCceEE-EEcChhhhcccceecCC-ceEE Q lcl|Aclame:pro 254 KLKDI--------FPKLEFVTIPEYD----------TASGRLVQLWAPRVEGKDTAT-CGFTEKMRAHSIERYSS-YFRQ 313 (336) Q Consensus 254 ~l~~n--------~pnl~i~~~pel~----------~a~G~~~~~~~~~~~~~~~~~-~~~p~~~r~l~~~~~~~-~~~v 313 (336) .|..+ |..|....+|.-+ .+++.+..+++++...=.... ..+..++--+| ... .+.+ T Consensus 221 ~l~~~G~ig~~lGf~~i~s~~lp~~~~t~~~~~~~~~~~~~~~~~~a~~k~Av~~a~~~dv~~~i~~~~---~~~~a~~I 297 (322) T protein:vir:10 221 DLQSKGIITNWMGYTWIVSTRLDKFDPTQWGMAAEDGPQGDEIWCIAMTDMALGYHSCKDIWTKVAEDP---SASFAWRI 297 (322) T ss_pred hhhhcCeeeeeeeEEEEEeccCCccccccccccccCCCCccceeEEEEecCceeEEEeeeeeEEeeccC---Ccchhhhh Confidence 33222 2223333344221 123455556665532111111 11222221111 111 2333 Q ss_pred ccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 314 KKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 314 p~~~~t~Gv~ir~P~av~~~~GI 336 (336) -.....|.+.| .|..|+.++=- T Consensus 298 ~~~~~~Ga~ri-~~~gVv~i~~~ 319 (322) T protein:vir:10 298 YSAFTADCVRV-EDEHIFKLRLK 319 (322) T ss_pred hhhhhhCceEe-ccCcEEEEEEe Confidence 44444444444 77766665544 No 125 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=76.53 E-value=0.13 Score=25.36 Aligned_cols=295 Identities=8% Similarity=-0.029 Sum_probs=127.0 Q ss_pred CchHHHH----HHhh------hcceeccch----hh-------------hccchhHHHHHhhhhhccccc--ccCcc--h Q lcl|Aclame:pro 1 MRDAQRI----QNLA------RAGVILPRS----VQ-------------NVSTPLTEYAMDAADLSPHLS--STGSS--G 49 (336) Q Consensus 1 ~~~~~~~----~~l~------~~g~~~~~~----~~-------------~~~~~~~~~a~da~d~~~~l~--t~~~~--~ 49 (336) ...++.+ ..++ ..+..-+.. .. ..........+.+......++ +.+++ . T Consensus 65 ~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~t~~~GG~l 144 (402) T protein:vir:93 65 QQRFNIVERQVQDIEEKEKAKVKDKGEAYQSLSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKL 144 (402) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhhccccCCCCchhHHHHHHHHHHHHHHHhhhhHHHHHHhHHHHHhhhccCCCcCCccc Confidence 0000000 0000 000000000 00 000011111121111111222 22333 3 Q ss_pred HHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeee-cceeeEEeecccCCceeeeeeeeeeeeEEEEEE Q lcl|Aclame:pro 50 IPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAE-PTTKVATYGDYSSDGDSGANINYPQRQSYFFQT 128 (336) Q Consensus 50 i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e-~~G~a~~ygd~~diP~~~~~~~~~~~~v~~~~~ 128 (336) ||..+.+ +|++.+...-..+.+..+.+.+.. .++.++ ..+.+...+.....|-.+.......-.++.++. T Consensus 145 IP~~~~~----~Ii~~~~~~~~l~~~~~v~~~~~~-----~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~ 215 (402) T protein:vir:93 145 LPKTLSK----EIVSEPFAKNQLREKARLTNIKGL-----EIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKV 215 (402) T ss_pred cchhHHH----HHHHhHHhhhhhhhhceeeecCCc-----eeeeeeccCCccccccccccccccccccceeeecceeeee Confidence 7766654 344444444444556555544432 223222 345567778877888888877888888888888 Q ss_pred EEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcce-EEeeccccceEEEEecCCCCcccccccccccccCHHHHHHH Q lcl|Aclame:pro 129 WTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGS-YLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNE 207 (336) Q Consensus 129 ~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~-~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~D 207 (336) .+.+|.+=|.- ...++.+.-....+.++...+++. +..|++...-.|.++++.+... +....++| T Consensus 216 ~i~iS~ell~D---s~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~p~g~~~~~~~~~~-----------~~~~~~d~ 281 (402) T protein:vir:93 216 FAAISDTVIHG---SDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEV-----------EGADMYDA 281 (402) T ss_pred echhhHHHHhh---hHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccc-----------cccchHHH Confidence 88888553432 345566666666666666665543 3445555555677776544321 11223677 Q ss_pred HHHHHHHHHHHhCCceecccccEEEecHH-HHHhcccCCCCCccHHHHHHHhCCccEEEEcccccCCCCceEEEEEEeec Q lcl|Aclame:pro 208 VVALFQVLQTQSQGIITQEDVLRMGLPPT-AMSDLSKTNQYGLAAAAKLKDIFPKLEFVTIPEYDTASGRLVQLWAPRVE 286 (336) Q Consensus 208 i~~l~~~l~~~s~g~v~~~~p~tL~Lp~~-~~~~L~~~~~~~~Tvl~~l~~n~pnl~i~~~pel~~a~G~~~~~~~~~~~ 286 (336) |.+++.+|...-. + .-..+|-+. ....+....+.|-.++. .-|+ ++-..|-..+++ .....|-+ . T Consensus 282 l~~~~~~l~~~y~-----~-na~~imn~~t~~~~~~~~~d~~~~~~~----~~~~-~llG~PV~~t~~-~~~i~~GD-f- 347 (402) T protein:vir:93 282 IINALADLHEDYR-----D-NATIYMRYADYVKIISVLSNGTTNFFD----TPAE-KVFGKPVVFTDA-AVKPIVGD-F- 347 (402) T ss_pred HHHHHhccChhhh-----c-CCEEEEechHHHHHHHHHhcCCCcccc----cCCc-cccccceEEecC-CCceeeec-h- Confidence 7777776643211 1 123556444 44444433333333321 1122 122222222211 01111111 0 Q ss_pred CCceEEEEc-ChhhhcccceecCCceEEccccceeeeeeecccceeeec--cC Q lcl|Aclame:pro 287 GKDTATCGF-TEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMI--GV 336 (336) Q Consensus 287 ~~~~~~~~~-p~~~r~l~~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~--GI 336 (336) .. ..+.+ .+-++.. -+........-+..|.+|.++ +|-||+.+. +- T Consensus 348 -~~-~~~~~~~~~~~~~-~~~~~~~~~~~~~~r~Dg~v~-~~~A~~~l~ik~~ 396 (402) T protein:vir:93 348 -NY-FGINYDGTTYDTD-KDVKKGEYLFVLTAWYDQQRT-LDSAFRIAKAKEN 396 (402) T ss_pred -hh-hhhhhhhhhhhhh-hcccCCceEEEEEEEeCcEEe-chhheEEEEeecC Confidence 00 00000 0111111 111223455667778877665 599887543 22 No 126 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=74.66 E-value=0.16 Score=25.01 Aligned_cols=217 Identities=11% Similarity=0.063 Sum_probs=109.2 Q ss_pred c--ccCCCcceeeEEEeeeecceeeEEeecccCCceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHH Q lcl|Aclame:pro 78 E--SKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSS 155 (336) Q Consensus 78 v--~t~g~w~~~t~~~~~~e~~G~a~~ygd~~diP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aA 155 (336) + ...| +|++|+.+ .|.+..+++++.+|.........+.+|.+.+-+++++..+... ..|=++ .+-.... T Consensus 1 ~~~~~~G----dtit~P~~--iGda~~v~eG~~i~~~~l~~t~~~atIk~~gk~~~itD~a~l~--~~gDp~-~ea~~Q~ 71 (231) T protein:vir:73 1 ENGINLA----NLCEYPND--IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALS--GYGDPI-GESNKQL 71 (231) T ss_pred CccccCC----ceEEeccc--ccchhhhcCCCcCChhhccccceeeeEeeeccceeeeHHHHhh--ccCchH-HHHHHHH Confidence 1 1223 67899865 8999999999999999999999999999988888777665544 455443 4444455 Q ss_pred HHHHHHhhcceEEeeccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecH Q lcl|Aclame:pro 156 ALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPP 235 (336) Q Consensus 156 r~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~ 235 (336) .+++..++|+-++ +-+. .+..+.++ ..| ++.|++++..+-.. ...+..+++.| T Consensus 72 ~~~iA~kvD~di~---------~~~~----~a~l~~~~----~~t----~d~i~~A~~~fgde------~~~~~vivv~p 124 (231) T protein:vir:73 72 GLSLANKVDDDLL---------KAAK----TTSQTVST----KAN----VDGVQAALDIFNDE------DAQAYVLIVNP 124 (231) T ss_pred HHHHHHhhhHHHH---------Hhhc----cccccccc----ccc----HHHHHHHHHHhccc------cccceEEEEcc Confidence 5566555554211 0000 00001111 123 45555555554221 23578899999 Q ss_pred HHHHhcccCCCCCccHHHHHHH----h-----CCccEEEEcccccCCCCceEEEEEEeecCCceEEEEcChhhhccccee Q lcl|Aclame:pro 236 TAMSDLSKTNQYGLAAAAKLKD----I-----FPKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSIER 306 (336) Q Consensus 236 ~~~~~L~~~~~~~~Tvl~~l~~----n-----~pnl~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p~~~r~l~~~~ 306 (336) ..+..|-+--....+ -..... | +-+++|+....+....|.... |+.. +.-+.+..-...+. ..+. T Consensus 125 ~~~~~Lrk~~~~~~~-~~~~g~~i~~~G~iG~i~G~~Vi~S~~~~~~~~~~~~-~i~~---~gAl~~~~k~~~~v-EtdR 198 (231) T protein:vir:73 125 KDAAKIRKDANAKNI-GSEVGANALINGTYADVLGAQIVRSKKLAEGSALMFK-IVSN---SPALKLVLKRGVQV-ETDR 198 (231) T ss_pred hHHHhhhhccchhhh-hhhhccceeeecccceEcceEEEEcCCCCCCceeeee-EEee---ccceeeeeccccee-eccc Confidence 988887441111100 000000 0 223555554444332222111 1111 11111111111110 1111 Q ss_pred cCCc-eEEccccceeeeeeecccceeee--ccC Q lcl|Aclame:pro 307 YSSY-FRQKKSAGTWGAVIFRPFAVAQM--IGV 336 (336) Q Consensus 307 ~~~~-~~vp~~~~t~Gv~ir~P~av~~~--~GI 336 (336) ..+. -..=.....+||-++.|-.++.+ .|+ T Consensus 199 d~~~k~~~i~~~~~y~v~l~~~~~vv~~t~~g~ 231 (231) T protein:vir:73 199 DIVTKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) T ss_pred cccccccEEEEeEEEEEEEEcCccEEEEEeecC Confidence 1111 11111224579999999988876 688 No 127 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=70.21 E-value=0.21 Score=24.28 Aligned_cols=304 Identities=11% Similarity=0.007 Sum_probs=120.8 Q ss_pred CchHHHHHHhhhcceeccchhhhccchhHHHHHhhhhhcccccccCcchHHHHHHHhhCceeeeeeccccchhhhccccc Q lcl|Aclame:pro 1 MRDAQRIQNLARAGVILPRSVQNVSTPLTEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESK 80 (336) Q Consensus 1 ~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~a~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t 80 (336) .+.+...++..+.-+...+....++.+-+.... +.-. +-.+.....+|..+.+-| ++.+...-....+..+.+ T Consensus 49 ~~~~~~~~~~~~~~~~~~~g~~~lt~~e~~~~~-~~~~--~~~~~gg~lvP~~~~~~I----~~~l~~~s~l~~~~~v~~ 121 (383) T protein:vir:78 49 EQAKKEARQEADAYISASRTDKNITNEEIKFFN-DINK--EVGYKEETLLPQTVVDEI----FEDLTTEHPFLASIGMRT 121 (383) T ss_pred HHHHHHHHHHHHHHHHhcCChhhhhHHHHHHHH-HHhc--cCCCCCccccCHHHHHHH----HHHHHhhccceeeeeeEe Confidence 011111111111001111111122222222111 1100 011112233565555433 333322222222333333 Q ss_pred CCCcceeeEEEeeeecceeeEEeecccCCc-eeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHH Q lcl|Aclame:pro 81 KGDWTTLVAAFITAEPTTKVATYGDYSSDG-DSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGL 159 (336) Q Consensus 81 ~g~w~~~t~~~~~~e~~G~a~~ygd~~diP-~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~ 159 (336) .+. ...++..+..+.+...+....++ ..+.......-..+.+..-...+.+=|. -..+++.+--.....+++ T Consensus 122 ~~~----~~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~i~is~ell~---Ds~~~ie~~i~~~l~~~~ 194 (383) T protein:vir:78 122 TGL----RTKFLKSETSGVAVWGKIFGEIKGQLDATFSDEESIQNKLTAFVVVPKDLEK---FGPAWVKRFVVTQIEEAF 194 (383) T ss_pred cCC----ceEEEEEcCCcceEEeecccccccccCcceeeEeecceeeEeeccchHHHhh---ccHHHHHHHHHHHHHHHH Confidence 222 13566777777777766555553 4455555566667777666666644333 334578888999999999 Q ss_pred HHhhcceEEeeccccceEEEEecCCCCcccccc-cccccccCHHHHHHHHHHHHHHH---HHHhCCceec-----ccccE Q lcl|Aclame:pro 160 AKFLNGSYLFGVAGLENYGLINDPSLSAPITAT-TPWSGSPAVEAVVNEVVALFQVL---QTQSQGIITQ-----EDVLR 230 (336) Q Consensus 160 e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~~-t~w~~~~t~~eI~~Di~~l~~~l---~~~s~g~v~~-----~~p~t 230 (336) .+.+++-.+.|++..+-.|++++.+.....+.. .+.+. ++..--.+|+..++..+ ...-...... -...+ T Consensus 195 a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~ 273 (383) T protein:vir:78 195 AVALESAYIVGDGNDKPIGLNRKVGKGSTVVDGVYAEKA-ATGTLTFANPKTTVNELTDVYKYHSVKENGHPLNVAGKVT 273 (383) T ss_pred HHHHhhheEeccCCCCceeeeeccCCccccccccccccc-ccchhhhhhhHHHHHHHHHHHhccchhcccchhhhcCceE Confidence 999999999999988999999975432222111 11111 11111123333333222 2111100000 01122 Q ss_pred EEecHHH-HHhcc---cCCCCCc--cHHHHHHHhCCccEEEEcccccC---CCCc-eEEEEEEeecCCceEEEEcChhhh Q lcl|Aclame:pro 231 MGLPPTA-MSDLS---KTNQYGL--AAAAKLKDIFPKLEFVTIPEYDT---ASGR-LVQLWAPRVEGKDTATCGFTEKMR 300 (336) Q Consensus 231 L~Lp~~~-~~~L~---~~~~~~~--Tvl~~l~~n~pnl~i~~~pel~~---a~G~-~~~~~~~~~~~~~~~~~~~p~~~r 300 (336) .++-+.- +..+. .-+..|. +++- | .++|+..+.... .-|. +.+++.++ .+ .++....... T Consensus 274 ~~~n~~~~~~~~~~~~~~~~~G~~~t~l~-----~-~~~iv~s~~~p~~~iifgdfs~Y~i~~r-~~---~~i~~~~~~~ 343 (383) T protein:vir:78 274 LLVNPTDAWDVKKQYTSLNANGVYVTALP-----F-NLNIIESLFVPEKKAISYVAERYDALIG-GP---LDIGTYDQTL 343 (383) T ss_pred EEEcCcchhhhccchhccCCCCceeeecC-----C-CceEEecCCCCcccEEEeeccceEEEec-cc---ceEEecchhh Confidence 3333321 11111 1111121 1110 1 234433222211 1122 12333322 11 1111111111 Q ss_pred cccceecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 301 AHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 301 ~l~~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) + ..-....-...|..| .++.|.|++.++ | T Consensus 344 f-----~~d~~~f~~~~r~dG-~~~~~~A~~vl~-~ 372 (383) T protein:vir:78 344 A-----IEDLNLYAAKQFAYG-KAKDDKAAAVWT-L 372 (383) T ss_pred h-----hcCceEEEEEEEEcC-EEecCCeEEEEE-E Confidence 1 011223344455555 556677766544 4 No 128 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=68.16 E-value=0.24 Score=23.97 Aligned_cols=274 Identities=14% Similarity=0.085 Sum_probs=110.0 Q ss_pred ccccCcchHHH----------HHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecceeeEEee----cc- Q lcl|Aclame:pro 42 LSSTGSSGIPN----------YLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYG----DY- 106 (336) Q Consensus 42 l~t~~~~~i~~----------~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~~yg----d~- 106 (336) |++.++.+.|. |+-.|- -+|.+.....-+.+.++.+.+.- ...++.|+. +|+.++.+ .. T Consensus 1 ms~~~~~tr~~~~~s~~d~al~le~f~-geV~~af~~~s~~~~~~~~rti~--~g~s~~~~~---iG~~~~~~~~pG~~l 74 (335) T protein:vir:63 1 MSFLNDLTRPNYAGKNADVDIHLEEHL-GIVDKHFAYTSKFAPLMNIRDLR--GSNVVRLDR---LGNVEAKGRRAGEEL 74 (335) T ss_pred CCCcccchhhhcccccchhheehhhhh-hhHHHHHHhhhhhccccceeeec--cceeEEEee---eeeeeeecccCCcCc Confidence 22222222222 111111 11111111122233444443321 125566655 58887763 21 Q ss_pred cCCceeeeeeeeeeee--EEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEE------eec-cccceE Q lcl|Aclame:pro 107 SSDGDSGANINYPQRQ--SYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYL------FGV-AGLENY 177 (336) Q Consensus 107 ~diP~~~~~~~~~~~~--v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~------~Gd-~~~g~~ 177 (336) ...|... ++.... -..+.-.+=|.++|.+. ..++-++-....-.++.++.|+..+ .+. +..... T Consensus 75 ~~~~~~~---~k~~itVD~ll~a~~~I~dlDe~~~----~yDvRse~s~e~G~aLA~~~D~~~~~~i~~aa~~~a~~~~~ 147 (335) T protein:vir:63 75 ERSRVVN---DKWNLTVDTLLYLRHQFDHQDEWTQ----SFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLE 147 (335) T ss_pred CCCCccc---cceEEEecceeechhhhhhHHHHhc----CchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccC Confidence 2223211 221211 12223333344444332 3334344444444444455444332 111 122223 Q ss_pred EEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceec-ccccEEEecHHHHHhcccC-----CCCCc-- Q lcl|Aclame:pro 178 GLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQ-EDVLRMGLPPTAMSDLSKT-----NQYGL-- 249 (336) Q Consensus 178 GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~-~~p~tL~Lp~~~~~~L~~~-----~~~~~-- 249 (336) |.++ |.....+..++. .+.+.++.+.+=+..+..++..+- .-+. .+.-.++++|.+|..|-.- ++++. T Consensus 148 ~~~~-~G~~~~~~~tg~-~~~~~~~~l~~a~~~a~~~L~e~d--VP~~~~~dr~~vv~P~~y~~Ll~~~~l~n~~~~~s~ 223 (335) T protein:vir:63 148 DAFS-PGVLEKLDLTGL-TAKQAADKIVRMHRRVVETFIDRD--LGDAVYSEGLTPMSPRVFSLLLEHDKLMNVEYQATG 223 (335) T ss_pred CCcC-CCcceeeeeccC-cccccHHHHHHHHHHHHHHHHhcc--CCCcccCceEEEeChHHHHHHhcccccccccccccc Confidence 3333 233222222232 223458888887777777775442 1010 1336799999999988542 12221 Q ss_pred cHHHHHHHh---CCccEEEEcccccCCCC--------------c-eEEEEEEeecCCc--eEEEEcChhhhcccceecCC Q lcl|Aclame:pro 250 AAAAKLKDI---FPKLEFVTIPEYDTASG--------------R-LVQLWAPRVEGKD--TATCGFTEKMRAHSIERYSS 309 (336) Q Consensus 250 Tvl~~l~~n---~pnl~i~~~pel~~a~G--------------~-~~~~~~~~~~~~~--~~~~~~p~~~r~l~~~~~~~ 309 (336) +.-.+.+.. .-+++|+..+.|-+.++ + ...+.+- ....- ++++ .+...+.. -+.+.- T Consensus 224 ~~~~~~~g~v~~v~Gv~V~~sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~-~~~~Al~t~~~-~~vt~e~~-~~~~~~ 300 (335) T protein:vir:63 224 ATNDYVKSRVAILNGVKVLETPRFATKAIAAHPLGRHFNVSAEESERQIALF-LPSKTLITAQV-APVQAKLW-EDNEKF 300 (335) T ss_pred ccccccCceeEEeeceEEEeeccCCCCCcccccccccCCccccccceeEEEE-EecceEEEEEE-eeccccee-eccchh Confidence 111122111 12356666666632211 1 0111110 01111 1111 11111110 111223 Q ss_pred ceEEccccceeeeeeecc--cceeeeccC Q lcl|Aclame:pro 310 YFRQKKSAGTWGAVIFRP--FAVAQMIGV 336 (336) Q Consensus 310 ~~~vp~~~~t~Gv~ir~P--~av~~~~GI 336 (336) .|.+.+... .|+-++|| .++....|| T Consensus 301 ~~~i~~~~a-~G~g~lRPe~a~~i~~tg~ 328 (335) T protein:vir:63 301 SWVLDTFQM-YNIGARRPDTAGAIELKGI 328 (335) T ss_pred hHHhHHHHH-cCCcccccceEEEEEEcCC Confidence 455555544 79999999 566678899 No 129 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=65.57 E-value=0.28 Score=23.60 Aligned_cols=302 Identities=12% Similarity=0.082 Sum_probs=127.6 Q ss_pred CchHH-H----HHHhhh-----------cc---eeccchh----hhcc---chhHHHHHhhhhhcccccccCcc--hHHH Q lcl|Aclame:pro 1 MRDAQ-R----IQNLAR-----------AG---VILPRSV----QNVS---TPLTEYAMDAADLSPHLSSTGSS--GIPN 52 (336) Q Consensus 1 ~~~~~-~----~~~l~~-----------~g---~~~~~~~----~~~~---~~~~~~a~da~d~~~~l~t~~~~--~i~~ 52 (336) +++.. + ++.+.+ .| -.+.... ..+. ...+.............++++++ .+|. T Consensus 73 le~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~ 152 (425) T protein:vir:95 73 LEGEIAQLEDELEQINSKQPSNQSRQKMQGSKGDVVEMNRLQVREMLKTGEYYKRSEVVEFYEKFRNLRAVAGGELTIPE 152 (425) T ss_pred HHHHHHHHHHHHHHhhhhccchhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhHHHHHHHHHHhhcccccCceeccH Confidence 11100 0 000000 00 0000000 0000 00000000000111112222222 3566 Q ss_pred HHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecceeeEEeecccCCceeee-eeeeeeeeEEEEEEEEe Q lcl|Aclame:pro 53 YLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGA-NINYPQRQSYFFQTWTR 131 (336) Q Consensus 53 ~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~~ygd~~diP~~~~-~~~~~~~~v~~~~~~~~ 131 (336) .+.+ +|++.+-.......++.+.+.. ....+++....+.+...+.+..+|..+. .....+-..+.++..+. T Consensus 153 ~~~~----~Ii~~l~~~~~i~~~~~~~~~~----g~~~ip~~~~~~~a~~v~E~~~~~~~~~~~f~~i~l~~~k~~~~~~ 224 (425) T protein:vir:95 153 VVVN----RIMDIMGDYTTLYPLVDKIRVK----GTTRILVDTDTSPATWIEQSGALPTGDVGTIASIDFDGFKVGKVTF 224 (425) T ss_pred HHHH----HHHHHHHhhhhHHHhhceeecC----ceeEEEEecCCccccccccccccccccccccceeeeeheeeeeeeh Confidence 5554 3333332222233333332211 1346677777788888888888888775 46777778888888888 Q ss_pred eCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccc--cceEEEEecCCCCcccccccccccccCHHHHHHHHH Q lcl|Aclame:pro 132 WGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAG--LENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVV 209 (336) Q Consensus 132 y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~--~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~ 209 (336) +|.+=|..+. .++.+--....+.++.+.+++-.++|++. ..-.|++++ ++.... .+...++. .++|+. T Consensus 225 iS~ell~ds~---~~l~~~i~~~l~~~i~~~~d~~il~G~G~~~~~p~Gil~~--~~~~~~-~~~~~~~~----~~~~~~ 294 (425) T protein:vir:95 225 VDNYLLQDSI---INLDDYVTKKIARAIAKALDLAIVKGTGAANKQPLGIIPS--LPPENQ-VTVEADNN----LLKNLV 294 (425) T ss_pred hhHHHHhccH---HHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeecc--cccccc-cccccccc----hHHHHH Confidence 8876554433 36888888888899999999999999864 345799985 322111 11122223 345666 Q ss_pred HHHHHHHHHhCCceecccccEEEecHH-HHHhcc----cCCCCCccHHHHHHHhC--C---ccEEEEcccccCC---CCc Q lcl|Aclame:pro 210 ALFQVLQTQSQGIITQEDVLRMGLPPT-AMSDLS----KTNQYGLAAAAKLKDIF--P---KLEFVTIPEYDTA---SGR 276 (336) Q Consensus 210 ~l~~~l~~~s~g~v~~~~p~tL~Lp~~-~~~~L~----~~~~~~~Tvl~~l~~n~--p---nl~i~~~pel~~a---~G~ 276 (336) +++..+...... . .....+|.+. .+..|. ..+..|.-++. ..+. | +..++..+.+... -|. T Consensus 295 ~~~~~~~~~~~~---~-~~~~~v~~~~~~~~~l~~l~~~kd~~g~~i~~--~~~~~~~~l~G~pvv~~~~~~~~~i~~Gd 368 (425) T protein:vir:95 295 KQIGLIDTGDDS---V-GEIVAVMKRSTYYNRLVEFSIQVDSNGNVVGK--LPNLRTPDLLGLRVVFNNFLDDDTVLFGE 368 (425) T ss_pred HHHHhhhhhccc---c-CceEEEEeChHHHHHHHHHHhhcCCCCceeec--cCCCCCccccceeeEEcCcCCCccEEEEe Confidence 666555332210 1 1123444444 344332 12333332211 0111 1 1122221111110 022 Q ss_pred eEEEEEEeecCCceEEEEcChhhhcccceecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 277 LVQLWAPRVEGKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 277 ~~~~~~~~~~~~~~~~~~~p~~~r~l~~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) -.+.++-.+. + ..+.+...- ....-...+-+..|.. +.+++|-||++++ | T Consensus 369 ~~~~~~~~~~-~--~~i~~~~~~-----~f~~~~~~~~~~~r~d-~~~~~~~a~~~~~-i 418 (425) T protein:vir:95 369 FEQYTLVERE-N--ITIDSSTHV-----KFTEDQTAFRGKGRFD-GKPVKPEAFVLVT-I 418 (425) T ss_pred cccEEEEeec-c--eEEEeeccc-----ccccCceEEEEEEeeC-cEeecccceEEEE-e Confidence 1122211111 1 111111100 0011123333444444 4667788888773 3 No 130 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=62.76 E-value=0.33 Score=23.23 Aligned_cols=281 Identities=10% Similarity=0.022 Sum_probs=116.4 Q ss_pred CchHHHHHHhhh-cceeccchhhhcc-ch-hHHH-HHhhhhhcccccccCcch--HHHHHHHhhCceeeeeeccccchhh Q lcl|Aclame:pro 1 MRDAQRIQNLAR-AGVILPRSVQNVS-TP-LTEY-AMDAADLSPHLSSTGSSG--IPNYLTTYVDPAVIDILVAPMKAAE 74 (336) Q Consensus 1 ~~~~~~~~~l~~-~g~~~~~~~~~~~-~~-~~~~-a~da~d~~~~l~t~~~~~--i~~~l~~~idp~v~~~~~~~~~~~~ 74 (336) +.-+..+....+ .+-.......... .. .... ..........-.+..++| +|..+. ..|++.+........ T Consensus 85 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~liP~~~~----~~ii~~~~~~~~l~~ 160 (394) T protein:vir:97 85 KTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEIL----YTPAREVKTVVDLKP 160 (394) T ss_pred HHHHHHHHHHHHHHHHHhhhhhhhhhHHHHHHHHHhhhhhhhhccccccccccccChHHHH----HHHHHHhhhhhhhhh Confidence 110111110000 0000000000000 00 0000 000001111112333333 675544 355666555555555 Q ss_pred hcccccCCCcceeeEEEeeeec-ceeeEEeecccCCce-eeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHH Q lcl|Aclame:pro 75 LVGESKKGDWTTLVAAFITAEP-TTKVATYGDYSSDGD-SGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELN 152 (336) Q Consensus 75 l~~v~t~g~w~~~t~~~~~~e~-~G~a~~ygd~~diP~-~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~ 152 (336) +..+.+... .+..+++... .+.+...+.+...|- .+...+..+-....++..+.+|.+=++-+ ..++.+.-. T Consensus 161 ~~~~~~~~~---~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~i~is~ell~ds---~~~~~~~i~ 234 (394) T protein:vir:97 161 FTTVYQAKK---ASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDA---DVDLVGIVS 234 (394) T ss_pred hceeeeccC---cceEEEEEecCCCccceecccccccccccccceeEEeehhheeeehhhHHHHHhhh---hHHHHHHHH Confidence 555432211 1234555543 345677788777874 44566666777777877777776534333 345666666 Q ss_pred HHHHHHHHHhhcceEEeeccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEE Q lcl|Aclame:pro 153 YSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMG 232 (336) Q Consensus 153 ~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~ 232 (336) ...+.++...+|.-.+.|.... ++. +.++.++| ..+++...... ..-.++ T Consensus 235 ~~la~~~~~~~~~~i~~g~~~~---------------~~~----~~~~~~~~----~~~~~~~~~~~-------~~a~~v 284 (394) T protein:vir:97 235 ESISQIKVNTTNDAIAKVLKSF---------------TTK----TVKNLDEI----KALLNGGFDPA-------YNVSLI 284 (394) T ss_pred HHHHHHHHHHHHHHHhhccccc---------------ccc----ccccHHHH----HHHHHhhhhhh-------hCCEEE Confidence 6666666666665444443210 011 12234444 44443322211 123589 Q ss_pred ecHHHHHhccc-CCCCCccHHH-HHHHhCC----ccEEEEcccccCCCCceEE---------EEEEeecCCceEEEEcCh Q lcl|Aclame:pro 233 LPPTAMSDLSK-TNQYGLAAAA-KLKDIFP----KLEFVTIPEYDTASGRLVQ---------LWAPRVEGKDTATCGFTE 297 (336) Q Consensus 233 Lp~~~~~~L~~-~~~~~~Tvl~-~l~~n~p----nl~i~~~pel~~a~G~~~~---------~~~~~~~~~~~~~~~~p~ 297 (336) |.|..+..|.. .+..|.-++. -+...-+ +..++..+ +.+.|.... .++++ . . ..+.. T Consensus 285 ~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~pv~~~~--~~~~~~~~~~~gd~~~~~~~~~~-~-~--~~~~~-- 356 (394) T protein:vir:97 285 VSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLS--DEVLGANKAFIGDFKRGVLFADR-K-D--LGLRW-- 356 (394) T ss_pred EcHHHHHHHHHhhccCCCeeeecCcCCCCCceeccceeEEec--ccccCCccEEEeeccccEEEEEe-c-c--eEEEE-- Confidence 99999888753 3333432321 0111111 12222222 122232222 22211 0 1 11111 Q ss_pred hhhcccceecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 298 KMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 298 ~~r~l~~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) -.+. .... ..-+..|.+| .+.+|.||+.++.= T Consensus 357 --~~~~--~~~~--~~~~~~r~d~-~v~~~~a~~~~~~~ 388 (394) T protein:vir:97 357 --ADNE--IYGQ--YLQAVLRFGV-SKVDDKAGYYVTFT 388 (394) T ss_pred --eccc--ccce--eEEEEEEEcc-EEecccceEEEEec Confidence 0000 0011 1245566544 56689999987776 No 131 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=46.35 E-value=0.74 Score=21.31 Aligned_cols=279 Identities=14% Similarity=0.036 Sum_probs=111.5 Q ss_pred HHHhhhhhcccccccCcchHHHHHHHhhCceee-----eeeccccchhhhcccccCCCcc-eeeEEEeeeecceeeEEee Q lcl|Aclame:pro 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPAVI-----DILVAPMKAAELVGESKKGDWT-TLVAAFITAEPTTKVATYG 104 (336) Q Consensus 31 ~a~da~d~~~~l~t~~~~~i~~~l~~~idp~v~-----~~~~~~~~~~~l~~v~t~g~w~-~~t~~~~~~e~~G~a~~yg 104 (336) |++-=....+.++| ++-++|| |+++ +.+........++. +..++.. -+++.++..- ...+.-|. T Consensus 1 ~~~~~~~~~~~~~t-------~~v~~fi-pei~s~~i~~~l~~~~v~~~~~~-d~~~~~~~Gdtv~ip~~g-~~~~~d~~ 70 (341) T protein:vir:94 1 MALGNTITGPSINT-------QRGQQFI-PEQWLSEVQMFRKAKMLDTSVVK-TWGAQVKKGDTFHVPRIS-ELGVEDKA 70 (341) T ss_pred Ccchhhhccccccc-------hhHHHHH-HHHHHHHHHHHHHhhcchhhccc-cccccccCCceEEEeccC-cceeeeec Confidence 44422222333332 2344454 4443 33344445555553 2233332 2678888642 33455554 Q ss_pred cccCCceeeeeeeeeeeeEEEE-EEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecC Q lcl|Aclame:pro 105 DYSSDGDSGANINYPQRQSYFF-QTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDP 183 (336) Q Consensus 105 d~~diP~~~~~~~~~~~~v~~~-~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~P 183 (336) -...++.-+.+..+...++-.. ..++.++..|... ...++-.+-.+.+..++.+..++..+---+...... -+ T Consensus 71 ~~~~i~~~~~~~~~~~itiD~~~~~~~~i~d~d~~~---~~~d~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~---~~ 144 (341) T protein:vir:94 71 TDVPVGVQPVNDTDFVITVDTDRTTAVALDDLLEIQ---ASYDLRAPYLEAMGYALAKDMTGSILGLRAAVQNTA---SQ 144 (341) T ss_pred CCCccccccccCceEEEEEeeeeecceeechHHHHh---hccchHHHHHHHHHHHHHHHHHHHHHHHhhhccccc---cC Confidence 3445555555444444444232 3456666555432 345676766666666776666654221001000000 01 Q ss_pred CCCcccccccccccccCHHH-HHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhcccCCCC------CccHHHHHH Q lcl|Aclame:pro 184 SLSAPITATTPWSGSPAVEA-VVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSKTNQY------GLAAAAKLK 256 (336) Q Consensus 184 nl~~~~~~~t~w~~~~t~~e-I~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~~~~~------~~Tvl~~l~ 256 (336) +.. . ......+.+++. .++.|.++...+-.. + + +...-.++++|..+..|.+-+.+ |.. -++ T Consensus 145 ~~~--~--~~~~~~t~~~~~~~~~~i~~a~~~Lde~--~-V-P~~gR~lvv~P~~~~~Ll~~~~~~~~~~~g~~---~l~ 213 (341) T protein:vir:94 145 NVF--S--SSNGAITGNGQAFSFAVFLAARRLLLEA--D-V-PEEKIVLLISPGQESALFTIPQFISKDFINNA---PIA 213 (341) T ss_pred ccc--c--CccccccCchhhhhHHHHHHHHHHHhhc--C-C-CccCCEEEeCHHHHHHHhhchhhhhhhccccc---hhh Confidence 110 0 001111111222 234444554444222 1 2 23346799999999988642211 111 122 Q ss_pred H----hCCccEEEEcccccCCCCce----EEEEE----------------------------EeecCCceEEEEcChhhh Q lcl|Aclame:pro 257 D----IFPKLEFVTIPEYDTASGRL----VQLWA----------------------------PRVEGKDTATCGFTEKMR 300 (336) Q Consensus 257 ~----n~pnl~i~~~pel~~a~G~~----~~~~~----------------------------~~~~~~~~~~~~~p~~~r 300 (336) + +.-.++|...+.+-..++.. ....+ -+.+.--..++.=|+.++ T Consensus 214 ~G~ig~i~G~~V~~Sn~lp~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~gl~~~~~av~~~k~~~~~~~~ 293 (341) T protein:vir:94 214 QGQIGSLMGVRVIRTSLIGNNSATGWRNGAPTIAPAEATPGFTGSRYLPKQDSFTSLPATFTGNSRPVHTAVMCHMDWAA 293 (341) T ss_pred eeeeeeEeceEEEEeccccccccccccccccceecccccccccccccccccccccccEEEEEEecccccceeeecchhhh Confidence 1 12234555544443211110 00000 000000011111234344 Q ss_pred ccccee-cCCceEEccccc-------eeeeeeecccceeeeccC Q lcl|Aclame:pro 301 AHSIER-YSSYFRQKKSAG-------TWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 301 ~l~~~~-~~~~~~vp~~~~-------t~Gv~ir~P~av~~~~GI 336 (336) ...+|. +......+.... ..|+-+.||-+++.+.=- T Consensus 294 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~G~~~lrp~~~v~~~~~ 337 (341) T protein:vir:94 294 AVVSKAPRVTQSFENREQVWLMVGRQAYGARLYRPLHAVNIHTT 337 (341) T ss_pred ccccccccccccchhhhhhhhhhhhhhhcccccCcceeEEEecC Confidence 433321 111111111111 346777777775443333 No 132 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=44.30 E-value=0.81 Score=21.08 Aligned_cols=307 Identities=14% Similarity=0.081 Sum_probs=119.0 Q ss_pred CchHHHHH-HhhhcceeccchhhhccchhHHHHHhhhhhcccccccCcch--HHHHHHHhhCceeeeeeccccchhhhcc Q lcl|Aclame:pro 1 MRDAQRIQ-NLARAGVILPRSVQNVSTPLTEYAMDAADLSPHLSSTGSSG--IPNYLTTYVDPAVIDILVAPMKAAELVG 77 (336) Q Consensus 1 ~~~~~~~~-~l~~~g~~~~~~~~~~~~~~~~~a~da~d~~~~l~t~~~~~--i~~~l~~~idp~v~~~~~~~~~~~~l~~ 77 (336) .++...-. .-.+.-+..-+....++.+-+.... +. .. .+.+++| +|..+.+ +|++.+...-..+.+.. T Consensus 41 ~~~~~~~~~~e~~~~~~~~~~~~~lt~~e~~~~~-~~--~~--~~~~~gg~lvP~~~~~----~I~~~l~~~s~i~~~~~ 111 (381) T protein:vir:95 41 FEETKLQAKAEAERVSSLPKSAQSLSANQRSFFM-DI--NK--NVNYKEEKLLPEETID----RIFEDLTTNHPLLADLG 111 (381) T ss_pred hhhHHHHHHHHHHHHHHhccCcccccHHHHHHHH-HH--hc--ccCCCCceecCHHHHH----HHHHHHHhhccceehee Confidence 00000000 0000000000000111111111110 10 00 1222333 6655554 33444333222333343 Q ss_pred cccCCCcceeeEEEeeeecceeeEEeecccCCc-eeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHH Q lcl|Aclame:pro 78 ESKKGDWTTLVAAFITAEPTTKVATYGDYSSDG-DSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSA 156 (336) Q Consensus 78 v~t~g~w~~~t~~~~~~e~~G~a~~ygd~~diP-~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr 156 (336) +.+.+.. ..+...+..+.+.+.+....++ -.+.......-..+.+.....+|.+=|. ...+++.+--..... T Consensus 112 v~~~~~~----~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~elL~---Ds~~~ie~~i~~~la 184 (381) T protein:vir:95 112 IKNAGLR----LKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLND---FGPAWIERFVRVQIE 184 (381) T ss_pred eEecCcc----eEEEEecCCcceeeecccccccccccccceeeeecceeEEeechhhHHHhh---cCHHHHHHHHHHHHH Confidence 4333321 3456667777777766655553 4455555566667777666666644333 244578888888888 Q ss_pred HHHHHhhcceEEeeccccceEEEEecCCCCccccccc-ccc------cccCHHHHHHHHHHHHHHHHHHhCCceec-ccc Q lcl|Aclame:pro 157 LGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATT-PWS------GSPAVEAVVNEVVALFQVLQTQSQGIITQ-EDV 228 (336) Q Consensus 157 ~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~~t-~w~------~~~t~~eI~~Di~~l~~~l~~~s~g~v~~-~~p 228 (336) .++.+.+++-++.|++..+-.|+++++......+... ++. ...++...++.+..++..+-..-.+.... ..- T Consensus 185 ~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~ 264 (381) T protein:vir:95 185 EAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGN 264 (381) T ss_pred HHHHHHhhheeEeccCCCCceeeeeccCcccccccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCc Confidence 9999999999999999999999999865432222111 110 01122333455555555553322221100 011 Q ss_pred cEEEecHHHHHhcc----cCCCCCccHHHHHHHhCCccEEEEcccccC---CCCce-EEEEEEeecCCceEEEEcChhhh Q lcl|Aclame:pro 229 LRMGLPPTAMSDLS----KTNQYGLAAAAKLKDIFPKLEFVTIPEYDT---ASGRL-VQLWAPRVEGKDTATCGFTEKMR 300 (336) Q Consensus 229 ~tL~Lp~~~~~~L~----~~~~~~~Tvl~~l~~n~pnl~i~~~pel~~---a~G~~-~~~~~~~~~~~~~~~~~~p~~~r 300 (336) -+++|-+.-+..|- ..++.|.-+... -| +++|+..+.... .-|.- .+.+.++ .+- ++....... T Consensus 265 a~~~mn~~t~~~l~~~~~~~~~~G~~v~~l---~~-g~~vv~s~~~p~~~iifgDfs~Y~i~~r-~~~---~i~~~~~~~ 336 (381) T protein:vir:95 265 VTMVVNPSDAFEVQAQYTHLNANGVYVTAL---PF-NLNVIESTVQEAGKVLTYVKGLYDGYLA-GGI---NVQKFKETL 336 (381) T ss_pred eEEEEccccHHhhccccccCCCCCceeecC---CC-CceEEecCCCCcCcEEEEecccEEEEEe-ccc---EEEeechhH Confidence 23455555433331 112223111000 01 233332211110 01211 1222221 111 111000000 Q ss_pred cccceecCCceEEccccceeeeee-ecccceeeeccC Q lcl|Aclame:pro 301 AHSIERYSSYFRQKKSAGTWGAVI-FRPFAVAQMIGV 336 (336) Q Consensus 301 ~l~~~~~~~~~~vp~~~~t~Gv~i-r~P~av~~~~GI 336 (336) + ..-....-...|..|..+ -.=+.+..+.-. T Consensus 337 ~-----~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~ 368 (381) T protein:vir:95 337 A-----LDDMDLYTAKQFAYGKAKDNKVAAVWKLDLK 368 (381) T ss_pred h-----hcCCeEEEEEEEEcCEEecCceEEEEEEEec Confidence 0 001122223333333211 111222222222 No 133 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=44.30 E-value=0.81 Score=21.08 Aligned_cols=307 Identities=14% Similarity=0.081 Sum_probs=119.0 Q ss_pred CchHHHHH-HhhhcceeccchhhhccchhHHHHHhhhhhcccccccCcch--HHHHHHHhhCceeeeeeccccchhhhcc Q lcl|Aclame:pro 1 MRDAQRIQ-NLARAGVILPRSVQNVSTPLTEYAMDAADLSPHLSSTGSSG--IPNYLTTYVDPAVIDILVAPMKAAELVG 77 (336) Q Consensus 1 ~~~~~~~~-~l~~~g~~~~~~~~~~~~~~~~~a~da~d~~~~l~t~~~~~--i~~~l~~~idp~v~~~~~~~~~~~~l~~ 77 (336) .++...-. .-.+.-+..-+....++.+-+.... +. .. .+.+++| +|..+.+ +|++.+...-..+.+.. T Consensus 41 ~~~~~~~~~~e~~~~~~~~~~~~~lt~~e~~~~~-~~--~~--~~~~~gg~lvP~~~~~----~I~~~l~~~s~i~~~~~ 111 (381) T protein:vir:10 41 FEETKLQAKAEAERVSSLPKSAQSLSANQRSFFM-DI--NK--NVNYKEEKLLPEETID----RIFEDLTTNHPLLADLG 111 (381) T ss_pred hhhHHHHHHHHHHHHHHhccCcccccHHHHHHHH-HH--hc--ccCCCCceecCHHHHH----HHHHHHHhhccceehee Confidence 00000000 0000000000000111111111110 10 00 1222333 6655554 33444333222333343 Q ss_pred cccCCCcceeeEEEeeeecceeeEEeecccCCc-eeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHH Q lcl|Aclame:pro 78 ESKKGDWTTLVAAFITAEPTTKVATYGDYSSDG-DSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSA 156 (336) Q Consensus 78 v~t~g~w~~~t~~~~~~e~~G~a~~ygd~~diP-~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr 156 (336) +.+.+.. ..+...+..+.+.+.+....++ -.+.......-..+.+.....+|.+=|. ...+++.+--..... T Consensus 112 v~~~~~~----~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~elL~---Ds~~~ie~~i~~~la 184 (381) T protein:vir:10 112 IKNAGLR----LKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLND---FGPAWIERFVRVQIE 184 (381) T ss_pred eEecCcc----eEEEEecCCcceeeecccccccccccccceeeeecceeEEeechhhHHHhh---cCHHHHHHHHHHHHH Confidence 4333321 3456667777777766655553 4455555566667777666666644333 244578888888888 Q ss_pred HHHHHhhcceEEeeccccceEEEEecCCCCccccccc-ccc------cccCHHHHHHHHHHHHHHHHHHhCCceec-ccc Q lcl|Aclame:pro 157 LGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATT-PWS------GSPAVEAVVNEVVALFQVLQTQSQGIITQ-EDV 228 (336) Q Consensus 157 ~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~~t-~w~------~~~t~~eI~~Di~~l~~~l~~~s~g~v~~-~~p 228 (336) .++.+.+++-++.|++..+-.|+++++......+... ++. ...++...++.+..++..+-..-.+.... ..- T Consensus 185 ~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~ 264 (381) T protein:vir:10 185 EAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKSVAVKGN 264 (381) T ss_pred HHHHHHhhheeEeccCCCCceeeeeccCcccccccccccccccccccccccchhhHHHHHHHHHhhccccccccccccCc Confidence 9999999999999999999999999865432222111 110 01122333455555555553322221100 011 Q ss_pred cEEEecHHHHHhcc----cCCCCCccHHHHHHHhCCccEEEEcccccC---CCCce-EEEEEEeecCCceEEEEcChhhh Q lcl|Aclame:pro 229 LRMGLPPTAMSDLS----KTNQYGLAAAAKLKDIFPKLEFVTIPEYDT---ASGRL-VQLWAPRVEGKDTATCGFTEKMR 300 (336) Q Consensus 229 ~tL~Lp~~~~~~L~----~~~~~~~Tvl~~l~~n~pnl~i~~~pel~~---a~G~~-~~~~~~~~~~~~~~~~~~p~~~r 300 (336) -+++|-+.-+..|- ..++.|.-+... -| +++|+..+.... .-|.- .+.+.++ .+- ++....... T Consensus 265 a~~~mn~~t~~~l~~~~~~~~~~G~~v~~l---~~-g~~vv~s~~~p~~~iifgDfs~Y~i~~r-~~~---~i~~~~~~~ 336 (381) T protein:vir:10 265 VTMVVNPSDAFEVQAQYTHLNANGVYVTAL---PF-NLNVIESTVQEAGKVLTYVKGLYDGYLA-GGI---NVQKFKETL 336 (381) T ss_pred eEEEEccccHHhhccccccCCCCCceeecC---CC-CceEEecCCCCcCcEEEEecccEEEEEe-ccc---EEEeechhH Confidence 23455555433331 112223111000 01 233332211110 01211 1222221 111 111000000 Q ss_pred cccceecCCceEEccccceeeeee-ecccceeeeccC Q lcl|Aclame:pro 301 AHSIERYSSYFRQKKSAGTWGAVI-FRPFAVAQMIGV 336 (336) Q Consensus 301 ~l~~~~~~~~~~vp~~~~t~Gv~i-r~P~av~~~~GI 336 (336) + ..-....-...|..|..+ -.=+.+..+.-. T Consensus 337 ~-----~~d~~~f~a~~r~dg~~~~~~A~~v~~l~~~ 368 (381) T protein:vir:10 337 A-----LDDMDLYTAKQFAYGKAKDNKVAAVWKLDLK 368 (381) T ss_pred h-----hcCCeEEEEEEEEcCEEecCceEEEEEEEec Confidence 0 001122223333333211 111222222222 No 134 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=43.76 E-value=0.83 Score=21.02 Aligned_cols=283 Identities=9% Similarity=-0.017 Sum_probs=112.0 Q ss_pred CchHHHHHHh---hh-cceeccchhhhccchhHHHHHhhhhhcccccccCcchH-HHHHHHhhCceeeeeeccccchhhh Q lcl|Aclame:pro 1 MRDAQRIQNL---AR-AGVILPRSVQNVSTPLTEYAMDAADLSPHLSSTGSSGI-PNYLTTYVDPAVIDILVAPMKAAEL 75 (336) Q Consensus 1 ~~~~~~~~~l---~~-~g~~~~~~~~~~~~~~~~~a~da~d~~~~l~t~~~~~i-~~~l~~~idp~v~~~~~~~~~~~~l 75 (336) |-+...-+.| ++ .++..+ |+..+ ..+ |.+.+.+++ .+.+- -+-++...+ T Consensus 1 ~~~k~~~~~l~~~~~~~~~~~~------------------~~~~g------~~v~~~~~~~l~~-~i~e~-s~~l~~i~v 54 (321) T protein:vir:31 1 MASRTINNDLSRITEKNALTVD------------------DLDAG------GTLPDPLWDEFWT-DMIEE-TPLLDAIRT 54 (321) T ss_pred CchHHHHHHHHHHHHhcccccc------------------ccCCc------ceeCHHHHHHHHH-HHHHh-hhhhhhcee Confidence 3332221111 11 111111 11111 112 233333332 22221 222333333 Q ss_pred cccccCCCcceeeEEEeeeecceeeEEeec-c-cCCceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHH Q lcl|Aclame:pro 76 VGESKKGDWTTLVAAFITAEPTTKVATYGD-Y-SSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNY 153 (336) Q Consensus 76 ~~v~t~g~w~~~t~~~~~~e~~G~a~~ygd-~-~diP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~ 153 (336) +|+++ ..- ....+...|.+...++ . ...+..+.......-..+.+..-...+.+-|. ..+.+-++.+.-.. T Consensus 55 ~~v~~---~~~---~i~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~k~~~~~~it~e~L~-d~a~~~d~e~~i~~ 127 (321) T protein:vir:31 55 ETVGA---KKT---RIPTLNIGERHRRPQDEGEWNENESDVSTGTIDISTEKATVAWDLPREVVQ-ENPEGEALADRILN 127 (321) T ss_pred eeccC---cce---eeeeeccCCcccccccccccccccccceeeeeeeeeEEEEeehhccHHHHH-hhhcchhHHHHHHH Confidence 33321 110 1111111122211121 1 11122222222223333444444444444343 33446789999999 Q ss_pred HHHHHHHHhhcceEEeeccccce------EEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccc Q lcl|Aclame:pro 154 SSALGLAKFLNGSYLFGVAGLEN------YGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQED 227 (336) Q Consensus 154 aAr~a~e~~~n~~~~~Gd~~~g~------~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~ 227 (336) ..++++...+..+.|+|++.... .|+++.+.-..... +...+..+. +++.+++..|-..- .+. T Consensus 128 ~ia~~~a~~~~~~~~nGd~~~~~~~~~~n~G~l~~a~~~~~~~--~~~~~~~~~----d~l~~l~~~l~~~y-----r~~ 196 (321) T protein:vir:31 128 LMTDAWSADVEDLAANGDEDAEDSFENQNDGFITVAEGDVETI--DAADDILDN----DLVIRTIAGLDSKY-----RAR 196 (321) T ss_pred HHHHHHHHHHHhheeeccccCCCcccccchhhhhhhccccccc--cccccccCH----HHHHHHHHhccHhH-----hcC Confidence 99999999999999999975332 46666432211111 101112222 33344444442211 112 Q ss_pred c-cEEEecHHHHHhc----ccCCC-CCccHHHH-HHHhCCccEEEEcccccCCCCceEEEEEEeecCCceEEEEc--Chh Q lcl|Aclame:pro 228 V-LRMGLPPTAMSDL----SKTNQ-YGLAAAAK-LKDIFPKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGF--TEK 298 (336) Q Consensus 228 p-~tL~Lp~~~~~~L----~~~~~-~~~Tvl~~-l~~n~pnl~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~--p~~ 298 (336) + ...+|....+..+ ...+. .+...+.- -..++=++.++.+|.+-... .++.+ . +.+.+.+ ... T Consensus 197 ~~~v~im~~~~~~~~~~~l~~~~~~~~~~~l~~~~~~tl~G~pvv~~~~mP~~~----il~t~-~---~nl~~~~~~~~~ 268 (321) T protein:vir:31 197 MNPALIVSEDQLLSYHYTLTDRDTPLGDNVIMGEADVNPFSFPIIGSGLWPDDK----AMFTD-P---QNLIYALYRDLE 268 (321) T ss_pred CCeEEEechHHHHHHHHHHhcCCCccccchhhccccccccceeEEEcCCCCCCc----EEEec-c---ccEEEEEeeccE Confidence 2 3567887765432 22111 11122111 11123456677777664321 12211 1 1111111 112 Q ss_pred hhccc--cee--cCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 299 MRAHS--IER--YSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 299 ~r~l~--~~~--~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) ++... .+. +...+.--+ .+--|.+|..+-+++.+.|| T Consensus 269 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~ve~~~a~a~~~~i 309 (321) T protein:vir:31 269 IDVLTESDKVSERDLHARYFM-RGDDDFAIENTEAVVLAEGL 309 (321) T ss_pred EEEeecCccccccceeeEeee-eeecceeEeccccEEEEecC Confidence 22211 111 112222222 23366888999999999999 No 135 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=43.31 E-value=0.85 Score=20.97 Aligned_cols=246 Identities=14% Similarity=-0.024 Sum_probs=100.8 Q ss_pred hcccccCCCcceeeEEEeeeecceeeEEeec--ccCC--ceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHH Q lcl|Aclame:pro 75 LVGESKKGDWTTLVAAFITAEPTTKVATYGD--YSSD--GDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASE 150 (336) Q Consensus 75 l~~v~t~g~w~~~t~~~~~~e~~G~a~~ygd--~~di--P~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~ 150 (336) ++=--+.| .++.|+. .|++++..- +.++ +.-+...++....|=..- -++.-+.++-.+| +..++-++ T Consensus 1 ~vr~i~~g----~s~~~~~---iG~~~~~~~~~G~~l~~~~~~~~~~e~~itID~~l-~~~~~VdDiD~~q-a~~Dlr~e 71 (324) T protein:vir:99 1 MTRTITSG----KSAQFPV---MGRTKARYLKQGQSLDDGREDIKHTEKVITIDGLL-TTDVLIYDIEDAM-NHYDVRSE 71 (324) T ss_pred CeeeeecC----ceEEEee---eeeeEeccccCCCCcCCCcCCcCcccEEEEecchh-hhhhhhhhHHHHh-cCccchhH Confidence 11111222 4455543 577765531 2332 112222222111111100 1111223344443 44667777 Q ss_pred HHHHHHHHHHHhhcceEEe-----e--ccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCce Q lcl|Aclame:pro 151 LNYSSALGLAKFLNGSYLF-----G--VAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGII 223 (336) Q Consensus 151 k~~aAr~a~e~~~n~~~~~-----G--d~~~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v 223 (336) -.+.+..++.+..|+..+. . .+.....+.............++.-....+++.+++-|.++-..|-.+. + T Consensus 72 ~s~~~G~aLA~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~dai~~a~~~Lde~~---V 148 (324) T protein:vir:99 72 YSTQMGEALAMAADVANYAEMAKLVNSRKETTNENIEGLGAASLVKITGKKEDPAKYGTQVIQALTYARAAFAKKY---I 148 (324) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhcccccccCCcccCCccceecccccccccccCHHHHHHHHHHHHHHHhhcC---C Confidence 7777777777777754321 1 0111111111111111111122222234568889998888877775543 2 Q ss_pred ecccccEEEecHHHHHhcccCC-----CCCccHHHHHHHh---CCccEEEEcccccCCC---------CceEEEE----- Q lcl|Aclame:pro 224 TQEDVLRMGLPPTAMSDLSKTN-----QYGLAAAAKLKDI---FPKLEFVTIPEYDTAS---------GRLVQLW----- 281 (336) Q Consensus 224 ~~~~p~tL~Lp~~~~~~L~~~~-----~~~~Tvl~~l~~n---~pnl~i~~~pel~~a~---------G~~~~~~----- 281 (336) +...-.+++||..|..|.... .++ +...+-+.. .-+++|...+.|-..+ +..+.+= T Consensus 149 -P~~gR~~vv~P~~y~~Ll~~~~~~~~~~~-~~~~~~~G~V~~i~Gf~V~~Sn~lp~~~~t~~~~a~~~~~~~~~~~~~~ 226 (324) T protein:vir:99 149 -PAGDRTFYTDPDTYSAILAALMPNAANYA-ALIDPETGNIRNVMGFEVVETPHMTAQMVTNPTDAFDGTGHIFPATGDS 226 (324) T ss_pred -CCCCCEEEeChHHHHHHhhcccccccccc-cccceecceEEEEeceEEEecCCcccccccccccccccccccccccccc Confidence 345578999999999885321 111 111111111 1235666555553211 1111000 Q ss_pred ----EEeecCCceEEEE-----------cChhhhcccceecCCceEEccccceeeeeeecccceeeec-------cC Q lcl|Aclame:pro 282 ----APRVEGKDTATCG-----------FTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMI-------GV 336 (336) Q Consensus 282 ----~~~~~~~~~~~~~-----------~p~~~r~l~~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~-------GI 336 (336) -+..+...+.-+. ++.+......+.+ -.+-+...+.. |+.+.||-+++... |+ T Consensus 227 ~~~~ky~~d~~~~~gl~~~~~a~~tv~~~~~~~e~~~~~~~-~~d~i~~~~a~-G~~~lRPe~a~~v~l~~~~~~~~ 301 (324) T protein:vir:99 227 TTTGKMTVGADNVVGLFVHRSAVATLKLKDMALERARRPEY-QADQIIAKYAM-GHGGLRPEAVGAIIFEDGETPAV 301 (324) T ss_pred ccccccccccCceeEEEEehhheEEEeeecceecceechhh-HHHhhhhhhhh-cCcccccceEEEEEEccCccccc Confidence 0000001111111 1112222221121 23444444444 88888998775443 33 No 136 >protein:vir:99888 Length: 309 # NCBI annotation: capsid protein # Family: family:all:908 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164075;genbank:gi:56692607;genbank:GeneID:3192616 Probab=43.16 E-value=0.86 Score=20.96 Aligned_cols=266 Identities=10% Similarity=-0.005 Sum_probs=118.9 Q ss_pred hhhhhcccccccCcchHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecceeeE-EeecccCCcee Q lcl|Aclame:pro 34 DAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVA-TYGDYSSDGDS 112 (336) Q Consensus 34 da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~-~ygd~~diP~~ 112 (336) .+ ..| -..++-+.++-+.|=+ +++-+++|||...++--.-...+|+-.|.-=... ..+-..+.-.+ T Consensus 1 ~~--~~~---~~~dp~LT~~A~gy~n--------~~~Ia~~l~P~vpV~~~~~~~~~f~~~e~F~~~~t~r~~~~~~~~v 67 (309) T protein:vir:99 1 MS--NAP---FPIDPELTAIAIAYRN--------GRMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEV 67 (309) T ss_pred CC--CCC---cCcCHhHHHHHhhccC--------hhhhhhhcCCccccCccccceeeechhhcccccchhhccCCCcceE Confidence 00 000 1123333344444433 3356788999876654333444443322110000 00112233345 Q ss_pred eeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHH----HHhhcceEEeeccccceEEEEecCCC-Cc Q lcl|Aclame:pro 113 GANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGL----AKFLNGSYLFGVAGLENYGLINDPSL-SA 187 (336) Q Consensus 113 ~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~----e~~~n~~~~~Gd~~~g~~GllN~Pnl-~~ 187 (336) +.........+...+.-..+...|...|. .++++.++....++..+ |...-++++--. |.|+= -. T Consensus 68 ~~~~~~~~~~~~~~~L~~~i~~~~~~~a~-~~~d~~~~Av~~l~~~i~l~rE~~~A~lv~~~a---------~y~~~~k~ 137 (309) T protein:vir:99 68 EFSATDETGSTEDHGLDAPVPQADIDNAP-TNYNPLGHATEQTTNLILLDREARTSKLVFSPN---------SYAAGNKT 137 (309) T ss_pred eecccCceeeecccceeecCCchhhhhcc-CCCCHHHHHHHHHHHHHHHHHHHHHHHHhcChh---------hcCCCceE Confidence 55555555555555555556666665553 35676666655444433 333334322111 21211 11 Q ss_pred ccccccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhccc---------CCC--CCccHHHHHH Q lcl|Aclame:pro 188 PITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSK---------TNQ--YGLAAAAKLK 256 (336) Q Consensus 188 ~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~---------~~~--~~~Tvl~~l~ 256 (336) ..+.+.+|.+. +.| ++.||.+....+ | -.|++++|..+.+.+|.+ .+. .+.=-.++|+ T Consensus 138 ~Lsgt~~wsd~-~SD-Pi~~i~~~~~~~-----g----~~PN~~vlg~~~~~~l~~hp~i~~~ik~~~~~~g~it~~~la 206 (309) T protein:vir:99 138 TLSGADQWSDP-TSN-PLPVITDALDSV-----I----LRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQ 206 (309) T ss_pred EecCccccCCC-CCC-cHHHHHHHHHhh-----C----CCcceEEechHHHHHHhhCHHHHHHhcCCCccccccCHHHHH Confidence 23445567653 344 889998887664 3 269999999999998853 111 1222256777 Q ss_pred HhCCccEEEE-cccccCC----C-------CceEEEEEEeecCCceEEEEcChhhhccc-ceecCCceEEccccceeeee Q lcl|Aclame:pro 257 DIFPKLEFVT-IPEYDTA----S-------GRLVQLWAPRVEGKDTATCGFTEKMRAHS-IERYSSYFRQKKSAGTWGAV 323 (336) Q Consensus 257 ~n~pnl~i~~-~pel~~a----~-------G~~~~~~~~~~~~~~~~~~~~p~~~r~l~-~~~~~~~~~vp~~~~t~Gv~ 323 (336) +-|-=-+|.- -.-+.++ + |+.+.|.+.... .++.+ -|.-=.... -.....+|..|....-||-. T Consensus 207 ~l~~ve~V~vg~a~~n~a~~g~~~~~~~iwg~~~~L~y~~~~-~~~~~--~ps~G~t~~~~~r~~g~~~d~~~~~~g~~~ 283 (309) T protein:vir:99 207 ELLELDAIYIGEARLNIARPGQNPNLIRAWGPHASFIYRDRL-ADTRN--GTTFGLTAQWGDRVSGSIADPNIGLRGGQR 283 (309) T ss_pred HHhCcceEEeecceeeccccccccccccccCCcEEEEEcCCC-CCCcc--cccccceeecccccCCceeeeeeccCCceE Confidence 6653112221 1111111 1 333333332211 11111 110000000 01222345666655555544 Q ss_pred ee-----cccceeeeccC Q lcl|Aclame:pro 324 IF-----RPFAVAQMIGV 336 (336) Q Consensus 324 ir-----~P~av~~~~GI 336 (336) || .|.-++.-.|. T Consensus 284 vr~~~~~k~~i~~~d~G~ 301 (309) T protein:vir:99 284 VRVGESVKELVTAPDLGF 301 (309) T ss_pred EEEeccccchhcchhcch Confidence 43 55556666665 No 137 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=39.36 E-value=1 Score=20.53 Aligned_cols=289 Identities=9% Similarity=0.030 Sum_probs=122.0 Q ss_pred CchHHHH----HHh---hh-cceeccch---hhh--cc-----chhHHH--HH----hhhhhcccccccCcch--HHHHH Q lcl|Aclame:pro 1 MRDAQRI----QNL---AR-AGVILPRS---VQN--VS-----TPLTEY--AM----DAADLSPHLSSTGSSG--IPNYL 54 (336) Q Consensus 1 ~~~~~~~----~~l---~~-~g~~~~~~---~~~--~~-----~~~~~~--a~----da~d~~~~l~t~~~~~--i~~~l 54 (336) ....+.+ ..+ +. .....+.. ... .. ...+.. .+ ....+.. ..+.+++| +|..+ T Consensus 47 ~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lr~~~~~~~~~~-~~t~~~gg~~vP~~~ 125 (389) T protein:vir:10 47 KARRDAINDQIKALEAEKPAEPKTEPKDDGSKKGTDLSKKPIDAKKKAINDFIHSHGKVIDATS-KVTSTEAGVLIPEEI 125 (389) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhccccccccccccccchhHHHHHHHHHHHHhhcchhhhhhhc-ccccCCcceeehHHH Confidence 0000000 000 00 00000000 000 00 000000 00 0001111 13334443 67665 Q ss_pred HHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeec-ceeeEEeecccCCc-eeeeeeeeeeeeEEEEEEEEee Q lcl|Aclame:pro 55 TTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEP-TTKVATYGDYSSDG-DSGANINYPQRQSYFFQTWTRW 132 (336) Q Consensus 55 ~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~-~G~a~~ygd~~diP-~~~~~~~~~~~~v~~~~~~~~y 132 (336) .+ .|++.+........++++..... .+..|++... .+.+...+....+| ..+.......-.++.++..+.+ T Consensus 126 ~~----~i~~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~i~~~~~k~~~~~~i 198 (389) T protein:vir:10 126 IY----DPTAEVNSVVDLSTLVTKTPVTT---PKGTYPILKRATDRFSSVAELAENPKLAEPEFNKVDWSVATYRGAIPL 198 (389) T ss_pred HH----HHHHHHHhhhhHHhhcceeeccC---CeeEEEEEecCCCccccccccccccccccccceeeeeeheeeEeeehh Confidence 54 44555555555555555433221 2234554443 44445667777776 4566777778888888888888 Q ss_pred CHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHH Q lcl|Aclame:pro 133 GERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALF 212 (336) Q Consensus 133 ~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~ 212 (336) |.+=++.+ ..++.+.-....++++...+|.-++.|.+... +..+ ....+ ++|+..++ T Consensus 199 S~ell~ds---~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~----------~~~~------~~~~~----~d~l~~~~ 255 (389) T protein:vir:10 199 SEEAIADS---AVDLTALVGQSIKEKSVNTYNAMIAPVLQSFT----------AKKT------TTDTL----VDSLKHIL 255 (389) T ss_pred hHHHHhhh---hHHHHHHHHHHHHHHHHHHHHHHHhhhhcccc----------cccc------ccccc----HHHHHHHH Confidence 87655543 34677777777788888888776665554311 1000 11123 34444444 Q ss_pred HHHHHHhCCceecccccEEEecHHHHHhccc-CCCCCccHHH-----HHHHhC----CccEEEEcc--cccCCCCceEEE Q lcl|Aclame:pro 213 QVLQTQSQGIITQEDVLRMGLPPTAMSDLSK-TNQYGLAAAA-----KLKDIF----PKLEFVTIP--EYDTASGRLVQL 280 (336) Q Consensus 213 ~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~-~~~~~~Tvl~-----~l~~n~----pnl~i~~~p--el~~a~G~~~~~ 280 (336) +..... . ....++|.++.+..|.+ .+..|.-++. -..... -++.++..+ .+.+++|....+ T Consensus 256 ~~~~~~-~------~~a~~~~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~ 328 (389) T protein:vir:10 256 NVDLDP-A------YSRALVVTQSLFNTLDTLKDKNGRYLLHDASDSITDGTAKGTILGVPVYVVGDTLLGSLAGDQKAF 328 (389) T ss_pred Hhhhhh-h------hCcEEEecHHHHHHHHHhhccCCCeeeecCcccccccccccccccceeEEecccccCCCCCceEEE Confidence 321111 1 12368999999888864 3333332211 000011 122232221 222333443333 Q ss_pred EEEeec-----CCceEEEEcChhhhcccceecCCceEEccccceeeeeeecccceeeec--cC Q lcl|Aclame:pro 281 WAPRVE-----GKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMI--GV 336 (336) Q Consensus 281 ~~~~~~-----~~~~~~~~~p~~~r~l~~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~--GI 336 (336) |.+-.+ ...-..+.+.. +.. -....-...|.+|. +..|-||+.+. .. T Consensus 329 ~gd~~~~~~~~~~~~~~i~~~~----~~~----~~~~~~~~~r~d~~-~~~~~a~~~~~~~~~ 382 (389) T protein:vir:10 329 VGDLKRGVLFTDRQQVTLAWED----SKI----YGKYLGAAFRFGVQ-KADSKAGYFVTNTDV 382 (389) T ss_pred EeeccccEEEEeecceEEEeec----ccc----ccceEEEEEEeccE-EecccceEEEEeecc Confidence 332100 00111222111 110 01122234566655 56788877554 44 No 138 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=34.03 E-value=1.3 Score=19.93 Aligned_cols=292 Identities=11% Similarity=0.038 Sum_probs=105.8 Q ss_pred ccchhhhccchhHHHHHhhhhhcccccccCcchHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeec Q lcl|Aclame:pro 17 LPRSVQNVSTPLTEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEP 96 (336) Q Consensus 17 ~~~~~~~~~~~~~~~a~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~ 96 (336) +. ++.. .....|+.. ++++-.-=|+-.|- -+|.......-..+.++.+.+.-. -.++.|+. T Consensus 1 m~----~~~~--------~~~t~~~~~-~~~~~~~l~le~~~-geV~~af~~~s~~~~~~~~r~i~~--G~s~~~~~--- 61 (334) T protein:vir:80 1 MT----YPAA--------NTHTRPGWG-GANSDVSLHIEEHL-GLVDASFMYSSKFASWMNVRSLRG--TNQLRVDR--- 61 (334) T ss_pred CC----CCcC--------CCccccccc-cccchheehhhhhh-hHHHHHHHHhhhhhccceeeeccc--cceEEEee--- Confidence 10 0000 011122221 11110000111111 111001111122334444433211 25666664 Q ss_pred ceeeEEee--cccCCceeeeeeeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEe----e Q lcl|Aclame:pro 97 TTKVATYG--DYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLF----G 170 (336) Q Consensus 97 ~G~a~~yg--d~~diP~~~~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~----G 170 (336) .|++++.. -+..+.--....++....|-.. .-++.-+.++-.++. ..++-++-.+.+..++.++.|+..+. | T Consensus 62 iG~~~~~~~~~g~~l~~~~~~~~~~~l~ID~~-l~~~~~VddiD~~q~-~~D~rse~~~~~G~aLA~~~D~~~~~~l~ka 139 (334) T protein:vir:80 62 VGASTIAGRKAGEELVVQKNVSDKLNLTVDTV-LYARHFFDKFDEWTS-NLDVRKETAREDGIALARQYDQACIIQLQKC 139 (334) T ss_pred ecceeeeeecCCCCCCCCCcccCceEEEEeee-eehhhhHhhHHHHhc-CcchHHHHHHHHHHHHHHHHHHHHHHHHHHh Confidence 57776543 2333311111111111111110 112222344444432 33455555555555555555543221 1 Q ss_pred c---cccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhcccC--- Q lcl|Aclame:pro 171 V---AGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSKT--- 244 (336) Q Consensus 171 d---~~~g~~GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~~--- 244 (336) - +......-+++..........++-...++++.+++=+..+...+..+.--. .+...-.++++|.+|..|-.- T Consensus 140 a~~~~~~~~~~~~~~G~~~~~~~~g~~~~~~~~~~~l~~a~~~a~~~L~e~dvp~-~~~~~R~~vv~P~~y~~Ll~~~r~ 218 (334) T protein:vir:80 140 GDFLAPAHLKPAFHDGILLPSTISGLAADAAADADVLVAAHRQGVEAMVFRDLGD-QLMSEGVTLLDPVIFSFLLEHDRL 218 (334) T ss_pred hhhcccccccccccCCcceeecccccccchhhhHHHHHHHHHHHHHHHHhcCCCC-CcCCceEEEeChHHHHHHhccccc Confidence 1 011111111111110000011111224568888888877777776653210 001346899999999988532 Q ss_pred --CCCCc--cHHHHHHH---hCCccEEEEcccccCCC------CceEEEEEEeecCCceEEEEc---ChhhhcccceecC Q lcl|Aclame:pro 245 --NQYGL--AAAAKLKD---IFPKLEFVTIPEYDTAS------GRLVQLWAPRVEGKDTATCGF---TEKMRAHSIERYS 308 (336) Q Consensus 245 --~~~~~--Tvl~~l~~---n~pnl~i~~~pel~~a~------G~~~~~~~~~~~~~~~~~~~~---p~~~r~l~~~~~~ 308 (336) .+++. +...+-+. +.-+++|+..+.|-+.+ |+..-.+. ++.+..+.+ |+..- -++... T Consensus 219 ~n~d~~~s~~~~~~~~g~i~~v~G~~V~~Sn~~P~~~~t~~~~g~~~~~~a----gd~t~~~~~~~~~~Al~--t~~~~~ 292 (334) T protein:vir:80 219 MNVEFGAKEGGNSFVGGRIAMLNGVRVVETPRFPQSAITANALGADFNVTD----AEVRRKMITFIPSMALI--SAQVHP 292 (334) T ss_pred ccceeccccccccccceeEEEEeceEEEeecCCCCcccccccccccccccc----ccccceEEEEEeCceEE--EEEEee Confidence 12211 11112211 12235666655553211 21111110 110000000 01000 011111 Q ss_pred ---CceEEcccc-------ceeeeeeecc--cceeeeccC Q lcl|Aclame:pro 309 ---SYFRQKKSA-------GTWGAVIFRP--FAVAQMIGV 336 (336) Q Consensus 309 ---~~~~vp~~~-------~t~Gv~ir~P--~av~~~~GI 336 (336) ..|..+.+. -..|+-++|| .++..++++ T Consensus 293 ~~~e~~~~~~~~~d~i~~~~a~G~g~lRPeaa~vv~~~~~ 332 (334) T protein:vir:80 293 VSAQFWEEKKDFGHYLDTFQSYNIGQRRPDAVAVHDITVT 332 (334) T ss_pred cceeeeechhhHHHHHHHHHHcCCceeccceEEEEEEeee Confidence 112222222 2679999999 677888898 No 139 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=33.69 E-value=1.3 Score=19.89 Aligned_cols=311 Identities=10% Similarity=0.097 Sum_probs=146.6 Q ss_pred CchHHHHHHhhhcceeccchhhhccchhH-HHHHhhhhhc----ccc-cccCcchHHHHHHHhhCceeeeeeccccchhh Q lcl|Aclame:pro 1 MRDAQRIQNLARAGVILPRSVQNVSTPLT-EYAMDAADLS----PHL-SSTGSSGIPNYLTTYVDPAVIDILVAPMKAAE 74 (336) Q Consensus 1 ~~~~~~~~~l~~~g~~~~~~~~~~~~~~~-~~a~da~d~~----~~l-~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~ 74 (336) |...+.+++..-.-+.++.....|---.. .|+.|---.+ -.| +..++-.||..+.+-+ -+.-|.+|-..+ T Consensus 28 me~~et~~e~~~~~~~~~~~e~el~E~f~Kmm~G~~p~~eV~~~e~mtt~~a~IliP~vis~v~-~Eaaepl~~~~k--- 103 (393) T protein:vir:79 28 MERGETLAEADANKLALNEEETQILESFAKMMEGETPTNEVNLREFMATPSAQILIPRVIVGTM-REAAEPLYIGTK--- 103 (393) T ss_pred hhhhhhhhhhhhhhhhcchhHHHHHHHHHHHhcCCCchhheehhhhhcCCCcceechhhhhhhh-hhcccchhHHHH--- Confidence 33322222222111222111111100011 1111110000 001 1223444555555433 122223332222 Q ss_pred hccccc--CCCcceeeEEEeeeecceeeEEeecccCCceeeee---eeeeeeeEEEEEEEEeeCHHHHHHHHhhCCCHHH Q lcl|Aclame:pro 75 LVGESK--KGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGAN---INYPQRQSYFFQTWTRWGERELEMAGAGRVDLAS 149 (336) Q Consensus 75 l~~v~t--~g~w~~~t~~~~~~e~~G~a~~ygd~~diP~~~~~---~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~ 149 (336) +|-..+ .| ++..|.-+- +=.+.-+|++..+|-.+.. -++...++-++++.++||.+=+.. .|+++.. T Consensus 104 l~qk~~L~~G----rsm~F~~~g-~~Ra~~IgEGgE~~~~sld~~T~dsv~~~~gK~G~~Ia~SqEmIsD---Sg~Dvin 175 (393) T protein:vir:79 104 MLQKIRLKSG----QSMIFPSIG-IMRAYDVAEGQEIPEDSIDWQTHESPEIRVGKSGIRLRFTDEMISD---SQWDLMS 175 (393) T ss_pred HHHHHhhhcC----cceeccchh-eeeeccccccccccccchhhhcCCceeEEechhhhhhhhHHHHhhc---chHHHHH Confidence 222211 12 122222111 1233445666666555443 445556677788888898766655 4678888 Q ss_pred HHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCcccccc----cccccccCHHHHHHHHHHHHHHHHHHhCCceec Q lcl|Aclame:pro 150 ELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITAT----TPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQ 225 (336) Q Consensus 150 ~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~~----t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~ 225 (336) ---.+|-|++.++....++.++..++-.-|=+-|.-+..-... +.-.++- -++||.++.-++ ++.+. T Consensus 176 ~~l~aA~RaMaRkKee~a~n~fk~~ghtvfDa~st~t~ahptGr~~~~~qNGTl----SleDllDm~~av--~~~hy--- 246 (393) T protein:vir:79 176 MMIKQAGRAMGRHKEQKAYHQFRSHGHTVFDNYSTNKLAHTTGLDKNGVQNDTF----SAEDFLDLIIAV--MANEY--- 246 (393) T ss_pred HHHHHHHHHHHhhhHHHHHhhhhcccceeeeccccCccceeecCCccccccccc----cHHHHHHHHHHH--hcccC--- Confidence 8889999999999999999999988865554432222111111 1111122 356666665555 34322 Q ss_pred ccccEEEecHHHHHhccc--------CCCCC-----------ccHHHHHHHhCC-ccEEEEccc--ccCCCCceEEEEEE Q lcl|Aclame:pro 226 EDVLRMGLPPTAMSDLSK--------TNQYG-----------LAAAAKLKDIFP-KLEFVTIPE--YDTASGRLVQLWAP 283 (336) Q Consensus 226 ~~p~tL~Lp~~~~~~L~~--------~~~~~-----------~Tvl~~l~~n~p-nl~i~~~pe--l~~a~G~~~~~~~~ 283 (336) .|.+|.|-|-+++.+.+ .|.+| .=+-+.|+...| |+.|.-.|- |... ....+.+. T Consensus 247 -t~svi~MHPLAWnv~AKna~me~~~~na~gN~~~~~~~ts~algp~~i~~~~~~nlnv~~sPfvp~d~k-~~rFd~~~- 323 (393) T protein:vir:79 247 -TPSDLMMHPLAWTVFAKNELMGSLQANPYGNYPAKGAPSSMALGPDSIQGRLPFNFNVNLSPFIPLDKK-SRRFDVYA- 323 (393) T ss_pred -CcceEEEcCchhhhhhhhhhhcceeeccccccCccccchhhhhchhhhccccccceeEEEecccccccc-cceeeEEE- Confidence 58899999988887643 12222 223344544444 455554443 3333 22333333 Q ss_pred eecCCceEEEEcChhhhcccceecCC-ceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 284 RVEGKDTATCGFTEKMRAHSIERYSS-YFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 284 ~~~~~~~~~~~~p~~~r~l~~~~~~~-~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) ++.+.++-+.+-..+..-.-+-+.- --+++..+|+|=-++--=.+|+-...| T Consensus 324 -Vd~NnvgvlLV~D~i~tdq~ddk~rdiq~iKl~ERYG~gvLn~gkaiavakNI 376 (393) T protein:vir:79 324 -VDRNNVGVLLVRDDLKTDQWDEKARGLQNIKMIERYGIGILNEGKAIAVAKNI 376 (393) T ss_pred -eecCCceEEEEecCcceeccccccccceeeeeeeeeceeeeeCCceEEEEecc Confidence 3345555554433222221111111 245667777765577777777777777 No 140 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=31.93 E-value=1.5 Score=19.68 Aligned_cols=287 Identities=9% Similarity=-0.079 Sum_probs=114.3 Q ss_pred HHHhhhhhcccccccCcchHHHHHHHhhCceeeeeec-cccchhhhcccccCCCcceeeEEEeeeecceeeEEeec--cc Q lcl|Aclame:pro 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILV-APMKAAELVGESKKGDWTTLVAAFITAEPTTKVATYGD--YS 107 (336) Q Consensus 31 ~a~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~-~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~~ygd--~~ 107 (336) |.--..-..|+.+ +++.-.-=||--|- +.+..-| ..-....++.+.|.- ...++.|+. .|+.+..+- +. T Consensus 1 Ms~~n~~t~p~~~-gsg~~~aL~Le~f~--GeV~taF~~~si~~~~~~vRtI~--~gkS~qf~~---lG~s~a~y~~pG~ 72 (400) T protein:vir:10 1 MSTPNNLTNVAVS-ASGEVDSLLIEKFN--GKVNEQYLKGENIMSYFDVQTVT--GTNTVSNKY---LGETELQVLAPGQ 72 (400) T ss_pred CCCCccccccccc-cccchhhhHHhHhc--chHHHHHHHHhhhcccceeeeec--ccceEEEEE---eeeeEEeeecCCC Confidence 1000011122221 11111111333222 1111111 112234555554422 125566654 587776542 22 Q ss_pred CCceeeeeeeeee---eeEEEEEEEEeeCHHHHHHHHh-hCCCHHHHHHHHHHHHHHHhhcceE-Eeecc----ccceEE Q lcl|Aclame:pro 108 SDGDSGANINYPQ---RQSYFFQTWTRWGERELEMAGA-GRVDLASELNYSSALGLAKFLNGSY-LFGVA----GLENYG 178 (336) Q Consensus 108 diP~~~~~~~~~~---~~v~~~~~~~~y~~~El~~A~~-~g~~l~~~k~~aAr~a~e~~~n~~~-~~Gd~----~~g~~G 178 (336) .+ ...-..+.+. ..--.+.-.+=|.++|.+.-=- .+-.+..+-..+-++.+++++=+.+ .-|.+ ..++.| T Consensus 73 ~l-dg~~~~~dk~~ItIDtLL~a~~~V~dlDd~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~a~~a~t~~~~~~~~ 151 (400) T protein:vir:10 73 SP-AATSTQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKKMEDEMLIQQMLLGGIANTQAKRTNPR 151 (400) T ss_pred Cc-CCCCcccCcEEEEeCceeeecchhhhHHHHhhccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccCC Confidence 22 2221122211 1112233344455555433221 3434444444444555554332212 22211 122223 Q ss_pred EEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhcccC-----CCCCccH-H Q lcl|Aclame:pro 179 LINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSKT-----NQYGLAA-A 252 (336) Q Consensus 179 llN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~~-----~~~~~Tv-l 252 (336) ..-++... .++.+ ...+.++++++.+-+..+..++.. ..+. .....+++||..|..|... ++++.|- . T Consensus 152 g~~~g~s~-~v~~~-~~~~~~~~~~l~~A~~~A~~~LdE---kdVP-~~d~vvl~pp~~Ys~Ll~~dkLvnrdf~~s~~g 225 (400) T protein:vir:10 152 VKGHGFSV-NVEVN-EGEALVNPQYVMAAVEFALEQQLE---QEVD-ISDVAILMPWRYFNVLRDADRIVDKSYTISQSG 225 (400) T ss_pred ccccccce-eeccc-ccccccCHHHHHHHHHHHHHHHHh---cCCC-ccceEEEcCHHHHHHHHhCCcccchhccccCCC Confidence 33222211 11112 222335789999988888887743 3343 3356889999999877432 2333221 2 Q ss_pred HHHHHh---CCccEEEEcccccCCC-------------C---------ceEEEEEEeecCCceEEEE-cChhhhccccee Q lcl|Aclame:pro 253 AKLKDI---FPKLEFVTIPEYDTAS-------------G---------RLVQLWAPRVEGKDTATCG-FTEKMRAHSIER 306 (336) Q Consensus 253 ~~l~~n---~pnl~i~~~pel~~a~-------------G---------~~~~~~~~~~~~~~~~~~~-~p~~~r~l~~~~ 306 (336) ++.+.. .-+++|+..+.|-+.+ | .+..+++-.. ..++... +|.-.+. --+. T Consensus 226 ~~~~g~v~~v~Gv~Iv~Sn~lP~~a~~~~~~~lS~a~~G~~y~~t~d~s~~~av~F~~--sAv~tvk~~~lt~~~-~~d~ 302 (400) T protein:vir:10 226 ATIQGFVLSSYNCPVIPSNRFPKYSQGQKHHLLSNEDNGYRYDPIAEMNGAIAVLFTA--DALLVGRSIDVIGDI-FYEK 302 (400) T ss_pred ccccceEEEEeceEEEeeCcCCcccCcccccccccCCCCccCCccccccceeEEEEeh--hheEEEEeecccccc-ccch Confidence 233222 3356777666663211 1 1122222111 1111100 0100000 0122 Q ss_pred cCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 307 YSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 307 ~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) +...|.+.|.+ ..|+..+||-|+.-+.=- T Consensus 303 r~~~~~id~~~-a~G~g~~RPeaa~vv~~~ 331 (400) T protein:vir:10 303 KEKTYYIDTFM-SEGAIPDRWEAVSVVTTK 331 (400) T ss_pred hhHHHHHHHHH-HhCCcccchhheEEEEec Confidence 23344444433 468999999888765433 No 141 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=31.85 E-value=1.5 Score=19.67 Aligned_cols=259 Identities=10% Similarity=0.018 Sum_probs=108.0 Q ss_pred ccccCcchHHHHHHHhhCceeeeeeccccchhhhcccccC--CCcceeeEEEeeeecceeeEEeecccCCceeeeeeeee Q lcl|Aclame:pro 42 LSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESKK--GDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYP 119 (336) Q Consensus 42 l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~--g~w~~~t~~~~~~e~~G~a~~ygd~~diP~~~~~~~~~ 119 (336) |.. +.-+|+.+...|. +.+...+....|+....+ +.-+ .|++++..-..+.+.--+....++.-+.+..+. T Consensus 1 MA~--~~~~pei~~~~v~----~~~~~~lv~~~l~~~~~~~~~~~G-dTv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~ 73 (273) T protein:vir:79 1 MAF--NNFIPELWSDMLL----EEWTAQTVFANLVNREYEGIASKG-NVVHIAGVVAPTVKDYKAAGRQTSADAISDTGV 73 (273) T ss_pred Ccc--hhhhHHHHHHHHH----HHHHhhccchhhhhccccccccCC-cEEEEeecCcccccccccCCCccCccccccceE Confidence 111 1224555554332 222333334444322211 1111 478888865544333223333344445555556 Q ss_pred eeeEEE-EEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCcccccccccccc Q lcl|Aclame:pro 120 QRQSYF-FQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGS 198 (336) Q Consensus 120 ~~~v~~-~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~ 198 (336) ..++-. -..++.++..|...+ ..++.. -...+..++.+.+++..+ + ++-.-... .+.. .. T Consensus 74 ~~tid~~~~~~~~i~d~d~~~~---~~~~~~-~~~~~~~ala~~vD~~i~-~--------~~~~a~~~--~~~~----~~ 134 (273) T protein:vir:79 74 DLLIDQEKSIDFLVDDIDRVQV---AGSLEA-YTRAGATALATDTDKFIA-D--------MLVDNGTA--LTGS----AP 134 (273) T ss_pred EEEEeeecccceeeccHHHHhh---cccHHH-HHHHHHHHHHHHHHHHHH-H--------HHhhcccc--cccc----cc Confidence 666644 344666665554332 335643 333445566666554211 0 11000000 0001 12 Q ss_pred cCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhcccCCC-C---CccH-HHHHHH----hCCccEEEEccc Q lcl|Aclame:pro 199 PAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSKTNQ-Y---GLAA-AAKLKD----IFPKLEFVTIPE 269 (336) Q Consensus 199 ~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~~~~-~---~~Tv-l~~l~~----n~pnl~i~~~pe 269 (336) .+++.+++.|.++...+-.. + + +...-.|+++|..+..|.+..+ . ...- ..-|++ +.-+++|..... T Consensus 135 ~~~~~~~~~i~~a~~~ld~~--~-v-P~~~R~lvv~p~~~~~Ll~~~~~~~~~~~~~~~~~l~~G~ig~~~G~~i~~s~~ 210 (273) T protein:vir:79 135 SDADDAFDLIASALKELTKA--N-V-PNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNN 210 (273) T ss_pred cchhhHHHHHHHHHHHhhhc--c-C-CccCcEEEECHHHHHHHhhchhhhhhhhhcccccceeeeEeeEEeceEEEeccc Confidence 34566777777776665332 1 1 2233479999999887643211 0 0000 001111 122455665555 Q ss_pred ccCCCCceEEEEEEeecCCceEEEEcChhhhcccceecCCceEEccccceeeeeeecccceeeeccC Q lcl|Aclame:pro 270 YDTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 270 l~~a~G~~~~~~~~~~~~~~~~~~~~p~~~r~l~~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~GI 336 (336) +...++.....+... .--...+ + ..+..+..+.+. .-.+... -..|+-+.+|-+++.+.== T Consensus 211 lp~~~~~~~~a~~~~--A~~~a~~-~-~~~e~~r~~~~~-~~~v~~~-~~yg~~v~~p~~vv~~~~~ 271 (273) T protein:vir:79 211 LRDTDDEQFVAFHPS--AAAYVSQ-I-DTVEALRDQDSF-SDRIRAL-HVYGGKVVRPTGVVVFNKT 271 (273) T ss_pred ccccCceEEEEEecc--ceeeeee-h-hhhhcccCcccc-eeeeeee-eeeeeEEecCceEEEEecc Confidence 543333333222211 1000110 1 012222222222 2222222 2467777788887775433 No 142 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=25.59 E-value=2 Score=18.89 Aligned_cols=256 Identities=11% Similarity=-0.009 Sum_probs=104.2 Q ss_pred ccccCcchHHHHHHHhhCceeeeeeccccchhhhccccc--CCCcceeeEEEeeeecceeeEEeecccCCceeeeeeeee Q lcl|Aclame:pro 42 LSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESK--KGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYP 119 (336) Q Consensus 42 l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t--~g~w~~~t~~~~~~e~~G~a~~ygd~~diP~~~~~~~~~ 119 (336) |. -+.-+|..+..-|... .........|+.... ++..+ .++.++..-..+.+.--+....++.-+.+..+. T Consensus 1 MA--~~~~~pe~~~~~v~~~----~~~~lv~~~l~~~~~~~~~~~G-dtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~ 73 (273) T protein:vir:10 1 MA--FNNFIPELWSDMLLEE----WTAQTVFANLVNREYEGTASKG-NVVHIAGVVAPTVKDYKAAGRQTSADAISDTGV 73 (273) T ss_pred Cc--chhhhHHHHHHHHHHH----HHhhhccchhhccccccccccC-ceEEEeecccccccccccCCCccCccccccceE Confidence 11 1222455444433222 233333444544322 23232 578888755444322111112222333333444 Q ss_pred eeeEEE-EEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCcccccccccccc Q lcl|Aclame:pro 120 QRQSYF-FQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGS 198 (336) Q Consensus 120 ~~~v~~-~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~ 198 (336) ..++-. -..++.++..|...+ . .++.+ -...+..++.+.+++..+ + . +.+- ... .+ . ... T Consensus 74 ~~tid~~~~~~~~i~d~d~~~~--~-~~~~~-~~~~~~~alA~~vD~~i~-~--~--~~~a----~~~--~~-~---~~~ 134 (273) T protein:vir:10 74 DLLIDQEKSIDFLVDDIDRVQV--A-GSLEA-YTRAGATALATDTDKFIA-D--M--LVDN----GTA--LT-G---SAP 134 (273) T ss_pred EEEEeeeeecceEeecHHHhhh--h-ccHHH-HHHHHHHHHHHHHHHHHH-H--H--Hhcc----ccc--cc-c---ccc Confidence 444422 244555554443322 2 24433 233344455555553221 0 0 0000 000 00 0 112 Q ss_pred cCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhcccCCCC----Ccc-HHHHHHH----hCCccEEEEccc Q lcl|Aclame:pro 199 PAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSKTNQY----GLA-AAAKLKD----IFPKLEFVTIPE 269 (336) Q Consensus 199 ~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~~~~~----~~T-vl~~l~~----n~pnl~i~~~pe 269 (336) .+++.+++.|.++...+-..- + +...-.|+++|..+..|.+..++ ... -..-|++ +.-+++|..... T Consensus 135 ~~~~~~~~~i~~a~~~ld~~~---v-P~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~ 210 (273) T protein:vir:10 135 TDADDAFDLIAKALKELTKAN---V-PNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNN 210 (273) T ss_pred cchhHHHHHHHHHHHHhhhcC---C-CcCCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceEEEEecc Confidence 456778888888877773321 1 22345799999999987543210 000 0011111 122355555444 Q ss_pred ccCCCCceEEEEEEeecCCceEEEEcCh---hhhcccceecCCceEEccccceeeeeeecccceeeecc--C Q lcl|Aclame:pro 270 YDTASGRLVQLWAPRVEGKDTATCGFTE---KMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIG--V 336 (336) Q Consensus 270 l~~a~G~~~~~~~~~~~~~~~~~~~~p~---~~r~l~~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~G--I 336 (336) +-..++.....+... . +.++. .+..+..+.+. .-.+... -..|+-+.||-+++.+.= = T Consensus 211 lp~~~~~~~~~~~~~--A-----~~~a~q~~~~e~~r~~~~~-~~~v~~~-~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 211 LRDTDDEQFVAFHPS--A-----AAYVSQIDTVEALRDQDSF-SDRIRAL-HVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred cccCCccEEEEEecc--c-----eeeeeeeehhhcccCCCcc-eeeeeee-eeeeeeEeccceEEEEeccCC Confidence 433333333222211 1 11122 12222222222 2222222 236777778888776543 3 No 143 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=25.59 E-value=2 Score=18.89 Aligned_cols=256 Identities=11% Similarity=-0.009 Sum_probs=104.2 Q ss_pred ccccCcchHHHHHHHhhCceeeeeeccccchhhhccccc--CCCcceeeEEEeeeecceeeEEeecccCCceeeeeeeee Q lcl|Aclame:pro 42 LSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESK--KGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYP 119 (336) Q Consensus 42 l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t--~g~w~~~t~~~~~~e~~G~a~~ygd~~diP~~~~~~~~~ 119 (336) |. -+.-+|..+..-|... .........|+.... ++..+ .++.++..-..+.+.--+....++.-+.+..+. T Consensus 1 MA--~~~~~pe~~~~~v~~~----~~~~lv~~~l~~~~~~~~~~~G-dtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~ 73 (273) T protein:vir:10 1 MA--FNNFIPELWSDMLLEE----WTAQTVFANLVNREYEGTASKG-NVVHIAGVVAPTVKDYKAAGRQTSADAISDTGV 73 (273) T ss_pred Cc--chhhhHHHHHHHHHHH----HHhhhccchhhccccccccccC-ceEEEeecccccccccccCCCccCccccccceE Confidence 11 1222455444433222 233333444544322 23232 578888755444322111112222333333444 Q ss_pred eeeEEE-EEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEEeeccccceEEEEecCCCCcccccccccccc Q lcl|Aclame:pro 120 QRQSYF-FQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGS 198 (336) Q Consensus 120 ~~~v~~-~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~ 198 (336) ..++-. -..++.++..|...+ . .++.+ -...+..++.+.+++..+ + . +.+- ... .+ . ... T Consensus 74 ~~tid~~~~~~~~i~d~d~~~~--~-~~~~~-~~~~~~~alA~~vD~~i~-~--~--~~~a----~~~--~~-~---~~~ 134 (273) T protein:vir:10 74 DLLIDQEKSIDFLVDDIDRVQV--A-GSLEA-YTRAGATALATDTDKFIA-D--M--LVDN----GTA--LT-G---SAP 134 (273) T ss_pred EEEEeeeeecceEeecHHHhhh--h-ccHHH-HHHHHHHHHHHHHHHHHH-H--H--Hhcc----ccc--cc-c---ccc Confidence 444422 244555554443322 2 24433 233344455555553221 0 0 0000 000 00 0 112 Q ss_pred cCHHHHHHHHHHHHHHHHHHhCCceecccccEEEecHHHHHhcccCCCC----Ccc-HHHHHHH----hCCccEEEEccc Q lcl|Aclame:pro 199 PAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSDLSKTNQY----GLA-AAAKLKD----IFPKLEFVTIPE 269 (336) Q Consensus 199 ~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p~tL~Lp~~~~~~L~~~~~~----~~T-vl~~l~~----n~pnl~i~~~pe 269 (336) .+++.+++.|.++...+-..- + +...-.|+++|..+..|.+..++ ... -..-|++ +.-+++|..... T Consensus 135 ~~~~~~~~~i~~a~~~ld~~~---v-P~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~v~~s~~ 210 (273) T protein:vir:10 135 TDADDAFDLIAKALKELTKAN---V-PNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGARIVESNN 210 (273) T ss_pred cchhHHHHHHHHHHHHhhhcC---C-CcCCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceEEEEecc Confidence 456778888888877773321 1 22345799999999987543210 000 0011111 122355555444 Q ss_pred ccCCCCceEEEEEEeecCCceEEEEcCh---hhhcccceecCCceEEccccceeeeeeecccceeeecc--C Q lcl|Aclame:pro 270 YDTASGRLVQLWAPRVEGKDTATCGFTE---KMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIG--V 336 (336) Q Consensus 270 l~~a~G~~~~~~~~~~~~~~~~~~~~p~---~~r~l~~~~~~~~~~vp~~~~t~Gv~ir~P~av~~~~G--I 336 (336) +-..++.....+... . +.++. .+..+..+.+. .-.+... -..|+-+.||-+++.+.= = T Consensus 211 lp~~~~~~~~~~~~~--A-----~~~a~q~~~~e~~r~~~~~-~~~v~~~-~~yg~~v~~~~~~~~l~~~g~ 273 (273) T protein:vir:10 211 LRDTDDEQFVAFHPS--A-----AAYVSQIDTVEALRDQDSF-SDRIRAL-HVYGGKVVRPTGVVVFNKTGS 273 (273) T ss_pred cccCCccEEEEEecc--c-----eeeeeeeehhhcccCCCcc-eeeeeee-eeeeeeEeccceEEEEeccCC Confidence 433333333222211 1 11122 12222222222 2222222 236777778888776543 3 No 144 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=20.08 E-value=2.8 Score=18.10 Aligned_cols=272 Identities=14% Similarity=0.092 Sum_probs=107.7 Q ss_pred ccccCcchHHH----------HHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecceeeEEe----ecc- Q lcl|Aclame:pro 42 LSSTGSSGIPN----------YLTTYVDPAVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTKVATY----GDY- 106 (336) Q Consensus 42 l~t~~~~~i~~----------~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~~~e~~G~a~~y----gd~- 106 (336) |++.++.+.|. |+-.|- -+|.+.....-+.+.++.+.+.- ...++.|+. +|+++.. |.. T Consensus 1 ms~~~~~t~~~~~~s~~d~al~le~f~-geV~~af~~~s~~~~~~~~rti~--~g~s~~~~~---iG~~~~~~~~pG~~l 74 (335) T protein:vir:78 1 MSFLNDLTRPNYAGKNADVDIHLEEHL-GIVDKHFAYTSKFAPLMNIRDLR--GSNVVRLDR---LGNVEAKGRRAGEEL 74 (335) T ss_pred CCccccccccccccccchhhhhhhhhh-hHHHHHHHHhhhhccccceeeec--cceeEEEee---eeeeeecccccCccc Confidence 22222222211 111111 11111111222334444443321 125566664 6888765 221 Q ss_pred cCCceeeeeeeeeeee--EEEEEEEEeeCHHHHHHHHhhCCCHHHHHHHHHHHHHHHhhcceEE------eec-cccceE Q lcl|Aclame:pro 107 SSDGDSGANINYPQRQ--SYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYL------FGV-AGLENY 177 (336) Q Consensus 107 ~diP~~~~~~~~~~~~--v~~~~~~~~y~~~El~~A~~~g~~l~~~k~~aAr~a~e~~~n~~~~------~Gd-~~~g~~ 177 (336) +..|... ++.... -..+.-.+=|.++|.+ +..++-++-....-.++.++.|+..+ .+. +..... T Consensus 75 ~~~~~~~---~k~~itID~ll~a~~~VddlDe~~----~~yDvR~e~s~~~G~aLA~~~Dq~~~~~l~~aa~~~a~~~~~ 147 (335) T protein:vir:78 75 ERSRVVN---DKWNLTVDTLLYLRHQFDHQDEWT----QSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLE 147 (335) T ss_pred CCCCccc---CCeEEEecceeechhhHhhHHHhh----cCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccC Confidence 1223211 111111 1122223333344432 33444444444555555555554332 111 111111 Q ss_pred EEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceecccc---cEEEecHHHHHhcccC-----CCCCc Q lcl|Aclame:pro 178 GLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDV---LRMGLPPTAMSDLSKT-----NQYGL 249 (336) Q Consensus 178 GllN~Pnl~~~~~~~t~w~~~~t~~eI~~Di~~l~~~l~~~s~g~v~~~~p---~tL~Lp~~~~~~L~~~-----~~~~~ 249 (336) +-++ |........++. .+.++++.+.+=+.++...+....- ++.+ ..++++|.+|..|-.- ++|+. T Consensus 148 ~~~~-~G~~~~~~~tg~-~~~~~~~~l~~a~~~a~~~l~ekdv----P~~~~~~rv~vv~P~~y~~Ll~~~~l~n~~~~~ 221 (335) T protein:vir:78 148 DAFS-PGVLEKLDLTGL-TAKEAAEKIVRMHRRVVETFIERDL----GDAVYSEGLTPMSPRVFSLLLEHDKLMSVEYQA 221 (335) T ss_pred CCcC-CCcceeeeeccc-cccccHHHHHHHHHHHHHHHHhccC----CCCCCCccEEEeChHHHHHHhcccccccccccc Confidence 1222 122111111111 2234578888877777777754431 1221 4689999999988542 12221 Q ss_pred c--HHHHHHHh---CCccEEEEcccccCCCC---------c------eEEEEEEeecCCc--eEEEEcChhhhcccceec Q lcl|Aclame:pro 250 A--AAAKLKDI---FPKLEFVTIPEYDTASG---------R------LVQLWAPRVEGKD--TATCGFTEKMRAHSIERY 307 (336) Q Consensus 250 T--vl~~l~~n---~pnl~i~~~pel~~a~G---------~------~~~~~~~~~~~~~--~~~~~~p~~~r~l~~~~~ 307 (336) | .-.+.+.. .-+++|+..+.|-+.++ + .+.+.+- ....- +++ .++...+.. -+.+ T Consensus 222 s~~~~~~~~g~v~~v~Gv~V~~Sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~-~~~~Al~t~~-~~~~~~e~~-~~~~ 298 (335) T protein:vir:78 222 TGATNDYVKSRVAILNGVKVLETPRFATKAISAHPLGRHFNVSAEEAERQIALF-LPSKTLITAQ-VAPVQAKLW-EDHD 298 (335) T ss_pred cccccccccceeEEeeceEEEeeccCCCCCCccccccccCCcccccccceEEEE-EecceEEEEE-EEeccccee-eccc Confidence 1 11111111 12466666666642211 1 1111110 11111 111 111111111 1112 Q ss_pred CCceEEccccceeeeeeecc--cceeeeccC Q lcl|Aclame:pro 308 SSYFRQKKSAGTWGAVIFRP--FAVAQMIGV 336 (336) Q Consensus 308 ~~~~~vp~~~~t~Gv~ir~P--~av~~~~GI 336 (336) .-.|.+.+... .|+-++|| .++...+|| T Consensus 299 ~~~~~i~~~~a-~G~g~lRPe~a~~i~~tg~ 328 (335) T protein:vir:78 299 QFSWVLDTFQM-YNIGARRPDTAGAIELKGI 328 (335) T ss_pred hhhHhhhHHHH-cCCcccCcceEEEEEecCC Confidence 23455555544 79999999 566778899 Done!