Query lcl|Aclame:protein:vir:78558|NCBI_annot:major capsid protein|genbank:acc:YP_001294848;genbank:gi:149882911;genbank:GeneID:5291029 Match_columns 336 No_of_seqs 105 out of 109 Neff 6.5 Searched_HMMs 1612 Date Mon Dec 2 11:33:57 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_13 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_13_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:78558 Length: 336 100.0 8E-136 5E-139 761.3 34.5 336 1-336 1-336 (336) 2 protein:vir:106734 Length: 336 100.0 1E-135 7E-139 760.6 34.5 336 1-336 1-336 (336) 3 protein:vir:3643 Length: 336 # 100.0 1E-134 6E-138 755.2 34.5 336 1-336 1-336 (336) 4 protein:vir:101557 Length: 336 100.0 1E-134 6E-138 755.3 34.3 336 1-336 1-336 (336) 5 protein:vir:94070 Length: 339 100.0 1E-122 8E-126 688.9 32.8 335 1-336 4-339 (339) 6 protein:vir:107732 Length: 379 100.0 4E-118 3E-121 664.0 31.0 334 1-336 23-379 (379) 7 protein:vir:99576 Length: 388 100.0 4E-115 2E-118 647.8 31.4 333 1-336 30-388 (388) 8 protein:vir:96079 Length: 382 100.0 3E-112 2E-115 631.9 27.7 334 1-336 21-382 (382) 9 protein:vir:79642 Length: 329 100.0 5.4E-92 3.3E-95 521.0 29.4 319 12-336 1-326 (329) 10 protein:vir:104342 Length: 314 100.0 2.6E-90 1.6E-93 511.8 27.8 305 13-336 1-311 (314) 11 protein:vir:107687 Length: 319 100.0 2.6E-88 1.6E-91 500.8 28.9 316 1-336 1-319 (319) 12 protein:vir:80068 Length: 301 100.0 8.9E-88 5.5E-91 497.8 28.0 292 42-336 1-301 (301) 13 protein:vir:103285 Length: 296 100.0 1.3E-85 7.9E-89 486.0 27.0 290 31-336 1-293 (296) 14 protein:vir:5255 Length: 304 # 100.0 7.6E-83 4.7E-86 470.8 27.1 284 33-335 1-304 (304) 15 protein:vir:105778 Length: 358 98.8 2.1E-10 1.3E-13 73.7 13.4 315 1-336 12-357 (358) 16 protein:vir:7771 Length: 330 # 98.7 9.7E-10 6E-13 70.0 13.7 288 31-336 1-321 (330) 17 protein:vir:104085 Length: 320 98.5 1.5E-08 9.2E-12 63.5 14.4 285 27-336 1-315 (320) 18 protein:vir:80376 Length: 435 98.5 2.6E-08 1.6E-11 62.1 14.9 315 1-336 55-431 (435) 19 protein:vir:105905 Length: 304 98.5 3.2E-08 2E-11 61.7 15.3 282 31-336 1-303 (304) 20 protein:vir:94142 Length: 304 98.5 3.2E-08 2E-11 61.7 15.3 282 31-336 1-303 (304) 21 protein:vir:1433 Length: 435 # 98.4 3.6E-08 2.2E-11 61.4 15.0 316 1-336 52-431 (435) 22 protein:vir:5739 Length: 366 # 98.4 1.6E-07 1E-10 57.8 17.8 303 1-336 30-364 (366) 23 protein:vir:94771 Length: 298 98.3 5.6E-08 3.5E-11 60.3 13.2 274 31-336 1-297 (298) 24 protein:vir:99920 Length: 311 98.2 1.5E-07 9.6E-11 57.9 14.0 280 31-336 1-310 (311) 25 protein:vir:9574 Length: 300 # 98.2 1.3E-07 8.3E-11 58.3 13.5 275 31-336 1-298 (300) 26 protein:vir:94673 Length: 419 98.2 6.6E-07 4.1E-10 54.5 16.8 307 1-336 56-415 (419) 27 protein:vir:8420 Length: 477 # 98.2 2E-07 1.2E-10 57.3 13.9 318 1-336 103-469 (477) 28 protein:vir:1638 Length: 298 # 98.2 2.2E-07 1.4E-10 57.1 14.2 274 31-336 1-297 (298) 29 protein:vir:8187 Length: 311 # 98.2 2E-07 1.3E-10 57.3 13.8 275 42-336 1-308 (311) 30 protein:vir:2504 Length: 305 # 98.2 3.4E-07 2.1E-10 56.1 14.9 274 31-336 1-296 (305) 31 protein:vir:108211 Length: 318 98.2 1.6E-08 9.9E-12 63.3 7.6 274 24-336 1-315 (318) 32 protein:vir:95763 Length: 297 98.1 4E-07 2.5E-10 55.6 14.1 275 28-336 1-294 (297) 33 protein:vir:4226 Length: 326 # 98.0 7E-07 4.4E-10 54.3 13.8 290 1-336 3-321 (326) 34 protein:vir:41 Length: 299 # N 98.0 1.2E-06 7.2E-10 53.1 14.2 275 31-336 1-296 (299) 35 protein:vir:105038 Length: 428 97.9 2.1E-06 1.3E-09 51.7 15.0 317 1-336 53-426 (428) 36 protein:vir:97148 Length: 324 97.9 2.4E-06 1.5E-09 51.4 14.9 293 1-336 1-313 (324) 37 protein:vir:96392 Length: 324 97.9 4.3E-06 2.7E-09 50.0 16.2 294 1-336 1-313 (324) 38 protein:vir:78830 Length: 324 97.9 4.3E-06 2.7E-09 50.0 16.2 294 1-336 1-313 (324) 39 protein:vir:2430 Length: 318 # 97.9 2.5E-06 1.6E-09 51.2 14.7 289 10-336 1-311 (318) 40 protein:vir:103955 Length: 324 97.8 4.8E-06 3E-09 49.8 15.9 289 1-336 1-313 (324) 41 protein:vir:78223 Length: 333 97.8 2E-06 1.2E-09 51.8 13.7 292 1-336 1-330 (333) 42 protein:vir:78523 Length: 338 97.8 4.6E-06 2.8E-09 49.9 15.3 295 1-336 1-333 (338) 43 protein:vir:9759 Length: 303 # 97.8 2.5E-06 1.6E-09 51.3 13.7 281 31-336 1-301 (303) 44 protein:vir:9309 Length: 324 # 97.8 7.9E-06 4.9E-09 48.5 16.4 289 1-336 1-313 (324) 45 protein:vir:99749 Length: 324 97.8 9E-06 5.6E-09 48.3 16.4 289 1-336 1-313 (324) 46 protein:vir:96223 Length: 324 97.7 9.3E-06 5.7E-09 48.2 15.7 294 1-336 1-313 (324) 47 protein:vir:100135 Length: 418 97.7 1E-05 6.2E-09 48.0 15.9 303 1-336 67-413 (418) 48 protein:vir:104256 Length: 458 97.6 2E-05 1.2E-08 46.3 16.3 314 1-336 99-456 (458) 49 protein:vir:80684 Length: 315 97.5 8.4E-06 5.2E-09 48.4 12.9 275 31-336 1-304 (315) 50 protein:vir:101650 Length: 497 97.5 8.1E-06 5E-09 48.5 12.7 310 1-336 98-491 (497) 51 protein:vir:7855 Length: 497 # 97.5 8.1E-06 5E-09 48.5 12.7 310 1-336 98-491 (497) 52 protein:vir:81227 Length: 413 97.4 4.3E-05 2.7E-08 44.5 15.7 302 1-336 58-408 (413) 53 protein:vir:4339 Length: 395 # 97.4 2.3E-05 1.4E-08 46.0 14.2 305 1-336 54-393 (395) 54 protein:vir:191 Length: 385 # 97.3 2.9E-05 1.8E-08 45.4 13.3 304 1-336 50-382 (385) 55 protein:vir:1886 Length: 385 # 97.3 2.9E-05 1.8E-08 45.4 13.3 304 1-336 50-382 (385) 56 protein:vir:10364 Length: 390 97.2 7.3E-05 4.5E-08 43.3 15.0 300 1-336 54-390 (390) 57 protein:vir:8102 Length: 543 # 97.0 0.00014 8.8E-08 41.7 14.3 306 1-336 188-540 (543) 58 protein:vir:3613 Length: 272 # 96.8 9.5E-05 5.9E-08 42.6 12.3 257 31-336 1-272 (272) 59 protein:vir:93616 Length: 645 96.8 0.00031 1.9E-07 39.8 14.8 305 1-336 290-637 (645) 60 protein:vir:97053 Length: 390 96.8 0.00023 1.4E-07 40.6 14.0 297 1-336 54-390 (390) 61 protein:vir:96833 Length: 275 96.7 7.4E-05 4.6E-08 43.2 11.0 257 31-336 1-275 (275) 62 protein:vir:2344 Length: 397 # 96.7 0.00029 1.8E-07 40.0 14.1 278 13-336 1-304 (397) 63 protein:vir:9410 Length: 415 # 96.6 0.00046 2.8E-07 38.9 14.8 309 1-336 58-402 (415) 64 protein:vir:96123 Length: 274 96.5 0.00041 2.5E-07 39.2 13.5 256 31-336 1-268 (274) 65 protein:vir:96762 Length: 632 96.4 0.00053 3.3E-07 38.5 14.0 304 1-336 304-631 (632) 66 protein:vir:4159 Length: 315 # 96.4 0.00068 4.2E-07 37.9 14.7 299 15-336 1-315 (315) 67 protein:vir:6212 Length: 434 # 96.3 0.00049 3.1E-07 38.7 13.0 308 1-336 95-427 (434) 68 protein:vir:4700 Length: 415 # 96.2 0.00086 5.3E-07 37.4 16.2 303 1-336 51-402 (415) 69 protein:vir:4600 Length: 415 # 96.2 0.00086 5.3E-07 37.4 16.2 303 1-336 51-402 (415) 70 protein:vir:1328 Length: 392 # 96.1 0.0011 6.6E-07 36.9 14.5 307 1-336 58-389 (392) 71 protein:vir:81070 Length: 390 96.0 0.0011 7E-07 36.7 15.9 302 1-336 54-390 (390) 72 protein:vir:6242 Length: 390 # 96.0 0.0011 7E-07 36.7 13.6 302 1-336 58-387 (390) 73 protein:vir:4197 Length: 314 # 95.9 0.0013 8.1E-07 36.4 14.1 290 1-336 1-310 (314) 74 protein:vir:4456 Length: 401 # 95.8 0.00094 5.8E-07 37.2 12.5 314 1-336 51-399 (401) 75 protein:vir:97433 Length: 274 95.8 0.0012 7.7E-07 36.5 13.1 255 31-336 1-268 (274) 76 protein:vir:94494 Length: 274 95.8 0.0012 7.7E-07 36.5 13.1 255 31-336 1-268 (274) 77 protein:vir:98339 Length: 415 95.8 0.0015 9.4E-07 36.0 15.3 304 1-336 51-402 (415) 78 protein:vir:79987 Length: 415 95.8 0.0015 9.4E-07 36.0 15.3 304 1-336 51-402 (415) 79 protein:vir:81100 Length: 415 95.8 0.0015 9.4E-07 36.0 15.3 304 1-336 51-402 (415) 80 protein:vir:93742 Length: 274 95.7 0.0014 8.7E-07 36.2 12.8 255 31-336 1-268 (274) 81 protein:vir:80930 Length: 278 95.6 0.0011 6.6E-07 36.9 12.0 262 31-336 1-275 (278) 82 protein:vir:4830 Length: 397 # 95.6 0.0018 1.1E-06 35.6 13.5 294 1-336 50-385 (397) 83 protein:vir:100247 Length: 425 95.0 0.003 1.9E-06 34.4 15.3 314 1-336 88-422 (425) 84 protein:vir:3033 Length: 272 # 94.9 0.0032 2E-06 34.2 15.3 251 31-336 1-267 (272) 85 protein:vir:9820 Length: 272 # 94.9 0.0032 2E-06 34.2 15.3 251 31-336 1-267 (272) 86 protein:vir:485 Length: 407 # 94.9 0.0033 2.1E-06 34.2 16.4 314 1-336 53-398 (407) 87 protein:vir:4092 Length: 390 # 94.8 0.0035 2.2E-06 34.1 16.0 301 1-336 55-368 (390) 88 protein:vir:107882 Length: 307 94.7 0.0019 1.2E-06 35.5 10.8 262 33-336 1-300 (307) 89 protein:vir:105334 Length: 276 94.1 0.0054 3.4E-06 33.0 12.8 253 31-336 1-268 (276) 90 protein:vir:4856 Length: 293 # 94.0 0.0051 3.2E-06 33.2 11.7 258 27-336 1-279 (293) 91 protein:vir:1239 Length: 274 # 93.9 0.006 3.7E-06 32.7 14.7 253 34-336 1-268 (274) 92 protein:vir:96262 Length: 274 93.3 0.0079 4.9E-06 32.1 13.5 255 34-336 1-268 (274) 93 protein:vir:95898 Length: 274 93.3 0.0079 4.9E-06 32.1 13.5 255 34-336 1-268 (274) 94 protein:vir:7409 Length: 408 # 93.0 0.0091 5.7E-06 31.8 13.9 298 1-336 59-391 (408) 95 protein:vir:3870 Length: 400 # 92.6 0.011 6.7E-06 31.4 12.4 287 1-336 65-397 (400) 96 protein:vir:3991 Length: 404 # 92.2 0.012 7.7E-06 31.0 14.1 299 1-336 63-391 (404) 97 protein:vir:102119 Length: 404 92.1 0.013 8E-06 30.9 14.0 309 1-336 44-398 (404) 98 protein:vir:4953 Length: 397 # 92.0 0.013 8.1E-06 30.9 13.4 295 1-336 53-383 (397) 99 protein:vir:93881 Length: 387 91.2 0.017 1E-05 30.3 10.9 282 1-336 56-379 (387) 100 protein:vir:1025 Length: 408 # 91.1 0.017 1.1E-05 30.2 15.6 298 1-336 56-391 (408) 101 protein:vir:79078 Length: 307 90.8 0.019 1.2E-05 30.0 12.1 263 33-336 1-300 (307) 102 protein:vir:95107 Length: 270 90.3 0.017 1.1E-05 30.2 9.8 253 31-336 1-263 (270) 103 protein:vir:98480 Length: 348 90.3 0.022 1.3E-05 29.7 10.7 275 33-336 1-341 (348) 104 protein:vir:78640 Length: 352 90.1 0.023 1.4E-05 29.6 12.0 283 1-336 21-344 (352) 105 protein:vir:4511 Length: 409 # 89.5 0.026 1.6E-05 29.3 12.5 307 1-336 41-404 (409) 106 protein:vir:102873 Length: 392 88.4 0.033 2E-05 28.7 13.9 295 1-336 35-382 (392) 107 protein:vir:107593 Length: 392 88.4 0.033 2E-05 28.7 13.9 295 1-336 35-382 (392) 108 protein:vir:105004 Length: 392 88.4 0.033 2E-05 28.7 13.9 295 1-336 35-382 (392) 109 protein:vir:102082 Length: 392 88.4 0.033 2E-05 28.7 13.9 295 1-336 35-382 (392) 110 protein:vir:101607 Length: 379 87.6 0.037 2.3E-05 28.4 14.4 293 1-336 39-379 (379) 111 protein:vir:81160 Length: 371 87.5 0.038 2.4E-05 28.4 14.5 292 1-336 45-368 (371) 112 protein:vir:102655 Length: 322 86.8 0.043 2.7E-05 28.1 9.9 283 31-336 1-319 (322) 113 protein:vir:4997 Length: 397 # 86.0 0.049 3E-05 27.8 14.1 295 1-336 43-383 (397) 114 protein:vir:8843 Length: 317 # 85.2 0.055 3.4E-05 27.5 11.3 283 31-336 1-313 (317) 115 protein:vir:1268 Length: 397 # 84.5 0.06 3.7E-05 27.3 13.9 287 1-336 92-395 (397) 116 protein:vir:1383 Length: 421 # 83.2 0.07 4.4E-05 26.9 13.9 295 1-336 54-392 (421) 117 protein:vir:9361 Length: 402 # 80.7 0.092 5.7E-05 26.3 9.3 295 1-336 65-396 (402) 118 protein:vir:80128 Length: 466 79.8 0.1 6.2E-05 26.1 16.1 300 1-336 112-446 (466) 119 protein:vir:100172 Length: 394 76.5 0.14 8.4E-05 25.3 14.4 287 1-336 57-382 (394) 120 protein:vir:94933 Length: 330 76.3 0.14 8.5E-05 25.3 8.7 289 1-336 1-327 (330) 121 protein:vir:96978 Length: 387 76.0 0.14 8.7E-05 25.3 9.2 295 1-336 50-379 (387) 122 protein:vir:2685 Length: 387 # 76.0 0.14 8.7E-05 25.3 9.2 295 1-336 50-379 (387) 123 protein:vir:94424 Length: 387 76.0 0.14 8.7E-05 25.3 9.2 295 1-336 50-379 (387) 124 protein:vir:78350 Length: 383 74.5 0.16 9.8E-05 25.0 11.2 304 1-336 44-372 (383) 125 protein:vir:3845 Length: 395 # 72.5 0.18 0.00011 24.6 14.5 295 1-336 50-381 (395) 126 protein:vir:97255 Length: 310 71.8 0.19 0.00012 24.5 11.8 267 17-336 1-308 (310) 127 protein:vir:739 Length: 231 # 67.8 0.25 0.00015 23.9 13.5 219 79-336 1-231 (231) 128 protein:vir:99888 Length: 309 60.0 0.38 0.00024 22.9 15.3 265 36-336 1-301 (309) 129 protein:vir:6324 Length: 335 # 59.7 0.39 0.00024 22.8 8.7 274 42-336 1-328 (335) 130 protein:vir:95376 Length: 425 55.5 0.48 0.0003 22.3 12.9 302 1-336 73-418 (425) 131 protein:vir:101291 Length: 381 55.3 0.48 0.0003 22.3 15.7 298 1-336 41-365 (381) 132 protein:vir:9509 Length: 381 # 55.3 0.48 0.0003 22.3 15.7 298 1-336 41-365 (381) 133 protein:vir:9704 Length: 394 # 53.1 0.54 0.00033 22.1 12.3 284 1-336 85-388 (394) 134 protein:vir:3158 Length: 321 # 46.2 0.74 0.00046 21.3 13.5 284 1-336 1-309 (321) 135 protein:vir:94622 Length: 341 41.5 0.92 0.00057 20.8 12.6 279 31-336 1-337 (341) 136 protein:vir:99675 Length: 324 36.7 1.2 0.00072 20.2 8.4 244 75-336 1-301 (324) 137 protein:vir:7990 Length: 273 # 31.8 1.5 0.00091 19.7 14.0 257 31-336 1-271 (273) 138 protein:vir:2736 Length: 348 # 25.5 2 0.0013 18.9 14.0 262 31-336 1-319 (348) 139 protein:vir:102605 Length: 273 23.4 2.3 0.0014 18.6 14.8 256 31-336 1-271 (273) 140 protein:vir:105822 Length: 273 23.4 2.3 0.0014 18.6 14.8 256 31-336 1-271 (273) 141 protein:vir:962 Length: 397 # 20.6 2.7 0.0017 18.2 11.9 289 1-336 93-396 (397) No 1 >protein:vir:78558 Length: 336 # NCBI annotation: major capsid protein # Family: family:all:1653 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294848;genbank:gi:149882911;genbank:GeneID:5291029 Probab=100.00 E-value=7.9e-136 Score=761.26 Aligned_cols=336 Identities=100% Similarity=1.441 Sum_probs=333.9 Q ss_pred CchHHHHHHHhhcceeccchhhhhhhhhhhhhhhhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhhccccc Q lcl|Aclame:pro 1 MRDAQRIQNLARAGVILPRSVKNVSTPLAEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESK 80 (336) Q Consensus 1 m~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t 80 (336) |||++++++|+|+||+||+++++|+++++.|+|||+|++|+|+|++|+|||||||+||||++||++|+||++++|||++| T Consensus 1 ~~~~~~~~~l~~~gi~~~~~~~~~~~~~~~~a~da~d~~~~~~t~~~~g~~~~l~~~i~p~~~~~~~~~~~~~~l~~v~t 80 (336) T protein:vir:78 1 MRDAQRIQNLARAGVILPRSVKNVSTPLAEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESK 80 (336) T ss_pred CchHHHHHHHhccCeecchhhhhhhHHHHHHHHhhhhhccccccCCCcchHHHHHHhcccceeeehhhhhhhhhhccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCcceeeEEEeeeecccceEEeecccCCceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHH Q lcl|Aclame:pro 81 KGDWTTLVAAFITAEPTTTVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLA 160 (336) Q Consensus 81 ~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e 160 (336) +|+|++++++|+++|.+|+|++|||++|+|++|+++++++|+++++++||+||++|+++|+++|++|+++|+.+||+++| T Consensus 81 ~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P~vd~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~~ale 160 (336) T protein:vir:78 81 KGDWTTLVAAFITAEPTTTVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLA 160 (336) T ss_pred CCCccccEEEEeeeecceeeEEeecccCCCeeecceeeEEEEEEEEEeeeeecHHHHHHHHHhCCCcHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhhccEEEeeccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHh Q lcl|Aclame:pro 161 KFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSD 240 (336) Q Consensus 161 ~~~n~i~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~ 240 (336) +++|++|||||+++++|||||||||+++++++++||+++|+|||++||++++++|++||+|.+++|+|+||+||++++.+ T Consensus 161 ~~~N~~~~~Gd~~~~~~GllN~P~l~a~~t~~~~~w~~~T~~~I~~Di~~~~~~l~~qt~g~~~~~~~~tL~Lp~~~~~~ 240 (336) T protein:vir:78 161 KFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSD 240 (336) T ss_pred HhhCeEEEEeccccceEEEEeCCCCCcccccCcCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccceEEEechHHHHh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccCCCCCccHHHHHHHhCCccEEEEcccccCCCCceEEEEEEeeCCCceEEEEeCchhhcccceecCCceEEeeeccee Q lcl|Aclame:pro 241 LSKTNQYGLSAAAKLKEIFPKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTW 320 (336) Q Consensus 241 Ls~~~~~~~Tvl~~l~~n~pnl~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~v~~~~rt~ 320 (336) |+++|++|+||++|||+|||||+|+++|||++|||++++||++++++++++++++||+||+||+|+++++|+|||++||| T Consensus 241 L~~~n~~g~tv~~~lk~n~Pnl~i~t~pel~~Agg~~~~~~~~~~~~~~t~~~~~p~~f~~lpvq~~~~~~~v~~~~rt~ 320 (336) T protein:vir:78 241 LSKTNQYGLSAAAKLKEIFPKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTW 320 (336) T ss_pred ccCCCccCccHHHHHHHhcCccEEEEcccccccCcceEEEEEeeccCCcceeeecchhhhccceeecCceeEecccccee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeEEecccceeeeccC Q lcl|Aclame:pro 321 GAVIFRPFAVAQMIGV 336 (336) Q Consensus 321 Gv~ir~P~ai~~~~GI 336 (336) ||+||||+||+|++|| T Consensus 321 Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:78 321 GAVIFRPFAVAQMIGV 336 (336) T ss_pred eeeeeccchheeeccC Confidence 9999999999999999 No 2 >protein:vir:106734 Length: 336 # NCBI annotation: gp13 # Family: family:all:1653 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944321;genbank:gi:38638620;genbank:GeneID:2657363 Probab=100.00 E-value=1.1e-135 Score=760.57 Aligned_cols=336 Identities=99% Similarity=1.437 Sum_probs=334.0 Q ss_pred CchHHHHHHHhhcceeccchhhhhhhhhhhhhhhhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhhccccc Q lcl|Aclame:pro 1 MRDAQRIQNLARAGVILPRSVKNVSTPLAEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESK 80 (336) Q Consensus 1 m~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t 80 (336) |||++++++|+|+||+||+++++|+++++.|+|||+|++|+|+|++|+|||||||+||||++||++|+||++++||||+| T Consensus 1 ~~~~~~~~~l~~~gi~~~~~~~~~~~~~~~~a~da~d~~~~~~t~~~~g~~~~l~~~i~p~~~~~~~~~~~~~~l~~v~t 80 (336) T protein:vir:10 1 MRDAQRIQNLARAGVILPRSVKNVSTPLAEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESK 80 (336) T ss_pred CchHHHHHHHhccCeecchhhhhhhHHHHHHHHhhhhhccccccCCCcchHHHHHhhcCcceeeeeechhchhhhccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCcceeeEEEeeeecccceEEeecccCCceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHH Q lcl|Aclame:pro 81 KGDWTTLVAAFITAEPTTTVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLA 160 (336) Q Consensus 81 ~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e 160 (336) +|+|++++++|+++|.+|++++|||++|+|++|+++++++|++|++++||+||++|+++|+++|++|+++|+.+||+++| T Consensus 81 ~g~w~~~~~~~~~~e~~G~a~~ygd~~d~P~~d~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~~ale 160 (336) T protein:vir:10 81 KGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLA 160 (336) T ss_pred CCCcceeeEEEEeeeeeeeEEEccccCCCcceeeeeeeeeeeEEEEEEEEeeCHHHHHHHHHhCCCcHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhhccEEEeeccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHh Q lcl|Aclame:pro 161 KFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSD 240 (336) Q Consensus 161 ~~~n~i~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~ 240 (336) +++|++|||||+++++||||||||||++++++++||++||+|||++||++++++|++||+|.+++|+|+||+||++++.+ T Consensus 161 ~~~N~~~~~Gd~~~~~~GllN~P~l~a~~t~~~~~w~~~T~~eI~~Di~~~~~~l~~qt~g~i~~~~~~tL~Lp~~~~~~ 240 (336) T protein:vir:10 161 KFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSD 240 (336) T ss_pred HhhCeEEEEeecccceEEEeecCCCCcccccCcCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccceEEEechHHHHh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccCCCCCccHHHHHHHhCCccEEEEcccccCCCCceEEEEEEeeCCCceEEEEeCchhhcccceecCCceEEeeeccee Q lcl|Aclame:pro 241 LSKTNQYGLSAAAKLKEIFPKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTW 320 (336) Q Consensus 241 Ls~~~~~~~Tvl~~l~~n~pnl~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~v~~~~rt~ 320 (336) |+++|++|+|+++|||+|||||+|+++|||++|||++++||++++++++++++++||+||+||+|+++++|+|||++||| T Consensus 241 L~~~n~~g~tv~~~lk~n~Pnl~i~t~pel~~Agg~~~~~~~~~~~~~~t~~~~~P~~f~~lpvq~~~~~~~v~~~~rt~ 320 (336) T protein:vir:10 241 LSKTNQYGLSAAAKLKEIFPKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTW 320 (336) T ss_pred ccCCCccCccHHHHHHHhCCccEEEEcccccccCCceEEEEEecccCCcceeeecChhhhccceeecCceeEecccccee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeEEecccceeeeccC Q lcl|Aclame:pro 321 GAVIFRPFAVAQMIGV 336 (336) Q Consensus 321 Gv~ir~P~ai~~~~GI 336 (336) ||+||||+||+|++|| T Consensus 321 Gv~i~rP~ai~~~~GI 336 (336) T protein:vir:10 321 GAVIFRPFAVAQMLGV 336 (336) T ss_pred eeeeeccchheeeccC Confidence 9999999999999999 No 3 >protein:vir:3643 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705638;genbank:gi:23752323;genbank:GeneID:955719 Probab=100.00 E-value=1e-134 Score=755.22 Aligned_cols=336 Identities=97% Similarity=1.421 Sum_probs=333.8 Q ss_pred CchHHHHHHHhhcceeccchhhhhhhhhhhhhhhhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhhccccc Q lcl|Aclame:pro 1 MRDAQRIQNLARAGVILPRSVKNVSTPLAEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESK 80 (336) Q Consensus 1 m~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t 80 (336) |||++++++|+|+||+||+++.+++.+...|+|||+|++|+|+|++|+|||||||+||||++||++|+||++++||||+| T Consensus 1 ~~~~~~~~~l~~~gi~~~~~~~~~~~~~~~~~~da~d~~~~~~~~~~~~~~~~l~~~i~p~~~~~~~~~~~~~~l~pv~t 80 (336) T protein:vir:36 1 MRDAQRIQNLARAGVILPRSVQNVSTPLTEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESK 80 (336) T ss_pred CchHHHHHHHhhcCeeecchhhhhhhHHHHhhhhhhhccCccccCCCcchHHHHHHhhccceEeeecchhhhhhhccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCcceeeEEEeeeecccceEEeecccCCceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHH Q lcl|Aclame:pro 81 KGDWTTLVAAFITAEPTTTVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLA 160 (336) Q Consensus 81 ~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e 160 (336) +|+|++++++|+++|.+|+|++|||++|+|++|+++++++|+++++++||+||++|+++|+++|++|.++|+.+||+++| T Consensus 81 ~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P~~d~~~~~~~~~v~~~~~g~~yg~~E~~~Aa~~~~~l~~~Ka~aA~~ale 160 (336) T protein:vir:36 81 KGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLA 160 (336) T ss_pred cCCccceeEEEeeeeceeeEEEeeccCCCceeecccceeeeeEEEEEeeeeeCHHHHHHHHHhCCCcHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhhccEEEeeccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHh Q lcl|Aclame:pro 161 KFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSD 240 (336) Q Consensus 161 ~~~n~i~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~ 240 (336) +++|++|||||+++++|||||||||++.++++++||+++|+|||++||++++++|++||+|.++.|+|+||+||++++.+ T Consensus 161 ~~~N~i~~~Gd~~~~~yGllNdP~l~a~~t~~t~~~~~~t~~ei~~Di~~~~~~l~~qt~G~i~~~~~~tL~LP~~~~~~ 240 (336) T protein:vir:36 161 KFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSD 240 (336) T ss_pred HhhCcEEEEeccccceEEEEecCCCccccccCCCcccccCHHHHHHHHHHHHHHHHHhcCCeeeeccccEEEechHHHHh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccCCCCCccHHHHHHHhCCccEEEEcccccCCCCceEEEEEEeeCCCceEEEEeCchhhcccceecCCceEEeeeccee Q lcl|Aclame:pro 241 LSKTNQYGLSAAAKLKEIFPKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTW 320 (336) Q Consensus 241 Ls~~~~~~~Tvl~~l~~n~pnl~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~v~~~~rt~ 320 (336) |+++|++|+||++|||+|||||+|+++|||++|+|++++||++++++++++++++||+||+||+|+++++|+|||++||| T Consensus 241 Ls~~n~~g~Tvl~~lk~n~Pnl~i~t~pEl~~a~g~~~~l~~~~~~~~~t~~~~~p~~~~~l~vq~~~~~~~v~~~~rt~ 320 (336) T protein:vir:36 241 LSKTNQYGLAAAAKLKDIFPKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTW 320 (336) T ss_pred ccCCCccCccHHHHHHHhcCccEEEEccccccCCCceEEEEEEecCCCcceeeecchhhhccceeecCceeEecccccee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeEEecccceeeeccC Q lcl|Aclame:pro 321 GAVIFRPFAVAQMIGV 336 (336) Q Consensus 321 Gv~ir~P~ai~~~~GI 336 (336) ||+||||+||++++|| T Consensus 321 Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:36 321 GAVIFRPFAVAQMIGV 336 (336) T ss_pred eeeeeccchheeeecC Confidence 9999999999999999 No 4 >protein:vir:101557 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958117;genbank:gi:41057663;genbank:GeneID:2716814 Probab=100.00 E-value=9.8e-135 Score=755.28 Aligned_cols=336 Identities=97% Similarity=1.420 Sum_probs=333.8 Q ss_pred CchHHHHHHHhhcceeccchhhhhhhhhhhhhhhhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhhccccc Q lcl|Aclame:pro 1 MRDAQRIQNLARAGVILPRSVKNVSTPLAEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESK 80 (336) Q Consensus 1 m~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t 80 (336) |||++++++|+|+||+||+++.+|+.+...|+|||+|++|+|+|++|+|||||||+||||++||++|+||++++||||+| T Consensus 1 ~~~~~~~~~l~~~gi~~~~~~~~~~~~~~~~~~da~d~~~~~~~~~~~~i~~~l~~~i~p~~~~~~~~p~~a~~l~pv~t 80 (336) T protein:vir:10 1 MRDAQRIQNLARAGVILPRSVQNVSTPLTEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELVGESK 80 (336) T ss_pred CchHHHHHHHhhcCeeecchhhhhhhhHHHhhhhhhhccCccccCCCchhHHHHHhhcccceeeehhhhhhhhhhccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred CCCcceeeEEEeeeecccceEEeecccCCceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHH Q lcl|Aclame:pro 81 KGDWTTLVAAFITAEPTTTVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLA 160 (336) Q Consensus 81 ~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e 160 (336) +|+|++++++|+++|.+|+|++|||++|+|++|+++++++|+++++++||+||++|+++|+++|++|+++|+.+||+++| T Consensus 81 ~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P~~d~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~~ale 160 (336) T protein:vir:10 81 KGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLA 160 (336) T ss_pred cCCccceeEEEeeeeceeeEEEeeccCCCceeecccceeeeeEEEEEeeeeeCHHHHHHHHHhCCCcHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhhccEEEeeccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHh Q lcl|Aclame:pro 161 KFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSD 240 (336) Q Consensus 161 ~~~n~i~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~ 240 (336) +++|++|||||+++++|||||||||+++++++++||+++|+|||++||++++++|+.||+|.++.|+|+||+||++++.+ T Consensus 161 ~~~N~i~~~Gd~~~~~yGllN~P~l~a~~t~~t~~~~~~t~eei~~Di~~~~~~l~~qs~G~i~~~~~~tL~LP~~~~~~ 240 (336) T protein:vir:10 161 KFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQEDVLRMGLPPTAMSD 240 (336) T ss_pred HhhCcEEEEeccccceEEEEeCCCCccccccCCCcccccCHHHHHHHHHHHHHHHHHhcCCeecccCcceEEecHHHHHh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccCCCCCccHHHHHHHhCCccEEEEcccccCCCCceEEEEEEeeCCCceEEEEeCchhhcccceecCCceEEeeeccee Q lcl|Aclame:pro 241 LSKTNQYGLSAAAKLKEIFPKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTW 320 (336) Q Consensus 241 Ls~~~~~~~Tvl~~l~~n~pnl~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~v~~~~rt~ 320 (336) |+++|++|+||++|||+|||||+|+++|||++++|++++||++++++++++++++||+||+||+|+++++|+|||++||| T Consensus 241 Ls~~n~~g~Tvl~~lk~n~Pnl~i~t~pEl~~a~G~~~~l~~~~~~~~~t~~~~~p~~~~~l~vq~~~~~~~v~~~~rt~ 320 (336) T protein:vir:10 241 LSKTNQYGLAAAAKLKDIFPKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTW 320 (336) T ss_pred ccCCCccCccHHHHHHHhcCccEEEEccccccCCCceEEEEEEecCCCcceeeecchhhhccceeecCceeEecccccee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeEEecccceeeeccC Q lcl|Aclame:pro 321 GAVIFRPFAVAQMIGV 336 (336) Q Consensus 321 Gv~ir~P~ai~~~~GI 336 (336) ||+||||+||+|++|| T Consensus 321 Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:10 321 GAVIFRPFAVAQMIGV 336 (336) T ss_pred eeeeeccchheeeecC Confidence 9999999999999999 No 5 >protein:vir:94070 Length: 339 # NCBI annotation: putative structural protein # Family: family:all:1653 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453625;genbank:gi:84662661;genbank:GeneID:5142580 Probab=100.00 E-value=1.3e-122 Score=688.90 Aligned_cols=335 Identities=48% Similarity=0.833 Sum_probs=324.9 Q ss_pred CchHHHHHHHhhcceeccc-hhhhhhhhhhhhhhhhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhhcccc Q lcl|Aclame:pro 1 MRDAQRIQNLARAGVILPR-SVKNVSTPLAEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGES 79 (336) Q Consensus 1 m~~~~~~~~l~~~g~~~~~-~~~~~~~~~~~~~~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~ 79 (336) =.|.+++++|+|+||+||+ ..+.++.+...|||||+|++|.++|..|++||+++++||||+|||++|+++++++|||++ T Consensus 4 ~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~a~d~~~~~~~~~~~~~~~i~a~~~~~i~~~vy~~~~~~~~~~~l~pv~ 83 (339) T protein:vir:94 4 NNDRTDIKQLEKVGIIFDGYSPKSISSEVSAYAMDAVNLTPTLQTTANAGIPAWMTTFVDRRVIDIQLAPMAAAKIFPEV 83 (339) T ss_pred echHHHHHHHHhhceeeccchhhhcchhhHhhhccccccccccccccccchhhhhhhhhchhheeecccccchhhhcccc Confidence 4588999999999999995 455578899999999999999999999999999999999999999999999999999999 Q ss_pred cCCCcceeeEEEeeeecccceEEeecccCCceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHH Q lcl|Aclame:pro 80 KKGDWTTLVAAFITAEPTTTVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGL 159 (336) Q Consensus 80 t~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~ 159 (336) |+|+|++++++|+++|.+|+|++|||++|+|++|+++++++|+++++++||+|+++|+++|+++|++|+++|+.+||+++ T Consensus 84 t~g~w~~~t~~y~~~e~~G~a~~ygd~ad~Pl~~~~v~~~~~~v~~~~~g~~y~~~E~~~A~~~g~~l~~~Ka~aA~~al 163 (339) T protein:vir:94 84 KKGDWTTTYGVFIIAEPVGQVATYSDWSANGMSKANVNFESRQNYRYQTWTEYGDLEMATYGEAGIDYVARQEISASLVM 163 (339) T ss_pred cCCCCcccEEEEeeeecccceEEcccccCCCcccccceeeEEeEEEEEEEEeecHHHHHHHHhhCCChHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHhhccEEEeeccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHH Q lcl|Aclame:pro 160 AKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMS 239 (336) Q Consensus 160 e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~ 239 (336) |+++|+++|||++++++||||||||+++.++++++ |+++|+|||++||++++++|+.||+|.+++++|+||+|||+++. T Consensus 164 ~~~~N~i~~~Gd~~~~~~GLlN~P~l~~~v~~s~~-Wa~kT~~eI~~Di~~~~~~l~~~s~g~~~~~~~~~L~LP~~~~~ 242 (339) T protein:vir:94 164 AKFANSSYLLGVAGIANYGLMNDPSLPAPVAATVN-WATAAPEDIANDVVAMVGRLISQSGGLITGQERMVMALAPSALN 242 (339) T ss_pred HHhhceEEeeeecccceEEEEeCCCccccccCCCC-cccCCHHHHHHHHHHHHHHHHHhcCCeeeeccCcEEEecHHHHH Confidence 99999999999999999999999999987776665 56779999999999999999999999999999999999999999 Q ss_pred hcccCCCCCccHHHHHHHhCCccEEEEcccccCCCCceEEEEEEeeCCCceEEEEeCchhhcccceecCCceEEeeecce Q lcl|Aclame:pro 240 DLSKTNQYGLSAAAKLKEIFPKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSIERYSSYFRQKKSAGT 319 (336) Q Consensus 240 ~Ls~~~~~~~Tvl~~l~~n~pnl~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~v~~~~rt 319 (336) +|+++|++|+|+++|||+|||||+|+++|||+++||++.+||+++++++++++++||||||+||+|+++++|+|||++|| T Consensus 243 ~L~~~n~~~~Tvl~~lk~n~pnl~i~~~~el~~a~g~~~~~~~~~~~~~~~~~~~~p~~~~~lpvq~~~~~~~v~~~~rt 322 (339) T protein:vir:94 243 NVNRTNNFGLSAGAKIAQTYPNIQFVAVPEFDTASGRLVQLWVPEVNGQPTGEVAFAEKLRSHSIERYSTTTRQKHSGAT 322 (339) T ss_pred hcccCCcCCccHHHHHHHhcCCcEEEEccccccCCCceEEEEEEeccCCcceEEEcchhhhccccEEcCceEEecceeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeEEecccceeeeccC Q lcl|Aclame:pro 320 WGAVIFRPFAVAQMIGV 336 (336) Q Consensus 320 ~Gv~ir~P~ai~~~~GI 336 (336) |||+||||+||+|++|| T Consensus 323 ~Gv~i~~P~ai~~~~GI 339 (339) T protein:vir:94 323 FGAVIYQPWAVTQELGV 339 (339) T ss_pred eeEEEEccceeeeeecC Confidence 99999999999999999 No 6 >protein:vir:107732 Length: 379 # NCBI annotation: gp23 # Family: family:all:1653 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024871;genbank:gi:48697513;genbank:GeneID:2948349 Probab=100.00 E-value=4.3e-118 Score=664.03 Aligned_cols=334 Identities=23% Similarity=0.338 Sum_probs=310.3 Q ss_pred CchHH--HHHHHhhcceeccchhhhhhhhhhhhhhhhhhhcC------ccccCCcchHHHHHHHhhCceeeeeeccccch Q lcl|Aclame:pro 1 MRDAQ--RIQNLARAGVILPRSVKNVSTPLAEYAMDAADLSP------HLSSTGSSGIPNYLTTYVDPSVIDILVAPMKA 72 (336) Q Consensus 1 m~~~~--~~~~l~~~g~~~~~~~~~~~~~~~~~~~da~d~~~------~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~ 72 (336) -.|++ ++++|+|+||+||+....+. +...+||||+|.+| +|+|++|+|||||||+|+ |+++|++++||++ T Consensus 23 ~~~~~~~~~~~l~~~gi~~~~~~~~~~-~~~~~amd~~~~~~~~~~~~~l~~~~~~g~~~~l~~~~-p~~i~~~tap~~a 100 (379) T protein:vir:10 23 SADVTLDNLKHLESYGIHLNGRKNKLF-ELMQFAMDSNDIGPIPTPLSPLSPVSIPGLIQFLQNWL-PGHVRILTAVREA 100 (379) T ss_pred cccccHHHHHHHHhcCccccchhhhhh-hhhhhhhccccccccccccCccccccccchHHHHHhhc-chHHHHHhhhhhh Confidence 23444 88999999999997654433 45667999999995 888999999999999999 9999999999999 Q ss_pred hhhcccccCCCcceeeEEEeeeecccceEEeecccCCceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHH Q lcl|Aclame:pro 73 AELVGESKKGDWTTLVAAFITAEPTTTVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELN 152 (336) Q Consensus 73 ~~l~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~ 152 (336) +|||||+|+|+|++++++|+++|.+|+|++|||++|+|++|+++++++|++|++++||+||++|+++|+++|++|+++|+ T Consensus 101 ~~l~pv~t~g~W~~~~~~~~v~e~~G~A~~ygd~~d~pl~d~~~~~~~r~v~~~~~g~~yg~~El~~Aa~~g~~l~~~Ka 180 (379) T protein:vir:10 101 DEFLGLSTVGQWDDEQIVQRVLEGLGTAQPYTDGGNMALMSWTPTFETRTVVRFEAGLQVAPLEEARSSRVQVSSADEKR 180 (379) T ss_pred hhhcccccCCCceeeeEEEeeeeeeeeeEEeccccCCCeeeeeeeeeeeeeEEEEEEEeecHHHHHHHHHhCCChHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHhhccEEEee--ccccceEEEEecCCCCccccccc-----ccccccCHHHHHHHHHHHHHHHHHHhCCceec Q lcl|Aclame:pro 153 YSSALGLAKFLNGSYLFG--VAGLENYGLINDPSLSAPITATT-----PWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQ 225 (336) Q Consensus 153 ~aAr~a~e~~~n~i~~~G--d~~~g~~GllN~Pnl~~~~~~~t-----~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~ 225 (336) .+||+++|+++|+++||| |+++++|||||||||++.+++++ +.|++||+|||++||++++++++.||+|.+++ T Consensus 181 ~aA~~ale~~~N~i~f~G~~d~~~~~yGllNdP~l~a~~t~atg~~~~t~Wa~kT~~eI~~Di~~~~~~l~~qs~g~~~~ 260 (379) T protein:vir:10 181 AMVGEALEVQRNRVAFYGYNDGSGRTFGFLNDPNLPAYVAVPNGAGGSPLWAQKTTLEIIADLRNGLTALQVQSMGRIKS 260 (379) T ss_pred HHHHHHHHHhhceEEEEeecCCCcceEEEEeCCCCcccccccCCcccccccccCCHHHHHHHHHHHHHHHHHhhCCeecc Confidence 999999999999999999 67999999999999998776643 45788899999999999999999999999877 Q ss_pred c-CCcEEEecHHHHHhcccCCCCCccHHHHHHHhCCccEEEEcccccCCCCc--eEEEEEEeeCCCc-----eEEEEeCc Q lcl|Aclame:pro 226 E-AVLHMGLPPTAMSDLSKTNQYGLSAAAKLKEIFPKLEFVTIPEYDTASGR--LVQLWAPRVEGKD-----TATCGFTE 297 (336) Q Consensus 226 ~-~p~tL~Lp~~~~~~Ls~~~~~~~Tvl~~l~~n~pnl~i~~~pel~~a~G~--~~~~~~~~~~~~~-----~~~~~~p~ 297 (336) + .|++|+|||+++.+|+++|++|+||++|||+|||||+|+++|||++++|+ .++||++++++++ ++.++||| T Consensus 261 ~~~~~tL~LP~~~~~~L~~~n~~g~Tvl~~lk~n~Pnl~i~t~pEL~~aggg~~~~~~~~~~~~~~~t~~~~~~~~~~p~ 340 (379) T protein:vir:10 261 NKTPITIGIPNAYENYITTPTELGYSVAQYMRESYPNVTFVSAPELNDANGGSSAIYYYADAVENNGTDDGRTWLQVVPT 340 (379) T ss_pred cccceeEEecHHHHHhhccccccCccHHHHHHHhcCCcEEEEcccccccCCCccEEEEEeeccCCCccCCcceEEEecch Confidence 4 79999999999999999999999999999999999999999999998664 6789998877654 46799999 Q ss_pred hhhcccceecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 298 KMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 298 ~~~~l~~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) |||+||+|++.++|+|||++|||||+||||+||+|++|- T Consensus 341 k~~~l~ve~~~~~~~~~~~~rt~Gv~ir~P~Ai~~~~G~ 379 (379) T protein:vir:10 341 KMFTLGVEKKIKGYAEGYTNATAGAMLKRPFATYRQTGA 379 (379) T ss_pred hhhhccceecCceeEeccccceeeeeeecchhhheecCC Confidence 999999999999999999999999999999999999999 No 7 >protein:vir:99576 Length: 388 # NCBI annotation: hypothetical protein # Family: family:all:1653 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039801;genbank:gi:126011051;genbank:GeneID:4818271 Probab=100.00 E-value=4e-115 Score=647.76 Aligned_cols=333 Identities=24% Similarity=0.353 Sum_probs=304.4 Q ss_pred CchHHHHHHHhhcceeccchhhhhhh------hhhhhhhhhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhh Q lcl|Aclame:pro 1 MRDAQRIQNLARAGVILPRSVKNVST------PLAEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAE 74 (336) Q Consensus 1 m~~~~~~~~l~~~g~~~~~~~~~~~~------~~~~~~~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~ 74 (336) |. -..+++|+|+||+||++..++.. .++.+||||++.+| .|++|.|||++|++|+||++||++++||++++ T Consensus 30 ~~-~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~a~da~~~~~--~t~~~~gip~~~~~~~~p~~~~~~~~p~~~~~ 106 (388) T protein:vir:99 30 LT-DMAVRELKKFGLVFDHATVKRQIELLHEGGVATQAFDSAYVAP--TTQASIPTPIQFLQQWLPGFVKVLTSARKIDE 106 (388) T ss_pred ee-chhhHhhhhcceeccCccchhhhhhhhhhhhhhcccCcccccc--cccCcccHHHHHhhhhccceeeeeechhhhhh Confidence 32 35678899999999997655433 23455677655555 58899999999999999999999999999999 Q ss_pred hcccccCCCcceeeEEEeeeecccceEEeecccCCceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHH Q lcl|Aclame:pro 75 LVGESKKGDWTTLVAAFITAEPTTTVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYS 154 (336) Q Consensus 75 l~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~a 154 (336) ||||+|+|+|++++++|+++|.+|+|++|||++|+|++|+++++++|++|++++||+||++|+++|+++|++|+++|+.+ T Consensus 107 l~pv~t~g~W~~~~~~f~v~e~~G~A~~ygd~~D~Pl~d~~~~~~~r~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~A 186 (388) T protein:vir:99 107 ILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQG 186 (388) T ss_pred hccccccCCccceeEEEeeeecceeEEEeecccCCCceeccceeeeeeEEEEEeeeeecHHHHHHHHhhCCCcHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHhhccEEEeeccc---cceEEEEecCCCCccccccc----ccccccCHHHHHHHHHHHHHHHHHHhCCceecc- Q lcl|Aclame:pro 155 SALGLAKFLNGSYLFGVAG---LENYGLINDPSLSAPITATT----PWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQE- 226 (336) Q Consensus 155 Ar~a~e~~~n~i~~~Gd~~---~g~~GllN~Pnl~~~~~~~t----~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~- 226 (336) ||+++|+++|+++|||+++ .++|||||||||++.+++++ +.|++||+|||++||++++++|++||+|.++++ T Consensus 187 A~~ale~~~N~i~f~G~~g~~~~~~yGllNdP~l~a~v~at~~~~~~~Wa~kT~~eI~~Di~~~~~~i~~qs~g~~~~~~ 266 (388) T protein:vir:99 187 AAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPED 266 (388) T ss_pred HHHHHHhhhceEEEEeecCCCccceEEEeeCCCcccccccccCCcCcccccCCHHHHHHHHHHHHHHHHHhcCCeeeecc Confidence 9999999999999999875 58999999999998776543 238888999999999999999999999999875 Q ss_pred CCcEEEecHHHHHhcccCCCCCccHHHHHHHhCCccEEEEcccccCCC----CceEEEEEEeeC--------CCceEEEE Q lcl|Aclame:pro 227 AVLHMGLPPTAMSDLSKTNQYGLSAAAKLKEIFPKLEFVTIPEYDTAS----GRLVQLWAPRVE--------GKDTATCG 294 (336) Q Consensus 227 ~p~tL~Lp~~~~~~Ls~~~~~~~Tvl~~l~~n~pnl~i~~~pel~~a~----G~~~~~~~~~~~--------~~~~~~~~ 294 (336) .|++|+||++++.+|+++|++|+||++|||+|||||+|+++|||++++ |..++|+.++++ +++++.++ T Consensus 267 ~~~tL~LP~~~~~~Ls~~n~~g~Tvl~~lk~n~Pnl~i~t~pEl~~a~~tgg~~~~~~~~~~~~~~~~~~~~~~~t~~~~ 346 (388) T protein:vir:99 267 VDITLVLPMNKVDMLSVVTDLGISVRDWLKQTYPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQL 346 (388) T ss_pred cceEEEechHHHHhccccCcCCccHHHHHHHhcCCcEEEEecccccccccCCceeEEEEecccccccccCccCcceeEEe Confidence 699999999999999999999999999999999999999999998763 456788887763 68899999 Q ss_pred eCchhhcccceecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 295 FTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 295 ~p~~~~~l~~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) +||||++||+|+++++|+|||++|||||+||||+||+|++|| T Consensus 347 ~p~~~~~l~vq~~~~~~~~~~~~rt~Gv~ir~P~Ai~~~~GI 388 (388) T protein:vir:99 347 VQSKFVTLGVEKRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) T ss_pred cccccccccceecCceeEeccccceeeeEEeccchhheeccC Confidence 999999999999999999999999999999999999999999 No 8 >protein:vir:96079 Length: 382 # NCBI annotation: hypothetical protein ORF023 # Family: family:all:1653 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294440;genbank:gi:149408337;genbank:GeneID:5237198 Probab=100.00 E-value=3.1e-112 Score=631.90 Aligned_cols=334 Identities=24% Similarity=0.355 Sum_probs=299.5 Q ss_pred CchH--HHHHHHhhcceeccchhhh--hhh------hhhhhhhhhhhhcCccccCCcchHHHHHHHhhCceeeeeecccc Q lcl|Aclame:pro 1 MRDA--QRIQNLARAGVILPRSVKN--VST------PLAEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPM 70 (336) Q Consensus 1 m~~~--~~~~~l~~~g~~~~~~~~~--~~~------~~~~~~~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~ 70 (336) +.++ +++++|+|+||+||++..+ +.. .+..+||||++.+| .|.+|.|||++|++|+||++||++|+|| T Consensus 21 ~~~~~~~~~~~l~~~gi~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~--~t~~~~g~p~~~l~~~~p~~~~~~~~p~ 98 (382) T protein:vir:96 21 LKNVTHEAVAALGRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAP--VTTPSIPTPIQFLQTWLPGFVKVMTAAR 98 (382) T ss_pred hhcccHHHHHHHhccccccCcccchhHhhhhhhhhhhhhhcccccccCCc--cccCCccHHHHHHhhhhhhhhhhhhhhh Confidence 3444 6789999999999987421 122 22346888877776 4888999999999999999999999999 Q ss_pred chhhhcccccCCCcceeeEEEeeeecccceEEeecccCCceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHH Q lcl|Aclame:pro 71 KAAELVGESKKGDWTTLVAAFITAEPTTTVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASE 150 (336) Q Consensus 71 ~~~~l~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~ 150 (336) ++++||||+|+|+|++++++|+++|.+|+|++|||++|+|++|+++++++|++|++++||+|+.+|+++|+++|++|.++ T Consensus 99 ~~~~l~pv~t~g~W~~~t~ty~~~e~~G~A~~ygd~~D~Pl~d~~~~~~~r~v~~~~~g~~yg~lE~~rAa~~~~~l~~~ 178 (382) T protein:vir:96 99 KIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPLTSWNANFERRTIVRGELGLLVGTLEEGRASAIRLNSAET 178 (382) T ss_pred hhhhhccccccCCccceEEEEeeeecccceEEeecccCCCccccccceeEEEEEEEEEeeeecHHHHHHHHhhCCCcHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHhhccEEEeec---cccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceecc- Q lcl|Aclame:pro 151 LNYSSALGLAKFLNGSYLFGV---AGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQE- 226 (336) Q Consensus 151 K~~aAr~a~e~~~n~i~~~Gd---~~~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~- 226 (336) |+.+||+++|+++|+++|||+ .++++||||||||||+.+++++++|+++|+|||++||++++++|++||+|.++++ T Consensus 179 Ka~aA~~ale~~~N~i~f~G~~~g~~~~~yGllNdP~l~a~~t~a~~~Wa~kT~~eI~~Di~~l~~~i~~qt~G~~~~~~ 258 (382) T protein:vir:96 179 KRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPPFQTPPSQGWATADWAGIIGDIREAVRQLRIQSQDQIDPKA 258 (382) T ss_pred HHHHHHHHHHHhhceEEEEeeecCcCcceEEEEeCCCcccccccCCCCcccccHHHHHHHHHHHHHHHHhccCCeeeecc Confidence 999999999999999999998 4589999999999999988889999999999999999999999999999999875 Q ss_pred CCcEEEecHHHHHhcccCCCCCccHHHHHHHhCCccEEEEcccccCCC--C----ceEEEEEEeeC----C----CceEE Q lcl|Aclame:pro 227 AVLHMGLPPTAMSDLSKTNQYGLSAAAKLKEIFPKLEFVTIPEYDTAS--G----RLVQLWAPRVE----G----KDTAT 292 (336) Q Consensus 227 ~p~tL~Lp~~~~~~Ls~~~~~~~Tvl~~l~~n~pnl~i~~~pel~~a~--G----~~~~~~~~~~~----~----~~~~~ 292 (336) .|++|+||++++.+|+++|++|+||++|||+|||||+|+++|||++++ | ..++++.++++ + +..+. T Consensus 259 ~~~~L~LP~~~~~~Ls~~n~~g~Tvl~~lk~n~Pnl~i~t~peL~~a~~~g~g~~~~~~~~~~e~~~~~~~s~~~p~~f~ 338 (382) T protein:vir:96 259 EKITMALATSKVDYLSVTTPYGISVSDWIEQTYPKMRIVSAPELSGVQMQGKTPEDALVLFVEEVDASVDGSTDGGSVFS 338 (382) T ss_pred cceEEeechHHHhhccccCccCccHHHHHHHhcCCcEEEEccccccccCCCccceeEEEEecchhhhhcccccccCccee Confidence 599999999999999999999999999999999999999999998762 2 22345555542 2 33444 Q ss_pred EEeCchhhcccceecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 293 CGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 293 ~~~p~~~~~l~~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) +.+|++++.+|+|++.++|++||++|||||+||||+||+|++|| T Consensus 339 q~~p~~~~~l~ve~~~~~~~~~~s~~t~Gv~i~~P~ai~~~~GI 382 (382) T protein:vir:96 339 QLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) T ss_pred ccccceeeeccceeecceeEeccccceeeeEEEcchhhhhccCC Confidence 55677888899999999999999999999999999999999999 No 9 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=100.00 E-value=5.4e-92 Score=520.96 Aligned_cols=319 Identities=11% Similarity=0.088 Sum_probs=283.9 Q ss_pred hcceeccchhhhhhhhhhhhhhhhhhhcCccccCCcchHHHHHH---HhhCceeeeeeccccchhhhcccccCCCcceee Q lcl|Aclame:pro 12 RAGVILPRSVKNVSTPLAEYAMDAADLSPHLSSTGSSGIPNYLT---TYVDPSVIDILVAPMKAAELVGESKKGDWTTLV 88 (336) Q Consensus 12 ~~g~~~~~~~~~~~~~~~~~~~da~d~~~~l~t~~~~~i~~~l~---~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t 88 (336) -.|..+.. ++...-+-.+.-|+++++.+.+..+.++++|++ ++|||++||+++++++++++||++++++|++++ T Consensus 1 ~~~~~~~~---~~~~d~~~~~~~a~~~~~~~~~~~~~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~i~~~~~~~~~~ 77 (329) T protein:vir:79 1 MRGNIMSK---EMKYDEFEANVIANHMQLRGAKNDASDMGIWTSQELHKIKAQAYEKEYPAGSALRVFPVTSELSDTDKT 77 (329) T ss_pred Cccchhhh---hhccchhhhhhHhhhcccccceeccchhhHHHHHHHHHHHHHHHhhhhcccchhhhcccccCCCCceeE Confidence 22443322 222221222334556777788888898999998 899999999999999999999999999999999 Q ss_pred EEEeeeecccceEEeecc-cCCceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEE Q lcl|Aclame:pro 89 AAFITAEPTTTVATYGDY-SSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSY 167 (336) Q Consensus 89 ~~~~v~e~~G~a~~ygd~-~DiP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~ 167 (336) ++|+++|.+|++++|||+ +|+|++|++++++++++++++.+|+|+++|+++|+++|++|+++|+.+|++++++++|+++ T Consensus 78 ~t~~~~~~~G~a~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~ 157 (329) T protein:vir:79 78 FEYQTFDKVGHAKIIADYTDDLSTVDALMTSEFGKVFRLGNAFLISIDEIKAGQRTGKSLSTRKANAAQNAHDQLVNHLV 157 (329) T ss_pred EEeeeeecceeeeeecCcccccceeecccceeEEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhccEE Confidence 999999999999999997 5779999999999999999999999999999999999999999999999999999999999 Q ss_pred EeeccccceEEEEecCCCCcccccc--cccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhcccC- Q lcl|Aclame:pro 168 LFGVAGLENYGLINDPSLSAPITAT--TPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSKT- 244 (336) Q Consensus 168 ~~Gd~~~g~~GllN~Pnl~~~~~~~--t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~~- 244 (336) |+|++++++|||||||++++..+++ ++.|+++|++||++||++++++++.+|+|. +.|++|+|||+++.+|+++ T Consensus 158 f~G~~~~g~~GLlN~p~v~~~~~~~~~~~~w~~kt~~ei~~di~~~~~~l~~~s~g~---~~p~~L~Lpp~~~~~L~~~~ 234 (329) T protein:vir:79 158 FKGSKPHKIISVFEHPNLTTINSAGWNNAAGTGKKPETAQDELEQAIEKIETLTNGQ---HRANMILIPPSMRKVLMVRM 234 (329) T ss_pred EeecccccceeeecCCCccccccCCCCCccccccCHHHHHHHHHHHHHHHHHhcCce---ecccEEEecHHHHHHhhccc Confidence 9999999999999999998655443 334788899999999999999999999986 5799999999999999764 Q ss_pred CCCCccHHHHHHHhCCccEEEEcccccCCCCceEEEEEEeeCCCceEEEEeCchhhcccceecCCceEEeeecceeeeEE Q lcl|Aclame:pro 245 NQYGLSAAAKLKEIFPKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVI 324 (336) Q Consensus 245 ~~~~~Tvl~~l~~n~pnl~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~v~~~~rt~Gv~i 324 (336) +++|+|+++||++|||+|+|+++|||++++++...+++-..++++++++++||||++||+|+++++|+|||++|||||+| T Consensus 235 ~~~~~tvl~~lk~~~~~l~I~~~~el~~ag~~g~~~~v~y~~~~~~~~~~vp~~~~~l~~q~~~~~~~v~~~~r~~Gv~i 314 (329) T protein:vir:79 235 PETTMSYLDYFKQQNGGITIESISELEDIDGAGTKAALVYEKDPMNMSIEIPEAFNMLTAQPKDLHFKVPCTSKCTGLTI 314 (329) T ss_pred CCCCccHHHHHHHhCCCcEEEEcccccccCCCCceEEEEEecCCceEEEecCcceeeeeceecCceEEEceeeeEEEEEE Confidence 57899999999999999999999999999876555555556789999999999999999999999999999999999999 Q ss_pred ecccceeeeccC Q lcl|Aclame:pro 325 FRPFAVAQMIGV 336 (336) Q Consensus 325 r~P~ai~~~~GI 336 (336) |||+||+|++|| T Consensus 315 ~~P~ai~~~dGI 326 (329) T protein:vir:79 315 YRPLTLVLIKGL 326 (329) T ss_pred ECcceeeeeeee Confidence 999999999999 No 10 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=100.00 E-value=2.6e-90 Score=511.77 Aligned_cols=305 Identities=14% Similarity=0.133 Sum_probs=263.9 Q ss_pred cceeccchhhhhhhhhhhhh-hhhhhhcCccccCCcchHHHHHH---HhhCceeeeeeccccchhhhcccccCCCcceee Q lcl|Aclame:pro 13 AGVILPRSVKNVSTPLAEYA-MDAADLSPHLSSTGSSGIPNYLT---TYVDPSVIDILVAPMKAAELVGESKKGDWTTLV 88 (336) Q Consensus 13 ~g~~~~~~~~~~~~~~~~~~-~da~d~~~~l~t~~~~~i~~~l~---~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t 88 (336) .-++|+...-.+......|. +|+ |.+ -+|++ ++|||+|||+++++++++++||++++++|++++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~-d~~-----------~~fl~~ql~~id~~v~e~~~~~~~~~~~i~v~~~~~~~~et 68 (314) T protein:vir:10 1 MAIKFDAEQAKITTHLEQMGVEKA-DAA-----------GIWAVSQLTAALNRAYEKEYAENSVVNIFPVTNEIPGHAKY 68 (314) T ss_pred CccchHHHHHHHHHHHHhhcccch-hhh-----------HHHHHHHHHHHHHHHhhhhccccccceeeccccCCCCceeE Confidence 11222211111111111111 111 111 13555 699999999999999999999999999999999 Q ss_pred EEEeeeecccceEEeeccc-CCceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEE Q lcl|Aclame:pro 89 AAFITAEPTTTVATYGDYS-SDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSY 167 (336) Q Consensus 89 ~~~~v~e~~G~a~~ygd~~-DiP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~ 167 (336) ++|+++|.+|++++|||++ |+|++|++++++++++++++.+|+||++|+++|+++|++|+++|+.+|++++++++|+++ T Consensus 69 ~~~~~~e~~G~a~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~ 148 (314) T protein:vir:10 69 FEYPEFDGVGIAQIIADYSDDLPLVDAFMTEKQGKVFRFGNAFLISTDEIKAGAATGQSLSARKQALAFEAHDNLLDKLV 148 (314) T ss_pred EEeeeeccccceeeeCCcccccceeecccceeEEEEEEEEeeEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEE Confidence 9999999999999999975 679999999999999999999999999999999999999999999999999999999999 Q ss_pred EeeccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhcccCCC- Q lcl|Aclame:pro 168 LFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSKTNQ- 246 (336) Q Consensus 168 ~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~~~~- 246 (336) |+|++++|++|||||||++.. +++++| +|++||++||+++++++++||+|. +.|++|+|||+++.+|+++++ T Consensus 149 f~G~~~~g~~GLlN~p~v~~~-~~~~~W---aT~~ei~~Di~~~~~~l~~~s~g~---~~p~~l~Lpp~~~~~L~~~~~~ 221 (314) T protein:vir:10 149 WSGSAPHGIVSVFDQPNINNV-VATPNW---SVPQNAIDDVTAMIDAVESSTQGL---HHVTDILLPASARRVMQGLVPQ 221 (314) T ss_pred EeecccccceeEeecCCCccc-cCCCCc---ccHHHHHHHHHHHHHHHHHhcCcc---ccceeEEecHHHHHhhcccccC Confidence 999999999999999999753 445565 489999999999999999999986 579999999999999987754 Q ss_pred CCccHHHHHHHhCCccEEEEcccccCCCCceEEEEEEeeCCCceEEEEeCchhhcccceecCCceEEeeecceeeeEEec Q lcl|Aclame:pro 247 YGLSAAAKLKEIFPKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFR 326 (336) Q Consensus 247 ~~~Tvl~~l~~n~pnl~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~v~~~~rt~Gv~ir~ 326 (336) +|+|+++||++|||||+|+++|||++++|+...+++-..++++++++++||||++||+|+++++|++||++|||||+||| T Consensus 222 ~~~tvl~~l~~n~~~l~I~~~~el~~ag~~g~~~~v~y~~~~~~~~~~vp~~~~~l~~e~~~~~~~~~~~~r~~Gv~i~~ 301 (314) T protein:vir:10 222 TNLSYGELFTRNNPGLTIRFLQFLDNYDGAGGKAALAFEKSPLNMSIEIPEVTNVLPAQPKDLHFRYPVTSKATGLIVYR 301 (314) T ss_pred CCccHHHHHHHhCCCcEEEEcccccccCCCcceEEEEEecCCcEEEEecCccceeecceecCceEEEcceeeeEEEEEEC Confidence 69999999999999999999999999987655555545678999999999999999999999999999999999999999 Q ss_pred ccceeeeccC Q lcl|Aclame:pro 327 PFAVAQMIGV 336 (336) Q Consensus 327 P~ai~~~~GI 336 (336) |+||+|++|| T Consensus 302 P~ai~~~dGI 311 (314) T protein:vir:10 302 PLTMAVIKGI 311 (314) T ss_pred cceeEeeeee Confidence 9999999999 No 11 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=100.00 E-value=2.6e-88 Score=500.76 Aligned_cols=316 Identities=12% Similarity=0.075 Sum_probs=269.8 Q ss_pred CchHH-HHHHHhhcceeccchhhhhhhhhhhhhhhhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhhcccc Q lcl|Aclame:pro 1 MRDAQ-RIQNLARAGVILPRSVKNVSTPLAEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGES 79 (336) Q Consensus 1 m~~~~-~~~~l~~~g~~~~~~~~~~~~~~~~~~~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~ 79 (336) |.+-+ +.++++. | ......|.+|.+ ++ .+-+.+.+...++|||++||+++++++++++||+. T Consensus 1 ~~~~~~~~~~~~~--~---------~~~~~~~~~~~d-a~-----~~~g~~~~~ql~~id~~v~e~~~~~l~~~~~i~v~ 63 (319) T protein:vir:10 1 MTTKKFDEADKSN--V---------EMYLIQAGVKQD-AA-----ATMGIWTAQELHRIKSQSYEEDYPVGSALRVFPVT 63 (319) T ss_pred CCCcchhHHhhHH--H---------HHHHhhccchhh-hh-----hhhhhHHHHHHHHHHHHHHhhhhcceechhhcccc Confidence 43211 1111111 1 011111222211 11 01122445566799999999999999999999999 Q ss_pred cCCCcceeeEEEeeeecccceEEeeccc-CCceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHH Q lcl|Aclame:pro 80 KKGDWTTLVAAFITAEPTTTVATYGDYS-SDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALG 158 (336) Q Consensus 80 t~g~w~~~t~~~~v~e~~G~a~~ygd~~-DiP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a 158 (336) ++++|++++++|.++|.+|++++|||++ |+|++|++.+++++++++++.+|+|+++|+++|+++|++|+++|+.+|+++ T Consensus 64 ~~~~~~~~~~~~~~~~~~G~a~~~~d~~~dip~v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~ 143 (319) T protein:vir:10 64 TELSPTDKTFEYMTFDKVGTAQIIADYTDDLPLVDALGTSEFGKVFRLGNAYLISIDEIKAGQATGRPLSTRKASACQLA 143 (319) T ss_pred cCCCCceEEEEeeeeccccceeeecCccccccceeccceeeEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHH Confidence 9999999999999999999999999975 579999999999999999999999999999999999999999999999999 Q ss_pred HHHhhccEEEeeccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHH Q lcl|Aclame:pro 159 LAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAM 238 (336) Q Consensus 159 ~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~ 238 (336) +++++|+++|+|++++|++||||||+++....+...+|+++|+|||++||++++++++++|+|. +.|++|+|||+++ T Consensus 144 ~~~~~n~i~f~G~~~~g~~GLlN~p~~~~~~~~~~~~~~t~t~~~i~~di~~~~~~l~~~s~g~---~~p~~L~L~p~~~ 220 (319) T protein:vir:10 144 HDQLVNRLVFKGSAPHKIVSVFNHPNITKITSGKWIDVSTMKPETAEAELTQAIETIETITRGQ---HRATNILIPPSMR 220 (319) T ss_pred HHHhhceEEEeecccccceeEEeCCCceeeecCCCCCccccCHHHHHHHHHHHHHHHHHhcCce---eeceEEEecHHHH Confidence 9999999999999999999999999998765544455778899999999999999999999986 4799999999999 Q ss_pred HhcccC-CCCCccHHHHHHHhCCccEEEEcccccCCCCceEEEEEEeeCCCceEEEEeCchhhcccceecCCceEEeeec Q lcl|Aclame:pro 239 SDLSKT-NQYGLSAAAKLKEIFPKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSIERYSSYFRQKKSA 317 (336) Q Consensus 239 ~~Ls~~-~~~~~Tvl~~l~~n~pnl~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~v~~~~ 317 (336) .+|+++ +.+|+|+++|||+||||++|+++|||++++|+...+++-..++++++++++||||++||+|+++++|++||++ T Consensus 221 ~~L~~~~~~~~~t~l~~lk~~~~~l~I~~~pel~~ag~~g~~~~v~y~~~~~~~~~~v~~~~~~~~~e~~~l~~~~~~~~ 300 (319) T protein:vir:10 221 KVLAIRMPETTMSYLDYFKSQNSGIEIDSIAELEDIDGAGTKGVLVYEKNPMNMSIEIPEAFNMLPAQPKDLHFKVPCTS 300 (319) T ss_pred HhhhcccCCCCeeHHHHHHHhcCCceEEEeeeecccCCCcceEEEEEecCCceEEEecCcceeeeeeeecCceEEEeeee Confidence 999864 6789999999999999999999999999977655444444578999999999999999999999999999999 Q ss_pred ceeeeEEecccceeeeccC Q lcl|Aclame:pro 318 GTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 318 rt~Gv~ir~P~ai~~~~GI 336 (336) |||||+||||+||+|++|| T Consensus 301 r~~Gv~i~~P~ai~~~dGI 319 (319) T protein:vir:10 301 KCTGLTIYRPMTIVLITGV 319 (319) T ss_pred eeEEEEEEccceeEeeecC Confidence 9999999999999999999 No 12 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=100.00 E-value=8.9e-88 Score=497.82 Aligned_cols=292 Identities=12% Similarity=0.059 Sum_probs=273.5 Q ss_pred cccCCcchHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecccceEEeeccc-CCceeeeeeeeee Q lcl|Aclame:pro 42 LSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVATYGDYS-SDGDSGTNINYPQ 120 (336) Q Consensus 42 l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~-DiP~vd~~~~~~~ 120 (336) |++.+++.+++.+.++|||++||++++++++++|||++++++|++++++|+++|.+|++++|||++ |+|++|+++++++ T Consensus 1 ~~~~~~g~f~~~~l~~id~~v~e~~~~~l~~r~l~~v~~~~~~~~~~~~~~~~~~~G~~~~~~~~~~dip~~~~~~~~~~ 80 (301) T protein:vir:80 1 MQGKITATIEARDLQAIDNVIYEPKQEELTARSVFPQKFDVNEGAESYSFDVMTRSGAAKIIANGADDLPLVDVDMVRKS 80 (301) T ss_pred CCccccchhhHHHHHHHHHHHHHhhhhhhhhhhhcccccCCCCceEEEEEeeeccceeEEEecCcccccccccccceeEE Confidence 889999999999999999999999999999999999999999999999999999999999999975 6799999999999 Q ss_pred eeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCCCCcccccc-----ccc Q lcl|Aclame:pro 121 RQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITAT-----TPW 195 (336) Q Consensus 121 ~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~~~~~-----t~w 195 (336) +++++++.+|+|+++||++|+++|++|+++|+.+|++++++++|+++|+|+++++++||||+|++++..++. .+. T Consensus 81 ~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~f~G~~~~g~~GLlN~p~~~~~~~~~~~~~~~~~ 160 (301) T protein:vir:80 81 VPIYSIGIGLSYTIQDLRAARMQGTTVDAAKATTVRRAIAEKENSIAFRGEKKYAIKGAFEATGIQIDVSPTTGVGNVSK 160 (301) T ss_pred EEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEeeecccccceeeecCCCcccccccCcccccccc Confidence 999999999999999999999999999999999999999999999999999999999999999998766543 245 Q ss_pred ccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhcccC---CCCCccHHHHHHHhCCccEEEEcccccC Q lcl|Aclame:pro 196 SGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSKT---NQYGLSAAAKLKEIFPKLEFVTIPEYDT 272 (336) Q Consensus 196 ~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~~---~~~~~Tvl~~l~~n~pnl~i~~~pel~~ 272 (336) |.++|+|||++||++++++++.+|+|. +.|++|+|||+++.+|+++ +.+|+|+++||++|||+++|+++|||++ T Consensus 161 w~~~t~~ei~~di~~~~~~l~~~s~g~---~~p~~L~L~p~~~~~L~~~~~~~~~~~tvl~~l~~~~~~~~I~~~p~L~~ 237 (301) T protein:vir:80 161 WEKKTAEQIIDEIGEAHTKITVLPGYG---TASLKLCLPPKQFELINKKRYSNEDSRSVLKVLQDNAWFSAIVRVPDLAG 237 (301) T ss_pred cccCCHHHHHHHHHHHHHHHHHhcCce---ecccEEEecHHHHHhhhhccccCCCCeeHHHHHHHHcCcceEEEcceecc Confidence 788899999999999999999999986 4799999999999999865 5679999999999999999999999999 Q ss_pred CCCceEEEEEEeeCCCceEEEEeCchhhcccceecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 273 ASGRLVQLWAPRVEGKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 273 a~G~~~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) ++++...+++-..++++++++++||||++||+|+++++|+|||++|||||+||||.||+|++|| T Consensus 238 ~g~~g~~~~v~~~~~~d~~~~~v~~~~~~~~~e~~~~~~~~~~~~r~~Gv~i~~P~ai~~~~GI 301 (301) T protein:vir:80 238 MGTAGSDSFAVIHDSNETAELIIPMDITRHPEEYSFPRTKVPFEERTAGVVVRFPAAIVRVDGI 301 (301) T ss_pred CCCCcccEEEEEecCCcEEEEEecCceeeecceecCceeEeeeeeeeEEEEEEccceEEEEecC Confidence 8755444444445689999999999999999999999999999999999999999999999999 No 13 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=100.00 E-value=1.3e-85 Score=486.01 Aligned_cols=290 Identities=13% Similarity=0.116 Sum_probs=264.7 Q ss_pred hhhhhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecccceEEeeccc-CC Q lcl|Aclame:pro 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVATYGDYS-SD 109 (336) Q Consensus 31 ~~~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~-Di 109 (336) |.+|.+|.++.+ -+.-.++|||++||+++++++++++||++++++|++++++|+++|.+|++++|||++ |+ T Consensus 1 ~~~~~a~~~~~f--------~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~di 72 (296) T protein:vir:10 1 MGVDKADAAGIW--------TVKQLTASLNKAYETEYDQNSVVNLFPVSNEIPGYAKYFEYPVFDGVGIAQIVADYTDDL 72 (296) T ss_pred CcccchhhhHHH--------HHHHHHHHHHHHHhhhhcccccceecccccCCCCceeEEEeeeeeccCceeEeCCCcccc Confidence 888877765543 234447999999999999999999999999999999999999999999999999975 67 Q ss_pred ceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCCCCccc Q lcl|Aclame:pro 110 GDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPI 189 (336) Q Consensus 110 P~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~~ 189 (336) |++|++.+++++++++++.+|+|+++||++|+++|++|+++|+.+|++++++++|+++|+|++++|++||||||+++.. T Consensus 73 p~v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~ka~aA~~~~~~~~n~~~f~G~~~~g~~GLlN~p~v~~~- 151 (296) T protein:vir:10 73 PLVDALATERQGKVFRFGNAFLISIDEIKVGQATGQSLSTRKQSLAFEAHDKLLDKLVWSGSTAHGIPSVFDYPNINNV- 151 (296) T ss_pred ceeeccceeEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeecccccceeEeecCCCccc- Confidence 9999999999999999999999999999999999999999999999999999999999999999999999999999864 Q ss_pred ccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhcccC-CCCCccHHHHHHHhCCccEEEEcc Q lcl|Aclame:pro 190 TATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSKT-NQYGLSAAAKLKEIFPKLEFVTIP 268 (336) Q Consensus 190 ~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~~l~~n~pnl~i~~~p 268 (336) +++++| + ++.+|++||++++++++.+|+|. +.|++|+|||+++.+|+++ +.+|+|+++||++||||++|+++| T Consensus 152 ~~~~~W-~--~~t~i~~Di~~~~~~l~~~s~g~---~~p~~l~L~p~~~~~L~~~~~~~~~t~l~~ik~~~~~l~i~~~~ 225 (296) T protein:vir:10 152 VSGGSW-S--QPTTAVSDITSLLDIIETSTNGQ---HRATHLLLPTTARRIMQNLVPGTSVSYGEFFRQNNSGVTVEFVQ 225 (296) T ss_pred cccCCc-c--CHHHHHHHHHHHHHHHHHhhCce---ecceeEEeCHHHHHHHhhccCCCCccHHHHHHHhcCCceEEEee Confidence 334455 3 45589999999999999999986 5689999999999999865 678999999999999999999999 Q ss_pred cccCCCCceE-EEEEEeeCCCceEEEEeCchhhcccceecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 269 EYDTASGRLV-QLWAPRVEGKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 269 el~~a~G~~~-~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) ||++++|+.. .|+++ .++++++++++||||++||+|+++++|++||++|||||+||||.||++++|| T Consensus 226 ~l~~a~~~g~~~~v~~-~~~~~~~~~~v~~~~~~~~~e~~~l~~~~~~~~~~~Gv~i~~P~ai~~~dGI 293 (296) T protein:vir:10 226 YLNDYNGTGTSAAIAY-EKDPNNMAIEIPEATNALPAQPKDLHFKIPVTSKATGLIVYRPLTMAVMKGI 293 (296) T ss_pred eeccCCCCcceEEEEE-EcCCceEEEEcCcceeeecccccCceEEEeeEeeEEEEEEECCceeEEEeee Confidence 9999877544 44444 4789999999999999999999999999999999999999999999999999 No 14 >protein:vir:5255 Length: 304 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852760;genbank:gi:31544035;uniprot:Q7Y5U0;genbank:GeneID:2753552 Probab=100.00 E-value=7.6e-83 Score=470.78 Aligned_cols=284 Identities=12% Similarity=0.068 Sum_probs=254.4 Q ss_pred hhhhhhcCccccCCcchHHHHHH---HhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecccceE--Eeecc- Q lcl|Aclame:pro 33 MDAADLSPHLSSTGSSGIPNYLT---TYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVA--TYGDY- 106 (336) Q Consensus 33 ~da~d~~~~l~t~~~~~i~~~l~---~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~~G~a~--~ygd~- 106 (336) |++ -+||. ++||++|||.+|+++++.+|||++++++|++++++|+++|.+|+++ +++++ T Consensus 1 ~~~---------------lafl~~qL~~id~~vye~~~~~~~~~~lipv~t~~~~~~~~~~~~~~d~~G~a~~~~i~~~a 65 (304) T protein:vir:52 1 MSL---------------LAYVKNGLTAVSKDIAETKYPEIVFPQFVYVDQQTAVGITEKLHYGADEHGSLDDGLITVGT 65 (304) T ss_pred Cch---------------HHHHHHHHHHHhhhhhccccccchhhhhccccCCCCcccceEEEeeeeccCcccccccCCcC Confidence 221 14554 6899999999999999999999999999999999999999999999 66775 Q ss_pred cCCceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccc-cceEEEEecCCC Q lcl|Aclame:pro 107 SSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAG-LENYGLINDPSL 185 (336) Q Consensus 107 ~DiP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~-~g~~GllN~Pnl 185 (336) +|+|++|++++++++++++++.||+|+++||++|+++|++|+++|+++||+++++++|+++|+||++ +|++|||||||+ T Consensus 66 ~dip~vd~~~~~~~~~i~~~~~~~~y~~~El~~a~~~g~~l~~~ka~aa~~a~~~~~n~v~~~Gd~~~~g~~GllN~p~v 145 (304) T protein:vir:52 66 STLDQVEVGFTPTRSYIVPWAKSVTWTKPELEQGKLLGLALNTAKIMALNKNAQQTLQKVAFLGHAKDSRLTGLLNNKSV 145 (304) T ss_pred CccceeecccceeEEEEEEEeeeeeecHHHHHHHHHhCCCcHHHHHHHHHHHHHhhhceEEEEeeccccceEEEEeCCCc Confidence 6889999999999999999999999999999999999999999999999999999999999999985 789999999999 Q ss_pred Ccccccc---cccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhcccC--CCCCccHHHHHHHhCC Q lcl|Aclame:pro 186 SAPITAT---TPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSKT--NQYGLSAAAKLKEIFP 260 (336) Q Consensus 186 ~~~~~~~---t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~~--~~~~~Tvl~~l~~n~p 260 (336) +...++. ++-|.++|+|||++||++++++++.+|+|. +.|+||+|||+++.+|+.+ +++|+|+|+||++||| T Consensus 146 ~~~~~~~~~a~~~w~~~T~~eI~~di~~~~~~i~~~s~~~---~~p~tl~Lpp~~~~~l~~~~~~~~~~Tvl~~l~~n~~ 222 (304) T protein:vir:52 146 EVYAIKGAAQNTKVQAMDFDKAVAFFKEIFLKGMEKTKRI---EAPNTFAIDSLDLAHLALVQRANTDTTALEFLTKHLS 222 (304) T ss_pred ceeeecCCccCCccccCCHHHHHHHHHHHHHHHHhccCce---ecCceEEeCHHHHHHHhhccCCCCCchHHHHHHHhcc Confidence 8755432 234788899999999999999999999975 6899999999999999754 5689999999999988 Q ss_pred -----ccEEEEccc-ccCC-CCceEEEEEEeeCCCceEEEEeCchhhcccceecCC-ceEEeeecceeeeEEecccceee Q lcl|Aclame:pro 261 -----KLEFVTIPE-YDTA-SGRLVQLWAPRVEGKDTATCGFTEKMRAHSIERYSS-YFRQKKSAGTWGAVIFRPFAVAQ 332 (336) Q Consensus 261 -----nl~i~~~pe-l~~a-~G~~~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~-~~~v~~~~rt~Gv~ir~P~ai~~ 332 (336) +|+|+.+|+ +.++ +|++.+|++++ +++++.++++||||++||+|++++ .|++||++|||||+||||++++| T Consensus 223 ~~~g~~l~I~~v~~~~~~~g~~g~~r~vvY~-~d~~~~~~~vP~p~~~l~~q~~~~~~~~vp~~~r~gGv~v~~P~a~~y 301 (304) T protein:vir:52 223 AAAGRQVAIKALPSNYGTRVTDGKTRAMVYV-NSKEHVIFDVPMSPTVLDAQPKGLLAFESGLRMAFGGVTFMEPDSALY 301 (304) T ss_pred cccCCcceEEEecccccccCCCCceEEEEEe-cChhheEEecCccccccchhhcCCceEEecceeeeeeEEEEccceeee Confidence 678999984 5544 35566666664 579999999999999999999987 79999999999999999999999 Q ss_pred ecc Q lcl|Aclame:pro 333 MIG 335 (336) Q Consensus 333 ~~G 335 (336) .|= T Consensus 302 ~D~ 304 (304) T protein:vir:52 302 VDY 304 (304) T ss_pred ecC Confidence 999 No 15 >protein:vir:105778 Length: 358 # NCBI annotation: gp9 # Family: family:all:10995 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224147;genbank:gi:62362222;genbank:GeneID:3342531 Probab=98.83 E-value=2.1e-10 Score=73.67 Aligned_cols=315 Identities=11% Similarity=0.013 Sum_probs=185.6 Q ss_pred CchHHHHHHHhhcceeccchhhhhhhhhhhhhhhhhhhcCccccCC-cchHHHHHHHhhCceeeeeeccc---cchhhhc Q lcl|Aclame:pro 1 MRDAQRIQNLARAGVILPRSVKNVSTPLAEYAMDAADLSPHLSSTG-SSGIPNYLTTYVDPSVIDILVAP---MKAAELV 76 (336) Q Consensus 1 m~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~da~d~~~~l~t~~-~~~i~~~l~~~idp~v~~~~~~~---~~~~~l~ 76 (336) |......++|.---..|.. +..+.||...+...|.+.+.. -+++|+-+-.-+|.++.++.-++ --..+|. T Consensus 12 ~~~~~qw~~L~~~Rna~n~------~~~a~maan~a~~~~~~~~~NAv~~v~~D~wr~~D~~~~q~fr~e~~~~l~NDLm 85 (358) T protein:vir:10 12 SRLGGHWNELWANRNMWNA------QHDAMIAANRSNMTPEWLAVNAVGGFTRDFWAEIDRQVLQLRDQEVGMEIVNDLI 85 (358) T ss_pred HHHHHHHHHHHHHHHHhhh------hhhhHHhhhHHHhhhhhheecccccCCHHHHHHHhhhhhhhcccchhHHHHhhhh Confidence 4444555554420000100 122445555555555444332 25667867677888887776664 3356788 Q ss_pred ccccCCCcceeeEEEeeeec-ccceEEe--ecc-cCCceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHH Q lcl|Aclame:pro 77 GESKKGDWTTLVAAFITAEP-TTTVATY--GDY-SSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELN 152 (336) Q Consensus 77 ~v~t~g~w~~~t~~~~v~e~-~G~a~~y--gd~-~DiP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~ 152 (336) |+++.-+=......|++.-- .|++..- |.. ...--+..++.=...+|+. .||..+.+|..-.+--|+++..+-+ T Consensus 86 ~ls~sv~Igktv~~y~~~gd~~~~v~~SmsGQ~~~~lD~~~y~~dGtpiPIfd--sg~~f~WR~~~~~~~~g~d~~~daQ 163 (358) T protein:vir:10 86 GVQTVLPVGKTAKLYNVIGDIADDVSVSIDGQAPFSFDHTEYASDGDPIPVFT--AGYGVNWRHAAGLNSLGIDLVLDSQ 163 (358) T ss_pred hccccccHHHHHHHHhhhcCCCceEEEEecccCcccccceeeeccCCEeeeec--cCccccccchhhcCccccchhHHHH Confidence 88877665544455655433 6666433 432 2223333444434445554 4555555888888899999999999 Q ss_pred HHHHHHHHHhhccEEEeecc-----ccceEEEEecCCCCccccccc-c----cccccCHHHHHHHH-HHHHHHHHHHhCC Q lcl|Aclame:pro 153 YSSALGLAKFLNGSYLFGVA-----GLENYGLINDPSLSAPITATT-P----WSGSPAVEAVVNEV-VTLFQVLQTQSQG 221 (336) Q Consensus 153 ~aAr~a~e~~~n~i~~~Gd~-----~~g~~GllN~Pnl~~~~~~~t-~----w~~~~T~~eI~~Di-~~l~~~l~~~t~g 221 (336) .+.-+.+.++.-..+|.|+. ++..|||-||||+....-+++ + -..++|+++++.-+ .+++..+-...+ T Consensus 164 ~~~~~kv~~~~vdy~lNG~~~I~v~g~t~~Glrn~~n~~qv~l~~~s~g~NiDlttat~~a~~~~f~~~l~~~~~~~N~- 242 (358) T protein:vir:10 164 MAKMRKFNQKRVNYYLNGDPNIQVQSYPAQGIKNHRNTKKINLGSGSGGANIDLTTADMTALFAFFGKGAFGTLARANK- 242 (358) T ss_pred HHHHHHHHHHHHhhhhccCCceeecCcccccccCCcceeEEEeccCCCcceeeeccCCHHHHHHHHHHHHHHHHHhhcc- Confidence 99999999999999999996 677899999999864332221 1 24467888888888 567777765543 Q ss_pred ceeccCCcEEEecHHHHHhcccC-CCC---CccHHHHHHHhCCcc-EEEEcccccCCCCceEEEEEEeeCCCceEEEEeC Q lcl|Aclame:pro 222 IITQEAVLHMGLPPTAMSDLSKT-NQY---GLSAAAKLKEIFPKL-EFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFT 296 (336) Q Consensus 222 ~v~~~~p~tL~Lp~~~~~~Ls~~-~~~---~~Tvl~~l~~n~pnl-~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p 296 (336) .-.-.++..+|+.+..|.++ ... +-|||+++++ |+++ .|++.+.|+ |+-...++. ..++..-.+- T Consensus 243 ---~~~~~~~~vs~ei~~n~~r~Y~~~~~~~gTIl~~vl~-~~~va~I~~~~~Ls---gNeii~~~~---~~~vi~plvG 312 (358) T protein:vir:10 243 ---VAQYDVMWVSPEIWANLAQPYVVNGVVSGNVLNAVLP-FAPVREIRQTFALS---GNEFIAYVR---RQDIISPLVG 312 (358) T ss_pred ---cceeeEEEEcHHHHhhhhcccccccccchhhHHHhhc-ccCcccccccccCC---CccEEEEEe---CCceeeeeec Confidence 12356899999999999874 222 3499999976 4554 466666676 655544432 2444444445 Q ss_pred chhhcccceecCCceEEeeeccee---eeEEeccc----ceeeeccC Q lcl|Aclame:pro 297 EKMRAHSIERYSSYFRQKKSAGTW---GAVIFRPF----AVAQMIGV 336 (336) Q Consensus 297 ~~~~~l~~~~~~~~~~v~~~~rt~---Gv~ir~P~----ai~~~~GI 336 (336) |++-..|.-..+ +.-.+..++| |++||.=. .|.+..-+ T Consensus 313 ~~~gt~~~pR~~--p~ddY~f~vwsA~glqik~D~~Gks~Vv~~~~~ 357 (358) T protein:vir:10 313 MAVGVVPLPRPL--PNVNYNFQIMSAEGLQITADDQGLSGVVYGANL 357 (358) T ss_pred ceeeeecCCCCC--CCcchhhhhhhhhceeeeeccccceeeEeeccc Confidence 554333321111 1122222332 34444432 23333333 No 16 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=98.71 E-value=9.7e-10 Score=69.99 Aligned_cols=288 Identities=8% Similarity=-0.030 Sum_probs=163.6 Q ss_pred hhhhhhhhcCccccCCcch-H-HHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecccceEEeecccC Q lcl|Aclame:pro 31 YAMDAADLSPHLSSTGSSG-I-PNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVATYGDYSS 108 (336) Q Consensus 31 ~~~da~d~~~~l~t~~~~~-i-~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~D 108 (336) |+.+...+.-..+|.+.++ + |.+.. ++++.+.+.....++.++.+... ....|++.+..+.+..++.... T Consensus 1 m~~~~~~a~~~~~t~~~g~~i~~~~~~-----~ii~~~~~~s~l~~~~~~~~~~~---~~~~~p~~~~~~~a~~v~Eg~~ 72 (330) T protein:vir:77 1 MAGSTVPSTQVALTGDFSAFLTPEQSQ-----DYFAEIEKTSIVQRIARKVPMGP---TGISIPHWTGAVSASWTGEAER 72 (330) T ss_pred CcccccchhhccccCCCcceechhHHH-----HHHHHHHhccchhhhcceeeccC---CceEEEEEcCCcceeEecCCCc Confidence 4433333222223333333 3 33333 33444445555666665543322 3367888888888999999999 Q ss_pred CceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeecc-ccceEEEEecCCCCc Q lcl|Aclame:pro 109 DGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVA-GLENYGLINDPSLSA 187 (336) Q Consensus 109 iP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~-~~g~~GllN~Pnl~~ 187 (336) +|..+..........+.++..+.+|.+=++. ...++.+.-.....+++.+.+++-.++|+. +....|++|++.... T Consensus 73 ~~~~~~~f~~i~~~~~k~~~~~~is~ell~d---s~~~~~~~i~~~l~~ai~~~~~~~~l~G~g~~~~~~g~~~~~~~~~ 149 (330) T protein:vir:77 73 KPITKGSFGKQELEPVKITTIFAESAEVVRL---NPLNYLNTMRTKIAEAIALKFDAAAIHGIDKPSAFKGYLAETTKVV 149 (330) T ss_pred cccccceeeEEEEeEEEEEEeehhhHHHHhc---chHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccccccccccccc Confidence 9999998888999999999999988765543 356789999999999999999999999997 455679999764332 Q ss_pred ccccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhccc-CCCCCccHHHH-HHHh----CCc Q lcl|Aclame:pro 188 PITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAK-LKEI----FPK 261 (336) Q Consensus 188 ~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~-l~~n----~pn 261 (336) .......--...+...+++|+.+++..+...- -.+..++|.++.+..|.+ .+..|.-++.- +... ..+ T Consensus 150 ~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~------~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~ 223 (330) T protein:vir:77 150 SLADTNLTTASGPQGNAYLAVNNALSLLVNSG------KKWTGTLLDNVTEPILNTAVDGNGRPLFVESTYTEQVGAIRE 223 (330) T ss_pred eeecccccccccccchhHHHHHHHHHhhhhcC------CCccEEEEcHHHHHHHHHHhccCCceeecCccccccccccCC Confidence 22211111112234567899999888776542 134579999999988854 23333322211 0000 111 Q ss_pred cEE-----EEccccc-CCCCceEEEEEEeeCC-----CceEEEEeCch-hhc-----------ccc-eecCCceEEeeec Q lcl|Aclame:pro 262 LEF-----VTIPEYD-TASGRLVQLWAPRVEG-----KDTATCGFTEK-MRA-----------HSI-ERYSSYFRQKKSA 317 (336) Q Consensus 262 l~i-----~~~pel~-~a~G~~~~~~~~~~~~-----~~~~~~~~p~~-~~~-----------l~~-~~~~~~~~v~~~~ 317 (336) .++ +...... +.++++..+++-.... ..-..+.+... .-. .++ -...-...+-|+. T Consensus 224 ~~l~G~PV~~~~~~p~~~~~~~~~~~~gd~s~~~i~~~~~~~i~~~~e~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~ 303 (330) T protein:vir:77 224 GRILGRPTYVADNVVNGTVGNRVVGVMGDFSQVIWGQIGGLSFDVTDQATLDFGEEQGGVWVPKLISLWQHNMVAVRCEA 303 (330) T ss_pred ceecceeeEEeccccCCCCCCccEEEEEecceEEEEEecCcEEEEeecceeeecccccccccccccchhhcCcEEEEEEE Confidence 222 2222221 2233444333322110 00011111111 000 000 0111236667888 Q ss_pred ceeeeEEecccceeeeccC Q lcl|Aclame:pro 318 GTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 318 rt~Gv~ir~P~ai~~~~GI 336 (336) |.++.. ++|.||+++.+. T Consensus 304 r~d~~v-~~~~a~~~i~~~ 321 (330) T protein:vir:77 304 EFAFMV-NDKDAFVKLTDQ 321 (330) T ss_pred EeccEE-ecccceEEEEec Confidence 887766 669999999999 No 17 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=98.49 E-value=1.5e-08 Score=63.50 Aligned_cols=285 Identities=10% Similarity=-0.032 Sum_probs=153.4 Q ss_pred hhhhhhhhhhhhcCccc--cCCcchHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecccceEEee Q lcl|Aclame:pro 27 PLAEYAMDAADLSPHLS--STGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVATYG 104 (336) Q Consensus 27 ~~~~~~~da~d~~~~l~--t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~~G~a~~yg 104 (336) ..+..++|+.-.+-... +...+.||..+. .++++.+...-...++.++...+. .+..|++.+..+.+...+ T Consensus 1 ~~~~~~~~~~~~~~~~t~~~~~~~~ip~~~~----~~ii~~~~~~s~l~~~~~~~~~~~---~~~~~p~~~~~~~a~~v~ 73 (320) T protein:vir:10 1 MAAGTAFQVDHAQIAQTGDTMFKGYLEPEQA----KDYFAEAEKTSIVQQFAQKVPMGT---TGQKIPHWIGDVSAQWIG 73 (320) T ss_pred CCCCccCCHHHHHhhccccccccccccHHHH----HHHHHHHHhccchhhhcceeeccC---CceEEEEEeCCcceEEec Confidence 11112223221111111 112223565554 345555555556666666654332 346788888888899999 Q ss_pred cccCCceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCC Q lcl|Aclame:pro 105 DYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPS 184 (336) Q Consensus 105 d~~DiP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pn 184 (336) ...++|..+........+.+.++..+.+|.+=++.+. .++.+.-....++++.+.+++-.+.|+....-.|++...+ T Consensus 74 E~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~---~~l~~~i~~~l~~a~a~~~d~a~l~G~g~~~~~~~~~~~~ 150 (320) T protein:vir:10 74 EGDMKPITKGNMTSQNIAPHKIATIFVASAETVRANP---ANYLGTMRTKVATAFAMAFDSAALNGTDSPFPTYLAQTTK 150 (320) T ss_pred CCccccccccceeEEEEeeEEEEEeehhhHHHHhcCh---HHHHHHHHHHHHHHHHHHHHHHhhcccCCCCCcccccccc Confidence 9999999999999999999999999999987666443 6788888888999999999999999998544344444322 Q ss_pred CCcccccccccccccCHHHHH---HHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhccc-CCCCCccHHHH-H---- Q lcl|Aclame:pro 185 LSAPITATTPWSGSPAVEAVV---NEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAK-L---- 255 (336) Q Consensus 185 l~~~~~~~t~w~~~~T~~eI~---~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~-l---- 255 (336) ........+ .+.+.+. +++.+++..+... . ..+..++|.++.+..|.+ .+..|..++.- + T Consensus 151 ~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~---~---~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~ 219 (320) T protein:vir:10 151 SVSLADPGG-----ATASDLTAYDAVAVNGLSLLVNA---K---KKWTHTLLDDIVEPILNGAKDKNGRPLFIESTYTDE 219 (320) T ss_pred cccceeccc-----ccccccccHHHHHHHHHhhhhcc---c---CCCcEEEEcHHHHHHHHHhhccCCceeeccccccCc Confidence 111111111 1111121 2233333333221 1 246689999999999864 33333333211 1 Q ss_pred HHhCCccEEEEcccccCC---CCceEEEE-------EEeeCCCceEEEEeC-chhhcc---cce-----ecCCceEEeee Q lcl|Aclame:pro 256 KEIFPKLEFVTIPEYDTA---SGRLVQLW-------APRVEGKDTATCGFT-EKMRAH---SIE-----RYSSYFRQKKS 316 (336) Q Consensus 256 ~~n~pnl~i~~~pel~~a---~G~~~~~~-------~~~~~~~~~~~~~~p-~~~~~l---~~~-----~~~~~~~v~~~ 316 (336) ..+++..++...|-.... .|....++ +-.+.+ ..+.+- +..-.. +.. .+.-...+-+. T Consensus 220 ~~~~~~~~i~g~pv~~~~~~~~~~~~~~~gd~~~~~~~~~~~---~~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~ 296 (320) T protein:vir:10 220 NSPFRAGRIVSRPTILSDHVADGTTVGYMGDFRNVIWGQVGG---LSFDVTDQATLNLGTPTEPNFVSLWQHNLVAVRVE 296 (320) T ss_pred cccccCceeeeeeeEecCCCCCCceEEEEeecceEEEEEecC---eEEEEeecceeeeccccccccchhhhcCcEEEEEE Confidence 112334455555554322 22222121 111111 111111 000000 000 01112344555 Q ss_pred cceeeeEEecccceeeeccC Q lcl|Aclame:pro 317 AGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 317 ~rt~Gv~ir~P~ai~~~~GI 336 (336) .|+ |+.+.+|.||+++.|+ T Consensus 297 ~~~-d~~v~~~~a~~~l~~~ 315 (320) T protein:vir:10 297 AEY-AFHNNDKDAFVKLTNV 315 (320) T ss_pred Eee-ccEEecccceEEEEec Confidence 565 6677999999999999 No 18 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=98.45 E-value=2.6e-08 Score=62.14 Aligned_cols=315 Identities=12% Similarity=0.062 Sum_probs=156.4 Q ss_pred CchHHHHHHH-hh--------------------------cceeccc-------hhhhhhh--hh---hhhhhhhhhhcCc Q lcl|Aclame:pro 1 MRDAQRIQNL-AR--------------------------AGVILPR-------SVKNVST--PL---AEYAMDAADLSPH 41 (336) Q Consensus 1 m~~~~~~~~l-~~--------------------------~g~~~~~-------~~~~~~~--~~---~~~~~da~d~~~~ 41 (336) +++.+..... ++ .|..|.. +...... .. .....+.+.+.- T Consensus 55 ~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 133 (435) T protein:vir:80 55 AEAAERMAAAAAVPVDPNPAAVTASAAAPVYAQPKAPEVKGAKMARMVRALAAARGDAQLASKLAIERGFGEEVAMSLN- 133 (435) T ss_pred HHHHHHHHHhhcccccchhhhhccccccccccccchhhhhHHHHHHHHHHHHhccchhHHHHHHHHhhhhhhhhhhhhc- Confidence 1111111100 00 0000000 0000000 00 000001110000 Q ss_pred cccCCcc--hHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecccceEEeecccCCceeeeeeeee Q lcl|Aclame:pro 42 LSSTGSS--GIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVATYGDYSSDGDSGTNINYP 119 (336) Q Consensus 42 l~t~~~~--~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP~vd~~~~~~ 119 (336) ..+...+ .+|..+.+ +|++.+.+......+-. +.-......+.|++.+..+.+...+.....|..+...... T Consensus 134 ~~~~~~gg~lvP~~~~~----~ii~~l~~~~~i~~~~~--~~v~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~f~~i 207 (435) T protein:vir:80 134 TLSPGAGGVLVPENLSS----EVIELLRPKSVVRKLGA--RTLPLSNGNITIPRLKGGAIVGYIGADTDIPTTQQQFDDL 207 (435) T ss_pred ccCCCCCccccchhHHH----HHHHHHhhhchhhhccc--eeeecCCCceEEEEEeCCcceeeeccCccccccccceeeE Confidence 1122222 35666554 23333322222222211 1111112346778888788888888888899999888888 Q ss_pred eeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccc-cceEEEEecCCCCcccccccccccc Q lcl|Aclame:pro 120 QRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAG-LENYGLINDPSLSAPITATTPWSGS 198 (336) Q Consensus 120 ~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~-~g~~GllN~Pnl~~~~~~~t~w~~~ 198 (336) .-..+.++..+.+|.+=|.- ...+-++.+.-......++.+.+++-+++|++. ....|++++........++ +. T Consensus 208 ~~~~~k~~~~~~is~ell~d-s~~~~~l~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~----~~ 282 (435) T protein:vir:80 208 KLTAKKMAALVPIANDLIKY-AGVNPNVDQIVVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPGNVITAS----DG 282 (435) T ss_pred EEeeEEEEEeehhhHHHHHh-hcccHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCcccceeecccccceeecc----cc Confidence 88889999888887553333 333456777788888888888999988999863 5678999976543322222 12 Q ss_pred cCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhccc-CCCCCccHHHHHHHh----CCccEEEEcccccCC Q lcl|Aclame:pro 199 PAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAKLKEI----FPKLEFVTIPEYDTA 273 (336) Q Consensus 199 ~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~l~~n----~pnl~i~~~pel~~a 273 (336) .|.+.+..|+.+++..+.....+ -.+..++|.+..+..|.+ .+..|.-++.-+..+ +|=+....+|...+. T Consensus 283 ~~~~~~~~d~~~~~~~~~~~~~~----~~~~~~vmn~~~~~~L~~lkd~~G~~l~~~~~~~~l~G~pv~~~~~~p~~~~~ 358 (435) T protein:vir:80 283 STLQKIETDLGKAILALENADAN----LTQPGWIMAPRTFRFLEGLRDGNGNKVYPELANGMLKGYPVGKTTQVPINLGE 358 (435) T ss_pred cchhhHHHHHHHHHHHhhccccc----cccCEEEEcHHHHHHHHhhhccCCceeccCCCCCeEeeeeeEEeccccccccC Confidence 45677888999988887654321 134578999999998864 343344333211111 121222233333334 Q ss_pred CCceEEEEEEeeCCCceEEEEeCch--hhcc--c-----------ceecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 274 SGRLVQLWAPRVEGKDTATCGFTEK--MRAH--S-----------IERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 274 ~G~~~~~~~~~~~~~~~~~~~~p~~--~~~l--~-----------~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) ++....+++-+.. + ..+..-.. +... . .-.++ ...+-+..| .++.+++|.||+++.|+ T Consensus 359 ~~~~~~i~~gd~s--~-~~i~~~~~~~i~~~~~~~~~~~~~~~~~~f~~n-~~~~r~~~r-~d~~~~~~~a~~~l~~~ 431 (435) T protein:vir:80 359 AGKESEIYFTDFG--D-VFIGEEETLEIDYSKEATYKDADGHMVSAFQRD-QTLIRVIAK-NDFGPRHVESIAVLSGV 431 (435) T ss_pred CCCcceEEEEEcc--c-EEEEeecceEEEEeccccccccccchhhhhhcC-cceeeeeee-eCcEeecccceEEEecc Confidence 4443333332211 1 10100000 0000 0 00111 133344444 45678899999999999 No 19 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=98.45 E-value=3.2e-08 Score=61.66 Aligned_cols=282 Identities=10% Similarity=0.011 Sum_probs=157.2 Q ss_pred hhhhhhhhcCccccCCcch--HHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecccceEEeecccC Q lcl|Aclame:pro 31 YAMDAADLSPHLSSTGSSG--IPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVATYGDYSS 108 (336) Q Consensus 31 ~~~da~d~~~~l~t~~~~~--i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~D 108 (336) ||.-...+.- ..++++++ ||..+. .++++.+........++.+...+. ....+++.+..+.+..++.... T Consensus 1 ma~~~~~~~~-~~~t~~gg~lip~~~~----~~ii~~~~~~~~l~~~~~~~~~~~---~~~~ip~~~~~~~a~~v~E~~~ 72 (304) T protein:vir:10 1 MATPTYTPGN-VILSDFKNGVIPAEQG----TLIMKDIMANSAIMKLAKNEPMTA---QKKKFTYLAKGVGAYWVSETER 72 (304) T ss_pred Cccccccccc-ccccCCCceecchhHH----HHHHHHHHhccchhhhcceeeccC---CceEEEEEeCCcceEEeecCcc Confidence 4333222222 22233333 666665 344555555555555655544332 3356788887888889998889 Q ss_pred CceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCCCCcc Q lcl|Aclame:pro 109 DGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAP 188 (336) Q Consensus 109 iP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~ 188 (336) +|..+..........+.++..+.+|.+=++.+ ..++.+.-.....+++.+.+++-+++|++..+-.|.+.+..+... T Consensus 73 ~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds---~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~ 149 (304) T protein:vir:10 73 IQTSKPEYAQAEMEAKKIGVIIPLSKEFLKWT---AKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGA 149 (304) T ss_pred cccccceeeEEEEEEEEEEEeehhhHHHHhcc---hHHHHHHHHHHHHHHHHHHHHhhheeccCCCcccccccccccccc Confidence 99999999999999999999999887544433 477888888999999999999999999987766665554444332 Q ss_pred cccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhccc-CCCCCccHHHHHHHhCCccEEEEc Q lcl|Aclame:pro 189 ITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAKLKEIFPKLEFVTI 267 (336) Q Consensus 189 ~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~l~~n~pnl~i~~~ 267 (336) .+.... .++....++||.+++..+...- ..+..++|.++.+..|.+ .+..|.-+++-=-.+.-++.++.. T Consensus 150 ~~~~~~---~~~~~~~~~~i~~~~~~l~~~~------~~~~~~v~~~~~~~~L~~lkd~~G~~l~~~~~~~l~G~PV~~~ 220 (304) T protein:vir:10 150 EEKGNV---VTDTNNLYVDLSALMATIEDEE------LDPNGVLTTRSFRSKMRNALDANDRPLFDANGNEIMGLPLSYT 220 (304) T ss_pred cccccc---cccccchHHHHHHHHHHhhhcc------CCcCEEEEcHHHHHHHHHhhccCCcEeecCCCccccceeeEEe Confidence 222211 1122346889999888775431 135579999999998864 333333222100000111233322 Q ss_pred ccccCCCCceEEEEEE-------eeCCCceEEEEeCchh--------hcccce---ecCCceEEeeecceeeeEEecccc Q lcl|Aclame:pro 268 PEYDTASGRLVQLWAP-------RVEGKDTATCGFTEKM--------RAHSIE---RYSSYFRQKKSAGTWGAVIFRPFA 329 (336) Q Consensus 268 pel~~a~G~~~~~~~~-------~~~~~~~~~~~~p~~~--------~~l~~~---~~~~~~~v~~~~rt~Gv~ir~P~a 329 (336) +.+...++....++.+ .+.+ .++.+-..- ...+.. ...-....-++.|+++.++ +|.| T Consensus 221 ~~~~~~~~~~~~~~gd~~~~~~~~~~~---~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~-~~~a 296 (304) T protein:vir:10 221 GADVYDKKKSLALMGDWDYARYGILQG---IEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNV-KPEA 296 (304) T ss_pred cccccCCCCcEEEEEehhhEEEEEecc---eEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEee-cccc Confidence 3332222222222211 1111 011110000 000000 0111244556667666554 5999 Q ss_pred eeeeccC Q lcl|Aclame:pro 330 VAQMIGV 336 (336) Q Consensus 330 i~~~~GI 336 (336) |+.+..- T Consensus 297 ~~~l~~a 303 (304) T protein:vir:10 297 FATLKPT 303 (304) T ss_pred eEEEEec Confidence 9999888 No 20 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=98.45 E-value=3.2e-08 Score=61.66 Aligned_cols=282 Identities=10% Similarity=0.011 Sum_probs=157.2 Q ss_pred hhhhhhhhcCccccCCcch--HHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecccceEEeecccC Q lcl|Aclame:pro 31 YAMDAADLSPHLSSTGSSG--IPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVATYGDYSS 108 (336) Q Consensus 31 ~~~da~d~~~~l~t~~~~~--i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~D 108 (336) ||.-...+.- ..++++++ ||..+. .++++.+........++.+...+. ....+++.+..+.+..++.... T Consensus 1 ma~~~~~~~~-~~~t~~gg~lip~~~~----~~ii~~~~~~~~l~~~~~~~~~~~---~~~~ip~~~~~~~a~~v~E~~~ 72 (304) T protein:vir:94 1 MATPTYTPGN-VILSDFKNGVIPAEQG----TLIMKDIMANSAIMKLAKNEPMTA---QKKKFTYLAKGVGAYWVSETER 72 (304) T ss_pred Cccccccccc-ccccCCCceecchhHH----HHHHHHHHhccchhhhcceeeccC---CceEEEEEeCCcceEEeecCcc Confidence 4333222222 22233333 666665 344555555555555655544332 3356788887888889998889 Q ss_pred CceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCCCCcc Q lcl|Aclame:pro 109 DGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAP 188 (336) Q Consensus 109 iP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~ 188 (336) +|..+..........+.++..+.+|.+=++.+ ..++.+.-.....+++.+.+++-+++|++..+-.|.+.+..+... T Consensus 73 ~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds---~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~~~~~~~~~~~~~~~ 149 (304) T protein:vir:94 73 IQTSKPEYAQAEMEAKKIGVIIPLSKEFLKWT---AKDFFNEVKPLIAEAFYKAFDQAVIFGTKSPYNTSTSGKPLVEGA 149 (304) T ss_pred cccccceeeEEEEEEEEEEEeehhhHHHHhcc---hHHHHHHHHHHHHHHHHHHHHhhheeccCCCcccccccccccccc Confidence 99999999999999999999999887544433 477888888999999999999999999987766665554444332 Q ss_pred cccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhccc-CCCCCccHHHHHHHhCCccEEEEc Q lcl|Aclame:pro 189 ITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAKLKEIFPKLEFVTI 267 (336) Q Consensus 189 ~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~l~~n~pnl~i~~~ 267 (336) .+.... .++....++||.+++..+...- ..+..++|.++.+..|.+ .+..|.-+++-=-.+.-++.++.. T Consensus 150 ~~~~~~---~~~~~~~~~~i~~~~~~l~~~~------~~~~~~v~~~~~~~~L~~lkd~~G~~l~~~~~~~l~G~PV~~~ 220 (304) T protein:vir:94 150 EEKGNV---VTDTNNLYVDLSALMATIEDEE------LDPNGVLTTRSFRSKMRNALDANDRPLFDANGNEIMGLPLSYT 220 (304) T ss_pred cccccc---cccccchHHHHHHHHHHhhhcc------CCcCEEEEcHHHHHHHHHhhccCCcEeecCCCccccceeeEEe Confidence 222211 1122346889999888775431 135579999999998864 333333222100000111233322 Q ss_pred ccccCCCCceEEEEEE-------eeCCCceEEEEeCchh--------hcccce---ecCCceEEeeecceeeeEEecccc Q lcl|Aclame:pro 268 PEYDTASGRLVQLWAP-------RVEGKDTATCGFTEKM--------RAHSIE---RYSSYFRQKKSAGTWGAVIFRPFA 329 (336) Q Consensus 268 pel~~a~G~~~~~~~~-------~~~~~~~~~~~~p~~~--------~~l~~~---~~~~~~~v~~~~rt~Gv~ir~P~a 329 (336) +.+...++....++.+ .+.+ .++.+-..- ...+.. ...-....-++.|+++.++ +|.| T Consensus 221 ~~~~~~~~~~~~~~gd~~~~~~~~~~~---~~i~~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~-~~~a 296 (304) T protein:vir:94 221 GADVYDKKKSLALMGDWDYARYGILQG---IEYAISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNV-KPEA 296 (304) T ss_pred cccccCCCCcEEEEEehhhEEEEEecc---eEEEEeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEee-cccc Confidence 3332222222222211 1111 011110000 000000 0111244556667666554 5999 Q ss_pred eeeeccC Q lcl|Aclame:pro 330 VAQMIGV 336 (336) Q Consensus 330 i~~~~GI 336 (336) |+.+..- T Consensus 297 ~~~l~~a 303 (304) T protein:vir:94 297 FATLKPT 303 (304) T ss_pred eEEEEec Confidence 9999888 No 21 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=98.43 E-value=3.6e-08 Score=61.39 Aligned_cols=316 Identities=12% Similarity=0.071 Sum_probs=155.0 Q ss_pred CchHHHHHHHhh------------------------------cceecc-------chhhhhhh--hhhhhhhhhhhhcCc Q lcl|Aclame:pro 1 MRDAQRIQNLAR------------------------------AGVILP-------RSVKNVST--PLAEYAMDAADLSPH 41 (336) Q Consensus 1 m~~~~~~~~l~~------------------------------~g~~~~-------~~~~~~~~--~~~~~~~da~d~~~~ 41 (336) +...++..++.. .|..|. .+...... ............... T Consensus 52 I~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 131 (435) T protein:vir:14 52 IERAEAAERMAAAAAVPVDPNPTAVAAPAAAPVHAQPKALEVKGAKMARMVRALAAARGDAQLASKLAIERGFGEEVAMS 131 (435) T ss_pred HHHHHHHHHHHHhhcccccchhhhhhhccccccccccchhhhhHHHHHHHHHHHHhhcchhhHHHHHHHhhhhhhhhhhh Confidence 111111110000 000000 00000000 000000000001111 Q ss_pred cc--cCCcc--hHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecccceEEeecccCCceeeeeee Q lcl|Aclame:pro 42 LS--STGSS--GIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVATYGDYSSDGDSGTNIN 117 (336) Q Consensus 42 l~--t~~~~--~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP~vd~~~~ 117 (336) ++ +...+ .+|..+.+ +|++.+.+......+.. +.-......+.|++.+..+.+...+....+|..+.... T Consensus 132 ~~~~t~~~gg~~vP~~~~~----~ii~~l~~~~~i~~~~~--~~~~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~f~ 205 (435) T protein:vir:14 132 LNTLSPGAGGVLVPENLSS----EVIELLRPKSVVRKLGA--RTLPLSNGNITIPRLKGGAIVGYIGADTDIPTTQQQFD 205 (435) T ss_pred cccCCcCCCccccchhHHH----HHHHHHhhhchhhhhcc--eeeecCCCceEEEEEeCCcceeeeccCcccccccccee Confidence 11 12222 25655443 33443333333333311 11111223467888888888888888888998888888 Q ss_pred eeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccc-cceEEEEecCCCCcccccccccc Q lcl|Aclame:pro 118 YPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAG-LENYGLINDPSLSAPITATTPWS 196 (336) Q Consensus 118 ~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~-~g~~GllN~Pnl~~~~~~~t~w~ 196 (336) ..+-..+.++..+.+|.+=+.-+ ..+.+|.+.-......++.+.+++-.++|++. ....|+++........+. +. T Consensus 206 ~i~~~~~k~~~~~~iS~ell~ds-~~~~~l~~~i~~~l~~ai~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~-~~-- 281 (435) T protein:vir:14 206 DLKLTAKKMAALVPIANDLIKYA-GVNPNVDQIVVGDLTAAIGAREDKAFIRDDGTANTPKGLRFWALPSNVITA-SD-- 281 (435) T ss_pred EEEeeeEEEEEeehhhHHHHHhh-ccCHHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeecccccceecc-cc-- Confidence 88888888888888875433332 22345778888888888889999989999874 457899986544332222 21 Q ss_pred cccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhccc-CCCCCccHHHHHHH----hCCccEEEEccccc Q lcl|Aclame:pro 197 GSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAKLKE----IFPKLEFVTIPEYD 271 (336) Q Consensus 197 ~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~l~~----n~pnl~i~~~pel~ 271 (336) .+|.+.+.+|+.+++..+.....+. .+..++|.+..+..|.. .+..|.-++.-+.. -+|=+....+|.-. T Consensus 282 -~~~~~~~~~~~~~l~~~~~~~~~~~----~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~g~l~G~Pv~~~~~~p~~~ 356 (435) T protein:vir:14 282 -ASTLQKIETDLGKVILALENADANL----TQPGWIMAPRTFRFLEGLRDGNGNKVYPELANGMLKGYPVGKTTQVPINL 356 (435) T ss_pred -ccchhhHHHHHHHHHHHhhhccccc----cCCEEEEcHHHHHHHHHhhccCCceeccCCCCCeeecceeEeeccccccc Confidence 2466778899999988887653321 24468999999988864 33334333211110 01211112223322 Q ss_pred CCCCceEEEEEEeeCCCceEEEEeCchhhc--cc--c-----------eecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 272 TASGRLVQLWAPRVEGKDTATCGFTEKMRA--HS--I-----------ERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 272 ~a~G~~~~~~~~~~~~~~~~~~~~p~~~~~--l~--~-----------~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) +.++....+++-+.. +. .+..-+.++. .+ . -.+ -...+-+..|+++ .+++|.||+++.|+ T Consensus 357 ~~~~~~~~i~~gd~s--~~-~i~~~~~~~~~~~~~~~~~~~~~~~~~~f~~-~~~~~r~~~r~d~-~~~~~~a~~~l~~~ 431 (435) T protein:vir:14 357 GETGKESEIYFTDFG--DV-FIGEEETLEIDYSKEATYKDADGHMVSAFQR-DQTLIRVIAKNDF-GPRHVESIAVLAGV 431 (435) T ss_pred cCCCccceEEEeecc--cE-EEEEecccEEEEeccccccccccchhhhhhc-ChhheeeeeeeCc-eeecccceEEEecC Confidence 233333233322211 11 1111111110 00 0 001 1233445666665 88999999999999 No 22 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=98.39 E-value=1.6e-07 Score=57.79 Aligned_cols=303 Identities=13% Similarity=0.071 Sum_probs=153.1 Q ss_pred CchHHHHHHHhh-cceeccchhh----hhhhhhhhhhhhhhhhcCccccCC-cch--HHHHHHHhhCceeeeeeccc--- Q lcl|Aclame:pro 1 MRDAQRIQNLAR-AGVILPRSVK----NVSTPLAEYAMDAADLSPHLSSTG-SSG--IPNYLTTYVDPSVIDILVAP--- 69 (336) Q Consensus 1 m~~~~~~~~l~~-~g~~~~~~~~----~~~~~~~~~~~da~d~~~~l~t~~-~~~--i~~~l~~~idp~v~~~~~~~--- 69 (336) +.=.+....|++ .|- +..+.. .+......+ .+++++ ++| ||..+.+ +|++.+.+. T Consensus 30 ~~~~~~~~a~a~~~g~-~~~a~~~a~~~~~~~~~~~---------a~~~~~~~Gg~lvP~~~~~----~ii~~l~~~s~l 95 (366) T protein:vir:57 30 AGMTRMVMSIAAGKGN-LADAAKFAATELGDTGLSM---------AISTAAGSGGALIPQNMQN----EVIELLRDRTVV 95 (366) T ss_pred hhHHHHHHHHHhcccc-hhHHHHHHHHhhcchhhhh---------hccccccCCccccchhHHH----HHHHHHhhhcch Confidence 111122222332 221 111111 111111111 122222 222 5766654 233333222 Q ss_pred cch-hhhcccccCCCcceeeEEEeeeecccceEEeecccCCceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHH Q lcl|Aclame:pro 70 MKA-AELVGESKKGDWTTLVAAFITAEPTTTVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLA 148 (336) Q Consensus 70 ~~~-~~l~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~ 148 (336) ++. .+.+|.. ...+.+++.+..+.+...+...++|..+.......-+.+.++....+|.+=|+.+ ..++. T Consensus 96 ~~lg~~~v~~~------~g~~~~p~~t~~~~a~wv~E~~~~~~s~~~f~~i~~~~~k~~~~~~iS~ell~ds---~~~~~ 166 (366) T protein:vir:57 96 RILGARSIPLP------NGNLSMPRLSGGATAGYVGEGKDVVATGATFDDVKLSAKTMIALVPVSNQLIGRA---GFNVE 166 (366) T ss_pred hhhceeeeecC------CCceEEEEEeCCcceeeeccCccccccccceeEEEEeeEEEEEeehhhHHHHhhh---hHHHH Confidence 121 2222221 2346778877777888889999999999888888999999999888885544433 45688 Q ss_pred HHHHHHHHHHHHHhhccEEEeecc-ccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccC Q lcl|Aclame:pro 149 SELNYSSALGLAKFLNGSYLFGVA-GLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEA 227 (336) Q Consensus 149 ~~K~~aAr~a~e~~~n~i~~~Gd~-~~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~ 227 (336) +.-+.....++.+.+++-.++|+. +..-.|++|.+.........+. ...+...+..++..+.........+ -. T Consensus 167 ~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~~~--t~~~~~~~~~~~~~~~~~~~~~~~~----~~ 240 (366) T protein:vir:57 167 QLLLGDILSAIATREDKAFLRDDGTGDTPKGMKAVATAANRLVAWTG--TAINLTTIDEYLDSLILKHMDSNSN----MI 240 (366) T ss_pred HHHHHHHHHHHHHHHHHHhhccCCCCccccceeeccccccceeeccc--cccchhhHHHHHHHHHHhhhccccc----cc Confidence 888888888999999999999997 4577899997654322221111 1223344444444333222211111 12 Q ss_pred CcEEEecHHHHHhccc-CCCCCccHHHHHHH----hCCccEEEEcccccCCCCceEEEEEEeeCCCceEEEEeCchhhc- Q lcl|Aclame:pro 228 VLHMGLPPTAMSDLSK-TNQYGLSAAAKLKE----IFPKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMRA- 301 (336) Q Consensus 228 p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~l~~----n~pnl~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p~~~~~- 301 (336) ....+|.+..+..|.+ ++..|..++.-+.. .||=+.-..+|.-.+++++...+++-+. .+. .+.....++. T Consensus 241 ~a~~vmn~~~~~~L~~lkd~~G~~l~~~~~~g~l~G~Pvv~s~~ip~~~~~~~~~~~i~~gdf--s~~-~i~~~~~i~i~ 317 (366) T protein:vir:57 241 RCGWGLSNRTYMTLFGLRDGNGNKVYPEMSQGILKGYPIQRTSAIPANLGDDGNESEIYFCDF--NDV-VIGEDGMMKVD 317 (366) T ss_pred cCEEEecHHHHHHHHhhhccCCceeccCCCCCeecceeeEEccccccccccCCCccEEEEEec--ceE-EEEEecceEEE Confidence 4468899999888864 34445544422211 1332222234443333444333333221 111 1111111100 Q ss_pred -c-------cc-e----ecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 302 -H-------SI-E----RYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 302 -l-------~~-~----~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) . +. + .+.-...+-+..++.+ .+++|.||+++.|| T Consensus 318 ~~~ea~~~~~~g~~~~~f~~~~~~iR~~~~~d~-~v~~~~a~~~lt~~ 364 (366) T protein:vir:57 318 FSTEATYKDADGQLVSAFARNQSLIRVVTEHDI-GFRHPEGLVLGTGV 364 (366) T ss_pred EeeccccccccccchhhhhcCceeEEeeeeeCc-EeeccccEEEEecc Confidence 0 00 0 0111244555555554 55999999999999 No 23 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=98.30 E-value=5.6e-08 Score=60.35 Aligned_cols=274 Identities=11% Similarity=0.038 Sum_probs=148.0 Q ss_pred hhhhhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecccceEEeecccCCc Q lcl|Aclame:pro 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVATYGDYSSDG 110 (336) Q Consensus 31 ~~~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP 110 (336) |+. .....+|..+.+ ++++.+.+.-....+.++.+.+. ....+++....++|..++...++| T Consensus 1 ma~-----------~gG~lip~~~~~----~ii~~~~~~s~i~~~~~~~~~~~---~~~~~p~~~~~~~a~~v~Eg~~~~ 62 (298) T protein:vir:94 1 MVL-----------NKGTLFDPELVT----DLISKVAGKSSIARLSAQKPIPF---NGEKVFTFTMDSEIDVVAESGKKT 62 (298) T ss_pred Cee-----------ccccccChhHHH----HHHHHHHhhchhhhhcceeeccC---CceEEEEEecCcceEEeeCCcccc Confidence 222 222334554442 34444454445566666544333 245778888888899999999999 Q ss_pred eeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeecccc-----ceEEEEecCCC Q lcl|Aclame:pro 111 DSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGL-----ENYGLINDPSL 185 (336) Q Consensus 111 ~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~-----g~~GllN~Pnl 185 (336) ..+.......-..+.++....+|.+=++...-...+|.+.-+...++++.+.++.-.++|.... ...|..+..+. T Consensus 63 ~~~~~f~~v~l~~~k~~~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~ 142 (298) T protein:vir:94 63 HGGVTLAPQTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSK 142 (298) T ss_pred ccccceeEEEEeeeEEEEeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCCcccccccccccccc Confidence 9999888888888899988888866454444556678888888899999999999999885321 11222111111 Q ss_pred CcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhccc-CCCCCccHHHHHHHh-CC--- Q lcl|Aclame:pro 186 SAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAKLKEI-FP--- 260 (336) Q Consensus 186 ~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~l~~n-~p--- 260 (336) ...... . .+....+++||.+++..+...- ..+..++|.++.+..|.+ .+..|.-++.=...+ -| T Consensus 143 ~~~~~~----~-~~~~~~~~~~i~~~~~~~~~~~------~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~tl 211 (298) T protein:vir:94 143 VTQKVE----A-PRGIADPNGAIENAVELLTGVD------ADVTGIAINPSFRSALAKQKDLQGNALFPELKWGATPDTI 211 (298) T ss_pred cccccc----c-ccccccHHHHHHHHHHhhhhcC------CCccEEEEcHHHHHHHHHhhccCCCeeecCcccCCCCcee Confidence 100000 1 1223457889999988775531 235579999999988854 333343232111111 01 Q ss_pred -ccEEEEcccccCC-CCceEEEEEEeeCCCceEEEEeC--chhhcccc-ee--------cCCceEEeeecceeeeEEecc Q lcl|Aclame:pro 261 -KLEFVTIPEYDTA-SGRLVQLWAPRVEGKDTATCGFT--EKMRAHSI-ER--------YSSYFRQKKSAGTWGAVIFRP 327 (336) Q Consensus 261 -nl~i~~~pel~~a-~G~~~~~~~~~~~~~~~~~~~~p--~~~~~l~~-~~--------~~~~~~v~~~~rt~Gv~ir~P 327 (336) ++.++....+.+. ++....+++-+. .+...+.+- +.+...+- .. +.-...+-++.|. |+.+++| T Consensus 212 ~G~PV~~~~~v~~~~~~~~~~~~~Gdf--s~~~~~~~~~~~~~~~~~~~~~d~~~~~~f~~~~v~~r~~~r~-~~~~~~~ 288 (298) T protein:vir:94 212 NGLPVDVNKTVSDMSLTQRDRAIIGDF--ANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFL-GWGILDA 288 (298) T ss_pred cceeeEEecccccccCCCccEEEEeec--cceEEEEEecCceEEEeecCCCcCcchhhhhcCcEEEEEEEEe-ccEeecc Confidence 1222222222211 222222222211 111111110 11111110 00 0112334455555 5667779 Q ss_pred cceeeeccC Q lcl|Aclame:pro 328 FAVAQMIGV 336 (336) Q Consensus 328 ~ai~~~~GI 336 (336) .||+++.|. T Consensus 289 ~a~~~l~~~ 297 (298) T protein:vir:94 289 TKFARVTEA 297 (298) T ss_pred cceEEEEec Confidence 999999999 No 24 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=98.22 E-value=1.5e-07 Score=57.93 Aligned_cols=280 Identities=13% Similarity=0.046 Sum_probs=156.4 Q ss_pred hhhhhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecccceEEeecccCCc Q lcl|Aclame:pro 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVATYGDYSSDG 110 (336) Q Consensus 31 ~~~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP 110 (336) || .+++.....+|..+.+ +|++.+.+......+..+...+. ....|++....+.|..+|....+| T Consensus 1 Ma--------t~tt~~g~~vP~~~~~----~ii~~~~~~s~l~~~~~~i~~~~---~~~~~p~~~~~~~a~wv~Eg~~~~ 65 (311) T protein:vir:99 1 MA--------TFGTGNLKNLPRNIAD----GMVKDVVQGSTVAVLSARKPQRF---GNEDIITFNGRPKAEFVGEGQQKS 65 (311) T ss_pred Cc--------eecCCCceeccHHHHH----HHHHHHHhhchhhhhcceeeccC---CceEEEEEeCCceeEEeecCcccc Confidence 22 1233333345665543 33444444444455544433221 335788888888999999999999 Q ss_pred eeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeecc---ccceEEEEecCCCCc Q lcl|Aclame:pro 111 DSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVA---GLENYGLINDPSLSA 187 (336) Q Consensus 111 ~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~---~~g~~GllN~Pnl~~ 187 (336) ..+....+..-..+.++..+..|.+=++..-....+|...-....++++.+.+++-.++|+. +.+..|+.|-..... T Consensus 66 ~~~~~f~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~g~~~~g~~~~~~~~~ 145 (311) T protein:vir:99 66 STTGEFDFVTSTPKKAQVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINPLTGTVIPGWSNYLGAAS 145 (311) T ss_pred cccceeeEEEEeeEEEEEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccCcccccccccccccc Confidence 99998888888889999988888664444446678899999999999999999999999986 344455555322221 Q ss_pred ccccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhccc-CCCCCccHHHHHHH--------h Q lcl|Aclame:pro 188 PITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAKLKE--------I 258 (336) Q Consensus 188 ~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~l~~--------n 258 (336) ..... ...+......||..++..+...... -.++.++|.+..+..|.+ .+..|.-+++-... - T Consensus 146 ~~~~~----~~~~~~~~~~~i~~~~~~~~~~~~~----~~~~~~vmn~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G 217 (311) T protein:vir:99 146 KRVEL----TADTIANPDLAIEAAVGLLVANGHP----TPVNGLALHPSIAWGLSTARYTDGRKKFPELGLGIGVSSFEG 217 (311) T ss_pred ceeec----cccccchhHHHHHHHHHHHhhhccC----CCccEEEEcHHHHHHHHhhhccCCCeeecCcccCCCCceecc Confidence 11101 1122334567788887766655432 235569999998888854 33334333221100 0 Q ss_pred CCccEEEEcccccCC--------CCceEEEEEEeeCCCceEEEEeCchh--hcccc----e----ecCCceEEeeeccee Q lcl|Aclame:pro 259 FPKLEFVTIPEYDTA--------SGRLVQLWAPRVEGKDTATCGFTEKM--RAHSI----E----RYSSYFRQKKSAGTW 320 (336) Q Consensus 259 ~pnl~i~~~pel~~a--------~G~~~~~~~~~~~~~~~~~~~~p~~~--~~l~~----~----~~~~~~~v~~~~rt~ 320 (336) +|-+.-..+|.-... .+....+++-+. .+-..+.+.... ..... . ...-..-+-|+.|++ T Consensus 218 ~Pv~~s~~i~~~~~~~~~~~~~~~~~~~~~~~Gdf--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~r~~~r~d 295 (311) T protein:vir:99 218 IDASVSDTVNGGDEADPDDEDLDAARAVRGIVGDF--ANGIHWGVQRDIPVELIKYGDPDGQGDLKRHNQIALRLEIVYG 295 (311) T ss_pred eeeEeecccccccccccccchhhccCcceEEEeec--cccEEEEEecCceEEEeecCCCCcchhhhhcCcEEEEEEEeec Confidence 121111122221111 122223333221 111111111111 11110 0 111235677899999 Q ss_pred eeEEecccceeeeccC Q lcl|Aclame:pro 321 GAVIFRPFAVAQMIGV 336 (336) Q Consensus 321 Gv~ir~P~ai~~~~GI 336 (336) +. |++|.+++..++. T Consensus 296 ~~-v~~~~~v~~~~~~ 310 (311) T protein:vir:99 296 WY-VFTDRFVVIENAV 310 (311) T ss_pred ce-ecChhHeeeeccc Confidence 97 5679888888888 No 25 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=98.21 E-value=1.3e-07 Score=58.28 Aligned_cols=275 Identities=11% Similarity=0.009 Sum_probs=152.9 Q ss_pred hhhhhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecccceEEeecccCCc Q lcl|Aclame:pro 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVATYGDYSSDG 110 (336) Q Consensus 31 ~~~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP 110 (336) ||-. .+.++.+ +|..+. +++++.+...-....+.++.+.+. ....|++.+..+.|..++...+.| T Consensus 1 ma~~-t~~~G~l-------ip~~~~----~~ii~~l~~~s~i~~l~~~~~~~~---~~~~~p~~~~~~~a~wv~Eg~~~~ 65 (300) T protein:vir:95 1 MSEA-QLSKGNL-------FNPELV----TKVINKVKGHSSIAKLSPQKPIPF---NGQREFVFDFDSDIDIVAENGKKT 65 (300) T ss_pred Cccc-ccCCcce-------echhhH----HHHHHHHHhhhhhhhhcceeeccC---CceEEEEEecCcceEEeeCCcccc Confidence 4432 2233322 344333 344554444444555555543322 245788888888999999999999 Q ss_pred eeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeecc-----ccceEEEEecCCC Q lcl|Aclame:pro 111 DSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVA-----GLENYGLINDPSL 185 (336) Q Consensus 111 ~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~-----~~g~~GllN~Pnl 185 (336) ..+...+...-+.+.++....+|.+=+++......++.+.-....++++.+.+++-.++|+. +....|..+.+.. T Consensus 66 ~s~~~f~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~ 145 (300) T protein:vir:95 66 HGGVSLDPVTIVPLKVEYGARVSDEFLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTKQASTIIGDNCFDKK 145 (300) T ss_pred cccccceeeEeeeEEEEEeehhhHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCCCCcccccccccccc Confidence 99999998898999999988888664444445667888888889999999999999999962 3444555554443 Q ss_pred CcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhccc-CCCCCccHHHHHHHh--CC-- Q lcl|Aclame:pro 186 SAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAKLKEI--FP-- 260 (336) Q Consensus 186 ~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~l~~n--~p-- 260 (336) ...+... +....++||.+++..+... ++ .+..++|.++.+..|.+ .+..|..++.-.... .. T Consensus 146 ~~~~~~~-------~~~~~~~~i~~~~~~~~~~-~~-----~~~~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l 212 (300) T protein:vir:95 146 VTQTVPF-------KDTNPDESMEDAVGMIDGS-ER-----DITGAILDPIFTTALSKMKNAEGGKLYPELAWGGVPDAI 212 (300) T ss_pred cceeecc-------cccchHHHHHHHHHHhhhc-CC-----CccEEEECHHHHHHHHHhhccCCCeeccCccccCCCcee Confidence 2211111 1112356788887766442 21 35579999999988864 344454443211111 11 Q ss_pred -ccEEEEcccccCC-CCceEEEEEEeeCCCceEEEEeCc--hhhcccc-e--------ecCCceEEeeecceeeeEEecc Q lcl|Aclame:pro 261 -KLEFVTIPEYDTA-SGRLVQLWAPRVEGKDTATCGFTE--KMRAHSI-E--------RYSSYFRQKKSAGTWGAVIFRP 327 (336) Q Consensus 261 -nl~i~~~pel~~a-~G~~~~~~~~~~~~~~~~~~~~p~--~~~~l~~-~--------~~~~~~~v~~~~rt~Gv~ir~P 327 (336) ++.++........ .+....+++-+. .+...+.+-+ .+...+. . ...-.+-+-++.|+ |+.|++| T Consensus 213 ~G~Pv~~s~~v~~~~~~~~~~~~~GDf--~~~~~~~~~~~~~~~v~~~~~~d~~~~~~f~~~~v~~r~~~r~-d~~v~~~ 289 (300) T protein:vir:95 213 NGLAVDKNRTVSYSQTDPKNTAIVGDF--ETMFKWGYAKEVPMEIIKYGDPDNSGRDLKGYNQIYIRCEAYI-GWGIMDA 289 (300) T ss_pred cceeeEEecCCCCCCCCCccEEEEeec--cceEEEEEecccEEEEeeccCCCCcchhhhhcCcEEEEEEEee-cceeecc Confidence 1223222222221 222222332111 0101011100 0111110 0 01112444556666 5567779 Q ss_pred cceeeeccC Q lcl|Aclame:pro 328 FAVAQMIGV 336 (336) Q Consensus 328 ~ai~~~~GI 336 (336) .||+++.|. T Consensus 290 ~a~~~l~~~ 298 (300) T protein:vir:95 290 ASFARIVKT 298 (300) T ss_pred cceEEEecC Confidence 999999999 No 26 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=98.18 E-value=6.6e-07 Score=54.47 Aligned_cols=307 Identities=11% Similarity=0.089 Sum_probs=152.9 Q ss_pred CchHH--------------------------------HHHHHh---hcceeccchhhhhhhhhhhhhhhhhhhcCccccC Q lcl|Aclame:pro 1 MRDAQ--------------------------------RIQNLA---RAGVILPRSVKNVSTPLAEYAMDAADLSPHLSST 45 (336) Q Consensus 1 m~~~~--------------------------------~~~~l~---~~g~~~~~~~~~~~~~~~~~~~da~d~~~~l~t~ 45 (336) .+... .++.+. +.|..- .....+. ......++ ..++...++ T Consensus 56 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~-~~~~~~~~--~~~~~~~~~ 131 (419) T protein:vir:94 56 LRTAPPAPKGPADGGTPLTPAEAGTFRSLAQRFADSDGLREYRARDKRGQFQ-VEMRDID-PNRLLSRD--APAGTITNP 131 (419) T ss_pred HHHHHHHHHHHhhhhccccccccccccchhhhhhhHHHHHHHHHhhhhhhhh-HHHHHHH-HHHhhccc--cccccccCC Confidence 01000 000000 001000 0000000 00000111 122233334 Q ss_pred CcchHHHHHHHhhCceeeeeeccccchhhhcccccCCC----cceee-EEEeeeecccceEEeecccCCceeeeeeeeee Q lcl|Aclame:pro 46 GSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGD----WTTLV-AAFITAEPTTTVATYGDYSSDGDSGTNINYPQ 120 (336) Q Consensus 46 ~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~----w~~~t-~~~~v~e~~G~a~~ygd~~DiP~vd~~~~~~~ 120 (336) ...-+|..+...| ......+....+++.+.+... +..++ ....+....+.+.+.+.+...|..+....... T Consensus 132 ~~~~~p~~~~~~i----~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~ 207 (419) T protein:vir:94 132 NVPHLPQLVPGIV----PTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEGTAKPQSTLSFDTIT 207 (419) T ss_pred cccccchhhhHHH----HHHHhhhhhhhhcceeeeccCCceeeeeeccccccccccCcccceecCCccccccccceeeEE Confidence 4444566666544 222233333444444432221 11111 11222333456778888888999999999999 Q ss_pred eeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCCCCcccccccccccccC Q lcl|Aclame:pro 121 RQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPA 200 (336) Q Consensus 121 ~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~T 200 (336) ...+.++....+|.+=++-+. ++.+.-....++++.+.+|+-.++|++.....|++|++.+....+... +...| T Consensus 208 ~~~~k~~~~~~is~ell~d~~----~l~~~i~~~la~a~~~~~d~aii~G~G~~~p~Gi~~~~~~~~~~~~~~--~~~~t 281 (419) T protein:vir:94 208 TTLKTVAHWLPITRQAADDNS----QLMGYIQGRLTYGLRFLRDRQLLNGNGSTEMQGILTTPGIGTYQQPKP--TAPAT 281 (419) T ss_pred eeeeeEEEeehhhHHHHHhHH----HHHHHHHHHHHHHHHHHHHHHHHhccCcccccceeccccccccccccc--ccccc Confidence 999999999999976555432 477877788888888999999999999999999999988765433322 23345 Q ss_pred HHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhcccC-CCCCccH-HH-HHHHhCC----ccEEEEcccccCC Q lcl|Aclame:pro 201 VEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSKT-NQYGLSA-AA-KLKEIFP----KLEFVTIPEYDTA 273 (336) Q Consensus 201 ~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tv-l~-~l~~n~p----nl~i~~~pel~~a 273 (336) ....++||.+++..+...- -.+..++|.++.+..|... +..|-.+ +. .+..--+ ++.++........ T Consensus 282 ~~~~~~~l~~~~~~~~~~~------~~~~~~v~n~~~~~~l~~~k~~~~~~~~~~~~~~~~~~~~l~G~pV~~~~~~~~~ 355 (419) T protein:vir:94 282 DEPPLVDIRRAKTVAEIAG------FPPDGVVVHPQDWESIELDQAPGSGVFRVIANVQGEATPRIWGLNVVSTVAIAQG 355 (419) T ss_pred cchhHHHHHHHHHhhhhcc------CCCCEEEEcHHHHHHHHHHhhcCCCceeecCCcccCCCccccceeeEEcCCCCCc Confidence 5668999999988875421 1355799999988887532 2222111 10 0110001 1222222222110 Q ss_pred ---CCceE--EEEEEeeCCCceEEEEeCchhhcccc-eecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 274 ---SGRLV--QLWAPRVEGKDTATCGFTEKMRAHSI-ERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 274 ---~G~~~--~~~~~~~~~~~~~~~~~p~~~~~l~~-~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) -|.-. +.++++ .+ ..+.+ -.+.. -...-....-+..|++|. ++.|-||+++..- T Consensus 356 ~~~~gd~~~~~~~~~~-~~---~~v~~----~~~~~~~~~~~~~~~r~~~r~d~~-v~~~~a~~~~~~~ 415 (419) T protein:vir:94 356 TALVGGFRQGATLWSR-QG---ITVLM----TDSHADFFTANTLVILAEFRANLA-VYQPKAFVRVTFA 415 (419) T ss_pred cEEEeeccceEEEEEe-cc---eEEEE----eccccchhhcCcEEEEEEEeeccE-EeccccEEEEEec Confidence 02111 112111 01 11110 00000 001122344556666655 5779999998877 No 27 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=98.18 E-value=2e-07 Score=57.31 Aligned_cols=318 Identities=14% Similarity=0.059 Sum_probs=145.4 Q ss_pred Cch---HHHHHHHhh--cceeccchhhhhhh---------hhhhhhhhhhhhcCccccCCcch----HHHHHHHhhCcee Q lcl|Aclame:pro 1 MRD---AQRIQNLAR--AGVILPRSVKNVST---------PLAEYAMDAADLSPHLSSTGSSG----IPNYLTTYVDPSV 62 (336) Q Consensus 1 m~~---~~~~~~l~~--~g~~~~~~~~~~~~---------~~~~~~~da~d~~~~l~t~~~~~----i~~~l~~~idp~v 62 (336) ... ...+..+.. .+-....+...+.. ..+. ..........+++.+.+| +|.++. .+| T Consensus 103 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~gg~lv~~~~~~----~~i 177 (477) T protein:vir:84 103 YEKGNGQSYFRDLAMQTVGMADEPAKERLRRHMVDVESDKEIRK-IAKVGEEYRDLDRNGGTGGYAVPPLWMM----NRF 177 (477) T ss_pred hhhhHHHHHHHHHHHHHhhhhhhHHHHHHHHHHhhhhhhhhHHH-HHHhhhhhccccccCCCcceeeccchhH----HHH Confidence 000 000000000 00000000000000 0000 000011112222222221 233333 244 Q ss_pred eeeeccccchhhhcccccCCCcceeeEEEeeeecccce-EEeeccc-----CCceeeeeeeeeeeeEEEEEEEEEeCHHH Q lcl|Aclame:pro 63 IDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTV-ATYGDYS-----SDGDSGTNINYPQRQSYFFQTWTRWGERE 136 (336) Q Consensus 63 ~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~~G~a-~~ygd~~-----DiP~vd~~~~~~~~~v~~~~~~~~y~~~E 136 (336) ++.+-+......++++.+... ....+.++..+..+.. ...+.++ +.|..+.......-+.+.++..+.+|.+= T Consensus 178 i~~l~~~~~i~~~~~~~~~~~-~~~~~~ip~~~~~~~~a~~~~Eg~~~~~~~~~~s~~~f~~i~~~~~k~~~~~~iS~el 256 (477) T protein:vir:84 178 IELARAGRTYANLCPTEPLPG-GTSSINIPKILTGTSTAIQAADNAALTAPSAHEVDLTDGFVQANVKTIAGQQGIAIQL 256 (477) T ss_pred HHHhhhcchHHHhhceeeecC-CcceeEEEEEecCcceeeeeccCcccccccccccccceeeEEEeeeeEEeeeHHHHHH Confidence 555555555556665543322 2234567665544433 3455542 44777777777788888888888887554 Q ss_pred HHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeecc-ccceEEEEecCCCCcccccc-cccccccCHHHHHHHHHHHHHH Q lcl|Aclame:pro 137 LEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVA-GLENYGLINDPSLSAPITAT-TPWSGSPAVEAVVNEVVTLFQV 214 (336) Q Consensus 137 l~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~-~~g~~GllN~Pnl~~~~~~~-t~w~~~~T~~eI~~Di~~l~~~ 214 (336) |.. ...++.+--....+.++...++.-.++|++ +....|++|.+++.....+. +.-|. ..+..+++|.+++.. T Consensus 257 l~d---s~~~l~~~i~~~l~~~~~~~~d~~~l~G~Gt~~~p~Gi~~~~~~~~~~~~~~~~t~~--~~~~~~~~i~~~~~~ 331 (477) T protein:vir:84 257 LDQ---AAVSVDEFVFRDLAADYANKLNVQVISGTGSNNQVVGVRATAGITQVTATSAGSALE--KHQIIYQKIADAIQR 331 (477) T ss_pred Hhc---cchhHHHHHHHHHHHHHHHHHHHHHhccCCCCCccceeeeccccccccccccccchh--hHHHHHHHHHHHHhh Confidence 443 345788888888999999999999999997 45689999998775433221 11111 234455555555554 Q ss_pred HHHHhCCceeccCCcEEEecHHHHHhccc-CCCCCccHH----------HHHHHhC--------CccEEEEcccc---cC Q lcl|Aclame:pro 215 LQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAA----------AKLKEIF--------PKLEFVTIPEY---DT 272 (336) Q Consensus 215 l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl----------~~l~~n~--------pnl~i~~~pel---~~ 272 (336) +.... ...+...+|-|..+..|.+ .+..|.-++ .++..+. -++.++..+.+ .+ T Consensus 332 ~~~~~-----~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~~~~~~~l~G~pVv~s~~~p~~~~ 406 (477) T protein:vir:84 332 VHTSR-----FLEPEVIVMHPRRWASFHAIFAGDDRPLIVPSGPGFNNLGVLTEVASQRVVGQMHGLPVVTDPTLPTTLG 406 (477) T ss_pred ccccc-----cCCccEEEEcHHHHHHHHHhhccCCCeeeecCcccccccccccccccccccchhcccceEecCccccccc Confidence 43221 1124467888887777643 222222111 1111110 11223332322 23 Q ss_pred CCCceEEEEEEeeCCCceEEEEeCchhhcccceecC-CceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 273 ASGRLVQLWAPRVEGKDTATCGFTEKMRAHSIERYS-SYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 273 a~G~~~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~-~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) ++++...+++-.. .+..-..--+.++..+--... ....+-..+......+|+|.||+.++|. T Consensus 407 ~~~d~~~i~~gd~--~~~~i~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~~~~r~~~afv~~t~~ 469 (477) T protein:vir:84 407 TGTDQDVIHVLRA--SDLALFESSVRMRALQETRAENLSVLLQVYGYLAFTAARFPQSVVEIGGT 469 (477) T ss_pred ccCCcceEEEEEe--ceEEEEeeceeEEeccccccccceeeeeehhhhhhhhhccccceEEeecc Confidence 3444444443322 122111111122222211111 1111111222333677899999999999 No 28 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=98.18 E-value=2.2e-07 Score=57.09 Aligned_cols=274 Identities=11% Similarity=0.045 Sum_probs=149.2 Q ss_pred hhhhhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecccceEEeecccCCc Q lcl|Aclame:pro 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVATYGDYSSDG 110 (336) Q Consensus 31 ~~~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP 110 (336) ||-+ ....+|..+.+ ++++.+.+......+.++.+... ....+++.+..++|..+|...++| T Consensus 1 ma~~-----------gG~lvp~~~~~----~ii~~~~~~s~i~~l~~~~~~~~---~~~~ip~~~~~~~a~~v~E~~~~~ 62 (298) T protein:vir:16 1 MVLN-----------KGTLFDPTLVT----DLISKVAGKSSIARLSAQKPIPF---NGEKVFTFTMDSEIDVVAESGKKT 62 (298) T ss_pred Cccc-----------CcceechhHHH----HHHHHHHhhhhhhhhcceeeccC---CceEEEEEecCcceEEecCCcccc Confidence 2222 22223333332 33444444444555555443322 335778888889999999999999 Q ss_pred eeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeecc-----ccceEEEEecCCC Q lcl|Aclame:pro 111 DSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVA-----GLENYGLINDPSL 185 (336) Q Consensus 111 ~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~-----~~g~~GllN~Pnl 185 (336) ..+.......-..+.++....+|.+=++.+.....++.+.-+...++++.+.++.-.++|.. ..+..|+....+. T Consensus 63 ~~~~~f~~v~l~~~k~a~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~ 142 (298) T protein:vir:16 63 HGGVTLAPQTMVPIKVEYGARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSK 142 (298) T ss_pred ccccceeEEEEeeeeEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcccccccccccccc Confidence 99998888888999999988888776666666678888888889999999999999999953 2223333332221 Q ss_pred CcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhccc-CCCCCccHHHHHHHh-----C Q lcl|Aclame:pro 186 SAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAKLKEI-----F 259 (336) Q Consensus 186 ~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~l~~n-----~ 259 (336) ....+.. .. .....++||.+++..+...- ..+..++|.++.+..|.+ .+..|.-++.-.-.+ . T Consensus 143 ~~~~~~~----~~-~~~~~~~~i~~~~~~~~~~~------~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l 211 (298) T protein:vir:16 143 VTQKVEA----PR-GIADPNGAIENAVELLTGVD------ADVTGIAINPSFRSALAKQKDLQDNALFPELKWGATPDTI 211 (298) T ss_pred ccccccc----cc-ccccHHHHHHHHHHHhhhcC------CCccEEEEcHHHHHHHHHhhccCCCeeecCcccCCCCcee Confidence 1111111 11 12346788999988775421 134579999999988864 344444343211111 1 Q ss_pred CccEEEEcccccC-CCCceEEEEEEeeCCCceEEEEeCch--hhcccc-eecC--------CceEEeeecceeeeEEecc Q lcl|Aclame:pro 260 PKLEFVTIPEYDT-ASGRLVQLWAPRVEGKDTATCGFTEK--MRAHSI-ERYS--------SYFRQKKSAGTWGAVIFRP 327 (336) Q Consensus 260 pnl~i~~~pel~~-a~G~~~~~~~~~~~~~~~~~~~~p~~--~~~l~~-~~~~--------~~~~v~~~~rt~Gv~ir~P 327 (336) -++.++......+ +++.+..+++-+. .+...+.+... +...+. ...+ -...+-|+.| .|..+++| T Consensus 212 ~G~PV~~~~~v~~~~~~~~~~~~~GDf--s~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~ra~~r-~d~~v~~~ 288 (298) T protein:vir:16 212 NGLPVDVNKTVSDMSLTQRDRAIIGDF--ANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELF-LGWGILDA 288 (298) T ss_pred cceeeEEecccccccCCCccEEEEeec--cceEEEEEecCceEEEeeccCCcCcchhhhhcCcEEEEEEEE-EccEeecc Confidence 1122222222222 2223333333221 01011111111 111110 0000 1122233334 56678899 Q ss_pred cceeeeccC Q lcl|Aclame:pro 328 FAVAQMIGV 336 (336) Q Consensus 328 ~ai~~~~GI 336 (336) .||+++.|. T Consensus 289 ~a~~~l~~a 297 (298) T protein:vir:16 289 TKFARVTEA 297 (298) T ss_pred cceEEEeec Confidence 999999999 No 29 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=98.17 E-value=2e-07 Score=57.26 Aligned_cols=275 Identities=10% Similarity=-0.041 Sum_probs=146.9 Q ss_pred cccCCcc--hHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecccceEEeecccCCceeeeeeeee Q lcl|Aclame:pro 42 LSSTGSS--GIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVATYGDYSSDGDSGTNINYP 119 (336) Q Consensus 42 l~t~~~~--~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP~vd~~~~~~ 119 (336) |.|.+.+ .+|..+.+ +|++.+.+.-....+.++.+.+. ....+++.+..++|..++....+|..+...... T Consensus 1 mat~~~gg~lvP~~~~~----~ii~~~~~~s~i~~~~~~i~~~~---~~~~~p~~~~~~~a~wv~Eg~~~~~~~~~f~~v 73 (311) T protein:vir:81 1 MVALATGTFQLPKHLVP----GVWQKAQGQSVLARLSMAEPQEF---GEQQYMTLTAPPRGEVVGEGAQKSESTATFAPV 73 (311) T ss_pred CceecCCceEcchhHHH----HHHHHHHhcchhhhhcceeecCC---CceEEEEEeCCceeEEeecCcccccccceeeEE Confidence 1222222 24555543 34444444444555555433221 246788888888999999999999999888888 Q ss_pred eeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeecc---ccceEEEEecCCCCcccccccccc Q lcl|Aclame:pro 120 QRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVA---GLENYGLINDPSLSAPITATTPWS 196 (336) Q Consensus 120 ~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~---~~g~~GllN~Pnl~~~~~~~t~w~ 196 (336) .-+.+.++....+|.+=++...-...+|.+.-+...++++.+.+++-.++|+. +.+..|+++.. ....... . . T Consensus 74 ~l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~~~~gi~~~~--~~~~~~~-~-~ 149 (311) T protein:vir:81 74 TAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKI--LDTTNIV-E-L 149 (311) T ss_pred EEeeEEEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCcccccccccc--cccceee-e-e Confidence 88888888888777654444445667788888999999999999999999974 33445666631 1111100 0 1 Q ss_pred cccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhccc-CCCCCccHHHHHH-Hh-------CCccEEEEc Q lcl|Aclame:pro 197 GSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAKLK-EI-------FPKLEFVTI 267 (336) Q Consensus 197 ~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~l~-~n-------~pnl~i~~~ 267 (336) ...+...+..+|.+++..+... + ..+..++|.+..+..|.+ .+..|.-++.-.. .. +|=+.-..+ T Consensus 150 ~~~~~~~~~~~i~~~~~~~~~~-~-----~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~tl~G~Pv~~~~~i 223 (311) T protein:vir:81 150 TTGTSATPDLAVEAAVGLVLGD-N-----LSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDVASFAGLNAAVSDTV 223 (311) T ss_pred cccccchHHHHHHHHHHHhhhc-C-----CCceEEEEcHHHHHHHHhhhccCCCeeecCccccCCCceecceeEEecccc Confidence 1222233456677777666432 2 235579999999988854 2333332321110 00 121110112 Q ss_pred ccc-----------cCCCCceEEEEEE-------eeCCCceEEEEeCchhhcccc-eecCCceEEeeecceeeeEEeccc Q lcl|Aclame:pro 268 PEY-----------DTASGRLVQLWAP-------RVEGKDTATCGFTEKMRAHSI-ERYSSYFRQKKSAGTWGAVIFRPF 328 (336) Q Consensus 268 pel-----------~~a~G~~~~~~~~-------~~~~~~~~~~~~p~~~~~l~~-~~~~~~~~v~~~~rt~Gv~ir~P~ 328 (336) |.- ..+.+....++.+ .+.+ -+.++ .++....-.. -...-...+-|..|+++ .+++|. T Consensus 224 ~~~~~~~~~~~~~~~~~~~~~~~~~gDfs~~~i~~~~~-~~~~~-~~~~~~~~~~~~~~~~~v~~r~~~r~d~-~v~~~~ 300 (311) T protein:vir:81 224 RGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVS-IPLEL-IEFGDPDGLGDLKRQNQIAIRAEVVYGI-GIMSTD 300 (311) T ss_pred cccccccccccchhcccCCccEEEEEecccEEEEEecc-ceEEE-eccCCCCcchhhhhcCcEEEEEEEEecc-Eeeccc Confidence 211 0111111222221 1111 11110 1111000000 01112345566666655 556699 Q ss_pred ceeeeccC Q lcl|Aclame:pro 329 AVAQMIGV 336 (336) Q Consensus 329 ai~~~~GI 336 (336) ||+++.|. T Consensus 301 a~~~l~~a 308 (311) T protein:vir:81 301 AFAVVRDA 308 (311) T ss_pred ceEEEEee Confidence 99999999 No 30 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=98.17 E-value=3.4e-07 Score=56.07 Aligned_cols=274 Identities=10% Similarity=0.032 Sum_probs=145.2 Q ss_pred hhhhhhhhcCccccC-CcchHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecccceEEeeccc-- Q lcl|Aclame:pro 31 YAMDAADLSPHLSST-GSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVATYGDYS-- 107 (336) Q Consensus 31 ~~~da~d~~~~l~t~-~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~-- 107 (336) |+- .+++ ....+|..+.+ +|++.+...-....+..+.+.+. .+..+++.+..+.+..+|... T Consensus 1 ma~--------~t~~~gg~liP~~~~~----~Ii~~~~~~s~l~~l~~~~~~~~---~~~~~p~~~~~~~a~wv~E~~~~ 65 (305) T protein:vir:25 1 MAD--------ISRAEVASLIQEAYSD----TLLAAAKQGSTVLSAFQNVNMGT---KTTHLPVLATLPEADWVGESATD 65 (305) T ss_pred CCC--------ccCCccceecCHHHHH----HHHHHHHhhchhhhhcceeeccC---CcEEEEEEeCCcceEEeeccccc Confidence 111 1111 12226666653 44555555555566665544332 246777777777888886643 Q ss_pred ---CCceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCC Q lcl|Aclame:pro 108 ---SDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPS 184 (336) Q Consensus 108 ---DiP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pn 184 (336) ++|..+.......-..+.++..+.+|.+=++ ....++.+.-.....+++.+.+++-.++|+.... |+.+... T Consensus 66 ~~~~~~~s~~~f~~i~~~~~k~~~~~~is~ell~---ds~~~~~~~i~~~l~~~~a~~~d~a~~~G~g~~~--~~~~~~~ 140 (305) T protein:vir:25 66 PKGVKPTSKVTWANRTLVAEEIAVIIPVHENVID---DATVAVLTEVAELGGQAIGKKLDQAVIFGTDKPA--SWVSPAL 140 (305) T ss_pred ccccccccccceeeEEeeeEEEEEeehhhHHHHh---cchHHHHHHHHHHHHHHHHHHHhhhheeccCCCC--Ccccccc Confidence 4577777778888888889988998875443 3346789999999999999999999999997432 3333222 Q ss_pred CCccccc---ccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhccc-CCCCCccHHHHHHHhCC Q lcl|Aclame:pro 185 LSAPITA---TTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAKLKEIFP 260 (336) Q Consensus 185 l~~~~~~---~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~l~~n~p 260 (336) ++....+ ...+-...+..++++++..+...+.... ..+..++|.+..+..|.+ .+..|.-++. -...- T Consensus 141 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~v~~~~~~~~l~~lkd~~G~~i~~--~~~l~ 212 (305) T protein:vir:25 141 IPAAVTAGQAVEVVGGVANESDIVGATNRAAKAVASAG------WAPDTLLSSLALRYEVANIRDANGNPVFR--DDSFA 212 (305) T ss_pred ccccccccccccccccchhhhHHHHHHHHHHHhhhhcc------cccceeEecHHHHHHHHHhhccCCceeec--CCccc Confidence 2221111 1122222334556777776665553321 234569999998888854 3444443321 00111 Q ss_pred ccEEEEcccccCCCCceEEEEEE-------eeCCCceEEEEeCch--hhcc--cce-ecCCceEEeeecceeeeEEeccc Q lcl|Aclame:pro 261 KLEFVTIPEYDTASGRLVQLWAP-------RVEGKDTATCGFTEK--MRAH--SIE-RYSSYFRQKKSAGTWGAVIFRPF 328 (336) Q Consensus 261 nl~i~~~pel~~a~G~~~~~~~~-------~~~~~~~~~~~~p~~--~~~l--~~~-~~~~~~~v~~~~rt~Gv~ir~P~ 328 (336) ++.+.-.......++....++.+ ...+ ..+.+-.. +... +.. ...-.+.+-+..|++ ..+.+|. T Consensus 213 G~Pv~~~~~~~~~~~~~~~~~gd~s~~~i~~~~~---~~i~~~~~~~~~~~~~~~~~~~~~~~~~R~~~r~~-~~v~~p~ 288 (305) T protein:vir:25 213 GFRTFFNRNGAWDADAAIEVIADSSRVKIGVRQD---ITVKFLDQATLGTGENQINLAERDMVALRLKARFA-YVLGVSA 288 (305) T ss_pred ccceEEcCccCCCCCccEEEEEecceEEEEEecC---eEEEEeeeeeeecCCceeeeeecCcEEEEEEEeec-ceeeCcc Confidence 12222111111222222222211 1111 01111000 0000 000 111234455667775 4577899 Q ss_pred ceeeeccC Q lcl|Aclame:pro 329 AVAQMIGV 336 (336) Q Consensus 329 ai~~~~GI 336 (336) +|+.++|+ T Consensus 289 a~v~~~~~ 296 (305) T protein:vir:25 289 TAQGANKT 296 (305) T ss_pred cEEEEccc Confidence 99999999 No 31 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=98.16 E-value=1.6e-08 Score=63.32 Aligned_cols=274 Identities=13% Similarity=0.180 Sum_probs=145.5 Q ss_pred hhhhhhhhhhhhhhhcCccccCCcchHHHHHH--HhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecc---c Q lcl|Aclame:pro 24 VSTPLAEYAMDAADLSPHLSSTGSSGIPNYLT--TYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPT---T 98 (336) Q Consensus 24 ~~~~~~~~~~da~d~~~~l~t~~~~~i~~~l~--~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~~---G 98 (336) ++.+ ...-+...++.+ ++..+|. ++|..++.+..=+.+-++.||-. .+.-....+.|.-.++. | T Consensus 1 ~~~~---~~i~s~~~~~~i------tv~~ll~~P~~I~~~i~e~~~~~~iad~lf~~--~~a~~~~~v~f~~~~p~~~~~ 69 (318) T protein:vir:10 1 MTAP---TGIVSVSDGPAI------TVRELVGNPLWIPTALKKMMVNQFISESLFRN--GGANPNGVVAYNEGNPSFLED 69 (318) T ss_pred CCCC---CcceeeecCCce------ehHHhhCCchhHHHHHHHHHhccchhhhhhhc--ccccccceeEEEecccccccC Confidence 1100 000000111111 1112222 23333333333333334444432 22212344555443333 6 Q ss_pred ceEEeecccCCceeeeeeeeeee-eEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccceE Q lcl|Aclame:pro 99 TVATYGDYSSDGDSGTNINYPQR-QSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENY 177 (336) Q Consensus 99 ~a~~ygd~~DiP~vd~~~~~~~~-~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~ 177 (336) .+...+-+..+|+++..-.+.+. .+..++.++++|.+.+.+ .+++...+...++++++-++.|+.+ + T Consensus 70 d~e~VaEggEiP~~~~~~G~~~ia~~~K~G~~~~vS~Em~~~---n~~~~v~r~~~~l~Nti~r~~d~~a---------~ 137 (318) T protein:vir:10 70 DVADVAEFGEIPVSAGARGLPRTAFAVKKALGVRVSKEMIDE---NRVGAVNDQMLQLRNTFIRANDRSA---------K 137 (318) T ss_pred cHhhccCcccccccCCCCCchhhhhhehhccceeccHHHHhh---cChhHHHHHHHHHHHHHHHHHHHHH---------H Confidence 77777778899999987756555 557899999999876654 4577788888888888888877763 3 Q ss_pred EEEecCCCCcccccccccccccCHHHHHHHHHHHHHHH--------------HHHhCCceeccCCcEEEecHHHHHhccc Q lcl|Aclame:pro 178 GLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVL--------------QTQSQGIITQEAVLHMGLPPTAMSDLSK 243 (336) Q Consensus 178 GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l--------------~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~ 243 (336) ..|.+++++. ...++.|-.. .....|+-.+...+ .....|+ .|+||+|-|..+..|.+ T Consensus 138 dal~sa~t~~-~~~s~~w~~~---~~~~~d~~~A~e~v~~a~~~~~~a~~~~~~~~~GY----~pdtIVlhP~~~~~l~~ 209 (318) T protein:vir:10 138 ALLQSPIVPT-LAVPTAWDNG---GKVRTDIAIAIEQISTAAPTAYPAGVGSSDEYFGF----IPDTIVMHYALLPILMD 209 (318) T ss_pred HHHhcccccc-ccCCcCCCCc---ccccccchhhhhhhhhhhhhhhhhhhhhhhhccCc----cceeeEECHHHHHHHhc Confidence 4566665543 3344444321 00112222222111 1112222 58999999999999964 Q ss_pred CCC----C---CccHHHHH--HHhCCc----cEEEEcccccCCCCceEEEEEEeeCCCceEEEEeCchhhccccee---- Q lcl|Aclame:pro 244 TNQ----Y---GLSAAAKL--KEIFPK----LEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSIER---- 306 (336) Q Consensus 244 ~~~----~---~~Tvl~~l--~~n~pn----l~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p~~~~~l~~~~---- 306 (336) -.. | +-.+...+ ..+||. |+++..|-+.. +.+ ++++ .+.++.+..++++.+.+--. T Consensus 210 n~~~~~~y~~~a~~~~~~~~~tg~~~g~~lGl~vi~s~~~p~---~~a-lvlq---~g~vG~~~d~~pl~~t~~~~egg~ 282 (318) T protein:vir:10 210 NENFMKVYERNANYVSTAPDWTGNFPGSVMGLNVIRSRTFPI---DRV-LIME---RGTVGFYSDTRPLQFTALYPEGNG 282 (318) T ss_pred chhhhhhhhccchhhhhcccccccccceeeceEEeecCccCC---Cee-EEEe---cCCcceeeccccceeeecccCCCC Confidence 211 1 11111111 122432 66766666652 222 3333 36677777788777666433 Q ss_pred ----cCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 307 ----YSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 307 ----~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) .+.+|....+..+ ...|..|+|+..++|| T Consensus 283 ~~g~~~~s~~~~~~~~~-~~~V~~PkA~~~itgi 315 (318) T protein:vir:10 283 PNGGPTESYRADASHKR-ALAVDQPKAALWLTGI 315 (318) T ss_pred CCCCcchhhheehheee-eeeeeCcceeEEEeec Confidence 3345666655443 5778899999999999 No 32 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=98.10 E-value=4e-07 Score=55.64 Aligned_cols=275 Identities=8% Similarity=-0.019 Sum_probs=151.3 Q ss_pred hhhhhhhhhhhcCcccc-CCcchHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecccceEEeecc Q lcl|Aclame:pro 28 LAEYAMDAADLSPHLSS-TGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVATYGDY 106 (336) Q Consensus 28 ~~~~~~da~d~~~~l~t-~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~ 106 (336) +....+|+. -.+++ ...+.+|..+.+ ++++.+...-....+.++...+. .....+++....+.+..++.+ T Consensus 1 m~~~~~~~~---~~~~t~~~~~lvP~~~~~----~ii~~~~~~s~l~~~~~~~~~~~--~~~~~~~~~~~~~~a~~v~Eg 71 (297) T protein:vir:95 1 MTVQTFNPE---NVLVSQKKDGTLHKEFTD----IIMKEVAQNSLVMQLGQYQEMEG--EQEKTVYVQTDGISAYWVNET 71 (297) T ss_pred CCccccccc---cccccCCCcceechhHHH----HHHHHHHhhchhhhhcceeecCC--CccEEEEEEcCCceeEEeecC Confidence 111122322 11222 223346666653 44555555555555555543221 123456666777788899999 Q ss_pred cCCceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCCCC Q lcl|Aclame:pro 107 SSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLS 186 (336) Q Consensus 107 ~DiP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~ 186 (336) .++|..+........+.+.++..+.+|.+-++.+. .++.+.-....++++.+.+++-.++|+...+-.|+++..... T Consensus 72 ~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~---~~l~~~i~~~la~ai~~~~d~a~l~G~g~~~~~gi~~~~~~~ 148 (297) T protein:vir:95 72 EKIKTDKPEVVPVTLKAHKLGIILVTSREALNYTW---KKFFEDMKPQIVEAFYKKIDEAGLLGHDTPFANSVAKAAKDA 148 (297) T ss_pred ccccccccceeEEEEeeEEEEEeehhhHHHHhcCH---HHHHHHHHHHHHHHHHHHHHHHHhcccCCccccccccccccc Confidence 99999999999999999999999999986666443 578888889999999999999999999988888888743321 Q ss_pred cccccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhccc-CCCCCccHHHHHHHhCCcc--- Q lcl|Aclame:pro 187 APITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAKLKEIFPKL--- 262 (336) Q Consensus 187 ~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~l~~n~pnl--- 262 (336) . +.. -++.| ++||.+++.++...- -.+..++|.+..+..|.+ .+..|.-++ ......+ T Consensus 149 ~--~~~---~~~~t----~~~i~~~~~~l~~~~------~~~~~~v~~~~~~~~L~~l~d~~G~~i~---~~~~~~l~G~ 210 (297) T protein:vir:95 149 N--KVI---GGPIN----YDNILKLQDALYDAD------VEPNAFVSKIQNRSALREARDGNKVSIY---DKAANTIDGI 210 (297) T ss_pred c--eec---ccccC----HHHHHHHHHHhhhcc------CCcCEEEEcHHHHHHHHHhhccCCceee---cCCCCcccce Confidence 1 111 11223 566777777765431 135689999999998864 333333222 1111111 Q ss_pred EEEEcccccCCC-----CceEEEEEEeeCCCceEEEEeCchhh-cccce--------ecCCceEEeeecceeeeEEeccc Q lcl|Aclame:pro 263 EFVTIPEYDTAS-----GRLVQLWAPRVEGKDTATCGFTEKMR-AHSIE--------RYSSYFRQKKSAGTWGAVIFRPF 328 (336) Q Consensus 263 ~i~~~pel~~a~-----G~~~~~~~~~~~~~~~~~~~~p~~~~-~l~~~--------~~~~~~~v~~~~rt~Gv~ir~P~ 328 (336) .++..+.-.... |+-.++++-...+ .++.+-.... ....+ ...-...+.++.|.++ .+++|. T Consensus 211 Pv~~~~~~~~~~~~~~~gd~s~~~~~~~~~---~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~-~v~~~~ 286 (297) T protein:vir:95 211 TTVDLKSARFEKGDLLAGDFDNLIYGVPYN---ITYKISEEGQISTITNADGTPINLFEQEMIAIRATMDIAV-MITKTD 286 (297) T ss_pred eeEeecCCCCCCceEEEEecccEEEEEecC---eEEEEeeccccccccccCccchhhhhcCcEEEEEEEEecc-Eeeccc Confidence 121111111111 1111222211111 1111111100 00000 1112344455556555 556699 Q ss_pred ceeeeccC Q lcl|Aclame:pro 329 AVAQMIGV 336 (336) Q Consensus 329 ai~~~~GI 336 (336) ||+.+..- T Consensus 287 a~~~l~~a 294 (297) T protein:vir:95 287 AFAKLTPA 294 (297) T ss_pred ceEEEeec Confidence 99998877 No 33 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=98.00 E-value=7e-07 Score=54.31 Aligned_cols=290 Identities=10% Similarity=-0.014 Sum_probs=150.4 Q ss_pred CchHHHHHHHhhcceeccchhhhhhh-hhhhhhhhhhhhcCccccCCcc-hHHHHHHHhhCceeeeeeccccchhhhccc Q lcl|Aclame:pro 1 MRDAQRIQNLARAGVILPRSVKNVST-PLAEYAMDAADLSPHLSSTGSS-GIPNYLTTYVDPSVIDILVAPMKAAELVGE 78 (336) Q Consensus 1 m~~~~~~~~l~~~g~~~~~~~~~~~~-~~~~~~~da~d~~~~l~t~~~~-~i~~~l~~~idp~v~~~~~~~~~~~~l~~v 78 (336) ||+-+ +...+.. +.+.+. ..+.+.+ .+|..+. .++++.+........+..+ T Consensus 3 ~~~~r--------------~~~~~~~~e~~a~~---------~~~~~~g~~ip~~~~----~~ii~~~~~~s~i~~~~~~ 55 (326) T protein:vir:42 3 VNPDR--------------TTPFLGVNDPKVAQ---------TGDSMFEGYLEPEQA----QDYFAEAEKISIVQQFAQK 55 (326) T ss_pred CCccc--------------hhhhcCcchhhhee---------ccccCCcceechhhH----HHHHHHHHhcchhhhhcce Confidence 22211 1111110 111110 1111122 2444444 2345554555445555554 Q ss_pred ccCCCcceeeEEEeeeecccceEEeecccCCceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHH Q lcl|Aclame:pro 79 SKKGDWTTLVAAFITAEPTTTVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALG 158 (336) Q Consensus 79 ~t~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a 158 (336) ...+ ..+..|++.+..+.+..++....+|..+...+...-..+.++..+.+|.+=++.+ ..++.+.-....+++ T Consensus 56 ~~~~---~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~v~iS~ell~~s---~~~~~~~i~~~l~~a 129 (326) T protein:vir:42 56 IPMG---TTGQKIPHWTGDVSASWIGEGDMKPITKGNMTSQTIAPHKIATIFVASAETVRAN---PANYLGTMRTKVATA 129 (326) T ss_pred eecc---CCceEEEEEeCCcceEEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcC---HHHHHHHHHHHHHHH Confidence 4332 2345778888888888899999999999999999999999999999987555543 367888888888999 Q ss_pred HHHhhccEEEeeccccceEEEEecCCCCcccccccccccccCHHHHHHHHH--HHHHHHHHHhCCceeccCCcEEEecHH Q lcl|Aclame:pro 159 LAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVV--TLFQVLQTQSQGIITQEAVLHMGLPPT 236 (336) Q Consensus 159 ~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~--~l~~~l~~~t~g~v~~~~p~tL~Lp~~ 236 (336) +.+.+++-.++|+...+-.|++|.+.....+...+.. .+.+...+|+. .++..+.... .....++|.+. T Consensus 130 ~~~~~d~a~l~G~gs~~p~gi~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~------~~~a~~v~n~~ 200 (326) T protein:vir:42 130 FAMAFDNAAINGTDSPFPTFLAQTTKEVSLVDPDGTG---SNADLTVYDAVAVNALSLLVNAG------KKWTHTLLDDI 200 (326) T ss_pred HHHHHHHHhhcccCCCccccccccccccceeeccccc---ccccchhHHHHHHHHHhhhhhhc------cCccEEEEeHH Confidence 9999999999999877778888865432222222111 11122233332 2222221111 12446899999 Q ss_pred HHHhccc-CCCCCccHHHHHHHh-----CCccEEEEccccc-C--CC-------CceEEEEEEeeCCCceEEEEeCc-hh Q lcl|Aclame:pro 237 AMSDLSK-TNQYGLSAAAKLKEI-----FPKLEFVTIPEYD-T--AS-------GRLVQLWAPRVEGKDTATCGFTE-KM 299 (336) Q Consensus 237 ~~~~Ls~-~~~~~~Tvl~~l~~n-----~pnl~i~~~pel~-~--a~-------G~~~~~~~~~~~~~~~~~~~~p~-~~ 299 (336) .+..|.+ .+..|.-++.--..+ ++.-++...|-.- . .. |+-.++++-.+.+. .+.+-. .. T Consensus 201 ~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~Gd~s~~~~~~~~~~---~v~~~~e~~ 277 (326) T protein:vir:42 201 TEPILNGAKDKSGRPLFIESTYTEENSPFRLGRIVARPTILSDHVASGTVVGYQGDFRQLVWGQVGGL---SFDVTDQAT 277 (326) T ss_pred HHHHHHHhhccCCceeeccccccCccccccCceeeeeeEEEcCCCCCCceEEEEeecceEEEEEecce---EEEEeecce Confidence 8888854 333333222110001 1112233333221 1 11 22222222222111 111111 11 Q ss_pred hcccc----e----ecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 300 RAHSI----E----RYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 300 ~~l~~----~----~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) ..... . ...-...+.+..|. ++.+.+|.||+++.++ T Consensus 278 ~~~~~~~~~~~~~~~~~d~~~~r~~~~~-d~~v~~~~a~~~l~~~ 321 (326) T protein:vir:42 278 LNLGTPQAPNFVSLWQHNLVAVRVEAEY-AFHCNDKDAFVKLTNV 321 (326) T ss_pred eeecccccccchhhhhcCcEEEEEEEEe-ccEEecccceEEEeec Confidence 00000 0 11123555667776 5567999999999999 No 34 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=97.96 E-value=1.2e-06 Score=53.12 Aligned_cols=275 Identities=8% Similarity=0.049 Sum_probs=150.2 Q ss_pred hhhhhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecccceEEeecccCCc Q lcl|Aclame:pro 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVATYGDYSSDG 110 (336) Q Consensus 31 ~~~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP 110 (336) |..+|+ ..- .+....+.||..+.+ +|++.+...-....+..+.+.+. .+..+++.+. ..+..++...++| T Consensus 1 ~g~~a~-~~~-~~~~~~~~iP~~~~~----~ii~~~~~~s~l~~~~~~~~~~~---~~~~~~~~~~-~~a~~v~E~~~~~ 70 (299) T protein:vir:41 1 MGFNPD-TTT-MQSAKTGSIPINISE----QIITGVKNGSAAMKLAKAVPMTK---PEEEFTFMSG-VGAFWVDEAERIQ 70 (299) T ss_pred CCcCCC-ccc-ccCCCceecchhHHH----HHHHHHHhcchhhhhceeeecCC---CcEEEEEEcC-CceeeeecCcccc Confidence 333332 110 011112235555543 33444444444445544433322 2234555443 5577888888999 Q ss_pred eeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCCCCcccc Q lcl|Aclame:pro 111 DSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPIT 190 (336) Q Consensus 111 ~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~~~ 190 (336) ..+..........+.++..+.++.+=+.. ...++.+.-.....+++.+.+++-.++|+....-.|+++......... T Consensus 71 ~~~~~f~~v~l~~~k~~~~~~is~ell~d---s~~~~~~~i~~~l~~a~~~~~d~a~l~G~g~~~~~gil~~~~~~~~~~ 147 (299) T protein:vir:41 71 TSKPTFTKAKMRSKKMGVIIPTTKENLNY---SVTNFFSLMQAEIVEAFYKKFDQAVFTGVESPYNWNILKSATDASNLV 147 (299) T ss_pred ccccceeEEEEeeEEEEEeehhhHHHHhc---CHHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccccccccccccceee Confidence 99999999999999999999999865543 336788889999999999999999999998887789988543322111 Q ss_pred cccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhccc-CCCCCccHHHH-HHHhCC---ccEEE Q lcl|Aclame:pro 191 ATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAK-LKEIFP---KLEFV 265 (336) Q Consensus 191 ~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~-l~~n~p---nl~i~ 265 (336) . ..+. -++||.+++.++...- -.+..++|.+..+..|.+ .+..|.-++.= +...-+ ++.+. T Consensus 148 ~----~~~~----~~~~l~~~~~~l~~~~------~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~l~G~PV~ 213 (299) T protein:vir:41 148 E----ETAN----KYDDLNEAIGLIEAED------LEPNGIATIRKQRVKYRSTKDGNGMPIFNTATSNGVDDVLGLPIA 213 (299) T ss_pred c----cccc----cHHHHHHHHHhhhccc------CCcCEEEEcHHHHHHHHHhhccCCceeecCCcCCCCceecceeeE Confidence 1 1112 3678888888775321 135579999999998864 33333322210 000001 12222 Q ss_pred EcccccCCCCceEEEEEEeeCCCce-------EEEEeCc-hhhccccee--------cCCceEEeeecceeeeEEecccc Q lcl|Aclame:pro 266 TIPEYDTASGRLVQLWAPRVEGKDT-------ATCGFTE-KMRAHSIER--------YSSYFRQKKSAGTWGAVIFRPFA 329 (336) Q Consensus 266 ~~pel~~a~G~~~~~~~~~~~~~~~-------~~~~~p~-~~~~l~~~~--------~~~~~~v~~~~rt~Gv~ir~P~a 329 (336) ..+.+. ++++...+++-+. ... .++.+-. ......... ..-...+.+..|+ |..+++|.| T Consensus 214 ~~~~~~-~~~~~~~~~~gdf--s~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~-d~~v~~~~A 289 (299) T protein:vir:41 214 YTPKYT-FGDKDISELVGDW--NQAYYGILRGVEYEILTEATLTTVADETGKPLNLAERDMAAIKATFEV-GFMVVKDEA 289 (299) T ss_pred EecccC-CCCCceEEEEEec--ccEEEEEecCcEEEEeecccccccccccccchhhhhcCcEEEEEEEEe-ccEEecccc Confidence 223332 2222222222111 111 1111111 110000000 1113455666777 556677999 Q ss_pred eeeeccC Q lcl|Aclame:pro 330 VAQMIGV 336 (336) Q Consensus 330 i~~~~GI 336 (336) |+.+.+- T Consensus 290 ~~~l~~~ 296 (299) T protein:vir:41 290 FSAVQPK 296 (299) T ss_pred eEEEEec Confidence 9999999 No 35 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=97.92 E-value=2.1e-06 Score=51.74 Aligned_cols=317 Identities=12% Similarity=0.081 Sum_probs=145.2 Q ss_pred CchHHHHHHHhh--------------------------cce-----eccchhhhhhhhhhhhhhhhh---hhcCccccCC Q lcl|Aclame:pro 1 MRDAQRIQNLAR--------------------------AGV-----ILPRSVKNVSTPLAEYAMDAA---DLSPHLSSTG 46 (336) Q Consensus 1 m~~~~~~~~l~~--------------------------~g~-----~~~~~~~~~~~~~~~~~~da~---d~~~~l~t~~ 46 (336) ++..+....+++ .++ .+..+...+. ....++.+.. .....+.+.+ T Consensus 53 i~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~ 131 (428) T protein:vir:10 53 MDRMEATERAAALVAKPVKATQHGPAVIVKAEPKQYTGAGMTRMVMSIAAAQGNLQ-DAAKFASDELNDQSVSMAISTAA 131 (428) T ss_pred HHHHHHHHHHHHHHhhhhhchhhccccccccccchhhhHHHHHHHHHHHHhhhhHH-HHHHHhhhhhhhhhHhhhhcccc Confidence 111111110000 000 0000000000 0000100000 0000112222 Q ss_pred -cc--hHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecccceEEeecccCCceeeeeeeeeeeeE Q lcl|Aclame:pro 47 -SS--GIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVATYGDYSSDGDSGTNINYPQRQS 123 (336) Q Consensus 47 -~~--~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v 123 (336) ++ .||..+.+ +|++.+........+ +... .......+.+++....+.+...+.+...|..+.......-.. T Consensus 132 ~~gg~liP~~~~~----~ii~~l~~~~~l~~~-~~~~-~~~~~g~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~ 205 (428) T protein:vir:10 132 GSGGVLIPQNIHS----EVIELLRDRTIVRKL-GARS-IPLPNGNMSLPRLAGGATASYTGENQDAKVSEARFDDVKLTA 205 (428) T ss_pred cCCccccchhHHH----HHHHHHhhhchhhhh-ccee-eecCCcceEEEEEeCCcceeeeccCccccccccceeeEEeee Confidence 22 25766553 334433333333333 1111 111123367777777778888898899999998888888888 Q ss_pred EEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccc-cceEEEEecCCCCcccccccccccccCHH Q lcl|Aclame:pro 124 YFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAG-LENYGLINDPSLSAPITATTPWSGSPAVE 202 (336) Q Consensus 124 ~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~-~g~~GllN~Pnl~~~~~~~t~w~~~~T~~ 202 (336) +.++..+.+|.+=+.-+ ..++.+--......++.+.+++.+++|++. ....|++|.......+...+ .-...+.+ T Consensus 206 ~k~~~~v~is~ell~ds---~~~l~~~i~~~l~~ai~~~~d~~~l~G~G~~~~p~Gi~~~~~~~~~~~~~~-~~~~~~~~ 281 (428) T protein:vir:10 206 KTMIAMVPISNALIGRA---GFNVEQLVLQDILTAISVREDKAFMRDDGTGDTPIGMKARATQWNRLLPWA-ADAAVNLD 281 (428) T ss_pred EEEEEeehhhHHHHhhh---hHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccccccccc-ccccccHH Confidence 99999998887755543 356788888888888889999999999974 46679999654332212111 11223333 Q ss_pred HHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhccc-CCCCCccHHHHHHH-hCCccEEEE---cccccCCCCce Q lcl|Aclame:pro 203 AVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAKLKE-IFPKLEFVT---IPEYDTASGRL 277 (336) Q Consensus 203 eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~l~~-n~pnl~i~~---~pel~~a~G~~ 277 (336) .+-..++ .+................+|.+..+..|.. .+..|.-++.-... .+-++.++. +|.-.+.+++. T Consensus 282 ~~~~~~~----~~~~~~~~~~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~g~l~G~pv~~~~~~p~~~~~~~~~ 357 (428) T protein:vir:10 282 TIDTYLD----SIILMSMDGNSNMISSGWGMSNRTYMKLFGLRDGNGNKVYPEMAQGMLKGYPIQRTSAIPANLGEGGKE 357 (428) T ss_pred HHHHHHH----HHHHhhhccccccccCEEEEcHHHHHHHHHhhccCCceeccCCCCCeeeceeeEEeccccccccCCCcc Confidence 3322222 222211110001124578899998888854 34444434321111 011222222 23322233333 Q ss_pred EEEEEEeeCCCceEEEEeCchhhcccc-ee-------------cCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 278 VQLWAPRVEGKDTATCGFTEKMRAHSI-ER-------------YSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 278 ~~~~~~~~~~~~~~~~~~p~~~~~l~~-~~-------------~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) ..+++-+. .+ ..+..-..++...- +. ..-...+-+..| -|+.+++|.||+.++|| T Consensus 358 ~~i~~gd~--s~-~~i~~~~~i~i~~~~~~~~~~~~~~~~~~f~~~~~~~R~~~r-~d~~v~~p~a~~~~t~~ 426 (428) T protein:vir:10 358 SEIYFADF--ND-VVIGEDGNMKVDFSKEASYIDTDGKLVSAFSRNQSLIRVVTE-HDIGFRHPEGLVLGTGV 426 (428) T ss_pred ceEEEEec--ce-EEEEEecceEEEeecccccccccccccchhhcchhheeeeee-eCceeeccceEEEEecc Confidence 33333221 11 11111111111000 00 001122234444 46678999999999999 No 36 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=97.89 E-value=2.4e-06 Score=51.41 Aligned_cols=293 Identities=11% Similarity=0.017 Sum_probs=153.5 Q ss_pred CchHHHHH-HHhhcceeccchhhhhhhhhhhhhhhhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhhcccc Q lcl|Aclame:pro 1 MRDAQRIQ-NLARAGVILPRSVKNVSTPLAEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGES 79 (336) Q Consensus 1 m~~~~~~~-~l~~~g~~~~~~~~~~~~~~~~~~~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~ 79 (336) |+..+.++ ++.++-... ........ +.. ...+.....+|..+.+ ++++.+........++.+. T Consensus 1 ~~~~~~~~~~~~~f~~~~-----~~~~~~~a---~~~----~~~~~~~~~iP~~~~~----~ii~~~~~~s~l~~~~~~~ 64 (324) T protein:vir:97 1 MEQTQKLKLNLQHFASNN-----VKPQVFNP---DNV----MMHEKKDGTLMNEFTT----PILQEVMENSKIMQLGKYE 64 (324) T ss_pred CccchhHHHHHHHHHHhh-----hhhhhhcc---ccc----cccCCCcceechhHHH----HHHHHHHhhcchhhhccee Confidence 87666655 333321100 00011111 110 0112233345666554 3344444444445555443 Q ss_pred cCCCcceeeEEEeeeecccceEEeecccCCceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHH Q lcl|Aclame:pro 80 KKGDWTTLVAAFITAEPTTTVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGL 159 (336) Q Consensus 80 t~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~ 159 (336) +.+ ..+..+++....+.+..++.+..+|..+..........+.++..+.+|.+-++.+ ..++.+.-.....+++ T Consensus 65 ~~~---~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds---~~~l~~~i~~~l~~ai 138 (324) T protein:vir:97 65 PME---GTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYT---YSQFFEEMKPMIAEAF 138 (324) T ss_pred ecc---CCceEEEEEecCcceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHHhcc---hHHHHHHHHHHHHHHH Confidence 332 2346788888888999999999999999999999999999999999998555544 3678888888889999 Q ss_pred HHhhccEEEeeccccc-eEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHH Q lcl|Aclame:pro 160 AKFLNGSYLFGVAGLE-NYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAM 238 (336) Q Consensus 160 e~~~n~i~~~Gd~~~g-~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~ 238 (336) .+.+++-.+.|++... ..|+++........+. ++. -++||.+++..+... + ..+.+++|.+..+ T Consensus 139 a~~~d~a~l~G~g~~~~~~gi~~~~~~~~~~~~-----~~~----~~~~i~~~~~~l~~~--~----~~~~~~v~n~~~~ 203 (324) T protein:vir:97 139 YKKFDEAGILNQGNNPFGKSIAQSIEKTNKVIK-----GDF----TQDNIIDLEALLEDD--E----LEANAFISKTQNR 203 (324) T ss_pred HHHHHHHhhccCCCCccCccccccccccceecc-----ccC----CHHHHHHHHHhhhhc--c----CCCCEEEEcHHHH Confidence 9999999999987442 3455553322111110 112 256777777766432 1 1356899999999 Q ss_pred Hhccc-CCCCCccHHHHHHHhCC---ccEEEEcccccCCCC-----ceEEEEEEeeCCCceEEEEeCchhh-ccccee-- Q lcl|Aclame:pro 239 SDLSK-TNQYGLSAAAKLKEIFP---KLEFVTIPEYDTASG-----RLVQLWAPRVEGKDTATCGFTEKMR-AHSIER-- 306 (336) Q Consensus 239 ~~Ls~-~~~~~~Tvl~~l~~n~p---nl~i~~~pel~~a~G-----~~~~~~~~~~~~~~~~~~~~p~~~~-~l~~~~-- 306 (336) ..|.+ .+..|..++. -.... ++.++..+-.....| .-.++++-.+.+ .++.+-..-. ...... T Consensus 204 ~~L~~lkd~~g~~~~~--~~~~~tl~G~PV~~~~~~~~~~~~~~~gd~~~~~i~~~~~---~~i~~~~~~~~~~~~~~~~ 278 (324) T protein:vir:97 204 SLLRKIVDPETKERIY--DRNSDTLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQL---IEYKIDETAQLSTVKNEDG 278 (324) T ss_pred HHHHHhhcCCCceeec--CCCCccccceeeEeecCCCCCcceEEEEecccEEEEEecC---cEEEEeecccccccccccc Confidence 88864 3333332221 11111 122222221111111 111111111111 1111111000 000000 Q ss_pred ------cCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 307 ------YSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 307 ------~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) ..-...+-+..|.++ .+++|.||+.+.+. T Consensus 279 ~~~~~f~~d~~~~r~~~r~d~-~v~~~~a~~~l~~~ 313 (324) T protein:vir:97 279 TPVNLFEQDMVALRATMHVAL-HIADDKAFAKLVPA 313 (324) T ss_pred cchhhhhcCcEEEEEEEEecc-EEecccceEEEEec Confidence 001233344556644 55569999999999 No 37 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=97.88 E-value=4.3e-06 Score=50.00 Aligned_cols=294 Identities=10% Similarity=0.022 Sum_probs=151.6 Q ss_pred CchHHHHHHHhhcceeccchhhhhhhhhhhhhhhhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhhccccc Q lcl|Aclame:pro 1 MRDAQRIQNLARAGVILPRSVKNVSTPLAEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESK 80 (336) Q Consensus 1 m~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t 80 (336) |+.-+..+.-. +.+.... ...-..+|. .. .........||..+.+ +|++.....-....++++.+ T Consensus 1 ~~~~~~~~~~~--~~~~~~~-------~~~~~~~a~-~~-~~~~~~~~~iP~~~~~----~ii~~~~~~s~l~~l~~~~~ 65 (324) T protein:vir:96 1 MEQTQKLKLNL--QHFASNN-------VKPQVFNPD-NV-MMHEKKDGTLMNEFTT----PILQEVMENSKIMQLGKYEP 65 (324) T ss_pred CCcchhhhHHH--HHHHHHh-------hhhhhhccc-cc-cccCcCccccchhHHH----HHHHHHHhhchhhhhcceee Confidence 66444333211 2111011 000011111 00 0112233456665553 44555555555666665544 Q ss_pred CCCcceeeEEEeeeecccceEEeecccCCceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHH Q lcl|Aclame:pro 81 KGDWTTLVAAFITAEPTTTVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLA 160 (336) Q Consensus 81 ~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e 160 (336) ... .++.+++.+..+.+..++....+|..+..........+.++....+|.+=++.+ ..++.+.-.....+++. T Consensus 66 ~~~---~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds---~~~l~~~i~~~la~ai~ 139 (324) T protein:vir:96 66 MEG---TEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYT---YSQFFEEMKPMIAEAFY 139 (324) T ss_pred ccC---CceEEEEEecCcceeEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcc---hHHHHHHHHHHHHHHHH Confidence 322 346788888888999999999999999999999999999999999987655544 35788888888888888 Q ss_pred HhhccEEEeeccccc-eEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHH Q lcl|Aclame:pro 161 KFLNGSYLFGVAGLE-NYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMS 239 (336) Q Consensus 161 ~~~n~i~~~Gd~~~g-~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~ 239 (336) +.+++-+++|+.... ..|+++........+ -++. -++||.+++..+...- ..+..++|.++.+. T Consensus 140 ~~~d~a~l~G~g~~~~~~gi~~~~~~~~~~~-----~~~~----t~~~i~~~~~~l~~~~------~~~~~~vmn~~~~~ 204 (324) T protein:vir:96 140 KKFDEAGILNQGNNPFGKSIAQSIEKTNKVI-----KGDF----TQDNIIDLEALLEDDE------LEANAFISKTQNRS 204 (324) T ss_pred HHHHHHHhccCCCCCcCccccccccccceec-----cccc----cHHHHHHHHHhhhhcc------CCCCEEEEcHHHHH Confidence 889988888886432 345554322211111 0112 3667777777664421 24568999999999 Q ss_pred hcccC-CCCCccHHHHHHHhCCc---cEEEEcccccCCC-----CceEEEEEEeeCCCceEEEEeCchhh-c----c--- Q lcl|Aclame:pro 240 DLSKT-NQYGLSAAAKLKEIFPK---LEFVTIPEYDTAS-----GRLVQLWAPRVEGKDTATCGFTEKMR-A----H--- 302 (336) Q Consensus 240 ~Ls~~-~~~~~Tvl~~l~~n~pn---l~i~~~pel~~a~-----G~~~~~~~~~~~~~~~~~~~~p~~~~-~----l--- 302 (336) .|.+. +..|..++. ...-+. +-++..+-..... |.-.++++-.+.+ ..+.+-..-. . . T Consensus 205 ~L~~l~d~~G~~~~~--~~~~~~l~G~PV~~~~~~~~~~~~~~~gd~~~~~~g~~~~---~~i~~~~~~~~~~~~~~~~~ 279 (324) T protein:vir:96 205 LLRKIVDPETKERIY--DRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQL---IEYKIDETAQLSTVKNEDGT 279 (324) T ss_pred HHHHhhccCCCeeec--CCCCCcccceeeEeeCCCCCCcceEEEEecceEEEEEecC---cEEEEeeccccccccccccc Confidence 88643 333332221 011111 1222211111111 1111111111111 1111111000 0 0 Q ss_pred cce-ecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 303 SIE-RYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 303 ~~~-~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) +.. ...-...+-+..|. |+.+++|.||+++.|. T Consensus 280 ~~~~f~~d~~~~r~~~r~-d~~v~~~~A~~~l~~a 313 (324) T protein:vir:96 280 PVNLFEQDMVALRATMHV-ALHIADDKAFAKLVPA 313 (324) T ss_pred chhhhhcCcEEEEEEEEE-ccEEecccceEEEecc Confidence 000 00112444555666 5556669999999999 No 38 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=97.88 E-value=4.3e-06 Score=50.00 Aligned_cols=294 Identities=10% Similarity=0.022 Sum_probs=151.6 Q ss_pred CchHHHHHHHhhcceeccchhhhhhhhhhhhhhhhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhhccccc Q lcl|Aclame:pro 1 MRDAQRIQNLARAGVILPRSVKNVSTPLAEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESK 80 (336) Q Consensus 1 m~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t 80 (336) |+.-+..+.-. +.+.... ...-..+|. .. .........||..+.+ +|++.....-....++++.+ T Consensus 1 ~~~~~~~~~~~--~~~~~~~-------~~~~~~~a~-~~-~~~~~~~~~iP~~~~~----~ii~~~~~~s~l~~l~~~~~ 65 (324) T protein:vir:78 1 MEQTQKLKLNL--QHFASNN-------VKPQVFNPD-NV-MMHEKKDGTLMNEFTT----PILQEVMENSKIMQLGKYEP 65 (324) T ss_pred CCcchhhhHHH--HHHHHHh-------hhhhhhccc-cc-cccCcCccccchhHHH----HHHHHHHhhchhhhhcceee Confidence 66444333211 2111011 000011111 00 0112233456665553 44555555555666665544 Q ss_pred CCCcceeeEEEeeeecccceEEeecccCCceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHH Q lcl|Aclame:pro 81 KGDWTTLVAAFITAEPTTTVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLA 160 (336) Q Consensus 81 ~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e 160 (336) ... .++.+++.+..+.+..++....+|..+..........+.++....+|.+=++.+ ..++.+.-.....+++. T Consensus 66 ~~~---~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds---~~~l~~~i~~~la~ai~ 139 (324) T protein:vir:78 66 MEG---TEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYT---YSQFFEEMKPMIAEAFY 139 (324) T ss_pred ccC---CceEEEEEecCcceeEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcc---hHHHHHHHHHHHHHHHH Confidence 322 346788888888999999999999999999999999999999999987655544 35788888888888888 Q ss_pred HhhccEEEeeccccc-eEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHH Q lcl|Aclame:pro 161 KFLNGSYLFGVAGLE-NYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMS 239 (336) Q Consensus 161 ~~~n~i~~~Gd~~~g-~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~ 239 (336) +.+++-+++|+.... ..|+++........+ -++. -++||.+++..+...- ..+..++|.++.+. T Consensus 140 ~~~d~a~l~G~g~~~~~~gi~~~~~~~~~~~-----~~~~----t~~~i~~~~~~l~~~~------~~~~~~vmn~~~~~ 204 (324) T protein:vir:78 140 KKFDEAGILNQGNNPFGKSIAQSIEKTNKVI-----KGDF----TQDNIIDLEALLEDDE------LEANAFISKTQNRS 204 (324) T ss_pred HHHHHHHhccCCCCCcCccccccccccceec-----cccc----cHHHHHHHHHhhhhcc------CCCCEEEEcHHHHH Confidence 889988888886432 345554322211111 0112 3667777777664421 24568999999999 Q ss_pred hcccC-CCCCccHHHHHHHhCCc---cEEEEcccccCCC-----CceEEEEEEeeCCCceEEEEeCchhh-c----c--- Q lcl|Aclame:pro 240 DLSKT-NQYGLSAAAKLKEIFPK---LEFVTIPEYDTAS-----GRLVQLWAPRVEGKDTATCGFTEKMR-A----H--- 302 (336) Q Consensus 240 ~Ls~~-~~~~~Tvl~~l~~n~pn---l~i~~~pel~~a~-----G~~~~~~~~~~~~~~~~~~~~p~~~~-~----l--- 302 (336) .|.+. +..|..++. ...-+. +-++..+-..... |.-.++++-.+.+ ..+.+-..-. . . T Consensus 205 ~L~~l~d~~G~~~~~--~~~~~~l~G~PV~~~~~~~~~~~~~~~gd~~~~~~g~~~~---~~i~~~~~~~~~~~~~~~~~ 279 (324) T protein:vir:78 205 LLRKIVDPETKERIY--DRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQL---IEYKIDETAQLSTVKNEDGT 279 (324) T ss_pred HHHHhhccCCCeeec--CCCCCcccceeeEeeCCCCCCcceEEEEecceEEEEEecC---cEEEEeeccccccccccccc Confidence 88643 333332221 011111 1222211111111 1111111111111 1111111000 0 0 Q ss_pred cce-ecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 303 SIE-RYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 303 ~~~-~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) +.. ...-...+-+..|. |+.+++|.||+++.|. T Consensus 280 ~~~~f~~d~~~~r~~~r~-d~~v~~~~A~~~l~~a 313 (324) T protein:vir:78 280 PVNLFEQDMVALRATMHV-ALHIADDKAFAKLVPA 313 (324) T ss_pred chhhhhcCcEEEEEEEEE-ccEEecccceEEEecc Confidence 000 00112444555666 5556669999999999 No 39 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=97.87 E-value=2.5e-06 Score=51.25 Aligned_cols=289 Identities=11% Similarity=-0.001 Sum_probs=147.7 Q ss_pred HhhcceeccchhhhhhhhhhhhhhhhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeE Q lcl|Aclame:pro 10 LARAGVILPRSVKNVSTPLAEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVA 89 (336) Q Consensus 10 l~~~g~~~~~~~~~~~~~~~~~~~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~ 89 (336) |++ |-.| ..+.+.++.- -++...+.+|..+.+ ++++.+.+......+..+...+ ..+. T Consensus 1 ~~~-~~~~-------~~e~~~~~~~-------~~~~~~~~ip~~~~~----~ii~~~~~~~~l~~~~~~~~~~---~~~~ 58 (318) T protein:vir:24 1 MAA-GTAF-------AVDHAQIAQT-------GDTMFKGYLEPEQAK----DYFAEAEKTSIVQQFAQKVPMG---TTGQ 58 (318) T ss_pred CCC-CCCC-------CHHHHHhhcc-------cCcccceeechhHHH----HHHHHHHhhchhhhhcceeecc---CCce Confidence 111 2111 1111211110 012222335665554 3344444444445555443322 2346 Q ss_pred EEeeeecccceEEeecccCCceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEe Q lcl|Aclame:pro 90 AFITAEPTTTVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLF 169 (336) Q Consensus 90 ~~~v~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~ 169 (336) .+++....+.+..++....+|..+...+...-..+.++....+|.+-++.+ ..++.+.-.....+++.+.+++-+++ T Consensus 59 ~ip~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~e~l~ds---~~~~~~~i~~~l~~~~~~~~d~a~l~ 135 (318) T protein:vir:24 59 KIPHWVGDVSAQWIGEGDMKPITKGNMTSQTIAPHKIATIFVASAETVRAN---PANYLGTMRTKVATAFAMAFDGAAMH 135 (318) T ss_pred EEEEEeCCcceEEecCCccccccccceeEEEEeeEEEEEeehhhHHHhhcC---hHHHHHHHHHHHHHHHHHHHHHhhhc Confidence 777878888899999999999999888888888899999999887655543 35788888899999999999999999 Q ss_pred eccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhccc-CCCCC Q lcl|Aclame:pro 170 GVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYG 248 (336) Q Consensus 170 Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~ 248 (336) |+....-.|+++...... .+..+. ..+. ..+++.+++..+...- ..+..++|.++.+..|.+ .+..| T Consensus 136 G~g~~~~~~~~~~~~~~~-~~~~~~--~~~~---~~~~~~~~~~~~~~~~------~~~~~~v~n~~~~~~L~~lkd~~G 203 (318) T protein:vir:24 136 GTDSPFPTYIGQTTKAIS-IADTTG--ATTV---YDQVAVNGLSLLVNDG------KKWTHTLLDDITEPILNGAKDQNG 203 (318) T ss_pred ccCCCCCccccccccccc-cccccc--ccch---HHHHHHHHHHhhcccc------CCCCEEEEcHHHHHHHHHhhccCC Confidence 997655556665322110 111110 1111 1123334443332211 235689999999998864 34444 Q ss_pred ccHHHHHHHh-----CCccEEEEccccc--CC-CCceEEEEEEee----CCCceEEEEeCchhhcc-cc----e----ec Q lcl|Aclame:pro 249 LSAAAKLKEI-----FPKLEFVTIPEYD--TA-SGRLVQLWAPRV----EGKDTATCGFTEKMRAH-SI----E----RY 307 (336) Q Consensus 249 ~Tvl~~l~~n-----~pnl~i~~~pel~--~a-~G~~~~~~~~~~----~~~~~~~~~~p~~~~~l-~~----~----~~ 307 (336) ..++.-...+ +...++...|-.- .. .|....++.+-. ....-..+.+....... .. . .. T Consensus 204 ~~l~~~~~~~~~~~~~~~~~i~g~pv~~~~~~~~~~~~~~~gdfs~~~~~~~~~l~i~~~~~~~~~~~~~~~~~~~~~f~ 283 (318) T protein:vir:24 204 RPLFIESTYGEAASPFRSGRIVARPTILSDHVVEGTTVGFMGDFSQLIWGQIGGLSFDVTDQATLNLGTVESPNFVSLWQ 283 (318) T ss_pred ceeecCccccCccccccCceEEEEeeEEeCCCCCCccEEEEeecceEEEEEecCeEEEEeeccceeccccccccchhhhh Confidence 4332211111 1112333333331 11 222222221110 00000111111111100 00 0 11 Q ss_pred CCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 308 SSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 308 ~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) .-...+-+..|. |+.+++|.||+.+.++ T Consensus 284 ~~~~~~r~~~r~-d~~v~~~~a~~~i~~~ 311 (318) T protein:vir:24 284 HNLVAVRVEAEY-AFHCNDAEAFVALTNV 311 (318) T ss_pred cCcEEEEEEEEE-ccEEecccceEEEEee Confidence 123455666676 4556889999999999 No 40 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=97.85 E-value=4.8e-06 Score=49.76 Aligned_cols=289 Identities=9% Similarity=-0.007 Sum_probs=150.3 Q ss_pred CchHHHHH-HHhhcceeccchhhhhhhhhhhhhhhhhhhcCcccc-CCcchHHHHHHHhhCceeeeeeccccchhhhccc Q lcl|Aclame:pro 1 MRDAQRIQ-NLARAGVILPRSVKNVSTPLAEYAMDAADLSPHLSS-TGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGE 78 (336) Q Consensus 1 m~~~~~~~-~l~~~g~~~~~~~~~~~~~~~~~~~da~d~~~~l~t-~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v 78 (336) |+.-+..+ ++.++--.+.... .+. | ...+++ .+.+.+|..+.+ +|++.+...-...+++++ T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~-----~~~-----a---~~~~~~~~~~~liP~~~~~----~ii~~~~~~s~l~~~~~~ 63 (324) T protein:vir:10 1 MEQTQKLKLNLQHFASNNVKPQ-----VFN-----P---DNVMMHEKKDGTLLNDFTT----PILQEVMENSKIMQLGKY 63 (324) T ss_pred CCCchHHHHHHHHHHHHhhccc-----eec-----c---cceeccCCCcceechhHHH----HHHHHHHhhchhhhhcce Confidence 66544444 3333111100000 000 0 111111 112234554443 333333333344555554 Q ss_pred ccCCCcceeeEEEeeeecccceEEeecccCCceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHH Q lcl|Aclame:pro 79 SKKGDWTTLVAAFITAEPTTTVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALG 158 (336) Q Consensus 79 ~t~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a 158 (336) .+.+. .++.|++.+..+.+..++.+..+|..+.......-..+.++..+.+|.+-++.+ ..++.+.-.....++ T Consensus 64 ~~~~~---~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds---~~~l~~~i~~~l~~a 137 (324) T protein:vir:10 64 EPMEG---TEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYT---YSQFFEEMKPMIAEA 137 (324) T ss_pred eeccC---CceEEEEEeCCcceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHHhcc---hHHHHHHHHHHHHHH Confidence 43332 346788888888999999999999999999999999999999999988666544 357888888888888 Q ss_pred HHHhhccEEEeeccccc-eEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHH Q lcl|Aclame:pro 159 LAKFLNGSYLFGVAGLE-NYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTA 237 (336) Q Consensus 159 ~e~~~n~i~~~Gd~~~g-~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~ 237 (336) +.+.+++-.++|+.... ..|+++..... + .... ...-++||.+++..+... + ..+.+++|.++. T Consensus 138 i~~~~d~a~l~G~g~~~~~~~i~~~~~~~------~-~~~~--~~~t~~~i~~~~~~l~~~--~----~~~~~~v~n~~~ 202 (324) T protein:vir:10 138 FYKKFDEAGILNQGNNPFGKSIAQSIEKT------N-KVIK--GDFTQDNIIDLEALLEDD--E----LEANAFISKTQN 202 (324) T ss_pred HHHHHHHHhhhcCCCCccCcccccccccc------c-eecc--ccCCHHHHHHHHHhhhhc--c----CCCCEEEEcHHH Confidence 88888988888886542 23444422111 1 1111 112367777787777442 1 246689999999 Q ss_pred HHhcccC-CCCCccHHHHHHHhCCc---cEEEEcccccCCCCceEEEEEE--------eeCCCceEEEEeCchhhcccc- Q lcl|Aclame:pro 238 MSDLSKT-NQYGLSAAAKLKEIFPK---LEFVTIPEYDTASGRLVQLWAP--------RVEGKDTATCGFTEKMRAHSI- 304 (336) Q Consensus 238 ~~~Ls~~-~~~~~Tvl~~l~~n~pn---l~i~~~pel~~a~G~~~~~~~~--------~~~~~~~~~~~~p~~~~~l~~- 304 (336) +..|.+- +..|..++ .-.+... +.++..+-. ..+...+++- .+.+ ..+.+-..-..... T Consensus 203 ~~~L~~l~d~~g~~~~--~~~~~~~l~G~PV~~~~~~---~~~~~~~~~gd~~~~~~~~~~~---~~i~~~~~~~~~~~~ 274 (324) T protein:vir:10 203 RSLLRKIVDPETKERI--YDRNSDTLDGLPVVNLKSS---NLKRGELITGDFDKLIYGIPQL---IEYKIDETAQLSTVK 274 (324) T ss_pred HHHHHHhhccCCceee--cCCCCccccceeEEeecCC---CCCcceEEEEecccEEEEEecC---cEEEEeecccccccc Confidence 9988642 33333221 1111111 222222211 1111112211 1111 11111111000000 Q ss_pred ----e----ecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 305 ----E----RYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 305 ----~----~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) . ...-...+-+..|+++.++ +|.||+++.|. T Consensus 275 ~~~~~~~~~~~~~~~~~r~~~r~d~~v~-~~~A~~~l~~a 313 (324) T protein:vir:10 275 NEDGTPVNLFEQDMVALRATMHVALHIA-DDKAFAKLVPA 313 (324) T ss_pred cccccchhhhhcCcEEEEEEEEEccEEe-cccceEEEEec Confidence 0 1122355566677765555 69999999999 No 41 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=97.84 E-value=2e-06 Score=51.85 Aligned_cols=292 Identities=9% Similarity=0.037 Sum_probs=149.9 Q ss_pred CchHHHHHHHhhcceeccchhhhhhhhhhhhhhhhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhhccccc Q lcl|Aclame:pro 1 MRDAQRIQNLARAGVILPRSVKNVSTPLAEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESK 80 (336) Q Consensus 1 m~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t 80 (336) |- . -++.+.+.+= ...++.+.......+|..+.+ +|++.+.+.-...++..+.+ T Consensus 1 ~a---~------------------l~el~~~~~~-~~~~g~~~~~~~~liP~~~~~----~ii~~l~~~s~l~~~~~~~~ 54 (333) T protein:vir:78 1 MA---T------------------LNELLPNSAG-SNHQGRLAHVPSDLLPKEIVG----PIFDKAQESSLVLRMGEQIP 54 (333) T ss_pred Cc---h------------------hHHhhhhccc-ccccCceecCCccccchhHHH----HHHHHHHhhchhhhhcceee Confidence 11 1 1223333221 122333333334456665553 34444444444444444433 Q ss_pred CCCcceeeEEEeeeecccc--------eEEeecccCCceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHH Q lcl|Aclame:pro 81 KGDWTTLVAAFITAEPTTT--------VATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELN 152 (336) Q Consensus 81 ~g~w~~~t~~~~v~e~~G~--------a~~ygd~~DiP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~ 152 (336) .+. ....+++...... +...++...+|..+..........+.++....+|.+=++. ...++.+.-+ T Consensus 55 ~~~---~~~~~p~~~~~~~a~~v~eg~~~~~~e~~~~~~~~~~f~~i~l~~~kl~~~~~is~ell~~---s~~~~~~~i~ 128 (333) T protein:vir:78 55 ISY---GETIIPTTVKRPEVGQVGVGTSNEQREGGLKPLSGTAWDTRSVSPIKLATIVTVSEEFARM---NPSGLYTKLQ 128 (333) T ss_pred ccC---CceEEEEEeCCceeEeecCcccccccccccccccccceeEEEEeeEEEEEeehhhHHHHhc---CHHHHHHHHH Confidence 221 2234555444433 3444556677888888888888999999999998744443 3356888888 Q ss_pred HHHHHHHHHhhccEEEeeccc---cceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCc Q lcl|Aclame:pro 153 YSSALGLAKFLNGSYLFGVAG---LENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVL 229 (336) Q Consensus 153 ~aAr~a~e~~~n~i~~~Gd~~---~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~ 229 (336) ....+++.+.+++-.+.|+.. .+..|+++...+...... .. ...+.+..++||.+++..+..... ..+. T Consensus 129 ~~la~ai~~~~d~~~l~G~g~~~~~~~~g~~~~~~~~~~~~~-~~--~~~~~~~~~~~i~~~~~~~~~~~~-----~~~~ 200 (333) T protein:vir:78 129 GDLAYAIGRGIDLAVFHGKSPLTGSALQGIDTDNVIANTTNV-DY--LQETGDPLLDRLLDGYDLVSANTD-----VEFN 200 (333) T ss_pred HHHHHHHHHHHHHHHhcccCCCCCcccccccccccccccccc-cc--cccccchhHHHHHHHHHhhccccc-----cCce Confidence 888999999999999999974 566777776554332111 11 112234467888888776644321 2355 Q ss_pred EEEecHHHHHhccc----CCCCCccHHHHHHHh-----CCccEEEEccccc----CCCCceEEEEEEeeCCCceEEEEeC Q lcl|Aclame:pro 230 HMGLPPTAMSDLSK----TNQYGLSAAAKLKEI-----FPKLEFVTIPEYD----TASGRLVQLWAPRVEGKDTATCGFT 296 (336) Q Consensus 230 tL~Lp~~~~~~Ls~----~~~~~~Tvl~~l~~n-----~pnl~i~~~pel~----~a~G~~~~~~~~~~~~~~~~~~~~p 296 (336) .++|.|..+..|.+ .+..|.-++...... .-++.++....+. .+.++...+++-+.. + ..+.+. T Consensus 201 ~~vmn~~~~~~L~~~~~~~d~~G~~i~~~~~~~~~~~~l~G~Pv~~~~~i~~~~~~~~~~~~~~~~gD~~--~-~~~g~~ 277 (333) T protein:vir:78 201 GWAVDPRFRAHLLRAQAYRDANGNVDPSRINLAAQTGDVLGLPAQFGRAVGGDLGAAVDSKTRIIGGDFS--Q-LKFGFA 277 (333) T ss_pred EEEEcchHHHHHHHHhhhcCCCCceeecCccccCCCceeeceeeEEccccCCCccccCCCccEEEEEecc--c-EEEEEe Confidence 78888887766632 233344343322111 1122233222222 122233333332221 1 111111 Q ss_pred ch--hhcccc---e---------ecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 297 EK--MRAHSI---E---------RYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 297 ~~--~~~l~~---~---------~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) .. +...+. . ...-...+-++.|.+ +.|+.|.||+++.+- T Consensus 278 ~~~~i~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d-~~v~~~~a~~~l~~~ 330 (333) T protein:vir:78 278 DEIRIKMSDTATLTDSGSATVSMWQTNQIAILIEVTFG-WLLGDKQAFVKFVDD 330 (333) T ss_pred eccEEEEeccccccccccceeehhhcCcEEEEEEEEEc-cEEecccceEEEecc Confidence 11 111110 0 001112334555554 456999999999998 No 42 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=97.82 E-value=4.6e-06 Score=49.86 Aligned_cols=295 Identities=10% Similarity=0.038 Sum_probs=160.2 Q ss_pred CchHHHHHHHhhcceeccchhhhhhhhhhhhhhhhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhhccccc Q lcl|Aclame:pro 1 MRDAQRIQNLARAGVILPRSVKNVSTPLAEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESK 80 (336) Q Consensus 1 m~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t 80 (336) |- .-++++.+.+-. +.++.+++.+...+|..+.+ +|++.+........+.++.. T Consensus 1 ~~---------------------~~~e~~~~~~~~-~~~~~~~~~~~~liP~~~~~----~ii~~~~~~s~l~~l~~~~~ 54 (338) T protein:vir:78 1 MA---------------------TLNELAPNTAGS-NHQGRLAHVPSDLLPKEIVG----PIFDKAQESSLVLRLGENIP 54 (338) T ss_pred Cc---------------------chHHhhhhhccc-ccccceecccccccchHHHH----HHHHHHHhhchhhhhcceee Confidence 11 113445555443 33344455555667777764 44555555555666666544 Q ss_pred CCCcceeeEEEeeeec--------ccceEEeecccCCceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHH Q lcl|Aclame:pro 81 KGDWTTLVAAFITAEP--------TTTVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELN 152 (336) Q Consensus 81 ~g~w~~~t~~~~v~e~--------~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~ 152 (336) .+. ....+++.+. .+.+...+....+|..+...+......+.++....+|.+=++. ...++.+.-. T Consensus 55 ~~~---~~~~ip~~~~~~~a~~v~~~~~~~~~Eg~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~d---s~~~~~~~i~ 128 (338) T protein:vir:78 55 ISY---GETIIPTTVKRPEVGQVGVGTSNEQREGGTKPLSGTAWDTRSVAPIKLATIVTVSEEFARM---NPSGLYTKLQ 128 (338) T ss_pred ccC---CceEEEEEecCccceeecccccccccccccccccccceeEEEEEEEEEEEeehhhHHHHhc---CHHHHHHHHH Confidence 332 2344554432 2445566777888999988888888889999888888754443 3367888888 Q ss_pred HHHHHHHHHhhccEEEeeccc---cceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCc Q lcl|Aclame:pro 153 YSSALGLAKFLNGSYLFGVAG---LENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVL 229 (336) Q Consensus 153 ~aAr~a~e~~~n~i~~~Gd~~---~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~ 229 (336) ...++++.+.+++-.+.|+.. .+..|++++..+...++....+ +..+..++++.+++..+...... .+. T Consensus 129 ~~la~a~~~~~d~~~l~G~g~~~~~~~~gi~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~-----~~~ 200 (338) T protein:vir:78 129 ADLAYAIGRGIDLAVFHGKSPLTGSALQGIDTNNVIVNTTNVDYLQ---TGTTPLLDRFLDGYDLVSANTDV-----DFN 200 (338) T ss_pred HHHHHHHHHHHHHHhhcccCCCcccccccccccccccccccccccc---ccchhhHHHHHHHHHHhhhhccc-----cce Confidence 889999999999999999974 4567777765554322222211 22355788888888877544321 355 Q ss_pred EEEecHHHHHhccc----CCCCCccHHHHHHHh-CC----ccEEE---EcccccC-CCCceEEEEEEeeCC-----CceE Q lcl|Aclame:pro 230 HMGLPPTAMSDLSK----TNQYGLSAAAKLKEI-FP----KLEFV---TIPEYDT-ASGRLVQLWAPRVEG-----KDTA 291 (336) Q Consensus 230 tL~Lp~~~~~~Ls~----~~~~~~Tvl~~l~~n-~p----nl~i~---~~pel~~-a~G~~~~~~~~~~~~-----~~~~ 291 (336) .++|.+..+..|.+ .+..|.-++.-.... -| ++-++ .+|.-.+ +.++...+++-+... ..-. T Consensus 201 ~~~m~~~~~~~L~~~~~l~d~~g~~l~~~~~~~~~~~~l~G~PV~~~~~ip~~~~~~~~~~~~~~~gdfs~~~~~~~~~~ 280 (338) T protein:vir:78 201 GWAADPRYRARLLRSQAYRDANGNVDPTRINLAASAGDLLGLPVQFGKAVGGDLGAATDSKVRVVGGDFSQLKYGFADEI 280 (338) T ss_pred EEEEchHHHHHHHHHhhhccCCCceeecccccCCCCceeeeeeEEEccccCccccccCCcccEEEEEecceEEEEeeccc Confidence 79999888776632 233333222111111 11 12222 2333222 233333333322100 0001 Q ss_pred EEEe-Cchhhcccc----eec----CCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 292 TCGF-TEKMRAHSI----ERY----SSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 292 ~~~~-p~~~~~l~~----~~~----~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) .+.+ ++.-..... +.. .-...+-|+.|. |+.+.+|.||+++... T Consensus 281 ~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~-d~~v~~~~a~~~l~~~ 333 (338) T protein:vir:78 281 RVKMSDTATLTDNTSPTPQTVSMWQTNQIAILIEVTF-GWLLGDKQAFVKFVDD 333 (338) T ss_pred EEEEeecccccccccccccchhhhhcCcEEEEEEEEe-ccEeecccceEEEecc Confidence 1111 110000000 000 012334555555 4567789999999999 No 43 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=97.80 E-value=2.5e-06 Score=51.26 Aligned_cols=281 Identities=10% Similarity=-0.020 Sum_probs=146.1 Q ss_pred hhhhhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecccceEEeecccCCc Q lcl|Aclame:pro 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVATYGDYSSDG 110 (336) Q Consensus 31 ~~~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP 110 (336) |+-+ +.....||..+.+ +|++.+.+......+.++...+. .+..+++....+.|..++....+| T Consensus 1 m~t~---------t~gg~liP~~~~~----~ii~~l~~~s~i~~l~~~~~~~~---~~~~ip~~~~~~~a~wv~E~~~~~ 64 (303) T protein:vir:97 1 MGTE---------TSKASLFDKHLVS----DLINKVKGHSSLAKLSSQKPIPF---NGSKEFTFTLDSDIDVVAENGKKT 64 (303) T ss_pred Cccc---------CCCCeEcchhHHH----HHHHHHHhhchhhhhcceeecCC---CceEEEEEecCcceEEeecCcccc Confidence 3322 1122234554442 33444444444555555443322 346778888888999999999999 Q ss_pred eeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCC-CCccc Q lcl|Aclame:pro 111 DSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPS-LSAPI 189 (336) Q Consensus 111 ~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pn-l~~~~ 189 (336) ..+........+.+.++..+.+|.+=++.......++.+.-.....+++.+.++.-.++|+....-.+...-+. ..... T Consensus 65 ~s~~~f~~v~l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~g~~~~~~~~~~~~~~ 144 (303) T protein:vir:97 65 HGGLSLEPVTIVPIKVEYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPRTKKASDVIGTNHFDSK 144 (303) T ss_pred ccccceeeEEeeeEEEEEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccCCccccccccccccccc Confidence 99999888898999999999888664444445677888889999999999999999999964322222111110 00000 Q ss_pred ccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhccc-CCCCCccHH-HHHHHh-----CCcc Q lcl|Aclame:pro 190 TATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAA-AKLKEI-----FPKL 262 (336) Q Consensus 190 ~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl-~~l~~n-----~pnl 262 (336) +.... ..++.+..++||.+++..+... + ..+..++|.++.+..|.+ .+..|.-++ .-+... .-++ T Consensus 145 ~~~~~--~~~~~~~~~~~i~~~~~~~~~~-~-----~~~~~~vmn~~~~~~L~~lkd~~g~~~~~~~~~~~~~~~~l~G~ 216 (303) T protein:vir:97 145 VTQVV--KFTESEDADANIEAAVNLIQGA-E-----GVVTGLAMDTEFSTALAKVTNGEMGPKMYPELAWGANPDSINGL 216 (303) T ss_pred ccccc--ccccccchHHHHHHHHHHHhhc-C-----CCccEEEEcHHHHHHHHHhhccCCCeEEecCccCCCCCceecce Confidence 00100 1112234678999998877542 1 235679999988887753 333232111 000000 0012 Q ss_pred EEE---EcccccCCCCceEEEEEEeeC------CCceEEEEeCchhhcccce---ecCCceEEeeecceeeeEEecccce Q lcl|Aclame:pro 263 EFV---TIPEYDTASGRLVQLWAPRVE------GKDTATCGFTEKMRAHSIE---RYSSYFRQKKSAGTWGAVIFRPFAV 330 (336) Q Consensus 263 ~i~---~~pel~~a~G~~~~~~~~~~~------~~~~~~~~~p~~~~~l~~~---~~~~~~~v~~~~rt~Gv~ir~P~ai 330 (336) .++ .+|.-.+.+.....+++-+.. ..+-.++.+.......... ...-..-+-++.|++ ..+++|.|| T Consensus 217 Pv~~s~~v~~~~~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~n~~~~r~~~r~~-~~v~~p~af 295 (303) T protein:vir:97 217 KSSVNTTVGAGADEAESKDLVIIGDFESMFKWGYAKQIPMEIIKYGDPDNSGKDLKGYNQIYLRAEAYIG-WGILDAKSF 295 (303) T ss_pred eeEEecccCCccccCCCccEEEEeeccccEEEEEecCcEEEEeeccCCCCcchhhhhcCcEEEEEEEEec-cEeecccce Confidence 222 233222222111112211100 0001111111100000000 001123344555554 456779999 Q ss_pred eeeccC Q lcl|Aclame:pro 331 AQMIGV 336 (336) Q Consensus 331 ~~~~GI 336 (336) +++... T Consensus 296 ~~l~~~ 301 (303) T protein:vir:97 296 ARVTKG 301 (303) T ss_pred EEeeCC Confidence 999999 No 44 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=97.80 E-value=7.9e-06 Score=48.54 Aligned_cols=289 Identities=10% Similarity=0.028 Sum_probs=147.9 Q ss_pred CchHHHHH-HHhhcceeccchhhhhhhhhhhhhhhhhhhcCcccc-CCcchHHHHHHHhhCceeeeeeccccchhhhccc Q lcl|Aclame:pro 1 MRDAQRIQ-NLARAGVILPRSVKNVSTPLAEYAMDAADLSPHLSS-TGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGE 78 (336) Q Consensus 1 m~~~~~~~-~l~~~g~~~~~~~~~~~~~~~~~~~da~d~~~~l~t-~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v 78 (336) |+.-+..+ ++.+ |-... ....... +...+++ .+.+.||..+.+ ++++.+........+.++ T Consensus 1 ~~~~~~~~~~~~~----f~~~~------~~~~~~~---a~~~~~~~~~~~liP~~~~~----~ii~~~~~~s~l~~l~~~ 63 (324) T protein:vir:93 1 MEQTQKLKLNLQH----FASNN------VKPQVFN---PDNVMMHEKKDGTLLNDFTT----PILQEVMENSKIMQLGKY 63 (324) T ss_pred CchhHHHHHHHHH----HHHhh------hhhhhcc---cccccccCCCcceechhHHH----HHHHHHHhhchhhhhcce Confidence 65433332 1111 10110 0000001 1111111 223345665554 334444444444555544 Q ss_pred ccCCCcceeeEEEeeeecccceEEeecccCCceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHH Q lcl|Aclame:pro 79 SKKGDWTTLVAAFITAEPTTTVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALG 158 (336) Q Consensus 79 ~t~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a 158 (336) .+.+. .++.|++.+..+.+..++.+.++|..+..........+.++..+.+|.+=++.+ ..++.+.-......+ T Consensus 64 ~~~~~---~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds---~~~l~~~i~~~l~~a 137 (324) T protein:vir:93 64 EPMEG---TEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYT---YSQFFEEMKPMIAEA 137 (324) T ss_pred eeccC---CceEEEEEecCcceeeecCCccccccccceeEEEEEeEEEEEeehhhHHHHhcc---hHHHHHHHHHHHHHH Confidence 33222 346788888888899999999999999888888888888998888887555543 357888888888888 Q ss_pred HHHhhccEEEeecccc-ceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHH Q lcl|Aclame:pro 159 LAKFLNGSYLFGVAGL-ENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTA 237 (336) Q Consensus 159 ~e~~~n~i~~~Gd~~~-g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~ 237 (336) +.+.+++-++.|+... ...|+++........+ .. ..-++||.+++..+...- ..+.+++|.++. T Consensus 138 ia~~~d~a~l~G~g~~~~~~~~~~~~~~~~~~~-------~~--~~~~~~i~~~~~~l~~~~------~~~~~~v~n~~~ 202 (324) T protein:vir:93 138 FYKKFDEAGILNQGNNPFGKSIAQSIEKTNKVI-------KG--DFTQDNIIDLEALLEDDE------LEANAFISKTQN 202 (324) T ss_pred HHHHHHHHHhcCCCCCCcCccccccccccceec-------cc--cccHHHHHHHHHhhhhcc------CCCCEEEEcHHH Confidence 8888888888888643 2244554322211111 11 113677888887775431 135689999999 Q ss_pred HHhccc-CCCCCccHHHHHHHhCCc---cEEEEcccccCCCCceEEEEEEeeCCCceEEEEeCchhhc----cc------ Q lcl|Aclame:pro 238 MSDLSK-TNQYGLSAAAKLKEIFPK---LEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMRA----HS------ 303 (336) Q Consensus 238 ~~~Ls~-~~~~~~Tvl~~l~~n~pn---l~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p~~~~~----l~------ 303 (336) +..|.+ .+..|.-++. ....+. +-++..+- ..++...+++-+. .. ..+.+.+.++. ++ T Consensus 203 ~~~L~~l~d~~G~~~~~--~~~~~~l~G~PVv~~~~---~~~~~~~i~~gdf--s~-~~~~~~~~~~i~~~~~~~~~~~~ 274 (324) T protein:vir:93 203 RSLLRKIVDPETKERIY--DRNSDSLDGLPVVNLKS---SNLKRGELITGDF--DK-LIYGIPQLIEYKIDETAQLSTVK 274 (324) T ss_pred HHHHHHhhCCCCCeeec--CCCCCcccceeeEeecC---CCCCcceEEEEec--ce-EEEEEecCcEEEEeecccccccc Confidence 999864 3333332210 011111 22222111 1122111221110 00 11111111100 00 Q ss_pred --c-e----ecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 304 --I-E----RYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 304 --~-~----~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) - . ...-...+-+..|. |+.+.+|.||+++.+. T Consensus 275 ~~~~~~~~~f~~n~~~~r~~~r~-d~~v~~~~a~~~l~~a 313 (324) T protein:vir:93 275 NEDGTPVNLFEQDMVALRATMHV-ALHIADDKAFAKLVPA 313 (324) T ss_pred cccccchhhhhcCcEEEEEEEEe-ccEEecccceEEEecc Confidence 0 0 01123445556666 5557779999999998 No 45 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=97.78 E-value=9e-06 Score=48.26 Aligned_cols=289 Identities=10% Similarity=0.014 Sum_probs=151.0 Q ss_pred CchHHHHH-HHhhcceeccchhhhhhhhhhhhhhhhhhhcCcccc-CCcchHHHHHHHhhCceeeeeeccccchhhhccc Q lcl|Aclame:pro 1 MRDAQRIQ-NLARAGVILPRSVKNVSTPLAEYAMDAADLSPHLSS-TGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGE 78 (336) Q Consensus 1 m~~~~~~~-~l~~~g~~~~~~~~~~~~~~~~~~~da~d~~~~l~t-~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v 78 (336) |..-+.++ ++.++-- .. .+.-..++ ...+++ ...+.+|..+.+ ++++.+...-....++.+ T Consensus 1 ~~k~~~~~~~~~~~~~----~~------~~~~~~~a---~~~~~~~~~~~lip~~~~~----~ii~~~~~~s~l~~~~~~ 63 (324) T protein:vir:99 1 MEQTQKLKLNLQHFAS----NN------VKPQVFNP---DNVMMHEKKDGTLLNDFTT----PILQEVMENSKIMRLGKY 63 (324) T ss_pred CCCchHhhHHHHHHHH----Hh------hhhhhccc---cceeccCCCcceechhHHH----HHHHHHHhhchhhhhcce Confidence 66554444 3333111 00 00000111 111111 122345554433 334444444445555554 Q ss_pred ccCCCcceeeEEEeeeecccceEEeecccCCceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHH Q lcl|Aclame:pro 79 SKKGDWTTLVAAFITAEPTTTVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALG 158 (336) Q Consensus 79 ~t~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a 158 (336) .+.+. .+..|++.+..+.+...+....+|..+.......-..+.++..+.+|.+-++.+. .++.+.-.....++ T Consensus 64 ~~~~~---~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~---~~l~~~i~~~l~~a 137 (324) T protein:vir:99 64 EPMEG---TEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY---SQFFEEMKPMIAEA 137 (324) T ss_pred eeccC---CceEEEEEecCcceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHHhcch---HHHHHHHHHHHHHH Confidence 43322 3467888888888999999999999999999999999999999999986666543 56888888888888 Q ss_pred HHHhhccEEEeeccccc-eEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHH Q lcl|Aclame:pro 159 LAKFLNGSYLFGVAGLE-NYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTA 237 (336) Q Consensus 159 ~e~~~n~i~~~Gd~~~g-~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~ 237 (336) +.+.+++-.++|+.... ..|+++..... + ..... ..-++||.+++..+... + ..+..++|.++. T Consensus 138 i~~~~d~~~l~G~g~~~~~~~~~~~~~~~------~-~~~~~--~~~~~~i~~~~~~l~~~--~----~~~~~~v~n~~~ 202 (324) T protein:vir:99 138 FYKKFDEAGILNQGNNPFGKSIAQSIEKT------N-KVIKG--DFTQDNIIDLEALLEDD--E----LEANAFISKTQN 202 (324) T ss_pred HHHHHHHHhhhcCCCCccCcccccccccc------c-eeccc--cCCHHHHHHHHHhhhhc--c----CCCCEEEEcHHH Confidence 88888888888886542 23444422211 1 11111 11367777787776432 1 245689999999 Q ss_pred HHhcccC-CCCCccHHHHHHHhCCc---cEEEEcccccCCCCceEEEEEEeeCCCceEEEEeCchhhc--------ccc- Q lcl|Aclame:pro 238 MSDLSKT-NQYGLSAAAKLKEIFPK---LEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMRA--------HSI- 304 (336) Q Consensus 238 ~~~Ls~~-~~~~~Tvl~~l~~n~pn---l~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p~~~~~--------l~~- 304 (336) +..|.+- +..|..++ .-..... +.++..+-. ..+...+++-+. ... .+.+.+.++. ... T Consensus 203 ~~~L~~l~d~~g~~~~--~~~~~~~l~G~PVv~~~~~---~~~~~~~i~gd~--~~~-~~~~~~~~~i~~~~~~~~~~~~ 274 (324) T protein:vir:99 203 RSLLRKIVDPETKERI--YDRNSDTLDGLPVVNLKSS---NLKRGELITGDF--DKL-IYGIPQLIEYKIDETAQLSTVK 274 (324) T ss_pred HHHHHHhhcCCCceee--cCCCCccccceeEEeecCC---CCCcceEEEEec--ccE-EEEEecCcEEEEeecccccccc Confidence 9988642 33332221 1111111 222222211 122212222111 011 1111111100 000 Q ss_pred e--------ecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 305 E--------RYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 305 ~--------~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) . ...-...+-+..|.++. +.+|.||+.+.|. T Consensus 275 ~~~~~~~~~f~~~~~~~r~~~r~d~~-v~~~~a~~~lt~a 313 (324) T protein:vir:99 275 NEDGTPVNLFEQDMVALRATMHVALH-IADDKAFAKLVPA 313 (324) T ss_pred cccccchhhhhcCcEEEEEEEEEccE-EecccceEEEEec Confidence 0 11123555666677555 4569999999999 No 46 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=97.73 E-value=9.3e-06 Score=48.18 Aligned_cols=294 Identities=10% Similarity=0.025 Sum_probs=150.0 Q ss_pred CchHHHHH-HHhhcceeccchhhhhhhhhhhhhhhhhhhcCcc-ccCCcchHHHHHHHhhCceeeeeeccccchhhhccc Q lcl|Aclame:pro 1 MRDAQRIQ-NLARAGVILPRSVKNVSTPLAEYAMDAADLSPHL-SSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGE 78 (336) Q Consensus 1 m~~~~~~~-~l~~~g~~~~~~~~~~~~~~~~~~~da~d~~~~l-~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v 78 (336) |+.-+.++ ++.++- .. .......++ ...+ .+...+.+|..+.+ +|++..-.......++++ T Consensus 1 ~~~~~~~~~~~~~f~----~~------~~~~~~~~a---~~~~~~~~~~~lip~~~~~----~ii~~~~~~s~l~~l~~~ 63 (324) T protein:vir:96 1 MEQTQKLKLNLQHFA----SN------NVKPQVFNP---DNVMMHEKKDGTLLNDFTT----PILQEVMENSKIMQLGKY 63 (324) T ss_pred CCcchhhhHHHHHHH----Hh------hhhhhhccc---ccccccCCCcceechhHHH----HHHHHHHhhchhhhhcce Confidence 66555444 233210 00 000000111 1111 12233345665554 334444444445555555 Q ss_pred ccCCCcceeeEEEeeeecccceEEeecccCCceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHH Q lcl|Aclame:pro 79 SKKGDWTTLVAAFITAEPTTTVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALG 158 (336) Q Consensus 79 ~t~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a 158 (336) .+.+. .++.|++.+..+.+..++....+|..+..........+.++..+.+|.+=++.+ ..++.+.-......+ T Consensus 64 ~~~~~---~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds---~~~l~~~i~~~l~~a 137 (324) T protein:vir:96 64 EPMEG---TEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYT---YSQFFEEMKPMIAEA 137 (324) T ss_pred eeccC---CceEEEEEecCcceeeecCCccccccccceeEEEEEeEEEEEeehhhHHHHhcc---hHHHHHHHHHHHHHH Confidence 44332 346788888888899999999999999998888999999998888887655543 367888888888899 Q ss_pred HHHhhccEEEeeccccce-EEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHH Q lcl|Aclame:pro 159 LAKFLNGSYLFGVAGLEN-YGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTA 237 (336) Q Consensus 159 ~e~~~n~i~~~Gd~~~g~-~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~ 237 (336) +.+.+++.+++|+..... .|+++. +. .+..+...+ .-++||.+++..+... + ..+..++|.++. T Consensus 138 ia~~~d~~~l~G~g~~~~~~~~~~~--~~-----~~~~~~~~~--~~~~~i~~~~~~i~~~--~----~~~~~~i~n~~~ 202 (324) T protein:vir:96 138 FYKKFDEAGILNQGNNPFGKSIAQS--IK-----KTNKVIKGD--FTQDNIIDLEALLEDD--E----LEANAFISKTQN 202 (324) T ss_pred HHHHHHHHhhhcCCCCCcCcccccc--cc-----ccceecccc--cchHHHHHHHHhhhhc--c----CCCCEEEEcHHH Confidence 999999999999864322 233331 11 111111111 1256677777766432 1 246689999999 Q ss_pred HHhccc-CCCCCccHHHHHH-HhCCccEEEEcccccCCC-----CceEEEEEEeeCCCceEEEEeCchhhccc-ce---- Q lcl|Aclame:pro 238 MSDLSK-TNQYGLSAAAKLK-EIFPKLEFVTIPEYDTAS-----GRLVQLWAPRVEGKDTATCGFTEKMRAHS-IE---- 305 (336) Q Consensus 238 ~~~Ls~-~~~~~~Tvl~~l~-~n~pnl~i~~~pel~~a~-----G~~~~~~~~~~~~~~~~~~~~p~~~~~l~-~~---- 305 (336) +..|.+ .+..|..++.--. .++-++.++..+...... |....+++-.+.+ .++.+-..-.... .. T Consensus 203 ~~~L~~lkd~~G~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~gd~s~~~~~~~~~---~~i~~~~~~~~~~~~~~~~~ 279 (324) T protein:vir:96 203 RSLLRKIVDPETKERIYDRNSDSLDGLPVVNLKSSNLKRGELITGDFDKLIYGIPQL---IEYKIDETAQLSTVKNEDGT 279 (324) T ss_pred HHHHHHhhCCCCCeeecCCCCCcccceeeEeecCCCCCcceEEEEecceEEEEEecC---cEEEEeeccccccccccccc Confidence 998864 3333432221000 001112222222111111 1111111111111 1111111000000 00 Q ss_pred ----ecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 306 ----RYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 306 ----~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) ...-....-+..|+ |+.+++|.||+++.+- T Consensus 280 ~~~~~~~n~v~~r~~~r~-d~~v~~~~a~~~l~~a 313 (324) T protein:vir:96 280 PVNLFEQDMVALRATMHV-ALHIADDKAFAKLVPA 313 (324) T ss_pred chhhhhcCcEEEEEEEEe-ccEEecccceEEEecc Confidence 00112344556666 4556779999999988 No 47 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=97.73 E-value=1e-05 Score=48.01 Aligned_cols=303 Identities=12% Similarity=0.057 Sum_probs=148.5 Q ss_pred CchHHHHH----HHh----hcceeccchh-h-----------------hhhh-----hhhhhhhhhhhhcCccccCCcch Q lcl|Aclame:pro 1 MRDAQRIQ----NLA----RAGVILPRSV-K-----------------NVST-----PLAEYAMDAADLSPHLSSTGSSG 49 (336) Q Consensus 1 m~~~~~~~----~l~----~~g~~~~~~~-~-----------------~~~~-----~~~~~~~da~d~~~~l~t~~~~~ 49 (336) +.+..++. +++ +.+-.-.... . ++.. ...............-.+..... T Consensus 67 ~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~l 146 (418) T protein:vir:10 67 LIKQGELQARLLEAEQKLARGGGSAELETPKTLGQLVTESEEMKGMDGSARKSVRVRVDRKSIMNVPATVGSGVSGSNSL 146 (418) T ss_pred HHHHHHHHHHHHHHHHHHhhcccccccchhhhhhHHhhhHHHHHHHHHHHhhhhhhhhHHHHHHHhhhhccCCCCCCccc Confidence 11111111 000 1000000000 0 0000 00000000001111101112223 Q ss_pred HHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeec-ccceEEeecccCCceeeeeeeeeeeeEEEEEE Q lcl|Aclame:pro 50 IPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEP-TTTVATYGDYSSDGDSGTNINYPQRQSYFFQT 128 (336) Q Consensus 50 i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~-~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~ 128 (336) +|..+. ++|++.+.......+++++...+. .++.+..... .+.+...+.+..+|..+..........+.++. T Consensus 147 vp~~~~----~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~ 219 (418) T protein:vir:10 147 VVADRQ----AGIIAPPQRKMTIRDLLMPGQTSS---SSIEYTVETGFTNNAAAVAEGAQKPTSDLKFNLKNQPVRTIAH 219 (418) T ss_pred cchhHH----HHHHHHHhhhhhHHhhcceeeccC---CceeEEEEecCCCceeeeccCccccccccceeeEEEeeeeEEE Confidence 455444 355666666667777776654432 2345555444 34566778888899999888888999999999 Q ss_pred EEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeecccc-ceEEEEecCCCCcccccccccccccCHHHHHHH Q lcl|Aclame:pro 129 WTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGL-ENYGLINDPSLSAPITATTPWSGSPAVEAVVNE 207 (336) Q Consensus 129 ~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~-g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~D 207 (336) .+.+|.+=+..+ .++.+--......++.+.+++-.++|++.. ...|++|...........+ ...-++| T Consensus 220 ~~~is~ell~ds----~~l~~~i~~~l~~a~~~~~d~a~l~G~g~~~~p~Gi~~~~~~~~~~~~~~-------~~~~~~~ 288 (418) T protein:vir:10 220 LFKASRQILDDA----PALQSYIDGRARYGLQLTEEGQILKGDGTGANILGILPQASAFMPSITLA-------NATPIDK 288 (418) T ss_pred eehhhHHHHHhH----HHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccccccccc-------ccccHHH Confidence 999987644332 267888888888889999999999998643 4789999765443222111 1123566 Q ss_pred HHHHHHHHHHHhCCceeccCCcEEEecHHHHHhccc-CCCCCccHHHHHHHh----CCccEEEEcccccCC---CCce-- Q lcl|Aclame:pro 208 VVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAKLKEI----FPKLEFVTIPEYDTA---SGRL-- 277 (336) Q Consensus 208 i~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~l~~n----~pnl~i~~~pel~~a---~G~~-- 277 (336) |..++..+... + -.+..++|.+..+..|.+ .+..|.-++.=.... +-++.++..+.+... -|.- T Consensus 289 i~~~~~~~~~~-~-----~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~l~G~pV~~~~~~p~~~~~~gd~s~ 362 (418) T protein:vir:10 289 IRLALLQAVLA-E-----FPATGIVLNPIDWASIELTKDSQGRYIVGNPVNGTTPRLWNLPVVETQAMTANEFLVGAFSM 362 (418) T ss_pred HHHHHHhhccc-c-----CCCCEEEEcHHHHHHHHHhhcCCCceeccccccCCCceecceeeEEcCCCCCCcEEEeeccc Confidence 77666655321 1 135579999999988854 343344333211110 111222222222110 1211 Q ss_pred EEEEEEeeCCCceEEEEeCchhhcccc-eecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 278 VQLWAPRVEGKDTATCGFTEKMRAHSI-ERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 278 ~~~~~~~~~~~~~~~~~~p~~~~~l~~-~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) .+.+++. .+ ..+.+ -.+.. ....-....-+..|.+| .++.|.||++++.. T Consensus 363 ~~~~~~~-~~---~~i~~----~~~~~~~f~~~~~~~r~~~~~d~-~~~~~~a~~~~~~~ 413 (418) T protein:vir:10 363 AAQIFDR-ME---IEVLL----STENVDDFEKNMVSIRAEERLAL-AVYRPESFVTGALV 413 (418) T ss_pred eEEEEEe-cc---eEEEE----ecccchhhhcCceEEEEEEeecc-EEecccceEEEEec Confidence 1112111 11 11111 00000 00112234445566665 58899999999988 No 48 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=97.63 E-value=2e-05 Score=46.33 Aligned_cols=314 Identities=9% Similarity=-0.018 Sum_probs=138.4 Q ss_pred CchHHHHH-HHhhcceec-------cchhhh-------------------hhhhhhhhhhhhhhhcCccccCCcchHHHH Q lcl|Aclame:pro 1 MRDAQRIQ-NLARAGVIL-------PRSVKN-------------------VSTPLAEYAMDAADLSPHLSSTGSSGIPNY 53 (336) Q Consensus 1 m~~~~~~~-~l~~~g~~~-------~~~~~~-------------------~~~~~~~~~~da~d~~~~l~t~~~~~i~~~ 53 (336) ++...... .+++..... ...... ..........-+...+ ...+.....+|.. T Consensus 99 ~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~a~~~~-~~~~~g~~~ip~~ 177 (458) T protein:vir:10 99 QDEIKSLLTAREGRSFVGDSVAKALYGTQENFEDEVEKLVLLSYVMEKGVFETEHGQRHLKAVNQS-SSVEVSSESYETI 177 (458) T ss_pred HHHHHHHHHHHHhhhhhhhhhhccchhhhhhHHHHHHHHHHHHHHHhhccchhhhhhhhhhhhhhc-ccCccccceehhh Confidence 11100000 000000000 000000 0000011111111111 1111223345544 Q ss_pred HHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecccceEEeecccCCce------eeeeeeeeeeeEEEEE Q lcl|Aclame:pro 54 LTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVATYGDYSSDGD------SGTNINYPQRQSYFFQ 127 (336) Q Consensus 54 l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP~------vd~~~~~~~~~v~~~~ 127 (336) +. +.|++.+.+......++.+...+. ....|.+....+.+...+.+...|- .+..........+.++ T Consensus 178 ~~----~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~a~~v~e~~~~~~~~~~~~~~~~~~~i~~~~~k~~ 250 (458) T protein:vir:10 178 FS----QRIIRDLQKELVVGALFEELPMSS---KILTMLVEPDAGKATWVAASTYGTDTTTGEEVKGALKEIHFSTYKLA 250 (458) T ss_pred Hh----HHHHHHHHhhhhHHhhcceeecCC---cceEEEEecCCcceeecccccccccccccccccccceeeEeeeeeEE Confidence 44 344555544445555555433221 2245555455566666666544442 3334555566677788 Q ss_pred EEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCCCCcccccccccccccCHHHHHHH Q lcl|Aclame:pro 128 TWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNE 207 (336) Q Consensus 128 ~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~D 207 (336) ..+.+|.+=+.-+ ..++.+.-......++.+.++.-+++|++.....|++|++......++...-.+..+ .--++| T Consensus 251 ~~v~is~ell~ds---~~~~~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~-~~~~~~ 326 (458) T protein:vir:10 251 AKSFITDETEEDA---IFSLLPLLRKRLIEAHAVSIEEAFMTGDGSGKPKGLLTLASEDSAKVVTEAKADGSV-LVTAKT 326 (458) T ss_pred eeehhhHHHHhcc---hHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCccceeeecccccccceeecccccccc-cccHHH Confidence 8888886644333 356888888888889999999999999988888999999876543222211111111 112566 Q ss_pred HHHHHHHHHHHhCCceeccCCcEEEecHHHHHhccc-CCCCCccHHHH-HHHh--------CCccEEEEcccccCCCCce Q lcl|Aclame:pro 208 VVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAK-LKEI--------FPKLEFVTIPEYDTASGRL 277 (336) Q Consensus 208 i~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~-l~~n--------~pnl~i~~~pel~~a~G~~ 277 (336) |.+++..+...- -.+..++|.+..+..|.. .+..|.-++.. +... +-.+.|+....+...++.. T Consensus 327 i~~~~~~l~~~~------~~~~~~v~~~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~ 400 (458) T protein:vir:10 327 ISKLRRKLGRHG------LKLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDSVKLQGQVGRIYGLPVVVSEYFPAKANSA 400 (458) T ss_pred HHHHHHhhhhhh------cCCCEEEEcHHHHHHHHhhcccCCceeeccccccccccCcCceecceeeEEccccccccCCc Confidence 777776663321 134579999999988854 23323222211 1100 0112232222221111111 Q ss_pred EEEEEEeeCCCceEEEEeCchhhccc-ceecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 278 VQLWAPRVEGKDTATCGFTEKMRAHS-IERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 278 ~~~~~~~~~~~~~~~~~~p~~~~~l~-~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) ...+....+ -..+.--..++..- .....-...+=...|. |..+++|.+|+..+== T Consensus 401 ~~~~~~f~~---~~~~~~~~~~~v~~d~~~~~~~~~~~~~~r~-~~~v~~~~a~v~~~~a 456 (458) T protein:vir:10 401 EFAVIVYKD---NFVMPRQRAVTVERERQAGKQRDAYYVTQRV-NLQRYFANGVVSGTYA 456 (458) T ss_pred ceEEEEecc---cEEEEEeeceEEEeecccCCCceEEEEEEEe-cceEecccceEEEeec Confidence 111211100 00000000011000 0001112334445564 6788899988772111 No 49 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=97.52 E-value=8.4e-06 Score=48.41 Aligned_cols=275 Identities=12% Similarity=-0.007 Sum_probs=143.3 Q ss_pred hhhhhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecccceEEeecccCCc Q lcl|Aclame:pro 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVATYGDYSSDG 110 (336) Q Consensus 31 ~~~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP 110 (336) |+..+...++ ..+|..+.+ +|++.+...-..+.+..+...+ .....+++....+.|.++|....+| T Consensus 1 Ma~~~~~~gg-------~~vP~~~~~----~ii~~l~~~s~i~~l~~~i~~~---~~~~~ip~~~~~~~a~wv~Eg~~~~ 66 (315) T protein:vir:80 1 MADDFLSAGK-------LELPGSMIG----AVRDRAIDSGVLAKLSPEQPTI---FGPVKGAVFSGVPRAKIVGEGEVKP 66 (315) T ss_pred CCCCcCCcCc-------eEcchHHHH----HHHHHHHhhchhhhhcceeecC---CCceEEEEEeCCcceEEeeCCcccc Confidence 5544432222 235666653 3344444443344444333222 2346788888888999999999999 Q ss_pred eeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhC-CCHHHHHHHHHHHHHHHhhccEEEeecccc---ceEEEEecCCCC Q lcl|Aclame:pro 111 DSGTNINYPQRQSYFFQTWTRWGERELEMAGAGR-VDLASELNYSSALGLAKFLNGSYLFGVAGL---ENYGLINDPSLS 186 (336) Q Consensus 111 ~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g-~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~---g~~GllN~Pnl~ 186 (336) ..+...+...-..+.++....+|.+=++.....- -.|.+.-....++++.+.++.-.++|+... +..|+.+. + T Consensus 67 ~s~~~f~~v~l~~~kl~~~~~iS~ell~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~~~~~~~~~~~--~- 143 (315) T protein:vir:80 67 SASVDVSAFTAQPIKVVTQQRVSDEFMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKAASAVHTS--L- 143 (315) T ss_pred ccccceeeeEeeeeeEEeeehhhHHHhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCCCccccccccc--c- Confidence 9998888888888888888887765443322222 226677777888899999999999997532 23333331 1 Q ss_pred cccccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhcccC-CC-----CCccHHHHHHHhCC Q lcl|Aclame:pro 187 APITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSKT-NQ-----YGLSAAAKLKEIFP 260 (336) Q Consensus 187 ~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~~-~~-----~~~Tvl~~l~~n~p 260 (336) ...+. ........++||.+++..+..... ..+...+|-+..+..|.+- +. .+..++.=+...-| T Consensus 144 ---~~~~~--~~~~~~~~~~d~~~~~~~~~~~~~-----~~~~~~imn~~~~~~L~~l~~~~g~~~~g~~~~~~~~~g~~ 213 (315) T protein:vir:80 144 ---NKTKN--IVDATDSATADLVKAVGLIAGAGL-----QVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAGFAGL 213 (315) T ss_pred ---ccccc--eeeccccchHHHHHHHHHHhhccC-----ccceEEEEcHHHHHHHHHHhhccCCcccccccccccccCCC Confidence 11111 111224467888888877643321 2345699999888887532 11 11111110111101 Q ss_pred ----ccEEE---EcccccCC-CCceEEEEEEeeCCCceEEEEeCchh--hcccc-----e----ecCCceEEeeecceee Q lcl|Aclame:pro 261 ----KLEFV---TIPEYDTA-SGRLVQLWAPRVEGKDTATCGFTEKM--RAHSI-----E----RYSSYFRQKKSAGTWG 321 (336) Q Consensus 261 ----nl~i~---~~pel~~a-~G~~~~~~~~~~~~~~~~~~~~p~~~--~~l~~-----~----~~~~~~~v~~~~rt~G 321 (336) ++.++ .+|..... .++...+++-+. .+ ..+.+.+.+ ..++- . .+.-...+-|+.|+ | T Consensus 214 ~tl~G~PV~~~~~~~~~~~~~~~~~~~~~~GDf--s~-~~~g~~~~~~i~i~~~~~~~~~~~~~~~~~~v~~r~~~r~-~ 289 (315) T protein:vir:80 214 DNWRGLNVGASSTVSGAPEMSPASGVKAIVGDF--SR-VHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVL-Y 289 (315) T ss_pred ceecceeeEecCcCCcccccccccccEEEEeec--cc-EEEEEecCeeEEEeccccccCcccchhhcCcEEEEEEEEe-c Confidence 11122 23322222 122223332111 11 111111111 11100 0 11112455666665 5 Q ss_pred eEEecccceeeeccC Q lcl|Aclame:pro 322 AVIFRPFAVAQMIGV 336 (336) Q Consensus 322 v~ir~P~ai~~~~GI 336 (336) ..|++|.||+++.+. T Consensus 290 ~~v~~~~a~~~l~~~ 304 (315) T protein:vir:80 290 VAIESLDSFAVVKEK 304 (315) T ss_pred ceeecccceEEEeec Confidence 567899999999998 No 50 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=97.51 E-value=8.1e-06 Score=48.50 Aligned_cols=310 Identities=15% Similarity=0.104 Sum_probs=155.2 Q ss_pred Cch----HHHHHHHhhcceec-----cchhhhhhhhhhh------hhhhhhhhcCccccCCcch--HHHHHHHhhCceee Q lcl|Aclame:pro 1 MRD----AQRIQNLARAGVIL-----PRSVKNVSTPLAE------YAMDAADLSPHLSSTGSSG--IPNYLTTYVDPSVI 63 (336) Q Consensus 1 m~~----~~~~~~l~~~g~~~-----~~~~~~~~~~~~~------~~~da~d~~~~l~t~~~~~--i~~~l~~~idp~v~ 63 (336) +++ ........+.+... .........+... .+....-.. ...+++.+| +|..+. ++|+ T Consensus 98 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~gg~~vp~~~~----~~ii 172 (497) T protein:vir:10 98 MNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQN-PFGSTGTFAPGILPTFL----PGIV 172 (497) T ss_pred hhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhh-hcccCcccccccchhhh----HHHH Confidence 000 00000000000000 0000000000000 000000000 112222222 343333 4667 Q ss_pred eeeccccchhhhcccccCCCcceeeEEEeeeec-ccceEEeecccCCceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHH Q lcl|Aclame:pro 64 DILVAPMKAAELVGESKKGDWTTLVAAFITAEP-TTTVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGA 142 (336) Q Consensus 64 ~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~-~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~ 142 (336) +.+.+......++++.+.+. ..+.|+.... .+.+.+++.+..+|..+..........+.++..+.+|.+=|+-+ T Consensus 173 ~~~~~~~~i~~l~~~~~~~~---~~~~~~~~~~~~~~a~wv~E~~~~~~s~~~f~~i~~~~~k~a~~~~iS~ell~d~-- 247 (497) T protein:vir:10 173 EQLFYELSLADLISSRPVTS---PNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA-- 247 (497) T ss_pred HHHHhhhhHHhhccccccCC---CceEEEEEcCCCCcceeeccCcccccccccceeeEeeeeeeEeecHhHHHHHHhH-- Confidence 77777777788877654433 2356665433 45788889999999999988999999999999888886533332 Q ss_pred hCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCCCCccccccccc--------------------cc----- Q lcl|Aclame:pro 143 GRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPW--------------------SG----- 197 (336) Q Consensus 143 ~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w--------------------~~----- 197 (336) . .|.+--....++++.+.++.-.++|++..+..|++|++.........+.+ +. T Consensus 248 ~--~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 325 (497) T protein:vir:10 248 P--ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTV 325 (497) T ss_pred H--HHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchhhhhhhhhhhhhhcccccchhhhhhHH Confidence 2 47888888889999999999999999988899999988654322111100 00 Q ss_pred ----------------------ccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhccc-CCCCCccHHHH Q lcl|Aclame:pro 198 ----------------------SPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAK 254 (336) Q Consensus 198 ----------------------~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~ 254 (336) ..+...++.++..++..+..... ..|+.++|.+..+..|.+ .+..|.-++.- T Consensus 326 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~ 400 (497) T protein:vir:10 326 ASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLF-----QTPNAVVMNPRDWELLRLTKDANGQYMGGN 400 (497) T ss_pred HHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcc-----cCCCeEEEchHHHHHHHHhhcCCCceeccC Confidence 00223344555555555544321 246678888888877753 23334322210 Q ss_pred ---------HHHh--CCccEEEEcccccCCC----Cc---eEEEEEEeeCCCceEEEEeCchhhcccceecCCceEEeee Q lcl|Aclame:pro 255 ---------LKEI--FPKLEFVTIPEYDTAS----GR---LVQLWAPRVEGKDTATCGFTEKMRAHSIERYSSYFRQKKS 316 (336) Q Consensus 255 ---------l~~n--~pnl~i~~~pel~~a~----G~---~~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~v~~~ 316 (336) .... .-+..++..+... ++ |. ..+.++++ .+ .++.+... .......-.+.+-++ T Consensus 401 ~~~~~~~~~~~~~~~l~G~pV~~t~~~~-~~~~~~Gd~~~~~~~i~~r-~~---~~v~~~~~---~~~~f~~n~v~~r~~ 472 (497) T protein:vir:10 401 FFGNAYGNPVNGGKNIWGVPVVTTPLIP-LGTILVGHFAPSVIQTARR-EG---VTMQMTNS---NGTDFVDGKVTVRAE 472 (497) T ss_pred cccccccccccCCceeeceeeEecCCCC-CCceEEeecccceEEEEEe-cc---cEEEeecc---cchhhhcCcEEEEEE Confidence 0000 0011222222221 11 11 11122222 11 11111110 011111234566777 Q ss_pred cceeeeEEecccceeeeccC Q lcl|Aclame:pro 317 AGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 317 ~rt~Gv~ir~P~ai~~~~GI 336 (336) .|++| .|++|.||++++-. T Consensus 473 ~r~~~-~v~~p~A~~~l~~~ 491 (497) T protein:vir:10 473 ERLGL-LVYRPSAFQLIQLK 491 (497) T ss_pred Eeecc-eeeccccEEEEEec Confidence 78877 67899999999888 No 51 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=97.51 E-value=8.1e-06 Score=48.50 Aligned_cols=310 Identities=15% Similarity=0.104 Sum_probs=155.2 Q ss_pred Cch----HHHHHHHhhcceec-----cchhhhhhhhhhh------hhhhhhhhcCccccCCcch--HHHHHHHhhCceee Q lcl|Aclame:pro 1 MRD----AQRIQNLARAGVIL-----PRSVKNVSTPLAE------YAMDAADLSPHLSSTGSSG--IPNYLTTYVDPSVI 63 (336) Q Consensus 1 m~~----~~~~~~l~~~g~~~-----~~~~~~~~~~~~~------~~~da~d~~~~l~t~~~~~--i~~~l~~~idp~v~ 63 (336) +++ ........+.+... .........+... .+....-.. ...+++.+| +|..+. ++|+ T Consensus 98 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~gg~~vp~~~~----~~ii 172 (497) T protein:vir:78 98 MNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAAELMGAFADGETAPAAIGQN-PFGSTGTFAPGILPTFL----PGIV 172 (497) T ss_pred hhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHhhhhhhHHHHHhh-hcccCcccccccchhhh----HHHH Confidence 000 00000000000000 0000000000000 000000000 112222222 343333 4667 Q ss_pred eeeccccchhhhcccccCCCcceeeEEEeeeec-ccceEEeecccCCceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHH Q lcl|Aclame:pro 64 DILVAPMKAAELVGESKKGDWTTLVAAFITAEP-TTTVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGA 142 (336) Q Consensus 64 ~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~-~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~ 142 (336) +.+.+......++++.+.+. ..+.|+.... .+.+.+++.+..+|..+..........+.++..+.+|.+=|+-+ T Consensus 173 ~~~~~~~~i~~l~~~~~~~~---~~~~~~~~~~~~~~a~wv~E~~~~~~s~~~f~~i~~~~~k~a~~~~iS~ell~d~-- 247 (497) T protein:vir:78 173 EQLFYELSLADLISSRPVTS---PNLSYLTESAAHNNAAAVAEAGTYPFSSEEFARVYEQVGKVANALTITDEGLRDA-- 247 (497) T ss_pred HHHHhhhhHHhhccccccCC---CceEEEEEcCCCCcceeeccCcccccccccceeeEeeeeeeEeecHhHHHHHHhH-- Confidence 77777777788877654433 2356665433 45788889999999999988999999999999888886533332 Q ss_pred hCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCCCCccccccccc--------------------cc----- Q lcl|Aclame:pro 143 GRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPW--------------------SG----- 197 (336) Q Consensus 143 ~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w--------------------~~----- 197 (336) . .|.+--....++++.+.++.-.++|++..+..|++|++.........+.+ +. T Consensus 248 ~--~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 325 (497) T protein:vir:78 248 P--ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFGATSATVSNVKFPADGTNGAFVGQDTV 325 (497) T ss_pred H--HHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchhhhhhhhhhhhhhcccccchhhhhhHH Confidence 2 47888888889999999999999999988899999988654322111100 00 Q ss_pred ----------------------ccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhccc-CCCCCccHHHH Q lcl|Aclame:pro 198 ----------------------SPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAK 254 (336) Q Consensus 198 ----------------------~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~ 254 (336) ..+...++.++..++..+..... ..|+.++|.+..+..|.+ .+..|.-++.- T Consensus 326 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~ 400 (497) T protein:vir:78 326 ASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLF-----QTPNAVVMNPRDWELLRLTKDANGQYMGGN 400 (497) T ss_pred HHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcc-----cCCCeEEEchHHHHHHHHhhcCCCceeccC Confidence 00223344555555555544321 246678888888877753 23334322210 Q ss_pred ---------HHHh--CCccEEEEcccccCCC----Cc---eEEEEEEeeCCCceEEEEeCchhhcccceecCCceEEeee Q lcl|Aclame:pro 255 ---------LKEI--FPKLEFVTIPEYDTAS----GR---LVQLWAPRVEGKDTATCGFTEKMRAHSIERYSSYFRQKKS 316 (336) Q Consensus 255 ---------l~~n--~pnl~i~~~pel~~a~----G~---~~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~v~~~ 316 (336) .... .-+..++..+... ++ |. ..+.++++ .+ .++.+... .......-.+.+-++ T Consensus 401 ~~~~~~~~~~~~~~~l~G~pV~~t~~~~-~~~~~~Gd~~~~~~~i~~r-~~---~~v~~~~~---~~~~f~~n~v~~r~~ 472 (497) T protein:vir:78 401 FFGNAYGNPVNGGKNIWGVPVVTTPLIP-LGTILVGHFAPSVIQTARR-EG---VTMQMTNS---NGTDFVDGKVTVRAE 472 (497) T ss_pred cccccccccccCCceeeceeeEecCCCC-CCceEEeecccceEEEEEe-cc---cEEEeecc---cchhhhcCcEEEEEE Confidence 0000 0011222222221 11 11 11122222 11 11111110 011111234566777 Q ss_pred cceeeeEEecccceeeeccC Q lcl|Aclame:pro 317 AGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 317 ~rt~Gv~ir~P~ai~~~~GI 336 (336) .|++| .|++|.||++++-. T Consensus 473 ~r~~~-~v~~p~A~~~l~~~ 491 (497) T protein:vir:78 473 ERLGL-LVYRPSAFQLIQLK 491 (497) T ss_pred Eeecc-eeeccccEEEEEec Confidence 78877 67899999999888 No 52 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=97.43 E-value=4.3e-05 Score=44.51 Aligned_cols=302 Identities=12% Similarity=0.054 Sum_probs=147.9 Q ss_pred CchHHHHHHHhhcceeccc------------------------hhhhh-hhhhhhhhhhhhhhcCccccCCcchHHHHHH Q lcl|Aclame:pro 1 MRDAQRIQNLARAGVILPR------------------------SVKNV-STPLAEYAMDAADLSPHLSSTGSSGIPNYLT 55 (336) Q Consensus 1 m~~~~~~~~l~~~g~~~~~------------------------~~~~~-~~~~~~~~~da~d~~~~l~t~~~~~i~~~l~ 55 (336) +.+.+.-..+.+.+..... ..... ..+... ..+.+ ...+.++.....+|..+. T Consensus 58 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~-~~~~~~~~~~~~vp~~~~ 135 (413) T protein:vir:81 58 SVDSEKSGELTRKGEGYKSIGEFFAKRAGDQIKQQAGGAQLNYSVGEYVAPRVKA-ASDPA-STATLTDEFQGGYGTTWN 135 (413) T ss_pred HHhHHHhhhHhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhHHHh-hhhhh-hhcccccccccccchhhH Confidence 1111111111111110000 00000 000010 11111 111223344444665554 Q ss_pred HhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeee----cccceEEeecccCCceeee-eeeeeeeeEEEEEEEE Q lcl|Aclame:pro 56 TYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAE----PTTTVATYGDYSSDGDSGT-NINYPQRQSYFFQTWT 130 (336) Q Consensus 56 ~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e----~~G~a~~ygd~~DiP~vd~-~~~~~~~~v~~~~~~~ 130 (336) +++++.+.......+++++.+... .+..|++.. ..+.+..++.+..+|-.+. ........++.++..+ T Consensus 136 ----~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~ 208 (413) T protein:vir:81 136 ----RNIIYRRREKLVVADLMDNLTMTN---TTIKYLMEKANRVVEGGFKTVAEGGKKPYMRFADFDIVTESLSKIAGLT 208 (413) T ss_pred ----HHHHHHHhhhhhHHhhcceeeccC---CceeEEEeccccccccccceecCcccccccCcccceeeEeeeeeEEEee Confidence 567777777777788877654332 223444322 2245677787888887774 6777888888898889 Q ss_pred EeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccc-cceEEEEecCCCCcccccccccccccCHHHHHHHHH Q lcl|Aclame:pro 131 RWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAG-LENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVV 209 (336) Q Consensus 131 ~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~-~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~ 209 (336) .+|.+=|..+. .|.+--....+.++.+.+++-.++|++. ....|++|.+++..... .+.+.++++|. T Consensus 209 ~iS~ell~ds~----~l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~Gi~~~~~~~~~~~--------~~~~~~~~~i~ 276 (413) T protein:vir:81 209 KITDEMIEDYD----FLVSYINARLLEELAIEEERQLLLGDGTGNNLTGLLKRDGIQTLAV--------SNKDELADSIY 276 (413) T ss_pred hhhHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHhccCCCCCcccccccccccccccc--------cccchhHHHHH Confidence 99976444332 2777777777888888888888999853 34579999877653221 12344677777 Q ss_pred HHHHHHHHHhCCceeccCCcEEEecHHHHHhccc-CCCCCccHHHH-HHHh--CCc----cEEEEcccccCC---CCce- Q lcl|Aclame:pro 210 TLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAK-LKEI--FPK----LEFVTIPEYDTA---SGRL- 277 (336) Q Consensus 210 ~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~-l~~n--~pn----l~i~~~pel~~a---~G~~- 277 (336) .++..+....+ -.+..++|.++.+..|.+ .+..|.-++.- +... .+. -++-..|-.-.. .|.. T Consensus 277 ~~~~~~~~~~~-----~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~l~G~pv~~s~~~~~~~~~ 351 (413) T protein:vir:81 277 KAMTNISLATP-----FQADALVINPLDYQELRLAKDANGQYYGGGVFQGQYGSGGIMLDPAPWGLRTVQSQVVPVGKPV 351 (413) T ss_pred HHHHHhhhhcc-----CCCcEEEEcHHHHHHHHHhhccCCceeccccccccccccccccCceecceeeEEcCCCCcccEE Confidence 77766543322 135579999998888753 34334333211 1110 000 012122221111 1211 Q ss_pred ------EEEEEEeeCCCceEEEEeCchhhcccceecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 278 ------VQLWAPRVEGKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 278 ------~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) .+.++++ .+ ..+.+.. ........-...+-+..|++| .+++|.||+.++.= T Consensus 352 ~gd~~~~~~~~~~-~~---~~v~~~~---~~~~~~~~~~~~~r~~~r~d~-~~~~~~a~~~l~~~ 408 (413) T protein:vir:81 352 VGAFRSAASVLRK-GG---VRIDSTN---TNVDDFENNLITVRAEERVGL-MVTFPEAIVQLDVA 408 (413) T ss_pred EEecccEEEEEEe-cc---eEEEEec---cccchhhcCcEEEEEEEeecc-EEecccceEEEEec Confidence 1111111 01 1111100 000001112345556666664 55789999988766 No 53 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=97.43 E-value=2.3e-05 Score=46.00 Aligned_cols=305 Identities=12% Similarity=0.065 Sum_probs=148.7 Q ss_pred CchHH-HHHHH-------hhcceeccchhhh----hhh---------hhhhhhhhhhhhcCccccCCcc--hHHHHHHHh Q lcl|Aclame:pro 1 MRDAQ-RIQNL-------ARAGVILPRSVKN----VST---------PLAEYAMDAADLSPHLSSTGSS--GIPNYLTTY 57 (336) Q Consensus 1 m~~~~-~~~~l-------~~~g~~~~~~~~~----~~~---------~~~~~~~da~d~~~~l~t~~~~--~i~~~l~~~ 57 (336) +...+ .+.+. ++.+.. +..... ... ..+...+-........++..++ .+|..+. T Consensus 54 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~vp~~~~-- 130 (395) T protein:vir:43 54 QGELQARLSAAEQAMLANEKRDGG-EEAPKTAGQMVAESLKEQGVTSSLRGSHRVSMPRSAITSIDGSGGALVAPDRR-- 130 (395) T ss_pred HHHHHHHHHHHHHHHHhhhccccc-cchhhhHHHHHHHHHHHHHHHHHhhhhhhhhhhhhhhcccCCCCccccchhhH-- Confidence 00000 00000 000000 000000 000 0000000000000001112222 1233222 Q ss_pred hCceeeeeeccccchhhhcccccCCCcceeeEEEeeeec-ccceEEeecccCCceeeeeeeeeeeeEEEEEEEEEeCHHH Q lcl|Aclame:pro 58 VDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEP-TTTVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERE 136 (336) Q Consensus 58 idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~-~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~~~~y~~~E 136 (336) ++|++.+........++++.+.+. .++.|++... .+.+..++.+...|..+..........+.++..+.+|.+= T Consensus 131 --~~ii~~~~~~~~l~~l~~~~~~~~---~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~i~~~~~k~~~~~~is~el 205 (395) T protein:vir:43 131 --PGVVAAPQRRLTIRDLVAPGTTES---NSVEYVRETGFVNNAAPVSEGTQKPYSDLTFELENAPVRTIAHLFKASRQI 205 (395) T ss_pred --HHHHHHHHhhhhHHhhccceecCC---CceEEEEEecCCCceeeecCCccccccccceeEEEEeeeeEEEeehhhHHH Confidence 456666666667777777765443 2456666433 4677888988899999999999999999999999999654 Q ss_pred HHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccc-eEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 137 LEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLE-NYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVL 215 (336) Q Consensus 137 l~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g-~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l 215 (336) ++.+ . .+.+--....++++.+.++.-.++|++..+ ..|+++.+.+....... ..+.+..++||.+++..+ T Consensus 206 l~d~---~-~l~~~v~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~~-----~~~~~~~~~~i~~~~~~~ 276 (395) T protein:vir:43 206 LDDA---S-ALQSYIDARARYGLMLVEECQLLYGNGTGANLHGIIPQAQAYAPPSGV-----VVTAEQRIDRIRLAILQA 276 (395) T ss_pred HHhH---H-HHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccccccccccccccc-----ccccchhHHHHHHHHHhh Confidence 4322 2 577777777788888888888888986433 46999876553321111 123456788888888777 Q ss_pred HHHhCCceeccCCcEEEecHHHHHhccc-CCCCCccHHHHHHHh----CCccEEEEcccccCC---CCceE--EEEEEee Q lcl|Aclame:pro 216 QTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAKLKEI----FPKLEFVTIPEYDTA---SGRLV--QLWAPRV 285 (336) Q Consensus 216 ~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~l~~n----~pnl~i~~~pel~~a---~G~~~--~~~~~~~ 285 (336) ...- -.+..++|.|..+..|.+ .+..|.-++.-.... +-++.++..+.+... -|.-. +.++++ T Consensus 277 ~~~~------~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~l~G~pVv~~~~~~~~~~~~gd~~~~~~~~~~- 349 (395) T protein:vir:43 277 QLAE------FPASGIVLNPIDWALIELNKDAENRYIIGSPQNGTTPTLWRLPVVETQAITQDEFLTGAFSLGAQIFDR- 349 (395) T ss_pred cccc------CCCcEEEEcHHHHHHHHHhhccCCceeccccccCCCceecceeeEEcCCCCCCcEEEEeccceEEEEEe- Confidence 4321 135589999999988853 344343333211111 112233333332211 12211 112211 Q ss_pred CCCceEEEEeCchhhcccceecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 286 EGKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 286 ~~~~~~~~~~p~~~~~l~~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) .+ ..+.+... .+.- ...-.+..-+..| .|+.+++|-||++++-= T Consensus 350 ~~---~~i~~~~~--~~~~-f~~~~~~~r~~~r-~d~~v~~~~a~~~~~~t 393 (395) T protein:vir:43 350 MD---IEVLVSTE--NDKD-FENNMVTIRAEER-LAFAVYRPEAFVTGSLT 393 (395) T ss_pred cc---eEEEEecc--ccch-hhcCcEEEEEEEe-eccEEecccceEEEEec Confidence 11 11111100 0000 0001122233334 45566889999888544 No 54 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=97.28 E-value=2.9e-05 Score=45.42 Aligned_cols=304 Identities=13% Similarity=0.079 Sum_probs=152.0 Q ss_pred CchHHH-HHHHhh---cceeccchhhh----hhhhhhhhhh--------hh-hhhcCccccCCcchHHHHHHHhhCceee Q lcl|Aclame:pro 1 MRDAQR-IQNLAR---AGVILPRSVKN----VSTPLAEYAM--------DA-ADLSPHLSSTGSSGIPNYLTTYVDPSVI 63 (336) Q Consensus 1 m~~~~~-~~~l~~---~g~~~~~~~~~----~~~~~~~~~~--------da-~d~~~~l~t~~~~~i~~~l~~~idp~v~ 63 (336) ++..++ +.++++ .+..-+..... ...++..... .. ..+...-++.+...+|..+ .++++ T Consensus 50 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~----~~~ii 125 (385) T protein:vir:19 50 LTKSGTRLFDLEQKLASGAENPGEKKSFSERAAEELIKSWDGKQGTFGAKTFNKSLGSDADSAGSLIQPMQ----IPGII 125 (385) T ss_pred HHHHHHHHHHHHHHhhccccccchhhhhHHHHHHHHHHHHHHhhccchhhHHHhhhccccccCCceecchh----hhHHH Confidence 111111 111111 11111111000 0011100000 00 0000000111111133222 24566 Q ss_pred eeeccccchhhhcccccCCCcceeeEEEeeeec-ccceEEeecccCCceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHH Q lcl|Aclame:pro 64 DILVAPMKAAELVGESKKGDWTTLVAAFITAEP-TTTVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGA 142 (336) Q Consensus 64 ~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~-~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~ 142 (336) +..........++++...+. ..+.|++.+. .+.+...+.+..+|..+..........+.++..+.+|.+ +..-. T Consensus 126 ~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~k~~~~~~is~e-ll~d~- 200 (385) T protein:vir:19 126 MPGLRRLTIRDLLAQGRTSS---NALEYVREEVFTNNADVVAEKALKPESDITFSKQTANVKTIAHWVQASRQ-VMDDA- 200 (385) T ss_pred HHhhhccchhhhcceecccC---cceEEEEEecCCcceeeeccCccccccccceeEEEEeeeeEEEeehhhHH-HHhhH- Confidence 66666666777777754432 2356777665 356677788889999999999999999999999999964 43322 Q ss_pred hCCCHHHHHHHHHHHHHHHhhccEEEeeccc-cceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCC Q lcl|Aclame:pro 143 GRVDLASELNYSSALGLAKFLNGSYLFGVAG-LENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQG 221 (336) Q Consensus 143 ~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~-~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g 221 (336) .++.+.-....+.++.+.++.-.+.|++. ....|+++.+........ .+.+..++||.+++..+...- T Consensus 201 --~~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~-------~~~~~~~d~i~~~~~~l~~~~-- 269 (385) T protein:vir:19 201 --PMLQSYINNRLMYGLALKEEGQLLNGDGTGDNLEGLNKVATAYDTSLN-------ATGDTRADIIAHAIYQVTESE-- 269 (385) T ss_pred --HHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccc-------ccccchHHHHHHHHHhhcccc-- Confidence 24777777778888888888888999853 445789987654322111 122335677888877774321 Q ss_pred ceeccCCcEEEecHHHHHhccc-CCCCCccHHHHHHHhCC----ccEEEEcccccC---CCCc--eEEEEEEeeCCCceE Q lcl|Aclame:pro 222 IITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAKLKEIFP----KLEFVTIPEYDT---ASGR--LVQLWAPRVEGKDTA 291 (336) Q Consensus 222 ~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~l~~n~p----nl~i~~~pel~~---a~G~--~~~~~~~~~~~~~~~ 291 (336) ..+..++|+++.+..|.. .+..|.-++.-....-+ ++.++..+.+.. .-|+ ..+.+++. .+ . T Consensus 270 ----~~~~~~~~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~l~G~pV~~~~~~p~~~~~~gd~~~~~~~~~~---~~-~ 341 (385) T protein:vir:19 270 ----FSASGIVLNPRDWHNIALLKDNEGRYIFGGPQAFTSNIMWGLPVVPTKAQAAGTFTVGGFDMASQVWDR---MD-A 341 (385) T ss_pred ----CCCCEEEEcHHHHHHHHHhhcCCCceeccCcccCCCceecceeeEEcCcCCCCcEEEeecccEEEEEEe---cc-e Confidence 235689999999988854 34434433221111111 122332222211 0111 11222211 11 1 Q ss_pred EEEeCchhhcccceecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 292 TCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 292 ~~~~p~~~~~l~~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) .+.+... ..-....-.+.+-+..|++|. +++|.||++++.- T Consensus 342 ~v~~~~~---~~~~~~~~~~~~~~~~r~~~~-v~~~~a~~~~~~~ 382 (385) T protein:vir:19 342 TVEVSRE---DRDNFVKNMLTILCEERLALA-HYRPTAIIKGTFS 382 (385) T ss_pred EEEEecc---ccchhhcCcEEEEEEEeeccE-EecccceEEEEec Confidence 1111100 000011123455677777755 5789999999988 No 55 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=97.28 E-value=2.9e-05 Score=45.42 Aligned_cols=304 Identities=13% Similarity=0.079 Sum_probs=152.0 Q ss_pred CchHHH-HHHHhh---cceeccchhhh----hhhhhhhhhh--------hh-hhhcCccccCCcchHHHHHHHhhCceee Q lcl|Aclame:pro 1 MRDAQR-IQNLAR---AGVILPRSVKN----VSTPLAEYAM--------DA-ADLSPHLSSTGSSGIPNYLTTYVDPSVI 63 (336) Q Consensus 1 m~~~~~-~~~l~~---~g~~~~~~~~~----~~~~~~~~~~--------da-~d~~~~l~t~~~~~i~~~l~~~idp~v~ 63 (336) ++..++ +.++++ .+..-+..... ...++..... .. ..+...-++.+...+|..+ .++++ T Consensus 50 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~----~~~ii 125 (385) T protein:vir:18 50 LTKSGTRLFDLEQKLASGAENPGEKKSFSERAAEELIKSWDGKQGTFGAKTFNKSLGSDADSAGSLIQPMQ----IPGII 125 (385) T ss_pred HHHHHHHHHHHHHHhhccccccchhhhhHHHHHHHHHHHHHHhhccchhhHHHhhhccccccCCceecchh----hhHHH Confidence 111111 111111 11111111000 0011100000 00 0000000111111133222 24566 Q ss_pred eeeccccchhhhcccccCCCcceeeEEEeeeec-ccceEEeecccCCceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHH Q lcl|Aclame:pro 64 DILVAPMKAAELVGESKKGDWTTLVAAFITAEP-TTTVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGA 142 (336) Q Consensus 64 ~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~-~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~ 142 (336) +..........++++...+. ..+.|++.+. .+.+...+.+..+|..+..........+.++..+.+|.+ +..-. T Consensus 126 ~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~k~~~~~~is~e-ll~d~- 200 (385) T protein:vir:18 126 MPGLRRLTIRDLLAQGRTSS---NALEYVREEVFTNNADVVAEKALKPESDITFSKQTANVKTIAHWVQASRQ-VMDDA- 200 (385) T ss_pred HHhhhccchhhhcceecccC---cceEEEEEecCCcceeeeccCccccccccceeEEEEeeeeEEEeehhhHH-HHhhH- Confidence 66666666777777754432 2356777665 356677788889999999999999999999999999964 43322 Q ss_pred hCCCHHHHHHHHHHHHHHHhhccEEEeeccc-cceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCC Q lcl|Aclame:pro 143 GRVDLASELNYSSALGLAKFLNGSYLFGVAG-LENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQG 221 (336) Q Consensus 143 ~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~-~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g 221 (336) .++.+.-....+.++.+.++.-.+.|++. ....|+++.+........ .+.+..++||.+++..+...- T Consensus 201 --~~l~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~Gi~~~~~~~~~~~~-------~~~~~~~d~i~~~~~~l~~~~-- 269 (385) T protein:vir:18 201 --PMLQSYINNRLMYGLALKEEGQLLNGDGTGDNLEGLNKVATAYDTSLN-------ATGDTRADIIAHAIYQVTESE-- 269 (385) T ss_pred --HHHHHHHHHHHHHHHHHHHHHHHHhccCCCCccccccccccccccccc-------ccccchHHHHHHHHHhhcccc-- Confidence 24777777778888888888888999853 445789987654322111 122335677888877774321 Q ss_pred ceeccCCcEEEecHHHHHhccc-CCCCCccHHHHHHHhCC----ccEEEEcccccC---CCCc--eEEEEEEeeCCCceE Q lcl|Aclame:pro 222 IITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAKLKEIFP----KLEFVTIPEYDT---ASGR--LVQLWAPRVEGKDTA 291 (336) Q Consensus 222 ~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~l~~n~p----nl~i~~~pel~~---a~G~--~~~~~~~~~~~~~~~ 291 (336) ..+..++|+++.+..|.. .+..|.-++.-....-+ ++.++..+.+.. .-|+ ..+.+++. .+ . T Consensus 270 ----~~~~~~~~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~l~G~pV~~~~~~p~~~~~~gd~~~~~~~~~~---~~-~ 341 (385) T protein:vir:18 270 ----FSASGIVLNPRDWHNIALLKDNEGRYIFGGPQAFTSNIMWGLPVVPTKAQAAGTFTVGGFDMASQVWDR---MD-A 341 (385) T ss_pred ----CCCCEEEEcHHHHHHHHHhhcCCCceeccCcccCCCceecceeeEEcCcCCCCcEEEeecccEEEEEEe---cc-e Confidence 235689999999988854 34434433221111111 122332222211 0111 11222211 11 1 Q ss_pred EEEeCchhhcccceecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 292 TCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 292 ~~~~p~~~~~l~~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) .+.+... ..-....-.+.+-+..|++|. +++|.||++++.- T Consensus 342 ~v~~~~~---~~~~~~~~~~~~~~~~r~~~~-v~~~~a~~~~~~~ 382 (385) T protein:vir:18 342 TVEVSRE---DRDNFVKNMLTILCEERLALA-HYRPTAIIKGTFS 382 (385) T ss_pred EEEEecc---ccchhhcCcEEEEEEEeeccE-EecccceEEEEec Confidence 1111100 000011123455677777755 5789999999988 No 56 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=97.23 E-value=7.3e-05 Score=43.26 Aligned_cols=300 Identities=13% Similarity=0.073 Sum_probs=146.1 Q ss_pred CchH-HHHHHHhhcceeccchhhhhh----------------------hhhhhhhhhhhhhcCccccCCcc-h-HHHHHH Q lcl|Aclame:pro 1 MRDA-QRIQNLARAGVILPRSVKNVS----------------------TPLAEYAMDAADLSPHLSSTGSS-G-IPNYLT 55 (336) Q Consensus 1 m~~~-~~~~~l~~~g~~~~~~~~~~~----------------------~~~~~~~~da~d~~~~l~t~~~~-~-i~~~l~ 55 (336) ++.. +.+.++++.+...+....... ......+.. ..+....+.+.+ . +|.++. T Consensus 54 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~g~~~~~~~~~ 131 (390) T protein:vir:10 54 VQAARQRVAELEGNGAGGDVQHVSVGDLFVASEQFQASAGRWNDRSARATMNIKAAL--NTASTDAAGSAGALTTPNRLP 131 (390) T ss_pred HHHHHHHHHHHHhhcccccccccchhhhhhhhHHHHHHHHhhhhhhhhhhhHHHHHH--HhhhcccccccccccchhHHH Confidence 1100 111111211111110000000 000000100 111111222222 2 333333 Q ss_pred HhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeec-ccceEEeecccCCceeeeeeeeeeeeEEEEEEEEEeCH Q lcl|Aclame:pro 56 TYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEP-TTTVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGE 134 (336) Q Consensus 56 ~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~-~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~~~~y~~ 134 (336) ++++.+........++.+.+.+. .++.|+..+. .+.+...+....+|-.+..........+.++..+.+|. T Consensus 132 -----~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ 203 (390) T protein:vir:10 132 -----GFITQPDARLTVRDLIGSGRTDS---ALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATR 203 (390) T ss_pred -----HHHHHHHhhchhhhhcceeeccC---CceEEEEEecCCcceeeecCCccccccccceeEEEEeeEEEEEeehhhH Confidence 45555555556667776655433 2356666554 46777888888899999998999999999999999987 Q ss_pred HHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccc-cceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHH Q lcl|Aclame:pro 135 RELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAG-LENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQ 213 (336) Q Consensus 135 ~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~-~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~ 213 (336) + +..-. .++.+.-....++++.+.+++-.++|++. ....|++|.+......+. .+....++|+..++. T Consensus 204 e-ll~d~---~~l~~~i~~~l~~~~~~~~~~~il~G~G~~~~p~Gi~~~~~~~~~~~~-------~~~~~~~~~~~~~~~ 272 (390) T protein:vir:10 204 Q-ILSDA---PQLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTT-------IAGATRVDQLRLAML 272 (390) T ss_pred H-HHHhH---HHHHHHHHHHHHHHHHHHHHHHHhhcCCCCcccccccccccccccccc-------ccccchHHHHHHHHH Confidence 5 43322 26788888888889999999989999863 447899997665432211 112224567777777 Q ss_pred HHHHHhCCceeccCCcEEEecHHHHHhccc-CCCCCccHHHHHHHhCC----ccEEEEcccccCC---CCceE--EEEEE Q lcl|Aclame:pro 214 VLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAKLKEIFP----KLEFVTIPEYDTA---SGRLV--QLWAP 283 (336) Q Consensus 214 ~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~l~~n~p----nl~i~~~pel~~a---~G~~~--~~~~~ 283 (336) .+...- ..+..++|.|+.+..|.+ .+..|.-++.--...-+ ++.++..+.+... -|+-. +.+++ T Consensus 273 ~l~~~~------~~~~~~v~n~~~~~~L~~lkd~~g~~l~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~gdf~~~~~~~~ 346 (390) T protein:vir:10 273 QASLAE------YPASGIVINPIDWAAIELAKDANNQYLIGNARGTLTPTLWGLPVVATQAMAPGEFLVGAFDLAAQIFD 346 (390) T ss_pred hhcccc------CCCCEEEEcHHHHHHHHHhhcCCCceeecCCcCcCCceecceeeEEcCCCCCCcEEEEeccceEEEEE Confidence 664321 135579999999888864 34334333211111101 1222222222100 01111 11111 Q ss_pred eeCCCceEEEEeCchhhcccceecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 284 RVEGKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 284 ~~~~~~~~~~~~p~~~~~l~~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) + .+ ..+.+.. .+ .-...-...+-+..|++| .+++|.||+..+== T Consensus 347 ~-~~---~~i~~~~---~~-~~~~~~~~~~r~~~r~d~-~v~~~~a~~~~~~a 390 (390) T protein:vir:10 347 Q-WD---ARVEIGY---VN-DDFQRNMVTVLAEERLAL-VVYRPEALISGSFA 390 (390) T ss_pred e-cc---eEEEEee---cc-cccccCcEEEEEEEeecc-EEeccccEEEEEeC Confidence 1 11 1111110 01 001112234445556555 67888888654322 No 57 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=96.97 E-value=0.00014 Score=41.69 Aligned_cols=306 Identities=10% Similarity=-0.003 Sum_probs=146.1 Q ss_pred CchH--------HHHHHHh----hcceecc--------------chhhhhhhhhhhhhhhhhhhcCccccCCcch--HHH Q lcl|Aclame:pro 1 MRDA--------QRIQNLA----RAGVILP--------------RSVKNVSTPLAEYAMDAADLSPHLSSTGSSG--IPN 52 (336) Q Consensus 1 m~~~--------~~~~~l~----~~g~~~~--------------~~~~~~~~~~~~~~~da~d~~~~l~t~~~~~--i~~ 52 (336) +.+. +.+.+++ +....-. .....+. .....++....+.. .|.+++| ||. T Consensus 188 ~e~~~~~~~~~~~~~d~~e~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~l~-~~e~~~~~~~~~~~--~t~~~gg~lip~ 264 (543) T protein:vir:81 188 SDNVRAAATKIIERFDDEDSTLARQCLATSSPAYLRAWSKMARNPHAAILT-EEEKRAINEVRAMG--LTKADGGYLVPF 264 (543) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhhhHHHHHHHhhHHHHhh-hhhhhhhhhhhhcc--cccccCcccCch Confidence 0000 0000000 0000000 0000010 01111222111111 2233332 343 Q ss_pred HHHHhhCceeeeeeccc-cchhhhcccccCCCcceeeEEEeeeecccceEEeecccCCceeeeeeeeeeeeEEEEEEEEE Q lcl|Aclame:pro 53 YLTTYVDPSVIDILVAP-MKAAELVGESKKGDWTTLVAAFITAEPTTTVATYGDYSSDGDSGTNINYPQRQSYFFQTWTR 131 (336) Q Consensus 53 ~l~~~idp~v~~~~~~~-~~~~~l~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~~~~ 131 (336) .+. ++++.....+ -....+..+.+. ...+.+++....+.+...|.+..+|..+.........++.++..+. T Consensus 265 ~~~----~~ii~~~~~~~~~l~~~~~~~~~----~g~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~ 336 (543) T protein:vir:81 265 QLD----PTVIITSNGSLNDIRRFARQVVA----TGDVWHGVSSAAVQWSWDAEFEEVSDDSPEFGQPEIPVKKAQGFVP 336 (543) T ss_pred hhh----hHHHHHHHhhhchhhhhcccccC----CcceEEEEecCCcceeecccCccccccccccceeeeeeeeeEeeeh Confidence 332 2222111111 122333333221 2334666767777888889888999999888888999999999999 Q ss_pred eCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccc-cceEEEEecCCCCcccccccccccccCHHHHHHHHHH Q lcl|Aclame:pro 132 WGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAG-LENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVT 210 (336) Q Consensus 132 y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~-~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~ 210 (336) +|.+ +..- ..++.+.-......++.+.++.-+++|++. ....|+++++..... +..+.+ +..-.++|+.+ T Consensus 337 is~e-ll~d---~~~~~~~i~~~l~~~~~~~~d~ail~G~Gt~~~p~Gi~~~~~~~~~----~~~~~~-~~~~~~~~~~~ 407 (543) T protein:vir:81 337 ISIE-ALQD---EANVTETVALLFAEGKDELEAVTLTTGTGQGNQPTGIVTALAGTAA----EIAPVT-AETFALADVYA 407 (543) T ss_pred hhHH-HHhc---cHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcccccchhhcccccc----cccccc-cccccHHHHHH Confidence 9984 4332 248999999999999999999999999963 467899986543211 111111 22335778888 Q ss_pred HHHHHHHHhCCceeccCCcEEEecHHHHHhccc-CCCCCccHHHHHHHh-------CCccEEEEcccccC--CCCceEEE Q lcl|Aclame:pro 211 LFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAKLKEI-------FPKLEFVTIPEYDT--ASGRLVQL 280 (336) Q Consensus 211 l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~l~~n-------~pnl~i~~~pel~~--a~G~~~~~ 280 (336) ++..+...- .....++|.+..+..|.+ .+..|.=++.-+... +|=+....+|.... .+.+...+ T Consensus 408 ~~~~l~~~~------~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~g~~~~l~G~pv~~~~~~~~~~~~~~~~~~~~i 481 (543) T protein:vir:81 408 VYEQLAARH------RRQGAWLANNLIYNKIRQFDTQGGAGLWTTIGNGEPSQLLGRPVGEAEAMDANWNTSASADNFVL 481 (543) T ss_pred HHHhhhccc------cCCcEEEEcHHHHHHHHHhhcCCCceeccCcCCCCCccccceeeEEeccccccccccccCCcceE Confidence 877664321 123479999999998854 333333222211111 12112222333321 11122223 Q ss_pred EEEeeCCCceEEEEe--Cchhhcccc-----eecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 281 WAPRVEGKDTATCGF--TEKMRAHSI-----ERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 281 ~~~~~~~~~~~~~~~--p~~~~~l~~-----~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) ++-+.. . ..+.. .+.+...+- ......+.+-...|++| .+++|-||+.+.-- T Consensus 482 ~~gd~~--~-~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~r~d~-~v~~~~A~~~l~~~ 540 (543) T protein:vir:81 482 LYGNFQ--N-YVIADRIGMTVEFIPHLFGTNRRPNGSRGWFAYYRMGA-DVVNPNAFRLLNVE 540 (543) T ss_pred EEeecc--c-eeEEeecccEEEEeccccccchhhcCceEEEEEEeecc-EeecccceEEEEec Confidence 332221 1 11111 111111110 01111233444555555 55679999887766 No 58 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=96.83 E-value=9.5e-05 Score=42.65 Aligned_cols=257 Identities=12% Similarity=0.050 Sum_probs=133.2 Q ss_pred hhhhhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhhcccccC--CCcceeeEEEeeeecccceEEeecccC Q lcl|Aclame:pro 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESKK--GDWTTLVAAFITAEPTTTVATYGDYSS 108 (336) Q Consensus 31 ~~~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~--g~w~~~t~~~~v~e~~G~a~~ygd~~D 108 (336) || + ..++-++--+|..+..||..++ -..+....+..+... |.. -.++.++.+...|++..++++++ T Consensus 1 ma---~----~~T~~~d~iiPev~~~~v~~~~----~~~~~~~~~~~~~~~l~g~~-G~ti~iP~~~~~gda~~~~eg~~ 68 (272) T protein:vir:36 1 MS---K----QKTTLADLVNPEVLAPIVSYEL----NKALRFAPLAQVDTTLQGQP-GNTLKFPAFTYIGDAADVAEGGE 68 (272) T ss_pred CC---C----cceehhhhhchHHHHHHHHHHH----HhhhhhccccccccccccCC-CCEEEEeeeccCccccccCCCCc Confidence 11 1 1345566677999998884443 334444555555432 222 36799999999999999999999 Q ss_pred CceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCCCCcc Q lcl|Aclame:pro 109 DGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAP 188 (336) Q Consensus 109 iP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~ 188 (336) ++.-+.+.......+...+-++++ .++.+++. +-++..+-...+.+++.+.+++..+ ..++- ... T Consensus 69 i~~~~lt~~~~~~~i~~~~k~~~v--tD~~~~~~-~~d~~~~~~~~~a~~~a~~~d~~i~---------~~l~~--~~~- 133 (272) T protein:vir:36 69 ISLDKIGTTTKSVTIKKAAKGTEI--TDEAALSG-YGDPIGESNKQLGLSLANKVDDDLL---------SAAKT--TSQ- 133 (272) T ss_pred cChhhcCCcceeEeeehhhccccc--cHHHHhhc-cchHHHHHHHHHHHHHHHHHHHHHH---------HHhcc--ccc- Confidence 999999999888888877655555 55555553 4455555555566666666664321 11110 000 Q ss_pred cccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhcccCCCC---CccHHHHHH-----HhCC Q lcl|Aclame:pro 189 ITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSKTNQY---GLSAAAKLK-----EIFP 260 (336) Q Consensus 189 ~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~~~~~---~~Tvl~~l~-----~n~p 260 (336) +.+ .+.+ +++|.+++..+-.. ...+..+++.|..+..|.+-... +.+..+-+. -.|- T Consensus 134 -~~~----~~~~----~d~i~~A~~~lgd~------~~~~~~ivv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G~ig~~~ 198 (272) T protein:vir:36 134 -TVS----TKAN----VDGVQAALDIFNDE------DAQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALINGTYADVL 198 (272) T ss_pred -ccc----cccc----HHHHHHHHHHhhhc------CCCceEEEEcHHHHHHHhcccccccccccccccceeeeccceec Confidence 001 1223 34555555544221 12367899999999988642211 001111010 1133 Q ss_pred ccEEEEcccccCCCC-ceEEEEEEeeCCCceEEEEeCchhh--cccceecCCceEEeeecceeeeEEecccceeee--cc Q lcl|Aclame:pro 261 KLEFVTIPEYDTASG-RLVQLWAPRVEGKDTATCGFTEKMR--AHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQM--IG 335 (336) Q Consensus 261 nl~i~~~pel~~a~G-~~~~~~~~~~~~~~~~~~~~p~~~~--~l~~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~--~G 335 (336) +++|+.-..+....| ...+++.. .-+........+ ..-.+.+... .+- .-...|+-+.+|-+++.+ .| T Consensus 199 G~~Vv~s~~~p~~~~~~~~~~~~~-----gA~~~~~~~~~~vE~~R~~~~~~d-~i~-~~~~y~~~v~~~~~vv~~t~~g 271 (272) T protein:vir:36 199 GAQIVRSKKLAEGSALMFKIVSNS-----PALKLVLKRGVQVETDRDIVTKTT-VIT-ADEHYAAYLYDLTKVVNITFTG 271 (272) T ss_pred CeeEEEeCCCCCCceeEEEEEecc-----cceeeeecCCcccccccchhhcCc-EEE-EEEEEEEEEEcCccEEEEeecC Confidence 456655444432222 11222211 111111111111 1111111111 111 125589999999987765 68 Q ss_pred C Q lcl|Aclame:pro 336 V 336 (336) Q Consensus 336 I 336 (336) + T Consensus 272 ~ 272 (272) T protein:vir:36 272 V 272 (272) T ss_pred C Confidence 8 No 59 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=96.79 E-value=0.00031 Score=39.83 Aligned_cols=305 Identities=10% Similarity=0.002 Sum_probs=138.5 Q ss_pred Cch---HHHHHHHhhcceeccchhhh----h-hhhhhhhhhhhhhhcCccccCCc--c--hHHHHHHHhhCceeeeeecc Q lcl|Aclame:pro 1 MRD---AQRIQNLARAGVILPRSVKN----V-STPLAEYAMDAADLSPHLSSTGS--S--GIPNYLTTYVDPSVIDILVA 68 (336) Q Consensus 1 m~~---~~~~~~l~~~g~~~~~~~~~----~-~~~~~~~~~da~d~~~~l~t~~~--~--~i~~~l~~~idp~v~~~~~~ 68 (336) ++. .+.++.|.+.+-.+-.+.+. . ..........++.+.+ +++.+. + .+|+.+.. +|++.+.+ T Consensus 290 ~kg~~f~~~~~al~~~~g~~~~a~e~a~~~~~~~~~~~~~~~~a~~~~-~~~~~~~~Gg~~vp~~~~~----~ii~~l~~ 364 (645) T protein:vir:93 290 DKGIGFARFAKSLAAAKGVRSEALEVARRQYPDDSRLHHVLKSAVGAG-TTTDPQWAGSLSEYQEYAQ----DFIDYLRP 364 (645) T ss_pred hhhhhHHHHHHHHHhcccchhHHHHHHHhhcccchhhhhhhhhhhhcc-ccccccccCCccCchhhHH----HHHHhhhh Confidence 111 01122222211111111000 0 0001111111111111 222211 1 23444432 33444444 Q ss_pred ccchhhhcccccCCCcc-eeeEEEeeeecccceEEeecccCCceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCH Q lcl|Aclame:pro 69 PMKAAELVGESKKGDWT-TLVAAFITAEPTTTVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDL 147 (336) Q Consensus 69 ~~~~~~l~~v~t~g~w~-~~t~~~~v~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l 147 (336) ......+-.....+... ...+..+.....+.+.+.|...+.|..+.......-+.+.++.-..+|.+=|+.+ ..++ T Consensus 365 ~svv~~l~~~~~~~~~~~~~~~~ip~~t~~~~a~wv~Eg~~~~~s~~~f~~v~l~~~kla~~~~iS~ell~ds---~~~~ 441 (645) T protein:vir:93 365 QTIIGRFGQGGIPALRQVPFNIRVHAQVSGGAAGWVGEGKTKPLTKFDFESITFSHAKVSAIAVLTEELIRFS---SPAA 441 (645) T ss_pred hhhHHhhccccccccccccCceeeeeeecCcceEEeccCccccccccceeEEEEeeEEEEEeehhHHHHHhhc---hHHH Confidence 33333332221111111 1234556656667788888888999999988888888888888888875544433 4567 Q ss_pred HHHHHHHHHHHHHHhhccEEEeecccc----ceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCce Q lcl|Aclame:pro 148 ASELNYSSALGLAKFLNGSYLFGVAGL----ENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGII 223 (336) Q Consensus 148 ~~~K~~aAr~a~e~~~n~i~~~Gd~~~----g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v 223 (336) .+--....+.++.+.+++-++.|+..- .-.|++|. +. .+ ++......|+..++..+..... T Consensus 442 ~~~i~~~l~~aia~~~d~a~l~g~g~~~~~~~p~gi~~~--~~-----~~-----~~~~~~~~d~~~~~~~~~~a~~--- 506 (645) T protein:vir:93 442 DALVRNALAEAVVARLDTDFVDPKKAAVADVSPASITHD--VK-----GT-----ASSGNPDADAEAAFGQFVAANL--- 506 (645) T ss_pred HHHHHHHHHHHHHHHHHHHhhcCCCcccCCccccceecc--cc-----cc-----ccccchHHHHHHHHHHHHhcCC--- Confidence 777777788888888888888776432 12344441 10 11 0111244678778777754431 Q ss_pred eccCCcEEEecHHHHHhcccC-CCCCccHHHHHHHhCCccEEEEcccccCC-------CCceEEEEEEeeCCCceEEEEe Q lcl|Aclame:pro 224 TQEAVLHMGLPPTAMSDLSKT-NQYGLSAAAKLKEIFPKLEFVTIPEYDTA-------SGRLVQLWAPRVEGKDTATCGF 295 (336) Q Consensus 224 ~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~~l~~n~pnl~i~~~pel~~a-------~G~~~~~~~~~~~~~~~~~~~~ 295 (336) .+ .--..+|.|..+..|.+- +..|.-++--+-.. +=++-..|-.... -|+-..+++-...+ ..+.+ T Consensus 507 ~~-~~a~~vmn~~~~~~L~~lkd~~G~~~~~~~~~~--~~tL~G~PV~~s~~vp~~~~~gd~s~~~ig~~~~---v~i~~ 580 (645) T protein:vir:93 507 QP-TGAVWLMSSTNALALSMRKNALGQKEYPDMTLL--GGSFQGLPVIVSQYVGDQLVLVNAPDIYLADDGG---VAVDM 580 (645) T ss_pred Cc-cccEEEEcHHHHHHHHhccccCCceeecCCCCC--CceeeceeeEEeccCCcceeEeccccEEEEEecc---eEEEe Confidence 11 112578999988888643 33332221000000 0012222221110 11111111111100 11111 Q ss_pred Cchhh------------------cccceecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 296 TEKMR------------------AHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 296 p~~~~------------------~l~~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) ...-. .+-. ...-.+-+.|+.|+++.. ++|.||++++|+ T Consensus 581 s~~a~~~~~~~~~~~~~~~~~~~~v~l-f~~d~vaira~~r~d~~~-~~p~a~~~lt~~ 637 (645) T protein:vir:93 581 SREASLEMQSEPTGDSTTPSPVELVSM-FQTGSVAIRAERWINWRR-RRTAAVAVITGV 637 (645) T ss_pred ecceeEEEeecccccccccccccchhH-hhcCceEEEEEEEEccee-eCccceEEEecc Confidence 11000 0000 112235567777776654 999999999999 No 60 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=96.77 E-value=0.00023 Score=40.56 Aligned_cols=297 Identities=12% Similarity=0.086 Sum_probs=142.5 Q ss_pred CchHH-HHHHHhhcceeccchhhhhh------h-----------------hhhhhhhhhhhhcCccccCCcc-hHHHHHH Q lcl|Aclame:pro 1 MRDAQ-RIQNLARAGVILPRSVKNVS------T-----------------PLAEYAMDAADLSPHLSSTGSS-GIPNYLT 55 (336) Q Consensus 1 m~~~~-~~~~l~~~g~~~~~~~~~~~------~-----------------~~~~~~~da~d~~~~l~t~~~~-~i~~~l~ 55 (336) +...+ .++++++.+-..+....... . ........+ ....++.+.+ -+|..+. T Consensus 54 i~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~g~lip~~~~ 130 (390) T protein:vir:97 54 VQAARQRVAELEGNGAGGDVQHVSVGDMFVASEQFQASTGRWNDRSARATMNIKAALNT---ASTDAAGSAGALTTPNRL 130 (390) T ss_pred HHHHHHHHHHHHhcccccccccccchhhhhhhHHHHHHHHHhhhhhhhhhhHHHHHHHh---hhcccccccccccchhhh Confidence 11100 01111111110000000000 0 000001110 0011111111 1233222 Q ss_pred HhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeec-ccceEEeecccCCceeeeeeeeeeeeEEEEEEEEEeCH Q lcl|Aclame:pro 56 TYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEP-TTTVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGE 134 (336) Q Consensus 56 ~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~-~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~~~~y~~ 134 (336) +++++.+........++++...+. .+..|++.+. .+.+...+....+|-.+..........+.++....++. T Consensus 131 ----~~ii~~~~~~~~i~~~~~~~~~~~---~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~ 203 (390) T protein:vir:97 131 ----PGFITPPDARLTVRDLIGSGRTDS---ALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATR 203 (390) T ss_pred ----HHHHHHHhhhhhhHhhcceeeccC---CceEEEEEecCCcceeeecCCccccccccceeEEEEeeeeEEEeehhhH Confidence 345555555555666666544432 2356666554 46778888888999999888888999999999888887 Q ss_pred HHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeecccc-ceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHH Q lcl|Aclame:pro 135 RELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGL-ENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQ 213 (336) Q Consensus 135 ~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~-g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~ 213 (336) + +-.-. .++.+.-....++++.+.+++-.++|+... ...|++|.+......+ ..+.+..++||..++. T Consensus 204 e-ll~ds---~~l~~~i~~~la~a~~~~~d~a~l~G~g~~~~p~Gi~~~~~~~~~~~-------~~~~~~~~d~~~~~~~ 272 (390) T protein:vir:97 204 Q-ILSDA---PQLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPT-------TIAGATRVDQLRLAML 272 (390) T ss_pred H-HHHhH---HHHHHHHHHHHHHHHHHHHHHHHhhcCCCCccccceeeccccccccc-------cccccchHHHHHHHHH Confidence 5 43322 257888888888999999999899998643 4789999765432211 1223345677777776 Q ss_pred HHHHHhCCceeccCCcEEEecHHHHHhccc-CCCCCccHHHHHHH----hCCccEEEEcccccCC---CCce--EEEEEE Q lcl|Aclame:pro 214 VLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAKLKE----IFPKLEFVTIPEYDTA---SGRL--VQLWAP 283 (336) Q Consensus 214 ~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~l~~----n~pnl~i~~~pel~~a---~G~~--~~~~~~ 283 (336) .+...- ..+..++|.|+.+..|.+ .+..|.-++.-... .+-++.++..+.+... -|.- .+.++. T Consensus 273 ~~~~~~------~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~gd~~~~~~~~~ 346 (390) T protein:vir:97 273 QASLAE------YPASGIVINPIDWAAIELAKDANNQYLIGNARGTLTPTLWGLPVVATQAMAPGEFLVGAFDLAAQIFD 346 (390) T ss_pred hhcccc------CCCCEEEEcHHHHHHHHHhhcCCCceeecCccCCCCceecceeeEEcCCCCCCcEEEEeccceEEEEE Confidence 664322 135679999999988864 34434322211000 0112222222222110 0111 122221 Q ss_pred eeCCCceEEEEe---CchhhcccceecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 284 RVEGKDTATCGF---TEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 284 ~~~~~~~~~~~~---p~~~~~l~~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) + .+ ..+.+ ...|+ + -....-+..| .|..+++|.||++.+== T Consensus 347 ~-~~---~~i~~~~~~~~f~------~-~~~~~r~~~r-~d~~v~~~~a~v~~~~a 390 (390) T protein:vir:97 347 Q-WD---ARVEIGYVNDDFQ------R-NMVTVLAEER-LALVVYRPEALITGSFA 390 (390) T ss_pred e-cc---eEEEEeecccccc------c-CcEEEEEEEe-eccEEeccccEEEEEeC Confidence 1 11 11111 11111 1 1122233334 45567788887665422 No 61 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=96.73 E-value=7.4e-05 Score=43.24 Aligned_cols=257 Identities=11% Similarity=0.034 Sum_probs=131.7 Q ss_pred hhhhhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhhcccccC--CCcceeeEEEeeeecccceEEeecccC Q lcl|Aclame:pro 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESKK--GDWTTLVAAFITAEPTTTVATYGDYSS 108 (336) Q Consensus 31 ~~~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~--g~w~~~t~~~~v~e~~G~a~~ygd~~D 108 (336) |||= + .+.-++--+|+.+..||..++ ...+....|..+.+. |.- -.++.++.++..|.+..|.++++ T Consensus 1 ~~~~-~-----~T~l~d~i~PEv~~~~v~~~~----~~~~~~~~~~~~~~~l~g~~-G~tv~iP~~~~ig~a~~~~~g~~ 69 (275) T protein:vir:96 1 MALE-N-----MTKLANMVNPEVLAPMMQAEL----DKKLKFAQFADIDNTLVGQP-GNTITFPAFVYSGDAKVVPEGEE 69 (275) T ss_pred CCCc-c-----cchhhhhhchHHHHHHHHHHH----HHhhhhcccceecccccCCC-CCEEEeeeeccCCccccccCCCC Confidence 6552 2 355667777999998885554 334444555544332 322 36799999999999999999999 Q ss_pred CceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCCCCcc Q lcl|Aclame:pro 109 DGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAP 188 (336) Q Consensus 109 iP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~ 188 (336) ++..+.........+...+-+++++. +.+.+. +-++..+-...+..++.+.+++..+ ..++.-.+ T Consensus 70 i~~~~lt~~~~~~~i~~~~~~~~i~D--~~~~~~-~~d~~~~~~~~~a~~~a~~~d~~ll---------~~l~~a~~--- 134 (275) T protein:vir:96 70 IPIDLIETKKRQATIRKIGKGTVLTD--EALLSG-YGDPKGEAVRQHGLAIANKVDNDVL---------EALQGATL--- 134 (275) T ss_pred cchhhcccceeeEEeehhcccccccH--HHHHhh-ccchHHHHHHHHHHHHHHHHHHHHH---------HHHhcccc--- Confidence 99999998888888888766665555 444444 3355555555666667666665432 11111001 Q ss_pred cccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhcccCC--------CCCccHHH-HHHHhC Q lcl|Aclame:pro 189 ITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSKTN--------QYGLSAAA-KLKEIF 259 (336) Q Consensus 189 ~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~~~--------~~~~Tvl~-~l~~n~ 259 (336) +.++ +..+.+ .|.+++..+-.. ...+..|+++|..+..|.+-. ..+..++. =.--.| T Consensus 135 -~~~~---~~~~~d----~i~dA~~~lgd~------~~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~G~ig~~ 200 (275) T protein:vir:96 135 -KVEA---DITKLA----GLQTAIDKFNDE------DLEPMVLFVNPLDAGKLRASATDNFTRATLLGDNVIVKGAFGEA 200 (275) T ss_pred -cccc---cccCHH----HHHHHHHHhccc------cCCccEEEeCHHHHHHHHhcccccccccccccccceecccccee Confidence 1111 122333 444444444211 135779999999999884421 11111100 000012 Q ss_pred CccEEEEcccccCCCCceEEEEEEeeCCCceEEEEeCchhhcccceecCCceEEe-eecceeeeEEecccceeee----- Q lcl|Aclame:pro 260 PKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSIERYSSYFRQK-KSAGTWGAVIFRPFAVAQM----- 333 (336) Q Consensus 260 pnl~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~v~-~~~rt~Gv~ir~P~ai~~~----- 333 (336) =+++|+....+. -..++++- +.-+......+... ......+...-- ..-..+|+-+.+|-+++.+ T Consensus 201 ~G~~Vi~s~~~p---~~t~~i~~-----~gA~~~~~~~~~~v-E~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~ 271 (275) T protein:vir:96 201 LGAIIVRSNKIK---EGEAILAK-----RGAVKLITKRDFFL-ETERHASHKSTALFSDKHYVAYLYDESKVVKITKSAS 271 (275) T ss_pred cCeeEEEeCCCC---cceEEEEe-----ccceeeeecCCccc-ccccchhhcCcEEEEeEEEEEEEEcCccEEEEEeccc Confidence 234544333321 11223221 11111111111110 111111111111 1124568889999888775 Q ss_pred -ccC Q lcl|Aclame:pro 334 -IGV 336 (336) Q Consensus 334 -~GI 336 (336) +|+ T Consensus 272 ~~~~ 275 (275) T protein:vir:96 272 GLGV 275 (275) T ss_pred ccCC Confidence 344 No 62 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=96.70 E-value=0.00029 Score=39.99 Aligned_cols=278 Identities=12% Similarity=0.035 Sum_probs=141.1 Q ss_pred cceeccchhhhhhhhhhhhhhhhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEe Q lcl|Aclame:pro 13 AGVILPRSVKNVSTPLAEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFI 92 (336) Q Consensus 13 ~g~~~~~~~~~~~~~~~~~~~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~ 92 (336) .|+. .+.+.++.-.... .+.--+|.+....| +.+.......+++.+.+.+. .+..|+ T Consensus 1 ~g~~---------~e~~~~~~~~t~~------~~g~l~~~~~~~ii-----~~l~~~s~i~~l~~~~~~~~---~~~~ip 57 (397) T protein:vir:23 1 MGFS---------ADHSQIAQTKDTM------FTGYLDPVQAKDYF-----AEAEKTSIVQRVAQKIPMGA---TGIVIP 57 (397) T ss_pred CCcC---------HHHHHHhhccCCC------CccccchhHHHHHH-----HHHHhccchhhhcceeeccC---CceEEE Confidence 2221 1122222111101 11112344444333 33333344455555443322 346788 Q ss_pred eeecccceEEeecccCCceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeecc Q lcl|Aclame:pro 93 TAEPTTTVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVA 172 (336) Q Consensus 93 v~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~ 172 (336) +.+....+..++....+|..+..........+.++..+.+|.+=++.+ ..++.+.-+...++++.+.+++-+++|+. T Consensus 58 ~~~~~~~a~wv~Eg~~~~~s~~~f~~v~l~~~k~~~~v~iS~ell~ds---~~~l~~~i~~~l~~aia~~~d~a~l~G~g 134 (397) T protein:vir:23 58 HWTGDVSAQWIGEGDMKPITKGNMTKRDVHPAKIATIFVASAETVRAN---PANYLGTMRTKVATAIAMAFDNAALHGTN 134 (397) T ss_pred EEcCCcceEEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcc---hHHHHHHHHHHHHHHHHHHHHHHHhhccc Confidence 888888889999999999999988888999999999999987655544 37789999999999999999999999986 Q ss_pred c-cceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhccc-CCCCCcc Q lcl|Aclame:pro 173 G-LENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLS 250 (336) Q Consensus 173 ~-~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~T 250 (336) . .++-|+.+..+... .+.... ..+|+..++..+...- ..+..++|.+..+..|.+ .+..|.. T Consensus 135 t~~~~~~~~~~~~~~~------~~~~~~----~~~~~~~~~~~l~~~~------~~~a~~vmn~~~~~~L~~lkd~~G~~ 198 (397) T protein:vir:23 135 APSAFQGYLDQSNKTQ------SISPNA----YQGLGVSGLTKLVTDG------KKWTHTLLDDTVEPVLNGSVDANGRP 198 (397) T ss_pred CCccccccccccccee------eecccc----hhHHHHHHHHhhhhcc------cCCCEEEEcHHHHHHHHHhhccCCce Confidence 4 33444444322211 111112 2334444444443221 134679999999888864 3333433 Q ss_pred HHHHHHHh-CC----ccEEEEcccccC---CCCceE-------EEEEEeeCCCceEEEEeCchh-hcccceec------- Q lcl|Aclame:pro 251 AAAKLKEI-FP----KLEFVTIPEYDT---ASGRLV-------QLWAPRVEGKDTATCGFTEKM-RAHSIERY------- 307 (336) Q Consensus 251 vl~~l~~n-~p----nl~i~~~pel~~---a~G~~~-------~~~~~~~~~~~~~~~~~p~~~-~~l~~~~~------- 307 (336) ++.=-..+ .| .-++...|-.-. ..|... .+++..+.+ ..+.+.... ........ T Consensus 199 i~~~~~~~~~~~~~~~~tl~G~Pv~~s~~~~~g~~~~~~gDfs~~~i~~~~~---i~i~~~~e~~~~~~~~~~~~~~~lf 275 (397) T protein:vir:23 199 LFVESTYESLTTPFREGRILGRPTILSDHVAEGDVVGYAGDFSQIIWGQVGG---LSFDVTDQATLNLGSQESPNFVSLW 275 (397) T ss_pred eecccccccccccccCceeeeeeEEEeCCCCCCceEEEEeecceEEEEEEec---eEEEEeeeeeeeeccccccceeeee Confidence 32110011 11 113333332211 122221 111111111 111111100 00000000 Q ss_pred -CCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 308 -SSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 308 -~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) .-....-+..|+ ++.+++|.+|+++.+- T Consensus 276 ~~d~v~~ra~~r~-d~~v~~~~a~~~~~~~ 304 (397) T protein:vir:23 276 QHNLVAVRVEAEY-GLLINDVNAFVKLTFD 304 (397) T ss_pred eccceeEEEEeee-ccceecccceEEEeec Confidence 011233334444 4588899999999986 No 63 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=96.63 E-value=0.00046 Score=38.90 Aligned_cols=309 Identities=9% Similarity=0.033 Sum_probs=137.5 Q ss_pred CchHHHHH----------------------HHhhcceeccchhhhhhhhhhhh-----hhhhhhhcCccccCCcchHHHH Q lcl|Aclame:pro 1 MRDAQRIQ----------------------NLARAGVILPRSVKNVSTPLAEY-----AMDAADLSPHLSSTGSSGIPNY 53 (336) Q Consensus 1 m~~~~~~~----------------------~l~~~g~~~~~~~~~~~~~~~~~-----~~da~d~~~~l~t~~~~~i~~~ 53 (336) ++...+.. .....+..+... .....+.+.+ ......+...-++.....||.. T Consensus 58 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~e~~~~~~~~~~~~~~~~~~~~~~~g~~~iP~~ 136 (415) T protein:vir:94 58 LDKLKEKDGTSENNQQSVEVNEASTYRNQANINDLGISIQNT-KVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEE 136 (415) T ss_pred HHHHHHHHHhhhhccccccccchhhHHHHHHHHHHHhhhhhh-hhhHHHHHHHHHHhhhhhhhhhhccccccccccCcHH Confidence 00000000 000000000000 0000011110 0000011111111222335543 Q ss_pred HHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecccceEEeecccCCcee-eeeeeeeeeeEEEEEEEEEe Q lcl|Aclame:pro 54 LTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVATYGDYSSDGDS-GTNINYPQRQSYFFQTWTRW 132 (336) Q Consensus 54 l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP~v-d~~~~~~~~~v~~~~~~~~y 132 (336) +. +++++.+........++++..... ....+.+......+.+...+.+.++|-. ..........++.++..+.+ T Consensus 137 ~~----~~ii~~~~~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~i~~~~~k~~~~~~i 211 (415) T protein:vir:94 137 IV----TDILKLKEVEFNLDKYVTVKRVTN-GSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINTHRGYFRI 211 (415) T ss_pred HH----HHHHHHHHhhhhhhhhcceeeccC-CceeEEEEeecCCccceeccccccccccccccceeeEeeheeeeeechh Confidence 33 456666666667777766644321 1122333333445567777888888854 45788888889999988888 Q ss_pred CHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEec-CCCCcccccccccccccCHHHHHHHHHHH Q lcl|Aclame:pro 133 GERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLIND-PSLSAPITATTPWSGSPAVEAVVNEVVTL 211 (336) Q Consensus 133 ~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~-Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l 211 (336) |.+=+. ....++.+.-....++++.+.+|+-++.|+......+.... ..... +.. .++ ..-++||.++ T Consensus 212 s~ell~---ds~~~~~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~--~~~-----~~~-~~~~~~i~~~ 280 (415) T protein:vir:94 212 SREAIE---DAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGK--KLE-----VKK-AKSLDDIKDA 280 (415) T ss_pred hHHHHh---hchHHHHHHHHHHHHHHHHHHHHHHHhhccccCcccccccccccccc--ccc-----ccc-ccchHHHHHH Confidence 865333 33467888888888888888888888887754322222221 11111 111 111 1236677777 Q ss_pred HHHHHHHhCCceeccCCcEEEecHHHHHhccc-CCCCCccHHH-HHHHh----CCccEEEEccccc-CCCCceEEEEEEe Q lcl|Aclame:pro 212 FQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAA-KLKEI----FPKLEFVTIPEYD-TASGRLVQLWAPR 284 (336) Q Consensus 212 ~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~-~l~~n----~pnl~i~~~pel~-~a~G~~~~~~~~~ 284 (336) +..+... + -.+..++|.++.+..|.+ .+..|.-++. -+... +-++.++..+.+- ++.|....++.+- T Consensus 281 ~~~~~~~--~----~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~i~~gd~ 354 (415) T protein:vir:94 281 INLNVKP--N----YEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNNTLIIGNL 354 (415) T ss_pred HHhhhhh--c----cCCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCCceecceeeEEecccccCCCCccEEEEEeh Confidence 7766432 1 136689999999998864 3443433321 01110 1112333333332 2233333333221 Q ss_pred eCCCceEEEEeCchhhcccceecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 285 VEGKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 285 ~~~~~~~~~~~p~~~~~l~~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) . +...+..-+.++..-..........-...|. |+.+.+|.||+++.-- T Consensus 355 ~---~~~~~~~~~~~~v~~~~~~~~~~~~r~~~r~-d~~~~~~~a~~~~~~~ 402 (415) T protein:vir:94 355 K---DAIVLFDRSQYQASWTDYMHFGECLMIAVRQ-DCRILDYKSAIVIEYD 402 (415) T ss_pred h---ccEEEEeecceEEEEeccccCceEEEEEEEe-ccEEeccccEEEEEEe Confidence 0 0000000011111000000111112234454 5666789999988644 No 64 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=96.46 E-value=0.00041 Score=39.18 Aligned_cols=256 Identities=10% Similarity=0.025 Sum_probs=132.1 Q ss_pred hhhhhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhhcccccC--CCcceeeEEEeeeecccceEEeecccC Q lcl|Aclame:pro 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESKK--GDWTTLVAAFITAEPTTTVATYGDYSS 108 (336) Q Consensus 31 ~~~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~--g~w~~~t~~~~v~e~~G~a~~ygd~~D 108 (336) || + .-++-++--+|..+..|+..++ ...+....+..+.+. |..+ .++.++.+...|.+..|.++++ T Consensus 1 ma---~----~~T~~~d~i~Pev~s~~v~~~~----~~~~~~~~~~~~~~~l~g~~G-~tv~ip~~~~~g~~~~~~~g~~ 68 (274) T protein:vir:96 1 MA---Q----GTTKVSNLIVPEVLAPMMQAEL----DKKLRFAQFADIDSTLVGQPG-DTLTFPAFTYSGDAQVIAEGEK 68 (274) T ss_pred CC---c----cccchhhhhhhHHHHHHHHHHH----HhhhhhcccccccccccCCCC-CEEEEEeeccCCCccccCCCCc Confidence 11 1 0133356678888888885443 444455566655442 2222 5799999999999999999999 Q ss_pred CceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCCCCcc Q lcl|Aclame:pro 109 DGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAP 188 (336) Q Consensus 109 iP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~ 188 (336) +|..++........+...+-+++++. +.+++ .+.++..+....+..++.+.+++..+- .++.- + T Consensus 69 i~~~~it~~~~~~~i~~~~~~~~i~D--~~~~~-~~~d~~~~~~~~~~~~~a~~~d~~i~~---------~l~~a--~-- 132 (274) T protein:vir:96 69 IPVDQIGTSKREAKVRKIGKGTELTD--EAVLS-GFGDPQGEAVRQHGLAIANKVDNDVLE---------ALKGA--T-- 132 (274) T ss_pred CchhhcccceeEEEEEeeeceeeecH--HHHHh-hcchHHHHHHHHHHHHHHHHHHHHHHH---------HHhcC--C-- Confidence 99999999988888888766666654 44444 455666777777777787777764331 11110 0 Q ss_pred cccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhcccCC--------CCCccHH-HHHHHhC Q lcl|Aclame:pro 189 ITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSKTN--------QYGLSAA-AKLKEIF 259 (336) Q Consensus 189 ~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~~~--------~~~~Tvl-~~l~~n~ 259 (336) .+.++ ++.+ ++.|.++...+-.. ...+..|+++|..+..|.+-+ ..|..++ .-.--+| T Consensus 133 ~~~~~---~~~~----~d~i~dA~~~l~d~------~~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~g~ig~~ 199 (274) T protein:vir:96 133 LTVEA---DITK----LDGLQTAIDKFNDE------DLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVKGAFGEA 199 (274) T ss_pred CCcCc---cccc----HHHHHHHHHHhccc------CCCceEEEeCHHHHHHHHhcccccccccccccccceeeccccee Confidence 01111 1123 34445554444221 124678999999999885421 1111100 0000012 Q ss_pred CccEEEEcccccCCCCceEEEEEEeeCCCceEEEEeCchhhcccceecCCceEEeeec-ceeeeEEecccceeeeccC Q lcl|Aclame:pro 260 PKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSIERYSSYFRQKKSA-GTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 260 pnl~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~v~~~~-rt~Gv~ir~P~ai~~~~GI 336 (336) -+++|.....+- -..++++- +.-+......+... ..+...+...-.... ...|+-+.+|-+++.+.== T Consensus 200 ~G~~Vi~s~~~p---~~t~~l~~-----~gA~~~~~~~~~~v-E~~Rd~~~~~d~i~~~~~yg~~~~~~~~vv~~t~~ 268 (274) T protein:vir:96 200 LGAVIVRSNKLN---KGEALLAK-----KGAVKLITKRDFFL-EKDRDASRKSTALYSDKHYVAYLYDESKVVKITKG 268 (274) T ss_pred cCeeEEEcCCCC---cceEEEEe-----CcceeeeecCCccc-ccccchhhcccEEEEeeEEEEEEEcCccEEEEEcC Confidence 234444322221 11222221 11111111111110 011111111111111 3578999999877766544 No 65 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=96.44 E-value=0.00053 Score=38.55 Aligned_cols=304 Identities=14% Similarity=0.040 Sum_probs=138.5 Q ss_pred CchHH---HHHHHhh-----cceeccchh---hhhhhhhh--hhhhhhhhhcCcc--ccCCcch--HH-HHHHHhhCcee Q lcl|Aclame:pro 1 MRDAQ---RIQNLAR-----AGVILPRSV---KNVSTPLA--EYAMDAADLSPHL--SSTGSSG--IP-NYLTTYVDPSV 62 (336) Q Consensus 1 m~~~~---~~~~l~~-----~g~~~~~~~---~~~~~~~~--~~~~da~d~~~~l--~t~~~~~--i~-~~l~~~idp~v 62 (336) |.... .++.++. .++..-.+. .......+ .+.+++. ....+ .|++++| +| .++.. ++ T Consensus 304 ~~~~~l~rai~a~a~~~~~~a~~~~e~a~~~a~~~G~~arg~~~~~~~l-~~ra~~~~t~~~gg~lvp~~~~~~----~i 378 (632) T protein:vir:96 304 LQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKEARGFYMPHEVL-VQRQLEKKTAGKGGELVATELLSE----EF 378 (632) T ss_pred HHHHHHHHHHHhhhccchhhhhhhhHHHHHHHHhhhhhhhhhhhhHHHH-HHhhhhcccccccccccccccchH----HH Confidence 11111 1111110 000000000 00000000 0111110 00111 1112221 23 33321 22 Q ss_pred eeeeccccchhhhcccccCCCcceeeEEEeeeecccceEEeecccCCceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHH Q lcl|Aclame:pro 63 IDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGA 142 (336) Q Consensus 63 ~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~ 142 (336) ++.+.+..-+..+ +...... ....+.++.....+.+...|-...+|..+...+..+-..+.++..+.+|.+=|..+ T Consensus 379 ie~lr~~s~i~~l-~~~~~~~-~~g~~~ip~~~~~~~a~wv~E~~~~~~s~~~f~~i~l~~~k~~~~v~iS~ell~ds-- 454 (632) T protein:vir:96 379 IDILRNKAIIGQM-GARMLPG-LVGDVDIPKKTSGANFYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQS-- 454 (632) T ss_pred HHHHhhcchhhhh-cceEeec-CCcceEEEEEeCCceeEeecCCccccccccceeeEEeeeeEEEEehhhHHHHHhcc-- Confidence 3333332223332 2211110 12346788888777888888888899988888888888888988888876544433 Q ss_pred hCCCHHHHHHHHHHHHHHHhhccEEEeecc-ccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCC Q lcl|Aclame:pro 143 GRVDLASELNYSSALGLAKFLNGSYLFGVA-GLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQG 221 (336) Q Consensus 143 ~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~-~~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g 221 (336) ..++.+.-......++.+.+++-+++|++ +....|++|...++....++ +..+ ++||.++...+..... T Consensus 455 -~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~~~~p~Gi~~~~~~~~~~~~~----~~~~----~~~i~~~~~~i~~~~~- 524 (632) T protein:vir:96 455 -SIHVENLIREDLIEGIGVALDLAMLTGTGLANDPVGLLNMTGVPALTYPA----GGVD----WASVVDMETKISTFNA- 524 (632) T ss_pred -chHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccceeeecccccceeccc----ccCC----HHHHHHHHHHHhhccc- Confidence 56777877788888888889988899987 45678999987665422211 1122 3456666655544321 Q ss_pred ceeccCCcEEEecHHHHHhccc---CCCCCccHHH--HHHHhCCccEEEEcccccCCCCceEEEEEEeeCCCceEEEEeC Q lcl|Aclame:pro 222 IITQEAVLHMGLPPTAMSDLSK---TNQYGLSAAA--KLKEIFPKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFT 296 (336) Q Consensus 222 ~v~~~~p~tL~Lp~~~~~~Ls~---~~~~~~Tvl~--~l~~n~pnl~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p 296 (336) +. .....+|.+.....|.. .+..|.-+++ .| .-||-+.-..+|.-...-|+-...++..+.+-... .-| T Consensus 525 --~~-~~~~~~~~~~~~~~l~~~~l~d~~G~~i~~~~~l-~G~pv~~s~~ip~~~~~~gd~s~~~i~~~~~~~i~--~~~ 598 (632) T protein:vir:96 525 --DA-GRLAYLTSVTQRGAAKKAQVFDNTGERIWQNNEV-NGYRAEASNQIPADTWIFGDWSQIVIAMWGVLDLK--VDP 598 (632) T ss_pred --cc-CccEEEEchhHHHHHHHHhccCCCCceeecCCee-cccceEeccccccCcEEEeecceEEEEEecceEEE--Ecc Confidence 11 23457888777666642 2333332321 00 01332222233332222243333333332221111 111 Q ss_pred chhhcccceecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 297 EKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 297 ~~~~~l~~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) ... .......+-+..+ .++-+++|.+|+...== T Consensus 599 ~~~------~~~~~v~~~~~~~-~d~~v~~~~af~~~k~~ 631 (632) T protein:vir:96 599 YTK------AASDGLVLRVFQD-VDAGVRRKEAFCIAKKG 631 (632) T ss_pred ccc------cccCceEEEEEee-cCceeechhhhhheeec Confidence 110 1112223333444 34577888877643322 No 66 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=96.38 E-value=0.00068 Score=37.95 Aligned_cols=299 Identities=10% Similarity=0.034 Sum_probs=137.1 Q ss_pred eeccchhhhhhhhhhhhhhhhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhhcccccCCCccee--eEEEe Q lcl|Aclame:pro 15 VILPRSVKNVSTPLAEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTL--VAAFI 92 (336) Q Consensus 15 ~~~~~~~~~~~~~~~~~~~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~--t~~~~ 92 (336) +--+.+...-......-++...|.+++.. .|.+++.+|+ .+.+. -+-++....+.. .+++.-+ .+-+. T Consensus 1 ~~~~~~~~~~~~~~~~k~~t~~d~~Gg~l------~P~~~~~~i~-~~~e~-s~~l~~~~vi~~--~~~~~~~i~~~g~~ 70 (315) T protein:vir:41 1 MLTIEDIRGGKPFEIVPKIDVPDLGRGVL------SVDRFGEFVK-AVRDS-AVIIPEARIDNA--LKSYEKDISRLSLV 70 (315) T ss_pred CcccchhhcCChhhhhhhcCCcCCCCcee------chHHHHHHHH-HHHhh-hhhhhhceeeec--cccccccccccccC Confidence 11111110000000111222333333322 3566666663 34331 222333332211 1111100 00010 Q ss_pred eeecccceEEeecccCCceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeecc Q lcl|Aclame:pro 93 TAEPTTTVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVA 172 (336) Q Consensus 93 v~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~ 172 (336) ..-..| +...|...+.|..+..........+.+..-...+.+.|. -.+-+.++.+.-......++.+.+....+.||+ T Consensus 71 ~~~~~g-~~~~~~~~~~~~~~~~f~~~~l~~~~l~~~~~it~elL~-D~~~~~~~e~~l~~~~a~~~a~~~~~~~~nGdg 148 (315) T protein:vir:41 71 LDVGPG-RDETGQKLAPPESTAEVKTNTLYMREMVTKVVIHEDAIE-DNIEGKAFEQKIVTLLGEGISYVLEKYYLHGDT 148 (315) T ss_pred cccccc-cccccCcCCCCCCccccceeeeceeeeeeeccccHHHHH-hhhccccHHHHHHHHHHHHHHHHHHHHhhccCC Confidence 000001 112233344444444444444444444444455555555 344578999999999999999999999999997 Q ss_pred c------cceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhccc-CC Q lcl|Aclame:pro 173 G------LENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TN 245 (336) Q Consensus 173 ~------~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~ 245 (336) . ....|+|+.....+..+ ...+.+...+.+.+.|+...+..-..+.. .....+|+.+.+..+.+ .+ T Consensus 149 ~s~~p~~~~~~G~l~~a~~~~~~~-~~~~~a~~~~~d~l~~l~~sl~~~yr~~~------~~~~~imn~~t~~~~rklk~ 221 (315) T protein:vir:41 149 SSSDPLLRMSDGWLKLASEKLTES-DVDPEAEDWPMNLFDTMIESLPTPYRNNL------PNMKFYVTWDIYRAYRDALK 221 (315) T ss_pred cCcCccccccccceeccccccccc-ccccccccccHHHHHHHHHhcChHHhhcC------CceEEEEcHHHHHHHHHHhc Confidence 4 45679998664432211 22222222233344444433332222211 12368888877765532 12 Q ss_pred CCCccHHHHH--HH---hCCccEEEEcccccCC-CCceEEEEEEeeCCCceEEEEeCchhhcccc-eecCCceEEeeecc Q lcl|Aclame:pro 246 QYGLSAAAKL--KE---IFPKLEFVTIPEYDTA-SGRLVQLWAPRVEGKDTATCGFTEKMRAHSI-ERYSSYFRQKKSAG 318 (336) Q Consensus 246 ~~~~Tvl~~l--~~---n~pnl~i~~~pel~~a-~G~~~~~~~~~~~~~~~~~~~~p~~~~~l~~-~~~~~~~~v~~~~r 318 (336) .-|.-+++=. .. .+-+..++.+|.+... .+....++.+. +...+.+-..++.++- ..+...+..-.+.| T Consensus 222 ~~g~~lw~~~~~~g~~~tl~G~PV~~~~~m~~~~~~~~~ilf~d~----~nl~~~~~~~i~i~~~~~a~~~~~~~~~~~r 297 (315) T protein:vir:41 222 GRETGLGDQALTGANSILYDGRPVQYVPALEALNDGKSRALFVVP----TQLVYGFWRNIKVVPDYDAEMRLTKYVASLR 297 (315) T ss_pred cCCCccccchhhcCCCceecccceEecccccccCCCCccEEEecc----cceEEEeccccEEEeeecCCCCceEEEEEEE Confidence 2222222211 11 1223456677777554 34445555432 1122333334444432 22223355555678 Q ss_pred eeeeEEecccceeeeccC Q lcl|Aclame:pro 319 TWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 319 t~Gv~ir~P~ai~~~~GI 336 (336) .+|-.+-...+++....| T Consensus 298 ~d~~~~~~~~~a~~~~~v 315 (315) T protein:vir:41 298 TDNHYEDEEGAVSATITV 315 (315) T ss_pred eceeEEeccceeEeeeeC Confidence 877666688999999999 No 67 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=96.29 E-value=0.00049 Score=38.72 Aligned_cols=308 Identities=11% Similarity=0.037 Sum_probs=145.5 Q ss_pred CchHHH-HHHHhhcceeccchhhhhhhh----hhhh---hhhhhhhcCccc-cCCcch--HHHHHHHhhCceeeeeeccc Q lcl|Aclame:pro 1 MRDAQR-IQNLARAGVILPRSVKNVSTP----LAEY---AMDAADLSPHLS-STGSSG--IPNYLTTYVDPSVIDILVAP 69 (336) Q Consensus 1 m~~~~~-~~~l~~~g~~~~~~~~~~~~~----~~~~---~~da~d~~~~l~-t~~~~~--i~~~l~~~idp~v~~~~~~~ 69 (336) +..... +......+....+.......+ +..+ ..+... +..++ +++++| ||..+.+ +|++.+.+. T Consensus 95 ~~e~~~~~~~~~~~~~~~~~~~~~~~~e~r~a~~~~l~~~~~~~e-~~a~~~~t~~GG~lvP~~~~~----~Ii~~l~~~ 169 (434) T protein:vir:62 95 SEEQRSAISASIAAALSTKGHRTNKETEIRSVFANYIVGNIDEKE-ARALGLVTGNGSVTIPDFLSK----EIITYAQEE 169 (434) T ss_pred HHHHHHHHHHHHHhhhhhccccchHHHHHHHHHHHHhccccchhh-hhhhcccccccceecchhhHH----HHHHhhhhh Confidence 000000 000000111110000000000 0000 001000 00111 122333 6776654 333333333 Q ss_pred cchhhhcccccCCCcceeeEEEeeeecccceEEe---ecccCCceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCC Q lcl|Aclame:pro 70 MKAAELVGESKKGDWTTLVAAFITAEPTTTVATY---GDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVD 146 (336) Q Consensus 70 ~~~~~l~~v~t~g~w~~~t~~~~v~e~~G~a~~y---gd~~DiP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~ 146 (336) .....+..+...+ ..+.|++....+.+... +..++.|..+.......-..+.++..+.+|.+=|.- ..++ T Consensus 170 ~~i~~~~~~~~~~----~~~~~p~~~~~~~a~~~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~d---s~~~ 242 (434) T protein:vir:62 170 NFLRRLGTGVKTK----ENIKYPVLVKKAEAQGHKNERTNNEMPETDIEFDEIELSPTEFDALATVTKKLLAR---TGLP 242 (434) T ss_pred hhhhhhcceeccC----CceEEEEEecCCcccceecccccccccccccceeeEEeeheeeEeehhhHHHHHhc---chHH Confidence 3333333332111 23567776666666544 235677888888888888888898888888664443 3567 Q ss_pred HHHHHHHHHHHHHHHhhccEEEeeccccc-eEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceec Q lcl|Aclame:pro 147 LASELNYSSALGLAKFLNGSYLFGVAGLE-NYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQ 225 (336) Q Consensus 147 l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g-~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~ 225 (336) |.+.-....+.++.+.+++-.+.|++..+ .-|+++.+.+.. ++ +....++||.+++..+...- .. T Consensus 243 l~~~i~~~la~~~~~~~d~~~l~G~G~~~~~~g~~~~~~~~~----~~------~~~~~~d~l~~l~~~l~~~~----~~ 308 (434) T protein:vir:62 243 IEQIVMDELKKAYVRKETQYMVNGDEANNINDGALAKKAVEF----KT------DEKNLYDALVKMKNTPVKEV----RK 308 (434) T ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCCCccccceeecccccc----cc------cccchhhHHHHHHhhcchhh----hc Confidence 88888888899999999999999997544 447777544422 11 11124567777777664321 11 Q ss_pred cCCcEEEecHHHHHhccc-CCCCCccHHH-HHHHh--CC----ccEEEEcccccC-CCCceEEEEEEeeCCCceEEEEeC Q lcl|Aclame:pro 226 EAVLHMGLPPTAMSDLSK-TNQYGLSAAA-KLKEI--FP----KLEFVTIPEYDT-ASGRLVQLWAPRVEGKDTATCGFT 296 (336) Q Consensus 226 ~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~-~l~~n--~p----nl~i~~~pel~~-a~G~~~~~~~~~~~~~~~~~~~~p 296 (336) .-.++|.+..+..|.. .+..|.-++. ...-+ .| +..++....+.. ++|+...+++-+...--.+...=+ T Consensus 309 --~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~g~~~tl~G~pV~~~~~~~~~~~~~~~~i~~Gdfs~~~i~~~~g~ 386 (434) T protein:vir:62 309 --KARWVLNTAALTKIETMKTDDGFPLLRPFNQAEGGIGYTLLGFPVEEEDAIDIPDSPDTPVFYFGDFSKFYIQDVIGS 386 (434) T ss_pred --CCEEEEcHHHHHHHHHhhccCCCEeeccCCCccCCCCceecceeeEEecCccCccCCCceEEEEeeccceEEEEeece Confidence 1257888888888854 3333432321 11000 01 122333333322 233333232211100000000001 Q ss_pred chhhcccc-eecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 297 EKMRAHSI-ERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 297 ~~~~~l~~-~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) +.++.+.. .......-.-+..|..|-.||.|.+++-..+. T Consensus 387 ~~i~~~~~~~~~~~~v~~~~~~r~Dgk~i~~~~~~~~~~~~ 427 (434) T protein:vir:62 387 LEVQKLVELFSRTNRVGFRIWNLLDAQLIHSPFEVPVYKYV 427 (434) T ss_pred eEEEeehhhhcccCceEEEEEeeecceeecCcccceEEEEE Confidence 12222211 11233445677888999999999999877555 No 68 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=96.22 E-value=0.00086 Score=37.40 Aligned_cols=303 Identities=9% Similarity=0.049 Sum_probs=139.3 Q ss_pred Cch---H-HHHHHHh-------------------------hcceeccchhhhhhhhhhhh---hhhhhhhcCccccCCcc Q lcl|Aclame:pro 1 MRD---A-QRIQNLA-------------------------RAGVILPRSVKNVSTPLAEY---AMDAADLSPHLSSTGSS 48 (336) Q Consensus 1 m~~---~-~~~~~l~-------------------------~~g~~~~~~~~~~~~~~~~~---~~da~d~~~~l~t~~~~ 48 (336) ++. . +++.+.. ..+..+... .....+.+.+ ............+++.+ T Consensus 51 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~t~~g 129 (415) T protein:vir:47 51 IQEKQEELDKLKEKDRTSENNQQSVEVNEARTYRNQANINDLGISIQNT-KVTSQEVRDFTEYLETRNDIQGGSLKTDSG 129 (415) T ss_pred HHHHHHHHHHHHHHHHhhhhcccccccchhhhhHHHHHHHHHHHhhhhh-hhhHHHHHHHHHHHhhhhhhhhccccccCC Confidence 000 0 0000000 000000000 0000000000 00000001111122222 Q ss_pred --hHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeee--ecccceEEeecccCCceee-eeeeeeeeeE Q lcl|Aclame:pro 49 --GIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITA--EPTTTVATYGDYSSDGDSG-TNINYPQRQS 123 (336) Q Consensus 49 --~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~--e~~G~a~~ygd~~DiP~vd-~~~~~~~~~v 123 (336) .+|..+. ++|++.+........++.+..... .+..+++. ...+.+...+.+..+|-.+ .......... T Consensus 130 ~~~iP~~~~----~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~v~~~~ 202 (415) T protein:vir:47 130 FVVIPEEIV----TDILKLKEVEFNLDKYVTVKRVTN---GSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDI 202 (415) T ss_pred cccccHHHH----HHHHHHHHhhhhhhhhcceeeccC---CceeEEEEEecCCcceeecccccccccccccceeeEEeee Confidence 3665555 455666666666667666533221 11233333 3344566778888888554 5778888888 Q ss_pred EEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCCCCcccccccccccccCHHH Q lcl|Aclame:pro 124 YFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEA 203 (336) Q Consensus 124 ~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~T~~e 203 (336) +.++..+.+|.+=+. ....+|.+.-....+.++.+.+++-++.|+......+........ ..+ +...+ .. T Consensus 203 ~k~~~~~~iS~ell~---ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~-~~~-----~~~~~-~~ 272 (415) T protein:vir:47 203 NTHRGYFRISREAIE---DAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKE-GKK-----LEVKK-AK 272 (415) T ss_pred eeeEeeehhhHHHHh---hchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCccccccccccc-cce-----ecccc-cc Confidence 889988888875443 344688888888888888999999888888654333333321111 011 11111 11 Q ss_pred HHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhccc-CCCCCccHHH-HHHHhCC----ccEEEEccccc-CCCCc Q lcl|Aclame:pro 204 VVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAA-KLKEIFP----KLEFVTIPEYD-TASGR 276 (336) Q Consensus 204 I~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~-~l~~n~p----nl~i~~~pel~-~a~G~ 276 (336) -++||.+++..+...- -.+..++|.++.+..|.+ .+..|.-++. -+...-| ++.++..+..- +++|. T Consensus 273 ~~~~i~~~~~~~~~~~------~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~ 346 (415) T protein:vir:47 273 SLDDIKDAINLNVKPN------YEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGN 346 (415) T ss_pred chHHHHHHHHhhhhhc------cCCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCCccccceeeEEeccccccCCCc Confidence 3567777777665432 135689999999998864 3333332221 0111111 12333333222 23344 Q ss_pred eEEEEEEeeCC-----CceEEEEeCchhhcccceecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 277 LVQLWAPRVEG-----KDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 277 ~~~~~~~~~~~-----~~~~~~~~p~~~~~l~~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) ...+|.+-.+. ..-..+.. .+|.. .....-+..|+ |+.+.+|.||++++-- T Consensus 347 ~~~~~gd~~~~~~~~~~~~~~v~~-~~~~~-------~~~~~~~~~r~-d~~v~~~~a~~~~~~~ 402 (415) T protein:vir:47 347 NTLIIGNLKDAIVLFDRSQYQASW-TDYMH-------FGECLMIAVRQ-DCRILDYKSAIVIEYD 402 (415) T ss_pred cEEEEEehhccEEEEeecceEEEe-ecccc-------CceEEEEEEEe-ccEEeccccEEEEEee Confidence 33333321100 00011111 11111 11122345565 6667789999988744 No 69 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=96.22 E-value=0.00086 Score=37.40 Aligned_cols=303 Identities=9% Similarity=0.049 Sum_probs=139.3 Q ss_pred Cch---H-HHHHHHh-------------------------hcceeccchhhhhhhhhhhh---hhhhhhhcCccccCCcc Q lcl|Aclame:pro 1 MRD---A-QRIQNLA-------------------------RAGVILPRSVKNVSTPLAEY---AMDAADLSPHLSSTGSS 48 (336) Q Consensus 1 m~~---~-~~~~~l~-------------------------~~g~~~~~~~~~~~~~~~~~---~~da~d~~~~l~t~~~~ 48 (336) ++. . +++.+.. ..+..+... .....+.+.+ ............+++.+ T Consensus 51 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~t~~g 129 (415) T protein:vir:46 51 IQEKQEELDKLKEKDRTSENNQQSVEVNEARTYRNQANINDLGISIQNT-KVTSQEVRDFTEYLETRNDIQGGSLKTDSG 129 (415) T ss_pred HHHHHHHHHHHHHHHHhhhhcccccccchhhhhHHHHHHHHHHHhhhhh-hhhHHHHHHHHHHHhhhhhhhhccccccCC Confidence 000 0 0000000 000000000 0000000000 00000001111122222 Q ss_pred --hHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeee--ecccceEEeecccCCceee-eeeeeeeeeE Q lcl|Aclame:pro 49 --GIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITA--EPTTTVATYGDYSSDGDSG-TNINYPQRQS 123 (336) Q Consensus 49 --~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~--e~~G~a~~ygd~~DiP~vd-~~~~~~~~~v 123 (336) .+|..+. ++|++.+........++.+..... .+..+++. ...+.+...+.+..+|-.+ .......... T Consensus 130 ~~~iP~~~~----~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~v~~~~ 202 (415) T protein:vir:46 130 FVVIPEEIV----TDILKLKEVEFNLDKYVTVKRVTN---GSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDI 202 (415) T ss_pred cccccHHHH----HHHHHHHHhhhhhhhhcceeeccC---CceeEEEEEecCCcceeecccccccccccccceeeEEeee Confidence 3665555 455666666666667666533221 11233333 3344566778888888554 5778888888 Q ss_pred EEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCCCCcccccccccccccCHHH Q lcl|Aclame:pro 124 YFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEA 203 (336) Q Consensus 124 ~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~T~~e 203 (336) +.++..+.+|.+=+. ....+|.+.-....+.++.+.+++-++.|+......+........ ..+ +...+ .. T Consensus 203 ~k~~~~~~iS~ell~---ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~-~~~-----~~~~~-~~ 272 (415) T protein:vir:46 203 NTHRGYFRISREAIE---DAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKE-GKK-----LEVKK-AK 272 (415) T ss_pred eeeEeeehhhHHHHh---hchHHHHHHHHHHHHHHHHHHHHHHHhhccccCCccccccccccc-cce-----ecccc-cc Confidence 889988888875443 344688888888888888999999888888654333333321111 011 11111 11 Q ss_pred HHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhccc-CCCCCccHHH-HHHHhCC----ccEEEEccccc-CCCCc Q lcl|Aclame:pro 204 VVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAA-KLKEIFP----KLEFVTIPEYD-TASGR 276 (336) Q Consensus 204 I~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~-~l~~n~p----nl~i~~~pel~-~a~G~ 276 (336) -++||.+++..+...- -.+..++|.++.+..|.+ .+..|.-++. -+...-| ++.++..+..- +++|. T Consensus 273 ~~~~i~~~~~~~~~~~------~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~ 346 (415) T protein:vir:46 273 SLDDIKDAINLNVKPN------YEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGN 346 (415) T ss_pred chHHHHHHHHhhhhhc------cCCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCCccccceeeEEeccccccCCCc Confidence 3567777777665432 135689999999998864 3333332221 0111111 12333333222 23344 Q ss_pred eEEEEEEeeCC-----CceEEEEeCchhhcccceecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 277 LVQLWAPRVEG-----KDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 277 ~~~~~~~~~~~-----~~~~~~~~p~~~~~l~~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) ...+|.+-.+. ..-..+.. .+|.. .....-+..|+ |+.+.+|.||++++-- T Consensus 347 ~~~~~gd~~~~~~~~~~~~~~v~~-~~~~~-------~~~~~~~~~r~-d~~v~~~~a~~~~~~~ 402 (415) T protein:vir:46 347 NTLIIGNLKDAIVLFDRSQYQASW-TDYMH-------FGECLMIAVRQ-DCRILDYKSAIVIEYD 402 (415) T ss_pred cEEEEEehhccEEEEeecceEEEe-ecccc-------CceEEEEEEEe-ccEEeccccEEEEEee Confidence 33333321100 00011111 11111 11122345565 6667789999988744 No 70 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=96.06 E-value=0.0011 Score=36.88 Aligned_cols=307 Identities=10% Similarity=0.042 Sum_probs=141.1 Q ss_pred CchHHHHHHHhh-----cceeccchhhhhhh-----------hhhhhhhhhhhhcCccccCCcc-hH-HHHHHHhhCcee Q lcl|Aclame:pro 1 MRDAQRIQNLAR-----AGVILPRSVKNVST-----------PLAEYAMDAADLSPHLSSTGSS-GI-PNYLTTYVDPSV 62 (336) Q Consensus 1 m~~~~~~~~l~~-----~g~~~~~~~~~~~~-----------~~~~~~~da~d~~~~l~t~~~~-~i-~~~l~~~idp~v 62 (336) ....+...++++ -+-.-+........ +.+.. ..+..... .++..++ .+ |......|+ ++ T Consensus 58 ~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~g~~~~~~~~-~~~~~~~~-~t~~~~g~~~~~~~~~~~i~-~~ 134 (392) T protein:vir:13 58 IDAIKATDAVTSLLSGLQGSGSGAQRSADHDDDAVLRAGNLGEARSF-EFAPEKRD-GTKAGNPNVLSRTLYGQLIA-QA 134 (392) T ss_pred HHHHHHHHHHHHHhcccCCcccchhhhhhHHHHHHHhccchhhhHHH-Hhhhhhhc-ccccCCCccccccchHHHHH-HH Confidence 000011111110 00000000000000 00000 00111111 1112222 12 222222221 11 Q ss_pred eeeeccccc-hhhhcccccCCCcceeeEEEeeeecccceEEeecccCCceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHH Q lcl|Aclame:pro 63 IDILVAPMK-AAELVGESKKGDWTTLVAAFITAEPTTTVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAG 141 (336) Q Consensus 63 ~~~~~~~~~-~~~l~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~ 141 (336) . ..+..++ ....++... ...+.+++.+..+.+..++.+..+|..+.......-..+.++..+.+|.+=|+. T Consensus 135 ~-~~~~~l~~~~~~~~~~~-----~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~d-- 206 (392) T protein:vir:13 135 V-ERSAIMRGGASTFTTSD-----ANPMDFTVITGRATAGIVGETAEIPESYPATTQRSMGGFKYGFASVVSYEFATD-- 206 (392) T ss_pred H-hhhhhhhhcceeeecCC-----CceeEEEEEcCCcceeeecccccccccccceeeEEeeeeeEEeeehhHHHHHhc-- Confidence 0 1111111 122222211 234567777888888888999999999988888888889999888888665553 Q ss_pred HhCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCC Q lcl|Aclame:pro 142 AGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQG 221 (336) Q Consensus 142 ~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g 221 (336) ..+++.+.-....+.++.+.++.-+++|++...-.|+++++.... ....+ ..++ .-.++||.+++..|...-. T Consensus 207 -s~~~l~~~i~~~l~~~i~~~~d~~~l~G~Gt~~p~Gil~~~~~~~---~~~~~-~~~~-~~~~d~l~~~~~~l~~~~~- 279 (392) T protein:vir:13 207 -QVLDLVGFLVSDAGPAIGDAMGRHFLTGTGTGQPRGILTDATGAN---AAFGE-ADAD-SKVSDALIDLFHEVPSAYR- 279 (392) T ss_pred -chHHHHHHHHHHHHHHHHHHHHHHHhcccCCcccccccccccccc---ccccc-cccc-cccHHHHHHHHHhhhhhhh- Confidence 355788888888888899999999999998777889999754321 11111 1111 1235667777766643321 Q ss_pred ceeccCCcEEEecHHHHHhccc-CCCCCccHHH-HHHHhCCccEEEEcccccCCC-CceEEEEEEeeCCCceEEEEeCch Q lcl|Aclame:pro 222 IITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAA-KLKEIFPKLEFVTIPEYDTAS-GRLVQLWAPRVEGKDTATCGFTEK 298 (336) Q Consensus 222 ~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~-~l~~n~pnl~i~~~pel~~a~-G~~~~~~~~~~~~~~~~~~~~p~~ 298 (336) ..-.++|.+..+..|.. .+..|.-++. -+...-| -++-..|-..... .....++.+ . .. ..+..-.. T Consensus 280 -----~~a~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~g~~-~~l~G~Pv~~~~~~~~~~i~~Gd-f--~~-~~i~~~~~ 349 (392) T protein:vir:13 280 -----KNAKFVVNDLRAAQMRKLKDANGQYLWQSALTVGAP-DTFNGKVVETDDGMPADKVLFAD-L--SK-YRVRFAGS 349 (392) T ss_pred -----cCCEEEEcHHHHHHHHHhhccCCceeecCCcCCCCC-ceecceeeEEcCCCCCCcEEEee-c--cc-eeEEeecc Confidence 13368899988888753 3433432221 0000001 0122222221111 001111111 0 00 00111111 Q ss_pred hhc--cc-ceecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 299 MRA--HS-IERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 299 ~~~--l~-~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) ++. .. .........+-+..|.+|. +++|.||+.+..= T Consensus 350 ~~i~~~~~~~~~~~~~~~r~~~r~d~~-~~~~~A~~~~~~~ 389 (392) T protein:vir:13 350 LRVDRSVDAKFSTDQIVYRFLQRADGL-LVDARGAKVLTVT 389 (392) T ss_pred eEEEeeccccccCCcEEEEEEEEeccE-EecccceEEEEee Confidence 111 00 1112223455667777655 7789998866665 No 71 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=96.01 E-value=0.0011 Score=36.73 Aligned_cols=302 Identities=15% Similarity=0.101 Sum_probs=144.3 Q ss_pred CchHH-HHHHHhhcceeccchhhhhhhh-------------------hhhhhhhh-hhhcCccccCCcc--hHHHHHHHh Q lcl|Aclame:pro 1 MRDAQ-RIQNLARAGVILPRSVKNVSTP-------------------LAEYAMDA-ADLSPHLSSTGSS--GIPNYLTTY 57 (336) Q Consensus 1 m~~~~-~~~~l~~~g~~~~~~~~~~~~~-------------------~~~~~~da-~d~~~~l~t~~~~--~i~~~l~~~ 57 (336) ++..+ .+.++++..-.-.......... .......+ ...+....+.+.+ .+|.+. T Consensus 54 i~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~--- 130 (390) T protein:vir:81 54 VQAARQRVAELEGNGAGGDVQHVSVGDMFVASEQFQASAGRWNDRSARATMNIKAALNTASTDAAGSAGALTTPNRL--- 130 (390) T ss_pred HHHHHHHHHHHHhcccccccccccchhhhhhhHHHHHHHHHHhhhhhhhhhHHHHHHHhhccccccCCcceechhhh--- Confidence 11111 1111221111000000000000 00000001 0111111111111 123333 Q ss_pred hCceeeeeeccccchhhhcccccCCCcceeeEEEeeeec-ccceEEeecccCCceeeeeeeeeeeeEEEEEEEEEeCHHH Q lcl|Aclame:pro 58 VDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEP-TTTVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERE 136 (336) Q Consensus 58 idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~-~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~~~~y~~~E 136 (336) +++++.+........++++.+.+. .++.++.... .+.+...+.+..+|..+.........++.++..+.+|.+= T Consensus 131 --~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~el 205 (390) T protein:vir:81 131 --PGFITPPDARLTVRDLIGSGRTDS---ALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQI 205 (390) T ss_pred --HHHHHHHhhhhhhhhhcceeeccC---CceEEEEEecCCcceeeecCCcccccccceeeEEEEeeeEEEEeehhhHHH Confidence 345555555566666666544332 2345555544 4677788888899999999999999999999999998753 Q ss_pred HHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccc-cceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 137 LEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAG-LENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVL 215 (336) Q Consensus 137 l~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~-~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l 215 (336) ++.+ .++.+.-....++++.+.+|+-.++|+.. ....|++|.+........ .+...-++||..++..+ T Consensus 206 l~d~----~~~~~~i~~~l~~~~~~~~d~a~l~G~g~~~~~~Gi~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~ 274 (390) T protein:vir:81 206 LSDA----PQLASYMNNRLIRGLKVKEDAEILRGTGANDGLLGLIPQATTYAAPTT-------IAGATRVDQLRLAMLQA 274 (390) T ss_pred HHhH----HHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcccceeecccccccccc-------cccchhHHHHHHHHHhh Confidence 3322 25888888888888888889888999864 347899997665432211 11122456777777666 Q ss_pred HHHhCCceeccCCcEEEecHHHHHhccc-CCCCCccHHHHHHHhC-C---ccEEEEcccccCC---CCce--EEEEEEee Q lcl|Aclame:pro 216 QTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAKLKEIF-P---KLEFVTIPEYDTA---SGRL--VQLWAPRV 285 (336) Q Consensus 216 ~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~l~~n~-p---nl~i~~~pel~~a---~G~~--~~~~~~~~ 285 (336) ... + ..+..++|.|+.+..|.+ .+..|.-++.-....- + ++.++..+.+... -|.- .+.++++ T Consensus 275 ~~~--~----~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~l~G~pv~~~~~~p~~~~~~gd~~~~~~~~~~- 347 (390) T protein:vir:81 275 SLA--E----YNPSGIVINPIDWAAIELAKDANNQYLIGNARGTLTPTLWGLPVVATQAMAPGEFLVGAFDLAAQIFDQ- 347 (390) T ss_pred ccc--c----CCCCEEEEcHHHHHHHHHhhcCCCceeecCcccccCceecceeeEEcCCCCCCcEEEEehhceEEEEEe- Confidence 432 1 235579999999888864 3444433322111110 1 1222222222110 1111 1122111 Q ss_pred CCCceEEEEeCchhhcccceecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 286 EGKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 286 ~~~~~~~~~~p~~~~~l~~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) .+ ..+ .+...+.-...-...+-+..|++| .++.|.||++.+== T Consensus 348 ~~---~~v----~~~~~~~~~~~~~v~~r~~~r~d~-~v~~~~a~v~~t~a 390 (390) T protein:vir:81 348 WD---ARV----EIGYVGEDFQRNMITVLAEERLAL-VVYRPEALISGSFA 390 (390) T ss_pred cc---eEE----EEecccchhhcCcEEEEEEEeecc-EEecccceEEEEeC Confidence 01 111 111111111111233445666655 66778877654311 No 72 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=95.99 E-value=0.0011 Score=36.73 Aligned_cols=302 Identities=9% Similarity=0.035 Sum_probs=139.2 Q ss_pred CchHHHHH----HHhh-cceeccchh-h------hh-h---hhhhhhhhhhhhhcCccccCCcch--HHHHHHHhhCcee Q lcl|Aclame:pro 1 MRDAQRIQ----NLAR-AGVILPRSV-K------NV-S---TPLAEYAMDAADLSPHLSSTGSSG--IPNYLTTYVDPSV 62 (336) Q Consensus 1 m~~~~~~~----~l~~-~g~~~~~~~-~------~~-~---~~~~~~~~da~d~~~~l~t~~~~~--i~~~l~~~idp~v 62 (336) ....+... .+.+ .+....... . ++ . ...+..... .... ..++.++++ +|.+....|. ++ T Consensus 58 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~r~~~~~-~~~~-~~t~~~~g~~~~~~~~~~~i~-~~ 134 (390) T protein:vir:62 58 IEAIKAIDPVTSLLSGLQGSGSGAQRSADVDDDATLRAGNLGEARSFEFA-PEKR-DGTKAGNPNVLSRTLYGQLIA-QA 134 (390) T ss_pred HHHHHHHHHHHHHHhhcccccccchhhcchHHHHHHhhhhhhhhHHHHhh-hhhh-cccccCCCccccccchHHHHH-HH Confidence 00000000 0000 111111000 0 00 0 001111111 1111 112222322 2223322221 11 Q ss_pred eeeeccccc-hhhhcccccCCCcceeeEEEeeeecccceEEeecccCCceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHH Q lcl|Aclame:pro 63 IDILVAPMK-AAELVGESKKGDWTTLVAAFITAEPTTTVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAG 141 (336) Q Consensus 63 ~~~~~~~~~-~~~l~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~ 141 (336) .+ .+..++ ....++..+. ..+.+++....+.+...+-...+|-.+.......-..+.++..+.+|.+=|+- T Consensus 135 ~~-~~~~l~~~~~~~~~~~~-----~~~~~p~~~~~~~a~wv~E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~d-- 206 (390) T protein:vir:62 135 VE-RSAIMRGGATTFTTSDA-----NPLDFTVITGRSSASIVGETAEIPESYPATAQRSMGGFKYGFASVVSYEFATD-- 206 (390) T ss_pred Hh-hhhhhhhcceeeecCCC-----ceeEEEEEcCCcceeeecccccccccccceeeeEeeeeeEEeehHHHHHHHhh-- Confidence 11 122221 2233332221 23567788888888888888899999998888899999999988888665554 Q ss_pred HhCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCC Q lcl|Aclame:pro 142 AGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQG 221 (336) Q Consensus 142 ~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g 221 (336) ..+++.+.-....+.++.+.+++-.++|++. -.|++|+++...... .....+..| ++||.+++..|..... T Consensus 207 -s~~~l~~~i~~~l~~~i~~~~d~~~l~G~G~--p~Gi~~~~~~~~~~~-~~~~~~~~~----~~~l~~~~~~l~~~~~- 277 (390) T protein:vir:62 207 -QVLDLVGFLVSDAGPAIGDAMGRHFITGTGQ--PRGILTDASPATATF-LATDTDSKV----SDALIDLFHEVPSAYR- 277 (390) T ss_pred -hhHHHHHHHHHHHHHHHHHHHHhhhhccCCc--cccccccccccccce-ecccccccc----hHHHHHHHHhhhhhhh- Confidence 4567888888888899999999989999874 369999876543221 111112233 4556666665533211 Q ss_pred ceeccCCcEEEecHHHHHhccc-CCCCCccHH-HHHHH-------hCCccEEEEcccccCCCCceEEEEEEeeCCCceEE Q lcl|Aclame:pro 222 IITQEAVLHMGLPPTAMSDLSK-TNQYGLSAA-AKLKE-------IFPKLEFVTIPEYDTASGRLVQLWAPRVEGKDTAT 292 (336) Q Consensus 222 ~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl-~~l~~-------n~pnl~i~~~pel~~a~G~~~~~~~~~~~~~~~~~ 292 (336) . --..+|.++.+..|.+ .+..|.=++ .-+.. -+|=+.-..+|...-.-|.-.+.++....+..+. T Consensus 278 ---~--~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~g~~~~l~G~Pv~~~~~~p~~~i~~gd~s~~~i~~~~~~~v~- 351 (390) T protein:vir:62 278 ---A--NAKYVVNDLRAAQMRKLKDANGQYLWQSGLTVGAPSLFNGKVVETDDGMPADKILFADLSKYRVRFAGSLRVD- 351 (390) T ss_pred ---c--CCEEEEchHHHHHHHHhhccCCCeeecCCcCCCccceecccceEEecCCCCccEEEeeccceeEEeecceEEE- Confidence 1 1258889988888853 232222111 00110 1221111111110000011111111111110000 Q ss_pred EEeCchhhcccceecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 293 CGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 293 ~~~p~~~~~l~~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) .... +....-...+-+..|.+| .+..|.||+.+..= T Consensus 352 -~~~~------~~~~~~~~~~~~~~r~d~-~~~~~~A~~~l~~~ 387 (390) T protein:vir:62 352 -RSVD------AKFSTDQIVYRFLQRADG-LLVDARGAKVLTVT 387 (390) T ss_pred -eecc------ccccCCcEEEEEEEEeCc-EeechhheEEEEee Confidence 0011 111122344556667665 68899998887755 No 73 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=95.88 E-value=0.0013 Score=36.38 Aligned_cols=290 Identities=12% Similarity=0.041 Sum_probs=138.0 Q ss_pred CchHHHHHHHhhcceeccchhhhhhhhhhhhhhhhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhhccccc Q lcl|Aclame:pro 1 MRDAQRIQNLARAGVILPRSVKNVSTPLAEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESK 80 (336) Q Consensus 1 m~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t 80 (336) |+-.+..-+.-| ++...|++++.. .|+.++.+|+ .+ +..-+-++....+. + T Consensus 1 ~~~~~~~~~~~k-------------------~it~~d~~gG~L------~P~~~~~~i~-~l-~e~s~i~~~a~vi~--t 51 (314) T protein:vir:41 1 MDFLNKPFQITP-------------------KIDVPDLGKGIL------AVQRFGEFVR-EV-RENSAIIKDARVLN--A 51 (314) T ss_pred CchhhhHHHhhc-------------------ccccccCCCcee------ChHHHHHHHH-HH-Hhccchhhheeeec--c Confidence 331111111111 112223333322 2555666552 22 22222222222221 2 Q ss_pred CCCcceeeEEEeeeec----ccceEEeecccCCceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHH Q lcl|Aclame:pro 81 KGDWTTLVAAFITAEP----TTTVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSA 156 (336) Q Consensus 81 ~g~w~~~t~~~~v~e~----~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr 156 (336) .++.. ..+..+.. ...+...|+.++.|-.+..........+.+..-+..+.+.|+-.+ -|.++...-....+ T Consensus 52 ~~s~~---~~i~~i~~g~~~~~~~~~~~~~~~~~~~~~tf~~~~l~~~kl~~~v~is~e~L~D~a-~~~~le~~i~~~~A 127 (314) T protein:vir:41 52 LKSYE---VDISRISLGVELEPGRNTSGTKVAPTADEVTVSTNTLEMKELVTKVVLEDEALEDNI-EQSAFEQTITSLLA 127 (314) T ss_pred cCccc---eeecccccCcccccccccccCCccCCcccccccceeeeeEEEEEeecccHHHHHhhh-chhhHHHHHHHHHH Confidence 22211 11111111 111223345555566666555556666667777777766666444 56789999999999 Q ss_pred HHHHHhhccEEEeeccc--------cceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCC Q lcl|Aclame:pro 157 LGLAKFLNGSYLFGVAG--------LENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAV 228 (336) Q Consensus 157 ~a~e~~~n~i~~~Gd~~--------~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p 228 (336) ..+.+.+....+.||+. ....|+|+...... +..+. .+.+.+++.+.|+-..+..-..+.. .. T Consensus 128 e~~g~~~~~~~~nGdg~~~s~~~~~~~p~G~l~~a~~~~--~~~~~-~~~~~~~~~~~~l~~sl~~~yr~~~------~~ 198 (314) T protein:vir:41 128 SGVTYDLECFFLHADSSLTTGRELYRINDGWMKLAGNQY--TDAEP-EDENWPLNLFDGMMDELDTRYLQLK------PR 198 (314) T ss_pred HHHHHHHHHHhhccccCCcCcccchhcchhhhhhcccce--eecCc-cccccHHHHHHHHHHhcCchhhcCC------Cc Confidence 99999999999999974 24567777532221 11111 0112233333333333322222221 23 Q ss_pred cEEEecHHHHHhccc-CCCCCccHHHHHHHh-----CCccEEEEcccccCCC-CceEEEEEEeeCCCceEEEEeCchhhc Q lcl|Aclame:pro 229 LHMGLPPTAMSDLSK-TNQYGLSAAAKLKEI-----FPKLEFVTIPEYDTAS-GRLVQLWAPRVEGKDTATCGFTEKMRA 301 (336) Q Consensus 229 ~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~l~~n-----~pnl~i~~~pel~~a~-G~~~~~~~~~~~~~~~~~~~~p~~~~~ 301 (336) ...+|++..+..+.+ -..-+..+++..... +-+..++.+|.+.+.+ +....|+.+- +.+-..+.+.+|. T Consensus 199 ~~~~m~~~t~~~~r~~l~~~~~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~i~fgd~----~nlv~~~~~~ir~ 274 (314) T protein:vir:41 199 MKFYVSNEIYNGYRKQLLVRETGLGDSALIGATGLQYDGIPIQYVPALDALGDDKARALLTVP----TNLVYGFWRNIRI 274 (314) T ss_pred eEEEecHHHHHHHHHHHhccCCcccchhhhCCCCceecceeeEecccccccCCCCceEEEech----hheEEEeeceeEE Confidence 468888876655432 111111122222111 2245677788887654 5556666542 2233455555665 Q ss_pred ccc-eecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 302 HSI-ERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 302 l~~-~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) ++- ..+...+..-.+.|+.....-.+.++..+.+= T Consensus 275 ~~~~~a~~~~~~~~~~~r~d~~~~~~~aa~~~~~~~ 310 (314) T protein:vir:41 275 EPKRDAAMRRTEYIASLRADCNYEDENAAVAAVIDM 310 (314) T ss_pred eecccCcCCeEEEEEEEEeceEEEEcCcEEEEEeec Confidence 542 22233455555556655555566777766665 No 74 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=95.82 E-value=0.00094 Score=37.18 Aligned_cols=314 Identities=11% Similarity=0.047 Sum_probs=142.5 Q ss_pred CchHHHH-HH-------Hhhcceeccchh--hhhhhhhhhhh----hhhh--hhcCccc--cCCcc--hHHHHHHHhhCc Q lcl|Aclame:pro 1 MRDAQRI-QN-------LARAGVILPRSV--KNVSTPLAEYA----MDAA--DLSPHLS--STGSS--GIPNYLTTYVDP 60 (336) Q Consensus 1 m~~~~~~-~~-------l~~~g~~~~~~~--~~~~~~~~~~~----~da~--d~~~~l~--t~~~~--~i~~~l~~~idp 60 (336) +...++. .+ .++......... .+ ...+..+. .+.. --...++ +.+.+ .||..+.+ T Consensus 51 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~-~~a~~~~lr~~~~~~~~~~e~~a~~~~~~~~GG~~iP~~~~~---- 125 (401) T protein:vir:44 51 LSELENLKSDLEKELLELKRPARGAQNKVAAEH-KDAFVGFLRKGREDGLRDLERKALQVGTDEDGGYAVPEELDR---- 125 (401) T ss_pred HHHHHHHHHHHHHHHHHhhccccccccchhHHH-HHHHHHHHhhhhhhhhHHHHHHHhhcCCCCCCceeccHhHHH---- Confidence 1110000 00 111111000000 00 00000000 0000 0000111 12222 35665543 Q ss_pred eeeeeeccccchhhhcccccCCCcceeeEEEeeeecccceEEeecccCCceeee-eeeeeeeeEEEEEEEEEeCHHHHHH Q lcl|Aclame:pro 61 SVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVATYGDYSSDGDSGT-NINYPQRQSYFFQTWTRWGERELEM 139 (336) Q Consensus 61 ~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP~vd~-~~~~~~~~v~~~~~~~~y~~~El~~ 139 (336) +|++.+-..-....+..+.+.+. ....+++......+.+.+.....|-.+. ......-.++.++..+.+|.+=+.. T Consensus 126 ~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~a~wv~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~d 202 (401) T protein:vir:44 126 SILSLLKDEVVMRQEATVITVGG---SDYKKLVNLGGTASGWVGETDTRSQTATSRLGLIEPFMGEIYGNPQATQKMLDD 202 (401) T ss_pred HHHHHHHhhhhhhhhceeeecCC---CceEEEEecCCccceeeccccccCccccccceeeeeehhheeeehhhhHHHHhc Confidence 34444433333344444332221 1234555444455666666666765543 5666677777788888888765543 Q ss_pred HHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCCCCccccccccccc------ccCHHHHHHHHHHHHH Q lcl|Aclame:pro 140 AGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSG------SPAVEAVVNEVVTLFQ 213 (336) Q Consensus 140 A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~------~~T~~eI~~Di~~l~~ 213 (336) ...+|.+.-....+.++.+.++.-.++|++.....|+|+.+......... .+.. .++..--++||.+++. T Consensus 203 ---s~~~l~~~i~~~la~ai~~~~~~~~l~G~G~~~p~Gil~~~~~~~~~~~~-~~~~~~~~~t~~~~~~~~d~i~~~~~ 278 (401) T protein:vir:44 203 ---AFFNVEAWINSELATEFAEQEEIAFTTGDGTKKPKGFLAYESTEESDKAR-AFGKLQHIVSGEATAVTADAIIKLIY 278 (401) T ss_pred ---chHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccceeecccccccccccc-ccccccccccccccccCHHHHHHHHH Confidence 35688888888899999999999999999988899999987765432211 1100 0111112677777777 Q ss_pred HHHHHhCCceeccCCcEEEecHHHHHhccc-CCCCCccHHHH-HHHhCC----ccEEEEcccccC-CCCceEEEEEEeeC Q lcl|Aclame:pro 214 VLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAK-LKEIFP----KLEFVTIPEYDT-ASGRLVQLWAPRVE 286 (336) Q Consensus 214 ~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~-l~~n~p----nl~i~~~pel~~-a~G~~~~~~~~~~~ 286 (336) .|...-. ..-+++|.++.+..|.. .+..|.-++.- +...-| ++-++....+.. ++|....+|.+- T Consensus 279 ~l~~~~~------~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~g~~~~l~G~PVv~~~~~p~~~~~~~~i~~Gd~-- 350 (401) T protein:vir:44 279 TLRKAHR------TGAKFMMNNNSLFAIRLLKDTEGNYLWRPGLELGQPSSLAGYGIAENEQMPDIAADAKAIAFGNF-- 350 (401) T ss_pred hcchhhh------cCCEEEEcHHHHHHHHHhhccCCceeecCCcCCCCCceecceeeEEecCcCCccCCccEEEEeeh-- Confidence 6643211 12368999999988853 34334333210 111111 112222211111 222332333211 Q ss_pred CCceEEEEeCchhhccc-ceecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 287 GKDTATCGFTEKMRAHS-IERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 287 ~~~~~~~~~p~~~~~l~-~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) ..-..+.--+.++.+- .....-....-+..|.+|..+. |.||+.+..= T Consensus 351 -~~~~~i~~~~~~~~~~~~~~~~~~v~~~a~~r~d~~~~~-~~a~~~l~~~ 399 (401) T protein:vir:44 351 -KRGYTIVDRIGTRILRDPYTNKPFVGFYTTKRTGGMLVD-SQAIKLLKIA 399 (401) T ss_pred -hccEEEEEecceEEeeeccccCCcEEEEEEEEeccEEec-ccceEEEEee Confidence 0000010001111110 0111233445666677666555 8888775544 No 75 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=95.82 E-value=0.0012 Score=36.51 Aligned_cols=255 Identities=9% Similarity=0.006 Sum_probs=130.7 Q ss_pred hhhhhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhhcccccC--CCcceeeEEEeeeecccceEEeecccC Q lcl|Aclame:pro 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESKK--GDWTTLVAAFITAEPTTTVATYGDYSS 108 (336) Q Consensus 31 ~~~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~--g~w~~~t~~~~v~e~~G~a~~ygd~~D 108 (336) || + ..+.-++--+|..+..||..++ ...+....+..+... |.. -.+++++.+...|.+..|.++++ T Consensus 1 ma---~----~~T~~~d~iiPev~~~~v~~~~----~~~l~~~~~~~~d~~l~g~~-G~tv~iP~~~~~g~a~~~~~g~~ 68 (274) T protein:vir:97 1 MP---Q----GLTKTSDQIIPEVLAPMMQAQL----EKKLRFASFAEVDSTLQGQP-GDTLTFPAFVYSGDAQVVAEGEK 68 (274) T ss_pred CC---c----cceehhheechHHHHHHHHHhh----hhhhhhcccceecccccCCC-CCEEEEeeecCCCccccccCCCc Confidence 11 1 1344556668888998885443 344555566555432 322 36899999999999999999999 Q ss_pred CceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCCCCcc Q lcl|Aclame:pro 109 DGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAP 188 (336) Q Consensus 109 iP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~ 188 (336) ++.-+.........+...+-+ |.+.++.+++..+ ++..+-...+.+++.+++++..+ ..++.-.+ T Consensus 69 i~~~~lt~~~~~~~i~~~~~~--~~i~D~~~~~~~~-dp~~~~~~~~a~a~a~~vd~~~~---------~~l~~a~~--- 133 (274) T protein:vir:97 69 IPTDILETKKREAKIRKIAKG--TSITDEALLSGYG-DPQGEQVRQHGLAHANKVDNDVL---------EALMGAKL--- 133 (274) T ss_pred ccccccccceeEEEeeeecce--ecccHHHHHhccc-hHHHHHHHHHHHHHHHHHHHHHH---------HHHhccCc--- Confidence 999999998888888776655 4445555555444 45556666667777777776433 11111001 Q ss_pred cccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhcccCC--------CCCccHHH-HHHHhC Q lcl|Aclame:pro 189 ITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSKTN--------QYGLSAAA-KLKEIF 259 (336) Q Consensus 189 ~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~~~--------~~~~Tvl~-~l~~n~ 259 (336) +.++ ++.+ +++|.++...+-.. ...+..|+++|..+..|.+-+ .++..++. =.--+| T Consensus 134 -~~~~---~~~~----~d~i~dA~~~l~d~------~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~ 199 (274) T protein:vir:97 134 -TVNA---DITK----LNGLQSAIDKFNDE------DLEPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEA 199 (274) T ss_pred -cccc---cccC----HHHHHHHHHHhhcc------CCCceEEEeCHHHHHHHHhhhhhhccccCcccccceecccccee Confidence 0111 1223 34455555544322 124678999999999886421 11111110 000012 Q ss_pred CccEEEEcccccCCCCceEEEEEEeeCCCceEEEEeCchhh--cccceecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 260 PKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMR--AHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 260 pnl~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p~~~~--~l~~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) -+++|...+.+- -...+++- +.-+......+.+ ..-.+.+.. -.+- .-...|+-+.+|..++.+.-= T Consensus 200 ~G~~Vi~s~~~p---~~t~~l~~-----~gA~~~~~~~~~~vE~~Rd~~~~~-d~i~-~~~~y~~~~~~~~~vv~~t~~ 268 (274) T protein:vir:97 200 LGAIIVRTNKLE---AGTAILAK-----KGAVKLILKRDFFLEVARDASTKT-TALY-SDKHYVAYLYDESKAVKITKG 268 (274) T ss_pred cCeeEEEcCCCC---cceEEEEe-----CcceEeeecCCceeccccchhhcc-cEEE-EEEEEEEEEEcCCceEEEecC Confidence 234444333221 11222221 1111111111111 000011111 1111 113567788888777765543 No 76 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=95.82 E-value=0.0012 Score=36.51 Aligned_cols=255 Identities=9% Similarity=0.006 Sum_probs=130.7 Q ss_pred hhhhhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhhcccccC--CCcceeeEEEeeeecccceEEeecccC Q lcl|Aclame:pro 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESKK--GDWTTLVAAFITAEPTTTVATYGDYSS 108 (336) Q Consensus 31 ~~~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~--g~w~~~t~~~~v~e~~G~a~~ygd~~D 108 (336) || + ..+.-++--+|..+..||..++ ...+....+..+... |.. -.+++++.+...|.+..|.++++ T Consensus 1 ma---~----~~T~~~d~iiPev~~~~v~~~~----~~~l~~~~~~~~d~~l~g~~-G~tv~iP~~~~~g~a~~~~~g~~ 68 (274) T protein:vir:94 1 MP---Q----GLTKTSDQIIPEVLAPMMQAQL----EKKLRFASFAEVDSTLQGQP-GDTLTFPAFVYSGDAQVVAEGEK 68 (274) T ss_pred CC---c----cceehhheechHHHHHHHHHhh----hhhhhhcccceecccccCCC-CCEEEEeeecCCCccccccCCCc Confidence 11 1 1344556668888998885443 344555566555432 322 36899999999999999999999 Q ss_pred CceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCCCCcc Q lcl|Aclame:pro 109 DGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAP 188 (336) Q Consensus 109 iP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~ 188 (336) ++.-+.........+...+-+ |.+.++.+++..+ ++..+-...+.+++.+++++..+ ..++.-.+ T Consensus 69 i~~~~lt~~~~~~~i~~~~~~--~~i~D~~~~~~~~-dp~~~~~~~~a~a~a~~vd~~~~---------~~l~~a~~--- 133 (274) T protein:vir:94 69 IPTDILETKKREAKIRKIAKG--TSITDEALLSGYG-DPQGEQVRQHGLAHANKVDNDVL---------EALMGAKL--- 133 (274) T ss_pred ccccccccceeEEEeeeecce--ecccHHHHHhccc-hHHHHHHHHHHHHHHHHHHHHHH---------HHHhccCc--- Confidence 999999998888888776655 4445555555444 45556666667777777776433 11111001 Q ss_pred cccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhcccCC--------CCCccHHH-HHHHhC Q lcl|Aclame:pro 189 ITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSKTN--------QYGLSAAA-KLKEIF 259 (336) Q Consensus 189 ~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~~~--------~~~~Tvl~-~l~~n~ 259 (336) +.++ ++.+ +++|.++...+-.. ...+..|+++|..+..|.+-+ .++..++. =.--+| T Consensus 134 -~~~~---~~~~----~d~i~dA~~~l~d~------~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~ 199 (274) T protein:vir:94 134 -TVNA---DITK----LNGLQSAIDKFNDE------DLEPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEA 199 (274) T ss_pred -cccc---cccC----HHHHHHHHHHhhcc------CCCceEEEeCHHHHHHHHhhhhhhccccCcccccceecccccee Confidence 0111 1223 34455555544322 124678999999999886421 11111110 000012 Q ss_pred CccEEEEcccccCCCCceEEEEEEeeCCCceEEEEeCchhh--cccceecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 260 PKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMR--AHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 260 pnl~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p~~~~--~l~~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) -+++|...+.+- -...+++- +.-+......+.+ ..-.+.+.. -.+- .-...|+-+.+|..++.+.-= T Consensus 200 ~G~~Vi~s~~~p---~~t~~l~~-----~gA~~~~~~~~~~vE~~Rd~~~~~-d~i~-~~~~y~~~~~~~~~vv~~t~~ 268 (274) T protein:vir:94 200 LGAIIVRTNKLE---AGTAILAK-----KGAVKLILKRDFFLEVARDASTKT-TALY-SDKHYVAYLYDESKAVKITKG 268 (274) T ss_pred cCeeEEEcCCCC---cceEEEEe-----CcceEeeecCCceeccccchhhcc-cEEE-EEEEEEEEEEcCCceEEEecC Confidence 234444333221 11222221 1111111111111 000011111 1111 113567788888777765543 No 77 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=95.75 E-value=0.0015 Score=36.05 Aligned_cols=304 Identities=9% Similarity=0.054 Sum_probs=134.0 Q ss_pred CchH----HHHH-------------------------HHhhcceeccchhhhhhhhhhhh---hhhhhhhcCccccCCcc Q lcl|Aclame:pro 1 MRDA----QRIQ-------------------------NLARAGVILPRSVKNVSTPLAEY---AMDAADLSPHLSSTGSS 48 (336) Q Consensus 1 m~~~----~~~~-------------------------~l~~~g~~~~~~~~~~~~~~~~~---~~da~d~~~~l~t~~~~ 48 (336) ++.. +++. +....+-.+.. ......+.+.+ ............+++.+ T Consensus 51 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 129 (415) T protein:vir:98 51 IQEKQEELDKLKEKDGTSENNQQSVEVNEARTYRNQANINDLGISIQN-TKVTSQEVRDFTEYLETRNDIQGGSLKTDSG 129 (415) T ss_pred HHHHHHHHHHHHHHHhhhhhcccccccchhhhHHHHHHHHHHhhhhhh-hhhHHHHHHHHHHHHhhhhhhhhcccccccc Confidence 0000 0000 00000000000 00000111110 00000110111222222 Q ss_pred --hHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecccceEEeecccCCceee-eeeeeeeeeEEE Q lcl|Aclame:pro 49 --GIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVATYGDYSSDGDSG-TNINYPQRQSYF 125 (336) Q Consensus 49 --~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP~vd-~~~~~~~~~v~~ 125 (336) .+|..+. ++|++..........++.+.....- ...+.+......+.+...+...++|-.+ .........++. T Consensus 130 g~~iP~~~~----~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k 204 (415) T protein:vir:98 130 FVVIPEEIV----TDILKLKEVEFNLDKYVTVKRVTNG-SGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINT 204 (415) T ss_pred ccccchHHH----HHHHHHHHhhhhhhhheeeeeccCC-ceeEEEEeecCCccceeeccccccCcccccceeeEEeeeee Confidence 3665544 4555655555556666655332210 1122233333344556777788887554 577888888888 Q ss_pred EEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccce-EEEEecCCCCcccccccccccccCHHHH Q lcl|Aclame:pro 126 FQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLEN-YGLINDPSLSAPITATTPWSGSPAVEAV 204 (336) Q Consensus 126 ~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~-~GllN~Pnl~~~~~~~t~w~~~~T~~eI 204 (336) ++..+.+|.+=++ ....++.+.-......++.+.+|+-++.|+..... .++++....+. +.+.. +.. - T Consensus 205 ~~~~~~iS~ell~---ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~--~~~~~--~~~----~ 273 (415) T protein:vir:98 205 HRGYFRISREAIE---DAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGK--KLEVK--KAK----S 273 (415) T ss_pred eEeeehhhHHHHh---hchHHHHHHHHHHHHHHHHHHHHHHHhhccccCcccccccccccccc--ccccc--ccc----c Confidence 9888888865333 34567888888888888888888888887754322 23332211111 11110 112 2 Q ss_pred HHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhccc-CCCCCccHHH-HHHHhCC----ccEEEEccccc-CCCCce Q lcl|Aclame:pro 205 VNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAA-KLKEIFP----KLEFVTIPEYD-TASGRL 277 (336) Q Consensus 205 ~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~-~l~~n~p----nl~i~~~pel~-~a~G~~ 277 (336) ++||.+++..+...- -.+..++|.++.+..|.+ .+..|.-++. =+....+ +..++..+..- +++|.. T Consensus 274 ~~~i~~~~~~~~~~~------~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~ 347 (415) T protein:vir:98 274 LDDIKDAINLNVKPN------YEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNN 347 (415) T ss_pred hhHHHHHHHhhhhhc------cCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeEEecccccCCCCcc Confidence 566777776664321 135679999999888864 3333332221 0011111 12233333322 223333 Q ss_pred EEEEEEeeC-----CCceEEEEeCchhhcccceecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 278 VQLWAPRVE-----GKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 278 ~~~~~~~~~-----~~~~~~~~~p~~~~~l~~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) ..+|.+-.+ ...-..+.+ .+|.. .....-+..|. |+.+++|-||+.++-- T Consensus 348 ~~~~Gd~~~~~~~~~~~~~~v~~-~~~~~-------~~~~~~~~~r~-d~~v~~~~a~~~~~~~ 402 (415) T protein:vir:98 348 TLIIGNLKDAIVLFDRSQYQASW-TDYMH-------FGECLMIAVRQ-DCRILDYKSAIVIEYD 402 (415) T ss_pred EEEEEehhccEEEEeecceEEEE-ecccc-------CceEEEEEEEe-ccEEeccccEEEEEEe Confidence 333322100 000011111 11111 11122244565 5666789999988655 No 78 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=95.75 E-value=0.0015 Score=36.05 Aligned_cols=304 Identities=9% Similarity=0.054 Sum_probs=134.0 Q ss_pred CchH----HHHH-------------------------HHhhcceeccchhhhhhhhhhhh---hhhhhhhcCccccCCcc Q lcl|Aclame:pro 1 MRDA----QRIQ-------------------------NLARAGVILPRSVKNVSTPLAEY---AMDAADLSPHLSSTGSS 48 (336) Q Consensus 1 m~~~----~~~~-------------------------~l~~~g~~~~~~~~~~~~~~~~~---~~da~d~~~~l~t~~~~ 48 (336) ++.. +++. +....+-.+.. ......+.+.+ ............+++.+ T Consensus 51 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 129 (415) T protein:vir:79 51 IQEKQEELDKLKEKDGTSENNQQSVEVNEARTYRNQANINDLGISIQN-TKVTSQEVRDFTEYLETRNDIQGGSLKTDSG 129 (415) T ss_pred HHHHHHHHHHHHHHHhhhhhcccccccchhhhHHHHHHHHHHhhhhhh-hhhHHHHHHHHHHHHhhhhhhhhcccccccc Confidence 0000 0000 00000000000 00000111110 00000110111222222 Q ss_pred --hHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecccceEEeecccCCceee-eeeeeeeeeEEE Q lcl|Aclame:pro 49 --GIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVATYGDYSSDGDSG-TNINYPQRQSYF 125 (336) Q Consensus 49 --~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP~vd-~~~~~~~~~v~~ 125 (336) .+|..+. ++|++..........++.+.....- ...+.+......+.+...+...++|-.+ .........++. T Consensus 130 g~~iP~~~~----~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k 204 (415) T protein:vir:79 130 FVVIPEEIV----TDILKLKEVEFNLDKYVTVKRVTNG-SGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINT 204 (415) T ss_pred ccccchHHH----HHHHHHHHhhhhhhhheeeeeccCC-ceeEEEEeecCCccceeeccccccCcccccceeeEEeeeee Confidence 3665544 4555655555556666655332210 1122233333344556777788887554 577888888888 Q ss_pred EEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccce-EEEEecCCCCcccccccccccccCHHHH Q lcl|Aclame:pro 126 FQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLEN-YGLINDPSLSAPITATTPWSGSPAVEAV 204 (336) Q Consensus 126 ~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~-~GllN~Pnl~~~~~~~t~w~~~~T~~eI 204 (336) ++..+.+|.+=++ ....++.+.-......++.+.+|+-++.|+..... .++++....+. +.+.. +.. - T Consensus 205 ~~~~~~iS~ell~---ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~--~~~~~--~~~----~ 273 (415) T protein:vir:79 205 HRGYFRISREAIE---DAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGK--KLEVK--KAK----S 273 (415) T ss_pred eEeeehhhHHHHh---hchHHHHHHHHHHHHHHHHHHHHHHHhhccccCcccccccccccccc--ccccc--ccc----c Confidence 9888888865333 34567888888888888888888888887754322 23332211111 11110 112 2 Q ss_pred HHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhccc-CCCCCccHHH-HHHHhCC----ccEEEEccccc-CCCCce Q lcl|Aclame:pro 205 VNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAA-KLKEIFP----KLEFVTIPEYD-TASGRL 277 (336) Q Consensus 205 ~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~-~l~~n~p----nl~i~~~pel~-~a~G~~ 277 (336) ++||.+++..+...- -.+..++|.++.+..|.+ .+..|.-++. =+....+ +..++..+..- +++|.. T Consensus 274 ~~~i~~~~~~~~~~~------~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~ 347 (415) T protein:vir:79 274 LDDIKDAINLNVKPN------YEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNN 347 (415) T ss_pred hhHHHHHHHhhhhhc------cCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeEEecccccCCCCcc Confidence 566777776664321 135679999999888864 3333332221 0011111 12233333322 223333 Q ss_pred EEEEEEeeC-----CCceEEEEeCchhhcccceecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 278 VQLWAPRVE-----GKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 278 ~~~~~~~~~-----~~~~~~~~~p~~~~~l~~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) ..+|.+-.+ ...-..+.+ .+|.. .....-+..|. |+.+++|-||+.++-- T Consensus 348 ~~~~Gd~~~~~~~~~~~~~~v~~-~~~~~-------~~~~~~~~~r~-d~~v~~~~a~~~~~~~ 402 (415) T protein:vir:79 348 TLIIGNLKDAIVLFDRSQYQASW-TDYMH-------FGECLMIAVRQ-DCRILDYKSAIVIEYD 402 (415) T ss_pred EEEEEehhccEEEEeecceEEEE-ecccc-------CceEEEEEEEe-ccEEeccccEEEEEEe Confidence 333322100 000011111 11111 11122244565 5666789999988655 No 79 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=95.75 E-value=0.0015 Score=36.05 Aligned_cols=304 Identities=9% Similarity=0.054 Sum_probs=134.0 Q ss_pred CchH----HHHH-------------------------HHhhcceeccchhhhhhhhhhhh---hhhhhhhcCccccCCcc Q lcl|Aclame:pro 1 MRDA----QRIQ-------------------------NLARAGVILPRSVKNVSTPLAEY---AMDAADLSPHLSSTGSS 48 (336) Q Consensus 1 m~~~----~~~~-------------------------~l~~~g~~~~~~~~~~~~~~~~~---~~da~d~~~~l~t~~~~ 48 (336) ++.. +++. +....+-.+.. ......+.+.+ ............+++.+ T Consensus 51 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 129 (415) T protein:vir:81 51 IQEKQEELDKLKEKDGTSENNQQSVEVNEARTYRNQANINDLGISIQN-TKVTSQEVRDFTEYLETRNDIQGGSLKTDSG 129 (415) T ss_pred HHHHHHHHHHHHHHHhhhhhcccccccchhhhHHHHHHHHHHhhhhhh-hhhHHHHHHHHHHHHhhhhhhhhcccccccc Confidence 0000 0000 00000000000 00000111110 00000110111222222 Q ss_pred --hHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecccceEEeecccCCceee-eeeeeeeeeEEE Q lcl|Aclame:pro 49 --GIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVATYGDYSSDGDSG-TNINYPQRQSYF 125 (336) Q Consensus 49 --~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP~vd-~~~~~~~~~v~~ 125 (336) .+|..+. ++|++..........++.+.....- ...+.+......+.+...+...++|-.+ .........++. T Consensus 130 g~~iP~~~~----~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~~k 204 (415) T protein:vir:81 130 FVVIPEEIV----TDILKLKEVEFNLDKYVTVKRVTNG-SGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDINT 204 (415) T ss_pred ccccchHHH----HHHHHHHHhhhhhhhheeeeeccCC-ceeEEEEeecCCccceeeccccccCcccccceeeEEeeeee Confidence 3665544 4555655555556666655332210 1122233333344556777788887554 577888888888 Q ss_pred EEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccce-EEEEecCCCCcccccccccccccCHHHH Q lcl|Aclame:pro 126 FQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLEN-YGLINDPSLSAPITATTPWSGSPAVEAV 204 (336) Q Consensus 126 ~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~-~GllN~Pnl~~~~~~~t~w~~~~T~~eI 204 (336) ++..+.+|.+=++ ....++.+.-......++.+.+|+-++.|+..... .++++....+. +.+.. +.. - T Consensus 205 ~~~~~~iS~ell~---ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~--~~~~~--~~~----~ 273 (415) T protein:vir:81 205 HRGYFRISREAIE---DAKVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGK--KLEVK--KAK----S 273 (415) T ss_pred eEeeehhhHHHHh---hchHHHHHHHHHHHHHHHHHHHHHHHhhccccCcccccccccccccc--ccccc--ccc----c Confidence 9888888865333 34567888888888888888888888887754322 23332211111 11110 112 2 Q ss_pred HHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhccc-CCCCCccHHH-HHHHhCC----ccEEEEccccc-CCCCce Q lcl|Aclame:pro 205 VNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAA-KLKEIFP----KLEFVTIPEYD-TASGRL 277 (336) Q Consensus 205 ~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~-~l~~n~p----nl~i~~~pel~-~a~G~~ 277 (336) ++||.+++..+...- -.+..++|.++.+..|.+ .+..|.-++. =+....+ +..++..+..- +++|.. T Consensus 274 ~~~i~~~~~~~~~~~------~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~ 347 (415) T protein:vir:81 274 LDDIKDAINLNVKPN------YEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPDEVLGQKGNN 347 (415) T ss_pred hhHHHHHHHhhhhhc------cCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeEEecccccCCCCcc Confidence 566777776664321 135679999999888864 3333332221 0011111 12233333322 223333 Q ss_pred EEEEEEeeC-----CCceEEEEeCchhhcccceecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 278 VQLWAPRVE-----GKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 278 ~~~~~~~~~-----~~~~~~~~~p~~~~~l~~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) ..+|.+-.+ ...-..+.+ .+|.. .....-+..|. |+.+++|-||+.++-- T Consensus 348 ~~~~Gd~~~~~~~~~~~~~~v~~-~~~~~-------~~~~~~~~~r~-d~~v~~~~a~~~~~~~ 402 (415) T protein:vir:81 348 TLIIGNLKDAIVLFDRSQYQASW-TDYMH-------FGECLMIAVRQ-DCRILDYKSAIVIEYD 402 (415) T ss_pred EEEEEehhccEEEEeecceEEEE-ecccc-------CceEEEEEEEe-ccEEeccccEEEEEEe Confidence 333322100 000011111 11111 11122244565 5666789999988655 No 80 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=95.65 E-value=0.0014 Score=36.22 Aligned_cols=255 Identities=9% Similarity=0.002 Sum_probs=131.7 Q ss_pred hhhhhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhhcccccC--CCcceeeEEEeeeecccceEEeecccC Q lcl|Aclame:pro 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESKK--GDWTTLVAAFITAEPTTTVATYGDYSS 108 (336) Q Consensus 31 ~~~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~--g~w~~~t~~~~v~e~~G~a~~ygd~~D 108 (336) ||- ..+.-++--+|..+..||..++ ...+....+..+... |.- -.++.++.++..|.++.|.++++ T Consensus 1 ma~-------~~T~~~~~iiPev~~~~v~~~~----~~~~~~~~~~~~~~~l~g~~-G~tv~ip~~~~~g~~~~~~eg~~ 68 (274) T protein:vir:93 1 MPQ-------GITKTSNQIIPEVLAPMMQAQL----EKKLRFASFAEVDSTLQGQP-GDTLTFPAFVYSGDAQVVAEGEK 68 (274) T ss_pred CCc-------cceehhheechHHHHHHHHHHH----HhhhhhcccccccccccCCC-CCEEEEEeeccCCCcccccCCCc Confidence 211 1344556678888888885443 334445555555432 222 34799999999999999999999 Q ss_pred CceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCCCCcc Q lcl|Aclame:pro 109 DGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAP 188 (336) Q Consensus 109 iP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~ 188 (336) ++..+.........+...+-+++++. +.+++. +-++..+-...+.+++.+++++..+- .++.-.+ T Consensus 69 i~~~~it~~~~~~~i~~~~~~~~i~D--~~~~~~-~~d~~~~~~~~~~~~~a~~~d~~~~~---------~~~~a~~--- 133 (274) T protein:vir:93 69 IPTDILETKKREAKIRKIAKGTSITD--EALLSG-YGDPQGEQVRQHGLAHANKVDNDVLE---------ALMGAKL--- 133 (274) T ss_pred ccccccccceeEEEeeeecccccccH--HHHHhh-ccchHHHHHHHHHHHHHHHHHHHHHH---------HHhcccc--- Confidence 99999999998888888765555554 444444 45566677777777787777764431 1111000 Q ss_pred cccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhcccCC--------CCCccHH-HHHHHhC Q lcl|Aclame:pro 189 ITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSKTN--------QYGLSAA-AKLKEIF 259 (336) Q Consensus 189 ~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~~~--------~~~~Tvl-~~l~~n~ 259 (336) +.++ ++.+ +++|.+++..+-.. + ..+..|+++|..+..|.+-. ..|-.++ +=.--.| T Consensus 134 -~~~~---~~~~----~d~i~dA~~~l~d~--~----~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~ 199 (274) T protein:vir:93 134 -TVNA---DITK----LNGLQSAIDKFNDE--D----LEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKGAFGEA 199 (274) T ss_pred -cccc---cccC----HHHHHHHHHHhhhc--c----CCccEEEeCHHHHHHHHhhhhhcccccccccccceeeccccee Confidence 1111 1223 34455555444321 1 25678999999999886421 1111110 0000012 Q ss_pred CccEEEEcccccCCCCceEEEEEEeeCCCceEEEEeCchhh--cccceecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 260 PKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMR--AHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 260 pnl~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p~~~~--~l~~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) =+++|...+.+. -.+++++- +.-+....-.+.+ ..-.+.+. .-.+- .-...|+-+.+|-+++.+.-= T Consensus 200 ~G~~Vi~s~~~p---~~t~~l~~-----~gai~~~~~~~~~vE~~Rd~~~~-~d~i~-~~~~y~~~~~~~~~~v~~t~~ 268 (274) T protein:vir:93 200 LGAIIVRTNKLE---AGTAILAK-----KGAVKLILKRDFFLEVARDASTK-TTALY-SDKHYVAYLYDESKAVKITKG 268 (274) T ss_pred cCeeEEEcCCCC---cceEEEEe-----CCeEEEEecCCcccccccchhhc-ccEEE-EEEEEEEEEEcCCceEEEeeC Confidence 234544433321 12222221 1111111111111 11111111 11111 124467888888877776644 No 81 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=95.61 E-value=0.0011 Score=36.87 Aligned_cols=262 Identities=7% Similarity=-0.027 Sum_probs=136.1 Q ss_pred hhhhhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhhccccc--CCCcceeeEEEeeeecccceEEeecccC Q lcl|Aclame:pro 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESK--KGDWTTLVAAFITAEPTTTVATYGDYSS 108 (336) Q Consensus 31 ~~~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t--~g~w~~~t~~~~v~e~~G~a~~ygd~~D 108 (336) ||- ..++-++.-+|..+..||.-++ ...+....+..+.. +|.- -.++.++.+...|.+..|.++++ T Consensus 1 Ma~-------~~T~~~~~iiPev~s~~v~~~~----~~~~v~~~~~~~~~~l~g~~-G~tv~ip~~~~~g~a~~~~~g~~ 68 (278) T protein:vir:80 1 MAD-------LTTKLANLIDPEVMGPMISAKL----PKAIKFGKIAPIDNSLEGQP-GSEITVPKYKYIGDAQDVAEGAA 68 (278) T ss_pred CCC-------cceehhheecHHHHHHHHHHHH----HHhhhhcccceecccccCCC-CCEEEEeeeccCCcceeecCCCc Confidence 221 1244456678888888885443 22333334443332 2222 26789999999999999999999 Q ss_pred CceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCCCCcc Q lcl|Aclame:pro 109 DGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAP 188 (336) Q Consensus 109 iP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~ 188 (336) ++..+.........+...+-+++++ ++.+.+ .+.++..+-...+..++.+.+++..+-.-. |..+. + T Consensus 69 i~~~~lt~~~~~~~i~~~~~a~~v~--D~~~~~-~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~-----~a~~~--~--- 135 (278) T protein:vir:80 69 IDYSALETESVKHGIKKAGKGVKLT--DESVLS-GYGDPVEEAQKQIRMAIASKVDNDILEEAL-----TTTLE--V--- 135 (278) T ss_pred CcccccccceeeEeeehhhcccccc--HHHHhh-ccccHHHHHHHHHHHHHHHHHHHHHHHHHh-----ccccc--c--- Confidence 9999999888888888776655554 444333 477788888888888888888875542111 21111 0 Q ss_pred cccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhcccCC--------CCCccHH-HHHHHhC Q lcl|Aclame:pro 189 ITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSKTN--------QYGLSAA-AKLKEIF 259 (336) Q Consensus 189 ~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~~~--------~~~~Tvl-~~l~~n~ 259 (336) +.+ ....+.+..++.+.++...+... .+ ..+..|+++|..+..|.+-. .++..++ +-.--.| T Consensus 136 -~~~---~t~~~~~~~~~~~~da~~~l~~~---~~--~~~~~ivv~p~~~~~L~k~~~~~~~~~~~~g~~~~~~G~ig~~ 206 (278) T protein:vir:80 136 -KGA---INIGLIDKIENTFTDAPDAIEDE---SI--TTTGVLFLNYKDTAKLREEAAGSWTKASQLGDDLLVKGAFGEL 206 (278) T ss_pred -ccc---cccchhhhHHHHHHHHHHhhccc---CC--CcccEEEECHHHHHHHHhhhhhhccccccccccceeeccceee Confidence 101 11123344455555554433221 11 13446999999988885321 1111110 0000012 Q ss_pred CccEEEEcccccCCCCceEEEEEEeeCCCceEEEEeCch--hhcccceecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 260 PKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEK--MRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 260 pnl~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p~~--~~~l~~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) -+++|..-..+. -+..+++-.. -+......+ ....-.+.+.. -.+.. -...|+-+.+|-+++.+.-- T Consensus 207 ~G~~Vi~s~~~p---~~t~~l~~~g-----Ai~~~~~~~~~vE~~Rd~~~~~-d~i~~-~~~yg~~v~~~~~~v~it~~ 275 (278) T protein:vir:80 207 LGWEIVRTKKLA---DGNALAVKAG-----ALKTFLKRNLLAESGRDMDHKL-TKFNA-DQHYAVALVDETKAVKVVPV 275 (278) T ss_pred cceeEEEcCCCC---cceEEEEecc-----ceeeeecCCcccccccchhhcc-ceeee-eeEEEEEEEcCcceEEEeec Confidence 234444333332 1223333211 111111111 11111111111 11111 23468999999999888766 No 82 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=95.59 E-value=0.0018 Score=35.64 Aligned_cols=294 Identities=11% Similarity=-0.014 Sum_probs=134.1 Q ss_pred CchHHHHHHH---------------hhcceeccchhhhhhhh----hh----hhhhhhhhhcCccccCCcch--HHHHHH Q lcl|Aclame:pro 1 MRDAQRIQNL---------------ARAGVILPRSVKNVSTP----LA----EYAMDAADLSPHLSSTGSSG--IPNYLT 55 (336) Q Consensus 1 m~~~~~~~~l---------------~~~g~~~~~~~~~~~~~----~~----~~~~da~d~~~~l~t~~~~~--i~~~l~ 55 (336) ..+++.+... .+....-. ......+ ++ ....+ ...+...++.+++| ||..+. T Consensus 50 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~t~~~gg~~iP~~~~ 126 (397) T protein:vir:48 50 KMKRDMFKEQYTEARANEVVNMSEEEKKPLTKS--EEEVKAGFVKDFKNLVRGRYQN-LLDSKTDASGSDAGLTIPQDIQ 126 (397) T ss_pred HHHHHHHHHHHHHHHHhhhhhhhhhccccccch--hhHHHHHHHHHHHHHHhhhhhH-HHHHhhccCCccccccccHHHH Confidence 1111111100 00011000 0000000 00 00001 11122234444443 566554 Q ss_pred HhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecccceEEeecccCCceee-eeeeeeeeeEEEEEEEEEeCH Q lcl|Aclame:pro 56 TYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVATYGDYSSDGDSG-TNINYPQRQSYFFQTWTRWGE 134 (336) Q Consensus 56 ~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP~vd-~~~~~~~~~v~~~~~~~~y~~ 134 (336) ++|++.+........++++.......-....++..+..+.+...+....+|-.+ .......-..+.++..+.+|. T Consensus 127 ----~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ 202 (397) T protein:vir:48 127 ----TAIHTLVRQYDSLQEYVNVENVTTLTGSRVYEKWADITGLAKLDDEAGSIGTNDDPKLYPIRYAIKRYAGISTVTN 202 (397) T ss_pred ----HHHHHHHHHHHHHHhhhceeeccCCcceEEEEeecCCCcceeeeccccccccccccceeeEEeeheeeeeehhhHH Confidence 456666666666666665543333222223333344556677777778887654 577888888888888888887 Q ss_pred HHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHH Q lcl|Aclame:pro 135 RELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQV 214 (336) Q Consensus 135 ~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~ 214 (336) +=++.+ ..++.+.-......++.+.+++-.+.|++..... + ...+. +||.+++.. T Consensus 203 ell~ds---~~~l~~~v~~~l~~~~~~~~d~~il~G~g~~~~~------------~------~~~~~----d~i~~~~~~ 257 (397) T protein:vir:48 203 SLLADS---AENILAWLSGWIAKKVVVTRNKAILEAIATLPTK------------P------TLTKW----DDIIDLQAK 257 (397) T ss_pred HHHhhc---hHHHHHHHHHHHHHHHHHHHHHHHhhcccccccc------------c------ccccH----HHHHHHHHH Confidence 655443 3577777777788888888888888887543210 1 11233 345555555 Q ss_pred HHHHhCCceeccCCcEEEecHHHHHhccc-CCCCCccHHHH-HHHhCC----c--cEEEEcccccC-CCCceEEEEEEee Q lcl|Aclame:pro 215 LQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAK-LKEIFP----K--LEFVTIPEYDT-ASGRLVQLWAPRV 285 (336) Q Consensus 215 l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~-l~~n~p----n--l~i~~~pel~~-a~G~~~~~~~~~~ 285 (336) |...- .....++|.+..+..|.+ .+..|.-++.- +...-+ + +.+....-+.. +.+....+|.+ . T Consensus 258 l~~~~------~~~a~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~~~~gd-~ 330 (397) T protein:vir:48 258 VDPAI------KQTSFFLTNTSGFTALKKVKNAFGDYLMERDVKSPTGYSIDGFAVKEVADRWLANASSGAMPLYFGD-L 330 (397) T ss_pred hhhhh------cCCCEEEECHHHHHHHHHhhcCCCceeeccCcCCCCCceeccceeEEecccccCCcCCCceEEEEEe-c Confidence 53321 124588999999998854 23333322210 111111 1 11111111222 22333333322 1 Q ss_pred CCCceEEEE----eCchhhcccc-eecCCceEEeeecceeeeEEecccceeeec--cC Q lcl|Aclame:pro 286 EGKDTATCG----FTEKMRAHSI-ERYSSYFRQKKSAGTWGAVIFRPFAVAQMI--GV 336 (336) Q Consensus 286 ~~~~~~~~~----~p~~~~~l~~-~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~--GI 336 (336) .+...+. +....-.+.- ....-....-+..|.+| .++.|.+|+.++ +. T Consensus 331 --~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~-~~~~~~a~~~~~~~~~ 385 (397) T protein:vir:48 331 --KQAVTLFDRQQMSLLSTNIGGGAFETDTTKIRVIDRFDV-VATDTESFVPASFKAI 385 (397) T ss_pred --cceEEEEeecceEEEEeccchhhhhcCceeEEEEeeecc-EEecccceEEEEeccc Confidence 1101010 1111111110 01112234455666666 457788886554 33 No 83 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=94.98 E-value=0.003 Score=34.39 Aligned_cols=314 Identities=12% Similarity=0.027 Sum_probs=146.2 Q ss_pred CchHHHHHHHhhcceeccc---hhhhhhhhhhhh--hhhhhhhcCccccCCcc--hHHHHHHHhhCceeeeeeccccchh Q lcl|Aclame:pro 1 MRDAQRIQNLARAGVILPR---SVKNVSTPLAEY--AMDAADLSPHLSSTGSS--GIPNYLTTYVDPSVIDILVAPMKAA 73 (336) Q Consensus 1 m~~~~~~~~l~~~g~~~~~---~~~~~~~~~~~~--~~da~d~~~~l~t~~~~--~i~~~l~~~idp~v~~~~~~~~~~~ 73 (336) +++......-++.+-.-.. ...+ ...+..+ ..+...+. ...+.+++ .+|..+.+ +|++.+...-... T Consensus 88 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~af~~~l~~~e~~~al-~~~t~~~gG~lvP~~~~~----~ii~~~~~~s~l~ 161 (425) T protein:vir:10 88 VDEANIKIAAAQMGANGVKPLRDPEY-TEAFKAHVKRGDVQAAL-NKGEDSEGGYLTPIEWDR----TITNKLVLISPMR 161 (425) T ss_pred HHHHHHHHHhhhcccccccccccHHH-HHHHHHHhhhhhhHHHh-hcCcCCCCceeccHhHHH----HHHHHHHhhhhhh Confidence 0000000000011100000 0000 0000000 00110000 01223333 35655543 4455444444455 Q ss_pred hhcccccCCCcceeeEEEeeeecccceEEeecccCCceeee-eeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHH Q lcl|Aclame:pro 74 ELVGESKKGDWTTLVAAFITAEPTTTVATYGDYSSDGDSGT-NINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELN 152 (336) Q Consensus 74 ~l~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP~vd~-~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~ 152 (336) .+..+.+... ....+++....+.+...|.+..+|-.+. ......-..+.++..+.+|.+=++ ....++.+.-. T Consensus 162 ~l~~~~~~~~---~~~~~~~~~~~~~a~wv~E~~~~~~~~~~~f~~v~~~~~k~~~~i~iS~ell~---ds~~~l~~~i~ 235 (425) T protein:vir:10 162 QLCRVQPVSK---AGFSKLFNMGGTTSGWVGEASQRPQTNAATFQPLSFASGEIYANPAATQQILD---DAEIDLESWLA 235 (425) T ss_pred hhceeeeccC---CceEEEEEcCCcceeeeccccccccccccccceeeeeheeeEeehHhHHHHHh---cchhHHHHHHH Confidence 5555433222 1245555555556777788888887664 577777788888888888765443 34578999999 Q ss_pred HHHHHHHHHhhccEEEeeccccceEEEEecCCCCccccccc-cccc----ccCHHHHHHHHHHHHHHHHHHhCCceeccC Q lcl|Aclame:pro 153 YSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATT-PWSG----SPAVEAVVNEVVTLFQVLQTQSQGIITQEA 227 (336) Q Consensus 153 ~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~~~~~t-~w~~----~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~ 227 (336) .....++.+.+|.-+++|++.....|++|++.......... ..+. ..+..--++||.+++..|...-. . T Consensus 236 ~~la~ai~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~l~~l~~~l~~~~~------~ 309 (425) T protein:vir:10 236 TEVQTEFAKQEGKAFLAGDGTNKPNGLLTYIAGGANAAKHPFGAIEVVNSGAAADITSDGIIDLVYDLPSAFT------G 309 (425) T ss_pred HHHHHHHHHHHHhhhhcccCCCCcceeeeccccccccccccccccccccccccccccHHHHHHHHhhhhhhhc------c Confidence 99999999999999999999888999999876543221110 0111 11222345677777766543211 2 Q ss_pred CcEEEecHHHHHhccc-CCCCCccHHH-HHHHhCC----ccEEEEcccccC-CCCceEEEEEEeeCCCceEEEEeCchhh Q lcl|Aclame:pro 228 VLHMGLPPTAMSDLSK-TNQYGLSAAA-KLKEIFP----KLEFVTIPEYDT-ASGRLVQLWAPRVEGKDTATCGFTEKMR 300 (336) Q Consensus 228 p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~-~l~~n~p----nl~i~~~pel~~-a~G~~~~~~~~~~~~~~~~~~~~p~~~~ 300 (336) .-+++|.+..+..|.+ .+..|.-++. =+..-.| +..++....+.. +.|....+|.+-. .-. .+.--.-++ T Consensus 310 ~a~~vmn~~~~~~L~~lkD~~G~~l~~~~~~~g~~~~l~G~PV~~~~~~p~~~~~~~~i~~Gd~~--~~~-~i~~~~~~~ 386 (425) T protein:vir:10 310 NARFAMNRNTQRQVRKLKDGQGNYLWQPSYVAGQPATLAGYPVTEVPDMPDVAANSTPILFGDFQ--QTY-LIIDRIGVR 386 (425) T ss_pred CCEEEEchHHHHHHHHhhcCCCceeeccCccCCCCceecceeeEEecCcCCccCCccEEEEEehh--ccE-EEEEecceE Confidence 3378999999988853 3333332221 0011011 122222222222 2233333332210 000 000001111 Q ss_pred ccc-ceecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 301 AHS-IERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 301 ~l~-~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) .+- .....-...+-...|.+|.+ +.|.||+.+..= T Consensus 387 v~~d~~~~~~~~~~~~~~r~d~~v-~~~~A~~~l~~~ 422 (425) T protein:vir:10 387 VLRDPYTAKPYVLFYTTKRVGGGL-LNPEPMRAMKVA 422 (425) T ss_pred EEecccccCCcEEEEEEEEeccEe-ecccceEEEEee Confidence 110 00111223444556665554 459998665443 No 84 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=94.90 E-value=0.0032 Score=34.24 Aligned_cols=251 Identities=9% Similarity=0.056 Sum_probs=132.5 Q ss_pred hhhhhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhhccccc--CCCcceeeEEEeeeecccceEEeecccC Q lcl|Aclame:pro 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESK--KGDWTTLVAAFITAEPTTTVATYGDYSS 108 (336) Q Consensus 31 ~~~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t--~g~w~~~t~~~~v~e~~G~a~~ygd~~D 108 (336) ||-- -++.++--+|..+..+|.-++ ...+....+.-+.. .|.-+ .++.++.++..|.+..++.+++ T Consensus 1 MA~~-------~T~~~~~~iPev~s~~v~~~~----~~~~~~~~~~~~~~~~~g~~G-~tv~iP~~~~~~~a~~v~eg~~ 68 (272) T protein:vir:30 1 MAVG-------TTKMAQMLDPEVLADMIDAEV----GKAIRFAPLAEVDTTLEGQPG-TTLTVPKWDYIGDAEDVAEGEA 68 (272) T ss_pred CCCc-------cccchheechHHHHHHHHHHH----HHHhhhhccccccccccCCCC-CEEEEEEecCCCCcccccCCCc Confidence 2211 133445667888888774332 22222233333222 12212 3788999998999999999999 Q ss_pred CceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCCCCcc Q lcl|Aclame:pro 109 DGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAP 188 (336) Q Consensus 109 iP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~ 188 (336) +|..+.........+..++..+.++.++... ...++.+.-...+.+++.+.+++..+ +.++- +. T Consensus 69 i~~~~~~~~~~~~~~~~~~~~~~itd~~~~~---s~~d~~~~~~~~~~~~~a~~~d~~i~---------~~~~~----a~ 132 (272) T protein:vir:30 69 IPMTQLGFKKTTMTIKKAGKGVEITDEAILS---GYGDPVGQAAKQIVEAIDHKVDADVL---------DALSK----ST 132 (272) T ss_pred ccccccccceEEEEeeeeeeeeeecHHHHhh---ccccHHHHHHHHHHHHHHHHHHHHHH---------HHhcc----cc Confidence 9999999999999999999888888776544 34577777777777777777775433 11110 00 Q ss_pred cccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhcccC--------CCCCccHHHHHH---- Q lcl|Aclame:pro 189 ITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSKT--------NQYGLSAAAKLK---- 256 (336) Q Consensus 189 ~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~~--------~~~~~Tvl~~l~---- 256 (336) .+.. .+.| +++|.+++..+-.. + ..+..++++|..+..|.+. ++++. ..+. T Consensus 133 ~~~~----~~~t----~d~i~da~~~l~~~--~----~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~---~~~~~g~i 195 (272) T protein:vir:30 133 QTVE----ATAT----VDGVSKALDIFNDE--D----DAETVIVMNPADASTLRLDAAKEWLGATEVGA---NRVVSGVY 195 (272) T ss_pred cccc----cccC----HHHHHHHHHHHhcc--C----CCccEEEEcHHHHHHHHHhccccccccccccc---cccccccc Confidence 0111 1123 45566665555322 1 2466899999998887431 11111 1111 Q ss_pred HhCCccEEEEcccccCCCCceEEEEEEeeCCCceEEEEeCchhhcccceecCC--ceEEeeecceeeeEEecccceeeec Q lcl|Aclame:pro 257 EIFPKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSIERYSS--YFRQKKSAGTWGAVIFRPFAVAQMI 334 (336) Q Consensus 257 ~n~pnl~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~--~~~v~~~~rt~Gv~ir~P~ai~~~~ 334 (336) .++-+++++..+-+. .++++++- +.-+.+..-...+ ........ ...+-.. +..|+-+.+|.+++.+. T Consensus 196 g~i~G~~Vi~s~~~p---~~t~~~~~-----~~a~~~~~~~~~~-ve~~r~~~~~~~~i~~~-~~~~~~v~~~~~vv~~t 265 (272) T protein:vir:30 196 GEVLGVQIVRSRKCP---KGTAYMVR-----KGALRIMLKRNTM-VETDRDITKAINQIVAN-KHYGVYLYKAEKAVKIT 265 (272) T ss_pred hhhcCeeEEEcCCCC---cceEEEEc-----CCeEEEEecCCce-eeeccccccceeEEEEE-EEEEEEEEcCCceEEEE Confidence 112234544433332 11222221 1111111111111 01111111 1222222 34568888999888876 Q ss_pred cC Q lcl|Aclame:pro 335 GV 336 (336) Q Consensus 335 GI 336 (336) -= T Consensus 266 ~~ 267 (272) T protein:vir:30 266 LK 267 (272) T ss_pred ec Confidence 44 No 85 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=94.90 E-value=0.0032 Score=34.24 Aligned_cols=251 Identities=9% Similarity=0.056 Sum_probs=132.5 Q ss_pred hhhhhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhhccccc--CCCcceeeEEEeeeecccceEEeecccC Q lcl|Aclame:pro 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESK--KGDWTTLVAAFITAEPTTTVATYGDYSS 108 (336) Q Consensus 31 ~~~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t--~g~w~~~t~~~~v~e~~G~a~~ygd~~D 108 (336) ||-- -++.++--+|..+..+|.-++ ...+....+.-+.. .|.-+ .++.++.++..|.+..++.+++ T Consensus 1 MA~~-------~T~~~~~~iPev~s~~v~~~~----~~~~~~~~~~~~~~~~~g~~G-~tv~iP~~~~~~~a~~v~eg~~ 68 (272) T protein:vir:98 1 MAVG-------TTKMAQMLDPEVLADMIDAEV----GKAIRFAPLAEVDTTLEGQPG-TTLTVPKWDYIGDAEDVAEGEA 68 (272) T ss_pred CCCc-------cccchheechHHHHHHHHHHH----HHHhhhhccccccccccCCCC-CEEEEEEecCCCCcccccCCCc Confidence 2211 133445667888888774332 22222233333222 12212 3788999998999999999999 Q ss_pred CceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCCCCcc Q lcl|Aclame:pro 109 DGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAP 188 (336) Q Consensus 109 iP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~ 188 (336) +|..+.........+..++..+.++.++... ...++.+.-...+.+++.+.+++..+ +.++- +. T Consensus 69 i~~~~~~~~~~~~~~~~~~~~~~itd~~~~~---s~~d~~~~~~~~~~~~~a~~~d~~i~---------~~~~~----a~ 132 (272) T protein:vir:98 69 IPMTQLGFKKTTMTIKKAGKGVEITDEAILS---GYGDPVGQAAKQIVEAIDHKVDADVL---------DALSK----ST 132 (272) T ss_pred ccccccccceEEEEeeeeeeeeeecHHHHhh---ccccHHHHHHHHHHHHHHHHHHHHHH---------HHhcc----cc Confidence 9999999999999999999888888776544 34577777777777777777775433 11110 00 Q ss_pred cccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhcccC--------CCCCccHHHHHH---- Q lcl|Aclame:pro 189 ITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSKT--------NQYGLSAAAKLK---- 256 (336) Q Consensus 189 ~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~~--------~~~~~Tvl~~l~---- 256 (336) .+.. .+.| +++|.+++..+-.. + ..+..++++|..+..|.+. ++++. ..+. T Consensus 133 ~~~~----~~~t----~d~i~da~~~l~~~--~----~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~---~~~~~g~i 195 (272) T protein:vir:98 133 QTVE----ATAT----VDGVSKALDIFNDE--D----DAETVIVMNPADASTLRLDAAKEWLGATEVGA---NRVVSGVY 195 (272) T ss_pred cccc----cccC----HHHHHHHHHHHhcc--C----CCccEEEEcHHHHHHHHHhccccccccccccc---cccccccc Confidence 0111 1123 45566665555322 1 2466899999998887431 11111 1111 Q ss_pred HhCCccEEEEcccccCCCCceEEEEEEeeCCCceEEEEeCchhhcccceecCC--ceEEeeecceeeeEEecccceeeec Q lcl|Aclame:pro 257 EIFPKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSIERYSS--YFRQKKSAGTWGAVIFRPFAVAQMI 334 (336) Q Consensus 257 ~n~pnl~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~--~~~v~~~~rt~Gv~ir~P~ai~~~~ 334 (336) .++-+++++..+-+. .++++++- +.-+.+..-...+ ........ ...+-.. +..|+-+.+|.+++.+. T Consensus 196 g~i~G~~Vi~s~~~p---~~t~~~~~-----~~a~~~~~~~~~~-ve~~r~~~~~~~~i~~~-~~~~~~v~~~~~vv~~t 265 (272) T protein:vir:98 196 GEVLGVQIVRSRKCP---KGTAYMVR-----KGALRIMLKRNTM-VETDRDITKAINQIVAN-KHYGVYLYKAEKAVKIT 265 (272) T ss_pred hhhcCeeEEEcCCCC---cceEEEEc-----CCeEEEEecCCce-eeeccccccceeEEEEE-EEEEEEEEcCCceEEEE Confidence 112234544433332 11222221 1111111111111 01111111 1222222 34568888999888876 Q ss_pred cC Q lcl|Aclame:pro 335 GV 336 (336) Q Consensus 335 GI 336 (336) -= T Consensus 266 ~~ 267 (272) T protein:vir:98 266 LK 267 (272) T ss_pred ec Confidence 44 No 86 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=94.85 E-value=0.0033 Score=34.16 Aligned_cols=314 Identities=11% Similarity=0.037 Sum_probs=143.7 Q ss_pred CchHHHHH--HHhhcceeccchhhhhhhhh----hhh----hhhhh----hhcCccccCCcc--hHHHHHHHhhCceeee Q lcl|Aclame:pro 1 MRDAQRIQ--NLARAGVILPRSVKNVSTPL----AEY----AMDAA----DLSPHLSSTGSS--GIPNYLTTYVDPSVID 64 (336) Q Consensus 1 m~~~~~~~--~l~~~g~~~~~~~~~~~~~~----~~~----~~da~----d~~~~l~t~~~~--~i~~~l~~~idp~v~~ 64 (336) +++..+-. ++....-...........+. ..+ .-+.. ..+-...|.+++ .||..+.+- |++ T Consensus 53 ~e~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~a~~~~l~~g~~~~~~~~e~~a~~~~t~~~gG~~iP~~~~~~----I~~ 128 (407) T protein:vir:48 53 LENLKSDLEAELAEVKRPAGGTQNKVASEHKEAFIGFMRKGREDGLRELERKALQVGNDEDGGYAIPEELDRT----ILT 128 (407) T ss_pred HHHHHHHHHHHHHHhhccccccccchhhHHHHHHHHHHhccchhhhhHHHHHhhhcccCCCCcccccHhHHHH----HHH Confidence 11111100 00000000000000000000 000 00000 000011222333 356665543 333 Q ss_pred eeccccchhhhcccccCCCcceeeEEEeeeecccceEEeecccCCceee-eeeeeeeeeEEEEEEEEEeCHHHHHHHHHh Q lcl|Aclame:pro 65 ILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVATYGDYSSDGDSG-TNINYPQRQSYFFQTWTRWGERELEMAGAG 143 (336) Q Consensus 65 ~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP~vd-~~~~~~~~~v~~~~~~~~y~~~El~~A~~~ 143 (336) .+-..-....+..+.+.+ .....+++......+...+.+...|-.+ .......-.++.++..+.+|.+=++. . T Consensus 129 ~~~~~~~l~~~~~~~~~~---~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~d---s 202 (407) T protein:vir:48 129 LLKDEVVMRQEATVITLG---GSDYKKLVNLGGTTSGWVGETDARPETATSKLGLIEPFMGEIYGNPQATQKMLDD---A 202 (407) T ss_pred HHHhhhhhhhhceeeecC---CCceEEEEecCCcceeeecccccccccccccceeEEeeeeeeEeehhhHHHHHhc---c Confidence 333222233344332222 1235666666666777777777777554 46777777888888888888765543 3 Q ss_pred CCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCCCCccccccc--c--ccc-ccCHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 144 RVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATT--P--WSG-SPAVEAVVNEVVTLFQVLQTQ 218 (336) Q Consensus 144 g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~~~~~t--~--w~~-~~T~~eI~~Di~~l~~~l~~~ 218 (336) ..++.+.-......++.+.++.-.++|++.....|+|+++.+........ . --. .++..--++||.+++..|... T Consensus 203 ~~~l~~~i~~~l~~~i~~~~~~a~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~i~~l~~~l~~~ 282 (407) T protein:vir:48 203 FFNVEDWINSELALEFAEQEEIAFTSGDGSKKPKGFLAYESTDEDDKTRAFGKLQHIASGAASGVTADAIIKLIYTLRKA 282 (407) T ss_pred hHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccceeeecccccccccccccccccccccccccccChHHHHHHHHhhchh Confidence 46788888888888889999999999999888999999887643222110 0 000 111111256777777766432 Q ss_pred hCCceeccCCcEEEecHHHHHhccc-CCCCCccHHHH-HHH-------hCCccEEEEcccccCCCCceEEEEEEeeCCCc Q lcl|Aclame:pro 219 SQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAK-LKE-------IFPKLEFVTIPEYDTASGRLVQLWAPRVEGKD 289 (336) Q Consensus 219 t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~-l~~-------n~pnl~i~~~pel~~a~G~~~~~~~~~~~~~~ 289 (336) -. . .-+++|.+..+..|.+ .+..|.-++.- +.. -+|=+....+|.. ++|....+|.+-. .. T Consensus 283 ~~----~--~a~~v~n~~~~~~L~~lkD~~Gr~l~~~~~~~g~~~~l~G~PV~~~~~~p~~--~~~~~~i~~Gd~~--~~ 352 (407) T protein:vir:48 283 HR----S--GAKFMMNNSSLFAIRLLKDNDGNYLWRPGIELGQPSSLAGYGIVENEQMPDI--AADAKAIAFGNFK--RG 352 (407) T ss_pred hh----c--CCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeEEecCcCCc--cCCccEEEEEecc--cc Confidence 11 1 2258899998888853 33333322110 011 1221222223322 2333333332210 00 Q ss_pred eEEEEeCchhhccc-ceecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 290 TATCGFTEKMRAHS-IERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 290 ~~~~~~p~~~~~l~-~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) . .+.--+.++.+- .....-....-+..|.+|. ++.|.||+.+..= T Consensus 353 ~-~i~~~~~~~i~~d~~~~~~~~~~~~~~r~d~~-v~~~~a~~~l~~~ 398 (407) T protein:vir:48 353 Y-TIVDRIGTRILRDPYTNKPFVGFYTTKRTGGM-LVDSQAIKLMKIG 398 (407) T ss_pred E-EEEEeeceEEEeeccccCCcEEEEEEEEeccE-EecccceEEEEee Confidence 0 000000111110 0011223445566777664 4559999765544 No 87 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=94.79 E-value=0.0035 Score=34.06 Aligned_cols=301 Identities=11% Similarity=0.016 Sum_probs=135.6 Q ss_pred CchHHHHHHHhhcceeccchhhhhhhhhhhhhhhhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhhccccc Q lcl|Aclame:pro 1 MRDAQRIQNLARAGVILPRSVKNVSTPLAEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESK 80 (336) Q Consensus 1 m~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t 80 (336) .+...+...+.+.|. ..++.+.++.. .++-...+ .+.....+|..+.+-| ++.+...-....++.+.+ T Consensus 55 ~~~~~~~~~~~~~~~------~~l~~~~r~~~-~~~~~~~~-~~~gg~lvP~~~~~~I----~~~~~~~s~i~~~~~~~~ 122 (390) T protein:vir:40 55 NREMNDNNVLASRGA------NALTSDESKYY-NEVIAGNG-FAGVTALLPPTVFERV----FEDLTVEHPLLSKINFVN 122 (390) T ss_pred HHHHHHHHHHHhcCc------hhccHHHHHHH-HHHHhccC-cccCcccccHHHHHHH----HHHHHhhhhhhhhceeee Confidence 000011111111121 11222222211 11111111 2223334676666433 333333323334444432 Q ss_pred CCCcceeeEEEeeeecccceEEeecccCCc-eeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHH Q lcl|Aclame:pro 81 KGDWTTLVAAFITAEPTTTVATYGDYSSDG-DSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGL 159 (336) Q Consensus 81 ~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP-~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~ 159 (336) .+. ....++.....+.+...+....+| ..+.......-..+.+...+.+|.+=++.+ ..++.+.-....++++ T Consensus 123 ~~~---~~~~i~~~~~~~~a~~~~E~~~~~~~~~~~f~~i~l~~~k~~~~i~iS~ell~ds---~~~l~~~i~~~la~~i 196 (390) T protein:vir:40 123 TTA---TTEWIISVGDVATAWWGPLCAEIKEVLDNGFDKIQTGMYKLSAYIPVCNAMLDLG---PSWLDQYVRTILGEAM 196 (390) T ss_pred cCC---ceeEEEEEcCCcceeeeccccccCccccccceeeEeeeeeEEEeehhhHHHHhcc---hHHHHHHHHHHHHHHH Confidence 222 234456666677777777666665 456677777788888888888886555533 4578888889999999 Q ss_pred HHhhccEEEeeccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHH-HH Q lcl|Aclame:pro 160 AKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPT-AM 238 (336) Q Consensus 160 e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~-~~ 238 (336) ...+|+-+++|++...-.|++|++.........+....+.|.+.+.+.+..+...+.....- ...--.++|-++ .+ T Consensus 197 ~~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~---~~~~a~~i~n~~t~~ 273 (390) T protein:vir:40 197 ALGLEAGIVNGSGKDQPIGMMRDLNNVTAGEHPVKTATPLTDLTPATLATKVMLPLTDNGKK---SVSDAILVINPADYW 273 (390) T ss_pred HHHHHhhhhcccCCCccceeeeccccccccccccccccccchhhHHHHHHHHHHHhhcchhh---hhcCceEEEcchhHH Confidence 99999999999988788899997643221111111112223333333333333333222210 001123555543 33 Q ss_pred Hhcc----cCCCCCccHHHHHHHhCCccEEEEcccccCC---CCceEEEEEEeeCCCceEEEEeCchhhccccee--cCC Q lcl|Aclame:pro 239 SDLS----KTNQYGLSAAAKLKEIFPKLEFVTIPEYDTA---SGRLVQLWAPRVEGKDTATCGFTEKMRAHSIER--YSS 309 (336) Q Consensus 239 ~~Ls----~~~~~~~Tvl~~l~~n~pnl~i~~~pel~~a---~G~~~~~~~~~~~~~~~~~~~~p~~~~~l~~~~--~~~ 309 (336) .+|. ..+..|.-++..+- + ++.++..+..... -|.-.+.++..+.+ +.+...+ +. ..- T Consensus 274 ~~l~~~~~~~d~~G~~v~~~~~--~-g~pvv~~~~~p~~~i~~Gd~s~~~i~~~~~---------~~v~~~~-~~~f~~~ 340 (390) T protein:vir:40 274 SKIYAATSYMTPQGVWVTGILP--V-PLEIVQSVAVPVGKAVAGRAKDYFMGIGSE---------QVIRTST-EYRLLDD 340 (390) T ss_pred HHHHHHhhccCCCCccccccCC--C-ceeEEEcCCCCCCcEEEEeeceEEEEeecc---------eEEEecc-hhhhhcC Confidence 3332 11222322221110 1 2344332222110 12111211111111 1111111 11 122 Q ss_pred ceEEeeecceeeeEEecccceee--eccC Q lcl|Aclame:pro 310 YFRQKKSAGTWGAVIFRPFAVAQ--MIGV 336 (336) Q Consensus 310 ~~~v~~~~rt~Gv~ir~P~ai~~--~~GI 336 (336) ....-...|.+|.. +.|.||+. +.++ T Consensus 341 ~~~~r~~~r~dg~v-~~~~A~~~l~~~~~ 368 (390) T protein:vir:40 341 ETLYYAKQYANGRP-KDNSSFLVFDITGL 368 (390) T ss_pred cEEEEEEEEeCCEE-ecccceEEEEeecc Confidence 34455666766654 45988884 4555 No 88 >protein:vir:107882 Length: 307 # NCBI annotation: gp34 # Family: family:all:908 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024707;genbank:gi:48696944;genbank:GeneID:2845970 Probab=94.75 E-value=0.0019 Score=35.53 Aligned_cols=262 Identities=12% Similarity=0.057 Sum_probs=113.9 Q ss_pred hhhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecccceEE--------ee Q lcl|Aclame:pro 33 MDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVAT--------YG 104 (336) Q Consensus 33 ~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~~G~a~~--------yg 104 (336) |-.. .+| -..++.+.++-+.|=.+ .+-+++|||...++--.-...+|+ . ++.. -| T Consensus 1 m~~~-~~~---~~~dp~LT~~A~gy~n~--------~~ia~~l~P~vpv~~~~~k~~~f~---~--eaF~~~~t~r~~~~ 63 (307) T protein:vir:10 1 MGRL-SKL---RIVDPVLTNLAIGYTNA--------EFIGQSLMPVVEVEKEGGKIPKFG---K--ESFRLYKTERALRA 63 (307) T ss_pred CCCC-CCC---cccChhHHHHHHhhcch--------hhhhhhcCCcccccccccceeeEC---c--ccccchhhhcccCC Confidence 1111 111 11233344555555544 467788888876554333333442 2 1110 01 Q ss_pred cccCCceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHH----HHhhccEEEeeccccceEEEE Q lcl|Aclame:pro 105 DYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGL----AKFLNGSYLFGVAGLENYGLI 180 (336) Q Consensus 105 d~~DiP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~----e~~~n~i~~~Gd~~~g~~Gll 180 (336) +.+-+ +... ........-+-+..+-+. .+.++.+..++.+++...++..+ |...-++++-.. T Consensus 64 ~~~~v---~~~~-~~~~~~~~~~~~L~~~id-~r~~~~~~~~~~~~av~~l~d~I~l~~E~~~A~l~~~~~--------- 129 (307) T protein:vir:10 64 RSNRM---NPED-LGSIDIVLDEHDLEYPID-YREDQESAFPLEQAAVQTATEAIQLRREKMVADLAQNPN--------- 129 (307) T ss_pred Cccee---eccc-ccccccccccccccccCC-hhhcCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCcc--------- Confidence 11111 1000 000011111111222211 13445556666665555443333 333344433211 Q ss_pred ecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhccc---------CCCCC-cc Q lcl|Aclame:pro 181 NDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK---------TNQYG-LS 250 (336) Q Consensus 181 N~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~---------~~~~~-~T 250 (336) |.|+-...+-+.+.-|+..+.| ++.||.+....+...++ -.|++++|....+..|.. .+..+ +| T Consensus 130 ~y~~~~k~tLsGt~~Wsd~~sD-Pi~di~~~~~ai~~~~g-----~~Pn~~vlg~~a~~al~~hp~i~e~lk~~~~g~it 203 (307) T protein:vir:10 130 SYAGGNKKQLSATEKFTAAGSD-PVGVIEDGKEAIRTKIG-----RRPNTMVIGASAYKTLKAHPQLIEKIKYSMKGIVT 203 (307) T ss_pred ccCCCceEEeccccccCCCCCC-cHHHHHHHHHHHHhhhC-----CccceEEeCHHHHHHHhcCHHHHHHhCCccccccC Confidence 1222111122233345555655 89999999999988775 369999999999998863 11223 44 Q ss_pred HHHHHHHhCCccEEEEccc--ccCCCC-------c-eEEEEEEeeCCCceEEEEeCchhhcccceecCCceEEeeeccee Q lcl|Aclame:pro 251 AAAKLKEIFPKLEFVTIPE--YDTASG-------R-LVQLWAPRVEGKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTW 320 (336) Q Consensus 251 vl~~l~~n~pnl~i~~~pe--l~~a~G-------~-~~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~v~~~~rt~ 320 (336) . +.|++-+ .++.+.+-+ +.++.+ + ...+++....+.+...+..|. |- +-.+.++..+..+..+. + T Consensus 204 ~-~~la~ll-~v~~i~vg~a~~~~~~~~~~~iw~~~~vl~yv~~~~~~~~~~~~eps-fG-yT~~~~g~~~~d~~~~~-~ 278 (307) T protein:vir:10 204 V-DLLKEIF-EVENIAVGEAIYADDKDRFTDIWGANIVLAYVPLQRGGQQRTPYEPS-YG-YTLRKKGNPVVDTRIED-G 278 (307) T ss_pred H-HHHHHHh-CceeEEEeeeeeeccCCccceeCCCceEEEecccccCCCCCcccccc-cc-eeEEEcCCeEeeceecC-C Confidence 3 4554433 344333332 223332 2 223333332222222222221 11 11234455555555553 4 Q ss_pred eeE------EecccceeeeccC Q lcl|Aclame:pro 321 GAV------IFRPFAVAQMIGV 336 (336) Q Consensus 321 Gv~------ir~P~ai~~~~GI 336 (336) |+. +++|.-+..-.|. T Consensus 279 ~~~~~r~~~~~~~~i~~~~~G~ 300 (307) T protein:vir:10 279 KLELVRSTDIFRPYLLGADAGY 300 (307) T ss_pred ceeEEeccccccceeecccccc Confidence 433 3456666666665 No 89 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=94.08 E-value=0.0054 Score=33.00 Aligned_cols=253 Identities=9% Similarity=0.016 Sum_probs=128.8 Q ss_pred hhhhhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhhcccccCCCc-ceeeEEEeeeecccceEEeecccCC Q lcl|Aclame:pro 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDW-TTLVAAFITAEPTTTVATYGDYSSD 109 (336) Q Consensus 31 ~~~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w-~~~t~~~~v~e~~G~a~~ygd~~Di 109 (336) || . ..++-++--+|..+..||.-++ -..+....|..+.+..+- .-.+++++.++..|.+..+++++++ T Consensus 1 Ma---~----~~T~l~d~i~Pev~~~~v~~~~----~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~igda~~~~eg~~i 69 (276) T protein:vir:10 1 MA---Q----GTTTKSTQIVPEVLAPMMQAEL----DKKLRFAQFADIDSTLVGQPGDTLTFPAFVYSGDATVVPEGQKI 69 (276) T ss_pred CC---c----ceeehhhhhchHHHHHHHHHHH----HhhhhhcccceecccccCCCCCEEEeeeecCCCccccccCCCcc Confidence 11 1 1244455567888888884443 333444555554432211 2367999999999999999999999 Q ss_pred ceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCCCCccc Q lcl|Aclame:pro 110 GDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPI 189 (336) Q Consensus 110 P~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~~ 189 (336) |............+...+-++.++.++ ..+. +.+.-.+-......++.+.+++..+ ..++.-.. T Consensus 70 ~~~~lt~~~~~a~i~~~~k~~~~tD~a--~~~~-~~dp~~~~~~~~~~~~a~~~d~~~~---------~~l~~~~~---- 133 (276) T protein:vir:10 70 PVDKIETNRREAKIHKIGKGTDITDEA--LLSG-YGDPQGEAVRQHGLAIANKVDNDVL---------EALRGTKL---- 133 (276) T ss_pred CccccccceeeEEeehccccccccHHH--HHhh-ccchHHHHHHHHHHHHHHHHHHHHH---------HHHhcccc---- Confidence 999999999998898876666665544 3333 4455555566666666666665322 11110000 Q ss_pred ccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhcccC--------CCCCccHHHHHHH---- Q lcl|Aclame:pro 190 TATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSKT--------NQYGLSAAAKLKE---- 257 (336) Q Consensus 190 ~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~~--------~~~~~Tvl~~l~~---- 257 (336) +.+ .++.|. +.|.+++..+-.+ ...+..|++.|..+..|.+- +..+.. .+.. T Consensus 134 ~~~---~~~~t~----d~i~~A~~~lgd~------~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~---~~~~G~ig 197 (276) T protein:vir:10 134 TVS---ADIGTL----AGLEAAIDTFDDE------DLEPMVLFINPKDAGKLRSSASDNFTRATELGDN---IIVKGAFG 197 (276) T ss_pred ccc---ccccCH----HHHHHHHHHhccc------cCcccEEEEcHHHHHHHHHhcccccccccccccc---ceeccccc Confidence 001 112343 3444454444222 12467899999998888532 111111 1100 Q ss_pred hCCccEEEEcccccCCCCceEEEEEEeeCCCceEEEEeCchhh--cccceecCCceEEeeecceeeeEEecccceeeecc Q lcl|Aclame:pro 258 IFPKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMR--AHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIG 335 (336) Q Consensus 258 n~pnl~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p~~~~--~l~~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~G 335 (336) .|-+++|+....+. -...+++- +.-+.+....+.+ ...-+.+.. -.+ ..-..+|+-+.+|..++.+.= T Consensus 198 ~~~G~~Vi~s~~~p---~~t~~l~~-----~gAi~~~~~~~~~vE~dRd~~~~~-d~i-~~~~~y~~~~~~~~~vv~~t~ 267 (276) T protein:vir:10 198 EALGAVIVRSKKLD---EGEAILAK-----RGAVKLITKRDFFLETDRDPSTKT-TAL-YSDKHYVAYLYDESKAVKVTK 267 (276) T ss_pred eecceeEEEcCCCC---cceEEEEe-----ccceeeeecCCceeecccchhhcc-cEE-EEeeEEEEEEEcCcceEEEec Confidence 12245555444332 12222221 1111111111111 000011111 111 112456888889987777663 Q ss_pred C Q lcl|Aclame:pro 336 V 336 (336) Q Consensus 336 I 336 (336) - T Consensus 268 ~ 268 (276) T protein:vir:10 268 G 268 (276) T ss_pred C Confidence 3 No 90 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=94.05 E-value=0.0051 Score=33.15 Aligned_cols=258 Identities=10% Similarity=-0.055 Sum_probs=127.2 Q ss_pred hhhhhhhhhhhhcCccccCCcc--hHHHHHHHhhCceeeeeeccccchhhh---cccccCCCcceeeEEEeeee-cccce Q lcl|Aclame:pro 27 PLAEYAMDAADLSPHLSSTGSS--GIPNYLTTYVDPSVIDILVAPMKAAEL---VGESKKGDWTTLVAAFITAE-PTTTV 100 (336) Q Consensus 27 ~~~~~~~da~d~~~~l~t~~~~--~i~~~l~~~idp~v~~~~~~~~~~~~l---~~v~t~g~w~~~t~~~~v~e-~~G~a 100 (336) -..+|+ .+|.+++ .+|..+.+ +|++.+........+ +|+++.. ....+...+ ..+.+ T Consensus 1 ~l~~~~---------~~t~~~gg~liP~~~~~----~Ii~~~~~~~~l~~~~~~~~~~~~~----g~~~~~~~~~~~~~a 63 (293) T protein:vir:48 1 MLDSKT---------DHSGSDAGLTIPQDIRT----AINTLVRQYDSLQEYVNVENVTTLT----GSRVYEKWTDITGLA 63 (293) T ss_pred Cceeec---------ccccCcCceEechhHHH----HHHHHHHhhhhhhhhceeeeccCCc----ceEEEEeecCCCcce Confidence 111111 1222232 34666553 334444333334444 4443321 223444333 45677 Q ss_pred EEeecccCCce-eeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEE Q lcl|Aclame:pro 101 ATYGDYSSDGD-SGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGL 179 (336) Q Consensus 101 ~~ygd~~DiP~-vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~Gl 179 (336) ...+....+|- .+.......-..+.++..+.+|.+=++-+ ..+|.+.-....++++.+.+|+-.+.|..... T Consensus 64 ~~v~Eg~~~~~~~~~~~~~i~l~~~k~~~~~~iS~ell~ds---~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~---- 136 (293) T protein:vir:48 64 NIDDEAGKIADIDDPKLSLIKYTIKRYAGISTVTNSLLADS---AENILAWLSGWIAKKVVVTRNKAILGVVDKLP---- 136 (293) T ss_pred eeecCCcccccccccceeEEEEeeeEEEEeehhhHHHHhhh---hHHHHHHHHHHHHHHHHHHHHhHHhhcccccc---- Confidence 88888888875 45677888888899998888886655543 46788888888888888888876665543211 Q ss_pred EecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhccc-CCCCCccHHH-HHHH Q lcl|Aclame:pro 180 INDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAA-KLKE 257 (336) Q Consensus 180 lN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~-~l~~ 257 (336) ......+ ++||.+++..+...- .....++|.++.+..|.+ .+..|.-+++ -+.. T Consensus 137 --------------~~~~~~~----~d~i~~~~~~l~~~~------~~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~ 192 (293) T protein:vir:48 137 --------------TKPTLTK----WDDIIDLEAKVDPAI------KQTSFFLTNTSGFTALKKVKNALGDYLMERDVKS 192 (293) T ss_pred --------------ccccccC----HHHHHHHHHhhhhhh------cCCCEEEEcHHHHHHHHHhhccCCceEeecCcCC Confidence 1112233 456677777664321 123479999999998854 3333332221 0111 Q ss_pred hCC----c--cEEEEccccc-CCCCceEEEEEEeeCCCceEEEE--eCchhhccc---ceecCCceEEeeecceeeeEEe Q lcl|Aclame:pro 258 IFP----K--LEFVTIPEYD-TASGRLVQLWAPRVEGKDTATCG--FTEKMRAHS---IERYSSYFRQKKSAGTWGAVIF 325 (336) Q Consensus 258 n~p----n--l~i~~~pel~-~a~G~~~~~~~~~~~~~~~~~~~--~p~~~~~l~---~~~~~~~~~v~~~~rt~Gv~ir 325 (336) ..+ + +.+....-+. .++|....++.+- .+-..+. -...+.... .....-...+-+..|.+|. ++ T Consensus 193 ~~~~~l~G~Pv~~~~~~~~~~~~~~~~~~~~gd~---~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~~-~~ 268 (293) T protein:vir:48 193 PTGYSIAGFAVKEISDRWLPNASSGVMPLYFGDL---KQAVTLFDRQQMSLLSTNIGGGAFETDTTKVRVIDRFDVV-AT 268 (293) T ss_pred CCCceecceeeEEecccccCCccCCceEEEEEec---cceEEEEEecceEEEEecccchhhhcCeEEEEEEEeeCcE-Ee Confidence 111 1 1111111111 2233333333221 1100000 001111111 0011123445566666664 67 Q ss_pred cccceeeeccC Q lcl|Aclame:pro 326 RPFAVAQMIGV 336 (336) Q Consensus 326 ~P~ai~~~~GI 336 (336) +|.||+.+..= T Consensus 269 ~~~a~~~l~~~ 279 (293) T protein:vir:48 269 DTEAFVPASFK 279 (293) T ss_pred cccceEEEEee Confidence 89999977744 No 91 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=93.89 E-value=0.006 Score=32.75 Aligned_cols=253 Identities=10% Similarity=0.010 Sum_probs=125.4 Q ss_pred hhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhhcccccCCCc-ceeeEEEeeeecccceEEeecccCCcee Q lcl|Aclame:pro 34 DAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDW-TTLVAAFITAEPTTTVATYGDYSSDGDS 112 (336) Q Consensus 34 da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w-~~~t~~~~v~e~~G~a~~ygd~~DiP~v 112 (336) -|+ ..+.-++--+|..+..||..++ ...+....|..+....+- .-.+++++.+...|.+..|.++++++.- T Consensus 1 ma~----~~T~l~d~iiPev~~~~v~~~~----~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~~ 72 (274) T protein:vir:12 1 MAQ----GLTKTSNQIIPEVLAPMMQAQL----EKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPTD 72 (274) T ss_pred CCc----ceeehhhhhchHHHHHHHHHHH----HhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCccchh Confidence 011 1345566678899998885554 344555556555433211 2468999999999999999999999999 Q ss_pred eeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCCCCcccccc Q lcl|Aclame:pro 113 GTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITAT 192 (336) Q Consensus 113 d~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~~~~~ 192 (336) +.........+...+-++.++ ++.+++.. -++..+-...+..++.+.+++-.+ . .++...+ +.+ T Consensus 73 ~lt~~~~~~~i~~~~~~~~i~--D~~~~~~~-~d~~~~~~~q~~~~~a~~vd~~~l-~--------~~~~a~~----~~~ 136 (274) T protein:vir:12 73 ILETKKREAKIRKIAKGTSIT--DEALLSGY-GDPQGEQVRQHGLAHANKVDNDVL-E--------ALMGAKL----TVN 136 (274) T ss_pred hcccceeeEEeeeecceeeec--HHHHHhcc-cchHHHHHHHHHHHHHHHHHHHHH-H--------HHhcccc----ccc Confidence 999999888888876555554 45555544 345555556666666666665322 0 1110000 001 Q ss_pred cccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhcccCC--------CCCccHHHHHHH----hCC Q lcl|Aclame:pro 193 TPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSKTN--------QYGLSAAAKLKE----IFP 260 (336) Q Consensus 193 t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~~~--------~~~~Tvl~~l~~----n~p 260 (336) + ++.+. +.|.+++..+-.. ...+..|+++|..+..|.+-. +++..+ ++. .|- T Consensus 137 ~---~a~~~----d~i~dA~~~lgd~------~~~~~~ivv~p~~~~~L~k~~~~~fv~~s~~g~~~---~~~G~ig~~~ 200 (274) T protein:vir:12 137 A---DITKL----NGLQSAIDKFNDE------DLEPMVLFINPLDAGKLRGDASTNFTRATELGDDI---IVKGAFGEAL 200 (274) T ss_pred c---cccCH----HHHHHHHHHhccc------cccccEEEeCHHHHHHHHhhhhhhccccccccccc---eecccceeec Confidence 1 12233 3444444444221 124678999999998886421 112111 110 122 Q ss_pred ccEEEEcccccCCCCceEEEEEEeeCCCceEEEEeCchhh--cccceecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 261 KLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMR--AHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 261 nl~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p~~~~--~l~~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) +++|+....+. -...+++- +.-+......+.+ ..-.+.+... .=..-..+||-+.+|-.++.+..= T Consensus 201 G~~Vi~s~~~p---~~t~~l~~-----~gA~~~~~~~~~~vE~~Rd~~~~~d--~i~~~~~y~~~~~~~~~vv~~t~~ 268 (274) T protein:vir:12 201 GAIIVRSNKLE---AGTAILAK-----KGAVKLILKRDFFLEVARDASTKTT--ALYSDKHYVAYLYDESKAVKITKG 268 (274) T ss_pred CeeEEEeCCCC---cceEEEEe-----ccceeeeecCCceeccccchhhccc--EEEeeeEEEEEEEcCCceEEEEcC Confidence 34444322221 01112111 1111111111100 0000011111 111123456666666666665544 No 92 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=93.34 E-value=0.0079 Score=32.11 Aligned_cols=255 Identities=9% Similarity=0.000 Sum_probs=128.8 Q ss_pred hhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhhccccc--CCCcceeeEEEeeeecccceEEeecccCCce Q lcl|Aclame:pro 34 DAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESK--KGDWTTLVAAFITAEPTTTVATYGDYSSDGD 111 (336) Q Consensus 34 da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t--~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP~ 111 (336) -|+ ..++-++--+|..+..||.-++ ...+....|..+.. +|.- -.++.++.+...|.+..|.++++++. T Consensus 1 m~~----~~T~l~d~i~Pev~~~~v~~~~----~~~l~~~~~~~~~~~l~g~~-G~tv~iP~~~~ig~a~~~~~g~~i~~ 71 (274) T protein:vir:96 1 MAQ----GMTKLTNQIVPEVLAPMMQAEL----EKKLRFASFAEIDNTLVGQP-GDTLTFPAFIYSGDAKVVAEGEKIPT 71 (274) T ss_pred CCc----ceeehhheechHHHHHHHHHHH----HhhhhccccceecccccCCC-CCEEEeeeecCCCccccccCCCccch Confidence 011 1345556667888888875443 34444455544432 2332 37899999999999999999999998 Q ss_pred eeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCCCCccccc Q lcl|Aclame:pro 112 SGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITA 191 (336) Q Consensus 112 vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~~~~ 191 (336) -..........+...+-++.++ ++.+.+. +-++..+-...+..++.+.+++..+ ..++..... . T Consensus 72 ~~lt~~~~~~~i~~~~~a~~i~--D~~~~~~-~~d~~~~~~~~~~~~~a~~vd~~i~---------~~l~~a~~~----~ 135 (274) T protein:vir:96 72 DILETKKREAKIRKIAKGTSIS--DEALLSG-YGDPQGEQVRQHGLAHANKVDDDVL---------EALKSAKLT----V 135 (274) T ss_pred hhcccceeEEEeeeeecceeeh--HHHHhhc-cchHHHHHHHHHHHHHHHHHHHHHH---------HHHhccccc----c Confidence 8999888888888766665555 5555444 3455556666666777766665322 111100000 0 Q ss_pred ccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhcccCC--------CCCccHHH-HHHHhCCcc Q lcl|Aclame:pro 192 TTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSKTN--------QYGLSAAA-KLKEIFPKL 262 (336) Q Consensus 192 ~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~~~--------~~~~Tvl~-~l~~n~pnl 262 (336) ++ ++.+ ++.|.+++..+-.. ...+..|+++|..+..|.+-. +.+..++- =.--.|-++ T Consensus 136 ~~---~~~~----~d~i~~A~~~lgd~------~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~ 202 (274) T protein:vir:96 136 EA---DITK----LTGLQTAIDKFNDE------DLEPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGEALGA 202 (274) T ss_pred cc---cccC----HHHHHHHHHHhccc------cccccEEEeCHHHHHHHHhhccccccccccccccceeccccceecCe Confidence 10 1123 34445554444221 125678999999999886421 11111100 000012234 Q ss_pred EEEEcccccCCCCceEEEEEEeeCCCceEEEEeCchhh--cccceecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 263 EFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMR--AHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 263 ~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p~~~~--~l~~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) +|+....+. -...+++- +.-+......+.+ ..-.+.+... .=..-..+|+-+.+|-.++.+.-= T Consensus 203 ~Vi~s~~~~---~~t~~l~~-----~gA~~~~~~~~~~vE~~Rd~~~~~d--~i~~~~~y~~~~~~~~~~v~~tk~ 268 (274) T protein:vir:96 203 VIVRSNKLE---AGTAILAK-----KGAVKLITKRDFFLETDRDPSTKTT--ALYSDKHYVAYLYDESKAVKITKG 268 (274) T ss_pred EEEEeCCCC---CceEEEEe-----ccceeeeecCCcccccccccccccC--EEEEeEEEEEEEEcCCcEEEEEcC Confidence 444332221 11222221 1111111111111 1001111111 111225678888899877776633 No 93 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=93.34 E-value=0.0079 Score=32.11 Aligned_cols=255 Identities=9% Similarity=0.000 Sum_probs=128.8 Q ss_pred hhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhhccccc--CCCcceeeEEEeeeecccceEEeecccCCce Q lcl|Aclame:pro 34 DAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESK--KGDWTTLVAAFITAEPTTTVATYGDYSSDGD 111 (336) Q Consensus 34 da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t--~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP~ 111 (336) -|+ ..++-++--+|..+..||.-++ ...+....|..+.. +|.- -.++.++.+...|.+..|.++++++. T Consensus 1 m~~----~~T~l~d~i~Pev~~~~v~~~~----~~~l~~~~~~~~~~~l~g~~-G~tv~iP~~~~ig~a~~~~~g~~i~~ 71 (274) T protein:vir:95 1 MAQ----GMTKLTNQIVPEVLAPMMQAEL----EKKLRFASFAEIDNTLVGQP-GDTLTFPAFIYSGDAKVVAEGEKIPT 71 (274) T ss_pred CCc----ceeehhheechHHHHHHHHHHH----HhhhhccccceecccccCCC-CCEEEeeeecCCCccccccCCCccch Confidence 011 1345556667888888875443 34444455544432 2332 37899999999999999999999998 Q ss_pred eeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCCCCccccc Q lcl|Aclame:pro 112 SGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITA 191 (336) Q Consensus 112 vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~~~~ 191 (336) -..........+...+-++.++ ++.+.+. +-++..+-...+..++.+.+++..+ ..++..... . T Consensus 72 ~~lt~~~~~~~i~~~~~a~~i~--D~~~~~~-~~d~~~~~~~~~~~~~a~~vd~~i~---------~~l~~a~~~----~ 135 (274) T protein:vir:95 72 DILETKKREAKIRKIAKGTSIS--DEALLSG-YGDPQGEQVRQHGLAHANKVDDDVL---------EALKSAKLT----V 135 (274) T ss_pred hhcccceeEEEeeeeecceeeh--HHHHhhc-cchHHHHHHHHHHHHHHHHHHHHHH---------HHHhccccc----c Confidence 8999888888888766665555 5555444 3455556666666777766665322 111100000 0 Q ss_pred ccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhcccCC--------CCCccHHH-HHHHhCCcc Q lcl|Aclame:pro 192 TTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSKTN--------QYGLSAAA-KLKEIFPKL 262 (336) Q Consensus 192 ~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~~~--------~~~~Tvl~-~l~~n~pnl 262 (336) ++ ++.+ ++.|.+++..+-.. ...+..|+++|..+..|.+-. +.+..++- =.--.|-++ T Consensus 136 ~~---~~~~----~d~i~~A~~~lgd~------~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G~ig~~~G~ 202 (274) T protein:vir:95 136 EA---DITK----LTGLQTAIDKFNDE------DLEPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVKGAFGEALGA 202 (274) T ss_pred cc---cccC----HHHHHHHHHHhccc------cccccEEEeCHHHHHHHHhhccccccccccccccceeccccceecCe Confidence 10 1123 34445554444221 125678999999999886421 11111100 000012234 Q ss_pred EEEEcccccCCCCceEEEEEEeeCCCceEEEEeCchhh--cccceecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 263 EFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMR--AHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 263 ~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p~~~~--~l~~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) +|+....+. -...+++- +.-+......+.+ ..-.+.+... .=..-..+|+-+.+|-.++.+.-= T Consensus 203 ~Vi~s~~~~---~~t~~l~~-----~gA~~~~~~~~~~vE~~Rd~~~~~d--~i~~~~~y~~~~~~~~~~v~~tk~ 268 (274) T protein:vir:95 203 VIVRSNKLE---AGTAILAK-----KGAVKLITKRDFFLETDRDPSTKTT--ALYSDKHYVAYLYDESKAVKITKG 268 (274) T ss_pred EEEEeCCCC---CceEEEEe-----ccceeeeecCCcccccccccccccC--EEEEeEEEEEEEEcCCcEEEEEcC Confidence 444332221 11222221 1111111111111 1001111111 111225678888899877776633 No 94 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=93.01 E-value=0.0091 Score=31.76 Aligned_cols=298 Identities=8% Similarity=-0.037 Sum_probs=131.6 Q ss_pred CchH-HHHHHHhhccee------ccchhhhhhhhhhhhhhh---------hhhhcCcc--ccCCcc--hHHHHHHHhhCc Q lcl|Aclame:pro 1 MRDA-QRIQNLARAGVI------LPRSVKNVSTPLAEYAMD---------AADLSPHL--SSTGSS--GIPNYLTTYVDP 60 (336) Q Consensus 1 m~~~-~~~~~l~~~g~~------~~~~~~~~~~~~~~~~~d---------a~d~~~~l--~t~~~~--~i~~~l~~~idp 60 (336) +++. .+....+..+.. ................+. .......+ .+.+++ .+|..+. + T Consensus 59 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~vP~~~~----~ 134 (408) T protein:vir:74 59 LREQLVEAQAEQVVNMREEEKGPLNKSENELKDKFVKDFVNMVRNPMAFLNTVSSKTETSGSDSAAGLTIPQDIR----T 134 (408) T ss_pred HHHHHHHHHHHHHhhccccccccccchhhhhHHHHHHHHHHHHhcchhhhhhhhhhhhcccccCCCceeechhHh----h Confidence 1110 000011111110 000000000000000000 00111111 122222 2566555 3 Q ss_pred eeeeeeccccchhhhcccccCCCcceeeEEEeeeeccc-ceEEeecccCCce-eeeeeeeeeeeEEEEEEEEEeCHHHHH Q lcl|Aclame:pro 61 SVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTT-TVATYGDYSSDGD-SGTNINYPQRQSYFFQTWTRWGERELE 138 (336) Q Consensus 61 ~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~~G-~a~~ygd~~DiP~-vd~~~~~~~~~v~~~~~~~~y~~~El~ 138 (336) .|++.+........++++..... ....+.+......+ .+...+...++|- .+..........+.++..+.+|.+=+. T Consensus 135 ~Ii~~~~~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ 213 (408) T protein:vir:74 135 MINTLVRQYDSLQQYVRVESVST-SSGSRVYEKWTDVTPLKAMDEEDGKIPDLDNPRLTIIKYLIKRYAGIITATNTLLK 213 (408) T ss_pred HHHHHHhhhcchhhhcceeeccC-CcceEEEEeecCCcccccccccccccccccccceeeEEeeeeeEEeeehhHHHHHh Confidence 55666666666666665543221 12233444433333 3445566778874 557888888899999998888876443 Q ss_pred HHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 139 MAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQ 218 (336) Q Consensus 139 ~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~ 218 (336) ....+|.+.-.....+++.+.+|+-.+.|++... + .-...+.+.|++.++..+..-. T Consensus 214 ---ds~~~l~~~i~~~l~~~~~~~~d~~il~G~G~~~----------~--------~~~~~~~~~i~~~~~~~l~~~~-- 270 (408) T protein:vir:74 214 ---DTAENILAWLSSWIAKKVVVTRNQAIIAAMGTVP----------K--------KPTIANFDDVITMINTSVDPAI-- 270 (408) T ss_pred ---hchHHHHHHHHHHHHHHHHHHHHHHHhhcccccc----------c--------ccccccHHHHHHHHHHhhhhhh-- Confidence 3455778888888888888888888888875421 1 0012244444443332222111 Q ss_pred hCCceeccCCcEEEecHHHHHhccc-CCCCCccHHHH-HHHhCCc----cEEEEcc--cccCCCCceEEEEEEeeCCCce Q lcl|Aclame:pro 219 SQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAK-LKEIFPK----LEFVTIP--EYDTASGRLVQLWAPRVEGKDT 290 (336) Q Consensus 219 t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~-l~~n~pn----l~i~~~p--el~~a~G~~~~~~~~~~~~~~~ 290 (336) .....++|.+..+..|.+ .+..|.-++.- +....|. ..++..+ .+...+.+...+++-... +- T Consensus 271 -------~~~a~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~i~~gd~~--~~ 341 (408) T protein:vir:74 271 -------IATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNSGSTVYPLYYGDMS--QA 341 (408) T ss_pred -------cCCCEEEEcHHHHHHHHHhhcCCCceEeccCcCCCCCceecceeeEEecCcccccccCCcceEEEEehh--cc Confidence 112368999999988864 23334333210 1111111 1121111 122222222222221110 00 Q ss_pred EEEE--eCchhhcccce---ecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 291 ATCG--FTEKMRAHSIE---RYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 291 ~~~~--~p~~~~~l~~~---~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) ..+. -...+...+.. ...-...+-+..|.+|. ++.|.||+..+.- T Consensus 342 ~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~-~~~~~a~~~~~~~ 391 (408) T protein:vir:74 342 ITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVK-ATDSEALVAGSFT 391 (408) T ss_pred EEEEEecceEEEEeccccchhhcceeeEEEEEeeCcE-EecccceEEEEee Confidence 0000 01111111110 01123445667777775 6679998888754 No 95 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=92.58 E-value=0.011 Score=31.36 Aligned_cols=287 Identities=9% Similarity=-0.004 Sum_probs=125.3 Q ss_pred CchHHHHHHHhhcce------------------------------eccchhhhhhhhhhhhhhhhhhhcCccccCCcc-- Q lcl|Aclame:pro 1 MRDAQRIQNLARAGV------------------------------ILPRSVKNVSTPLAEYAMDAADLSPHLSSTGSS-- 48 (336) Q Consensus 1 m~~~~~~~~l~~~g~------------------------------~~~~~~~~~~~~~~~~~~da~d~~~~l~t~~~~-- 48 (336) .+..+...+..+... .+-...................+.....+.+++ T Consensus 65 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~ 144 (400) T protein:vir:38 65 RDLYEAALKGNEQSSGKKPDHPEEHSYRDALNAYLHTRGRNTDGVNFEKTDVGTFAVLRAVPTDASDAVNAGVKAADAAS 144 (400) T ss_pred HHHHHHHHHHHhhcccccccchhhhhHHHHHHHHHhhHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHHhhcccccCCcc Confidence 000000000000000 000000000000000001111111112223333 Q ss_pred hHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeee-cccceEEeecccCCce-eeeeeeeeeeeEEEE Q lcl|Aclame:pro 49 GIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAE-PTTTVATYGDYSSDGD-SGTNINYPQRQSYFF 126 (336) Q Consensus 49 ~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e-~~G~a~~ygd~~DiP~-vd~~~~~~~~~v~~~ 126 (336) .+|..+. ++|++.+........++++.+.+ ..+..|++.. ..|.+..++....+|- .+...+...-..+.+ T Consensus 145 ~vP~~~~----~~ii~~~~~~~~l~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~i~~~~~k~ 217 (400) T protein:vir:38 145 TIPETIS----NTPQRELQTVVDLKPFTNVFQAS---TQKGTYPTVANATTKMVTVAELEKNPAMAKPEFKPVNWSVETY 217 (400) T ss_pred cccHHHH----HHHHHHHHhhhhhhhcceeEecc---CcceEEEEEecCCCccccccccccccccccccceeeEeehhhe Confidence 3565554 34455555555566666554332 1245667655 4567778887777764 456777777777888 Q ss_pred EEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCCCCcccccccccccccCHHHHHH Q lcl|Aclame:pro 127 QTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVN 206 (336) Q Consensus 127 ~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~ 206 (336) +..+.+|.+=|. ....++.+.-....+.++...+|.-.++|..... +. ...+.+ T Consensus 218 ~~~~~is~ell~---ds~~~~~~~i~~~l~~~~~~~~~~~i~~~~~~~~---------------~~----~~~~~~---- 271 (400) T protein:vir:38 218 RQALPVSQESID---DSAIDLVGLIAQNGQQIKVNTTNGAVATLLKGFT---------------AK----TISSVD---- 271 (400) T ss_pred eeehhhHHHHHh---hhHHHHHHHHHHHHHHHHHHHHHHhhhhcccccc---------------cc----ccccHH---- Confidence 888888764333 3345677777777777788888876666654211 00 112333 Q ss_pred HHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhccc-CCCCCccHHH-HHHHhCC----ccEEEEccccc-CCCCceEE Q lcl|Aclame:pro 207 EVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAA-KLKEIFP----KLEFVTIPEYD-TASGRLVQ 279 (336) Q Consensus 207 Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~-~l~~n~p----nl~i~~~pel~-~a~G~~~~ 279 (336) ||..++...... ...-.++|.|+.+..|.. .+..|.-++. -+...-| +..++.....- +..|.... T Consensus 272 ~~~~~~~~~~~~-------~~~a~~v~~~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~pv~~~~~~~~~~~g~~~~ 344 (400) T protein:vir:38 272 DLKHINNVDLDP-------AYSRVIIASQSFYNFLDTVKDGNGRYLLQDSILTPSGKSVLGMPIAVVSDDTLGAAGEAHA 344 (400) T ss_pred HHHHHHHhhhhh-------hhCcEEEEcHHHHHHHHHhhccCCCeeeecCcCCCCccccccceeEEecccccCCCCceEE Confidence 344333322111 112378999999988864 3333443321 0111111 12222222111 22344333 Q ss_pred EEEEeeCC-----CceEEEEeCchhhcccceecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 280 LWAPRVEG-----KDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 280 ~~~~~~~~-----~~~~~~~~p~~~~~l~~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) +|.+-.+. .....+.+. .+. .-....-+..|.+|.+ ..|.+|+.+..- T Consensus 345 ~~gd~s~~~~~~~~~~~~~~~~----~~~----~~~~~~~~~~r~d~~~-~~~~a~~~l~~~ 397 (400) T protein:vir:38 345 FLGDIKRAILFANRADFMVRWV----DDQ----IYGQFLQAGMRFGVSV-ADEKAGYFLTYT 397 (400) T ss_pred EEEeccccEEEEeecceEEEEe----ccc----ccceeEEEEEEeccEE-ecccceEEEEee Confidence 33321100 001111111 111 1112334556665554 469998887766 No 96 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=92.18 E-value=0.012 Score=31.02 Aligned_cols=299 Identities=10% Similarity=-0.037 Sum_probs=130.6 Q ss_pred CchHHHHHHHhhc-ceeccc--hhhhhh----hhhhhhhhh-----hhhhcCc--cccCCcc--hHHHHHHHhhCceeee Q lcl|Aclame:pro 1 MRDAQRIQNLARA-GVILPR--SVKNVS----TPLAEYAMD-----AADLSPH--LSSTGSS--GIPNYLTTYVDPSVID 64 (336) Q Consensus 1 m~~~~~~~~l~~~-g~~~~~--~~~~~~----~~~~~~~~d-----a~d~~~~--l~t~~~~--~i~~~l~~~idp~v~~ 64 (336) +++.+........ ...-+. ...... ..+..+... .+..... ..+.+++ .||..+.+ +|++ T Consensus 63 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~a~~~~t~~~gg~~iP~~~~~----~ii~ 138 (404) T protein:vir:39 63 LVEAQAEQVVNMREEEKGPLNKSEYELKDKFVKEFVNMVRNPMAFLNTVSSKTETSGSDSAAGLTIPQDIRT----MINT 138 (404) T ss_pred HHHHHHHHHhccccccccccccchhhhHHHHHHHHHHHHhcchhhhhhhhhhhhhcccccCCceeccHHHHH----HHHH Confidence 1111111100000 000000 000000 001000000 0001111 1222322 26666553 4455 Q ss_pred eeccccchhhhcccccCCCcceeeEEEeeeecccceEEeecccCCce-eeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHh Q lcl|Aclame:pro 65 ILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVATYGDYSSDGD-SGTNINYPQRQSYFFQTWTRWGERELEMAGAG 143 (336) Q Consensus 65 ~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP~-vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~ 143 (336) ..........++.+.....-.-........+..+.+...+....+|- .+..........+.++..+.+|.+=+.- . T Consensus 139 ~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~d---s 215 (404) T protein:vir:39 139 LVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLDNPRLTIIKYLIKRYAGIITATNTLLKD---T 215 (404) T ss_pred HHHhhhhHHhhcceeeccCCcceEEEEeecCCccceeeecCccccccccccceeeEEeeeeeEEeeehhHHHHHhh---c Confidence 55555556666555332211111122333455567788888888885 5578888888899999888888754443 3 Q ss_pred CCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCce Q lcl|Aclame:pro 144 RVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGII 223 (336) Q Consensus 144 g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v 223 (336) ..+|.+.-......++.+.+++-++.|++... +. + ...+.+ ||..++....... . T Consensus 216 ~~~l~~~i~~~l~~~~~~~~d~~il~g~g~~~----------~~-----~---~~~~~~----~i~~~~~~~~~~~---~ 270 (404) T protein:vir:39 216 AENILAWLSSWIAKKVVVTRNQAIIAAMGTVP----------KK-----P---TIAKFD----DVITMINTSVDPA---I 270 (404) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHhcccccc----------cc-----c---ccccHH----HHHHHHHHhhhhh---h Confidence 46778888888888888888888888875421 11 0 112333 3444433221111 1 Q ss_pred eccCCcEEEecHHHHHhccc-CCCCCccHHHH-HHHhCC----ccEEEE--cccccCCCCceEEEEEEeeCCCceEEEEe Q lcl|Aclame:pro 224 TQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAK-LKEIFP----KLEFVT--IPEYDTASGRLVQLWAPRVEGKDTATCGF 295 (336) Q Consensus 224 ~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~-l~~n~p----nl~i~~--~pel~~a~G~~~~~~~~~~~~~~~~~~~~ 295 (336) . ....++|.++.+..|.. .+..|.-++.- +...-+ +..++. -..+...+.....+++-.. .+...+.. T Consensus 271 ~--~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~gd~--~~~~~~~~ 346 (404) T protein:vir:39 271 I--ATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKKVIVVADRWLPNSGSTVYPLYYGDM--SQAITLFD 346 (404) T ss_pred c--cCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCcceecceeEEEecccccCccCCCccEEEEEec--cccEEEEe Confidence 1 12368999999988864 33334333210 001111 111111 1111111222222222211 11011100 Q ss_pred Cchhh--cccce---ecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 296 TEKMR--AHSIE---RYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 296 p~~~~--~l~~~---~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) -+.++ ..+.. ...-....-+..|.+ +.+++|.||+.+..- T Consensus 347 ~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d-~~~~~~~a~~~~~~~ 391 (404) T protein:vir:39 347 RENMSLLPTNIGAGAFETDTTKIRVIDRFD-VKTTDSEALVAGSFT 391 (404) T ss_pred ecceEEEEeccchhhhhhceeeEEEEeeec-cEEecccceEEEEee Confidence 01111 10100 011223445666666 577889999998877 No 97 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=92.09 E-value=0.013 Score=30.94 Aligned_cols=309 Identities=9% Similarity=0.015 Sum_probs=137.0 Q ss_pred CchHHHHH---H----Hhhccee-ccchh--hhh--hhh------------hhhhhh----hhhhhcCccccCCcc--hH Q lcl|Aclame:pro 1 MRDAQRIQ---N----LARAGVI-LPRSV--KNV--STP------------LAEYAM----DAADLSPHLSSTGSS--GI 50 (336) Q Consensus 1 m~~~~~~~---~----l~~~g~~-~~~~~--~~~--~~~------------~~~~~~----da~d~~~~l~t~~~~--~i 50 (336) +....... + +++.... ...+. ... ... ...... ....+. ...+.+++ .+ T Consensus 44 ~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~a~-~~~~~~~gg~~v 122 (404) T protein:vir:10 44 QAKIEAQKRKENIENNFNEDNVKSLNTGKEENVIYNGALFVRAIADNLLKQKNQRGLNLSEKEINAI-SENIDEDGGYAV 122 (404) T ss_pred HHHHHHHHHHHHHHHHHhhhhccccccccchhhHHHHHHHHHHHHHHHHHHHHhhhhcchhhHHhhh-ccccCCCCceee Confidence 00000000 0 0000000 00000 000 000 000000 000000 01122222 24 Q ss_pred HHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecccceEEeecccCCcee--eeeeeeeeeeEEEEEE Q lcl|Aclame:pro 51 PNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVATYGDYSSDGDS--GTNINYPQRQSYFFQT 128 (336) Q Consensus 51 ~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP~v--d~~~~~~~~~v~~~~~ 128 (336) |..+. ++|++..........++++.....- ...+.+......+.+...+.....|.. +..........+.++. T Consensus 123 P~~~~----~~ii~~~~~~~~l~~l~~~~~~~~~-~g~~~~~~~~~~~~~~~v~e~~~~~~~~~~~~f~~i~~~~~k~~~ 197 (404) T protein:vir:10 123 PEDIQ----TKINTRLKDTTDLYNMVDYEPVFTR-SGSRTYEKRSKQKPMKPLSENQQIPTNGDNGKLERFNFKLKDLAD 197 (404) T ss_pred chhHH----HHHHHHHhhhhhHhhhhceeeccCC-ccceEEEEecCCcceeeccccccccccccccceeeeEeeheeeEe Confidence 54443 4556655555556666666443211 123445555555566677777677664 3445566667777888 Q ss_pred EEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccc-cceEEEEecCCCCcccccccccccccCHHHHHHH Q lcl|Aclame:pro 129 WTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAG-LENYGLINDPSLSAPITATTPWSGSPAVEAVVNE 207 (336) Q Consensus 129 ~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~-~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~D 207 (336) .+.+|.+=+. ....+|.+.-....++++.+.+++-+++|++. ....|+++.+...+.... ...+ ++| T Consensus 198 ~~~iS~ell~---ds~~~l~~~i~~~la~~~~~~~~~~il~G~g~~~~~~gi~~~~~~~~~~~~-----~~~~----~~~ 265 (404) T protein:vir:10 198 FMSIPNDLLK---FADKSLEDWIINWFVDKVRITRNAEILYGAGGDEHATGIMTANKFKKITLP-----KSPA----LKD 265 (404) T ss_pred eehhhHHHHh---hcHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCCCcccceeeccccceeecc-----cccc----HHH Confidence 8888874333 33457888888888888888999999999874 456788886655422111 1123 345 Q ss_pred HHHHHHHHHHHhCCceeccCCcEEEecHHHHHhccc-CCCCCccHHH-HHHHhCC----ccEEEEccc--ccCCCCceEE Q lcl|Aclame:pro 208 VVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAA-KLKEIFP----KLEFVTIPE--YDTASGRLVQ 279 (336) Q Consensus 208 i~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~-~l~~n~p----nl~i~~~pe--l~~a~G~~~~ 279 (336) +..+++.... . +. .....++|.+..+..|.+ .+..|.-++. -+....+ +.-++.++. +..+++.... T Consensus 266 ~~~~~~~~l~-~-~~---~~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~ 340 (404) T protein:vir:10 266 FKKCKNVELL-N-VF---KATSSWIVNQDGFNYLDSLEDKTGRPYLQPDPKDPTQYRFLGLPVIELPNDLLLSTESAIPV 340 (404) T ss_pred HHHHHHhhhh-c-cc---cCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCccccceeeEEecccccCCCCCccEE Confidence 5554442211 1 11 123468999998888854 2333332221 1111111 122332332 2233344444 Q ss_pred EEEEeeCCCceEEEEe--Cchhhcccc---eecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 280 LWAPRVEGKDTATCGF--TEKMRAHSI---ERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 280 ~~~~~~~~~~~~~~~~--p~~~~~l~~---~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) ++.+- .+...+.. ...+..... ....-....-+..|.+ +.+++|.||+.++=- T Consensus 341 ~~gd~---s~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~d-~~v~~~~a~~~~~~~ 398 (404) T protein:vir:10 341 LLGDT---KEAYKYVSDGAYELATTNIGAGAFETNTTKARIIMRID-GNVKDSEALLIAEIP 398 (404) T ss_pred EEEec---cccEEEEEecceEEEEeccccchhhcCceEEEEEEeec-cEEecccceEEEEee Confidence 44321 11111100 111111110 0011223344555554 477888888766654 No 98 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=92.04 E-value=0.013 Score=30.90 Aligned_cols=295 Identities=11% Similarity=-0.021 Sum_probs=133.7 Q ss_pred CchHHHHH----HHhhcce------eccchhhhhhhh----hhhh----hhhhhhhcCccccCCcc--hHHHHHHHhhCc Q lcl|Aclame:pro 1 MRDAQRIQ----NLARAGV------ILPRSVKNVSTP----LAEY----AMDAADLSPHLSSTGSS--GIPNYLTTYVDP 60 (336) Q Consensus 1 m~~~~~~~----~l~~~g~------~~~~~~~~~~~~----~~~~----~~da~d~~~~l~t~~~~--~i~~~l~~~idp 60 (336) ++...... .....+. ............ +..+ ..++ ..+....+.+++ .+|..+.. T Consensus 53 ~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~-~~~~~~~t~~~gg~~vP~~~~~---- 127 (397) T protein:vir:49 53 RDMFKEQYTEARANEVANMSEEEKKPLTKSEEEVKAGFVKDFKNLVRGRYQNL-LDSKTDASGSDAGLTIPQDIQT---- 127 (397) T ss_pred HHHHHHHHHHHHHHhhhccccccccccccchhHHHHHHHHHHHHHHhcchhHH-HHHhhccccccCcccccHhHHH---- Confidence 11111100 0000000 000000000000 0000 0000 001111223333 35666553 Q ss_pred eeeeeeccccchhhhcccccCCCcceeeEEEee-eecccceEEeecccCCce-eeeeeeeeeeeEEEEEEEEEeCHHHHH Q lcl|Aclame:pro 61 SVIDILVAPMKAAELVGESKKGDWTTLVAAFIT-AEPTTTVATYGDYSSDGD-SGTNINYPQRQSYFFQTWTRWGERELE 138 (336) Q Consensus 61 ~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v-~e~~G~a~~ygd~~DiP~-vd~~~~~~~~~v~~~~~~~~y~~~El~ 138 (336) +|++.+........++.+.....-. ..+.|.. .+..|.+...+.+..+|- .+.......-..+.++..+.+|..=++ T Consensus 128 ~ii~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ 206 (397) T protein:vir:49 128 AIHTLVSQYDSLQEYVNVENVTTLT-GSRVYEKWTDITGLANIDDEAGKIADVDDPKLSLIKYTIKRYAGISTVTNSLLA 206 (397) T ss_pred HHHHHHHhhhhHHhhhceeecccCc-cceEEEeeccCCcceeeecCccccccccccceeeEEeeeeeEEeeehhHHHHHh Confidence 4556555555565665553322111 1223333 344577888888888874 567888888899999998888865444 Q ss_pred HHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 139 MAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQ 218 (336) Q Consensus 139 ~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~ 218 (336) .+ ..++.+.-....++++.+.++.-.+.|++..... . ...+ ++||.+++..+... T Consensus 207 ds---~~~l~~~i~~~l~~~~~~~~d~ai~~G~g~~~~~------------~------~~~~----~d~i~~~~~~l~~~ 261 (397) T protein:vir:49 207 DS---AENILAWLSGWIAKKVVVTRNKAILEAIAALPTK------------P------TLTK----WDDIIDLEAKVDPA 261 (397) T ss_pred hh---HHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc------------c------cccc----HHHHHHHHHhhhhh Confidence 33 3567777777788888888888888886543210 0 1122 34566666666443 Q ss_pred hCCceeccCCcEEEecHHHHHhccc-CCCCCccHHHH-HHHhCC----ccEEEE--ccccc-CCCCceEEEEEEeeCCCc Q lcl|Aclame:pro 219 SQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAK-LKEIFP----KLEFVT--IPEYD-TASGRLVQLWAPRVEGKD 289 (336) Q Consensus 219 t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~-l~~n~p----nl~i~~--~pel~-~a~G~~~~~~~~~~~~~~ 289 (336) -. ....++|.++.+..|.+ .+..|.-++.= +....+ ++-++. ...+. ++.+....++.+-. + T Consensus 262 ~~------~~a~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~~---~ 332 (397) T protein:vir:49 262 IK------QTSFFLTNTSGFTALKKVKNALGDYLMERDVKSPTGYSIDGFAVKEVADRWLANGTGGAMPLYFGDLK---Q 332 (397) T ss_pred hc------CCCEEEEcHHHHHHHHHhhcCCCceeeccCcCCCCCceecceeeEEecccccccccCCceeEEEeecc---c Confidence 21 23478999999988854 33334333210 111111 111211 11121 22334334433211 1 Q ss_pred eEEEE--eCchhhcccc---eecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 290 TATCG--FTEKMRAHSI---ERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 290 ~~~~~--~p~~~~~l~~---~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) -..+. -...+...+. ....-...+-+..|.+| .+++|.+|+.+..= T Consensus 333 ~~~~~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~-~~~~~~a~~~~~~~ 383 (397) T protein:vir:49 333 AVTLFDRQHMSLLSTNIGGGAFETDTTKVRVIDRFDV-VATDTEAFVPASFK 383 (397) T ss_pred eEEEEeecceEEEEeccccchhhcCceeEEEEeeeCc-EEecccceEEEEee Confidence 01010 0111111110 00111233344455544 67788888876644 No 99 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=91.23 E-value=0.017 Score=30.31 Aligned_cols=282 Identities=9% Similarity=-0.004 Sum_probs=127.7 Q ss_pred Cc-----------------------------hHHHHHHHhhcceeccchhhhhhhhhhhhhhhhhhhcCc--cccCCcc- Q lcl|Aclame:pro 1 MR-----------------------------DAQRIQNLARAGVILPRSVKNVSTPLAEYAMDAADLSPH--LSSTGSS- 48 (336) Q Consensus 1 m~-----------------------------~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~da~d~~~~--l~t~~~~- 48 (336) ++ +.+.+.+..|.++ .....+...+.+...... ..+.+++ T Consensus 56 l~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~--------~~~~~~~~~~~~~~~~~al~~~t~s~gG 127 (387) T protein:vir:93 56 VERQVKDIEEKEKAKVKDTGEAYQSLNDHEKMVKAKAEFYRHAI--------LPNEFEKPSMEAQRLLHALPTGNDSGGD 127 (387) T ss_pred HHHHHHHHHHHHHHhhhhccccCCCcchhhHHHHHHHHHHHHHh--------hhhhhhhhhhhhHHHHHhhccCcCCCCc Confidence 00 0011111111000 000000000111000011 1233333 Q ss_pred -hHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEee-eecccceEEeecccCCceeeeeeeeeeeeEEEE Q lcl|Aclame:pro 49 -GIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFIT-AEPTTTVATYGDYSSDGDSGTNINYPQRQSYFF 126 (336) Q Consensus 49 -~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v-~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~ 126 (336) .||..+.+ +|++.+...-....+..+.+.+.. .++. ....+.+...+.....|-.+.......-..+.+ T Consensus 128 ~~IP~~~~~----~Ii~~~~~~~~l~~~~~v~~~~~~-----~~p~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~ 198 (387) T protein:vir:93 128 KLLPKTLSK----EIVSEPFAKNQLREKARLTNIKGL-----EIPRVSYTLDDDDFITDVETAKELKLKGDTVKFTTNKF 198 (387) T ss_pred eeechhHHH----HHHHHHHhhchhhhheeeeecCCc-----eEEEEeecCCccccccCcccccccccccceeeeeheee Confidence 37777664 334444433334555555544432 2232 334456777888878888888878888888888 Q ss_pred EEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEE-eeccccceEEEEecCCCCcccccccccccccCHHHHH Q lcl|Aclame:pro 127 QTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYL-FGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVV 205 (336) Q Consensus 127 ~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~-~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~ 205 (336) +..+.+|.+=|. ....++.+--....+.++.+.+++.+| .|++...-.|.++++.+... +....+ T Consensus 199 ~~~~~iS~ell~---Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~p~g~l~~~~~~~v-----------~~~~~~ 264 (387) T protein:vir:93 199 KVFAAISDTVIH---GSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLDHMSFYNGSVKEV-----------EGADMY 264 (387) T ss_pred eeechhhHHHHh---hhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccc-----------cccchH Confidence 888888855333 344567777777777777777666444 45555555788876554321 112235 Q ss_pred HHHHHHHHHHHHHhCCceeccCCcEEEecHH-HHHhcccCCCCCccHHHHHHHhCCccEEEEcccccCCC------CceE Q lcl|Aclame:pro 206 NEVVTLFQVLQTQSQGIITQEAVLHMGLPPT-AMSDLSKTNQYGLSAAAKLKEIFPKLEFVTIPEYDTAS------GRLV 278 (336) Q Consensus 206 ~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~-~~~~Ls~~~~~~~Tvl~~l~~n~pnl~i~~~pel~~a~------G~~~ 278 (336) +||.+++.++...-. .+ -..+|.+. ....+....+.|-.++ .. .|+ +|-..|-..+++ |.-. T Consensus 265 d~i~~~~~~l~~~~~----~~--a~~~mn~~t~~~~~~~~~d~~~~~~---~~-~~~-~llG~PV~~~~~~~~~~~GDf~ 333 (387) T protein:vir:93 265 DAIINALADLHEDYR----DN--ATIYMRYADYVKIISVLSNGTTNFF---DT-PAE-KVFGKPVVFTDAAVKPIVGDFN 333 (387) T ss_pred HHHHHHHhccChhhh----cC--CEEEEechHHHHHHHHHhcCCCccc---cc-CCc-cccccceEEecCCCceeeeehh Confidence 667777776644321 11 13556544 3444443333332222 11 121 222222222211 1111 Q ss_pred EEEEEeeCCCceEEEEeCchhhcccceecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 279 QLWAPRVEGKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 279 ~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) +.+.. ++ .+-+.+.. +.....+.+-+..|.+|.+ ++|-||+.+.-= T Consensus 334 ~~~~~-~~---------~~~~~~~~-~~~~~~~~~~~~~r~d~~v-~~~eA~~~l~~k 379 (387) T protein:vir:93 334 YFGIN-YD---------GTTYDTDK-DVKKGEYLFVLTAWYDQQR-TLDSAFRIAKAK 379 (387) T ss_pred hhhee-hh---------hheeeecc-cccCCceeEEEEeeeCcee-echhheEEEEee Confidence 11111 11 01111111 1122233444556776665 569999876432 No 100 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=91.13 E-value=0.017 Score=30.24 Aligned_cols=298 Identities=9% Similarity=-0.043 Sum_probs=130.8 Q ss_pred Cch----HHHHHHHhhcceec----c--chhhhhh----hhhhhh-----hhhhhhhcCcc--ccCCcc--hHHHHHHHh Q lcl|Aclame:pro 1 MRD----AQRIQNLARAGVIL----P--RSVKNVS----TPLAEY-----AMDAADLSPHL--SSTGSS--GIPNYLTTY 57 (336) Q Consensus 1 m~~----~~~~~~l~~~g~~~----~--~~~~~~~----~~~~~~-----~~da~d~~~~l--~t~~~~--~i~~~l~~~ 57 (336) ++. ..+..+.++.+..- + ....... ..+..+ ..........+ .+.+++ .+|..+.+ T Consensus 56 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~- 134 (408) T protein:vir:10 56 RDALREQLVEAQAEQVVNMREEEKGPLNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRT- 134 (408) T ss_pred HHHHHHHHHHHHHHHHhccccccccccccchhhhHHHHHHHHHHHhhcchhhhhhhhhhhhhcccccCCceeccHhHHH- Confidence 100 00011111111110 0 0000000 000000 00001111112 222333 26766654 Q ss_pred hCceeeeeeccccchhhhcccccCCCcceeeEEE-eeeecccceEEeecccCCceee-eeeeeeeeeEEEEEEEEEeCHH Q lcl|Aclame:pro 58 VDPSVIDILVAPMKAAELVGESKKGDWTTLVAAF-ITAEPTTTVATYGDYSSDGDSG-TNINYPQRQSYFFQTWTRWGER 135 (336) Q Consensus 58 idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~-~v~e~~G~a~~ygd~~DiP~vd-~~~~~~~~~v~~~~~~~~y~~~ 135 (336) +|++.+........+..+.....-. ..+.+ ...+..+.+...+....+|-.+ ..........+.++..+.+|.+ T Consensus 135 ---~Ii~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~e 210 (408) T protein:vir:10 135 ---MINTLVRQYDSLQQYVRVESVSTSN-GSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRYAGIITATNT 210 (408) T ss_pred ---HHHHHHHhhchhhhhcceeeccCCc-ceEEEeeccccccceeeecCccccccccCcceeeEEeeeeeEEeeehhHHH Confidence 4556555555555655443221100 11222 2224456777888888888654 5778888888888888888866 Q ss_pred HHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 136 ELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVL 215 (336) Q Consensus 136 El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l 215 (336) =++- ...+|.+--....++++.+.+++-.+.|++... +. . ...+.++|++.++..+.. T Consensus 211 ll~d---s~~~l~~~i~~~l~~~~~~~~~~~il~g~g~~~----------~~--~------~~~~~~~l~~~~~~~~~~- 268 (408) T protein:vir:10 211 SLKD---TAENILAWLSSWIAKKVVVTRNQAIIEVMKAAP----------KK--P------TIAKFDDVITMINTAVDP- 268 (408) T ss_pred HHhh---chHHHHHHHHHHHHHHHHHHHHHHHhhcccccc----------cc--c------ccccHHHHHHHHHHhhhh- Confidence 4443 356777777888888888888887777776421 00 0 112344444333322211 Q ss_pred HHHhCCceeccCCcEEEecHHHHHhccc-CCCCCccHHHH-HHHhCC----ccEEEEcc--cccCCCCceEEEEEEeeCC Q lcl|Aclame:pro 216 QTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAK-LKEIFP----KLEFVTIP--EYDTASGRLVQLWAPRVEG 287 (336) Q Consensus 216 ~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~-l~~n~p----nl~i~~~p--el~~a~G~~~~~~~~~~~~ 287 (336) +. . ..-.++|.+..+..|.+ .+..|.-+++- +....| +..++... .+...+.+...+++-+. T Consensus 269 -----~~-~--~~a~~v~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~~~~~~~~~~~~i~~gd~-- 338 (408) T protein:vir:10 269 -----AI-I--ATSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYLIKGKQVIVVADRWLPNTGSTVYPLYYGDM-- 338 (408) T ss_pred -----hh-c--cCCEEEEcHHHHHHHHHhhccCCceEeccCcCCCCCceecceeeEEecccccCccCCCceEEEEEeh-- Confidence 11 1 12368999999998864 34344444321 111111 11122111 12121222222222211 Q ss_pred CceEEEEe--Cchhhcccc-ee--cCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 288 KDTATCGF--TEKMRAHSI-ER--YSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 288 ~~~~~~~~--p~~~~~l~~-~~--~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) .+...+.. .+.+...+. .. ..-....-+..|.+| .++.|.||+.+..- T Consensus 339 ~~~~~~~~~~~~~v~~~~~~~~~f~~~~~~~r~~~r~d~-~v~~~~a~~~~~~~ 391 (408) T protein:vir:10 339 SQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDV-KATDSEALVAGSFS 391 (408) T ss_pred hccEEEEEecceEEEEcccccchhhcCceEEEEEEeecc-EEeccccEEEEEee Confidence 11000100 111111110 00 112345556667666 55669999988755 No 101 >protein:vir:79078 Length: 307 # NCBI annotation: gp8 # Family: family:all:908 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111208;genbank:gi:134288798;genbank:GeneID:4960752 Probab=90.78 E-value=0.019 Score=30.01 Aligned_cols=263 Identities=13% Similarity=0.083 Sum_probs=105.4 Q ss_pred hhhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecccceEEeec-----cc Q lcl|Aclame:pro 33 MDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVATYGD-----YS 107 (336) Q Consensus 33 ~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd-----~~ 107 (336) |-. -.+|- ..++-+.++-..|= -+++-+++|||....+.-.-...+| +. ++....| .+ T Consensus 1 m~~-~~~~~---~~dp~LT~~A~gy~--------n~~~Iad~lfP~vpV~~~~~k~~~f---~~--e~f~~~~t~ra~~~ 63 (307) T protein:vir:79 1 MGR-LSKLR---IVDPVLTNLAIGYT--------NAEFIGQTLMPVVEVEKEGGKIPKF---GK--ESFRLYQTERALRA 63 (307) T ss_pred CCC-CCCCc---ccCHHHHHHHhhcc--------chhhhhhhcCCcccccccccceeee---cc--ccccccccccccCC Confidence 111 11111 11333333333333 3457778888876554322222333 21 1110000 01 Q ss_pred CCceeee-eeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHH----HHHHHHhhccEEEeeccccceEEEEec Q lcl|Aclame:pro 108 SDGDSGT-NINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSS----ALGLAKFLNGSYLFGVAGLENYGLIND 182 (336) Q Consensus 108 DiP~vd~-~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aA----r~a~e~~~n~i~~~Gd~~~g~~GllN~ 182 (336) +...++. ..+.....+. +.+..+.++ .+..+..++++.+++..-. .+..|...-++++-.. |. T Consensus 64 ~~~~v~~~~~~~~~~~~~--~~~l~~~id-~r~~~~~~~~~~~~Av~~l~d~I~l~~E~~~A~l~~~~~---------~y 131 (307) T protein:vir:79 64 KSNRMNPEDIDSVDVNLD--EHDLEYPID-YREDQESAFPLEQAAVQTATDAIQLRREKMIADLSQNPS---------SY 131 (307) T ss_pred Ccceeeeecccccccccc--ccchhhccc-chhcCCCCCCHHHHHHHHHHHHHHhHHHHHHHHHhcccc---------cc Confidence 1111110 0000000000 111111111 1233344555555443332 3444444444554322 12 Q ss_pred CCCCcc-cccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhccc---------CCCCC-ccH Q lcl|Aclame:pro 183 PSLSAP-ITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK---------TNQYG-LSA 251 (336) Q Consensus 183 Pnl~~~-~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~---------~~~~~-~Tv 251 (336) |+-... .++++.|.+ .+.| ++.||.+....+...++ -.|++++|....+..|.. .+..+ +| T Consensus 132 ~~~~k~tLsgt~~Wsd-~~sD-Pi~di~~~~~ai~~~~g-----~~Pn~~vlg~~a~~~l~~h~~i~~~lk~~~~g~it- 203 (307) T protein:vir:79 132 AAGNKKQLSATEKFTA-ANSD-PVGVIEDGKEAIRTKIG-----RRPNTMVIGASAYKTLKAHPQLIEKIKYSMKGIVT- 203 (307) T ss_pred CCCceEEEccCcccCC-CCCC-cHHHHHHHHHHHHHhhC-----CccceEEeCHHHHHHHhcCHHHHHHhcCccccccC- Confidence 222211 223444544 5655 89999999999988775 369999999999998853 12223 34 Q ss_pred HHHHHHhCCccEEEEccc--ccCCCC-------ce-EEEEEEeeCCCceEEEEeC-chhhcccceecCCceEEeeec--- Q lcl|Aclame:pro 252 AAKLKEIFPKLEFVTIPE--YDTASG-------RL-VQLWAPRVEGKDTATCGFT-EKMRAHSIERYSSYFRQKKSA--- 317 (336) Q Consensus 252 l~~l~~n~pnl~i~~~pe--l~~a~G-------~~-~~~~~~~~~~~~~~~~~~p-~~~~~l~~~~~~~~~~v~~~~--- 317 (336) .++|++-+ .++.+.+-+ +.++++ +. ..+++...-+.....+.-| .-|.+ +.++..+..+..+ T Consensus 204 ~~~la~l~-~v~~V~vg~a~y~~~~~~~~~iw~~~~~l~y~~~~~~~~~~~~~~ps~Gyt~---~~~g~~~~d~~~~~~~ 279 (307) T protein:vir:79 204 VDLLKEIF-EVENIAVGEAIYADDKDRFTDIWGANIVLAYVPLQRGGQQRTPYEPSYGYTL---RKKGNPVVDTRIEDGK 279 (307) T ss_pred HHHHHHHh-CceeEEEeeeeeecccccchhcCCCceEEEecccccCCCCCcccccccceeE---EecCceEEecccCCCc Confidence 35565533 344332222 223332 22 2233322111111111011 11111 2222223333332 Q ss_pred --ceeeeEEecccceeeeccC Q lcl|Aclame:pro 318 --GTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 318 --rt~Gv~ir~P~ai~~~~GI 336 (336) .+...++..|.-++.-.|. T Consensus 280 ~~~vrv~~~~~~~i~~~~~G~ 300 (307) T protein:vir:79 280 LELVRATDIFRPYLLGADAGY 300 (307) T ss_pred eeEEeecccccceeeccccch Confidence 2233344567666666565 No 102 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=90.32 E-value=0.017 Score=30.23 Aligned_cols=253 Identities=10% Similarity=-0.023 Sum_probs=127.9 Q ss_pred hhhhhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhhcccccCCCc-ceeeEEEeeeecccceEEeecccCC Q lcl|Aclame:pro 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDW-TTLVAAFITAEPTTTVATYGDYSSD 109 (336) Q Consensus 31 ~~~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w-~~~t~~~~v~e~~G~a~~ygd~~Di 109 (336) || .++-++--+|..|..||-.++ -...+...+..+++..+- .-.+++++.++..|++..+.+++++ T Consensus 1 Ma---------~T~~~d~I~Pev~~~~V~e~~----~~~~~~~~~~~~d~~L~g~~G~ti~~P~~~~igdae~~~eg~~i 67 (270) T protein:vir:95 1 MT---------QTKKANLINPEVLANVVSAQM----QNAIRFTPYAVTDDTLVGQPGDTITRPKYAYIGAAEDLQEGVAM 67 (270) T ss_pred CC---------ceehhhhcchHHHHHHHHHHH----HhHHhhccccccccccCCCCCCEEEeeeecCCCccccccCCCcc Confidence 11 244556668999999984443 223344455555433211 2367999999999999999999999 Q ss_pred ceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCCCCccc Q lcl|Aclame:pro 110 GDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPI 189 (336) Q Consensus 110 P~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~~ 189 (336) +..+........++.+.+-+++++.+ .+....+=+ ..+-.....+.+.+++++..+ +.+ .|... T Consensus 68 ~~~~lt~~~~~a~i~~~gk~~~itD~--a~~~~~~dp-~~~~~~q~a~~~a~~~d~~li---~~l--~~a~~-------- 131 (270) T protein:vir:95 68 DTTQMSMTTTKVTVKETGKAVEVTQT--AIITNVNGT-LQEASRQLAMSLADKVEIDYI---AEL--NKSKQ-------- 131 (270) T ss_pred chhhcccchheeeeehhhCcceecHH--HHhhhccch-HHHHHHHHHHHHHHHHHHHHH---HHh--ccccc-------- Confidence 99999999999999888766666554 444444434 444444555666666554332 000 01100 Q ss_pred ccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhcccCC--CCCccHHHHHHH-h---CCccE Q lcl|Aclame:pro 190 TATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSKTN--QYGLSAAAKLKE-I---FPKLE 263 (336) Q Consensus 190 ~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~~~--~~~~Tvl~~l~~-n---~pnl~ 263 (336) + .+ ...+.+ +|++++..+ |. ....+..|++.|..+..|.+-. .+.......+.. . |-+++ T Consensus 132 ~-~~---~~~t~~----~~~dA~~~l-----gd-~~~~~~~i~vhs~~~~~Lrk~~~~~~~~~~~~~~~~G~ig~~~G~~ 197 (270) T protein:vir:95 132 T-AT---VSADAT----GILDAIEVF-----NS-ENDEDYVLYVNPKDYNKLVKSLFKVGGNVQDRAISKGDLVEIVGVS 197 (270) T ss_pred c-cc---cccCHH----HHHHHHHHh-----cc-ccCCCcEEEEcHHHHHHHHhhhcccccccccchhcccccceeccee Confidence 0 00 112333 344443333 11 2345789999999998886421 010000011111 1 22344 Q ss_pred E-EEcccccCCCCceEEEEEEeeCCCceEEEEeCchhhcccceecCCceE--EeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 264 F-VTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSIERYSSYFR--QKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 264 i-~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~--v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) + ++-.-. .-...+++- +.-..+....+.+. ... |....+ .=..-+.+||-+..|..++.++== T Consensus 198 Viv~s~~~---~~~~~~l~~-----~gAi~~~~~~~~~v-Etd-Rd~~~~~d~i~~~~~y~v~~~~~skvv~~t~~ 263 (270) T protein:vir:95 198 DIVKSKRV---SENTAFLQR-----YGAMEIVNKKKPEA-YTD-FDILKRTHLLSTNYHYSVNLKDETGVVKVTFK 263 (270) T ss_pred EEEeCCCC---CceeEEEEe-----ccceeeeecCCcee-eec-cchhhcccEEEeeeEEEEEEEccceEEEEEec Confidence 2 321110 112233331 22222222222111 111 111111 111225688888888888866322 No 103 >protein:vir:98480 Length: 348 # NCBI annotation: ORFp38 # Family: family:all:1083 # MgeID: mge:1589 # MgeName: VWB # Cross-refs: genbank:acc:NP_958280;genbank:gi:41057254;uniprot:Q38595;genbank:GeneID:2732864 Probab=90.30 E-value=0.022 Score=29.72 Aligned_cols=275 Identities=9% Similarity=-0.017 Sum_probs=101.9 Q ss_pred hhhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecccceEEeec----ccC Q lcl|Aclame:pro 33 MDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVATYGD----YSS 108 (336) Q Consensus 33 ~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd----~~D 108 (336) |.-. .. .+--=|..|+.+|.........+.+-.+.+||... +..+.|..+.........+. ... T Consensus 1 M~~~-~~------~d~~~~~~l~~~i~~~~~~~~~~~~l~~~~fp~~~-----~~~~~~~~~~~~~~~~~~a~~~~~~~~ 68 (348) T protein:vir:98 1 MSWT-LD------TEFIEPTQLTGLIREALRDLQVNRFRLARWLPNVD-----VDDITFEFLRGGGGLAETASYRSWDTE 68 (348) T ss_pred Ccch-hh------hhccCHHHHHHHHHHHhhccCcchhhHHhcCCCcc-----ccceEEEEEeccCCceeeeeeecCCCc Confidence 1100 00 00001455666662111122233355678888643 12234443332222111222 122 Q ss_pred Cceeee-eeeeeeeeEEEEEEEEEeCHHHHHHHHHhCC--------CHHHHHHHHHHHHHHHhhc------cEEEeeccc Q lcl|Aclame:pro 109 DGDSGT-NINYPQRQSYFFQTWTRWGERELEMAGAGRV--------DLASELNYSSALGLAKFLN------GSYLFGVAG 173 (336) Q Consensus 109 iP~vd~-~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~--------~l~~~K~~aAr~a~e~~~n------~i~~~Gd~~ 173 (336) -|+.+- ..+..+.++-.++..+..+..|+...+.... +...+...++++..|.... ++.+-|.. T Consensus 69 ~~~~~r~g~~~~~~~~~~i~~~~~i~~~d~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~~~g~~- 147 (348) T protein:vir:98 69 SKIGRREGLAKVMGELPPISEKIPLNEYDRLRLRKLSRDEALPFIARDAQRLARNIGARFEVARGSALVNATVPVTELQ- 147 (348) T ss_pred cceeecccceeeeeeccccccccccCHHHHHHhcCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCeEEEecCc- Confidence 233332 2233334444455666777777665432211 0111222333443333322 34444433 Q ss_pred cce-EEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhcccC-------- Q lcl|Aclame:pro 174 LEN-YGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSKT-------- 244 (336) Q Consensus 174 ~g~-~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~~-------- 244 (336) +.+ ||. |+-. ..++++.|++..+++ ++.||.+....+...++ . .|++++|.+..+..|.+- T Consensus 148 ~~vDyg~---~~~~-~~t~~~~Ws~~~~ad-p~~di~~~~~~~~~~~G-~----~p~~~vm~~~~~~~l~~~~~i~~~~~ 217 (348) T protein:vir:98 148 QTVDFGR---IGSH-SVVAAVLWSVHATAT-PISDLESWVATYEDTNG-Q----SPGVILMPKAAVSHMRQCEEVIRQVF 217 (348) T ss_pred eEEcccc---Cccc-ccccccccCCCCCCC-HHHHHHHHHHHHHHccC-C----cceEEEeCHHHHHHHhcCHHHHHHHh Confidence 222 333 2222 235667887655655 88999999888876554 2 488999999999988431 Q ss_pred --CCC---C-cc--HHHHHHHhCCccEEEEcccccCCCCceEEEEEEeeCCCceEEEEeCc------------hhhcccc Q lcl|Aclame:pro 245 --NQY---G-LS--AAAKLKEIFPKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTE------------KMRAHSI 304 (336) Q Consensus 245 --~~~---~-~T--vl~~l~~n~pnl~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p~------------~~~~l~~ 304 (336) +.. . ++ .+..++..+--..|+.--+.-...|....++-+ +.. +.+|. -.+.+++ T Consensus 218 ~~~~~~~~~~~~~~~~~~~~~~~g~~~i~~~d~~~~~~g~~~~~~p~-----~~i-~l~p~~~~~~~~~~~~~G~t~~G~ 291 (348) T protein:vir:98 218 PLAPSGTAPMVSVEQLNTVLSSMGLPPIEVYDAKVAVDGVSTRITPA-----NAI-ALLPEPGATDAAQPTELGATLLGT 291 (348) T ss_pred ccCccccccccCHHHHHHHHHhhCCeEEEEeeeEEEcCCceeceecC-----CeE-EEEecCCcccccccccccceeccc Confidence 100 0 11 122222222212222211111112222222110 111 11111 0011110 Q ss_pred --eecCCceEEeeec------------ceeeeEE---ecccce-eeeccC Q lcl|Aclame:pro 305 --ERYSSYFRQKKSA------------GTWGAVI---FRPFAV-AQMIGV 336 (336) Q Consensus 305 --~~~~~~~~v~~~~------------rt~Gv~i---r~P~ai-~~~~GI 336 (336) +...+.....+.. .-.|..+ -+|+-+ .+.+++ T Consensus 292 ~~e~~~~~~~~~~~~~~~i~~~~~~~~dP~~~~~~~~s~~lPv~~~~~~~ 341 (348) T protein:vir:98 292 TAESLEDDYALAPGEQPGIVAATWKTKDPVRLWTHAAAVGIPVLREPNLT 341 (348) T ss_pred chhhhccccccceeccCceeeeeeeecCCcEEEEEEeeeeeccccCCCcE Confidence 0011111111100 0001111 112211 112222 No 104 >protein:vir:78640 Length: 352 # NCBI annotation: phage capsid # Family: family:all:658 # MgeID: mge:1855 # MgeName: tp310-2 # Cross-refs: genbank:acc:YP_001429943;genbank:gi:156603997;genbank:GeneID:5525386 Probab=90.12 E-value=0.023 Score=29.61 Aligned_cols=283 Identities=9% Similarity=-0.004 Sum_probs=130.0 Q ss_pred Cc-----------------------------hHHHHHHHhhcceeccchhhhhhhhhhhhhhhhhhhcCccc--cCCcc- Q lcl|Aclame:pro 1 MR-----------------------------DAQRIQNLARAGVILPRSVKNVSTPLAEYAMDAADLSPHLS--STGSS- 48 (336) Q Consensus 1 m~-----------------------------~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~da~d~~~~l~--t~~~~- 48 (336) ++ .++.+.+..|.+. .........+........++ +.+++ T Consensus 21 l~~~~d~~e~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~--------~~~~~~~~~~~~~~~~~al~~~~~~~gG 92 (352) T protein:vir:78 21 VERQVQDIEEKEKAKVKDKGEAYQSLNDNEKLVKAKAEFYRHAI--------LPNEFEKPSMEAQRLLHALPTGNDSGGD 92 (352) T ss_pred HHHHHHHHHHHHHHHhhhccccccccchhhhHHHHHHHHHHHHh--------hhhHHHHHHhhHHHHHHHhccCCCCCCc Confidence 00 0011111111110 00011111111111111122 22222 Q ss_pred -hHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecccceEEeecccCCceeeeeeeeeeeeEEEEE Q lcl|Aclame:pro 49 -GIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVATYGDYSSDGDSGTNINYPQRQSYFFQ 127 (336) Q Consensus 49 -~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~ 127 (336) .||..+.+ +|++.+......+.+..+.+.+... ...+....+.+.+.+....+|-.+.......-..+.++ T Consensus 93 ~lIP~~~~~----~Ii~~l~~~s~l~~~~~v~~~~~~~----~p~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~ 164 (352) T protein:vir:78 93 KLLPKTLSK----EIVSEPFAKNQLREKARLTNIKGLE----IPRVSYTLDDDDFITDVETAKELKLKGDTVKFTTNKFK 164 (352) T ss_pred eeccHhHHH----HHHHHHHhhcchhhheeeEecCCce----EEEEecCCCcccccccccccccccccceeeeecceeEE Confidence 47766553 3344334444445556555555432 22333344677888888888888888888888889998 Q ss_pred EEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccE-EEeeccccceEEEEecCCCCcccccccccccccCHHHHHH Q lcl|Aclame:pro 128 TWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGS-YLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVN 206 (336) Q Consensus 128 ~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i-~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~ 206 (336) ..+.+|.+=|.-+ ..+|.+--....++++.+.++.. +..|++.....|.++++.+... + ....++ T Consensus 165 ~~i~is~ell~Ds---~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~~~~~g~l~~~~~~~~-t----------~~~~~d 230 (352) T protein:vir:78 165 VFAAISDTVIHGS---DVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEV-E----------GANMYD 230 (352) T ss_pred eechhhHHHHhhh---hHHHHHHHHHHHHHHHHHHHHHhhhhcCCCCcccccceeccccccc-c----------ccchHH Confidence 8888887644433 35677777776666666665554 4456655556777777665431 1 111356 Q ss_pred HHHHHHHHHHHHhCCceeccCCcEEEecHHHHHh-cccCCCCCccHHHHHHHhCCccEEEEcccccCCC------CceEE Q lcl|Aclame:pro 207 EVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSD-LSKTNQYGLSAAAKLKEIFPKLEFVTIPEYDTAS------GRLVQ 279 (336) Q Consensus 207 Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~-Ls~~~~~~~Tvl~~l~~n~pnl~i~~~pel~~a~------G~~~~ 279 (336) ||.+++..|...-. + .-+.+|-+..+.. +...++.|..++. .-|+ ++-..|-..+++ |.-.+ T Consensus 231 ~i~~~~~~l~~~~~-----~-~a~~~mn~~t~~~l~~~~~~~~~~~~~----~~~~-~llG~PV~~~~~~~~~~~Gdf~~ 299 (352) T protein:vir:78 231 AIINALADLHEDYR-----D-NATIYMRYADYVKIISVLSNGTTNFFD----TPAE-KVFGKPVVFTDAAVKPIVGDFNY 299 (352) T ss_pred HHHHHHhccChhhh-----c-CCEEEEehHHHHHHHHHHhccCCcccc----cCCc-cccccceEEecCCCceeEeehhh Confidence 66666665533211 1 1246665554433 3433333433331 1121 121222222221 11111 Q ss_pred EEEEeeCCCceEEEEeCchhhcccceecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 280 LWAPRVEGKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 280 ~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) .+.. +++ +-+..+- +........-+..|..|.+ ++|-||+.+.-= T Consensus 300 ~~~~-~~~---------~~~~~~~-~~~~g~~~f~~~~r~Dg~~-~~~eA~~~l~~~ 344 (352) T protein:vir:78 300 FGIN-YDG---------TTYDTDK-DVKKGEYLFVLTAWYDQQR-TLDSAFRIAKAK 344 (352) T ss_pred hhhh-hhh---------heeeeec-cccCCeeEEEEEeeeCcee-echhheEEEEee Confidence 1110 000 0011110 1112235556677777774 559998665433 No 105 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=89.51 E-value=0.026 Score=29.28 Aligned_cols=307 Identities=10% Similarity=0.100 Sum_probs=129.8 Q ss_pred CchHHHHH-------HHh---h----------cceeccchh-------------------hhhh-hhhhhhhhhhhhhcC Q lcl|Aclame:pro 1 MRDAQRIQ-------NLA---R----------AGVILPRSV-------------------KNVS-TPLAEYAMDAADLSP 40 (336) Q Consensus 1 m~~~~~~~-------~l~---~----------~g~~~~~~~-------------------~~~~-~~~~~~~~da~d~~~ 40 (336) +.+++++. ++. + ....-+... ..+. .+.. ++....+.+ T Consensus 41 ~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~l~~~~~~~~~~e~~--~~~~~~a~~ 118 (409) T protein:vir:45 41 KSELEALDERIAREEELRRQDQAYIESNEEEQRQNLDPENNSQQDEKRAQVFDKWMRHGASELTSEERK--ALRELRAQG 118 (409) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhcccCCCCCcchhhHHHHHHHHHHHHhhhhhccHHHHH--HHHHHhhcc Confidence 11111100 000 0 000000000 0000 0011 111111111 Q ss_pred ccccCCcch--HHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeeccc-ceEEeecccCCceeeeeee Q lcl|Aclame:pro 41 HLSSTGSSG--IPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTT-TVATYGDYSSDGDSGTNIN 117 (336) Q Consensus 41 ~l~t~~~~~--i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~~G-~a~~ygd~~DiP~vd~~~~ 117 (336) .++.+.+| ||..+.+ +|++.+........+..+.+... ...+.+...+..+ .+...+.....|-.+.... T Consensus 119 -~~~~~~gg~liP~~~~~----~ii~~~~~~~~l~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~f~ 191 (409) T protein:vir:45 119 -VAQDEKGGYTVPETFLA----KVVEKMKSYGGIASVAQILTTSD--GRTMEWATADGTSEVGVLLGENEEAGEEDTDFG 191 (409) T ss_pred -CccCcCCceeccHhHHH----HHHHHHHhhhhhhhhceeeecCC--CceEEEEeeccCccccccccccccccccccccc Confidence 12333332 5665543 34444433333334433322221 1234444444443 3456677777777777655 Q ss_pred eeeeeEEEEE-EEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccc---cceEEEEecCCCCccccccc Q lcl|Aclame:pro 118 YPQRQSYFFQ-TWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAG---LENYGLINDPSLSAPITATT 193 (336) Q Consensus 118 ~~~~~v~~~~-~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~---~g~~GllN~Pnl~~~~~~~t 193 (336) ...-..+.+. ..+.+|.+=+.- ...+|.+.-......++.+.+++-.++|+.. .+..|+++.+....... . T Consensus 192 ~~~l~~~k~~~~~i~is~ell~d---s~~~l~~~i~~~la~a~~~~~~~a~l~G~G~~~~~~p~Gil~~~~~~~~~~-~- 266 (409) T protein:vir:45 192 MGSLGALKMTSKIIRVSNELLQD---SAIDMEAYLARRIAERIGRGEARYLIQGTGAGTPKQPKGLAASVTGTTQTA-A- 266 (409) T ss_pred eeeeeeeeeeeeehhhhHHHHhc---cHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCccccceeeeccccccccc-c- Confidence 5444444433 234566544433 3457888888888888889999999999964 46789999654322111 1 Q ss_pred ccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhccc-CCCCCccHHH-HHHHh----CCccEEEEc Q lcl|Aclame:pro 194 PWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAA-KLKEI----FPKLEFVTI 267 (336) Q Consensus 194 ~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~-~l~~n----~pnl~i~~~ 267 (336) .+..| ++||.+++..|...-. ....-.+++.+..+..|.+ .+..|.-++. -+... .-+..++.. T Consensus 267 --~~~~~----~d~i~~l~~~l~~~~~----~~a~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~ 336 (409) T protein:vir:45 267 --ANAVK----WQEILALKHSIDPAYR----RGPKFRLAFNDNTLKLISEMEDGQGRPLWLPDIVGVAPASVLNVPYVID 336 (409) T ss_pred --ccccc----hHHHHHHHHhhhhhhc----cCCeEEEEECHHHHHHHHHhhcCCCceeeccCcCCCCCceecceeeEEe Confidence 11223 4566666666543221 1122246778877777643 2333332221 00011 111222222 Q ss_pred ccccC-CCCceEEEEEEeeCCCceEEEEe--Cchhhcccc-eecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 268 PEYDT-ASGRLVQLWAPRVEGKDTATCGF--TEKMRAHSI-ERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 268 pel~~-a~G~~~~~~~~~~~~~~~~~~~~--p~~~~~l~~-~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) ..+.+ +.|....+|.+ .. +.. +.. ++-+..+.. ..+.....+-+..|.+|. +..|-||+.+.+= T Consensus 337 ~~~p~~~~~~~~i~~Gd-~~--~~~-i~~~~~~~~~~~~d~~~~~~~~~~~~~~r~d~~-~~~~~A~~~l~~k 404 (409) T protein:vir:45 337 QEIDDIGAGKKFMFCGD-FD--RFI-IRRVRYMILKRLVERYAEYDQTGFLAFHRFDCI-LEDTSAIKALVGK 404 (409) T ss_pred cCcCCccCCccEEEEee-hh--hhh-eeeccceEEEEeecccccCCcEEEEEEEEeccE-eechhheEEEEec Confidence 22221 12222222221 10 000 000 000111000 011223445566676554 8889999887765 No 106 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=88.38 E-value=0.033 Score=28.73 Aligned_cols=295 Identities=7% Similarity=-0.056 Sum_probs=131.9 Q ss_pred CchHHH----------HHHHhh----cceecc----ch-hh------------hhhhhhhhhhhhhhhhcC--ccccCCc Q lcl|Aclame:pro 1 MRDAQR----------IQNLAR----AGVILP----RS-VK------------NVSTPLAEYAMDAADLSP--HLSSTGS 47 (336) Q Consensus 1 m~~~~~----------~~~l~~----~g~~~~----~~-~~------------~~~~~~~~~~~da~d~~~--~l~t~~~ 47 (336) +.++++ +.+.+. .+-... .+ .. .+....+. .......+. ...|.++ T Consensus 35 ~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~t~~~ 113 (392) T protein:vir:10 35 MEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDVFMKALRNKPLNAEERE-FLEDDLEQRAMSGLTGED 113 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHHHHHHHhcccccHHHHH-HHhhhhhhhhccccccCC Confidence 111100 000000 000000 00 00 00000000 000000010 1123334 Q ss_pred c--hHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecccceEEeecccCCceee-eeeeeeeeeEE Q lcl|Aclame:pro 48 S--GIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVATYGDYSSDGDSG-TNINYPQRQSY 124 (336) Q Consensus 48 ~--~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP~vd-~~~~~~~~~v~ 124 (336) + .+|..+. ++|++.+...-....+.++..... ......+......+.+...+.+...|-.+ ...+...-..+ T Consensus 114 gg~~vP~~~~----~~ii~~~~~~s~l~~~~~~~~~~~-~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~ 188 (392) T protein:vir:10 114 GGLVIPQDIQ----TQINELARSFDALEQYVTVEPVRT-RSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVK 188 (392) T ss_pred CceecchhHH----HHHHHHHHhhhhhhhhceeeeccC-CceeEEEEeecCCccceeecccccccccccccceeEEeeee Confidence 4 3666554 345555555555555555533221 11123344444455677778877777554 57778888888 Q ss_pred EEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCCCCcccccccccccccCHHHH Q lcl|Aclame:pro 125 FFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAV 204 (336) Q Consensus 125 ~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI 204 (336) .++..+.+|.+=++.+ ..+|.+.-....+.++.+.++.-.+.|++.... . ...+.+.| T Consensus 189 k~~~~~~iS~ell~ds---~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~---------------~----~~~~~d~i 246 (392) T protein:vir:10 189 DRAGILPLSRSLLQDS---DQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK---------------Q----AIKSLDDI 246 (392) T ss_pred eEEEeehhhHHHHhhh---HHHHHHHHHHHHHHHHHHHHHHHHhhccccccc---------------c----CccCHHHH Confidence 8998999988766543 456888888888888888888777776653211 0 01233434 Q ss_pred HHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhccc-CCCCCccHHHH-HHHhCC----ccEEE---Ec--cc-ccC Q lcl|Aclame:pro 205 VNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAK-LKEIFP----KLEFV---TI--PE-YDT 272 (336) Q Consensus 205 ~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~-l~~n~p----nl~i~---~~--pe-l~~ 272 (336) .+-++..+... . . ..-+++|.++.+..|.+ .+..|.-++.- +....+ ++.++ .. |. ... T Consensus 247 ~~~~~~~l~~~------~-~--~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~ 317 (392) T protein:vir:10 247 KDVLNVKLDPA------I-S--PNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGT 317 (392) T ss_pred HHHHHHhhhhh------h-c--cCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCcccEEEecccccCCCcc Confidence 33333222111 1 1 12368999999888854 23333322210 111111 11211 11 11 122 Q ss_pred CCCceEEEEEEeeCCCceEEE--EeCchhhcccc---eecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 273 ASGRLVQLWAPRVEGKDTATC--GFTEKMRAHSI---ERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 273 a~G~~~~~~~~~~~~~~~~~~--~~p~~~~~l~~---~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) +.+....++.+- .+...+ .-.+.+...+. ....-...+-|..|.+| .+++|.+|+.+..- T Consensus 318 ~~~~~~~~~gdf---s~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~-~v~~~~a~~~l~~~ 382 (392) T protein:vir:10 318 TAKKAPLIIGDL---KEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDV-QMWDNEAAVYGEID 382 (392) T ss_pred cCCceEEEEEeh---hceEEEEeecceEEEEeccccchhhcCceEEEEEEeecc-EEecccceEEEEec Confidence 234444444321 110000 01111111111 01112345667777776 67779999998765 No 107 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=88.38 E-value=0.033 Score=28.73 Aligned_cols=295 Identities=7% Similarity=-0.056 Sum_probs=131.9 Q ss_pred CchHHH----------HHHHhh----cceecc----ch-hh------------hhhhhhhhhhhhhhhhcC--ccccCCc Q lcl|Aclame:pro 1 MRDAQR----------IQNLAR----AGVILP----RS-VK------------NVSTPLAEYAMDAADLSP--HLSSTGS 47 (336) Q Consensus 1 m~~~~~----------~~~l~~----~g~~~~----~~-~~------------~~~~~~~~~~~da~d~~~--~l~t~~~ 47 (336) +.++++ +.+.+. .+-... .+ .. .+....+. .......+. ...|.++ T Consensus 35 ~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~t~~~ 113 (392) T protein:vir:10 35 MEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDVFMKALRNKPLNAEERE-FLEDDLEQRAMSGLTGED 113 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHHHHHHHhcccccHHHHH-HHhhhhhhhhccccccCC Confidence 111100 000000 000000 00 00 00000000 000000010 1123334 Q ss_pred c--hHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecccceEEeecccCCceee-eeeeeeeeeEE Q lcl|Aclame:pro 48 S--GIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVATYGDYSSDGDSG-TNINYPQRQSY 124 (336) Q Consensus 48 ~--~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP~vd-~~~~~~~~~v~ 124 (336) + .+|..+. ++|++.+...-....+.++..... ......+......+.+...+.+...|-.+ ...+...-..+ T Consensus 114 gg~~vP~~~~----~~ii~~~~~~s~l~~~~~~~~~~~-~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~ 188 (392) T protein:vir:10 114 GGLVIPQDIQ----TQINELARSFDALEQYVTVEPVRT-RSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVK 188 (392) T ss_pred CceecchhHH----HHHHHHHHhhhhhhhhceeeeccC-CceeEEEEeecCCccceeecccccccccccccceeEEeeee Confidence 4 3666554 345555555555555555533221 11123344444455677778877777554 57778888888 Q ss_pred EEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCCCCcccccccccccccCHHHH Q lcl|Aclame:pro 125 FFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAV 204 (336) Q Consensus 125 ~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI 204 (336) .++..+.+|.+=++.+ ..+|.+.-....+.++.+.++.-.+.|++.... . ...+.+.| T Consensus 189 k~~~~~~iS~ell~ds---~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~---------------~----~~~~~d~i 246 (392) T protein:vir:10 189 DRAGILPLSRSLLQDS---DQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK---------------Q----AIKSLDDI 246 (392) T ss_pred eEEEeehhhHHHHhhh---HHHHHHHHHHHHHHHHHHHHHHHHhhccccccc---------------c----CccCHHHH Confidence 8998999988766543 456888888888888888888777776653211 0 01233434 Q ss_pred HHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhccc-CCCCCccHHHH-HHHhCC----ccEEE---Ec--cc-ccC Q lcl|Aclame:pro 205 VNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAK-LKEIFP----KLEFV---TI--PE-YDT 272 (336) Q Consensus 205 ~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~-l~~n~p----nl~i~---~~--pe-l~~ 272 (336) .+-++..+... . . ..-+++|.++.+..|.+ .+..|.-++.- +....+ ++.++ .. |. ... T Consensus 247 ~~~~~~~l~~~------~-~--~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~ 317 (392) T protein:vir:10 247 KDVLNVKLDPA------I-S--PNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGT 317 (392) T ss_pred HHHHHHhhhhh------h-c--cCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCcccEEEecccccCCCcc Confidence 33333222111 1 1 12368999999888854 23333322210 111111 11211 11 11 122 Q ss_pred CCCceEEEEEEeeCCCceEEE--EeCchhhcccc---eecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 273 ASGRLVQLWAPRVEGKDTATC--GFTEKMRAHSI---ERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 273 a~G~~~~~~~~~~~~~~~~~~--~~p~~~~~l~~---~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) +.+....++.+- .+...+ .-.+.+...+. ....-...+-|..|.+| .+++|.+|+.+..- T Consensus 318 ~~~~~~~~~gdf---s~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~-~v~~~~a~~~l~~~ 382 (392) T protein:vir:10 318 TAKKAPLIIGDL---KEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDV-QMWDNEAAVYGEID 382 (392) T ss_pred cCCceEEEEEeh---hceEEEEeecceEEEEeccccchhhcCceEEEEEEeecc-EEecccceEEEEec Confidence 234444444321 110000 01111111111 01112345667777776 67779999998765 No 108 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=88.38 E-value=0.033 Score=28.73 Aligned_cols=295 Identities=7% Similarity=-0.056 Sum_probs=131.9 Q ss_pred CchHHH----------HHHHhh----cceecc----ch-hh------------hhhhhhhhhhhhhhhhcC--ccccCCc Q lcl|Aclame:pro 1 MRDAQR----------IQNLAR----AGVILP----RS-VK------------NVSTPLAEYAMDAADLSP--HLSSTGS 47 (336) Q Consensus 1 m~~~~~----------~~~l~~----~g~~~~----~~-~~------------~~~~~~~~~~~da~d~~~--~l~t~~~ 47 (336) +.++++ +.+.+. .+-... .+ .. .+....+. .......+. ...|.++ T Consensus 35 ~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~t~~~ 113 (392) T protein:vir:10 35 MEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDVFMKALRNKPLNAEERE-FLEDDLEQRAMSGLTGED 113 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHHHHHHHhcccccHHHHH-HHhhhhhhhhccccccCC Confidence 111100 000000 000000 00 00 00000000 000000010 1123334 Q ss_pred c--hHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecccceEEeecccCCceee-eeeeeeeeeEE Q lcl|Aclame:pro 48 S--GIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVATYGDYSSDGDSG-TNINYPQRQSY 124 (336) Q Consensus 48 ~--~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP~vd-~~~~~~~~~v~ 124 (336) + .+|..+. ++|++.+...-....+.++..... ......+......+.+...+.+...|-.+ ...+...-..+ T Consensus 114 gg~~vP~~~~----~~ii~~~~~~s~l~~~~~~~~~~~-~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~ 188 (392) T protein:vir:10 114 GGLVIPQDIQ----TQINELARSFDALEQYVTVEPVRT-RSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVK 188 (392) T ss_pred CceecchhHH----HHHHHHHHhhhhhhhhceeeeccC-CceeEEEEeecCCccceeecccccccccccccceeEEeeee Confidence 4 3666554 345555555555555555533221 11123344444455677778877777554 57778888888 Q ss_pred EEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCCCCcccccccccccccCHHHH Q lcl|Aclame:pro 125 FFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAV 204 (336) Q Consensus 125 ~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI 204 (336) .++..+.+|.+=++.+ ..+|.+.-....+.++.+.++.-.+.|++.... . ...+.+.| T Consensus 189 k~~~~~~iS~ell~ds---~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~---------------~----~~~~~d~i 246 (392) T protein:vir:10 189 DRAGILPLSRSLLQDS---DQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK---------------Q----AIKSLDDI 246 (392) T ss_pred eEEEeehhhHHHHhhh---HHHHHHHHHHHHHHHHHHHHHHHHhhccccccc---------------c----CccCHHHH Confidence 8998999988766543 456888888888888888888777776653211 0 01233434 Q ss_pred HHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhccc-CCCCCccHHHH-HHHhCC----ccEEE---Ec--cc-ccC Q lcl|Aclame:pro 205 VNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAK-LKEIFP----KLEFV---TI--PE-YDT 272 (336) Q Consensus 205 ~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~-l~~n~p----nl~i~---~~--pe-l~~ 272 (336) .+-++..+... . . ..-+++|.++.+..|.+ .+..|.-++.- +....+ ++.++ .. |. ... T Consensus 247 ~~~~~~~l~~~------~-~--~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~ 317 (392) T protein:vir:10 247 KDVLNVKLDPA------I-S--PNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGT 317 (392) T ss_pred HHHHHHhhhhh------h-c--cCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCcccEEEecccccCCCcc Confidence 33333222111 1 1 12368999999888854 23333322210 111111 11211 11 11 122 Q ss_pred CCCceEEEEEEeeCCCceEEE--EeCchhhcccc---eecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 273 ASGRLVQLWAPRVEGKDTATC--GFTEKMRAHSI---ERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 273 a~G~~~~~~~~~~~~~~~~~~--~~p~~~~~l~~---~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) +.+....++.+- .+...+ .-.+.+...+. ....-...+-|..|.+| .+++|.+|+.+..- T Consensus 318 ~~~~~~~~~gdf---s~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~-~v~~~~a~~~l~~~ 382 (392) T protein:vir:10 318 TAKKAPLIIGDL---KEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDV-QMWDNEAAVYGEID 382 (392) T ss_pred cCCceEEEEEeh---hceEEEEeecceEEEEeccccchhhcCceEEEEEEeecc-EEecccceEEEEec Confidence 234444444321 110000 01111111111 01112345667777776 67779999998765 No 109 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=88.38 E-value=0.033 Score=28.73 Aligned_cols=295 Identities=7% Similarity=-0.056 Sum_probs=131.9 Q ss_pred CchHHH----------HHHHhh----cceecc----ch-hh------------hhhhhhhhhhhhhhhhcC--ccccCCc Q lcl|Aclame:pro 1 MRDAQR----------IQNLAR----AGVILP----RS-VK------------NVSTPLAEYAMDAADLSP--HLSSTGS 47 (336) Q Consensus 1 m~~~~~----------~~~l~~----~g~~~~----~~-~~------------~~~~~~~~~~~da~d~~~--~l~t~~~ 47 (336) +.++++ +.+.+. .+-... .+ .. .+....+. .......+. ...|.++ T Consensus 35 ~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~t~~~ 113 (392) T protein:vir:10 35 MEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDVFMKALRNKPLNAEERE-FLEDDLEQRAMSGLTGED 113 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHHHHHHHhcccccHHHHH-HHhhhhhhhhccccccCC Confidence 111100 000000 000000 00 00 00000000 000000010 1123334 Q ss_pred c--hHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecccceEEeecccCCceee-eeeeeeeeeEE Q lcl|Aclame:pro 48 S--GIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVATYGDYSSDGDSG-TNINYPQRQSY 124 (336) Q Consensus 48 ~--~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP~vd-~~~~~~~~~v~ 124 (336) + .+|..+. ++|++.+...-....+.++..... ......+......+.+...+.+...|-.+ ...+...-..+ T Consensus 114 gg~~vP~~~~----~~ii~~~~~~s~l~~~~~~~~~~~-~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l~~~ 188 (392) T protein:vir:10 114 GGLVIPQDIQ----TQINELARSFDALEQYVTVEPVRT-RSGSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQYAVK 188 (392) T ss_pred CceecchhHH----HHHHHHHHhhhhhhhhceeeeccC-CceeEEEEeecCCccceeecccccccccccccceeEEeeee Confidence 4 3666554 345555555555555555533221 11123344444455677778877777554 57778888888 Q ss_pred EEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCCCCcccccccccccccCHHHH Q lcl|Aclame:pro 125 FFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAV 204 (336) Q Consensus 125 ~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI 204 (336) .++..+.+|.+=++.+ ..+|.+.-....+.++.+.++.-.+.|++.... . ...+.+.| T Consensus 189 k~~~~~~iS~ell~ds---~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~~~---------------~----~~~~~d~i 246 (392) T protein:vir:10 189 DRAGILPLSRSLLQDS---DQNILKYVTKWLGKKSKVTRNVLILGVIEKLTK---------------Q----AIKSLDDI 246 (392) T ss_pred eEEEeehhhHHHHhhh---HHHHHHHHHHHHHHHHHHHHHHHHhhccccccc---------------c----CccCHHHH Confidence 8998999988766543 456888888888888888888777776653211 0 01233434 Q ss_pred HHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhccc-CCCCCccHHHH-HHHhCC----ccEEE---Ec--cc-ccC Q lcl|Aclame:pro 205 VNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAK-LKEIFP----KLEFV---TI--PE-YDT 272 (336) Q Consensus 205 ~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~-l~~n~p----nl~i~---~~--pe-l~~ 272 (336) .+-++..+... . . ..-+++|.++.+..|.+ .+..|.-++.- +....+ ++.++ .. |. ... T Consensus 247 ~~~~~~~l~~~------~-~--~~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~~~~~~~~~~~~~ 317 (392) T protein:vir:10 247 KDVLNVKLDPA------I-S--PNAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVVVSNRFLKSKGT 317 (392) T ss_pred HHHHHHhhhhh------h-c--cCCEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCcccEEEecccccCCCcc Confidence 33333222111 1 1 12368999999888854 23333322210 111111 11211 11 11 122 Q ss_pred CCCceEEEEEEeeCCCceEEE--EeCchhhcccc---eecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 273 ASGRLVQLWAPRVEGKDTATC--GFTEKMRAHSI---ERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 273 a~G~~~~~~~~~~~~~~~~~~--~~p~~~~~l~~---~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) +.+....++.+- .+...+ .-.+.+...+. ....-...+-|..|.+| .+++|.+|+.+..- T Consensus 318 ~~~~~~~~~gdf---s~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~-~v~~~~a~~~l~~~ 382 (392) T protein:vir:10 318 TAKKAPLIIGDL---KEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDV-QMWDNEAAVYGEID 382 (392) T ss_pred cCCceEEEEEeh---hceEEEEeecceEEEEeccccchhhcCceEEEEEEeecc-EEecccceEEEEec Confidence 234444444321 110000 01111111111 01112345667777776 67779999998765 No 110 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=87.63 E-value=0.037 Score=28.40 Aligned_cols=293 Identities=12% Similarity=0.051 Sum_probs=131.8 Q ss_pred Cc---------------hH-HHHHH----Hhhcceeccchh---hhhhh------hhhhhhhhhhhhcCccccCCcc--h Q lcl|Aclame:pro 1 MR---------------DA-QRIQN----LARAGVILPRSV---KNVST------PLAEYAMDAADLSPHLSSTGSS--G 49 (336) Q Consensus 1 m~---------------~~-~~~~~----l~~~g~~~~~~~---~~~~~------~~~~~~~da~d~~~~l~t~~~~--~ 49 (336) |+ +. ..+.+ +++.+-.-+... ..... ..+.+..-.+..++.++++.+. . T Consensus 39 ~~~~~~~~~~e~~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 118 (379) T protein:vir:10 39 MTSEKDLAVNELKSDMAALQAHADKLDVKLKEKAKSEDKSDSLVKSITENFNDIKEVRNGKSIQVKAVGDMTLPVNLTGA 118 (379) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccchhHHHHHHHHHHhHHHHHhhhhhhhhhhcccccCCCCccc Confidence 10 00 01111 111111111110 00000 0111100112333444444433 4 Q ss_pred HHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecccce--EEeecccCCceeeeeeeeeeeeEEEEE Q lcl|Aclame:pro 50 IPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTV--ATYGDYSSDGDSGTNINYPQRQSYFFQ 127 (336) Q Consensus 50 i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~~G~a--~~ygd~~DiP~vd~~~~~~~~~v~~~~ 127 (336) ||.... +.|++..-.......++.+.+.. ..++.|+.....+.+ ...+.+...|..+..........+.++ T Consensus 119 ip~~~~----~~ii~~~~~~~~i~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~f~~i~~~~~k~~ 191 (379) T protein:vir:10 119 QPKDYN----FDVVLNPSQMLNVSDIVGAVSIS---GGTYTFVRENGAGEGAIGAQVEGATKGQKDYDISMIDVNTDFIA 191 (379) T ss_pred cchhhh----hHHHHhHHhhhhHHhhceeeecc---CCceEEEEeecCCCcccccccCCccccccccceeeeEeeeeeEE Confidence 555544 34455555555555665554332 234566665544433 345777888999999999999999999 Q ss_pred EEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCCCCcccccccccccccCHHHHHHH Q lcl|Aclame:pro 128 TWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNE 207 (336) Q Consensus 128 ~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~D 207 (336) ..+.+|.+=|+-+. .|.+--....++++.+.+|.-.+.|+...+..+. ...+ +..+ ++| T Consensus 192 ~~~~iS~ell~D~~----~l~~~i~~~la~~~~~~~~~~~~~g~~~~~~~~~----------~~~~---~~~~----~d~ 250 (379) T protein:vir:10 192 GFTRYSKKMANNLP----FLTSFIPNALRRDYAKAENAAFNAVLAANATAST----------EIIT---NKNK----VEM 250 (379) T ss_pred eeehhhHHHHhhHH----HHHHHHHHHHHHHHHHHHHHHHhccccccccccc----------cccc---Cccc----HHH Confidence 99998875444432 3666666666666666666644444432221110 1111 1122 356 Q ss_pred HHHHHHHHHHHhCCceeccCCcEEEecHHHHHhccc-CCCCCccHHH--HHHHh-----CCccEEEEcccccCCCCceEE Q lcl|Aclame:pro 208 VVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAA--KLKEI-----FPKLEFVTIPEYDTASGRLVQ 279 (336) Q Consensus 208 i~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~--~l~~n-----~pnl~i~~~pel~~a~G~~~~ 279 (336) |.+++..+... + ..+..++|.|..+..|.+ .+..|.-++. ...++ .-++.++..+.+. .|. . T Consensus 251 i~~~~~~~~~~--~----~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~l~G~pvv~s~~~~--ag~--~ 320 (379) T protein:vir:10 251 LINEIAKQENL--D----FPVTAIVLRPTDYYDILVTQKSVGAGYGLPGVVTQDNGVLRINGIPLFRATWLA--ANK--Y 320 (379) T ss_pred HHHHHHhhhhc--c----CCCCEEEEcHHHHHHHHHhhccCCceeccCCccCCCCCcceecceeeEecCCCC--CCc--e Confidence 66665555322 1 135579999988888753 3333433321 00011 1122333333322 221 1 Q ss_pred EEEEeeCCCceEEEEeC----chhhcccc-eecCCceEEeeecceeeeEEecccceee--eccC Q lcl|Aclame:pro 280 LWAPRVEGKDTATCGFT----EKMRAHSI-ERYSSYFRQKKSAGTWGAVIFRPFAVAQ--MIGV 336 (336) Q Consensus 280 ~~~~~~~~~~~~~~~~p----~~~~~l~~-~~~~~~~~v~~~~rt~Gv~ir~P~ai~~--~~GI 336 (336) ++.+- .. ..+.+- ..+...+. ....-...+-++.|. |+.+++|.||++ +.+| T Consensus 321 ~~gdf---~~-~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~R~-~~~v~~p~a~v~~~~~~~ 379 (379) T protein:vir:10 321 YVGDW---TR-VTKVTTEGLSLEFSEVEGTNFVKNNITARIEAQV-ALAVEQPAALIFGDFTAV 379 (379) T ss_pred EEeec---cc-EEEEEEeceEEEEeecccccccCCcEEEEEEEEe-ccEEecCccEEEEEecCC Confidence 11111 00 001110 11111110 011223445556676 556668999998 7788 No 111 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=87.50 E-value=0.038 Score=28.35 Aligned_cols=292 Identities=9% Similarity=-0.004 Sum_probs=134.8 Q ss_pred CchHHHHHHHhhcceeccc----hhhhh-------hhhhhhhhhhhhhhcCccccCCcch--HHHHHHHhhCceeeeeec Q lcl|Aclame:pro 1 MRDAQRIQNLARAGVILPR----SVKNV-------STPLAEYAMDAADLSPHLSSTGSSG--IPNYLTTYVDPSVIDILV 67 (336) Q Consensus 1 m~~~~~~~~l~~~g~~~~~----~~~~~-------~~~~~~~~~da~d~~~~l~t~~~~~--i~~~l~~~idp~v~~~~~ 67 (336) +...+...+.++....... ..... .+..+.....+. ...+.+++| +|..+. +++++.+- T Consensus 45 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~a~----~~~t~~~gg~~vP~~~~----~~ii~~~~ 116 (371) T protein:vir:81 45 FDVAKELYEEQKQTIEDKEPLKPTVQVKENEVEAFVNHIRTRFRNAM----SEGSNQDGGYTVPQDIQ----TRINELRE 116 (371) T ss_pred HHHHHHHHHHHHHhhccccccccchhhHHHHHHHHHHHHHHHHHHhh----ccCCCccCceeecHhHH----HHHHHHHH Confidence 2222222222111111000 00000 000111111111 112233333 665554 45566555 Q ss_pred cccchhhhcccccCCCcceeeEEEeeeecccceEEeecccCCce-eeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCC Q lcl|Aclame:pro 68 APMKAAELVGESKKGDWTTLVAAFITAEPTTTVATYGDYSSDGD-SGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVD 146 (336) Q Consensus 68 ~~~~~~~l~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP~-vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~ 146 (336) ..-....++++...+. ...++.+......+.+...+.++++|- .+......+.+.+.++..+.+|.+=++.+. .+ T Consensus 117 ~~s~i~~~~~~~~~~~-~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~---~~ 192 (371) T protein:vir:81 117 SKDALQNLITVEPVTT-LSGSRVFKKRSQQTGFVEVAEGAAIGEKATPQFTLLQYQVKKYAGFFRVTNELLNDST---EA 192 (371) T ss_pred hhhhhhhhceeeeccC-CceeEEEEeecCCcceeeeccccccccccccceeeEEeeeeEEEEeehhhHHHHhhhh---HH Confidence 5555666666543322 123344555555677888888888884 567888999999999999999877665443 46 Q ss_pred HHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceecc Q lcl|Aclame:pro 147 LASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQE 226 (336) Q Consensus 147 l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~ 226 (336) |.+--....+.++.+.+|+..+.|++...-.| ..+.+.|...++..+.... . T Consensus 193 l~~~i~~~l~~a~~~~~~~~i~~g~g~~~~~~-------------------~~~~~~i~~~~~~~l~~~~---------~ 244 (371) T protein:vir:81 193 IVNTLVRWIGDESRVTRNGLIINVLNTKAKTA-------------------IADLDGLKQIINVQLDPVF---------R 244 (371) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhcccccccc-------------------cccHHHHHHHHHhhcchhh---------h Confidence 77777777788888888887777765422110 1233444444432221111 1 Q ss_pred CCcEEEecHHHHHhccc-CCCCCccHHH-HHHHh-------CCccEEEEccccc----CCCCceEEEEEEeeCCCceEEE Q lcl|Aclame:pro 227 AVLHMGLPPTAMSDLSK-TNQYGLSAAA-KLKEI-------FPKLEFVTIPEYD----TASGRLVQLWAPRVEGKDTATC 293 (336) Q Consensus 227 ~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~-~l~~n-------~pnl~i~~~pel~----~a~G~~~~~~~~~~~~~~~~~~ 293 (336) ....++|.+..+..|.+ .+..|.-++. =+... +|=+....+|... +.+.+...+++-.. .+-..+ T Consensus 245 ~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~~~~~~~i~~Gd~--~~~~~~ 322 (371) T protein:vir:81 245 STSSVIVNQDAFNWLDTLKDQNGQYLLQPSISSPTGRQLLGLPVVIVSNKVLANRVDGGTGAQFAPIIVGDL--KEAVVM 322 (371) T ss_pred cCCEEEEcHHHHHHHHHhhccCCCeeeecccCCCCCceecceeEEEecccccCccccccccCCcceEEEEeh--hceEEE Confidence 23468999988888854 2333322210 00111 1211112223211 11122222222111 000111 Q ss_pred EeCchh--hcccce---ecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 294 GFTEKM--RAHSIE---RYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 294 ~~p~~~--~~l~~~---~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) ...+.+ ...... ...-...+-+..|.+| .+++|.||+.+. + T Consensus 323 ~~~~~~~i~~~~~~~~~f~~~~v~~~~~~r~d~-~~~~~~a~~~~~-~ 368 (371) T protein:vir:81 323 FDRQRTEIMSSNVAMDAFETDATLWRAIERMDV-KMRDDEAFVFGE-V 368 (371) T ss_pred EeecceEEEEeccccchhhcCceEEEEEEeecc-EEecccceEEEE-E Confidence 111111 111110 0112345556666665 566799888776 5 No 112 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=86.79 E-value=0.043 Score=28.07 Aligned_cols=283 Identities=12% Similarity=0.080 Sum_probs=115.9 Q ss_pred hhhhhhhhc-CccccCCcchHHHHHHHhhCceeeeeeccccchhhhcccccCCCccee--e-EEEeee--ecccceE--- Q lcl|Aclame:pro 31 YAMDAADLS-PHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTL--V-AAFITA--EPTTTVA--- 101 (336) Q Consensus 31 ~~~da~d~~-~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~--t-~~~~v~--e~~G~a~--- 101 (336) |++-+-.++ |.|++.- .-+|.++|-+ -++..+.... ..|-|..+...-... + ..|.+. ..+|+.. T Consensus 1 ~~~~~~~~~~~~Ms~~i---~~~fv~qy~~--~v~~~~qq~~-s~L~~tV~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 74 (322) T protein:vir:10 1 MKLNAIMSMLPLIAGDI---DQAFVQTYET--TLRILSQQKS-AKLKQYCQHKNESSESHNWETLASMDPDAVKRKRSRQ 74 (322) T ss_pred Ccccceeeeeeeeechh---hhHHHHHHHH--HHHHHHHHhh-hhhhcccccccccccccceeecccccccccccccccc Confidence 555555555 5555421 2345666652 2333333322 333333221111111 0 111211 1233332 Q ss_pred Eeeccc-CCceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEe---eccccceE Q lcl|Aclame:pro 102 TYGDYS-SDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLF---GVAGLENY 177 (336) Q Consensus 102 ~ygd~~-DiP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~---Gd~~~g~~ 177 (336) ..+|.. |.|...............+..++.+...++.+ +..+..+.-.+++..|++++.+++.+- |.+..+.. T Consensus 75 ~~~d~~~dtp~~~~~~~~r~~~~~d~~~~~~VDd~D~~k---~~~D~~~~~~~~~a~AL~R~~D~~I~~a~~g~a~~~~~ 151 (322) T protein:vir:10 75 QSADGTYPTPVNNKPFAKRRTNVDTYDTGHVVEQEDISQ---MLLDPNSALITSQAYAMARKTDDLIIAGAWKPASIKGT 151 (322) T ss_pred cccCcccCCCccccccceEEEeecccccceecchHHHHH---hhcCchHHHHHHHHHHhhhHHHHHHHhhhhcccccccc Confidence 334543 66766654444444444555566555555443 345666666667777777777774432 44332211 Q ss_pred EEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhcccCC---CCCccHHHH Q lcl|Aclame:pro 178 GLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSKTN---QYGLSAAAK 254 (336) Q Consensus 178 GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~~~---~~~~Tvl~~ 254 (336) | .+....++..--+..++. -++.|.++...+.... +..+.+-.++++|+++..|-.-. +.+..--+. T Consensus 152 g------t~v~~~ss~~i~~g~~g~-t~~kl~~a~~~l~~~d---vp~d~~R~~vv~p~~~~~LL~d~~~ts~D~~~~~~ 221 (322) T protein:vir:10 152 G------QPVEFLATQEIGDGTKPI-SFDYVTEITERFLENE---IEPEVSKVIVIGPTQARKLLQITEATSADYTSAMD 221 (322) T ss_pred c------cccccCCCcccccCccch-hHHHHHHHHHHHHhcC---CCCCCCeEEEeCHHHHHHHhcchhhhhhhcccchh Confidence 1 111101110000011111 2333444444443332 22223457999999998875321 111111233 Q ss_pred HHHh--------CCccEEEEccccc----------CCCCceEEEEEEeeCCCceEEE-EeCchhhcccceecCC-ceEEe Q lcl|Aclame:pro 255 LKEI--------FPKLEFVTIPEYD----------TASGRLVQLWAPRVEGKDTATC-GFTEKMRAHSIERYSS-YFRQK 314 (336) Q Consensus 255 l~~n--------~pnl~i~~~pel~----------~a~G~~~~~~~~~~~~~~~~~~-~~p~~~~~l~~~~~~~-~~~v~ 314 (336) |..+ |..|....+|.-+ .+++.+..+++.+...-..... .+..++--+| ... .+.+- T Consensus 222 l~~~G~ig~~lGf~~i~s~~lp~~~~t~~~~~~~~~~~~~~~~~~a~~k~Av~~a~~~dv~~~i~~~~---~~~~a~~I~ 298 (322) T protein:vir:10 222 LQSKGIITNWMGYTWIVSTRLDKFDPTQWGMAAEDGPQGDEIWCIAMTDMALGYHSCKDIWTKVAEDP---SASFAWRIY 298 (322) T ss_pred hhhcCeeeeeeeEEEEEeccCCccccccccccccCCCCccceeEEEEecCceeEEEeeeeeEEeeccC---Ccchhhhhh Confidence 3322 2223333344221 1234455566655322111110 1222221111 111 12333 Q ss_pred eecceeeeEEecccceeeeccC Q lcl|Aclame:pro 315 KSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 315 ~~~rt~Gv~ir~P~ai~~~~GI 336 (336) .....|.+. -.|..|+.++=- T Consensus 299 ~~~~~Ga~r-i~~~gVv~i~~~ 319 (322) T protein:vir:10 299 SAFTADCVR-VEDEHIFKLRLK 319 (322) T ss_pred hhhhhCceE-eccCcEEEEEEe Confidence 334444444 477776665544 No 113 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=85.96 E-value=0.049 Score=27.77 Aligned_cols=295 Identities=10% Similarity=-0.024 Sum_probs=133.2 Q ss_pred CchHHH-------HH----HHh-----------hcceeccchh--hhhhhhhhhh----hhhhhhhcCccccCCc--chH Q lcl|Aclame:pro 1 MRDAQR-------IQ----NLA-----------RAGVILPRSV--KNVSTPLAEY----AMDAADLSPHLSSTGS--SGI 50 (336) Q Consensus 1 m~~~~~-------~~----~l~-----------~~g~~~~~~~--~~~~~~~~~~----~~da~d~~~~l~t~~~--~~i 50 (336) ++++.. +. ..+ +.+..-.... ......+..+ ..++ ..+....+.+. ..| T Consensus 43 ~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~-~~~~~~~t~~~gg~~i 121 (397) T protein:vir:49 43 KNERDTAKMKRDLFKEQYTEARANEVANMSEEEKKPLTKNEEEVKANFVKDFKNLVRGRYQNL-LDSKTDGSGSDAGLTI 121 (397) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccchhhHHHHHHHHHHHHHhhcchhhH-HHhhhccCCccCccee Confidence 111100 00 000 0000000000 0000000000 0111 11111222222 335 Q ss_pred HHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeee-ecccceEEeecccCCceee-eeeeeeeeeEEEEEE Q lcl|Aclame:pro 51 PNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITA-EPTTTVATYGDYSSDGDSG-TNINYPQRQSYFFQT 128 (336) Q Consensus 51 ~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~-e~~G~a~~ygd~~DiP~vd-~~~~~~~~~v~~~~~ 128 (336) |..+.+ .|++.+.......++..+..... ....+.+... +..+.+...+....+|-.+ ......+...+.++. T Consensus 122 P~~~~~----~ii~~~~~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k~~~ 196 (397) T protein:vir:49 122 PQDIRT----AINTLVRQFDSLQEYVNVENVTT-LTGSRVYEKWADITGLAKLDDEGGQIGQNDDPKLSLIRYAIKRYAG 196 (397) T ss_pred cHHHHH----HHHHHHHhhhhHhhhcceeeccC-CcceEEEEeeccCCcceeeeccccccccccccceeeeEeeeeeeEe Confidence 666554 44454455555555554432221 1122344433 3446677777777787665 356777888888888 Q ss_pred EEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCCCCcccccccccccccCHHHHHHHH Q lcl|Aclame:pro 129 WTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEV 208 (336) Q Consensus 129 ~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di 208 (336) .+.+|.+=++. ..+++.+.-.....+++.+.+|+-.++|++... +. + ...+ ++|| T Consensus 197 ~~~iS~ell~d---s~~~l~~~i~~~l~~~~~~~~d~ail~G~g~~~----------~~--~------~~~~----~d~i 251 (397) T protein:vir:49 197 ISTVTNSLLAD---SAENILAWLSGWIAKKVVVTRNKAILEAIGTLP----------NK--P------TLAK----WDDI 251 (397) T ss_pred ehhhHHHHHhh---hhHHHHHHHHHHHHHHHHHHHHHHHHhcccccc----------cc--c------cccC----HHHH Confidence 88887654433 346788888888888888888888888876421 11 0 1123 3466 Q ss_pred HHHHHHHHHHhCCceeccCCcEEEecHHHHHhccc-CCCCCccHHH-HHHHhCC----ccEEEEcc--cc-cCCCCceEE Q lcl|Aclame:pro 209 VTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAA-KLKEIFP----KLEFVTIP--EY-DTASGRLVQ 279 (336) Q Consensus 209 ~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~-~l~~n~p----nl~i~~~p--el-~~a~G~~~~ 279 (336) .+++..+...- ..+..++|.+..+..|.+ .+..|.-++. =+....+ ++.++... .+ .+.++.... T Consensus 252 ~~~~~~l~~~~------~~~a~~v~n~~~~~~l~~lkd~~g~~l~~~~~~~g~~~~l~G~pV~~~~~~~~~~~~~~~~~~ 325 (397) T protein:vir:49 252 IDLQAKVDPAI------KQTSLFLTNTSGFTALKKVKNAMGDYLMERDVKSPTGYSIDGFVVKEISDRFLPNGTGGAMPL 325 (397) T ss_pred HHHHHhhhhhh------cCCCEEEEcHHHHHHHHHhhccCCceeecccccCCCCceecceeeEEecccccccccCCceeE Confidence 66666664321 135589999999998854 3333432221 0111111 11111111 11 222333333 Q ss_pred EEEEeeCCCceEEEE--eCchhhcccc---eecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 280 LWAPRVEGKDTATCG--FTEKMRAHSI---ERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 280 ~~~~~~~~~~~~~~~--~p~~~~~l~~---~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) +|.+- .+...+. -...+...+. ....-....-+..|.+|. +++|.||+.+..= T Consensus 326 ~~gd~---~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~d~~-~~~~~a~~~~~~~ 383 (397) T protein:vir:49 326 YFGDL---KQAVTLFDRQHLSLLSTNIGGGAFETDTTKVRVIDRFDVV-STDTEAFVPASFK 383 (397) T ss_pred EEeec---cceEEEEeecccEEEEeccccchhhcCeeeEEEEEeeccE-EecccceEEEEec Confidence 33321 1100000 0111111111 111223445667777776 6779999887633 No 114 >protein:vir:8843 Length: 317 # NCBI annotation: major head protein # Family: family:all:3919 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775251;genbank:gi:27476049;genbank:GeneID:2700597 Probab=85.17 E-value=0.055 Score=27.50 Aligned_cols=283 Identities=9% Similarity=-0.037 Sum_probs=115.6 Q ss_pred hhhhhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecccceEEe-ecccCC Q lcl|Aclame:pro 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVATY-GDYSSD 109 (336) Q Consensus 31 ~~~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~~G~a~~y-gd~~Di 109 (336) ||.=|+ ..+|.-+.|.--=|...| +.+--.+.-...+++-.+ -+..++.|+.-+....+... ..++|- T Consensus 1 ma~~~~----~~~t~~~~g~~~dl~~~I----~~isp~dTPf~S~i~~~~---a~~~~~~W~~d~l~~~~~~~~~EG~da 69 (317) T protein:vir:88 1 MATPTN----AVSTVEINGKREDLIDII----YNIAPYDTPFMSAIGKGV---ATAITHEWQTDELRQPGKNTRVEGEDA 69 (317) T ss_pred CCcccc----ceEeeeeeeeeechhhhh----eecCCccCcceeeecCce---ecccEEEEEeeecCCccccccccCccc Confidence 222111 122222222222222222 222222222333444322 22334555544444333211 122333 Q ss_pred ceeeeeee---eeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeecc---------ccceE Q lcl|Aclame:pro 110 GDSGTNIN---YPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVA---------GLENY 177 (336) Q Consensus 110 P~vd~~~~---~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~---------~~g~~ 177 (336) |....... .-.-+|++=...+.++.+-...++. -++.+....-+...+.+.++...+.|.. ...+- T Consensus 70 ~~~~~~~r~~~~N~tQIf~k~v~VSgTa~av~~~G~--~~ela~q~~kk~~EikrdmE~~li~g~~a~~~~~~t~~r~~~ 147 (317) T protein:vir:88 70 TIKAGSFTTMLNNYCQISDETLQVTGTADRVKKAGR--KNELAYQLAKKSKELKLDMEYALVGAPQAKVQRNTTTPGQMA 147 (317) T ss_pred ccccccCCEEeccEEEEEEeEEEEeehhhhhhhcCc--cchhHHHHHHHHHHHHHHHHHHHhcCeeeccCCCCccchhhh Confidence 22222111 1123466666666666666655442 2322222222223333333333333332 24556 Q ss_pred EEEec--CC-CC-c----ccccccccccccCHHHH-HHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhcccCCCCC Q lcl|Aclame:pro 178 GLIND--PS-LS-A----PITATTPWSGSPAVEAV-VNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSKTNQYG 248 (336) Q Consensus 178 GllN~--Pn-l~-~----~~~~~t~w~~~~T~~eI-~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~~~~~~ 248 (336) |+++- +| +- . .+...+.-|...|+..+ -+||++++.++|..-+ .|.++.+++..-..|+.-...+ T Consensus 148 Gl~~~i~t~~~~~~~g~~~~~~~~~~~t~~t~~~lte~~l~~~l~~i~~~Gg------~~~~i~v~a~~k~~i~~~~~~~ 221 (317) T protein:vir:88 148 NIFAYYKTNGSLGANGVAPVGDGSNTGTAGDLRLLTEDMLLNASESIWRNGG------QANSIQTSSSIKKAISKNMKGR 221 (317) T ss_pred hHHHHhccCceeccCccccccCCCccccccccccccHHHHHHHHHHHHhcCC------CCCEEEeChHHHHHHHHHhcCC Confidence 76652 11 10 0 00011111222222223 3568889999998543 3667899998888876321111 Q ss_pred ccHHHHHHHhCCccEEEEcccccCCCCceEEEEEEeeC--------CCceEEEEeCchhhcccceecCCceEEeeeccee Q lcl|Aclame:pro 249 LSAAAKLKEIFPKLEFVTIPEYDTASGRLVQLWAPRVE--------GKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTW 320 (336) Q Consensus 249 ~Tvl~~l~~n~pnl~i~~~pel~~a~G~~~~~~~~~~~--------~~~~~~~~~p~~~~~l~~~~~~~~~~v~~~~rt~ 320 (336) .+..... .-.+.-...+-.+.+.-|. +.++..+.- +++.+++++=.+|..-+ ..+....+--....=+ T Consensus 222 ~~~i~~~--~~~~~~g~~v~~~~tdfG~-v~ii~~r~lp~~~~~~~D~~~~~l~~Lr~~~~e~-laKtGd~~k~~i~~E~ 297 (317) T protein:vir:88 222 ATEITLD--ASDNRIAQTVDVYESDFGK-YTIRANRWFHENTLFVFDPKMHSLCYLRPFFQHE-LAKTGDSEKRQLLVEY 297 (317) T ss_pred ceeEEEc--ccCeEEEEEEEEEEeCCeE-EEEEeCCCCCCCeEEEEcccccceeecccceeec-cCCCcccceeEEEEEE Confidence 1100000 0011122233333333332 222222221 23334443323332222 2233334444555678 Q ss_pred eeEEecccceeeeccC Q lcl|Aclame:pro 321 GAVIFRPFAVAQMIGV 336 (336) Q Consensus 321 Gv~ir~P~ai~~~~GI 336 (336) |++++-|.|.+...|| T Consensus 298 tLe~~N~~a~a~i~~l 313 (317) T protein:vir:88 298 TFRVNNEKSGALIRDV 313 (317) T ss_pred EEEEcCccceeEEEEe Confidence 9999999999999999 No 115 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=84.47 E-value=0.06 Score=27.27 Aligned_cols=287 Identities=8% Similarity=-0.053 Sum_probs=132.8 Q ss_pred CchHHHHHHHhhcceeccchhhhhhhhhhhhhhhhhhhcCccccCCcch--HHHHHHHhhCceeeeeeccccchhhhccc Q lcl|Aclame:pro 1 MRDAQRIQNLARAGVILPRSVKNVSTPLAEYAMDAADLSPHLSSTGSSG--IPNYLTTYVDPSVIDILVAPMKAAELVGE 78 (336) Q Consensus 1 m~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~da~d~~~~l~t~~~~~--i~~~l~~~idp~v~~~~~~~~~~~~l~~v 78 (336) +..++.+...-+-+......+..+ ......+|. -++.+++| +|..+. ++|++.+...-....++++ T Consensus 92 ~~~~~a~~~~~~~~~~~~~~~~~~-~~~~~~a~~-------~~~~~~gg~lvP~~~~----~~ii~~~~~~~~l~~~~~~ 159 (397) T protein:vir:12 92 QQYSKAFLKGLRGKRLTDEERDLL-DSPEFRAMS-------GINDEDGGILIPEDIG----RQIHEFKRQFEPLEQYVTV 159 (397) T ss_pred HHHHHHHHHHHhccCCcHHHHHHH-hhhhhhhcc-------ccccccCcccCchhHH----HHHHHhhhhhhhHHhhcce Confidence 111111111001111000000000 000000110 12223332 455444 4556666655556666555 Q ss_pred ccCCCcceeeEEEeeeecccceEEeecccCCcee-eeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHH Q lcl|Aclame:pro 79 SKKGDWTTLVAAFITAEPTTTVATYGDYSSDGDS-GTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSAL 157 (336) Q Consensus 79 ~t~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP~v-d~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~ 157 (336) ..... ....+.+......+.+...+.+..+|-. ....+......+.++..+.+|.+=+. ....+|.+--....++ T Consensus 160 ~~~~~-~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~~v~~~~~k~~~~~~is~e~l~---ds~~~l~~~i~~~l~~ 235 (397) T protein:vir:12 160 EPVTT-RSGTRLLEKNADMVPFSPVEELGNLPEIDQPRFTKVSYSIIDYGGIMTLSNSMLN---DSDQAIMTYVAKWFAK 235 (397) T ss_pred eeccC-CceeEEEEEecCCcceeeecccccccccccccceeEEeeheeeEeeehhhHHHHh---hchHHHHHHHHHHHHH Confidence 33221 1123445555556678888888888754 45778888888888888888866443 3345777777778888 Q ss_pred HHHHhhccEEEeeccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHH-HHHHHhCCceeccCCcEEEecHH Q lcl|Aclame:pro 158 GLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQ-VLQTQSQGIITQEAVLHMGLPPT 236 (336) Q Consensus 158 a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~-~l~~~t~g~v~~~~p~tL~Lp~~ 236 (336) ++.+.++.-.+.|++...-.|. .+ ++||.+++. .+... . .....++|.+. T Consensus 236 ~~~~~~d~~il~G~g~~~~~g~-------------------~~----~~~i~~~~~~~l~~~----~--~~~a~~~~n~~ 286 (397) T protein:vir:12 236 KSVVTRNNLILAAIASLKKVDI-------------------DG----LDGIKKALNVTLDPM----V--APGSIVLTNQD 286 (397) T ss_pred HHHHHHHHHHHhcccccccccc-------------------cc----HHHHHHHHhhccchh----h--hCCCEEEEcHH Confidence 8888888888888764321111 12 334444433 22111 1 12346899998 Q ss_pred HHHhccc-CCCCCccHHH-HHHHhCC----ccEEEEcccc--cCCCCceEEEEEEeeCCCceEEEEeCch--hhcc--c- Q lcl|Aclame:pro 237 AMSDLSK-TNQYGLSAAA-KLKEIFP----KLEFVTIPEY--DTASGRLVQLWAPRVEGKDTATCGFTEK--MRAH--S- 303 (336) Q Consensus 237 ~~~~Ls~-~~~~~~Tvl~-~l~~n~p----nl~i~~~pel--~~a~G~~~~~~~~~~~~~~~~~~~~p~~--~~~l--~- 303 (336) .+..|.+ .+..|.-++. -+....| ++.+...+.. ..+.|....++.+- .+-..+..-+. +... + T Consensus 287 ~~~~L~~lkd~~G~~l~~~~~~~g~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~---~~~~~~~~~~~~~i~~~~~~~ 363 (397) T protein:vir:12 287 GYDWLDTLKDGTGRYLLQPDPTNPTKKLLDGRPVVPFTNRVLKTQKGKAPLIIGNL---KEAIVLFDREQQSIASTDTGA 363 (397) T ss_pred HHHHHHHhhccCCceeecccccCCCCccccceeeEEecccccccCCCccEEEEEeh---hceEEEEeecceEEEEecccc Confidence 8888854 3333432211 0111111 1222222221 12233333333221 11111111111 1111 1 Q ss_pred ceecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 304 IERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 304 ~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) .....-...+-+..|.+| .++.|-||+.++-= T Consensus 364 ~~f~~~~~~~r~~~r~d~-~~~~~~a~~~~~~t 395 (397) T protein:vir:12 364 GAFETNSTKVRGIEREDV-RKWDEDAVVFGQIT 395 (397) T ss_pred chhhcCceEEEEEEeecc-EEecccceEEEEEe Confidence 011122355667777766 55889999887765 No 116 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=83.23 E-value=0.07 Score=26.91 Aligned_cols=295 Identities=8% Similarity=-0.060 Sum_probs=118.7 Q ss_pred CchHH-------HHHHHhhccee-cc-------chhh-hhhhhhhhh--hh---hhhhhcCccccCCcc--hHHHHHHHh Q lcl|Aclame:pro 1 MRDAQ-------RIQNLARAGVI-LP-------RSVK-NVSTPLAEY--AM---DAADLSPHLSSTGSS--GIPNYLTTY 57 (336) Q Consensus 1 m~~~~-------~~~~l~~~g~~-~~-------~~~~-~~~~~~~~~--~~---da~d~~~~l~t~~~~--~i~~~l~~~ 57 (336) |...+ +.....+.... .. .... ......+.+ .+ ........+.+.+++ .||..+.+ T Consensus 54 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ra~~t~~~gg~liP~~~~~- 132 (421) T protein:vir:13 54 MEIIEEEIESVMTAIDEERKNTNFTGGRVIINGDSKEEKRSLQLSAMSKTIRGIQLSEEERDIMSSTNNGAVIPQEFVN- 132 (421) T ss_pred HHHHHHHHHHHHHHHHHHHhhhcccccccccccchhHHHHHHHHHHHHHhhhccchhHHHhhccccCCcceecchhhHH- Confidence 11110 00000000000 00 0000 000000000 00 000001112333333 36655543 Q ss_pred hCceeeeeeccccchhhhcccccCCCcceeeEEEeeeeccc--ceEEeecccCCceeeeeeeeeeeeEEEEEEEEEeCHH Q lcl|Aclame:pro 58 VDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTT--TVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGER 135 (336) Q Consensus 58 idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~~G--~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~~~~y~~~ 135 (336) +|++.+........++.+.+... .+..|++..... .+...+...++|..+.......-.++.++..+.+|.+ T Consensus 133 ---~Ii~~~~~~~~l~~l~~~~~~~~---~~~~~~~~~~~~~~~~~~~~E~~~~~~s~~~f~~i~~~~~k~~~~v~iS~e 206 (421) T protein:vir:13 133 ---EFEKLKEGYPSLKEHCHVIPVNR---NAGKMPVRAGASVDKLANLAKDTELVKAMLKTQPMAYDIDDYGLLAPIDNS 206 (421) T ss_pred ---HHHHHHHhhhhhhhhceeeeccC---CceEEEEeecCCccceeeccccccccccccceeEEEeeeeeeEeehhhhHH Confidence 44444444444445544432221 223444433332 2344566778888888777788888888888888865 Q ss_pred HHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHH Q lcl|Aclame:pro 136 ELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVL 215 (336) Q Consensus 136 El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l 215 (336) =|+-+ ..+|.+--....++++...+|.-.. ....|+++.+ ...+ ++||.+++..+ T Consensus 207 ll~ds---~~~l~~~i~~~la~~~~~~~~~~i~-----~~~~g~~~~~-------------~~~~----~d~i~~~~~~l 261 (421) T protein:vir:13 207 LLEDS---EINFLEFVNEEFAEFAVNTENAEIV-----KQAKAVLAEE-------------TIND----YAGLVKTINSL 261 (421) T ss_pred HHhhh---HHHHHHHHHHHHHHHHHHHhhhhHh-----hhhhhccccc-------------cccc----hHHHHHHHHHh Confidence 44433 3356655555556666665553211 1122332211 0123 45666677766 Q ss_pred HHHhCCceeccCCcEEEecHHHHHhccc-CCCCCccHHHHHHHhCC----ccEEEEccccc-CCCCceEEEEEEeeCCCc Q lcl|Aclame:pro 216 QTQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAKLKEIFP----KLEFVTIPEYD-TASGRLVQLWAPRVEGKD 289 (336) Q Consensus 216 ~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~l~~n~p----nl~i~~~pel~-~a~G~~~~~~~~~~~~~~ 289 (336) ...- .....++|.+..+..|.+ .+..|.=++.-+...-| ++.++..+..- ++++....++.+- .+ T Consensus 262 ~~~~------~~~a~~v~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~tl~G~pV~~~~~~~~~~~~~~~~~~gd~---~~ 332 (421) T protein:vir:13 262 VPNA------RKRAIIVTNSDGRAYLDGLMDKQGRPLLKELSDGGDLVFKGRPVIELEESIFDVGDETKFIVSDF---KT 332 (421) T ss_pred hhhh------cCCCEEEEcHHHHHHHHHhhcCCCceeecCcCCCCCceecceeeEEeccccccCCCceEEEEEec---cc Confidence 4321 124579999999988864 34434333322221111 12333333222 2222222222221 11 Q ss_pred eEEEEeCchhhcc--c-ceecCCceEEeeecceeee----------EEecccceeeeccC Q lcl|Aclame:pro 290 TATCGFTEKMRAH--S-IERYSSYFRQKKSAGTWGA----------VIFRPFAVAQMIGV 336 (336) Q Consensus 290 ~~~~~~p~~~~~l--~-~~~~~~~~~v~~~~rt~Gv----------~ir~P~ai~~~~GI 336 (336) ...+...+.++.. . .....-.+.+-+..|.+|. .+.+|.+++...+. T Consensus 333 ~~~~~~~~~~~v~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~a~v~~~~~ 392 (421) T protein:vir:13 333 LIKFMDRKQYLIDQSKEAGYTKNETIARIIERFDVNSPLDKSSDAEKIRKFGVIVKLQEV 392 (421) T ss_pred cEEEEEecceEEEeecccccccCeeEEEEEeeecceeecchhhheeeecccceeeccccc Confidence 0111111111110 0 0011122334445555444 44455556666665 No 117 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=80.73 E-value=0.092 Score=26.26 Aligned_cols=295 Identities=8% Similarity=-0.026 Sum_probs=127.0 Q ss_pred CchHHHH----HHHh------hcceecc----chhhh-------------hhhhhhhhhhhhhhhcCccc--cCCcc--h Q lcl|Aclame:pro 1 MRDAQRI----QNLA------RAGVILP----RSVKN-------------VSTPLAEYAMDAADLSPHLS--STGSS--G 49 (336) Q Consensus 1 m~~~~~~----~~l~------~~g~~~~----~~~~~-------------~~~~~~~~~~da~d~~~~l~--t~~~~--~ 49 (336) ...++.+ ..++ ..+..-+ ..... .........+.+......++ +.+++ . T Consensus 65 ~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~t~~~GG~l 144 (402) T protein:vir:93 65 QQRFNIVERQVQDIEEKEKAKVKDKGEAYQSLSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKL 144 (402) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhhccccCCCCchhHHHHHHHHHHHHHHHhhhhHHHHHHhHHHHHhhhccCCCcCCccc Confidence 0000000 0000 0000000 00000 00011111112111111222 22333 3 Q ss_pred HHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeee-cccceEEeecccCCceeeeeeeeeeeeEEEEEE Q lcl|Aclame:pro 50 IPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAE-PTTTVATYGDYSSDGDSGTNINYPQRQSYFFQT 128 (336) Q Consensus 50 i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e-~~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~ 128 (336) ||..+.+ +|++.+...-..+.+..+.+.+.. .++.++ ..+.+...+.....|-.+.......-..+.++. T Consensus 145 IP~~~~~----~Ii~~~~~~~~l~~~~~v~~~~~~-----~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~ 215 (402) T protein:vir:93 145 LPKTLSK----EIVSEPFAKNQLREKARLTNIKGL-----EIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKV 215 (402) T ss_pred cchhHHH----HHHHhHHhhhhhhhhceeeecCCc-----eeeeeeccCCccccccccccccccccccceeeecceeeee Confidence 7776654 334444444444555555544432 223332 345567778877888888888888888888888 Q ss_pred EEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccE-EEeeccccceEEEEecCCCCcccccccccccccCHHHHHHH Q lcl|Aclame:pro 129 WTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGS-YLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNE 207 (336) Q Consensus 129 ~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i-~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~D 207 (336) .+.+|.+=|.- ...++.+.-....+.++...+++. +..|++...-.|.++++.+... +....++| T Consensus 216 ~i~iS~ell~D---s~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~p~g~~~~~~~~~~-----------~~~~~~d~ 281 (402) T protein:vir:93 216 FAAISDTVIHG---SDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEV-----------EGADMYDA 281 (402) T ss_pred echhhHHHHhh---hHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccc-----------cccchHHH Confidence 88888553432 345566666666666666665543 4445555555677776544321 11223667 Q ss_pred HHHHHHHHHHHhCCceeccCCcEEEecHH-HHHhcccCCCCCccHHHHHHHhCCccEEEEcccccCCCCceEEEEEEeeC Q lcl|Aclame:pro 208 VVTLFQVLQTQSQGIITQEAVLHMGLPPT-AMSDLSKTNQYGLSAAAKLKEIFPKLEFVTIPEYDTASGRLVQLWAPRVE 286 (336) Q Consensus 208 i~~l~~~l~~~t~g~v~~~~p~tL~Lp~~-~~~~Ls~~~~~~~Tvl~~l~~n~pnl~i~~~pel~~a~G~~~~~~~~~~~ 286 (336) |.+++..|...-. + .-..+|-+. ....+....+.|-.++. .-|+ ++-..|-..+++- ....|-+- T Consensus 282 l~~~~~~l~~~y~-----~-na~~imn~~t~~~~~~~~~d~~~~~~~----~~~~-~llG~PV~~t~~~-~~i~~GDf-- 347 (402) T protein:vir:93 282 IINALADLHEDYR-----D-NATIYMRYADYVKIISVLSNGTTNFFD----TPAE-KVFGKPVVFTDAA-VKPIVGDF-- 347 (402) T ss_pred HHHHHhccChhhh-----c-CCEEEEechHHHHHHHHHhcCCCcccc----cCCc-cccccceEEecCC-Cceeeech-- Confidence 7777776643211 1 114566544 44444443333333321 1122 2222222222210 11111100 Q ss_pred CCceEEEEe-CchhhcccceecCCceEEeeecceeeeEEecccceeeec--cC Q lcl|Aclame:pro 287 GKDTATCGF-TEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMI--GV 336 (336) Q Consensus 287 ~~~~~~~~~-p~~~~~l~~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~--GI 336 (336) .. ..+.+ .+-++.. -+.......+-+..|.+|.++ .|-||+.+. +- T Consensus 348 -~~-~~~~~~~~~~~~~-~~~~~~~~~~~~~~r~Dg~v~-~~~A~~~l~ik~~ 396 (402) T protein:vir:93 348 -NY-FGINYDGTTYDTD-KDVKKGEYLFVLTAWYDQQRT-LDSAFRIAKAKEN 396 (402) T ss_pred -hh-hhhhhhhhhhhhh-hcccCCceEEEEEEEeCcEEe-chhheEEEEeecC Confidence 00 00000 0111111 011223455566677776665 599887543 22 No 118 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=79.83 E-value=0.1 Score=26.05 Aligned_cols=300 Identities=13% Similarity=0.094 Sum_probs=128.3 Q ss_pred CchH--HHHHHHh--hcceeccchhhhhhhhhhhh---hhhhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchh Q lcl|Aclame:pro 1 MRDA--QRIQNLA--RAGVILPRSVKNVSTPLAEY---AMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAA 73 (336) Q Consensus 1 m~~~--~~~~~l~--~~g~~~~~~~~~~~~~~~~~---~~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~ 73 (336) .++. ..++.+. +.+.+ ....+.+.. ..+.+....+ .+.....+|.++.+-|-..+ +...+-++.. T Consensus 112 ~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~-~~g~~~~vP~~~~~~i~~~l-~~~~~l~~~~ 183 (466) T protein:vir:80 112 GETRMKGFFRNMPYEQRAAL------IARSEVKEFLAQVRTLAQQKRA-VSGAELTIPDVMLELLRDNM-HRYSKLISKV 183 (466) T ss_pred HHHHHHHHHHhhhhhhHHHH------HHHHHHHHHHHHHHHHhhhhhh-hccccccccHHHHHHHHHhh-hhhhhhhhhe Confidence 0000 0000000 00000 000000000 0111100001 11112346776665442222 1111222223 Q ss_pred hhcccccCCCcceeeEEEeeeecccceEEeecccCCceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHH Q lcl|Aclame:pro 74 ELVGESKKGDWTTLVAAFITAEPTTTVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNY 153 (336) Q Consensus 74 ~l~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~ 153 (336) ...|++. +..+.+......+.+.+...++|..+.......-.++.++.-+.+|.+=|. ....++.+--.. T Consensus 184 ~v~~~~g-------~~~~~~~~~~~~a~wv~E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~---ds~~~l~~~i~~ 253 (466) T protein:vir:80 184 RLRPLKG-------TARQNIAGAIPEGVWTEAVANLNELSLSFSQIEVDGYKVGGFIPIPNSTLE---DSDLNLADEILD 253 (466) T ss_pred eeeecCc-------eeEeeeecCCcceeecccccccccccccccceeecceeeeeehhhhHHHHh---cchHHHHHHHHH Confidence 3333221 234455445556677777888888888777788888888887778766554 344578888888 Q ss_pred HHHHHHHHhhccEEEeeccccceEEEEecCCCCccccc---ccccccccCHHH-------------HHHHHHHHHHHHHH Q lcl|Aclame:pro 154 SSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITA---TTPWSGSPAVEA-------------VVNEVVTLFQVLQT 217 (336) Q Consensus 154 aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~~~~---~t~w~~~~T~~e-------------I~~Di~~l~~~l~~ 217 (336) ..+.++...+|.-++.|++...-.|+||+......... ..+.+...+... .+.|+..++..+.. T Consensus 254 ~la~~~~~~~~~ail~G~G~~~P~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 333 (466) T protein:vir:80 254 AIGQAIGFALDKAILYGTGTKMPVGIVTRLAQTTQPPNWGTKAPAWTNLSTTNLLKIDPTGKSAEEFFSELVLKLSKARA 333 (466) T ss_pred HHHHHHHHHHhhheeeccCCCCcceeeecccccccccccccccccccccchhhhhhhhhhccchhhHHHHHHHHHHhhhc Confidence 88999999999999999988888899997543211111 111122112111 12222111111100 Q ss_pred HhCCceeccCCcEE-EecHHHHHhcc-cC---CCCCccHHHHHHHh-CC--ccEEEE---cccccCCCCce-EEEEEEee Q lcl|Aclame:pro 218 QSQGIITQEAVLHM-GLPPTAMSDLS-KT---NQYGLSAAAKLKEI-FP--KLEFVT---IPEYDTASGRL-VQLWAPRV 285 (336) Q Consensus 218 ~t~g~v~~~~p~tL-~Lp~~~~~~Ls-~~---~~~~~Tvl~~l~~n-~p--nl~i~~---~pel~~a~G~~-~~~~~~~~ 285 (336) . ...+.-+ ++.+..+..|. .. +..|. +-+--.| .| +..|+. +|+-.--.|.. .+.+.++ T Consensus 334 ~------~~~~~~~w~~~~~~~~~l~~~~~~~~~~g~--~~~~~~~~~~i~G~pvv~s~~~~~~~~~~g~~~~y~i~~r- 404 (466) T protein:vir:80 334 N------YSNGMKFWAMSSNTHAVLMSKAITFNSAGA--LVASLNNTMPIVGGDIVILDFIPDNDIIGGYGSLYLLAER- 404 (466) T ss_pred c------ccCCceeEEecchhHHHhhcccccccCCcc--ccccCCCcccccccceeecCccCccceeeeccccEEEEee- Confidence 0 0123333 33334443332 11 11111 0000001 01 122222 22211112222 2233222 Q ss_pred CCCceEEEEeCchhhcccceecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 286 EGKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 286 ~~~~~~~~~~p~~~~~l~~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) .+ .++.......+ ..-...+-+..|.+|- ++.|.||+.+++= T Consensus 405 ~~---~~i~~~~~~~f-----~~d~~~~r~~~r~dg~-~~~~~afv~~~~~ 446 (466) T protein:vir:80 405 AD---IKLAQSEHVRF-----IEDQTVFKGTARYDGK-PVFGEGFVAVNIA 446 (466) T ss_pred cc---eEEEechhhhh-----hcCcEEEEEEEEEccE-EeccCceEEEEec Confidence 11 22222221111 1123456667777554 4799999998744 No 119 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=76.48 E-value=0.14 Score=25.35 Aligned_cols=287 Identities=10% Similarity=0.035 Sum_probs=127.6 Q ss_pred CchHHHHHH---Hhhccee--ccc---hhhh-hh---hhhhhh------hhhhhhhcCccccCCc--chHHHHHHHhhCc Q lcl|Aclame:pro 1 MRDAQRIQN---LARAGVI--LPR---SVKN-VS---TPLAEY------AMDAADLSPHLSSTGS--SGIPNYLTTYVDP 60 (336) Q Consensus 1 m~~~~~~~~---l~~~g~~--~~~---~~~~-~~---~~~~~~------~~da~d~~~~l~t~~~--~~i~~~l~~~idp 60 (336) ..+.+.... ....++. .+. .... .. ..+..+ ..+.+ ....|.++ ..+|..+. . T Consensus 57 i~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~---~~~~t~~~gg~~vP~~~~----~ 129 (394) T protein:vir:10 57 IKDLEAENKANSDPDKPVDNAQPNGTDLKKKPIDAKKKAINDFIHSHGKVIDNA---AGHVTSTEAGVLIPEEII----Y 129 (394) T ss_pred HHHHHHHHHhhcchhhhhhhhcccccchhhhHHHHHHHHHHHHHhccchhhhhh---hcccccccCceeccHHHH----H Confidence 111111000 0000000 000 0000 00 001111 11111 11122333 23555544 3 Q ss_pred eeeeeeccccchhhhcccccCCCcceeeEEEeeeec-ccceEEeecccCCce-eeeeeeeeeeeEEEEEEEEEeCHHHHH Q lcl|Aclame:pro 61 SVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEP-TTTVATYGDYSSDGD-SGTNINYPQRQSYFFQTWTRWGERELE 138 (336) Q Consensus 61 ~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~-~G~a~~ygd~~DiP~-vd~~~~~~~~~v~~~~~~~~y~~~El~ 138 (336) +|++.+.+......++.+...+. .+..|++... .+.+...+...+.|- .+...+.....++.++.-+.+|.+=|+ T Consensus 130 ~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ 206 (394) T protein:vir:10 130 DPTAEVNSVVDLSTLVTKTPVTT---PKGTYPILKRATDRFSSVAELAENPALAEPEFEQVDWSVSTYRGAIPLSEEAIA 206 (394) T ss_pred HHHHHHHhhhhhhhhceeeeccC---CceEEEEEecCCCccccccccccccccccccceeEEeeeeeeEeeehhHHHHHh Confidence 55666666666666665543321 2345555554 466778888888884 556788888888888888888877666 Q ss_pred HHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 139 MAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQ 218 (336) Q Consensus 139 ~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~ 218 (336) .+ ..+|.+.-....+.++...+|+-.+.|.+.. .+..++ +..+.+ ||..++...... T Consensus 207 ds---~~~l~~~i~~~la~~~~~~~~~~il~g~g~~----------~~~~~~------~~~~~d----~l~~~~~~~~~~ 263 (394) T protein:vir:10 207 DS---AVDLTSLVGQSINEKSVNTYNAMIAPVLQSF----------TAKATT------TDTLVD----SLKHILNVDLDP 263 (394) T ss_pred hh---hHHHHHHHHHHHHHHHHHHHHHHHhhccccc----------cccccc------ccccHH----HHHHHHHhhhhh Confidence 54 3467777777777777777877666665421 111011 122333 344443322111 Q ss_pred hCCceeccCCcEEEecHHHHHhccc-CCCCCccHHHHHH-----HhCC----ccEEEEcc--cccCCCCceEEEEEEeeC Q lcl|Aclame:pro 219 SQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAKLK-----EIFP----KLEFVTIP--EYDTASGRLVQLWAPRVE 286 (336) Q Consensus 219 t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~l~-----~n~p----nl~i~~~p--el~~a~G~~~~~~~~~~~ 286 (336) - + .-.++|.++.+..|.+ .+..|.-++.--. ...| ++.++... .+.++.|....++.+-.+ T Consensus 264 ~-~------~a~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~L~G~PV~~~~~~~~~~~~~~~~i~~gd~s~ 336 (394) T protein:vir:10 264 A-Y------SRALVVTQSLFNTLDTLKDKNGRYLLHDASDSITDGTAKGTVLGVPVYVVGDALLGSAAGDQKAFVGDLKR 336 (394) T ss_pred h-c------cCEEEecHHHHHHHHHhhccCCCeeeeccccccccCCcccccccceeEEecccccCCCCCceEEEEeeccc Confidence 1 1 1268999988888864 3333432221100 0011 12232222 223334444333332100 Q ss_pred -----CCceEEEEeCchhhcccceecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 287 -----GKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 287 -----~~~~~~~~~p~~~~~l~~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) ...-.++.+ .++.. -...+-...|.+| .+++|.+|+.+..= T Consensus 337 ~~~~~~~~~~~v~~-~~~~~-------~~~~~~~~~r~d~-~~~~~~ai~~~~~~ 382 (394) T protein:vir:10 337 GVLFADRQQVTLAW-EDSKI-------YGRYLGAAFRFGV-KQADSNAGYFVTNT 382 (394) T ss_pred cEEEEeecceEEEE-ecccc-------cceeEEEEEEecc-EEeccccEEEEEee Confidence 001111111 11100 0111233456654 56669999887644 No 120 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=76.27 E-value=0.14 Score=25.31 Aligned_cols=289 Identities=13% Similarity=0.104 Sum_probs=116.6 Q ss_pred CchHHHHHHHhh---cceeccc-hhhhhhhhhhhhhhhhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhhc Q lcl|Aclame:pro 1 MRDAQRIQNLAR---AGVILPR-SVKNVSTPLAEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELV 76 (336) Q Consensus 1 m~~~~~~~~l~~---~g~~~~~-~~~~~~~~~~~~~~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~ 76 (336) |-..-.-..--| +--.||. +..-++ ..+|+ -+ -+..+..+++|.+-..-...+.+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~p~l~m~alT------Laea~----~l-----------~~d~~~~~VIE~l~~~s~iL~~l 59 (330) T protein:vir:94 1 MVRICTPPLRGRWRTLTHQFPELKMPTVT------LAESA----KL-----------SQDHLVSGLIETIVEVNPLYEMM 59 (330) T ss_pred CceecCCccccceeehhccccccchhhhh------hhHHh----hc-----------CchhhHHHHHHhhhccchHHhhc Confidence 110000000000 0001111 000000 00111 01 11223345566555555566666 Q ss_pred cccc-CCCcceeeEEEeeeecccceEEee---cccC-CceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCC--CHHH Q lcl|Aclame:pro 77 GESK-KGDWTTLVAAFITAEPTTTVATYG---DYSS-DGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRV--DLAS 149 (336) Q Consensus 77 ~v~t-~g~w~~~t~~~~v~e~~G~a~~yg---d~~D-iP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~--~l~~ 149 (336) |-+. ++. ...|.....-+.+.... -+.. .| ......+-....++... .-+-+-|...|- ++.. T Consensus 60 pf~~ve~~----~~~~~r~~~lp~a~~r~~n~~~~~~~~---~Tf~q~t~~l~~l~~~~---~Vd~~iadl~g~~~d~~~ 129 (330) T protein:vir:94 60 PFTEIEGN----ALAYNRENVLGDVQFLAVGGTITAKNP---ATFTKVTSELTTLIGDA---EVNGLIQATRSDFMDQTS 129 (330) T ss_pred ccccccCC----cceeeeeecCCcceeeeccccccccCc---ceeeeeeechhhhhhhH---HHHHHHHHhcCCHHHHHH Confidence 6432 221 23343322223332221 1111 12 11111111222233222 223333344453 3333 Q ss_pred HHHHHHHHHHHHhhccEEEeeccc-cceEEEEecCCCCcccccccc-cccccCHHHHHHHHHHHHHHHHHHhCCceeccC Q lcl|Aclame:pro 150 ELNYSSALGLAKFLNGSYLFGVAG-LENYGLINDPSLSAPITATTP-WSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEA 227 (336) Q Consensus 150 ~K~~aAr~a~e~~~n~i~~~Gd~~-~g~~GllN~Pnl~~~~~~~t~-w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~ 227 (336) ....+-.+++.+.+..-.++||+. .++.||++ ++.......+. --+.-| ++|+.+|+..+++.-+ . T Consensus 130 ~q~~~~ieal~~~~e~~linGDs~~~~F~GL~~--~~~~~q~i~tg~~gg~~T----~d~LDeLl~~v~~~~g------~ 197 (330) T protein:vir:94 130 VQVASKAKSIGRQYQASMITGDGTGNSFQGMMG--LVAASQTISAGANGGTLT----FELLDQLLDLVKDKDG------Q 197 (330) T ss_pred HHHHHHHHHHHHHHHHHhhccCCCCccccchhh--cCCcccEEecCCCCCCCC----HHHHHHHHHHhcCCCC------C Confidence 334455667888888888999865 67779987 34322221110 012334 4677778777755321 4 Q ss_pred CcEEEecHHHHHhccc--C--CCCC---ccHHHHHH--HhCCccEEEEc---ccccC---CCCceEEEEEEeeCCC--ce Q lcl|Aclame:pro 228 VLHMGLPPTAMSDLSK--T--NQYG---LSAAAKLK--EIFPKLEFVTI---PEYDT---ASGRLVQLWAPRVEGK--DT 290 (336) Q Consensus 228 p~tL~Lp~~~~~~Ls~--~--~~~~---~Tvl~~l~--~n~pnl~i~~~---pel~~---a~G~~~~~~~~~~~~~--~~ 290 (336) |+.|+|+......+.. + +.++ .++..+=+ ..|-.+-|... |.=.+ ++|... +|+-+.... +. T Consensus 198 ~~~~l~n~a~~r~I~a~~R~~~~~~v~~~~~~~~G~~v~~~~GvPi~~~d~ip~~~~~~~~~~tts-Iyav~~G~~~~~q 276 (330) T protein:vir:94 198 VDYLMSSFAMRRKYFSLLRALGGAAIGEVMTLPSGRQIPTYRGVPWFVNDFIPSNMTQGTATNATA-IFAGTFDDGSNKY 276 (330) T ss_pred CcEEEechhHHHHHHHHHHhccCCCCCCcccccCCCEEeeeCCeEEEecccccCCCCcccCCCcee-EEEEeeccccccc Confidence 7789988776665532 1 1112 11111000 11223333322 22111 233333 333333211 11 Q ss_pred EEEEeCch------hhccc-ceecC-CceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 291 ATCGFTEK------MRAHS-IERYS-SYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 291 ~~~~~p~~------~~~l~-~~~~~-~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) +-+.+.++ .|..+ .+.+. .+|.+ ...-|+.+.-|.|++.+.|| T Consensus 277 gV~Gl~~~g~~glsVr~~G~~~~k~v~~~~v---~~y~~~av~~~~a~~~L~~V 327 (330) T protein:vir:94 277 GIAGLTARGSAGLRVQNVGAKENADETITRV---KMYCGFANFSQLGLAAIKGL 327 (330) T ss_pred ceEeecCCCCCcceeeeCCCccccceeeEEE---EEeeeeEEechhheeeeccc Confidence 22233221 23333 11111 33444 34678999999999999999 No 121 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=76.01 E-value=0.14 Score=25.26 Aligned_cols=295 Identities=8% Similarity=-0.010 Sum_probs=124.7 Q ss_pred CchHHHHH----HHh---h-----cce--eccchhhhhh-------------hhhhhhhhhhhhhcCcc--ccCCcc--h Q lcl|Aclame:pro 1 MRDAQRIQ----NLA---R-----AGV--ILPRSVKNVS-------------TPLAEYAMDAADLSPHL--SSTGSS--G 49 (336) Q Consensus 1 m~~~~~~~----~l~---~-----~g~--~~~~~~~~~~-------------~~~~~~~~da~d~~~~l--~t~~~~--~ 49 (336) ...++.++ .++ + .+- .-+....... .........+......+ .+.+++ . T Consensus 50 ~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gG~l 129 (387) T protein:vir:96 50 QQRFNIVERQVQDIEEKEKAKVKDKGEAYQSLSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKL 129 (387) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhhccccCCCCchhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCCCCCCcee Confidence 11111110 000 0 000 0000000000 00000011111111111 222222 3 Q ss_pred HHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEee-eecccceEEeecccCCceeeeeeeeeeeeEEEEEE Q lcl|Aclame:pro 50 IPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFIT-AEPTTTVATYGDYSSDGDSGTNINYPQRQSYFFQT 128 (336) Q Consensus 50 i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v-~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~ 128 (336) ||..+.+ +|++.+...-....+..+.+.+.. .++. ....+.+...+.....|-.+.......-..+.++. T Consensus 130 IP~~~~~----~Ii~~~~~~~~l~~~~~~~~~~~~-----~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~ 200 (387) T protein:vir:96 130 LPKTLSK----EIVSEPFAKNQLREKARLTNIKGL-----EIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKV 200 (387) T ss_pred echhHHH----HHHHHHHhhchhhhhceeeecCCc-----eeeeeeccCCccccccccccccccccccceeeechheeee Confidence 6776653 445444444444566555554432 2222 23345667778887888888888888888888888 Q ss_pred EEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEE-EeeccccceEEEEecCCCCcccccccccccccCHHHHHHH Q lcl|Aclame:pro 129 WTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSY-LFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNE 207 (336) Q Consensus 129 ~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~-~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~D 207 (336) .+.+|.+=|. ....++.+--....++++...+++.+ ..|++...-.|.++++.+... +.+..++| T Consensus 201 ~i~iS~ell~---ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~~-----------~~~~~~d~ 266 (387) T protein:vir:96 201 FAAISDTVIH---GSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEV-----------EGADMYDA 266 (387) T ss_pred echhhHHHHh---hhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccc-----------cccchHHH Confidence 8888855343 23455666666666666666655543 345544445677776544321 11223667 Q ss_pred HHHHHHHHHHHhCCceeccCCcEEEecH-HHHHhcccCCCCCccHHHHHHHhCCccEEEEcccccCCCCceEEEEEEeeC Q lcl|Aclame:pro 208 VVTLFQVLQTQSQGIITQEAVLHMGLPP-TAMSDLSKTNQYGLSAAAKLKEIFPKLEFVTIPEYDTASGRLVQLWAPRVE 286 (336) Q Consensus 208 i~~l~~~l~~~t~g~v~~~~p~tL~Lp~-~~~~~Ls~~~~~~~Tvl~~l~~n~pnl~i~~~pel~~a~G~~~~~~~~~~~ 286 (336) |.+++.+|...-. .+ -..+|-+ .....+....+.|-.++. .-|+ ++-..|-.-+.+ ....+|-+ . T Consensus 267 i~~~~~~l~~~y~----~n--a~~imn~~t~~~~~~~~~~~~~~~~~----~~~~-~llG~PV~~~~~-~~~~~~GD-f- 332 (387) T protein:vir:96 267 IINALADLHEDYR----DN--ATIYMRYADYVKIISVLSNGTTNFFD----TPAE-KVFGKPVVFTDA-AVKPIVGD-F- 332 (387) T ss_pred HHHHHhccChhhh----cC--CEEEEechHHHHHHHHHhcCCCcccc----cCCc-cccccceEEecC-CCceeeec-h- Confidence 7777776644321 11 1355543 444444433333332221 1121 111112111111 00111110 0 Q ss_pred CCceEEEEeCchhhccc-ceecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 287 GKDTATCGFTEKMRAHS-IERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 287 ~~~~~~~~~p~~~~~l~-~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) . -..+.+ ..+...+ -+.......+-+..|..|.+ ++|-||+.+.-= T Consensus 333 -~-~~~~~~-~~~~~~~~~~~~~~~~~~~~~~r~Dg~v-~~~~A~~~l~~k 379 (387) T protein:vir:96 333 -N-YFGINY-DGTTYDTDKDVKKGEYLFVLTAWYDQQR-TLDSAFRIAKAK 379 (387) T ss_pred -h-hhhhhh-hhhhheecccccCCceEEEEEEEeCcEe-echhheEEEEee Confidence 0 000000 0011100 01112344555566766655 469998875432 No 122 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=76.01 E-value=0.14 Score=25.26 Aligned_cols=295 Identities=8% Similarity=-0.010 Sum_probs=124.7 Q ss_pred CchHHHHH----HHh---h-----cce--eccchhhhhh-------------hhhhhhhhhhhhhcCcc--ccCCcc--h Q lcl|Aclame:pro 1 MRDAQRIQ----NLA---R-----AGV--ILPRSVKNVS-------------TPLAEYAMDAADLSPHL--SSTGSS--G 49 (336) Q Consensus 1 m~~~~~~~----~l~---~-----~g~--~~~~~~~~~~-------------~~~~~~~~da~d~~~~l--~t~~~~--~ 49 (336) ...++.++ .++ + .+- .-+....... .........+......+ .+.+++ . T Consensus 50 ~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gG~l 129 (387) T protein:vir:26 50 QQRFNIVERQVQDIEEKEKAKVKDKGEAYQSLSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKL 129 (387) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhhccccCCCCchhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCCCCCCcee Confidence 11111110 000 0 000 0000000000 00000011111111111 222222 3 Q ss_pred HHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEee-eecccceEEeecccCCceeeeeeeeeeeeEEEEEE Q lcl|Aclame:pro 50 IPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFIT-AEPTTTVATYGDYSSDGDSGTNINYPQRQSYFFQT 128 (336) Q Consensus 50 i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v-~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~ 128 (336) ||..+.+ +|++.+...-....+..+.+.+.. .++. ....+.+...+.....|-.+.......-..+.++. T Consensus 130 IP~~~~~----~Ii~~~~~~~~l~~~~~~~~~~~~-----~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~ 200 (387) T protein:vir:26 130 LPKTLSK----EIVSEPFAKNQLREKARLTNIKGL-----EIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKV 200 (387) T ss_pred echhHHH----HHHHHHHhhchhhhhceeeecCCc-----eeeeeeccCCccccccccccccccccccceeeechheeee Confidence 6776653 445444444444566555554432 2222 23345667778887888888888888888888888 Q ss_pred EEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEE-EeeccccceEEEEecCCCCcccccccccccccCHHHHHHH Q lcl|Aclame:pro 129 WTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSY-LFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNE 207 (336) Q Consensus 129 ~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~-~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~D 207 (336) .+.+|.+=|. ....++.+--....++++...+++.+ ..|++...-.|.++++.+... +.+..++| T Consensus 201 ~i~iS~ell~---ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~~-----------~~~~~~d~ 266 (387) T protein:vir:26 201 FAAISDTVIH---GSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEV-----------EGADMYDA 266 (387) T ss_pred echhhHHHHh---hhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccc-----------cccchHHH Confidence 8888855343 23455666666666666666655543 345544445677776544321 11223667 Q ss_pred HHHHHHHHHHHhCCceeccCCcEEEecH-HHHHhcccCCCCCccHHHHHHHhCCccEEEEcccccCCCCceEEEEEEeeC Q lcl|Aclame:pro 208 VVTLFQVLQTQSQGIITQEAVLHMGLPP-TAMSDLSKTNQYGLSAAAKLKEIFPKLEFVTIPEYDTASGRLVQLWAPRVE 286 (336) Q Consensus 208 i~~l~~~l~~~t~g~v~~~~p~tL~Lp~-~~~~~Ls~~~~~~~Tvl~~l~~n~pnl~i~~~pel~~a~G~~~~~~~~~~~ 286 (336) |.+++.+|...-. .+ -..+|-+ .....+....+.|-.++. .-|+ ++-..|-.-+.+ ....+|-+ . T Consensus 267 i~~~~~~l~~~y~----~n--a~~imn~~t~~~~~~~~~~~~~~~~~----~~~~-~llG~PV~~~~~-~~~~~~GD-f- 332 (387) T protein:vir:26 267 IINALADLHEDYR----DN--ATIYMRYADYVKIISVLSNGTTNFFD----TPAE-KVFGKPVVFTDA-AVKPIVGD-F- 332 (387) T ss_pred HHHHHhccChhhh----cC--CEEEEechHHHHHHHHHhcCCCcccc----cCCc-cccccceEEecC-CCceeeec-h- Confidence 7777776644321 11 1355543 444444433333332221 1121 111112111111 00111110 0 Q ss_pred CCceEEEEeCchhhccc-ceecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 287 GKDTATCGFTEKMRAHS-IERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 287 ~~~~~~~~~p~~~~~l~-~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) . -..+.+ ..+...+ -+.......+-+..|..|.+ ++|-||+.+.-= T Consensus 333 -~-~~~~~~-~~~~~~~~~~~~~~~~~~~~~~r~Dg~v-~~~~A~~~l~~k 379 (387) T protein:vir:26 333 -N-YFGINY-DGTTYDTDKDVKKGEYLFVLTAWYDQQR-TLDSAFRIAKAK 379 (387) T ss_pred -h-hhhhhh-hhhhheecccccCCceEEEEEEEeCcEe-echhheEEEEee Confidence 0 000000 0011100 01112344555566766655 469998875432 No 123 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=76.01 E-value=0.14 Score=25.26 Aligned_cols=295 Identities=8% Similarity=-0.010 Sum_probs=124.7 Q ss_pred CchHHHHH----HHh---h-----cce--eccchhhhhh-------------hhhhhhhhhhhhhcCcc--ccCCcc--h Q lcl|Aclame:pro 1 MRDAQRIQ----NLA---R-----AGV--ILPRSVKNVS-------------TPLAEYAMDAADLSPHL--SSTGSS--G 49 (336) Q Consensus 1 m~~~~~~~----~l~---~-----~g~--~~~~~~~~~~-------------~~~~~~~~da~d~~~~l--~t~~~~--~ 49 (336) ...++.++ .++ + .+- .-+....... .........+......+ .+.+++ . T Consensus 50 ~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gG~l 129 (387) T protein:vir:94 50 QQRFNIVERQVQDIEEKEKAKVKDKGEAYQSLSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKL 129 (387) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhhccccCCCCchhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCCCCCCcee Confidence 11111110 000 0 000 0000000000 00000011111111111 222222 3 Q ss_pred HHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEee-eecccceEEeecccCCceeeeeeeeeeeeEEEEEE Q lcl|Aclame:pro 50 IPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFIT-AEPTTTVATYGDYSSDGDSGTNINYPQRQSYFFQT 128 (336) Q Consensus 50 i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v-~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~ 128 (336) ||..+.+ +|++.+...-....+..+.+.+.. .++. ....+.+...+.....|-.+.......-..+.++. T Consensus 130 IP~~~~~----~Ii~~~~~~~~l~~~~~~~~~~~~-----~~p~~~~~~~~a~~v~Eg~~~~~~~~~f~~v~l~~~k~~~ 200 (387) T protein:vir:94 130 LPKTLSK----EIVSEPFAKNQLREKARLTNIKGL-----EIPRVSYTLDDDDFITDVETAKELKAKGDTVKFTTNKFKV 200 (387) T ss_pred echhHHH----HHHHHHHhhchhhhhceeeecCCc-----eeeeeeccCCccccccccccccccccccceeeechheeee Confidence 6776653 445444444444566555554432 2222 23345667778887888888888888888888888 Q ss_pred EEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEE-EeeccccceEEEEecCCCCcccccccccccccCHHHHHHH Q lcl|Aclame:pro 129 WTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSY-LFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNE 207 (336) Q Consensus 129 ~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~-~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~D 207 (336) .+.+|.+=|. ....++.+--....++++...+++.+ ..|++...-.|.++++.+... +.+..++| T Consensus 201 ~i~iS~ell~---ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~g~~~g~~~~~~~~~~-----------~~~~~~d~ 266 (387) T protein:vir:94 201 FAAISDTVIH---GSDVDLVNWVENALQSGLAAKERKDALAVSPKSGLEHMSFYNGSVKEV-----------EGADMYDA 266 (387) T ss_pred echhhHHHHh---hhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCccccceeeeccccccc-----------cccchHHH Confidence 8888855343 23455666666666666666655543 345544445677776544321 11223667 Q ss_pred HHHHHHHHHHHhCCceeccCCcEEEecH-HHHHhcccCCCCCccHHHHHHHhCCccEEEEcccccCCCCceEEEEEEeeC Q lcl|Aclame:pro 208 VVTLFQVLQTQSQGIITQEAVLHMGLPP-TAMSDLSKTNQYGLSAAAKLKEIFPKLEFVTIPEYDTASGRLVQLWAPRVE 286 (336) Q Consensus 208 i~~l~~~l~~~t~g~v~~~~p~tL~Lp~-~~~~~Ls~~~~~~~Tvl~~l~~n~pnl~i~~~pel~~a~G~~~~~~~~~~~ 286 (336) |.+++.+|...-. .+ -..+|-+ .....+....+.|-.++. .-|+ ++-..|-.-+.+ ....+|-+ . T Consensus 267 i~~~~~~l~~~y~----~n--a~~imn~~t~~~~~~~~~~~~~~~~~----~~~~-~llG~PV~~~~~-~~~~~~GD-f- 332 (387) T protein:vir:94 267 IINALADLHEDYR----DN--ATIYMRYADYVKIISVLSNGTTNFFD----TPAE-KVFGKPVVFTDA-AVKPIVGD-F- 332 (387) T ss_pred HHHHHhccChhhh----cC--CEEEEechHHHHHHHHHhcCCCcccc----cCCc-cccccceEEecC-CCceeeec-h- Confidence 7777776644321 11 1355543 444444433333332221 1121 111112111111 00111110 0 Q ss_pred CCceEEEEeCchhhccc-ceecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 287 GKDTATCGFTEKMRAHS-IERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 287 ~~~~~~~~~p~~~~~l~-~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) . -..+.+ ..+...+ -+.......+-+..|..|.+ ++|-||+.+.-= T Consensus 333 -~-~~~~~~-~~~~~~~~~~~~~~~~~~~~~~r~Dg~v-~~~~A~~~l~~k 379 (387) T protein:vir:94 333 -N-YFGINY-DGTTYDTDKDVKKGEYLFVLTAWYDQQR-TLDSAFRIAKAK 379 (387) T ss_pred -h-hhhhhh-hhhhheecccccCCceEEEEEEEeCcEe-echhheEEEEee Confidence 0 000000 0011100 01112344555566766655 469998875432 No 124 >protein:vir:78350 Length: 383 # NCBI annotation: Cps # Family: family:all:635 # MgeID: mge:1850 # MgeName: B025 # Cross-refs: genbank:acc:YP_001468644;genbank:gi:157325222;genbank:GeneID:5601696 Probab=74.45 E-value=0.16 Score=24.98 Aligned_cols=304 Identities=11% Similarity=0.032 Sum_probs=122.6 Q ss_pred Cch-----HHHHHHHhhcceeccchhhhhhhhhhhhhhhhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhh Q lcl|Aclame:pro 1 MRD-----AQRIQNLARAGVILPRSVKNVSTPLAEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAEL 75 (336) Q Consensus 1 m~~-----~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l 75 (336) .++ ++..++-.+.-+........++.+-+.... +.-.+ -.+...-.+|..+.+-| ++.+...-....+ T Consensus 44 ~~~~~~~~~~~~~~~~~~~~~~~~g~~~lt~~e~~~~~-~~~~~--~~~~gg~lvP~~~~~~I----~~~l~~~s~l~~~ 116 (383) T protein:vir:78 44 AADIMEQAKKEARQEADAYISASRTDKNITNEEIKFFN-DINKE--VGYKEETLLPQTVVDEI----FEDLTTEHPFLAS 116 (383) T ss_pred HHHHHHHHHHHHHHHHHHHHHhcCChhhhhHHHHHHHH-HHhcc--CCCCCccccCHHHHHHH----HHHHHhhccceee Confidence 001 111111101000001111222222222111 11010 01112233566555433 3322222122223 Q ss_pred cccccCCCcceeeEEEeeeecccceEEeecccCCc-eeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHH Q lcl|Aclame:pro 76 VGESKKGDWTTLVAAFITAEPTTTVATYGDYSSDG-DSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYS 154 (336) Q Consensus 76 ~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP-~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~a 154 (336) ..+.+.+. ...++..+..+.+...+-...++ ..+.......-..+.+..-...+.+=|. -..+++.+--... T Consensus 117 ~~v~~~~~----~~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~i~is~ell~---Ds~~~ie~~i~~~ 189 (383) T protein:vir:78 117 IGMRTTGL----RTKFLKSETSGVAVWGKIFGEIKGQLDATFSDEESIQNKLTAFVVVPKDLEK---FGPAWVKRFVVTQ 189 (383) T ss_pred eeeEecCC----ceEEEEEcCCcceEEeecccccccccCcceeeEeecceeeEeeccchHHHhh---ccHHHHHHHHHHH Confidence 33332222 13566777777777766555553 4455566666677777766666644333 3345788888999 Q ss_pred HHHHHHHhhccEEEeeccccceEEEEecCCCCcccccc-cccccccCHHHHHHHHHHHHHHHHHHhCC-ceecc------ Q lcl|Aclame:pro 155 SALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITAT-TPWSGSPAVEAVVNEVVTLFQVLQTQSQG-IITQE------ 226 (336) Q Consensus 155 Ar~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~~~~~-t~w~~~~T~~eI~~Di~~l~~~l~~~t~g-~v~~~------ 226 (336) ..+++.+.+++-.+.|++..+-.|++++.+.....+.. .+.+. ++..--.+|+..++..+...... ....+ T Consensus 190 l~~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~ 268 (383) T protein:vir:78 190 IEEAFAVALESAYIVGDGNDKPIGLNRKVGKGSTVVDGVYAEKA-ATGTLTFANPKTTVNELTDVYKYHSVKENGHPLNV 268 (383) T ss_pred HHHHHHHHHhhheEeccCCCCceeeeeccCCccccccccccccc-ccchhhhhhhHHHHHHHHHHHhccchhcccchhhh Confidence 99999999999999999988999999975432222211 11221 11111223333333322211110 00001 Q ss_pred -CCcEEEecHHH-HHhccc---CCCCCc--cHHHHHHHhCCccEEEEccccc---CCCCce-EEEEEEeeCCCceEEEEe Q lcl|Aclame:pro 227 -AVLHMGLPPTA-MSDLSK---TNQYGL--SAAAKLKEIFPKLEFVTIPEYD---TASGRL-VQLWAPRVEGKDTATCGF 295 (336) Q Consensus 227 -~p~tL~Lp~~~-~~~Ls~---~~~~~~--Tvl~~l~~n~pnl~i~~~pel~---~a~G~~-~~~~~~~~~~~~~~~~~~ 295 (336) ...+.++-+.. +..+.. -+..|. +++- | .++|+..+... -.-|.- .+++.++ .+ .++.. T Consensus 269 ~~~~~~~~n~~~~~~~~~~~~~~~~~G~~~t~l~-----~-~~~iv~s~~~p~~~iifgdfs~Y~i~~r-~~---~~i~~ 338 (383) T protein:vir:78 269 AGKVTLLVNPTDAWDVKKQYTSLNANGVYVTALP-----F-NLNIIESLFVPEKKAISYVAERYDALIG-GP---LDIGT 338 (383) T ss_pred cCceEEEEcCcchhhhccchhccCCCCceeeecC-----C-CceEEecCCCCcccEEEeeccceEEEec-cc---ceEEe Confidence 11233333321 111111 111121 1110 1 23333322111 111211 1333322 11 11111 Q ss_pred CchhhcccceecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 296 TEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 296 p~~~~~l~~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) .....+ ..-....-...|..| .++.|.|++.++ | T Consensus 339 ~~~~~f-----~~d~~~f~~~~r~dG-~~~~~~A~~vl~-~ 372 (383) T protein:vir:78 339 YDQTLA-----IEDLNLYAAKQFAYG-KAKDDKAAAVWT-L 372 (383) T ss_pred cchhhh-----hcCceEEEEEEEEcC-EEecCCeEEEEE-E Confidence 111110 111233444556655 566778766654 4 No 125 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=72.50 E-value=0.18 Score=24.64 Aligned_cols=295 Identities=12% Similarity=-0.031 Sum_probs=127.0 Q ss_pred CchHHHH----HHH---hhcce----------e-ccchh--hhhhhhhhhhhhhhhhhcCccccCCcch--HHHHHHHhh Q lcl|Aclame:pro 1 MRDAQRI----QNL---ARAGV----------I-LPRSV--KNVSTPLAEYAMDAADLSPHLSSTGSSG--IPNYLTTYV 58 (336) Q Consensus 1 m~~~~~~----~~l---~~~g~----------~-~~~~~--~~~~~~~~~~~~da~d~~~~l~t~~~~~--i~~~l~~~i 58 (336) ..+.+.. ++. ++..+ . .+... ..+........... ..... ++.+++| +|..+. T Consensus 50 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~-~~~~~gg~~vP~~~~--- 124 (395) T protein:vir:38 50 LKNAKMAQELAKSAYEDARANLNAEPVNKKPLPVKDGKPDAQAMKNQFVKDFKNL-VTSGT-TGTGNAGLTIPEDIQ--- 124 (395) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhhccccccccchhhhhHHHHHHHHHHHHHHHHH-Hhhcc-CccCCCceecchhHh--- Confidence 0000000 000 00000 0 00000 00000111111111 11111 2233333 566554 Q ss_pred CceeeeeeccccchhhhcccccCCCcceeeEEEeee-ecccceEEeecccCCcee-eeeeeeeeeeEEEEEEEEEeCHHH Q lcl|Aclame:pro 59 DPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITA-EPTTTVATYGDYSSDGDS-GTNINYPQRQSYFFQTWTRWGERE 136 (336) Q Consensus 59 dp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~-e~~G~a~~ygd~~DiP~v-d~~~~~~~~~v~~~~~~~~y~~~E 136 (336) ++|++.....-....+..+.....-. ..+.+... +..+.+...+....+|-. +........+.+.++..+.+|.+= T Consensus 125 -~~ii~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~f~~v~~~~~k~~~~~~iS~el 202 (395) T protein:vir:38 125 -LQIRTLTRSFTSLESLANVENVTTSH-GSRVYEKLADITPLKDLDDESALIGDNDDPELTVVKYLIHRYAGITTVTNTL 202 (395) T ss_pred -hHHHHHHHhhcchhhhcceeeccCCc-ceEEEEeeccCCccccccccccccccccccceeeEEeeeeeeEeehhhHHHH Confidence 34555555555555554432221111 12233332 333445566777777744 467777788888888888888653 Q ss_pred HHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 137 LEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQ 216 (336) Q Consensus 137 l~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~ 216 (336) ++. ...+|.+--......++.+.+++-.++|++.... .+ + ..+.+ ||.++++... T Consensus 203 l~d---s~~~l~~~i~~~la~~~~~~~~~~il~g~g~~~~--------~~---~-------~~~~~----~i~~~~~~~l 257 (395) T protein:vir:38 203 LKD---TVDNIIQWLVNWAAKKDVVTRNAKILEVMGKAPK--------KP---T-------ISQFD----NIKDLENNTL 257 (395) T ss_pred Hhh---hHHHHHHHHHHHHHHHHHHHHHHHHhhccccccc--------cc---c-------cccHH----HHHHHHHHhh Confidence 332 3456777778888888888888888888764321 00 0 12223 4444443221 Q ss_pred HHhCCceeccCCcEEEecHHHHHhccc-CCCCCccHHHH-HHHhCCc----cEEEEcc--cccCCCCceEEEEEEeeCCC Q lcl|Aclame:pro 217 TQSQGIITQEAVLHMGLPPTAMSDLSK-TNQYGLSAAAK-LKEIFPK----LEFVTIP--EYDTASGRLVQLWAPRVEGK 288 (336) Q Consensus 217 ~~t~g~v~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~-l~~n~pn----l~i~~~p--el~~a~G~~~~~~~~~~~~~ 288 (336) ... . .....++|.+..+..|.+ .+..|.-++.- +....|+ ..+.... .+..+++....++.+-. T Consensus 258 ~~~---~--~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~~~~~~~~~~~~i~~gd~~--- 329 (395) T protein:vir:38 258 DPA---I--ESTSSFITNQSGYNILSKVKDADGRYLMQPDVTSPDKYLIDGKPVIRIADKWLPDVSGSHPLYFGDLK--- 329 (395) T ss_pred hhh---h--cCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCcceeccceeEEecccccCcCCCcceEEEEecc--- Confidence 111 0 123468999999988854 34334333211 1111111 1111111 12223333333333211 Q ss_pred ceEEEEe--Cchhhccc---ceecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 289 DTATCGF--TEKMRAHS---IERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 289 ~~~~~~~--p~~~~~l~---~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) +...+.. ...+.... .....-.+.+-+..|.+| .+.+|.||+.++.- T Consensus 330 ~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~r~~~r~d~-~~~~~~a~~~~~~~ 381 (395) T protein:vir:38 330 QGITLFDRQQMQIDTTNVGAGSFEHDTTKLRFIDRFDV-QLIDDGAFAAASFK 381 (395) T ss_pred ccEEEEEecceEEEEeccccchhhcCceEEEEEEeecc-EEecccceEEEEee Confidence 1011100 00111111 001112344556666665 55669999999977 No 126 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=71.80 E-value=0.19 Score=24.53 Aligned_cols=267 Identities=14% Similarity=0.138 Sum_probs=118.7 Q ss_pred ccchhhhhhhhhhhhhhhhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhhccccc-CCCcceeeEEEeee- Q lcl|Aclame:pro 17 LPRSVKNVSTPLAEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESK-KGDWTTLVAAFITA- 94 (336) Q Consensus 17 ~~~~~~~~~~~~~~~~~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t-~g~w~~~t~~~~v~- 94 (336) +|. |+ .+ +|+. ++ +.-+..+|+|..-..-...+.+|=+. +| .++.|.-. T Consensus 1 mpa----lt--La----ea~k----~~-----------~d~l~~~ViE~~~~~s~lL~~LpF~~veg----~~~~ynR~~ 51 (310) T protein:vir:97 1 MAS----VT--LA----ESAK----LA-----------QDELVAGVIENIITVNRMFDVLPFDSIEG----NSLAYNREN 51 (310) T ss_pred Ccc----cc--hH----HHhh----cC-----------cchHHHHHHHHHhccchHHHhCCcccccC----CcceeeEee Confidence 220 10 00 1110 00 00111233333333333445554321 22 12333322 Q ss_pred ecccce--EEeecccCCceeeeeeeeeeeeEEEEEEEEEeCHHHHHHH--HHh-CCCHHH--HHHHHHHHHHHHhhccEE Q lcl|Aclame:pro 95 EPTTTV--ATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMA--GAG-RVDLAS--ELNYSSALGLAKFLNGSY 167 (336) Q Consensus 95 e~~G~a--~~ygd~~DiP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A--~~~-g~~l~~--~K~~aAr~a~e~~~n~i~ 167 (336) +..|.+ .+.-.+++.|.......+ ....++...--+..|+-+. ... +-+.+. ..-....+++.+...... T Consensus 52 ~~~~~~~~~v~~~~~~~g~~~~~~t~---~~~~~~L~i~~g~~~Vd~~i~dl~~~~~~dq~~~Ql~~~iea~~~~~e~~l 128 (310) T protein:vir:97 52 VLGDVIMAGVGTTFSGAGAGKAAATF---TKVNSNLTTIMGDAEVNGLIQATRSGDGNDQTAVQIASKAKSAGRKYQDQL 128 (310) T ss_pred ccCCcccccccccccCCCcccccccc---ceeeeeeeeeeehhhhhhHHHhhhcCChHHHHHHHHHHHHHHHHHHHHHHh Confidence 222222 122223333333333333 2344555566666666643 322 323233 333445667778888888 Q ss_pred Eeecc-ccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHH---HHhccc Q lcl|Aclame:pro 168 LFGVA-GLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTA---MSDLSK 243 (336) Q Consensus 168 ~~Gd~-~~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~---~~~Ls~ 243 (336) ++||. ...++||+..-.-...+.+.+. -+.-|+ +|+.+|+..+++.-+ .|..|++.|.. +..+.+ T Consensus 129 INGD~a~n~F~GL~~~~~~~q~i~~~~~-gg~~t~----d~LDeLl~~v~~~~g------~p~~~l~~~~~~r~i~A~~R 197 (310) T protein:vir:97 129 INGNGAGNEFAGLIQLCASGQKATTGAT-GSAISF----AILDELMDLVVDKDG------QVDYLTMHARTLRSYKALLR 197 (310) T ss_pred hccccCCCcccchhhcCCccceeecCCC-CCCCCH----HHHHHHHHHHhcCCC------CCCEEEecHHHHHHHHHHHH Confidence 89987 5677899984211111111111 123354 678888888865432 47789999965 333332 Q ss_pred C-----------CCCCccHHHHHHHhCCccEEEEccccc------CCCCceEEEEEEeeCCCce---EEEEeC------c Q lcl|Aclame:pro 244 T-----------NQYGLSAAAKLKEIFPKLEFVTIPEYD------TASGRLVQLWAPRVEGKDT---ATCGFT------E 297 (336) Q Consensus 244 ~-----------~~~~~Tvl~~l~~n~pnl~i~~~pel~------~a~G~~~~~~~~~~~~~~~---~~~~~p------~ 297 (336) . +.+|.-|+ .|-++-|...-... +++|. .-+|+-+.. .+. +-+.++ . T Consensus 198 ~~~~~g~~~~~~~~~G~~v~-----~~~GiPi~~~d~ip~~~~~~~~~gt-TsIya~r~G-e~~~~~Gv~Gl~~~~~~gl 270 (310) T protein:vir:97 198 ALGGASINEVVELPSGAEVP-----AYSGTPIFRNDYIPTNQTKGGTTGC-TTIFAGTLD-DGSRTHGIAGLTATQAAGI 270 (310) T ss_pred HhcCCCCCCccccCCCCEEe-----eeCCeEEEEeCccCCCccccccCCc-eeEEEEeeC-ccccccceeccccCCccce Confidence 1 11222221 23344443332221 12333 333333332 221 111111 1 Q ss_pred hhhccc-ceecC-CceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 298 KMRAHS-IERYS-SYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 298 ~~~~l~-~~~~~-~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) ..|.++ .+.+. .+|.|. .+-|+.+.-|.|++.+.|| T Consensus 271 sVr~~G~~~~~~v~~~~V~---~Y~~~av~~~~A~a~L~~V 308 (310) T protein:vir:97 271 QVVDVGESEDSDEHIWRVK---WYCGLALFSEKGLACADGI 308 (310) T ss_pred eEEeCCcccCCcceeEEEE---EeeeEEEecccceeeeccc Confidence 234444 22222 345553 3689999999999999999 No 127 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=67.85 E-value=0.25 Score=23.92 Aligned_cols=219 Identities=10% Similarity=0.039 Sum_probs=108.7 Q ss_pred ccCCCcceeeEEEeeeecccceEEeecccCCceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHH Q lcl|Aclame:pro 79 SKKGDWTTLVAAFITAEPTTTVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALG 158 (336) Q Consensus 79 ~t~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a 158 (336) .. |=-.-.|++|+.+ .|.|..+++++.+|...........+|.+.+-+++++..+... ..|=++ .+-.....++ T Consensus 1 ~~-~~~~Gdtit~P~~--iGda~~v~eG~~i~~~~l~~t~~~atIk~~gk~~~itD~a~l~--~~gDp~-~ea~~Q~~~~ 74 (231) T protein:vir:73 1 EN-GINLANLCEYPND--IGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALS--GYGDPI-GESNKQLGLS 74 (231) T ss_pred Cc-cccCCceEEeccc--ccchhhhcCCCcCChhhccccceeeeEeeeccceeeeHHHHhh--ccCchH-HHHHHHHHHH Confidence 11 1111267889865 8999999999999999999999999999988888887665544 355443 4444455556 Q ss_pred HHHhhccEEEeeccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHH Q lcl|Aclame:pro 159 LAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAM 238 (336) Q Consensus 159 ~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~ 238 (336) +..++|+-++ +-+. .+.| ..+++. =++.|++++..+-.. ...+..+++.|..+ T Consensus 75 iA~kvD~di~---------~~~~----------~a~l-~~~~~~-t~d~i~~A~~~fgde------~~~~~vivv~p~~~ 127 (231) T protein:vir:73 75 LANKVDDDLL---------KAAK----------TTSQ-TVSTKA-NVDGVQAALDIFNDE------DAQAYVLIVNPKDA 127 (231) T ss_pred HHHhhhHHHH---------Hhhc----------cccc-cccccc-cHHHHHHHHHHhccc------cccceEEEEcchHH Confidence 6555554211 0000 0111 111111 244455555544221 23577899999988 Q ss_pred HhcccCCCCCccHHHHHHH----h-----CCccEEEEcccccCCCCceEEEEEEeeCCCceEEEEeCchhhcccceecCC Q lcl|Aclame:pro 239 SDLSKTNQYGLSAAAKLKE----I-----FPKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSIERYSS 309 (336) Q Consensus 239 ~~Ls~~~~~~~Tvl~~l~~----n-----~pnl~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~ 309 (336) ..|-+--....+ -..... | +-+++|...+.+....+.... |+. .+.-+.+..-...+. ..+...+ T Consensus 128 ~~Lrk~~~~~~~-~~~~g~~i~~~G~iG~i~G~~Vi~S~~~~~~~~~~~~-~i~---~~gAl~~~~k~~~~v-EtdRd~~ 201 (231) T protein:vir:73 128 AKIRKDANAKNI-GSEVGANALINGTYADVLGAQIVRSKKLAEGSALMFK-IVS---NSPALKLVLKRGVQV-ETDRDIV 201 (231) T ss_pred Hhhhhccchhhh-hhhhccceeeecccceEcceEEEEcCCCCCCceeeee-EEe---eccceeeeeccccee-ecccccc Confidence 887541111000 000000 0 123455544444322221111 111 111111111111110 1111111 Q ss_pred c-eEEeeecceeeeEEecccceeee--ccC Q lcl|Aclame:pro 310 Y-FRQKKSAGTWGAVIFRPFAVAQM--IGV 336 (336) Q Consensus 310 ~-~~v~~~~rt~Gv~ir~P~ai~~~--~GI 336 (336) . -..=.....+||-++.|..++.+ .|+ T Consensus 202 ~k~~~i~~~~~y~v~l~~~~~vv~~t~~g~ 231 (231) T protein:vir:73 202 TKTTVITADEHYAAYLYDLTKVVNITFTGV 231 (231) T ss_pred ccccEEEEeEEEEEEEEcCccEEEEEeecC Confidence 1 11111224579999999988876 688 No 128 >protein:vir:99888 Length: 309 # NCBI annotation: capsid protein # Family: family:all:908 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164075;genbank:gi:56692607;genbank:GeneID:3192616 Probab=60.04 E-value=0.38 Score=22.89 Aligned_cols=265 Identities=11% Similarity=0.029 Sum_probs=121.0 Q ss_pred hhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecccceE-EeecccCCceeee Q lcl|Aclame:pro 36 ADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVA-TYGDYSSDGDSGT 114 (336) Q Consensus 36 ~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~~G~a~-~ygd~~DiP~vd~ 114 (336) +-..| -..++.+.++-+.|=++ ++-+++|||...++--.-...+|+-.|.-=... ..+=.++.-.++. T Consensus 1 ~~~~~---~~~dp~LT~~A~gy~n~--------~~Ia~~l~P~vpV~~~~~~~~~f~~~e~F~~~~t~r~~~~~~~~v~~ 69 (309) T protein:vir:99 1 MSNAP---FPIDPELTAIAIAYRNG--------RMISDEVLPRVPVGKQEFKFWKYDLAQGFTVPETLVGRKSKPNEVEF 69 (309) T ss_pred CCCCC---cCcCHhHHHHHhhccCh--------hhhhhhcCCccccCccccceeeechhhcccccchhhccCCCcceEee Confidence 11111 11233344555555433 356788999876654333444443222110000 0011233345555 Q ss_pred eeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHH----HHhhccEEEeeccccceEEEEecCCCC-ccc Q lcl|Aclame:pro 115 NINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGL----AKFLNGSYLFGVAGLENYGLINDPSLS-API 189 (336) Q Consensus 115 ~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~----e~~~n~i~~~Gd~~~g~~GllN~Pnl~-~~~ 189 (336) ........+...+.-+.+...|...|. .++++.++....++..+ |...-++++--. |.|+=. ... T Consensus 70 ~~~~~~~~~~~~~L~~~i~~~~~~~a~-~~~d~~~~Av~~l~~~i~l~rE~~~A~lv~~~a---------~y~~~~k~~L 139 (309) T protein:vir:99 70 SATDETGSTEDHGLDAPVPQADIDNAP-TNYNPLGHATEQTTNLILLDREARTSKLVFSPN---------SYAAGNKTTL 139 (309) T ss_pred cccCceeeecccceeecCCchhhhhcc-CCCCHHHHHHHHHHHHHHHHHHHHHHHHhcChh---------hcCCCceEEe Confidence 555555555555565666666666553 35776666665554433 333333322111 212111 112 Q ss_pred ccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhccc---------CCC--CCccHHHHHHHh Q lcl|Aclame:pro 190 TATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK---------TNQ--YGLSAAAKLKEI 258 (336) Q Consensus 190 ~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~---------~~~--~~~Tvl~~l~~n 258 (336) +++.+|.+ .+.| ++.||.+....+ | -.|++++|....+.+|.+ .+. .+.=-.++|++- T Consensus 140 sgt~~wsd-~~SD-Pi~~i~~~~~~~-----g----~~PN~~vlg~~~~~~l~~hp~i~~~ik~~~~~~g~it~~~la~l 208 (309) T protein:vir:99 140 SGADQWSD-PTSN-PLPVITDALDSV-----I----LRPNIGVLGRRTATILRRHPKIVKAYNGSLGDEGMVPMAFLQEL 208 (309) T ss_pred cCccccCC-CCCC-cHHHHHHHHHhh-----C----CCcceEEechHHHHHHhhCHHHHHHhcCCCccccccCHHHHHHH Confidence 33445554 5555 899999887664 3 269999999999998853 111 122225677765 Q ss_pred CCcc-EEEE-cccccC----CC-------CceEEEEEEeeCCCceEEEEeCc-hhhcccceecCCceEEeeecceeeeEE Q lcl|Aclame:pro 259 FPKL-EFVT-IPEYDT----AS-------GRLVQLWAPRVEGKDTATCGFTE-KMRAHSIERYSSYFRQKKSAGTWGAVI 324 (336) Q Consensus 259 ~pnl-~i~~-~pel~~----a~-------G~~~~~~~~~~~~~~~~~~~~p~-~~~~l~~~~~~~~~~v~~~~rt~Gv~i 324 (336) |- + +|.- -.-+.+ .+ |+.+.|.+.... .++.+ -|. -|.+-=-......|..|....-||-.| T Consensus 209 ~~-ve~V~vg~a~~n~a~~g~~~~~~~iwg~~~~L~y~~~~-~~~~~--~ps~G~t~~~~~r~~g~~~d~~~~~~g~~~v 284 (309) T protein:vir:99 209 LE-LDAIYIGEARLNIARPGQNPNLIRAWGPHASFIYRDRL-ADTRN--GTTFGLTAQWGDRVSGSIADPNIGLRGGQRV 284 (309) T ss_pred hC-cceEEeecceeeccccccccccccccCCcEEEEEcCCC-CCCcc--cccccceeecccccCCceeeeeeccCCceEE Confidence 43 3 2221 011111 11 333333332211 22211 111 000000012233566676666665444 Q ss_pred e-----cccceeeeccC Q lcl|Aclame:pro 325 F-----RPFAVAQMIGV 336 (336) Q Consensus 325 r-----~P~ai~~~~GI 336 (336) | .|.-++.-.|- T Consensus 285 r~~~~~k~~i~~~d~G~ 301 (309) T protein:vir:99 285 RVGESVKELVTAPDLGF 301 (309) T ss_pred EEeccccchhcchhcch Confidence 4 55555555565 No 129 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=59.67 E-value=0.39 Score=22.84 Aligned_cols=274 Identities=14% Similarity=0.106 Sum_probs=109.2 Q ss_pred cccCCcchHHHH----------HHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecccceEEeec--c--- Q lcl|Aclame:pro 42 LSSTGSSGIPNY----------LTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVATYGD--Y--- 106 (336) Q Consensus 42 l~t~~~~~i~~~----------l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd--~--- 106 (336) |++.++.+.|.+ +-.|- -+|.+.....-+.+.++.+.+.- ...++.|+. .|+.+..+- + T Consensus 1 ms~~~~~tr~~~~~s~~d~al~le~f~-geV~~af~~~s~~~~~~~~rti~--~g~s~~~~~---iG~~~~~~~~pG~~l 74 (335) T protein:vir:63 1 MSFLNDLTRPNYAGKNADVDIHLEEHL-GIVDKHFAYTSKFAPLMNIRDLR--GSNVVRLDR---LGNVEAKGRRAGEEL 74 (335) T ss_pred CCCcccchhhhcccccchhheehhhhh-hhHHHHHHhhhhhccccceeeec--cceeEEEee---eeeeeeecccCCcCc Confidence 333333222221 11111 11111111122233444443321 135566654 587776632 2 Q ss_pred cCCceeeeeeeeee--eeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEE------eec-cccceE Q lcl|Aclame:pro 107 SSDGDSGTNINYPQ--RQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYL------FGV-AGLENY 177 (336) Q Consensus 107 ~DiP~vd~~~~~~~--~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~------~Gd-~~~g~~ 177 (336) +..|... ++.. ..-..+.-.+=|.++|.+ +..++-++-....-.++.++.|+.++ .+. +..... T Consensus 75 ~~~~~~~---~k~~itVD~ll~a~~~I~dlDe~~----~~yDvRse~s~e~G~aLA~~~D~~~~~~i~~aa~~~a~~~~~ 147 (335) T protein:vir:63 75 ERSRVVN---DKWNLTVDTLLYLRHQFDHQDEWT----QSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAPVDLE 147 (335) T ss_pred CCCCccc---cceEEEecceeechhhhhhHHHHh----cCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhccccCccccC Confidence 2223211 1211 111223333334444433 23344444444444455555554332 111 122223 Q ss_pred EEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceec-cCCcEEEecHHHHHhcccC-----CCCCc-- Q lcl|Aclame:pro 178 GLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQ-EAVLHMGLPPTAMSDLSKT-----NQYGL-- 249 (336) Q Consensus 178 GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~-~~p~tL~Lp~~~~~~Ls~~-----~~~~~-- 249 (336) |.++ |+....+..++. -+.+.++.+.+=+..+..++..+- +-+. .++-..+++|.+|..|-.- ++++. T Consensus 148 ~~~~-~G~~~~~~~tg~-~~~~~~~~l~~a~~~a~~~L~e~d--VP~~~~~dr~~vv~P~~y~~Ll~~~~l~n~~~~~s~ 223 (335) T protein:vir:63 148 DAFS-PGVLEKLDLTGL-TAKQAADKIVRMHRRVVETFIDRD--LGDAVYSEGLTPMSPRVFSLLLEHDKLMNVEYQATG 223 (335) T ss_pred CCcC-CCcceeeeeccC-cccccHHHHHHHHHHHHHHHHhcc--CCCcccCceEEEeChHHHHHHhcccccccccccccc Confidence 3333 233222222222 223458888777777777775442 1011 1336899999999988542 12221 Q ss_pred cHHHHHHHh---CCccEEEEcccccCCCC--------------ce---EEEEEEeeCCCceEEEEeCchhhcccceecCC Q lcl|Aclame:pro 250 SAAAKLKEI---FPKLEFVTIPEYDTASG--------------RL---VQLWAPRVEGKDTATCGFTEKMRAHSIERYSS 309 (336) Q Consensus 250 Tvl~~l~~n---~pnl~i~~~pel~~a~G--------------~~---~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~ 309 (336) +.-.+.+.. .-+++|+..+.|-+.++ +- +.++... .---+++. .+...+.. -+.+.- T Consensus 224 ~~~~~~~g~v~~v~Gv~V~~sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~-~Al~t~~~-~~vt~e~~-~~~~~~ 300 (335) T protein:vir:63 224 ATNDYVKSRVAILNGVKVLETPRFATKAIAAHPLGRHFNVSAEESERQIALFLPS-KTLITAQV-APVQAKLW-EDNEKF 300 (335) T ss_pred ccccccCceeEEeeceEEEeeccCCCCCcccccccccCCccccccceeEEEEEec-ceEEEEEE-eeccccee-eccchh Confidence 111122111 12356666666632211 10 1111110 00001110 11111110 011222 Q ss_pred ceEEeeecceeeeEEecc--cceeeeccC Q lcl|Aclame:pro 310 YFRQKKSAGTWGAVIFRP--FAVAQMIGV 336 (336) Q Consensus 310 ~~~v~~~~rt~Gv~ir~P--~ai~~~~GI 336 (336) .|.+.+... .|+-++|| .++....|| T Consensus 301 ~~~i~~~~a-~G~g~lRPe~a~~i~~tg~ 328 (335) T protein:vir:63 301 SWVLDTFQM-YNIGARRPDTAGAIELKGI 328 (335) T ss_pred hHHhHHHHH-cCCcccccceEEEEEEcCC Confidence 344444443 79999999 456678899 No 130 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=55.54 E-value=0.48 Score=22.35 Aligned_cols=302 Identities=12% Similarity=0.093 Sum_probs=128.1 Q ss_pred Cch-HHHHH-HHhhc--------------ce---eccchh----hhhh-hh--hhhhhhhhhhhcCccccCCcc--hHHH Q lcl|Aclame:pro 1 MRD-AQRIQ-NLARA--------------GV---ILPRSV----KNVS-TP--LAEYAMDAADLSPHLSSTGSS--GIPN 52 (336) Q Consensus 1 m~~-~~~~~-~l~~~--------------g~---~~~~~~----~~~~-~~--~~~~~~da~d~~~~l~t~~~~--~i~~ 52 (336) +++ ..++. +|++. |. .+.... ..+. .. .+.............++++++ .+|. T Consensus 73 le~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~ 152 (425) T protein:vir:95 73 LEGEIAQLEDELEQINSKQPSNQSRQKMQGSKGDVVEMNRLQVREMLKTGEYYKRSEVVEFYEKFRNLRAVAGGELTIPE 152 (425) T ss_pred HHHHHHHHHHHHHHhhhhccchhhhhhhhhhhhhHHHHHHHHHHHHHhhhhhhhhhHHHHHHHHHHhhcccccCceeccH Confidence 110 00000 00000 00 000000 0000 00 000000000000112222232 3566 Q ss_pred HHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecccceEEeecccCCceeee-eeeeeeeeEEEEEEEEE Q lcl|Aclame:pro 53 YLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTTVATYGDYSSDGDSGT-NINYPQRQSYFFQTWTR 131 (336) Q Consensus 53 ~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP~vd~-~~~~~~~~v~~~~~~~~ 131 (336) .+.+- |++.+-.......++.+.... ....+++....+.+...+.+..+|..+. ......-..+.++..+. T Consensus 153 ~~~~~----Ii~~l~~~~~i~~~~~~~~~~----g~~~ip~~~~~~~a~~v~E~~~~~~~~~~~f~~i~l~~~k~~~~~~ 224 (425) T protein:vir:95 153 VVVNR----IMDIMGDYTTLYPLVDKIRVK----GTTRILVDTDTSPATWIEQSGALPTGDVGTIASIDFDGFKVGKVTF 224 (425) T ss_pred HHHHH----HHHHHHhhhhHHHhhceeecC----ceeEEEEecCCccccccccccccccccccccceeeeeheeeeeeeh Confidence 55543 333332222233333332211 1346677777788888888888888776 46777788888888888 Q ss_pred eCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccc--cceEEEEecCCCCcccccccccccccCHHHHHHHHH Q lcl|Aclame:pro 132 WGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAG--LENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVV 209 (336) Q Consensus 132 y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~--~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~ 209 (336) +|.+=|..+. .++.+--....+.++.+.+++-.++|++. ..-.|++++ ++.... .+...++. .++|+. T Consensus 225 iS~ell~ds~---~~l~~~i~~~l~~~i~~~~d~~il~G~G~~~~~p~Gil~~--~~~~~~-~~~~~~~~----~~~~~~ 294 (425) T protein:vir:95 225 VDNYLLQDSI---INLDDYVTKKIARAIAKALDLAIVKGTGAANKQPLGIIPS--LPPENQ-VTVEADNN----LLKNLV 294 (425) T ss_pred hhHHHHhccH---HHHHHHHHHHHHHHHHHHHHHHhhccCCCCccccceeecc--cccccc-cccccccc----hHHHHH Confidence 8876554433 36888888888899999999999999864 345799985 222111 11111222 345666 Q ss_pred HHHHHHHHHhCCceeccCCcEEEecHH-HHHhcc----cCCCCCccHHHHHHHhC--C---ccEEEEcccccCC---CCc Q lcl|Aclame:pro 210 TLFQVLQTQSQGIITQEAVLHMGLPPT-AMSDLS----KTNQYGLSAAAKLKEIF--P---KLEFVTIPEYDTA---SGR 276 (336) Q Consensus 210 ~l~~~l~~~t~g~v~~~~p~tL~Lp~~-~~~~Ls----~~~~~~~Tvl~~l~~n~--p---nl~i~~~pel~~a---~G~ 276 (336) +++..+..... .. .....+|.+. .+..|. ..+..|.-++. ..+. | +..++....+... -|. T Consensus 295 ~~~~~~~~~~~---~~-~~~~~v~~~~~~~~~l~~l~~~kd~~g~~i~~--~~~~~~~~l~G~pvv~~~~~~~~~i~~Gd 368 (425) T protein:vir:95 295 KQIGLIDTGDD---SV-GEIVAVMKRSTYYNRLVEFSIQVDSNGNVVGK--LPNLRTPDLLGLRVVFNNFLDDDTVLFGE 368 (425) T ss_pred HHHHhhhhhcc---cc-CceEEEEeChHHHHHHHHHHhhcCCCCceeec--cCCCCCccccceeeEEcCcCCCccEEEEe Confidence 66655433221 11 1123445544 343332 12333332211 0111 1 1122211111100 021 Q ss_pred eEEEEEEeeCCCceEEEEeCchhhcccceecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 277 LVQLWAPRVEGKDTATCGFTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 277 ~~~~~~~~~~~~~~~~~~~p~~~~~l~~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) -.+.++-.+.+ ..+.+... .....-...+-+..|.. +.+++|-||++++ | T Consensus 369 ~~~~~~~~~~~---~~i~~~~~-----~~f~~~~~~~~~~~r~d-~~~~~~~a~~~~~-i 418 (425) T protein:vir:95 369 FEQYTLVEREN---ITIDSSTH-----VKFTEDQTAFRGKGRFD-GKPVKPEAFVLVT-I 418 (425) T ss_pred cccEEEEeecc---eEEEeecc-----cccccCceEEEEEEeeC-cEeecccceEEEE-e Confidence 11211111111 11111110 00011123334444554 4677888888874 4 No 131 >protein:vir:101291 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:1591 # MgeName: phiNM3 # Cross-refs: genbank:acc:YP_908831;genbank:gi:118725095;genbank:GeneID:4555862 Probab=55.27 E-value=0.48 Score=22.31 Aligned_cols=298 Identities=12% Similarity=0.070 Sum_probs=128.6 Q ss_pred Cc-----hHHHHHH--HhhcceeccchhhhhhhhhhhhhhhhhhhcCccccCCcch--HHHHHHHhhCceeeeeeccccc Q lcl|Aclame:pro 1 MR-----DAQRIQN--LARAGVILPRSVKNVSTPLAEYAMDAADLSPHLSSTGSSG--IPNYLTTYVDPSVIDILVAPMK 71 (336) Q Consensus 1 m~-----~~~~~~~--l~~~g~~~~~~~~~~~~~~~~~~~da~d~~~~l~t~~~~~--i~~~l~~~idp~v~~~~~~~~~ 71 (336) .+ ..++++. +.+.| ...++.+-+.... +.. -.+.+++| +|..+.+ +|++.+...-. T Consensus 41 ~~~~~~~~~~e~~~~~~~~~~------~~~lt~~e~~~~~-~~~----~~~~~~gg~lvP~~~~~----~I~~~l~~~s~ 105 (381) T protein:vir:10 41 FEETKLQAKAEAERVSSLPKS------AQSLSANQRSFFM-DIN----KNVNYKEEKLLPEETID----RIFEDLTTNHP 105 (381) T ss_pred hhhHHHHHHHHHHHHHHhccC------cccccHHHHHHHH-HHh----cccCCCCceecCHHHHH----HHHHHHHhhcc Confidence 00 0000000 00011 0111111111100 000 01122222 5665554 33443333222 Q ss_pred hhhhcccccCCCcceeeEEEeeeecccceEEeecccCCc-eeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHH Q lcl|Aclame:pro 72 AAELVGESKKGDWTTLVAAFITAEPTTTVATYGDYSSDG-DSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASE 150 (336) Q Consensus 72 ~~~l~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP-~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~ 150 (336) .+.+..+.+.+. ...+...+..+.|...+....++ -.+.......-..+.+.....+|.+=|. ...+++.+- T Consensus 106 i~~~~~v~~~~~----~~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~elL~---Ds~~~ie~~ 178 (381) T protein:vir:10 106 LLADLGIKNAGL----RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLND---FGPAWIERF 178 (381) T ss_pred ceeheeeEecCc----ceEEEEecCCcceeeecccccccccccccceeeeecceeEEeechhhHHHhh---cCHHHHHHH Confidence 333333333232 13456667777777776655553 4455566666677777766676644333 244578888 Q ss_pred HHHHHHHHHHHhhccEEEeeccccceEEEEecCCCCccccccc-ccc------cccCHHHHHHHHHHHHHHHHHHhCCce Q lcl|Aclame:pro 151 LNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATT-PWS------GSPAVEAVVNEVVTLFQVLQTQSQGII 223 (336) Q Consensus 151 K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~~~~~t-~w~------~~~T~~eI~~Di~~l~~~l~~~t~g~v 223 (336) -......++.+.+++-++.|++..+-.|+++++......+..+ ++. ...++.-.++.+..++..+-..-.+.. T Consensus 179 i~~~la~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~ 258 (381) T protein:vir:10 179 VRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKS 258 (381) T ss_pred HHHHHHHHHHHHhhheeEeccCCCCceeeeeccCcccccccccccccccccccccccchhhHHHHHHHHHhhcccccccc Confidence 8888889999999999999999999999999765432222111 110 111223334555555555433222211 Q ss_pred ec-cCCcEEEecHHHHHhcc----cCCCCCccHHHHHHHhCCccEEEEcccccC---CCCce-EEEEEEeeCCCceEEEE Q lcl|Aclame:pro 224 TQ-EAVLHMGLPPTAMSDLS----KTNQYGLSAAAKLKEIFPKLEFVTIPEYDT---ASGRL-VQLWAPRVEGKDTATCG 294 (336) Q Consensus 224 ~~-~~p~tL~Lp~~~~~~Ls----~~~~~~~Tvl~~l~~n~pnl~i~~~pel~~---a~G~~-~~~~~~~~~~~~~~~~~ 294 (336) .. ..--+++|-+.-+..|- ..++.|.-+... -| +++|+..+.... .-|.- .+.+.++ .+ T Consensus 259 ~~~~~~a~~~mn~~t~~~l~~~~~~~~~~G~~v~~l---~~-g~~vv~s~~~p~~~iifgDfs~Y~i~~r-~~------- 326 (381) T protein:vir:10 259 VAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTAL---PF-NLNVIESTVQEAGKVLTYVKGLYDGYLA-GG------- 326 (381) T ss_pred ccccCceEEEEccccHHhhccccccCCCCCceeecC---CC-CceEEecCCCCcCcEEEEecccEEEEEe-cc------- Confidence 00 01124566655443332 112323211000 01 244433222110 01211 1233222 11 Q ss_pred eCchhhcccc-eecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 295 FTEKMRAHSI-ERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 295 ~p~~~~~l~~-~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) +.+..+.. ....-....-...|..|. ++.|.|++.++ | T Consensus 327 --~~i~~~~~~~~~~d~~~f~a~~r~dg~-~~~~~A~~v~~-l 365 (381) T protein:vir:10 327 --INVQKFKETLALDDMDLYTAKQFAYGK-AKDNKVAAVWK-L 365 (381) T ss_pred --cEEEeechhHhhcCCeEEEEEEEEcCE-EecCceEEEEE-E Confidence 11111110 001112334455565554 46777776654 4 No 132 >protein:vir:9509 Length: 381 # NCBI annotation: hypothetical protein # Family: family:all:635 # MgeID: mge:170 # MgeName: phiN315 # Cross-refs: genbank:acc:NP_835556;genbank:gi:30043951;genbank:GeneID:1260537 Probab=55.27 E-value=0.48 Score=22.31 Aligned_cols=298 Identities=12% Similarity=0.070 Sum_probs=128.6 Q ss_pred Cc-----hHHHHHH--HhhcceeccchhhhhhhhhhhhhhhhhhhcCccccCCcch--HHHHHHHhhCceeeeeeccccc Q lcl|Aclame:pro 1 MR-----DAQRIQN--LARAGVILPRSVKNVSTPLAEYAMDAADLSPHLSSTGSSG--IPNYLTTYVDPSVIDILVAPMK 71 (336) Q Consensus 1 m~-----~~~~~~~--l~~~g~~~~~~~~~~~~~~~~~~~da~d~~~~l~t~~~~~--i~~~l~~~idp~v~~~~~~~~~ 71 (336) .+ ..++++. +.+.| ...++.+-+.... +.. -.+.+++| +|..+.+ +|++.+...-. T Consensus 41 ~~~~~~~~~~e~~~~~~~~~~------~~~lt~~e~~~~~-~~~----~~~~~~gg~lvP~~~~~----~I~~~l~~~s~ 105 (381) T protein:vir:95 41 FEETKLQAKAEAERVSSLPKS------AQSLSANQRSFFM-DIN----KNVNYKEEKLLPEETID----RIFEDLTTNHP 105 (381) T ss_pred hhhHHHHHHHHHHHHHHhccC------cccccHHHHHHHH-HHh----cccCCCCceecCHHHHH----HHHHHHHhhcc Confidence 00 0000000 00011 0111111111100 000 01122222 5665554 33443333222 Q ss_pred hhhhcccccCCCcceeeEEEeeeecccceEEeecccCCc-eeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHH Q lcl|Aclame:pro 72 AAELVGESKKGDWTTLVAAFITAEPTTTVATYGDYSSDG-DSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASE 150 (336) Q Consensus 72 ~~~l~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~DiP-~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~ 150 (336) .+.+..+.+.+. ...+...+..+.|...+....++ -.+.......-..+.+.....+|.+=|. ...+++.+- T Consensus 106 i~~~~~v~~~~~----~~~i~~~~~~~~a~w~~e~~~~~~~~~~~f~~i~l~~~kl~~~~~is~elL~---Ds~~~ie~~ 178 (381) T protein:vir:95 106 LLADLGIKNAGL----RLKFLKSETSGVAVWGKIYGEIKGQLDAAFSEETAIQNKLTAFVVLPKDLND---FGPAWIERF 178 (381) T ss_pred ceeheeeEecCc----ceEEEEecCCcceeeecccccccccccccceeeeecceeEEeechhhHHHhh---cCHHHHHHH Confidence 333333333232 13456667777777776655553 4455566666677777766676644333 244578888 Q ss_pred HHHHHHHHHHHhhccEEEeeccccceEEEEecCCCCccccccc-ccc------cccCHHHHHHHHHHHHHHHHHHhCCce Q lcl|Aclame:pro 151 LNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATT-PWS------GSPAVEAVVNEVVTLFQVLQTQSQGII 223 (336) Q Consensus 151 K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~~~~~t-~w~------~~~T~~eI~~Di~~l~~~l~~~t~g~v 223 (336) -......++.+.+++-++.|++..+-.|+++++......+..+ ++. ...++.-.++.+..++..+-..-.+.. T Consensus 179 i~~~la~~~a~~~~~a~i~G~G~~qP~Gil~~~~~~~~~~~g~~~~~~~~~t~t~~~~~~~~~~l~~~~~~~~~~~~~~~ 258 (381) T protein:vir:95 179 VRVQIEEAFAVALETAFLKGTGKDQPIGLNRQVQKGVSVTEGAYPEKEEQGTLTFANPRATVNELTQVFKYHSTNEKGKS 258 (381) T ss_pred HHHHHHHHHHHHhhheeEeccCCCCceeeeeccCcccccccccccccccccccccccchhhHHHHHHHHHhhcccccccc Confidence 8888889999999999999999999999999765432222111 110 111223334555555555433222211 Q ss_pred ec-cCCcEEEecHHHHHhcc----cCCCCCccHHHHHHHhCCccEEEEcccccC---CCCce-EEEEEEeeCCCceEEEE Q lcl|Aclame:pro 224 TQ-EAVLHMGLPPTAMSDLS----KTNQYGLSAAAKLKEIFPKLEFVTIPEYDT---ASGRL-VQLWAPRVEGKDTATCG 294 (336) Q Consensus 224 ~~-~~p~tL~Lp~~~~~~Ls----~~~~~~~Tvl~~l~~n~pnl~i~~~pel~~---a~G~~-~~~~~~~~~~~~~~~~~ 294 (336) .. ..--+++|-+.-+..|- ..++.|.-+... -| +++|+..+.... .-|.- .+.+.++ .+ T Consensus 259 ~~~~~~a~~~mn~~t~~~l~~~~~~~~~~G~~v~~l---~~-g~~vv~s~~~p~~~iifgDfs~Y~i~~r-~~------- 326 (381) T protein:vir:95 259 VAVKGNVTMVVNPSDAFEVQAQYTHLNANGVYVTAL---PF-NLNVIESTVQEAGKVLTYVKGLYDGYLA-GG------- 326 (381) T ss_pred ccccCceEEEEccccHHhhccccccCCCCCceeecC---CC-CceEEecCCCCcCcEEEEecccEEEEEe-cc------- Confidence 00 01124566655443332 112323211000 01 244433222110 01211 1233222 11 Q ss_pred eCchhhcccc-eecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 295 FTEKMRAHSI-ERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 295 ~p~~~~~l~~-~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) +.+..+.. ....-....-...|..|. ++.|.|++.++ | T Consensus 327 --~~i~~~~~~~~~~d~~~f~a~~r~dg~-~~~~~A~~v~~-l 365 (381) T protein:vir:95 327 --INVQKFKETLALDDMDLYTAKQFAYGK-AKDNKVAAVWK-L 365 (381) T ss_pred --cEEEeechhHhhcCCeEEEEEEEEcCE-EecCceEEEEE-E Confidence 11111110 001112334455565554 46777776654 4 No 133 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=53.14 E-value=0.54 Score=22.07 Aligned_cols=284 Identities=9% Similarity=0.008 Sum_probs=117.7 Q ss_pred CchHHHHHHHhh-cceeccchhhhhh-hh-hhhh-hhhhhhhcCccccCCcch--HHHHHHHhhCceeeeeeccccchhh Q lcl|Aclame:pro 1 MRDAQRIQNLAR-AGVILPRSVKNVS-TP-LAEY-AMDAADLSPHLSSTGSSG--IPNYLTTYVDPSVIDILVAPMKAAE 74 (336) Q Consensus 1 m~~~~~~~~l~~-~g~~~~~~~~~~~-~~-~~~~-~~da~d~~~~l~t~~~~~--i~~~l~~~idp~v~~~~~~~~~~~~ 74 (336) ++-...+....+ .+-.......... .+ .+.. ..........-.|..++| +|..+. ..|++.+........ T Consensus 85 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~liP~~~~----~~ii~~~~~~~~l~~ 160 (394) T protein:vir:97 85 KTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDGIKKENAKPVSSEEIL----YTPAREVKTVVDLKP 160 (394) T ss_pred HHHHHHHHHHHHHHHHHhhhhhhhhhHHHHHHHHHhhhhhhhhccccccccccccChHHHH----HHHHHHhhhhhhhhh Confidence 111111111000 0100000000000 00 0000 000011111112333333 666554 345665555555555 Q ss_pred hcccccCCCcceeeEEEeeeec-ccceEEeecccCCce-eeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHH Q lcl|Aclame:pro 75 LVGESKKGDWTTLVAAFITAEP-TTTVATYGDYSSDGD-SGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELN 152 (336) Q Consensus 75 l~~v~t~g~w~~~t~~~~v~e~-~G~a~~ygd~~DiP~-vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~ 152 (336) +..+.+... .+..+++... .+.+...+.....|- .+...+...-..+.++..+.+|.+=++-+ ..++.+.-. T Consensus 161 ~~~~~~~~~---~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~l~~~k~~~~i~is~ell~ds---~~~~~~~i~ 234 (394) T protein:vir:97 161 FTTVYQAKK---ASGKYPVLQRATTKMVTVAELEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDA---DVDLVGIVS 234 (394) T ss_pred hceeeeccC---cceEEEEEecCCCccceecccccccccccccceeEEeehhheeeehhhHHHHHhhh---hHHHHHHHH Confidence 555432211 2244555543 345677788777874 44567777777777887777776544333 345666666 Q ss_pred HHHHHHHHHhhccEEEeeccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEE Q lcl|Aclame:pro 153 YSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMG 232 (336) Q Consensus 153 ~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~ 232 (336) ...+.++...+|.-.+.|.... ++. ...+.++ |..+++...... ..-.++ T Consensus 235 ~~la~~~~~~~~~~i~~g~~~~---------------~~~----~~~~~~~----~~~~~~~~~~~~-------~~a~~v 284 (394) T protein:vir:97 235 ESISQIKVNTTNDAIAKVLKSF---------------TTK----TVKNLDE----IKALLNGGFDPA-------YNVSLI 284 (394) T ss_pred HHHHHHHHHHHHHHHhhccccc---------------ccc----ccccHHH----HHHHHHhhhhhh-------hCCEEE Confidence 6666666666665444443210 111 1123444 444443322211 123689 Q ss_pred ecHHHHHhccc-CCCCCccHHH-HHHHhCC----ccEEEEcccccCCCCceEEEEEEeeC------CCceEEEEeCchhh Q lcl|Aclame:pro 233 LPPTAMSDLSK-TNQYGLSAAA-KLKEIFP----KLEFVTIPEYDTASGRLVQLWAPRVE------GKDTATCGFTEKMR 300 (336) Q Consensus 233 Lp~~~~~~Ls~-~~~~~~Tvl~-~l~~n~p----nl~i~~~pel~~a~G~~~~~~~~~~~------~~~~~~~~~p~~~~ 300 (336) |.+..+..|.. .+..|.-++. -+...-+ +..++..+ +.+.|....++.+-.. ..+ ..+.+ - T Consensus 285 ~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~pv~~~~--~~~~~~~~~~~gd~~~~~~~~~~~~-~~~~~----~ 357 (394) T protein:vir:97 285 VSQSFYQTLDTLKDGNGRYLLQDDITAVSGKVLLGKPVFVLS--DEVLGANKAFIGDFKRGVLFADRKD-LGLRW----A 357 (394) T ss_pred EcHHHHHHHHHhhccCCCeeeecCcCCCCCceeccceeEEec--ccccCCccEEEeeccccEEEEEecc-eEEEE----e Confidence 99998888853 3333433321 0111111 12222222 1222322222211000 000 11111 0 Q ss_pred cccceecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 301 AHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 301 ~l~~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) .+ ..... -.-+..|.+| .+.+|.||+.++.= T Consensus 358 ~~--~~~~~--~~~~~~r~d~-~v~~~~a~~~~~~~ 388 (394) T protein:vir:97 358 DN--EIYGQ--YLQAVLRFGV-SKVDDKAGYYVTFT 388 (394) T ss_pred cc--cccce--eEEEEEEEcc-EEecccceEEEEec Confidence 00 00011 2345566655 56689999988776 No 134 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=46.24 E-value=0.74 Score=21.29 Aligned_cols=284 Identities=9% Similarity=-0.021 Sum_probs=112.5 Q ss_pred CchHHHHHHH---hh-cceeccchhhhhhhhhhhhhhhhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhhc Q lcl|Aclame:pro 1 MRDAQRIQNL---AR-AGVILPRSVKNVSTPLAEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELV 76 (336) Q Consensus 1 m~~~~~~~~l---~~-~g~~~~~~~~~~~~~~~~~~~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~ 76 (336) |-+...-+.| ++ .++..+ ++..+ -.-.|.+.+.+++ .+.+- -+-++...++ T Consensus 1 ~~~k~~~~~l~~~~~~~~~~~~------------------~~~~g-----~~v~~~~~~~l~~-~i~e~-s~~l~~i~v~ 55 (321) T protein:vir:31 1 MASRTINNDLSRITEKNALTVD------------------DLDAG-----GTLPDPLWDEFWT-DMIEE-TPLLDAIRTE 55 (321) T ss_pred CchHHHHHHHHHHHHhcccccc------------------ccCCc-----ceeCHHHHHHHHH-HHHHh-hhhhhhceee Confidence 3322222211 11 111111 11111 0011333333332 22221 2223333333 Q ss_pred ccccCCCcceeeEEEeeeecccceEEeec-c-cCCceeeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHH Q lcl|Aclame:pro 77 GESKKGDWTTLVAAFITAEPTTTVATYGD-Y-SSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYS 154 (336) Q Consensus 77 ~v~t~g~w~~~t~~~~v~e~~G~a~~ygd-~-~DiP~vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~a 154 (336) |+++ ..- ........|.+...++ . ...+..+.......-..+.+..-...+.+-|. ..+.+-++.+.-... T Consensus 56 ~v~~---~~~---~i~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~k~~~~~~it~e~L~-d~a~~~d~e~~i~~~ 128 (321) T protein:vir:31 56 TVGA---KKT---RIPTLNIGERHRRPQDEGEWNENESDVSTGTIDISTEKATVAWDLPREVVQ-ENPEGEALADRILNL 128 (321) T ss_pred eccC---cce---eeeeeccCCcccccccccccccccccceeeeeeeeeEEEEeehhccHHHHH-hhhcchhHHHHHHHH Confidence 3321 110 1111111122211121 1 11122233333333334444444444444343 333467899999999 Q ss_pred HHHHHHHhhccEEEeeccccc------eEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCC Q lcl|Aclame:pro 155 SALGLAKFLNGSYLFGVAGLE------NYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAV 228 (336) Q Consensus 155 Ar~a~e~~~n~i~~~Gd~~~g------~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p 228 (336) .++++...+..+.|+|++... +.|+++.+.-.......+ .+..+. +++.+++..|-..- .+.+ T Consensus 129 ia~~~a~~~~~~~~nGd~~~~~~~~~~n~G~l~~a~~~~~~~~~~--~~~~~~----d~l~~l~~~l~~~y-----r~~~ 197 (321) T protein:vir:31 129 MTDAWSADVEDLAANGDEDAEDSFENQNDGFITVAEGDVETIDAA--DDILDN----DLVIRTIAGLDSKY-----RARM 197 (321) T ss_pred HHHHHHHHHHhheeeccccCCCcccccchhhhhhhcccccccccc--ccccCH----HHHHHHHHhccHhH-----hcCC Confidence 999999999999999997533 346666432221111001 112222 23344444432211 1122 Q ss_pred -cEEEecHHHHHhc----ccCCC-CCccHHHH-HHHhCCccEEEEcccccCCCCceEEEEEEeeCCCceEEEEeC--chh Q lcl|Aclame:pro 229 -LHMGLPPTAMSDL----SKTNQ-YGLSAAAK-LKEIFPKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFT--EKM 299 (336) Q Consensus 229 -~tL~Lp~~~~~~L----s~~~~-~~~Tvl~~-l~~n~pnl~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p--~~~ 299 (336) ...+|....+..+ ...+. .+...+.- -..++=++.++.+|.+-.. ..++.+ .+.+.+.+- ..+ T Consensus 198 ~~v~im~~~~~~~~~~~l~~~~~~~~~~~l~~~~~~tl~G~pvv~~~~mP~~----~il~t~----~~nl~~~~~~~~~~ 269 (321) T protein:vir:31 198 NPALIVSEDQLLSYHYTLTDRDTPLGDNVIMGEADVNPFSFPIIGSGLWPDD----KAMFTD----PQNLIYALYRDLEI 269 (321) T ss_pred CeEEEechHHHHHHHHHHhcCCCccccchhhccccccccceeEEEcCCCCCC----cEEEec----cccEEEEEeeccEE Confidence 2567877655432 22111 11122111 0112335667777766432 112211 111111111 122 Q ss_pred hccc--cee--cCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 300 RAHS--IER--YSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 300 ~~l~--~~~--~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) +... .+. +...++--++ +--|.+|..+-+++.+.|| T Consensus 270 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~ve~~~a~a~~~~i 309 (321) T protein:vir:31 270 DVLTESDKVSERDLHARYFMR-GDDDFAIENTEAVVLAEGL 309 (321) T ss_pred EEeecCccccccceeeEeeee-eecceeEeccccEEEEecC Confidence 2211 111 1122222223 3356888999999999999 No 135 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=41.49 E-value=0.92 Score=20.77 Aligned_cols=279 Identities=13% Similarity=0.042 Sum_probs=110.1 Q ss_pred hhhhhhhhcCccccCCcchHHHHHHHhhCcee-----eeeeccccchhhhcccccCCCcc-eeeEEEeeeecccceEEee Q lcl|Aclame:pro 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPSV-----IDILVAPMKAAELVGESKKGDWT-TLVAAFITAEPTTTVATYG 104 (336) Q Consensus 31 ~~~da~d~~~~l~t~~~~~i~~~l~~~idp~v-----~~~~~~~~~~~~l~~v~t~g~w~-~~t~~~~v~e~~G~a~~yg 104 (336) |+|-=...++.++| ++-++|| |++ .+.+........++. +..++.. -+++.++..- ...++-|. T Consensus 1 ~~~~~~~~~~~~~t-------~~v~~fi-pei~s~~i~~~l~~~~v~~~~~~-d~~~~~~~Gdtv~ip~~g-~~~~~d~~ 70 (341) T protein:vir:94 1 MALGNTITGPSINT-------QRGQQFI-PEQWLSEVQMFRKAKMLDTSVVK-TWGAQVKKGDTFHVPRIS-ELGVEDKA 70 (341) T ss_pred Ccchhhhccccccc-------hhHHHHH-HHHHHHHHHHHHHhhcchhhccc-cccccccCCceEEEeccC-cceeeeec Confidence 55533333344332 2233444 443 233344455555553 2233332 3678888642 33455554 Q ss_pred cccCCceeeeeeeeeeeeEEE-EEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecC Q lcl|Aclame:pro 105 DYSSDGDSGTNINYPQRQSYF-FQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDP 183 (336) Q Consensus 105 d~~DiP~vd~~~~~~~~~v~~-~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~P 183 (336) -...++.-+.+..+....+-. -..++.++..|... ...++-.+-.+.+..++.+..++..+---+...... -+ T Consensus 71 ~~~~i~~~~~~~~~~~itiD~~~~~~~~i~d~d~~~---~~~d~~~~~~~~~~~aLA~~~D~~i~~~~a~~~~~~---~~ 144 (341) T protein:vir:94 71 TDVPVGVQPVNDTDFVITVDTDRTTAVALDDLLEIQ---ASYDLRAPYLEAMGYALAKDMTGSILGLRAAVQNTA---SQ 144 (341) T ss_pred CCCccccccccCceEEEEEeeeeecceeechHHHHh---hccchHHHHHHHHHHHHHHHHHHHHHHHhhhccccc---cC Confidence 344555545444444444422 24456666555432 345777776666767776666654221001000000 01 Q ss_pred CCCcccccccccccccCHHH-HHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhcccCCC------CCccHHHHHH Q lcl|Aclame:pro 184 SLSAPITATTPWSGSPAVEA-VVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSKTNQ------YGLSAAAKLK 256 (336) Q Consensus 184 nl~~~~~~~t~w~~~~T~~e-I~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~~~~------~~~Tvl~~l~ 256 (336) +.. + +.......+++. .++.|.++...+-.. + + +...-.++++|..+..|.+-+. .+.. -++ T Consensus 145 ~~~---~-~~~~~~t~~~~~~~~~~i~~a~~~Lde~--~-V-P~~gR~lvv~P~~~~~Ll~~~~~~~~~~~g~~---~l~ 213 (341) T protein:vir:94 145 NVF---S-SSNGAITGNGQAFSFAVFLAARRLLLEA--D-V-PEEKIVLLISPGQESALFTIPQFISKDFINNA---PIA 213 (341) T ss_pred ccc---c-CccccccCchhhhhHHHHHHHHHHHhhc--C-C-CccCCEEEeCHHHHHHHhhchhhhhhhccccc---hhh Confidence 100 0 000111111222 234444444444322 1 1 3344579999999999864221 1111 122 Q ss_pred H----hCCccEEEEcccccCCCCce----EEEEEE----------------------------eeCCCceEEEEeCchhh Q lcl|Aclame:pro 257 E----IFPKLEFVTIPEYDTASGRL----VQLWAP----------------------------RVEGKDTATCGFTEKMR 300 (336) Q Consensus 257 ~----n~pnl~i~~~pel~~a~G~~----~~~~~~----------------------------~~~~~~~~~~~~p~~~~ 300 (336) + +.-.++|...+.+-..++.. +..... +.+---..++.=|+.++ T Consensus 214 ~G~ig~i~G~~V~~Sn~lp~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~gl~~~~~av~~~k~~~~~~~~ 293 (341) T protein:vir:94 214 QGQIGSLMGVRVIRTSLIGNNSATGWRNGAPTIAPAEATPGFTGSRYLPKQDSFTSLPATFTGNSRPVHTAVMCHMDWAA 293 (341) T ss_pred eeeeeeEeceEEEEeccccccccccccccccceecccccccccccccccccccccccEEEEEEecccccceeeecchhhh Confidence 1 12234555444442211110 000000 00000011111233333 Q ss_pred ccccee-cCCceEEeeec-------ceeeeEEecccceeeeccC Q lcl|Aclame:pro 301 AHSIER-YSSYFRQKKSA-------GTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 301 ~l~~~~-~~~~~~v~~~~-------rt~Gv~ir~P~ai~~~~GI 336 (336) ...+|. +......+.+. -..|+-+.||.+++.+.=- T Consensus 294 ~~~~~~~~~~~~~~~~~~~~~i~~~~~~G~~~lrp~~~v~~~~~ 337 (341) T protein:vir:94 294 AVVSKAPRVTQSFENREQVWLMVGRQAYGARLYRPLHAVNIHTT 337 (341) T ss_pred ccccccccccccchhhhhhhhhhhhhhhcccccCcceeEEEecC Confidence 332221 11111111111 1346666666665433333 No 136 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=36.72 E-value=1.2 Score=20.24 Aligned_cols=244 Identities=13% Similarity=-0.018 Sum_probs=99.9 Q ss_pred hcccccCCCcceeeEEEeeeecccceEEeec--ccCC--ceeeeeeeeeeeeE--EEEEEEEEeCHHHHHHHHHhCCCHH Q lcl|Aclame:pro 75 LVGESKKGDWTTLVAAFITAEPTTTVATYGD--YSSD--GDSGTNINYPQRQS--YFFQTWTRWGERELEMAGAGRVDLA 148 (336) Q Consensus 75 l~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd--~~Di--P~vd~~~~~~~~~v--~~~~~~~~y~~~El~~A~~~g~~l~ 148 (336) ++=--+.| .++.|+. .|++++..- +.++ +.-+...++....| ..+. +.-+.++-.+| +..++- T Consensus 1 ~vr~i~~g----~s~~~~~---iG~~~~~~~~~G~~l~~~~~~~~~~e~~itID~~l~~---~~~VdDiD~~q-a~~Dlr 69 (324) T protein:vir:99 1 MTRTITSG----KSAQFPV---MGRTKARYLKQGQSLDDGREDIKHTEKVITIDGLLTT---DVLIYDIEDAM-NHYDVR 69 (324) T ss_pred CeeeeecC----ceEEEee---eeeeEeccccCCCCcCCCcCCcCcccEEEEecchhhh---hhhhhhHHHHh-cCccch Confidence 11111222 3445543 577665431 2332 11222222211111 1111 11223444444 446677 Q ss_pred HHHHHHHHHHHHHhhccEEEe-----e--ccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCC Q lcl|Aclame:pro 149 SELNYSSALGLAKFLNGSYLF-----G--VAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQG 221 (336) Q Consensus 149 ~~K~~aAr~a~e~~~n~i~~~-----G--d~~~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g 221 (336) ++-.+.+..++.+..|+..+. . .+.....+..............+.--...+++.+++-|..+-..|..+. T Consensus 70 ~e~s~~~G~aLA~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~dai~~a~~~Lde~~-- 147 (324) T protein:vir:99 70 SEYSTQMGEALAMAADVANYAEMAKLVNSRKETTNENIEGLGAASLVKITGKKEDPAKYGTQVIQALTYARAAFAKKY-- 147 (324) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccCCcccCCccceecccccccccccCHHHHHHHHHHHHHHHhhcC-- Confidence 777777777777777754321 1 0111111111111111111112222224567888888888877775543 Q ss_pred ceeccCCcEEEecHHHHHhcccCC-----CCCccHHHHHHHh---CCccEEEEcccccCCC---------CceEEEE--- Q lcl|Aclame:pro 222 IITQEAVLHMGLPPTAMSDLSKTN-----QYGLSAAAKLKEI---FPKLEFVTIPEYDTAS---------GRLVQLW--- 281 (336) Q Consensus 222 ~v~~~~p~tL~Lp~~~~~~Ls~~~-----~~~~Tvl~~l~~n---~pnl~i~~~pel~~a~---------G~~~~~~--- 281 (336) + +...-.+++||..|..|.... .++ +...+.+.. .-+++|...+.|-..+ +..+.+= T Consensus 148 -V-P~~gR~~vv~P~~y~~Ll~~~~~~~~~~~-~~~~~~~G~V~~i~Gf~V~~Sn~lp~~~~t~~~~a~~~~~~~~~~~~ 224 (324) T protein:vir:99 148 -I-PAGDRTFYTDPDTYSAILAALMPNAANYA-ALIDPETGNIRNVMGFEVVETPHMTAQMVTNPTDAFDGTGHIFPATG 224 (324) T ss_pred -C-CCCCCEEEeChHHHHHHhhcccccccccc-cccceecceEEEEeceEEEecCCcccccccccccccccccccccccc Confidence 2 445678999999999885321 111 111111111 1235666555553211 1111000 Q ss_pred ------EEeeCCCceEEEE-----------eCchhhcccceecCCceEEeeecceeeeEEecccceeeec-------cC Q lcl|Aclame:pro 282 ------APRVEGKDTATCG-----------FTEKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMI-------GV 336 (336) Q Consensus 282 ------~~~~~~~~~~~~~-----------~p~~~~~l~~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~-------GI 336 (336) -+..+...+.-+. ++........+.+ -.+.+...... |+.+.||-+++... |+ T Consensus 225 ~~~~~~ky~~d~~~~~gl~~~~~a~~tv~~~~~~~e~~~~~~~-~~d~i~~~~a~-G~~~lRPe~a~~v~l~~~~~~~~ 301 (324) T protein:vir:99 225 DSTTTGKMTVGADNVVGLFVHRSAVATLKLKDMALERARRPEY-QADQIIAKYAM-GHGGLRPEAVGAIIFEDGETPAV 301 (324) T ss_pred ccccccccccccCceeEEEEehhheEEEeeecceecceechhh-HHHhhhhhhhh-cCcccccceEEEEEEccCccccc Confidence 0000011111111 1112222222222 22333333333 88888998775443 33 No 137 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=31.77 E-value=1.5 Score=19.66 Aligned_cols=257 Identities=12% Similarity=0.033 Sum_probs=108.7 Q ss_pred hhhhhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhhcccccCCCc-ceeeEEEeeeecccceEEeecccCC Q lcl|Aclame:pro 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDW-TTLVAAFITAEPTTTVATYGDYSSD 109 (336) Q Consensus 31 ~~~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w-~~~t~~~~v~e~~G~a~~ygd~~Di 109 (336) ||. +.-+|+.+...|... +...+....|+....++.- --.|++++..-..+.+..-+....+ T Consensus 1 MA~-------------~~~~pei~~~~v~~~----~~~~lv~~~l~~~~~~~~~~~GdTv~ip~~~~~~~~d~~~~~~~~ 63 (273) T protein:vir:79 1 MAF-------------NNFIPELWSDMLLEE----WTAQTVFANLVNREYEGIASKGNVVHIAGVVAPTVKDYKAAGRQT 63 (273) T ss_pred Ccc-------------hhhhHHHHHHHHHHH----HHhhccchhhhhccccccccCCcEEEEeecCcccccccccCCCcc Confidence 111 122455555444222 2333334444322211110 0247888886655533322333334 Q ss_pred ceeeeeeeeeeeeEEE-EEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCCCCcc Q lcl|Aclame:pro 110 GDSGTNINYPQRQSYF-FQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSAP 188 (336) Q Consensus 110 P~vd~~~~~~~~~v~~-~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~ 188 (336) +.-+.+..+...++-. -..++.+...|... ...++.. -...+..++.+.+++..+ +++-.-... T Consensus 64 ~~~~~~~~~~~~tid~~~~~~~~i~d~d~~~---~~~~~~~-~~~~~~~ala~~vD~~i~---------~~~~~a~~~-- 128 (273) T protein:vir:79 64 SADAISDTGVDLLIDQEKSIDFLVDDIDRVQ---VAGSLEA-YTRAGATALATDTDKFIA---------DMLVDNGTA-- 128 (273) T ss_pred CccccccceEEEEEeeecccceeeccHHHHh---hcccHHH-HHHHHHHHHHHHHHHHHH---------HHHhhcccc-- Confidence 4445555555666643 35566776655433 2335643 333445566666654211 111000000 Q ss_pred cccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhcccCCCC----CccH-HHHHHH----hC Q lcl|Aclame:pro 189 ITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSKTNQY----GLSA-AAKLKE----IF 259 (336) Q Consensus 189 ~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~~~~~----~~Tv-l~~l~~----n~ 259 (336) .+.+ ...+++.+++.|.++...+-..- + |...-.|+++|..+..|.+..+. ...- ..-|++ +. T Consensus 129 ~~~~----~~~~~~~~~~~i~~a~~~ld~~~---v-P~~~R~lvv~p~~~~~Ll~~~~~~~~~~~~~~~~~l~~G~ig~~ 200 (273) T protein:vir:79 129 LTGS----APSDADDAFDLIASALKELTKAN---V-PNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNL 200 (273) T ss_pred cccc----cccchhhHHHHHHHHHHHhhhcc---C-CccCcEEEECHHHHHHHhhchhhhhhhhhcccccceeeeEeeEE Confidence 0101 12345667777777766553321 1 33344899999998877442110 0000 000111 12 Q ss_pred CccEEEEcccccCCCCceEEEEEEeeCCCceEEEEeCc---hhhcccceecCCceEEeeecceeeeEEecccceeeeccC Q lcl|Aclame:pro 260 PKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTE---KMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) Q Consensus 260 pnl~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p~---~~~~l~~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~GI 336 (336) -+++|.....+...++.....+.... +.+.. .+..+..+.+. .-.+... -..|+-+.+|-+++.+.== T Consensus 201 ~G~~i~~s~~lp~~~~~~~~a~~~~A-------~~~a~~~~~~e~~r~~~~~-~~~v~~~-~~yg~~v~~p~~vv~~~~~ 271 (273) T protein:vir:79 201 LGARIVESNNLRDTDDEQFVAFHPSA-------AAYVSQIDTVEALRDQDSF-SDRIRAL-HVYGGKVVRPTGVVVFNKT 271 (273) T ss_pred eceEEEecccccccCceEEEEEeccc-------eeeeeehhhhhcccCcccc-eeeeeee-eeeeeEEecCceEEEEecc Confidence 23555555445433333332222111 11111 12222222222 2222232 3367777788887775433 No 138 >protein:vir:2736 Length: 348 # NCBI annotation: putative structural protein # Family: family:all:1083 # MgeID: mge:58 # MgeName: O1205 # Cross-refs: genbank:acc:NP_695109;genbank:gi:23455878;genbank:GeneID:955608 Probab=25.51 E-value=2 Score=18.88 Aligned_cols=262 Identities=7% Similarity=0.011 Sum_probs=90.8 Q ss_pred hhhhhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhhcccccCCCcceeeEEEeeeecccc---eEEeec-c Q lcl|Aclame:pro 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESKKGDWTTLVAAFITAEPTTT---VATYGD-Y 106 (336) Q Consensus 31 ~~~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~g~w~~~t~~~~v~e~~G~---a~~ygd-~ 106 (336) |+.-.. +. =++.|+.||. ++ ...+.++-...+||....- .+.|...+.... +..+-. . T Consensus 1 M~~i~d-----~f------~~~~l~~~v~-~~-~~~~~~~l~~~~Fp~~~~~-----~~~~~~~~~~~~~~~~a~~v~~~ 62 (348) T protein:vir:27 1 MGLIYD-----KV------TASNIAGYFN-AL-QENVSSTLGESIFPARKQL-----GTKLSYIKGASGQSVALKAAAFD 62 (348) T ss_pred Ccchhh-----hc------CHHHHHHHHH-hc-cchhhhhhHhhcCCCcccc-----ceeEEEEeeccCceeEeeeecCC Confidence 221110 11 0345555552 11 1122334445677743211 122222222221 122211 1 Q ss_pred cCCceeee-eeeeeeeeEEEEEEEEEeCHHHHHHHHHhCC--CHHH-------------HHHHHHHHHHHHhhcc----- Q lcl|Aclame:pro 107 SSDGDSGT-NINYPQRQSYFFQTWTRWGERELEMAGAGRV--DLAS-------------ELNYSSALGLAKFLNG----- 165 (336) Q Consensus 107 ~DiP~vd~-~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~--~l~~-------------~K~~aAr~a~e~~~n~----- 165 (336) ..-|+.+- ..+..+.++-.+.-.+..+..|++.-..+.- +-.. +...+.++..|...-+ T Consensus 63 ~~~~~~~r~~~~~~~~~~p~i~~~~~i~~~d~~~~~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~~al~~G 142 (348) T protein:vir:27 63 TNVTIRDRVSAEMHDEQMPFFKEAMLVKENDRQQLNLVKDSGNAVLVNTIVAGIFNDNLTLVNGARARLEAMRMQVLATG 142 (348) T ss_pred CCcceecccceeeeeeecCccccccccCHHHHHHHHHhhccCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcC Confidence 11122211 1222233333344555666666543222211 1111 1112233333433333 Q ss_pred -EEEeeccccceEEE-EecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhccc Q lcl|Aclame:pro 166 -SYLFGVAGLENYGL-INDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSK 243 (336) Q Consensus 166 -i~~~Gd~~~g~~Gl-lN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~ 243 (336) +.+-|+.. . +.+ ++.|.-. .++++++ |+.++++ +++||.+....+.. + |. .|.+++|.+..+..|.+ T Consensus 143 ki~i~~~~~-~-~~vdfg~~~~~-~~t~~~~-W~~~~ad-p~~di~~~~~~~~~-~-G~----~~~~ii~~~~~~~~l~~ 211 (348) T protein:vir:27 143 KIAFTSDGV-N-KDIDYGVKPDH-KKQVSKS-WAEPGAT-PLADLEDAIETARE-L-GL----NPERAVMNAKTFGLIRK 211 (348) T ss_pred eeEEecCCe-e-EEEeecCCccc-ceeeeec-cCCCCCC-HHHHHHHHHHHHHh-c-CC----cccEEEECHHHHHHHhc Confidence 33333221 1 111 1222221 2344555 4555655 88999998877643 4 42 58899999999999854 Q ss_pred C---------CC---CCcc---HHHHHHHhCCccEEEEcc-cccCCCCceEEEEEEeeCCCceEEEEeCch-hhccccee Q lcl|Aclame:pro 244 T---------NQ---YGLS---AAAKLKEIFPKLEFVTIP-EYDTASGRLVQLWAPRVEGKDTATCGFTEK-MRAHSIER 306 (336) Q Consensus 244 ~---------~~---~~~T---vl~~l~~n~pnl~i~~~p-el~~a~G~~~~~~~~~~~~~~~~~~~~p~~-~~~l~~~~ 306 (336) - +. ..++ +.+++.. +-.++|+.-- .+.+.+|....+ +|.. +..+|... T Consensus 212 ~~~v~~~~~~~~~~~~~i~~~~~~~~~~~-~~g~~i~~yd~~y~d~~G~~~~~--------------~p~~~vvl~~~~~ 276 (348) T protein:vir:27 212 AASTVKVIKPLAGDGSAVTKAELENYIAD-NFGVSIVLENGTYRNDKGEVSKF--------------YPDGHLTLIPNGP 276 (348) T ss_pred CHHHHHHhcccCccccccCHHHHHHHHHh-hcCceEEEEeeEEEcCCCcCccc--------------ccCCeEEEEcCCc Confidence 1 00 0111 2222211 1222322211 122223322221 2321 11222111 Q ss_pred cCCc-eEE------eeecceeeeEE------ecccceeeeccC Q lcl|Aclame:pro 307 YSSY-FRQ------KKSAGTWGAVI------FRPFAVAQMIGV 336 (336) Q Consensus 307 ~~~~-~~v------~~~~rt~Gv~i------r~P~ai~~~~GI 336 (336) .+.. |-. +....+....+ ..+.....-+-. T Consensus 277 ~G~~~yG~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~dP~ 319 (348) T protein:vir:27 277 LGNTVFGTTPEESDLFADNTVNAEVEIVDNGIAVTTTKTTDPV 319 (348) T ss_pred ceeEEeccCcchhhhhhccccccceeeeCCeeEEEeeecCCCc Confidence 1100 000 00111111111 111111222222 No 139 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=23.39 E-value=2.3 Score=18.59 Aligned_cols=256 Identities=11% Similarity=0.006 Sum_probs=104.7 Q ss_pred hhhhhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhhcccccC--CCcceeeEEEeeeecccceEEeecccC Q lcl|Aclame:pro 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESKK--GDWTTLVAAFITAEPTTTVATYGDYSS 108 (336) Q Consensus 31 ~~~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~--g~w~~~t~~~~v~e~~G~a~~ygd~~D 108 (336) ||. +.-+|..+..-|...+ ........|+..... +..+ .++.++.....+.+.--+.... T Consensus 1 MA~-------------~~~~pe~~~~~v~~~~----~~~lv~~~l~~~~~~~~~~~G-dtv~ip~~~~~~~~d~~~~~~~ 62 (273) T protein:vir:10 1 MAF-------------NNFIPELWSDMLLEEW----TAQTVFANLVNREYEGTASKG-NVVHIAGVVAPTVKDYKAAGRQ 62 (273) T ss_pred Ccc-------------hhhhHHHHHHHHHHHH----HhhhccchhhccccccccccC-ceEEEeecccccccccccCCCc Confidence 111 2224555554333232 333334445443222 2222 5788887554443221121122 Q ss_pred CceeeeeeeeeeeeEE-EEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCCCCc Q lcl|Aclame:pro 109 DGDSGTNINYPQRQSY-FFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSA 187 (336) Q Consensus 109 iP~vd~~~~~~~~~v~-~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~ 187 (336) ++.-+.+..+...++- .-..++.++..|...+ . .++.+ -...+..++.+.+++..+ + . +.+- ... T Consensus 63 ~~~~~~~~~~~~~tid~~~~~~~~i~d~d~~~~--~-~~~~~-~~~~~~~alA~~vD~~i~-~--~--~~~a----~~~- 128 (273) T protein:vir:10 63 TSADAISDTGVDLLIDQEKSIDFLVDDIDRVQV--A-GSLEA-YTRAGATALATDTDKFIA-D--M--LVDN----GTA- 128 (273) T ss_pred cCccccccceEEEEEeeeeecceEeecHHHhhh--h-ccHHH-HHHHHHHHHHHHHHHHHH-H--H--Hhcc----ccc- Confidence 2223333333344442 2344555554443322 2 24533 233344455555553221 0 0 0000 000 Q ss_pred ccccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhcccCCCC----Ccc-HHHHHHH----h Q lcl|Aclame:pro 188 PITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSKTNQY----GLS-AAAKLKE----I 258 (336) Q Consensus 188 ~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~~~~~----~~T-vl~~l~~----n 258 (336) .+.+ ...|++.+++.|.++...+-..- + +...-.|+++|..+..|.+.+++ ... -..-+++ + T Consensus 129 -~~~~----~~~~~~~~~~~i~~a~~~ld~~~---v-P~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~ 199 (273) T protein:vir:10 129 -LTGS----APTDADDAFDLIAKALKELTKAN---V-PNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGN 199 (273) T ss_pred -cccc----cccchhHHHHHHHHHHHHhhhcC---C-CcCCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeE Confidence 0101 12356778888888877764332 1 33345899999999988543210 000 0011111 1 Q ss_pred CCccEEEEcccccCCCCceEEEEEEeeCCCceEEEEeCc---hhhcccceecCCceEEeeecceeeeEEecccceeeecc Q lcl|Aclame:pro 259 FPKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTE---KMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIG 335 (336) Q Consensus 259 ~pnl~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p~---~~~~l~~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~G 335 (336) .-+++|.....+-..++.....+.... +.++. .+..+..+.+. .-.+... -..|+-+.||-+++.+.= T Consensus 200 i~G~~v~~s~~lp~~~~~~~~~~~~~A-------~~~a~q~~~~e~~r~~~~~-~~~v~~~-~~yg~~v~~~~~~~~l~~ 270 (273) T protein:vir:10 200 LLGARIVESNNLRDTDDEQFVAFHPSA-------AAYVSQIDTVEALRDQDSF-SDRIRAL-HVYGGKVVRPTGVVVFNK 270 (273) T ss_pred EeceEEEEecccccCCccEEEEEeccc-------eeeeeeeehhhcccCCCcc-eeeeeee-eeeeeeEeccceEEEEec Confidence 223455554444322333333222111 11222 12222222222 2222222 236777778888776543 Q ss_pred C Q lcl|Aclame:pro 336 V 336 (336) Q Consensus 336 I 336 (336) = T Consensus 271 ~ 271 (273) T protein:vir:10 271 T 271 (273) T ss_pred c Confidence 3 No 140 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=23.39 E-value=2.3 Score=18.59 Aligned_cols=256 Identities=11% Similarity=0.006 Sum_probs=104.7 Q ss_pred hhhhhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhhcccccC--CCcceeeEEEeeeecccceEEeecccC Q lcl|Aclame:pro 31 YAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELVGESKK--GDWTTLVAAFITAEPTTTVATYGDYSS 108 (336) Q Consensus 31 ~~~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~~v~t~--g~w~~~t~~~~v~e~~G~a~~ygd~~D 108 (336) ||. +.-+|..+..-|...+ ........|+..... +..+ .++.++.....+.+.--+.... T Consensus 1 MA~-------------~~~~pe~~~~~v~~~~----~~~lv~~~l~~~~~~~~~~~G-dtv~ip~~~~~~~~d~~~~~~~ 62 (273) T protein:vir:10 1 MAF-------------NNFIPELWSDMLLEEW----TAQTVFANLVNREYEGTASKG-NVVHIAGVVAPTVKDYKAAGRQ 62 (273) T ss_pred Ccc-------------hhhhHHHHHHHHHHHH----HhhhccchhhccccccccccC-ceEEEeecccccccccccCCCc Confidence 111 2224555554333232 333334445443222 2222 5788887554443221121122 Q ss_pred CceeeeeeeeeeeeEE-EEEEEEEeCHHHHHHHHHhCCCHHHHHHHHHHHHHHHhhccEEEeeccccceEEEEecCCCCc Q lcl|Aclame:pro 109 DGDSGTNINYPQRQSY-FFQTWTRWGERELEMAGAGRVDLASELNYSSALGLAKFLNGSYLFGVAGLENYGLINDPSLSA 187 (336) Q Consensus 109 iP~vd~~~~~~~~~v~-~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~ 187 (336) ++.-+.+..+...++- .-..++.++..|...+ . .++.+ -...+..++.+.+++..+ + . +.+- ... T Consensus 63 ~~~~~~~~~~~~~tid~~~~~~~~i~d~d~~~~--~-~~~~~-~~~~~~~alA~~vD~~i~-~--~--~~~a----~~~- 128 (273) T protein:vir:10 63 TSADAISDTGVDLLIDQEKSIDFLVDDIDRVQV--A-GSLEA-YTRAGATALATDTDKFIA-D--M--LVDN----GTA- 128 (273) T ss_pred cCccccccceEEEEEeeeeecceEeecHHHhhh--h-ccHHH-HHHHHHHHHHHHHHHHHH-H--H--Hhcc----ccc- Confidence 2223333333344442 2344555554443322 2 24533 233344455555553221 0 0 0000 000 Q ss_pred ccccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEecHHHHHhcccCCCC----Ccc-HHHHHHH----h Q lcl|Aclame:pro 188 PITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLPPTAMSDLSKTNQY----GLS-AAAKLKE----I 258 (336) Q Consensus 188 ~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp~~~~~~Ls~~~~~----~~T-vl~~l~~----n 258 (336) .+.+ ...|++.+++.|.++...+-..- + +...-.|+++|..+..|.+.+++ ... -..-+++ + T Consensus 129 -~~~~----~~~~~~~~~~~i~~a~~~ld~~~---v-P~~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~ 199 (273) T protein:vir:10 129 -LTGS----APTDADDAFDLIAKALKELTKAN---V-PNVGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGN 199 (273) T ss_pred -cccc----cccchhHHHHHHHHHHHHhhhcC---C-CcCCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeE Confidence 0101 12356778888888877764332 1 33345899999999988543210 000 0011111 1 Q ss_pred CCccEEEEcccccCCCCceEEEEEEeeCCCceEEEEeCc---hhhcccceecCCceEEeeecceeeeEEecccceeeecc Q lcl|Aclame:pro 259 FPKLEFVTIPEYDTASGRLVQLWAPRVEGKDTATCGFTE---KMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIG 335 (336) Q Consensus 259 ~pnl~i~~~pel~~a~G~~~~~~~~~~~~~~~~~~~~p~---~~~~l~~~~~~~~~~v~~~~rt~Gv~ir~P~ai~~~~G 335 (336) .-+++|.....+-..++.....+.... +.++. .+..+..+.+. .-.+... -..|+-+.||-+++.+.= T Consensus 200 i~G~~v~~s~~lp~~~~~~~~~~~~~A-------~~~a~q~~~~e~~r~~~~~-~~~v~~~-~~yg~~v~~~~~~~~l~~ 270 (273) T protein:vir:10 200 LLGARIVESNNLRDTDDEQFVAFHPSA-------AAYVSQIDTVEALRDQDSF-SDRIRAL-HVYGGKVVRPTGVVVFNK 270 (273) T ss_pred EeceEEEEecccccCCccEEEEEeccc-------eeeeeeeehhhcccCCCcc-eeeeeee-eeeeeeEeccceEEEEec Confidence 223455554444322333333222111 11222 12222222222 2222222 236777778888776543 Q ss_pred C Q lcl|Aclame:pro 336 V 336 (336) Q Consensus 336 I 336 (336) = T Consensus 271 ~ 271 (273) T protein:vir:10 271 T 271 (273) T ss_pred c Confidence 3 No 141 >protein:vir:962 Length: 397 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:19 # MgeName: bIL285 # Cross-refs: genbank:acc:NP_076616;genbank:gi:13095724;genbank:GeneID:920264 Probab=20.59 E-value=2.7 Score=18.18 Aligned_cols=289 Identities=9% Similarity=-0.004 Sum_probs=112.0 Q ss_pred CchHHH-HHHHhhcceeccchh---hhhhhhhhhhhhhhhhhcCccccCCcchHHHHHHHhhCceeeeeeccccchhhhc Q lcl|Aclame:pro 1 MRDAQR-IQNLARAGVILPRSV---KNVSTPLAEYAMDAADLSPHLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELV 76 (336) Q Consensus 1 m~~~~~-~~~l~~~g~~~~~~~---~~~~~~~~~~~~da~d~~~~l~t~~~~~i~~~l~~~idp~v~~~~~~~~~~~~l~ 76 (336) .+.... -...++......... ..+........... ....-.......+|..+.+.| .+. -.......++ T Consensus 93 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~vp~~~~~~i----~~~-~~~~~l~~~~ 165 (397) T protein:vir:96 93 QKPKDGEKRKMKKFKVTEEELAEKRSAINAFVKSKGAEK--RDGFTSVEGGALIPQELLQPQ----LEP-KDIVDLSKYV 165 (397) T ss_pred hhhHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHhhhhhh--hhcccccccccchhHHHHHHH----HHh-hhhhhHHHhh Confidence 000000 000000000000000 00000011111111 111112233344555544333 332 1222223333 Q ss_pred ccccCCCcceeeEEEeeeec-ccceEEeecccCCce-eeeeeeeeeeeEEEEEEEEEeCHHHHHHHHHhCCCHHHHHHHH Q lcl|Aclame:pro 77 GESKKGDWTTLVAAFITAEP-TTTVATYGDYSSDGD-SGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYS 154 (336) Q Consensus 77 ~v~t~g~w~~~t~~~~v~e~-~G~a~~ygd~~DiP~-vd~~~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~a 154 (336) .+.+. ......+++... .+.+...+.....|- .+.......-.++.++..+.+|.+=++.+. .++.+.-... T Consensus 166 ~~~~~---~~~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~i~~~~~~~~~~~~~s~ell~ds~---~~l~~~i~~~ 239 (397) T protein:vir:96 166 RSVPV---NSASGKFPVISKSGSKMATVQQLEKNPQLANPKMVEIDYSVATRRGYIPISQEMIDDAS---YDVTGLIADE 239 (397) T ss_pred hhccc---cccceeEEEEeccCCccccccccccccccccccccceeecHhHhhcchhhHHHHHhhhH---HHHHHHHHHH Confidence 32211 122344555443 345556677777763 566666667777777777777765555433 3455555566 Q ss_pred HHHHHHHhhccEEEeeccccceEEEEecCCCCcccccccccccccCHHHHHHHHHHHHHHHHHHhCCceeccCCcEEEec Q lcl|Aclame:pro 155 SALGLAKFLNGSYLFGVAGLENYGLINDPSLSAPITATTPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQEAVLHMGLP 234 (336) Q Consensus 155 Ar~a~e~~~n~i~~~Gd~~~g~~GllN~Pnl~~~~~~~t~w~~~~T~~eI~~Di~~l~~~l~~~t~g~v~~~~p~tL~Lp 234 (336) .+.++...++.-.+.|+.... +.+ ..| ++||.+++....... ..-+++|. T Consensus 240 l~~~~~~~~~~~i~~g~g~~~---------------~~~----~~~----~d~~~~~~~~~~~~~-------~~a~~v~n 289 (397) T protein:vir:96 240 IQDQSLNTKNADIAAVLKTAT---------------AKS----VVG----VDGLKDLINKEIKKV-------YDVKLFIS 289 (397) T ss_pred HHHHHHHHHHHHHhhcccccc---------------ccc----ccc----hHHHHHHHHHhhhhh-------cCcEEEEc Confidence 666666666665555543211 111 123 334555544322211 12369999 Q ss_pred HHHHHhccc-CCCCCccHHH-HHHHhCC----ccEEEEccc--ccCCCCceEEEEEEeeCCCceEEEEeCchhhccccee Q lcl|Aclame:pro 235 PTAMSDLSK-TNQYGLSAAA-KLKEIFP----KLEFVTIPE--YDTASGRLVQLWAPRVEGKDTATCGFTEKMRAHSIER 306 (336) Q Consensus 235 ~~~~~~Ls~-~~~~~~Tvl~-~l~~n~p----nl~i~~~pe--l~~a~G~~~~~~~~~~~~~~~~~~~~p~~~~~l~~~~ 306 (336) ++.+..|.. .+..|.-++. -+...-| +..++..+. +.++.|..+.+|.+- .+...+..-+.+....... T Consensus 290 ~~~~~~l~~lkd~~G~~~~~~~~~~~~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~---~~~~~~~~~~~~~~~~~~~ 366 (397) T protein:vir:96 290 ASMYSELDKLKDKNGRYLLQDSITAASGKQLLGKEVVVLDDDVIGKSVGNVVGFIGDA---KAFASFFDRKQVSVSWVDN 366 (397) T ss_pred HHHHHHHHHhhccCCCeEeccCccCCCcccccccceEEecccccCCCCCceEEEEeeh---hcceEeEeecceEEEEecc Confidence 999998854 3444443321 1111111 122332222 122333333333321 1000000001111110000 Q ss_pred cCCceEEeeecceeeeEEecccceeeecc-C Q lcl|Aclame:pro 307 YSSYFRQKKSAGTWGAVIFRPFAVAQMIG-V 336 (336) Q Consensus 307 ~~~~~~v~~~~rt~Gv~ir~P~ai~~~~G-I 336 (336) ..-....-.+.|.+| .++.|-||+.+.- + T Consensus 367 ~~~~~~~~~~~r~d~-~~~~~~a~~~~~~~~ 396 (397) T protein:vir:96 367 NIYGQLLAGIIRYDV-KATDKKAGFYVTFTI 396 (397) T ss_pred cccceeEEEEEEEcc-EEecccceEEEEeec Confidence 011122334456555 5668888887752 2 Done!