Query lcl|Aclame:protein:vir:99576|NCBI_annot:hypothetical protein|genbank:acc:YP_001039801;genbank:gi:126011051;genbank:GeneID:4818271 Match_columns 388 No_of_seqs 106 out of 110 Neff 6.3 Searched_HMMs 1612 Date Mon Dec 2 20:17:19 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_35 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_35_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:99576 Length: 388 100.0 8E-147 5E-150 821.5 34.2 388 1-388 1-388 (388) 2 protein:vir:96079 Length: 382 100.0 1E-131 7E-135 738.4 30.1 378 1-388 1-382 (382) 3 protein:vir:107732 Length: 379 100.0 1E-126 6E-130 711.4 32.9 370 1-388 1-379 (379) 4 protein:vir:106734 Length: 336 100.0 2E-115 1E-118 650.0 31.2 336 29-388 1-336 (336) 5 protein:vir:78558 Length: 336 100.0 9E-115 5E-118 645.9 31.1 336 29-388 1-336 (336) 6 protein:vir:101557 Length: 336 100.0 4E-114 3E-117 642.2 31.0 336 29-388 1-336 (336) 7 protein:vir:3643 Length: 336 # 100.0 6E-114 4E-117 641.4 31.2 336 29-388 1-336 (336) 8 protein:vir:94070 Length: 339 100.0 7E-114 4E-117 640.9 30.7 337 26-388 1-339 (339) 9 protein:vir:79642 Length: 329 100.0 6.4E-92 4E-95 520.5 28.1 319 40-388 1-326 (329) 10 protein:vir:104342 Length: 314 100.0 9E-90 5.6E-93 508.8 28.2 309 41-388 1-311 (314) 11 protein:vir:80068 Length: 301 100.0 2.4E-88 1.5E-91 500.9 27.5 296 74-388 1-301 (301) 12 protein:vir:107687 Length: 319 100.0 1.1E-87 7.1E-91 497.2 27.9 316 30-388 1-319 (319) 13 protein:vir:5255 Length: 304 # 100.0 5.4E-85 3.4E-88 482.6 27.4 292 77-387 1-304 (304) 14 protein:vir:103285 Length: 296 100.0 6.2E-85 3.9E-88 482.2 25.9 291 65-388 1-293 (296) 15 protein:vir:7771 Length: 330 # 98.5 6.7E-09 4.2E-12 65.4 13.2 297 62-388 1-321 (330) 16 protein:vir:105778 Length: 358 98.5 3.6E-09 2.2E-12 66.9 10.8 332 1-388 1-357 (358) 17 protein:vir:1638 Length: 298 # 98.4 2E-08 1.2E-11 62.8 12.3 279 65-388 1-297 (298) 18 protein:vir:1433 Length: 435 # 98.3 1.3E-07 8.1E-11 58.3 14.8 348 1-388 65-431 (435) 19 protein:vir:9574 Length: 300 # 98.3 4.7E-08 2.9E-11 60.8 12.2 285 65-388 1-298 (300) 20 protein:vir:2504 Length: 305 # 98.2 2.2E-07 1.3E-10 57.1 14.4 283 72-388 1-296 (305) 21 protein:vir:94771 Length: 298 98.2 7.2E-08 4.5E-11 59.7 11.8 279 65-388 1-297 (298) 22 protein:vir:9759 Length: 303 # 98.2 1.4E-07 8.7E-11 58.2 12.9 284 65-388 1-301 (303) 23 protein:vir:104085 Length: 320 98.1 9.1E-08 5.6E-11 59.2 11.5 297 44-388 1-315 (320) 24 protein:vir:80376 Length: 435 98.1 5E-07 3.1E-10 55.1 15.4 352 1-388 65-431 (435) 25 protein:vir:5739 Length: 366 # 98.1 1.5E-07 9.3E-11 58.0 12.5 345 1-388 1-364 (366) 26 protein:vir:8187 Length: 311 # 98.1 1.5E-07 9E-11 58.1 12.2 290 64-388 1-308 (311) 27 protein:vir:94142 Length: 304 98.0 5.7E-07 3.5E-10 54.8 14.0 286 62-388 1-303 (304) 28 protein:vir:105905 Length: 304 98.0 5.7E-07 3.5E-10 54.8 14.0 286 62-388 1-303 (304) 29 protein:vir:99920 Length: 311 98.0 3.3E-07 2E-10 56.1 12.3 294 64-388 1-310 (311) 30 protein:vir:78523 Length: 338 98.0 1E-06 6.2E-10 53.5 14.4 304 21-388 1-333 (338) 31 protein:vir:95763 Length: 297 98.0 1.2E-06 7.4E-10 53.1 14.8 283 62-388 1-294 (297) 32 protein:vir:80684 Length: 315 98.0 8.9E-07 5.5E-10 53.8 13.7 288 65-388 1-304 (315) 33 protein:vir:96392 Length: 324 98.0 1.4E-06 8.6E-10 52.7 14.6 304 16-388 1-313 (324) 34 protein:vir:78830 Length: 324 98.0 1.4E-06 8.6E-10 52.7 14.6 304 16-388 1-313 (324) 35 protein:vir:9309 Length: 324 # 97.9 1.9E-06 1.2E-09 52.0 14.1 305 21-388 1-313 (324) 36 protein:vir:108211 Length: 318 97.9 1.3E-07 7.9E-11 58.4 7.6 279 30-388 1-315 (318) 37 protein:vir:103955 Length: 324 97.9 3.9E-06 2.4E-09 50.2 15.7 302 16-388 1-313 (324) 38 protein:vir:41 Length: 299 # N 97.9 1.7E-06 1.1E-09 52.2 13.6 281 65-388 1-296 (299) 39 protein:vir:78223 Length: 333 97.8 1.2E-06 7.7E-10 53.0 12.4 306 21-388 1-330 (333) 40 protein:vir:97148 Length: 324 97.8 5.6E-06 3.5E-09 49.4 15.9 304 16-388 1-313 (324) 41 protein:vir:99749 Length: 324 97.8 5.4E-06 3.3E-09 49.5 15.8 302 16-388 1-313 (324) 42 protein:vir:96223 Length: 324 97.8 3.3E-06 2.1E-09 50.6 14.3 304 23-388 1-313 (324) 43 protein:vir:105038 Length: 428 97.3 6.9E-05 4.3E-08 43.4 15.7 352 1-388 43-426 (428) 44 protein:vir:4226 Length: 326 # 97.3 2.4E-05 1.5E-08 45.9 13.2 306 21-388 1-321 (326) 45 protein:vir:2430 Length: 318 # 97.3 2.5E-05 1.6E-08 45.8 12.9 295 38-388 1-311 (318) 46 protein:vir:2344 Length: 397 # 97.2 2E-05 1.2E-08 46.3 11.7 288 41-388 1-304 (397) 47 protein:vir:94673 Length: 419 97.1 3.9E-05 2.4E-08 44.8 12.6 334 1-388 50-415 (419) 48 protein:vir:8420 Length: 477 # 96.9 5.6E-05 3.5E-08 43.9 11.6 346 1-388 93-469 (477) 49 protein:vir:7855 Length: 497 # 96.4 0.00044 2.7E-07 39.0 13.4 349 1-388 67-491 (497) 50 protein:vir:101650 Length: 497 96.4 0.00044 2.7E-07 39.0 13.4 349 1-388 67-491 (497) 51 protein:vir:97053 Length: 390 96.3 0.00012 7.2E-08 42.2 9.8 327 1-388 44-390 (390) 52 protein:vir:8102 Length: 543 # 96.0 0.00015 9.2E-08 41.6 8.9 343 1-388 173-540 (543) 53 protein:vir:100135 Length: 418 96.0 0.0011 7E-07 36.8 14.1 317 1-388 84-413 (418) 54 protein:vir:191 Length: 385 # 96.0 0.0001 6.4E-08 42.4 7.8 342 1-388 1-382 (385) 55 protein:vir:1886 Length: 385 # 96.0 0.0001 6.4E-08 42.4 7.8 342 1-388 1-382 (385) 56 protein:vir:10364 Length: 390 95.9 0.00025 1.5E-07 40.3 9.6 343 1-388 19-390 (390) 57 protein:vir:6212 Length: 434 # 95.6 0.00025 1.6E-07 40.3 8.6 331 1-388 73-427 (434) 58 protein:vir:97255 Length: 310 95.6 0.00025 1.6E-07 40.3 8.5 277 67-388 1-308 (310) 59 protein:vir:100247 Length: 425 95.6 0.00035 2.2E-07 39.5 9.2 342 1-388 71-422 (425) 60 protein:vir:4700 Length: 415 # 95.4 0.0012 7.6E-07 36.6 11.6 321 1-388 71-402 (415) 61 protein:vir:4600 Length: 415 # 95.4 0.0012 7.6E-07 36.6 11.6 321 1-388 71-402 (415) 62 protein:vir:4456 Length: 401 # 95.3 0.00061 3.8E-07 38.2 9.7 343 1-388 51-399 (401) 63 protein:vir:485 Length: 407 # 95.1 0.001 6.4E-07 37.0 10.5 345 1-388 40-398 (407) 64 protein:vir:96123 Length: 274 95.1 0.0027 1.7E-06 34.7 13.2 255 65-388 1-268 (274) 65 protein:vir:93616 Length: 645 95.0 0.0014 8.9E-07 36.2 10.8 335 1-388 272-637 (645) 66 protein:vir:98339 Length: 415 94.9 0.002 1.2E-06 35.4 11.4 324 1-388 68-402 (415) 67 protein:vir:81100 Length: 415 94.9 0.002 1.2E-06 35.4 11.4 324 1-388 68-402 (415) 68 protein:vir:79987 Length: 415 94.9 0.002 1.2E-06 35.4 11.4 324 1-388 68-402 (415) 69 protein:vir:104256 Length: 458 94.8 0.0034 2.1E-06 34.1 13.2 333 1-388 99-456 (458) 70 protein:vir:9410 Length: 415 # 94.8 0.00084 5.2E-07 37.5 9.0 342 1-388 41-402 (415) 71 protein:vir:81227 Length: 413 94.8 0.0035 2.2E-06 34.0 13.3 332 1-388 51-408 (413) 72 protein:vir:81070 Length: 390 94.7 0.0013 7.9E-07 36.5 9.9 326 1-388 54-390 (390) 73 protein:vir:4339 Length: 395 # 94.7 0.0031 1.9E-06 34.4 11.9 315 1-388 36-393 (395) 74 protein:vir:94933 Length: 330 93.7 0.0037 2.3E-06 33.9 10.3 298 1-388 1-327 (330) 75 protein:vir:94494 Length: 274 93.6 0.0071 4.4E-06 32.4 12.8 257 65-388 1-268 (274) 76 protein:vir:97433 Length: 274 93.6 0.0071 4.4E-06 32.4 12.8 257 65-388 1-268 (274) 77 protein:vir:4197 Length: 314 # 93.1 0.0087 5.4E-06 31.9 13.4 292 35-388 1-310 (314) 78 protein:vir:80930 Length: 278 93.0 0.009 5.6E-06 31.8 12.1 263 65-388 1-275 (278) 79 protein:vir:96833 Length: 275 92.5 0.0032 2E-06 34.2 8.1 256 65-388 1-275 (275) 80 protein:vir:93742 Length: 274 92.4 0.012 7.2E-06 31.2 12.7 257 65-388 1-268 (274) 81 protein:vir:3613 Length: 272 # 92.3 0.012 7.5E-06 31.1 11.2 259 65-388 1-272 (272) 82 protein:vir:4159 Length: 315 # 91.3 0.016 1E-05 30.4 13.2 298 50-388 1-315 (315) 83 protein:vir:79078 Length: 307 91.1 0.0088 5.4E-06 31.9 8.9 268 64-388 1-300 (307) 84 protein:vir:96762 Length: 632 90.7 0.019 1.2E-05 30.0 10.4 329 1-388 269-631 (632) 85 protein:vir:9820 Length: 272 # 90.4 0.021 1.3E-05 29.8 14.2 252 65-388 1-267 (272) 86 protein:vir:3033 Length: 272 # 90.4 0.021 1.3E-05 29.8 14.2 252 65-388 1-267 (272) 87 protein:vir:99888 Length: 309 90.1 0.023 1.4E-05 29.6 11.6 268 60-388 1-301 (309) 88 protein:vir:107882 Length: 307 89.9 0.011 6.6E-06 31.4 8.3 269 64-388 1-300 (307) 89 protein:vir:4856 Length: 293 # 89.7 0.025 1.5E-05 29.4 13.5 270 63-388 1-279 (293) 90 protein:vir:1328 Length: 392 # 88.4 0.02 1.3E-05 29.9 8.7 331 1-388 44-389 (392) 91 protein:vir:102655 Length: 322 84.6 0.059 3.7E-05 27.3 11.0 282 65-388 1-319 (322) 92 protein:vir:6242 Length: 390 # 83.1 0.071 4.4E-05 26.9 9.9 334 1-388 44-387 (390) 93 protein:vir:102119 Length: 404 83.0 0.072 4.5E-05 26.8 13.2 329 1-388 44-398 (404) 94 protein:vir:6324 Length: 335 # 81.3 0.073 4.6E-05 26.8 8.4 286 30-388 1-328 (335) 95 protein:vir:1025 Length: 408 # 79.0 0.11 6.7E-05 25.9 12.3 312 1-388 72-391 (408) 96 protein:vir:105334 Length: 276 77.2 0.13 7.9E-05 25.5 11.7 253 65-388 1-268 (276) 97 protein:vir:96262 Length: 274 77.2 0.13 7.9E-05 25.5 14.7 256 65-388 1-268 (274) 98 protein:vir:95898 Length: 274 77.2 0.13 7.9E-05 25.5 14.7 256 65-388 1-268 (274) 99 protein:vir:3991 Length: 404 # 76.1 0.14 8.6E-05 25.3 12.3 325 1-388 42-391 (404) 100 protein:vir:80213 Length: 334 71.6 0.19 0.00012 24.5 8.4 297 65-388 1-332 (334) 101 protein:vir:100172 Length: 394 71.3 0.16 9.6E-05 25.0 7.3 330 1-388 1-382 (394) 102 protein:vir:8843 Length: 317 # 70.7 0.21 0.00013 24.4 10.9 281 65-388 1-313 (317) 103 protein:vir:1239 Length: 274 # 64.7 0.3 0.00018 23.5 14.2 255 65-388 1-268 (274) 104 protein:vir:3870 Length: 400 # 62.5 0.33 0.00021 23.2 8.9 318 1-388 72-397 (400) 105 protein:vir:3845 Length: 395 # 61.0 0.36 0.00022 23.0 13.2 313 1-388 43-381 (395) 106 protein:vir:4511 Length: 409 # 60.7 0.37 0.00023 23.0 12.7 340 1-388 41-404 (409) 107 protein:vir:105645 Length: 400 59.9 0.38 0.00024 22.9 10.1 290 65-388 1-331 (400) 108 protein:vir:3158 Length: 321 # 57.1 0.44 0.00027 22.5 15.6 291 21-388 1-309 (321) 109 protein:vir:99675 Length: 324 53.3 0.42 0.00026 22.7 6.1 256 107-388 1-301 (324) 110 protein:vir:95376 Length: 425 52.9 0.54 0.00034 22.0 9.9 328 1-388 63-418 (425) 111 protein:vir:7019 Length: 401 # 52.4 0.55 0.00034 22.0 9.0 295 65-388 1-335 (401) 112 protein:vir:1268 Length: 397 # 51.6 0.58 0.00036 21.9 10.4 328 1-388 39-395 (397) 113 protein:vir:7990 Length: 273 # 49.9 0.62 0.00039 21.7 13.0 256 77-388 1-271 (273) 114 protein:vir:4830 Length: 397 # 48.2 0.68 0.00042 21.5 9.8 309 1-388 63-385 (397) 115 protein:vir:95963 Length: 395 47.8 0.69 0.00043 21.5 11.4 327 1-388 25-373 (395) 116 protein:vir:739 Length: 231 # 46.3 0.74 0.00046 21.3 11.5 220 112-388 1-231 (231) 117 protein:vir:95107 Length: 270 45.2 0.78 0.00048 21.2 12.9 250 72-388 1-263 (270) 118 protein:vir:78935 Length: 335 41.8 0.91 0.00056 20.8 9.1 287 65-388 1-328 (335) 119 protein:vir:106647 Length: 303 41.2 0.94 0.00058 20.7 9.2 269 65-388 1-288 (303) 120 protein:vir:80128 Length: 466 40.9 0.95 0.00059 20.7 10.7 334 1-388 84-446 (466) 121 protein:vir:103323 Length: 364 39.6 1 0.00063 20.6 11.0 291 65-388 1-337 (364) 122 protein:vir:7409 Length: 408 # 39.6 1 0.00063 20.6 12.5 324 1-388 39-391 (408) 123 protein:vir:1383 Length: 421 # 38.4 1.1 0.00066 20.4 12.0 311 1-388 54-392 (421) 124 protein:vir:105822 Length: 273 37.0 1.1 0.00071 20.3 14.3 257 77-388 1-271 (273) 125 protein:vir:102605 Length: 273 37.0 1.1 0.00071 20.3 14.3 257 77-388 1-271 (273) 126 protein:vir:96490 Length: 348 35.4 1.2 0.00077 20.1 6.7 287 65-388 1-330 (348) 127 protein:vir:9927 Length: 295 # 34.3 1.3 0.00081 20.0 9.2 259 62-388 1-280 (295) 128 protein:vir:102873 Length: 392 29.0 1.7 0.0011 19.3 11.7 333 1-388 35-382 (392) 129 protein:vir:102082 Length: 392 29.0 1.7 0.0011 19.3 11.7 333 1-388 35-382 (392) 130 protein:vir:105004 Length: 392 29.0 1.7 0.0011 19.3 11.7 333 1-388 35-382 (392) 131 protein:vir:107593 Length: 392 29.0 1.7 0.0011 19.3 11.7 333 1-388 35-382 (392) 132 protein:vir:93881 Length: 387 27.7 1.8 0.0011 19.2 11.0 314 1-388 43-379 (387) 133 protein:vir:95318 Length: 328 27.6 1.8 0.0011 19.2 9.1 223 67-388 1-239 (328) 134 protein:vir:94622 Length: 341 27.4 1.8 0.0011 19.1 9.8 294 65-388 1-337 (341) 135 protein:vir:101607 Length: 379 20.4 2.8 0.0017 18.2 12.7 313 1-388 39-379 (379) 136 protein:vir:107388 Length: 331 20.1 2.8 0.0018 18.1 8.1 222 67-388 1-240 (331) 137 protein:vir:107826 Length: 331 20.1 2.8 0.0018 18.1 8.1 222 67-388 1-240 (331) 138 protein:vir:98525 Length: 331 20.1 2.8 0.0018 18.1 8.1 222 67-388 1-240 (331) No 1 >protein:vir:99576 Length: 388 # NCBI annotation: hypothetical protein # Family: family:all:1653 # MgeID: mge:1544 # MgeName: BcepF1 # Cross-refs: genbank:acc:YP_001039801;genbank:gi:126011051;genbank:GeneID:4818271 Probab=100.00 E-value=8.1e-147 Score=821.51 Aligned_cols=388 Identities=100% Similarity=1.456 Sum_probs=383.3 Q ss_pred CCCcceeeeecCccccchhhhhhcccccccccCCHHHHhhcceecccchhhcchhhhhhhhhhhhccCcccccccccccc Q lcl|Aclame:pro 1 MKQLSKVHQSLAGRSVRAFDMANGKADYRLTDMAVRELKKFGLVFDHATVKRQIELLHEGGVATQAFDSAYVAPTTQASI 80 (388) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~amDaa~~~~~t~~~~ 80 (388) |+|+||+||+|+||++||++|+++++++++|+++++||+|+||+||+++.++..+++.+.+++++||||++++++|++|+ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~a~da~~~~~~t~~~~ 80 (388) T protein:vir:99 1 MKQLSKVHQSLAGRSVRAFDMANGKADYRLTDMAVRELKKFGLVFDHATVKRQIELLHEGGVATQAFDSAYVAPTTQASI 80 (388) T ss_pred CCCccceeeecCCcccchhhhhcCCcceeeechhhHhhhhcceeccCccchhhhhhhhhhhhhhcccCcccccccccCcc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hHHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccceEecccccCCceeeeeeeeeeeeEEEEEE Q lcl|Aclame:pro 81 PTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPLSSWNVNFERRTIVRGEM 160 (388) Q Consensus 81 g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~diP~~~~n~~~~~~~v~~~~~ 160 (388) |+|++||+||||+|||++++|+++++||||+|+|+|++++++|+++|.+|+|++|||++|+|++|+|+++++|++|++++ T Consensus 81 gip~~~~~~~~p~~~~~~~~p~~~~~l~pv~t~g~W~~~~~~f~v~e~~G~A~~ygd~~D~Pl~d~~~~~~~r~v~~~~~ 160 (388) T protein:vir:99 81 PTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPLSSWNVNFERRTIVRGEM 160 (388) T ss_pred cHHHHHhhhhccceeeeeechhhhhhhccccccCCccceeEEEeeeecceeEEEeecccCCCceeccceeeeeeEEEEEe Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccccccccCCcccccccCCH Q lcl|Aclame:pro 161 GIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWVSGGANAF 240 (388) Q Consensus 161 ~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~t~Wa~kT~ 240 (388) ||+|+++|+++|+++|++|+++|+.+||+++|+++|+++|||+++++.+++|||||||||++++++++.+++++|++||+ T Consensus 161 g~~yg~~El~~A~~~g~~l~~~Ka~AA~~ale~~~N~i~f~G~~g~~~~~~yGllNdP~l~a~v~at~~~~~~~Wa~kT~ 240 (388) T protein:vir:99 161 GIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWVSGGANAF 240 (388) T ss_pred eeeecHHHHHHHHhhCCCcHHHHHHHHHHHHHhhhceEEEEeecCCCccceEEEeeCCCcccccccccCCcCcccccCCH Confidence 99999999999999999999999999999999999999999999987789999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccCCCcCccHHHHHHHhCCccEEEEccccccccCCCCcc Q lcl|Aclame:pro 241 QGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVVTDLGISVRDWLKQTYPRVRVMSAPELQGGNPDDGKD 320 (388) Q Consensus 241 ~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~~~~~~Tvl~~lk~n~pnl~i~~~pel~~a~gtg~~~ 320 (388) +||++||+.++++|+.||+|+++++++|++|+||++++.+|+++|++|+||++|||+|||||+|+++|||++++++|+++ T Consensus 241 ~eI~~Di~~~~~~i~~qs~g~~~~~~~~~tL~LP~~~~~~Ls~~n~~g~Tvl~~lk~n~Pnl~i~t~pEl~~a~~tgg~~ 320 (388) T protein:vir:99 241 QGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVVTDLGISVRDWLKQTYPRVRVMSAPELQGGNPDDGKD 320 (388) T ss_pred HHHHHHHHHHHHHHHHhcCCeeeecccceEEEechHHHHhccccCcCCccHHHHHHHhcCCcEEEEecccccccccCCce Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 321 IAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 321 ~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) ++++|+++++.+.++++++++++.|.||++||+||+|++.++|++||++|||||+||||+||++++|| T Consensus 321 ~~~~~~~~~~~~~~~~~~~~~t~~~~~p~~~~~l~vq~~~~~~~~~~~~rt~Gv~ir~P~Ai~~~~GI 388 (388) T protein:vir:99 321 IAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) T ss_pred eEEEEecccccccccCccCcceeEEecccccccccceecCceeEeccccceeeeEEeccchhheeccC Confidence 99999999999999999999999999999999999999999999999999999999999999999999 No 2 >protein:vir:96079 Length: 382 # NCBI annotation: hypothetical protein ORF023 # Family: family:all:1653 # MgeID: mge:1597 # MgeName: F8 # Cross-refs: genbank:acc:YP_001294440;genbank:gi:149408337;genbank:GeneID:5237198 Probab=100.00 E-value=1.2e-131 Score=738.44 Aligned_cols=378 Identities=61% Similarity=0.996 Sum_probs=355.1 Q ss_pred CCCcceeeeecCccccchhhhhhcccccccccCCHHHHhhcceecccchhhcchhhhhh--hhhhhhccCcccccccccc Q lcl|Aclame:pro 1 MKQLSKVHQSLAGRSVRAFDMANGKADYRLTDMAVRELKKFGLVFDHATVKRQIELLHE--GGVATQAFDSAYVAPTTQA 78 (388) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~--~~~~~~amDaa~~~~~t~~ 78 (388) |+|+||+||||+||.|||+++.+ .++.++++|+|+||+||++..+.+.+...+ ......||||+..+++|++ T Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~l~~~gi~~~~~~~~~~~~~~~~~~~~~~~~amDa~~~~~~t~~ 74 (382) T protein:vir:96 1 MSHISKTHSRLAGRHAKPFDLKN------VTHEAVAALGRIGLVFDHAVVQDQIKALAKAGAFRSGSAMDSNFTAPVTTP 74 (382) T ss_pred CCCcceeeeecCCccccchhhhc------ccHHHHHHHhccccccCcccchhHhhhhhhhhhhhhhcccccccCCccccC Confidence 99999999999999999999866 466679999999999999988777666554 3346789999999999999 Q ss_pred cchHHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccceEecccccCCceeeeeeeeeeeeEEEE Q lcl|Aclame:pro 79 SIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPLSSWNVNFERRTIVRG 158 (388) Q Consensus 79 ~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~diP~~~~n~~~~~~~v~~~ 158 (388) |+|+|++||+||||++||++|+||++++||||+|+|+|++++++|+++|.+|+|++|||++|+|++|+++++++|++|++ T Consensus 75 ~~g~p~~~l~~~~p~~~~~~~~p~~~~~l~pv~t~g~W~~~t~ty~~~e~~G~A~~ygd~~D~Pl~d~~~~~~~r~v~~~ 154 (382) T protein:vir:96 75 SIPTPIQFLQTWLPGFVKVMTAARKIDEIIGIDTVGSWEDQEIVQGIVEPAGTAVEYGDHTNIPLTSWNANFERRTIVRG 154 (382) T ss_pred CccHHHHHHhhhhhhhhhhhhhhhhhhhhccccccCCccceEEEEeeeecccceEEeecccCCCccccccceeEEEEEEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccccccccCCcccccccC Q lcl|Aclame:pro 159 EMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWVSGGAN 238 (388) Q Consensus 159 ~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~t~Wa~k 238 (388) ++||+|+.+|+++|+++|++|.++|+.+||+++|+++|+++|||+.++..+|+||||||||||+.+++. .++|++| T Consensus 155 ~~g~~yg~lE~~rAa~~~~~l~~~Ka~aA~~ale~~~N~i~f~G~~~g~~~~~yGllNdP~l~a~~t~a----~~~Wa~k 230 (382) T protein:vir:96 155 ELGLLVGTLEEGRASAIRLNSAETKRQQAAIGLEIFRNAIGFYGWQSGLGNRTYGFLNDPNLPPFQTPP----SQGWATA 230 (382) T ss_pred EEeeeecHHHHHHHHhhCCCcHHHHHHHHHHHHHHhhceEEEEeeecCcCcceEEEEeCCCcccccccC----CCCcccc Confidence 999999999999999999999999999999999999999999998665567999999999998765432 3559999 Q ss_pred CHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccCCCcCccHHHHHHHhCCccEEEEcccccccc--CC Q lcl|Aclame:pro 239 AFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVVTDLGISVRDWLKQTYPRVRVMSAPELQGGN--PD 316 (388) Q Consensus 239 T~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~~~~~~Tvl~~lk~n~pnl~i~~~pel~~a~--gt 316 (388) |++||++||++++++|++||+|+++++.+|++|+|||+++.+|+++|++|+||++|||+|||||+|+++|||++++ +. T Consensus 231 T~~eI~~Di~~l~~~i~~qt~G~~~~~~~~~~L~LP~~~~~~Ls~~n~~g~Tvl~~lk~n~Pnl~i~t~peL~~a~~~g~ 310 (382) T protein:vir:96 231 DWAGIIGDIREAVRQLRIQSQDQIDPKAEKITMALATSKVDYLSVTTPYGISVSDWIEQTYPKMRIVSAPELSGVQMQGK 310 (382) T ss_pred cHHHHHHHHHHHHHHHHhccCCeeeecccceEEeechHHHhhccccCccCccHHHHHHHhcCCcEEEEccccccccCCCc Confidence 9999999999999999999999999888899999999999999999999999999999999999999999998764 44 Q ss_pred CCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 317 DGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 317 g~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) |+.+++++|.++++...+++++.+.+|.|.+|++++++|+|++.++|++||++|||||+||||+||++++|| T Consensus 311 g~~~~~~~~~~e~~~~~~~s~~~p~~f~q~~p~~~~~l~ve~~~~~~~~~~s~~t~Gv~i~~P~ai~~~~GI 382 (382) T protein:vir:96 311 TPEDALVLFVEEVDASVDGSTDGGSVFSQLVQSKFITLGVEKRAKSYVEDFSNGTAGALCKRPWAVVRYLGI 382 (382) T ss_pred cceeEEEEecchhhhhcccccccCcceeccccceeeeccceeecceeEeccccceeeeEEEcchhhhhccCC Confidence 578999999999988888999999999999999999999999999999999999999999999999999999 No 3 >protein:vir:107732 Length: 379 # NCBI annotation: gp23 # Family: family:all:1653 # MgeID: mge:1520 # MgeName: BcepB1A # Cross-refs: genbank:acc:YP_024871;genbank:gi:48697513;genbank:GeneID:2948349 Probab=100.00 E-value=9.9e-127 Score=711.39 Aligned_cols=370 Identities=42% Similarity=0.681 Sum_probs=335.2 Q ss_pred CCCcceeeeecCccccchhhhhhcccccccccCCHHHHhhcceecccchhhcchhhhhhhhhhhhccCcccccc------ Q lcl|Aclame:pro 1 MKQLSKVHQSLAGRSVRAFDMANGKADYRLTDMAVRELKKFGLVFDHATVKRQIELLHEGGVATQAFDSAYVAP------ 74 (388) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~amDaa~~~~------ 74 (388) |+|+||+||||+||++|| |.+.+.|+++++ +++|+|+||+|++...... ..+..||||+..++ T Consensus 1 ~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~--~~~l~~~gi~~~~~~~~~~-------~~~~~amd~~~~~~~~~~~~ 69 (379) T protein:vir:10 1 MPQISKIHSSLNARQMTQ--MVMDSADVTLDN--LKHLESYGIHLNGRKNKLF-------ELMQFAMDSNDIGPIPTPLS 69 (379) T ss_pred CCCcceeeeecCccccch--hhhccccccHHH--HHHHHhcCccccchhhhhh-------hhhhhhhccccccccccccC Confidence 999999999999999995 778888888876 7889999999997653322 22345778774443 Q ss_pred --cccccchHHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccceEecccccCCceeeeeeeeee Q lcl|Aclame:pro 75 --TTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPLSSWNVNFER 152 (388) Q Consensus 75 --~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~diP~~~~n~~~~~ 152 (388) +++++.|+| +||++|.|+++|++++||++.+||||+|+|+|++++++|+++|++|+|++|||++|+|++|+++++++ T Consensus 70 ~l~~~~~~g~~-~~l~~~~p~~i~~~tap~~a~~l~pv~t~g~W~~~~~~~~v~e~~G~A~~ygd~~d~pl~d~~~~~~~ 148 (379) T protein:vir:10 70 PLSPVSIPGLI-QFLQNWLPGHVRILTAVREADEFLGLSTVGQWDDEQIVQRVLEGLGTAQPYTDGGNMALMSWTPTFET 148 (379) T ss_pred ccccccccchH-HHHHhhcchHHHHHhhhhhhhhhcccccCCCceeeeEEEeeeeeeeeeEEeccccCCCeeeeeeeeee Confidence 234455555 57777779999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccccc-cccCCc Q lcl|Aclame:pro 153 RTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIA-STTPGG 231 (388) Q Consensus 153 ~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~-~~~~~~ 231 (388) +++|++++||+|+++|+++|+++|++|+++|+.+||+++|+++|+++|||+.+. .+++|||||||||+++++ ++++++ T Consensus 149 r~v~~~~~g~~yg~~El~~Aa~~g~~l~~~Ka~aA~~ale~~~N~i~f~G~~d~-~~~~yGllNdP~l~a~~t~atg~~~ 227 (379) T protein:vir:10 149 RTVVRFEAGLQVAPLEEARSSRVQVSSADEKRAMVGEALEVQRNRVAFYGYNDG-SGRTFGFLNDPNLPAYVAVPNGAGG 227 (379) T ss_pred eeeEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCC-CcceEEEEeCCCCcccccccCCccc Confidence 999999999999999999999999999999999999999999999999997543 368999999999998866 455677 Q ss_pred ccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccCCCcCccHHHHHHHhCCccEEEEccccc Q lcl|Aclame:pro 232 WVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVVTDLGISVRDWLKQTYPRVRVMSAPELQ 311 (388) Q Consensus 232 ~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~~~~~~Tvl~~lk~n~pnl~i~~~pel~ 311 (388) +++|++||++||++||+++++++|.||+|.++++++|++|+|||+++.+|+++|++|+||++|||+|||||+|+++|||+ T Consensus 228 ~t~Wa~kT~~eI~~Di~~~~~~l~~qs~g~~~~~~~~~tL~LP~~~~~~L~~~n~~g~Tvl~~lk~n~Pnl~i~t~pEL~ 307 (379) T protein:vir:10 228 SPLWAQKTTLEIIADLRNGLTALQVQSMGRIKSNKTPITIGIPNAYENYITTPTELGYSVAQYMRESYPNVTFVSAPELN 307 (379) T ss_pred ccccccCCHHHHHHHHHHHHHHHHHhhCCeecccccceeEEecHHHHHhhccccccCccHHHHHHHhcCCcEEEEccccc Confidence 88999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 312 GGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 312 ~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) +|+ |+++.+++|.++++ ..+.+.++++.|+||||||+||+|++.++|+|||++|||||+||||+||+|++|- T Consensus 308 ~ag--gg~~~~~~~~~~~~---~~~t~~~~~~~~~~p~k~~~l~ve~~~~~~~~~~~~rt~Gv~ir~P~Ai~~~~G~ 379 (379) T protein:vir:10 308 DAN--GGSSAIYYYADAVE---NNGTDDGRTWLQVVPTKMFTLGVEKKIKGYAEGYTNATAGAMLKRPFATYRQTGA 379 (379) T ss_pred ccC--CCccEEEEEeeccC---CCccCCcceEEEecchhhhhccceecCceeEeccccceeeeeeecchhhheecCC Confidence 984 55678899999875 4566778999999999999999999999999999999999999999999999999 No 4 >protein:vir:106734 Length: 336 # NCBI annotation: gp13 # Family: family:all:1653 # MgeID: mge:1599 # MgeName: Bcep1 # Cross-refs: genbank:acc:NP_944321;genbank:gi:38638620;genbank:GeneID:2657363 Probab=100.00 E-value=1.6e-115 Score=650.00 Aligned_cols=336 Identities=22% Similarity=0.315 Sum_probs=309.6 Q ss_pred ccccCCHHHHhhcceecccchhhcchhhhhhhhhhhhccCcccccccccccchHHHHHHHhhcceeeeecccchhhhhhc Q lcl|Aclame:pro 29 RLTDMAVRELKKFGLVFDHATVKRQIELLHEGGVATQAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEIL 108 (388) Q Consensus 29 ~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~amDaa~~~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~ 108 (388) +++-..+++|+|+||+||++..++.+++ .+++++|+|+++ +++|.++.|++..+++|+||++||++|+|+++++|| T Consensus 1 ~~~~~~~~~l~~~gi~~~~~~~~~~~~~---~~~a~da~d~~~-~~~t~~~~g~~~~l~~~i~p~~~~~~~~~~~~~~l~ 76 (336) T protein:vir:10 1 MRDAQRIQNLARAGVILPRSVKNVSTPL---AEYAMDAADLSP-HLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELV 76 (336) T ss_pred CchHHHHHHHhccCeecchhhhhhhHHH---HHHHHhhhhhcc-ccccCCCcchHHHHHhhcCcceeeeeechhchhhhc Confidence 2333458899999999999888888765 334555666543 246888999999999999999999999999999999 Q ss_pred ccccCCCCceeeEEEeeeccccceEecccccCCceeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHH Q lcl|Aclame:pro 109 GVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAA 188 (388) Q Consensus 109 ~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~diP~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr 188 (388) ||+|+|+|++++++|+++|++|++++|||++|+|++|+|++++++++|++++||+||++|+++|+++|++|+++|+.+|| T Consensus 77 ~v~t~g~w~~~~~~~~~~e~~G~a~~ygd~~d~P~~d~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~ 156 (336) T protein:vir:10 77 GESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSA 156 (336) T ss_pred ccccCCCcceeeEEEEeeeeeeeEEEccccCCCcceeeeeeeeeeeEEEEEEEEeeCHHHHHHHHHhCCCcHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHhhceEEEEeecCccccceEEEeecCCCccccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeecccccc Q lcl|Aclame:pro 189 VQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVD 268 (388) Q Consensus 189 ~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p 268 (388) +++|+++|+++||||++ +|+|||||||||+++++++ ++.|++||+|||++||++++++|+.||+|.+++ ++| T Consensus 157 ~ale~~~N~~~~~Gd~~---~~~~GllN~P~l~a~~t~~----~~~w~~~T~~eI~~Di~~~~~~l~~qt~g~i~~-~~~ 228 (336) T protein:vir:10 157 LGLAKFLNGSYLFGVAG---LENYGLINDPSLSAPITAT----TPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQ-EAV 228 (336) T ss_pred HHHHHhhCeEEEEeecc---cceEEEeecCCCCcccccC----cCcccccCHHHHHHHHHHHHHHHHHhcCCeeee-ccc Confidence 99999999999999985 5899999999999876543 345899999999999999999999999999986 589 Q ss_pred ceEEcCHHHHHhhccCCCcCccHHHHHHHhCCccEEEEccccccccCCCCccEEEEEEcccccccccccCCCcceEeecc Q lcl|Aclame:pro 269 ITLVLPMNKVDMLSVVTDLGISVRDWLKQTYPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQ 348 (388) Q Consensus 269 ~tL~Lp~~~~~~Ls~~~~~~~Tvl~~lk~n~pnl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p 348 (388) ++|+||++++.+|+++|++|+|+++|||+|||||+|+++|||++| +++++++|++++ ++++++.++|| T Consensus 229 ~tL~Lp~~~~~~L~~~n~~g~tv~~~lk~n~Pnl~i~t~pel~~A----gg~~~~~~~~~~--------~~~~t~~~~~P 296 (336) T protein:vir:10 229 LHMGLPPTAMSDLSKTNQYGLSAAAKLKEIFPKLEFVTIPEYDTA----SGRLVQLWAPRV--------EGKDTATCGFT 296 (336) T ss_pred eEEEechHHHHhccCCCccCccHHHHHHHhCCccEEEEccccccc----CCceEEEEEecc--------cCCcceeeecC Confidence 999999999999999999999999999999999999999999876 357899999986 56889999999 Q ss_pred hhhhccCceeccCceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 349 SKFVTLGVEKRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 349 ~~~r~~~v~~~~~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) ++||+||+|++.++|++||++|||||+||||+||++++|| T Consensus 297 ~~f~~lpvq~~~~~~~v~~~~rt~Gv~i~rP~ai~~~~GI 336 (336) T protein:vir:10 297 EKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMLGV 336 (336) T ss_pred hhhhccceeecCceeEeccccceeeeeeeccchheeeccC Confidence 9999999999999999999999999999999999999999 No 5 >protein:vir:78558 Length: 336 # NCBI annotation: major capsid protein # Family: family:all:1653 # MgeID: mge:1854 # MgeName: BcepNY3 # Cross-refs: genbank:acc:YP_001294848;genbank:gi:149882911;genbank:GeneID:5291029 Probab=100.00 E-value=8.8e-115 Score=645.87 Aligned_cols=336 Identities=23% Similarity=0.318 Sum_probs=309.3 Q ss_pred ccccCCHHHHhhcceecccchhhcchhhhhhhhhhhhccCcccccccccccchHHHHHHHhhcceeeeecccchhhhhhc Q lcl|Aclame:pro 29 RLTDMAVRELKKFGLVFDHATVKRQIELLHEGGVATQAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEIL 108 (388) Q Consensus 29 ~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~amDaa~~~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~ 108 (388) +++-..+++|+|+||+||++..++.+++ .+++++|+|+++ +++|.++.|++..+++|+||++||++|+|+++++|| T Consensus 1 ~~~~~~~~~l~~~gi~~~~~~~~~~~~~---~~~a~da~d~~~-~~~t~~~~g~~~~l~~~i~p~~~~~~~~~~~~~~l~ 76 (336) T protein:vir:78 1 MRDAQRIQNLARAGVILPRSVKNVSTPL---AEYAMDAADLSP-HLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELV 76 (336) T ss_pred CchHHHHHHHhccCeecchhhhhhhHHH---HHHHHhhhhhcc-ccccCCCcchHHHHHHhcccceeeehhhhhhhhhhc Confidence 2333458899999999999888888765 334555666543 246888999999999999999999999999999999 Q ss_pred ccccCCCCceeeEEEeeeccccceEecccccCCceeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHH Q lcl|Aclame:pro 109 GVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAA 188 (388) Q Consensus 109 ~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~diP~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr 188 (388) ||+|+|+|++++++|+++|.+|++++|||++|+|++|+|+++++++++++++||+||++|+++|+++|++|+++|+.+|| T Consensus 77 ~v~t~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P~vd~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~ 156 (336) T protein:vir:78 77 GESKKGDWTTLVAAFITAEPTTTVATYGDYSSDGDSGTNINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSA 156 (336) T ss_pred ccccCCCccccEEEEeeeecceeeEEeecccCCCeeecceeeEEEEEEEEEeeeeecHHHHHHHHHhCCCcHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHhhceEEEEeecCccccceEEEeecCCCccccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeecccccc Q lcl|Aclame:pro 189 VQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVD 268 (388) Q Consensus 189 ~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p 268 (388) +++|+++|+++|||+++ +++|||||||||+++++++ ++.|++||+|||++||++++++|+.||+|.++. ++| T Consensus 157 ~ale~~~N~~~~~Gd~~---~~~~GllN~P~l~a~~t~~----~~~w~~~T~~~I~~Di~~~~~~l~~qt~g~~~~-~~~ 228 (336) T protein:vir:78 157 LGLAKFLNGSYLFGVAG---LENYGLINDPSLSAPITAT----TPWSGSPAVEAVVNEVVTLFQVLQTQSQGIITQ-EAV 228 (336) T ss_pred HHHHHhhCeEEEEeccc---cceEEEEeCCCCCcccccC----cCcccccCHHHHHHHHHHHHHHHHHhcCCeeee-ccc Confidence 99999999999999975 5899999999999876543 345899999999999999999999999999976 589 Q ss_pred ceEEcCHHHHHhhccCCCcCccHHHHHHHhCCccEEEEccccccccCCCCccEEEEEEcccccccccccCCCcceEeecc Q lcl|Aclame:pro 269 ITLVLPMNKVDMLSVVTDLGISVRDWLKQTYPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQ 348 (388) Q Consensus 269 ~tL~Lp~~~~~~Ls~~~~~~~Tvl~~lk~n~pnl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p 348 (388) ++|+||++++.+|+++|++|+|+++|||+|||||+|+++|||++| +++++++|++++ ++++++.++|| T Consensus 229 ~tL~Lp~~~~~~L~~~n~~g~tv~~~lk~n~Pnl~i~t~pel~~A----gg~~~~~~~~~~--------~~~~t~~~~~p 296 (336) T protein:vir:78 229 LHMGLPPTAMSDLSKTNQYGLSAAAKLKEIFPKLEFVTIPEYDTA----SGRLVQLWAPRV--------EGKDTATCGFT 296 (336) T ss_pred eEEEechHHHHhccCCCccCccHHHHHHHhcCccEEEEccccccc----CcceEEEEEeec--------cCCcceeeecc Confidence 999999999999999999999999999999999999999999876 357899999986 56889999999 Q ss_pred hhhhccCceeccCceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 349 SKFVTLGVEKRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 349 ~~~r~~~v~~~~~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) ++||+||+|++.++|++||++|||||+||||+||++++|| T Consensus 297 ~~f~~lpvq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:78 297 EKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred hhhhccceeecCceeEeccccceeeeeeeccchheeeccC Confidence 9999999999999999999999999999999999999999 No 6 >protein:vir:101557 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:1477 # MgeName: Bcep43 # Cross-refs: genbank:acc:NP_958117;genbank:gi:41057663;genbank:GeneID:2716814 Probab=100.00 E-value=4.1e-114 Score=642.20 Aligned_cols=336 Identities=22% Similarity=0.310 Sum_probs=308.6 Q ss_pred ccccCCHHHHhhcceecccchhhcchhhhhhhhhhhhccCcccccccccccchHHHHHHHhhcceeeeecccchhhhhhc Q lcl|Aclame:pro 29 RLTDMAVRELKKFGLVFDHATVKRQIELLHEGGVATQAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEIL 108 (388) Q Consensus 29 ~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~amDaa~~~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~ 108 (388) ++|-..+++|+|+||+|+++..++..++. .++|+|||+++ +++|.+|.|+|..+++|+||++||++++||++++|| T Consensus 1 ~~~~~~~~~l~~~gi~~~~~~~~~~~~~~---~~~~da~d~~~-~~~~~~~~~i~~~l~~~i~p~~~~~~~~p~~a~~l~ 76 (336) T protein:vir:10 1 MRDAQRIQNLARAGVILPRSVQNVSTPLT---EYAMDAADLSP-HLSSTGSSGIPNYLTTYVDPAVIDILVAPMKAAELV 76 (336) T ss_pred CchHHHHHHHhhcCeeecchhhhhhhhHH---HhhhhhhhccC-ccccCCCchhHHHHHhhcccceeeehhhhhhhhhhc Confidence 34444689999999999999998887653 34555666543 346788999999999999999999999999999999 Q ss_pred ccccCCCCceeeEEEeeeccccceEecccccCCceeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHH Q lcl|Aclame:pro 109 GVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAA 188 (388) Q Consensus 109 ~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~diP~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr 188 (388) ||+|+|+|++++++|+++|++|+|++||||+|+|++|+++++++++++++++||+||++|+++|+++|++|+++|+.+|| T Consensus 77 pv~t~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P~~d~~~~~~~~~v~~~~~g~~yg~~El~~A~~~g~~l~~~Ka~aA~ 156 (336) T protein:vir:10 77 GESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSA 156 (336) T ss_pred cccccCCccceeEEEeeeeceeeEEEeeccCCCceeecccceeeeeEEEEEeeeeeCHHHHHHHHHhCCCcHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHhhceEEEEeecCccccceEEEeecCCCccccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeecccccc Q lcl|Aclame:pro 189 VQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVD 268 (388) Q Consensus 189 ~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p 268 (388) +++|+++|+++||||++ +++|||||||+|+++++++ ++.|.+||+|||++||++++++|+.||+|+++. |+| T Consensus 157 ~ale~~~N~i~~~Gd~~---~~~yGllN~P~l~a~~t~~----t~~~~~~t~eei~~Di~~~~~~l~~qs~G~i~~-~~~ 228 (336) T protein:vir:10 157 LGLAKFLNGSYLFGVAG---LENYGLINDPSLSAPITAT----TPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQ-EDV 228 (336) T ss_pred HHHHHhhCcEEEEeccc---cceEEEEeCCCCccccccC----CCcccccCHHHHHHHHHHHHHHHHHhcCCeecc-cCc Confidence 99999999999999975 6899999999999876543 345788999999999999999999999999975 789 Q ss_pred ceEEcCHHHHHhhccCCCcCccHHHHHHHhCCccEEEEccccccccCCCCccEEEEEEcccccccccccCCCcceEeecc Q lcl|Aclame:pro 269 ITLVLPMNKVDMLSVVTDLGISVRDWLKQTYPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQ 348 (388) Q Consensus 269 ~tL~Lp~~~~~~Ls~~~~~~~Tvl~~lk~n~pnl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p 348 (388) ++|+||++++.+|+++|++|+||++|||+|||||+|+++|||++|+ +++++||++++ ++++++.+.|| T Consensus 229 ~tL~LP~~~~~~Ls~~n~~g~Tvl~~lk~n~Pnl~i~t~pEl~~a~----G~~~~l~~~~~--------~~~~t~~~~~p 296 (336) T protein:vir:10 229 LRMGLPPTAMSDLSKTNQYGLAAAAKLKDIFPKLEFVTIPEYDTAS----GRLVQLWAPRV--------EGKDTATCGFT 296 (336) T ss_pred ceEEecHHHHHhccCCCccCccHHHHHHHhcCccEEEEccccccCC----CceEEEEEEec--------CCCcceeeecc Confidence 9999999999999999999999999999999999999999998763 45788888876 46889999999 Q ss_pred hhhhccCceeccCceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 349 SKFVTLGVEKRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 349 ~~~r~~~v~~~~~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) ++||+||+|++.++|++||++|||||+||||+||++++|| T Consensus 297 ~~~~~l~vq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:10 297 EKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred hhhhccceeecCceeEeccccceeeeeeeccchheeeecC Confidence 9999999999999999999999999999999999999999 No 7 >protein:vir:3643 Length: 336 # NCBI annotation: gp12 # Family: family:all:1653 # MgeID: mge:75 # MgeName: Bcep781 # Cross-refs: genbank:acc:NP_705638;genbank:gi:23752323;genbank:GeneID:955719 Probab=100.00 E-value=5.7e-114 Score=641.44 Aligned_cols=336 Identities=22% Similarity=0.311 Sum_probs=307.9 Q ss_pred ccccCCHHHHhhcceecccchhhcchhhhhhhhhhhhccCcccccccccccchHHHHHHHhhcceeeeecccchhhhhhc Q lcl|Aclame:pro 29 RLTDMAVRELKKFGLVFDHATVKRQIELLHEGGVATQAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEIL 108 (388) Q Consensus 29 ~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~amDaa~~~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~ 108 (388) ++|-..+++|+|+||+|+++..++..++.. ++|+|||+.+ +.+|.++.|+|..+++|+||++||++|+|+++++|| T Consensus 1 ~~~~~~~~~l~~~gi~~~~~~~~~~~~~~~---~~~da~d~~~-~~~~~~~~~~~~~l~~~i~p~~~~~~~~~~~~~~l~ 76 (336) T protein:vir:36 1 MRDAQRIQNLARAGVILPRSVQNVSTPLTE---YAMDAADLSP-HLSSTGSSGIPNYLTTYVDPSVIDILVAPMKAAELV 76 (336) T ss_pred CchHHHHHHHhhcCeeecchhhhhhhHHHH---hhhhhhhccC-ccccCCCcchHHHHHHhhccceEeeecchhhhhhhc Confidence 344446899999999999999988876533 3455555442 245788999999999999999999999999999999 Q ss_pred ccccCCCCceeeEEEeeeccccceEecccccCCceeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHH Q lcl|Aclame:pro 109 GVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAA 188 (388) Q Consensus 109 ~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~diP~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr 188 (388) ||+|+|+|++++++|+++|++|+|++||||+|+|++|+++++++++++++++||+||++|+++|+++|++|+++|+.+|| T Consensus 77 pv~t~g~W~~~~~~~~~~e~~G~a~~ygd~~D~P~~d~~~~~~~~~v~~~~~g~~yg~~E~~~Aa~~~~~l~~~Ka~aA~ 156 (336) T protein:vir:36 77 GESKKGDWTTLVAAFITAEPTTKVATYGDYSSDGDSGANINYPQRQSYFFQTWTRWGERELEMAGAGRVDLASELNYSSA 156 (336) T ss_pred cccccCCccceeEEEeeeeceeeEEEeeccCCCceeecccceeeeeEEEEEeeeeeCHHHHHHHHHhCCCcHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHhhceEEEEeecCccccceEEEeecCCCccccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeecccccc Q lcl|Aclame:pro 189 VQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVD 268 (388) Q Consensus 189 ~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p 268 (388) +++|+++|+++||||++ +++|||||||+|+++++++ ++.|++||+|||++||++++++|+.||+|+++. |+| T Consensus 157 ~ale~~~N~i~~~Gd~~---~~~yGllNdP~l~a~~t~~----t~~~~~~t~~ei~~Di~~~~~~l~~qt~G~i~~-~~~ 228 (336) T protein:vir:36 157 LGLAKFLNGSYLFGVAG---LENYGLINDPSLSAPITAT----TPWSGSPAVEAVVNEVVALFQVLQTQSQGIITQ-EDV 228 (336) T ss_pred HHHHHhhCcEEEEeccc---cceEEEEecCCCccccccC----CCcccccCHHHHHHHHHHHHHHHHHhcCCeeee-ccc Confidence 99999999999999975 6899999999999876543 344788999999999999999999999999874 789 Q ss_pred ceEEcCHHHHHhhccCCCcCccHHHHHHHhCCccEEEEccccccccCCCCccEEEEEEcccccccccccCCCcceEeecc Q lcl|Aclame:pro 269 ITLVLPMNKVDMLSVVTDLGISVRDWLKQTYPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQ 348 (388) Q Consensus 269 ~tL~Lp~~~~~~Ls~~~~~~~Tvl~~lk~n~pnl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p 348 (388) ++|+||++++.+|+++|++|+||++|||+|||||+|+++|||++|+ +++++||++++ ++++++.+.|| T Consensus 229 ~tL~LP~~~~~~Ls~~n~~g~Tvl~~lk~n~Pnl~i~t~pEl~~a~----g~~~~l~~~~~--------~~~~t~~~~~p 296 (336) T protein:vir:36 229 LRMGLPPTAMSDLSKTNQYGLAAAAKLKDIFPKLEFVTIPEYDTAS----GRLVQLWAPRV--------EGKDTATCGFT 296 (336) T ss_pred cEEEechHHHHhccCCCccCccHHHHHHHhcCccEEEEccccccCC----CceEEEEEEec--------CCCcceeeecc Confidence 9999999999999999999999999999999999999999998763 46788888876 46889999999 Q ss_pred hhhhccCceeccCceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 349 SKFVTLGVEKRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 349 ~~~r~~~v~~~~~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) ++||+||+|++.++|++||++|||||+||||+||++++|| T Consensus 297 ~~~~~l~vq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 336 (336) T protein:vir:36 297 EKMRAHSIERYSSYFRQKKSAGTWGAVIFRPFAVAQMIGV 336 (336) T ss_pred hhhhccceeecCceeEeccccceeeeeeeccchheeeecC Confidence 9999999999999999999999999999999999999999 No 8 >protein:vir:94070 Length: 339 # NCBI annotation: putative structural protein # Family: family:all:1653 # MgeID: mge:1493 # MgeName: OP2 # Cross-refs: genbank:acc:YP_453625;genbank:gi:84662661;genbank:GeneID:5142580 Probab=100.00 E-value=7.1e-114 Score=640.90 Aligned_cols=337 Identities=23% Similarity=0.312 Sum_probs=313.0 Q ss_pred cccccccCCHHHHhhcceecccchhhcchhhhhhhhhhhhccCccccc--ccccccchHHHHHHHhhcceeeeecccchh Q lcl|Aclame:pro 26 ADYRLTDMAVRELKKFGLVFDHATVKRQIELLHEGGVATQAFDSAYVA--PTTQASIPTPIQFLQQWLPGFVKVLTSARK 103 (388) Q Consensus 26 ~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~amDaa~~~--~~t~~~~g~l~~~l~~idp~v~e~l~~~~~ 103 (388) .-+++|+.+++||+|+||+||+...+....+. ..+||||+..+ .+|.++.|||+++|+||||++||++|++++ T Consensus 1 ~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~-----~~~a~d~~~~~~~~~~~~~~~i~a~~~~~i~~~vy~~~~~~~~ 75 (339) T protein:vir:94 1 MSINNDRTDIKQLEKVGIIFDGYSPKSISSEV-----SAYAMDAVNLTPTLQTTANAGIPAWMTTFVDRRVIDIQLAPMA 75 (339) T ss_pred CceechHHHHHHHHhhceeeccchhhhcchhh-----Hhhhccccccccccccccccchhhhhhhhhchhheeecccccc Confidence 45678999999999999999999888765433 34677776554 468889999999999999999999999999 Q ss_pred hhhhcccccCCCCceeeEEEeeeccccceEecccccCCceeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHH Q lcl|Aclame:pro 104 IDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVK 183 (388) Q Consensus 104 ~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~diP~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K 183 (388) +++|||++|+|+|++++++|+++|.+|+|++|||++|+|++++|+++++++++++++|++|+++|+++|+++|++|+++| T Consensus 76 ~~~l~pv~t~g~w~~~t~~y~~~e~~G~a~~ygd~ad~Pl~~~~v~~~~~~v~~~~~g~~y~~~E~~~A~~~g~~l~~~K 155 (339) T protein:vir:94 76 AAKIFPEVKKGDWTTTYGVFIIAEPVGQVATYSDWSANGMSKANVNFESRQNYRYQTWTEYGDLEMATYGEAGIDYVARQ 155 (339) T ss_pred hhhhcccccCCCCcccEEEEeeeecccceEEcccccCCCcccccceeeEEeEEEEEEEEeecHHHHHHHHhhCCChHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeec Q lcl|Aclame:pro 184 RQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNID 263 (388) Q Consensus 184 ~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~ 263 (388) +.+||+++|+++|+++|||+++ +|+|||||||||+++++ ++++|++||++||++||++++++++.+|+|.++ T Consensus 156 a~aA~~al~~~~N~i~~~Gd~~---~~~~GLlN~P~l~~~v~-----~s~~Wa~kT~~eI~~Di~~~~~~l~~~s~g~~~ 227 (339) T protein:vir:94 156 EISASLVMAKFANSSYLLGVAG---IANYGLMNDPSLPAPVA-----ATVNWATAAPEDIANDVVAMVGRLISQSGGLIT 227 (339) T ss_pred HHHHHHHHHHhhceEEeeeecc---cceEEEEeCCCcccccc-----CCCCcccCCHHHHHHHHHHHHHHHHHhcCCeee Confidence 9999999999999999999986 58999999999987654 345799999999999999999999999999998 Q ss_pred cccccceEEcCHHHHHhhccCCCcCccHHHHHHHhCCccEEEEccccccccCCCCccEEEEEEcccccccccccCCCcce Q lcl|Aclame:pro 264 PEDVDITLVLPMNKVDMLSVVTDLGISVRDWLKQTYPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTW 343 (388) Q Consensus 264 ~~~~p~tL~Lp~~~~~~Ls~~~~~~~Tvl~~lk~n~pnl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~ 343 (388) + ++|++|+|||+++.+|+++|++|+|+++|||+|||||+|+++|||+++ +++++++|++++ .+++++ T Consensus 228 ~-~~~~~L~LP~~~~~~L~~~n~~~~Tvl~~lk~n~pnl~i~~~~el~~a----~g~~~~~~~~~~--------~~~~~~ 294 (339) T protein:vir:94 228 G-QERMVMALAPSALNNVNRTNNFGLSAGAKIAQTYPNIQFVAVPEFDTA----SGRLVQLWVPEV--------NGQPTG 294 (339) T ss_pred e-ccCcEEEecHHHHHhcccCCcCCccHHHHHHHhcCCcEEEEccccccC----CCceEEEEEEec--------cCCcce Confidence 7 479999999999999999999999999999999999999999999865 357888888875 578999 Q ss_pred EeecchhhhccCceeccCceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 344 AQLVQSKFVTLGVEKRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 344 ~~~~p~~~r~~~v~~~~~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) .++|||+||.||+|++.++|++||++|||||+||||+||++++|| T Consensus 295 ~~~~p~~~~~lpvq~~~~~~~v~~~~rt~Gv~i~~P~ai~~~~GI 339 (339) T protein:vir:94 295 EVAFAEKLRSHSIERYSTTTRQKHSGATFGAVIYQPWAVTQELGV 339 (339) T ss_pred EEEcchhhhccccEEcCceEEecceeeeeeEEEEccceeeeeecC Confidence 999999999999999999999999999999999999999999999 No 9 >protein:vir:79642 Length: 329 # NCBI annotation: HsbB # Family: family:all:463 # MgeID: mge:1872 # MgeName: TLS # Cross-refs: genbank:acc:YP_001285525;genbank:gi:148734508;genbank:GeneID:5220000 Probab=100.00 E-value=6.4e-92 Score=520.54 Aligned_cols=319 Identities=13% Similarity=0.102 Sum_probs=281.0 Q ss_pred hcceecccchhhcchhhhhh--hhhhhhccCccccccccccc---chHHHHHHHhhcceeeeecccchhhhhhcccccCC Q lcl|Aclame:pro 40 KFGLVFDHATVKRQIELLHE--GGVATQAFDSAYVAPTTQAS---IPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVG 114 (388) Q Consensus 40 ~~g~~~~~~~~~~~~~~~~~--~~~~~~amDaa~~~~~t~~~---~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g 114 (388) -.|. ++++|.+. +..+..+||+..... +..+ ..|++++|++|||+|||+.+++++++++||+.+++ T Consensus 1 ~~~~--------~~~~~~~~d~~~~~~~a~~~~~~~~-~~~~~~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~i~~~~ 71 (329) T protein:vir:79 1 MRGN--------IMSKEMKYDEFEANVIANHMQLRGA-KNDASDMGIWTSQELHKIKAQAYEKEYPAGSALRVFPVTSEL 71 (329) T ss_pred Cccc--------hhhhhhccchhhhhhHhhhcccccc-eeccchhhHHHHHHHHHHHHHHHhhhhcccchhhhcccccCC Confidence 1121 22222222 223344555443322 2222 45999999999999999999999999999999999 Q ss_pred CCceeeEEEeeeccccceEecccc-cCCceeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHH Q lcl|Aclame:pro 115 SWEDQEIVQGIVEPAGTAMEYGDL-TNIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEI 193 (388) Q Consensus 115 ~w~~~t~~~~v~e~~G~a~~ygd~-~diP~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~ 193 (388) +|++++++|+++|.+|++++|||+ +|+|+++++++++++++++++.+|+|+++|+++|+++|++|+++|+.+|++++++ T Consensus 72 ~~~~~~~t~~~~~~~G~a~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~ 151 (329) T protein:vir:79 72 SDTDKTFEYQTFDKVGHAKIIADYTDDLSTVDALMTSEFGKVFRLGNAFLISIDEIKAGQRTGKSLSTRKANAAQNAHDQ 151 (329) T ss_pred CCceeEEEeeeeecceeeeeecCcccccceeecccceeEEEEEEEEEEEEecHHHHHHHHHhCCChHHHHHHHHHHHHHH Confidence 999999999999999999999995 7899999999999999999999999999999999999999999999999999999 Q ss_pred hhceEEEEeecCccccceEEEeecCCCccccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEc Q lcl|Aclame:pro 194 MRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVL 273 (388) Q Consensus 194 ~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~L 273 (388) ++|+++|+|+++ +|+|||||||||++. +++++++++|++||++||++||+++++++|.+|+|. +.|++|+| T Consensus 152 ~~n~i~f~G~~~---~g~~GLlN~p~v~~~--~~~~~~~~~w~~kt~~ei~~di~~~~~~l~~~s~g~----~~p~~L~L 222 (329) T protein:vir:79 152 LVNHLVFKGSKP---HKIISVFEHPNLTTI--NSAGWNNAAGTGKKPETAQDELEQAIEKIETLTNGQ----HRANMILI 222 (329) T ss_pred hhccEEEeeccc---ccceeeecCCCcccc--ccCCCCCccccccCHHHHHHHHHHHHHHHHHhcCce----ecccEEEe Confidence 999999999875 589999999999753 345667789999999999999999999999999986 46889999 Q ss_pred CHHHHHhhcc-CCCcCccHHHHHHHhCCccEEEEccccccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhh Q lcl|Aclame:pro 274 PMNKVDMLSV-VTDLGISVRDWLKQTYPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFV 352 (388) Q Consensus 274 p~~~~~~Ls~-~~~~~~Tvl~~lk~n~pnl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r 352 (388) ||+++.+|++ .+.+|+|+++||++||||++|+++|||+++ +.++++++++|.++ ++++.+.+|++|| T Consensus 223 pp~~~~~L~~~~~~~~~tvl~~lk~~~~~l~I~~~~el~~a-g~~g~~~~v~y~~~-----------~~~~~~~vp~~~~ 290 (329) T protein:vir:79 223 PPSMRKVLMVRMPETTMSYLDYFKQQNGGITIESISELEDI-DGAGTKAALVYEKD-----------PMNMSIEIPEAFN 290 (329) T ss_pred cHHHHHHhhcccCCCCccHHHHHHHhCCCcEEEEccccccc-CCCCceEEEEEecC-----------CceEEEecCccee Confidence 9999999975 467899999999999999999999999998 56778999999765 5778999999999 Q ss_pred ccCceeccCceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 353 TLGVEKRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 353 ~~~v~~~~~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) +|++|++.++|++||++|||||+||||+||++++|| T Consensus 291 ~l~~q~~~~~~~v~~~~r~~Gv~i~~P~ai~~~dGI 326 (329) T protein:vir:79 291 MLTAQPKDLHFKVPCTSKCTGLTIYRPLTLVLIKGL 326 (329) T ss_pred eeeceecCceEEEceeeeEEEEEEECcceeeeeeee Confidence 999999999999999999999999999999999999 No 10 >protein:vir:104342 Length: 314 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1593 # MgeName: RTP # Cross-refs: genbank:acc:YP_398971;genbank:gi:81343955;genbank:GeneID:3778874 Probab=100.00 E-value=9e-90 Score=508.77 Aligned_cols=309 Identities=12% Similarity=0.083 Sum_probs=271.5 Q ss_pred cceecccchhhcchhhhhhhhhhhhccCcccccccccccchHHHHHHHhhcceeeeecccchhhhhhcccccCCCCceee Q lcl|Aclame:pro 41 FGLVFDHATVKRQIELLHEGGVATQAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQE 120 (388) Q Consensus 41 ~g~~~~~~~~~~~~~~~~~~~~~~~amDaa~~~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t 120 (388) .-++|+ ..++......+.||.+.. ...+.|++++|++|||+|||+++++++++++||++++++||+++ T Consensus 1 ~~~~~~--------~~~~~~~~~~~~~~~~~~----d~~~~fl~~ql~~id~~v~e~~~~~~~~~~~i~v~~~~~~~~et 68 (314) T protein:vir:10 1 MAIKFD--------AEQAKITTHLEQMGVEKA----DAAGIWAVSQLTAALNRAYEKEYAENSVVNIFPVTNEIPGHAKY 68 (314) T ss_pred CccchH--------HHHHHHHHHHHhhcccch----hhhHHHHHHHHHHHHHHHhhhhccccccceeeccccCCCCceeE Confidence 122222 112222223344442211 12345999999999999999999999999999999999999999 Q ss_pred EEEeeeccccceEecccc-cCCceeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEE Q lcl|Aclame:pro 121 IVQGIVEPAGTAMEYGDL-TNIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIG 199 (388) Q Consensus 121 ~~~~v~e~~G~a~~ygd~-~diP~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~ 199 (388) ++|+++|.+|++++|||+ +|+|++|++++++++++++++.+|+|+++|+++|+++|++|+++|+.+|++++++++|+++ T Consensus 69 ~~~~~~e~~G~a~~~~d~~~dip~vd~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA~~~~~~~~n~i~ 148 (314) T protein:vir:10 69 FEYPEFDGVGIAQIIADYSDDLPLVDAFMTEKQGKVFRFGNAFLISTDEIKAGAATGQSLSARKQALAFEAHDNLLDKLV 148 (314) T ss_pred EEeeeeccccceeeeCCcccccceeecccceeEEEEEEEEeeEEecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEE Confidence 999999999999999996 6799999999999999999999999999999999999999999999999999999999999 Q ss_pred EEeecCccccceEEEeecCCCccccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHH Q lcl|Aclame:pro 200 FYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVD 279 (388) Q Consensus 200 ~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~ 279 (388) |+|+++ +|+|||||||||+.. +++++| +|++||++||+++++++|.+|+|. +.|++|+|||+++. T Consensus 149 f~G~~~---~g~~GLlN~p~v~~~------~~~~~W--aT~~ei~~Di~~~~~~l~~~s~g~----~~p~~l~Lpp~~~~ 213 (314) T protein:vir:10 149 WSGSAP---HGIVSVFDQPNINNV------VATPNW--SVPQNAIDDVTAMIDAVESSTQGL----HHVTDILLPASARR 213 (314) T ss_pred Eeeccc---ccceeEeecCCCccc------cCCCCc--ccHHHHHHHHHHHHHHHHHhcCcc----ccceeEEecHHHHH Confidence 999875 589999999999642 234568 599999999999999999999986 46889999999999 Q ss_pred hhccCCC-cCccHHHHHHHhCCccEEEEccccccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCcee Q lcl|Aclame:pro 280 MLSVVTD-LGISVRDWLKQTYPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEK 358 (388) Q Consensus 280 ~Ls~~~~-~~~Tvl~~lk~n~pnl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~ 358 (388) +|+++++ +|+|+++||++|||||+|+++|||+++ |.++++++++|.++ ++++.+.||++||.|++|+ T Consensus 214 ~L~~~~~~~~~tvl~~l~~n~~~l~I~~~~el~~a-g~~g~~~~v~y~~~-----------~~~~~~~vp~~~~~l~~e~ 281 (314) T protein:vir:10 214 VMQGLVPQTNLSYGELFTRNNPGLTIRFLQFLDNY-DGAGGKAALAFEKS-----------PLNMSIEIPEVTNVLPAQP 281 (314) T ss_pred hhcccccCCCccHHHHHHHhCCCcEEEEccccccc-CCCcceEEEEEecC-----------CcEEEEecCccceeeccee Confidence 9987754 599999999999999999999999998 56778999999765 5689999999999999999 Q ss_pred ccCceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 359 RVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 359 ~~~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) +.++|++||++|||||+||||+||++++|| T Consensus 282 ~~~~~~~~~~~r~~Gv~i~~P~ai~~~dGI 311 (314) T protein:vir:10 282 KDLHFRYPVTSKATGLIVYRPLTMAVIKGI 311 (314) T ss_pred cCceEEEcceeeeEEEEEECcceeEeeeee Confidence 999999999999999999999999999999 No 11 >protein:vir:80068 Length: 301 # NCBI annotation: gp8 # Family: family:all:463 # MgeID: mge:1876 # MgeName: B054 # Cross-refs: genbank:acc:YP_001468712;genbank:gi:157325292;genbank:GeneID:5601759 Probab=100.00 E-value=2.4e-88 Score=500.92 Aligned_cols=296 Identities=15% Similarity=0.135 Sum_probs=277.4 Q ss_pred ccccccchHHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccceEecccc-cCCceeeeeeeeee Q lcl|Aclame:pro 74 PTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDL-TNIPLSSWNVNFER 152 (388) Q Consensus 74 ~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~-~diP~~~~n~~~~~ 152 (388) .++.++.+|++++|++|||++||++++++++++|||++++++|++++++|.++|.+|++++|||+ +|+|++++++++.+ T Consensus 1 ~~~~~~g~f~~~~l~~id~~v~e~~~~~l~~r~l~~v~~~~~~~~~~~~~~~~~~~G~~~~~~~~~~dip~~~~~~~~~~ 80 (301) T protein:vir:80 1 MQGKITATIEARDLQAIDNVIYEPKQEELTARSVFPQKFDVNEGAESYSFDVMTRSGAAKIIANGADDLPLVDVDMVRKS 80 (301) T ss_pred CCccccchhhHHHHHHHHHHHHHhhhhhhhhhhhcccccCCCCceEEEEEeeeccceeEEEecCcccccccccccceeEE Confidence 34556778999999999999999999999999999999999999999999999999999999995 67999999999999 Q ss_pred eeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCcccccc-ccCCc Q lcl|Aclame:pro 153 RTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIAS-TTPGG 231 (388) Q Consensus 153 ~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~-~~~~~ 231 (388) +++++++.+|+|+++||++|+++|++|+++|+.+|++++++++|+++|+|+++ .|+|||||+||+++..++ ++.++ T Consensus 81 ~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aa~~~~~~~~n~~~f~G~~~---~g~~GLlN~p~~~~~~~~~~~~~~ 157 (301) T protein:vir:80 81 VPIYSIGIGLSYTIQDLRAARMQGTTVDAAKATTVRRAIAEKENSIAFRGEKK---YAIKGAFEATGIQIDVSPTTGVGN 157 (301) T ss_pred EEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEeeeccc---ccceeeecCCCcccccccCccccc Confidence 99999999999999999999999999999999999999999999999999875 589999999999888665 55567 Q ss_pred ccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC---CCcCccHHHHHHHhCCccEEEEcc Q lcl|Aclame:pro 232 WVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV---TDLGISVRDWLKQTYPRVRVMSAP 308 (388) Q Consensus 232 ~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~---~~~~~Tvl~~lk~n~pnl~i~~~p 308 (388) .++|++||++||++||++++++++.+|+|. +.|++|+|||+++.+|+++ +.+|+|+++||++|||+++|+++| T Consensus 158 ~~~w~~~t~~ei~~di~~~~~~l~~~s~g~----~~p~~L~L~p~~~~~L~~~~~~~~~~~tvl~~l~~~~~~~~I~~~p 233 (301) T protein:vir:80 158 VSKWEKKTAEQIIDEIGEAHTKITVLPGYG----TASLKLCLPPKQFELINKKRYSNEDSRSVLKVLQDNAWFSAIVRVP 233 (301) T ss_pred ccccccCCHHHHHHHHHHHHHHHHHhcCce----ecccEEEecHHHHHhhhhccccCCCCeeHHHHHHHHcCcceEEEcc Confidence 889999999999999999999999999986 4678999999999999864 567999999999999999999999 Q ss_pred ccccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 309 ELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 309 el~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) ||+++ +.++++++++|+++ ++.+.+.+|++||+|+++++.++|+++|++|||||+||||+||++++|| T Consensus 234 ~L~~~-g~~g~~~~v~~~~~-----------~d~~~~~v~~~~~~~~~e~~~~~~~~~~~~r~~Gv~i~~P~ai~~~~GI 301 (301) T protein:vir:80 234 DLAGM-GTAGSDSFAVIHDS-----------NETAELIIPMDITRHPEEYSFPRTKVPFEERTAGVVVRFPAAIVRVDGI 301 (301) T ss_pred eeccC-CCCcccEEEEEecC-----------CcEEEEEecCceeeecceecCceeEeeeeeeeEEEEEEccceEEEEecC Confidence 99998 56789999999774 5678999999999999999999999999999999999999999999999 No 12 >protein:vir:107687 Length: 319 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1518 # MgeName: T1 # Cross-refs: genbank:acc:YP_003898;genbank:gi:45686314;genbank:GeneID:2773027 Probab=100.00 E-value=1.1e-87 Score=497.24 Aligned_cols=316 Identities=15% Similarity=0.149 Sum_probs=277.4 Q ss_pred cccCCHHHHhhcceecccchhhcchhhhhhhhhhhhccCcccccccccccch-HHHHHHHhhcceeeeecccchhhhhhc Q lcl|Aclame:pro 30 LTDMAVRELKKFGLVFDHATVKRQIELLHEGGVATQAFDSAYVAPTTQASIP-TPIQFLQQWLPGFVKVLTSARKIDEIL 108 (388) Q Consensus 30 ~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~amDaa~~~~~t~~~~g-~l~~~l~~idp~v~e~l~~~~~~~~i~ 108 (388) +.++++.+.+...|. .+.+ .+.||.+ ++.++| |++++|++|||++||++++++.++++| T Consensus 1 ~~~~~~~~~~~~~~~-------------~~~~--~~~~~~d-----a~~~~g~~~~~ql~~id~~v~e~~~~~l~~~~~i 60 (319) T protein:vir:10 1 MTTKKFDEADKSNVE-------------MYLI--QAGVKQD-----AAATMGIWTAQELHRIKSQSYEEDYPVGSALRVF 60 (319) T ss_pred CCCcchhHHhhHHHH-------------HHHh--hccchhh-----hhhhhhhHHHHHHHHHHHHHHhhhhcceechhhc Confidence 344444443332111 1111 1223322 223444 899999999999999999999999999 Q ss_pred ccccCCCCceeeEEEeeeccccceEecccc-cCCceeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHH Q lcl|Aclame:pro 109 GVKTVGSWEDQEIVQGIVEPAGTAMEYGDL-TNIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGA 187 (388) Q Consensus 109 ~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~-~diP~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aA 187 (388) |+.++++||+++++|.++|.+|++++|||+ +|+|+++++++++++++++++.+|+|+++||++|+++|++|+++|+.+| T Consensus 61 ~v~~~~~~~~~~~~~~~~~~~G~a~~~~d~~~dip~v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~k~~aA 140 (319) T protein:vir:10 61 PVTTELSPTDKTFEYMTFDKVGTAQIIADYTDDLPLVDALGTSEFGKVFRLGNAYLISIDEIKAGQATGRPLSTRKASAC 140 (319) T ss_pred ccccCCCCceEEEEeeeeccccceeeecCccccccceeccceeeEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHH Confidence 999999999999999999999999999996 5789999999999999999999999999999999999999999999999 Q ss_pred HHHHHHhhceEEEEeecCccccceEEEeecCCCccccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccc Q lcl|Aclame:pro 188 AVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDV 267 (388) Q Consensus 188 r~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~ 267 (388) ++++++++|+++|+|+++ .|+|||||+||+++.. .+.|+.|++||+|||++||++++++++.+|+|.+ . T Consensus 141 ~~~~~~~~n~i~f~G~~~---~g~~GLlN~p~~~~~~----~~~~~~~~t~t~~~i~~di~~~~~~l~~~s~g~~----~ 209 (319) T protein:vir:10 141 QLAHDQLVNRLVFKGSAP---HKIVSVFNHPNITKIT----SGKWIDVSTMKPETAEAELTQAIETIETITRGQH----R 209 (319) T ss_pred HHHHHHhhceEEEeeccc---ccceeEEeCCCceeee----cCCCCCccccCHHHHHHHHHHHHHHHHHhcCcee----e Confidence 999999999999999875 5899999999997543 3456679999999999999999999999999874 6 Q ss_pred cceEEcCHHHHHhhcc-CCCcCccHHHHHHHhCCccEEEEccccccccCCCCccEEEEEEcccccccccccCCCcceEee Q lcl|Aclame:pro 268 DITLVLPMNKVDMLSV-VTDLGISVRDWLKQTYPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQL 346 (388) Q Consensus 268 p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~lk~n~pnl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~ 346 (388) |++|+|||+++.+|++ .+.+|+|+++|||+||||++|+++|||+++ +.++++++++|.++ ++.+.++ T Consensus 210 p~~L~L~p~~~~~L~~~~~~~~~t~l~~lk~~~~~l~I~~~pel~~a-g~~g~~~~v~y~~~-----------~~~~~~~ 277 (319) T protein:vir:10 210 ATNILIPPSMRKVLAIRMPETTMSYLDYFKSQNSGIEIDSIAELEDI-DGAGTKGVLVYEKN-----------PMNMSIE 277 (319) T ss_pred ceEEEecHHHHHhhhcccCCCCeeHHHHHHHhcCCceEEEeeeeccc-CCCcceEEEEEecC-----------CceEEEe Confidence 7899999999999975 467899999999999999999999999998 46678999999765 5689999 Q ss_pred cchhhhccCceeccCceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 347 VQSKFVTLGVEKRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 347 ~p~~~r~~~v~~~~~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) ||++||+|++|++.++|+++|++|||||+||||+||++++|| T Consensus 278 v~~~~~~~~~e~~~l~~~~~~~~r~~Gv~i~~P~ai~~~dGI 319 (319) T protein:vir:10 278 IPEAFNMLPAQPKDLHFKVPCTSKCTGLTIYRPMTIVLITGV 319 (319) T ss_pred cCcceeeeeeeecCceEEEeeeeeeEEEEEEccceeEeeecC Confidence 999999999999999999999999999999999999999999 No 13 >protein:vir:5255 Length: 304 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:117 # MgeName: Aaphi23 # Cross-refs: genbank:acc:NP_852760;genbank:gi:31544035;uniprot:Q7Y5U0;genbank:GeneID:2753552 Probab=100.00 E-value=5.4e-85 Score=482.56 Aligned_cols=292 Identities=12% Similarity=0.062 Sum_probs=266.5 Q ss_pred cccchHHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccceE--eccc-ccCCceeeeeeeeeee Q lcl|Aclame:pro 77 QASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAM--EYGD-LTNIPLSSWNVNFERR 153 (388) Q Consensus 77 ~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~--~ygd-~~diP~~~~n~~~~~~ 153 (388) ++.++|++++|+++|++|||++++++.++++||++++++||+++++|+++|.+|+++ ++++ .+|+|++|++++++++ T Consensus 1 ~~~lafl~~qL~~id~~vye~~~~~~~~~~lipv~t~~~~~~~~~~~~~~d~~G~a~~~~i~~~a~dip~vd~~~~~~~~ 80 (304) T protein:vir:52 1 MSLLAYVKNGLTAVSKDIAETKYPEIVFPQFVYVDQQTAVGITEKLHYGADEHGSLDDGLITVGTSTLDQVEVGFTPTRS 80 (304) T ss_pred CchHHHHHHHHHHHhhhhhccccccchhhhhccccCCCCcccceEEEeeeeccCcccccccCCcCCccceeecccceeEE Confidence 567889999999999999999999999999999999999999999999999999998 4454 6899999999999999 Q ss_pred eEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccccccccCCccc Q lcl|Aclame:pro 154 TIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWV 233 (388) Q Consensus 154 ~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~t 233 (388) ++++++.+|+|+++||++|+++|++|+++|+++||+++++++|+++|||+++ .+|++||||||||+...++ +.+.++ T Consensus 81 ~i~~~~~~~~y~~~El~~a~~~g~~l~~~ka~aa~~a~~~~~n~v~~~Gd~~--~~g~~GllN~p~v~~~~~~-~~~a~~ 157 (304) T protein:vir:52 81 YIVPWAKSVTWTKPELEQGKLLGLALNTAKIMALNKNAQQTLQKVAFLGHAK--DSRLTGLLNNKSVEVYAIK-GAAQNT 157 (304) T ss_pred EEEEEeeeeeecHHHHHHHHHhCCCcHHHHHHHHHHHHHhhhceEEEEeecc--ccceEEEEeCCCcceeeec-CCccCC Confidence 9999999999999999999999999999999999999999999999999874 2589999999999876543 333457 Q ss_pred ccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhcc--CCCcCccHHHHHHHhCC-----ccEEEE Q lcl|Aclame:pro 234 SGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSV--VTDLGISVRDWLKQTYP-----RVRVMS 306 (388) Q Consensus 234 ~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~--~~~~~~Tvl~~lk~n~p-----nl~i~~ 306 (388) +|++||++||++||+++++++|.+|+|. ++|++|+|||+++.+|+. .+++|+|+|+||++||| +|+|+. T Consensus 158 ~w~~~T~~eI~~di~~~~~~i~~~s~~~----~~p~tl~Lpp~~~~~l~~~~~~~~~~Tvl~~l~~n~~~~~g~~l~I~~ 233 (304) T protein:vir:52 158 KVQAMDFDKAVAFFKEIFLKGMEKTKRI----EAPNTFAIDSLDLAHLALVQRANTDTTALEFLTKHLSAAAGRQVAIKA 233 (304) T ss_pred ccccCCHHHHHHHHHHHHHHHHhccCce----ecCceEEeCHHHHHHHhhccCCCCCchHHHHHHHhcccccCCcceEEE Confidence 7999999999999999999999999985 578899999999999964 46789999999999988 778999 Q ss_pred ccc-cccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccC-ceEEecccceeeeeeecccccee Q lcl|Aclame:pro 307 APE-LQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVK-NYVEAYSNATAGVMLKRPWAVVR 384 (388) Q Consensus 307 ~pe-l~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~-~~~~~~~~~t~G~ii~rP~ai~~ 384 (388) +|+ +.++ |.||+++|++|.++ ++.+.+.+|++++.|+++++.. .|++||.+|||||.||||++++| T Consensus 234 v~~~~~~~-g~~g~~r~vvY~~d-----------~~~~~~~vP~p~~~l~~q~~~~~~~~vp~~~r~gGv~v~~P~a~~y 301 (304) T protein:vir:52 234 LPSNYGTR-VTDGKTRAMVYVNS-----------KEHVIFDVPMSPTVLDAQPKGLLAFESGLRMAFGGVTFMEPDSALY 301 (304) T ss_pred eccccccc-CCCCceEEEEEecC-----------hhheEEecCccccccchhhcCCceEEecceeeeeeEEEEccceeee Confidence 984 5554 77889999999776 4567888999999999999986 79999999999999999999999 Q ss_pred ecc Q lcl|Aclame:pro 385 LIG 387 (388) Q Consensus 385 ~~G 387 (388) .|= T Consensus 302 ~D~ 304 (304) T protein:vir:52 302 VDY 304 (304) T ss_pred ecC Confidence 999 No 14 >protein:vir:103285 Length: 296 # NCBI annotation: hypothetical protein # Family: family:all:463 # MgeID: mge:1605 # MgeName: JK06 # Cross-refs: genbank:acc:YP_277465;genbank:gi:71834107;genbank:GeneID:3562396 Probab=100.00 E-value=6.2e-85 Score=482.23 Aligned_cols=291 Identities=13% Similarity=0.113 Sum_probs=268.1 Q ss_pred hccCcccccccccccchHHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccceEecccc-cCCce Q lcl|Aclame:pro 65 QAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDL-TNIPL 143 (388) Q Consensus 65 ~amDaa~~~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~-~diP~ 143 (388) |.||.+. ..+.|++++|++|||+|||++++++.++++||+.+.++||+++++|+++|.+|++++|||+ +|+|+ T Consensus 1 ~~~~~a~------~~~~f~~~ql~~id~~v~e~~~~~l~~~~~i~v~~~~~~~~~~~~~~~~~~~G~a~~~~~~~~dip~ 74 (296) T protein:vir:10 1 MGVDKAD------AAGIWTVKQLTASLNKAYETEYDQNSVVNLFPVSNEIPGYAKYFEYPVFDGVGIAQIVADYTDDLPL 74 (296) T ss_pred Ccccchh------hhHHHHHHHHHHHHHHHHhhhhcccccceecccccCCCCceeEEEeeeeeccCceeEeCCCccccce Confidence 7888652 2456999999999999999999999999999999999999999999999999999999996 67999 Q ss_pred eeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccc Q lcl|Aclame:pro 144 SSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPA 223 (388) Q Consensus 144 ~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~ 223 (388) ++++++++++++++++.+|+|+++||++|+++|++|+++|+.+|++++++++|+++|+|+++ +|++||||+|+++.. T Consensus 75 v~~~~~~~~~~i~~~~~~~~~~~~El~~a~~~g~~l~~~ka~aA~~~~~~~~n~~~f~G~~~---~g~~GLlN~p~v~~~ 151 (296) T protein:vir:10 75 VDALATERQGKVFRFGNAFLISIDEIKVGQATGQSLSTRKQSLAFEAHDKLLDKLVWSGSTA---HGIPSVFDYPNINNV 151 (296) T ss_pred eeccceeEEEEEEEEEeeeeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEeeccc---ccceeEeecCCCccc Confidence 99999999999999999999999999999999999999999999999999999999999875 589999999999653 Q ss_pred cccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC-CCcCccHHHHHHHhCCcc Q lcl|Aclame:pro 224 IASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRDWLKQTYPRV 302 (388) Q Consensus 224 ~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~~lk~n~pnl 302 (388) .+ +++|+++ +||++||++++++++.+|+|.+ .|++|+|||+++.+|+++ +.+|+|+++||++||||+ T Consensus 152 ~~------~~~W~~~--t~i~~Di~~~~~~l~~~s~g~~----~p~~l~L~p~~~~~L~~~~~~~~~t~l~~ik~~~~~l 219 (296) T protein:vir:10 152 VS------GGSWSQP--TTAVSDITSLLDIIETSTNGQH----RATHLLLPTTARRIMQNLVPGTSVSYGEFFRQNNSGV 219 (296) T ss_pred cc------cCCccCH--HHHHHHHHHHHHHHHHhhCcee----cceeEEeCHHHHHHHhhccCCCCccHHHHHHHhcCCc Confidence 22 2358655 4999999999999999999874 567999999999999865 778999999999999999 Q ss_pred EEEEccccccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEecccceeeeeeeccccc Q lcl|Aclame:pro 303 RVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSNATAGVMLKRPWAV 382 (388) Q Consensus 303 ~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~~~t~G~ii~rP~ai 382 (388) +|+++|||+++ +.++++++++|.++ ++.+.+++|++||+|++|++.++|+++|++|||||+||||.|| T Consensus 220 ~i~~~~~l~~a-~~~g~~~~v~~~~~-----------~~~~~~~v~~~~~~~~~e~~~l~~~~~~~~~~~Gv~i~~P~ai 287 (296) T protein:vir:10 220 TVEFVQYLNDY-NGTGTSAAIAYEKD-----------PNNMAIEIPEATNALPAQPKDLHFKIPVTSKATGLIVYRPLTM 287 (296) T ss_pred eEEEeeeeccC-CCCcceEEEEEEcC-----------CceEEEEcCcceeeecccccCceEEEeeEeeEEEEEEECCcee Confidence 99999999998 45678999999764 5688999999999999999999999999999999999999999 Q ss_pred eeeccC Q lcl|Aclame:pro 383 VRLIGL 388 (388) Q Consensus 383 ~~~~GI 388 (388) ++++|| T Consensus 288 ~~~dGI 293 (296) T protein:vir:10 288 AVMKGI 293 (296) T ss_pred EEEeee Confidence 999999 No 15 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=98.52 E-value=6.7e-09 Score=65.40 Aligned_cols=297 Identities=12% Similarity=0.024 Sum_probs=165.7 Q ss_pred hhhhccCcccccccccccchH-HHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccceEecccccC Q lcl|Aclame:pro 62 VATQAFDSAYVAPTTQASIPT-PIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTN 140 (388) Q Consensus 62 ~~~~amDaa~~~~~t~~~~g~-l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~d 140 (388) |+...+.+... ..|..+.++ +-.+. .++++.+.+....+++.++..... ....|++.+..+.+...+.... T Consensus 1 m~~~~~~a~~~-~~t~~~g~~i~~~~~----~~ii~~~~~~s~l~~~~~~~~~~~---~~~~~p~~~~~~~a~~v~Eg~~ 72 (330) T protein:vir:77 1 MAGSTVPSTQV-ALTGDFSAFLTPEQS----QDYFAEIEKTSIVQRIARKVPMGP---TGISIPHWTGAVSASWTGEAER 72 (330) T ss_pred Ccccccchhhc-cccCCCcceechhHH----HHHHHHHHhccchhhhcceeeccC---CceEEEEEcCCcceeEecCCCc Confidence 34444444322 123333333 32222 356666666666667665544322 3356788877788888899999 Q ss_pred CceeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCC Q lcl|Aclame:pro 141 IPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSL 220 (388) Q Consensus 141 iP~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l 220 (388) +|..+...+......+.++..+.++.+-++. ...++.+.-.....+++...+|+-.++|+.. .....|++|++.. T Consensus 73 ~~~~~~~f~~i~~~~~k~~~~~~is~ell~d---s~~~~~~~i~~~l~~ai~~~~~~~~l~G~g~--~~~~~g~~~~~~~ 147 (330) T protein:vir:77 73 KPITKGSFGKQELEPVKITTIFAESAEVVRL---NPLNYLNTMRTKIAEAIALKFDAAAIHGIDK--PSAFKGYLAETTK 147 (330) T ss_pred cccccceeeEEEEeEEEEEEeehhhHHHHhc---chHHHHHHHHHHHHHHHHHHHHHHhhcccCC--CCccccccccccc Confidence 9999999999999999999999998854433 3567889999999999999999999999864 3467899998643 Q ss_pred ccccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC-CCcCccHHHHHHH-- Q lcl|Aclame:pro 221 LPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRDWLKQ-- 297 (388) Q Consensus 221 ~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~~lk~-- 297 (388) ......+ ...-.+++...+++||.+++..+...-. .+..++|.+..+..|.+. +..|.-++.--.. T Consensus 148 ~~~~~~~----~~~~~~~~~~~~~~~l~~~~~~~~~~~~-------~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~ 216 (330) T protein:vir:77 148 VVSLADT----NLTTASGPQGNAYLAVNNALSLLVNSGK-------KWTGTLLDNVTEPILNTAVDGNGRPLFVESTYTE 216 (330) T ss_pred cceeecc----cccccccccchhHHHHHHHHHhhhhcCC-------CccEEEEcHHHHHHHHHHhccCCceeecCccccc Confidence 2211111 1112334556678999999888775521 234789999999988542 3334333211000 Q ss_pred ---hCCccEEEEcc-----ccccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhc-----------cCc-e Q lcl|Aclame:pro 298 ---TYPRVRVMSAP-----ELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVT-----------LGV-E 357 (388) Q Consensus 298 ---n~pnl~i~~~p-----el~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~-----------~~v-~ 357 (388) ...+.++...| .+.. ++.+....+++. +.......... .-+++. ..+..-. .++ - T Consensus 217 ~~~~~~~~~l~G~PV~~~~~~p~--~~~~~~~~~~~g-d~s~~~i~~~~-~~~i~~-~~e~~~~~~~~~~~~~~~~~~~~ 291 (330) T protein:vir:77 217 QVGAIREGRILGRPTYVADNVVN--GTVGNRVVGVMG-DFSQVIWGQIG-GLSFDV-TDQATLDFGEEQGGVWVPKLISL 291 (330) T ss_pred cccccCCceecceeeEEeccccC--CCCCCccEEEEE-ecceEEEEEec-CcEEEE-eecceeeecccccccccccccch Confidence 01112333222 2221 222222333322 11111111000 001110 0110000 000 0 Q ss_pred eccCceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 358 KRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 358 ~~~~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) ...-...+.+..|.++.+ .+|-||+++.+. T Consensus 292 f~~~~~~~r~~~r~d~~v-~~~~a~~~i~~~ 321 (330) T protein:vir:77 292 WQHNMVAVRCEAEFAFMV-NDKDAFVKLTDQ 321 (330) T ss_pred hhcCcEEEEEEEEeccEE-ecccceEEEEec Confidence 111124556666776666 559999999999 No 16 >protein:vir:105778 Length: 358 # NCBI annotation: gp9 # Family: family:all:10995 # MgeID: mge:1501 # MgeName: ES18 # Cross-refs: genbank:acc:YP_224147;genbank:gi:62362222;genbank:GeneID:3342531 Probab=98.48 E-value=3.6e-09 Score=66.87 Aligned_cols=332 Identities=12% Similarity=0.021 Sum_probs=176.6 Q ss_pred CCCcceeeeecCccccchhhhhhcccccccccCCHHHHhhcceecccchhhcchhhhhhhhhhhhccCccccccc-cccc Q lcl|Aclame:pro 1 MKQLSKVHQSLAGRSVRAFDMANGKADYRLTDMAVRELKKFGLVFDHATVKRQIELLHEGGVATQAFDSAYVAPT-TQAS 79 (388) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~amDaa~~~~~-t~~~ 79 (388) |-= || .+..-+.+.+-.|.+=...... .+....+++|-+++...+. ...| T Consensus 1 ~~f-~K--------------------------~~~an~~~~~~qw~~L~~~Rna--~n~~~~a~maan~a~~~~~~~~~N 51 (358) T protein:vir:10 1 MYF-SK--------------------------ETLATNSRLGGHWNELWANRNM--WNAQHDAMIAANRSNMTPEWLAVN 51 (358) T ss_pred Cee-ch--------------------------hhhhhHHHHHHHHHHHHHHHHH--hhhhhhhHHhhhHHHhhhhhheec Confidence 111 11 0111122222122211110000 0000112222222221111 1222 Q ss_pred --chHHHHHHHhhcceeeeeccc---chhhhhhcccccCCCCceeeEEEeeecc-ccceEe--cccc-cCCceeeeeeee Q lcl|Aclame:pro 80 --IPTPIQFLQQWLPGFVKVLTS---ARKIDEILGVKTVGSWEDQEIVQGIVEP-AGTAME--YGDL-TNIPLSSWNVNF 150 (388) Q Consensus 80 --~g~l~~~l~~idp~v~e~l~~---~~~~~~i~~v~t~g~w~~~t~~~~v~e~-~G~a~~--ygd~-~diP~~~~n~~~ 150 (388) .+|+..+--.||..+.+.-.+ -.-..+|.|+.++-+-......|.+..- .|++.. .|.. ...-- +.+++ T Consensus 52 Av~~v~~D~wr~~D~~~~q~fr~e~~~~l~NDLm~ls~sv~Igktv~~y~~~gd~~~~v~~SmsGQ~~~~lD~--~~y~~ 129 (358) T protein:vir:10 52 AVGGFTRDFWAEIDRQVLQLRDQEVGMEIVNDLIGVQTVLPVGKTAKLYNVIGDIADDVSVSIDGQAPFSFDH--TEYAS 129 (358) T ss_pred ccccCCHHHHHHHhhhhhhhcccchhHHHHhhhhhccccccHHHHHHHHhhhcCCCceEEEEecccCcccccc--eeeec Confidence 347777778888888776555 4456688888887776655555666533 666642 2432 12222 22333 Q ss_pred eeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccc--cceEEEeecCCCcccc-ccc Q lcl|Aclame:pro 151 ERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNG--NRTFGFLNDPSLLPAI-AST 227 (388) Q Consensus 151 ~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~--~g~~GllN~P~l~a~~-~~~ 227 (388) .-.++=-+-.||..+++|+.-.+--|+++..+-+....+.+.+++-..+|-|+.+..- +-+|||-||||+-... .+. T Consensus 130 dGtpiPIfdsg~~f~WR~~~~~~~~g~d~~~daQ~~~~~kv~~~~vdy~lNG~~~I~v~g~t~~Glrn~~n~~qv~l~~~ 209 (358) T protein:vir:10 130 DGDPIPVFTAGYGVNWRHAAGLNSLGIDLVLDSQMAKMRKFNQKRVNYYLNGDPNIQVQSYPAQGIKNHRNTKKINLGSG 209 (358) T ss_pred cCCEeeeeccCccccccchhhcCccccchhHHHHHHHHHHHHHHHHhhhhccCCceeecCcccccccCCcceeEEEeccC Confidence 3344444567888888999888889999999999999999999999999999875322 3589999999974321 111 Q ss_pred cCCcccccccCCHHHHHHHH-HHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC-CCc---CccHHHHHHHhCCcc Q lcl|Aclame:pro 228 TPGGWVSGGANAFQGIVGDL-RLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDL---GISVRDWLKQTYPRV 302 (388) Q Consensus 228 ~~~~~t~Wa~kT~~eI~~DI-~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~---~~Tvl~~lk~n~pnl 302 (388) +.+-.-...++|+++++.-. ..++..+-.... +. .-.++..+|+.+..|.++ ... +-|||+++++--+=- T Consensus 210 s~g~NiDlttat~~a~~~~f~~~l~~~~~~~N~--~~---~~~~~~vs~ei~~n~~r~Y~~~~~~~gTIl~~vl~~~~va 284 (358) T protein:vir:10 210 SGGANIDLTTADMTALFAFFGKGAFGTLARANK--VA---QYDVMWVSPEIWANLAQPYVVNGVVSGNVLNAVLPFAPVR 284 (358) T ss_pred CCcceeeeccCCHHHHHHHHHHHHHHHHHhhcc--cc---eeeEEEEcHHHHhhhhcccccccccchhhHHHhhcccCcc Confidence 12223357889998888887 677887776664 22 234899999999999874 332 349999997654434 Q ss_pred EEEEccccccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEeccccee---eeeeecc Q lcl|Aclame:pro 303 RVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSNATA---GVMLKRP 379 (388) Q Consensus 303 ~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~~~t~---G~ii~rP 379 (388) .|.+.+.|. ++-++.|+..-+. ..-.-++......-| |.. +.-+|.=++| |+.|++= T Consensus 285 ~I~~~~~Ls-------gNeii~~~~~~~v--i~plvG~~~gt~~~p---R~~--------p~ddY~f~vwsA~glqik~D 344 (358) T protein:vir:10 285 EIRQTFALS-------GNEFIAYVRRQDI--ISPLVGMAVGVVPLP---RPL--------PNVNYNFQIMSAEGLQITAD 344 (358) T ss_pred cccccccCC-------CccEEEEEeCCce--eeeeecceeeeecCC---CCC--------CCcchhhhhhhhhceeeeec Confidence 577777775 4455566544211 000011111111111 111 1112222222 3334322 Q ss_pred c----cceeeccC Q lcl|Aclame:pro 380 W----AVVRLIGL 388 (388) Q Consensus 380 ~----ai~~~~GI 388 (388) . .|.+..-+ T Consensus 345 ~~Gks~Vv~~~~~ 357 (358) T protein:vir:10 345 DQGLSGVVYGANL 357 (358) T ss_pred cccceeeEeeccc Confidence 1 11222222 No 17 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=98.37 E-value=2e-08 Score=62.84 Aligned_cols=279 Identities=9% Similarity=-0.049 Sum_probs=148.3 Q ss_pred hccCcccccccccccchHHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccceEecccccCCcee Q lcl|Aclame:pro 65 QAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPLS 144 (388) Q Consensus 65 ~amDaa~~~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~diP~~ 144 (388) || +.++.-+|-.+.+ +|++.+......+++.++..... ....+++....+.+..++...++|.. T Consensus 1 ma---------~~gG~lvp~~~~~----~ii~~~~~~s~i~~l~~~~~~~~---~~~~ip~~~~~~~a~~v~E~~~~~~~ 64 (298) T protein:vir:16 1 MV---------LNKGTLFDPTLVT----DLISKVAGKSSIARLSAQKPIPF---NGEKVFTFTMDSEIDVVAESGKKTHG 64 (298) T ss_pred Cc---------ccCcceechhHHH----HHHHHHHhhhhhhhhcceeeccC---CceEEEEEecCcceEEecCCcccccc Confidence 22 1122224433333 45555555555566655443322 33567787778888899999999999 Q ss_pred eeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCc--cccceEEEeecCCCcc Q lcl|Aclame:pro 145 SWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGK--NGNRTFGFLNDPSLLP 222 (388) Q Consensus 145 ~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~--~~~g~~GllN~P~l~a 222 (388) +...+......+.++..+.++.+=++.......++.+.-+...++++.+.+++-.++|.... ...+..|+....+... T Consensus 65 ~~~f~~v~l~~~k~a~~~~iS~ell~~s~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~ 144 (298) T protein:vir:16 65 GVTLAPQTMVPIKVEYGARISDEFMYASDEEKINILQEFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVT 144 (298) T ss_pred ccceeEEEEeeeeEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHHhhccccCCCCcccccccccccccccc Confidence 99999999999999999998885554444455678888888899999999999999996321 1123334333222111 Q ss_pred ccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC-CCcCccHHHHHHHh--- Q lcl|Aclame:pro 223 AIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRDWLKQT--- 298 (388) Q Consensus 223 ~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~~lk~n--- 298 (388) .. .. ........++||.+++..+...-. .+..++|.+..+..|.+. +..|.-++.-.-.+ T Consensus 145 ~~--------~~-~~~~~~~~~~~i~~~~~~~~~~~~-------~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~ 208 (298) T protein:vir:16 145 QK--------VE-APRGIADPNGAIENAVELLTGVDA-------DVTGIAINPSFRSALAKQKDLQDNALFPELKWGATP 208 (298) T ss_pred cc--------cc-cccccccHHHHHHHHHHHhhhcCC-------CccEEEEcHHHHHHHHHhhccCCCeeecCcccCCCC Confidence 10 00 111223357789999888775421 233689999999988653 34454443211111 Q ss_pred --CCccEEEEccccccccCCCCccEEEEEEccccccc-ccccCCCcceEeecch---------hhhccCceeccCceEEe Q lcl|Aclame:pro 299 --YPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAV-DGSTDGGDTWAQLVQS---------KFVTLGVEKRVKNYVEA 366 (388) Q Consensus 299 --~pnl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~-~~~~~~~~t~~~~~p~---------~~r~~~v~~~~~~~~~~ 366 (388) ..++.++....+... .+++.. .+|.-+..... ....+ .-+++. .+. -|+...+ ... T Consensus 209 ~~l~G~PV~~~~~v~~~-~~~~~~--~~~~GDfs~~~~~~~~~-~~~~~~-~~~~~~~~~~~~~f~~~~v-------~~r 276 (298) T protein:vir:16 209 DTINGLPVDVNKTVSDM-SLTQRD--RAIIGDFANGFKWGYAK-EVPLEV-IQYGDPDNSGLDLKGYNQV-------YIR 276 (298) T ss_pred ceecceeeEEecccccc-cCCCcc--EEEEeeccceEEEEEec-CceEEE-eeccCCcCcchhhhhcCcE-------EEE Confidence 111222222222221 112222 22222211110 00000 000100 000 0111111 122 Q ss_pred cccceeeeeeeccccceeeccC Q lcl|Aclame:pro 367 YSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 367 ~~~~t~G~ii~rP~ai~~~~GI 388 (388) +..| .|..+.+|-||+++.|. T Consensus 277 a~~r-~d~~v~~~~a~~~l~~a 297 (298) T protein:vir:16 277 AELF-LGWGILDATKFARVTEA 297 (298) T ss_pred EEEE-EccEeecccceEEEeec Confidence 2333 45667779999999999 No 18 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=98.27 E-value=1.3e-07 Score=58.33 Aligned_cols=348 Identities=10% Similarity=0.027 Sum_probs=153.5 Q ss_pred CCCcce------eeeecCccccchhhhhhcccccccccCCHHHHhhcceecccchh--hcchhhhhhhhh---hhhccCc Q lcl|Aclame:pro 1 MKQLSK------VHQSLAGRSVRAFDMANGKADYRLTDMAVRELKKFGLVFDHATV--KRQIELLHEGGV---ATQAFDS 69 (388) Q Consensus 1 ~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~--~~~~~~~~~~~~---~~~amDa 69 (388) +.+... .+........++... ...-.++.++-..+-.+.. ....+....... ...++.. T Consensus 65 ~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 134 (435) T protein:vir:14 65 AAVPVDPNPTAVAAPAAAPVHAQPKAL----------EVKGAKMARMVRALAAARGDAQLASKLAIERGFGEEVAMSLNT 134 (435) T ss_pred hcccccchhhhhhhccccccccccchh----------hhhHHHHHHHHHHHHhhcchhhHHHHHHHhhhhhhhhhhhccc Confidence 111000 000000000000000 0000000010000000000 000000000000 0111111 Q ss_pred ccccccccccchHHHHHHHhhcceeeeecccchhhhhhc-ccccCCCCceeeEEEeeeccccceEecccccCCceeeeee Q lcl|Aclame:pro 70 AYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEIL-GVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPLSSWNV 148 (388) Q Consensus 70 a~~~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~-~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~diP~~~~n~ 148 (388) .+..+.|+++- +.+..+|++.+....-.+.+. .+.+ .....+.+++.+..+.+...+....+|..+... T Consensus 135 -----~t~~~gg~~vP--~~~~~~ii~~l~~~~~i~~~~~~~~~---~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~f 204 (435) T protein:vir:14 135 -----LSPGAGGVLVP--ENLSSEVIELLRPKSVVRKLGARTLP---LSNGNITIPRLKGGAIVGYIGADTDIPTTQQQF 204 (435) T ss_pred -----CCcCCCccccc--hhHHHHHHHHHhhhchhhhhcceeee---cCCCceEEEEEeCCcceeeeccCccccccccce Confidence 12233343322 112335555554433333331 1111 112245677777777777888888999999888 Q ss_pred eeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCcccccccc Q lcl|Aclame:pro 149 NFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTT 228 (388) Q Consensus 149 ~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~ 228 (388) ....-..+.++..+.++.+=|.-+ ..+.+|.+.-......++.+.+|+..+.|+.. .+...|++|....+...+. T Consensus 205 ~~i~~~~~k~~~~~~iS~ell~ds-~~~~~l~~~i~~~l~~ai~~~~d~a~l~G~G~--~~~p~Gi~~~~~~~~~~~~-- 279 (435) T protein:vir:14 205 DDLKLTAKKMAALVPIANDLIKYA-GVNPNVDQIVVGDLTAAIGAREDKAFIRDDGT--ANTPKGLRFWALPSNVITA-- 279 (435) T ss_pred eEEEeeeEEEEEeehhhHHHHHhh-ccCHHHHHHHHHHHHHHHHHHHHHHhhccCCC--Cccccceeecccccceecc-- Confidence 888888999999888887433332 22335777888888888888899989999742 2357798887544222111 Q ss_pred CCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC-CCcCccHHHHHHH-hCCccEEEE Q lcl|Aclame:pro 229 PGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRDWLKQ-TYPRVRVMS 306 (388) Q Consensus 229 ~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~~lk~-n~pnl~i~~ 306 (388) -...|.+.+.+|+.+++..+.....+. ....++|.+..+..|... +..|.-++.=+.. ...++.++. T Consensus 280 ------~~~~~~~~~~~~~~~l~~~~~~~~~~~-----~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~g~l~G~Pv~~ 348 (435) T protein:vir:14 280 ------SDASTLQKIETDLGKVILALENADANL-----TQPGWIMAPRTFRFLEGLRDGNGNKVYPELANGMLKGYPVGK 348 (435) T ss_pred ------ccccchhhHHHHHHHHHHHhhhccccc-----cCCEEEEcHHHHHHHHHhhccCCceeccCCCCCeeecceeEe Confidence 122566778889999988888664321 234689999999988653 3334433310100 001112222 Q ss_pred ccccccccCCCCccEEEEEEcccccccccccCCCcceEeecchh-hhccC----ceeccCceEEecccceeeeeeecccc Q lcl|Aclame:pro 307 APELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSK-FVTLG----VEKRVKNYVEAYSNATAGVMLKRPWA 381 (388) Q Consensus 307 ~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~-~r~~~----v~~~~~~~~~~~~~~t~G~ii~rP~a 381 (388) ...+-.-.++++....++|. +.......... +-.+.. -++. +..+. .-+..-...+.+..|.+| .+.+|.| T Consensus 349 ~~~~p~~~~~~~~~~~i~~g-d~s~~~i~~~~-~~~~~~-~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~-~~~~~~a 424 (435) T protein:vir:14 349 TTQVPINLGETGKESEIYFT-DFGDVFIGEEE-TLEIDY-SKEATYKDADGHMVSAFQRDQTLIRVIAKNDF-GPRHVES 424 (435) T ss_pred eccccccccCCCccceEEEe-ecccEEEEEec-ccEEEE-eccccccccccchhhhhhcChhheeeeeeeCc-eeecccc Confidence 22221100122222223332 11111111000 000000 0000 00000 000000112234445555 6777999 Q ss_pred ceeeccC Q lcl|Aclame:pro 382 VVRLIGL 388 (388) Q Consensus 382 i~~~~GI 388 (388) |+.+.|+ T Consensus 425 ~~~l~~~ 431 (435) T protein:vir:14 425 IAVLAGV 431 (435) T ss_pred eEEEecC Confidence 9999999 No 19 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=98.27 E-value=4.7e-08 Score=60.76 Aligned_cols=285 Identities=9% Similarity=-0.059 Sum_probs=149.7 Q ss_pred hccCcccccccccccchHHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccceEecccccCCcee Q lcl|Aclame:pro 65 QAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPLS 144 (388) Q Consensus 65 ~amDaa~~~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~diP~~ 144 (388) ||-.. +..+.-+|..+.. +|++.+....-.+++.++..... ....+++.+..+.|...+...++|.. T Consensus 1 ma~~t------~~~G~lip~~~~~----~ii~~l~~~s~i~~l~~~~~~~~---~~~~~p~~~~~~~a~wv~Eg~~~~~s 67 (300) T protein:vir:95 1 MSEAQ------LSKGNLFNPELVT----KVINKVKGHSSIAKLSPQKPIPF---NGQREFVFDFDSDIDIVAENGKKTHG 67 (300) T ss_pred Ccccc------cCCcceechhhHH----HHHHHHHhhhhhhhhcceeeccC---CceEEEEEecCcceEEeeCCcccccc Confidence 33111 1111114444433 55666555555556655443222 24567787777888899999999999 Q ss_pred eeeeeeeeeeEEEEEEEEeecHHHHHHH-HHhCCChHHHHHHHHHHHHHHhhceEEEEeecC--ccccceEEEeecCCCc Q lcl|Aclame:pro 145 SWNVNFERRTIVRGEMGIQVGLLEEGRA-SAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEG--KNGNRTFGFLNDPSLL 221 (388) Q Consensus 145 ~~n~~~~~~~v~~~~~~~~y~~~El~~A-~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~--~~~~g~~GllN~P~l~ 221 (388) +...+...-+.+.++....++.+ |.+. .-...++.+.-....++++...+++-.++|+.. ++.....|..+.++.. T Consensus 68 ~~~f~~v~l~~~k~~~~~~iS~e-ll~~~~d~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~ 146 (300) T protein:vir:95 68 GVSLDPVTIVPLKVEYGARVSDE-FLHASEEAKVDMLTDFVEGFSKKLARGLDIMSIHGINPRTKQASTIIGDNCFDKKV 146 (300) T ss_pred cccceeeEeeeEEEEEeehhhHH-HhccCCCCHHHHHHHHHHHHHHHHHHHHHHhhhhcccCCCCCCccccccccccccc Confidence 99999999999999999999884 4433 234577888888899999999999999999631 1223345555554432 Q ss_pred cccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhcc-CCCcCccHHHHHHHh-- Q lcl|Aclame:pro 222 PAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSV-VTDLGISVRDWLKQT-- 298 (388) Q Consensus 222 a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~lk~n-- 298 (388) ..+++ .+....++||.+++..+...- ..+..++|.|..+..|.+ .+..|..++.-.... T Consensus 147 ~~~~~-----------~~~~~~~~~i~~~~~~~~~~~-------~~~~~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~~ 208 (300) T protein:vir:95 147 TQTVP-----------FKDTNPDESMEDAVGMIDGSE-------RDITGAILDPIFTTALSKMKNAEGGKLYPELAWGGV 208 (300) T ss_pred ceeec-----------ccccchHHHHHHHHHHhhhcC-------CCccEEEECHHHHHHHHHhhccCCCeeccCccccCC Confidence 21111 111122567888877775432 124479999999998865 344455454211111 Q ss_pred ---CCccEEEEccccccccCCCCccEEEEEEccccccc-ccccCCCcceEeecchhhh--ccCc-eeccCceEEecccce Q lcl|Aclame:pro 299 ---YPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAV-DGSTDGGDTWAQLVQSKFV--TLGV-EKRVKNYVEAYSNAT 371 (388) Q Consensus 299 ---~pnl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~-~~~~~~~~t~~~~~p~~~r--~~~v-~~~~~~~~~~~~~~t 371 (388) ..++.++....+... .++....+ |.-+..... .+..+ -.+..+..... ..++ -++.-..-..+..|+ T Consensus 209 ~~~l~G~Pv~~s~~v~~~-~~~~~~~~--~~GDf~~~~~~~~~~---~~~~~v~~~~~~d~~~~~~f~~~~v~~r~~~r~ 282 (300) T protein:vir:95 209 PDAINGLAVDKNRTVSYS-QTDPKNTA--IVGDFETMFKWGYAK---EVPMEIIKYGDPDNSGRDLKGYNQIYIRCEAYI 282 (300) T ss_pred CceecceeeEEecCCCCC-CCCCccEE--EEeeccceEEEEEec---ccEEEEeeccCCCCcchhhhhcCcEEEEEEEee Confidence 111223222222211 12222222 222211000 00000 00000100000 0000 001111223344455 Q ss_pred eeeeeeccccceeeccC Q lcl|Aclame:pro 372 AGVMLKRPWAVVRLIGL 388 (388) Q Consensus 372 ~G~ii~rP~ai~~~~GI 388 (388) |+.|++|-||+++.|. T Consensus 283 -d~~v~~~~a~~~l~~~ 298 (300) T protein:vir:95 283 -GWGIMDAASFARIVKT 298 (300) T ss_pred -cceeecccceEEEecC Confidence 5566679999999999 No 20 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=98.20 E-value=2.2e-07 Score=57.13 Aligned_cols=283 Identities=10% Similarity=-0.032 Sum_probs=144.9 Q ss_pred ccccccccch--HHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccceEeccccc-----CCcee Q lcl|Aclame:pro 72 VAPTTQASIP--TPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLT-----NIPLS 144 (388) Q Consensus 72 ~~~~t~~~~g--~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~-----diP~~ 144 (388) +...+..+.| +|..+. ++|++.+......+++..+.+.+. .+..+++....+.+...|... ++|.. T Consensus 1 ma~~t~~~gg~liP~~~~----~~Ii~~~~~~s~l~~l~~~~~~~~---~~~~~p~~~~~~~a~wv~E~~~~~~~~~~~s 73 (305) T protein:vir:25 1 MADISRAEVASLIQEAYS----DTLLAAAKQGSTVLSAFQNVNMGT---KTTHLPVLATLPEADWVGESATDPKGVKPTS 73 (305) T ss_pred CCCccCCccceecCHHHH----HHHHHHHHhhchhhhhcceeeccC---CcEEEEEEeCCcceEEeeccccccccccccc Confidence 1112333344 444443 466666666665666665554332 235677776666777776543 46777 Q ss_pred eeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCcccc Q lcl|Aclame:pro 145 SWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAI 224 (388) Q Consensus 145 ~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~ 224 (388) +.......-..+.++..+.++.+=++ ....++.+.-.....+++.+.+++-.++|+... .|+.+...++... T Consensus 74 ~~~f~~i~~~~~k~~~~~~is~ell~---ds~~~~~~~i~~~l~~~~a~~~d~a~~~G~g~~-----~~~~~~~~~~~~~ 145 (305) T protein:vir:25 74 KVTWANRTLVAEEIAVIIPVHENVID---DATVAVLTEVAELGGQAIGKKLDQAVIFGTDKP-----ASWVSPALIPAAV 145 (305) T ss_pred ccceeeEEeeeEEEEEeehhhHHHHh---cchHHHHHHHHHHHHHHHHHHHhhhheeccCCC-----CCccccccccccc Confidence 88888888889999999999984332 234678899999999999999999999998532 2333332222221 Q ss_pred ccccCCccccc-ccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhcc-CCCcCccHHHHHHHhCCcc Q lcl|Aclame:pro 225 ASTTPGGWVSG-GANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSV-VTDLGISVRDWLKQTYPRV 302 (388) Q Consensus 225 ~~~~~~~~t~W-a~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~lk~n~pnl 302 (388) ... .....+ ...+..++++++..+...+.... ..++.++|.+..+..|.+ .+..|.-++. -...-.+ T Consensus 146 ~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~v~~~~~~~~l~~lkd~~G~~i~~--~~~l~G~ 214 (305) T protein:vir:25 146 TAG--QAVEVVGGVANESDIVGATNRAAKAVASAG-------WAPDTLLSSLALRYEVANIRDANGNPVFR--DDSFAGF 214 (305) T ss_pred ccc--ccccccccchhhhHHHHHHHHHHHhhhhcc-------cccceeEecHHHHHHHHHhhccCCceeec--CCccccc Confidence 111 111111 22334556777776665554321 234468999999998854 3444544331 0011111 Q ss_pred EEEEccccccccCCCCccEEEEEEcccccccccccCCCcceEeecch-hhhccCc---eeccCceEEecccceeeeeeec Q lcl|Aclame:pro 303 RVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQS-KFVTLGV---EKRVKNYVEAYSNATAGVMLKR 378 (388) Q Consensus 303 ~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~-~~r~~~v---~~~~~~~~~~~~~~t~G~ii~r 378 (388) .+.-.... ..+.++..+++.+ ......+...+ -+++. ..+ .+..... -++.-...+.+..|.+ ..|.+ T Consensus 215 Pv~~~~~~----~~~~~~~~~~~gd-~s~~~i~~~~~-~~i~~-~~~~~~~~~~~~~~~~~~~~~~~R~~~r~~-~~v~~ 286 (305) T protein:vir:25 215 RTFFNRNG----AWDADAAIEVIAD-SSRVKIGVRQD-ITVKF-LDQATLGTGENQINLAERDMVALRLKARFA-YVLGV 286 (305) T ss_pred ceEEcCcc----CCCCCccEEEEEe-cceEEEEEecC-eEEEE-eeeeeeecCCceeeeeecCcEEEEEEEeec-ceeeC Confidence 12211111 1112222223221 11111111100 00000 000 0000000 0111123344555665 45777 Q ss_pred cccceeeccC Q lcl|Aclame:pro 379 PWAVVRLIGL 388 (388) Q Consensus 379 P~ai~~~~GI 388 (388) |.+|+.++|+ T Consensus 287 p~a~v~~~~~ 296 (305) T protein:vir:25 287 SATAQGANKT 296 (305) T ss_pred cccEEEEccc Confidence 9999999999 No 21 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=98.19 E-value=7.2e-08 Score=59.74 Aligned_cols=279 Identities=9% Similarity=-0.044 Sum_probs=145.7 Q ss_pred hccCcccccccccccchHHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccceEecccccCCcee Q lcl|Aclame:pro 65 QAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPLS 144 (388) Q Consensus 65 ~amDaa~~~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~diP~~ 144 (388) ||.+. +.-+|-.+. .+|++.+......+++.++.+.+. ....+++....+.+...+...++|.. T Consensus 1 ma~~g---------G~lip~~~~----~~ii~~~~~~s~i~~~~~~~~~~~---~~~~~p~~~~~~~a~~v~Eg~~~~~~ 64 (298) T protein:vir:94 1 MVLNK---------GTLFDPELV----TDLISKVAGKSSIARLSAQKPIPF---NGEKVFTFTMDSEIDVVAESGKKTHG 64 (298) T ss_pred Ceecc---------ccccChhHH----HHHHHHHHhhchhhhhcceeeccC---CceEEEEEecCcceEEeeCCcccccc Confidence 33222 111333333 355666655555666665544333 23567777777788889999999999 Q ss_pred eeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCcc--ccceEEEeecCCCcc Q lcl|Aclame:pro 145 SWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKN--GNRTFGFLNDPSLLP 222 (388) Q Consensus 145 ~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~--~~g~~GllN~P~l~a 222 (388) +...+......+.++....++.+=|+...-...+|.+.-+...++++.+.+++-.++|..... .....|..+..+... T Consensus 65 ~~~f~~v~l~~~k~~~~~~iS~ell~~~~~~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~~g~~~~~~~~~~~~~~~~ 144 (298) T protein:vir:94 65 GVTLAPQTMVPIKVEYGARISDEFMYASDEEKINILQAFNDGFAKKVARGIDLMAFHGVNPRLGTASAVIGTNHFDSKVT 144 (298) T ss_pred ccceeEEEEeeeEEEEeeehhHHHhccCCccHHHHHHHHHHHHHHHHHHHHHHHhhcccccCCCcccccccccccccccc Confidence 999999999999999888888753333223445788888888999999999999999954221 111222211111100 Q ss_pred ccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhcc-CCCcCccHHHHHHHh--- Q lcl|Aclame:pro 223 AIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSV-VTDLGISVRDWLKQT--- 298 (388) Q Consensus 223 ~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~lk~n--- 298 (388) .. . -.......+++||.+++..+...-. .+..++|.+..+..|.+ .+..|.-++.=...+ T Consensus 145 ~~--------~-~~~~~~~~~~~~i~~~~~~~~~~~~-------~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~ 208 (298) T protein:vir:94 145 QK--------V-EAPRGIADPNGAIENAVELLTGVDA-------DVTGIAINPSFRSALAKQKDLQGNALFPELKWGATP 208 (298) T ss_pred cc--------c-ccccccccHHHHHHHHHHhhhhcCC-------CccEEEEcHHHHHHHHHhhccCCCeeecCcccCCCC Confidence 00 0 0112234567899999888775421 23469999999998854 233343332111111 Q ss_pred --CCccEEEEccccccccCCCCccEEEEEEccccccc-ccccCCCcceEeecch---------hhhccCceeccCceEEe Q lcl|Aclame:pro 299 --YPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAV-DGSTDGGDTWAQLVQS---------KFVTLGVEKRVKNYVEA 366 (388) Q Consensus 299 --~pnl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~-~~~~~~~~t~~~~~p~---------~~r~~~v~~~~~~~~~~ 366 (388) ...+.+.....+... .++..+. +|.-+..... .+.. ..-.++. .+. .|+... .... T Consensus 209 ~tl~G~PV~~~~~v~~~-~~~~~~~--~~~Gdfs~~~~~~~~-~~~~~~~-~~~~~~d~~~~~~f~~~~-------v~~r 276 (298) T protein:vir:94 209 DTINGLPVDVNKTVSDM-SLTQRDR--AIIGDFANGFKWGYA-KEVPLEV-IQYGDPDNSGLDLKGYNQ-------VYIR 276 (298) T ss_pred ceecceeeEEecccccc-cCCCccE--EEEeeccceEEEEEe-cCceEEE-eecCCCcCcchhhhhcCc-------EEEE Confidence 111222222222211 1111222 2222211100 0000 0000000 000 111111 1223 Q ss_pred cccceeeeeeeccccceeeccC Q lcl|Aclame:pro 367 YSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 367 ~~~~t~G~ii~rP~ai~~~~GI 388 (388) +..|. |+.+.+|-||+++.|. T Consensus 277 ~~~r~-~~~~~~~~a~~~l~~~ 297 (298) T protein:vir:94 277 AELFL-GWGILDATKFARVTEA 297 (298) T ss_pred EEEEe-ccEeecccceEEEEec Confidence 33344 5666779999999999 No 22 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=98.17 E-value=1.4e-07 Score=58.16 Aligned_cols=284 Identities=6% Similarity=-0.054 Sum_probs=144.1 Q ss_pred hccCcccccccccccchHHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccceEecccccCCcee Q lcl|Aclame:pro 65 QAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPLS 144 (388) Q Consensus 65 ~amDaa~~~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~diP~~ 144 (388) |+-+ +.++.-+|..+.+ +|++.+....-.+.+.++...+. .+..+++....+.+...+....+|.. T Consensus 1 m~t~-------t~gg~liP~~~~~----~ii~~l~~~s~i~~l~~~~~~~~---~~~~ip~~~~~~~a~wv~E~~~~~~s 66 (303) T protein:vir:97 1 MGTE-------TSKASLFDKHLVS----DLINKVKGHSSLAKLSSQKPIPF---NGSKEFTFTLDSDIDVVAENGKKTHG 66 (303) T ss_pred Cccc-------CCCCeEcchhHHH----HHHHHHHhhchhhhhcceeecCC---CceEEEEEecCcceEEeecCcccccc Confidence 2211 2223335555444 55565555555566654443222 34567777777888899999999999 Q ss_pred eeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCcccc Q lcl|Aclame:pro 145 SWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAI 224 (388) Q Consensus 145 ~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~ 224 (388) +...+....+.+.++..+.++.+=++.......++...-.....+++.+.+|+-.+.|+... .|.-+........... T Consensus 67 ~~~f~~v~l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~--~g~~~~~~~~~~~~~~ 144 (303) T protein:vir:97 67 GLSLEPVTIVPIKVEYGARLSDEFLYATEEEKIDILKAFNEGFAKKLARGIDLMAMHGINPR--TKKASDVIGTNHFDSK 144 (303) T ss_pred ccceeeEEeeeEEEEEeehhhHHHhhcCccchHHHHHHHHHHHHHHHHHHHHhhhhcccccC--Cccccccccccccccc Confidence 99999999999999999999884333323445678888899999999999999999996421 1211111111110000 Q ss_pred ccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhcc-CCCcCccHH-HHHHHh---- Q lcl|Aclame:pro 225 ASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSV-VTDLGISVR-DWLKQT---- 298 (388) Q Consensus 225 ~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl-~~lk~n---- 298 (388) + +.. =...+.+..++||.+++..+...- ..+..++|.|..+..|.+ .+..|.-++ .=+... T Consensus 145 ~-~~~-----~~~~~~~~~~~~i~~~~~~~~~~~-------~~~~~~vmn~~~~~~L~~lkd~~g~~~~~~~~~~~~~~~ 211 (303) T protein:vir:97 145 V-TQV-----VKFTESEDADANIEAAVNLIQGAE-------GVVTGLAMDTEFSTALAKVTNGEMGPKMYPELAWGANPD 211 (303) T ss_pred c-ccc-----cccccccchHHHHHHHHHHHhhcC-------CCccEEEEcHHHHHHHHHhhccCCCeEEecCccCCCCCc Confidence 0 000 011122335788998988776432 234569999998888853 233332221 000101 Q ss_pred -CCccEEEEccccccccCCCCccEEEEEEccccccc-ccccCCCcceEeecch---------hhhccCceeccCceEEec Q lcl|Aclame:pro 299 -YPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAV-DGSTDGGDTWAQLVQS---------KFVTLGVEKRVKNYVEAY 367 (388) Q Consensus 299 -~pnl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~-~~~~~~~~t~~~~~p~---------~~r~~~v~~~~~~~~~~~ 367 (388) ...+.++....+.+..+.+.... .+|.-+..... .+. ...-+++. .+. -|+..- .-+.+ T Consensus 212 ~l~G~Pv~~s~~v~~~~~~~~~~~-~~~~Gdf~~~~~~~~-~~~~~~~~-~~~~~~d~~~~~~~~~n~-------~~~r~ 281 (303) T protein:vir:97 212 SINGLKSSVNTTVGAGADEAESKD-LVIIGDFESMFKWGY-AKQIPMEI-IKYGDPDNSGKDLKGYNQ-------IYLRA 281 (303) T ss_pred eecceeeEEecccCCccccCCCcc-EEEEeeccccEEEEE-ecCcEEEE-eeccCCCCcchhhhhcCc-------EEEEE Confidence 11122222112211111111111 22222211000 000 00011111 000 011111 12233 Q ss_pred ccceeeeeeeccccceeeccC Q lcl|Aclame:pro 368 SNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 368 ~~~t~G~ii~rP~ai~~~~GI 388 (388) +.|. |..|++|-||+++... T Consensus 282 ~~r~-~~~v~~p~af~~l~~~ 301 (303) T protein:vir:97 282 EAYI-GWGILDAKSFARVTKG 301 (303) T ss_pred EEEe-ccEeecccceEEeeCC Confidence 4444 4556669999999999 No 23 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=98.15 E-value=9.1e-08 Score=59.19 Aligned_cols=297 Identities=10% Similarity=-0.056 Sum_probs=150.0 Q ss_pred ecccchhhcchhhhhhhhhhhhccCcccccc---cccccch-HHHHHHHhhcceeeeecccchhhhhhcccccCCCCcee Q lcl|Aclame:pro 44 VFDHATVKRQIELLHEGGVATQAFDSAYVAP---TTQASIP-TPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQ 119 (388) Q Consensus 44 ~~~~~~~~~~~~~~~~~~~~~~amDaa~~~~---~t~~~~g-~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~ 119 (388) +. +..+||+..... .+..+.+ +|.... .++++.+......+++.++...+. . T Consensus 1 ~~-----------------~~~~~~~~~~~~~~t~~~~~~~~ip~~~~----~~ii~~~~~~s~l~~~~~~~~~~~---~ 56 (320) T protein:vir:10 1 MA-----------------AGTAFQVDHAQIAQTGDTMFKGYLEPEQA----KDYFAEAEKTSIVQQFAQKVPMGT---T 56 (320) T ss_pred CC-----------------CCccCCHHHHHhhccccccccccccHHHH----HHHHHHHHhccchhhhcceeeccC---C Confidence 00 112222111100 1112223 343333 466677666666677766554332 3 Q ss_pred eEEEeeeccccceEecccccCCceeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEE Q lcl|Aclame:pro 120 EIVQGIVEPAGTAMEYGDLTNIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIG 199 (388) Q Consensus 120 t~~~~v~e~~G~a~~ygd~~diP~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~ 199 (388) +..+++.+..+.+...+...++|..+...++...+.+.++..+.++.+=++.+ ..++.+.-....++++.+.+|+-. T Consensus 57 ~~~~p~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds---~~~l~~~i~~~l~~a~a~~~d~a~ 133 (320) T protein:vir:10 57 GQKIPHWIGDVSAQWIGEGDMKPITKGNMTSQNIAPHKIATIFVASAETVRAN---PANYLGTMRTKVATAFAMAFDSAA 133 (320) T ss_pred ceEEEEEeCCcceEEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcC---hHHHHHHHHHHHHHHHHHHHHHHh Confidence 45678877778888899999999999999999999999999999998655432 367888888899999999999999 Q ss_pred EEeecCccccceEEEeecCCCccccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHH Q lcl|Aclame:pro 200 FYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVD 279 (388) Q Consensus 200 ~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~ 279 (388) +.|+......+..|.++.-++.. .+..++++-+.. -+++..++..+...- ..+..++|.+..+. T Consensus 134 l~G~g~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~--~~~~~~~~~~~~~~~-------~~~~~~v~n~~~~~ 197 (320) T protein:vir:10 134 LNGTDSPFPTYLAQTTKSVSLAD-------PGGATASDLTAY--DAVAVNGLSLLVNAK-------KKWTHTLLDDIVEP 197 (320) T ss_pred hcccCCCCCccccccccccccee-------cccccccccccH--HHHHHHHHhhhhccc-------CCCcEEEEcHHHHH Confidence 99976432223333333222111 111112221111 122333333333221 23458999999999 Q ss_pred hhccC-CCcCccHHHH-H----HHhCCccEEEEccccccccCCCCccEEEEEEcccccccccccCCCcceEeecchhh-h Q lcl|Aclame:pro 280 MLSVV-TDLGISVRDW-L----KQTYPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKF-V 352 (388) Q Consensus 280 ~Ls~~-~~~~~Tvl~~-l----k~n~pnl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~-r 352 (388) .|.+. +..|..++.- + ..+++..++...|-+...+- ..++...+|.+ ..........+ -.++ ...+.. . T Consensus 198 ~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~i~g~pv~~~~~~-~~~~~~~~~gd-~~~~~~~~~~~-~~i~-~~~~~~~~ 273 (320) T protein:vir:10 198 ILNGAKDKNGRPLFIESTYTDENSPFRAGRIVSRPTILSDHV-ADGTTVGYMGD-FRNVIWGQVGG-LSFD-VTDQATLN 273 (320) T ss_pred HHHHhhccCCceeeccccccCccccccCceeeeeeeEecCCC-CCCceEEEEee-cceEEEEEecC-eEEE-Eeecceee Confidence 99642 3334333221 1 11233345665554432211 22232323221 11011110000 0000 000000 0 Q ss_pred ccCce-------eccCceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 353 TLGVE-------KRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 353 ~~~v~-------~~~~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) ..... ++.-...+.+..|. |+.+.+|-||+++.|+ T Consensus 274 ~~~~~~~~~~~~f~~~~~~~r~~~~~-d~~v~~~~a~~~l~~~ 315 (320) T protein:vir:10 274 LGTPTEPNFVSLWQHNLVAVRVEAEY-AFHNNDKDAFVKLTNV 315 (320) T ss_pred eccccccccchhhhcCcEEEEEEEee-ccEEecccceEEEEec Confidence 00000 00001122333444 6666889999999999 No 24 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=98.14 E-value=5e-07 Score=55.13 Aligned_cols=352 Identities=9% Similarity=0.028 Sum_probs=153.4 Q ss_pred CCCcceeeeec-CccccchhhhhhcccccccccCCHHHHhhcc--eecccchhhcchhh-hhh--hhhhhhccCcccccc Q lcl|Aclame:pro 1 MKQLSKVHQSL-AGRSVRAFDMANGKADYRLTDMAVRELKKFG--LVFDHATVKRQIEL-LHE--GGVATQAFDSAYVAP 74 (388) Q Consensus 1 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~g--~~~~~~~~~~~~~~-~~~--~~~~~~amDaa~~~~ 74 (388) +.+.......- .....++..-... .....-..+.++- +............. ... ..-...++. . T Consensus 65 ~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~ 134 (435) T protein:vir:80 65 AAVPVDPNPAAVTASAAAPVYAQPK-----APEVKGAKMARMVRALAAARGDAQLASKLAIERGFGEEVAMSLN-----T 134 (435) T ss_pred hcccccchhhhhccccccccccccc-----hhhhhHHHHHHHHHHHHhccchhHHHHHHHHhhhhhhhhhhhhc-----c Confidence 11000000000 0000000000000 0000000011100 00000000000000 000 000001111 1 Q ss_pred cccccch--HHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccceEecccccCCceeeeeeeeee Q lcl|Aclame:pro 75 TTQASIP--TPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPLSSWNVNFER 152 (388) Q Consensus 75 ~t~~~~g--~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~diP~~~~n~~~~~ 152 (388) .+....| +|.... .+|++.+.+..-.+.+-. +.-+.....+.+++.+..+.+...+....+|..+...+... T Consensus 135 ~~~~~gg~lvP~~~~----~~ii~~l~~~~~i~~~~~--~~v~~~~~~~~~p~~~~~~~a~~v~E~~~~~~~~~~f~~i~ 208 (435) T protein:vir:80 135 LSPGAGGVLVPENLS----SEVIELLRPKSVVRKLGA--RTLPLSNGNITIPRLKGGAIVGYIGADTDIPTTQQQFDDLK 208 (435) T ss_pred cCCCCCccccchhHH----HHHHHHHhhhchhhhccc--eeeecCCCceEEEEEeCCcceeeeccCccccccccceeeEE Confidence 1222333 333322 345554433333333311 01111122456777777777778888889999999988999 Q ss_pred eeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccccccccCCcc Q lcl|Aclame:pro 153 RTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGW 232 (388) Q Consensus 153 ~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~ 232 (388) -..+.++..+.++.+=|+. +..+-++.+.-......++...+++-.++|+.. .+...|++|+..+....++ T Consensus 209 ~~~~k~~~~~~is~ell~d-s~~~~~l~~~i~~~l~~a~~~~~d~a~l~G~G~--~~~p~Gi~~~~~~~~~~~~------ 279 (435) T protein:vir:80 209 LTAKKMAALVPIANDLIKY-AGVNPNVDQIVVGDLTAAIGAREDKAFIRDDGT--ANTPKGLRFWALPGNVITA------ 279 (435) T ss_pred EeeEEEEEeehhhHHHHHh-hcccHHHHHHHHHHHHHHHHHHHHHHhhccCCC--CCcccceeecccccceeec------ Confidence 9999999998888753333 233446777788888888888888888999642 2356799998754322211 Q ss_pred cccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC-CCcCccHHHHHHHh-CCccEEEEcccc Q lcl|Aclame:pro 233 VSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRDWLKQT-YPRVRVMSAPEL 310 (388) Q Consensus 233 t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~~lk~n-~pnl~i~~~pel 310 (388) -...|.+.+..|+.+++..+.....+ ..+..++|.+..+..|... +..|.-++.-+..+ .-++.++....+ T Consensus 280 --~~~~~~~~~~~d~~~~~~~~~~~~~~-----~~~~~~vmn~~~~~~L~~lkd~~G~~l~~~~~~~~l~G~pv~~~~~~ 352 (435) T protein:vir:80 280 --SDGSTLQKIETDLGKAILALENADAN-----LTQPGWIMAPRTFRFLEGLRDGNGNKVYPELANGMLKGYPVGKTTQV 352 (435) T ss_pred --ccccchhhHHHHHHHHHHHhhccccc-----cccCEEEEcHHHHHHHHhhhccCCceeccCCCCCeEeeeeeEEeccc Confidence 13366677788999888888765432 1235789999999998653 44454443111110 111122222222 Q ss_pred ccccCCCCccEEEEEEcccccccccccCCCcceEeecchh-hhccC---c-eeccCceEEecccceeeeeeeccccceee Q lcl|Aclame:pro 311 QGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSK-FVTLG---V-EKRVKNYVEAYSNATAGVMLKRPWAVVRL 385 (388) Q Consensus 311 ~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~-~r~~~---v-~~~~~~~~~~~~~~t~G~ii~rP~ai~~~ 385 (388) ..-.++++....++|. +......+-. +.-.+.. .++- +.... + -+..-...+.+..|. ++.+++|.||+.+ T Consensus 353 p~~~~~~~~~~~i~~g-d~s~~~i~~~-~~~~i~~-~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~-d~~~~~~~a~~~l 428 (435) T protein:vir:80 353 PINLGEAGKESEIYFT-DFGDVFIGEE-ETLEIDY-SKEATYKDADGHMVSAFQRDQTLIRVIAKN-DFGPRHVESIAVL 428 (435) T ss_pred cccccCCCCcceEEEE-EcccEEEEee-cceEEEE-eccccccccccchhhhhhcCcceeeeeeee-CcEeecccceEEE Confidence 1101122222223332 2111111000 0000000 0000 00000 0 000001223344444 5566779999999 Q ss_pred ccC Q lcl|Aclame:pro 386 IGL 388 (388) Q Consensus 386 ~GI 388 (388) .|+ T Consensus 429 ~~~ 431 (435) T protein:vir:80 429 SGV 431 (435) T ss_pred ecc Confidence 999 No 25 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=98.14 E-value=1.5e-07 Score=57.99 Aligned_cols=345 Identities=12% Similarity=0.055 Sum_probs=153.4 Q ss_pred CC----CcceeeeecCccccchhhhhhcccccccccCCHHHH-----hhcceecccchhhcchhhhhhhhhhhhccCccc Q lcl|Aclame:pro 1 MK----QLSKVHQSLAGRSVRAFDMANGKADYRLTDMAVREL-----KKFGLVFDHATVKRQIELLHEGGVATQAFDSAY 71 (388) Q Consensus 1 ~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l-----~~~g~~~~~~~~~~~~~~~~~~~~~~~amDaa~ 71 (388) |- .+.+.|++.++.-+++-... .....+.++ +..|- +..+ .+...+...+...+ +++- T Consensus 1 ~a~~~a~~~~~~~~~~~~~~~~~~~~-------~kg~~~~~~~~a~a~~~g~-~~~a-~~~a~~~~~~~~~~-~a~~--- 67 (366) T protein:vir:57 1 MAAAVAVPVKAHSVAPGIIIKEELQQ-------YKGAGMTRMVMSIAAGKGN-LADA-AKFAATELGDTGLS-MAIS--- 67 (366) T ss_pred Cccccccccccccccccccccccccc-------ccchhHHHHHHHHHhcccc-hhHH-HHHHHHhhcchhhh-hhcc--- Confidence 11 11133443333222211100 001111111 11110 0000 00000000000000 1111 Q ss_pred ccccccccchH--HHHHHHhhcceeeeecccchhhhhh-cccccCCCCceeeEEEeeeccccceEecccccCCceeeeee Q lcl|Aclame:pro 72 VAPTTQASIPT--PIQFLQQWLPGFVKVLTSARKIDEI-LGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPLSSWNV 148 (388) Q Consensus 72 ~~~~t~~~~g~--l~~~l~~idp~v~e~l~~~~~~~~i-~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~diP~~~~n~ 148 (388) .+..+.|+ |..+ ..+|++.+....-.+.+ ..+...+ ...+.+++....+.+...+...++|..+... T Consensus 68 ---~~~~~Gg~lvP~~~----~~~ii~~l~~~s~l~~lg~~~v~~~---~g~~~~p~~t~~~~a~wv~E~~~~~~s~~~f 137 (366) T protein:vir:57 68 ---TAAGSGGALIPQNM----QNEVIELLRDRTVVRILGARSIPLP---NGNLSMPRLSGGATAGYVGEGKDVVATGATF 137 (366) T ss_pred ---ccccCCccccchhH----HHHHHHHHhhhcchhhhceeeeecC---CCceEEEEEeCCcceeeeccCccccccccce Confidence 12223443 4332 23455555433333333 1111111 1235677776666777888899999999998 Q ss_pred eeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCcccccccc Q lcl|Aclame:pro 149 NFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTT 228 (388) Q Consensus 149 ~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~ 228 (388) ....-+.+.++....++.+=|+. ...++.+.-+.....++.+.+|+-.++|+.. ...-.|++|.+.......... T Consensus 138 ~~i~~~~~k~~~~~~iS~ell~d---s~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~--~~~p~Gi~~~~~~~~~~~~~~ 212 (366) T protein:vir:57 138 DDVKLSAKTMIALVPVSNQLIGR---AGFNVEQLLLGDILSAIATREDKAFLRDDGT--GDTPKGMKAVATAANRLVAWT 212 (366) T ss_pred eEEEEeeEEEEEeehhhHHHHhh---hhHHHHHHHHHHHHHHHHHHHHHHhhccCCC--Cccccceeeccccccceeecc Confidence 89999999999988888743332 3457788888888899999999999999752 235789999876532211111 Q ss_pred CCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhcc-CCCcCccHHHHHHHh-CCccEEEE Q lcl|Aclame:pro 229 PGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSV-VTDLGISVRDWLKQT-YPRVRVMS 306 (388) Q Consensus 229 ~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~lk~n-~pnl~i~~ 306 (388) + ...+...+..++..+.......... ......+|.+..+..|.+ .+..|..++.-+... .-+..++. T Consensus 213 -~-----t~~~~~~~~~~~~~~~~~~~~~~~~-----~~~a~~vmn~~~~~~L~~lkd~~G~~l~~~~~~g~l~G~Pvv~ 281 (366) T protein:vir:57 213 -G-----TAINLTTIDEYLDSLILKHMDSNSN-----MIRCGWGLSNRTYMTLFGLRDGNGNKVYPEMSQGILKGYPIQR 281 (366) T ss_pred -c-----cccchhhHHHHHHHHHHhhhccccc-----cccCEEEecHHHHHHHHhhhccCCceeccCCCCCeecceeeEE Confidence 1 1233344444444333332222211 112367899999988865 344455554211111 11122333 Q ss_pred ccccccccCCCCccEEEEEEcccccccccccCCCcceEeecchh-hhc-cCc---eeccCceEEecccceeeeeeecccc Q lcl|Aclame:pro 307 APELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSK-FVT-LGV---EKRVKNYVEAYSNATAGVMLKRPWA 381 (388) Q Consensus 307 ~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~-~r~-~~v---~~~~~~~~~~~~~~t~G~ii~rP~a 381 (388) ...+..-.++++....++| -+.......... .-.+.. .++. +.. .+. -++.-...+.+..++ ++.+++|.+ T Consensus 282 s~~ip~~~~~~~~~~~i~~-gdfs~~~i~~~~-~i~i~~-~~ea~~~~~~g~~~~~f~~~~~~iR~~~~~-d~~v~~~~a 357 (366) T protein:vir:57 282 TSAIPANLGDDGNESEIYF-CDFNDVVIGEDG-MMKVDF-STEATYKDADGQLVSAFARNQSLIRVVTEH-DIGFRHPEG 357 (366) T ss_pred ccccccccccCCCccEEEE-EecceEEEEEec-ceEEEE-eeccccccccccchhhhhcCceeEEeeeee-CcEeecccc Confidence 3222211122222233333 222111111100 000100 0000 000 000 011111233344444 455688999 Q ss_pred ceeeccC Q lcl|Aclame:pro 382 VVRLIGL 388 (388) Q Consensus 382 i~~~~GI 388 (388) |+.++|| T Consensus 358 ~~~lt~~ 364 (366) T protein:vir:57 358 LVLGTGV 364 (366) T ss_pred EEEEecc Confidence 9999999 No 26 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=98.13 E-value=1.5e-07 Score=58.07 Aligned_cols=290 Identities=9% Similarity=-0.092 Sum_probs=147.7 Q ss_pred hhccCcccccccccccchHHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccceEecccccCCce Q lcl|Aclame:pro 64 TQAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPL 143 (388) Q Consensus 64 ~~amDaa~~~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~diP~ 143 (388) |.+.. ..+.-+|..+.+ +|++.+.+....+++.++.+... ....+++.+..+.+...+....+|. T Consensus 1 mat~~--------~gg~lvP~~~~~----~ii~~~~~~s~i~~~~~~i~~~~---~~~~~p~~~~~~~a~wv~Eg~~~~~ 65 (311) T protein:vir:81 1 MVALA--------TGTFQLPKHLVP----GVWQKAQGQSVLARLSMAEPQEF---GEQQYMTLTAPPRGEVVGEGAQKSE 65 (311) T ss_pred Cceec--------CCceEcchhHHH----HHHHHHHhcchhhhhcceeecCC---CceEEEEEeCCceeEEeecCccccc Confidence 22211 122224555544 55555555555566655443222 2457788888888888999999999 Q ss_pred eeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccc Q lcl|Aclame:pro 144 SSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPA 223 (388) Q Consensus 144 ~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~ 223 (388) .+...+...-..+.++..+.++.+=++...-...+|.+.-+...++++.+.+++-.++|+.........|+++...- . T Consensus 66 ~~~~f~~v~l~~~kl~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~~~~gi~~~~~~--~ 143 (311) T protein:vir:81 66 STATFAPVTAIPRKVQVTQRFSQEVKWADESRQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSGSPAKILD--T 143 (311) T ss_pred ccceeeEEEEeeEEEEEeehhhHHHhhcCcccHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCcccccccccccc--c Confidence 99998888888999988888887433333334567888888999999999999999999754333345566654211 0 Q ss_pred cccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC-CCcCccHHHHHH-HhCCc Q lcl|Aclame:pro 224 IASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRDWLK-QTYPR 301 (388) Q Consensus 224 ~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~~lk-~n~pn 301 (388) .. .......+...+..+|.+++..+.... ..+..++|.+..+..|.+- +..|.-++.-.. ...| T Consensus 144 ~~------~~~~~~~~~~~~~~~i~~~~~~~~~~~-------~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~- 209 (311) T protein:vir:81 144 TN------IVELTTGTSATPDLAVEAAVGLVLGDN-------LSPDGVALDNTFSFMLATQRDSQGRKLYPELGFGTDV- 209 (311) T ss_pred ce------eeeecccccchHHHHHHHHHHHhhhcC-------CCceEEEEcHHHHHHHHhhhccCCCeeecCccccCCC- Confidence 00 001122222234566777776665332 1244689999999988642 333433332111 0011 Q ss_pred cEEEEccc-----ccc---c-------cCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCc-eeccCceEE Q lcl|Aclame:pro 302 VRVMSAPE-----LQG---G-------NPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGV-EKRVKNYVE 365 (388) Q Consensus 302 l~i~~~pe-----l~~---a-------~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v-~~~~~~~~~ 365 (388) -++...|= +.+ . ...+.++..+++. +......+... .-+++. .++-.....+ -++.-.... T Consensus 210 ~tl~G~Pv~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g-Dfs~~~i~~~~-~~~~~~-~~~~~~~~~~~~~~~~~v~~ 286 (311) T protein:vir:81 210 ASFAGLNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAG-DFSAFRWGVQV-SIPLEL-IEFGDPDGLGDLKRQNQIAI 286 (311) T ss_pred ceecceeEEecccccccccccccccchhcccCCccEEEEE-ecccEEEEEec-cceEEE-eccCCCCcchhhhhcCcEEE Confidence 11111110 000 0 0011122222221 11111111100 011111 1110000000 000111233 Q ss_pred ecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 366 AYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 366 ~~~~~t~G~ii~rP~ai~~~~GI 388 (388) .+..|+++ .+.+|-||+++.|. T Consensus 287 r~~~r~d~-~v~~~~a~~~l~~a 308 (311) T protein:vir:81 287 RAEVVYGI-GIMSTDAFAVVRDA 308 (311) T ss_pred EEEEEecc-EeecccceEEEEee Confidence 44455555 45559999999999 No 27 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=98.05 E-value=5.7e-07 Score=54.82 Aligned_cols=286 Identities=11% Similarity=-0.049 Sum_probs=151.0 Q ss_pred hhhhccCcccccccccccch--HHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccceEeccccc Q lcl|Aclame:pro 62 VATQAFDSAYVAPTTQASIP--TPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLT 139 (388) Q Consensus 62 ~~~~amDaa~~~~~t~~~~g--~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~ 139 (388) |+....++... .++.+.| +|-... ++|++.+......+++..+...+. ....+++.+..+.+...+... T Consensus 1 ma~~~~~~~~~--~~t~~gg~lip~~~~----~~ii~~~~~~~~l~~~~~~~~~~~---~~~~ip~~~~~~~a~~v~E~~ 71 (304) T protein:vir:94 1 MATPTYTPGNV--ILSDFKNGVIPAEQG----TLIMKDIMANSAIMKLAKNEPMTA---QKKKFTYLAKGVGAYWVSETE 71 (304) T ss_pred Ccccccccccc--cccCCCceecchhHH----HHHHHHHHhccchhhhcceeeccC---CceEEEEEeCCcceEEeecCc Confidence 34444444322 1223333 444433 456666655555566655544322 334677777777788888888 Q ss_pred CCceeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCC Q lcl|Aclame:pro 140 NIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPS 219 (388) Q Consensus 140 diP~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~ 219 (388) .+|..+...+......+.++..+.++.+=++. ...++...-.....+++.+.+|+-.++|+... +-.|.+.+.. T Consensus 72 ~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~d---s~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~---~~~~~~~~~~ 145 (304) T protein:vir:94 72 RIQTSKPEYAQAEMEAKKIGVIIPLSKEFLKW---TAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSP---YNTSTSGKPL 145 (304) T ss_pred ccccccceeeEEEEEEEEEEEeehhhHHHHhc---chHHHHHHHHHHHHHHHHHHHHhhheeccCCC---cccccccccc Confidence 99999999999999999999999998854432 34778888889999999999999999998532 2334343333 Q ss_pred CccccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC-CCcCccHHHHHHHh Q lcl|Aclame:pro 220 LLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRDWLKQT 298 (388) Q Consensus 220 l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~~lk~n 298 (388) +....+.. . ...+....++||.+++..+...- ..+..++|.++.+..|.+. +..|.-+++ .+ T Consensus 146 ~~~~~~~~------~-~~~~~~~~~~~i~~~~~~l~~~~-------~~~~~~v~~~~~~~~L~~lkd~~G~~l~~---~~ 208 (304) T protein:vir:94 146 VEGAEEKG------N-VVTDTNNLYVDLSALMATIEDEE-------LDPNGVLTTRSFRSKMRNALDANDRPLFD---AN 208 (304) T ss_pred cccccccc------c-ccccccchHHHHHHHHHHhhhcc-------CCcCEEEEcHHHHHHHHHhhccCCcEeec---CC Confidence 32211111 1 11222335888888888876432 1234799999999998642 333433221 11 Q ss_pred CC---ccEEEEccccccccCCCCccEEEEEEcccccccccccCCCcceEeecchhh--------hccCce---eccCceE Q lcl|Aclame:pro 299 YP---RVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKF--------VTLGVE---KRVKNYV 364 (388) Q Consensus 299 ~p---nl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~--------r~~~v~---~~~~~~~ 364 (388) -. ++.++..+.+-. .+++..+++. +......+...+ -++. +-.+. +..+.. .+.--+. T Consensus 209 ~~~l~G~PV~~~~~~~~----~~~~~~~~~g-d~~~~~~~~~~~-~~i~--~~~e~~~~~~~~~~~~g~~~~~f~~~~~~ 280 (304) T protein:vir:94 209 GNEIMGLPLSYTGADVY----DKKKSLALMG-DWDYARYGILQG-IEYA--ISEDATLTTLQASDASGQPVSLFERDMFA 280 (304) T ss_pred CccccceeeEEeccccc----CCCCcEEEEE-ehhhEEEEEecc-eEEE--EeecceeeeecccccCccchhhhhcCcEE Confidence 01 122322222211 1122222222 111111111100 0000 00000 000000 0111123 Q ss_pred EecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 365 EAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 365 ~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) ..+..|.++. +.+|-||+.+..- T Consensus 281 ~r~~~r~~~~-v~~~~a~~~l~~a 303 (304) T protein:vir:94 281 LRATMHIAYM-NVKPEAFATLKPT 303 (304) T ss_pred EEEEEEeccE-eecccceEEEEec Confidence 3344455554 5559999999988 No 28 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=98.05 E-value=5.7e-07 Score=54.82 Aligned_cols=286 Identities=11% Similarity=-0.049 Sum_probs=151.0 Q ss_pred hhhhccCcccccccccccch--HHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccceEeccccc Q lcl|Aclame:pro 62 VATQAFDSAYVAPTTQASIP--TPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLT 139 (388) Q Consensus 62 ~~~~amDaa~~~~~t~~~~g--~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~ 139 (388) |+....++... .++.+.| +|-... ++|++.+......+++..+...+. ....+++.+..+.+...+... T Consensus 1 ma~~~~~~~~~--~~t~~gg~lip~~~~----~~ii~~~~~~~~l~~~~~~~~~~~---~~~~ip~~~~~~~a~~v~E~~ 71 (304) T protein:vir:10 1 MATPTYTPGNV--ILSDFKNGVIPAEQG----TLIMKDIMANSAIMKLAKNEPMTA---QKKKFTYLAKGVGAYWVSETE 71 (304) T ss_pred Ccccccccccc--cccCCCceecchhHH----HHHHHHHHhccchhhhcceeeccC---CceEEEEEeCCcceEEeecCc Confidence 34444444322 1223333 444433 456666655555566655544322 334677777777788888888 Q ss_pred CCceeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCC Q lcl|Aclame:pro 140 NIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPS 219 (388) Q Consensus 140 diP~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~ 219 (388) .+|..+...+......+.++..+.++.+=++. ...++...-.....+++.+.+|+-.++|+... +-.|.+.+.. T Consensus 72 ~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~d---s~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~~---~~~~~~~~~~ 145 (304) T protein:vir:10 72 RIQTSKPEYAQAEMEAKKIGVIIPLSKEFLKW---TAKDFFNEVKPLIAEAFYKAFDQAVIFGTKSP---YNTSTSGKPL 145 (304) T ss_pred ccccccceeeEEEEEEEEEEEeehhhHHHHhc---chHHHHHHHHHHHHHHHHHHHHhhheeccCCC---cccccccccc Confidence 99999999999999999999999998854432 34778888889999999999999999998532 2334343333 Q ss_pred CccccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC-CCcCccHHHHHHHh Q lcl|Aclame:pro 220 LLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRDWLKQT 298 (388) Q Consensus 220 l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~~lk~n 298 (388) +....+.. . ...+....++||.+++..+...- ..+..++|.++.+..|.+. +..|.-+++ .+ T Consensus 146 ~~~~~~~~------~-~~~~~~~~~~~i~~~~~~l~~~~-------~~~~~~v~~~~~~~~L~~lkd~~G~~l~~---~~ 208 (304) T protein:vir:10 146 VEGAEEKG------N-VVTDTNNLYVDLSALMATIEDEE-------LDPNGVLTTRSFRSKMRNALDANDRPLFD---AN 208 (304) T ss_pred cccccccc------c-ccccccchHHHHHHHHHHhhhcc-------CCcCEEEEcHHHHHHHHHhhccCCcEeec---CC Confidence 32211111 1 11222335888888888876432 1234799999999998642 333433221 11 Q ss_pred CC---ccEEEEccccccccCCCCccEEEEEEcccccccccccCCCcceEeecchhh--------hccCce---eccCceE Q lcl|Aclame:pro 299 YP---RVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKF--------VTLGVE---KRVKNYV 364 (388) Q Consensus 299 ~p---nl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~--------r~~~v~---~~~~~~~ 364 (388) -. ++.++..+.+-. .+++..+++. +......+...+ -++. +-.+. +..+.. .+.--+. T Consensus 209 ~~~l~G~PV~~~~~~~~----~~~~~~~~~g-d~~~~~~~~~~~-~~i~--~~~e~~~~~~~~~~~~g~~~~~f~~~~~~ 280 (304) T protein:vir:10 209 GNEIMGLPLSYTGADVY----DKKKSLALMG-DWDYARYGILQG-IEYA--ISEDATLTTLQASDASGQPVSLFERDMFA 280 (304) T ss_pred CccccceeeEEeccccc----CCCCcEEEEE-ehhhEEEEEecc-eEEE--EeecceeeeecccccCccchhhhhcCcEE Confidence 01 122322222211 1122222222 111111111100 0000 00000 000000 0111123 Q ss_pred EecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 365 EAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 365 ~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) ..+..|.++. +.+|-||+.+..- T Consensus 281 ~r~~~r~~~~-v~~~~a~~~l~~a 303 (304) T protein:vir:10 281 LRATMHIAYM-NVKPEAFATLKPT 303 (304) T ss_pred EEEEEEeccE-eecccceEEEEec Confidence 3344455554 5559999999988 No 29 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=98.03 E-value=3.3e-07 Score=56.15 Aligned_cols=294 Identities=10% Similarity=-0.042 Sum_probs=151.2 Q ss_pred hhccCcccccccccccchHHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccceEecccccCCce Q lcl|Aclame:pro 64 TQAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPL 143 (388) Q Consensus 64 ~~amDaa~~~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~diP~ 143 (388) |-++++. .+.-+|-.+.+ +|++.+.+....+++..+...+. ....|++....+.+...|....+|. T Consensus 1 Mat~tt~-------~g~~vP~~~~~----~ii~~~~~~s~l~~~~~~i~~~~---~~~~~p~~~~~~~a~wv~Eg~~~~~ 66 (311) T protein:vir:99 1 MATFGTG-------NLKNLPRNIAD----GMVKDVVQGSTVAVLSARKPQRF---GNEDIITFNGRPKAEFVGEGQQKSS 66 (311) T ss_pred CceecCC-------CceeccHHHHH----HHHHHHHhhchhhhhcceeeccC---CceEEEEEeCCceeEEeecCccccc Confidence 2233321 12224444433 55565555555555544332221 3356788777788888999999999 Q ss_pred eeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccc Q lcl|Aclame:pro 144 SSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPA 223 (388) Q Consensus 144 ~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~ 223 (388) .+....+..-..+.+...+.++.+=++.......+|...-....++++.+.+++-.++|+......+..|+.|-....+. T Consensus 67 ~~~~f~~v~l~~~k~~~~~~iS~ell~~~~d~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~g~~~~g~~~~~~~~~~ 146 (311) T protein:vir:99 67 TTGEFDFVTSTPKKAQVTMRFNEEVQWADEDYQLGVLQTLSEAGAEALARALDLGLYHRINPLTGTVIPGWSNYLGAASK 146 (311) T ss_pred ccceeeEEEEeeEEEEEeehhhHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccCccccccccccccccc Confidence 99999999999999999888888433333345678899999999999999999999999753322334454443322111 Q ss_pred cccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC-CCcCccHHHHHHHhCCcc Q lcl|Aclame:pro 224 IASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRDWLKQTYPRV 302 (388) Q Consensus 224 ~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~~lk~n~pnl 302 (388) .. +-...+......||..++..+...... ..++.++|.+..+..|.+. +..|.-+++-....-..- T Consensus 147 ~~--------~~~~~~~~~~~~~i~~~~~~~~~~~~~-----~~~~~~vmn~~~~~~L~~lkd~~G~~l~~~~~~~~~~~ 213 (311) T protein:vir:99 147 RV--------ELTADTIANPDLAIEAAVGLLVANGHP-----TPVNGLALHPSIAWGLSTARYTDGRKKFPELGLGIGVS 213 (311) T ss_pred ee--------eccccccchhHHHHHHHHHHHhhhccC-----CCccEEEEcHHHHHHHHhhhccCCCeeecCcccCCCCc Confidence 00 111223334466788787777655432 2344689999999988653 333443332111110000 Q ss_pred EEEEcc-----ccccccCCCCc-------cEEEEEEcccccccccccCCCcceEeecchhhhccCc---eeccCceEEec Q lcl|Aclame:pro 303 RVMSAP-----ELQGGNPDDGK-------DIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGV---EKRVKNYVEAY 367 (388) Q Consensus 303 ~i~~~p-----el~~a~gtg~~-------~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v---~~~~~~~~~~~ 367 (388) ++...| .+.+-.++... +...+|.-+......-.....-++.. .+.. .+.. -++.--.-..+ T Consensus 214 ~l~G~Pv~~s~~i~~~~~~~~~~~~~~~~~~~~~~~Gdf~~~~~~~~~~~~~~~~-~~~~--~~~~~~~~~~~d~~~~r~ 290 (311) T protein:vir:99 214 SFEGIDASVSDTVNGGDEADPDDEDLDAARAVRGIVGDFANGIHWGVQRDIPVEL-IKYG--DPDGQGDLKRHNQIALRL 290 (311) T ss_pred eecceeeEeecccccccccccccchhhccCcceEEEeeccccEEEEEecCceEEE-eecC--CCCcchhhhhcCcEEEEE Confidence 222211 11110011111 11112222111000000000001110 0000 0000 01111234566 Q ss_pred ccceeeeeeeccccceeeccC Q lcl|Aclame:pro 368 SNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 368 ~~~t~G~ii~rP~ai~~~~GI 388 (388) ..|++|. |++|-+++..++. T Consensus 291 ~~r~d~~-v~~~~~v~~~~~~ 310 (311) T protein:vir:99 291 EIVYGWY-VFTDRFVVIENAV 310 (311) T ss_pred EEeecce-ecChhHeeeeccc Confidence 7888887 5669877777777 No 30 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=97.99 E-value=1e-06 Score=53.47 Aligned_cols=304 Identities=10% Similarity=-0.043 Sum_probs=151.1 Q ss_pred hhhcccccccccCCHHHHhhcceecccchhhcchhhhhhhhhhhhccCcccccccccccch-HHHHHHHhhcceeeeecc Q lcl|Aclame:pro 21 MANGKADYRLTDMAVRELKKFGLVFDHATVKRQIELLHEGGVATQAFDSAYVAPTTQASIP-TPIQFLQQWLPGFVKVLT 99 (388) Q Consensus 21 ~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~amDaa~~~~~t~~~~g-~l~~~l~~idp~v~e~l~ 99 (388) |+ .+++| +.+++-.+..+..+..+.+ +|-.+.+ +|++.+. T Consensus 1 ~~-----------~~~e~------------------------~~~~~~~~~~~~~~~~~~~liP~~~~~----~ii~~~~ 41 (338) T protein:vir:78 1 MA-----------TLNEL------------------------APNTAGSNHQGRLAHVPSDLLPKEIVG----PIFDKAQ 41 (338) T ss_pred Cc-----------chHHh------------------------hhhhcccccccceecccccccchHHHH----HHHHHHH Confidence 00 01222 2233333333332222222 5555544 5566665 Q ss_pred cchhhhhhcccccCCCCceeeEEEeeecc--------ccceEecccccCCceeeeeeeeeeeeEEEEEEEEeecHHHHHH Q lcl|Aclame:pro 100 SARKIDEILGVKTVGSWEDQEIVQGIVEP--------AGTAMEYGDLTNIPLSSWNVNFERRTIVRGEMGIQVGLLEEGR 171 (388) Q Consensus 100 ~~~~~~~i~~v~t~g~w~~~t~~~~v~e~--------~G~a~~ygd~~diP~~~~n~~~~~~~v~~~~~~~~y~~~El~~ 171 (388) .....+.+.++..... ....+++... .+.+...++...+|..+...+......+.++..+.++.+=++. T Consensus 42 ~~s~l~~l~~~~~~~~---~~~~ip~~~~~~~a~~v~~~~~~~~~Eg~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~d 118 (338) T protein:vir:78 42 ESSLVLRLGENIPISY---GETIIPTTVKRPEVGQVGVGTSNEQREGGTKPLSGTAWDTRSVAPIKLATIVTVSEEFARM 118 (338) T ss_pred hhchhhhhcceeeccC---CceEEEEEecCccceeecccccccccccccccccccceeEEEEEEEEEEEeehhhHHHHhc Confidence 5555566655443322 2334444322 2344556778889999999999999999999998888843332 Q ss_pred HHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccccccccCCcccccccCCHHHHHHHHHHHH Q lcl|Aclame:pro 172 ASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLML 251 (388) Q Consensus 172 A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~ 251 (388) ...++.+.-....++++.+.+|+-.+.|+......+..|++++..+....+.. . ..+.....++++.+++ T Consensus 119 ---s~~~~~~~i~~~la~a~~~~~d~~~l~G~g~~~~~~~~gi~~~~~~~~~~~~~-----~--~~~~~~~~~~~~~~~~ 188 (338) T protein:vir:78 119 ---NPSGLYTKLQADLAYAIGRGIDLAVFHGKSPLTGSALQGIDTNNVIVNTTNVD-----Y--LQTGTTPLLDRFLDGY 188 (338) T ss_pred ---CHHHHHHHHHHHHHHHHHHHHHHHhhcccCCCccccccccccccccccccccc-----c--ccccchhhHHHHHHHH Confidence 33678888888999999999999999998644344677877765542211110 1 1122345688888887 Q ss_pred HHHHHhcCCeeccccccceEEcCHHHHHhhcc----CCCcCccHHHHHHHhCCccEEEE--------ccccccccCCCCc Q lcl|Aclame:pro 252 ITLRVQSEDNIDPEDVDITLVLPMNKVDMLSV----VTDLGISVRDWLKQTYPRVRVMS--------APELQGGNPDDGK 319 (388) Q Consensus 252 ~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~----~~~~~~Tvl~~lk~n~pnl~i~~--------~pel~~a~gtg~~ 319 (388) ..+...... .+..++|.+..+..|.. .+..|.-++.-.......-+|-. +|.-.++ ..++ T Consensus 189 ~~~~~~~~~------~~~~~~m~~~~~~~L~~~~~l~d~~g~~l~~~~~~~~~~~~l~G~PV~~~~~ip~~~~~--~~~~ 260 (338) T protein:vir:78 189 DLVSANTDV------DFNGWAADPRYRARLLRSQAYRDANGNVDPTRINLAASAGDLLGLPVQFGKAVGGDLGA--ATDS 260 (338) T ss_pred HHhhhhccc------cceEEEEchHHHHHHHHHhhhccCCCceeecccccCCCCceeeeeeEEEccccCccccc--cCCc Confidence 777644321 24468899888776632 23334333221111111112222 2221111 1222 Q ss_pred cEEEEEEcccccccccccCCCcceEeecchhhhccCce--------eccCceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 320 DIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVE--------KRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 320 ~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~--------~~~~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) +..+++. +......+... .-+++. .++.-...+.. +..--....|+.|. |+.+.+|-||+++... T Consensus 261 ~~~~~~g-dfs~~~~~~~~-~~~i~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~-d~~v~~~~a~~~l~~~ 333 (338) T protein:vir:78 261 KVRVVGG-DFSQLKYGFAD-EIRVKM-SDTATLTDNTSPTPQTVSMWQTNQIAILIEVTF-GWLLGDKQAFVKFVDD 333 (338) T ss_pred ccEEEEE-ecceEEEEeec-ccEEEE-eecccccccccccccchhhhhcCcEEEEEEEEe-ccEeecccceEEEecc Confidence 2333332 21111111000 000000 00000000000 00011223344455 4555669999999999 No 31 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=97.99 E-value=1.2e-06 Score=53.07 Aligned_cols=283 Identities=10% Similarity=-0.025 Sum_probs=148.3 Q ss_pred hhhhccCcccccccccccch--HHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccceEeccccc Q lcl|Aclame:pro 62 VATQAFDSAYVAPTTQASIP--TPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLT 139 (388) Q Consensus 62 ~~~~amDaa~~~~~t~~~~g--~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~ 139 (388) |....||+.... +..+.| +|..+.+ +|++.+.+..-.+++.++...+.- ....+++......+...++.. T Consensus 1 m~~~~~~~~~~~--~t~~~~~lvP~~~~~----~ii~~~~~~s~l~~~~~~~~~~~~--~~~~~~~~~~~~~a~~v~Eg~ 72 (297) T protein:vir:95 1 MTVQTFNPENVL--VSQKKDGTLHKEFTD----IIMKEVAQNSLVMQLGQYQEMEGE--QEKTVYVQTDGISAYWVNETE 72 (297) T ss_pred CCcccccccccc--ccCCCcceechhHHH----HHHHHHHhhchhhhhcceeecCCC--ccEEEEEEcCCceeEEeecCc Confidence 333445554321 222222 4555444 556666555555666554432211 123456666667778889999 Q ss_pred CCceeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCC Q lcl|Aclame:pro 140 NIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPS 219 (388) Q Consensus 140 diP~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~ 219 (388) ++|..+..........+.++..+.++.+-++.+ ..++.+.-....++++.+.+++-.++|+.. .+..|+++... T Consensus 73 ~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds---~~~l~~~i~~~la~ai~~~~d~a~l~G~g~---~~~~gi~~~~~ 146 (297) T protein:vir:95 73 KIKTDKPEVVPVTLKAHKLGIILVTSREALNYT---WKKFFEDMKPQIVEAFYKKIDEAGLLGHDT---PFANSVAKAAK 146 (297) T ss_pred cccccccceeEEEEeeEEEEEeehhhHHHHhcC---HHHHHHHHHHHHHHHHHHHHHHHHhcccCC---ccccccccccc Confidence 999999999999999999999999998655433 257888888999999999999999999753 24567766533 Q ss_pred CccccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC-CCcCccHHHHHHHh Q lcl|Aclame:pro 220 LLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRDWLKQT 298 (388) Q Consensus 220 l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~~lk~n 298 (388) ... +..++. -| ++||.+++..+...-. .+..++|.+..+..|.+. +..|.-++ ... T Consensus 147 ~~~----~~~~~~-----~t----~~~i~~~~~~l~~~~~-------~~~~~v~~~~~~~~L~~l~d~~G~~i~---~~~ 203 (297) T protein:vir:95 147 DAN----KVIGGP-----IN----YDNILKLQDALYDADV-------EPNAFVSKIQNRSALREARDGNKVSIY---DKA 203 (297) T ss_pred ccc----eecccc-----cC----HHHHHHHHHHhhhccC-------CcCEEEEcHHHHHHHHHhhccCCceee---cCC Confidence 211 111111 11 6677888887765421 245799999999988643 33343222 111 Q ss_pred CCccEEEEccccccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhh-----ccCce---eccCceEEecccc Q lcl|Aclame:pro 299 YPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFV-----TLGVE---KRVKNYVEAYSNA 370 (388) Q Consensus 299 ~pnl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r-----~~~v~---~~~~~~~~~~~~~ 370 (388) .. ++-..|-...-+.......+ ++.+ ......+... .-.++. ..+-.. ..+.. .+.-.....+..| T Consensus 204 ~~--~l~G~Pv~~~~~~~~~~~~~-~~gd-~s~~~~~~~~-~~~i~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~ 277 (297) T protein:vir:95 204 AN--TIDGITTVDLKSARFEKGDL-LAGD-FDNLIYGVPY-NITYKI-SEEGQISTITNADGTPINLFEQEMIAIRATMD 277 (297) T ss_pred CC--cccceeeEeecCCCCCCceE-EEEe-cccEEEEEec-CeEEEE-eeccccccccccCccchhhhhcCcEEEEEEEE Confidence 11 22222221100001111111 2211 1111010000 000100 000000 00000 1111233344445 Q ss_pred eeeeeeeccccceeeccC Q lcl|Aclame:pro 371 TAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 371 t~G~ii~rP~ai~~~~GI 388 (388) .++ .+.+|-||+++..- T Consensus 278 ~d~-~v~~~~a~~~l~~a 294 (297) T protein:vir:95 278 IAV-MITKTDAFAKLTPA 294 (297) T ss_pred ecc-EeecccceEEEeec Confidence 544 45569999998877 No 32 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=97.96 E-value=8.9e-07 Score=53.77 Aligned_cols=288 Identities=10% Similarity=-0.075 Sum_probs=141.8 Q ss_pred hccCcccccccccccchHHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccceEecccccCCcee Q lcl|Aclame:pro 65 QAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPLS 144 (388) Q Consensus 65 ~amDaa~~~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~diP~~ 144 (388) ||..+. +.++.-+|..+.+ +|++.+......+.+..+...+. ....+++....+.|...|....+|.. T Consensus 1 Ma~~~~-----~~gg~~vP~~~~~----~ii~~l~~~s~i~~l~~~i~~~~---~~~~ip~~~~~~~a~wv~Eg~~~~~s 68 (315) T protein:vir:80 1 MADDFL-----SAGKLELPGSMIG----AVRDRAIDSGVLAKLSPEQPTIF---GPVKGAVFSGVPRAKIVGEGEVKPSA 68 (315) T ss_pred CCCCcC-----CcCceEcchHHHH----HHHHHHHhhchhhhhcceeecCC---CceEEEEEeCCcceEEeeCCcccccc Confidence 443321 1122325555444 55555555555555544332221 34578888888888899999999999 Q ss_pred eeeeeeeeeeEEEEEEEEeecHHHHHHHHHhC--CChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCcc Q lcl|Aclame:pro 145 SWNVNFERRTIVRGEMGIQVGLLEEGRASAMR--INSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLP 222 (388) Q Consensus 145 ~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g--~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a 222 (388) +...+...-..+.++....++.+ +.+..... -.|.+.-....++++.+.+++-.|+|+......+..|+.+.-+. T Consensus 69 ~~~f~~v~l~~~kl~~~~~iS~e-ll~~s~~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~~~~~~~~~~~~~~-- 145 (315) T protein:vir:80 69 SVDVSAFTAQPIKVVTQQRVSDE-FMWADADYRLGVLQDLISPALGASIGRAVDLIAFHGIDPATGKAASAVHTSLNK-- 145 (315) T ss_pred ccceeeeEeeeeeEEeeehhhHH-HhhcCchhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCCCcccccccccccc-- Confidence 99888888888888888888874 43322211 12567777888899999999999999753222223333222110 Q ss_pred ccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC-CC-----cCccHHHHHH Q lcl|Aclame:pro 223 AIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TD-----LGISVRDWLK 296 (388) Q Consensus 223 ~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-~~-----~~~Tvl~~lk 296 (388) + + -........++||.+++..+..... ..+..++|.+..+..|.+. +. .+..++.=+. T Consensus 146 ----~-----~-~~~~~~~~~~~d~~~~~~~~~~~~~------~~~~~~imn~~~~~~L~~l~~~~g~~~~g~~~~~~~~ 209 (315) T protein:vir:80 146 ----T-----K-NIVDATDSATADLVKAVGLIAGAGL------QVPNGVALDPAFSFALSTEVYPKGSPLAGQPMYPAAG 209 (315) T ss_pred ----c-----c-ceeeccccchHHHHHHHHHHhhccC------ccceEEEEcHHHHHHHHHHhhccCCcccccccccccc Confidence 0 0 0122233457788888877654322 1234689999988888543 11 1222111111 Q ss_pred HhCCccEEEEcc-----ccccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCc---eeccCceEEecc Q lcl|Aclame:pro 297 QTYPRVRVMSAP-----ELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGV---EKRVKNYVEAYS 368 (388) Q Consensus 297 ~n~pnl~i~~~p-----el~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v---~~~~~~~~~~~~ 368 (388) ..-|+ +|-..| .+......+......+|.-+.........+ .-.++. .+.. ...+. -++.-...+.+. T Consensus 210 ~g~~~-tl~G~PV~~~~~~~~~~~~~~~~~~~~~~GDfs~~~~g~~~-~~~i~i-~~~~-~~~~~~~~~~~~~~v~~r~~ 285 (315) T protein:vir:80 210 FAGLD-NWRGLNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQR-NFPIEL-IEYG-DPDQTGRDLKGHNEVMVRAE 285 (315) T ss_pred cCCCc-eecceeeEecCcCCcccccccccccEEEEeecccEEEEEec-CeeEEE-eccc-cccCcccchhhcCcEEEEEE Confidence 11111 222222 111100011111112222221111010000 001111 0000 00000 011111334455 Q ss_pred cceeeeeeeccccceeeccC Q lcl|Aclame:pro 369 NATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 369 ~~t~G~ii~rP~ai~~~~GI 388 (388) .|. |..|++|-||+++.+. T Consensus 286 ~r~-~~~v~~~~a~~~l~~~ 304 (315) T protein:vir:80 286 AVL-YVAIESLDSFAVVKEK 304 (315) T ss_pred EEe-cceeecccceEEEeec Confidence 555 4556669999999999 No 33 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=97.95 E-value=1.4e-06 Score=52.69 Aligned_cols=304 Identities=9% Similarity=-0.073 Sum_probs=147.4 Q ss_pred cchhhhhhcccccccccCCHHHHhhcceecccchhhcchhhhhhhhhhhhccCcccccccccccchHHHHHHHhhcceee Q lcl|Aclame:pro 16 VRAFDMANGKADYRLTDMAVRELKKFGLVFDHATVKRQIELLHEGGVATQAFDSAYVAPTTQASIPTPIQFLQQWLPGFV 95 (388) Q Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~amDaa~~~~~t~~~~g~l~~~l~~idp~v~ 95 (388) ++ + ++.. -..+ + .|-.. ... ....+|........++.-+|..+.+ +|+ T Consensus 1 ~~-------~----~~~~-~~~~---~-~~~~~-------~~~-----~~~~~a~~~~~~~~~~~~iP~~~~~----~ii 48 (324) T protein:vir:96 1 ME-------Q----TQKL-KLNL---Q-HFASN-------NVK-----PQVFNPDNVMMHEKKDGTLMNEFTT----PIL 48 (324) T ss_pred CC-------c----chhh-hHHH---H-HHHHH-------hhh-----hhhhccccccccCcCccccchhHHH----HHH Confidence 11 1 0100 0011 1 11111 100 1111211111111122224554443 555 Q ss_pred eecccchhhhhhcccccCCCCceeeEEEeeeccccceEecccccCCceeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHh Q lcl|Aclame:pro 96 KVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAM 175 (388) Q Consensus 96 e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~diP~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~ 175 (388) +.+......++++++.+... .+..+++.+..+.+...+....+|..+...+......+.++..+.++.+-++.+ T Consensus 49 ~~~~~~s~l~~l~~~~~~~~---~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds--- 122 (324) T protein:vir:96 49 QEVMENSKIMQLGKYEPMEG---TEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYT--- 122 (324) T ss_pred HHHHhhchhhhhcceeeccC---CceEEEEEecCcceeEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcc--- Confidence 55555555666655544322 346788887788888999999999999999999999999999999998544433 Q ss_pred CCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccccccccCCcccccccCCHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 176 RINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLR 255 (388) Q Consensus 176 g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~ 255 (388) ..++...-.....+++.+.+++-.++|+... ....|+++..+..... . . ...-++||.+++..+. T Consensus 123 ~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~--~~~~gi~~~~~~~~~~----~------~---~~~t~~~i~~~~~~l~ 187 (324) T protein:vir:96 123 YSQFFEEMKPMIAEAFYKKFDEAGILNQGNN--PFGKSIAQSIEKTNKV----I------K---GDFTQDNIIDLEALLE 187 (324) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhccCCCC--CcCcccccccccccee----c------c---ccccHHHHHHHHHhhh Confidence 3578888888888888899999999997532 2334555543321110 0 0 1112677777877765 Q ss_pred HhcCCeeccccccceEEcCHHHHHhhccC-CCcCccHHHHHHHhCCccEEEEccccc-cccCCCCccEEEEEEccccccc Q lcl|Aclame:pro 256 VQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRDWLKQTYPRVRVMSAPELQ-GGNPDDGKDIAYMFLDSVDTAV 333 (388) Q Consensus 256 ~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~~lk~n~pnl~i~~~pel~-~a~gtg~~~~~~~~~~~~d~~~ 333 (388) ..- ..+..++|.+..+..|.+. +..|..++. .-..-++-..|=.. .+...+.+. + ++.+ ..... T Consensus 188 ~~~-------~~~~~~vmn~~~~~~L~~l~d~~G~~~~~----~~~~~~l~G~PV~~~~~~~~~~~~-~-~~gd-~~~~~ 253 (324) T protein:vir:96 188 DDE-------LEANAFISKTQNRSLLRKIVDPETKERIY----DRNSDSLDGLPVVNLKSSNLKRGE-L-ITGD-FDKLI 253 (324) T ss_pred hcc-------CCCCEEEEcHHHHHHHHHhhccCCCeeec----CCCCCcccceeeEeeCCCCCCcce-E-EEEe-cceEE Confidence 432 2345799999999988643 333433221 11111222222111 111112222 1 1111 10000 Q ss_pred ccccCCCcceEe---ecchhhhccCce----eccCceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 334 DGSTDGGDTWAQ---LVQSKFVTLGVE----KRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 334 ~~~~~~~~t~~~---~~p~~~r~~~v~----~~~~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) .+... .-+++. ..-...+...-. ...-.....+..|. |+.+.+|-||+++.|. T Consensus 254 ~g~~~-~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~-d~~v~~~~A~~~l~~a 313 (324) T protein:vir:96 254 YGIPQ-LIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHV-ALHIADDKAFAKLVPA 313 (324) T ss_pred EEEec-CcEEEEeecccccccccccccchhhhhcCcEEEEEEEEE-ccEEecccceEEEecc Confidence 00000 000000 000000000000 00001222333344 5566669999999998 No 34 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=97.95 E-value=1.4e-06 Score=52.69 Aligned_cols=304 Identities=9% Similarity=-0.073 Sum_probs=147.4 Q ss_pred cchhhhhhcccccccccCCHHHHhhcceecccchhhcchhhhhhhhhhhhccCcccccccccccchHHHHHHHhhcceee Q lcl|Aclame:pro 16 VRAFDMANGKADYRLTDMAVRELKKFGLVFDHATVKRQIELLHEGGVATQAFDSAYVAPTTQASIPTPIQFLQQWLPGFV 95 (388) Q Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~amDaa~~~~~t~~~~g~l~~~l~~idp~v~ 95 (388) ++ + ++.. -..+ + .|-.. ... ....+|........++.-+|..+.+ +|+ T Consensus 1 ~~-------~----~~~~-~~~~---~-~~~~~-------~~~-----~~~~~a~~~~~~~~~~~~iP~~~~~----~ii 48 (324) T protein:vir:78 1 ME-------Q----TQKL-KLNL---Q-HFASN-------NVK-----PQVFNPDNVMMHEKKDGTLMNEFTT----PIL 48 (324) T ss_pred CC-------c----chhh-hHHH---H-HHHHH-------hhh-----hhhhccccccccCcCccccchhHHH----HHH Confidence 11 1 0100 0011 1 11111 100 1111211111111122224554443 555 Q ss_pred eecccchhhhhhcccccCCCCceeeEEEeeeccccceEecccccCCceeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHh Q lcl|Aclame:pro 96 KVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAM 175 (388) Q Consensus 96 e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~diP~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~ 175 (388) +.+......++++++.+... .+..+++.+..+.+...+....+|..+...+......+.++..+.++.+-++.+ T Consensus 49 ~~~~~~s~l~~l~~~~~~~~---~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds--- 122 (324) T protein:vir:78 49 QEVMENSKIMQLGKYEPMEG---TEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYT--- 122 (324) T ss_pred HHHHhhchhhhhcceeeccC---CceEEEEEecCcceeEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcc--- Confidence 55555555666655544322 346788887788888999999999999999999999999999999998544433 Q ss_pred CCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccccccccCCcccccccCCHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 176 RINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLR 255 (388) Q Consensus 176 g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~ 255 (388) ..++...-.....+++.+.+++-.++|+... ....|+++..+..... . . ...-++||.+++..+. T Consensus 123 ~~~l~~~i~~~la~ai~~~~d~a~l~G~g~~--~~~~gi~~~~~~~~~~----~------~---~~~t~~~i~~~~~~l~ 187 (324) T protein:vir:78 123 YSQFFEEMKPMIAEAFYKKFDEAGILNQGNN--PFGKSIAQSIEKTNKV----I------K---GDFTQDNIIDLEALLE 187 (324) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhccCCCC--CcCcccccccccccee----c------c---ccccHHHHHHHHHhhh Confidence 3578888888888888899999999997532 2334555543321110 0 0 1112677777877765 Q ss_pred HhcCCeeccccccceEEcCHHHHHhhccC-CCcCccHHHHHHHhCCccEEEEccccc-cccCCCCccEEEEEEccccccc Q lcl|Aclame:pro 256 VQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRDWLKQTYPRVRVMSAPELQ-GGNPDDGKDIAYMFLDSVDTAV 333 (388) Q Consensus 256 ~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~~lk~n~pnl~i~~~pel~-~a~gtg~~~~~~~~~~~~d~~~ 333 (388) ..- ..+..++|.+..+..|.+. +..|..++. .-..-++-..|=.. .+...+.+. + ++.+ ..... T Consensus 188 ~~~-------~~~~~~vmn~~~~~~L~~l~d~~G~~~~~----~~~~~~l~G~PV~~~~~~~~~~~~-~-~~gd-~~~~~ 253 (324) T protein:vir:78 188 DDE-------LEANAFISKTQNRSLLRKIVDPETKERIY----DRNSDSLDGLPVVNLKSSNLKRGE-L-ITGD-FDKLI 253 (324) T ss_pred hcc-------CCCCEEEEcHHHHHHHHHhhccCCCeeec----CCCCCcccceeeEeeCCCCCCcce-E-EEEe-cceEE Confidence 432 2345799999999988643 333433221 11111222222111 111112222 1 1111 10000 Q ss_pred ccccCCCcceEe---ecchhhhccCce----eccCceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 334 DGSTDGGDTWAQ---LVQSKFVTLGVE----KRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 334 ~~~~~~~~t~~~---~~p~~~r~~~v~----~~~~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) .+... .-+++. ..-...+...-. ...-.....+..|. |+.+.+|-||+++.|. T Consensus 254 ~g~~~-~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~-d~~v~~~~A~~~l~~a 313 (324) T protein:vir:78 254 YGIPQ-LIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHV-ALHIADDKAFAKLVPA 313 (324) T ss_pred EEEec-CcEEEEeecccccccccccccchhhhhcCcEEEEEEEEE-ccEEecccceEEEecc Confidence 00000 000000 000000000000 00001222333344 5566669999999998 No 35 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=97.88 E-value=1.9e-06 Score=51.99 Aligned_cols=305 Identities=9% Similarity=-0.053 Sum_probs=146.1 Q ss_pred hhhcccccccccCCHHHHhhcceecccchhhcchhhhhhhhhhhhccCcccccccccccchHHHHHHHhhcceeeeeccc Q lcl|Aclame:pro 21 MANGKADYRLTDMAVRELKKFGLVFDHATVKRQIELLHEGGVATQAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTS 100 (388) Q Consensus 21 ~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~amDaa~~~~~t~~~~g~l~~~l~~idp~v~e~l~~ 100 (388) |.. .++.|.-+. .+. ........+.|......+.++.-+|-.+.. +|++.+.. T Consensus 1 ~~~------------~~~~~~~~~------~f~-----~~~~~~~~~~a~~~~~~~~~~~liP~~~~~----~ii~~~~~ 53 (324) T protein:vir:93 1 MEQ------------TQKLKLNLQ------HFA-----SNNVKPQVFNPDNVMMHEKKDGTLLNDFTT----PILQEVME 53 (324) T ss_pred Cch------------hHHHHHHHH------HHH-----HhhhhhhhcccccccccCCCcceechhHHH----HHHHHHHh Confidence 100 111110000 011 111122222322111111112224444433 45555544 Q ss_pred chhhhhhcccccCCCCceeeEEEeeeccccceEecccccCCceeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChH Q lcl|Aclame:pro 101 ARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSA 180 (388) Q Consensus 101 ~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~diP~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~ 180 (388) ....+++.++...+. .++.|++.+..+.+...+....+|..+...+...-..+.++..+.++.+-++.+ ..++. T Consensus 54 ~s~l~~l~~~~~~~~---~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds---~~~l~ 127 (324) T protein:vir:93 54 NSKIMQLGKYEPMEG---TEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYT---YSQFF 127 (324) T ss_pred hchhhhhcceeeccC---CceEEEEEecCcceeeecCCccccccccceeEEEEEeEEEEEeehhhHHHHhcc---hHHHH Confidence 444555554443222 345778887778888889999999999999988999999999988988544333 35778 Q ss_pred HHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCC Q lcl|Aclame:pro 181 EVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSED 260 (388) Q Consensus 181 ~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g 260 (388) +.-......++.+.+++-.+.|+... ....|+++........ ..+ ..-++||.+++..+...- T Consensus 128 ~~i~~~l~~aia~~~d~a~l~G~g~~--~~~~~~~~~~~~~~~~----~~~---------~~~~~~i~~~~~~l~~~~-- 190 (324) T protein:vir:93 128 EEMKPMIAEAFYKKFDEAGILNQGNN--PFGKSIAQSIEKTNKV----IKG---------DFTQDNIIDLEALLEDDE-- 190 (324) T ss_pred HHHHHHHHHHHHHHHHHHHhcCCCCC--CcCcccccccccccee----ccc---------cccHHHHHHHHHhhhhcc-- Confidence 88888888888888999889997532 2234555543221110 000 112678888888776532 Q ss_pred eeccccccceEEcCHHHHHhhccC-CCcCccHHHHHHHhCCccEEEEccccccccCCCCccEEEEEEcccccccccccCC Q lcl|Aclame:pro 261 NIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRDWLKQTYPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDG 339 (388) Q Consensus 261 ~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~~lk~n~pnl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~ 339 (388) ..+..++|.++.+..|.+. +..|.-++. ....+ ++-..|=....+.......+ ++ -+......+...+ T Consensus 191 -----~~~~~~v~n~~~~~~L~~l~d~~G~~~~~--~~~~~--~l~G~PVv~~~~~~~~~~~i-~~-gdfs~~~~~~~~~ 259 (324) T protein:vir:93 191 -----LEANAFISKTQNRSLLRKIVDPETKERIY--DRNSD--SLDGLPVVNLKSSNLKRGEL-IT-GDFDKLIYGIPQL 259 (324) T ss_pred -----CCCCEEEEcHHHHHHHHHhhCCCCCeeec--CCCCC--cccceeeEeecCCCCCcceE-EE-EecceEEEEEecC Confidence 1244799999999999643 333432211 01111 22222211110011111111 11 1111111111100 Q ss_pred CcceE---eecchhhhccCce----eccCceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 340 GDTWA---QLVQSKFVTLGVE----KRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 340 ~~t~~---~~~p~~~r~~~v~----~~~~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) -+++ ++.....+..... +..-...+.+..|. |+.+.+|-||+++.+. T Consensus 260 -~~i~~~~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~-d~~v~~~~a~~~l~~a 313 (324) T protein:vir:93 260 -IEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHV-ALHIADDKAFAKLVPA 313 (324) T ss_pred -cEEEEeecccccccccccccchhhhhcCcEEEEEEEEe-ccEEecccceEEEecc Confidence 0000 0000000000000 00011222333444 5667779999999998 No 36 >protein:vir:108211 Length: 318 # NCBI annotation: gp9 # Family: family:all:6420 # MgeID: mge:2004 # MgeName: Giles # Cross-refs: genbank:acc:YP_001552338;genbank:gi:160700658;genbank:GeneID:5758931 Probab=97.87 E-value=1.3e-07 Score=58.38 Aligned_cols=279 Identities=13% Similarity=0.121 Sum_probs=133.6 Q ss_pred cccCCHHHHhhccee--cccchhhcchhhhhhhhhhhhccCcccccccccccchHHHHHHHhhcceeeeecccchhhhhh Q lcl|Aclame:pro 30 LTDMAVRELKKFGLV--FDHATVKRQIELLHEGGVATQAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEI 107 (388) Q Consensus 30 ~~~~~~~~l~~~g~~--~~~~~~~~~~~~~~~~~~~~~amDaa~~~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i 107 (388) +++ + -||+ .++. ..+.|.-...| . =++.+..+..+++++ +..| T Consensus 1 ~~~--~-----~~i~s~~~~~---------------~itv~~ll~~P----~-~I~~~i~e~~~~~~i--------ad~l 45 (318) T protein:vir:10 1 MTA--P-----TGIVSVSDGP---------------AITVRELVGNP----L-WIPTALKKMMVNQFI--------SESL 45 (318) T ss_pred CCC--C-----CcceeeecCC---------------ceehHHhhCCc----h-hHHHHHHHHHhccch--------hhhh Confidence 110 0 0111 0000 00000000000 0 023333333333222 2233 Q ss_pred cccccCCCCceeeEEEeeeccc---cceEecccccCCceeeeeeeeeee-eEEEEEEEEeecHHHHHHHHHhCCChHHHH Q lcl|Aclame:pro 108 LGVKTVGSWEDQEIVQGIVEPA---GTAMEYGDLTNIPLSSWNVNFERR-TIVRGEMGIQVGLLEEGRASAMRINSAEVK 183 (388) Q Consensus 108 ~~v~t~g~w~~~t~~~~v~e~~---G~a~~ygd~~diP~~~~n~~~~~~-~v~~~~~~~~y~~~El~~A~~~g~~l~~~K 183 (388) |- +.+.-....+.|.-..+. |.+.-...+..+|+++......+. .+..++.+++++.+.+. ..+++...+. T Consensus 46 f~--~~~a~~~~~v~f~~~~p~~~~~d~e~VaEggEiP~~~~~~G~~~ia~~~K~G~~~~vS~Em~~---~n~~~~v~r~ 120 (318) T protein:vir:10 46 FR--NGGANPNGVVAYNEGNPSFLEDDVADVAEFGEIPVSAGARGLPRTAFAVKKALGVRVSKEMID---ENRVGAVNDQ 120 (318) T ss_pred hh--cccccccceeEEEecccccccCcHhhccCcccccccCCCCCchhhhhhehhccceeccHHHHh---hcChhHHHHH Confidence 32 112212233344332222 555555667889999877766554 55789999999996443 3567888899 Q ss_pred HHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccccccccCCcccccccCC-----HHHH----HHHHHHHHHHH Q lcl|Aclame:pro 184 RQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWVSGGANA-----FQGI----VGDLRLMLITL 254 (388) Q Consensus 184 ~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~t~Wa~kT-----~~eI----~~DI~~~~~~l 254 (388) ...+++++..+.|+.++ ..|.+++++.-..+ ++|..|.... +.|. ..|++.+-..= T Consensus 121 ~~~l~Nti~r~~d~~a~------------dal~sa~t~~~~~s---~~w~~~~~~~~d~~~A~e~v~~a~~~~~~a~~~~ 185 (318) T protein:vir:10 121 MLQLRNTFIRANDRSAK------------ALLQSPIVPTLAVP---TAWDNGGKVRTDIAIAIEQISTAAPTAYPAGVGS 185 (318) T ss_pred HHHHHHHHHHHHHHHHH------------HHHhccccccccCC---cCCCCcccccccchhhhhhhhhhhhhhhhhhhhh Confidence 99999999999888765 23555554332111 1222222111 1111 11222211111 Q ss_pred HHhcCCeeccccccceEEcCHHHHHhhccCCC----c---CccHHHHH--HHhCC----ccEEEEccccccccCCCCccE Q lcl|Aclame:pro 255 RVQSEDNIDPEDVDITLVLPMNKVDMLSVVTD----L---GISVRDWL--KQTYP----RVRVMSAPELQGGNPDDGKDI 321 (388) Q Consensus 255 ~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~~~----~---~~Tvl~~l--k~n~p----nl~i~~~pel~~a~gtg~~~~ 321 (388) .....| -.|++|+|.|..+..|.+... | +..+..-+ .-+|| +|+++..|-+. .+. T Consensus 186 ~~~~~G-----Y~pdtIVlhP~~~~~l~~n~~~~~~y~~~a~~~~~~~~~tg~~~g~~lGl~vi~s~~~p-------~~~ 253 (318) T protein:vir:10 186 SDEYFG-----FIPDTIVMHYALLPILMDNENFMKVYERNANYVSTAPDWTGNFPGSVMGLNVIRSRTFP-------IDR 253 (318) T ss_pred hhhccC-----ccceeeEECHHHHHHHhcchhhhhhhhccchhhhhcccccccccceeeceEEeecCccC-------CCe Confidence 122222 257899999999999954311 1 11111111 12233 36777777664 233 Q ss_pred EEEEEcccccccccccCCCcceEeecchhhhccCce-----e---ccCceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 322 AYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVE-----K---RVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 322 ~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~-----~---~~~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) ++++.+-.. +....++.+...+.. + ++.+|....+..+ ...|.+|.|+..++|| T Consensus 254 alvlq~g~v------------G~~~d~~pl~~t~~~~egg~~~g~~~~s~~~~~~~~~-~~~V~~PkA~~~itgi 315 (318) T protein:vir:10 254 VLIMERGTV------------GFYSDTRPLQFTALYPEGNGPNGGPTESYRADASHKR-ALAVDQPKAALWLTGI 315 (318) T ss_pred eEEEecCCc------------ceeeccccceeeecccCCCCCCCCcchhhheehheee-eeeeeCcceeEEEeec Confidence 555544321 112122222222222 1 4567887776655 4667789999999999 No 37 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=97.87 E-value=3.9e-06 Score=50.24 Aligned_cols=302 Identities=9% Similarity=-0.072 Sum_probs=148.3 Q ss_pred cchhhhhhcccccccccCCHHHHhhcceecccchhhcchhhhhhhhhhhhccCcccccccccccchHHHHHHHhhcceee Q lcl|Aclame:pro 16 VRAFDMANGKADYRLTDMAVRELKKFGLVFDHATVKRQIELLHEGGVATQAFDSAYVAPTTQASIPTPIQFLQQWLPGFV 95 (388) Q Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~amDaa~~~~~t~~~~g~l~~~l~~idp~v~ 95 (388) ++ + .+. .-.+++++--. ......++|........++.-+|..+.+ +|+ T Consensus 1 ~~-------~----~~~-~~~~~~~f~~~----------------~~~~~~~~a~~~~~~~~~~~liP~~~~~----~ii 48 (324) T protein:vir:10 1 ME-------Q----TQK-LKLNLQHFASN----------------NVKPQVFNPDNVMMHEKKDGTLLNDFTT----PIL 48 (324) T ss_pred CC-------C----chH-HHHHHHHHHHH----------------hhccceecccceeccCCCcceechhHHH----HHH Confidence 11 1 111 11123331100 1111222222111111112224544444 444 Q ss_pred eecccchhhhhhcccccCCCCceeeEEEeeeccccceEecccccCCceeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHh Q lcl|Aclame:pro 96 KVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAM 175 (388) Q Consensus 96 e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~diP~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~ 175 (388) +.+......++++++...+. .+..+++.+..+.+...+....+|..+.......-..+.++..+.++.+-++.+ T Consensus 49 ~~~~~~s~l~~~~~~~~~~~---~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds--- 122 (324) T protein:vir:10 49 QEVMENSKIMQLGKYEPMEG---TEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYT--- 122 (324) T ss_pred HHHHhhchhhhhcceeeccC---CceEEEEEeCCcceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHHhcc--- Confidence 44444444555554443322 346778887778888999999999999999999999999999999998655433 Q ss_pred CCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccccccccCCcccccccCCHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 176 RINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLR 255 (388) Q Consensus 176 g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~ 255 (388) ..++.+.-.....+++.+.+++-.++|+... ....|+++...... +... ....++||.+++..+. T Consensus 123 ~~~l~~~i~~~l~~ai~~~~d~a~l~G~g~~--~~~~~i~~~~~~~~----------~~~~---~~~t~~~i~~~~~~l~ 187 (324) T protein:vir:10 123 YSQFFEEMKPMIAEAFYKKFDEAGILNQGNN--PFGKSIAQSIEKTN----------KVIK---GDFTQDNIIDLEALLE 187 (324) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHhhhcCCCC--ccCccccccccccc----------eecc---ccCCHHHHHHHHHhhh Confidence 3578888888888888899999999997532 12345554322110 0011 1112677777877775 Q ss_pred HhcCCeeccccccceEEcCHHHHHhhccC-CCcCccHHHHHHHhCCccEEEEccccc-cccCCCCccEEEEEEccccccc Q lcl|Aclame:pro 256 VQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRDWLKQTYPRVRVMSAPELQ-GGNPDDGKDIAYMFLDSVDTAV 333 (388) Q Consensus 256 ~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~~lk~n~pnl~i~~~pel~-~a~gtg~~~~~~~~~~~~d~~~ 333 (388) ..- ..+..++|.+..+..|.+. +..|.-++ .-.+.. ++-..|=.. .+...+.+. +++.+ ..... T Consensus 188 ~~~-------~~~~~~v~n~~~~~~L~~l~d~~g~~~~--~~~~~~--~l~G~PV~~~~~~~~~~~~--~~~gd-~~~~~ 253 (324) T protein:vir:10 188 DDE-------LEANAFISKTQNRSLLRKIVDPETKERI--YDRNSD--TLDGLPVVNLKSSNLKRGE--LITGD-FDKLI 253 (324) T ss_pred hcc-------CCCCEEEEcHHHHHHHHHhhccCCceee--cCCCCc--cccceeEEeecCCCCCcce--EEEEe-cccEE Confidence 431 2345799999999988653 33333221 111111 222222111 111111121 22211 11111 Q ss_pred ccccCCCcceEeecchhhhccC--------c-eeccCceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 334 DGSTDGGDTWAQLVQSKFVTLG--------V-EKRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 334 ~~~~~~~~t~~~~~p~~~r~~~--------v-~~~~~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) .+... .-+++. -....... + -++.-.....+..|.++. +.+|-||+++.|. T Consensus 254 ~~~~~-~~~i~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~-v~~~~A~~~l~~a 313 (324) T protein:vir:10 254 YGIPQ-LIEYKI--DETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALH-IADDKAFAKLVPA 313 (324) T ss_pred EEEec-CcEEEE--eecccccccccccccchhhhhcCcEEEEEEEEEccE-EecccceEEEEec Confidence 11111 011110 00000000 0 011112333444555554 4569999999999 No 38 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=97.86 E-value=1.7e-06 Score=52.22 Aligned_cols=281 Identities=11% Similarity=0.019 Sum_probs=148.4 Q ss_pred hccCcccccccccccch--HHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccceEecccccCCc Q lcl|Aclame:pro 65 QAFDSAYVAPTTQASIP--TPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIP 142 (388) Q Consensus 65 ~amDaa~~~~~t~~~~g--~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~diP 142 (388) +..+|+.. .+....| +|..+.+ +|++.+......+++..+...+. .+..+++.+. ..+...+...++| T Consensus 1 ~g~~a~~~--~~~~~~~~~iP~~~~~----~ii~~~~~~s~l~~~~~~~~~~~---~~~~~~~~~~-~~a~~v~E~~~~~ 70 (299) T protein:vir:41 1 MGFNPDTT--TMQSAKTGSIPINISE----QIITGVKNGSAAMKLAKAVPMTK---PEEEFTFMSG-VGAFWVDEAERIQ 70 (299) T ss_pred CCcCCCcc--cccCCCceecchhHHH----HHHHHHHhcchhhhhceeeecCC---CcEEEEEEcC-CceeeeecCcccc Confidence 33343322 1222222 4544444 55565555555555554443322 2234555443 4567788899999 Q ss_pred eeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCcc Q lcl|Aclame:pro 143 LSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLP 222 (388) Q Consensus 143 ~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a 222 (388) ..+...+......+.++..+.++.+=++. ...++.+.-.....+++.+.+|+-.+.|+... ...|+++...... T Consensus 71 ~~~~~f~~v~l~~~k~~~~~~is~ell~d---s~~~~~~~i~~~l~~a~~~~~d~a~l~G~g~~---~~~gil~~~~~~~ 144 (299) T protein:vir:41 71 TSKPTFTKAKMRSKKMGVIIPTTKENLNY---SVTNFFSLMQAEIVEAFYKKFDQAVFTGVESP---YNWNILKSATDAS 144 (299) T ss_pred ccccceeEEEEeeEEEEEeehhhHHHHhc---CHHHHHHHHHHHHHHHHHHHHHHHHhhcccCc---ccccccccccccc Confidence 99999999999999999999999854432 23578889999999999999999999998532 3557777643211 Q ss_pred ccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC-CCcCccHHHHHHHhC-C Q lcl|Aclame:pro 223 AIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRDWLKQTY-P 300 (388) Q Consensus 223 ~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~~lk~n~-p 300 (388) . +..+ .+. -++||.+++..+...-. .+..++|.+..+..|.+. +..|.-++.=--.+- + T Consensus 145 ~---~~~~-----~~~----~~~~l~~~~~~l~~~~~-------~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~ 205 (299) T protein:vir:41 145 N---LVEE-----TAN----KYDDLNEAIGLIEAEDL-------EPNGIATIRKQRVKYRSTKDGNGMPIFNTATSNGVD 205 (299) T ss_pred e---eecc-----ccc----cHHHHHHHHHhhhcccC-------CcCEEEEcHHHHHHHHHhhccCCceeecCCcCCCCc Confidence 1 1111 111 26788888887764321 244799999999988652 333333321000010 1 Q ss_pred ---ccEEEEccccccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCce--------eccCceEEeccc Q lcl|Aclame:pro 301 ---RVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVE--------KRVKNYVEAYSN 369 (388) Q Consensus 301 ---nl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~--------~~~~~~~~~~~~ 369 (388) .+.+...+.+. .|+++..++|.+ .......... .-+++. ..+........ +..-...+.+.. T Consensus 206 ~l~G~PV~~~~~~~----~~~~~~~~~~gd-fs~~~i~~~~-~~~i~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~ 278 (299) T protein:vir:41 206 DVLGLPIAYTPKYT----FGDKDISELVGD-WNQAYYGILR-GVEYEI-LTEATLTTVADETGKPLNLAERDMAAIKATF 278 (299) T ss_pred eecceeeEEecccC----CCCCceEEEEEe-cccEEEEEec-CcEEEE-eecccccccccccccchhhhhcCcEEEEEEE Confidence 12233333322 233333333322 1111111000 001110 00100000000 011123344555 Q ss_pred ceeeeeeeccccceeeccC Q lcl|Aclame:pro 370 ATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 370 ~t~G~ii~rP~ai~~~~GI 388 (388) |. |..+++|-||+.+.+- T Consensus 279 ~~-d~~v~~~~A~~~l~~~ 296 (299) T protein:vir:41 279 EV-GFMVVKDEAFSAVQPK 296 (299) T ss_pred Ee-ccEEecccceEEEEec Confidence 55 5556669999999999 No 39 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=97.83 E-value=1.2e-06 Score=52.96 Aligned_cols=306 Identities=11% Similarity=-0.050 Sum_probs=146.1 Q ss_pred hhhcccccccccCCHHHHhhcceecccchhhcchhhhhhhhhhhhccCccccccccc-ccchHHHHHHHhhcceeeeecc Q lcl|Aclame:pro 21 MANGKADYRLTDMAVRELKKFGLVFDHATVKRQIELLHEGGVATQAFDSAYVAPTTQ-ASIPTPIQFLQQWLPGFVKVLT 99 (388) Q Consensus 21 ~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~amDaa~~~~~t~-~~~g~l~~~l~~idp~v~e~l~ 99 (388) |+ .+++|+. .++=++..+..+. ++.-+|-.+.+ +|++.+. T Consensus 1 ~a-----------~l~el~~------------------------~~~~~~~~g~~~~~~~~liP~~~~~----~ii~~l~ 41 (333) T protein:vir:78 1 MA-----------TLNELLP------------------------NSAGSNHQGRLAHVPSDLLPKEIVG----PIFDKAQ 41 (333) T ss_pred Cc-----------hhHHhhh------------------------hcccccccCceecCCccccchhHHH----HHHHHHH Confidence 00 0222221 1111111111111 11124544444 4455554 Q ss_pred cchhhhhhcccccCCCCceeeEEEeeecccc--------ceEecccccCCceeeeeeeeeeeeEEEEEEEEeecHHHHHH Q lcl|Aclame:pro 100 SARKIDEILGVKTVGSWEDQEIVQGIVEPAG--------TAMEYGDLTNIPLSSWNVNFERRTIVRGEMGIQVGLLEEGR 171 (388) Q Consensus 100 ~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G--------~a~~ygd~~diP~~~~n~~~~~~~v~~~~~~~~y~~~El~~ 171 (388) .....+++..+...+. ....+++..... .+...++...+|..+..........+.++....++.+=++. T Consensus 42 ~~s~l~~~~~~~~~~~---~~~~~p~~~~~~~a~~v~eg~~~~~~e~~~~~~~~~~f~~i~l~~~kl~~~~~is~ell~~ 118 (333) T protein:vir:78 42 ESSLVLRMGEQIPISY---GETIIPTTVKRPEVGQVGVGTSNEQREGGLKPLSGTAWDTRSVSPIKLATIVTVSEEFARM 118 (333) T ss_pred hhchhhhhcceeeccC---CceEEEEEeCCceeEeecCcccccccccccccccccceeEEEEeeEEEEEeehhhHHHHhc Confidence 4444455544433221 223444443332 33444566778999999999999999999999999853332 Q ss_pred HHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccccccccCCcccccccCCHHHHHHHHHHHH Q lcl|Aclame:pro 172 ASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLML 251 (388) Q Consensus 172 A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~ 251 (388) ...++.+.-+....+++.+.+++-.+.|+......+..|++|...+..... ......+.+..++||.+++ T Consensus 119 ---s~~~~~~~i~~~la~ai~~~~d~~~l~G~g~~~~~~~~g~~~~~~~~~~~~-------~~~~~~~~~~~~~~i~~~~ 188 (333) T protein:vir:78 119 ---NPSGLYTKLQGDLAYAIGRGIDLAVFHGKSPLTGSALQGIDTDNVIANTTN-------VDYLQETGDPLLDRLLDGY 188 (333) T ss_pred ---CHHHHHHHHHHHHHHHHHHHHHHHHhcccCCCCCccccccccccccccccc-------ccccccccchhHHHHHHHH Confidence 335788888888899999999999999986544456778887766532211 0112233334577888887 Q ss_pred HHHHHhcCCeeccccccceEEcCHHHHHhhcc----CCCcCccHHHHHHHhCCccEEEEcc-----ccccccCCCCccEE Q lcl|Aclame:pro 252 ITLRVQSEDNIDPEDVDITLVLPMNKVDMLSV----VTDLGISVRDWLKQTYPRVRVMSAP-----ELQGGNPDDGKDIA 322 (388) Q Consensus 252 ~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~----~~~~~~Tvl~~lk~n~pnl~i~~~p-----el~~a~gtg~~~~~ 322 (388) ..+..... ..+..++|.|..+..|.+ .+..|.-++.........-+|...| .+..-.+++..... T Consensus 189 ~~~~~~~~------~~~~~~vmn~~~~~~L~~~~~~~d~~G~~i~~~~~~~~~~~~l~G~Pv~~~~~i~~~~~~~~~~~~ 262 (333) T protein:vir:78 189 DLVSANTD------VEFNGWAVDPRFRAHLLRAQAYRDANGNVDPSRINLAAQTGDVLGLPAQFGRAVGGDLGAAVDSKT 262 (333) T ss_pred Hhhccccc------cCceEEEEcchHHHHHHHHhhhcCCCCceeecCccccCCCceeeceeeEEccccCCCccccCCCcc Confidence 77654432 224468888887776632 2333444443322221112233222 22211111222222 Q ss_pred EEEEcccccccccccCCCcceEeecchhhh--c-cCc---eeccCceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 323 YMFLDSVDTAVDGSTDGGDTWAQLVQSKFV--T-LGV---EKRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 323 ~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r--~-~~v---~~~~~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) .+|.-+.......... .++..+..... . .+. -++.--...-+..|. |+.|++|-||+++.+- T Consensus 263 ~~~~gD~~~~~~g~~~---~~~i~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~-d~~v~~~~a~~~l~~~ 330 (333) T protein:vir:78 263 RIIGGDFSQLKFGFAD---EIRIKMSDTATLTDSGSATVSMWQTNQIAILIEVTF-GWLLGDKQAFVKFVDD 330 (333) T ss_pred EEEEEecccEEEEEee---ccEEEEeccccccccccceeehhhcCcEEEEEEEEE-ccEEecccceEEEecc Confidence 2222222111111100 01110100000 0 000 000001112233344 5556889999999998 No 40 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=97.83 E-value=5.6e-06 Score=49.39 Aligned_cols=304 Identities=9% Similarity=-0.054 Sum_probs=147.5 Q ss_pred cchhhhhhcccccccccCCHHHHhhcceecccchhhcchhhhhhhhhhhhccCcccccccccccchHHHHHHHhhcceee Q lcl|Aclame:pro 16 VRAFDMANGKADYRLTDMAVRELKKFGLVFDHATVKRQIELLHEGGVATQAFDSAYVAPTTQASIPTPIQFLQQWLPGFV 95 (388) Q Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~amDaa~~~~~t~~~~g~l~~~l~~idp~v~ 95 (388) ++ + ++. .-.+++++ ...+.... ..+|......+.++.-+|..+.+ +|+ T Consensus 1 ~~-------~----~~~-~~~~~~~f-----------~~~~~~~~-----~~~a~~~~~~~~~~~~iP~~~~~----~ii 48 (324) T protein:vir:97 1 ME-------Q----TQK-LKLNLQHF-----------ASNNVKPQ-----VFNPDNVMMHEKKDGTLMNEFTT----PIL 48 (324) T ss_pred Cc-------c----chh-HHHHHHHH-----------HHhhhhhh-----hhccccccccCCCcceechhHHH----HHH Confidence 11 1 110 01112221 11111111 11211111111222224554444 445 Q ss_pred eecccchhhhhhcccccCCCCceeeEEEeeeccccceEecccccCCceeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHh Q lcl|Aclame:pro 96 KVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAM 175 (388) Q Consensus 96 e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~diP~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~ 175 (388) +.+......++++.+...+ ..+..+++....+.+...+....+|..+...+......+.++..+.++.+-++.+ T Consensus 49 ~~~~~~s~l~~~~~~~~~~---~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds--- 122 (324) T protein:vir:97 49 QEVMENSKIMQLGKYEPME---GTEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYT--- 122 (324) T ss_pred HHHHhhcchhhhcceeecc---CCceEEEEEecCcceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHHhcc--- Confidence 5554444455554443322 2346778887788888999999999999999999999999999999998544433 Q ss_pred CCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccccccccCCcccccccCCHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 176 RINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLR 255 (388) Q Consensus 176 g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~ 255 (388) ..++.+.-.....+++.+.+++..+.|+... ....|+++........+ . .+- .++||.+++..+. T Consensus 123 ~~~l~~~i~~~l~~aia~~~d~a~l~G~g~~--~~~~gi~~~~~~~~~~~----~-----~~~----~~~~i~~~~~~l~ 187 (324) T protein:vir:97 123 YSQFFEEMKPMIAEAFYKKFDEAGILNQGNN--PFGKSIAQSIEKTNKVI----K-----GDF----TQDNIIDLEALLE 187 (324) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHhhccCCCC--ccCccccccccccceec----c-----ccC----CHHHHHHHHHhhh Confidence 4678888888999999999999999997643 23456666533211110 0 111 2567777777775 Q ss_pred HhcCCeeccccccceEEcCHHHHHhhccC-CCcCccHHHHHHHhCCccEEEEcccc-ccccCCCCccEEEEEEccccccc Q lcl|Aclame:pro 256 VQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRDWLKQTYPRVRVMSAPEL-QGGNPDDGKDIAYMFLDSVDTAV 333 (388) Q Consensus 256 ~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~~lk~n~pnl~i~~~pel-~~a~gtg~~~~~~~~~~~~d~~~ 333 (388) ..- ..+.+++|.+..+..|.+. +..|..++. -.... ++-..|=. ..+...+.+. +++.+ ..... T Consensus 188 ~~~-------~~~~~~v~n~~~~~~L~~lkd~~g~~~~~--~~~~~--tl~G~PV~~~~~~~~~~~~--~~~gd-~~~~~ 253 (324) T protein:vir:97 188 DDE-------LEANAFISKTQNRSLLRKIVDPETKERIY--DRNSD--TLDGLPVVNLKSSNLKRGE--LITGD-FDKLI 253 (324) T ss_pred hcc-------CCCCEEEEcHHHHHHHHHhhcCCCceeec--CCCCc--cccceeeEeecCCCCCcce--EEEEe-cccEE Confidence 432 2345799999999988643 333433321 11111 12222211 1111111111 11111 10010 Q ss_pred ccccCCCcce---EeecchhhhccCce----eccCceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 334 DGSTDGGDTW---AQLVQSKFVTLGVE----KRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 334 ~~~~~~~~t~---~~~~p~~~r~~~v~----~~~~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) .+... .-++ .+......+..... ...-.....+..|. |+.+.+|.||+.+.+. T Consensus 254 i~~~~-~~~i~~~~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~-d~~v~~~~a~~~l~~~ 313 (324) T protein:vir:97 254 YGIPQ-LIEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHV-ALHIADDKAFAKLVPA 313 (324) T ss_pred EEEec-CcEEEEeecccccccccccccchhhhhcCcEEEEEEEEe-ccEEecccceEEEEec Confidence 11000 0000 00000000000000 00001112223344 5555679999999999 No 41 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=97.83 E-value=5.4e-06 Score=49.48 Aligned_cols=302 Identities=10% Similarity=-0.056 Sum_probs=147.7 Q ss_pred cchhhhhhcccccccccCCHHHHhhcceecccchhhcchhhhhhhhhhhhccCcccccccccccchHHHHHHHhhcceee Q lcl|Aclame:pro 16 VRAFDMANGKADYRLTDMAVRELKKFGLVFDHATVKRQIELLHEGGVATQAFDSAYVAPTTQASIPTPIQFLQQWLPGFV 95 (388) Q Consensus 16 ~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~amDaa~~~~~t~~~~g~l~~~l~~idp~v~ 95 (388) ++ + ++.++ .+++++- ........++|.........+.-+|..+.. +|+ T Consensus 1 ~~-------k----~~~~~-~~~~~~~----------------~~~~~~~~~~a~~~~~~~~~~~lip~~~~~----~ii 48 (324) T protein:vir:99 1 ME-------Q----TQKLK-LNLQHFA----------------SNNVKPQVFNPDNVMMHEKKDGTLLNDFTT----PIL 48 (324) T ss_pred CC-------C----chHhh-HHHHHHH----------------HHhhhhhhccccceeccCCCcceechhHHH----HHH Confidence 22 1 11111 1123211 011112222322111111112224544433 455 Q ss_pred eecccchhhhhhcccccCCCCceeeEEEeeeccccceEecccccCCceeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHh Q lcl|Aclame:pro 96 KVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAM 175 (388) Q Consensus 96 e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~diP~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~ 175 (388) +.+......++++++...+. .+..+++.+..+.+...+....+|..+...+...-..+.++..+.++.+-++.+ T Consensus 49 ~~~~~~s~l~~~~~~~~~~~---~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds--- 122 (324) T protein:vir:99 49 QEVMENSKIMRLGKYEPMEG---TEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYT--- 122 (324) T ss_pred HHHHhhchhhhhcceeeccC---CceEEEEEecCcceeEeccCccccccccceeEEEEeeEEEEEeehhhHHHHhcc--- Confidence 55444444555554433222 345677877777888889999999999999999999999999999998655443 Q ss_pred CCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccccccccCCcccccccCCHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 176 RINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLR 255 (388) Q Consensus 176 g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~ 255 (388) ..++.+.-.....+++.+.+++-.+.|+... ....|+++...... +... ...-++||.+++..+. T Consensus 123 ~~~l~~~i~~~l~~ai~~~~d~~~l~G~g~~--~~~~~~~~~~~~~~----------~~~~---~~~~~~~i~~~~~~l~ 187 (324) T protein:vir:99 123 YSQFFEEMKPMIAEAFYKKFDEAGILNQGNN--PFGKSIAQSIEKTN----------KVIK---GDFTQDNIIDLEALLE 187 (324) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHhhhcCCCC--ccCccccccccccc----------eecc---ccCCHHHHHHHHHhhh Confidence 3578888888888888889999999997532 12345554322110 0011 1112677788877775 Q ss_pred HhcCCeeccccccceEEcCHHHHHhhccC-CCcCccHHHHHHHhCCccEEEEcccc-ccccCCCCccEEEEEEccccccc Q lcl|Aclame:pro 256 VQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRDWLKQTYPRVRVMSAPEL-QGGNPDDGKDIAYMFLDSVDTAV 333 (388) Q Consensus 256 ~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~~lk~n~pnl~i~~~pel-~~a~gtg~~~~~~~~~~~~d~~~ 333 (388) ..- ..+..++|.+..+..|.+. +..|..++ .-.... ++-..|=. +.+... ++..+++.+ ..... T Consensus 188 ~~~-------~~~~~~v~n~~~~~~L~~l~d~~g~~~~--~~~~~~--~l~G~PVv~~~~~~~--~~~~~i~gd-~~~~~ 253 (324) T protein:vir:99 188 DDE-------LEANAFISKTQNRSLLRKIVDPETKERI--YDRNSD--TLDGLPVVNLKSSNL--KRGELITGD-FDKLI 253 (324) T ss_pred hcc-------CCCCEEEEcHHHHHHHHHhhcCCCceee--cCCCCc--cccceeEEeecCCCC--CcceEEEEe-cccEE Confidence 431 2345799999999988643 33333222 111111 12222211 111111 111122211 11111 Q ss_pred ccccCCCcceEeecchhhhccCce---------eccCceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 334 DGSTDGGDTWAQLVQSKFVTLGVE---------KRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 334 ~~~~~~~~t~~~~~p~~~r~~~v~---------~~~~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) .+... .-+++. .......... +..-.....+..|.+ +.+.+|.||+.+.|. T Consensus 254 ~~~~~-~~~i~~--~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d-~~v~~~~a~~~lt~a 313 (324) T protein:vir:99 254 YGIPQ-LIEYKI--DETAQLSTVKNEDGTPVNLFEQDMVALRATMHVA-LHIADDKAFAKLVPA 313 (324) T ss_pred EEEec-CcEEEE--eecccccccccccccchhhhhcCcEEEEEEEEEc-cEEecccceEEEEec Confidence 11111 011110 0000000000 011112333444554 455569999999999 No 42 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=97.80 E-value=3.3e-06 Score=50.63 Aligned_cols=304 Identities=9% Similarity=-0.083 Sum_probs=144.1 Q ss_pred hcccccccccCCHHHHhhcceecccchhhcchhhhhhhhhhhhccCcccccccccccchHHHHHHHhhcceeeeecccch Q lcl|Aclame:pro 23 NGKADYRLTDMAVRELKKFGLVFDHATVKRQIELLHEGGVATQAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSAR 102 (388) Q Consensus 23 ~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~amDaa~~~~~t~~~~g~l~~~l~~idp~v~e~l~~~~ 102 (388) +.+ ++. .-.++++ |- .........+|.........+.-+|-...+ +|++.+.... T Consensus 1 ~~~----~~~-~~~~~~~----f~------------~~~~~~~~~~a~~~~~~~~~~~lip~~~~~----~ii~~~~~~s 55 (324) T protein:vir:96 1 MEQ----TQK-LKLNLQH----FA------------SNNVKPQVFNPDNVMMHEKKDGTLLNDFTT----PILQEVMENS 55 (324) T ss_pred CCc----chh-hhHHHHH----HH------------HhhhhhhhcccccccccCCCcceechhHHH----HHHHHHHhhc Confidence 111 111 1112222 11 011112222322111111112224444433 4444444444 Q ss_pred hhhhhcccccCCCCceeeEEEeeeccccceEecccccCCceeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHH Q lcl|Aclame:pro 103 KIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEV 182 (388) Q Consensus 103 ~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~diP~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~ 182 (388) ..++++++...+. .++.|++.+..+.+...+....+|..+..........+.++..+.++.+-++.+ ..++.+. T Consensus 56 ~l~~l~~~~~~~~---~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds---~~~l~~~ 129 (324) T protein:vir:96 56 KIMQLGKYEPMEG---TEKKFTFWADKPGAYWVGEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYT---YSQFFEE 129 (324) T ss_pred hhhhhcceeeccC---CceEEEEEecCcceeeecCCccccccccceeEEEEEeEEEEEeehhhHHHHhcc---hHHHHHH Confidence 4555555443322 346788887778888899999999999999999999999999999988544433 3578888 Q ss_pred HHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCee Q lcl|Aclame:pro 183 KRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNI 262 (388) Q Consensus 183 K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v 262 (388) -......++.+.+++..++|+... ....|+++.... ...+..++ ..++||.+++..+...- T Consensus 130 i~~~l~~aia~~~d~~~l~G~g~~--~~~~~~~~~~~~-----------~~~~~~~~--~~~~~i~~~~~~i~~~~---- 190 (324) T protein:vir:96 130 MKPMIAEAFYKKFDEAGILNQGNN--PFGKSIAQSIKK-----------TNKVIKGD--FTQDNIIDLEALLEDDE---- 190 (324) T ss_pred HHHHHHHHHHHHHHHHhhhcCCCC--CcCccccccccc-----------cceecccc--cchHHHHHHHHhhhhcc---- Confidence 888888999999999999997532 223344432111 01111111 12566777777665431 Q ss_pred ccccccceEEcCHHHHHhhccC-CCcCccHHHHHHHhCCccEEEEcccc-ccccCCCCccEEEEEEcccccccccccCCC Q lcl|Aclame:pro 263 DPEDVDITLVLPMNKVDMLSVV-TDLGISVRDWLKQTYPRVRVMSAPEL-QGGNPDDGKDIAYMFLDSVDTAVDGSTDGG 340 (388) Q Consensus 263 ~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~~lk~n~pnl~i~~~pel-~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~ 340 (388) ..+..++|.+..+..|.+. +..|..++ ..-+.-++-..|=. ..+...+.+. + ++. +......+...+ T Consensus 191 ---~~~~~~i~n~~~~~~L~~lkd~~G~~~~----~~~~~~~l~G~PV~~~~~~~~~~~~-~-~~g-d~s~~~~~~~~~- 259 (324) T protein:vir:96 191 ---LEANAFISKTQNRSLLRKIVDPETKERI----YDRNSDSLDGLPVVNLKSSNLKRGE-L-ITG-DFDKLIYGIPQL- 259 (324) T ss_pred ---CCCCEEEEcHHHHHHHHHhhCCCCCeee----cCCCCCcccceeeEeecCCCCCcce-E-EEE-ecceEEEEEecC- Confidence 2345799999999988643 33343332 11111122222211 1111111111 1 111 110000100000 Q ss_pred cceE---eecchhhhccCce----eccCceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 341 DTWA---QLVQSKFVTLGVE----KRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 341 ~t~~---~~~p~~~r~~~v~----~~~~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) -+++ +..-......... +..-.....+..|. |+.+.+|-||+++.+- T Consensus 260 ~~i~~~~~~~~~~~~~~~~~~~~~~~~n~v~~r~~~r~-d~~v~~~~a~~~l~~a 313 (324) T protein:vir:96 260 IEYKIDETAQLSTVKNEDGTPVNLFEQDMVALRATMHV-ALHIADDKAFAKLVPA 313 (324) T ss_pred cEEEEeecccccccccccccchhhhhcCcEEEEEEEEe-ccEEecccceEEEecc Confidence 0000 0000000000000 00001122333344 5567779999999988 No 43 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=97.32 E-value=6.9e-05 Score=43.39 Aligned_cols=352 Identities=12% Similarity=0.100 Sum_probs=144.6 Q ss_pred CCCcceeeeecCc------c------ccchhhhhhcccccccccCCH---------HHH-hhcceecccchhhcchhhhh Q lcl|Aclame:pro 1 MKQLSKVHQSLAG------R------SVRAFDMANGKADYRLTDMAV---------REL-KKFGLVFDHATVKRQIELLH 58 (388) Q Consensus 1 ~~~~~~~~~~~~~------~------~~~~~~~~~~~~~~~~~~~~~---------~~l-~~~g~~~~~~~~~~~~~~~~ 58 (388) ++++......|.- + .+++... ......+...... ..+ +..| .+..+. +...+... T Consensus 43 ~~e~~~l~~~i~~~e~~e~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~-~~~~~~~~ 119 (428) T protein:vir:10 43 QQQFTDISAKMDRMEATERAAALVAKPVKATQH-GPAVIVKAEPKQYTGAGMTRMVMSIAAAQG-NLQDAA-KFASDELN 119 (428) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhhhchhh-ccccccccccchhhhHHHHHHHHHHHHhhh-hHHHHH-HHhhhhhh Confidence 1111111111100 0 0000000 0000000000000 000 0000 000000 00000000 Q ss_pred hhhhhhhccCcccccccccccch--HHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccceEecc Q lcl|Aclame:pro 59 EGGVATQAFDSAYVAPTTQASIP--TPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYG 136 (388) Q Consensus 59 ~~~~~~~amDaa~~~~~t~~~~g--~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~yg 136 (388) . ....+++.. +..+.| +|-... ++|++.+......+.+ +..+ .+.....+.+++....+.+...+ T Consensus 120 ~-~~~~~~~~~------~~~~gg~liP~~~~----~~ii~~l~~~~~l~~~-~~~~-~~~~~g~~~~p~~~~~~~a~~v~ 186 (428) T protein:vir:10 120 D-QSVSMAIST------AAGSGGVLIPQNIH----SEVIELLRDRTIVRKL-GARS-IPLPNGNMSLPRLAGGATASYTG 186 (428) T ss_pred h-hhHhhhhcc------cccCCccccchhHH----HHHHHHHhhhchhhhh-ccee-eecCCcceEEEEEeCCcceeeec Confidence 0 000011111 112233 343322 3555555443333444 1111 11111235677766666777888 Q ss_pred cccCCceeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEee Q lcl|Aclame:pro 137 DLTNIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLN 216 (388) Q Consensus 137 d~~diP~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN 216 (388) ....+|..+...+...-..+.+...+.++.+=+..+ ..++.+--......++...+|+..++|+.. .....|++| T Consensus 187 Eg~~~~~~~~~f~~i~~~~~k~~~~v~is~ell~ds---~~~l~~~i~~~l~~ai~~~~d~~~l~G~G~--~~~p~Gi~~ 261 (428) T protein:vir:10 187 ENQDAKVSEARFDDVKLTAKTMIAMVPISNALIGRA---GFNVEQLVLQDILTAISVREDKAFMRDDGT--GDTPIGMKA 261 (428) T ss_pred cCccccccccceeeEEeeeEEEEEeehhhHHHHhhh---hHHHHHHHHHHHHHHHHHHHHHHHhccCCC--Ccccccccc Confidence 889999999988888999999999999998655433 356788888888889999999999999753 234679999 Q ss_pred cCCCccccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC-CCcCccHHHHH Q lcl|Aclame:pro 217 DPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRDWL 295 (388) Q Consensus 217 ~P~l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~~l 295 (388) .......+.+++.. +..+.+.+- .++..+....... .........+|.+..+..|... +..|.-++.-. T Consensus 262 ~~~~~~~~~~~~~~-----~~~~~~~~~----~~~~~~~~~~~~~-~~~~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~ 331 (428) T protein:vir:10 262 RATQWNRLLPWAAD-----AAVNLDTID----TYLDSIILMSMDG-NSNMISSGWGMSNRTYMKLFGLRDGNGNKVYPEM 331 (428) T ss_pred cccccccccccccc-----ccccHHHHH----HHHHHHHHhhhcc-ccccccCEEEEcHHHHHHHHHhhccCCceeccCC Confidence 76543322222211 222333222 2222222211100 0111234678899988888643 44455443211 Q ss_pred HH-hCCccEEEEccccccccCCCCccEEEEEEcccccccccccCCCcceEeec-chh-hh-ccCc---eeccCceEEecc Q lcl|Aclame:pro 296 KQ-TYPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLV-QSK-FV-TLGV---EKRVKNYVEAYS 368 (388) Q Consensus 296 k~-n~pnl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~-p~~-~r-~~~v---~~~~~~~~~~~~ 368 (388) .. .+..+.++....+-.-.++++....++|. +.......... .....+ ++. +. ..+. -...-...+.+. T Consensus 332 ~~g~l~G~pv~~~~~~p~~~~~~~~~~~i~~g-d~s~~~i~~~~---~i~i~~~~~~~~~~~~~~~~~~f~~~~~~~R~~ 407 (428) T protein:vir:10 332 AQGMLKGYPIQRTSAIPANLGEGGKESEIYFA-DFNDVVIGEDG---NMKVDFSKEASYIDTDGKLVSAFSRNQSLIRVV 407 (428) T ss_pred CCCeeeceeeEEeccccccccCCCccceEEEE-ecceEEEEEec---ceEEEeecccccccccccccchhhcchhheeee Confidence 00 01122222221111101223333333332 22111111110 001000 000 00 0000 000000111222 Q ss_pred cceeeeeeeccccceeeccC Q lcl|Aclame:pro 369 NATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 369 ~~t~G~ii~rP~ai~~~~GI 388 (388) . -.|+.|++|-||+.++|| T Consensus 408 ~-r~d~~v~~p~a~~~~t~~ 426 (428) T protein:vir:10 408 T-EHDIGFRHPEGLVLGTGV 426 (428) T ss_pred e-eeCceeeccceEEEEecc Confidence 3 346677889999999999 No 44 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=97.31 E-value=2.4e-05 Score=45.88 Aligned_cols=306 Identities=10% Similarity=-0.023 Sum_probs=145.0 Q ss_pred hhhcccccccccCCHHHHhhcceecccchhhcchhhhhhhhhhhhccCcccccccccccch-HHHHHHHhhcceeeeecc Q lcl|Aclame:pro 21 MANGKADYRLTDMAVRELKKFGLVFDHATVKRQIELLHEGGVATQAFDSAYVAPTTQASIP-TPIQFLQQWLPGFVKVLT 99 (388) Q Consensus 21 ~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~amDaa~~~~~t~~~~g-~l~~~l~~idp~v~e~l~ 99 (388) |.- +. .+.+++.+. + ...++- . .+..+.+ +|.... .+|++.+. T Consensus 1 ~~~------~~---~r~~~~~~~-----------~-------e~~a~~---~--~~~~~g~~ip~~~~----~~ii~~~~ 44 (326) T protein:vir:42 1 MAV------NP---DRTTPFLGV-----------N-------DPKVAQ---T--GDSMFEGYLEPEQA----QDYFAEAE 44 (326) T ss_pred CCC------Cc---cchhhhcCc-----------c-------hhhhee---c--cccCCcceechhhH----HHHHHHHH Confidence 000 00 011111110 0 001110 0 1112222 343333 35666665 Q ss_pred cchhhhhhcccccCCCCceeeEEEeeeccccceEecccccCCceeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCCh Q lcl|Aclame:pro 100 SARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINS 179 (388) Q Consensus 100 ~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~diP~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l 179 (388) ...-.+++..+...+ ..+..|++.+..+.+...+....+|..+...+...-..+.++..+.++.+-++. ...++ T Consensus 45 ~~s~i~~~~~~~~~~---~~~~~~p~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~v~iS~ell~~---s~~~~ 118 (326) T protein:vir:42 45 KISIVQQFAQKIPMG---TTGQKIPHWTGDVSASWIGEGDMKPITKGNMTSQTIAPHKIATIFVASAETVRA---NPANY 118 (326) T ss_pred hcchhhhhcceeecc---CCceEEEEEeCCcceEEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhc---CHHHH Confidence 555556654444322 234567777777788888999999999999999999999999999999854433 24678 Q ss_pred HHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcC Q lcl|Aclame:pro 180 AEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSE 259 (388) Q Consensus 180 ~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~ 259 (388) .+.-....++++...+++-.+.|+.. .+-.|++|.+.....+.. .+...++..+..++ ++..++..+...- T Consensus 119 ~~~i~~~l~~a~~~~~d~a~l~G~gs---~~p~gi~~~~~~~~~~~~---~~~~~~~~~~~~~~--~~~~~~~~~~~~~- 189 (326) T protein:vir:42 119 LGTMRTKVATAFAMAFDNAAINGTDS---PFPTFLAQTTKEVSLVDP---DGTGSNADLTVYDA--VAVNALSLLVNAG- 189 (326) T ss_pred HHHHHHHHHHHHHHHHHHHhhcccCC---Cccccccccccccceeec---ccccccccchhHHH--HHHHHHhhhhhhc- Confidence 88888888999999999999999753 234577776543221111 11111222222222 1222222222111 Q ss_pred CeeccccccceEEcCHHHHHhhccC-CCcCccHHHHHHHh-----CCccEEEEccccccccCCCCccEEEEEEccccccc Q lcl|Aclame:pro 260 DNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRDWLKQT-----YPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAV 333 (388) Q Consensus 260 g~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~~lk~n-----~pnl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~ 333 (388) .....++|.+..+..|.+. +..|.-++.=-..+ ++.-++...|-..... ...++...++ -+..... T Consensus 190 ------~~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~l~G~pv~~~~~-~~~~~~~~~~-Gd~s~~~ 261 (326) T protein:vir:42 190 ------KKWTHTLLDDITEPILNGAKDKSGRPLFIESTYTEENSPFRLGRIVARPTILSDH-VASGTVVGYQ-GDFRQLV 261 (326) T ss_pred ------cCccEEEEeHHHHHHHHHhhccCCceeeccccccCccccccCceeeeeeEEEcCC-CCCCceEEEE-eecceEE Confidence 1123688999999988642 33343232111111 1112333333221110 0112222111 1110000 Q ss_pred ccccCCCcceEeecchhhhccCc-e-------eccCceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 334 DGSTDGGDTWAQLVQSKFVTLGV-E-------KRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 334 ~~~~~~~~t~~~~~p~~~r~~~v-~-------~~~~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) ..... .-.++. ..+.....+. + +..-...+.+..+. |+.+.+|-||+++.++ T Consensus 262 ~~~~~-~~~v~~-~~e~~~~~~~~~~~~~~~~~~~d~~~~r~~~~~-d~~v~~~~a~~~l~~~ 321 (326) T protein:vir:42 262 WGQVG-GLSFDV-TDQATLNLGTPQAPNFVSLWQHNLVAVRVEAEY-AFHCNDKDAFVKLTNV 321 (326) T ss_pred EEEec-ceEEEE-eecceeeecccccccchhhhhcCcEEEEEEEEe-ccEEecccceEEEeec Confidence 00000 000000 0000000000 0 01112334455555 5566789999999999 No 45 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=97.27 E-value=2.5e-05 Score=45.80 Aligned_cols=295 Identities=10% Similarity=-0.039 Sum_probs=143.2 Q ss_pred HhhcceecccchhhcchhhhhhhhhhhhccCcccccccccccch-HHHHHHHhhcceeeeecccchhhhhhcccccCCCC Q lcl|Aclame:pro 38 LKKFGLVFDHATVKRQIELLHEGGVATQAFDSAYVAPTTQASIP-TPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSW 116 (388) Q Consensus 38 l~~~g~~~~~~~~~~~~~~~~~~~~~~~amDaa~~~~~t~~~~g-~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w 116 (388) |++ |-.|+ .+.... ..+ .++.+.+ +|..+.+ +|++.+.+....+++.++.... T Consensus 1 ~~~-~~~~~-------~e~~~~----~~~--------~~~~~~~~ip~~~~~----~ii~~~~~~~~l~~~~~~~~~~-- 54 (318) T protein:vir:24 1 MAA-GTAFA-------VDHAQI----AQT--------GDTMFKGYLEPEQAK----DYFAEAEKTSIVQQFAQKVPMG-- 54 (318) T ss_pred CCC-CCCCC-------HHHHHh----hcc--------cCcccceeechhHHH----HHHHHHHhhchhhhhcceeecc-- Confidence 111 21221 111110 001 1112222 4554444 4445554444445554443322 Q ss_pred ceeeEEEeeeccccceEecccccCCceeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhc Q lcl|Aclame:pro 117 EDQEIVQGIVEPAGTAMEYGDLTNIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRN 196 (388) Q Consensus 117 ~~~t~~~~v~e~~G~a~~ygd~~diP~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n 196 (388) ..+..+++....+.+...+....+|..+...+...-..+.++..+.++.+-++. ...++.+.-.....+++...+| T Consensus 55 -~~~~~ip~~~~~~~a~~v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~iS~e~l~d---s~~~~~~~i~~~l~~~~~~~~d 130 (318) T protein:vir:24 55 -TTGQKIPHWVGDVSAQWIGEGDMKPITKGNMTSQTIAPHKIATIFVASAETVRA---NPANYLGTMRTKVATAFAMAFD 130 (318) T ss_pred -CCceEEEEEeCCcceEEecCCccccccccceeEEEEeeEEEEEeehhhHHHhhc---ChHHHHHHHHHHHHHHHHHHHH Confidence 234567777777788888999999999999988999999999999998864443 3357888888999999999999 Q ss_pred eEEEEeecCccccceEEEeecCCCccccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHH Q lcl|Aclame:pro 197 AIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMN 276 (388) Q Consensus 197 ~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~ 276 (388) +-.+.|+... .-.|+++.... +..+...+.+.+ . .+++.+++..+...- ..+..++|.++ T Consensus 131 ~a~l~G~g~~---~~~~~~~~~~~---~~~~~~~~~~~~----~---~~~~~~~~~~~~~~~-------~~~~~~v~n~~ 190 (318) T protein:vir:24 131 GAAMHGTDSP---FPTYIGQTTKA---ISIADTTGATTV----Y---DQVAVNGLSLLVNDG-------KKWTHTLLDDI 190 (318) T ss_pred HhhhcccCCC---CCccccccccc---ccccccccccch----H---HHHHHHHHHhhcccc-------CCCCEEEEcHH Confidence 9999997432 23355544221 111111111111 1 233344444333221 23457999999 Q ss_pred HHHhhccC-CCcCccHHHHHHHh-----CCccEEEEccccccccCCCCccEEEEEEcccccccccccCCCcceEeecchh Q lcl|Aclame:pro 277 KVDMLSVV-TDLGISVRDWLKQT-----YPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSK 350 (388) Q Consensus 277 ~~~~Ls~~-~~~~~Tvl~~lk~n-----~pnl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~ 350 (388) .+..|.+. +..|..++.-...+ +...++...|-....+. ..++...++.+ ......... ..+...+... T Consensus 191 ~~~~L~~lkd~~G~~l~~~~~~~~~~~~~~~~~i~g~pv~~~~~~-~~~~~~~~~gd-fs~~~~~~~---~~l~i~~~~~ 265 (318) T protein:vir:24 191 TEPILNGAKDQNGRPLFIESTYGEAASPFRSGRIVARPTILSDHV-VEGTTVGFMGD-FSQLIWGQI---GGLSFDVTDQ 265 (318) T ss_pred HHHHHHHhhccCCceeecCccccCccccccCceEEEEeeEEeCCC-CCCccEEEEee-cceEEEEEe---cCeEEEEeec Confidence 99998642 44444332211111 11123444443322111 11222222211 100000000 0001000000 Q ss_pred hhcc--Cce-------eccCceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 351 FVTL--GVE-------KRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 351 ~r~~--~v~-------~~~~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) ...+ ..+ ++.-...+.+..|. |+.+.+|.||+.+.++ T Consensus 266 ~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~-d~~v~~~~a~~~i~~~ 311 (318) T protein:vir:24 266 ATLNLGTVESPNFVSLWQHNLVAVRVEAEY-AFHCNDAEAFVALTNV 311 (318) T ss_pred cceeccccccccchhhhhcCcEEEEEEEEE-ccEEecccceEEEEee Confidence 0000 000 01111233444455 5556779999999999 No 46 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=97.21 E-value=2e-05 Score=46.33 Aligned_cols=288 Identities=13% Similarity=0.052 Sum_probs=142.0 Q ss_pred cceecccchhhcchhhhhhhhhhhhccCcccccccccccch-HHHHHHHhhcceeeeecccchhhhhhcccccCCCCcee Q lcl|Aclame:pro 41 FGLVFDHATVKRQIELLHEGGVATQAFDSAYVAPTTQASIP-TPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQ 119 (388) Q Consensus 41 ~g~~~~~~~~~~~~~~~~~~~~~~~amDaa~~~~~t~~~~g-~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~ 119 (388) .|+. .+... ...+.+ ..+.| ++-.+.+ ++++.+......+++.++...+. . T Consensus 1 ~g~~---------~e~~~----~~~~~t--------~~~~g~l~~~~~~----~ii~~l~~~s~i~~l~~~~~~~~---~ 52 (397) T protein:vir:23 1 MGFS---------ADHSQ----IAQTKD--------TMFTGYLDPVQAK----DYFAEAEKTSIVQRVAQKIPMGA---T 52 (397) T ss_pred CCcC---------HHHHH----HhhccC--------CCCccccchhHHH----HHHHHHHhccchhhhcceeeccC---C Confidence 2211 11111 111112 12233 3333333 34444444444555555444322 3 Q ss_pred eEEEeeeccccceEecccccCCceeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEE Q lcl|Aclame:pro 120 EIVQGIVEPAGTAMEYGDLTNIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIG 199 (388) Q Consensus 120 t~~~~v~e~~G~a~~ygd~~diP~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~ 199 (388) +..|++.+....+...++...+|..+..........+.++..+.++.+=++.+ ..++.+.-+...++++.+.+|+-. T Consensus 53 ~~~ip~~~~~~~a~wv~Eg~~~~~s~~~f~~v~l~~~k~~~~v~iS~ell~ds---~~~l~~~i~~~l~~aia~~~d~a~ 129 (397) T protein:vir:23 53 GIVIPHWTGDVSAQWIGEGDMKPITKGNMTKRDVHPAKIATIFVASAETVRAN---PANYLGTMRTKVATAIAMAFDNAA 129 (397) T ss_pred ceEEEEEcCCcceEEecCCccccccccceeEEEEeeEEEEEeehhhHHHHhcc---hHHHHHHHHHHHHHHHHHHHHHHH Confidence 45678877777888889999999999999999999999999999998544433 467899999999999999999999 Q ss_pred EEeecCccccceEEEeecCCCccccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHH Q lcl|Aclame:pro 200 FYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVD 279 (388) Q Consensus 200 ~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~ 279 (388) ++|+.. ..+.-|+++..+.... +..+. .++|+..++..+...- ..+..++|.+..+. T Consensus 130 l~G~gt--~~~~~~~~~~~~~~~~-----------~~~~~---~~~~~~~~~~~l~~~~-------~~~a~~vmn~~~~~ 186 (397) T protein:vir:23 130 LHGTNA--PSAFQGYLDQSNKTQS-----------ISPNA---YQGLGVSGLTKLVTDG-------KKWTHTLLDDTVEP 186 (397) T ss_pred hhcccC--Ccccccccccccceee-----------ecccc---hhHHHHHHHHhhhhcc-------cCCCEEEEcHHHHH Confidence 999854 2355566554432111 11111 1334444444444322 12357899999998 Q ss_pred hhccC-CCcCccHHHHHHHh-CC----ccEEEEccccccccCCCCccEEEEEEcccccccccccCCCcceEeecchhh-h Q lcl|Aclame:pro 280 MLSVV-TDLGISVRDWLKQT-YP----RVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKF-V 352 (388) Q Consensus 280 ~Ls~~-~~~~~Tvl~~lk~n-~p----nl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~-r 352 (388) .|.+. +..|.-++.=-..+ .| .-++...|-....+- ..++...++.+ .......... .-+++ +.... . T Consensus 187 ~L~~lkd~~G~~i~~~~~~~~~~~~~~~~tl~G~Pv~~s~~~-~~g~~~~~~gD-fs~~~i~~~~-~i~i~--~~~e~~~ 261 (397) T protein:vir:23 187 VLNGSVDANGRPLFVESTYESLTTPFREGRILGRPTILSDHV-AEGDVVGYAGD-FSQIIWGQVG-GLSFD--VTDQATL 261 (397) T ss_pred HHHHhhccCCceeecccccccccccccCceeeeeeEEEeCCC-CCCceEEEEee-cceEEEEEEe-ceEEE--Eeeeeee Confidence 88653 33344332211111 11 123444442221111 11222222211 1000000000 00000 00000 0 Q ss_pred ccCc---eeccCce-----EEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 353 TLGV---EKRVKNY-----VEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 353 ~~~v---~~~~~~~-----~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) ..+. ......| ...+..|. |+.+++|-+|++..+- T Consensus 262 ~~~~~~~~~~~~lf~~d~v~~ra~~r~-d~~v~~~~a~~~~~~~ 304 (397) T protein:vir:23 262 NLGSQESPNFVSLWQHNLVAVRVEAEY-GLLINDVNAFVKLTFD 304 (397) T ss_pred eeccccccceeeeeeccceeEEEEeee-ccceecccceEEEeec Confidence 0000 0000001 11222333 5577789999999886 No 47 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=97.14 E-value=3.9e-05 Score=44.76 Aligned_cols=334 Identities=10% Similarity=-0.013 Sum_probs=143.6 Q ss_pred CCCc----------------ceeeeecCccccchhhhhhcccccccccCCHHHHhhcceecccchhhcchhhhhhhhhhh Q lcl|Aclame:pro 1 MKQL----------------SKVHQSLAGRSVRAFDMANGKADYRLTDMAVRELKKFGLVFDHATVKRQIELLHEGGVAT 64 (388) Q Consensus 1 ~~~~----------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~ 64 (388) +.++ ...-..-.....+.+..... +.. ....+. +....+....... ..... .. T Consensus 50 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~--~~~~~~--~~~~~~~~~~~~~-~~~~~--~~ 118 (419) T protein:vir:94 50 AARAALLRTAPPAPKGPADGGTPLTPAEAGTFRSLAQRFA----DSD--GLREYR--ARDKRGQFQVEMR-DIDPN--RL 118 (419) T ss_pred HHHHHHHHHHHHHHHHHhhhhccccccccccccchhhhhh----hHH--HHHHHH--HhhhhhhhhHHHH-HHHHH--Hh Confidence 0000 00000000000000000000 000 000000 0000000000000 00000 11 Q ss_pred hccCcccccccccccch-HHHHHHHhhcceeeeecccchhhhhhcccccCCCCc----ee-eEEEeeeccccceEecccc Q lcl|Aclame:pro 65 QAFDSAYVAPTTQASIP-TPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWE----DQ-EIVQGIVEPAGTAMEYGDL 138 (388) Q Consensus 65 ~amDaa~~~~~t~~~~g-~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~----~~-t~~~~v~e~~G~a~~ygd~ 138 (388) ..++ ...+..+.++.. +|..... .+..........++++.+....... .+ ..+..+....+.+...+.. T Consensus 119 ~~~~-~~~~~~~~~~~~~~p~~~~~----~i~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg 193 (419) T protein:vir:94 119 LSRD-APAGTITNPNVPHLPQLVPG----IVPTTPDLPLLVADLLDQQNADYNVLEYIRDTSGTAGAGSTWNKAAVVPEG 193 (419) T ss_pred hccc-cccccccCCcccccchhhhH----HHHHHHhhhhhhhhcceeeeccCCceeeeeeccccccccccCcccceecCC Confidence 1111 111111222221 1222222 2222222222334444333221111 11 0111222333456677888 Q ss_pred cCCceeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecC Q lcl|Aclame:pro 139 TNIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDP 218 (388) Q Consensus 139 ~diP~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P 218 (388) ...|..+..........+.++..+.++.+=++-+ .++.+.-....+.++...+|+-.++|+.. ....|++|.+ T Consensus 194 ~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~----~~l~~~i~~~la~a~~~~~d~aii~G~G~---~~p~Gi~~~~ 266 (419) T protein:vir:94 194 TAKPQSTLSFDTITTTLKTVAHWLPITRQAADDN----SQLMGYIQGRLTYGLRFLRDRQLLNGNGS---TEMQGILTTP 266 (419) T ss_pred ccccccccceeeEEeeeeeEEEeehhhHHHHHhH----HHHHHHHHHHHHHHHHHHHHHHHHhccCc---ccccceeccc Confidence 8899999999999999999999999998655433 14778777888888888899999999763 3588999999 Q ss_pred CCccccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC-CCcCccH-HH-HH Q lcl|Aclame:pro 219 SLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISV-RD-WL 295 (388) Q Consensus 219 ~l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tv-l~-~l 295 (388) ++.+..++. .+...|....++||.+++..+...-. .+..++|.+..+..|... +..|..+ +. .+ T Consensus 267 ~~~~~~~~~------~~~~~t~~~~~~~l~~~~~~~~~~~~-------~~~~~v~n~~~~~~l~~~k~~~~~~~~~~~~~ 333 (419) T protein:vir:94 267 GIGTYQQPK------PTAPATDEPPLVDIRRAKTVAEIAGF-------PPDGVVVHPQDWESIELDQAPGSGVFRVIANV 333 (419) T ss_pred ccccccccc------cccccccchhHHHHHHHHHhhhhccC-------CCCEEEEcHHHHHHHHHHhhcCCCceeecCCc Confidence 875443322 24556677789999999988874321 244789999988888532 2222111 10 01 Q ss_pred HH----hCCccEEEEccccccccCC---CCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEecc Q lcl|Aclame:pro 296 KQ----TYPRVRVMSAPELQGGNPD---DGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYS 368 (388) Q Consensus 296 k~----n~pnl~i~~~pel~~a~gt---g~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~ 368 (388) .. ..-++.++....+. + ++ |.-+..+.+....+.....+.+.. ..|+. -.....+. T Consensus 334 ~~~~~~~l~G~pV~~~~~~~-~-~~~~~gd~~~~~~~~~~~~~~v~~~~~~~--------~~~~~-------~~~~~r~~ 396 (419) T protein:vir:94 334 QGEATPRIWGLNVVSTVAIA-Q-GTALVGGFRQGATLWSRQGITVLMTDSHA--------DFFTA-------NTLVILAE 396 (419) T ss_pred ccCCCccccceeeEEcCCCC-C-ccEEEeeccceEEEEEecceEEEEecccc--------chhhc-------CcEEEEEE Confidence 00 00111222222211 0 00 000111111110000000000000 00111 11223344 Q ss_pred cceeeeeeeccccceeeccC Q lcl|Aclame:pro 369 NATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 369 ~~t~G~ii~rP~ai~~~~GI 388 (388) .|.+|. +++|-||+++..- T Consensus 397 ~r~d~~-v~~~~a~~~~~~~ 415 (419) T protein:vir:94 397 FRANLA-VYQPKAFVRVTFA 415 (419) T ss_pred EeeccE-EeccccEEEEEec Confidence 455554 4669999998877 No 48 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=96.90 E-value=5.6e-05 Score=43.90 Aligned_cols=346 Identities=10% Similarity=0.016 Sum_probs=142.0 Q ss_pred CCCcceeeeecCccccchh--hhhhcccccccccCCHHHHhhcceecccchhhcchhhhhh--hhhhhhccCcccccccc Q lcl|Aclame:pro 1 MKQLSKVHQSLAGRSVRAF--DMANGKADYRLTDMAVRELKKFGLVFDHATVKRQIELLHE--GGVATQAFDSAYVAPTT 76 (388) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~--~~~~~~amDaa~~~~~t 76 (388) .......+........+.+ .+... ...+.......++.+....... ..+.... ......++.. + T Consensus 93 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~------~ 160 (477) T protein:vir:84 93 ATVEVNEALTYEKGNGQSYFRDLAMQ-TVGMADEPAKERLRRHMVDVES-----DKEIRKIAKVGEEYRDLDR------N 160 (477) T ss_pred cccccccchhhhhhHHHHHHHHHHHH-HhhhhhhHHHHHHHHHHhhhhh-----hhhHHHHHHhhhhhccccc------c Confidence 0000000000000000000 00000 0000000000111110000000 0000000 0001111111 1 Q ss_pred cccch--HHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccce-Eecccc-----cCCceeeeee Q lcl|Aclame:pro 77 QASIP--TPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTA-MEYGDL-----TNIPLSSWNV 148 (388) Q Consensus 77 ~~~~g--~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a-~~ygd~-----~diP~~~~n~ 148 (388) ....| +|-.++ -.+|++.+-.....+++++...... ....+.++..+..+.. ...++. ...|..+... T Consensus 161 ~~~gg~lv~~~~~---~~~ii~~l~~~~~i~~~~~~~~~~~-~~~~~~ip~~~~~~~~a~~~~Eg~~~~~~~~~~s~~~f 236 (477) T protein:vir:84 161 GGTGGYAVPPLWM---MNRFIELARAGRTYANLCPTEPLPG-GTSSINIPKILTGTSTAIQAADNAALTAPSAHEVDLTD 236 (477) T ss_pred CCCcceeeccchh---HHHHHHHhhhcchHHHhhceeeecC-CcceeEEEEEecCcceeeeeccCcccccccccccccce Confidence 11222 222221 2345565555444555554433211 1123455555433322 234443 3557778888 Q ss_pred eeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCcccccccc Q lcl|Aclame:pro 149 NFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTT 228 (388) Q Consensus 149 ~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~ 228 (388) +...-+.+.++..+.++.+=|+ ....++.+--....+.++...+|+-.++|+.- .....|++|.+++.....+ T Consensus 237 ~~i~~~~~k~~~~~~iS~ell~---ds~~~l~~~i~~~l~~~~~~~~d~~~l~G~Gt--~~~p~Gi~~~~~~~~~~~~-- 309 (477) T protein:vir:84 237 GFVQANVKTIAGQQGIAIQLLD---QAAVSVDEFVFRDLAADYANKLNVQVISGTGS--NNQVVGVRATAGITQVTAT-- 309 (477) T ss_pred eeEEEeeeeEEeeeHHHHHHHh---ccchhHHHHHHHHHHHHHHHHHHHHHhccCCC--CCccceeeecccccccccc-- Confidence 8888888888888888875333 23467888888999999999999999999742 2357899999887543221 Q ss_pred CCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhcc-CCCcCccHHH----------HHHH Q lcl|Aclame:pro 229 PGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSV-VTDLGISVRD----------WLKQ 297 (388) Q Consensus 229 ~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~----------~lk~ 297 (388) +.+++|+. .+..+++|..++..+-.... ..+...+|.|..+..|.. .+..|.-+++ ++.. T Consensus 310 -~~~~t~~~--~~~~~~~i~~~~~~~~~~~~------~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~ 380 (477) T protein:vir:84 310 -SAGSALEK--HQIIYQKIADAIQRVHTSRF------LEPEVIVMHPRRWASFHAIFAGDDRPLIVPSGPGFNNLGVLTE 380 (477) T ss_pred -ccccchhh--HHHHHHHHHHHHhhcccccc------CCccEEEEcHHHHHHHHHhhccCCCeeeecCcccccccccccc Confidence 12223332 33455555555554443221 123357778887777743 2222322211 1111 Q ss_pred h--------CCccEEEEccccccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEeccc Q lcl|Aclame:pro 298 T--------YPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSN 369 (388) Q Consensus 298 n--------~pnl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~~ 369 (388) + .....++..+.+-.-.++++....++|.+--+ ...+ ++.-.. ...+..+..... ..|+ ..+ T Consensus 381 ~~~~~~~~~l~G~pVv~s~~~p~~~~~~~d~~~i~~gd~~~-~~i~--~~~~~~-~~~~~~~~~~~~----~~~~--v~~ 450 (477) T protein:vir:84 381 VASQRVVGQMHGLPVVTDPTLPTTLGTGTDQDVIHVLRASD-LALF--ESSVRM-RALQETRAENLS----VLLQ--VYG 450 (477) T ss_pred cccccccchhcccceEecCcccccccccCCcceEEEEEece-EEEE--eeceeE-Eeccccccccce----eeee--ehh Confidence 0 11223333333321112233333334432211 1111 111111 111221111100 0111 111 Q ss_pred ceeeeeeeccccceeeccC Q lcl|Aclame:pro 370 ATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 370 ~t~G~ii~rP~ai~~~~GI 388 (388) ......+|+|-||+.++|. T Consensus 451 ~~~~~~~r~~~afv~~t~~ 469 (477) T protein:vir:84 451 YLAFTAARFPQSVVEIGGT 469 (477) T ss_pred hhhhhhhccccceEEeecc Confidence 1223567889999999999 No 49 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=96.41 E-value=0.00044 Score=38.97 Aligned_cols=349 Identities=10% Similarity=0.006 Sum_probs=153.5 Q ss_pred CCCccee---eeecCccccchh------------hhhhcccccccccCCHHHHhhcceecccchhhcchhhhhhhhhhhh Q lcl|Aclame:pro 1 MKQLSKV---HQSLAGRSVRAF------------DMANGKADYRLTDMAVRELKKFGLVFDHATVKRQIELLHEGGVATQ 65 (388) Q Consensus 1 ~~~~~~~---~~~~~~~~~~~~------------~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~ 65 (388) +.++... ...+..+..+.. ++.+........... .++.+--..........+......-... T Consensus 67 ~a~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 143 (497) T protein:vir:78 67 DAAKDGLDNDIPEVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFD---VSFNVSAKAADPGTAAAELMGAFADGET 143 (497) T ss_pred HHHHHHHHHHHHHHHhhhhhhHHHHHHHHHhhhHHHHhhhhhhhhhhhh---hhhhhhhhhhhhHHHHHHHHHHHhhhhh Confidence 0000000 000000000000 000000000000000 0000000000000000000000000000 Q ss_pred ccCccc-ccccccccchHHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeecc-ccceEecccccCCce Q lcl|Aclame:pro 66 AFDSAY-VAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEP-AGTAMEYGDLTNIPL 143 (388) Q Consensus 66 amDaa~-~~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~-~G~a~~ygd~~diP~ 143 (388) +....- ....+.+..|+++- +-+.++|++.+......++++++.+.+. ..+.|++... .+.+...+....+|. T Consensus 144 ~~~~~~~~~~~~~~~gg~~vp--~~~~~~ii~~~~~~~~i~~l~~~~~~~~---~~~~~~~~~~~~~~a~wv~E~~~~~~ 218 (497) T protein:vir:78 144 APAAIGQNPFGSTGTFAPGIL--PTFLPGIVEQLFYELSLADLISSRPVTS---PNLSYLTESAAHNNAAAVAEAGTYPF 218 (497) T ss_pred hHHHHHhhhcccCcccccccc--hhhhHHHHHHHHhhhhHHhhccccccCC---CceEEEEEcCCCCcceeeccCccccc Confidence 000000 00112233333322 2234577887777777788876655433 2355655433 356778899999999 Q ss_pred eeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccc Q lcl|Aclame:pro 144 SSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPA 223 (388) Q Consensus 144 ~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~ 223 (388) .+...+......+.+...+.++.+=|+-+ . .|.+--....++++...+|+-.++|+.- .+..|++|++...+. T Consensus 219 s~~~f~~i~~~~~k~a~~~~iS~ell~d~--~--~l~~~i~~~l~~~i~~~~d~~~l~G~G~---~~p~Gil~~~~~~~~ 291 (497) T protein:vir:78 219 SSEEFARVYEQVGKVANALTITDEGLRDA--P--ELFNFVQGRLLEGIQRKEEVQLLAGGGY---PGVNGLLQRSTGFTA 291 (497) T ss_pred ccccceeeEeeeeeeEeecHhHHHHHHhH--H--HHHHHHHHHHHHHHHHHHHHHhhcCCCc---ccccccccccccccc Confidence 99999999999999999888888533322 2 4778888888899999999999999742 258899999876432 Q ss_pred cccccCCcc---------------ccc----------------------------ccCCHHHHHHHHHHHHHHHHHhcCC Q lcl|Aclame:pro 224 IASTTPGGW---------------VSG----------------------------GANAFQGIVGDLRLMLITLRVQSED 260 (388) Q Consensus 224 ~~~~~~~~~---------------t~W----------------------------a~kT~~eI~~DI~~~~~~l~~~s~g 260 (388) ......... ..| ...+...++.++..++..+..... T Consensus 292 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 370 (497) T protein:vir:78 292 SSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLF- 370 (497) T ss_pred cccccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcc- Confidence 211110000 000 011334455666666666655432 Q ss_pred eeccccccceEEcCHHHHHhhccC-CCcCccHHHHH---------HH--hCCccEEEEccccccccCC---CC-ccEEEE Q lcl|Aclame:pro 261 NIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRDWL---------KQ--TYPRVRVMSAPELQGGNPD---DG-KDIAYM 324 (388) Q Consensus 261 ~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~~l---------k~--n~pnl~i~~~pel~~a~gt---g~-~~~~~~ 324 (388) ..|+.++|.+..+..|.+. +..|.-++.-- .. ......++..+.+. + ++ |. ....+. T Consensus 371 -----~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~-~-~~~~~Gd~~~~~~~ 443 (497) T protein:vir:78 371 -----QTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIP-L-GTILVGHFAPSVIQ 443 (497) T ss_pred -----cCCCeEEEchHHHHHHHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCCC-C-CceEEeecccceEE Confidence 2355788888888877542 33344332110 00 00112222222221 1 00 00 001111 Q ss_pred EEcccccccccccCCCcceEeecchhhhccCceeccCceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 325 FLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 325 ~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) +.++.+.....+.+ ....+..-...+-+..|.+| .|++|.||+++.-. T Consensus 444 i~~r~~~~v~~~~~---------------~~~~f~~n~v~~r~~~r~~~-~v~~p~A~~~l~~~ 491 (497) T protein:vir:78 444 TARREGVTMQMTNS---------------NGTDFVDGKVTVRAEERLGL-LVYRPSAFQLIQLK 491 (497) T ss_pred EEEecccEEEeecc---------------cchhhhcCcEEEEEEEeecc-eeeccccEEEEEec Confidence 11111111111110 00011111233444556655 66689999999888 No 50 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=96.41 E-value=0.00044 Score=38.97 Aligned_cols=349 Identities=10% Similarity=0.006 Sum_probs=153.5 Q ss_pred CCCccee---eeecCccccchh------------hhhhcccccccccCCHHHHhhcceecccchhhcchhhhhhhhhhhh Q lcl|Aclame:pro 1 MKQLSKV---HQSLAGRSVRAF------------DMANGKADYRLTDMAVRELKKFGLVFDHATVKRQIELLHEGGVATQ 65 (388) Q Consensus 1 ~~~~~~~---~~~~~~~~~~~~------------~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~ 65 (388) +.++... ...+..+..+.. ++.+........... .++.+--..........+......-... T Consensus 67 ~a~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 143 (497) T protein:vir:10 67 DAAKDGLDNDIPEVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFD---VSFNVSAKAADPGTAAAELMGAFADGET 143 (497) T ss_pred HHHHHHHHHHHHHHHhhhhhhHHHHHHHHHhhhHHHHhhhhhhhhhhhh---hhhhhhhhhhhhHHHHHHHHHHHhhhhh Confidence 0000000 000000000000 000000000000000 0000000000000000000000000000 Q ss_pred ccCccc-ccccccccchHHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeecc-ccceEecccccCCce Q lcl|Aclame:pro 66 AFDSAY-VAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEP-AGTAMEYGDLTNIPL 143 (388) Q Consensus 66 amDaa~-~~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~-~G~a~~ygd~~diP~ 143 (388) +....- ....+.+..|+++- +-+.++|++.+......++++++.+.+. ..+.|++... .+.+...+....+|. T Consensus 144 ~~~~~~~~~~~~~~~gg~~vp--~~~~~~ii~~~~~~~~i~~l~~~~~~~~---~~~~~~~~~~~~~~a~wv~E~~~~~~ 218 (497) T protein:vir:10 144 APAAIGQNPFGSTGTFAPGIL--PTFLPGIVEQLFYELSLADLISSRPVTS---PNLSYLTESAAHNNAAAVAEAGTYPF 218 (497) T ss_pred hHHHHHhhhcccCcccccccc--hhhhHHHHHHHHhhhhHHhhccccccCC---CceEEEEEcCCCCcceeeccCccccc Confidence 000000 00112233333322 2234577887777777788876655433 2355655433 356778899999999 Q ss_pred eeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccc Q lcl|Aclame:pro 144 SSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPA 223 (388) Q Consensus 144 ~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~ 223 (388) .+...+......+.+...+.++.+=|+-+ . .|.+--....++++...+|+-.++|+.- .+..|++|++...+. T Consensus 219 s~~~f~~i~~~~~k~a~~~~iS~ell~d~--~--~l~~~i~~~l~~~i~~~~d~~~l~G~G~---~~p~Gil~~~~~~~~ 291 (497) T protein:vir:10 219 SSEEFARVYEQVGKVANALTITDEGLRDA--P--ELFNFVQGRLLEGIQRKEEVQLLAGGGY---PGVNGLLQRSTGFTA 291 (497) T ss_pred ccccceeeEeeeeeeEeecHhHHHHHHhH--H--HHHHHHHHHHHHHHHHHHHHHhhcCCCc---ccccccccccccccc Confidence 99999999999999999888888533322 2 4778888888899999999999999742 258899999876432 Q ss_pred cccccCCcc---------------ccc----------------------------ccCCHHHHHHHHHHHHHHHHHhcCC Q lcl|Aclame:pro 224 IASTTPGGW---------------VSG----------------------------GANAFQGIVGDLRLMLITLRVQSED 260 (388) Q Consensus 224 ~~~~~~~~~---------------t~W----------------------------a~kT~~eI~~DI~~~~~~l~~~s~g 260 (388) ......... ..| ...+...++.++..++..+..... T Consensus 292 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 370 (497) T protein:vir:10 292 SSASSLFGATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLF- 370 (497) T ss_pred cccccchhhhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcc- Confidence 211110000 000 011334455666666666655432 Q ss_pred eeccccccceEEcCHHHHHhhccC-CCcCccHHHHH---------HH--hCCccEEEEccccccccCC---CC-ccEEEE Q lcl|Aclame:pro 261 NIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRDWL---------KQ--TYPRVRVMSAPELQGGNPD---DG-KDIAYM 324 (388) Q Consensus 261 ~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~~l---------k~--n~pnl~i~~~pel~~a~gt---g~-~~~~~~ 324 (388) ..|+.++|.+..+..|.+. +..|.-++.-- .. ......++..+.+. + ++ |. ....+. T Consensus 371 -----~~~~~~vmn~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~~~l~G~pV~~t~~~~-~-~~~~~Gd~~~~~~~ 443 (497) T protein:vir:10 371 -----QTPNAVVMNPRDWELLRLTKDANGQYMGGNFFGNAYGNPVNGGKNIWGVPVVTTPLIP-L-GTILVGHFAPSVIQ 443 (497) T ss_pred -----cCCCeEEEchHHHHHHHHhhcCCCceeccCcccccccccccCCceeeceeeEecCCCC-C-CceEEeecccceEE Confidence 2355788888888877542 33344332110 00 00112222222221 1 00 00 001111 Q ss_pred EEcccccccccccCCCcceEeecchhhhccCceeccCceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 325 FLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 325 ~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) +.++.+.....+.+ ....+..-...+-+..|.+| .|++|.||+++.-. T Consensus 444 i~~r~~~~v~~~~~---------------~~~~f~~n~v~~r~~~r~~~-~v~~p~A~~~l~~~ 491 (497) T protein:vir:10 444 TARREGVTMQMTNS---------------NGTDFVDGKVTVRAEERLGL-LVYRPSAFQLIQLK 491 (497) T ss_pred EEEecccEEEeecc---------------cchhhhcCcEEEEEEEeecc-eeeccccEEEEEec Confidence 11111111111110 00011111233444556655 66689999999888 No 51 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=96.34 E-value=0.00012 Score=42.17 Aligned_cols=327 Identities=12% Similarity=0.053 Sum_probs=143.5 Q ss_pred CCCcceeeeecCccccchhhhhhcccc----------cccccCCHHHHhhcceecccchhhcchhhhhhhhhhhhccCcc Q lcl|Aclame:pro 1 MKQLSKVHQSLAGRSVRAFDMANGKAD----------YRLTDMAVRELKKFGLVFDHATVKRQIELLHEGGVATQAFDSA 70 (388) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~amDaa 70 (388) ..++......+...+-+--++...... ....+.....+.+.+ .........+ ......+. T Consensus 44 ~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~-------~~~~~~~~ 113 (390) T protein:vir:97 44 FATVGNLSAEVQAARQRVAELEGNGAGGDVQHVSVGDMFVASEQFQASTGRW---NDRSARATMN-------IKAALNTA 113 (390) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhcccccccccccchhhhhhhHHHHHHHHHh---hhhhhhhhhH-------HHHHHHhh Confidence 000000000000000000000000000 000000000000000 0000000000 00111111 Q ss_pred cccccccccchHHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeecc-ccceEecccccCCceeeeeee Q lcl|Aclame:pro 71 YVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEP-AGTAMEYGDLTNIPLSSWNVN 149 (388) Q Consensus 71 ~~~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~-~G~a~~ygd~~diP~~~~n~~ 149 (388) .. .+..+.|.++-.. +.++|++.+......+.++++...+. .+..|++... .+.+...+....+|-.+.... T Consensus 114 ~~--~~~~~~g~lip~~--~~~~ii~~~~~~~~i~~~~~~~~~~~---~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~ 186 (390) T protein:vir:97 114 ST--DAAGSAGALTTPN--RLPGFITPPDARLTVRDLIGSGRTDS---ALIEYVQETGFVNNAAIVAEGALKPESSLKFA 186 (390) T ss_pred hc--ccccccccccchh--hhHHHHHHHhhhhhhHhhcceeeccC---CceEEEEEecCCcceeeecCCcccccccccee Confidence 11 1233344333221 12466776666666666666554332 2345666544 356778888889999999999 Q ss_pred eeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccccccccC Q lcl|Aclame:pro 150 FERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTP 229 (388) Q Consensus 150 ~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~~ 229 (388) ......+.++..+.++.+ +-.-. .++.+.-....++++.+.+|+-.++|+.. .....|++|.++..... T Consensus 187 ~i~~~~~k~~~~~~is~e-ll~ds---~~l~~~i~~~la~a~~~~~d~a~l~G~g~--~~~p~Gi~~~~~~~~~~----- 255 (390) T protein:vir:97 187 KKTDTTHVIAHTMKATRQ-ILSDA---PQLASYMNNRLIRGLKVKEDAEILRGTGA--NDGLLGLIPQATTYAAP----- 255 (390) T ss_pred EEEEeeeeEEEeehhhHH-HHHhH---HHHHHHHHHHHHHHHHHHHHHHHhhcCCC--Cccccceeecccccccc----- Confidence 999999999998888885 43322 25788888888899999999999999643 23578999987653221 Q ss_pred CcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC-CCcCccHHHHHHHh----CCccEE Q lcl|Aclame:pro 230 GGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRDWLKQT----YPRVRV 304 (388) Q Consensus 230 ~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~~lk~n----~pnl~i 304 (388) ...+.+..++||..++..+...-. .+..++|.|..+..|.+. +..|.-++.-.... .-++.+ T Consensus 256 ------~~~~~~~~~d~~~~~~~~~~~~~~-------~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~l~G~pV 322 (390) T protein:vir:97 256 ------TTIAGATRVDQLRLAMLQASLAEY-------PASGIVINPIDWAAIELAKDANNQYLIGNARGTLTPTLWGLPV 322 (390) T ss_pred ------ccccccchHHHHHHHHHhhccccC-------CCCEEEEcHHHHHHHHHhhcCCCceeecCccCCCCceecceee Confidence 112233346777777776653321 234789999999988643 33343332111000 111222 Q ss_pred EEccccccccCC---CCccEEEEEEcccccccccccCCCcceEee-cchhhhccCceeccCceEEecccceeeeeeeccc Q lcl|Aclame:pro 305 MSAPELQGGNPD---DGKDIAYMFLDSVDTAVDGSTDGGDTWAQL-VQSKFVTLGVEKRVKNYVEAYSNATAGVMLKRPW 380 (388) Q Consensus 305 ~~~pel~~a~gt---g~~~~~~~~~~~~d~~~~~~~~~~~t~~~~-~p~~~r~~~v~~~~~~~~~~~~~~t~G~ii~rP~ 380 (388) +..+.+. + ++ |.-+..+.+....+ -++... -...|+..- ...-+..|. |..+++|. T Consensus 323 ~~~~~~~-~-~~~~~gd~~~~~~~~~~~~----------~~i~~~~~~~~f~~~~-------~~~r~~~r~-d~~v~~~~ 382 (390) T protein:vir:97 323 VATQAMA-P-GEFLVGAFDLAAQIFDQWD----------ARVEIGYVNDDFQRNM-------VTVLAEERL-ALVVYRPE 382 (390) T ss_pred EEcCCCC-C-CcEEEEeccceEEEEEecc----------eEEEEeecccccccCc-------EEEEEEEee-ccEEeccc Confidence 2222221 1 00 00011111111100 001100 001122111 111222233 45566688 Q ss_pred cceeeccC Q lcl|Aclame:pro 381 AVVRLIGL 388 (388) Q Consensus 381 ai~~~~GI 388 (388) ||+..+== T Consensus 383 a~v~~~~a 390 (390) T protein:vir:97 383 ALITGSFA 390 (390) T ss_pred cEEEEEeC Confidence 88664322 No 52 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=96.02 E-value=0.00015 Score=41.56 Aligned_cols=343 Identities=10% Similarity=-0.007 Sum_probs=148.1 Q ss_pred CC-----CcceeeeecC------ccccchhhhhhcccccccccCCHHHHhhcce--ecccchhhcchhhhhhhhhhhhcc Q lcl|Aclame:pro 1 MK-----QLSKVHQSLA------GRSVRAFDMANGKADYRLTDMAVRELKKFGL--VFDHATVKRQIELLHEGGVATQAF 67 (388) Q Consensus 1 ~~-----~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~l~~~g~--~~~~~~~~~~~~~~~~~~~~~~am 67 (388) |. ++.+....+. .+..+.++.......-......-.+..+... ........+.. . ...++ T Consensus 173 ~~~~~~~~~e~l~~~~e~~~~~~~~~~~~~d~~e~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~l~~-----~--e~~~~ 245 (543) T protein:vir:81 173 LRARALSAIEKMQGASDNVRAAATKIIERFDDEDSTLARQCLATSSPAYLRAWSKMARNPHAAILTE-----E--EKRAI 245 (543) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhhhHHHHHHHhhHHHHhhh-----h--hhhhh Confidence 00 0000000000 0000000000000000000000000000000 00000000000 0 00111 Q ss_pred CcccccccccccchH--HHHHHHhhcceee-eecccchhhhhhcccccCCCCceeeEEEeeeccccceEecccccCCcee Q lcl|Aclame:pro 68 DSAYVAPTTQASIPT--PIQFLQQWLPGFV-KVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPLS 144 (388) Q Consensus 68 Daa~~~~~t~~~~g~--l~~~l~~idp~v~-e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~diP~~ 144 (388) ........+.++.|+ |..+. ++++ +.+..-...+.+..+.+. ...+.+++....+.+...+....+|.. T Consensus 246 ~~~~~~~~t~~~gg~lip~~~~----~~ii~~~~~~~~~l~~~~~~~~~----~g~~~~~~~~~~~~a~~v~Eg~~~~~~ 317 (543) T protein:vir:81 246 NEVRAMGLTKADGGYLVPFQLD----PTVIITSNGSLNDIRRFARQVVA----TGDVWHGVSSAAVQWSWDAEFEEVSDD 317 (543) T ss_pred hhhhhcccccccCcccCchhhh----hHHHHHHHhhhchhhhhcccccC----CcceEEEEecCCcceeecccCcccccc Confidence 111111223344443 33333 2333 222221223344333222 123456666666677788889999999 Q ss_pred eeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCcccc Q lcl|Aclame:pro 145 SWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAI 224 (388) Q Consensus 145 ~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~ 224 (388) +.........++.++..+.++.+ +-.- ..++.+.-......++...+|.-.++|+.. .....|+++++...... T Consensus 318 ~~~~~~i~~~~~k~~~~~~is~e-ll~d---~~~~~~~i~~~l~~~~~~~~d~ail~G~Gt--~~~p~Gi~~~~~~~~~~ 391 (543) T protein:vir:81 318 SPEFGQPEIPVKKAQGFVPISIE-ALQD---EANVTETVALLFAEGKDELEAVTLTTGTGQ--GNQPTGIVTALAGTAAE 391 (543) T ss_pred ccccceeeeeeeeeEeeehhhHH-HHhc---cHHHHHHHHHHHHHHHHHHHHHHHhccCCC--Ccccccchhhccccccc Confidence 99999999999999999999985 4332 248899999999999999999999999743 23578999876542211 Q ss_pred ccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC-CCcCccHHHHHHHhCC--- Q lcl|Aclame:pro 225 ASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRDWLKQTYP--- 300 (388) Q Consensus 225 ~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~~lk~n~p--- 300 (388) .. ...+..-.++|+.+++..+-..-. ....++|.+..+..|.+. +..|.=++.-+...-| T Consensus 392 ~~---------~~~~~~~~~~~~~~~~~~l~~~~~-------~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~g~~~~l 455 (543) T protein:vir:81 392 IA---------PVTAETFALADVYAVYEQLAARHR-------RQGAWLANNLIYNKIRQFDTQGGAGLWTTIGNGEPSQL 455 (543) T ss_pred cc---------ccccccccHHHHHHHHHhhhcccc-------CCcEEEEcHHHHHHHHHhhcCCCceeccCcCCCCCccc Confidence 11 111222346788888777653221 123689999999988542 3333322221111111 Q ss_pred -ccEEEEc---cccccccCCCCccEEEEEEcccccccccccCCCcceEeec-chhhhccCceeccCceEEecccceeeee Q lcl|Aclame:pro 301 -RVRVMSA---PELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLV-QSKFVTLGVEKRVKNYVEAYSNATAGVM 375 (388) Q Consensus 301 -nl~i~~~---pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~-p~~~r~~~v~~~~~~~~~~~~~~t~G~i 375 (388) ++.++.. |..... +...++..++|. +..........+ +...+ |..+ ..-....-...+-...+.+| . T Consensus 456 ~G~pv~~~~~~~~~~~~-~~~~~~~~i~~g-d~~~~~i~~~~~---~~i~~~~~~~--~~~~~~~~~~~~~~~~r~d~-~ 527 (543) T protein:vir:81 456 LGRPVGEAEAMDANWNT-SASADNFVLLYG-NFQNYVIADRIG---MTVEFIPHLF--GTNRRPNGSRGWFAYYRMGA-D 527 (543) T ss_pred cceeeEEeccccccccc-cccCCcceEEEe-eccceeEEeecc---cEEEEecccc--ccchhhcCceEEEEEEeecc-E Confidence 1223322 222211 112233333432 222211111111 11111 1100 00000011122233334444 5 Q ss_pred eeccccceeeccC Q lcl|Aclame:pro 376 LKRPWAVVRLIGL 388 (388) Q Consensus 376 i~rP~ai~~~~GI 388 (388) +++|-||+.+.-- T Consensus 528 v~~~~A~~~l~~~ 540 (543) T protein:vir:81 528 VVNPNAFRLLNVE 540 (543) T ss_pred eecccceEEEEec Confidence 5669999887766 No 53 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=96.01 E-value=0.0011 Score=36.75 Aligned_cols=317 Identities=11% Similarity=0.020 Sum_probs=146.3 Q ss_pred CCCcceeeeecCccccchhhhhhcccccccccCCHHHHhhcceecccchhhcchhhhhhhhhhhhccCcccccccccccc Q lcl|Aclame:pro 1 MKQLSKVHQSLAGRSVRAFDMANGKADYRLTDMAVRELKKFGLVFDHATVKRQIELLHEGGVATQAFDSAYVAPTTQASI 80 (388) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~amDaa~~~~~t~~~~ 80 (388) ..+.......-..+..... .. .-.+++.+.-.+. .......+....... -+.. ..+.++. T Consensus 84 ~~~~~~~~~~~~~~~~~~~--~~----------~~~~~~~~~~~~~-~~~~~~~~~~~~~~~-----~~~~--~~~~~~~ 143 (418) T protein:vir:10 84 LARGGGSAELETPKTLGQL--VT----------ESEEMKGMDGSAR-KSVRVRVDRKSIMNV-----PATV--GSGVSGS 143 (418) T ss_pred HhhcccccccchhhhhhHH--hh----------hHHHHHHHHHHHh-hhhhhhhHHHHHHHh-----hhhc--cCCCCCC Confidence 0000000000000000000 00 0001110000000 000011111110000 0010 1123334 Q ss_pred h--HHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeecc-ccceEecccccCCceeeeeeeeeeeeEEE Q lcl|Aclame:pro 81 P--TPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEP-AGTAMEYGDLTNIPLSSWNVNFERRTIVR 157 (388) Q Consensus 81 g--~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~-~G~a~~ygd~~diP~~~~n~~~~~~~v~~ 157 (388) | +|..+ .++|++.+......++++++...+. .+..+.+... .+.+...+....+|..+...+......+. T Consensus 144 g~lvp~~~----~~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~f~~v~~~~~k 216 (418) T protein:vir:10 144 NSLVVADR----QAGIIAPPQRKMTIRDLLMPGQTSS---SSIEYTVETGFTNNAAAVAEGAQKPTSDLKFNLKNQPVRT 216 (418) T ss_pred ccccchhH----HHHHHHHHhhhhhHHhhcceeeccC---CceeEEEEecCCCceeeeccCccccccccceeeEEEeeee Confidence 4 34433 3467777777777777766654433 2344555433 34556778888999999999999999999 Q ss_pred EEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccccccccCCccccccc Q lcl|Aclame:pro 158 GEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWVSGGA 237 (388) Q Consensus 158 ~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~t~Wa~ 237 (388) +...+.++.+=++.+ .++.+--......++...+|+-.++|+... ....|++|..++...... T Consensus 217 ~~~~~~is~ell~ds----~~l~~~i~~~l~~a~~~~~d~a~l~G~g~~--~~p~Gi~~~~~~~~~~~~----------- 279 (418) T protein:vir:10 217 IAHLFKASRQILDDA----PALQSYIDGRARYGLQLTEEGQILKGDGTG--ANILGILPQASAFMPSIT----------- 279 (418) T ss_pred EEEeehhhHHHHHhH----HHHHHHHHHHHHHHHHHHHHHHHhccCCCC--cccccccccccccccccc----------- Confidence 999999988544332 267888888888899999999999997532 247899998765322111 Q ss_pred CCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC-CCcCccHHHHHHHh----CCccEEEEcccccc Q lcl|Aclame:pro 238 NAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRDWLKQT----YPRVRVMSAPELQG 312 (388) Q Consensus 238 kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~~lk~n----~pnl~i~~~pel~~ 312 (388) .+...-++||..++..+...- ..+..++|.+..+..|... +..|.-++.=.... +-++.++..+.+.. T Consensus 280 ~~~~~~~~~i~~~~~~~~~~~-------~~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~l~G~pV~~~~~~p~ 352 (418) T protein:vir:10 280 LANATPIDKIRLALLQAVLAE-------FPATGIVLNPIDWASIELTKDSQGRYIVGNPVNGTTPRLWNLPVVETQAMTA 352 (418) T ss_pred ccccccHHHHHHHHHhhcccc-------CCCCEEEEcHHHHHHHHHhhcCCCceeccccccCCCceecceeeEEcCCCCC Confidence 111112567777776664321 1234699999999888543 33344333211110 11122222222210 Q ss_pred cc-CCCC-ccEEEEEEcccccccccccCCCcceEeecch---hhhccCceeccCceEEecccceeeeeeeccccceeecc Q lcl|Aclame:pro 313 GN-PDDG-KDIAYMFLDSVDTAVDGSTDGGDTWAQLVQS---KFVTLGVEKRVKNYVEAYSNATAGVMLKRPWAVVRLIG 387 (388) Q Consensus 313 a~-gtg~-~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~---~~r~~~v~~~~~~~~~~~~~~t~G~ii~rP~ai~~~~G 387 (388) .. --|. +....++.+ . .-++.. -+. .|... ....-+..+.+ +.+++|-||++.+. T Consensus 353 ~~~~~gd~s~~~~~~~~-~----------~~~i~~-~~~~~~~f~~~-------~~~~r~~~~~d-~~~~~~~a~~~~~~ 412 (418) T protein:vir:10 353 NEFLVGAFSMAAQIFDR-M----------EIEVLL-STENVDDFEKN-------MVSIRAEERLA-LAVYRPESFVTGAL 412 (418) T ss_pred CcEEEeeccceEEEEEe-c----------ceEEEE-ecccchhhhcC-------ceEEEEEEeec-cEEecccceEEEEe Confidence 00 0000 011111110 0 000100 000 01111 11222333444 45778999999888 Q ss_pred C Q lcl|Aclame:pro 388 L 388 (388) Q Consensus 388 I 388 (388) . T Consensus 413 ~ 413 (418) T protein:vir:10 413 V 413 (418) T ss_pred c Confidence 8 No 54 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=95.96 E-value=0.0001 Score=42.43 Aligned_cols=342 Identities=11% Similarity=-0.030 Sum_probs=150.2 Q ss_pred CCCcceeeeecCccc-------------cchh-------hhhhcccccccc--cCCHHHHhhc---ceecccchh----h Q lcl|Aclame:pro 1 MKQLSKVHQSLAGRS-------------VRAF-------DMANGKADYRLT--DMAVRELKKF---GLVFDHATV----K 51 (388) Q Consensus 1 ~~~~~~~~~~~~~~~-------------~~~~-------~~~~~~~~~~~~--~~~~~~l~~~---g~~~~~~~~----~ 51 (388) |+++-...-.+.... ++.+ +.......-.+. +....++++. +..-+.... . T Consensus 1 M~~l~el~~~~~~~~~e~~~l~~~~~~e~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (385) T protein:vir:19 1 MSELALIQKAIEESQQKMTQLFDAQKAEIESTGQVSKQLQSDLMKVQEELTKSGTRLFDLEQKLASGAENPGEKKSFSER 80 (385) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccchhhhhHHH Confidence 111000000000000 0000 000000000000 0000001000 000000000 0 Q ss_pred cchhhhhhhhhhh-----hccCcccccccccccchHHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeee Q lcl|Aclame:pro 52 RQIELLHEGGVAT-----QAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIV 126 (388) Q Consensus 52 ~~~~~~~~~~~~~-----~amDaa~~~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~ 126 (388) ...++........ ...-++.. .+..+.|.++. +.+.+.|++.+......++++++...+. ..+.|++. T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~g~~i~--~~~~~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~ 153 (385) T protein:vir:19 81 AAEELIKSWDGKQGTFGAKTFNKSLG--SDADSAGSLIQ--PMQIPGIIMPGLRRLTIRDLLAQGRTSS---NALEYVRE 153 (385) T ss_pred HHHHHHHHHHHhhccchhhHHHhhhc--cccccCCceec--chhhhHHHHHhhhccchhhhcceecccC---cceEEEEE Confidence 0000000000000 00001111 11122222221 1234577777777777777777654433 23456666 Q ss_pred cc-ccceEecccccCCceeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecC Q lcl|Aclame:pro 127 EP-AGTAMEYGDLTNIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEG 205 (388) Q Consensus 127 e~-~G~a~~ygd~~diP~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~ 205 (388) +. .+.+...+.+..+|..+..........+.++..+.++.+ +-.-. .++...-....+.++...+|+-.+.|+.. T Consensus 154 ~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~k~~~~~~is~e-ll~d~---~~l~~~i~~~la~a~~~~~d~~~l~G~g~ 229 (385) T protein:vir:19 154 EVFTNNADVVAEKALKPESDITFSKQTANVKTIAHWVQASRQ-VMDDA---PMLQSYINNRLMYGLALKEEGQLLNGDGT 229 (385) T ss_pred ecCCcceeeeccCccccccccceeEEEEeeeeEEEeehhhHH-HHhhH---HHHHHHHHHHHHHHHHHHHHHHHHhccCC Confidence 54 345667788889999999999999999999999999974 43322 25777777888888888889889999753 Q ss_pred ccccceEEEeecCCCccccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC- Q lcl|Aclame:pro 206 KNGNRTFGFLNDPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV- 284 (388) Q Consensus 206 ~~~~g~~GllN~P~l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~- 284 (388) . ....|+++.++..+... ..+.+..++||..++..+...- ..+..++|++..+..|... T Consensus 230 ~--~~~~Gi~~~~~~~~~~~-----------~~~~~~~~d~i~~~~~~l~~~~-------~~~~~~~~~~~~~~~l~~lk 289 (385) T protein:vir:19 230 G--DNLEGLNKVATAYDTSL-----------NATGDTRADIIAHAIYQVTESE-------FSASGIVLNPRDWHNIALLK 289 (385) T ss_pred C--Ccccccccccccccccc-----------cccccchHHHHHHHHHhhcccc-------CCCCEEEEcHHHHHHHHHhh Confidence 2 34679999876532111 1122234677888877775432 1245799999999988542 Q ss_pred CCcCccHHHHHHHhCC----ccEEEEccccccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceecc Q lcl|Aclame:pro 285 TDLGISVRDWLKQTYP----RVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRV 360 (388) Q Consensus 285 ~~~~~Tvl~~lk~n~p----nl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~ 360 (388) +..|.-++.-.....+ .+.++..+.+. + + .+++..-+ .......... ....+....+ -.... T Consensus 290 d~~G~~l~~~~~~~~~~~l~G~pV~~~~~~p-~-----~-~~~~gd~~--~~~~~~~~~~--~~v~~~~~~~---~~~~~ 355 (385) T protein:vir:19 290 DNEGRYIFGGPQAFTSNIMWGLPVVPTKAQA-A-----G-TFTVGGFD--MASQVWDRMD--ATVEVSREDR---DNFVK 355 (385) T ss_pred cCCCceeccCcccCCCceecceeeEEcCcCC-C-----C-cEEEeecc--cEEEEEEecc--eEEEEecccc---chhhc Confidence 3344433321111111 12222222221 1 1 11111100 0000000000 0000000000 00011 Q ss_pred CceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 361 KNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 361 ~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) -.+.+.+..|.+|. +++|-+|++++.- T Consensus 356 ~~~~~~~~~r~~~~-v~~~~a~~~~~~~ 382 (385) T protein:vir:19 356 NMLTILCEERLALA-HYRPTAIIKGTFS 382 (385) T ss_pred CcEEEEEEEeeccE-EecccceEEEEec Confidence 12334455566654 4679999999888 No 55 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=95.96 E-value=0.0001 Score=42.43 Aligned_cols=342 Identities=11% Similarity=-0.030 Sum_probs=150.2 Q ss_pred CCCcceeeeecCccc-------------cchh-------hhhhcccccccc--cCCHHHHhhc---ceecccchh----h Q lcl|Aclame:pro 1 MKQLSKVHQSLAGRS-------------VRAF-------DMANGKADYRLT--DMAVRELKKF---GLVFDHATV----K 51 (388) Q Consensus 1 ~~~~~~~~~~~~~~~-------------~~~~-------~~~~~~~~~~~~--~~~~~~l~~~---g~~~~~~~~----~ 51 (388) |+++-...-.+.... ++.+ +.......-.+. +....++++. +..-+.... . T Consensus 1 M~~l~el~~~~~~~~~e~~~l~~~~~~e~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (385) T protein:vir:18 1 MSELALIQKAIEESQQKMTQLFDAQKAEIESTGQVSKQLQSDLMKVQEELTKSGTRLFDLEQKLASGAENPGEKKSFSER 80 (385) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccchhhhhHHH Confidence 111000000000000 0000 000000000000 0000001000 000000000 0 Q ss_pred cchhhhhhhhhhh-----hccCcccccccccccchHHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeee Q lcl|Aclame:pro 52 RQIELLHEGGVAT-----QAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIV 126 (388) Q Consensus 52 ~~~~~~~~~~~~~-----~amDaa~~~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~ 126 (388) ...++........ ...-++.. .+..+.|.++. +.+.+.|++.+......++++++...+. ..+.|++. T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~g~~i~--~~~~~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~ 153 (385) T protein:vir:18 81 AAEELIKSWDGKQGTFGAKTFNKSLG--SDADSAGSLIQ--PMQIPGIIMPGLRRLTIRDLLAQGRTSS---NALEYVRE 153 (385) T ss_pred HHHHHHHHHHHhhccchhhHHHhhhc--cccccCCceec--chhhhHHHHHhhhccchhhhcceecccC---cceEEEEE Confidence 0000000000000 00001111 11122222221 1234577777777777777777654433 23456666 Q ss_pred cc-ccceEecccccCCceeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecC Q lcl|Aclame:pro 127 EP-AGTAMEYGDLTNIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEG 205 (388) Q Consensus 127 e~-~G~a~~ygd~~diP~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~ 205 (388) +. .+.+...+.+..+|..+..........+.++..+.++.+ +-.-. .++...-....+.++...+|+-.+.|+.. T Consensus 154 ~~~~~~a~~v~E~~~~~~~~~~~~~~~~~~~k~~~~~~is~e-ll~d~---~~l~~~i~~~la~a~~~~~d~~~l~G~g~ 229 (385) T protein:vir:18 154 EVFTNNADVVAEKALKPESDITFSKQTANVKTIAHWVQASRQ-VMDDA---PMLQSYINNRLMYGLALKEEGQLLNGDGT 229 (385) T ss_pred ecCCcceeeeccCccccccccceeEEEEeeeeEEEeehhhHH-HHhhH---HHHHHHHHHHHHHHHHHHHHHHHHhccCC Confidence 54 345667788889999999999999999999999999974 43322 25777777888888888889889999753 Q ss_pred ccccceEEEeecCCCccccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC- Q lcl|Aclame:pro 206 KNGNRTFGFLNDPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV- 284 (388) Q Consensus 206 ~~~~g~~GllN~P~l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~- 284 (388) . ....|+++.++..+... ..+.+..++||..++..+...- ..+..++|++..+..|... T Consensus 230 ~--~~~~Gi~~~~~~~~~~~-----------~~~~~~~~d~i~~~~~~l~~~~-------~~~~~~~~~~~~~~~l~~lk 289 (385) T protein:vir:18 230 G--DNLEGLNKVATAYDTSL-----------NATGDTRADIIAHAIYQVTESE-------FSASGIVLNPRDWHNIALLK 289 (385) T ss_pred C--Ccccccccccccccccc-----------cccccchHHHHHHHHHhhcccc-------CCCCEEEEcHHHHHHHHHhh Confidence 2 34679999876532111 1122234677888877775432 1245799999999988542 Q ss_pred CCcCccHHHHHHHhCC----ccEEEEccccccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceecc Q lcl|Aclame:pro 285 TDLGISVRDWLKQTYP----RVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRV 360 (388) Q Consensus 285 ~~~~~Tvl~~lk~n~p----nl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~ 360 (388) +..|.-++.-.....+ .+.++..+.+. + + .+++..-+ .......... ....+....+ -.... T Consensus 290 d~~G~~l~~~~~~~~~~~l~G~pV~~~~~~p-~-----~-~~~~gd~~--~~~~~~~~~~--~~v~~~~~~~---~~~~~ 355 (385) T protein:vir:18 290 DNEGRYIFGGPQAFTSNIMWGLPVVPTKAQA-A-----G-TFTVGGFD--MASQVWDRMD--ATVEVSREDR---DNFVK 355 (385) T ss_pred cCCCceeccCcccCCCceecceeeEEcCcCC-C-----C-cEEEeecc--cEEEEEEecc--eEEEEecccc---chhhc Confidence 3344433321111111 12222222221 1 1 11111100 0000000000 0000000000 00011 Q ss_pred CceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 361 KNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 361 ~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) -.+.+.+..|.+|. +++|-+|++++.- T Consensus 356 ~~~~~~~~~r~~~~-v~~~~a~~~~~~~ 382 (385) T protein:vir:18 356 NMLTILCEERLALA-HYRPTAIIKGTFS 382 (385) T ss_pred CcEEEEEEEeeccE-EecccceEEEEec Confidence 12334455566654 4679999999888 No 56 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=95.90 E-value=0.00025 Score=40.35 Aligned_cols=343 Identities=12% Similarity=0.033 Sum_probs=143.8 Q ss_pred CCCc---ceeeeecCccccchhhhhhc---cccccccc--CCHHHHhhcceecccchhhcc------------------h Q lcl|Aclame:pro 1 MKQL---SKVHQSLAGRSVRAFDMANG---KADYRLTD--MAVRELKKFGLVFDHATVKRQ------------------I 54 (388) Q Consensus 1 ~~~~---~~~~~~~~~~~~~~~~~~~~---~~~~~~~~--~~~~~l~~~g~~~~~~~~~~~------------------~ 54 (388) |+.+ .+....+..-....++.... ...-.++. ....++++.+...+....... . T Consensus 19 ~~~~~e~~~~~~~~~~e~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 98 (390) T protein:vir:10 19 LRAFGERAVRDGELNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAGGDVQHVSVGDLFVASEQFQASAGRWNDR 98 (390) T ss_pred HHHHHHHHHhhcccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccccchhhhhhhhHHHHHHHHhhhhh Confidence 0000 00000000000000000000 00000000 000000000000000000000 0 Q ss_pred hhhhhhhhhhhccCcccccccccccch-HHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeecc-ccce Q lcl|Aclame:pro 55 ELLHEGGVATQAFDSAYVAPTTQASIP-TPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEP-AGTA 132 (388) Q Consensus 55 ~~~~~~~~~~~amDaa~~~~~t~~~~g-~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~-~G~a 132 (388) ........ ....+++... .+.++.+ +|-.+. +++++.+......++++.+.+.+. ..+.|++.+. .+.+ T Consensus 99 ~~~~~~~~-~~~~~~~~~~-~~~~~g~~~~~~~~----~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~a 169 (390) T protein:vir:10 99 SARATMNI-KAALNTASTD-AAGSAGALTTPNRL----PGFITQPDARLTVRDLIGSGRTDS---ALIEYVQETGFVNNA 169 (390) T ss_pred hhhhhhHH-HHHHHhhhcc-cccccccccchhHH----HHHHHHHHhhchhhhhcceeeccC---CceEEEEEecCCcce Confidence 00000000 0111111111 1222222 232222 456676666666677766655433 2345555543 4567 Q ss_pred EecccccCCceeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceE Q lcl|Aclame:pro 133 MEYGDLTNIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTF 212 (388) Q Consensus 133 ~~ygd~~diP~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~ 212 (388) ...+....+|-.+..........+.+...+.++.+ +-.-. .++.+.-....++++...+|+-.+.|+.. ..... T Consensus 170 ~~v~Eg~~~~~~~~~~~~i~~~~~k~~~~~~is~e-ll~d~---~~l~~~i~~~l~~~~~~~~~~~il~G~G~--~~~p~ 243 (390) T protein:vir:10 170 AIVAEGALKPESSLKFAKKTDTTHVIAHTMKATRQ-ILSDA---PQLASYMNNRLIRGLKVKEDAEILRGTGA--NDGLL 243 (390) T ss_pred eeecCCccccccccceeEEEEeeEEEEEeehhhHH-HHHhH---HHHHHHHHHHHHHHHHHHHHHHHhhcCCC--Ccccc Confidence 77788889999999999999999999999999985 43322 26788888888889999999999999642 34578 Q ss_pred EEeecCCCccccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC-CCcCccH Q lcl|Aclame:pro 213 GFLNDPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISV 291 (388) Q Consensus 213 GllN~P~l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tv 291 (388) |++|.+++....+. .+....++||..++..+...-. .+..++|.|+.+..|.+. +..|.-+ T Consensus 244 Gi~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~l~~~~~-------~~~~~v~n~~~~~~L~~lkd~~g~~l 305 (390) T protein:vir:10 244 GLIPQATTYAAPTT-----------IAGATRVDQLRLAMLQASLAEY-------PASGIVINPIDWAAIELAKDANNQYL 305 (390) T ss_pred cccccccccccccc-----------ccccchHHHHHHHHHhhccccC-------CCCEEEEcHHHHHHHHHhhcCCCcee Confidence 99998765322111 1122235677777777754321 234789999999888653 3334433 Q ss_pred HHHHHHhCCccEEEEccccccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEecccce Q lcl|Aclame:pro 292 RDWLKQTYPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSNAT 371 (388) Q Consensus 292 l~~lk~n~pnl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~~~t 371 (388) +.--...-+. +|-..|=..... -..+. + +| -+............-++. +. + ...-...-....-+..|. T Consensus 306 ~~~~~~~~~~-~l~G~pv~~~~~-~p~~~-~-~~-gdf~~~~~~~~~~~~~i~--~~---~-~~~~~~~~~~~~r~~~r~ 374 (390) T protein:vir:10 306 IGNARGTLTP-TLWGLPVVATQA-MAPGE-F-LV-GAFDLAAQIFDQWDARVE--IG---Y-VNDDFQRNMVTVLAEERL 374 (390) T ss_pred ecCCcCcCCc-eecceeeEEcCC-CCCCc-E-EE-EeccceEEEEEecceEEE--Ee---e-cccccccCcEEEEEEEee Confidence 2211111111 222222111110 00111 1 11 111000000000000000 00 0 000011111222233344 Q ss_pred eeeeeeccccceeeccC Q lcl|Aclame:pro 372 AGVMLKRPWAVVRLIGL 388 (388) Q Consensus 372 ~G~ii~rP~ai~~~~GI 388 (388) |+.+++|.||+..+== T Consensus 375 -d~~v~~~~a~~~~~~a 390 (390) T protein:vir:10 375 -ALVVYRPEALISGSFA 390 (390) T ss_pred -ccEEeccccEEEEEeC Confidence 4466778888654311 No 57 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=95.60 E-value=0.00025 Score=40.30 Aligned_cols=331 Identities=11% Similarity=0.008 Sum_probs=146.9 Q ss_pred CCCcceeeeecC----ccccchhhhhhc--------cccccccc-CCHHHHhhcceecccchhhcchhhhhhhhhhhhcc Q lcl|Aclame:pro 1 MKQLSKVHQSLA----GRSVRAFDMANG--------KADYRLTD-MAVRELKKFGLVFDHATVKRQIELLHEGGVATQAF 67 (388) Q Consensus 1 ~~~~~~~~~~~~----~~~~~~~~~~~~--------~~~~~~~~-~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~am 67 (388) .....+...... .+.......... ....+... ....+.++ .|......... .....++ T Consensus 73 ~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~r~---a~~~~l~~~~~------~~e~~a~ 143 (434) T protein:vir:62 73 DDDPEKKEDPTAKENPNEKTELSEEQRSAISASIAAALSTKGHRTNKETEIRS---VFANYIVGNID------EKEARAL 143 (434) T ss_pred cchhhhhcchhhhcchhhhHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHH---HHHHHhccccc------hhhhhhh Confidence 000000000000 000000000000 00000000 00000000 00000000000 0001111 Q ss_pred CcccccccccccchHHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccceEec---ccccCCcee Q lcl|Aclame:pro 68 DSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEY---GDLTNIPLS 144 (388) Q Consensus 68 Daa~~~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~y---gd~~diP~~ 144 (388) + .++++.|+++- +.+...|++.+......+.+..+...+ ....|++....+.+... +...+.|.. T Consensus 144 -----~-~~t~~GG~lvP--~~~~~~Ii~~l~~~~~i~~~~~~~~~~----~~~~~p~~~~~~~a~~~~~~~e~~~~~~~ 211 (434) T protein:vir:62 144 -----G-LVTGNGSVTIP--DFLSKEIITYAQEENFLRRLGTGVKTK----ENIKYPVLVKKAEAQGHKNERTNNEMPET 211 (434) T ss_pred -----c-ccccccceecc--hhhHHHHHHhhhhhhhhhhhcceeccC----CceEEEEEecCCcccceeccccccccccc Confidence 1 12234454332 123345666555444445554332211 23456666555555432 335688888 Q ss_pred eeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCcccc Q lcl|Aclame:pro 145 SWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAI 224 (388) Q Consensus 145 ~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~ 224 (388) +.......-..+.+...+.++.+=|+- ..++|.+.-....+.++...+++-.+.|+.-. ...-|+++.+.+... T Consensus 212 ~~~f~~v~~~~~k~~~~~~iS~ell~d---s~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~--~~~~g~~~~~~~~~~- 285 (434) T protein:vir:62 212 DIEFDEIELSPTEFDALATVTKKLLAR---TGLPIEQIVMDELKKAYVRKETQYMVNGDEAN--NINDGALAKKAVEFK- 285 (434) T ss_pred ccceeeEEeeheeeEeehhhHHHHHhc---chHHHHHHHHHHHHHHHHHHHHHHHhccCCCC--ccccceeeccccccc- Confidence 888888899999999988888853332 35678888888899999999999999997522 235577776654211 Q ss_pred ccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhcc-CCCcCccHHH-HHHHh---- Q lcl|Aclame:pro 225 ASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSV-VTDLGISVRD-WLKQT---- 298 (388) Q Consensus 225 ~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~-~lk~n---- 298 (388) .+..-.++||.+++..+...-. . + -.++|.+..+..|.. .+..|.-++. ...-+ T Consensus 286 -------------~~~~~~~d~l~~l~~~l~~~~~----~-~--a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~g~~ 345 (434) T protein:vir:62 286 -------------TDEKNLYDALVKMKNTPVKEVR----K-K--ARWVLNTAALTKIETMKTDDGFPLLRPFNQAEGGIG 345 (434) T ss_pred -------------ccccchhhHHHHHHhhcchhhh----c-C--CEEEEcHHHHHHHHHhhccCCCEeeccCCCccCCCC Confidence 0111235677777777754321 1 1 157888888888854 3333433322 11100 Q ss_pred --CCccEEEEccccccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEecccceeeeee Q lcl|Aclame:pro 299 --YPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSNATAGVML 376 (388) Q Consensus 299 --~pnl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~~~t~G~ii 376 (388) ..+..++....+.. +.++....++| -+.......-..+..++.. ..+.|..- ...-.-...|..|-.| T Consensus 346 ~tl~G~pV~~~~~~~~--~~~~~~~~i~~-Gdfs~~~i~~~~g~~~i~~-~~~~~~~~------~~v~~~~~~r~Dgk~i 415 (434) T protein:vir:62 346 YTLLGFPVEEEDAIDI--PDSPDTPVFYF-GDFSKFYIQDVIGSLEVQK-LVELFSRT------NRVGFRIWNLLDAQLI 415 (434) T ss_pred ceecceeeEEecCccC--ccCCCceEEEE-eeccceEEEEeeceeEEEe-ehhhhccc------CceEEEEEeeecceee Confidence 11122333322221 22223333333 2221110110111222222 12222111 1122344567788899 Q ss_pred eccccceeeccC Q lcl|Aclame:pro 377 KRPWAVVRLIGL 388 (388) Q Consensus 377 ~rP~ai~~~~GI 388 (388) |+|.+++...+. T Consensus 416 ~~~~~~~~~~~~ 427 (434) T protein:vir:62 416 HSPFEVPVYKYV 427 (434) T ss_pred cCcccceEEEEE Confidence 999999977555 No 58 >protein:vir:97255 Length: 310 # NCBI annotation: hypothetical protein ORF017 # Family: family:all:1120 # MgeID: mge:1657 # MgeName: M6 # Cross-refs: genbank:acc:YP_001294525;genbank:gi:149408246;genbank:GeneID:5237120 Probab=95.57 E-value=0.00025 Score=40.29 Aligned_cols=277 Identities=12% Similarity=0.072 Sum_probs=126.7 Q ss_pred cCcccccccccccchHHHHHHHhhcceeeeecccchhhhhhccccc-CCCCceeeEEEeee-ccccce--EecccccCCc Q lcl|Aclame:pro 67 FDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKT-VGSWEDQEIVQGIV-EPAGTA--MEYGDLTNIP 142 (388) Q Consensus 67 mDaa~~~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t-~g~w~~~t~~~~v~-e~~G~a--~~ygd~~diP 142 (388) |.| +|-+-++.+.+ ..+..+|+|.+...-+..+.+|=+. +|. ++.|.-. ...|.+ ...-.+++.| T Consensus 1 mpa-----ltLaea~k~~~--d~l~~~ViE~~~~~s~lL~~LpF~~veg~----~~~ynR~~~~~~~~~~~v~~~~~~~g 69 (310) T protein:vir:97 1 MAS-----VTLAESAKLAQ--DELVAGVIENIITVNRMFDVLPFDSIEGN----SLAYNRENVLGDVIMAGVGTTFSGAG 69 (310) T ss_pred Ccc-----cchHHHhhcCc--chHHHHHHHHHhccchHHHhCCcccccCC----cceeeEeeccCCcccccccccccCCC Confidence 221 11111111111 1123456666666666666666432 222 2333332 222222 2222233333 Q ss_pred eeeeeeeeeeeeEEEEEEEEeecHHHHHHH--HHh-CC--ChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeec Q lcl|Aclame:pro 143 LSSWNVNFERRTIVRGEMGIQVGLLEEGRA--SAM-RI--NSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLND 217 (388) Q Consensus 143 ~~~~n~~~~~~~v~~~~~~~~y~~~El~~A--~~~-g~--~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~ 217 (388) .......+ .-+.++..+--+..|+-+. ... +- +.-+.+-....+++.++....-++||.. .+.++||+.. T Consensus 70 ~~~~~~t~---~~~~~~L~i~~g~~~Vd~~i~dl~~~~~~dq~~~Ql~~~iea~~~~~e~~lINGD~a--~n~F~GL~~~ 144 (310) T protein:vir:97 70 AGKAAATF---TKVNSNLTTIMGDAEVNGLIQATRSGDGNDQTAVQIASKAKSAGRKYQDQLINGNGA--GNEFAGLIQL 144 (310) T ss_pred cccccccc---ceeeeeeeeeeehhhhhhHHHhhhcCChHHHHHHHHHHHHHHHHHHHHHHhhccccC--CCcccchhhc Confidence 33322222 2223334444445554432 221 32 2333444555667778888888999875 3578999886 Q ss_pred CCCccccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHH---HHhhccC---------- Q lcl|Aclame:pro 218 PSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNK---VDMLSVV---------- 284 (388) Q Consensus 218 P~l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~---~~~Ls~~---------- 284 (388) =.-..-+.+.+.++.. | ++|+.+++..+|..-+ .|..|++.|.. +..+.+. T Consensus 145 ~~~~q~i~~~~~gg~~-----t----~d~LDeLl~~v~~~~g-------~p~~~l~~~~~~r~i~A~~R~~~~~g~~~~~ 208 (310) T protein:vir:97 145 CASGQKATTGATGSAI-----S----FAILDELMDLVVDKDG-------QVDYLTMHARTLRSYKALLRALGGASINEVV 208 (310) T ss_pred CCccceeecCCCCCCC-----C----HHHHHHHHHHHhcCCC-------CCCEEEecHHHHHHHHHHHHHhcCCCCCCcc Confidence 3221112211122222 2 5789999999986543 35578899864 4444331 Q ss_pred -CCcCccHHHHHHHhCCccEEEEccccc---cccCCCCccEEEEEEcccc---cccccccCCCcceEeecchhhhccC-c Q lcl|Aclame:pro 285 -TDLGISVRDWLKQTYPRVRVMSAPELQ---GGNPDDGKDIAYMFLDSVD---TAVDGSTDGGDTWAQLVQSKFVTLG-V 356 (388) Q Consensus 285 -~~~~~Tvl~~lk~n~pnl~i~~~pel~---~a~gtg~~~~~~~~~~~~d---~~~~~~~~~~~t~~~~~p~~~r~~~-v 356 (388) +.+|.-|+ .|-.+-|..+-.+- ...+++++...++..-.-+ .=+.+...+...+. ..|..| + T Consensus 209 ~~~~G~~v~-----~~~GiPi~~~d~ip~~~~~~~~~gtTsIya~r~Ge~~~~~Gv~Gl~~~~~~gl-----sVr~~G~~ 278 (310) T protein:vir:97 209 ELPSGAEVP-----AYSGTPIFRNDYIPTNQTKGGTTGCTTIFAGTLDDGSRTHGIAGLTATQAAGI-----QVVDVGES 278 (310) T ss_pred ccCCCCEEe-----eeCCeEEEEeCccCCCccccccCCceeEEEEeeCccccccceeccccCCccce-----eEEeCCcc Confidence 12333332 23344444332221 1112233444444332111 11233332222222 234444 2 Q ss_pred -eeccCceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 357 -EKRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 357 -~~~~~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) +....+|.++. -.|+.+.-|.|++.+.|| T Consensus 279 ~~~~v~~~~V~~---Y~~~av~~~~A~a~L~~V 308 (310) T protein:vir:97 279 EDSDEHIWRVKW---YCGLALFSEKGLACADGI 308 (310) T ss_pred cCCcceeEEEEE---eeeEEEecccceeeeccc Confidence 12224555544 469999999999999999 No 59 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=95.56 E-value=0.00035 Score=39.50 Aligned_cols=342 Identities=10% Similarity=0.006 Sum_probs=150.1 Q ss_pred CCCcceeeeecCccccchhhhhhcccccccccCCHHHHhhcceecccchhhcchhhhhhhhhhhhccC--cccccccccc Q lcl|Aclame:pro 1 MKQLSKVHQSLAGRSVRAFDMANGKADYRLTDMAVRELKKFGLVFDHATVKRQIELLHEGGVATQAFD--SAYVAPTTQA 78 (388) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~amD--aa~~~~~t~~ 78 (388) +.+..+.+..+..++.+ ++ .... .+.+.-..-.+.......+......-.....+ ++.. ..+.+ T Consensus 71 ~~~~~~~~~ei~~~~~~-~~-----------~~~~-~~~~~~~~~~~~~~~~~~~~~~af~~~l~~~e~~~al~-~~t~~ 136 (425) T protein:vir:10 71 LAKVDKVSADLEALQAA-VD-----------EANI-KIAAAQMGANGVKPLRDPEYTEAFKAHVKRGDVQAALN-KGEDS 136 (425) T ss_pred HHHHHHHHHHHHHHHHH-HH-----------HHHH-HHHhhhcccccccccccHHHHHHHHHHhhhhhhHHHhh-cCcCC Confidence 11111111111111000 00 0000 00000000000000000010000000000001 1111 11334 Q ss_pred cchHHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccceEecccccCCceeee-eeeeeeeeEEE Q lcl|Aclame:pro 79 SIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPLSSW-NVNFERRTIVR 157 (388) Q Consensus 79 ~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~diP~~~~-n~~~~~~~v~~ 157 (388) +.|+++- +.+.++|++.+......+++..+.+... ....+++......+...+....+|-.+. ..+...-..+. T Consensus 137 ~gG~lvP--~~~~~~ii~~~~~~s~l~~l~~~~~~~~---~~~~~~~~~~~~~a~wv~E~~~~~~~~~~~f~~v~~~~~k 211 (425) T protein:vir:10 137 EGGYLTP--IEWDRTITNKLVLISPMRQLCRVQPVSK---AGFSKLFNMGGTTSGWVGEASQRPQTNAATFQPLSFASGE 211 (425) T ss_pred CCceecc--HhHHHHHHHHHHhhhhhhhhceeeeccC---CceEEEEEcCCcceeeeccccccccccccccceeeeehee Confidence 4454432 2334567777766666666655443322 2234555444556667788888887764 67888888888 Q ss_pred EEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccccccccCCccc-ccc Q lcl|Aclame:pro 158 GEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWV-SGG 236 (388) Q Consensus 158 ~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~t-~Wa 236 (388) ++..+.++.+=++ ....++.+.-......++...+|+-.++|+.- ....|+||++...+.......+... .-. T Consensus 212 ~~~~i~iS~ell~---ds~~~l~~~i~~~la~ai~~~~d~~~l~G~G~---~~p~Gil~~~~~~~~~~~~~~~~~~~~~~ 285 (425) T protein:vir:10 212 IYANPAATQQILD---DAEIDLESWLATEVQTEFAKQEGKAFLAGDGT---NKPNGLLTYIAGGANAAKHPFGAIEVVNS 285 (425) T ss_pred eEeehHhHHHHHh---cchhHHHHHHHHHHHHHHHHHHHhhhhcccCC---CCcceeeeccccccccccccccccccccc Confidence 8888888875333 33568889999999999999999999999742 3678999988654322111100000 001 Q ss_pred cCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhcc-CCCcCccHHHH-HHHhCC----ccEEEEcccc Q lcl|Aclame:pro 237 ANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSV-VTDLGISVRDW-LKQTYP----RVRVMSAPEL 310 (388) Q Consensus 237 ~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~-lk~n~p----nl~i~~~pel 310 (388) ..+..--++||.+++..|...-. ..-+++|.+..+..|.. .+..|.-++.- +..-.| +.-++....+ T Consensus 286 ~~~~~~~~d~l~~l~~~l~~~~~-------~~a~~vmn~~~~~~L~~lkD~~G~~l~~~~~~~g~~~~l~G~PV~~~~~~ 358 (425) T protein:vir:10 286 GAAADITSDGIIDLVYDLPSAFT-------GNARFAMNRNTQRQVRKLKDGQGNYLWQPSYVAGQPATLAGYPVTEVPDM 358 (425) T ss_pred cccccccHHHHHHHHhhhhhhhc-------cCCEEEEchHHHHHHHHhhcCCCceeeccCccCCCCceecceeeEEecCc Confidence 12223346677777776653321 12368899999998854 23334322210 000011 1123332222 Q ss_pred ccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 311 QGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 311 ~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) .. .+.+...++|.+ ....... .+ +........++-. .-...+-...|.+|.+ .+|.||+.+..= T Consensus 359 p~---~~~~~~~i~~Gd-~~~~~~i-~~--~~~~~v~~d~~~~------~~~~~~~~~~r~d~~v-~~~~A~~~l~~~ 422 (425) T protein:vir:10 359 PD---VAANSTPILFGD-FQQTYLI-ID--RIGVRVLRDPYTA------KPYVLFYTTKRVGGGL-LNPEPMRAMKVA 422 (425) T ss_pred CC---ccCCccEEEEEe-hhccEEE-EE--ecceEEEeccccc------CCcEEEEEEEEeccEe-ecccceEEEEee Confidence 21 122233334422 1110000 00 0001111121211 1112233344555554 449998664433 No 60 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=95.38 E-value=0.0012 Score=36.55 Aligned_cols=321 Identities=9% Similarity=-0.031 Sum_probs=139.6 Q ss_pred CCCcceeeeecCccccchhhhhhcccccccccCCHHHHhhcceecccchhhcchhhhhhhhhhhhccCcccccccccccc Q lcl|Aclame:pro 1 MKQLSKVHQSLAGRSVRAFDMANGKADYRLTDMAVRELKKFGLVFDHATVKRQIELLHEGGVATQAFDSAYVAPTTQASI 80 (388) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~amDaa~~~~~t~~~~ 80 (388) ..+.-. .-..|..+ + .......+..+... .........+.-.............+..+. T Consensus 71 ~~~~~~---~~~~~~~~--~--------------~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~t~~g 129 (415) T protein:vir:47 71 NQQSVE---VNEARTYR--N--------------QANINDLGISIQNT--KVTSQEVRDFTEYLETRNDIQGGSLKTDSG 129 (415) T ss_pred cccccc---cchhhhhH--H--------------HHHHHHHHHhhhhh--hhhHHHHHHHHHHHhhhhhhhhccccccCC Confidence 110000 00000000 0 00000111111000 000011111111111111111112222233 Q ss_pred h--HHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeee--ccccceEecccccCCcee-eeeeeeeeeeE Q lcl|Aclame:pro 81 P--TPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIV--EPAGTAMEYGDLTNIPLS-SWNVNFERRTI 155 (388) Q Consensus 81 g--~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~--e~~G~a~~ygd~~diP~~-~~n~~~~~~~v 155 (388) + +|..+ .++|++.+......++++.+..... .+..+++. ...+.+...+....+|-. ....+...... T Consensus 130 ~~~iP~~~----~~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~v~~~~ 202 (415) T protein:vir:47 130 FVVIPEEI----VTDILKLKEVEFNLDKYVTVKRVTN---GSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDI 202 (415) T ss_pred cccccHHH----HHHHHHHHHhhhhhhhhcceeeccC---CceeEEEEEecCCcceeecccccccccccccceeeEEeee Confidence 3 44333 3467777766666677755533221 11233433 233355667788888865 45788888999 Q ss_pred EEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccccccccCCccccc Q lcl|Aclame:pro 156 VRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWVSG 235 (388) Q Consensus 156 ~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~t~W 235 (388) +.++..+.++.+=++ ....+|.+.-....+.++.+.+|+-.+.|+.... ...++......+. . + T Consensus 203 ~k~~~~~~iS~ell~---ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~--~~~~~~~~~~~~~-----~------~ 266 (415) T protein:vir:47 203 NTHRGYFRISREAIE---DAKVNVLQELKLWMARTIAATRNKAIIDVITKGS--TGSTSSGFEKEGK-----K------L 266 (415) T ss_pred eeeEeeehhhHHHHh---hchHHHHHHHHHHHHHHHHHHHHHHHhhccccCC--ccccccccccccc-----e------e Confidence 999999988885443 3346788888888889999999999998874321 1112111111100 0 0 Q ss_pred ccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC-CCcCccHHH-HHHHh----CCccEEEEccc Q lcl|Aclame:pro 236 GANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRD-WLKQT----YPRVRVMSAPE 309 (388) Q Consensus 236 a~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~-~lk~n----~pnl~i~~~pe 309 (388) . .+...-++||.+++..+...-. .+..++|.++.+..|.+. +..|.-++. -+... ..+..++..+. T Consensus 267 ~-~~~~~~~~~i~~~~~~~~~~~~-------~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~pV~~~~~ 338 (415) T protein:vir:47 267 E-VKKAKSLDDIKDAINLNVKPNY-------EHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPD 338 (415) T ss_pred c-cccccchHHHHHHHHhhhhhcc-------CCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCCccccceeeEEecc Confidence 1 1111126677777777764421 245789999999988643 333443321 00010 11122333322 Q ss_pred cccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 310 LQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 310 l~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) +-. + .+++..++|.+ .......... ..+...+ .++.... ...-...|. |+.+.+|-||+.+.-- T Consensus 339 ~~~--~-~~~~~~~~~gd-~~~~~~~~~~--~~~~v~~-~~~~~~~-------~~~~~~~r~-d~~v~~~~a~~~~~~~ 402 (415) T protein:vir:47 339 EVL--G-QKGNNTLIIGN-LKDAIVLFDR--SQYQASW-TDYMHFG-------ECLMIAVRQ-DCRILDYKSAIVIEYD 402 (415) T ss_pred ccc--c-CCCccEEEEEe-hhccEEEEee--cceEEEe-eccccCc-------eEEEEEEEe-ccEEeccccEEEEEee Confidence 211 1 22333344432 1110010000 0011111 1111111 111223354 5566679999887644 No 61 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=95.38 E-value=0.0012 Score=36.55 Aligned_cols=321 Identities=9% Similarity=-0.031 Sum_probs=139.6 Q ss_pred CCCcceeeeecCccccchhhhhhcccccccccCCHHHHhhcceecccchhhcchhhhhhhhhhhhccCcccccccccccc Q lcl|Aclame:pro 1 MKQLSKVHQSLAGRSVRAFDMANGKADYRLTDMAVRELKKFGLVFDHATVKRQIELLHEGGVATQAFDSAYVAPTTQASI 80 (388) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~amDaa~~~~~t~~~~ 80 (388) ..+.-. .-..|..+ + .......+..+... .........+.-.............+..+. T Consensus 71 ~~~~~~---~~~~~~~~--~--------------~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~t~~g 129 (415) T protein:vir:46 71 NQQSVE---VNEARTYR--N--------------QANINDLGISIQNT--KVTSQEVRDFTEYLETRNDIQGGSLKTDSG 129 (415) T ss_pred cccccc---cchhhhhH--H--------------HHHHHHHHHhhhhh--hhhHHHHHHHHHHHhhhhhhhhccccccCC Confidence 110000 00000000 0 00000111111000 000011111111111111111112222233 Q ss_pred h--HHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeee--ccccceEecccccCCcee-eeeeeeeeeeE Q lcl|Aclame:pro 81 P--TPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIV--EPAGTAMEYGDLTNIPLS-SWNVNFERRTI 155 (388) Q Consensus 81 g--~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~--e~~G~a~~ygd~~diP~~-~~n~~~~~~~v 155 (388) + +|..+ .++|++.+......++++.+..... .+..+++. ...+.+...+....+|-. ....+...... T Consensus 130 ~~~iP~~~----~~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~~~~~v~~~~ 202 (415) T protein:vir:46 130 FVVIPEEI----VTDILKLKEVEFNLDKYVTVKRVTN---GSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDI 202 (415) T ss_pred cccccHHH----HHHHHHHHHhhhhhhhhcceeeccC---CceeEEEEEecCCcceeecccccccccccccceeeEEeee Confidence 3 44333 3467777766666677755533221 11233433 233355667788888865 45788888999 Q ss_pred EEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccccccccCCccccc Q lcl|Aclame:pro 156 VRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWVSG 235 (388) Q Consensus 156 ~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~t~W 235 (388) +.++..+.++.+=++ ....+|.+.-....+.++.+.+|+-.+.|+.... ...++......+. . + T Consensus 203 ~k~~~~~~iS~ell~---ds~~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~--~~~~~~~~~~~~~-----~------~ 266 (415) T protein:vir:46 203 NTHRGYFRISREAIE---DAKVNVLQELKLWMARTIAATRNKAIIDVITKGS--TGSTSSGFEKEGK-----K------L 266 (415) T ss_pred eeeEeeehhhHHHHh---hchHHHHHHHHHHHHHHHHHHHHHHHhhccccCC--ccccccccccccc-----e------e Confidence 999999988885443 3346788888888889999999999998874321 1112111111100 0 0 Q ss_pred ccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC-CCcCccHHH-HHHHh----CCccEEEEccc Q lcl|Aclame:pro 236 GANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRD-WLKQT----YPRVRVMSAPE 309 (388) Q Consensus 236 a~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~-~lk~n----~pnl~i~~~pe 309 (388) . .+...-++||.+++..+...-. .+..++|.++.+..|.+. +..|.-++. -+... ..+..++..+. T Consensus 267 ~-~~~~~~~~~i~~~~~~~~~~~~-------~~~~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~pV~~~~~ 338 (415) T protein:vir:46 267 E-VKKAKSLDDIKDAINLNVKPNY-------EHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPD 338 (415) T ss_pred c-cccccchHHHHHHHHhhhhhcc-------CCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCCccccceeeEEecc Confidence 1 1111126677777777764421 245789999999988643 333443321 00010 11122333322 Q ss_pred cccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 310 LQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 310 l~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) +-. + .+++..++|.+ .......... ..+...+ .++.... ...-...|. |+.+.+|-||+.+.-- T Consensus 339 ~~~--~-~~~~~~~~~gd-~~~~~~~~~~--~~~~v~~-~~~~~~~-------~~~~~~~r~-d~~v~~~~a~~~~~~~ 402 (415) T protein:vir:46 339 EVL--G-QKGNNTLIIGN-LKDAIVLFDR--SQYQASW-TDYMHFG-------ECLMIAVRQ-DCRILDYKSAIVIEYD 402 (415) T ss_pred ccc--c-CCCccEEEEEe-hhccEEEEee--cceEEEe-eccccCc-------eEEEEEEEe-ccEEeccccEEEEEee Confidence 211 1 22333344432 1110010000 0011111 1111111 111223354 5566679999887644 No 62 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=95.31 E-value=0.00061 Score=38.20 Aligned_cols=343 Identities=10% Similarity=-0.006 Sum_probs=143.5 Q ss_pred CCCcceeeeecCccccchhhhhhcccccccccCCHHHHhh-cceecccchhhcchhhhhhhhhhhhccCccccccccccc Q lcl|Aclame:pro 1 MKQLSKVHQSLAGRSVRAFDMANGKADYRLTDMAVRELKK-FGLVFDHATVKRQIELLHEGGVATQAFDSAYVAPTTQAS 79 (388) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~-~g~~~~~~~~~~~~~~~~~~~~~~~amDaa~~~~~t~~~ 79 (388) ++.+...-.-+. |+.. ...+. ....... .-.+.++ ++=.+..+.. ..+.... ..+|-. +..+.++ T Consensus 51 ~~~~~~~~~~~~-~~~~--~~~~~-~~~~~~~-~~~e~~~a~~~~lr~~~~----~~~~~~e--~~a~~~---~~~~~GG 116 (401) T protein:vir:44 51 LSELENLKSDLE-KELL--ELKRP-ARGAQNK-VAAEHKDAFVGFLRKGRE----DGLRDLE--RKALQV---GTDEDGG 116 (401) T ss_pred HHHHHHHHHHHH-HHHH--Hhhcc-ccccccc-hhHHHHHHHHHHHhhhhh----hhhHHHH--HHHhhc---CCCCCCc Confidence 100000000000 0000 00000 0000000 0001111 0000000000 0011111 111111 1011122 Q ss_pred chHHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccceEecccccCCceee-eeeeeeeeeEEEE Q lcl|Aclame:pro 80 IPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPLSS-WNVNFERRTIVRG 158 (388) Q Consensus 80 ~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~diP~~~-~n~~~~~~~v~~~ 158 (388) .-+|..+ .++|++.+......+.+..+...+. ....+++......+...+....+|-.+ ...+...-.++.+ T Consensus 117 ~~iP~~~----~~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~a~wv~E~~~~~~~~~~~~~~v~~~~~k~ 189 (401) T protein:vir:44 117 YAVPEEL----DRSILSLLKDEVVMRQEATVITVGG---SDYKKLVNLGGTASGWVGETDTRSQTATSRLGLIEPFMGEI 189 (401) T ss_pred eeccHhH----HHHHHHHHHhhhhhhhhceeeecCC---CceEEEEecCCccceeeccccccCccccccceeeeeehhhe Confidence 2245443 3466666554444555544433222 223455544444455667777777655 3677778888888 Q ss_pred EEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccccccccCCcccccccC Q lcl|Aclame:pro 159 EMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWVSGGAN 238 (388) Q Consensus 159 ~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~t~Wa~k 238 (388) ...+.++.+=+.. ...++.+.-....+.++...++.-.++|+.. ....|+||.+...........+.-..-.+. T Consensus 190 ~~~~~iS~ell~d---s~~~l~~~i~~~la~ai~~~~~~~~l~G~G~---~~p~Gil~~~~~~~~~~~~~~~~~~~~~t~ 263 (401) T protein:vir:44 190 YGNPQATQKMLDD---AFFNVEAWINSELATEFAEQEEIAFTTGDGT---KKPKGFLAYESTEESDKARAFGKLQHIVSG 263 (401) T ss_pred eeehhhhHHHHhc---chHHHHHHHHHHHHHHHHHHHHhhhhccCCC---Cccceeeccccccccccccccccccccccc Confidence 8888888864433 3558888888899999999999999999743 357899999877543221111000000111 Q ss_pred CHH-HHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC-CCcCccHHHHHHHhCCccEEEEccccc--ccc Q lcl|Aclame:pro 239 AFQ-GIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRDWLKQTYPRVRVMSAPELQ--GGN 314 (388) Q Consensus 239 T~~-eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~~lk~n~pnl~i~~~pel~--~a~ 314 (388) +.. --++||.+++..|...-. ...+++|.++.+..|... +..|.-++.-=..+.+.-+|-..|=+. ... T Consensus 264 ~~~~~~~d~i~~~~~~l~~~~~-------~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~g~~~~l~G~PVv~~~~~p 336 (401) T protein:vir:44 264 EATAVTADAIIKLIYTLRKAHR-------TGAKFMMNNNSLFAIRLLKDTEGNYLWRPGLELGQPSSLAGYGIAENEQMP 336 (401) T ss_pred cccccCHHHHHHHHHhcchhhh-------cCCEEEEcHHHHHHHHHhhccCCceeecCCcCCCCCceecceeeEEecCcC Confidence 111 226777777777653311 122688999999988543 333443321100111111222222110 000 Q ss_pred CCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 315 PDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 315 gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) ..++++..++|. +........ + ...+.. .-+++. ..-...+.+..|.+|.++. |-||+.+..= T Consensus 337 ~~~~~~~~i~~G-d~~~~~~i~-~-~~~~~~-~~~~~~------~~~~v~~~a~~r~d~~~~~-~~a~~~l~~~ 399 (401) T protein:vir:44 337 DIAADAKAIAFG-NFKRGYTIV-D-RIGTRI-LRDPYT------NKPFVGFYTTKRTGGMLVD-SQAIKLLKIA 399 (401) T ss_pred CccCCccEEEEe-ehhccEEEE-E-ecceEE-eeeccc------cCCcEEEEEEEEeccEEec-ccceEEEEee Confidence 112233333442 211100000 0 000111 111111 1112233444455555544 8888765544 No 63 >protein:vir:485 Length: 407 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:11 # MgeName: P27 # Cross-refs: genbank:acc:NP_543092;swissprot:trembl:q8w627;genbank:gi:18249904;uniprot:Q8W627;genbank:GeneID:929693 Probab=95.15 E-value=0.001 Score=36.95 Aligned_cols=345 Identities=11% Similarity=0.027 Sum_probs=146.4 Q ss_pred CCCcceeeeecCccc------cchhhhhhcccccccccCCHHHHhhcceecccchhhcchhhhhhhhhhhhccCcccccc Q lcl|Aclame:pro 1 MKQLSKVHQSLAGRS------VRAFDMANGKADYRLTDMAVRELKKFGLVFDHATVKRQIELLHEGGVATQAFDSAYVAP 74 (388) Q Consensus 1 ~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~amDaa~~~~ 74 (388) ..++-+....+...+ -+.+...+.. ....++-...+-++ .|.+.........+. .....++.. T Consensus 40 ~~~~e~~~~~~~~~e~~~~~~~~~~~~~~~~-~~~~~~~~~~e~~~---a~~~~l~~g~~~~~~--~~e~~a~~~----- 108 (407) T protein:vir:48 40 AGEVETLNGKLAELENLKSDLEAELAEVKRP-AGGTQNKVASEHKE---AFIGFMRKGREDGLR--ELERKALQV----- 108 (407) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc-ccccccchhhHHHH---HHHHHHhccchhhhh--HHHHHhhhc----- Confidence 000000000000000 0000000000 00000000000000 000000000000000 001112221 Q ss_pred cccccchHHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccceEecccccCCceee-eeeeeeee Q lcl|Aclame:pro 75 TTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPLSS-WNVNFERR 153 (388) Q Consensus 75 ~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~diP~~~-~n~~~~~~ 153 (388) .+.++.|+++- +.+.++|++.+......+.+..+.+.+. ....+++......+...+....+|-.+ .......- T Consensus 109 ~t~~~gG~~iP--~~~~~~I~~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~f~~i~~ 183 (407) T protein:vir:48 109 GNDEDGGYAIP--EELDRTILTLLKDEVVMRQEATVITLGG---SDYKKLVNLGGTTSGWVGETDARPETATSKLGLIEP 183 (407) T ss_pred ccCCCCccccc--HhHHHHHHHHHHhhhhhhhhceeeecCC---CceEEEEecCCcceeeecccccccccccccceeEEe Confidence 12233443322 2245566666544444455544333222 334555655555666777777888654 46777888 Q ss_pred eEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccccccccCCccc Q lcl|Aclame:pro 154 TIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWV 233 (388) Q Consensus 154 ~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~t 233 (388) .++.+...+.++.+=++. ...++.+.-......++...+++-.++|+.- ....|+|+++.+.........+... T Consensus 184 ~~~k~~~~~~iS~ell~d---s~~~l~~~i~~~l~~~i~~~~~~a~l~G~G~---~~p~Gil~~~~~~~~~~~~~~~~~~ 257 (407) T protein:vir:48 184 FMGEIYGNPQATQKMLDD---AFFNVEDWINSELALEFAEQEEIAFTSGDGS---KKPKGFLAYESTDEDDKTRAFGKLQ 257 (407) T ss_pred eeeeeEeehhhHHHHHhc---chHHHHHHHHHHHHHHHHHHHHhhhhccCCC---Cccceeeeccccccccccccccccc Confidence 889999988888864433 3457888888888888999999999999753 3578999998874432211111111 Q ss_pred ccccCCHHH-HHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC-CCcCccHHHH-HHHhCC----ccEEEE Q lcl|Aclame:pro 234 SGGANAFQG-IVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRDW-LKQTYP----RVRVMS 306 (388) Q Consensus 234 ~Wa~kT~~e-I~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~~-lk~n~p----nl~i~~ 306 (388) .-.++++.. -++||.+++..|...-. . .-.++|.+..+..|.+- +..|.-++.- +....| +..++. T Consensus 258 ~~~~~~~~~~~~d~i~~l~~~l~~~~~----~---~a~~v~n~~~~~~L~~lkD~~Gr~l~~~~~~~g~~~~l~G~PV~~ 330 (407) T protein:vir:48 258 HIASGAASGVTADAIIKLIYTLRKAHR----S---GAKFMMNNSSLFAIRLLKDNDGNYLWRPGIELGQPSSLAGYGIVE 330 (407) T ss_pred ccccccccccChHHHHHHHHhhchhhh----c---CCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeEE Confidence 111122222 26777777776654321 1 12578999998888542 3334333210 011111 112222 Q ss_pred ccccccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEecccceeeeeeeccccceeec Q lcl|Aclame:pro 307 APELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSNATAGVMLKRPWAVVRLI 386 (388) Q Consensus 307 ~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~~~t~G~ii~rP~ai~~~~ 386 (388) ...+.. .+++...++|.+ ....... .+ +....+...++- ..-...+.+..|.+|.+ .+|-||+.+. T Consensus 331 ~~~~p~---~~~~~~~i~~Gd-~~~~~~i-~~--~~~~~i~~d~~~------~~~~~~~~~~~r~d~~v-~~~~a~~~l~ 396 (407) T protein:vir:48 331 NEQMPD---IAADAKAIAFGN-FKRGYTI-VD--RIGTRILRDPYT------NKPFVGFYTTKRTGGML-VDSQAIKLMK 396 (407) T ss_pred ecCcCC---ccCCccEEEEEe-ccccEEE-EE--eeceEEEeeccc------cCCcEEEEEEEEeccEE-ecccceEEEE Confidence 222211 122333344422 1110000 00 000111111111 11112334455665555 4499997654 Q ss_pred cC Q lcl|Aclame:pro 387 GL 388 (388) Q Consensus 387 GI 388 (388) .= T Consensus 397 ~~ 398 (407) T protein:vir:48 397 IG 398 (407) T ss_pred ee Confidence 43 No 64 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=95.13 E-value=0.0027 Score=34.66 Aligned_cols=255 Identities=10% Similarity=0.019 Sum_probs=128.3 Q ss_pred hccCcccccccccccchHHHHHHHhhcceeeeecccchhhhhhcccccC--CCCceeeEEEeeeccccceEecccccCCc Q lcl|Aclame:pro 65 QAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTV--GSWEDQEIVQGIVEPAGTAMEYGDLTNIP 142 (388) Q Consensus 65 ~amDaa~~~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~--g~w~~~t~~~~v~e~~G~a~~ygd~~diP 142 (388) || ... .+.+.+=+|-.+..++..++ ...+....+..+++. |.- -.+++++.++..|.+.-|.++++++ T Consensus 1 ma--~~~---T~~~d~i~Pev~s~~v~~~~----~~~~~~~~~~~~~~~l~g~~-G~tv~ip~~~~~g~~~~~~~g~~i~ 70 (274) T protein:vir:96 1 MA--QGT---TKVSNLIVPEVLAPMMQAEL----DKKLRFAQFADIDSTLVGQP-GDTLTFPAFTYSGDAQVIAEGEKIP 70 (274) T ss_pred CC--ccc---cchhhhhhhHHHHHHHHHHH----HhhhhhcccccccccccCCC-CCEEEEEeeccCCCccccCCCCcCc Confidence 22 110 12234446666666554433 333444555544442 222 2578999999899999999999999 Q ss_pred eeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCcc Q lcl|Aclame:pro 143 LSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLP 222 (388) Q Consensus 143 ~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a 222 (388) ..++........+...+-+++++. +.+++ .+.++..+....+..++...+++..+-- ++.- T Consensus 71 ~~~it~~~~~~~i~~~~~~~~i~D--~~~~~-~~~d~~~~~~~~~~~~~a~~~d~~i~~~------------l~~a---- 131 (274) T protein:vir:96 71 VDQIGTSKREAKVRKIGKGTELTD--EAVLS-GFGDPQGEAVRQHGLAIANKVDNDVLEA------------LKGA---- 131 (274) T ss_pred hhhcccceeEEEEEeeeceeeecH--HHHHh-hcchHHHHHHHHHHHHHHHHHHHHHHHH------------HhcC---- Confidence 999999988888887766666655 44433 4557777778888888888887654411 1110 Q ss_pred ccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccCC--------CcCccHH-H Q lcl|Aclame:pro 223 AIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVVT--------DLGISVR-D 293 (388) Q Consensus 223 ~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~~--------~~~~Tvl-~ 293 (388) +....+... + ++.|..+...+-.. ...+..|+++|..+..|.+-+ +.|..++ . T Consensus 132 ---~~~~~~~~~----~----~d~i~dA~~~l~d~-------~~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~ 193 (274) T protein:vir:96 132 ---TLTVEADIT----K----LDGLQTAIDKFNDE-------DLEPMVLFVNPLDAGGLRTSASDNFTRPTQLGDNIIVK 193 (274) T ss_pred ---CCCcCcccc----c----HHHHHHHHHHhccc-------CCCceEEEeCHHHHHHHHhcccccccccccccccceee Confidence 011111111 2 44455555554322 124668999999999885421 1111110 0 Q ss_pred HHHHhCCccEEEEccccccccCCCCccEEEEEEcccccccccccCCCc-ceEeecchhhhccCceeccCceEEecccc-e Q lcl|Aclame:pro 294 WLKQTYPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGD-TWAQLVQSKFVTLGVEKRVKNYVEAYSNA-T 371 (388) Q Consensus 294 ~lk~n~pnl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~-t~~~~~p~~~r~~~v~~~~~~~~~~~~~~-t 371 (388) -.--++-+++|..-..+- ....+++.+.- . .....+. +.|. .| ..+........+ . T Consensus 194 g~ig~~~G~~Vi~s~~~p-------~~t~~l~~~gA--~--~~~~~~~~~vE~-----~R------d~~~~~d~i~~~~~ 251 (274) T protein:vir:96 194 GAFGEALGAVIVRSNKLN-------KGEALLAKKGA--V--KLITKRDFFLEK-----DR------DASRKSTALYSDKH 251 (274) T ss_pred cccceecCeeEEEcCCCC-------cceEEEEeCcc--e--eeeecCCccccc-----cc------chhhcccEEEEeeE Confidence 000112234444322221 11223332210 0 0000000 0111 11 111111112222 5 Q ss_pred eeeeeeccccceeeccC Q lcl|Aclame:pro 372 AGVMLKRPWAVVRLIGL 388 (388) Q Consensus 372 ~G~ii~rP~ai~~~~GI 388 (388) +|+-+.+|-+++.+.-= T Consensus 252 yg~~~~~~~~vv~~t~~ 268 (274) T protein:vir:96 252 YVAYLYDESKVVKITKG 268 (274) T ss_pred EEEEEEcCccEEEEEcC Confidence 78889999887776544 No 65 >protein:vir:93616 Length: 645 # NCBI annotation: putative major head protein/prohead protease # Family: family:all:21 # MgeID: mge:157 # MgeName: phi 4795 # Cross-refs: genbank:acc:YP_001449293;genbank:gi:157166041;goa:Q6H9U8;interpro:IPR006433;uniprot:Q6H9U8;genbank:GeneID:5580438 Probab=94.99 E-value=0.0014 Score=36.17 Aligned_cols=335 Identities=12% Similarity=0.004 Sum_probs=135.4 Q ss_pred CCCcceeeeecC------------ccccchhhhhhcccccccccCCHHHHhhcceecccchhhcchhhhhhhhhhhhccC Q lcl|Aclame:pro 1 MKQLSKVHQSLA------------GRSVRAFDMANGKADYRLTDMAVRELKKFGLVFDHATVKRQIELLHEGGVATQAFD 68 (388) Q Consensus 1 ~~~~~~~~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~amD 68 (388) +...++.+.... +|.+|.+-.... +...+...+++.+ ..+..... ....+..+..+.+ T Consensus 272 ~~~~~~~~~~~~~~~~~~~kg~~f~~~~~al~~~~g-----~~~~a~e~a~~~~--~~~~~~~~---~~~~a~~~~~~~~ 341 (645) T protein:vir:93 272 GNVAAVASAPVIRVEQKLDKGIGFARFAKSLAAAKG-----VRSEALEVARRQY--PDDSRLHH---VLKSAVGAGTTTD 341 (645) T ss_pred cccccccccccccchhhhhhhhhHHHHHHHHHhccc-----chhHHHHHHHhhc--ccchhhhh---hhhhhhhcccccc Confidence 100000000000 000111000000 0000001111111 11110000 0000000111112 Q ss_pred cccccccccccchH--HHHHHHhhcceeeeecccchhhhhhcccccCCCCc-eeeEEEeeeccccceEecccccCCceee Q lcl|Aclame:pro 69 SAYVAPTTQASIPT--PIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWE-DQEIVQGIVEPAGTAMEYGDLTNIPLSS 145 (388) Q Consensus 69 aa~~~~~t~~~~g~--l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~-~~t~~~~v~e~~G~a~~ygd~~diP~~~ 145 (388) + ....|+ |..+. .+|++.+.+....+.+-.....+-.. ...+..+.....+.+...|....+|..+ T Consensus 342 ~-------~~~Gg~~vp~~~~----~~ii~~l~~~svv~~l~~~~~~~~~~~~~~~~ip~~t~~~~a~wv~Eg~~~~~s~ 410 (645) T protein:vir:93 342 P-------QWAGSLSEYQEYA----QDFIDYLRPQTIIGRFGQGGIPALRQVPFNIRVHAQVSGGAAGWVGEGKTKPLTK 410 (645) T ss_pred c-------cccCCccCchhhH----HHHHHhhhhhhhHHhhccccccccccccCceeeeeeecCcceEEeccCccccccc Confidence 1 111222 22222 24455554444444442221111111 1123445555556667788889999999 Q ss_pred eeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccc-cceEEEeecCCCcccc Q lcl|Aclame:pro 146 WNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNG-NRTFGFLNDPSLLPAI 224 (388) Q Consensus 146 ~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~-~g~~GllN~P~l~a~~ 224 (388) ...+...-+.+.+.....++.+=|+. ...++.+--....+.++...+++-.+.|+..... ..-.|++|.- T Consensus 411 ~~f~~v~l~~~kla~~~~iS~ell~d---s~~~~~~~i~~~l~~aia~~~d~a~l~g~g~~~~~~~p~gi~~~~------ 481 (645) T protein:vir:93 411 FDFESITFSHAKVSAIAVLTEELIRF---SSPAADALVRNALAEAVVARLDTDFVDPKKAAVADVSPASITHDV------ 481 (645) T ss_pred cceeEEEEeeEEEEEeehhHHHHHhh---chHHHHHHHHHHHHHHHHHHHHHHhhcCCCcccCCccccceeccc------ Confidence 99999999999999888888743332 3466777777888888888888888877532110 0122333310 Q ss_pred ccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC-CCcCccHHHHHHHhCCccE Q lcl|Aclame:pro 225 ASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRDWLKQTYPRVR 303 (388) Q Consensus 225 ~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~~lk~n~pnl~ 303 (388) .. .++......|+..++..+..+.. .+.. ...+|.|..+..|.+. +..|.-++--+-.. +-+ T Consensus 482 -~~---------~~~~~~~~~d~~~~~~~~~~a~~---~~~~--a~~vmn~~~~~~L~~lkd~~G~~~~~~~~~~--~~t 544 (645) T protein:vir:93 482 -KG---------TASSGNPDADAEAAFGQFVAANL---QPTG--AVWLMSSTNALALSMRKNALGQKEYPDMTLL--GGS 544 (645) T ss_pred -cc---------cccccchHHHHHHHHHHHHhcCC---Cccc--cEEEEcHHHHHHHHhccccCCceeecCCCCC--Cce Confidence 00 11111234688888887765542 1111 2478999988888653 33343221000000 012 Q ss_pred EEEcccccc----cc--CCCCccEEEEEEcccccccccccCCCcceEee-cch-hh------hccCceeccCceEEeccc Q lcl|Aclame:pro 304 VMSAPELQG----GN--PDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQL-VQS-KF------VTLGVEKRVKNYVEAYSN 369 (388) Q Consensus 304 i~~~pel~~----a~--gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~-~p~-~~------r~~~v~~~~~~~~~~~~~ 369 (388) +-..|=+.. ++ -.+.+++.+ ...-+..+..+.+ -+++.. .|. .+ ..+.. ++.--+-+.++. T Consensus 545 L~G~PV~~s~~vp~~~~~gd~s~~~i--g~~~~v~i~~s~~--a~~~~~~~~~~~~~~~~~~~~v~l-f~~d~vaira~~ 619 (645) T protein:vir:93 545 FQGLPVIVSQYVGDQLVLVNAPDIYL--ADDGGVAVDMSRE--ASLEMQSEPTGDSTTPSPVELVSM-FQTGSVAIRAER 619 (645) T ss_pred eeceeeEEeccCCcceeEeccccEEE--EEecceEEEeecc--eeEEEeecccccccccccccchhH-hhcCceEEEEEE Confidence 222221111 00 001122221 1111111111110 001000 000 00 00000 111112334555 Q ss_pred ceeeeeeeccccceeeccC Q lcl|Aclame:pro 370 ATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 370 ~t~G~ii~rP~ai~~~~GI 388 (388) |+++. +++|-||++++|+ T Consensus 620 r~d~~-~~~p~a~~~lt~~ 637 (645) T protein:vir:93 620 WINWR-RRRTAAVAVITGV 637 (645) T ss_pred EEcce-eeCccceEEEecc Confidence 55544 5889999999999 No 66 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=94.93 E-value=0.002 Score=35.39 Aligned_cols=324 Identities=8% Similarity=-0.034 Sum_probs=135.8 Q ss_pred CCCcceeeeecCccccchhhhhhcccccccccCCHHHHhhcceecccchhhcchhhhhhhhhhhhccCcccccccccccc Q lcl|Aclame:pro 1 MKQLSKVHQSLAGRSVRAFDMANGKADYRLTDMAVRELKKFGLVFDHATVKRQIELLHEGGVATQAFDSAYVAPTTQASI 80 (388) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~amDaa~~~~~t~~~~ 80 (388) +..-.+..-....+..+.. .+....+-.+.... ...+......-.....+.......+..+. T Consensus 68 ~~~~~~~~~~~~~~~~~~~----------------~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 129 (415) T protein:vir:98 68 SENNQQSVEVNEARTYRNQ----------------ANINDLGISIQNTK--VTSQEVRDFTEYLETRNDIQGGSLKTDSG 129 (415) T ss_pred hhhcccccccchhhhHHHH----------------HHHHHHhhhhhhhh--hHHHHHHHHHHHHhhhhhhhhcccccccc Confidence 1110000000000100000 00000000010000 00001111100111111111112233333 Q ss_pred h--HHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeec--cccceEecccccCCceee-eeeeeeeeeE Q lcl|Aclame:pro 81 P--TPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVE--PAGTAMEYGDLTNIPLSS-WNVNFERRTI 155 (388) Q Consensus 81 g--~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e--~~G~a~~ygd~~diP~~~-~n~~~~~~~v 155 (388) | +|..+ .++|++.+......++++.+..... ....+.+.. ..+.+...+...++|-.+ ...+.....+ T Consensus 130 g~~iP~~~----~~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~ 202 (415) T protein:vir:98 130 FVVIPEEI----VTDILKLKEVEFNLDKYVTVKRVTN---GSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDI 202 (415) T ss_pred ccccchHH----HHHHHHHHHhhhhhhhheeeeeccC---CceeEEEEeecCCccceeeccccccCcccccceeeEEeee Confidence 4 55433 4466666666666666655533211 111233322 333445667778888554 5788888999 Q ss_pred EEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccccccccCCccccc Q lcl|Aclame:pro 156 VRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWVSG 235 (388) Q Consensus 156 ~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~t~W 235 (388) +.++..+.++.+=++ ....++.+.-......++.+.+|+-.+.|+.... ...++++...... + +. T Consensus 203 ~k~~~~~~iS~ell~---ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~--~~~~~~~~~~~~~--~-~~------- 267 (415) T protein:vir:98 203 NTHRGYFRISREAIE---DAKVNVLQELKLWMARTIAATRNKAIIDVITKGS--TGSTSSGFEKEGK--K-LE------- 267 (415) T ss_pred eeeEeeehhhHHHHh---hchHHHHHHHHHHHHHHHHHHHHHHHhhccccCc--ccccccccccccc--c-cc------- Confidence 999988888885332 2355788888888888888888988888874321 1223222211111 0 00 Q ss_pred ccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC-CCcCccHHHH-HHHhCC----ccEEEEccc Q lcl|Aclame:pro 236 GANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRDW-LKQTYP----RVRVMSAPE 309 (388) Q Consensus 236 a~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~~-lk~n~p----nl~i~~~pe 309 (388) .+...-++||.+++..+...- ..+..++|.++.+..|.+. +..|.-++.- +....+ +..++..+. T Consensus 268 --~~~~~~~~~i~~~~~~~~~~~-------~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~ 338 (415) T protein:vir:98 268 --VKKAKSLDDIKDAINLNVKPN-------YEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPD 338 (415) T ss_pred --cccccchhHHHHHHHhhhhhc-------cCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeEEecc Confidence 011112667777777765321 1245789999999988642 3334322210 001011 112333332 Q ss_pred cccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 310 LQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 310 l~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) +-. + ++++..++|.+ .......... ..+...+ .++.... ...-...|. |+.+++|-||+.+.-- T Consensus 339 ~~~--~-~~~~~~~~~Gd-~~~~~~~~~~--~~~~v~~-~~~~~~~-------~~~~~~~r~-d~~v~~~~a~~~~~~~ 402 (415) T protein:vir:98 339 EVL--G-QKGNNTLIIGN-LKDAIVLFDR--SQYQASW-TDYMHFG-------ECLMIAVRQ-DCRILDYKSAIVIEYD 402 (415) T ss_pred ccc--C-CCCccEEEEEe-hhccEEEEee--cceEEEE-eccccCc-------eEEEEEEEe-ccEEeccccEEEEEEe Confidence 211 1 22333344432 1110000000 0011111 1111111 011223354 4556679999888655 No 67 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=94.93 E-value=0.002 Score=35.39 Aligned_cols=324 Identities=8% Similarity=-0.034 Sum_probs=135.8 Q ss_pred CCCcceeeeecCccccchhhhhhcccccccccCCHHHHhhcceecccchhhcchhhhhhhhhhhhccCcccccccccccc Q lcl|Aclame:pro 1 MKQLSKVHQSLAGRSVRAFDMANGKADYRLTDMAVRELKKFGLVFDHATVKRQIELLHEGGVATQAFDSAYVAPTTQASI 80 (388) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~amDaa~~~~~t~~~~ 80 (388) +..-.+..-....+..+.. .+....+-.+.... ...+......-.....+.......+..+. T Consensus 68 ~~~~~~~~~~~~~~~~~~~----------------~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 129 (415) T protein:vir:81 68 SENNQQSVEVNEARTYRNQ----------------ANINDLGISIQNTK--VTSQEVRDFTEYLETRNDIQGGSLKTDSG 129 (415) T ss_pred hhhcccccccchhhhHHHH----------------HHHHHHhhhhhhhh--hHHHHHHHHHHHHhhhhhhhhcccccccc Confidence 1110000000000100000 00000000010000 00001111100111111111112233333 Q ss_pred h--HHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeec--cccceEecccccCCceee-eeeeeeeeeE Q lcl|Aclame:pro 81 P--TPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVE--PAGTAMEYGDLTNIPLSS-WNVNFERRTI 155 (388) Q Consensus 81 g--~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e--~~G~a~~ygd~~diP~~~-~n~~~~~~~v 155 (388) | +|..+ .++|++.+......++++.+..... ....+.+.. ..+.+...+...++|-.+ ...+.....+ T Consensus 130 g~~iP~~~----~~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~ 202 (415) T protein:vir:81 130 FVVIPEEI----VTDILKLKEVEFNLDKYVTVKRVTN---GSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDI 202 (415) T ss_pred ccccchHH----HHHHHHHHHhhhhhhhheeeeeccC---CceeEEEEeecCCccceeeccccccCcccccceeeEEeee Confidence 4 55433 4466666666666666655533211 111233322 333445667778888554 5788888999 Q ss_pred EEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccccccccCCccccc Q lcl|Aclame:pro 156 VRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWVSG 235 (388) Q Consensus 156 ~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~t~W 235 (388) +.++..+.++.+=++ ....++.+.-......++.+.+|+-.+.|+.... ...++++...... + +. T Consensus 203 ~k~~~~~~iS~ell~---ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~--~~~~~~~~~~~~~--~-~~------- 267 (415) T protein:vir:81 203 NTHRGYFRISREAIE---DAKVNVLQELKLWMARTIAATRNKAIIDVITKGS--TGSTSSGFEKEGK--K-LE------- 267 (415) T ss_pred eeeEeeehhhHHHHh---hchHHHHHHHHHHHHHHHHHHHHHHHhhccccCc--ccccccccccccc--c-cc------- Confidence 999988888885332 2355788888888888888888988888874321 1223222211111 0 00 Q ss_pred ccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC-CCcCccHHHH-HHHhCC----ccEEEEccc Q lcl|Aclame:pro 236 GANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRDW-LKQTYP----RVRVMSAPE 309 (388) Q Consensus 236 a~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~~-lk~n~p----nl~i~~~pe 309 (388) .+...-++||.+++..+...- ..+..++|.++.+..|.+. +..|.-++.- +....+ +..++..+. T Consensus 268 --~~~~~~~~~i~~~~~~~~~~~-------~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~ 338 (415) T protein:vir:81 268 --VKKAKSLDDIKDAINLNVKPN-------YEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPD 338 (415) T ss_pred --cccccchhHHHHHHHhhhhhc-------cCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeEEecc Confidence 011112667777777765321 1245789999999988642 3334322210 001011 112333332 Q ss_pred cccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 310 LQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 310 l~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) +-. + ++++..++|.+ .......... ..+...+ .++.... ...-...|. |+.+++|-||+.+.-- T Consensus 339 ~~~--~-~~~~~~~~~Gd-~~~~~~~~~~--~~~~v~~-~~~~~~~-------~~~~~~~r~-d~~v~~~~a~~~~~~~ 402 (415) T protein:vir:81 339 EVL--G-QKGNNTLIIGN-LKDAIVLFDR--SQYQASW-TDYMHFG-------ECLMIAVRQ-DCRILDYKSAIVIEYD 402 (415) T ss_pred ccc--C-CCCccEEEEEe-hhccEEEEee--cceEEEE-eccccCc-------eEEEEEEEe-ccEEeccccEEEEEEe Confidence 211 1 22333344432 1110000000 0011111 1111111 011223354 4556679999888655 No 68 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=94.93 E-value=0.002 Score=35.39 Aligned_cols=324 Identities=8% Similarity=-0.034 Sum_probs=135.8 Q ss_pred CCCcceeeeecCccccchhhhhhcccccccccCCHHHHhhcceecccchhhcchhhhhhhhhhhhccCcccccccccccc Q lcl|Aclame:pro 1 MKQLSKVHQSLAGRSVRAFDMANGKADYRLTDMAVRELKKFGLVFDHATVKRQIELLHEGGVATQAFDSAYVAPTTQASI 80 (388) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~amDaa~~~~~t~~~~ 80 (388) +..-.+..-....+..+.. .+....+-.+.... ...+......-.....+.......+..+. T Consensus 68 ~~~~~~~~~~~~~~~~~~~----------------~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~g 129 (415) T protein:vir:79 68 SENNQQSVEVNEARTYRNQ----------------ANINDLGISIQNTK--VTSQEVRDFTEYLETRNDIQGGSLKTDSG 129 (415) T ss_pred hhhcccccccchhhhHHHH----------------HHHHHHhhhhhhhh--hHHHHHHHHHHHHhhhhhhhhcccccccc Confidence 1110000000000100000 00000000010000 00001111100111111111112233333 Q ss_pred h--HHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeec--cccceEecccccCCceee-eeeeeeeeeE Q lcl|Aclame:pro 81 P--TPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVE--PAGTAMEYGDLTNIPLSS-WNVNFERRTI 155 (388) Q Consensus 81 g--~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e--~~G~a~~ygd~~diP~~~-~n~~~~~~~v 155 (388) | +|..+ .++|++.+......++++.+..... ....+.+.. ..+.+...+...++|-.+ ...+.....+ T Consensus 130 g~~iP~~~----~~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~v~E~~~~~~~~~~~~~~v~~~~ 202 (415) T protein:vir:79 130 FVVIPEEI----VTDILKLKEVEFNLDKYVTVKRVTN---GSGKYPVVRQSEVAALEKVEELEENPELAVKPFFQLAYDI 202 (415) T ss_pred ccccchHH----HHHHHHHHHhhhhhhhheeeeeccC---CceeEEEEeecCCccceeeccccccCcccccceeeEEeee Confidence 4 55433 4466666666666666655533211 111233322 333445667778888554 5788888999 Q ss_pred EEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccccccccCCccccc Q lcl|Aclame:pro 156 VRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWVSG 235 (388) Q Consensus 156 ~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~t~W 235 (388) +.++..+.++.+=++ ....++.+.-......++.+.+|+-.+.|+.... ...++++...... + +. T Consensus 203 ~k~~~~~~iS~ell~---ds~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~--~~~~~~~~~~~~~--~-~~------- 267 (415) T protein:vir:79 203 NTHRGYFRISREAIE---DAKVNVLQELKLWMARTIAATRNKAIIDVITKGS--TGSTSSGFEKEGK--K-LE------- 267 (415) T ss_pred eeeEeeehhhHHHHh---hchHHHHHHHHHHHHHHHHHHHHHHHhhccccCc--ccccccccccccc--c-cc------- Confidence 999988888885332 2355788888888888888888988888874321 1223222211111 0 00 Q ss_pred ccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC-CCcCccHHHH-HHHhCC----ccEEEEccc Q lcl|Aclame:pro 236 GANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRDW-LKQTYP----RVRVMSAPE 309 (388) Q Consensus 236 a~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~~-lk~n~p----nl~i~~~pe 309 (388) .+...-++||.+++..+...- ..+..++|.++.+..|.+. +..|.-++.- +....+ +..++..+. T Consensus 268 --~~~~~~~~~i~~~~~~~~~~~-------~~~~~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~G~pV~~~~~ 338 (415) T protein:vir:79 268 --VKKAKSLDDIKDAINLNVKPN-------YEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLLGAKIEILPD 338 (415) T ss_pred --cccccchhHHHHHHHhhhhhc-------cCCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCCceecceeeEEecc Confidence 011112667777777765321 1245789999999988642 3334322210 001011 112333332 Q ss_pred cccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 310 LQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 310 l~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) +-. + ++++..++|.+ .......... ..+...+ .++.... ...-...|. |+.+++|-||+.+.-- T Consensus 339 ~~~--~-~~~~~~~~~Gd-~~~~~~~~~~--~~~~v~~-~~~~~~~-------~~~~~~~r~-d~~v~~~~a~~~~~~~ 402 (415) T protein:vir:79 339 EVL--G-QKGNNTLIIGN-LKDAIVLFDR--SQYQASW-TDYMHFG-------ECLMIAVRQ-DCRILDYKSAIVIEYD 402 (415) T ss_pred ccc--C-CCCccEEEEEe-hhccEEEEee--cceEEEE-eccccCc-------eEEEEEEEe-ccEEeccccEEEEEEe Confidence 211 1 22333344432 1110000000 0011111 1111111 011223354 4556679999888655 No 69 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=94.84 E-value=0.0034 Score=34.14 Aligned_cols=333 Identities=11% Similarity=-0.024 Sum_probs=136.9 Q ss_pred CCC----ccee-eeecCccccchhhhhhcccccccccCCHHHHhhcceecccchh-hcchhhhhhhhhhhhccCcccccc Q lcl|Aclame:pro 1 MKQ----LSKV-HQSLAGRSVRAFDMANGKADYRLTDMAVRELKKFGLVFDHATV-KRQIELLHEGGVATQAFDSAYVAP 74 (388) Q Consensus 1 ~~~----~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~-~~~~~~~~~~~~~~~amDaa~~~~ 74 (388) +.+ .... .-....+......... ......++++.. +.+... ....+... ...+..++..+ T Consensus 99 ~~~~~~~~~~~e~~~~~~~~~~~~~~~~-------~~~~~~~~e~~~--~~~~~~~~~~~~~~~-~~~~~~a~~~~---- 164 (458) T protein:vir:10 99 QDEIKSLLTAREGRSFVGDSVAKALYGT-------QENFEDEVEKLV--LLSYVMEKGVFETEH-GQRHLKAVNQS---- 164 (458) T ss_pred HHHHHHHHHHHHhhhhhhhhhhccchhh-------hhhHHHHHHHHH--HHHHHHhhccchhhh-hhhhhhhhhhc---- Confidence 000 0000 0000000000000000 000001111100 000000 00000000 00011122211 Q ss_pred cccccch--HHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccceEecccccCCc------eeee Q lcl|Aclame:pro 75 TTQASIP--TPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIP------LSSW 146 (388) Q Consensus 75 ~t~~~~g--~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~diP------~~~~ 146 (388) .+.+..| +|.. +.+.|++.+......+++..+...+. ....+.+....+.+...+.+...| ..+. T Consensus 165 ~~~~~g~~~ip~~----~~~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~a~~v~e~~~~~~~~~~~~~~~ 237 (458) T protein:vir:10 165 SSVEVSSESYETI----FSQRIIRDLQKELVVGALFEELPMSS---KILTMLVEPDAGKATWVAASTYGTDTTTGEEVKG 237 (458) T ss_pred ccCccccceehhh----HhHHHHHHHHhhhhHHhhcceeecCC---cceEEEEecCCcceeecccccccccccccccccc Confidence 1112222 4433 34456666655555566654433221 233455544445555666554444 3344 Q ss_pred eeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCcccccc Q lcl|Aclame:pro 147 NVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIAS 226 (388) Q Consensus 147 n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~ 226 (388) .........+.++..+.++.+=+.. ...++.+.-......++...+|+-.++|+.- ....|++|+++.....++ T Consensus 238 ~~~~i~~~~~k~~~~v~is~ell~d---s~~~~~~~i~~~l~~~i~~~~d~~~l~G~G~---~~p~Gi~~~~~~~~~~~~ 311 (458) T protein:vir:10 238 ALKEIHFSTYKLAAKSFITDETEED---AIFSLLPLLRKRLIEAHAVSIEEAFMTGDGS---GKPKGLLTLASEDSAKVV 311 (458) T ss_pred cceeeEeeeeeEEeeehhhHHHHhc---chHHHHHHHHHHHHHHHHHHHHHHhhcCCCC---Cccceeeeccccccccee Confidence 4566667778888888888853332 2356888888888888899999999999742 357899999887543222 Q ss_pred ccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC-CCcCccHHHHHHHh------- Q lcl|Aclame:pro 227 TTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRDWLKQT------- 298 (388) Q Consensus 227 ~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~~lk~n------- 298 (388) ....+. .-...| ++||.+++..+...-. .+..++|.+..+..|... +..|.-++..-..+ T Consensus 312 ~~~~~~-~~~~~~----~~~i~~~~~~l~~~~~-------~~~~~v~~~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~ 379 (458) T protein:vir:10 312 TEAKAD-GSVLVT----AKTISKLRRKLGRHGL-------KLSKLVLIVSMDAYYDLLEDEEWQDVAQVGNDSVKLQGQV 379 (458) T ss_pred eccccc-cccccc----HHHHHHHHHhhhhhhc-------CCCEEEEcHHHHHHHHhhcccCCceeeccccccccccCcC Confidence 211110 111122 4666667766643211 234689999999988643 33333232211111 Q ss_pred --CCccEEEEccccccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCce-EEecccceeeee Q lcl|Aclame:pro 299 --YPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNY-VEAYSNATAGVM 375 (388) Q Consensus 299 --~pnl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~-~~~~~~~t~G~i 375 (388) +....|+....+ .+ +++.++. +|..-.+..... + +....+...+|.. +.+ .+-...| -|.. T Consensus 380 ~~l~G~pv~~~~~~-p~-~~~~~~~--~~~~f~~~~~~~--~--~~~~~v~~d~~~~-------~~~~~~~~~~r-~~~~ 443 (458) T protein:vir:10 380 GRIYGLPVVVSEYF-PA-KANSAEF--AVIVYKDNFVMP--R--QRAVTVERERQAG-------KQRDAYYVTQR-VNLQ 443 (458) T ss_pred ceecceeeEEcccc-cc-ccCCcce--EEEEecccEEEE--E--eeceEEEeecccC-------CCceEEEEEEE-ecce Confidence 111223322111 11 1122222 222111110000 0 0001111111111 111 2222233 5778 Q ss_pred eeccccceeeccC Q lcl|Aclame:pro 376 LKRPWAVVRLIGL 388 (388) Q Consensus 376 i~rP~ai~~~~GI 388 (388) +++|.+|+..+== T Consensus 444 v~~~~a~v~~~~a 456 (458) T protein:vir:10 444 RYFANGVVSGTYA 456 (458) T ss_pred EecccceEEEeec Confidence 8889998772211 No 70 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=94.78 E-value=0.00084 Score=37.45 Aligned_cols=342 Identities=7% Similarity=-0.040 Sum_probs=137.6 Q ss_pred CCCcceeeeecCccc-----cch---hhhhhccccccccc---CCHHHHhhcceecccchhhcchhhhhhhhhhhhccCc Q lcl|Aclame:pro 1 MKQLSKVHQSLAGRS-----VRA---FDMANGKADYRLTD---MAVRELKKFGLVFDHATVKRQIELLHEGGVATQAFDS 69 (388) Q Consensus 1 ~~~~~~~~~~~~~~~-----~~~---~~~~~~~~~~~~~~---~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~amDa 69 (388) +.++......+..+. ++. ....+......... .........+..+.+... ..+....+.-....... T Consensus 41 ~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~e~~~~~~~~~~~~~ 118 (415) T protein:vir:94 41 EQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEASTYRNQANINDLGISIQNTKV--TSQEVRDFTEYLETRND 118 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccchhhHHHHHHHHHHHhhhhhhhh--hHHHHHHHHHHhhhhhh Confidence 000000000000000 000 00000000000000 000000001111110000 00000000000000010 Q ss_pred ccccccccccch--HHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccceEecccccCCcee-ee Q lcl|Aclame:pro 70 AYVAPTTQASIP--TPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPLS-SW 146 (388) Q Consensus 70 a~~~~~t~~~~g--~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~diP~~-~~ 146 (388) ...+..+..+.| +|. .+.++|++.+......++++++..... ....+.+......+.+...+...++|-. .. T Consensus 119 ~~~~~~~~~~g~~~iP~----~~~~~ii~~~~~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~ 193 (415) T protein:vir:94 119 IQGGSLKTDSGFVVIPE----EIVTDILKLKEVEFNLDKYVTVKRVTN-GSGKYPVVRQSEVAALEKVEELEENPELAVK 193 (415) T ss_pred hhhhccccccccccCcH----HHHHHHHHHHHhhhhhhhhcceeeccC-CceeEEEEeecCCccceeccccccccccccc Confidence 111111222333 453 345577777777777777765543221 0112222233344456677778888854 45 Q ss_pred eeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCcccccc Q lcl|Aclame:pro 147 NVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIAS 226 (388) Q Consensus 147 n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~ 226 (388) ..+.....++.++..+.++.+=++ ....++.+.-....+.++...+|+-.+.|+.... ...++.+...... + T Consensus 194 ~~~~i~~~~~k~~~~~~is~ell~---ds~~~~~~~i~~~l~~~~~~~~~~~il~g~g~g~--~~~~~~~~~~~~~--~- 265 (415) T protein:vir:94 194 PFFQLAYDINTHRGYFRISREAIE---DAKVNVLQELKLWMARTIAATRNKAIIDVITKGS--TGSTSSGFEKEGK--K- 265 (415) T ss_pred cceeeEeeheeeeeechhhHHHHh---hchHHHHHHHHHHHHHHHHHHHHHHHhhccccCc--ccccccccccccc--c- Confidence 788889999999998888885333 2345788888888888888899988888865322 1222222111110 0 Q ss_pred ccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC-CCcCccHHH-HHHHh----CC Q lcl|Aclame:pro 227 TTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRD-WLKQT----YP 300 (388) Q Consensus 227 ~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~-~lk~n----~p 300 (388) +. .+...-++||.+++..+...- ..+..++|.++.+..|... +..|.-++. -+... .. T Consensus 266 --------~~-~~~~~~~~~i~~~~~~~~~~~-------~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~l~ 329 (415) T protein:vir:94 266 --------LE-VKKAKSLDDIKDAINLNVKPN-------YEHNVAIVSQTMFAKLDKMKDKLGNYLIQPDVKEKTQQRLL 329 (415) T ss_pred --------cc-cccccchHHHHHHHHhhhhhc-------cCCCEEEEcHHHHHHHHHhhccCCCeeeccCcCCCCCceec Confidence 01 011112667777877765321 1245799999999988653 434443321 01100 11 Q ss_pred ccEEEEccccccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEecccceeeeeeeccc Q lcl|Aclame:pro 301 RVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSNATAGVMLKRPW 380 (388) Q Consensus 301 nl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~~~t~G~ii~rP~ 380 (388) ...++..+.+-. + .+++..++|.+ ........... .+...+ .++.... ...-...|. |+.+.+|- T Consensus 330 G~pV~~~~~~~~--~-~~~~~~i~~gd-~~~~~~~~~~~--~~~v~~-~~~~~~~-------~~~r~~~r~-d~~~~~~~ 394 (415) T protein:vir:94 330 GAKIEILPDEVL--G-QKGNNTLIIGN-LKDAIVLFDRS--QYQASW-TDYMHFG-------ECLMIAVRQ-DCRILDYK 394 (415) T ss_pred ceeeEEeccccc--C-CCCccEEEEEe-hhccEEEEeec--ceEEEE-eccccCc-------eEEEEEEEe-ccEEeccc Confidence 122333332221 1 12333334432 10000000000 011111 1111111 011122343 55566799 Q ss_pred cceeeccC Q lcl|Aclame:pro 381 AVVRLIGL 388 (388) Q Consensus 381 ai~~~~GI 388 (388) ||+.+.-- T Consensus 395 a~~~~~~~ 402 (415) T protein:vir:94 395 SAIVIEYD 402 (415) T ss_pred cEEEEEEe Confidence 99888644 No 71 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=94.77 E-value=0.0035 Score=34.03 Aligned_cols=332 Identities=10% Similarity=0.021 Sum_probs=143.9 Q ss_pred CCC-----c-ceeeeecC--ccccchhhhh-hcccccccccCCHHHHhhcceecccchhhcchhhhhhhhhhhhccCccc Q lcl|Aclame:pro 1 MKQ-----L-SKVHQSLA--GRSVRAFDMA-NGKADYRLTDMAVRELKKFGLVFDHATVKRQIELLHEGGVATQAFDSAY 71 (388) Q Consensus 1 ~~~-----~-~~~~~~~~--~~~~~~~~~~-~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~amDaa~ 71 (388) +.+ . ........ +...+..... .... ..........+..++.... ....+. . ...+.+. T Consensus 51 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~---~~~~~~-~------~~~~~~~ 118 (413) T protein:vir:81 51 LQEATAGSVDSEKSGELTRKGEGYKSIGEFFAKRA--GDQIKQQAGGAQLNYSVGE---YVAPRV-K------AASDPAS 118 (413) T ss_pred HHHHHHhHHhHHHhhhHhhhhhhhhhhhhhhhhhh--hhHHHHHHHHHHhhhhhhh---hhhhHH-H------hhhhhhh Confidence 000 0 00000000 0000000000 0000 0000000001111111000 000000 0 0011111 Q ss_pred c-cccccccchHHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeecc----ccceEecccccCCceeee Q lcl|Aclame:pro 72 V-APTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEP----AGTAMEYGDLTNIPLSSW 146 (388) Q Consensus 72 ~-~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~----~G~a~~ygd~~diP~~~~ 146 (388) . +..+..+..+|.. +.++|++.+......++++++..... .+..|++... .+.+...+....+|-.+. T Consensus 119 ~~~~~~~~~~~vp~~----~~~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~ 191 (413) T protein:vir:81 119 TATLTDEFQGGYGTT----WNRNIIYRRREKLVVADLMDNLTMTN---TTIKYLMEKANRVVEGGFKTVAEGGKKPYMRF 191 (413) T ss_pred hcccccccccccchh----hHHHHHHHHhhhhhHHhhcceeeccC---CceeEEEeccccccccccceecCcccccccCc Confidence 1 1111222224433 44678888888877888876544332 2333443321 234566777788887774 Q ss_pred -eeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccccc Q lcl|Aclame:pro 147 -NVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIA 225 (388) Q Consensus 147 -n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~ 225 (388) ........++.++..+.++.+=|+.+. .|.+--....+.++...+|+-.++|+.. .....|++|.+++.+.. T Consensus 192 ~~f~~i~~~~~k~~~~~~iS~ell~ds~----~l~~~i~~~la~~~~~~~d~~~l~G~G~--~~~~~Gi~~~~~~~~~~- 264 (413) T protein:vir:81 192 ADFDIVTESLSKIAGLTKITDEMIEDYD----FLVSYINARLLEELAIEEERQLLLGDGT--GNNLTGLLKRDGIQTLA- 264 (413) T ss_pred ccceeeEeeeeeEEEeehhhHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHhccCCC--CCccccccccccccccc- Confidence 678888888999888889986444332 2677777777788888888888999743 23467999988764221 Q ss_pred cccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhcc-CCCcCccHHHH-HHHh--CC- Q lcl|Aclame:pro 226 STTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSV-VTDLGISVRDW-LKQT--YP- 300 (388) Q Consensus 226 ~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~-lk~n--~p- 300 (388) ..+.+.+++||..++..+....+ ..++.++|.++.+..|.+ .+..|.-++.- +... .+ T Consensus 265 -----------~~~~~~~~~~i~~~~~~~~~~~~------~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~ 327 (413) T protein:vir:81 265 -----------VSNKDELADSIYKAMTNISLATP------FQADALVINPLDYQELRLAKDANGQYYGGGVFQGQYGSGG 327 (413) T ss_pred -----------ccccchhHHHHHHHHHHhhhhcc------CCCcEEEEcHHHHHHHHHhhccCCceeccccccccccccc Confidence 12233457778777776654433 124568999999888854 23334433211 1100 00 Q ss_pred ---ccEEEEccccccccCCCCccEEEEEEcccccccccccCCCcceEeecch---hhhccCceeccCceEEecccceeee Q lcl|Aclame:pro 301 ---RVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQS---KFVTLGVEKRVKNYVEAYSNATAGV 374 (388) Q Consensus 301 ---nl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~---~~r~~~v~~~~~~~~~~~~~~t~G~ 374 (388) .-++-..|-..... .-.+.. +| -+............-+++. .+. .|+.. ....-+..|.+ + T Consensus 328 ~~~~~~l~G~pv~~s~~-~~~~~~--~~-gd~~~~~~~~~~~~~~v~~-~~~~~~~~~~~-------~~~~r~~~r~d-~ 394 (413) T protein:vir:81 328 IMLDPAPWGLRTVQSQV-VPVGKP--VV-GAFRSAASVLRKGGVRIDS-TNTNVDDFENN-------LITVRAEERVG-L 394 (413) T ss_pred cccCceecceeeEEcCC-CCcccE--EE-EecccEEEEEEecceEEEE-eccccchhhcC-------cEEEEEEEeec-c Confidence 00122222111100 001111 11 1100000000000000100 010 01111 12233344444 4 Q ss_pred eeeccccceeeccC Q lcl|Aclame:pro 375 MLKRPWAVVRLIGL 388 (388) Q Consensus 375 ii~rP~ai~~~~GI 388 (388) .+++|-||+.++.= T Consensus 395 ~~~~~~a~~~l~~~ 408 (413) T protein:vir:81 395 MVTFPEAIVQLDVA 408 (413) T ss_pred EEecccceEEEEec Confidence 55779999988766 No 72 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=94.75 E-value=0.0013 Score=36.46 Aligned_cols=326 Identities=13% Similarity=0.034 Sum_probs=143.2 Q ss_pred CC----CcceeeeecCccccchhhhhhcccccccccCCHHHHhhcceecccchhhcchhhhhhhhhhhhccCcccccccc Q lcl|Aclame:pro 1 MK----QLSKVHQSLAGRSVRAFDMANGKADYRLTDMAVRELKKFGLVFDHATVKRQIELLHEGGVATQAFDSAYVAPTT 76 (388) Q Consensus 1 ~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~amDaa~~~~~t 76 (388) +. +.........+....+...... ... ......+...+-... .... .... .+...+.. .+ T Consensus 54 i~~~e~~~~~~~~~~~~~~~~~~~~~~~--~~~--~~~~~~~~~~~~~~~---~~~~------~~~~-~~~~~~~~--~~ 117 (390) T protein:vir:81 54 VQAARQRVAELEGNGAGGDVQHVSVGDM--FVA--SEQFQASAGRWNDRS---ARAT------MNIK-AALNTAST--DA 117 (390) T ss_pred HHHHHHHHHHHHhcccccccccccchhh--hhh--hHHHHHHHHHHhhhh---hhhh------hHHH-HHHHhhcc--cc Confidence 00 0000000000000000000000 000 000011111000000 0000 0000 00011111 12 Q ss_pred cccchHHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeecc-ccceEecccccCCceeeeeeeeeeeeE Q lcl|Aclame:pro 77 QASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEP-AGTAMEYGDLTNIPLSSWNVNFERRTI 155 (388) Q Consensus 77 ~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~-~G~a~~ygd~~diP~~~~n~~~~~~~v 155 (388) .++.|.++.. + +.+++++.+......++++++...+. ....+++... .+.+...+....+|-.+.........+ T Consensus 118 ~~~~g~~~~~-~-~~~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~a~~v~Eg~~~~~~~~~~~~i~~~~ 192 (390) T protein:vir:81 118 AGSAGALTTP-N-RLPGFITPPDARLTVRDLIGSGRTDS---ALIEYVQETGFVNNAAIVAEGALKPESSLKFAKKTDTT 192 (390) T ss_pred ccCCcceech-h-hhHHHHHHHhhhhhhhhhcceeeccC---CceEEEEEecCCcceeeecCCcccccccceeeEEEEee Confidence 2333322211 1 12457777766666677766544332 2344555443 456677788889999999999999999 Q ss_pred EEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccccccccCCccccc Q lcl|Aclame:pro 156 VRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWVSG 235 (388) Q Consensus 156 ~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~t~W 235 (388) +.++..+.++.+=++.+ .++.+.-....+.++...+|+-.++|+.. .....|++|.++...... T Consensus 193 ~k~~~~~~is~ell~d~----~~~~~~i~~~l~~~~~~~~d~a~l~G~g~--~~~~~Gi~~~~~~~~~~~---------- 256 (390) T protein:vir:81 193 HVIAHTMKATRQILSDA----PQLASYMNNRLIRGLKVKEDAEILRGTGA--NDGLLGLIPQATTYAAPT---------- 256 (390) T ss_pred eEEEEeehhhHHHHHhH----HHHHHHHHHHHHHHHHHHHHHHHHhcCCC--CCcccceeeccccccccc---------- Confidence 99999999988533322 25778888888888888888888999753 235889999876532211 Q ss_pred ccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC-CCcCccHHHHHHHhC-C---ccEEEEcccc Q lcl|Aclame:pro 236 GANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRDWLKQTY-P---RVRVMSAPEL 310 (388) Q Consensus 236 a~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~~lk~n~-p---nl~i~~~pel 310 (388) ..+....++||..++..+...- ..+..++|.|..+..|.+. +..|.-++.-..... + ++.++..+.+ T Consensus 257 -~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~~l~G~pv~~~~~~ 328 (390) T protein:vir:81 257 -TIAGATRVDQLRLAMLQASLAE-------YNPSGIVINPIDWAAIELAKDANNQYLIGNARGTLTPTLWGLPVVATQAM 328 (390) T ss_pred -ccccchhHHHHHHHHHhhcccc-------CCCCEEEEcHHHHHHHHHhhcCCCceeecCcccccCceecceeeEEcCCC Confidence 1112223567777777665432 2345799999999888653 444443332111111 1 1222222222 Q ss_pred ccccCCCCccEEEEEEcccccccccccCCCcceEee-cchhhhccCceeccCceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 311 QGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQL-VQSKFVTLGVEKRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 311 ~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~-~p~~~r~~~v~~~~~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) . .+. ++ | -+............-++... .+..|+.. ....-+..|.+| .+++|-||++.+== T Consensus 329 p------~~~-~~-~-gd~~~~~~~~~~~~~~v~~~~~~~~~~~~-------~v~~r~~~r~d~-~v~~~~a~v~~t~a 390 (390) T protein:vir:81 329 A------PGE-FL-V-GAFDLAAQIFDQWDARVEIGYVGEDFQRN-------MITVLAEERLAL-VVYRPEALISGSFA 390 (390) T ss_pred C------CCc-EE-E-EehhceEEEEEecceEEEEecccchhhcC-------cEEEEEEEeecc-EEecccceEEEEeC Confidence 1 111 11 1 11000000000000000000 00111111 112233445544 55568887665311 No 73 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=94.69 E-value=0.0031 Score=34.36 Aligned_cols=315 Identities=12% Similarity=0.034 Sum_probs=148.6 Q ss_pred CCCc------------------------------ceeeeecCccccchhhhhhcccccccccCCHHHHhhcceecccchh Q lcl|Aclame:pro 1 MKQL------------------------------SKVHQSLAGRSVRAFDMANGKADYRLTDMAVRELKKFGLVFDHATV 50 (388) Q Consensus 1 ~~~~------------------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~ 50 (388) +.+. ............+...... .......++. T Consensus 36 ~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~------ 98 (395) T protein:vir:43 36 FGEMNKETRAKVDELLTAQGELQARLSAAEQAMLANEKRDGGEEAPKTAGQMV-----------AESLKEQGVT------ 98 (395) T ss_pred HhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccchhhhHHHHH-----------HHHHHHHHHH------ Confidence 0000 0000000000000000000 0000000100 Q ss_pred hcchhhhhh--hhhhhhccCcccccccccccchHHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeecc Q lcl|Aclame:pro 51 KRQIELLHE--GGVATQAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEP 128 (388) Q Consensus 51 ~~~~~~~~~--~~~~~~amDaa~~~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~ 128 (388) ...... ......++. .+..+.|.++... +.++|++.+......++++++.+.+. .++.|++... T Consensus 99 ---~~~~~~~~~~~~~~~~~------~~~~~~g~~vp~~--~~~~ii~~~~~~~~l~~l~~~~~~~~---~~~~~~~~~~ 164 (395) T protein:vir:43 99 ---SSLRGSHRVSMPRSAIT------SIDGSGGALVAPD--RRPGVVAAPQRRLTIRDLVAPGTTES---NSVEYVRETG 164 (395) T ss_pred ---HHhhhhhhhhhhhhhhc------ccCCCCccccchh--hHHHHHHHHHhhhhHHhhccceecCC---CceEEEEEec Confidence 000000 000001111 1223334332221 23567777777777777777665443 2355666433 Q ss_pred -ccceEecccccCCceeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCcc Q lcl|Aclame:pro 129 -AGTAMEYGDLTNIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKN 207 (388) Q Consensus 129 -~G~a~~ygd~~diP~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~ 207 (388) .+.+...+....+|..+...+......+.++..+.++.+=++.+ . ++.+--....+.++...+|+-.++|+.- T Consensus 165 ~~~~a~~v~E~~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~---~-~l~~~v~~~la~a~~~~~d~~~l~G~g~-- 238 (395) T protein:vir:43 165 FVNNAAPVSEGTQKPYSDLTFELENAPVRTIAHLFKASRQILDDA---S-ALQSYIDARARYGLMLVEECQLLYGNGT-- 238 (395) T ss_pred CCCceeeecCCccccccccceeEEEEeeeeEEEeehhhHHHHHhH---H-HHHHHHHHHHHHHHHHHHHHHHHhccCC-- Confidence 45677888888999999999999999999999999997544322 2 5777777788888888888888999642 Q ss_pred ccceEEEeecCCCccccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhcc-CCC Q lcl|Aclame:pro 208 GNRTFGFLNDPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSV-VTD 286 (388) Q Consensus 208 ~~g~~GllN~P~l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~-~~~ 286 (388) .....|+++.+++...... ...+.+..++||.+++..+...-. .+..++|.|..+..|.. .+. T Consensus 239 ~~~~~Gi~~~~~~~~~~~~---------~~~~~~~~~~~i~~~~~~~~~~~~-------~~~~~vmn~~~~~~l~~lkd~ 302 (395) T protein:vir:43 239 GANLHGIIPQAQAYAPPSG---------VVVTAEQRIDRIRLAILQAQLAEF-------PASGIVLNPIDWALIELNKDA 302 (395) T ss_pred CCccccccccccccccccc---------cccccchhHHHHHHHHHhhccccC-------CCcEEEEcHHHHHHHHHhhcc Confidence 2346799988765321111 123445568888888877754321 23479999999988854 233 Q ss_pred cCccHHHHHHHh----CCccEEEEccccccccC-CCC-ccEEEEEEcccccccccccCCCcceEeecch---hhhccCce Q lcl|Aclame:pro 287 LGISVRDWLKQT----YPRVRVMSAPELQGGNP-DDG-KDIAYMFLDSVDTAVDGSTDGGDTWAQLVQS---KFVTLGVE 357 (388) Q Consensus 287 ~~~Tvl~~lk~n----~pnl~i~~~pel~~a~g-tg~-~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~---~~r~~~v~ 357 (388) .|.-++.=.... .-++.|+..+.+..... -|. ++...++.+. .-+++.. +. .|+.. T Consensus 303 ~G~~i~~~~~~~~~~~l~G~pVv~~~~~~~~~~~~gd~~~~~~~~~~~-----------~~~i~~~-~~~~~~f~~~--- 367 (395) T protein:vir:43 303 ENRYIIGSPQNGTTPTLWRLPVVETQAITQDEFLTGAFSLGAQIFDRM-----------DIEVLVS-TENDKDFENN--- 367 (395) T ss_pred CCceeccccccCCCceecceeeEEcCCCCCCcEEEEeccceEEEEEec-----------ceEEEEe-ccccchhhcC--- Confidence 344333211111 11122333322211000 000 0111111110 0011110 00 01111 Q ss_pred eccCceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 358 KRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 358 ~~~~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) .+..-+..| .|+.+++|-||++++-= T Consensus 368 ----~~~~r~~~r-~d~~v~~~~a~~~~~~t 393 (395) T protein:vir:43 368 ----MVTIRAEER-LAFAVYRPEAFVTGSLT 393 (395) T ss_pred ----cEEEEEEEe-eccEEecccceEEEEec Confidence 111122223 35566779998887544 No 74 >protein:vir:94933 Length: 330 # NCBI annotation: putative phage structural protein # Family: family:all:1120 # MgeID: mge:1538 # MgeName: Xp15 # Cross-refs: genbank:acc:YP_239278;genbank:gi:66392060;genbank:GeneID:5076578 Probab=93.72 E-value=0.0037 Score=33.93 Aligned_cols=298 Identities=13% Similarity=0.104 Sum_probs=127.1 Q ss_pred CCCcceeeeecCccccchhhhhhcccccccccCCHHHHhhcceecccchhhcchhhhhhhhhhhhccCcccccccccccc Q lcl|Aclame:pro 1 MKQLSKVHQSLAGRSVRAFDMANGKADYRLTDMAVRELKKFGLVFDHATVKRQIELLHEGGVATQAFDSAYVAPTTQASI 80 (388) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~amDaa~~~~~t~~~~ 80 (388) |-++..-.-| +| =..+...+|.-.|..++..| . T Consensus 1 ~~~~~~~~~~--~~---~~~~~~~~p~l~m~alTLae------------------------------------------a 33 (330) T protein:vir:94 1 MVRICTPPLR--GR---WRTLTHQFPELKMPTVTLAE------------------------------------------S 33 (330) T ss_pred CceecCCccc--cc---eeehhccccccchhhhhhhH------------------------------------------H Confidence 4332211111 11 01111112222222222222 1 Q ss_pred hHHHHHHHhhcceeeeecccchhhhhhccccc-CCCCceeeEEEeeeccccceE---ecccccC-CceeeeeeeeeeeeE Q lcl|Aclame:pro 81 PTPIQFLQQWLPGFVKVLTSARKIDEILGVKT-VGSWEDQEIVQGIVEPAGTAM---EYGDLTN-IPLSSWNVNFERRTI 155 (388) Q Consensus 81 g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t-~g~w~~~t~~~~v~e~~G~a~---~ygd~~d-iP~~~~n~~~~~~~v 155 (388) +.+.. ..+..+|+|.+...-...+.+|-+. ++. ...|.....-+.+. .++.+.. .| ....+.+..+ T Consensus 34 ~~l~~--d~~~~~VIE~l~~~s~iL~~lpf~~ve~~----~~~~~r~~~lp~a~~r~~n~~~~~~~~---~Tf~q~t~~l 104 (330) T protein:vir:94 34 AKLSQ--DHLVSGLIETIVEVNPLYEMMPFTEIEGN----ALAYNRENVLGDVQFLAVGGTITAKNP---ATFTKVTSEL 104 (330) T ss_pred hhcCc--hhhHHHHHHhhhccchHHhhcccccccCC----cceeeeeecCCcceeeeccccccccCc---ceeeeeeech Confidence 11111 1123455555555555556655322 122 23333322222222 2222221 12 1122223334 Q ss_pred EEEEEEEeecHHHHHHHHHhC--CChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccccccccCCccc Q lcl|Aclame:pro 156 VRGEMGIQVGLLEEGRASAMR--INSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWV 233 (388) Q Consensus 156 ~~~~~~~~y~~~El~~A~~~g--~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~t 233 (388) ..++..+++..+ -|...| .++......+-.+++.+++....++||... +++.||++.=.-...+.+-+.++.. T Consensus 105 ~~l~~~~~Vd~~---iadl~g~~~d~~~~q~~~~ieal~~~~e~~linGDs~~--~~F~GL~~~~~~~q~i~tg~~gg~~ 179 (330) T protein:vir:94 105 TTLIGDAEVNGL---IQATRSDFMDQTSVQVASKAKSIGRQYQASMITGDGTG--NSFQGMMGLVAASQTISAGANGGTL 179 (330) T ss_pred hhhhhhHHHHHH---HHHhcCCHHHHHHHHHHHHHHHHHHHHHHHhhccCCCC--ccccchhhcCCcccEEecCCCCCCC Confidence 445554444443 222334 244555555666688888888899998653 4788998642211111111111222 Q ss_pred ccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhc---cC-C----------CcCccHHHHHHHhC Q lcl|Aclame:pro 234 SGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLS---VV-T----------DLGISVRDWLKQTY 299 (388) Q Consensus 234 ~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls---~~-~----------~~~~Tvl~~lk~n~ 299 (388) + ++|+.+|+..+|..-+ .|..|+|+......|. +. + .+|.-|+. | T Consensus 180 T---------~d~LDeLl~~v~~~~g-------~~~~~l~n~a~~r~I~a~~R~~~~~~v~~~~~~~~G~~v~~-----~ 238 (330) T protein:vir:94 180 T---------FELLDQLLDLVKDKDG-------QVDYLMSSFAMRRKYFSLLRALGGAAIGEVMTLPSGRQIPT-----Y 238 (330) T ss_pred C---------HHHHHHHHHHhcCCCC-------CCcEEEechhHHHHHHHHHHhccCCCCCCcccccCCCEEee-----e Confidence 2 6888899888886543 3556887666544442 21 1 22332222 2 Q ss_pred CccEEEEc---cccccccCCCCccEEEEEE--c-ccccccccccCCCcceEeecchhhhccC-cee-ccCceEEecccce Q lcl|Aclame:pro 300 PRVRVMSA---PELQGGNPDDGKDIAYMFL--D-SVDTAVDGSTDGGDTWAQLVQSKFVTLG-VEK-RVKNYVEAYSNAT 371 (388) Q Consensus 300 pnl~i~~~---pel~~a~gtg~~~~~~~~~--~-~~d~~~~~~~~~~~t~~~~~p~~~r~~~-v~~-~~~~~~~~~~~~t 371 (388) -.+-|... |.=++...+++....++.. + +.++-..+.......+.. .|..| ++. ...+|.++. - T Consensus 239 ~GvPi~~~d~ip~~~~~~~~~~ttsIyav~~G~~~~~qgV~Gl~~~g~~gls-----Vr~~G~~~~k~v~~~~v~~---y 310 (330) T protein:vir:94 239 RGVPWFVNDFIPSNMTQGTATNATAIFAGTFDDGSNKYGIAGLTARGSAGLR-----VQNVGAKENADETITRVKM---Y 310 (330) T ss_pred CCeEEEecccccCCCCcccCCCceeEEEEeecccccccceEeecCCCCCcce-----eeeCCCccccceeeEEEEE---e Confidence 23333322 2212221223444444433 1 112222333322221211 23333 111 123455443 4 Q ss_pred eeeeeeccccceeeccC Q lcl|Aclame:pro 372 AGVMLKRPWAVVRLIGL 388 (388) Q Consensus 372 ~G~ii~rP~ai~~~~GI 388 (388) .|+.+.-|.|++.+.|| T Consensus 311 ~~~av~~~~a~~~L~~V 327 (330) T protein:vir:94 311 CGFANFSQLGLAAIKGL 327 (330) T ss_pred eeeEEechhheeeeccc Confidence 68889999999999999 No 75 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=93.57 E-value=0.0071 Score=32.36 Aligned_cols=257 Identities=10% Similarity=-0.028 Sum_probs=124.8 Q ss_pred hccCcccccccccccchHHHHHHHhhcceeeeecccchhhhhhcccccC--CCCceeeEEEeeeccccceEecccccCCc Q lcl|Aclame:pro 65 QAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTV--GSWEDQEIVQGIVEPAGTAMEYGDLTNIP 142 (388) Q Consensus 65 ~amDaa~~~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~--g~w~~~t~~~~v~e~~G~a~~ygd~~diP 142 (388) || ... ...+++=+|-.+..++..++ ...+....|..++.. |.. -.+++++.++..|.+.-|.++++++ T Consensus 1 ma--~~~---T~~~d~iiPev~~~~v~~~~----~~~l~~~~~~~~d~~l~g~~-G~tv~iP~~~~~g~a~~~~~g~~i~ 70 (274) T protein:vir:94 1 MP--QGL---TKTSDQIIPEVLAPMMQAQL----EKKLRFASFAEVDSTLQGQP-GDTLTFPAFVYSGDAQVVAEGEKIP 70 (274) T ss_pred CC--ccc---eehhheechHHHHHHHHHhh----hhhhhhcccceecccccCCC-CCEEEEeeecCCCccccccCCCccc Confidence 22 111 12234446666666655443 333444555444432 221 3588999999999999999999999 Q ss_pred eeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCcc Q lcl|Aclame:pro 143 LSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLP 222 (388) Q Consensus 143 ~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a 222 (388) .-++........+...+-+ |.+.++.+++. +-++..+-...+.+++..++++..+- .++.-.+ T Consensus 71 ~~~lt~~~~~~~i~~~~~~--~~i~D~~~~~~-~~dp~~~~~~~~a~a~a~~vd~~~~~------------~l~~a~~-- 133 (274) T protein:vir:94 71 TDILETKKREAKIRKIAKG--TSITDEALLSG-YGDPQGEQVRQHGLAHANKVDNDVLE------------ALMGAKL-- 133 (274) T ss_pred ccccccceeEEEeeeecce--ecccHHHHHhc-cchHHHHHHHHHHHHHHHHHHHHHHH------------HHhccCc-- Confidence 9999998888888776655 44545555544 44666777777778888877765431 1111000 Q ss_pred ccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC--------CCcCccHHH- Q lcl|Aclame:pro 223 AIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV--------TDLGISVRD- 293 (388) Q Consensus 223 ~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~--------~~~~~Tvl~- 293 (388) ... ++++. +++|.++...+-.. ...+..|+++|..+..|.+. +.+|..++. T Consensus 134 -----~~~-----~~~~~---~d~i~dA~~~l~d~-------~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~ 193 (274) T protein:vir:94 134 -----TVN-----ADITK---LNGLQSAIDKFNDE-------DLEPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVK 193 (274) T ss_pred -----ccc-----ccccC---HHHHHHHHHHhhcc-------CCCceEEEeCHHHHHHHHhhhhhhccccCcccccceec Confidence 000 11111 44555555555332 12456899999999988542 112211110 Q ss_pred HHHHhCCccEEEEccccccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEecccceee Q lcl|Aclame:pro 294 WLKQTYPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSNATAG 373 (388) Q Consensus 294 ~lk~n~pnl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~~~t~G 373 (388) =.--++-+++|..-+.+- ....+++.+.- .......+.. .|.-=.++.+.- ...- -..+| T Consensus 194 G~ig~~~G~~Vi~s~~~p-------~~t~~l~~~gA--~~~~~~~~~~-vE~~Rd~~~~~d-~i~~---------~~~y~ 253 (274) T protein:vir:94 194 GAFGEALGAIIVRTNKLE-------AGTAILAKKGA--VKLILKRDFF-LEVARDASTKTT-ALYS---------DKHYV 253 (274) T ss_pred cccceecCeeEEEcCCCC-------cceEEEEeCcc--eEeeecCCce-eccccchhhccc-EEEE---------EEEEE Confidence 000012244554332221 11223332210 0000000000 011001111110 0000 12567 Q ss_pred eeeeccccceeeccC Q lcl|Aclame:pro 374 VMLKRPWAVVRLIGL 388 (388) Q Consensus 374 ~ii~rP~ai~~~~GI 388 (388) +-+.+|-.++.+.-= T Consensus 254 ~~~~~~~~vv~~t~~ 268 (274) T protein:vir:94 254 AYLYDESKAVKITKG 268 (274) T ss_pred EEEEcCCceEEEecC Confidence 777777766665533 No 76 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=93.57 E-value=0.0071 Score=32.36 Aligned_cols=257 Identities=10% Similarity=-0.028 Sum_probs=124.8 Q ss_pred hccCcccccccccccchHHHHHHHhhcceeeeecccchhhhhhcccccC--CCCceeeEEEeeeccccceEecccccCCc Q lcl|Aclame:pro 65 QAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTV--GSWEDQEIVQGIVEPAGTAMEYGDLTNIP 142 (388) Q Consensus 65 ~amDaa~~~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~--g~w~~~t~~~~v~e~~G~a~~ygd~~diP 142 (388) || ... ...+++=+|-.+..++..++ ...+....|..++.. |.. -.+++++.++..|.+.-|.++++++ T Consensus 1 ma--~~~---T~~~d~iiPev~~~~v~~~~----~~~l~~~~~~~~d~~l~g~~-G~tv~iP~~~~~g~a~~~~~g~~i~ 70 (274) T protein:vir:97 1 MP--QGL---TKTSDQIIPEVLAPMMQAQL----EKKLRFASFAEVDSTLQGQP-GDTLTFPAFVYSGDAQVVAEGEKIP 70 (274) T ss_pred CC--ccc---eehhheechHHHHHHHHHhh----hhhhhhcccceecccccCCC-CCEEEEeeecCCCccccccCCCccc Confidence 22 111 12234446666666655443 333444555444432 221 3588999999999999999999999 Q ss_pred eeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCcc Q lcl|Aclame:pro 143 LSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLP 222 (388) Q Consensus 143 ~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a 222 (388) .-++........+...+-+ |.+.++.+++. +-++..+-...+.+++..++++..+- .++.-.+ T Consensus 71 ~~~lt~~~~~~~i~~~~~~--~~i~D~~~~~~-~~dp~~~~~~~~a~a~a~~vd~~~~~------------~l~~a~~-- 133 (274) T protein:vir:97 71 TDILETKKREAKIRKIAKG--TSITDEALLSG-YGDPQGEQVRQHGLAHANKVDNDVLE------------ALMGAKL-- 133 (274) T ss_pred ccccccceeEEEeeeecce--ecccHHHHHhc-cchHHHHHHHHHHHHHHHHHHHHHHH------------HHhccCc-- Confidence 9999998888888776655 44545555544 44666777777778888877765431 1111000 Q ss_pred ccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC--------CCcCccHHH- Q lcl|Aclame:pro 223 AIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV--------TDLGISVRD- 293 (388) Q Consensus 223 ~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~--------~~~~~Tvl~- 293 (388) ... ++++. +++|.++...+-.. ...+..|+++|..+..|.+. +.+|..++. T Consensus 134 -----~~~-----~~~~~---~d~i~dA~~~l~d~-------~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~ 193 (274) T protein:vir:97 134 -----TVN-----ADITK---LNGLQSAIDKFNDE-------DLEPMVLFVNPLDAGKLRGDASTNFTRATELGDDIIVK 193 (274) T ss_pred -----ccc-----ccccC---HHHHHHHHHHhhcc-------CCCceEEEeCHHHHHHHHhhhhhhccccCcccccceec Confidence 000 11111 44555555555332 12456899999999988542 112211110 Q ss_pred HHHHhCCccEEEEccccccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEecccceee Q lcl|Aclame:pro 294 WLKQTYPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSNATAG 373 (388) Q Consensus 294 ~lk~n~pnl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~~~t~G 373 (388) =.--++-+++|..-+.+- ....+++.+.- .......+.. .|.-=.++.+.- ...- -..+| T Consensus 194 G~ig~~~G~~Vi~s~~~p-------~~t~~l~~~gA--~~~~~~~~~~-vE~~Rd~~~~~d-~i~~---------~~~y~ 253 (274) T protein:vir:97 194 GAFGEALGAIIVRTNKLE-------AGTAILAKKGA--VKLILKRDFF-LEVARDASTKTT-ALYS---------DKHYV 253 (274) T ss_pred cccceecCeeEEEcCCCC-------cceEEEEeCcc--eEeeecCCce-eccccchhhccc-EEEE---------EEEEE Confidence 000012244554332221 11223332210 0000000000 011001111110 0000 12567 Q ss_pred eeeeccccceeeccC Q lcl|Aclame:pro 374 VMLKRPWAVVRLIGL 388 (388) Q Consensus 374 ~ii~rP~ai~~~~GI 388 (388) +-+.+|-.++.+.-= T Consensus 254 ~~~~~~~~vv~~t~~ 268 (274) T protein:vir:97 254 AYLYDESKAVKITKG 268 (274) T ss_pred EEEEcCCceEEEecC Confidence 777777766665533 No 77 >protein:vir:4197 Length: 314 # NCBI annotation: putative structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:88 # MgeName: psiM100 # Cross-refs: genbank:acc:NP_071822;genbank:gi:11863105;genbank:GeneID:1257607 Probab=93.11 E-value=0.0087 Score=31.87 Aligned_cols=292 Identities=9% Similarity=-0.072 Sum_probs=138.3 Q ss_pred HHHHhhcceecccchhhcchhhhhhhhhhhhccCcccccccccccchHHHH-HHHhhcceeeeecccchhhhhhcccc-c Q lcl|Aclame:pro 35 VRELKKFGLVFDHATVKRQIELLHEGGVATQAFDSAYVAPTTQASIPTPIQ-FLQQWLPGFVKVLTSARKIDEILGVK-T 112 (388) Q Consensus 35 ~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~amDaa~~~~~t~~~~g~l~~-~l~~idp~v~e~l~~~~~~~~i~~v~-t 112 (388) ..+|+| ... .--++... ..+.|+|.- +++ ++++.+....-.+.+..+. + T Consensus 1 ~~~~~~------------------~~~-~~k~it~~------d~~gG~L~P~~~~----~~i~~l~e~s~i~~~a~vi~t 51 (314) T protein:vir:41 1 MDFLNK------------------PFQ-ITPKIDVP------DLGKGILAVQRFG----EFVREVRENSAIIKDARVLNA 51 (314) T ss_pred Cchhhh------------------HHH-hhcccccc------cCCCceeChHHHH----HHHHHHHhccchhhheeeecc Confidence 011222 111 11122211 112333332 122 3555555555555555543 3 Q ss_pred CCCCceeeEEEeeecc----ccceEecccccCCceeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHH Q lcl|Aclame:pro 113 VGSWEDQEIVQGIVEP----AGTAMEYGDLTNIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAA 188 (388) Q Consensus 113 ~g~w~~~t~~~~v~e~----~G~a~~ygd~~diP~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr 188 (388) .+... ..+..+.. ...+...|+.++.|-.+.......-..+.+..-+.++.+.|+-. +-|.++...-....+ T Consensus 52 ~~s~~---~~i~~i~~g~~~~~~~~~~~~~~~~~~~~~tf~~~~l~~~kl~~~v~is~e~L~D~-a~~~~le~~i~~~~A 127 (314) T protein:vir:41 52 LKSYE---VDISRISLGVELEPGRNTSGTKVAPTADEVTVSTNTLEMKELVTKVVLEDEALEDN-IEQSAFEQTITSLLA 127 (314) T ss_pred cCccc---eeecccccCcccccccccccCCccCCcccccccceeeeeEEEEEeecccHHHHHhh-hchhhHHHHHHHHHH Confidence 33311 11222211 11122345566677777776666777777777777777666544 456789999999999 Q ss_pred HHHHHhhceEEEEeecCcc-----ccceEEEeecCCCccccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeec Q lcl|Aclame:pro 189 VQLEIMRNAIGFYGWEGKN-----GNRTFGFLNDPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNID 263 (388) Q Consensus 189 ~a~~~~~n~i~~~G~a~~~-----~~g~~GllN~P~l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~ 263 (388) ..+...+....|.||.... .+...|+|+...... +.. +.-..+.+++.+.|+...+..-+.+.++ T Consensus 128 e~~g~~~~~~~~nGdg~~~s~~~~~~~p~G~l~~a~~~~--~~~-----~~~~~~~~~~~~~~l~~sl~~~yr~~~~--- 197 (314) T protein:vir:41 128 SGVTYDLECFFLHADSSLTTGRELYRINDGWMKLAGNQY--TDA-----EPEDENWPLNLFDGMMDELDTRYLQLKP--- 197 (314) T ss_pred HHHHHHHHHHhhccccCCcCcccchhcchhhhhhcccce--eec-----CccccccHHHHHHHHHHhcCchhhcCCC--- Confidence 9999999999999985321 124557776533211 100 0112234455555544444443434321 Q ss_pred cccccceEEcCHHHHHhhcc-CCCcCccHHHHHHHh-----CCccEEEEccccccccCCCCccEEEEEEccccccccccc Q lcl|Aclame:pro 264 PEDVDITLVLPMNKVDMLSV-VTDLGISVRDWLKQT-----YPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGST 337 (388) Q Consensus 264 ~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~lk~n-----~pnl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~ 337 (388) ....+|++..+..+.+ -.+-+..+++..... +-+..++.+|.+.+. +.....+||.+- .. T Consensus 198 ----~~~~~m~~~t~~~~r~~l~~~~~~l~~~~~~~~~~~~l~G~PV~~~~~~~~~---~~~~~~i~fgd~-~n------ 263 (314) T protein:vir:41 198 ----RMKFYVSNEIYNGYRKQLLVRETGLGDSALIGATGLQYDGIPIQYVPALDAL---GDDKARALLTVP-TN------ 263 (314) T ss_pred ----ceEEEecHHHHHHHHHHHhccCCcccchhhhCCCCceecceeeEeccccccc---CCCCceEEEech-hh------ Confidence 2367888877665532 111122222222222 223457777777654 344555566442 11 Q ss_pred CCCcceEeecchhhhccCce-eccCceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 338 DGGDTWAQLVQSKFVTLGVE-KRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 338 ~~~~t~~~~~p~~~r~~~v~-~~~~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) +-..+...+|.++-. .+...+.+..+-|+...+.-.+.++..+.+= T Consensus 264 -----lv~~~~~~ir~~~~~~a~~~~~~~~~~~r~d~~~~~~~aa~~~~~~~ 310 (314) T protein:vir:41 264 -----LVYGFWRNIRIEPKRDAAMRRTEYIASLRADCNYEDENAAVAAVIDM 310 (314) T ss_pred -----eEEEeeceeEEeecccCcCCeEEEEEEEEeceEEEEcCcEEEEEeec Confidence 111222223332111 1111233333444544444456776666655 No 78 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=93.04 E-value=0.009 Score=31.80 Aligned_cols=263 Identities=12% Similarity=-0.001 Sum_probs=130.3 Q ss_pred hccCcccccccccccchHHHHHHHhhcceeeeecccchhhhhhcccccC--CCCceeeEEEeeeccccceEecccccCCc Q lcl|Aclame:pro 65 QAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTV--GSWEDQEIVQGIVEPAGTAMEYGDLTNIP 142 (388) Q Consensus 65 ~amDaa~~~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~--g~w~~~t~~~~v~e~~G~a~~ygd~~diP 142 (388) ||--. ...+++=+|-.+..++..++.+ -.....+..++.. |.- -.+++++.++..|.+..|.++++++ T Consensus 1 Ma~~~-----T~~~~~iiPev~s~~v~~~~~~----~~v~~~~~~~~~~l~g~~-G~tv~ip~~~~~g~a~~~~~g~~i~ 70 (278) T protein:vir:80 1 MADLT-----TKLANLIDPEVMGPMISAKLPK----AIKFGKIAPIDNSLEGQP-GSEITVPKYKYIGDAQDVAEGAAID 70 (278) T ss_pred CCCcc-----eehhheecHHHHHHHHHHHHHH----hhhhcccceecccccCCC-CCEEEEeeeccCCcceeecCCCcCc Confidence 22101 1223444666666665444432 2222333322221 221 2678899999899999999999999 Q ss_pred eeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCcc Q lcl|Aclame:pro 143 LSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLP 222 (388) Q Consensus 143 ~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a 222 (388) ..+.........+...+-+++++. +.+ ...+.++...-...+..++.+.+++..+-. +.|..+.. T Consensus 71 ~~~lt~~~~~~~i~~~~~a~~v~D--~~~-~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~--------l~~a~~~~---- 135 (278) T protein:vir:80 71 YSALETESVKHGIKKAGKGVKLTD--ESV-LSGYGDPVEEAQKQIRMAIASKVDNDILEE--------ALTTTLEV---- 135 (278) T ss_pred ccccccceeeEeeehhhccccccH--HHH-hhccccHHHHHHHHHHHHHHHHHHHHHHHH--------Hhcccccc---- Confidence 999999888888887765555544 433 345778888889999999999988765421 11211110 Q ss_pred ccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccCC--------CcCccHH-H Q lcl|Aclame:pro 223 AIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVVT--------DLGISVR-D 293 (388) Q Consensus 223 ~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~~--------~~~~Tvl-~ 293 (388) ++ . ....+.+..++.+..+...+-... + + .+..|+++|..+..|.+-+ .++..++ . T Consensus 136 ----~~---~--~t~~~~~~~~~~~~da~~~l~~~~---~-~--~~~~ivv~p~~~~~L~k~~~~~~~~~~~~g~~~~~~ 200 (278) T protein:vir:80 136 ----KG---A--INIGLIDKIENTFTDAPDAIEDES---I-T--TTGVLFLNYKDTAKLREEAAGSWTKASQLGDDLLVK 200 (278) T ss_pred ----cc---c--cccchhhhHHHHHHHHHHhhcccC---C-C--cccEEEECHHHHHHHHhhhhhhccccccccccceee Confidence 00 0 011123334444444444443222 1 1 2335889999998885321 1121110 0 Q ss_pred HHHHhCCccEEEEccccccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEeccc-cee Q lcl|Aclame:pro 294 WLKQTYPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSN-ATA 372 (388) Q Consensus 294 ~lk~n~pnl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~~-~t~ 372 (388) -.--.+-+++|..-..+- ....+++.+.- .......+.. .|.--.++. ....... ..+ T Consensus 201 G~ig~~~G~~Vi~s~~~p-------~~t~~l~~~gA--i~~~~~~~~~-vE~~Rd~~~-----------~~d~i~~~~~y 259 (278) T protein:vir:80 201 GAFGELLGWEIVRTKKLA-------DGNALAVKAGA--LKTFLKRNLL-AESGRDMDH-----------KLTKFNADQHY 259 (278) T ss_pred ccceeecceeEEEcCCCC-------cceEEEEeccc--eeeeecCCcc-cccccchhh-----------ccceeeeeeEE Confidence 000112234544433221 12234443220 0000000000 111111111 1111111 256 Q ss_pred eeeeeccccceeeccC Q lcl|Aclame:pro 373 GVMLKRPWAVVRLIGL 388 (388) Q Consensus 373 G~ii~rP~ai~~~~GI 388 (388) |+-+.+|-+++.+.-- T Consensus 260 g~~v~~~~~~v~it~~ 275 (278) T protein:vir:80 260 AVALVDETKAVKVVPV 275 (278) T ss_pred EEEEEcCcceEEEeec Confidence 8999999999888766 No 79 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=92.48 E-value=0.0032 Score=34.24 Aligned_cols=256 Identities=12% Similarity=0.027 Sum_probs=126.6 Q ss_pred hccCcccccccccccchHHHHHHHhhcceeeeecccchhhhhhcccccC--CCCceeeEEEeeeccccceEecccccCCc Q lcl|Aclame:pro 65 QAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTV--GSWEDQEIVQGIVEPAGTAMEYGDLTNIP 142 (388) Q Consensus 65 ~amDaa~~~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~--g~w~~~t~~~~v~e~~G~a~~ygd~~diP 142 (388) |||=+ . ...+++=+|-.+..++..++- .-.....|..+++. |.- -.+++++.++..|.+..|.++++++ T Consensus 1 ~~~~~-~---T~l~d~i~PEv~~~~v~~~~~----~~~~~~~~~~~~~~l~g~~-G~tv~iP~~~~ig~a~~~~~g~~i~ 71 (275) T protein:vir:96 1 MALEN-M---TKLANMVNPEVLAPMMQAELD----KKLKFAQFADIDNTLVGQP-GNTITFPAFVYSGDAKVVPEGEEIP 71 (275) T ss_pred CCCcc-c---chhhhhhchHHHHHHHHHHHH----HhhhhcccceecccccCCC-CCEEEeeeeccCCccccccCCCCcc Confidence 44332 1 123345456666666554442 33333444433332 221 3678999999999999999999999 Q ss_pred eeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCcc Q lcl|Aclame:pro 143 LSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLP 222 (388) Q Consensus 143 ~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a 222 (388) ..++........+...+-+++++. +...+ .+-++..+-...+..++...+++..+ .. ++.-.. T Consensus 72 ~~~lt~~~~~~~i~~~~~~~~i~D--~~~~~-~~~d~~~~~~~~~a~~~a~~~d~~ll-~~-----------l~~a~~-- 134 (275) T protein:vir:96 72 IDLIETKKRQATIRKIGKGTVLTD--EALLS-GYGDPKGEAVRQHGLAIANKVDNDVL-EA-----------LQGATL-- 134 (275) T ss_pred hhhcccceeeEEeehhcccccccH--HHHHh-hccchHHHHHHHHHHHHHHHHHHHHH-HH-----------Hhcccc-- Confidence 999999888888887766655555 43333 34466666677777777777776543 11 111000 Q ss_pred ccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccCC--------CcCccHHH- Q lcl|Aclame:pro 223 AIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVVT--------DLGISVRD- 293 (388) Q Consensus 223 ~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~~--------~~~~Tvl~- 293 (388) + .... .-+ ++.|..++..+-.. ...+..|+++|..+..|.+.. ..|..++. T Consensus 135 ----~-~~~~----~~~----~d~i~dA~~~lgd~-------~~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~g~~~~~~ 194 (275) T protein:vir:96 135 ----K-VEAD----ITK----LAGLQTAIDKFNDE-------DLEPMVLFVNPLDAGKLRASATDNFTRATLLGDNVIVK 194 (275) T ss_pred ----c-cccc----ccC----HHHHHHHHHHhccc-------cCCccEEEeCHHHHHHHHhcccccccccccccccceec Confidence 0 0000 112 34444455444221 234678999999999884421 11111100 Q ss_pred HHHHhCCccEEEEccccccccCCCCccEEEEEEcccccccccccCCCc-ceEeecchhhhccCceeccCceEEeccc-ce Q lcl|Aclame:pro 294 WLKQTYPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGD-TWAQLVQSKFVTLGVEKRVKNYVEAYSN-AT 371 (388) Q Consensus 294 ~lk~n~pnl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~-t~~~~~p~~~r~~~v~~~~~~~~~~~~~-~t 371 (388) =.--.+-+++|+.-..+. ....+++.+.- . .....+. .. |..| ..+...-.... .. T Consensus 195 G~ig~~~G~~Vi~s~~~p-------~~t~~i~~~gA--~--~~~~~~~~~v-----E~~R------d~~~~~d~i~~~~~ 252 (275) T protein:vir:96 195 GAFGEALGAIIVRSNKIK-------EGEAILAKRGA--V--KLITKRDFFL-----ETER------HASHKSTALFSDKH 252 (275) T ss_pred cccceecCeeEEEeCCCC-------cceEEEEeccc--e--eeeecCCccc-----cccc------chhhcCcEEEEeEE Confidence 000112244554433221 11223442210 0 0000000 00 1111 11111111112 35 Q ss_pred eeeeeeccccceee------ccC Q lcl|Aclame:pro 372 AGVMLKRPWAVVRL------IGL 388 (388) Q Consensus 372 ~G~ii~rP~ai~~~------~GI 388 (388) +|+-+++|-.++.+ +|+ T Consensus 253 y~~~~~~~~~vv~~t~~~~~~~~ 275 (275) T protein:vir:96 253 YVAYLYDESKVVKITKSASGLGV 275 (275) T ss_pred EEEEEEcCccEEEEEecccccCC Confidence 68888888877774 344 No 80 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=92.39 E-value=0.012 Score=31.20 Aligned_cols=257 Identities=9% Similarity=-0.040 Sum_probs=126.5 Q ss_pred hccCcccccccccccchHHHHHHHhhcceeeeecccchhhhhhcccccCCCC-ceeeEEEeeeccccceEecccccCCce Q lcl|Aclame:pro 65 QAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSW-EDQEIVQGIVEPAGTAMEYGDLTNIPL 143 (388) Q Consensus 65 ~amDaa~~~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w-~~~t~~~~v~e~~G~a~~ygd~~diP~ 143 (388) || ... ...+++=+|-.+..++..++ ...+....+..++....- .-.+++++.++..|.+..|.++++++. T Consensus 1 ma--~~~---T~~~~~iiPev~~~~v~~~~----~~~~~~~~~~~~~~~l~g~~G~tv~ip~~~~~g~~~~~~eg~~i~~ 71 (274) T protein:vir:93 1 MP--QGI---TKTSNQIIPEVLAPMMQAQL----EKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPT 71 (274) T ss_pred CC--ccc---eehhheechHHHHHHHHHHH----HhhhhhcccccccccccCCCCCEEEEEeeccCCCcccccCCCcccc Confidence 22 111 12234446766666655443 233334444444332111 124789999998999999999999999 Q ss_pred eeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccc Q lcl|Aclame:pro 144 SSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPA 223 (388) Q Consensus 144 ~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~ 223 (388) .++........+...+-++.+.. +.+++ .+.++..+-...+.+++..++++..+-. ++.-. T Consensus 72 ~~it~~~~~~~i~~~~~~~~i~D--~~~~~-~~~d~~~~~~~~~~~~~a~~~d~~~~~~------------~~~a~---- 132 (274) T protein:vir:93 72 DILETKKREAKIRKIAKGTSITD--EALLS-GYGDPQGEQVRQHGLAHANKVDNDVLEA------------LMGAK---- 132 (274) T ss_pred cccccceeEEEeeeecccccccH--HHHHh-hccchHHHHHHHHHHHHHHHHHHHHHHH------------Hhccc---- Confidence 99999998888887765555555 44443 3456777777778888888887654311 11000 Q ss_pred cccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccCC--------CcCccHH-HH Q lcl|Aclame:pro 224 IASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVVT--------DLGISVR-DW 294 (388) Q Consensus 224 ~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~~--------~~~~Tvl-~~ 294 (388) .+.. + ...+ +++|.+++..+-.. ...+..|+++|..+..|.+.+ ..|..++ .= T Consensus 133 --~~~~-~----~~~~----~d~i~dA~~~l~d~-------~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~G 194 (274) T protein:vir:93 133 --LTVN-A----DITK----LNGLQSAIDKFNDE-------DLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKG 194 (274) T ss_pred --cccc-c----cccC----HHHHHHHHHHhhhc-------cCCccEEEeCHHHHHHHHhhhhhcccccccccccceeec Confidence 0000 0 0112 44555555554332 234668999999999885421 1111111 00 Q ss_pred HHHhCCccEEEEccccccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEeccc-ceee Q lcl|Aclame:pro 295 LKQTYPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSN-ATAG 373 (388) Q Consensus 295 lk~n~pnl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~~-~t~G 373 (388) .--.+-+++|...+.+- ....+++.+.- .......+. ..|..-.++.+. ..... ..+| T Consensus 195 ~ig~~~G~~Vi~s~~~p-------~~t~~l~~~ga--i~~~~~~~~-~vE~~Rd~~~~~-----------d~i~~~~~y~ 253 (274) T protein:vir:93 195 AFGEALGAIIVRTNKLE-------AGTAILAKKGA--VKLILKRDF-FLEVARDASTKT-----------TALYSDKHYV 253 (274) T ss_pred ccceecCeeEEEcCCCC-------cceEEEEeCCe--EEEEecCCc-ccccccchhhcc-----------cEEEEEEEEE Confidence 00112245555433221 11223332210 000000000 011111111111 11111 2567 Q ss_pred eeeeccccceeeccC Q lcl|Aclame:pro 374 VMLKRPWAVVRLIGL 388 (388) Q Consensus 374 ~ii~rP~ai~~~~GI 388 (388) +-+.+|-+++.+.-= T Consensus 254 ~~~~~~~~~v~~t~~ 268 (274) T protein:vir:93 254 AYLYDESKAVKITKG 268 (274) T ss_pred EEEEcCCceEEEeeC Confidence 777777777766533 No 81 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=92.28 E-value=0.012 Score=31.10 Aligned_cols=259 Identities=11% Similarity=0.033 Sum_probs=127.0 Q ss_pred hccCcccccccccccchHHHHHHHhhcceeeeecccchhhhhhcccccC--CCCceeeEEEeeeccccceEecccccCCc Q lcl|Aclame:pro 65 QAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTV--GSWEDQEIVQGIVEPAGTAMEYGDLTNIP 142 (388) Q Consensus 65 ~amDaa~~~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~--g~w~~~t~~~~v~e~~G~a~~ygd~~diP 142 (388) ||-- -.+.+++=+|--+..++..++ ..-.....+..++.. |.- =.+++++.++..|.+..+++.++++ T Consensus 1 ma~~-----~T~~~d~iiPev~~~~v~~~~----~~~~~~~~~~~~~~~l~g~~-G~ti~iP~~~~~gda~~~~eg~~i~ 70 (272) T protein:vir:36 1 MSKQ-----KTTLADLVNPEVLAPIVSYEL----NKALRFAPLAQVDTTLQGQP-GNTLKFPAFTYIGDAADVAEGGEIS 70 (272) T ss_pred CCCc-----ceehhhhhchHHHHHHHHHHH----HhhhhhccccccccccccCC-CCEEEEeeeccCccccccCCCCccC Confidence 2210 112234435655555543333 233333444444332 211 3578999999999999999999999 Q ss_pred eeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCcc Q lcl|Aclame:pro 143 LSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLP 222 (388) Q Consensus 143 ~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a 222 (388) ..+++.....-.+...+-++.+ .++.+++ ++-++..+-...+..++...+++..+ . ...| .+ T Consensus 71 ~~~lt~~~~~~~i~~~~k~~~v--tD~~~~~-~~~d~~~~~~~~~a~~~a~~~d~~i~-~-------~l~~---~~---- 132 (272) T protein:vir:36 71 LDKIGTTTKSVTIKKAAKGTEI--TDEAALS-GYGDPIGESNKQLGLSLANKVDDDLL-S-------AAKT---TS---- 132 (272) T ss_pred hhhcCCcceeEeeehhhccccc--cHHHHhh-ccchHHHHHHHHHHHHHHHHHHHHHH-H-------Hhcc---cc---- Confidence 9999999888888877655444 4554444 44566666666677777777665332 1 0111 00 Q ss_pred ccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccCCCc-------CccHH-HH Q lcl|Aclame:pro 223 AIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVVTDL-------GISVR-DW 294 (388) Q Consensus 223 ~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~~~~-------~~Tvl-~~ 294 (388) ... -...+ +++|..++..+-..- ..+..++++|..+..|.+-..+ +..++ .- T Consensus 133 ----~~~-----~~~~~----~d~i~~A~~~lgd~~-------~~~~~ivv~p~~~~~L~k~~~~~~~~~~~~~~~~~~G 192 (272) T protein:vir:36 133 ----QTV-----STKAN----VDGVQAALDIFNDED-------AQAYVLIVNPKDAAKIRKDANAKNIGSEVGANALING 192 (272) T ss_pred ----ccc-----ccccc----HHHHHHHHHHhhhcC-------CCceEEEEcHHHHHHHhcccccccccccccccceeee Confidence 000 01122 345555555543321 2356899999999998642211 11110 00 Q ss_pred HHHhCCccEEEEccccccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEecc-cceee Q lcl|Aclame:pro 295 LKQTYPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYS-NATAG 373 (388) Q Consensus 295 lk~n~pnl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~-~~t~G 373 (388) .--.+-+++|+.-..+-. +++ ....+++.+.- .......+.. . |..| ....+..... -..+| T Consensus 193 ~ig~~~G~~Vv~s~~~p~--~~~-~~~~~~~~~gA--~~~~~~~~~~-v-----E~~R------~~~~~~d~i~~~~~y~ 255 (272) T protein:vir:36 193 TYADVLGAQIVRSKKLAE--GSA-LMFKIVSNSPA--LKLVLKRGVQ-V-----ETDR------DIVTKTTVITADEHYA 255 (272) T ss_pred ccceecCeeEEEeCCCCC--Cce-eEEEEEecccc--eeeeecCCcc-c-----cccc------chhhcCcEEEEEEEEE Confidence 001233566654433321 111 11122222210 0000000000 0 1111 1111111111 13689 Q ss_pred eeeeccccceee--ccC Q lcl|Aclame:pro 374 VMLKRPWAVVRL--IGL 388 (388) Q Consensus 374 ~ii~rP~ai~~~--~GI 388 (388) +-+.+|-+++.+ .|+ T Consensus 256 ~~v~~~~~vv~~t~~g~ 272 (272) T protein:vir:36 256 AYLYDLTKVVNITFTGV 272 (272) T ss_pred EEEEcCccEEEEeecCC Confidence 999999987765 688 No 82 >protein:vir:4159 Length: 315 # NCBI annotation: structural protein # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:87 # MgeName: psiM2 # Cross-refs: genbank:acc:NP_046968;genbank:gi:9630538;genbank:GeneID:1261712 Probab=91.32 E-value=0.016 Score=30.37 Aligned_cols=298 Identities=11% Similarity=-0.047 Sum_probs=132.3 Q ss_pred hhcchhhhhh-hhhhhhccCcccccccccccchH--HHHHHHhhcceeeeecccchhhhhhcccc-cCCCCceee--EEE Q lcl|Aclame:pro 50 VKRQIELLHE-GGVATQAFDSAYVAPTTQASIPT--PIQFLQQWLPGFVKVLTSARKIDEILGVK-TVGSWEDQE--IVQ 123 (388) Q Consensus 50 ~~~~~~~~~~-~~~~~~amDaa~~~~~t~~~~g~--l~~~l~~idp~v~e~l~~~~~~~~i~~v~-t~g~w~~~t--~~~ 123 (388) ....++.+.. ..-..-++.. +..+.|+ |.+ ++. +++.+....-.+++..+. +.+++.-+. ..+ T Consensus 1 ~~~~~~~~~~~~~~~~k~~t~------~d~~Gg~l~P~~-~~~----~i~~~~e~s~~l~~~~vi~~~~~~~~~i~~~g~ 69 (315) T protein:vir:41 1 MLTIEDIRGGKPFEIVPKIDV------PDLGRGVLSVDR-FGE----FVKAVRDSAVIIPEARIDNALKSYEKDISRLSL 69 (315) T ss_pred CcccchhhcCChhhhhhhcCC------cCCCCceechHH-HHH----HHHHHHhhhhhhhhceeeecccccccccccccc Confidence 0011111100 0000112221 1123333 322 222 223222222234443332 122222111 111 Q ss_pred eeeccccceEecccccCCceeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEee Q lcl|Aclame:pro 124 GIVEPAGTAMEYGDLTNIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGW 203 (388) Q Consensus 124 ~v~e~~G~a~~ygd~~diP~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~ 203 (388) ...-..| ....|+....|-.+..........+.+..-...+.+.|+ -.+-+.++.+........++...++...|.|| T Consensus 70 ~~~~~~g-~~~~~~~~~~~~~~~~f~~~~l~~~~l~~~~~it~elL~-D~~~~~~~e~~l~~~~a~~~a~~~~~~~~nGd 147 (315) T protein:vir:41 70 VLDVGPG-RDETGQKLAPPESTAEVKTNTLYMREMVTKVVIHEDAIE-DNIEGKAFEQKIVTLLGEGISYVLEKYYLHGD 147 (315) T ss_pred Ccccccc-cccccCcCCCCCCccccceeeeceeeeeeeccccHHHHH-hhhccccHHHHHHHHHHHHHHHHHHHHhhccC Confidence 1100011 112344445555555555555555555555566665554 34457899999999999999999999999997 Q ss_pred cCc--c-ccceEEEeecCCCccccccccCCccccccc-CCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHH Q lcl|Aclame:pro 204 EGK--N-GNRTFGFLNDPSLLPAIASTTPGGWVSGGA-NAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVD 279 (388) Q Consensus 204 a~~--~-~~g~~GllN~P~l~a~~~~~~~~~~t~Wa~-kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~ 279 (388) ... + .+...|+|+.....+.. .+..++. ..+.+.+.|+...+..-..+.+ .....+|..+.+. T Consensus 148 g~s~~p~~~~~~G~l~~a~~~~~~------~~~~~~a~~~~~d~l~~l~~sl~~~yr~~~-------~~~~~imn~~t~~ 214 (315) T protein:vir:41 148 TSSSDPLLRMSDGWLKLASEKLTE------SDVDPEAEDWPMNLFDTMIESLPTPYRNNL-------PNMKFYVTWDIYR 214 (315) T ss_pred CcCcCccccccccceecccccccc------cccccccccccHHHHHHHHHhcChHHhhcC-------CceEEEEcHHHHH Confidence 531 0 13457888875432211 1111222 2244455555544444443322 1235788888776 Q ss_pred hhcc-CCCcCccHHHHHHHh-----CCccEEEEccccccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhc Q lcl|Aclame:pro 280 MLSV-VTDLGISVRDWLKQT-----YPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVT 353 (388) Q Consensus 280 ~Ls~-~~~~~~Tvl~~lk~n-----~pnl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~ 353 (388) .+.+ .+.-|.-+++=.... +-+..|+.+|.+... +..+..++|.+-. . +...+...+|. T Consensus 215 ~~rklk~~~g~~lw~~~~~~g~~~tl~G~PV~~~~~m~~~---~~~~~~ilf~d~~-n-----------l~~~~~~~i~i 279 (315) T protein:vir:41 215 AYRDALKGRETGLGDQALTGANSILYDGRPVQYVPALEAL---NDGKSRALFVVPT-Q-----------LVYGFWRNIKV 279 (315) T ss_pred HHHHHhccCCCccccchhhcCCCceecccceEeccccccc---CCCCccEEEeccc-c-----------eEEEeccccEE Confidence 5533 122222222211111 112236666666533 3334455654421 1 11112222222 Q ss_pred cCce-eccCceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 354 LGVE-KRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 354 ~~v~-~~~~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) ++-. .+...+.+..+-|.+|-.+-...+++....| T Consensus 280 ~~~~~a~~~~~~~~~~~r~d~~~~~~~~~a~~~~~v 315 (315) T protein:vir:41 280 VPDYDAEMRLTKYVASLRTDNHYEDEEGAVSATITV 315 (315) T ss_pred EeeecCCCCceEEEEEEEeceeEEeccceeEeeeeC Confidence 2111 1112234444567777666678999999999 No 83 >protein:vir:79078 Length: 307 # NCBI annotation: gp8 # Family: family:all:908 # MgeID: mge:1862 # MgeName: phiE255 # Cross-refs: genbank:acc:YP_001111208;genbank:gi:134288798;genbank:GeneID:4960752 Probab=91.07 E-value=0.0088 Score=31.87 Aligned_cols=268 Identities=10% Similarity=0.024 Sum_probs=106.5 Q ss_pred hhccCcccccccccccchHHHHH-HHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccc--cce--Eecccc Q lcl|Aclame:pro 64 TQAFDSAYVAPTTQASIPTPIQF-LQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPA--GTA--MEYGDL 138 (388) Q Consensus 64 ~~amDaa~~~~~t~~~~g~l~~~-l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~--G~a--~~ygd~ 138 (388) |+++..... . -+.|..+ +.|-. +++-+..|||..-.+.-.-...+|+- |.- ... .+-|.. T Consensus 1 m~~~~~~~~---~---dp~LT~~A~gy~n--------~~~Iad~lfP~vpV~~~~~k~~~f~~-e~f~~~~t~ra~~~~~ 65 (307) T protein:vir:79 1 MGRLSKLRI---V---DPVLTNLAIGYTN--------AEFIGQTLMPVVEVEKEGGKIPKFGK-ESFRLYQTERALRAKS 65 (307) T ss_pred CCCCCCCcc---c---CHHHHHHHhhccc--------hhhhhhhcCCcccccccccceeeecc-ccccccccccccCCCc Confidence 566664322 0 1112111 11212 23455666776544433323333321 000 000 011111 Q ss_pred cCCceeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHH----HHHHHHHHHhhceEEEEeecCccccceEEE Q lcl|Aclame:pro 139 TNIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKR----QGAAVQLEIMRNAIGFYGWEGKNGNRTFGF 214 (388) Q Consensus 139 ~diP~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~----~aAr~a~~~~~n~i~~~G~a~~~~~g~~Gl 214 (388) +.+...++. +.....-+.+..+-+. .+..+..++++..++. ....+..|...-+++|-+. .| T Consensus 66 ~~v~~~~~~----~~~~~~~~~~l~~~id-~r~~~~~~~~~~~~Av~~l~d~I~l~~E~~~A~l~~~~~-------~y-- 131 (307) T protein:vir:79 66 NRMNPEDID----SVDVNLDEHDLEYPID-YREDQESAFPLEQAAVQTATDAIQLRREKMIADLSQNPS-------SY-- 131 (307) T ss_pred ceeeeeccc----cccccccccchhhccc-chhcCCCCCCHHHHHHHHHHHHHHhHHHHHHHHHhcccc-------cc-- Confidence 111110000 0000000111111111 1122233344333322 2334445555555555321 22 Q ss_pred eecCCCccccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhcc---------CC Q lcl|Aclame:pro 215 LNDPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSV---------VT 285 (388) Q Consensus 215 lN~P~l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~---------~~ 285 (388) |+- .-.+.+|...|.+++ -+++.||.++...+...++ ..|++++|....+.+|.+ .+ T Consensus 132 ---~~~----~k~tLsgt~~Wsd~~-sDPi~di~~~~~ai~~~~g------~~Pn~~vlg~~a~~~l~~h~~i~~~lk~~ 197 (307) T protein:vir:79 132 ---AAG----NKKQLSATEKFTAAN-SDPVGVIEDGKEAIRTKIG------RRPNTMVIGASAYKTLKAHPQLIEKIKYS 197 (307) T ss_pred ---CCC----ceEEEccCcccCCCC-CCcHHHHHHHHHHHHHhhC------CccceEEeCHHHHHHHhcCHHHHHHhcCc Confidence 110 011123445798865 5589999999999999886 468999999999998853 12 Q ss_pred CcC-ccHHHHHHHhCCccEEEEc--cccccccCC----CCccEEEEEEcccccccccccCCCcceEe--ecchhhhccCc Q lcl|Aclame:pro 286 DLG-ISVRDWLKQTYPRVRVMSA--PELQGGNPD----DGKDIAYMFLDSVDTAVDGSTDGGDTWAQ--LVQSKFVTLGV 356 (388) Q Consensus 286 ~~~-~Tvl~~lk~n~pnl~i~~~--pel~~a~gt----g~~~~~~~~~~~~d~~~~~~~~~~~t~~~--~~p~~~r~~~v 356 (388) ..+ +| .++|++-+ .++.+.+ .-+..+++. -+.+++.+|++..-. ...+...+. .+.- T Consensus 198 ~~g~it-~~~la~l~-~v~~V~vg~a~y~~~~~~~~~iw~~~~~l~y~~~~~~-----~~~~~~~~ps~Gyt~------- 263 (307) T protein:vir:79 198 MKGIVT-VDLLKEIF-EVENIAVGEAIYADDKDRFTDIWGANIVLAYVPLQRG-----GQQRTPYEPSYGYTL------- 263 (307) T ss_pred cccccC-HHHHHHHh-CceeEEEeeeeeecccccchhcCCCceEEEecccccC-----CCCCcccccccceeE------- Confidence 223 34 45665543 3432222 112222211 234677777644211 111111111 1111 Q ss_pred eeccCceEEeccccee-----eeeeeccccceeeccC Q lcl|Aclame:pro 357 EKRVKNYVEAYSNATA-----GVMLKRPWAVVRLIGL 388 (388) Q Consensus 357 ~~~~~~~~~~~~~~t~-----G~ii~rP~ai~~~~GI 388 (388) +.+......++.+..+ ...+.+|.-++.-.|. T Consensus 264 ~~~g~~~~d~~~~~~~~~~vrv~~~~~~~i~~~~~G~ 300 (307) T protein:vir:79 264 RKKGNPVVDTRIEDGKLELVRATDIFRPYLLGADAGY 300 (307) T ss_pred EecCceEEecccCCCceeEEeecccccceeeccccch Confidence 1111122223332211 1223456655555555 No 84 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=90.68 E-value=0.019 Score=30.00 Aligned_cols=329 Identities=9% Similarity=-0.002 Sum_probs=136.3 Q ss_pred CCCcceeeeecCc--------------------cccchhhhhhc---cccccccc------CCHHHHhhcceecccchhh Q lcl|Aclame:pro 1 MKQLSKVHQSLAG--------------------RSVRAFDMANG---KADYRLTD------MAVRELKKFGLVFDHATVK 51 (388) Q Consensus 1 ~~~~~~~~~~~~~--------------------~~~~~~~~~~~---~~~~~~~~------~~~~~l~~~g~~~~~~~~~ 51 (388) |.+.+..+..-.. |.++-..+.+. ........ ++...+++.|-. +.. T Consensus 269 l~~~~~a~~~~~~a~~~~~~~~~~~~~~i~~~~re~~~~~l~rai~a~a~~~~~~a~~~~e~a~~~a~~~G~~---arg- 344 (632) T protein:vir:96 269 MNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADASGKE---ARG- 344 (632) T ss_pred HhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHHhhhccchhhhhhhhHHHHHHHHhhhhh---hhh- Confidence 3222221111110 11111111000 00000000 000001111100 000 Q ss_pred cchhhhhhhhhhhhccCcccccccccccchHHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccc Q lcl|Aclame:pro 52 RQIELLHEGGVATQAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGT 131 (388) Q Consensus 52 ~~~~~~~~~~~~~~amDaa~~~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~ 131 (388) +. .....+...++.. .|..+.|+++.. +..-.+|++.+.+....+++ +.... +-....+.+++....+. T Consensus 345 ~~---~~~~~l~~ra~~~-----~t~~~gg~lvp~-~~~~~~iie~lr~~s~i~~l-~~~~~-~~~~g~~~ip~~~~~~~ 413 (632) T protein:vir:96 345 FY---MPHEVLVQRQLEK-----KTAGKGGELVAT-ELLSEEFIDILRNKAIIGQM-GARML-PGLVGDVDIPKKTSGAN 413 (632) T ss_pred hh---hhHHHHHHhhhhc-----cccccccccccc-ccchHHHHHHHhhcchhhhh-cceEe-ecCCcceEEEEEeCCce Confidence 00 0000001122221 123334443321 11113445555443333343 21111 00112456777777677 Q ss_pred eEecccccCCceeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccce Q lcl|Aclame:pro 132 AMEYGDLTNIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRT 211 (388) Q Consensus 132 a~~ygd~~diP~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~ 211 (388) +...|....+|..+...+...-..+.++..+.++.+=|.. ...++.+.-+.....++...+++-.++|+.- .... T Consensus 414 a~wv~E~~~~~~s~~~f~~i~l~~~k~~~~v~iS~ell~d---s~~~~~~~i~~~l~~a~~~~~d~a~l~G~G~--~~~p 488 (632) T protein:vir:96 414 FYWIGEDEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQ---SSIHVENLIREDLIEGIGVALDLAMLTGTGL--ANDP 488 (632) T ss_pred eEeecCCccccccccceeeEEeeeeEEEEehhhHHHHHhc---cchHHHHHHHHHHHHHHHHHHHHHhhcccCC--CCcc Confidence 7778888899999988888899999999888888753432 3567788888888888889999999999642 2357 Q ss_pred EEEeecCCCccccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhcc---CCCcC Q lcl|Aclame:pro 212 FGFLNDPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSV---VTDLG 288 (388) Q Consensus 212 ~GllN~P~l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~---~~~~~ 288 (388) .|++|..++++...+ +++.. ++||.++...+..... +. .....+|.+.....|.. .+..| T Consensus 489 ~Gi~~~~~~~~~~~~---~~~~~---------~~~i~~~~~~i~~~~~---~~--~~~~~~~~~~~~~~l~~~~l~d~~G 551 (632) T protein:vir:96 489 VGLLNMTGVPALTYP---AGGVD---------WASVVDMETKISTFNA---DA--GRLAYLTSVTQRGAAKKAQVFDNTG 551 (632) T ss_pred ceeeecccccceecc---cccCC---------HHHHHHHHHHHhhccc---cc--CccEEEEchhHHHHHHHHhccCCCC Confidence 799998877542211 11111 4466666666655432 11 12357788777766642 23334 Q ss_pred ccHHH--HHHHhCCccEEEEccccccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEe Q lcl|Aclame:pro 289 ISVRD--WLKQTYPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEA 366 (388) Q Consensus 289 ~Tvl~--~lk~n~pnl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~ 366 (388) .-+++ .| .-||-+.-..+|.-...- |.-+.+ ++...-...... -|..... .....+. T Consensus 552 ~~i~~~~~l-~G~pv~~s~~ip~~~~~~--gd~s~~-~i~~~~~~~i~~-----------~~~~~~~------~~~v~~~ 610 (632) T protein:vir:96 552 ERIWQNNEV-NGYRAEASNQIPADTWIF--GDWSQI-VIAMWGVLDLKV-----------DPYTKAA------SDGLVLR 610 (632) T ss_pred ceeecCCee-cccceEeccccccCcEEE--eecceE-EEEEecceEEEE-----------ccccccc------cCceEEE Confidence 33321 00 012211111122111100 000111 111100000000 0000000 0011111 Q ss_pred cccceeeeeeeccccceeeccC Q lcl|Aclame:pro 367 YSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 367 ~~~~t~G~ii~rP~ai~~~~GI 388 (388) +..+ .++-+++|-+|+...== T Consensus 611 ~~~~-~d~~v~~~~af~~~k~~ 631 (632) T protein:vir:96 611 VFQD-VDAGVRRKEAFCIAKKG 631 (632) T ss_pred EEee-cCceeechhhhhheeec Confidence 2222 35566778777643322 No 85 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=90.39 E-value=0.021 Score=29.77 Aligned_cols=252 Identities=13% Similarity=0.066 Sum_probs=130.9 Q ss_pred hccCcccccccccccchHHHHHHHhhcceeeeecccchhhhhhcccccC--CCCceeeEEEeeeccccceEecccccCCc Q lcl|Aclame:pro 65 QAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTV--GSWEDQEIVQGIVEPAGTAMEYGDLTNIP 142 (388) Q Consensus 65 ~amDaa~~~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~--g~w~~~t~~~~v~e~~G~a~~ygd~~diP 142 (388) ||.-- .+.+++=+|-.+..++...+ ..-.....+.-++.. |.- -.++.++.++..|.+..+++++++| T Consensus 1 MA~~~-----T~~~~~~iPev~s~~v~~~~----~~~~~~~~~~~~~~~~~g~~-G~tv~iP~~~~~~~a~~v~eg~~i~ 70 (272) T protein:vir:98 1 MAVGT-----TKMAQMLDPEVLADMIDAEV----GKAIRFAPLAEVDTTLEGQP-GTTLTVPKWDYIGDAEDVAEGEAIP 70 (272) T ss_pred CCCcc-----ccchheechHHHHHHHHHHH----HHHhhhhccccccccccCCC-CCEEEEEEecCCCCcccccCCCccc Confidence 22111 12233446766666654333 223333334322221 111 1367888888889999999999999 Q ss_pred eeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCcc Q lcl|Aclame:pro 143 LSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLP 222 (388) Q Consensus 143 ~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a 222 (388) ..+...+.....+..++..+.++.++... ...++.+.-...+.+++.+.+++..+ +- ..|-- T Consensus 71 ~~~~~~~~~~~~~~~~~~~~~itd~~~~~---s~~d~~~~~~~~~~~~~a~~~d~~i~-~~-------~~~a~------- 132 (272) T protein:vir:98 71 MTQLGFKKTTMTIKKAGKGVEITDEAILS---GYGDPVGQAAKQIVEAIDHKVDADVL-DA-------LSKST------- 132 (272) T ss_pred ccccccceEEEEeeeeeeeeeecHHHHhh---ccccHHHHHHHHHHHHHHHHHHHHHH-HH-------hcccc------- Confidence 99999999999999998888888766533 35578788888888888877776543 10 11100 Q ss_pred ccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC--------CCcCccHHHH Q lcl|Aclame:pro 223 AIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV--------TDLGISVRDW 294 (388) Q Consensus 223 ~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~--------~~~~~Tvl~~ 294 (388) .+.+ ...| +++|.+++..+-..- ..+..++++|..+..|.+. ++++. .. T Consensus 133 -~~~~--------~~~t----~d~i~da~~~l~~~~-------~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~---~~ 189 (272) T protein:vir:98 133 -QTVE--------ATAT----VDGVSKALDIFNDED-------DAETVIVMNPADASTLRLDAAKEWLGATEVGA---NR 189 (272) T ss_pred -cccc--------cccC----HHHHHHHHHHHhccC-------CCccEEEEcHHHHHHHHHhccccccccccccc---cc Confidence 0001 0112 456666666554331 2345799999999887432 12221 11 Q ss_pred HH----HhCCccEEEEccccccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEeccc- Q lcl|Aclame:pro 295 LK----QTYPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSN- 369 (388) Q Consensus 295 lk----~n~pnl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~~- 369 (388) +. .++-+++|...+.+. .+..+++.+.- .......+. +.+. .| ........... T Consensus 190 ~~~g~ig~i~G~~Vi~s~~~p-------~~t~~~~~~~a--~~~~~~~~~-~ve~-----~r------~~~~~~~~i~~~ 248 (272) T protein:vir:98 190 VVSGVYGEVLGVQIVRSRKCP-------KGTAYMVRKGA--LRIMLKRNT-MVET-----DR------DITKAINQIVAN 248 (272) T ss_pred cccccchhhcCeeEEEcCCCC-------cceEEEEcCCe--EEEEecCCc-eeee-----cc------ccccceeEEEEE Confidence 11 122345555443332 12233332220 000000000 1111 01 01111111121 Q ss_pred ceeeeeeeccccceeeccC Q lcl|Aclame:pro 370 ATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 370 ~t~G~ii~rP~ai~~~~GI 388 (388) +..|+-+.+|-+++..+-= T Consensus 249 ~~~~~~v~~~~~vv~~t~~ 267 (272) T protein:vir:98 249 KHYGVYLYKAEKAVKITLK 267 (272) T ss_pred EEEEEEEEcCCceEEEEec Confidence 3567888889888887544 No 86 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=90.39 E-value=0.021 Score=29.77 Aligned_cols=252 Identities=13% Similarity=0.066 Sum_probs=130.9 Q ss_pred hccCcccccccccccchHHHHHHHhhcceeeeecccchhhhhhcccccC--CCCceeeEEEeeeccccceEecccccCCc Q lcl|Aclame:pro 65 QAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTV--GSWEDQEIVQGIVEPAGTAMEYGDLTNIP 142 (388) Q Consensus 65 ~amDaa~~~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~--g~w~~~t~~~~v~e~~G~a~~ygd~~diP 142 (388) ||.-- .+.+++=+|-.+..++...+ ..-.....+.-++.. |.- -.++.++.++..|.+..+++++++| T Consensus 1 MA~~~-----T~~~~~~iPev~s~~v~~~~----~~~~~~~~~~~~~~~~~g~~-G~tv~iP~~~~~~~a~~v~eg~~i~ 70 (272) T protein:vir:30 1 MAVGT-----TKMAQMLDPEVLADMIDAEV----GKAIRFAPLAEVDTTLEGQP-GTTLTVPKWDYIGDAEDVAEGEAIP 70 (272) T ss_pred CCCcc-----ccchheechHHHHHHHHHHH----HHHhhhhccccccccccCCC-CCEEEEEEecCCCCcccccCCCccc Confidence 22111 12233446766666654333 223333334322221 111 1367888888889999999999999 Q ss_pred eeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCcc Q lcl|Aclame:pro 143 LSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLP 222 (388) Q Consensus 143 ~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a 222 (388) ..+...+.....+..++..+.++.++... ...++.+.-...+.+++.+.+++..+ +- ..|-- T Consensus 71 ~~~~~~~~~~~~~~~~~~~~~itd~~~~~---s~~d~~~~~~~~~~~~~a~~~d~~i~-~~-------~~~a~------- 132 (272) T protein:vir:30 71 MTQLGFKKTTMTIKKAGKGVEITDEAILS---GYGDPVGQAAKQIVEAIDHKVDADVL-DA-------LSKST------- 132 (272) T ss_pred ccccccceEEEEeeeeeeeeeecHHHHhh---ccccHHHHHHHHHHHHHHHHHHHHHH-HH-------hcccc------- Confidence 99999999999999998888888766533 35578788888888888877776543 10 11100 Q ss_pred ccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC--------CCcCccHHHH Q lcl|Aclame:pro 223 AIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV--------TDLGISVRDW 294 (388) Q Consensus 223 ~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~--------~~~~~Tvl~~ 294 (388) .+.+ ...| +++|.+++..+-..- ..+..++++|..+..|.+. ++++. .. T Consensus 133 -~~~~--------~~~t----~d~i~da~~~l~~~~-------~~~~~~vv~p~~~~~L~k~~~~~~~~~~~~~~---~~ 189 (272) T protein:vir:30 133 -QTVE--------ATAT----VDGVSKALDIFNDED-------DAETVIVMNPADASTLRLDAAKEWLGATEVGA---NR 189 (272) T ss_pred -cccc--------cccC----HHHHHHHHHHHhccC-------CCccEEEEcHHHHHHHHHhccccccccccccc---cc Confidence 0001 0112 456666666554331 2345799999999887432 12221 11 Q ss_pred HH----HhCCccEEEEccccccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEeccc- Q lcl|Aclame:pro 295 LK----QTYPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSN- 369 (388) Q Consensus 295 lk----~n~pnl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~~- 369 (388) +. .++-+++|...+.+. .+..+++.+.- .......+. +.+. .| ........... T Consensus 190 ~~~g~ig~i~G~~Vi~s~~~p-------~~t~~~~~~~a--~~~~~~~~~-~ve~-----~r------~~~~~~~~i~~~ 248 (272) T protein:vir:30 190 VVSGVYGEVLGVQIVRSRKCP-------KGTAYMVRKGA--LRIMLKRNT-MVET-----DR------DITKAINQIVAN 248 (272) T ss_pred cccccchhhcCeeEEEcCCCC-------cceEEEEcCCe--EEEEecCCc-eeee-----cc------ccccceeEEEEE Confidence 11 122345555443332 12233332220 000000000 1111 01 01111111121 Q ss_pred ceeeeeeeccccceeeccC Q lcl|Aclame:pro 370 ATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 370 ~t~G~ii~rP~ai~~~~GI 388 (388) +..|+-+.+|-+++..+-= T Consensus 249 ~~~~~~v~~~~~vv~~t~~ 267 (272) T protein:vir:30 249 KHYGVYLYKAEKAVKITLK 267 (272) T ss_pred EEEEEEEEcCCceEEEEec Confidence 3567888889888887544 No 87 >protein:vir:99888 Length: 309 # NCBI annotation: capsid protein # Family: family:all:908 # MgeID: mge:1480 # MgeName: B3 # Cross-refs: genbank:acc:YP_164075;genbank:gi:56692607;genbank:GeneID:3192616 Probab=90.10 E-value=0.023 Score=29.60 Aligned_cols=268 Identities=12% Similarity=0.023 Sum_probs=114.8 Q ss_pred hhhhhhccCcccccccccccchHHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccceEeccc-- Q lcl|Aclame:pro 60 GGVATQAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGD-- 137 (388) Q Consensus 60 ~~~~~~amDaa~~~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd-- 137 (388) +.=+.+-.|..+ | ++ =+.|-.++ +-++.|||..-.+...-...+|+-.|.- ..+.+ T Consensus 1 ~~~~~~~~dp~L----T--~~-----A~gy~n~~--------~Ia~~l~P~vpV~~~~~~~~~f~~~e~F---~~~~t~r 58 (309) T protein:vir:99 1 MSNAPFPIDPEL----T--AI-----AIAYRNGR--------MISDEVLPRVPVGKQEFKFWKYDLAQGF---TVPETLV 58 (309) T ss_pred CCCCCcCcCHhH----H--HH-----HhhccChh--------hhhhhcCCccccCccccceeeechhhcc---cccchhh Confidence 000111122211 1 00 01222232 4456777776555433334344322211 01110 Q ss_pred --ccCCceeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHH----HHhhceEEEEeecCccccce Q lcl|Aclame:pro 138 --LTNIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQL----EIMRNAIGFYGWEGKNGNRT 211 (388) Q Consensus 138 --~~diP~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~----~~~~n~i~~~G~a~~~~~g~ 211 (388) ..+.-.++.........+...+.-.-+...|...| ..+.++.++....++..+ |...-++++-- . T Consensus 59 ~~~~~~~~v~~~~~~~~~~~~~~~L~~~i~~~~~~~a-~~~~d~~~~Av~~l~~~i~l~rE~~~A~lv~~~-------a- 129 (309) T protein:vir:99 59 GRKSKPNEVEFSATDETGSTEDHGLDAPVPQADIDNA-PTNYNPLGHATEQTTNLILLDREARTSKLVFSP-------N- 129 (309) T ss_pred ccCCCcceEeecccCceeeecccceeecCCchhhhhc-cCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCh-------h- Confidence 11222333333333333444444444555565544 235666655554444433 33333333311 0 Q ss_pred EEEeecCCCccccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhcc-------- Q lcl|Aclame:pro 212 FGFLNDPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSV-------- 283 (388) Q Consensus 212 ~GllN~P~l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~-------- 283 (388) |+| +.- -.+.+|...|.+++. +++.||.++...+ | -.|++++|....+.+|.+ T Consensus 130 ----~y~---~~~-k~~Lsgt~~wsd~~S-DPi~~i~~~~~~~-----g-----~~PN~~vlg~~~~~~l~~hp~i~~~i 190 (309) T protein:vir:99 130 ----SYA---AGN-KTTLSGADQWSDPTS-NPLPVITDALDSV-----I-----LRPNIGVLGRRTATILRRHPKIVKAY 190 (309) T ss_pred ----hcC---CCc-eEEecCccccCCCCC-CcHHHHHHHHHhh-----C-----CCcceEEechHHHHHHhhCHHHHHHh Confidence 111 100 011233446887553 4778888887654 2 368999999999998853 Q ss_pred -CCC--cCccHHHHHHHhCCccEEEEc-ccccccc-CC-------CCccEEEEEEcccccccccccCCCcceEeecchhh Q lcl|Aclame:pro 284 -VTD--LGISVRDWLKQTYPRVRVMSA-PELQGGN-PD-------DGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKF 351 (388) Q Consensus 284 -~~~--~~~Tvl~~lk~n~pnl~i~~~-pel~~a~-gt-------g~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~ 351 (388) .+. .|.--.++|++-+-=-+|.-- .-+..+. +. -+.+++++|....-. +.++ .+|..-+.-.. T Consensus 191 k~~~~~~g~it~~~la~l~~ve~V~vg~a~~n~a~~g~~~~~~~iwg~~~~L~y~~~~~~----~~~~-ps~G~t~~~~~ 265 (309) T protein:vir:99 191 NGSLGDEGMVPMAFLQELLELDAIYIGEARLNIARPGQNPNLIRAWGPHASFIYRDRLAD----TRNG-TTFGLTAQWGD 265 (309) T ss_pred cCCCccccccCHHHHHHHhCcceEEeecceeeccccccccccccccCCcEEEEEcCCCCC----Cccc-ccccceeeccc Confidence 111 132225777766432233321 1121110 00 145667777654311 1111 22322221112 Q ss_pred hccCceeccCceEEecccceeeeeee-----ccccceeeccC Q lcl|Aclame:pro 352 VTLGVEKRVKNYVEAYSNATAGVMLK-----RPWAVVRLIGL 388 (388) Q Consensus 352 r~~~v~~~~~~~~~~~~~~t~G~ii~-----rP~ai~~~~GI 388 (388) |.- ..|..++...-||-.|| .|.-++.-.|. T Consensus 266 r~~------g~~~d~~~~~~g~~~vr~~~~~k~~i~~~d~G~ 301 (309) T protein:vir:99 266 RVS------GSIADPNIGLRGGQRVRVGESVKELVTAPDLGF 301 (309) T ss_pred ccC------CceeeeeeccCCceEEEEeccccchhcchhcch Confidence 222 24555665555554443 56666666665 No 88 >protein:vir:107882 Length: 307 # NCBI annotation: gp34 # Family: family:all:908 # MgeID: mge:1565 # MgeName: BcepMu # Cross-refs: genbank:acc:YP_024707;genbank:gi:48696944;genbank:GeneID:2845970 Probab=89.88 E-value=0.011 Score=31.40 Aligned_cols=269 Identities=12% Similarity=0.060 Sum_probs=107.9 Q ss_pred hhccCcccc-cccccccchHHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccc----eEecccc Q lcl|Aclame:pro 64 TQAFDSAYV-APTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGT----AMEYGDL 138 (388) Q Consensus 64 ~~amDaa~~-~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~----a~~ygd~ 138 (388) |++|..... +| -+.-++ +.|-.| .+-+..+||..-.+--.-...+|+- |.-=. ..+-|+. T Consensus 1 m~~~~~~~~~dp----~LT~~A--~gy~n~--------~~ia~~l~P~vpv~~~~~k~~~f~~-eaF~~~~t~r~~~~~~ 65 (307) T protein:vir:10 1 MGRLSKLRIVDP----VLTNLA--IGYTNA--------EFIGQSLMPVVEVEKEGGKIPKFGK-ESFRLYKTERALRARS 65 (307) T ss_pred CCCCCCCcccCh----hHHHHH--Hhhcch--------hhhhhhcCCcccccccccceeeECc-ccccchhhhcccCCCc Confidence 566664332 11 111000 222223 2455566676554443333333321 11000 0000111 Q ss_pred cCCceeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHH----HHHHHHhhceEEEEeecCccccceEEE Q lcl|Aclame:pro 139 TNIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGA----AVQLEIMRNAIGFYGWEGKNGNRTFGF 214 (388) Q Consensus 139 ~diP~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aA----r~a~~~~~n~i~~~G~a~~~~~g~~Gl 214 (388) +-+-..... ..+...-+-+..+-+. .+.++....++.+++...+ ++..|...-++++-. ..|+ T Consensus 66 ~~v~~~~~~----~~~~~~~~~~L~~~id-~r~~~~~~~~~~~~av~~l~d~I~l~~E~~~A~l~~~~-------~~y~- 132 (307) T protein:vir:10 66 NRMNPEDLG----SIDIVLDEHDLEYPID-YREDQESAFPLEQAAVQTATEAIQLRREKMVADLAQNP-------NSYA- 132 (307) T ss_pred ceeeccccc----ccccccccccccccCC-hhhcCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHhcCc-------cccC- Confidence 111000000 0000001111111111 1223333445444443333 333344444444421 1121 Q ss_pred eecCCCccccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhcc---------CC Q lcl|Aclame:pro 215 LNDPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSV---------VT 285 (388) Q Consensus 215 lN~P~l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~---------~~ 285 (388) + . ...+.+|...|.+++ -+++.||.++...+...++ ..|++++|....+.+|.+ .+ T Consensus 133 ----~---~-~k~tLsGt~~Wsd~~-sDPi~di~~~~~ai~~~~g------~~Pn~~vlg~~a~~al~~hp~i~e~lk~~ 197 (307) T protein:vir:10 133 ----G---G-NKKQLSATEKFTAAG-SDPVGVIEDGKEAIRTKIG------RRPNTMVIGASAYKTLKAHPQLIEKIKYS 197 (307) T ss_pred ----C---C-ceEEeccccccCCCC-CCcHHHHHHHHHHHHhhhC------CccceEEeCHHHHHHHhcCHHHHHHhCCc Confidence 1 0 011123445798865 5588999999999998886 468999999999998853 12 Q ss_pred CcC-ccHHHHHHHhCCccEEEEcc--ccccccCC----CCccEEEEEEcccccccccccCCCcceEeecchhhhccCcee Q lcl|Aclame:pro 286 DLG-ISVRDWLKQTYPRVRVMSAP--ELQGGNPD----DGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEK 358 (388) Q Consensus 286 ~~~-~Tvl~~lk~n~pnl~i~~~p--el~~a~gt----g~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~ 358 (388) ..| +| .+.|++-+ .++.+.+- -+..+++. -+.+++.+|++........+... .+|- +... . T Consensus 198 ~~g~it-~~~la~ll-~v~~i~vg~a~~~~~~~~~~~iw~~~~vl~yv~~~~~~~~~~~~e-psfG--yT~~-------~ 265 (307) T protein:vir:10 198 MKGIVT-VDLLKEIF-EVENIAVGEAIYADDKDRFTDIWGANIVLAYVPLQRGGQQRTPYE-PSYG--YTLR-------K 265 (307) T ss_pred cccccC-HHHHHHHh-CceeEEEeeeeeeccCCccceeCCCceEEEecccccCCCCCcccc-cccc--eeEE-------E Confidence 223 44 34555433 44433332 12222110 14467777765421100000000 0111 1111 2 Q ss_pred ccCceEEecccceeeee------eeccccceeeccC Q lcl|Aclame:pro 359 RVKNYVEAYSNATAGVM------LKRPWAVVRLIGL 388 (388) Q Consensus 359 ~~~~~~~~~~~~t~G~i------i~rP~ai~~~~GI 388 (388) +...++.++.+. +|+- +.+|.-++...|. T Consensus 266 ~g~~~~d~~~~~-~~~~~~r~~~~~~~~i~~~~~G~ 300 (307) T protein:vir:10 266 KGNPVVDTRIED-GKLELVRSTDIFRPYLLGADAGY 300 (307) T ss_pred cCCeEeeceecC-CceeEEeccccccceeecccccc Confidence 223333333332 2322 3456666666665 No 89 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=89.68 E-value=0.025 Score=29.37 Aligned_cols=270 Identities=11% Similarity=-0.065 Sum_probs=127.4 Q ss_pred hhhccCcccccccccccch--HHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeec-cccceEeccccc Q lcl|Aclame:pro 63 ATQAFDSAYVAPTTQASIP--TPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVE-PAGTAMEYGDLT 139 (388) Q Consensus 63 ~~~amDaa~~~~~t~~~~g--~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e-~~G~a~~ygd~~ 139 (388) =..+|-. .+..+.| +|..+.+ +|++.+......+++..+....... ....+...+ ..+.+...+... T Consensus 1 ~l~~~~~-----~t~~~gg~liP~~~~~----~Ii~~~~~~~~l~~~~~~~~~~~~~-g~~~~~~~~~~~~~a~~v~Eg~ 70 (293) T protein:vir:48 1 MLDSKTD-----HSGSDAGLTIPQDIRT----AINTLVRQYDSLQEYVNVENVTTLT-GSRVYEKWTDITGLANIDDEAG 70 (293) T ss_pred Cceeecc-----cccCcCceEechhHHH----HHHHHHHhhhhhhhhceeeeccCCc-ceEEEEeecCCCcceeeecCCc Confidence 1123332 1222333 4555544 5566655544455553332211111 122333333 345677888888 Q ss_pred CCce-eeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecC Q lcl|Aclame:pro 140 NIPL-SSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDP 218 (388) Q Consensus 140 diP~-~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P 218 (388) .+|- .+....+..-..+.++..+.++.+=++- ...+|.+.-....++++...+|+-.+.|.... T Consensus 71 ~~~~~~~~~~~~i~l~~~k~~~~~~iS~ell~d---s~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~------------ 135 (293) T protein:vir:48 71 KIADIDDPKLSLIKYTIKRYAGISTVTNSLLAD---SAENILAWLSGWIAKKVVVTRNKAILGVVDKL------------ 135 (293) T ss_pred ccccccccceeEEEEeeeEEEEeehhhHHHHhh---hhHHHHHHHHHHHHHHHHHHHHhHHhhccccc------------ Confidence 8885 4567888888999999988888854433 34678888888888888888887666553210 Q ss_pred CCccccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhcc-CCCcCccHHHHHHH Q lcl|Aclame:pro 219 SLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSV-VTDLGISVRDWLKQ 297 (388) Q Consensus 219 ~l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~lk~ 297 (388) ++.. ...+ ++||.+++..+...-. ....++|.++.+..|.+ .+..|.-+++=--. T Consensus 136 --------~~~~-----~~~~----~d~i~~~~~~l~~~~~-------~~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~ 191 (293) T protein:vir:48 136 --------PTKP-----TLTK----WDDIIDLEAKVDPAIK-------QTSFFLTNTSGFTALKKVKNALGDYLMERDVK 191 (293) T ss_pred --------cccc-----cccC----HHHHHHHHHhhhhhhc-------CCCEEEEcHHHHHHHHHhhccCCceEeecCcC Confidence 0000 1122 5677777777754321 12368999999998854 23334322210001 Q ss_pred hCCccEEEEcccc--cc--ccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEecccceee Q lcl|Aclame:pro 298 TYPRVRVMSAPEL--QG--GNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSNATAG 373 (388) Q Consensus 298 n~pnl~i~~~pel--~~--a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~~~t~G 373 (388) +...-+|-..|=. .. ....+.++..++|.+ .......... ..+...+... .......-...+-+..|-+| T Consensus 192 ~~~~~~l~G~Pv~~~~~~~~~~~~~~~~~~~~gd-~~~~~~~~~~--~~~~i~~~~~---~~~~~~~~~~~~r~~~r~d~ 265 (293) T protein:vir:48 192 SPTGYSIAGFAVKEISDRWLPNASSGVMPLYFGD-LKQAVTLFDR--QQMSLLSTNI---GGGAFETDTTKVRVIDRFDV 265 (293) T ss_pred CCCCceecceeeEEecccccCCccCCceEEEEEe-ccceEEEEEe--cceEEEEecc---cchhhhcCeEEEEEEEeeCc Confidence 1111123322211 10 001122333333322 1110000000 0011111000 00000011122233344444 Q ss_pred eeeeccccceeeccC Q lcl|Aclame:pro 374 VMLKRPWAVVRLIGL 388 (388) Q Consensus 374 ~ii~rP~ai~~~~GI 388 (388) .+++|-||+.+..= T Consensus 266 -~~~~~~a~~~l~~~ 279 (293) T protein:vir:48 266 -VATDTEAFVPASFK 279 (293) T ss_pred -EEecccceEEEEee Confidence 56779999977643 No 90 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=88.40 E-value=0.02 Score=29.87 Aligned_cols=331 Identities=10% Similarity=-0.015 Sum_probs=135.3 Q ss_pred CCCcceeeeecCccccch--------hhhhhcc--cccccccC--CHHHHhhcceecccchhhcchhhhhhhhhhhhccC Q lcl|Aclame:pro 1 MKQLSKVHQSLAGRSVRA--------FDMANGK--ADYRLTDM--AVRELKKFGLVFDHATVKRQIELLHEGGVATQAFD 68 (388) Q Consensus 1 ~~~~~~~~~~~~~~~~~~--------~~~~~~~--~~~~~~~~--~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~amD 68 (388) +..+..+-.+|. |.... ....+.. ........ ......+.|.. ...+....+....+ T Consensus 44 ~~e~~~l~~~i~-~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~g~~----------~~~~~~~~~~~~~~ 112 (392) T protein:vir:13 44 LTAVADFDGRIK-RGIDAIKATDAVTSLLSGLQGSGSGAQRSADHDDDAVLRAGNL----------GEARSFEFAPEKRD 112 (392) T ss_pred HHHHHHHHHHHH-HHHHHHHHHHHHHHHhcccCCcccchhhhhhHHHHHHHhccch----------hhhHHHHhhhhhhc Confidence 000000000000 00000 0000000 00000000 00000011100 00111111111111 Q ss_pred cccccccccccchHHH-HHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccceEecccccCCceeeee Q lcl|Aclame:pro 69 SAYVAPTTQASIPTPI-QFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPLSSWN 147 (388) Q Consensus 69 aa~~~~~t~~~~g~l~-~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~diP~~~~n 147 (388) . .+.++.+++. .+.. +.|.+.+-...-.+.+..+.... ....+.+++....+.+...+....+|..+.. T Consensus 113 ~-----t~~~~g~~~~~~~~~---~~i~~~~~~~~~l~~~~~~~~~~--~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~ 182 (392) T protein:vir:13 113 G-----TKAGNPNVLSRTLYG---QLIAQAVERSAIMRGGASTFTTS--DANPMDFTVITGRATAGIVGETAEIPESYPA 182 (392) T ss_pred c-----cccCCCccccccchH---HHHHHHHhhhhhhhhcceeeecC--CCceeEEEEEcCCcceeeecccccccccccc Confidence 1 1222222211 1111 11112111111122222211111 1133456677777777788999999999999 Q ss_pred eeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccccccc Q lcl|Aclame:pro 148 VNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIAST 227 (388) Q Consensus 148 ~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~ 227 (388) .+...-..+.+...+.++.+=|+. ...++.+--....+.++.+.+|.-.+.|+.- ..-.|+|+++...... T Consensus 183 f~~v~~~~~k~~~~~~iS~ell~d---s~~~l~~~i~~~l~~~i~~~~d~~~l~G~Gt---~~p~Gil~~~~~~~~~--- 253 (392) T protein:vir:13 183 TTQRSMGGFKYGFASVVSYEFATD---QVLDLVGFLVSDAGPAIGDAMGRHFLTGTGT---GQPRGILTDATGANAA--- 253 (392) T ss_pred eeeEEeeeeeEEeeehhHHHHHhc---chHHHHHHHHHHHHHHHHHHHHHHHhcccCC---cccccccccccccccc--- Confidence 999999999999888888764442 3557888888888888899999999999742 3467999886542111 Q ss_pred cCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhcc-CCCcCccHHHH-HHHhCCccEEE Q lcl|Aclame:pro 228 TPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSV-VTDLGISVRDW-LKQTYPRVRVM 305 (388) Q Consensus 228 ~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~-lk~n~pnl~i~ 305 (388) ..|++.+ .-.++||.+++..|...-. ....++|.+..+..|.. .+..|.-++.= +...-| -+|- T Consensus 254 -----~~~~~~~-~~~~d~l~~~~~~l~~~~~-------~~a~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~g~~-~~l~ 319 (392) T protein:vir:13 254 -----FGEADAD-SKVSDALIDLFHEVPSAYR-------KNAKFVVNDLRAAQMRKLKDANGQYLWQSALTVGAP-DTFN 319 (392) T ss_pred -----ccccccc-cccHHHHHHHHHhhhhhhh-------cCCEEEEcHHHHHHHHHhhccCCceeecCCcCCCCC-ceec Confidence 1111111 1125566667666643321 12358899998888854 33334322210 000001 1222 Q ss_pred EccccccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEecccceeeeeeeccccceee Q lcl|Aclame:pro 306 SAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSNATAGVMLKRPWAVVRL 385 (388) Q Consensus 306 ~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~~~t~G~ii~rP~ai~~~ 385 (388) ..|=....+ -. .+. ++|. +......... +.-++... .+.+ ...-...+-+..|.+|. +.+|-||+.+ T Consensus 320 G~Pv~~~~~-~~-~~~-i~~G-df~~~~i~~~-~~~~i~~~-~~~~------~~~~~~~~r~~~r~d~~-~~~~~A~~~~ 386 (392) T protein:vir:13 320 GKVVETDDG-MP-ADK-VLFA-DLSKYRVRFA-GSLRVDRS-VDAK------FSTDQIVYRFLQRADGL-LVDARGAKVL 386 (392) T ss_pred ceeeEEcCC-CC-CCc-EEEe-eccceeEEee-cceEEEee-cccc------ccCCcEEEEEEEEeccE-EecccceEEE Confidence 222221110 01 111 1221 1111001000 01111110 0100 11112233444555544 6669998866 Q ss_pred ccC Q lcl|Aclame:pro 386 IGL 388 (388) Q Consensus 386 ~GI 388 (388) ..- T Consensus 387 ~~~ 389 (392) T protein:vir:13 387 TVT 389 (392) T ss_pred Eee Confidence 655 No 91 >protein:vir:102655 Length: 322 # NCBI annotation: Hypothetical protein # Family: family:all:6384 # MgeID: mge:1624 # MgeName: VP2 # Cross-refs: genbank:acc:YP_052979;genbank:gi:50282923;genbank:GeneID:2948122 Probab=84.58 E-value=0.059 Score=27.31 Aligned_cols=282 Identities=11% Similarity=0.027 Sum_probs=109.4 Q ss_pred hccCccccc-ccccccchHHHHHHHhhcceeeeecccchhhhhhccccc------CCC-Cceee-EEEeeeccccceE-- Q lcl|Aclame:pro 65 QAFDSAYVA-PTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKT------VGS-WEDQE-IVQGIVEPAGTAM-- 133 (388) Q Consensus 65 ~amDaa~~~-~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t------~g~-w~~~t-~~~~v~e~~G~a~-- 133 (388) |++.+-..+ +.+.. .+...|.+.+..++ +.+++... ..|-+..+ +++ |.... ..++++ |+.. T Consensus 1 ~~~~~~~~~~~~Ms~--~i~~~fv~qy~~~v-~~~~qq~~-s~L~~tV~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~ 73 (322) T protein:vir:10 1 MKLNAIMSMLPLIAG--DIDQAFVQTYETTL-RILSQQKS-AKLKQYCQHKNESSESHNWETLASMDPDAV---KRKRSR 73 (322) T ss_pred Ccccceeeeeeeeec--hhhhHHHHHHHHHH-HHHHHHhh-hhhhcccccccccccccceeeccccccccc---cccccc Confidence 455544433 12111 13333333332222 22222222 22222211 111 11111 122222 2222 Q ss_pred -eccccc-CCceeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEE---eecCccc Q lcl|Aclame:pro 134 -EYGDLT-NIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFY---GWEGKNG 208 (388) Q Consensus 134 -~ygd~~-diP~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~---G~a~~~~ 208 (388) ..+|.. |+|..........-...-+..+..+..+++. ++..+..+.-.+++..|++++.|++.+- |.+. T Consensus 74 ~~~~d~~~dtp~~~~~~~~r~~~~~d~~~~~~VDd~D~~---k~~~D~~~~~~~~~a~AL~R~~D~~I~~a~~g~a~--- 147 (322) T protein:vir:10 74 QQSADGTYPTPVNNKPFAKRRTNVDTYDTGHVVEQEDIS---QMLLDPNSALITSQAYAMARKTDDLIIAGAWKPAS--- 147 (322) T ss_pred ccccCcccCCCccccccceEEEeecccccceecchHHHH---HhhcCchHHHHHHHHHHhhhHHHHHHHhhhhcccc--- Confidence 223422 6776665554444555555555555555543 3455777777778888888888875543 3321 Q ss_pred cceEEEeecCCCcc--ccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccCC- Q lcl|Aclame:pro 209 NRTFGFLNDPSLLP--AIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVVT- 285 (388) Q Consensus 209 ~g~~GllN~P~l~a--~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~~- 285 (388) .| .++.+. +.+.....+ +..--++.|..+...+.... + |++.+-.++++|+++..|-.-+ T Consensus 148 ---~~---~~gt~v~~~ss~~i~~g-------~~g~t~~kl~~a~~~l~~~d---v-p~d~~R~~vv~p~~~~~LL~d~~ 210 (322) T protein:vir:10 148 ---IK---GTGQPVEFLATQEIGDG-------TKPISFDYVTEITERFLENE---I-EPEVSKVIVIGPTQARKLLQITE 210 (322) T ss_pred ---cc---ccccccccCCCcccccC-------ccchhHHHHHHHHHHHHhcC---C-CCCCCeEEEeCHHHHHHHhcchh Confidence 11 111111 011101111 11112334555555555443 2 3333446899999988874321 Q ss_pred --CcCccHHHHHHHh--------CCccEEEEccccc------cccCCCC--ccEEEEEEcccccccccccCCCcceEeec Q lcl|Aclame:pro 286 --DLGISVRDWLKQT--------YPRVRVMSAPELQ------GGNPDDG--KDIAYMFLDSVDTAVDGSTDGGDTWAQLV 347 (388) Q Consensus 286 --~~~~Tvl~~lk~n--------~pnl~i~~~pel~------~a~gtg~--~~~~~~~~~~~d~~~~~~~~~~~t~~~~~ 347 (388) .....--+.|..+ |..+.-..+|.-. +..+..+ ....++|.++= ..-....+-.+-..-. T Consensus 211 ~ts~D~~~~~~l~~~G~ig~~lGf~~i~s~~lp~~~~t~~~~~~~~~~~~~~~~~~a~~k~A--v~~a~~~dv~~~i~~~ 288 (322) T protein:vir:10 211 ATSADYTSAMDLQSKGIITNWMGYTWIVSTRLDKFDPTQWGMAAEDGPQGDEIWCIAMTDMA--LGYHSCKDIWTKVAED 288 (322) T ss_pred hhhhhcccchhhhhcCeeeeeeeEEEEEeccCCccccccccccccCCCCccceeEEEEecCc--eeEEEeeeeeEEeecc Confidence 1111112333222 1112222333111 1111122 23334554431 1011000011111112 Q ss_pred chhhhccCceeccCceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 348 QSKFVTLGVEKRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 348 p~~~r~~~v~~~~~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) |.+.+.+.+ ... ...|+.+-+|-.|+.+.=- T Consensus 289 ~~~~~a~~I---------~~~-~~~Ga~ri~~~gVv~i~~~ 319 (322) T protein:vir:10 289 PSASFAWRI---------YSA-FTADCVRVEDEHIFKLRLK 319 (322) T ss_pred CCcchhhhh---------hhh-hhhCceEeccCcEEEEEEe Confidence 333332222 111 2345555567776665544 No 92 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=83.08 E-value=0.071 Score=26.87 Aligned_cols=334 Identities=11% Similarity=0.013 Sum_probs=133.3 Q ss_pred CCCcceeeeecCccccchhhh---h----hcccccccccCCHHHHhhcceecccchhhcchhhhhhhhhhhhccCccccc Q lcl|Aclame:pro 1 MKQLSKVHQSLAGRSVRAFDM---A----NGKADYRLTDMAVRELKKFGLVFDHATVKRQIELLHEGGVATQAFDSAYVA 73 (388) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~---~----~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~amDaa~~~ 73 (388) +.++.+.--.|. |.....+. . +........+..... ...............+....+....+ T Consensus 44 ~~e~~~l~~~i~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~r~~~~~~~r~~~~~~~~~~----- 112 (390) T protein:vir:62 44 ITAVSDYDARIK-RGIEAIKAIDPVTSLLSGLQGSGSGAQRSAD-----VDDDATLRAGNLGEARSFEFAPEKRD----- 112 (390) T ss_pred HHHHHHHHHHHH-HHHHHHHHHHHHHHHHhhcccccccchhhcc-----hHHHHHHhhhhhhhhHHHHhhhhhhc----- Confidence 000000000000 00000000 0 000000000000000 00000000000000111111111111 Q ss_pred ccccccchH-HHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccceEecccccCCceeeeeeeeee Q lcl|Aclame:pro 74 PTTQASIPT-PIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPLSSWNVNFER 152 (388) Q Consensus 74 ~~t~~~~g~-l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~diP~~~~n~~~~~ 152 (388) ..+.++.++ +-.+.+ ..|.+.+-...-.+.+..+.+... ...+.+++....+.+...+....+|-.+....... T Consensus 113 ~t~~~~g~~~~~~~~~---~~i~~~~~~~~~l~~~~~~~~~~~--~~~~~~p~~~~~~~a~wv~E~~~~~~~~~~f~~i~ 187 (390) T protein:vir:62 113 GTKAGNPNVLSRTLYG---QLIAQAVERSAIMRGGATTFTTSD--ANPLDFTVITGRSSASIVGETAEIPESYPATAQRS 187 (390) T ss_pred ccccCCCccccccchH---HHHHHHHhhhhhhhhcceeeecCC--CceeEEEEEcCCcceeeecccccccccccceeeeE Confidence 122223332 211222 112222211111222222211111 12345777777777778888899999999999999 Q ss_pred eeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccccccccCCcc Q lcl|Aclame:pro 153 RTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGW 232 (388) Q Consensus 153 ~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~ 232 (388) -..+.+...+.++.+=|+. ..+++.+.-....+.++...+|+-.+.|+.. -.|++|+++........+. T Consensus 188 ~~~~k~~~~~~iS~ell~d---s~~~l~~~i~~~l~~~i~~~~d~~~l~G~G~-----p~Gi~~~~~~~~~~~~~~~--- 256 (390) T protein:vir:62 188 MGGFKYGFASVVSYEFATD---QVLDLVGFLVSDAGPAIGDAMGRHFITGTGQ-----PRGILTDASPATATFLATD--- 256 (390) T ss_pred eeeeeEEeehHHHHHHHhh---hhHHHHHHHHHHHHHHHHHHHHhhhhccCCc-----cccccccccccccceeccc--- Confidence 9999999988888765443 4557888888888999999999999999742 2489998765322111110 Q ss_pred cccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC-CCcCccHHH-HHHHhCCccEEEEcccc Q lcl|Aclame:pro 233 VSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRD-WLKQTYPRVRVMSAPEL 310 (388) Q Consensus 233 t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~-~lk~n~pnl~i~~~pel 310 (388) -...+ ++||.+++..|...-. . --.++|.++.+..|.+- +..|.=++. -+...-| -+|-..|=. T Consensus 257 --~~~~~----~~~l~~~~~~l~~~~~----~---~a~~vmn~~~~~~L~~lkd~~g~~l~~~~~~~g~~-~~l~G~Pv~ 322 (390) T protein:vir:62 257 --TDSKV----SDALIDLFHEVPSAYR----A---NAKYVVNDLRAAQMRKLKDANGQYLWQSGLTVGAP-SLFNGKVVE 322 (390) T ss_pred --ccccc----hHHHHHHHHhhhhhhh----c---CCEEEEchHHHHHHHHhhccCCCeeecCCcCCCcc-ceecccceE Confidence 01233 4555666665543211 1 11578999988888542 222321210 0110011 012111111 Q ss_pred ccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 311 QGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 311 ~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) ...+ ..... ++|. +........ ...-.+....-..|+ .-...+....|.+| .+.+|-||+.+..= T Consensus 323 ~~~~-~p~~~--i~~g-d~s~~~i~~-~~~~~v~~~~~~~~~-------~~~~~~~~~~r~d~-~~~~~~A~~~l~~~ 387 (390) T protein:vir:62 323 TDDG-MPADK--ILFA-DLSKYRVRF-AGSLRVDRSVDAKFS-------TDQIVYRFLQRADG-LLVDARGAKVLTVT 387 (390) T ss_pred EecC-CCCcc--EEEe-eccceeEEe-ecceEEEeecccccc-------CCcEEEEEEEEeCc-EeechhheEEEEee Confidence 1000 01111 1121 110000000 000000000000011 11122334445554 56669998777744 No 93 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=82.99 E-value=0.072 Score=26.84 Aligned_cols=329 Identities=8% Similarity=-0.034 Sum_probs=136.1 Q ss_pred CCCcceee---------------eecCccccchhhhhhcccccccccCCHHHHhhcceecccchhhcchhhhhhhhhhhh Q lcl|Aclame:pro 1 MKQLSKVH---------------QSLAGRSVRAFDMANGKADYRLTDMAVRELKKFGLVFDHATVKRQIELLHEGGVATQ 65 (388) Q Consensus 1 ~~~~~~~~---------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~ 65 (388) +.|+...+ ....++.......... .++.++-.+ .............. T Consensus 44 ~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------~~~~~~~~~------~~~~~~~~~~~~~~ 105 (404) T protein:vir:10 44 QAKIEAQKRKENIENNFNEDNVKSLNTGKEENVIYNGAL------------FVRAIADNL------LKQKNQRGLNLSEK 105 (404) T ss_pred HHHHHHHHHHHHHHHHHhhhhccccccccchhhHHHHHH------------HHHHHHHHH------HHHHHhhhhcchhh Confidence 11111000 0001111100000000 000000000 00000000000000 Q ss_pred ccCcccccccccccch--HHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccceEecccccCCce Q lcl|Aclame:pro 66 AFDSAYVAPTTQASIP--TPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPL 143 (388) Q Consensus 66 amDaa~~~~~t~~~~g--~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~diP~ 143 (388) ...+. ...+.++.| +|..+ .++|++.+.......+++++.....-. ..+.+........+...+....+|. T Consensus 106 e~~a~--~~~~~~~gg~~vP~~~----~~~ii~~~~~~~~l~~l~~~~~~~~~~-g~~~~~~~~~~~~~~~v~e~~~~~~ 178 (404) T protein:vir:10 106 EINAI--SENIDEDGGYAVPEDI----QTKINTRLKDTTDLYNMVDYEPVFTRS-GSRTYEKRSKQKPMKPLSENQQIPT 178 (404) T ss_pred HHhhh--ccccCCCCceeechhH----HHHHHHHHhhhhhHhhhhceeeccCCc-cceEEEEecCCcceeeccccccccc Confidence 11111 011223333 34333 346666666666666666554332111 1233444444445556666667776 Q ss_pred e--eeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCc Q lcl|Aclame:pro 144 S--SWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLL 221 (388) Q Consensus 144 ~--~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~ 221 (388) . +..........+.++..+.++.+=+. ....+|.+.-....++++...+|+-.++|+... ....|+++.+.+. T Consensus 179 ~~~~~~f~~i~~~~~k~~~~~~iS~ell~---ds~~~l~~~i~~~la~~~~~~~~~~il~G~g~~--~~~~gi~~~~~~~ 253 (404) T protein:vir:10 179 NGDNGKLERFNFKLKDLADFMSIPNDLLK---FADKSLEDWIINWFVDKVRITRNAEILYGAGGD--EHATGIMTANKFK 253 (404) T ss_pred cccccceeeeEeeheeeEeeehhhHHHHh---hcHHHHHHHHHHHHHHHHHHHHHHHHhhcCCCC--Ccccceeeccccc Confidence 4 34556667778888888888884332 233578888888888888899999999997532 3577888877653 Q ss_pred cccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC-CCcCccHHH-HHHHhC Q lcl|Aclame:pro 222 PAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRD-WLKQTY 299 (388) Q Consensus 222 a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~-~lk~n~ 299 (388) +.... ...+ ++|+..+++.... .+ +. ....++|.+..+..|.+. +..|.-++. -+.... T Consensus 254 ~~~~~---------~~~~----~~~~~~~~~~~l~-~~--~~---~~~~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~ 314 (404) T protein:vir:10 254 KITLP---------KSPA----LKDFKKCKNVELL-NV--FK---ATSSWIVNQDGFNYLDSLEDKTGRPYLQPDPKDPT 314 (404) T ss_pred eeecc---------cccc----HHHHHHHHHhhhh-cc--cc---CCCEEEEcHHHHHHHHHhhccCCceeeccCcCCCC Confidence 22111 1112 4455555543222 11 11 123688999998888543 333432321 011111 Q ss_pred C----ccEEEEccccccccCCCCccEEEEEEcccccccccccCCCcceEeec-chhhhccCceeccCceEEecccceeee Q lcl|Aclame:pro 300 P----RVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLV-QSKFVTLGVEKRVKNYVEAYSNATAGV 374 (388) Q Consensus 300 p----nl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~-p~~~r~~~v~~~~~~~~~~~~~~t~G~ 374 (388) + +.-++.++..... + ++++..++|.+-.+........ .++..+ +..+. ....-....-+..|. |+ T Consensus 315 ~~~l~G~PV~~~~~~~~~-~-~~~~~~~~~gd~s~~~~~~~~~---~~~i~~~~~~~~----~~~~~~~~~~~~~r~-d~ 384 (404) T protein:vir:10 315 QYRFLGLPVIELPNDLLL-S-TESAIPVLLGDTKEAYKYVSDG---AYELATTNIGAG----AFETNTTKARIIMRI-DG 384 (404) T ss_pred CccccceeeEEecccccC-C-CCCccEEEEEeccccEEEEEec---ceEEEEeccccc----hhhcCceEEEEEEee-cc Confidence 1 1223333322111 1 2233333443211100010000 011110 00000 000001112233333 55 Q ss_pred eeeccccceeeccC Q lcl|Aclame:pro 375 MLKRPWAVVRLIGL 388 (388) Q Consensus 375 ii~rP~ai~~~~GI 388 (388) .+.+|-||+.+.=- T Consensus 385 ~v~~~~a~~~~~~~ 398 (404) T protein:vir:10 385 NVKDSEALLIAEIP 398 (404) T ss_pred EEecccceEEEEee Confidence 67778888866644 No 94 >protein:vir:6324 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:132 # MgeName: phiKMV # Cross-refs: genbank:acc:NP_877471;genbank:gi:33300843;uniprot:Q7Y2D3;genbank:GeneID:1482613 Probab=81.28 E-value=0.073 Score=26.80 Aligned_cols=286 Identities=13% Similarity=0.078 Sum_probs=117.9 Q ss_pred cccCCHHHHhhcceecccchhhcchhhhhhhhhhhhccCcccccccccccchHHHHHHHhhcceeeeecccchhhhhhcc Q lcl|Aclame:pro 30 LTDMAVRELKKFGLVFDHATVKRQIELLHEGGVATQAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILG 109 (388) Q Consensus 30 ~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~amDaa~~~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~ 109 (388) |+++ +.|-| .+.+..+= -..-||+-+.-+|.+....-.+.+++.. T Consensus 1 ms~~--~~~tr-------------------~~~~~s~~--------------d~al~le~f~geV~~af~~~s~~~~~~~ 45 (335) T protein:vir:63 1 MSFL--NDLTR-------------------PNYAGKNA--------------DVDIHLEEHLGIVDKHFAYTSKFAPLMN 45 (335) T ss_pred CCCc--ccchh-------------------hhcccccc--------------hhheehhhhhhhHHHHHHhhhhhccccc Confidence 1111 11111 11111111 2223455555555444444445556666 Q ss_pred cccCCCCceeeEEEeeeccccceEec----ccc-cCCceeeeeeeeeeeeE--EEEEEEEeecHHHHHHHHHhCCChHHH Q lcl|Aclame:pro 110 VKTVGSWEDQEIVQGIVEPAGTAMEY----GDL-TNIPLSSWNVNFERRTI--VRGEMGIQVGLLEEGRASAMRINSAEV 182 (388) Q Consensus 110 v~t~g~w~~~t~~~~v~e~~G~a~~y----gd~-~diP~~~~n~~~~~~~v--~~~~~~~~y~~~El~~A~~~g~~l~~~ 182 (388) +.+.- ...++.|+.+ |+.... |.. +..|... ++....| ..+.-.+=|.+.|. ++..++-++ T Consensus 46 ~rti~--~g~s~~~~~i---G~~~~~~~~pG~~l~~~~~~~---~k~~itVD~ll~a~~~I~dlDe~----~~~yDvRse 113 (335) T protein:vir:63 46 IRDLR--GSNVVRLDRL---GNVEAKGRRAGEELERSRVVN---DKWNLTVDTLLYLRHQFDHQDEW----TQSFDMRKE 113 (335) T ss_pred eeeec--cceeEEEeee---eeeeeecccCCcCcCCCCccc---cceEEEecceeechhhhhhHHHH----hcCchhHHH Confidence 55421 1255556554 666655 221 2223211 2211111 11222222333333 344455555 Q ss_pred HHHHHHHHHHHhhceEEE------EeecCccccceEEEeecCCCccccccccCCcccccccCCHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 183 KRQGAAVQLEIMRNAIGF------YGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRV 256 (388) Q Consensus 183 K~~aAr~a~~~~~n~i~~------~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~ 256 (388) -....-.++.++.|+..+ .+... .....|.++ |++...+..++. -+...++.+.+=+..+..++.. T Consensus 114 ~s~e~G~aLA~~~D~~~~~~i~~aa~~~a--~~~~~~~~~-~G~~~~~~~tg~-----~~~~~~~~l~~a~~~a~~~L~e 185 (335) T protein:vir:63 114 VAELDGQELARKFDQACLIQVIKAAAMDA--PVDLEDAFS-PGVLEKLDLTGL-----TAKQAADKIVRMHRRVVETFID 185 (335) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccccC--ccccCCCcC-CCcceeeeeccC-----cccccHHHHHHHHHHHHHHHHh Confidence 555556666666665432 11110 012223333 233222222221 1223578887777777777775 Q ss_pred hcCCeeccc--cccceEEcCHHHHHhhccC-----CCcCcc--HHHHHHHh---CCccEEEEccccccccCCC----C-- Q lcl|Aclame:pro 257 QSEDNIDPE--DVDITLVLPMNKVDMLSVV-----TDLGIS--VRDWLKQT---YPRVRVMSAPELQGGNPDD----G-- 318 (388) Q Consensus 257 ~s~g~v~~~--~~p~tL~Lp~~~~~~Ls~~-----~~~~~T--vl~~lk~n---~pnl~i~~~pel~~a~gtg----~-- 318 (388) +- + |+ ..+...+++|.+|..|-.- ++|+.+ .-.+.+.. --+++|...+.|-..++++ . T Consensus 186 ~d---V-P~~~~~dr~~vv~P~~y~~Ll~~~~l~n~~~~~s~~~~~~~~g~v~~v~Gv~V~~sn~lP~~~~t~~~lg~a~ 261 (335) T protein:vir:63 186 RD---L-GDAVYSEGLTPMSPRVFSLLLEHDKLMNVEYQATGATNDYVKSRVAILNGVKVLETPRFATKAIAAHPLGRHF 261 (335) T ss_pred cc---C-CCcccCceEEEeChHHHHHHhccccccccccccccccccccCceeEEeeceEEEeeccCCCCCcccccccccC Confidence 53 1 21 1235789999999988532 222211 11122111 1135666666663222221 0 Q ss_pred --------ccEEEEEEcccccccccccCCCcceEe-ecchhhhccCceeccCceEEecccceeeeeeecc--ccceeecc Q lcl|Aclame:pro 319 --------KDIAYMFLDSVDTAVDGSTDGGDTWAQ-LVQSKFVTLGVEKRVKNYVEAYSNATAGVMLKRP--WAVVRLIG 387 (388) Q Consensus 319 --------~~~~~~~~~~~d~~~~~~~~~~~t~~~-~~p~~~r~~~v~~~~~~~~~~~~~~t~G~ii~rP--~ai~~~~G 387 (388) ..+.++|-++- . -+.++ .+..++.. +.+...|.+++-.. .|+-++|| .++...+| T Consensus 262 n~~~~d~~~~~~~~~~~~A--l--------~t~~~~~vt~e~~~---~~~~~~~~i~~~~a-~G~g~lRPe~a~~i~~tg 327 (335) T protein:vir:63 262 NVSAEESERQIALFLPSKT--L--------ITAQVAPVQAKLWE---DNEKFSWVLDTFQM-YNIGARRPDTAGAIELKG 327 (335) T ss_pred CccccccceeEEEEEecce--E--------EEEEEeecccceee---ccchhhHHhHHHHH-cCCcccccceEEEEEEcC Confidence 01222221110 0 00000 00000000 11223455555443 79999999 45566789 Q ss_pred C Q lcl|Aclame:pro 388 L 388 (388) Q Consensus 388 I 388 (388) | T Consensus 328 ~ 328 (335) T protein:vir:63 328 I 328 (335) T ss_pred C Confidence 9 No 95 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=79.05 E-value=0.11 Score=25.88 Aligned_cols=312 Identities=10% Similarity=-0.036 Sum_probs=128.9 Q ss_pred CCCcceeeeecCccccchhhhhhcccccccccCCHHHHhhcceecccchhhcchhhhhhhhhhhhccCcccccccccccc Q lcl|Aclame:pro 1 MKQLSKVHQSLAGRSVRAFDMANGKADYRLTDMAVRELKKFGLVFDHATVKRQIELLHEGGVATQAFDSAYVAPTTQASI 80 (388) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~amDaa~~~~~t~~~~ 80 (388) ....+..+.....+.-..... -...+.++ ....... .. .....++ ...+.++. T Consensus 72 ~~~~~~~~~~~~~~~~~~~~~------------~~~~~~~~---~~~~~~~-----~~--~~~~~a~-----~~~t~~~g 124 (408) T protein:vir:10 72 VNMREEEKGPLNKSENELKDK------------FVKDFVNM---VRNPMAF-----MN--TVSSKTE-----TSGSDSAA 124 (408) T ss_pred hccccccccccccchhhhHHH------------HHHHHHHH---hhcchhh-----hh--hhhhhhh-----hcccccCC Confidence 100000000000000000000 00000000 0000000 00 0011111 11123334 Q ss_pred hHHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEe-eeccccceEecccccCCceee-eeeeeeeeeEEEE Q lcl|Aclame:pro 81 PTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQG-IVEPAGTAMEYGDLTNIPLSS-WNVNFERRTIVRG 158 (388) Q Consensus 81 g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~-v~e~~G~a~~ygd~~diP~~~-~n~~~~~~~v~~~ 158 (388) |+++- +.+.++|++.+......+++..+.....-. ..+.+. ..+..+.+...+....+|-.+ ...+....+.+.+ T Consensus 125 g~~vP--~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~i~~~~~k~ 201 (408) T protein:vir:10 125 GLTIP--QDIRTMINTLVRQYDSLQQYVRVESVSTSN-GSRVYEKWTDVTPLTVMDAEDGKIPDLDNPQLTIIKYLIKRY 201 (408) T ss_pred ceecc--HhHHHHHHHHHHhhchhhhhcceeeccCCc-ceEEEeeccccccceeeecCccccccccCcceeeEEeeeeeE Confidence 43332 123356777776666666664443321111 111121 223445667788888888654 5788888899999 Q ss_pred EEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccccccccCCcccccccC Q lcl|Aclame:pro 159 EMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWVSGGAN 238 (388) Q Consensus 159 ~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~t~Wa~k 238 (388) ...+.++.+=++- ...+|.+--....++++...+|+-.+.|+.... +. . ... T Consensus 202 ~~~~~iS~ell~d---s~~~l~~~i~~~l~~~~~~~~~~~il~g~g~~~------------------~~--~-----~~~ 253 (408) T protein:vir:10 202 AGIITATNTSLKD---TAENILAWLSSWIAKKVVVTRNQAIIEVMKAAP------------------KK--P-----TIA 253 (408) T ss_pred EeeehhHHHHHhh---chHHHHHHHHHHHHHHHHHHHHHHHhhcccccc------------------cc--c-----ccc Confidence 9888888854432 355778888888888888888888887764210 00 0 112 Q ss_pred CHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC-CCcCccHHHH-HHHhCCccEEEEcccc--c--c Q lcl|Aclame:pro 239 AFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRDW-LKQTYPRVRVMSAPEL--Q--G 312 (388) Q Consensus 239 T~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~~-lk~n~pnl~i~~~pel--~--~ 312 (388) +.+ ||..++....... +.. .-.++|.+..+..|.+. +..|.-+++- +....| -+|-..|=. + . T Consensus 254 ~~~----~l~~~~~~~~~~~---~~~---~a~~v~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~-~~l~G~PV~~~~~~~ 322 (408) T protein:vir:10 254 KFD----DVITMINTAVDPA---IIA---TSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNS-YLIKGKQVIVVADRW 322 (408) T ss_pred cHH----HHHHHHHHhhhhh---hcc---CCEEEEcHHHHHHHHHhhccCCceEeccCcCCCCC-ceecceeeEEecccc Confidence 333 4443332211111 111 12688999999988653 3345544321 111111 123222211 1 1 Q ss_pred ccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 313 GNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 313 a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) ....+.++..++|.+=.+........+ -++.. -+... .....-.....+..|.+| .+++|-+|+.+..- T Consensus 323 ~~~~~~~~~~i~~gd~~~~~~~~~~~~-~~v~~-~~~~~----~~f~~~~~~~r~~~r~d~-~v~~~~a~~~~~~~ 391 (408) T protein:vir:10 323 LPNTGSTVYPLYYGDMSQAITLFDREN-MSLLP-TNIGA----GAFETDTTKIRVIDRFDV-KATDSEALVAGSFS 391 (408) T ss_pred cCccCCCceEEEEEehhccEEEEEecc-eEEEE-ccccc----chhhcCceEEEEEEeecc-EEeccccEEEEEee Confidence 111133333334432111000100000 01110 01000 000011123334444544 55669999987755 No 96 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=77.23 E-value=0.13 Score=25.50 Aligned_cols=253 Identities=11% Similarity=0.021 Sum_probs=124.8 Q ss_pred hccCcccccccccccchHHHHHHHhhcceeeeecccchhhhhhcccccCCC-CceeeEEEeeeccccceEecccccCCce Q lcl|Aclame:pro 65 QAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGS-WEDQEIVQGIVEPAGTAMEYGDLTNIPL 143 (388) Q Consensus 65 ~amDaa~~~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~-w~~~t~~~~v~e~~G~a~~ygd~~diP~ 143 (388) ||.- -.+.+++=+|--+..++..++ ..-.....|..+++... -.-.+++++.++..|.+..+++++++|. T Consensus 1 Ma~~-----~T~l~d~i~Pev~~~~v~~~~----~~~~~~~~~~~~~~~l~g~~G~ti~iP~~~~igda~~~~eg~~i~~ 71 (276) T protein:vir:10 1 MAQG-----TTTKSTQIVPEVLAPMMQAEL----DKKLRFAQFADIDSTLVGQPGDTLTFPAFVYSGDATVVPEGQKIPV 71 (276) T ss_pred CCcc-----eeehhhhhchHHHHHHHHHHH----HhhhhhcccceecccccCCCCCEEEeeeecCCCccccccCCCccCc Confidence 2210 112233335555555544433 22333344444443211 1236789999999999999999999999 Q ss_pred eeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccc Q lcl|Aclame:pro 144 SSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPA 223 (388) Q Consensus 144 ~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~ 223 (388) ..+........+..++-++.++.++ .. ..+.+.-..-...+..++...+++..+ . .++.-.. T Consensus 72 ~~lt~~~~~a~i~~~~k~~~~tD~a--~~-~~~~dp~~~~~~~~~~~~a~~~d~~~~-~-----------~l~~~~~--- 133 (276) T protein:vir:10 72 DKIETNRREAKIHKIGKGTDITDEA--LL-SGYGDPQGEAVRQHGLAIANKVDNDVL-E-----------ALRGTKL--- 133 (276) T ss_pred cccccceeeEEeehccccccccHHH--HH-hhccchHHHHHHHHHHHHHHHHHHHHH-H-----------HHhcccc--- Confidence 9999999888888876666655543 33 335566677777777777777775432 1 1111000 Q ss_pred cccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC--------CCcCccHHHHH Q lcl|Aclame:pro 224 IASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV--------TDLGISVRDWL 295 (388) Q Consensus 224 ~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~--------~~~~~Tvl~~l 295 (388) +. .-..-| ++.|..++..+-.+ ...+..|++.|..+..|.+- +++|.. .+ T Consensus 134 ---~~-----~~~~~t----~d~i~~A~~~lgd~-------~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~---~~ 191 (276) T protein:vir:10 134 ---TV-----SADIGT----LAGLEAAIDTFDDE-------DLEPMVLFINPKDAGKLRSSASDNFTRATELGDN---II 191 (276) T ss_pred ---cc-----cccccC----HHHHHHHHHHhccc-------cCcccEEEEcHHHHHHHHHhcccccccccccccc---ce Confidence 00 001112 34444454444322 12456899999999888432 111111 11 Q ss_pred H----HhCCccEEEEccccccccCCCCccEEEEEEcccccccccccCCCc-ceEeecchhhhccCceeccCceEEecc-c Q lcl|Aclame:pro 296 K----QTYPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGD-TWAQLVQSKFVTLGVEKRVKNYVEAYS-N 369 (388) Q Consensus 296 k----~n~pnl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~-t~~~~~p~~~r~~~v~~~~~~~~~~~~-~ 369 (388) . -.|-+++|+..+.+. ....+++.+.- . .....+. +.| ..| ......-..+ - T Consensus 192 ~~G~ig~~~G~~Vi~s~~~p-------~~t~~l~~~gA--i--~~~~~~~~~vE-----~dR------d~~~~~d~i~~~ 249 (276) T protein:vir:10 192 VKGAFGEALGAVIVRSKKLD-------EGEAILAKRGA--V--KLITKRDFFLE-----TDR------DPSTKTTALYSD 249 (276) T ss_pred eccccceecceeEEEcCCCC-------cceEEEEeccc--e--eeeecCCceee-----ccc------chhhcccEEEEe Confidence 0 012245555443331 12233433210 0 0000000 001 011 0001111111 1 Q ss_pred ceeeeeeeccccceeeccC Q lcl|Aclame:pro 370 ATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 370 ~t~G~ii~rP~ai~~~~GI 388 (388) ..+|+-+++|..++.+.=- T Consensus 250 ~~y~~~~~~~~~vv~~t~~ 268 (276) T protein:vir:10 250 KHYVAYLYDESKAVKVTKG 268 (276) T ss_pred eEEEEEEEcCcceEEEecC Confidence 3678888888877776633 No 97 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=77.16 E-value=0.13 Score=25.48 Aligned_cols=256 Identities=11% Similarity=0.022 Sum_probs=123.8 Q ss_pred hccCcccccccccccchHHHHHHHhhcceeeeecccchhhhhhcccccC--CCCceeeEEEeeeccccceEecccccCCc Q lcl|Aclame:pro 65 QAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTV--GSWEDQEIVQGIVEPAGTAMEYGDLTNIP 142 (388) Q Consensus 65 ~amDaa~~~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~--g~w~~~t~~~~v~e~~G~a~~ygd~~diP 142 (388) || .+. ...+++=+|-.+..++.-++ ..-+....|..++.. |.- -.+++++.+...|.+.-|.++++++ T Consensus 1 m~--~~~---T~l~d~i~Pev~~~~v~~~~----~~~l~~~~~~~~~~~l~g~~-G~tv~iP~~~~ig~a~~~~~g~~i~ 70 (274) T protein:vir:96 1 MA--QGM---TKLTNQIVPEVLAPMMQAEL----EKKLRFASFAEIDNTLVGQP-GDTLTFPAFIYSGDAKVVAEGEKIP 70 (274) T ss_pred CC--cce---eehhheechHHHHHHHHHHH----HhhhhccccceecccccCCC-CCEEEeeeecCCCccccccCCCccc Confidence 12 111 12334445655555554333 333333444333322 222 3688999999999999999999999 Q ss_pred eeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCcc Q lcl|Aclame:pro 143 LSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLP 222 (388) Q Consensus 143 ~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a 222 (388) .-..........+...+-++.+. ++.+.+ .+-++..+-...+..++...+++..+ + .++.... T Consensus 71 ~~~lt~~~~~~~i~~~~~a~~i~--D~~~~~-~~~d~~~~~~~~~~~~~a~~vd~~i~---~---------~l~~a~~-- 133 (274) T protein:vir:96 71 TDILETKKREAKIRKIAKGTSIS--DEALLS-GYGDPQGEQVRQHGLAHANKVDDDVL---E---------ALKSAKL-- 133 (274) T ss_pred hhhcccceeEEEeeeeecceeeh--HHHHhh-ccchHHHHHHHHHHHHHHHHHHHHHH---H---------HHhcccc-- Confidence 99999988888887766555555 454443 34466777777777888777776433 1 0110000 Q ss_pred ccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccCC--------CcCccHHH- Q lcl|Aclame:pro 223 AIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVVT--------DLGISVRD- 293 (388) Q Consensus 223 ~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~~--------~~~~Tvl~- 293 (388) +.. -..-+ ++.|..++..+-.. ...+..|+++|..+..|.+-. +.+..++- T Consensus 134 ----~~~-----~~~~~----~d~i~~A~~~lgd~-------~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~ 193 (274) T protein:vir:96 134 ----TVE-----ADITK----LTGLQTAIDKFNDE-------DLEPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVK 193 (274) T ss_pred ----ccc-----ccccC----HHHHHHHHHHhccc-------cccccEEEeCHHHHHHHHhhccccccccccccccceec Confidence 000 01112 44455555444322 234668999999999985421 11211110 Q ss_pred HHHHhCCccEEEEccccccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEec-cccee Q lcl|Aclame:pro 294 WLKQTYPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAY-SNATA 372 (388) Q Consensus 294 ~lk~n~pnl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~-~~~t~ 372 (388) =.--.+-+++|+.-..+. ....+++.+.- . .....+. .. -|..| ..+...-.. .-..+ T Consensus 194 G~ig~~~G~~Vi~s~~~~-------~~t~~l~~~gA--~--~~~~~~~-~~---vE~~R------d~~~~~d~i~~~~~y 252 (274) T protein:vir:96 194 GAFGEALGAVIVRSNKLE-------AGTAILAKKGA--V--KLITKRD-FF---LETDR------DPSTKTTALYSDKHY 252 (274) T ss_pred cccceecCeEEEEeCCCC-------CceEEEEeccc--e--eeeecCC-cc---ccccc------ccccccCEEEEeEEE Confidence 000012244544332221 11223332210 0 0000000 00 01111 001111111 11367 Q ss_pred eeeeeccccceeeccC Q lcl|Aclame:pro 373 GVMLKRPWAVVRLIGL 388 (388) Q Consensus 373 G~ii~rP~ai~~~~GI 388 (388) |+-+.+|-.++.+.-= T Consensus 253 ~~~~~~~~~~v~~tk~ 268 (274) T protein:vir:96 253 VAYLYDESKAVKITKG 268 (274) T ss_pred EEEEEcCCcEEEEEcC Confidence 8888888877776633 No 98 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=77.16 E-value=0.13 Score=25.48 Aligned_cols=256 Identities=11% Similarity=0.022 Sum_probs=123.8 Q ss_pred hccCcccccccccccchHHHHHHHhhcceeeeecccchhhhhhcccccC--CCCceeeEEEeeeccccceEecccccCCc Q lcl|Aclame:pro 65 QAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTV--GSWEDQEIVQGIVEPAGTAMEYGDLTNIP 142 (388) Q Consensus 65 ~amDaa~~~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~--g~w~~~t~~~~v~e~~G~a~~ygd~~diP 142 (388) || .+. ...+++=+|-.+..++.-++ ..-+....|..++.. |.- -.+++++.+...|.+.-|.++++++ T Consensus 1 m~--~~~---T~l~d~i~Pev~~~~v~~~~----~~~l~~~~~~~~~~~l~g~~-G~tv~iP~~~~ig~a~~~~~g~~i~ 70 (274) T protein:vir:95 1 MA--QGM---TKLTNQIVPEVLAPMMQAEL----EKKLRFASFAEIDNTLVGQP-GDTLTFPAFIYSGDAKVVAEGEKIP 70 (274) T ss_pred CC--cce---eehhheechHHHHHHHHHHH----HhhhhccccceecccccCCC-CCEEEeeeecCCCccccccCCCccc Confidence 12 111 12334445655555554333 333333444333322 222 3688999999999999999999999 Q ss_pred eeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCcc Q lcl|Aclame:pro 143 LSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLP 222 (388) Q Consensus 143 ~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a 222 (388) .-..........+...+-++.+. ++.+.+ .+-++..+-...+..++...+++..+ + .++.... T Consensus 71 ~~~lt~~~~~~~i~~~~~a~~i~--D~~~~~-~~~d~~~~~~~~~~~~~a~~vd~~i~---~---------~l~~a~~-- 133 (274) T protein:vir:95 71 TDILETKKREAKIRKIAKGTSIS--DEALLS-GYGDPQGEQVRQHGLAHANKVDDDVL---E---------ALKSAKL-- 133 (274) T ss_pred hhhcccceeEEEeeeeecceeeh--HHHHhh-ccchHHHHHHHHHHHHHHHHHHHHHH---H---------HHhcccc-- Confidence 99999988888887766555555 454443 34466777777777888777776433 1 0110000 Q ss_pred ccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccCC--------CcCccHHH- Q lcl|Aclame:pro 223 AIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVVT--------DLGISVRD- 293 (388) Q Consensus 223 ~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~~--------~~~~Tvl~- 293 (388) +.. -..-+ ++.|..++..+-.. ...+..|+++|..+..|.+-. +.+..++- T Consensus 134 ----~~~-----~~~~~----~d~i~~A~~~lgd~-------~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~g~~~~~~ 193 (274) T protein:vir:95 134 ----TVE-----ADITK----LTGLQTAIDKFNDE-------DLEPMVLFISPLDAGKLRGDATTNFTRATELGDDVIVK 193 (274) T ss_pred ----ccc-----ccccC----HHHHHHHHHHhccc-------cccccEEEeCHHHHHHHHhhccccccccccccccceec Confidence 000 01112 44455555444322 234668999999999985421 11211110 Q ss_pred HHHHhCCccEEEEccccccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEec-cccee Q lcl|Aclame:pro 294 WLKQTYPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAY-SNATA 372 (388) Q Consensus 294 ~lk~n~pnl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~-~~~t~ 372 (388) =.--.+-+++|+.-..+. ....+++.+.- . .....+. .. -|..| ..+...-.. .-..+ T Consensus 194 G~ig~~~G~~Vi~s~~~~-------~~t~~l~~~gA--~--~~~~~~~-~~---vE~~R------d~~~~~d~i~~~~~y 252 (274) T protein:vir:95 194 GAFGEALGAVIVRSNKLE-------AGTAILAKKGA--V--KLITKRD-FF---LETDR------DPSTKTTALYSDKHY 252 (274) T ss_pred cccceecCeEEEEeCCCC-------CceEEEEeccc--e--eeeecCC-cc---ccccc------ccccccCEEEEeEEE Confidence 000012244544332221 11223332210 0 0000000 00 01111 001111111 11367 Q ss_pred eeeeeccccceeeccC Q lcl|Aclame:pro 373 GVMLKRPWAVVRLIGL 388 (388) Q Consensus 373 G~ii~rP~ai~~~~GI 388 (388) |+-+.+|-.++.+.-= T Consensus 253 ~~~~~~~~~~v~~tk~ 268 (274) T protein:vir:95 253 VAYLYDESKAVKITKG 268 (274) T ss_pred EEEEEcCCcEEEEEcC Confidence 8888888877776633 No 99 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=76.10 E-value=0.14 Score=25.28 Aligned_cols=325 Identities=10% Similarity=0.007 Sum_probs=126.9 Q ss_pred CCC----cc-------eeeeecCcccc-chhhhhhccccc-ccccCCHHHHhhcceecccchhhcchhhhhhhhhhhhcc Q lcl|Aclame:pro 1 MKQ----LS-------KVHQSLAGRSV-RAFDMANGKADY-RLTDMAVRELKKFGLVFDHATVKRQIELLHEGGVATQAF 67 (388) Q Consensus 1 ~~~----~~-------~~~~~~~~~~~-~~~~~~~~~~~~-~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~am 67 (388) +.+ +. .+...+.-... +........... ...........+-.+. .... ...... ......++ T Consensus 42 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~-~~~~~~--~~~e~~a~ 116 (404) T protein:vir:39 42 MSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKGPLNKSEYELKDKFVKEFV--NMVR-NPMAFL--NTVSSKTE 116 (404) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccccchhhhHHHHHHHHH--HHHh-cchhhh--hhhhhhhh Confidence 000 00 00000000000 000000000000 0000000000000000 0000 000000 00011111 Q ss_pred Ccccccccccccch--HHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccceEecccccCCce-e Q lcl|Aclame:pro 68 DSAYVAPTTQASIP--TPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPL-S 144 (388) Q Consensus 68 Daa~~~~~t~~~~g--~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~diP~-~ 144 (388) - ..+..+.| +|..+.+ .|++.+......++++.+.....-.-........+..+.+...+....+|- . T Consensus 117 ~-----~~t~~~gg~~iP~~~~~----~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~Eg~~~~~~~ 187 (404) T protein:vir:39 117 T-----SGSDSAAGLTIPQDIRT----MINTLVRQYDSLQQYVRVESVSTSNGSRVYEKWTDVTPLTVMDAEDGKIPDLD 187 (404) T ss_pred h-----cccccCCceeccHHHHH----HHHHHHHhhhhHHhhcceeeccCCcceEEEEeecCCccceeeecCcccccccc Confidence 1 11222333 4554444 555665555556666544332111111112222344466677888888885 5 Q ss_pred eeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCcccc Q lcl|Aclame:pro 145 SWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAI 224 (388) Q Consensus 145 ~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~ 224 (388) +..........+.++..+.++.+=++. ...+|.+.-......++...+|+-.+.|+... .| T Consensus 188 ~~~f~~i~~~~~k~~~~~~iS~ell~d---s~~~l~~~i~~~l~~~~~~~~d~~il~g~g~~----------~~------ 248 (404) T protein:vir:39 188 NPRLTIIKYLIKRYAGIITATNTLLKD---TAENILAWLSSWIAKKVVVTRNQAIIAAMGTV----------PK------ 248 (404) T ss_pred ccceeeEEeeeeeEEeeehhHHHHHhh---chHHHHHHHHHHHHHHHHHHHHHHHHhccccc----------cc------ Confidence 678888899999999888888854432 34677888888888888888888888886310 00 Q ss_pred ccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC-CCcCccHHHHHHHhCCccE Q lcl|Aclame:pro 225 ASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRDWLKQTYPRVR 303 (388) Q Consensus 225 ~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~~lk~n~pnl~ 303 (388) . ++ ..+ ++||..++....... +.. ...++|.+..+..|... +..|.-++.--..+...-+ T Consensus 249 --~--~~-----~~~----~~~i~~~~~~~~~~~---~~~---~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~~~~~~~ 309 (404) T protein:vir:39 249 --K--PT-----IAK----FDDVITMINTSVDPA---IIA---TSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNSYL 309 (404) T ss_pred --c--cc-----ccc----HHHHHHHHHHhhhhh---hcc---CCEEEEcHHHHHHHHHhhccCCceeeccCcCCCCcce Confidence 0 01 122 334444443222111 111 12689999999988643 3334433210000111112 Q ss_pred EEEcccc--c--cccCCCCccEEEEEEcccccccccccCCCcceEeec-ch---hhhccCceeccCceEEecccceeeee Q lcl|Aclame:pro 304 VMSAPEL--Q--GGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLV-QS---KFVTLGVEKRVKNYVEAYSNATAGVM 375 (388) Q Consensus 304 i~~~pel--~--~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~-p~---~~r~~~v~~~~~~~~~~~~~~t~G~i 375 (388) |-..|=. + .....+..+..++|. +.......... ......+ +. .|..+ .....+..|. |+. T Consensus 310 l~G~pV~~~~~~~~~~~~~~~~~~~~g-d~~~~~~~~~~--~~~~i~~~~~~~~~~~~~-------~~~~r~~~r~-d~~ 378 (404) T protein:vir:39 310 IKGKKVIVVADRWLPNSGSTVYPLYYG-DMSQAITLFDR--ENMSLLPTNIGAGAFETD-------TTKIRVIDRF-DVK 378 (404) T ss_pred ecceeEEEecccccCccCCCccEEEEE-eccccEEEEee--cceEEEEeccchhhhhhc-------eeeEEEEeee-ccE Confidence 3222211 0 010112222222222 11100000000 0011101 10 01111 1122233333 567 Q ss_pred eeccccceeeccC Q lcl|Aclame:pro 376 LKRPWAVVRLIGL 388 (388) Q Consensus 376 i~rP~ai~~~~GI 388 (388) +++|-+|+.+..- T Consensus 379 ~~~~~a~~~~~~~ 391 (404) T protein:vir:39 379 TTDSEALVAGSFT 391 (404) T ss_pred EecccceEEEEee Confidence 7889999998877 No 100 >protein:vir:80213 Length: 334 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1879 # MgeName: LKA1 # Cross-refs: genbank:acc:YP_001522884;genbank:gi:158345177;genbank:GeneID:5687476 Probab=71.60 E-value=0.19 Score=24.49 Aligned_cols=297 Identities=9% Similarity=0.014 Sum_probs=115.1 Q ss_pred hccC--cccccccccccchHHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccceEecc--cccC Q lcl|Aclame:pro 65 QAFD--SAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYG--DLTN 140 (388) Q Consensus 65 ~amD--aa~~~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~yg--d~~d 140 (388) |+-- .....+.--...+-..-+|+-+.-+|.+....-...++++.+.+.-. -.++.|+.+ |++.... -+.. T Consensus 1 m~~~~~~~~t~~~~~~~~~~~~l~le~~~geV~~af~~~s~~~~~~~~r~i~~--G~s~~~~~i---G~~~~~~~~~g~~ 75 (334) T protein:vir:80 1 MTYPAANTHTRPGWGGANSDVSLHIEEHLGLVDASFMYSSKFASWMNVRSLRG--TNQLRVDRV---GASTIAGRKAGEE 75 (334) T ss_pred CCCCcCCCccccccccccchheehhhhhhhHHHHHHHHhhhhhccceeeeccc--cceEEEeee---cceeeeeecCCCC Confidence 1100 00111100001112223445555544333333344455555544211 255566554 5555432 1222 Q ss_pred CceeeeeeeeeeeeEEEEE-EEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeec----CccccceEEEe Q lcl|Aclame:pro 141 IPLSSWNVNFERRTIVRGE-MGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWE----GKNGNRTFGFL 215 (388) Q Consensus 141 iP~~~~n~~~~~~~v~~~~-~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a----~~~~~g~~Gll 215 (388) +..-.+.-++ ..+..=+ .-++.-+.++..+ ++..++-++-.+.+..++.++.|+..+.-.. -........-+ T Consensus 76 l~~~~~~~~~--~~l~ID~~l~~~~~VddiD~~-q~~~D~rse~~~~~G~aLA~~~D~~~~~~l~kaa~~~~~~~~~~~~ 152 (334) T protein:vir:80 76 LVVQKNVSDK--LNLTVDTVLYARHFFDKFDEW-TSNLDVRKETAREDGIALARQYDQACIIQLQKCGDFLAPAHLKPAF 152 (334) T ss_pred CCCCCcccCc--eEEEEeeeeehhhhHhhHHHH-hcCcchHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccc Confidence 2111111111 1121111 1122233344444 3445666666666677777766654321100 00000011111 Q ss_pred ecCCCccccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccc--cccceEEcCHHHHHhhccC-----CCcC Q lcl|Aclame:pro 216 NDPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPE--DVDITLVLPMNKVDMLSVV-----TDLG 288 (388) Q Consensus 216 N~P~l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~--~~p~tL~Lp~~~~~~Ls~~-----~~~~ 288 (388) ++... ..+.. .+.+.-...+++.+++=+..+...+..+.- |+ .....++++|.+|..|-.- .+|+ T Consensus 153 ~~G~~-~~~~~---~g~~~~~~~~~~~l~~a~~~a~~~L~e~dv----p~~~~~~R~~vv~P~~y~~Ll~~~r~~n~d~~ 224 (334) T protein:vir:80 153 HDGIL-LPSTI---SGLAADAAADADVLVAAHRQGVEAMVFRDL----GDQLMSEGVTLLDPVIFSFLLEHDRLMNVEFG 224 (334) T ss_pred cCCcc-eeecc---cccccchhhhHHHHHHHHHHHHHHHHhcCC----CCCcCCceEEEeChHHHHHHhcccccccceec Confidence 11111 11111 111222346688888888888777777653 21 1246899999999988432 1221 Q ss_pred c--cHHHHHHH---hCCccEEEEccccccccCCCC--ccEEEEEEcccccccccccCCCcceEeecchhh---hccCcee Q lcl|Aclame:pro 289 I--SVRDWLKQ---TYPRVRVMSAPELQGGNPDDG--KDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKF---VTLGVEK 358 (388) Q Consensus 289 ~--Tvl~~lk~---n~pnl~i~~~pel~~a~gtg~--~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~---r~~~v~~ 358 (388) - +...+-+. +.=.++|...+.|=..+.+.. +...-.|+.+. ......+.-++.. +..+ . T Consensus 225 ~s~~~~~~~~g~i~~v~G~~V~~Sn~~P~~~~t~~~~g~~~~~~agd~---------t~~~~~~~~~~Al~t~~~~~--~ 293 (334) T protein:vir:80 225 AKEGGNSFVGGRIAMLNGVRVVETPRFPQSAITANALGADFNVTDAEV---------RRKMITFIPSMALISAQVHP--V 293 (334) T ss_pred cccccccccceeEEEEeceEEEeecCCCCccccccccccccccccccc---------cceEEEEEeCceEEEEEEee--c Confidence 1 11112111 111345555444432211111 11111111110 0000000000000 0000 0 Q ss_pred ccCceEEecccc-------eeeeeeecc--ccceeeccC Q lcl|Aclame:pro 359 RVKNYVEAYSNA-------TAGVMLKRP--WAVVRLIGL 388 (388) Q Consensus 359 ~~~~~~~~~~~~-------t~G~ii~rP--~ai~~~~GI 388 (388) ....|..+.... -.|+-+.|| .++..++++ T Consensus 294 ~~e~~~~~~~~~d~i~~~~a~G~g~lRPeaa~vv~~~~~ 332 (334) T protein:vir:80 294 SAQFWEEKKDFGHYLDTFQSYNIGQRRPDAVAVHDITVT 332 (334) T ss_pred ceeeeechhhHHHHHHHHHHcCCceeccceEEEEEEeee Confidence 111233333332 579999999 666778888 No 101 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=71.32 E-value=0.16 Score=25.02 Aligned_cols=330 Identities=10% Similarity=-0.005 Sum_probs=129.5 Q ss_pred CCCcceeeeecCc--------------------cccchhhhhhcccccccccC--CHHHHhh----------------cc Q lcl|Aclame:pro 1 MKQLSKVHQSLAG--------------------RSVRAFDMANGKADYRLTDM--AVRELKK----------------FG 42 (388) Q Consensus 1 ~~~~~~~~~~~~~--------------------~~~~~~~~~~~~~~~~~~~~--~~~~l~~----------------~g 42 (388) |+++-.....+.. .+++..+-.-.......+.+ .+.+|++ .+ T Consensus 1 M~~l~~l~~~~~~~~~e~~~~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~ 80 (394) T protein:vir:10 1 MDKLQTLFNEVSAKCADLNAQLNAKLQDENASVDDFQKIKDDLTAAKARRDAINDQIKDLEAENKANSDPDKPVDNAQPN 80 (394) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHhhhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhhhhhccc Confidence 1100000000000 00000000000000000000 0000000 00 Q ss_pred ee-cccchhhcchhhhhhh-hhhhhccCcccccccccccchHHHHHHHhhcceeeeecccchhhhhhcccccCCCCceee Q lcl|Aclame:pro 43 LV-FDHATVKRQIELLHEG-GVATQAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQE 120 (388) Q Consensus 43 ~~-~~~~~~~~~~~~~~~~-~~~~~amDaa~~~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t 120 (388) .. ..........+.+..+ .......+.+. ...+.++.|+++- +.+..+|++.+.+....+.++.+..... .+ T Consensus 81 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~-~~~t~~~gg~~vP--~~~~~~ii~~~~~~~~l~~~~~~~~~~~---~~ 154 (394) T protein:vir:10 81 GTDLKKKPIDAKKKAINDFIHSHGKVIDNAA-GHVTSTEAGVLIP--EEIIYDPTAEVNSVVDLSTLVTKTPVTT---PK 154 (394) T ss_pred ccchhhhHHHHHHHHHHHHHhccchhhhhhh-cccccccCceecc--HHHHHHHHHHHHhhhhhhhhceeeeccC---Cc Confidence 00 0000000000000000 00001111111 1123334443331 2234567777777666677755443222 23 Q ss_pred EEEeeecc-ccceEecccccCCce-eeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceE Q lcl|Aclame:pro 121 IVQGIVEP-AGTAMEYGDLTNIPL-SSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAI 198 (388) Q Consensus 121 ~~~~v~e~-~G~a~~ygd~~diP~-~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i 198 (388) ..|++... .+.+...+....+|- .+...+.....++.+...+.++.+-|+.+ ..+|.+.-....+.++...+|+- T Consensus 155 ~~~~~~~~~~~~~~~~~E~~~~~~~~~~~~~~v~l~~~k~~~~~~iS~ell~ds---~~~l~~~i~~~la~~~~~~~~~~ 231 (394) T protein:vir:10 155 GTYPILKRATDRFSSVAELAENPALAEPEFEQVDWSVSTYRGAIPLSEEAIADS---AVDLTSLVGQSINEKSVNTYNAM 231 (394) T ss_pred eEEEEEecCCCccccccccccccccccccceeEEeeeeeeEeeehhHHHHHhhh---hHHHHHHHHHHHHHHHHHHHHHH Confidence 45555543 356667788888885 55688888888898988888888655543 34677777777888888888877 Q ss_pred EEEeecCccccceEEEeecCCCccccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHH Q lcl|Aclame:pro 199 GFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKV 278 (388) Q Consensus 199 ~~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~ 278 (388) .+.|... +. +.+.. ...+ ++||..++......- + ...++|.++.+ T Consensus 232 il~g~g~----~~--------------~~~~~-----~~~~----~d~l~~~~~~~~~~~---~-----~a~~vmn~~~~ 276 (394) T protein:vir:10 232 IAPVLQS----FT--------------AKATT-----TDTL----VDSLKHILNVDLDPA---Y-----SRALVVTQSLF 276 (394) T ss_pred Hhhcccc----cc--------------ccccc-----cccc----HHHHHHHHHhhhhhh---c-----cCEEEecHHHH Confidence 7766431 10 00000 1122 344444443222211 1 12589999988 Q ss_pred HhhccC-CCcCccHHHHHHH---------hCCccEEEEccccccccCCCCccEEEEEEcccccccccccCCCcceEeecc Q lcl|Aclame:pro 279 DMLSVV-TDLGISVRDWLKQ---------TYPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQ 348 (388) Q Consensus 279 ~~Ls~~-~~~~~Tvl~~lk~---------n~pnl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p 348 (388) ..|..- +..|.-++.--.. ....+.++..+. ... ++++++..++|.+=.+........ .++..+. T Consensus 277 ~~l~~lkd~~G~~i~~~~~~~~~~~~~~~~L~G~PV~~~~~-~~~-~~~~~~~~i~~gd~s~~~~~~~~~---~~~v~~~ 351 (394) T protein:vir:10 277 NTLDTLKDKNGRYLLHDASDSITDGTAKGTVLGVPVYVVGD-ALL-GSAAGDQKAFVGDLKRGVLFADRQ---QVTLAWE 351 (394) T ss_pred HHHHHhhccCCCeeeeccccccccCCcccccccceeEEecc-ccc-CCCCCceEEEEeeccccEEEEeec---ceEEEEe Confidence 888643 3334333211000 011122322221 111 233445444443211000010000 0111110 Q ss_pred hhhhccCceeccCceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 349 SKFVTLGVEKRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 349 ~~~r~~~v~~~~~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) +.-.. ... +-...|.+| .+++|-+|+.+..= T Consensus 352 -~~~~~-----~~~--~~~~~r~d~-~~~~~~ai~~~~~~ 382 (394) T protein:vir:10 352 -DSKIY-----GRY--LGAAFRFGV-KQADSNAGYFVTNT 382 (394) T ss_pred -ccccc-----cee--EEEEEEecc-EEeccccEEEEEee Confidence 00000 011 122335544 55559999886644 No 102 >protein:vir:8843 Length: 317 # NCBI annotation: major head protein # Family: family:all:3919 # MgeID: mge:158 # MgeName: PaP3 # Cross-refs: genbank:acc:NP_775251;genbank:gi:27476049;genbank:GeneID:2700597 Probab=70.69 E-value=0.21 Score=24.35 Aligned_cols=281 Identities=7% Similarity=-0.051 Sum_probs=117.6 Q ss_pred hccCcccccccccccchH---HHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccceE-ecccccC Q lcl|Aclame:pro 65 QAFDSAYVAPTTQASIPT---PIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAM-EYGDLTN 140 (388) Q Consensus 65 ~amDaa~~~~~t~~~~g~---l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~-~ygd~~d 140 (388) ||..|.. -.|..+.|- |..++..|.| ..+-..+++.- ..-...++.+..-+....+. ....+.| T Consensus 1 ma~~~~~--~~t~~~~g~~~dl~~~I~~isp-------~dTPf~S~i~~---~~a~~~~~~W~~d~l~~~~~~~~~EG~d 68 (317) T protein:vir:88 1 MATPTNA--VSTVEINGKREDLIDIIYNIAP-------YDTPFMSAIGK---GVATAITHEWQTDELRQPGKNTRVEGED 68 (317) T ss_pred CCccccc--eEeeeeeeeeechhhhheecCC-------ccCcceeeecC---ceecccEEEEEeeecCCccccccccCcc Confidence 3333321 112222321 2333333333 22222234332 12233344455443332221 1112333 Q ss_pred Cceeeeeeeee---eeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeec------Cccccce Q lcl|Aclame:pro 141 IPLSSWNVNFE---RRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWE------GKNGNRT 211 (388) Q Consensus 141 iP~~~~n~~~~---~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a------~~~~~g~ 211 (388) .|......... .-+|++=...+.++.+....++. -+..+....-+...+...++...+.|.. ......+ T Consensus 69 a~~~~~~~r~~~~N~tQIf~k~v~VSgTa~av~~~G~--~~ela~q~~kk~~EikrdmE~~li~g~~a~~~~~~t~~r~~ 146 (317) T protein:vir:88 69 ATIKAGSFTTMLNNYCQISDETLQVTGTADRVKKAGR--KNELAYQLAKKSKELKLDMEYALVGAPQAKVQRNTTTPGQM 146 (317) T ss_pred cccccccCCEEeccEEEEEEeEEEEeehhhhhhhcCc--cchhHHHHHHHHHHHHHHHHHHHhcCeeeccCCCCccchhh Confidence 33222211111 23455545555566655544332 2322332222333333444444444432 2122345 Q ss_pred EEEeec---CCCc-cccccccCCcccccccCCHHHH-HHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccCCC Q lcl|Aclame:pro 212 FGFLND---PSLL-PAIASTTPGGWVSGGANAFQGI-VGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVVTD 286 (388) Q Consensus 212 ~GllN~---P~l~-a~~~~~~~~~~t~Wa~kT~~eI-~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~~~ 286 (388) -|+++- -++. +.......++...|-+.|+..+ .+||++++.++|..-+ .|..+.+++.....|+.-.. T Consensus 147 ~Gl~~~i~t~~~~~~~g~~~~~~~~~~~t~~t~~~lte~~l~~~l~~i~~~Gg-------~~~~i~v~a~~k~~i~~~~~ 219 (317) T protein:vir:88 147 ANIFAYYKTNGSLGANGVAPVGDGSNTGTAGDLRLLTEDMLLNASESIWRNGG-------QANSIQTSSSIKKAISKNMK 219 (317) T ss_pred hhHHHHhccCceeccCccccccCCCccccccccccccHHHHHHHHHHHHhcCC-------CCCEEEeChHHHHHHHHHhc Confidence 555543 1211 1111112233344544444443 4558899999998654 24578999998888864211 Q ss_pred c--------------CccHHHHHHHhCCccEEEEccccccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhh Q lcl|Aclame:pro 287 L--------------GISVRDWLKQTYPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFV 352 (388) Q Consensus 287 ~--------------~~Tvl~~lk~n~pnl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r 352 (388) . +.+|-.|. -+|-.++|+.-+.+. .+.++++..+ -+.+.+-.++. T Consensus 220 ~~~~~i~~~~~~~~~g~~v~~~~-tdfG~v~ii~~r~lp-------~~~~~~~D~~-------------~~~l~~Lr~~~ 278 (317) T protein:vir:88 220 GRATEITLDASDNRIAQTVDVYE-SDFGKYTIRANRWFH-------ENTLFVFDPK-------------MHSLCYLRPFF 278 (317) T ss_pred CCceeEEEcccCeEEEEEEEEEE-eCCeEEEEEeCCCCC-------CCeEEEEccc-------------ccceeecccce Confidence 1 11111111 123333444433332 2334444332 12222212222 Q ss_pred ccCceeccCceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 353 TLGVEKRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 353 ~~~v~~~~~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) ..+. .+....+--..-+-+|+-++-|.|.+...|| T Consensus 279 ~e~l-aKtGd~~k~~i~~E~tLe~~N~~a~a~i~~l 313 (317) T protein:vir:88 279 QHEL-AKTGDSEKRQLLVEYTFRVNNEKSGALIRDV 313 (317) T ss_pred eecc-CCCcccceeEEEEEEEEEEcCccceeEEEEe Confidence 1111 2222333333446789999999999999999 No 103 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=64.71 E-value=0.3 Score=23.49 Aligned_cols=255 Identities=9% Similarity=-0.063 Sum_probs=120.1 Q ss_pred hccCcccccccccccchHHHHHHHhhcceeeeecccchhhhhhcccccCCCC-ceeeEEEeeeccccceEecccccCCce Q lcl|Aclame:pro 65 QAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSW-EDQEIVQGIVEPAGTAMEYGDLTNIPL 143 (388) Q Consensus 65 ~amDaa~~~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w-~~~t~~~~v~e~~G~a~~ygd~~diP~ 143 (388) || .+. ...+++=+|-.+..++..++ ...+....|..++....- .-.+++++.+...|.+..|.++++++. T Consensus 1 ma--~~~---T~l~d~iiPev~~~~v~~~~----~~~l~~~~~~~~d~~l~g~~G~tv~iP~~~~ig~a~~~~~g~~i~~ 71 (274) T protein:vir:12 1 MA--QGL---TKTSNQIIPEVLAPMMQAQL----EKKLRFASFAEVDSTLQGQPGDTLTFPAFVYSGDAQVVAEGEKIPT 71 (274) T ss_pred CC--cce---eehhhhhchHHHHHHHHHHH----HhhhhhcccceecccccCCCCCEEEEeeecCCCccccccCCCccch Confidence 11 111 12334446666666654443 333444455444433111 136789999999999999999999999 Q ss_pred eeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccc Q lcl|Aclame:pro 144 SSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPA 223 (388) Q Consensus 144 ~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~ 223 (388) .++........+...+-++.+.. +.+++. +-++..+-...+..++...+++-.+ . .++-. T Consensus 72 ~~lt~~~~~~~i~~~~~~~~i~D--~~~~~~-~~d~~~~~~~q~~~~~a~~vd~~~l-~-----------~~~~a----- 131 (274) T protein:vir:12 72 DILETKKREAKIRKIAKGTSITD--EALLSG-YGDPQGEQVRQHGLAHANKVDNDVL-E-----------ALMGA----- 131 (274) T ss_pred hhcccceeeEEeeeecceeeecH--HHHHhc-ccchHHHHHHHHHHHHHHHHHHHHH-H-----------HHhcc----- Confidence 99999988888887665555544 444443 4466666667777777777665432 1 01100 Q ss_pred cccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccCC--------CcCccHHH-H Q lcl|Aclame:pro 224 IASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVVT--------DLGISVRD-W 294 (388) Q Consensus 224 ~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~~--------~~~~Tvl~-~ 294 (388) +...+ .+++. ++.|..++..+-.. ...+..|+++|..+..|.+-. +++..++- = T Consensus 132 ---~~~~~----~~a~~---~d~i~dA~~~lgd~-------~~~~~~ivv~p~~~~~L~k~~~~~fv~~s~~g~~~~~~G 194 (274) T protein:vir:12 132 ---KLTVN----ADITK---LNGLQSAIDKFNDE-------DLEPMVLFINPLDAGKLRGDASTNFTRATELGDDIIVKG 194 (274) T ss_pred ---ccccc----ccccC---HHHHHHHHHHhccc-------cccccEEEeCHHHHHHHHhhhhhhccccccccccceecc Confidence 00000 11111 33444454444322 234668999999999885421 12211110 0 Q ss_pred HHHhCCccEEEEc---cccccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEecccce Q lcl|Aclame:pro 295 LKQTYPRVRVMSA---PELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSNAT 371 (388) Q Consensus 295 lk~n~pnl~i~~~---pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~~~t 371 (388) .--.+-+++|+.- |+...- -. +...+.++.+.. . ..|.--.++.+. -...-. .. T Consensus 195 ~ig~~~G~~Vi~s~~~p~~t~~-l~-~~gA~~~~~~~~----------~-~vE~~Rd~~~~~-d~i~~~---------~~ 251 (274) T protein:vir:12 195 AFGEALGAIIVRSNKLEAGTAI-LA-KKGAVKLILKRD----------F-FLEVARDASTKT-TALYSD---------KH 251 (274) T ss_pred cceeecCeeEEEeCCCCcceEE-EE-eccceeeeecCC----------c-eeccccchhhcc-cEEEee---------eE Confidence 0001224454432 322110 00 111111111110 0 011100111111 111111 24 Q ss_pred eeeeeeccccceeeccC Q lcl|Aclame:pro 372 AGVMLKRPWAVVRLIGL 388 (388) Q Consensus 372 ~G~ii~rP~ai~~~~GI 388 (388) +||-+++|-.++.+..= T Consensus 252 y~~~~~~~~~vv~~t~~ 268 (274) T protein:vir:12 252 YVAYLYDESKAVKITKG 268 (274) T ss_pred EEEEEEcCCceEEEEcC Confidence 45555555555555433 No 104 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=62.49 E-value=0.33 Score=23.19 Aligned_cols=318 Identities=11% Similarity=-0.029 Sum_probs=126.1 Q ss_pred CCCcceeeeecCccccchhhhhhcccccccccCCHHHHhhcceecccchhhcchhhhhhhhhhhhccCcccccccccccc Q lcl|Aclame:pro 1 MKQLSKVHQSLAGRSVRAFDMANGKADYRLTDMAVRELKKFGLVFDHATVKRQIELLHEGGVATQAFDSAYVAPTTQASI 80 (388) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~amDaa~~~~~t~~~~ 80 (388) +....... ..+...+-............+....+ ..+..+-...............-...++. ...+.++. T Consensus 72 ~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~g 142 (400) T protein:vir:38 72 LKGNEQSS---GKKPDHPEEHSYRDALNAYLHTRGRN--TDGVNFEKTDVGTFAVLRAVPTDASDAVN----AGVKAADA 142 (400) T ss_pred HHHHhhcc---cccccchhhhhHHHHHHHHHhhHHHH--HHHHHHHHHHHHHHhhhhhhhHHHHHHHh----hcccccCC Confidence 00000000 00000000000000000000000000 00000000000000000000000111111 12233344 Q ss_pred hHHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeec-cccceEecccccCCce-eeeeeeeeeeeEEEE Q lcl|Aclame:pro 81 PTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVE-PAGTAMEYGDLTNIPL-SSWNVNFERRTIVRG 158 (388) Q Consensus 81 g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e-~~G~a~~ygd~~diP~-~~~n~~~~~~~v~~~ 158 (388) |+++- +.+.++|++.+......+.++++.+.+. .+..|++.. ..|.+...+....+|- .+...+...-..+.+ T Consensus 143 g~~vP--~~~~~~ii~~~~~~~~l~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~E~~~~~~~~~~~f~~i~~~~~k~ 217 (400) T protein:vir:38 143 ASTIP--ETISNTPQRELQTVVDLKPFTNVFQAST---QKGTYPTVANATTKMVTVAELEKNPAMAKPEFKPVNWSVETY 217 (400) T ss_pred ccccc--HHHHHHHHHHHHhhhhhhhcceeEeccC---cceEEEEEecCCCccccccccccccccccccceeeEeehhhe Confidence 43332 2234466666666556666655543321 244566654 4466667777777774 466777778888888 Q ss_pred EEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccccccccCCcccccccC Q lcl|Aclame:pro 159 EMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWVSGGAN 238 (388) Q Consensus 159 ~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~t~Wa~k 238 (388) +..+.++.+=|. ....++.+.-....+.++...+|.-.++|.... ++++ .. T Consensus 218 ~~~~~is~ell~---ds~~~~~~~i~~~l~~~~~~~~~~~i~~~~~~~------------------~~~~--------~~ 268 (400) T protein:vir:38 218 RQALPVSQESID---DSAIDLVGLIAQNGQQIKVNTTNGAVATLLKGF------------------TAKT--------IS 268 (400) T ss_pred eeehhhHHHHHh---hhHHHHHHHHHHHHHHHHHHHHHHhhhhccccc------------------cccc--------cc Confidence 888888874332 234567777777788888888888777664311 0011 11 Q ss_pred CHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC-CCcCccHHHH-HHHh----CCccEEEEcccccc Q lcl|Aclame:pro 239 AFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRDW-LKQT----YPRVRVMSAPELQG 312 (388) Q Consensus 239 T~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~~-lk~n----~pnl~i~~~pel~~ 312 (388) + ++||..++......-. ...++|.|..+..|... +..|.-++.- +... ..+..++..+..- T Consensus 269 ~----~~~~~~~~~~~~~~~~--------~a~~v~~~~~~~~l~~lkd~~G~~i~~~~~~~~~~~~l~G~pv~~~~~~~- 335 (400) T protein:vir:38 269 S----VDDLKHINNVDLDPAY--------SRVIIASQSFYNFLDTVKDGNGRYLLQDSILTPSGKSVLGMPIAVVSDDT- 335 (400) T ss_pred c----HHHHHHHHHhhhhhhh--------CcEEEEcHHHHHHHHHhhccCCCeeeecCcCCCCccccccceeEEecccc- Confidence 2 3344444432222111 13689999999988653 3334433210 1111 1112232222111 Q ss_pred ccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 313 GNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 313 a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) . .+.++..++|.+=-. ....... ..+...+.. +. .-...+....|.+|.+ .+|-+|+.+..= T Consensus 336 ~--~~~g~~~~~~gd~s~-~~~~~~~--~~~~~~~~~----~~----~~~~~~~~~~r~d~~~-~~~~a~~~l~~~ 397 (400) T protein:vir:38 336 L--GAAGEAHAFLGDIKR-AILFANR--ADFMVRWVD----DQ----IYGQFLQAGMRFGVSV-ADEKAGYFLTYT 397 (400) T ss_pred c--CCCCceEEEEEeccc-cEEEEee--cceEEEEec----cc----ccceeEEEEEEeccEE-ecccceEEEEee Confidence 1 123444444433110 0000000 001111100 00 0011223344554444 458888887666 No 105 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=60.96 E-value=0.36 Score=23.00 Aligned_cols=313 Identities=11% Similarity=-0.063 Sum_probs=127.4 Q ss_pred CCCc------------------ceeeeecCccccchhhhhhcccccccccCCHHHHhhcceecccchhhcchhhhhhhhh Q lcl|Aclame:pro 1 MKQL------------------SKVHQSLAGRSVRAFDMANGKADYRLTDMAVRELKKFGLVFDHATVKRQIELLHEGGV 62 (388) Q Consensus 1 ~~~~------------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~ 62 (388) +.++ +.....+.++...+..+. ..+..... ......++.... T Consensus 43 ~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------------~~~~~~~~----~~~~~~~~~~~~- 102 (395) T protein:vir:38 43 INKLNASLKNAKMAQELAKSAYEDARANLNAEPVNKKPLP---------------VKDGKPDA----QAMKNQFVKDFK- 102 (395) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccc---------------hhhhhHHH----HHHHHHHHHHHH- Confidence 0000 000001111111000000 00000000 000001111111 Q ss_pred hhhccCcccccccccccch--HHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeee-ccccceEeccccc Q lcl|Aclame:pro 63 ATQAFDSAYVAPTTQASIP--TPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIV-EPAGTAMEYGDLT 139 (388) Q Consensus 63 ~~~amDaa~~~~~t~~~~g--~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~-e~~G~a~~ygd~~ 139 (388) ...+.. ..+.++.| +|..+. ++|++.+......+.+..+.....-. ..+.+... +..+.+...+... T Consensus 103 ~~~~~~-----~~~~~~gg~~vP~~~~----~~ii~~~~~~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~a~~v~E~~ 172 (395) T protein:vir:38 103 NLVTSG-----TTGTGNAGLTIPEDIQ----LQIRTLTRSFTSLESLANVENVTTSH-GSRVYEKLADITPLKDLDDESA 172 (395) T ss_pred HHHhhc-----cCccCCCceecchhHh----hHHHHHHHhhcchhhhcceeeccCCc-ceEEEEeeccCCcccccccccc Confidence 111111 12223344 454443 46666666665566664432211111 12223222 2233445567777 Q ss_pred CCcee-eeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecC Q lcl|Aclame:pro 140 NIPLS-SWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDP 218 (388) Q Consensus 140 diP~~-~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P 218 (388) .+|-. ....+....+.+.+...+.++.+=++ ....+|.+--......++...+|+-.+.|+.... T Consensus 173 ~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~---ds~~~l~~~i~~~la~~~~~~~~~~il~g~g~~~----------- 238 (395) T protein:vir:38 173 LIGDNDDPELTVVKYLIHRYAGITTVTNTLLK---DTVDNIIQWLVNWAAKKDVVTRNAKILEVMGKAP----------- 238 (395) T ss_pred ccccccccceeeEEeeeeeeEeehhhHHHHHh---hhHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc----------- Confidence 88754 46778888889999988888875332 2345677888888888888999988888864211 Q ss_pred CCccccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC-CCcCccHHHHHHH Q lcl|Aclame:pro 219 SLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRDWLKQ 297 (388) Q Consensus 219 ~l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~~lk~ 297 (388) +. . + ..+ ++||..+++...... +. ....++|.+..+..|.+. +..|.-++.---. T Consensus 239 ~~------~---~-----~~~----~~~i~~~~~~~l~~~---~~---~~a~~v~n~~~~~~L~~lkd~~G~~l~~~~~~ 294 (395) T protein:vir:38 239 KK------P---T-----ISQ----FDNIKDLENNTLDPA---IE---STSSFITNQSGYNILSKVKDADGRYLMQPDVT 294 (395) T ss_pred cc------c---c-----ccc----HHHHHHHHHHhhhhh---hc---CCCEEEEcHHHHHHHHHhhccCCceeeccCcC Confidence 00 0 0 012 344554444322211 11 123689999999988543 3344433211001 Q ss_pred hCCccEEEEccccc--c-ccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEecccceeee Q lcl|Aclame:pro 298 TYPRVRVMSAPELQ--G-GNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSNATAGV 374 (388) Q Consensus 298 n~pnl~i~~~pel~--~-a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~~~t~G~ 374 (388) +...-+|-..|=+. . ..+.++++..++|.+-....... ....-+++. .+.. ...+..-.+..-+..|. |+ T Consensus 295 ~~~~~~l~G~pV~~~~~~~~~~~~~~~~i~~gd~~~~~~i~-~~~~~~i~~-~~~~----~~~~~~~~~~~r~~~r~-d~ 367 (395) T protein:vir:38 295 SPDKYLIDGKPVIRIADKWLPDVSGSHPLYFGDLKQGITLF-DRQQMQIDT-TNVG----AGSFEHDTTKLRFIDRF-DV 367 (395) T ss_pred CCCcceeccceeEEecccccCcCCCcceEEEEeccccEEEE-EecceEEEE-eccc----cchhhcCceEEEEEEee-cc Confidence 11111222222111 0 00112233333443211000000 000001111 0000 00000111222333344 55 Q ss_pred eeeccccceeeccC Q lcl|Aclame:pro 375 MLKRPWAVVRLIGL 388 (388) Q Consensus 375 ii~rP~ai~~~~GI 388 (388) .+.+|-||+.+..- T Consensus 368 ~~~~~~a~~~~~~~ 381 (395) T protein:vir:38 368 QLIDDGAFAAASFK 381 (395) T ss_pred EEecccceEEEEee Confidence 66679999999877 No 106 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=60.71 E-value=0.37 Score=22.97 Aligned_cols=340 Identities=11% Similarity=0.038 Sum_probs=135.3 Q ss_pred CCCcceeeeecCccc---------cchh-hhhhcccccccccCCHHHHhhcceecc----cchhhcchhhhhhhhhhhhc Q lcl|Aclame:pro 1 MKQLSKVHQSLAGRS---------VRAF-DMANGKADYRLTDMAVRELKKFGLVFD----HATVKRQIELLHEGGVATQA 66 (388) Q Consensus 1 ~~~~~~~~~~~~~~~---------~~~~-~~~~~~~~~~~~~~~~~~l~~~g~~~~----~~~~~~~~~~~~~~~~~~~a 66 (388) +.++......+.-.+ .+.. ............. .+.++..-.|. ++......+..... ....+ T Consensus 41 ~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~a~~~~l~~~~~~~~~~e~~~~-~~~~a 116 (409) T protein:vir:45 41 KSELEALDERIAREEELRRQDQAYIESNEEEQRQNLDPENNS---QQDEKRAQVFDKWMRHGASELTSEERKAL-RELRA 116 (409) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhcccCCCCCcc---hhhHHHHHHHHHHHHhhhhhccHHHHHHH-HHHhh Confidence 111110000000000 0000 0000000000000 00011000110 00001111111111 11122 Q ss_pred cCcccccccccccch--HHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeecccc-ceEecccccCCce Q lcl|Aclame:pro 67 FDSAYVAPTTQASIP--TPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAG-TAMEYGDLTNIPL 143 (388) Q Consensus 67 mDaa~~~~~t~~~~g--~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G-~a~~ygd~~diP~ 143 (388) +- ..+.+..| +|..+. ++|++.+......+.+..+.+... .....+...+..+ .+...+....+|- T Consensus 117 ~~-----~~~~~~gg~liP~~~~----~~ii~~~~~~~~l~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~v~E~~~~~~ 185 (409) T protein:vir:45 117 QG-----VAQDEKGGYTVPETFL----AKVVEKMKSYGGIASVAQILTTSD--GRTMEWATADGTSEVGVLLGENEEAGE 185 (409) T ss_pred cc-----CccCcCCceeccHhHH----HHHHHHHHhhhhhhhhceeeecCC--CceEEEEeeccCccccccccccccccc Confidence 21 11223334 444443 456666655554555544332222 1223344444333 3346677777888 Q ss_pred eeeeeeeeeeeEEEEE-EEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCcc Q lcl|Aclame:pro 144 SSWNVNFERRTIVRGE-MGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLP 222 (388) Q Consensus 144 ~~~n~~~~~~~v~~~~-~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a 222 (388) .+..........+... ..+.++.+=++- ...++.+.-......++...+|+-.++|+.-....+..|+++++.... T Consensus 186 ~~~~f~~~~l~~~k~~~~~i~is~ell~d---s~~~l~~~i~~~la~a~~~~~~~a~l~G~G~~~~~~p~Gil~~~~~~~ 262 (409) T protein:vir:45 186 EDTDFGMGSLGALKMTSKIIRVSNELLQD---SAIDMEAYLARRIAERIGRGEARYLIQGTGAGTPKQPKGLAASVTGTT 262 (409) T ss_pred cccccceeeeeeeeeeeeehhhhHHHHhc---cHHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCccccceeeecccccc Confidence 7776665555444443 334566643332 345788888888888888999999999985332345789999865321 Q ss_pred ccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC-CCcCccHHH-HHHHh-- Q lcl|Aclame:pro 223 AIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRD-WLKQT-- 298 (388) Q Consensus 223 ~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~-~lk~n-- 298 (388) . +... .+-| ++||.+++..|-..-. ....-.+++.+..+..|..- +..|.-+++ -+... T Consensus 263 ~---~~~~-----~~~~----~d~i~~l~~~l~~~~~-----~~a~~~~~~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~ 325 (409) T protein:vir:45 263 Q---TAAA-----NAVK----WQEILALKHSIDPAYR-----RGPKFRLAFNDNTLKLISEMEDGQGRPLWLPDIVGVAP 325 (409) T ss_pred c---cccc-----cccc----hHHHHHHHHhhhhhhc-----cCCeEEEEECHHHHHHHHHhhcCCCceeeccCcCCCCC Confidence 1 1111 1112 4566667776654321 11112356777777777542 333433321 00011 Q ss_pred --CCccEEEEccccccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEecccceeeeee Q lcl|Aclame:pro 299 --YPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSNATAGVML 376 (388) Q Consensus 299 --~pnl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~~~t~G~ii 376 (388) ..+..++....+.. .+.++..++|-+ ....... ..+..+... ..+.|.. .....+.+..|.+|. + T Consensus 326 ~~l~G~PV~~~~~~p~---~~~~~~~i~~Gd-~~~~~i~-~~~~~~~~~-~~d~~~~------~~~~~~~~~~r~d~~-~ 392 (409) T protein:vir:45 326 ASVLNVPYVIDQEIDD---IGAGKKFMFCGD-FDRFIIR-RVRYMILKR-LVERYAE------YDQTGFLAFHRFDCI-L 392 (409) T ss_pred ceecceeeEEecCcCC---ccCCccEEEEee-hhhhhee-eccceEEEE-eeccccc------CCcEEEEEEEEeccE-e Confidence 11222332222211 122333344422 1111111 111111111 1122211 111233444455444 6 Q ss_pred eccccceeeccC Q lcl|Aclame:pro 377 KRPWAVVRLIGL 388 (388) Q Consensus 377 ~rP~ai~~~~GI 388 (388) .+|-||+.+.+= T Consensus 393 ~~~~A~~~l~~k 404 (409) T protein:vir:45 393 EDTSAIKALVGK 404 (409) T ss_pred echhheEEEEec Confidence 669988877765 No 107 >protein:vir:105645 Length: 400 # NCBI annotation: putative major capsid protein # Family: family:all:2806 # MgeID: mge:1674 # MgeName: K1E # Cross-refs: genbank:acc:YP_425009;genbank:gi:83571757;uniprot:Q2WC43;genbank:GeneID:3837286 Probab=59.87 E-value=0.38 Score=22.86 Aligned_cols=290 Identities=12% Similarity=0.025 Sum_probs=110.9 Q ss_pred hccCcccccccccccchHHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccceEeccc--ccCCc Q lcl|Aclame:pro 65 QAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGD--LTNIP 142 (388) Q Consensus 65 ~amDaa~~~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd--~~diP 142 (388) |+.-....-+...++..-..-+|+-+.-+|......--..++++.+.|.-. ..++.|+.. |+....+= +..+ T Consensus 1 Ms~~n~~t~p~~~gsg~~~aL~Le~f~GeV~taF~~~si~~~~~~vRtI~~--gkS~qf~~l---G~s~a~y~~pG~~l- 74 (400) T protein:vir:10 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTG--TNTVSNKYL---GETELQVLAPGQSP- 74 (400) T ss_pred CCCCccccccccccccchhhhHHhHhcchHHHHHHHHhhhcccceeeeecc--cceEEEEEe---eeeEEeeecCCCCc- Confidence 110001111111111112333444444433333222233445555554211 145555443 66654421 1111 Q ss_pred eee-eeeeee--eeeEEEEEEEEeecHHHHHHHHHhCCC-hHHHHHHHHHHHHHHhhce-----EEEEeecCc-cccceE Q lcl|Aclame:pro 143 LSS-WNVNFE--RRTIVRGEMGIQVGLLEEGRASAMRIN-SAEVKRQGAAVQLEIMRNA-----IGFYGWEGK-NGNRTF 212 (388) Q Consensus 143 ~~~-~n~~~~--~~~v~~~~~~~~y~~~El~~A~~~g~~-l~~~K~~aAr~a~~~~~n~-----i~~~G~a~~-~~~g~~ 212 (388) ... +.-++. +..--.+.-.+=|.+.|. +.-++ +-++-....-.++.++.|+ +..-|.+.. ...++. T Consensus 75 dg~~~~~dk~~ItIDtLL~a~~~V~dlDd~----q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~a~~a~t~~~~~~~ 150 (400) T protein:vir:10 75 AATSTQADKNQLVIDATVIARNTVAHLHDV----QGDIDSLKPKLATNQAKQLKKMEDEMLIQQMLLGGIANTQAKRTNP 150 (400) T ss_pred CCCCcccCcEEEEeCceeeecchhhhHHHH----hhccccccHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccC Confidence 111 111111 111111222222344333 12222 2222222222333333332 111111100 001122 Q ss_pred EEeecCCCccccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC-----CCc Q lcl|Aclame:pro 213 GFLNDPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-----TDL 287 (388) Q Consensus 213 GllN~P~l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-----~~~ 287 (388) |..-++. +.+.. +...-+..+++++.+.|..+..++.... +. . ....+++||+.|..|... .++ T Consensus 151 ~g~~~g~-----s~~v~-~~~~~~~~~~~~l~~A~~~A~~~LdEkd---VP-~-~d~vvl~pp~~Ys~Ll~~dkLvnrdf 219 (400) T protein:vir:10 151 RVKGHGF-----SVNVE-VNEGEALVNPQYVMAAVEFALEQQLEQE---VD-I-SDVAILMPWRYFNVLRDADRIVDKSY 219 (400) T ss_pred Ccccccc-----ceeec-ccccccccCHHHHHHHHHHHHHHHHhcC---CC-c-cceEEEcCHHHHHHHHhCCcccchhc Confidence 2221111 11111 1122244588999999988888876443 33 2 246899999999887432 244 Q ss_pred CccH-HHHHHH---hCCccEEEEcccccccc--------------------CCCCccEEEEEEcccccccccccCCCcce Q lcl|Aclame:pro 288 GISV-RDWLKQ---TYPRVRVMSAPELQGGN--------------------PDDGKDIAYMFLDSVDTAVDGSTDGGDTW 343 (388) Q Consensus 288 ~~Tv-l~~lk~---n~pnl~i~~~pel~~a~--------------------gtg~~~~~~~~~~~~d~~~~~~~~~~~t~ 343 (388) +.+- .++.+. +--+++|+..+.|-... +.-.+.++++|-++--.. T Consensus 220 ~~s~~g~~~~g~v~~v~Gv~Iv~Sn~lP~~a~~~~~~~lS~a~~G~~y~~t~d~s~~~av~F~~sAv~t----------- 288 (400) T protein:vir:10 220 TISQSGATIQGFVLSSYNCPVIPSNRFPKYSQGQKHHLLSNEDNGYRYDPIAEMNGAIAVLFTADALLV----------- 288 (400) T ss_pred cccCCCccccceEEEEeceEEEeeCcCCcccCcccccccccCCCCccCCccccccceeEEEEehhheEE----------- Confidence 3221 223322 13356677666662211 111223444443331000 Q ss_pred EeecchhhhccCceeccCceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 344 AQLVQSKFVTLGVEKRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 344 ~~~~p~~~r~~~v~~~~~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) -++.+.-.+.. =+.+...|.+++-. -.|+..+||-|++-++=- T Consensus 289 vk~~~lt~~~~-~d~r~~~~~id~~~-a~G~g~~RPeaa~vv~~~ 331 (400) T protein:vir:10 289 GRSIDVIGDIF-YEKKEKTYYIDTFM-SEGAIPDRWEAVSVVTTK 331 (400) T ss_pred EEeeccccccc-cchhhHHHHHHHHH-HhCCcccchhheEEEEec Confidence 00000000000 02334455555433 468999999988766544 No 108 >protein:vir:3158 Length: 321 # NCBI annotation: capsid protein gpE # Family: family:all:1377 # ACLAME annotation(s): phi:0000161 - phage head/capsid # MgeID: mge:316 # MgeName: PhiCh1 # Cross-refs: genbank:acc:NP_665929;genbank:gi:22091115;genbank:GeneID:951342 Probab=57.10 E-value=0.44 Score=22.53 Aligned_cols=291 Identities=12% Similarity=0.005 Sum_probs=111.4 Q ss_pred hhhcccccccccCCHHHHhhcceecccchhhcchhhhhhhhhhhhccCcccccccccccch--HHHHHHHhhcceeeeec Q lcl|Aclame:pro 21 MANGKADYRLTDMAVRELKKFGLVFDHATVKRQIELLHEGGVATQAFDSAYVAPTTQASIP--TPIQFLQQWLPGFVKVL 98 (388) Q Consensus 21 ~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~amDaa~~~~~t~~~~g--~l~~~l~~idp~v~e~l 98 (388) |+.+ -+++ .+....-+ -.+. . .....| ++-.+.+.+...+.+ T Consensus 1 ~~~k------------~~~~---------------~l~~~~~~-~~~~-----~-~~~~~g~~v~~~~~~~l~~~i~e-- 44 (321) T protein:vir:31 1 MASR------------TINN---------------DLSRITEK-NALT-----V-DDLDAGGTLPDPLWDEFWTDMIE-- 44 (321) T ss_pred CchH------------HHHH---------------HHHHHHHh-cccc-----c-cccCCcceeCHHHHHHHHHHHHH-- Confidence 1111 1111 00000000 0010 0 111222 333444433333332 Q ss_pred ccchhhhhhcccccCCCCceeeEEEeeeccccceEeccc--ccCCceeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhC Q lcl|Aclame:pro 99 TSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGD--LTNIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMR 176 (388) Q Consensus 99 ~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd--~~diP~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g 176 (388) .-..+..+.+.......- ....++..|.+...++ ....+..+...+...-..+.+.....++.+-| ...+.+ T Consensus 45 --~s~~l~~i~v~~v~~~~~---~i~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~k~~~~~~it~e~L-~d~a~~ 118 (321) T protein:vir:31 45 --ETPLLDAIRTETVGAKKT---RIPTLNIGERHRRPQDEGEWNENESDVSTGTIDISTEKATVAWDLPREVV-QENPEG 118 (321) T ss_pred --hhhhhhhceeeeccCcce---eeeeeccCCcccccccccccccccccceeeeeeeeeEEEEeehhccHHHH-Hhhhcc Confidence 222233333322222111 1112222222222121 11222333334444444555555555555444 333456 Q ss_pred CChHHHHHHHHHHHHHHhhceEEEEeecCccc---cceEEEeecCCCccccccccCCccccccc-CCHHHHHHHHHHHHH Q lcl|Aclame:pro 177 INSAEVKRQGAAVQLEIMRNAIGFYGWEGKNG---NRTFGFLNDPSLLPAIASTTPGGWVSGGA-NAFQGIVGDLRLMLI 252 (388) Q Consensus 177 ~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~---~g~~GllN~P~l~a~~~~~~~~~~t~Wa~-kT~~eI~~DI~~~~~ 252 (388) -++...-....++++...++.++|.|+..... ..+.|+|+.+.-.+ .+. .++. ... ++++.+++. T Consensus 119 ~d~e~~i~~~ia~~~a~~~~~~~~nGd~~~~~~~~~~n~G~l~~a~~~~---~~~-----~~~~~~~~---~d~l~~l~~ 187 (321) T protein:vir:31 119 EALADRILNLMTDAWSADVEDLAANGDEDAEDSFENQNDGFITVAEGDV---ETI-----DAADDILD---NDLVIRTIA 187 (321) T ss_pred hhHHHHHHHHHHHHHHHHHHhheeeccccCCCcccccchhhhhhhcccc---ccc-----cccccccC---HHHHHHHHH Confidence 78999999999999999999999999853211 11246655432110 010 1111 111 223333443 Q ss_pred HHHHhcCCeeccccccceEEcCHHHHHhhc----cC-CCcCccHHHH-HHHhCCccEEEEccccccccCCCCccEEEEEE Q lcl|Aclame:pro 253 TLRVQSEDNIDPEDVDITLVLPMNKVDMLS----VV-TDLGISVRDW-LKQTYPRVRVMSAPELQGGNPDDGKDIAYMFL 326 (388) Q Consensus 253 ~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls----~~-~~~~~Tvl~~-lk~n~pnl~i~~~pel~~a~gtg~~~~~~~~~ 326 (388) .|-..- .++.-...+|....+..+. .. +..+...+.- -..++-++.++.+|.+-.. .+++. T Consensus 188 ~l~~~y-----r~~~~~v~im~~~~~~~~~~~l~~~~~~~~~~~l~~~~~~tl~G~pvv~~~~mP~~--------~il~t 254 (321) T protein:vir:31 188 GLDSKY-----RARMNPALIVSEDQLLSYHYTLTDRDTPLGDNVIMGEADVNPFSFPIIGSGLWPDD--------KAMFT 254 (321) T ss_pred hccHhH-----hcCCCeEEEechHHHHHHHHHHhcCCCccccchhhccccccccceeEEEcCCCCCC--------cEEEe Confidence 332210 0111235667777654321 11 1112222111 0112334556666666321 12222 Q ss_pred cccccccccccCCCcceEeecchhhhccCc--ee--ccCceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 327 DSVDTAVDGSTDGGDTWAQLVQSKFVTLGV--EK--RVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 327 ~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v--~~--~~~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) + .+........ ...+|...- .. +...++ .+..+--|.+|..+-+++.+.|| T Consensus 255 ~-~~nl~~~~~~---------~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~ve~~~a~a~~~~i 309 (321) T protein:vir:31 255 D-PQNLIYALYR---------DLEIDVLTESDKVSERDLHAR-YFMRGDDDFAIENTEAVVLAEGL 309 (321) T ss_pred c-cccEEEEEee---------ccEEEEeecCccccccceeeE-eeeeeecceeEeccccEEEEecC Confidence 1 1111111000 001111000 00 011111 11222357889999999999999 No 109 >protein:vir:99675 Length: 324 # NCBI annotation: Major capsid protein # Family: family:all:975 # MgeID: mge:1523 # MgeName: VP4 # Cross-refs: genbank:acc:YP_249589;genbank:gi:68299740;genbank:GeneID:3799990 Probab=53.32 E-value=0.42 Score=22.67 Aligned_cols=256 Identities=10% Similarity=0.015 Sum_probs=101.2 Q ss_pred hcccccCCCCceeeEEEeeeccccceEecc----c-----ccCCceeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCC Q lcl|Aclame:pro 107 ILGVKTVGSWEDQEIVQGIVEPAGTAMEYG----D-----LTNIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRI 177 (388) Q Consensus 107 i~~v~t~g~w~~~t~~~~v~e~~G~a~~yg----d-----~~diP~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~ 177 (388) ++---++| .++.|+. .|++.... . -.+++-.+....--....++ .-+.++..+ ++.. T Consensus 1 ~vr~i~~g----~s~~~~~---iG~~~~~~~~~G~~l~~~~~~~~~~e~~itID~~l~~~------~~VdDiD~~-qa~~ 66 (324) T protein:vir:99 1 MTRTITSG----KSAQFPV---MGRTKARYLKQGQSLDDGREDIKHTEKVITIDGLLTTD------VLIYDIEDA-MNHY 66 (324) T ss_pred CeeeeecC----ceEEEee---eeeeEeccccCCCCcCCCcCCcCcccEEEEecchhhhh------hhhhhHHHH-hcCc Confidence 22222223 3444443 35555432 1 12334444222212222222 222233333 3556 Q ss_pred ChHHHHHHHHHHHHHHhhceEEE---EeecCcccc-ceEEEeecCCCccccccccCCcccccccCCHHHHHHHHHHHHHH Q lcl|Aclame:pro 178 NSAEVKRQGAAVQLEIMRNAIGF---YGWEGKNGN-RTFGFLNDPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLIT 253 (388) Q Consensus 178 ~l~~~K~~aAr~a~~~~~n~i~~---~G~a~~~~~-g~~GllN~P~l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~ 253 (388) ++-++-.+.+..++.+..|+..+ .+.+...+. ...+.....+. .. ...++...-+..+++.+++-|..+-.. T Consensus 67 Dlr~e~s~~~G~aLA~~~Dq~i~~~~a~~~~~~a~~~~~~~~~~g~~---~~-~~~~~~~~~~~~~~~~~~dai~~a~~~ 142 (324) T protein:vir:99 67 DVRSEYSTQMGEALAMAADVANYAEMAKLVNSRKETTNENIEGLGAA---SL-VKITGKKEDPAKYGTQVIQALTYARAA 142 (324) T ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccCCcccCCcc---ce-ecccccccccccCHHHHHHHHHHHHHH Confidence 77778778888888888886543 110000000 00011111111 00 111222333457788999999888888 Q ss_pred HHHhcCCeeccccccceEEcCHHHHHhhccC---C--CcCccHHHHHHH---hCCccEEEEccccccccCCCCccEE--- Q lcl|Aclame:pro 254 LRVQSEDNIDPEDVDITLVLPMNKVDMLSVV---T--DLGISVRDWLKQ---TYPRVRVMSAPELQGGNPDDGKDIA--- 322 (388) Q Consensus 254 l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~---~--~~~~Tvl~~lk~---n~pnl~i~~~pel~~a~gtg~~~~~--- 322 (388) |-.+. + |+ ..-.+++||.+|..|... + .++ +...+.+- +.-+++|...+.|-...+++..+.+ T Consensus 143 Lde~~---V-P~-~gR~~vv~P~~y~~Ll~~~~~~~~~~~-~~~~~~~G~V~~i~Gf~V~~Sn~lp~~~~t~~~~a~~~~ 216 (324) T protein:vir:99 143 FAKKY---I-PA-GDRTFYTDPDTYSAILAALMPNAANYA-ALIDPETGNIRNVMGFEVVETPHMTAQMVTNPTDAFDGT 216 (324) T ss_pred HhhcC---C-CC-CCCEEEeChHHHHHHhhcccccccccc-cccceecceEEEEeceEEEecCCcccccccccccccccc Confidence 87665 3 33 346799999999988432 1 111 01111110 0123455555544322111111000 Q ss_pred ---EEEEcccccccccccCCCcceEeecchh-----------hhccCceeccCceEEecccceeeeeeeccccceeec-- Q lcl|Aclame:pro 323 ---YMFLDSVDTAVDGSTDGGDTWAQLVQSK-----------FVTLGVEKRVKNYVEAYSNATAGVMLKRPWAVVRLI-- 386 (388) Q Consensus 323 ---~~~~~~~d~~~~~~~~~~~t~~~~~p~~-----------~r~~~v~~~~~~~~~~~~~~t~G~ii~rP~ai~~~~-- 386 (388) .-+..+...-..--.+-..+.-..|+.+ .....- .+...|.++-.. -.|+.+.||-+++-.. T Consensus 217 ~~~~~~~~~~~~~~ky~~d~~~~~gl~~~~~a~~tv~~~~~~~e~~~~-~~~~~d~i~~~~-a~G~~~lRPe~a~~v~l~ 294 (324) T protein:vir:99 217 GHIFPATGDSTTTGKMTVGADNVVGLFVHRSAVATLKLKDMALERARR-PEYQADQIIAKY-AMGHGGLRPEAVGAIIFE 294 (324) T ss_pred ccccccccccccccccccccCceeEEEEehhheEEEeeecceecceec-hhhHHHhhhhhh-hhcCcccccceEEEEEEc Confidence 0000000000000000001111111111 111110 111223333222 3488899998775443 Q ss_pred -----cC Q lcl|Aclame:pro 387 -----GL 388 (388) Q Consensus 387 -----GI 388 (388) |+ T Consensus 295 ~~~~~~~ 301 (324) T protein:vir:99 295 DGETPAV 301 (324) T ss_pred cCccccc Confidence 33 No 110 >protein:vir:95376 Length: 425 # NCBI annotation: phage major capsid protein # Family: family:all:635 # MgeID: mge:1567 # MgeName: GBSV1 # Cross-refs: genbank:acc:YP_764476;genbank:gi:115334630;genbank:GeneID:5179263 Probab=52.93 E-value=0.54 Score=22.04 Aligned_cols=328 Identities=13% Similarity=0.065 Sum_probs=126.9 Q ss_pred CCCcceeeeecCcc-------------------ccchhhhhhcccccccccCCHHHHhhcceecccchhhcchhhhhhhh Q lcl|Aclame:pro 1 MKQLSKVHQSLAGR-------------------SVRAFDMANGKADYRLTDMAVRELKKFGLVFDHATVKRQIELLHEGG 61 (388) Q Consensus 1 ~~~~~~~~~~~~~~-------------------~~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~ 61 (388) ...+...+..+-.. ..++.+...... ..........+.+.+..+-.. +. .... T Consensus 63 ~~~l~~~~~~le~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~------~~-~~~~ 134 (425) T protein:vir:95 63 RNELNEKKSKLEGEIAQLEDELEQINSKQPSNQSRQKMQGSKGDV-VEMNRLQVREMLKTGEYYKRS------EV-VEFY 134 (425) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhhhccchhhhhhhhhhhhhH-HHHHHHHHHHHHhhhhhhhhh------HH-HHHH Confidence 00000000000000 000000000000 000000001111111111100 00 0000 Q ss_pred hhhhccCcccccccccccch--HHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccceEeccccc Q lcl|Aclame:pro 62 VATQAFDSAYVAPTTQASIP--TPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLT 139 (388) Q Consensus 62 ~~~~amDaa~~~~~t~~~~g--~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~ 139 (388) ....+. .++++.| +|..+.. +|++.+-...-...++.+.... ....+++....+.+...+... T Consensus 135 ~~~~~~-------~~~~~gg~~vP~~~~~----~Ii~~l~~~~~i~~~~~~~~~~----g~~~ip~~~~~~~a~~v~E~~ 199 (425) T protein:vir:95 135 EKFRNL-------RAVAGGELTIPEVVVN----RIMDIMGDYTTLYPLVDKIRVK----GTTRILVDTDTSPATWIEQSG 199 (425) T ss_pred HHHHhh-------cccccCceeccHHHHH----HHHHHHHhhhhHHHhhceeecC----ceeEEEEecCCcccccccccc Confidence 011111 1223333 4555444 4455443333334444332221 123566766777777788888 Q ss_pred CCceeee-eeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecC Q lcl|Aclame:pro 140 NIPLSSW-NVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDP 218 (388) Q Consensus 140 diP~~~~-n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P 218 (388) .+|..+. ..+...-..+.+...+.++.+=|..+ ..++.+--....+.++...+++-.+.|+... ...-.|+|++- T Consensus 200 ~~~~~~~~~f~~i~l~~~k~~~~~~iS~ell~ds---~~~l~~~i~~~l~~~i~~~~d~~il~G~G~~-~~~p~Gil~~~ 275 (425) T protein:vir:95 200 ALPTGDVGTIASIDFDGFKVGKVTFVDNYLLQDS---IINLDDYVTKKIARAIAKALDLAIVKGTGAA-NKQPLGIIPSL 275 (425) T ss_pred ccccccccccceeeeeheeeeeeehhhHHHHhcc---HHHHHHHHHHHHHHHHHHHHHHHhhccCCCC-ccccceeeccc Confidence 8888876 47788888888888888888644333 3468888888899999999999999997532 22356888763 Q ss_pred CCccccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHH-HHHhhc---c-CCCcCccHHH Q lcl|Aclame:pro 219 SLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMN-KVDMLS---V-VTDLGISVRD 293 (388) Q Consensus 219 ~l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~-~~~~Ls---~-~~~~~~Tvl~ 293 (388) .. ....+. ++++. .++||.+++..+....... . ...++|.+. .+..|. . .+..|.-++. T Consensus 276 ~~--~~~~~~------~~~~~---~~~~~~~~~~~~~~~~~~~---~--~~~~v~~~~~~~~~l~~l~~~kd~~g~~i~~ 339 (425) T protein:vir:95 276 PP--ENQVTV------EADNN---LLKNLVKQIGLIDTGDDSV---G--EIVAVMKRSTYYNRLVEFSIQVDSNGNVVGK 339 (425) T ss_pred cc--cccccc------ccccc---hHHHHHHHHHhhhhhcccc---C--ceEEEEeChHHHHHHHHHHhhcCCCCceeec Confidence 22 111111 11111 3456666665554433210 1 113444443 443332 1 2333432211 Q ss_pred HHHHhCCccEEEEccccccccCCCCccEEEEEEcccccccccccCCCcceEeecch-hhhccCceeccCceEEeccccee Q lcl|Aclame:pro 294 WLKQTYPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQS-KFVTLGVEKRVKNYVEAYSNATA 372 (388) Q Consensus 294 ~lk~n~pnl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~-~~r~~~v~~~~~~~~~~~~~~t~ 372 (388) ..+.+.=+|-..|-....+ -+.+. ++|.+ ........ ..+.++... ++ .|.. -...+-...|.. T Consensus 340 --~~~~~~~~l~G~pvv~~~~-~~~~~--i~~Gd-~~~~~~~~-~~~~~i~~~-~~~~f~~-------~~~~~~~~~r~d 404 (425) T protein:vir:95 340 --LPNLRTPDLLGLRVVFNNF-LDDDT--VLFGE-FEQYTLVE-RENITIDSS-THVKFTE-------DQTAFRGKGRFD 404 (425) T ss_pred --cCCCCCccccceeeEEcCc-CCCcc--EEEEe-cccEEEEe-ecceEEEee-ccccccc-------CceEEEEEEeeC Confidence 1111111111111111000 01111 12211 00000000 000111110 00 0000 011222233444 Q ss_pred eeeeeccccceeeccC Q lcl|Aclame:pro 373 GVMLKRPWAVVRLIGL 388 (388) Q Consensus 373 G~ii~rP~ai~~~~GI 388 (388) | -+.+|-||+.++ | T Consensus 405 ~-~~~~~~a~~~~~-i 418 (425) T protein:vir:95 405 G-KPVKPEAFVLVT-I 418 (425) T ss_pred c-EeecccceEEEE-e Confidence 4 444588877763 3 No 111 >protein:vir:7019 Length: 401 # NCBI annotation: major capsid protein # Family: family:all:2806 # MgeID: mge:141 # MgeName: SP6 # Cross-refs: genbank:acc:NP_853592;genbank:gi:31711674;genbank:GeneID:1481800 Probab=52.43 E-value=0.55 Score=21.99 Aligned_cols=295 Identities=12% Similarity=0.028 Sum_probs=110.3 Q ss_pred hccCcccccccccccchHHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccceEecc--cccCCc Q lcl|Aclame:pro 65 QAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYG--DLTNIP 142 (388) Q Consensus 65 ~amDaa~~~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~yg--d~~diP 142 (388) |+.-.....+...++..-..-+|+-+.-+|......--..++++.+.+.-. ..++.|+.. |+....+ -+..+= T Consensus 1 Ms~~n~~t~~~~~~sg~~~al~Le~f~GeV~taF~~~si~~~~~~vRti~~--gkS~qf~~~---G~s~~~~~~pG~~ld 75 (401) T protein:vir:70 1 MSTPNNLTNVAVSASGEVDSLLIEKFNGKVNEQYLKGENIMSYFDVQTVTG--TNTVSNKYL---GETELQVLAPGQSPA 75 (401) T ss_pred CCCCccccccccccccchhHhHHhHhcchHHHHHHHHhhhcccceeeeecc--cceEEEEEe---eeeEeeeecCCCCcC Confidence 111111111111111112333444444433333222233445555554211 144455443 6665442 111110 Q ss_pred eeeeeeeeeeee--EEEEEEEEeecHHHHHHHH-HhCCChHHHHHHHHHHHHHHhhceEEE-EeecCccccceEEEeecC Q lcl|Aclame:pro 143 LSSWNVNFERRT--IVRGEMGIQVGLLEEGRAS-AMRINSAEVKRQGAAVQLEIMRNAIGF-YGWEGKNGNRTFGFLNDP 218 (388) Q Consensus 143 ~~~~n~~~~~~~--v~~~~~~~~y~~~El~~A~-~~g~~l~~~K~~aAr~a~~~~~n~i~~-~G~a~~~~~g~~GllN~P 218 (388) -.....++.... --.+.-.+=|.+.|.+.-- ..+-.+..+-..+-++.+++++=+... -|.+. .-+-=..| T Consensus 76 ~~~~~~dK~~ItID~lL~a~~~V~dlDe~q~~yD~vRse~s~e~G~ALA~~~Dq~iiq~i~~aa~an-----a~~~~~~p 150 (401) T protein:vir:70 76 ATSTQADKNQLVIDATVIARNTVAHLHDVQGDIDSLKPKLATNQAKQLKRMEDEMLIQQMMLGGIAN-----TQAKRTNP 150 (401) T ss_pred CCCcccccEEEEeCceeehhhhhhhHHHHHhcccccchHHHHHHHHHHHHHHHHHHHHHHHHhcccc-----ccccccCC Confidence 011111111111 1111122223343332211 123334444444445555554422221 11110 00000001 Q ss_pred CCcc-ccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC-----CCcCcc-H Q lcl|Aclame:pro 219 SLLP-AIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-----TDLGIS-V 291 (388) Q Consensus 219 ~l~a-~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-----~~~~~T-v 291 (388) ..-. ..+-+. ++...-...+++++.+-|..+..++..+. + |. ....+++||..|..|... .+|+.+ - T Consensus 151 ~~~~~G~~i~v-~~~~~~~~~~~~~l~~ai~dA~~~LdEkd---V-P~-~r~vvl~pp~~Ys~Ll~~d~L~nrd~~~s~~ 224 (401) T protein:vir:70 151 RVKGHGFSINV-EVAEGEALVNPQYVMAAVEFALEQQLEQE---V-DI-SDVAILMPWRYFNVLRDADRIVDKTYTISQS 224 (401) T ss_pred CcCCCceEEec-cccccccccCHHHHHHHHHHHHHHHHhcC---C-Cc-cceEEEcCHHHHHHHHhcCcccchhhccccC Confidence 0000 000011 11222245789999999999888877554 3 32 357888999999877432 233221 1 Q ss_pred HHHHHHh---CCccEEEEcccccc------------cc--------CCCCccEEEEEEcccccccccccCCCcceEeecc Q lcl|Aclame:pro 292 RDWLKQT---YPRVRVMSAPELQG------------GN--------PDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQ 348 (388) Q Consensus 292 l~~lk~n---~pnl~i~~~pel~~------------a~--------gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p 348 (388) ..+.+.. --+++|+..+.|-. +. +.-.+.++++|-++--.. -++.+ T Consensus 225 g~~~~G~v~~vaGv~Vv~SnnlP~~a~~it~~~ls~a~~G~~y~~~~d~s~~~~v~f~~~Av~t-----------vk~~~ 293 (401) T protein:vir:70 225 GATIQGFTLSSYNCPVIPSNRFPKYSQGQTHHLLSNEDNGYRYDPLPAMNGAIAVLFTADALLV-----------GRSID 293 (401) T ss_pred CccccceEEEEeceEEEeeccccccccccccccccccCCCccCCCCccccceeEEEEehhheEE-----------EEeec Confidence 2222221 11344554444421 10 111223444443331000 00000 Q ss_pred hhhhccCceeccCceEEecccceeeeeeeccccceee----ccC Q lcl|Aclame:pro 349 SKFVTLGVEKRVKNYVEAYSNATAGVMLKRPWAVVRL----IGL 388 (388) Q Consensus 349 ~~~r~~~v~~~~~~~~~~~~~~t~G~ii~rP~ai~~~----~GI 388 (388) .-.+.. =+.+...|.+++- .-.|+..+||-|++-+ +|. T Consensus 294 lt~~~~-~d~r~~~~~id~~-~a~g~g~~RPeaa~vv~~k~~~~ 335 (401) T protein:vir:70 294 VTGDIF-YEKKEKTYYIDTF-MAEGAIPDRWEAVSVVTTKRNTT 335 (401) T ss_pred cccchh-hhhhhhHHHHHHH-HHhCCcccchhheEEEeecCccc Confidence 000000 0233345655543 3568999999988664 222 No 112 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=51.59 E-value=0.58 Score=21.89 Aligned_cols=328 Identities=9% Similarity=-0.068 Sum_probs=131.6 Q ss_pred CCCcceeeee--------------cCcccc-chhhhhhcc-cccccccCC------HHHHhhcceecccchhhcchhhhh Q lcl|Aclame:pro 1 MKQLSKVHQS--------------LAGRSV-RAFDMANGK-ADYRLTDMA------VRELKKFGLVFDHATVKRQIELLH 58 (388) Q Consensus 1 ~~~~~~~~~~--------------~~~~~~-~~~~~~~~~-~~~~~~~~~------~~~l~~~g~~~~~~~~~~~~~~~~ 58 (388) +..+.+.-.. +.++.- .+....+.. ...+.+... ...+.+ .+.+. ....+.+. T Consensus 39 ~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~---~~~~~--~~~~~~~~ 113 (397) T protein:vir:12 39 LDEVKQLKNQIELMTEGRSLDVPDLPGGVNFVPEQERNPEGQRSQGQGNEERQQQYSKAFLK---GLRGK--RLTDEERD 113 (397) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhcccccccchhhHHHHHHHHHHHH---HHhcc--CCcHHHHH Confidence 0000000000 000000 000000000 000000000 000111 00000 00011110 Q ss_pred hh-hhhhhccCcccccccccccchHHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccceEeccc Q lcl|Aclame:pro 59 EG-GVATQAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGD 137 (388) Q Consensus 59 ~~-~~~~~amDaa~~~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd 137 (388) .. .....+|.. .+.++.|+++- +.+.+.|++.+......++++++.....- ...+.+......+.+...+. T Consensus 114 ~~~~~~~~a~~~-----~~~~~gg~lvP--~~~~~~ii~~~~~~~~l~~~~~~~~~~~~-~~~~~~~~~~~~~~a~~v~E 185 (397) T protein:vir:12 114 LLDSPEFRAMSG-----INDEDGGILIP--EDIGRQIHEFKRQFEPLEQYVTVEPVTTR-SGTRLLEKNADMVPFSPVEE 185 (397) T ss_pred HHhhhhhhhccc-----cccccCcccCc--hhHHHHHHHhhhhhhhHHhhcceeeccCC-ceeEEEEEecCCcceeeecc Confidence 00 011122221 23334443321 23345677777666656666544322111 12233444445556778888 Q ss_pred ccCCcee-eeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEee Q lcl|Aclame:pro 138 LTNIPLS-SWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLN 216 (388) Q Consensus 138 ~~diP~~-~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN 216 (388) ...+|-. ....+......+.++..+.++.+=++ ....+|.+--....+.++...+|.-.+.|+.... T Consensus 186 g~~~~~~~~~~~~~v~~~~~k~~~~~~is~e~l~---ds~~~l~~~i~~~l~~~~~~~~d~~il~G~g~~~--------- 253 (397) T protein:vir:12 186 LGNLPEIDQPRFTKVSYSIIDYGGIMTLSNSMLN---DSDQAIMTYVAKWFAKKSVVTRNNLILAAIASLK--------- 253 (397) T ss_pred cccccccccccceeEEeeheeeEeeehhhHHHHh---hchHHHHHHHHHHHHHHHHHHHHHHHHhcccccc--------- Confidence 8888854 46788888999999998888885443 2345777778888888888888888888864210 Q ss_pred cCCCccccccccCCcccccccCCHHHHHHHHHHHHH-HHHHhcCCeeccccccceEEcCHHHHHhhcc-CCCcCccHHHH Q lcl|Aclame:pro 217 DPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLI-TLRVQSEDNIDPEDVDITLVLPMNKVDMLSV-VTDLGISVRDW 294 (388) Q Consensus 217 ~P~l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~-~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~-~~~~~~Tvl~~ 294 (388) |.+. .+ ++||..++. .+... + .....++|.+..+..|.+ .+..|.-++.= T Consensus 254 ---------~~g~--------~~----~~~i~~~~~~~l~~~----~---~~~a~~~~n~~~~~~L~~lkd~~G~~l~~~ 305 (397) T protein:vir:12 254 ---------KVDI--------DG----LDGIKKALNVTLDPM----V---APGSIVLTNQDGYDWLDTLKDGTGRYLLQP 305 (397) T ss_pred ---------cccc--------cc----HHHHHHHHhhccchh----h---hCCCEEEEcHHHHHHHHHhhccCCceeecc Confidence 1111 11 345554443 22111 1 112368899998888854 23334322210 Q ss_pred HHHhCCccEEEEcccc--cc-ccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEecccce Q lcl|Aclame:pro 295 LKQTYPRVRVMSAPEL--QG-GNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSNAT 371 (388) Q Consensus 295 lk~n~pnl~i~~~pel--~~-a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~~~t 371 (388) -..+...-+|-..|=+ .. ..+.++++..++|.+-.+........ .-++.. .... ......-.....+..|. T Consensus 306 ~~~~g~~~~l~G~pv~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~-~~~i~~-~~~~----~~~f~~~~~~~r~~~r~ 379 (397) T protein:vir:12 306 DPTNPTKKLLDGRPVVPFTNRVLKTQKGKAPLIIGNLKEAIVLFDRE-QQSIAS-TDTG----AGAFETNSTKVRGIERE 379 (397) T ss_pred cccCCCCccccceeeEEecccccccCCCccEEEEEehhceEEEEeec-ceEEEE-eccc----cchhhcCceEEEEEEee Confidence 0011111123222211 11 11223334333443211000010000 001110 0000 00001112233455555 Q ss_pred eeeeeeccccceeeccC Q lcl|Aclame:pro 372 AGVMLKRPWAVVRLIGL 388 (388) Q Consensus 372 ~G~ii~rP~ai~~~~GI 388 (388) +| .+++|-||+.+.-= T Consensus 380 d~-~~~~~~a~~~~~~t 395 (397) T protein:vir:12 380 DV-RKWDEDAVVFGQIT 395 (397) T ss_pred cc-EEecccceEEEEEe Confidence 55 45668888877655 No 113 >protein:vir:7990 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:151 # MgeName: Che8 # Cross-refs: genbank:acc:NP_817344;genbank:gi:29565772;genbank:GeneID:1258978 Probab=49.93 E-value=0.62 Score=21.71 Aligned_cols=256 Identities=13% Similarity=0.062 Sum_probs=104.5 Q ss_pred cccchH-HHHHHHhhcceeeeecccchhhhhhcccccC--CCCceeeEEEeeeccccceEecccccCCceeeeeeeeeee Q lcl|Aclame:pro 77 QASIPT-PIQFLQQWLPGFVKVLTSARKIDEILGVKTV--GSWEDQEIVQGIVEPAGTAMEYGDLTNIPLSSWNVNFERR 153 (388) Q Consensus 77 ~~~~g~-l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~--g~w~~~t~~~~v~e~~G~a~~ygd~~diP~~~~n~~~~~~ 153 (388) +++.-| |-. |.+.+.+.+........++..+-+ +.- -.|++++.....+.+..-+....++.-+.+..+... T Consensus 1 MA~~~~~pei----~~~~v~~~~~~~lv~~~l~~~~~~~~~~~-GdTv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~ 75 (273) T protein:vir:79 1 MAFNNFIPEL----WSDMLLEEWTAQTVFANLVNREYEGIASK-GNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDL 75 (273) T ss_pred CcchhhhHHH----HHHHHHHHHHhhccchhhhhccccccccC-CcEEEEeecCcccccccccCCCccCccccccceEEE Confidence 223212 222 223333333333333444322211 111 237888887655444333333344455555566666 Q ss_pred eEEEE-EEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccccccccCCcc Q lcl|Aclame:pro 154 TIVRG-EMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGW 232 (388) Q Consensus 154 ~v~~~-~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~ 232 (388) ++-.. ..++.+...|... ...++.. -...+..++...+++..+ + ++-.-. +... + T Consensus 76 tid~~~~~~~~i~d~d~~~---~~~~~~~-~~~~~~~ala~~vD~~i~-~-----------~~~~a~-----~~~~--~- 131 (273) T protein:vir:79 76 LIDQEKSIDFLVDDIDRVQ---VAGSLEA-YTRAGATALATDTDKFIA-D-----------MLVDNG-----TALT--G- 131 (273) T ss_pred EEeeecccceeeccHHHHh---hcccHHH-HHHHHHHHHHHHHHHHHH-H-----------HHhhcc-----cccc--c- Confidence 66443 5566666655432 3345643 344455666666664321 1 010000 0000 0 Q ss_pred cccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccCCCcCccHHHH------HHH----hCCcc Q lcl|Aclame:pro 233 VSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVVTDLGISVRDW------LKQ----TYPRV 302 (388) Q Consensus 233 t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~~~~~~Tvl~~------lk~----n~pnl 302 (388) -..-+++.+++.|..+...+-.+. + |+. .-.|+++|..+..|-+..+. .+-.++ |++ +.-++ T Consensus 132 --~~~~~~~~~~~~i~~a~~~ld~~~---v-P~~-~R~lvv~p~~~~~Ll~~~~~-~~~~~~~~~~~~l~~G~ig~~~G~ 203 (273) T protein:vir:79 132 --SAPSDADDAFDLIASALKELTKAN---V-PNV-GRVVVVNAEMAFWLRSSGSK-LTSADTSGDAAGLRAGTIGNLLGA 203 (273) T ss_pred --ccccchhhHHHHHHHHHHHhhhcc---C-Ccc-CcEEEECHHHHHHHhhchhh-hhhhhhcccccceeeeEeeEEece Confidence 012345566777777766554332 1 221 23789999998877432110 000000 110 11134 Q ss_pred EEEEccccccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEeccc-ceeeeeeecccc Q lcl|Aclame:pro 303 RVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSN-ATAGVMLKRPWA 381 (388) Q Consensus 303 ~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~~-~t~G~ii~rP~a 381 (388) +|.....+.. +.+...+.+.++- +- +..+...-...+..+.|-.-+.+ -..|+-+.||-+ T Consensus 204 ~i~~s~~lp~----~~~~~~~a~~~~A-------------~~--~a~~~~~~e~~r~~~~~~~~v~~~~~yg~~v~~p~~ 264 (273) T protein:vir:79 204 RIVESNNLRD----TDDEQFVAFHPSA-------------AA--YVSQIDTVEALRDQDSFSDRIRALHVYGGKVVRPTG 264 (273) T ss_pred EEEecccccc----cCceEEEEEeccc-------------ee--eeeehhhhhcccCcccceeeeeeeeeeeeEEecCce Confidence 4444333321 1112223332221 00 00000000011111112111111 247888888998 Q ss_pred ceeeccC Q lcl|Aclame:pro 382 VVRLIGL 388 (388) Q Consensus 382 i~~~~GI 388 (388) ++.+.== T Consensus 265 vv~~~~~ 271 (273) T protein:vir:79 265 VVVFNKT 271 (273) T ss_pred EEEEecc Confidence 8875433 No 114 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=48.19 E-value=0.68 Score=21.51 Aligned_cols=309 Identities=10% Similarity=-0.009 Sum_probs=128.2 Q ss_pred CCCcceeeeecCccccchhhhhhcccccccccCCHHHHhhcceecccchhhcchhhhhhhhhhhhccCcccccccccccc Q lcl|Aclame:pro 1 MKQLSKVHQSLAGRSVRAFDMANGKADYRLTDMAVRELKKFGLVFDHATVKRQIELLHEGGVATQAFDSAYVAPTTQASI 80 (388) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~amDaa~~~~~t~~~~ 80 (388) +....... ...+..++.... ... ...+..+ .|... ...... . ...++. ..+..+. T Consensus 63 ~~~~~~~~--~~~~~~~~~~~~-------~~~-~~~~~~~---~~~~~----~~~~~~-~--~~~~~~-----~~t~~~g 117 (397) T protein:vir:48 63 ARANEVVN--MSEEEKKPLTKS-------EEE-VKAGFVK---DFKNL----VRGRYQ-N--LLDSKT-----DASGSDA 117 (397) T ss_pred HHHhhhhh--hhhhccccccch-------hhH-HHHHHHH---HHHHH----Hhhhhh-H--HHHHhh-----ccCCccc Confidence 00000000 000000000000 000 0001111 00000 000000 0 001111 1122233 Q ss_pred h--HHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccceEecccccCCcee-eeeeeeeeeeEEE Q lcl|Aclame:pro 81 P--TPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPLS-SWNVNFERRTIVR 157 (388) Q Consensus 81 g--~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~diP~~-~~n~~~~~~~v~~ 157 (388) | +|..+. ++|++.+......++++++.......-....+...+..+.+...+....+|-. +...+...-..+. T Consensus 118 g~~iP~~~~----~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~~~~~k 193 (397) T protein:vir:48 118 GLTIPQDIQ----TAIHTLVRQYDSLQEYVNVENVTTLTGSRVYEKWADITGLAKLDDEAGSIGTNDDPKLYPIRYAIKR 193 (397) T ss_pred cccccHHHH----HHHHHHHHHHHHHHhhhceeeccCCcceEEEEeecCCCcceeeeccccccccccccceeeEEeehee Confidence 4 444443 46777766666666665554332222222223333444556677777888865 4678888888899 Q ss_pred EEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccccccccCCccccccc Q lcl|Aclame:pro 158 GEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWVSGGA 237 (388) Q Consensus 158 ~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~t~Wa~ 237 (388) ++..+.++.+=++. ...++.+.-......++...+|+-.+.|+.... ..++.. T Consensus 194 ~~~~~~iS~ell~d---s~~~l~~~v~~~l~~~~~~~~d~~il~G~g~~~--------------------~~~~~~---- 246 (397) T protein:vir:48 194 YAGISTVTNSLLAD---SAENILAWLSGWIAKKVVVTRNKAILEAIATLP--------------------TKPTLT---- 246 (397) T ss_pred eeeehhhHHHHHhh---chHHHHHHHHHHHHHHHHHHHHHHHhhcccccc--------------------cccccc---- Confidence 99888888864433 345777778888888888888888888863210 001111 Q ss_pred CCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC-CCcCccHHHHHHHhCCccEEEEccc------c Q lcl|Aclame:pro 238 NAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRDWLKQTYPRVRVMSAPE------L 310 (388) Q Consensus 238 kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~~lk~n~pnl~i~~~pe------l 310 (388) + ++||.+++..+...-. ....++|.+..+..|... +..|.-++.---.+...-+|-..|= + T Consensus 247 -~----~d~i~~~~~~l~~~~~-------~~a~~v~n~~~~~~L~~lkd~~G~~i~~~~~~~~~~~~l~G~PV~~~~~~~ 314 (397) T protein:vir:48 247 -K----WDDIIDLQAKVDPAIK-------QTSFFLTNTSGFTALKKVKNAFGDYLMERDVKSPTGYSIDGFAVKEVADRW 314 (397) T ss_pred -c----HHHHHHHHHHhhhhhc-------CCCEEEECHHHHHHHHHhhcCCCceeeccCcCCCCCceeccceeEEecccc Confidence 1 3455666665543321 124789999999988643 3334333211001111112222221 1 Q ss_pred ccccCCCCccEEEEEEcccccccccccCCCcceEee-cchh-hhccCceeccCceEEecccceeeeeeeccccceeec-- Q lcl|Aclame:pro 311 QGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQL-VQSK-FVTLGVEKRVKNYVEAYSNATAGVMLKRPWAVVRLI-- 386 (388) Q Consensus 311 ~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~-~p~~-~r~~~v~~~~~~~~~~~~~~t~G~ii~rP~ai~~~~-- 386 (388) ... + +.++..++|-+ ............-.++.. ...+ |... ....-+..|.+ +.+++|-+|+..+ T Consensus 315 ~~~-~-~~~~~~~~~gd-~~~~~~~~~~~~~~i~~~~~~~~~~~~~-------~~~~r~~~r~d-~~~~~~~a~~~~~~~ 383 (397) T protein:vir:48 315 LAN-A-SSGAMPLYFGD-LKQAVTLFDRQQMSLLSTNIGGGAFETD-------TTKIRVIDRFD-VVATDTESFVPASFK 383 (397) T ss_pred cCC-c-CCCceEEEEEe-ccceEEEEeecceEEEEeccchhhhhcC-------ceeEEEEeeec-cEEecccceEEEEec Confidence 111 1 12222233321 110000000000001100 0000 1100 11222333444 4556688886554 Q ss_pred cC Q lcl|Aclame:pro 387 GL 388 (388) Q Consensus 387 GI 388 (388) .. T Consensus 384 ~~ 385 (397) T protein:vir:48 384 AI 385 (397) T ss_pred cc Confidence 33 No 115 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=47.76 E-value=0.69 Score=21.46 Aligned_cols=327 Identities=13% Similarity=0.027 Sum_probs=117.7 Q ss_pred CCCcceeeeecCccccchhhhhhcccccccccCCHH----------HHhhcceecccchhhcchhhhhhhhhhhhccCcc Q lcl|Aclame:pro 1 MKQLSKVHQSLAGRSVRAFDMANGKADYRLTDMAVR----------ELKKFGLVFDHATVKRQIELLHEGGVATQAFDSA 70 (388) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~l~~~g~~~~~~~~~~~~~~~~~~~~~~~amDaa 70 (388) +...-+.. .+.+.+...-....-+....... .+...|.. ....+...-. .++.. T Consensus 25 ~~~~~~~e-----~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~r~~~------~l~~ee~~~~----~~~~~- 88 (395) T protein:vir:95 25 VQNGASDE-----EQSKAFGAMFDALSNDLQEEITAEINNRVVDNGILAKRSQD------PLTSEERKFF----NDINY- 88 (395) T ss_pred HhhhhhHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcc------ccchHHHHHH----HHHhh- Confidence 00000000 00000000000000000000000 01111211 0111111100 11111 Q ss_pred cccccccccch--HHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccceEecccccCC-ceeeee Q lcl|Aclame:pro 71 YVAPTTQASIP--TPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNI-PLSSWN 147 (388) Q Consensus 71 ~~~~~t~~~~g--~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~di-P~~~~n 147 (388) .+.++.| +|..+.+ +|++.+...-..+.+..+.+.+. ...+...+..+.+...+..+.+ +..+.. T Consensus 89 ----~t~~~gG~liP~~~~~----~Ii~~l~~~s~i~~~~~v~~~~~----~~~i~~~~~~~~a~w~~e~~~~~~~~~~~ 156 (395) T protein:vir:95 89 ----DVGYTDEKILPETVVE----RVFDDLQKDHPLLSKINFQNAGI----KTRVIKADPAGQAVWGKVFGEIKGQLDAA 156 (395) T ss_pred ----ccCCCCceeccHHHHH----HHHHHHHhhhhhhhhceeEecCC----ceEEEEecCCcceEEeecccccCcccccc Confidence 1222333 4444433 55555544444455544443332 1345666666776665444455 455666 Q ss_pred eeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccccccc Q lcl|Aclame:pro 148 VNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIAST 227 (388) Q Consensus 148 ~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~ 227 (388) .....-..+.+..-..++.+=| .....++.+--....+.++.+.+|+-.+.|+.-. .++-.|+||+.......... T Consensus 157 f~~i~l~~~kl~~~~~iS~ell---~ds~~~ie~~i~~~la~~ia~~~~~a~i~G~G~~-~~qP~Gil~~~~~~~~~~~~ 232 (395) T protein:vir:95 157 FREENFTQYKLTCFVVLPDDLS---TFGPAWIERFVRTQIQEAISVALESAIINGGGAA-KTQPVGLMKDVNTNSGAVTD 232 (395) T ss_pred ceeeeeceeeEEEeecccHHHH---hcchhHHHHHHHHHHHHHHHHHHhhheeeccCCC-CcCceeeeeccccccccccc Confidence 6677777888887777777433 2345788999999999999999999999997421 12357999986543211111 Q ss_pred cCCcccccccCCHHHH---HHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhh-ccC---CCcC--ccHHHHHHHh Q lcl|Aclame:pro 228 TPGGWVSGGANAFQGI---VGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDML-SVV---TDLG--ISVRDWLKQT 298 (388) Q Consensus 228 ~~~~~t~Wa~kT~~eI---~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~L-s~~---~~~~--~Tvl~~lk~n 298 (388) +.. -...|.+.+ +..+..++..+-...++.-....-..+++|.+.-+..+ ..+ +..| .|++= T Consensus 233 ~~~----~~~~t~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~mn~~t~~~~~g~~~~~~~~G~~~~~lg----- 303 (395) T protein:vir:95 233 KAS----SGTLTFADADTTILELNDVLKNLSVDEKGKELKIDGKVALVVNPRDSWDVQARYTYLTANGGFVTVLP----- 303 (395) T ss_pred ccc----cchhhhhhhHhhHHHHHHHHHhhccccccchhhhcCceEEEEcchhhhhcCCcceeccCCCcceeccC----- Confidence 110 011122222 22222333222111111100000112345554433222 110 1112 11110 Q ss_pred CCccEEEEccccccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEecccceeeeeeec Q lcl|Aclame:pro 299 YPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSNATAGVMLKR 378 (388) Q Consensus 299 ~pnl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~~~t~G~ii~r 378 (388) | ++.++....+. + ++ ++|.+= ..-...- ...-.+. ..++.+-.. ....+....|.+|.++ . T Consensus 304 ~-g~~v~~~~~~p-~-----~~--i~fgdf-s~y~i~~-r~~~~i~-~~~~~~~~~------d~~~f~~~~r~dg~~~-~ 364 (395) T protein:vir:95 304 Y-NVTIITSEFVP-E-----GK--LVAFVT-DRYNAVR-GGGLTVK-KFDQTLALE------DAVLFTAKTFAYGQPD-D 364 (395) T ss_pred C-cceEEEcCCCC-C-----Cc--EEEEec-ccEEEEE-ecceEEE-eccchhhhC------CcEEEEEEEEECCEEe-c Confidence 1 23333221111 0 11 111110 0000000 0000000 011110000 0112223334443333 3 Q ss_pred cccceeeccC Q lcl|Aclame:pro 379 PWAVVRLIGL 388 (388) Q Consensus 379 P~ai~~~~GI 388 (388) |-||. .+-| T Consensus 365 ~~A~~-~l~i 373 (395) T protein:vir:95 365 NKASA-VYDL 373 (395) T ss_pred cccEE-EEEe Confidence 55554 2222 No 116 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=46.27 E-value=0.74 Score=21.30 Aligned_cols=220 Identities=9% Similarity=-0.028 Sum_probs=107.5 Q ss_pred cCCCCceeeEEEeeeccccceEecccccCCceeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHH Q lcl|Aclame:pro 112 TVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQL 191 (388) Q Consensus 112 t~g~w~~~t~~~~v~e~~G~a~~ygd~~diP~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~ 191 (388) ..|--.=.+++|+- ..|.+..+++++.+|..++.......++...+-++++...+... ..| +...+-.....+++ T Consensus 1 ~~~~~~Gdtit~P~--~iGda~~v~eG~~i~~~~l~~t~~~atIk~~gk~~~itD~a~l~--~~g-Dp~~ea~~Q~~~~i 75 (231) T protein:vir:73 1 ENGINLANLCEYPN--DIGDAADVAEGGEISLDKIGTTTKSVTIKKAAKGTEITDEAALS--GYG-DPIGESNKQLGLSL 75 (231) T ss_pred CccccCCceEEecc--cccchhhhcCCCcCChhhccccceeeeEeeeccceeeeHHHHhh--ccC-chHHHHHHHHHHHH Confidence 22222235778885 48999999999999999999999999999988888877765533 344 44455556666666 Q ss_pred HHhhceEEEEeecCccccceEEEeecCCCccccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceE Q lcl|Aclame:pro 192 EIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITL 271 (388) Q Consensus 192 ~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL 271 (388) ..++|.-.+ +- +. .+ +|..+++ --+++|+.++..+-.. ...+..+ T Consensus 76 A~kvD~di~-~~-----------~~--------~a-------~l~~~~~-~t~d~i~~A~~~fgde-------~~~~~vi 120 (231) T protein:vir:73 76 ANKVDDDLL-KA-----------AK--------TT-------SQTVSTK-ANVDGVQAALDIFNDE-------DAQAYVL 120 (231) T ss_pred HHhhhHHHH-Hh-----------hc--------cc-------ccccccc-ccHHHHHHHHHHhccc-------cccceEE Confidence 666665322 10 00 00 0111111 0145555555554322 2356789 Q ss_pred EcCHHHHHhhccCCCcCccHHHHHH----Hh-----CCccEEEEccccccccCCCCccEEEEEEcccccccccccCCCcc Q lcl|Aclame:pro 272 VLPMNKVDMLSVVTDLGISVRDWLK----QT-----YPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDT 342 (388) Q Consensus 272 ~Lp~~~~~~Ls~~~~~~~Tvl~~lk----~n-----~pnl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t 342 (388) ++.|..+..|.+--....+ -.... .| +-.++|..-+.+.. +++ -.+-++.. .+.+....-.+. . T Consensus 121 vv~p~~~~~Lrk~~~~~~~-~~~~g~~i~~~G~iG~i~G~~Vi~S~~~~~--~~~-~~~~~i~~--~gAl~~~~k~~~-~ 193 (231) T protein:vir:73 121 IVNPKDAAKIRKDANAKNI-GSEVGANALINGTYADVLGAQIVRSKKLAE--GSA-LMFKIVSN--SPALKLVLKRGV-Q 193 (231) T ss_pred EEcchHHHhhhhccchhhh-hhhhccceeeecccceEcceEEEEcCCCCC--Cce-eeeeEEee--ccceeeeecccc-e Confidence 9999998888541111100 00000 00 12345544333321 111 00001110 000000000000 0 Q ss_pred eEeecchhhhccCceeccCceEEecccceeeeeeeccccceee--ccC Q lcl|Aclame:pro 343 WAQLVQSKFVTLGVEKRVKNYVEAYSNATAGVMLKRPWAVVRL--IGL 388 (388) Q Consensus 343 ~~~~~p~~~r~~~v~~~~~~~~~~~~~~t~G~ii~rP~ai~~~--~GI 388 (388) .|. .| .+..+.-.+. ....+||-++.|..++.+ .|+ T Consensus 194 vEt-----dR--d~~~k~~~i~---~~~~y~v~l~~~~~vv~~t~~g~ 231 (231) T protein:vir:73 194 VET-----DR--DIVTKTTVIT---ADEHYAAYLYDLTKVVNITFTGV 231 (231) T ss_pred eec-----cc--cccccccEEE---EeEEEEEEEEcCccEEEEEeecC Confidence 010 01 0000111111 113679999999998876 677 No 117 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=45.20 E-value=0.78 Score=21.18 Aligned_cols=250 Identities=10% Similarity=-0.010 Sum_probs=120.4 Q ss_pred ccccccccchHHHHHHHhhcceeeeecccchhhhhhcccccCCCC-ceeeEEEeeeccccceEecccccCCceeeeeeee Q lcl|Aclame:pro 72 VAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSW-EDQEIVQGIVEPAGTAMEYGDLTNIPLSSWNVNF 150 (388) Q Consensus 72 ~~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w-~~~t~~~~v~e~~G~a~~ygd~~diP~~~~n~~~ 150 (388) .....-+++=+|--+..|+-.++ ..-.+...+..+++...- .=.+++++.++..|.+..+.+.++++..++.... T Consensus 1 Ma~T~~~d~I~Pev~~~~V~e~~----~~~~~~~~~~~~d~~L~g~~G~ti~~P~~~~igdae~~~eg~~i~~~~lt~~~ 76 (270) T protein:vir:95 1 MTQTKKANLINPEVLANVVSAQM----QNAIRFTPYAVTDDTLVGQPGDTITRPKYAYIGAAEDLQEGVAMDTTQMSMTT 76 (270) T ss_pred CCceehhhhcchHHHHHHHHHHH----HhHHhhccccccccccCCCCCCEEEeeeecCCCccccccCCCccchhhcccch Confidence 11122234445655666554433 222223334444333111 1367899999999999999999999999999999 Q ss_pred eeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccccccccCC Q lcl|Aclame:pro 151 ERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPG 230 (388) Q Consensus 151 ~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~ 230 (388) ..-++.+.+-+++++.+ .+....+ +...+-.....+.+..++++..+ +- ..|.... .+ T Consensus 77 ~~a~i~~~gk~~~itD~--a~~~~~~-dp~~~~~~q~a~~~a~~~d~~li-~~-------l~~a~~~--------~~--- 134 (270) T protein:vir:95 77 TKVTVKETGKAVEVTQT--AIITNVN-GTLQEASRQLAMSLADKVEIDYI-AE-------LNKSKQT--------AT--- 134 (270) T ss_pred heeeeehhhCcceecHH--HHhhhcc-chHHHHHHHHHHHHHHHHHHHHH-HH-------hcccccc--------cc--- Confidence 99999888766666554 3333333 55555556666777666665433 11 1111000 00 Q ss_pred cccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccCC-----CcCccHHHHHHHh-----CC Q lcl|Aclame:pro 231 GWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVVT-----DLGISVRDWLKQT-----YP 300 (388) Q Consensus 231 ~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~~-----~~~~Tvl~~lk~n-----~p 300 (388) ..-+. ++|+.++..+- +....+..|++.|..+..|.+.. ..+..+ + .| |- T Consensus 135 -----~~~t~----~~~~dA~~~lg-------d~~~~~~~i~vhs~~~~~Lrk~~~~~~~~~~~~~---~-~~G~ig~~~ 194 (270) T protein:vir:95 135 -----VSADA----TGILDAIEVFN-------SENDEDYVLYVNPKDYNKLVKSLFKVGGNVQDRA---I-SKGDLVEIV 194 (270) T ss_pred -----cccCH----HHHHHHHHHhc-------cccCCCcEEEEcHHHHHHHHhhhcccccccccch---h-cccccceec Confidence 11233 34444443331 12245778999999999885421 111111 1 11 22 Q ss_pred ccEE-EEccccccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEec-ccceeeeeeec Q lcl|Aclame:pro 301 RVRV-MSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAY-SNATAGVMLKR 378 (388) Q Consensus 301 nl~i-~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~-~~~t~G~ii~r 378 (388) ++++ ++- .. -.....++|.+-- .....-.+.. .|. .| ..+...-.. .-..+||-++. T Consensus 195 G~~Viv~s----~~---~~~~~~~l~~~gA--i~~~~~~~~~-vEt-----dR------d~~~~~d~i~~~~~y~v~~~~ 253 (270) T protein:vir:95 195 GVSDIVKS----KR---VSENTAFLQRYGA--MEIVNKKKPE-AYT-----DF------DILKRTHLLSTNYHYSVNLKD 253 (270) T ss_pred ceeEEEeC----CC---CCceeEEEEeccc--eeeeecCCce-eee-----cc------chhhcccEEEeeeEEEEEEEc Confidence 3442 321 11 1122334443221 0000000000 111 11 000000011 12467888888 Q ss_pred cccceeeccC Q lcl|Aclame:pro 379 PWAVVRLIGL 388 (388) Q Consensus 379 P~ai~~~~GI 388 (388) |..++.++== T Consensus 254 ~skvv~~t~~ 263 (270) T protein:vir:95 254 ETGVVKVTFK 263 (270) T ss_pred cceEEEEEec Confidence 8877765311 No 118 >protein:vir:78935 Length: 335 # NCBI annotation: capsid protein # Family: family:all:2806 # MgeID: mge:1860 # MgeName: LKD16 # Cross-refs: genbank:acc:YP_001522824;genbank:gi:158345059;genbank:GeneID:5687425 Probab=41.82 E-value=0.91 Score=20.81 Aligned_cols=287 Identities=12% Similarity=0.060 Sum_probs=118.0 Q ss_pred hccCcccccccccccchHHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccceEec----ccc-c Q lcl|Aclame:pro 65 QAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEY----GDL-T 139 (388) Q Consensus 65 ~amDaa~~~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~y----gd~-~ 139 (388) |+.-+..+-+.--++.+-..-||+-+.-+|.+....-...++++.+.+.- ...++.|+. .|++... |.. + T Consensus 1 ms~~~~~t~~~~~~s~~d~al~le~f~geV~~af~~~s~~~~~~~~rti~--~g~s~~~~~---iG~~~~~~~~pG~~l~ 75 (335) T protein:vir:78 1 MSFLNDLTRPNYAGKNADVDIHLEEHLGIVDKHFAYTSKFAPLMNIRDLR--GSNVVRLDR---LGNVEAKGRRAGEELE 75 (335) T ss_pred CCccccccccccccccchhhhhhhhhhhHHHHHHHHhhhhccccceeeec--cceeEEEee---eeeeeecccccCcccC Confidence 12122111110001111233456666665555554455556666665421 125556654 4676654 321 2 Q ss_pred CCceeeeeeeeeeeeE--EEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEE------EeecCccccce Q lcl|Aclame:pro 140 NIPLSSWNVNFERRTI--VRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGF------YGWEGKNGNRT 211 (388) Q Consensus 140 diP~~~~n~~~~~~~v--~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~------~G~a~~~~~g~ 211 (388) ..|... ++....| ..+.-.+=|.+.|. ++..++-++-......++.++.|+..+ .+.+.. ... T Consensus 76 ~~~~~~---~k~~itID~ll~a~~~VddlDe~----~~~yDvR~e~s~~~G~aLA~~~Dq~~~~~l~~aa~~~a~--~~~ 146 (335) T protein:vir:78 76 RSRVVN---DKWNLTVDTLLYLRHQFDHQDEW----TQSFDMRKEVAELDGQELARKFDQACLIQVIKAAAMDAP--VDL 146 (335) T ss_pred CCCccc---CCeEEEecceeechhhHhhHHHh----hcCchhHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccc--ccc Confidence 222211 1111111 11222222333333 345566666666666666666666433 111000 001 Q ss_pred EEEeecCCCccccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeecccc--ccceEEcCHHHHHhhccC----- Q lcl|Aclame:pro 212 FGFLNDPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPED--VDITLVLPMNKVDMLSVV----- 284 (388) Q Consensus 212 ~GllN~P~l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~--~p~tL~Lp~~~~~~Ls~~----- 284 (388) .+.++ |++...+..++ .-+...++.+.+=+..+...+....- |+. .-...+++|.+|..|-.- T Consensus 147 ~~~~~-~G~~~~~~~tg-----~~~~~~~~~l~~a~~~a~~~l~ekdv----P~~~~~~rv~vv~P~~y~~Ll~~~~l~n 216 (335) T protein:vir:78 147 EDAFS-PGVLEKLDLTG-----LTAKEAAEKIVRMHRRVVETFIERDL----GDAVYSEGLTPMSPRVFSLLLEHDKLMS 216 (335) T ss_pred CCCcC-CCcceeeeecc-----ccccccHHHHHHHHHHHHHHHHhccC----CCCCCCccEEEeChHHHHHHhccccccc Confidence 11111 22211111111 11334677777777777777775542 211 114688999999988532 Q ss_pred CCcCcc--HHHHHHHh---CCccEEEEccccccccCCC-------------Ccc-EEEEEEcccccccccccCCCcceEe Q lcl|Aclame:pro 285 TDLGIS--VRDWLKQT---YPRVRVMSAPELQGGNPDD-------------GKD-IAYMFLDSVDTAVDGSTDGGDTWAQ 345 (388) Q Consensus 285 ~~~~~T--vl~~lk~n---~pnl~i~~~pel~~a~gtg-------------~~~-~~~~~~~~~d~~~~~~~~~~~t~~~ 345 (388) ++|+.| .-.+.+.. --+++|+..+.|-..++++ .+. ++++|-++- -..-+ T Consensus 217 ~~~~~s~~~~~~~~g~v~~v~Gv~V~~Sn~lP~~~~t~~~lg~a~n~~~~d~~~~~~~~~~~~A-----------l~t~~ 285 (335) T protein:vir:78 217 VEYQATGATNDYVKSRVAILNGVKVLETPRFATKAISAHPLGRHFNVSAEEAERQIALFLPSKT-----------LITAQ 285 (335) T ss_pred ccccccccccccccceeEEeeceEEEeeccCCCCCCccccccccCCcccccccceEEEEEecce-----------EEEEE Confidence 222211 11111111 1135566666554322221 111 222221110 00001 Q ss_pred ecchhhhccCceeccCceEEecccceeeeeeecc--ccceeeccC Q lcl|Aclame:pro 346 LVQSKFVTLGVEKRVKNYVEAYSNATAGVMLKRP--WAVVRLIGL 388 (388) Q Consensus 346 ~~p~~~r~~~v~~~~~~~~~~~~~~t~G~ii~rP--~ai~~~~GI 388 (388) +.+...+.. -+.+...|.+++-.. .|+-++|| .++...+|| T Consensus 286 ~~~~~~e~~-~~~~~~~~~i~~~~a-~G~g~lRPe~a~~i~~tg~ 328 (335) T protein:vir:78 286 VAPVQAKLW-EDHDQFSWVLDTFQM-YNIGARRPDTAGAIELKGI 328 (335) T ss_pred EEeccccee-eccchhhHhhhHHHH-cCCcccCcceEEEEEecCC Confidence 111111110 011223455555443 79999999 455677899 No 119 >protein:vir:106647 Length: 303 # NCBI annotation: ORF011 # Family: family:all:1178 # MgeID: mge:1557 # MgeName: 187 # Cross-refs: genbank:acc:YP_239493;genbank:gi:66395226;genbank:GeneID:4555801 Probab=41.20 E-value=0.94 Score=20.74 Aligned_cols=269 Identities=12% Similarity=0.054 Sum_probs=122.3 Q ss_pred hccCcccc---cccccccchHHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccceEecccccCC Q lcl|Aclame:pro 65 QAFDSAYV---APTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNI 141 (388) Q Consensus 65 ~amDaa~~---~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~di 141 (388) |+-+.... +....-+.-|--+|-..|+. |...+-..+..|... | -.-+++.|++.++.|.+.-.+.+..| T Consensus 1 M~~e~nl~~~~dL~~a~siDF~~~f~~~i~~-----L~~~LGv~r~~pla~-G-t~iktyK~~~~~y~gda~dVaEGe~I 73 (303) T protein:vir:10 1 MSAENNLINVEALGKAKSIDFANKLGVGLNK-----LFEALAIQNKIPMNV-G-SALKQYRFKVEDSEKPNGDVAEGDVI 73 (303) T ss_pred CCCCcCCcchhhcccceeehhhhhhhhhHHH-----HHHHhhhhccccccC-C-ceeeeeeeeceeeccccccccCCccc Confidence 11111110 00011123244444433321 222333334445442 2 24467778888899999888899999 Q ss_pred ceeeeeeeee---eeeEEEEEEEEeecHHHHHHHHHhCCChHH-HHHHHHHHHHHHhhceEEEEeecCccccceEEEeec Q lcl|Aclame:pro 142 PLSSWNVNFE---RRTIVRGEMGIQVGLLEEGRASAMRINSAE-VKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLND 217 (388) Q Consensus 142 P~~~~n~~~~---~~~v~~~~~~~~y~~~El~~A~~~g~~l~~-~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~ 217 (388) |+..+..... ..++..+.-+. +.+-++ +.|...+- +-...-.+++.++++.-.|- .| T Consensus 74 plskvt~~~~~t~~~~~kK~rK~t--TdEAIq---lsGyg~aVgetd~qL~~~Iq~kIdnd~~~------------~l-- 134 (303) T protein:vir:10 74 PLTKVTREQVDITELQFAKYRKST--SAEAIQ---AHGYDLAINQTDNEMIKYVQKKFRAKFFE------------TL-- 134 (303) T ss_pred chhhheeeecceEEEEeecccccc--cHHHHH---hhcCCchhHHHHHHHHHHHHhhhhHHHHH------------HH-- Confidence 9999987643 34444444433 554433 34433221 11223344555554433320 00 Q ss_pred CCCccccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC------CCcCccH Q lcl|Aclame:pro 218 PSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV------TDLGISV 291 (388) Q Consensus 218 P~l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~------~~~~~Tv 291 (388) .++++... .+++.+--.+-|..++...|.+-....+.+..+..++=|-+...||... +++|.++ T Consensus 135 ------ktaT~t~~----~t~~t~~s~~glq~Al~~~~~kl~~~~ed~~~~V~FvNP~Daa~yl~~A~i~~~~t~fG~n~ 204 (303) T protein:vir:10 135 ------KSAIENGK----RTNKTKLSAENLQGALSKGRANLSVLLDDEITPIAFVNPNDTAEYLANGFINSTGAQFGVNL 204 (303) T ss_pred ------hhcccccc----cccceeecHHHHHHHHHhhhhhccccccccccEEEEEchHHHHHHhhcCCcchhhhhhhhhh Confidence 01111111 1111222256677788877766554333233343444566677778543 3457666 Q ss_pred HHHHHHhCCccEEEEccccccc--cCCCCccEEEEEEcccccccccccCCCcceEeecchhh----hccCceeccCceEE Q lcl|Aclame:pro 292 RDWLKQTYPRVRVMSAPELQGG--NPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKF----VTLGVEKRVKNYVE 365 (388) Q Consensus 292 l~~lk~n~pnl~i~~~pel~~a--~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~----r~~~v~~~~~~~~~ 365 (388) ++ ||-+++|+..+++... -.+-..+..+.|++-. . +..+.|.+-+- +- ..|....+...+.- T Consensus 205 L~----nfLG~~II~S~kv~~G~~~~T~~~Ni~~ay~~~~-g------~l~~~f~~t~D-~tglIGv~h~~~~~~~t~eT 272 (303) T protein:vir:10 205 LT----PYVGVKIVEFADVPQGEVWMTVAENLNVAYANPR-G------ELSRAFAFATD-ATGFVGVLHDIQPQRLTSDT 272 (303) T ss_pred hh----hhhcceEEEeccCCCceEEEeeccceEEEEecCc-h------hhhhhhhhccc-cccceEEEeccccceeeehh Confidence 65 7778888877766531 1334557777776531 1 22222222110 00 01222222222221 Q ss_pred ecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 366 AYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 366 ~~~~~t~G~ii~rP~ai~~~~GI 388 (388) -...|+. ..|. +.+|| T Consensus 273 ---~~~~~~~-lfpE---~~dgi 288 (303) T protein:vir:10 273 ---IYASAIS-MFPE---NIDAV 288 (303) T ss_pred ---HhHhHHH-hccc---ccceE Confidence 1122222 2243 44555 No 120 >protein:vir:80128 Length: 466 # NCBI annotation: Phage capsid protein # Family: family:all:635 # MgeID: mge:1877 # MgeName: bacteriophage bv1 # Cross-refs: genbank:acc:YP_001425603;genbank:gi:155042936;genbank:GeneID:5469556 Probab=40.87 E-value=0.95 Score=20.70 Aligned_cols=334 Identities=9% Similarity=-0.016 Sum_probs=124.6 Q ss_pred CCCcceeeeecCccccchhhhhhccc-ccccccC------CHHHHhhcceecccchhhcchhhhhhhhhhhhccCccccc Q lcl|Aclame:pro 1 MKQLSKVHQSLAGRSVRAFDMANGKA-DYRLTDM------AVRELKKFGLVFDHATVKRQIELLHEGGVATQAFDSAYVA 73 (388) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~------~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~amDaa~~~ 73 (388) +.++ ......++-+|-...+... ..+..+. ...+.++.+.+...... ..+.....-......+ T Consensus 84 l~e~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~---- 153 (466) T protein:vir:80 84 LEQL---NNKEPKNNSEPAQVSGARTQQFVGGETRMKGFFRNMPYEQRAALIARSEV---KEFLAQVRTLAQQKRA---- 153 (466) T ss_pred HHHH---HHhhhccCchhHHHHhhhhhHHhhHHHHHHHHHHhhhhhhHHHHHHHHHH---HHHHHHHHHHhhhhhh---- Confidence 0000 0000001001100000000 0000000 00000110100000000 0000000000000011 Q ss_pred ccccccchHHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccceEecccccCCceeeeeeeeeee Q lcl|Aclame:pro 74 PTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPLSSWNVNFERR 153 (388) Q Consensus 74 ~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~diP~~~~n~~~~~~ 153 (388) .+..+.-+|-.+.+.| ++.+......+.++.+..... +..+.+......+...+...++|-.+...+...- T Consensus 154 -~~g~~~~vP~~~~~~i----~~~l~~~~~l~~~~~v~~~~g----~~~~~~~~~~~~a~wv~E~~~~~~~~~~f~~i~~ 224 (466) T protein:vir:80 154 -VSGAELTIPDVMLELL----RDNMHRYSKLISKVRLRPLKG----TARQNIAGAIPEGVWTEAVANLNELSLSFSQIEV 224 (466) T ss_pred -hccccccccHHHHHHH----HHhhhhhhhhhhheeeeecCc----eeEeeeecCCcceeecccccccccccccccceee Confidence 0111122454444433 333322222233333322221 2234444444455666778888988888888888 Q ss_pred eEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccccccccCCccc Q lcl|Aclame:pro 154 TIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWV 233 (388) Q Consensus 154 ~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~t 233 (388) .++.+...+.++.+=|. ....++.+--....+.++...+|+-.+.|+... ...|+||+.+...... ......+ T Consensus 225 ~~~k~~~~~~iS~ell~---ds~~~l~~~i~~~la~~~~~~~~~ail~G~G~~---~P~Gil~~~~~~~~~~-~~~~~~~ 297 (466) T protein:vir:80 225 DGYKVGGFIPIPNSTLE---DSDLNLADEILDAIGQAIGFALDKAILYGTGTK---MPVGIVTRLAQTTQPP-NWGTKAP 297 (466) T ss_pred cceeeeeehhhhHHHHh---cchHHHHHHHHHHHHHHHHHHHhhheeeccCCC---Ccceeeeccccccccc-ccccccc Confidence 88999888888885443 244578888888999999999999999997532 4569999865432111 1111112 Q ss_pred ccccCCHHH-------------HHHHHHHHHHHHHHhcCCeeccccccceE-EcCHHHHHhhc-cC---CCcCccHHHHH Q lcl|Aclame:pro 234 SGGANAFQG-------------IVGDLRLMLITLRVQSEDNIDPEDVDITL-VLPMNKVDMLS-VV---TDLGISVRDWL 295 (388) Q Consensus 234 ~Wa~kT~~e-------------I~~DI~~~~~~l~~~s~g~v~~~~~p~tL-~Lp~~~~~~Ls-~~---~~~~~Tvl~~l 295 (388) .+.+.+... .+.|+...+..+... ...+..+ ++.+..+..|. .. +..|. +-+- T Consensus 298 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~~~w~~~~~~~~~l~~~~~~~~~~g~--~~~~ 368 (466) T protein:vir:80 298 AWTNLSTTNLLKIDPTGKSAEEFFSELVLKLSKARAN-------YSNGMKFWAMSSNTHAVLMSKAITFNSAGA--LVAS 368 (466) T ss_pred cccccchhhhhhhhhhccchhhHHHHHHHHHHhhhcc-------ccCCceeEEecchhHHHhhcccccccCCcc--cccc Confidence 233323222 222322111111111 1112222 33334443332 11 11111 1000 Q ss_pred HHh-C--CccEEEEcccccccc-CCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEecccce Q lcl|Aclame:pro 296 KQT-Y--PRVRVMSAPELQGGN-PDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSNAT 371 (388) Q Consensus 296 k~n-~--pnl~i~~~pel~~a~-gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~~~t 371 (388) -.| . -+..|+..+-..... -.|-.+...++ ... .+...+...... ..-.+.+....|. T Consensus 369 ~~~~~~i~G~pvv~s~~~~~~~~~~g~~~~y~i~-~r~------------~~~i~~~~~~~f-----~~d~~~~r~~~r~ 430 (466) T protein:vir:80 369 LNNTMPIVGGDIVILDFIPDNDIIGGYGSLYLLA-ERA------------DIKLAQSEHVRF-----IEDQTVFKGTARY 430 (466) T ss_pred CCCcccccccceeecCccCccceeeeccccEEEE-eec------------ceEEEechhhhh-----hcCcEEEEEEEEE Confidence 011 0 112232221110000 00111111111 111 111111111110 0112334556666 Q ss_pred eeeeeeccccceeeccC Q lcl|Aclame:pro 372 AGVMLKRPWAVVRLIGL 388 (388) Q Consensus 372 ~G~ii~rP~ai~~~~GI 388 (388) +|.+ +.|-||+.+++= T Consensus 431 dg~~-~~~~afv~~~~~ 446 (466) T protein:vir:80 431 DGKP-VFGEGFVAVNIA 446 (466) T ss_pred ccEE-eccCceEEEEec Confidence 6655 569999988643 No 121 >protein:vir:103323 Length: 364 # NCBI annotation: major capsid-like protein # Family: family:all:2806 # MgeID: mge:1609 # MgeName: Era103 # Cross-refs: genbank:acc:YP_001039668;genbank:gi:125999997;genbank:GeneID:4818399 Probab=39.62 E-value=1 Score=20.56 Aligned_cols=291 Identities=11% Similarity=0.042 Sum_probs=108.7 Q ss_pred hccCcccccccccccchHHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccceEeccc--ccCC- Q lcl|Aclame:pro 65 QAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGD--LTNI- 141 (388) Q Consensus 65 ~amDaa~~~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd--~~di- 141 (388) |+.-.....+...+...-..-+|+...-+|.+....--..++++.+.+.- ...++.|+.+ |++...+- +..+ T Consensus 1 ms~~n~~t~~~~~~~~~~~al~le~f~geV~taf~~~s~~~~~~~~rti~--~gkS~q~~~i---G~~~~~~~~~G~~ld 75 (364) T protein:vir:10 1 MSNPNVLTQPAVSASGEVDSLLIEKFNNRVHEQYLKGENLLQWFDVQEVV--GTNSVSNKYI---GETELQVLSPGKSPD 75 (364) T ss_pred CCCcccccccccccccchhhhhhhhhhhhHHHHHHHHHhhcCcceeeeec--ccceEEeeee---eeeEEeeeccCcccC Confidence 12111111111111112333445544444433332333344555554321 1245555554 66655310 1111 Q ss_pred ceeeeeeeeeeeeE--EEEEEEEeecHHHHHHHHHhCCC-hHHHHHHHHHHHHHHhhceEEE----Ee-ecCccccceEE Q lcl|Aclame:pro 142 PLSSWNVNFERRTI--VRGEMGIQVGLLEEGRASAMRIN-SAEVKRQGAAVQLEIMRNAIGF----YG-WEGKNGNRTFG 213 (388) Q Consensus 142 P~~~~n~~~~~~~v--~~~~~~~~y~~~El~~A~~~g~~-l~~~K~~aAr~a~~~~~n~i~~----~G-~a~~~~~g~~G 213 (388) | ....-++....| ..+.-.+=|.+.|. ++.++ +.++-...+..++.++.|+..+ .+ .+ ..-+ T Consensus 76 ~-~~~~~~k~~itID~ll~a~~~V~diDe~----q~~~D~vR~e~s~e~G~ALA~~~Dq~i~~~v~~aa~a-----~~~~ 145 (364) T protein:vir:10 76 A-SPTEFDKNRLVVDTTVIARNTVAHFHDV----QNDIDGLKSKLSVNQAKKLKKMEDSMVIQQLVLGGIS-----NTEA 145 (364) T ss_pred C-CCcccCcEEEEecceeeechhhhhHHHH----hcCccchhHHHHHHHHHHHHHHHHHHHHHHHHhhhhh-----cccc Confidence 1 111111111111 11111222333332 23344 3333333444444443333221 11 01 0111 Q ss_pred EeecCCCccccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC-----CCcC Q lcl|Aclame:pro 214 FLNDPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-----TDLG 288 (388) Q Consensus 214 llN~P~l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-----~~~~ 288 (388) -.+.|-+...-.....++..+-...+++.+++=|..+...|-.+. +..++ ..++|||.+|..|-.- .+|+ T Consensus 146 ~~~~~~~~~~g~~i~~~~~a~~~~~~~~~l~~ai~~a~~~LdEkd---VP~~~--R~~vv~P~~y~~Ll~~~~lvn~d~~ 220 (364) T protein:vir:10 146 IRKNPRVAGHGFSIHIVGLASSFLTSPQYMMAAIEMAMEQQTEQE---VDTSE--LCGLMPWTAFNCLRDADRIVDKSYT 220 (364) T ss_pred cccCCcccCCcceeeecccCcchhhhHHHHHHHHHHHHHHHhhcC---CCccc--cEEEeChHHHHHHhcCCcccccccc Confidence 112221111000000011122345667788877777777766554 33222 5789999988888432 2333 Q ss_pred cc-HHHHHHHh---CCccEEEEccccccccC----C--------------------C--CccEEEEEEcccccccccccC Q lcl|Aclame:pro 289 IS-VRDWLKQT---YPRVRVMSAPELQGGNP----D--------------------D--GKDIAYMFLDSVDTAVDGSTD 338 (388) Q Consensus 289 ~T-vl~~lk~n---~pnl~i~~~pel~~a~g----t--------------------g--~~~~~~~~~~~~d~~~~~~~~ 338 (388) .+ --.+.+.. --+++|+..+.|-..++ + + ...++++|-++- + T Consensus 221 ~~~~~~~~~G~v~~v~Gv~Vv~Sn~lP~~~~~~~~t~~~t~h~ls~~~~g~~y~v~~d~~~~~~~~f~~~A--l------ 292 (364) T protein:vir:10 221 IAASDNTVDGFVLKSWNTPIVPSNRFPKLSDNTEGTGNTKHHKLSNAGNGNRYDVTAGQTSAQAVLFTQDA--L------ 292 (364) T ss_pred ccCCCccccceeEEEeceEEEeccccccccccccccccccccccccccCCcccccccccceeEEEEEecce--E------ Confidence 21 11222221 12345555544421100 0 0 123344443320 0 Q ss_pred CCcceEeecchhhhccCceeccCceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 339 GGDTWAQLVQSKFVTLGVEKRVKNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 339 ~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) -+.++ .+...+... +.+...|.+++-. ..|+-++||-+++-+.=- T Consensus 293 --~tv~~-~~~t~e~~~-~~~~~~~~ida~~-a~G~g~lRPeaa~~i~~~ 337 (364) T protein:vir:10 293 --LVGRT-ISITGDIFY-EKKEKTWYIDTFL-AEGAIPDRWEAVAVVTAA 337 (364) T ss_pred --EEEEE-ecceeeeee-ccceeeeeeeeeh-cccCcccCccceEEEEec Confidence 00111 111111000 1222356666533 479999999888766433 No 122 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=39.56 E-value=1 Score=20.56 Aligned_cols=324 Identities=10% Similarity=-0.025 Sum_probs=125.3 Q ss_pred CCCcceeeeec-------Cc--cccchhh---hhhcccc-cccccCCHHHHhhcce-ecccchhhcchhhhhhhhhhhhc Q lcl|Aclame:pro 1 MKQLSKVHQSL-------AG--RSVRAFD---MANGKAD-YRLTDMAVRELKKFGL-VFDHATVKRQIELLHEGGVATQA 66 (388) Q Consensus 1 ~~~~~~~~~~~-------~~--~~~~~~~---~~~~~~~-~~~~~~~~~~l~~~g~-~~~~~~~~~~~~~~~~~~~~~~a 66 (388) +..+......+ .. .+.+..+ ....... .........+.+...+ .|-+.... ..... ......+ T Consensus 39 ~e~i~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~--~~~~~~a 115 (408) T protein:vir:74 39 AEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKGPLNKSENELKDKFVKDFVNMVRN-PMAFL--NTVSSKT 115 (408) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccccchhhhhHHHHHHHHHHHHhc-chhhh--hhhhhhh Confidence 00000000000 00 0000000 0000000 0000000000000000 00000000 00000 0011111 Q ss_pred cCcccccccccccch--HHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeecccc-ceEecccccCCce Q lcl|Aclame:pro 67 FDSAYVAPTTQASIP--TPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAG-TAMEYGDLTNIPL 143 (388) Q Consensus 67 mDaa~~~~~t~~~~g--~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G-~a~~ygd~~diP~ 143 (388) +- ..+..+.| +|-.+ .+.|++.+......++++++..... ....+.+......+ .+...+...++|- T Consensus 116 ~~-----~~~~~~gg~~vP~~~----~~~Ii~~~~~~~~l~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~v~E~~~~~~ 185 (408) T protein:vir:74 116 ET-----SGSDSAAGLTIPQDI----RTMINTLVRQYDSLQQYVRVESVST-SSGSRVYEKWTDVTPLKAMDEEDGKIPD 185 (408) T ss_pred hc-----ccccCCCceeechhH----hhHHHHHHhhhcchhhhcceeeccC-CcceEEEEeecCCccccccccccccccc Confidence 11 11222334 44333 3466666666666666655433221 11222333333333 3345566778885 Q ss_pred -eeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCcc Q lcl|Aclame:pro 144 -SSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLP 222 (388) Q Consensus 144 -~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a 222 (388) .+...+....+.+.+...+.++.+=++ ....+|.+.-.....+++...+|+-.+.|+... .| T Consensus 186 ~~~~~~~~i~~~~~k~~~~~~iS~ell~---ds~~~l~~~i~~~l~~~~~~~~d~~il~G~G~~----------~~---- 248 (408) T protein:vir:74 186 LDNPRLTIIKYLIKRYAGIITATNTLLK---DTAENILAWLSSWIAKKVVVTRNQAIIAAMGTV----------PK---- 248 (408) T ss_pred ccccceeeEEeeeeeEEeeehhHHHHHh---hchHHHHHHHHHHHHHHHHHHHHHHHhhccccc----------cc---- Confidence 557888899999999999999885442 345578888888888888888998888886310 00 Q ss_pred ccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC-CCcCccHHHH-HHHhCC Q lcl|Aclame:pro 223 AIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRDW-LKQTYP 300 (388) Q Consensus 223 ~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~~-lk~n~p 300 (388) . + ...+.+.|++.++..+.... .. ...++|.+..+..|.+. +..|.-++.= +....| T Consensus 249 ----~--~-----~~~~~~~i~~~~~~~l~~~~-------~~---~a~~v~n~~~~~~l~~lkd~~G~~l~~~~~~~~~~ 307 (408) T protein:vir:74 249 ----K--P-----TIANFDDVITMINTSVDPAI-------IA---TSSLLTNQSGLNKLALVKTAEGKYLLEPDPTKPNS 307 (408) T ss_pred ----c--c-----ccccHHHHHHHHHHhhhhhh-------cC---CCEEEEcHHHHHHHHHhhcCCCceEeccCcCCCCC Confidence 0 0 11223334333322221111 11 12588999999988643 3334433210 111111 Q ss_pred ccEEEEccc------cccccCCCCccEEEEEEcccccccccccCCCcceEeecch---hhhccCceeccCceEEecccce Q lcl|Aclame:pro 301 RVRVMSAPE------LQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQS---KFVTLGVEKRVKNYVEAYSNAT 371 (388) Q Consensus 301 nl~i~~~pe------l~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~---~~r~~~v~~~~~~~~~~~~~~t 371 (388) -+|-..|= .... .++++..++|.+-....... ....-.+.. -+. .|... ....-+..|. T Consensus 308 -~~l~G~pV~~~~~~~~~~--~~~~~~~i~~gd~~~~~~~~-~~~~~~i~~-~~~~~~~f~~~-------~~~~r~~~r~ 375 (408) T protein:vir:74 308 -YLIKGKQVIVVADRWLPN--SGSTVYPLYYGDMSQAITLF-DRENMSLLP-TNIGAGAFETD-------TTKIRVIDRF 375 (408) T ss_pred -ceecceeeEEecCccccc--ccCCcceEEEEehhccEEEE-EecceEEEE-eccccchhhcc-------eeeEEEEEee Confidence 12322221 1111 12222222332110000000 000001110 010 01111 1223344455 Q ss_pred eeeeeeccccceeeccC Q lcl|Aclame:pro 372 AGVMLKRPWAVVRLIGL 388 (388) Q Consensus 372 ~G~ii~rP~ai~~~~GI 388 (388) +| .+++|-||+..+.- T Consensus 376 d~-~~~~~~a~~~~~~~ 391 (408) T protein:vir:74 376 DV-KATDSEALVAGSFT 391 (408) T ss_pred Cc-EEecccceEEEEee Confidence 55 46669998887754 No 123 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=38.44 E-value=1.1 Score=20.43 Aligned_cols=311 Identities=13% Similarity=0.040 Sum_probs=122.1 Q ss_pred CCCcce--------------eeeecCccccchhhhhhcccccccccCCHHHHhhcceecccchhhcchhhhhhhhhhhhc Q lcl|Aclame:pro 1 MKQLSK--------------VHQSLAGRSVRAFDMANGKADYRLTDMAVRELKKFGLVFDHATVKRQIELLHEGGVATQA 66 (388) Q Consensus 1 ~~~~~~--------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~g~~~~~~~~~~~~~~~~~~~~~~~a 66 (388) |....+ ......++...+..... ......+-++ . .+.......... T Consensus 54 i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~-~---------~~~~~~~~~~~~ 113 (421) T protein:vir:13 54 MEIIEEEIESVMTAIDEERKNTNFTGGRVIINGDSKE----------EKRSLQLSAM-S---------KTIRGIQLSEEE 113 (421) T ss_pred HHHHHHHHHHHHHHHHHHHhhhcccccccccccchhH----------HHHHHHHHHH-H---------HhhhccchhHHH Confidence 111000 00000000000000000 0000000000 0 000000000001 Q ss_pred cCcccccccccccch--HHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccc--eEecccccCCc Q lcl|Aclame:pro 67 FDSAYVAPTTQASIP--TPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGT--AMEYGDLTNIP 142 (388) Q Consensus 67 mDaa~~~~~t~~~~g--~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~--a~~ygd~~diP 142 (388) .+ ..+.++.| +|..+. +.|++.+......+.++.+..... .+..|++...... +...+....+| T Consensus 114 ra-----~~t~~~gg~liP~~~~----~~Ii~~~~~~~~l~~l~~~~~~~~---~~~~~~~~~~~~~~~~~~~~E~~~~~ 181 (421) T protein:vir:13 114 RD-----IMSSTNNGAVIPQEFV----NEFEKLKEGYPSLKEHCHVIPVNR---NAGKMPVRAGASVDKLANLAKDTELV 181 (421) T ss_pred hh-----ccccCCcceecchhhH----HHHHHHHHhhhhhhhhceeeeccC---CceEEEEeecCCccceeecccccccc Confidence 11 12233344 454433 345555544444555544332221 2233444322222 33456677888 Q ss_pred eeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCcc Q lcl|Aclame:pro 143 LSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLP 222 (388) Q Consensus 143 ~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a 222 (388) ..+.......-.++.+...+.++.+=|+-+ ..+|.+--....+.++..++|.-.. +...|+++.+.+ T Consensus 182 ~s~~~f~~i~~~~~k~~~~v~iS~ell~ds---~~~l~~~i~~~la~~~~~~~~~~i~--------~~~~g~~~~~~~-- 248 (421) T protein:vir:13 182 KAMLKTQPMAYDIDDYGLLAPIDNSLLEDS---EINFLEFVNEEFAEFAVNTENAEIV--------KQAKAVLAEETI-- 248 (421) T ss_pred ccccceeEEEeeeeeeEeehhhhHHHHhhh---HHHHHHHHHHHHHHHHHHHhhhhHh--------hhhhhccccccc-- Confidence 888888888888899988888887544332 3456666666666666666653221 122343332111 Q ss_pred ccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC-CCcCccHHHHHHHhCC- Q lcl|Aclame:pro 223 AIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRDWLKQTYP- 300 (388) Q Consensus 223 ~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~~lk~n~p- 300 (388) .+ ++||.+++..+...-. ....++|.+..+..|... +..|.=++.-+....| T Consensus 249 ---------------~~----~d~i~~~~~~l~~~~~-------~~a~~v~n~~~~~~l~~lkd~~G~~i~~~~~~~~~~ 302 (421) T protein:vir:13 249 ---------------ND----YAGLVKTINSLVPNAR-------KRAIIVTNSDGRAYLDGLMDKQGRPLLKELSDGGDL 302 (421) T ss_pred ---------------cc----hHHHHHHHHHhhhhhc-------CCCEEEEcHHHHHHHHHhhcCCCceeecCcCCCCCc Confidence 12 5677777777754321 134789999999988643 4444433332221111 Q ss_pred ---ccEEEEccccccccCCCCccEEEEEEcccccccccccCCCcceEeecchh-hhc----cCceeccCceEEeccccee Q lcl|Aclame:pro 301 ---RVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSK-FVT----LGVEKRVKNYVEAYSNATA 372 (388) Q Consensus 301 ---nl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~-~r~----~~v~~~~~~~~~~~~~~t~ 372 (388) ++.++..+.... ++ +++..++|.+-......+.. .+-++.. .... |.. +.+..+. .++....++.. T Consensus 303 tl~G~pV~~~~~~~~--~~-~~~~~~~~gd~~~~~~~~~~-~~~~v~~-~~~~~f~~~~~~~r~~~r~-d~~~~~~~a~~ 376 (421) T protein:vir:13 303 VFKGRPVIELEESIF--DV-GDETKFIVSDFKTLIKFMDR-KQYLIDQ-SKEAGYTKNETIARIIERF-DVNSPLDKSSD 376 (421) T ss_pred eecceeeEEeccccc--cC-CCceEEEEEeccccEEEEEe-cceEEEe-ecccccccCeeEEEEEeee-cceeecchhhh Confidence 223333332221 11 22333333221100101100 1111111 0111 111 1111110 12222233344 Q ss_pred eeeeeccccceeeccC Q lcl|Aclame:pro 373 GVMLKRPWAVVRLIGL 388 (388) Q Consensus 373 G~ii~rP~ai~~~~GI 388 (388) .+.+.+|-+++...+. T Consensus 377 ~~~~~~~~a~v~~~~~ 392 (421) T protein:vir:13 377 AEKIRKFGVIVKLQEV 392 (421) T ss_pred eeeecccceeeccccc Confidence 4555666667777666 No 124 >protein:vir:105822 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1636 # MgeName: PMC # Cross-refs: genbank:acc:YP_655767;genbank:gi:109522090;genbank:GeneID:4157630 Probab=37.05 E-value=1.1 Score=20.27 Aligned_cols=257 Identities=12% Similarity=0.028 Sum_probs=101.5 Q ss_pred cccchH-HHHHHHhhcceeeeecccchhhhhhcccccC--CCCceeeEEEeeeccccceEecccccCCceeeeeeeeeee Q lcl|Aclame:pro 77 QASIPT-PIQFLQQWLPGFVKVLTSARKIDEILGVKTV--GSWEDQEIVQGIVEPAGTAMEYGDLTNIPLSSWNVNFERR 153 (388) Q Consensus 77 ~~~~g~-l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~--g~w~~~t~~~~v~e~~G~a~~ygd~~diP~~~~n~~~~~~ 153 (388) +++.-| |-.+...++ +.+........++..+.. +.. -.++.++.....+.+..-+....++.-+.+.++... T Consensus 1 MA~~~~~pe~~~~~v~----~~~~~~lv~~~l~~~~~~~~~~~-Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~ 75 (273) T protein:vir:10 1 MAFNNFIPELWSDMLL----EEWTAQTVFANLVNREYEGTASK-GNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDL 75 (273) T ss_pred CcchhhhHHHHHHHHH----HHHHhhhccchhhcccccccccc-CceEEEeecccccccccccCCCccCccccccceEEE Confidence 233222 222333222 233233333344332211 222 257888876554433211222223333334444444 Q ss_pred eEEE-EEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccccccccCCcc Q lcl|Aclame:pro 154 TIVR-GEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGW 232 (388) Q Consensus 154 ~v~~-~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~ 232 (388) ++-. -..++.+...|...+ . .++.. -...+..++...+++..+ + -+.+- .+...+ T Consensus 76 tid~~~~~~~~i~d~d~~~~--~-~~~~~-~~~~~~~alA~~vD~~i~-~-------~~~~a-----------~~~~~~- 131 (273) T protein:vir:10 76 LIDQEKSIDFLVDDIDRVQV--A-GSLEA-YTRAGATALATDTDKFIA-D-------MLVDN-----------GTALTG- 131 (273) T ss_pred EEeeeeecceEeecHHHhhh--h-ccHHH-HHHHHHHHHHHHHHHHHH-H-------HHhcc-----------cccccc- Confidence 4422 244455554443222 2 34533 233344556655554322 0 00010 000000 Q ss_pred cccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccCCCc----CccH-HHHHHHh----CCccE Q lcl|Aclame:pro 233 VSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVVTDL----GISV-RDWLKQT----YPRVR 303 (388) Q Consensus 233 t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~~~~----~~Tv-l~~lk~n----~pnl~ 303 (388) -+.-|++.+++.|.++...+-.+. + |+ ..-.|+++|..+..|.+.+++ ...- ..-+++- .-.++ T Consensus 132 --~~~~~~~~~~~~i~~a~~~ld~~~---v-P~-~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~ 204 (273) T protein:vir:10 132 --SAPTDADDAFDLIAKALKELTKAN---V-PN-VGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGAR 204 (273) T ss_pred --ccccchhHHHHHHHHHHHHhhhcC---C-Cc-CCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceE Confidence 123456678888888877775443 1 22 234799999999988442211 0000 0011110 11234 Q ss_pred EEEccccccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEeccc-ceeeeeeeccccc Q lcl|Aclame:pro 304 VMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSN-ATAGVMLKRPWAV 382 (388) Q Consensus 304 i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~~-~t~G~ii~rP~ai 382 (388) |.....|- .+.+..++.+.++-- .. ..|.. ++-.+..+ +.+-.-+.+ -..|+-|.||-++ T Consensus 205 v~~s~~lp----~~~~~~~~~~~~~A~----~~------a~q~~--~~e~~r~~---~~~~~~v~~~~~yg~~v~~~~~~ 265 (273) T protein:vir:10 205 IVESNNLR----DTDDEQFVAFHPSAA----AY------VSQID--TVEALRDQ---DSFSDRIRALHVYGGKVVRPTGV 265 (273) T ss_pred EEEecccc----cCCccEEEEEeccce----ee------eeeee--hhhcccCC---CcceeeeeeeeeeeeeEeccceE Confidence 44432331 122233344433210 00 01110 11111111 111111111 2478888889988 Q ss_pred eeeccC Q lcl|Aclame:pro 383 VRLIGL 388 (388) Q Consensus 383 ~~~~GI 388 (388) +.+.== T Consensus 266 ~~l~~~ 271 (273) T protein:vir:10 266 VVFNKT 271 (273) T ss_pred EEEecc Confidence 875433 No 125 >protein:vir:102605 Length: 273 # NCBI annotation: gp6 # Family: family:all:2203 # MgeID: mge:1661 # MgeName: Llij # Cross-refs: genbank:acc:YP_655002;genbank:gi:109392192;genbank:GeneID:4157227 Probab=37.05 E-value=1.1 Score=20.27 Aligned_cols=257 Identities=12% Similarity=0.028 Sum_probs=101.5 Q ss_pred cccchH-HHHHHHhhcceeeeecccchhhhhhcccccC--CCCceeeEEEeeeccccceEecccccCCceeeeeeeeeee Q lcl|Aclame:pro 77 QASIPT-PIQFLQQWLPGFVKVLTSARKIDEILGVKTV--GSWEDQEIVQGIVEPAGTAMEYGDLTNIPLSSWNVNFERR 153 (388) Q Consensus 77 ~~~~g~-l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~--g~w~~~t~~~~v~e~~G~a~~ygd~~diP~~~~n~~~~~~ 153 (388) +++.-| |-.+...++ +.+........++..+.. +.. -.++.++.....+.+..-+....++.-+.+.++... T Consensus 1 MA~~~~~pe~~~~~v~----~~~~~~lv~~~l~~~~~~~~~~~-Gdtv~ip~~~~~~~~d~~~~~~~~~~~~~~~~~~~~ 75 (273) T protein:vir:10 1 MAFNNFIPELWSDMLL----EEWTAQTVFANLVNREYEGTASK-GNVVHIAGVVAPTVKDYKAAGRQTSADAISDTGVDL 75 (273) T ss_pred CcchhhhHHHHHHHHH----HHHHhhhccchhhcccccccccc-CceEEEeecccccccccccCCCccCccccccceEEE Confidence 233222 222333222 233233333344332211 222 257888876554433211222223333334444444 Q ss_pred eEEE-EEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccccccccCCcc Q lcl|Aclame:pro 154 TIVR-GEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGW 232 (388) Q Consensus 154 ~v~~-~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~ 232 (388) ++-. -..++.+...|...+ . .++.. -...+..++...+++..+ + -+.+- .+...+ T Consensus 76 tid~~~~~~~~i~d~d~~~~--~-~~~~~-~~~~~~~alA~~vD~~i~-~-------~~~~a-----------~~~~~~- 131 (273) T protein:vir:10 76 LIDQEKSIDFLVDDIDRVQV--A-GSLEA-YTRAGATALATDTDKFIA-D-------MLVDN-----------GTALTG- 131 (273) T ss_pred EEeeeeecceEeecHHHhhh--h-ccHHH-HHHHHHHHHHHHHHHHHH-H-------HHhcc-----------cccccc- Confidence 4422 244455554443222 2 34533 233344556655554322 0 00010 000000 Q ss_pred cccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccCCCc----CccH-HHHHHHh----CCccE Q lcl|Aclame:pro 233 VSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVVTDL----GISV-RDWLKQT----YPRVR 303 (388) Q Consensus 233 t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~~~~----~~Tv-l~~lk~n----~pnl~ 303 (388) -+.-|++.+++.|.++...+-.+. + |+ ..-.|+++|..+..|.+.+++ ...- ..-+++- .-.++ T Consensus 132 --~~~~~~~~~~~~i~~a~~~ld~~~---v-P~-~~R~lvv~p~~~~~L~~~~~~~~~~~~~~~~~~l~~G~ig~i~G~~ 204 (273) T protein:vir:10 132 --SAPTDADDAFDLIAKALKELTKAN---V-PN-VGRVVVVNAEMAFWLRSSGSKLTSADTSGDAAGLRAGTIGNLLGAR 204 (273) T ss_pred --ccccchhHHHHHHHHHHHHhhhcC---C-Cc-CCCEEEECHHHHHHHhcchhhhhhhhccccccceeeeeeeEEeceE Confidence 123456678888888877775443 1 22 234799999999988442211 0000 0011110 11234 Q ss_pred EEEccccccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEeccc-ceeeeeeeccccc Q lcl|Aclame:pro 304 VMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSN-ATAGVMLKRPWAV 382 (388) Q Consensus 304 i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~~-~t~G~ii~rP~ai 382 (388) |.....|- .+.+..++.+.++-- .. ..|.. ++-.+..+ +.+-.-+.+ -..|+-|.||-++ T Consensus 205 v~~s~~lp----~~~~~~~~~~~~~A~----~~------a~q~~--~~e~~r~~---~~~~~~v~~~~~yg~~v~~~~~~ 265 (273) T protein:vir:10 205 IVESNNLR----DTDDEQFVAFHPSAA----AY------VSQID--TVEALRDQ---DSFSDRIRALHVYGGKVVRPTGV 265 (273) T ss_pred EEEecccc----cCCccEEEEEeccce----ee------eeeee--hhhcccCC---CcceeeeeeeeeeeeeEeccceE Confidence 44432331 122233344433210 00 01110 11111111 111111111 2478888889988 Q ss_pred eeeccC Q lcl|Aclame:pro 383 VRLIGL 388 (388) Q Consensus 383 ~~~~GI 388 (388) +.+.== T Consensus 266 ~~l~~~ 271 (273) T protein:vir:10 266 VVFNKT 271 (273) T ss_pred EEEecc Confidence 875433 No 126 >protein:vir:96490 Length: 348 # NCBI annotation: head protein # Family: family:all:1083 # MgeID: mge:1620 # MgeName: 2972 # Cross-refs: genbank:acc:YP_238492;genbank:gi:66391768;genbank:GeneID:5176912 Probab=35.39 E-value=1.2 Score=20.08 Aligned_cols=287 Identities=11% Similarity=0.036 Sum_probs=77.6 Q ss_pred hccCcccccccccccchHHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEee-eccccceEecccc-cCCc Q lcl|Aclame:pro 65 QAFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGI-VEPAGTAMEYGDL-TNIP 142 (388) Q Consensus 65 ~amDaa~~~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v-~e~~G~a~~ygd~-~diP 142 (388) |+.=.++ .++.. +....+.+. ..+.++-...+||....-+ ..+.+-. ......+.++-.. ..-| T Consensus 1 M~~i~d~---f~~~~---l~~~i~~~~-----~~~~~~l~~~~Fp~~~~~~---~~~~~~~~~~~~~~~a~~v~~~~~~~ 66 (348) T protein:vir:96 1 MGLIYDK---VTASN---IAGYFNTLQ-----ENVDSTLGESIFPARKQLG---TKLSYIKGASGQSVALKAAAFDTNVT 66 (348) T ss_pred Ccchhhc---cCHHH---HHHHHHhcc-----cchhhhhhhhcCCCccccc---eeEEEEeecCCceeEeeeecCCCCcc Confidence 1100001 11111 111111111 1122222345566432111 0111111 1111111111111 0111 Q ss_pred eee-eeeeeeeeeEEEEEEEEeecHHHH---HHHHHhCCCh------------HHHHHHHHHHHHHHhhceEEEEeecCc Q lcl|Aclame:pro 143 LSS-WNVNFERRTIVRGEMGIQVGLLEE---GRASAMRINS------------AEVKRQGAAVQLEIMRNAIGFYGWEGK 206 (388) Q Consensus 143 ~~~-~n~~~~~~~v~~~~~~~~y~~~El---~~A~~~g~~l------------~~~K~~aAr~a~~~~~n~i~~~G~a~~ 206 (388) ..+ -..+..+..+-.+......+..|+ +.+...+-+- ..+...++++-.|...-+..+.|--.. T Consensus 67 ~~~r~~~~~~~~~~p~i~~~~~i~~~d~~~l~~~~~~~~~~~~~~~~~~i~~d~~~l~~~i~~r~E~m~~qal~~Gki~~ 146 (348) T protein:vir:96 67 IRDRVSAEIHDEQMPFFKEALLVKENDRQQLNLVKDTGNEALINTIVAGIFNDDVTLINGARARLEAMRMQVLATGKIAF 146 (348) T ss_pred eecccceeeeeeecCccccccccCHHHHHHHHhhhccCCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCeeEe Confidence 111 001111111111222233343333 2222222111 111223344444444444333331100 Q ss_pred cccceEEEeecCCCccccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccCCC Q lcl|Aclame:pro 207 NGNRTFGFLNDPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVVTD 286 (388) Q Consensus 207 ~~~g~~GllN~P~l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~~~ 286 (388) ...|..-.+.. +.++...-+ ...+|.+++. .++.||.++...+. .+ |. .|.+++|.+..+..|.+- T Consensus 147 ~~~~~~~~vdf-g~~~~~~~t---~~~~W~~~~a-dp~~di~~~~~~~~-~~-G~-----~~~~~i~~~~~~~~l~~~-- 212 (348) T protein:vir:96 147 TSDGVNKDIDY-GVKADHKKQ---VSKSWAEPGA-TPLADLEDAIETAR-EL-GL-----NPERAIMNAKTFGLIRKA-- 212 (348) T ss_pred ecCCeeEEEec-cCCccccee---eccccCCCCC-CHHHHHHHHHHHHH-hc-CC-----cccEEEeCHHHHHHHhcC-- Confidence 00111101111 222221111 1235887655 59999999987765 34 42 466899999999998431 Q ss_pred cCccHHHHHHHhCCccEEEEccccccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhh-----------ccC Q lcl|Aclame:pro 287 LGISVRDWLKQTYPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFV-----------TLG 355 (388) Q Consensus 287 ~~~Tvl~~lk~n~pnl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r-----------~~~ 355 (388) ..|.+.++-...+...++..++...-++-++=.++.|...+- . +.. +....+|+..- .++ T Consensus 213 --~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~y~~~y~-d-----~~G-~~~~~~p~~~v~l~~~~~~G~~~yg 283 (348) T protein:vir:96 213 --ASTVKAIKPLAGDGSSVTKAELQNYVADNYGVEIVLENGTYR-N-----EKG-EVSKFFPDGHLTLIPNGPLGNTVFG 283 (348) T ss_pred --HHHHHHHhccCCccccccHHHHHHHHhhhcCceEEEEccEEE-e-----cCC-cEeccccCCeEEEEcCCCceeEEec Confidence 112222221111111111111111100000101112211110 0 000 00111221110 111 Q ss_pred ceeccCc--e----EEecccceeeeeee-----ccccc---eeeccC Q lcl|Aclame:pro 356 VEKRVKN--Y----VEAYSNATAGVMLK-----RPWAV---VRLIGL 388 (388) Q Consensus 356 v~~~~~~--~----~~~~~~~t~G~ii~-----rP~ai---~~~~GI 388 (388) ..++... + .......-.|..++ .|... +....+ T Consensus 284 ~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~dP~~~~~~~~s~pl 330 (348) T protein:vir:96 284 TTPEESDLFADNTVNADVEIVDSGIAVTTTKTTDPVNVQTKVSMVAL 330 (348) T ss_pred cChhhhhhhhcccccccceecCCeeEEEeeecCCCceEEEEEeeeee Confidence 1100000 0 00000001111111 11111 111111 No 127 >protein:vir:9927 Length: 295 # NCBI annotation: hypothetical protein # Family: family:all:1178 # MgeID: mge:178 # MgeName: 315.6 # Cross-refs: genbank:acc:NP_795689;genbank:gi:28876459;genbank:GeneID:1258000 Probab=34.33 E-value=1.3 Score=19.96 Aligned_cols=259 Identities=15% Similarity=0.061 Sum_probs=114.9 Q ss_pred hhhh-ccCcccccccccccchHHHHHHHhhcceeeeecccchhhhhhcccccCCCCc-eeeEEEeeeccccceEeccccc Q lcl|Aclame:pro 62 VATQ-AFDSAYVAPTTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWE-DQEIVQGIVEPAGTAMEYGDLT 139 (388) Q Consensus 62 ~~~~-amDaa~~~~~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~-~~t~~~~v~e~~G~a~~ygd~~ 139 (388) ||.- --.++. ....-++-|-.+|-..|+ ...++++|...-+-. -.+++++-++..|.+.-++.+. T Consensus 1 mAe~nlt~~~d--L~~~~sidfv~~f~~~i~-----------~L~~~Lgi~r~~p~a~G~tIt~pK~~~tgda~dVaEGe 67 (295) T protein:vir:99 1 MAEKNLNTMAD--LGDIKSIDFVNKFSKNIN-----------DLLKLLGVTRRETLTNDLKIQTYKWEVTLDQTDPGEGE 67 (295) T ss_pred CCCcccccHhh--ccCceeehhhHHhhhhHH-----------HHHHHhccccccccccCCeEEeeeeeeecccccccCCc Confidence 1100 000010 111123334444433222 223445554443332 3678888899999999999999 Q ss_pred CCceeeeeeee---eeeeEEEEEEEEeecHHHHHHHHHhCCChH-HHHHHHHHHHHHHhhceEEEEeecCccccceEEEe Q lcl|Aclame:pro 140 NIPLSSWNVNF---ERRTIVRGEMGIQVGLLEEGRASAMRINSA-EVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFL 215 (388) Q Consensus 140 diP~~~~n~~~---~~~~v~~~~~~~~y~~~El~~A~~~g~~l~-~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~Gll 215 (388) .||+..+.... ...++..+.-+ ++.+-+++. |...+ .+-...-++++..++++-.|- .| T Consensus 68 ~Iplskvt~~~~~t~t~kikK~rK~--tTdEAIqls---Gygdpvgead~qL~~~ia~kId~D~~~------------~l 130 (295) T protein:vir:99 68 TIPLSKVTRTKDKDYTVKWFKKRRA--TTAEAIARH---GAARAITEADKRIMRELQNGIKDAFFT------------FL 130 (295) T ss_pred ccchhhheeeeeeeeEEEeeeeccc--ccHHHHHhc---CCCchhHHHHHHHHHHHHHhhhHHHHH------------Hh Confidence 99999999875 33445555444 455444333 43322 222233455566655543330 01 Q ss_pred ecCCCccccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEc-CHHHHHhhcc-------CCCc Q lcl|Aclame:pro 216 NDPSLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVL-PMNKVDMLSV-------VTDL 287 (388) Q Consensus 216 N~P~l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~L-p~~~~~~Ls~-------~~~~ 287 (388) - +.+.. . +. +.+...+..+|.+-....+..+.+..+.+ |.+...||.. .+++ T Consensus 131 k--------tat~t------~--tg----~~lq~a~a~~~~al~~f~Ee~~~~~V~FVnP~D~a~yl~~A~~~~~~a~~f 190 (295) T protein:vir:99 131 K--------TKPTK------V--KG----VGLQKALSASWAKLATFNEFEGSPLVSFVSPLDVANYLGDTKVGADASNVF 190 (295) T ss_pred c--------cCcee------e--eh----hhHHHHHHHhhhhhhhcccccCCceEEEEehHHHHHHHhccccccchhhhh Confidence 0 00110 1 11 23344555555554433333334545555 5555666642 2447 Q ss_pred CccHHHHHHHhCCccE-EEEccccccc--cCCCCccEEEEEEccc----ccccccccCCCcceEeecchhhhccCceecc Q lcl|Aclame:pro 288 GISVRDWLKQTYPRVR-VMSAPELQGG--NPDDGKDIAYMFLDSV----DTAVDGSTDGGDTWAQLVQSKFVTLGVEKRV 360 (388) Q Consensus 288 ~~Tvl~~lk~n~pnl~-i~~~pel~~a--~gtg~~~~~~~~~~~~----d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~ 360 (388) |.++|+ ||-+++ |+..+++... -.+-..+..+.|++-- ...++...| ++....| .|....+. T Consensus 191 G~~~L~----nfLG~q~II~S~kv~~G~~~aT~~~Ni~~ay~~~~~g~l~~~f~~~~D--~tglIg~-----~h~~~~~~ 259 (295) T protein:vir:99 191 GMTLLK----NFLGMQNVIVMPSVPEGKIYSTAVENLVFASLNVKGGDLGGLFADFTD--ETGLIAA-----ARNRQLSN 259 (295) T ss_pred hhhhhh----hhhccceEEEcccCCCceEEEeeccceEEEEecCCchhhhhhhhhccC--cccceEE-----Eeccccce Confidence 887775 666775 7777666531 1334556666665421 111111111 1111111 12222222 Q ss_pred CceEEecccceeeeeeeccccceeeccC Q lcl|Aclame:pro 361 KNYVEAYSNATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 361 ~~~~~~~~~~t~G~ii~rP~ai~~~~GI 388 (388) ..++- -...|+. ..|. +.+|| T Consensus 260 ~t~et---~~~~~~~-lfpE---~~dgi 280 (295) T protein:vir:99 260 LTYES---VFFGANV-LFAE---IPEGV 280 (295) T ss_pred eeehh---hhHhHHH-hccc---ccceE Confidence 22211 0111222 2232 23344 No 128 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=29.05 E-value=1.7 Score=19.33 Aligned_cols=333 Identities=10% Similarity=-0.029 Sum_probs=129.9 Q ss_pred CCCcceeeeecCccccchhhh-----hhcccccccccCCHHHHhhccee-cccchhhcchhhhhhhhhhhhccCcccccc Q lcl|Aclame:pro 1 MKQLSKVHQSLAGRSVRAFDM-----ANGKADYRLTDMAVRELKKFGLV-FDHATVKRQIELLHEGGVATQAFDSAYVAP 74 (388) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~l~~~g~~-~~~~~~~~~~~~~~~~~~~~~amDaa~~~~ 74 (388) +.++...-..+- |.-+..+. .+......-......+.++.... +.+.. ...+... .........+. .. T Consensus 35 ~~e~~~l~~~i~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~--~~~~~~~-~~~~~~~~~~~--~~ 108 (392) T protein:vir:10 35 MEEVRSLQKKID-LQRSLDEAETEERNNGREVETRNVDGEMEYRDVFMKALRNKP--LNAEERE-FLEDDLEQRAM--SG 108 (392) T ss_pred HHHHHHHHHHHH-HHHHHHHHHHHHhhccccccccCccchHHHHHHHHHHHhccc--ccHHHHH-HHhhhhhhhhc--cc Confidence 000000000000 00000000 00000000000011111110000 00000 0000000 00000001111 11 Q ss_pred cccccchHHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccceEecccccCCceee-eeeeeeee Q lcl|Aclame:pro 75 TTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPLSS-WNVNFERR 153 (388) Q Consensus 75 ~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~diP~~~-~n~~~~~~ 153 (388) .|.++.|+++- +.+.++|++.+......+.+.++.....-. ....+......+.+...+....+|-.+ ...+.... T Consensus 109 ~t~~~gg~~vP--~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~-~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l 185 (392) T protein:vir:10 109 LTGEDGGLVIP--QDIQTQINELARSFDALEQYVTVEPVRTRS-GSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQY 185 (392) T ss_pred cccCCCceecc--hhHHHHHHHHHHhhhhhhhhceeeeccCCc-eeEEEEeecCCccceeecccccccccccccceeEEe Confidence 13334444332 123356677776666666665543322111 122333333444566777777887554 57788888 Q ss_pred eEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccccccccCCccc Q lcl|Aclame:pro 154 TIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWV 233 (388) Q Consensus 154 ~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~t 233 (388) ..+.+...+.++.+=|+.+ ..+|.+.-....+.++...+|.-.+.|+... . +.+ T Consensus 186 ~~~k~~~~~~iS~ell~ds---~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~---------~---------~~~----- 239 (392) T protein:vir:10 186 AVKDRAGILPLSRSLLQDS---DQNILKYVTKWLGKKSKVTRNVLILGVIEKL---------T---------KQA----- 239 (392) T ss_pred eeeeEEEeehhhHHHHhhh---HHHHHHHHHHHHHHHHHHHHHHHHhhccccc---------c---------ccC----- Confidence 8899999999998655443 4578888888888888888888777765311 0 000 Q ss_pred ccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC-CCcCccHHHH-HHHhCC----ccEEEE- Q lcl|Aclame:pro 234 SGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRDW-LKQTYP----RVRVMS- 306 (388) Q Consensus 234 ~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~~-lk~n~p----nl~i~~- 306 (388) ..+ ++||..+++...... +.. .-.++|.++.+..|.+- +..|.-++.- +....+ +..++. T Consensus 240 ---~~~----~d~i~~~~~~~l~~~---~~~---~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~~ 306 (392) T protein:vir:10 240 ---IKS----LDDIKDVLNVKLDPA---ISP---NAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVV 306 (392) T ss_pred ---ccC----HHHHHHHHHHhhhhh---hcc---CCEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCcccEEE Confidence 112 234444443221111 111 13589999999888542 3333322211 111111 111111 Q ss_pred cc-ccccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEecccceeeeeeeccccceee Q lcl|Aclame:pro 307 AP-ELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSNATAGVMLKRPWAVVRL 385 (388) Q Consensus 307 ~p-el~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~~~t~G~ii~rP~ai~~~ 385 (388) .. .+-...+.+.++..++|.+=.......... +-++.. -+.. ..-...-.....+..|.+| .+++|-+|+.+ T Consensus 307 ~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~-~~~~~~-~~~~----~~~f~~~~~~~r~~~r~d~-~v~~~~a~~~l 379 (392) T protein:vir:10 307 VSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKRE-DMELAS-TDVG----GKAFTRNTLDLRAIQRDDV-QMWDNEAAVYG 379 (392) T ss_pred ecccccCCCcccCCceEEEEEehhceEEEEeec-ceEEEE-eccc----cchhhcCceEEEEEEeecc-EEecccceEEE Confidence 10 111111334445444543210000000000 001110 0100 0000001122445556655 55669999998 Q ss_pred ccC Q lcl|Aclame:pro 386 IGL 388 (388) Q Consensus 386 ~GI 388 (388) ..= T Consensus 380 ~~~ 382 (392) T protein:vir:10 380 EID 382 (392) T ss_pred Eec Confidence 765 No 129 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=29.05 E-value=1.7 Score=19.33 Aligned_cols=333 Identities=10% Similarity=-0.029 Sum_probs=129.9 Q ss_pred CCCcceeeeecCccccchhhh-----hhcccccccccCCHHHHhhccee-cccchhhcchhhhhhhhhhhhccCcccccc Q lcl|Aclame:pro 1 MKQLSKVHQSLAGRSVRAFDM-----ANGKADYRLTDMAVRELKKFGLV-FDHATVKRQIELLHEGGVATQAFDSAYVAP 74 (388) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~l~~~g~~-~~~~~~~~~~~~~~~~~~~~~amDaa~~~~ 74 (388) +.++...-..+- |.-+..+. .+......-......+.++.... +.+.. ...+... .........+. .. T Consensus 35 ~~e~~~l~~~i~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~--~~~~~~~-~~~~~~~~~~~--~~ 108 (392) T protein:vir:10 35 MEEVRSLQKKID-LQRSLDEAETEERNNGREVETRNVDGEMEYRDVFMKALRNKP--LNAEERE-FLEDDLEQRAM--SG 108 (392) T ss_pred HHHHHHHHHHHH-HHHHHHHHHHHHhhccccccccCccchHHHHHHHHHHHhccc--ccHHHHH-HHhhhhhhhhc--cc Confidence 000000000000 00000000 00000000000011111110000 00000 0000000 00000001111 11 Q ss_pred cccccchHHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccceEecccccCCceee-eeeeeeee Q lcl|Aclame:pro 75 TTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPLSS-WNVNFERR 153 (388) Q Consensus 75 ~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~diP~~~-~n~~~~~~ 153 (388) .|.++.|+++- +.+.++|++.+......+.+.++.....-. ....+......+.+...+....+|-.+ ...+.... T Consensus 109 ~t~~~gg~~vP--~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~-~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l 185 (392) T protein:vir:10 109 LTGEDGGLVIP--QDIQTQINELARSFDALEQYVTVEPVRTRS-GSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQY 185 (392) T ss_pred cccCCCceecc--hhHHHHHHHHHHhhhhhhhhceeeeccCCc-eeEEEEeecCCccceeecccccccccccccceeEEe Confidence 13334444332 123356677776666666665543322111 122333333444566777777887554 57788888 Q ss_pred eEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccccccccCCccc Q lcl|Aclame:pro 154 TIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWV 233 (388) Q Consensus 154 ~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~t 233 (388) ..+.+...+.++.+=|+.+ ..+|.+.-....+.++...+|.-.+.|+... . +.+ T Consensus 186 ~~~k~~~~~~iS~ell~ds---~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~---------~---------~~~----- 239 (392) T protein:vir:10 186 AVKDRAGILPLSRSLLQDS---DQNILKYVTKWLGKKSKVTRNVLILGVIEKL---------T---------KQA----- 239 (392) T ss_pred eeeeEEEeehhhHHHHhhh---HHHHHHHHHHHHHHHHHHHHHHHHhhccccc---------c---------ccC----- Confidence 8899999999998655443 4578888888888888888888777765311 0 000 Q ss_pred ccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC-CCcCccHHHH-HHHhCC----ccEEEE- Q lcl|Aclame:pro 234 SGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRDW-LKQTYP----RVRVMS- 306 (388) Q Consensus 234 ~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~~-lk~n~p----nl~i~~- 306 (388) ..+ ++||..+++...... +.. .-.++|.++.+..|.+- +..|.-++.- +....+ +..++. T Consensus 240 ---~~~----~d~i~~~~~~~l~~~---~~~---~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~~ 306 (392) T protein:vir:10 240 ---IKS----LDDIKDVLNVKLDPA---ISP---NAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVV 306 (392) T ss_pred ---ccC----HHHHHHHHHHhhhhh---hcc---CCEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCcccEEE Confidence 112 234444443221111 111 13589999999888542 3333322211 111111 111111 Q ss_pred cc-ccccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEecccceeeeeeeccccceee Q lcl|Aclame:pro 307 AP-ELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSNATAGVMLKRPWAVVRL 385 (388) Q Consensus 307 ~p-el~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~~~t~G~ii~rP~ai~~~ 385 (388) .. .+-...+.+.++..++|.+=.......... +-++.. -+.. ..-...-.....+..|.+| .+++|-+|+.+ T Consensus 307 ~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~-~~~~~~-~~~~----~~~f~~~~~~~r~~~r~d~-~v~~~~a~~~l 379 (392) T protein:vir:10 307 VSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKRE-DMELAS-TDVG----GKAFTRNTLDLRAIQRDDV-QMWDNEAAVYG 379 (392) T ss_pred ecccccCCCcccCCceEEEEEehhceEEEEeec-ceEEEE-eccc----cchhhcCceEEEEEEeecc-EEecccceEEE Confidence 10 111111334445444543210000000000 001110 0100 0000001122445556655 55669999998 Q ss_pred ccC Q lcl|Aclame:pro 386 IGL 388 (388) Q Consensus 386 ~GI 388 (388) ..= T Consensus 380 ~~~ 382 (392) T protein:vir:10 380 EID 382 (392) T ss_pred Eec Confidence 765 No 130 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=29.05 E-value=1.7 Score=19.33 Aligned_cols=333 Identities=10% Similarity=-0.029 Sum_probs=129.9 Q ss_pred CCCcceeeeecCccccchhhh-----hhcccccccccCCHHHHhhccee-cccchhhcchhhhhhhhhhhhccCcccccc Q lcl|Aclame:pro 1 MKQLSKVHQSLAGRSVRAFDM-----ANGKADYRLTDMAVRELKKFGLV-FDHATVKRQIELLHEGGVATQAFDSAYVAP 74 (388) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~l~~~g~~-~~~~~~~~~~~~~~~~~~~~~amDaa~~~~ 74 (388) +.++...-..+- |.-+..+. .+......-......+.++.... +.+.. ...+... .........+. .. T Consensus 35 ~~e~~~l~~~i~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~--~~~~~~~-~~~~~~~~~~~--~~ 108 (392) T protein:vir:10 35 MEEVRSLQKKID-LQRSLDEAETEERNNGREVETRNVDGEMEYRDVFMKALRNKP--LNAEERE-FLEDDLEQRAM--SG 108 (392) T ss_pred HHHHHHHHHHHH-HHHHHHHHHHHHhhccccccccCccchHHHHHHHHHHHhccc--ccHHHHH-HHhhhhhhhhc--cc Confidence 000000000000 00000000 00000000000011111110000 00000 0000000 00000001111 11 Q ss_pred cccccchHHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccceEecccccCCceee-eeeeeeee Q lcl|Aclame:pro 75 TTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPLSS-WNVNFERR 153 (388) Q Consensus 75 ~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~diP~~~-~n~~~~~~ 153 (388) .|.++.|+++- +.+.++|++.+......+.+.++.....-. ....+......+.+...+....+|-.+ ...+.... T Consensus 109 ~t~~~gg~~vP--~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~-~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l 185 (392) T protein:vir:10 109 LTGEDGGLVIP--QDIQTQINELARSFDALEQYVTVEPVRTRS-GSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQY 185 (392) T ss_pred cccCCCceecc--hhHHHHHHHHHHhhhhhhhhceeeeccCCc-eeEEEEeecCCccceeecccccccccccccceeEEe Confidence 13334444332 123356677776666666665543322111 122333333444566777777887554 57788888 Q ss_pred eEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccccccccCCccc Q lcl|Aclame:pro 154 TIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWV 233 (388) Q Consensus 154 ~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~t 233 (388) ..+.+...+.++.+=|+.+ ..+|.+.-....+.++...+|.-.+.|+... . +.+ T Consensus 186 ~~~k~~~~~~iS~ell~ds---~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~---------~---------~~~----- 239 (392) T protein:vir:10 186 AVKDRAGILPLSRSLLQDS---DQNILKYVTKWLGKKSKVTRNVLILGVIEKL---------T---------KQA----- 239 (392) T ss_pred eeeeEEEeehhhHHHHhhh---HHHHHHHHHHHHHHHHHHHHHHHHhhccccc---------c---------ccC----- Confidence 8899999999998655443 4578888888888888888888777765311 0 000 Q ss_pred ccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC-CCcCccHHHH-HHHhCC----ccEEEE- Q lcl|Aclame:pro 234 SGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRDW-LKQTYP----RVRVMS- 306 (388) Q Consensus 234 ~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~~-lk~n~p----nl~i~~- 306 (388) ..+ ++||..+++...... +.. .-.++|.++.+..|.+- +..|.-++.- +....+ +..++. T Consensus 240 ---~~~----~d~i~~~~~~~l~~~---~~~---~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~~ 306 (392) T protein:vir:10 240 ---IKS----LDDIKDVLNVKLDPA---ISP---NAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVV 306 (392) T ss_pred ---ccC----HHHHHHHHHHhhhhh---hcc---CCEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCcccEEE Confidence 112 234444443221111 111 13589999999888542 3333322211 111111 111111 Q ss_pred cc-ccccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEecccceeeeeeeccccceee Q lcl|Aclame:pro 307 AP-ELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSNATAGVMLKRPWAVVRL 385 (388) Q Consensus 307 ~p-el~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~~~t~G~ii~rP~ai~~~ 385 (388) .. .+-...+.+.++..++|.+=.......... +-++.. -+.. ..-...-.....+..|.+| .+++|-+|+.+ T Consensus 307 ~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~-~~~~~~-~~~~----~~~f~~~~~~~r~~~r~d~-~v~~~~a~~~l 379 (392) T protein:vir:10 307 VSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKRE-DMELAS-TDVG----GKAFTRNTLDLRAIQRDDV-QMWDNEAAVYG 379 (392) T ss_pred ecccccCCCcccCCceEEEEEehhceEEEEeec-ceEEEE-eccc----cchhhcCceEEEEEEeecc-EEecccceEEE Confidence 10 111111334445444543210000000000 001110 0100 0000001122445556655 55669999998 Q ss_pred ccC Q lcl|Aclame:pro 386 IGL 388 (388) Q Consensus 386 ~GI 388 (388) ..= T Consensus 380 ~~~ 382 (392) T protein:vir:10 380 EID 382 (392) T ss_pred Eec Confidence 765 No 131 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=29.05 E-value=1.7 Score=19.33 Aligned_cols=333 Identities=10% Similarity=-0.029 Sum_probs=129.9 Q ss_pred CCCcceeeeecCccccchhhh-----hhcccccccccCCHHHHhhccee-cccchhhcchhhhhhhhhhhhccCcccccc Q lcl|Aclame:pro 1 MKQLSKVHQSLAGRSVRAFDM-----ANGKADYRLTDMAVRELKKFGLV-FDHATVKRQIELLHEGGVATQAFDSAYVAP 74 (388) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~l~~~g~~-~~~~~~~~~~~~~~~~~~~~~amDaa~~~~ 74 (388) +.++...-..+- |.-+..+. .+......-......+.++.... +.+.. ...+... .........+. .. T Consensus 35 ~~e~~~l~~~i~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~--~~~~~~~-~~~~~~~~~~~--~~ 108 (392) T protein:vir:10 35 MEEVRSLQKKID-LQRSLDEAETEERNNGREVETRNVDGEMEYRDVFMKALRNKP--LNAEERE-FLEDDLEQRAM--SG 108 (392) T ss_pred HHHHHHHHHHHH-HHHHHHHHHHHHhhccccccccCccchHHHHHHHHHHHhccc--ccHHHHH-HHhhhhhhhhc--cc Confidence 000000000000 00000000 00000000000011111110000 00000 0000000 00000001111 11 Q ss_pred cccccchHHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccceEecccccCCceee-eeeeeeee Q lcl|Aclame:pro 75 TTQASIPTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTAMEYGDLTNIPLSS-WNVNFERR 153 (388) Q Consensus 75 ~t~~~~g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a~~ygd~~diP~~~-~n~~~~~~ 153 (388) .|.++.|+++- +.+.++|++.+......+.+.++.....-. ....+......+.+...+....+|-.+ ...+.... T Consensus 109 ~t~~~gg~~vP--~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~-~~~~~~~~~~~~~a~~v~E~~~~~~~~~~~~~~v~l 185 (392) T protein:vir:10 109 LTGEDGGLVIP--QDIQTQINELARSFDALEQYVTVEPVRTRS-GSRVLEKNSDMIPFAEITEMGEIPETDNPKFSNVQY 185 (392) T ss_pred cccCCCceecc--hhHHHHHHHHHHhhhhhhhhceeeeccCCc-eeEEEEeecCCccceeecccccccccccccceeEEe Confidence 13334444332 123356677776666666665543322111 122333333444566777777887554 57788888 Q ss_pred eEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccccccccCCccc Q lcl|Aclame:pro 154 TIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIASTTPGGWV 233 (388) Q Consensus 154 ~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~~~~~~t 233 (388) ..+.+...+.++.+=|+.+ ..+|.+.-....+.++...+|.-.+.|+... . +.+ T Consensus 186 ~~~k~~~~~~iS~ell~ds---~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~---------~---------~~~----- 239 (392) T protein:vir:10 186 AVKDRAGILPLSRSLLQDS---DQNILKYVTKWLGKKSKVTRNVLILGVIEKL---------T---------KQA----- 239 (392) T ss_pred eeeeEEEeehhhHHHHhhh---HHHHHHHHHHHHHHHHHHHHHHHHhhccccc---------c---------ccC----- Confidence 8899999999998655443 4578888888888888888888777765311 0 000 Q ss_pred ccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC-CCcCccHHHH-HHHhCC----ccEEEE- Q lcl|Aclame:pro 234 SGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRDW-LKQTYP----RVRVMS- 306 (388) Q Consensus 234 ~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~~-lk~n~p----nl~i~~- 306 (388) ..+ ++||..+++...... +.. .-.++|.++.+..|.+- +..|.-++.- +....+ +..++. T Consensus 240 ---~~~----~d~i~~~~~~~l~~~---~~~---~a~~vm~~~~~~~L~~lkd~~G~~l~~~~~~~~~~~tllG~~~v~~ 306 (392) T protein:vir:10 240 ---IKS----LDDIKDVLNVKLDPA---ISP---NAILLTNQDGFNYLDKLKDKDGKYILQSDPTQKNKKLFAGTNPVVV 306 (392) T ss_pred ---ccC----HHHHHHHHHHhhhhh---hcc---CCEEEEcHHHHHHHHHhhccCCCeEeecCccCCccccccCcccEEE Confidence 112 234444443221111 111 13589999999888542 3333322211 111111 111111 Q ss_pred cc-ccccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEecccceeeeeeeccccceee Q lcl|Aclame:pro 307 AP-ELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSNATAGVMLKRPWAVVRL 385 (388) Q Consensus 307 ~p-el~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~~~t~G~ii~rP~ai~~~ 385 (388) .. .+-...+.+.++..++|.+=.......... +-++.. -+.. ..-...-.....+..|.+| .+++|-+|+.+ T Consensus 307 ~~~~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~-~~~~~~-~~~~----~~~f~~~~~~~r~~~r~d~-~v~~~~a~~~l 379 (392) T protein:vir:10 307 VSNRFLKSKGTTAKKAPLIIGDLKEAIVLFKRE-DMELAS-TDVG----GKAFTRNTLDLRAIQRDDV-QMWDNEAAVYG 379 (392) T ss_pred ecccccCCCcccCCceEEEEEehhceEEEEeec-ceEEEE-eccc----cchhhcCceEEEEEEeecc-EEecccceEEE Confidence 10 111111334445444543210000000000 001110 0100 0000001122445556655 55669999998 Q ss_pred ccC Q lcl|Aclame:pro 386 IGL 388 (388) Q Consensus 386 ~GI 388 (388) ..= T Consensus 380 ~~~ 382 (392) T protein:vir:10 380 EID 382 (392) T ss_pred Eec Confidence 765 No 132 >protein:vir:93881 Length: 387 # NCBI annotation: ORF011 # Family: family:all:658 # MgeID: mge:1485 # MgeName: 3A # Cross-refs: genbank:acc:YP_239938;genbank:gi:66395599;genbank:GeneID:5130947 Probab=27.72 E-value=1.8 Score=19.16 Aligned_cols=314 Identities=8% Similarity=-0.026 Sum_probs=127.8 Q ss_pred CC-------Cc---ceeeee--------cCccccchhhhhhcccccccccCCHHH-Hhhcceecccchhhcchhhhhhhh Q lcl|Aclame:pro 1 MK-------QL---SKVHQS--------LAGRSVRAFDMANGKADYRLTDMAVRE-LKKFGLVFDHATVKRQIELLHEGG 61 (388) Q Consensus 1 ~~-------~~---~~~~~~--------~~~~~~~~~~~~~~~~~~~~~~~~~~~-l~~~g~~~~~~~~~~~~~~~~~~~ 61 (388) ++ ++ ...+-. +.....++..+..... +.-+..+ ++. ++.-............ T Consensus 43 ~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~r~-~~~~~~~~~~~~~~~~---- 113 (387) T protein:vir:93 43 ETEKAGLQQRFNIVERQVKDIEEKEKAKVKDTGEAYQSLNDHEK----MVKAKAEFYRH-AILPNEFEKPSMEAQR---- 113 (387) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccCCCcchhhH----HHHHHHHHHHH-HhhhhhhhhhhhhhHH---- Confidence 00 00 000000 0000000000000000 0000111 111 1000000000000000 Q ss_pred hhhhccCcccccccccccch--HHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeee-ccccceEecccc Q lcl|Aclame:pro 62 VATQAFDSAYVAPTTQASIP--TPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIV-EPAGTAMEYGDL 138 (388) Q Consensus 62 ~~~~amDaa~~~~~t~~~~g--~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~-e~~G~a~~ygd~ 138 (388) -..+| ...+.++.| +|..+.. +|++.+......+++..+.+.++.. ++.. ...+.+...+.. T Consensus 114 -~~~al-----~~~t~s~gG~~IP~~~~~----~Ii~~~~~~~~l~~~~~v~~~~~~~-----~p~~~~~~~~a~~v~E~ 178 (387) T protein:vir:93 114 -LLHAL-----PTGNDSGGDKLLPKTLSK----EIVSEPFAKNQLREKARLTNIKGLE-----IPRVSYTLDDDDFITDV 178 (387) T ss_pred -HHHhh-----ccCcCCCCceeechhHHH----HHHHHHHhhchhhhheeeeecCCce-----EEEEeecCCccccccCc Confidence 00111 111223334 4554443 5555555444456666665554422 2322 234456677888 Q ss_pred cCCceeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecC Q lcl|Aclame:pro 139 TNIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDP 218 (388) Q Consensus 139 ~diP~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P 218 (388) ...|-.+...+...-..+.+...+.++.+=|+ -...++.+--....+.++...++..+|.+..+. ..-.|.++++ T Consensus 179 ~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~---Ds~~~l~~~i~~~la~~~~~~e~~~~~~~g~g~--g~p~g~l~~~ 253 (387) T protein:vir:93 179 ETAKELKLKGDTVKFTTNKFKVFAAISDTVIH---GSDVDLVNWVENALQSGLAAKERKDALAVSPKS--GLDHMSFYNG 253 (387) T ss_pred ccccccccccceeeeeheeeeeechhhHHHHh---hhHHHHHHHHHHHHHHHHHHHHHHhHhhcCCCc--cccceeeecc Confidence 88888888888888888999888888864332 234567777777777777777777666433321 2345777776 Q ss_pred CCccccccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHH-HHhhccCCCcCccHHHHHHH Q lcl|Aclame:pro 219 SLLPAIASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNK-VDMLSVVTDLGISVRDWLKQ 297 (388) Q Consensus 219 ~l~a~~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~-~~~Ls~~~~~~~Tvl~~lk~ 297 (388) .+.+. +....++||.+++..+-..-. . . -.++|.+.. ..++....+.|-.++ .. T Consensus 254 ~~~~v---------------~~~~~~d~i~~~~~~l~~~~~----~-~--a~~~mn~~t~~~~~~~~~d~~~~~~---~~ 308 (387) T protein:vir:93 254 SVKEV---------------EGADMYDAIINALADLHEDYR----D-N--ATIYMRYADYVKIISVLSNGTTNFF---DT 308 (387) T ss_pred ccccc---------------cccchHHHHHHHHhccChhhh----c-C--CEEEEechHHHHHHHHHhcCCCccc---cc Confidence 54221 111235677777776654321 1 1 135565443 444444333333222 11 Q ss_pred hCCccEEEEccccccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEecccceeeeeee Q lcl|Aclame:pro 298 TYPRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSNATAGVMLK 377 (388) Q Consensus 298 n~pnl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~~~t~G~ii~ 377 (388) .|+ +|-..|=.... +.... +| -+....... .+ ...+.+ . + +.....+.+-...|.+|.+ + T Consensus 309 -~~~-~llG~PV~~~~---~~~~~--~~-GDf~~~~~~-~~-~~~~~~-~----~----~~~~~~~~~~~~~r~d~~v-~ 368 (387) T protein:vir:93 309 -PAE-KVFGKPVVFTD---AAVKP--IV-GDFNYFGIN-YD-GTTYDT-D----K----DVKKGEYLFVLTAWYDQQR-T 368 (387) T ss_pred -CCc-cccccceEEec---CCCce--ee-eehhhhhee-hh-hheeee-c----c----cccCCceeEEEEeeeCcee-e Confidence 121 23333322211 11111 11 111100000 00 000000 0 0 0111122333445666665 4 Q ss_pred ccccceeeccC Q lcl|Aclame:pro 378 RPWAVVRLIGL 388 (388) Q Consensus 378 rP~ai~~~~GI 388 (388) +|-||+.+.-= T Consensus 369 ~~eA~~~l~~k 379 (387) T protein:vir:93 369 LDSAFRIAKAK 379 (387) T ss_pred chhheEEEEee Confidence 59999865432 No 133 >protein:vir:95318 Length: 328 # NCBI annotation: hypothetical protein # Family: family:all:1903 # MgeID: mge:1564 # MgeName: phiV10 # Cross-refs: genbank:acc:YP_512264;genbank:gi:89152431;genbank:GeneID:3952987 Probab=27.64 E-value=1.8 Score=19.15 Aligned_cols=223 Identities=12% Similarity=0.012 Sum_probs=105.0 Q ss_pred cCcccccccccccchHHHHHHHhhc-----ceeeeecccchhhhhhccccc--CCCCceeeEEEeeeccccceEeccccc Q lcl|Aclame:pro 67 FDSAYVAPTTQASIPTPIQFLQQWL-----PGFVKVLTSARKIDEILGVKT--VGSWEDQEIVQGIVEPAGTAMEYGDLT 139 (388) Q Consensus 67 mDaa~~~~~t~~~~g~l~~~l~~id-----p~v~e~l~~~~~~~~i~~v~t--~g~w~~~t~~~~v~e~~G~a~~ygd~~ 139 (388) |. ...+. +-.|.......+ +.|+|.+...-...+.+|-.. .|.|. .|.+....-.+...-=.. T Consensus 1 m~-----~~~~~-~~TL~e~Akr~~~d~~~~~VIE~l~~~n~IL~~lpf~e~n~gt~~----~~~v~~~LP~~~fR~lN~ 70 (328) T protein:vir:95 1 MA-----VKGLT-ALTLADWGKRVDPNGKVDKIIELLGQTNPILQDMPFVEGNLPTGH----RTTIRSGLPSATWRLLNY 70 (328) T ss_pred CC-----ccccc-cccHHHHHhhhCcchhHHHHHHHHhccchhHhhcceeecccCCcc----eeeEeeccCCceeeecCC Confidence 11 11111 124445444444 478888877666666666653 35554 344443333333333334 Q ss_pred CCceeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhC--CChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEE--- Q lcl|Aclame:pro 140 NIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMR--INSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGF--- 214 (388) Q Consensus 140 diP~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g--~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~Gl--- 214 (388) .+|-......+.+..+..++.-+++.....+ ..| -.+-+.+..+-.+++.+...+..||||...+..++-|| T Consensus 71 g~~~s~~tt~q~t~~l~ilgg~~eVDr~la~---~~Gn~~~~ra~q~~~~~ka~~~~~~~~~iyGdsa~~p~~F~GL~~R 147 (328) T protein:vir:95 71 GVQPSKSTTVQVTDSVGMLETYAEVDKSLAD---LNGNTAEFRLSEDRAFIEAMNQQMAQTLFYGDSSVNPQQFMGLSSR 147 (328) T ss_pred ccCcccceeEEEEEEEEEEecceeechHHHh---hcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCccCChhhhcchhhh Confidence 5677777788888888888887777774332 233 13345555666777778888888999876555677776 Q ss_pred eecCCCccc--cccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccCCCcCccHH Q lcl|Aclame:pro 215 LNDPSLLPA--IASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVVTDLGISVR 292 (388) Q Consensus 215 lN~P~l~a~--~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~~~~~~Tvl 292 (388) +|+++..-+ +...++++++. ..||.-.-| +. -.+.+-|...-. | T Consensus 148 ~~~~s~~~a~qiidaGgtg~~~-----------------TSi~~v~~g---~~--~~~giyPkG~~~--------G---- 193 (328) T protein:vir:95 148 YSSLSAGNAQNIIDAGGTGTDN-----------------TSIWLVVWG---EN--TVHGIFPKGKKA--------G---- 193 (328) T ss_pred cCccccccccceeecccCCCCc-----------------eEEEEEEEc---CC--eEEEeccccccc--------C---- Confidence 444321100 00011111110 001111000 00 001111111000 1 Q ss_pred HHHHHhCCccEEEEccc--cccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEecccc Q lcl|Aclame:pro 293 DWLKQTYPRVRVMSAPE--LQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYSNA 370 (388) Q Consensus 293 ~~lk~n~pnl~i~~~pe--l~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~~~ 370 (388) |.++..-| +.++ .+.....|.. .|+=+ T Consensus 194 ---------l~~~d~g~~~~~~~----~g~~y~~y~~----------------------------------~~~w~---- 222 (328) T protein:vir:95 194 ---------IQMEDKGQVTLEDA----NGGKYEGYRT----------------------------------HYKWD---- 222 (328) T ss_pred ---------ceeeecCceeeecC----CCCeeeEEEE----------------------------------EEEee---- Confidence 12221110 0011 0111111111 12222 Q ss_pred eeeeeeeccccceeeccC Q lcl|Aclame:pro 371 TAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 371 t~G~ii~rP~ai~~~~GI 388 (388) -|+.|+-|-+++|.-.| T Consensus 223 -~Gl~i~d~r~vvrI~NI 239 (328) T protein:vir:95 223 -NGLALRDWRYVVRIANI 239 (328) T ss_pred -eeeEEcCcccEEEEecC Confidence 29999999999999999 No 134 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=27.42 E-value=1.8 Score=19.12 Aligned_cols=294 Identities=12% Similarity=0.018 Sum_probs=104.9 Q ss_pred hccCcccccc-cccccchHHHHH-HHhhcceeeeecccchhhhhhcccccCCCCc-eeeEEEeeeccccceEecccccCC Q lcl|Aclame:pro 65 QAFDSAYVAP-TTQASIPTPIQF-LQQWLPGFVKVLTSARKIDEILGVKTVGSWE-DQEIVQGIVEPAGTAMEYGDLTNI 141 (388) Q Consensus 65 ~amDaa~~~~-~t~~~~g~l~~~-l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~-~~t~~~~v~e~~G~a~~ygd~~di 141 (388) |+|==.++++ .++++.. +| -+-|..++.+.+.....+.+++. +.++... -+++.++..- ...+.-|.-...+ T Consensus 1 ~~~~~~~~~~~~~t~~v~---~fipei~s~~i~~~l~~~~v~~~~~~-d~~~~~~~Gdtv~ip~~g-~~~~~d~~~~~~i 75 (341) T protein:vir:94 1 MALGNTITGPSINTQRGQ---QFIPEQWLSEVQMFRKAKMLDTSVVK-TWGAQVKKGDTFHVPRIS-ELGVEDKATDVPV 75 (341) T ss_pred CcchhhhccccccchhHH---HHHHHHHHHHHHHHHHhhcchhhccc-cccccccCCceEEEeccC-cceeeeecCCCcc Confidence 2222222222 1222222 11 13444444555555555556543 2223322 3577787653 2334444333455 Q ss_pred ceeeeeeeeeeeeEEEE-EEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCC Q lcl|Aclame:pro 142 PLSSWNVNFERRTIVRG-EMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSL 220 (388) Q Consensus 142 P~~~~n~~~~~~~v~~~-~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l 220 (388) +.-+.+..+...++-.. ...+.+...|.. +...++-.+-.+.+..++.+..++..+--.+... + ..-++. T Consensus 76 ~~~~~~~~~~~itiD~~~~~~~~i~d~d~~---~~~~d~~~~~~~~~~~aLA~~~D~~i~~~~a~~~-----~-~~~~~~ 146 (341) T protein:vir:94 76 GVQPVNDTDFVITVDTDRTTAVALDDLLEI---QASYDLRAPYLEAMGYALAKDMTGSILGLRAAVQ-----N-TASQNV 146 (341) T ss_pred ccccccCceEEEEEeeeeecceeechHHHH---hhccchHHHHHHHHHHHHHHHHHHHHHHHhhhcc-----c-cccCcc Confidence 55454444444444222 345556655442 2355777777777777777776654331111000 0 000110 Q ss_pred ccccccccCCcccccccCCHHH-HHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccCCCc------CccHHH Q lcl|Aclame:pro 221 LPAIASTTPGGWVSGGANAFQG-IVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVVTDL------GISVRD 293 (388) Q Consensus 221 ~a~~~~~~~~~~t~Wa~kT~~e-I~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~~~~------~~Tvl~ 293 (388) . ..++. -.+.+++. .++.|..+...+-... + |. ....++++|..+..|.+-+.+ +.. T Consensus 147 ~--~~~~~------~~t~~~~~~~~~~i~~a~~~Lde~~---V-P~-~gR~lvv~P~~~~~Ll~~~~~~~~~~~g~~--- 210 (341) T protein:vir:94 147 F--SSSNG------AITGNGQAFSFAVFLAARRLLLEAD---V-PE-EKIVLLISPGQESALFTIPQFISKDFINNA--- 210 (341) T ss_pred c--cCccc------cccCchhhhhHHHHHHHHHHHhhcC---C-Cc-cCCEEEeCHHHHHHHhhchhhhhhhccccc--- Confidence 0 00000 01111222 2344444444443332 2 32 234789999999988542211 111 Q ss_pred HHHH----hCCccEEEEccccccccCCCCccE--EEEEEcccccc--------cccccCCCcce----------Eeecch Q lcl|Aclame:pro 294 WLKQ----TYPRVRVMSAPELQGGNPDDGKDI--AYMFLDSVDTA--------VDGSTDGGDTW----------AQLVQS 349 (388) Q Consensus 294 ~lk~----n~pnl~i~~~pel~~a~gtg~~~~--~~~~~~~~d~~--------~~~~~~~~~t~----------~~~~p~ 349 (388) -+++ +.-.++|...+.+-..++++.... .........+. .....+....+ ...-|+ T Consensus 211 ~l~~G~ig~i~G~~V~~Sn~lp~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~gl~~~~~av~~~k~~~~~ 290 (341) T protein:vir:94 211 PIAQGQIGSLMGVRVIRTSLIGNNSATGWRNGAPTIAPAEATPGFTGSRYLPKQDSFTSLPATFTGNSRPVHTAVMCHMD 290 (341) T ss_pred hhheeeeeeEeceEEEEeccccccccccccccccceecccccccccccccccccccccccEEEEEEecccccceeeecch Confidence 1221 111344554444432211111000 00000000000 00000000000 000111 Q ss_pred hhhccCce-eccCceEEecccce-------eeeeeeccccceeeccC Q lcl|Aclame:pro 350 KFVTLGVE-KRVKNYVEAYSNAT-------AGVMLKRPWAVVRLIGL 388 (388) Q Consensus 350 ~~r~~~v~-~~~~~~~~~~~~~t-------~G~ii~rP~ai~~~~GI 388 (388) -++...++ .+..+-....+.+. .|+=+.||.+++.+.=- T Consensus 291 ~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~G~~~lrp~~~v~~~~~ 337 (341) T protein:vir:94 291 WAAAVVSKAPRVTQSFENREQVWLMVGRQAYGARLYRPLHAVNIHTT 337 (341) T ss_pred hhhccccccccccccchhhhhhhhhhhhhhhcccccCcceeEEEecC Confidence 12222111 11111111111211 24445555553322222 No 135 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=20.44 E-value=2.8 Score=18.16 Aligned_cols=313 Identities=9% Similarity=-0.025 Sum_probs=124.7 Q ss_pred CCCcceeeeecCccccchhhhhhcc--cccccccCCHHHHhhcceecccch---hhcc----hhhhhhhhhhhhccCccc Q lcl|Aclame:pro 1 MKQLSKVHQSLAGRSVRAFDMANGK--ADYRLTDMAVRELKKFGLVFDHAT---VKRQ----IELLHEGGVATQAFDSAY 71 (388) Q Consensus 1 ~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~l~~~g~~~~~~~---~~~~----~~~~~~~~~~~~amDaa~ 71 (388) |+..-+. .+. ++.... ..-+.+.+ ..++++.+-.-.... .... .....+. .....++++- T Consensus 39 ~~~~~~~-------~~~--e~~~~~~~l~~~~~~~-e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~ 107 (379) T protein:vir:10 39 MTSEKDL-------AVN--ELKSDMAALQAHADKL-DVKLKEKAKSEDKSDSLVKSITENFNDIKEVRN-GKSIQVKAVG 107 (379) T ss_pred hhHHHHH-------HHH--HHHHHHHHHHHHHHHH-HHHHHhcccccccchhHHHHHHHHHHhHHHHHh-hhhhhhhhhc Confidence 1110000 000 000000 00000000 000000000000000 0000 0000000 0001122211 Q ss_pred ccccccccc--hHHHHHHHhhcceeeeecccchhhhhhcccccCCCCceeeEEEeeeccccce--EecccccCCceeeee Q lcl|Aclame:pro 72 VAPTTQASI--PTPIQFLQQWLPGFVKVLTSARKIDEILGVKTVGSWEDQEIVQGIVEPAGTA--MEYGDLTNIPLSSWN 147 (388) Q Consensus 72 ~~~~t~~~~--g~l~~~l~~idp~v~e~l~~~~~~~~i~~v~t~g~w~~~t~~~~v~e~~G~a--~~ygd~~diP~~~~n 147 (388) +..+..+. .+|.. +.+.|++.+-.....++++.+.+... .++.|++....+.+ ...+....+|..+.. T Consensus 108 -~~~~~~~~~~~ip~~----~~~~ii~~~~~~~~i~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~v~Eg~~~~~~~~~ 179 (379) T protein:vir:10 108 -DMTLPVNLTGAQPKD----YNFDVVLNPSQMLNVSDIVGAVSISG---GTYTFVRENGAGEGAIGAQVEGATKGQKDYD 179 (379) T ss_pred -ccccCCCCccccchh----hhhHHHHhHHhhhhHHhhceeeeccC---CceEEEEeecCCCcccccccCCccccccccc Confidence 11122222 23333 33466666666666666665544322 34556655443333 345778899999999 Q ss_pred eeeeeeeEEEEEEEEeecHHHHHHHHHhCCChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEEeecCCCccccccc Q lcl|Aclame:pro 148 VNFERRTIVRGEMGIQVGLLEEGRASAMRINSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGFLNDPSLLPAIAST 227 (388) Q Consensus 148 ~~~~~~~v~~~~~~~~y~~~El~~A~~~g~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~GllN~P~l~a~~~~~ 227 (388) ........+.+...+.++.+=|+-+. +|.+--....++++...+|.-.+.|... .+..+. . T Consensus 180 f~~i~~~~~k~~~~~~iS~ell~D~~----~l~~~i~~~la~~~~~~~~~~~~~g~~~---~~~~~~------------~ 240 (379) T protein:vir:10 180 ISMIDVNTDFIAGFTRYSKKMANNLP----FLTSFIPNALRRDYAKAENAAFNAVLAA---NATAST------------E 240 (379) T ss_pred eeeeEeeeeeEEeeehhhHHHHhhHH----HHHHHHHHHHHHHHHHHHHHHHhccccc---cccccc------------c Confidence 99999999999999888875444332 3556665666666666666544433221 111110 0 Q ss_pred cCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccC-CCcCccHHH--HHHHh-----C Q lcl|Aclame:pro 228 TPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVV-TDLGISVRD--WLKQT-----Y 299 (388) Q Consensus 228 ~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~-~~~~~Tvl~--~lk~n-----~ 299 (388) +.. ...+ ++||.+++..+... + ..+..++|.|..+..|.+. +..|.-++. ...++ . T Consensus 241 ~~~-----~~~~----~d~i~~~~~~~~~~-~------~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~l 304 (379) T protein:vir:10 241 IIT-----NKNK----VEMLINEIAKQENL-D------FPVTAIVLRPTDYYDILVTQKSVGAGYGLPGVVTQDNGVLRI 304 (379) T ss_pred ccc-----Cccc----HHHHHHHHHhhhhc-c------CCCCEEEEcHHHHHHHHHhhccCCceeccCCccCCCCCccee Confidence 000 0111 45666666655432 1 2345699999988888542 333433321 01111 1 Q ss_pred CccEEEEccccccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhh----ccCc-eeccCceEEecccceeee Q lcl|Aclame:pro 300 PRVRVMSAPELQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFV----TLGV-EKRVKNYVEAYSNATAGV 374 (388) Q Consensus 300 pnl~i~~~pel~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r----~~~v-~~~~~~~~~~~~~~t~G~ 374 (388) .++.++..+.+. + ++ + +| -+.. + +...+-+..+ .... ....-...+.+..|. |+ T Consensus 305 ~G~pvv~s~~~~-a-----g~-~-~~-gdf~----------~-~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~R~-~~ 363 (379) T protein:vir:10 305 NGIPLFRATWLA-A-----NK-Y-YV-GDWT----------R-VTKVTTEGLSLEFSEVEGTNFVKNNITARIEAQV-AL 363 (379) T ss_pred cceeeEecCCCC-C-----Cc-e-EE-eecc----------c-EEEEEEeceEEEEeecccccccCCcEEEEEEEEe-cc Confidence 122233322221 1 12 1 11 1110 0 0011100000 0000 011111222333344 56 Q ss_pred eeecccccee--eccC Q lcl|Aclame:pro 375 MLKRPWAVVR--LIGL 388 (388) Q Consensus 375 ii~rP~ai~~--~~GI 388 (388) .|++|-||++ +.+| T Consensus 364 ~v~~p~a~v~~~~~~~ 379 (379) T protein:vir:10 364 AVEQPAALIFGDFTAV 379 (379) T ss_pred EEecCccEEEEEecCC Confidence 6677999998 7788 No 136 >protein:vir:107388 Length: 331 # NCBI annotation: Bbp17 # Family: family:all:1903 # MgeID: mge:1537 # MgeName: BPP-1 # Cross-refs: genbank:acc:NP_958686;genbank:gi:41179378;genbank:GeneID:2717182 Probab=20.06 E-value=2.8 Score=18.10 Aligned_cols=222 Identities=14% Similarity=0.067 Sum_probs=100.7 Q ss_pred cCcccccccccccchHHHHHHHh------hcceeeeecccchhhhhhcccc--cCCCCceeeEEEeeeccccceEecccc Q lcl|Aclame:pro 67 FDSAYVAPTTQASIPTPIQFLQQ------WLPGFVKVLTSARKIDEILGVK--TVGSWEDQEIVQGIVEPAGTAMEYGDL 138 (388) Q Consensus 67 mDaa~~~~~t~~~~g~l~~~l~~------idp~v~e~l~~~~~~~~i~~v~--t~g~w~~~t~~~~v~e~~G~a~~ygd~ 138 (388) |. ...+ ++..|.....+ ++++|+|.+...-...+.+|-. +.+.|.-.+ +....-.+...-=. T Consensus 1 m~-----~~~~-~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~----vrt~LP~~~fR~lN 70 (331) T protein:vir:10 1 MP-----TLST-TNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTT----VRSGLPTGTWRKLN 70 (331) T ss_pred CC-----cccc-CcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceee----EEeccCCchhhccC Confidence 11 0000 22244454433 4567888876655556666665 345554333 22111222222222 Q ss_pred cCCceeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhC--CChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEE-- Q lcl|Aclame:pro 139 TNIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMR--INSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGF-- 214 (388) Q Consensus 139 ~diP~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g--~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~Gl-- 214 (388) ..++-.....++.+..+..++.-+++... .|...| -.+-+....+-.+++.+...+..||||...+..++-|| T Consensus 71 ~g~~~s~~tt~q~t~~l~ilgg~~eVDk~---la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~k 147 (331) T protein:vir:10 71 YGVQPEKSRTVQVKDSMGMLETYAEVDKA---LADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTP 147 (331) T ss_pred CccCcccceeEEEEEEEEEeccceeechH---HHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchh Confidence 34566666666777777777766666653 343444 12234455556677788888888999976666678787 Q ss_pred -eecCCCccc---cccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccCCCcCcc Q lcl|Aclame:pro 215 -LNDPSLLPA---IASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVVTDLGIS 290 (388) Q Consensus 215 -lN~P~l~a~---~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~~~~~~T 290 (388) +|++..... +.+.+ ++++. ..||.-.-| +. -.+.+-|...- .| T Consensus 148 R~~~~~a~~~~q~IdaGg-tG~~~-----------------TSI~~v~~~---~~--~~~giyPkG~~--------~G-- 194 (331) T protein:vir:10 148 RFNSLSAENGQNIIDAGG-TGSDN-----------------ASIWLTVWG---PN--TLHTIYPKGSQ--------AG-- 194 (331) T ss_pred hccccccccccceeecCC-CCCCc-----------------eEEEEEEEc---CC--eeEEecccccc--------cC-- Confidence 555432110 11111 11110 011111000 00 00111111110 01 Q ss_pred HHHHHHHhCCccEEEEccc--cccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEecc Q lcl|Aclame:pro 291 VRDWLKQTYPRVRVMSAPE--LQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYS 368 (388) Q Consensus 291 vl~~lk~n~pnl~i~~~pe--l~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~ 368 (388) |+++..-| +..+ .+..+..|..+ |+=+ T Consensus 195 -----------l~~~d~g~~~~~~~----~G~~y~~y~~~----------------------------------~~w~-- 223 (331) T protein:vir:10 195 -----------LQSRDLGEDTLIDA----AGGRYQGYRTH----------------------------------YKWD-- 223 (331) T ss_pred -----------ceEeecCceeeecC----CCCeeeEEEEE----------------------------------EEee-- Confidence 22222111 0011 01222222111 2222 Q ss_pred cceeeeeeeccccceeeccC Q lcl|Aclame:pro 369 NATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 369 ~~t~G~ii~rP~ai~~~~GI 388 (388) -|+.|+-|-+++|.-.| T Consensus 224 ---~Gl~i~d~r~v~ri~NI 240 (331) T protein:vir:10 224 ---IGLTLRDWRYVVRIANV 240 (331) T ss_pred ---eeeEEcCcccEEEEecc Confidence 28999999999999999 No 137 >protein:vir:107826 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1673 # MgeName: BIP-1 # Cross-refs: genbank:acc:NP_996627;genbank:gi:45580761;genbank:GeneID:2767902 Probab=20.06 E-value=2.8 Score=18.10 Aligned_cols=222 Identities=14% Similarity=0.067 Sum_probs=100.7 Q ss_pred cCcccccccccccchHHHHHHHh------hcceeeeecccchhhhhhcccc--cCCCCceeeEEEeeeccccceEecccc Q lcl|Aclame:pro 67 FDSAYVAPTTQASIPTPIQFLQQ------WLPGFVKVLTSARKIDEILGVK--TVGSWEDQEIVQGIVEPAGTAMEYGDL 138 (388) Q Consensus 67 mDaa~~~~~t~~~~g~l~~~l~~------idp~v~e~l~~~~~~~~i~~v~--t~g~w~~~t~~~~v~e~~G~a~~ygd~ 138 (388) |. ...+ ++..|.....+ ++++|+|.+...-...+.+|-. +.+.|.-.+ +....-.+...-=. T Consensus 1 m~-----~~~~-~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~----vrt~LP~~~fR~lN 70 (331) T protein:vir:10 1 MP-----TLST-TNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTT----VRSGLPTGTWRKLN 70 (331) T ss_pred CC-----cccc-CcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceee----EEeccCCchhhccC Confidence 11 0000 22244454433 4567888876655556666665 345554333 22111222222222 Q ss_pred cCCceeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhC--CChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEE-- Q lcl|Aclame:pro 139 TNIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMR--INSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGF-- 214 (388) Q Consensus 139 ~diP~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g--~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~Gl-- 214 (388) ..++-.....++.+..+..++.-+++... .|...| -.+-+....+-.+++.+...+..||||...+..++-|| T Consensus 71 ~g~~~s~~tt~q~t~~l~ilgg~~eVDk~---la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~k 147 (331) T protein:vir:10 71 YGVQPEKSRTVQVKDSMGMLETYAEVDKA---LADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTP 147 (331) T ss_pred CccCcccceeEEEEEEEEEeccceeechH---HHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchh Confidence 34566666666777777777766666653 343444 12234455556677788888888999976666678787 Q ss_pred -eecCCCccc---cccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccCCCcCcc Q lcl|Aclame:pro 215 -LNDPSLLPA---IASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVVTDLGIS 290 (388) Q Consensus 215 -lN~P~l~a~---~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~~~~~~T 290 (388) +|++..... +.+.+ ++++. ..||.-.-| +. -.+.+-|...- .| T Consensus 148 R~~~~~a~~~~q~IdaGg-tG~~~-----------------TSI~~v~~~---~~--~~~giyPkG~~--------~G-- 194 (331) T protein:vir:10 148 RFNSLSAENGQNIIDAGG-TGSDN-----------------ASIWLTVWG---PN--TLHTIYPKGSQ--------AG-- 194 (331) T ss_pred hccccccccccceeecCC-CCCCc-----------------eEEEEEEEc---CC--eeEEecccccc--------cC-- Confidence 555432110 11111 11110 011111000 00 00111111110 01 Q ss_pred HHHHHHHhCCccEEEEccc--cccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEecc Q lcl|Aclame:pro 291 VRDWLKQTYPRVRVMSAPE--LQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYS 368 (388) Q Consensus 291 vl~~lk~n~pnl~i~~~pe--l~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~ 368 (388) |+++..-| +..+ .+..+..|..+ |+=+ T Consensus 195 -----------l~~~d~g~~~~~~~----~G~~y~~y~~~----------------------------------~~w~-- 223 (331) T protein:vir:10 195 -----------LQSRDLGEDTLIDA----AGGRYQGYRTH----------------------------------YKWD-- 223 (331) T ss_pred -----------ceEeecCceeeecC----CCCeeeEEEEE----------------------------------EEee-- Confidence 22222111 0011 01222222111 2222 Q ss_pred cceeeeeeeccccceeeccC Q lcl|Aclame:pro 369 NATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 369 ~~t~G~ii~rP~ai~~~~GI 388 (388) -|+.|+-|-+++|.-.| T Consensus 224 ---~Gl~i~d~r~v~ri~NI 240 (331) T protein:vir:10 224 ---IGLTLRDWRYVVRIANV 240 (331) T ss_pred ---eeeEEcCcccEEEEecc Confidence 28999999999999999 No 138 >protein:vir:98525 Length: 331 # NCBI annotation: hypothetical protein predicted by GeneMark # Family: family:all:1903 # MgeID: mge:1592 # MgeName: BMP-1 # Cross-refs: genbank:acc:NP_996579;genbank:gi:45569510;genbank:GeneID:2767853 Probab=20.06 E-value=2.8 Score=18.10 Aligned_cols=222 Identities=14% Similarity=0.067 Sum_probs=100.7 Q ss_pred cCcccccccccccchHHHHHHHh------hcceeeeecccchhhhhhcccc--cCCCCceeeEEEeeeccccceEecccc Q lcl|Aclame:pro 67 FDSAYVAPTTQASIPTPIQFLQQ------WLPGFVKVLTSARKIDEILGVK--TVGSWEDQEIVQGIVEPAGTAMEYGDL 138 (388) Q Consensus 67 mDaa~~~~~t~~~~g~l~~~l~~------idp~v~e~l~~~~~~~~i~~v~--t~g~w~~~t~~~~v~e~~G~a~~ygd~ 138 (388) |. ...+ ++..|.....+ ++++|+|.+...-...+.+|-. +.+.|.-.+ +....-.+...-=. T Consensus 1 m~-----~~~~-~~~TL~e~Ak~~~~~~~l~~~IIE~l~~tn~IL~~lpf~e~N~~t~~~~~----vrt~LP~~~fR~lN 70 (331) T protein:vir:98 1 MP-----TLST-TNPTLADVAARMTPDGKIDPQIVEMLNETNEILDDMTVIEANGFTEHKTT----VRSGLPTGTWRKLN 70 (331) T ss_pred CC-----cccc-CcccHHHHHHhcCcchhHHHHHHHHHhcCchHHhhceeeeccCCccceee----EEeccCCchhhccC Confidence 11 0000 22244454433 4567888876655556666665 345554333 22111222222222 Q ss_pred cCCceeeeeeeeeeeeEEEEEEEEeecHHHHHHHHHhC--CChHHHHHHHHHHHHHHhhceEEEEeecCccccceEEE-- Q lcl|Aclame:pro 139 TNIPLSSWNVNFERRTIVRGEMGIQVGLLEEGRASAMR--INSAEVKRQGAAVQLEIMRNAIGFYGWEGKNGNRTFGF-- 214 (388) Q Consensus 139 ~diP~~~~n~~~~~~~v~~~~~~~~y~~~El~~A~~~g--~~l~~~K~~aAr~a~~~~~n~i~~~G~a~~~~~g~~Gl-- 214 (388) ..++-.....++.+..+..++.-+++... .|...| -.+-+....+-.+++.+...+..||||...+..++-|| T Consensus 71 ~g~~~s~~tt~q~t~~l~ilgg~~eVDk~---la~~~Gn~~~~ra~e~~~~ik~m~~~~~~~~iyGD~a~~p~~F~GL~k 147 (331) T protein:vir:98 71 YGVQPEKSRTVQVKDSMGMLETYAEVDKA---LADLNGNSAAWRLSEDRAFIEGMNQTQATTLFYGDSSIDAEKFMGLTP 147 (331) T ss_pred CccCcccceeEEEEEEEEEeccceeechH---HHhhcCCHHHHHHHHHHHHHHHHHHHHHHHHhcCCcccChhhhccchh Confidence 34566666666777777777766666653 343444 12234455556677788888888999976666678787 Q ss_pred -eecCCCccc---cccccCCcccccccCCHHHHHHHHHHHHHHHHHhcCCeeccccccceEEcCHHHHHhhccCCCcCcc Q lcl|Aclame:pro 215 -LNDPSLLPA---IASTTPGGWVSGGANAFQGIVGDLRLMLITLRVQSEDNIDPEDVDITLVLPMNKVDMLSVVTDLGIS 290 (388) Q Consensus 215 -lN~P~l~a~---~~~~~~~~~t~Wa~kT~~eI~~DI~~~~~~l~~~s~g~v~~~~~p~tL~Lp~~~~~~Ls~~~~~~~T 290 (388) +|++..... +.+.+ ++++. ..||.-.-| +. -.+.+-|...- .| T Consensus 148 R~~~~~a~~~~q~IdaGg-tG~~~-----------------TSI~~v~~~---~~--~~~giyPkG~~--------~G-- 194 (331) T protein:vir:98 148 RFNSLSAENGQNIIDAGG-TGSDN-----------------ASIWLTVWG---PN--TLHTIYPKGSQ--------AG-- 194 (331) T ss_pred hccccccccccceeecCC-CCCCc-----------------eEEEEEEEc---CC--eeEEecccccc--------cC-- Confidence 555432110 11111 11110 011111000 00 00111111110 01 Q ss_pred HHHHHHHhCCccEEEEccc--cccccCCCCccEEEEEEcccccccccccCCCcceEeecchhhhccCceeccCceEEecc Q lcl|Aclame:pro 291 VRDWLKQTYPRVRVMSAPE--LQGGNPDDGKDIAYMFLDSVDTAVDGSTDGGDTWAQLVQSKFVTLGVEKRVKNYVEAYS 368 (388) Q Consensus 291 vl~~lk~n~pnl~i~~~pe--l~~a~gtg~~~~~~~~~~~~d~~~~~~~~~~~t~~~~~p~~~r~~~v~~~~~~~~~~~~ 368 (388) |+++..-| +..+ .+..+..|..+ |+=+ T Consensus 195 -----------l~~~d~g~~~~~~~----~G~~y~~y~~~----------------------------------~~w~-- 223 (331) T protein:vir:98 195 -----------LQSRDLGEDTLIDA----AGGRYQGYRTH----------------------------------YKWD-- 223 (331) T ss_pred -----------ceEeecCceeeecC----CCCeeeEEEEE----------------------------------EEee-- Confidence 22222111 0011 01222222111 2222 Q ss_pred cceeeeeeeccccceeeccC Q lcl|Aclame:pro 369 NATAGVMLKRPWAVVRLIGL 388 (388) Q Consensus 369 ~~t~G~ii~rP~ai~~~~GI 388 (388) -|+.|+-|-+++|.-.| T Consensus 224 ---~Gl~i~d~r~v~ri~NI 240 (331) T protein:vir:98 224 ---IGLTLRDWRYVVRIANV 240 (331) T ss_pred ---eeeEEcCcccEEEEecc Confidence 28999999999999999 Done!