Query lcl|Aclame:protein:vir:5670|NCBI_annot:gp23|genbank:acc:NP_899609;genbank:gi:34419596;genbank:GeneID:2546039 Match_columns 514 No_of_seqs 157 out of 437 Neff 5.3 Searched_HMMs 1612 Date Sun Dec 1 04:24:01 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_361 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_361_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:5670 Length: 514 # 100.0 1E-257 8E-261 1429.1 37.6 514 1-514 1-514 (514) 2 protein:vir:80986 Length: 528 100.0 3E-243 2E-246 1350.5 36.4 511 1-514 5-528 (528) 3 protein:vir:100603 Length: 529 100.0 2E-241 1E-244 1340.4 36.1 511 1-514 6-529 (529) 4 protein:vir:98143 Length: 524 100.0 9E-241 6E-244 1336.5 36.4 509 1-514 5-524 (524) 5 protein:vir:101039 Length: 529 100.0 1E-240 7E-244 1336.0 36.4 511 1-514 6-529 (529) 6 protein:vir:101811 Length: 529 100.0 2E-240 2E-243 1334.2 36.7 510 1-514 6-529 (529) 7 protein:vir:106286 Length: 534 100.0 5E-240 3E-243 1332.7 37.5 511 1-514 4-534 (534) 8 protein:vir:6901 Length: 522 # 100.0 9E-240 6E-243 1331.1 36.3 509 1-514 8-522 (522) 9 protein:vir:6601 Length: 528 # 100.0 3E-239 2E-242 1328.0 35.8 508 1-514 5-528 (528) 10 protein:vir:107947 Length: 519 100.0 8E-238 5E-241 1320.6 36.7 510 1-514 4-519 (519) 11 protein:vir:103463 Length: 521 100.0 6E-237 4E-240 1315.7 36.4 509 1-514 7-521 (521) 12 protein:vir:7214 Length: 521 # 100.0 2E-236 1E-239 1312.7 36.5 509 1-514 7-521 (521) 13 protein:vir:104915 Length: 470 100.0 9E-221 6E-224 1226.9 34.8 454 1-514 7-469 (470) 14 protein:vir:106998 Length: 468 100.0 2E-220 1E-223 1225.4 35.0 455 1-514 5-467 (468) 15 protein:vir:104549 Length: 462 100.0 4E-217 2E-220 1207.0 34.5 450 1-514 4-461 (462) 16 protein:vir:103181 Length: 457 100.0 4E-213 2E-216 1185.3 34.1 444 1-514 4-456 (457) 17 protein:vir:5942 Length: 523 # 100.0 3E-193 2E-196 1076.0 30.1 441 1-493 8-523 (523) 18 protein:vir:3033 Length: 272 # 95.9 0.0013 8.3E-07 36.3 16.4 266 154-503 1-272 (272) 19 protein:vir:9820 Length: 272 # 95.9 0.0013 8.3E-07 36.3 16.4 266 154-503 1-272 (272) 20 protein:vir:81100 Length: 415 95.4 0.0021 1.3E-06 35.3 15.8 343 1-500 45-415 (415) 21 protein:vir:79987 Length: 415 95.4 0.0021 1.3E-06 35.3 15.8 343 1-500 45-415 (415) 22 protein:vir:98339 Length: 415 95.4 0.0021 1.3E-06 35.3 15.8 343 1-500 45-415 (415) 23 protein:vir:9410 Length: 415 # 95.3 0.0024 1.5E-06 34.9 14.6 348 1-500 45-415 (415) 24 protein:vir:41 Length: 299 # N 95.2 0.0026 1.6E-06 34.8 16.7 275 69-501 1-299 (299) 25 protein:vir:4856 Length: 293 # 93.5 0.0072 4.5E-06 32.3 15.9 261 52-488 1-293 (293) 26 protein:vir:93742 Length: 274 93.2 0.0083 5.2E-06 32.0 17.0 271 133-496 1-274 (274) 27 protein:vir:96123 Length: 274 92.2 0.013 7.8E-06 31.0 15.8 269 133-491 1-274 (274) 28 protein:vir:1886 Length: 385 # 90.3 0.021 1.3E-05 29.7 17.7 331 1-494 37-385 (385) 29 protein:vir:191 Length: 385 # 90.3 0.021 1.3E-05 29.7 17.7 331 1-494 37-385 (385) 30 protein:vir:100135 Length: 418 90.1 0.023 1.4E-05 29.6 14.4 349 1-501 39-418 (418) 31 protein:vir:78523 Length: 338 89.8 0.024 1.5E-05 29.4 14.4 311 57-498 1-338 (338) 32 protein:vir:4953 Length: 397 # 89.1 0.028 1.8E-05 29.1 12.5 336 1-500 1-397 (397) 33 protein:vir:8420 Length: 477 # 89.0 0.029 1.8E-05 29.0 17.9 355 1-500 71-477 (477) 34 protein:vir:4700 Length: 415 # 88.6 0.031 1.9E-05 28.8 17.9 341 1-500 52-415 (415) 35 protein:vir:4600 Length: 415 # 88.6 0.031 1.9E-05 28.8 17.9 341 1-500 52-415 (415) 36 protein:vir:10364 Length: 390 88.3 0.033 2.1E-05 28.7 18.7 333 1-493 34-390 (390) 37 protein:vir:8187 Length: 311 # 85.7 0.051 3.2E-05 27.7 14.4 286 77-496 1-311 (311) 38 protein:vir:96762 Length: 632 85.3 0.054 3.4E-05 27.5 15.8 331 1-487 245-632 (632) 39 protein:vir:7771 Length: 330 # 85.1 0.056 3.4E-05 27.5 14.5 299 149-514 1-323 (330) 40 protein:vir:97433 Length: 274 84.9 0.057 3.5E-05 27.4 17.4 270 133-496 1-274 (274) 41 protein:vir:94494 Length: 274 84.9 0.057 3.5E-05 27.4 17.4 270 133-496 1-274 (274) 42 protein:vir:96262 Length: 274 84.0 0.064 4E-05 27.1 13.9 267 154-500 1-274 (274) 43 protein:vir:95898 Length: 274 84.0 0.064 4E-05 27.1 13.9 267 154-500 1-274 (274) 44 protein:vir:81160 Length: 371 80.9 0.09 5.6E-05 26.3 15.6 321 1-488 20-371 (371) 45 protein:vir:9704 Length: 394 # 79.1 0.11 6.7E-05 25.9 14.2 320 1-490 60-394 (394) 46 protein:vir:9759 Length: 303 # 78.8 0.11 6.9E-05 25.8 16.6 284 76-490 1-303 (303) 47 protein:vir:78223 Length: 333 78.3 0.12 7.2E-05 25.7 16.8 307 38-489 1-333 (333) 48 protein:vir:1638 Length: 298 # 78.1 0.12 7.3E-05 25.7 12.3 279 161-494 1-298 (298) 49 protein:vir:100247 Length: 425 77.9 0.12 7.5E-05 25.6 16.0 324 1-489 68-425 (425) 50 protein:vir:739 Length: 231 # 77.3 0.13 7.9E-05 25.5 13.6 215 209-514 1-231 (231) 51 protein:vir:94142 Length: 304 76.5 0.13 8.3E-05 25.4 15.3 281 133-494 1-304 (304) 52 protein:vir:105905 Length: 304 76.5 0.13 8.3E-05 25.4 15.3 281 133-494 1-304 (304) 53 protein:vir:3613 Length: 272 # 75.9 0.14 8.8E-05 25.2 14.4 263 154-514 1-272 (272) 54 protein:vir:4997 Length: 397 # 75.7 0.14 8.9E-05 25.2 15.8 325 1-502 40-397 (397) 55 protein:vir:81227 Length: 413 75.2 0.15 9.2E-05 25.1 17.4 341 1-514 42-410 (413) 56 protein:vir:9309 Length: 324 # 74.5 0.16 9.8E-05 25.0 17.1 304 33-503 1-324 (324) 57 protein:vir:80930 Length: 278 74.5 0.16 9.8E-05 25.0 16.1 272 142-502 1-278 (278) 58 protein:vir:1433 Length: 435 # 73.7 0.17 0.0001 24.8 13.4 341 1-498 32-435 (435) 59 protein:vir:97053 Length: 390 72.7 0.18 0.00011 24.7 20.2 344 1-493 20-390 (390) 60 protein:vir:1268 Length: 397 # 71.8 0.19 0.00012 24.5 15.3 304 1-489 53-397 (397) 61 protein:vir:99749 Length: 324 70.6 0.21 0.00013 24.3 11.7 301 124-503 1-324 (324) 62 protein:vir:3870 Length: 400 # 69.5 0.22 0.00014 24.2 14.3 320 1-497 45-400 (400) 63 protein:vir:6212 Length: 434 # 66.0 0.27 0.00017 23.7 16.0 346 1-501 43-434 (434) 64 protein:vir:81070 Length: 390 65.3 0.29 0.00018 23.6 19.7 335 1-493 20-390 (390) 65 protein:vir:2430 Length: 318 # 60.9 0.36 0.00023 23.0 18.0 288 41-501 1-318 (318) 66 protein:vir:9574 Length: 300 # 59.8 0.39 0.00024 22.9 17.0 279 76-513 1-300 (300) 67 protein:vir:80684 Length: 315 59.3 0.4 0.00025 22.8 14.7 289 142-503 1-315 (315) 68 protein:vir:1239 Length: 274 # 54.9 0.49 0.00031 22.3 16.7 269 133-496 1-274 (274) 69 protein:vir:105038 Length: 428 54.0 0.51 0.00032 22.2 15.2 325 1-496 56-428 (428) 70 protein:vir:94771 Length: 298 53.4 0.53 0.00033 22.1 14.0 281 154-494 1-298 (298) 71 protein:vir:107593 Length: 392 52.8 0.54 0.00034 22.0 16.0 321 1-501 39-392 (392) 72 protein:vir:102873 Length: 392 52.8 0.54 0.00034 22.0 16.0 321 1-501 39-392 (392) 73 protein:vir:102082 Length: 392 52.8 0.54 0.00034 22.0 16.0 321 1-501 39-392 (392) 74 protein:vir:105004 Length: 392 52.8 0.54 0.00034 22.0 16.0 321 1-501 39-392 (392) 75 protein:vir:2504 Length: 305 # 52.3 0.56 0.00035 22.0 16.7 280 76-497 1-305 (305) 76 protein:vir:78830 Length: 324 52.3 0.56 0.00035 22.0 17.6 303 11-498 1-324 (324) 77 protein:vir:96392 Length: 324 52.3 0.56 0.00035 22.0 17.6 303 11-498 1-324 (324) 78 protein:vir:79928 Length: 393 48.2 0.68 0.00042 21.5 10.1 357 4-514 1-391 (393) 79 protein:vir:4339 Length: 395 # 47.1 0.71 0.00044 21.4 18.5 348 1-488 9-395 (395) 80 protein:vir:80376 Length: 435 43.6 0.84 0.00052 21.0 17.8 340 1-498 46-435 (435) 81 protein:vir:4092 Length: 390 # 43.2 0.85 0.00053 21.0 15.5 349 1-504 2-390 (390) 82 protein:vir:4830 Length: 397 # 41.8 0.91 0.00056 20.8 16.4 319 1-500 47-397 (397) 83 protein:vir:2344 Length: 397 # 41.6 0.92 0.00057 20.8 18.1 303 69-514 1-329 (397) 84 protein:vir:105334 Length: 276 38.1 1.1 0.00067 20.4 15.5 267 154-502 1-276 (276) 85 protein:vir:104256 Length: 458 32.9 1.4 0.00087 19.8 17.9 337 1-488 85-458 (458) 86 protein:vir:104085 Length: 320 32.4 1.4 0.00089 19.7 13.8 294 148-497 1-320 (320) 87 protein:vir:94622 Length: 341 30.5 1.6 0.00098 19.5 10.1 288 133-497 1-341 (341) 88 protein:vir:103955 Length: 324 30.5 1.6 0.00098 19.5 18.0 302 21-503 1-324 (324) 89 protein:vir:3845 Length: 395 # 30.3 1.6 0.00099 19.5 18.5 329 1-502 37-395 (395) 90 protein:vir:100884 Length: 389 30.0 1.6 0.001 19.5 17.8 323 1-502 37-389 (389) 91 protein:vir:94673 Length: 419 29.8 1.6 0.001 19.4 20.4 346 1-514 53-417 (419) 92 protein:vir:1025 Length: 408 # 26.9 1.9 0.0012 19.1 12.0 342 1-503 5-408 (408) 93 protein:vir:97148 Length: 324 26.9 1.9 0.0012 19.1 18.6 304 21-503 1-324 (324) 94 protein:vir:95107 Length: 270 25.7 2 0.0013 18.9 12.9 261 144-505 1-270 (270) 95 protein:vir:95763 Length: 297 24.2 2.2 0.0014 18.7 18.2 278 66-496 1-297 (297) 96 protein:vir:99920 Length: 311 21.9 2.5 0.0016 18.4 14.9 281 173-488 1-311 (311) No 1 >protein:vir:5670 Length: 514 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899609;genbank:gi:34419596;genbank:GeneID:2546039 Probab=100.00 E-value=1.2e-257 Score=1429.06 Aligned_cols=514 Identities=100% Similarity=1.410 Sum_probs=507.0 Q ss_pred CcchhhhhhhhccccccccccccchhhhhhhhhhhhHHHHHHhcccccchhhhhhhcccccccccccccccccccccccc Q lcl|Aclame:pro 1 MNLTEKWKDLLEAEGADMPEIATATKQKIMSKIFENQDRDINNDPMYRDPQLVEAFNAGLNEAVVNGDHGYDPANIAQGV 80 (514) Q Consensus 1 ~~l~~kw~p~l~~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~g~~~~~~~~st 80 (514) |||+|||+|||||||+|||||++.|||+|+++||||||||++|+++|+|++++|+|+++|+|+.++|+||++++||+||+ T Consensus 1 ~~l~~kw~p~l~~~~~~~~~i~~~~~~~~~~~l~enq~~~~~~~~~~~~~~~~~~~~~~l~e~~~~~~~~~~~~~ia~s~ 80 (514) T protein:vir:56 1 MNLTEKWKDLLEAEGADMPEIATATKQKIMSKIFENQDRDINNDPMYRDPQLVEAFNAGLNEAVVNGDHGYDPANIAQGV 80 (514) T ss_pred CchhhhhhHHhcccccccccccchhhhhhhhhhhhhHHHHHhcCCcccchhhhhhhhccccccccccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccceeeehhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCccccccccccCCCCccCcccccccccc Q lcl|Aclame:pro 81 TTGAVTNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAEAFHPTRQADASFSGQAAASTIAD 160 (514) Q Consensus 81 ~tg~v~~~~P~l~~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~tg~EA~~~~nEadt~fSG~~~~~~~~~ 160 (514) +|++|+++||+||+|||||+|||||+||||||||||||||||||||+|++++++|.||||++||+|++|||+.+..+..+ T Consensus 81 ~t~~v~~~~P~ll~lvRRa~~~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~tg~EAf~~~nEadt~fSG~~~~~~~~~ 160 (514) T protein:vir:56 81 TTGAVTNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAEAFHPTRQADASFSGQAAASTIAD 160 (514) T ss_pred ccccccccchhHHHHHHHHHHhhhhhhhheeccCCchhhhheeeeeeecCCCcccccccccccccCcCcccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999988888 Q ss_pred ccccccccccccccccccccccccccccccccccccccccCccccccccccccccccccccccccchhhhccccCCCCCC Q lcl|Aclame:pro 161 FPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAELQENFNGSSN 240 (514) Q Consensus 161 ~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~GmtTa~aEal~~~ggs~~ 240 (514) .+..+..+.+++..+..+...++++...+...+.....+.+..+.+++.+.++++.+|+++.||+|+.+|+++++|++++ T Consensus 161 ~~~~~~~~~G~~~~~~~t~~~gd~~~~~~~~~~~~~~~~~~~~~~t~~~~~~a~~~~y~~~~Gm~Ta~aEal~~lggs~~ 240 (514) T protein:vir:56 161 FPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAELQENFNGSSN 240 (514) T ss_pred ccccccccccccccccccccccccccccccccccccccccccccccccccccccchhhhhhhhhhhhhhhhcccCCCCcc Confidence 88889999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhheeecccccccc Q lcl|Aclame:pro 241 NEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNSQAQIGKSGWTQG 320 (514) Q Consensus 241 ~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~~l~~~a~v~~~~~~~~ 320 (514) ++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||++|+++|+|+|+||+++ T Consensus 241 ~~f~EMaFsIdK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILSTEImlEINReii~~l~~~atv~~~~~~~~ 320 (514) T protein:vir:56 241 NEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNSQAQIGKSGWTQG 320 (514) T ss_pred cccceeeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHHhheeehhcccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhhccccccchhccccCccc Q lcl|Aclame:pro 321 AGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVGPAAQGMQDGSM 400 (514) Q Consensus 321 v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~ 400 (514) ++++|+|||++++|++|+||++||+|+|+++||||+|+|+|+|+||+||||||||+||++|+|+|||++++++|..++.. T Consensus 321 ~~~~G~~d~~~~~d~~~~~~~~e~~~~l~~~i~~~an~i~~~T~rg~gn~~i~S~~Va~~L~~sg~l~~~~~~g~~~~~~ 400 (514) T protein:vir:56 321 AGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVGPAAQGMQDGSM 400 (514) T ss_pred cccccccccccccccccchHHHHHHHHHHHHHHHHHHHHHhhcccccccEEEEchhHHHHHHhhhhhccccccCcccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccCceEEEEecCceEEEecCCCccceEEEEEecCCCcccceeeccccccccccccCCccccceeeeeeeeeeeecCc Q lcl|Aclame:pro 401 NTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGFKGSTEMDAGVFYSPYVPLTPLRGSDSKNFQPVIGFKTRYGVQVNPF 480 (514) Q Consensus 401 ~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~kG~~~~~~~~fy~PYv~~~~~~~~dp~s~qp~~~~~tRY~l~~nPf 480 (514) ++|+++.+|+|+|+|||+||||||+++|||+|||||++++|+||||||||||++++++||+||||+|||||||||++||| T Consensus 401 ~~d~~~~~~aG~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~~NPy 480 (514) T protein:vir:56 401 NTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGFKGSTEMDAGVFYSPYVPLTPLRGSDSKNFQPVIGFKTRYGVQVNPF 480 (514) T ss_pred ccccCcceEEEEecCceEEEecCCCCcceEEEEEecCcceecceeeccccccccccccCCccccceeeeeeeeceeeCCC Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccCcceeecCcchhhhccccceeeeeeeecC Q lcl|Aclame:pro 481 ADPTASATKVGNGAPVAASMGKNAYFRRVFVKGL 514 (514) Q Consensus 481 ~~~~~~~~~i~~~~~~~~~~~~~~~~r~~~V~~~ 514 (514) +++++...++.|+|+..++.+||.|||||+|||| T Consensus 481 ~~~~~~~~~~~~~~~~~a~~~~n~y~r~v~v~~l 514 (514) T protein:vir:56 481 ADPTASATKVGNGAPVAASMGKNAYFRRVFVKGL 514 (514) T ss_pred CCccccccccCCcchhhhcccccceeeeEEEecC Confidence 9999988999999999999999999999999999 No 2 >protein:vir:80986 Length: 528 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469506;genbank:gi:157311463;genbank:GeneID:5602119 Probab=100.00 E-value=2.6e-243 Score=1350.55 Aligned_cols=511 Identities=58% Similarity=0.910 Sum_probs=456.6 Q ss_pred CcchhhhhhhhccccccccccccchhhhhhhhhhhhHHHHHHhcccccchhhhhhhcccccccccccccccccccccccc Q lcl|Aclame:pro 1 MNLTEKWKDLLEAEGADMPEIATATKQKIMSKIFENQDRDINNDPMYRDPQLVEAFNAGLNEAVVNGDHGYDPANIAQGV 80 (514) Q Consensus 1 ~~l~~kw~p~l~~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~g~~~~~~~~st 80 (514) .+|+|||+||||||| +|||++.+||+|+|+|||||||+++|+|+|+|++++++||.+|.||+++|+|||++.+|+||+ T Consensus 5 ~~l~~kw~p~l~~~~--~~~i~~~~~~~~~a~llenq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~~~~~~~i~es~ 82 (528) T protein:vir:80 5 KELMEKWSPLLENEK--LPEIATASKQKLVAKILESQEADFAVDPIYKDEKVVEAFGGFIAEAEVAGDHGYDASQIAAGQ 82 (528) T ss_pred HHHHHhhhHhhcCCc--cchhcchhhhhhhhhhhhhhhHHhhccccccchHHHHhhhhhccccccccccCCccccccccc Confidence 789999999999997 899999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccceeeehhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCC--ccccccccccCCCCccCcccccccc Q lcl|Aclame:pro 81 TTGAVTNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPL--TGAEAFHPTRQADASFSGQAAASTI 158 (514) Q Consensus 81 ~tg~v~~~~P~l~~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~--tg~EA~~~~nEadt~fSG~~~~~~~ 158 (514) +|++|++|||+||+|||||+|||||+||||||||||||||||||||||+++++ +++||||+++++|+.||+....... T Consensus 83 ~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~~~ea~~~~~~~da~fS~~~t~~~a 162 (528) T protein:vir:80 83 TTGAITNVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGPNPLASQAKEAFHPMYAPDAFHSSLAAKGAA 162 (528) T ss_pred cccccccCCchhhhHHHHHHhhhhhhhhheeccCCchhhhheeeeeeecCCccccccccccccccccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999864 4789999999999999998665443 Q ss_pred ccccccccccccc---cccc----cccccccccccccccccccccccccCcccc-ccccccccccccccccccccchhhh Q lcl|Aclame:pro 159 ADFPTTGAATDGT---PYKA----EVTTSGGDVSMRYFLALGAVTLAVAGQMTA-TEYTDGVAGGLLVEIDAGMATSQAE 230 (514) Q Consensus 159 ~~~~~~~~~t~~~---~~~~----~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~a~~~~y~~~~GmtTa~aE 230 (514) ......+...... .... .........+...............+.... .......+.+.+|+++.||+|+.+| T Consensus 163 ~~~ea~t~fs~~~~~~~~~~G~~~~~t~~~tg~~~~~~~~~~~~~~~~~gt~~~~~~~~~~~~~~~~~~~~~Gm~Ta~AE 242 (528) T protein:vir:80 163 VGSPTGTPFAKLAIGTQIEAGDIVHHTFAETGIAYLQNVTAEQVTPTKAGSESEDEVVMKLMEEGKLAEIAFGMATSIAE 242 (528) T ss_pred cccccccccccccccccccccceeccccccccccccccccccccCccccCCcccccccccccccccccccccccchhhhh Confidence 2221111111100 0000 000111111122222333333333333332 3445567788899999999999999 Q ss_pred ccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhe Q lcl|Aclame:pro 231 LQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNSQA 310 (514) Q Consensus 231 al~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~~l~~~a 310 (514) .++.+|++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||+|||+|||||||++|+..| T Consensus 243 ~le~lg~ss~~~f~EMaFsIEKvTVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILStEImlEINReii~~i~~~a 322 (528) T protein:vir:80 243 IQEGFNGSSNNPWAEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINFTA 322 (528) T ss_pred hhcccCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eecccccccccCC-cceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhhcccccc Q lcl|Aclame:pro 311 QIGKSGWTQGAGA-AGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVG 389 (514) Q Consensus 311 ~v~~~~~~~~v~~-~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~~g~~~~ 389 (514) ++++.+|+++|++ +|+|||+++.|++|+||++||+|+|++|||+|||+|+|+|+||+||||||||+||++|+++|.+++ T Consensus 323 ~~~~~~~t~~~~~~~G~~dl~~~~d~~g~r~~~e~~k~L~~~i~~~an~I~~~T~~~~gn~vi~S~~Va~~L~~~g~~~~ 402 (528) T protein:vir:80 323 QVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQGIS 402 (528) T ss_pred eeeeeeeeeccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhcccccc Confidence 9999999999977 699999999999999999999999999999999999999999999999999999999999999999 Q ss_pred chhccccCccccccccCceEEEEecCceEEEecCCCccceEEEEEecCCCcccceeeccccccccccccCCccccceeee Q lcl|Aclame:pro 390 PAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGFKGSTEMDAGVFYSPYVPLTPLRGSDSKNFQPVIGF 469 (514) Q Consensus 390 ~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~kG~~~~~~~~fy~PYv~~~~~~~~dp~s~qp~~~~ 469 (514) +++++.... .++|+++.+|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|++++||+||||+||| T Consensus 403 ~~~~~~~~~-~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g~ 481 (528) T protein:vir:80 403 LAMQGAAKG-LNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPYVALTPLRATDPQSFHPVLGF 481 (528) T ss_pred ccccccccc-cccCCCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeecccccceeeEeeCCccccceeee Confidence 999886544 7899999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeeeeecCccccccC--cceeecCcchhhhccccceeeeeeeecC Q lcl|Aclame:pro 470 KTRYGVQVNPFADPTAS--ATKVGNGAPVAASMGKNAYFRRVFVKGL 514 (514) Q Consensus 470 ~tRY~l~~nPf~~~~~~--~~~i~~~~~~~~~~~~~~~~r~~~V~~~ 514 (514) ||||||++|||++..++ +.||+++++|++++|||+|||||+|||| T Consensus 482 ~tRY~l~~NP~~~~~~~~~~~r~~~g~~~~~~ag~n~~~r~~~Vk~~ 528 (528) T protein:vir:80 482 KTRYGIGINPFADSKSQAPSARITSGMLSKDSVGKNAYFRRVWVKGC 528 (528) T ss_pred eeeeceeecCcccccCCcccccccccchhhhhcCccceeEEeeeccC Confidence 99999999999998765 5899999999999999999999999999 No 3 >protein:vir:100603 Length: 529 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656387;genbank:gi:109290138;genbank:GeneID:4156581 Probab=100.00 E-value=1.8e-241 Score=1340.44 Aligned_cols=511 Identities=62% Similarity=0.981 Sum_probs=456.4 Q ss_pred CcchhhhhhhhccccccccccccchhhhhhhhhhhhHHHHHHhcccccchhhhhhhcccccccccccccccccccccccc Q lcl|Aclame:pro 1 MNLTEKWKDLLEAEGADMPEIATATKQKIMSKIFENQDRDINNDPMYRDPQLVEAFNAGLNEAVVNGDHGYDPANIAQGV 80 (514) Q Consensus 1 ~~l~~kw~p~l~~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~g~~~~~~~~st 80 (514) .+|+|||+||||||| +|||++.+||+|+++|||||||+|+|+|+|+|.+++|+|+.+|+|+.++|+||+++++|++|+ T Consensus 6 ~~l~~kw~p~l~~~~--~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~e~~~~~l~e~~~~~~~~~~~~~ia~s~ 83 (529) T protein:vir:10 6 KEILNKWTPLLEGEG--LPEIAGKNKQALVAQILEAQEKDSKTDPVYRDDKLIEAFGQSLMEAEVAGDHGYDPTNIAAGQ 83 (529) T ss_pred HHHHHHhhHhhcCCc--cchhcchhhhhhhhhhhhhHHHHhhcccccchhhhhhhhhhccchhhcccccccccccccccc Confidence 579999999999997 899999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccceeeehhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCC--ccccccccccCCCCccCcccccccc Q lcl|Aclame:pro 81 TTGAVTNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPL--TGAEAFHPTRQADASFSGQAAASTI 158 (514) Q Consensus 81 ~tg~v~~~~P~l~~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~--tg~EA~~~~nEadt~fSG~~~~~~~ 158 (514) +|++|+++||+||+|||||+|||||+||||||||||||||||||||||+++.+ ++.|||++++|||+.|||...+... T Consensus 84 ~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~g~eaf~~~~e~dt~~SG~~~~~~~ 163 (529) T protein:vir:10 84 SSGAITNIGPAVIGMVRRAIPSLIAFDIAGVQPMTGPTGQVFALRSVYGKDPLAAGAKEAFHPMYAPDAWHSGLAAKGAT 163 (529) T ss_pred cccccccccchhhhhHHHHHHhHHhhhhheeccCCchhhhhhhheeeecCCcCCCccccccccccccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999865 4789999999999999998776654 Q ss_pred cccccccccccccccc--------ccccccccccccccccccccccccccCccccccccccccccccccccccccchhhh Q lcl|Aclame:pro 159 ADFPTTGAATDGTPYK--------AEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAE 230 (514) Q Consensus 159 ~~~~~~~~~t~~~~~~--------~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~GmtTa~aE 230 (514) ......+....+.... ......+..+................+...+..+...++.+.+|+++.||+|+.+| T Consensus 164 ~~~~~~~~~~~t~~~a~~~~~~~~~~~nea~t~~s~~~tg~~~~~g~~~tg~~~~~~~~~~~a~~~~~~~~~gmsTa~aE 243 (529) T protein:vir:10 164 TSSDGTPFAALTAGQAVATGDIVYHFFYESGSAYLQNVTGGNVTVGTNETGAALDALVSAKIAAGELAEIAEGMATSIAE 243 (529) T ss_pred ccccccccccccccceeeccccceeeecccccccccccccccccccccccCCccccccccccccccccccccccchhhhh Confidence 3333222221111110 00000000000000000111111222333445566678888999999999999999 Q ss_pred ccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhe Q lcl|Aclame:pro 231 LQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNSQA 310 (514) Q Consensus 231 al~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~~l~~~a 310 (514) +|++||++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||+|||+|||||||++|+..| T Consensus 244 al~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAvHGLDAEtELsNILStEImlEINReii~~i~~~a 323 (529) T protein:vir:10 244 LRQGFNGTTDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNGILANEVMLEINREVIDWINYTA 323 (529) T ss_pred ccccCCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHhhhhc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eecccccccccC-CcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhhcccccc Q lcl|Aclame:pro 311 QIGKSGWTQGAG-AAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVG 389 (514) Q Consensus 311 ~v~~~~~~~~v~-~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~~g~~~~ 389 (514) ++++.+|++.++ .+|+|||.++.|+.++||++||+|+|++|||+|||+|+|+|+||+||||||||+||++|+|.|.+++ T Consensus 324 ~~~~~g~~~~~~~~~gv~d~~~~~d~~~~~~~~e~~~~L~~~i~~~an~I~~~T~rg~~n~vi~S~~Va~~L~~~~~~~~ 403 (529) T protein:vir:10 324 QVGKSGWTQTVGSAAGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALALVDAGIT 403 (529) T ss_pred eeeeeeeeccccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchHHHHHHhhhccccc Confidence 999999998886 5799999999999999999999999999999999999999999999999999999999999999999 Q ss_pred chhccccCccccccccCceEEEEecCceEEEecCCCccceEEEEEecCCCcccceeeccccccccccccCCccccceeee Q lcl|Aclame:pro 390 PAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGFKGSTEMDAGVFYSPYVPLTPLRGSDSKNFQPVIGF 469 (514) Q Consensus 390 ~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~kG~~~~~~~~fy~PYv~~~~~~~~dp~s~qp~~~~ 469 (514) +++++...+ .++|+++++|||+|+|||+||||||+++|||+|||||++++|+||||||||||+|++++||+||||+||| T Consensus 404 ~~~~~~~sg-~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g~ 482 (529) T protein:vir:10 404 PAAQGMASG-LNADTTKGVFAGVLGGRYKVYIDQYARQDYFTMGYRGANNLDAGIYYCPYVALTPLRGSDPKNFQPVMGF 482 (529) T ss_pred ccccccccc-ceeecCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCCcccceeee Confidence 999887655 6689999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeeeeecCccccccC--cceeecCcchhhhccccceeeeeeeecC Q lcl|Aclame:pro 470 KTRYGVQVNPFADPTAS--ATKVGNGAPVAASMGKNAYFRRVFVKGL 514 (514) Q Consensus 470 ~tRY~l~~nPf~~~~~~--~~~i~~~~~~~~~~~~~~~~r~~~V~~~ 514 (514) ||||||++|||+++.++ ..||+|++||++++|||+|||||+|||| T Consensus 483 ~tRY~l~~NP~~~~~~~~~~~r~~~g~~~~~~ag~n~~~r~~~Vk~l 529 (529) T protein:vir:10 483 KTRYAIGVNPFAESRTQAPTSRISNGMPGAHSVGKNAYFRRVWVKGL 529 (529) T ss_pred eeeeceeecCccccccccccccccCCcchhhhcCccceeeEeeeccC Confidence 99999999999998776 5799999999999999999999999999 No 4 >protein:vir:98143 Length: 524 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239203;genbank:gi:66391678;genbank:GeneID:3416245 Probab=100.00 E-value=9.2e-241 Score=1336.54 Aligned_cols=509 Identities=59% Similarity=0.946 Sum_probs=450.1 Q ss_pred CcchhhhhhhhccccccccccccchhhhhhhhhhhhHHHHHHhcccccchhhhhhhcccccccccccccccccccccccc Q lcl|Aclame:pro 1 MNLTEKWKDLLEAEGADMPEIATATKQKIMSKIFENQDRDINNDPMYRDPQLVEAFNAGLNEAVVNGDHGYDPANIAQGV 80 (514) Q Consensus 1 ~~l~~kw~p~l~~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~g~~~~~~~~st 80 (514) .+|+|||+||||+|.. +|||++.|||+|+|+||||||||++|+++|+|++++++||.+|.||+++++||++++||++|+ T Consensus 5 ~~l~~kw~p~l~~~~~-~~~i~~~~~~~~~a~llenq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~~~~~~~i~~s~ 83 (524) T protein:vir:98 5 NELMEKWNDLLESQEG-LPDIATKSKKQLVAAILEAQEKDAETDPVYRDEKIVESFGGFLAEAEIAGDHNYDQTNIASGK 83 (524) T ss_pred HHHHHHhHHHhcCCcC-cchhcchhhHHHHHHHHhhHHHHHhcCccccchHHHHhhhccccccccccccccccccccccc Confidence 6799999999998532 899999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccceeeehhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCc------cccccccccCCCCccCcccc Q lcl|Aclame:pro 81 TTGAVTNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLT------GAEAFHPTRQADASFSGQAA 154 (514) Q Consensus 81 ~tg~v~~~~P~l~~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~t------g~EA~~~~nEadt~fSG~~~ 154 (514) +|++|+++||+||+|||||+|||||+|||||||||||||||||||+||++++++ ..||||++++||++|||.++ T Consensus 84 ~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~n~~~~~gteA~~nEAf~~~ye~dt~fSG~g~ 163 (524) T protein:vir:98 84 SSGAITNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSGEGA 163 (524) T ss_pred cccccccccchhhhHHHHHHHhhhhhhhheeccCCchhhhhhhhheeecCCCCCcccccccccccccccccccccCCccc Confidence 999999999999999999999999999999999999999999999999998654 26888899999999999877 Q ss_pred ccccccccccccccccccccccccccccccccccccccccccccccCccccccccccccccccccccccccchhhhcccc Q lcl|Aclame:pro 155 ASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAELQEN 234 (514) Q Consensus 155 ~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~GmtTa~aEal~~ 234 (514) .....+.+.......+..........+ ................+.............+.+..|+++.||+|+.+|+|++ T Consensus 164 ~t~~s~~~~g~~~~~g~~~~~~~~~~g-~~~~~~~~~g~~~~tgt~p~~~~~a~~~~~~~g~~~~~~~GmsTA~aEaL~~ 242 (524) T protein:vir:98 164 HTAFAKITTGTAIATGAIVYHIFQETG-IAYFQNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQEN 242 (524) T ss_pred ccccccccccccccccccccccccccc-ceeccccccCcccccccccccccccccccccccceeecccccchhhhhhhcc Confidence 766554443333332222222211111 1111111111111111112222233445667788999999999999999999 Q ss_pred CCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhheeecc Q lcl|Aclame:pro 235 FNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNSQAQIGK 314 (514) Q Consensus 235 ~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~~l~~~a~v~~ 314 (514) ||++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||++|+..+++++ T Consensus 243 ~g~ss~~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILSTEImlEINReii~~i~~~a~~~~ 322 (524) T protein:vir:98 243 FNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQVGK 322 (524) T ss_pred CCCCccccccceeeEEEEEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHhhhheece Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccccccCC-cceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhh--ccccccch Q lcl|Aclame:pro 315 SGWTQGAGA-AGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSM--TDTLVGPA 391 (514) Q Consensus 315 ~~~~~~v~~-~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~--~g~~~~~~ 391 (514) .+|+.++.+ +|+|||+++.|..++||++||+|+|++|||+|||+|+|+|+||+|||||||||||++|+| .|++++++ T Consensus 323 ~g~t~~~~~~~G~~dl~~~~d~~~~r~~~e~~~~L~~~i~~~an~I~~~T~rg~~n~~i~S~~Va~~L~~~~~g~~~~s~ 402 (524) T protein:vir:98 323 SGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPASQ 402 (524) T ss_pred eecccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhhhcccccccc Confidence 999998765 699999999999999999999999999999999999999999999999999999999999 78887655 Q ss_pred hccccCccccccccCceEEEEecCceEEEecCCCccceEEEEEecCCCcccceeeccccccccccccCCccccceeeeee Q lcl|Aclame:pro 392 AQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGFKGSTEMDAGVFYSPYVPLTPLRGSDSKNFQPVIGFKT 471 (514) Q Consensus 392 ~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~kG~~~~~~~~fy~PYv~~~~~~~~dp~s~qp~~~~~t 471 (514) .+ ++..++|+++.+|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|++++||+||||+||||| T Consensus 403 ~~---~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~t 479 (524) T protein:vir:98 403 GL---QKTLNVDTTKAVFAGVLGGTYKVYIDQYARQDYFTVGFKGDNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKT 479 (524) T ss_pred hh---hcccccCCccceEEEEecCceEEEecCCCCcceEEEEeeCCcccccceeeccccccccccccCCccccceeeeee Confidence 54 567899999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeeecCccccccC--cceeecCcchhhhccccceeeeeeeecC Q lcl|Aclame:pro 472 RYGVQVNPFADPTAS--ATKVGNGAPVAASMGKNAYFRRVFVKGL 514 (514) Q Consensus 472 RY~l~~nPf~~~~~~--~~~i~~~~~~~~~~~~~~~~r~~~V~~~ 514 (514) ||||++|||++..++ +.||+++++|++++|+|+|||+|+|||| T Consensus 480 RY~l~~NP~~~~~~~~~~~ri~~g~~~~~~ag~n~~~r~~~Vk~l 524 (524) T protein:vir:98 480 RYGIGINPFANSRSQAPADRITSGMISKEMCGKNAYFRKVWVKGL 524 (524) T ss_pred eeceeecCcccccCCccccccccCcchHhhcCccceeeEeeeccC Confidence 999999999997765 3599999999999999999999999999 No 5 >protein:vir:101039 Length: 529 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932516;genbank:gi:37651642;genbank:GeneID:2610532 Probab=100.00 E-value=1.1e-240 Score=1336.02 Aligned_cols=511 Identities=61% Similarity=0.973 Sum_probs=453.7 Q ss_pred CcchhhhhhhhccccccccccccchhhhhhhhhhhhHHHHHHhcccccchhhhhhhcccccccccccccccccccccccc Q lcl|Aclame:pro 1 MNLTEKWKDLLEAEGADMPEIATATKQKIMSKIFENQDRDINNDPMYRDPQLVEAFNAGLNEAVVNGDHGYDPANIAQGV 80 (514) Q Consensus 1 ~~l~~kw~p~l~~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~g~~~~~~~~st 80 (514) .+|+|||+||||||| +|||++.+||+|+|+|||||||+++|+++|+|.+++|+|+.+|+|+.++|+|||++++|+||+ T Consensus 6 ~~l~~kw~p~l~~~~--~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~e~~~~~l~~~~~~~~~~~~~~~i~est 83 (529) T protein:vir:10 6 KEILNKWTPLLEGEG--LPEIAGKNKQALVAQILEAQEKDSKSDPVYRDDKLIEAFGQSLMEAEVAGDHGYDPTNIAAGQ 83 (529) T ss_pred HHHHHHhHHHhcCCc--cchhccchhhhhhhhhhhhhHHHHhhccccchhhhhhhhhcccchhhcccccccccccccccc Confidence 689999999999997 899999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccceeeehhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCc--cccccccccCCCCccCcccccccc Q lcl|Aclame:pro 81 TTGAVTNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLT--GAEAFHPTRQADASFSGQAAASTI 158 (514) Q Consensus 81 ~tg~v~~~~P~l~~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~t--g~EA~~~~nEadt~fSG~~~~~~~ 158 (514) +|++|++|||+||+|||||+|||||+||||||||||||||||||||||++++++ +.|+||+++.|++.|||....... T Consensus 84 ~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~~~eaf~~~y~Pda~~sga~~~ga~ 163 (529) T protein:vir:10 84 SSGAITNIGPAVIGMVRRAIPSLIAFDIAGVQPMTGPTGQVFALRSVYGKDPLAAGAKEAFHPMYAPDAWHSSLATKGAT 163 (529) T ss_pred ccccccccCchhhhhHHHHHhhhhhheeeeeecCCchhhhhhhhheeecCCccccccccccccccccccccccccccccc Confidence 999999999999999999999999999999999999999999999999998764 689999999999999998776554 Q ss_pred cccccccccccc--cccc------ccccccccccccccccccccccccccCccccccccccccccccccccccccchhhh Q lcl|Aclame:pro 159 ADFPTTGAATDG--TPYK------AEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAE 230 (514) Q Consensus 159 ~~~~~~~~~t~~--~~~~------~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~GmtTa~aE 230 (514) ......+..... .... +.........................+...........+.+..|+++.||+|+.+| T Consensus 164 ~~~~~~~~~~~t~~~~~a~~~g~ea~f~ea~t~fs~~~~g~~~~~g~~~~~~~~~~~~~~~~a~~~~~~~~~Gm~Ta~aE 243 (529) T protein:vir:10 164 TTTDGTPFAKLTAGQAIAEGDIVGHFFYESGTAFLQNVSGASVTVGTNETGEALDKLINAAIGEGKLAEIAEGMATSIAE 243 (529) T ss_pred cccCccccccccccccccccCcceeeeecccceecccccccccccCccccCcccccccccccccccccccccccchhhhh Confidence 332222111111 0000 00000000000000000011111122223334455677788999999999999999 Q ss_pred ccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhe Q lcl|Aclame:pro 231 LQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNSQA 310 (514) Q Consensus 231 al~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~~l~~~a 310 (514) +|+++|++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||+|||+|||||||++|+++| T Consensus 244 aL~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILStEImlEINReii~~l~~~a 323 (529) T protein:vir:10 244 LRQGFNGSNDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNGILANEVMLEINREVIDWINYTA 323 (529) T ss_pred ccccCCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHhHhhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eecccccccccCC-cceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhhcccccc Q lcl|Aclame:pro 311 QIGKSGWTQGAGA-AGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVG 389 (514) Q Consensus 311 ~v~~~~~~~~v~~-~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~~g~~~~ 389 (514) +++|.+|++..+. +|+|||++++|+.++||++||+|+|++|||+|||+|+|+|+||+||||||||+||++|+|+|++++ T Consensus 324 ~~~k~~g~~~~~~~~Gv~d~~~~~~~~~~~~~~e~~k~L~~~i~~~an~I~~~T~rg~~n~vi~S~~Va~~L~~~~~~~~ 403 (529) T protein:vir:10 324 QVGKSGWTKTDGSASGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALALIDTNIS 403 (529) T ss_pred hhhhcccccccccccceeecccCccccccchHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchHHHHHHHhhhhhcc Confidence 9999999987755 799999999999999999999999999999999999999999999999999999999999999999 Q ss_pred chhccccCccccccccCceEEEEecCceEEEecCCCccceEEEEEecCCCcccceeeccccccccccccCCccccceeee Q lcl|Aclame:pro 390 PAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGFKGSTEMDAGVFYSPYVPLTPLRGSDSKNFQPVIGF 469 (514) Q Consensus 390 ~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~kG~~~~~~~~fy~PYv~~~~~~~~dp~s~qp~~~~ 469 (514) +++++...+ .++|+++.+|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|++++||+||||+||| T Consensus 404 ~~~~~~~sg-~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g~ 482 (529) T protein:vir:10 404 PAAQGMASG-LNADTTKGVFAGILGGRYKVYIDQYARQDYFTMGYRGANNLDAGIYYCPYVALTPLRGSDPKNFQPVMGF 482 (529) T ss_pred ccccccccc-cccccCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCCcccceeee Confidence 999988777 4689999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeeeeecCccccccC--cceeecCcchhhhccccceeeeeeeecC Q lcl|Aclame:pro 470 KTRYGVQVNPFADPTAS--ATKVGNGAPVAASMGKNAYFRRVFVKGL 514 (514) Q Consensus 470 ~tRY~l~~nPf~~~~~~--~~~i~~~~~~~~~~~~~~~~r~~~V~~~ 514 (514) ||||||++|||++..++ ..||++++||++++|+|+|||||+|||| T Consensus 483 ~tRY~l~~NP~~~~~~~~~~~r~~~g~~~~~~ag~n~~~r~~~Vk~l 529 (529) T protein:vir:10 483 KTRYAIGVNPFAESRTQAPQGRITSGMPGVNSVGKNAYFRRVWVKGL 529 (529) T ss_pred eeeeceeecCccccccccccccccCCcchhhhcCccceeEEeeeccC Confidence 99999999999987654 5799999999999999999999999999 No 6 >protein:vir:101811 Length: 529 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238888;genbank:gi:66391963;genbank:GeneID:3416638 Probab=100.00 E-value=2.4e-240 Score=1334.22 Aligned_cols=510 Identities=61% Similarity=0.975 Sum_probs=453.3 Q ss_pred CcchhhhhhhhccccccccccccchhhhhhhhhhhhHHHHHHhcccccchhhhhhhcccccccccccccccccccccccc Q lcl|Aclame:pro 1 MNLTEKWKDLLEAEGADMPEIATATKQKIMSKIFENQDRDINNDPMYRDPQLVEAFNAGLNEAVVNGDHGYDPANIAQGV 80 (514) Q Consensus 1 ~~l~~kw~p~l~~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~g~~~~~~~~st 80 (514) .+|+|||+||||||| +|||++.+||+|+|+|||||||+++|+++|+|.+++|+|+.+|+|+.++|+|||++++|++|+ T Consensus 6 ~~l~~kw~p~l~~~~--~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~e~~~~~l~e~~~~~~~~~~~~~i~~st 83 (529) T protein:vir:10 6 KEILNKWTPLLEGEG--LPEIAGKNKQALVAQILEAQEKDSKSDPVYRDDKLIEAFGQSLMEAEVAGDHGYDPTNIAAGQ 83 (529) T ss_pred HHHHHHhhHhhcCCc--cchhccchhhhhhhhhhhhhHHHHhcccccchhhhhhhhhccchhhccccccccccccccccc Confidence 579999999999997 899999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccceeeehhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCc--cccccccccCCCCccCcccccccc Q lcl|Aclame:pro 81 TTGAVTNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLT--GAEAFHPTRQADASFSGQAAASTI 158 (514) Q Consensus 81 ~tg~v~~~~P~l~~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~t--g~EA~~~~nEadt~fSG~~~~~~~ 158 (514) +|++|++|||+||+|||||+|||||+||||||||||||||||||||||++++++ +.|+||+++.|++.|||+...... T Consensus 84 ~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~~~eaf~~~~~pda~~sga~~~ga~ 163 (529) T protein:vir:10 84 SSGAITNIGPAVIGMVRRAIPSLIAFDIAGVQPMTGPTGQVFALRSVYGKDPLAAGAKEAFHPMYAPDAWHSSLATKGAT 163 (529) T ss_pred ccccccccCchhhhhHHHHHHhhhhhhhheeccCCchhhhhheeeeeecCCccccccccccccccccccccccccccccc Confidence 999999999999999999999999999999999999999999999999998754 689999999999999999876654 Q ss_pred cccccccccccccc--cc------ccccccccccccc-cccccccccccccCccccccccccccccccccccccccchhh Q lcl|Aclame:pro 159 ADFPTTGAATDGTP--YK------AEVTTSGGDVSMR-YFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQA 229 (514) Q Consensus 159 ~~~~~~~~~t~~~~--~~------~~~~~~~g~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~GmtTa~a 229 (514) ......+....+.. .. +..... +..+.. .............+...........+.+..|+++.||+|+.+ T Consensus 164 t~~~~t~~~~~ta~~~~a~g~g~ea~f~ea-~t~fs~~~~g~~~~~g~~~t~~~~~~~~~~~~a~~~~~~~~~GmsTa~a 242 (529) T protein:vir:10 164 TTTDGTPFAKLTAGQAIAEGDIVGHFFYES-GTAFLQNVSGASVTVGTNETGEALDKLINAAIGEGKLAEIAEGMATSIA 242 (529) T ss_pred ccccccccccccccccccccccceeeeccc-CceeeccccccccccCccccCcccccccccccccccccccccchhhhhh Confidence 33322221111110 00 000000 000000 000001111122222334455667788899999999999999 Q ss_pred hccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhh Q lcl|Aclame:pro 230 ELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNSQ 309 (514) Q Consensus 230 Eal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~~l~~~ 309 (514) |+|++||++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||+|||+|||||||++|+++ T Consensus 243 EaL~~~ggss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILStEImlEINReii~~l~~~ 322 (529) T protein:vir:10 243 ELRQGFNGSNDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNGILANEVMLEINREVIDWINYT 322 (529) T ss_pred hccccCCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeecccccccccCC-cceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhhccccc Q lcl|Aclame:pro 310 AQIGKSGWTQGAGA-AGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLV 388 (514) Q Consensus 310 a~v~~~~~~~~v~~-~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~~g~~~ 388 (514) |+++|.+|+..++. +|+|||++++|+.++||++||+|+|++|||+|||+|+|+|+||+||||||||+||++|+|+|++. T Consensus 323 a~~~~~~~~~~~~~~~Gv~d~~~~~~~~~~~~~~e~~~~L~~~i~~~an~I~~~T~rg~~n~vi~S~~Va~~L~~~~~~~ 402 (529) T protein:vir:10 323 AQVGKSGWTKTDGSASGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALALIDTNI 402 (529) T ss_pred hhhhccccccccccccceeecccCccccccchHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchHHHHHHHhhcccc Confidence 99999999987755 79999999999999999999999999999999999999999999999999999999999999998 Q ss_pred cchhccccCccccccccCceEEEEecCceEEEecCCCccceEEEEEecCCCcccceeeccccccccccccCCccccceee Q lcl|Aclame:pro 389 GPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGFKGSTEMDAGVFYSPYVPLTPLRGSDSKNFQPVIG 468 (514) Q Consensus 389 ~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~kG~~~~~~~~fy~PYv~~~~~~~~dp~s~qp~~~ 468 (514) .+++++...+ .++|+++.+|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|++++||+||||+|| T Consensus 403 ~~~~~~~~sg-~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g 481 (529) T protein:vir:10 403 SPAAQGMASG-LNADTTKGVFAGILGGRYKVYIDQYARQDYFTMGYRGANNLDAGIYYCPYVALTPLRGFDPKNFQPVMG 481 (529) T ss_pred cccccccccc-cccccCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCCcccceee Confidence 8888887776 468999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeeeeeecCccccccC--cceeecCcchhhhccccceeeeeeeecC Q lcl|Aclame:pro 469 FKTRYGVQVNPFADPTAS--ATKVGNGAPVAASMGKNAYFRRVFVKGL 514 (514) Q Consensus 469 ~~tRY~l~~nPf~~~~~~--~~~i~~~~~~~~~~~~~~~~r~~~V~~~ 514 (514) |||||||++|||++..++ ..||++++||++++|+|+|||||+|||| T Consensus 482 ~~tRY~l~~NP~~~~~~~~~~~r~~~g~~~~~~ag~n~~~r~~~Vk~l 529 (529) T protein:vir:10 482 FKTRYAIGVNPFAESRTQAPQGRITSGMPGVNSVGKNAYFRRVWVKGL 529 (529) T ss_pred eeeeeceeecCccccccccccccccCCcchhhhcCccceeEEeeeccC Confidence 999999999999987654 5799999999999999999999999999 No 7 >protein:vir:106286 Length: 534 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944113;genbank:gi:38640157;genbank:GeneID:2658034 Probab=100.00 E-value=4.7e-240 Score=1332.67 Aligned_cols=511 Identities=58% Similarity=0.952 Sum_probs=451.4 Q ss_pred CcchhhhhhhhccccccccccccchhhhhhhhhhhhHHHHHHhc--ccccchhhhhhhccc--------ccccccccccc Q lcl|Aclame:pro 1 MNLTEKWKDLLEAEGADMPEIATATKQKIMSKIFENQDRDINND--PMYRDPQLVEAFNAG--------LNEAVVNGDHG 70 (514) Q Consensus 1 ~~l~~kw~p~l~~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~--~~~~~~~~~~~~~~~--------~~~a~~~~~~g 70 (514) .+|+|||+||||||| +|||++.+||+|+|+||||||||++|+ ++|+|++++++|+.+ |.|++++++|| T Consensus 4 ~~l~~kw~p~l~~~~--~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~g 81 (534) T protein:vir:10 4 KSLLKKWQPLVESEG--MPAIASMKRKDIVARIFENQDEDIAHNEGGVYTDQVVVNSMVDVKGRIEEARLAEANIGGDHG 81 (534) T ss_pred hHHHHHhHHhhcCCc--cccccchhhhhhhhhhhhhHHHHHhhhcccccchhhhhhhhhccccchhhccccccccccccc Confidence 899999999999997 899999999999999999999999887 699999999999887 99999999999 Q ss_pred ccccccccccccccccccceeeehhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCC--CccccccccccCCCCc Q lcl|Aclame:pro 71 YDPANIAQGVTTGAVTNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDP--LTGAEAFHPTRQADAS 148 (514) Q Consensus 71 ~~~~~~~~st~tg~v~~~~P~l~~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~--~tg~EA~~~~nEadt~ 148 (514) ||++||+||++|++|+++||+||+|||||+|||||+|||||||||||||||||||+||.++. .++.||||+.+.+|++ T Consensus 82 ~~~~~ia~s~~s~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~n~~~~~s~~EAf~ne~~adt~ 161 (534) T protein:vir:10 82 YDATKIASGETSGSITNVGPAVMGLVRRAIPQLIAFDICGVQPMTSSTGQVFTLRAIYGGNSQDANAREAFHPTYGPDAD 161 (534) T ss_pred cccccccccccccccccccchhhhHHHHHHHhhhhhhhheeccCCchhhhheeeeeeecCCCCCcccccccccccccccc Confidence 99999999999999999999999999999999999999999999999999999999999875 3578999976679999 Q ss_pred cCccccccccccccccccccccccccccccc----cccccccccccccccccccccCccccccccccccccccccccccc Q lcl|Aclame:pro 149 FSGQAAASTIADFPTTGAATDGTPYKAEVTT----SGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGM 224 (514) Q Consensus 149 fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~----~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~Gm 224 (514) |||+.+...........+...++........ ..+......................+.......+.+..|+++.|| T Consensus 162 fSG~~~a~~~~~~~~~~a~~~g~~~~~~~~~~t~~~~Gt~~~~~~~~~~v~~~~~~~~~ag~~~~~~~~~~~~y~~~~gm 241 (534) T protein:vir:10 162 FSGRGAAQDIAVFVRGTAVASGAFAKLHIEAATGVQAGTKTVQFIKDYAVDALPADQTEAGLAYKWLLANGYAVETSSAM 241 (534) T ss_pred ccccccccccccccccccccccccccccccccccccccccccccccccccccccCCccccccccccccccccceeccccc Confidence 9999877665544443433333322221111 111111111111111111112222233445566677889999999 Q ss_pred cchhhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHH Q lcl|Aclame:pro 225 ATSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVN 304 (514) Q Consensus 225 tTa~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~ 304 (514) +|+.+|+|+.+|++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||+ T Consensus 242 ~Ta~AE~lg~~ggs~~~~f~EMsFsIdKvtVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELsNILSTEImlEINReii~ 321 (534) T protein:vir:10 242 ATAFAELQQGFNGSADNEWNEMSFRIDKQVVEAKSRQLKAQYSIEMAQDLRAVHGLDADSELSSILANEIMHEINREMVL 321 (534) T ss_pred chhhHhhhccCCCCcccchhhcceEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHhhheeecccccccc-cCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhh Q lcl|Aclame:pro 305 LVNSQAQIGKSGWTQG-AGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSM 383 (514) Q Consensus 305 ~l~~~a~v~~~~~~~~-v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~ 383 (514) +|+++|+++|.+++.+ ++.+|+|||.++.|+.++||++||+|+|+++||+|||+|+|+|+||+|||||||||||++|+| T Consensus 322 ~l~~~a~~~k~~~~~~~~~~~G~~d~~~~~~~~~~~~~~e~~~~L~~~i~~~an~i~~~T~rg~~n~~v~S~~Va~~L~~ 401 (534) T protein:vir:10 322 WINATAKVGKTGWTNMHGGKAGVFDFQDTKDIRGARWAGESYKALVVQIDKEANEIARQTGRGQGNFIICSRNVAAALGH 401 (534) T ss_pred HHhhhhheeecccccccccccceeeeeccccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchhHHHHHhh Confidence 9999999988888875 456899999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccchhccccCccccccccCceEEEEecCceEEEecCCCccceEEEEEecCCCcccceeeccccccccccccCCccc Q lcl|Aclame:pro 384 TDTLVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGFKGSTEMDAGVFYSPYVPLTPLRGSDSKNF 463 (514) Q Consensus 384 ~g~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~kG~~~~~~~~fy~PYv~~~~~~~~dp~s~ 463 (514) +|||++.|+++.. ...++|+++.+|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|++++||+|| T Consensus 402 ~g~l~~~~~~~~~-~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sf 480 (534) T protein:vir:10 402 TDMLMTPAVMGAN-TTMNTDTTSSLFAGVLAGKYRVYIDQYAVEDYFTVGYKGASEMDAGLYYCPYVALTPLRGTDPKNF 480 (534) T ss_pred ccchhcccccccc-ccccccCCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCccc Confidence 9999999988754 448999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cceeeeeeeeeeeecCccccccC--cceeecCcc-hhhhccccceeeeeeeecC Q lcl|Aclame:pro 464 QPVIGFKTRYGVQVNPFADPTAS--ATKVGNGAP-VAASMGKNAYFRRVFVKGL 514 (514) Q Consensus 464 qp~~~~~tRY~l~~nPf~~~~~~--~~~i~~~~~-~~~~~~~~~~~r~~~V~~~ 514 (514) ||+|||||||||++|||++..++ ..+|+|+++ |++++|+|.|||||+|||| T Consensus 481 qP~~g~~tRY~l~~NP~~~~~~~~~~~~i~~g~~~~~~~ag~n~~~~~~~Vk~l 534 (534) T protein:vir:10 481 QPVLGFKTRYGVKLHPMADATQNKGFAKISNGMPQHTNMFGKNAFFRRVLVAGV 534 (534) T ss_pred cceeeeeeeeceeecCcccccCCccccccccCCcchhhhcccccceeeeeeecC Confidence 99999999999999999998765 369999976 8899999999999999999 No 8 >protein:vir:6901 Length: 522 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861877;genbank:gi:32453668;genbank:GeneID:1494303 Probab=100.00 E-value=9e-240 Score=1331.12 Aligned_cols=509 Identities=61% Similarity=0.991 Sum_probs=454.2 Q ss_pred CcchhhhhhhhccccccccccccchhhhhhhhhhhhHHHHHHhcccccchhhhhhhcccccccccccccccccccccccc Q lcl|Aclame:pro 1 MNLTEKWKDLLEAEGADMPEIATATKQKIMSKIFENQDRDINNDPMYRDPQLVEAFNAGLNEAVVNGDHGYDPANIAQGV 80 (514) Q Consensus 1 ~~l~~kw~p~l~~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~g~~~~~~~~st 80 (514) .+|+|||+||||||| +|+|++. ||+|+|+|||||||+++|+|+|+|++++++||++|+||+++|+||+++.+|+||+ T Consensus 8 e~l~~kw~p~l~~~~--~~~~~~~-~~~~~a~l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~~~~~~~i~es~ 84 (522) T protein:vir:69 8 AQLVDKWKELLEGEG--LPEIANS-KQAIIAKIFENQEKDFEVSPEYKDEKIAQAFGSFLTEAEIGGDHGYNAQNIAAGQ 84 (522) T ss_pred HHHHHhhHHHhcCCC--CCccccc-hhhhhhhhhhhhhHHhhcccccchhHHHHhhhhhhhhhccccccCCCcccccccc Confidence 679999999999998 8999986 9999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccceeeehhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCc--cccccccccCCCCccCcccccccc Q lcl|Aclame:pro 81 TTGAVTNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLT--GAEAFHPTRQADASFSGQAAASTI 158 (514) Q Consensus 81 ~tg~v~~~~P~l~~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~t--g~EA~~~~nEadt~fSG~~~~~~~ 158 (514) +|++|++|||+||+|+|||+|||||+||||||||||||||||||||||+++.++ +.|||+++||+|++|||.+..... T Consensus 85 ~t~~v~~~~P~li~lvrRa~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~q~~~~~~~eaf~~~neadt~fSG~~~~t~~ 164 (522) T protein:vir:69 85 TSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQPMNSPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGQGAAKKF 164 (522) T ss_pred cccccccccchHHHHHHHHHhhhhhhhceeeccCCchhhhheeeeeeccCCcccCccccccccccccccccccccccccc Confidence 999999999999999999999999999999999999999999999999998654 789999999999999999777666 Q ss_pred ccccccccccccccccccccccccccccccccccccccccccCccccccccccccccccccccccccchhhhccccCCCC Q lcl|Aclame:pro 159 ADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAELQENFNGS 238 (514) Q Consensus 159 ~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~GmtTa~aEal~~~ggs 238 (514) ...+.......++.........+ .+.........................+..+.+.+|+++.||+|+.+|+++.+|++ T Consensus 165 ~~~~~~~~t~~G~~~~~~~~~~g-t~~~~~~a~~t~~~t~~~~~~~~~ai~s~~~~~~~y~~g~GmsTa~aEal~~lggs 243 (522) T protein:vir:69 165 PALAASTQTKVGDIYTHFFQETG-TVYLQASAQVTISSSADDAAKLDAEIIKQMEAGALVEIAEGMATSIAELQEGFNGS 243 (522) T ss_pred ccccccccccccccccccccccc-ceeeecccCCcCCCCCcccccccchhccccccccceeeccccchhhhhhcccCCCC Confidence 55544443333333222211111 11000000001111111111122344567788899999999999999999999999 Q ss_pred CCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhheeecccccc Q lcl|Aclame:pro 239 SNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNSQAQIGKSGWT 318 (514) Q Consensus 239 ~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~~l~~~a~v~~~~~~ 318 (514) ++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||++|+..|++++.+++ T Consensus 244 s~~~f~EMaFsIeKvTVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILSTEImlEINReii~~i~~sa~~~~~g~t 323 (522) T protein:vir:69 244 TDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQVGKSGMT 323 (522) T ss_pred cccchhhhcceEeeEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhhheeeccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccCC-cceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhhccccccchhccccC Q lcl|Aclame:pro 319 QGAGA-AGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVGPAAQGMQD 397 (514) Q Consensus 319 ~~v~~-~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~~g~~~~~~~~~~~~ 397 (514) +.+++ +|+|||+++.|+.++||++||+|+|++|||+|||+|+|+|+||+|||||||||||++|+|+|.++++++++... T Consensus 324 ~~~~~~~Gv~Dl~~~~~~~~~rw~~e~~k~L~~~i~~~an~i~~~T~rg~~n~~i~S~~Va~~L~~~~~~~~~~~~~~~~ 403 (522) T protein:vir:69 324 NIVGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTGISYAAQGLAS 403 (522) T ss_pred cccccccceeecccccccccchhHHHHHHHHHHHHHHHHHHHHHhcccccccEEEEchhHHHHHhhcccccccccccccc Confidence 88855 89999999999999999999999999999999999999999999999999999999999999999999988665 Q ss_pred ccccccccCceEEEEecCceEEEecCCCccceEEEEEecCCCcccceeeccccccccccccCCccccceeeeeeeeeeee Q lcl|Aclame:pro 398 GSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGFKGSTEMDAGVFYSPYVPLTPLRGSDSKNFQPVIGFKTRYGVQV 477 (514) Q Consensus 398 ~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~kG~~~~~~~~fy~PYv~~~~~~~~dp~s~qp~~~~~tRY~l~~ 477 (514) + .++|+++++|||+|+|||+||||||+++|||+|||||++++|+||||||||||+|++++||+||||+|||||||||++ T Consensus 404 g-~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~v 482 (522) T protein:vir:69 404 G-FNTDTTKSVFAGVLGGKYRVYIDQYAKQDYFTVGYKGANEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGV 482 (522) T ss_pred c-ccccCCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCccccceeeeeeeeceee Confidence 5 678999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCccccccC--cceeecCcc-hhhhccccceeeeeeeecC Q lcl|Aclame:pro 478 NPFADPTAS--ATKVGNGAP-VAASMGKNAYFRRVFVKGL 514 (514) Q Consensus 478 nPf~~~~~~--~~~i~~~~~-~~~~~~~~~~~r~~~V~~~ 514 (514) |||++..++ +.||+|++| +.+.+|||.|||||+|||| T Consensus 483 NP~~~~~~~~~~~ri~~g~p~~~~~~~~n~y~r~v~v~~~ 522 (522) T protein:vir:69 483 NPFAESSLQAPGARIQSGMPSILNSLGKNAYFRRVYVKGI 522 (522) T ss_pred cCcccccCCcccceeecccchhhcccCCcceeeEEEeecC Confidence 999986543 589999997 5589999999999999999 No 9 >protein:vir:6601 Length: 528 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891732;genbank:gi:33620668;genbank:GeneID:1725275 Probab=100.00 E-value=3.3e-239 Score=1328.05 Aligned_cols=508 Identities=59% Similarity=0.930 Sum_probs=451.8 Q ss_pred CcchhhhhhhhccccccccccccchhhhhhhhhhhhHHHHHHhcccccchhhhhhhcccccccccccccccccccccccc Q lcl|Aclame:pro 1 MNLTEKWKDLLEAEGADMPEIATATKQKIMSKIFENQDRDINNDPMYRDPQLVEAFNAGLNEAVVNGDHGYDPANIAQGV 80 (514) Q Consensus 1 ~~l~~kw~p~l~~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~g~~~~~~~~st 80 (514) .+|+|||+||||||| +|||++.+||+|+|+|||||||+++|+|+|++++++|+|+.+|+||+++|+|||++.+|++|+ T Consensus 5 ~~l~~kw~p~l~~~~--~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~~~~~~~i~es~ 82 (528) T protein:vir:66 5 KELMEKWSPLLENEK--LPEIATASKQKLVAKILESQEADFAVDPIYKDEKVVEAFGGFIAEAEVAGDHGYDASQIAAGQ 82 (528) T ss_pred HHHHHHhHHhhcCCC--cchhcchhhhhhhhhhhhhhHHHhhcccchhhHHHHHhhhhhhhhhcccccccccchhccccc Confidence 789999999999997 899999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccceeeehhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCc--cccccccccCCCCccCcccccccc Q lcl|Aclame:pro 81 TTGAVTNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLT--GAEAFHPTRQADASFSGQAAASTI 158 (514) Q Consensus 81 ~tg~v~~~~P~l~~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~t--g~EA~~~~nEadt~fSG~~~~~~~ 158 (514) +|++|++|||+||+|||||+|||||+||||||||||||||||||||+|++++++ +.||||+++.+++.||+....... T Consensus 83 ~t~~v~~~~P~Li~lvRRa~p~LIa~DIwGVQPMTgPTGlIFAmRs~Y~~~~~~~~~~eAfh~~~g~ea~fsea~t~~a~ 162 (528) T protein:vir:66 83 TTGAITNVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLKSGAREAFHPMYAPDAFHSSLAAKEAT 162 (528) T ss_pred cccccccCchhHHHHHHHHHHhhhhhhhheeecCCchhhhheeeeeeecCCccccccccccccccccccccccccccccc Confidence 999999999999999999999999999999999999999999999999998754 578999999999999987655443 Q ss_pred ccccccccccccccccccccccccccccccc----------cccccccccccCcc-ccccccccccccccccccccccch Q lcl|Aclame:pro 159 ADFPTTGAATDGTPYKAEVTTSGGDVSMRYF----------LALGAVTLAVAGQM-TATEYTDGVAGGLLVEIDAGMATS 227 (514) Q Consensus 159 ~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~----------~~~~~~~~~~~~~~-~~~~~~~~~a~~~~y~~~~GmtTa 227 (514) ..+++.-.....+... ....++.+.+.. ...........+.. .........+.+.+|+++.||+|+ T Consensus 163 ~gGpTGliFAm~s~y~---s~~~g~ea~~nea~t~fs~~~~~~~~~~~~~~~g~~~g~~~~~~~~a~~~~~~~~~Gm~Ta 239 (528) T protein:vir:66 163 VGSPTGTAFAKLTLSQ---AITAGDIVYHTFAETGIAYLQNVTGDSVTPQKVGSESEDEVVMKLIEEGKLAEIAFGMATS 239 (528) T ss_pred ccCCccceeecccccc---cccccceeeecccccceeeeccccccccccCcccccccccccccccccccceecccccchh Confidence 3333211111111100 001111111111 11111111111111 223344456777889999999999 Q ss_pred hhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHHHHh Q lcl|Aclame:pro 228 QAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVN 307 (514) Q Consensus 228 ~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~~l~ 307 (514) .+|+++.+|++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||+|||+|||||||++|+ T Consensus 240 ~aEale~lg~~s~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELsNILStEImlEINREii~~i~ 319 (528) T protein:vir:66 240 IAEIQEGFNGSSNNPWAEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVIN 319 (528) T ss_pred hhhhhcccCCCcccchhhcceEEEeEEEEeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhh Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hheeecccccccccCC-cceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhhccc Q lcl|Aclame:pro 308 SQAQIGKSGWTQGAGA-AGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDT 386 (514) Q Consensus 308 ~~a~v~~~~~~~~v~~-~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~~g~ 386 (514) ..|++++.+|+++|++ +|+|||++++|++|+||++||+|+|++|||+|||+|+|+|+||+||||||||+||++|+++|. T Consensus 320 ~~a~~~~~~~t~~~~~~aG~~dl~~~~d~~g~rw~~e~~k~L~~~i~~~an~I~~~T~r~~gn~vi~S~~Va~~L~~~g~ 399 (528) T protein:vir:66 320 FTAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQ 399 (528) T ss_pred heeeeeeeeeeeccccccceeecccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhccc Confidence 9999999999999977 699999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccchhccccCccccccccCceEEEEecCceEEEecCCCccceEEEEEecCCCcccceeeccccccccccccCCccccce Q lcl|Aclame:pro 387 LVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGFKGSTEMDAGVFYSPYVPLTPLRGSDSKNFQPV 466 (514) Q Consensus 387 ~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~kG~~~~~~~~fy~PYv~~~~~~~~dp~s~qp~ 466 (514) ++++++++.... .++|+++.+|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|++++||+||||+ T Consensus 400 ~~~~~~~~~~~~-~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~ 478 (528) T protein:vir:66 400 GISLAMQGAAKG-LNTDTTKAVFAGVLAGKYKVFIDQYARQDYFTVGYKGDNEMDAGIYYAPYVALTPLRATDPQSFHPV 478 (528) T ss_pred cccccccccccc-cccCCCCceeEEEecCceEEEecCCCCcceEEEEEeCCcccccceeecccccceeeEeeCCccccce Confidence 999999887555 7899999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeeeeeeeeeecCccccccC--cceeecCcchhhhccccceeeeeeeecC Q lcl|Aclame:pro 467 IGFKTRYGVQVNPFADPTAS--ATKVGNGAPVAASMGKNAYFRRVFVKGL 514 (514) Q Consensus 467 ~~~~tRY~l~~nPf~~~~~~--~~~i~~~~~~~~~~~~~~~~r~~~V~~~ 514 (514) |||||||||++|||++..++ +.||+++++|++++|||+|||||+|||| T Consensus 479 ~g~~tRY~l~vNP~~~~~~~~~~~ri~~g~~~~~~ag~n~~~r~~~Vk~~ 528 (528) T protein:vir:66 479 LGFKTRYGIGINPFADSKSQEPSARITSGMLSKDSVGKNAYFRRVWVKGC 528 (528) T ss_pred eeeeeeeceeecCcccccCccccccccccchhhhhcCccceeEEeeeccC Confidence 99999999999999997643 6999999999999999999999999999 No 10 >protein:vir:107947 Length: 519 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595301;genbank:gi:161622607;genbank:GeneID:5783666 Probab=100.00 E-value=7.6e-238 Score=1320.56 Aligned_cols=510 Identities=61% Similarity=0.985 Sum_probs=460.8 Q ss_pred CcchhhhhhhhccccccccccccchhhhhhhhhhhhHHHHHHhcccccchhhhhhhcccccccccccccccccccccccc Q lcl|Aclame:pro 1 MNLTEKWKDLLEAEGADMPEIATATKQKIMSKIFENQDRDINNDPMYRDPQLVEAFNAGLNEAVVNGDHGYDPANIAQGV 80 (514) Q Consensus 1 ~~l~~kw~p~l~~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~g~~~~~~~~st 80 (514) .+|+|||+||||||| +|+|++.|||+|+++|||||||+|+|+++|++++++++|+.+|+|++++++|||++++|++|+ T Consensus 4 ~~l~~kw~p~l~~~~--~~~i~~~~~~~i~~~~~en~~~~~~~~~~~~~~~~~~~~~~~l~e~~~~~~~~~~~t~i~~~~ 81 (519) T protein:vir:10 4 NALVQKWSALLENEA--LPEIVGASKQAIIAKIFENQEQDILTAPEYRDEKISEAFGSFLTEAEIGGDHGYDATNIAAGQ 81 (519) T ss_pred hHHHHHhHHhhcccc--cchhhhhhhHHHHHHHHHHHHHHhhhcccccchHHHHHHhhhcchhccCCccccCcccccccc Confidence 789999999999997 899999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccceeeehhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCc--cccccccccCCCCccCcccccccc Q lcl|Aclame:pro 81 TTGAVTNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLT--GAEAFHPTRQADASFSGQAAASTI 158 (514) Q Consensus 81 ~tg~v~~~~P~l~~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~t--g~EA~~~~nEadt~fSG~~~~~~~ 158 (514) +|+++.++||+||+|+|||+|||||+||||||||||||||||||||||++++++ +.|+|+++||+|++|||+++.... T Consensus 82 ~t~~v~~~~P~l~~l~rRa~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~n~~~~~~g~ea~~~~nEadt~fSG~~~~~~~ 161 (519) T protein:vir:10 82 TSGAVTQIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAETF 161 (519) T ss_pred ccccccccchhHHHHHHHHHHhhhhhhhheeecCCchhhhhheeeeeecCCccccccccccccccccccccCcccccccc Confidence 999999999999999999999999999999999999999999999999998654 789999999999999999887665 Q ss_pred ccccccccccccccccccccccccccccccccccccccccccCccccccccccccccccccccccccchhhhccccCCCC Q lcl|Aclame:pro 159 ADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAELQENFNGS 238 (514) Q Consensus 159 ~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~GmtTa~aEal~~~ggs 238 (514) ...+.......+..........+ +..............++.....+.......+.+.+|+++.||+|+.+|+++.+|++ T Consensus 162 ~~~~~~~~~~~g~~~~~~~~~s~-~~~~~~~~~~t~~ag~t~~~~~~~a~~~~~~~~~~~~~~~gmsTa~aEal~~lggs 240 (519) T protein:vir:10 162 EALAASKVLEVGKIYSHFFEATG-SAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGS 240 (519) T ss_pred ccccccccccccccccccccccc-cceeccccccccCCCCcCccccccccccccccccccccccccccchhhccccCCCc Confidence 55554444444433333322221 11111111112222222233344456678888899999999999999999999999 Q ss_pred CCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhheeecccccc Q lcl|Aclame:pro 239 SNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNSQAQIGKSGWT 318 (514) Q Consensus 239 ~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~~l~~~a~v~~~~~~ 318 (514) ++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||++|+..|++++.+++ T Consensus 241 s~~~f~EMaFsIeKvTVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii~~i~~sa~~~~~g~t 320 (519) T protein:vir:10 241 TDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMT 320 (519) T ss_pred cccchhhhceeEEEEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhhhhcceeecc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cccCC-cceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhhccccccchhccccC Q lcl|Aclame:pro 319 QGAGA-AGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVGPAAQGMQD 397 (514) Q Consensus 319 ~~v~~-~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~~g~~~~~~~~~~~~ 397 (514) .+++. +|+|||+++.|+.++||++||+|+|++|||+|||+|+|+|+||+|||||||||||++|+++|.++++++++... T Consensus 321 ~~~~~~aGv~d~~~~~d~~~~rw~~e~~k~L~~~i~~~an~I~~~T~r~~gn~ii~S~~Va~~L~~~g~~~~~~~~~~~~ 400 (519) T protein:vir:10 321 NTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQ 400 (519) T ss_pred cCcccccceeecccccccccchHHHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhccchhccccccccc Confidence 88877 59999999999999999999999999999999999999999999999999999999999999999999988766 Q ss_pred ccccccccCceEEEEecCceEEEecCCCccceEEEEEecCCCcccceeeccccccccccccCCccccceeeeeeeeeeee Q lcl|Aclame:pro 398 GSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGFKGSTEMDAGVFYSPYVPLTPLRGSDSKNFQPVIGFKTRYGVQV 477 (514) Q Consensus 398 ~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~kG~~~~~~~~fy~PYv~~~~~~~~dp~s~qp~~~~~tRY~l~~ 477 (514) . .++|+++++|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|++++||+||||+|||||||||++ T Consensus 401 ~-~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~~ 479 (519) T protein:vir:10 401 G-FNVDTTKAVFAGVLGGKYRVYIDQYARSDYFTIGYKGSNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI 479 (519) T ss_pred c-ccccCCCceEEEEecCceEEEecCCCCcceEEEEEecCcccccceeeccccccccccccCCccccceeeeeeeeceee Confidence 6 689999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCccccccC--cceeecCcc-hhhhccccceeeeeeeecC Q lcl|Aclame:pro 478 NPFADPTAS--ATKVGNGAP-VAASMGKNAYFRRVFVKGL 514 (514) Q Consensus 478 nPf~~~~~~--~~~i~~~~~-~~~~~~~~~~~r~~~V~~~ 514 (514) |||++..++ ..||+|+|| |++..++|.|||||+|||| T Consensus 480 NP~~~~~~~~~~~~i~~g~~~~a~~~~~n~y~r~v~v~~~ 519 (519) T protein:vir:10 480 NPFADPAAQAPTKRIQNGMPDIVNSLGLNGYFRRVYVKGI 519 (519) T ss_pred cCcccccccCccceeccCchhhhccccCceeeeeeeeecC Confidence 999987543 479999987 6788899999999999999 No 11 >protein:vir:103463 Length: 521 # NCBI annotation: major head subunit precursor # Family: family:all:364 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803115;genbank:gi:116326395;genbank:GeneID:4405492 Probab=100.00 E-value=5.8e-237 Score=1315.72 Aligned_cols=509 Identities=61% Similarity=0.975 Sum_probs=454.6 Q ss_pred CcchhhhhhhhccccccccccccchhhhhhhhhhhhHHHHHHhcccccchhhhhhhcccccccccccccccccccccccc Q lcl|Aclame:pro 1 MNLTEKWKDLLEAEGADMPEIATATKQKIMSKIFENQDRDINNDPMYRDPQLVEAFNAGLNEAVVNGDHGYDPANIAQGV 80 (514) Q Consensus 1 ~~l~~kw~p~l~~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~g~~~~~~~~st 80 (514) .+|+|||+||||||| +|+|++. ||+|+|+|||||||+++|+|+|++++++++|+++|+|++++++||++++||+||+ T Consensus 7 ~~l~~kw~p~l~~~~--~~~i~~~-~~~~~a~~~enq~~~~~~~~~~~~~~~~~~~~~~l~e~~~~~~~~~~~~~i~es~ 83 (521) T protein:vir:10 7 AELLNKWKPLLEGEG--LPEIANS-KQAIIAKIFENQEKDFQTAPEYKDEKIAQAFGSFLTEAEIGGDHGYNATNIAAGQ 83 (521) T ss_pred HHHHHhhhhhhccCC--CCccccc-hhhhhhhhhhhhhhhhhhccccchhHHHHHHhhhhhhhcccCccccccccccccc Confidence 679999999999997 8999987 9999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccceeeehhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCc--cccccccccCCCCccCcccccccc Q lcl|Aclame:pro 81 TTGAVTNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLT--GAEAFHPTRQADASFSGQAAASTI 158 (514) Q Consensus 81 ~tg~v~~~~P~l~~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~t--g~EA~~~~nEadt~fSG~~~~~~~ 158 (514) +|++|+++||+||+|||||+|||||+||||||||||||||||||||||++++++ +.|+||+++++|+.|||+.+.... T Consensus 84 ~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~q~~~~~g~eaf~~~~~ada~fSG~~~at~~ 163 (521) T protein:vir:10 84 TSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQPMNSPTGQVFALRAVYGKDPIAAGAKEAFHPMYGPDAMFSGQGAAKKF 163 (521) T ss_pred cccccccCCchhhhHHHHHHhhhhhhhceeeccCCchhhhheeeeeeccCCccccccccccchhcccccccccccccccc Confidence 999999999999999999999999999999999999999999999999999765 789999999999999999887665 Q ss_pred ccccccccccccccccccccccccccccccccccccccccccCccccccccccccccccccccccccchhhhccccCCCC Q lcl|Aclame:pro 159 ADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAELQENFNGS 238 (514) Q Consensus 159 ~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~GmtTa~aEal~~~ggs 238 (514) ......+....+.......... +.+....................+.......+.+.+|+++.||+|+++|+|+.+|++ T Consensus 164 s~~~~~~~~~~Gd~~~~~~~~~-g~~~~~~~~~~t~~~t~~d~~~~~~~~~~~~~~~~~y~~~~GmsTa~aEal~~~g~s 242 (521) T protein:vir:10 164 AALAASTQTTVGDIYTHFFQDT-GTVYLQASAQVTISSTADDAAKLDAEIKKQMEAGALVEIAEGMATSIAELQESFNGS 242 (521) T ss_pred cccccccccccccccccccccc-ccceecccccccCCCcccccccccccccccccccceeecccccchhhHhhhccCCCC Confidence 5444433333333322221111 111111111111111111112233456667888899999999999999999999999 Q ss_pred CCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhheeecccccc Q lcl|Aclame:pro 239 SNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNSQAQIGKSGWT 318 (514) Q Consensus 239 ~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~~l~~~a~v~~~~~~ 318 (514) ++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||++|+..+++++.+++ T Consensus 243 s~~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii~~i~~sa~~~~~g~t 322 (521) T protein:vir:10 243 TDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQVGKSGMT 322 (521) T ss_pred ccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHhhhhhheeeeeeeeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999998 Q ss_pred cccCC-cceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhhccccccchhccccC Q lcl|Aclame:pro 319 QGAGA-AGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVGPAAQGMQD 397 (514) Q Consensus 319 ~~v~~-~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~~g~~~~~~~~~~~~ 397 (514) ..++. +|+|||+++.|+.++||++||+|+|++|||+|||+|+|+|+||+|||||||||||++|+|+|.++++++++... T Consensus 323 ~~~~~~~G~~d~~~~~d~~~~~~~~e~~k~L~~~i~~~an~i~~~T~r~~~n~~i~S~~Va~~L~~~~~~~~~~~~~~~~ 402 (521) T protein:vir:10 323 LTPGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTGISYAAQGLAT 402 (521) T ss_pred eccCccccceecccccccccchHHHHHHHHHHHHHHHHHHHHHHhcccccceEEEEchHHHHHHhhcccccccccccccc Confidence 87744 89999999999999999999999999999999999999999999999999999999999999999999998665 Q ss_pred ccccccccCceEEEEecCceEEEecCCCccceEEEEEecCCCcccceeeccccccccccccCCccccceeeeeeeeeeee Q lcl|Aclame:pro 398 GSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGFKGSTEMDAGVFYSPYVPLTPLRGSDSKNFQPVIGFKTRYGVQV 477 (514) Q Consensus 398 ~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~kG~~~~~~~~fy~PYv~~~~~~~~dp~s~qp~~~~~tRY~l~~ 477 (514) + .++|+++++|||+|+|||+||||||+++|||+|||||++++|+||||||||||+|++++||+||||+|||||||||++ T Consensus 403 g-~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~~ 481 (521) T protein:vir:10 403 G-FNTDTTKSVFAGVLGGKYRVYIDQYAKQDYFTVGYKGPNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI 481 (521) T ss_pred c-ccccCCCceEEEEecCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCccccceeeeeeeeceee Confidence 5 678999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCccccccC-cceeecCcchh--hhccccceeeeeeeecC Q lcl|Aclame:pro 478 NPFADPTAS-ATKVGNGAPVA--ASMGKNAYFRRVFVKGL 514 (514) Q Consensus 478 nPf~~~~~~-~~~i~~~~~~~--~~~~~~~~~r~~~V~~~ 514 (514) |||++..++ ..+++++++|+ +..++|.|||||+|||| T Consensus 482 NP~~~~~~~~~~~~i~~~~~~~~a~~~~~sy~r~v~v~~l 521 (521) T protein:vir:10 482 NPFAESAAQAPASRIQSGMPSILNSLGKNAYFRRVYVKGI 521 (521) T ss_pred cCcccccCCccceeecccchhhhccccccceeeeeeecCC Confidence 999998765 47788888887 46689999999999999 No 12 >protein:vir:7214 Length: 521 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049787;genbank:gi:9632597;genbank:GeneID:1258751 Probab=100.00 E-value=2.1e-236 Score=1312.70 Aligned_cols=509 Identities=61% Similarity=0.970 Sum_probs=458.1 Q ss_pred CcchhhhhhhhccccccccccccchhhhhhhhhhhhHHHHHHhcccccchhhhhhhcccccccccccccccccccccccc Q lcl|Aclame:pro 1 MNLTEKWKDLLEAEGADMPEIATATKQKIMSKIFENQDRDINNDPMYRDPQLVEAFNAGLNEAVVNGDHGYDPANIAQGV 80 (514) Q Consensus 1 ~~l~~kw~p~l~~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~g~~~~~~~~st 80 (514) .+|+|||+||||||| +|+|++. ||+|+|+|||||||+++|+|+|++++++++|+++|+|++++++||++++||+||+ T Consensus 7 ~~l~~kw~p~l~~~~--~~~i~~~-~~~~~a~~~enq~~~~~~~~~~~~~~~~~~~~~~l~e~~~~~~~~~~~~~iaes~ 83 (521) T protein:vir:72 7 AELLNKWKPLLEGEG--LPEIANS-KQAIIAKIFENQEKDFQTAPEYKDEKIAQAFGSFLTEAEIGGDHGYNATNIAAGQ 83 (521) T ss_pred HHHHHhhhhhhccCC--CCccccc-hhhhhhhhhhhhhhhhhhcccccchHHHHHHhhhhhhhcccCccccCcccccccc Confidence 679999999999997 8999987 9999999999999999999999999999999999999999999999999999999 Q ss_pred ccccccccceeeehhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCc--cccccccccCCCCccCcccccccc Q lcl|Aclame:pro 81 TTGAVTNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLT--GAEAFHPTRQADASFSGQAAASTI 158 (514) Q Consensus 81 ~tg~v~~~~P~l~~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~t--g~EA~~~~nEadt~fSG~~~~~~~ 158 (514) +|++|+++||+||+|||||+|||||+||||||||||||||||||||||++++++ ++||||+++++|+.|||+++.... T Consensus 84 ~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~q~~~~~g~ea~~~e~~~da~fSG~~~~~~~ 163 (521) T protein:vir:72 84 TSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQPMNSPTGQVFALRAVYGKDPVAAGAKEAFHPMYGPDAMFSGQGAAKKF 163 (521) T ss_pred cccccccCCchhhhHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCcccccccchhcccccccccccccccc Confidence 999999999999999999999999999999999999999999999999998754 789999999999999999877655 Q ss_pred ccccccccccccccccccccccccccccccccccccccccccCccccccccccccccccccccccccchhhhccccCCCC Q lcl|Aclame:pro 159 ADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAELQENFNGS 238 (514) Q Consensus 159 ~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~GmtTa~aEal~~~ggs 238 (514) ......+....++...+..... +..+.............+....++......++.+..|+++.||+|+.+|+++.+|++ T Consensus 164 ~~~~~~~~~a~Gd~~~~~~~~~-gt~~~~~~~~~~~~~g~t~~~~t~~~v~~~~~a~~~y~~g~gm~Ta~aEal~~~g~s 242 (521) T protein:vir:72 164 PALAASTQTTVGDIYTHFFQET-GTVYLQASVQVTIDAGATDAAKLDAEIKKQMEAGALVEIAEGMATSIAELQEGFNGS 242 (521) T ss_pred cccccccccccccccccccccc-cccccccccccccCCCCCCccccccccccccccCceeeeecccchhhhhhhcccCCc Confidence 4444444444444333322221 122222222222222233333444556778888899999999999999999999999 Q ss_pred CCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhheeecccccc Q lcl|Aclame:pro 239 SNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNSQAQIGKSGWT 318 (514) Q Consensus 239 ~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~~l~~~a~v~~~~~~ 318 (514) ++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||++|+..+++++.+++ T Consensus 243 s~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii~~i~~sa~~g~~g~t 322 (521) T protein:vir:72 243 TDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQVGKSGMT 322 (521) T ss_pred ccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHhhhhhheeeeeeeeee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999998 Q ss_pred cccCC-cceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhhccccccchhccccC Q lcl|Aclame:pro 319 QGAGA-AGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVGPAAQGMQD 397 (514) Q Consensus 319 ~~v~~-~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~~g~~~~~~~~~~~~ 397 (514) ..++. +|+|||+++.|+.++||++||+|+|++|||+|||+|+|+|+||+|||||||||||++|+|+|.++++++++... T Consensus 323 ~~~~~~~G~~d~~~~~d~~~~~~~~e~~k~L~~~i~~~an~i~~~T~r~~~n~~i~S~~Va~~L~~~~~~~~~~~~~~~~ 402 (521) T protein:vir:72 323 LTPGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTGISYAAQGLAT 402 (521) T ss_pred eccCccccceecccccccccchHHHHHHHHHHHHHHHHHHHHHHhcccccceEEEEchHHHHHHhhcccccccccccccc Confidence 87744 89999999999999999999999999999999999999999999999999999999999999999999998655 Q ss_pred ccccccccCceEEEEecCceEEEecCCCccceEEEEEecCCCcccceeeccccccccccccCCccccceeeeeeeeeeee Q lcl|Aclame:pro 398 GSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGFKGSTEMDAGVFYSPYVPLTPLRGSDSKNFQPVIGFKTRYGVQV 477 (514) Q Consensus 398 ~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~kG~~~~~~~~fy~PYv~~~~~~~~dp~s~qp~~~~~tRY~l~~ 477 (514) + .++|+++++|+|+|+|||+||||||+++|||+|||||++++|+||||||||||+|++++||+||||+|||||||||++ T Consensus 403 g-~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~~ 481 (521) T protein:vir:72 403 G-FSTDTTKSVFAGVLGGKYRVYIDQYAKQDYFTVGYKGPNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKTRYGIGI 481 (521) T ss_pred c-ccccCCCceEEEEccCceEEEecCCCCcceEEEEEeCCcccccceeeccccccccccccCCccccceeeeeeeeceee Confidence 5 678999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred cCccccccC-cceeecCcchh--hhccccceeeeeeeecC Q lcl|Aclame:pro 478 NPFADPTAS-ATKVGNGAPVA--ASMGKNAYFRRVFVKGL 514 (514) Q Consensus 478 nPf~~~~~~-~~~i~~~~~~~--~~~~~~~~~r~~~V~~~ 514 (514) |||++..++ +.+++++++|+ +..++|.|||||+|||| T Consensus 482 NP~~~~~~~~~a~~i~~~~~~~~a~~~~~sy~r~v~v~~l 521 (521) T protein:vir:72 482 NPFAESAAQAPASRIQSGMPSILNSLGKNAYFRRVYVKGI 521 (521) T ss_pred cCcccccCcccceeecCcChhhhcCccccceeeeeeecCC Confidence 999998664 58888888887 55689999999999999 No 13 >protein:vir:104915 Length: 470 # NCBI annotation: T4-like major capsid protein # Family: family:all:364 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214367;genbank:gi:61806007;genbank:GeneID:3294435 Probab=100.00 E-value=9.3e-221 Score=1226.88 Aligned_cols=454 Identities=39% Similarity=0.674 Sum_probs=401.1 Q ss_pred CcchhhhhhhhccccccccccccchhhhhhhhhhhhHHHHHHhcccccchhhhhhhccccccc-cccccccccccccccc Q lcl|Aclame:pro 1 MNLTEKWKDLLEAEGADMPEIATATKQKIMSKIFENQDRDINNDPMYRDPQLVEAFNAGLNEA-VVNGDHGYDPANIAQG 79 (514) Q Consensus 1 ~~l~~kw~p~l~~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~~~~a-~~~~~~g~~~~~~~~s 79 (514) .+|+|||+||||||| +|||++.|||+|+++|||||||+|+|++. +|.|+ +++++||+++.+|+|| T Consensus 7 e~l~~kw~p~l~~~~--~~~i~~~~~~~v~a~l~enq~~~~~~~~~------------~l~e~~~~~~~~~~~~~~i~~s 72 (470) T protein:vir:10 7 EYLQEKWAPILDYDG--LDPIKDSHRRSVTAVLLENQEKELREERN------------FLSEAPNVNTNSGATAGFSADA 72 (470) T ss_pred HHHHHhhhhhhcCCc--cchhcchhhhhhhhhhhhhhHHHHhhccc------------hhhhhhhccccccccccccccc Confidence 689999999999997 89999999999999999999999999996 46666 7999999999999999 Q ss_pred cccccccccceeeehhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCccccccccccCCCCccCccccccccc Q lcl|Aclame:pro 80 VTTGAVTNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAEAFHPTRQADASFSGQAAASTIA 159 (514) Q Consensus 80 t~tg~v~~~~P~l~~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~tg~EA~~~~nEadt~fSG~~~~~~~~ 159 (514) |+|++|++|||+||+|||||+|||||+|||||||||||||||||||+||.++ +|+|+|| +|+|++|||..++.... T Consensus 73 t~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~n~--sG~Eaff--nEA~T~fSG~~~~~~~~ 148 (470) T protein:vir:10 73 TAAGPVAGFDPVLISLIRRSMPNLVAYDLAGVQPMNGPTGLIFAMRSRYKTQ--SGTEALF--NEADTAFSGQPDGLDDT 148 (470) T ss_pred cccccccccCchhhhhHHHHHhhhhhhhhheeecCCccceeeeEEEEEecCC--Cccceee--ecCCcccCccccccccc Confidence 9999999999999999999999999999999999999999999999999998 5889999 99999999987776543 Q ss_pred ccccccccc-ccccccccccccccccccccccccccccccccCccccccccccccccccccccccccchhhhccccCCCC Q lcl|Aclame:pro 160 DFPTTGAAT-DGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAELQENFNGS 238 (514) Q Consensus 160 ~~~~~~~~t-~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~GmtTa~aEal~~~ggs 238 (514) .....+... .+.... .. .. +... ........+....|+++.||+|+.+|.+ |++ T Consensus 149 ~~~~~~~a~~~g~~~~---~~----------~g-------t~~~--~~~~~~~~a~~~~y~~~~GMsTa~aE~l---g~s 203 (470) T protein:vir:10 149 SGFTATGANNVGLGTT---AQ----------QG-------SNPG--LLNSTAAQTNATDYNVGQGMRTDSAEDL---GDG 203 (470) T ss_pred cccccccccccccccc---cc----------cc-------cccc--ccccccccccccccccccccchHHhhhc---CCC Confidence 322211110 000000 00 00 0000 0011122344567889999999999977 678 Q ss_pred CCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhheeecccccc Q lcl|Aclame:pro 239 SNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNSQAQIGKSGWT 318 (514) Q Consensus 239 ~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~~l~~~a~v~~~~~~ 318 (514) ++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||+|||+|||||||++|+++|+ +||+ T Consensus 204 ~~~~f~EMaFsIeK~tVtAKSRaLKAeYTiELAQDLKAiHGLDAEtELaNILStEImlEINReii~~l~~~a~---~~k~ 280 (470) T protein:vir:10 204 TGDQFNQMAFSIEKVTVTAKSRALKAEYSLELAQDLKAIHGLNAEAELANILSTEILAEINREVIRTIYNVAE---PGAQ 280 (470) T ss_pred CCcccceeeeEEEEEEEEeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhhh---hcee Confidence 8899999999999999999999999999999999999999999999999999999999999999999988876 6778 Q ss_pred cccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhhccccccchhccccCc Q lcl|Aclame:pro 319 QGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVGPAAQGMQDG 398 (514) Q Consensus 319 ~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~~g~~~~~~~~~~~~~ 398 (514) .+++++|+|||+++.| +||++|++|+|++||++++|+|+|+|+||+||||||||+||++|+++|||++.|... + T Consensus 281 ~~~~~~Gv~Dl~~~~~---gr~~~e~~~~l~~~i~~ean~i~~~t~r~~~n~~i~S~~Va~~La~sG~l~~~~~~~---~ 354 (470) T protein:vir:10 281 ANVAAAGTFDLDTDSN---GRWSVEKFKGLIFQIERDANAIAQRTRRGKGNMILCSADVASALTMAGVLDYTPALN---A 354 (470) T ss_pred ccccccceEEeecccc---hhHHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchhHHhHhhhccccccccccc---c Confidence 8889999999997776 799999999999999999999999999999999999999999999999999877754 4 Q ss_pred cccccccCceEEEEecCceEEEecCC------CccceEEEEEecCCCcccceeeccccccccccccCCccccceeeeeee Q lcl|Aclame:pro 399 SMNTDTNQTVFAGVLGGRFKVYIDQY------AVNDYFTVGFKGSTEMDAGVFYSPYVPLTPLRGSDSKNFQPVIGFKTR 472 (514) Q Consensus 399 ~~~~d~~~~~~~G~l~~~~~vy~D~y------~~~dy~~vG~kG~~~~~~~~fy~PYv~~~~~~~~dp~s~qp~~~~~tR 472 (514) .+++|+++++|+|+|+|||+|||||| +++|||+|||||++++|+||||||||||++++++||+||||+|||||| T Consensus 355 ~~~~D~t~~~~~G~l~~~~~vy~d~y~~~~~~a~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g~~tR 434 (470) T protein:vir:10 355 NLNVDDTGNTFAGILQGKYRVYIDPFSASGGAAATQYYVVGYKGSSPYDAGLFYCPYVPLQMVRAVGQDTFQPKIGFKTR 434 (470) T ss_pred ccccCCCCceEEEEecCceEEEeeccccccCcccccEEEEEEecCcceecceeeccccccccCCCCCCccccceeeeeee Confidence 58999999999999999999999997 778999999999999999999999999999999999999999999999 Q ss_pred eeeeecCccccccCc-ceeecCcchhhhccccceeeeeeeecC Q lcl|Aclame:pro 473 YGVQVNPFADPTASA-TKVGNGAPVAASMGKNAYFRRVFVKGL 514 (514) Q Consensus 473 Y~l~~nPf~~~~~~~-~~i~~~~~~~~~~~~~~~~r~~~V~~~ 514 (514) |||++|||++..+++ .+|+ .|+|.|||||+|||| T Consensus 435 Y~l~~NP~~~~~~~~~~~i~--------~~~n~y~r~~~v~~l 469 (470) T protein:vir:10 435 YGLVENPFSQGTTQGLGTLT--------RNSNRYYRRVKVANL 469 (470) T ss_pred eceeecCcccCCCccccccc--------CCCCceeeEEEeecc Confidence 999999999987664 5555 489999999999999 No 14 >protein:vir:106998 Length: 468 # NCBI annotation: major capsid protein gp23 # Family: family:all:364 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195142;genbank:gi:58532919;uniprot:Q5GQN0;genbank:GeneID:3260495 Probab=100.00 E-value=1.7e-220 Score=1225.42 Aligned_cols=455 Identities=38% Similarity=0.633 Sum_probs=397.5 Q ss_pred CcchhhhhhhhccccccccccccchhhhhhhhhhhhHHHHHHhcccccchhhhhhhcccccccccccccccccccccccc Q lcl|Aclame:pro 1 MNLTEKWKDLLEAEGADMPEIATATKQKIMSKIFENQDRDINNDPMYRDPQLVEAFNAGLNEAVVNGDHGYDPANIAQGV 80 (514) Q Consensus 1 ~~l~~kw~p~l~~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~g~~~~~~~~st 80 (514) .+|+|||+||||||| +|||++.|||+|+|+||||||||++|++.|+++.++++|++ .+......+++++ T Consensus 5 e~l~~kW~plLe~~~--~~~i~~~~k~~i~a~llENQe~~~~~~~~~~~~~~~~~~~~---------~~~~~~n~~~~~~ 73 (468) T protein:vir:10 5 EHLQEKWSPVLNHGE--APAIGDRYKRAVTSVLLENQERFLREERGMLNEVAVNSLGA---------GTIAPAGSALGSA 73 (468) T ss_pred HHHHHhhhHhhcCCc--cchhccchhhhhhhhhhhhHHHHHhccccccchhhHhhcCC---------cccchhhhhhhhc Confidence 899999999999997 89999999999999999999999999999999999999864 2244555678899 Q ss_pred ccccccccceeeehhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCccccccccccCCCCccCcccccccccc Q lcl|Aclame:pro 81 TTGAVTNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAEAFHPTRQADASFSGQAAASTIAD 160 (514) Q Consensus 81 ~tg~v~~~~P~l~~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~tg~EA~~~~nEadt~fSG~~~~~~~~~ 160 (514) +|++|+++||+||+|||||+|||||+|||||||||||||||||||+||.++ +|+|||| ||||++|||......... T Consensus 74 ~t~~v~~~~P~Li~l~RRa~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~n~--~g~EAf~--nEadt~fSg~~~~~~~~~ 149 (468) T protein:vir:10 74 NTGGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQ--AGEEALF--NEPDTGFTGGYDASQGDY 149 (468) T ss_pred ccccccccCchhhhhHHHHHhhhhhhhceeeecCCccceeeeEEEEEecCC--CCcccee--cccccccccccccccccc Confidence 999999999999999999999999999999999999999999999999998 5889999 999999999754432211 Q ss_pred ccccccccccccccccccccccccccccccccccccccccCccccccccccccccccccccccccchhhhccccCCCCCC Q lcl|Aclame:pro 161 FPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAELQENFNGSSN 240 (514) Q Consensus 161 ~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~GmtTa~aEal~~~ggs~~ 240 (514) ....+.... ... ..........+...+|+++.||+|+.+|.++ +++ T Consensus 150 ~~~~~~~~~---------------------------~~~---~g~~~~~~~~a~~~~~~~g~gMsTa~aE~lG----~~~ 195 (468) T protein:vir:10 150 AVRTGAGVG---------------------------GDS---EGNNPALLNDAAPGTYEVGSKMPREDLERMG----EAN 195 (468) T ss_pred ccccccccc---------------------------cCC---CCCcccccccccccccccccccchHHHhhcC----CCC Confidence 100000000 000 0001111223445678899999999999883 456 Q ss_pred cccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhheeecccccccc Q lcl|Aclame:pro 241 NEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNSQAQIGKSGWTQG 320 (514) Q Consensus 241 ~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~~l~~~a~v~~~~~~~~ 320 (514) ++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||++|+++|+ +|++++ T Consensus 196 ~~f~EMaFsIeK~tVtAKSRaLKAeYTiELAQDLKAiHGLDAEtELaNILStEImlEINReii~~l~~va~---~~k~~g 272 (468) T protein:vir:10 196 RLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAK---KGAQNN 272 (468) T ss_pred cccceeeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhcHHHHHhHhhhhh---heeccc Confidence 78999999999999999999999999999999999999999999999999999999999999999987776 567889 Q ss_pred cCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhhccccccchhccccCcc- Q lcl|Aclame:pro 321 AGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVGPAAQGMQDGS- 399 (514) Q Consensus 321 v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~~g~~~~~~~~~~~~~~- 399 (514) ++++|+|||+++.| +||++|++|+|++|||+|+|+|+|+|+||+||||||||+||++|+++|||++.|.+....+. T Consensus 273 ~~~~Gv~d~~~~~~---~rw~~e~~k~L~~~i~~ean~i~~~T~rg~gn~ii~S~~Va~~L~~sG~l~~~~~~~~~~~~~ 349 (468) T protein:vir:10 273 VANAGIFDLDVDSN---GRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPS 349 (468) T ss_pred cccccccccccccc---chhHHHHHHHHHHHHHHHHHHHHHhhccccccEEEechhHHHHHhhcCcceeccccccccccc Confidence 99999999997766 79999999999999999999999999999999999999999999999999998776655442 Q ss_pred -ccccccCceEEEEecCceEEEecCCCc----cceEEEEEecCCCcccceeeccccccccccccCCccccceeeeeeeee Q lcl|Aclame:pro 400 -MNTDTNQTVFAGVLGGRFKVYIDQYAV----NDYFTVGFKGSTEMDAGVFYSPYVPLTPLRGSDSKNFQPVIGFKTRYG 474 (514) Q Consensus 400 -~~~d~~~~~~~G~l~~~~~vy~D~y~~----~dy~~vG~kG~~~~~~~~fy~PYv~~~~~~~~dp~s~qp~~~~~tRY~ 474 (514) .++|+++++|+|+|+|||+||||||+. +|||+|||||++++|+||||||||||+|++++||+||||+|||||||| T Consensus 350 ~~~~D~tg~~~~G~l~~r~~vy~D~Ya~~~s~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~ 429 (468) T protein:vir:10 350 IGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNTFQPKIGFKTRYG 429 (468) T ss_pred ccccccCcceEEEEecCceEEEEccccccCCccceEEEEEecCcceeceeeeccccccccccccCCCcccceeeeeeeec Confidence 589999999999999999999999975 799999999999999999999999999999999999999999999999 Q ss_pred eeecCccccccCcceeecCcch--hhhccccceeeeeeeecC Q lcl|Aclame:pro 475 VQVNPFADPTASATKVGNGAPV--AASMGKNAYFRRVFVKGL 514 (514) Q Consensus 475 l~~nPf~~~~~~~~~i~~~~~~--~~~~~~~~~~r~~~V~~~ 514 (514) |++|||+...+ +.|+++. +..+|+|.|||||+|||| T Consensus 430 l~~NP~~~~~~----~~~g~~~~~~~~~~~N~y~r~~~v~~l 467 (468) T protein:vir:10 430 MVSNPFVTTNG----LYNGTPDGEALTPNANMYYRRVQVTNL 467 (468) T ss_pred eeecccceecc----ccCCCcccccccccccceeeeEEEecc Confidence 99999997543 4444433 245699999999999999 No 15 >protein:vir:104549 Length: 462 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214669;genbank:gi:61806310;genbank:GeneID:3294604 Probab=100.00 E-value=3.9e-217 Score=1207.01 Aligned_cols=450 Identities=39% Similarity=0.681 Sum_probs=390.7 Q ss_pred CcchhhhhhhhccccccccccccchhhhhhhhhhhhHHHHHHhcccccchhhhhhhcccccccccccccccccccccccc Q lcl|Aclame:pro 1 MNLTEKWKDLLEAEGADMPEIATATKQKIMSKIFENQDRDINNDPMYRDPQLVEAFNAGLNEAVVNGDHGYDPANIAQGV 80 (514) Q Consensus 1 ~~l~~kw~p~l~~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~g~~~~~~~~st 80 (514) .+|+|||+||||||| +|+|++.|||+|+++|||||||||+|++. +|+|+. ++||++ .+++ T Consensus 4 ~~l~~~w~~~l~~~~--~~~i~~~~~~~~~~~~~enq~~~~~~~~~------------~l~ea~--~~~g~~----~~~~ 63 (462) T protein:vir:10 4 QQLQEKWAPVLNHES--VPEIKDSYKKGVVAQLLENQENAIREEGQ------------VLNETL--QTTGYT----TGDT 63 (462) T ss_pred HHHHHHhhhhhcccc--cchhhhhhHHHHHHHHhhhHHHHHHhccc------------chhccc--cccCCC----cCcc Confidence 689999999999997 89999999999999999999999999875 677875 889987 5678 Q ss_pred ccccccccceeeehhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCC----ccccccccccCCCCccCcccccc Q lcl|Aclame:pro 81 TTGAVTNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPL----TGAEAFHPTRQADASFSGQAAAS 156 (514) Q Consensus 81 ~tg~v~~~~P~l~~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~----tg~EA~~~~nEadt~fSG~~~~~ 156 (514) +|+++++|||+||+|||||+|||||+||||||||||||||||||||||++++. +|+|||| ||+|+.|||..+.. T Consensus 64 ~t~~~~~~~P~Li~l~Rra~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~~~~~~~nq~gtEAlf--nEadt~fSg~~~~~ 141 (462) T protein:vir:10 64 ATGPVAGFDPVLISLIRRSMPQLIAYDVAGVQPMTGPTGLIFAMRSFYGSERRPANSDFREALF--NEPNAGFSGGAGTG 141 (462) T ss_pred cccccccccchhhhHHHHHHhhhhhhcceeeecCCcchhhhheeeeeccCCccccccccchhhh--ccCCcCcccccccc Confidence 89999999999999999999999999999999999999999999999998754 4789999 99999999987664 Q ss_pred ccccccccccccccccccccccccccccccccccccccccccccCccccccccccccccccccccccccchhhhccccCC Q lcl|Aclame:pro 157 TIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAELQENFN 236 (514) Q Consensus 157 ~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~GmtTa~aEal~~~g 236 (514) ............... ..+. .....+ .........++.+.||+|+.+|+|+. T Consensus 142 ~~~~~~~~~~~~~~~-------~~g~-------------~~~~~~-------~~~~g~~~~~~~~~GM~Ta~aE~lg~-- 192 (462) T protein:vir:10 142 LSNYDPTASSSAVND-------AEGA-------------NPGLLN-------DSPAGTYEVTGDATGMATATAEALDD-- 192 (462) T ss_pred ccccccccccccccc-------cccc-------------cceeec-------CCCccceecccccccccchhccccCC-- Confidence 432221111000000 0000 000000 00111122455678999999999963 Q ss_pred CCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhheeecccc Q lcl|Aclame:pro 237 GSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNSQAQIGKSG 316 (514) Q Consensus 237 gs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~~l~~~a~v~~~~ 316 (514) ++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||++|+++|+ +| T Consensus 193 ~s~n~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILSTEImlEINReii~~l~~~a~---~~ 269 (462) T protein:vir:10 193 SSASTAFREMGFSIEKVTVTAKSRALKAEYSIEMAQDLKAIHGLDAESELANILSTEILAEINREVVRTIYVNAV---KG 269 (462) T ss_pred ccCCcchhhceeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHhhhhhhhe---ee Confidence 466789999999999999999999999999999999999999999999999999999999999999999988875 67 Q ss_pred cccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhhccccccchhcccc Q lcl|Aclame:pro 317 WTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVGPAAQGMQ 396 (514) Q Consensus 317 ~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~~g~~~~~~~~~~~ 396 (514) |+.+++++|+|||+++.+ +||++|++|+|++||+++||+|+|+||||+|||||||||||++|+|+|||++.|+.... T Consensus 270 k~~~~~~~Gv~dl~~~~~---gr~~~e~~k~l~~qi~~ean~i~~~t~r~~~n~~i~S~~Va~~La~sG~l~~~p~~~~~ 346 (462) T protein:vir:10 270 AIANTATDGIFDLDVDSN---GRWSVEKFKGLLFQIERDSNAIGQETRRGKGNILICSADVASALGMAGVLDYAPGLQGN 346 (462) T ss_pred ecccccccceeeeccccc---hHHHHHHHHHHHHHHHHHHHHHHHHhccccceEEEEchhHHHHhhhccchhcccccccc Confidence 788889999999987765 79999999999999999999999999999999999999999999999999998876666 Q ss_pred CccccccccCceEEEEecCceEEEecCC----CccceEEEEEecCCCcccceeeccccccccccccCCccccceeeeeee Q lcl|Aclame:pro 397 DGSMNTDTNQTVFAGVLGGRFKVYIDQY----AVNDYFTVGFKGSTEMDAGVFYSPYVPLTPLRGSDSKNFQPVIGFKTR 472 (514) Q Consensus 397 ~~~~~~d~~~~~~~G~l~~~~~vy~D~y----~~~dy~~vG~kG~~~~~~~~fy~PYv~~~~~~~~dp~s~qp~~~~~tR 472 (514) .+..++|+++.+|+|+|+|||+|||||| +++|||+|||||++++|+||||||||||++++++||+||||+|||||| T Consensus 347 ~~~~~~d~~~~~~~G~l~~r~~vy~D~Y~~~ns~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g~~tR 426 (462) T protein:vir:10 347 SALTGVDDTSSTLVGTLNGRIKVYVDPYSSNVADKHFYVAGYKGTSPYDAGLFYCPYVPLQQVRAINPNTFQPKIGFKTR 426 (462) T ss_pred ccccccccccceeEEEecCceEEEEecccCCCcccceEEEEEeCCcccccceeeccccccccccccCCccccceeeeeee Confidence 6666899999999999999999999998 668999999999999999999999999999999999999999999999 Q ss_pred eeeeecCccccccCcceeecCcchhhhccccceeeeeeeecC Q lcl|Aclame:pro 473 YGVQVNPFADPTASATKVGNGAPVAASMGKNAYFRRVFVKGL 514 (514) Q Consensus 473 Y~l~~nPf~~~~~~~~~i~~~~~~~~~~~~~~~~r~~~V~~~ 514 (514) |||++|||++..++. ...+..|+|.|||||+|||| T Consensus 427 Y~l~~NP~t~~~~~~-------~~~~~~~~n~y~r~~~v~~l 461 (462) T protein:vir:10 427 YGMVSNPFSGGLTQG-------SGALTANANKYYRRVQVANL 461 (462) T ss_pred eeeeecCCCCCcCCc-------cccccccCcceeeeEEeecc Confidence 999999999987764 12345799999999999999 No 16 >protein:vir:103181 Length: 457 # NCBI annotation: gp135 # Family: family:all:364 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717802;genbank:gi:113200639;genbank:GeneID:4239190 Probab=100.00 E-value=3.5e-213 Score=1185.34 Aligned_cols=444 Identities=41% Similarity=0.698 Sum_probs=390.7 Q ss_pred CcchhhhhhhhccccccccccccchhhhhhhhhhhhHHHHHHhcccccchhhhhhhcccccccccccccccccccccccc Q lcl|Aclame:pro 1 MNLTEKWKDLLEAEGADMPEIATATKQKIMSKIFENQDRDINNDPMYRDPQLVEAFNAGLNEAVVNGDHGYDPANIAQGV 80 (514) Q Consensus 1 ~~l~~kw~p~l~~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~g~~~~~~~~st 80 (514) .+|+|||+||||||| +|||++.|||+|+++|||||||||.|++. +|+||. ++||+++. |+ T Consensus 4 ~~l~~~w~~~l~~~~--~~~i~~~~~~~~~~~~lenq~~~~~~~~~------------~l~ea~--~~~g~~~~----s~ 63 (457) T protein:vir:10 4 QNLQEKWAPVLEHDS--LPEIGDSYKKGVVAQLLENQEKAIAEEGK------------ILTETL--QTTGYTGG----DT 63 (457) T ss_pred HHHHHHhhHhhccCc--cchhhhhHHHHHHHHHhhhHHHHHHhccc------------cccccc--cccCCCcc----cc Confidence 689999999999997 89999999999999999999999998875 677775 88998754 67 Q ss_pred ccccccccceeeehhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCC----ccccccccccCCCCccCcccccc Q lcl|Aclame:pro 81 TTGAVTNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPL----TGAEAFHPTRQADASFSGQAAAS 156 (514) Q Consensus 81 ~tg~v~~~~P~l~~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~----tg~EA~~~~nEadt~fSG~~~~~ 156 (514) +|++|+++||+||+|||||+|||||+|||||||||||||||||||+||.++.. +.+|||| ||||+.|||..+.. T Consensus 64 ~t~~v~~~~P~Li~l~Rra~p~LIa~DIwGVQPmTgPTGLIFAmRsrY~~q~~~~~a~~~EAl~--nEadt~fSg~~~~~ 141 (457) T protein:vir:10 64 VTGPVAGFDPVLISLIRRSMPQLIAYDIAGVQPMTGPTGLIFAMRTNYGAERNPAAAGYDEAFF--NEPNAGFSGGPGAY 141 (457) T ss_pred cccccccccchhhhhhHHHHhhhhhhhcceeecCCCcceeeeeeeeeecCccccccccccceee--eccCcccCcccccc Confidence 89999999999999999999999999999999999999999999999998753 2379998 99999999976654 Q ss_pred ccccccccccccccccccccccccccccccccccccccccccccCccccccccccccccccccccccccchhhhccccCC Q lcl|Aclame:pro 157 TIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAELQENFN 236 (514) Q Consensus 157 ~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~GmtTa~aEal~~~g 236 (514) ........+...+. ..........+....++++.||+|+.+|.|++ T Consensus 142 ~~~~~~~~~~~~gt--------------------------------~~~~~~~~~~~~~~~~~~~~gmsTA~aE~lgd-- 187 (457) T protein:vir:10 142 DPGATGVTNDAEGT--------------------------------NPALLNDSPAGTYEQADDATGMSTATVEALDD-- 187 (457) T ss_pred cccccccccccccc--------------------------------cccccCccccccccccccccchhhhhhhccCC-- Confidence 33211110000000 00001111122334678899999999999963 Q ss_pred CCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhheeecccc Q lcl|Aclame:pro 237 GSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNSQAQIGKSG 316 (514) Q Consensus 237 gs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~~l~~~a~v~~~~ 316 (514) ++++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||++|+++|+ +| T Consensus 188 ~~~n~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAiHGLDAEtELaNILStEImlEINReii~~l~~~a~---~~ 264 (457) T protein:vir:10 188 STANTAFREMGFSIEKVTVTARARALKAEYSIEMAQDLKAIHGLDAEQELANILSTEILAEINREVVRTIYTNAV---AG 264 (457) T ss_pred CCCccchhhheeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHhHhhhhe---ee Confidence 566789999999999999999999999999999999999999999999999999999999999999999987775 77 Q ss_pred cccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhhccccccchhcccc Q lcl|Aclame:pro 317 WTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVGPAAQGMQ 396 (514) Q Consensus 317 ~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~~g~~~~~~~~~~~ 396 (514) ++++++++|+|||+++.| +||++|++|+|++||++++|+|+|+|+||+||||||||+||++|+++|||++.|+.... T Consensus 265 ~~~~~~~~gv~dl~~~~~---g~~~~e~~k~L~~~i~~ean~i~~~T~rg~gn~~i~S~~Va~~L~~sg~l~~~p~~~~~ 341 (457) T protein:vir:10 265 AQNNTATAGVFDLDVDSN---GRWSVEKFKGLLFQIERDANAIGHQTRRGKGNILICSADVVSALGMAGVLDYTPALNGN 341 (457) T ss_pred eccccccceeeeeecccc---chhhHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchhHHHHHhhcccccccchhhcc Confidence 788889999999986665 79999999999999999999999999999999999999999999999999998888877 Q ss_pred CccccccccCceEEEEecCceEEEecCCCc----cceEEEEEecCCCcccceeeccccccccccccCCccccceeeeeee Q lcl|Aclame:pro 397 DGSMNTDTNQTVFAGVLGGRFKVYIDQYAV----NDYFTVGFKGSTEMDAGVFYSPYVPLTPLRGSDSKNFQPVIGFKTR 472 (514) Q Consensus 397 ~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~----~dy~~vG~kG~~~~~~~~fy~PYv~~~~~~~~dp~s~qp~~~~~tR 472 (514) .+..++|+++.+|+|+|+|||+||||||+. +|||+|||||++++|+||||||||||++++++||+||||+|||||| T Consensus 342 ~~~~~~d~~~~~~~G~l~~r~~vy~D~Ya~~ns~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g~~tR 421 (457) T protein:vir:10 342 NGLAGVDDTSSTLVGTLNGRIKVYVDPYSANVADKHFYVAGYKGTSPYDAGLFYCPYVPLQQVRAINPDTFQPKIGFKTR 421 (457) T ss_pred ccccccccccceeEEEecCCeEEEEecccccCCccceEEEEEeCCcceecceeecccccccccCccCCccccceeeeeee Confidence 777889999999999999999999998874 7999999999999999999999999999999999999999999999 Q ss_pred eeeeecCccccccCc-ceeecCcchhhhccccceeeeeeeecC Q lcl|Aclame:pro 473 YGVQVNPFADPTASA-TKVGNGAPVAASMGKNAYFRRVFVKGL 514 (514) Q Consensus 473 Y~l~~nPf~~~~~~~-~~i~~~~~~~~~~~~~~~~r~~~V~~~ 514 (514) |||++|||+...+++ .+++ .|+|.||||+.|+|| T Consensus 422 Y~l~~NP~~~~~~~~~~~~~--------~~~n~~~~rs~vs~l 456 (457) T protein:vir:10 422 YGMVSNPFAGGLTQGSGALT--------VNANKYYRRVQVANL 456 (457) T ss_pred eeeeeccccccccccccccc--------ccchhhcceeeeeec Confidence 999999999987664 5444 468899999999999 No 17 >protein:vir:5942 Length: 523 # NCBI annotation: similar to major head protein # Family: family:all:364 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835728;genbank:gi:30044131 Probab=100.00 E-value=3e-193 Score=1076.05 Aligned_cols=441 Identities=24% Similarity=0.335 Sum_probs=341.1 Q ss_pred CcchhhhhhhhccccccccccccchhhhhhhhhhhhHHHHHHhcccccchhhhhhhcccccccccccccccccccccccc Q lcl|Aclame:pro 1 MNLTEKWKDLLEAEGADMPEIATATKQKIMSKIFENQDRDINNDPMYRDPQLVEAFNAGLNEAVVNGDHGYDPANIAQGV 80 (514) Q Consensus 1 ~~l~~kw~p~l~~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~g~~~~~~~~st 80 (514) .+|+|||+||||+|| +.|||+|+|+|||||||+-++ +|.|++ T Consensus 8 e~l~~kw~p~l~~~~-------~~~~~~~~a~llenq~~~~~~-------------------------------~l~e~~ 49 (523) T protein:vir:59 8 EQLIEKWQPLLEGCR-------NDWERHTLATLLENQYREAKK-------------------------------HLMETT 49 (523) T ss_pred HHHHHhhhhhhcccC-------ChhHHHHHHHHhhhhhHHHHH-------------------------------hhhhhh Confidence 569999999999876 447999999999999997332 345667 Q ss_pred ccccccccceeeehhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCccccccc------------cccCCCCc Q lcl|Aclame:pro 81 TTGAVTNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAEAFH------------PTRQADAS 148 (514) Q Consensus 81 ~tg~v~~~~P~l~~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~tg~EA~~------------~~nEadt~ 148 (514) .+++|++|.| ||+|+||++|||||+||||||||||||||||||||||.++. |+||+| +++++++. T Consensus 50 ~~~~~~~~~~-~~~~v~r~~p~l~a~DIWGVQPMTGPTGLIFAMRSRY~~q~--gteA~yg~~~~~~~~a~~~~~ean~~ 126 (523) T protein:vir:59 50 QTTEVDGWNL-ALPIVRRVFANLRATDLVSVQPLSLPTGLVFYLDFKSPELP--GNGSVYGGTGLTTDTATGGLYDENAR 126 (523) T ss_pred hccccccccc-hhhhhhhHhhhhhhhhccccccCCCCcceeEEEEeeccCCC--CcccccCccccCcccccccccccccc Confidence 7999999996 99999999999999999999999999999999999999984 666665 23566677 Q ss_pred cCccccccccccccc-ccccccccc-ccccccccc------------------c------------------ccc----c Q lcl|Aclame:pro 149 FSGQAAASTIADFPT-TGAATDGTP-YKAEVTTSG------------------G------------------DVS----M 186 (514) Q Consensus 149 fSG~~~~~~~~~~~~-~~~~t~~~~-~~~~~~~~~------------------g------------------~~~----~ 186 (514) ||+..+......... ++....... ......... + ..+ . T Consensus 127 ~s~~~~~~~~~~d~~~sg~~~~~~~a~stg~A~a~~s~si~k~~vTa~s~agta~~~li~A~~~q~itg~tga~fa~s~~ 206 (523) T protein:vir:59 127 LSRREYETTITVDLATAQQATMRDVGFDTGIASLVSSGAVYYVDVPVASLPGVADVNTVRFWQYDDASGDPENTVAYPLP 206 (523) T ss_pred cccccccCccCCCcccccccccccccccccchhhccccceeeeeccccccccccccccccccccccccccccccccchhh Confidence 777655443321111 000000000 000000000 0 000 0 Q ss_pred ccccccccccccccCc-------c---ccccccccccccccccccccccchhhhccccCC--CCCCcccccceeEEEEEE Q lcl|Aclame:pro 187 RYFLALGAVTLAVAGQ-------M---TATEYTDGVAGGLLVEIDAGMATSQAELQENFN--GSSNNEWNEMSFRIDKQV 254 (514) Q Consensus 187 ~~~~~~~~~~~~~~~~-------~---~~~~~~~~~a~~~~y~~~~GmtTa~aEal~~~g--gs~~~~f~EMsFsIEK~t 254 (514) .+.........+..+. . ..............++.+.||+|+.+|.++.++ ++++++|+||+|+||||+ T Consensus 207 ~an~astAss~Al~gEA~t~~sTd~at~~~Gtt~t~~~~~lyt~~~g~~t~~~~~~~~~~~~~~~~~~~~eM~FsIeK~t 286 (523) T protein:vir:59 207 RYNRIVGAVGSALYARLFFVTGSDFATVAGGTPSTQDLDLVYYIDARNDFEDQSTDPDYPDPGFQSLDIPEINLELRSRP 286 (523) T ss_pred ccccccccccccccccccccccccccccCCCcccccccccccccccccchhhccccccccccccccccccceeeEEEeEE Confidence 0000000000000000 0 000001111223468889999999999998765 467899999999999999 Q ss_pred EEeecccccccccHHHHHHHHhhc-CCChhHHHHHHHHHHHHHHhhHHHHHHHhhheeecccccccccCCcceecccccc Q lcl|Aclame:pro 255 VEAKSRQLKAQYSIELAQDLRAVH-GLDADAELSGILANEVMVELNREIVNLVNSQAQIGKSGWTQGAGAAGVFDFSDAV 333 (514) Q Consensus 255 VtAKSRaLKAEYT~ELAQDLkAiH-GLDAEaELanILStEImlEINReii~~l~~~a~v~~~~~~~~v~~~g~~dl~~~~ 333 (514) |||||||||||||||||||||||| |||||+||+||||+|||||||||||++|+++|+ +|++.++.++|+|||.++. T Consensus 287 VtAkSRaLKAeYT~ELAQDLKAiH~GLDAE~ELanILStEImlEINR~ii~~~~~~a~---~~~~~~~~~~g~~~~~~~~ 363 (523) T protein:vir:59 287 VATKTRKLRAAWTPEAMQDLAAYHKGVDLENEIVTLMSQYIAREIDLEILSTIMAHAR---RTDNYGFWSEVVGEYYDET 363 (523) T ss_pred EeeecccccccccHHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHhHhhhhe---eeeeccccccceeeecccc Confidence 999999999999999999999999 999999999999999999999999999988876 6667788899999999877 Q ss_pred c---cccchhh--HHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhhccccccchhccccCccccccccCce Q lcl|Aclame:pro 334 D---VKGARWA--GEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVGPAAQGMQDGSMNTDTNQTV 408 (514) Q Consensus 334 d---~~~~rwa--~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~~d~~~~~ 408 (514) | +.|.+|. +||+|+|+++||+|+|+|+|+|+||+|||||||||||++|+++|||+. .....+|+++.+ T Consensus 364 ~~~~~~~~~~~~~~e~~~~l~~~~~~~~n~i~~~t~~~~~~~~~~s~~v~~~l~~~~~~~~-------~~~~~~~~~~~~ 436 (523) T protein:vir:59 364 SGNFVAGNFYGSKQEWLATLMIELNKVSNRIQQKTAVAGANFLVTSPQVAALLESMPGFTP-------GNDNRDGGTGIF 436 (523) T ss_pred cchhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEchhHHHHHHhcccccc-------CCccccccccce Confidence 6 3334443 899999999999999999999999999999999999999999999963 223467889999 Q ss_pred EEEEecCceEEEecCCCccceEEEEEec-CCCcccceeecccccccccccc-CCccccceeeeeeeeeeee-cCcccccc Q lcl|Aclame:pro 409 FAGVLGGRFKVYIDQYAVNDYFTVGFKG-STEMDAGVFYSPYVPLTPLRGS-DSKNFQPVIGFKTRYGVQV-NPFADPTA 485 (514) Q Consensus 409 ~~G~l~~~~~vy~D~y~~~dy~~vG~kG-~~~~~~~~fy~PYv~~~~~~~~-dp~s~qp~~~~~tRY~l~~-nPf~~~~~ 485 (514) |+|+|+|||+||||||+++|||+||||| .+++|+||||||||||.+++++ ||+||||+|||||||||++ |||+...- T Consensus 437 ~~g~l~~~~~vy~d~~~~~dy~~~g~k~~~~~~~~~~~y~Py~~l~~~~~~~dp~s~qp~~~~~tRY~l~v~nP~~~~~~ 516 (523) T protein:vir:59 437 YVGMVQGRYRLYKNIYQNQPVIIMGNQDLNTPWQTGAVYAPYVPLLFTPTIVDPVNFSYRRGLMTRYALEVVRPEFYGLL 516 (523) T ss_pred eEEEecCceEEEecCCCCcceEEEEecccCCcccccceecccchhhcccccccCCcccceeeeeeehhheecchhHhhhh Confidence 9999999999999999999999999999 5699999999999999999985 9999999999999999986 99988654 Q ss_pred CcceeecC Q lcl|Aclame:pro 486 SATKVGNG 493 (514) Q Consensus 486 ~~~~i~~~ 493 (514) +-+ ...+ T Consensus 517 ~~~-~~~~ 523 (523) T protein:vir:59 517 YVK-LLQP 523 (523) T ss_pred hhh-hcCC Confidence 421 1111 No 18 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=95.87 E-value=0.0013 Score=36.35 Aligned_cols=266 Identities=11% Similarity=0.024 Sum_probs=117.4 Q ss_pred ccccccccccccccccccccccccccccccccccccc---ccccccccccCccccccccccccccccccccccccchhhh Q lcl|Aclame:pro 154 AASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFL---ALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAE 230 (514) Q Consensus 154 ~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~GmtTa~aE 230 (514) .+.+. +...+... ......... ............. ..+. .. .+....+..--....++ T Consensus 1 MA~~~--------T~~~~~~i-------Pev~s~~v~~~~~~~~~~~~~~~~~--~~~~-g~-~G~tv~iP~~~~~~~a~ 61 (272) T protein:vir:30 1 MAVGT--------TKMAQMLD-------PEVLADMIDAEVGKAIRFAPLAEVD--TTLE-GQ-PGTTLTVPKWDYIGDAE 61 (272) T ss_pred CCCcc--------ccchheec-------hHHHHHHHHHHHHHHhhhhcccccc--cccc-CC-CCCEEEEEEecCCCCcc Confidence 11000 00000000 000000000 0000000000000 0000 00 11111111100111122 Q ss_pred ccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhe Q lcl|Aclame:pro 231 LQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNSQA 310 (514) Q Consensus 231 al~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~~l~~~a 310 (514) -. +. +..+..=..+.+.++++.|.++-.-++|=|++.+ -+-|.++++.+-|+..|..+|+.+|+..+.... T Consensus 62 ~v---~e--g~~i~~~~~~~~~~~~~~~~~~~~~~itd~~~~~----s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~ 132 (272) T protein:vir:30 62 DV---AE--GEAIPMTQLGFKKTTMTIKKAGKGVEITDEAILS----GYGDPVGQAAKQIVEAIDHKVDADVLDALSKST 132 (272) T ss_pred cc---cC--CCcccccccccceEEEEeeeeeeeeeecHHHHhh----ccccHHHHHHHHHHHHHHHHHHHHHHHHhcccc Confidence 11 10 2233444456777888888777666777666543 256899999999999999999999987653211 Q ss_pred eecccccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhhccccccc Q lcl|Aclame:pro 311 QIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVGP 390 (514) Q Consensus 311 ~v~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~~g~~~~~ 390 (514) +.+..... + +-.-.+..++.++ -...+++||+|+++..|.......+. T Consensus 133 --------~~~~~~~t--~-------------d~i~da~~~l~~~---------~~~~~~~vv~p~~~~~L~k~~~~~~~ 180 (272) T protein:vir:30 133 --------QTVEATAT--V-------------DGVSKALDIFNDE---------DDAETVIVMNPADASTLRLDAAKEWL 180 (272) T ss_pred --------cccccccC--H-------------HHHHHHHHHHhcc---------CCCccEEEEcHHHHHHHHHhcccccc Confidence 11111111 1 1122233333322 24567999999999998765443321 Q ss_pred hhccccCccccccccCceEEEEecCceEEEecCCCccceEEEEEecCCCcccceeeccccccccccccCCccccceeeee Q lcl|Aclame:pro 391 AAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGFKGSTEMDAGVFYSPYVPLTPLRGSDSKNFQPVIGFK 470 (514) Q Consensus 391 ~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~kG~~~~~~~~fy~PYv~~~~~~~~dp~s~qp~~~~~ 470 (514) ..... .+ +...+-..|.+. |++|+++++.+.+=+++.-+|.- +++-..-+.. ....|+.+++=.+-.. T Consensus 181 ~~~~~----~~-~~~~~g~ig~i~-G~~Vi~s~~~p~~t~~~~~~~a~----~~~~~~~~~v--e~~r~~~~~~~~i~~~ 248 (272) T protein:vir:30 181 GATEV----GA-NRVVSGVYGEVL-GVQIVRSRKCPKGTAYMVRKGAL----RIMLKRNTMV--ETDRDITKAINQIVAN 248 (272) T ss_pred ccccc----cc-cccccccchhhc-CeeEEEcCCCCcceEEEEcCCeE----EEEecCCcee--eeccccccceeEEEEE Confidence 11110 00 111111124554 57999999998654444333311 1111211111 1235888888888888 Q ss_pred eeeee-eecCc--cccccCcceeecCcchhhhcccc Q lcl|Aclame:pro 471 TRYGV-QVNPF--ADPTASATKVGNGAPVAASMGKN 503 (514) Q Consensus 471 tRY~l-~~nPf--~~~~~~~~~i~~~~~~~~~~~~~ 503 (514) -|||+ ..||= ...+-. .++|- T Consensus 249 ~~~~~~v~~~~~vv~~t~~------------~a~~~ 272 (272) T protein:vir:30 249 KHYGVYLYKAEKAVKITLK------------DAAKK 272 (272) T ss_pred EEEEEEEEcCCceEEEEec------------ccccC Confidence 89998 45662 111111 12222 No 19 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=95.87 E-value=0.0013 Score=36.35 Aligned_cols=266 Identities=11% Similarity=0.024 Sum_probs=117.4 Q ss_pred ccccccccccccccccccccccccccccccccccccc---ccccccccccCccccccccccccccccccccccccchhhh Q lcl|Aclame:pro 154 AASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFL---ALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAE 230 (514) Q Consensus 154 ~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~GmtTa~aE 230 (514) .+.+. +...+... ......... ............. ..+. .. .+....+..--....++ T Consensus 1 MA~~~--------T~~~~~~i-------Pev~s~~v~~~~~~~~~~~~~~~~~--~~~~-g~-~G~tv~iP~~~~~~~a~ 61 (272) T protein:vir:98 1 MAVGT--------TKMAQMLD-------PEVLADMIDAEVGKAIRFAPLAEVD--TTLE-GQ-PGTTLTVPKWDYIGDAE 61 (272) T ss_pred CCCcc--------ccchheec-------hHHHHHHHHHHHHHHhhhhcccccc--cccc-CC-CCCEEEEEEecCCCCcc Confidence 11000 00000000 000000000 0000000000000 0000 00 11111111100111122 Q ss_pred ccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhe Q lcl|Aclame:pro 231 LQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNSQA 310 (514) Q Consensus 231 al~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~~l~~~a 310 (514) -. +. +..+..=..+.+.++++.|.++-.-++|=|++.+ -+-|.++++.+-|+..|..+|+.+|+..+.... T Consensus 62 ~v---~e--g~~i~~~~~~~~~~~~~~~~~~~~~~itd~~~~~----s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~ 132 (272) T protein:vir:98 62 DV---AE--GEAIPMTQLGFKKTTMTIKKAGKGVEITDEAILS----GYGDPVGQAAKQIVEAIDHKVDADVLDALSKST 132 (272) T ss_pred cc---cC--CCcccccccccceEEEEeeeeeeeeeecHHHHhh----ccccHHHHHHHHHHHHHHHHHHHHHHHHhcccc Confidence 11 10 2233444456777888888777666777666543 256899999999999999999999987653211 Q ss_pred eecccccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhhccccccc Q lcl|Aclame:pro 311 QIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVGP 390 (514) Q Consensus 311 ~v~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~~g~~~~~ 390 (514) +.+..... + +-.-.+..++.++ -...+++||+|+++..|.......+. T Consensus 133 --------~~~~~~~t--~-------------d~i~da~~~l~~~---------~~~~~~~vv~p~~~~~L~k~~~~~~~ 180 (272) T protein:vir:98 133 --------QTVEATAT--V-------------DGVSKALDIFNDE---------DDAETVIVMNPADASTLRLDAAKEWL 180 (272) T ss_pred --------cccccccC--H-------------HHHHHHHHHHhcc---------CCCccEEEEcHHHHHHHHHhcccccc Confidence 11111111 1 1122233333322 24567999999999998765443321 Q ss_pred hhccccCccccccccCceEEEEecCceEEEecCCCccceEEEEEecCCCcccceeeccccccccccccCCccccceeeee Q lcl|Aclame:pro 391 AAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGFKGSTEMDAGVFYSPYVPLTPLRGSDSKNFQPVIGFK 470 (514) Q Consensus 391 ~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~kG~~~~~~~~fy~PYv~~~~~~~~dp~s~qp~~~~~ 470 (514) ..... .+ +...+-..|.+. |++|+++++.+.+=+++.-+|.- +++-..-+.. ....|+.+++=.+-.. T Consensus 181 ~~~~~----~~-~~~~~g~ig~i~-G~~Vi~s~~~p~~t~~~~~~~a~----~~~~~~~~~v--e~~r~~~~~~~~i~~~ 248 (272) T protein:vir:98 181 GATEV----GA-NRVVSGVYGEVL-GVQIVRSRKCPKGTAYMVRKGAL----RIMLKRNTMV--ETDRDITKAINQIVAN 248 (272) T ss_pred ccccc----cc-cccccccchhhc-CeeEEEcCCCCcceEEEEcCCeE----EEEecCCcee--eeccccccceeEEEEE Confidence 11110 00 111111124554 57999999998654444333311 1111211111 1235888888888888 Q ss_pred eeeee-eecCc--cccccCcceeecCcchhhhcccc Q lcl|Aclame:pro 471 TRYGV-QVNPF--ADPTASATKVGNGAPVAASMGKN 503 (514) Q Consensus 471 tRY~l-~~nPf--~~~~~~~~~i~~~~~~~~~~~~~ 503 (514) -|||+ ..||= ...+-. .++|- T Consensus 249 ~~~~~~v~~~~~vv~~t~~------------~a~~~ 272 (272) T protein:vir:98 249 KHYGVYLYKAEKAVKITLK------------DAAKK 272 (272) T ss_pred EEEEEEEEcCCceEEEEec------------ccccC Confidence 89998 45662 111111 12222 No 20 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=95.44 E-value=0.0021 Score=35.30 Aligned_cols=343 Identities=15% Similarity=0.134 Sum_probs=129.1 Q ss_pred Ccchhhhh-------hhhccccccccccccchhhhhhh-hhhhhHH------HHHHhcccccchhhhhhhcccccccccc Q lcl|Aclame:pro 1 MNLTEKWK-------DLLEAEGADMPEIATATKQKIMS-KIFENQD------RDINNDPMYRDPQLVEAFNAGLNEAVVN 66 (514) Q Consensus 1 ~~l~~kw~-------p~l~~~~~~~~~i~~~~~~~~~~-~~~enq~------~~~~~~~~~~~~~~~~~~~~~~~~a~~~ 66 (514) ..|.++.. .+.+..+ .++....+..... +..+++. ..+++... ...-..+|...+.... T Consensus 45 ~~l~~~i~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~-- 117 (415) T protein:vir:81 45 TDLRSQIQEKQEELDKLKEKDG---TSENNQQSVEVNEARTYRNQANINDLGISIQNTKV--TSQEVRDFTEYLETRN-- 117 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHHh---hhhhcccccccchhhhHHHHHHHHHHhhhhhhhhh--HHHHHHHHHHHHhhhh-- Confidence 11111111 1110000 0111111111111 1111110 00010000 0001111111111100 Q ss_pred ccccccccccccccccccccccceeee--hhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCccccccccccC Q lcl|Aclame:pro 67 GDHGYDPANIAQGVTTGAVTNIGPTVM--GMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAEAFHPTRQ 144 (514) Q Consensus 67 ~~~g~~~~~~~~st~tg~v~~~~P~l~--~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~tg~EA~~~~nE 144 (514) +.....-.+..|. ..-|.-+ .+++++..+..-.+++.|+||++..+-+--.+.. . +..+ T Consensus 118 -----~~~~~~~~~~~gg--~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~--~----~~~~------ 178 (415) T protein:vir:81 118 -----DIQGGSLKTDSGF--VVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQS--E----VAAL------ 178 (415) T ss_pred -----hhhhccccccccc--cccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeec--C----Cccc------ Confidence 0000000111111 1112111 2445555667788999999999887754322211 1 0000 Q ss_pred CCCccCccccccccccccccccccccccccccccccccccccccccccccccccccCccccccccccccccccccccccc Q lcl|Aclame:pro 145 ADASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGM 224 (514) Q Consensus 145 adt~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~Gm 224 (514) .|-+ ++ T Consensus 179 ---~~v~----------------------------------------------------------------------E~- 184 (415) T protein:vir:81 179 ---EKVE----------------------------------------------------------------------EL- 184 (415) T ss_pred ---eeec----------------------------------------------------------------------cc- Confidence 0000 00 Q ss_pred cchhhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHH Q lcl|Aclame:pro 225 ATSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVN 304 (514) Q Consensus 225 tTa~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~ 304 (514) ++. ...+...|.+..|.+.|. +-...+|-||.+|- ..|.+++|.+-|+..|...+|+.|+. T Consensus 185 ----~~~----~~~~~~~~~~v~~~~~k~-------~~~~~iS~ell~ds----~~~l~~~i~~~l~~~~~~~~~~~il~ 245 (415) T protein:vir:81 185 ----EEN----PELAVKPFFQLAYDINTH-------RGYFRISREAIEDA----KVNVLQELKLWMARTIAATRNKAIID 245 (415) T ss_pred ----ccc----CcccccceeeEEeeeeee-------EeeehhhHHHHhhc----hHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 000 000112345555555555 44556999999984 35679999999999999999999965 Q ss_pred HHhhheeecccccccccCCcce-eccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhh Q lcl|Aclame:pro 305 LVNSQAQIGKSGWTQGAGAAGV-FDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSM 383 (514) Q Consensus 305 ~l~~~a~v~~~~~~~~v~~~g~-~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~ 383 (514) -.-.-.. ....... ...+. ...... . ..+....++..+... ..+.+.+||+|.....|.. T Consensus 246 g~g~g~~--~~~~~~~-~~~~~~~~~~~~-----~--~~~~i~~~~~~~~~~---------~~~~~~~v~n~~~~~~l~~ 306 (415) T protein:vir:81 246 VITKGST--GSTSSGF-EKEGKKLEVKKA-----K--SLDDIKDAINLNVKP---------NYEHNVAIVSQTMFAKLDK 306 (415) T ss_pred ccccCcc--ccccccc-cccccccccccc-----c--chhHHHHHHHhhhhh---------ccCCCEEEEcHHHHHHHHH Confidence 4421100 0000000 00000 000000 0 112233333333221 2456778999999888875 Q ss_pred ccccccchhccccCccccccccCceEEEEecCceEEEecCCCccceEEEEEecCCCcccceeecc----cccc----ccc Q lcl|Aclame:pro 384 TDTLVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGFKGSTEMDAGVFYSP----YVPL----TPL 455 (514) Q Consensus 384 ~g~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~kG~~~~~~~~fy~P----Yv~~----~~~ 455 (514) ..--..-+. ...+.++ -..++|.| ++|++.++.+.. -.|+ ..++|+- |+.. ..+ T Consensus 307 lkd~~G~~l-------~~~~~~~-~~~~~l~G-~pV~~~~~~~~~-----~~~~----~~~~~Gd~~~~~~~~~~~~~~v 368 (415) T protein:vir:81 307 MKDKLGNYL-------IQPDVKE-KTQQRLLG-AKIEILPDEVLG-----QKGN----NTLIIGNLKDAIVLFDRSQYQA 368 (415) T ss_pred hhccCCcee-------eccCcCC-CCCceecc-eeeEEecccccC-----CCCc----cEEEEEehhccEEEEeecceEE Confidence 311000000 0111111 11135544 478877665421 1111 1122221 1111 111 Q ss_pred cccCCccccceeeeeeeeee-eecC--ccccccCcceeecCcchhhhc Q lcl|Aclame:pro 456 RGSDSKNFQPVIGFKTRYGV-QVNP--FADPTASATKVGNGAPVAASM 500 (514) Q Consensus 456 ~~~dp~s~qp~~~~~tRY~l-~~nP--f~~~~~~~~~i~~~~~~~~~~ 500 (514) ...|-.+++..+....|++. ..+| |...+-..+--..|+ ..+-+ T Consensus 369 ~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~~~~~-~~~~~ 415 (415) T protein:vir:81 369 SWTDYMHFGECLMIAVRQDCRILDYKSAIVIEYDDSERGEGD-LGLEA 415 (415) T ss_pred EEeccccCceEEEEEEEeccEEeccccEEEEEEeccCCCCCc-cccCC Confidence 22355667778888889987 4555 432211111001111 11111 No 21 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=95.44 E-value=0.0021 Score=35.30 Aligned_cols=343 Identities=15% Similarity=0.134 Sum_probs=129.1 Q ss_pred Ccchhhhh-------hhhccccccccccccchhhhhhh-hhhhhHH------HHHHhcccccchhhhhhhcccccccccc Q lcl|Aclame:pro 1 MNLTEKWK-------DLLEAEGADMPEIATATKQKIMS-KIFENQD------RDINNDPMYRDPQLVEAFNAGLNEAVVN 66 (514) Q Consensus 1 ~~l~~kw~-------p~l~~~~~~~~~i~~~~~~~~~~-~~~enq~------~~~~~~~~~~~~~~~~~~~~~~~~a~~~ 66 (514) ..|.++.. .+.+..+ .++....+..... +..+++. ..+++... ...-..+|...+.... T Consensus 45 ~~l~~~i~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~-- 117 (415) T protein:vir:79 45 TDLRSQIQEKQEELDKLKEKDG---TSENNQQSVEVNEARTYRNQANINDLGISIQNTKV--TSQEVRDFTEYLETRN-- 117 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHHh---hhhhcccccccchhhhHHHHHHHHHHhhhhhhhhh--HHHHHHHHHHHHhhhh-- Confidence 11111111 1110000 0111111111111 1111110 00010000 0001111111111100 Q ss_pred ccccccccccccccccccccccceeee--hhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCccccccccccC Q lcl|Aclame:pro 67 GDHGYDPANIAQGVTTGAVTNIGPTVM--GMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAEAFHPTRQ 144 (514) Q Consensus 67 ~~~g~~~~~~~~st~tg~v~~~~P~l~--~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~tg~EA~~~~nE 144 (514) +.....-.+..|. ..-|.-+ .+++++..+..-.+++.|+||++..+-+--.+.. . +..+ T Consensus 118 -----~~~~~~~~~~~gg--~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~--~----~~~~------ 178 (415) T protein:vir:79 118 -----DIQGGSLKTDSGF--VVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQS--E----VAAL------ 178 (415) T ss_pred -----hhhhccccccccc--cccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeec--C----Cccc------ Confidence 0000000111111 1112111 2445555667788999999999887754322211 1 0000 Q ss_pred CCCccCccccccccccccccccccccccccccccccccccccccccccccccccccCccccccccccccccccccccccc Q lcl|Aclame:pro 145 ADASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGM 224 (514) Q Consensus 145 adt~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~Gm 224 (514) .|-+ ++ T Consensus 179 ---~~v~----------------------------------------------------------------------E~- 184 (415) T protein:vir:79 179 ---EKVE----------------------------------------------------------------------EL- 184 (415) T ss_pred ---eeec----------------------------------------------------------------------cc- Confidence 0000 00 Q ss_pred cchhhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHH Q lcl|Aclame:pro 225 ATSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVN 304 (514) Q Consensus 225 tTa~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~ 304 (514) ++. ...+...|.+..|.+.|. +-...+|-||.+|- ..|.+++|.+-|+..|...+|+.|+. T Consensus 185 ----~~~----~~~~~~~~~~v~~~~~k~-------~~~~~iS~ell~ds----~~~l~~~i~~~l~~~~~~~~~~~il~ 245 (415) T protein:vir:79 185 ----EEN----PELAVKPFFQLAYDINTH-------RGYFRISREAIEDA----KVNVLQELKLWMARTIAATRNKAIID 245 (415) T ss_pred ----ccc----CcccccceeeEEeeeeee-------EeeehhhHHHHhhc----hHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 000 000112345555555555 44556999999984 35679999999999999999999965 Q ss_pred HHhhheeecccccccccCCcce-eccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhh Q lcl|Aclame:pro 305 LVNSQAQIGKSGWTQGAGAAGV-FDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSM 383 (514) Q Consensus 305 ~l~~~a~v~~~~~~~~v~~~g~-~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~ 383 (514) -.-.-.. ....... ...+. ...... . ..+....++..+... ..+.+.+||+|.....|.. T Consensus 246 g~g~g~~--~~~~~~~-~~~~~~~~~~~~-----~--~~~~i~~~~~~~~~~---------~~~~~~~v~n~~~~~~l~~ 306 (415) T protein:vir:79 246 VITKGST--GSTSSGF-EKEGKKLEVKKA-----K--SLDDIKDAINLNVKP---------NYEHNVAIVSQTMFAKLDK 306 (415) T ss_pred ccccCcc--ccccccc-cccccccccccc-----c--chhHHHHHHHhhhhh---------ccCCCEEEEcHHHHHHHHH Confidence 4421100 0000000 00000 000000 0 112233333333221 2456778999999888875 Q ss_pred ccccccchhccccCccccccccCceEEEEecCceEEEecCCCccceEEEEEecCCCcccceeecc----cccc----ccc Q lcl|Aclame:pro 384 TDTLVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGFKGSTEMDAGVFYSP----YVPL----TPL 455 (514) Q Consensus 384 ~g~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~kG~~~~~~~~fy~P----Yv~~----~~~ 455 (514) ..--..-+. ...+.++ -..++|.| ++|++.++.+.. -.|+ ..++|+- |+.. ..+ T Consensus 307 lkd~~G~~l-------~~~~~~~-~~~~~l~G-~pV~~~~~~~~~-----~~~~----~~~~~Gd~~~~~~~~~~~~~~v 368 (415) T protein:vir:79 307 MKDKLGNYL-------IQPDVKE-KTQQRLLG-AKIEILPDEVLG-----QKGN----NTLIIGNLKDAIVLFDRSQYQA 368 (415) T ss_pred hhccCCcee-------eccCcCC-CCCceecc-eeeEEecccccC-----CCCc----cEEEEEehhccEEEEeecceEE Confidence 311000000 0111111 11135544 478877665421 1111 1122221 1111 111 Q ss_pred cccCCccccceeeeeeeeee-eecC--ccccccCcceeecCcchhhhc Q lcl|Aclame:pro 456 RGSDSKNFQPVIGFKTRYGV-QVNP--FADPTASATKVGNGAPVAASM 500 (514) Q Consensus 456 ~~~dp~s~qp~~~~~tRY~l-~~nP--f~~~~~~~~~i~~~~~~~~~~ 500 (514) ...|-.+++..+....|++. ..+| |...+-..+--..|+ ..+-+ T Consensus 369 ~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~~~~~-~~~~~ 415 (415) T protein:vir:79 369 SWTDYMHFGECLMIAVRQDCRILDYKSAIVIEYDDSERGEGD-LGLEA 415 (415) T ss_pred EEeccccCceEEEEEEEeccEEeccccEEEEEEeccCCCCCc-cccCC Confidence 22355667778888889987 4555 432211111001111 11111 No 22 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=95.44 E-value=0.0021 Score=35.30 Aligned_cols=343 Identities=15% Similarity=0.134 Sum_probs=129.1 Q ss_pred Ccchhhhh-------hhhccccccccccccchhhhhhh-hhhhhHH------HHHHhcccccchhhhhhhcccccccccc Q lcl|Aclame:pro 1 MNLTEKWK-------DLLEAEGADMPEIATATKQKIMS-KIFENQD------RDINNDPMYRDPQLVEAFNAGLNEAVVN 66 (514) Q Consensus 1 ~~l~~kw~-------p~l~~~~~~~~~i~~~~~~~~~~-~~~enq~------~~~~~~~~~~~~~~~~~~~~~~~~a~~~ 66 (514) ..|.++.. .+.+..+ .++....+..... +..+++. ..+++... ...-..+|...+.... T Consensus 45 ~~l~~~i~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~-- 117 (415) T protein:vir:98 45 TDLRSQIQEKQEELDKLKEKDG---TSENNQQSVEVNEARTYRNQANINDLGISIQNTKV--TSQEVRDFTEYLETRN-- 117 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHHh---hhhhcccccccchhhhHHHHHHHHHHhhhhhhhhh--HHHHHHHHHHHHhhhh-- Confidence 11111111 1110000 0111111111111 1111110 00010000 0001111111111100 Q ss_pred ccccccccccccccccccccccceeee--hhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCccccccccccC Q lcl|Aclame:pro 67 GDHGYDPANIAQGVTTGAVTNIGPTVM--GMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAEAFHPTRQ 144 (514) Q Consensus 67 ~~~g~~~~~~~~st~tg~v~~~~P~l~--~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~tg~EA~~~~nE 144 (514) +.....-.+..|. ..-|.-+ .+++++..+..-.+++.|+||++..+-+--.+.. . +..+ T Consensus 118 -----~~~~~~~~~~~gg--~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~--~----~~~~------ 178 (415) T protein:vir:98 118 -----DIQGGSLKTDSGF--VVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQS--E----VAAL------ 178 (415) T ss_pred -----hhhhccccccccc--cccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEeec--C----Cccc------ Confidence 0000000111111 1112111 2445555667788999999999887754322211 1 0000 Q ss_pred CCCccCccccccccccccccccccccccccccccccccccccccccccccccccccCccccccccccccccccccccccc Q lcl|Aclame:pro 145 ADASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGM 224 (514) Q Consensus 145 adt~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~Gm 224 (514) .|-+ ++ T Consensus 179 ---~~v~----------------------------------------------------------------------E~- 184 (415) T protein:vir:98 179 ---EKVE----------------------------------------------------------------------EL- 184 (415) T ss_pred ---eeec----------------------------------------------------------------------cc- Confidence 0000 00 Q ss_pred cchhhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHH Q lcl|Aclame:pro 225 ATSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVN 304 (514) Q Consensus 225 tTa~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~ 304 (514) ++. ...+...|.+..|.+.|. +-...+|-||.+|- ..|.+++|.+-|+..|...+|+.|+. T Consensus 185 ----~~~----~~~~~~~~~~v~~~~~k~-------~~~~~iS~ell~ds----~~~l~~~i~~~l~~~~~~~~~~~il~ 245 (415) T protein:vir:98 185 ----EEN----PELAVKPFFQLAYDINTH-------RGYFRISREAIEDA----KVNVLQELKLWMARTIAATRNKAIID 245 (415) T ss_pred ----ccc----CcccccceeeEEeeeeee-------EeeehhhHHHHhhc----hHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 000 000112345555555555 44556999999984 35679999999999999999999965 Q ss_pred HHhhheeecccccccccCCcce-eccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhh Q lcl|Aclame:pro 305 LVNSQAQIGKSGWTQGAGAAGV-FDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSM 383 (514) Q Consensus 305 ~l~~~a~v~~~~~~~~v~~~g~-~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~ 383 (514) -.-.-.. ....... ...+. ...... . ..+....++..+... ..+.+.+||+|.....|.. T Consensus 246 g~g~g~~--~~~~~~~-~~~~~~~~~~~~-----~--~~~~i~~~~~~~~~~---------~~~~~~~v~n~~~~~~l~~ 306 (415) T protein:vir:98 246 VITKGST--GSTSSGF-EKEGKKLEVKKA-----K--SLDDIKDAINLNVKP---------NYEHNVAIVSQTMFAKLDK 306 (415) T ss_pred ccccCcc--ccccccc-cccccccccccc-----c--chhHHHHHHHhhhhh---------ccCCCEEEEcHHHHHHHHH Confidence 4421100 0000000 00000 000000 0 112233333333221 2456778999999888875 Q ss_pred ccccccchhccccCccccccccCceEEEEecCceEEEecCCCccceEEEEEecCCCcccceeecc----cccc----ccc Q lcl|Aclame:pro 384 TDTLVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGFKGSTEMDAGVFYSP----YVPL----TPL 455 (514) Q Consensus 384 ~g~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~kG~~~~~~~~fy~P----Yv~~----~~~ 455 (514) ..--..-+. ...+.++ -..++|.| ++|++.++.+.. -.|+ ..++|+- |+.. ..+ T Consensus 307 lkd~~G~~l-------~~~~~~~-~~~~~l~G-~pV~~~~~~~~~-----~~~~----~~~~~Gd~~~~~~~~~~~~~~v 368 (415) T protein:vir:98 307 MKDKLGNYL-------IQPDVKE-KTQQRLLG-AKIEILPDEVLG-----QKGN----NTLIIGNLKDAIVLFDRSQYQA 368 (415) T ss_pred hhccCCcee-------eccCcCC-CCCceecc-eeeEEecccccC-----CCCc----cEEEEEehhccEEEEeecceEE Confidence 311000000 0111111 11135544 478877665421 1111 1122221 1111 111 Q ss_pred cccCCccccceeeeeeeeee-eecC--ccccccCcceeecCcchhhhc Q lcl|Aclame:pro 456 RGSDSKNFQPVIGFKTRYGV-QVNP--FADPTASATKVGNGAPVAASM 500 (514) Q Consensus 456 ~~~dp~s~qp~~~~~tRY~l-~~nP--f~~~~~~~~~i~~~~~~~~~~ 500 (514) ...|-.+++..+....|++. ..+| |...+-..+--..|+ ..+-+ T Consensus 369 ~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~~~~~-~~~~~ 415 (415) T protein:vir:98 369 SWTDYMHFGECLMIAVRQDCRILDYKSAIVIEYDDSERGEGD-LGLEA 415 (415) T ss_pred EEeccccCceEEEEEEEeccEEeccccEEEEEEeccCCCCCc-cccCC Confidence 22355667778888889987 4555 432211111001111 11111 No 23 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=95.27 E-value=0.0024 Score=34.94 Aligned_cols=348 Identities=14% Similarity=0.119 Sum_probs=127.5 Q ss_pred Ccchhhhhhh-------hccccccccccccchhhhhhh-hhhh------hHHHHHHhcccccchhhhhhhcccccccccc Q lcl|Aclame:pro 1 MNLTEKWKDL-------LEAEGADMPEIATATKQKIMS-KIFE------NQDRDINNDPMYRDPQLVEAFNAGLNEAVVN 66 (514) Q Consensus 1 ~~l~~kw~p~-------l~~~~~~~~~i~~~~~~~~~~-~~~e------nq~~~~~~~~~~~~~~~~~~~~~~~~~a~~~ 66 (514) ..|.++..-+ .+.... ++......+... ..-+ +....+++... . ..-..+|...+... T Consensus 45 ~~l~~~i~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~-~~e~~~~~~~~~~~--- 116 (415) T protein:vir:94 45 TDLRSQIQEKQEELDKLKEKDGT---SENNQQSVEVNEASTYRNQANINDLGISIQNTKV-T-SQEVRDFTEYLETR--- 116 (415) T ss_pred HHHHHHHHHHHHHHHHHHHHHHh---hhhccccccccchhhHHHHHHHHHHHhhhhhhhh-h-HHHHHHHHHHhhhh--- Confidence 1111111111 000000 000000000000 0000 00001111100 0 00011111111110 Q ss_pred ccccccccccccccccccccccceeeehhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCccccccccccCCC Q lcl|Aclame:pro 67 GDHGYDPANIAQGVTTGAVTNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAEAFHPTRQAD 146 (514) Q Consensus 67 ~~~g~~~~~~~~st~tg~v~~~~P~l~~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~tg~EA~~~~nEad 146 (514) .+.......+.+|...--..+.=.+++.+-+..+-.+++.++||++..+-+--.+ ... +.++ T Consensus 117 ----~~~~~~~~~~~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~--~~~----~~~~-------- 178 (415) T protein:vir:94 117 ----NDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVR--QSE----VAAL-------- 178 (415) T ss_pred ----hhhhhhccccccccccCcHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEe--ecC----Cccc-------- Confidence 0000001111112111111112234455557778899999999988765432221 111 0000 Q ss_pred CccCccccccccccccccccccccccccccccccccccccccccccccccccccCccccccccccccccccccccccccc Q lcl|Aclame:pro 147 ASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMAT 226 (514) Q Consensus 147 t~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~GmtT 226 (514) .|-+ +| T Consensus 179 -~~v~----------------------------------------------------------------------Eg--- 184 (415) T protein:vir:94 179 -EKVE----------------------------------------------------------------------EL--- 184 (415) T ss_pred -eecc----------------------------------------------------------------------cc--- Confidence 0000 00 Q ss_pred hhhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHHHH Q lcl|Aclame:pro 227 SQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLV 306 (514) Q Consensus 227 a~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~~l 306 (514) ++. ...+...|.+..|.+.|.. -.-.+|-||.+|-- +|.+++|.+-|...|..-+|+.|+.-. T Consensus 185 --~~~----~~~~~~~~~~i~~~~~k~~-------~~~~is~ell~ds~----~~~~~~i~~~l~~~~~~~~~~~il~g~ 247 (415) T protein:vir:94 185 --EEN----PELAVKPFFQLAYDINTHR-------GYFRISREAIEDAK----VNVLQELKLWMARTIAATRNKAIIDVI 247 (415) T ss_pred --ccc----cccccccceeeEeeheeee-------eechhhHHHHhhch----HHHHHHHHHHHHHHHHHHHHHHHhhcc Confidence 000 0001123555555555554 44569999999864 467999999999999999999996543 Q ss_pred hhheeecccccccccCCcce-eccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhhcc Q lcl|Aclame:pro 307 NSQAQIGKSGWTQGAGAAGV-FDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTD 385 (514) Q Consensus 307 ~~~a~v~~~~~~~~v~~~g~-~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~~g 385 (514) -.-.. ..-.......+. ....... ..+....++..+.. ...+.+.+|++|.....|.... T Consensus 248 g~g~~---~~~~~~~~~~~~~~~~~~~~-------~~~~i~~~~~~~~~---------~~~~~~~~vmn~~~~~~l~~lk 308 (415) T protein:vir:94 248 TKGST---GSTSSGFEKEGKKLEVKKAK-------SLDDIKDAINLNVK---------PNYEHNVAIVSQTMFAKLDKMK 308 (415) T ss_pred ccCcc---cccccccccccccccccccc-------chHHHHHHHHhhhh---------hccCCCEEEEcHHHHHHHHHhh Confidence 21100 000000000000 0000000 11223333333321 1345778899999988887531 Q ss_pred ccccchhccccCccccccccCceEEEEecCceEEEecCCCcc----ce-EEEEEecCCCcccceeeccccccccccccCC Q lcl|Aclame:pro 386 TLVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVN----DY-FTVGFKGSTEMDAGVFYSPYVPLTPLRGSDS 460 (514) Q Consensus 386 ~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~----dy-~~vG~kG~~~~~~~~fy~PYv~~~~~~~~dp 460 (514) --..-+. ...+.+. -..++|.| ++|++.+..+. +. +++|--.. . +..... ....+...|- T Consensus 309 d~~G~~l-------~~~~~~~-~~~~~l~G-~pV~~~~~~~~~~~~~~~i~~gd~~~----~-~~~~~~-~~~~v~~~~~ 373 (415) T protein:vir:94 309 DKLGNYL-------IQPDVKE-KTQQRLLG-AKIEILPDEVLGQKGNNTLIIGNLKD----A-IVLFDR-SQYQASWTDY 373 (415) T ss_pred ccCCCee-------eccCcCC-CCCceecc-eeeEEecccccCCCCccEEEEEehhc----c-EEEEee-cceEEEEecc Confidence 1000000 0111111 01134544 47887766542 11 23331000 0 000000 0011122345 Q ss_pred ccccceeeeeeeeee-eecC--ccccccCcceeecCcchhhhc Q lcl|Aclame:pro 461 KNFQPVIGFKTRYGV-QVNP--FADPTASATKVGNGAPVAASM 500 (514) Q Consensus 461 ~s~qp~~~~~tRY~l-~~nP--f~~~~~~~~~i~~~~~~~~~~ 500 (514) .+++-.+-...|+++ ..+| |...+-..+--..|+ ..+-+ T Consensus 374 ~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~-~~~~~ 415 (415) T protein:vir:94 374 MHFGECLMIAVRQDCRILDYKSAIVIEYDDSERGEGD-LGLEA 415 (415) T ss_pred ccCceEEEEEEEeccEEeccccEEEEEEeccCCCCCc-cccCC Confidence 566667777888887 4555 332211111001111 11111 No 24 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=95.20 E-value=0.0026 Score=34.80 Aligned_cols=275 Identities=13% Similarity=0.069 Sum_probs=123.6 Q ss_pred cccccccccccc-ccccccccceeeehhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCccccccccccCCCC Q lcl|Aclame:pro 69 HGYDPANIAQGV-TTGAVTNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAEAFHPTRQADA 147 (514) Q Consensus 69 ~g~~~~~~~~st-~tg~v~~~~P~l~~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~tg~EA~~~~nEadt 147 (514) -|+++.....+. ..+-|+. ...--+++++..+.+-.+++-+-||++.+--+ ...+ +.+| T Consensus 1 ~g~~a~~~~~~~~~~~~iP~--~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~-----~~~~----~~~a--------- 60 (299) T protein:vir:41 1 MGFNPDTTTMQSAKTGSIPI--NISEQIITGVKNGSAAMKLAKAVPMTKPEEEF-----TFMS----GVGA--------- 60 (299) T ss_pred CCcCCCcccccCCCceecch--hHHHHHHHHHHhcchhhhhceeeecCCCcEEE-----EEEc----CCce--------- Confidence 233222111111 1111111 11123555666788888999999998766321 1111 0000 Q ss_pred ccCccccccccccccccccccccccccccccccccccccccccccccccccccCccccccccccccccccccccccccch Q lcl|Aclame:pro 148 SFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATS 227 (514) Q Consensus 148 ~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~GmtTa 227 (514) .|- T Consensus 61 ~~v----------------------------------------------------------------------------- 63 (299) T protein:vir:41 61 FWV----------------------------------------------------------------------------- 63 (299) T ss_pred eee----------------------------------------------------------------------------- Confidence 000 Q ss_pred hhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHHHHh Q lcl|Aclame:pro 228 QAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVN 307 (514) Q Consensus 228 ~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~~l~ 307 (514) +| +.+++|...++++++...|..+-...+|.||.+|-. .|.++.|.+.|+..|...+|+.|+.- T Consensus 64 -~E---------~~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~----~~~~~~i~~~l~~a~~~~~d~a~l~G-- 127 (299) T protein:vir:41 64 -DE---------AERIQTSKPTFTKAKMRSKKMGVIIPTTKENLNYSV----TNFFSLMQAEIVEAFYKKFDQAVFTG-- 127 (299) T ss_pred -ec---------CccccccccceeEEEEeeEEEEEeehhhHHHHhcCH----HHHHHHHHHHHHHHHHHHHHHHHhhc-- Confidence 01 122444455667888888888888889999999854 35689999999999999999988532 Q ss_pred hheeecccccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhhcccc Q lcl|Aclame:pro 308 SQAQIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTL 387 (514) Q Consensus 308 ~~a~v~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~~g~~ 387 (514) ..+- .+.|++...... ... .......+.-|.++.+.+.. ..++++.+||+|+....|.... T Consensus 128 -~g~~---------~~~gil~~~~~~-~~~----~~~~~~~~~~l~~~~~~l~~--~~~~~~~~v~n~~~~~~L~~lk-- 188 (299) T protein:vir:41 128 -VESP---------YNWNILKSATDA-SNL----VEETANKYDDLNEAIGLIEA--EDLEPNGIATIRKQRVKYRSTK-- 188 (299) T ss_pred -ccCc---------cccccccccccc-cee----eccccccHHHHHHHHHhhhc--ccCCcCEEEEcHHHHHHHHHhh-- Confidence 1110 011221100000 000 00000011223333344432 2356778999999998888521 Q ss_pred ccchhccccCccccccccCceEEEEecCceEEEecCCCccce----EE--------EEEecCCCcccceeeccccccccc Q lcl|Aclame:pro 388 VGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDY----FT--------VGFKGSTEMDAGVFYSPYVPLTPL 455 (514) Q Consensus 388 ~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy----~~--------vG~kG~~~~~~~~fy~PYv~~~~~ 455 (514) . ..|. .-...+.++.. ++|. |++|++.++.+.+= ++ +|..++.+++-. .+.... T Consensus 189 -d--~~G~--~l~~~~~~~~~--~~l~-G~PV~~~~~~~~~~~~~~~~~gdfs~~~i~~~~~~~i~~~------~~~~~~ 254 (299) T protein:vir:41 189 -D--GNGM--PIFNTATSNGV--DDVL-GLPIAYTPKYTFGDKDISELVGDWNQAYYGILRGVEYEIL------TEATLT 254 (299) T ss_pred -c--cCCc--eeecCCcCCCC--ceec-ceeeEEecccCCCCCceEEEEEecccEEEEEecCcEEEEe------eccccc Confidence 1 1110 00111111111 4555 57998888877541 22 222222111000 000000 Q ss_pred cccCCcc-----ccc-eee--eeeeeeee-ecC--ccccccCcceeecCcchhhhcc Q lcl|Aclame:pro 456 RGSDSKN-----FQP-VIG--FKTRYGVQ-VNP--FADPTASATKVGNGAPVAASMG 501 (514) Q Consensus 456 ~~~dp~s-----~qp-~~~--~~tRY~l~-~nP--f~~~~~~~~~i~~~~~~~~~~~ 501 (514) ...|++. ||- .+. ...|++.. .+| |+..+.-.+ + T Consensus 255 ~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~A~~~l~~~aa------------~ 299 (299) T protein:vir:41 255 TVADETGKPLNLAERDMAAIKATFEVGFMVVKDEAFSAVQPKAG------------N 299 (299) T ss_pred ccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEeccC------------C Confidence 1112221 222 233 33577663 344 333211110 0 No 25 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=93.53 E-value=0.0072 Score=32.32 Aligned_cols=261 Identities=15% Similarity=0.112 Sum_probs=110.4 Q ss_pred hhhhhccccccccccccccccccccccccccccccccceeeehhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCC Q lcl|Aclame:pro 52 LVEAFNAGLNEAVVNGDHGYDPANIAQGVTTGAVTNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKD 131 (514) Q Consensus 52 ~~~~~~~~~~~a~~~~~~g~~~~~~~~st~tg~v~~~~P~l~~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~ 131 (514) +++++.....+ ++|+ +-+ ..+.+ .+++.+-++.+-.+++.+-||++.+|-+ .+... T Consensus 1 ~l~~~~~~t~~-----~gg~----liP-------~~~~~---~Ii~~~~~~~~l~~~~~~~~~~~~~g~~-----~~~~~ 56 (293) T protein:vir:48 1 MLDSKTDHSGS-----DAGL----TIP-------QDIRT---AINTLVRQYDSLQEYVNVENVTTLTGSR-----VYEKW 56 (293) T ss_pred CceeecccccC-----cCce----Eec-------hhHHH---HHHHHHHhhhhhhhhceeeeccCCcceE-----EEEee Confidence 23332221111 0010 000 01111 1344444666677888888888766511 11111 Q ss_pred CCccccccccccCCCCccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccccccc Q lcl|Aclame:pro 132 PLTGAEAFHPTRQADASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDG 211 (514) Q Consensus 132 ~~tg~EA~~~~nEadt~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 211 (514) ...+. .+.| T Consensus 57 ~~~~~---------~a~~-------------------------------------------------------------- 65 (293) T protein:vir:48 57 TDITG---------LANI-------------------------------------------------------------- 65 (293) T ss_pred cCCCc---------ceee-------------------------------------------------------------- Confidence 00000 0000 Q ss_pred ccccccccccccccchhhhccccCCCCCCcccccce-eEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHH Q lcl|Aclame:pro 212 VAGGLLVEIDAGMATSQAELQENFNGSSNNEWNEMS-FRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGIL 290 (514) Q Consensus 212 ~a~~~~y~~~~GmtTa~aEal~~~ggs~~~~f~EMs-FsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanIL 290 (514) + +| +..++|.+ .++++++..+|.-+-...+|-||.+|. .+|.|++|.+-| T Consensus 66 --------v--------~E---------g~~~~~~~~~~~~~i~l~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~l 116 (293) T protein:vir:48 66 --------D--------DE---------AGKIADIDDPKLSLIKYTIKRYAGISTVTNSLLADS----AENILAWLSGWI 116 (293) T ss_pred --------e--------cC---------CcccccccccceeEEEEeeeEEEEeehhhHHHHhhh----hHHHHHHHHHHH Confidence 0 01 11233332 345566666666666677999999986 367899999999 Q ss_pred HHHHHHHhhHHHHHHHhhheeecccccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccE Q lcl|Aclame:pro 291 ANEVMVELNREIVNLVNSQAQIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNF 370 (514) Q Consensus 291 StEImlEINReii~~l~~~a~v~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~ 370 (514) +..|..-+|+.|+.-+...++ ..+.+++ +....|+.++.. .- ..... T Consensus 117 a~~~~~~~~~~i~~g~~~~~~-----------~~~~~~~-------------d~i~~~~~~l~~-------~~--~~~a~ 163 (293) T protein:vir:48 117 AKKVVVTRNKAILGVVDKLPT-----------KPTLTKW-------------DDIIDLEAKVDP-------AI--KQTSF 163 (293) T ss_pred HHHHHHHHHhHHhhccccccc-----------cccccCH-------------HHHHHHHHhhhh-------hh--cCCCE Confidence 999999999999654422211 1112211 223444444432 11 22346 Q ss_pred EEEChhHHhHHhhccccccchhccccCccccccccCceEEEEecCceEEEe--cCCCcc--------------ceEEEEE Q lcl|Aclame:pro 371 IIASRNVVSALSMTDTLVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYI--DQYAVN--------------DYFTVGF 434 (514) Q Consensus 371 ~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~--D~y~~~--------------dy~~vG~ 434 (514) .+|++.....|.... . ..| ...-.+...+-..++|.|+ +|++ |.+.+. +++.++. T Consensus 164 ~vmn~~~~~~L~~lk---d--~~g---~~l~~~~~~~~~~~~l~G~-Pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~ 234 (293) T protein:vir:48 164 FLTNTSGFTALKKVK---N--ALG---DYLMERDVKSPTGYSIAGF-AVKEISDRWLPNASSGVMPLYFGDLKQAVTLFD 234 (293) T ss_pred EEEcHHHHHHHHHhh---c--cCC---ceEeecCcCCCCCceecce-eeEEecccccCCccCCceEEEEEeccceEEEEE Confidence 788999887776421 0 000 1111111111111355544 6665 333221 1222222 Q ss_pred ecCCCcccceeeccccccccccccCCccccceeeeeeeeee---------------eecCccccccCcc Q lcl|Aclame:pro 435 KGSTEMDAGVFYSPYVPLTPLRGSDSKNFQPVIGFKTRYGV---------------QVNPFADPTASAT 488 (514) Q Consensus 435 kG~~~~~~~~fy~PYv~~~~~~~~dp~s~qp~~~~~tRY~l---------------~~nPf~~~~~~~~ 488 (514) ++.... -..++.. .+-.+-|=.+-...||+. .+-|+.+..+-.. T Consensus 235 ~~~~~i----~~~~~~~------~~~~~~~~~~r~~~r~d~~~~~~~a~~~l~~~~~~~~~~~~~~~~~ 293 (293) T protein:vir:48 235 RQQMSL----LSTNIGG------GAFETDTTKVRVIDRFDVVATDTEAFVPASFKAIADQKGNIGSTAV 293 (293) T ss_pred ecceEE----EEecccc------hhhhcCeEEEEEEEeeCcEEecccceEEEEeeccccCCccccccCC Confidence 222111 1111100 011122233334444443 3334333221111 No 26 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=93.22 E-value=0.0083 Score=31.98 Aligned_cols=271 Identities=9% Similarity=-0.041 Sum_probs=114.1 Q ss_pred CccccccccccCCCCccCccccccccccccccccccccccccccccccccccccccccccccccccccCccccccccccc Q lcl|Aclame:pro 133 LTGAEAFHPTRQADASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGV 212 (514) Q Consensus 133 ~tg~EA~~~~nEadt~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 212 (514) +. ...|..+- ...+..... .+.... .............. .+. .. T Consensus 1 ma---------~~~T~~~~---------------~iiPev~~~-------~v~~~~--~~~~~~~~~~~~~~--~l~-g~ 44 (274) T protein:vir:93 1 MP---------QGITKTSN---------------QIIPEVLAP-------MMQAQL--EKKLRFASFAEVDS--TLQ-GQ 44 (274) T ss_pred CC---------ccceehhh---------------eechHHHHH-------HHHHHH--Hhhhhhcccccccc--ccc-CC Confidence 00 00000000 000000000 000000 00000000000000 000 00 Q ss_pred cccccccccccccchhhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHH Q lcl|Aclame:pro 213 AGGLLVEIDAGMATSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILAN 292 (514) Q Consensus 213 a~~~~y~~~~GmtTa~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILSt 292 (514) .+.+.+++.--.+..++.. ......++.++ +..+.+++-|-|+-.=+++=| +.+.+ +-|.-.+..+-++. T Consensus 45 -~G~tv~ip~~~~~g~~~~~---~eg~~i~~~~i--t~~~~~~~i~~~~~~~~i~D~--~~~~~--~~d~~~~~~~~~~~ 114 (274) T protein:vir:93 45 -PGDTLTFPAFVYSGDAQVV---AEGEKIPTDIL--ETKKREAKIRKIAKGTSITDE--ALLSG--YGDPQGEQVRQHGL 114 (274) T ss_pred -CCCEEEEEeeccCCCcccc---cCCCccccccc--ccceeEEEeeeecccccccHH--HHHhh--ccchHHHHHHHHHH Confidence 1222222211111222222 11122334444 445555555665532233322 22223 57889999999999 Q ss_pred HHHHHhhHHHHHHHhhheeecccccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEE Q lcl|Aclame:pro 293 EVMVELNREIVNLVNSQAQIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFII 372 (514) Q Consensus 293 EImlEINReii~~l~~~a~v~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v 372 (514) .+..+++++++..+..... .+ +...+ ..+-+-.+..++.++ -..+++++ T Consensus 115 ~~a~~~d~~~~~~~~~a~~--------~~-~~~~~-------------~~d~i~dA~~~l~d~---------~~~~~~iv 163 (274) T protein:vir:93 115 AHANKVDNDVLEALMGAKL--------TV-NADIT-------------KLNGLQSAIDKFNDE---------DLEPMVLF 163 (274) T ss_pred HHHHHHHHHHHHHHhcccc--------cc-ccccc-------------CHHHHHHHHHHhhhc---------cCCccEEE Confidence 9999999999877633210 01 11111 123344444444432 24678999 Q ss_pred EChhHHhHHhhccccccchhccccCccccccccCceEEEEecCceEEEecCCCccceEEEEEecCCCcccceeecccccc Q lcl|Aclame:pro 373 ASRNVVSALSMTDTLVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGFKGSTEMDAGVFYSPYVPL 452 (514) Q Consensus 373 ~S~~va~~L~~~g~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~kG~~~~~~~~fy~PYv~~ 452 (514) |+|.+++.|.......+.......+ +...+-..|.+. |++||+|+..|..-..+.-+|.-. |+. --+. T Consensus 164 v~p~~~~~L~k~~~~~f~~~s~~g~-----~~~~~G~ig~~~-G~~Vi~s~~~p~~t~~l~~~gai~-----~~~-~~~~ 231 (274) T protein:vir:93 164 INPLDAGKLRGDASTNFTRATELGD-----DIIVKGAFGEAL-GAIIVRTNKLEAGTAILAKKGAVK-----LIL-KRDF 231 (274) T ss_pred eCHHHHHHHHhhhhhcccccccccc-----cceeecccceec-CeeEEEcCCCCcceEEEEeCCeEE-----EEe-cCCc Confidence 9999999988653333222211110 111111235554 689999999885443333333211 111 1111 Q ss_pred ccccccCCccccceeeeeeeeee-eecC--ccccccCcceeecCcch Q lcl|Aclame:pro 453 TPLRGSDSKNFQPVIGFKTRYGV-QVNP--FADPTASATKVGNGAPV 496 (514) Q Consensus 453 ~~~~~~dp~s~qp~~~~~tRY~l-~~nP--f~~~~~~~~~i~~~~~~ 496 (514) .-....|++++.=.+-...|||+ ..|| ....+-... +-.| T Consensus 232 ~vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~~v~~t~~~~----s~~~ 274 (274) T protein:vir:93 232 FLEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSG----SLEM 274 (274) T ss_pred ccccccchhhcccEEEEEEEEEEEEEcCCceEEEeeCcc----ccCC Confidence 11124699999999999999998 4566 211111110 0111 No 27 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=92.16 E-value=0.013 Score=31.00 Aligned_cols=269 Identities=8% Similarity=0.007 Sum_probs=113.8 Q ss_pred CccccccccccCCCCccCccccccccccccccccccccccccccccccccccccccccccccccccccCccccccccccc Q lcl|Aclame:pro 133 LTGAEAFHPTRQADASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGV 212 (514) Q Consensus 133 ~tg~EA~~~~nEadt~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 212 (514) +. ...|.-+ ....+......... ...... .......... .+. . T Consensus 1 ma---------~~~T~~~---------------d~i~Pev~s~~v~~----~~~~~~-----~~~~~~~~~~--~l~-g- 43 (274) T protein:vir:96 1 MA---------QGTTKVS---------------NLIVPEVLAPMMQA----ELDKKL-----RFAQFADIDS--TLV-G- 43 (274) T ss_pred CC---------ccccchh---------------hhhhhHHHHHHHHH----HHHhhh-----hhcccccccc--ccc-C- Confidence 00 0000000 00000000000000 000000 0000000000 000 0 Q ss_pred cccccccccccccchhhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHH Q lcl|Aclame:pro 213 AGGLLVEIDAGMATSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILAN 292 (514) Q Consensus 213 a~~~~y~~~~GmtTa~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILSt 292 (514) ..+.+.+++.--.+..+|.. ......++.++.+ ...+++.|-|.-.=+++=|. ++..+-|.-.+..+-++. T Consensus 44 ~~G~tv~ip~~~~~g~~~~~---~~g~~i~~~~it~--~~~~~~i~~~~~~~~i~D~~----~~~~~~d~~~~~~~~~~~ 114 (274) T protein:vir:96 44 QPGDTLTFPAFTYSGDAQVI---AEGEKIPVDQIGT--SKREAKVRKIGKGTELTDEA----VLSGFGDPQGEAVRQHGL 114 (274) T ss_pred CCCCEEEEEeeccCCCcccc---CCCCcCchhhccc--ceeEEEEEeeeceeeecHHH----HHhhcchHHHHHHHHHHH Confidence 01222222211112222221 1112334455443 33444445454222333222 122367889999999999 Q ss_pred HHHHHhhHHHHHHHhhheeecccccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEE Q lcl|Aclame:pro 293 EVMVELNREIVNLVNSQAQIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFII 372 (514) Q Consensus 293 EImlEINReii~~l~~~a~v~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v 372 (514) .++.+++++|+..+.... ..+ .+..+ ..+.+-.+..++.++ -...++++ T Consensus 115 ~~a~~~d~~i~~~l~~a~--------~~~-~~~~~-------------~~d~i~dA~~~l~d~---------~~~~~~iv 163 (274) T protein:vir:96 115 AIANKVDNDVLEALKGAT--------LTV-EADIT-------------KLDGLQTAIDKFNDE---------DLEPMVLF 163 (274) T ss_pred HHHHHHHHHHHHHHhcCC--------CCc-Ccccc-------------cHHHHHHHHHHhccc---------CCCceEEE Confidence 999999999987773321 011 11111 123344444444432 23678999 Q ss_pred EChhHHhHHhhccccccchhccccCccccccccCceEEEEecCceEEEecCCCccce-EEEEEecCCCcccceeeccccc Q lcl|Aclame:pro 373 ASRNVVSALSMTDTLVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDY-FTVGFKGSTEMDAGVFYSPYVP 451 (514) Q Consensus 373 ~S~~va~~L~~~g~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy-~~vG~kG~~~~~~~~fy~PYv~ 451 (514) |+|.+++.|.......+.......++.. ..+. .|.+. |++||+|...|..= +++| +|.-. |+.. -+ T Consensus 164 v~p~~~~~L~k~~~~~f~~~~~~g~~~~---~~g~--ig~~~-G~~Vi~s~~~p~~t~~l~~-~gA~~-----~~~~-~~ 230 (274) T protein:vir:96 164 VNPLDAGGLRTSASDNFTRPTQLGDNII---VKGA--FGEAL-GAVIVRSNKLNKGEALLAK-KGAVK-----LITK-RD 230 (274) T ss_pred eCHHHHHHHHhcccccccccccccccce---eecc--cceec-CeeEEEcCCCCcceEEEEe-Cccee-----eeec-CC Confidence 9999999997754333222211110100 1111 25554 68999999988643 2332 22211 1111 11 Q ss_pred cccccccCCccccceeeeeeeeee-eecC--cccccc-Ccceee Q lcl|Aclame:pro 452 LTPLRGSDSKNFQPVIGFKTRYGV-QVNP--FADPTA-SATKVG 491 (514) Q Consensus 452 ~~~~~~~dp~s~qp~~~~~tRY~l-~~nP--f~~~~~-~~~~i~ 491 (514) ...-...|+..++-.|-...+||+ ..|| ....+. .+.++. T Consensus 231 ~~vE~~Rd~~~~~d~i~~~~~yg~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:96 231 FFLEKDRDASRKSTALYSDKHYVAYLYDESKVVKITKGAGDEVM 274 (274) T ss_pred cccccccchhhcccEEEEeeEEEEEEEcCccEEEEEcCcccccC Confidence 111124699999999999999999 4577 222222 222222 No 28 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=90.33 E-value=0.021 Score=29.74 Aligned_cols=331 Identities=12% Similarity=0.059 Sum_probs=120.9 Q ss_pred CcchhhhhhhhccccccccccccchhhhhhhhhhhhHHHHHHhcccccch------hhhhhhcccccccccccccccccc Q lcl|Aclame:pro 1 MNLTEKWKDLLEAEGADMPEIATATKQKIMSKIFENQDRDINNDPMYRDP------QLVEAFNAGLNEAVVNGDHGYDPA 74 (514) Q Consensus 1 ~~l~~kw~p~l~~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~------~~~~~~~~~~~~a~~~~~~g~~~~ 74 (514) -+|.+ ...=++.+ + ++ . ...++..++...+.+..... ...+.+-.............-... T Consensus 37 ~~l~~-~~~~~~~~------~----~~-~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 103 (385) T protein:vir:18 37 KQLQS-DLMKVQEE------L----TK-S-GTRLFDLEQKLASGAENPGEKKSFSERAAEELIKSWDGKQGTFGAKTFNK 103 (385) T ss_pred HHHHH-HHHHHHHH------H----HH-H-HHHHHHHHHHhhccccccchhhhhHHHHHHHHHHHHHHhhccchhhHHHh Confidence 00000 00000000 0 00 0 00011111111111100000 000000000000000000000000 Q ss_pred ccccccc-ccc--ccccceeeehhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCccccccccccCCCCccCc Q lcl|Aclame:pro 75 NIAQGVT-TGA--VTNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAEAFHPTRQADASFSG 151 (514) Q Consensus 75 ~~~~st~-tg~--v~~~~P~l~~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~tg~EA~~~~nEadt~fSG 151 (514) .+...++ .|. .....+.++ +.+..+..-.++|-++||++++.-+. ++... ...+ .|- T Consensus 104 ~~~~~~~~~g~~i~~~~~~~ii---~~~~~~~~l~~~~~~~~~~~~~~~~~----~~~~~---~~~a---------~~v- 163 (385) T protein:vir:18 104 SLGSDADSAGSLIQPMQIPGII---MPGLRRLTIRDLLAQGRTSSNALEYV----REEVF---TNNA---------DVV- 163 (385) T ss_pred hhccccccCCceecchhhhHHH---HHhhhccchhhhcceecccCcceEEE----EEecC---Ccce---------eee- Confidence 0000011 111 112233333 34445667788899999987753221 11110 0000 000 Q ss_pred cccccccccccccccccccccccccccccccccccccccccccccccccCccccccccccccccccccccccccchhhhc Q lcl|Aclame:pro 152 QAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAEL 231 (514) Q Consensus 152 ~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~GmtTa~aEa 231 (514) .| T Consensus 164 -----------------------------------------------------------------------------~E- 165 (385) T protein:vir:18 164 -----------------------------------------------------------------------------AE- 165 (385) T ss_pred -----------------------------------------------------------------------------cc- Confidence 01 Q ss_pred cccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhee Q lcl|Aclame:pro 232 QENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNSQAQ 311 (514) Q Consensus 232 l~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~~l~~~a~ 311 (514) +..+++-..++++++.+.|.-+-...+|.||.||-- +.++.|.+-|+..|..-+|+.||.- ..+ T Consensus 166 --------~~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~d~~-----~l~~~i~~~la~a~~~~~d~~~l~G---~g~ 229 (385) T protein:vir:18 166 --------KALKPESDITFSKQTANVKTIAHWVQASRQVMDDAP-----MLQSYINNRLMYGLALKEEGQLLNG---DGT 229 (385) T ss_pred --------CccccccccceeEEEEeeeeEEEeehhhHHHHhhHH-----HHHHHHHHHHHHHHHHHHHHHHHhc---cCC Confidence 112333345566666666666677789999999853 2477788888888888888777421 100 Q ss_pred ecccccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhhccccccch Q lcl|Aclame:pro 312 IGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVGPA 391 (514) Q Consensus 312 v~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~~g~~~~~~ 391 (514) ++ .+.|++.......... -.. .-..+-.|..+...|. ...+..+.+||||+....|....--..-+ T Consensus 230 -~~-------~~~Gi~~~~~~~~~~~-~~~---~~~~~d~i~~~~~~l~--~~~~~~~~~~~~~~~~~~l~~lkd~~G~~ 295 (385) T protein:vir:18 230 -GD-------NLEGLNKVATAYDTSL-NAT---GDTRADIIAHAIYQVT--ESEFSASGIVLNPRDWHNIALLKDNEGRY 295 (385) T ss_pred -CC-------cccccccccccccccc-ccc---ccchHHHHHHHHHhhc--cccCCCCEEEEcHHHHHHHHHhhcCCCce Confidence 11 1223322111100000 000 0011222333333332 22466778999999998877532100000 Q ss_pred hccccCccccccccCceEEEEecCceEEEecCCCccceEEEEEecCCCcccceeeccccc-ccccccc--CCccc-ccee Q lcl|Aclame:pro 392 AQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGFKGSTEMDAGVFYSPYVP-LTPLRGS--DSKNF-QPVI 467 (514) Q Consensus 392 ~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~kG~~~~~~~~fy~PYv~-~~~~~~~--dp~s~-qp~~ 467 (514) . .+...+.++ ++|. |++|+++++.|..-+++|--- .+++. +.- ...+... +..-| +..+ T Consensus 296 l-----~~~~~~~~~----~~l~-G~pV~~~~~~p~~~~~~gd~~-----~~~~~--~~~~~~~v~~~~~~~~~~~~~~~ 358 (385) T protein:vir:18 296 I-----FGGPQAFTS----NIMW-GLPVVPTKAQAAGTFTVGGFD-----MASQV--WDRMDATVEVSREDRDNFVKNML 358 (385) T ss_pred e-----ccCcccCCC----ceec-ceeeEEcCcCCCCcEEEeecc-----cEEEE--EEecceEEEEeccccchhhcCcE Confidence 0 000011122 4564 479999999987655554210 00111 000 0000000 00111 2233 Q ss_pred e--eeeeeee-eecC--ccccccCcceeecCc Q lcl|Aclame:pro 468 G--FKTRYGV-QVNP--FADPTASATKVGNGA 494 (514) Q Consensus 468 ~--~~tRY~l-~~nP--f~~~~~~~~~i~~~~ 494 (514) + ...||+. ..+| |...+--. +. T Consensus 359 ~~~~~~r~~~~v~~~~a~~~~~~~a-----a~ 385 (385) T protein:vir:18 359 TILCEERLALAHYRPTAIIKGTFSS-----GS 385 (385) T ss_pred EEEEEEeeccEEecccceEEEEecc-----CC Confidence 3 3457776 4455 32211100 00 No 29 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=90.33 E-value=0.021 Score=29.74 Aligned_cols=331 Identities=12% Similarity=0.059 Sum_probs=120.9 Q ss_pred CcchhhhhhhhccccccccccccchhhhhhhhhhhhHHHHHHhcccccch------hhhhhhcccccccccccccccccc Q lcl|Aclame:pro 1 MNLTEKWKDLLEAEGADMPEIATATKQKIMSKIFENQDRDINNDPMYRDP------QLVEAFNAGLNEAVVNGDHGYDPA 74 (514) Q Consensus 1 ~~l~~kw~p~l~~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~------~~~~~~~~~~~~a~~~~~~g~~~~ 74 (514) -+|.+ ...=++.+ + ++ . ...++..++...+.+..... ...+.+-.............-... T Consensus 37 ~~l~~-~~~~~~~~------~----~~-~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 103 (385) T protein:vir:19 37 KQLQS-DLMKVQEE------L----TK-S-GTRLFDLEQKLASGAENPGEKKSFSERAAEELIKSWDGKQGTFGAKTFNK 103 (385) T ss_pred HHHHH-HHHHHHHH------H----HH-H-HHHHHHHHHHhhccccccchhhhhHHHHHHHHHHHHHHhhccchhhHHHh Confidence 00000 00000000 0 00 0 00011111111111100000 000000000000000000000000 Q ss_pred ccccccc-ccc--ccccceeeehhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCccccccccccCCCCccCc Q lcl|Aclame:pro 75 NIAQGVT-TGA--VTNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAEAFHPTRQADASFSG 151 (514) Q Consensus 75 ~~~~st~-tg~--v~~~~P~l~~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~tg~EA~~~~nEadt~fSG 151 (514) .+...++ .|. .....+.++ +.+..+..-.++|-++||++++.-+. ++... ...+ .|- T Consensus 104 ~~~~~~~~~g~~i~~~~~~~ii---~~~~~~~~l~~~~~~~~~~~~~~~~~----~~~~~---~~~a---------~~v- 163 (385) T protein:vir:19 104 SLGSDADSAGSLIQPMQIPGII---MPGLRRLTIRDLLAQGRTSSNALEYV----REEVF---TNNA---------DVV- 163 (385) T ss_pred hhccccccCCceecchhhhHHH---HHhhhccchhhhcceecccCcceEEE----EEecC---Ccce---------eee- Confidence 0000011 111 112233333 34445667788899999987753221 11110 0000 000 Q ss_pred cccccccccccccccccccccccccccccccccccccccccccccccccCccccccccccccccccccccccccchhhhc Q lcl|Aclame:pro 152 QAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAEL 231 (514) Q Consensus 152 ~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~GmtTa~aEa 231 (514) .| T Consensus 164 -----------------------------------------------------------------------------~E- 165 (385) T protein:vir:19 164 -----------------------------------------------------------------------------AE- 165 (385) T ss_pred -----------------------------------------------------------------------------cc- Confidence 01 Q ss_pred cccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhee Q lcl|Aclame:pro 232 QENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNSQAQ 311 (514) Q Consensus 232 l~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~~l~~~a~ 311 (514) +..+++-..++++++.+.|.-+-...+|.||.||-- +.++.|.+-|+..|..-+|+.||.- ..+ T Consensus 166 --------~~~~~~~~~~~~~~~~~~~k~~~~~~is~ell~d~~-----~l~~~i~~~la~a~~~~~d~~~l~G---~g~ 229 (385) T protein:vir:19 166 --------KALKPESDITFSKQTANVKTIAHWVQASRQVMDDAP-----MLQSYINNRLMYGLALKEEGQLLNG---DGT 229 (385) T ss_pred --------CccccccccceeEEEEeeeeEEEeehhhHHHHhhHH-----HHHHHHHHHHHHHHHHHHHHHHHhc---cCC Confidence 112333345566666666666677789999999853 2477788888888888888777421 100 Q ss_pred ecccccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhhccccccch Q lcl|Aclame:pro 312 IGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVGPA 391 (514) Q Consensus 312 v~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~~g~~~~~~ 391 (514) ++ .+.|++.......... -.. .-..+-.|..+...|. ...+..+.+||||+....|....--..-+ T Consensus 230 -~~-------~~~Gi~~~~~~~~~~~-~~~---~~~~~d~i~~~~~~l~--~~~~~~~~~~~~~~~~~~l~~lkd~~G~~ 295 (385) T protein:vir:19 230 -GD-------NLEGLNKVATAYDTSL-NAT---GDTRADIIAHAIYQVT--ESEFSASGIVLNPRDWHNIALLKDNEGRY 295 (385) T ss_pred -CC-------cccccccccccccccc-ccc---ccchHHHHHHHHHhhc--cccCCCCEEEEcHHHHHHHHHhhcCCCce Confidence 11 1223322111100000 000 0011222333333332 22466778999999998877532100000 Q ss_pred hccccCccccccccCceEEEEecCceEEEecCCCccceEEEEEecCCCcccceeeccccc-ccccccc--CCccc-ccee Q lcl|Aclame:pro 392 AQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGFKGSTEMDAGVFYSPYVP-LTPLRGS--DSKNF-QPVI 467 (514) Q Consensus 392 ~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~kG~~~~~~~~fy~PYv~-~~~~~~~--dp~s~-qp~~ 467 (514) . .+...+.++ ++|. |++|+++++.|..-+++|--- .+++. +.- ...+... +..-| +..+ T Consensus 296 l-----~~~~~~~~~----~~l~-G~pV~~~~~~p~~~~~~gd~~-----~~~~~--~~~~~~~v~~~~~~~~~~~~~~~ 358 (385) T protein:vir:19 296 I-----FGGPQAFTS----NIMW-GLPVVPTKAQAAGTFTVGGFD-----MASQV--WDRMDATVEVSREDRDNFVKNML 358 (385) T ss_pred e-----ccCcccCCC----ceec-ceeeEEcCcCCCCcEEEeecc-----cEEEE--EEecceEEEEeccccchhhcCcE Confidence 0 000011122 4564 479999999987655554210 00111 000 0000000 00111 2233 Q ss_pred e--eeeeeee-eecC--ccccccCcceeecCc Q lcl|Aclame:pro 468 G--FKTRYGV-QVNP--FADPTASATKVGNGA 494 (514) Q Consensus 468 ~--~~tRY~l-~~nP--f~~~~~~~~~i~~~~ 494 (514) + ...||+. ..+| |...+--. +. T Consensus 359 ~~~~~~r~~~~v~~~~a~~~~~~~a-----a~ 385 (385) T protein:vir:19 359 TILCEERLALAHYRPTAIIKGTFSS-----GS 385 (385) T ss_pred EEEEEEeeccEEecccceEEEEecc-----CC Confidence 3 3457776 4455 32211100 00 No 30 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=90.05 E-value=0.023 Score=29.57 Aligned_cols=349 Identities=13% Similarity=0.071 Sum_probs=121.5 Q ss_pred Ccchhhhhhhhccccccccccccchhhhhhhh--hhhhHHHHHHhc-------ccccch----------hhhhhhccccc Q lcl|Aclame:pro 1 MNLTEKWKDLLEAEGADMPEIATATKQKIMSK--IFENQDRDINND-------PMYRDP----------QLVEAFNAGLN 61 (514) Q Consensus 1 ~~l~~kw~p~l~~~~~~~~~i~~~~~~~~~~~--~~enq~~~~~~~-------~~~~~~----------~~~~~~~~~~~ 61 (514) ..+.++..--++.-+....|+.... ....+. -|+.+.+++.+. ...... .-...|...+. T Consensus 39 ~~~~e~~~~e~~~~~~~~~e~~~~~-~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 117 (418) T protein:vir:10 39 KSAGEKALAEAKRAGDLGVETKATV-DELLIKQGELQARLLEAEQKLARGGGSAELETPKTLGQLVTESEEMKGMDGSAR 117 (418) T ss_pred HHHHHHHHHHHHhhhhhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccccccchhhhhhHHhhhHHHHHHHHHHHh Confidence 1111111111110000000100000 000000 001111111000 000000 00000000000 Q ss_pred ccc---ccccccccccccccccccccccccceeee-hhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCcccc Q lcl|Aclame:pro 62 EAV---VNGDHGYDPANIAQGVTTGAVTNIGPTVM-GMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAE 137 (514) Q Consensus 62 ~a~---~~~~~g~~~~~~~~st~tg~v~~~~P~l~-~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~tg~E 137 (514) ... .....-........+++++.-...-|.+. .+++.+.+..+-.++|.+-||++++.-+ .|- ... + T Consensus 118 ~~~~~~~~~~~~~~~~~~~~~~~~~~g~lvp~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~--~~~--~~~---~-- 188 (418) T protein:vir:10 118 KSVRVRVDRKSIMNVPATVGSGVSGSNSLVVADRQAGIIAPPQRKMTIRDLLMPGQTSSSSIEY--TVE--TGF---T-- 188 (418) T ss_pred hhhhhhhHHHHHHHhhhhccCCCCCCccccchhHHHHHHHHHhhhhhHHhhcceeeccCCceeE--EEE--ecC---C-- Confidence 000 00000000000000111111111111111 2344455667778889999988765211 110 000 0 Q ss_pred ccccccCCCCccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccccccccccccc Q lcl|Aclame:pro 138 AFHPTRQADASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLL 217 (514) Q Consensus 138 A~~~~nEadt~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~ 217 (514) +.+.|- T Consensus 189 -------~~a~~v------------------------------------------------------------------- 194 (418) T protein:vir:10 189 -------NNAAAV------------------------------------------------------------------- 194 (418) T ss_pred -------Cceeee------------------------------------------------------------------- Confidence 000000 Q ss_pred ccccccccchhhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHH Q lcl|Aclame:pro 218 VEIDAGMATSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVE 297 (514) Q Consensus 218 y~~~~GmtTa~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlE 297 (514) +| +...++-..++++++..+|.-+-...+|-||.||.- |.++.|.+-|+..|..- T Consensus 195 -----------~E---------~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~-----~l~~~i~~~l~~a~~~~ 249 (418) T protein:vir:10 195 -----------AE---------GAQKPTSDLKFNLKNQPVRTIAHLFKASRQILDDAP-----ALQSYIDGRARYGLQLT 249 (418) T ss_pred -----------cc---------CccccccccceeeEEEeeeeEEEeehhhHHHHHhHH-----HHHHHHHHHHHHHHHHH Confidence 01 011222223455666666666666789999999863 45888888888888888 Q ss_pred hhHHHHHHHhhheeecccccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhH Q lcl|Aclame:pro 298 LNREIVNLVNSQAQIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNV 377 (514) Q Consensus 298 INReii~~l~~~a~v~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~v 377 (514) +|+-||. -.-+ ... +.|++......-...+ . .....+..|..+...+. ...+..+.+||+|.. T Consensus 250 ~d~a~l~---G~g~---~~~-----p~Gi~~~~~~~~~~~~---~-~~~~~~~~i~~~~~~~~--~~~~~~~~~v~n~~~ 312 (418) T protein:vir:10 250 EEGQILK---GDGT---GAN-----ILGILPQASAFMPSIT---L-ANATPIDKIRLALLQAV--LAEFPATGIVLNPID 312 (418) T ss_pred HHHHHhc---cCCC---Ccc-----cccccccccccccccc---c-cccccHHHHHHHHHhhc--cccCCCCEEEEcHHH Confidence 8887742 1111 111 2233221111000000 0 00011222222222232 234566779999999 Q ss_pred HhHHhhccccccchhccccCccccccccCceEEEEecCceEEEecCCCccceEEEEEecCCC-----cccceeecccccc Q lcl|Aclame:pro 378 VSALSMTDTLVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGFKGSTE-----MDAGVFYSPYVPL 452 (514) Q Consensus 378 a~~L~~~g~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~kG~~~-----~~~~~fy~PYv~~ 452 (514) ...|...- + ..| ...-.+.+.. -.|+|. |++|+++++.+.+-+++|---..- .+-.+=..||.-. T Consensus 313 ~~~L~~lk--d---~~G---~~i~~~~~~~-~~~~l~-G~pV~~~~~~p~~~~~~gd~s~~~~~~~~~~~~i~~~~~~~~ 382 (418) T protein:vir:10 313 WASIELTK--D---SQG---RYIVGNPVNG-TTPRLW-NLPVVETQAMTANEFLVGAFSMAAQIFDRMEIEVLLSTENVD 382 (418) T ss_pred HHHHHHhh--c---CCC---ceeccccccC-CCceec-ceeeEEcCCCCCCcEEEeeccceEEEEEecceEEEEecccch Confidence 98876421 1 001 0111111100 115565 479999999886555555211000 0000111111110 Q ss_pred ccccccCCccccceeeeeeeeeee-ecC--ccccccCcceeecCcchhhhcc Q lcl|Aclame:pro 453 TPLRGSDSKNFQPVIGFKTRYGVQ-VNP--FADPTASATKVGNGAPVAASMG 501 (514) Q Consensus 453 ~~~~~~dp~s~qp~~~~~tRY~l~-~nP--f~~~~~~~~~i~~~~~~~~~~~ 501 (514) +-..-+=.+=+..|++.. .+| |.. ..-.....| T Consensus 383 ------~f~~~~~~~r~~~~~d~~~~~~~a~~~----------~~~~~~~~g 418 (418) T protein:vir:10 383 ------DFEKNMVSIRAEERLALAVYRPESFVT----------GALVEQAGG 418 (418) T ss_pred ------hhhcCceEEEEEEeeccEEecccceEE----------EEeccCCCC Confidence 011122233345567663 344 221 111112222 No 31 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=89.81 E-value=0.024 Score=29.44 Aligned_cols=311 Identities=15% Similarity=0.064 Sum_probs=122.6 Q ss_pred ccccccccccccccccccccccccccccccccceeee-hhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCcc Q lcl|Aclame:pro 57 NAGLNEAVVNGDHGYDPANIAQGVTTGAVTNIGPTVM-GMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTG 135 (514) Q Consensus 57 ~~~~~~a~~~~~~g~~~~~~~~st~tg~v~~~~P~l~-~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~tg 135 (514) -+.++|...... |.+..+-..++.++-++. .+. -+++.+.+..+-..+|.+.||+++..-|.-. ... T Consensus 1 ~~~~~e~~~~~~-~~~~~~~~~~~~~~liP~---~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ip~~----~~~---- 68 (338) T protein:vir:78 1 MATLNELAPNTA-GSNHQGRLAHVPSDLLPK---EIVGPIFDKAQESSLVLRLGENIPISYGETIIPTT----VKR---- 68 (338) T ss_pred CcchHHhhhhhc-ccccccceecccccccch---HHHHHHHHHHHhhchhhhhcceeeccCCceEEEEE----ecC---- Confidence 011111110000 000000011111111111 111 2445555677778899999999865443321 111 Q ss_pred ccccccccCCCCccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccccccccccc Q lcl|Aclame:pro 136 AEAFHPTRQADASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGG 215 (514) Q Consensus 136 ~EA~~~~nEadt~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~ 215 (514) +.+.+-+.... T Consensus 69 ---------~~a~~v~~~~~------------------------------------------------------------ 79 (338) T protein:vir:78 69 ---------PEVGQVGVGTS------------------------------------------------------------ 79 (338) T ss_pred ---------ccceeeccccc------------------------------------------------------------ Confidence 11111110000 Q ss_pred ccccccccccchhhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHH Q lcl|Aclame:pro 216 LLVEIDAGMATSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVM 295 (514) Q Consensus 216 ~~y~~~~GmtTa~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEIm 295 (514) .-.+| +...++-.-+++.++...+..+-...+|-||.+|-. .|.+++|.+-|+..|. T Consensus 80 ----------~~~~E---------g~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~----~~~~~~i~~~la~a~~ 136 (338) T protein:vir:78 80 ----------NEQRE---------GGTKPLSGTAWDTRSVAPIKLATIVTVSEEFARMNP----SGLYTKLQADLAYAIG 136 (338) T ss_pred ----------ccccc---------cccccccccceeEEEEEEEEEEEeehhhHHHHhcCH----HHHHHHHHHHHHHHHH Confidence 00001 112222233344445555544555668999999833 5678999999999999 Q ss_pred HHhhHHHHHHHhhheeecccccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEECh Q lcl|Aclame:pro 296 VELNREIVNLVNSQAQIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASR 375 (514) Q Consensus 296 lEINReii~~l~~~a~v~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~ 375 (514) ..||..||.---...-..-.+........+....+. .+. ....++..+.++...|... ..+..+.++++| T Consensus 137 ~~~d~~~l~G~g~~~~~~~~gi~~~~~~~~~~~~~~-------~~~--~~~~~~~~~~~~~~~~~~~-~~~~~~~~~m~~ 206 (338) T protein:vir:78 137 RGIDLAVFHGKSPLTGSALQGIDTNNVIVNTTNVDY-------LQT--GTTPLLDRFLDGYDLVSAN-TDVDFNGWAADP 206 (338) T ss_pred HHHHHHhhcccCCCcccccccccccccccccccccc-------ccc--cchhhHHHHHHHHHHhhhh-ccccceEEEEch Confidence 999998864321100000000000000001100000 000 0122333444444444322 246677899999 Q ss_pred hHHhHHhhccccccchhccccCccccccccCceEEEEecCceEEEecCCCccc---------eEEEE--------EecCC Q lcl|Aclame:pro 376 NVVSALSMTDTLVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVND---------YFTVG--------FKGST 438 (514) Q Consensus 376 ~va~~L~~~g~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d---------y~~vG--------~kG~~ 438 (514) +....|...--+.. .. +...-.+....--.++|.|+ +||++.+.+.+ -+++| ..+.- T Consensus 207 ~~~~~L~~~~~l~d--~~---g~~l~~~~~~~~~~~~l~G~-PV~~~~~ip~~~~~~~~~~~~~~~gdfs~~~~~~~~~~ 280 (338) T protein:vir:78 207 RYRARLLRSQAYRD--AN---GNVDPTRINLAASAGDLLGL-PVQFGKAVGGDLGAATDSKVRVVGGDFSQLKYGFADEI 280 (338) T ss_pred HHHHHHHHHhhhcc--CC---CceeecccccCCCCceeeee-eEEEccccCccccccCCcccEEEEEecceEEEEeeccc Confidence 99887754322211 00 00110010000011455544 99988775521 12223 22211 Q ss_pred CcccceeeccccccccccccCCcc-----cc-ceee--eeeeeee-eecCccccccCcceeecCcchhh Q lcl|Aclame:pro 439 EMDAGVFYSPYVPLTPLRGSDSKN-----FQ-PVIG--FKTRYGV-QVNPFADPTASATKVGNGAPVAA 498 (514) Q Consensus 439 ~~~~~~fy~PYv~~~~~~~~dp~s-----~q-p~~~--~~tRY~l-~~nPf~~~~~~~~~i~~~~~~~~ 498 (514) .++ .++|. ......||.. || --++ ...|++. ..+|= ...++.++..-++ T Consensus 281 ~i~----~~~~~--~~~~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~-----a~~~l~~~~~~~~ 338 (338) T protein:vir:78 281 RVK----MSDTA--TLTDNTSPTPQTVSMWQTNQIAILIEVTFGWLLGDKQ-----AFVKFVDDEDPDA 338 (338) T ss_pred EEE----Eeecc--cccccccccccchhhhhcCcEEEEEEEEeccEeeccc-----ceEEEecccCCCC Confidence 110 00000 0001122221 11 1123 3567874 55661 1233333333333 No 32 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=89.12 E-value=0.028 Score=29.08 Aligned_cols=336 Identities=13% Similarity=0.116 Sum_probs=120.0 Q ss_pred Ccchhhhhhhhccccccccccccchh----------------hhh---hhhhh---hhHHHHHHh---cc---------- Q lcl|Aclame:pro 1 MNLTEKWKDLLEAEGADMPEIATATK----------------QKI---MSKIF---ENQDRDINN---DP---------- 45 (514) Q Consensus 1 ~~l~~kw~p~l~~~~~~~~~i~~~~~----------------~~~---~~~~~---enq~~~~~~---~~---------- 45 (514) |+.+++=.-.++.... +|++..+ +++ +..+. |.+++.+.+ +. T Consensus 1 Mk~~~el~~~~~~~~~---~~~~l~~~~~~~~~~~~~~~ee~~~~~~~i~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~ 77 (397) T protein:vir:49 1 MKTSNELHDLWVAQGD---KVENLNEKLNVAMLDDSVSAEELQAIKNERDTAKMKRDMFKEQYTEARANEVANMSEEEKK 77 (397) T ss_pred CchHHHHHHHHHHHHH---HHHHHHHHHHHHHhhhhcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccc Confidence 5533331111111000 0100000 000 00110 111111110 00 Q ss_pred ccc-ch-hh----hhhhccccccccccccccccccccccccc-cccccccceeeehhhhhhhhhhhhcceeEEecCCccc Q lcl|Aclame:pro 46 MYR-DP-QL----VEAFNAGLNEAVVNGDHGYDPANIAQGVT-TGAVTNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPT 118 (514) Q Consensus 46 ~~~-~~-~~----~~~~~~~~~~a~~~~~~g~~~~~~~~st~-tg~v~~~~P~l~~l~Rra~~~LIa~DI~GVQPmTgPT 118 (514) ... .. .. ..+|...+.. +.. ........+++ .|.+.--..+.=.+++.+.+..+-.++|.++||++++ T Consensus 78 ~~~~~~~~~~~~~~~~~~~~l~~----~~~-~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~ 152 (397) T protein:vir:49 78 PLTKSEEEVKAGFVKDFKNLVRG----RYQ-NLLDSKTDASGSDAGLTIPQDIQTAIHTLVSQYDSLQEYVNVENVTTLT 152 (397) T ss_pred ccccchhHHHHHHHHHHHHHHhc----chh-HHHHHhhccccccCcccccHhHHHHHHHHHHhhhhHHhhhceeecccCc Confidence 000 00 00 0011111111 000 00000111111 1211110111112334444666778889999999998 Q ss_pred ceeeeeeeeecCCCCccccccccccCCCCccCcccccccccccccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 119 SQVFTLRSVYGKDPLTGAEAFHPTRQADASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLA 198 (514) Q Consensus 119 GLIFAMRSrY~~~~~tg~EA~~~~nEadt~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~ 198 (514) |-+.-++- .+. ... +.|-+ T Consensus 153 ~~~~~~~~--~~~---~~~---------a~~v~----------------------------------------------- 171 (397) T protein:vir:49 153 GSRVYEKW--TDI---TGL---------ANIDD----------------------------------------------- 171 (397) T ss_pred cceEEEee--ccC---Ccc---------eeeec----------------------------------------------- Confidence 84332211 110 000 00000 Q ss_pred ccCccccccccccccccccccccccccchhhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhc Q lcl|Aclame:pro 199 VAGQMTATEYTDGVAGGLLVEIDAGMATSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVH 278 (514) Q Consensus 199 ~~~~~~~~~~~~~~a~~~~y~~~~GmtTa~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiH 278 (514) +|- +. ...+...|.++.|++.|..+ ...+|-||.+|-. T Consensus 172 -----------------------E~~-----~~----~~~~~~~~~~i~~~~~k~~~-------~~~iS~ell~ds~--- 209 (397) T protein:vir:49 172 -----------------------EAG-----KI----ADVDDPKLSLIKYTIKRYAG-------ISTVTNSLLADSA--- 209 (397) T ss_pred -----------------------Ccc-----cc----ccccccceeeEEeeeeeEEe-------eehhHHHHHhhhH--- Confidence 000 00 00012235555555555544 4559999999953 Q ss_pred CCChhHHHHHHHHHHHHHHhhHHHHHHHhhheeecccccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 279 GLDADAELSGILANEVMVELNREIVNLVNSQAQIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANE 358 (514) Q Consensus 279 GLDAEaELanILStEImlEINReii~~l~~~a~v~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~ 358 (514) .|.+++|.+-|+..|...+|+.||.-.-.. ....+.+++ +-...|+..|... T Consensus 210 -~~l~~~i~~~l~~~~~~~~d~ai~~G~g~~-----------~~~~~~~~~-------------d~i~~~~~~l~~~--- 261 (397) T protein:vir:49 210 -ENILAWLSGWIAKKVVVTRNKAILEAIAAL-----------PTKPTLTKW-------------DDIIDLEAKVDPA--- 261 (397) T ss_pred -HHHHHHHHHHHHHHHHHHHHHHHHhhcccc-----------ccccccccH-------------HHHHHHHHhhhhh--- Confidence 567999999999999999999996543111 112233322 2244455554432 Q ss_pred HHHhcccccccEEEEChhHHhHHhhccccccchhccccCccccccccCceEEEEecCceEEEe--cCCCcc----c---- Q lcl|Aclame:pro 359 IGRQTGRGNGNFIIASRNVVSALSMTDTLVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYI--DQYAVN----D---- 428 (514) Q Consensus 359 I~~~T~r~~~n~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~--D~y~~~----d---- 428 (514) ......+|++|.....|...- . ..|. .-...+.+ .-..++|.| ++|++ |...+. + T Consensus 262 ------~~~~a~~vmn~~~~~~l~~lk---d--~~G~--~l~~~~~~-~~~~~~l~G-~PV~~~~~~~~~~~~~~~~~i~ 326 (397) T protein:vir:49 262 ------IKQTSFFLTNTSGFTALKKVK---N--ALGD--YLMERDVK-SPTGYSIDG-FAVKEVADRWLANGTGGAMPLY 326 (397) T ss_pred ------hcCCCEEEEcHHHHHHHHHhh---c--CCCc--eeeccCcC-CCCCceecc-eeeEEecccccccccCCceeEE Confidence 133457889999988887531 1 0110 00111111 111145644 47765 222221 1 Q ss_pred ------eEEEEEecCCCcccceeeccccccccccccCCccccceeeeeeeeee-eecC--ccccccCcceeecCcchhhh Q lcl|Aclame:pro 429 ------YFTVGFKGSTEMDAGVFYSPYVPLTPLRGSDSKNFQPVIGFKTRYGV-QVNP--FADPTASATKVGNGAPVAAS 499 (514) Q Consensus 429 ------y~~vG~kG~~~~~~~~fy~PYv~~~~~~~~dp~s~qp~~~~~tRY~l-~~nP--f~~~~~~~~~i~~~~~~~~~ 499 (514) |++++.++..+. =+.||.. .+-...+=.+-...|++. ..+| |...+-....-.-++..... T Consensus 327 ~gd~~~~~~~~~~~~~~i----~~~~~~~------~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~~~~~ 396 (397) T protein:vir:49 327 FGDLKQAVTLFDRQHMSL----LSTNIGG------GAFETDTTKVRVIDRFDVVATDTEAFVPASFKAIADQKGNLGSTA 396 (397) T ss_pred EeeccceEEEEeecceEE----EEecccc------chhhcCceeEEEEeeeCcEEecccceEEEEeecccCCCCCccccc Confidence 222222221111 1122211 011222334444555554 2333 21110000000000000000 Q ss_pred c Q lcl|Aclame:pro 500 M 500 (514) Q Consensus 500 ~ 500 (514) . T Consensus 397 ~ 397 (397) T protein:vir:49 397 V 397 (397) T ss_pred C Confidence 1 No 33 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=88.96 E-value=0.029 Score=29.00 Aligned_cols=355 Identities=13% Similarity=0.079 Sum_probs=130.5 Q ss_pred Ccchh-----------hhhh-hhccccccccccccchhhhhhhhhhh---hHHHHHHh----cccccchhhhhhhccccc Q lcl|Aclame:pro 1 MNLTE-----------KWKD-LLEAEGADMPEIATATKQKIMSKIFE---NQDRDINN----DPMYRDPQLVEAFNAGLN 61 (514) Q Consensus 1 ~~l~~-----------kw~p-~l~~~~~~~~~i~~~~~~~~~~~~~e---nq~~~~~~----~~~~~~~~~~~~~~~~~~ 61 (514) ..|.+ .... -...+. +.....++......+. ++.+.... ++.-+.....+..... T Consensus 71 ~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-- 144 (477) T protein:vir:84 71 RELESEIERSGKLEAETKTVRKATVEV----NEALTYEKGNGQSYFRDLAMQTVGMADEPAKERLRRHMVDVESDKEI-- 144 (477) T ss_pred HHHHHHHHHhhcchhhhhhhccccccc----ccchhhhhhHHHHHHHHHHHHHhhhhhhHHHHHHHHHHhhhhhhhhH-- Confidence 00000 0000 000000 0000001000000000 00000000 0000000000000000 Q ss_pred cccccccccccccccccccccccccccceeee-h-hhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCcccccc Q lcl|Aclame:pro 62 EAVVNGDHGYDPANIAQGVTTGAVTNIGPTVM-G-MVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAEAF 139 (514) Q Consensus 62 ~a~~~~~~g~~~~~~~~st~tg~v~~~~P~l~-~-l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~tg~EA~ 139 (514) ......+.....+..++++|.. ..-|-.+ . ++...-++.+-.++|++.||++.+|-+--.+.. .. ...+. T Consensus 145 --~~~~~~~~~~~~~~~~~~~gg~-lv~~~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~ip~~~--~~---~~~a~ 216 (477) T protein:vir:84 145 --RKIAKVGEEYRDLDRNGGTGGY-AVPPLWMMNRFIELARAGRTYANLCPTEPLPGGTSSINIPKIL--TG---TSTAI 216 (477) T ss_pred --HHHHHhhhhhccccccCCCcce-eeccchhHHHHHHHhhhcchHHHhhceeeecCCcceeEEEEEe--cC---cceee Confidence 0000000011111111111111 1112222 1 444445677778999999999988854322211 00 00000 Q ss_pred ccccCCCCccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccccccccccccccc Q lcl|Aclame:pro 140 HPTRQADASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVE 219 (514) Q Consensus 140 ~~~nEadt~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~ 219 (514) +.+. T Consensus 217 ---------~~~E------------------------------------------------------------------- 220 (477) T protein:vir:84 217 ---------QAAD------------------------------------------------------------------- 220 (477) T ss_pred ---------eecc------------------------------------------------------------------- Confidence 0000 Q ss_pred ccccccchhhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 220 IDAGMATSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELN 299 (514) Q Consensus 220 ~~~GmtTa~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEIN 299 (514) |-. ......++...+++.+++.+|.-+-...+|-||.+|-. .|.++.|.+-|+..|...|+ T Consensus 221 ---g~~------------~~~~~~~~s~~~f~~i~~~~~k~~~~~~iS~ell~ds~----~~l~~~i~~~l~~~~~~~~d 281 (477) T protein:vir:84 221 ---NAA------------LTAPSAHEVDLTDGFVQANVKTIAGQQGIAIQLLDQAA----VSVDEFVFRDLAADYANKLN 281 (477) T ss_pred ---Ccc------------cccccccccccceeeEEEeeeeEEeeeHHHHHHHhccc----hhHHHHHHHHHHHHHHHHHH Confidence 000 00112344446677788888888888889999999953 45799999999999999999 Q ss_pred HHHHHHHhhheeecccccccccCCcceecccccccc----ccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEECh Q lcl|Aclame:pro 300 REIVNLVNSQAQIGKSGWTQGAGAAGVFDFSDAVDV----KGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASR 375 (514) Q Consensus 300 Reii~~l~~~a~v~~~~~~~~v~~~g~~dl~~~~d~----~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~ 375 (514) +.|+ +-.-+ .. .+.|++........ .+..| .....++..|-...+.+.... +-.+..+|++| T Consensus 282 ~~~l---~G~Gt---~~-----~p~Gi~~~~~~~~~~~~~~~~t~--~~~~~~~~~i~~~~~~~~~~~-~~~~~~~v~~~ 347 (477) T protein:vir:84 282 VQVI---SGTGS---NN-----QVVGVRATAGITQVTATSAGSAL--EKHQIIYQKIADAIQRVHTSR-FLEPEVIVMHP 347 (477) T ss_pred HHHh---ccCCC---CC-----ccceeeeccccccccccccccch--hhHHHHHHHHHHHHhhccccc-cCCccEEEEcH Confidence 9884 22111 01 13455433211100 01111 112233333333333333222 23456788888 Q ss_pred hHHhHHhhccccccch----hcccc-CccccccccCceEEEEecCceEEEecCCCccc--------eEEEEEecCCCc-c Q lcl|Aclame:pro 376 NVVSALSMTDTLVGPA----AQGMQ-DGSMNTDTNQTVFAGVLGGRFKVYIDQYAVND--------YFTVGFKGSTEM-D 441 (514) Q Consensus 376 ~va~~L~~~g~~~~~~----~~~~~-~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d--------y~~vG~kG~~~~-~ 441 (514) +....|....--..-| ..... ........-.....|+|. |++|+++++.|.+ -|++|--.+.-. + T Consensus 348 ~~~~~l~~lkd~~G~~l~~~~~~~~~~~~~~~~~~~~~~~~~l~-G~pVv~s~~~p~~~~~~~d~~~i~~gd~~~~~i~~ 426 (477) T protein:vir:84 348 RRWASFHAIFAGDDRPLIVPSGPGFNNLGVLTEVASQRVVGQMH-GLPVVTDPTLPTTLGTGTDQDVIHVLRASDLALFE 426 (477) T ss_pred HHHHHHHHhhccCCCeeeecCcccccccccccccccccccchhc-ccceEecCcccccccccCCcceEEEEEeceEEEEe Confidence 8766654421100000 00000 000111112222335665 5699999998743 344443321100 0 Q ss_pred c--ceeeccccccccccccCCcccc--ceeeeeeeee-----eeecC--ccccccCc--ceeecCcchhhhc Q lcl|Aclame:pro 442 A--GVFYSPYVPLTPLRGSDSKNFQ--PVIGFKTRYG-----VQVNP--FADPTASA--TKVGNGAPVAASM 500 (514) Q Consensus 442 ~--~~fy~PYv~~~~~~~~dp~s~q--p~~~~~tRY~-----l~~nP--f~~~~~~~--~~i~~~~~~~~~~ 500 (514) . .+. +++.++. ....|.. || .+.+| |...+--. ++ .++ T Consensus 427 ~~~~~~------------~~~~~~~~~~~~~~~v-~~~~~~~~~r~~~afv~~t~~~~~~~--------~~~ 477 (477) T protein:vir:84 427 SSVRMR------------ALQETRAENLSVLLQV-YGYLAFTAARFPQSVVEIGGTALTAP--------TFA 477 (477) T ss_pred eceeEE------------eccccccccceeeeee-hhhhhhhhhccccceEEeeccccccc--------ccC Confidence 0 011 2222221 2222211 22 12245 33221100 00 011 No 34 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=88.59 E-value=0.031 Score=28.83 Aligned_cols=341 Identities=13% Similarity=0.055 Sum_probs=128.8 Q ss_pred Ccchhh---hhhhhccccccccccccc--------hhhhhhhhhhhhHHHHHHhcccccchhhhhhhccccccccccccc Q lcl|Aclame:pro 1 MNLTEK---WKDLLEAEGADMPEIATA--------TKQKIMSKIFENQDRDINNDPMYRDPQLVEAFNAGLNEAVVNGDH 69 (514) Q Consensus 1 ~~l~~k---w~p~l~~~~~~~~~i~~~--------~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~ 69 (514) -.+.++ =.-.++............ -.+......+.+....-.|- ..|...+... T Consensus 52 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~------ 116 (415) T protein:vir:47 52 QEKQEELDKLKEKDRTSENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEV---------RDFTEYLETR------ 116 (415) T ss_pred HHHHHHHHHHHHHHHhhhhcccccccchhhhhHHHHHHHHHHHhhhhhhhhHHHH---------HHHHHHHhhh------ Confidence 111111 000111110000000000 00011111111110000000 0111111110 Q ss_pred cccccccccccccccccccceeeehhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCccccccccccCCCCcc Q lcl|Aclame:pro 70 GYDPANIAQGVTTGAVTNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAEAFHPTRQADASF 149 (514) Q Consensus 70 g~~~~~~~~st~tg~v~~~~P~l~~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~tg~EA~~~~nEadt~f 149 (514) .+......++..|...--....=.+++.+.+...-.+++.+.||+++++-+--.+.. . +.++ .| T Consensus 117 -~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~--~----~~~~---------~~ 180 (415) T protein:vir:47 117 -NDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQS--E----VAAL---------EK 180 (415) T ss_pred -hhhhhccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEec--C----Ccce---------ee Confidence 000000001111111100111113555566777888999999999988754222211 0 0000 00 Q ss_pred CccccccccccccccccccccccccccccccccccccccccccccccccccCccccccccccccccccccccccccchhh Q lcl|Aclame:pro 150 SGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQA 229 (514) Q Consensus 150 SG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~GmtTa~a 229 (514) - + T Consensus 181 v----------------------------------------------------------------------~-------- 182 (415) T protein:vir:47 181 V----------------------------------------------------------------------E-------- 182 (415) T ss_pred c----------------------------------------------------------------------c-------- Confidence 0 0 Q ss_pred hccccCCCCCCcccccce-eEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhh Q lcl|Aclame:pro 230 ELQENFNGSSNNEWNEMS-FRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNS 308 (514) Q Consensus 230 Eal~~~ggs~~~~f~EMs-FsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~~l~~ 308 (514) | +...++.+ -++++++..++..+-...+|-||.+|-. .|.+++|.+-|+..|...+|+.|+.-.-. T Consensus 183 E---------g~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~----~~l~~~i~~~l~~~i~~~~d~~il~g~g~ 249 (415) T protein:vir:47 183 E---------LEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAK----VNVLQELKLWMARTIAATRNKAIIDVITK 249 (415) T ss_pred c---------ccccccccccceeeEEeeeeeeEeeehhhHHHHhhch----HHHHHHHHHHHHHHHHHHHHHHHhhcccc Confidence 0 01122222 2445555555555556679999999843 56789999999999999999999654311 Q ss_pred heeecccccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhhccccc Q lcl|Aclame:pro 309 QAQIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLV 388 (514) Q Consensus 309 ~a~v~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~~g~~~ 388 (514) -.... ...........+...... ..+-...|+..+... .++.+.+|++|.....|.... + T Consensus 250 g~~~~--~~~~~~~~~~~~~~~~~~-------~~~~i~~~~~~~~~~---------~~~~~~~v~n~~~~~~L~~lk--d 309 (415) T protein:vir:47 250 GSTGS--TSSGFEKEGKKLEVKKAK-------SLDDIKDAINLNVKP---------NYEHNVAIVSQTMFAKLDKMK--D 309 (415) T ss_pred CCccc--cccccccccceecccccc-------chHHHHHHHHhhhhh---------ccCCCEEEEcHHHHHHHHHhh--c Confidence 10000 000000000111100000 112233444433322 356778999999988886421 0 Q ss_pred cchhccccCccccccccCceEEEEecCceEEEecCCCccceEEEEEecCCCcccceeeccccc--------cccccccCC Q lcl|Aclame:pro 389 GPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGFKGSTEMDAGVFYSPYVP--------LTPLRGSDS 460 (514) Q Consensus 389 ~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~kG~~~~~~~~fy~PYv~--------~~~~~~~dp 460 (514) ..|. .-...+.+.. ..++|.| ++|++.++.+. |-.|+ ..++|+.|-. ...+...|- T Consensus 310 ---~~G~--~i~~~~~~~~-~~~~l~G-~pV~~~~~~~~-----~~~~~----~~~~~gd~~~~~~~~~~~~~~v~~~~~ 373 (415) T protein:vir:47 310 ---KLGN--YLIQPDVKEK-TQQRLLG-AKIEILPDEVL-----GQKGN----NTLIIGNLKDAIVLFDRSQYQASWTDY 373 (415) T ss_pred ---cCCC--eeeccCcCCC-CCccccc-eeeEEeccccc-----cCCCc----cEEEEEehhccEEEEeecceEEEeecc Confidence 0000 0011111111 1135544 47777665542 11111 1122222110 011112344 Q ss_pred ccccceeeeeeeeee-eecC--ccccccCcceeecCcchhhhc Q lcl|Aclame:pro 461 KNFQPVIGFKTRYGV-QVNP--FADPTASATKVGNGAPVAASM 500 (514) Q Consensus 461 ~s~qp~~~~~tRY~l-~~nP--f~~~~~~~~~i~~~~~~~~~~ 500 (514) .+++-.+-...|++. ..+| |...+-..+ ..-+.+..+-+ T Consensus 374 ~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~-~~~~~~~~~~~ 415 (415) T protein:vir:47 374 MHFGECLMIAVRQDCRILDYKSAIVIEYDDS-ERGEGDLGLEA 415 (415) T ss_pred ccCceEEEEEEEeccEEeccccEEEEEeecc-CCCCCCccCCC Confidence 566667777888887 4555 322110000 00001111111 No 35 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=88.59 E-value=0.031 Score=28.83 Aligned_cols=341 Identities=13% Similarity=0.055 Sum_probs=128.8 Q ss_pred Ccchhh---hhhhhccccccccccccc--------hhhhhhhhhhhhHHHHHHhcccccchhhhhhhccccccccccccc Q lcl|Aclame:pro 1 MNLTEK---WKDLLEAEGADMPEIATA--------TKQKIMSKIFENQDRDINNDPMYRDPQLVEAFNAGLNEAVVNGDH 69 (514) Q Consensus 1 ~~l~~k---w~p~l~~~~~~~~~i~~~--------~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~ 69 (514) -.+.++ =.-.++............ -.+......+.+....-.|- ..|...+... T Consensus 52 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~~------ 116 (415) T protein:vir:46 52 QEKQEELDKLKEKDRTSENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEV---------RDFTEYLETR------ 116 (415) T ss_pred HHHHHHHHHHHHHHHhhhhcccccccchhhhhHHHHHHHHHHHhhhhhhhhHHHH---------HHHHHHHhhh------ Confidence 111111 000111110000000000 00011111111110000000 0111111110 Q ss_pred cccccccccccccccccccceeeehhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCccccccccccCCCCcc Q lcl|Aclame:pro 70 GYDPANIAQGVTTGAVTNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAEAFHPTRQADASF 149 (514) Q Consensus 70 g~~~~~~~~st~tg~v~~~~P~l~~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~tg~EA~~~~nEadt~f 149 (514) .+......++..|...--....=.+++.+.+...-.+++.+.||+++++-+--.+.. . +.++ .| T Consensus 117 -~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~--~----~~~~---------~~ 180 (415) T protein:vir:46 117 -NDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQS--E----VAAL---------EK 180 (415) T ss_pred -hhhhhccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEec--C----Ccce---------ee Confidence 000000001111111100111113555566777888999999999988754222211 0 0000 00 Q ss_pred CccccccccccccccccccccccccccccccccccccccccccccccccccCccccccccccccccccccccccccchhh Q lcl|Aclame:pro 150 SGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQA 229 (514) Q Consensus 150 SG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~GmtTa~a 229 (514) - + T Consensus 181 v----------------------------------------------------------------------~-------- 182 (415) T protein:vir:46 181 V----------------------------------------------------------------------E-------- 182 (415) T ss_pred c----------------------------------------------------------------------c-------- Confidence 0 0 Q ss_pred hccccCCCCCCcccccce-eEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhh Q lcl|Aclame:pro 230 ELQENFNGSSNNEWNEMS-FRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNS 308 (514) Q Consensus 230 Eal~~~ggs~~~~f~EMs-FsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~~l~~ 308 (514) | +...++.+ -++++++..++..+-...+|-||.+|-. .|.+++|.+-|+..|...+|+.|+.-.-. T Consensus 183 E---------g~~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~----~~l~~~i~~~l~~~i~~~~d~~il~g~g~ 249 (415) T protein:vir:46 183 E---------LEENPELAVKPFFQLAYDINTHRGYFRISREAIEDAK----VNVLQELKLWMARTIAATRNKAIIDVITK 249 (415) T ss_pred c---------ccccccccccceeeEEeeeeeeEeeehhhHHHHhhch----HHHHHHHHHHHHHHHHHHHHHHHhhcccc Confidence 0 01122222 2445555555555556679999999843 56789999999999999999999654311 Q ss_pred heeecccccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhhccccc Q lcl|Aclame:pro 309 QAQIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLV 388 (514) Q Consensus 309 ~a~v~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~~g~~~ 388 (514) -.... ...........+...... ..+-...|+..+... .++.+.+|++|.....|.... + T Consensus 250 g~~~~--~~~~~~~~~~~~~~~~~~-------~~~~i~~~~~~~~~~---------~~~~~~~v~n~~~~~~L~~lk--d 309 (415) T protein:vir:46 250 GSTGS--TSSGFEKEGKKLEVKKAK-------SLDDIKDAINLNVKP---------NYEHNVAIVSQTMFAKLDKMK--D 309 (415) T ss_pred CCccc--cccccccccceecccccc-------chHHHHHHHHhhhhh---------ccCCCEEEEcHHHHHHHHHhh--c Confidence 10000 000000000111100000 112233444433322 356778999999988886421 0 Q ss_pred cchhccccCccccccccCceEEEEecCceEEEecCCCccceEEEEEecCCCcccceeeccccc--------cccccccCC Q lcl|Aclame:pro 389 GPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGFKGSTEMDAGVFYSPYVP--------LTPLRGSDS 460 (514) Q Consensus 389 ~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~kG~~~~~~~~fy~PYv~--------~~~~~~~dp 460 (514) ..|. .-...+.+.. ..++|.| ++|++.++.+. |-.|+ ..++|+.|-. ...+...|- T Consensus 310 ---~~G~--~i~~~~~~~~-~~~~l~G-~pV~~~~~~~~-----~~~~~----~~~~~gd~~~~~~~~~~~~~~v~~~~~ 373 (415) T protein:vir:46 310 ---KLGN--YLIQPDVKEK-TQQRLLG-AKIEILPDEVL-----GQKGN----NTLIIGNLKDAIVLFDRSQYQASWTDY 373 (415) T ss_pred ---cCCC--eeeccCcCCC-CCccccc-eeeEEeccccc-----cCCCc----cEEEEEehhccEEEEeecceEEEeecc Confidence 0000 0011111111 1135544 47777665542 11111 1122222110 011112344 Q ss_pred ccccceeeeeeeeee-eecC--ccccccCcceeecCcchhhhc Q lcl|Aclame:pro 461 KNFQPVIGFKTRYGV-QVNP--FADPTASATKVGNGAPVAASM 500 (514) Q Consensus 461 ~s~qp~~~~~tRY~l-~~nP--f~~~~~~~~~i~~~~~~~~~~ 500 (514) .+++-.+-...|++. ..+| |...+-..+ ..-+.+..+-+ T Consensus 374 ~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~-~~~~~~~~~~~ 415 (415) T protein:vir:46 374 MHFGECLMIAVRQDCRILDYKSAIVIEYDDS-ERGEGDLGLEA 415 (415) T ss_pred ccCceEEEEEEEeccEEeccccEEEEEeecc-CCCCCCccCCC Confidence 566667777888887 4555 322110000 00001111111 No 36 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=88.30 E-value=0.033 Score=28.70 Aligned_cols=333 Identities=13% Similarity=0.066 Sum_probs=120.4 Q ss_pred Ccchhhhhhhhccccccccccccchhhhhhhhh---hhhHHHHHH-hccc-------ccchhhhhhhccccccccccccc Q lcl|Aclame:pro 1 MNLTEKWKDLLEAEGADMPEIATATKQKIMSKI---FENQDRDIN-NDPM-------YRDPQLVEAFNAGLNEAVVNGDH 69 (514) Q Consensus 1 ~~l~~kw~p~l~~~~~~~~~i~~~~~~~~~~~~---~enq~~~~~-~~~~-------~~~~~~~~~~~~~~~~a~~~~~~ 69 (514) -...+++.-+..... ++ ++.+ ..+ ++.-++... .+.. .........+-............ T Consensus 34 ~e~~~~~~~~~~e~~----~l----~~~i-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 104 (390) T protein:vir:10 34 ASARSKVDELFATVG----NL----SAEV-QAARQRVAELEGNGAGGDVQHVSVGDLFVASEQFQASAGRWNDRSARATM 104 (390) T ss_pred HHHHHHHHHHHHHHH----HH----HHHH-HHHHHHHHHHHhhcccccccccchhhhhhhhHHHHHHHHhhhhhhhhhhh Confidence 111222222211100 00 0000 000 000000000 0000 00000000000000000000000 Q ss_pred cccc-cccc--ccc-cccc--ccccceeeehhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCcccccccccc Q lcl|Aclame:pro 70 GYDP-ANIA--QGV-TTGA--VTNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAEAFHPTR 143 (514) Q Consensus 70 g~~~-~~~~--~st-~tg~--v~~~~P~l~~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~tg~EA~~~~n 143 (514) -... ..-. .++ ..|. ++...+-+|-++ -....-.++|.+.||++++.-+. +.... + .++ T Consensus 105 ~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~~~---~~~~~l~~~~~~~~~~~~~~~~~----~~~~~--~-~~a----- 169 (390) T protein:vir:10 105 NIKAALNTASTDAAGSAGALTTPNRLPGFITQP---DARLTVRDLIGSGRTDSALIEYV----QETGF--V-NNA----- 169 (390) T ss_pred HHHHHHHhhhcccccccccccchhHHHHHHHHH---HhhchhhhhcceeeccCCceEEE----EEecC--C-cce----- Confidence 0000 0000 000 0111 111122233333 34455667899999877652221 11110 0 000 Q ss_pred CCCCccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccccccccccccccccccc Q lcl|Aclame:pro 144 QADASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAG 223 (514) Q Consensus 144 Eadt~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~G 223 (514) .|- T Consensus 170 ----~~v------------------------------------------------------------------------- 172 (390) T protein:vir:10 170 ----AIV------------------------------------------------------------------------- 172 (390) T ss_pred ----eee------------------------------------------------------------------------- Confidence 000 Q ss_pred ccchhhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHH Q lcl|Aclame:pro 224 MATSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIV 303 (514) Q Consensus 224 mtTa~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii 303 (514) .| +...++-..+++++++.+|..+....+|-||.||-- |.++.|.+-|+..|...||+.|| T Consensus 173 -----~E---------g~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~-----~l~~~i~~~l~~~~~~~~~~~il 233 (390) T protein:vir:10 173 -----AE---------GALKPESSLKFAKKTDTTHVIAHTMKATRQILSDAP-----QLASYMNNRLIRGLKVKEDAEIL 233 (390) T ss_pred -----cC---------CccccccccceeEEEEeeEEEEEeehhhHHHHHhHH-----HHHHHHHHHHHHHHHHHHHHHHh Confidence 01 112233335566667777777777889999999853 46899999999999999999885 Q ss_pred HHHhhheeecccccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhh Q lcl|Aclame:pro 304 NLVNSQAQIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSM 383 (514) Q Consensus 304 ~~l~~~a~v~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~ 383 (514) .- .-+ +-.+.|++......-.... -....++..+..+...+. ......+.+|++|.....|.. T Consensus 234 ~G---~G~--------~~~p~Gi~~~~~~~~~~~~----~~~~~~~~~~~~~~~~l~--~~~~~~~~~v~n~~~~~~L~~ 296 (390) T protein:vir:10 234 RG---TGA--------NDGLLGLIPQATTYAAPTT----IAGATRVDQLRLAMLQAS--LAEYPASGIVINPIDWAAIEL 296 (390) T ss_pred hc---CCC--------Ccccccccccccccccccc----ccccchHHHHHHHHHhhc--cccCCCCEEEEcHHHHHHHHH Confidence 21 110 1123344332111100000 001111122222222222 224566788999998877764 Q ss_pred ccccccchhccccCccccccccCceEEEEecCceEEEecCCCccceEEEEEecCCCcccceeeccccccccccccC---- Q lcl|Aclame:pro 384 TDTLVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGFKGSTEMDAGVFYSPYVPLTPLRGSD---- 459 (514) Q Consensus 384 ~g~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~kG~~~~~~~~fy~PYv~~~~~~~~d---- 459 (514) .. + ..| ...-.+.... -.++| .|++|++++..|.+-+++|--- .+++.+... ...+...+ T Consensus 297 lk--d---~~g---~~l~~~~~~~-~~~~l-~G~pv~~~~~~p~~~~~~gdf~-----~~~~~~~~~-~~~i~~~~~~~~ 360 (390) T protein:vir:10 297 AK--D---ANN---QYLIGNARGT-LTPTL-WGLPVVATQAMAPGEFLVGAFD-----LAAQIFDQW-DARVEIGYVNDD 360 (390) T ss_pred hh--c---CCC---ceeecCCcCc-CCcee-cceeeEEcCCCCCCcEEEEecc-----ceEEEEEec-ceEEEEeecccc Confidence 21 1 001 0000000000 01345 4569999999887655555210 011111110 00011111 Q ss_pred Cccccceeeeeeeeee-eecC--ccccccCcceeecC Q lcl|Aclame:pro 460 SKNFQPVIGFKTRYGV-QVNP--FADPTASATKVGNG 493 (514) Q Consensus 460 p~s~qp~~~~~tRY~l-~~nP--f~~~~~~~~~i~~~ 493 (514) -.+-+=.+-...||+. +.+| |... .=+ T Consensus 361 ~~~~~~~~r~~~r~d~~v~~~~a~~~~-------~~a 390 (390) T protein:vir:10 361 FQRNMVTVLAEERLALVVYRPEALISG-------SFA 390 (390) T ss_pred cccCcEEEEEEEeeccEEeccccEEEE-------EeC Confidence 1122223334456665 3344 2111 100 No 37 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=85.68 E-value=0.051 Score=27.67 Aligned_cols=286 Identities=11% Similarity=0.075 Sum_probs=115.5 Q ss_pred ccccccccccccceeeehhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCccccccccccCCCCccCcccccc Q lcl|Aclame:pro 77 AQGVTTGAVTNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAEAFHPTRQADASFSGQAAAS 156 (514) Q Consensus 77 ~~st~tg~v~~~~P~l~~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~tg~EA~~~~nEadt~fSG~~~~~ 156 (514) ...+++|.+.--....=.+++++-++-+-.+++-+.||++..- +|+... .+.+|- | T Consensus 1 mat~~~gg~lvP~~~~~~ii~~~~~~s~i~~~~~~i~~~~~~~-------~~p~~~-~~~~a~---------w------- 56 (311) T protein:vir:81 1 MVALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQ-------QYMTLT-APPRGE---------V------- 56 (311) T ss_pred CceecCCceEcchhHHHHHHHHHHhcchhhhhcceeecCCCce-------EEEEEe-CCceeE---------E------- Confidence 1222233221111111124455556777888999999865421 121110 000110 0 Q ss_pred ccccccccccccccccccccccccccccccccccccccccccccCccccccccccccccccccccccccchhhhccccCC Q lcl|Aclame:pro 157 TIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAELQENFN 236 (514) Q Consensus 157 ~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~GmtTa~aEal~~~g 236 (514) + +| T Consensus 57 ---------------------------------------------------------------v--------~E------ 59 (311) T protein:vir:81 57 ---------------------------------------------------------------V--------GE------ 59 (311) T ss_pred ---------------------------------------------------------------e--------ec------ Confidence 0 01 Q ss_pred CCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhheeecccc Q lcl|Aclame:pro 237 GSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNSQAQIGKSG 316 (514) Q Consensus 237 gs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~~l~~~a~v~~~~ 316 (514) +..+++...++++++..+|.-+-....|-||.|+-.. -.++-|++|.+-|+..|...|+.-++.-.....-..-.+ T Consensus 60 ---g~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~~~~d-~~~~l~~~i~~~la~ai~~~~d~a~l~G~~~~~~~~~~g 135 (311) T protein:vir:81 60 ---GAQKSESTATFAPVTAIPRKVQVTQRFSQEVKWADES-RQLGVLQTMADLSGVALGRALDLIGIHGINPLTGAALSG 135 (311) T ss_pred ---CcccccccceeeEEEEeeEEEEEeehhhHHHhhcCcc-cHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCccccc Confidence 1122333334455555555444455799999875322 134457788888888888888888743321000000001 Q ss_pred cccccCC-cceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhhccccccchhccc Q lcl|Aclame:pro 317 WTQGAGA-AGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVGPAAQGM 395 (514) Q Consensus 317 ~~~~v~~-~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~~g~~~~~~~~~~ 395 (514) ....+.. ......... ....++.-|.++-..+. ..+++.+-+|++|+....|.... + ..| T Consensus 136 i~~~~~~~~~~~~~~~~-----------~~~~~~~~i~~~~~~~~--~~~~~~~~~vmn~~~~~~l~~lk--d---~~G- 196 (311) T protein:vir:81 136 SPAKILDTTNIVELTTG-----------TSATPDLAVEAAVGLVL--GDNLSPDGVALDNTFSFMLATQR--D---SQG- 196 (311) T ss_pred ccccccccceeeeeccc-----------ccchHHHHHHHHHHHhh--hcCCCceEEEEcHHHHHHHHhhh--c---cCC- Confidence 1111101 111111110 01112223444444442 22577777899999988886421 1 000 Q ss_pred cCccccccccCceEEEEecCceEEEecCCCccceE------EEEEecCCCc-----c-cceeeccccccc--cccccCCc Q lcl|Aclame:pro 396 QDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDYF------TVGFKGSTEM-----D-AGVFYSPYVPLT--PLRGSDSK 461 (514) Q Consensus 396 ~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~------~vG~kG~~~~-----~-~~~fy~PYv~~~--~~~~~dp~ 461 (514) ...-.+....-..|+|.| ++|+++.+.+..-. .+...+.... | +.+++...-... ..+-.|+. T Consensus 197 --~~l~~~~~~~~~~~tl~G-~Pv~~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~gDfs~~~i~~~~~~~~~~~~~~~~~ 273 (311) T protein:vir:81 197 --RKLYPELGFGTDVASFAG-LNAAVSDTVRGGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPD 273 (311) T ss_pred --CeeecCccccCCCceecc-eeEEecccccccccccccccchhcccCCccEEEEEecccEEEEEeccceEEEeccCCCC Confidence 001001111111256665 58988876653211 1111111110 0 112222211111 11111322 Q ss_pred c----ccc-eeee--eeeeee-eecC--ccccccCcceeecCcch Q lcl|Aclame:pro 462 N----FQP-VIGF--KTRYGV-QVNP--FADPTASATKVGNGAPV 496 (514) Q Consensus 462 s----~qp-~~~~--~tRY~l-~~nP--f~~~~~~~~~i~~~~~~ 496 (514) . ||- .++| ..|++. +.+| |+..+.- ..- T Consensus 274 ~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~a-------~~~ 311 (311) T protein:vir:81 274 GLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDA-------DES 311 (311) T ss_pred cchhhhhcCcEEEEEEEEeccEeecccceEEEEee-------ccC Confidence 1 221 1334 467775 4666 4432211 111 No 38 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=85.26 E-value=0.054 Score=27.53 Aligned_cols=331 Identities=11% Similarity=0.103 Sum_probs=120.4 Q ss_pred Ccchhhh-----------hhhhcccc---------------ccccccccch-----hhhhhhhhhhhHHHHHHhcccccc Q lcl|Aclame:pro 1 MNLTEKW-----------KDLLEAEG---------------ADMPEIATAT-----KQKIMSKIFENQDRDINNDPMYRD 49 (514) Q Consensus 1 ~~l~~kw-----------~p~l~~~~---------------~~~~~i~~~~-----~~~~~~~~~enq~~~~~~~~~~~~ 49 (514) ..+..+| +-+||+-- ...|++.+.. +|....--|.+..+.+..... .. T Consensus 245 ~~~~~~ai~~g~sld~~ra~~ld~l~~~~~a~~~~~~a~~~~~~~~~~~~~~i~~~~re~~~~~l~rai~a~a~~~~-~~ 323 (632) T protein:vir:96 245 RSLAQEAIQKGHTVDQFRALVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDW-SK 323 (632) T ss_pred hhhHHHHHhccccHHHHHHHHHHHHhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHHhhhccch-hh Confidence 1111111 11111100 0011111110 011100001111111100000 00 Q ss_pred --------hhhhhhhccccccccccccccc-ccc-cccc-ccccccccccceeee--hhhhhhhhhhhhcceeEEecCCc Q lcl|Aclame:pro 50 --------PQLVEAFNAGLNEAVVNGDHGY-DPA-NIAQ-GVTTGAVTNIGPTVM--GMVRRAIPQLIAFDIAGVQPMTG 116 (514) Q Consensus 50 --------~~~~~~~~~~~~~a~~~~~~g~-~~~-~~~~-st~tg~v~~~~P~l~--~l~Rra~~~LIa~DI~GVQPmTg 116 (514) ..+.+..|...... ..+.. ... .+.. ++++|...-- |.++ .++....++.|...+ |++.+++ T Consensus 324 a~~~~e~a~~~a~~~G~~arg~---~~~~~~l~~ra~~~~t~~~gg~lvp-~~~~~~~iie~lr~~s~i~~l-~~~~~~~ 398 (632) T protein:vir:96 324 AGFEREVSLAIADASGKEARGF---YMPHEVLVQRQLEKKTAGKGGELVA-TELLSEEFIDILRNKAIIGQM-GARMLPG 398 (632) T ss_pred hhhhhHHHHHHHHhhhhhhhhh---hhhHHHHHHhhhhcccccccccccc-cccchHHHHHHHhhcchhhhh-cceEeec Confidence 00000100000000 00000 000 0000 0111110000 1111 122222345555554 5555544 Q ss_pred ccceeeeeeeeecCCCCccccccccccCCCCccCcccccccccccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 117 PTSQVFTLRSVYGKDPLTGAEAFHPTRQADASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVT 196 (514) Q Consensus 117 PTGLIFAMRSrY~~~~~tg~EA~~~~nEadt~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~ 196 (514) .+|-+ +++.+. ++.++-| T Consensus 399 ~~g~~-----~ip~~~-~~~~a~w-------------------------------------------------------- 416 (632) T protein:vir:96 399 LVGDV-----DIPKKT-SGANFYW-------------------------------------------------------- 416 (632) T ss_pred CCcce-----EEEEEe-CCceeEe-------------------------------------------------------- Confidence 44421 111110 0000000 Q ss_pred ccccCccccccccccccccccccccccccchhhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHh Q lcl|Aclame:pro 197 LAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRA 276 (514) Q Consensus 197 ~~~~~~~~~~~~~~~~a~~~~y~~~~GmtTa~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkA 276 (514) + +| +...++-..+++++++.+|.=+-...+|-||..| T Consensus 417 -----------------------v--------~E---------~~~~~~s~~~f~~i~l~~~k~~~~v~iS~ell~d--- 453 (632) T protein:vir:96 417 -----------------------I--------GE---------DEDVQDSDFDFTTLSFSPKTIAGAVPVTRKLRKQ--- 453 (632) T ss_pred -----------------------e--------cC---------CccccccccceeeEEeeeeEEEEehhhHHHHHhc--- Confidence 0 01 1123444466777777787777777889998776 Q ss_pred hcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhheeecccccccccCCcceecccccc----ccccchhhHHHHHHHHHHH Q lcl|Aclame:pro 277 VHGLDADAELSGILANEVMVELNREIVNLVNSQAQIGKSGWTQGAGAAGVFDFSDAV----DVKGARWAGEAYKALLIQI 352 (514) Q Consensus 277 iHGLDAEaELanILStEImlEINReii~~l~~~a~v~~~~~~~~v~~~g~~dl~~~~----d~~~~rwa~e~~r~L~~~i 352 (514) -.+|.|++|.+-|...|...+++.+|. -..+ .. .+.|++...... ...+..| +....|...| T Consensus 454 -s~~~~~~~i~~~l~~a~~~~~d~a~l~---G~G~---~~-----~p~Gi~~~~~~~~~~~~~~~~~~--~~i~~~~~~i 519 (632) T protein:vir:96 454 -SSIHVENLIREDLIEGIGVALDLAMLT---GTGL---AN-----DPVGLLNMTGVPALTYPAGGVDW--ASVVDMETKI 519 (632) T ss_pred -cchHHHHHHHHHHHHHHHHHHHHHhhc---ccCC---CC-----ccceeeecccccceecccccCCH--HHHHHHHHHH Confidence 257889999999999999999999942 2111 01 123443322111 0111111 2333343333 Q ss_pred HHHHHHHHHhcccccccEEEEChhHHhHHhhccccccchhccccCccccccccCceEE--EEecCceEEEecCCCccceE Q lcl|Aclame:pro 353 EKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVGPAAQGMQDGSMNTDTNQTVFA--GVLGGRFKVYIDQYAVNDYF 430 (514) Q Consensus 353 ~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~~d~~~~~~~--G~l~~~~~vy~D~y~~~dy~ 430 (514) ... -........||+|.....|......+ .++..-+ |+|. ||+|++.++.+.+-+ T Consensus 520 ~~~-------~~~~~~~~~~~~~~~~~~l~~~~l~d---------------~~G~~i~~~~~l~-G~pv~~s~~ip~~~~ 576 (632) T protein:vir:96 520 STF-------NADAGRLAYLTSVTQRGAAKKAQVFD---------------NTGERIWQNNEVN-GYRAEASNQIPADTW 576 (632) T ss_pred hhc-------ccccCccEEEEchhHHHHHHHHhccC---------------CCCceeecCCeec-ccceEeccccccCcE Confidence 222 12233445688988877776532221 1111111 4554 679999988876544 Q ss_pred EEEEecCCCcccceeeccccccccccccCC----ccccceeeeeeeeee-eecC--ccccccCc Q lcl|Aclame:pro 431 TVGFKGSTEMDAGVFYSPYVPLTPLRGSDS----KNFQPVIGFKTRYGV-QVNP--FADPTASA 487 (514) Q Consensus 431 ~vG~kG~~~~~~~~fy~PYv~~~~~~~~dp----~s~qp~~~~~tRY~l-~~nP--f~~~~~~~ 487 (514) ++|--. -+|+.-+-.+. -.+|| .+-+=.+=...|+++ +.+| |...+... T Consensus 577 ~~gd~s------~~~i~~~~~~~--i~~~~~~~~~~~~v~~~~~~~~d~~v~~~~af~~~k~~A 632 (632) T protein:vir:96 577 IFGDWS------QIVIAMWGVLD--LKVDPYTKAASDGLVLRVFQDVDAGVRRKEAFCIAKKGA 632 (632) T ss_pred EEeecc------eEEEEEecceE--EEEccccccccCceEEEEEeecCceeechhhhhheeecC Confidence 443210 01111000000 01222 223333344556655 3444 33322211 No 39 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=85.07 E-value=0.056 Score=27.46 Aligned_cols=299 Identities=10% Similarity=0.015 Sum_probs=101.1 Q ss_pred cCccccccccccccccccccccccccccccccccccccccccccccccccccCccccccccccccccccccccccccchh Q lcl|Aclame:pro 149 FSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQ 228 (514) Q Consensus 149 fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~GmtTa~ 228 (514) -.|......... .+...++. ........+... ... .......... .... .....+..- +.. T Consensus 1 m~~~~~~a~~~~---~t~~~g~~----i~~~~~~~ii~~-~~~----~s~l~~~~~~-~~~~--~~~~~~p~~----~~~ 61 (330) T protein:vir:77 1 MAGSTVPSTQVA---LTGDFSAF----LTPEQSQDYFAE-IEK----TSIVQRIARK-VPMG--PTGISIPHW----TGA 61 (330) T ss_pred Ccccccchhhcc---ccCCCcce----echhHHHHHHHH-HHh----ccchhhhcce-eecc--CCceEEEEE----cCC Confidence 011111100000 00000000 000000000000 000 0000000000 0000 000001100 011 Q ss_pred hhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhh Q lcl|Aclame:pro 229 AELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNS 308 (514) Q Consensus 229 aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~~l~~ 308 (514) .++. + -..+.++++-..+++++++..|..+-+..+|-||.+|- ..|.|++|.+-|+..|...||+.||.- T Consensus 62 ~~a~--~-v~Eg~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds----~~~~~~~i~~~l~~ai~~~~~~~~l~G--- 131 (330) T protein:vir:77 62 VSAS--W-TGEAERKPITKGSFGKQELEPVKITTIFAESAEVVRLN----PLNYLNTMRTKIAEAIALKFDAAAIHG--- 131 (330) T ss_pred ccee--E-ecCCCccccccceeeEEEEeEEEEEEeehhhHHHHhcc----hHHHHHHHHHHHHHHHHHHHHHHhhcc--- Confidence 1110 0 11245677878888999999998888889999999984 467899999999999999999999521 Q ss_pred heeec-ccccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhhcccc Q lcl|Aclame:pro 309 QAQIG-KSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTL 387 (514) Q Consensus 309 ~a~v~-~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~~g~~ 387 (514) .-+-. -.+..+.+.......-. ......- ....++..+.++-..+.+. ....+.+||+|+....|....-- T Consensus 132 ~g~~~~~~g~~~~~~~~~~~~~~--~~~~~~~----~~~~~~~~l~~~~~~~~~~--~~~~~~~vmn~~~~~~l~~lkd~ 203 (330) T protein:vir:77 132 IDKPSAFKGYLAETTKVVSLADT--NLTTASG----PQGNAYLAVNNALSLLVNS--GKKWTGTLLDNVTEPILNTAVDG 203 (330) T ss_pred cCCCCccccccccccccceeecc--ccccccc----ccchhHHHHHHHHHhhhhc--CCCccEEEEcHHHHHHHHHHhcc Confidence 10000 00000000000000000 0000000 1122233334443434322 34556789999999888742100 Q ss_pred ccchhccccCccccccccCceEEEEecCceEEEecCCCccc--------------eEEEEEecCCCc----ccceeeccc Q lcl|Aclame:pro 388 VGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVND--------------YFTVGFKGSTEM----DAGVFYSPY 449 (514) Q Consensus 388 ~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d--------------y~~vG~kG~~~~----~~~~fy~PY 449 (514) ...+.- ..............++|.| ++||++.+.+.+ ++++|-.+..+. ++.+.+. T Consensus 204 ~G~~l~---~~~~~~~~~~~~~~~~l~G-~PV~~~~~~p~~~~~~~~~~~~gd~s~~~i~~~~~~~i~~~~e~~~~~~-- 277 (330) T protein:vir:77 204 NGRPLF---VESTYTEQVGAIREGRILG-RPTYVADNVVNGTVGNRVVGVMGDFSQVIWGQIGGLSFDVTDQATLDFG-- 277 (330) T ss_pred CCceee---cCccccccccccCCceecc-eeeEEeccccCCCCCCccEEEEEecceEEEEEecCcEEEEeecceeeec-- Confidence 000000 0000000000111134444 688888876531 111121111110 0000000 Q ss_pred ccccc-c-cccCCcccc-ceee--eeeeeeeeecCccccccCcceeecCcchhhhccccceeeeeeeecC Q lcl|Aclame:pro 450 VPLTP-L-RGSDSKNFQ-PVIG--FKTRYGVQVNPFADPTASATKVGNGAPVAASMGKNAYFRRVFVKGL 514 (514) Q Consensus 450 v~~~~-~-~~~dp~s~q-p~~~--~~tRY~l~~nPf~~~~~~~~~i~~~~~~~~~~~~~~~~r~~~V~~~ 514 (514) .... . ...+-+-|+ -.++ ...|++..+ . .-.-+++|.+|.= T Consensus 278 -~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v--~---------------------~~~a~~~i~~~~~ 323 (330) T protein:vir:77 278 -EEQGGVWVPKLISLWQHNMVAVRCEAEFAFMV--N---------------------DKDAFVKLTDQVA 323 (330) T ss_pred -ccccccccccccchhhcCcEEEEEEEEeccEE--e---------------------cccceEEEEeccC Confidence 0000 0 000000011 0111 112333221 0 0011112211111 No 40 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=84.90 E-value=0.057 Score=27.41 Aligned_cols=270 Identities=10% Similarity=-0.024 Sum_probs=111.2 Q ss_pred CccccccccccCCCCccCccccccccccccccccccccccccccccccccccccccccccccccccccCccccccccccc Q lcl|Aclame:pro 133 LTGAEAFHPTRQADASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGV 212 (514) Q Consensus 133 ~tg~EA~~~~nEadt~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 212 (514) +. ..-|..+- -..+..... .+.... .............. .+. .. T Consensus 1 ma---------~~~T~~~d---------------~iiPev~~~-------~v~~~~--~~~l~~~~~~~~d~--~l~-g~ 44 (274) T protein:vir:97 1 MP---------QGLTKTSD---------------QIIPEVLAP-------MMQAQL--EKKLRFASFAEVDS--TLQ-GQ 44 (274) T ss_pred CC---------ccceehhh---------------eechHHHHH-------HHHHhh--hhhhhhcccceecc--ccc-CC Confidence 10 00000000 000000000 000000 00000000000000 000 00 Q ss_pred cccccccccccccchhhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHH Q lcl|Aclame:pro 213 AGGLLVEIDAGMATSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILAN 292 (514) Q Consensus 213 a~~~~y~~~~GmtTa~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILSt 292 (514) .+.+.+++.--.+..+|.. .....-+..++. ..+.+++.+-|+ |+ |.+.=-..+.+ +-|.-.|..+-++. T Consensus 45 -~G~tv~iP~~~~~g~a~~~---~~g~~i~~~~lt--~~~~~~~i~~~~-~~-~~i~D~~~~~~--~~dp~~~~~~~~a~ 114 (274) T protein:vir:97 45 -PGDTLTFPAFVYSGDAQVV---AEGEKIPTDILE--TKKREAKIRKIA-KG-TSITDEALLSG--YGDPQGEQVRQHGL 114 (274) T ss_pred -CCCEEEEeeecCCCccccc---cCCCcccccccc--cceeEEEeeeec-ce-ecccHHHHHhc--cchHHHHHHHHHHH Confidence 1222222211112223322 111223344443 334444445555 32 32222223333 46788899999999 Q ss_pred HHHHHhhHHHHHHHhhheeecccccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEE Q lcl|Aclame:pro 293 EVMVELNREIVNLVNSQAQIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFII 372 (514) Q Consensus 293 EImlEINReii~~l~~~a~v~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v 372 (514) -|..+++.+++..+...... + +...++ .+-+-.+..++.++. ..+.+++ T Consensus 115 a~a~~vd~~~~~~l~~a~~~--------~-~~~~~~-------------~d~i~dA~~~l~d~~---------~~~~~iv 163 (274) T protein:vir:97 115 AHANKVDNDVLEALMGAKLT--------V-NADITK-------------LNGLQSAIDKFNDED---------LEPMVLF 163 (274) T ss_pred HHHHHHHHHHHHHHhccCcc--------c-cccccC-------------HHHHHHHHHHhhccC---------CCceEEE Confidence 99999999998777443211 1 111121 233444444444332 3678999 Q ss_pred EChhHHhHHhhccccccchhccccCccccccccCceEEEEecCceEEEecCCCccceEEEEEecCCCcccceeecccccc Q lcl|Aclame:pro 373 ASRNVVSALSMTDTLVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGFKGSTEMDAGVFYSPYVPL 452 (514) Q Consensus 373 ~S~~va~~L~~~g~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~kG~~~~~~~~fy~PYv~~ 452 (514) |+|.|++.|.......+........ ....+-..|.+. |++||+|+..|..-..+--+| .+-|.---+. T Consensus 164 v~p~~~~~L~k~~~~~f~~~s~~g~-----~~~~~G~ig~~~-G~~Vi~s~~~p~~t~~l~~~g------A~~~~~~~~~ 231 (274) T protein:vir:97 164 VNPLDAGKLRGDASTNFTRATELGD-----DIIVKGAFGEAL-GAIIVRTNKLEAGTAILAKKG------AVKLILKRDF 231 (274) T ss_pred eCHHHHHHHHhhhhhhccccCcccc-----cceeccccceec-CeeEEEcCCCCcceEEEEeCc------ceEeeecCCc Confidence 9999999888643322211111100 001111125554 679999999885432221122 2222101111 Q ss_pred cccc-ccCCccccceeeeeeeeee-eecC--ccccccCcceeecCcch Q lcl|Aclame:pro 453 TPLR-GSDSKNFQPVIGFKTRYGV-QVNP--FADPTASATKVGNGAPV 496 (514) Q Consensus 453 ~~~~-~~dp~s~qp~~~~~tRY~l-~~nP--f~~~~~~~~~i~~~~~~ 496 (514) . ++ ..|+..+.=.+-..-+||+ ..|| ....+-. .-+..| T Consensus 232 ~-vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~----~~~~~~ 274 (274) T protein:vir:97 232 F-LEVARDASTKTTALYSDKHYVAYLYDESKAVKITKG----SGSLEM 274 (274) T ss_pred e-eccccchhhcccEEEEEEEEEEEEEcCCceEEEecC----cccccC Confidence 1 23 3699999999999999998 4565 1111111 001111 No 41 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=84.90 E-value=0.057 Score=27.41 Aligned_cols=270 Identities=10% Similarity=-0.024 Sum_probs=111.2 Q ss_pred CccccccccccCCCCccCccccccccccccccccccccccccccccccccccccccccccccccccccCccccccccccc Q lcl|Aclame:pro 133 LTGAEAFHPTRQADASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGV 212 (514) Q Consensus 133 ~tg~EA~~~~nEadt~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 212 (514) +. ..-|..+- -..+..... .+.... .............. .+. .. T Consensus 1 ma---------~~~T~~~d---------------~iiPev~~~-------~v~~~~--~~~l~~~~~~~~d~--~l~-g~ 44 (274) T protein:vir:94 1 MP---------QGLTKTSD---------------QIIPEVLAP-------MMQAQL--EKKLRFASFAEVDS--TLQ-GQ 44 (274) T ss_pred CC---------ccceehhh---------------eechHHHHH-------HHHHhh--hhhhhhcccceecc--ccc-CC Confidence 10 00000000 000000000 000000 00000000000000 000 00 Q ss_pred cccccccccccccchhhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHH Q lcl|Aclame:pro 213 AGGLLVEIDAGMATSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILAN 292 (514) Q Consensus 213 a~~~~y~~~~GmtTa~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILSt 292 (514) .+.+.+++.--.+..+|.. .....-+..++. ..+.+++.+-|+ |+ |.+.=-..+.+ +-|.-.|..+-++. T Consensus 45 -~G~tv~iP~~~~~g~a~~~---~~g~~i~~~~lt--~~~~~~~i~~~~-~~-~~i~D~~~~~~--~~dp~~~~~~~~a~ 114 (274) T protein:vir:94 45 -PGDTLTFPAFVYSGDAQVV---AEGEKIPTDILE--TKKREAKIRKIA-KG-TSITDEALLSG--YGDPQGEQVRQHGL 114 (274) T ss_pred -CCCEEEEeeecCCCccccc---cCCCcccccccc--cceeEEEeeeec-ce-ecccHHHHHhc--cchHHHHHHHHHHH Confidence 1222222211112223322 111223344443 334444445555 32 32222223333 46788899999999 Q ss_pred HHHHHhhHHHHHHHhhheeecccccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEE Q lcl|Aclame:pro 293 EVMVELNREIVNLVNSQAQIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFII 372 (514) Q Consensus 293 EImlEINReii~~l~~~a~v~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v 372 (514) -|..+++.+++..+...... + +...++ .+-+-.+..++.++. ..+.+++ T Consensus 115 a~a~~vd~~~~~~l~~a~~~--------~-~~~~~~-------------~d~i~dA~~~l~d~~---------~~~~~iv 163 (274) T protein:vir:94 115 AHANKVDNDVLEALMGAKLT--------V-NADITK-------------LNGLQSAIDKFNDED---------LEPMVLF 163 (274) T ss_pred HHHHHHHHHHHHHHhccCcc--------c-cccccC-------------HHHHHHHHHHhhccC---------CCceEEE Confidence 99999999998777443211 1 111121 233444444444332 3678999 Q ss_pred EChhHHhHHhhccccccchhccccCccccccccCceEEEEecCceEEEecCCCccceEEEEEecCCCcccceeecccccc Q lcl|Aclame:pro 373 ASRNVVSALSMTDTLVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGFKGSTEMDAGVFYSPYVPL 452 (514) Q Consensus 373 ~S~~va~~L~~~g~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~kG~~~~~~~~fy~PYv~~ 452 (514) |+|.|++.|.......+........ ....+-..|.+. |++||+|+..|..-..+--+| .+-|.---+. T Consensus 164 v~p~~~~~L~k~~~~~f~~~s~~g~-----~~~~~G~ig~~~-G~~Vi~s~~~p~~t~~l~~~g------A~~~~~~~~~ 231 (274) T protein:vir:94 164 VNPLDAGKLRGDASTNFTRATELGD-----DIIVKGAFGEAL-GAIIVRTNKLEAGTAILAKKG------AVKLILKRDF 231 (274) T ss_pred eCHHHHHHHHhhhhhhccccCcccc-----cceeccccceec-CeeEEEcCCCCcceEEEEeCc------ceEeeecCCc Confidence 9999999888643322211111100 001111125554 679999999885432221122 2222101111 Q ss_pred cccc-ccCCccccceeeeeeeeee-eecC--ccccccCcceeecCcch Q lcl|Aclame:pro 453 TPLR-GSDSKNFQPVIGFKTRYGV-QVNP--FADPTASATKVGNGAPV 496 (514) Q Consensus 453 ~~~~-~~dp~s~qp~~~~~tRY~l-~~nP--f~~~~~~~~~i~~~~~~ 496 (514) . ++ ..|+..+.=.+-..-+||+ ..|| ....+-. .-+..| T Consensus 232 ~-vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~----~~~~~~ 274 (274) T protein:vir:94 232 F-LEVARDASTKTTALYSDKHYVAYLYDESKAVKITKG----SGSLEM 274 (274) T ss_pred e-eccccchhhcccEEEEEEEEEEEEEcCCceEEEecC----cccccC Confidence 1 23 3699999999999999998 4565 1111111 001111 No 42 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=84.01 E-value=0.064 Score=27.13 Aligned_cols=267 Identities=12% Similarity=0.014 Sum_probs=112.3 Q ss_pred ccccccccccccccccccccccccccccccccccccc---ccccccccccCccccccccccccccccccccccccchhhh Q lcl|Aclame:pro 154 AASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFL---ALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAE 230 (514) Q Consensus 154 ~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~GmtTa~aE 230 (514) .++.. +...+. ....++..+.. ............. ..+. +. .+.+.+++.--....+| T Consensus 1 m~~~~--------T~l~d~-------i~Pev~~~~v~~~~~~~l~~~~~~~~~--~~l~-g~-~G~tv~iP~~~~ig~a~ 61 (274) T protein:vir:96 1 MAQGM--------TKLTNQ-------IVPEVLAPMMQAELEKKLRFASFAEID--NTLV-GQ-PGDTLTFPAFIYSGDAK 61 (274) T ss_pred CCcce--------eehhhe-------echHHHHHHHHHHHHhhhhccccceec--cccc-CC-CCCEEEeeeecCCCccc Confidence 00000 000000 00000000000 0000000000000 0000 00 12222232211222333 Q ss_pred ccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcC-CChhHHHHHHHHHHHHHHhhHHHHHHHhhh Q lcl|Aclame:pro 231 LQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHG-LDADAELSGILANEVMVELNREIVNLVNSQ 309 (514) Q Consensus 231 al~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHG-LDAEaELanILStEImlEINReii~~l~~~ 309 (514) .. .....-...++.. .+.+++.+-|. |+ |.+. |+-+..+ -|.-.|..+-++..+..+++++++..+... T Consensus 62 ~~---~~g~~i~~~~lt~--~~~~~~i~~~~-~a-~~i~---D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a 131 (274) T protein:vir:96 62 VV---AEGEKIPTDILET--KKREAKIRKIA-KG-TSIS---DEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSA 131 (274) T ss_pred cc---cCCCccchhhccc--ceeEEEeeeee-cc-eeeh---HHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 22 1112233444433 33334434443 22 2222 5555553 588899999999999999999998777331 Q ss_pred eeecccccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhhcccccc Q lcl|Aclame:pro 310 AQIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVG 389 (514) Q Consensus 310 a~v~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~~g~~~~ 389 (514) . ..+. ...++ .+.+-....++.++. ..++++||+|+|++.|.......+ T Consensus 132 ~--------~~~~-~~~~~-------------~d~i~~A~~~lgd~~---------~~~~~ivv~p~~~~~L~k~~~~~f 180 (274) T protein:vir:96 132 K--------LTVE-ADITK-------------LTGLQTAIDKFNDED---------LEPMVLFISPLDAGKLRGDATTNF 180 (274) T ss_pred c--------cccc-ccccC-------------HHHHHHHHHHhcccc---------ccccEEEeCHHHHHHHHhhccccc Confidence 1 0111 11121 223333344444332 357899999999999987543332 Q ss_pred chhccccCccccccccCceEEEEecCceEEEecCCCccce-EEEEEecCCCcccceeecccccccccc-ccCCcccccee Q lcl|Aclame:pro 390 PAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDY-FTVGFKGSTEMDAGVFYSPYVPLTPLR-GSDSKNFQPVI 467 (514) Q Consensus 390 ~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy-~~vG~kG~~~~~~~~fy~PYv~~~~~~-~~dp~s~qp~~ 467 (514) ........ ....+-..|.+. |++||+|...+..- +++| +|. -.||.. -+. .++ ..||++++=.+ T Consensus 181 ~~~s~~g~-----~~~~~G~ig~~~-G~~Vi~s~~~~~~t~~l~~-~gA-----~~~~~~-~~~-~vE~~Rd~~~~~d~i 246 (274) T protein:vir:96 181 TRATELGD-----DVIVKGAFGEAL-GAVIVRSNKLEAGTAILAK-KGA-----VKLITK-RDF-FLETDRDPSTKTTAL 246 (274) T ss_pred cccccccc-----cceeccccceec-CeEEEEeCCCCCceEEEEe-ccc-----eeeeec-CCc-ccccccccccccCEE Confidence 21111000 011111125554 68999999877432 2222 221 122221 111 123 36999999999 Q ss_pred eeeeeeee-eecCccccccCcceeecCcchhhhc Q lcl|Aclame:pro 468 GFKTRYGV-QVNPFADPTASATKVGNGAPVAASM 500 (514) Q Consensus 468 ~~~tRY~l-~~nPf~~~~~~~~~i~~~~~~~~~~ 500 (514) -..-+||+ ..||=. -.++.. .-|.+-. T Consensus 247 ~~~~~y~~~~~~~~~-----~v~~tk-~~~~~~~ 274 (274) T protein:vir:96 247 YSDKHYVAYLYDESK-----AVKITK-GSGSLEM 274 (274) T ss_pred EEeEEEEEEEEcCCc-----EEEEEc-CCccccC Confidence 99999998 456611 011111 1111111 No 43 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=84.01 E-value=0.064 Score=27.13 Aligned_cols=267 Identities=12% Similarity=0.014 Sum_probs=112.3 Q ss_pred ccccccccccccccccccccccccccccccccccccc---ccccccccccCccccccccccccccccccccccccchhhh Q lcl|Aclame:pro 154 AASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFL---ALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAE 230 (514) Q Consensus 154 ~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~GmtTa~aE 230 (514) .++.. +...+. ....++..+.. ............. ..+. +. .+.+.+++.--....+| T Consensus 1 m~~~~--------T~l~d~-------i~Pev~~~~v~~~~~~~l~~~~~~~~~--~~l~-g~-~G~tv~iP~~~~ig~a~ 61 (274) T protein:vir:95 1 MAQGM--------TKLTNQ-------IVPEVLAPMMQAELEKKLRFASFAEID--NTLV-GQ-PGDTLTFPAFIYSGDAK 61 (274) T ss_pred CCcce--------eehhhe-------echHHHHHHHHHHHHhhhhccccceec--cccc-CC-CCCEEEeeeecCCCccc Confidence 00000 000000 00000000000 0000000000000 0000 00 12222232211222333 Q ss_pred ccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcC-CChhHHHHHHHHHHHHHHhhHHHHHHHhhh Q lcl|Aclame:pro 231 LQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHG-LDADAELSGILANEVMVELNREIVNLVNSQ 309 (514) Q Consensus 231 al~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHG-LDAEaELanILStEImlEINReii~~l~~~ 309 (514) .. .....-...++.. .+.+++.+-|. |+ |.+. |+-+..+ -|.-.|..+-++..+..+++++++..+... T Consensus 62 ~~---~~g~~i~~~~lt~--~~~~~~i~~~~-~a-~~i~---D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i~~~l~~a 131 (274) T protein:vir:95 62 VV---AEGEKIPTDILET--KKREAKIRKIA-KG-TSIS---DEALLSGYGDPQGEQVRQHGLAHANKVDDDVLEALKSA 131 (274) T ss_pred cc---cCCCccchhhccc--ceeEEEeeeee-cc-eeeh---HHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 22 1112233444433 33334434443 22 2222 5555553 588899999999999999999998777331 Q ss_pred eeecccccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhhcccccc Q lcl|Aclame:pro 310 AQIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVG 389 (514) Q Consensus 310 a~v~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~~g~~~~ 389 (514) . ..+. ...++ .+.+-....++.++. ..++++||+|+|++.|.......+ T Consensus 132 ~--------~~~~-~~~~~-------------~d~i~~A~~~lgd~~---------~~~~~ivv~p~~~~~L~k~~~~~f 180 (274) T protein:vir:95 132 K--------LTVE-ADITK-------------LTGLQTAIDKFNDED---------LEPMVLFISPLDAGKLRGDATTNF 180 (274) T ss_pred c--------cccc-ccccC-------------HHHHHHHHHHhcccc---------ccccEEEeCHHHHHHHHhhccccc Confidence 1 0111 11121 223333344444332 357899999999999987543332 Q ss_pred chhccccCccccccccCceEEEEecCceEEEecCCCccce-EEEEEecCCCcccceeecccccccccc-ccCCcccccee Q lcl|Aclame:pro 390 PAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDY-FTVGFKGSTEMDAGVFYSPYVPLTPLR-GSDSKNFQPVI 467 (514) Q Consensus 390 ~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy-~~vG~kG~~~~~~~~fy~PYv~~~~~~-~~dp~s~qp~~ 467 (514) ........ ....+-..|.+. |++||+|...+..- +++| +|. -.||.. -+. .++ ..||++++=.+ T Consensus 181 ~~~s~~g~-----~~~~~G~ig~~~-G~~Vi~s~~~~~~t~~l~~-~gA-----~~~~~~-~~~-~vE~~Rd~~~~~d~i 246 (274) T protein:vir:95 181 TRATELGD-----DVIVKGAFGEAL-GAVIVRSNKLEAGTAILAK-KGA-----VKLITK-RDF-FLETDRDPSTKTTAL 246 (274) T ss_pred cccccccc-----cceeccccceec-CeEEEEeCCCCCceEEEEe-ccc-----eeeeec-CCc-ccccccccccccCEE Confidence 21111000 011111125554 68999999877432 2222 221 122221 111 123 36999999999 Q ss_pred eeeeeeee-eecCccccccCcceeecCcchhhhc Q lcl|Aclame:pro 468 GFKTRYGV-QVNPFADPTASATKVGNGAPVAASM 500 (514) Q Consensus 468 ~~~tRY~l-~~nPf~~~~~~~~~i~~~~~~~~~~ 500 (514) -..-+||+ ..||=. -.++.. .-|.+-. T Consensus 247 ~~~~~y~~~~~~~~~-----~v~~tk-~~~~~~~ 274 (274) T protein:vir:95 247 YSDKHYVAYLYDESK-----AVKITK-GSGSLEM 274 (274) T ss_pred EEeEEEEEEEEcCCc-----EEEEEc-CCccccC Confidence 99999998 456611 011111 1111111 No 44 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=80.92 E-value=0.09 Score=26.31 Aligned_cols=321 Identities=16% Similarity=0.145 Sum_probs=125.2 Q ss_pred Ccc-----hhhhhhhhccccccccccccchhh-hhhhhhhhhHHHHHHhccccc-----chhhhhhhccccccccccccc Q lcl|Aclame:pro 1 MNL-----TEKWKDLLEAEGADMPEIATATKQ-KIMSKIFENQDRDINNDPMYR-----DPQLVEAFNAGLNEAVVNGDH 69 (514) Q Consensus 1 ~~l-----~~kw~p~l~~~~~~~~~i~~~~~~-~~~~~~~enq~~~~~~~~~~~-----~~~~~~~~~~~~~~a~~~~~~ 69 (514) .+| .|+|.-+.. ||....++ +....++|-+.+.+.....-. ......+|...+.. T Consensus 20 ~~~~~~~~~e~~~~~~~-------ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~------- 85 (371) T protein:vir:81 20 RKLLAENKIEEAKKLKE-------EIVALQEKFDVAKELYEEQKQTIEDKEPLKPTVQVKENEVEAFVNHIRT------- 85 (371) T ss_pred HHHhhHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHhhccccccccchhhHHHHHHHHHHHHHH------- Confidence 111 122322211 22221111 111122222222221111000 00011122111110 Q ss_pred ccccccccccc-cccc--cc-ccceeeehhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCccccccccccCC Q lcl|Aclame:pro 70 GYDPANIAQGV-TTGA--VT-NIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAEAFHPTRQA 145 (514) Q Consensus 70 g~~~~~~~~st-~tg~--v~-~~~P~l~~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~tg~EA~~~~nEa 145 (514) .....+..++ .+|. |+ .+.+ -+++.+.++.+-.+++.+.||++.++-+.-.+. .. +.++- T Consensus 86 -~~~~a~~~~t~~~gg~~vP~~~~~---~ii~~~~~~s~i~~~~~~~~~~~~~~~~~~~~~--~~----~~~a~------ 149 (371) T protein:vir:81 86 -RFRNAMSEGSNQDGGYTVPQDIQT---RINELRESKDALQNLITVEPVTTLSGSRVFKKR--SQ----QTGFV------ 149 (371) T ss_pred -HHHHhhccCCCccCceeecHhHHH---HHHHHHHhhhhhhhhceeeeccCCceeEEEEee--cC----Cccee------ Confidence 0000111111 1111 11 1122 245555677788899999999887765432221 11 11110 Q ss_pred CCccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccccccccccccccccccccc Q lcl|Aclame:pro 146 DASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMA 225 (514) Q Consensus 146 dt~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~Gmt 225 (514) |-+ +|- T Consensus 150 ---~v~----------------------------------------------------------------------Eg~- 155 (371) T protein:vir:81 150 ---EVA----------------------------------------------------------------------EGA- 155 (371) T ss_pred ---eec----------------------------------------------------------------------ccc- Confidence 000 000 Q ss_pred chhhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHHH Q lcl|Aclame:pro 226 TSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNL 305 (514) Q Consensus 226 Ta~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~~ 305 (514) +. ...+...|.+..++..|..+ ...+|-||.+|-. .|-++.|.+.|...|..-+|+.|+.- T Consensus 156 ----~~----~~~~~~~f~~i~~~~~k~~~-------~~~iS~ell~ds~----~~l~~~i~~~l~~a~~~~~~~~i~~g 216 (371) T protein:vir:81 156 ----AI----GEKATPQFTLLQYQVKKYAG-------FFRVTNELLNDST----EAIVNTLVRWIGDESRVTRNGLIINV 216 (371) T ss_pred ----cc----ccccccceeeEEeeeeEEEE-------eehhhHHHHhhhh----HHHHHHHHHHHHHHHHHHHHHHHHhh Confidence 00 00112235555555555554 4469999999853 45688999999999999999888553 Q ss_pred HhhheeecccccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhhcc Q lcl|Aclame:pro 306 VNSQAQIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTD 385 (514) Q Consensus 306 l~~~a~v~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~~g 385 (514) .-.. .+.|....+ ....++.... .........+|++|.....|... T Consensus 217 ~g~~------------~~~~~~~~~-------------~i~~~~~~~l--------~~~~~~~a~~vmn~~~~~~L~~l- 262 (371) T protein:vir:81 217 LNTK------------AKTAIADLD-------------GLKQIINVQL--------DPVFRSTSSVIVNQDAFNWLDTL- 262 (371) T ss_pred cccc------------cccccccHH-------------HHHHHHHhhc--------chhhhcCCEEEEcHHHHHHHHHh- Confidence 3111 123333221 1222221111 11111234688999988887742 Q ss_pred ccccchhccccCccccccccCceEEEEecCceEEEecCCCccceEEEEEecCCCcccceeeccccc----ccc--c-ccc Q lcl|Aclame:pro 386 TLVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGFKGSTEMDAGVFYSPYVP----LTP--L-RGS 458 (514) Q Consensus 386 ~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~kG~~~~~~~~fy~PYv~----~~~--~-~~~ 458 (514) .. ..| ...-......-..|+|.| ++||+..+.+...-.++--+. -...++|+.+.. ... + -.+ T Consensus 263 --kd--~~g---~~l~~~~~~~~~~~~l~G-~pV~~~~~~~~~~~~~~~~~~--~~~~i~~Gd~~~~~~~~~~~~~~i~~ 332 (371) T protein:vir:81 263 --KD--QNG---QYLLQPSISSPTGRQLLG-LPVVIVSNKVLANRVDGGTGA--QFAPIIVGDLKEAVVMFDRQRTEIMS 332 (371) T ss_pred --hc--cCC---CeeeecccCCCCCceecc-eeEEEecccccCccccccccC--CcceEEEEehhceEEEEeecceEEEE Confidence 11 000 000000001112256654 588887776643222111111 111234443211 100 0 012 Q ss_pred CCc------cccceeeeeeeeee-eecC--ccccccCcc Q lcl|Aclame:pro 459 DSK------NFQPVIGFKTRYGV-QVNP--FADPTASAT 488 (514) Q Consensus 459 dp~------s~qp~~~~~tRY~l-~~nP--f~~~~~~~~ 488 (514) ++. +-+=.+-...||+. ..+| |...+--.+ T Consensus 333 ~~~~~~~f~~~~v~~~~~~r~d~~~~~~~a~~~~~~~~A 371 (371) T protein:vir:81 333 SNVAMDAFETDATLWRAIERMDVKMRDDEAFVFGEVQLA 371 (371) T ss_pred eccccchhhcCceEEEEEEeeccEEecccceEEEEEecC Confidence 222 22345555566665 4455 322111111 No 45 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=79.10 E-value=0.11 Score=25.89 Aligned_cols=320 Identities=16% Similarity=0.111 Sum_probs=119.7 Q ss_pred Ccchhhhhhhhcccc-----ccccccccchhhhhhhhhhhhHHHHHHhc--ccccchhhhhhhccccccccccccccccc Q lcl|Aclame:pro 1 MNLTEKWKDLLEAEG-----ADMPEIATATKQKIMSKIFENQDRDINND--PMYRDPQLVEAFNAGLNEAVVNGDHGYDP 73 (514) Q Consensus 1 ~~l~~kw~p~l~~~~-----~~~~~i~~~~~~~~~~~~~enq~~~~~~~--~~~~~~~~~~~~~~~~~~a~~~~~~g~~~ 73 (514) +...++ ..+... .....-.....+......+..+....+.. ....+......-.....+....+ T Consensus 60 ~~~~e~---~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------ 130 (394) T protein:vir:97 60 LKLYES---SVEVGGAENIGGKEVTQEEKTYRESVNDFIRSKGKIVNDSLRFEGKDEVLMPINETTPVEPQKDG------ 130 (394) T ss_pred HHHHHH---HhhhhccccccccccchhhHHHHHHHHHHHHHHHHHhhhhhhhhhHHHHHHHHHhhhhhhhhccc------ Confidence 111111 111100 00000011111122222222222211111 00000000000000000000000 Q ss_pred cccccccccccccccceeeehhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCccccccccccCCCCccCccc Q lcl|Aclame:pro 74 ANIAQGVTTGAVTNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAEAFHPTRQADASFSGQA 153 (514) Q Consensus 74 ~~~~~st~tg~v~~~~P~l~~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~tg~EA~~~~nEadt~fSG~~ 153 (514) -...+|.+.--....-.+++.+-+..+...++.+.||+++++-+--++. . ++ . ..+ T Consensus 131 ----~t~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~----~--~~-~---------~~~---- 186 (394) T protein:vir:97 131 ----IKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQR----A--TT-K---------MVT---- 186 (394) T ss_pred ----cccccccccChHHHHHHHHHHhhhhhhhhhhceeeeccCcceEEEEEec----C--CC-c---------cce---- Confidence 0011111111111112245545566677889999999887654321110 0 00 0 000 Q ss_pred cccccccccccccccccccccccccccccccccccccccccccccccCccccccccccccccccccccccccchhhhccc Q lcl|Aclame:pro 154 AASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAELQE 233 (514) Q Consensus 154 ~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~GmtTa~aEal~ 233 (514) ++ | T Consensus 187 ------------------------------------------------------------------v~--------E--- 189 (394) T protein:vir:97 187 ------------------------------------------------------------------VA--------E--- 189 (394) T ss_pred ------------------------------------------------------------------ec--------c--- Confidence 00 0 Q ss_pred cCCCCCCcccccc-eeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhheee Q lcl|Aclame:pro 234 NFNGSSNNEWNEM-SFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNSQAQI 312 (514) Q Consensus 234 ~~ggs~~~~f~EM-sFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~~l~~~a~v 312 (514) +...++. ...++++++.++.-+....+|-||++|- +.|.+++|.+-|+..|..-+|..||.-+... T Consensus 190 ------~~~~~~~~~~~~~~v~l~~~k~~~~i~is~ell~ds----~~~~~~~i~~~la~~~~~~~~~~i~~g~~~~--- 256 (394) T protein:vir:97 190 ------LEKNPALAKPDFKDVAWNIDTYRGAIPLSQESIDDA----DVDLVGIVSESISQIKVNTTNDAIAKVLKSF--- 256 (394) T ss_pred ------cccccccccccceeEEeehhheeeehhhHHHHHhhh----hHHHHHHHHHHHHHHHHHHHHHHHhhccccc--- Confidence 0011111 1345566666666666778999999986 3456888888888888888888886543211 Q ss_pred cccccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhhccccccchh Q lcl|Aclame:pro 313 GKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVGPAA 392 (514) Q Consensus 313 ~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~~g~~~~~~~ 392 (514) .+.+...+ +....++.... .. +..+. +||+|.+...|... .. . T Consensus 257 ---------~~~~~~~~-------------~~~~~~~~~~~--------~~-~~~a~-~v~n~~~~~~l~~l---kd--~ 299 (394) T protein:vir:97 257 ---------TTKTVKNL-------------DEIKALLNGGF--------DP-AYNVS-LIVSQSFYQTLDTL---KD--G 299 (394) T ss_pred ---------cccccccH-------------HHHHHHHHhhh--------hh-hhCCE-EEEcHHHHHHHHHh---hc--c Confidence 11222111 11222221111 11 22333 67999998887653 10 0 Q ss_pred ccccCccccccccCceEEEEecCceEEEe--cCCCccceEEEEEecCCCcccceeeccccccccccccCCccccceeeee Q lcl|Aclame:pro 393 QGMQDGSMNTDTNQTVFAGVLGGRFKVYI--DQYAVNDYFTVGFKGSTEMDAGVFYSPYVPLTPLRGSDSKNFQPVIGFK 470 (514) Q Consensus 393 ~~~~~~~~~~d~~~~~~~G~l~~~~~vy~--D~y~~~dy~~vG~kG~~~~~~~~fy~PYv~~~~~~~~dp~s~qp~~~~~ 470 (514) .|. --...+.++. .-++|.| ++|++ |...+..-+++|-- . .+.++..-.. ..+...|...++-.+-.. T Consensus 300 ~G~--~i~~~~~~~~-~~~~l~G-~pv~~~~~~~~~~~~~~~gd~---~--~~~~~~~~~~-~~~~~~~~~~~~~~~~~~ 369 (394) T protein:vir:97 300 NGR--YLLQDDITAV-SGKVLLG-KPVFVLSDEVLGANKAFIGDF---K--RGVLFADRKD-LGLRWADNEIYGQYLQAV 369 (394) T ss_pred CCC--eeeecCcCCC-CCceecc-ceeEEecccccCCccEEEeec---c--ccEEEEEecc-eEEEEecccccceeEEEE Confidence 000 0011111111 1135655 46666 44444444444420 0 0111111101 111233445555555566 Q ss_pred eeeee-eecC--cccc--ccCccee Q lcl|Aclame:pro 471 TRYGV-QVNP--FADP--TASATKV 490 (514) Q Consensus 471 tRY~l-~~nP--f~~~--~~~~~~i 490 (514) .||+. +.+| |... +.-..+. T Consensus 370 ~r~d~~v~~~~a~~~~~~~~~~~p~ 394 (394) T protein:vir:97 370 LRFGVSKVDDKAGYYVTFTPEPLPL 394 (394) T ss_pred EEEccEEecccceEEEEecccccCC Confidence 77776 3344 2211 1111111 No 46 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=78.78 E-value=0.11 Score=25.82 Aligned_cols=284 Identities=13% Similarity=0.085 Sum_probs=117.0 Q ss_pred cccccccccccccceeee-hhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCccccccccccCCCCccCcccc Q lcl|Aclame:pro 76 IAQGVTTGAVTNIGPTVM-GMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAEAFHPTRQADASFSGQAA 154 (514) Q Consensus 76 ~~~st~tg~v~~~~P~l~-~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~tg~EA~~~~nEadt~fSG~~~ 154 (514) ++-++ +++. ...|.+. .+++++.+..+-.+++.+.||++.+.-|. ++.. +.+|.+ = T Consensus 1 m~t~t-~gg~-liP~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip----~~~~----~~~a~w---------v---- 57 (303) T protein:vir:97 1 MGTET-SKAS-LFDKHLVSDLINKVKGHSSLAKLSSQKPIPFNGSKEF----TFTL----DSDIDV---------V---- 57 (303) T ss_pred CcccC-CCCe-EcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEE----EEec----CcceEE---------e---- Confidence 22111 2211 1112221 34555567778899999999876544331 1111 111111 0 Q ss_pred ccccccccccccccccccccccccccccccccccccccccccccccCccccccccccccccccccccccccchhhhcccc Q lcl|Aclame:pro 155 ASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAELQEN 234 (514) Q Consensus 155 ~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~GmtTa~aEal~~ 234 (514) +| T Consensus 58 --------------------------------------------------------------------------~E---- 59 (303) T protein:vir:97 58 --------------------------------------------------------------------------AE---- 59 (303) T ss_pred --------------------------------------------------------------------------ec---- Confidence 01 Q ss_pred CCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhheeecc Q lcl|Aclame:pro 235 FNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNSQAQIGK 314 (514) Q Consensus 235 ~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~~l~~~a~v~~ 314 (514) +.++++-..+++.++..+|.-+-...+|-||.|.... ..++-+++|.+-|+..|...|+..++.-..... +. T Consensus 60 -----~~~~~~s~~~f~~v~l~~~kl~~~~~iS~ell~~~~d-~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~--g~ 131 (303) T protein:vir:97 60 -----NGKKTHGGLSLEPVTIVPIKVEYGARLSDEFLYATEE-EKIDILKAFNEGFAKKLARGIDLMAMHGINPRT--KK 131 (303) T ss_pred -----CccccccccceeeEEeeeEEEEEeehhhHHHhhcCcc-chHHHHHHHHHHHHHHHHHHHHhhhhcccccCC--cc Confidence 0112222233345555555555556799999863321 235568889999999999999988854321100 00 Q ss_pred cccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhhccccccchhcc Q lcl|Aclame:pro 315 SGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVGPAAQG 394 (514) Q Consensus 315 ~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~~g~~~~~~~~~ 394 (514) ..... +...+..+.. .-+... ....++.-|.++-+.+.. ..+..+.+|++|+....|.... + ..| T Consensus 132 ~~~~~--~~~~~~~~~~-~~~~~~-----~~~~~~~~i~~~~~~~~~--~~~~~~~~vmn~~~~~~L~~lk--d---~~g 196 (303) T protein:vir:97 132 ASDVI--GTNHFDSKVT-QVVKFT-----ESEDADANIEAAVNLIQG--AEGVVTGLAMDTEFSTALAKVT--N---GEM 196 (303) T ss_pred ccccc--cccccccccc-cccccc-----cccchHHHHHHHHHHHhh--cCCCccEEEEcHHHHHHHHHhh--c---cCC Confidence 00000 0000100100 000000 011223344444444432 2456667999999998886321 1 000 Q ss_pred ccCccccccccCceEEEEecCceEEEecCCCccce-----EEEEEecCCCcccceeecccc--ccccccccCCcc----- Q lcl|Aclame:pro 395 MQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDY-----FTVGFKGSTEMDAGVFYSPYV--PLTPLRGSDSKN----- 462 (514) Q Consensus 395 ~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy-----~~vG~kG~~~~~~~~fy~PYv--~~~~~~~~dp~s----- 462 (514) +.-...+.....-.|+|.| ++|+++.+.+... -.+.+-|+- ...+.+...- ++......|++. T Consensus 197 --~~~~~~~~~~~~~~~~l~G-~Pv~~s~~v~~~~~~~~~~~~~~~Gdf--~~~~~~~~~~~~~~~~~~~~~~d~~~~~~ 271 (303) T protein:vir:97 197 --GPKMYPELAWGANPDSING-LKSSVNTTVGAGADEAESKDLVIIGDF--ESMFKWGYAKQIPMEIIKYGDPDNSGKDL 271 (303) T ss_pred --CeEEecCccCCCCCceecc-eeeEEecccCCccccCCCccEEEEeec--cccEEEEEecCcEEEEeeccCCCCcchhh Confidence 0001111110111146664 7999987755311 011122221 0111111111 111111122221 Q ss_pred ccc-eeee--eeeeee-eecC--ccccccCccee Q lcl|Aclame:pro 463 FQP-VIGF--KTRYGV-QVNP--FADPTASATKV 490 (514) Q Consensus 463 ~qp-~~~~--~tRY~l-~~nP--f~~~~~~~~~i 490 (514) |+- .++| ..||+. +.+| |+..++.. | T Consensus 272 ~~~n~~~~r~~~r~~~~v~~p~af~~l~~~~--~ 303 (303) T protein:vir:97 272 KGYNQIYLRAEAYIGWGILDAKSFARVTKGE--V 303 (303) T ss_pred hhcCcEEEEEEEEeccEeecccceEEeeCCC--C Confidence 111 1333 456765 4555 43333222 1 No 47 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=78.31 E-value=0.12 Score=25.72 Aligned_cols=307 Identities=15% Similarity=0.095 Sum_probs=116.7 Q ss_pred HHHHHhcccccchhhhhhhccccccccccccccccccccccccccccccccceeee--hhhhhhhhhhhhcceeEEecCC Q lcl|Aclame:pro 38 DRDINNDPMYRDPQLVEAFNAGLNEAVVNGDHGYDPANIAQGVTTGAVTNIGPTVM--GMVRRAIPQLIAFDIAGVQPMT 115 (514) Q Consensus 38 ~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~g~~~~~~~~st~tg~v~~~~P~l~--~l~Rra~~~LIa~DI~GVQPmT 115 (514) ...|+| ...... |.....-..+..++-+ |--+ -+++.+.++.+...++-+.||+ T Consensus 1 ~a~l~e-------------------l~~~~~-~~~~~g~~~~~~~~li----P~~~~~~ii~~l~~~s~l~~~~~~~~~~ 56 (333) T protein:vir:78 1 MATLNE-------------------LLPNSA-GSNHQGRLAHVPSDLL----PKEIVGPIFDKAQESSLVLRMGEQIPIS 56 (333) T ss_pred CchhHH-------------------hhhhcc-cccccCceecCCcccc----chhHHHHHHHHHHhhchhhhhcceeecc Confidence 122222 211100 0000000111111111 1111 1444455677778889999987 Q ss_pred cccceeeeeeeeecCCCCccccccccccCCCCccCccccccccccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 116 GPTSQVFTLRSVYGKDPLTGAEAFHPTRQADASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAV 195 (514) Q Consensus 116 gPTGLIFAMRSrY~~~~~tg~EA~~~~nEadt~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~ 195 (514) +-.--|.-. .. .+.+.|-+.+. T Consensus 57 ~~~~~~p~~----~~-------------~~~a~~v~eg~----------------------------------------- 78 (333) T protein:vir:78 57 YGETIIPTT----VK-------------RPEVGQVGVGT----------------------------------------- 78 (333) T ss_pred CCceEEEEE----eC-------------CceeEeecCcc----------------------------------------- Confidence 632221111 11 01111111100 Q ss_pred cccccCccccccccccccccccccccccccchhhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHH Q lcl|Aclame:pro 196 TLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLR 275 (514) Q Consensus 196 ~~~~~~~~~~~~~~~~~a~~~~y~~~~GmtTa~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLk 275 (514) .....|... -..+...|.+..++..|..+- ...|-||.+|-. T Consensus 79 -----------------------------~~~~~e~~~--~~~~~~~f~~i~l~~~kl~~~-------~~is~ell~~s~ 120 (333) T protein:vir:78 79 -----------------------------SNEQREGGL--KPLSGTAWDTRSVSPIKLATI-------VTVSEEFARMNP 120 (333) T ss_pred -----------------------------ccccccccc--ccccccceeEEEEeeEEEEEe-------ehhhHHHHhcCH Confidence 000111000 001123455555555555544 447888888754 Q ss_pred hhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhheeecccccccccCC-cceeccccccccccchhhHHHHHHHHHHHHH Q lcl|Aclame:pro 276 AVHGLDADAELSGILANEVMVELNREIVNLVNSQAQIGKSGWTQGAGA-AGVFDFSDAVDVKGARWAGEAYKALLIQIEK 354 (514) Q Consensus 276 AiHGLDAEaELanILStEImlEINReii~~l~~~a~v~~~~~~~~v~~-~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~ 354 (514) .|.+++|.+.|...|...|+..+|.--.... .. ...++.+ .++..-. .... ........+..|.+ T Consensus 121 ----~~~~~~i~~~la~ai~~~~d~~~l~G~g~~~---~~-~~~g~~~~~~~~~~~--~~~~----~~~~~~~~~~~i~~ 186 (333) T protein:vir:78 121 ----SGLYTKLQGDLAYAIGRGIDLAVFHGKSPLT---GS-ALQGIDTDNVIANTT--NVDY----LQETGDPLLDRLLD 186 (333) T ss_pred ----HHHHHHHHHHHHHHHHHHHHHHHhcccCCCC---Cc-ccccccccccccccc--cccc----cccccchhHHHHHH Confidence 4579999999999999999999953211110 00 0001100 1110000 0000 00011111222222 Q ss_pred HHHHHHHhcccccccEEEEChhHHhHHhhccccccchhccccCccccccccCceEEEEecCceEEEecCCCccce----- Q lcl|Aclame:pro 355 EANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDY----- 429 (514) Q Consensus 355 ~a~~I~~~T~r~~~n~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy----- 429 (514) +-..+...- ...++.+|++|+....|.....+.. .+|..- ...+.. ..-.|+|.| ++|+++.+.+.+. T Consensus 187 ~~~~~~~~~-~~~~~~~vmn~~~~~~L~~~~~~~d--~~G~~i--~~~~~~-~~~~~~l~G-~Pv~~~~~i~~~~~~~~~ 259 (333) T protein:vir:78 187 GYDLVSANT-DVEFNGWAVDPRFRAHLLRAQAYRD--ANGNVD--PSRINL-AAQTGDVLG-LPAQFGRAVGGDLGAAVD 259 (333) T ss_pred HHHhhcccc-ccCceEEEEcchHHHHHHHHhhhcC--CCCcee--ecCccc-cCCCceeec-eeeEEccccCCCccccCC Confidence 222222222 4667788899988777754333211 000000 000000 001155664 5999988766442 Q ss_pred ----EEEE--------EecCCCcccceeeccccccccccccCCcccc-ceee--eeeeeee-eecC--ccccccCcce Q lcl|Aclame:pro 430 ----FTVG--------FKGSTEMDAGVFYSPYVPLTPLRGSDSKNFQ-PVIG--FKTRYGV-QVNP--FADPTASATK 489 (514) Q Consensus 430 ----~~vG--------~kG~~~~~~~~fy~PYv~~~~~~~~dp~s~q-p~~~--~~tRY~l-~~nP--f~~~~~~~~~ 489 (514) +++| ..+..+. -..+|.-.......--.-|| -.++ ...|++. ..+| |+..+...++ T Consensus 260 ~~~~~~~gD~~~~~~g~~~~~~i----~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~~~a~ 333 (333) T protein:vir:78 260 SKTRIIGGDFSQLKFGFADEIRI----KMSDTATLTDSGSATVSMWQTNQIAILIEVTFGWLLGDKQAFVKFVDDEQP 333 (333) T ss_pred CccEEEEEecccEEEEEeeccEE----EEeccccccccccceeehhhcCcEEEEEEEEEccEEecccceEEEeccCCC Confidence 3333 2222111 11222110000000000111 1122 2347765 3566 4433222222 No 48 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=78.09 E-value=0.12 Score=25.67 Aligned_cols=279 Identities=11% Similarity=0.020 Sum_probs=106.7 Q ss_pred ccccccccccccccccccccccccccccccccccccccccCccccccccccccccccccccccccchhhhccccCCCCCC Q lcl|Aclame:pro 161 FPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAELQENFNGSSN 240 (514) Q Consensus 161 ~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~GmtTa~aEal~~~ggs~~ 240 (514) -...++........... . ............ .....-...... ..+..+.+-..-.+| + T Consensus 1 ma~~gG~lvp~~~~~~i--i--~~~~~~s~i~~l-~~~~~~~~~~~~--------ip~~~~~~~a~~v~E---------~ 58 (298) T protein:vir:16 1 MVLNKGTLFDPTLVTDL--I--SKVAGKSSIARL-SAQKPIPFNGEK--------VFTFTMDSEIDVVAE---------S 58 (298) T ss_pred CcccCcceechhHHHHH--H--HHHHhhhhhhhh-cceeeccCCceE--------EEEEecCcceEEecC---------C Confidence 00000000000000000 0 000000000000 000000000000 001111111111112 3 Q ss_pred cccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhheeecccccccc Q lcl|Aclame:pro 241 NEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNSQAQIGKSGWTQG 320 (514) Q Consensus 241 ~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~~l~~~a~v~~~~~~~~ 320 (514) .++++-..++++++..+|.-+-....|-||.++--- -..|-+++|.+-|+..|...|+..++.-...- .+.... T Consensus 59 ~~~~~~~~~f~~v~l~~~k~a~~~~iS~ell~~s~d-~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~-----~g~~~~ 132 (298) T protein:vir:16 59 GKKTHGGVTLAPQTMVPIKVEYGARISDEFMYASDE-EKINILQEFNDGFAKKVARGIDLMAFHGVNPR-----LGTASA 132 (298) T ss_pred ccccccccceeEEEEeeeeEEEeehhhHHHhhcCcc-cHHHHHHHHHHHHHHHHHHHHHHHhhccccCC-----CCcccc Confidence 466777777788888888878888999999875421 12445778888888888888888885331100 111110 Q ss_pred -cCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhhccccccchhccccCcc Q lcl|Aclame:pro 321 -AGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVGPAAQGMQDGS 399 (514) Q Consensus 321 -v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~~g~~~~~~~~~~~~~~ 399 (514) .+..++......... ..+....++..|.++...+.. ...+...+|++|+....|.... + ..| .+ T Consensus 133 ~~~~~~~~~~~~~~~~-----~~~~~~~~~~~i~~~~~~~~~--~~~~~~~~vmn~~~~~~l~~lk--d---~~G---~~ 197 (298) T protein:vir:16 133 VIGTNHFDSKVTQKVE-----APRGIADPNGAIENAVELLTG--VDADVTGIAINPSFRSALAKQK--D---LQD---NA 197 (298) T ss_pred cccccccccccccccc-----cccccccHHHHHHHHHHHhhh--cCCCccEEEEcHHHHHHHHHhh--c---cCC---Ce Confidence 111111110000000 001122233344444444432 1355566899999988876521 1 011 11 Q ss_pred c-cccccCceEEEEecCceEEEecCCCcc------ceEEEEEecCCCcccceeecccc--ccccccccCCcc-----cc- Q lcl|Aclame:pro 400 M-NTDTNQTVFAGVLGGRFKVYIDQYAVN------DYFTVGFKGSTEMDAGVFYSPYV--PLTPLRGSDSKN-----FQ- 464 (514) Q Consensus 400 ~-~~d~~~~~~~G~l~~~~~vy~D~y~~~------dy~~vG~kG~~~~~~~~fy~PYv--~~~~~~~~dp~s-----~q- 464 (514) . ..+..+. -.|+|.| ++|+++.+.+. +.+++|- - ..++.|..-- .+...+..|+++ || T Consensus 198 i~~~~~~~~-~~~~l~G-~PV~~~~~v~~~~~~~~~~~~~GD---f--s~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~ 270 (298) T protein:vir:16 198 LFPELKWGA-TPDTING-LPVDVNKTVSDMSLTQRDRAIIGD---F--ANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGY 270 (298) T ss_pred eecCcccCC-CCceecc-eeeEEecccccccCCCccEEEEee---c--cceEEEEEecCceEEEeeccCCcCcchhhhhc Confidence 1 1111110 0156765 49999887652 2344441 0 0111122110 111122223332 22 Q ss_pred ceeee--eeeeee-eecCccccccCcceeecCc Q lcl|Aclame:pro 465 PVIGF--KTRYGV-QVNPFADPTASATKVGNGA 494 (514) Q Consensus 465 p~~~~--~tRY~l-~~nPf~~~~~~~~~i~~~~ 494 (514) =.++| ..|++. ..+| +...++.+.+ T Consensus 271 ~~v~~ra~~r~d~~v~~~-----~a~~~l~~at 298 (298) T protein:vir:16 271 NQVYIRAELFLGWGILDA-----TKFARVTEAN 298 (298) T ss_pred CcEEEEEEEEEccEeecc-----cceEEEeecC Confidence 11333 457764 4555 1123333333 No 49 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=77.87 E-value=0.12 Score=25.63 Aligned_cols=324 Identities=14% Similarity=0.155 Sum_probs=118.6 Q ss_pred Ccchhhhhhhhccccccccccccchhhhhhhhhhh--hHHHHH-Hhcccccchhhhhhhccccccccccccccccccccc Q lcl|Aclame:pro 1 MNLTEKWKDLLEAEGADMPEIATATKQKIMSKIFE--NQDRDI-NNDPMYRDPQLVEAFNAGLNEAVVNGDHGYDPANIA 77 (514) Q Consensus 1 ~~l~~kw~p~l~~~~~~~~~i~~~~~~~~~~~~~e--nq~~~~-~~~~~~~~~~~~~~~~~~~~~a~~~~~~g~~~~~~~ 77 (514) ..++++ ++.... +|... ++. +....+ +.++.. ++...........+|..++...+ ....+. T Consensus 68 ~e~~~~----~~~~~~---ei~~~-~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~af~~~l~~~e-------~~~al~ 131 (425) T protein:vir:10 68 SDALAK----VDKVSA---DLEAL-QAA-VDEANIKIAAAQMGANGVKPLRDPEYTEAFKAHVKRGD-------VQAALN 131 (425) T ss_pred HHHHHH----HHHHHH---HHHHH-HHH-HHHHHHHHHhhhcccccccccccHHHHHHHHHHhhhhh-------hHHHhh Confidence 011111 000000 11100 000 000000 000000 01111112222223332221110 000111 Q ss_pred ccccc-ccc---cccceeeehhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCccccccccccCCCCccCccc Q lcl|Aclame:pro 78 QGVTT-GAV---TNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAEAFHPTRQADASFSGQA 153 (514) Q Consensus 78 ~st~t-g~v---~~~~P~l~~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~tg~EA~~~~nEadt~fSG~~ 153 (514) .++++ |.+ +.+.+- +++.+-...+..++|.+.||+++..-+.- .. ++..+ .|-+. T Consensus 132 ~~t~~~gG~lvP~~~~~~---ii~~~~~~s~l~~l~~~~~~~~~~~~~~~-----~~---~~~~a---------~wv~E- 190 (425) T protein:vir:10 132 KGEDSEGGYLTPIEWDRT---ITNKLVLISPMRQLCRVQPVSKAGFSKLF-----NM---GGTTS---------GWVGE- 190 (425) T ss_pred cCcCCCCceeccHhHHHH---HHHHHHhhhhhhhhceeeeccCCceEEEE-----Ec---CCcce---------eeecc- Confidence 11111 111 111122 44444456677789999999877543321 11 01111 01000 Q ss_pred cccccccccccccccccccccccccccccccccccccccccccccccCccccccccccccccccccccccccchhhhccc Q lcl|Aclame:pro 154 AASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAELQE 233 (514) Q Consensus 154 ~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~GmtTa~aEal~ 233 (514) .+.. T Consensus 191 ---------------------------------------------------------------------------~~~~- 194 (425) T protein:vir:10 191 ---------------------------------------------------------------------------ASQR- 194 (425) T ss_pred ---------------------------------------------------------------------------cccc- Confidence 0000 Q ss_pred cCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhheeec Q lcl|Aclame:pro 234 NFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNSQAQIG 313 (514) Q Consensus 234 ~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~~l~~~a~v~ 313 (514) ..+....|.++.|.+.|..+ ...+|-||.+|-. +|.+++|.+-|+..|...+|+.||.- .-+- T Consensus 195 --~~~~~~~f~~v~~~~~k~~~-------~i~iS~ell~ds~----~~l~~~i~~~la~ai~~~~d~~~l~G---~G~~- 257 (425) T protein:vir:10 195 --PQTNAATFQPLSFASGEIYA-------NPAATQQILDDAE----IDLESWLATEVQTEFAKQEGKAFLAG---DGTN- 257 (425) T ss_pred --ccccccccceeeeeheeeEe-------ehHhHHHHHhcch----hHHHHHHHHHHHHHHHHHHHhhhhcc---cCCC- Confidence 00011236666666666655 4569999999853 45689999999999999999988531 1100 Q ss_pred ccccccccCCcceecccc---------------ccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHH Q lcl|Aclame:pro 314 KSGWTQGAGAAGVFDFSD---------------AVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVV 378 (514) Q Consensus 314 ~~~~~~~v~~~g~~dl~~---------------~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va 378 (514) .+.|++.... .+......-..+....|+..+...- +. ....|++|... T Consensus 258 --------~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~l~~l~~~l~~~~--------~~-~a~~vmn~~~~ 320 (425) T protein:vir:10 258 --------KPNGLLTYIAGGANAAKHPFGAIEVVNSGAAADITSDGIIDLVYDLPSAF--------TG-NARFAMNRNTQ 320 (425) T ss_pred --------CcceeeeccccccccccccccccccccccccccccHHHHHHHHhhhhhhh--------cc-CCEEEEchHHH Confidence 0112211100 0000000001123344444332211 22 23568999988 Q ss_pred hHHhhccccccchhccccCccc-cccccCceEEEEecCceEEEecCCCcc-----ceEEEEEecCCCcccceeecccccc Q lcl|Aclame:pro 379 SALSMTDTLVGPAAQGMQDGSM-NTDTNQTVFAGVLGGRFKVYIDQYAVN-----DYFTVGFKGSTEMDAGVFYSPYVPL 452 (514) Q Consensus 379 ~~L~~~g~~~~~~~~~~~~~~~-~~d~~~~~~~G~l~~~~~vy~D~y~~~-----dy~~vG~kG~~~~~~~~fy~PYv~~ 452 (514) ..|...- + .+| .+. ..+.+.. -.++|.| ++|+++.+.+. +.+++| +-.. ..+.. .- T Consensus 321 ~~L~~lk--D---~~G---~~l~~~~~~~g-~~~~l~G-~PV~~~~~~p~~~~~~~~i~~G---d~~~--~~~i~---~~ 382 (425) T protein:vir:10 321 RQVRKLK--D---GQG---NYLWQPSYVAG-QPATLAG-YPVTEVPDMPDVAANSTPILFG---DFQQ--TYLII---DR 382 (425) T ss_pred HHHHHhh--c---CCC---ceeeccCccCC-CCceecc-eeeEEecCcCCccCCccEEEEE---ehhc--cEEEE---Ee Confidence 8776421 1 000 111 0111110 1145664 58998887662 234433 1100 01111 00 Q ss_pred cccc-ccCCcc--ccceeeeeeeeee-eecCcc--ccccCcce Q lcl|Aclame:pro 453 TPLR-GSDSKN--FQPVIGFKTRYGV-QVNPFA--DPTASATK 489 (514) Q Consensus 453 ~~~~-~~dp~s--~qp~~~~~tRY~l-~~nPf~--~~~~~~~~ 489 (514) ..++ ..|+-. -+=.+-...||+. ..+|-+ ...-..++ T Consensus 383 ~~~~v~~d~~~~~~~~~~~~~~r~d~~v~~~~A~~~l~~~as~ 425 (425) T protein:vir:10 383 IGVRVLRDPYTAKPYVLFYTTKRVGGGLLNPEPMRAMKVAASE 425 (425) T ss_pred cceEEEecccccCCcEEEEEEEEeccEeecccceEEEEeeccC Confidence 0011 112222 1222233456665 455522 11111111 No 50 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=77.25 E-value=0.13 Score=25.50 Aligned_cols=215 Identities=10% Similarity=0.101 Sum_probs=94.5 Q ss_pred cccccccccccccccccchhhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHH Q lcl|Aclame:pro 209 TDGVAGGLLVEIDAGMATSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSG 288 (514) Q Consensus 209 ~~~~a~~~~y~~~~GmtTa~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELan 288 (514) ..+...+.+-+++.- ...+|.+. ....-+..+|++ ..++++.|-+.=.=++|=| ..|.+ +| |.-.|..+ T Consensus 1 ~~~~~~Gdtit~P~~--iGda~~v~---eG~~i~~~~l~~--t~~~atIk~~gk~~~itD~--a~l~~-~g-Dp~~ea~~ 69 (231) T protein:vir:73 1 ENGINLANLCEYPND--IGDAADVA---EGGEISLDKIGT--TTKSVTIKKAAKGTEITDE--AALSG-YG-DPIGESNK 69 (231) T ss_pred CccccCCceEEeccc--ccchhhhc---CCCcCChhhccc--cceeeeEeeeccceeeeHH--HHhhc-cC-chHHHHHH Confidence 011111111111110 22333331 111223444544 4444454544332333322 23445 33 88999999 Q ss_pred HHHHHHHHHhhHHHHHHHhhheeecccccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccc Q lcl|Aclame:pro 289 ILANEVMVELNREIVNLVNSQAQIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNG 368 (514) Q Consensus 289 ILStEImlEINReii~~l~~~a~v~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~ 368 (514) -|+..|..++|.||+..+..... . +.. .+++. .+..+..++ .++ -... T Consensus 70 Q~~~~iA~kvD~di~~~~~~a~l-~-------~~~--~~t~d----------~i~~A~~~f---gde---------~~~~ 117 (231) T protein:vir:73 70 QLGLSLANKVDDDLLKAAKTTSQ-T-------VST--KANVD----------GVQAALDIF---NDE---------DAQA 117 (231) T ss_pred HHHHHHHHhhhHHHHHhhccccc-c-------ccc--cccHH----------HHHHHHHHh---ccc---------cccc Confidence 99999999999999877643221 0 111 11111 111222221 221 2456 Q ss_pred cEEEEChhHHhHHhhccccccchhccccCccccccccCceEE---EEecCceEEEecCCCccceEEEEEecCCCccccee Q lcl|Aclame:pro 369 NFIIASRNVVSALSMTDTLVGPAAQGMQDGSMNTDTNQTVFA---GVLGGRFKVYIDQYAVNDYFTVGFKGSTEMDAGVF 445 (514) Q Consensus 369 n~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~~d~~~~~~~---G~l~~~~~vy~D~y~~~dy~~vG~kG~~~~~~~~f 445 (514) .++||+|+++..|...--+...... -..+...- |.+ .|++|+++...+. ++.+ T Consensus 118 ~vivv~p~~~~~Lrk~~~~~~~~~~---------~g~~i~~~G~iG~i-~G~~Vi~S~~~~~--------------~~~~ 173 (231) T protein:vir:73 118 YVLIVNPKDAAKIRKDANAKNIGSE---------VGANALINGTYADV-LGAQIVRSKKLAE--------------GSAL 173 (231) T ss_pred eEEEEcchHHHhhhhccchhhhhhh---------hccceeeecccceE-cceEEEEcCCCCC--------------Ccee Confidence 7999999999888652111100000 00111111 344 3578888877663 2234 Q ss_pred eccccccc-----------ccc-ccCCccccceeeeeeeeeee-ecCccccccCcceeecCcchhhhccccceeeeeeee Q lcl|Aclame:pro 446 YSPYVPLT-----------PLR-GSDSKNFQPVIGFKTRYGVQ-VNPFADPTASATKVGNGAPVAASMGKNAYFRRVFVK 512 (514) Q Consensus 446 y~PYv~~~-----------~~~-~~dp~s~qp~~~~~tRY~l~-~nPf~~~~~~~~~i~~~~~~~~~~~~~~~~r~~~V~ 512 (514) +++|+... .++ ..|+..+.-.+--.-.|++. .|| .=...+.+| T Consensus 174 ~~~~i~~~gAl~~~~k~~~~vEtdRd~~~k~~~i~~~~~y~v~l~~~------------------------~~vv~~t~~ 229 (231) T protein:vir:73 174 MFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYLYDL------------------------TKVVNITFT 229 (231) T ss_pred eeeEEeeccceeeeecccceeeccccccccccEEEEeEEEEEEEEcC------------------------ccEEEEEee Confidence 44443210 000 13455554444444444432 122 112345556 Q ss_pred cC Q lcl|Aclame:pro 513 GL 514 (514) Q Consensus 513 ~~ 514 (514) |+ T Consensus 230 g~ 231 (231) T protein:vir:73 230 GV 231 (231) T ss_pred cC Confidence 66 No 51 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=76.55 E-value=0.13 Score=25.36 Aligned_cols=281 Identities=11% Similarity=0.050 Sum_probs=108.0 Q ss_pred CccccccccccCCCCccCccccccccccccccccccccccccccccccccccccccccccccccccccCccccccccccc Q lcl|Aclame:pro 133 LTGAEAFHPTRQADASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGV 212 (514) Q Consensus 133 ~tg~EA~~~~nEadt~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 212 (514) +++.+ ....+.... ..++...........- ................+.... ...+ T Consensus 1 ma~~~---------------~~~~~~~~t-~~gg~lip~~~~~~ii----~~~~~~~~l~~~~~~~~~~~~-~~~i---- 55 (304) T protein:vir:94 1 MATPT---------------YTPGNVILS-DFKNGVIPAEQGTLIM----KDIMANSAIMKLAKNEPMTAQ-KKKF---- 55 (304) T ss_pred Ccccc---------------ccccccccc-CCCceecchhHHHHHH----HHHHhccchhhhcceeeccCC-ceEE---- Confidence 11110 000000000 0000000000000000 000000000000000000000 0000 Q ss_pred cccccccccccccchhhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHH Q lcl|Aclame:pro 213 AGGLLVEIDAGMATSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILAN 292 (514) Q Consensus 213 a~~~~y~~~~GmtTa~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILSt 292 (514) ....+.+-..-.+| +.++++-.-++++++++.|..+-...+|-||.+|- .+|.++.|.+-|.. T Consensus 56 ----p~~~~~~~a~~v~E---------~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~l~~ 118 (304) T protein:vir:94 56 ----TYLAKGVGAYWVSE---------TERIQTSKPEYAQAEMEAKKIGVIIPLSKEFLKWT----AKDFFNEVKPLIAE 118 (304) T ss_pred ----EEEeCCcceEEeec---------CcccccccceeeEEEEEEEEEEEeehhhHHHHhcc----hHHHHHHHHHHHHH Confidence 00000000000111 34567777788888888888888999999999985 36678899999999 Q ss_pred HHHHHhhHHHHHHHhhheeecccccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEE Q lcl|Aclame:pro 293 EVMVELNREIVNLVNSQAQIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFII 372 (514) Q Consensus 293 EImlEINReii~~l~~~a~v~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v 372 (514) .|...||+.++.- .-+ ....+....+.+.-...... ........+..|+++...|... ..+...+| T Consensus 119 ~ia~~~d~~~l~G---~g~----~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~i~~~~~~l~~~--~~~~~~~v 184 (304) T protein:vir:94 119 AFYKAFDQAVIFG---TKS----PYNTSTSGKPLVEGAEEKGN-----VVTDTNNLYVDLSALMATIEDE--ELDPNGVL 184 (304) T ss_pred HHHHHHHhhheec---cCC----Cccccccccccccccccccc-----ccccccchHHHHHHHHHHhhhc--cCCcCEEE Confidence 9999998888432 110 00011111111110000000 0001112233344555555432 34455689 Q ss_pred EChhHHhHHhhccccccchhccccCccccccccCceEEEEecCceEEEecCCCccc------------eEEEEEecCCCc Q lcl|Aclame:pro 373 ASRNVVSALSMTDTLVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVND------------YFTVGFKGSTEM 440 (514) Q Consensus 373 ~S~~va~~L~~~g~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d------------y~~vG~kG~~~~ 440 (514) |+|.....|... .. ..| .+. ..++. |+|. |++||++++.+.+ ++++|..++.+. T Consensus 185 ~~~~~~~~L~~l---kd--~~G---~~l---~~~~~--~~l~-G~PV~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i 250 (304) T protein:vir:94 185 TTRSFRSKMRNA---LD--AND---RPL---FDANG--NEIM-GLPLSYTGADVYDKKKSLALMGDWDYARYGILQGIEY 250 (304) T ss_pred EcHHHHHHHHHh---hc--cCC---cEe---ecCCC--cccc-ceeeEEecccccCCCCcEEEEEehhhEEEEEecceEE Confidence 999999888742 11 000 111 11111 4554 4699988887632 233343332221 Q ss_pred ccceeeccccc--cccccccCCc-----ccc---ceeeeeeeeee-eecCccccccCcceeecCc Q lcl|Aclame:pro 441 DAGVFYSPYVP--LTPLRGSDSK-----NFQ---PVIGFKTRYGV-QVNPFADPTASATKVGNGA 494 (514) Q Consensus 441 ~~~~fy~PYv~--~~~~~~~dp~-----s~q---p~~~~~tRY~l-~~nPf~~~~~~~~~i~~~~ 494 (514) +- ..+ ..+....|++ -|+ =.+=+..||++ ..+| ....++...+ T Consensus 251 ~~------~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~-----~a~~~l~~a~ 304 (304) T protein:vir:94 251 AI------SEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKP-----EAFATLKPTE 304 (304) T ss_pred EE------eecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecc-----cceEEEEecC Confidence 10 000 0011111222 122 22333457776 3444 1112222222 No 52 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=76.55 E-value=0.13 Score=25.36 Aligned_cols=281 Identities=11% Similarity=0.050 Sum_probs=108.0 Q ss_pred CccccccccccCCCCccCccccccccccccccccccccccccccccccccccccccccccccccccccCccccccccccc Q lcl|Aclame:pro 133 LTGAEAFHPTRQADASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGV 212 (514) Q Consensus 133 ~tg~EA~~~~nEadt~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 212 (514) +++.+ ....+.... ..++...........- ................+.... ...+ T Consensus 1 ma~~~---------------~~~~~~~~t-~~gg~lip~~~~~~ii----~~~~~~~~l~~~~~~~~~~~~-~~~i---- 55 (304) T protein:vir:10 1 MATPT---------------YTPGNVILS-DFKNGVIPAEQGTLIM----KDIMANSAIMKLAKNEPMTAQ-KKKF---- 55 (304) T ss_pred Ccccc---------------ccccccccc-CCCceecchhHHHHHH----HHHHhccchhhhcceeeccCC-ceEE---- Confidence 11110 000000000 0000000000000000 000000000000000000000 0000 Q ss_pred cccccccccccccchhhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHH Q lcl|Aclame:pro 213 AGGLLVEIDAGMATSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILAN 292 (514) Q Consensus 213 a~~~~y~~~~GmtTa~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILSt 292 (514) ....+.+-..-.+| +.++++-.-++++++++.|..+-...+|-||.+|- .+|.++.|.+-|.. T Consensus 56 ----p~~~~~~~a~~v~E---------~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~l~~ 118 (304) T protein:vir:10 56 ----TYLAKGVGAYWVSE---------TERIQTSKPEYAQAEMEAKKIGVIIPLSKEFLKWT----AKDFFNEVKPLIAE 118 (304) T ss_pred ----EEEeCCcceEEeec---------CcccccccceeeEEEEEEEEEEEeehhhHHHHhcc----hHHHHHHHHHHHHH Confidence 00000000000111 34567777788888888888888999999999985 36678899999999 Q ss_pred HHHHHhhHHHHHHHhhheeecccccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEE Q lcl|Aclame:pro 293 EVMVELNREIVNLVNSQAQIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFII 372 (514) Q Consensus 293 EImlEINReii~~l~~~a~v~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v 372 (514) .|...||+.++.- .-+ ....+....+.+.-...... ........+..|+++...|... ..+...+| T Consensus 119 ~ia~~~d~~~l~G---~g~----~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~i~~~~~~l~~~--~~~~~~~v 184 (304) T protein:vir:10 119 AFYKAFDQAVIFG---TKS----PYNTSTSGKPLVEGAEEKGN-----VVTDTNNLYVDLSALMATIEDE--ELDPNGVL 184 (304) T ss_pred HHHHHHHhhheec---cCC----Cccccccccccccccccccc-----ccccccchHHHHHHHHHHhhhc--cCCcCEEE Confidence 9999998888432 110 00011111111110000000 0001112233344555555432 34455689 Q ss_pred EChhHHhHHhhccccccchhccccCccccccccCceEEEEecCceEEEecCCCccc------------eEEEEEecCCCc Q lcl|Aclame:pro 373 ASRNVVSALSMTDTLVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVND------------YFTVGFKGSTEM 440 (514) Q Consensus 373 ~S~~va~~L~~~g~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d------------y~~vG~kG~~~~ 440 (514) |+|.....|... .. ..| .+. ..++. |+|. |++||++++.+.+ ++++|..++.+. T Consensus 185 ~~~~~~~~L~~l---kd--~~G---~~l---~~~~~--~~l~-G~PV~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i 250 (304) T protein:vir:10 185 TTRSFRSKMRNA---LD--AND---RPL---FDANG--NEIM-GLPLSYTGADVYDKKKSLALMGDWDYARYGILQGIEY 250 (304) T ss_pred EcHHHHHHHHHh---hc--cCC---cEe---ecCCC--cccc-ceeeEEecccccCCCCcEEEEEehhhEEEEEecceEE Confidence 999999888742 11 000 111 11111 4554 4699988887632 233343332221 Q ss_pred ccceeeccccc--cccccccCCc-----ccc---ceeeeeeeeee-eecCccccccCcceeecCc Q lcl|Aclame:pro 441 DAGVFYSPYVP--LTPLRGSDSK-----NFQ---PVIGFKTRYGV-QVNPFADPTASATKVGNGA 494 (514) Q Consensus 441 ~~~~fy~PYv~--~~~~~~~dp~-----s~q---p~~~~~tRY~l-~~nPf~~~~~~~~~i~~~~ 494 (514) +- ..+ ..+....|++ -|+ =.+=+..||++ ..+| ....++...+ T Consensus 251 ~~------~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~-----~a~~~l~~a~ 304 (304) T protein:vir:10 251 AI------SEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKP-----EAFATLKPTE 304 (304) T ss_pred EE------eecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecc-----cceEEEEecC Confidence 10 000 0011111222 122 22333457776 3444 1112222222 No 53 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=75.92 E-value=0.14 Score=25.24 Aligned_cols=263 Identities=11% Similarity=0.017 Sum_probs=109.5 Q ss_pred ccccccccccccccccccccccccccccccccccccc---ccccccccccCccccccccccccccccccccccccchhhh Q lcl|Aclame:pro 154 AASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFL---ALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAE 230 (514) Q Consensus 154 ~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~GmtTa~aE 230 (514) .++.. +...+ .-...++..+.. .............. ... .. .+.+.+++.--....+| T Consensus 1 ma~~~--------T~~~d-------~iiPev~~~~v~~~~~~~~~~~~~~~~~~--~l~-g~-~G~ti~iP~~~~~gda~ 61 (272) T protein:vir:36 1 MSKQK--------TTLAD-------LVNPEVLAPIVSYELNKALRFAPLAQVDT--TLQ-GQ-PGNTLKFPAFTYIGDAA 61 (272) T ss_pred CCCcc--------eehhh-------hhchHHHHHHHHHHHHhhhhhcccccccc--ccc-cC-CCCEEEEeeeccCcccc Confidence 11100 00000 000000000000 00000000000000 000 01 12222232211222333 Q ss_pred ccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhe Q lcl|Aclame:pro 231 LQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNSQA 310 (514) Q Consensus 231 al~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~~l~~~a 310 (514) .+. ....-+..+ .+..+.+++-|-|+-.-++|=|. ++.-+-|.-.|..+-++..++.+++++|+..+... T Consensus 62 ~~~---eg~~i~~~~--lt~~~~~~~i~~~~k~~~vtD~~----~~~~~~d~~~~~~~~~a~~~a~~~d~~i~~~l~~~- 131 (272) T protein:vir:36 62 DVA---EGGEISLDK--IGTTTKSVTIKKAAKGTEITDEA----ALSGYGDPIGESNKQLGLSLANKVDDDLLSAAKTT- 131 (272) T ss_pred ccC---CCCccChhh--cCCcceeEeeehhhccccccHHH----HhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccc- Confidence 221 111223333 34555666666555322232222 12225789999999999999999999997776321 Q ss_pred eecccccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhhccccccc Q lcl|Aclame:pro 311 QIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVGP 390 (514) Q Consensus 311 ~v~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~~g~~~~~ 390 (514) ...+.. .+++ +..-.+..++.++. ...++++|+|+++..|..-.-+... T Consensus 132 -------~~~~~~--~~~~-------------d~i~~A~~~lgd~~---------~~~~~ivv~p~~~~~L~k~~~~~~~ 180 (272) T protein:vir:36 132 -------SQTVST--KANV-------------DGVQAALDIFNDED---------AQAYVLIVNPKDAAKIRKDANAKNI 180 (272) T ss_pred -------cccccc--cccH-------------HHHHHHHHHhhhcC---------CCceEEEEcHHHHHHHhcccccccc Confidence 111111 1111 11222333333322 3467999999999888643332221 Q ss_pred hhccccCccccccccCceEEEEecCceEEEecCCCccc---eEEEEE-ecCCCcccceeecccccccccc-ccCCccccc Q lcl|Aclame:pro 391 AAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVND---YFTVGF-KGSTEMDAGVFYSPYVPLTPLR-GSDSKNFQP 465 (514) Q Consensus 391 ~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d---y~~vG~-kG~~~~~~~~fy~PYv~~~~~~-~~dp~s~qp 465 (514) ..... .+.-.++. .|.+. |++|++|...|.+ |..+.. +|. -.+|..= .. .++ ..|+..++= T Consensus 181 ~~~~~----~~~~~~G~--ig~~~-G~~Vv~s~~~p~~~~~~~~~~~~~gA-----~~~~~~~-~~-~vE~~R~~~~~~d 246 (272) T protein:vir:36 181 GSEVG----ANALINGT--YADVL-GAQIVRSKKLAEGSALMFKIVSNSPA-----LKLVLKR-GV-QVETDRDIVTKTT 246 (272) T ss_pred ccccc----ccceeeec--cceec-CeeEEEeCCCCCCceeEEEEEecccc-----eeeeecC-Cc-ccccccchhhcCc Confidence 11000 00001111 24553 5899999998853 222222 221 1122211 11 122 358999998 Q ss_pred eeeeeeeeee-eecCccccccCcceeecCcchhhhccccceeeeeeeecC Q lcl|Aclame:pro 466 VIGFKTRYGV-QVNPFADPTASATKVGNGAPVAASMGKNAYFRRVFVKGL 514 (514) Q Consensus 466 ~~~~~tRY~l-~~nPf~~~~~~~~~i~~~~~~~~~~~~~~~~r~~~V~~~ 514 (514) .+--.-+||+ ..||=. ...+..||+ T Consensus 247 ~i~~~~~y~~~v~~~~~------------------------vv~~t~~g~ 272 (272) T protein:vir:36 247 VITADEHYAAYLYDLTK------------------------VVNITFTGV 272 (272) T ss_pred EEEEEEEEEEEEEcCcc------------------------EEEEeecCC Confidence 8888888887 345511 112223333 No 54 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=75.70 E-value=0.14 Score=25.20 Aligned_cols=325 Identities=12% Similarity=0.093 Sum_probs=112.8 Q ss_pred Cc-------chhhhhhh---hcccccc-ccccccchhhhhh--hhhhhhHHHHHHhcccccchhhhhhhccccccccccc Q lcl|Aclame:pro 1 MN-------LTEKWKDL---LEAEGAD-MPEIATATKQKIM--SKIFENQDRDINNDPMYRDPQLVEAFNAGLNEAVVNG 67 (514) Q Consensus 1 ~~-------l~~kw~p~---l~~~~~~-~~~i~~~~~~~~~--~~~~enq~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~ 67 (514) .+ |.++=.-+ ++..... ........++.+. ....+..++ .+|...+... T Consensus 40 ~~l~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------------~~~~~~l~~~---- 101 (397) T protein:vir:49 40 QAIKNERDTAKMKRDLFKEQYTEARANEVANMSEEEKKPLTKNEEEVKANFV--------------KDFKNLVRGR---- 101 (397) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccchhhHHHHHHH--------------HHHHHHhhcc---- Confidence 00 00000000 0000000 0000000000000 000001110 1111111100 Q ss_pred ccccccccccccc-ccccccccceeeehhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCccccccccccCCC Q lcl|Aclame:pro 68 DHGYDPANIAQGV-TTGAVTNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAEAFHPTRQAD 146 (514) Q Consensus 68 ~~g~~~~~~~~st-~tg~v~~~~P~l~~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~tg~EA~~~~nEad 146 (514) ....... ...++ +.|.+.--..+.=.+++.+-++..-.+++.|+||++.+|-+--.+ .... ...+ T Consensus 102 ~~~~~~~-~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~--~~~~---~~~a-------- 167 (397) T protein:vir:49 102 YQNLLDS-KTDGSGSDAGLTIPQDIRTAINTLVRQFDSLQEYVNVENVTTLTGSRVYEK--WADI---TGLA-------- 167 (397) T ss_pred hhhHHHh-hhccCCccCcceecHHHHHHHHHHHHhhhhHhhhcceeeccCCcceEEEEe--eccC---Ccce-------- Confidence 0000000 00011 111111000001123344446667778999999999876432111 1110 0000 Q ss_pred CccCccccccccccccccccccccccccccccccccccccccccccccccccccCccccccccccccccccccccccccc Q lcl|Aclame:pro 147 ASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMAT 226 (514) Q Consensus 147 t~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~GmtT 226 (514) .|-+. + T Consensus 168 -~~v~E----------------------------------------------------------------------~--- 173 (397) T protein:vir:49 168 -KLDDE----------------------------------------------------------------------G--- 173 (397) T ss_pred -eeecc----------------------------------------------------------------------c--- Confidence 01000 0 Q ss_pred hhhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHHHH Q lcl|Aclame:pro 227 SQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLV 306 (514) Q Consensus 227 a~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~~l 306 (514) +.. ..+....|.++.|++.|. +-...+|-||.+|-. +|.+++|.+-|+..|..-+|+.||.-. T Consensus 174 ---~~~---~~~~~~~~~~v~~~~~k~-------~~~~~iS~ell~ds~----~~l~~~i~~~l~~~~~~~~d~ail~G~ 236 (397) T protein:vir:49 174 ---GQI---GQNDDPKLSLIRYAIKRY-------AGISTVTNSLLADSA----ENILAWLSGWIAKKVVVTRNKAILEAI 236 (397) T ss_pred ---ccc---ccccccceeeeEeeeeee-------EeehhhHHHHHhhhh----HHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 000 000112345544555444 445669999999853 567999999999999999999985332 Q ss_pred hhheeecccccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhhccc Q lcl|Aclame:pro 307 NSQAQIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDT 386 (514) Q Consensus 307 ~~~a~v~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~~g~ 386 (514) -+ +....+.+++ +-...|+..+.. .......+|++|.....|.... T Consensus 237 ---g~--------~~~~~~~~~~-------------d~i~~~~~~l~~---------~~~~~a~~v~n~~~~~~l~~lk- 282 (397) T protein:vir:49 237 ---GT--------LPNKPTLAKW-------------DDIIDLQAKVDP---------AIKQTSLFLTNTSGFTALKKVK- 282 (397) T ss_pred ---cc--------ccccccccCH-------------HHHHHHHHhhhh---------hhcCCCEEEEcHHHHHHHHHhh- Confidence 11 0111222221 123334333322 2344567899999988887531 Q ss_pred cccchhccccCccccccccCceEEEEecCceEEEe--cCCCcc----c----------eEEEEEecCCCcccceeecccc Q lcl|Aclame:pro 387 LVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYI--DQYAVN----D----------YFTVGFKGSTEMDAGVFYSPYV 450 (514) Q Consensus 387 ~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~--D~y~~~----d----------y~~vG~kG~~~~~~~~fy~PYv 450 (514) + ..|. .-...+... -..++|.|. +|++ |...+. + |++++..+.-. +-..||. T Consensus 283 -d---~~g~--~l~~~~~~~-g~~~~l~G~-pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~----i~~~~~~ 350 (397) T protein:vir:49 283 -N---AMGD--YLMERDVKS-PTGYSIDGF-VVKEISDRFLPNGTGGAMPLYFGDLKQAVTLFDRQHLS----LLSTNIG 350 (397) T ss_pred -c---cCCc--eeecccccC-CCCceecce-eeEEecccccccccCCceeEEEeeccceEEEEeecccE----EEEeccc Confidence 0 0000 000001110 111456554 6554 222221 1 22222222111 1122221 Q ss_pred ccccccccCCccccceeeeeeeeeee-ecC--ccccccCcceeecCcchhhhccc Q lcl|Aclame:pro 451 PLTPLRGSDSKNFQPVIGFKTRYGVQ-VNP--FADPTASATKVGNGAPVAASMGK 502 (514) Q Consensus 451 ~~~~~~~~dp~s~qp~~~~~tRY~l~-~nP--f~~~~~~~~~i~~~~~~~~~~~~ 502 (514) - .+-...+=.+-...|++.. .+| |...+- +...+.-+.....|. T Consensus 351 ~------~~~~~~~~~~~~~~r~d~~~~~~~a~~~~~~--~~~~~~~~~~~~~~~ 397 (397) T protein:vir:49 351 G------GAFETDTTKVRVIDRFDVVSTDTEAFVPASF--KAIADQKAKLSTAGA 397 (397) T ss_pred c------chhhcCeeeEEEEEeeccEEecccceEEEEe--cccccccCcccccCC Confidence 1 1112333444555666652 233 221110 000000001111111 No 55 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=75.23 E-value=0.15 Score=25.12 Aligned_cols=341 Identities=13% Similarity=0.059 Sum_probs=116.4 Q ss_pred Ccchhhhhhh---------------hccccccccccccchhhhhhhhhhhhHHHHHHh--cccccchh-hhhhhcccccc Q lcl|Aclame:pro 1 MNLTEKWKDL---------------LEAEGADMPEIATATKQKIMSKIFENQDRDINN--DPMYRDPQ-LVEAFNAGLNE 62 (514) Q Consensus 1 ~~l~~kw~p~---------------l~~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~--~~~~~~~~-~~~~~~~~~~~ 62 (514) ..+.+.=..+ ++..+...+.+.+. ..+-.+.+.+..+. ........ ...-+.+.... T Consensus 42 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 116 (413) T protein:vir:81 42 KANQDFLRELQEATAGSVDSEKSGELTRKGEGYKSIGEF-----FAKRAGDQIKQQAGGAQLNYSVGEYVAPRVKAASDP 116 (413) T ss_pred HHHHHHHHHHHHHHHhHHhHHHhhhHhhhhhhhhhhhhh-----hhhhhhhHHHHHHHHHHhhhhhhhhhhhHHHhhhhh Confidence 0000000000 00000000000000 00000111111000 00000000 00000001000 Q ss_pred cccccccccccccccccccccc--ccccceeeehhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCccccccc Q lcl|Aclame:pro 63 AVVNGDHGYDPANIAQGVTTGA--VTNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAEAFH 140 (514) Q Consensus 63 a~~~~~~g~~~~~~~~st~tg~--v~~~~P~l~~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~tg~EA~~ 140 (514) +..++. ++..+. -+.+.+-++.++ -+..+-.+++.|+||++++.-+.-... .... T Consensus 117 ~~~~~~----------~~~~~~~vp~~~~~~ii~~~---~~~~~l~~~~~~~~~~~~~~~~~~~~~-~~~~--------- 173 (413) T protein:vir:81 117 ASTATL----------TDEFQGGYGTTWNRNIIYRR---REKLVVADLMDNLTMTNTTIKYLMEKA-NRVV--------- 173 (413) T ss_pred hhhccc----------ccccccccchhhHHHHHHHH---hhhhhHHhhcceeeccCCceeEEEecc-cccc--------- Confidence 000000 000110 112223334444 456677899999999997643321110 0000 Q ss_pred cccCCCCccCccccccccccccccccccccccccccccccccccccccccccccccccccCccccccccccccccccccc Q lcl|Aclame:pro 141 PTRQADASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEI 220 (514) Q Consensus 141 ~~nEadt~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~ 220 (514) ..++.| + T Consensus 174 ---~~~a~~----------------------------------------------------------------------v 180 (413) T protein:vir:81 174 ---EGGFKT----------------------------------------------------------------------V 180 (413) T ss_pred ---ccccce----------------------------------------------------------------------e Confidence 000000 0 Q ss_pred cccccchhhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhH Q lcl|Aclame:pro 221 DAGMATSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNR 300 (514) Q Consensus 221 ~~GmtTa~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINR 300 (514) ++|-.. .| +....|.+..|.+.|.. -...+|-||.+|--+ .++.|.+-|+..|...+|+ T Consensus 181 ~Eg~~~--~~-------~~~~~f~~i~~~~~k~~-------~~~~iS~ell~ds~~-----l~~~i~~~la~~~~~~~d~ 239 (413) T protein:vir:81 181 AEGGKK--PY-------MRFADFDIVTESLSKIA-------GLTKITDEMIEDYDF-----LVSYINARLLEELAIEEER 239 (413) T ss_pred cCcccc--cc-------cCcccceeeEeeeeeEE-------EeehhhHHHHHHHHH-----HHHHHHHHHHHHHHHHHHH Confidence 000000 00 00123555555555554 445689999998632 4788888888888888888 Q ss_pred HHHHHHhhheeecccccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhH Q lcl|Aclame:pro 301 EIVNLVNSQAQIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSA 380 (514) Q Consensus 301 eii~~l~~~a~v~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~ 380 (514) .||. -..+ ++ ...|+++......... .....++.-|.+....+.... .+..+.+|++|..... T Consensus 240 ~~l~---G~G~-~~-------~~~Gi~~~~~~~~~~~-----~~~~~~~~~i~~~~~~~~~~~-~~~~~~~vmn~~~~~~ 302 (413) T protein:vir:81 240 QLLL---GDGT-GN-------NLTGLLKRDGIQTLAV-----SNKDELADSIYKAMTNISLAT-PFQADALVINPLDYQE 302 (413) T ss_pred HHhc---cCCC-CC-------cccccccccccccccc-----cccchhHHHHHHHHHHhhhhc-cCCCcEEEEcHHHHHH Confidence 7742 1111 00 1234433221111100 011122222222223332222 3455668899998877 Q ss_pred HhhccccccchhccccCccccccccCceEEEEecCceEEEecCCCccceEEEEEecCC--Ccc---cceeeccccccccc Q lcl|Aclame:pro 381 LSMTDTLVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGFKGST--EMD---AGVFYSPYVPLTPL 455 (514) Q Consensus 381 L~~~g~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~kG~~--~~~---~~~fy~PYv~~~~~ 455 (514) |....--..-+...........+ -.....++|. |++|+++...+..-+++|---.. -.+ -.+=..+|... T Consensus 303 l~~lkd~~G~~l~~~~~~~~~~~-~~~~~~~~l~-G~pv~~s~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~--- 377 (413) T protein:vir:81 303 LRLAKDANGQYYGGGVFQGQYGS-GGIMLDPAPW-GLRTVQSQVVPVGKPVVGAFRSAASVLRKGGVRIDSTNTNVD--- 377 (413) T ss_pred HHHhhccCCceeccccccccccc-cccccCceec-ceeeEEcCCCCcccEEEEecccEEEEEEecceEEEEeccccc--- Confidence 75321000000000000000000 0000113444 66999998877655555421100 000 00111111110 Q ss_pred cccCCccccceeeeeeeeeee-ecC--ccccccCcceeecCcchhhhccccceeeeeeeecC Q lcl|Aclame:pro 456 RGSDSKNFQPVIGFKTRYGVQ-VNP--FADPTASATKVGNGAPVAASMGKNAYFRRVFVKGL 514 (514) Q Consensus 456 ~~~dp~s~qp~~~~~tRY~l~-~nP--f~~~~~~~~~i~~~~~~~~~~~~~~~~r~~~V~~~ 514 (514) +-.+-+=.+-+..||++. .+| |.. +.++.. T Consensus 378 ---~~~~~~~~~r~~~r~d~~~~~~~a~~~--------------------------l~~~~~ 410 (413) T protein:vir:81 378 ---DFENNLITVRAEERVGLMVTFPEAIVQ--------------------------LDVAEV 410 (413) T ss_pred ---hhhcCcEEEEEEEeeccEEecccceEE--------------------------EEecCC Confidence 112334455555666653 233 221 111111 No 56 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=74.49 E-value=0.16 Score=24.98 Aligned_cols=304 Identities=10% Similarity=0.026 Sum_probs=120.7 Q ss_pred hhhhHHHHHHhcccccchhhhhhhccccccccccccccccccccccccccccccccceeeehhhhhhhhhhhhcceeEEe Q lcl|Aclame:pro 33 IFENQDRDINNDPMYRDPQLVEAFNAGLNEAVVNGDHGYDPANIAQGVTTGAVTNIGPTVMGMVRRAIPQLIAFDIAGVQ 112 (514) Q Consensus 33 ~~enq~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~g~~~~~~~~st~tg~v~~~~P~l~~l~Rra~~~LIa~DI~GVQ 112 (514) ..|+|+....-. ++.. ....+..+. |+ +.. ++.+++..--....=-+++.+..+.+..+++-+- T Consensus 1 ~~~~~~~~~~~~-~f~~---~~~~~~~~~-a~----------~~~-~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~ 64 (324) T protein:vir:93 1 MEQTQKLKLNLQ-HFAS---NNVKPQVFN-PD----------NVM-MHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYE 64 (324) T ss_pred CchhHHHHHHHH-HHHH---hhhhhhhcc-cc----------ccc-ccCCCcceechhHHHHHHHHHHhhchhhhhccee Confidence 233333322111 0000 000001110 00 000 0111110000111112444455677888899999 Q ss_pred cCCcccceeeeeeeeecCCCCccccccccccCCCCccCcccccccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 113 PMTGPTSQVFTLRSVYGKDPLTGAEAFHPTRQADASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLAL 192 (514) Q Consensus 113 PmTgPTGLIFAMRSrY~~~~~tg~EA~~~~nEadt~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~ 192 (514) ||++++--|.- ... +.+| .|- T Consensus 65 ~~~~~~~~ip~----~~~----~~~a---------~~v------------------------------------------ 85 (324) T protein:vir:93 65 PMEGTEKKFTF----WAD----KPGA---------YWV------------------------------------------ 85 (324) T ss_pred eccCCceEEEE----Eec----Ccce---------eee------------------------------------------ Confidence 99887532211 100 0000 000 Q ss_pred ccccccccCccccccccccccccccccccccccchhhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHH Q lcl|Aclame:pro 193 GAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQ 272 (514) Q Consensus 193 ~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~GmtTa~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQ 272 (514) +| +..++|..-++++++++.|..+-....|-||.+ T Consensus 86 ------------------------------------~E---------g~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ 120 (324) T protein:vir:93 86 ------------------------------------GE---------GQKIETSKATWVNATMRAFKLGVILPVTKEFLN 120 (324) T ss_pred ------------------------------------cC---------CccccccccceeEEEEEeEEEEEeehhhHHHHh Confidence 01 112233333455666666666666779999999 Q ss_pred HHHhhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhheeecccccccccCCcceeccccccccccchhhHHHHHHHHHHH Q lcl|Aclame:pro 273 DLRAVHGLDADAELSGILANEVMVELNREIVNLVNSQAQIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQI 352 (514) Q Consensus 273 DLkAiHGLDAEaELanILStEImlEINReii~~l~~~a~v~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i 352 (514) |-. .|.+++|.+.|+..|...+++.+|.- ..+- ..+.|+++.....-. .......+-.| T Consensus 121 ds~----~~l~~~i~~~l~~aia~~~d~a~l~G---~g~~--------~~~~~~~~~~~~~~~------~~~~~~~~~~i 179 (324) T protein:vir:93 121 YTY----SQFFEEMKPMIAEAFYKKFDEAGILN---QGNN--------PFGKSIAQSIEKTNK------VIKGDFTQDNI 179 (324) T ss_pred cch----HHHHHHHHHHHHHHHHHHHHHHHhcC---CCCC--------CcCccccccccccce------eccccccHHHH Confidence 953 45788999999999999999988432 2110 011222221100000 00000112223 Q ss_pred HHHHHHHHHhcccccccEEEEChhHHhHHhhccccccchhccccCccccccccCceEEEEecCceEEEecCCCc--cce- Q lcl|Aclame:pro 353 EKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAV--NDY- 429 (514) Q Consensus 353 ~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~--~dy- 429 (514) .++-+.|.. ..+..+.+||+|.....|....- ..|. ..-.+..+ +.|. |++|++.+... ... T Consensus 180 ~~~~~~l~~--~~~~~~~~v~n~~~~~~L~~l~d-----~~G~---~~~~~~~~----~~l~-G~PVv~~~~~~~~~~~i 244 (324) T protein:vir:93 180 IDLEALLED--DELEANAFISKTQNRSLLRKIVD-----PETK---ERIYDRNS----DSLD-GLPVVNLKSSNLKRGEL 244 (324) T ss_pred HHHHHhhhh--ccCCCCEEEEcHHHHHHHHHhhC-----CCCC---eeecCCCC----Cccc-ceeeEeecCCCCCcceE Confidence 333333322 23556679999999988875311 1111 11111111 3454 45888765533 222 Q ss_pred -------EEEEEecCCCcccceeeccccccccccccCC------ccccceeeeeeeeeee-ecC--ccccccCcceeecC Q lcl|Aclame:pro 430 -------FTVGFKGSTEMDAGVFYSPYVPLTPLRGSDS------KNFQPVIGFKTRYGVQ-VNP--FADPTASATKVGNG 493 (514) Q Consensus 430 -------~~vG~kG~~~~~~~~fy~PYv~~~~~~~~dp------~s~qp~~~~~tRY~l~-~nP--f~~~~~~~~~i~~~ 493 (514) +++|..++.+.+- ..+..+......|. ..-|=.+=+..||+.. .+| |+.. ... T Consensus 245 ~~gdfs~~~~~~~~~~~i~~----~~~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~v~~~~a~~~l-------~~a 313 (324) T protein:vir:93 245 ITGDFDKLIYGIPQLIEYKI----DETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKL-------VPA 313 (324) T ss_pred EEEecceEEEEEecCcEEEE----eecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEE-------ecc Confidence 3333333222110 01111110000110 0112344455677763 344 3221 111 Q ss_pred cchh-hhcccc Q lcl|Aclame:pro 494 APVA-ASMGKN 503 (514) Q Consensus 494 ~~~~-~~~~~~ 503 (514) .... ...|+- T Consensus 314 ~~~~~~~~~~~ 324 (324) T protein:vir:93 314 DKRTDSVPGEV 324 (324) T ss_pred cccCCCCCCCC Confidence 1111 222332 No 57 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=74.47 E-value=0.16 Score=24.98 Aligned_cols=272 Identities=12% Similarity=0.031 Sum_probs=111.6 Q ss_pred ccCCCCccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccccccccccccccccc Q lcl|Aclame:pro 142 TRQADASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEID 221 (514) Q Consensus 142 ~nEadt~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~ 221 (514) |-+.-| -. .....+..... .+..... ............ ..+. . ..+.+.+++ T Consensus 1 Ma~~~T-~~--------------~~~iiPev~s~-------~v~~~~~--~~~v~~~~~~~~--~~l~-g-~~G~tv~ip 52 (278) T protein:vir:80 1 MADLTT-KL--------------ANLIDPEVMGP-------MISAKLP--KAIKFGKIAPID--NSLE-G-QPGSEITVP 52 (278) T ss_pred CCCcce-eh--------------hheecHHHHHH-------HHHHHHH--Hhhhhcccceec--cccc-C-CCCCEEEEe Confidence 000000 00 00000000000 0000000 000000000000 0000 0 012222222 Q ss_pred ccccchhhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhh-cCCChhHHHHHHHHHHHHHHhhH Q lcl|Aclame:pro 222 AGMATSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAV-HGLDADAELSGILANEVMVELNR 300 (514) Q Consensus 222 ~GmtTa~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAi-HGLDAEaELanILStEImlEINR 300 (514) .--....+|.. .. +..+..-.++..+++++-|-|+ |+ | + .-|+.+. -+-|.-.+..+-++.-+..++++ T Consensus 53 ~~~~~g~a~~~---~~--g~~i~~~~lt~~~~~~~i~~~~-~a-~--~-v~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~ 122 (278) T protein:vir:80 53 KYKYIGDAQDV---AE--GAAIDYSALETESVKHGIKKAG-KG-V--K-LTDESVLSGYGDPVEEAQKQIRMAIASKVDN 122 (278) T ss_pred eeccCCcceee---cC--CCcCcccccccceeeEeeehhh-cc-c--c-ccHHHHhhccccHHHHHHHHHHHHHHHHHHH Confidence 11111222322 11 1233333455666677767665 22 2 2 3344443 36789999999999999999999 Q ss_pred HHHHHHhhheeecccccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhH Q lcl|Aclame:pro 301 EIVNLVNSQAQIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSA 380 (514) Q Consensus 301 eii~~l~~~a~v~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~ 380 (514) +++..+.... ..+..+-..|..+ -+.+.+-.+..++.++ - --...+++++|.++.. T Consensus 123 ~l~~~l~~a~--------~~~~~~~t~~~~~--------~~~~~~~da~~~l~~~-------~-~~~~~~ivv~p~~~~~ 178 (278) T protein:vir:80 123 DILEEALTTT--------LEVKGAINIGLID--------KIENTFTDAPDAIEDE-------S-ITTTGVLFLNYKDTAK 178 (278) T ss_pred HHHHHHhccc--------cccccccccchhh--------hHHHHHHHHHHhhccc-------C-CCcccEEEECHHHHHH Confidence 9988774321 1112211222110 0122222222222221 1 1123489999999999 Q ss_pred HhhccccccchhccccCccccccccCceEEEEecCceEEEecCCCccce-EEEEEecCCCcccceeecccccccccc-cc Q lcl|Aclame:pro 381 LSMTDTLVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDY-FTVGFKGSTEMDAGVFYSPYVPLTPLR-GS 458 (514) Q Consensus 381 L~~~g~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy-~~vG~kG~~~~~~~~fy~PYv~~~~~~-~~ 458 (514) |.......+.......+ ....+-..|.+. |++||++...|..= ++++ +|. -.|+..= +. .++ .. T Consensus 179 L~k~~~~~~~~~~~~g~-----~~~~~G~ig~~~-G~~Vi~s~~~p~~t~~l~~-~gA-----i~~~~~~-~~-~vE~~R 244 (278) T protein:vir:80 179 LREEAAGSWTKASQLGD-----DLLVKGAFGELL-GWEIVRTKKLADGNALAVK-AGA-----LKTFLKR-NL-LAESGR 244 (278) T ss_pred HHhhhhhhccccccccc-----cceeeccceeec-ceeEEEcCCCCcceEEEEe-ccc-----eeeeecC-Cc-cccccc Confidence 97653332211100000 011111235664 68999999988521 2222 121 1122211 11 122 36 Q ss_pred CCccccceeeeeeeeeee-ecCcc--ccccCcceeecCcchhhhccc Q lcl|Aclame:pro 459 DSKNFQPVIGFKTRYGVQ-VNPFA--DPTASATKVGNGAPVAASMGK 502 (514) Q Consensus 459 dp~s~qp~~~~~tRY~l~-~nPf~--~~~~~~~~i~~~~~~~~~~~~ 502 (514) |+..++-.|-...+||+. .||-. ..+... |. T Consensus 245 d~~~~~d~i~~~~~yg~~v~~~~~~v~it~~a-------------~~ 278 (278) T protein:vir:80 245 DMDHKLTKFNADQHYAVALVDETKAVKVVPVA-------------GN 278 (278) T ss_pred chhhccceeeeeeEEEEEEEcCcceEEEeecc-------------CC Confidence 999999999999999984 46722 111111 11 No 58 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=73.68 E-value=0.17 Score=24.84 Aligned_cols=341 Identities=13% Similarity=0.102 Sum_probs=109.7 Q ss_pred CcchhhhhhhhccccccccccccchhhhhhhhhhhhHHHHHHhc---------------------ccccchhhhhhhccc Q lcl|Aclame:pro 1 MNLTEKWKDLLEAEGADMPEIATATKQKIMSKIFENQDRDINND---------------------PMYRDPQLVEAFNAG 59 (514) Q Consensus 1 ~~l~~kw~p~l~~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~---------------------~~~~~~~~~~~~~~~ 59 (514) -.-.+++.-+.. ||++. +..| .+ +|.+++...+. ..-........|+.+ T Consensus 32 ~ee~~~~~~l~~-------ei~~l-~~~I-~~-~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 101 (435) T protein:vir:14 32 VEQQAEFDQLSS-------KFSEL-TAQI-ER-AEAAERMAAAAAVPVDPNPTAVAAPAAAPVHAQPKALEVKGAKMARM 101 (435) T ss_pred HHHHHHHHHHHH-------HHHHH-HHHH-HH-HHHHHHHHHhhcccccchhhhhhhccccccccccchhhhhHHHHHHH Confidence 001122222211 01100 0000 00 11111110000 000000000001110 Q ss_pred cc---ccc--------ccc--cccccccccccccccccccccceeeeh------hhhhhhhhhhhcce-eEEecCCcccc Q lcl|Aclame:pro 60 LN---EAV--------VNG--DHGYDPANIAQGVTTGAVTNIGPTVMG------MVRRAIPQLIAFDI-AGVQPMTGPTS 119 (514) Q Consensus 60 ~~---~a~--------~~~--~~g~~~~~~~~st~tg~v~~~~P~l~~------l~Rra~~~LIa~DI-~GVQPmTgPTG 119 (514) .. .+. ... ..+....+ .-.+++. .....||+ +++++.++.+..++ +-+.||+... T Consensus 102 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~t~---~~gg~~vP~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~- 176 (435) T protein:vir:14 102 VRALAAARGDAQLASKLAIERGFGEEVAM-SLNTLSP---GAGGVLVPENLSSEVIELLRPKSVVRKLGARTLPLSNGN- 176 (435) T ss_pred HHHHHhhcchhhHHHHHHHhhhhhhhhhh-hcccCCc---CCCccccchhHHHHHHHHHhhhchhhhhcceeeecCCCc- Confidence 00 000 000 00000000 0000010 11122222 22323344444443 2233332111 Q ss_pred eeeeeeeeecCCCCccccccccccCCCCccCccccccccccccccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 120 QVFTLRSVYGKDPLTGAEAFHPTRQADASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAV 199 (514) Q Consensus 120 LIFAMRSrY~~~~~tg~EA~~~~nEadt~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~ 199 (514) +-+ +... ++.++. T Consensus 177 ~~~------p~~~-~~~~a~------------------------------------------------------------ 189 (435) T protein:vir:14 177 ITI------PRLK-GGAIVG------------------------------------------------------------ 189 (435) T ss_pred eEE------EEEe-CCccee------------------------------------------------------------ Confidence 000 0000 000000 Q ss_pred cCccccccccccccccccccccccccchhhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcC Q lcl|Aclame:pro 200 AGQMTATEYTDGVAGGLLVEIDAGMATSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHG 279 (514) Q Consensus 200 ~~~~~~~~~~~~~a~~~~y~~~~GmtTa~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHG 279 (514) -+ .| +..+++-.-++++++..++..+-....|-||.+|-. -. T Consensus 190 -------------------~v--------~E---------~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~--~~ 231 (435) T protein:vir:14 190 -------------------YI--------GA---------DTDIPTTQQQFDDLKLTAKKMAALVPIANDLIKYAG--VN 231 (435) T ss_pred -------------------ee--------cc---------CccccccccceeEEEeeeEEEEEeehhhHHHHHhhc--cC Confidence 00 01 112233334556666666666667779999999932 12 Q ss_pred CChhHHHHHHHHHHHHHHhhHHHHHHHhhheeecccccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 280 LDADAELSGILANEVMVELNREIVNLVNSQAQIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEI 359 (514) Q Consensus 280 LDAEaELanILStEImlEINReii~~l~~~a~v~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I 359 (514) .+.|+.|.+-|+..|...+|+.|+. -.-+ +-.+.|++.......+...- ...-+..+...+.++-..+ T Consensus 232 ~~l~~~i~~~l~~ai~~~~d~a~l~---G~G~--------~~~p~Gi~~~~~~~~~~~~~-~~~~~~~~~~~~~~l~~~~ 299 (435) T protein:vir:14 232 PNVDQIVVGDLTAAIGAREDKAFIR---DDGT--------ANTPKGLRFWALPSNVITAS-DASTLQKIETDLGKVILAL 299 (435) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhc---cCCC--------Cccccceeecccccceeccc-cccchhhHHHHHHHHHHHh Confidence 3467888888888888888887742 1100 00133443221110000000 0001111222222222222 Q ss_pred HHhcccccccEEEEChhHHhHHhhccccccchhccccCccccccccCceEEEEecCceEEEecCCCccc--------eEE Q lcl|Aclame:pro 360 GRQTGRGNGNFIIASRNVVSALSMTDTLVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVND--------YFT 431 (514) Q Consensus 360 ~~~T~r~~~n~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d--------y~~ 431 (514) ...-........|++|.....|....- ..| ...-.+.+ .|+|.| ++|+++++.|.+ -++ T Consensus 300 ~~~~~~~~~~~~v~n~~~~~~L~~lkd-----~~G---~~l~~~~~----~g~l~G-~Pv~~~~~~p~~~~~~~~~~~i~ 366 (435) T protein:vir:14 300 ENADANLTQPGWIMAPRTFRFLEGLRD-----GNG---NKVYPELA----NGMLKG-YPVGKTTQVPINLGETGKESEIY 366 (435) T ss_pred hhccccccCCEEEEcHHHHHHHHHhhc-----cCC---ceeccCCC----CCeeec-ceeEeeccccccccCCCccceEE Confidence 211112234557899999988875321 001 01111112 256655 589988775431 122 Q ss_pred --------EEEecCCCcccceeeccccccccccccCCccc---cceeeeeeeeee-eecC--ccccccCcceeecCcchh Q lcl|Aclame:pro 432 --------VGFKGSTEMDAGVFYSPYVPLTPLRGSDSKNF---QPVIGFKTRYGV-QVNP--FADPTASATKVGNGAPVA 497 (514) Q Consensus 432 --------vG~kG~~~~~~~~fy~PYv~~~~~~~~dp~s~---qp~~~~~tRY~l-~~nP--f~~~~~~~~~i~~~~~~~ 497 (514) +|..+.-. +-.+||.-........-..| |=.+=+..|++. ..+| |.. .++-+|. T Consensus 367 ~gd~s~~~i~~~~~~~----~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~--------l~~~~~~ 434 (435) T protein:vir:14 367 FTDFGDVFIGEEETLE----IDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIAV--------LAGVAWG 434 (435) T ss_pred EeecccEEEEEecccE----EEEeccccccccccchhhhhhcChhheeeeeeeCceeecccceEE--------EecCCCC Confidence 33333222 22333322111000000001 122334455555 2333 221 2222222 Q ss_pred h Q lcl|Aclame:pro 498 A 498 (514) Q Consensus 498 ~ 498 (514) + T Consensus 435 ~ 435 (435) T protein:vir:14 435 A 435 (435) T ss_pred C Confidence 2 No 59 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=72.71 E-value=0.18 Score=24.68 Aligned_cols=344 Identities=10% Similarity=0.040 Sum_probs=118.1 Q ss_pred Ccchhhhhh---hhccccccccccccchhhhhhhhhhh--hHHHHH-Hhcccc-cchhh-------hhhhcccccccccc Q lcl|Aclame:pro 1 MNLTEKWKD---LLEAEGADMPEIATATKQKIMSKIFE--NQDRDI-NNDPMY-RDPQL-------VEAFNAGLNEAVVN 66 (514) Q Consensus 1 ~~l~~kw~p---~l~~~~~~~~~i~~~~~~~~~~~~~e--nq~~~~-~~~~~~-~~~~~-------~~~~~~~~~~a~~~ 66 (514) ..+.++=.- +.+.+...+.++.. -.+.+.++|=+ ....+. .++... ..... .+.+..+....... T Consensus 20 ~~~~e~~~~~~~~~~e~~~~~~~~~~-e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 98 (390) T protein:vir:97 20 KAFGERAVRDGELNASARSKVDELFA-TVGNLSAEVQAARQRVAELEGNGAGGDVQHVSVGDMFVASEQFQASTGRWNDR 98 (390) T ss_pred HHHHHHHHhhcCCCHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHhcccccccccccchhhhhhhHHHHHHHHHhhhh Confidence 111111000 00000000000000 00111111111 000000 000000 00000 00000000000000 Q ss_pred ccccc----cc--cccccccccccccccceeeehhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCccccccc Q lcl|Aclame:pro 67 GDHGY----DP--ANIAQGVTTGAVTNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAEAFH 140 (514) Q Consensus 67 ~~~g~----~~--~~~~~st~tg~v~~~~P~l~~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~tg~EA~~ 140 (514) ..... .. .....+++++...-....+=.+++++-++.+-.+++.+-||++++.-+--. ... + T Consensus 99 ~~~~~~~~~~~~~~~~~~~~~~~g~lip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~~~~~~~----~~~--~------ 166 (390) T protein:vir:97 99 SARATMNIKAALNTASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDSALIEYVQE----TGF--V------ 166 (390) T ss_pred hhhhhhHHHHHHHhhhcccccccccccchhhhHHHHHHHhhhhhhHhhcceeeccCCceEEEEE----ecC--C------ Confidence 00000 00 000001111111111111223344444555667788999988766322111 000 0 Q ss_pred cccCCCCccCccccccccccccccccccccccccccccccccccccccccccccccccccCccccccccccccccccccc Q lcl|Aclame:pro 141 PTRQADASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEI 220 (514) Q Consensus 141 ~~nEadt~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~ 220 (514) +.+.|- T Consensus 167 ----~~a~~v---------------------------------------------------------------------- 172 (390) T protein:vir:97 167 ----NNAAIV---------------------------------------------------------------------- 172 (390) T ss_pred ----cceeee---------------------------------------------------------------------- Confidence 000000 Q ss_pred cccccchhhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhH Q lcl|Aclame:pro 221 DAGMATSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNR 300 (514) Q Consensus 221 ~~GmtTa~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINR 300 (514) + | +..+++-..++++++...|.-+-...+|-||.+|-- +.++.|.+-|+..|...+|+ T Consensus 173 ~--------E---------g~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~ds~-----~l~~~i~~~la~a~~~~~d~ 230 (390) T protein:vir:97 173 A--------E---------GALKPESSLKFAKKTDTTHVIAHTMKATRQILSDAP-----QLASYMNNRLIRGLKVKEDA 230 (390) T ss_pred c--------C---------CccccccccceeEEEEeeeeEEEeehhhHHHHHhHH-----HHHHHHHHHHHHHHHHHHHH Confidence 0 0 011222223344444444444556779999999852 46899999999999999998 Q ss_pred HHHHHHhhheeecccccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhH Q lcl|Aclame:pro 301 EIVNLVNSQAQIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSA 380 (514) Q Consensus 301 eii~~l~~~a~v~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~ 380 (514) .|+.- .-+ +-.+.|++..........+.-....+-. |..+...+ .......+.+|++|+.... T Consensus 231 a~l~G---~g~--------~~~p~Gi~~~~~~~~~~~~~~~~~~~d~----~~~~~~~~--~~~~~~~~~~v~n~~~~~~ 293 (390) T protein:vir:97 231 EILRG---TGA--------NDGLLGLIPQATTYAAPTTIAGATRVDQ----LRLAMLQA--SLAEYPASGIVINPIDWAA 293 (390) T ss_pred HHhhc---CCC--------CccccceeeccccccccccccccchHHH----HHHHHHhh--ccccCCCCEEEEcHHHHHH Confidence 88421 100 0113344321110000000000011111 22222222 2334567788999999888 Q ss_pred HhhccccccchhccccCccccccccCceEEEEecCceEEEecCCCccceEEEEEecCCCcccceeeccccccccccccC- Q lcl|Aclame:pro 381 LSMTDTLVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGFKGSTEMDAGVFYSPYVPLTPLRGSD- 459 (514) Q Consensus 381 L~~~g~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~kG~~~~~~~~fy~PYv~~~~~~~~d- 459 (514) |... .. ..| ...-.+.... --++|. |++|++++..+.+-+++|--- ..+++...-.+. +...+ T Consensus 294 L~~l---kd--~~G---~~l~~~~~~~-~~~~l~-G~pV~~~~~~~~~~~~~gd~~-----~~~~~~~~~~~~-i~~~~~ 357 (390) T protein:vir:97 294 IELA---KD--ANN---QYLIGNARGT-LTPTLW-GLPVVATQAMAPGEFLVGAFD-----LAAQIFDQWDAR-VEIGYV 357 (390) T ss_pred HHHh---hc--CCC---ceeecCccCC-CCceec-ceeeEEcCCCCCCcEEEEecc-----ceEEEEEecceE-EEEeec Confidence 8742 11 111 1111111111 013454 569999998887666555210 011111000000 01111 Q ss_pred C---ccccceeeeeeeeeee-ecC--ccccccCcceeecC Q lcl|Aclame:pro 460 S---KNFQPVIGFKTRYGVQ-VNP--FADPTASATKVGNG 493 (514) Q Consensus 460 p---~s~qp~~~~~tRY~l~-~nP--f~~~~~~~~~i~~~ 493 (514) . .+-+=.+-+..||++. .+| |... .=+ T Consensus 358 ~~~f~~~~~~~r~~~r~d~~v~~~~a~v~~-------~~a 390 (390) T protein:vir:97 358 NDDFQRNMVTVLAEERLALVVYRPEALITG-------SFA 390 (390) T ss_pred ccccccCcEEEEEEEeeccEEeccccEEEE-------EeC Confidence 1 1222233445577763 344 2211 100 No 60 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=71.83 E-value=0.19 Score=24.53 Aligned_cols=304 Identities=13% Similarity=0.065 Sum_probs=111.6 Q ss_pred Ccch----------------hhhhhhhccc-cccccccccchhhhh----hhhhhhhHHHHHHhcccccchhhhhhhccc Q lcl|Aclame:pro 1 MNLT----------------EKWKDLLEAE-GADMPEIATATKQKI----MSKIFENQDRDINNDPMYRDPQLVEAFNAG 59 (514) Q Consensus 1 ~~l~----------------~kw~p~l~~~-~~~~~~i~~~~~~~~----~~~~~enq~~~~~~~~~~~~~~~~~~~~~~ 59 (514) .... ....+..... +.........+++++ ....+-+-++....... T Consensus 53 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~------------- 119 (397) T protein:vir:12 53 TEGRSLDVPDLPGGVNFVPEQERNPEGQRSQGQGNEERQQQYSKAFLKGLRGKRLTDEERDLLDSPE------------- 119 (397) T ss_pred HHHHHHHHHHHHHHhhhhhhhhhhhcccccccchhhHHHHHHHHHHHHHHhccCCcHHHHHHHhhhh------------- Confidence 0000 0000000000 000000000011110 01111111111110000 Q ss_pred cccccccccccccccccccccccccc---cccceeeehhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCccc Q lcl|Aclame:pro 60 LNEAVVNGDHGYDPANIAQGVTTGAV---TNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGA 136 (514) Q Consensus 60 ~~~a~~~~~~g~~~~~~~~st~tg~v---~~~~P~l~~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~tg~ 136 (514) .-+. ...++++|.+ +.+.+. +++.+.++.+-.+++.+.||+++.|-+--.|.. + +. T Consensus 120 -~~a~-----------~~~~~~~gg~lvP~~~~~~---ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~--~----~~ 178 (397) T protein:vir:12 120 -FRAM-----------SGINDEDGGILIPEDIGRQ---IHEFKRQFEPLEQYVTVEPVTTRSGTRLLEKNA--D----MV 178 (397) T ss_pred -hhhc-----------cccccccCcccCchhHHHH---HHHhhhhhhhHHhhcceeeccCCceeEEEEEec--C----Cc Confidence 0000 0001112221 122233 344444666778999999999988753221111 0 00 Q ss_pred cccccccCCCCccCccccccccccccccccccccccccccccccccccccccccccccccccccCccccccccccccccc Q lcl|Aclame:pro 137 EAFHPTRQADASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGL 216 (514) Q Consensus 137 EA~~~~nEadt~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~ 216 (514) .| .|-+. T Consensus 179 ~a---------~~v~E---------------------------------------------------------------- 185 (397) T protein:vir:12 179 PF---------SPVEE---------------------------------------------------------------- 185 (397) T ss_pred ce---------eeecc---------------------------------------------------------------- Confidence 00 01000 Q ss_pred cccccccccchhhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHH Q lcl|Aclame:pro 217 LVEIDAGMATSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMV 296 (514) Q Consensus 217 ~y~~~~GmtTa~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEIml 296 (514) |- +. ...+...|.++.|+..|..+- ..+|-||.+|-- +|.++.|.+.|...|.. T Consensus 186 ------g~-----~~----~~~~~~~~~~v~~~~~k~~~~-------~~is~e~l~ds~----~~l~~~i~~~l~~~~~~ 239 (397) T protein:vir:12 186 ------LG-----NL----PEIDQPRFTKVSYSIIDYGGI-------MTLSNSMLNDSD----QAIMTYVAKWFAKKSVV 239 (397) T ss_pred ------cc-----cc----cccccccceeEEeeheeeEee-------ehhhHHHHhhch----HHHHHHHHHHHHHHHHH Confidence 00 00 000112355666666666554 459999998854 45688999999999999 Q ss_pred HhhHHHHHHHhhheeecccccccccCCcceeccccccccccchhhHHHHHH-HHHHHHHHHHHHHHhcccccccEEEECh Q lcl|Aclame:pro 297 ELNREIVNLVNSQAQIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKA-LLIQIEKEANEIGRQTGRGNGNFIIASR 375 (514) Q Consensus 297 EINReii~~l~~~a~v~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~-L~~~i~~~a~~I~~~T~r~~~n~~v~S~ 375 (514) .+|+.|+.-. - .+ .+.|+..++ .... ++..++.. ...+..+||+| T Consensus 240 ~~d~~il~G~---g----~~-----~~~g~~~~~-------------~i~~~~~~~l~~~---------~~~~a~~~~n~ 285 (397) T protein:vir:12 240 TRNNLILAAI---A----SL-----KKVDIDGLD-------------GIKKALNVTLDPM---------VAPGSIVLTNQ 285 (397) T ss_pred HHHHHHHhcc---c----cc-----cccccccHH-------------HHHHHHhhccchh---------hhCCCEEEEcH Confidence 9999885432 1 11 233443321 1222 22233211 22345678999 Q ss_pred hHHhHHhhccccccchhccccCccccccccCceEEEEecCceEEEecCCCccceEEEEEecCCCcccceeeccccc---- Q lcl|Aclame:pro 376 NVVSALSMTDTLVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGFKGSTEMDAGVFYSPYVP---- 451 (514) Q Consensus 376 ~va~~L~~~g~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~kG~~~~~~~~fy~PYv~---- 451 (514) .....|... .. ..|.. -...+.+. -.-++|.| ++|++.+...... - .-+.-++|+.|-. T Consensus 286 ~~~~~L~~l---kd--~~G~~--l~~~~~~~-g~~~~l~G-~pv~~~~~~~~~~-----~---~~~~~~~~gd~~~~~~~ 348 (397) T protein:vir:12 286 DGYDWLDTL---KD--GTGRY--LLQPDPTN-PTKKLLDG-RPVVPFTNRVLKT-----Q---KGKAPLIIGNLKEAIVL 348 (397) T ss_pred HHHHHHHHh---hc--cCCce--eecccccC-CCCccccc-eeeEEeccccccc-----C---CCccEEEEEehhceEEE Confidence 988887653 10 00100 00001011 01134544 4777544321100 0 0000112221110 Q ss_pred ----cccccc-----cCCccccceeeeeeeeee-eecC--ccccccCcce Q lcl|Aclame:pro 452 ----LTPLRG-----SDSKNFQPVIGFKTRYGV-QVNP--FADPTASATK 489 (514) Q Consensus 452 ----~~~~~~-----~dp~s~qp~~~~~tRY~l-~~nP--f~~~~~~~~~ 489 (514) ...+.. .+-.+-+-.+-...|++. ..+| |... +-..+ T Consensus 349 ~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~-~~t~~ 397 (397) T protein:vir:12 349 FDREQQSIASTDTGAGAFETNSTKVRGIEREDVRKWDEDAVVFG-QITVE 397 (397) T ss_pred EeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEE-EEeeC Confidence 000000 011123345555666665 3344 2110 00011 No 61 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=70.62 E-value=0.21 Score=24.34 Aligned_cols=301 Identities=9% Similarity=-0.042 Sum_probs=112.1 Q ss_pred eeeeecCCCCccccccccccCCCCccCccccccccccccccccccccccccccccccccccccccccccccccccccCcc Q lcl|Aclame:pro 124 LRSVYGKDPLTGAEAFHPTRQADASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQM 203 (514) Q Consensus 124 MRSrY~~~~~tg~EA~~~~nEadt~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~ 203 (514) |+.....+.--+..+. ..+....|........... +........ ...- ...-...+.... T Consensus 1 ~~k~~~~~~~~~~~~~--~~~~~~~~~a~~~~~~~~~-----~~lip~~~~-------~~ii------~~~~~~s~l~~~ 60 (324) T protein:vir:99 1 MEQTQKLKLNLQHFAS--NNVKPQVFNPDNVMMHEKK-----DGTLLNDFT-------TPIL------QEVMENSKIMRL 60 (324) T ss_pred CCCchHhhHHHHHHHH--HhhhhhhccccceeccCCC-----cceechhHH-------HHHH------HHHHhhchhhhh Confidence 8877554411111110 0111222211100000000 000000000 0000 000000000000 Q ss_pred ccccccccccccccccccccccchhhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChh Q lcl|Aclame:pro 204 TATEYTDGVAGGLLVEIDAGMATSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDAD 283 (514) Q Consensus 204 ~~~~~~~~~a~~~~y~~~~GmtTa~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE 283 (514) ..... ...+ ...+..--....++-. ..+..+++...+++++++..|.-+---..|-||.+|-. .|.+ T Consensus 61 ~~~~~---~~~~-~~~~p~~~~~~~a~~v-----~Eg~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~----~~l~ 127 (324) T protein:vir:99 61 GKYEP---MEGT-EKKFTFWADKPGAYWV-----GEGQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY----SQFF 127 (324) T ss_pred cceee---ccCC-ceEEEEEecCcceeEe-----ccCccccccccceeEEEEeeEEEEEeehhhHHHHhcch----HHHH Confidence 00000 0000 0000000000011100 11355777788888888888888888889999999974 4579 Q ss_pred HHHHHHHHHHHHHHhhHHHHHHHhhheeecccccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhc Q lcl|Aclame:pro 284 AELSGILANEVMVELNREIVNLVNSQAQIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQT 363 (514) Q Consensus 284 aELanILStEImlEINReii~~l~~~a~v~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T 363 (514) ++|.+.|+..|...+++.||. -..+-. .+.|++......- .. ....-.+..|.++.+.|. . T Consensus 128 ~~i~~~l~~ai~~~~d~~~l~---G~g~~~--------~~~~~~~~~~~~~-----~~-~~~~~~~~~i~~~~~~l~--~ 188 (324) T protein:vir:99 128 EEMKPMIAEAFYKKFDEAGIL---NQGNNP--------FGKSIAQSIEKTN-----KV-IKGDFTQDNIIDLEALLE--D 188 (324) T ss_pred HHHHHHHHHHHHHHHHHHhhh---cCCCCc--------cCccccccccccc-----ee-ccccCCHHHHHHHHHhhh--h Confidence 999999999999999999953 222100 1111111100000 00 000011222333434342 2 Q ss_pred ccccccEEEEChhHHhHHhhccccccchhccccCccccccccCceEEEEecCceEEEecCCCcc--ceEEEE-------- Q lcl|Aclame:pro 364 GRGNGNFIIASRNVVSALSMTDTLVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVN--DYFTVG-------- 433 (514) Q Consensus 364 ~r~~~n~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~--dy~~vG-------- 433 (514) .....+.+|++|.....|....- ..| +. .-.+..+ ++|.| ++|+|.+.... ..+++| T Consensus 189 ~~~~~~~~v~n~~~~~~L~~l~d-----~~g--~~-~~~~~~~----~~l~G-~PVv~~~~~~~~~~~~i~gd~~~~~~~ 255 (324) T protein:vir:99 189 DELEANAFISKTQNRSLLRKIVD-----PET--KE-RIYDRNS----DTLDG-LPVVNLKSSNLKRGELITGDFDKLIYG 255 (324) T ss_pred ccCCCCEEEEcHHHHHHHHHhhc-----CCC--ce-eecCCCC----ccccc-eeEEeecCCCCCcceEEEEecccEEEE Confidence 34566678999999988875311 011 00 1111122 34544 58888776553 223333 Q ss_pred EecCCCcccceeeccccccccccccCCc--------cccceeeeeeeeee-eecC--ccccc--cCcceeecCcchhhhc Q lcl|Aclame:pro 434 FKGSTEMDAGVFYSPYVPLTPLRGSDSK--------NFQPVIGFKTRYGV-QVNP--FADPT--ASATKVGNGAPVAASM 500 (514) Q Consensus 434 ~kG~~~~~~~~fy~PYv~~~~~~~~dp~--------s~qp~~~~~tRY~l-~~nP--f~~~~--~~~~~i~~~~~~~~~~ 500 (514) ..+.-.. -.....-+. ...|+. +-+=.+=...||+. ..|| |+..+ +.+.. ..+ T Consensus 256 ~~~~~~i----~~~~~~~~~--~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~lt~a~~~~~--------~~~ 321 (324) T protein:vir:99 256 IPQLIEY----KIDETAQLS--TVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKTD--------SVP 321 (324) T ss_pred EecCcEE----EEeeccccc--ccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeccCCCC--------CCC Confidence 2211110 000000000 000111 11122223456664 3444 33211 11100 001 Q ss_pred ccc Q lcl|Aclame:pro 501 GKN 503 (514) Q Consensus 501 ~~~ 503 (514) ++= T Consensus 322 ~~~ 324 (324) T protein:vir:99 322 GEV 324 (324) T ss_pred CCC Confidence 111 No 62 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=69.54 E-value=0.22 Score=24.17 Aligned_cols=320 Identities=14% Similarity=0.058 Sum_probs=118.1 Q ss_pred Ccchhhhhhhhccccccccccccchhh-hhhhhhhhhHHHHHHhccc-ccchhhhhhhccc-----------------cc Q lcl|Aclame:pro 1 MNLTEKWKDLLEAEGADMPEIATATKQ-KIMSKIFENQDRDINNDPM-YRDPQLVEAFNAG-----------------LN 61 (514) Q Consensus 1 ~~l~~kw~p~l~~~~~~~~~i~~~~~~-~~~~~~~enq~~~~~~~~~-~~~~~~~~~~~~~-----------------~~ 61 (514) -.+++++.-+.+. |.+.-++ .....+.+..++....... .......+.+... .. T Consensus 45 ~e~~~~~~~l~~e-------i~~l~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 117 (400) T protein:vir:38 45 EGVRAKYDKAGKE-------IKDLEEKRDLYEAALKGNEQSSGKKPDHPEEHSYRDALNAYLHTRGRNTDGVNFEKTDVG 117 (400) T ss_pred HHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHhhcccccccchhhhhHHHHHHHHHhhHHHHHHHHHHHHHHHH Confidence 1111112111110 0000000 0000000000000000000 0000000000000 00 Q ss_pred cccccccccccc-ccccccc--ccccc---cccceeeehhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCcc Q lcl|Aclame:pro 62 EAVVNGDHGYDP-ANIAQGV--TTGAV---TNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTG 135 (514) Q Consensus 62 ~a~~~~~~g~~~-~~~~~st--~tg~v---~~~~P~l~~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~tg 135 (514) .........-.. .....++ ..|.+ +.+.+. +++..-++.+..+++.+.||++.++-+--++.. ++ T Consensus 118 ~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~---ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~------~~ 188 (400) T protein:vir:38 118 TFAVLRAVPTDASDAVNAGVKAADAASTIPETISNT---PQRELQTVVDLKPFTNVFQASTQKGTYPTVANA------TT 188 (400) T ss_pred HHhhhhhhhHHHHHHHhhcccccCCcccccHHHHHH---HHHHHHhhhhhhhcceeEeccCcceEEEEEecC------CC Confidence 000000000000 0000001 11111 112222 333444666788899999999887644322211 00 Q ss_pred ccccccccCCCCccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccccccccccc Q lcl|Aclame:pro 136 AEAFHPTRQADASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGG 215 (514) Q Consensus 136 ~EA~~~~nEadt~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~ 215 (514) . ..|-+ T Consensus 189 ~----------~~~~~---------------------------------------------------------------- 194 (400) T protein:vir:38 189 K----------MVTVA---------------------------------------------------------------- 194 (400) T ss_pred c----------ccccc---------------------------------------------------------------- Confidence 0 00000 Q ss_pred ccccccccccchhhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHH Q lcl|Aclame:pro 216 LLVEIDAGMATSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVM 295 (514) Q Consensus 216 ~~y~~~~GmtTa~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEIm 295 (514) ++- +. ...+...|.+ ++...+.-+-...+|-||.+|- ..|.+++|.+-|+..|. T Consensus 195 ------E~~-----~~----~~~~~~~f~~-------i~~~~~k~~~~~~is~ell~ds----~~~~~~~i~~~l~~~~~ 248 (400) T protein:vir:38 195 ------ELE-----KN----PAMAKPEFKP-------VNWSVETYRQALPVSQESIDDS----AIDLVGLIAQNGQQIKV 248 (400) T ss_pred ------ccc-----cc----ccccccccee-------eEeehhheeeehhhHHHHHhhh----HHHHHHHHHHHHHHHHH Confidence 000 00 0001123444 4445555555677999999985 34678899999999999 Q ss_pred HHhhHHHHHHHhhheeecccccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEECh Q lcl|Aclame:pro 296 VELNREIVNLVNSQAQIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASR 375 (514) Q Consensus 296 lEINReii~~l~~~a~v~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~ 375 (514) ..+|+.|+..... + .+.|+..++ ....++..... . .. ....|++| T Consensus 249 ~~~~~~i~~~~~~-------~-----~~~~~~~~~-------------~~~~~~~~~~~--------~-~~-~a~~v~~~ 293 (400) T protein:vir:38 249 NTTNGAVATLLKG-------F-----TAKTISSVD-------------DLKHINNVDLD--------P-AY-SRVIIASQ 293 (400) T ss_pred HHHHHhhhhcccc-------c-----cccccccHH-------------HHHHHHHhhhh--------h-hh-CcEEEEcH Confidence 9999888543311 1 122222111 12222221111 1 12 24567899 Q ss_pred hHHhHHhhccccccchhccccCccccccccCceEEEEecCceEEEecCCCccceEEEEEecCCCcccceeecccc----c Q lcl|Aclame:pro 376 NVVSALSMTDTLVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGFKGSTEMDAGVFYSPYV----P 451 (514) Q Consensus 376 ~va~~L~~~g~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~kG~~~~~~~~fy~PYv----~ 451 (514) .....|... .. ..|.. -...+.++. ..++|.| ++|++..+.+.. -.|+ .-++|..+- - T Consensus 294 ~~~~~l~~l---kd--~~G~~--i~~~~~~~~-~~~~l~G-~pv~~~~~~~~~-----~~g~----~~~~~gd~s~~~~~ 355 (400) T protein:vir:38 294 SFYNFLDTV---KD--GNGRY--LLQDSILTP-SGKSVLG-MPIAVVSDDTLG-----AAGE----AHAFLGDIKRAILF 355 (400) T ss_pred HHHHHHHHh---hc--cCCCe--eeecCcCCC-Ccccccc-ceeEEecccccC-----CCCc----eEEEEEeccccEEE Confidence 988777642 10 00000 000111111 1135554 477776655431 1111 112222211 1 Q ss_pred ----cccccccCCccccceeeeeeeeee-eecC--ccccccCcceeecCcchh Q lcl|Aclame:pro 452 ----LTPLRGSDSKNFQPVIGFKTRYGV-QVNP--FADPTASATKVGNGAPVA 497 (514) Q Consensus 452 ----~~~~~~~dp~s~qp~~~~~tRY~l-~~nP--f~~~~~~~~~i~~~~~~~ 497 (514) ...+...|-..|+..+-...||+. ..+| |...+ -.+.+ T Consensus 356 ~~~~~~~~~~~~~~~~~~~~~~~~r~d~~~~~~~a~~~l~--------~~~~a 400 (400) T protein:vir:38 356 ANRADFMVRWVDDQIYGQFLQAGMRFGVSVADEKAGYFLT--------YTPKA 400 (400) T ss_pred EeecceEEEEecccccceeEEEEEEeccEEecccceEEEE--------eecCC Confidence 112233456667777888889987 3455 32211 11111 No 63 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=66.00 E-value=0.27 Score=23.66 Aligned_cols=346 Identities=13% Similarity=0.094 Sum_probs=116.3 Q ss_pred Ccchhhhhhhhc------------------------------cccccccccccchhhhhhhhhhhhHHHHHHhcccccch Q lcl|Aclame:pro 1 MNLTEKWKDLLE------------------------------AEGADMPEIATATKQKIMSKIFENQDRDINNDPMYRDP 50 (514) Q Consensus 1 ~~l~~kw~p~l~------------------------------~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~ 50 (514) -.|.++..-+-+ ++......+....+|+...+.+.+..+.- .+...... T Consensus 43 ~~l~~~~~~l~~~i~~le~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~-~~~~~~~~ 121 (434) T protein:vir:62 43 EQLTKEIQTISEELAKLEEKEKEEDPAKKKDDDPEKKEDPTAKENPNEKTELSEEQRSAISASIAAALSTK-GHRTNKET 121 (434) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcchhhhhcchhhhcchhhhHHHHHHHHHHHHHHHHhhhhhc-cccchHHH Confidence 111111111100 00000000000000000000000000000 00000000 Q ss_pred hhhhhhccccccccccccccccccccccccccccccccceeee--hhhhhhhhhhhhcceeEEecCCcccceeeeeeeee Q lcl|Aclame:pro 51 QLVEAFNAGLNEAVVNGDHGYDPANIAQGVTTGAVTNIGPTVM--GMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVY 128 (514) Q Consensus 51 ~~~~~~~~~~~~a~~~~~~g~~~~~~~~st~tg~v~~~~P~l~--~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY 128 (514) ..-.+|...+..- .... ..-+-++++++-.-.=|.-+ .+++..-+..+...++-|.|+++..- |- ++ T Consensus 122 e~r~a~~~~l~~~-----~~~~-e~~a~~~~t~~GG~lvP~~~~~~Ii~~l~~~~~i~~~~~~~~~~~~~~--~p---~~ 190 (434) T protein:vir:62 122 EIRSVFANYIVGN-----IDEK-EARALGLVTGNGSVTIPDFLSKEIITYAQEENFLRRLGTGVKTKENIK--YP---VL 190 (434) T ss_pred HHHHHHHHHhccc-----cchh-hhhhhcccccccceecchhhHHHHHHhhhhhhhhhhhcceeccCCceE--EE---EE Confidence 0011111111100 0000 00011112211000012222 14444445666677788877764311 00 01 Q ss_pred cCCCCccccccccccCCCCccCccccccccccccccccccccccccccccccccccccccccccccccccccCccccccc Q lcl|Aclame:pro 129 GKDPLTGAEAFHPTRQADASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEY 208 (514) Q Consensus 129 ~~~~~tg~EA~~~~nEadt~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~ 208 (514) ...+ .+.+- T Consensus 191 ~~~~-------------~a~~~---------------------------------------------------------- 199 (434) T protein:vir:62 191 VKKA-------------EAQGH---------------------------------------------------------- 199 (434) T ss_pred ecCC-------------cccce---------------------------------------------------------- Confidence 1100 00000 Q ss_pred cccccccccccccccccchhhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHH Q lcl|Aclame:pro 209 TDGVAGGLLVEIDAGMATSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSG 288 (514) Q Consensus 209 ~~~~a~~~~y~~~~GmtTa~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELan 288 (514) . + ...+...++-..++++++..+|.-+-...+|-||.+|- .+|-+++|.+ T Consensus 200 ---------~-----------~------~~e~~~~~~~~~~f~~v~~~~~k~~~~~~iS~ell~ds----~~~l~~~i~~ 249 (434) T protein:vir:62 200 ---------K-----------N------ERTNNEMPETDIEFDEIELSPTEFDALATVTKKLLART----GLPIEQIVMD 249 (434) T ss_pred ---------e-----------c------ccccccccccccceeeEEeeheeeEeehhhHHHHHhcc----hHHHHHHHHH Confidence 0 0 00011122222345566666666666778999999995 4567999999 Q ss_pred HHHHHHHHHhhHHHHHHHhhheeecccccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccc Q lcl|Aclame:pro 289 ILANEVMVELNREIVNLVNSQAQIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNG 368 (514) Q Consensus 289 ILStEImlEINReii~~l~~~a~v~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~ 368 (514) -|+..|...+++.||.- .-+ .....++.......+... +.. ..+....|+..+ ...- +..+ T Consensus 250 ~la~~~~~~~d~~~l~G---~G~---~~~~~g~~~~~~~~~~~~----~~~-~~d~l~~l~~~l-------~~~~-~~~a 310 (434) T protein:vir:62 250 ELKKAYVRKETQYMVNG---DEA---NNINDGALAKKAVEFKTD----EKN-LYDALVKMKNTP-------VKEV-RKKA 310 (434) T ss_pred HHHHHHHHHHHHHHhcc---CCC---Cccccceeeccccccccc----ccc-hhhHHHHHHhhc-------chhh-hcCC Confidence 99999999999999531 111 000001111000111100 000 112222333332 2211 2333 Q ss_pred cEEEEChhHHhHHhhccccccchhccccCccccccccC-ceEEEEecCceEEEecCCCccceEEEEEecCCCcccceee- Q lcl|Aclame:pro 369 NFIIASRNVVSALSMTDTLVGPAAQGMQDGSMNTDTNQ-TVFAGVLGGRFKVYIDQYAVNDYFTVGFKGSTEMDAGVFY- 446 (514) Q Consensus 369 n~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~~d~~~-~~~~G~l~~~~~vy~D~y~~~dy~~vG~kG~~~~~~~~fy- 446 (514) ..|++|.....|... .. ..|. --...+... .-.-.+|.| ++|+++.+.+.. -.|.. .-++| T Consensus 311 -~~v~n~~~~~~L~~l---kd--~~G~--~l~~~~~~~~~g~~~tl~G-~pV~~~~~~~~~-----~~~~~---~~i~~G 373 (434) T protein:vir:62 311 -RWVLNTAALTKIETM---KT--DDGF--PLLRPFNQAEGGIGYTLLG-FPVEEEDAIDIP-----DSPDT---PVFYFG 373 (434) T ss_pred -EEEEcHHHHHHHHHh---hc--cCCC--EeeccCCCccCCCCceecc-eeeEEecCccCc-----cCCCc---eEEEEe Confidence 457899988877642 11 0110 000000000 000024544 588888766521 11100 00111 Q ss_pred --cccccccc-----c-cccCC--ccccceeeeeeeeeee-e-cCccccccCcceeecCcchhhhcc Q lcl|Aclame:pro 447 --SPYVPLTP-----L-RGSDS--KNFQPVIGFKTRYGVQ-V-NPFADPTASATKVGNGAPVAASMG 501 (514) Q Consensus 447 --~PYv~~~~-----~-~~~dp--~s~qp~~~~~tRY~l~-~-nPf~~~~~~~~~i~~~~~~~~~~~ 501 (514) +-|...+. + +..+. .+-|=.+..+.|++-. + .||+.. +.....+.+..+ T Consensus 374 dfs~~~i~~~~g~~~i~~~~~~~~~~~~v~~~~~~r~Dgk~i~~~~~~~------~~~~~~~~~~~~ 434 (434) T protein:vir:62 374 DFSKFYIQDVIGSLEVQKLVELFSRTNRVGFRIWNLLDAQLIHSPFEVP------VYKYVLKAPTGA 434 (434) T ss_pred eccceEEEEeeceeEEEeehhhhcccCceEEEEEeeecceeecCcccce------EEEEEeccCCCC Confidence 22211110 0 11122 1223335556777533 4 487642 221222222233 No 64 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=65.33 E-value=0.29 Score=23.57 Aligned_cols=335 Identities=11% Similarity=0.067 Sum_probs=116.4 Q ss_pred Ccchhh------hhhhhccccccccccccchhhhhhhhh--hhhHHHHHHhccc-------c-----cchhhhhhhcccc Q lcl|Aclame:pro 1 MNLTEK------WKDLLEAEGADMPEIATATKQKIMSKI--FENQDRDINNDPM-------Y-----RDPQLVEAFNAGL 60 (514) Q Consensus 1 ~~l~~k------w~p~l~~~~~~~~~i~~~~~~~~~~~~--~enq~~~~~~~~~-------~-----~~~~~~~~~~~~~ 60 (514) ..+.|+ ...-...+ ..++. .-.+.+.++| +|.+...+..... - ......+.+.... T Consensus 20 ~~~~e~~~~~~~~~~e~~~~---~~~l~-~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 95 (390) T protein:vir:81 20 RAFGERAVRDGELNASARSK---VDELF-ATVGNLSAEVQAARQRVAELEGNGAGGDVQHVSVGDMFVASEQFQASAGRW 95 (390) T ss_pred HHHHHHHHhhcCcCHHHHHH---HHHHH-HHHHHHHHHHHHHHHHHHHHHhcccccccccccchhhhhhhHHHHHHHHHH Confidence 111110 00000000 00000 0000111111 1111111100000 0 0000000000000 Q ss_pred cccccccccccccc---cccccccccc---ccccceeeehhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCc Q lcl|Aclame:pro 61 NEAVVNGDHGYDPA---NIAQGVTTGA---VTNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLT 134 (514) Q Consensus 61 ~~a~~~~~~g~~~~---~~~~st~tg~---v~~~~P~l~~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~t 134 (514) .........-.... .....++++. .+...+ .++++.-+..+-.++|.+.||++++.-+.- .... + T Consensus 96 ~~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~---~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~----~~~~--~ 166 (390) T protein:vir:81 96 NDRSARATMNIKAALNTASTDAAGSAGALTTPNRLP---GFITPPDARLTVRDLIGSGRTDSALIEYVQ----ETGF--V 166 (390) T ss_pred hhhhhhhhhHHHHHHHhhccccccCCcceechhhhH---HHHHHHhhhhhhhhhcceeeccCCceEEEE----EecC--C Confidence 00000000000000 0000001111 111222 234444455667889999999877632211 1110 0 Q ss_pred cccccccccCCCCccCccccccccccccccccccccccccccccccccccccccccccccccccccCccccccccccccc Q lcl|Aclame:pro 135 GAEAFHPTRQADASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAG 214 (514) Q Consensus 135 g~EA~~~~nEadt~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~ 214 (514) + ++ .|- T Consensus 167 ~-~a---------~~v---------------------------------------------------------------- 172 (390) T protein:vir:81 167 N-NA---------AIV---------------------------------------------------------------- 172 (390) T ss_pred c-ce---------eee---------------------------------------------------------------- Confidence 0 00 000 Q ss_pred cccccccccccchhhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHH Q lcl|Aclame:pro 215 GLLVEIDAGMATSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEV 294 (514) Q Consensus 215 ~~~y~~~~GmtTa~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEI 294 (514) ++| +.. ..+...|.++.+.+.|.. -...+|-||.+|-- +.++.|.+-|+..| T Consensus 173 ------~Eg------~~~----~~~~~~~~~i~~~~~k~~-------~~~~is~ell~d~~-----~~~~~i~~~l~~~~ 224 (390) T protein:vir:81 173 ------AEG------ALK----PESSLKFAKKTDTTHVIA-------HTMKATRQILSDAP-----QLASYMNNRLIRGL 224 (390) T ss_pred ------cCC------ccc----ccccceeeEEEEeeeEEE-------EeehhhHHHHHhHH-----HHHHHHHHHHHHHH Confidence 000 000 001123555555555554 45568999999842 46889999999999 Q ss_pred HHHhhHHHHHHHhhheeecccccccccCCcceecccccccc---ccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEE Q lcl|Aclame:pro 295 MVELNREIVNLVNSQAQIGKSGWTQGAGAAGVFDFSDAVDV---KGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFI 371 (514) Q Consensus 295 mlEINReii~~l~~~a~v~~~~~~~~v~~~g~~dl~~~~d~---~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~ 371 (514) ...+|+.||.- .-+ +-.+.|++........ .......+....++.++. ......+.+ T Consensus 225 ~~~~d~a~l~G---~g~--------~~~~~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~~~~~~ 284 (390) T protein:vir:81 225 KVKEDAEILRG---TGA--------NDGLLGLIPQATTYAAPTTIAGATRVDQLRLAMLQAS---------LAEYNPSGI 284 (390) T ss_pred HHHHHHHHHhc---CCC--------CCcccceeecccccccccccccchhHHHHHHHHHhhc---------cccCCCCEE Confidence 99999988432 110 1112343322110000 001112233333333332 224566678 Q ss_pred EEChhHHhHHhhccccccchhccccCccccccccCceEEEEecCceEEEecCCCccceEEEEEecCCCcccceeeccccc Q lcl|Aclame:pro 372 IASRNVVSALSMTDTLVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGFKGSTEMDAGVFYSPYVP 451 (514) Q Consensus 372 v~S~~va~~L~~~g~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~kG~~~~~~~~fy~PYv~ 451 (514) |++|.....|.... + ..| ...-.+.... -.++|. |++|++.+..|.+-+++|---. .++.. ... T Consensus 285 v~~~~~~~~l~~lk--d---~~G---~~l~~~~~~~-~~~~l~-G~pv~~~~~~p~~~~~~gd~~~-----~~~~~-~~~ 348 (390) T protein:vir:81 285 VINPIDWAAIELAK--D---ANN---QYLIGNARGT-LTPTLW-GLPVVATQAMAPGEFLVGAFDL-----AAQIF-DQW 348 (390) T ss_pred EEcHHHHHHHHHhh--c---CCC---ceeecCcccc-cCceec-ceeeEEcCCCCCCcEEEEehhc-----eEEEE-Eec Confidence 99999988776421 1 001 0010111110 013453 5699999998876555553210 00000 000 Q ss_pred cccccccC----Cccccceeeeeeeeee-eecC--ccccccCcceeecC Q lcl|Aclame:pro 452 LTPLRGSD----SKNFQPVIGFKTRYGV-QVNP--FADPTASATKVGNG 493 (514) Q Consensus 452 ~~~~~~~d----p~s~qp~~~~~tRY~l-~~nP--f~~~~~~~~~i~~~ 493 (514) ...+...+ -.+-+=.+=...|++. ..+| |... .-+ T Consensus 349 ~~~v~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~v~~-------t~a 390 (390) T protein:vir:81 349 DARVEIGYVGEDFQRNMITVLAEERLALVVYRPEALISG-------SFA 390 (390) T ss_pred ceEEEEecccchhhcCcEEEEEEEeeccEEecccceEEE-------EeC Confidence 00000111 0111223334556665 4444 2221 111 No 65 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=60.91 E-value=0.36 Score=22.99 Aligned_cols=288 Identities=10% Similarity=0.046 Sum_probs=113.6 Q ss_pred HHhcccccchhhhhhhcccccccccccccccccccccccc-ccccccccce--eeehhhhhhhhhhhhcceeEEecCCcc Q lcl|Aclame:pro 41 INNDPMYRDPQLVEAFNAGLNEAVVNGDHGYDPANIAQGV-TTGAVTNIGP--TVMGMVRRAIPQLIAFDIAGVQPMTGP 117 (514) Q Consensus 41 ~~~~~~~~~~~~~~~~~~~~~~a~~~~~~g~~~~~~~~st-~tg~v~~~~P--~l~~l~Rra~~~LIa~DI~GVQPmTgP 117 (514) ++-+. ..++.+-...+ ++.+....=| ..=.+++.+-+..+-.+++.+.||+++ T Consensus 1 ~~~~~------------------------~~~~e~~~~~~~~~~~~~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~ 56 (318) T protein:vir:24 1 MAAGT------------------------AFAVDHAQIAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTT 56 (318) T ss_pred CCCCC------------------------CCCHHHHHhhcccCcccceeechhHHHHHHHHHHhhchhhhhcceeeccCC Confidence 11111 11111111111 1111111001 111133444466677888899999876 Q ss_pred cceeeeeeeeecCCCCccccccccccCCCCccCccccccccccccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 118 TSQVFTLRSVYGKDPLTGAEAFHPTRQADASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTL 197 (514) Q Consensus 118 TGLIFAMRSrY~~~~~tg~EA~~~~nEadt~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~ 197 (514) +.-|- +... +.+|.+ T Consensus 57 ~~~ip----~~~~----~~~a~~--------------------------------------------------------- 71 (318) T protein:vir:24 57 GQKIP----HWVG----DVSAQW--------------------------------------------------------- 71 (318) T ss_pred ceEEE----EEeC----CcceEE--------------------------------------------------------- Confidence 43221 1110 000100 Q ss_pred cccCccccccccccccccccccccccccchhhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhh Q lcl|Aclame:pro 198 AVAGQMTATEYTDGVAGGLLVEIDAGMATSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAV 277 (514) Q Consensus 198 ~~~~~~~~~~~~~~~a~~~~y~~~~GmtTa~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAi 277 (514) ++ | +.++++...++++++.+.|..+....+|-||.+|-. T Consensus 72 ----------------------v~--------E---------g~~~~~~~~~f~~i~~~~~k~~~~~~iS~e~l~ds~-- 110 (318) T protein:vir:24 72 ----------------------IG--------E---------GDMKPITKGNMTSQTIAPHKIATIFVASAETVRANP-- 110 (318) T ss_pred ----------------------ec--------C---------CccccccccceeEEEEeeEEEEEeehhhHHHhhcCh-- Confidence 00 1 112333444556666666666667789999999844 Q ss_pred cCCChhHHHHHHHHHHHHHHhhHHHHHHHhhheeecccccccccCCcceecccccccc----ccchhhHHHHHHHHHHHH Q lcl|Aclame:pro 278 HGLDADAELSGILANEVMVELNREIVNLVNSQAQIGKSGWTQGAGAAGVFDFSDAVDV----KGARWAGEAYKALLIQIE 353 (514) Q Consensus 278 HGLDAEaELanILStEImlEINReii~~l~~~a~v~~~~~~~~v~~~g~~dl~~~~d~----~~~rwa~e~~r~L~~~i~ 353 (514) .|.+++|.+.|+..|...|++.++.-- -+ . .+.|++........ ...-+..+....++..+ T Consensus 111 --~~~~~~i~~~l~~~~~~~~d~a~l~G~---g~----~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~- 175 (318) T protein:vir:24 111 --ANYLGTMRTKVATAFAMAFDGAAMHGT---DS----P-----FPTYIGQTTKAISIADTTGATTVYDQVAVNGLSLL- 175 (318) T ss_pred --HHHHHHHHHHHHHHHHHHHHHhhhccc---CC----C-----CCcccccccccccccccccccchHHHHHHHHHHhh- Confidence 578999999999999999999995321 10 0 01111111100000 00011111222222222 Q ss_pred HHHHHHHHhcccccccEEEEChhHHhHHhhccccccchhccccCccccccc---cCceEE-EEecCceEEEecCCCccce Q lcl|Aclame:pro 354 KEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVGPAAQGMQDGSMNTDT---NQTVFA-GVLGGRFKVYIDQYAVNDY 429 (514) Q Consensus 354 ~~a~~I~~~T~r~~~n~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~~d~---~~~~~~-G~l~~~~~vy~D~y~~~dy 429 (514) .-.......+||+|.....|.... +. .|. .-...+. ....+. +.+ .+++|++.+..+..- T Consensus 176 --------~~~~~~~~~~v~n~~~~~~L~~lk--d~---~G~--~l~~~~~~~~~~~~~~~~~i-~g~pv~~~~~~~~~~ 239 (318) T protein:vir:24 176 --------VNDGKKWTHTLLDDITEPILNGAK--DQ---NGR--PLFIESTYGEAASPFRSGRI-VARPTILSDHVVEGT 239 (318) T ss_pred --------ccccCCCCEEEEcHHHHHHHHHhh--cc---CCc--eeecCccccCccccccCceE-EEEeeEEeCCCCCCc Confidence 222345567899999998887421 10 000 0000000 111111 112 245777777665311 Q ss_pred EEEEEecCCCcccceeecccccc--cccc------ccCCc----c-c---cceeeeeeeeeee-ecC--ccccccCccee Q lcl|Aclame:pro 430 FTVGFKGSTEMDAGVFYSPYVPL--TPLR------GSDSK----N-F---QPVIGFKTRYGVQ-VNP--FADPTASATKV 490 (514) Q Consensus 430 ~~vG~kG~~~~~~~~fy~PYv~~--~~~~------~~dp~----s-~---qp~~~~~tRY~l~-~nP--f~~~~~~~~~i 490 (514) . +++-|+- +.++|+-.-.+ ...+ ..|+. + | |=.+=...||+.. .+| |+..+.-. T Consensus 240 ~-~~~~gdf---s~~~~~~~~~l~i~~~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~i~~~~--- 312 (318) T protein:vir:24 240 T-VGFMGDF---SQLIWGQIGGLSFDVTDQATLNLGTVESPNFVSLWQHNLVAVRVEAEYAFHCNDAEAFVALTNVV--- 312 (318) T ss_pred c-EEEEeec---ceEEEEEecCeEEEEeeccceeccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeec--- Confidence 0 0011111 01222211111 0000 01111 1 2 2333345677764 444 33211100 Q ss_pred ecCcchhhhcc Q lcl|Aclame:pro 491 GNGAPVAASMG 501 (514) Q Consensus 491 ~~~~~~~~~~~ 501 (514) .+-..| T Consensus 313 -----a~~~~~ 318 (318) T protein:vir:24 313 -----SGGGEG 318 (318) T ss_pred -----cCCCCC Confidence 000001 No 66 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=59.79 E-value=0.39 Score=22.85 Aligned_cols=279 Identities=13% Similarity=0.109 Sum_probs=114.6 Q ss_pred cccccccccccccceeee-hhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCccccccccccCCCCccCcccc Q lcl|Aclame:pro 76 IAQGVTTGAVTNIGPTVM-GMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAEAFHPTRQADASFSGQAA 154 (514) Q Consensus 76 ~~~st~tg~v~~~~P~l~-~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~tg~EA~~~~nEadt~fSG~~~ 154 (514) .++.++++... .-|.+. .++.++.+..+-.+++.+.||++-..-+. ++.. +.+| .|-+ T Consensus 1 ma~~t~~~G~l-ip~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~~p----~~~~----~~~a---------~wv~--- 59 (300) T protein:vir:95 1 MSEAQLSKGNL-FNPELVTKVINKVKGHSSIAKLSPQKPIPFNGQREF----VFDF----DSDI---------DIVA--- 59 (300) T ss_pred CcccccCCcce-echhhHHHHHHHHHhhhhhhhhcceeeccCCceEEE----EEec----Ccce---------EEee--- Confidence 34444444332 122221 22233345556678899999876322111 1111 1111 1100 Q ss_pred ccccccccccccccccccccccccccccccccccccccccccccccCccccccccccccccccccccccccchhhhcccc Q lcl|Aclame:pro 155 ASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAELQEN 234 (514) Q Consensus 155 ~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~GmtTa~aEal~~ 234 (514) | T Consensus 60 ---------------------------------------------------------------------------E---- 60 (300) T protein:vir:95 60 ---------------------------------------------------------------------------E---- 60 (300) T ss_pred ---------------------------------------------------------------------------C---- Confidence 1 Q ss_pred CCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhheeecc Q lcl|Aclame:pro 235 FNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNSQAQIGK 314 (514) Q Consensus 235 ~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~~l~~~a~v~~ 314 (514) +.+.++...+++++++.+|.-+-...+|-||.+-... ..+|-+++|.+-|...|...+++.++.-... . T Consensus 61 -----g~~~~~s~~~f~~v~l~~~k~~~~~~iS~ell~~~~d-~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~-----~ 129 (300) T protein:vir:95 61 -----NGKKTHGGVSLDPVTIVPLKVEYGARVSDEFLHASEE-AKVDMLTDFVEGFSKKLARGLDIMSIHGINP-----R 129 (300) T ss_pred -----CcccccccccceeeEeeeEEEEEeehhhHHHhccCCC-CHHHHHHHHHHHHHHHHHHHHHHhhhhcccC-----C Confidence 1123333444555666666556666799998753221 2355688888888888888888888533210 0 Q ss_pred cccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhhccccccchhcc Q lcl|Aclame:pro 315 SGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVGPAAQG 394 (514) Q Consensus 315 ~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~~g~~~~~~~~~ 394 (514) .+. +....|........... . .......+.-|.++...+.. -.++.+.+|++|+....|....- ..| T Consensus 130 ~g~--~~~~~~~~~~~~~~~~~-~---~~~~~~~~~~i~~~~~~~~~--~~~~~~~~vmn~~~~~~L~~lkd-----~~G 196 (300) T protein:vir:95 130 TKQ--ASTIIGDNCFDKKVTQT-V---PFKDTNPDESMEDAVGMIDG--SERDITGAILDPIFTTALSKMKN-----AEG 196 (300) T ss_pred CCC--Cccccccccccccccee-e---cccccchHHHHHHHHHHhhh--cCCCccEEEECHHHHHHHHHhhc-----cCC Confidence 110 00111111000000000 0 00011222333344333321 24667778999999887764211 111 Q ss_pred ccCccc-cccccCceEEEEecCceEEEecCCCcc------ceEEEEEecCCCcccceeeccccc--cccccccCCcc--- Q lcl|Aclame:pro 395 MQDGSM-NTDTNQTVFAGVLGGRFKVYIDQYAVN------DYFTVGFKGSTEMDAGVFYSPYVP--LTPLRGSDSKN--- 462 (514) Q Consensus 395 ~~~~~~-~~d~~~~~~~G~l~~~~~vy~D~y~~~------dy~~vG~kG~~~~~~~~fy~PYv~--~~~~~~~dp~s--- 462 (514) .+. ..+.++ -..++|.| ++|+++...+. +.+++|= +..+++|..... +....-.|+++ T Consensus 197 ---~~i~~~~~~~-~~~~~l~G-~Pv~~s~~v~~~~~~~~~~~~~GD-----f~~~~~~~~~~~~~~~v~~~~~~d~~~~ 266 (300) T protein:vir:95 197 ---GKLYPELAWG-GVPDAING-LAVDKNRTVSYSQTDPKNTAIVGD-----FETMFKWGYAKEVPMEIIKYGDPDNSGR 266 (300) T ss_pred ---CeeccCcccc-CCCceecc-eeeEEecCCCCCCCCCccEEEEee-----ccceEEEEEecccEEEEeeccCCCCcch Confidence 001 011111 11256655 59998887653 1222221 001111221110 11111112221 Q ss_pred --cc---ceeeeeeeeee-eecC--ccccccCcceeecCcchhhhccccceeeeeeeec Q lcl|Aclame:pro 463 --FQ---PVIGFKTRYGV-QVNP--FADPTASATKVGNGAPVAASMGKNAYFRRVFVKG 513 (514) Q Consensus 463 --~q---p~~~~~tRY~l-~~nP--f~~~~~~~~~i~~~~~~~~~~~~~~~~r~~~V~~ 513 (514) || =.+=+..|++. +.+| |... .++.| T Consensus 267 ~~f~~~~v~~r~~~r~d~~v~~~~a~~~l-------------------------~~~~g 300 (300) T protein:vir:95 267 DLKGYNQIYIRCEAYIGWGIMDAASFARI-------------------------VKTGG 300 (300) T ss_pred hhhhcCcEEEEEEEeecceeecccceEEE-------------------------ecCCC Confidence 11 22233446664 3355 3322 11111 No 67 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=59.25 E-value=0.4 Score=22.79 Aligned_cols=289 Identities=12% Similarity=-0.018 Sum_probs=97.2 Q ss_pred ccCCCCccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccccccccccccccccc Q lcl|Aclame:pro 142 TRQADASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEID 221 (514) Q Consensus 142 ~nEadt~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~ 221 (514) |.+.- ...++...........-. ........-.....-+.. .....+ ....+ T Consensus 1 Ma~~~---------------~~~gg~~vP~~~~~~ii~----~l~~~s~i~~l~~~i~~~-~~~~~i--------p~~~~ 52 (315) T protein:vir:80 1 MADDF---------------LSAGKLELPGSMIGAVRD----RAIDSGVLAKLSPEQPTI-FGPVKG--------AVFSG 52 (315) T ss_pred CCCCc---------------CCcCceEcchHHHHHHHH----HHHhhchhhhhcceeecC-CCceEE--------EEEeC Confidence 11100 000111111100000000 000000000000000000 000000 00011 Q ss_pred ccccchhhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHH Q lcl|Aclame:pro 222 AGMATSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNRE 301 (514) Q Consensus 222 ~GmtTa~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINRe 301 (514) .+-..-.+| +..+++...+++++++.+|.-+-....|-||.+|-. .|+..+|.++|..++...|.|. T Consensus 53 ~~~a~wv~E---------g~~~~~s~~~f~~v~l~~~kl~~~~~iS~ell~~s~----~~~~~~l~~~i~~~la~ai~~~ 119 (315) T protein:vir:80 53 VPRAKIVGE---------GEVKPSASVDVSAFTAQPIKVVTQQRVSDEFMWADA----DYRLGVLQDLISPALGASIGRA 119 (315) T ss_pred CcceEEeeC---------CccccccccceeeeEeeeeeEEeeehhhHHHhhcCc----hhHHHHHHHHHHHHHHHHHHHH Confidence 111111122 334566666777777777766666679999998843 4566677777777776666666 Q ss_pred HHHHHhhheeecccccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHH Q lcl|Aclame:pro 302 IVNLVNSQAQIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSAL 381 (514) Q Consensus 302 ii~~l~~~a~v~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L 381 (514) +-..+.. .+ + .+. +-...|..+.-... .+ .++..-..+.-+.++...+.... ....+-.|++|+....| T Consensus 120 ~d~a~~~-G~-~-~~~--~~~~~~~~~~~~~~-~~----~~~~~~~~~~d~~~~~~~~~~~~-~~~~~~~imn~~~~~~L 188 (315) T protein:vir:80 120 VDLIAFH-GI-D-PAT--GKAASAVHTSLNKT-KN----IVDATDSATADLVKAVGLIAGAG-LQVPNGVALDPAFSFAL 188 (315) T ss_pred Hhhheee-cc-C-CCC--Cccccccccccccc-cc----eeeccccchHHHHHHHHHHhhcc-CccceEEEEcHHHHHHH Confidence 5333321 10 0 000 00111111110000 00 00000011111222222222111 12345688999998888 Q ss_pred hhccccccchhccccCccccccccCceEEEEecCceEEEecCCCccc---------e--------EEEEEecCCCcccce Q lcl|Aclame:pro 382 SMTDTLVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVND---------Y--------FTVGFKGSTEMDAGV 444 (514) Q Consensus 382 ~~~g~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d---------y--------~~vG~kG~~~~~~~~ 444 (514) ...-.....+..+..--+. ...+. .++|.| ++|+++.+.+.+ . +.+|+.+... + T Consensus 189 ~~l~~~~g~~~~g~~~~~~--~~~g~--~~tl~G-~PV~~~~~~~~~~~~~~~~~~~~~~GDfs~~~~g~~~~~~----i 259 (315) T protein:vir:80 189 STEVYPKGSPLAGQPMYPA--AGFAG--LDNWRG-LNVGASSTVSGAPEMSPASGVKAIVGDFSRVHWGFQRNFP----I 259 (315) T ss_pred HHHhhccCCcccccccccc--cccCC--Cceecc-eeeEecCcCCcccccccccccEEEEeecccEEEEEecCee----E Confidence 7542211111111100000 00111 145665 699988887531 1 2222222111 1 Q ss_pred eeccccccccccccC--Ccc-ccc-eeeee--eeeee-eecC--ccccccCcceeecCcchhhhcccc Q lcl|Aclame:pro 445 FYSPYVPLTPLRGSD--SKN-FQP-VIGFK--TRYGV-QVNP--FADPTASATKVGNGAPVAASMGKN 503 (514) Q Consensus 445 fy~PYv~~~~~~~~d--p~s-~qp-~~~~~--tRY~l-~~nP--f~~~~~~~~~i~~~~~~~~~~~~~ 503 (514) -..+|.- .| +.+ ||. .++|. .|+|. ..+| |...+.-. .+...-.+.| T Consensus 260 ~i~~~~~------~~~~~~~~~~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~------a~~~~~~~~~ 315 (315) T protein:vir:80 260 ELIEYGD------PDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKA------APKPNPPAEN 315 (315) T ss_pred EEecccc------ccCcccchhhcCcEEEEEEEEecceeecccceEEEeecc------CCCCCCCCCC Confidence 1112210 01 011 221 13333 45554 3455 33221111 1222223333 No 68 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=54.85 E-value=0.49 Score=22.27 Aligned_cols=269 Identities=10% Similarity=-0.041 Sum_probs=109.3 Q ss_pred CccccccccccCCCCccCccccccccccccccccccccccccccccccccccccccccccccccccccCccccccccccc Q lcl|Aclame:pro 133 LTGAEAFHPTRQADASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGV 212 (514) Q Consensus 133 ~tg~EA~~~~nEadt~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 212 (514) +. ...|..+-. ..+..... .+..... ............. .+. .. T Consensus 1 ma---------~~~T~l~d~---------------iiPev~~~-------~v~~~~~--~~l~~~~~~~~d~--~l~-g~ 44 (274) T protein:vir:12 1 MA---------QGLTKTSNQ---------------IIPEVLAP-------MMQAQLE--KKLRFASFAEVDS--TLQ-GQ 44 (274) T ss_pred CC---------cceeehhhh---------------hchHHHHH-------HHHHHHH--hhhhhcccceecc--ccc-CC Confidence 00 000000000 00000000 0000000 0000000000000 000 00 Q ss_pred cccccccccccccchhhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHH Q lcl|Aclame:pro 213 AGGLLVEIDAGMATSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILAN 292 (514) Q Consensus 213 a~~~~y~~~~GmtTa~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILSt 292 (514) .+.+.+++.--....+|.. .....-...++..+ +.+++-+-|.-. |.+.=-..+.+ +-|.-.|..+-++. T Consensus 45 -~G~tv~iP~~~~ig~a~~~---~~g~~i~~~~lt~~--~~~~~i~~~~~~--~~i~D~~~~~~--~~d~~~~~~~q~~~ 114 (274) T protein:vir:12 45 -PGDTLTFPAFVYSGDAQVV---AEGEKIPTDILETK--KREAKIRKIAKG--TSITDEALLSG--YGDPQGEQVRQHGL 114 (274) T ss_pred -CCCEEEEeeecCCCccccc---cCCCccchhhcccc--eeeEEeeeecce--eeecHHHHHhc--ccchHHHHHHHHHH Confidence 1222222211112223322 11122334444433 334444444322 32211122333 46888999999999 Q ss_pred HHHHHhhHHHHHHHhhheeecccccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEE Q lcl|Aclame:pro 293 EVMVELNREIVNLVNSQAQIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFII 372 (514) Q Consensus 293 EImlEINReii~~l~~~a~v~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v 372 (514) -|..+++.+++..+..... . + ....++ .+-+-....++.++. ..++++| T Consensus 115 ~~a~~vd~~~l~~~~~a~~-----~---~-~~~a~~-------------~d~i~dA~~~lgd~~---------~~~~~iv 163 (274) T protein:vir:12 115 AHANKVDNDVLEALMGAKL-----T---V-NADITK-------------LNGLQSAIDKFNDED---------LEPMVLF 163 (274) T ss_pred HHHHHHHHHHHHHHhcccc-----c---c-cccccC-------------HHHHHHHHHHhcccc---------ccccEEE Confidence 9999999999877743211 0 1 111111 222333333333332 3578999 Q ss_pred EChhHHhHHhhccccccchhccccCccccccccCceEEEEecCceEEEecCCCccce-EEEEEecCCCcccceeeccccc Q lcl|Aclame:pro 373 ASRNVVSALSMTDTLVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDY-FTVGFKGSTEMDAGVFYSPYVP 451 (514) Q Consensus 373 ~S~~va~~L~~~g~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy-~~vG~kG~~~~~~~~fy~PYv~ 451 (514) |+|.|++.|.......+....... . ....+-..|.+. |++||+|...|..- +++| +|.- .||. --+ T Consensus 164 v~p~~~~~L~k~~~~~fv~~s~~g---~--~~~~~G~ig~~~-G~~Vi~s~~~p~~t~~l~~-~gA~-----~~~~-~~~ 230 (274) T protein:vir:12 164 INPLDAGKLRGDASTNFTRATELG---D--DIIVKGAFGEAL-GAIIVRSNKLEAGTAILAK-KGAV-----KLIL-KRD 230 (274) T ss_pred eCHHHHHHHHhhhhhhcccccccc---c--cceecccceeec-CeeEEEeCCCCcceEEEEe-ccce-----eeee-cCC Confidence 999999998875433222211100 0 111111235564 67999999887532 2222 1211 1221 112 Q ss_pred ccccc-ccCCccccceeeeeeeeee-eecCc--cccccCcceeecCcch Q lcl|Aclame:pro 452 LTPLR-GSDSKNFQPVIGFKTRYGV-QVNPF--ADPTASATKVGNGAPV 496 (514) Q Consensus 452 ~~~~~-~~dp~s~qp~~~~~tRY~l-~~nPf--~~~~~~~~~i~~~~~~ 496 (514) .. ++ ..||..++=.+-..-+||+ ..||= ...+-... +..| T Consensus 231 ~~-vE~~Rd~~~~~d~i~~~~~y~~~~~~~~~vv~~t~~~~----~~~~ 274 (274) T protein:vir:12 231 FF-LEVARDASTKTTALYSDKHYVAYLYDESKAVKITKGSG----SLEM 274 (274) T ss_pred ce-eccccchhhcccEEEeeeEEEEEEEcCCceEEEEcCCc----cccC Confidence 11 23 3699999999999999996 45661 11111100 1111 No 69 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=53.99 E-value=0.51 Score=22.17 Aligned_cols=325 Identities=16% Similarity=0.175 Sum_probs=110.3 Q ss_pred Ccchhhh-----hhhhccccccccccc---cchhhhh----hhhhhh------hHHHHHHhcccccchhhhhhhcccccc Q lcl|Aclame:pro 1 MNLTEKW-----KDLLEAEGADMPEIA---TATKQKI----MSKIFE------NQDRDINNDPMYRDPQLVEAFNAGLNE 62 (514) Q Consensus 1 ~~l~~kw-----~p~l~~~~~~~~~i~---~~~~~~~----~~~~~e------nq~~~~~~~~~~~~~~~~~~~~~~~~~ 62 (514) +.=+|+= .|+-.......+.+. ...+..- ...+.+ |..+..++.. .++....+ T Consensus 56 ~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~------- 126 (428) T protein:vir:10 56 MEATERAAALVAKPVKATQHGPAVIVKAEPKQYTGAGMTRMVMSIAAAQGNLQDAAKFASDEL--NDQSVSMA------- 126 (428) T ss_pred HHHHHHHHHHHhhhhhchhhccccccccccchhhhHHHHHHHHHHHHhhhhHHHHHHHhhhhh--hhhhHhhh------- Confidence 0000000 011110000000000 0000000 011111 1111111110 00000000 Q ss_pred ccccccccccccccccccccccc--c-ccceeeehhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCcccccc Q lcl|Aclame:pro 63 AVVNGDHGYDPANIAQGVTTGAV--T-NIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAEAF 139 (514) Q Consensus 63 a~~~~~~g~~~~~~~~st~tg~v--~-~~~P~l~~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~tg~EA~ 139 (514) +...+++|.+ + .+.+-++.+.| +..+..++ |+..+++++|-+- |+-.. ++.++. T Consensus 127 -------------~~~~~~~gg~liP~~~~~~ii~~l~---~~~~l~~~-~~~~~~~~~g~~~-----~p~~~-~~~~a~ 183 (428) T protein:vir:10 127 -------------ISTAAGSGGVLIPQNIHSEVIELLR---DRTIVRKL-GARSIPLPNGNMS-----LPRLA-GGATAS 183 (428) T ss_pred -------------hcccccCCccccchhHHHHHHHHHh---hhchhhhh-cceeeecCCcceE-----EEEEe-CCccee Confidence 0111111111 1 11222233332 33344444 2222222223211 11000 000000 Q ss_pred ccccCCCCccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccccccccccccccc Q lcl|Aclame:pro 140 HPTRQADASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVE 219 (514) Q Consensus 140 ~~~nEadt~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~ 219 (514) | T Consensus 184 ---------~---------------------------------------------------------------------- 184 (428) T protein:vir:10 184 ---------Y---------------------------------------------------------------------- 184 (428) T ss_pred ---------e---------------------------------------------------------------------- Confidence 0 Q ss_pred ccccccchhhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 220 IDAGMATSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELN 299 (514) Q Consensus 220 ~~~GmtTa~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEIN 299 (514) + +| +...++...++++++...|.-+-...+|-||.+|- ..|.++.|.+-|...|...+| T Consensus 185 v--------~E---------g~~~~~~~~~f~~i~~~~~k~~~~v~is~ell~ds----~~~l~~~i~~~l~~ai~~~~d 243 (428) T protein:vir:10 185 T--------GE---------NQDAKVSEARFDDVKLTAKTMIAMVPISNALIGRA----GFNVEQLVLQDILTAISVRED 243 (428) T ss_pred e--------cc---------CccccccccceeeEEeeeEEEEEeehhhHHHHhhh----hHHHHHHHHHHHHHHHHHHHH Confidence 0 01 12334444556666666666666788999999984 245688888888888888888 Q ss_pred HHHHHHHhhheeecccccccccCCcceeccccccc-----cccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEC Q lcl|Aclame:pro 300 REIVNLVNSQAQIGKSGWTQGAGAAGVFDFSDAVD-----VKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIAS 374 (514) Q Consensus 300 Reii~~l~~~a~v~~~~~~~~v~~~g~~dl~~~~d-----~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S 374 (514) +.||. -.-+ +..+.|++.-..... ...+--..+....+...+ .+...+... ..+....|++ T Consensus 244 ~~~l~---G~G~--------~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~--~~~~~~~v~n 309 (428) T protein:vir:10 244 KAFMR---DDGT--------GDTPIGMKARATQWNRLLPWAADAAVNLDTIDTYLDSI-ILMSMDGNS--NMISSGWGMS 309 (428) T ss_pred HHHhc---cCCC--------CccccccccccccccccccccccccccHHHHHHHHHHH-HHhhhcccc--ccccCEEEEc Confidence 88842 1110 111233322110000 000000112222222222 222333222 2234456779 Q ss_pred hhHHhHHhhccccccchhccccCccccccccCceEEEEecCceEEEecCCCccc----------------eEEEEEecCC Q lcl|Aclame:pro 375 RNVVSALSMTDTLVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVND----------------YFTVGFKGST 438 (514) Q Consensus 375 ~~va~~L~~~g~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d----------------y~~vG~kG~~ 438 (514) +.....|.... + ..| .+.-.+.. .|+|. |++||++.+.+.+ ++++|..+.- T Consensus 310 ~~~~~~L~~lk--d---~~G---~~i~~~~~----~g~l~-G~pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~i~~~~~i 376 (428) T protein:vir:10 310 NRTYMKLFGLR--D---GNG---NKVYPEMA----QGMLK-GYPIQRTSAIPANLGEGGKESEIYFADFNDVVIGEDGNM 376 (428) T ss_pred HHHHHHHHHhh--c---cCC---ceeccCCC----CCeee-ceeeEEeccccccccCCCccceEEEEecceEEEEEecce Confidence 98887776531 0 000 01111111 24554 5599998876543 1333433333 Q ss_pred CcccceeeccccccccccccCCccc---cceeeeeeeeeeeec-C--ccccccCcceeecCcch Q lcl|Aclame:pro 439 EMDAGVFYSPYVPLTPLRGSDSKNF---QPVIGFKTRYGVQVN-P--FADPTASATKVGNGAPV 496 (514) Q Consensus 439 ~~~~~~fy~PYv~~~~~~~~dp~s~---qp~~~~~tRY~l~~n-P--f~~~~~~~~~i~~~~~~ 496 (514) +.+ .+||..........-..| +=.+=...|+++.+- | |+. ..+-.| T Consensus 377 ~i~----~~~~~~~~~~~~~~~~~f~~~~~~~R~~~r~d~~v~~p~a~~~--------~t~~~~ 428 (428) T protein:vir:10 377 KVD----FSKEASYIDTDGKLVSAFSRNQSLIRVVTEHDIGFRHPEGLVL--------GTGVLF 428 (428) T ss_pred EEE----eecccccccccccccchhhcchhheeeeeeeCceeeccceEEE--------EeccCC Confidence 221 122211110000000011 122235566666432 5 322 222233 No 70 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=53.39 E-value=0.53 Score=22.10 Aligned_cols=281 Identities=11% Similarity=-0.005 Sum_probs=92.8 Q ss_pred cccccccccccccccccccccccccccccccccccccccccccccccCccccccccccccccccccccccccchhhhccc Q lcl|Aclame:pro 154 AASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAELQE 233 (514) Q Consensus 154 ~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~GmtTa~aEal~ 233 (514) .. ..++...........- ...............-.... ....+ .+..+.+-..-.+| T Consensus 1 ma-------~~gG~lip~~~~~~ii----~~~~~~s~i~~~~~~~~~~~-~~~~~--------p~~~~~~~a~~v~E--- 57 (298) T protein:vir:94 1 MV-------LNKGTLFDPELVTDLI----SKVAGKSSIARLSAQKPIPF-NGEKV--------FTFTMDSEIDVVAE--- 57 (298) T ss_pred Ce-------eccccccChhHHHHHH----HHHHhhchhhhhcceeeccC-CceEE--------EEEecCcceEEeeC--- Confidence 00 0000111000000000 00000000000000000000 00000 00001000001112 Q ss_pred cCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhheeec Q lcl|Aclame:pro 234 NFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNSQAQIG 313 (514) Q Consensus 234 ~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~~l~~~a~v~ 313 (514) +.+++|-..+++.++...|.-+-....|-||.|+--. -..+-+++|.+-|+..|..+|+..++.-...- + + T Consensus 58 ------g~~~~~~~~~f~~v~l~~~k~~~~~~iS~ell~~~~~-~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~-~-g 128 (298) T protein:vir:94 58 ------SGKKTHGGVTLAPQTMVPIKVEYGARISDEFMYASDE-EKINILQAFNDGFAKKVARGIDLMAFHGVNPR-L-G 128 (298) T ss_pred ------CccccccccceeEEEEeeeEEEEeeehhHHHhccCCc-cHHHHHHHHHHHHHHHHHHHHHHHhhcccccC-C-C Confidence 2345555555666666665555567789998764221 01223556666666666666666664321000 0 0 Q ss_pred ccccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhhccccccchhc Q lcl|Aclame:pro 314 KSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVGPAAQ 393 (514) Q Consensus 314 ~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~~g~~~~~~~~ 393 (514) . .....+..+.......... .......++.-+.++...+.. ...+....|++|+....|.... + .. T Consensus 129 ~--~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~i~~~~~~~~~--~~~~~~~~vmn~~~~~~l~~lk--d---~~ 194 (298) T protein:vir:94 129 T--ASAVIGTNHFDSKVTQKVE-----APRGIADPNGAIENAVELLTG--VDADVTGIAINPSFRSALAKQK--D---LQ 194 (298) T ss_pred c--ccccccccccccccccccc-----cccccccHHHHHHHHHHhhhh--cCCCccEEEEcHHHHHHHHHhh--c---cC Confidence 0 0011111111111000000 000111222333444333322 1345667999999998886521 1 11 Q ss_pred cccCccccccccCceEEEEecCceEEEecCCCcc------ceEEEEEecCCCcccceeeccccccc--cccccCCcc--- Q lcl|Aclame:pro 394 GMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVN------DYFTVGFKGSTEMDAGVFYSPYVPLT--PLRGSDSKN--- 462 (514) Q Consensus 394 ~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~------dy~~vG~kG~~~~~~~~fy~PYv~~~--~~~~~dp~s--- 462 (514) |..- ...+.++ -..|+|.| ++|++++.-+. +.+++| +-. .++.|...-.+. ..+..||+. T Consensus 195 G~~l--~~~~~~~-~~~~tl~G-~PV~~~~~v~~~~~~~~~~~~~G---dfs--~~~~~~~~~~~~~~~~~~~~~d~~~~ 265 (298) T protein:vir:94 195 GNAL--FPELKWG-ATPDTING-LPVDVNKTVSDMSLTQRDRAIIG---DFA--NGFKWGYAKEVPLEVIQYGDPDNSGL 265 (298) T ss_pred CCee--ecCcccC-CCCceecc-eeeEEecccccccCCCccEEEEe---ecc--ceEEEEEecCceEEEeecCCCcCcch Confidence 1100 0111111 11145654 59998876542 222222 111 112222221111 111112221 Q ss_pred --cc-ceeee--eeeeeee-ecCccccccCcceeecCc Q lcl|Aclame:pro 463 --FQ-PVIGF--KTRYGVQ-VNPFADPTASATKVGNGA 494 (514) Q Consensus 463 --~q-p~~~~--~tRY~l~-~nPf~~~~~~~~~i~~~~ 494 (514) || =.++| ..|++.. .+| ....++...+ T Consensus 266 ~~f~~~~v~~r~~~r~~~~~~~~-----~a~~~l~~~t 298 (298) T protein:vir:94 266 DLKGYNQVYIRAELFLGWGILDA-----TKFARVTEAN 298 (298) T ss_pred hhhhcCcEEEEEEEEeccEeecc-----cceEEEEecC Confidence 22 12334 4566653 444 1123333333 No 71 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=52.83 E-value=0.54 Score=22.03 Aligned_cols=321 Identities=12% Similarity=0.080 Sum_probs=118.9 Q ss_pred Ccchhhhhhh---hccccccc----------cccccchhhhhhhhhhhhHHHHHHhcccccchhhhhhhccccccccccc Q lcl|Aclame:pro 1 MNLTEKWKDL---LEAEGADM----------PEIATATKQKIMSKIFENQDRDINNDPMYRDPQLVEAFNAGLNEAVVNG 67 (514) Q Consensus 1 ~~l~~kw~p~---l~~~~~~~----------~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~ 67 (514) -.|.++.... .+.+.... .+-...+|++ ....|.+++..-. ..+.+... .+ T Consensus 39 ~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~l~~~~~~~~---------~~~~~~~~-~~----- 102 (392) T protein:vir:10 39 RSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDV-FMKALRNKPLNAE---------EREFLEDD-LE----- 102 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHH-HHHHHhcccccHH---------HHHHHhhh-hh----- Confidence 1222222111 00000000 0011111111 1111111111000 00000000 00 Q ss_pred cccccccccccccc-ccc--c-cccceeeehhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCcccccccccc Q lcl|Aclame:pro 68 DHGYDPANIAQGVT-TGA--V-TNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAEAFHPTR 143 (514) Q Consensus 68 ~~g~~~~~~~~st~-tg~--v-~~~~P~l~~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~tg~EA~~~~n 143 (514) .......++ .|. | ..+.+-++.++ ..+..-.+++.+.||++++|-+. ..+..+ +.++ T Consensus 103 -----~~~~~~~t~~~gg~~vP~~~~~~ii~~~---~~~s~l~~~~~~~~~~~~~~~~~--~~~~~~----~~~a----- 163 (392) T protein:vir:10 103 -----QRAMSGLTGEDGGLVIPQDIQTQINELA---RSFDALEQYVTVEPVRTRSGSRV--LEKNSD----MIPF----- 163 (392) T ss_pred -----hhhccccccCCCceecchhHHHHHHHHH---HhhhhhhhhceeeeccCCceeEE--EEeecC----Cccc----- Confidence 000011111 111 1 11223333333 44555668999999999877432 111111 0000 Q ss_pred CCCCccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccccccccccccccccccc Q lcl|Aclame:pro 144 QADASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAG 223 (514) Q Consensus 144 Eadt~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~G 223 (514) .|-+ ++ T Consensus 164 ----~~v~----------------------------------------------------------------------E~ 169 (392) T protein:vir:10 164 ----AEIT----------------------------------------------------------------------EM 169 (392) T ss_pred ----eeec----------------------------------------------------------------------cc Confidence 0000 00 Q ss_pred ccchhhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHH Q lcl|Aclame:pro 224 MATSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIV 303 (514) Q Consensus 224 mtTa~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii 303 (514) ++. ..+....|.++.|...|..+ ...+|-||.+|- ..|.+++|.+.|...|...+|..|+ T Consensus 170 -----~~~----~~~~~~~~~~v~l~~~k~~~-------~~~iS~ell~ds----~~~l~~~i~~~l~~~i~~~~d~~~~ 229 (392) T protein:vir:10 170 -----GEI----PETDNPKFSNVQYAVKDRAG-------ILPLSRSLLQDS----DQNILKYVTKWLGKKSKVTRNVLIL 229 (392) T ss_pred -----ccc----cccccccceeEEeeeeeEEE-------eehhhHHHHhhh----HHHHHHHHHHHHHHHHHHHHHHHHh Confidence 000 00112235555555555544 455999999994 2567889999999999999998885 Q ss_pred HHHhhheeecccccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhh Q lcl|Aclame:pro 304 NLVNSQAQIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSM 383 (514) Q Consensus 304 ~~l~~~a~v~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~ 383 (514) .-.... .+.+...+ +....++.... .+. ....-..|++|.....|.. T Consensus 230 ~g~g~~------------~~~~~~~~-------------d~i~~~~~~~l--~~~------~~~~a~~vm~~~~~~~L~~ 276 (392) T protein:vir:10 230 GVIEKL------------TKQAIKSL-------------DDIKDVLNVKL--DPA------ISPNAILLTNQDGFNYLDK 276 (392) T ss_pred hccccc------------cccCccCH-------------HHHHHHHHHhh--hhh------hccCCEEEEcHHHHHHHHH Confidence 433111 11222221 12333221111 111 1123457899999888875 Q ss_pred ccccccchhccccCccccccccCceEEEEecCceEEEecCCCccceEEEEEecCCCcccceeeccccc----cc--ccc- Q lcl|Aclame:pro 384 TDTLVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGFKGSTEMDAGVFYSPYVP----LT--PLR- 456 (514) Q Consensus 384 ~g~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~kG~~~~~~~~fy~PYv~----~~--~~~- 456 (514) . .. ..| ...-.+....-..++|.|...|+++.... ++.+|...-+..++|+.+-. .. .+. T Consensus 277 l---kd--~~G---~~l~~~~~~~~~~~tllG~~~v~~~~~~~-----~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~ 343 (392) T protein:vir:10 277 L---KD--KDG---KYILQSDPTQKNKKLFAGTNPVVVVSNRF-----LKSKGTTAKKAPLIIGDLKEAIVLFKREDMEL 343 (392) T ss_pred h---hc--cCC---CeEeecCccCCccccccCcccEEEecccc-----cCCCcccCCceEEEEEehhceEEEEeecceEE Confidence 3 10 000 00100111111235677765666543221 11112111122233332211 00 000 Q ss_pred ccCC------ccccceeeeeeeeee-eecC--ccccccCcceeecCcchhhhcc Q lcl|Aclame:pro 457 GSDS------KNFQPVIGFKTRYGV-QVNP--FADPTASATKVGNGAPVAASMG 501 (514) Q Consensus 457 ~~dp------~s~qp~~~~~tRY~l-~~nP--f~~~~~~~~~i~~~~~~~~~~~ 501 (514) .+++ .+.+=.+-...|++. ..+| |...+-..+ .+-..-+| T Consensus 344 ~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~-----a~~~~~~~ 392 (392) T protein:vir:10 344 ASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLS-----APVEQPQG 392 (392) T ss_pred EEeccccchhhcCceEEEEEEeeccEEecccceEEEEeccc-----ccccCCCC Confidence 1122 234455667778876 3344 332211110 01111122 No 72 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=52.83 E-value=0.54 Score=22.03 Aligned_cols=321 Identities=12% Similarity=0.080 Sum_probs=118.9 Q ss_pred Ccchhhhhhh---hccccccc----------cccccchhhhhhhhhhhhHHHHHHhcccccchhhhhhhccccccccccc Q lcl|Aclame:pro 1 MNLTEKWKDL---LEAEGADM----------PEIATATKQKIMSKIFENQDRDINNDPMYRDPQLVEAFNAGLNEAVVNG 67 (514) Q Consensus 1 ~~l~~kw~p~---l~~~~~~~----------~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~ 67 (514) -.|.++.... .+.+.... .+-...+|++ ....|.+++..-. ..+.+... .+ T Consensus 39 ~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~l~~~~~~~~---------~~~~~~~~-~~----- 102 (392) T protein:vir:10 39 RSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDV-FMKALRNKPLNAE---------EREFLEDD-LE----- 102 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHH-HHHHHhcccccHH---------HHHHHhhh-hh----- Confidence 1222222111 00000000 0011111111 1111111111000 00000000 00 Q ss_pred cccccccccccccc-ccc--c-cccceeeehhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCcccccccccc Q lcl|Aclame:pro 68 DHGYDPANIAQGVT-TGA--V-TNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAEAFHPTR 143 (514) Q Consensus 68 ~~g~~~~~~~~st~-tg~--v-~~~~P~l~~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~tg~EA~~~~n 143 (514) .......++ .|. | ..+.+-++.++ ..+..-.+++.+.||++++|-+. ..+..+ +.++ T Consensus 103 -----~~~~~~~t~~~gg~~vP~~~~~~ii~~~---~~~s~l~~~~~~~~~~~~~~~~~--~~~~~~----~~~a----- 163 (392) T protein:vir:10 103 -----QRAMSGLTGEDGGLVIPQDIQTQINELA---RSFDALEQYVTVEPVRTRSGSRV--LEKNSD----MIPF----- 163 (392) T ss_pred -----hhhccccccCCCceecchhHHHHHHHHH---HhhhhhhhhceeeeccCCceeEE--EEeecC----Cccc----- Confidence 000011111 111 1 11223333333 44555668999999999877432 111111 0000 Q ss_pred CCCCccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccccccccccccccccccc Q lcl|Aclame:pro 144 QADASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAG 223 (514) Q Consensus 144 Eadt~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~G 223 (514) .|-+ ++ T Consensus 164 ----~~v~----------------------------------------------------------------------E~ 169 (392) T protein:vir:10 164 ----AEIT----------------------------------------------------------------------EM 169 (392) T ss_pred ----eeec----------------------------------------------------------------------cc Confidence 0000 00 Q ss_pred ccchhhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHH Q lcl|Aclame:pro 224 MATSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIV 303 (514) Q Consensus 224 mtTa~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii 303 (514) ++. ..+....|.++.|...|..+ ...+|-||.+|- ..|.+++|.+.|...|...+|..|+ T Consensus 170 -----~~~----~~~~~~~~~~v~l~~~k~~~-------~~~iS~ell~ds----~~~l~~~i~~~l~~~i~~~~d~~~~ 229 (392) T protein:vir:10 170 -----GEI----PETDNPKFSNVQYAVKDRAG-------ILPLSRSLLQDS----DQNILKYVTKWLGKKSKVTRNVLIL 229 (392) T ss_pred -----ccc----cccccccceeEEeeeeeEEE-------eehhhHHHHhhh----HHHHHHHHHHHHHHHHHHHHHHHHh Confidence 000 00112235555555555544 455999999994 2567889999999999999998885 Q ss_pred HHHhhheeecccccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhh Q lcl|Aclame:pro 304 NLVNSQAQIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSM 383 (514) Q Consensus 304 ~~l~~~a~v~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~ 383 (514) .-.... .+.+...+ +....++.... .+. ....-..|++|.....|.. T Consensus 230 ~g~g~~------------~~~~~~~~-------------d~i~~~~~~~l--~~~------~~~~a~~vm~~~~~~~L~~ 276 (392) T protein:vir:10 230 GVIEKL------------TKQAIKSL-------------DDIKDVLNVKL--DPA------ISPNAILLTNQDGFNYLDK 276 (392) T ss_pred hccccc------------cccCccCH-------------HHHHHHHHHhh--hhh------hccCCEEEEcHHHHHHHHH Confidence 433111 11222221 12333221111 111 1123457899999888875 Q ss_pred ccccccchhccccCccccccccCceEEEEecCceEEEecCCCccceEEEEEecCCCcccceeeccccc----cc--ccc- Q lcl|Aclame:pro 384 TDTLVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGFKGSTEMDAGVFYSPYVP----LT--PLR- 456 (514) Q Consensus 384 ~g~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~kG~~~~~~~~fy~PYv~----~~--~~~- 456 (514) . .. ..| ...-.+....-..++|.|...|+++.... ++.+|...-+..++|+.+-. .. .+. T Consensus 277 l---kd--~~G---~~l~~~~~~~~~~~tllG~~~v~~~~~~~-----~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~ 343 (392) T protein:vir:10 277 L---KD--KDG---KYILQSDPTQKNKKLFAGTNPVVVVSNRF-----LKSKGTTAKKAPLIIGDLKEAIVLFKREDMEL 343 (392) T ss_pred h---hc--cCC---CeEeecCccCCccccccCcccEEEecccc-----cCCCcccCCceEEEEEehhceEEEEeecceEE Confidence 3 10 000 00100111111235677765666543221 11112111122233332211 00 000 Q ss_pred ccCC------ccccceeeeeeeeee-eecC--ccccccCcceeecCcchhhhcc Q lcl|Aclame:pro 457 GSDS------KNFQPVIGFKTRYGV-QVNP--FADPTASATKVGNGAPVAASMG 501 (514) Q Consensus 457 ~~dp------~s~qp~~~~~tRY~l-~~nP--f~~~~~~~~~i~~~~~~~~~~~ 501 (514) .+++ .+.+=.+-...|++. ..+| |...+-..+ .+-..-+| T Consensus 344 ~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~-----a~~~~~~~ 392 (392) T protein:vir:10 344 ASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLS-----APVEQPQG 392 (392) T ss_pred EEeccccchhhcCceEEEEEEeeccEEecccceEEEEeccc-----ccccCCCC Confidence 1122 234455667778876 3344 332211110 01111122 No 73 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=52.83 E-value=0.54 Score=22.03 Aligned_cols=321 Identities=12% Similarity=0.080 Sum_probs=118.9 Q ss_pred Ccchhhhhhh---hccccccc----------cccccchhhhhhhhhhhhHHHHHHhcccccchhhhhhhccccccccccc Q lcl|Aclame:pro 1 MNLTEKWKDL---LEAEGADM----------PEIATATKQKIMSKIFENQDRDINNDPMYRDPQLVEAFNAGLNEAVVNG 67 (514) Q Consensus 1 ~~l~~kw~p~---l~~~~~~~----------~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~ 67 (514) -.|.++.... .+.+.... .+-...+|++ ....|.+++..-. ..+.+... .+ T Consensus 39 ~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~l~~~~~~~~---------~~~~~~~~-~~----- 102 (392) T protein:vir:10 39 RSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDV-FMKALRNKPLNAE---------EREFLEDD-LE----- 102 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHH-HHHHHhcccccHH---------HHHHHhhh-hh----- Confidence 1222222111 00000000 0011111111 1111111111000 00000000 00 Q ss_pred cccccccccccccc-ccc--c-cccceeeehhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCcccccccccc Q lcl|Aclame:pro 68 DHGYDPANIAQGVT-TGA--V-TNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAEAFHPTR 143 (514) Q Consensus 68 ~~g~~~~~~~~st~-tg~--v-~~~~P~l~~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~tg~EA~~~~n 143 (514) .......++ .|. | ..+.+-++.++ ..+..-.+++.+.||++++|-+. ..+..+ +.++ T Consensus 103 -----~~~~~~~t~~~gg~~vP~~~~~~ii~~~---~~~s~l~~~~~~~~~~~~~~~~~--~~~~~~----~~~a----- 163 (392) T protein:vir:10 103 -----QRAMSGLTGEDGGLVIPQDIQTQINELA---RSFDALEQYVTVEPVRTRSGSRV--LEKNSD----MIPF----- 163 (392) T ss_pred -----hhhccccccCCCceecchhHHHHHHHHH---HhhhhhhhhceeeeccCCceeEE--EEeecC----Cccc----- Confidence 000011111 111 1 11223333333 44555668999999999877432 111111 0000 Q ss_pred CCCCccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccccccccccccccccccc Q lcl|Aclame:pro 144 QADASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAG 223 (514) Q Consensus 144 Eadt~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~G 223 (514) .|-+ ++ T Consensus 164 ----~~v~----------------------------------------------------------------------E~ 169 (392) T protein:vir:10 164 ----AEIT----------------------------------------------------------------------EM 169 (392) T ss_pred ----eeec----------------------------------------------------------------------cc Confidence 0000 00 Q ss_pred ccchhhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHH Q lcl|Aclame:pro 224 MATSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIV 303 (514) Q Consensus 224 mtTa~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii 303 (514) ++. ..+....|.++.|...|..+ ...+|-||.+|- ..|.+++|.+.|...|...+|..|+ T Consensus 170 -----~~~----~~~~~~~~~~v~l~~~k~~~-------~~~iS~ell~ds----~~~l~~~i~~~l~~~i~~~~d~~~~ 229 (392) T protein:vir:10 170 -----GEI----PETDNPKFSNVQYAVKDRAG-------ILPLSRSLLQDS----DQNILKYVTKWLGKKSKVTRNVLIL 229 (392) T ss_pred -----ccc----cccccccceeEEeeeeeEEE-------eehhhHHHHhhh----HHHHHHHHHHHHHHHHHHHHHHHHh Confidence 000 00112235555555555544 455999999994 2567889999999999999998885 Q ss_pred HHHhhheeecccccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhh Q lcl|Aclame:pro 304 NLVNSQAQIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSM 383 (514) Q Consensus 304 ~~l~~~a~v~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~ 383 (514) .-.... .+.+...+ +....++.... .+. ....-..|++|.....|.. T Consensus 230 ~g~g~~------------~~~~~~~~-------------d~i~~~~~~~l--~~~------~~~~a~~vm~~~~~~~L~~ 276 (392) T protein:vir:10 230 GVIEKL------------TKQAIKSL-------------DDIKDVLNVKL--DPA------ISPNAILLTNQDGFNYLDK 276 (392) T ss_pred hccccc------------cccCccCH-------------HHHHHHHHHhh--hhh------hccCCEEEEcHHHHHHHHH Confidence 433111 11222221 12333221111 111 1123457899999888875 Q ss_pred ccccccchhccccCccccccccCceEEEEecCceEEEecCCCccceEEEEEecCCCcccceeeccccc----cc--ccc- Q lcl|Aclame:pro 384 TDTLVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGFKGSTEMDAGVFYSPYVP----LT--PLR- 456 (514) Q Consensus 384 ~g~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~kG~~~~~~~~fy~PYv~----~~--~~~- 456 (514) . .. ..| ...-.+....-..++|.|...|+++.... ++.+|...-+..++|+.+-. .. .+. T Consensus 277 l---kd--~~G---~~l~~~~~~~~~~~tllG~~~v~~~~~~~-----~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~ 343 (392) T protein:vir:10 277 L---KD--KDG---KYILQSDPTQKNKKLFAGTNPVVVVSNRF-----LKSKGTTAKKAPLIIGDLKEAIVLFKREDMEL 343 (392) T ss_pred h---hc--cCC---CeEeecCccCCccccccCcccEEEecccc-----cCCCcccCCceEEEEEehhceEEEEeecceEE Confidence 3 10 000 00100111111235677765666543221 11112111122233332211 00 000 Q ss_pred ccCC------ccccceeeeeeeeee-eecC--ccccccCcceeecCcchhhhcc Q lcl|Aclame:pro 457 GSDS------KNFQPVIGFKTRYGV-QVNP--FADPTASATKVGNGAPVAASMG 501 (514) Q Consensus 457 ~~dp------~s~qp~~~~~tRY~l-~~nP--f~~~~~~~~~i~~~~~~~~~~~ 501 (514) .+++ .+.+=.+-...|++. ..+| |...+-..+ .+-..-+| T Consensus 344 ~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~-----a~~~~~~~ 392 (392) T protein:vir:10 344 ASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLS-----APVEQPQG 392 (392) T ss_pred EEeccccchhhcCceEEEEEEeeccEEecccceEEEEeccc-----ccccCCCC Confidence 1122 234455667778876 3344 332211110 01111122 No 74 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=52.83 E-value=0.54 Score=22.03 Aligned_cols=321 Identities=12% Similarity=0.080 Sum_probs=118.9 Q ss_pred Ccchhhhhhh---hccccccc----------cccccchhhhhhhhhhhhHHHHHHhcccccchhhhhhhccccccccccc Q lcl|Aclame:pro 1 MNLTEKWKDL---LEAEGADM----------PEIATATKQKIMSKIFENQDRDINNDPMYRDPQLVEAFNAGLNEAVVNG 67 (514) Q Consensus 1 ~~l~~kw~p~---l~~~~~~~----------~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~ 67 (514) -.|.++.... .+.+.... .+-...+|++ ....|.+++..-. ..+.+... .+ T Consensus 39 ~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~l~~~~~~~~---------~~~~~~~~-~~----- 102 (392) T protein:vir:10 39 RSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDV-FMKALRNKPLNAE---------EREFLEDD-LE----- 102 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHH-HHHHHhcccccHH---------HHHHHhhh-hh----- Confidence 1222222111 00000000 0011111111 1111111111000 00000000 00 Q ss_pred cccccccccccccc-ccc--c-cccceeeehhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCcccccccccc Q lcl|Aclame:pro 68 DHGYDPANIAQGVT-TGA--V-TNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAEAFHPTR 143 (514) Q Consensus 68 ~~g~~~~~~~~st~-tg~--v-~~~~P~l~~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~tg~EA~~~~n 143 (514) .......++ .|. | ..+.+-++.++ ..+..-.+++.+.||++++|-+. ..+..+ +.++ T Consensus 103 -----~~~~~~~t~~~gg~~vP~~~~~~ii~~~---~~~s~l~~~~~~~~~~~~~~~~~--~~~~~~----~~~a----- 163 (392) T protein:vir:10 103 -----QRAMSGLTGEDGGLVIPQDIQTQINELA---RSFDALEQYVTVEPVRTRSGSRV--LEKNSD----MIPF----- 163 (392) T ss_pred -----hhhccccccCCCceecchhHHHHHHHHH---HhhhhhhhhceeeeccCCceeEE--EEeecC----Cccc----- Confidence 000011111 111 1 11223333333 44555668999999999877432 111111 0000 Q ss_pred CCCCccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccccccccccccccccccc Q lcl|Aclame:pro 144 QADASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAG 223 (514) Q Consensus 144 Eadt~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~G 223 (514) .|-+ ++ T Consensus 164 ----~~v~----------------------------------------------------------------------E~ 169 (392) T protein:vir:10 164 ----AEIT----------------------------------------------------------------------EM 169 (392) T ss_pred ----eeec----------------------------------------------------------------------cc Confidence 0000 00 Q ss_pred ccchhhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHH Q lcl|Aclame:pro 224 MATSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIV 303 (514) Q Consensus 224 mtTa~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii 303 (514) ++. ..+....|.++.|...|..+ ...+|-||.+|- ..|.+++|.+.|...|...+|..|+ T Consensus 170 -----~~~----~~~~~~~~~~v~l~~~k~~~-------~~~iS~ell~ds----~~~l~~~i~~~l~~~i~~~~d~~~~ 229 (392) T protein:vir:10 170 -----GEI----PETDNPKFSNVQYAVKDRAG-------ILPLSRSLLQDS----DQNILKYVTKWLGKKSKVTRNVLIL 229 (392) T ss_pred -----ccc----cccccccceeEEeeeeeEEE-------eehhhHHHHhhh----HHHHHHHHHHHHHHHHHHHHHHHHh Confidence 000 00112235555555555544 455999999994 2567889999999999999998885 Q ss_pred HHHhhheeecccccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhh Q lcl|Aclame:pro 304 NLVNSQAQIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSM 383 (514) Q Consensus 304 ~~l~~~a~v~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~ 383 (514) .-.... .+.+...+ +....++.... .+. ....-..|++|.....|.. T Consensus 230 ~g~g~~------------~~~~~~~~-------------d~i~~~~~~~l--~~~------~~~~a~~vm~~~~~~~L~~ 276 (392) T protein:vir:10 230 GVIEKL------------TKQAIKSL-------------DDIKDVLNVKL--DPA------ISPNAILLTNQDGFNYLDK 276 (392) T ss_pred hccccc------------cccCccCH-------------HHHHHHHHHhh--hhh------hccCCEEEEcHHHHHHHHH Confidence 433111 11222221 12333221111 111 1123457899999888875 Q ss_pred ccccccchhccccCccccccccCceEEEEecCceEEEecCCCccceEEEEEecCCCcccceeeccccc----cc--ccc- Q lcl|Aclame:pro 384 TDTLVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGFKGSTEMDAGVFYSPYVP----LT--PLR- 456 (514) Q Consensus 384 ~g~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~kG~~~~~~~~fy~PYv~----~~--~~~- 456 (514) . .. ..| ...-.+....-..++|.|...|+++.... ++.+|...-+..++|+.+-. .. .+. T Consensus 277 l---kd--~~G---~~l~~~~~~~~~~~tllG~~~v~~~~~~~-----~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~ 343 (392) T protein:vir:10 277 L---KD--KDG---KYILQSDPTQKNKKLFAGTNPVVVVSNRF-----LKSKGTTAKKAPLIIGDLKEAIVLFKREDMEL 343 (392) T ss_pred h---hc--cCC---CeEeecCccCCccccccCcccEEEecccc-----cCCCcccCCceEEEEEehhceEEEEeecceEE Confidence 3 10 000 00100111111235677765666543221 11112111122233332211 00 000 Q ss_pred ccCC------ccccceeeeeeeeee-eecC--ccccccCcceeecCcchhhhcc Q lcl|Aclame:pro 457 GSDS------KNFQPVIGFKTRYGV-QVNP--FADPTASATKVGNGAPVAASMG 501 (514) Q Consensus 457 ~~dp------~s~qp~~~~~tRY~l-~~nP--f~~~~~~~~~i~~~~~~~~~~~ 501 (514) .+++ .+.+=.+-...|++. ..+| |...+-..+ .+-..-+| T Consensus 344 ~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~-----a~~~~~~~ 392 (392) T protein:vir:10 344 ASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLS-----APVEQPQG 392 (392) T ss_pred EEeccccchhhcCceEEEEEEeeccEEecccceEEEEeccc-----ccccCCCC Confidence 1122 234455667778876 3344 332211110 01111122 No 75 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=52.32 E-value=0.56 Score=21.98 Aligned_cols=280 Identities=13% Similarity=0.114 Sum_probs=116.5 Q ss_pred cccccccccccccceee-ehhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCccccccccccCCCCccCcccc Q lcl|Aclame:pro 76 IAQGVTTGAVTNIGPTV-MGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAEAFHPTRQADASFSGQAA 154 (514) Q Consensus 76 ~~~st~tg~v~~~~P~l-~~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~tg~EA~~~~nEadt~fSG~~~ 154 (514) ++..+++..-.-.-+.+ =.+++++.++.+-.+++-+.||++++--|--. .. +.+| .|-+.+. T Consensus 1 ma~~t~~~gg~liP~~~~~~Ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~----~~----~~~a---------~wv~E~~ 63 (305) T protein:vir:25 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVL----AT----LPEA---------DWVGESA 63 (305) T ss_pred CCCccCCccceecCHHHHHHHHHHHHhhchhhhhcceeeccCCcEEEEEE----eC----Ccce---------EEeeccc Confidence 12112111111111111 12344555666778889999998765222111 01 1111 1111100 Q ss_pred ccccccccccccccccccccccccccccccccccccccccccccccCccccccccccccccccccccccccchhhhcccc Q lcl|Aclame:pro 155 ASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAELQEN 234 (514) Q Consensus 155 ~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~GmtTa~aEal~~ 234 (514) .. . T Consensus 64 ~~----------------------------------------------------------------------~------- 66 (305) T protein:vir:25 64 TD----------------------------------------------------------------------P------- 66 (305) T ss_pred cc----------------------------------------------------------------------c------- Confidence 00 0 Q ss_pred CCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhheeecc Q lcl|Aclame:pro 235 FNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNSQAQIGK 314 (514) Q Consensus 235 ~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~~l~~~a~v~~ 314 (514) ...++.-..+++++...++..+-...+|-||.+|-. .|.|++|.+-|+..|...+++.++.-- ++ T Consensus 67 -----~~~~~~s~~~f~~i~~~~~k~~~~~~is~ell~ds~----~~~~~~i~~~l~~~~a~~~d~a~~~G~------g~ 131 (305) T protein:vir:25 67 -----KGVKPTSKVTWANRTLVAEEIAVIIPVHENVIDDAT----VAVLTEVAELGGQAIGKKLDQAVIFGT------DK 131 (305) T ss_pred -----cccccccccceeeEEeeeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHhhhheecc------CC Confidence 001111123344555555555556679999999843 568999999999999999999995321 11 Q ss_pred cccccccCCcceecc-----ccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhhcccccc Q lcl|Aclame:pro 315 SGWTQGAGAAGVFDF-----SDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVG 389 (514) Q Consensus 315 ~~~~~~v~~~g~~dl-----~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~~g~~~~ 389 (514) ..+....+++.. ....... .......++.-+.++...+...- +..+-+|++|.....|... . T Consensus 132 ---~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~v~~~~~~~~l~~l---k- 198 (305) T protein:vir:25 132 ---PASWVSPALIPAAVTAGQAVEVVG----GVANESDIVGATNRAAKAVASAG--WAPDTLLSSLALRYEVANI---R- 198 (305) T ss_pred ---CCCccccccccccccccccccccc----cchhhhHHHHHHHHHHHhhhhcc--cccceeEecHHHHHHHHHh---h- Confidence 111111010000 0000000 11123334444444444443222 4445578899888777632 1 Q ss_pred chhccccCccccccccCce-EE-EEecCceEEEecCCCccc------------eEEEEEecCCCcccceeeccccccccc Q lcl|Aclame:pro 390 PAAQGMQDGSMNTDTNQTV-FA-GVLGGRFKVYIDQYAVND------------YFTVGFKGSTEMDAGVFYSPYVPLTPL 455 (514) Q Consensus 390 ~~~~~~~~~~~~~d~~~~~-~~-G~l~~~~~vy~D~y~~~d------------y~~vG~kG~~~~~~~~fy~PYv~~~~~ 455 (514) |..++. |. ++| .|++|+|..+.+.+ ++.+|..+..+.+- ..+.-+.+ T Consensus 199 -------------d~~G~~i~~~~~l-~G~Pv~~~~~~~~~~~~~~~~~gd~s~~~i~~~~~~~i~~----~~~~~~~~- 259 (305) T protein:vir:25 199 -------------DANGNPVFRDDSF-AGFRTFFNRNGAWDADAAIEVIADSSRVKIGVRQDITVKF----LDQATLGT- 259 (305) T ss_pred -------------ccCCceeecCCcc-cccceEEcCccCCCCCccEEEEEecceEEEEEecCeEEEE----eeeeeeec- Confidence 111111 11 355 34688888775532 12222222211111 10100000 Q ss_pred cccCCcc-cc-ceee--eeeeeee-eecCccccccCcceeecCcchh Q lcl|Aclame:pro 456 RGSDSKN-FQ-PVIG--FKTRYGV-QVNPFADPTASATKVGNGAPVA 497 (514) Q Consensus 456 ~~~dp~s-~q-p~~~--~~tRY~l-~~nPf~~~~~~~~~i~~~~~~~ 497 (514) .-.+.+ || ..++ ...|||+ +.||=+-..-.+.....-.|-+ T Consensus 260 -~~~~~~~~~~~~~~~R~~~r~~~~v~~p~a~v~~~~~~~~~~~pa~ 305 (305) T protein:vir:25 260 -GENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAVVAPAA 305 (305) T ss_pred -CCceeeeeecCcEEEEEEEeecceeeCcccEEEEccccccccCCCC Confidence 001111 21 1222 4668996 5688432222111111111111 No 76 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=52.25 E-value=0.56 Score=21.97 Aligned_cols=303 Identities=11% Similarity=0.026 Sum_probs=115.8 Q ss_pred hccccccccccccchhhhhhhhhhhhHHHHHHhcccccchhhhhhhccccccccccccccccccccccccccccccccce Q lcl|Aclame:pro 11 LEAEGADMPEIATATKQKIMSKIFENQDRDINNDPMYRDPQLVEAFNAGLNEAVVNGDHGYDPANIAQGVTTGAVTNIGP 90 (514) Q Consensus 11 l~~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~g~~~~~~~~st~tg~v~~~~P 90 (514) .+. ++.....+++....+.+=+.+. + . +.. .+.+++..--.+ T Consensus 1 ~~~-----~~~~~~~~~~~~~~~~~~~~~~--------------a--------~----------~~~-~~~~~~~~iP~~ 42 (324) T protein:vir:78 1 MEQ-----TQKLKLNLQHFASNNVKPQVFN--------------P--------D----------NVM-MHEKKDGTLMNE 42 (324) T ss_pred CCc-----chhhhHHHHHHHHHhhhhhhhc--------------c--------c----------ccc-ccCcCccccchh Confidence 000 0111111111111111100000 0 0 000 001111100001 Q ss_pred eeehhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCccccccccccCCCCccCcccccccccccccccccccc Q lcl|Aclame:pro 91 TVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAEAFHPTRQADASFSGQAAASTIADFPTTGAATDG 170 (514) Q Consensus 91 ~l~~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~tg~EA~~~~nEadt~fSG~~~~~~~~~~~~~~~~t~~ 170 (514) +.=.+++.+..+....+++-+-||++++--|.-. .. +.+| .| T Consensus 43 ~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~----~~----~~~a---------~~--------------------- 84 (324) T protein:vir:78 43 FTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFW----AD----KPGA---------YW--------------------- 84 (324) T ss_pred HHHHHHHHHHhhchhhhhcceeeccCCceEEEEE----ec----Ccce---------eE--------------------- Confidence 1112444455666678888888888765222110 00 0000 00 Q ss_pred ccccccccccccccccccccccccccccccCccccccccccccccccccccccccchhhhccccCCCCCCcccccceeEE Q lcl|Aclame:pro 171 TPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAELQENFNGSSNNEWNEMSFRI 250 (514) Q Consensus 171 ~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~GmtTa~aEal~~~ggs~~~~f~EMsFsI 250 (514) + +| +..+++...++ T Consensus 85 -------------------------------------------------v--------~E---------g~~~~~~~~~~ 98 (324) T protein:vir:78 85 -------------------------------------------------V--------GE---------GQKIETSKATW 98 (324) T ss_pred -------------------------------------------------e--------cC---------Cccccccccce Confidence 0 01 11233333445 Q ss_pred EEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhheeecccccccccCCcceeccc Q lcl|Aclame:pro 251 DKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNSQAQIGKSGWTQGAGAAGVFDFS 330 (514) Q Consensus 251 EK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~~l~~~a~v~~~~~~~~v~~~g~~dl~ 330 (514) +++++..+.-+.-..+|-||.+|-. .|.+++|.+-|+..|...|++.+|. -..+-. .+.|+.... T Consensus 99 ~~v~~~~~k~~~~~~is~ell~ds~----~~l~~~i~~~la~ai~~~~d~a~l~---G~g~~~--------~~~gi~~~~ 163 (324) T protein:vir:78 99 VNATMRAFKLGVILPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGIL---NQGNNP--------FGKSIAQSI 163 (324) T ss_pred eEEEEeeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHHhc---cCCCCC--------cCccccccc Confidence 5555555555556669999999864 5679999999999999999998843 222110 112222111 Q ss_pred cccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhhccccccchhccccCccccccccCceEE Q lcl|Aclame:pro 331 DAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVGPAAQGMQDGSMNTDTNQTVFA 410 (514) Q Consensus 331 ~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~~d~~~~~~~ 410 (514) ...-.. ......+..|.++.+.|.. .....+.+|+||+....|....-- .| ...-.+..+ T Consensus 164 ~~~~~~------~~~~~t~~~i~~~~~~l~~--~~~~~~~~vmn~~~~~~L~~l~d~-----~G---~~~~~~~~~---- 223 (324) T protein:vir:78 164 EKTNKV------IKGDFTQDNIIDLEALLED--DELEANAFISKTQNRSLLRKIVDP-----ET---KERIYDRNS---- 223 (324) T ss_pred ccccee------ccccccHHHHHHHHHhhhh--ccCCCCEEEEcHHHHHHHHHhhcc-----CC---CeeecCCCC---- Confidence 000000 0001112233344444432 235566789999999888753111 11 011111122 Q ss_pred EEecCceEEEecCCCcc--ceEEEE--------EecCCCcccceeeccccccccccccCCc-----cc---cceeeeeee Q lcl|Aclame:pro 411 GVLGGRFKVYIDQYAVN--DYFTVG--------FKGSTEMDAGVFYSPYVPLTPLRGSDSK-----NF---QPVIGFKTR 472 (514) Q Consensus 411 G~l~~~~~vy~D~y~~~--dy~~vG--------~kG~~~~~~~~fy~PYv~~~~~~~~dp~-----s~---qp~~~~~tR 472 (514) ++|. |++|++++.... ..+++| ..++-...- ..+.-... ..|+. -| +=.+=...| T Consensus 224 ~~l~-G~PV~~~~~~~~~~~~~~~gd~~~~~~g~~~~~~i~~----~~~~~~~~--~~~~~~~~~~~f~~d~~~~r~~~r 296 (324) T protein:vir:78 224 DSLD-GLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKI----DETAQLST--VKNEDGTPVNLFEQDMVALRATMH 296 (324) T ss_pred Cccc-ceeeEeeCCCCCCcceEEEEecceEEEEEecCcEEEE----eecccccc--cccccccchhhhhcCcEEEEEEEE Confidence 3344 358887765442 223333 222111100 00000000 00110 01 122223356 Q ss_pred eee-eecC--ccccccCcceeecCcchhh Q lcl|Aclame:pro 473 YGV-QVNP--FADPTASATKVGNGAPVAA 498 (514) Q Consensus 473 Y~l-~~nP--f~~~~~~~~~i~~~~~~~~ 498 (514) |+. ..+| |+..+.- ..-.+..+..- T Consensus 297 ~d~~v~~~~A~~~l~~a-~~~~~~~~~~~ 324 (324) T protein:vir:78 297 VALHIADDKAFAKLVPA-DKRTDSVPGEV 324 (324) T ss_pred EccEEecccceEEEecc-cccCCCCCCCC Confidence 665 2334 3322110 00011111111 No 77 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=52.25 E-value=0.56 Score=21.97 Aligned_cols=303 Identities=11% Similarity=0.026 Sum_probs=115.8 Q ss_pred hccccccccccccchhhhhhhhhhhhHHHHHHhcccccchhhhhhhccccccccccccccccccccccccccccccccce Q lcl|Aclame:pro 11 LEAEGADMPEIATATKQKIMSKIFENQDRDINNDPMYRDPQLVEAFNAGLNEAVVNGDHGYDPANIAQGVTTGAVTNIGP 90 (514) Q Consensus 11 l~~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~g~~~~~~~~st~tg~v~~~~P 90 (514) .+. ++.....+++....+.+=+.+. + . +.. .+.+++..--.+ T Consensus 1 ~~~-----~~~~~~~~~~~~~~~~~~~~~~--------------a--------~----------~~~-~~~~~~~~iP~~ 42 (324) T protein:vir:96 1 MEQ-----TQKLKLNLQHFASNNVKPQVFN--------------P--------D----------NVM-MHEKKDGTLMNE 42 (324) T ss_pred CCc-----chhhhHHHHHHHHHhhhhhhhc--------------c--------c----------ccc-ccCcCccccchh Confidence 000 0111111111111111100000 0 0 000 001111100001 Q ss_pred eeehhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCccccccccccCCCCccCcccccccccccccccccccc Q lcl|Aclame:pro 91 TVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAEAFHPTRQADASFSGQAAASTIADFPTTGAATDG 170 (514) Q Consensus 91 ~l~~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~tg~EA~~~~nEadt~fSG~~~~~~~~~~~~~~~~t~~ 170 (514) +.=.+++.+..+....+++-+-||++++--|.-. .. +.+| .| T Consensus 43 ~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~----~~----~~~a---------~~--------------------- 84 (324) T protein:vir:96 43 FTTPILQEVMENSKIMQLGKYEPMEGTEKKFTFW----AD----KPGA---------YW--------------------- 84 (324) T ss_pred HHHHHHHHHHhhchhhhhcceeeccCCceEEEEE----ec----Ccce---------eE--------------------- Confidence 1112444455666678888888888765222110 00 0000 00 Q ss_pred ccccccccccccccccccccccccccccccCccccccccccccccccccccccccchhhhccccCCCCCCcccccceeEE Q lcl|Aclame:pro 171 TPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAELQENFNGSSNNEWNEMSFRI 250 (514) Q Consensus 171 ~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~GmtTa~aEal~~~ggs~~~~f~EMsFsI 250 (514) + +| +..+++...++ T Consensus 85 -------------------------------------------------v--------~E---------g~~~~~~~~~~ 98 (324) T protein:vir:96 85 -------------------------------------------------V--------GE---------GQKIETSKATW 98 (324) T ss_pred -------------------------------------------------e--------cC---------Cccccccccce Confidence 0 01 11233333445 Q ss_pred EEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhheeecccccccccCCcceeccc Q lcl|Aclame:pro 251 DKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNSQAQIGKSGWTQGAGAAGVFDFS 330 (514) Q Consensus 251 EK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~~l~~~a~v~~~~~~~~v~~~g~~dl~ 330 (514) +++++..+.-+.-..+|-||.+|-. .|.+++|.+-|+..|...|++.+|. -..+-. .+.|+.... T Consensus 99 ~~v~~~~~k~~~~~~is~ell~ds~----~~l~~~i~~~la~ai~~~~d~a~l~---G~g~~~--------~~~gi~~~~ 163 (324) T protein:vir:96 99 VNATMRAFKLGVILPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGIL---NQGNNP--------FGKSIAQSI 163 (324) T ss_pred eEEEEeeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHHhc---cCCCCC--------cCccccccc Confidence 5555555555556669999999864 5679999999999999999998843 222110 112222111 Q ss_pred cccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhhccccccchhccccCccccccccCceEE Q lcl|Aclame:pro 331 DAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVGPAAQGMQDGSMNTDTNQTVFA 410 (514) Q Consensus 331 ~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~~d~~~~~~~ 410 (514) ...-.. ......+..|.++.+.|.. .....+.+|+||+....|....-- .| ...-.+..+ T Consensus 164 ~~~~~~------~~~~~t~~~i~~~~~~l~~--~~~~~~~~vmn~~~~~~L~~l~d~-----~G---~~~~~~~~~---- 223 (324) T protein:vir:96 164 EKTNKV------IKGDFTQDNIIDLEALLED--DELEANAFISKTQNRSLLRKIVDP-----ET---KERIYDRNS---- 223 (324) T ss_pred ccccee------ccccccHHHHHHHHHhhhh--ccCCCCEEEEcHHHHHHHHHhhcc-----CC---CeeecCCCC---- Confidence 000000 0001112233344444432 235566789999999888753111 11 011111122 Q ss_pred EEecCceEEEecCCCcc--ceEEEE--------EecCCCcccceeeccccccccccccCCc-----cc---cceeeeeee Q lcl|Aclame:pro 411 GVLGGRFKVYIDQYAVN--DYFTVG--------FKGSTEMDAGVFYSPYVPLTPLRGSDSK-----NF---QPVIGFKTR 472 (514) Q Consensus 411 G~l~~~~~vy~D~y~~~--dy~~vG--------~kG~~~~~~~~fy~PYv~~~~~~~~dp~-----s~---qp~~~~~tR 472 (514) ++|. |++|++++.... ..+++| ..++-...- ..+.-... ..|+. -| +=.+=...| T Consensus 224 ~~l~-G~PV~~~~~~~~~~~~~~~gd~~~~~~g~~~~~~i~~----~~~~~~~~--~~~~~~~~~~~f~~d~~~~r~~~r 296 (324) T protein:vir:96 224 DSLD-GLPVVNLKSSNLKRGELITGDFDKLIYGIPQLIEYKI----DETAQLST--VKNEDGTPVNLFEQDMVALRATMH 296 (324) T ss_pred Cccc-ceeeEeeCCCCCCcceEEEEecceEEEEEecCcEEEE----eecccccc--cccccccchhhhhcCcEEEEEEEE Confidence 3344 358887765442 223333 222111100 00000000 00110 01 122223356 Q ss_pred eee-eecC--ccccccCcceeecCcchhh Q lcl|Aclame:pro 473 YGV-QVNP--FADPTASATKVGNGAPVAA 498 (514) Q Consensus 473 Y~l-~~nP--f~~~~~~~~~i~~~~~~~~ 498 (514) |+. ..+| |+..+.- ..-.+..+..- T Consensus 297 ~d~~v~~~~A~~~l~~a-~~~~~~~~~~~ 324 (324) T protein:vir:96 297 VALHIADDKAFAKLVPA-DKRTDSVPGEV 324 (324) T ss_pred EccEEecccceEEEecc-cccCCCCCCCC Confidence 665 2334 3322110 00011111111 No 78 >protein:vir:79928 Length: 393 # NCBI annotation: major head protein # Family: family:all:30335 # MgeID: mge:1874 # MgeName: 0305phi8-36 # Cross-refs: genbank:acc:YP_001429616;genbank:gi:156564106;genbank:GeneID:5525693 Probab=48.23 E-value=0.68 Score=21.52 Aligned_cols=357 Identities=15% Similarity=0.146 Sum_probs=149.3 Q ss_pred hhhhhhhhccccccccccccchhhhhhhhhhhhHHHHHHhcc--cccchhhhhhhccccccccccccccccccccccccc Q lcl|Aclame:pro 4 TEKWKDLLEAEGADMPEIATATKQKIMSKIFENQDRDINNDP--MYRDPQLVEAFNAGLNEAVVNGDHGYDPANIAQGVT 81 (514) Q Consensus 4 ~~kw~p~l~~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~--~~~~~~~~~~~~~~~~~a~~~~~~g~~~~~~~~st~ 81 (514) .|.|..-|...| . -|-|++++++-| |-+ +..+.|++.+--.= +..+ T Consensus 1 ~~~~~~~~~~~~--~---------------~~~~~~e~k~lr~~me~--------~et~~e~~~~~~~~-~~~e------ 48 (393) T protein:vir:79 1 MENWLKQLKESG--F---------------TETQVQEQKSLRTRMER--------GETLAEADANKLAL-NEEE------ 48 (393) T ss_pred CchHHHHHHhcc--C---------------chhHHHHHHHHHHHhhh--------hhhhhhhhhhhhhc-chhH------ Confidence 788877776655 2 244444444332 111 22223332111000 0000 Q ss_pred cccccccceeeehhhhhhhhhhhhcceeE--Eec---CCcccceeeeeeeeecCCCCcccccccc---ccCCCCccCccc Q lcl|Aclame:pro 82 TGAVTNIGPTVMGMVRRAIPQLIAFDIAG--VQP---MTGPTSQVFTLRSVYGKDPLTGAEAFHP---TRQADASFSGQA 153 (514) Q Consensus 82 tg~v~~~~P~l~~l~Rra~~~LIa~DI~G--VQP---mTgPTGLIFAMRSrY~~~~~tg~EA~~~---~nEadt~fSG~~ 153 (514) +-|.- -+.|+.+.|+-| ||- ||.|+|.|--=|+.-+.- .-..|.++. +.+.-+=-+|++ T Consensus 49 -----------~el~E-~f~Kmm~G~~p~~eV~~~e~mtt~~a~IliP~vis~v~-~Eaaepl~~~~kl~qk~~L~~Grs 115 (393) T protein:vir:79 49 -----------TQILE-SFAKMMEGETPTNEVNLREFMATPSAQILIPRVIVGTM-REAAEPLYIGTKMLQKIRLKSGQS 115 (393) T ss_pred -----------HHHHH-HHHHHhcCCCchhheehhhhhcCCCcceechhhhhhhh-hhcccchhHHHHHHHHHhhhcCcc Confidence 22333 355666666554 555 888888876655552210 001111110 000000001111 Q ss_pred cccccccccccccccccccccccccccccccccccccccccccccccCccccccccccccccccccccccccchhhhccc Q lcl|Aclame:pro 154 AASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAELQE 233 (514) Q Consensus 154 ~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~GmtTa~aEal~ 233 (514) ..-.. +..--.|++++| T Consensus 116 m~F~~-----------------------------------------------------~g~~Ra~~IgEG---------- 132 (393) T protein:vir:79 116 MIFPS-----------------------------------------------------IGIMRAYDVAEG---------- 132 (393) T ss_pred eeccc-----------------------------------------------------hheeeecccccc---------- Confidence 10000 000012333333 Q ss_pred cCCCCCCccccc--ce-eEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhe Q lcl|Aclame:pro 234 NFNGSSNNEWNE--MS-FRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNSQA 310 (514) Q Consensus 234 ~~ggs~~~~f~E--Ms-FsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~~l~~~a 310 (514) .+|++ |. |+-|++.++.|--.++=+||=|+.-| .|+|--.-+....-.-|..--.-+.++...+++ T Consensus 133 -------gE~~~~sld~~T~dsv~~~~gK~G~~Ia~SqEmIsD----Sg~Dvin~~l~aA~RaMaRkKee~a~n~fk~~g 201 (393) T protein:vir:79 133 -------QEIPEDSIDWQTHESPEIRVGKSGIRLRFTDEMISD----SQWDLMSMMIKQAGRAMGRHKEQKAYHQFRSHG 201 (393) T ss_pred -------ccccccchhhhcCCceeEEechhhhhhhhHHHHhhc----chHHHHHHHHHHHHHHHHhhhHHHHHhhhhccc Confidence 23444 44 66789999888777777777777666 688766666555555666666666777777766 Q ss_pred e-ecccccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhhccccc- Q lcl|Aclame:pro 311 Q-IGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLV- 388 (514) Q Consensus 311 ~-v~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~~g~~~- 388 (514) + +...+-+..++..=-.||+. ...+..+.| -+.+|.-+|+ .--+.++.++--|=+-+..+---.|- T Consensus 202 htvfDa~st~t~ahptGr~~~~---~qNGTlSle-------DllDm~~av~--~~hyt~svi~MHPLAWnv~AKna~me~ 269 (393) T protein:vir:79 202 HTVFDNYSTNKLAHTTGLDKNG---VQNDTFSAE-------DFLDLIIAVM--ANEYTPSDLMMHPLAWTVFAKNELMGS 269 (393) T ss_pred ceeeeccccCccceeecCCccc---cccccccHH-------HHHHHHHHHh--cccCCcceEEEcCchhhhhhhhhhhcc Confidence 5 33333333333221122221 111222333 3344443332 23588888888876544443211110 Q ss_pred -cchhccccCccccccccC---ceEEE--EecCc----eEEEecCCCc-------cceEEEEEecCCCcccceeeccccc Q lcl|Aclame:pro 389 -GPAAQGMQDGSMNTDTNQ---TVFAG--VLGGR----FKVYIDQYAV-------NDYFTVGFKGSTEMDAGVFYSPYVP 451 (514) Q Consensus 389 -~~~~~~~~~~~~~~d~~~---~~~~G--~l~~~----~~vy~D~y~~-------~dy~~vG~kG~~~~~~~~fy~PYv~ 451 (514) +..+-+ |.+..+ ...-| .+.|| +.|.+-|.-| +||+.|-- +..+--|.-- T Consensus 270 ~~~na~g------N~~~~~~~ts~algp~~i~~~~~~nlnv~~sPfvp~d~k~~rFd~~~Vd~---NnvgvlLV~D---- 336 (393) T protein:vir:79 270 LQANPYG------NYPAKGAPSSMALGPDSIQGRLPFNFNVNLSPFIPLDKKSRRFDVYAVDR---NNVGVLLVRD---- 336 (393) T ss_pred eeecccc------ccCccccchhhhhchhhhccccccceeEEEecccccccccceeeEEEeec---CCceEEEEec---- Confidence 000000 111000 00001 23333 4444443322 34444421 1111111000 Q ss_pred ccccccc-CCccccceeeeeeeeee-eecCccccccCcceeecCcchhhhccccceeeeeeeecC Q lcl|Aclame:pro 452 LTPLRGS-DSKNFQPVIGFKTRYGV-QVNPFADPTASATKVGNGAPVAASMGKNAYFRRVFVKGL 514 (514) Q Consensus 452 ~~~~~~~-dp~s~qp~~~~~tRY~l-~~nPf~~~~~~~~~i~~~~~~~~~~~~~~~~r~~~V~~~ 514 (514) ...+.-. ||.----+|=++-|||+ +.|-=.-+. ..|+|.- .-.|+--..+||+ T Consensus 337 ~i~tdq~ddk~rdiq~iKl~ERYG~gvLn~gkaia--vakNI~~--------~k~y~~P~~~~~~ 391 (393) T protein:vir:79 337 DLKTDQWDEKARGLQNIKMIERYGIGILNEGKAIA--VAKNISM--------DKSYAEPMLIKNV 391 (393) T ss_pred CcceeccccccccceeeeeeeeeceeeeeCCceEE--EEeccee--------ecccccchhhhcc Confidence 0000001 33333446778889999 555411110 1222211 1235555666777 No 79 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=47.13 E-value=0.71 Score=21.39 Aligned_cols=348 Identities=11% Similarity=0.074 Sum_probs=121.7 Q ss_pred Ccchhhhhhhhc-------cccccc---cccccchhhh----------hhhhhhhhHHHHHHhcccccchhhhhhhcc-- Q lcl|Aclame:pro 1 MNLTEKWKDLLE-------AEGADM---PEIATATKQK----------IMSKIFENQDRDINNDPMYRDPQLVEAFNA-- 58 (514) Q Consensus 1 ~~l~~kw~p~l~-------~~~~~~---~~i~~~~~~~----------~~~~~~enq~~~~~~~~~~~~~~~~~~~~~-- 58 (514) -.|++++.-+-+ .-.++. -+.....++. +-+++.+-+.+.......-...+.....+. T Consensus 9 ~el~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 88 (395) T protein:vir:43 9 GELNASLKQVGDQIKSQAEQVNTQIANFGEMNKETRAKVDELLTAQGELQARLSAAEQAMLANEKRDGGEEAPKTAGQMV 88 (395) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccchhhhHHHHH Confidence 112222221111 000000 0000001111 011111110000000000000000000000 Q ss_pred --------cccccccccccccccccccccccccccccccee-eehhhhhhhhhhhhcceeEEecCCcccceeeeeeeeec Q lcl|Aclame:pro 59 --------GLNEAVVNGDHGYDPANIAQGVTTGAVTNIGPT-VMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYG 129 (514) Q Consensus 59 --------~~~~a~~~~~~g~~~~~~~~st~tg~v~~~~P~-l~~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~ 129 (514) +..... .+..-....+...++++.+-....|. .-.++++.-+..+..++|.++||.+++.-+ .| .. T Consensus 89 ~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~g~~vp~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~--~~--~~ 163 (395) T protein:vir:43 89 AESLKEQGVTSSLR-GSHRVSMPRSAITSIDGSGGALVAPDRRPGVVAAPQRRLTIRDLVAPGTTESNSVEY--VR--ET 163 (395) T ss_pred HHHHHHHHHHHHhh-hhhhhhhhhhhhcccCCCCccccchhhHHHHHHHHHhhhhHHhhccceecCCCceEE--EE--Ee Confidence 000000 00000000000000100000001111 112333344666778888888887664311 11 11 Q ss_pred CCCCccccccccccCCCCccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccccc Q lcl|Aclame:pro 130 KDPLTGAEAFHPTRQADASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYT 209 (514) Q Consensus 130 ~~~~tg~EA~~~~nEadt~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~ 209 (514) ... +.+.| T Consensus 164 ~~~------------~~a~~------------------------------------------------------------ 171 (395) T protein:vir:43 164 GFV------------NNAAP------------------------------------------------------------ 171 (395) T ss_pred cCC------------Cceee------------------------------------------------------------ Confidence 000 00000 Q ss_pred ccccccccccccccccchhhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHH Q lcl|Aclame:pro 210 DGVAGGLLVEIDAGMATSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGI 289 (514) Q Consensus 210 ~~~a~~~~y~~~~GmtTa~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanI 289 (514) + +| +...++-..+++++++..|.-+-...+|.||.||.- +.++.|.+- T Consensus 172 ----------v--------~E---------~~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~-----~l~~~v~~~ 219 (395) T protein:vir:43 172 ----------V--------SE---------GTQKPYSDLTFELENAPVRTIAHLFKASRQILDDAS-----ALQSYIDAR 219 (395) T ss_pred ----------e--------cC---------CccccccccceeEEEEeeeeEEEeehhhHHHHHhHH-----HHHHHHHHH Confidence 0 01 011223334455555555555566779999999863 358889999 Q ss_pred HHHHHHHHhhHHHHHHHhhheeecccccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhccccccc Q lcl|Aclame:pro 290 LANEVMVELNREIVNLVNSQAQIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGN 369 (514) Q Consensus 290 LStEImlEINReii~~l~~~a~v~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n 369 (514) |+..+...+|+.||. -..+ ++ ...|++......-... -... ....++..|.++.+.+. ...+++. T Consensus 220 la~a~~~~~d~~~l~---G~g~-~~-------~~~Gi~~~~~~~~~~~-~~~~-~~~~~~~~i~~~~~~~~--~~~~~~~ 284 (395) T protein:vir:43 220 ARYGLMLVEECQLLY---GNGT-GA-------NLHGIIPQAQAYAPPS-GVVV-TAEQRIDRIRLAILQAQ--LAEFPAS 284 (395) T ss_pred HHHHHHHHHHHHHHh---ccCC-CC-------cccccccccccccccc-cccc-ccchhHHHHHHHHHhhc--cccCCCc Confidence 999999999998843 1111 11 1123322110000000 0000 01122333444444443 2245667 Q ss_pred EEEEChhHHhHHhhccccccchhccccCccccccccCceEEEEecCceEEEecCCCccceEEEEEecCCCcccceeeccc Q lcl|Aclame:pro 370 FIIASRNVVSALSMTDTLVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGFKGSTEMDAGVFYSPY 449 (514) Q Consensus 370 ~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~kG~~~~~~~~fy~PY 449 (514) .+|+||.....|...- + ..| ...-.+... --.++|. |++|+++++.+.+=+++|--... -+++ . T Consensus 285 ~~vmn~~~~~~l~~lk--d---~~G---~~i~~~~~~-~~~~~l~-G~pVv~~~~~~~~~~~~gd~~~~----~~~~--~ 348 (395) T protein:vir:43 285 GIVLNPIDWALIELNK--D---AEN---RYIIGSPQN-GTTPTLW-RLPVVETQAITQDEFLTGAFSLG----AQIF--D 348 (395) T ss_pred EEEEcHHHHHHHHHhh--c---cCC---ceecccccc-CCCceec-ceeeEEcCCCCCCcEEEEeccce----EEEE--E Confidence 8999999987775321 1 111 111111111 1124565 47999999988655554421110 0000 0 Q ss_pred cccccccccC--Ccccc---ceeeeeeeeeee-ecC--ccccccCcc Q lcl|Aclame:pro 450 VPLTPLRGSD--SKNFQ---PVIGFKTRYGVQ-VNP--FADPTASAT 488 (514) Q Consensus 450 v~~~~~~~~d--p~s~q---p~~~~~tRY~l~-~nP--f~~~~~~~~ 488 (514) -....+...+ -..|+ =.+-+..|++.. .+| |...+=-.+ T Consensus 349 ~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~~~~taa 395 (395) T protein:vir:43 349 RMDIEVLVSTENDKDFENNMVTIRAEERLAFAVYRPEAFVTGSLTAS 395 (395) T ss_pred ecceEEEEeccccchhhcCcEEEEEEEeeccEEecccceEEEEeccC Confidence 0011111111 11232 233344577763 334 322110000 No 80 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=43.59 E-value=0.84 Score=21.00 Aligned_cols=340 Identities=15% Similarity=0.129 Sum_probs=111.2 Q ss_pred Ccchhhhhhhhcccc--ccccc-c--------------------ccchhhhhhhhhhhhHHHHHHhcccccchhhhhhhc Q lcl|Aclame:pro 1 MNLTEKWKDLLEAEG--ADMPE-I--------------------ATATKQKIMSKIFENQDRDINNDPMYRDPQLVEAFN 57 (514) Q Consensus 1 ~~l~~kw~p~l~~~~--~~~~~-i--------------------~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~ 57 (514) ..|.++..-+-+-+- +.... + .+..|..-.++++ +.+..... +.. .++. T Consensus 46 ~~l~~~i~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~--~~~--~~~~ 117 (435) T protein:vir:80 46 NELTAQIERAEAAERMAAAAAVPVDPNPAAVTASAAAPVYAQPKAPEVKGAKMARMV----RALAAARG--DAQ--LASK 117 (435) T ss_pred HHHHHHHHHHHHHHHHHHhhcccccchhhhhccccccccccccchhhhhHHHHHHHH----HHHHhccc--hhH--HHHH Confidence 222222221110000 00000 0 0000111111110 11100000 000 0000 Q ss_pred cccccccccccccccccccccccccccccccceeeeh------hhhhhhhhhhhcce-eEEecCCcccceeeeeeeeecC Q lcl|Aclame:pro 58 AGLNEAVVNGDHGYDPANIAQGVTTGAVTNIGPTVMG------MVRRAIPQLIAFDI-AGVQPMTGPTSQVFTLRSVYGK 130 (514) Q Consensus 58 ~~~~~a~~~~~~g~~~~~~~~st~tg~v~~~~P~l~~------l~Rra~~~LIa~DI-~GVQPmTgPTGLIFAMRSrY~~ 130 (514) .... ...+.+..+. -.+++ ......||+ +++++-+..+...+ +=+.||+.+.- +|+- T Consensus 118 ~~~~-----~~~~~~~~~~-~~~~~---~~~gg~lvP~~~~~~ii~~l~~~~~i~~~~~~~v~~~~~~~-------~~p~ 181 (435) T protein:vir:80 118 LAIE-----RGFGEEVAMS-LNTLS---PGAGGVLVPENLSSEVIELLRPKSVVRKLGARTLPLSNGNI-------TIPR 181 (435) T ss_pred HHHh-----hhhhhhhhhh-hcccC---CCCCccccchhHHHHHHHHHhhhchhhhccceeeecCCCce-------EEEE Confidence 0000 0000000000 00001 111112222 22222233333333 22334433211 1110 Q ss_pred CCCccccccccccCCCCccCccccccccccccccccccccccccccccccccccccccccccccccccccCccccccccc Q lcl|Aclame:pro 131 DPLTGAEAFHPTRQADASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTD 210 (514) Q Consensus 131 ~~~tg~EA~~~~nEadt~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 210 (514) .. ++.++.| T Consensus 182 ~~-~~~~a~~---------------------------------------------------------------------- 190 (435) T protein:vir:80 182 LK-GGAIVGY---------------------------------------------------------------------- 190 (435) T ss_pred Ee-CCcceee---------------------------------------------------------------------- Confidence 00 0000000 Q ss_pred cccccccccccccccchhhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHH Q lcl|Aclame:pro 211 GVAGGLLVEIDAGMATSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGIL 290 (514) Q Consensus 211 ~~a~~~~y~~~~GmtTa~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanIL 290 (514) + .| +..+++...++++++...+.-+-....|.||.+|-.- +.|.|+.|.+-| T Consensus 191 ---------v--------~E---------~~~~~~~~~~f~~i~~~~~k~~~~~~is~ell~ds~~--~~~l~~~i~~~l 242 (435) T protein:vir:80 191 ---------I--------GA---------DTDIPTTQQQFDDLKLTAKKMAALVPIANDLIKYAGV--NPNVDQIVVGDL 242 (435) T ss_pred ---------e--------cc---------CccccccccceeeEEEeeEEEEEeehhhHHHHHhhcc--cHHHHHHHHHHH Confidence 0 01 1123444455666666666666677799999999432 345778888888 Q ss_pred HHHHHHHhhHHHHHHHhhheeecccccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccE Q lcl|Aclame:pro 291 ANEVMVELNREIVNLVNSQAQIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNF 370 (514) Q Consensus 291 StEImlEINReii~~l~~~a~v~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~ 370 (514) +.-|...+++.|+. -.-+ +-.+.|++......-+... -.......+...+.+.-..+...-....... T Consensus 243 ~~a~~~~~d~a~l~---G~G~--------~~~p~Gi~~~~~~~~~~~~-~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~ 310 (435) T protein:vir:80 243 TAAIGAREDKAFIR---DDGT--------ANTPKGLRFWALPGNVITA-SDGSTLQKIETDLGKAILALENADANLTQPG 310 (435) T ss_pred HHHHHHHHHHHhhc---cCCC--------CCcccceeecccccceeec-ccccchhhHHHHHHHHHHHhhccccccccCE Confidence 88888888887732 2110 0012333321111000000 0001122222222222222221111234556 Q ss_pred EEEChhHHhHHhhccccccchhccccCccccccccCceEEEEecCceEEEecCCCccc--------e--------EEEEE Q lcl|Aclame:pro 371 IIASRNVVSALSMTDTLVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVND--------Y--------FTVGF 434 (514) Q Consensus 371 ~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~d--------y--------~~vG~ 434 (514) .|++|.....|....-- .| ...-.+..+ |+|. |++||++.+.|.+ . ++||- T Consensus 311 ~vmn~~~~~~L~~lkd~-----~G---~~l~~~~~~----~~l~-G~pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~i~~ 377 (435) T protein:vir:80 311 WIMAPRTFRFLEGLRDG-----NG---NKVYPELAN----GMLK-GYPVGKTTQVPINLGEAGKESEIYFTDFGDVFIGE 377 (435) T ss_pred EEEcHHHHHHHHhhhcc-----CC---ceeccCCCC----CeEe-eeeeEEeccccccccCCCCcceEEEEEcccEEEEe Confidence 78999999888753211 11 111112222 4554 4699998886531 1 22333 Q ss_pred ecCCCcccceeeccccccccccccCCccc---cceeeeeeeeeeee-cCccccccCcceeecCcchhh Q lcl|Aclame:pro 435 KGSTEMDAGVFYSPYVPLTPLRGSDSKNF---QPVIGFKTRYGVQV-NPFADPTASATKVGNGAPVAA 498 (514) Q Consensus 435 kG~~~~~~~~fy~PYv~~~~~~~~dp~s~---qp~~~~~tRY~l~~-nPf~~~~~~~~~i~~~~~~~~ 498 (514) .+.-..+ ..+|.-...-...--..| +=.+=+.-|++..+ +|= .-.++++-.|.+ T Consensus 378 ~~~~~i~----~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~~~~~~------a~~~l~~~~~~~ 435 (435) T protein:vir:80 378 EETLEID----YSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVE------SIAVLSGVAWGA 435 (435) T ss_pred ecceEEE----EeccccccccccchhhhhhcCcceeeeeeeeCcEeeccc------ceEEEeccCCCC Confidence 3322211 111111000000000001 12223445565532 341 111222233333 No 81 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=43.24 E-value=0.85 Score=20.96 Aligned_cols=349 Identities=13% Similarity=0.081 Sum_probs=110.6 Q ss_pred Ccchh---hhhhhhccccccccccccchhhhhhhhhhhhHHHHHHh----cc-cc--cchhhhhhhcccccccccccccc Q lcl|Aclame:pro 1 MNLTE---KWKDLLEAEGADMPEIATATKQKIMSKIFENQDRDINN----DP-MY--RDPQLVEAFNAGLNEAVVNGDHG 70 (514) Q Consensus 1 ~~l~~---kw~p~l~~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~----~~-~~--~~~~~~~~~~~~~~~a~~~~~~g 70 (514) .+|.| |..-+-... +=.+....+..-..+.+++..+.+.+ +. .. +.....+.+............-. T Consensus 2 k~L~e~~~e~~e~~~~~---~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~r~ 78 (390) T protein:vir:40 2 NNLDKKDSETLNISTAF---LNAIKEGATEAEQVTAFTNMAEQIQNNIIAQARKEVNREMNDNNVLASRGANALTSDESK 78 (390) T ss_pred chHHHHHHHHHHHHHHH---HHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCchhccHHHHH Confidence 11221 111111100 00011100100001111111111100 00 00 00000000000000000000000 Q ss_pred ccccccccccccccccccceeeeh------hhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCccccccccccC Q lcl|Aclame:pro 71 YDPANIAQGVTTGAVTNIGPTVMG------MVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAEAFHPTRQ 144 (514) Q Consensus 71 ~~~~~~~~st~tg~v~~~~P~l~~------l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~tg~EA~~~~nE 144 (514) .-...+...++++ ...||+ +.+.+-..-+-.++|-+.||++....|.. .... .++.+ T Consensus 79 ~~~~~~~~~~~~~-----gg~lvP~~~~~~I~~~~~~~s~i~~~~~~~~~~~~~~~i~~----~~~~----~~a~~---- 141 (390) T protein:vir:40 79 YYNEVIAGNGFAG-----VTALLPPTVFERVFEDLTVEHPLLSKINFVNTTATTEWIIS----VGDV----ATAWW---- 141 (390) T ss_pred HHHHHHhccCccc-----CcccccHHHHHHHHHHHHhhhhhhhhceeeecCCceeEEEE----EcCC----cceee---- Confidence 0000001111110 112222 22222333345678899998875544431 1110 01100 Q ss_pred CCCccCccccccccccccccccccccccccccccccccccccccccccccccccccCccccccccccccccccccccccc Q lcl|Aclame:pro 145 ADASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGM 224 (514) Q Consensus 145 adt~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~Gm 224 (514) -+. + T Consensus 142 -----~~E----------------------------------------------------------------------~- 145 (390) T protein:vir:40 142 -----GPL----------------------------------------------------------------------C- 145 (390) T ss_pred -----ecc----------------------------------------------------------------------c- Confidence 000 0 Q ss_pred cchhhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHH Q lcl|Aclame:pro 225 ATSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVN 304 (514) Q Consensus 225 tTa~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~ 304 (514) ++. .......|.+..|++.|..+-. ..|-||.+|-- .|.|++|.+.|+..|..-+|+.||. T Consensus 146 ----~~~----~~~~~~~f~~i~l~~~k~~~~i-------~iS~ell~ds~----~~l~~~i~~~la~~i~~~~~~a~l~ 206 (390) T protein:vir:40 146 ----AEI----KEVLDNGFDKIQTGMYKLSAYI-------PVCNAMLDLGP----SWLDQYVRTILGEAMALGLEAGIVN 206 (390) T ss_pred ----ccc----CccccccceeeEeeeeeEEEee-------hhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHhhhhc Confidence 000 0112335777777777776543 48999999864 4579999999999999999999954 Q ss_pred HHhhheeecccccccccCCcceecccccc------ccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhH- Q lcl|Aclame:pro 305 LVNSQAQIGKSGWTQGAGAAGVFDFSDAV------DVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNV- 377 (514) Q Consensus 305 ~l~~~a~v~~~~~~~~v~~~g~~dl~~~~------d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~v- 377 (514) - .-+- .+.|++.-.... ........-+-.-.++..+......-.... ++++.| ||++.. T Consensus 207 G---~G~~---------~P~Gil~~~~~~~~~~~~~~~~~~~t~~~~~~~~~~l~~~~~~~~~~~-~~~a~~-i~n~~t~ 272 (390) T protein:vir:40 207 G---SGKD---------QPIGMMRDLNNVTAGEHPVKTATPLTDLTPATLATKVMLPLTDNGKKS-VSDAIL-VINPADY 272 (390) T ss_pred c---cCCC---------ccceeeeccccccccccccccccccchhhHHHHHHHHHHHhhcchhhh-hcCceE-EEcchhH Confidence 1 1100 011221100000 000000000112222222222211111111 233444 566554 Q ss_pred HhHHhhccccccchhccccCccccccccCceEEEEecCceEEEecCCCccceEEEEE--------ecCCCcccc--eeec Q lcl|Aclame:pro 378 VSALSMTDTLVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGF--------KGSTEMDAG--VFYS 447 (514) Q Consensus 378 a~~L~~~g~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~--------kG~~~~~~~--~fy~ 447 (514) +..|...-++ .|..++...+.+.-+++|+++++.+.+-++.|- .+....+.+ .+|. T Consensus 273 ~~~l~~~~~~--------------~d~~G~~v~~~~~~g~pvv~~~~~p~~~i~~Gd~s~~~i~~~~~~~v~~~~~~~f~ 338 (390) T protein:vir:40 273 WSKIYAATSY--------------MTPQGVWVTGILPVPLEIVQSVAVPVGKAVAGRAKDYFMGIGSEQVIRTSTEYRLL 338 (390) T ss_pred HHHHHHHhhc--------------cCCCCccccccCCCceeEEEcCCCCCCcEEEEeeceEEEEeecceEEEecchhhhh Confidence 4555432222 122233223334457899999998865444442 221111110 0000 Q ss_pred -ccc-----ccccccccCCccccceeeeeeeee-eeecCccccccCcceeecCcchhhhccccc Q lcl|Aclame:pro 448 -PYV-----PLTPLRGSDSKNFQPVIGFKTRYG-VQVNPFADPTASATKVGNGAPVAASMGKNA 504 (514) Q Consensus 448 -PYv-----~~~~~~~~dp~s~qp~~~~~tRY~-l~~nPf~~~~~~~~~i~~~~~~~~~~~~~~ 504 (514) ..+ .-.....+||+.|. ++=++.==+ -.+.||....+..+ -. ++. T Consensus 339 ~~~~~~r~~~r~dg~v~~~~A~~-~l~~~~~~~~~~~~~~~~~~~~~~---------~~--~~~ 390 (390) T protein:vir:40 339 DDETLYYAKQYANGRPKDNSSFL-VFDITGLEGSPAIDVNVVNNATPS---------ET--PAE 390 (390) T ss_pred cCcEEEEEEEEeCCEEecccceE-EEEeeccCCCCCCCcceeeCCCCC---------CC--CCC Confidence 000 00000012333332 000000000 01122221100000 00 000 No 82 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=41.82 E-value=0.91 Score=20.81 Aligned_cols=319 Identities=14% Similarity=0.114 Sum_probs=113.7 Q ss_pred Ccchhhhhhhhccc----cccccccccchhhhhhhhhhhhHHHHHHhcccccchhhhhhhcccccccccccccccccccc Q lcl|Aclame:pro 1 MNLTEKWKDLLEAE----GADMPEIATATKQKIMSKIFENQDRDINNDPMYRDPQLVEAFNAGLNEAVVNGDHGYDPANI 76 (514) Q Consensus 1 ~~l~~kw~p~l~~~----~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~g~~~~~~ 76 (514) -.|.++=.-+.+.. ....+......++ ....-.++++.+. ...|..++.... .-..... T Consensus 47 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~-----------~~~~~~~~~~~~----~~~~~~~- 109 (397) T protein:vir:48 47 DTAKMKRDMFKEQYTEARANEVVNMSEEEKK-PLTKSEEEVKAGF-----------VKDFKNLVRGRY----QNLLDSK- 109 (397) T ss_pred HHHHHHHHHHHHHHHHHHHhhhhhhhhhccc-cccchhhHHHHHH-----------HHHHHHHHhhhh----hHHHHHh- Confidence 01111100000000 0000000000000 0000001110000 011111111000 0000000 Q ss_pred cccc-ccccc---cccceeeehhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCccccccccccCCCCccCcc Q lcl|Aclame:pro 77 AQGV-TTGAV---TNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAEAFHPTRQADASFSGQ 152 (514) Q Consensus 77 ~~st-~tg~v---~~~~P~l~~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~tg~EA~~~~nEadt~fSG~ 152 (514) ..++ +.|.+ ..+.+.++.++ .+...-.+++.++||++++|-+--.+. .+. . +.+.|-+ T Consensus 110 ~~~t~~~gg~~iP~~~~~~ii~~~---~~~~~l~~~~~~~~~~~~~~~~~~~~~--~~~---~---------~~a~~v~- 171 (397) T protein:vir:48 110 TDASGSDAGLTIPQDIQTAIHTLV---RQYDSLQEYVNVENVTTLTGSRVYEKW--ADI---T---------GLAKLDD- 171 (397) T ss_pred hccCCccccccccHHHHHHHHHHH---HHHHHHHhhhceeeccCCcceEEEEee--cCC---C---------cceeeec- Confidence 0011 11111 12222333333 355566888999999998885543221 111 0 0000000 Q ss_pred ccccccccccccccccccccccccccccccccccccccccccccccccCccccccccccccccccccccccccchhhhcc Q lcl|Aclame:pro 153 AAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAELQ 232 (514) Q Consensus 153 ~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~GmtTa~aEal 232 (514) ++ +.. T Consensus 172 ---------------------------------------------------------------------E~------~~~ 176 (397) T protein:vir:48 172 ---------------------------------------------------------------------EA------GSI 176 (397) T ss_pred ---------------------------------------------------------------------cc------ccc Confidence 00 000 Q ss_pred ccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhheee Q lcl|Aclame:pro 233 ENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNSQAQI 312 (514) Q Consensus 233 ~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~~l~~~a~v 312 (514) ..+....|.++.|++.|..+ ...+|-||.+|-. .|.+++|.+-|+..|..-+|+.|+.-. -+ T Consensus 177 ---~~~~~~~~~~v~~~~~k~~~-------~~~iS~ell~ds~----~~l~~~v~~~l~~~~~~~~d~~il~G~---g~- 238 (397) T protein:vir:48 177 ---GTNDDPKLYPIRYAIKRYAG-------ISTVTNSLLADSA----ENILAWLSGWIAKKVVVTRNKAILEAI---AT- 238 (397) T ss_pred ---ccccccceeeEEeeheeeee-------ehhhHHHHHhhch----HHHHHHHHHHHHHHHHHHHHHHHhhcc---cc- Confidence 00112235555555555544 4579999999853 467999999999999999999995432 11 Q ss_pred cccccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhhccccccchh Q lcl|Aclame:pro 313 GKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVGPAA 392 (514) Q Consensus 313 ~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~~g~~~~~~~ 392 (514) +....+..++ +-...++..+.. . ...+..+||+|.....|.... . . T Consensus 239 -------~~~~~~~~~~-------------d~i~~~~~~l~~-------~--~~~~a~~v~n~~~~~~L~~lk---d--~ 284 (397) T protein:vir:48 239 -------LPTKPTLTKW-------------DDIIDLQAKVDP-------A--IKQTSFFLTNTSGFTALKKVK---N--A 284 (397) T ss_pred -------cccccccccH-------------HHHHHHHHHhhh-------h--hcCCCEEEECHHHHHHHHHhh---c--C Confidence 0011122211 123334433332 1 234567889999998887531 1 0 Q ss_pred ccccCccccccccCceEEEEecCceEEEe--cCCCcc--------------ceEEEEEecCCCcccceeecccccccccc Q lcl|Aclame:pro 393 QGMQDGSMNTDTNQTVFAGVLGGRFKVYI--DQYAVN--------------DYFTVGFKGSTEMDAGVFYSPYVPLTPLR 456 (514) Q Consensus 393 ~~~~~~~~~~d~~~~~~~G~l~~~~~vy~--D~y~~~--------------dy~~vG~kG~~~~~~~~fy~PYv~~~~~~ 456 (514) .| +.-...+.+.. ..++|.|. +|++ |...+. +|++++..+.-... ..++.. T Consensus 285 ~G--~~i~~~~~~~~-~~~~l~G~-PV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~----~~~~~~----- 351 (397) T protein:vir:48 285 FG--DYLMERDVKSP-TGYSIDGF-AVKEVADRWLANASSGAMPLYFGDLKQAVTLFDRQQMSLL----STNIGG----- 351 (397) T ss_pred CC--ceeeccCcCCC-CCceeccc-eeEEecccccCCcCCCceEEEEEeccceEEEEeecceEEE----Eeccch----- Confidence 00 00011111111 11456554 6654 222211 12333333222111 111110 Q ss_pred ccCCccccceeeeeeeeee-eecC--cccc-----ccCcceeecCcchhhhc Q lcl|Aclame:pro 457 GSDSKNFQPVIGFKTRYGV-QVNP--FADP-----TASATKVGNGAPVAASM 500 (514) Q Consensus 457 ~~dp~s~qp~~~~~tRY~l-~~nP--f~~~-----~~~~~~i~~~~~~~~~~ 500 (514) -+-.+.+=.+-...||+. ..+| |... .+... ....... T Consensus 352 -~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~-----~~~~~~~ 397 (397) T protein:vir:48 352 -GAFETDTTKIRVIDRFDVVATDTESFVPASFKAIADQKG-----NLGSTAV 397 (397) T ss_pred -hhhhcCceeEEEEeeeccEEecccceEEEEecccccCCC-----CccccCC Confidence 011222334444455544 2333 2111 01100 0000001 No 83 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=41.60 E-value=0.92 Score=20.78 Aligned_cols=303 Identities=12% Similarity=0.060 Sum_probs=121.7 Q ss_pred cccccccccccc-ccccccc-cceeee-hhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCccccccccccCC Q lcl|Aclame:pro 69 HGYDPANIAQGV-TTGAVTN-IGPTVM-GMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAEAFHPTRQA 145 (514) Q Consensus 69 ~g~~~~~~~~st-~tg~v~~-~~P~l~-~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~tg~EA~~~~nEa 145 (514) -|+++.+-.... ++.+... .-|.++ .+++++..+.+-.+++-+.||++++.-|. .... +.++. T Consensus 1 ~g~~~e~~~~~~~~t~~~~g~l~~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip-----~~~~---~~~a~------ 66 (397) T protein:vir:23 1 MGFSADHSQIAQTKDTMFTGYLDPVQAKDYFAEAEKTSIVQRVAQKIPMGATGIVIP-----HWTG---DVSAQ------ 66 (397) T ss_pred CCcCHHHHHHhhccCCCCccccchhHHHHHHHHHHhccchhhhcceeeccCCceEEE-----EEcC---CcceE------ Confidence 222222211111 1111110 111111 12233334555677888888877642111 1110 11110 Q ss_pred CCccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccccccccccccccccccccc Q lcl|Aclame:pro 146 DASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMA 225 (514) Q Consensus 146 dt~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~Gmt 225 (514) |- T Consensus 67 ---wv--------------------------------------------------------------------------- 68 (397) T protein:vir:23 67 ---WI--------------------------------------------------------------------------- 68 (397) T ss_pred ---Ee--------------------------------------------------------------------------- Confidence 00 Q ss_pred chhhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHHH Q lcl|Aclame:pro 226 TSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNL 305 (514) Q Consensus 226 Ta~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~~ 305 (514) +| +..+++-..+++++++..|..+-.-.+|-||.+|-. .|.+++|.+-|...|...+|+.+|.- T Consensus 69 ---~E---------g~~~~~s~~~f~~v~l~~~k~~~~v~iS~ell~ds~----~~l~~~i~~~l~~aia~~~d~a~l~G 132 (397) T protein:vir:23 69 ---GE---------GDMKPITKGNMTKRDVHPAKIATIFVASAETVRANP----ANYLGTMRTKVATAIAMAFDNAALHG 132 (397) T ss_pred ---cC---------CccccccccceeEEEEeeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHHhhc Confidence 01 112333344566677777777777789999999863 56799999999999999999999642 Q ss_pred HhhheeecccccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhhcc Q lcl|Aclame:pro 306 VNSQAQIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTD 385 (514) Q Consensus 306 l~~~a~v~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~~g 385 (514) --. .. ...+..+..... . -+... ..+..+..+...+.. -....+.+|++++....|...- T Consensus 133 ~gt------~~-----~~~~~~~~~~~~--~--~~~~~---~~~~~~~~~~~~l~~--~~~~~a~~vmn~~~~~~L~~lk 192 (397) T protein:vir:23 133 TNA------PS-----AFQGYLDQSNKT--Q--SISPN---AYQGLGVSGLTKLVT--DGKKWTHTLLDDTVEPVLNGSV 192 (397) T ss_pred ccC------Cc-----ccccccccccce--e--eeccc---chhHHHHHHHHhhhh--cccCCCEEEEcHHHHHHHHHhh Confidence 110 00 000111000000 0 00000 001111222222222 2355677899999998887531 Q ss_pred ccccchhccccCccccccccCceEEEEecCceEEEecCCCccceEEEEEecCCCcccceeecccccccccccc------- Q lcl|Aclame:pro 386 TLVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGFKGSTEMDAGVFYSPYVPLTPLRGS------- 458 (514) Q Consensus 386 ~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~kG~~~~~~~~fy~PYv~~~~~~~~------- 458 (514) --..-+.-. .. ...........|+| .+++|+++++.+.+-+ +++.|+-. .+||.- .....++.. T Consensus 193 d~~G~~i~~--~~-~~~~~~~~~~~~tl-~G~Pv~~s~~~~~g~~-~~~~gDfs---~~~i~~-~~~i~i~~~~e~~~~~ 263 (397) T protein:vir:23 193 DANGRPLFV--ES-TYESLTTPFREGRI-LGRPTILSDHVAEGDV-VGYAGDFS---QIIWGQ-VGGLSFDVTDQATLNL 263 (397) T ss_pred ccCCceeec--cc-ccccccccccCcee-eeeeEEEeCCCCCCce-EEEEeecc---eEEEEE-EeceEEEEeeeeeeee Confidence 000000000 00 00000000111455 4669999988774321 12222211 111110 000001111 Q ss_pred --CCc----c-c---cceeeeeeeeee-eecC--ccccccC---cceeecCcchhhhccccceeeeeeeecC Q lcl|Aclame:pro 459 --DSK----N-F---QPVIGFKTRYGV-QVNP--FADPTAS---ATKVGNGAPVAASMGKNAYFRRVFVKGL 514 (514) Q Consensus 459 --dp~----s-~---qp~~~~~tRY~l-~~nP--f~~~~~~---~~~i~~~~~~~~~~~~~~~~r~~~V~~~ 514 (514) |+. + | |=.+=+..|++. ..+| |...... .+... ...+.......|.+++= T Consensus 264 ~~~~~~~~~~lf~~d~v~~ra~~r~d~~v~~~~a~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~ 329 (397) T protein:vir:23 264 GSQESPNFVSLWQHNLVAVRVEAEYGLLINDVNAFVKLTFDPVLTTYAL------DLDGASAGNFTLSLDGK 329 (397) T ss_pred ccccccceeeeeeccceeEEEEeeeccceecccceEEEeeccccceeee------cccccCcceEEEEecCc Confidence 111 0 1 122333455655 3333 3221111 11111 11123344455555443 No 84 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=38.15 E-value=1.1 Score=20.40 Aligned_cols=267 Identities=10% Similarity=0.034 Sum_probs=110.5 Q ss_pred ccccccccccccccccccccccccccccccccccccccc--cc-ccccccCccccccccccccccccccccccccchhhh Q lcl|Aclame:pro 154 AASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLAL--GA-VTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAE 230 (514) Q Consensus 154 ~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~--~~-~~~~~~~~~~~~~~~~~~a~~~~y~~~~GmtTa~aE 230 (514) .++. .+...+.. ...++..+.... +. .......... ... + ..+.+.+++.--....+| T Consensus 1 Ma~~--------~T~l~d~i-------~Pev~~~~v~~~~~~~~~~~~~~~~~~--~l~-g-~~G~ti~iP~~~~igda~ 61 (276) T protein:vir:10 1 MAQG--------TTTKSTQI-------VPEVLAPMMQAELDKKLRFAQFADIDS--TLV-G-QPGDTLTFPAFVYSGDAT 61 (276) T ss_pred CCcc--------eeehhhhh-------chHHHHHHHHHHHHhhhhhcccceecc--ccc-C-CCCCEEEeeeecCCCccc Confidence 1100 00000000 000000000000 00 0000000000 000 0 112222322211122333 Q ss_pred ccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhc-CCChhHHHHHHHHHHHHHHhhHHHHHHHhhh Q lcl|Aclame:pro 231 LQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVH-GLDADAELSGILANEVMVELNREIVNLVNSQ 309 (514) Q Consensus 231 al~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiH-GLDAEaELanILStEImlEINReii~~l~~~ 309 (514) ... . +.++..=..+..+.+++.+-|.-.=++| |+-+.. +.|.-.|..+-++.-|..+++.+++..+... T Consensus 62 ~~~---e--g~~i~~~~lt~~~~~a~i~~~~k~~~~t-----D~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~~l~~~ 131 (276) T protein:vir:10 62 VVP---E--GQKIPVDKIETNRREAKIHKIGKGTDIT-----DEALLSGYGDPQGEAVRQHGLAIANKVDNDVLEALRGT 131 (276) T ss_pred ccc---C--CCccCccccccceeeEEeehcccccccc-----HHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 221 1 2223333344455555555554333333 333332 6799999999999999999999998776432 Q ss_pred eeecccccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhh---ccc Q lcl|Aclame:pro 310 AQIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSM---TDT 386 (514) Q Consensus 310 a~v~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~---~g~ 386 (514) .. .+ +.+.+.+ +.+-....++.++ -.+.++++|+|++++.|.. ..+ T Consensus 132 ~~--------~~-~~~~~t~-------------d~i~~A~~~lgd~---------~~~~~~ivv~p~~~~~L~k~~~~~f 180 (276) T protein:vir:10 132 KL--------TV-SADIGTL-------------AGLEAAIDTFDDE---------DLEPMVLFINPKDAGKLRSSASDNF 180 (276) T ss_pred cc--------cc-cccccCH-------------HHHHHHHHHhccc---------cCcccEEEEcHHHHHHHHHhccccc Confidence 11 01 1112221 1222222223222 2567899999999988854 344 Q ss_pred cccchhccccCccccccccCceEEEEecCceEEEecCCCccceEEEEEecCCCcccceeecccccccccc-ccCCccccc Q lcl|Aclame:pro 387 LVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGFKGSTEMDAGVFYSPYVPLTPLR-GSDSKNFQP 465 (514) Q Consensus 387 ~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~kG~~~~~~~~fy~PYv~~~~~~-~~dp~s~qp 465 (514) ...+.. + .+...+-..|++. |++|++|...+..-..+--+|.-.+ +.. -+. .++ ..|++.++= T Consensus 181 ~~~s~~-g-------~~~~~~G~ig~~~-G~~Vi~s~~~p~~t~~l~~~gAi~~----~~~--~~~-~vE~dRd~~~~~d 244 (276) T protein:vir:10 181 TRATEL-G-------DNIIVKGAFGEAL-GAVIVRSKKLDEGEAILAKRGAVKL----ITK--RDF-FLETDRDPSTKTT 244 (276) T ss_pred cccccc-c-------ccceeccccceec-ceeEEEcCCCCcceEEEEeccceee----eec--CCc-eeecccchhhccc Confidence 321110 0 0111111235553 6799999998753322221222221 111 111 122 359999999 Q ss_pred eeeeeeeeee-eecCccccccCcceeecCcchhhhccc Q lcl|Aclame:pro 466 VIGFKTRYGV-QVNPFADPTASATKVGNGAPVAASMGK 502 (514) Q Consensus 466 ~~~~~tRY~l-~~nPf~~~~~~~~~i~~~~~~~~~~~~ 502 (514) .|-...+||+ ..||=.. .++.-+. +..-.|. T Consensus 245 ~i~~~~~y~~~~~~~~~v-----v~~t~~~-~~~~~~~ 276 (276) T protein:vir:10 245 ALYSDKHYVAYLYDESKA-----VKVTKGA-GTTDSGA 276 (276) T ss_pred EEEEeeEEEEEEEcCcce-----EEEecCC-cCCcCCC Confidence 9988899988 4455110 1111111 1111111 No 85 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=32.86 E-value=1.4 Score=19.79 Aligned_cols=337 Identities=11% Similarity=0.129 Sum_probs=116.6 Q ss_pred Ccchhhhhhhhccccc----------cccccccchhhhhhhhhhhhHHHHHHhcccccchhhhhhhcccccccccccccc Q lcl|Aclame:pro 1 MNLTEKWKDLLEAEGA----------DMPEIATATKQKIMSKIFENQDRDINNDPMYRDPQLVEAFNAGLNEAVVNGDHG 70 (514) Q Consensus 1 ~~l~~kw~p~l~~~~~----------~~~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~g 70 (514) -...++...-+..+.. ..+.+....++... .-.+|-++++.. +.+. ...... ...+.. ..... T Consensus 85 a~~~e~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~-~~~~~~~~~~e~-~~~~-~~~~~~---~~~~~~-~~~~~ 157 (458) T protein:vir:10 85 AQTVEKQQETIVGLQDEIKSLLTAREGRSFVGDSVAKALY-GTQENFEDEVEK-LVLL-SYVMEK---GVFETE-HGQRH 157 (458) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhccch-hhhhhHHHHHHH-HHHH-HHHHhh---ccchhh-hhhhh Confidence 1111111110000000 00000000000000 000000001000 0000 000000 000000 00000 Q ss_pred cccccccccccc--c--cc-cccceeeehhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCccccccccccCC Q lcl|Aclame:pro 71 YDPANIAQGVTT--G--AV-TNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAEAFHPTRQA 145 (514) Q Consensus 71 ~~~~~~~~st~t--g--~v-~~~~P~l~~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~tg~EA~~~~nEa 145 (514) ..+. ..+++. | .+ ..+.+-+ +.++.+..+..++|-++||+++..-++ .. . .+. T Consensus 158 ~~a~--~~~~~~~~g~~~ip~~~~~~i---i~~~~~~~~l~~~~~~~~~~~~~~~~~-~~----~---~~~--------- 215 (458) T protein:vir:10 158 LKAV--NQSSSVEVSSESYETIFSQRI---IRDLQKELVVGALFEELPMSSKILTML-VE----P---DAG--------- 215 (458) T ss_pred hhhh--hhcccCccccceehhhHhHHH---HHHHHhhhhHHhhcceeecCCcceEEE-Ee----c---CCc--------- Confidence 0000 000000 0 11 1122223 334446667889999999988642221 11 0 010 Q ss_pred CCccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccccccccccccccccccccc Q lcl|Aclame:pro 146 DASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMA 225 (514) Q Consensus 146 dt~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~Gmt 225 (514) .+.|-+.+.... . T Consensus 216 ~a~~v~e~~~~~-------------------------------------------------------------------~ 228 (458) T protein:vir:10 216 KATWVAASTYGT-------------------------------------------------------------------D 228 (458) T ss_pred ceeecccccccc-------------------------------------------------------------------c Confidence 111111000000 0 Q ss_pred chhhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHHH Q lcl|Aclame:pro 226 TSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNL 305 (514) Q Consensus 226 Ta~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~~ 305 (514) +.. ..... .+++++++.++.-+....+|-||.+|-- .|.+++|.+-|+.-|..-||+.||. T Consensus 229 ~~~-------~~~~~-------~~~~~i~~~~~k~~~~v~is~ell~ds~----~~~~~~i~~~l~~~i~~~~d~~~l~- 289 (458) T protein:vir:10 229 TTT-------GEEVK-------GALKEIHFSTYKLAAKSFITDETEEDAI----FSLLPLLRKRLIEAHAVSIEEAFMT- 289 (458) T ss_pred ccc-------ccccc-------ccceeeEeeeeeEEeeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHhhc- Confidence 000 00111 2234455555555556779999998843 4568889999999999999998843 Q ss_pred HhhheeecccccccccCCcceecccccc------ccccc---hhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChh Q lcl|Aclame:pro 306 VNSQAQIGKSGWTQGAGAAGVFDFSDAV------DVKGA---RWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRN 376 (514) Q Consensus 306 l~~~a~v~~~~~~~~v~~~g~~dl~~~~------d~~~~---rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~ 376 (514) -.-+ + .+.|++...... ...+. -...+....|+..+.. ........||+|. T Consensus 290 --G~G~----~-----~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~~---------~~~~~~~~v~~~~ 349 (458) T protein:vir:10 290 --GDGS----G-----KPKGLLTLASEDSAKVVTEAKADGSVLVTAKTISKLRRKLGR---------HGLKLSKLVLIVS 349 (458) T ss_pred --CCCC----C-----ccceeeecccccccceeecccccccccccHHHHHHHHHhhhh---------hhcCCCEEEEcHH Confidence 1111 0 122332221100 00000 0011222333333221 1224457899999 Q ss_pred HHhHHhhccccccchhccccCccccccccCceEEEEecCceEEEecCCCcc-----ceEEEEEecCCCcccceeeccccc Q lcl|Aclame:pro 377 VVSALSMTDTLVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVN-----DYFTVGFKGSTEMDAGVFYSPYVP 451 (514) Q Consensus 377 va~~L~~~g~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~-----dy~~vG~kG~~~~~~~~fy~PYv~ 451 (514) ....|...---.+.+...........+.++ ++|. |++|+++.+.|. +.++..++ + =|+. T Consensus 350 ~~~~l~~lkd~~G~~i~~~~~~~~~~~~~~----~~l~-G~pv~~~~~~p~~~~~~~~~~~~f~-~----------~~~~ 413 (458) T protein:vir:10 350 MDAYYDLLEDEEWQDVAQVGNDSVKLQGQV----GRIY-GLPVVVSEYFPAKANSAEFAVIVYK-D----------NFVM 413 (458) T ss_pred HHHHHHhhcccCCceeeccccccccccCcC----ceec-ceeeEEccccccccCCcceEEEEec-c----------cEEE Confidence 887776421100000000000000011111 3455 579999988764 22222121 1 0111 Q ss_pred c--cccc-ccCCccccceeeee--eeeee-eecC--ccccccCcc Q lcl|Aclame:pro 452 L--TPLR-GSDSKNFQPVIGFK--TRYGV-QVNP--FADPTASAT 488 (514) Q Consensus 452 ~--~~~~-~~dp~s~qp~~~~~--tRY~l-~~nP--f~~~~~~~~ 488 (514) . ..+. ..||-+-...++|. .|+|+ +.+| |...+--.+ T Consensus 414 ~~~~~~~v~~d~~~~~~~~~~~~~~r~~~~v~~~~a~v~~~~aa~ 458 (458) T protein:vir:10 414 PRQRAVTVERERQAGKQRDAYYVTQRVNLQRYFANGVVSGTYAAS 458 (458) T ss_pred EEeeceEEEeecccCCCceEEEEEEEecceEecccceEEEeeccC Confidence 1 0011 13544435556665 46654 4455 322211111 No 86 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=32.36 E-value=1.4 Score=19.73 Aligned_cols=294 Identities=14% Similarity=0.037 Sum_probs=103.8 Q ss_pred ccCccccccccccccccccccccccccccccccccccccccccccccccccccCccccccccccccccccccccccccch Q lcl|Aclame:pro 148 SFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATS 227 (514) Q Consensus 148 ~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~GmtTa 227 (514) ..-|.............+....+.. ........+... ...........................+.+-..- T Consensus 1 ~~~~~~~~~~~~~~~~t~~~~~~~~---ip~~~~~~ii~~------~~~~s~l~~~~~~~~~~~~~~~~p~~~~~~~a~~ 71 (320) T protein:vir:10 1 MAAGTAFQVDHAQIAQTGDTMFKGY---LEPEQAKDYFAE------AEKTSIVQQFAQKVPMGTTGQKIPHWIGDVSAQW 71 (320) T ss_pred CCCCccCCHHHHHhhcccccccccc---ccHHHHHHHHHH------HHhccchhhhcceeeccCCceEEEEEeCCcceEE Confidence 2222211100000000000000000 000000000000 0000000000000000000000000001111111 Q ss_pred hhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHHHHh Q lcl|Aclame:pro 228 QAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVN 307 (514) Q Consensus 228 ~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~~l~ 307 (514) .+| +..+++-..+++++++..|.......+|.||.+|-. .|.++.|.+.|...|...+|+.++. T Consensus 72 v~E---------~~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~----~~l~~~i~~~l~~a~a~~~d~a~l~--- 135 (320) T protein:vir:10 72 IGE---------GDMKPITKGNMTSQNIAPHKIATIFVASAETVRANP----ANYLGTMRTKVATAFAMAFDSAALN--- 135 (320) T ss_pred ecC---------CccccccccceeEEEEeeEEEEEeehhhHHHHhcCh----HHHHHHHHHHHHHHHHHHHHHHhhc--- Confidence 112 345666667778888888888888999999999865 4678899999999999999998842 Q ss_pred hheeecccc---ccc--ccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHh Q lcl|Aclame:pro 308 SQAQIGKSG---WTQ--GAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALS 382 (514) Q Consensus 308 ~~a~v~~~~---~~~--~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~ 382 (514) -.-.....+ ..+ ++...+.....+ -+..+ .+ +..+...+ .........+||+|+....|. T Consensus 136 G~g~~~~~~~~~~~~~~~~~~~~~~~~~~-------~~~~~---~~---~~~~~~~~--~~~~~~~~~~v~n~~~~~~L~ 200 (320) T protein:vir:10 136 GTDSPFPTYLAQTTKSVSLADPGGATASD-------LTAYD---AV---AVNGLSLL--VNAKKKWTHTLLDDIVEPILN 200 (320) T ss_pred ccCCCCCcccccccccccceecccccccc-------cccHH---HH---HHHHHhhh--hcccCCCcEEEEcHHHHHHHH Confidence 111100000 000 001111111000 00111 11 11111112 222345668899999998887 Q ss_pred hccccccchhccccCccccccccCceEEEEecCceEEEecCCCccce----------EEEEEecCCCcccceeecccccc Q lcl|Aclame:pro 383 MTDTLVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDY----------FTVGFKGSTEMDAGVFYSPYVPL 452 (514) Q Consensus 383 ~~g~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy----------~~vG~kG~~~~~~~~fy~PYv~~ 452 (514) ...--...+.- .+. ...........+++ .+++|+++++.+.+= +++|..+.-+++-+ .+. T Consensus 201 ~lkd~~G~~l~--~~~-~~~~~~~~~~~~~i-~g~pv~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~------~~~ 270 (320) T protein:vir:10 201 GAKDKNGRPLF--IES-TYTDENSPFRAGRI-VSRPTILSDHVADGTTVGYMGDFRNVIWGQVGGLSFDVT------DQA 270 (320) T ss_pred HhhccCCceee--ccc-cccCccccccCcee-eeeeeEecCCCCCCceEEEEeecceEEEEEecCeEEEEe------ecc Confidence 42110000000 000 00011111111333 366899998877542 22222222211100 000 Q ss_pred ccccccCCcc-----cc---ceeeeeeeeee-eecC--ccccccCcceeecCcchh Q lcl|Aclame:pro 453 TPLRGSDSKN-----FQ---PVIGFKTRYGV-QVNP--FADPTASATKVGNGAPVA 497 (514) Q Consensus 453 ~~~~~~dp~s-----~q---p~~~~~tRY~l-~~nP--f~~~~~~~~~i~~~~~~~ 497 (514) ......|+.. || =.+=...|++. ..+| |+..+.-. .|.+ T Consensus 271 ~~~~~~~~~~~~~~~f~~~~~~~r~~~~~d~~v~~~~a~~~l~~~~------ap~~ 320 (320) T protein:vir:10 271 TLNLGTPTEPNFVSLWQHNLVAVRVEAEYAFHNNDKDAFVKLTNVV------TPDA 320 (320) T ss_pred eeeeccccccccchhhhcCcEEEEEEEeeccEEecccceEEEEecc------CCCC Confidence 0000011111 11 12223356665 3444 33322111 1111 No 87 >protein:vir:94622 Length: 341 # NCBI annotation: PfWMP4_37 # Family: family:all:2203 # MgeID: mge:1525 # MgeName: Pf-WMP4 # Cross-refs: genbank:acc:YP_762667;genbank:gi:115304375;genbank:GeneID:5142322 Probab=30.47 E-value=1.6 Score=19.50 Aligned_cols=288 Identities=11% Similarity=0.052 Sum_probs=97.6 Q ss_pred CccccccccccCCCCccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccccc--- Q lcl|Aclame:pro 133 LTGAEAFHPTRQADASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYT--- 209 (514) Q Consensus 133 ~tg~EA~~~~nEadt~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~--- 209 (514) ++.. | ++.|..-.. ....... +....+.+..... ...+..... ...+.... T Consensus 1 ~~~~---------~-~~~~~~~~t----------~~v~~fi---pei~s~~i~~~l~--~~~v~~~~~-~d~~~~~~~Gd 54 (341) T protein:vir:94 1 MALG---------N-TITGPSINT----------QRGQQFI---PEQWLSEVQMFRK--AKMLDTSVV-KTWGAQVKKGD 54 (341) T ss_pred Ccch---------h-hhccccccc----------hhHHHHH---HHHHHHHHHHHHH--hhcchhhcc-ccccccccCCc Confidence 1110 0 111110000 0000000 0000000000000 000000000 00000000 Q ss_pred ----cccccccc--ccccccccchhhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChh Q lcl|Aclame:pro 210 ----DGVAGGLL--VEIDAGMATSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDAD 283 (514) Q Consensus 210 ----~~~a~~~~--y~~~~GmtTa~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE 283 (514) ........ |..+..++ .+ +..-.+..++|||...-+- .+ +-+|..| . | .|-- T Consensus 55 tv~ip~~g~~~~~d~~~~~~i~---~~---------~~~~~~~~itiD~~~~~~~--~i---~d~d~~~---~-~-~d~~ 112 (341) T protein:vir:94 55 TFHVPRISELGVEDKATDVPVG---VQ---------PVNDTDFVITVDTDRTTAV--AL---DDLLEIQ---A-S-YDLR 112 (341) T ss_pred eEEEeccCcceeeeecCCCccc---cc---------cccCceEEEEEeeeeecce--ee---chHHHHh---h-c-cchH Confidence 00000000 11011110 01 1112455677777543322 00 1223222 2 3 5778 Q ss_pred HHHHHHHHHHHHHHhhHHHHHHHhhheeecccccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhc Q lcl|Aclame:pro 284 AELSGILANEVMVELNREIVNLVNSQAQIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQT 363 (514) Q Consensus 284 aELanILStEImlEINReii~~l~~~a~v~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T 363 (514) .|+..-....++.+++++|+..+-..+. + ...++... .+.....+ +....-+.+..+...+++.. T Consensus 113 ~~~~~~~~~aLA~~~D~~i~~~~a~~~~--~--~~~~~~~~--~~~~~t~~--~~~~~~~~i~~a~~~Lde~~------- 177 (341) T protein:vir:94 113 APYLEAMGYALAKDMTGSILGLRAAVQN--T--ASQNVFSS--SNGAITGN--GQAFSFAVFLAARRLLLEAD------- 177 (341) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhhccc--c--ccCccccC--ccccccCc--hhhhhHHHHHHHHHHHhhcC------- Confidence 8888888899999999999776632221 0 00011110 01110011 11122234444444444321 Q ss_pred ccccccEEEEChhHHhHHhhccccccchhccccCccccccccCceEEEEecC--ceEEEecCCCccceEEEEEe------ Q lcl|Aclame:pro 364 GRGNGNFIIASRNVVSALSMTDTLVGPAAQGMQDGSMNTDTNQTVFAGVLGG--RFKVYIDQYAVNDYFTVGFK------ 435 (514) Q Consensus 364 ~r~~~n~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~--~~~vy~D~y~~~dy~~vG~k------ 435 (514) ---.|-|+|++|++...|-...-+......+ +.+ +.-|.+.. |+.||..++-|.+-.. +++ T Consensus 178 VP~~gR~lvv~P~~~~~Ll~~~~~~~~~~~g------~~~----l~~G~ig~i~G~~V~~Sn~lp~~~~~-~~~~~~~~~ 246 (341) T protein:vir:94 178 VPEEKIVLLISPGQESALFTIPQFISKDFIN------NAP----IAQGQIGSLMGVRVIRTSLIGNNSAT-GWRNGAPTI 246 (341) T ss_pred CCccCCEEEeCHHHHHHHhhchhhhhhhccc------cch----hheeeeeeEeceEEEEeccccccccc-cccccccce Confidence 1235789999999998887655443221111 111 22244444 8899999988753211 111 Q ss_pred -------------------cCCCcccceeeccccccccccccCCcccccee-----------------eeeeeeeeeecC Q lcl|Aclame:pro 436 -------------------GSTEMDAGVFYSPYVPLTPLRGSDSKNFQPVI-----------------GFKTRYGVQVNP 479 (514) Q Consensus 436 -------------------G~~~~~~~~fy~PYv~~~~~~~~dp~s~qp~~-----------------~~~tRY~l~~nP 479 (514) +......||++.+.- .-..+..||+.++... .+..||++-+-+ T Consensus 247 ~~~~~~~~i~~~~~~~~~~~~~~~~~gl~~~~~a-v~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~G~~~ 325 (341) T protein:vir:94 247 APAEATPGFTGSRYLPKQDSFTSLPATFTGNSRP-VHTAVMCHMDWAAAVVSKAPRVTQSFENREQVWLMVGRQAYGARL 325 (341) T ss_pred ecccccccccccccccccccccccEEEEEEeccc-ccceeeecchhhhccccccccccccchhhhhhhhhhhhhhhcccc Confidence 111122333333211 1112223444333321 122333333333 Q ss_pred ccccccCcceeecCcchh Q lcl|Aclame:pro 480 FADPTASATKVGNGAPVA 497 (514) Q Consensus 480 f~~~~~~~~~i~~~~~~~ 497 (514) +.+... .-+..+.+-. T Consensus 326 lrp~~~--v~~~~~~~~~ 341 (341) T protein:vir:94 326 YRPLHA--VNIHTTGDTV 341 (341) T ss_pred cCccee--EEEecCcCCC Confidence 222110 0000000000 No 88 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=30.46 E-value=1.6 Score=19.50 Aligned_cols=302 Identities=11% Similarity=0.038 Sum_probs=118.6 Q ss_pred cccchhhhhhhhhhhhHHHHHHhcccccchhhhhhhccccccccccccccccccccccccccccccccceeeehhhhhhh Q lcl|Aclame:pro 21 IATATKQKIMSKIFENQDRDINNDPMYRDPQLVEAFNAGLNEAVVNGDHGYDPANIAQGVTTGAVTNIGPTVMGMVRRAI 100 (514) Q Consensus 21 i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~g~~~~~~~~st~tg~v~~~~P~l~~l~Rra~ 100 (514) ++..++.+ .|.-++..... .+..+. ++ +. .++.+++..--..+.=.+++.+. T Consensus 1 ~~~~~~~~------~~~~~f~~~~~----------~~~~~~-a~----------~~-~~~~~~~~liP~~~~~~ii~~~~ 52 (324) T protein:vir:10 1 MEQTQKLK------LNLQHFASNNV----------KPQVFN-PD----------NV-MMHEKKDGTLLNDFTTPILQEVM 52 (324) T ss_pred CCCchHHH------HHHHHHHHHhh----------ccceec-cc----------ce-eccCCCcceechhHHHHHHHHHH Confidence 11111111 01111110000 000000 00 00 01111110000001111233344 Q ss_pred hhhhhcceeEEecCCcccceeeeeeeeecCCCCccccccccccCCCCccCcccccccccccccccccccccccccccccc Q lcl|Aclame:pro 101 PQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAEAFHPTRQADASFSGQAAASTIADFPTTGAATDGTPYKAEVTTS 180 (514) Q Consensus 101 ~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~tg~EA~~~~nEadt~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~ 180 (514) .+.+-.+++-+.||++.+.-|. +... +.+|- |- T Consensus 53 ~~s~l~~~~~~~~~~~~~~~~p----~~~~----~~~a~---------~v------------------------------ 85 (324) T protein:vir:10 53 ENSKIMQLGKYEPMEGTEKKFT----FWAD----KPGAY---------WV------------------------------ 85 (324) T ss_pred hhchhhhhcceeeccCCceEEE----EEeC----Cccee---------Ee------------------------------ Confidence 5556778888999887653221 1000 00110 00 Q ss_pred ccccccccccccccccccccCccccccccccccccccccccccccchhhhccccCCCCCCcccccceeEEEEEEEEeecc Q lcl|Aclame:pro 181 GGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSR 260 (514) Q Consensus 181 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~GmtTa~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSR 260 (514) +| +..+++...+++++++..|.. T Consensus 86 ------------------------------------------------~E---------g~~~~~~~~~~~~v~~~~~k~ 108 (324) T protein:vir:10 86 ------------------------------------------------GE---------GQKIETSKATWVNATMRAFKL 108 (324) T ss_pred ------------------------------------------------cc---------CccccccccceeEEEEeeEEE Confidence 01 122344445566677777777 Q ss_pred cccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhheeecccccccccCCcceeccccccccccchh Q lcl|Aclame:pro 261 QLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNSQAQIGKSGWTQGAGAAGVFDFSDAVDVKGARW 340 (514) Q Consensus 261 aLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~~l~~~a~v~~~~~~~~v~~~g~~dl~~~~d~~~~rw 340 (514) +..-..|-||.+|-. .|.+++|.+.|+..|...+++.+|.- ..+ . ..+.|+++........ . T Consensus 109 ~~~~~iS~ell~ds~----~~l~~~i~~~l~~ai~~~~d~a~l~G---~g~---~-----~~~~~i~~~~~~~~~~---~ 170 (324) T protein:vir:10 109 GVILPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGILN---QGN---N-----PFGKSIAQSIEKTNKV---I 170 (324) T ss_pred EEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHhhhc---CCC---C-----ccCcccccccccccee---c Confidence 777779999999864 46799999999999999999999532 211 0 0111221110000000 0 Q ss_pred hHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhhccccccchhccccCccccccccCceEEEEecCceEEE Q lcl|Aclame:pro 341 AGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVY 420 (514) Q Consensus 341 a~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy 420 (514) ...-.+..|.++.+.|. ......+.+|++|.....|....- ..|. ..-.+..+ ++|.| ++|+ T Consensus 171 ---~~~~t~~~i~~~~~~l~--~~~~~~~~~v~n~~~~~~L~~l~d-----~~g~---~~~~~~~~----~~l~G-~PV~ 232 (324) T protein:vir:10 171 ---KGDFTQDNIIDLEALLE--DDELEANAFISKTQNRSLLRKIVD-----PETK---ERIYDRNS----DTLDG-LPVV 232 (324) T ss_pred ---cccCCHHHHHHHHHhhh--hccCCCCEEEEcHHHHHHHHHhhc-----cCCc---eeecCCCC----ccccc-eeEE Confidence 00001222333333332 223566678999999988875311 1111 11111122 34444 4888 Q ss_pred ecCCCcc--ceEEEE--------EecCCCcccceeeccccccccccccCCc--------cccceeeeeeeeee-eecC-- Q lcl|Aclame:pro 421 IDQYAVN--DYFTVG--------FKGSTEMDAGVFYSPYVPLTPLRGSDSK--------NFQPVIGFKTRYGV-QVNP-- 479 (514) Q Consensus 421 ~D~y~~~--dy~~vG--------~kG~~~~~~~~fy~PYv~~~~~~~~dp~--------s~qp~~~~~tRY~l-~~nP-- 479 (514) +.+.... ..+++| ..++-..+ .....-+. ...|+. +-+=.+=...||+. ..+| T Consensus 233 ~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~----~~~~~~~~--~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~A 306 (324) T protein:vir:10 233 NLKSSNLKRGELITGDFDKLIYGIPQLIEYK----IDETAQLS--TVKNEDGTPVNLFEQDMVALRATMHVALHIADDKA 306 (324) T ss_pred eecCCCCCcceEEEEecccEEEEEecCcEEE----Eeeccccc--ccccccccchhhhhcCcEEEEEEEEEccEEecccc Confidence 8776543 223333 22211110 00000000 001111 11223333457775 3445 Q ss_pred ccccccCcceeecCcchh-hhcccc Q lcl|Aclame:pro 480 FADPTASATKVGNGAPVA-ASMGKN 503 (514) Q Consensus 480 f~~~~~~~~~i~~~~~~~-~~~~~~ 503 (514) |+..+. ..... ..+++= T Consensus 307 ~~~l~~-------a~~~~~~~~~~~ 324 (324) T protein:vir:10 307 FAKLVP-------ADKKTDSVPGEV 324 (324) T ss_pred eEEEEe-------ccCCCCCCCCCC Confidence 432211 00000 111211 No 89 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=30.28 E-value=1.6 Score=19.48 Aligned_cols=329 Identities=14% Similarity=0.109 Sum_probs=116.1 Q ss_pred Ccchhhhhhhhccccccccccccch-hhhhhhhhhhhHHHHHHhccc-ccchh------hhhhh-ccccccccccccccc Q lcl|Aclame:pro 1 MNLTEKWKDLLEAEGADMPEIATAT-KQKIMSKIFENQDRDINNDPM-YRDPQ------LVEAF-NAGLNEAVVNGDHGY 71 (514) Q Consensus 1 ~~l~~kw~p~l~~~~~~~~~i~~~~-~~~~~~~~~enq~~~~~~~~~-~~~~~------~~~~~-~~~~~~a~~~~~~g~ 71 (514) ....++..-+.+ +|.... ++.......+.+......... -.... ....+ ..++... ... T Consensus 37 ~~~~ee~~~l~~-------~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~~~ 104 (395) T protein:vir:38 37 SHSVDDINKLNA-------SLKNAKMAQELAKSAYEDARANLNAEPVNKKPLPVKDGKPDAQAMKNQFVKDF-----KNL 104 (395) T ss_pred HHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHhhhhhccccccccchhhhhHHHHHHHHHHHHHH-----HHH Confidence 111111111110 010000 011111111111111111000 00000 00000 0000000 000 Q ss_pred cccccccccccccc---cccceeeehhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCccccccccccCCCCc Q lcl|Aclame:pro 72 DPANIAQGVTTGAV---TNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAEAFHPTRQADAS 148 (514) Q Consensus 72 ~~~~~~~st~tg~v---~~~~P~l~~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~tg~EA~~~~nEadt~ 148 (514) . .....++++|.+ ..+.+- +++.+....+..++|.++||++++|-+--. +-.+. +. .+. T Consensus 105 ~-~~~~~~~~~gg~~vP~~~~~~---ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~--~~~~~---~~---------~a~ 166 (395) T protein:vir:38 105 V-TSGTTGTGNAGLTIPEDIQLQ---IRTLTRSFTSLESLANVENVTTSHGSRVYE--KLADI---TP---------LKD 166 (395) T ss_pred H-hhccCccCCCceecchhHhhH---HHHHHHhhcchhhhcceeeccCCcceEEEE--eeccC---Cc---------ccc Confidence 0 000001111211 122222 344444566778899999999998854111 10000 00 000 Q ss_pred cCccccccccccccccccccccccccccccccccccccccccccccccccccCccccccccccccccccccccccccchh Q lcl|Aclame:pro 149 FSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQ 228 (514) Q Consensus 149 fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~GmtTa~ 228 (514) |- +.+ T Consensus 167 ~v----------------------------------------------------------------------~E~----- 171 (395) T protein:vir:38 167 LD----------------------------------------------------------------------DES----- 171 (395) T ss_pred cc----------------------------------------------------------------------ccc----- Confidence 00 000 Q ss_pred hhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhh Q lcl|Aclame:pro 229 AELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNS 308 (514) Q Consensus 229 aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~~l~~ 308 (514) ++. ..+....|.+..|...|..+ ...+|.||.+|- +.|-++.|.+-|+..|..-||+.|+.-. T Consensus 172 ~~~----~~~~~~~f~~v~~~~~k~~~-------~~~iS~ell~ds----~~~l~~~i~~~la~~~~~~~~~~il~g~-- 234 (395) T protein:vir:38 172 ALI----GDNDDPELTVVKYLIHRYAG-------ITTVTNTLLKDT----VDNIIQWLVNWAAKKDVVTRNAKILEVM-- 234 (395) T ss_pred ccc----ccccccceeeEEeeeeeeEe-------ehhhHHHHHhhh----HHHHHHHHHHHHHHHHHHHHHHHHhhcc-- Confidence 000 00112235555555555554 445999999993 3456888989898888888888885322 Q ss_pred heeecccccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhhccccc Q lcl|Aclame:pro 309 QAQIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLV 388 (514) Q Consensus 309 ~a~v~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~~g~~~ 388 (514) .+ +....+..++ +....++......+ - .....+||+|.....|... . T Consensus 235 -g~--------~~~~~~~~~~-------------~~i~~~~~~~l~~~------~--~~~a~~v~n~~~~~~L~~l---k 281 (395) T protein:vir:38 235 -GK--------APKKPTISQF-------------DNIKDLENNTLDPA------I--ESTSSFITNQSGYNILSKV---K 281 (395) T ss_pred -cc--------cccccccccH-------------HHHHHHHHHhhhhh------h--cCCCEEEEcHHHHHHHHHh---h Confidence 11 0011122211 12233332222221 1 1234578999998888642 1 Q ss_pred cchhccccCccccccccCceEEEEecCceEEEecCCCcc-----ce-EEEE---------EecCCCcccceeeccccccc Q lcl|Aclame:pro 389 GPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVN-----DY-FTVG---------FKGSTEMDAGVFYSPYVPLT 453 (514) Q Consensus 389 ~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~-----dy-~~vG---------~kG~~~~~~~~fy~PYv~~~ 453 (514) . ..| .+.-.+...+-..++|. |++|++....+. +. +++| .+... .+=+.++. T Consensus 282 d--~~G---~~l~~~~~~~~~~~~l~-G~pV~~~~~~~~~~~~~~~~i~~gd~~~~~~i~~~~~~----~i~~~~~~--- 348 (395) T protein:vir:38 282 D--ADG---RYLMQPDVTSPDKYLID-GKPVIRIADKWLPDVSGSHPLYFGDLKQGITLFDRQQM----QIDTTNVG--- 348 (395) T ss_pred c--cCC---ceeeccCcCCCCcceec-cceeEEecccccCcCCCcceEEEEeccccEEEEEecce----EEEEeccc--- Confidence 1 001 11111111111113454 457776543221 11 2222 11100 01111110 Q ss_pred cccccCCccccceeeeeeeeeee-ecC--ccccccCcceeecCcchhhhccc Q lcl|Aclame:pro 454 PLRGSDSKNFQPVIGFKTRYGVQ-VNP--FADPTASATKVGNGAPVAASMGK 502 (514) Q Consensus 454 ~~~~~dp~s~qp~~~~~tRY~l~-~nP--f~~~~~~~~~i~~~~~~~~~~~~ 502 (514) ..+-..-+=.+-+..||+.. .+| |...+- .......+..-..|| T Consensus 349 ---~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~--~~~~~~~~~~~~~~~ 395 (395) T protein:vir:38 349 ---AGSFEHDTTKLRFIDRFDVQLIDDGAFAAASF--KTVANQAQGTAGTGK 395 (395) T ss_pred ---cchhhcCceEEEEEEeeccEEecccceEEEEe--ecccCCCCCccCCCC Confidence 01112333455666777763 334 332211 001111111223355 No 90 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=30.03 E-value=1.6 Score=19.45 Aligned_cols=323 Identities=15% Similarity=0.134 Sum_probs=119.5 Q ss_pred CcchhhhhhhhccccccccccccchhhhhhhhhhhhHHHHHHhcc--------cc---------cchhh---hhhhcccc Q lcl|Aclame:pro 1 MNLTEKWKDLLEAEGADMPEIATATKQKIMSKIFENQDRDINNDP--------MY---------RDPQL---VEAFNAGL 60 (514) Q Consensus 1 ~~l~~kw~p~l~~~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~--------~~---------~~~~~---~~~~~~~~ 60 (514) -+|.+++.-+.+. + .=|+.|.+.+..+. .. ..... ..+|..++ T Consensus 37 ~~l~~ei~~~~~~---------------~--~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l 99 (389) T protein:vir:10 37 QKIKDDLTAAKAR---------------R--DAINDQIKALEAEKPAEPKTEPKDDGSKKGTDLSKKPIDAKKKAINDFI 99 (389) T ss_pred HHHHHHHHHHHHH---------------H--HHHHHHHHHHHHHHHhhhhccccccccccccccchhHHHHHHHHHHHHh Confidence 1111111111000 0 00112222211000 00 00000 01111111 Q ss_pred cccccccccccccccccccccc-ccccccceeeehhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCcccccc Q lcl|Aclame:pro 61 NEAVVNGDHGYDPANIAQGVTT-GAVTNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAEAF 139 (514) Q Consensus 61 ~~a~~~~~~g~~~~~~~~st~t-g~v~~~~P~l~~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~tg~EA~ 139 (514) -. .+.....++.++++ |.+.--....=.++++..+..+-.++|.|.||+++++-+--++. .. +.-+ T Consensus 100 r~------~~~~~~~~~~~t~~~gg~~vP~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~--~~----~~~~- 166 (389) T protein:vir:10 100 HS------HGKVIDATSKVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYPILKR--AT----DRFS- 166 (389) T ss_pred hc------chhhhhhhcccccCCcceeehHHHHHHHHHHHHhhhhHHhhcceeeccCCeeEEEEEec--CC----Cccc- Confidence 10 00001111111211 11111001111245555566677899999999988654332221 10 0000 Q ss_pred ccccCCCCccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccccccccccccccc Q lcl|Aclame:pro 140 HPTRQADASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVE 219 (514) Q Consensus 140 ~~~nEadt~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~ 219 (514) +-+ T Consensus 167 ---------~~~-------------------------------------------------------------------- 169 (389) T protein:vir:10 167 ---------SVA-------------------------------------------------------------------- 169 (389) T ss_pred ---------ccc-------------------------------------------------------------------- Confidence 000 Q ss_pred ccccccchhhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 220 IDAGMATSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELN 299 (514) Q Consensus 220 ~~~GmtTa~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEIN 299 (514) ++ ++.. ..+...|.+..+++.|..+ -..+|-||.+|- ..|-+++|.+-|...+..-+| T Consensus 170 --E~-----~~~~----~~~~~~~~~i~~~~~k~~~-------~~~iS~ell~ds----~~~l~~~i~~~la~~~~~~~~ 227 (389) T protein:vir:10 170 --EL-----AENP----KLAEPEFNKVDWSVATYRG-------AIPLSEEAIADS----AVDLTALVGQSIKEKSVNTYN 227 (389) T ss_pred --cc-----cccc----ccccccceeeeeeheeeEe-------eehhhHHHHhhh----hHHHHHHHHHHHHHHHHHHHH Confidence 00 0000 0112345666666666544 445999999984 246688899999999999899 Q ss_pred HHHHHHHhhheeecccccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHh Q lcl|Aclame:pro 300 REIVNLVNSQAQIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVS 379 (514) Q Consensus 300 Reii~~l~~~a~v~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~ 379 (514) ..|+..+... ...+ ..+... .+....++..... . .+ ...+||+|.... T Consensus 228 ~~i~~g~~~~-------~~~~--~~~~~~-------------~d~l~~~~~~~~~--------~-~~-~a~~~~n~~~~~ 275 (389) T protein:vir:10 228 AMIAPVLQSF-------TAKK--TTTDTL-------------VDSLKHILNVDLD--------P-AY-SRALVVTQSLFN 275 (389) T ss_pred HHHhhhhccc-------cccc--cccccc-------------HHHHHHHHHhhhh--------h-hh-CcEEEecHHHHH Confidence 8886544211 1111 111111 1122333221111 1 12 245789999888 Q ss_pred HHhhccccccchhccccCccccccccCceEEEEecCceEEEe-cC-CCcc---ce-EEEEEecCCCcccceeeccccccc Q lcl|Aclame:pro 380 ALSMTDTLVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYI-DQ-YAVN---DY-FTVGFKGSTEMDAGVFYSPYVPLT 453 (514) Q Consensus 380 ~L~~~g~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~-D~-y~~~---dy-~~vG~kG~~~~~~~~fy~PYv~~~ 453 (514) .|...---.+-+... .+ . .+.+...+-++|.|. +||+ |. ..+. |. +++|= +..+.++... ... T Consensus 276 ~L~~lkd~~G~~i~~--~~-~-~~~~~~~~~~~l~G~-pV~~~~~~~~~~~~~~~~~~~gd-----~~~~~~~~~~-~~~ 344 (389) T protein:vir:10 276 TLDTLKDKNGRYLLH--DA-S-DSITDGTAKGTILGV-PVYVVGDTLLGSLAGDQKAFVGD-----LKRGVLFTDR-QQV 344 (389) T ss_pred HHHHhhccCCCeeee--cC-c-ccccccccccccccc-eeEEecccccCCCCCceEEEEee-----ccccEEEEee-cce Confidence 877531000000000 00 0 011111122456554 7664 32 2221 11 33330 0000000000 011 Q ss_pred cccccCCccccceeeeeeeeeee-ecC--ccccccCcceeecCcchhhhccc Q lcl|Aclame:pro 454 PLRGSDSKNFQPVIGFKTRYGVQ-VNP--FADPTASATKVGNGAPVAASMGK 502 (514) Q Consensus 454 ~~~~~dp~s~qp~~~~~tRY~l~-~nP--f~~~~~~~~~i~~~~~~~~~~~~ 502 (514) .+...|-..|.-.+-..-|++.. .|| |.- +.-..--...++| T Consensus 345 ~i~~~~~~~~~~~~~~~~r~d~~~~~~~a~~~-------~~~~~~~~~~~~~ 389 (389) T protein:vir:10 345 TLAWEDSKIYGKYLGAAFRFGVQKADSKAGYF-------VTNTDVPGSALGK 389 (389) T ss_pred EEEeeccccccceEEEEEEeccEEecccceEE-------EEeeccCCCCCCC Confidence 11223445556667777898873 344 211 1100111122344 No 91 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=29.81 E-value=1.6 Score=19.42 Aligned_cols=346 Identities=10% Similarity=-0.010 Sum_probs=121.0 Q ss_pred Ccchhhhhhhhcc----ccccc----cccccchhhhhhhhhhhhHHHHHHhcccccchhhhhhhcccccccccccccccc Q lcl|Aclame:pro 1 MNLTEKWKDLLEA----EGADM----PEIATATKQKIMSKIFENQDRDINNDPMYRDPQLVEAFNAGLNEAVVNGDHGYD 72 (514) Q Consensus 1 ~~l~~kw~p~l~~----~~~~~----~~i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~g~~ 72 (514) ....+.=..-++. ..... ...+...++..-..-+.+.....++... ......+.....+.. +. T Consensus 53 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~---~~--- 123 (419) T protein:vir:94 53 AALLRTAPPAPKGPADGGTPLTPAEAGTFRSLAQRFADSDGLREYRARDKRGQF---QVEMRDIDPNRLLSR---DA--- 123 (419) T ss_pred HHHHHHHHHHHHHHhhhhccccccccccccchhhhhhhHHHHHHHHHhhhhhhh---hHHHHHHHHHHhhcc---cc--- Confidence 1111100000000 00000 0111111110000011111111111100 000000000000000 00 Q ss_pred ccccccccccccccccceeeehhhhh--hhhhhhhcceeEEecCCcccceeeeeeeeecCCCCccccccccccCCCCccC Q lcl|Aclame:pro 73 PANIAQGVTTGAVTNIGPTVMGMVRR--AIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAEAFHPTRQADASFS 150 (514) Q Consensus 73 ~~~~~~st~tg~v~~~~P~l~~l~Rr--a~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~tg~EA~~~~nEadt~fS 150 (514) ..++.+.+....-|.+++=... .-..++..++|.+.||++++.-+ +|..-.+. + ..+ T Consensus 124 ----~~~~~~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~~~~~~~~~~--~~~~~~~~------~---------~~~ 182 (419) T protein:vir:94 124 ----PAGTITNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEY--IRDTSGTA------G---------AGS 182 (419) T ss_pred ----ccccccCCcccccchhhhHHHHHHHhhhhhhhhcceeeeccCCceee--eeeccccc------c---------ccc Confidence 0011111111112222221111 11234557899999998764322 22110000 0 000 Q ss_pred ccccccccccccccccccccccccccccccccccccccccccccccccccCccccccccccccccccccccccccchhhh Q lcl|Aclame:pro 151 GQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAE 230 (514) Q Consensus 151 G~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~GmtTa~aE 230 (514) +.. -..-.+| T Consensus 183 ~~~----------------------------------------------------------------------~a~~v~E 192 (419) T protein:vir:94 183 TWN----------------------------------------------------------------------KAAVVPE 192 (419) T ss_pred cCc----------------------------------------------------------------------ccceecC Confidence 000 0000011 Q ss_pred ccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhe Q lcl|Aclame:pro 231 LQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNSQA 310 (514) Q Consensus 231 al~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~~l~~~a 310 (514) +..+++...++++++..+|.=+-...+|-||.||.- +.+++|.+-|+..|...+|+.||. -.- T Consensus 193 ---------g~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~-----~l~~~i~~~la~a~~~~~d~aii~---G~G 255 (419) T protein:vir:94 193 ---------GTAKPQSTLSFDTITTTLKTVAHWLPITRQAADDNS-----QLMGYIQGRLTYGLRFLRDRQLLN---GNG 255 (419) T ss_pred ---------CccccccccceeeEEeeeeeEEEeehhhHHHHHhHH-----HHHHHHHHHHHHHHHHHHHHHHHh---ccC Confidence 123445555566666666665666779999999963 358999999999999999999953 211 Q ss_pred eecccccccccCCcceeccccccccc-cchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhhcccccc Q lcl|Aclame:pro 311 QIGKSGWTQGAGAAGVFDFSDAVDVK-GARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVG 389 (514) Q Consensus 311 ~v~~~~~~~~v~~~g~~dl~~~~d~~-~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~~g~~~~ 389 (514) + +. +.|++......... ..-+.....-..+..|.++-+.+. ......+.+||+|.....|...--=.. T Consensus 256 ~----~~-----p~Gi~~~~~~~~~~~~~~~~~~t~~~~~~~l~~~~~~~~--~~~~~~~~~v~n~~~~~~l~~~k~~~~ 324 (419) T protein:vir:94 256 S----TE-----MQGILTTPGIGTYQQPKPTAPATDEPPLVDIRRAKTVAE--IAGFPPDGVVVHPQDWESIELDQAPGS 324 (419) T ss_pred c----cc-----ccceecccccccccccccccccccchhHHHHHHHHHhhh--hccCCCCEEEEcHHHHHHHHHHhhcCC Confidence 1 11 22222111000000 000011111222333344433333 223567789999998877754211000 Q ss_pred chhccccCccccccccCceEEEEecCceEEEecCCCccceEEEEEecCCCcccceeecccccccccc-ccCCc------c Q lcl|Aclame:pro 390 PAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGFKGSTEMDAGVFYSPYVPLTPLR-GSDSK------N 462 (514) Q Consensus 390 ~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~kG~~~~~~~~fy~PYv~~~~~~-~~dp~------s 462 (514) -+. ...... .+..+ ++|. |++|+++...+..-+++|--.. +|- ......+. .+++. . T Consensus 325 ~~~--~~~~~~-~~~~~----~~l~-G~pV~~~~~~~~~~~~~gd~~~-------~~~-~~~~~~~~v~~~~~~~~~~~~ 388 (419) T protein:vir:94 325 GVF--RVIANV-QGEAT----PRIW-GLNVVSTVAIAQGTALVGGFRQ-------GAT-LWSRQGITVLMTDSHADFFTA 388 (419) T ss_pred Cce--eecCCc-ccCCC----cccc-ceeeEEcCCCCCccEEEeeccc-------eEE-EEEecceEEEEeccccchhhc Confidence 000 000000 01111 3554 4699999887754444441100 000 00000000 01111 1 Q ss_pred ccceeeeeeeeeee-ecCccccccCcceeecCcchhhhccccceeeeeeeecC Q lcl|Aclame:pro 463 FQPVIGFKTRYGVQ-VNPFADPTASATKVGNGAPVAASMGKNAYFRRVFVKGL 514 (514) Q Consensus 463 ~qp~~~~~tRY~l~-~nPf~~~~~~~~~i~~~~~~~~~~~~~~~~r~~~V~~~ 514 (514) -+=.+=+..||++. .+| .-|.++.++-. T Consensus 389 ~~~~~r~~~r~d~~v~~~------------------------~a~~~~~~~aa 417 (419) T protein:vir:94 389 NTLVILAEFRANLAVYQP------------------------KAFVRVTFAAA 417 (419) T ss_pred CcEEEEEEEeeccEEecc------------------------ccEEEEEeccC Confidence 22233344566552 233 01111111111 No 92 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=26.94 E-value=1.9 Score=19.06 Aligned_cols=342 Identities=12% Similarity=0.136 Sum_probs=119.8 Q ss_pred Cc---chhhhhh--------------hhccccccccccccchhhhh---hhh--hhhhHHHHHHhc--------cccc-- Q lcl|Aclame:pro 1 MN---LTEKWKD--------------LLEAEGADMPEIATATKQKI---MSK--IFENQDRDINND--------PMYR-- 48 (514) Q Consensus 1 ~~---l~~kw~p--------------~l~~~~~~~~~i~~~~~~~~---~~~--~~enq~~~~~~~--------~~~~-- 48 (514) |+ |.++|.- .++.+....=+|... +..+ .++ -|+.|.+++.+. .... T Consensus 5 m~l~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ee~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 83 (408) T protein:vir:10 5 LTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSEL-KNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKGPLN 83 (408) T ss_pred ccHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhccccccccccc Confidence 22 2223321 121111111112111 1111 000 122222222111 0000 Q ss_pred ------chhhhhhhcccccccccccccccccccccccccc-cc--c-cccceeeehhhhhhhhhhhhcceeEEecCCccc Q lcl|Aclame:pro 49 ------DPQLVEAFNAGLNEAVVNGDHGYDPANIAQGVTT-GA--V-TNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPT 118 (514) Q Consensus 49 ------~~~~~~~~~~~~~~a~~~~~~g~~~~~~~~st~t-g~--v-~~~~P~l~~l~Rra~~~LIa~DI~GVQPmTgPT 118 (514) ......+|..++....--... -....+..++.+ |. | ..+.+. +++.+.......+++.+.||+++. T Consensus 84 ~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~a~~~~t~~~gg~~vP~~~~~~---Ii~~~~~~~~l~~~~~~~~~~~~~ 159 (408) T protein:vir:10 84 KSENELKDKFVKDFVNMVRNPMAFMNT-VSSKTETSGSDSAAGLTIPQDIRTM---INTLVRQYDSLQQYVRVESVSTSN 159 (408) T ss_pred cchhhhHHHHHHHHHHHhhcchhhhhh-hhhhhhhcccccCCceeccHhHHHH---HHHHHHhhchhhhhcceeeccCCc Confidence 011112222222110000000 000001111111 11 1 111222 444445566678899999999988 Q ss_pred ceeeeeeeeecCCCCccccccccccCCCCccCcccccccccccccccccccccccccccccccccccccccccccccccc Q lcl|Aclame:pro 119 SQVFTLRSVYGKDPLTGAEAFHPTRQADASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLA 198 (514) Q Consensus 119 GLIFAMRSrY~~~~~tg~EA~~~~nEadt~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~ 198 (514) |-+--.+-. +. + +.+.|-+. T Consensus 160 ~~~~~~~~~--~~--~----------~~a~~v~E---------------------------------------------- 179 (408) T protein:vir:10 160 GSRVYEKWT--DV--T----------PLTVMDAE---------------------------------------------- 179 (408) T ss_pred ceEEEeecc--cc--c----------cceeeecC---------------------------------------------- Confidence 865433210 00 0 00001000 Q ss_pred ccCccccccccccccccccccccccccchhhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhc Q lcl|Aclame:pro 199 VAGQMTATEYTDGVAGGLLVEIDAGMATSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVH 278 (514) Q Consensus 199 ~~~~~~~~~~~~~~a~~~~y~~~~GmtTa~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiH 278 (514) + ++.. ..+...|.++.|.+.|..+- ..+|-||.+|- T Consensus 180 ------------------------~-----~~~~----~~~~~~~~~i~~~~~k~~~~-------~~iS~ell~ds---- 215 (408) T protein:vir:10 180 ------------------------D-----GKIP----DLDNPQLTIIKYLIKRYAGI-------ITATNTSLKDT---- 215 (408) T ss_pred ------------------------c-----cccc----cccCcceeeEEeeeeeEEee-------ehhHHHHHhhc---- Confidence 0 0000 01112356666666665544 55999999994 Q ss_pred CCChhHHHHHHHHHHHHHHhhHHHHHHHhhheeecccccccccCCcceeccccccccccchhhHHHHHHHH-HHHHHHHH Q lcl|Aclame:pro 279 GLDADAELSGILANEVMVELNREIVNLVNSQAQIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALL-IQIEKEAN 357 (514) Q Consensus 279 GLDAEaELanILStEImlEINReii~~l~~~a~v~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~-~~i~~~a~ 357 (514) .+|.+++|.+-|+..|..-+|+.|+.-. .+.. ...+..++ +....++ ..+. T Consensus 216 ~~~l~~~i~~~l~~~~~~~~~~~il~g~---g~~~--------~~~~~~~~-------------~~l~~~~~~~~~---- 267 (408) T protein:vir:10 216 AENILAWLSSWIAKKVVVTRNQAIIEVM---KAAP--------KKPTIAKF-------------DDVITMINTAVD---- 267 (408) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHHhhcc---cccc--------cccccccH-------------HHHHHHHHHhhh---- Confidence 4567899999999999999999885433 2110 01122111 1122221 1111 Q ss_pred HHHHhcccccc-cEEEEChhHHhHHhhccccccchhccccCccccccccCceEEEEecCceEEEecCCCccceEEEEEec Q lcl|Aclame:pro 358 EIGRQTGRGNG-NFIIASRNVVSALSMTDTLVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGFKG 436 (514) Q Consensus 358 ~I~~~T~r~~~-n~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~kG 436 (514) . .+.+ -.+||+|.....|...---..-+. ...+.+.. .-++|. |++|++-.+. .++-.| T Consensus 268 -----~-~~~~~a~~v~n~~~~~~l~~lkd~~G~~i-------~~~~~~~~-~~~~l~-G~PV~~~~~~-----~~~~~~ 327 (408) T protein:vir:10 268 -----P-AIIATSSLLTNQSGLNKLALVKTAEGKYL-------LEPDPTKP-NSYLIK-GKQVIVVADR-----WLPNTG 327 (408) T ss_pred -----h-hhccCCEEEEcHHHHHHHHHhhccCCceE-------eccCcCCC-CCceec-ceeeEEeccc-----ccCccC Confidence 1 1222 257899998888875311110000 00011110 012453 4466653221 111111 Q ss_pred CCCcccceeeccccc----cc--ccc-ccCC------ccccceeeeeeeeee-eecC--ccccc--cCcceeecCcchhh Q lcl|Aclame:pro 437 STEMDAGVFYSPYVP----LT--PLR-GSDS------KNFQPVIGFKTRYGV-QVNP--FADPT--ASATKVGNGAPVAA 498 (514) Q Consensus 437 ~~~~~~~~fy~PYv~----~~--~~~-~~dp------~s~qp~~~~~tRY~l-~~nP--f~~~~--~~~~~i~~~~~~~~ 498 (514) ++.. .+||+-+-. .+ .+. .+++ .+.+=.+-+..||++ ..+| |...+ .-..-. +..... T Consensus 328 ~~~~--~i~~gd~~~~~~~~~~~~~~v~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~~~~~~~~~~~--~~~~~~ 403 (408) T protein:vir:10 328 STVY--PLYYGDMSQAITLFDRENMSLLPTNIGAGAFETDTTKIRVIDRFDVKATDSEALVAGSFSAIADQV--GNFKTT 403 (408) T ss_pred CCce--EEEEEehhccEEEEEecceEEEEcccccchhhcCceEEEEEEeeccEEeccccEEEEEeeccccCC--CCCCCC Confidence 1111 123322111 00 000 1122 233445556677776 3444 22111 000000 000000 Q ss_pred hcccc Q lcl|Aclame:pro 499 SMGKN 503 (514) Q Consensus 499 ~~~~~ 503 (514) .++.- T Consensus 404 ~~~~~ 408 (408) T protein:vir:10 404 TSTAV 408 (408) T ss_pred CcccC Confidence 00000 No 93 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=26.90 E-value=1.9 Score=19.06 Aligned_cols=304 Identities=13% Similarity=0.080 Sum_probs=113.3 Q ss_pred cccchhhhhhhhhhhhHHHHHHhcccccchhhhhhhccccccccccccccccccccccccccccccccceeeehhhhhhh Q lcl|Aclame:pro 21 IATATKQKIMSKIFENQDRDINNDPMYRDPQLVEAFNAGLNEAVVNGDHGYDPANIAQGVTTGAVTNIGPTVMGMVRRAI 100 (514) Q Consensus 21 i~~~~~~~~~~~~~enq~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~g~~~~~~~~st~tg~v~~~~P~l~~l~Rra~ 100 (514) ...+++...-.+-+.+=.+...+ +.+ .+.. .+.+++..--..+.=-+++.+. T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~------------~~a---------------~~~~-~~~~~~~~iP~~~~~~ii~~~~ 52 (324) T protein:vir:97 1 MEQTQKLKLNLQHFASNNVKPQV------------FNP---------------DNVM-MHEKKDGTLMNEFTTPILQEVM 52 (324) T ss_pred CccchhHHHHHHHHHHhhhhhhh------------hcc---------------cccc-ccCCCcceechhHHHHHHHHHH Confidence 00011111111111111111000 000 0000 1111111100011112344555 Q ss_pred hhhhhcceeEEecCCcccceeeeeeeeecCCCCccccccccccCCCCccCcccccccccccccccccccccccccccccc Q lcl|Aclame:pro 101 PQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAEAFHPTRQADASFSGQAAASTIADFPTTGAATDGTPYKAEVTTS 180 (514) Q Consensus 101 ~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~tg~EA~~~~nEadt~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~ 180 (514) .+.+-.+++-+.||++.+--|- +... +.+| .|-+ T Consensus 53 ~~s~l~~~~~~~~~~~~~~~ip----~~~~----~~~a---------~~v~----------------------------- 86 (324) T protein:vir:97 53 ENSKIMQLGKYEPMEGTEKKFT----FWAD----KPGA---------YWVG----------------------------- 86 (324) T ss_pred hhcchhhhcceeeccCCceEEE----EEec----Ccce---------eEec----------------------------- Confidence 6777888899999987653221 1110 0000 0000 Q ss_pred ccccccccccccccccccccCccccccccccccccccccccccccchhhhccccCCCCCCcccccceeEEEEEEEEeecc Q lcl|Aclame:pro 181 GGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSR 260 (514) Q Consensus 181 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~GmtTa~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSR 260 (514) | +..+++...++++++.+.|.- T Consensus 87 -------------------------------------------------E---------g~~~~~~~~~f~~v~~~~~k~ 108 (324) T protein:vir:97 87 -------------------------------------------------E---------GQKIETSKATWVNATMRAFKL 108 (324) T ss_pred -------------------------------------------------c---------CccccccccceeEEEEeeEEE Confidence 1 011223333444444444444 Q ss_pred cccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhheeecccccccccCCcceeccccc-cccccch Q lcl|Aclame:pro 261 QLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNSQAQIGKSGWTQGAGAAGVFDFSDA-VDVKGAR 339 (514) Q Consensus 261 aLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~~l~~~a~v~~~~~~~~v~~~g~~dl~~~-~d~~~~r 339 (514) +.-..+|-||.+|-. .|.+++|.+-|+..|...+++.||.- ..+-. .+.|++..... +-...+ T Consensus 109 ~~~~~is~ell~ds~----~~l~~~i~~~l~~aia~~~d~a~l~G---~g~~~--------~~~gi~~~~~~~~~~~~~- 172 (324) T protein:vir:97 109 GVILPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGILN---QGNNP--------FGKSIAQSIEKTNKVIKG- 172 (324) T ss_pred EEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHhhcc---CCCCc--------cCccccccccccceeccc- Confidence 445559999999863 56799999999999999999999532 21100 11222211000 000000 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhhccccccchhccccCccccccccCceEEEEecCceEE Q lcl|Aclame:pro 340 WAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKV 419 (514) Q Consensus 340 wa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~v 419 (514) .-.+..|+++.+.|.. .......+||+|.....|....- ..| ...-.+... ++|.| ++| T Consensus 173 ------~~~~~~i~~~~~~l~~--~~~~~~~~v~n~~~~~~L~~lkd-----~~g---~~~~~~~~~----~tl~G-~PV 231 (324) T protein:vir:97 173 ------DFTQDNIIDLEALLED--DELEANAFISKTQNRSLLRKIVD-----PET---KERIYDRNS----DTLDG-LPV 231 (324) T ss_pred ------cCCHHHHHHHHHhhhh--ccCCCCEEEEcHHHHHHHHHhhc-----CCC---ceeecCCCC----ccccc-eee Confidence 0112233444444432 23445568999999988874311 011 011111122 34544 478 Q ss_pred EecCCCcc--ceEEEE--------EecCCCcccceeeccccccccccccCCc---cc---cceeeeeeeeee-eecC--c Q lcl|Aclame:pro 420 YIDQYAVN--DYFTVG--------FKGSTEMDAGVFYSPYVPLTPLRGSDSK---NF---QPVIGFKTRYGV-QVNP--F 480 (514) Q Consensus 420 y~D~y~~~--dy~~vG--------~kG~~~~~~~~fy~PYv~~~~~~~~dp~---s~---qp~~~~~tRY~l-~~nP--f 480 (514) ++.+.... ..+++| ..++-..+-+ .+.-+......|.. -| +=.+=+..||+. ..+| | T Consensus 232 ~~~~~~~~~~~~~~~gd~~~~~i~~~~~~~i~~~----~~~~~~~~~~~~~~~~~~f~~d~~~~r~~~r~d~~v~~~~a~ 307 (324) T protein:vir:97 232 VNLKSSNLKRGELITGDFDKLIYGIPQLIEYKID----ETAQLSTVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAF 307 (324) T ss_pred EeecCCCCCcceEEEEecccEEEEEecCcEEEEe----ecccccccccccccchhhhhcCcEEEEEEEEeccEEecccce Confidence 87665442 123333 2221111000 00000000000000 01 112223456664 3444 3 Q ss_pred cccccCcceeecCcchhhhcccc Q lcl|Aclame:pro 481 ADPTASATKVGNGAPVAASMGKN 503 (514) Q Consensus 481 ~~~~~~~~~i~~~~~~~~~~~~~ 503 (514) +..+.- .... ....++- T Consensus 308 ~~l~~~-~~~~-----~~~~~~~ 324 (324) T protein:vir:97 308 AKLVPA-DKKT-----DSVPGEV 324 (324) T ss_pred EEEEec-cCCC-----CCCCCCC Confidence 321100 0000 0111111 No 94 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=25.71 E-value=2 Score=18.90 Aligned_cols=261 Identities=12% Similarity=-0.001 Sum_probs=114.8 Q ss_pred CCCCccCcccccccccccccccccccccccccccccccccccccccccc--cc-ccccccCccccccccccccccccccc Q lcl|Aclame:pro 144 QADASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLAL--GA-VTLAVAGQMTATEYTDGVAGGLLVEI 220 (514) Q Consensus 144 Eadt~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~--~~-~~~~~~~~~~~~~~~~~~a~~~~y~~ 220 (514) =+-+.++--. ...++..+.... .. ........+. .. .+ ..+.+.++ T Consensus 1 Ma~T~~~d~I---------------------------~Pev~~~~V~e~~~~~~~~~~~~~~d~--~L-~g-~~G~ti~~ 49 (270) T protein:vir:95 1 MTQTKKANLI---------------------------NPEVLANVVSAQMQNAIRFTPYAVTDD--TL-VG-QPGDTITR 49 (270) T ss_pred CCceehhhhc---------------------------chHHHHHHHHHHHHhHHhhcccccccc--cc-CC-CCCCEEEe Confidence 0001000000 000000000000 00 0000000000 00 00 11222233 Q ss_pred cccccchhhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhc-CCChhHHHHHHHHHHHHHHhh Q lcl|Aclame:pro 221 DAGMATSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVH-GLDADAELSGILANEVMVELN 299 (514) Q Consensus 221 ~~GmtTa~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiH-GLDAEaELanILStEImlEIN 299 (514) +.--.+..+|.+. ....-+..+ .+..+.+++.|-|.-.=++| ||.+.- |-|.-.|..+-++.-|+.+++ T Consensus 50 P~~~~igdae~~~---eg~~i~~~~--lt~~~~~a~i~~~gk~~~it-----D~a~~~~~~dp~~~~~~q~a~~~a~~~d 119 (270) T protein:vir:95 50 PKYAYIGAAEDLQ---EGVAMDTTQ--MSMTTTKVTVKETGKAVEVT-----QTAIITNVNGTLQEASRQLAMSLADKVE 119 (270) T ss_pred eeecCCCcccccc---CCCccchhh--cccchheeeeehhhCcceec-----HHHHhhhccchHHHHHHHHHHHHHHHHH Confidence 2211233444332 111223344 44566667777776555554 444433 459999999999999999999 Q ss_pred HHHHHHHhhheeecccccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHh Q lcl|Aclame:pro 300 REIVNLVNSQAQIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVS 379 (514) Q Consensus 300 Reii~~l~~~a~v~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~ 379 (514) .++|..|.... +.. +..++....+ .+. .++..+ ...-++++|.|++++ T Consensus 120 ~~li~~l~~a~------~~~----~~~~t~~~~~----------dA~---~~lgd~---------~~~~~~i~vhs~~~~ 167 (270) T protein:vir:95 120 IDYIAELNKSK------QTA----TVSADATGIL----------DAI---EVFNSE---------NDEDYVLYVNPKDYN 167 (270) T ss_pred HHHHHHhcccc------ccc----ccccCHHHHH----------HHH---HHhccc---------cCCCcEEEEcHHHHH Confidence 99987774321 111 1112211111 111 112221 355679999999999 Q ss_pred HHhhccccccchhccccCccccccccCceEEEEecCceEEEecCCCccceEEEEEe-cCCCcccceeecccccccccc-c Q lcl|Aclame:pro 380 ALSMTDTLVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAVNDYFTVGFK-GSTEMDAGVFYSPYVPLTPLR-G 457 (514) Q Consensus 380 ~L~~~g~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~~dy~~vG~k-G~~~~~~~~fy~PYv~~~~~~-~ 457 (514) .|....+++..... .+.-.++. .|.+. |++|++|.+.+.+|-..-++ |.-. |+-.=.+. ++ . T Consensus 168 ~Lrk~~~~~~~~~~------~~~~~~G~--ig~~~-G~~Viv~s~~~~~~~~~l~~~gAi~-----~~~~~~~~--vEtd 231 (270) T protein:vir:95 168 KLVKSLFKVGGNVQ------DRAISKGD--LVEIV-GVSDIVKSKRVSENTAFLQRYGAME-----IVNKKKPE--AYTD 231 (270) T ss_pred HHHhhhcccccccc------cchhcccc--cceec-ceeEEEeCCCCCceeEEEEecccee-----eeecCCce--eeec Confidence 99887766532111 01111111 25554 57999998888777333333 2111 11111111 22 3 Q ss_pred cCCccccceeeeeeeeee-eecCc--cccccCcceeecCcchhhhccccce Q lcl|Aclame:pro 458 SDSKNFQPVIGFKTRYGV-QVNPF--ADPTASATKVGNGAPVAASMGKNAY 505 (514) Q Consensus 458 ~dp~s~qp~~~~~tRY~l-~~nPf--~~~~~~~~~i~~~~~~~~~~~~~~~ 505 (514) .|+..++-.+-...+|++ ..||= ...+-.+ +|.-.| T Consensus 232 Rd~~~~~d~i~~~~~y~v~~~~~skvv~~t~~~------------a~~~~~ 270 (270) T protein:vir:95 232 FDILKRTHLLSTNYHYSVNLKDETGVVKVTFKP------------SGSLEM 270 (270) T ss_pred cchhhcccEEEeeeEEEEEEEccceEEEEEecC------------CCCcCC Confidence 588888888888888887 23331 1111111 111122 No 95 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=24.25 E-value=2.2 Score=18.71 Aligned_cols=278 Identities=11% Similarity=0.046 Sum_probs=110.8 Q ss_pred cccccccccccccccccccccccceeeehhhhhhhhhhhhcceeEEecCCcccceeeeeeeeecCCCCccccccccccCC Q lcl|Aclame:pro 66 NGDHGYDPANIAQGVTTGAVTNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAEAFHPTRQA 145 (514) Q Consensus 66 ~~~~g~~~~~~~~st~tg~v~~~~P~l~~l~Rra~~~LIa~DI~GVQPmTgPTGLIFAMRSrY~~~~~tg~EA~~~~nEa 145 (514) =.-...++.+...++ +++..--..+.=.+++.+.+.-+-..++.+.||++++...+-... . +.+|- T Consensus 1 m~~~~~~~~~~~~t~-~~~~lvP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~~--~-----~~~a~------ 66 (297) T protein:vir:95 1 MTVQTFNPENVLVSQ-KKDGTLHKEFTDIIMKEVAQNSLVMQLGQYQEMEGEQEKTVYVQT--D-----GISAY------ 66 (297) T ss_pred CCccccccccccccC-CCcceechhHHHHHHHHHHhhchhhhhcceeecCCCccEEEEEEc--C-----CceeE------ Confidence 001111222222111 111110001111233444456677788999999888765542211 0 00110 Q ss_pred CCccCccccccccccccccccccccccccccccccccccccccccccccccccccCcccccccccccccccccccccccc Q lcl|Aclame:pro 146 DASFSGQAAASTIADFPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMA 225 (514) Q Consensus 146 dt~fSG~~~~~~~~~~~~~~~~t~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~Gmt 225 (514) |- T Consensus 67 ---~v--------------------------------------------------------------------------- 68 (297) T protein:vir:95 67 ---WV--------------------------------------------------------------------------- 68 (297) T ss_pred ---Ee--------------------------------------------------------------------------- Confidence 00 Q ss_pred chhhhccccCCCCCCcccccceeEEEEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHHH Q lcl|Aclame:pro 226 TSQAELQENFNGSSNNEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNL 305 (514) Q Consensus 226 Ta~aEal~~~ggs~~~~f~EMsFsIEK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~~ 305 (514) +| +..+++-..++++++...|..+-...+|.||.+|-. .|.++.|.+-|+..|...+++.||.- T Consensus 69 ---~E---------g~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~----~~l~~~i~~~la~ai~~~~d~a~l~G 132 (297) T protein:vir:95 69 ---NE---------TEKIKTDKPEVVPVTLKAHKLGIILVTSREALNYTW----KKFFEDMKPQIVEAFYKKIDEAGLLG 132 (297) T ss_pred ---ec---------CccccccccceeEEEEeeEEEEEeehhhHHHHhcCH----HHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 01 112333334456666666666666779999999975 35789999999999999999999522 Q ss_pred HhhheeecccccccccCCcceeccccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhhcc Q lcl|Aclame:pro 306 VNSQAQIGKSGWTQGAGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTD 385 (514) Q Consensus 306 l~~~a~v~~~~~~~~v~~~g~~dl~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~~g 385 (514) - -. ..+.|++...... . .... ..-.+..|.++...|... ....+.+||+|+....|... T Consensus 133 ~---g~---------~~~~gi~~~~~~~--~--~~~~--~~~t~~~i~~~~~~l~~~--~~~~~~~v~~~~~~~~L~~l- 191 (297) T protein:vir:95 133 H---DT---------PFANSVAKAAKDA--N--KVIG--GPINYDNILKLQDALYDA--DVEPNAFVSKIQNRSALREA- 191 (297) T ss_pred c---CC---------ccccccccccccc--c--eecc--cccCHHHHHHHHHHhhhc--cCCcCEEEEcHHHHHHHHHh- Confidence 1 10 0122222211100 0 0000 001122334444444432 24456789999998888742 Q ss_pred ccccchhccccCccccccccCceEEEEecCceEEEecCCCc--cceEEE--------EEecCCCcccceeeccccccccc Q lcl|Aclame:pro 386 TLVGPAAQGMQDGSMNTDTNQTVFAGVLGGRFKVYIDQYAV--NDYFTV--------GFKGSTEMDAGVFYSPYVPLTPL 455 (514) Q Consensus 386 ~~~~~~~~~~~~~~~~~d~~~~~~~G~l~~~~~vy~D~y~~--~dy~~v--------G~kG~~~~~~~~fy~PYv~~~~~ 455 (514) .. ..|. .. ..+. .++|. |++|++-+... ..-+++ |..++-+.+- ..+ .... T Consensus 192 --~d--~~G~---~i---~~~~--~~~l~-G~Pv~~~~~~~~~~~~~~~gd~s~~~~~~~~~~~i~~----~~~--~~~~ 252 (297) T protein:vir:95 192 --RD--GNKV---SI---YDKA--ANTID-GITTVDLKSARFEKGDLLAGDFDNLIYGVPYNITYKI----SEE--GQIS 252 (297) T ss_pred --hc--cCCc---ee---ecCC--CCccc-ceeeEeecCCCCCCceEEEEecccEEEEEecCeEEEE----eec--cccc Confidence 10 0000 00 0111 02333 34665433322 111111 1111100000 000 0000 Q ss_pred cccCCc-----ccc-ceeee--eeeeeeee-cCccccccCcceeecCcch Q lcl|Aclame:pro 456 RGSDSK-----NFQ-PVIGF--KTRYGVQV-NPFADPTASATKVGNGAPV 496 (514) Q Consensus 456 ~~~dp~-----s~q-p~~~~--~tRY~l~~-nPf~~~~~~~~~i~~~~~~ 496 (514) ...|+. -|| =.++| ..|++..+ || +...++...++- T Consensus 253 ~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~-----~a~~~l~~at~~ 297 (297) T protein:vir:95 253 TITNADGTPINLFEQEMIAIRATMDIAVMITKT-----DAFAKLTPAERV 297 (297) T ss_pred cccccCccchhhhhcCcEEEEEEEEeccEeecc-----cceEEEeecCCC Confidence 000110 011 11111 12333321 11 122333333333 No 96 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=21.86 E-value=2.5 Score=18.37 Aligned_cols=281 Identities=12% Similarity=0.057 Sum_probs=102.5 Q ss_pred cccccccccccccccc--ccccccccccccCccccccccccccccccccccccccchhhhccccCCCCCCcccccceeEE Q lcl|Aclame:pro 173 YKAEVTTSGGDVSMRY--FLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAELQENFNGSSNNEWNEMSFRI 250 (514) Q Consensus 173 ~~~~~~~~~g~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~a~~~~y~~~~GmtTa~aEal~~~ggs~~~~f~EMsFsI 250 (514) .. ..+..++..-... .......-....-........ .......+..- +...++- + -..+.++++...++ T Consensus 1 Ma-t~tt~~g~~vP~~~~~~ii~~~~~~s~l~~~~~~i~-~~~~~~~~p~~----~~~~~a~--w-v~Eg~~~~~~~~~f 71 (311) T protein:vir:99 1 MA-TFGTGNLKNLPRNIADGMVKDVVQGSTVAVLSARKP-QRFGNEDIITF----NGRPKAE--F-VGEGQQKSSTTGEF 71 (311) T ss_pred Cc-eecCCCceeccHHHHHHHHHHHHhhchhhhhcceee-ccCCceEEEEE----eCCceeE--E-eecCccccccccee Confidence 11 1111111100000 000000000000000000000 00000011000 0111100 0 01145677777888 Q ss_pred EEEEEEeecccccccccHHHHHHHHhhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhheeecccccccccC-Ccceecc Q lcl|Aclame:pro 251 DKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNSQAQIGKSGWTQGAG-AAGVFDF 329 (514) Q Consensus 251 EK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAEaELanILStEImlEINReii~~l~~~a~v~~~~~~~~v~-~~g~~dl 329 (514) ++++..+|.-+-....|-||.|+-.- -..|-+++|.+-|...|+..|++.+|.-....--..-.+..+-.. ......+ T Consensus 72 ~~v~l~~~k~~~~~~iS~ell~~~~d-~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~g~~~~g~~~~~~~~~~~~~~ 150 (311) T protein:vir:99 72 DFVTSTPKKAQVTMRFNEEVQWADED-YQLGVLQTLSEAGAEALARALDLGLYHRINPLTGTVIPGWSNYLGAASKRVEL 150 (311) T ss_pred eEEEEeeEEEEEeehhhHHHhhcccc-cHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccCccccccccccccccceeec Confidence 88999888888888999999763221 124458888888888888888888854321000000000000000 0011111 Q ss_pred ccccccccchhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEChhHHhHHhhccccccchhccccCccccccccCceE Q lcl|Aclame:pro 330 SDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVGPAAQGMQDGSMNTDTNQTVF 409 (514) Q Consensus 330 ~~~~d~~~~rwa~e~~r~L~~~i~~~a~~I~~~T~r~~~n~~v~S~~va~~L~~~g~~~~~~~~~~~~~~~~~d~~~~~~ 409 (514) .... .-.+..-|+.+...+...-.++..+..|++|+....|.... + ..|.. -...+..+. - T Consensus 151 ~~~~-----------~~~~~~~i~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~lk--d---~~G~~--l~~~~~~~~-~ 211 (311) T protein:vir:99 151 TADT-----------IANPDLAIEAAVGLLVANGHPTPVNGLALHPSIAWGLSTAR--Y---TDGRK--KFPELGLGI-G 211 (311) T ss_pred cccc-----------cchhHHHHHHHHHHHhhhccCCCccEEEEcHHHHHHHHhhh--c---cCCCe--eecCcccCC-C Confidence 1100 01111223333333322233456667899999998886421 1 00000 000010000 0 Q ss_pred EEEecCceEEEecCCCc----------------cceEEEEEecCCCcccceeecccccc--ccccccCCcccc-----ce Q lcl|Aclame:pro 410 AGVLGGRFKVYIDQYAV----------------NDYFTVGFKGSTEMDAGVFYSPYVPL--TPLRGSDSKNFQ-----PV 466 (514) Q Consensus 410 ~G~l~~~~~vy~D~y~~----------------~dy~~vG~kG~~~~~~~~fy~PYv~~--~~~~~~dp~s~q-----p~ 466 (514) .++|.| ++|++..+-+ .+++++|= ...++.|.-.-.. ...+.-|++... -- T Consensus 212 ~~~l~G-~Pv~~s~~i~~~~~~~~~~~~~~~~~~~~~~~Gd-----f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~ 285 (311) T protein:vir:99 212 VSSFEG-IDASVSDTVNGGDEADPDDEDLDAARAVRGIVGD-----FANGIHWGVQRDIPVELIKYGDPDGQGDLKRHNQ 285 (311) T ss_pred Cceecc-eeeEeecccccccccccccchhhccCcceEEEee-----ccccEEEEEecCceEEEeecCCCCcchhhhhcCc Confidence 134444 4888766433 12223221 0011222211111 111111233211 12 Q ss_pred eee--eeeeeeee-cC-ccccccCcc Q lcl|Aclame:pro 467 IGF--KTRYGVQV-NP-FADPTASAT 488 (514) Q Consensus 467 ~~~--~tRY~l~~-nP-f~~~~~~~~ 488 (514) ++| ..||+..+ || |....+..+ T Consensus 286 ~~~r~~~r~d~~v~~~~~v~~~~~~A 311 (311) T protein:vir:99 286 IALRLEIVYGWYVFTDRFVVIENAVA 311 (311) T ss_pred EEEEEEEeecceecChhHeeeecccC Confidence 333 57888643 33 332222211 Done!