Query lcl|Aclame:protein:vir:106998|NCBI_annot:major capsid protein gp23|genbank:acc:YP_195142;genbank:gi:58532919;uniprot:Q5GQN0;genbank:GeneID:3260495 Match_columns 468 No_of_seqs 170 out of 434 Neff 4.8 Searched_HMMs 1612 Date Sun Dec 1 09:05:26 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_215 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_215_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:106998 Length: 468 100.0 5E-251 3E-254 1392.8 37.8 468 1-468 1-468 (468) 2 protein:vir:104915 Length: 470 100.0 2E-235 1E-238 1307.8 36.3 456 1-468 3-470 (470) 3 protein:vir:104549 Length: 462 100.0 3E-232 2E-235 1289.6 35.5 451 2-468 1-462 (462) 4 protein:vir:103181 Length: 457 100.0 2E-230 1E-233 1280.3 35.6 446 2-468 1-457 (457) 5 protein:vir:106286 Length: 534 100.0 4E-225 2E-228 1250.9 35.6 456 2-467 1-534 (534) 6 protein:vir:101039 Length: 529 100.0 2E-222 1E-225 1235.6 34.5 455 1-467 2-529 (529) 7 protein:vir:6901 Length: 522 # 100.0 1E-221 7E-225 1232.0 34.6 458 1-467 4-522 (522) 8 protein:vir:101811 Length: 529 100.0 2E-221 1E-224 1230.2 34.8 460 1-467 2-529 (529) 9 protein:vir:7214 Length: 521 # 100.0 3E-221 2E-224 1229.2 34.1 459 1-467 3-521 (521) 10 protein:vir:103463 Length: 521 100.0 4E-221 2E-224 1228.9 34.0 459 1-467 3-521 (521) 11 protein:vir:6601 Length: 528 # 100.0 6E-221 4E-224 1228.0 34.3 460 1-467 1-528 (528) 12 protein:vir:80986 Length: 528 100.0 2E-220 1E-223 1225.4 34.8 460 1-467 1-528 (528) 13 protein:vir:98143 Length: 524 100.0 7E-220 4E-223 1222.0 34.6 458 1-467 1-524 (524) 14 protein:vir:5670 Length: 514 # 100.0 2E-219 1E-222 1219.3 33.6 455 5-467 1-514 (514) 15 protein:vir:100603 Length: 529 100.0 1E-218 8E-222 1215.0 34.6 456 1-467 2-529 (529) 16 protein:vir:107947 Length: 519 100.0 1E-216 6E-220 1204.7 35.1 456 2-467 1-519 (519) 17 protein:vir:5942 Length: 523 # 100.0 2E-194 1E-197 1082.4 31.6 405 1-468 1-522 (523) 18 protein:vir:1886 Length: 385 # 96.8 0.00033 2E-07 39.7 21.4 333 1-445 1-385 (385) 19 protein:vir:191 Length: 385 # 96.8 0.00033 2E-07 39.7 21.4 333 1-445 1-385 (385) 20 protein:vir:79987 Length: 415 96.8 0.00036 2.2E-07 39.5 18.8 343 1-455 1-415 (415) 21 protein:vir:81100 Length: 415 96.8 0.00036 2.2E-07 39.5 18.8 343 1-455 1-415 (415) 22 protein:vir:98339 Length: 415 96.8 0.00036 2.2E-07 39.5 18.8 343 1-455 1-415 (415) 23 protein:vir:9410 Length: 415 # 95.9 0.0013 7.9E-07 36.5 17.8 343 1-455 28-415 (415) 24 protein:vir:4953 Length: 397 # 95.4 0.0022 1.4E-06 35.1 18.3 331 1-451 1-397 (397) 25 protein:vir:4600 Length: 415 # 95.3 0.0022 1.4E-06 35.1 17.9 343 1-455 28-415 (415) 26 protein:vir:4700 Length: 415 # 95.3 0.0022 1.4E-06 35.1 17.9 343 1-455 28-415 (415) 27 protein:vir:4830 Length: 397 # 95.3 0.0024 1.5E-06 35.0 19.0 332 1-453 1-397 (397) 28 protein:vir:7409 Length: 408 # 94.5 0.0044 2.7E-06 33.5 21.1 331 1-451 4-408 (408) 29 protein:vir:9574 Length: 300 # 94.4 0.0046 2.9E-06 33.4 17.4 281 69-444 1-300 (300) 30 protein:vir:78523 Length: 338 94.2 0.005 3.1E-06 33.2 17.5 309 32-448 1-338 (338) 31 protein:vir:41 Length: 299 # N 93.6 0.007 4.3E-06 32.4 18.6 279 58-456 1-299 (299) 32 protein:vir:4092 Length: 390 # 93.4 0.0076 4.7E-06 32.2 18.0 345 1-448 1-390 (390) 33 protein:vir:4997 Length: 397 # 93.2 0.0084 5.2E-06 32.0 21.0 333 1-455 1-397 (397) 34 protein:vir:7771 Length: 330 # 92.7 0.01 6.3E-06 31.5 18.8 294 49-450 1-330 (330) 35 protein:vir:3845 Length: 395 # 92.0 0.013 8.2E-06 30.9 19.1 332 2-453 1-395 (395) 36 protein:vir:8420 Length: 477 # 91.8 0.014 8.9E-06 30.7 20.0 366 1-459 66-477 (477) 37 protein:vir:3033 Length: 272 # 91.3 0.017 1E-05 30.3 17.0 268 117-454 1-272 (272) 38 protein:vir:9820 Length: 272 # 91.3 0.017 1E-05 30.3 17.0 268 117-454 1-272 (272) 39 protein:vir:1433 Length: 435 # 90.8 0.019 1.2E-05 30.0 21.1 339 2-453 1-435 (435) 40 protein:vir:99749 Length: 324 90.6 0.02 1.2E-05 29.9 17.9 299 32-450 1-324 (324) 41 protein:vir:94142 Length: 304 90.4 0.021 1.3E-05 29.8 17.0 274 63-442 1-304 (304) 42 protein:vir:105905 Length: 304 90.4 0.021 1.3E-05 29.8 17.0 274 63-442 1-304 (304) 43 protein:vir:93742 Length: 274 89.9 0.024 1.5E-05 29.5 15.6 261 117-451 1-274 (274) 44 protein:vir:96262 Length: 274 89.2 0.028 1.7E-05 29.1 13.4 263 107-455 1-274 (274) 45 protein:vir:95898 Length: 274 89.2 0.028 1.7E-05 29.1 13.4 263 107-455 1-274 (274) 46 protein:vir:4339 Length: 395 # 89.0 0.029 1.8E-05 29.0 20.5 334 1-467 1-395 (395) 47 protein:vir:104085 Length: 320 89.0 0.029 1.8E-05 29.0 16.7 295 43-448 1-320 (320) 48 protein:vir:96123 Length: 274 87.6 0.038 2.3E-05 28.4 14.5 256 121-446 1-274 (274) 49 protein:vir:1638 Length: 298 # 86.5 0.045 2.8E-05 28.0 18.0 277 71-456 1-298 (298) 50 protein:vir:104256 Length: 458 86.1 0.048 3E-05 27.8 18.8 341 1-445 81-458 (458) 51 protein:vir:10364 Length: 390 85.8 0.05 3.1E-05 27.7 17.2 326 1-465 30-390 (390) 52 protein:vir:103955 Length: 324 85.4 0.053 3.3E-05 27.6 18.2 299 23-450 1-324 (324) 53 protein:vir:100172 Length: 394 85.3 0.054 3.3E-05 27.5 15.2 323 1-461 30-394 (394) 54 protein:vir:6212 Length: 434 # 84.1 0.063 3.9E-05 27.2 16.5 345 1-449 58-434 (434) 55 protein:vir:9704 Length: 394 # 83.3 0.07 4.3E-05 26.9 18.4 325 1-447 30-394 (394) 56 protein:vir:9759 Length: 303 # 83.2 0.07 4.4E-05 26.9 16.3 278 69-446 1-303 (303) 57 protein:vir:3991 Length: 404 # 82.6 0.075 4.7E-05 26.7 19.9 333 1-449 4-404 (404) 58 protein:vir:81227 Length: 413 82.4 0.077 4.8E-05 26.7 19.7 339 1-446 31-413 (413) 59 protein:vir:1383 Length: 421 # 82.1 0.079 4.9E-05 26.6 18.7 340 2-468 1-414 (421) 60 protein:vir:9309 Length: 324 # 81.9 0.082 5.1E-05 26.5 19.2 297 35-449 1-324 (324) 61 protein:vir:81070 Length: 390 80.9 0.09 5.6E-05 26.3 19.5 323 1-465 30-390 (390) 62 protein:vir:2430 Length: 318 # 80.3 0.096 6E-05 26.2 17.4 295 43-448 1-318 (318) 63 protein:vir:99920 Length: 311 80.1 0.098 6.1E-05 26.1 17.1 284 69-450 1-311 (311) 64 protein:vir:2504 Length: 305 # 78.3 0.12 7.2E-05 25.7 17.4 282 69-455 1-305 (305) 65 protein:vir:105038 Length: 428 74.1 0.16 0.0001 24.9 16.8 340 1-451 30-428 (428) 66 protein:vir:1025 Length: 408 # 72.6 0.18 0.00011 24.7 21.5 332 1-451 4-408 (408) 67 protein:vir:94494 Length: 274 71.9 0.19 0.00012 24.5 14.7 261 117-455 1-274 (274) 68 protein:vir:97433 Length: 274 71.9 0.19 0.00012 24.5 14.7 261 117-455 1-274 (274) 69 protein:vir:105334 Length: 276 70.4 0.21 0.00013 24.3 14.3 268 121-457 1-276 (276) 70 protein:vir:8187 Length: 311 # 67.6 0.25 0.00016 23.9 18.4 287 69-444 1-311 (311) 71 protein:vir:1084 Length: 437 # 67.5 0.25 0.00016 23.9 15.5 328 1-457 65-437 (437) 72 protein:vir:4856 Length: 293 # 66.8 0.26 0.00016 23.8 19.0 276 50-462 1-293 (293) 73 protein:vir:4511 Length: 409 # 65.7 0.28 0.00017 23.6 18.5 324 1-444 41-409 (409) 74 protein:vir:102119 Length: 404 65.6 0.28 0.00017 23.6 19.3 324 1-447 27-404 (404) 75 protein:vir:100135 Length: 418 64.5 0.3 0.00019 23.5 18.7 338 1-448 35-418 (418) 76 protein:vir:81160 Length: 371 62.3 0.34 0.00021 23.2 19.8 321 1-467 22-371 (371) 77 protein:vir:80376 Length: 435 61.0 0.36 0.00022 23.0 19.4 344 1-453 41-435 (435) 78 protein:vir:7855 Length: 497 # 59.8 0.38 0.00024 22.9 20.7 355 1-449 53-497 (497) 79 protein:vir:101650 Length: 497 59.8 0.38 0.00024 22.9 20.7 355 1-449 53-497 (497) 80 protein:vir:96223 Length: 324 57.4 0.44 0.00027 22.6 18.6 297 23-449 1-324 (324) 81 protein:vir:3870 Length: 400 # 56.3 0.46 0.00029 22.4 18.9 324 1-455 41-400 (400) 82 protein:vir:4226 Length: 326 # 56.2 0.46 0.00029 22.4 16.7 299 35-454 1-326 (326) 83 protein:vir:1239 Length: 274 # 56.1 0.46 0.00029 22.4 14.3 260 117-455 1-274 (274) 84 protein:vir:107593 Length: 392 55.5 0.48 0.0003 22.3 18.3 317 1-448 35-392 (392) 85 protein:vir:102082 Length: 392 55.5 0.48 0.0003 22.3 18.3 317 1-448 35-392 (392) 86 protein:vir:105004 Length: 392 55.5 0.48 0.0003 22.3 18.3 317 1-448 35-392 (392) 87 protein:vir:102873 Length: 392 55.5 0.48 0.0003 22.3 18.3 317 1-448 35-392 (392) 88 protein:vir:97148 Length: 324 54.0 0.51 0.00032 22.2 19.0 297 36-450 1-324 (324) 89 protein:vir:96392 Length: 324 52.9 0.54 0.00034 22.0 18.0 298 36-451 1-324 (324) 90 protein:vir:78830 Length: 324 52.9 0.54 0.00034 22.0 18.0 298 36-451 1-324 (324) 91 protein:vir:100884 Length: 389 52.8 0.54 0.00034 22.0 18.5 330 1-453 33-389 (389) 92 protein:vir:80684 Length: 315 50.9 0.6 0.00037 21.8 17.8 280 69-450 1-315 (315) 93 protein:vir:80930 Length: 278 49.8 0.63 0.00039 21.7 16.9 275 107-457 1-278 (278) 94 protein:vir:95763 Length: 297 49.6 0.63 0.00039 21.7 16.9 273 55-447 1-297 (297) 95 protein:vir:101607 Length: 379 48.4 0.67 0.00042 21.5 20.1 329 2-467 1-379 (379) 96 protein:vir:739 Length: 231 # 47.6 0.7 0.00043 21.4 12.1 218 161-467 1-231 (231) 97 protein:vir:5739 Length: 366 # 45.9 0.75 0.00047 21.3 16.8 336 1-451 3-366 (366) 98 protein:vir:78223 Length: 333 45.8 0.76 0.00047 21.2 19.0 311 47-446 1-333 (333) 99 protein:vir:2344 Length: 397 # 45.4 0.77 0.00048 21.2 17.3 302 58-468 1-347 (397) 100 protein:vir:6242 Length: 390 # 44.8 0.79 0.00049 21.1 18.3 336 1-455 3-390 (390) 101 protein:vir:94424 Length: 387 40.2 0.98 0.00061 20.6 17.3 323 1-447 1-387 (387) 102 protein:vir:2685 Length: 387 # 40.2 0.98 0.00061 20.6 17.3 323 1-447 1-387 (387) 103 protein:vir:96978 Length: 387 40.2 0.98 0.00061 20.6 17.3 323 1-447 1-387 (387) 104 protein:vir:94711 Length: 347 39.2 1 0.00064 20.5 14.6 297 107-449 1-347 (347) 105 protein:vir:94771 Length: 298 34.8 1.3 0.00079 20.0 18.8 274 71-456 1-298 (298) 106 protein:vir:4456 Length: 401 # 34.7 1.3 0.00079 20.0 18.0 328 1-467 27-401 (401) 107 protein:vir:8102 Length: 543 # 33.3 1.4 0.00085 19.8 16.6 327 1-450 172-543 (543) 108 protein:vir:9361 Length: 402 # 31.8 1.5 0.00091 19.7 17.3 329 1-447 16-402 (402) 109 protein:vir:95963 Length: 395 30.1 1.6 0.00099 19.5 14.0 349 1-450 1-395 (395) 110 protein:vir:94673 Length: 419 29.6 1.6 0.001 19.4 19.2 351 1-447 32-419 (419) 111 protein:vir:97053 Length: 390 28.3 1.8 0.0011 19.2 18.5 331 1-465 31-390 (390) 112 protein:vir:100247 Length: 425 28.2 1.8 0.0011 19.2 18.5 341 1-468 64-425 (425) 113 protein:vir:96833 Length: 275 27.4 1.8 0.0011 19.1 15.4 261 115-453 1-275 (275) 114 protein:vir:1268 Length: 397 # 25.5 2.1 0.0013 18.9 17.8 321 1-467 52-397 (397) No 1 >protein:vir:106998 Length: 468 # NCBI annotation: major capsid protein gp23 # Family: family:all:364 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195142;genbank:gi:58532919;uniprot:Q5GQN0;genbank:GeneID:3260495 Probab=100.00 E-value=5e-251 Score=1392.81 Aligned_cols=468 Identities=100% Similarity=1.465 Sum_probs=461.7 Q ss_pred CcchHHHHHhhhhhhCCCccchhcchhhhHHHHHHHhHHHHHHhhhhhhhhhhhhhhcCccccccccccccccccccccc Q lcl|Aclame:pro 1 MFNAEHLQEKWSPVLNHGEAPAIGDRYKRAVTSVLLENQERFLREERGMLNEVAVNSLGAGTIAPAGSALGSANTGGLAG 80 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~~~llenq~~~~~~~~~~l~e~~~~~~~~~~~~~~~~i~~st~tg~i~~ 80 (468) |||+|+|+|||+|||||||+|||++.|||+|+++|||||||++++++.+|+|.+++++|++++...++++++++|++|++ T Consensus 1 ~~~~e~l~~kW~plLe~~~~~~i~~~~k~~i~a~llENQe~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~t~~v~~ 80 (468) T protein:vir:10 1 MFNAEHLQEKWSPVLNHGEAPAIGDRYKRAVTSVLLENQERFLREERGMLNEVAVNSLGAGTIAPAGSALGSANTGGLAG 80 (468) T ss_pred CcchHHHHHhhhHhhcCCccchhccchhhhhhhhhhhhHHHHHhccccccchhhHhhcCCcccchhhhhhhhcccccccc Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred ccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCcccccccCCccccccccccccccccccCccccCC Q lcl|Aclame:pro 81 FDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDTGFTGGYDASQGDYAVRTGAGVGGD 160 (468) Q Consensus 81 ~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~ 160 (468) +||+||+||||++|||||+|||||||||||||||||||+||.+|+|+|+||||||++|||.++.......+..+.+..++ T Consensus 81 ~~P~Li~l~RRa~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~n~~g~EAf~nEadt~fSg~~~~~~~~~~~~~~~~~~~~ 160 (468) T protein:vir:10 81 FDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDTGFTGGYDASQGDYAVRTGAGVGGD 160 (468) T ss_pred cCchhhhhHHHHHhhhhhhhceeeecCCccceeeeEEEEEecCCCCccceeccccccccccccccccccccccccccccC Confidence 99999999999999999999999999999999999999999999999999999999999998888888888888888889 Q ss_pred CccccccccccccccccccccccchhhhhccCCCCcchhhcceEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHH Q lcl|Aclame:pro 161 SEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQE 240 (468) Q Consensus 161 ~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~E 240 (468) +.++++...+.+..+.|+++.||+|+++|.||+++++|+||+|+||||+|||||||||||||||||||||||||||||+| T Consensus 161 ~~g~~~~~~~~a~~~~~~~g~gMsTa~aE~lG~~~~~f~EMaFsIeK~tVtAKSRaLKAeYTiELAQDLKAiHGLDAEtE 240 (468) T protein:vir:10 161 SEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQE 240 (468) T ss_pred CCCCcccccccccccccccccccchHHHhhcCCCCcccceeeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChhHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhhHHHHHHHhhhhhccccccccccccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEE Q lcl|Aclame:pro 241 LANILSSEVLAEINREVVRRVYTVAKKGAQNNVANAGIFDLDVDSNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFL 320 (468) Q Consensus 241 LanILStEImlEINREii~~l~~va~~~k~~~~~~~g~~Dl~~~~~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~ 320 (468) |+||||||||+||||||||+||+||+|||++|++++|+|||++++||||++|+||+|+|||+||||+|+|||+||+|||| T Consensus 241 LaNILStEImlEINReii~~l~~va~~~k~~g~~~~Gv~d~~~~~~~rw~~e~~k~L~~~i~~ean~i~~~T~rg~gn~i 320 (468) T protein:vir:10 241 LANILSSEVLAEINREVVRRVYTVAKKGAQNNVANAGIFDLDVDSNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFL 320 (468) T ss_pred HHHHHHHHHHHHhcHHHHHhHhhhhhheecccccccccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccccccEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEchhHHHHHHhhcccccccccccccccccccccccCceeEEEecCceEEEEcccccccCCcceEEEEEecCCcccceeE Q lcl|Aclame:pro 321 ICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYKGTSPYDAGLF 400 (468) Q Consensus 321 v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~~~~dY~~vG~KG~~~~d~glf 400 (468) ||||+||++|+++|||++.|+++++.+.+++++|+|+++|+|+|+|||+|||||||++++|+||++|||||++++|+||| T Consensus 321 i~S~~Va~~L~~sG~l~~~~~~~~~~~~~~~~~D~tg~~~~G~l~~r~~vy~D~Ya~~~s~~dY~~vG~KG~~~~d~glf 400 (468) T protein:vir:10 321 ICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYKGTSPYDAGLF 400 (468) T ss_pred EechhHHHHHhhcCcceecccccccccccccccccCcceEEEEecCceEEEEccccccCCccceEEEEEecCcceeceee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred eeccchhhcccccCCccccceeeeeeeeeeeecCcccccCccccccchhhhhhhcccceeeeeeeecC Q lcl|Aclame:pro 401 YCPYVPLQMVRSIDPNTFQPKIGFKTRYGMVSNPFVTTNGLYNGTPDGEALTPNANMYYRRVQVTNLM 468 (468) Q Consensus 401 yaPYv~l~~~~~~dp~s~qP~~g~~tRY~l~~nP~~~~~~~~~~~~~~~~~~~~an~y~~r~~v~~l~ 468 (468) |||||||+|++++||+||||++||||||||++|||++..+.++++|++++|.+++|+|||||+|+||| T Consensus 401 yaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~~NP~~~~~~~~~g~~~~~~~~~~~N~y~r~~~v~~l~ 468 (468) T protein:vir:10 401 YCPYVPLQMVRSIDPNTFQPKIGFKTRYGMVSNPFVTTNGLYNGTPDGEALTPNANMYYRRVQVTNLM 468 (468) T ss_pred eccccccccccccCCCcccceeeeeeeeceeecccceeccccCCCcccccccccccceeeeEEEeccC Confidence 99999999999999999999999999999999999998888999999999999999999999999999 No 2 >protein:vir:104915 Length: 470 # NCBI annotation: T4-like major capsid protein # Family: family:all:364 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214367;genbank:gi:61806007;genbank:GeneID:3294435 Probab=100.00 E-value=1.6e-235 Score=1307.79 Aligned_cols=456 Identities=61% Similarity=0.981 Sum_probs=419.4 Q ss_pred CcchHHHHHhhhhhhCCCccchhcchhhhHHHHHHHhHHHHHHhhhhhhhhhhhhhhcCccccccccccccccccccccc Q lcl|Aclame:pro 1 MFNAEHLQEKWSPVLNHGEAPAIGDRYKRAVTSVLLENQERFLREERGMLNEVAVNSLGAGTIAPAGSALGSANTGGLAG 80 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~~~llenq~~~~~~~~~~l~e~~~~~~~~~~~~~~~~i~~st~tg~i~~ 80 (468) |+++|+|+|||+|||||||+|||++.|||+|+++|||||+++++|++++|+|++ .++++|++++++|+|||+|++|++ T Consensus 3 ~~~~e~l~~kw~p~l~~~~~~~i~~~~~~~v~a~l~enq~~~~~~~~~~l~e~~--~~~~~~~~~~~~i~~st~t~~v~~ 80 (470) T protein:vir:10 3 MFNSEYLQEKWAPILDYDGLDPIKDSHRRSVTAVLLENQEKELREERNFLSEAP--NVNTNSGATAGFSADATAAGPVAG 80 (470) T ss_pred cchhHHHHHhhhhhhcCCccchhcchhhhhhhhhhhhhhHHHHhhccchhhhhh--hccccccccccccccccccccccc Confidence 999999999999999999999999999999999999999999999999999986 689999999999999999999999 Q ss_pred ccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCcccccccCCccccccccccccccccc-----cCc Q lcl|Aclame:pro 81 FDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDTGFTGGYDASQGDYAVR-----TGA 155 (468) Q Consensus 81 ~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSg~~~~~~~~~~~~-----~~~ 155 (468) |||+||+||||++|||||+|||||||||||||||||||+||.+|+|+|+||+||++.|||...+........ ... T Consensus 81 ~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~n~sG~EaffnEA~T~fSG~~~~~~~~~~~~~~~a~~~g 160 (470) T protein:vir:10 81 FDPVLISLIRRSMPNLVAYDLAGVQPMNGPTGLIFAMRSRYKTQSGTEALFNEADTAFSGQPDGLDDTSGFTATGANNVG 160 (470) T ss_pred cCchhhhhHHHHHhhhhhhhhheeecCCccceeeeEEEEEecCCCccceeeecCCcccCccccccccccccccccccccc Confidence 999999999999999999999999999999999999999999999999999999999999765543322211 111 Q ss_pred cccCCCcccccccc----ccccccccccccccchhhhhccCCC-CcchhhcceEEEEEEEEeecccccccccHHHHHHHH Q lcl|Aclame:pro 156 GVGGDSEGNNPALL----NDAAPGTYEVGSKMPREDLERMGEA-NRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLK 230 (468) Q Consensus 156 ~~~~~~~gt~~~~~----~~a~~~~~t~~~gm~Ta~aE~lG~~-~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLk 230 (468) .......+++++.. ..+....|+++.||+|+++|.||+. +++|+||+|+||||+||||||||||||||||||||| T Consensus 161 ~~~~~~~gt~~~~~~~~~~~a~~~~y~~~~GMsTa~aE~lg~s~~~~f~EMaFsIeK~tVtAKSRaLKAeYTiELAQDLK 240 (470) T protein:vir:10 161 LGTTAQQGSNPGLLNSTAAQTNATDYNVGQGMRTDSAEDLGDGTGDQFNQMAFSIEKVTVTAKSRALKAEYSLELAQDLK 240 (470) T ss_pred ccccccccccccccccccccccccccccccccchHHhhhcCCCCCcccceeeeEEEEEEEEeeccceeccccHHHHHHHH Confidence 11222334444433 3334566899999999999999964 678999999999999999999999999999999999 Q ss_pred HhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhccccccccccccccccccccchhHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 231 AIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVANAGIFDLDVDSNGRWSVEKFKGLLFQVERDANAIAQ 310 (468) Q Consensus 231 AiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~~~~g~~Dl~~~~~grw~~e~~k~L~~~i~~ean~i~~ 310 (468) ||||||||+||+||||||||+||||||||+||++|+|||+.+++++|+|||+++++|||++|+||.|+|||+||||+|+| T Consensus 241 AiHGLDAEtELaNILStEImlEINReii~~l~~~a~~~k~~~~~~~Gv~Dl~~~~~gr~~~e~~~~l~~~i~~ean~i~~ 320 (470) T protein:vir:10 241 AIHGLNAEAELANILSTEILAEINREVIRTIYNVAEPGAQANVAAAGTFDLDTDSNGRWSVEKFKGLIFQIERDANAIAQ 320 (470) T ss_pred HhcCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhhhhceeccccccceEEeecccchhHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccccccCceeEEEecCceEEEEccccc--ccCCcceEEEE Q lcl|Aclame:pro 311 ETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVFVDPYAA--NLSDKHYYVIG 388 (468) Q Consensus 311 ~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~--~~~~~dY~~vG 388 (468) ||+||+||||||||+||++|+|||||++.|+.+++ +++|+|+++|+|+|+|||+||||||+. +++++||++|| T Consensus 321 ~t~r~~~n~~i~S~~Va~~La~sG~l~~~~~~~~~-----~~~D~t~~~~~G~l~~~~~vy~d~y~~~~~~a~~dy~~vG 395 (470) T protein:vir:10 321 RTRRGKGNMILCSADVASALTMAGVLDYTPALNAN-----LNVDDTGNTFAGILQGKYRVYIDPFSASGGAAATQYYVVG 395 (470) T ss_pred hhccccceEEEEchhHHhHhhhccccccccccccc-----cccCCCCceEEEEecCceEEEeeccccccCcccccEEEEE Confidence 99999999999999999999999999999998864 689999999999999999999999987 58999999999 Q ss_pred EecCCcccceeEeeccchhhcccccCCccccceeeeeeeeeeeecCcccccCccccccchhhhhhhcccceeeeeeeecC Q lcl|Aclame:pro 389 YKGTSPYDAGLFYCPYVPLQMVRSIDPNTFQPKIGFKTRYGMVSNPFVTTNGLYNGTPDGEALTPNANMYYRRVQVTNLM 468 (468) Q Consensus 389 ~KG~~~~d~glfyaPYv~l~~~~~~dp~s~qP~~g~~tRY~l~~nP~~~~~~~~~~~~~~~~~~~~an~y~~r~~v~~l~ 468 (468) |||++++|+||||||||||++++++||+||||++||||||||++|||++..++..+. ..++.|.|||||+|+||| T Consensus 396 ~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g~~tRY~l~~NP~~~~~~~~~~~-----i~~~~n~y~r~~~v~~l~ 470 (470) T protein:vir:10 396 YKGSSPYDAGLFYCPYVPLQMVRAVGQDTFQPKIGFKTRYGLVENPFSQGTTQGLGT-----LTRNSNRYYRRVKVANLM 470 (470) T ss_pred EecCcceecceeeccccccccCCCCCCccccceeeeeeeeceeecCcccCCCccccc-----ccCCCCceeeEEEeeccC Confidence 999999999999999999999999999999999999999999999999766554332 345788899999999999 No 3 >protein:vir:104549 Length: 462 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214669;genbank:gi:61806310;genbank:GeneID:3294604 Probab=100.00 E-value=3.3e-232 Score=1289.65 Aligned_cols=451 Identities=66% Similarity=1.036 Sum_probs=418.8 Q ss_pred cchHHHHHhhhhhhCCCccchhcchhhhHHHHHHHhHHHHHHhhhhhhhhhhhhhhcCcccccccccccccccccccccc Q lcl|Aclame:pro 2 FNAEHLQEKWSPVLNHGEAPAIGDRYKRAVTSVLLENQERFLREERGMLNEVAVNSLGAGTIAPAGSALGSANTGGLAGF 81 (468) Q Consensus 2 ~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~~~llenq~~~~~~~~~~l~e~~~~~~~~~~~~~~~~i~~st~tg~i~~~ 81 (468) |+.|+|+|||+|||||||+|+|++.+||+|+++|||||||+++|++.+|+|++ .+|++. .++++|++++++ T Consensus 1 ms~~~l~~~w~~~l~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~~~l~ea~-----~~~g~~----~~~~~t~~~~~~ 71 (462) T protein:vir:10 1 MSIQQLQEKWAPVLNHESVPEIKDSYKKGVVAQLLENQENAIREEGQVLNETL-----QTTGYT----TGDTATGPVAGF 71 (462) T ss_pred CchHHHHHHhhhhhcccccchhhhhhHHHHHHHHhhhHHHHHHhcccchhccc-----cccCCC----cCcccccccccc Confidence 88999999999999999999999999999999999999999999999999985 445544 578889999999 Q ss_pred cceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecC------CCCcccccccCCccccccccccccccccccCc Q lcl|Aclame:pro 82 DPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYEN------QAGEEALFNEPDTGFTGGYDASQGDYAVRTGA 155 (468) Q Consensus 82 ~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~------qsG~EA~fnEa~t~fSg~~~~~~~~~~~~~~~ 155 (468) ||+||+||||++|||||+|||||||||||||||||||+||.+ |+|+||||||+|+.|||.............+. T Consensus 72 ~P~Li~l~Rra~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~~~~~~~nq~gtEAlfnEadt~fSg~~~~~~~~~~~~~~~ 151 (462) T protein:vir:10 72 DPVLISLIRRSMPQLIAYDVAGVQPMTGPTGLIFAMRSFYGSERRPANSDFREALFNEPNAGFSGGAGTGLSNYDPTASS 151 (462) T ss_pred cchhhhHHHHHHhhhhhhcceeeecCCcchhhhheeeeeccCCccccccccchhhhccCCcCcccccccccccccccccc Confidence 999999999999999999999999999999999999999975 57899999999999999877666555555555 Q ss_pred cccCCCccccccccccccccccc---cccccchhhhhccCCC--CcchhhcceEEEEEEEEeecccccccccHHHHHHHH Q lcl|Aclame:pro 156 GVGGDSEGNNPALLNDAAPGTYE---VGSKMPREDLERMGEA--NRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLK 230 (468) Q Consensus 156 ~~~~~~~gt~~~~~~~a~~~~~t---~~~gm~Ta~aE~lG~~--~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLk 230 (468) ....+..++++...++...+.++ .+.||+|+++|.||++ +++|+||+|+||||+||||||||||||||||||||| T Consensus 152 ~~~~~~~g~~~~~~~~~~~g~~~~~~~~~GM~Ta~aE~lg~~s~n~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLK 231 (462) T protein:vir:10 152 SAVNDAEGANPGLLNDSPAGTYEVTGDATGMATATAEALDDSSASTAFREMGFSIEKVTVTAKSRALKAEYSIEMAQDLK 231 (462) T ss_pred ccccccccccceeecCCCccceecccccccccchhccccCCccCCcchhhceeEEEEEEEeeeccceeccccHHHHHHHH Confidence 56677788888888887777765 4579999999999953 568999999999999999999999999999999999 Q ss_pred HhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhccccccccccccccccccccchhHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 231 AIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVANAGIFDLDVDSNGRWSVEKFKGLLFQVERDANAIAQ 310 (468) Q Consensus 231 AiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~~~~g~~Dl~~~~~grw~~e~~k~L~~~i~~ean~i~~ 310 (468) ||||||||+||+||||||||+||||||||+||++|+|||+.+++++|+|||+++++|||++|+||+|+|||+||||+|+| T Consensus 232 AIHGLDAEtELaNILSTEImlEINReii~~l~~~a~~~k~~~~~~~Gv~dl~~~~~gr~~~e~~k~l~~qi~~ean~i~~ 311 (462) T protein:vir:10 232 AIHGLDAESELANILSTEILAEINREVVRTIYVNAVKGAIANTATDGIFDLDVDSNGRWSVEKFKGLLFQIERDSNAIGQ 311 (462) T ss_pred HhcCCChhHHHHHHHHHHHHHHhhHHHHhhhhhhheeeecccccccceeeeccccchHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccccccCceeEEEecCceEEEEcccccccCCcceEEEEEe Q lcl|Aclame:pro 311 ETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYK 390 (468) Q Consensus 311 ~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~~~~dY~~vG~K 390 (468) ||+||+|||||||||||++|+|||||+++|+.+++.+. .++|+++.+|+|+|+|||+||||||+.+|+|+||++|||| T Consensus 312 ~t~r~~~n~~i~S~~Va~~La~sG~l~~~p~~~~~~~~--~~~d~~~~~~~G~l~~r~~vy~D~Y~~~ns~~dy~~vG~K 389 (462) T protein:vir:10 312 ETRRGKGNILICSADVASALGMAGVLDYAPGLQGNSAL--TGVDDTSSTLVGTLNGRIKVYVDPYSSNVADKHFYVAGYK 389 (462) T ss_pred HhccccceEEEEchhHHHHhhhccchhccccccccccc--cccccccceeEEEecCceEEEEecccCCCcccceEEEEEe Confidence 99999999999999999999999999999998887653 4899999999999999999999999999999999999999 Q ss_pred cCCcccceeEeeccchhhcccccCCccccceeeeeeeeeeeecCcccccCccccccchhhhhhhcccceeeeeeeecC Q lcl|Aclame:pro 391 GTSPYDAGLFYCPYVPLQMVRSIDPNTFQPKIGFKTRYGMVSNPFVTTNGLYNGTPDGEALTPNANMYYRRVQVTNLM 468 (468) Q Consensus 391 G~~~~d~glfyaPYv~l~~~~~~dp~s~qP~~g~~tRY~l~~nP~~~~~~~~~~~~~~~~~~~~an~y~~r~~v~~l~ 468 (468) |++++|+||||||||||++++++||+||||++||||||||++|||++..++..+ ...++.|.|||||+|+||| T Consensus 390 G~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g~~tRY~l~~NP~t~~~~~~~~-----~~~~~~n~y~r~~~v~~l~ 462 (462) T protein:vir:10 390 GTSPYDAGLFYCPYVPLQQVRAINPNTFQPKIGFKTRYGMVSNPFSGGLTQGSG-----ALTANANKYYRRVQVANLM 462 (462) T ss_pred CCcccccceeeccccccccccccCCccccceeeeeeeeeeeecCCCCCcCCccc-----cccccCcceeeeEEeeccC Confidence 999999999999999999999999999999999999999999999876554332 3456888999999999999 No 4 >protein:vir:103181 Length: 457 # NCBI annotation: gp135 # Family: family:all:364 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717802;genbank:gi:113200639;genbank:GeneID:4239190 Probab=100.00 E-value=1.7e-230 Score=1280.29 Aligned_cols=446 Identities=66% Similarity=1.052 Sum_probs=410.4 Q ss_pred cchHHHHHhhhhhhCCCccchhcchhhhHHHHHHHhHHHHHHhhhhhhhhhhhhhhcCcccccccccccccccccccccc Q lcl|Aclame:pro 2 FNAEHLQEKWSPVLNHGEAPAIGDRYKRAVTSVLLENQERFLREERGMLNEVAVNSLGAGTIAPAGSALGSANTGGLAGF 81 (468) Q Consensus 2 ~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~~~llenq~~~~~~~~~~l~e~~~~~~~~~~~~~~~~i~~st~tg~i~~~ 81 (468) |+.|+|+|||+|||||||+|||++.|||+|+++|||||||+++|++++|+|++ +.|++ +.+|++|++|+++ T Consensus 1 m~~~~l~~~w~~~l~~~~~~~i~~~~~~~~~~~~lenq~~~~~~~~~~l~ea~-----~~~g~----~~~s~~t~~v~~~ 71 (457) T protein:vir:10 1 MSFQNLQEKWAPVLEHDSLPEIGDSYKKGVVAQLLENQEKAIAEEGKILTETL-----QTTGY----TGGDTVTGPVAGF 71 (457) T ss_pred CchHHHHHHhhHhhccCccchhhhhHHHHHHHHHhhhHHHHHHhccccccccc-----cccCC----Ccccccccccccc Confidence 88999999999999999999999999999999999999999999999999985 44444 4467789999999 Q ss_pred cceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCC------cccccccCCccccccccccccccccccCc Q lcl|Aclame:pro 82 DPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAG------EEALFNEPDTGFTGGYDASQGDYAVRTGA 155 (468) Q Consensus 82 ~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG------~EA~fnEa~t~fSg~~~~~~~~~~~~~~~ 155 (468) ||+||+||||++|||||+|||||||||||||||||||+||.+|.+ +|||||||++.|||......... . T Consensus 72 ~P~Li~l~Rra~p~LIa~DIwGVQPmTgPTGLIFAmRsrY~~q~~~~~a~~~EAl~nEadt~fSg~~~~~~~~~-----~ 146 (457) T protein:vir:10 72 DPVLISLIRRSMPQLIAYDIAGVQPMTGPTGLIFAMRTNYGAERNPAAAGYDEAFFNEPNAGFSGGPGAYDPGA-----T 146 (457) T ss_pred cchhhhhhHHHHhhhhhhhcceeecCCCcceeeeeeeeeecCccccccccccceeeeccCcccCcccccccccc-----c Confidence 999999999999999999999999999999999999999999877 79999999999998765543322 2 Q ss_pred cccCCCcccccccccccccc---ccccccccchhhhhccCCC--CcchhhcceEEEEEEEEeecccccccccHHHHHHHH Q lcl|Aclame:pro 156 GVGGDSEGNNPALLNDAAPG---TYEVGSKMPREDLERMGEA--NRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLK 230 (468) Q Consensus 156 ~~~~~~~gt~~~~~~~a~~~---~~t~~~gm~Ta~aE~lG~~--~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLk 230 (468) ...++..+++++..++...+ .++++.||+|+++|.||++ +++|+||+|+||||+||||||||||||||||||||| T Consensus 147 ~~~~~~~gt~~~~~~~~~~~~~~~~~~~~gmsTA~aE~lgd~~~n~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLK 226 (457) T protein:vir:10 147 GVTNDAEGTNPALLNDSPAGTYEQADDATGMSTATVEALDDSTANTAFREMGFSIEKVTVTARARALKAEYSIEMAQDLK 226 (457) T ss_pred ccccccccccccccCccccccccccccccchhhhhhhccCCCCCccchhhheeEEEEEEEeeeccceeccccHHHHHHHH Confidence 23345566777777666555 4578899999999999953 457999999999999999999999999999999999 Q ss_pred HhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhccccccccccccccccccccchhHHHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 231 AIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVANAGIFDLDVDSNGRWSVEKFKGLLFQVERDANAIAQ 310 (468) Q Consensus 231 AiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~~~~g~~Dl~~~~~grw~~e~~k~L~~~i~~ean~i~~ 310 (468) ||||||||+||+||||||||+||||||||+||++|+|||+.+++++|+|||+++++|||++|+||+|+|||+||||+|+| T Consensus 227 AiHGLDAEtELaNILStEImlEINReii~~l~~~a~~~~~~~~~~~gv~dl~~~~~g~~~~e~~k~L~~~i~~ean~i~~ 306 (457) T protein:vir:10 227 AIHGLDAEQELANILSTEILAEINREVVRTIYTNAVAGAQNNTATAGVFDLDVDSNGRWSVEKFKGLLFQIERDANAIGH 306 (457) T ss_pred HhcCCChhHHHHHHHHHHHHHHhhHHHHHhHhhhheeeeccccccceeeeeeccccchhhHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccccccCceeEEEecCceEEEEcccccccCCcceEEEEEe Q lcl|Aclame:pro 311 ETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYK 390 (468) Q Consensus 311 ~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~~~~dY~~vG~K 390 (468) ||+||+||||||||+||++|++||||+++|+++++.+. .++|+++.+|+|+|+|||+||||||+++|+|+|||+|||| T Consensus 307 ~T~rg~gn~~i~S~~Va~~L~~sg~l~~~p~~~~~~~~--~~~d~~~~~~~G~l~~r~~vy~D~Ya~~ns~~dy~~vG~K 384 (457) T protein:vir:10 307 QTRRGKGNILICSADVVSALGMAGVLDYTPALNGNNGL--AGVDDTSSTLVGTLNGRIKVYVDPYSANVADKHFYVAGYK 384 (457) T ss_pred hhccccceEEEEchhHHHHHhhcccccccchhhccccc--cccccccceeEEEecCCeEEEEecccccCCccceEEEEEe Confidence 99999999999999999999999999999999999775 4899999999999999999999999999999999999999 Q ss_pred cCCcccceeEeeccchhhcccccCCccccceeeeeeeeeeeecCcccccCccccccchhhhhhhcccceeeeeeeecC Q lcl|Aclame:pro 391 GTSPYDAGLFYCPYVPLQMVRSIDPNTFQPKIGFKTRYGMVSNPFVTTNGLYNGTPDGEALTPNANMYYRRVQVTNLM 468 (468) Q Consensus 391 G~~~~d~glfyaPYv~l~~~~~~dp~s~qP~~g~~tRY~l~~nP~~~~~~~~~~~~~~~~~~~~an~y~~r~~v~~l~ 468 (468) |++++|+||||||||||++++++||+||||++||||||||++|||+...++... ....+.|.|||||+|++|| T Consensus 385 G~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g~~tRY~l~~NP~~~~~~~~~~-----~~~~~~n~~~~rs~vs~ll 457 (457) T protein:vir:10 385 GTSPYDAGLFYCPYVPLQQVRAINPDTFQPKIGFKTRYGMVSNPFAGGLTQGSG-----ALTVNANKYYRRVQVANLM 457 (457) T ss_pred CCcceecceeecccccccccCccCCccccceeeeeeeeeeeecccccccccccc-----cccccchhhcceeeeeecC Confidence 999999999999999999999999999999999999999999999976554332 1234678899999999999 No 5 >protein:vir:106286 Length: 534 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944113;genbank:gi:38640157;genbank:GeneID:2658034 Probab=100.00 E-value=3.9e-225 Score=1250.86 Aligned_cols=456 Identities=37% Similarity=0.647 Sum_probs=402.4 Q ss_pred cchHHHHHhhhhhhCCCccchhcchhhhHHHHHHHhHHHHHHhhhh----------------------hhhhhhhhhhcC Q lcl|Aclame:pro 2 FNAEHLQEKWSPVLNHGEAPAIGDRYKRAVTSVLLENQERFLREER----------------------GMLNEVAVNSLG 59 (468) Q Consensus 2 ~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~~~llenq~~~~~~~~----------------------~~l~e~~~~~~~ 59 (468) |.+|+|+|||+|||||||+|||++.|||+|+++|||||||+++|++ .+|+|+ +.+ T Consensus 1 ~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~ea---~~~ 77 (534) T protein:vir:10 1 MSKKSLLKKWQPLVESEGMPAIASMKRKDIVARIFENQDEDIAHNEGGVYTDQVVVNSMVDVKGRIEEARLAEA---NIG 77 (534) T ss_pred CchhHHHHHhHHhhcCCccccccchhhhhhhhhhhhhHHHHHhhhcccccchhhhhhhhhccccchhhcccccc---ccc Confidence 9999999999999999999999999999999999999999999985 558885 789 Q ss_pred cccccccccccccccccccccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCC----Cccccccc-- Q lcl|Aclame:pro 60 AGTIAPAGSALGSANTGGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQA----GEEALFNE-- 133 (468) Q Consensus 60 ~~~~~~~~~i~~st~tg~i~~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qs----G~EA~fnE-- 133 (468) ++|+|++++|+||++|++|+++||+||+||||++|||||+|||||||||||||||||||+||.+|. ++||||+| T Consensus 78 ~~~g~~~~~ia~s~~s~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~n~~~~~s~~EAf~ne~~ 157 (534) T protein:vir:10 78 GDHGYDATKIASGETSGSITNVGPAVMGLVRRAIPQLIAFDICGVQPMTSSTGQVFTLRAIYGGNSQDANAREAFHPTYG 157 (534) T ss_pred cccccccccccccccccccccccchhhhHHHHHHHhhhhhhhheeccCCchhhhheeeeeeecCCCCCcccccccccccc Confidence 999999999999999999999999999999999999999999999999999999999999999875 67999999 Q ss_pred CCccccccccccccccccccCcc------------ccCCCccccc-------------c---------cccccccccccc Q lcl|Aclame:pro 134 PDTGFTGGYDASQGDYAVRTGAG------------VGGDSEGNNP-------------A---------LLNDAAPGTYEV 179 (468) Q Consensus 134 a~t~fSg~~~~~~~~~~~~~~~~------------~~~~~~gt~~-------------~---------~~~~a~~~~~t~ 179 (468) +|+.|||................ ..+.+.++.+ . .........|++ T Consensus 158 adt~fSG~~~a~~~~~~~~~~a~~~g~~~~~~~~~~t~~~~Gt~~~~~~~~~~v~~~~~~~~~ag~~~~~~~~~~~~y~~ 237 (534) T protein:vir:10 158 PDADFSGRGAAQDIAVFVRGTAVASGAFAKLHIEAATGVQAGTKTVQFIKDYAVDALPADQTEAGLAYKWLLANGYAVET 237 (534) T ss_pred ccccccccccccccccccccccccccccccccccccccccccccccccccccccccccCCccccccccccccccccceec Confidence 99999997654321111000000 0000111110 0 011122356889 Q ss_pred ccccchhhhhccC----CCCcchhhcceEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhH Q lcl|Aclame:pro 180 GSKMPREDLERMG----EANRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINR 255 (468) Q Consensus 180 ~~gm~Ta~aE~lG----~~~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINR 255 (468) +.||+|+.+|.+| +++++|+||+|+||||+|+|||||||||||||||||||||||||||+||+||||||||+|||| T Consensus 238 ~~gm~Ta~AE~lg~~ggs~~~~f~EMsFsIdKvtVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELsNILSTEImlEINR 317 (534) T protein:vir:10 238 SSAMATAFAELQQGFNGSADNEWNEMSFRIDKQVVEAKSRQLKAQYSIEMAQDLRAVHGLDADSELSSILANEIMHEINR 317 (534) T ss_pred ccccchhhHhhhccCCCCcccchhhcceEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhH Confidence 9999999999985 345789999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHhhhhhccccccc----ccccccccccccc---chhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHH Q lcl|Aclame:pro 256 EVVRRVYTVAKKGAQNNV----ANAGIFDLDVDSN---GRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVAS 328 (468) Q Consensus 256 Eii~~l~~va~~~k~~~~----~~~g~~Dl~~~~~---grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~ 328 (468) ||||+||++|+|||+.+. +++|+|||+++.| +||++|+||+|++|||+|||+|+|+|+||+|||||||||||+ T Consensus 318 eii~~l~~~a~~~k~~~~~~~~~~~G~~d~~~~~~~~~~~~~~e~~~~L~~~i~~~an~i~~~T~rg~~n~~v~S~~Va~ 397 (534) T protein:vir:10 318 EMVLWINATAKVGKTGWTNMHGGKAGVFDFQDTKDIRGARWAGESYKALVVQIDKEANEIARQTGRGQGNFIICSRNVAA 397 (534) T ss_pred HHHHHHhhhhheeecccccccccccceeeeeccccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchhHHH Confidence 999999999999999874 5789999999999 999999999999999999999999999999999999999999 Q ss_pred HHHhhcccccccccccccccccccccccCceeEEEecCceEEEEcccccccCCcceEEEEEecCCcccceeEeeccchhh Q lcl|Aclame:pro 329 ALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYKGTSPYDAGLFYCPYVPLQ 408 (468) Q Consensus 329 ~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~~~~dY~~vG~KG~~~~d~glfyaPYv~l~ 408 (468) +|+|||||++.|+..++.+ +++|+++.+|+|+|+|||+||||||+. +|||+|||||++++|+||||||||||+ T Consensus 398 ~L~~~g~l~~~~~~~~~~~---~~~d~~~~~~~G~l~~~~~vy~D~y~~----~dy~~vG~KG~~~~~~glfyaPYv~l~ 470 (534) T protein:vir:10 398 ALGHTDMLMTPAVMGANTT---MNTDTTSSLFAGVLAGKYRVYIDQYAV----EDYFTVGYKGASEMDAGLYYCPYVALT 470 (534) T ss_pred HHhhccchhcccccccccc---ccccCCCceEEEEecCceEEEecCCCC----cceEEEEEeCCcccccceeeccccccc Confidence 9999999999999877765 789999999999999999999999965 899999999999999999999999999 Q ss_pred cccccCCccccceeeeeeeeeeeecCcccccCccc--cccch-hhhhh--hcccceeeeeeeec Q lcl|Aclame:pro 409 MVRSIDPNTFQPKIGFKTRYGMVSNPFVTTNGLYN--GTPDG-EALTP--NANMYYRRVQVTNL 467 (468) Q Consensus 409 ~~~~~dp~s~qP~~g~~tRY~l~~nP~~~~~~~~~--~~~~~-~~~~~--~an~y~~r~~v~~l 467 (468) |++++||+||||++||||||||++|||++..++.. .+.|+ .+|.+ +.|.||||++|+|| T Consensus 471 ~~~~~dp~sfqP~~g~~tRY~l~~NP~~~~~~~~~~~~i~~g~~~~~~~ag~n~~~~~~~Vk~l 534 (534) T protein:vir:10 471 PLRGTDPKNFQPVLGFKTRYGVKLHPMADATQNKGFAKISNGMPQHTNMFGKNAFFRRVLVAGV 534 (534) T ss_pred cccccCCccccceeeeeeeeceeecCcccccCCccccccccCCcchhhhcccccceeeeeeecC Confidence 99999999999999999999999999998765443 23433 23444 44889999999999 No 6 >protein:vir:101039 Length: 529 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932516;genbank:gi:37651642;genbank:GeneID:2610532 Probab=100.00 E-value=2.4e-222 Score=1235.57 Aligned_cols=455 Identities=35% Similarity=0.597 Sum_probs=397.5 Q ss_pred CcchHHHHHhhhhhhCCCccchhcchhhhHHHHHHHhHHHHHHhhhhhhhhhhhhh---------hcCcccccccccccc Q lcl|Aclame:pro 1 MFNAEHLQEKWSPVLNHGEAPAIGDRYKRAVTSVLLENQERFLREERGMLNEVAVN---------SLGAGTIAPAGSALG 71 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~~~llenq~~~~~~~~~~l~e~~~~---------~~~~~~~~~~~~i~~ 71 (468) -+++|+|+|||+|||||||+|||++.|||+|+++|||||||++++++.+.++++.+ ..+++|+|++++|+| T Consensus 2 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~e~~~~~l~~~~~~~~~~~~~~~i~e 81 (529) T protein:vir:10 2 SLKNKEILNKWTPLLEGEGLPEIAGKNKQALVAQILEAQEKDSKSDPVYRDDKLIEAFGQSLMEAEVAGDHGYDPTNIAA 81 (529) T ss_pred cccHHHHHHHhHHHhcCCccchhccchhhhhhhhhhhhhHHHHhhccccchhhhhhhhhcccchhhcccccccccccccc Confidence 57888999999999999999999999999999999999999999999774443333 335789999999999 Q ss_pred cccccccccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCC-------------------------- Q lcl|Aclame:pro 72 SANTGGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQA-------------------------- 125 (468) Q Consensus 72 st~tg~i~~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qs-------------------------- 125 (468) ||+|++|++|||+||+||||++|||||+|||||||||||||||||||+||.++. T Consensus 82 st~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~~~eaf~~~y~Pda~~sga~~~g 161 (529) T protein:vir:10 82 GQSSGAITNIGPAVIGMVRRAIPSLIAFDIAGVQPMTGPTGQVFALRSVYGKDPLAAGAKEAFHPMYAPDAWHSSLATKG 161 (529) T ss_pred ccccccccccCchhhhhHHHHHhhhhhheeeeeecCCchhhhhhhhheeecCCccccccccccccccccccccccccccc Confidence 999999999999999999999999999999999999999999999999998764 Q ss_pred -----------------------CcccccccCCccccccccccccccccccCccccCCCccccccccccccccccccccc Q lcl|Aclame:pro 126 -----------------------GEEALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSK 182 (468) Q Consensus 126 -----------------------G~EA~fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~g 182 (468) |.|+||+|+++.||+.........+. ...+..++..........+.++++.| T Consensus 162 a~~~~~~~~~~~~t~~~~~a~~~g~ea~f~ea~t~fs~~~~g~~~~~g~-----~~~~~~~~~~~~~~~a~~~~~~~~~G 236 (529) T protein:vir:10 162 ATTTTDGTPFAKLTAGQAIAEGDIVGHFFYESGTAFLQNVSGASVTVGT-----NETGEALDKLINAAIGEGKLAEIAEG 236 (529) T ss_pred cccccCccccccccccccccccCcceeeeecccceecccccccccccCc-----cccCcccccccccccccccccccccc Confidence 23555555555555432221111100 00011111122223345577889999 Q ss_pred cchhhhhccC----CCCcchhhcceEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHH Q lcl|Aclame:pro 183 MPREDLERMG----EANRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVV 258 (468) Q Consensus 183 m~Ta~aE~lG----~~~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii 258 (468) |+|+++|.|| +++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||+|||+||||||| T Consensus 237 m~Ta~aEaL~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILStEImlEINReii 316 (529) T protein:vir:10 237 MATSIAELRQGFNGSNDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNGILANEVMLEINREVI 316 (529) T ss_pred cchhhhhccccCCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHH Confidence 9999999995 345789999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHhhhhhccccccc----ccccccccccccc---chhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHH Q lcl|Aclame:pro 259 RRVYTVAKKGAQNNV----ANAGIFDLDVDSN---GRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALA 331 (468) Q Consensus 259 ~~l~~va~~~k~~~~----~~~g~~Dl~~~~~---grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~ 331 (468) |+||++|+|||+.++ +++|+|||+++.| +||++|+||+|++|||+|||+|+|+|+||+||||||||+||++|+ T Consensus 317 ~~l~~~a~~~k~~g~~~~~~~~Gv~d~~~~~~~~~~~~~~e~~k~L~~~i~~~an~I~~~T~rg~~n~vi~S~~Va~~L~ 396 (529) T protein:vir:10 317 DWINYTAQVGKSGWTKTDGSASGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALA 396 (529) T ss_pred HhHhhhhhhhhcccccccccccceeecccCccccccchHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchHHHHHHH Confidence 999999999998776 8889999999866 999999999999999999999999999999999999999999999 Q ss_pred hhcccccccccccccccccccccccCceeEEEecCceEEEEcccccccCCcceEEEEEecCCcccceeEeeccchhhccc Q lcl|Aclame:pro 332 MAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVR 411 (468) Q Consensus 332 ~sG~~~~~~~~~~~~~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~~~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~ 411 (468) |+||+++++......+ .++|+++.+|+|+|+|||+||||||+ ++|||+|||||++++|+||||||||||+|++ T Consensus 397 ~~~~~~~~~~~~~~sg---~~~d~~~~~~~G~l~~~~~vy~D~y~----~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~ 469 (529) T protein:vir:10 397 LIDTNISPAAQGMASG---LNADTTKGVFAGILGGRYKVYIDQYA----RQDYFTMGYRGANNLDAGIYYCPYVALTPLR 469 (529) T ss_pred hhhhhccccccccccc---cccccCCceEEEEecCceEEEecCCC----CcceEEEEEeCCcccccceeecccccccccc Confidence 9999998876555444 57899999999999999999999986 5899999999999999999999999999999 Q ss_pred ccCCccccceeeeeeeeeeeecCcccccCc--cccccchhhhhhhc--ccceeeeeeeec Q lcl|Aclame:pro 412 SIDPNTFQPKIGFKTRYGMVSNPFVTTNGL--YNGTPDGEALTPNA--NMYYRRVQVTNL 467 (468) Q Consensus 412 ~~dp~s~qP~~g~~tRY~l~~nP~~~~~~~--~~~~~~~~~~~~~a--n~y~~r~~v~~l 467 (468) ++||+||||++||||||||++|||++...+ +..++++.+|+++| |.||||++|+|| T Consensus 470 ~~dp~sfqP~~g~~tRY~l~~NP~~~~~~~~~~~r~~~g~~~~~~ag~n~~~r~~~Vk~l 529 (529) T protein:vir:10 470 GSDPKNFQPVMGFKTRYAIGVNPFAESRTQAPQGRITSGMPGVNSVGKNAYFRRVWVKGL 529 (529) T ss_pred ccCCCcccceeeeeeeeceeecCccccccccccccccCCcchhhhcCccceeEEeeeccC Confidence 999999999999999999999999976554 34589999998877 889999999999 No 7 >protein:vir:6901 Length: 522 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861877;genbank:gi:32453668;genbank:GeneID:1494303 Probab=100.00 E-value=1.1e-221 Score=1232.01 Aligned_cols=458 Identities=36% Similarity=0.625 Sum_probs=401.2 Q ss_pred CcchHHHHHhhhhhhCCCccchhcchhhhHHHHHHHhHHHHHHhhhhhhhhhhhhhhc---------Ccccccccccccc Q lcl|Aclame:pro 1 MFNAEHLQEKWSPVLNHGEAPAIGDRYKRAVTSVLLENQERFLREERGMLNEVAVNSL---------GAGTIAPAGSALG 71 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~~~llenq~~~~~~~~~~l~e~~~~~~---------~~~~~~~~~~i~~ 71 (468) |+++|+|+|||+|||||||+|+|.+ +||+|+++|||||||++++++.+++|...+++ +++|+|.+.+|+| T Consensus 4 ~~~~e~l~~kw~p~l~~~~~~~~~~-~~~~~~a~l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~~~~~~~i~e 82 (522) T protein:vir:69 4 IKTKAQLVDKWKELLEGEGLPEIAN-SKQAIIAKIFENQEKDFEVSPEYKDEKIAQAFGSFLTEAEIGGDHGYNAQNIAA 82 (522) T ss_pred cchHHHHHHhhHHHhcCCCCCcccc-chhhhhhhhhhhhhHHhhcccccchhHHHHhhhhhhhhhccccccCCCcccccc Confidence 9999999999999999999999987 69999999999999999999977777665555 5699999999999 Q ss_pred cccccccccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCC----Ccccc--cccCCcccccccccc Q lcl|Aclame:pro 72 SANTGGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQA----GEEAL--FNEPDTGFTGGYDAS 145 (468) Q Consensus 72 st~tg~i~~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qs----G~EA~--fnEa~t~fSg~~~~~ 145 (468) |++|++|++|+|+||+|+||++|||||+|||||||||||||||||||+||.+|. ++|+| |||+|+.|||..... T Consensus 83 s~~t~~v~~~~P~li~lvrRa~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~q~~~~~~~eaf~~~neadt~fSG~~~~t 162 (522) T protein:vir:69 83 GQTSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQPMNSPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGQGAAK 162 (522) T ss_pred cccccccccccchHHHHHHHHHhhhhhhhceeeccCCchhhhheeeeeeccCCcccCccccccccccccccccccccccc Confidence 999999999999999999999999999999999999999999999999999875 66777 599999999976543 Q ss_pred ccccccccC-----------------------ccccCCCcccccccc------ccccccccccccccchhhhhcc---C- Q lcl|Aclame:pro 146 QGDYAVRTG-----------------------AGVGGDSEGNNPALL------NDAAPGTYEVGSKMPREDLERM---G- 192 (468) Q Consensus 146 ~~~~~~~~~-----------------------~~~~~~~~gt~~~~~------~~a~~~~~t~~~gm~Ta~aE~l---G- 192 (468) ......... ........++++... .....+.|+++.||+|+.+|.+ | T Consensus 163 ~~~~~~~~~~t~~G~~~~~~~~~~gt~~~~~~a~~t~~~t~~~~~~~~~ai~s~~~~~~~y~~g~GmsTa~aEal~~lgg 242 (522) T protein:vir:69 163 KFPALAASTQTKVGDIYTHFFQETGTVYLQASAQVTISSSADDAAKLDAEIIKQMEAGALVEIAEGMATSIAELQEGFNG 242 (522) T ss_pred cccccccccccccccccccccccccceeeecccCCcCCCCCcccccccchhccccccccceeeccccchhhhhhcccCCC Confidence 322111100 011111122222211 2234567899999999999985 3 Q ss_pred CCCcchhhcceEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhcccccc Q lcl|Aclame:pro 193 EANRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNN 272 (468) Q Consensus 193 ~~~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~ 272 (468) +++++|+||+|+||||+|+|||||||||||||||||||||||||||+||+||||||||+|||||||++|+.+|+.+++.. T Consensus 243 ss~~~f~EMaFsIeKvTVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILSTEImlEINReii~~i~~sa~~~~~g~ 322 (522) T protein:vir:69 243 STDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQVGKSGM 322 (522) T ss_pred CcccchhhhcceEeeEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhhheeecccc Confidence 34578999999999999999999999999999999999999999999999999999999999999998777777666543 Q ss_pred c----ccccccccccccc---chhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccc Q lcl|Aclame:pro 273 V----ANAGIFDLDVDSN---GRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGA 345 (468) Q Consensus 273 ~----~~~g~~Dl~~~~~---grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~ 345 (468) . +++|+|||+++.| |||++|+||+|+||||||||+|+|+|+||+|||||||||||++|+|+|++++.++...+ T Consensus 323 t~~~~~~~Gv~Dl~~~~~~~~~rw~~e~~k~L~~~i~~~an~i~~~T~rg~~n~~i~S~~Va~~L~~~~~~~~~~~~~~~ 402 (522) T protein:vir:69 323 TNIVGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTGISYAAQGLA 402 (522) T ss_pred ccccccccceeecccccccccchhHHHHHHHHHHHHHHHHHHHHHhcccccccEEEEchhHHHHHhhccccccccccccc Confidence 3 6899999999999 99999999999999999999999999999999999999999999999999999998866 Q ss_pred cccccccccccCceeEEEecCceEEEEcccccccCCcceEEEEEecCCcccceeEeeccchhhcccccCCccccceeeee Q lcl|Aclame:pro 346 GGPSIGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNTFQPKIGFK 425 (468) Q Consensus 346 ~~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~~~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~s~qP~~g~~ 425 (468) .+ .++|+++++|+|+|+|||+||||||+ ++|||+|||||++++|+||||||||||+|++++||+||||++||| T Consensus 403 ~g---~~~d~~~~~~~G~l~~~~~vy~D~y~----~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~ 475 (522) T protein:vir:69 403 SG---FNTDTTKSVFAGVLGGKYRVYIDQYA----KQDYFTVGYKGANEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFK 475 (522) T ss_pred cc---ccccCCCceEEEEecCceEEEecCCC----CcceEEEEEeCCcccccceeeccccccccccccCCccccceeeee Confidence 55 68899999999999999999999996 489999999999999999999999999999999999999999999 Q ss_pred eeeeeeecCcccccC------ccccccchhhhhhhcccceeeeeeeec Q lcl|Aclame:pro 426 TRYGMVSNPFVTTNG------LYNGTPDGEALTPNANMYYRRVQVTNL 467 (468) Q Consensus 426 tRY~l~~nP~~~~~~------~~~~~~~~~~~~~~an~y~~r~~v~~l 467 (468) |||||++|||++... ..+++|++.+ ..+.|.|||||+|+|| T Consensus 476 tRY~l~vNP~~~~~~~~~~~ri~~g~p~~~~-~~~~n~y~r~v~v~~~ 522 (522) T protein:vir:69 476 TRYGIGVNPFAESSLQAPGARIQSGMPSILN-SLGKNAYFRRVYVKGI 522 (522) T ss_pred eeeceeecCcccccCCcccceeecccchhhc-ccCCcceeeEEEeecC Confidence 999999999997432 2345566666 7788999999999999 No 8 >protein:vir:101811 Length: 529 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238888;genbank:gi:66391963;genbank:GeneID:3416638 Probab=100.00 E-value=2.3e-221 Score=1230.15 Aligned_cols=460 Identities=35% Similarity=0.608 Sum_probs=396.4 Q ss_pred CcchHHHHHhhhhhhCCCccchhcchhhhHHHHHHHhHHHHHHhhhhhhhhhhhhh---------hcCcccccccccccc Q lcl|Aclame:pro 1 MFNAEHLQEKWSPVLNHGEAPAIGDRYKRAVTSVLLENQERFLREERGMLNEVAVN---------SLGAGTIAPAGSALG 71 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~~~llenq~~~~~~~~~~l~e~~~~---------~~~~~~~~~~~~i~~ 71 (468) -+++|+|+|||+|||||||+|||++.|||+|+++|||||||+++++|.+.++++.+ ..+++|+|++++|++ T Consensus 2 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~e~~~~~l~e~~~~~~~~~~~~~i~~ 81 (529) T protein:vir:10 2 SLKNKEILNKWTPLLEGEGLPEIAGKNKQALVAQILEAQEKDSKSDPVYRDDKLIEAFGQSLMEAEVAGDHGYDPTNIAA 81 (529) T ss_pred ccchHHHHHHhhHhhcCCccchhccchhhhhhhhhhhhhHHHHhcccccchhhhhhhhhccchhhccccccccccccccc Confidence 57788999999999999999999999999999999999999999998773332222 346789999999999 Q ss_pred cccccccccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCC----Cccccccc--CCcccccccccc Q lcl|Aclame:pro 72 SANTGGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQA----GEEALFNE--PDTGFTGGYDAS 145 (468) Q Consensus 72 st~tg~i~~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qs----G~EA~fnE--a~t~fSg~~~~~ 145 (468) ||+|++|++|||+||+||||++|||||+|||||||||||||||||||+||.++. +.|+||++ +++.||+.+... T Consensus 82 st~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~~~eaf~~~~~pda~~sga~~~g 161 (529) T protein:vir:10 82 GQSSGAITNIGPAVIGMVRRAIPSLIAFDIAGVQPMTGPTGQVFALRSVYGKDPLAAGAKEAFHPMYAPDAWHSSLATKG 161 (529) T ss_pred ccccccccccCchhhhhHHHHHHhhhhhhhheeccCCchhhhhheeeeeecCCccccccccccccccccccccccccccc Confidence 999999999999999999999999999999999999999999999999998874 45777764 566666654322 Q ss_pred cccccccc---------------------------------Ccccc-----CCCccccccccccccccccccccccchhh Q lcl|Aclame:pro 146 QGDYAVRT---------------------------------GAGVG-----GDSEGNNPALLNDAAPGTYEVGSKMPRED 187 (468) Q Consensus 146 ~~~~~~~~---------------------------------~~~~~-----~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~ 187 (468) ........ +.... .+...+............++++.||+|+. T Consensus 162 a~t~~~~t~~~~~ta~~~~a~g~g~ea~f~ea~t~fs~~~~g~~~~~g~~~t~~~~~~~~~~~~a~~~~~~~~~GmsTa~ 241 (529) T protein:vir:10 162 ATTTTDGTPFAKLTAGQAIAEGDIVGHFFYESGTAFLQNVSGASVTVGTNETGEALDKLINAAIGEGKLAEIAEGMATSI 241 (529) T ss_pred ccccccccccccccccccccccccceeeecccCceeeccccccccccCccccCcccccccccccccccccccccchhhhh Confidence 21110000 00000 00011111122233456788999999999 Q ss_pred hhccC----CCCcchhhcceEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhh Q lcl|Aclame:pro 188 LERMG----EANRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYT 263 (468) Q Consensus 188 aE~lG----~~~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~ 263 (468) +|.|+ +++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+||||||||+||+ T Consensus 242 aEaL~~~ggss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILStEImlEINReii~~l~~ 321 (529) T protein:vir:10 242 AELRQGFNGSNDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNGILANEVMLEINREVIDWINY 321 (529) T ss_pred hhccccCCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHhh Confidence 99994 34678999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhccccccc----ccccccccccccc---chhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhccc Q lcl|Aclame:pro 264 VAKKGAQNNV----ANAGIFDLDVDSN---GRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVL 336 (468) Q Consensus 264 va~~~k~~~~----~~~g~~Dl~~~~~---grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~ 336 (468) +|+|||+.++ +++|+|||+++.| +||++|+||+|++|||+|||+|+|+|+||+||||||||+||++|+|+|| T Consensus 322 ~a~~~~~~~~~~~~~~~Gv~d~~~~~~~~~~~~~~e~~~~L~~~i~~~an~I~~~T~rg~~n~vi~S~~Va~~L~~~~~- 400 (529) T protein:vir:10 322 TAQVGKSGWTKTDGSASGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALALIDT- 400 (529) T ss_pred hhhhhccccccccccccceeecccCccccccchHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchHHHHHHHhhcc- Confidence 9999998877 6679999999866 9999999999999999999999999999999999999999999999985 Q ss_pred ccccccccccccccccccccCceeEEEecCceEEEEcccccccCCcceEEEEEecCCcccceeEeeccchhhcccccCCc Q lcl|Aclame:pro 337 DYSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPN 416 (468) Q Consensus 337 ~~~~~~~~~~~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~~~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~ 416 (468) .+.|+..+..+ +.++|+++.+|+|+|+|||+||||||+ ++|||+|||||++++|+||||||||||+|++++||+ T Consensus 401 ~~~~~~~~~~s--g~~~d~~~~~~~G~l~~~~~vy~D~y~----~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~ 474 (529) T protein:vir:10 401 NISPAAQGMAS--GLNADTTKGVFAGILGGRYKVYIDQYA----RQDYFTMGYRGANNLDAGIYYCPYVALTPLRGFDPK 474 (529) T ss_pred ccccccccccc--ccccccCCceEEEEecCceEEEecCCC----CcceEEEEEeCCcccccceeeccccccccccccCCC Confidence 55666544433 357999999999999999999999986 589999999999999999999999999999999999 Q ss_pred cccceeeeeeeeeeeecCcccccCc--cccccchhhhhhhc--ccceeeeeeeec Q lcl|Aclame:pro 417 TFQPKIGFKTRYGMVSNPFVTTNGL--YNGTPDGEALTPNA--NMYYRRVQVTNL 467 (468) Q Consensus 417 s~qP~~g~~tRY~l~~nP~~~~~~~--~~~~~~~~~~~~~a--n~y~~r~~v~~l 467 (468) ||||++||||||||++|||++...+ +..++++.+|+++| |.||||++|+|| T Consensus 475 sfqP~~g~~tRY~l~~NP~~~~~~~~~~~r~~~g~~~~~~ag~n~~~r~~~Vk~l 529 (529) T protein:vir:10 475 NFQPVMGFKTRYAIGVNPFAESRTQAPQGRITSGMPGVNSVGKNAYFRRVWVKGL 529 (529) T ss_pred cccceeeeeeeeceeecCccccccccccccccCCcchhhhcCccceeEEeeeccC Confidence 9999999999999999999976554 34589999998877 889999999999 No 9 >protein:vir:7214 Length: 521 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049787;genbank:gi:9632597;genbank:GeneID:1258751 Probab=100.00 E-value=3.5e-221 Score=1229.23 Aligned_cols=459 Identities=35% Similarity=0.605 Sum_probs=400.9 Q ss_pred CcchHHHHHhhhhhhCCCccchhcchhhhHHHHHHHhHHHHHHhhhhhhhhhhhhh---------hcCcccccccccccc Q lcl|Aclame:pro 1 MFNAEHLQEKWSPVLNHGEAPAIGDRYKRAVTSVLLENQERFLREERGMLNEVAVN---------SLGAGTIAPAGSALG 71 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~~~llenq~~~~~~~~~~l~e~~~~---------~~~~~~~~~~~~i~~ 71 (468) |+++|+|+|||+|||||||+|+|++ +||+|+++|||||||++++.+.+++|..++ .++++|++++.+|+| T Consensus 3 ~~~~~~l~~kw~p~l~~~~~~~i~~-~~~~~~a~~~enq~~~~~~~~~~~~~~~~~~~~~~l~e~~~~~~~~~~~~~iae 81 (521) T protein:vir:72 3 IKTKAELLNKWKPLLEGEGLPEIAN-SKQAIIAKIFENQEKDFQTAPEYKDEKIAQAFGSFLTEAEIGGDHGYNATNIAA 81 (521) T ss_pred cchhHHHHHhhhhhhccCCCCcccc-chhhhhhhhhhhhhhhhhhcccccchHHHHHHhhhhhhhcccCccccCcccccc Confidence 9999999999999999999999987 599999999999999999998555544333 346789999999999 Q ss_pred cccccccccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCC----Cccccccc--CCcccccccccc Q lcl|Aclame:pro 72 SANTGGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQA----GEEALFNE--PDTGFTGGYDAS 145 (468) Q Consensus 72 st~tg~i~~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qs----G~EA~fnE--a~t~fSg~~~~~ 145 (468) |++|++|+++||+||+||||++|||||+|||||||||||||||||||+||.+|. |+|+||+| +|+.|||.+... T Consensus 82 s~~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~q~~~~~g~ea~~~e~~~da~fSG~~~~~ 161 (521) T protein:vir:72 82 GQTSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQPMNSPTGQVFALRAVYGKDPVAAGAKEAFHPMYGPDAMFSGQGAAK 161 (521) T ss_pred cccccccccCCchhhhHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCcccccccchhcccccccccccccc Confidence 999999999999999999999999999999999999999999999999999985 78999986 788999886543 Q ss_pred ccccccccCcc----------------------------ccCCCccccccc-cccccccccccccccchhhhhcc---C- Q lcl|Aclame:pro 146 QGDYAVRTGAG----------------------------VGGDSEGNNPAL-LNDAAPGTYEVGSKMPREDLERM---G- 192 (468) Q Consensus 146 ~~~~~~~~~~~----------------------------~~~~~~gt~~~~-~~~a~~~~~t~~~gm~Ta~aE~l---G- 192 (468) ........... ..++...+++.. .+....+.|+++.||+|+.+|.+ | T Consensus 162 ~~~~~~~~~~~a~Gd~~~~~~~~~gt~~~~~~~~~~~~~g~t~~~~t~~~v~~~~~a~~~y~~g~gm~Ta~aEal~~~g~ 241 (521) T protein:vir:72 162 KFPALAASTQTTVGDIYTHFFQETGTVYLQASVQVTIDAGATDAAKLDAEIKKQMEAGALVEIAEGMATSIAELQEGFNG 241 (521) T ss_pred cccccccccccccccccccccccccccccccccccccCCCCCCccccccccccccccCceeeeecccchhhhhhhcccCC Confidence 32221111110 001111111111 12233467899999999999985 3 Q ss_pred CCCcchhhcceEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhcccccc Q lcl|Aclame:pro 193 EANRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNN 272 (468) Q Consensus 193 ~~~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~ 272 (468) +++++|+||+|+||||+|+|||||||||||||||||||||||||||+||+||||||||+|||||||++|+..|+.+++.. T Consensus 242 ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii~~i~~sa~~g~~g~ 321 (521) T protein:vir:72 242 STDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQVGKSGM 321 (521) T ss_pred cccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHhhhhhheeeeeeeee Confidence 34578999999999999999999999999999999999999999999999999999999999999988766666655433 Q ss_pred c----ccccccccccccc---chhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccc Q lcl|Aclame:pro 273 V----ANAGIFDLDVDSN---GRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGA 345 (468) Q Consensus 273 ~----~~~g~~Dl~~~~~---grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~ 345 (468) . +++|+|||+++.| +||++|+||+|+||||||||+|+|+|+||+|||||||||||++|+|+|.+++.++.+.+ T Consensus 322 t~~~~~~~G~~d~~~~~d~~~~~~~~e~~k~L~~~i~~~an~i~~~T~r~~~n~~i~S~~Va~~L~~~~~~~~~~~~~~~ 401 (521) T protein:vir:72 322 TLTPGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTGISYAAQGLA 401 (521) T ss_pred eeccCccccceecccccccccchHHHHHHHHHHHHHHHHHHHHHHhcccccceEEEEchHHHHHHhhccccccccccccc Confidence 3 5799999999998 99999999999999999999999999999999999999999999999999999998776 Q ss_pred cccccccccccCceeEEEecCceEEEEcccccccCCcceEEEEEecCCcccceeEeeccchhhcccccCCccccceeeee Q lcl|Aclame:pro 346 GGPSIGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNTFQPKIGFK 425 (468) Q Consensus 346 ~~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~~~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~s~qP~~g~~ 425 (468) .+ +++|+|+.+|+|+|+|||+||||||+ ++|||+|||||++++|+||||||||||+|++++||+||||++||| T Consensus 402 ~g---~~~d~~~~~~~G~l~~~~~vy~D~y~----~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~ 474 (521) T protein:vir:72 402 TG---FSTDTTKSVFAGVLGGKYRVYIDQYA----KQDYFTVGYKGPNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFK 474 (521) T ss_pred cc---ccccCCCceEEEEccCceEEEecCCC----CcceEEEEEeCCcccccceeeccccccccccccCCccccceeeee Confidence 65 67899999999999999999999996 489999999999999999999999999999999999999999999 Q ss_pred eeeeeeecCcccccCcc-ccccchhhhhh----hcccceeeeeeeec Q lcl|Aclame:pro 426 TRYGMVSNPFVTTNGLY-NGTPDGEALTP----NANMYYRRVQVTNL 467 (468) Q Consensus 426 tRY~l~~nP~~~~~~~~-~~~~~~~~~~~----~an~y~~r~~v~~l 467 (468) |||||++|||++..++. ...+++++|++ +.|.|||||+|+|| T Consensus 475 tRY~l~~NP~~~~~~~~~a~~i~~~~~~~~a~~~~~sy~r~v~v~~l 521 (521) T protein:vir:72 475 TRYGIGINPFAESAAQAPASRIQSGMPSILNSLGKNAYFRRVYVKGI 521 (521) T ss_pred eeeceeecCcccccCcccceeecCcChhhhcCccccceeeeeeecCC Confidence 99999999999865544 45899999998 44889999999999 No 10 >protein:vir:103463 Length: 521 # NCBI annotation: major head subunit precursor # Family: family:all:364 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803115;genbank:gi:116326395;genbank:GeneID:4405492 Probab=100.00 E-value=3.9e-221 Score=1228.93 Aligned_cols=459 Identities=36% Similarity=0.612 Sum_probs=401.8 Q ss_pred CcchHHHHHhhhhhhCCCccchhcchhhhHHHHHHHhHHHHHHhhhhhhhhhhhhhh---------cCcccccccccccc Q lcl|Aclame:pro 1 MFNAEHLQEKWSPVLNHGEAPAIGDRYKRAVTSVLLENQERFLREERGMLNEVAVNS---------LGAGTIAPAGSALG 71 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~~~llenq~~~~~~~~~~l~e~~~~~---------~~~~~~~~~~~i~~ 71 (468) |+++|+|+|||+|||||||+|+|++ +||+|+++|||||||++++.+.+++|..++. ++++|++++.+|+| T Consensus 3 ~~~~~~l~~kw~p~l~~~~~~~i~~-~~~~~~a~~~enq~~~~~~~~~~~~~~~~~~~~~~l~e~~~~~~~~~~~~~i~e 81 (521) T protein:vir:10 3 IKTKAELLNKWKPLLEGEGLPEIAN-SKQAIIAKIFENQEKDFQTAPEYKDEKIAQAFGSFLTEAEIGGDHGYNATNIAA 81 (521) T ss_pred cchhHHHHHhhhhhhccCCCCcccc-chhhhhhhhhhhhhhhhhhccccchhHHHHHHhhhhhhhcccCccccccccccc Confidence 9999999999999999999999987 5999999999999999999996655554444 45789999999999 Q ss_pred cccccccccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCC----Ccccccc--cCCcccccccccc Q lcl|Aclame:pro 72 SANTGGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQA----GEEALFN--EPDTGFTGGYDAS 145 (468) Q Consensus 72 st~tg~i~~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qs----G~EA~fn--Ea~t~fSg~~~~~ 145 (468) |++|++|+++||+||+||||++|||||+|||||||||||||||||||+||.+|. +.|+|++ ++|+.|||.+... T Consensus 82 s~~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~q~~~~~g~eaf~~~~~ada~fSG~~~at 161 (521) T protein:vir:10 82 GQTSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQPMNSPTGQVFALRAVYGKDPIAAGAKEAFHPMYGPDAMFSGQGAAK 161 (521) T ss_pred cccccccccCCchhhhHHHHHHhhhhhhhceeeccCCchhhhheeeeeeccCCccccccccccchhcccccccccccccc Confidence 999999999999999999999999999999999999999999999999999985 6788875 5999999986553 Q ss_pred ccccccccCcc-----------------------ccCCCccccccccc------cccccccccccccchhhhhccC---- Q lcl|Aclame:pro 146 QGDYAVRTGAG-----------------------VGGDSEGNNPALLN------DAAPGTYEVGSKMPREDLERMG---- 192 (468) Q Consensus 146 ~~~~~~~~~~~-----------------------~~~~~~gt~~~~~~------~a~~~~~t~~~gm~Ta~aE~lG---- 192 (468) ........... ......++++...+ ....+.|+++.||+|+.+|.|+ T Consensus 162 ~~s~~~~~~~~~~Gd~~~~~~~~~g~~~~~~~~~~t~~~t~~d~~~~~~~~~~~~~~~~~y~~~~GmsTa~aEal~~~g~ 241 (521) T protein:vir:10 162 KFAALAASTQTTVGDIYTHFFQDTGTVYLQASAQVTISSTADDAAKLDAEIKKQMEAGALVEIAEGMATSIAELQESFNG 241 (521) T ss_pred ccccccccccccccccccccccccccceecccccccCCCcccccccccccccccccccceeecccccchhhHhhhccCCC Confidence 32221111110 11111112221111 2345678899999999999873 Q ss_pred CCCcchhhcceEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhcccccc Q lcl|Aclame:pro 193 EANRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNN 272 (468) Q Consensus 193 ~~~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~ 272 (468) +++++|+||+|+||||+|+|||||||||||||||||||||||||||+||+||||||||+|||||||++|+..|+.+++.. T Consensus 242 ss~~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii~~i~~sa~~~~~g~ 321 (521) T protein:vir:10 242 STDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQVGKSGM 321 (521) T ss_pred CccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHhhhhhheeeeeeeee Confidence 45678999999999999999999999999999999999999999999999999999999999999988776666665433 Q ss_pred c----ccccccccccccc---chhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccc Q lcl|Aclame:pro 273 V----ANAGIFDLDVDSN---GRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGA 345 (468) Q Consensus 273 ~----~~~g~~Dl~~~~~---grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~ 345 (468) . +++|+|||+++.| +||++|+||+|+||||||||+|+|+|+||+|||||||||||++|+|+|++++.++...+ T Consensus 322 t~~~~~~~G~~d~~~~~d~~~~~~~~e~~k~L~~~i~~~an~i~~~T~r~~~n~~i~S~~Va~~L~~~~~~~~~~~~~~~ 401 (521) T protein:vir:10 322 TLTPGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTGISYAAQGLA 401 (521) T ss_pred eeccCccccceecccccccccchHHHHHHHHHHHHHHHHHHHHHHhcccccceEEEEchHHHHHHhhccccccccccccc Confidence 3 5799999999998 99999999999999999999999999999999999999999999999999999988766 Q ss_pred cccccccccccCceeEEEecCceEEEEcccccccCCcceEEEEEecCCcccceeEeeccchhhcccccCCccccceeeee Q lcl|Aclame:pro 346 GGPSIGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNTFQPKIGFK 425 (468) Q Consensus 346 ~~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~~~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~s~qP~~g~~ 425 (468) .+ .++|+|+++|+|+|+|||+||||||+ ++|||+|||||++++|+||||||||||+|++++||+||||++||| T Consensus 402 ~g---~~~d~~~~~~~G~l~~~~~vy~D~y~----~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~ 474 (521) T protein:vir:10 402 TG---FNTDTTKSVFAGVLGGKYRVYIDQYA----KQDYFTVGYKGPNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFK 474 (521) T ss_pred cc---ccccCCCceEEEEecCceEEEecCCC----CcceEEEEEeCCcccccceeeccccccccccccCCccccceeeee Confidence 55 67899999999999999999999996 489999999999999999999999999999999999999999999 Q ss_pred eeeeeeecCcccccCcccc-ccchhhhhhhc----ccceeeeeeeec Q lcl|Aclame:pro 426 TRYGMVSNPFVTTNGLYNG-TPDGEALTPNA----NMYYRRVQVTNL 467 (468) Q Consensus 426 tRY~l~~nP~~~~~~~~~~-~~~~~~~~~~a----n~y~~r~~v~~l 467 (468) |||||++|||++..++... .+++++|++.| |.|||||+|+|| T Consensus 475 tRY~l~~NP~~~~~~~~~~~~i~~~~~~~~a~~~~~sy~r~v~v~~l 521 (521) T protein:vir:10 475 TRYGIGINPFAESAAQAPASRIQSGMPSILNSLGKNAYFRRVYVKGI 521 (521) T ss_pred eeeceeecCcccccCCccceeecccchhhhccccccceeeeeeecCC Confidence 9999999999987655544 88999998844 889999999999 No 11 >protein:vir:6601 Length: 528 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891732;genbank:gi:33620668;genbank:GeneID:1725275 Probab=100.00 E-value=5.9e-221 Score=1227.96 Aligned_cols=460 Identities=36% Similarity=0.574 Sum_probs=408.0 Q ss_pred CcchHHHHHhhhhhhCCCccchhcchhhhHHHHHHHhHHHHHHhhhhhhhhhhhhhhc---------Ccccccccccccc Q lcl|Aclame:pro 1 MFNAEHLQEKWSPVLNHGEAPAIGDRYKRAVTSVLLENQERFLREERGMLNEVAVNSL---------GAGTIAPAGSALG 71 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~~~llenq~~~~~~~~~~l~e~~~~~~---------~~~~~~~~~~i~~ 71 (468) |+++|+|+|||+|||||||+|||++.|||+|+++|||||||++++++.+++|...+++ +.+|+|++++|+| T Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~~~~~~~i~e 80 (528) T protein:vir:66 1 MKTTKELMEKWSPLLENEKLPEIATASKQKLVAKILESQEADFAVDPIYKDEKVVEAFGGFIAEAEVAGDHGYDASQIAA 80 (528) T ss_pred CcchHHHHHHhHHhhcCCCcchhcchhhhhhhhhhhhhhHHHhhcccchhhHHHHHhhhhhhhhhcccccccccchhccc Confidence 9999999999999999999999999999999999999999999999988888766665 4689999999999 Q ss_pred cccccccccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCC-------------CcccccccCCccc Q lcl|Aclame:pro 72 SANTGGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQA-------------GEEALFNEPDTGF 138 (468) Q Consensus 72 st~tg~i~~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qs-------------G~EA~fnEa~t~f 138 (468) |++|++|+++||+||+|+||++|||||+|||||||||||||||||||++|.++. +.|++|+|+++.| T Consensus 81 s~~t~~v~~~~P~Li~lvRRa~p~LIa~DIwGVQPMTgPTGlIFAmRs~Y~~~~~~~~~~eAfh~~~g~ea~fsea~t~~ 160 (528) T protein:vir:66 81 GQTTGAITNVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLKSGAREAFHPMYAPDAFHSSLAAKE 160 (528) T ss_pred cccccccccCchhHHHHHHHHHHhhhhhhhheeecCCchhhhheeeeeeecCCccccccccccccccccccccccccccc Confidence 999999999999999999999999999999999999999999999999998764 4577777777776 Q ss_pred cccccccc-------------------------cc------cccccCccccCCCccccccccccccccccccccccchhh Q lcl|Aclame:pro 139 TGGYDASQ-------------------------GD------YAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPRED 187 (468) Q Consensus 139 Sg~~~~~~-------------------------~~------~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~ 187 (468) +..+..+. .. ..........++..++++...+....+.|+++.||+|++ T Consensus 161 a~~gGpTGliFAm~s~y~s~~~g~ea~~nea~t~fs~~~~~~~~~~~~~~~g~~~g~~~~~~~~a~~~~~~~~~Gm~Ta~ 240 (528) T protein:vir:66 161 ATVGSPTGTAFAKLTLSQAITAGDIVYHTFAETGIAYLQNVTGDSVTPQKVGSESEDEVVMKLIEEGKLAEIAFGMATSI 240 (528) T ss_pred ccccCCccceeecccccccccccceeeecccccceeeeccccccccccCcccccccccccccccccccceecccccchhh Confidence 64321100 00 000111112233345556666667778899999999999 Q ss_pred hhcc---C-CCCcchhhcceEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhh Q lcl|Aclame:pro 188 LERM---G-EANRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYT 263 (468) Q Consensus 188 aE~l---G-~~~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~ 263 (468) +|.+ | +++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||++|+. T Consensus 241 aEale~lg~~s~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELsNILStEImlEINREii~~i~~ 320 (528) T protein:vir:66 241 AEIQEGFNGSSNNPWAEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINF 320 (528) T ss_pred hhhhcccCCCcccchhhcceEEEeEEEEeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhh Confidence 9975 4 34678999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhccccccc----ccccccccccccc---chhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhccc Q lcl|Aclame:pro 264 VAKKGAQNNV----ANAGIFDLDVDSN---GRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVL 336 (468) Q Consensus 264 va~~~k~~~~----~~~g~~Dl~~~~~---grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~ 336 (468) .|+++++..+ +++|+|||+++.| +||++|+||+|+||||||||+|+|+|+||+||||||||+||++|+|||++ T Consensus 321 ~a~~~~~~~t~~~~~~aG~~dl~~~~d~~g~rw~~e~~k~L~~~i~~~an~I~~~T~r~~gn~vi~S~~Va~~L~~~g~~ 400 (528) T protein:vir:66 321 TAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQG 400 (528) T ss_pred eeeeeeeeeeeccccccceeecccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhcccc Confidence 9998886543 5789999998876 69999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccccccccccCceeEEEecCceEEEEcccccccCCcceEEEEEecCCcccceeEeeccchhhcccccCCc Q lcl|Aclame:pro 337 DYSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPN 416 (468) Q Consensus 337 ~~~~~~~~~~~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~~~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~ 416 (468) ++.+....+.+ +++|+++.+|+|+|+|||+||||||+ ++|||+|||||++++|+||||||||||+|++++||+ T Consensus 401 ~~~~~~~~~~~---~~~d~~~~~~~G~l~~~~~vy~D~y~----~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~ 473 (528) T protein:vir:66 401 ISLAMQGAAKG---LNTDTTKAVFAGVLAGKYKVFIDQYA----RQDYFTVGYKGDNEMDAGIYYAPYVALTPLRATDPQ 473 (528) T ss_pred ccccccccccc---cccCCCCceeEEEecCceEEEecCCC----CcceEEEEEeCCcccccceeecccccceeeEeeCCc Confidence 98888776655 78999999999999999999999986 589999999999999999999999999999999999 Q ss_pred cccceeeeeeeeeeeecCcccccCc--cccccchhhhhhhc--ccceeeeeeeec Q lcl|Aclame:pro 417 TFQPKIGFKTRYGMVSNPFVTTNGL--YNGTPDGEALTPNA--NMYYRRVQVTNL 467 (468) Q Consensus 417 s~qP~~g~~tRY~l~~nP~~~~~~~--~~~~~~~~~~~~~a--n~y~~r~~v~~l 467 (468) ||||++||||||||++|||+++..+ +...+++.+|.++| |.||||++|||| T Consensus 474 sfqP~~g~~tRY~l~vNP~~~~~~~~~~~ri~~g~~~~~~ag~n~~~r~~~Vk~~ 528 (528) T protein:vir:66 474 SFHPVLGFKTRYGIGINPFADSKSQEPSARITSGMLSKDSVGKNAYFRRVWVKGC 528 (528) T ss_pred cccceeeeeeeeceeecCcccccCccccccccccchhhhhcCccceeEEeeeccC Confidence 9999999999999999999986643 34589999998766 889999999999 No 12 >protein:vir:80986 Length: 528 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469506;genbank:gi:157311463;genbank:GeneID:5602119 Probab=100.00 E-value=1.7e-220 Score=1225.43 Aligned_cols=460 Identities=37% Similarity=0.590 Sum_probs=403.3 Q ss_pred CcchHHHHHhhhhhhCCCccchhcchhhhHHHHHHHhHHHHHHhhhhhhhhhhhhhhc---------Ccccccccccccc Q lcl|Aclame:pro 1 MFNAEHLQEKWSPVLNHGEAPAIGDRYKRAVTSVLLENQERFLREERGMLNEVAVNSL---------GAGTIAPAGSALG 71 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~~~llenq~~~~~~~~~~l~e~~~~~~---------~~~~~~~~~~i~~ 71 (468) |+++|+|+|||+|||||||+|||++.|||+|+++|||||||++++++.+++|...+++ +++|+|++.+|+| T Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~llenq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~~~~~~~i~e 80 (528) T protein:vir:80 1 MKTTKELMEKWSPLLENEKLPEIATASKQKLVAKILESQEADFAVDPIYKDEKVVEAFGGFIAEAEVAGDHGYDASQIAA 80 (528) T ss_pred CcchHHHHHhhhHhhcCCccchhcchhhhhhhhhhhhhhhHHhhccccccchHHHHhhhhhccccccccccCCccccccc Confidence 9999999999999999999999999999999999999999999999977777666555 4689999999999 Q ss_pred cccccccccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCC----Ccccc--cccCCcccccccccc Q lcl|Aclame:pro 72 SANTGGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQA----GEEAL--FNEPDTGFTGGYDAS 145 (468) Q Consensus 72 st~tg~i~~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qs----G~EA~--fnEa~t~fSg~~~~~ 145 (468) |++|++|++|||+||+||||++|||||+|||||||||||||||||||+||.++. ++|+| |+++++.||+..+.. T Consensus 81 s~~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~~~ea~~~~~~~da~fS~~~t~~ 160 (528) T protein:vir:80 81 GQTTGAITNVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGPNPLASQAKEAFHPMYAPDAFHSSLAAKG 160 (528) T ss_pred cccccccccCCchhhhHHHHHHhhhhhhhhheeccCCchhhhheeeeeeecCCccccccccccccccccccccccccccc Confidence 999999999999999999999999999999999999999999999999999874 45666 457888887654321 Q ss_pred cccccc--------c------cC------------------------ccccCCCccccccccccccccccccccccchhh Q lcl|Aclame:pro 146 QGDYAV--------R------TG------------------------AGVGGDSEGNNPALLNDAAPGTYEVGSKMPRED 187 (468) Q Consensus 146 ~~~~~~--------~------~~------------------------~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~ 187 (468) ...... . .+ ....++..++..........+.|+++.||+|+. T Consensus 161 ~a~~~ea~t~fs~~~~~~~~~~G~~~~~t~~~tg~~~~~~~~~~~~~~~~~gt~~~~~~~~~~~~~~~~~~~~~Gm~Ta~ 240 (528) T protein:vir:80 161 AAVGSPTGTPFAKLAIGTQIEAGDIVHHTFAETGIAYLQNVTAEQVTPTKAGSESEDEVVMKLMEEGKLAEIAFGMATSI 240 (528) T ss_pred cccccccccccccccccccccccceeccccccccccccccccccccCccccCCcccccccccccccccccccccccchhh Confidence 111000 0 00 001111112222223334456788999999999 Q ss_pred hhcc---C-CCCcchhhcceEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhh Q lcl|Aclame:pro 188 LERM---G-EANRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYT 263 (468) Q Consensus 188 aE~l---G-~~~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~ 263 (468) +|.+ | +++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||+|||+|||||||++|+. T Consensus 241 AE~le~lg~ss~~~f~EMaFsIEKvTVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILStEImlEINReii~~i~~ 320 (528) T protein:vir:80 241 AEIQEGFNGSSNNPWAEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINF 320 (528) T ss_pred hhhhcccCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhh Confidence 9965 3 35678999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hhhccccccc----ccccccccccccc---chhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhccc Q lcl|Aclame:pro 264 VAKKGAQNNV----ANAGIFDLDVDSN---GRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVL 336 (468) Q Consensus 264 va~~~k~~~~----~~~g~~Dl~~~~~---grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~ 336 (468) .|+++++..+ +++|+|||+++.| +||++|+||+|+||||||||+|+|+|+||+||||||||+||++|+|||++ T Consensus 321 ~a~~~~~~~t~~~~~~~G~~dl~~~~d~~g~r~~~e~~k~L~~~i~~~an~I~~~T~~~~gn~vi~S~~Va~~L~~~g~~ 400 (528) T protein:vir:80 321 TAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQG 400 (528) T ss_pred eeeeeeeeeeeccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhcccc Confidence 9999886543 5789999998876 89999999999999999999999999999999999999999999999998 Q ss_pred ccccccccccccccccccccCceeEEEecCceEEEEcccccccCCcceEEEEEecCCcccceeEeeccchhhcccccCCc Q lcl|Aclame:pro 337 DYSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPN 416 (468) Q Consensus 337 ~~~~~~~~~~~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~~~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~ 416 (468) ++.+....+.+ +++|+++.+|+|+|+|||+||||||+ ++|||+|||||++++|+||||||||||+|++++||+ T Consensus 401 ~~~~~~~~~~~---~~~d~~~~~~~G~l~~~~~vy~D~y~----~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~ 473 (528) T protein:vir:80 401 ISLAMQGAAKG---LNTDTTKAVFAGVLAGKYKVFIDQYA----RQDYFTVGYKGDNEMDAGIYYAPYVALTPLRATDPQ 473 (528) T ss_pred ccccccccccc---cccCCCCceEEEEecCceEEEecCCC----CcceEEEEEeCCcccccceeecccccceeeEeeCCc Confidence 88877766554 78999999999999999999999986 589999999999999999999999999999999999 Q ss_pred cccceeeeeeeeeeeecCcccccCc--cccccchhhhhhhc--ccceeeeeeeec Q lcl|Aclame:pro 417 TFQPKIGFKTRYGMVSNPFVTTNGL--YNGTPDGEALTPNA--NMYYRRVQVTNL 467 (468) Q Consensus 417 s~qP~~g~~tRY~l~~nP~~~~~~~--~~~~~~~~~~~~~a--n~y~~r~~v~~l 467 (468) ||||++||||||||++|||+++.++ +...+++.+|.++| |.||||++|+|| T Consensus 474 sfqP~~g~~tRY~l~~NP~~~~~~~~~~~r~~~g~~~~~~ag~n~~~r~~~Vk~~ 528 (528) T protein:vir:80 474 SFHPVLGFKTRYGIGINPFADSKSQAPSARITSGMLSKDSVGKNAYFRRVWVKGC 528 (528) T ss_pred cccceeeeeeeeceeecCcccccCCcccccccccchhhhhcCccceeEEeeeccC Confidence 9999999999999999999987654 34589999998765 889999999999 No 13 >protein:vir:98143 Length: 524 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239203;genbank:gi:66391678;genbank:GeneID:3416245 Probab=100.00 E-value=7.1e-220 Score=1222.04 Aligned_cols=458 Identities=37% Similarity=0.632 Sum_probs=403.9 Q ss_pred CcchHHHHHhhhhhhCC-CccchhcchhhhHHHHHHHhHHHHHHhhhhhhhhhhhhhhcC---------ccccccccccc Q lcl|Aclame:pro 1 MFNAEHLQEKWSPVLNH-GEAPAIGDRYKRAVTSVLLENQERFLREERGMLNEVAVNSLG---------AGTIAPAGSAL 70 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~-~~~~~i~~~~~~~~~~~llenq~~~~~~~~~~l~e~~~~~~~---------~~~~~~~~~i~ 70 (468) |+++|+|+|||+||||+ ||+|||++.+||+|+++|||||||++++++.+++|...+++| ++|+|++.+|+ T Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~a~llenq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~~~~~~~i~ 80 (524) T protein:vir:98 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLVAAILEAQEKDAETDPVYRDEKIVESFGGFLAEAEIAGDHNYDQTNIA 80 (524) T ss_pred CcchHHHHHHhHHHhcCCcCcchhcchhhHHHHHHHHhhHHHHHhcCccccchHHHHhhhcccccccccccccccccccc Confidence 99999999999999996 899999999999999999999999999999888887777765 57999999999 Q ss_pred ccccccccccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCC---CCcccccccC-------Cccccc Q lcl|Aclame:pro 71 GSANTGGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQ---AGEEALFNEP-------DTGFTG 140 (468) Q Consensus 71 ~st~tg~i~~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~q---sG~EA~fnEa-------~t~fSg 140 (468) ||++|++|+++||+||+||||++|||||+|||||||||||||||||||+||.++ .|+|++|||| |+.||| T Consensus 81 ~s~~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~n~~~~~gteA~~nEAf~~~ye~dt~fSG 160 (524) T protein:vir:98 81 SGKSSGAITNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSG 160 (524) T ss_pred ccccccccccccchhhhHHHHHHHhhhhhhhheeccCCchhhhhhhhheeecCCCCCcccccccccccccccccccccCC Confidence 999999999999999999999999999999999999999999999999999998 4679999886 889998 Q ss_pred ccccccccccccc-----------------------CccccCCCccccccccccc------cccccccccccchhhhhcc Q lcl|Aclame:pro 141 GYDASQGDYAVRT-----------------------GAGVGGDSEGNNPALLNDA------APGTYEVGSKMPREDLERM 191 (468) Q Consensus 141 ~~~~~~~~~~~~~-----------------------~~~~~~~~~gt~~~~~~~a------~~~~~t~~~gm~Ta~aE~l 191 (468) .+........... .........+++|...+.+ ....++++.||+|+.+|.| T Consensus 161 ~g~~t~~s~~~~g~~~~~g~~~~~~~~~~g~~~~~~~~~g~~~~tgt~p~~~~~a~~~~~~~g~~~~~~~GmsTA~aEaL 240 (524) T protein:vir:98 161 EGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQ 240 (524) T ss_pred ccccccccccccccccccccccccccccccceeccccccCcccccccccccccccccccccccceeecccccchhhhhhh Confidence 7543322211111 1111123344555544433 2346788999999999988 Q ss_pred C----CCCcchhhcceEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhc Q lcl|Aclame:pro 192 G----EANRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKK 267 (468) Q Consensus 192 G----~~~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~ 267 (468) + +++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||++|+..|+. T Consensus 241 ~~~g~ss~~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILSTEImlEINReii~~i~~~a~~ 320 (524) T protein:vir:98 241 ENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQV 320 (524) T ss_pred ccCCCCccccccceeeEEEEEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHhhhhee Confidence 4 346789999999999999999999999999999999999999999999999999999999999999887555555 Q ss_pred cccc----ccccccccccccccc---chhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHh--hccccc Q lcl|Aclame:pro 268 GAQN----NVANAGIFDLDVDSN---GRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAM--AGVLDY 338 (468) Q Consensus 268 ~k~~----~~~~~g~~Dl~~~~~---grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~--sG~~~~ 338 (468) +++. .++++|+|||+++.| +||++|+||+|++|||+|||+|+|+|+||+|||||||||||++|+| +||+++ T Consensus 321 ~~~g~t~~~~~~~G~~dl~~~~d~~~~r~~~e~~~~L~~~i~~~an~I~~~T~rg~~n~~i~S~~Va~~L~~~~~g~~~~ 400 (524) T protein:vir:98 321 GKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPA 400 (524) T ss_pred ceeecccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhhhcccccc Confidence 4442 356679999999865 9999999999999999999999999999999999999999999999 799998 Q ss_pred ccccccccccccccccccCceeEEEecCceEEEEcccccccCCcceEEEEEecCCcccceeEeeccchhhcccccCCccc Q lcl|Aclame:pro 339 SSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNTF 418 (468) Q Consensus 339 ~~~~~~~~~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~~~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~s~ 418 (468) ++.+... +++|+++.+|+|+|+|||+||||||+ ++|||+|||||++++|+||||||||||+|++++||+|| T Consensus 401 s~~~~~~-----~~~d~~~~~~~G~l~~~~~vy~D~y~----~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sf 471 (524) T protein:vir:98 401 SQGLQKT-----LNVDTTKAVFAGVLGGTYKVYIDQYA----RQDYFTVGFKGDNEMDAGIYYAPYVALTPLRGSDPKNF 471 (524) T ss_pred cchhhcc-----cccCCccceEEEEecCceEEEecCCC----CcceEEEEeeCCcccccceeeccccccccccccCCccc Confidence 8887653 68999999999999999999999996 48999999999999999999999999999999999999 Q ss_pred cceeeeeeeeeeeecCcccccCcc--ccccchhhhhhhc--ccceeeeeeeec Q lcl|Aclame:pro 419 QPKIGFKTRYGMVSNPFVTTNGLY--NGTPDGEALTPNA--NMYYRRVQVTNL 467 (468) Q Consensus 419 qP~~g~~tRY~l~~nP~~~~~~~~--~~~~~~~~~~~~a--n~y~~r~~v~~l 467 (468) ||++||||||||++|||+++.++. ..++++.+|.++| |.||||++|+|| T Consensus 472 qP~~g~~tRY~l~~NP~~~~~~~~~~~ri~~g~~~~~~ag~n~~~r~~~Vk~l 524 (524) T protein:vir:98 472 QPVMGFKTRYGIGINPFANSRSQAPADRITSGMISKEMCGKNAYFRKVWVKGL 524 (524) T ss_pred cceeeeeeeeceeecCcccccCCccccccccCcchHhhcCccceeeEeeeccC Confidence 999999999999999999866543 3589999998755 889999999999 No 14 >protein:vir:5670 Length: 514 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899609;genbank:gi:34419596;genbank:GeneID:2546039 Probab=100.00 E-value=2.2e-219 Score=1219.34 Aligned_cols=455 Identities=38% Similarity=0.638 Sum_probs=393.3 Q ss_pred HHHHHhhhhhhCCCc--cchhcchhhhHHHHHHHhHHHHHHhhhhhhhhhhhhhh---------cCcccccccccccccc Q lcl|Aclame:pro 5 EHLQEKWSPVLNHGE--APAIGDRYKRAVTSVLLENQERFLREERGMLNEVAVNS---------LGAGTIAPAGSALGSA 73 (468) Q Consensus 5 ~~l~~kw~p~l~~~~--~~~i~~~~~~~~~~~llenq~~~~~~~~~~l~e~~~~~---------~~~~~~~~~~~i~~st 73 (468) -+|+|||+||||||| +|||++.+||+|+++|||||||+++|++.+.++.+.++ ++++|+|++.+|+||+ T Consensus 1 ~~l~~kw~p~l~~~~~~~~~i~~~~~~~~~~~l~enq~~~~~~~~~~~~~~~~~~~~~~l~e~~~~~~~~~~~~~ia~s~ 80 (514) T protein:vir:56 1 MNLTEKWKDLLEAEGADMPEIATATKQKIMSKIFENQDRDINNDPMYRDPQLVEAFNAGLNEAVVNGDHGYDPANIAQGV 80 (514) T ss_pred CchhhhhhHHhcccccccccccchhhhhhhhhhhhhHHHHHhcCCcccchhhhhhhhccccccccccccccccccccccc Confidence 689999999999998 89999999999999999999999999997644433333 4688999999999999 Q ss_pred cccccccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCC--CCccccc--ccCCcccccccccccccc Q lcl|Aclame:pro 74 NTGGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQ--AGEEALF--NEPDTGFTGGYDASQGDY 149 (468) Q Consensus 74 ~tg~i~~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~q--sG~EA~f--nEa~t~fSg~~~~~~~~~ 149 (468) +|++|+++||+||+|+||++|||||+|||||||||||||||||||+||.+| +|+|||| ||+|+.|||.....+... T Consensus 81 ~t~~v~~~~P~ll~lvRRa~~~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~tg~EAf~~~nEadt~fSG~~~~~~~~~ 160 (514) T protein:vir:56 81 TTGAVTNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAEAFHPTRQADASFSGQAAASTIAD 160 (514) T ss_pred ccccccccchhHHHHHHHHHHhhhhhhhheeccCCchhhhheeeeeeecCCCcccccccccccccCcCcccccccccccc Confidence 999999999999999999999999999999999999999999999999988 7889999 999999998765433221 Q ss_pred ccccCccccCCC------------------------------ccccccccccccccccccccccchhhhhcc---C-CCC Q lcl|Aclame:pro 150 AVRTGAGVGGDS------------------------------EGNNPALLNDAAPGTYEVGSKMPREDLERM---G-EAN 195 (468) Q Consensus 150 ~~~~~~~~~~~~------------------------------~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~l---G-~~~ 195 (468) ....+....++. ........+.+....|+++.||+|+.+|.+ | +++ T Consensus 161 ~~~~~~~~~G~~~~~~~t~~~gd~~~~~~~~~~~~~~~~~~~~~~t~~~~~~a~~~~y~~~~Gm~Ta~aEal~~lggs~~ 240 (514) T protein:vir:56 161 FPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAELQENFNGSSN 240 (514) T ss_pred ccccccccccccccccccccccccccccccccccccccccccccccccccccccchhhhhhhhhhhhhhhhcccCCCCcc Confidence 111111000000 000011112233456788999999999985 3 356 Q ss_pred cchhhcceEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHh---hhhhcccccc Q lcl|Aclame:pro 196 RLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVY---TVAKKGAQNN 272 (468) Q Consensus 196 ~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~---~va~~~k~~~ 272 (468) ++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||++|+ +|+++||+.| T Consensus 241 ~~f~EMaFsIdK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILSTEImlEINReii~~l~~~atv~~~~~~~~ 320 (514) T protein:vir:56 241 NEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNSQAQIGKSGWTQG 320 (514) T ss_pred cccceeeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHHhheeehhcccccc Confidence 7899999999999999999999999999999999999999999999999999999999999998887 5567788899 Q ss_pred cccccccccccccc---chhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccc Q lcl|Aclame:pro 273 VANAGIFDLDVDSN---GRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPS 349 (468) Q Consensus 273 ~~~~g~~Dl~~~~~---grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~ 349 (468) ++++|+|||+++.| +||++|+||.|+||||+|||+|+|+|+||+||||||||+||++|+|+||+++.++...... T Consensus 321 ~~~~G~~d~~~~~d~~~~~~~~e~~~~l~~~i~~~an~i~~~T~rg~gn~~i~S~~Va~~L~~sg~l~~~~~~g~~~~-- 398 (514) T protein:vir:56 321 AGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVGPAAQGMQDG-- 398 (514) T ss_pred cccccccccccccccccchHHHHHHHHHHHHHHHHHHHHHhhcccccccEEEEchhHHHHHHhhhhhccccccCcccc-- Confidence 99999999998776 8999999999999999999999999999999999999999999999999988766654332 Q ss_pred cccccccCceeEEEecCceEEEEcccccccCCcceEEEEEecCCcccceeEeeccchhhcccccCCccccceeeeeeeee Q lcl|Aclame:pro 350 IGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNTFQPKIGFKTRYG 429 (468) Q Consensus 350 ~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~~~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~s~qP~~g~~tRY~ 429 (468) .+++|+++.+|+|+|+|||+||||||+. +|||+|||||++++|+||||||||||+|++++||+||||++||||||| T Consensus 399 ~~~~d~~~~~~aG~l~~~~~vy~D~y~~----~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~ 474 (514) T protein:vir:56 399 SMNTDTNQTVFAGVLGGRFKVYIDQYAV----NDYFTVGFKGSTEMDAGVFYSPYVPLTPLRGSDSKNFQPVIGFKTRYG 474 (514) T ss_pred ccccccCcceEEEEecCceEEEecCCCC----cceEEEEEecCcceecceeeccccccccccccCCccccceeeeeeeec Confidence 3689999999999999999999999865 899999999999999999999999999999999999999999999999 Q ss_pred eeecCcccccCccccccchhhhhhhc----ccceeeeeeeec Q lcl|Aclame:pro 430 MVSNPFVTTNGLYNGTPDGEALTPNA----NMYYRRVQVTNL 467 (468) Q Consensus 430 l~~nP~~~~~~~~~~~~~~~~~~~~a----n~y~~r~~v~~l 467 (468) |++|||++..+ ...+.+.+|.+++ |.|||||+|+|| T Consensus 475 l~~NPy~~~~~--~~~~~~~~~~~~a~~~~n~y~r~v~v~~l 514 (514) T protein:vir:56 475 VQVNPFADPTA--SATKVGNGAPVAASMGKNAYFRRVFVKGL 514 (514) T ss_pred eeeCCCCCccc--cccccCCcchhhhcccccceeeeEEEecC Confidence 99999986432 2334455555544 679999999999 No 15 >protein:vir:100603 Length: 529 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656387;genbank:gi:109290138;genbank:GeneID:4156581 Probab=100.00 E-value=1.3e-218 Score=1215.02 Aligned_cols=456 Identities=37% Similarity=0.623 Sum_probs=397.9 Q ss_pred CcchHHHHHhhhhhhCCCccchhcchhhhHHHHHHHhHHHHHHhhhhhh------------hhhhhhhhcCccccccccc Q lcl|Aclame:pro 1 MFNAEHLQEKWSPVLNHGEAPAIGDRYKRAVTSVLLENQERFLREERGM------------LNEVAVNSLGAGTIAPAGS 68 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~~~llenq~~~~~~~~~~------------l~e~~~~~~~~~~~~~~~~ 68 (468) -+++|+|+|||+|||||||+|||++.|||+|+++|||||||+++|++.+ |+|+ ..+++|+|++.+ T Consensus 2 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~e~~~~~l~e~---~~~~~~~~~~~~ 78 (529) T protein:vir:10 2 SLKTKEILNKWTPLLEGEGLPEIAGKNKQALVAQILEAQEKDSKTDPVYRDDKLIEAFGQSLMEA---EVAGDHGYDPTN 78 (529) T ss_pred ccchHHHHHHhhHhhcCCccchhcchhhhhhhhhhhhhHHHHhhcccccchhhhhhhhhhccchh---hccccccccccc Confidence 5788999999999999999999999999999999999999999999866 5554 456889999999 Q ss_pred ccccccccccccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCC----Ccccc--cccCCccccccc Q lcl|Aclame:pro 69 ALGSANTGGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQA----GEEAL--FNEPDTGFTGGY 142 (468) Q Consensus 69 i~~st~tg~i~~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qs----G~EA~--fnEa~t~fSg~~ 142 (468) |++|++|++|+++||+||+||||++|||||+|||||||||||||||||||+||.+|. +.|+| ++|+|+.|||.+ T Consensus 79 ia~s~~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~g~eaf~~~~e~dt~~SG~~ 158 (529) T protein:vir:10 79 IAAGQSSGAITNIGPAVIGMVRRAIPSLIAFDIAGVQPMTGPTGQVFALRSVYGKDPLAAGAKEAFHPMYAPDAWHSGLA 158 (529) T ss_pred ccccccccccccccchhhhhHHHHHHhHHhhhhheeccCCchhhhhhhheeeecCCcCCCcccccccccccccccccccc Confidence 999999999999999999999999999999999999999999999999999999874 55665 689999999876 Q ss_pred cccccccccccC------------------------------ccccCCCcccc-----c---cccccccccccccccccc Q lcl|Aclame:pro 143 DASQGDYAVRTG------------------------------AGVGGDSEGNN-----P---ALLNDAAPGTYEVGSKMP 184 (468) Q Consensus 143 ~~~~~~~~~~~~------------------------------~~~~~~~~gt~-----~---~~~~~a~~~~~t~~~gm~ 184 (468) ............ ......+.+++ + ...+.+....++++.||+ T Consensus 159 ~~~~~~~~~~~~~~~~t~~~a~~~~~~~~~~~nea~t~~s~~~tg~~~~~g~~~tg~~~~~~~~~~~a~~~~~~~~~gms 238 (529) T protein:vir:10 159 AKGATTSSDGTPFAALTAGQAVATGDIVYHFFYESGSAYLQNVTGGNVTVGTNETGAALDALVSAKIAAGELAEIAEGMA 238 (529) T ss_pred cccccccccccccccccccceeeccccceeeecccccccccccccccccccccccCCccccccccccccccccccccccc Confidence 443211110000 00000000111 0 011223345688999999 Q ss_pred hhhhhccC----CCCcchhhcceEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHH Q lcl|Aclame:pro 185 REDLERMG----EANRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRR 260 (468) Q Consensus 185 Ta~aE~lG----~~~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~ 260 (468) |+.+|.|+ +++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+||||||||+ T Consensus 239 Ta~aEal~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAvHGLDAEtELsNILStEImlEINReii~~ 318 (529) T protein:vir:10 239 TSIAELRQGFNGTTDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNGILANEVMLEINREVIDW 318 (529) T ss_pred hhhhhccccCCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHH Confidence 99999883 45678999999999999999999999999999999999999999999999999999999999999997 Q ss_pred Hhhhhhccccccc-----ccccccccccccc---chhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHh Q lcl|Aclame:pro 261 VYTVAKKGAQNNV-----ANAGIFDLDVDSN---GRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAM 332 (468) Q Consensus 261 l~~va~~~k~~~~-----~~~g~~Dl~~~~~---grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~ 332 (468) |+.+|+.++ +++ +.+|+|||+++.| +||++|+||+|++|||+|||+|+|+|+||+||||||||+||++|+| T Consensus 319 i~~~a~~~~-~g~~~~~~~~~gv~d~~~~~d~~~~~~~~e~~~~L~~~i~~~an~I~~~T~rg~~n~vi~S~~Va~~L~~ 397 (529) T protein:vir:10 319 INYTAQVGK-SGWTQTVGSAAGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALAL 397 (529) T ss_pred hhhhceeee-eeeeccccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchHHHHHHhh Confidence 666665544 443 6899999998876 8999999999999999999999999999999999999999999999 Q ss_pred hcccccccccccccccccccccccCceeEEEecCceEEEEcccccccCCcceEEEEEecCCcccceeEeeccchhhcccc Q lcl|Aclame:pro 333 AGVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRS 412 (468) Q Consensus 333 sG~~~~~~~~~~~~~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~~~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~ 412 (468) .|+++++++...+.+ .++|+++.+|+|+|+|||+||||||+ ++|||+|||||++++|+||||||||||+|+++ T Consensus 398 ~~~~~~~~~~~~~sg---~~~d~~~~~~~G~l~~~~~vy~D~y~----~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~ 470 (529) T protein:vir:10 398 VDAGITPAAQGMASG---LNADTTKGVFAGVLGGRYKVYIDQYA----RQDYFTMGYRGANNLDAGIYYCPYVALTPLRG 470 (529) T ss_pred hcccccccccccccc---ceeecCCceEEEEecCceEEEecCCC----CcceEEEEEeCCcccccceeeccccccccccc Confidence 999998888776655 67899999999999999999999996 48999999999999999999999999999999 Q ss_pred cCCccccceeeeeeeeeeeecCcccccCc--cccccchhhhhhhc--ccceeeeeeeec Q lcl|Aclame:pro 413 IDPNTFQPKIGFKTRYGMVSNPFVTTNGL--YNGTPDGEALTPNA--NMYYRRVQVTNL 467 (468) Q Consensus 413 ~dp~s~qP~~g~~tRY~l~~nP~~~~~~~--~~~~~~~~~~~~~a--n~y~~r~~v~~l 467 (468) +||+||||++||||||||++|||++..++ .+..+++.+|.++| |.||||++|+|| T Consensus 471 ~dp~sfqP~~g~~tRY~l~~NP~~~~~~~~~~~r~~~g~~~~~~ag~n~~~r~~~Vk~l 529 (529) T protein:vir:10 471 SDPKNFQPVMGFKTRYAIGVNPFAESRTQAPTSRISNGMPGAHSVGKNAYFRRVWVKGL 529 (529) T ss_pred cCCCcccceeeeeeeeceeecCccccccccccccccCCcchhhhcCccceeeEeeeccC Confidence 99999999999999999999999987655 45689999999877 889999999999 No 16 >protein:vir:107947 Length: 519 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595301;genbank:gi:161622607;genbank:GeneID:5783666 Probab=100.00 E-value=1e-216 Score=1204.74 Aligned_cols=456 Identities=35% Similarity=0.605 Sum_probs=392.6 Q ss_pred cchHHHHHhhhhhhCCCccchhcchhhhHHHHHHHhHHHHHHhhhhhhhhhhhhhhc---------Cccccccccccccc Q lcl|Aclame:pro 2 FNAEHLQEKWSPVLNHGEAPAIGDRYKRAVTSVLLENQERFLREERGMLNEVAVNSL---------GAGTIAPAGSALGS 72 (468) Q Consensus 2 ~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~~~llenq~~~~~~~~~~l~e~~~~~~---------~~~~~~~~~~i~~s 72 (468) |+.|+|+|||+|||||||+|+|++.|||+|+++||||||+++++.+.+++|..++.+ +++|+|++++|++| T Consensus 1 ~~~~~l~~kw~p~l~~~~~~~i~~~~~~~i~~~~~en~~~~~~~~~~~~~~~~~~~~~~~l~e~~~~~~~~~~~t~i~~~ 80 (519) T protein:vir:10 1 MKKNALVQKWSALLENEALPEIVGASKQAIIAKIFENQEQDILTAPEYRDEKISEAFGSFLTEAEIGGDHGYDATNIAAG 80 (519) T ss_pred CchhHHHHHhHHhhcccccchhhhhhhHHHHHHHHHHHHHHhhhcccccchHHHHHHhhhcchhccCCccccCccccccc Confidence 999999999999999999999999999999999999999999999866666555444 56999999999999 Q ss_pred ccccccccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCC----Ccccc--cccCCccccccccccc Q lcl|Aclame:pro 73 ANTGGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQA----GEEAL--FNEPDTGFTGGYDASQ 146 (468) Q Consensus 73 t~tg~i~~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qs----G~EA~--fnEa~t~fSg~~~~~~ 146 (468) ++|++|++++|+||+|+||++|||||+|||||||||||||||||||+||.++. +.|+| |+|+|+.|||.+.... T Consensus 81 ~~t~~v~~~~P~l~~l~rRa~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~n~~~~~~g~ea~~~~nEadt~fSG~~~~~~ 160 (519) T protein:vir:10 81 QTSGAVTQIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAET 160 (519) T ss_pred cccccccccchhHHHHHHHHHHhhhhhhhheeecCCchhhhhheeeeeecCCccccccccccccccccccccCccccccc Confidence 99999999999999999999999999999999999999999999999999875 45555 6999999999865433 Q ss_pred cccccccCccc----------------------cC-CCccccc------cccccccccccccccccchhhhhcc---C-C Q lcl|Aclame:pro 147 GDYAVRTGAGV----------------------GG-DSEGNNP------ALLNDAAPGTYEVGSKMPREDLERM---G-E 193 (468) Q Consensus 147 ~~~~~~~~~~~----------------------~~-~~~gt~~------~~~~~a~~~~~t~~~gm~Ta~aE~l---G-~ 193 (468) ........... .+ ...++++ ........+.++++.||+|+.+|.+ | + T Consensus 161 ~~~~~~~~~~~~g~~~~~~~~~s~~~~~~~~~~~t~~ag~t~~~~~~~a~~~~~~~~~~~~~~~gmsTa~aEal~~lggs 240 (519) T protein:vir:10 161 FEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGS 240 (519) T ss_pred cccccccccccccccccccccccccceeccccccccCCCCcCccccccccccccccccccccccccccchhhccccCCCc Confidence 22111111000 00 0001111 1112233466889999999999975 3 3 Q ss_pred CCcchhhcceEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhccccccc Q lcl|Aclame:pro 194 ANRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNV 273 (468) Q Consensus 194 ~~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~ 273 (468) ++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||++|+. +.|+++.++ T Consensus 241 s~~~f~EMaFsIeKvTVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii~~i~~-sa~~~~~g~ 319 (519) T protein:vir:10 241 TDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWINY-SAQVGKSGM 319 (519) T ss_pred cccchhhhceeEEEEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhh-hhhcceeec Confidence 4678999999999999999999999999999999999999999999999999999999999999987554 445555666 Q ss_pred cc-----ccccccccccc---chhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccc Q lcl|Aclame:pro 274 AN-----AGIFDLDVDSN---GRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGA 345 (468) Q Consensus 274 ~~-----~g~~Dl~~~~~---grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~ 345 (468) ++ .|+|||+++.| +||++|+||+|+||||||||+|+|+|+||+|||||||||||++|+|+|++++.++...+ T Consensus 320 t~~~~~~aGv~d~~~~~d~~~~rw~~e~~k~L~~~i~~~an~I~~~T~r~~gn~ii~S~~Va~~L~~~g~~~~~~~~~~~ 399 (519) T protein:vir:10 320 TNTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLG 399 (519) T ss_pred ccCcccccceeecccccccccchHHHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhccchhcccccccc Confidence 65 59999999966 99999999999999999999999999999999999999999999999999999987776 Q ss_pred cccccccccccCceeEEEecCceEEEEcccccccCCcceEEEEEecCCcccceeEeeccchhhcccccCCccccceeeee Q lcl|Aclame:pro 346 GGPSIGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNTFQPKIGFK 425 (468) Q Consensus 346 ~~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~~~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~s~qP~~g~~ 425 (468) .+ .++|+++.+|+|+|+|||+||||||+. +|||+|||||++++|+||||||||||+|++++||+||||++||| T Consensus 400 ~~---~~~d~~~~~~~G~l~~~~~vy~D~y~~----~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~ 472 (519) T protein:vir:10 400 QG---FNVDTTKAVFAGVLGGKYRVYIDQYAR----SDYFTIGYKGSNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFK 472 (519) T ss_pred cc---ccccCCCceEEEEecCceEEEecCCCC----cceEEEEEecCcccccceeeccccccccccccCCccccceeeee Confidence 65 689999999999999999999999965 79999999999999999999999999999999999999999999 Q ss_pred eeeeeeecCcccccCcccc--ccchhhhhh-----hcccceeeeeeeec Q lcl|Aclame:pro 426 TRYGMVSNPFVTTNGLYNG--TPDGEALTP-----NANMYYRRVQVTNL 467 (468) Q Consensus 426 tRY~l~~nP~~~~~~~~~~--~~~~~~~~~-----~an~y~~r~~v~~l 467 (468) |||||++|||++...+... +.++ |++ +.|.|||||+|+|| T Consensus 473 tRY~l~~NP~~~~~~~~~~~~i~~g--~~~~a~~~~~n~y~r~v~v~~~ 519 (519) T protein:vir:10 473 TRYGIGINPFADPAAQAPTKRIQNG--MPDIVNSLGLNGYFRRVYVKGI 519 (519) T ss_pred eeeceeecCcccccccCccceeccC--chhhhccccCceeeeeeeeecC Confidence 9999999999964433222 3443 443 44889999999999 No 17 >protein:vir:5942 Length: 523 # NCBI annotation: similar to major head protein # Family: family:all:364 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835728;genbank:gi:30044131 Probab=100.00 E-value=2.1e-194 Score=1082.40 Aligned_cols=405 Identities=27% Similarity=0.425 Sum_probs=336.7 Q ss_pred Ccc---hHHHHHhhhhhhCCCccchhcchhhhHHHHHHHhHHHHHHhhhhhhhhhhhhhhcCcccccccccccccccccc Q lcl|Aclame:pro 1 MFN---AEHLQEKWSPVLNHGEAPAIGDRYKRAVTSVLLENQERFLREERGMLNEVAVNSLGAGTIAPAGSALGSANTGG 77 (468) Q Consensus 1 ~~~---~~~l~~kw~p~l~~~~~~~i~~~~~~~~~~~llenq~~~~~~~~~~l~e~~~~~~~~~~~~~~~~i~~st~tg~ 77 (468) |.. +|+|+|||+||||+ |++.|||+||++|||||++ |++++|+|+ +.+++ T Consensus 1 ~~~~~~~e~l~~kw~p~l~~-----~~~~~~~~~~a~llenq~~---~~~~~l~e~-------------------~~~~~ 53 (523) T protein:vir:59 1 MSQPKINEQLIEKWQPLLEG-----CRNDWERHTLATLLENQYR---EAKKHLMET-------------------TQTTE 53 (523) T ss_pred CCcchhhHHHHHhhhhhhcc-----cCChhHHHHHHHHhhhhhH---HHHHhhhhh-------------------hhccc Confidence 443 57899999999997 6677999999999999985 677888884 45888 Q ss_pred cccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCcccccccCCc--------------ccccccc Q lcl|Aclame:pro 78 LAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDT--------------GFTGGYD 143 (468) Q Consensus 78 i~~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t--------------~fSg~~~ 143 (468) |++|.| ||+|+||++|||||+||||||||||||||||||||||.+|.|+|++|+++.+ .|++... T Consensus 54 ~~~~~~-~~~~v~r~~p~l~a~DIWGVQPMTGPTGLIFAMRSRY~~q~gteA~yg~~~~~~~~a~~~~~ean~~~s~~~~ 132 (523) T protein:vir:59 54 VDGWNL-ALPIVRRVFANLRATDLVSVQPLSLPTGLVFYLDFKSPELPGNGSVYGGTGLTTDTATGGLYDENARLSRREY 132 (523) T ss_pred cccccc-hhhhhhhHhhhhhhhhccccccCCCCcceeEEEEeeccCCCCcccccCccccCcccccccccccccccccccc Confidence 999996 9999999999999999999999999999999999999999999999876444 3333211 Q ss_pred cccccccccc-C------------------cc-------------ccCC---------------C--------------- Q lcl|Aclame:pro 144 ASQGDYAVRT-G------------------AG-------------VGGD---------------S--------------- 161 (468) Q Consensus 144 ~~~~~~~~~~-~------------------~~-------------~~~~---------------~--------------- 161 (468) .......... + +. .... . T Consensus 133 ~~~~~~d~~~sg~~~~~~~a~stg~A~a~~s~si~k~~vTa~s~agta~~~li~A~~~q~itg~tga~fa~s~~~an~as 212 (523) T protein:vir:59 133 ETTITVDLATAQQATMRDVGFDTGIASLVSSGAVYYVDVPVASLPGVADVNTVRFWQYDDASGDPENTVAYPLPRYNRIV 212 (523) T ss_pred cCccCCCcccccccccccccccccchhhccccceeeeeccccccccccccccccccccccccccccccccchhhcccccc Confidence 1100000000 0 00 0000 0 Q ss_pred c--------------ccccccc------ccccccccccccccchhhhhccCC------CCcchhhcceEEEEEEEEeecc Q lcl|Aclame:pro 162 E--------------GNNPALL------NDAAPGTYEVGSKMPREDLERMGE------ANRLFREMSFSIEKTSVTAQSR 215 (468) Q Consensus 162 ~--------------gt~~~~~------~~a~~~~~t~~~gm~Ta~aE~lG~------~~~~f~EMaFsIeK~tVtAKSR 215 (468) . +++.... .......|+.+.||+|+.+|.+|+ ++++|+||+|+||||+|||||| T Consensus 213 tAss~Al~gEA~t~~sTd~at~~~Gtt~t~~~~~lyt~~~g~~t~~~~~~~~~~~~~~~~~~~~eM~FsIeK~tVtAkSR 292 (523) T protein:vir:59 213 GAVGSALYARLFFVTGSDFATVAGGTPSTQDLDLVYYIDARNDFEDQSTDPDYPDPGFQSLDIPEINLELRSRPVATKTR 292 (523) T ss_pred ccccccccccccccccccccccCCCcccccccccccccccccchhhccccccccccccccccccceeeEEEeEEEeeecc Confidence 0 0000000 000112366678999999998864 3568999999999999999999 Q ss_pred cccccccHHHHHHHHHhc-CCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhccccccccccccccccccccchhH---- Q lcl|Aclame:pro 216 ALKAEYTLELAQDLKAIH-GLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVANAGIFDLDVDSNGRWS---- 290 (468) Q Consensus 216 aLKAEYT~ELAQDLkAiH-GLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~~~~g~~Dl~~~~~grw~---- 290 (468) |||||||||||||||||| |||||+||+|||||||||||||||||+||++|+|||+.+++++|+|||+++.++||. T Consensus 293 aLKAeYT~ELAQDLKAiH~GLDAE~ELanILStEImlEINR~ii~~~~~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~~ 372 (523) T protein:vir:59 293 KLRAAWTPEAMQDLAAYHKGVDLENEIVTLMSQYIAREIDLEILSTIMAHARRTDNYGFWSEVVGEYYDETSGNFVAGNF 372 (523) T ss_pred cccccccHHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHhHhhhheeeeeccccccceeeecccccchhhhhhh Confidence 999999999999999999 999999999999999999999999999999999999999999999999999999997 Q ss_pred ----HHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccccccCceeEEEecC Q lcl|Aclame:pro 291 ----VEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGTING 366 (468) Q Consensus 291 ----~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~~~~G~l~g 366 (468) +|.||.|++|||+|||+|+|+|+||+|||||||||||++|++||||++.+. ..+|+++.+|+|+|+| T Consensus 373 ~~~~~e~~~~l~~~~~~~~n~i~~~t~~~~~~~~~~s~~v~~~l~~~~~~~~~~~---------~~~~~~~~~~~g~l~~ 443 (523) T protein:vir:59 373 YGSKQEWLATLMIELNKVSNRIQQKTAVAGANFLVTSPQVAALLESMPGFTPGND---------NRDGGTGIFYVGMVQG 443 (523) T ss_pred hhhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEchhHHHHHHhccccccCCc---------cccccccceeEEEecC Confidence 899999999999999999999999999999999999999999999975533 3578999999999999 Q ss_pred ceEEEEcccccccCCcceEEEEEecC-CcccceeEeeccchhhccccc-CCccccceeeeeeeeeeee-cCcccccCccc Q lcl|Aclame:pro 367 RIKVFVDPYAANLSDKHYYVIGYKGT-SPYDAGLFYCPYVPLQMVRSI-DPNTFQPKIGFKTRYGMVS-NPFVTTNGLYN 443 (468) Q Consensus 367 ~~~vy~D~Ya~~~~~~dY~~vG~KG~-~~~d~glfyaPYv~l~~~~~~-dp~s~qP~~g~~tRY~l~~-nP~~~~~~~~~ 443 (468) ||+||||||+ ++|||+|||||+ +++|+||||||||||++++.+ ||+||||++||||||||++ |||+..-- T Consensus 444 ~~~vy~d~~~----~~dy~~~g~k~~~~~~~~~~~y~Py~~l~~~~~~~dp~s~qp~~~~~tRY~l~v~nP~~~~~~--- 516 (523) T protein:vir:59 444 RYRLYKNIYQ----NQPVIIMGNQDLNTPWQTGAVYAPYVPLLFTPTIVDPVNFSYRRGLMTRYALEVVRPEFYGLL--- 516 (523) T ss_pred ceEEEecCCC----CcceEEEEecccCCcccccceecccchhhcccccccCCcccceeeeeeehhheecchhHhhhh--- Confidence 9999999985 589999999994 699999999999999999996 9999999999999999986 99987421 Q ss_pred cccchhhhhhhcccceeeeeeeecC Q lcl|Aclame:pro 444 GTPDGEALTPNANMYYRRVQVTNLM 468 (468) Q Consensus 444 ~~~~~~~~~~~an~y~~r~~v~~l~ 468 (468) | |+-|- T Consensus 517 --------------~-----~~~~~ 522 (523) T protein:vir:59 517 --------------Y-----VKLLQ 522 (523) T ss_pred --------------h-----hhhcC Confidence 0 00000 No 18 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=96.80 E-value=0.00033 Score=39.68 Aligned_cols=333 Identities=12% Similarity=0.070 Sum_probs=130.0 Q ss_pred CcchHHHHHhhhhhhCCCccchhcchhhhHH--HH---HHHhHH----HHHHhhhhhhhhhhhhhhcC------------ Q lcl|Aclame:pro 1 MFNAEHLQEKWSPVLNHGEAPAIGDRYKRAV--TS---VLLENQ----ERFLREERGMLNEVAVNSLG------------ 59 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~--~~---~llenq----~~~~~~~~~~l~e~~~~~~~------------ 59 (468) |-+-++|.++..-+.+. +-++.+.-+..+ +. .=|+.+ ++.+++....+.+..+.... T Consensus 1 M~~l~el~~~~~~~~~e--~~~l~~~~~~e~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (385) T protein:vir:18 1 MSELALIQKAIEESQQK--MTQLFDAQKAEIESTGQVSKQLQSDLMKVQEELTKSGTRLFDLEQKLASGAENPGEKKSFS 78 (385) T ss_pred ChHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccchhhhhH Confidence 77777888887766543 222222222111 00 001111 11111111111111100000 Q ss_pred ----------------c-cccccccccccccc-cccc--ccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeee Q lcl|Aclame:pro 60 ----------------A-GTIAPAGSALGSAN-TGGL--AGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRS 119 (468) Q Consensus 60 ----------------~-~~~~~~~~i~~st~-tg~i--~~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRs 119 (468) . ........+...++ .|.+ ....+.+ +++......-.+++.++||+++..-+.- T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~i---i~~~~~~~~l~~~~~~~~~~~~~~~~~~--- 152 (385) T protein:vir:18 79 ERAAEELIKSWDGKQGTFGAKTFNKSLGSDADSAGSLIQPMQIPGI---IMPGLRRLTIRDLLAQGRTSSNALEYVR--- 152 (385) T ss_pred HHHHHHHHHHHHHhhccchhhHHHhhhccccccCCceecchhhhHH---HHHhhhccchhhhcceecccCcceEEEE--- Confidence 0 00000001111111 1111 1122333 3334445566778888888776532211 Q ss_pred eecCCCCcccccccCCccccccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCCCCcchh Q lcl|Aclame:pro 120 RYENQAGEEALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFR 199 (468) Q Consensus 120 rY~~qsG~EA~fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~~~f~ 199 (468) +....+ .+ .| . +| +..++ T Consensus 153 -~~~~~~-~a-------~~----------------------------------------------v--~E-----~~~~~ 170 (385) T protein:vir:18 153 -EEVFTN-NA-------DV----------------------------------------------V--AE-----KALKP 170 (385) T ss_pred -EecCCc-ce-------ee----------------------------------------------e--cc-----Ccccc Confidence 100000 00 00 0 01 12345 Q ss_pred hcceEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhccccccccccccc Q lcl|Aclame:pro 200 EMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVANAGIF 279 (468) Q Consensus 200 EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~~~~g~~ 279 (468) +-..++++++.+.|.-+-...+|-||.||-- +.++.|.+-|+..|..-+|+.||.- ...++ ...|++ T Consensus 171 ~~~~~~~~~~~~~~k~~~~~~is~ell~d~~-----~l~~~i~~~la~a~~~~~d~~~l~G----~g~~~----~~~Gi~ 237 (385) T protein:vir:18 171 ESDITFSKQTANVKTIAHWVQASRQVMDDAP-----MLQSYINNRLMYGLALKEEGQLLNG----DGTGD----NLEGLN 237 (385) T ss_pred ccccceeEEEEeeeeEEEeehhhHHHHhhHH-----HHHHHHHHHHHHHHHHHHHHHHHhc----cCCCC----cccccc Confidence 5556667777777777777889999999842 3566777777777777777766632 11111 111221 Q ss_pred cccccccchhHHHHHHHHHHH-HHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccccccCc Q lcl|Aclame:pro 280 DLDVDSNGRWSVEKFKGLLFQ-VERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGN 358 (468) Q Consensus 280 Dl~~~~~grw~~e~~k~L~~~-i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~ 358 (468) ........-+. ..-...+. |......+ ...-+.++-+||||+....|... ..+.+ + ........+. T Consensus 238 ~~~~~~~~~~~--~~~~~~~d~i~~~~~~l--~~~~~~~~~~~~~~~~~~~l~~l---kd~~G---~---~l~~~~~~~~ 304 (385) T protein:vir:18 238 KVATAYDTSLN--ATGDTRADIIAHAIYQV--TESEFSASGIVLNPRDWHNIALL---KDNEG---R---YIFGGPQAFT 304 (385) T ss_pred ccccccccccc--ccccchHHHHHHHHHhh--ccccCCCCEEEEcHHHHHHHHHh---hcCCC---c---eeccCcccCC Confidence 11100000000 00000111 22222222 12235667899999999988752 21111 1 1111111111 Q ss_pred eeEEEecCceEEEEcccccccCCcceEEEEE-ecCCcccceeEeeccchhhcccccCC---ccc-cceee--eeeeeee- Q lcl|Aclame:pro 359 LAVGTINGRIKVFVDPYAANLSDKHYYVIGY-KGTSPYDAGLFYCPYVPLQMVRSIDP---NTF-QPKIG--FKTRYGM- 430 (468) Q Consensus 359 ~~~G~l~g~~~vy~D~Ya~~~~~~dY~~vG~-KG~~~~d~glfyaPYv~l~~~~~~dp---~s~-qP~~g--~~tRY~l- 430 (468) .++|. |++|+++.+.. ..=+++|- +. +|--+....+...++. +-| +..++ ...||+. T Consensus 305 --~~~l~-G~pV~~~~~~p----~~~~~~gd~~~--------~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~ 369 (385) T protein:vir:18 305 --SNIMW-GLPVVPTKAQA----AGTFTVGGFDM--------ASQVWDRMDATVEVSREDRDNFVKNMLTILCEERLALA 369 (385) T ss_pred --Cceec-ceeeEEcCcCC----CCcEEEeeccc--------EEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccE Confidence 25665 48999997653 22233331 10 0110111111111100 011 22333 3347766 Q ss_pred eecC--cccccCccccc Q lcl|Aclame:pro 431 VSNP--FVTTNGLYNGT 445 (468) Q Consensus 431 ~~nP--~~~~~~~~~~~ 445 (468) +.+| |+..+- .... T Consensus 370 v~~~~a~~~~~~-~aa~ 385 (385) T protein:vir:18 370 HYRPTAIIKGTF-SSGS 385 (385) T ss_pred EecccceEEEEe-ccCC Confidence 3444 322110 0000 No 19 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=96.80 E-value=0.00033 Score=39.68 Aligned_cols=333 Identities=12% Similarity=0.070 Sum_probs=130.0 Q ss_pred CcchHHHHHhhhhhhCCCccchhcchhhhHH--HH---HHHhHH----HHHHhhhhhhhhhhhhhhcC------------ Q lcl|Aclame:pro 1 MFNAEHLQEKWSPVLNHGEAPAIGDRYKRAV--TS---VLLENQ----ERFLREERGMLNEVAVNSLG------------ 59 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~--~~---~llenq----~~~~~~~~~~l~e~~~~~~~------------ 59 (468) |-+-++|.++..-+.+. +-++.+.-+..+ +. .=|+.+ ++.+++....+.+..+.... T Consensus 1 M~~l~el~~~~~~~~~e--~~~l~~~~~~e~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (385) T protein:vir:19 1 MSELALIQKAIEESQQK--MTQLFDAQKAEIESTGQVSKQLQSDLMKVQEELTKSGTRLFDLEQKLASGAENPGEKKSFS 78 (385) T ss_pred ChHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccchhhhhH Confidence 77777888887766543 222222222111 00 001111 11111111111111100000 Q ss_pred ----------------c-cccccccccccccc-cccc--ccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeee Q lcl|Aclame:pro 60 ----------------A-GTIAPAGSALGSAN-TGGL--AGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRS 119 (468) Q Consensus 60 ----------------~-~~~~~~~~i~~st~-tg~i--~~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRs 119 (468) . ........+...++ .|.+ ....+.+ +++......-.+++.++||+++..-+.- T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~i---i~~~~~~~~l~~~~~~~~~~~~~~~~~~--- 152 (385) T protein:vir:19 79 ERAAEELIKSWDGKQGTFGAKTFNKSLGSDADSAGSLIQPMQIPGI---IMPGLRRLTIRDLLAQGRTSSNALEYVR--- 152 (385) T ss_pred HHHHHHHHHHHHHhhccchhhHHHhhhccccccCCceecchhhhHH---HHHhhhccchhhhcceecccCcceEEEE--- Confidence 0 00000001111111 1111 1122333 3334445566778888888776532211 Q ss_pred eecCCCCcccccccCCccccccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCCCCcchh Q lcl|Aclame:pro 120 RYENQAGEEALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFR 199 (468) Q Consensus 120 rY~~qsG~EA~fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~~~f~ 199 (468) +....+ .+ .| . +| +..++ T Consensus 153 -~~~~~~-~a-------~~----------------------------------------------v--~E-----~~~~~ 170 (385) T protein:vir:19 153 -EEVFTN-NA-------DV----------------------------------------------V--AE-----KALKP 170 (385) T ss_pred -EecCCc-ce-------ee----------------------------------------------e--cc-----Ccccc Confidence 100000 00 00 0 01 12345 Q ss_pred hcceEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhccccccccccccc Q lcl|Aclame:pro 200 EMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVANAGIF 279 (468) Q Consensus 200 EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~~~~g~~ 279 (468) +-..++++++.+.|.-+-...+|-||.||-- +.++.|.+-|+..|..-+|+.||.- ...++ ...|++ T Consensus 171 ~~~~~~~~~~~~~~k~~~~~~is~ell~d~~-----~l~~~i~~~la~a~~~~~d~~~l~G----~g~~~----~~~Gi~ 237 (385) T protein:vir:19 171 ESDITFSKQTANVKTIAHWVQASRQVMDDAP-----MLQSYINNRLMYGLALKEEGQLLNG----DGTGD----NLEGLN 237 (385) T ss_pred ccccceeEEEEeeeeEEEeehhhHHHHhhHH-----HHHHHHHHHHHHHHHHHHHHHHHhc----cCCCC----cccccc Confidence 5556667777777777777889999999842 3566777777777777777766632 11111 111221 Q ss_pred cccccccchhHHHHHHHHHHH-HHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccccccCc Q lcl|Aclame:pro 280 DLDVDSNGRWSVEKFKGLLFQ-VERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGN 358 (468) Q Consensus 280 Dl~~~~~grw~~e~~k~L~~~-i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~ 358 (468) ........-+. ..-...+. |......+ ...-+.++-+||||+....|... ..+.+ + ........+. T Consensus 238 ~~~~~~~~~~~--~~~~~~~d~i~~~~~~l--~~~~~~~~~~~~~~~~~~~l~~l---kd~~G---~---~l~~~~~~~~ 304 (385) T protein:vir:19 238 KVATAYDTSLN--ATGDTRADIIAHAIYQV--TESEFSASGIVLNPRDWHNIALL---KDNEG---R---YIFGGPQAFT 304 (385) T ss_pred ccccccccccc--ccccchHHHHHHHHHhh--ccccCCCCEEEEcHHHHHHHHHh---hcCCC---c---eeccCcccCC Confidence 11100000000 00000111 22222222 12235667899999999988752 21111 1 1111111111 Q ss_pred eeEEEecCceEEEEcccccccCCcceEEEEE-ecCCcccceeEeeccchhhcccccCC---ccc-cceee--eeeeeee- Q lcl|Aclame:pro 359 LAVGTINGRIKVFVDPYAANLSDKHYYVIGY-KGTSPYDAGLFYCPYVPLQMVRSIDP---NTF-QPKIG--FKTRYGM- 430 (468) Q Consensus 359 ~~~G~l~g~~~vy~D~Ya~~~~~~dY~~vG~-KG~~~~d~glfyaPYv~l~~~~~~dp---~s~-qP~~g--~~tRY~l- 430 (468) .++|. |++|+++.+.. ..=+++|- +. +|--+....+...++. +-| +..++ ...||+. T Consensus 305 --~~~l~-G~pV~~~~~~p----~~~~~~gd~~~--------~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~ 369 (385) T protein:vir:19 305 --SNIMW-GLPVVPTKAQA----AGTFTVGGFDM--------ASQVWDRMDATVEVSREDRDNFVKNMLTILCEERLALA 369 (385) T ss_pred --Cceec-ceeeEEcCcCC----CCcEEEeeccc--------EEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccE Confidence 25665 48999997653 22233331 10 0110111111111100 011 22333 3347766 Q ss_pred eecC--cccccCccccc Q lcl|Aclame:pro 431 VSNP--FVTTNGLYNGT 445 (468) Q Consensus 431 ~~nP--~~~~~~~~~~~ 445 (468) +.+| |+..+- .... T Consensus 370 v~~~~a~~~~~~-~aa~ 385 (385) T protein:vir:19 370 HYRPTAIIKGTF-SSGS 385 (385) T ss_pred EecccceEEEEe-ccCC Confidence 3444 322110 0000 No 20 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=96.76 E-value=0.00036 Score=39.47 Aligned_cols=343 Identities=16% Similarity=0.110 Sum_probs=138.5 Q ss_pred CcchHHHHHhhhhhhCCCccchhcchhhhHHHHHH----------HhHHHHHHhhhhhhhhhh---hh---hhc------ Q lcl|Aclame:pro 1 MFNAEHLQEKWSPVLNHGEAPAIGDRYKRAVTSVL----------LENQERFLREERGMLNEV---AV---NSL------ 58 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~~~l----------lenq~~~~~~~~~~l~e~---~~---~~~------ 58 (468) |=..++|.++=..+.+. +-+.+..-++.+...- ++.-++++.+....+.+. .. ... T Consensus 1 mk~~~el~~~l~el~~~--~~~~~~e~~~~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (415) T protein:vir:79 1 MKTKEELQSEISDIKRQ--IDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVN 78 (415) T ss_pred CchHHHHHHHHHHHHHH--HHHHHHHHHHHhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccc Confidence 55555666665555432 1111111111111000 111111111100000000 00 000 Q ss_pred ------------------Ccc-------------cccccccccccccccccccccceeh--hhhHHhhhhhhhhheeeee Q lcl|Aclame:pro 59 ------------------GAG-------------TIAPAGSALGSANTGGLAGFDPVLI--SLVRRAMPNLMAYDVCGVQ 105 (468) Q Consensus 59 ------------------~~~-------------~~~~~~~i~~st~tg~i~~~~P~Lv--~l~RRa~~~LI~~DI~GVQ 105 (468) ... -.......+.++++.+-...-|.-+ .++++..+...-.+++.|. T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~ 158 (415) T protein:vir:79 79 EARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVK 158 (415) T ss_pred hhhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhhhheeee Confidence 000 0000000001111111111123221 2445555667788999999 Q ss_pred cCCccceeeeeeeeeecCCCCcccccccCCccccccccccccccccccCccccCCCccccccccccccccccccccccch Q lcl|Aclame:pro 106 PMSGPTGLIFAMRSRYENQAGEEALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPR 185 (468) Q Consensus 106 PmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~T 185 (468) ||++..+-+--.+ ..+.. .+ .+ . T Consensus 159 ~~~~~~~~~~~~~--~~~~~--~~-------~~----------------------------------------------v 181 (415) T protein:vir:79 159 RVTNGSGKYPVVR--QSEVA--AL-------EK----------------------------------------------V 181 (415) T ss_pred eccCCceeEEEEe--ecCCc--cc-------ee----------------------------------------------e Confidence 9999887654443 11100 00 00 0 Q ss_pred hhhhccCCCC-cchhhcceEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhh Q lcl|Aclame:pro 186 EDLERMGEAN-RLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTV 264 (468) Q Consensus 186 a~aE~lG~~~-~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~v 264 (468) ++.....+.+ ..|.+..|++.|. +-...+|-||.+|- .+|.+++|.+-|+..|..-+|+.||.-.-+- T Consensus 182 ~E~~~~~~~~~~~~~~v~~~~~k~-------~~~~~iS~ell~ds----~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g 250 (415) T protein:vir:79 182 EELEENPELAVKPFFQLAYDINTH-------RGYFRISREAIEDA----KVNVLQELKLWMARTIAATRNKAIIDVITKG 250 (415) T ss_pred ccccccCcccccceeeEEeeeeee-------EeeehhhHHHHhhc----hHHHHHHHHHHHHHHHHHHHHHHHhhccccC Confidence 0001111111 2344455555444 44566999999984 3578999999999999999999998765332 Q ss_pred hhccccccccccccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhccccccccccc Q lcl|Aclame:pro 265 AKKGAQNNVANAGIFDLDVDSNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNG 344 (468) Q Consensus 265 a~~~k~~~~~~~g~~Dl~~~~~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~ 344 (468) ...+-..+....++. ...++--.++....++..+... -.+++.+||++.....|... ..+- T Consensus 251 ~~~~~~~~~~~~~~~---~~~~~~~~~~~i~~~~~~~~~~---------~~~~~~~v~n~~~~~~l~~l---kd~~---- 311 (415) T protein:vir:79 251 STGSTSSGFEKEGKK---LEVKKAKSLDDIKDAINLNVKP---------NYEHNVAIVSQTMFAKLDKM---KDKL---- 311 (415) T ss_pred ccccccccccccccc---cccccccchhHHHHHHHhhhhh---------ccCCCEEEEcHHHHHHHHHh---hccC---- Confidence 111100000000000 0001111122333343333221 13456789999999988762 2111 Q ss_pred ccccccccccccCceeEEEecCceEEEEcccccccCCcceEEEEEecCCcccceeEeec----cc-----hhhcccccCC Q lcl|Aclame:pro 345 AGGPSIGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYKGTSPYDAGLFYCP----YV-----PLQMVRSIDP 415 (468) Q Consensus 345 ~~~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~~~~dY~~vG~KG~~~~d~glfyaP----Yv-----~l~~~~~~dp 415 (468) +..+...+-++. ..++|+ |++|++.++.. .|-.|+. .++|+- |+ .+.+ ...|- T Consensus 312 --G~~l~~~~~~~~-~~~~l~-G~pV~~~~~~~---------~~~~~~~----~~~~Gd~~~~~~~~~~~~~~v-~~~~~ 373 (415) T protein:vir:79 312 --GNYLIQPDVKEK-TQQRLL-GAKIEILPDEV---------LGQKGNN----TLIIGNLKDAIVLFDRSQYQA-SWTDY 373 (415) T ss_pred --CceeeccCcCCC-CCceec-ceeeEEecccc---------cCCCCcc----EEEEEehhccEEEEeecceEE-EEecc Confidence 111111111111 124553 55676653321 1111111 122221 11 1111 11244 Q ss_pred ccccceeeeeeeeeee-ecC--cccc----cCccccccchhhhhhhc Q lcl|Aclame:pro 416 NTFQPKIGFKTRYGMV-SNP--FVTT----NGLYNGTPDGEALTPNA 455 (468) Q Consensus 416 ~s~qP~~g~~tRY~l~-~nP--~~~~----~~~~~~~~~~~~~~~~a 455 (468) .+++..+....|++.. .+| |... ...++| +++..| T Consensus 374 ~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~~~~-----~~~~~~ 415 (415) T protein:vir:79 374 MHFGECLMIAVRQDCRILDYKSAIVIEYDDSERGEG-----DLGLEA 415 (415) T ss_pred ccCceEEEEEEEeccEEeccccEEEEEEeccCCCCC-----ccccCC Confidence 5677778888899764 355 4322 222223 333333 No 21 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=96.76 E-value=0.00036 Score=39.47 Aligned_cols=343 Identities=16% Similarity=0.110 Sum_probs=138.5 Q ss_pred CcchHHHHHhhhhhhCCCccchhcchhhhHHHHHH----------HhHHHHHHhhhhhhhhhh---hh---hhc------ Q lcl|Aclame:pro 1 MFNAEHLQEKWSPVLNHGEAPAIGDRYKRAVTSVL----------LENQERFLREERGMLNEV---AV---NSL------ 58 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~~~l----------lenq~~~~~~~~~~l~e~---~~---~~~------ 58 (468) |=..++|.++=..+.+. +-+.+..-++.+...- ++.-++++.+....+.+. .. ... T Consensus 1 mk~~~el~~~l~el~~~--~~~~~~e~~~~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (415) T protein:vir:81 1 MKTKEELQSEISDIKRQ--IDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVN 78 (415) T ss_pred CchHHHHHHHHHHHHHH--HHHHHHHHHHHhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccc Confidence 55555666665555432 1111111111111000 111111111100000000 00 000 Q ss_pred ------------------Ccc-------------cccccccccccccccccccccceeh--hhhHHhhhhhhhhheeeee Q lcl|Aclame:pro 59 ------------------GAG-------------TIAPAGSALGSANTGGLAGFDPVLI--SLVRRAMPNLMAYDVCGVQ 105 (468) Q Consensus 59 ------------------~~~-------------~~~~~~~i~~st~tg~i~~~~P~Lv--~l~RRa~~~LI~~DI~GVQ 105 (468) ... -.......+.++++.+-...-|.-+ .++++..+...-.+++.|. T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~ 158 (415) T protein:vir:81 79 EARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVK 158 (415) T ss_pred hhhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhhhheeee Confidence 000 0000000001111111111123221 2445555667788999999 Q ss_pred cCCccceeeeeeeeeecCCCCcccccccCCccccccccccccccccccCccccCCCccccccccccccccccccccccch Q lcl|Aclame:pro 106 PMSGPTGLIFAMRSRYENQAGEEALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPR 185 (468) Q Consensus 106 PmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~T 185 (468) ||++..+-+--.+ ..+.. .+ .+ . T Consensus 159 ~~~~~~~~~~~~~--~~~~~--~~-------~~----------------------------------------------v 181 (415) T protein:vir:81 159 RVTNGSGKYPVVR--QSEVA--AL-------EK----------------------------------------------V 181 (415) T ss_pred eccCCceeEEEEe--ecCCc--cc-------ee----------------------------------------------e Confidence 9999887654443 11100 00 00 0 Q ss_pred hhhhccCCCC-cchhhcceEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhh Q lcl|Aclame:pro 186 EDLERMGEAN-RLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTV 264 (468) Q Consensus 186 a~aE~lG~~~-~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~v 264 (468) ++.....+.+ ..|.+..|++.|. +-...+|-||.+|- .+|.+++|.+-|+..|..-+|+.||.-.-+- T Consensus 182 ~E~~~~~~~~~~~~~~v~~~~~k~-------~~~~~iS~ell~ds----~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g 250 (415) T protein:vir:81 182 EELEENPELAVKPFFQLAYDINTH-------RGYFRISREAIEDA----KVNVLQELKLWMARTIAATRNKAIIDVITKG 250 (415) T ss_pred ccccccCcccccceeeEEeeeeee-------EeeehhhHHHHhhc----hHHHHHHHHHHHHHHHHHHHHHHHhhccccC Confidence 0001111111 2344455555444 44566999999984 3578999999999999999999998765332 Q ss_pred hhccccccccccccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhccccccccccc Q lcl|Aclame:pro 265 AKKGAQNNVANAGIFDLDVDSNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNG 344 (468) Q Consensus 265 a~~~k~~~~~~~g~~Dl~~~~~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~ 344 (468) ...+-..+....++. ...++--.++....++..+... -.+++.+||++.....|... ..+- T Consensus 251 ~~~~~~~~~~~~~~~---~~~~~~~~~~~i~~~~~~~~~~---------~~~~~~~v~n~~~~~~l~~l---kd~~---- 311 (415) T protein:vir:81 251 STGSTSSGFEKEGKK---LEVKKAKSLDDIKDAINLNVKP---------NYEHNVAIVSQTMFAKLDKM---KDKL---- 311 (415) T ss_pred ccccccccccccccc---cccccccchhHHHHHHHhhhhh---------ccCCCEEEEcHHHHHHHHHh---hccC---- Confidence 111100000000000 0001111122333343333221 13456789999999988762 2111 Q ss_pred ccccccccccccCceeEEEecCceEEEEcccccccCCcceEEEEEecCCcccceeEeec----cc-----hhhcccccCC Q lcl|Aclame:pro 345 AGGPSIGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYKGTSPYDAGLFYCP----YV-----PLQMVRSIDP 415 (468) Q Consensus 345 ~~~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~~~~dY~~vG~KG~~~~d~glfyaP----Yv-----~l~~~~~~dp 415 (468) +..+...+-++. ..++|+ |++|++.++.. .|-.|+. .++|+- |+ .+.+ ...|- T Consensus 312 --G~~l~~~~~~~~-~~~~l~-G~pV~~~~~~~---------~~~~~~~----~~~~Gd~~~~~~~~~~~~~~v-~~~~~ 373 (415) T protein:vir:81 312 --GNYLIQPDVKEK-TQQRLL-GAKIEILPDEV---------LGQKGNN----TLIIGNLKDAIVLFDRSQYQA-SWTDY 373 (415) T ss_pred --CceeeccCcCCC-CCceec-ceeeEEecccc---------cCCCCcc----EEEEEehhccEEEEeecceEE-EEecc Confidence 111111111111 124553 55676653321 1111111 122221 11 1111 11244 Q ss_pred ccccceeeeeeeeeee-ecC--cccc----cCccccccchhhhhhhc Q lcl|Aclame:pro 416 NTFQPKIGFKTRYGMV-SNP--FVTT----NGLYNGTPDGEALTPNA 455 (468) Q Consensus 416 ~s~qP~~g~~tRY~l~-~nP--~~~~----~~~~~~~~~~~~~~~~a 455 (468) .+++..+....|++.. .+| |... ...++| +++..| T Consensus 374 ~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~~~~-----~~~~~~ 415 (415) T protein:vir:81 374 MHFGECLMIAVRQDCRILDYKSAIVIEYDDSERGEG-----DLGLEA 415 (415) T ss_pred ccCceEEEEEEEeccEEeccccEEEEEEeccCCCCC-----ccccCC Confidence 5677778888899764 355 4322 222223 333333 No 22 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=96.76 E-value=0.00036 Score=39.47 Aligned_cols=343 Identities=16% Similarity=0.110 Sum_probs=138.5 Q ss_pred CcchHHHHHhhhhhhCCCccchhcchhhhHHHHHH----------HhHHHHHHhhhhhhhhhh---hh---hhc------ Q lcl|Aclame:pro 1 MFNAEHLQEKWSPVLNHGEAPAIGDRYKRAVTSVL----------LENQERFLREERGMLNEV---AV---NSL------ 58 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~~~l----------lenq~~~~~~~~~~l~e~---~~---~~~------ 58 (468) |=..++|.++=..+.+. +-+.+..-++.+...- ++.-++++.+....+.+. .. ... T Consensus 1 mk~~~el~~~l~el~~~--~~~~~~e~~~~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (415) T protein:vir:98 1 MKTKEELQSEISDIKRQ--IDLKVKYATRALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVN 78 (415) T ss_pred CchHHHHHHHHHHHHHH--HHHHHHHHHHHhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccc Confidence 55555666665555432 1111111111111000 111111111100000000 00 000 Q ss_pred ------------------Ccc-------------cccccccccccccccccccccceeh--hhhHHhhhhhhhhheeeee Q lcl|Aclame:pro 59 ------------------GAG-------------TIAPAGSALGSANTGGLAGFDPVLI--SLVRRAMPNLMAYDVCGVQ 105 (468) Q Consensus 59 ------------------~~~-------------~~~~~~~i~~st~tg~i~~~~P~Lv--~l~RRa~~~LI~~DI~GVQ 105 (468) ... -.......+.++++.+-...-|.-+ .++++..+...-.+++.|. T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~ 158 (415) T protein:vir:98 79 EARTYRNQANINDLGISIQNTKVTSQEVRDFTEYLETRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVK 158 (415) T ss_pred hhhhHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHhhhhhhhhccccccccccccchHHHHHHHHHHHhhhhhhhheeee Confidence 000 0000000001111111111123221 2445555667788999999 Q ss_pred cCCccceeeeeeeeeecCCCCcccccccCCccccccccccccccccccCccccCCCccccccccccccccccccccccch Q lcl|Aclame:pro 106 PMSGPTGLIFAMRSRYENQAGEEALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPR 185 (468) Q Consensus 106 PmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~T 185 (468) ||++..+-+--.+ ..+.. .+ .+ . T Consensus 159 ~~~~~~~~~~~~~--~~~~~--~~-------~~----------------------------------------------v 181 (415) T protein:vir:98 159 RVTNGSGKYPVVR--QSEVA--AL-------EK----------------------------------------------V 181 (415) T ss_pred eccCCceeEEEEe--ecCCc--cc-------ee----------------------------------------------e Confidence 9999887654443 11100 00 00 0 Q ss_pred hhhhccCCCC-cchhhcceEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhh Q lcl|Aclame:pro 186 EDLERMGEAN-RLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTV 264 (468) Q Consensus 186 a~aE~lG~~~-~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~v 264 (468) ++.....+.+ ..|.+..|++.|. +-...+|-||.+|- .+|.+++|.+-|+..|..-+|+.||.-.-+- T Consensus 182 ~E~~~~~~~~~~~~~~v~~~~~k~-------~~~~~iS~ell~ds----~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g 250 (415) T protein:vir:98 182 EELEENPELAVKPFFQLAYDINTH-------RGYFRISREAIEDA----KVNVLQELKLWMARTIAATRNKAIIDVITKG 250 (415) T ss_pred ccccccCcccccceeeEEeeeeee-------EeeehhhHHHHhhc----hHHHHHHHHHHHHHHHHHHHHHHHhhccccC Confidence 0001111111 2344455555444 44566999999984 3578999999999999999999998765332 Q ss_pred hhccccccccccccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhccccccccccc Q lcl|Aclame:pro 265 AKKGAQNNVANAGIFDLDVDSNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNG 344 (468) Q Consensus 265 a~~~k~~~~~~~g~~Dl~~~~~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~ 344 (468) ...+-..+....++. ...++--.++....++..+... -.+++.+||++.....|... ..+- T Consensus 251 ~~~~~~~~~~~~~~~---~~~~~~~~~~~i~~~~~~~~~~---------~~~~~~~v~n~~~~~~l~~l---kd~~---- 311 (415) T protein:vir:98 251 STGSTSSGFEKEGKK---LEVKKAKSLDDIKDAINLNVKP---------NYEHNVAIVSQTMFAKLDKM---KDKL---- 311 (415) T ss_pred ccccccccccccccc---cccccccchhHHHHHHHhhhhh---------ccCCCEEEEcHHHHHHHHHh---hccC---- Confidence 111100000000000 0001111122333343333221 13456789999999988762 2111 Q ss_pred ccccccccccccCceeEEEecCceEEEEcccccccCCcceEEEEEecCCcccceeEeec----cc-----hhhcccccCC Q lcl|Aclame:pro 345 AGGPSIGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYKGTSPYDAGLFYCP----YV-----PLQMVRSIDP 415 (468) Q Consensus 345 ~~~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~~~~dY~~vG~KG~~~~d~glfyaP----Yv-----~l~~~~~~dp 415 (468) +..+...+-++. ..++|+ |++|++.++.. .|-.|+. .++|+- |+ .+.+ ...|- T Consensus 312 --G~~l~~~~~~~~-~~~~l~-G~pV~~~~~~~---------~~~~~~~----~~~~Gd~~~~~~~~~~~~~~v-~~~~~ 373 (415) T protein:vir:98 312 --GNYLIQPDVKEK-TQQRLL-GAKIEILPDEV---------LGQKGNN----TLIIGNLKDAIVLFDRSQYQA-SWTDY 373 (415) T ss_pred --CceeeccCcCCC-CCceec-ceeeEEecccc---------cCCCCcc----EEEEEehhccEEEEeecceEE-EEecc Confidence 111111111111 124553 55676653321 1111111 122221 11 1111 11244 Q ss_pred ccccceeeeeeeeeee-ecC--cccc----cCccccccchhhhhhhc Q lcl|Aclame:pro 416 NTFQPKIGFKTRYGMV-SNP--FVTT----NGLYNGTPDGEALTPNA 455 (468) Q Consensus 416 ~s~qP~~g~~tRY~l~-~nP--~~~~----~~~~~~~~~~~~~~~~a 455 (468) .+++..+....|++.. .+| |... ...++| +++..| T Consensus 374 ~~~~~~~~~~~r~d~~v~~~~a~~~~~~~~~~~~~~-----~~~~~~ 415 (415) T protein:vir:98 374 MHFGECLMIAVRQDCRILDYKSAIVIEYDDSERGEG-----DLGLEA 415 (415) T ss_pred ccCceEEEEEEEeccEEeccccEEEEEEeccCCCCC-----ccccCC Confidence 5677778888899764 355 4322 222223 333333 No 23 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=95.91 E-value=0.0013 Score=36.45 Aligned_cols=343 Identities=14% Similarity=0.100 Sum_probs=130.1 Q ss_pred CcchHHHHHhhhhhhCCCccchhcchhh--hHHHHHHHhHHHHHH----------------------------------h Q lcl|Aclame:pro 1 MFNAEHLQEKWSPVLNHGEAPAIGDRYK--RAVTSVLLENQERFL----------------------------------R 44 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~--~~~~~~llenq~~~~----------------------------------~ 44 (468) +++.+++ ++...+.+. +-++....+ ++-+..+.|...... . T Consensus 28 ~~~~~~~-e~~~~~~~e--i~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 104 (415) T protein:vir:94 28 ALNNDEL-EKAEKLEQE--ITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEASTYRNQANINDLGISIQNTKVTSQ 104 (415) T ss_pred HhchhHH-HHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccchhhHHHHHHHHHHHhhhhhhhhhHH Confidence 2222221 111111110 000000000 000111111000000 0 Q ss_pred hhhhhhhhhhhhhcCcccccccccccccccccccccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCC Q lcl|Aclame:pro 45 EERGMLNEVAVNSLGAGTIAPAGSALGSANTGGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQ 124 (468) Q Consensus 45 ~~~~~l~e~~~~~~~~~~~~~~~~i~~st~tg~i~~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~q 124 (468) +.+.+... +.... ........+++|+..--....-.+++...+..+-.++++++||++..+-+--.+ ..+. T Consensus 105 e~~~~~~~-----~~~~~--~~~~~~~~~~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~--~~~~ 175 (415) T protein:vir:94 105 EVRDFTEY-----LETRN--DIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVR--QSEV 175 (415) T ss_pred HHHHHHHH-----hhhhh--hhhhhccccccccccCcHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEe--ecCC Confidence 00011000 00000 000001112222222111122234555556778899999999998776543333 1110 Q ss_pred CCcccccccCCccccccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCCCC-cchhhcce Q lcl|Aclame:pro 125 AGEEALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEAN-RLFREMSF 203 (468) Q Consensus 125 sG~EA~fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~-~~f~EMaF 203 (468) . ++ .| ..++....+.+ ..|.+..| T Consensus 176 ~--~~-------~~----------------------------------------------v~Eg~~~~~~~~~~~~~i~~ 200 (415) T protein:vir:94 176 A--AL-------EK----------------------------------------------VEELEENPELAVKPFFQLAY 200 (415) T ss_pred c--cc-------ee----------------------------------------------ccccccccccccccceeeEe Confidence 0 00 00 00001111111 13555555 Q ss_pred EEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhccccccccccccccccc Q lcl|Aclame:pro 204 SIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVANAGIFDLDV 283 (468) Q Consensus 204 sIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~~~~g~~Dl~~ 283 (468) ++.|.. -.-.+|-||.+|-- +|.+++|.+-|...|..-+|+.||.-.-+-.-.+-..+....++. . T Consensus 201 ~~~k~~-------~~~~is~ell~ds~----~~~~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~~---~ 266 (415) T protein:vir:94 201 DINTHR-------GYFRISREAIEDAK----VNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKK---L 266 (415) T ss_pred eheeee-------eechhhHHHHhhch----HHHHHHHHHHHHHHHHHHHHHHHhhccccCccccccccccccccc---c Confidence 555554 44569999999864 478999999999999999999998765432111100000000000 0 Q ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccccccCceeEEE Q lcl|Aclame:pro 284 DSNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGT 363 (468) Q Consensus 284 ~~~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~~~~G~ 363 (468) ..++--.++....++..+.. . -.+.+.+|++|.....|... ..+. +......+-++. ..++ T Consensus 267 ~~~~~~~~~~i~~~~~~~~~--------~-~~~~~~~vmn~~~~~~l~~l---kd~~------G~~l~~~~~~~~-~~~~ 327 (415) T protein:vir:94 267 EVKKAKSLDDIKDAINLNVK--------P-NYEHNVAIVSQTMFAKLDKM---KDKL------GNYLIQPDVKEK-TQQR 327 (415) T ss_pred ccccccchHHHHHHHHhhhh--------h-ccCCCEEEEcHHHHHHHHHh---hccC------CCeeeccCcCCC-CCce Confidence 00000112233334333321 1 23567889999999998762 2111 011111111111 1245 Q ss_pred ecCceEEEEcccccccCCcc-eEEEEEecCCcccceeEeeccchhhcccccCCccccceeeeeeeeeee-ecC--cccc- Q lcl|Aclame:pro 364 INGRIKVFVDPYAANLSDKH-YYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNTFQPKIGFKTRYGMV-SNP--FVTT- 438 (468) Q Consensus 364 l~g~~~vy~D~Ya~~~~~~d-Y~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~s~qP~~g~~tRY~l~-~nP--~~~~- 438 (468) |+ |++|++.+....-..-+ -+++|--.. .+.......+.+ ...|-.+++-.+-...|++.. .+| |... T Consensus 328 l~-G~pV~~~~~~~~~~~~~~~i~~gd~~~-----~~~~~~~~~~~v-~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~ 400 (415) T protein:vir:94 328 LL-GAKIEILPDEVLGQKGNNTLIIGNLKD-----AIVLFDRSQYQA-SWTDYMHFGECLMIAVRQDCRILDYKSAIVIE 400 (415) T ss_pred ec-ceeeEEecccccCCCCccEEEEEehhc-----cEEEEeecceEE-EEeccccCceEEEEEEEeccEEeccccEEEEE Confidence 54 45666653321100001 122221000 000000001111 112445566667777888764 355 3321 Q ss_pred ---cCccccccchhhhhhhc Q lcl|Aclame:pro 439 ---NGLYNGTPDGEALTPNA 455 (468) Q Consensus 439 ---~~~~~~~~~~~~~~~~a 455 (468) ...++| +++..| T Consensus 401 ~~~~~~~~~-----~~~~~~ 415 (415) T protein:vir:94 401 YDDSERGEG-----DLGLEA 415 (415) T ss_pred EeccCCCCC-----ccccCC Confidence 112222 233333 No 24 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=95.36 E-value=0.0022 Score=35.14 Aligned_cols=331 Identities=14% Similarity=0.138 Sum_probs=135.0 Q ss_pred CcchHHHHHhhhhhhCCCccchhcchhhh----------------HH---HHHHHh---HHHHHHhhhhhh----hhhhh Q lcl|Aclame:pro 1 MFNAEHLQEKWSPVLNHGEAPAIGDRYKR----------------AV---TSVLLE---NQERFLREERGM----LNEVA 54 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~----------------~~---~~~lle---nq~~~~~~~~~~----l~e~~ 54 (468) |=..++|.++|.-+-+. |++..++ ++ +..+.+ .+++.+.+.+.. ..+.. T Consensus 1 Mk~~~el~~~~~~~~~~-----~~~l~~~~~~~~~~~~~~~ee~~~~~~~i~~~~~~~e~~~~~~~~~~~~~~~~~~~~~ 75 (397) T protein:vir:49 1 MKTSNELHDLWVAQGDK-----VENLNEKLNVAMLDDSVSAEELQAIKNERDTAKMKRDMFKEQYTEARANEVANMSEEE 75 (397) T ss_pred CchHHHHHHHHHHHHHH-----HHHHHHHHHHHHhhhhcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc Confidence 77788888888765433 2111000 00 011111 011111110000 00000 Q ss_pred h-------------------hhcCccccccccccccccc-ccccccccceehhhhHHhhhhhhhhheeeeecCCccceee Q lcl|Aclame:pro 55 V-------------------NSLGAGTIAPAGSALGSAN-TGGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLI 114 (468) Q Consensus 55 ~-------------------~~~~~~~~~~~~~i~~st~-tg~i~~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLI 114 (468) . ..+..+...-....+.+++ .|++.--....-.+++...+..+-.++|.++||++++|-+ T Consensus 76 ~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~ 155 (397) T protein:vir:49 76 KKPLTKSEEEVKAGFVKDFKNLVRGRYQNLLDSKTDASGSDAGLTIPQDIQTAIHTLVSQYDSLQEYVNVENVTTLTGSR 155 (397) T ss_pred ccccccchhHHHHHHHHHHHHHHhcchhHHHHHhhccccccCcccccHhHHHHHHHHHHhhhhHHhhhceeecccCccce Confidence 0 0000000000001111222 2222111111222344444566778889999999998854 Q ss_pred eeeeeeecCCCCcccccccCCccccccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCCC Q lcl|Aclame:pro 115 FAMRSRYENQAGEEALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEA 194 (468) Q Consensus 115 FAMRsrY~~qsG~EA~fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~ 194 (468) .-++ ..+..+. ..| .++++...+. T Consensus 156 ~~~~--~~~~~~~--------a~~----------------------------------------------v~E~~~~~~~ 179 (397) T protein:vir:49 156 VYEK--WTDITGL--------ANI----------------------------------------------DDEAGKIADV 179 (397) T ss_pred EEEe--eccCCcc--------eee----------------------------------------------ecCccccccc Confidence 4333 1111100 000 0000111111 Q ss_pred -CcchhhcceEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhccccccc Q lcl|Aclame:pro 195 -NRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNV 273 (468) Q Consensus 195 -~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~ 273 (468) ...|.++.|++.|. +-...+|-||.+|-. .|.+++|.+-|+..|..-+|+.||.-.-+. . T Consensus 180 ~~~~~~~i~~~~~k~-------~~~~~iS~ell~ds~----~~l~~~i~~~l~~~~~~~~d~ai~~G~g~~--------~ 240 (397) T protein:vir:49 180 DDPKLSLIKYTIKRY-------AGISTVTNSLLADSA----ENILAWLSGWIAKKVVVTRNKAILEAIAAL--------P 240 (397) T ss_pred cccceeeEEeeeeeE-------EeeehhHHHHHhhhH----HHHHHHHHHHHHHHHHHHHHHHHHhhcccc--------c Confidence 12355555555444 444678999999853 578999999999999999999988643222 2 Q ss_pred cccccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccc Q lcl|Aclame:pro 274 ANAGIFDLDVDSNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEV 353 (468) Q Consensus 274 ~~~g~~Dl~~~~~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~ 353 (468) ...++++++ ....+++.+... -.....+|++|.....|... ..+- +..+++. T Consensus 241 ~~~~~~~~d----------~i~~~~~~l~~~---------~~~~a~~vmn~~~~~~l~~l---kd~~------G~~l~~~ 292 (397) T protein:vir:49 241 TKPTLTKWD----------DIIDLEAKVDPA---------IKQTSFFLTNTSGFTALKKV---KNAL------GDYLMER 292 (397) T ss_pred cccccccHH----------HHHHHHHhhhhh---------hcCCCEEEEcHHHHHHHHHh---hcCC------Cceeecc Confidence 233344332 233444444321 12345788999999998763 2110 1122222 Q ss_pred cccCceeEEEecCceEEEE--cccccccCCc----------ceEEEEEecCCcccceeEeeccchhhcccccCCccccce Q lcl|Aclame:pro 354 DDTGNLAVGTINGRIKVFV--DPYAANLSDK----------HYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNTFQPK 421 (468) Q Consensus 354 d~t~~~~~G~l~g~~~vy~--D~Ya~~~~~~----------dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~s~qP~ 421 (468) +-++. ..++|. |++|++ |.+..+.... +|++++.++... +=+.||.... -...+-. T Consensus 293 ~~~~~-~~~~l~-G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~----i~~~~~~~~~------~~~~~~~ 360 (397) T protein:vir:49 293 DVKSP-TGYSID-GFAVKEVADRWLANGTGGAMPLYFGDLKQAVTLFDRQHMS----LLSTNIGGGA------FETDTTK 360 (397) T ss_pred CcCCC-CCceec-ceeeEEecccccccccCCceeEEEeeccceEEEEeecceE----EEEeccccch------hhcCcee Confidence 22221 124564 445554 3333222211 233333333222 2233332211 1223333 Q ss_pred eeeeeeeeee-ecC--cccc--cC--ccccccchhhh Q lcl|Aclame:pro 422 IGFKTRYGMV-SNP--FVTT--NG--LYNGTPDGEAL 451 (468) Q Consensus 422 ~g~~tRY~l~-~nP--~~~~--~~--~~~~~~~~~~~ 451 (468) +-...|++.. .|| |... .+ ...+.-...+. T Consensus 361 ~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~~~~~~ 397 (397) T protein:vir:49 361 VRVIDRFDVVATDTEAFVPASFKAIADQKGNLGSTAV 397 (397) T ss_pred EEEEeeeCcEEecccceEEEEeecccCCCCCcccccC Confidence 4444555442 233 2211 11 11111100111 No 25 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=95.34 E-value=0.0022 Score=35.10 Aligned_cols=343 Identities=13% Similarity=0.070 Sum_probs=132.1 Q ss_pred CcchHHH-------------HHhhhhhhCC----------Cc--c--c------hhcchhhhHHHHHHHhHHHHHHhhhh Q lcl|Aclame:pro 1 MFNAEHL-------------QEKWSPVLNH----------GE--A--P------AIGDRYKRAVTSVLLENQERFLREER 47 (468) Q Consensus 1 ~~~~~~l-------------~~kw~p~l~~----------~~--~--~------~i~~~~~~~~~~~llenq~~~~~~~~ 47 (468) +++.+++ .++..-+-+. .. . . ...+...+......+.+....-.+.+ T Consensus 28 ~~~~~~~e~~~~~~~ev~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 107 (415) T protein:vir:46 28 ALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDRTSENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVR 107 (415) T ss_pred HhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccchhhhhHHHHHHHHHHHhhhhhhhhHHHHH Confidence 2222222 1111110000 00 0 0 00000000000000000000000000 Q ss_pred hhhhhhhhhhcCccccccccccccc--ccccccccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCC Q lcl|Aclame:pro 48 GMLNEVAVNSLGAGTIAPAGSALGS--ANTGGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQA 125 (468) Q Consensus 48 ~~l~e~~~~~~~~~~~~~~~~i~~s--t~tg~i~~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qs 125 (468) . +.+. ... ........ +..|+..--....-.+++.+.+...-.+++.+.||+++++-+.-.+.. . T Consensus 108 ~-~~~~----~~~----~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~--~-- 174 (415) T protein:vir:46 108 D-FTEY----LET----RNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQS--E-- 174 (415) T ss_pred H-HHHH----Hhh----hhhhhhccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEec--C-- Confidence 0 0000 000 00001111 112221111111123455556777888999999999988765433310 0 Q ss_pred CcccccccCCccccccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCCCCcchhhcc-eE Q lcl|Aclame:pro 126 GEEALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFREMS-FS 204 (468) Q Consensus 126 G~EA~fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~~~f~EMa-Fs 204 (468) +.++ .| . +| +..+++.+ -+ T Consensus 175 ~~~~-------~~----------------------------------------------v--~E-----g~~~~~~~~~~ 194 (415) T protein:vir:46 175 VAAL-------EK----------------------------------------------V--EE-----LEENPELAVKP 194 (415) T ss_pred Ccce-------ee----------------------------------------------c--cc-----ccccccccccc Confidence 0000 00 0 01 11233332 24 Q ss_pred EEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhccccccccc-cccccccc Q lcl|Aclame:pro 205 IEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVAN-AGIFDLDV 283 (468) Q Consensus 205 IeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~~~-~g~~Dl~~ 283 (468) +++++..++..+-...+|-||.+|-. .|.+++|.+-|+..|..-+|+.||.-.-+-...+--..... ...+ . T Consensus 195 ~~~v~~~~~k~~~~~~iS~ell~ds~----~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~~~~~--~- 267 (415) T protein:vir:46 195 FFQLAYDINTHRGYFRISREAIEDAK----VNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKL--E- 267 (415) T ss_pred eeeEEeeeeeeEeeehhhHHHHhhch----HHHHHHHHHHHHHHHHHHHHHHHhhccccCCcccccccccccccee--c- Confidence 55566666666666789999999843 57889999999999999999999876533111110000000 0000 0 Q ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccccccCceeEEE Q lcl|Aclame:pro 284 DSNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGT 363 (468) Q Consensus 284 ~~~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~~~~G~ 363 (468) ..+--..+....++.++.. --++.+.+|++|.....|... ..+. +..+...+-++.. .++ T Consensus 268 -~~~~~~~~~i~~~~~~~~~---------~~~~~~~~v~n~~~~~~L~~l---kd~~------G~~i~~~~~~~~~-~~~ 327 (415) T protein:vir:46 268 -VKKAKSLDDIKDAINLNVK---------PNYEHNVAIVSQTMFAKLDKM---KDKL------GNYLIQPDVKEKT-QQR 327 (415) T ss_pred -cccccchHHHHHHHHhhhh---------hccCCCEEEEcHHHHHHHHHh---hccC------CCeeeccCcCCCC-Ccc Confidence 0010112233344333332 124567889999999988752 2111 1111111111111 245 Q ss_pred ecCceEEEEcccccc-cCCcceEEEEEecCCcccceeEeeccchhhcccccCCccccceeeeeeeeeee-ecC--cccc- Q lcl|Aclame:pro 364 INGRIKVFVDPYAAN-LSDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNTFQPKIGFKTRYGMV-SNP--FVTT- 438 (468) Q Consensus 364 l~g~~~vy~D~Ya~~-~~~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~s~qP~~g~~tRY~l~-~nP--~~~~- 438 (468) |+ |++|++..++.. .....-+++|---. .+.......+.+ ...|-.+++-.+-...|++.. .+| |... T Consensus 328 l~-G~pV~~~~~~~~~~~~~~~~~~gd~~~-----~~~~~~~~~~~v-~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~ 400 (415) T protein:vir:46 328 LL-GAKIEILPDEVLGQKGNNTLIIGNLKD-----AIVLFDRSQYQA-SWTDYMHFGECLMIAVRQDCRILDYKSAIVIE 400 (415) T ss_pred cc-ceeeEEeccccccCCCccEEEEEehhc-----cEEEEeecceEE-EeeccccCceEEEEEEEeccEEeccccEEEEE Confidence 64 445655432210 00001122221000 000000001111 112445566667777888764 355 3221 Q ss_pred ---cCccccccchhhhhhhc Q lcl|Aclame:pro 439 ---NGLYNGTPDGEALTPNA 455 (468) Q Consensus 439 ---~~~~~~~~~~~~~~~~a 455 (468) ...+.| +++..| T Consensus 401 ~~~~~~~~~-----~~~~~~ 415 (415) T protein:vir:46 401 YDDSERGEG-----DLGLEA 415 (415) T ss_pred eeccCCCCC-----CccCCC Confidence 122222 233333 No 26 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=95.34 E-value=0.0022 Score=35.10 Aligned_cols=343 Identities=13% Similarity=0.070 Sum_probs=132.1 Q ss_pred CcchHHH-------------HHhhhhhhCC----------Cc--c--c------hhcchhhhHHHHHHHhHHHHHHhhhh Q lcl|Aclame:pro 1 MFNAEHL-------------QEKWSPVLNH----------GE--A--P------AIGDRYKRAVTSVLLENQERFLREER 47 (468) Q Consensus 1 ~~~~~~l-------------~~kw~p~l~~----------~~--~--~------~i~~~~~~~~~~~llenq~~~~~~~~ 47 (468) +++.+++ .++..-+-+. .. . . ...+...+......+.+....-.+.+ T Consensus 28 ~~~~~~~e~~~~~~~ev~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 107 (415) T protein:vir:47 28 ALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDRTSENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVR 107 (415) T ss_pred HhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccchhhhhHHHHHHHHHHHhhhhhhhhHHHHH Confidence 2222222 1111110000 00 0 0 00000000000000000000000000 Q ss_pred hhhhhhhhhhcCccccccccccccc--ccccccccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCC Q lcl|Aclame:pro 48 GMLNEVAVNSLGAGTIAPAGSALGS--ANTGGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQA 125 (468) Q Consensus 48 ~~l~e~~~~~~~~~~~~~~~~i~~s--t~tg~i~~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qs 125 (468) . +.+. ... ........ +..|+..--....-.+++.+.+...-.+++.+.||+++++-+.-.+.. . T Consensus 108 ~-~~~~----~~~----~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~--~-- 174 (415) T protein:vir:47 108 D-FTEY----LET----RNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQS--E-- 174 (415) T ss_pred H-HHHH----Hhh----hhhhhhccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEec--C-- Confidence 0 0000 000 00001111 112221111111123455556777888999999999988765433310 0 Q ss_pred CcccccccCCccccccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCCCCcchhhcc-eE Q lcl|Aclame:pro 126 GEEALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFREMS-FS 204 (468) Q Consensus 126 G~EA~fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~~~f~EMa-Fs 204 (468) +.++ .| . +| +..+++.+ -+ T Consensus 175 ~~~~-------~~----------------------------------------------v--~E-----g~~~~~~~~~~ 194 (415) T protein:vir:47 175 VAAL-------EK----------------------------------------------V--EE-----LEENPELAVKP 194 (415) T ss_pred Ccce-------ee----------------------------------------------c--cc-----ccccccccccc Confidence 0000 00 0 01 11233332 24 Q ss_pred EEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhccccccccc-cccccccc Q lcl|Aclame:pro 205 IEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVAN-AGIFDLDV 283 (468) Q Consensus 205 IeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~~~-~g~~Dl~~ 283 (468) +++++..++..+-...+|-||.+|-. .|.+++|.+-|+..|..-+|+.||.-.-+-...+--..... ...+ . T Consensus 195 ~~~v~~~~~k~~~~~~iS~ell~ds~----~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~~~~~--~- 267 (415) T protein:vir:47 195 FFQLAYDINTHRGYFRISREAIEDAK----VNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGKKL--E- 267 (415) T ss_pred eeeEEeeeeeeEeeehhhHHHHhhch----HHHHHHHHHHHHHHHHHHHHHHHhhccccCCcccccccccccccee--c- Confidence 55566666666666789999999843 57889999999999999999999876533111110000000 0000 0 Q ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccccccCceeEEE Q lcl|Aclame:pro 284 DSNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGT 363 (468) Q Consensus 284 ~~~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~~~~G~ 363 (468) ..+--..+....++.++.. --++.+.+|++|.....|... ..+. +..+...+-++.. .++ T Consensus 268 -~~~~~~~~~i~~~~~~~~~---------~~~~~~~~v~n~~~~~~L~~l---kd~~------G~~i~~~~~~~~~-~~~ 327 (415) T protein:vir:47 268 -VKKAKSLDDIKDAINLNVK---------PNYEHNVAIVSQTMFAKLDKM---KDKL------GNYLIQPDVKEKT-QQR 327 (415) T ss_pred -cccccchHHHHHHHHhhhh---------hccCCCEEEEcHHHHHHHHHh---hccC------CCeeeccCcCCCC-Ccc Confidence 0010112233344333332 124567889999999988752 2111 1111111111111 245 Q ss_pred ecCceEEEEcccccc-cCCcceEEEEEecCCcccceeEeeccchhhcccccCCccccceeeeeeeeeee-ecC--cccc- Q lcl|Aclame:pro 364 INGRIKVFVDPYAAN-LSDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNTFQPKIGFKTRYGMV-SNP--FVTT- 438 (468) Q Consensus 364 l~g~~~vy~D~Ya~~-~~~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~s~qP~~g~~tRY~l~-~nP--~~~~- 438 (468) |+ |++|++..++.. .....-+++|---. .+.......+.+ ...|-.+++-.+-...|++.. .+| |... T Consensus 328 l~-G~pV~~~~~~~~~~~~~~~~~~gd~~~-----~~~~~~~~~~~v-~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~ 400 (415) T protein:vir:47 328 LL-GAKIEILPDEVLGQKGNNTLIIGNLKD-----AIVLFDRSQYQA-SWTDYMHFGECLMIAVRQDCRILDYKSAIVIE 400 (415) T ss_pred cc-ceeeEEeccccccCCCccEEEEEehhc-----cEEEEeecceEE-EeeccccCceEEEEEEEeccEEeccccEEEEE Confidence 64 445655432210 00001122221000 000000001111 112445566667777888764 355 3221 Q ss_pred ---cCccccccchhhhhhhc Q lcl|Aclame:pro 439 ---NGLYNGTPDGEALTPNA 455 (468) Q Consensus 439 ---~~~~~~~~~~~~~~~~a 455 (468) ...+.| +++..| T Consensus 401 ~~~~~~~~~-----~~~~~~ 415 (415) T protein:vir:47 401 YDDSERGEG-----DLGLEA 415 (415) T ss_pred eeccCCCCC-----CccCCC Confidence 122222 233333 No 27 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=95.28 E-value=0.0024 Score=34.96 Aligned_cols=332 Identities=13% Similarity=0.124 Sum_probs=130.6 Q ss_pred CcchHHHHHhhhhhhCCCccchhcc-------------hhhhHHHHHHHhHHHHHHhhhhhhhhhhhhhhc--------- Q lcl|Aclame:pro 1 MFNAEHLQEKWSPVLNHGEAPAIGD-------------RYKRAVTSVLLENQERFLREERGMLNEVAVNSL--------- 58 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~-------------~~~~~~~~~llenq~~~~~~~~~~l~e~~~~~~--------- 58 (468) |=+.++|.+.|.-+=+. +-++.. .-.+++-+.+-+.+++ ++..+....+...... T Consensus 1 Mk~~~el~~~~~~~~~~--i~~~~~~~~~~~~~~~~~~ee~~~l~~ei~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (397) T protein:vir:48 1 MKTSNELHDLWVAQGDK--VENLNEKLNVAMLDDSVTAEELQAIKNERDTAKMK-RDMFKEQYTEARANEVVNMSEEEKK 77 (397) T ss_pred CchHHHHHHHHHHHHHH--HHHHHHHHHHhhcchhhhHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHhhhhhhhhhccc Confidence 88888887777665221 111100 0001111111111110 0000000000000000 Q ss_pred ---------------------Ccccccccccccccc-cccccc---cccceehhhhHHhhhhhhhhheeeeecCCcccee Q lcl|Aclame:pro 59 ---------------------GAGTIAPAGSALGSA-NTGGLA---GFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGL 113 (468) Q Consensus 59 ---------------------~~~~~~~~~~i~~st-~tg~i~---~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGL 113 (468) ...........+.++ +.|++. .+.+.+ ++...+...-.+++.++||++++|- T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~~~gg~~iP~~~~~~i---i~~~~~~~~l~~~~~~~~~~~~~~~ 154 (397) T protein:vir:48 78 PLTKSEEEVKAGFVKDFKNLVRGRYQNLLDSKTDASGSDAGLTIPQDIQTAI---HTLVRQYDSLQEYVNVENVTTLTGS 154 (397) T ss_pred cccchhhHHHHHHHHHHHHHHhhhhhHHHHHhhccCCccccccccHHHHHHH---HHHHHHHHHHHhhhceeeccCCcce Confidence 000000000011111 222221 222333 3333455567888999999999886 Q ss_pred eeeeeeeecCCCCcccccccCCccccccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCC Q lcl|Aclame:pro 114 IFAMRSRYENQAGEEALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGE 193 (468) Q Consensus 114 IFAMRsrY~~qsG~EA~fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~ 193 (468) +--.+ ..+..+. ..| .++++...+ T Consensus 155 ~~~~~--~~~~~~~--------a~~----------------------------------------------v~E~~~~~~ 178 (397) T protein:vir:48 155 RVYEK--WADITGL--------AKL----------------------------------------------DDEAGSIGT 178 (397) T ss_pred EEEEe--ecCCCcc--------eee----------------------------------------------ecccccccc Confidence 65444 1111100 000 000011111 Q ss_pred C-CcchhhcceEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhcccccc Q lcl|Aclame:pro 194 A-NRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNN 272 (468) Q Consensus 194 ~-~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~ 272 (468) . ...|.++.|++.|..+ ...+|-||.+|-. +|.+++|.+-|+..|..-+|+.||.-.- ++ T Consensus 179 ~~~~~~~~v~~~~~k~~~-------~~~iS~ell~ds~----~~l~~~v~~~l~~~~~~~~d~~il~G~g--------~~ 239 (397) T protein:vir:48 179 NDDPKLYPIRYAIKRYAG-------ISTVTNSLLADSA----ENILAWLSGWIAKKVVVTRNKAILEAIA--------TL 239 (397) T ss_pred ccccceeeEEeeheeeee-------ehhhHHHHHhhch----HHHHHHHHHHHHHHHHHHHHHHHhhccc--------cc Confidence 1 1235555555555543 4679999999843 5789999999999999999999875321 11 Q ss_pred ccccccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhccccccccccccccccccc Q lcl|Aclame:pro 273 VANAGIFDLDVDSNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGE 352 (468) Q Consensus 273 ~~~~g~~Dl~~~~~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~ 352 (468) ....++.+++ ....++..+. .. -..+..+||+|...+.|... ..+ .+..+++ T Consensus 240 ~~~~~~~~~d----------~i~~~~~~l~-------~~--~~~~a~~v~n~~~~~~L~~l---kd~------~G~~i~~ 291 (397) T protein:vir:48 240 PTKPTLTKWD----------DIIDLQAKVD-------PA--IKQTSFFLTNTSGFTALKKV---KNA------FGDYLME 291 (397) T ss_pred ccccccccHH----------HHHHHHHHhh-------hh--hcCCCEEEECHHHHHHHHHh---hcC------CCceeec Confidence 2222333221 2223333332 11 12346778999999999762 211 1112222 Q ss_pred ccccCceeEEEecCceEEEE--cccccccC-Cc---------ceEEEEEecCCcccceeEeeccchhhcccccCCccccc Q lcl|Aclame:pro 353 VDDTGNLAVGTINGRIKVFV--DPYAANLS-DK---------HYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNTFQP 420 (468) Q Consensus 353 ~d~t~~~~~G~l~g~~~vy~--D~Ya~~~~-~~---------dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~s~qP 420 (468) .+-++.. -++|+| ++|++ |.+..+.. +. +|++++..+..... ..++.. .+-...+- T Consensus 292 ~~~~~~~-~~~l~G-~PV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~----~~~~~~------~~~~~~~~ 359 (397) T protein:vir:48 292 RDVKSPT-GYSIDG-FAVKEVADRWLANASSGAMPLYFGDLKQAVTLFDRQQMSLL----STNIGG------GAFETDTT 359 (397) T ss_pred cCcCCCC-Cceecc-ceeEEecccccCCcCCCceEEEEEeccceEEEEeecceEEE----Eeccch------hhhhcCce Confidence 2222211 245644 44432 32221111 11 23333333222211 112111 01122223 Q ss_pred eeeeeeeeeee-ecC--cccc--cCccccccchhhhhh Q lcl|Aclame:pro 421 KIGFKTRYGMV-SNP--FVTT--NGLYNGTPDGEALTP 453 (468) Q Consensus 421 ~~g~~tRY~l~-~nP--~~~~--~~~~~~~~~~~~~~~ 453 (468) .+-...|++.. .|| |... .+.....++....+- T Consensus 360 ~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~~~~~~ 397 (397) T protein:vir:48 360 KIRVIDRFDVVATDTESFVPASFKAIADQKGNLGSTAV 397 (397) T ss_pred eEEEEeeeccEEecccceEEEEecccccCCCCccccCC Confidence 34444444432 233 2111 110011111111110 No 28 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=94.45 E-value=0.0044 Score=33.52 Aligned_cols=331 Identities=14% Similarity=0.123 Sum_probs=130.0 Q ss_pred CcchHHHHHhhhhhhCCCccchhcchhhhHHHHH---------HHhHHHHHH---hhhhh----hhhhhh---hhhc--- Q lcl|Aclame:pro 1 MFNAEHLQEKWSPVLNHGEAPAIGDRYKRAVTSV---------LLENQERFL---REERG----MLNEVA---VNSL--- 58 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~~~---------llenq~~~~---~~~~~----~l~e~~---~~~~--- 58 (468) ||+-|+|.++|..+.+. ++...+ .+-.. -++...+.+ ++.+. .+.+.. .... T Consensus 4 ~m~i~el~~~~~~~~~~-----~~~~~~-e~~~~~~~~~~~~e~i~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (408) T protein:vir:74 4 KLTVNQLNEAWIASGDK-----VTDFND-QINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREE 77 (408) T ss_pred hhhHHHHHHHHHHHHHH-----HHHHHH-HHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc Confidence 88999999999988654 222111 11000 000000000 00000 011100 0000 Q ss_pred ------------------------Cccccc-----ccccccccccccccccccceehhhhHHhhhhhhhhheeeeecCCc Q lcl|Aclame:pro 59 ------------------------GAGTIA-----PAGSALGSANTGGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSG 109 (468) Q Consensus 59 ------------------------~~~~~~-----~~~~i~~st~tg~i~~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTG 109 (468) ..+... ...+...++..|++.--....-.+++...+.....++++++||++ T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~vP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~ 157 (408) T protein:vir:74 78 EKGPLNKSENELKDKFVKDFVNMVRNPMAFLNTVSSKTETSGSDSAAGLTIPQDIRTMINTLVRQYDSLQQYVRVESVST 157 (408) T ss_pred ccccccchhhhhHHHHHHHHHHHHhcchhhhhhhhhhhhcccccCCCceeechhHhhHHHHHHhhhcchhhhcceeeccC Confidence 000000 000001111112211111111133444445556788899999999 Q ss_pred cceeeeeeeeeecCCCCcccccccCCccccccccccccccccccCccccCCCccccccccccccccccccccccchhhhh Q lcl|Aclame:pro 110 PTGLIFAMRSRYENQAGEEALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLE 189 (468) Q Consensus 110 PTGLIFAMRsrY~~qsG~EA~fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE 189 (468) .+|-+--.+ ..+.. ..+ . -.+ | T Consensus 158 ~~~~~~~~~--~~~~~-~~~-------~----------------------------------------------~v~--E 179 (408) T protein:vir:74 158 SSGSRVYEK--WTDVT-PLK-------A----------------------------------------------MDE--E 179 (408) T ss_pred CcceEEEEe--ecCCc-ccc-------c----------------------------------------------ccc--c Confidence 887553333 10000 000 0 000 0 Q ss_pred ccCCCCcchhhcc-eEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhcc Q lcl|Aclame:pro 190 RMGEANRLFREMS-FSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKG 268 (468) Q Consensus 190 ~lG~~~~~f~EMa-FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~ 268 (468) +...++.+ .+++++++..+.-+-...+|-||.+|- .+|.++.|.+-|+..|..-+|+.||.- T Consensus 180 -----~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~l~~~~~~~~d~~il~G-------- 242 (408) T protein:vir:74 180 -----DGKIPDLDNPRLTIIKYLIKRYAGIITATNTLLKDT----AENILAWLSSWIAKKVVVTRNQAIIAA-------- 242 (408) T ss_pred -----ccccccccccceeeEEeeeeeEEeeehhHHHHHhhc----hHHHHHHHHHHHHHHHHHHHHHHHhhc-------- Confidence 11122222 334445555555555566999999983 357889999999999999888887742 Q ss_pred ccccccccccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhccccccccccccccc Q lcl|Aclame:pro 269 AQNNVANAGIFDLDVDSNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGP 348 (468) Q Consensus 269 k~~~~~~~g~~Dl~~~~~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~ 348 (468) .-++....++.+++ +++-.+. ..+.. --+.. -.+||+|.....|... ..+. +. T Consensus 243 ~G~~~~~~~~~~~~-------------~i~~~~~---~~l~~-~~~~~-a~~v~n~~~~~~l~~l---kd~~------G~ 295 (408) T protein:vir:74 243 MGTVPKKPTIANFD-------------DVITMIN---TSVDP-AIIAT-SSLLTNQSGLNKLALV---KTAE------GK 295 (408) T ss_pred ccccccccccccHH-------------HHHHHHH---Hhhhh-hhcCC-CEEEEcHHHHHHHHHh---hcCC------Cc Confidence 11222222333221 1211111 01111 11222 3578899999999862 2111 11 Q ss_pred ccccccccCceeEEEecCceEEEE--cccccccCCcce-EEEE---------EecCCcccceeEeeccchhhcccccCCc Q lcl|Aclame:pro 349 SIGEVDDTGNLAVGTINGRIKVFV--DPYAANLSDKHY-YVIG---------YKGTSPYDAGLFYCPYVPLQMVRSIDPN 416 (468) Q Consensus 349 ~~~~~d~t~~~~~G~l~g~~~vy~--D~Ya~~~~~~dY-~~vG---------~KG~~~~d~glfyaPYv~l~~~~~~dp~ 416 (468) .+...+-++.. .++|. |++|++ |-...+....++ +++| -+++ -.+=..||.-. +-. T Consensus 296 ~l~~~~~~~~~-~~~l~-G~pV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~----~~i~~~~~~~~------~f~ 363 (408) T protein:vir:74 296 YLLEPDPTKPN-SYLIK-GKQVIVVADRWLPNSGSTVYPLYYGDMSQAITLFDREN----MSLLPTNIGAG------AFE 363 (408) T ss_pred eEeccCcCCCC-Cceec-ceeeEEecCcccccccCCcceEEEEehhccEEEEEecc----eEEEEeccccc------hhh Confidence 12222222221 24564 455554 211111111111 2222 1111 11222232211 112 Q ss_pred cccceeeeeeeeeeee-cC--cccc-----cCccccc--cchhhh Q lcl|Aclame:pro 417 TFQPKIGFKTRYGMVS-NP--FVTT-----NGLYNGT--PDGEAL 451 (468) Q Consensus 417 s~qP~~g~~tRY~l~~-nP--~~~~-----~~~~~~~--~~~~~~ 451 (468) ..+-.+-+..||+..+ +| |... ....+.. +..... T Consensus 364 ~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~ 408 (408) T protein:vir:74 364 TDTTKIRVIDRFDVKATDSEALVAGSFTAIADQVGNFKTTTSTAV 408 (408) T ss_pred cceeeEEEEEeeCcEEecccceEEEEeecccCCCCCCCCCccccC Confidence 3455555666666532 33 1110 1111111 111111 No 29 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=94.37 E-value=0.0046 Score=33.40 Aligned_cols=281 Identities=11% Similarity=0.062 Sum_probs=122.7 Q ss_pred ccccccccccccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCcccccccCCccccccccccccc Q lcl|Aclame:pro 69 ALGSANTGGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDTGFTGGYDASQGD 148 (468) Q Consensus 69 i~~st~tg~i~~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSg~~~~~~~~ 148 (468) .+++|++++..--....-.++.++.+..+-.+++.+.||++-..-+. . +.. +.+| .| T Consensus 1 ma~~t~~~G~lip~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~~p-~---~~~--~~~a-------~w---------- 57 (300) T protein:vir:95 1 MSEAQLSKGNLFNPELVTKVINKVKGHSSIAKLSPQKPIPFNGQREF-V---FDF--DSDI-------DI---------- 57 (300) T ss_pred CcccccCCcceechhhHHHHHHHHHhhhhhhhhcceeeccCCceEEE-E---Eec--Ccce-------EE---------- Confidence 45566555543222222233343445556678999999876432221 1 110 0000 00 Q ss_pred cccccCccccCCCccccccccccccccccccccccchhhhhccCCCCcchhhcceEEEEEEEEeecccccccccHHHHHH Q lcl|Aclame:pro 149 YAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFREMSFSIEKTSVTAQSRALKAEYTLELAQD 228 (468) Q Consensus 149 ~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQD 228 (468) .+| +.+.++...+++.++..+|.=+-...+|-||.+. T Consensus 58 --------------------------------------v~E-----g~~~~~s~~~f~~v~l~~~k~~~~~~iS~ell~~ 94 (300) T protein:vir:95 58 --------------------------------------VAE-----NGKKTHGGVSLDPVTIVPLKVEYGARVSDEFLHA 94 (300) T ss_pred --------------------------------------eeC-----CcccccccccceeeEeeeEEEEEeehhhHHHhcc Confidence 001 1234444555566666666666667789998753 Q ss_pred HHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhccccccc----cccccccccccccchhHHHHHHHHHHHHHHH Q lcl|Aclame:pro 229 LKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNV----ANAGIFDLDVDSNGRWSVEKFKGLLFQVERD 304 (468) Q Consensus 229 LkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~----~~~g~~Dl~~~~~grw~~e~~k~L~~~i~~e 304 (468) ... ..+|-+++|.+-|...|...+++.++.-... ..+...++ ...+.........+-- .+..... T Consensus 95 ~~d-~~~~l~~~i~~~l~~aia~~~d~~~l~G~~~--~~g~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~i~~ 163 (300) T protein:vir:95 95 SEE-AKVDMLTDFVEGFSKKLARGLDIMSIHGINP--RTKQASTIIGDNCFDKKVTQTVPFKDTN--------PDESMED 163 (300) T ss_pred CCC-CHHHHHHHHHHHHHHHHHHHHHHhhhhcccC--CCCCCcccccccccccccceeecccccc--------hHHHHHH Confidence 222 2356788888888888888888888754311 11111100 0011111111111100 1111111 Q ss_pred HHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccccccCceeEEEecCceEEEEccccccc--CCc Q lcl|Aclame:pro 305 ANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVFVDPYAANL--SDK 382 (468) Q Consensus 305 an~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~--~~~ 382 (468) +-... ..-.++.+-+|++|+....|... ..+ .+..+..-+.++. ..++|.| ++|+++.+.... .+. T Consensus 164 ~~~~~-~~~~~~~~~~vmn~~~~~~L~~l---kd~------~G~~i~~~~~~~~-~~~~l~G-~Pv~~s~~v~~~~~~~~ 231 (300) T protein:vir:95 164 AVGMI-DGSERDITGAILDPIFTTALSKM---KNA------EGGKLYPELAWGG-VPDAING-LAVDKNRTVSYSQTDPK 231 (300) T ss_pred HHHHh-hhcCCCccEEEECHHHHHHHHHh---hcc------CCCeeccCccccC-CCceecc-eeeEEecCCCCCCCCCc Confidence 11111 12246667789999999888652 211 1111111112211 2367754 688877553211 122 Q ss_pred ceEEEEEecCCcccceeEeeccchhhccc--ccCCcc-----c---cceeeeeeeeeeee-cC--cccccCcccc Q lcl|Aclame:pro 383 HYYVIGYKGTSPYDAGLFYCPYVPLQMVR--SIDPNT-----F---QPKIGFKTRYGMVS-NP--FVTTNGLYNG 444 (468) Q Consensus 383 dY~~vG~KG~~~~d~glfyaPYv~l~~~~--~~dp~s-----~---qP~~g~~tRY~l~~-nP--~~~~~~~~~~ 444 (468) +.+++|= +..+++|......++.. -.|+++ | |=.+-+..|+|..+ +| |+..... .| T Consensus 232 ~~~~~GD-----f~~~~~~~~~~~~~~~v~~~~~~d~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~~~-~g 300 (300) T protein:vir:95 232 NTAIVGD-----FETMFKWGYAKEVPMEIIKYGDPDNSGRDLKGYNQIYIRCEAYIGWGIMDAASFARIVKT-GG 300 (300) T ss_pred cEEEEee-----ccceEEEEEecccEEEEeeccCCCCcchhhhhcCcEEEEEEEeecceeecccceEEEecC-CC Confidence 3333331 00112222222222211 123332 2 12333455887544 66 5443221 11 No 30 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=94.23 E-value=0.005 Score=33.19 Aligned_cols=309 Identities=14% Similarity=0.094 Sum_probs=125.2 Q ss_pred HHHHHhHHHHHHhhhhhhhhhhhhhhcCccccccccccccccccccccccccee-hhhhHHhhhhhhhhheeeeecCCcc Q lcl|Aclame:pro 32 TSVLLENQERFLREERGMLNEVAVNSLGAGTIAPAGSALGSANTGGLAGFDPVL-ISLVRRAMPNLMAYDVCGVQPMSGP 110 (468) Q Consensus 32 ~~~llenq~~~~~~~~~~l~e~~~~~~~~~~~~~~~~i~~st~tg~i~~~~P~L-v~l~RRa~~~LI~~DI~GVQPmTGP 110 (468) |+ -|+|....+.+.+.. ...+++++- -.-+.+ -.+++.+.+..+-..+|.+.||+++ T Consensus 1 ~~---------------~~~e~~~~~~~~~~~------~~~~~~~~~-liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~ 58 (338) T protein:vir:78 1 MA---------------TLNELAPNTAGSNHQ------GRLAHVPSD-LLPKEIVGPIFDKAQESSLVLRLGENIPISYG 58 (338) T ss_pred Cc---------------chHHhhhhhcccccc------cceeccccc-ccchHHHHHHHHHHHhhchhhhhcceeeccCC Confidence 11 233322222222211 011111111 111111 1345555566677889999999987 Q ss_pred ceeeeeeeeeecCCCCcccccccCCccccccccccccccccccCccccCCCccccccccccccccccccccccchhhhhc Q lcl|Aclame:pro 111 TGLIFAMRSRYENQAGEEALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLER 190 (468) Q Consensus 111 TGLIFAMRsrY~~qsG~EA~fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~ 190 (468) ..-|.-.. ... ...+-+. . . ..-.+ | T Consensus 59 ~~~ip~~~----~~~---------~a~~v~~------------------~----~----------------~~~~~--E- 84 (338) T protein:vir:78 59 ETIIPTTV----KRP---------EVGQVGV------------------G----T----------------SNEQR--E- 84 (338) T ss_pred ceEEEEEe----cCc---------cceeecc------------------c----c----------------ccccc--c- Confidence 55444322 100 0000000 0 0 00001 1 Q ss_pred cCCCCcchhhcceEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhcccc Q lcl|Aclame:pro 191 MGEANRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQ 270 (468) Q Consensus 191 lG~~~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~ 270 (468) +..+++-.-+++.++...+..+-...+|-||.+|-. .|.|++|.+-|...|...||..||.---...-. .. T Consensus 85 ----g~~~~~~~~~f~~v~l~~~k~~~~~~is~ell~ds~----~~~~~~i~~~la~a~~~~~d~~~l~G~g~~~~~-~~ 155 (338) T protein:vir:78 85 ----GGTKPLSGTAWDTRSVAPIKLATIVTVSEEFARMNP----SGLYTKLQADLAYAIGRGIDLAVFHGKSPLTGS-AL 155 (338) T ss_pred ----cccccccccceeEEEEEEEEEEEeehhhHHHHhcCH----HHHHHHHHHHHHHHHHHHHHHHhhcccCCCccc-cc Confidence 122333334445555555555556678999999833 678899999999999999998888532211000 00 Q ss_pred ccccc----cccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhccccccccccccc Q lcl|Aclame:pro 271 NNVAN----AGIFDLDVDSNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAG 346 (468) Q Consensus 271 ~~~~~----~g~~Dl~~~~~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~ 346 (468) .++.. .+....+....+ -...|.....+..-......+..+-++++|+....|...-.+...-+ T Consensus 156 ~gi~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~~~~l~d~~g----- 223 (338) T protein:vir:78 156 QGIDTNNVIVNTTNVDYLQTG-------TTPLLDRFLDGYDLVSANTDVDFNGWAADPRYRARLLRSQAYRDANG----- 223 (338) T ss_pred ccccccccccccccccccccc-------chhhHHHHHHHHHHhhhhccccceEEEEchHHHHHHHHHhhhccCCC----- Confidence 00000 000001100000 00112222222222223335677889999999888765321211100 Q ss_pred ccccccccccCceeEEEecCceEEEEccccccc-----C--------CcceEEEEEecCCcccceeEeeccchhhccccc Q lcl|Aclame:pro 347 GPSIGEVDDTGNLAVGTINGRIKVFVDPYAANL-----S--------DKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSI 413 (468) Q Consensus 347 ~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~-----~--------~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~ 413 (468) .....-+.++. ..++|. |++|+++.+...+ . ++.++++|..++...+ ..+| ..+.... T Consensus 224 -~~l~~~~~~~~-~~~~l~-G~PV~~~~~ip~~~~~~~~~~~~~~~gdfs~~~~~~~~~~~i~----~~~~--~~~~~~~ 294 (338) T protein:vir:78 224 -NVDPTRINLAA-SAGDLL-GLPVQFGKAVGGDLGAATDSKVRVVGGDFSQLKYGFADEIRVK----MSDT--ATLTDNT 294 (338) T ss_pred -ceeecccccCC-CCceee-eeeEEEccccCccccccCCcccEEEEEecceEEEEeecccEEE----Eeec--ccccccc Confidence 00011111111 125564 4588877443211 0 1122223333222111 0111 1122222 Q ss_pred CCcc-----c---cceeeeeeeee-eeecC--cccccCccccccch Q lcl|Aclame:pro 414 DPNT-----F---QPKIGFKTRYG-MVSNP--FVTTNGLYNGTPDG 448 (468) Q Consensus 414 dp~s-----~---qP~~g~~tRY~-l~~nP--~~~~~~~~~~~~~~ 448 (468) ||.. | |=.+=...|++ .+.+| |+.... ...+++ T Consensus 295 ~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~a~~~l~~--~~~~~~ 338 (338) T protein:vir:78 295 SPTPQTVSMWQTNQIAILIEVTFGWLLGDKQAFVKFVD--DEDPDA 338 (338) T ss_pred cccccchhhhhcCcEEEEEEEEeccEeecccceEEEec--ccCCCC Confidence 3322 1 11222356787 45566 543322 122332 No 31 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=93.61 E-value=0.007 Score=32.41 Aligned_cols=279 Identities=10% Similarity=0.042 Sum_probs=125.8 Q ss_pred cCcccccccccccccccccccccccc-eehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCcccccccCCc Q lcl|Aclame:pro 58 LGAGTIAPAGSALGSANTGGLAGFDP-VLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDT 136 (468) Q Consensus 58 ~~~~~~~~~~~i~~st~tg~i~~~~P-~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t 136 (468) +|.+... ..+++++... .-| ..-.++++..+..+-.+++-+-||++.+.-+- .. ++.++ T Consensus 1 ~g~~a~~-----~~~~~~~~~~-iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~-----~~--~~~~a------- 60 (299) T protein:vir:41 1 MGFNPDT-----TTMQSAKTGS-IPINISEQIITGVKNGSAAMKLAKAVPMTKPEEEFT-----FM--SGVGA------- 60 (299) T ss_pred CCcCCCc-----ccccCCCcee-cchhHHHHHHHHHHhcchhhhhceeeecCCCcEEEE-----EE--cCCce------- Confidence 3333221 1122222221 112 22345666677888889999999988763221 00 00000 Q ss_pred cccccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCCCCcchhhcceEEEEEEEEeeccc Q lcl|Aclame:pro 137 GFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFREMSFSIEKTSVTAQSRA 216 (468) Q Consensus 137 ~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~~~f~EMaFsIeK~tVtAKSRa 216 (468) .| . +| +..+++...++++++...|..+ T Consensus 61 ~~----------------------------------------------v--~E-----~~~~~~~~~~f~~v~l~~~k~~ 87 (299) T protein:vir:41 61 FW----------------------------------------------V--DE-----AERIQTSKPTFTKAKMRSKKMG 87 (299) T ss_pred ee----------------------------------------------e--ec-----CccccccccceeEEEEeeEEEE Confidence 00 0 01 2234555666778888888888 Q ss_pred ccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhccccccccccccccccccccchhHHHHHHH Q lcl|Aclame:pro 217 LKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVANAGIFDLDVDSNGRWSVEKFKG 296 (468) Q Consensus 217 LKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~~~~g~~Dl~~~~~grw~~e~~k~ 296 (468) -...+|-||.+|-. .|.++.|.+.|...|...+|+.||.--- .++..|+...-...-.....+--..+.... T Consensus 88 ~~~~is~ell~ds~----~~~~~~i~~~l~~a~~~~~d~a~l~G~g----~~~~~gil~~~~~~~~~~~~~~~~~~~l~~ 159 (299) T protein:vir:41 88 VIIPTTKENLNYSV----TNFFSLMQAEIVEAFYKKFDQAVFTGVE----SPYNWNILKSATDASNLVEETANKYDDLNE 159 (299) T ss_pred EeehhhHHHHhcCH----HHHHHHHHHHHHHHHHHHHHHHHhhccc----CcccccccccccccceeeccccccHHHHHH Confidence 88899999999754 4678889999999999888888874211 111111110000000000000001122222 Q ss_pred HHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccccccCceeEEEecCceEEEEcccc Q lcl|Aclame:pro 297 LLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVFVDPYA 376 (468) Q Consensus 297 L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya 376 (468) ++ +.+.. --++++.+||+|+....|..- -+ + .+...++.+-++. .++|. +++|++.... T Consensus 160 ~~-------~~l~~--~~~~~~~~v~n~~~~~~L~~l--kd-~------~G~~l~~~~~~~~--~~~l~-G~PV~~~~~~ 218 (299) T protein:vir:41 160 AI-------GLIEA--EDLEPNGIATIRKQRVKYRST--KD-G------NGMPIFNTATSNG--VDDVL-GLPIAYTPKY 218 (299) T ss_pred HH-------Hhhhc--ccCCcCEEEEcHHHHHHHHHh--hc-c------CCceeecCCcCCC--Cceec-ceeeEEeccc Confidence 32 22222 234567799999999999862 11 1 1112222222222 24665 4777766443 Q ss_pred cccCC--------cceEEEEEecCCcccceeEeeccchhhcccccCCcc-----ccc-eeee--eeeeeeee-cC--ccc Q lcl|Aclame:pro 377 ANLSD--------KHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNT-----FQP-KIGF--KTRYGMVS-NP--FVT 437 (468) Q Consensus 377 ~~~~~--------~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~s-----~qP-~~g~--~tRY~l~~-nP--~~~ 437 (468) ..... +.++++|..++...+- -.+..+....||+. ||- .++| ..|++..+ || |+. T Consensus 219 ~~~~~~~~~~~gdfs~~~i~~~~~~~i~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~A~~~ 292 (299) T protein:vir:41 219 TFGDKDISELVGDWNQAYYGILRGVEYEI------LTEATLTTVADETGKPLNLAERDMAAIKATFEVGFMVVKDEAFSA 292 (299) T ss_pred CCCCCceEEEEEecccEEEEEecCcEEEE------eecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEE Confidence 32111 1122233333221110 00000111123321 222 2333 35666554 33 433 Q ss_pred ccCccccccchhhhhhhcc Q lcl|Aclame:pro 438 TNGLYNGTPDGEALTPNAN 456 (468) Q Consensus 438 ~~~~~~~~~~~~~~~~~an 456 (468) .. .+.|| T Consensus 293 l~------------~~aa~ 299 (299) T protein:vir:41 293 VQ------------PKAGN 299 (299) T ss_pred EE------------eccCC Confidence 22 11222 No 32 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=93.43 E-value=0.0076 Score=32.21 Aligned_cols=345 Identities=13% Similarity=0.045 Sum_probs=124.7 Q ss_pred CcchHHHHHhhhhhhCCCccchhcchhhh-HHHHHHHhH----HHHHHhhhhhhhhhhh--hhhc--------Ccccc-c Q lcl|Aclame:pro 1 MFNAEHLQEKWSPVLNHGEAPAIGDRYKR-AVTSVLLEN----QERFLREERGMLNEVA--VNSL--------GAGTI-A 64 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~-~~~~~llen----q~~~~~~~~~~l~e~~--~~~~--------~~~~~-~ 64 (468) |=+-+++.+|...+-... +-++.+.-+. .....+.|. |++-.++.+....... .... ..... . T Consensus 1 ik~L~e~~~e~~e~~~~~-~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~r~~ 79 (390) T protein:vir:40 1 MNNLDKKDSETLNISTAF-LNAIKEGATEAEQVTAFTNMAEQIQNNIIAQARKEVNREMNDNNVLASRGANALTSDESKY 79 (390) T ss_pred CchHHHHHHHHHHHHHHH-HHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCchhccHHHHHH Confidence 444444444443321110 1111111110 011111110 1111111110000000 0000 00000 0 Q ss_pred ccccccccccccccccccceehh-hhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCcccccccCCcccccccc Q lcl|Aclame:pro 65 PAGSALGSANTGGLAGFDPVLIS-LVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDTGFTGGYD 143 (468) Q Consensus 65 ~~~~i~~st~tg~i~~~~P~Lv~-l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSg~~~ 143 (468) ....+.+.++++.-.-.-+.+.. ++++.-..-+-.+++-+.||++....|.... + .+ ++ .|. T Consensus 80 ~~~~~~~~~~~~gg~lvP~~~~~~I~~~~~~~s~i~~~~~~~~~~~~~~~i~~~~----~-~~-~a-------~~~---- 142 (390) T protein:vir:40 80 YNEVIAGNGFAGVTALLPPTVFERVFEDLTVEHPLLSKINFVNTTATTEWIISVG----D-VA-TA-------WWG---- 142 (390) T ss_pred HHHHHhccCcccCcccccHHHHHHHHHHHHhhhhhhhhceeeecCCceeEEEEEc----C-Cc-ce-------eee---- Confidence 01111222221111111111111 2222223334567899999988655443111 1 00 00 000 Q ss_pred ccccccccccCccccCCCccccccccccccccccccccccchhhhhccCCCCcchhhcceEEEEEEEEeecccccccccH Q lcl|Aclame:pro 144 ASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFREMSFSIEKTSVTAQSRALKAEYTL 223 (468) Q Consensus 144 ~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ 223 (468) ...++.-.+....|.+..|++.|..+- ...|- T Consensus 143 -----------------------------------------~E~~~~~~~~~~~f~~i~l~~~k~~~~-------i~iS~ 174 (390) T protein:vir:40 143 -----------------------------------------PLCAEIKEVLDNGFDKIQTGMYKLSAY-------IPVCN 174 (390) T ss_pred -----------------------------------------ccccccCccccccceeeEeeeeeEEEe-------ehhhH Confidence 000000011123577777777777653 45788 Q ss_pred HHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHH---------Hhhhhhc--cccccccccccccccccccchhHHH Q lcl|Aclame:pro 224 ELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRR---------VYTVAKK--GAQNNVANAGIFDLDVDSNGRWSVE 292 (468) Q Consensus 224 ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~---------l~~va~~--~k~~~~~~~g~~Dl~~~~~grw~~e 292 (468) ||.+|-- .|.|++|.+.|+..|..-+|+.||.- |...+.. ++... ...+.+ . .. T Consensus 175 ell~ds~----~~l~~~i~~~la~~i~~~~~~a~l~G~G~~~P~Gil~~~~~~~~~~~~~-~~~~~~--t--------~~ 239 (390) T protein:vir:40 175 AMLDLGP----SWLDQYVRTILGEAMALGLEAGIVNGSGKDQPIGMMRDLNNVTAGEHPV-KTATPL--T--------DL 239 (390) T ss_pred HHHhcch----HHHHHHHHHHHHHHHHHHHHhhhhcccCCCccceeeecccccccccccc-cccccc--c--------hh Confidence 9998863 47899999999999999999998852 1111100 00000 011111 0 11 Q ss_pred HHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccccccCceeEEEecCceEEEE Q lcl|Aclame:pro 293 KFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVFV 372 (468) Q Consensus 293 ~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~~~~G~l~g~~~vy~ 372 (468) ....++.++......-..+.. +++.|++-....+..|...-++. |.+|....+.+.-+++|++ T Consensus 240 ~~~~~~~~l~~~~~~~~~~~~-~~a~~i~n~~t~~~~l~~~~~~~----------------d~~G~~v~~~~~~g~pvv~ 302 (390) T protein:vir:40 240 TPATLATKVMLPLTDNGKKSV-SDAILVINPADYWSKIYAATSYM----------------TPQGVWVTGILPVPLEIVQ 302 (390) T ss_pred hHHHHHHHHHHHhhcchhhhh-cCceEEEcchhHHHHHHHHhhcc----------------CCCCccccccCCCceeEEE Confidence 122233344333333333332 45556544445566665543332 2223322233334678887 Q ss_pred ccccccc----CCcceEEEEEecCCcccceeEeeccc--hhh----------cccccCCccccceeeeeeeee-eeecCc Q lcl|Aclame:pro 373 DPYAANL----SDKHYYVIGYKGTSPYDAGLFYCPYV--PLQ----------MVRSIDPNTFQPKIGFKTRYG-MVSNPF 435 (468) Q Consensus 373 D~Ya~~~----~~~dY~~vG~KG~~~~d~glfyaPYv--~l~----------~~~~~dp~s~qP~~g~~tRY~-l~~nP~ 435 (468) +.++... -++.++++|-.++...+.+ ++. .-+ -...+||+.|. ++=+..==| -.+.|| T Consensus 303 ~~~~p~~~i~~Gd~s~~~i~~~~~~~v~~~----~~~~f~~~~~~~r~~~r~dg~v~~~~A~~-~l~~~~~~~~~~~~~~ 377 (390) T protein:vir:40 303 SVAVPVGKAVAGRAKDYFMGIGSEQVIRTS----TEYRLLDDETLYYAKQYANGRPKDNSSFL-VFDITGLEGSPAIDVN 377 (390) T ss_pred cCCCCCCcEEEEeeceEEEEeecceEEEec----chhhhhcCcEEEEEEEEeCCEEecccceE-EEEeeccCCCCCCCcc Confidence 7554321 0111122333332222211 110 000 00112555544 111111001 133445 Q ss_pred ccccCccccccch Q lcl|Aclame:pro 436 VTTNGLYNGTPDG 448 (468) Q Consensus 436 ~~~~~~~~~~~~~ 448 (468) .++...++..+.. T Consensus 378 ~~~~~~~~~~~~~ 390 (390) T protein:vir:40 378 VVNNATPSETPAE 390 (390) T ss_pred eeeCCCCCCCCCC Confidence 5443333332222 No 33 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=93.20 E-value=0.0084 Score=31.97 Aligned_cols=333 Identities=14% Similarity=0.093 Sum_probs=131.0 Q ss_pred CcchHHHHHhhhhhhCCCccchhcch-------------hhhHHHHHHHh------HHHHHHhhhhhhh----------- Q lcl|Aclame:pro 1 MFNAEHLQEKWSPVLNHGEAPAIGDR-------------YKRAVTSVLLE------NQERFLREERGML----------- 50 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~-------------~~~~~~~~lle------nq~~~~~~~~~~l----------- 50 (468) |-+.++|.+.|.-+.+. +-++.+. --+++.+.+-+ .+.+.+.+.+..- T Consensus 1 Mk~~~eL~~~~~~~~~~--~~~l~~~~~~~~~~~~~~~ee~~~l~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (397) T protein:vir:49 1 MKTSNELHDLWIAQGDK--VENLNEKLNVAMLDDSVSAEELQAIKNERDTAKMKRDLFKEQYTEARANEVANMSEEEKKP 78 (397) T ss_pred CchHHHHHHHHHHHHHH--HHHHHHHHHHHHhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccc Confidence 88899999999887764 1111000 00111111110 0000011100000 Q ss_pred ---hhhh---------hhhcCc-ccccccccccccccccccccccceehhhhHHhhhhhhhhheeeeecCCccceeeeee Q lcl|Aclame:pro 51 ---NEVA---------VNSLGA-GTIAPAGSALGSANTGGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAM 117 (468) Q Consensus 51 ---~e~~---------~~~~~~-~~~~~~~~i~~st~tg~i~~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAM 117 (468) .+.. ...+.+ ...........+++.|++.--....-.+++..-+...-.+++.|+||++.+|-+--. T Consensus 79 ~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~ 158 (397) T protein:vir:49 79 LTKNEEEVKANFVKDFKNLVRGRYQNLLDSKTDGSGSDAGLTIPQDIRTAINTLVRQFDSLQEYVNVENVTTLTGSRVYE 158 (397) T ss_pred ccchhhHHHHHHHHHHHHHhhcchhhHHHhhhccCCccCcceecHHHHHHHHHHHHhhhhHhhhcceeeccCCcceEEEE Confidence 0000 000000 000000000011111211111111113444455666777899999999888753222 Q ss_pred eeeecCCCCcccccccCCccccccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCCCCcc Q lcl|Aclame:pro 118 RSRYENQAGEEALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRL 197 (468) Q Consensus 118 RsrY~~qsG~EA~fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~~~ 197 (468) + .....+ .+ .| .+ | +.. T Consensus 159 ~--~~~~~~-~a-------~~----------------------------------------------v~--E-----~~~ 175 (397) T protein:vir:49 159 K--WADITG-LA-------KL----------------------------------------------DD--E-----GGQ 175 (397) T ss_pred e--eccCCc-ce-------ee----------------------------------------------ec--c-----ccc Confidence 2 111100 00 00 00 1 011 Q ss_pred hhhcce-EEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhcccccccccc Q lcl|Aclame:pro 198 FREMSF-SIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVANA 276 (468) Q Consensus 198 f~EMaF-sIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~~~~ 276 (468) +++-.. +++.++..++.-+-...+|-||.+|-. +|.+++|.+-|+..|..-+|+.||.-.- ++.... T Consensus 176 ~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~----~~l~~~i~~~l~~~~~~~~d~ail~G~g--------~~~~~~ 243 (397) T protein:vir:49 176 IGQNDDPKLSLIRYAIKRYAGISTVTNSLLADSA----ENILAWLSGWIAKKVVVTRNKAILEAIG--------TLPNKP 243 (397) T ss_pred cccccccceeeeEeeeeeeEeehhhHHHHHhhhh----HHHHHHHHHHHHHHHHHHHHHHHHhccc--------cccccc Confidence 222221 233444444444445679999999853 5789999999999999999998874321 222233 Q ss_pred ccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhccccccccccccccccccccccc Q lcl|Aclame:pro 277 GIFDLDVDSNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDT 356 (468) Q Consensus 277 g~~Dl~~~~~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t 356 (468) +++.++ ....++..+. +.-.....+|++|.....|... ..+- +..++..+-+ T Consensus 244 ~~~~~d----------~i~~~~~~l~---------~~~~~~a~~v~n~~~~~~l~~l---kd~~------g~~l~~~~~~ 295 (397) T protein:vir:49 244 TLAKWD----------DIIDLQAKVD---------PAIKQTSLFLTNTSGFTALKKV---KNAM------GDYLMERDVK 295 (397) T ss_pred cccCHH----------HHHHHHHhhh---------hhhcCCCEEEEcHHHHHHHHHh---hccC------Cceeeccccc Confidence 333322 1223333332 1123456789999999988763 2110 1111111111 Q ss_pred CceeEEEecCceEEE-E-cccccccC-Cc---------ceEEEEEecCCcccceeEeeccchhhcccccCCccccceeee Q lcl|Aclame:pro 357 GNLAVGTINGRIKVF-V-DPYAANLS-DK---------HYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNTFQPKIGF 424 (468) Q Consensus 357 ~~~~~G~l~g~~~vy-~-D~Ya~~~~-~~---------dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~s~qP~~g~ 424 (468) .. ..++|+|+ +|+ + |.+..+.. +. +|++++..+... +-..||... +-...+-.+-. T Consensus 296 ~g-~~~~l~G~-pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~----i~~~~~~~~------~~~~~~~~~~~ 363 (397) T protein:vir:49 296 SP-TGYSIDGF-VVKEISDRFLPNGTGGAMPLYFGDLKQAVTLFDRQHLS----LLSTNIGGG------AFETDTTKVRV 363 (397) T ss_pred CC-CCceecce-eeEEecccccccccCCceeEEEeeccceEEEEeecccE----EEEeccccc------hhhcCeeeEEE Confidence 11 12456554 444 3 32111101 11 122222222221 222333211 11233334445 Q ss_pred eeeeeeee-cC--ccccc-----Cccccccchhhhhhhc Q lcl|Aclame:pro 425 KTRYGMVS-NP--FVTTN-----GLYNGTPDGEALTPNA 455 (468) Q Consensus 425 ~tRY~l~~-nP--~~~~~-----~~~~~~~~~~~~~~~a 455 (468) ..|++..+ +| |.... ........ ..| T Consensus 364 ~~r~d~~~~~~~a~~~~~~~~~~~~~~~~~~-----~~~ 397 (397) T protein:vir:49 364 IDRFDVVSTDTEAFVPASFKAIADQKAKLST-----AGA 397 (397) T ss_pred EEeeccEEecccceEEEEecccccccCcccc-----cCC Confidence 55665443 33 22110 00000000 011 No 34 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=92.73 E-value=0.01 Score=31.50 Aligned_cols=294 Identities=12% Similarity=0.067 Sum_probs=124.8 Q ss_pred hhhhhhhhhcCccccccccccccccccccccccccee-hhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCc Q lcl|Aclame:pro 49 MLNEVAVNSLGAGTIAPAGSALGSANTGGLAGFDPVL-ISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGE 127 (468) Q Consensus 49 ~l~e~~~~~~~~~~~~~~~~i~~st~tg~i~~~~P~L-v~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG~ 127 (468) |=.|. ..+... .++.+++.. .-|.+ -.++++..++.+-.+++-+.||+++.--| - +..+ +. T Consensus 1 m~~~~----------~~a~~~-~~t~~~g~~-i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~-p---~~~~--~~ 62 (330) T protein:vir:77 1 MAGST----------VPSTQV-ALTGDFSAF-LTPEQSQDYFAEIEKTSIVQRIARKVPMGPTGISI-P---HWTG--AV 62 (330) T ss_pred Ccccc----------cchhhc-cccCCCcce-echhHHHHHHHHHHhccchhhhcceeeccCCceEE-E---EEcC--Cc Confidence 11110 000000 111111111 11222 23556666777888899999998765321 1 1110 00 Q ss_pred ccccccCCccccccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCCCCcchhhcceEEEE Q lcl|Aclame:pro 128 EALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFREMSFSIEK 207 (468) Q Consensus 128 EA~fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~~~f~EMaFsIeK 207 (468) ++ .| . +| +..+++-..++++ T Consensus 63 ~a-------~~----------------------------------------------v--~E-----g~~~~~~~~~f~~ 82 (330) T protein:vir:77 63 SA-------SW----------------------------------------------T--GE-----AERKPITKGSFGK 82 (330) T ss_pred ce-------eE----------------------------------------------e--cC-----CCccccccceeeE Confidence 00 00 0 01 2234445556677 Q ss_pred EEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHH---------Hhhhhhcccccccccccc Q lcl|Aclame:pro 208 TSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRR---------VYTVAKKGAQNNVANAGI 278 (468) Q Consensus 208 ~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~---------l~~va~~~k~~~~~~~g~ 278 (468) ++...|..+-...+|-||.+|- ..|.|++|.+-|+..|...||+-||.- |+..+... ........ T Consensus 83 i~~~~~k~~~~~~is~ell~ds----~~~~~~~i~~~l~~ai~~~~~~~~l~G~g~~~~~~g~~~~~~~~--~~~~~~~~ 156 (330) T protein:vir:77 83 QELEPVKITTIFAESAEVVRLN----PLNYLNTMRTKIAEAIALKFDAAAIHGIDKPSAFKGYLAETTKV--VSLADTNL 156 (330) T ss_pred EEEeEEEEEEeehhhHHHHhcc----hHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCcccccccccccc--ceeecccc Confidence 7777777777778999999984 568999999999999999999988831 11111110 00001111 Q ss_pred ccccccccchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccc----c Q lcl|Aclame:pro 279 FDLDVDSNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEV----D 354 (468) Q Consensus 279 ~Dl~~~~~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~----d 354 (468) .+..... ...+. .+......+... -...+.+||+|+....|... ..+.+ ..+.+. . T Consensus 157 ~~~~~~~-----~~~~~----~l~~~~~~~~~~--~~~~~~~vmn~~~~~~l~~l---kd~~G------~~l~~~~~~~~ 216 (330) T protein:vir:77 157 TTASGPQ-----GNAYL----AVNNALSLLVNS--GKKWTGTLLDNVTEPILNTA---VDGNG------RPLFVESTYTE 216 (330) T ss_pred ccccccc-----chhHH----HHHHHHHhhhhc--CCCccEEEEcHHHHHHHHHH---hccCC------ceeecCccccc Confidence 1111100 01111 222223333222 24455789999999998752 21110 000010 1 Q ss_pred ccCceeEEEecCceEEEEcccccccC----------CcceEEEEEecCCcc----cceeEee--ccchhhcccccCCccc Q lcl|Aclame:pro 355 DTGNLAVGTINGRIKVFVDPYAANLS----------DKHYYVIGYKGTSPY----DAGLFYC--PYVPLQMVRSIDPNTF 418 (468) Q Consensus 355 ~t~~~~~G~l~g~~~vy~D~Ya~~~~----------~~dY~~vG~KG~~~~----d~glfya--PYv~l~~~~~~dp~s~ 418 (468) ......-++|. |++|++.......+ ++.++++|-.++... ++.+.+. .|.. ....+-.-| T Consensus 217 ~~~~~~~~~l~-G~PV~~~~~~p~~~~~~~~~~~~gd~s~~~i~~~~~~~i~~~~e~~~~~~~~~~~~---~~~~~~~~f 292 (330) T protein:vir:77 217 QVGAIREGRIL-GRPTYVADNVVNGTVGNRVVGVMGDFSQVIWGQIGGLSFDVTDQATLDFGEEQGGV---WVPKLISLW 292 (330) T ss_pred cccccCCceec-ceeeEEeccccCCCCCCccEEEEEecceEEEEEecCcEEEEeecceeeeccccccc---ccccccchh Confidence 11111224554 47888885543211 112233443333222 1111110 0000 000000111 Q ss_pred ---cceeeeeeeeeee-ecC--cccccCccccccchhh Q lcl|Aclame:pro 419 ---QPKIGFKTRYGMV-SNP--FVTTNGLYNGTPDGEA 450 (468) Q Consensus 419 ---qP~~g~~tRY~l~-~nP--~~~~~~~~~~~~~~~~ 450 (468) +=.+=...|++.. .+| |+.......+.+..+. T Consensus 293 ~~~~~~~r~~~r~d~~v~~~~a~~~i~~~~~~~~~~~~ 330 (330) T protein:vir:77 293 QHNMVAVRCEAEFAFMVNDKDAFVKLTDQVAGTDPEEE 330 (330) T ss_pred hcCcEEEEEEEEeccEEecccceEEEEeccCCcCCCCC Confidence 1122233466543 345 4433333333222222 No 35 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=92.01 E-value=0.013 Score=30.89 Aligned_cols=332 Identities=11% Similarity=0.052 Sum_probs=127.4 Q ss_pred cchHHHHHhhhhhhCCCccchhcchhhhHHHH-------HHHhHHHH---HH---h----hhhhhhhhhhhhhcCcc--- Q lcl|Aclame:pro 2 FNAEHLQEKWSPVLNHGEAPAIGDRYKRAVTS-------VLLENQER---FL---R----EERGMLNEVAVNSLGAG--- 61 (468) Q Consensus 2 ~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~~-------~llenq~~---~~---~----~~~~~l~e~~~~~~~~~--- 61 (468) |+-++|+++|.-+.+. +.++.+.-++.... ...|..++ .+ + +.+....+. ....... T Consensus 1 M~~~eL~~~~~~~~~~--~~~l~e~~~~~~~~~~~~~~~~~~ee~~~l~~~i~~~~~~~~~~~~~~~~~-~~~~~~~~~~ 77 (395) T protein:vir:38 1 MNINQLKDAFDMAGQK--VQDLEDKRAQFAIDLGNDASSHSVDDINKLNASLKNAKMAQELAKSAYEDA-RANLNAEPVN 77 (395) T ss_pred CCHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-Hhhhhhcccc Confidence 9999999999888643 33343322221111 11111100 00 0 000000000 0000000 Q ss_pred ------------------c--ccccccccccc---ccccc---ccccceehhhhHHhhhhhhhhheeeeecCCccceeee Q lcl|Aclame:pro 62 ------------------T--IAPAGSALGSA---NTGGL---AGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIF 115 (468) Q Consensus 62 ------------------~--~~~~~~i~~st---~tg~i---~~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIF 115 (468) . .......+.++ ++|++ ..+.+. +++...+..+..+++.+.||++++|-+- T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~---ii~~~~~~~~l~~~~~~~~~~~~~~~~~ 154 (395) T protein:vir:38 78 KKPLPVKDGKPDAQAMKNQFVKDFKNLVTSGTTGTGNAGLTIPEDIQLQ---IRTLTRSFTSLESLANVENVTTSHGSRV 154 (395) T ss_pred ccccchhhhhHHHHHHHHHHHHHHHHHHhhccCccCCCceecchhHhhH---HHHHHHhhcchhhhcceeeccCCcceEE Confidence 0 00001111111 12221 122223 4444445667888899999999988542 Q ss_pred eeeeeecCCCCcccccccCCccccccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCCC- Q lcl|Aclame:pro 116 AMRSRYENQAGEEALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEA- 194 (468) Q Consensus 116 AMRsrY~~qsG~EA~fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~- 194 (468) -.+ -.+..+ .+ . -.++++...+. T Consensus 155 ~~~--~~~~~~-~a-------~----------------------------------------------~v~E~~~~~~~~ 178 (395) T protein:vir:38 155 YEK--LADITP-LK-------D----------------------------------------------LDDESALIGDND 178 (395) T ss_pred EEe--eccCCc-cc-------c----------------------------------------------cccccccccccc Confidence 211 000000 00 0 00000111111 Q ss_pred CcchhhcceEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhcccccccc Q lcl|Aclame:pro 195 NRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVA 274 (468) Q Consensus 195 ~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~~ 274 (468) ...|.+..|+..|..+ ...+|-||.+|- +.|-++.|.+-|+..|..-||+.|+.-.= .+.. T Consensus 179 ~~~f~~v~~~~~k~~~-------~~~iS~ell~ds----~~~l~~~i~~~la~~~~~~~~~~il~g~g--------~~~~ 239 (395) T protein:vir:38 179 DPELTVVKYLIHRYAG-------ITTVTNTLLKDT----VDNIIQWLVNWAAKKDVVTRNAKILEVMG--------KAPK 239 (395) T ss_pred ccceeeEEeeeeeeEe-------ehhhHHHHHhhh----HHHHHHHHHHHHHHHHHHHHHHHHhhccc--------cccc Confidence 1235555555555544 455999999983 35678888888888888888887775211 1111 Q ss_pred ccccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhccccccccccccccccccccc Q lcl|Aclame:pro 275 NAGIFDLDVDSNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVD 354 (468) Q Consensus 275 ~~g~~Dl~~~~~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d 354 (468) ..+...+ .....+++.... .--+. ...+||+|.....|... ..+. +....+.+ T Consensus 240 ~~~~~~~----------~~i~~~~~~~l~-------~~~~~-~a~~v~n~~~~~~L~~l---kd~~------G~~l~~~~ 292 (395) T protein:vir:38 240 KPTISQF----------DNIKDLENNTLD-------PAIES-TSSFITNQSGYNILSKV---KDAD------GRYLMQPD 292 (395) T ss_pred ccccccH----------HHHHHHHHHhhh-------hhhcC-CCEEEEcHHHHHHHHHh---hccC------CceeeccC Confidence 1222211 122223222111 11112 23578999999888752 2111 11111111 Q ss_pred ccCceeEEEecCceEEEEcccc--cccCCc---------ceEEEEEecCCcccceeEeeccchhhcccccCCccccceee Q lcl|Aclame:pro 355 DTGNLAVGTINGRIKVFVDPYA--ANLSDK---------HYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNTFQPKIG 423 (468) Q Consensus 355 ~t~~~~~G~l~g~~~vy~D~Ya--~~~~~~---------dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~s~qP~~g 423 (468) -++ -..++|. |++|++.... ....+. +|++++.+... .+=+.++.. .+-..-+=.+- T Consensus 293 ~~~-~~~~~l~-G~pV~~~~~~~~~~~~~~~~i~~gd~~~~~~i~~~~~~----~i~~~~~~~------~~~~~~~~~~r 360 (395) T protein:vir:38 293 VTS-PDKYLID-GKPVIRIADKWLPDVSGSHPLYFGDLKQGITLFDRQQM----QIDTTNVGA------GSFEHDTTKLR 360 (395) T ss_pred cCC-CCcceec-cceeEEecccccCcCCCcceEEEEeccccEEEEEecce----EEEEecccc------chhhcCceEEE Confidence 111 1124554 4566553211 000010 11112211110 111111110 01122234455 Q ss_pred eeeeeeeee-cC--cccccCcc--ccccchhhhhh Q lcl|Aclame:pro 424 FKTRYGMVS-NP--FVTTNGLY--NGTPDGEALTP 453 (468) Q Consensus 424 ~~tRY~l~~-nP--~~~~~~~~--~~~~~~~~~~~ 453 (468) +..||+..+ +| |....-.. ...+....-++ T Consensus 361 ~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~~~~~~ 395 (395) T protein:vir:38 361 FIDRFDVQLIDDGAFAAASFKTVANQAQGTAGTGK 395 (395) T ss_pred EEEeeccEEecccceEEEEeecccCCCCCccCCCC Confidence 566666543 24 33211000 00000001111 No 36 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=91.76 E-value=0.014 Score=30.69 Aligned_cols=366 Identities=13% Similarity=0.129 Sum_probs=132.2 Q ss_pred Ccc-hHHHHHhhhhh------------hCCCccchhcchhhhHHHHHHHhH---HHHH-----H--hhhhhhhhhhhhhh Q lcl|Aclame:pro 1 MFN-AEHLQEKWSPV------------LNHGEAPAIGDRYKRAVTSVLLEN---QERF-----L--REERGMLNEVAVNS 57 (468) Q Consensus 1 ~~~-~~~l~~kw~p~------------l~~~~~~~i~~~~~~~~~~~llen---q~~~-----~--~~~~~~l~e~~~~~ 57 (468) +-. .++|.++=... ...+..+ ....++.-....+.+ +.+. . +..+.+........ T Consensus 66 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 143 (477) T protein:vir:84 66 LDEQIRELESEIERSGKLEAETKTVRKATVEVNE--ALTYEKGNGQSYFRDLAMQTVGMADEPAKERLRRHMVDVESDKE 143 (477) T ss_pred HHHHHHHHHHHHHHhhcchhhhhhhccccccccc--chhhhhhHHHHHHHHHHHHHhhhhhhHHHHHHHHHHhhhhhhhh Confidence 100 00110000000 0000000 000111000011100 0000 0 00000000000000 Q ss_pred cC--cccccccccccccccccccccccceeh--hhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCccccccc Q lcl|Aclame:pro 58 LG--AGTIAPAGSALGSANTGGLAGFDPVLI--SLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNE 133 (468) Q Consensus 58 ~~--~~~~~~~~~i~~st~tg~i~~~~P~Lv--~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnE 133 (468) .. .-...-...+..++++|+. -.-|..+ .++...-+..+..+++++.||++.+|-+-=-|.. +|.. T Consensus 144 ~~~~~~~~~~~~~~~~~~~~gg~-lv~~~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~ip~~~----~~~~----- 213 (477) T protein:vir:84 144 IRKIAKVGEEYRDLDRNGGTGGY-AVPPLWMMNRFIELARAGRTYANLCPTEPLPGGTSSINIPKIL----TGTS----- 213 (477) T ss_pred HHHHHHhhhhhccccccCCCcce-eeccchhHHHHHHHhhhcchHHHhhceeeecCCcceeEEEEEe----cCcc----- Confidence 00 0000001111112222211 1122221 2445455677778999999999988854332211 1000 Q ss_pred CCccccccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCCCCcchhhcceEEEEEEEEee Q lcl|Aclame:pro 134 PDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFREMSFSIEKTSVTAQ 213 (468) Q Consensus 134 a~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~~~f~EMaFsIeK~tVtAK 213 (468) ..+ . .+++... .....++...+++.++..+| T Consensus 214 --~a~----------------------~-----------------------~~Eg~~~--~~~~~~~s~~~f~~i~~~~~ 244 (477) T protein:vir:84 214 --TAI----------------------Q-----------------------AADNAAL--TAPSAHEVDLTDGFVQANVK 244 (477) T ss_pred --eee----------------------e-----------------------eccCccc--ccccccccccceeeEEEeee Confidence 000 0 0000000 01124555566777888888 Q ss_pred cccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhccccccccc-ccccccccccc-chhHH Q lcl|Aclame:pro 214 SRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVAN-AGIFDLDVDSN-GRWSV 291 (468) Q Consensus 214 SRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~~~-~g~~Dl~~~~~-grw~~ 291 (468) .-+-...+|-||.+|-. .|.++.|.+-|+..|..-|++.||.- .-..++..|+.+ .|+.-...... .-|. T Consensus 245 k~~~~~~iS~ell~ds~----~~l~~~i~~~l~~~~~~~~d~~~l~G---~Gt~~~p~Gi~~~~~~~~~~~~~~~~t~~- 316 (477) T protein:vir:84 245 TIAGQQGIAIQLLDQAA----VSVDEFVFRDLAADYANKLNVQVISG---TGSNNQVVGVRATAGITQVTATSAGSALE- 316 (477) T ss_pred eEEeeeHHHHHHHhccc----hhHHHHHHHHHHHHHHHHHHHHHhcc---CCCCCccceeeeccccccccccccccchh- Confidence 88888889999999843 57899999999999999999887742 101111222211 11111111110 1111 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhh----cccccccccccccccccccccccCceeEEEecCc Q lcl|Aclame:pro 292 EKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMA----GVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGR 367 (468) Q Consensus 292 e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~s----G~~~~~~~~~~~~~~~~~~~d~t~~~~~G~l~g~ 367 (468) ....+.-.|....+.+ ....+-.+..+|++|...+.|... |-.-+.|......+ .......-.....|+|. + T Consensus 317 -~~~~~~~~i~~~~~~~-~~~~~~~~~~~v~~~~~~~~l~~lkd~~G~~l~~~~~~~~~~-~~~~~~~~~~~~~~~l~-G 392 (477) T protein:vir:84 317 -KHQIIYQKIADAIQRV-HTSRFLEPEVIVMHPRRWASFHAIFAGDDRPLIVPSGPGFNN-LGVLTEVASQRVVGQMH-G 392 (477) T ss_pred -hHHHHHHHHHHHHhhc-cccccCCccEEEEcHHHHHHHHHhhccCCCeeeecCcccccc-cccccccccccccchhc-c Confidence 1111111122222221 112233456788888877766552 22111111111000 00011111122346764 6 Q ss_pred eEEEEccccccc----CCcceEEEEEecCCcccceeEeeccchhhcccccCCccc--cceeeeeeeee-----eeecC-- Q lcl|Aclame:pro 368 IKVFVDPYAANL----SDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNTF--QPKIGFKTRYG-----MVSNP-- 434 (468) Q Consensus 368 ~~vy~D~Ya~~~----~~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~s~--qP~~g~~tRY~-----l~~nP-- 434 (468) ++|+++.+.-.+ .+..-+++|--.+--. - +..+..-++|.++ .....|.+ || .+.+| T Consensus 393 ~pVv~s~~~p~~~~~~~d~~~i~~gd~~~~~i------~---~~~~~~~~~~~~~~~~~~~~~~v-~~~~~~~~~r~~~a 462 (477) T protein:vir:84 393 LPVVTDPTLPTTLGTGTDQDVIHVLRASDLAL------F---ESSVRMRALQETRAENLSVLLQV-YGYLAFTAARFPQS 462 (477) T ss_pred cceEecCcccccccccCCcceEEEEEeceEEE------E---eeceeEEeccccccccceeeeee-hhhhhhhhhccccc Confidence 799999664311 1122344444322100 0 0001111222221 12222221 22 22356 Q ss_pred cccccCccccccchhhhhhhcccce Q lcl|Aclame:pro 435 FVTTNGLYNGTPDGEALTPNANMYY 459 (468) Q Consensus 435 ~~~~~~~~~~~~~~~~~~~~an~y~ 459 (468) |+......-..|+ |. T Consensus 463 fv~~t~~~~~~~~----------~~ 477 (477) T protein:vir:84 463 VVEIGGTALTAPT----------FA 477 (477) T ss_pred eEEeecccccccc----------cC Confidence 5543322212222 22 No 37 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=91.26 E-value=0.017 Score=30.33 Aligned_cols=268 Identities=14% Similarity=0.088 Sum_probs=117.3 Q ss_pred eeeeecCCCCcccccccCCcccccc-ccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCCCC Q lcl|Aclame:pro 117 MRSRYENQAGEEALFNEPDTGFTGG-YDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEAN 195 (468) Q Consensus 117 MRsrY~~qsG~EA~fnEa~t~fSg~-~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~ 195 (468) |= ...++.++ .+..|--...--. ........... .......+ .+ +..+++..--....++-.++ + T Consensus 1 MA-~~~T~~~~-~~iPev~s~~v~~~~~~~~~~~~~~---~~~~~~~g-~~-------G~tv~iP~~~~~~~a~~v~e-g 66 (272) T protein:vir:30 1 MA-VGTTKMAQ-MLDPEVLADMIDAEVGKAIRFAPLA---EVDTTLEG-QP-------GTTLTVPKWDYIGDAEDVAE-G 66 (272) T ss_pred CC-Cccccchh-eechHHHHHHHHHHHHHHhhhhccc---cccccccC-CC-------CCEEEEEEecCCCCcccccC-C Confidence 11 11111111 1111000000000 00000000000 00000000 00 00111110001112222322 2 Q ss_pred cchhhcceEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhccccccccc Q lcl|Aclame:pro 196 RLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVAN 275 (468) Q Consensus 196 ~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~~~ 275 (468) .+++.-..+.+..+++.|.++-.-++|=|++.+ -+-|.++++.+-|+..|+.+|+++|+..+...... ++. T Consensus 67 ~~i~~~~~~~~~~~~~~~~~~~~~~itd~~~~~----s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~~-----~~~ 137 (272) T protein:vir:30 67 EAIPMTQLGFKKTTMTIKKAGKGVEITDEAILS----GYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQT-----VEA 137 (272) T ss_pred CcccccccccceEEEEeeeeeeeeeecHHHHhh----ccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-----ccc Confidence 334444556777788888887666777666533 35799999999999999999999999876543221 111 Q ss_pred cccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccccc Q lcl|Aclame:pro 276 AGIFDLDVDSNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDD 355 (468) Q Consensus 276 ~g~~Dl~~~~~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~ 355 (468) ... .+.+-.++.++..+ -...+++||+|++++.|......++..... .+. +.-. T Consensus 138 ~~t------------~d~i~da~~~l~~~---------~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~--~~~---~~~~ 191 (272) T protein:vir:30 138 TAT------------VDGVSKALDIFNDE---------DDAETVIVMNPADASTLRLDAAKEWLGATE--VGA---NRVV 191 (272) T ss_pred ccC------------HHHHHHHHHHHhcc---------CCCccEEEEcHHHHHHHHHhcccccccccc--ccc---cccc Confidence 111 11222333333322 245679999999999997765444332211 111 1111 Q ss_pred cCceeEEEecCceEEEEcccccccCCcceEEEEEecCCcccceeEeeccchhhcccccCCccccceeeeeeeeeeee-cC Q lcl|Aclame:pro 356 TGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNTFQPKIGFKTRYGMVS-NP 434 (468) Q Consensus 356 t~~~~~G~l~g~~~vy~D~Ya~~~~~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~s~qP~~g~~tRY~l~~-nP 434 (468) +| .+|++. |++|+++.+. |.+=+++.-+|.- +++-..-+. ...--|+.+++-.+-..-|||+.+ || T Consensus 192 ~g--~ig~i~-G~~Vi~s~~~----p~~t~~~~~~~a~----~~~~~~~~~--ve~~r~~~~~~~~i~~~~~~~~~v~~~ 258 (272) T protein:vir:30 192 SG--VYGEVL-GVQIVRSRKC----PKGTAYMVRKGAL----RIMLKRNTM--VETDRDITKAINQIVANKHYGVYLYKA 258 (272) T ss_pred cc--cchhhc-CeeEEEcCCC----CcceEEEEcCCeE----EEEecCCce--eeeccccccceeEEEEEEEEEEEEEcC Confidence 22 246774 5799999543 3222222222211 111122222 222247888888888888888753 44 Q ss_pred --cccccCccccccchhhhhhh Q lcl|Aclame:pro 435 --FVTTNGLYNGTPDGEALTPN 454 (468) Q Consensus 435 --~~~~~~~~~~~~~~~~~~~~ 454 (468) +....-..++ +. T Consensus 259 ~~vv~~t~~~a~--------~~ 272 (272) T protein:vir:30 259 EKAVKITLKDAA--------KK 272 (272) T ss_pred CceEEEEecccc--------cC Confidence 1111111111 11 No 38 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=91.26 E-value=0.017 Score=30.33 Aligned_cols=268 Identities=14% Similarity=0.088 Sum_probs=117.3 Q ss_pred eeeeecCCCCcccccccCCcccccc-ccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCCCC Q lcl|Aclame:pro 117 MRSRYENQAGEEALFNEPDTGFTGG-YDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEAN 195 (468) Q Consensus 117 MRsrY~~qsG~EA~fnEa~t~fSg~-~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~ 195 (468) |= ...++.++ .+..|--...--. ........... .......+ .+ +..+++..--....++-.++ + T Consensus 1 MA-~~~T~~~~-~~iPev~s~~v~~~~~~~~~~~~~~---~~~~~~~g-~~-------G~tv~iP~~~~~~~a~~v~e-g 66 (272) T protein:vir:98 1 MA-VGTTKMAQ-MLDPEVLADMIDAEVGKAIRFAPLA---EVDTTLEG-QP-------GTTLTVPKWDYIGDAEDVAE-G 66 (272) T ss_pred CC-Cccccchh-eechHHHHHHHHHHHHHHhhhhccc---cccccccC-CC-------CCEEEEEEecCCCCcccccC-C Confidence 11 11111111 1111000000000 00000000000 00000000 00 00111110001112222322 2 Q ss_pred cchhhcceEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhccccccccc Q lcl|Aclame:pro 196 RLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVAN 275 (468) Q Consensus 196 ~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~~~ 275 (468) .+++.-..+.+..+++.|.++-.-++|=|++.+ -+-|.++++.+-|+..|+.+|+++|+..+...... ++. T Consensus 67 ~~i~~~~~~~~~~~~~~~~~~~~~~itd~~~~~----s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~~-----~~~ 137 (272) T protein:vir:98 67 EAIPMTQLGFKKTTMTIKKAGKGVEITDEAILS----GYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQT-----VEA 137 (272) T ss_pred CcccccccccceEEEEeeeeeeeeeecHHHHhh----ccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-----ccc Confidence 334444556777788888887666777666533 35799999999999999999999999876543221 111 Q ss_pred cccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccccc Q lcl|Aclame:pro 276 AGIFDLDVDSNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDD 355 (468) Q Consensus 276 ~g~~Dl~~~~~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~ 355 (468) ... .+.+-.++.++..+ -...+++||+|++++.|......++..... .+. +.-. T Consensus 138 ~~t------------~d~i~da~~~l~~~---------~~~~~~~vv~p~~~~~L~k~~~~~~~~~~~--~~~---~~~~ 191 (272) T protein:vir:98 138 TAT------------VDGVSKALDIFNDE---------DDAETVIVMNPADASTLRLDAAKEWLGATE--VGA---NRVV 191 (272) T ss_pred ccC------------HHHHHHHHHHHhcc---------CCCccEEEEcHHHHHHHHHhcccccccccc--ccc---cccc Confidence 111 11222333333322 245679999999999997765444332211 111 1111 Q ss_pred cCceeEEEecCceEEEEcccccccCCcceEEEEEecCCcccceeEeeccchhhcccccCCccccceeeeeeeeeeee-cC Q lcl|Aclame:pro 356 TGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNTFQPKIGFKTRYGMVS-NP 434 (468) Q Consensus 356 t~~~~~G~l~g~~~vy~D~Ya~~~~~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~s~qP~~g~~tRY~l~~-nP 434 (468) +| .+|++. |++|+++.+. |.+=+++.-+|.- +++-..-+. ...--|+.+++-.+-..-|||+.+ || T Consensus 192 ~g--~ig~i~-G~~Vi~s~~~----p~~t~~~~~~~a~----~~~~~~~~~--ve~~r~~~~~~~~i~~~~~~~~~v~~~ 258 (272) T protein:vir:98 192 SG--VYGEVL-GVQIVRSRKC----PKGTAYMVRKGAL----RIMLKRNTM--VETDRDITKAINQIVANKHYGVYLYKA 258 (272) T ss_pred cc--cchhhc-CeeEEEcCCC----CcceEEEEcCCeE----EEEecCCce--eeeccccccceeEEEEEEEEEEEEEcC Confidence 22 246774 5799999543 3222222222211 111122222 222247888888888888888753 44 Q ss_pred --cccccCccccccchhhhhhh Q lcl|Aclame:pro 435 --FVTTNGLYNGTPDGEALTPN 454 (468) Q Consensus 435 --~~~~~~~~~~~~~~~~~~~~ 454 (468) +....-..++ +. T Consensus 259 ~~vv~~t~~~a~--------~~ 272 (272) T protein:vir:98 259 EKAVKITLKDAA--------KK 272 (272) T ss_pred CceEEEEecccc--------cC Confidence 1111111111 11 No 39 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=90.83 E-value=0.019 Score=30.04 Aligned_cols=339 Identities=15% Similarity=0.110 Sum_probs=116.7 Q ss_pred cchHHHHHhhhhhhCCCccchhcc----------hhhhHH---HHHH--HhHHHHHHhhhhhhhhhhhhhhcCcc----- Q lcl|Aclame:pro 2 FNAEHLQEKWSPVLNHGEAPAIGD----------RYKRAV---TSVL--LENQERFLREERGMLNEVAVNSLGAG----- 61 (468) Q Consensus 2 ~~~~~l~~kw~p~l~~~~~~~i~~----------~~~~~~---~~~l--lenq~~~~~~~~~~l~e~~~~~~~~~----- 61 (468) |+-++|+|+++.+++. +-++.+ .-++.+ .+.+ |++|-+.+++......+......... T Consensus 1 M~i~eL~e~r~~~~~~--~~~l~~~~~e~~~lt~ee~~~~~~l~~ei~~l~~~I~~~e~~~~~~~~~~~~~~~~~~~~~~ 78 (435) T protein:vir:14 1 MNVNELRRERAAVNQR--VQALAQIEVGGTALSVEQQAEFDQLSSKFSELTAQIERAEAAERMAAAAAVPVDPNPTAVAA 78 (435) T ss_pred CCHHHHHHHHHHHHHH--HHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccchhhhhhh Confidence 9999999999998763 111111 111110 0000 11111111111000000000000000 Q ss_pred ----cccc--------------------------------------cccccccccccccccccceehh------hhHHhh Q lcl|Aclame:pro 62 ----TIAP--------------------------------------AGSALGSANTGGLAGFDPVLIS------LVRRAM 93 (468) Q Consensus 62 ----~~~~--------------------------------------~~~i~~st~tg~i~~~~P~Lv~------l~RRa~ 93 (468) .... ....+...+++.- .....|++ ++.+.. T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~t~-~~gg~~vP~~~~~~ii~~l~ 157 (435) T protein:vir:14 79 PAAAPVHAQPKALEVKGAKMARMVRALAAARGDAQLASKLAIERGFGEEVAMSLNTLSP-GAGGVLVPENLSSEVIELLR 157 (435) T ss_pred ccccccccccchhhhhHHHHHHHHHHHHhhcchhhHHHHHHHhhhhhhhhhhhcccCCc-CCCccccchhHHHHHHHHHh Confidence 0000 0000000000000 00001111 111122 Q ss_pred hhhhhhhe-eeeecCCccceeeeeeeeeecCCCCcccccccCCccccccccccccccccccCccccCCCccccccccccc Q lcl|Aclame:pro 94 PNLMAYDV-CGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDA 172 (468) Q Consensus 94 ~~LI~~DI-~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a 172 (468) ++.+..++ +=+-||+... +-| |.... T Consensus 158 ~~~~i~~~~~~~~~~~~~~-~~~--------------------------------------------------p~~~~-- 184 (435) T protein:vir:14 158 PKSVVRKLGARTLPLSNGN-ITI--------------------------------------------------PRLKG-- 184 (435) T ss_pred hhchhhhhcceeeecCCCc-eEE--------------------------------------------------EEEeC-- Confidence 22222222 1111111100 000 00000 Q ss_pred cccccccccccchhhhhccCCCCcchhhcceEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHH Q lcl|Aclame:pro 173 APGTYEVGSKMPREDLERMGEANRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAE 252 (468) Q Consensus 173 ~~~~~t~~~gm~Ta~aE~lG~~~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlE 252 (468) .+.+.-.+ .+..+++..-++++++..++.-+-....|-||.+|-. .+.+.|+.|.+-|+..|... T Consensus 185 ------------~~~a~~v~-E~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~--~~~~l~~~i~~~l~~ai~~~ 249 (435) T protein:vir:14 185 ------------GAIVGYIG-ADTDIPTTQQQFDDLKLTAKKMAALVPIANDLIKYAG--VNPNVDQIVVGDLTAAIGAR 249 (435) T ss_pred ------------Ccceeeec-cCccccccccceeEEEeeeEEEEEeehhhHHHHHhhc--cCHHHHHHHHHHHHHHHHHH Confidence 00000001 1223455555667777777777777889999999932 12347778888888888877 Q ss_pred hhHHHHHHHhhhhhcccccccccccccccccc------cc-chh--HHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEc Q lcl|Aclame:pro 253 INREVVRRVYTVAKKGAQNNVANAGIFDLDVD------SN-GRW--SVEKFKGLLFQVERDANAIAQETRRGKGNFLICS 323 (468) Q Consensus 253 INREii~~l~~va~~~k~~~~~~~g~~Dl~~~------~~-grw--~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S 323 (468) +|+-||.- .-.+ -...|++..... .+ +-+ ....+..|+..+. +. .. ......+|++ T Consensus 250 ~d~a~l~G----~G~~----~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~---~~---~~-~~~~~~~v~n 314 (435) T protein:vir:14 250 EDKAFIRD----DGTA----NTPKGLRFWALPSNVITASDASTLQKIETDLGKVILALE---NA---DA-NLTQPGWIMA 314 (435) T ss_pred HHHHhhcc----CCCC----ccccceeecccccceeccccccchhhHHHHHHHHHHHhh---hc---cc-cccCCEEEEc Confidence 77777632 1110 112222211100 00 000 0111112211111 11 11 1233457899 Q ss_pred hhHHHHHHhhcccccccccccccccccccccccCceeEEEecCceEEEEccccccc----CC--------cceEEEEEec Q lcl|Aclame:pro 324 ADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVFVDPYAANL----SD--------KHYYVIGYKG 391 (468) Q Consensus 324 ~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~----~~--------~dY~~vG~KG 391 (468) |.....|... ..+ ++ ..+.. +.+ -|+|. +++|+++.+.-.+ .+ +.++++|..+ T Consensus 315 ~~~~~~L~~l---kd~---~G---~~l~~-~~~----~g~l~-G~Pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~i~~~~ 379 (435) T protein:vir:14 315 PRTFRFLEGL---RDG---NG---NKVYP-ELA----NGMLK-GYPVGKTTQVPINLGETGKESEIYFTDFGDVFIGEEE 379 (435) T ss_pred HHHHHHHHHh---hcc---CC---ceecc-CCC----CCeee-cceeEeeccccccccCCCccceEEEeecccEEEEEec Confidence 9999998763 211 11 11111 111 25665 4677776543110 01 1112233333 Q ss_pred CCcccceeEeeccchhhcccccCCccc---cceeeeeeeeeeee-cC--cccccCccccccchhhhhh Q lcl|Aclame:pro 392 TSPYDAGLFYCPYVPLQMVRSIDPNTF---QPKIGFKTRYGMVS-NP--FVTTNGLYNGTPDGEALTP 453 (468) Q Consensus 392 ~~~~d~glfyaPYv~l~~~~~~dp~s~---qP~~g~~tRY~l~~-nP--~~~~~~~~~~~~~~~~~~~ 453 (468) +.. +-.+||.......+..-..| |=.+=...|++..+ +| |+ ..++-.|++ T Consensus 380 ~~~----~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~--------~l~~~~~~~ 435 (435) T protein:vir:14 380 TLE----IDYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIA--------VLAGVAWGA 435 (435) T ss_pred ccE----EEEeccccccccccchhhhhhcChhheeeeeeeCceeecccceE--------EEecCCCCC Confidence 222 22333321111000000001 12222445555432 22 22 233344444 No 40 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=90.58 E-value=0.02 Score=29.89 Aligned_cols=299 Identities=10% Similarity=0.032 Sum_probs=124.2 Q ss_pred HHHHHhHHHHHHhhhhhhhhhhhhhhcCcccccccccccccccccccccccceehhhhHHhhhhhhhhheeeeecCCccc Q lcl|Aclame:pro 32 TSVLLENQERFLREERGMLNEVAVNSLGAGTIAPAGSALGSANTGGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPT 111 (468) Q Consensus 32 ~~~llenq~~~~~~~~~~l~e~~~~~~~~~~~~~~~~i~~st~tg~i~~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPT 111 (468) |.+- ||.+..++.....+-+ .+++..++. .++.+++..--....-.+++.+..+.+-.+++.+.||++.+ T Consensus 1 ~~k~-~~~~~~~~~~~~~~~~--~~~~~a~~~-------~~~~~~~~lip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~ 70 (324) T protein:vir:99 1 MEQT-QKLKLNLQHFASNNVK--PQVFNPDNV-------MMHEKKDGTLLNDFTTPILQEVMENSKIMRLGKYEPMEGTE 70 (324) T ss_pred CCCc-hHhhHHHHHHHHHhhh--hhhccccce-------eccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCc Confidence 1111 2222223332222221 123333221 11111111000111122334444556678889999988765 Q ss_pred eeeeeeeeeecCCCCcccccccCCccccccccccccccccccCccccCCCccccccccccccccccccccccchhhhhcc Q lcl|Aclame:pro 112 GLIFAMRSRYENQAGEEALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERM 191 (468) Q Consensus 112 GLIFAMRsrY~~qsG~EA~fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~l 191 (468) .-|. . +.. +.+ ..| . +| T Consensus 71 ~~~p-~---~~~--~~~-------a~~----------------------------------------------v--~E-- 87 (324) T protein:vir:99 71 KKFT-F---WAD--KPG-------AYW----------------------------------------------V--GE-- 87 (324) T ss_pred eEEE-E---Eec--Ccc-------eeE----------------------------------------------e--cc-- Confidence 3221 1 100 000 000 0 01 Q ss_pred CCCCcchhhcceEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhccccc Q lcl|Aclame:pro 192 GEANRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQN 271 (468) Q Consensus 192 G~~~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~ 271 (468) +..+++...++++++...|.-+--...|-||.+|-. .|.+++|.+.|+..|...+++.||.---+ T Consensus 88 ---g~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~----~~l~~~i~~~l~~ai~~~~d~~~l~G~g~-------- 152 (324) T protein:vir:99 88 ---GQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQGN-------- 152 (324) T ss_pred ---CccccccccceeEEEEeeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHhhhcCCC-------- Confidence 223455556667777777777777789999999974 46899999999999999999998843111 Q ss_pred ccccccccccccc----ccchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccc Q lcl|Aclame:pro 272 NVANAGIFDLDVD----SNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGG 347 (468) Q Consensus 272 ~~~~~g~~Dl~~~----~~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~ 347 (468) +....|+...... ..+.-..+....+ ...+ ...-...+.+|++|.....|.... .+ ++ T Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-------~~~l--~~~~~~~~~~v~n~~~~~~L~~l~---d~---~g--- 214 (324) T protein:vir:99 153 NPFGKSIAQSIEKTNKVIKGDFTQDNIIDL-------EALL--EDDELEANAFISKTQNRSLLRKIV---DP---ET--- 214 (324) T ss_pred CccCccccccccccceeccccCCHHHHHHH-------HHhh--hhccCCCCEEEEcHHHHHHHHHhh---cC---CC--- Confidence 1111111110000 0011112222222 2222 222345567899999999998631 11 11 Q ss_pred cccccccccCceeEEEecCceEEEEcccccccCCcceEEEEEecCCcccceeEeeccchhhc--------ccccCCc--- Q lcl|Aclame:pro 348 PSIGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYKGTSPYDAGLFYCPYVPLQM--------VRSIDPN--- 416 (468) Q Consensus 348 ~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~~~~dY~~vG~KG~~~~d~glfyaPYv~l~~--------~~~~dp~--- 416 (468) ..++. +.. .++|. +++|++.+.+. .+...+++|-... +++..--...+ ....|+. T Consensus 215 ~~~~~-~~~----~~~l~-G~PVv~~~~~~--~~~~~~i~gd~~~------~~~~~~~~~~i~~~~~~~~~~~~~~~~~~ 280 (324) T protein:vir:99 215 KERIY-DRN----SDTLD-GLPVVNLKSSN--LKRGELITGDFDK------LIYGIPQLIEYKIDETAQLSTVKNEDGTP 280 (324) T ss_pred ceeec-CCC----Ccccc-ceeEEeecCCC--CCcceEEEEeccc------EEEEEecCcEEEEeecccccccccccccc Confidence 11111 111 14454 46777765432 1222333332211 11111111111 1111111 Q ss_pred -----cccceeeeeeeeeee-ecC--cccccCcccc--ccchhh Q lcl|Aclame:pro 417 -----TFQPKIGFKTRYGMV-SNP--FVTTNGLYNG--TPDGEA 450 (468) Q Consensus 417 -----s~qP~~g~~tRY~l~-~nP--~~~~~~~~~~--~~~~~~ 450 (468) +-+=.+=...|++.. .|| |+.......+ .+.++= T Consensus 281 ~~~f~~~~~~~r~~~r~d~~v~~~~a~~~lt~a~~~~~~~~~~~ 324 (324) T protein:vir:99 281 VNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKTDSVPGEV 324 (324) T ss_pred hhhhhcCcEEEEEEEEEccEEecccceEEEEeccCCCCCCCCCC Confidence 112222334567643 444 4432211111 111111 No 41 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=90.40 E-value=0.021 Score=29.77 Aligned_cols=274 Identities=10% Similarity=0.047 Sum_probs=118.8 Q ss_pred cccccccccc---cccccccccccee-hhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCcccccccCCccc Q lcl|Aclame:pro 63 IAPAGSALGS---ANTGGLAGFDPVL-ISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDTGF 138 (468) Q Consensus 63 ~~~~~~i~~s---t~tg~i~~~~P~L-v~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~f 138 (468) +..+..-+.. |++|+. ..-+.+ -.++++..++.+..+++-+=||++.+--|. ++.+ +.++ .| T Consensus 1 ma~~~~~~~~~~~t~~gg~-lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip----~~~~--~~~a-------~~ 66 (304) T protein:vir:94 1 MATPTYTPGNVILSDFKNG-VIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFT----YLAK--GVGA-------YW 66 (304) T ss_pred CcccccccccccccCCCce-ecchhHHHHHHHHHHhccchhhhcceeeccCCceEEE----EEeC--Ccce-------EE Confidence 1111111111 112221 122222 235555666777788888888876542211 1110 0000 00 Q ss_pred cccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCCCCcchhhcceEEEEEEEEeeccccc Q lcl|Aclame:pro 139 TGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFREMSFSIEKTSVTAQSRALK 218 (468) Q Consensus 139 Sg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~~~f~EMaFsIeK~tVtAKSRaLK 218 (468) . +| +.++++-.-+++++++..|..+-. T Consensus 67 ----------------------------------------------v--~E-----~~~~~~~~~~~~~i~~~~~k~~~~ 93 (304) T protein:vir:94 67 ----------------------------------------------V--SE-----TERIQTSKPEYAQAEMEAKKIGVI 93 (304) T ss_pred ----------------------------------------------e--ec-----CcccccccceeeEEEEEEEEEEEe Confidence 0 01 123444455566677777777777 Q ss_pred ccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhccccccccccccccccc-----cccchhHHHH Q lcl|Aclame:pro 219 AEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVANAGIFDLDV-----DSNGRWSVEK 293 (468) Q Consensus 219 AEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~~~~g~~Dl~~-----~~~grw~~e~ 293 (468) ..+|-||.+|- .+|.|+.|.+-|...|...||+.+|.---+ .+..+....+.+.-.. ..++....+. T Consensus 94 ~~iS~ell~ds----~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 165 (304) T protein:vir:94 94 IPLSKEFLKWT----AKDFFNEVKPLIAEAFYKAFDQAVIFGTKS----PYNTSTSGKPLVEGAEEKGNVVTDTNNLYVD 165 (304) T ss_pred ehhhHHHHhcc----hHHHHHHHHHHHHHHHHHHHHhhheeccCC----CcccccccccccccccccccccccccchHHH Confidence 88999999875 367888999999999999998888753111 1111111111110000 0011111222 Q ss_pred HHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccccccCceeEEEecCceEEEEc Q lcl|Aclame:pro 294 FKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVFVD 373 (468) Q Consensus 294 ~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~~~~G~l~g~~~vy~D 373 (468) |......+... -....-+||+|.....|... ..+ + +....+- ..|+|. +++||++ T Consensus 166 -------i~~~~~~l~~~--~~~~~~~v~~~~~~~~L~~l---kd~-----~-G~~l~~~------~~~~l~-G~PV~~~ 220 (304) T protein:vir:94 166 -------LSALMATIEDE--ELDPNGVLTTRSFRSKMRNA---LDA-----N-DRPLFDA------NGNEIM-GLPLSYT 220 (304) T ss_pred -------HHHHHHHhhhc--cCCcCEEEEcHHHHHHHHHh---hcc-----C-CcEeecC------CCcccc-ceeeEEe Confidence 22222333221 23444688999999999752 111 1 1111111 024554 5788877 Q ss_pred ccccccCC--------cceEEEEEecCCcccceeEeeccchhh--cccccCCcc-----cc---ceeeeeeeeeeee-cC Q lcl|Aclame:pro 374 PYAANLSD--------KHYYVIGYKGTSPYDAGLFYCPYVPLQ--MVRSIDPNT-----FQ---PKIGFKTRYGMVS-NP 434 (468) Q Consensus 374 ~Ya~~~~~--------~dY~~vG~KG~~~~d~glfyaPYv~l~--~~~~~dp~s-----~q---P~~g~~tRY~l~~-nP 434 (468) .+.....+ +.++++|..++...+ ...+.. +....|++. || =.+=+..||++.+ || T Consensus 221 ~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~------~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~ 294 (304) T protein:vir:94 221 GADVYDKKKSLALMGDWDYARYGILQGIEYA------ISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKP 294 (304) T ss_pred cccccCCCCcEEEEEehhhEEEEEecceEEE------EeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecc Confidence 55432221 122334443332211 000111 111112221 22 2233345776543 33 Q ss_pred --cccccCcc Q lcl|Aclame:pro 435 --FVTTNGLY 442 (468) Q Consensus 435 --~~~~~~~~ 442 (468) |+...... T Consensus 295 ~a~~~l~~a~ 304 (304) T protein:vir:94 295 EAFATLKPTE 304 (304) T ss_pred cceEEEEecC Confidence 33211111 No 42 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=90.40 E-value=0.021 Score=29.77 Aligned_cols=274 Identities=10% Similarity=0.047 Sum_probs=118.8 Q ss_pred cccccccccc---cccccccccccee-hhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCcccccccCCccc Q lcl|Aclame:pro 63 IAPAGSALGS---ANTGGLAGFDPVL-ISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDTGF 138 (468) Q Consensus 63 ~~~~~~i~~s---t~tg~i~~~~P~L-v~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~f 138 (468) +..+..-+.. |++|+. ..-+.+ -.++++..++.+..+++-+=||++.+--|. ++.+ +.++ .| T Consensus 1 ma~~~~~~~~~~~t~~gg~-lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip----~~~~--~~~a-------~~ 66 (304) T protein:vir:10 1 MATPTYTPGNVILSDFKNG-VIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFT----YLAK--GVGA-------YW 66 (304) T ss_pred CcccccccccccccCCCce-ecchhHHHHHHHHHHhccchhhhcceeeccCCceEEE----EEeC--Ccce-------EE Confidence 1111111111 112221 122222 235555666777788888888876542211 1110 0000 00 Q ss_pred cccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCCCCcchhhcceEEEEEEEEeeccccc Q lcl|Aclame:pro 139 TGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFREMSFSIEKTSVTAQSRALK 218 (468) Q Consensus 139 Sg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~~~f~EMaFsIeK~tVtAKSRaLK 218 (468) . +| +.++++-.-+++++++..|..+-. T Consensus 67 ----------------------------------------------v--~E-----~~~~~~~~~~~~~i~~~~~k~~~~ 93 (304) T protein:vir:10 67 ----------------------------------------------V--SE-----TERIQTSKPEYAQAEMEAKKIGVI 93 (304) T ss_pred ----------------------------------------------e--ec-----CcccccccceeeEEEEEEEEEEEe Confidence 0 01 123444455566677777777777 Q ss_pred ccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhccccccccccccccccc-----cccchhHHHH Q lcl|Aclame:pro 219 AEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVANAGIFDLDV-----DSNGRWSVEK 293 (468) Q Consensus 219 AEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~~~~g~~Dl~~-----~~~grw~~e~ 293 (468) ..+|-||.+|- .+|.|+.|.+-|...|...||+.+|.---+ .+..+....+.+.-.. ..++....+. T Consensus 94 ~~iS~ell~ds----~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 165 (304) T protein:vir:10 94 IPLSKEFLKWT----AKDFFNEVKPLIAEAFYKAFDQAVIFGTKS----PYNTSTSGKPLVEGAEEKGNVVTDTNNLYVD 165 (304) T ss_pred ehhhHHHHhcc----hHHHHHHHHHHHHHHHHHHHHhhheeccCC----CcccccccccccccccccccccccccchHHH Confidence 88999999875 367888999999999999998888753111 1111111111110000 0011111222 Q ss_pred HHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccccccCceeEEEecCceEEEEc Q lcl|Aclame:pro 294 FKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVFVD 373 (468) Q Consensus 294 ~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~~~~G~l~g~~~vy~D 373 (468) |......+... -....-+||+|.....|... ..+ + +....+- ..|+|. +++||++ T Consensus 166 -------i~~~~~~l~~~--~~~~~~~v~~~~~~~~L~~l---kd~-----~-G~~l~~~------~~~~l~-G~PV~~~ 220 (304) T protein:vir:10 166 -------LSALMATIEDE--ELDPNGVLTTRSFRSKMRNA---LDA-----N-DRPLFDA------NGNEIM-GLPLSYT 220 (304) T ss_pred -------HHHHHHHhhhc--cCCcCEEEEcHHHHHHHHHh---hcc-----C-CcEeecC------CCcccc-ceeeEEe Confidence 22222333221 23444688999999999752 111 1 1111111 024554 5788877 Q ss_pred ccccccCC--------cceEEEEEecCCcccceeEeeccchhh--cccccCCcc-----cc---ceeeeeeeeeeee-cC Q lcl|Aclame:pro 374 PYAANLSD--------KHYYVIGYKGTSPYDAGLFYCPYVPLQ--MVRSIDPNT-----FQ---PKIGFKTRYGMVS-NP 434 (468) Q Consensus 374 ~Ya~~~~~--------~dY~~vG~KG~~~~d~glfyaPYv~l~--~~~~~dp~s-----~q---P~~g~~tRY~l~~-nP 434 (468) .+.....+ +.++++|..++...+ ...+.. +....|++. || =.+=+..||++.+ || T Consensus 221 ~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~------~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~ 294 (304) T protein:vir:10 221 GADVYDKKKSLALMGDWDYARYGILQGIEYA------ISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKP 294 (304) T ss_pred cccccCCCCcEEEEEehhhEEEEEecceEEE------EeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeecc Confidence 55432221 122334443332211 000111 111112221 22 2233345776543 33 Q ss_pred --cccccCcc Q lcl|Aclame:pro 435 --FVTTNGLY 442 (468) Q Consensus 435 --~~~~~~~~ 442 (468) |+...... T Consensus 295 ~a~~~l~~a~ 304 (304) T protein:vir:10 295 EAFATLKPTE 304 (304) T ss_pred cceEEEEecC Confidence 33211111 No 43 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=89.88 E-value=0.024 Score=29.48 Aligned_cols=261 Identities=14% Similarity=0.082 Sum_probs=114.4 Q ss_pred eeeeecCCC---Cccccc---cc---CCccccccccccccccccccCccccCCCccccccccccccccccccccccchhh Q lcl|Aclame:pro 117 MRSRYENQA---GEEALF---NE---PDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPRED 187 (468) Q Consensus 117 MRsrY~~qs---G~EA~f---nE---a~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~ 187 (468) |=.....-+ -.|-+- .+ ...-|++-... .. ...+ .+ +.+.++..--.... T Consensus 1 ma~~~T~~~~~iiPev~~~~v~~~~~~~~~~~~~~~~-~~------------~l~g-~~-------G~tv~ip~~~~~g~ 59 (274) T protein:vir:93 1 MPQGITKTSNQIIPEVLAPMMQAQLEKKLRFASFAEV-DS------------TLQG-QP-------GDTLTFPAFVYSGD 59 (274) T ss_pred CCccceehhheechHHHHHHHHHHHHhhhhhcccccc-cc------------cccC-CC-------CCEEEEEeeccCCC Confidence 221111000 011110 00 00001100000 00 0000 00 00111110001112 Q ss_pred hhccCCC-CcchhhcceEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhh Q lcl|Aclame:pro 188 LERMGEA-NRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAK 266 (468) Q Consensus 188 aE~lG~~-~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~ 266 (468) ++.+.++ .-++.++. ....+++-|-|+-.-+++=|. .+. -+-|.-.+..+-++..+...++++++..+..... T Consensus 60 ~~~~~eg~~i~~~~it--~~~~~~~i~~~~~~~~i~D~~--~~~--~~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a~~ 133 (274) T protein:vir:93 60 AQVVAEGEKIPTDILE--TKKREAKIRKIAKGTSITDEA--LLS--GYGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL 133 (274) T ss_pred cccccCCCcccccccc--cceeEEEeeeecccccccHHH--HHh--hccchHHHHHHHHHHHHHHHHHHHHHHHHhcccc Confidence 2323221 22344443 444555556665332333332 222 3578999999999999999999999987754322 Q ss_pred ccccccccccccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhccccccccccccc Q lcl|Aclame:pro 267 KGAQNNVANAGIFDLDVDSNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAG 346 (468) Q Consensus 267 ~~k~~~~~~~g~~Dl~~~~~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~ 346 (468) .. +...+ ..+.+-.++.++..+ -..+++++|+|.+++.|.......|.+.... T Consensus 134 ~~------~~~~~----------~~d~i~dA~~~l~d~---------~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~-- 186 (274) T protein:vir:93 134 TV------NADIT----------KLNGLQSAIDKFNDE---------DLEPMVLFINPLDAGKLRGDASTNFTRATEL-- 186 (274) T ss_pred cc------ccccc----------CHHHHHHHHHHhhhc---------cCCccEEEeCHHHHHHHHhhhhhcccccccc-- Confidence 11 11111 123333333333321 2467899999999999987544444332211 Q ss_pred ccccccccccCceeEEEecCceEEEEcccccccCCcceEEEEEecCCcccceeEeeccchhhcccccCCccccceeeeee Q lcl|Aclame:pro 347 GPSIGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNTFQPKIGFKT 426 (468) Q Consensus 347 ~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~~~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~s~qP~~g~~t 426 (468) +.. .-.+| .+|++. |++||+| ++-|..-.++.-+|.-. .+..+ +.....--|+.++.=.+-... T Consensus 187 g~~---~~~~G--~ig~~~-G~~Vi~s----~~~p~~t~~l~~~gai~----~~~~~--~~~vE~~Rd~~~~~d~i~~~~ 250 (274) T protein:vir:93 187 GDD---IIVKG--AFGEAL-GAIIVRT----NKLEAGTAILAKKGAVK----LILKR--DFFLEVARDASTKTTALYSDK 250 (274) T ss_pred ccc---ceeec--ccceec-CeeEEEc----CCCCcceEEEEeCCeEE----EEecC--CcccccccchhhcccEEEEEE Confidence 111 01112 357774 6899999 56664333332223211 11111 222223348999999999999 Q ss_pred eeeeee-cC--cccccCccccccchhhh Q lcl|Aclame:pro 427 RYGMVS-NP--FVTTNGLYNGTPDGEAL 451 (468) Q Consensus 427 RY~l~~-nP--~~~~~~~~~~~~~~~~~ 451 (468) +||+.+ || ..... ..++... | T Consensus 251 ~y~~~~~~~~~~v~~t-~~~~s~~---~ 274 (274) T protein:vir:93 251 HYVAYLYDESKAVKIT-KGSGSLE---M 274 (274) T ss_pred EEEEEEEcCCceEEEe-eCccccC---C Confidence 999864 44 11100 1111100 0 No 44 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=89.18 E-value=0.028 Score=29.11 Aligned_cols=263 Identities=14% Similarity=0.064 Sum_probs=111.0 Q ss_pred CCc-cceeeeeeeeeecCCCCcccccc---c---CCccccccccccccccccccCccccCCCcccccccccccccccccc Q lcl|Aclame:pro 107 MSG-PTGLIFAMRSRYENQAGEEALFN---E---PDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEV 179 (468) Q Consensus 107 mTG-PTGLIFAMRsrY~~qsG~EA~fn---E---a~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~ 179 (468) |.. .|- -.+.--.|-|-. + ..--|++-.. ... ...+ .++ .+.++ T Consensus 1 m~~~~T~--------l~d~i~Pev~~~~v~~~~~~~l~~~~~~~-~~~------------~l~g-~~G-------~tv~i 51 (274) T protein:vir:96 1 MAQGMTK--------LTNQIVPEVLAPMMQAELEKKLRFASFAE-IDN------------TLVG-QPG-------DTLTF 51 (274) T ss_pred CCcceee--------hhheechHHHHHHHHHHHHhhhhccccce-ecc------------cccC-CCC-------CEEEe Confidence 111 010 000000110000 0 0000110000 000 0000 000 01111 Q ss_pred ccccchhhhhccCCC-CcchhhcceEEEEEEEEeecccccccccHHHHHHHHHhc-CCChhHHHHHHHHHHHHHHhhHHH Q lcl|Aclame:pro 180 GSKMPREDLERMGEA-NRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIH-GLDAEQELANILSSEVLAEINREV 257 (468) Q Consensus 180 ~~gm~Ta~aE~lG~~-~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiH-GLDAE~ELanILStEImlEINREi 257 (468) ..--...++|.+..+ +-...++..+ +.+++.+-|+- + |.+ -|+.+.- +-|.-.|..+-++..++.++++++ T Consensus 52 P~~~~ig~a~~~~~g~~i~~~~lt~~--~~~~~i~~~~~-a-~~i---~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i 124 (274) T protein:vir:96 52 PAFIYSGDAKVVAEGEKIPTDILETK--KREAKIRKIAK-G-TSI---SDEALLSGYGDPQGEQVRQHGLAHANKVDDDV 124 (274) T ss_pred eeecCCCccccccCCCccchhhcccc--eeEEEeeeeec-c-eee---hHHHHhhccchHHHHHHHHHHHHHHHHHHHHH Confidence 000011122222221 1223444333 33333344432 2 222 2555544 458899999999999999999999 Q ss_pred HHHHhhhhhccccccccccccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccc Q lcl|Aclame:pro 258 VRRVYTVAKKGAQNNVANAGIFDLDVDSNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLD 337 (468) Q Consensus 258 i~~l~~va~~~k~~~~~~~g~~Dl~~~~~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~ 337 (468) +..+.+.... .+...++ .+.+-.++.++..+ -..+++++++|+|++.|....... T Consensus 125 ~~~l~~a~~~------~~~~~~~----------~d~i~~A~~~lgd~---------~~~~~~ivv~p~~~~~L~k~~~~~ 179 (274) T protein:vir:96 125 LEALKSAKLT------VEADITK----------LTGLQTAIDKFNDE---------DLEPMVLFISPLDAGKLRGDATTN 179 (274) T ss_pred HHHHhccccc------ccccccC----------HHHHHHHHHHhccc---------cccccEEEeCHHHHHHHHhhcccc Confidence 9877653222 1111121 22233333444322 136789999999999998865444 Q ss_pred cccccccccccccccccccCceeEEEecCceEEEEcccccccCCcceEEEEEe-cCCcccceeEeeccchhhcccccCCc Q lcl|Aclame:pro 338 YSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYK-GTSPYDAGLFYCPYVPLQMVRSIDPN 416 (468) Q Consensus 338 ~~~~~~~~~~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~~~~dY~~vG~K-G~~~~d~glfyaPYv~l~~~~~~dp~ 416 (468) |...... +. +.-..| .+|++. |++||+| ++.|. |-.+-++ |.-. ||.. .+...-.--||. T Consensus 180 f~~~s~~--g~---~~~~~G--~ig~~~-G~~Vi~s----~~~~~-~t~~l~~~gA~~-----~~~~-~~~~vE~~Rd~~ 240 (274) T protein:vir:96 180 FTRATEL--GD---DVIVKG--AFGEAL-GAVIVRS----NKLEA-GTAILAKKGAVK-----LITK-RDFFLETDRDPS 240 (274) T ss_pred ccccccc--cc---cceecc--ccceec-CeEEEEe----CCCCC-ceEEEEecccee-----eeec-CCcccccccccc Confidence 4432221 10 111112 357774 6899999 55553 2222222 2111 1111 111122224899 Q ss_pred cccceeeeeeeeeeee-cCcccccCccccccchhhhhhhc Q lcl|Aclame:pro 417 TFQPKIGFKTRYGMVS-NPFVTTNGLYNGTPDGEALTPNA 455 (468) Q Consensus 417 s~qP~~g~~tRY~l~~-nP~~~~~~~~~~~~~~~~~~~~a 455 (468) +++=.+-..-+||+.+ || ..--...-..|..-. T Consensus 241 ~~~d~i~~~~~y~~~~~~~------~~~v~~tk~~~~~~~ 274 (274) T protein:vir:96 241 TKTTALYSDKHYVAYLYDE------SKAVKITKGSGSLEM 274 (274) T ss_pred cccCEEEEeEEEEEEEEcC------CcEEEEEcCCccccC Confidence 9999999999998865 44 111111111222211 No 45 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=89.18 E-value=0.028 Score=29.11 Aligned_cols=263 Identities=14% Similarity=0.064 Sum_probs=111.0 Q ss_pred CCc-cceeeeeeeeeecCCCCcccccc---c---CCccccccccccccccccccCccccCCCcccccccccccccccccc Q lcl|Aclame:pro 107 MSG-PTGLIFAMRSRYENQAGEEALFN---E---PDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEV 179 (468) Q Consensus 107 mTG-PTGLIFAMRsrY~~qsG~EA~fn---E---a~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~ 179 (468) |.. .|- -.+.--.|-|-. + ..--|++-.. ... ...+ .++ .+.++ T Consensus 1 m~~~~T~--------l~d~i~Pev~~~~v~~~~~~~l~~~~~~~-~~~------------~l~g-~~G-------~tv~i 51 (274) T protein:vir:95 1 MAQGMTK--------LTNQIVPEVLAPMMQAELEKKLRFASFAE-IDN------------TLVG-QPG-------DTLTF 51 (274) T ss_pred CCcceee--------hhheechHHHHHHHHHHHHhhhhccccce-ecc------------cccC-CCC-------CEEEe Confidence 111 010 000000110000 0 0000110000 000 0000 000 01111 Q ss_pred ccccchhhhhccCCC-CcchhhcceEEEEEEEEeecccccccccHHHHHHHHHhc-CCChhHHHHHHHHHHHHHHhhHHH Q lcl|Aclame:pro 180 GSKMPREDLERMGEA-NRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIH-GLDAEQELANILSSEVLAEINREV 257 (468) Q Consensus 180 ~~gm~Ta~aE~lG~~-~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiH-GLDAE~ELanILStEImlEINREi 257 (468) ..--...++|.+..+ +-...++..+ +.+++.+-|+- + |.+ -|+.+.- +-|.-.|..+-++..++.++++++ T Consensus 52 P~~~~ig~a~~~~~g~~i~~~~lt~~--~~~~~i~~~~~-a-~~i---~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i 124 (274) T protein:vir:95 52 PAFIYSGDAKVVAEGEKIPTDILETK--KREAKIRKIAK-G-TSI---SDEALLSGYGDPQGEQVRQHGLAHANKVDDDV 124 (274) T ss_pred eeecCCCccccccCCCccchhhcccc--eeEEEeeeeec-c-eee---hHHHHhhccchHHHHHHHHHHHHHHHHHHHHH Confidence 000011122222221 1223444333 33333344432 2 222 2555544 458899999999999999999999 Q ss_pred HHHHhhhhhccccccccccccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccc Q lcl|Aclame:pro 258 VRRVYTVAKKGAQNNVANAGIFDLDVDSNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLD 337 (468) Q Consensus 258 i~~l~~va~~~k~~~~~~~g~~Dl~~~~~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~ 337 (468) +..+.+.... .+...++ .+.+-.++.++..+ -..+++++++|+|++.|....... T Consensus 125 ~~~l~~a~~~------~~~~~~~----------~d~i~~A~~~lgd~---------~~~~~~ivv~p~~~~~L~k~~~~~ 179 (274) T protein:vir:95 125 LEALKSAKLT------VEADITK----------LTGLQTAIDKFNDE---------DLEPMVLFISPLDAGKLRGDATTN 179 (274) T ss_pred HHHHhccccc------ccccccC----------HHHHHHHHHHhccc---------cccccEEEeCHHHHHHHHhhcccc Confidence 9877653222 1111121 22233333444322 136789999999999998865444 Q ss_pred cccccccccccccccccccCceeEEEecCceEEEEcccccccCCcceEEEEEe-cCCcccceeEeeccchhhcccccCCc Q lcl|Aclame:pro 338 YSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYK-GTSPYDAGLFYCPYVPLQMVRSIDPN 416 (468) Q Consensus 338 ~~~~~~~~~~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~~~~dY~~vG~K-G~~~~d~glfyaPYv~l~~~~~~dp~ 416 (468) |...... +. +.-..| .+|++. |++||+| ++.|. |-.+-++ |.-. ||.. .+...-.--||. T Consensus 180 f~~~s~~--g~---~~~~~G--~ig~~~-G~~Vi~s----~~~~~-~t~~l~~~gA~~-----~~~~-~~~~vE~~Rd~~ 240 (274) T protein:vir:95 180 FTRATEL--GD---DVIVKG--AFGEAL-GAVIVRS----NKLEA-GTAILAKKGAVK-----LITK-RDFFLETDRDPS 240 (274) T ss_pred ccccccc--cc---cceecc--ccceec-CeEEEEe----CCCCC-ceEEEEecccee-----eeec-CCcccccccccc Confidence 4432221 10 111112 357774 6899999 55553 2222222 2111 1111 111122224899 Q ss_pred cccceeeeeeeeeeee-cCcccccCccccccchhhhhhhc Q lcl|Aclame:pro 417 TFQPKIGFKTRYGMVS-NPFVTTNGLYNGTPDGEALTPNA 455 (468) Q Consensus 417 s~qP~~g~~tRY~l~~-nP~~~~~~~~~~~~~~~~~~~~a 455 (468) +++=.+-..-+||+.+ || ..--...-..|..-. T Consensus 241 ~~~d~i~~~~~y~~~~~~~------~~~v~~tk~~~~~~~ 274 (274) T protein:vir:95 241 TKTTALYSDKHYVAYLYDE------SKAVKITKGSGSLEM 274 (274) T ss_pred cccCEEEEeEEEEEEEEcC------CcEEEEEcCCccccC Confidence 9999999999998865 44 111111111222211 No 46 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=89.04 E-value=0.029 Score=29.04 Aligned_cols=334 Identities=10% Similarity=0.037 Sum_probs=123.6 Q ss_pred Ccch----HHHHHhhhhhhCCCccchhcchhhhHH--HHHHHhHHHHHHhhhh-------hhhhhhhh-------hhc-- Q lcl|Aclame:pro 1 MFNA----EHLQEKWSPVLNHGEAPAIGDRYKRAV--TSVLLENQERFLREER-------GMLNEVAV-------NSL-- 58 (468) Q Consensus 1 ~~~~----~~l~~kw~p~l~~~~~~~i~~~~~~~~--~~~llenq~~~~~~~~-------~~l~e~~~-------~~~-- 58 (468) |-+. ++|++++.-+-+. +-+....-+..+ +..+.+.+++.+.+-+ ..+.+.-. ... T Consensus 1 m~~~~k~l~el~~~~~~~~~~--~~~~~e~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (395) T protein:vir:43 1 MSDFEKQIGELNASLKQVGDQ--IKSQAEQVNTQIANFGEMNKETRAKVDELLTAQGELQARLSAAEQAMLANEKRDGGE 78 (395) T ss_pred ChhHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccc Confidence 5553 3444444433221 001100000000 0011122211111000 00000000 000 Q ss_pred -----------------------Ccccccc--cccccccccccccccccc-eehhhhHHhhhhhhhhheeeeecCCccce Q lcl|Aclame:pro 59 -----------------------GAGTIAP--AGSALGSANTGGLAGFDP-VLISLVRRAMPNLMAYDVCGVQPMSGPTG 112 (468) Q Consensus 59 -----------------------~~~~~~~--~~~i~~st~tg~i~~~~P-~Lv~l~RRa~~~LI~~DI~GVQPmTGPTG 112 (468) ..+..-. ...+...+.+++. -.-| ..-.++++..+..+..+++.++||.+++. T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~-~vp~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~ 157 (395) T protein:vir:43 79 EAPKTAGQMVAESLKEQGVTSSLRGSHRVSMPRSAITSIDGSGGA-LVAPDRRPGVVAAPQRRLTIRDLVAPGTTESNSV 157 (395) T ss_pred chhhhHHHHHHHHHHHHHHHHHhhhhhhhhhhhhhhcccCCCCcc-ccchhhHHHHHHHHHhhhhHHhhccceecCCCce Confidence 0000000 0000001111110 1111 11234444556667788899999887653 Q ss_pred eeeeeeeeecCCCCcccccccCCccccccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccC Q lcl|Aclame:pro 113 LIFAMRSRYENQAGEEALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMG 192 (468) Q Consensus 113 LIFAMRsrY~~qsG~EA~fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG 192 (468) -+. | +...++. + .+ . +| T Consensus 158 ~~~--~--~~~~~~~---------------------------------------a--------~~-------v--~E--- 174 (395) T protein:vir:43 158 EYV--R--ETGFVNN---------------------------------------A--------AP-------V--SE--- 174 (395) T ss_pred EEE--E--EecCCCc---------------------------------------e--------ee-------e--cC--- Confidence 221 1 1010000 0 00 0 01 Q ss_pred CCCcchhhcceEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhcccccc Q lcl|Aclame:pro 193 EANRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNN 272 (468) Q Consensus 193 ~~~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~ 272 (468) +..+++-..++++++...|.-+-...+|-||.||.- +.++.|.+-|+..+...+|+.||.- ...++. T Consensus 175 --~~~~~~~~~~~~~i~~~~~k~~~~~~is~ell~d~~-----~l~~~v~~~la~a~~~~~d~~~l~G----~g~~~~-- 241 (395) T protein:vir:43 175 --GTQKPYSDLTFELENAPVRTIAHLFKASRQILDDAS-----ALQSYIDARARYGLMLVEECQLLYG----NGTGAN-- 241 (395) T ss_pred --CccccccccceeEEEEeeeeEEEeehhhHHHHHhHH-----HHHHHHHHHHHHHHHHHHHHHHHhc----cCCCCc-- Confidence 112344445555666666666666789999999852 3678899999999999998888742 111111 Q ss_pred ccccccccccc----cccchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhccccccccccccccc Q lcl|Aclame:pro 273 VANAGIFDLDV----DSNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGP 348 (468) Q Consensus 273 ~~~~g~~Dl~~----~~~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~ 348 (468) ..|++-... ....-.... ..+-.+......+ ...-+++..+|+||.....|... ..+ + +. T Consensus 242 --~~Gi~~~~~~~~~~~~~~~~~~---~~~~~i~~~~~~~--~~~~~~~~~~vmn~~~~~~l~~l---kd~-----~-G~ 305 (395) T protein:vir:43 242 --LHGIIPQAQAYAPPSGVVVTAE---QRIDRIRLAILQA--QLAEFPASGIVLNPIDWALIELN---KDA-----E-NR 305 (395) T ss_pred --cccccccccccccccccccccc---hhHHHHHHHHHhh--ccccCCCcEEEEcHHHHHHHHHh---hcc-----C-Cc Confidence 112111000 000000000 0111122222232 12234566889999998888642 111 0 11 Q ss_pred ccccccccCceeEEEecCceEEEEcccccccCCcceEEEEEecCCcccceeEeeccchhhcccccCCc---ccc-ceee- Q lcl|Aclame:pro 349 SIGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPN---TFQ-PKIG- 423 (468) Q Consensus 349 ~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~~~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~---s~q-P~~g- 423 (468) .+. .+... ...++|. |++|+++.+... +=+++|--... |--+.-..+..-+++. .|+ -.++ T Consensus 306 ~i~-~~~~~-~~~~~l~-G~pVv~~~~~~~----~~~~~gd~~~~-------~~~~~~~~~~i~~~~~~~~~f~~~~~~~ 371 (395) T protein:vir:43 306 YII-GSPQN-GTTPTLW-RLPVVETQAITQ----DEFLTGAFSLG-------AQIFDRMDIEVLVSTENDKDFENNMVTI 371 (395) T ss_pred eec-ccccc-CCCceec-ceeeEEcCCCCC----CcEEEEeccce-------EEEEEecceEEEEeccccchhhcCcEEE Confidence 111 11111 1135665 479999866432 22333321100 0000000111111111 122 2233 Q ss_pred -eeeeeeeee-cC--cccccCccccccchhhhhhhcccceeeeeeeec Q lcl|Aclame:pro 424 -FKTRYGMVS-NP--FVTTNGLYNGTPDGEALTPNANMYYRRVQVTNL 467 (468) Q Consensus 424 -~~tRY~l~~-nP--~~~~~~~~~~~~~~~~~~~~an~y~~r~~v~~l 467 (468) +..|++..+ +| |+. +.++.= T Consensus 372 r~~~r~d~~v~~~~a~~~------------------------~~~taa 395 (395) T protein:vir:43 372 RAEERLAFAVYRPEAFVT------------------------GSLTAS 395 (395) T ss_pred EEEEeeccEEecccceEE------------------------EEeccC Confidence 345676654 23 221 111111 No 47 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=89.04 E-value=0.029 Score=29.04 Aligned_cols=295 Identities=12% Similarity=0.053 Sum_probs=116.5 Q ss_pred HhhhhhhhhhhhhhhcCcccccccccccccccccccccccceehh-hhHHhhhhhhhhheeeeecCCccceeeeeeeeee Q lcl|Aclame:pro 43 LREERGMLNEVAVNSLGAGTIAPAGSALGSANTGGLAGFDPVLIS-LVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRY 121 (468) Q Consensus 43 ~~~~~~~l~e~~~~~~~~~~~~~~~~i~~st~tg~i~~~~P~Lv~-l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY 121 (468) .++...+=.|. ......++++++- ..-|.+.. +++.+....+-.+++.+.||++.+.-|.- . T Consensus 1 ~~~~~~~~~~~------------~~~~~t~~~~~~~-~ip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~----~ 63 (320) T protein:vir:10 1 MAAGTAFQVDH------------AQIAQTGDTMFKG-YLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKIPH----W 63 (320) T ss_pred CCCCccCCHHH------------HHhhccccccccc-cccHHHHHHHHHHHHhccchhhhcceeeccCCceEEEE----E Confidence 11111110010 0011111111110 12222221 33334445567888899998876533221 1 Q ss_pred cCCCCcccccccCCccccccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCCCCcchhhc Q lcl|Aclame:pro 122 ENQAGEEALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFREM 201 (468) Q Consensus 122 ~~qsG~EA~fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~~~f~EM 201 (468) .+ +.++ .| . +| +..+++- T Consensus 64 ~~--~~~a-------~~----------------------------------------------v--~E-----~~~~~~~ 81 (320) T protein:vir:10 64 IG--DVSA-------QW----------------------------------------------I--GE-----GDMKPIT 81 (320) T ss_pred eC--Ccce-------EE----------------------------------------------e--cC-----Ccccccc Confidence 00 0000 00 0 01 1223444 Q ss_pred ceEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHH-------hhhhhcccccccc Q lcl|Aclame:pro 202 SFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRV-------YTVAKKGAQNNVA 274 (468) Q Consensus 202 aFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l-------~~va~~~k~~~~~ 274 (468) ..++++++...|..+-...+|.||.+|-. .|.|+.|.+.|...|...+|+-+|.-- ......+ ..+. T Consensus 82 ~~~f~~v~~~~~k~~~~~~is~ell~ds~----~~l~~~i~~~l~~a~a~~~d~a~l~G~g~~~~~~~~~~~~~--~~~~ 155 (320) T protein:vir:10 82 KGNMTSQNIAPHKIATIFVASAETVRANP----ANYLGTMRTKVATAFAMAFDSAALNGTDSPFPTYLAQTTKS--VSLA 155 (320) T ss_pred ccceeEEEEeeEEEEEeehhhHHHHhcCh----HHHHHHHHHHHHHHHHHHHHHHhhcccCCCCCccccccccc--ccce Confidence 44556667777777777889999999865 578888888888888888888886421 0000000 1111 Q ss_pred ccccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhccccccccccccccccccccc Q lcl|Aclame:pro 275 NAGIFDLDVDSNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVD 354 (468) Q Consensus 275 ~~g~~Dl~~~~~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d 354 (468) ..+..... .-+..+ .+ +...... ..........+||+|.....|..- ..+.+. .......... T Consensus 156 ~~~~~~~~----~~~~~~---~~---~~~~~~~--~~~~~~~~~~~v~n~~~~~~L~~l---kd~~G~--~l~~~~~~~~ 218 (320) T protein:vir:10 156 DPGGATAS----DLTAYD---AV---AVNGLSL--LVNAKKKWTHTLLDDIVEPILNGA---KDKNGR--PLFIESTYTD 218 (320) T ss_pred eccccccc----ccccHH---HH---HHHHHhh--hhcccCCCcEEEEcHHHHHHHHHh---hccCCc--eeeccccccC Confidence 11111111 111111 11 1111111 112234456889999999999752 211100 0000000001 Q ss_pred ccCceeEEEecCceEEEEcccccccC------CcceEEEEEecCCcccceeEeeccchhhcccccCCcc-----cc---c Q lcl|Aclame:pro 355 DTGNLAVGTINGRIKVFVDPYAANLS------DKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNT-----FQ---P 420 (468) Q Consensus 355 ~t~~~~~G~l~g~~~vy~D~Ya~~~~------~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~s-----~q---P 420 (468) ......-++| .+++|+++..+.... ++.++++|..++..++-+ -+.......|+.. || = T Consensus 219 ~~~~~~~~~i-~g~pv~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~------~~~~~~~~~~~~~~~~~~f~~~~~ 291 (320) T protein:vir:10 219 ENSPFRAGRI-VSRPTILSDHVADGTTVGYMGDFRNVIWGQVGGLSFDVT------DQATLNLGTPTEPNFVSLWQHNLV 291 (320) T ss_pred ccccccCcee-eeeeeEecCCCCCCceEEEEeecceEEEEEecCeEEEEe------ecceeeeccccccccchhhhcCcE Confidence 1111122333 367787775443211 112233444333222100 0000111111111 11 1 Q ss_pred eeeeeeeeeee-ecC--cccccCccccccch Q lcl|Aclame:pro 421 KIGFKTRYGMV-SNP--FVTTNGLYNGTPDG 448 (468) Q Consensus 421 ~~g~~tRY~l~-~nP--~~~~~~~~~~~~~~ 448 (468) .+=...|++.. .+| |+.... -..|++ T Consensus 292 ~~r~~~~~d~~v~~~~a~~~l~~--~~ap~~ 320 (320) T protein:vir:10 292 AVRVEAEYAFHNNDKDAFVKLTN--VVTPDA 320 (320) T ss_pred EEEEEEeeccEEecccceEEEEe--ccCCCC Confidence 12233566543 344 433321 122443 No 48 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=87.60 E-value=0.038 Score=28.39 Aligned_cols=256 Identities=13% Similarity=0.075 Sum_probs=111.2 Q ss_pred ecCCC---C----cccc---cccC---CccccccccccccccccccCccccCCCccccccccccccccccccccccchhh Q lcl|Aclame:pro 121 YENQA---G----EEAL---FNEP---DTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPRED 187 (468) Q Consensus 121 Y~~qs---G----~EA~---fnEa---~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~ 187 (468) -++.. . .|-| ..+. .--|++-. ..... . .+. + +...++..--.+.. T Consensus 1 ma~~~T~~~d~i~Pev~s~~v~~~~~~~~~~~~~~-~~~~~--------l-~g~----~-------G~tv~ip~~~~~g~ 59 (274) T protein:vir:96 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFA-DIDST--------L-VGQ----P-------GDTLTFPAFTYSGD 59 (274) T ss_pred CCccccchhhhhhhHHHHHHHHHHHHhhhhhcccc-ccccc--------c-cCC----C-------CCEEEEEeeccCCC Confidence 11100 0 0100 0000 00011000 00000 0 000 0 00111110001112 Q ss_pred hhccCC-CCcchhhcceEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhh Q lcl|Aclame:pro 188 LERMGE-ANRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAK 266 (468) Q Consensus 188 aE~lG~-~~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~ 266 (468) ++.... ..-++.++.++ ..+++.|-|+-.-+++=|. ++..+-|.-.+..+-++..++.+++++|+..|..... T Consensus 60 ~~~~~~g~~i~~~~it~~--~~~~~i~~~~~~~~i~D~~----~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~a~~ 133 (274) T protein:vir:96 60 AQVIAEGEKIPVDQIGTS--KREAKVRKIGKGTELTDEA----VLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGATL 133 (274) T ss_pred ccccCCCCcCchhhcccc--eeEEEEEeeeceeeecHHH----HHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC Confidence 222221 12234444433 3344445554322333222 2334678999999999999999999999988754321 Q ss_pred ccccccccccccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhccccccccccccc Q lcl|Aclame:pro 267 KGAQNNVANAGIFDLDVDSNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAG 346 (468) Q Consensus 267 ~~k~~~~~~~g~~Dl~~~~~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~ 346 (468) .. +...+ .++.+-.++.++..+ -..+++++|+|.+++.|..-....|.+.... T Consensus 134 ~~------~~~~~----------~~d~i~dA~~~l~d~---------~~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~-- 186 (274) T protein:vir:96 134 TV------EADIT----------KLDGLQTAIDKFNDE---------DLEPMVLFVNPLDAGGLRTSASDNFTRPTQL-- 186 (274) T ss_pred Cc------Ccccc----------cHHHHHHHHHHhccc---------CCCceEEEeCHHHHHHHHhcccccccccccc-- Confidence 11 11111 123333333333321 2467899999999999987654444433221 Q ss_pred ccccccccccCceeEEEecCceEEEEcccccccCCcceEEEEEecCCcccceeEeeccchhhcccccCCccccceeeeee Q lcl|Aclame:pro 347 GPSIGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNTFQPKIGFKT 426 (468) Q Consensus 347 ~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~~~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~s~qP~~g~~t 426 (468) +.. .-.+| .+|++. |++|++| ++-|..=..+-=+|.-. |+.. .+...-.-.||..++-.+-... T Consensus 187 g~~---~~~~g--~ig~~~-G~~Vi~s----~~~p~~t~~l~~~gA~~-----~~~~-~~~~vE~~Rd~~~~~d~i~~~~ 250 (274) T protein:vir:96 187 GDN---IIVKG--AFGEAL-GAVIVRS----NKLNKGEALLAKKGAVK-----LITK-RDFFLEKDRDASRKSTALYSDK 250 (274) T ss_pred ccc---ceeec--ccceec-CeeEEEc----CCCCcceEEEEeCccee-----eeec-CCcccccccchhhcccEEEEee Confidence 111 11122 257774 7899999 55553222111122211 1111 1112222348999999998889 Q ss_pred eeeeee-cC--cccc-cCcccccc Q lcl|Aclame:pro 427 RYGMVS-NP--FVTT-NGLYNGTP 446 (468) Q Consensus 427 RY~l~~-nP--~~~~-~~~~~~~~ 446 (468) +||+.+ || ..+. .+.-+... T Consensus 251 ~yg~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:96 251 HYVAYLYDESKVVKITKGAGDEVM 274 (274) T ss_pred EEEEEEEcCccEEEEEcCcccccC Confidence 999875 55 1111 11111110 No 49 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=86.48 E-value=0.045 Score=27.95 Aligned_cols=277 Identities=17% Similarity=0.147 Sum_probs=118.2 Q ss_pred cccccccccccccee-hhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCcccccccCCcccccccccccccc Q lcl|Aclame:pro 71 GSANTGGLAGFDPVL-ISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDTGFTGGYDASQGDY 149 (468) Q Consensus 71 ~st~tg~i~~~~P~L-v~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSg~~~~~~~~~ 149 (468) -.+++|.+ .-|.+ -.+++.+.+..+-.+++.+.||++...-|. .. .. +.++ .| T Consensus 1 ma~~gG~l--vp~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip-~~---~~--~~~a-------~~----------- 54 (298) T protein:vir:16 1 MVLNKGTL--FDPTLVTDLISKVAGKSSIARLSAQKPIPFNGEKVF-TF---TM--DSEI-------DV----------- 54 (298) T ss_pred CcccCcce--echhHHHHHHHHHHhhhhhhhhcceeeccCCceEEE-EE---ec--Ccce-------EE----------- Confidence 12222322 11211 123444456778899999999976432221 11 00 0000 00 Q ss_pred ccccCccccCCCccccccccccccccccccccccchhhhhccCCCCcchhhcceEEEEEEEEeecccccccccHHHHHHH Q lcl|Aclame:pro 150 AVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDL 229 (468) Q Consensus 150 ~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDL 229 (468) . +| +.++++-..++++++..+|.-+-....|-||.++- T Consensus 55 -----------------------------------v--~E-----~~~~~~~~~~f~~v~l~~~k~a~~~~iS~ell~~s 92 (298) T protein:vir:16 55 -----------------------------------V--AE-----SGKKTHGGVTLAPQTMVPIKVEYGARISDEFMYAS 92 (298) T ss_pred -----------------------------------e--cC-----CccccccccceeEEEEeeeeEEEeehhhHHHhhcC Confidence 0 01 12344445555666666666666788999998754 Q ss_pred HHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhccccccc-ccccccc---ccccc-cchhHHHHHHHHHHHHHHH Q lcl|Aclame:pro 230 KAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNV-ANAGIFD---LDVDS-NGRWSVEKFKGLLFQVERD 304 (468) Q Consensus 230 kAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~-~~~g~~D---l~~~~-~grw~~e~~k~L~~~i~~e 304 (468) -. -..|-+++|.+-|+..|...|+..++.-...- .+...++ ...++.. ..... ...+ .+...+... T Consensus 93 ~d-~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~~--~g~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~i~~~ 163 (298) T protein:vir:16 93 DE-EKINILQEFNDGFAKKVARGIDLMAFHGVNPR--LGTASAVIGTNHFDSKVTQKVEAPRGIA------DPNGAIENA 163 (298) T ss_pred cc-cHHHHHHHHHHHHHHHHHHHHHHHhhccccCC--CCcccccccccccccccccccccccccc------cHHHHHHHH Confidence 32 12456777888888888888877777542110 1111110 0001100 00001 0111 111122222 Q ss_pred HHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccccccCceeEEEecCceEEEEcccccc--cCCc Q lcl|Aclame:pro 305 ANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVFVDPYAAN--LSDK 382 (468) Q Consensus 305 an~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~--~~~~ 382 (468) ...+. ....+..-+|++|+....|... ..+ + +..+.+-+.++.. .|+|.| ++|+++..... ..+. T Consensus 164 ~~~~~--~~~~~~~~~vmn~~~~~~l~~l---kd~-----~-G~~i~~~~~~~~~-~~~l~G-~PV~~~~~v~~~~~~~~ 230 (298) T protein:vir:16 164 VELLT--GVDADVTGIAINPSFRSALAKQ---KDL-----Q-DNALFPELKWGAT-PDTING-LPVDVNKTVSDMSLTQR 230 (298) T ss_pred HHHhh--hcCCCccEEEEcHHHHHHHHHh---hcc-----C-CCeeecCcccCCC-Cceecc-eeeEEecccccccCCCc Confidence 22221 1134555689999999988752 211 1 1111111111111 267764 68887754322 2334 Q ss_pred ceEEEEEecCCcccceeEeeccch--hhcccccCCcc-----cc-ceeee--eeeee-eeecC--cccccCccccccchh Q lcl|Aclame:pro 383 HYYVIGYKGTSPYDAGLFYCPYVP--LQMVRSIDPNT-----FQ-PKIGF--KTRYG-MVSNP--FVTTNGLYNGTPDGE 449 (468) Q Consensus 383 dY~~vG~KG~~~~d~glfyaPYv~--l~~~~~~dp~s-----~q-P~~g~--~tRY~-l~~nP--~~~~~~~~~~~~~~~ 449 (468) +.+++|-- ..++.|..--. +++.+..|+++ || =.++| ..|++ .+.+| |+.... T Consensus 231 ~~~~~GDf-----s~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~ra~~r~d~~v~~~~a~~~l~~--------- 296 (298) T protein:vir:16 231 DRAIIGDF-----ANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTE--------- 296 (298) T ss_pred cEEEEeec-----cceEEEEEecCceEEEeeccCCcCcchhhhhcCcEEEEEEEEEccEeecccceEEEee--------- Confidence 45555511 01112222111 22222234432 22 11333 44666 34455 332211 Q ss_pred hhhhhcc Q lcl|Aclame:pro 450 ALTPNAN 456 (468) Q Consensus 450 ~~~~~an 456 (468) |+ T Consensus 297 -----at 298 (298) T protein:vir:16 297 -----AN 298 (298) T ss_pred -----cC Confidence 11 No 50 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=86.10 E-value=0.048 Score=27.81 Aligned_cols=341 Identities=13% Similarity=0.096 Sum_probs=118.2 Q ss_pred CcchHHHHHhhhhhhCC------------CccchhcchhhhHHHHHHHhHHHHHHhhh-hhhhhhhhhhhcC--c-cccc Q lcl|Aclame:pro 1 MFNAEHLQEKWSPVLNH------------GEAPAIGDRYKRAVTSVLLENQERFLREE-RGMLNEVAVNSLG--A-GTIA 64 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~------------~~~~~i~~~~~~~~~~~llenq~~~~~~~-~~~l~e~~~~~~~--~-~~~~ 64 (468) .-..+...++...-+.. +..+...+..++ .+...++....+. +..+.+....... . .... T Consensus 81 ~~~~a~~~e~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~----~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~ 156 (458) T protein:vir:10 81 NELFAQTVEKQQETIVGLQDEIKSLLTAREGRSFVGDSVAK----ALYGTQENFEDEVEKLVLLSYVMEKGVFETEHGQR 156 (458) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhc----cchhhhhhHHHHHHHHHHHHHHHhhccchhhhhhh Confidence 00000111111100000 000111000000 0000000000000 0011110000000 0 0000 Q ss_pred cccccccccc--ccccc---cccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCcccccccCCcccc Q lcl|Aclame:pro 65 PAGSALGSAN--TGGLA---GFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDTGFT 139 (468) Q Consensus 65 ~~~~i~~st~--tg~i~---~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fS 139 (468) .-.....+++ .|+.. .+.+.+ +.++.+..+..+++-++||+++..-++ ... .+ +...|- T Consensus 157 ~~~a~~~~~~~~~g~~~ip~~~~~~i---i~~~~~~~~l~~~~~~~~~~~~~~~~~-~~~-----~~-------~~a~~v 220 (458) T protein:vir:10 157 HLKAVNQSSSVEVSSESYETIFSQRI---IRDLQKELVVGALFEELPMSSKILTML-VEP-----DA-------GKATWV 220 (458) T ss_pred hhhhhhhcccCccccceehhhHhHHH---HHHHHhhhhHHhhcceeecCCcceEEE-Eec-----CC-------cceeec Confidence 0000011111 11111 122233 344446667889999999988653222 110 00 000000 Q ss_pred ccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCCCCcchhhcceEEEEEEEEeecccccc Q lcl|Aclame:pro 140 GGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFREMSFSIEKTSVTAQSRALKA 219 (468) Q Consensus 140 g~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~~~f~EMaFsIeK~tVtAKSRaLKA 219 (468) + ++....+ +..-..-.-+++++++.++.-+-.. T Consensus 221 ~----------------------------------------------e~~~~~~-~~~~~~~~~~~~~i~~~~~k~~~~v 253 (458) T protein:vir:10 221 A----------------------------------------------ASTYGTD-TTTGEEVKGALKEIHFSTYKLAAKS 253 (458) T ss_pred c----------------------------------------------ccccccc-ccccccccccceeeEeeeeeEEeee Confidence 0 0000000 0000011122344555555555567 Q ss_pred cccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhcccccccccc-c------cccccccccchhHHH Q lcl|Aclame:pro 220 EYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVANA-G------IFDLDVDSNGRWSVE 292 (468) Q Consensus 220 EYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~~~~-g------~~Dl~~~~~grw~~e 292 (468) .+|-||.+|-- .|.+++|.+-|...|..-||+.||.- .-.++..|+.+. + +.+.......-...+ T Consensus 254 ~is~ell~ds~----~~~~~~i~~~l~~~i~~~~d~~~l~G----~G~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~ 325 (458) T protein:vir:10 254 FITDETEEDAI----FSLLPLLRKRLIEAHAVSIEEAFMTG----DGSGKPKGLLTLASEDSAKVVTEAKADGSVLVTAK 325 (458) T ss_pred hhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHhhcC----CCCCccceeeecccccccceeecccccccccccHH Confidence 88999988833 46788899999999999998888752 111222222111 1 111111111111122 Q ss_pred HHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccccccCcee---EEEecCceE Q lcl|Aclame:pro 293 KFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLA---VGTINGRIK 369 (468) Q Consensus 293 ~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~~~---~G~l~g~~~ 369 (468) ....|++. +...- .+...+||+|.....|... ..+.+ ..+...+.+.... .++|+ |++ T Consensus 326 ~i~~~~~~-------l~~~~--~~~~~~v~~~~~~~~l~~l---kd~~G------~~i~~~~~~~~~~~~~~~~l~-G~p 386 (458) T protein:vir:10 326 TISKLRRK-------LGRHG--LKLSKLVLIVSMDAYYDLL---EDEEW------QDVAQVGNDSVKLQGQVGRIY-GLP 386 (458) T ss_pred HHHHHHHh-------hhhhh--cCCCEEEEcHHHHHHHHhh---cccCC------ceeeccccccccccCcCceec-cee Confidence 22223222 21111 2345679999988888642 21110 0001111111111 13565 689 Q ss_pred EEEccccccc-CCcceEEEEEecCCcccceeEeeccchhhcccccCCccccceeeee--eeeee-eecC--cccccCccc Q lcl|Aclame:pro 370 VFVDPYAANL-SDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNTFQPKIGFK--TRYGM-VSNP--FVTTNGLYN 443 (468) Q Consensus 370 vy~D~Ya~~~-~~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~s~qP~~g~~--tRY~l-~~nP--~~~~~~~~~ 443 (468) |+++.+.-.. ...+.++..++ + +.++.. -..+....||-+-...++|. .|.|+ +.+| |+... .+ T Consensus 387 v~~~~~~p~~~~~~~~~~~~f~-~-----~~~~~~--~~~~~v~~d~~~~~~~~~~~~~~r~~~~v~~~~a~v~~~--~a 456 (458) T protein:vir:10 387 VVVSEYFPAKANSAEFAVIVYK-D-----NFVMPR--QRAVTVERERQAGKQRDAYYVTQRVNLQRYFANGVVSGT--YA 456 (458) T ss_pred eEEccccccccCCcceEEEEec-c-----cEEEEE--eeceEEEeecccCCCceEEEEEEEecceEecccceEEEe--ec Confidence 9998654221 11222222221 1 011110 11122223554445556665 45543 3445 32211 11 Q ss_pred cc Q lcl|Aclame:pro 444 GT 445 (468) Q Consensus 444 ~~ 445 (468) .. T Consensus 457 a~ 458 (458) T protein:vir:10 457 AS 458 (458) T ss_pred cC Confidence 11 No 51 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=85.76 E-value=0.05 Score=27.70 Aligned_cols=326 Identities=12% Similarity=0.030 Sum_probs=121.9 Q ss_pred CcchHHHHHhhhhhhCCCccchhcchhhhHHH--HHHHhHHHHHH--------------hhhhhhhhhhhhh--hcCccc Q lcl|Aclame:pro 1 MFNAEHLQEKWSPVLNHGEAPAIGDRYKRAVT--SVLLENQERFL--------------REERGMLNEVAVN--SLGAGT 62 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~--~~llenq~~~~--------------~~~~~~l~e~~~~--~~~~~~ 62 (468) --.+++..+++.-+... +... ++.+- ...++..++.. ++...+..+.... ..+... T Consensus 30 ~~~~~e~~~~~~~~~~e-----~~~l-~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 103 (390) T protein:vir:10 30 GELNASARSKVDELFAT-----VGNL-SAEVQAARQRVAELEGNGAGGDVQHVSVGDLFVASEQFQASAGRWNDRSARAT 103 (390) T ss_pred cccCHHHHHHHHHHHHH-----HHHH-HHHHHHHHHHHHHHHhhcccccccccchhhhhhhhHHHHHHHHhhhhhhhhhh Confidence 11122333444333211 1110 00000 00011100000 0000000000000 000000 Q ss_pred cccccc----cccc-ccccccc--cccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCcccccccCC Q lcl|Aclame:pro 63 IAPAGS----ALGS-ANTGGLA--GFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPD 135 (468) Q Consensus 63 ~~~~~~----i~~s-t~tg~i~--~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~ 135 (468) ...... ...+ +..|.+. ..-+.++.+. .....-.++|.+.||++++.-+.- ..+.++ ++ T Consensus 104 ~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~~~---~~~~~l~~~~~~~~~~~~~~~~~~----~~~~~~-~a------ 169 (390) T protein:vir:10 104 MNIKAALNTASTDAAGSAGALTTPNRLPGFITQP---DARLTVRDLIGSGRTDSALIEYVQ----ETGFVN-NA------ 169 (390) T ss_pred hHHHHHHHhhhcccccccccccchhHHHHHHHHH---HhhchhhhhcceeeccCCceEEEE----EecCCc-ce------ Confidence 000000 0001 1111111 1122333333 344455678999998876532221 111000 00 Q ss_pred ccccccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCCCCcchhhcceEEEEEEEEeecc Q lcl|Aclame:pro 136 TGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFREMSFSIEKTSVTAQSR 215 (468) Q Consensus 136 t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~~~f~EMaFsIeK~tVtAKSR 215 (468) .| . +| +...++-..++++++..+|.. T Consensus 170 -~~----------------------------------------------v--~E-----g~~~~~~~~~~~~i~~~~~k~ 195 (390) T protein:vir:10 170 -AI----------------------------------------------V--AE-----GALKPESSLKFAKKTDTTHVI 195 (390) T ss_pred -ee----------------------------------------------e--cC-----CccccccccceeEEEEeeEEE Confidence 00 0 01 123444555666777777777 Q ss_pred cccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhccccccccccccccccc------cccchh Q lcl|Aclame:pro 216 ALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVANAGIFDLDV------DSNGRW 289 (468) Q Consensus 216 aLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~~~~g~~Dl~~------~~~grw 289 (468) +....+|-||.||-- |.++.|.+-|+..|...||+.||.- .-. +-...|++.... ...+-- T Consensus 196 ~~~~~is~ell~d~~-----~l~~~i~~~l~~~~~~~~~~~il~G----~G~----~~~p~Gi~~~~~~~~~~~~~~~~~ 262 (390) T protein:vir:10 196 AHTMKATRQILSDAP-----QLASYMNNRLIRGLKVKEDAEILRG----TGA----NDGLLGLIPQATTYAAPTTIAGAT 262 (390) T ss_pred EEeehhhHHHHHhHH-----HHHHHHHHHHHHHHHHHHHHHHhhc----CCC----Cccccccccccccccccccccccc Confidence 778899999999852 5678899999999999999887732 111 111222221110 001110 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccccccCceeEEEecCceE Q lcl|Aclame:pro 290 SVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGRIK 369 (468) Q Consensus 290 ~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~~~~G~l~g~~~ 369 (468) .......++.++ ......++.+|++|.....|... ..+.+ ....+.+..+. .++| .|++ T Consensus 263 ~~~~~~~~~~~l---------~~~~~~~~~~v~n~~~~~~L~~l---kd~~g------~~l~~~~~~~~--~~~l-~G~p 321 (390) T protein:vir:10 263 RVDQLRLAMLQA---------SLAEYPASGIVINPIDWAAIELA---KDANN------QYLIGNARGTL--TPTL-WGLP 321 (390) T ss_pred hHHHHHHHHHhh---------ccccCCCCEEEEcHHHHHHHHHh---hcCCC------ceeecCCcCcC--Ccee-ccee Confidence 112222222222 12234566789999998888752 21111 01111111111 2345 3678 Q ss_pred EEEcccccccCCcceEEEEEecCCcccceeEeeccchhhcccccC---Cccccceeeeeeeeeeee-cCcccccCccccc Q lcl|Aclame:pro 370 VFVDPYAANLSDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSID---PNTFQPKIGFKTRYGMVS-NPFVTTNGLYNGT 445 (468) Q Consensus 370 vy~D~Ya~~~~~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~d---p~s~qP~~g~~tRY~l~~-nP~~~~~~~~~~~ 445 (468) |+++... |..-+++|-- . .+++.+...-+......+ -.+-+=.+-...|++..+ +|= T Consensus 322 v~~~~~~----p~~~~~~gdf---~--~~~~~~~~~~~~i~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~---------- 382 (390) T protein:vir:10 322 VVATQAM----APGEFLVGAF---D--LAAQIFDQWDARVEIGYVNDDFQRNMVTVLAEERLALVVYRPE---------- 382 (390) T ss_pred eEEcCCC----CCCcEEEEec---c--ceEEEEEecceEEEEeecccccccCcEEEEEEEeeccEEeccc---------- Confidence 8888553 2333444421 0 111121111111111111 112222233335666543 330 Q ss_pred cchhhhhhhcccceeeeeee Q lcl|Aclame:pro 446 PDGEALTPNANMYYRRVQVT 465 (468) Q Consensus 446 ~~~~~~~~~an~y~~r~~v~ 465 (468) -|..+.++ T Consensus 383 ------------a~~~~~~a 390 (390) T protein:vir:10 383 ------------ALISGSFA 390 (390) T ss_pred ------------cEEEEEeC Confidence 11222222 No 52 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=85.44 E-value=0.053 Score=27.59 Aligned_cols=299 Identities=10% Similarity=0.036 Sum_probs=125.0 Q ss_pred hcchhhhHHHHHHHhHHHHHHhhhhhhhhhhhhhhcCcccccccccccccccccccccccceehhhhHHhhhhhhhhhee Q lcl|Aclame:pro 23 IGDRYKRAVTSVLLENQERFLREERGMLNEVAVNSLGAGTIAPAGSALGSANTGGLAGFDPVLISLVRRAMPNLMAYDVC 102 (468) Q Consensus 23 i~~~~~~~~~~~llenq~~~~~~~~~~l~e~~~~~~~~~~~~~~~~i~~st~tg~i~~~~P~Lv~l~RRa~~~LI~~DI~ 102 (468) ++...+ ....++.....+-. .+++..... . ++.+++..--....-.+++.+....+..+++ T Consensus 1 ~~~~~~----------~~~~~~~f~~~~~~--~~~~~a~~~------~-~~~~~~~liP~~~~~~ii~~~~~~s~l~~~~ 61 (324) T protein:vir:10 1 MEQTQK----------LKLNLQHFASNNVK--PQVFNPDNV------M-MHEKKDGTLLNDFTTPILQEVMENSKIMQLG 61 (324) T ss_pred CCCchH----------HHHHHHHHHHHhhc--cceecccce------e-ccCCCcceechhHHHHHHHHHHhhchhhhhc Confidence 111111 11112222211111 122222211 0 1111111001111122344444566778888 Q ss_pred eeecCCccceeeeeeeeeecCCCCcccccccCCccccccccccccccccccCccccCCCccccccccccccccccccccc Q lcl|Aclame:pro 103 GVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSK 182 (468) Q Consensus 103 GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~g 182 (468) -+-||++.+.-|.- ... +.++ .| T Consensus 62 ~~~~~~~~~~~~p~----~~~--~~~a-------~~-------------------------------------------- 84 (324) T protein:vir:10 62 KYEPMEGTEKKFTF----WAD--KPGA-------YW-------------------------------------------- 84 (324) T ss_pred ceeeccCCceEEEE----EeC--Ccce-------eE-------------------------------------------- Confidence 99998876533211 100 0000 00 Q ss_pred cchhhhhccCCCCcchhhcceEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHh Q lcl|Aclame:pro 183 MPREDLERMGEANRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVY 262 (468) Q Consensus 183 m~Ta~aE~lG~~~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~ 262 (468) . +| +..+++...+++++++..|.-+-.-..|-||.+|-. .|.+++|.+.|+..|...+++.+|.--- T Consensus 85 --v--~E-----g~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~----~~l~~~i~~~l~~ai~~~~d~a~l~G~g 151 (324) T protein:vir:10 85 --V--GE-----GQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQG 151 (324) T ss_pred --e--cc-----CccccccccceeEEEEeeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHhhhcCC Confidence 0 01 233555666677777778877778889999999864 4789999999999999999998875321 Q ss_pred hhhhcccccccccccccccccc----ccchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhccccc Q lcl|Aclame:pro 263 TVAKKGAQNNVANAGIFDLDVD----SNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDY 338 (468) Q Consensus 263 ~va~~~k~~~~~~~g~~Dl~~~----~~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~ 338 (468) +. ....|++..... ..+--..+....++ ..+. ..-...+.+|++|.....|... .. T Consensus 152 ~~--------~~~~~i~~~~~~~~~~~~~~~t~~~i~~~~-------~~l~--~~~~~~~~~v~n~~~~~~L~~l---~d 211 (324) T protein:vir:10 152 NN--------PFGKSIAQSIEKTNKVIKGDFTQDNIIDLE-------ALLE--DDELEANAFISKTQNRSLLRKI---VD 211 (324) T ss_pred CC--------ccCccccccccccceeccccCCHHHHHHHH-------Hhhh--hccCCCCEEEEcHHHHHHHHHh---hc Confidence 11 011111111000 00100112223332 2221 2224556789999999998763 11 Q ss_pred ccccccccccccccccccCceeEEEecCceEEEEcccccccCCcceEEEEEecCCcccceeEeeccchhhc--------c Q lcl|Aclame:pro 339 SSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYKGTSPYDAGLFYCPYVPLQM--------V 410 (468) Q Consensus 339 ~~~~~~~~~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~~~~dY~~vG~KG~~~~d~glfyaPYv~l~~--------~ 410 (468) + + +..++. +..+ ++|. +++|++.+.+. .+..-+++|-.. .+++...-...+ . T Consensus 212 ~-----~-g~~~~~-~~~~----~~l~-G~PV~~~~~~~--~~~~~~~~gd~~------~~~~~~~~~~~i~~~~~~~~~ 271 (324) T protein:vir:10 212 P-----E-TKERIY-DRNS----DTLD-GLPVVNLKSSN--LKRGELITGDFD------KLIYGIPQLIEYKIDETAQLS 271 (324) T ss_pred c-----C-Cceeec-CCCC----cccc-ceeEEeecCCC--CCcceEEEEecc------cEEEEEecCcEEEEeeccccc Confidence 1 1 011111 1111 3443 46777765432 222233333211 111111111111 1 Q ss_pred cccCCc--------cccceeeeeeeeee-eecC--cccccCccccc--cchhh Q lcl|Aclame:pro 411 RSIDPN--------TFQPKIGFKTRYGM-VSNP--FVTTNGLYNGT--PDGEA 450 (468) Q Consensus 411 ~~~dp~--------s~qP~~g~~tRY~l-~~nP--~~~~~~~~~~~--~~~~~ 450 (468) ...|+. +-+=.+=...|||. +.+| |+.......+. +.++= T Consensus 272 ~~~~~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~~~~~~ 324 (324) T protein:vir:10 272 TVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKTDSVPGEV 324 (324) T ss_pred ccccccccchhhhhcCcEEEEEEEEEccEEecccceEEEEeccCCCCCCCCCC Confidence 111111 11223333467775 3455 44332211111 11111 No 53 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=85.32 E-value=0.054 Score=27.55 Aligned_cols=323 Identities=11% Similarity=0.120 Sum_probs=119.9 Q ss_pred CcchH---HHHHhhhhhhCCCccchhcchhhhHHHHHHHhHHHHHHhhh-----------------------------hh Q lcl|Aclame:pro 1 MFNAE---HLQEKWSPVLNHGEAPAIGDRYKRAVTSVLLENQERFLREE-----------------------------RG 48 (468) Q Consensus 1 ~~~~~---~l~~kw~p~l~~~~~~~i~~~~~~~~~~~llenq~~~~~~~-----------------------------~~ 48 (468) -...| ++.+++.-+.+. + .-|+.|-+.+.+. +. T Consensus 30 ~~~~ee~~~~~~~~~~~~~~-----~----------~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 94 (394) T protein:vir:10 30 NASVDDFQKIKDDLTAAKAR-----R----------DAINDQIKDLEAENKANSDPDKPVDNAQPNGTDLKKKPIDAKKK 94 (394) T ss_pred hccHHHHHHHHHHHHHHHHH-----H----------HHHHHHHHHHHHHHHhhcchhhhhhhhcccccchhhhHHHHHHH Confidence 00011 111122111110 0 0011111111000 00 Q ss_pred hhhhhhhhhcCcccc-cccccccccccccccccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCc Q lcl|Aclame:pro 49 MLNEVAVNSLGAGTI-APAGSALGSANTGGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGE 127 (468) Q Consensus 49 ~l~e~~~~~~~~~~~-~~~~~i~~st~tg~i~~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG~ 127 (468) .+.+. +..+.. .........++.|++.--.+..-.++++..+..+-.+++.+.||+++++-+--.+. .++. T Consensus 95 ~~~~~----l~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~----~~~~ 166 (394) T protein:vir:10 95 AINDF----IHSHGKVIDNAAGHVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYPILKR----ATDR 166 (394) T ss_pred HHHHH----HhccchhhhhhhcccccccCceeccHHHHHHHHHHHHhhhhhhhhceeeeccCCceEEEEEec----CCCc Confidence 00000 000000 00000111122222222222223355556666677899999999998775554440 0000 Q ss_pred ccccccCCccccccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCC-CCcchhhcceEEE Q lcl|Aclame:pro 128 EALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGE-ANRLFREMSFSIE 206 (468) Q Consensus 128 EA~fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~-~~~~f~EMaFsIe 206 (468) - .+ ..+.+...+ +...|.+..|++. T Consensus 167 ~--------~~----------------------------------------------~~E~~~~~~~~~~~~~~v~l~~~ 192 (394) T protein:vir:10 167 F--------SS----------------------------------------------VAELAENPALAEPEFEQVDWSVS 192 (394) T ss_pred c--------cc----------------------------------------------ccccccccccccccceeEEeeee Confidence 0 00 000000011 1123555555555 Q ss_pred EEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhcccccccccccccccccccc Q lcl|Aclame:pro 207 KTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVANAGIFDLDVDSN 286 (468) Q Consensus 207 K~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~~~~g~~Dl~~~~~ 286 (468) |. +-...+|-||.+|- ..|.+++|.+-|+..|..-+|+.|+.-.- .+...++...-. T Consensus 193 k~-------~~~~~iS~ell~ds----~~~l~~~i~~~la~~~~~~~~~~il~g~g----~~~~~~~~~~~~-------- 249 (394) T protein:vir:10 193 TY-------RGAIPLSEEAIADS----AVDLTSLVGQSINEKSVNTYNAMIAPVLQ----SFTAKATTTDTL-------- 249 (394) T ss_pred ee-------EeeehhHHHHHhhh----hHHHHHHHHHHHHHHHHHHHHHHHhhccc----cccccccccccc-------- Confidence 54 44567999999984 25788899999999999999998875442 221111111111 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhccccccccccccccccccc---ccccCceeEEE Q lcl|Aclame:pro 287 GRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGE---VDDTGNLAVGT 363 (468) Q Consensus 287 grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~---~d~t~~~~~G~ 363 (468) ......++......+ +. ..+|++|.....|... ..+. ++ .+.. ...+.....++ T Consensus 250 ----~d~l~~~~~~~~~~~---------~~-a~~vmn~~~~~~l~~l---kd~~---G~---~i~~~~~~~~~~~~~~~~ 306 (394) T protein:vir:10 250 ----VDSLKHILNVDLDPA---------YS-RALVVTQSLFNTLDTL---KDKN---GR---YLLHDASDSITDGTAKGT 306 (394) T ss_pred ----HHHHHHHHHhhhhhh---------cc-CEEEecHHHHHHHHHh---hccC---CC---eeeeccccccccCCcccc Confidence 111222211111111 22 3577999988888753 2111 10 0111 11112222356 Q ss_pred ecCceEEEE-c-ccccccCCcceEEEEEecCCcccceeEeeccchhhcccccCCccccceeeeeeeeeeee-cC--cccc Q lcl|Aclame:pro 364 INGRIKVFV-D-PYAANLSDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNTFQPKIGFKTRYGMVS-NP--FVTT 438 (468) Q Consensus 364 l~g~~~vy~-D-~Ya~~~~~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~s~qP~~g~~tRY~l~~-nP--~~~~ 438 (468) |. |++|++ | .+..+.....-+++|--.. ++....- ...-....+...|.-.+-...|++..+ || |... T Consensus 307 L~-G~PV~~~~~~~~~~~~~~~~i~~gd~s~-----~~~~~~~-~~~~v~~~~~~~~~~~~~~~~r~d~~~~~~~ai~~~ 379 (394) T protein:vir:10 307 VL-GVPVYVVGDALLGSAAGDQKAFVGDLKR-----GVLFADR-QQVTLAWEDSKIYGRYLGAAFRFGVKQADSNAGYFV 379 (394) T ss_pred cc-cceeEEecccccCCCCCceEEEEeeccc-----cEEEEee-cceEEEEecccccceeEEEEEEeccEEeccccEEEE Confidence 64 455554 3 2211111101122221000 0000000 000011123444555566667887643 34 2111 Q ss_pred cCccccccchhhhhhhcccceee Q lcl|Aclame:pro 439 NGLYNGTPDGEALTPNANMYYRR 461 (468) Q Consensus 439 ~~~~~~~~~~~~~~~~an~y~~r 461 (468) . .....+.+=+|=+| T Consensus 380 ~--------~~~~~~~~~~~~~~ 394 (394) T protein:vir:10 380 T--------NTDAASGSTSGTGK 394 (394) T ss_pred E--------eecccCCCCCCCCC Confidence 0 00011111112223 No 54 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=84.06 E-value=0.063 Score=27.15 Aligned_cols=345 Identities=11% Similarity=0.056 Sum_probs=122.0 Q ss_pred CcchHHHHHhhhhhh-----------CCCccch--hcchhhhHHHHHHHhHHH-------HHHhhhhhhhhhhhhhhcCc Q lcl|Aclame:pro 1 MFNAEHLQEKWSPVL-----------NHGEAPA--IGDRYKRAVTSVLLENQE-------RFLREERGMLNEVAVNSLGA 60 (468) Q Consensus 1 ~~~~~~l~~kw~p~l-----------~~~~~~~--i~~~~~~~~~~~llenq~-------~~~~~~~~~l~e~~~~~~~~ 60 (468) -+..+.-.++..-.. .+++..+ +.....|+.....+.+.. ....+.+..+.+.... +. T Consensus 58 ~le~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~r~a~~~~l~~--~~ 135 (434) T protein:vir:62 58 KLEEKEKEEDPAKKKDDDPEKKEDPTAKENPNEKTELSEEQRSAISASIAAALSTKGHRTNKETEIRSVFANYIVG--NI 135 (434) T ss_pred HHHHHHHHHHHHhhhcchhhhhcchhhhcchhhhHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHHHHHHhcc--cc Confidence 111111122221111 1111000 111111111111111110 0001111111111000 00 Q ss_pred ccccccccccccccccccccccceeh--hhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCcccccccCCccc Q lcl|Aclame:pro 61 GTIAPAGSALGSANTGGLAGFDPVLI--SLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDTGF 138 (468) Q Consensus 61 ~~~~~~~~i~~st~tg~i~~~~P~Lv--~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~f 138 (468) .... ...+..++..|+.. =|.-+ .+++...+..+...++-|.|++|..- |-. +..... ..+ T Consensus 136 ~~~e-~~a~~~~t~~GG~l--vP~~~~~~Ii~~l~~~~~i~~~~~~~~~~~~~~--~p~---~~~~~~---------a~~ 198 (434) T protein:vir:62 136 DEKE-ARALGLVTGNGSVT--IPDFLSKEIITYAQEENFLRRLGTGVKTKENIK--YPV---LVKKAE---------AQG 198 (434) T ss_pred chhh-hhhhccccccccee--cchhhHHHHHHhhhhhhhhhhhcceeccCCceE--EEE---EecCCc---------ccc Confidence 0000 00001111122211 12221 24454556667778888888765311 110 100000 000 Q ss_pred cccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCCCCcchhhcceEEEEEEEEeeccccc Q lcl|Aclame:pro 139 TGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFREMSFSIEKTSVTAQSRALK 218 (468) Q Consensus 139 Sg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~~~f~EMaFsIeK~tVtAKSRaLK 218 (468) ...+..+...++-..++++++..+|.-+-. T Consensus 199 --------------------------------------------------~~~~~e~~~~~~~~~~f~~v~~~~~k~~~~ 228 (434) T protein:vir:62 199 --------------------------------------------------HKNERTNNEMPETDIEFDEIELSPTEFDAL 228 (434) T ss_pred --------------------------------------------------eecccccccccccccceeeEEeeheeeEee Confidence 000001122233334566667777777777 Q ss_pred ccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhccccccccccccccccccccchhHHHHHHHHH Q lcl|Aclame:pro 219 AEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVANAGIFDLDVDSNGRWSVEKFKGLL 298 (468) Q Consensus 219 AEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~~~~g~~Dl~~~~~grw~~e~~k~L~ 298 (468) ..+|-||.+|- .+|.+++|.+-|+..|..-+++.||.-==+ -....++.......+...... ..+....|. T Consensus 229 ~~iS~ell~ds----~~~l~~~i~~~la~~~~~~~d~~~l~G~G~---~~~~~g~~~~~~~~~~~~~~~--~~d~l~~l~ 299 (434) T protein:vir:62 229 ATVTKKLLART----GLPIEQIVMDELKKAYVRKETQYMVNGDEA---NNINDGALAKKAVEFKTDEKN--LYDALVKMK 299 (434) T ss_pred hhhHHHHHhcc----hHHHHHHHHHHHHHHHHHHHHHHHhccCCC---Cccccceeecccccccccccc--hhhHHHHHH Confidence 88999999995 467899999999999999999888841100 000111111111111111111 112222332 Q ss_pred HHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccccccCc-eeEEEecCceEEEEccccc Q lcl|Aclame:pro 299 FQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGN-LAVGTINGRIKVFVDPYAA 377 (468) Q Consensus 299 ~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~-~~~G~l~g~~~vy~D~Ya~ 377 (468) +.+. .. -+..+ ..|+++.....|... ..+ + |....+-..+.. -.-.+|. |++|+++.++. T Consensus 300 ~~l~-------~~-~~~~a-~~v~n~~~~~~L~~l---kd~-----~-G~~l~~~~~~~~~g~~~tl~-G~pV~~~~~~~ 360 (434) T protein:vir:62 300 NTPV-------KE-VRKKA-RWVLNTAALTKIETM---KTD-----D-GFPLLRPFNQAEGGIGYTLL-GFPVEEEDAID 360 (434) T ss_pred hhcc-------hh-hhcCC-EEEEcHHHHHHHHHh---hcc-----C-CCEeeccCCCccCCCCceec-ceeeEEecCcc Confidence 3222 11 12334 457899988888752 211 1 111111111000 0012454 47777775432 Q ss_pred ccCC--cceEEEEEecCCcccceeE-eeccc-hhhcccccCCc--cccceeeeeeee-eeeec-CcccccCcccc-ccch Q lcl|Aclame:pro 378 NLSD--KHYYVIGYKGTSPYDAGLF-YCPYV-PLQMVRSIDPN--TFQPKIGFKTRY-GMVSN-PFVTTNGLYNG-TPDG 448 (468) Q Consensus 378 ~~~~--~dY~~vG~KG~~~~d~glf-yaPYv-~l~~~~~~dp~--s~qP~~g~~tRY-~l~~n-P~~~~~~~~~~-~~~~ 448 (468) .... ..-+.+| +- +-| ..... .+.+.+..++- +-|=.+..+.|. |-.++ ||+..-=...+ .+.+ T Consensus 361 ~~~~~~~~~i~~G---df----s~~~i~~~~g~~~i~~~~~~~~~~~~v~~~~~~r~Dgk~i~~~~~~~~~~~~~~~~~~ 433 (434) T protein:vir:62 361 IPDSPDTPVFYFG---DF----SKFYIQDVIGSLEVQKLVELFSRTNRVGFRIWNLLDAQLIHSPFEVPVYKYVLKAPTG 433 (434) T ss_pred CccCCCceEEEEe---ec----cceEEEEeeceeEEEeehhhhcccCceEEEEEeeecceeecCcccceEEEEEeccCCC Confidence 1100 0001111 11 000 00011 11222222332 233334555777 44344 87753111111 1221 Q ss_pred h Q lcl|Aclame:pro 449 E 449 (468) Q Consensus 449 ~ 449 (468) . T Consensus 434 ~ 434 (434) T protein:vir:62 434 A 434 (434) T ss_pred C Confidence 1 No 55 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=83.28 E-value=0.07 Score=26.92 Aligned_cols=325 Identities=14% Similarity=0.122 Sum_probs=121.3 Q ss_pred CcchHHHHHhhhhhhCC------------------------Cccchhc-------chhhhHHHHHHHhHHHHHHhhhhhh Q lcl|Aclame:pro 1 MFNAEHLQEKWSPVLNH------------------------GEAPAIG-------DRYKRAVTSVLLENQERFLREERGM 49 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~------------------------~~~~~i~-------~~~~~~~~~~llenq~~~~~~~~~~ 49 (468) .++.+ -.++|.-+... +..+... ....+......+..+.+........ T Consensus 30 ~~~~~-~~~~~~~l~~eie~l~~ei~~l~~~~~~~e~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 108 (394) T protein:vir:97 30 ALESD-DLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEVTQEEKTYRESVNDFIRSKGKIVNDSLRF 108 (394) T ss_pred hhchh-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccchhhHHHHHHHHHHHHHHHHHhhhhhhh Confidence 22211 12223322210 0000000 0000000111111111111000000 Q ss_pred -hhhhhhhhcCccccccccccccc--ccccccccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCC Q lcl|Aclame:pro 50 -LNEVAVNSLGAGTIAPAGSALGS--ANTGGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAG 126 (468) Q Consensus 50 -l~e~~~~~~~~~~~~~~~~i~~s--t~tg~i~~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG 126 (468) ..+......- ..........+ ..+|++.--....-.+++...+......++.+.||+++++-+--++ ..+ T Consensus 109 ~~~~~~~~~~~--~~~~~~~~~~~~t~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~-----~~~ 181 (394) T protein:vir:97 109 EGKDEVLMPIN--ETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQ-----RAT 181 (394) T ss_pred hhHHHHHHHHH--hhhhhhhhccccccccccccChHHHHHHHHHHhhhhhhhhhhceeeeccCcceEEEEEe-----cCC Confidence 0000000000 00000011111 1112221111122234555556667788899999988876442222 000 Q ss_pred cccccccCCccccccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCCCCcchhhc-ceEE Q lcl|Aclame:pro 127 EEALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFREM-SFSI 205 (468) Q Consensus 127 ~EA~fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~~~f~EM-aFsI 205 (468) .. ..+ . +| +...++. ..++ T Consensus 182 ~~-------~~~----------------------------------------------v--~E-----~~~~~~~~~~~~ 201 (394) T protein:vir:97 182 TK-------MVT----------------------------------------------V--AE-----LEKNPALAKPDF 201 (394) T ss_pred Cc-------cce----------------------------------------------e--cc-----cccccccccccc Confidence 00 000 0 01 0112222 2445 Q ss_pred EEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhccccccccccccccccccc Q lcl|Aclame:pro 206 EKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVANAGIFDLDVDS 285 (468) Q Consensus 206 eK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~~~~g~~Dl~~~~ 285 (468) ++++..++.-+-...+|-||.+|- +.|.+++|.+-|+..|..-+|..||.-+-+. +..+...+ T Consensus 202 ~~v~l~~~k~~~~i~is~ell~ds----~~~~~~~i~~~la~~~~~~~~~~i~~g~~~~---------~~~~~~~~---- 264 (394) T protein:vir:97 202 KDVAWNIDTYRGAIPLSQESIDDA----DVDLVGIVSESISQIKVNTTNDAIAKVLKSF---------TTKTVKNL---- 264 (394) T ss_pred eeEEeehhheeeehhhHHHHHhhh----hHHHHHHHHHHHHHHHHHHHHHHHhhccccc---------cccccccH---- Confidence 566666666666788999999986 3467888888888888888888777543221 12222211 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccccccCceeEEEec Q lcl|Aclame:pro 286 NGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGTIN 365 (468) Q Consensus 286 ~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~~~~G~l~ 365 (468) +....++ +.. ... +..+. +||+|.+...|... ..+. |..+.+.+-++. .-++|. T Consensus 265 ------~~~~~~~-------~~~-~~~-~~~a~-~v~n~~~~~~l~~l---kd~~------G~~i~~~~~~~~-~~~~l~ 318 (394) T protein:vir:97 265 ------DEIKALL-------NGG-FDP-AYNVS-LIVSQSFYQTLDTL---KDGN------GRYLLQDDITAV-SGKVLL 318 (394) T ss_pred ------HHHHHHH-------Hhh-hhh-hhCCE-EEEcHHHHHHHHHh---hccC------CCeeeecCcCCC-CCceec Confidence 1111111 111 111 22344 57999999888763 1110 111111111111 124665 Q ss_pred CceEEEE--cccccccCCcceEEEEEecCCcccceeEeeccchhhcccccCCccccceeeeeeeeeeee-cC--cccccC Q lcl|Aclame:pro 366 GRIKVFV--DPYAANLSDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNTFQPKIGFKTRYGMVS-NP--FVTTNG 440 (468) Q Consensus 366 g~~~vy~--D~Ya~~~~~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~s~qP~~g~~tRY~l~~-nP--~~~~~~ 440 (468) | ++|++ |... +..-+++|-- ..+.++..-..+.. ...|...++..+-...|++..+ +| |....- T Consensus 319 G-~pv~~~~~~~~----~~~~~~~gd~-----~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~r~d~~v~~~~a~~~~~~ 387 (394) T protein:vir:97 319 G-KPVFVLSDEVL----GANKAFIGDF-----KRGVLFADRKDLGL-RWADNEIYGQYLQAVLRFGVSKVDDKAGYYVTF 387 (394) T ss_pred c-ceeEEeccccc----CCccEEEeec-----cccEEEEEecceEE-EEecccccceeEEEEEEEccEEecccceEEEEe Confidence 5 55554 4322 2222333320 01111222211111 1234444555555667777643 44 322111 Q ss_pred ccccccc Q lcl|Aclame:pro 441 LYNGTPD 447 (468) Q Consensus 441 ~~~~~~~ 447 (468) .....|= T Consensus 388 ~~~~~p~ 394 (394) T protein:vir:97 388 TPEPLPL 394 (394) T ss_pred cccccCC Confidence 1111111 No 56 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=83.21 E-value=0.07 Score=26.90 Aligned_cols=278 Identities=14% Similarity=0.133 Sum_probs=119.1 Q ss_pred cccccccccccccccee-hhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCcccccccCCcccccccccccc Q lcl|Aclame:pro 69 ALGSANTGGLAGFDPVL-ISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDTGFTGGYDASQG 147 (468) Q Consensus 69 i~~st~tg~i~~~~P~L-v~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSg~~~~~~~ 147 (468) ++.++++|.+ ..|.+ -.+++++.+..+..+++.+-||++.+.-|. ++.. +.++ .| T Consensus 1 m~t~t~gg~l--iP~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip----~~~~--~~~a-------~w--------- 56 (303) T protein:vir:97 1 MGTETSKASL--FDKHLVSDLINKVKGHSSLAKLSSQKPIPFNGSKEF----TFTL--DSDI-------DV--------- 56 (303) T ss_pred CcccCCCCeE--cchhHHHHHHHHHHhhchhhhhcceeecCCCceEEE----EEec--Ccce-------EE--------- Confidence 4434433322 23333 345666667888999999999986554332 1111 0000 00 Q ss_pred ccccccCccccCCCccccccccccccccccccccccchhhhhccCCCCcchhhcceEEEEEEEEeecccccccccHHHHH Q lcl|Aclame:pro 148 DYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFREMSFSIEKTSVTAQSRALKAEYTLELAQ 227 (468) Q Consensus 148 ~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQ 227 (468) .+ | +..+++-..+++.++..+|.-+-....|-||.| T Consensus 57 -------------------------------------v~--E-----~~~~~~s~~~f~~v~l~~~kl~~~~~iS~ell~ 92 (303) T protein:vir:97 57 -------------------------------------VA--E-----NGKKTHGGLSLEPVTIVPIKVEYGARLSDEFLY 92 (303) T ss_pred -------------------------------------ee--c-----CccccccccceeeEEeeeEEEEEeehhhHHHhh Confidence 00 1 112333334445555555555556689999986 Q ss_pred HHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhcccccccccccccccccc-------ccchhHHHHHHHHHHH Q lcl|Aclame:pro 228 DLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVANAGIFDLDVD-------SNGRWSVEKFKGLLFQ 300 (468) Q Consensus 228 DLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~~~~g~~Dl~~~-------~~grw~~e~~k~L~~~ 300 (468) .... ..++-+++|.+-|+..|...|+..+|.-..... + .+....+...+... ..+.-..+. T Consensus 93 ~~~d-~~~~l~~~i~~~la~a~~~~ld~a~l~G~~~~~--g--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------- 160 (303) T protein:vir:97 93 ATEE-EKIDILKAFNEGFAKKLARGIDLMAMHGINPRT--K--KASDVIGTNHFDSKVTQVVKFTESEDADAN------- 160 (303) T ss_pred cCcc-chHHHHHHHHHHHHHHHHHHHHhhhhcccccCC--c--cccccccccccccccccccccccccchHHH------- Confidence 3322 246678888888888888888888875542111 1 11111111111000 000001122 Q ss_pred HHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccccccCceeEEEecCceEEEEcccccc-- Q lcl|Aclame:pro 301 VERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVFVDPYAAN-- 378 (468) Q Consensus 301 i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~-- 378 (468) |....+.+ ...-+..+-+|++|+....|... ..+ ++ ..+..-+-....-.|+|.| ++|+++.+... T Consensus 161 i~~~~~~~--~~~~~~~~~~vmn~~~~~~L~~l---kd~-----~g-~~~~~~~~~~~~~~~~l~G-~Pv~~s~~v~~~~ 228 (303) T protein:vir:97 161 IEAAVNLI--QGAEGVVTGLAMDTEFSTALAKV---TNG-----EM-GPKMYPELAWGANPDSING-LKSSVNTTVGAGA 228 (303) T ss_pred HHHHHHHH--hhcCCCccEEEEcHHHHHHHHHh---hcc-----CC-CeEEecCccCCCCCceecc-eeeEEecccCCcc Confidence 22222222 12235556799999999888642 111 00 0001111111111256764 88888743221 Q ss_pred --cCCcceEEEEEecCCcccceeEeeccch--hhcccccCCcc-----ccc-eeee--eeeeeee-ecC--cccccCccc Q lcl|Aclame:pro 379 --LSDKHYYVIGYKGTSPYDAGLFYCPYVP--LQMVRSIDPNT-----FQP-KIGF--KTRYGMV-SNP--FVTTNGLYN 443 (468) Q Consensus 379 --~~~~dY~~vG~KG~~~~d~glfyaPYv~--l~~~~~~dp~s-----~qP-~~g~--~tRY~l~-~nP--~~~~~~~~~ 443 (468) ..+.+.+++| +- ...+.+...-. ++.....|++. ||- .++| ..||+.. .|| |+.... T Consensus 229 ~~~~~~~~~~~G---df--~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~n~~~~r~~~r~~~~v~~p~af~~l~~--- 300 (303) T protein:vir:97 229 DEAESKDLVIIG---DF--ESMFKWGYAKQIPMEIIKYGDPDNSGKDLKGYNQIYLRAEAYIGWGILDAKSFARVTK--- 300 (303) T ss_pred ccCCCccEEEEe---ec--cccEEEEEecCcEEEEeeccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEeeC--- Confidence 0112222222 10 11111222211 11222223321 111 1333 4566543 344 332221 Q ss_pred ccc Q lcl|Aclame:pro 444 GTP 446 (468) Q Consensus 444 ~~~ 446 (468) ..+ T Consensus 301 ~~~ 303 (303) T protein:vir:97 301 GEV 303 (303) T ss_pred CCC Confidence 111 No 57 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=82.64 E-value=0.075 Score=26.75 Aligned_cols=333 Identities=15% Similarity=0.134 Sum_probs=130.7 Q ss_pred CcchHHHHHhhhhhhCCCccchhcchh----------h---hHHHHHHHhHHHHHHhhhhhhhhhhhh------------ Q lcl|Aclame:pro 1 MFNAEHLQEKWSPVLNHGEAPAIGDRY----------K---RAVTSVLLENQERFLREERGMLNEVAV------------ 55 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~----------~---~~~~~~llenq~~~~~~~~~~l~e~~~------------ 55 (468) .|+-++|.++|.-+.+. +-++.+.- . +.+.+.+ +...+...+-+..+.+... T Consensus 4 ~m~l~el~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~ee~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (404) T protein:vir:39 4 KLTVNQLNEAWIASGDK--VTDFNDQINMALNDDNFSAEAMSELKNKR-DNEKVRRDALREQLVEAQAEQVVNMREEEKG 80 (404) T ss_pred HHHHHHHHHHHHHHHHH--HHHHHHHHHHHhccccccHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhcccccccc Confidence 56888999999887654 11111100 0 0111111 0000000000001111000 Q ss_pred ------------------hhcCccccc----ccccccccc-cccccc---cccceehhhhHHhhhhhhhhheeeeecCCc Q lcl|Aclame:pro 56 ------------------NSLGAGTIA----PAGSALGSA-NTGGLA---GFDPVLISLVRRAMPNLMAYDVCGVQPMSG 109 (468) Q Consensus 56 ------------------~~~~~~~~~----~~~~i~~st-~tg~i~---~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTG 109 (468) .-+..+... ....+..++ ++|+.. .+.+.+ ++...+.....+++.+.||++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~a~~~~t~~~gg~~iP~~~~~~i---i~~~~~~~~l~~~~~~~~~~~ 157 (404) T protein:vir:39 81 PLNKSEYELKDKFVKEFVNMVRNPMAFLNTVSSKTETSGSDSAAGLTIPQDIRTMI---NTLVRQYDSLQQYVRVESVST 157 (404) T ss_pred ccccchhhhHHHHHHHHHHHHhcchhhhhhhhhhhhhcccccCCceeccHHHHHHH---HHHHHhhhhHHhhcceeeccC Confidence 000000000 000011111 122111 122233 333345567788899999999 Q ss_pred cceeeeeeeeeecCCCCcccccccCCccccccccccccccccccCccccCCCccccccccccccccccccccccchhhhh Q lcl|Aclame:pro 110 PTGLIFAMRSRYENQAGEEALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLE 189 (468) Q Consensus 110 PTGLIFAMRsrY~~qsG~EA~fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE 189 (468) ++|-+--.| ..+..+ . ..| .++++ T Consensus 158 ~~~~~~~~~--~~~~~~-~-------a~~----------------------------------------------v~Eg~ 181 (404) T protein:vir:39 158 SNGSRVYEK--WTDVTP-L-------TVM----------------------------------------------DAEDG 181 (404) T ss_pred CcceEEEEe--ecCCcc-c-------eee----------------------------------------------ecCcc Confidence 887654333 111000 0 000 00001 Q ss_pred ccCC-CCcchhhcceEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhcc Q lcl|Aclame:pro 190 RMGE-ANRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKG 268 (468) Q Consensus 190 ~lG~-~~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~ 268 (468) ...+ ....|.++.|++.|..+-. .+|-||.+|- ..|.+++|.+-|+..|..-+|..||.-.- T Consensus 182 ~~~~~~~~~f~~i~~~~~k~~~~~-------~iS~ell~ds----~~~l~~~i~~~l~~~~~~~~d~~il~g~g------ 244 (404) T protein:vir:39 182 KIPDLDNPRLTIIKYLIKRYAGII-------TATNTLLKDT----AENILAWLSSWIAKKVVVTRNQAIIAAMG------ 244 (404) T ss_pred ccccccccceeeEEeeeeeEEeee-------hhHHHHHhhc----hHHHHHHHHHHHHHHHHHHHHHHHHhccc------ Confidence 1111 1235677777777766554 4999999984 25789999999999999999998875321 Q ss_pred ccccccccccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhccccccccccccccc Q lcl|Aclame:pro 269 AQNNVANAGIFDLDVDSNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGP 348 (468) Q Consensus 269 k~~~~~~~g~~Dl~~~~~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~ 348 (468) .+....+..+++ ....+++.....+ - + ....+||+|.....|... ..+. +. T Consensus 245 --~~~~~~~~~~~~----------~i~~~~~~~~~~~----~---~-~~a~~v~n~~~~~~L~~l---kd~~------G~ 295 (404) T protein:vir:39 245 --TVPKKPTIAKFD----------DVITMINTSVDPA----I---I-ATSSLLTNQSGLNKLALV---KTAE------GK 295 (404) T ss_pred --ccccccccccHH----------HHHHHHHHhhhhh----h---c-cCCEEEEcHHHHHHHHHh---hccC------Cc Confidence 122222333322 1112222111111 0 1 233589999999999863 2110 11 Q ss_pred ccccccccCceeEEEecCceEEEE-c-ccccccCCcce-EEEE-Eec----CCcccceeEeeccchhhcccccCCccccc Q lcl|Aclame:pro 349 SIGEVDDTGNLAVGTINGRIKVFV-D-PYAANLSDKHY-YVIG-YKG----TSPYDAGLFYCPYVPLQMVRSIDPNTFQP 420 (468) Q Consensus 349 ~~~~~d~t~~~~~G~l~g~~~vy~-D-~Ya~~~~~~dY-~~vG-~KG----~~~~d~glfyaPYv~l~~~~~~dp~s~qP 420 (468) ..+..+-++.. .++|.| ++|++ | ....+....++ +++| ++. .....-.+=..+|+...| ...+= T Consensus 296 ~l~~~~~~~~~-~~~l~G-~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~~------~~~~~ 367 (404) T protein:vir:39 296 YLLEPDPTKPN-SYLIKG-KKVIVVADRWLPNSGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGAF------ETDTT 367 (404) T ss_pred eeeccCcCCCC-cceecc-eeEEEecccccCccCCCccEEEEEeccccEEEEeecceEEEEeccchhhh------hhcee Confidence 11111111111 145644 45444 2 11111111111 1222 110 000011122222222111 12344 Q ss_pred eeeeeeeeeeee-cC--cccc----cCcccc-ccchh Q lcl|Aclame:pro 421 KIGFKTRYGMVS-NP--FVTT----NGLYNG-TPDGE 449 (468) Q Consensus 421 ~~g~~tRY~l~~-nP--~~~~----~~~~~~-~~~~~ 449 (468) .+-...||+..+ +| |... -+...+ .+.|- T Consensus 368 ~~r~~~r~d~~~~~~~a~~~~~~~~~a~~~~~~~~~~ 404 (404) T protein:vir:39 368 KIRVIDRFDVKTTDSEALVAGSFTAIADQVGNFTAGK 404 (404) T ss_pred eEEEEeeeccEEecccceEEEEeeccccCCCCCCCCC Confidence 555667777543 44 2211 111111 11111 No 58 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=82.42 E-value=0.077 Score=26.69 Aligned_cols=339 Identities=13% Similarity=0.034 Sum_probs=116.6 Q ss_pred CcchHHHHHhhhhhhCCCc-cch----hcchhhhHHH-------HHHHhHHHHH----Hhhhhh-----h-hhhhhhhhc Q lcl|Aclame:pro 1 MFNAEHLQEKWSPVLNHGE-APA----IGDRYKRAVT-------SVLLENQERF----LREERG-----M-LNEVAVNSL 58 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~-~~~----i~~~~~~~~~-------~~llenq~~~----~~~~~~-----~-l~e~~~~~~ 58 (468) |...+++.++=.-+.+... +.+ ..+.-++... ..+-++..+. .++.+. . ..+...... T Consensus 31 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 110 (413) T protein:vir:81 31 EDAKRERAKSVKANQDFLRELQEATAGSVDSEKSGELTRKGEGYKSIGEFFAKRAGDQIKQQAGGAQLNYSVGEYVAPRV 110 (413) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhHHhHHHhhhHhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhHH Confidence 2222222222211111100 000 0000000000 0000000000 000000 0 000000000 Q ss_pred Cccccccccccccccccccc----ccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCcccccccC Q lcl|Aclame:pro 59 GAGTIAPAGSALGSANTGGL----AGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEP 134 (468) Q Consensus 59 ~~~~~~~~~~i~~st~tg~i----~~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa 134 (468) . ..... ..+.++++..- ..+.+.+ ++..-+..+..+++.|+||++++.-+.-..+ . ... T Consensus 111 ~--~~~~~-~~~~~~~~~~~~~vp~~~~~~i---i~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~-~--~~~-------- 173 (413) T protein:vir:81 111 K--AASDP-ASTATLTDEFQGGYGTTWNRNI---IYRRREKLVVADLMDNLTMTNTTIKYLMEKA-N--RVV-------- 173 (413) T ss_pred H--hhhhh-hhhcccccccccccchhhHHHH---HHHHhhhhhHHhhcceeeccCCceeEEEecc-c--ccc-------- Confidence 0 00000 00111111111 1122334 3434456677899999999998753321110 0 000 Q ss_pred CccccccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCCCC-cchhhcceEEEEEEEEee Q lcl|Aclame:pro 135 DTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEAN-RLFREMSFSIEKTSVTAQ 213 (468) Q Consensus 135 ~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~-~~f~EMaFsIeK~tVtAK 213 (468) ...+ . -.++++...+++ ..|.+..|.+.|.. T Consensus 174 ~~~a--------------------------------------~------~v~Eg~~~~~~~~~~f~~i~~~~~k~~---- 205 (413) T protein:vir:81 174 EGGF--------------------------------------K------TVAEGGKKPYMRFADFDIVTESLSKIA---- 205 (413) T ss_pred cccc--------------------------------------c------eecCcccccccCcccceeeEeeeeeEE---- Confidence 0000 0 000011111111 23555555555444 Q ss_pred cccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhccccccccccccccccccc-----cch Q lcl|Aclame:pro 214 SRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVANAGIFDLDVDS-----NGR 288 (468) Q Consensus 214 SRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~~~~g~~Dl~~~~-----~gr 288 (468) -....|-||.+|-- +.++.|.+-|+..|..-+|+.||.- .-.+ -...|++...... ++. T Consensus 206 ---~~~~iS~ell~ds~-----~l~~~i~~~la~~~~~~~d~~~l~G----~G~~----~~~~Gi~~~~~~~~~~~~~~~ 269 (413) T protein:vir:81 206 ---GLTKITDEMIEDYD-----FLVSYINARLLEELAIEEERQLLLG----DGTG----NNLTGLLKRDGIQTLAVSNKD 269 (413) T ss_pred ---EeehhhHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHhcc----CCCC----Ccccccccccccccccccccc Confidence 44668899999862 2577777777777877777777632 1111 1122332211110 111 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhh----cccccccccccccccccccccccCceeEEEe Q lcl|Aclame:pro 289 WSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMA----GVLDYSSGLNGAGGPSIGEVDDTGNLAVGTI 364 (468) Q Consensus 289 w~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~s----G~~~~~~~~~~~~~~~~~~~d~t~~~~~G~l 364 (468) +. +.....+-.....-..+..+.+|++|.....|..- |-.-+.+......+ +-+....++| T Consensus 270 ~~--------~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~lkd~~G~~l~~~~~~~~~~-------~~~~~~~~~l 334 (413) T protein:vir:81 270 EL--------ADSIYKAMTNISLATPFQADALVINPLDYQELRLAKDANGQYYGGGVFQGQYG-------SGGIMLDPAP 334 (413) T ss_pred hh--------HHHHHHHHHHhhhhccCCCcEEEEcHHHHHHHHHhhccCCceecccccccccc-------ccccccCcee Confidence 11 11111111111112234566688999988887642 11111111111000 0011112455 Q ss_pred cCceEEEEcccccccCCcceEEEEE-ecC-Ccc---cceeEeeccchhhcccccCCccccceeeeeeeeeeee-cC--cc Q lcl|Aclame:pro 365 NGRIKVFVDPYAANLSDKHYYVIGY-KGT-SPY---DAGLFYCPYVPLQMVRSIDPNTFQPKIGFKTRYGMVS-NP--FV 436 (468) Q Consensus 365 ~g~~~vy~D~Ya~~~~~~dY~~vG~-KG~-~~~---d~glfyaPYv~l~~~~~~dp~s~qP~~g~~tRY~l~~-nP--~~ 436 (468) . |++|+++.+.. ..-+++|- +.. .-. .-.+=..+|.... -.+-|=.+=+..||+..+ +| |+ T Consensus 335 ~-G~pv~~s~~~~----~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~------~~~~~~~~r~~~r~d~~~~~~~a~~ 403 (413) T protein:vir:81 335 W-GLRTVQSQVVP----VGKPVVGAFRSAASVLRKGGVRIDSTNTNVDD------FENNLITVRAEERVGLMVTFPEAIV 403 (413) T ss_pred c-ceeeEEcCCCC----cccEEEEecccEEEEEEecceEEEEeccccch------hhcCcEEEEEEEeeccEEecccceE Confidence 4 56888775532 22233432 210 000 0112222222111 123344444555666543 33 33 Q ss_pred cccCcccccc Q lcl|Aclame:pro 437 TTNGLYNGTP 446 (468) Q Consensus 437 ~~~~~~~~~~ 446 (468) ...-.....| T Consensus 404 ~l~~~~~~~p 413 (413) T protein:vir:81 404 QLDVAEVVTP 413 (413) T ss_pred EEEecCCCCC Confidence 2221122233 No 59 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=82.14 E-value=0.079 Score=26.61 Aligned_cols=340 Identities=12% Similarity=0.070 Sum_probs=124.4 Q ss_pred cch----HHHHHhhhhhhCCCc--cchhcchh-------hhHHHHHH--HhHHHHHHhhh----hhhhhhhhhhhcCc-- Q lcl|Aclame:pro 2 FNA----EHLQEKWSPVLNHGE--APAIGDRY-------KRAVTSVL--LENQERFLREE----RGMLNEVAVNSLGA-- 60 (468) Q Consensus 2 ~~~----~~l~~kw~p~l~~~~--~~~i~~~~-------~~~~~~~l--lenq~~~~~~~----~~~l~e~~~~~~~~-- 60 (468) |+- ++|+++.+-+.+... .-+++... -++..+.+ |+++-+.+++. ...+.+.....-.. T Consensus 1 Mn~~e~lkel~~~~~el~~~~~~~~~~~~~~~~e~~~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (421) T protein:vir:13 1 MNLFERLKELRAKKKELEEKRCGIVEEIRSLAKEKKEEEARSKALEREKIEARMEIIEEEIESVMTAIDEERKNTNFTGG 80 (421) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Confidence 552 346677776654410 00011100 00011111 01110111111 11111110000000 Q ss_pred -------------------------ccc-ccccccccccccccc---ccccceehhhhHHhhhhhhhhheeeeecCCccc Q lcl|Aclame:pro 61 -------------------------GTI-APAGSALGSANTGGL---AGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPT 111 (468) Q Consensus 61 -------------------------~~~-~~~~~i~~st~tg~i---~~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPT 111 (468) ++. .....-.-+++.|+. ..+.+.++.+. .+...-.+++.+.||++++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ra~~t~~~gg~liP~~~~~~Ii~~~---~~~~~l~~l~~~~~~~~~~ 157 (421) T protein:vir:13 81 RVIINGDSKEEKRSLQLSAMSKTIRGIQLSEEERDIMSSTNNGAVIPQEFVNEFEKLK---EGYPSLKEHCHVIPVNRNA 157 (421) T ss_pred ccccccchhHHHHHHHHHHHHHhhhccchhHHHhhccccCCcceecchhhHHHHHHHH---HhhhhhhhhceeeeccCCc Confidence 000 000000011111221 11223333333 3445667888899988876 Q ss_pred eeeeeeeeeecCCCCcccccccCCccccccccccccccccccCccccCCCccccccccccccccccccccccchhhhhcc Q lcl|Aclame:pro 112 GLIFAMRSRYENQAGEEALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERM 191 (468) Q Consensus 112 GLIFAMRsrY~~qsG~EA~fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~l 191 (468) +-+--.. .... ..+. ..+| T Consensus 158 ~~~~~~~----~~~~---------~~~~----------------------------------------------~~~E-- 176 (421) T protein:vir:13 158 GKMPVRA----GASV---------DKLA----------------------------------------------NLAK-- 176 (421) T ss_pred eEEEEee----cCCc---------ccee----------------------------------------------eccc-- Confidence 6332111 0000 0000 0000 Q ss_pred CCCCcchhhcceEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhccccc Q lcl|Aclame:pro 192 GEANRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQN 271 (468) Q Consensus 192 G~~~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~ 271 (468) +...++-..++++++...+.-+-...+|-||.+|-- .|.++.|.+-|+..+..-+|..|+..+-.+ T Consensus 177 ---~~~~~~s~~~f~~i~~~~~k~~~~v~iS~ell~ds~----~~l~~~i~~~la~~~~~~~~~~i~~~~~g~------- 242 (421) T protein:vir:13 177 ---DTELVKAMLKTQPMAYDIDDYGLLAPIDNSLLEDSE----INFLEFVNEEFAEFAVNTENAEIVKQAKAV------- 242 (421) T ss_pred ---cccccccccceeEEEeeeeeeEeehhhhHHHHhhhH----HHHHHHHHHHHHHHHHHHhhhhHhhhhhhc------- Confidence 112233334444555555555555779999999853 467888888888888888888887643221 Q ss_pred cccccccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccccc Q lcl|Aclame:pro 272 NVANAGIFDLDVDSNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIG 351 (468) Q Consensus 272 ~~~~~g~~Dl~~~~~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~ 351 (468) .+..++.++ +..+.++..+... -..+..+|++|.....|... ..+. + ..+. T Consensus 243 -~~~~~~~~~----------d~i~~~~~~l~~~---------~~~~a~~v~n~~~~~~l~~l---kd~~---G---~~i~ 293 (421) T protein:vir:13 243 -LAEETINDY----------AGLVKTINSLVPN---------ARKRAIIVTNSDGRAYLDGL---MDKQ---G---RPLL 293 (421) T ss_pred -cccccccch----------HHHHHHHHHhhhh---------hcCCCEEEEcHHHHHHHHHh---hcCC---C---ceee Confidence 122333322 2344454444321 13445778899988888752 2110 0 0111 Q ss_pred cccccCceeEEEecCceEEEEcccccccCC----------cceEEEEEecCCcccceeEeeccchhhcccccCCccccce Q lcl|Aclame:pro 352 EVDDTGNLAVGTINGRIKVFVDPYAANLSD----------KHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNTFQPK 421 (468) Q Consensus 352 ~~d~t~~~~~G~l~g~~~vy~D~Ya~~~~~----------~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~s~qP~ 421 (468) . +.... --++|. |++|++..++..... .+|+.+|.++....+.+- + .+-..-+=. T Consensus 294 ~-~~~~~-~~~tl~-G~pV~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~v~~~~----~--------~~f~~~~~~ 358 (421) T protein:vir:13 294 K-ELSDG-GDLVFK-GRPVIELEESIFDVGDETKFIVSDFKTLIKFMDRKQYLIDQSK----E--------AGYTKNETI 358 (421) T ss_pred c-CcCCC-CCceec-ceeeEEeccccccCCCceEEEEEeccccEEEEEecceEEEeec----c--------cccccCeeE Confidence 1 10000 013554 445555533211000 112333333222221110 0 011122223 Q ss_pred eeeeeeeeeee-----------c---CcccccCccccccchhhhhhhcccceeeeeeeecC Q lcl|Aclame:pro 422 IGFKTRYGMVS-----------N---PFVTTNGLYNGTPDGEALTPNANMYYRRVQVTNLM 468 (468) Q Consensus 422 ~g~~tRY~l~~-----------n---P~~~~~~~~~~~~~~~~~~~~an~y~~r~~v~~l~ 468 (468) +-+..||+..+ . +|+...+....... .+=..++ +|-+|+.-= T Consensus 359 ~r~~~r~d~~~~~~~a~~~~~~~~~~a~v~~~~~~~~~~~-~~~~~~~----~~~~~~~~~ 414 (421) T protein:vir:13 359 ARIIERFDVNSPLDKSSDAEKIRKFGVIVKLQEVLKSSPR-SGKNKNE----SKEEIKEEG 414 (421) T ss_pred EEEEeeecceeecchhhheeeecccceeeccccccCCCCc-CCCCccc----cchheeecc Confidence 33444553322 1 12211111111100 0001111 222222211 No 60 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=81.86 E-value=0.082 Score=26.54 Aligned_cols=297 Identities=10% Similarity=0.034 Sum_probs=123.5 Q ss_pred HHhHHHHHHhhhhhh-hhhhhhhhcCcccccccccccccccccccccccceehhhhHHhhhhhhhhheeeeecCCcccee Q lcl|Aclame:pro 35 LLENQERFLREERGM-LNEVAVNSLGAGTIAPAGSALGSANTGGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGL 113 (468) Q Consensus 35 llenq~~~~~~~~~~-l~e~~~~~~~~~~~~~~~~i~~st~tg~i~~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGL 113 (468) ..|+|+... +.+.+ .+....+++++++. .++++++..--....-.+++.+..+.+..+++.+-||++++-- T Consensus 1 ~~~~~~~~~-~~~~f~~~~~~~~~~~a~~~-------~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ 72 (324) T protein:vir:93 1 MEQTQKLKL-NLQHFASNNVKPQVFNPDNV-------MMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKK 72 (324) T ss_pred CchhHHHHH-HHHHHHHhhhhhhhcccccc-------cccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceE Confidence 222222111 11111 11111233333221 1111111111112222345555667788899999999887643 Q ss_pred eeeeeeeecCCCCcccccccCCccccccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCC Q lcl|Aclame:pro 114 IFAMRSRYENQAGEEALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGE 193 (468) Q Consensus 114 IFAMRsrY~~qsG~EA~fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~ 193 (468) |.-.. . +.++ .| .+ | T Consensus 73 ip~~~----~--~~~a-------~~----------------------------------------------v~--E---- 87 (324) T protein:vir:93 73 FTFWA----D--KPGA-------YW----------------------------------------------VG--E---- 87 (324) T ss_pred EEEEe----c--Ccce-------ee----------------------------------------------ec--C---- Confidence 32110 0 0000 00 00 1 Q ss_pred CCcchhhcceEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhccccccc Q lcl|Aclame:pro 194 ANRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNV 273 (468) Q Consensus 194 ~~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~ 273 (468) +..+++..-++++++++.|..+-....|-||.+|-. .|.+++|.+.|+..|...+++.+|.---.. . T Consensus 88 -g~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~----~~l~~~i~~~l~~aia~~~d~a~l~G~g~~--------~ 154 (324) T protein:vir:93 88 -GQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQGNN--------P 154 (324) T ss_pred -CccccccccceeEEEEEeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHHhcCCCCC--------C Confidence 123344444556666666666667789999999953 468889999999999999998887532110 0 Q ss_pred ccccccccccc----ccchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccc Q lcl|Aclame:pro 274 ANAGIFDLDVD----SNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPS 349 (468) Q Consensus 274 ~~~g~~Dl~~~----~~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~ 349 (468) ...|+++.... ..+.-..+....++.++. ..-+....++|+|.....|... ..+ + +.. T Consensus 155 ~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~l~---------~~~~~~~~~v~n~~~~~~L~~l---~d~-----~-G~~ 216 (324) T protein:vir:93 155 FGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLE---------DDELEANAFISKTQNRSLLRKI---VDP-----E-TKE 216 (324) T ss_pred cCccccccccccceeccccccHHHHHHHHHhhh---------hccCCCCEEEEcHHHHHHHHHh---hCC-----C-CCe Confidence 11111111000 001111222233322221 1224456799999999999863 111 1 111 Q ss_pred cccccccCceeEEEecCceEEEEcccccccCCcc--------eEEEEEecCCcccceeEeeccchhhcccccCCc----- Q lcl|Aclame:pro 350 IGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKH--------YYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPN----- 416 (468) Q Consensus 350 ~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~~~~d--------Y~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~----- 416 (468) +.. +.. .++|. +++|++.+.+. .+.. ++++|..++...+- ..+..+. ...|+. T Consensus 217 ~~~-~~~----~~~l~-G~PVv~~~~~~--~~~~~i~~gdfs~~~~~~~~~~~i~~----~~~~~~~--~~~~~~~~~~~ 282 (324) T protein:vir:93 217 RIY-DRN----SDSLD-GLPVVNLKSSN--LKRGELITGDFDKLIYGIPQLIEYKI----DETAQLS--TVKNEDGTPVN 282 (324) T ss_pred eec-CCC----CCccc-ceeeEeecCCC--CCcceEEEEecceEEEEEecCcEEEE----eeccccc--ccccccccchh Confidence 111 111 23443 46777654321 1222 23333333322210 0000000 000111 Q ss_pred ---cccceeeeeeeeeeee-cC--cccccC---ccccccchh Q lcl|Aclame:pro 417 ---TFQPKIGFKTRYGMVS-NP--FVTTNG---LYNGTPDGE 449 (468) Q Consensus 417 ---s~qP~~g~~tRY~l~~-nP--~~~~~~---~~~~~~~~~ 449 (468) .-|=.+=...|||..+ +| |+.... ....+|..- T Consensus 283 ~f~~n~~~~r~~~r~d~~v~~~~a~~~l~~a~~~~~~~~~~~ 324 (324) T protein:vir:93 283 LFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVPGEV 324 (324) T ss_pred hhhcCcEEEEEEEEeccEEecccceEEEecccccCCCCCCCC Confidence 1122333445666543 34 332211 111122111 No 61 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=80.94 E-value=0.09 Score=26.31 Aligned_cols=323 Identities=11% Similarity=0.040 Sum_probs=115.1 Q ss_pred CcchHHHHHhhhhhh--------------------CCCc---cchhcchhh----hHHHHHHHhHHHHHHhhhhhhhhhh Q lcl|Aclame:pro 1 MFNAEHLQEKWSPVL--------------------NHGE---APAIGDRYK----RAVTSVLLENQERFLREERGMLNEV 53 (468) Q Consensus 1 ~~~~~~l~~kw~p~l--------------------~~~~---~~~i~~~~~----~~~~~~llenq~~~~~~~~~~l~e~ 53 (468) --.+++..++-.-+. +... .++-+...+ ..-...++.+..+........+... T Consensus 30 ~~~~~e~~~~~~~l~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 109 (390) T protein:vir:81 30 GELNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAGGDVQHVSVGDMFVASEQFQASAGRWNDRSARATMNIKAA 109 (390) T ss_pred cCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccchhhhhhhHHHHHHHHHHhhhhhhhhhHHHHH Confidence 000011111111110 0000 000000000 0000001110000000000000000 Q ss_pred hhhhcCcccccccccccccccccccccccce-ehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCcccccc Q lcl|Aclame:pro 54 AVNSLGAGTIAPAGSALGSANTGGLAGFDPV-LISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFN 132 (468) Q Consensus 54 ~~~~~~~~~~~~~~~i~~st~tg~i~~~~P~-Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fn 132 (468) . ... ....+++++. -..|. .-.++++..+..+-.+++.+.||++++.-+.-.. +..+. T Consensus 110 ~-~~~----------~~~~~~~~g~-~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~----~~~~~----- 168 (390) T protein:vir:81 110 L-NTA----------STDAAGSAGA-LTTPNRLPGFITPPDARLTVRDLIGSGRTDSALIEYVQET----GFVNN----- 168 (390) T ss_pred H-Hhh----------ccccccCCcc-eechhhhHHHHHHHhhhhhhhhhcceeeccCCceEEEEEe----cCCcc----- Confidence 0 000 0001111111 11111 1223444445667788999999988764332111 10000 Q ss_pred cCCccccccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCCCCcchhhcceEEEEEEEEe Q lcl|Aclame:pro 133 EPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFREMSFSIEKTSVTA 212 (468) Q Consensus 133 Ea~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~~~f~EMaFsIeK~tVtA 212 (468) ..| .+ | +..+++-..++++++.+. T Consensus 169 ---a~~----------------------------------------------v~--E-----g~~~~~~~~~~~~i~~~~ 192 (390) T protein:vir:81 169 ---AAI----------------------------------------------VA--E-----GALKPESSLKFAKKTDTT 192 (390) T ss_pred ---eee----------------------------------------------ec--C-----CcccccccceeeEEEEee Confidence 000 00 1 112223333344444445 Q ss_pred ecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhccc-cccccc-cccccccccccchhH Q lcl|Aclame:pro 213 QSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGA-QNNVAN-AGIFDLDVDSNGRWS 290 (468) Q Consensus 213 KSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k-~~~~~~-~g~~Dl~~~~~grw~ 290 (468) |.-+-...+|-||.+|- . +.++.|.+-|+..|...+|+-||.- ...++ ..|+.+ .+.........+-.. T Consensus 193 ~k~~~~~~is~ell~d~--~---~~~~~i~~~l~~~~~~~~d~a~l~G----~g~~~~~~Gi~~~~~~~~~~~~~~~~~~ 263 (390) T protein:vir:81 193 HVIAHTMKATRQILSDA--P---QLASYMNNRLIRGLKVKEDAEILRG----TGANDGLLGLIPQATTYAAPTTIAGATR 263 (390) T ss_pred eEEEEeehhhHHHHHhH--H---HHHHHHHHHHHHHHHHHHHHHHHhc----CCCCCcccceeecccccccccccccchh Confidence 54445567899999984 2 4788899989888888888877642 11111 112111 111111111112222 Q ss_pred HHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccccccCceeEEEecCceEE Q lcl|Aclame:pro 291 VEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKV 370 (468) Q Consensus 291 ~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~~~~G~l~g~~~v 370 (468) ++....+++++. ..-...+.+|++|.....|... ..+.+ ..+.. +.... -.++|. |++| T Consensus 264 ~~~~~~~~~~~~---------~~~~~~~~~v~~~~~~~~l~~l---kd~~G------~~l~~-~~~~~-~~~~l~-G~pv 322 (390) T protein:vir:81 264 VDQLRLAMLQAS---------LAEYNPSGIVINPIDWAAIELA---KDANN------QYLIG-NARGT-LTPTLW-GLPV 322 (390) T ss_pred HHHHHHHHHhhc---------cccCCCCEEEEcHHHHHHHHHh---hcCCC------ceeec-Ccccc-cCceec-ceee Confidence 333333333322 2234556789999999888752 21110 01111 11111 113553 6688 Q ss_pred EEcccccccCCcceEEEEEecCCcccceeEeeccchhhccccc--CC---ccccceeeeeeeeee-eecC--cccccCcc Q lcl|Aclame:pro 371 FVDPYAANLSDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSI--DP---NTFQPKIGFKTRYGM-VSNP--FVTTNGLY 442 (468) Q Consensus 371 y~D~Ya~~~~~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~--dp---~s~qP~~g~~tRY~l-~~nP--~~~~~~~~ 442 (468) ++..+. |.+-+++|--.. .++. +.-..+...+ .+ .+-+=.+=...|++. +.+| |+. T Consensus 323 ~~~~~~----p~~~~~~gd~~~-----~~~~--~~~~~~~v~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~v~----- 386 (390) T protein:vir:81 323 VATQAM----APGEFLVGAFDL-----AAQI--FDQWDARVEIGYVGEDFQRNMITVLAEERLALVVYRPEALIS----- 386 (390) T ss_pred EEcCCC----CCCcEEEEehhc-----eEEE--EEecceEEEEecccchhhcCcEEEEEEEeeccEEecccceEE----- Confidence 877543 333344442110 0000 0000111000 01 111223334556665 3344 322 Q ss_pred ccccchhhhhhhcccceeeeeee Q lcl|Aclame:pro 443 NGTPDGEALTPNANMYYRRVQVT 465 (468) Q Consensus 443 ~~~~~~~~~~~~an~y~~r~~v~ 465 (468) +.++ T Consensus 387 -------------------~t~a 390 (390) T protein:vir:81 387 -------------------GSFA 390 (390) T ss_pred -------------------EEeC Confidence 1111 No 62 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=80.32 E-value=0.096 Score=26.16 Aligned_cols=295 Identities=9% Similarity=0.006 Sum_probs=118.0 Q ss_pred HhhhhhhhhhhhhhhcCcccccccccccccccccccccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeec Q lcl|Aclame:pro 43 LREERGMLNEVAVNSLGAGTIAPAGSALGSANTGGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYE 122 (468) Q Consensus 43 ~~~~~~~l~e~~~~~~~~~~~~~~~~i~~st~tg~i~~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~ 122 (468) ++...++-.|.... ...+++.+.-.--....-.+++...+..+..+++.+-||++++.-|. +.. T Consensus 1 ~~~~~~~~~e~~~~------------~~~~~~~~~~~ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip----~~~ 64 (318) T protein:vir:24 1 MAAGTAFAVDHAQI------------AQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKIP----HWV 64 (318) T ss_pred CCCCCCCCHHHHHh------------hcccCcccceeechhHHHHHHHHHHhhchhhhhcceeeccCCceEEE----EEe Confidence 33222222221100 00111111111001111223344455667788899999987653321 110 Q ss_pred CCCCcccccccCCccccccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCCCCcchhhcc Q lcl|Aclame:pro 123 NQAGEEALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFREMS 202 (468) Q Consensus 123 ~qsG~EA~fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~~~f~EMa 202 (468) . +.+ ..| .+ | +.++++.. T Consensus 65 ~--~~~-------a~~----------------------------------------------v~--E-----g~~~~~~~ 82 (318) T protein:vir:24 65 G--DVS-------AQW----------------------------------------------IG--E-----GDMKPITK 82 (318) T ss_pred C--Ccc-------eEE----------------------------------------------ec--C-----Cccccccc Confidence 0 000 000 00 1 12344445 Q ss_pred eEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhccccccccc--ccccc Q lcl|Aclame:pro 203 FSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVAN--AGIFD 280 (468) Q Consensus 203 FsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~~~--~g~~D 280 (468) .++++++.+.|..+-...+|-||.+|-. .|.+++|.+.|+..|...|++.+|.---+ ++..++.. .++.- T Consensus 83 ~~f~~i~~~~~k~~~~~~iS~e~l~ds~----~~~~~~i~~~l~~~~~~~~d~a~l~G~g~----~~~~~~~~~~~~~~~ 154 (318) T protein:vir:24 83 GNMTSQTIAPHKIATIFVASAETVRANP----ANYLGTMRTKVATAFAMAFDGAAMHGTDS----PFPTYIGQTTKAISI 154 (318) T ss_pred cceeEEEEeeEEEEEeehhhHHHhhcCh----HHHHHHHHHHHHHHHHHHHHHhhhcccCC----CCCcccccccccccc Confidence 5566666666666667789999999844 57999999999999999999998743211 11111100 00000 Q ss_pred ccccccchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccccccCce- Q lcl|Aclame:pro 281 LDVDSNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNL- 359 (468) Q Consensus 281 l~~~~~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~~- 359 (468) .......-|..... .+.... ....-.....+||||.....|... ..+-+ ..+..-+.++.. T Consensus 155 ~~~~~~~~~~~~~~-------~~~~~~--~~~~~~~~~~~v~n~~~~~~L~~l---kd~~G------~~l~~~~~~~~~~ 216 (318) T protein:vir:24 155 ADTTGATTVYDQVA-------VNGLSL--LVNDGKKWTHTLLDDITEPILNGA---KDQNG------RPLFIESTYGEAA 216 (318) T ss_pred cccccccchHHHHH-------HHHHHh--hccccCCCCEEEEcHHHHHHHHHh---hccCC------ceeecCccccCcc Confidence 00000011110111 111111 122234556789999999999852 11100 000000111110 Q ss_pred ---eEEEecCceEEEEcccccccC------CcceEEEEEecCCcccceeEeeccchhhcccccCCcc-----c---ccee Q lcl|Aclame:pro 360 ---AVGTINGRIKVFVDPYAANLS------DKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNT-----F---QPKI 422 (468) Q Consensus 360 ---~~G~l~g~~~vy~D~Ya~~~~------~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~s-----~---qP~~ 422 (468) .-+.+. +++|++.+.+.... ++.++++|..++...+- -.+..+....|+.. | |=.+ T Consensus 217 ~~~~~~~i~-g~pv~~~~~~~~~~~~~~~gdfs~~~~~~~~~l~i~~------~~~~~~~~~~~~~~~~~~~f~~~~~~~ 289 (318) T protein:vir:24 217 SPFRSGRIV-ARPTILSDHVVEGTTVGFMGDFSQLIWGQIGGLSFDV------TDQATLNLGTVESPNFVSLWQHNLVAV 289 (318) T ss_pred ccccCceEE-EEeeEEeCCCCCCccEEEEeecceEEEEEecCeEEEE------eeccceeccccccccchhhhhcCcEEE Confidence 011221 35566654432110 11112233222221110 00001111112211 2 2333 Q ss_pred eeeeeeeee-ecC--cccccCccccccch Q lcl|Aclame:pro 423 GFKTRYGMV-SNP--FVTTNGLYNGTPDG 448 (468) Q Consensus 423 g~~tRY~l~-~nP--~~~~~~~~~~~~~~ 448 (468) =...|++.. .+| |+.......+.--+ T Consensus 290 r~~~r~d~~v~~~~a~~~i~~~~a~~~~~ 318 (318) T protein:vir:24 290 RVEAEYAFHCNDAEAFVALTNVVSGGGEG 318 (318) T ss_pred EEEEEEccEEecccceEEEEeeccCCCCC Confidence 345677765 444 43322211111111 No 63 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=80.13 E-value=0.098 Score=26.12 Aligned_cols=284 Identities=12% Similarity=0.048 Sum_probs=112.6 Q ss_pred ccccccccccccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCcccccccCCccccccccccccc Q lcl|Aclame:pro 69 ALGSANTGGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDTGFTGGYDASQGD 148 (468) Q Consensus 69 i~~st~tg~i~~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSg~~~~~~~~ 148 (468) .+..++++...--....-.+++++.+..+..+++.+-||+....-| -.. .+ +.++ .| T Consensus 1 Mat~tt~~g~~vP~~~~~~ii~~~~~~s~l~~~~~~i~~~~~~~~~-p~~---~~--~~~a-------~w---------- 57 (311) T protein:vir:99 1 MATFGTGNLKNLPRNIADGMVKDVVQGSTVAVLSARKPQRFGNEDI-ITF---NG--RPKA-------EF---------- 57 (311) T ss_pred CceecCCCceeccHHHHHHHHHHHHhhchhhhhcceeeccCCceEE-EEE---eC--Ccee-------EE---------- Confidence 2222333322211112233555566667778888888887533211 110 00 0000 00 Q ss_pred cccccCccccCCCccccccccccccccccccccccchhhhhccCCCCcchhhcceEEEEEEEEeecccccccccHHHHHH Q lcl|Aclame:pro 149 YAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFREMSFSIEKTSVTAQSRALKAEYTLELAQD 228 (468) Q Consensus 149 ~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQD 228 (468) .+ | +..+++...+++.++..+|.-+-....|-||.|+ T Consensus 58 ------------------------------------v~--E-----g~~~~~~~~~f~~v~l~~~k~~~~~~iS~ell~~ 94 (311) T protein:vir:99 58 ------------------------------------VG--E-----GQQKSSTTGEFDFVTSTPKKAQVTMRFNEEVQWA 94 (311) T ss_pred ------------------------------------ee--c-----CcccccccceeeEEEEeeEEEEEeehhhHHHhhc Confidence 00 1 1234444445566666666666678899999763 Q ss_pred HHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhcccc-cc---ccccccccccccccchhHHHHHHHHHHHHHHH Q lcl|Aclame:pro 229 LKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQ-NN---VANAGIFDLDVDSNGRWSVEKFKGLLFQVERD 304 (468) Q Consensus 229 LkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~-~~---~~~~g~~Dl~~~~~grw~~e~~k~L~~~i~~e 304 (468) -.- -..|-+++|.+-|...|+..|++-+|.-.-. ..++. .+ ....+........++....+.....++..... T Consensus 95 ~~d-~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~--~~g~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~ 171 (311) T protein:vir:99 95 DED-YQLGVLQTLSEAGAEALARALDLGLYHRINP--LTGTVIPGWSNYLGAASKRVELTADTIANPDLAIEAAVGLLVA 171 (311) T ss_pred ccc-cHHHHHHHHHHHHHHHHHHHHHHHhhcccCc--ccCccccccccccccccceeeccccccchhHHHHHHHHHHHhh Confidence 221 1355677888888888888888777754321 01110 00 00000100111111111111111111222222 Q ss_pred HHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccccccCceeEEEecCceEEEEccccc------- Q lcl|Aclame:pro 305 ANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVFVDPYAA------- 377 (468) Q Consensus 305 an~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~------- 377 (468) + -.++..+-.|++|+....|... ..+ + +..+.+-+.++.. .|+|. +++|++..+.. T Consensus 172 ~------~~~~~~~~~vmn~~~~~~L~~l---kd~-----~-G~~l~~~~~~~~~-~~~l~-G~Pv~~s~~i~~~~~~~~ 234 (311) T protein:vir:99 172 N------GHPTPVNGLALHPSIAWGLSTA---RYT-----D-GRKKFPELGLGIG-VSSFE-GIDASVSDTVNGGDEADP 234 (311) T ss_pred h------ccCCCccEEEEcHHHHHHHHhh---hcc-----C-CCeeecCcccCCC-Cceec-ceeeEeeccccccccccc Confidence 2 2245566689999999999752 211 0 1111111111111 24553 55677653210 Q ss_pred -----ccCCcceEEEEEecCCcccceeEeeccchhhcccc--cCCcccc-----ceeee--eeeeeeee-cC-cccccCc Q lcl|Aclame:pro 378 -----NLSDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRS--IDPNTFQ-----PKIGF--KTRYGMVS-NP-FVTTNGL 441 (468) Q Consensus 378 -----~~~~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~--~dp~s~q-----P~~g~--~tRY~l~~-nP-~~~~~~~ 441 (468) ...+.+++++|=- ..++.|.-.....+... -|++... --++| ..|||..+ || |+.... T Consensus 235 ~~~~~~~~~~~~~~~Gdf-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~r~~~r~d~~v~~~~~v~~~~- 308 (311) T protein:vir:99 235 DDEDLDAARAVRGIVGDF-----ANGIHWGVQRDIPVELIKYGDPDGQGDLKRHNQIALRLEIVYGWYVFTDRFVVIEN- 308 (311) T ss_pred ccchhhccCcceEEEeec-----cccEEEEEecCceEEEeecCCCCcchhhhhcCcEEEEEEEeecceecChhHeeeec- Confidence 0112223333210 01122222111111111 1223211 11233 56777544 33 332111 Q ss_pred cccccchhh Q lcl|Aclame:pro 442 YNGTPDGEA 450 (468) Q Consensus 442 ~~~~~~~~~ 450 (468) . .+ T Consensus 309 --~----~A 311 (311) T protein:vir:99 309 --A----VA 311 (311) T ss_pred --c----cC Confidence 0 01 No 64 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=78.25 E-value=0.12 Score=25.71 Aligned_cols=282 Identities=12% Similarity=0.077 Sum_probs=116.4 Q ss_pred cccccccccccccccee-hhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCcccccccCCcccccccccccc Q lcl|Aclame:pro 69 ALGSANTGGLAGFDPVL-ISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDTGFTGGYDASQG 147 (468) Q Consensus 69 i~~st~tg~i~~~~P~L-v~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSg~~~~~~~ 147 (468) .+..+++..=.-.-+.+ -.+++++.+..+-.+++-+.||++++--|--.. .+.++ .|-+ T Consensus 1 ma~~t~~~gg~liP~~~~~~Ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~------~~~~a-------~wv~------- 60 (305) T protein:vir:25 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLA------TLPEA-------DWVG------- 60 (305) T ss_pred CCCccCCccceecCHHHHHHHHHHHHhhchhhhhcceeeccCCcEEEEEEe------CCcce-------EEee------- Confidence 22222222111122222 334556666777788899999987753221111 01011 0100 Q ss_pred ccccccCccccCCCccccccccccccccccccccccchhhhhccCCCCcchhhcceEEEEEEEEeecccccccccHHHHH Q lcl|Aclame:pro 148 DYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFREMSFSIEKTSVTAQSRALKAEYTLELAQ 227 (468) Q Consensus 148 ~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQ 227 (468) +++... ...++.-..++++++..++..+-...+|-||.+ T Consensus 61 ---------------------------------------E~~~~~--~~~~~~s~~~f~~i~~~~~k~~~~~~is~ell~ 99 (305) T protein:vir:25 61 ---------------------------------------ESATDP--KGVKPTSKVTWANRTLVAEEIAVIIPVHENVID 99 (305) T ss_pred ---------------------------------------cccccc--cccccccccceeeEEeeeEEEEEeehhhHHHHh Confidence 000000 011222233444455555555556779999999 Q ss_pred HHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhccccccccccccccc-----cccccchhHHHHHHHHHHHHH Q lcl|Aclame:pro 228 DLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVANAGIFDL-----DVDSNGRWSVEKFKGLLFQVE 302 (468) Q Consensus 228 DLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~~~~g~~Dl-----~~~~~grw~~e~~k~L~~~i~ 302 (468) |-. .|.|++|.+-|+..|+..+++.+|.-- |+..+....++... ......- ..-.+-.++.-+. T Consensus 100 ds~----~~~~~~i~~~l~~~~a~~~d~a~~~G~------g~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~ 168 (305) T protein:vir:25 100 DAT----VAVLTEVAELGGQAIGKKLDQAVIFGT------DKPASWVSPALIPAAVTAGQAVEVVG-GVANESDIVGATN 168 (305) T ss_pred cch----HHHHHHHHHHHHHHHHHHHhhhheecc------CCCCCccccccccccccccccccccc-cchhhhHHHHHHH Confidence 843 578999999999999999999988421 11111111110000 0000000 0001111212222 Q ss_pred HHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccccccCcee--EEEecCceEEEEcccccccC Q lcl|Aclame:pro 303 RDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLA--VGTINGRIKVFVDPYAANLS 380 (468) Q Consensus 303 ~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~~~--~G~l~g~~~vy~D~Ya~~~~ 380 (468) .....+ .+. .+..|=++++|.-...|... . |.+|... -++| .+++|+|..+..... T Consensus 169 ~~~~~~-~~~-~~~~~~~v~~~~~~~~l~~l---k----------------d~~G~~i~~~~~l-~G~Pv~~~~~~~~~~ 226 (305) T protein:vir:25 169 RAAKAV-ASA-GWAPDTLLSSLALRYEVANI---R----------------DANGNPVFRDDSF-AGFRTFFNRNGAWDA 226 (305) T ss_pred HHHHhh-hhc-ccccceeEecHHHHHHHHHh---h----------------ccCCceeecCCcc-cccceEEcCccCCCC Confidence 222222 111 24445578899988888642 1 1111111 1345 346777765432111 Q ss_pred C--------cceEEEEEecCCcccceeEeeccchhhcccccCCcc-cc-ceee--eeeeeee-eecCcc--cccCccccc Q lcl|Aclame:pro 381 D--------KHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNT-FQ-PKIG--FKTRYGM-VSNPFV--TTNGLYNGT 445 (468) Q Consensus 381 ~--------~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~s-~q-P~~g--~~tRY~l-~~nP~~--~~~~~~~~~ 445 (468) . +..+++|..++.+.+- ..+. .+...-.|.+ || ..++ ...|||+ +.||-+ ......-+. T Consensus 227 ~~~~~~~gd~s~~~i~~~~~~~i~~----~~~~--~~~~~~~~~~~~~~~~~~~R~~~r~~~~v~~p~a~v~~~~~~~~~ 300 (305) T protein:vir:25 227 DAAIEVIADSSRVKIGVRQDITVKF----LDQA--TLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPVAV 300 (305) T ss_pred CccEEEEEecceEEEEEecCeEEEE----eeee--eeecCCceeeeeecCcEEEEEEEeecceeeCcccEEEEccccccc Confidence 1 1122233332222111 1110 0011111111 22 1223 3568995 568843 222222221 Q ss_pred cchhhhhhhc Q lcl|Aclame:pro 446 PDGEALTPNA 455 (468) Q Consensus 446 ~~~~~~~~~a 455 (468) .. ..| T Consensus 301 ~~-----pa~ 305 (305) T protein:vir:25 301 VA-----PAA 305 (305) T ss_pred cC-----CCC Confidence 11 111 No 65 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=74.07 E-value=0.16 Score=24.91 Aligned_cols=340 Identities=15% Similarity=0.135 Sum_probs=115.4 Q ss_pred CcchHHHHHhhhhhhC-CCccc-hhcchhh------------------hHHHHHHHhHHH---H------HHhhhhhhhh Q lcl|Aclame:pro 1 MFNAEHLQEKWSPVLN-HGEAP-AIGDRYK------------------RAVTSVLLENQE---R------FLREERGMLN 51 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~-~~~~~-~i~~~~~------------------~~~~~~llenq~---~------~~~~~~~~l~ 51 (468) +++.|+. ++...+.. -+.+- +|...-+ +....+--+++. . .+.+.++.+. T Consensus 30 ~lt~ee~-~~~~~l~~e~~~l~~~i~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 108 (428) T protein:vir:10 30 TLTAEQL-TEFAGLQQQFTDISAKMDRMEATERAAALVAKPVKATQHGPAVIVKAEPKQYTGAGMTRMVMSIAAAQGNLQ 108 (428) T ss_pred CCCHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhchhhccccccccccchhhhHHHHHHHHHHHHhhhhHH Confidence 3333321 11111110 00000 0100000 000000000000 0 0000010011 Q ss_pred hhh---hhhcCcccccccccccccccccccc---cccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCC Q lcl|Aclame:pro 52 EVA---VNSLGAGTIAPAGSALGSANTGGLA---GFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQA 125 (468) Q Consensus 52 e~~---~~~~~~~~~~~~~~i~~st~tg~i~---~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qs 125 (468) .+. ....+ .......+..++++|++. ...+.++.+.| +..+..++ |+...++++|-+-=.| ..+ T Consensus 109 ~~~~~~~~~~~--~~~~~~~~~~~~~~gg~liP~~~~~~ii~~l~---~~~~l~~~-~~~~~~~~~g~~~~p~--~~~-- 178 (428) T protein:vir:10 109 DAAKFASDELN--DQSVSMAISTAAGSGGVLIPQNIHSEVIELLR---DRTIVRKL-GARSIPLPNGNMSLPR--LAG-- 178 (428) T ss_pred HHHHHhhhhhh--hhhHhhhhcccccCCccccchhHHHHHHHHHh---hhchhhhh-cceeeecCCcceEEEE--EeC-- Confidence 000 00000 000111112222233221 11223333333 44444555 3333333344321111 000 Q ss_pred CcccccccCCccccccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCCCCcchhhcceEE Q lcl|Aclame:pro 126 GEEALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFREMSFSI 205 (468) Q Consensus 126 G~EA~fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~~~f~EMaFsI 205 (468) +..+ .| .+ | +..+++...++ T Consensus 179 ~~~a-------~~----------------------------------------------v~--E-----g~~~~~~~~~f 198 (428) T protein:vir:10 179 GATA-------SY----------------------------------------------TG--E-----NQDAKVSEARF 198 (428) T ss_pred Ccce-------ee----------------------------------------------ec--c-----Cccccccccce Confidence 0000 00 00 1 22344555556 Q ss_pred EEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhcc-cccccc-----ccccc Q lcl|Aclame:pro 206 EKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKG-AQNNVA-----NAGIF 279 (468) Q Consensus 206 eK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~-k~~~~~-----~~g~~ 279 (468) ++++...|.-+-...+|-||.+|- ..|.++.|.+.|...|...+|+.||.- ...+ +..|+- ..+++ T Consensus 199 ~~i~~~~~k~~~~v~is~ell~ds----~~~l~~~i~~~l~~ai~~~~d~~~l~G----~G~~~~p~Gi~~~~~~~~~~~ 270 (428) T protein:vir:10 199 DDVKLTAKTMIAMVPISNALIGRA----GFNVEQLVLQDILTAISVREDKAFMRD----DGTGDTPIGMKARATQWNRLL 270 (428) T ss_pred eeEEeeeEEEEEeehhhHHHHhhh----hHHHHHHHHHHHHHHHHHHHHHHHhcc----CCCCccccccccccccccccc Confidence 666666666666789999999884 246788888888888888888887642 1110 111110 01111 Q ss_pred cccccccchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccccccCce Q lcl|Aclame:pro 280 DLDVDSNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNL 359 (468) Q Consensus 280 Dl~~~~~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~~ 359 (468) ........ .......+ ......+......- ......|+++.....|... ..+ + |..++..+. T Consensus 271 ~~~~~~~~--~~~~~~~~-~~~~~~~~~~~~~~--~~~~~~v~n~~~~~~L~~l---kd~---~---G~~i~~~~~---- 332 (428) T protein:vir:10 271 PWAADAAV--NLDTIDTY-LDSIILMSMDGNSN--MISSGWGMSNRTYMKLFGL---RDG---N---GNKVYPEMA---- 332 (428) T ss_pred cccccccc--cHHHHHHH-HHHHHHhhhccccc--cccCEEEEcHHHHHHHHHh---hcc---C---CceeccCCC---- Confidence 11111110 01111111 22222222222211 2234567799888888752 211 1 111111111 Q ss_pred eEEEecCceEEEEcccccccC------------CcceEEEEEecCCcccceeEeeccchhhcccccCCccc---cceeee Q lcl|Aclame:pro 360 AVGTINGRIKVFVDPYAANLS------------DKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNTF---QPKIGF 424 (468) Q Consensus 360 ~~G~l~g~~~vy~D~Ya~~~~------------~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~s~---qP~~g~ 424 (468) -|+| .|++||++.+...+. ++.++++|..++-..+ .+||..........-..| +=.+=. T Consensus 333 -~g~l-~G~pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~i~~~~~i~i~----~~~~~~~~~~~~~~~~~f~~~~~~~R~ 406 (428) T protein:vir:10 333 -QGML-KGYPIQRTSAIPANLGEGGKESEIYFADFNDVVIGEDGNMKVD----FSKEASYIDTDGKLVSAFSRNQSLIRV 406 (428) T ss_pred -CCee-eceeeEEeccccccccCCCccceEEEEecceEEEEEecceEEE----eecccccccccccccchhhcchhheee Confidence 1455 367787764432110 1122334444333322 122221110000000011 112224 Q ss_pred eeeeeeeec-C--cccccCccccccchhhh Q lcl|Aclame:pro 425 KTRYGMVSN-P--FVTTNGLYNGTPDGEAL 451 (468) Q Consensus 425 ~tRY~l~~n-P--~~~~~~~~~~~~~~~~~ 451 (468) ..|+++.+. | |+. .++.+| T Consensus 407 ~~r~d~~v~~p~a~~~--------~t~~~~ 428 (428) T protein:vir:10 407 VTEHDIGFRHPEGLVL--------GTGVLF 428 (428) T ss_pred eeeeCceeeccceEEE--------EeccCC Confidence 566666553 4 322 233344 No 66 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=72.62 E-value=0.18 Score=24.66 Aligned_cols=332 Identities=13% Similarity=0.090 Sum_probs=125.1 Q ss_pred CcchHHHHHhhhhhhCC--------------Cc--cchhcchhhhHHHHHHHhHH---HHHHhhhh-----------hhh Q lcl|Aclame:pro 1 MFNAEHLQEKWSPVLNH--------------GE--APAIGDRYKRAVTSVLLENQ---ERFLREER-----------GML 50 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~--------------~~--~~~i~~~~~~~~~~~llenq---~~~~~~~~-----------~~l 50 (468) ||+.++|.++|.-+.+. +. ..+|.. .+..+ ..+.+.+ ++.+.+.. +.. T Consensus 4 ~m~l~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ee~~~-~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 81 (408) T protein:vir:10 4 KLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMSE-LKNKR-DNEKVRRDALREQLVEAQAEQVVNMREEEKGP 81 (408) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccHHHHHH-HHHHH-HHHHHHHHHHHHHHHHHHHHHHhccccccccc Confidence 88999999999665442 00 111110 01110 0111110 00010000 000 Q ss_pred hh--------hhh----hhcCccccccc----cccccccc-ccccc---cccceehhhhHHhhhhhhhhheeeeecCCcc Q lcl|Aclame:pro 51 NE--------VAV----NSLGAGTIAPA----GSALGSAN-TGGLA---GFDPVLISLVRRAMPNLMAYDVCGVQPMSGP 110 (468) Q Consensus 51 ~e--------~~~----~~~~~~~~~~~----~~i~~st~-tg~i~---~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGP 110 (468) .+ ... +-...+..... ..+..++. .|+.. ...+. +++.........+++.+.||+++ T Consensus 82 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~---Ii~~~~~~~~l~~~~~~~~~~~~ 158 (408) T protein:vir:10 82 LNKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTM---INTLVRQYDSLQQYVRVESVSTS 158 (408) T ss_pred cccchhhhHHHHHHHHHHHhhcchhhhhhhhhhhhhcccccCCceeccHhHHHH---HHHHHHhhchhhhhcceeeccCC Confidence 00 000 00001110000 00111111 11111 11222 44445556667889999999998 Q ss_pred ceeeeeeeeeecCCCCcccccccCCccccccccccccccccccCccccCCCccccccccccccccccccccccchhhhhc Q lcl|Aclame:pro 111 TGLIFAMRSRYENQAGEEALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLER 190 (468) Q Consensus 111 TGLIFAMRsrY~~qsG~EA~fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~ 190 (468) .|-+--.+ ..+.++ ...| .++++. T Consensus 159 ~~~~~~~~--~~~~~~--------~a~~----------------------------------------------v~E~~~ 182 (408) T protein:vir:10 159 NGSRVYEK--WTDVTP--------LTVM----------------------------------------------DAEDGK 182 (408) T ss_pred cceEEEee--cccccc--------ceee----------------------------------------------ecCccc Confidence 88765443 000000 0000 000011 Q ss_pred cCCCC-cchhhcceEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhccc Q lcl|Aclame:pro 191 MGEAN-RLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGA 269 (468) Q Consensus 191 lG~~~-~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k 269 (468) ..+.+ ..|.++.|+..|..+ ...+|-||.+|- .+|.+++|.+-|+..|..-+|+.||.-.-+.. T Consensus 183 ~~~~~~~~~~~i~~~~~k~~~-------~~~iS~ell~ds----~~~l~~~i~~~l~~~~~~~~~~~il~g~g~~~---- 247 (408) T protein:vir:10 183 IPDLDNPQLTIIKYLIKRYAG-------IITATNTSLKDT----AENILAWLSSWIAKKVVVTRNQAIIEVMKAAP---- 247 (408) T ss_pred cccccCcceeeEEeeeeeEEe-------eehhHHHHHhhc----hHHHHHHHHHHHHHHHHHHHHHHHhhcccccc---- Confidence 11111 235566666655554 456999999994 45778899999999998888888775432211 Q ss_pred cccccccccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccc Q lcl|Aclame:pro 270 QNNVANAGIFDLDVDSNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPS 349 (468) Q Consensus 270 ~~~~~~~g~~Dl~~~~~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~ 349 (468) ...++..++ ....+++..... --+..+ .+|||+.....|... ..+. |.. T Consensus 248 ----~~~~~~~~~----------~l~~~~~~~~~~-------~~~~~a-~~v~n~~~~~~l~~l---kd~~------G~~ 296 (408) T protein:vir:10 248 ----KKPTIAKFD----------DVITMINTAVDP-------AIIATS-SLLTNQSGLNKLALV---KTAE------GKY 296 (408) T ss_pred ----cccccccHH----------HHHHHHHHhhhh-------hhccCC-EEEEcHHHHHHHHHh---hccC------Cce Confidence 112222221 122222111111 112222 478999999988763 2111 111 Q ss_pred cccccccCceeEEEecCceEEEE--cccccccCC----------cceEEEEEecCCcccceeEeeccchhhcccccCCcc Q lcl|Aclame:pro 350 IGEVDDTGNLAVGTINGRIKVFV--DPYAANLSD----------KHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNT 417 (468) Q Consensus 350 ~~~~d~t~~~~~G~l~g~~~vy~--D~Ya~~~~~----------~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~s 417 (468) +.+.+-+.. ..++|. |++|++ |...-+... .+|++++-++.... =+.++.- .+-.+ T Consensus 297 i~~~~~~~~-~~~~l~-G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~v----~~~~~~~------~~f~~ 364 (408) T protein:vir:10 297 LLEPDPTKP-NSYLIK-GKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSL----LPTNIGA------GAFET 364 (408) T ss_pred EeccCcCCC-CCceec-ceeeEEecccccCccCCCceEEEEEehhccEEEEEecceEE----EEccccc------chhhc Confidence 111111111 113554 445444 211111011 01222222211111 1111100 00112 Q ss_pred ccceeeeeeeeeeee-cC--ccccc-----C--ccccccchhhh Q lcl|Aclame:pro 418 FQPKIGFKTRYGMVS-NP--FVTTN-----G--LYNGTPDGEAL 451 (468) Q Consensus 418 ~qP~~g~~tRY~l~~-nP--~~~~~-----~--~~~~~~~~~~~ 451 (468) .+=.+-+..||+..+ +| |.... . ...+.+..... T Consensus 365 ~~~~~r~~~r~d~~v~~~~a~~~~~~~~~~~~~~~~~~~~~~~~ 408 (408) T protein:vir:10 365 DTTKIRVIDRFDVKATDSEALVAGSFSAIADQVGNFKTTTSTAV 408 (408) T ss_pred CceEEEEEEeeccEEeccccEEEEEeeccccCCCCCCCCCcccC Confidence 233444555665532 33 11100 0 01112222221 No 67 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=71.94 E-value=0.19 Score=24.55 Aligned_cols=261 Identities=14% Similarity=0.059 Sum_probs=110.7 Q ss_pred eeeeec---CCCCcccccc---c---CCccccccccccccccccccCccccCCCccccccccccccccccccccccchhh Q lcl|Aclame:pro 117 MRSRYE---NQAGEEALFN---E---PDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPRED 187 (468) Q Consensus 117 MRsrY~---~qsG~EA~fn---E---a~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~ 187 (468) |=.... +.--.|-|-. + ..--|++-... .. .. .+ .+ +.+.++..--...+ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~-d~--------~l----~g-~~-------G~tv~iP~~~~~g~ 59 (274) T protein:vir:94 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEV-DS--------TL----QG-QP-------GDTLTFPAFVYSGD 59 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhccccee-cc--------cc----cC-CC-------CCEEEEeeecCCCc Confidence 221110 0000111100 0 00001110000 00 00 00 00 00111110001122 Q ss_pred hhccCCC-CcchhhcceEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhh Q lcl|Aclame:pro 188 LERMGEA-NRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAK 266 (468) Q Consensus 188 aE~lG~~-~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~ 266 (468) +|.+.++ .-+..++. ..+.+++.+-|+-.=+++=| ..+.+ +-|.-.+..+-++..|+.+++.+++..|.+... T Consensus 60 a~~~~~g~~i~~~~lt--~~~~~~~i~~~~~~~~i~D~--~~~~~--~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~ 133 (274) T protein:vir:94 60 AQVVAEGEKIPTDILE--TKKREAKIRKIAKGTSITDE--ALLSG--YGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL 133 (274) T ss_pred cccccCCCcccccccc--cceeEEEeeeecceecccHH--HHHhc--cchHHHHHHHHHHHHHHHHHHHHHHHHHhccCc Confidence 2322221 22334443 33344444555522223222 22223 468888999999999999999999988765433 Q ss_pred ccccccccccccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhccccccccccccc Q lcl|Aclame:pro 267 KGAQNNVANAGIFDLDVDSNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAG 346 (468) Q Consensus 267 ~~k~~~~~~~g~~Dl~~~~~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~ 346 (468) .. +...+ ..+.+-.++.++..+ -..+.+++|+|.|++.|.......|..... . T Consensus 134 ~~------~~~~~----------~~d~i~dA~~~l~d~---------~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~--~ 186 (274) T protein:vir:94 134 TV------NADIT----------KLNGLQSAIDKFNDE---------DLEPMVLFVNPLDAGKLRGDASTNFTRATE--L 186 (274) T ss_pred cc------ccccc----------CHHHHHHHHHHhhcc---------CCCceEEEeCHHHHHHHHhhhhhhccccCc--c Confidence 21 11112 123333344444322 236789999999999998754444433221 1 Q ss_pred ccccccccccCceeEEEecCceEEEEcccccccCCcceEEEEEecCCcccceeEeeccchhhcccccCCccccceeeeee Q lcl|Aclame:pro 347 GPSIGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNTFQPKIGFKT 426 (468) Q Consensus 347 ~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~~~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~s~qP~~g~~t 426 (468) +.. .-.+| .+|++. |++||+| ++-|. |-.+-++ -+.+-|.--.+...-.--||..+.-.+-..- T Consensus 187 g~~---~~~~G--~ig~~~-G~~Vi~s----~~~p~-~t~~l~~-----~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~ 250 (274) T protein:vir:94 187 GDD---IIVKG--AFGEAL-GAIIVRT----NKLEA-GTAILAK-----KGAVKLILKRDFFLEVARDASTKTTALYSDK 250 (274) T ss_pred ccc---ceecc--ccceec-CeeEEEc----CCCCc-ceEEEEe-----CcceEeeecCCceeccccchhhcccEEEEEE Confidence 111 11122 257774 6899999 55663 3222222 1122221111112222248888888888888 Q ss_pred eeeeee-cC--cccccCccccccchhhhhhhc Q lcl|Aclame:pro 427 RYGMVS-NP--FVTTNGLYNGTPDGEALTPNA 455 (468) Q Consensus 427 RY~l~~-nP--~~~~~~~~~~~~~~~~~~~~a 455 (468) +||+.+ || ..... -..+ ..-. T Consensus 251 ~y~~~~~~~~~vv~~t-~~~~-------~~~~ 274 (274) T protein:vir:94 251 HYVAYLYDESKAVKIT-KGSG-------SLEM 274 (274) T ss_pred EEEEEEEcCCceEEEe-cCcc-------cccC Confidence 998864 44 11100 0001 0000 No 68 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=71.94 E-value=0.19 Score=24.55 Aligned_cols=261 Identities=14% Similarity=0.059 Sum_probs=110.7 Q ss_pred eeeeec---CCCCcccccc---c---CCccccccccccccccccccCccccCCCccccccccccccccccccccccchhh Q lcl|Aclame:pro 117 MRSRYE---NQAGEEALFN---E---PDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPRED 187 (468) Q Consensus 117 MRsrY~---~qsG~EA~fn---E---a~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~ 187 (468) |=.... +.--.|-|-. + ..--|++-... .. .. .+ .+ +.+.++..--...+ T Consensus 1 ma~~~T~~~d~iiPev~~~~v~~~~~~~l~~~~~~~~-d~--------~l----~g-~~-------G~tv~iP~~~~~g~ 59 (274) T protein:vir:97 1 MPQGLTKTSDQIIPEVLAPMMQAQLEKKLRFASFAEV-DS--------TL----QG-QP-------GDTLTFPAFVYSGD 59 (274) T ss_pred CCccceehhheechHHHHHHHHHhhhhhhhhccccee-cc--------cc----cC-CC-------CCEEEEeeecCCCc Confidence 221110 0000111100 0 00001110000 00 00 00 00 00111110001122 Q ss_pred hhccCCC-CcchhhcceEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhh Q lcl|Aclame:pro 188 LERMGEA-NRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAK 266 (468) Q Consensus 188 aE~lG~~-~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~ 266 (468) +|.+.++ .-+..++. ..+.+++.+-|+-.=+++=| ..+.+ +-|.-.+..+-++..|+.+++.+++..|.+... T Consensus 60 a~~~~~g~~i~~~~lt--~~~~~~~i~~~~~~~~i~D~--~~~~~--~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a~~ 133 (274) T protein:vir:97 60 AQVVAEGEKIPTDILE--TKKREAKIRKIAKGTSITDE--ALLSG--YGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL 133 (274) T ss_pred cccccCCCcccccccc--cceeEEEeeeecceecccHH--HHHhc--cchHHHHHHHHHHHHHHHHHHHHHHHHHhccCc Confidence 2322221 22334443 33344444555522223222 22223 468888999999999999999999988765433 Q ss_pred ccccccccccccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhccccccccccccc Q lcl|Aclame:pro 267 KGAQNNVANAGIFDLDVDSNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAG 346 (468) Q Consensus 267 ~~k~~~~~~~g~~Dl~~~~~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~ 346 (468) .. +...+ ..+.+-.++.++..+ -..+.+++|+|.|++.|.......|..... . T Consensus 134 ~~------~~~~~----------~~d~i~dA~~~l~d~---------~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~--~ 186 (274) T protein:vir:97 134 TV------NADIT----------KLNGLQSAIDKFNDE---------DLEPMVLFVNPLDAGKLRGDASTNFTRATE--L 186 (274) T ss_pred cc------ccccc----------CHHHHHHHHHHhhcc---------CCCceEEEeCHHHHHHHHhhhhhhccccCc--c Confidence 21 11112 123333344444322 236789999999999998754444433221 1 Q ss_pred ccccccccccCceeEEEecCceEEEEcccccccCCcceEEEEEecCCcccceeEeeccchhhcccccCCccccceeeeee Q lcl|Aclame:pro 347 GPSIGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNTFQPKIGFKT 426 (468) Q Consensus 347 ~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~~~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~s~qP~~g~~t 426 (468) +.. .-.+| .+|++. |++||+| ++-|. |-.+-++ -+.+-|.--.+...-.--||..+.-.+-..- T Consensus 187 g~~---~~~~G--~ig~~~-G~~Vi~s----~~~p~-~t~~l~~-----~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~~~ 250 (274) T protein:vir:97 187 GDD---IIVKG--AFGEAL-GAIIVRT----NKLEA-GTAILAK-----KGAVKLILKRDFFLEVARDASTKTTALYSDK 250 (274) T ss_pred ccc---ceecc--ccceec-CeeEEEc----CCCCc-ceEEEEe-----CcceEeeecCCceeccccchhhcccEEEEEE Confidence 111 11122 257774 6899999 55663 3222222 1122221111112222248888888888888 Q ss_pred eeeeee-cC--cccccCccccccchhhhhhhc Q lcl|Aclame:pro 427 RYGMVS-NP--FVTTNGLYNGTPDGEALTPNA 455 (468) Q Consensus 427 RY~l~~-nP--~~~~~~~~~~~~~~~~~~~~a 455 (468) +||+.+ || ..... -..+ ..-. T Consensus 251 ~y~~~~~~~~~vv~~t-~~~~-------~~~~ 274 (274) T protein:vir:97 251 HYVAYLYDESKAVKIT-KGSG-------SLEM 274 (274) T ss_pred EEEEEEEcCCceEEEe-cCcc-------cccC Confidence 998864 44 11100 0001 0000 No 69 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=70.41 E-value=0.21 Score=24.31 Aligned_cols=268 Identities=12% Similarity=0.048 Sum_probs=110.6 Q ss_pred ecCC-CCccc-ccccCCccccc-cccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCCCCcc Q lcl|Aclame:pro 121 YENQ-AGEEA-LFNEPDTGFTG-GYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRL 197 (468) Q Consensus 121 Y~~q-sG~EA-~fnEa~t~fSg-~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~~~ 197 (468) -++. ..-.. +..|.-+.+=- .........+..... ....+ .+ ....++..--...++|.+.++ .+ T Consensus 1 Ma~~~T~l~d~i~Pev~~~~v~~~~~~~~~~~~~~~~~---~~l~g-~~-------G~ti~iP~~~~igda~~~~eg-~~ 68 (276) T protein:vir:10 1 MAQGTTTKSTQIVPEVLAPMMQAELDKKLRFAQFADID---STLVG-QP-------GDTLTFPAFVYSGDATVVPEG-QK 68 (276) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhcccceec---ccccC-CC-------CCEEEeeeecCCCccccccCC-Cc Confidence 1111 00000 00110000000 000000000000000 00000 00 011111110011233444432 23 Q ss_pred hhhcceEEEEEEEEeecccccccccHHHHHHHHH-hcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhcccccccccc Q lcl|Aclame:pro 198 FREMSFSIEKTSVTAQSRALKAEYTLELAQDLKA-IHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVANA 276 (468) Q Consensus 198 f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkA-iHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~~~~ 276 (468) +..-..+..+.+++.+-|.-.=++| |+-+ .-+.|.-.+..+-++..|+..++.+++..|....... .. T Consensus 69 i~~~~lt~~~~~a~i~~~~k~~~~t-----D~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~~l~~~~~~~------~~ 137 (276) T protein:vir:10 69 IPVDKIETNRREAKIHKIGKGTDIT-----DEALLSGYGDPQGEAVRQHGLAIANKVDNDVLEALRGTKLTV------SA 137 (276) T ss_pred cCccccccceeeEEeehcccccccc-----HHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccc------cc Confidence 3333334455555555554333333 3333 2368999999999999999999999998776543221 11 Q ss_pred ccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhccccccccccccccccccccccc Q lcl|Aclame:pro 277 GIFDLDVDSNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDT 356 (468) Q Consensus 277 g~~Dl~~~~~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t 356 (468) +.+.+ +.+-..+.++.. .-.+.++++|+|++.+.|......+|...... +. +.-.+ T Consensus 138 ~~~t~----------d~i~~A~~~lgd---------~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~--g~---~~~~~ 193 (276) T protein:vir:10 138 DIGTL----------AGLEAAIDTFDD---------EDLEPMVLFINPKDAGKLRSSASDNFTRATEL--GD---NIIVK 193 (276) T ss_pred cccCH----------HHHHHHHHHhcc---------ccCcccEEEEcHHHHHHHHHhccccccccccc--cc---cceec Confidence 22211 222222222221 12568899999999999965433343322221 11 11112 Q ss_pred CceeEEEecCceEEEEcccccccCCcceEEEEEe-cCCcccceeEeeccchhhcccccCCccccceeeeeeeeeeee-cC Q lcl|Aclame:pro 357 GNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYK-GTSPYDAGLFYCPYVPLQMVRSIDPNTFQPKIGFKTRYGMVS-NP 434 (468) Q Consensus 357 ~~~~~G~l~g~~~vy~D~Ya~~~~~~dY~~vG~K-G~~~~d~glfyaPYv~l~~~~~~dp~s~qP~~g~~tRY~l~~-nP 434 (468) | .+|++ .|++|++| ++-|. |-.+-++ |.-.+ +... +...-.--|++.++-.+--.-+||+.. || T Consensus 194 G--~ig~~-~G~~Vi~s----~~~p~-~t~~l~~~gAi~~----~~~~--~~~vE~dRd~~~~~d~i~~~~~y~~~~~~~ 259 (276) T protein:vir:10 194 G--AFGEA-LGAVIVRS----KKLDE-GEAILAKRGAVKL----ITKR--DFFLETDRDPSTKTTALYSDKHYVAYLYDE 259 (276) T ss_pred c--cccee-cceeEEEc----CCCCc-ceEEEEeccceee----eecC--CceeecccchhhcccEEEEeeEEEEEEEcC Confidence 2 35777 46899999 44553 2222222 22211 1111 111112238888888888888888754 44 Q ss_pred c--ccccCccccccchhhhhhhccc Q lcl|Aclame:pro 435 F--VTTNGLYNGTPDGEALTPNANM 457 (468) Q Consensus 435 ~--~~~~~~~~~~~~~~~~~~~an~ 457 (468) = ....- ..+ ...+|. T Consensus 260 ~~vv~~t~-~~~-------~~~~~~ 276 (276) T protein:vir:10 260 SKAVKVTK-GAG-------TTDSGA 276 (276) T ss_pred cceEEEec-CCc-------CCcCCC Confidence 1 11010 011 111111 No 70 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=67.56 E-value=0.25 Score=23.88 Aligned_cols=287 Identities=11% Similarity=0.086 Sum_probs=114.1 Q ss_pred ccccccccccccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCcccccccCCccccccccccccc Q lcl|Aclame:pro 69 ALGSANTGGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDTGFTGGYDASQGD 148 (468) Q Consensus 69 i~~st~tg~i~~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSg~~~~~~~~ 148 (468) .+ .+++|++.--....-.+++++.+.-+-.+++.+-||++..- -+- ++.+ +.++ .| T Consensus 1 ma-t~~~gg~lvP~~~~~~ii~~~~~~s~i~~~~~~i~~~~~~~-~~p---~~~~--~~~a-------~w---------- 56 (311) T protein:vir:81 1 MV-ALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQ-QYM---TLTA--PPRG-------EV---------- 56 (311) T ss_pred Cc-eecCCceEcchhHHHHHHHHHHhcchhhhhcceeecCCCce-EEE---EEeC--Ccee-------EE---------- Confidence 11 12223322111112234555667778889999999865421 110 1110 0000 00 Q ss_pred cccccCccccCCCccccccccccccccccccccccchhhhhccCCCCcchhhcceEEEEEEEEeecccccccccHHHHHH Q lcl|Aclame:pro 149 YAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFREMSFSIEKTSVTAQSRALKAEYTLELAQD 228 (468) Q Consensus 149 ~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQD 228 (468) .+ | +..+++...++++++..+|.=+-....|-||.|+ T Consensus 57 ------------------------------------v~--E-----g~~~~~~~~~f~~v~l~~~kl~~~~~iS~ell~~ 93 (311) T protein:vir:81 57 ------------------------------------VG--E-----GAQKSESTATFAPVTAIPRKVQVTQRFSQEVKWA 93 (311) T ss_pred ------------------------------------ee--c-----CcccccccceeeEEEEeeEEEEEeehhhHHHhhc Confidence 00 1 1233334444455555555444556789999875 Q ss_pred HHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhccccccccccccccc----cccccchhHHHHHHHHHHHHHHH Q lcl|Aclame:pro 229 LKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVANAGIFDL----DVDSNGRWSVEKFKGLLFQVERD 304 (468) Q Consensus 229 LkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~~~~g~~Dl----~~~~~grw~~e~~k~L~~~i~~e 304 (468) --. -.++-|++|.+-|+..|+..|+.-++.-.- +..+.......+++++- ...+...+..+. -+.+. T Consensus 94 ~~d-~~~~l~~~i~~~la~ai~~~~d~a~l~G~~--~~~~~~~~gi~~~~~~~~~~~~~~~~~~~~~~~------~i~~~ 164 (311) T protein:vir:81 94 DES-RQLGVLQTMADLSGVALGRALDLIGIHGIN--PLTGAALSGSPAKILDTTNIVELTTGTSATPDL------AVEAA 164 (311) T ss_pred Ccc-cHHHHHHHHHHHHHHHHHHHHHHhhhcccc--CCCCcccccccccccccceeeeecccccchHHH------HHHHH Confidence 322 234567777777777777777776664321 11111111111111110 000001111111 12222 Q ss_pred HHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccccccCceeEEEecCceEEEEcccccccC--Cc Q lcl|Aclame:pro 305 ANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVFVDPYAANLS--DK 382 (468) Q Consensus 305 an~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~~--~~ 382 (468) ...+ ...++..+-+|++|+....|... ..+ + +.....-+.++. ..|+|.| ++|+++-+...+- .. T Consensus 165 ~~~~--~~~~~~~~~~vmn~~~~~~l~~l---kd~-----~-G~~l~~~~~~~~-~~~tl~G-~Pv~~~~~i~~~~~~~~ 231 (311) T protein:vir:81 165 VGLV--LGDNLSPDGVALDNTFSFMLATQ---RDS-----Q-GRKLYPELGFGT-DVASFAG-LNAAVSDTVRGGPEAVT 231 (311) T ss_pred HHHh--hhcCCCceEEEEcHHHHHHHHhh---hcc-----C-CCeeecCccccC-CCceecc-eeEEecccccccccccc Confidence 2222 23357777789999999888652 111 0 111111111111 1366654 7777763221100 00 Q ss_pred ceEEEEEecCCc-----cc-ceeEeeccchhhccccc--CCcc----ccc-eeee--eeeeee-eecC--cccccCcccc Q lcl|Aclame:pro 383 HYYVIGYKGTSP-----YD-AGLFYCPYVPLQMVRSI--DPNT----FQP-KIGF--KTRYGM-VSNP--FVTTNGLYNG 444 (468) Q Consensus 383 dY~~vG~KG~~~-----~d-~glfyaPYv~l~~~~~~--dp~s----~qP-~~g~--~tRY~l-~~nP--~~~~~~~~~~ 444 (468) +=+.+...+... -| +.+++...-++.+...- |+.. ||- .++| ..|+|. +.+| |+.....-.. T Consensus 232 ~~~~~~~~~~~~~~~~~gDfs~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~l~~a~~~ 311 (311) T protein:vir:81 232 ASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAVVRDADES 311 (311) T ss_pred cccchhcccCCccEEEEEecccEEEEEeccceEEEeccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEEEEeeccC Confidence 000000000000 01 22333333333333322 2221 222 1333 367774 3666 5432211111 No 71 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=67.49 E-value=0.25 Score=23.87 Aligned_cols=328 Identities=11% Similarity=0.089 Sum_probs=110.3 Q ss_pred Cc--chHHH----------HHhhhhhhCCCccchhcchhhhHHHHHHHhHHHHHHhhhhhh-----------hhhhh--- Q lcl|Aclame:pro 1 MF--NAEHL----------QEKWSPVLNHGEAPAIGDRYKRAVTSVLLENQERFLREERGM-----------LNEVA--- 54 (468) Q Consensus 1 ~~--~~~~l----------~~kw~p~l~~~~~~~i~~~~~~~~~~~llenq~~~~~~~~~~-----------l~e~~--- 54 (468) .. ..+++ .++-....+.+...+..+ -.....+.+.+...+...+ ..+.. T Consensus 65 ~~~~~~e~~~~~~~~~~~e~~~~~~~~e~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 139 (437) T protein:vir:10 65 ASALKVEEKRDDSDLVAPELEENSADNEEDDPEKLKT-----ETKSEAEKDKKTVKDEEKRDAGGLQDMKLKVGGEIADK 139 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHhHHHHhHHHHHHHHHHHHh Confidence 00 00000 000000000000000000 0111111111111111000 00000 Q ss_pred -hhhcCccccc-cccccccc-ccccccccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCccccc Q lcl|Aclame:pro 55 -VNSLGAGTIA-PAGSALGS-ANTGGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALF 131 (468) Q Consensus 55 -~~~~~~~~~~-~~~~i~~s-t~tg~i~~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~f 131 (468) ...+...... .......+ ++.++.. .-..+...++.........+++.|.||+.+.+-+--.+.. . ..+ T Consensus 140 ~~~~~~~~~~~~e~~~~~~~~~~~~g~l-vp~~~~~~i~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~----~-~~~-- 211 (437) T protein:vir:10 140 KVTAFADYLKTGEVRDVTGIALKDGKVI-IPETILTPEKEVHQFPRLGSLVRTESVTTTTGKLPIFNNS----T-DLL-- 211 (437) T ss_pred hhhhhHHHHHhhhhhhhhhccccccccc-chHHHHHHHHHhhhhhhhhhcceeEeeccCceeeEEeecc----c-ccc-- Confidence 0000000000 00000001 1111110 0011111122111122345668888887776644333210 0 000 Q ss_pred ccCCccccccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCC-CCcchhhcceEEEEEEE Q lcl|Aclame:pro 132 NEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGE-ANRLFREMSFSIEKTSV 210 (468) Q Consensus 132 nEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~-~~~~f~EMaFsIeK~tV 210 (468) .+ ..+.+...+ ....|.++.|.+.|..+ T Consensus 212 -----~~----------------------------------------------~~e~~~~~e~~~~~~~~v~~~~~k~~~ 240 (437) T protein:vir:10 212 -----TA----------------------------------------------HTEYGQTTKNATPVITPILWDLKTYTG 240 (437) T ss_pred -----cc----------------------------------------------ccccccccccccccceeeeeehhheee Confidence 00 000011111 11346666666666543 Q ss_pred EeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhccccccccccccccccccccchhH Q lcl|Aclame:pro 211 TAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVANAGIFDLDVDSNGRWS 290 (468) Q Consensus 211 tAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~~~~g~~Dl~~~~~grw~ 290 (468) -..+|-||.+|- .+|.+++|.+.|+..|..-+|..||.-+-+. ...+.+....-| T Consensus 241 -------~~~is~ell~ds----~~~~~~~i~~~l~~~~~~~~~~~i~~g~g~~----~~~~~~~~~~~~---------- 295 (437) T protein:vir:10 241 -------GYVFSQELISDS----SYDWQAELQSRLIELRDNTDDSLIITALTDG----IKKTTSTYLLGD---------- 295 (437) T ss_pred -------ehhhhHHHHhhh----HHHHHHHHHHHHHHHHHHHHHHHHhhhhccc----ccccccccchhh---------- Confidence 467899999984 3578889999999999999999888754321 111111111111 Q ss_pred HHHHHHHH-HHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccccccCceeEEEecCceE Q lcl|Aclame:pro 291 VEKFKGLL-FQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGRIK 369 (468) Q Consensus 291 ~e~~k~L~-~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~~~~G~l~g~~~ 369 (468) ...++ +.+. ... +.. -.+||+|.....|... ..+ .|..+...+-++.. -++|.| ++ T Consensus 296 ---~~~~~~~~l~---~~~-----~~~-~~~~~~~~~~~~l~~l---kd~------~g~~~~~~~~~~~~-~~~l~G-~p 352 (437) T protein:vir:10 296 ---LKKVLNVTLK---PQD-----SAA-ASIVMSQSAYNLFDMA---TDA------MGRPLLQPNVTAAT-GYTLLG-KT 352 (437) T ss_pred ---HHHHHHhhhh---hhh-----hcC-CEEEEcHHHHHHHHHh---hcc------CCCeeeccCccCCC-Cccccc-ce Confidence 11111 0111 111 122 2569999998888763 111 11111222212111 246655 45 Q ss_pred EEEcccc--cccCCcceEEEEEecCCcccceeEeeccch---------hhcccccCCccccceeeeeeeeeee-ecC--c Q lcl|Aclame:pro 370 VFVDPYA--ANLSDKHYYVIGYKGTSPYDAGLFYCPYVP---------LQMVRSIDPNTFQPKIGFKTRYGMV-SNP--F 435 (468) Q Consensus 370 vy~D~Ya--~~~~~~dY~~vG~KG~~~~d~glfyaPYv~---------l~~~~~~dp~s~qP~~g~~tRY~l~-~nP--~ 435 (468) |++...+ .+...-++ .+||+.+-. ..+...-+-+.+...+.+..||+.. ++| | T Consensus 353 v~~~~~~~~~~~~~~~~-------------~~~~gd~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~r~d~~~~~~~a~ 419 (437) T protein:vir:10 353 VVIVDDKLFPSASAGDV-------------NIVVAPLKKAVINFKLTEITGQFQDTYDIWYKQLGIFLRQNVVQASKDLI 419 (437) T ss_pred eEEecccccCCcCCCce-------------EEEEeeccccEEEEeeeceEEEEecccccccceeeEEEEEccEEecccce Confidence 5542110 01111111 122222211 0111111334455566666788653 344 3 Q ss_pred ccccCccccccchhhhhhhccc Q lcl|Aclame:pro 436 VTTNGLYNGTPDGEALTPNANM 457 (468) Q Consensus 436 ~~~~~~~~~~~~~~~~~~~an~ 457 (468) +........... ...|-. T Consensus 420 ~~l~~~~~~~~~----~~~~~~ 437 (437) T protein:vir:10 420 VNLTGKLKAVTV----VQSTAV 437 (437) T ss_pred EEEEeecccccc----CCCCCC Confidence 321111011000 000100 No 72 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=66.75 E-value=0.26 Score=23.77 Aligned_cols=276 Identities=14% Similarity=0.086 Sum_probs=114.9 Q ss_pred hhhhhhhhcCcccccccccccccccccccccccceeh-hhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCcc Q lcl|Aclame:pro 50 LNEVAVNSLGAGTIAPAGSALGSANTGGLAGFDPVLI-SLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEE 128 (468) Q Consensus 50 l~e~~~~~~~~~~~~~~~~i~~st~tg~i~~~~P~Lv-~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG~E 128 (468) +.|+. +.++++.+-.-.-+.+. .+++..-+..+-.+++.+=||++.+|-+=-.+ ..+..+ + T Consensus 1 ~l~~~---------------~~~t~~~gg~liP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~--~~~~~~-~ 62 (293) T protein:vir:48 1 MLDSK---------------TDHSGSDAGLTIPQDIRTAINTLVRQYDSLQEYVNVENVTTLTGSRVYEK--WTDITG-L 62 (293) T ss_pred Cceee---------------cccccCcCceEechhHHHHHHHHHHhhhhhhhhceeeeccCCcceEEEEe--ecCCCc-c Confidence 22211 11111111111111111 24444455666778888888887665211111 000000 0 Q ss_pred cccccCCccccccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCCCCcchhhcc-eEEEE Q lcl|Aclame:pro 129 ALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFREMS-FSIEK 207 (468) Q Consensus 129 A~fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~~~f~EMa-FsIeK 207 (468) + .| . +| +..++|.+ .++++ T Consensus 63 a-------~~----------------------------------------------v--~E-----g~~~~~~~~~~~~~ 82 (293) T protein:vir:48 63 A-------NI----------------------------------------------D--DE-----AGKIADIDDPKLSL 82 (293) T ss_pred e-------ee----------------------------------------------e--cC-----CcccccccccceeE Confidence 0 00 0 01 12344443 45666 Q ss_pred EEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhccccccccccccccccccccc Q lcl|Aclame:pro 208 TSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVANAGIFDLDVDSNG 287 (468) Q Consensus 208 ~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~~~~g~~Dl~~~~~g 287 (468) ++..+|.-+-...+|-||.+|. .+|.|++|.+-|+..|..-+|+.|+.-+-..+. ..+.+.+ T Consensus 83 i~l~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~--------~~~~~~~------ 144 (293) T protein:vir:48 83 IKYTIKRYAGISTVTNSLLADS----AENILAWLSGWIAKKVVVTRNKAILGVVDKLPT--------KPTLTKW------ 144 (293) T ss_pred EEEeeeEEEEeehhhHHHHhhh----hHHHHHHHHHHHHHHHHHHHHhHHhhccccccc--------cccccCH------ Confidence 6667777777788999999986 367899999999999999999988865433221 2222222 Q ss_pred hhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccccccCceeEEEecCc Q lcl|Aclame:pro 288 RWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGR 367 (468) Q Consensus 288 rw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~~~~G~l~g~ 367 (468) +....|+.++. ..- +. ....+|++.....|... ..+. +..+.+-+-++. ..++|.| T Consensus 145 ----d~i~~~~~~l~-------~~~-~~-~a~~vmn~~~~~~L~~l---kd~~------g~~l~~~~~~~~-~~~~l~G- 200 (293) T protein:vir:48 145 ----DDIIDLEAKVD-------PAI-KQ-TSFFLTNTSGFTALKKV---KNAL------GDYLMERDVKSP-TGYSIAG- 200 (293) T ss_pred ----HHHHHHHHhhh-------hhh-cC-CCEEEEcHHHHHHHHHh---hccC------CceEeecCcCCC-CCceecc- Confidence 22333433332 111 22 33678899998888752 2111 111111111111 1245644 Q ss_pred eEEEE--cccccccCCcc----------eEEEEEecCCcccceeEeeccchhhcccccCCccccceeeeeeeeeee-ecC Q lcl|Aclame:pro 368 IKVFV--DPYAANLSDKH----------YYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNTFQPKIGFKTRYGMV-SNP 434 (468) Q Consensus 368 ~~vy~--D~Ya~~~~~~d----------Y~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~s~qP~~g~~tRY~l~-~nP 434 (468) ++|++ |.+..+....+ |+.++.++.... -..++.. .+-.+-|=.+-...||+.. .+| T Consensus 201 ~Pv~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i----~~~~~~~------~~~~~~~~~~r~~~r~d~~~~~~ 270 (293) T protein:vir:48 201 FAVKEISDRWLPNASSGVMPLYFGDLKQAVTLFDRQQMSL----LSTNIGG------GAFETDTTKVRVIDRFDVVATDT 270 (293) T ss_pred eeeEEecccccCCccCCceEEEEEeccceEEEEEecceEE----EEecccc------hhhhcCeEEEEEEEeeCcEEecc Confidence 45543 33322111111 222222222111 1111110 0112223344455555543 233 Q ss_pred --cccccCccccccchhhhhhhcccceeee Q lcl|Aclame:pro 435 --FVTTNGLYNGTPDGEALTPNANMYYRRV 462 (468) Q Consensus 435 --~~~~~~~~~~~~~~~~~~~~an~y~~r~ 462 (468) |...+-.....+.+.. +.-| | T Consensus 271 ~a~~~l~~~~~~~~~~~~-~~~~------~ 293 (293) T protein:vir:48 271 EAFVPASFKAIADQKGNI-GSTA------V 293 (293) T ss_pred cceEEEEeeccccCCccc-cccC------C Confidence 2110000000000000 0000 0 No 73 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=65.74 E-value=0.28 Score=23.63 Aligned_cols=324 Identities=16% Similarity=0.180 Sum_probs=115.7 Q ss_pred CcchHHHHHhh---------------------hhhhCCCccchhcchhhhHHHHHHHhH-HHHHHhhhhhhhhhhhhhhc Q lcl|Aclame:pro 1 MFNAEHLQEKW---------------------SPVLNHGEAPAIGDRYKRAVTSVLLEN-QERFLREERGMLNEVAVNSL 58 (468) Q Consensus 1 ~~~~~~l~~kw---------------------~p~l~~~~~~~i~~~~~~~~~~~llen-q~~~~~~~~~~l~e~~~~~~ 58 (468) +-.-+.|.++. .+.+..+.- .-.+..+++.....+.+ +.....+++..+.|...... T Consensus 41 ~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~a~~~~l~~~~~~~~~~e~~~~~~~~a~~~ 119 (409) T protein:vir:45 41 KSELEALDERIAREEELRRQDQAYIESNEEEQRQNLDPENN-SQQDEKRAQVFDKWMRHGASELTSEERKALRELRAQGV 119 (409) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhcccCCCCCc-chhhHHHHHHHHHHHHhhhhhccHHHHHHHHHHhhccC Confidence 11111111111 122222111 11111222222222211 11111222223333211111 Q ss_pred Cccccccccccccccccccc---ccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCcccccccCC Q lcl|Aclame:pro 59 GAGTIAPAGSALGSANTGGL---AGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPD 135 (468) Q Consensus 59 ~~~~~~~~~~i~~st~tg~i---~~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~ 135 (468) +. .+.|+. ..+.+.++.+.| +..+-.+++-|-|+++.....+-... ..+ .. T Consensus 120 ~~------------~~~gg~liP~~~~~~ii~~~~---~~~~l~~~~~~~~~~~~~~~~~~~~~---~~~-~~------- 173 (409) T protein:vir:45 120 AQ------------DEKGGYTVPETFLAKVVEKMK---SYGGIASVAQILTTSDGRTMEWATAD---GTS-EV------- 173 (409) T ss_pred cc------------CcCCceeccHhHHHHHHHHHH---hhhhhhhhceeeecCCCceEEEEeec---cCc-cc------- Confidence 00 011111 112233444444 33344677888888765544432221 000 00 Q ss_pred ccccccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCCCCcchhhcceEEEEEEEEeecc Q lcl|Aclame:pro 136 TGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFREMSFSIEKTSVTAQSR 215 (468) Q Consensus 136 t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~~~f~EMaFsIeK~tVtAKSR 215 (468) . --.++++........|.+..|.--|.. T Consensus 174 -----------------------~-----------------------~~v~E~~~~~~~~~~f~~~~l~~~k~~------ 201 (409) T protein:vir:45 174 -----------------------G-----------------------VLLGENEEAGEEDTDFGMGSLGALKMT------ 201 (409) T ss_pred -----------------------c-----------------------ccccccccccccccccceeeeeeeeee------ Confidence 0 000011111111122333333222211 Q ss_pred cccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHH-----------Hhhhhhcccccccccccccccccc Q lcl|Aclame:pro 216 ALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRR-----------VYTVAKKGAQNNVANAGIFDLDVD 284 (468) Q Consensus 216 aLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~-----------l~~va~~~k~~~~~~~g~~Dl~~~ 284 (468) +-=..+|-||.+|- .+|.+++|.+-|+..|.+-+|+.||.- |...+.. .......+.++ T Consensus 202 ~~~i~is~ell~ds----~~~l~~~i~~~la~a~~~~~~~a~l~G~G~~~~~~p~Gil~~~~~--~~~~~~~~~~~---- 271 (409) T protein:vir:45 202 SKIIRVSNELLQDS----AIDMEAYLARRIAERIGRGEARYLIQGTGAGTPKQPKGLAASVTG--TTQTAAANAVK---- 271 (409) T ss_pred eeehhhhHHHHhcc----HHHHHHHHHHHHHHHHHHHHHHHhhccCCCCCccccceeeecccc--ccccccccccc---- Confidence 11135799999994 257899999999999999999998841 1111110 01111111111 Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccE-EEEchhHHHHHHhhcccccccccccccccccccccccCceeEEE Q lcl|Aclame:pro 285 SNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNF-LICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGT 363 (468) Q Consensus 285 ~~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~-~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~~~~G~ 363 (468) .+....|++. +.-.= +..+.| ++|++.....|.. |..+. +..+++.+.+... .++ T Consensus 272 ------~d~i~~l~~~-------l~~~~-~~~a~~~~~~n~~~~~~l~~---lkd~~------G~~i~~~~~~~~~-~~~ 327 (409) T protein:vir:45 272 ------WQEILALKHS-------IDPAY-RRGPKFRLAFNDNTLKLISE---MEDGQ------GRPLWLPDIVGVA-PAS 327 (409) T ss_pred ------hHHHHHHHHh-------hhhhh-ccCCeEEEEECHHHHHHHHH---hhcCC------CceeeccCcCCCC-Cce Confidence 1222233232 21111 345666 5789988887764 22111 1111222221111 146 Q ss_pred ecCceEEEEcccccccCCcce-EEEEEecCCcccceeEeeccchhhcccccCCccccceeeee--eeeeee-ecC--ccc Q lcl|Aclame:pro 364 INGRIKVFVDPYAANLSDKHY-YVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNTFQPKIGFK--TRYGMV-SNP--FVT 437 (468) Q Consensus 364 l~g~~~vy~D~Ya~~~~~~dY-~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~s~qP~~g~~--tRY~l~-~nP--~~~ 437 (468) |.| ++|+++.+......-++ +++| +-. ..+...--........||-.-...++|. .||+.. .|| |.. T Consensus 328 l~G-~PV~~~~~~p~~~~~~~~i~~G---d~~---~~~i~~~~~~~~~~~~d~~~~~~~~~~~~~~r~d~~~~~~~A~~~ 400 (409) T protein:vir:45 328 VLN-VPYVIDQEIDDIGAGKKFMFCG---DFD---RFIIRRVRYMILKRLVERYAEYDQTGFLAFHRFDCILEDTSAIKA 400 (409) T ss_pred ecc-eeeEEecCcCCccCCccEEEEe---ehh---hhheeeccceEEEEeecccccCCcEEEEEEEEeccEeechhheEE Confidence 755 68888754321111111 2222 110 0011110011122223544323444443 366543 344 322 Q ss_pred cc--Ccccc Q lcl|Aclame:pro 438 TN--GLYNG 444 (468) Q Consensus 438 ~~--~~~~~ 444 (468) .. ...++ T Consensus 401 l~~k~s~~~ 409 (409) T protein:vir:45 401 LVGKGSVGG 409 (409) T ss_pred EEeccCCCC Confidence 11 10111 No 74 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=65.61 E-value=0.28 Score=23.61 Aligned_cols=324 Identities=12% Similarity=0.037 Sum_probs=113.2 Q ss_pred Cc----------chHHHHHhhhhhhCCCcc---------chh-------cchhhhHHHHHHHhHHHHHHhhhhhhhhhhh Q lcl|Aclame:pro 1 MF----------NAEHLQEKWSPVLNHGEA---------PAI-------GDRYKRAVTSVLLENQERFLREERGMLNEVA 54 (468) Q Consensus 1 ~~----------~~~~l~~kw~p~l~~~~~---------~~i-------~~~~~~~~~~~llenq~~~~~~~~~~l~e~~ 54 (468) -. .-+.|.++..-..+.+.. ... ....++.....+.+++ +.+.. T Consensus 27 ~~~~ee~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~ 96 (404) T protein:vir:10 27 GVTAEELNKTSNEIDILQAKIEAQKRKENIENNFNEDNVKSLNTGKEENVIYNGALFVRAIADNL----------LKQKN 96 (404) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccchhhHHHHHHHHHHHHHHH----------HHHHH Confidence 00 011233332211100000 000 0000001111111111 00000 Q ss_pred hhhcCccccccccccccc-ccccccc---cccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCcccc Q lcl|Aclame:pro 55 VNSLGAGTIAPAGSALGS-ANTGGLA---GFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEAL 130 (468) Q Consensus 55 ~~~~~~~~~~~~~~i~~s-t~tg~i~---~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~ 130 (468) ...+........ .+..+ +++|++. .+.+. +++.+.......+++++.||+++.|-+-=.| ..... T Consensus 97 ~~~~~~~~~e~~-a~~~~~~~~gg~~vP~~~~~~---ii~~~~~~~~l~~l~~~~~~~~~~g~~~~~~--~~~~~----- 165 (404) T protein:vir:10 97 QRGLNLSEKEIN-AISENIDEDGGYAVPEDIQTK---INTRLKDTTDLYNMVDYEPVFTRSGSRTYEK--RSKQK----- 165 (404) T ss_pred hhhhcchhhHHh-hhccccCCCCceeechhHHHH---HHHHHhhhhhHhhhhceeeccCCccceEEEE--ecCCc----- Confidence 000000000000 11111 2223221 12233 3444445567788999999999998543222 11100 Q ss_pred cccCCccccccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCCCCcchhhcceEEEEEEE Q lcl|Aclame:pro 131 FNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFREMSFSIEKTSV 210 (468) Q Consensus 131 fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~~~f~EMaFsIeK~tV 210 (468) ...|- ++++..... ....++++++. T Consensus 166 ----~~~~v----------------------------------------------~e~~~~~~~-----~~~~~f~~i~~ 190 (404) T protein:vir:10 166 ----PMKPL----------------------------------------------SENQQIPTN-----GDNGKLERFNF 190 (404) T ss_pred ----ceeec----------------------------------------------ccccccccc-----ccccceeeeEe Confidence 00000 000000000 01122344444 Q ss_pred EeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhcccccccccccccccc------cc Q lcl|Aclame:pro 211 TAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVANAGIFDLD------VD 284 (468) Q Consensus 211 tAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~~~~g~~Dl~------~~ 284 (468) +.|.-+-...+|-||.+|-. .+.++.|.+.|+..|...+|+.||.-. ..+ -...|+.... .. T Consensus 191 ~~~k~~~~~~iS~ell~ds~----~~l~~~i~~~la~~~~~~~~~~il~G~----g~~----~~~~gi~~~~~~~~~~~~ 258 (404) T protein:vir:10 191 KLKDLADFMSIPNDLLKFAD----KSLEDWIINWFVDKVRITRNAEILYGA----GGD----EHATGIMTANKFKKITLP 258 (404) T ss_pred eheeeEeeehhhHHHHhhcH----HHHHHHHHHHHHHHHHHHHHHHHhhcC----CCC----Ccccceeeccccceeecc Confidence 44444445678999998843 357777888888888888888776321 111 1111221111 11 Q ss_pred ccchhHHHHHHHHHHHHHHHHHHHHHhhcCCCcc-EEEEchhHHHHHHhhcccccccccccccccccccccccCceeEEE Q lcl|Aclame:pro 285 SNGRWSVEKFKGLLFQVERDANAIAQETRRGKGN-FLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGT 363 (468) Q Consensus 285 ~~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n-~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~~~~G~ 363 (468) ...- ......+ .+. .... .+.+| .+||||+..+.|... ..+. +...+..+-++. ..++ T Consensus 259 ~~~~--~~~~~~~-------~~~-~l~~-~~~~~~~~v~n~~~~~~L~~l---kd~~------G~~l~~~~~~~~-~~~~ 317 (404) T protein:vir:10 259 KSPA--LKDFKKC-------KNV-ELLN-VFKATSSWIVNQDGFNYLDSL---EDKT------GRPYLQPDPKDP-TQYR 317 (404) T ss_pred cccc--HHHHHHH-------HHh-hhhc-cccCCCEEEEcHHHHHHHHHh---hccC------CceeeccCcCCC-CCcc Confidence 1111 1111111 111 1112 23333 468999999998863 1110 111111111111 1245 Q ss_pred ecCceEEEE-cccccccCCcceEEEEEecCCcccceeEeeccc---------hhhcccccCC----ccccceeeeeeeee Q lcl|Aclame:pro 364 INGRIKVFV-DPYAANLSDKHYYVIGYKGTSPYDAGLFYCPYV---------PLQMVRSIDP----NTFQPKIGFKTRYG 429 (468) Q Consensus 364 l~g~~~vy~-D~Ya~~~~~~dY~~vG~KG~~~~d~glfyaPYv---------~l~~~~~~dp----~s~qP~~g~~tRY~ 429 (468) |+| ++|++ +....... ..+..++|+.+- .+......++ ...+=.+-...|++ T Consensus 318 l~G-~PV~~~~~~~~~~~-------------~~~~~~~~gd~s~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~r~d 383 (404) T protein:vir:10 318 FLG-LPVIELPNDLLLST-------------ESAIPVLLGDTKEAYKYVSDGAYELATTNIGAGAFETNTTKARIIMRID 383 (404) T ss_pred ccc-eeeEEecccccCCC-------------CCccEEEEEeccccEEEEEecceEEEEeccccchhhcCceEEEEEEeec Confidence 654 56664 31110000 001112222111 1111111122 23344566667777 Q ss_pred eee-cC--cccccCccccccc Q lcl|Aclame:pro 430 MVS-NP--FVTTNGLYNGTPD 447 (468) Q Consensus 430 l~~-nP--~~~~~~~~~~~~~ 447 (468) ..+ +| |+..+-.....|- T Consensus 384 ~~v~~~~a~~~~~~~~aa~~~ 404 (404) T protein:vir:10 384 GNVKDSEALLIAEIPVESVQA 404 (404) T ss_pred cEEecccceEEEEeecccCCC Confidence 643 33 3321110111111 No 75 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=64.48 E-value=0.3 Score=23.46 Aligned_cols=338 Identities=9% Similarity=0.027 Sum_probs=119.8 Q ss_pred CcchHHHHHhhhhhhCC--Cccchhcchhh---h---HHHHHHHhHHH------------------------HHHhhhhh Q lcl|Aclame:pro 1 MFNAEHLQEKWSPVLNH--GEAPAIGDRYK---R---AVTSVLLENQE------------------------RFLREERG 48 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~--~~~~~i~~~~~---~---~~~~~llenq~------------------------~~~~~~~~ 48 (468) .=..+++.++..--++. +...++..... + .+-+++-+--+ .+++.... T Consensus 35 ~~e~~~~~e~~~~e~~~~~~~~~e~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 114 (418) T protein:vir:10 35 GDEVKSAGEKALAEAKRAGDLGVETKATVDELLIKQGELQARLLEAEQKLARGGGSAELETPKTLGQLVTESEEMKGMDG 114 (418) T ss_pred HHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhhhhhHHhhhHHHHHHHHH Confidence 11112222222211110 00111111000 0 00000000000 00000000 Q ss_pred hhhhhhhhhcC-cccccccccccccccccccccccceeh-hhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCC Q lcl|Aclame:pro 49 MLNEVAVNSLG-AGTIAPAGSALGSANTGGLAGFDPVLI-SLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAG 126 (468) Q Consensus 49 ~l~e~~~~~~~-~~~~~~~~~i~~st~tg~i~~~~P~Lv-~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG 126 (468) ++.+....... ..-.........++++++. -.-|.+. .+++...+..+-.+++.+-||++++.-+ .| ..+.. T Consensus 115 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~-lvp~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~--~~--~~~~~- 188 (418) T protein:vir:10 115 SARKSVRVRVDRKSIMNVPATVGSGVSGSNS-LVVADRQAGIIAPPQRKMTIRDLLMPGQTSSSSIEY--TV--ETGFT- 188 (418) T ss_pred HHhhhhhhhhHHHHHHHhhhhccCCCCCCcc-ccchhHHHHHHHHHhhhhhHHhhcceeeccCCceeE--EE--EecCC- Confidence 00000000000 0000000111111111111 1222221 3445555667788899999998775321 11 00000 Q ss_pred cccccccCCccccccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCCCCcchhhcceEEE Q lcl|Aclame:pro 127 EEALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFREMSFSIE 206 (468) Q Consensus 127 ~EA~fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~~~f~EMaFsIe 206 (468) . ...| ++ | +...++-..+++ T Consensus 189 ~-------~a~~----------------------------------------------v~--E-----~~~~~~~~~~f~ 208 (418) T protein:vir:10 189 N-------NAAA----------------------------------------------VA--E-----GAQKPTSDLKFN 208 (418) T ss_pred C-------ceee----------------------------------------------ec--c-----Ccccccccccee Confidence 0 0000 00 1 112233344556 Q ss_pred EEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhcc-cccccc-cccc--cccc Q lcl|Aclame:pro 207 KTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKG-AQNNVA-NAGI--FDLD 282 (468) Q Consensus 207 K~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~-k~~~~~-~~g~--~Dl~ 282 (468) +++..+|.-+-...+|-||.||.- |.++.|.+-|+..|..-+|+-||.- ...+ +..|+. ..++ .... T Consensus 209 ~v~~~~~k~~~~~~is~ell~ds~-----~l~~~i~~~l~~a~~~~~d~a~l~G----~g~~~~p~Gi~~~~~~~~~~~~ 279 (418) T protein:vir:10 209 LKNQPVRTIAHLFKASRQILDDAP-----ALQSYIDGRARYGLQLTEEGQILKG----DGTGANILGILPQASAFMPSIT 279 (418) T ss_pred eEEEeeeeEEEeehhhHHHHHhHH-----HHHHHHHHHHHHHHHHHHHHHHhcc----CCCCcccccccccccccccccc Confidence 666666666667789999999852 4677788878777777777777632 1111 111210 0111 1111 Q ss_pred ccccchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccccccCceeEE Q lcl|Aclame:pro 283 VDSNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVG 362 (468) Q Consensus 283 ~~~~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~~~~G 362 (468) +.... .+.....+++.+ ...-+..+-+||+|.....|... -+ +. +..+.. +.+.. -.| T Consensus 280 ~~~~~--~~~~i~~~~~~~---------~~~~~~~~~~v~n~~~~~~L~~l--kd-~~------G~~i~~-~~~~~-~~~ 337 (418) T protein:vir:10 280 LANAT--PIDKIRLALLQA---------VLAEFPATGIVLNPIDWASIELT--KD-SQ------GRYIVG-NPVNG-TTP 337 (418) T ss_pred ccccc--cHHHHHHHHHhh---------ccccCCCCEEEEcHHHHHHHHHh--hc-CC------Cceecc-ccccC-CCc Confidence 11111 122222232222 12235566799999999888752 11 10 111111 11111 125 Q ss_pred EecCceEEEEcccccccCCcceEEEEEecCCc-----ccceeEeeccchhhcccccCCccccceeeeeeeeeeee-cC-- Q lcl|Aclame:pro 363 TINGRIKVFVDPYAANLSDKHYYVIGYKGTSP-----YDAGLFYCPYVPLQMVRSIDPNTFQPKIGFKTRYGMVS-NP-- 434 (468) Q Consensus 363 ~l~g~~~vy~D~Ya~~~~~~dY~~vG~KG~~~-----~d~glfyaPYv~l~~~~~~dp~s~qP~~g~~tRY~l~~-nP-- 434 (468) +|. |++|+++.+.. .+=+++|--.... .+-.+=..||....| ...+=.+=+..|++..+ +| T Consensus 338 ~l~-G~pV~~~~~~p----~~~~~~gd~s~~~~~~~~~~~~i~~~~~~~~~f------~~~~~~~r~~~~~d~~~~~~~a 406 (418) T protein:vir:10 338 RLW-NLPVVETQAMT----ANEFLVGAFSMAAQIFDRMEIEVLLSTENVDDF------EKNMVSIRAEERLALAVYRPES 406 (418) T ss_pred eec-ceeeEEcCCCC----CCcEEEeeccceEEEEEecceEEEEecccchhh------hcCceEEEEEEeeccEEecccc Confidence 665 47888886543 2223333210000 000111122211111 11222333445666543 34 Q ss_pred cccccCccccccch Q lcl|Aclame:pro 435 FVTTNGLYNGTPDG 448 (468) Q Consensus 435 ~~~~~~~~~~~~~~ 448 (468) |+...- .....| T Consensus 407 ~~~~~~--~~~~~g 418 (418) T protein:vir:10 407 FVTGAL--VEQAGG 418 (418) T ss_pred eEEEEe--ccCCCC Confidence 321110 000111 No 76 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=62.27 E-value=0.34 Score=23.17 Aligned_cols=321 Identities=13% Similarity=0.067 Sum_probs=128.3 Q ss_pred CcchHHHHHhhhhhhCCCccchhcchhhh-HHHHHHHhHHHHHHhhhhhh------hh---hhhhhhcCccccccccccc Q lcl|Aclame:pro 1 MFNAEHLQEKWSPVLNHGEAPAIGDRYKR-AVTSVLLENQERFLREERGM------LN---EVAVNSLGAGTIAPAGSAL 70 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~-~~~~~llenq~~~~~~~~~~------l~---e~~~~~~~~~~~~~~~~i~ 70 (468) +.+.++ .+.|.-+.. ||.+..++ +....+.|.+.+..+..... .. .+....+..+. ..... T Consensus 22 ~~~~~~-~e~~~~~~~-----ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~---~~a~~ 92 (371) T protein:vir:81 22 LLAENK-IEEAKKLKE-----EIVALQEKFDVAKELYEEQKQTIEDKEPLKPTVQVKENEVEAFVNHIRTRF---RNAMS 92 (371) T ss_pred HhhHHH-HHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHhhccccccccchhhHHHHHHHHHHHHHHHH---HHhhc Confidence 222222 344544332 23322111 11122222221111110000 00 00000000000 01111 Q ss_pred ccc-cccccccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCcccccccCCcccccccccccccc Q lcl|Aclame:pro 71 GSA-NTGGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDTGFTGGYDASQGDY 149 (468) Q Consensus 71 ~st-~tg~i~~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSg~~~~~~~~~ 149 (468) .++ ++|++.--....-.+++...+...-.+++++.||++.++-+.-.+ ..+. .++ .| T Consensus 93 ~~t~~~gg~~vP~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~~~~~--~~~~--~~a-------~~----------- 150 (371) T protein:vir:81 93 EGSNQDGGYTVPQDIQTRINELRESKDALQNLITVEPVTTLSGSRVFKK--RSQQ--TGF-------VE----------- 150 (371) T ss_pred cCCCccCceeecHhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEe--ecCC--cce-------ee----------- Confidence 122 222221111111235555667778889999999998877654333 1110 000 00 Q ss_pred ccccCccccCCCccccccccccccccccccccccchhhhhccCCC-CcchhhcceEEEEEEEEeecccccccccHHHHHH Q lcl|Aclame:pro 150 AVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEA-NRLFREMSFSIEKTSVTAQSRALKAEYTLELAQD 228 (468) Q Consensus 150 ~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~-~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQD 228 (468) .++++...+. ...|.+..++..|..+ ...+|-||.+| T Consensus 151 -----------------------------------v~Eg~~~~~~~~~~f~~i~~~~~k~~~-------~~~iS~ell~d 188 (371) T protein:vir:81 151 -----------------------------------VAEGAAIGEKATPQFTLLQYQVKKYAG-------FFRVTNELLND 188 (371) T ss_pred -----------------------------------eccccccccccccceeeEEeeeeEEEE-------eehhhHHHHhh Confidence 0000111111 1235555555555554 45799999998 Q ss_pred HHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhccccccccccccccccccccchhHHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 229 LKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVANAGIFDLDVDSNGRWSVEKFKGLLFQVERDANAI 308 (468) Q Consensus 229 LkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~~~~g~~Dl~~~~~grw~~e~~k~L~~~i~~ean~i 308 (468) -. .|.++.|.+.|...|..-+|+.|+.-.-+.+ ..|+..++ ..+.++..... . T Consensus 189 s~----~~l~~~i~~~l~~a~~~~~~~~i~~g~g~~~---------~~~~~~~~----------~i~~~~~~~l~---~- 241 (371) T protein:vir:81 189 ST----EAIVNTLVRWIGDESRVTRNGLIINVLNTKA---------KTAIADLD----------GLKQIINVQLD---P- 241 (371) T ss_pred hh----HHHHHHHHHHHHHHHHHHHHHHHHhhccccc---------ccccccHH----------HHHHHHHhhcc---h- Confidence 53 4678889999999998888888877433221 22222221 11222111000 0 Q ss_pred HHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccccccCceeEEEecCceEEEEcccccccCCcceEEEE Q lcl|Aclame:pro 309 AQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIG 388 (468) Q Consensus 309 ~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~~~~dY~~vG 388 (468) --+ ....+|++|.....|... ..+. +....+.+-+.. ..|+|. |++|++.. ++..| T Consensus 242 ---~~~-~~a~~vmn~~~~~~L~~l---kd~~------g~~l~~~~~~~~-~~~~l~-G~pV~~~~---------~~~~~ 297 (371) T protein:vir:81 242 ---VFR-STSSVIVNQDAFNWLDTL---KDQN------GQYLLQPSISSP-TGRQLL-GLPVVIVS---------NKVLA 297 (371) T ss_pred ---hhh-cCCEEEEcHHHHHHHHHh---hccC------CCeeeecccCCC-CCceec-ceeEEEec---------ccccC Confidence 011 223688999998888752 2110 111111111111 236775 56676652 23333 Q ss_pred EecC---CcccceeEeeccch-------hhcccccCCcc------ccceeeeeeeeeeee-cCcccccCccccccchhhh Q lcl|Aclame:pro 389 YKGT---SPYDAGLFYCPYVP-------LQMVRSIDPNT------FQPKIGFKTRYGMVS-NPFVTTNGLYNGTPDGEAL 451 (468) Q Consensus 389 ~KG~---~~~d~glfyaPYv~-------l~~~~~~dp~s------~qP~~g~~tRY~l~~-nP~~~~~~~~~~~~~~~~~ 451 (468) ..+. ..-...++|+.+.. ..+...+++.. -|=.+-...|++..+ +|= T Consensus 298 ~~~~~~~~~~~~~i~~Gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~v~~~~~~r~d~~~~~~~---------------- 361 (371) T protein:vir:81 298 NRVDGGTGAQFAPIIVGDLKEAVVMFDRQRTEIMSSNVAMDAFETDATLWRAIERMDVKMRDDE---------------- 361 (371) T ss_pred ccccccccCCcceEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEeccc---------------- Confidence 3321 11122344444321 11222223322 233445555666533 330 Q ss_pred hhhcccceeeeeeeec Q lcl|Aclame:pro 452 TPNANMYYRRVQVTNL 467 (468) Q Consensus 452 ~~~an~y~~r~~v~~l 467 (468) .|.++.++.= T Consensus 362 ------a~~~~~~~~A 371 (371) T protein:vir:81 362 ------AFVFGEVQLA 371 (371) T ss_pred ------ceEEEEEecC Confidence 1222222222 No 77 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=60.99 E-value=0.36 Score=23.00 Aligned_cols=344 Identities=14% Similarity=0.078 Sum_probs=113.6 Q ss_pred Cc-chHHHHHhhhhhh-----------------------------CCCccchhcchhhhHHHHHHHhHHHHHHhhhhhhh Q lcl|Aclame:pro 1 MF-NAEHLQEKWSPVL-----------------------------NHGEAPAIGDRYKRAVTSVLLENQERFLREERGML 50 (468) Q Consensus 1 ~~-~~~~l~~kw~p~l-----------------------------~~~~~~~i~~~~~~~~~~~llenq~~~~~~~~~~l 50 (468) |. .-++|.++..-+- ..+..++ .|..-.++++ +.+...++.+ T Consensus 41 l~~ei~~l~~~i~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~----~~~~~~~~~~ 112 (435) T protein:vir:80 41 LSSKFNELTAQIERAEAAERMAAAAAVPVDPNPAAVTASAAAPVYAQPKAPE----VKGAKMARMV----RALAAARGDA 112 (435) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcccccchhhhhccccccccccccchhh----hhHHHHHHHH----HHHHhccchh Confidence 11 1122332222111 0000000 1111111111 1111111111 Q ss_pred hhhhhhhcCccccccc-ccccc-cccccccccccceehhhhHHhhhhhhhhhe-eeeecCCccceeeeeeeeeecCCCCc Q lcl|Aclame:pro 51 NEVAVNSLGAGTIAPA-GSALG-SANTGGLAGFDPVLISLVRRAMPNLMAYDV-CGVQPMSGPTGLIFAMRSRYENQAGE 127 (468) Q Consensus 51 ~e~~~~~~~~~~~~~~-~~i~~-st~tg~i~~~~P~Lv~l~RRa~~~LI~~DI-~GVQPmTGPTGLIFAMRsrY~~qsG~ 127 (468) .++....+........ ..+.. +...|++.--....-.++++..+..+...+ +=+-||+.+. +-+... . ++. T Consensus 113 ~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~lvP~~~~~~ii~~l~~~~~i~~~~~~~v~~~~~~-~~~p~~---~--~~~ 186 (435) T protein:vir:80 113 QLASKLAIERGFGEEVAMSLNTLSPGAGGVLVPENLSSEVIELLRPKSVVRKLGARTLPLSNGN-ITIPRL---K--GGA 186 (435) T ss_pred HHHHHHHHhhhhhhhhhhhhcccCCCCCccccchhHHHHHHHHHhhhchhhhccceeeecCCCc-eEEEEE---e--CCc Confidence 1110000000000000 00111 111121111011101133333344444444 2233443322 111100 0 000 Q ss_pred ccccccCCccccccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCCCCcchhhcceEEEE Q lcl|Aclame:pro 128 EALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFREMSFSIEK 207 (468) Q Consensus 128 EA~fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~~~f~EMaFsIeK 207 (468) ++ .| . +| +..+++...++++ T Consensus 187 ~a-------~~----------------------------------------------v--~E-----~~~~~~~~~~f~~ 206 (435) T protein:vir:80 187 IV-------GY----------------------------------------------I--GA-----DTDIPTTQQQFDD 206 (435) T ss_pred ce-------ee----------------------------------------------e--cc-----Cccccccccceee Confidence 00 00 0 01 1234555556666 Q ss_pred EEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhcc-cccccccc-ccccccccc Q lcl|Aclame:pro 208 TSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKG-AQNNVANA-GIFDLDVDS 285 (468) Q Consensus 208 ~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~-k~~~~~~~-g~~Dl~~~~ 285 (468) ++...+.-+-....|-||.+|-.- +.|.|+.|.+-|+..|...+++-||.- ...+ ...|+.+. ++.-....+ T Consensus 207 i~~~~~k~~~~~~is~ell~ds~~--~~~l~~~i~~~l~~a~~~~~d~a~l~G----~G~~~~p~Gi~~~~~~~~~~~~~ 280 (435) T protein:vir:80 207 LKLTAKKMAALVPIANDLIKYAGV--NPNVDQIVVGDLTAAIGAREDKAFIRD----DGTANTPKGLRFWALPGNVITAS 280 (435) T ss_pred EEEeeEEEEEeehhhHHHHHhhcc--cHHHHHHHHHHHHHHHHHHHHHHhhcc----CCCCCcccceeecccccceeecc Confidence 777777666777899999999432 456788888888888888888777643 1111 11221110 000000111 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccccccCceeEEEec Q lcl|Aclame:pro 286 NGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGTIN 365 (468) Q Consensus 286 ~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~~~~G~l~ 365 (468) ++ .........+.+....+...........+|++|.....|... ..+ + |..+.. +.++ |+|. T Consensus 281 ~~----~~~~~~~~d~~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~l---kd~---~---G~~l~~-~~~~----~~l~ 342 (435) T protein:vir:80 281 DG----STLQKIETDLGKAILALENADANLTQPGWIMAPRTFRFLEGL---RDG---N---GNKVYP-ELAN----GMLK 342 (435) T ss_pred cc----cchhhHHHHHHHHHHHhhccccccccCEEEEcHHHHHHHHhh---hcc---C---Cceecc-CCCC----CeEe Confidence 11 011111111112111111111122345678999999998763 211 1 111121 1222 4554 Q ss_pred CceEEEEcccccccC------------CcceEEEEEecCCcccceeEeeccchhhcccccCCccc---cceeeeeeeeee Q lcl|Aclame:pro 366 GRIKVFVDPYAANLS------------DKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNTF---QPKIGFKTRYGM 430 (468) Q Consensus 366 g~~~vy~D~Ya~~~~------------~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~s~---qP~~g~~tRY~l 430 (468) +++||++.+.-.+. ++.+++||-.++...+ ..+|.-+......--..| +=.+=..-|++. T Consensus 343 -G~pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~i~~~~~~~i~----~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~ 417 (435) T protein:vir:80 343 -GYPVGKTTQVPINLGEAGKESEIYFTDFGDVFIGEEETLEID----YSKEATYKDADGHMVSAFQRDQTLIRVIAKNDF 417 (435) T ss_pred -eeeeEEeccccccccCCCCcceEEEEEcccEEEEeecceEEE----EeccccccccccchhhhhhcCcceeeeeeeeCc Confidence 47888875532110 0111223333322211 111111000000000001 112223445554 Q ss_pred ee-cCcccccCccccccchhhhhh Q lcl|Aclame:pro 431 VS-NPFVTTNGLYNGTPDGEALTP 453 (468) Q Consensus 431 ~~-nP~~~~~~~~~~~~~~~~~~~ 453 (468) .+ +|=+ =..+++-.|++ T Consensus 418 ~~~~~~a------~~~l~~~~~~~ 435 (435) T protein:vir:80 418 GPRHVES------IAVLSGVAWGA 435 (435) T ss_pred Eeecccc------eEEEeccCCCC Confidence 44 2311 01234444554 No 78 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=59.83 E-value=0.38 Score=22.86 Aligned_cols=355 Identities=13% Similarity=0.050 Sum_probs=135.4 Q ss_pred CcchHHHHHhhhhhhCCCc--cchhcch------hhhHHHHH---HHhHHHHHHhhhh-hhhh-hhh----------h-- Q lcl|Aclame:pro 1 MFNAEHLQEKWSPVLNHGE--APAIGDR------YKRAVTSV---LLENQERFLREER-GMLN-EVA----------V-- 55 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~--~~~i~~~------~~~~~~~~---llenq~~~~~~~~-~~l~-e~~----------~-- 55 (468) ....+++.+++..++.... ..+|... ..+.-..+ +.|...+...... .... +.. . T Consensus 53 ~~~~~~~~~~~~~~~a~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 132 (497) T protein:vir:78 53 HERAQEMLKSLGGADAAKDGLDNDIPEVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAA 132 (497) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHH Confidence 3333444444444433211 0111110 00000011 0010000000000 0000 000 0 Q ss_pred ---hhcCccc---cccccccccccccccc---ccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCC Q lcl|Aclame:pro 56 ---NSLGAGT---IAPAGSALGSANTGGL---AGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAG 126 (468) Q Consensus 56 ---~~~~~~~---~~~~~~i~~st~tg~i---~~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG 126 (468) ..+..+. .........++++++. ..+.+.++.+.| +..+..+++.+-||+++..- |... .+... T Consensus 133 ~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vp~~~~~~ii~~~~---~~~~i~~l~~~~~~~~~~~~-~~~~--~~~~~- 205 (497) T protein:vir:78 133 ELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLF---YELSLADLISSRPVTSPNLS-YLTE--SAAHN- 205 (497) T ss_pred HHHHHHhhhhhhHHHHHhhhcccCcccccccchhhhHHHHHHHH---hhhhHHhhccccccCCCceE-EEEE--cCCCC- Confidence 0000000 0011111122233332 133344444444 55567899999999887532 2111 00000 Q ss_pred cccccccCCccccccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCCCCcchhhcceEEE Q lcl|Aclame:pro 127 EEALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFREMSFSIE 206 (468) Q Consensus 127 ~EA~fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~~~f~EMaFsIe 206 (468) ++ . + . +| +..+++...+++ T Consensus 206 -~a-------~-----------------------------w-----------------v--~E-----~~~~~~s~~~f~ 224 (497) T protein:vir:78 206 -NA-------A-----------------------------A-----------------V--AE-----AGTYPFSSEEFA 224 (497) T ss_pred -cc-------e-----------------------------e-----------------e--cc-----Ccccccccccce Confidence 00 0 0 0 01 223455566677 Q ss_pred EEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHH--------Hhhhhhcccccccc---- Q lcl|Aclame:pro 207 KTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRR--------VYTVAKKGAQNNVA---- 274 (468) Q Consensus 207 K~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~--------l~~va~~~k~~~~~---- 274 (468) ++++.+|.-+-...+|-||++|-- +.++.|.+-|...|..-+|+.||.- |.+.+....+.... T Consensus 225 ~i~~~~~k~a~~~~iS~ell~d~~-----~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~ 299 (497) T protein:vir:78 225 RVYEQVGKVANALTITDEGLRDAP-----ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFG 299 (497) T ss_pred eeEeeeeeeEeecHhHHHHHHhHH-----HHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchh Confidence 788888877777899999999942 3789999999999999999988862 22211111110000 Q ss_pred ----ccccccccccccchhHHHH-----HHH----------------------HHHHHHHHHHHHHHhhcCCCccEEEEc Q lcl|Aclame:pro 275 ----NAGIFDLDVDSNGRWSVEK-----FKG----------------------LLFQVERDANAIAQETRRGKGNFLICS 323 (468) Q Consensus 275 ----~~g~~Dl~~~~~grw~~e~-----~k~----------------------L~~~i~~ean~i~~~T~r~~~n~~v~S 323 (468) ..+..++..+..+.|.+.. .+. ...--...+-....++....++.+|.+ T Consensus 300 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn 379 (497) T protein:vir:78 300 ATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMN 379 (497) T ss_pred hhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEc Confidence 0000111111111111110 000 000011122222334555677778888 Q ss_pred hhHHHHHHhh----cccccccccccccccccccccccCceeEEEecCceEEEEcccccccCCcceEEEEEecCC------ Q lcl|Aclame:pro 324 ADVASALAMA----GVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYKGTS------ 393 (468) Q Consensus 324 ~~Va~~L~~s----G~~~~~~~~~~~~~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~~~~dY~~vG~KG~~------ 393 (468) |.-...|... |-..+.+......+. .....++|. |++|++.+... .-++ ++|--... T Consensus 380 ~~~~~~l~~lkd~~G~~i~~~~~~~~~~~--------~~~~~~~l~-G~pV~~t~~~~---~~~~-~~Gd~~~~~~~i~~ 446 (497) T protein:vir:78 380 PRDWELLRLTKDANGQYMGGNFFGNAYGN--------PVNGGKNIW-GVPVVTTPLIP---LGTI-LVGHFAPSVIQTAR 446 (497) T ss_pred hHHHHHHHHhhcCCCceeccCcccccccc--------cccCCceee-ceeeEecCCCC---CCce-EEeecccceEEEEE Confidence 8877777642 222222111111110 000112565 47777765432 1222 23311100 Q ss_pred cccceeEeeccchhhcccccCCccccceeeeeeeeee-eecC--cccccCccccccchh Q lcl|Aclame:pro 394 PYDAGLFYCPYVPLQMVRSIDPNTFQPKIGFKTRYGM-VSNP--FVTTNGLYNGTPDGE 449 (468) Q Consensus 394 ~~d~glfyaPYv~l~~~~~~dp~s~qP~~g~~tRY~l-~~nP--~~~~~~~~~~~~~~~ 449 (468) ..+-.+-..||....| .+.+=.+=+..|+++ +.+| |...+ -.....+. T Consensus 447 r~~~~v~~~~~~~~~f------~~n~v~~r~~~r~~~~v~~p~A~~~l~--~~~~~~~~ 497 (497) T protein:vir:78 447 REGVTMQMTNSNGTDF------VDGKVTVRAEERLGLLVYRPSAFQLIQ--LKKGATGS 497 (497) T ss_pred ecccEEEeecccchhh------hcCcEEEEEEEeecceeeccccEEEEE--ecCCccCC Confidence 0011122223211111 122333444678866 6778 43221 11111111 No 79 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=59.83 E-value=0.38 Score=22.86 Aligned_cols=355 Identities=13% Similarity=0.050 Sum_probs=135.4 Q ss_pred CcchHHHHHhhhhhhCCCc--cchhcch------hhhHHHHH---HHhHHHHHHhhhh-hhhh-hhh----------h-- Q lcl|Aclame:pro 1 MFNAEHLQEKWSPVLNHGE--APAIGDR------YKRAVTSV---LLENQERFLREER-GMLN-EVA----------V-- 55 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~--~~~i~~~------~~~~~~~~---llenq~~~~~~~~-~~l~-e~~----------~-- 55 (468) ....+++.+++..++.... ..+|... ..+.-..+ +.|...+...... .... +.. . T Consensus 53 ~~~~~~~~~~~~~~~a~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 132 (497) T protein:vir:10 53 HERAQEMLKSLGGADAAKDGLDNDIPEVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAA 132 (497) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHH Confidence 3333444444444433211 0111110 00000011 0010000000000 0000 000 0 Q ss_pred ---hhcCccc---cccccccccccccccc---ccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCC Q lcl|Aclame:pro 56 ---NSLGAGT---IAPAGSALGSANTGGL---AGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAG 126 (468) Q Consensus 56 ---~~~~~~~---~~~~~~i~~st~tg~i---~~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG 126 (468) ..+..+. .........++++++. ..+.+.++.+.| +..+..+++.+-||+++..- |... .+... T Consensus 133 ~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vp~~~~~~ii~~~~---~~~~i~~l~~~~~~~~~~~~-~~~~--~~~~~- 205 (497) T protein:vir:10 133 ELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLF---YELSLADLISSRPVTSPNLS-YLTE--SAAHN- 205 (497) T ss_pred HHHHHHhhhhhhHHHHHhhhcccCcccccccchhhhHHHHHHHH---hhhhHHhhccccccCCCceE-EEEE--cCCCC- Confidence 0000000 0011111122233332 133344444444 55567899999999887532 2111 00000 Q ss_pred cccccccCCccccccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCCCCcchhhcceEEE Q lcl|Aclame:pro 127 EEALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFREMSFSIE 206 (468) Q Consensus 127 ~EA~fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~~~f~EMaFsIe 206 (468) ++ . + . +| +..+++...+++ T Consensus 206 -~a-------~-----------------------------w-----------------v--~E-----~~~~~~s~~~f~ 224 (497) T protein:vir:10 206 -NA-------A-----------------------------A-----------------V--AE-----AGTYPFSSEEFA 224 (497) T ss_pred -cc-------e-----------------------------e-----------------e--cc-----Ccccccccccce Confidence 00 0 0 0 01 223455566677 Q ss_pred EEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHH--------Hhhhhhcccccccc---- Q lcl|Aclame:pro 207 KTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRR--------VYTVAKKGAQNNVA---- 274 (468) Q Consensus 207 K~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~--------l~~va~~~k~~~~~---- 274 (468) ++++.+|.-+-...+|-||++|-- +.++.|.+-|...|..-+|+.||.- |.+.+....+.... T Consensus 225 ~i~~~~~k~a~~~~iS~ell~d~~-----~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~ 299 (497) T protein:vir:10 225 RVYEQVGKVANALTITDEGLRDAP-----ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFG 299 (497) T ss_pred eeEeeeeeeEeecHhHHHHHHhHH-----HHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchh Confidence 788888877777899999999942 3789999999999999999988862 22211111110000 Q ss_pred ----ccccccccccccchhHHHH-----HHH----------------------HHHHHHHHHHHHHHhhcCCCccEEEEc Q lcl|Aclame:pro 275 ----NAGIFDLDVDSNGRWSVEK-----FKG----------------------LLFQVERDANAIAQETRRGKGNFLICS 323 (468) Q Consensus 275 ----~~g~~Dl~~~~~grw~~e~-----~k~----------------------L~~~i~~ean~i~~~T~r~~~n~~v~S 323 (468) ..+..++..+..+.|.+.. .+. ...--...+-....++....++.+|.+ T Consensus 300 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn 379 (497) T protein:vir:10 300 ATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMN 379 (497) T ss_pred hhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEc Confidence 0000111111111111110 000 000011122222334555677778888 Q ss_pred hhHHHHHHhh----cccccccccccccccccccccccCceeEEEecCceEEEEcccccccCCcceEEEEEecCC------ Q lcl|Aclame:pro 324 ADVASALAMA----GVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYKGTS------ 393 (468) Q Consensus 324 ~~Va~~L~~s----G~~~~~~~~~~~~~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~~~~dY~~vG~KG~~------ 393 (468) |.-...|... |-..+.+......+. .....++|. |++|++.+... .-++ ++|--... T Consensus 380 ~~~~~~l~~lkd~~G~~i~~~~~~~~~~~--------~~~~~~~l~-G~pV~~t~~~~---~~~~-~~Gd~~~~~~~i~~ 446 (497) T protein:vir:10 380 PRDWELLRLTKDANGQYMGGNFFGNAYGN--------PVNGGKNIW-GVPVVTTPLIP---LGTI-LVGHFAPSVIQTAR 446 (497) T ss_pred hHHHHHHHHhhcCCCceeccCcccccccc--------cccCCceee-ceeeEecCCCC---CCce-EEeecccceEEEEE Confidence 8877777642 222222111111110 000112565 47777765432 1222 23311100 Q ss_pred cccceeEeeccchhhcccccCCccccceeeeeeeeee-eecC--cccccCccccccchh Q lcl|Aclame:pro 394 PYDAGLFYCPYVPLQMVRSIDPNTFQPKIGFKTRYGM-VSNP--FVTTNGLYNGTPDGE 449 (468) Q Consensus 394 ~~d~glfyaPYv~l~~~~~~dp~s~qP~~g~~tRY~l-~~nP--~~~~~~~~~~~~~~~ 449 (468) ..+-.+-..||....| .+.+=.+=+..|+++ +.+| |...+ -.....+. T Consensus 447 r~~~~v~~~~~~~~~f------~~n~v~~r~~~r~~~~v~~p~A~~~l~--~~~~~~~~ 497 (497) T protein:vir:10 447 REGVTMQMTNSNGTDF------VDGKVTVRAEERLGLLVYRPSAFQLIQ--LKKGATGS 497 (497) T ss_pred ecccEEEeecccchhh------hcCcEEEEEEEeecceeeccccEEEEE--ecCCccCC Confidence 0011122223211111 122333444678866 6778 43221 11111111 No 80 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=57.37 E-value=0.44 Score=22.56 Aligned_cols=297 Identities=11% Similarity=0.091 Sum_probs=116.7 Q ss_pred hcchhhhHHHHHHHhHHHHHHhhhhhhhhhhhhhhcCcccccccccccccccccccccccceeh-hhhHHhhhhhhhhhe Q lcl|Aclame:pro 23 IGDRYKRAVTSVLLENQERFLREERGMLNEVAVNSLGAGTIAPAGSALGSANTGGLAGFDPVLI-SLVRRAMPNLMAYDV 101 (468) Q Consensus 23 i~~~~~~~~~~~llenq~~~~~~~~~~l~e~~~~~~~~~~~~~~~~i~~st~tg~i~~~~P~Lv-~l~RRa~~~LI~~DI 101 (468) |+...+ .+..+++....+-+ .++++.... ..++..+.+ .-|.+. .+++.+..+.+..++ T Consensus 1 ~~~~~~----------~~~~~~~f~~~~~~--~~~~~a~~~------~~~~~~~~l--ip~~~~~~ii~~~~~~s~l~~l 60 (324) T protein:vir:96 1 MEQTQK----------LKLNLQHFASNNVK--PQVFNPDNV------MMHEKKDGT--LLNDFTTPILQEVMENSKIMQL 60 (324) T ss_pred CCcchh----------hhHHHHHHHHhhhh--hhhcccccc------cccCCCcce--echhHHHHHHHHHHhhchhhhh Confidence 111111 11112211111111 112222211 111111211 112222 234555566778889 Q ss_pred eeeecCCccceeeeeeeeeecCCCCcccccccCCccccccccccccccccccCccccCCCcccccccccccccccccccc Q lcl|Aclame:pro 102 CGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGS 181 (468) Q Consensus 102 ~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~ 181 (468) +.+-||++++.-|.-.. . +.++ .| T Consensus 61 ~~~~~~~~~~~~~p~~~----~--~~~a-------~~------------------------------------------- 84 (324) T protein:vir:96 61 GKYEPMEGTEKKFTFWA----D--KPGA-------YW------------------------------------------- 84 (324) T ss_pred cceeeccCCceEEEEEe----c--Ccce-------ee------------------------------------------- Confidence 99999988764332111 0 0000 00 Q ss_pred ccchhhhhccCCCCcchhhcceEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHH Q lcl|Aclame:pro 182 KMPREDLERMGEANRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRV 261 (468) Q Consensus 182 gm~Ta~aE~lG~~~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l 261 (468) .++++........|.+..+.+.|..+ ....|-||.+|-. .|.+++|.+.|...|...+++.||.-- T Consensus 85 ---v~Eg~~~~~~~~~f~~v~~~~~k~~~-------~~~is~ell~ds~----~~l~~~i~~~l~~aia~~~d~~~l~G~ 150 (324) T protein:vir:96 85 ---VGEGQKIETSKATWVNATMRAFKLGV-------ILPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQ 150 (324) T ss_pred ---ecCCccccccccceeEEEEEeEEEEE-------eehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHhhhcC Confidence 00111111122335555555555444 4558999999853 568889999999999999998888531 Q ss_pred hhhhhcccccccccccccccccccc----chhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccc Q lcl|Aclame:pro 262 YTVAKKGAQNNVANAGIFDLDVDSN----GRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLD 337 (468) Q Consensus 262 ~~va~~~k~~~~~~~g~~Dl~~~~~----grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~ 337 (468) - .+....|++....... +--..+....+ ...+. ..-+..+.++|||.....|...- + T Consensus 151 g--------~~~~~~~~~~~~~~~~~~~~~~~~~~~i~~~-------~~~i~--~~~~~~~~~i~n~~~~~~L~~lk--d 211 (324) T protein:vir:96 151 G--------NNPFGKSIAQSIKKTNKVIKGDFTQDNIIDL-------EALLE--DDELEANAFISKTQNRSLLRKIV--D 211 (324) T ss_pred C--------CCCcCccccccccccceecccccchHHHHHH-------HHhhh--hccCCCCEEEEcHHHHHHHHHhh--C Confidence 1 1111111111100000 00001222222 22221 22355667999999999887641 1 Q ss_pred cccccccccccccccccccCceeEEEecCceEEEEcccccccCCcceE--------EEEEecCCcccceeEeeccchhhc Q lcl|Aclame:pro 338 YSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYY--------VIGYKGTSPYDAGLFYCPYVPLQM 409 (468) Q Consensus 338 ~~~~~~~~~~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~~~~dY~--------~vG~KG~~~~d~glfyaPYv~l~~ 409 (468) + ++ ...+. +..+ ++| .+++|++++... .+..-+ ++|..++-..+.+ .+ ..+ T Consensus 212 -~---~G---~~~~~-~~~~----~~l-~G~PV~~~~~~~--~~~~~~~~gd~s~~~~~~~~~~~i~~~----~~--~~~ 270 (324) T protein:vir:96 212 -P---ET---KERIY-DRNS----DSL-DGLPVVNLKSSN--LKRGELITGDFDKLIYGIPQLIEYKID----ET--AQL 270 (324) T ss_pred -C---CC---Ceeec-CCCC----Ccc-cceeeEeecCCC--CCcceEEEEecceEEEEEecCcEEEEe----ec--ccc Confidence 1 11 11111 1111 334 356777754321 112223 3333332211100 00 001 Q ss_pred ccccCCcc-----c---cceeeeeeeeee-eecC--cccccCc---cccccchh Q lcl|Aclame:pro 410 VRSIDPNT-----F---QPKIGFKTRYGM-VSNP--FVTTNGL---YNGTPDGE 449 (468) Q Consensus 410 ~~~~dp~s-----~---qP~~g~~tRY~l-~~nP--~~~~~~~---~~~~~~~~ 449 (468) ....|+.. | |=.+=..-||+. ..+| |+..... ...+|..- T Consensus 271 ~~~~~~~~~~~~~~~~n~v~~r~~~r~d~~v~~~~a~~~l~~a~~~~~~~~~~~ 324 (324) T protein:vir:96 271 STVKNEDGTPVNLFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVPGEV 324 (324) T ss_pred cccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEecccccCCCCCCCC Confidence 11111110 1 223334456766 4455 4322111 11112111 No 81 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=56.29 E-value=0.46 Score=22.43 Aligned_cols=324 Identities=13% Similarity=0.066 Sum_probs=124.1 Q ss_pred CcchHHHHHhhhhhhCCCccchhcchhhh-HHHHHHHhHHHHHHhhh---------h----------hh-hhh---h--h Q lcl|Aclame:pro 1 MFNAEHLQEKWSPVLNHGEAPAIGDRYKR-AVTSVLLENQERFLREE---------R----------GM-LNE---V--A 54 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~-~~~~~llenq~~~~~~~---------~----------~~-l~e---~--~ 54 (468) +-..+++++++.-+.+. |.+.-++ .....+.+..++..... + .. ..+ . . T Consensus 41 ~~~~~e~~~~~~~l~~e-----i~~l~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 115 (400) T protein:vir:38 41 LKKAEGVRAKYDKAGKE-----IKDLEEKRDLYEAALKGNEQSSGKKPDHPEEHSYRDALNAYLHTRGRNTDGVNFEKTD 115 (400) T ss_pred HHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHhhcccccccchhhhhHHHHHHHHHhhHHHHHHHHHHHHHH Confidence 22334455555444322 2211111 01111111110000000 0 00 000 0 0 Q ss_pred hhhcCccc---cccccccccc--ccccccccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCccc Q lcl|Aclame:pro 55 VNSLGAGT---IAPAGSALGS--ANTGGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEA 129 (468) Q Consensus 55 ~~~~~~~~---~~~~~~i~~s--t~tg~i~~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA 129 (468) ........ .........+ +++|++.--....-.++++..+..+..+++.+.||++.++-+--++. .++.-+ T Consensus 116 ~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~----~~~~~~ 191 (400) T protein:vir:38 116 VGTFAVLRAVPTDASDAVNAGVKAADAASTIPETISNTPQRELQTVVDLKPFTNVFQASTQKGTYPTVAN----ATTKMV 191 (400) T ss_pred HHHHhhhhhhhHHHHHHHhhcccccCCcccccHHHHHHHHHHHHhhhhhhhcceeEeccCcceEEEEEec----CCCccc Confidence 00000000 0000111111 11122211111122234444466678889999999988775543331 000000 Q ss_pred ccccCCccccccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCCCCcchhhc-ceEEEEE Q lcl|Aclame:pro 130 LFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFREM-SFSIEKT 208 (468) Q Consensus 130 ~fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~~~f~EM-aFsIeK~ 208 (468) .+. | +...++. ..+++.+ T Consensus 192 ~~~--------------------------------------------------------E-----~~~~~~~~~~~f~~i 210 (400) T protein:vir:38 192 TVA--------------------------------------------------------E-----LEKNPAMAKPEFKPV 210 (400) T ss_pred ccc--------------------------------------------------------c-----cccccccccccceee Confidence 000 0 0001111 1233344 Q ss_pred EEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhccccccccccccccccccccch Q lcl|Aclame:pro 209 SVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVANAGIFDLDVDSNGR 288 (468) Q Consensus 209 tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~~~~g~~Dl~~~~~gr 288 (468) +..++.-+-...+|-||.+|- ..|.+++|.+-|...|...+|+-|+.-.-. .+..++..++ T Consensus 211 ~~~~~k~~~~~~is~ell~ds----~~~~~~~i~~~l~~~~~~~~~~~i~~~~~~---------~~~~~~~~~~------ 271 (400) T protein:vir:38 211 NWSVETYRQALPVSQESIDDS----AIDLVGLIAQNGQQIKVNTTNGAVATLLKG---------FTAKTISSVD------ 271 (400) T ss_pred EeehhheeeehhhHHHHHhhh----HHHHHHHHHHHHHHHHHHHHHHhhhhcccc---------ccccccccHH------ Confidence 445555555678999999985 347888999999999998888888754322 1122222111 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccccccCceeEEEecCce Q lcl|Aclame:pro 289 WSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGRI 368 (468) Q Consensus 289 w~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~~~~G~l~g~~ 368 (468) ....++... . .. ... ..+|++|.....|... ..+ + |..+.+.+-++.. .++|. |+ T Consensus 272 ----~~~~~~~~~-~-------~~-~~~-a~~v~~~~~~~~l~~l---kd~---~---G~~i~~~~~~~~~-~~~l~-G~ 326 (400) T protein:vir:38 272 ----DLKHINNVD-L-------DP-AYS-RVIIASQSFYNFLDTV---KDG---N---GRYLLQDSILTPS-GKSVL-GM 326 (400) T ss_pred ----HHHHHHHhh-h-------hh-hhC-cEEEEcHHHHHHHHHh---hcc---C---CCeeeecCcCCCC-ccccc-cc Confidence 112221111 1 11 122 3567899888888752 111 0 1111111111111 24564 45 Q ss_pred EEEEcccccc-cCCcceEEEEEecCCcccceeEeeccchhhcccccCCccccceeeeeeeeeeee-cC--cccccCcccc Q lcl|Aclame:pro 369 KVFVDPYAAN-LSDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNTFQPKIGFKTRYGMVS-NP--FVTTNGLYNG 444 (468) Q Consensus 369 ~vy~D~Ya~~-~~~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~s~qP~~g~~tRY~l~~-nP--~~~~~~~~~~ 444 (468) +|++...... ......+++|--.. .+.......+ -....|-..|+..+-...|++..+ +| |.... T Consensus 327 pv~~~~~~~~~~~g~~~~~~gd~s~-----~~~~~~~~~~-~~~~~~~~~~~~~~~~~~r~d~~~~~~~a~~~l~----- 395 (400) T protein:vir:38 327 PIAVVSDDTLGAAGEAHAFLGDIKR-----AILFANRADF-MVRWVDDQIYGQFLQAGMRFGVSVADEKAGYFLT----- 395 (400) T ss_pred eeEEecccccCCCCceEEEEEeccc-----cEEEEeecce-EEEEecccccceeEEEEEEeccEEecccceEEEE----- Confidence 5555422110 01111222221100 0011111111 112235556666777788988654 33 22211 Q ss_pred ccchhhhhhhc Q lcl|Aclame:pro 445 TPDGEALTPNA 455 (468) Q Consensus 445 ~~~~~~~~~~a 455 (468) . ...| T Consensus 396 -~-----~~~a 400 (400) T protein:vir:38 396 -Y-----TPKA 400 (400) T ss_pred -e-----ecCC Confidence 0 1111 No 82 >protein:vir:4226 Length: 326 # NCBI annotation: observed 35.2Kd protein # Family: family:all:507 # MgeID: mge:89 # MgeName: L5 # Cross-refs: genbank:acc:NP_039681;swissprot:sw:q05223;genbank:gi:9625447;uniprot:Q05223;genbank:GeneID:2942929 Probab=56.18 E-value=0.46 Score=22.42 Aligned_cols=299 Identities=12% Similarity=0.023 Sum_probs=121.3 Q ss_pred HHhHHHHHHhhhhhhhhhhhhhhcCcccccccccccccccccccccccceehhhhHHhhhhhhhhheeeeecCCccceee Q lcl|Aclame:pro 35 LLENQERFLREERGMLNEVAVNSLGAGTIAPAGSALGSANTGGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLI 114 (468) Q Consensus 35 llenq~~~~~~~~~~l~e~~~~~~~~~~~~~~~~i~~st~tg~i~~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLI 114 (468) |.=|-++. -++-...|.. .+..++++++-.--.+.+-.+++.+.+..+-..++-+-||++++.-+ T Consensus 1 ~~~~~~r~--~~~~~~~e~~-------------a~~~~~~~~g~~ip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~ 65 (326) T protein:vir:42 1 MAVNPDRT--TPFLGVNDPK-------------VAQTGDSMFEGYLEPEQAQDYFAEAEKISIVQQFAQKIPMGTTGQKI 65 (326) T ss_pred CCCCccch--hhhcCcchhh-------------heeccccCCcceechhhHHHHHHHHHhcchhhhhcceeeccCCceEE Confidence 11110000 0000011110 01111111111112223333455555555667788888888765322 Q ss_pred eeeeeeecCCCCcccccccCCccccccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCCC Q lcl|Aclame:pro 115 FAMRSRYENQAGEEALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEA 194 (468) Q Consensus 115 FAMRsrY~~qsG~EA~fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~ 194 (468) .-. . ++.++ .| .+| T Consensus 66 p~~----~--~~~~a-------~~------------------------------------------------v~E----- 79 (326) T protein:vir:42 66 PHW----T--GDVSA-------SW------------------------------------------------IGE----- 79 (326) T ss_pred EEE----e--CCcce-------EE------------------------------------------------ecC----- Confidence 100 0 00000 00 001 Q ss_pred CcchhhcceEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhcccccccc Q lcl|Aclame:pro 195 NRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVA 274 (468) Q Consensus 195 ~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~~ 274 (468) +..++|-..+++++++.+|...-.-.+|-||.+|-. .|.++.|.+-|+..|+..+++.+|.--- .+...++. T Consensus 80 g~~~~~~~~~f~~i~~~~~k~~~~v~iS~ell~~s~----~~~~~~i~~~l~~a~~~~~d~a~l~G~g----s~~p~gi~ 151 (326) T protein:vir:42 80 GDMKPITKGNMTSQTIAPHKIATIFVASAETVRANP----ANYLGTMRTKVATAFAMAFDNAAINGTD----SPFPTFLA 151 (326) T ss_pred CccccccccceeEEEEeeEEEEEeehhhHHHHhcCH----HHHHHHHHHHHHHHHHHHHHHHhhcccC----CCcccccc Confidence 234555666777788888887888889999999843 5789999999999999999999874211 01111100 Q ss_pred ----ccccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhccccccccccccccccc Q lcl|Aclame:pro 275 ----NAGIFDLDVDSNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSI 350 (468) Q Consensus 275 ----~~g~~Dl~~~~~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~ 350 (468) ..+..... ..+-+..-....+. +..+... .......++.+|++|.....|..- ..+.+ ..+ T Consensus 152 ~~~~~~~~~~~~--~~~~~~~~~~~~~~--~~~~~~~--~~~~~~~~a~~v~n~~~~~~L~~l---kd~~G------~~l 216 (326) T protein:vir:42 152 QTTKEVSLVDPD--GTGSNADLTVYDAV--AVNALSL--LVNAGKKWTHTLLDDITEPILNGA---KDKSG------RPL 216 (326) T ss_pred ccccccceeecc--cccccccchhHHHH--HHHHHhh--hhhhccCccEEEEeHHHHHHHHHh---hccCC------cee Confidence 00000000 00000000011110 1111111 112245667789999999999852 21110 011 Q ss_pred ccccc----cCceeEEEecCceEEEEcccccccCCcceEEEEEecCCcccceeEeeccchhhccc--------ccCCcc- Q lcl|Aclame:pro 351 GEVDD----TGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVR--------SIDPNT- 417 (468) Q Consensus 351 ~~~d~----t~~~~~G~l~g~~~vy~D~Ya~~~~~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~--------~~dp~s- 417 (468) ...+. .....-|+| .+++|+++.+.... + ++++-|+-. -+||...-.+.+.+ ..|+.. T Consensus 217 ~~~~~~~~~~~~~~~~~l-~G~pv~~~~~~~~~---~--~~~~~Gd~s---~~~~~~~~~~~v~~~~e~~~~~~~~~~~~ 287 (326) T protein:vir:42 217 FIESTYTEENSPFRLGRI-VARPTILSDHVASG---T--VVGYQGDFR---QLVWGQVGGLSFDVTDQATLNLGTPQAPN 287 (326) T ss_pred eccccccCccccccCcee-eeeeEEEcCCCCCC---c--eEEEEeecc---eEEEEEecceEEEEeecceeeeccccccc Confidence 11111 111122344 36888888765421 1 112222211 12222222221111 111111 Q ss_pred ----cc---ceeeeeeeeeeee-cC--cccccCccccccchhhhhhh Q lcl|Aclame:pro 418 ----FQ---PKIGFKTRYGMVS-NP--FVTTNGLYNGTPDGEALTPN 454 (468) Q Consensus 418 ----~q---P~~g~~tRY~l~~-nP--~~~~~~~~~~~~~~~~~~~~ 454 (468) || =.+=...|++..+ +| |+... .-.+.+. T Consensus 288 ~~~~~~~d~~~~r~~~~~d~~v~~~~a~~~l~--------~~~~~~~ 326 (326) T protein:vir:42 288 FVSLWQHNLVAVRVEAEYAFHCNDKDAFVKLT--------NVDATEA 326 (326) T ss_pred chhhhhcCcEEEEEEEEeccEEecccceEEEe--------eccccCC Confidence 22 2233456666543 33 22211 1111111 No 83 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=56.11 E-value=0.46 Score=22.41 Aligned_cols=260 Identities=14% Similarity=0.065 Sum_probs=109.7 Q ss_pred eee---eecCCCCcccccccCC------ccccccccccccccccccCccccCCCccccccccccccccccccccccchhh Q lcl|Aclame:pro 117 MRS---RYENQAGEEALFNEPD------TGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPRED 187 (468) Q Consensus 117 MRs---rY~~qsG~EA~fnEa~------t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~ 187 (468) |=. +..+.--.|-|-.... --|++-... ... . .+ .++ .+.++..--...+ T Consensus 1 ma~~~T~l~d~iiPev~~~~v~~~~~~~l~~~~~~~~-d~~--------l----~g-~~G-------~tv~iP~~~~ig~ 59 (274) T protein:vir:12 1 MAQGLTKTSNQIIPEVLAPMMQAQLEKKLRFASFAEV-DST--------L----QG-QPG-------DTLTFPAFVYSGD 59 (274) T ss_pred CCcceeehhhhhchHHHHHHHHHHHHhhhhhccccee-ccc--------c----cC-CCC-------CEEEEeeecCCCc Confidence 110 0000000111100000 001100000 000 0 00 000 0111110001112 Q ss_pred hhccCCC-CcchhhcceEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhh Q lcl|Aclame:pro 188 LERMGEA-NRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAK 266 (468) Q Consensus 188 aE~lG~~-~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~ 266 (468) +|.+..+ .-...++..+= .+++.+-|+-.=+++=| ..+.+ +-|.-.+..+-++..|+.+++.+++..+.+... T Consensus 60 a~~~~~g~~i~~~~lt~~~--~~~~i~~~~~~~~i~D~--~~~~~--~~d~~~~~~~q~~~~~a~~vd~~~l~~~~~a~~ 133 (274) T protein:vir:12 60 AQVVAEGEKIPTDILETKK--REAKIRKIAKGTSITDE--ALLSG--YGDPQGEQVRQHGLAHANKVDNDVLEALMGAKL 133 (274) T ss_pred cccccCCCccchhhcccce--eeEEeeeecceeeecHH--HHHhc--ccchHHHHHHHHHHHHHHHHHHHHHHHHhcccc Confidence 2323221 22344444333 33333444422222221 12233 568889999999999999999999988765332 Q ss_pred ccccccccccccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhccccccccccccc Q lcl|Aclame:pro 267 KGAQNNVANAGIFDLDVDSNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAG 346 (468) Q Consensus 267 ~~k~~~~~~~g~~Dl~~~~~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~ 346 (468) .. +...+ ..+.+-..+.++..+ -..+++++++|.|++.|......+|...... T Consensus 134 ~~------~~~a~----------~~d~i~dA~~~lgd~---------~~~~~~ivv~p~~~~~L~k~~~~~fv~~s~~-- 186 (274) T protein:vir:12 134 TV------NADIT----------KLNGLQSAIDKFNDE---------DLEPMVLFINPLDAGKLRGDASTNFTRATEL-- 186 (274) T ss_pred cc------ccccc----------CHHHHHHHHHHhccc---------cccccEEEeCHHHHHHHHhhhhhhccccccc-- Confidence 21 11111 123333333333322 1367899999999999988654444433221 Q ss_pred ccccccccccCceeEEEecCceEEEEcccccccCCcceEEEEEe-cCCcccceeEeeccchhhcccccCCccccceeeee Q lcl|Aclame:pro 347 GPSIGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYK-GTSPYDAGLFYCPYVPLQMVRSIDPNTFQPKIGFK 425 (468) Q Consensus 347 ~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~~~~dY~~vG~K-G~~~~d~glfyaPYv~l~~~~~~dp~s~qP~~g~~ 425 (468) +. +.-.+| .+|++ .|++||+| +.-|. |-.+-++ |+-. ||. -.+...-.--||..++-.+-.. T Consensus 187 g~---~~~~~G--~ig~~-~G~~Vi~s----~~~p~-~t~~l~~~gA~~-----~~~-~~~~~vE~~Rd~~~~~d~i~~~ 249 (274) T protein:vir:12 187 GD---DIIVKG--AFGEA-LGAIIVRS----NKLEA-GTAILAKKGAVK-----LIL-KRDFFLEVARDASTKTTALYSD 249 (274) T ss_pred cc---cceecc--cceee-cCeeEEEe----CCCCc-ceEEEEecccee-----eee-cCCceeccccchhhcccEEEee Confidence 11 111122 35777 46899999 55553 2222222 2111 111 1122222224888999888888 Q ss_pred eeeeeee-cC--cccccCccccccchhhhhhhc Q lcl|Aclame:pro 426 TRYGMVS-NP--FVTTNGLYNGTPDGEALTPNA 455 (468) Q Consensus 426 tRY~l~~-nP--~~~~~~~~~~~~~~~~~~~~a 455 (468) -+||..+ || -.... -.++ ..-. T Consensus 250 ~~y~~~~~~~~~vv~~t-~~~~-------~~~~ 274 (274) T protein:vir:12 250 KHYVAYLYDESKAVKIT-KGSG-------SLEM 274 (274) T ss_pred eEEEEEEEcCCceEEEE-cCCc-------cccC Confidence 8998654 55 11100 0111 1100 No 84 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=55.50 E-value=0.48 Score=22.34 Aligned_cols=317 Identities=18% Similarity=0.118 Sum_probs=126.6 Q ss_pred CcchHHHHHhhhhh----------hCCCc-----cchhcchhhhHHHHHHHhHHHHH--HhhhhhhhhhhhhhhcCcccc Q lcl|Aclame:pro 1 MFNAEHLQEKWSPV----------LNHGE-----APAIGDRYKRAVTSVLLENQERF--LREERGMLNEVAVNSLGAGTI 63 (468) Q Consensus 1 ~~~~~~l~~kw~p~----------l~~~~-----~~~i~~~~~~~~~~~llenq~~~--~~~~~~~l~e~~~~~~~~~~~ 63 (468) +=..+.|.++.... .+... ..+-...+|+... ..|.+++.. .++......|.. T Consensus 35 ~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~l~~~~~~~~~~~~~~~~~~~~--------- 104 (392) T protein:vir:10 35 MEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDVFM-KALRNKPLNAEEREFLEDDLEQR--------- 104 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHHHH-HHHhcccccHHHHHHHhhhhhhh--------- Confidence 11111222222111 00000 1111222333322 222222110 011111111110 Q ss_pred ccccccccccc-ccccc---cccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCcccccccCCcccc Q lcl|Aclame:pro 64 APAGSALGSAN-TGGLA---GFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDTGFT 139 (468) Q Consensus 64 ~~~~~i~~st~-tg~i~---~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fS 139 (468) ..+..++ .|+.. ...+.++.+.| ....-.+++++.||++++|-+.-.+ ..+ +.++ .| T Consensus 105 ----~~~~~t~~~gg~~vP~~~~~~ii~~~~---~~s~l~~~~~~~~~~~~~~~~~~~~--~~~--~~~a-------~~- 165 (392) T protein:vir:10 105 ----AMSGLTGEDGGLVIPQDIQTQINELAR---SFDALEQYVTVEPVRTRSGSRVLEK--NSD--MIPF-------AE- 165 (392) T ss_pred ----hccccccCCCceecchhHHHHHHHHHH---hhhhhhhhceeeeccCCceeEEEEe--ecC--Cccc-------ee- Confidence 0111111 12211 22334444444 4455668999999999887432111 110 0000 00 Q ss_pred ccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCCC-CcchhhcceEEEEEEEEeeccccc Q lcl|Aclame:pro 140 GGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEA-NRLFREMSFSIEKTSVTAQSRALK 218 (468) Q Consensus 140 g~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~-~~~f~EMaFsIeK~tVtAKSRaLK 218 (468) .+++....+. ...|.++.|+..|. +-. T Consensus 166 ---------------------------------------------v~E~~~~~~~~~~~~~~v~l~~~k~-------~~~ 193 (392) T protein:vir:10 166 ---------------------------------------------ITEMGEIPETDNPKFSNVQYAVKDR-------AGI 193 (392) T ss_pred ---------------------------------------------ecccccccccccccceeEEeeeeeE-------EEe Confidence 0000001111 12355555555554 444 Q ss_pred ccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhccccccccccccccccccccchhHHHHHHHHH Q lcl|Aclame:pro 219 AEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVANAGIFDLDVDSNGRWSVEKFKGLL 298 (468) Q Consensus 219 AEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~~~~g~~Dl~~~~~grw~~e~~k~L~ 298 (468) ..+|-||.+|- ..|.+++|.+-|...|..-+|.-|+.-.-+. ...+.+.+ +....++ T Consensus 194 ~~iS~ell~ds----~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~---------~~~~~~~~----------d~i~~~~ 250 (392) T protein:vir:10 194 LPLSRSLLQDS----DQNILKYVTKWLGKKSKVTRNVLILGVIEKL---------TKQAIKSL----------DDIKDVL 250 (392) T ss_pred ehhhHHHHhhh----HHHHHHHHHHHHHHHHHHHHHHHHhhccccc---------cccCccCH----------HHHHHHH Confidence 67899999984 2567889999999999998888887433221 12223222 2222222 Q ss_pred HHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccccccCceeEEEecCceEEEEcccccc Q lcl|Aclame:pro 299 FQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVFVDPYAAN 378 (468) Q Consensus 299 ~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~ 378 (468) .. ...... + ..-..|++|.....|... ..+. +..+.+.+-+. ...++|.|...|+++. + T Consensus 251 ~~--~l~~~~-----~-~~a~~vm~~~~~~~L~~l---kd~~------G~~l~~~~~~~-~~~~tllG~~~v~~~~---~ 309 (392) T protein:vir:10 251 NV--KLDPAI-----S-PNAILLTNQDGFNYLDKL---KDKD------GKYILQSDPTQ-KNKKLFAGTNPVVVVS---N 309 (392) T ss_pred HH--hhhhhh-----c-cCCEEEEcHHHHHHHHHh---hccC------CCeEeecCccC-CccccccCcccEEEec---c Confidence 11 111111 1 224478999999999763 2111 11111112111 1236777766666542 1 Q ss_pred cCCcceEEEEEecCCcccceeEeeccch-------hhcccccCC------ccccceeeeeeeeeeee-cC--ccccc--- Q lcl|Aclame:pro 379 LSDKHYYVIGYKGTSPYDAGLFYCPYVP-------LQMVRSIDP------NTFQPKIGFKTRYGMVS-NP--FVTTN--- 439 (468) Q Consensus 379 ~~~~dY~~vG~KG~~~~d~glfyaPYv~-------l~~~~~~dp------~s~qP~~g~~tRY~l~~-nP--~~~~~--- 439 (468) ..++.+|...-+..++|+.+-. ..+...+++ .+.+=.+-...|+|..+ +| |.... T Consensus 310 ------~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~ 383 (392) T protein:vir:10 310 ------RFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDL 383 (392) T ss_pred ------cccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEecc Confidence 1222223222233344433211 111111222 23445566677777543 34 33211 Q ss_pred Cccccccch Q lcl|Aclame:pro 440 GLYNGTPDG 448 (468) Q Consensus 440 ~~~~~~~~~ 448 (468) ......|-| T Consensus 384 ~a~~~~~~~ 392 (392) T protein:vir:10 384 SAPVEQPQG 392 (392) T ss_pred cccccCCCC Confidence 111112333 No 85 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=55.50 E-value=0.48 Score=22.34 Aligned_cols=317 Identities=18% Similarity=0.118 Sum_probs=126.6 Q ss_pred CcchHHHHHhhhhh----------hCCCc-----cchhcchhhhHHHHHHHhHHHHH--HhhhhhhhhhhhhhhcCcccc Q lcl|Aclame:pro 1 MFNAEHLQEKWSPV----------LNHGE-----APAIGDRYKRAVTSVLLENQERF--LREERGMLNEVAVNSLGAGTI 63 (468) Q Consensus 1 ~~~~~~l~~kw~p~----------l~~~~-----~~~i~~~~~~~~~~~llenq~~~--~~~~~~~l~e~~~~~~~~~~~ 63 (468) +=..+.|.++.... .+... ..+-...+|+... ..|.+++.. .++......|.. T Consensus 35 ~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~l~~~~~~~~~~~~~~~~~~~~--------- 104 (392) T protein:vir:10 35 MEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDVFM-KALRNKPLNAEEREFLEDDLEQR--------- 104 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHHHH-HHHhcccccHHHHHHHhhhhhhh--------- Confidence 11111222222111 00000 1111222333322 222222110 011111111110 Q ss_pred ccccccccccc-ccccc---cccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCcccccccCCcccc Q lcl|Aclame:pro 64 APAGSALGSAN-TGGLA---GFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDTGFT 139 (468) Q Consensus 64 ~~~~~i~~st~-tg~i~---~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fS 139 (468) ..+..++ .|+.. ...+.++.+.| ....-.+++++.||++++|-+.-.+ ..+ +.++ .| T Consensus 105 ----~~~~~t~~~gg~~vP~~~~~~ii~~~~---~~s~l~~~~~~~~~~~~~~~~~~~~--~~~--~~~a-------~~- 165 (392) T protein:vir:10 105 ----AMSGLTGEDGGLVIPQDIQTQINELAR---SFDALEQYVTVEPVRTRSGSRVLEK--NSD--MIPF-------AE- 165 (392) T ss_pred ----hccccccCCCceecchhHHHHHHHHHH---hhhhhhhhceeeeccCCceeEEEEe--ecC--Cccc-------ee- Confidence 0111111 12211 22334444444 4455668999999999887432111 110 0000 00 Q ss_pred ccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCCC-CcchhhcceEEEEEEEEeeccccc Q lcl|Aclame:pro 140 GGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEA-NRLFREMSFSIEKTSVTAQSRALK 218 (468) Q Consensus 140 g~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~-~~~f~EMaFsIeK~tVtAKSRaLK 218 (468) .+++....+. ...|.++.|+..|. +-. T Consensus 166 ---------------------------------------------v~E~~~~~~~~~~~~~~v~l~~~k~-------~~~ 193 (392) T protein:vir:10 166 ---------------------------------------------ITEMGEIPETDNPKFSNVQYAVKDR-------AGI 193 (392) T ss_pred ---------------------------------------------ecccccccccccccceeEEeeeeeE-------EEe Confidence 0000001111 12355555555554 444 Q ss_pred ccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhccccccccccccccccccccchhHHHHHHHHH Q lcl|Aclame:pro 219 AEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVANAGIFDLDVDSNGRWSVEKFKGLL 298 (468) Q Consensus 219 AEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~~~~g~~Dl~~~~~grw~~e~~k~L~ 298 (468) ..+|-||.+|- ..|.+++|.+-|...|..-+|.-|+.-.-+. ...+.+.+ +....++ T Consensus 194 ~~iS~ell~ds----~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~---------~~~~~~~~----------d~i~~~~ 250 (392) T protein:vir:10 194 LPLSRSLLQDS----DQNILKYVTKWLGKKSKVTRNVLILGVIEKL---------TKQAIKSL----------DDIKDVL 250 (392) T ss_pred ehhhHHHHhhh----HHHHHHHHHHHHHHHHHHHHHHHHhhccccc---------cccCccCH----------HHHHHHH Confidence 67899999984 2567889999999999998888887433221 12223222 2222222 Q ss_pred HHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccccccCceeEEEecCceEEEEcccccc Q lcl|Aclame:pro 299 FQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVFVDPYAAN 378 (468) Q Consensus 299 ~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~ 378 (468) .. ...... + ..-..|++|.....|... ..+. +..+.+.+-+. ...++|.|...|+++. + T Consensus 251 ~~--~l~~~~-----~-~~a~~vm~~~~~~~L~~l---kd~~------G~~l~~~~~~~-~~~~tllG~~~v~~~~---~ 309 (392) T protein:vir:10 251 NV--KLDPAI-----S-PNAILLTNQDGFNYLDKL---KDKD------GKYILQSDPTQ-KNKKLFAGTNPVVVVS---N 309 (392) T ss_pred HH--hhhhhh-----c-cCCEEEEcHHHHHHHHHh---hccC------CCeEeecCccC-CccccccCcccEEEec---c Confidence 11 111111 1 224478999999999763 2111 11111112111 1236777766666542 1 Q ss_pred cCCcceEEEEEecCCcccceeEeeccch-------hhcccccCC------ccccceeeeeeeeeeee-cC--ccccc--- Q lcl|Aclame:pro 379 LSDKHYYVIGYKGTSPYDAGLFYCPYVP-------LQMVRSIDP------NTFQPKIGFKTRYGMVS-NP--FVTTN--- 439 (468) Q Consensus 379 ~~~~dY~~vG~KG~~~~d~glfyaPYv~-------l~~~~~~dp------~s~qP~~g~~tRY~l~~-nP--~~~~~--- 439 (468) ..++.+|...-+..++|+.+-. ..+...+++ .+.+=.+-...|+|..+ +| |.... T Consensus 310 ------~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~ 383 (392) T protein:vir:10 310 ------RFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDL 383 (392) T ss_pred ------cccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEecc Confidence 1222223222233344433211 111111222 23445566677777543 34 33211 Q ss_pred Cccccccch Q lcl|Aclame:pro 440 GLYNGTPDG 448 (468) Q Consensus 440 ~~~~~~~~~ 448 (468) ......|-| T Consensus 384 ~a~~~~~~~ 392 (392) T protein:vir:10 384 SAPVEQPQG 392 (392) T ss_pred cccccCCCC Confidence 111112333 No 86 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=55.50 E-value=0.48 Score=22.34 Aligned_cols=317 Identities=18% Similarity=0.118 Sum_probs=126.6 Q ss_pred CcchHHHHHhhhhh----------hCCCc-----cchhcchhhhHHHHHHHhHHHHH--HhhhhhhhhhhhhhhcCcccc Q lcl|Aclame:pro 1 MFNAEHLQEKWSPV----------LNHGE-----APAIGDRYKRAVTSVLLENQERF--LREERGMLNEVAVNSLGAGTI 63 (468) Q Consensus 1 ~~~~~~l~~kw~p~----------l~~~~-----~~~i~~~~~~~~~~~llenq~~~--~~~~~~~l~e~~~~~~~~~~~ 63 (468) +=..+.|.++.... .+... ..+-...+|+... ..|.+++.. .++......|.. T Consensus 35 ~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~l~~~~~~~~~~~~~~~~~~~~--------- 104 (392) T protein:vir:10 35 MEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDVFM-KALRNKPLNAEEREFLEDDLEQR--------- 104 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHHHH-HHHhcccccHHHHHHHhhhhhhh--------- Confidence 11111222222111 00000 1111222333322 222222110 011111111110 Q ss_pred ccccccccccc-ccccc---cccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCcccccccCCcccc Q lcl|Aclame:pro 64 APAGSALGSAN-TGGLA---GFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDTGFT 139 (468) Q Consensus 64 ~~~~~i~~st~-tg~i~---~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fS 139 (468) ..+..++ .|+.. ...+.++.+.| ....-.+++++.||++++|-+.-.+ ..+ +.++ .| T Consensus 105 ----~~~~~t~~~gg~~vP~~~~~~ii~~~~---~~s~l~~~~~~~~~~~~~~~~~~~~--~~~--~~~a-------~~- 165 (392) T protein:vir:10 105 ----AMSGLTGEDGGLVIPQDIQTQINELAR---SFDALEQYVTVEPVRTRSGSRVLEK--NSD--MIPF-------AE- 165 (392) T ss_pred ----hccccccCCCceecchhHHHHHHHHHH---hhhhhhhhceeeeccCCceeEEEEe--ecC--Cccc-------ee- Confidence 0111111 12211 22334444444 4455668999999999887432111 110 0000 00 Q ss_pred ccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCCC-CcchhhcceEEEEEEEEeeccccc Q lcl|Aclame:pro 140 GGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEA-NRLFREMSFSIEKTSVTAQSRALK 218 (468) Q Consensus 140 g~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~-~~~f~EMaFsIeK~tVtAKSRaLK 218 (468) .+++....+. ...|.++.|+..|. +-. T Consensus 166 ---------------------------------------------v~E~~~~~~~~~~~~~~v~l~~~k~-------~~~ 193 (392) T protein:vir:10 166 ---------------------------------------------ITEMGEIPETDNPKFSNVQYAVKDR-------AGI 193 (392) T ss_pred ---------------------------------------------ecccccccccccccceeEEeeeeeE-------EEe Confidence 0000001111 12355555555554 444 Q ss_pred ccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhccccccccccccccccccccchhHHHHHHHHH Q lcl|Aclame:pro 219 AEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVANAGIFDLDVDSNGRWSVEKFKGLL 298 (468) Q Consensus 219 AEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~~~~g~~Dl~~~~~grw~~e~~k~L~ 298 (468) ..+|-||.+|- ..|.+++|.+-|...|..-+|.-|+.-.-+. ...+.+.+ +....++ T Consensus 194 ~~iS~ell~ds----~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~---------~~~~~~~~----------d~i~~~~ 250 (392) T protein:vir:10 194 LPLSRSLLQDS----DQNILKYVTKWLGKKSKVTRNVLILGVIEKL---------TKQAIKSL----------DDIKDVL 250 (392) T ss_pred ehhhHHHHhhh----HHHHHHHHHHHHHHHHHHHHHHHHhhccccc---------cccCccCH----------HHHHHHH Confidence 67899999984 2567889999999999998888887433221 12223222 2222222 Q ss_pred HHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccccccCceeEEEecCceEEEEcccccc Q lcl|Aclame:pro 299 FQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVFVDPYAAN 378 (468) Q Consensus 299 ~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~ 378 (468) .. ...... + ..-..|++|.....|... ..+. +..+.+.+-+. ...++|.|...|+++. + T Consensus 251 ~~--~l~~~~-----~-~~a~~vm~~~~~~~L~~l---kd~~------G~~l~~~~~~~-~~~~tllG~~~v~~~~---~ 309 (392) T protein:vir:10 251 NV--KLDPAI-----S-PNAILLTNQDGFNYLDKL---KDKD------GKYILQSDPTQ-KNKKLFAGTNPVVVVS---N 309 (392) T ss_pred HH--hhhhhh-----c-cCCEEEEcHHHHHHHHHh---hccC------CCeEeecCccC-CccccccCcccEEEec---c Confidence 11 111111 1 224478999999999763 2111 11111112111 1236777766666542 1 Q ss_pred cCCcceEEEEEecCCcccceeEeeccch-------hhcccccCC------ccccceeeeeeeeeeee-cC--ccccc--- Q lcl|Aclame:pro 379 LSDKHYYVIGYKGTSPYDAGLFYCPYVP-------LQMVRSIDP------NTFQPKIGFKTRYGMVS-NP--FVTTN--- 439 (468) Q Consensus 379 ~~~~dY~~vG~KG~~~~d~glfyaPYv~-------l~~~~~~dp------~s~qP~~g~~tRY~l~~-nP--~~~~~--- 439 (468) ..++.+|...-+..++|+.+-. ..+...+++ .+.+=.+-...|+|..+ +| |.... T Consensus 310 ------~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~ 383 (392) T protein:vir:10 310 ------RFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDL 383 (392) T ss_pred ------cccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEecc Confidence 1222223222233344433211 111111222 23445566677777543 34 33211 Q ss_pred Cccccccch Q lcl|Aclame:pro 440 GLYNGTPDG 448 (468) Q Consensus 440 ~~~~~~~~~ 448 (468) ......|-| T Consensus 384 ~a~~~~~~~ 392 (392) T protein:vir:10 384 SAPVEQPQG 392 (392) T ss_pred cccccCCCC Confidence 111112333 No 87 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=55.50 E-value=0.48 Score=22.34 Aligned_cols=317 Identities=18% Similarity=0.118 Sum_probs=126.6 Q ss_pred CcchHHHHHhhhhh----------hCCCc-----cchhcchhhhHHHHHHHhHHHHH--HhhhhhhhhhhhhhhcCcccc Q lcl|Aclame:pro 1 MFNAEHLQEKWSPV----------LNHGE-----APAIGDRYKRAVTSVLLENQERF--LREERGMLNEVAVNSLGAGTI 63 (468) Q Consensus 1 ~~~~~~l~~kw~p~----------l~~~~-----~~~i~~~~~~~~~~~llenq~~~--~~~~~~~l~e~~~~~~~~~~~ 63 (468) +=..+.|.++.... .+... ..+-...+|+... ..|.+++.. .++......|.. T Consensus 35 ~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~l~~~~~~~~~~~~~~~~~~~~--------- 104 (392) T protein:vir:10 35 MEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDVFM-KALRNKPLNAEEREFLEDDLEQR--------- 104 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHHHH-HHHhcccccHHHHHHHhhhhhhh--------- Confidence 11111222222111 00000 1111222333322 222222110 011111111110 Q ss_pred ccccccccccc-ccccc---cccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCcccccccCCcccc Q lcl|Aclame:pro 64 APAGSALGSAN-TGGLA---GFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDTGFT 139 (468) Q Consensus 64 ~~~~~i~~st~-tg~i~---~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fS 139 (468) ..+..++ .|+.. ...+.++.+.| ....-.+++++.||++++|-+.-.+ ..+ +.++ .| T Consensus 105 ----~~~~~t~~~gg~~vP~~~~~~ii~~~~---~~s~l~~~~~~~~~~~~~~~~~~~~--~~~--~~~a-------~~- 165 (392) T protein:vir:10 105 ----AMSGLTGEDGGLVIPQDIQTQINELAR---SFDALEQYVTVEPVRTRSGSRVLEK--NSD--MIPF-------AE- 165 (392) T ss_pred ----hccccccCCCceecchhHHHHHHHHHH---hhhhhhhhceeeeccCCceeEEEEe--ecC--Cccc-------ee- Confidence 0111111 12211 22334444444 4455668999999999887432111 110 0000 00 Q ss_pred ccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCCC-CcchhhcceEEEEEEEEeeccccc Q lcl|Aclame:pro 140 GGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEA-NRLFREMSFSIEKTSVTAQSRALK 218 (468) Q Consensus 140 g~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~-~~~f~EMaFsIeK~tVtAKSRaLK 218 (468) .+++....+. ...|.++.|+..|. +-. T Consensus 166 ---------------------------------------------v~E~~~~~~~~~~~~~~v~l~~~k~-------~~~ 193 (392) T protein:vir:10 166 ---------------------------------------------ITEMGEIPETDNPKFSNVQYAVKDR-------AGI 193 (392) T ss_pred ---------------------------------------------ecccccccccccccceeEEeeeeeE-------EEe Confidence 0000001111 12355555555554 444 Q ss_pred ccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhccccccccccccccccccccchhHHHHHHHHH Q lcl|Aclame:pro 219 AEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVANAGIFDLDVDSNGRWSVEKFKGLL 298 (468) Q Consensus 219 AEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~~~~g~~Dl~~~~~grw~~e~~k~L~ 298 (468) ..+|-||.+|- ..|.+++|.+-|...|..-+|.-|+.-.-+. ...+.+.+ +....++ T Consensus 194 ~~iS~ell~ds----~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~---------~~~~~~~~----------d~i~~~~ 250 (392) T protein:vir:10 194 LPLSRSLLQDS----DQNILKYVTKWLGKKSKVTRNVLILGVIEKL---------TKQAIKSL----------DDIKDVL 250 (392) T ss_pred ehhhHHHHhhh----HHHHHHHHHHHHHHHHHHHHHHHHhhccccc---------cccCccCH----------HHHHHHH Confidence 67899999984 2567889999999999998888887433221 12223222 2222222 Q ss_pred HHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccccccCceeEEEecCceEEEEcccccc Q lcl|Aclame:pro 299 FQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVFVDPYAAN 378 (468) Q Consensus 299 ~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~ 378 (468) .. ...... + ..-..|++|.....|... ..+. +..+.+.+-+. ...++|.|...|+++. + T Consensus 251 ~~--~l~~~~-----~-~~a~~vm~~~~~~~L~~l---kd~~------G~~l~~~~~~~-~~~~tllG~~~v~~~~---~ 309 (392) T protein:vir:10 251 NV--KLDPAI-----S-PNAILLTNQDGFNYLDKL---KDKD------GKYILQSDPTQ-KNKKLFAGTNPVVVVS---N 309 (392) T ss_pred HH--hhhhhh-----c-cCCEEEEcHHHHHHHHHh---hccC------CCeEeecCccC-CccccccCcccEEEec---c Confidence 11 111111 1 224478999999999763 2111 11111112111 1236777766666542 1 Q ss_pred cCCcceEEEEEecCCcccceeEeeccch-------hhcccccCC------ccccceeeeeeeeeeee-cC--ccccc--- Q lcl|Aclame:pro 379 LSDKHYYVIGYKGTSPYDAGLFYCPYVP-------LQMVRSIDP------NTFQPKIGFKTRYGMVS-NP--FVTTN--- 439 (468) Q Consensus 379 ~~~~dY~~vG~KG~~~~d~glfyaPYv~-------l~~~~~~dp------~s~qP~~g~~tRY~l~~-nP--~~~~~--- 439 (468) ..++.+|...-+..++|+.+-. ..+...+++ .+.+=.+-...|+|..+ +| |.... T Consensus 310 ------~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~ 383 (392) T protein:vir:10 310 ------RFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDL 383 (392) T ss_pred ------cccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEecc Confidence 1222223222233344433211 111111222 23445566677777543 34 33211 Q ss_pred Cccccccch Q lcl|Aclame:pro 440 GLYNGTPDG 448 (468) Q Consensus 440 ~~~~~~~~~ 448 (468) ......|-| T Consensus 384 ~a~~~~~~~ 392 (392) T protein:vir:10 384 SAPVEQPQG 392 (392) T ss_pred cccccCCCC Confidence 111112333 No 88 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=53.98 E-value=0.51 Score=22.16 Aligned_cols=297 Identities=11% Similarity=0.064 Sum_probs=116.7 Q ss_pred HhHHHHHHhhhhhhhhhh--hhhhcCcccccccccccccccccccccccceehhhhHHhhhhhhhhheeeeecCCcccee Q lcl|Aclame:pro 36 LENQERFLREERGMLNEV--AVNSLGAGTIAPAGSALGSANTGGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGL 113 (468) Q Consensus 36 lenq~~~~~~~~~~l~e~--~~~~~~~~~~~~~~~i~~st~tg~i~~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGL 113 (468) +|.- +.++.+.++.... ..+.+..... .++++++..--....-.+++.+....+..+++-+.||++.+-- T Consensus 1 ~~~~-~~~~~~~~~f~~~~~~~~~~~a~~~-------~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~ 72 (324) T protein:vir:97 1 MEQT-QKLKLNLQHFASNNVKPQVFNPDNV-------MMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKK 72 (324) T ss_pred Cccc-hhHHHHHHHHHHhhhhhhhhccccc-------cccCCCcceechhHHHHHHHHHHhhcchhhhcceeeccCCceE Confidence 1111 1111111111110 0011111111 1112222211111122345556667788889999999876532 Q ss_pred eeeeeeeecCCCCcccccccCCccccccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCC Q lcl|Aclame:pro 114 IFAMRSRYENQAGEEALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGE 193 (468) Q Consensus 114 IFAMRsrY~~qsG~EA~fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~ 193 (468) |-- +.. +.++ .| .+ | T Consensus 73 ip~----~~~--~~~a-------~~----------------------------------------------v~--E---- 87 (324) T protein:vir:97 73 FTF----WAD--KPGA-------YW----------------------------------------------VG--E---- 87 (324) T ss_pred EEE----Eec--Ccce-------eE----------------------------------------------ec--c---- Confidence 211 100 0000 00 00 1 Q ss_pred CCcchhhcceEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhccccccc Q lcl|Aclame:pro 194 ANRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNV 273 (468) Q Consensus 194 ~~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~ 273 (468) +..+++...++++++.++|.=+--..+|-||.+|-. .|.+++|.+-|+..|...+++.||.---.. . T Consensus 88 -g~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~----~~l~~~i~~~l~~aia~~~d~a~l~G~g~~--------~ 154 (324) T protein:vir:97 88 -GQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQGNN--------P 154 (324) T ss_pred -CccccccccceeEEEEeeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHhhccCCCC--------c Confidence 112333344444555555555555569999999863 578999999999999999999988632111 0 Q ss_pred ccccccccccc----ccchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccc Q lcl|Aclame:pro 274 ANAGIFDLDVD----SNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPS 349 (468) Q Consensus 274 ~~~g~~Dl~~~----~~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~ 349 (468) ...|++..... ..+....+....++ ..+. .--.....+||+|.....|...- + + ++ .. T Consensus 155 ~~~gi~~~~~~~~~~~~~~~~~~~i~~~~-------~~l~--~~~~~~~~~v~n~~~~~~L~~lk--d-~---~g---~~ 216 (324) T protein:vir:97 155 FGKSIAQSIEKTNKVIKGDFTQDNIIDLE-------ALLE--DDELEANAFISKTQNRSLLRKIV--D-P---ET---KE 216 (324) T ss_pred cCccccccccccceeccccCCHHHHHHHH-------Hhhh--hccCCCCEEEEcHHHHHHHHHhh--c-C---CC---ce Confidence 11111111000 00111112222232 2221 11234446789999999988531 1 1 11 11 Q ss_pred cccccccCceeEEEecCceEEEEcccccccCCcceEEEEEecCCcccceeEeeccchhhccc--------ccCCc----- Q lcl|Aclame:pro 350 IGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVR--------SIDPN----- 416 (468) Q Consensus 350 ~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~~~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~--------~~dp~----- 416 (468) .+. +.. .|+|. +++|++.+-. . .+...+++|-. +.+++...-...+.. ..|+. T Consensus 217 ~~~-~~~----~~tl~-G~PV~~~~~~-~-~~~~~~~~gd~------~~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~ 282 (324) T protein:vir:97 217 RIY-DRN----SDTLD-GLPVVNLKSS-N-LKRGELITGDF------DKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVN 282 (324) T ss_pred eec-CCC----Ccccc-ceeeEeecCC-C-CCcceEEEEec------ccEEEEEecCcEEEEeecccccccccccccchh Confidence 111 111 14454 4566665321 1 12222333311 011111111111100 00111 Q ss_pred cc---cceeeeeeeeee-eecC--cccccCcccc--ccchhh Q lcl|Aclame:pro 417 TF---QPKIGFKTRYGM-VSNP--FVTTNGLYNG--TPDGEA 450 (468) Q Consensus 417 s~---qP~~g~~tRY~l-~~nP--~~~~~~~~~~--~~~~~~ 450 (468) -| +=.+=+..||+. ..|| |+.......+ .+.++- T Consensus 283 ~f~~d~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~ 324 (324) T protein:vir:97 283 LFEQDMVALRATMHVALHIADDKAFAKLVPADKKTDSVPGEV 324 (324) T ss_pred hhhcCcEEEEEEEEeccEEecccceEEEEeccCCCCCCCCCC Confidence 01 122223467764 4455 4432221111 111111 No 89 >protein:vir:96392 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1613 # MgeName: 53 # Cross-refs: genbank:acc:YP_239648;genbank:gi:66395381;genbank:GeneID:5132868 Probab=52.91 E-value=0.54 Score=22.04 Aligned_cols=298 Identities=10% Similarity=0.033 Sum_probs=119.8 Q ss_pred HhHHHHHHhhhhhhhhhhhh--hhcCcccccccccccccccccccccccceehhhhHHhhhhhhhhheeeeecCCcccee Q lcl|Aclame:pro 36 LENQERFLREERGMLNEVAV--NSLGAGTIAPAGSALGSANTGGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGL 113 (468) Q Consensus 36 lenq~~~~~~~~~~l~e~~~--~~~~~~~~~~~~~i~~st~tg~i~~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGL 113 (468) +| |.+.++++.++...... +++.+. ... .+.+++..--....-.+++.+.......+++-+-||++++-- T Consensus 1 ~~-~~~~~~~~~~~~~~~~~~~~~~~a~------~~~-~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ 72 (324) T protein:vir:96 1 ME-QTQKLKLNLQHFASNNVKPQVFNPD------NVM-MHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKK 72 (324) T ss_pred CC-cchhhhHHHHHHHHHhhhhhhhccc------ccc-ccCcCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceE Confidence 11 11122222222211100 011111 011 111111111111222345555566677888888888876533 Q ss_pred eeeeeeeecCCCCcccccccCCccccccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCC Q lcl|Aclame:pro 114 IFAMRSRYENQAGEEALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGE 193 (468) Q Consensus 114 IFAMRsrY~~qsG~EA~fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~ 193 (468) |.-.. .+.+ ..| . +| T Consensus 73 ~p~~~------~~~~-------a~~----------------------------------------------v--~E---- 87 (324) T protein:vir:96 73 FTFWA------DKPG-------AYW----------------------------------------------V--GE---- 87 (324) T ss_pred EEEEe------cCcc-------eeE----------------------------------------------e--cC---- Confidence 22110 0000 000 0 01 Q ss_pred CCcchhhcceEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhccccccc Q lcl|Aclame:pro 194 ANRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNV 273 (468) Q Consensus 194 ~~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~ 273 (468) +..+++...+++++++..+.-+.-..+|-||.+|-. .|.+++|.+-|+..|...|++-+|.---+. . T Consensus 88 -g~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds~----~~l~~~i~~~la~ai~~~~d~a~l~G~g~~--------~ 154 (324) T protein:vir:96 88 -GQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQGNN--------P 154 (324) T ss_pred -CccccccccceeEEEEeeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHHhccCCCC--------C Confidence 123444455555666666666666679999999864 578999999999999999999887532111 0 Q ss_pred ccccccccccc----ccchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccc Q lcl|Aclame:pro 274 ANAGIFDLDVD----SNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPS 349 (468) Q Consensus 274 ~~~g~~Dl~~~----~~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~ 349 (468) ...|+...... ..+-...+....++.++. ..-...+.+|+||+....|.... + + ++ .. T Consensus 155 ~~~gi~~~~~~~~~~~~~~~t~~~i~~~~~~l~---------~~~~~~~~~vmn~~~~~~L~~l~--d-~---~G---~~ 216 (324) T protein:vir:96 155 FGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLE---------DDELEANAFISKTQNRSLLRKIV--D-P---ET---KE 216 (324) T ss_pred cCccccccccccceeccccccHHHHHHHHHhhh---------hccCCCCEEEEcHHHHHHHHHhh--c-c---CC---Ce Confidence 11111111000 111111233333333222 22345557899999999987631 1 1 11 11 Q ss_pred cccccccCceeEEEecCceEEEEcccccccCCcceEEEEEecCCcccceeEeeccchhhccc--------ccCCc----- Q lcl|Aclame:pro 350 IGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVR--------SIDPN----- 416 (468) Q Consensus 350 ~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~~~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~--------~~dp~----- 416 (468) ++. +..+ ++|. +++|++++... .+..-+++|-. +.+++...-...+.. ..|+. T Consensus 217 ~~~-~~~~----~~l~-G~PV~~~~~~~--~~~~~~~~gd~------~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~ 282 (324) T protein:vir:96 217 RIY-DRNS----DSLD-GLPVVNLKSSN--LKRGELITGDF------DKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVN 282 (324) T ss_pred eec-CCCC----Cccc-ceeeEeeCCCC--CCcceEEEEec------ceEEEEEecCcEEEEeecccccccccccccchh Confidence 111 1111 3343 46777765321 22223333311 111111111111100 00111 Q ss_pred cc---cceeeeeeeeeeee-cC--cccccCcc-ccccchhhh Q lcl|Aclame:pro 417 TF---QPKIGFKTRYGMVS-NP--FVTTNGLY-NGTPDGEAL 451 (468) Q Consensus 417 s~---qP~~g~~tRY~l~~-nP--~~~~~~~~-~~~~~~~~~ 451 (468) -| |=.+=...||+..+ +| |+...... ....+..+. T Consensus 283 ~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~~~~~~ 324 (324) T protein:vir:96 283 LFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVPGEV 324 (324) T ss_pred hhhcCcEEEEEEEEEccEEecccceEEEecccccCCCCCCCC Confidence 01 12222334665543 34 33221111 111111121 No 90 >protein:vir:78830 Length: 324 # NCBI annotation: major head protein # Family: family:all:507 # MgeID: mge:1858 # MgeName: 80alpha # Cross-refs: genbank:acc:YP_001285361;genbank:gi:148717889;genbank:GeneID:5246961 Probab=52.91 E-value=0.54 Score=22.04 Aligned_cols=298 Identities=10% Similarity=0.033 Sum_probs=119.8 Q ss_pred HhHHHHHHhhhhhhhhhhhh--hhcCcccccccccccccccccccccccceehhhhHHhhhhhhhhheeeeecCCcccee Q lcl|Aclame:pro 36 LENQERFLREERGMLNEVAV--NSLGAGTIAPAGSALGSANTGGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGL 113 (468) Q Consensus 36 lenq~~~~~~~~~~l~e~~~--~~~~~~~~~~~~~i~~st~tg~i~~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGL 113 (468) +| |.+.++++.++...... +++.+. ... .+.+++..--....-.+++.+.......+++-+-||++++-- T Consensus 1 ~~-~~~~~~~~~~~~~~~~~~~~~~~a~------~~~-~~~~~~~~iP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ 72 (324) T protein:vir:78 1 ME-QTQKLKLNLQHFASNNVKPQVFNPD------NVM-MHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEKK 72 (324) T ss_pred CC-cchhhhHHHHHHHHHhhhhhhhccc------ccc-ccCcCccccchhHHHHHHHHHHhhchhhhhcceeeccCCceE Confidence 11 11122222222211100 011111 011 111111111111222345555566677888888888876533 Q ss_pred eeeeeeeecCCCCcccccccCCccccccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCC Q lcl|Aclame:pro 114 IFAMRSRYENQAGEEALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGE 193 (468) Q Consensus 114 IFAMRsrY~~qsG~EA~fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~ 193 (468) |.-.. .+.+ ..| . +| T Consensus 73 ~p~~~------~~~~-------a~~----------------------------------------------v--~E---- 87 (324) T protein:vir:78 73 FTFWA------DKPG-------AYW----------------------------------------------V--GE---- 87 (324) T ss_pred EEEEe------cCcc-------eeE----------------------------------------------e--cC---- Confidence 22110 0000 000 0 01 Q ss_pred CCcchhhcceEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhccccccc Q lcl|Aclame:pro 194 ANRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNV 273 (468) Q Consensus 194 ~~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~ 273 (468) +..+++...+++++++..+.-+.-..+|-||.+|-. .|.+++|.+-|+..|...|++-+|.---+. . T Consensus 88 -g~~~~~~~~~~~~v~~~~~k~~~~~~is~ell~ds~----~~l~~~i~~~la~ai~~~~d~a~l~G~g~~--------~ 154 (324) T protein:vir:78 88 -GQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQGNN--------P 154 (324) T ss_pred -CccccccccceeEEEEeeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHHhccCCCC--------C Confidence 123444455555666666666666679999999864 578999999999999999999887532111 0 Q ss_pred ccccccccccc----ccchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccc Q lcl|Aclame:pro 274 ANAGIFDLDVD----SNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPS 349 (468) Q Consensus 274 ~~~g~~Dl~~~----~~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~ 349 (468) ...|+...... ..+-...+....++.++. ..-...+.+|+||+....|.... + + ++ .. T Consensus 155 ~~~gi~~~~~~~~~~~~~~~t~~~i~~~~~~l~---------~~~~~~~~~vmn~~~~~~L~~l~--d-~---~G---~~ 216 (324) T protein:vir:78 155 FGKSIAQSIEKTNKVIKGDFTQDNIIDLEALLE---------DDELEANAFISKTQNRSLLRKIV--D-P---ET---KE 216 (324) T ss_pred cCccccccccccceeccccccHHHHHHHHHhhh---------hccCCCCEEEEcHHHHHHHHHhh--c-c---CC---Ce Confidence 11111111000 111111233333333222 22345557899999999987631 1 1 11 11 Q ss_pred cccccccCceeEEEecCceEEEEcccccccCCcceEEEEEecCCcccceeEeeccchhhccc--------ccCCc----- Q lcl|Aclame:pro 350 IGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVR--------SIDPN----- 416 (468) Q Consensus 350 ~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~~~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~--------~~dp~----- 416 (468) ++. +..+ ++|. +++|++++... .+..-+++|-. +.+++...-...+.. ..|+. T Consensus 217 ~~~-~~~~----~~l~-G~PV~~~~~~~--~~~~~~~~gd~------~~~~~g~~~~~~i~~~~~~~~~~~~~~~~~~~~ 282 (324) T protein:vir:78 217 RIY-DRNS----DSLD-GLPVVNLKSSN--LKRGELITGDF------DKLIYGIPQLIEYKIDETAQLSTVKNEDGTPVN 282 (324) T ss_pred eec-CCCC----Cccc-ceeeEeeCCCC--CCcceEEEEec------ceEEEEEecCcEEEEeecccccccccccccchh Confidence 111 1111 3343 46777765321 22223333311 111111111111100 00111 Q ss_pred cc---cceeeeeeeeeeee-cC--cccccCcc-ccccchhhh Q lcl|Aclame:pro 417 TF---QPKIGFKTRYGMVS-NP--FVTTNGLY-NGTPDGEAL 451 (468) Q Consensus 417 s~---qP~~g~~tRY~l~~-nP--~~~~~~~~-~~~~~~~~~ 451 (468) -| |=.+=...||+..+ +| |+...... ....+..+. T Consensus 283 ~f~~d~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~~~~~~ 324 (324) T protein:vir:78 283 LFEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVPGEV 324 (324) T ss_pred hhhcCcEEEEEEEEEccEEecccceEEEecccccCCCCCCCC Confidence 01 12222334665543 34 33221111 111111121 No 91 >protein:vir:100884 Length: 389 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1473 # MgeName: Lc-Nu # Cross-refs: genbank:acc:YP_358764;genbank:gi:78000028;genbank:GeneID:3726155 Probab=52.80 E-value=0.54 Score=22.03 Aligned_cols=330 Identities=12% Similarity=0.142 Sum_probs=119.4 Q ss_pred CcchHHHHHhhhhhhCCCccchhcchhhhHHH-HHHHhHHH--------------HHHhhhhhhhhhhhhhhcCcccccc Q lcl|Aclame:pro 1 MFNAEHLQEKWSPVLNHGEAPAIGDRYKRAVT-SVLLENQE--------------RFLREERGMLNEVAVNSLGAGTIAP 65 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~-~~llenq~--------------~~~~~~~~~l~e~~~~~~~~~~~~~ 65 (468) +=..++|.+++.-+.+. +-++....++.-. .....++. +.....+..+.+. +. +...- T Consensus 33 ~e~~~~l~~ei~~~~~~--~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----lr-~~~~~ 105 (389) T protein:vir:10 33 VDDFQKIKDDLTAAKAR--RDAINDQIKALEAEKPAEPKTEPKDDGSKKGTDLSKKPIDAKKKAINDF----IH-SHGKV 105 (389) T ss_pred HHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHhhhhccccccccccccccchhHHHHHHHHHHHH----hh-cchhh Confidence 11111222233222111 0000000000000 00000000 0000000001110 00 00001 Q ss_pred ccccccccc-ccccccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCcccccccCCccccccccc Q lcl|Aclame:pro 66 AGSALGSAN-TGGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDTGFTGGYDA 144 (468) Q Consensus 66 ~~~i~~st~-tg~i~~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSg~~~~ 144 (468) ...++++++ .|++.--....-.++++..+..+..+++.|.||+++++-+--++. .+ +.-+.. T Consensus 106 ~~~~~~~t~~~gg~~vP~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~--~~--~~~~~~------------- 168 (389) T protein:vir:10 106 IDATSKVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYPILKR--AT--DRFSSV------------- 168 (389) T ss_pred hhhhcccccCCcceeehHHHHHHHHHHHHhhhhHHhhcceeeccCCeeEEEEEec--CC--Cccccc------------- Confidence 111122222 222211111122345555566777899999999988765443331 00 000000 Q ss_pred cccccccccCccccCCCccccccccccccccccccccccchhhhhccCCCCcchhhcceEEEEEEEEeecccccccccHH Q lcl|Aclame:pro 145 SQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFREMSFSIEKTSVTAQSRALKAEYTLE 224 (468) Q Consensus 145 ~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~~~f~EMaFsIeK~tVtAKSRaLKAEYT~E 224 (468) +..++.-..+...|.+..+++.|.. --..+|-| T Consensus 169 ----------------------------------------~E~~~~~~~~~~~~~~i~~~~~k~~-------~~~~iS~e 201 (389) T protein:vir:10 169 ----------------------------------------AELAENPKLAEPEFNKVDWSVATYR-------GAIPLSEE 201 (389) T ss_pred ----------------------------------------cccccccccccccceeeeeeheeeE-------eeehhhHH Confidence 0000000001224555666665554 44568999 Q ss_pred HHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhccccccccccccccccccccchhHHHHHHHHHHHHHHH Q lcl|Aclame:pro 225 LAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVANAGIFDLDVDSNGRWSVEKFKGLLFQVERD 304 (468) Q Consensus 225 LAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~~~~g~~Dl~~~~~grw~~e~~k~L~~~i~~e 304 (468) |.+|- ..|.+++|.+-|...+..-+|..|+.-+-. +...++ .+... ......++ ... T Consensus 202 ll~ds----~~~l~~~i~~~la~~~~~~~~~~i~~g~~~----~~~~~~--~~~~~----------~d~l~~~~-~~~-- 258 (389) T protein:vir:10 202 AIADS----AVDLTALVGQSIKEKSVNTYNAMIAPVLQS----FTAKKT--TTDTL----------VDSLKHIL-NVD-- 258 (389) T ss_pred HHhhh----hHHHHHHHHHHHHHHHHHHHHHHHhhhhcc----cccccc--ccccc----------HHHHHHHH-Hhh-- Confidence 99984 346788899999988888888888754321 111111 11111 11122221 111 Q ss_pred HHHHHHhhcCCCccEEEEchhHHHHHHhh----cccccccccccccccccccccccCceeEEEecCceEEE-Ec-ccccc Q lcl|Aclame:pro 305 ANAIAQETRRGKGNFLICSADVASALAMA----GVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVF-VD-PYAAN 378 (468) Q Consensus 305 an~i~~~T~r~~~n~~v~S~~Va~~L~~s----G~~~~~~~~~~~~~~~~~~~d~t~~~~~G~l~g~~~vy-~D-~Ya~~ 378 (468) ... .+ ..-+||++.....|... |.+-+.|+.. +.+.....++|.| ++|| +| .+... T Consensus 259 -----~~~-~~-~a~~~~n~~~~~~L~~lkd~~G~~i~~~~~~----------~~~~~~~~~~l~G-~pV~~~~~~~~~~ 320 (389) T protein:vir:10 259 -----LDP-AY-SRALVVTQSLFNTLDTLKDKNGRYLLHDASD----------SITDGTAKGTILG-VPVYVVGDTLLGS 320 (389) T ss_pred -----hhh-hh-CcEEEecHHHHHHHHHhhccCCCeeeecCcc----------ccccccccccccc-ceeEEecccccCC Confidence 111 12 24578999998888863 2111111110 1111222346654 4554 33 21111 Q ss_pred cCCcceEEEEEecCCcccceeEeeccchhhcccccCCccccceeeeeeeeeeee-cC--cccc--cCccccccchhhhhh Q lcl|Aclame:pro 379 LSDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNTFQPKIGFKTRYGMVS-NP--FVTT--NGLYNGTPDGEALTP 453 (468) Q Consensus 379 ~~~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~s~qP~~g~~tRY~l~~-nP--~~~~--~~~~~~~~~~~~~~~ 453 (468) ....-.+++|=- ..+..+...-.+. ....|-..|.-.+...-|++..+ || |... .......+ ++ T Consensus 321 ~~~~~~~~~gd~-----~~~~~~~~~~~~~-i~~~~~~~~~~~~~~~~r~d~~~~~~~a~~~~~~~~~~~~~~-----~~ 389 (389) T protein:vir:10 321 LAGDQKAFVGDL-----KRGVLFTDRQQVT-LAWEDSKIYGKYLGAAFRFGVQKADSKAGYFVTNTDVPGSAL-----GK 389 (389) T ss_pred CCCceEEEEeec-----cccEEEEeecceE-EEeeccccccceEEEEEEeccEEecccceEEEEeeccCCCCC-----CC Confidence 111111222200 0000000000000 11123344455666667888643 34 2211 11111111 11 No 92 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=50.88 E-value=0.6 Score=21.81 Aligned_cols=280 Identities=14% Similarity=0.057 Sum_probs=113.2 Q ss_pred ccc-cccccccccccceeh-hhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCcccccccCCccccccccccc Q lcl|Aclame:pro 69 ALG-SANTGGLAGFDPVLI-SLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDTGFTGGYDASQ 146 (468) Q Consensus 69 i~~-st~tg~i~~~~P~Lv-~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSg~~~~~~ 146 (468) .++ ++++|+.. .-+.+. .+++++.+..+...++-|-||.+.. +-|-.. .+ +.+| .| T Consensus 1 Ma~~~~~~gg~~-vP~~~~~~ii~~l~~~s~i~~l~~~i~~~~~~-~~ip~~---~~--~~~a-------~w-------- 58 (315) T protein:vir:80 1 MADDFLSAGKLE-LPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGP-VKGAVF---SG--VPRA-------KI-------- 58 (315) T ss_pred CCCCcCCcCceE-cchHHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEE---eC--Ccce-------EE-------- Confidence 122 22233332 222222 2455555666778888888886542 222221 00 0000 00 Q ss_pred cccccccCccccCCCccccccccccccccccccccccchhhhhccCCCCcchhhcceEEEEEEEEeecccccccccHHHH Q lcl|Aclame:pro 147 GDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFREMSFSIEKTSVTAQSRALKAEYTLELA 226 (468) Q Consensus 147 ~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELA 226 (468) .+ | +..+++...++++++..+|.=+-....|-||. T Consensus 59 --------------------------------------v~--E-----g~~~~~s~~~f~~v~l~~~kl~~~~~iS~ell 93 (315) T protein:vir:80 59 --------------------------------------VG--E-----GEVKPSASVDVSAFTAQPIKVVTQQRVSDEFM 93 (315) T ss_pred --------------------------------------ee--C-----CccccccccceeeeEeeeeeEEeeehhhHHHh Confidence 00 1 12233334444445554444444567899998 Q ss_pred HHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhccccccccccccccc-------cccccchhHHHHHHHHHH Q lcl|Aclame:pro 227 QDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVANAGIFDL-------DVDSNGRWSVEKFKGLLF 299 (468) Q Consensus 227 QDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~~~~g~~Dl-------~~~~~grw~~e~~k~L~~ 299 (468) +|-. .|+..+|.++|..++...|.|.+=+.++.=...+. +-...|+... .+..+.-| ..+..++. T Consensus 94 ~~s~----~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~--~d~~~~~~ 165 (315) T protein:vir:80 94 WADA----DYRLGVLQDLISPALGASIGRAVDLIAFHGIDPAT--GKAASAVHTSLNKTKNIVDATDSAT--ADLVKAVG 165 (315) T ss_pred hcCc----hhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCC--Cccccccccccccccceeeccccch--HHHHHHHH Confidence 8843 46777787777777777777766555553211100 0011111100 00011111 11112212 Q ss_pred HHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccccccCceeEEEecCceEEEEccccccc Q lcl|Aclame:pro 300 QVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVFVDPYAANL 379 (468) Q Consensus 300 ~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~ 379 (468) + +..... ...+-.|++|+....|... ....+...++. ....-...+. .|+|. +++|+++.++... T Consensus 166 ~-------~~~~~~-~~~~~~imn~~~~~~L~~l---~~~~g~~~~g~-~~~~~~~~g~--~~tl~-G~PV~~~~~~~~~ 230 (315) T protein:vir:80 166 L-------IAGAGL-QVPNGVALDPAFSFALSTE---VYPKGSPLAGQ-PMYPAAGFAG--LDNWR-GLNVGASSTVSGA 230 (315) T ss_pred H-------HhhccC-ccceEEEEcHHHHHHHHHH---hhccCCccccc-ccccccccCC--Cceec-ceeeEecCcCCcc Confidence 2 211111 2334588999999998753 11111111100 0000001111 25675 4788877654321 Q ss_pred C-------------CcceEEEEEecCCcccceeEeeccchhhcccccCCc----c-ccc-eeeee--eeeeee-ecC--c Q lcl|Aclame:pro 380 S-------------DKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPN----T-FQP-KIGFK--TRYGMV-SNP--F 435 (468) Q Consensus 380 ~-------------~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~----s-~qP-~~g~~--tRY~l~-~nP--~ 435 (468) . ++.++.+|+.+...+ -..+| .|++ + ||. .++|+ .|+|.. .+| | T Consensus 231 ~~~~~~~~~~~~~GDfs~~~~g~~~~~~i----~i~~~--------~~~~~~~~~~~~~~~v~~r~~~r~~~~v~~~~a~ 298 (315) T protein:vir:80 231 PEMSPASGVKAIVGDFSRVHWGFQRNFPI----ELIEY--------GDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSF 298 (315) T ss_pred cccccccccEEEEeecccEEEEEecCeeE----EEecc--------ccccCcccchhhcCcEEEEEEEEecceeecccce Confidence 1 111222333322211 12222 1111 1 221 13333 455543 555 4 Q ss_pred ccccC--ccccccchhh Q lcl|Aclame:pro 436 VTTNG--LYNGTPDGEA 450 (468) Q Consensus 436 ~~~~~--~~~~~~~~~~ 450 (468) +.-.. .+-..|.+++ T Consensus 299 ~~l~~~~a~~~~~~~~~ 315 (315) T protein:vir:80 299 AVVKEKAAPKPNPPAEN 315 (315) T ss_pred EEEeeccCCCCCCCCCC Confidence 43221 1222344444 No 93 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=49.84 E-value=0.63 Score=21.70 Aligned_cols=275 Identities=12% Similarity=0.054 Sum_probs=111.3 Q ss_pred CCcc-ceeeeeeeeeecCCCCcccccccCCccccccccccccccccccCccccCCCccccccccccccccccccccccch Q lcl|Aclame:pro 107 MSGP-TGLIFAMRSRYENQAGEEALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPR 185 (468) Q Consensus 107 mTGP-TGLIFAMRsrY~~qsG~EA~fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~T 185 (468) |--+ |-+ .+.--.|-|-+.....+-.. ....... .......+ .++ ...++..--.. T Consensus 1 Ma~~~T~~--------~~~iiPev~s~~v~~~~~~~----~v~~~~~---~~~~~l~g-~~G-------~tv~ip~~~~~ 57 (278) T protein:vir:80 1 MADLTTKL--------ANLIDPEVMGPMISAKLPKA----IKFGKIA---PIDNSLEG-QPG-------SEITVPKYKYI 57 (278) T ss_pred CCCcceeh--------hheecHHHHHHHHHHHHHHh----hhhcccc---eecccccC-CCC-------CEEEEeeeccC Confidence 1100 000 00000010000000000000 0000000 00000000 000 01111100011 Q ss_pred hhhhccCCCCcchhhcceEEEEEEEEeecccccccccHHHHHHHHH-hcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhh Q lcl|Aclame:pro 186 EDLERMGEANRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKA-IHGLDAEQELANILSSEVLAEINREVVRRVYTV 264 (468) Q Consensus 186 a~aE~lG~~~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkA-iHGLDAE~ELanILStEImlEINREii~~l~~v 264 (468) .++|.+.++ ..+..-..+..+.+++-|-|+- + ++ .-|+.+ .-+-|.-.+..+-++..+..+++++++..|... T Consensus 58 g~a~~~~~g-~~i~~~~lt~~~~~~~i~~~~~-a---~~-v~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~a 131 (278) T protein:vir:80 58 GDAQDVAEG-AAIDYSALETESVKHGIKKAGK-G---VK-LTDESVLSGYGDPVEEAQKQIRMAIASKVDNDILEEALTT 131 (278) T ss_pred CcceeecCC-CcCcccccccceeeEeeehhhc-c---cc-ccHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 122333321 2333334455666666666652 2 22 234444 346789999999999999999999999887653 Q ss_pred hhccccccccccccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhccccccccccc Q lcl|Aclame:pro 265 AKKGAQNNVANAGIFDLDVDSNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNG 344 (468) Q Consensus 265 a~~~k~~~~~~~g~~Dl~~~~~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~ 344 (468) ... +...-+.|..+. +.+.+-++.-++. ... --...+++++|.+.+.|.......+...... T Consensus 132 ~~~-----~~~~~t~~~~~~-----~~~~~~da~~~l~-------~~~-~~~~~~ivv~p~~~~~L~k~~~~~~~~~~~~ 193 (278) T protein:vir:80 132 TLE-----VKGAINIGLIDK-----IENTFTDAPDAIE-------DES-ITTTGVLFLNYKDTAKLREEAAGSWTKASQL 193 (278) T ss_pred ccc-----cccccccchhhh-----HHHHHHHHHHhhc-------ccC-CCcccEEEECHHHHHHHHhhhhhhccccccc Confidence 221 111112221110 1111111111111 111 1123489999999999987654444433221 Q ss_pred ccccccccccccCceeEEEecCceEEEEcccccccCCcceEEEEEecCCcccceeEeeccchhhcccccCCccccceeee Q lcl|Aclame:pro 345 AGGPSIGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNTFQPKIGF 424 (468) Q Consensus 345 ~~~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~~~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~s~qP~~g~ 424 (468) ..+ .-. +-.+|++ .|++||++ ++-|. +-.+-++ ... =.|+..= +...-.--|+..++-.+-. T Consensus 194 g~~-----~~~--~G~ig~~-~G~~Vi~s----~~~p~-~t~~l~~-~gA---i~~~~~~-~~~vE~~Rd~~~~~d~i~~ 255 (278) T protein:vir:80 194 GDD-----LLV--KGAFGEL-LGWEIVRT----KKLAD-GNALAVK-AGA---LKTFLKR-NLLAESGRDMDHKLTKFNA 255 (278) T ss_pred ccc-----cee--eccceee-cceeEEEc----CCCCc-ceEEEEe-ccc---eeeeecC-Ccccccccchhhccceeee Confidence 111 111 1235777 47899999 55552 2111111 111 0122111 1112222489999998888 Q ss_pred eeeeeeee-cCcccccCccccccchhhhhhhccc Q lcl|Aclame:pro 425 KTRYGMVS-NPFVTTNGLYNGTPDGEALTPNANM 457 (468) Q Consensus 425 ~tRY~l~~-nP~~~~~~~~~~~~~~~~~~~~an~ 457 (468) ..+||+.+ ||-... . +.+.|.. T Consensus 256 ~~~yg~~v~~~~~~v------~-----it~~a~~ 278 (278) T protein:vir:80 256 DQHYAVALVDETKAV------K-----VVPVAGN 278 (278) T ss_pred eeEEEEEEEcCcceE------E-----EeeccCC Confidence 89999875 552210 0 1111111 No 94 >protein:vir:95763 Length: 297 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1578 # MgeName: SMP # Cross-refs: genbank:acc:YP_950590;genbank:gi:119953785;genbank:GeneID:5076833 Probab=49.61 E-value=0.63 Score=21.67 Aligned_cols=273 Identities=13% Similarity=0.090 Sum_probs=114.7 Q ss_pred hhhcCccccccccccccccccccccccccee-hhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCccccccc Q lcl|Aclame:pro 55 VNSLGAGTIAPAGSALGSANTGGLAGFDPVL-ISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNE 133 (468) Q Consensus 55 ~~~~~~~~~~~~~~i~~st~tg~i~~~~P~L-v~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnE 133 (468) |+...-++. +.. ++++++.. .-+.+ -.+++.+.+.-+-..++.+.||++++...+-.. .. +.++ T Consensus 1 m~~~~~~~~----~~~-~t~~~~~l-vP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~~~~--~~---~~~a---- 65 (297) T protein:vir:95 1 MTVQTFNPE----NVL-VSQKKDGT-LHKEFTDIIMKEVAQNSLVMQLGQYQEMEGEQEKTVYVQ--TD---GISA---- 65 (297) T ss_pred CCccccccc----ccc-ccCCCcce-echhHHHHHHHHHHhhchhhhhcceeecCCCccEEEEEE--cC---Ccee---- Confidence 222222211 111 11122211 11222 233444455557778899999988877655332 00 0000 Q ss_pred CCccccccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCCCCcchhhcceEEEEEEEEee Q lcl|Aclame:pro 134 PDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFREMSFSIEKTSVTAQ 213 (468) Q Consensus 134 a~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~~~f~EMaFsIeK~tVtAK 213 (468) .| . +| +..+++-..++++++...| T Consensus 66 ---~~----------------------------------------------v--~E-----g~~~~~~~~~f~~v~l~~~ 89 (297) T protein:vir:95 66 ---YW----------------------------------------------V--NE-----TEKIKTDKPEVVPVTLKAH 89 (297) T ss_pred ---EE----------------------------------------------e--ec-----CccccccccceeEEEEeeE Confidence 00 0 01 1233444445566666666 Q ss_pred cccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhcccccccccccccccccccc----chh Q lcl|Aclame:pro 214 SRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVANAGIFDLDVDSN----GRW 289 (468) Q Consensus 214 SRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~~~~g~~Dl~~~~~----grw 289 (468) ..+-...+|.||.+|-. .|.+..|.+-|+..|...+++.||.---+. ...|++....... +.- T Consensus 90 k~~~~~~is~ell~ds~----~~l~~~i~~~la~ai~~~~d~a~l~G~g~~---------~~~gi~~~~~~~~~~~~~~~ 156 (297) T protein:vir:95 90 KLGIILVTSREALNYTW----KKFFEDMKPQIVEAFYKKIDEAGLLGHDTP---------FANSVAKAAKDANKVIGGPI 156 (297) T ss_pred EEEEeehhhHHHHhcCH----HHHHHHHHHHHHHHHHHHHHHHHhcccCCc---------ccccccccccccceeccccc Confidence 66667779999999875 468899999999999999999998421110 0111111110000 000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccccccCceeEEEecCceE Q lcl|Aclame:pro 290 SVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGRIK 369 (468) Q Consensus 290 ~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~~~~G~l~g~~~ 369 (468) ..+. |.+....+...- ...+-+||+|+....|... -+ +. +..+++ .. .|+|. +++ T Consensus 157 t~~~-------i~~~~~~l~~~~--~~~~~~v~~~~~~~~L~~l--~d-~~------G~~i~~--~~----~~~l~-G~P 211 (297) T protein:vir:95 157 NYDN-------ILKLQDALYDAD--VEPNAFVSKIQNRSALREA--RD-GN------KVSIYD--KA----ANTID-GIT 211 (297) T ss_pred CHHH-------HHHHHHHhhhcc--CCcCEEEEcHHHHHHHHHh--hc-cC------Cceeec--CC----CCccc-cee Confidence 1122 223333332222 3445689999999998752 11 10 111111 11 13343 345 Q ss_pred EEEcccccccCCc--------ceEEEEEecCCcccceeEeeccchhhcccccCCc----c-cc-ceeee--eeeeeeee- Q lcl|Aclame:pro 370 VFVDPYAANLSDK--------HYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPN----T-FQ-PKIGF--KTRYGMVS- 432 (468) Q Consensus 370 vy~D~Ya~~~~~~--------dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~----s-~q-P~~g~--~tRY~l~~- 432 (468) |+.-+... .+. .++++|..++.+.+-. . +.......|+. + || =.++| ..|++..+ T Consensus 212 v~~~~~~~--~~~~~~~~gd~s~~~~~~~~~~~i~~~----~--~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~ 283 (297) T protein:vir:95 212 TVDLKSAR--FEKGDLLAGDFDNLIYGVPYNITYKIS----E--EGQISTITNADGTPINLFEQEMIAIRATMDIAVMIT 283 (297) T ss_pred eEeecCCC--CCCceEEEEecccEEEEEecCeEEEEe----e--ccccccccccCccchhhhhcCcEEEEEEEEeccEee Confidence 55432211 111 2222333332211100 0 00011111211 0 11 11222 24555443 Q ss_pred cC--cccccCccccccc Q lcl|Aclame:pro 433 NP--FVTTNGLYNGTPD 447 (468) Q Consensus 433 nP--~~~~~~~~~~~~~ 447 (468) || |+... ...+- T Consensus 284 ~~~a~~~l~---~at~~ 297 (297) T protein:vir:95 284 KTDAFAKLT---PAERV 297 (297) T ss_pred cccceEEEe---ecCCC Confidence 33 33211 11111 No 95 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=48.41 E-value=0.67 Score=21.54 Aligned_cols=329 Identities=14% Similarity=0.104 Sum_probs=122.5 Q ss_pred cchHHHHHhhhhhhCCCc--------------------cchh----cchhhhHHHHHHHhHHH---HHHhhhhhh----- Q lcl|Aclame:pro 2 FNAEHLQEKWSPVLNHGE--------------------APAI----GDRYKRAVTSVLLENQE---RFLREERGM----- 49 (468) Q Consensus 2 ~~~~~l~~kw~p~l~~~~--------------------~~~i----~~~~~~~~~~~llenq~---~~~~~~~~~----- 49 (468) |+..++.++=.-+++.-. +.+. .+.-+.. .+.|.+..+ +..+...+. T Consensus 1 m~~~e~~~~~~~~~~~l~~~~~~~~~e~~~~~e~~~~~~~~~~~~~~~e~~~~-~~~l~~~~~~~e~~~~~~~~~~~~~~ 79 (379) T protein:vir:10 1 MEALEIKVALEAIKGQVDSKSSAQALEVKGLIEALEAKMTSEKDLAVNELKSD-MAALQAHADKLDVKLKEKAKSEDKSD 79 (379) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhhHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHhcccccccch Confidence 555555554444433210 0000 0000000 011111100 000000000 Q ss_pred -hhhhhhhhcCcc-----ccccccccccc-cccccccc-----ccceehhhhHHhhhhhhhhheeeeecCCccceeeeee Q lcl|Aclame:pro 50 -LNEVAVNSLGAG-----TIAPAGSALGS-ANTGGLAG-----FDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAM 117 (468) Q Consensus 50 -l~e~~~~~~~~~-----~~~~~~~i~~s-t~tg~i~~-----~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAM 117 (468) ..+......... ........+.+ +++.+... +.+.+ ++..-....-.+++.|.||++++.-|.- T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ip~~~~~~i---i~~~~~~~~i~~~~~~~~~~~~~~~~~~- 155 (379) T protein:vir:10 80 SLVKSITENFNDIKEVRNGKSIQVKAVGDMTLPVNLTGAQPKDYNFDV---VLNPSQMLNVSDIVGAVSISGGTYTFVR- 155 (379) T ss_pred hHHHHHHHHHHhHHHHHhhhhhhhhhhcccccCCCCccccchhhhhHH---HHhHHhhhhHHhhceeeeccCCceEEEE- Confidence 000000000000 00000001111 11122111 22223 3333334456688888888877542211 Q ss_pred eeeecCCCCcccccccCCccccccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCCCCcc Q lcl|Aclame:pro 118 RSRYENQAGEEALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRL 197 (468) Q Consensus 118 RsrY~~qsG~EA~fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~~~ 197 (468) ..+ +.+ ..+ .-.+| +.. T Consensus 156 ------~~~-----------~~~------------------------~~~-----------------~~v~E-----g~~ 172 (379) T protein:vir:10 156 ------ENG-----------AGE------------------------GAI-----------------GAQVE-----GAT 172 (379) T ss_pred ------eec-----------CCC------------------------ccc-----------------ccccC-----Ccc Confidence 000 000 000 00011 223 Q ss_pred hhhcceEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhccccccccccc Q lcl|Aclame:pro 198 FREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVANAG 277 (468) Q Consensus 198 f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~~~~g 277 (468) .+++..++++++..+|.=+--...|-||.||-- +.++.|.+-|+..|+.-+|..++.-+.+.+.-+... . T Consensus 173 ~~~~~~~f~~i~~~~~k~~~~~~iS~ell~D~~-----~l~~~i~~~la~~~~~~~~~~~~~g~~~~~~~~~~~-~---- 242 (379) T protein:vir:10 173 KGQKDYDISMIDVNTDFIAGFTRYSKKMANNLP-----FLTSFIPNALRRDYAKAENAAFNAVLAANATASTEI-I---- 242 (379) T ss_pred ccccccceeeeEeeeeeEEeeehhhHHHHhhHH-----HHHHHHHHHHHHHHHHHHHHHHhccccccccccccc-c---- Confidence 445555555555555555555789999999963 277888898999998888888876554332111111 1 Q ss_pred cccccccccchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccccc-- Q lcl|Aclame:pro 278 IFDLDVDSNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDD-- 355 (468) Q Consensus 278 ~~Dl~~~~~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~-- 355 (468) .+.- .+...+.+++++. ..-..++-+|++|.....|... ..+- + ....+.+. T Consensus 243 -------~~~~-~~d~i~~~~~~~~---------~~~~~~~~~vmn~~~~~~l~~l---kd~~---G---~~l~~~~~~~ 296 (379) T protein:vir:10 243 -------TNKN-KVEMLINEIAKQE---------NLDFPVTAIVLRPTDYYDILVT---QKSV---G---AGYGLPGVVT 296 (379) T ss_pred -------cCcc-cHHHHHHHHHhhh---------hccCCCCEEEEcHHHHHHHHHh---hccC---C---ceeccCCccC Confidence 1111 1233333333332 1134556788999988887642 1110 0 00111000 Q ss_pred -cCceeEEEecCceEEEEcccccccCCcceEEEEEecCCcccceeEeeccchhhccccc--CCccccceeeeeeeeeeee Q lcl|Aclame:pro 356 -TGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSI--DPNTFQPKIGFKTRYGMVS 432 (468) Q Consensus 356 -t~~~~~G~l~g~~~vy~D~Ya~~~~~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~--dp~s~qP~~g~~tRY~l~~ 432 (468) .+.. .+|. |++|+++++... .-+++|=-.. .-+++--=+..+..+.. +-.+-+=.+=+..|+|+.+ T Consensus 297 ~~~~~--~~l~-G~pvv~s~~~~a----g~~~~gdf~~----~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~R~~~~v 365 (379) T protein:vir:10 297 QDNGV--LRIN-GIPLFRATWLAA----NKYYVGDWTR----VTKVTTEGLSLEFSEVEGTNFVKNNITARIEAQVALAV 365 (379) T ss_pred CCCCc--ceec-ceeeEecCCCCC----CceEEeeccc----EEEEEEeceEEEEeecccccccCCcEEEEEEEEeccEE Confidence 0111 1343 579999976542 2233321110 01111110001111110 1122222233345776543 Q ss_pred -cCcccccCccccccchhhhhhhcccceeeeeeeec Q lcl|Aclame:pro 433 -NPFVTTNGLYNGTPDGEALTPNANMYYRRVQVTNL 467 (468) Q Consensus 433 -nP~~~~~~~~~~~~~~~~~~~~an~y~~r~~v~~l 467 (468) +|=+ |-++.+..+ T Consensus 366 ~~p~a----------------------~v~~~~~~~ 379 (379) T protein:vir:10 366 EQPAA----------------------LIFGDFTAV 379 (379) T ss_pred ecCcc----------------------EEEEEecCC Confidence 4411 112222222 No 96 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=47.63 E-value=0.7 Score=21.45 Aligned_cols=218 Identities=16% Similarity=0.126 Sum_probs=95.6 Q ss_pred CccccccccccccccccccccccchhhhhccCCCC-cchhhcceEEEEEEEEeecccccccccHHHHHHHHHhcCCChhH Q lcl|Aclame:pro 161 SEGNNPALLNDAAPGTYEVGSKMPREDLERMGEAN-RLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQ 239 (468) Q Consensus 161 ~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~-~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ 239 (468) ..+.+.++.. ++.. -..++|.+.++. -+..+|+++= .+++.|-+.=.-++|=|- .|.+ + -|.-. T Consensus 1 ~~~~~~Gdti-------t~P~--~iGda~~v~eG~~i~~~~l~~t~--~~atIk~~gk~~~itD~a--~l~~-~-gDp~~ 65 (231) T protein:vir:73 1 ENGINLANLC-------EYPN--DIGDAADVAEGGEISLDKIGTTT--KSVTIKKAAKGTEITDEA--ALSG-Y-GDPIG 65 (231) T ss_pred CccccCCceE-------Eecc--cccchhhhcCCCcCChhhccccc--eeeeEeeeccceeeeHHH--Hhhc-c-CchHH Confidence 1111111111 1110 023445555432 3455666544 444445543333444322 2444 3 38899 Q ss_pred HHHHHHHHHHHHHhhHHHHHHHhhhhhccccccccccccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccE Q lcl|Aclame:pro 240 ELANILSSEVLAEINREVVRRVYTVAKKGAQNNVANAGIFDLDVDSNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNF 319 (468) Q Consensus 240 ELanILStEImlEINREii~~l~~va~~~k~~~~~~~g~~Dl~~~~~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~ 319 (468) |..+-|+..|+..++.||+..+-+.+...+ ..+.++ .+-... -.+..| -....+ T Consensus 66 ea~~Q~~~~iA~kvD~di~~~~~~a~l~~~-------~~~t~d-------~i~~A~---~~fgde---------~~~~~v 119 (231) T protein:vir:73 66 ESNKQLGLSLANKVDDDLLKAAKTTSQTVS-------TKANVD-------GVQAAL---DIFNDE---------DAQAYV 119 (231) T ss_pred HHHHHHHHHHHHhhhHHHHHhhcccccccc-------ccccHH-------HHHHHH---HHhccc---------cccceE Confidence 999999999999999999987765443311 111110 111111 112111 256789 Q ss_pred EEEchhHHHHHHhhcccccccccccccccccccccccCceeEEEecCceEEEEcccccccCCcceEEEEEecCCccccee Q lcl|Aclame:pro 320 LICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYKGTSPYDAGL 399 (468) Q Consensus 320 ~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~~~~dY~~vG~KG~~~~d~gl 399 (468) ++|+|++++-|... .++.. .-.+.++. .=.+|. +|.+. |++|+++ ++-| +++. T Consensus 120 ivv~p~~~~~Lrk~--~~~~~-~~~~~g~~---i~~~G~--iG~i~-G~~Vi~S----~~~~--------------~~~~ 172 (231) T protein:vir:73 120 LIVNPKDAAKIRKD--ANAKN-IGSEVGAN---ALINGT--YADVL-GAQIVRS----KKLA--------------EGSA 172 (231) T ss_pred EEEcchHHHhhhhc--cchhh-hhhhhccc---eeeecc--cceEc-ceEEEEc----CCCC--------------CCce Confidence 99999999998762 11110 01112211 112232 35553 4777777 2222 2334 Q ss_pred Eeeccchhh------cccc------cCCccccceeeeeeeeeeeecCcccccCccccccchhhhhhhcccceeeeeeeec Q lcl|Aclame:pro 400 FYCPYVPLQ------MVRS------IDPNTFQPKIGFKTRYGMVSNPFVTTNGLYNGTPDGEALTPNANMYYRRVQVTNL 467 (468) Q Consensus 400 fyaPYv~l~------~~~~------~dp~s~qP~~g~~tRY~l~~nP~~~~~~~~~~~~~~~~~~~~an~y~~r~~v~~l 467 (468) ++++|+.-. .++. .|+..+.-.+----.|++.. +.++ +. =+..++++ T Consensus 173 ~~~~~i~~~gAl~~~~k~~~~vEtdRd~~~k~~~i~~~~~y~v~l--~~~~-----~v--------------v~~t~~g~ 231 (231) T protein:vir:73 173 LMFKIVSNSPALKLVLKRGVQVETDRDIVTKTTVITADEHYAAYL--YDLT-----KV--------------VNITFTGV 231 (231) T ss_pred eeeeEEeeccceeeeecccceeeccccccccccEEEEeEEEEEEE--EcCc-----cE--------------EEEEeecC Confidence 555654210 1111 14444444444444444432 1110 00 01111222 No 97 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=45.94 E-value=0.75 Score=21.26 Aligned_cols=336 Identities=14% Similarity=0.097 Sum_probs=113.5 Q ss_pred CcchHHHHHhhhh--hhCCCccchhcchh-hhHHHHHHHhHHHHHHhhhhhhhhhhhhhhcCcccccccccccccccccc Q lcl|Aclame:pro 1 MFNAEHLQEKWSP--VLNHGEAPAIGDRY-KRAVTSVLLENQERFLREERGMLNEVAVNSLGAGTIAPAGSALGSANTGG 77 (468) Q Consensus 1 ~~~~~~l~~kw~p--~l~~~~~~~i~~~~-~~~~~~~llenq~~~~~~~~~~l~e~~~~~~~~~~~~~~~~i~~st~tg~ 77 (468) --+.+.-++++.+ .+...+.++-+..- .|.+.+ |...+.+..+.. |.....++.... ...+..++.+|+ T Consensus 3 ~~~a~~~~~~~~~~~~~~~~~~~~~kg~~~~~~~~a-~a~~~g~~~~a~-----~~a~~~~~~~~~--~~a~~~~~~~Gg 74 (366) T protein:vir:57 3 AAVAVPVKAHSVAPGIIIKEELQQYKGAGMTRMVMS-IAAGKGNLADAA-----KFAATELGDTGL--SMAISTAAGSGG 74 (366) T ss_pred ccccccccccccccccccccccccccchhHHHHHHH-HHhcccchhHHH-----HHHHHhhcchhh--hhhccccccCCc Confidence 1111112222211 11111111111100 111211 111111111000 000011111100 111122222222 Q ss_pred cc---cccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCcccccccCCccccccccccccccccccC Q lcl|Aclame:pro 78 LA---GFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDTGFTGGYDASQGDYAVRTG 154 (468) Q Consensus 78 i~---~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSg~~~~~~~~~~~~~~ 154 (468) .. .....++.+.| +..+...+ |++.+++++|-+-=.| ..+ +.++ T Consensus 75 ~lvP~~~~~~ii~~l~---~~s~l~~l-g~~~v~~~~g~~~~p~--~t~--~~~a------------------------- 121 (366) T protein:vir:57 75 ALIPQNMQNEVIELLR---DRTVVRIL-GARSIPLPNGNLSMPR--LSG--GATA------------------------- 121 (366) T ss_pred cccchhHHHHHHHHHh---hhcchhhh-ceeeeecCCCceEEEE--EeC--Ccce------------------------- Confidence 21 11112222222 22222222 3333333333211111 000 0000 Q ss_pred ccccCCCccccccccccccccccccccccchhhhhccCCCCcchhhcceEEEEEEEEeecccccccccHHHHHHHHHhcC Q lcl|Aclame:pro 155 AGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHG 234 (468) Q Consensus 155 ~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHG 234 (468) .+ . +| +.++++...+++++++..|.-+-...+|-||.+|-- T Consensus 122 ---------------------~w-------v--~E-----~~~~~~s~~~f~~i~~~~~k~~~~~~iS~ell~ds~---- 162 (366) T protein:vir:57 122 ---------------------GY-------V--GE-----GKDVVATGATFDDVKLSAKTMIALVPVSNQLIGRAG---- 162 (366) T ss_pred ---------------------ee-------e--cc-----CccccccccceeEEEEeeEEEEEeehhhHHHHhhhh---- Confidence 00 0 01 123444555666677777776777789999998753 Q ss_pred CChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhcccccccc-----ccccccccccccchh-HHHHHHHHHHHHHHHHHHH Q lcl|Aclame:pro 235 LDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVA-----NAGIFDLDVDSNGRW-SVEKFKGLLFQVERDANAI 308 (468) Q Consensus 235 LDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~~-----~~g~~Dl~~~~~grw-~~e~~k~L~~~i~~ean~i 308 (468) .|.|+.|.+-|...|...+++.||.-=-+ + .+..|+. ..+.+.... +...| .+...-.++...... T Consensus 163 ~~~~~~i~~~l~~a~~~~~d~a~l~G~G~-~--~~p~Gi~~~~~~~~~~~~~~~-t~~~~~~~~~~~~~~~~~~~~---- 234 (366) T protein:vir:57 163 FNVEQLLLGDILSAIATREDKAFLRDDGT-G--DTPKGMKAVATAANRLVAWTG-TAINLTTIDEYLDSLILKHMD---- 234 (366) T ss_pred HHHHHHHHHHHHHHHHHHHHHHhhccCCC-C--ccccceeeccccccceeeccc-cccchhhHHHHHHHHHHhhhc---- Confidence 46889999999999999888888753110 0 0111110 011111100 00111 111111222211111 Q ss_pred HHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccccccCceeEEEecCceEEEEccccccc--------- Q lcl|Aclame:pro 309 AQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVFVDPYAANL--------- 379 (468) Q Consensus 309 ~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~--------- 379 (468) ......+...|+++.....|... ..+ + +..... +.+ -|+|. +++|+++.+...+ T Consensus 235 --~~~~~~~a~~vmn~~~~~~L~~l---kd~---~---G~~l~~-~~~----~g~l~-G~Pvv~s~~ip~~~~~~~~~~~ 297 (366) T protein:vir:57 235 --SNSNMIRCGWGLSNRTYMTLFGL---RDG---N---GNKVYP-EMS----QGILK-GYPIQRTSAIPANLGDDGNESE 297 (366) T ss_pred --cccccccCEEEecHHHHHHHHhh---hcc---C---Cceecc-CCC----CCeec-ceeeEEccccccccccCCCccE Confidence 12223344567999988888752 111 0 111111 111 25563 5788887543211 Q ss_pred ---CCcceEEEEEecCCcccceeEeeccchhhcccc---cCCccccceeeeeeeeeeee-cCcccccCccccccchhhh Q lcl|Aclame:pro 380 ---SDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRS---IDPNTFQPKIGFKTRYGMVS-NPFVTTNGLYNGTPDGEAL 451 (468) Q Consensus 380 ---~~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~---~dp~s~qP~~g~~tRY~l~~-nP~~~~~~~~~~~~~~~~~ 451 (468) -++.++++|-.+....+ .+++........ ..=.+-|=.+=...|+++.+ +| ..=...++..| T Consensus 298 i~~gdfs~~~i~~~~~i~i~----~~~ea~~~~~~g~~~~~f~~~~~~iR~~~~~d~~v~~~------~a~~~lt~~~~ 366 (366) T protein:vir:57 298 IYFCDFNDVVIGEDGMMKVD----FSTEATYKDADGQLVSAFARNQSLIRVVTEHDIGFRHP------EGLVLGTGVIW 366 (366) T ss_pred EEEEecceEEEEEecceEEE----EeeccccccccccchhhhhcCceeEEeeeeeCcEeecc------ccEEEEecccC Confidence 01222333333333322 111100000000 00001112233344555544 23 00012333444 No 98 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=45.75 E-value=0.76 Score=21.24 Aligned_cols=311 Identities=13% Similarity=0.048 Sum_probs=115.9 Q ss_pred hhhhhhhhhhhcCcccccccccccccccccccccccce-ehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCC Q lcl|Aclame:pro 47 RGMLNEVAVNSLGAGTIAPAGSALGSANTGGLAGFDPV-LISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQA 125 (468) Q Consensus 47 ~~~l~e~~~~~~~~~~~~~~~~i~~st~tg~i~~~~P~-Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qs 125 (468) -..|.|-...+.+.+... ..++.++. ..-+. .-.+++...+..+..+++-+.||++..--|.-.. .. T Consensus 1 ~a~l~el~~~~~~~~~~g------~~~~~~~~-liP~~~~~~ii~~l~~~s~l~~~~~~~~~~~~~~~~p~~~----~~- 68 (333) T protein:vir:78 1 MATLNELLPNSAGSNHQG------RLAHVPSD-LLPKEIVGPIFDKAQESSLVLRMGEQIPISYGETIIPTTV----KR- 68 (333) T ss_pred CchhHHhhhhcccccccC------ceecCCcc-ccchhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEe----CC- Confidence 222333221222211110 00111110 11111 1124455556667788899999876333222111 00 Q ss_pred CcccccccCCccccccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCCCCcchhhcceEE Q lcl|Aclame:pro 126 GEEALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFREMSFSI 205 (468) Q Consensus 126 G~EA~fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~~~f~EMaFsI 205 (468) +...|-+.+ .....++++.....+..|.+..++. T Consensus 69 --------~~a~~v~eg--------------------------------------~~~~~~e~~~~~~~~~~f~~i~l~~ 102 (333) T protein:vir:78 69 --------PEVGQVGVG--------------------------------------TSNEQREGGLKPLSGTAWDTRSVSP 102 (333) T ss_pred --------ceeEeecCc--------------------------------------ccccccccccccccccceeEEEEee Confidence 000110000 0000011111112223455555555 Q ss_pred EEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhccccccc-cccccccccc- Q lcl|Aclame:pro 206 EKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNV-ANAGIFDLDV- 283 (468) Q Consensus 206 eK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~-~~~g~~Dl~~- 283 (468) .|..+ -...|-||.+|-. .|.+++|.+.|...|...|+..+|.---.....+ ..|+ ...++..... T Consensus 103 ~kl~~-------~~~is~ell~~s~----~~~~~~i~~~la~ai~~~~d~~~l~G~g~~~~~~-~~g~~~~~~~~~~~~~ 170 (333) T protein:vir:78 103 IKLAT-------IVTVSEEFARMNP----SGLYTKLQGDLAYAIGRGIDLAVFHGKSPLTGSA-LQGIDTDNVIANTTNV 170 (333) T ss_pred EEEEE-------eehhhHHHHhcCH----HHHHHHHHHHHHHHHHHHHHHHHhcccCCCCCcc-cccccccccccccccc Confidence 55554 3457778887754 4789999999999999999998874211110000 0000 0000000000 Q ss_pred cccchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccccccCceeEEE Q lcl|Aclame:pro 284 DSNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGT 363 (468) Q Consensus 284 ~~~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~~~~G~ 363 (468) ...+-...-.+..+ ......+ ..-....++.+|++|+-...|.....+..+.+ ..+...+-.+.. .|+ T Consensus 171 ~~~~~~~~~~~~~i----~~~~~~~-~~~~~~~~~~~vmn~~~~~~L~~~~~~~d~~G------~~i~~~~~~~~~-~~~ 238 (333) T protein:vir:78 171 DYLQETGDPLLDRL----LDGYDLV-SANTDVEFNGWAVDPRFRAHLLRAQAYRDANG------NVDPSRINLAAQ-TGD 238 (333) T ss_pred cccccccchhHHHH----HHHHHhh-ccccccCceEEEEcchHHHHHHHHhhhcCCCC------ceeecCccccCC-Cce Confidence 00000000011122 1111111 11224667788889988887765432221110 011111111111 256 Q ss_pred ecCceEEEEcccccccC-----CcceEE--------EEEecCCcccceeEeeccchhhcccccCCcccc---ceeeeeee Q lcl|Aclame:pro 364 INGRIKVFVDPYAANLS-----DKHYYV--------IGYKGTSPYDAGLFYCPYVPLQMVRSIDPNTFQ---PKIGFKTR 427 (468) Q Consensus 364 l~g~~~vy~D~Ya~~~~-----~~dY~~--------vG~KG~~~~d~glfyaPYv~l~~~~~~dp~s~q---P~~g~~tR 427 (468) |. +++|+++.+...+. +...++ +|..++.+.+ ..+|.-.......--.-|| =.+=...| T Consensus 239 l~-G~Pv~~~~~i~~~~~~~~~~~~~~~~gD~~~~~~g~~~~~~i~----~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r 313 (333) T protein:vir:78 239 VL-GLPAQFGRAVGGDLGAAVDSKTRIIGGDFSQLKFGFADEIRIK----MSDTATLTDSGSATVSMWQTNQIAILIEVT 313 (333) T ss_pred ee-ceeeEEccccCCCccccCCCccEEEEEecccEEEEEeeccEEE----EeccccccccccceeehhhcCcEEEEEEEE Confidence 65 46888774432110 111233 3333322221 1222110000000000111 11122357 Q ss_pred eeee-ecC--cccccCcccccc Q lcl|Aclame:pro 428 YGMV-SNP--FVTTNGLYNGTP 446 (468) Q Consensus 428 Y~l~-~nP--~~~~~~~~~~~~ 446 (468) ++.. .+| |+...... .| T Consensus 314 ~d~~v~~~~a~~~l~~~~--a~ 333 (333) T protein:vir:78 314 FGWLLGDKQAFVKFVDDE--QP 333 (333) T ss_pred EccEEecccceEEEeccC--CC Confidence 7643 566 44332211 12 No 99 >protein:vir:2344 Length: 397 # NCBI annotation: gp14 # Family: family:all:507 # MgeID: mge:51 # MgeName: Bxb1 # Cross-refs: genbank:acc:NP_075281;genbank:gi:12657868;genbank:GeneID:920118 Probab=45.39 E-value=0.77 Score=21.20 Aligned_cols=302 Identities=12% Similarity=0.047 Sum_probs=115.6 Q ss_pred cCccccccccccccccccccccccccee-hhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCcccccccCCc Q lcl|Aclame:pro 58 LGAGTIAPAGSALGSANTGGLAGFDPVL-ISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDT 136 (468) Q Consensus 58 ~~~~~~~~~~~i~~st~tg~i~~~~P~L-v~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t 136 (468) +|.+-- ....+..+++..-.-.-|.+ -.+++++....+-.+++.+.||++++.-|. +... +.++ T Consensus 1 ~g~~~e--~~~~~~~~t~~~~g~l~~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip----~~~~--~~~a------- 65 (397) T protein:vir:23 1 MGFSAD--HSQIAQTKDTMFTGYLDPVQAKDYFAEAEKTSIVQRVAQKIPMGATGIVIP----HWTG--DVSA------- 65 (397) T ss_pred CCcCHH--HHHHhhccCCCCccccchhHHHHHHHHHHhccchhhhcceeeccCCceEEE----EEcC--Ccce------- Confidence 111100 00000011110000111211 122333444556677888888887653221 1100 0000 Q ss_pred cccccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCCCCcchhhcceEEEEEEEEeeccc Q lcl|Aclame:pro 137 GFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFREMSFSIEKTSVTAQSRA 216 (468) Q Consensus 137 ~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~~~f~EMaFsIeK~tVtAKSRa 216 (468) .| .+ | +..+++-..+++++++..|..+ T Consensus 66 ~w----------------------------------------------v~--E-----g~~~~~s~~~f~~v~l~~~k~~ 92 (397) T protein:vir:23 66 QW----------------------------------------------IG--E-----GDMKPITKGNMTKRDVHPAKIA 92 (397) T ss_pred EE----------------------------------------------ec--C-----CccccccccceeEEEEeeEEEE Confidence 00 00 1 1234444555667777777777 Q ss_pred ccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhcccccc-ccccccccccccccchhHHHHHH Q lcl|Aclame:pro 217 LKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNN-VANAGIFDLDVDSNGRWSVEKFK 295 (468) Q Consensus 217 LKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~-~~~~g~~Dl~~~~~grw~~e~~k 295 (468) -.-.+|-||.+|-. .|.|++|.+-|...|...+|+.+|.-.-+ .+... ........... .+-...+... T Consensus 93 ~~v~iS~ell~ds~----~~l~~~i~~~l~~aia~~~d~a~l~G~gt----~~~~~~~~~~~~~~~~~--~~~~~~~~~~ 162 (397) T protein:vir:23 93 TIFVASAETVRANP----ANYLGTMRTKVATAIAMAFDNAALHGTNA----PSAFQGYLDQSNKTQSI--SPNAYQGLGV 162 (397) T ss_pred EeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHHhhcccC----Ccccccccccccceeee--cccchhHHHH Confidence 77889999999863 67899999999999999999999853221 10000 00000000000 0000011111 Q ss_pred HHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhh----cccccccccccccccccccccccCceeEEEecCceEEE Q lcl|Aclame:pro 296 GLLFQVERDANAIAQETRRGKGNFLICSADVASALAMA----GVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVF 371 (468) Q Consensus 296 ~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~s----G~~~~~~~~~~~~~~~~~~~d~t~~~~~G~l~g~~~vy 371 (468) .++. .+ ..--...+-+|++++....|... |-+-+.+..... .......|+| .+++|+ T Consensus 163 ~~~~-------~l--~~~~~~~a~~vmn~~~~~~L~~lkd~~G~~i~~~~~~~~---------~~~~~~~~tl-~G~Pv~ 223 (397) T protein:vir:23 163 SGLT-------KL--VTDGKKWTHTLLDDTVEPVLNGSVDANGRPLFVESTYES---------LTTPFREGRI-LGRPTI 223 (397) T ss_pred HHHH-------hh--hhcccCCCEEEEcHHHHHHHHHhhccCCceeeccccccc---------ccccccCcee-eeeeEE Confidence 1111 11 12234557789999999999863 111111111100 0111122455 477888 Q ss_pred EcccccccC------CcceEEEEEecCCcccceeEeeccchhhcccccCCcc-----c---cceeeeeeeeee-eecC-- Q lcl|Aclame:pro 372 VDPYAANLS------DKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNT-----F---QPKIGFKTRYGM-VSNP-- 434 (468) Q Consensus 372 ~D~Ya~~~~------~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~s-----~---qP~~g~~tRY~l-~~nP-- 434 (468) ++..+.... ++..+++|..+....+- .-+..+....|+.. | |=.+=+..|++. +.+| T Consensus 224 ~s~~~~~g~~~~~~gDfs~~~i~~~~~i~i~~------~~e~~~~~~~~~~~~~~~lf~~d~v~~ra~~r~d~~v~~~~a 297 (397) T protein:vir:23 224 LSDHVAEGDVVGYAGDFSQIIWGQVGGLSFDV------TDQATLNLGSQESPNFVSLWQHNLVAVRVEAEYGLLINDVNA 297 (397) T ss_pred EeCCCCCCceEEEEeecceEEEEEEeceEEEE------eeeeeeeeccccccceeeeeeccceeEEEEeeeccceecccc Confidence 886653211 12223344443322110 00000011111110 0 111112233333 1112 Q ss_pred cccccCcc----------------------ccccchhhhhhhcccceeeeeeeecC Q lcl|Aclame:pro 435 FVTTNGLY----------------------NGTPDGEALTPNANMYYRRVQVTNLM 468 (468) Q Consensus 435 ~~~~~~~~----------------------~~~~~~~~~~~~an~y~~r~~v~~l~ 468 (468) |...+... .....+.++.+.+ ..|+.-| T Consensus 298 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~------~~~~~~~ 347 (397) T protein:vir:23 298 FVKLTFDPVLTTYALDLDGASAGNFTLSLDGKTSANIAYNAST------ATVKSAI 347 (397) T ss_pred eEEEeeccccceeeecccccCcceEEEEecCccccCcccccch------hhhHHHh Confidence 11110000 0000111111000 1111111 No 100 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=44.84 E-value=0.79 Score=21.14 Aligned_cols=336 Identities=14% Similarity=0.054 Sum_probs=110.3 Q ss_pred CcchHHHHHhhhhh-------hCCCc----cchhcchhhh--HHHHHHHhHHHHHHhhhhhhhhhhhh---hhcCcccc- Q lcl|Aclame:pro 1 MFNAEHLQEKWSPV-------LNHGE----APAIGDRYKR--AVTSVLLENQERFLREERGMLNEVAV---NSLGAGTI- 63 (468) Q Consensus 1 ~~~~~~l~~kw~p~-------l~~~~----~~~i~~~~~~--~~~~~llenq~~~~~~~~~~l~e~~~---~~~~~~~~- 63 (468) .++-+.|+|+=.-+ ++... -++.+..+++ .-+..| +.|-+...+....+.+... ...+.... T Consensus 3 ~~~l~~l~e~r~~~~~e~~~L~~~~~~~~lt~e~~~~~~~l~~e~~~l-~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~ 81 (390) T protein:vir:62 3 ATTLSANFEARERATAELRTLTDEFAGKEMTDEAREKEERLITAVSDY-DARIKRGIEAIKAIDPVTSLLSGLQGSGSGA 81 (390) T ss_pred hhHHHHHHHHHHHHHHHHHHHHHHhhcccccHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHhhcccccccc Confidence 33333444433222 22110 0111111110 000111 1111111111111111000 00000000 Q ss_pred -----------------------ccccccccccccccccccccee-hhhhHHhh-hhhhhhheeeeecCCccceeeeeee Q lcl|Aclame:pro 64 -----------------------APAGSALGSANTGGLAGFDPVL-ISLVRRAM-PNLMAYDVCGVQPMSGPTGLIFAMR 118 (468) Q Consensus 64 -----------------------~~~~~i~~st~tg~i~~~~P~L-v~l~RRa~-~~LI~~DI~GVQPmTGPTGLIFAMR 118 (468) .........+++++-.-.-|.+ -.++.... ...+...++-|-||++...+-+... T Consensus 82 ~~~~~~~~~~~~r~~~~~~~r~~~~~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~p~~ 161 (390) T protein:vir:62 82 QRSADVDDDATLRAGNLGEARSFEFAPEKRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGATTFTTSDANPLDFTVI 161 (390) T ss_pred hhhcchHHHHHHhhhhhhhhHHHHhhhhhhcccccCCCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEE Confidence 0000000011111100001111 01111111 1112334444444443332222211 Q ss_pred eeecCCCCcccccccCCccccccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCCCCcch Q lcl|Aclame:pro 119 SRYENQAGEEALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLF 198 (468) Q Consensus 119 srY~~qsG~EA~fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~~~f 198 (468) . + +.+ .. -.+ | +..+ T Consensus 162 ~---~--~~~----------------------------------------------a~-------wv~--E-----~~~~ 176 (390) T protein:vir:62 162 T---G--RSS----------------------------------------------AS-------IVG--E-----TAEI 176 (390) T ss_pred c---C--Ccc----------------------------------------------ee-------eec--c-----cccc Confidence 0 0 000 00 001 1 2234 Q ss_pred hhcceEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhcccccccccccc Q lcl|Aclame:pro 199 REMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVANAGI 278 (468) Q Consensus 199 ~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~~~~g~ 278 (468) ++-.-++++++..+|.-+-....|-||.+|- .+|.+++|.+-|+..|..-+|..||.- -|+ ..|+ T Consensus 177 ~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~l~~~i~~~~d~~~l~G------~G~-----p~Gi 241 (390) T protein:vir:62 177 PESYPATAQRSMGGFKYGFASVVSYEFATDQ----VLDLVGFLVSDAGPAIGDAMGRHFITG------TGQ-----PRGI 241 (390) T ss_pred cccccceeeeEeeeeeEEeehHHHHHHHhhh----hHHHHHHHHHHHHHHHHHHHHhhhhcc------CCc-----cccc Confidence 4445555667777777777789999999993 468899999999999999999998842 111 1122 Q ss_pred ccccccccchh-----HHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccc Q lcl|Aclame:pro 279 FDLDVDSNGRW-----SVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEV 353 (468) Q Consensus 279 ~Dl~~~~~grw-----~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~ 353 (468) +........-. ..-.+..| ..+-+++...- +..+ ..||++.....|... ... + +..+++- T Consensus 242 ~~~~~~~~~~~~~~~~~~~~~~~l----~~~~~~l~~~~-~~~a-~~vmn~~~~~~L~~l---kd~-----~-g~~l~~~ 306 (390) T protein:vir:62 242 LTDASPATATFLATDTDSKVSDAL----IDLFHEVPSAY-RANA-KYVVNDLRAAQMRKL---KDA-----N-GQYLWQS 306 (390) T ss_pred cccccccccceecccccccchHHH----HHHHHhhhhhh-hcCC-EEEEchHHHHHHHHh---hcc-----C-CCeeecC Confidence 11110000000 00001122 11112221111 2333 357788888888652 211 1 1111111 Q ss_pred cccCceeEEEecCceEEEEcccccccCCcceEEEEEecCCcccceeEeeccchhhcccccCCcc--ccceeeeeeeeeee Q lcl|Aclame:pro 354 DDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNT--FQPKIGFKTRYGMV 431 (468) Q Consensus 354 d~t~~~~~G~l~g~~~vy~D~Ya~~~~~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~s--~qP~~g~~tRY~l~ 431 (468) +-+... -++|.| ++|+++.++. .+=+++|- -.. .+...--.....+..|+-. -|=.+=+..|++.. T Consensus 307 ~~~~g~-~~~l~G-~Pv~~~~~~p----~~~i~~gd---~s~---~~i~~~~~~~v~~~~~~~~~~~~~~~~~~~r~d~~ 374 (390) T protein:vir:62 307 GLTVGA-PSLFNG-KVVETDDGMP----ADKILFAD---LSK---YRVRFAGSLRVDRSVDAKFSTDQIVYRFLQRADGL 374 (390) T ss_pred CcCCCc-cceecc-cceEEecCCC----CccEEEee---ccc---eeEEeecceEEEeeccccccCCcEEEEEEEEeCcE Confidence 111111 135654 5788775432 22233231 000 0000000111111122211 11222233455432 Q ss_pred -ecC--cccccCccccccchhhhhhhc Q lcl|Aclame:pro 432 -SNP--FVTTNGLYNGTPDGEALTPNA 455 (468) Q Consensus 432 -~nP--~~~~~~~~~~~~~~~~~~~~a 455 (468) .|| |.... +...| T Consensus 375 ~~~~~A~~~l~-----------~~~~a 390 (390) T protein:vir:62 375 LVDARGAKVLT-----------VTPGA 390 (390) T ss_pred eechhheEEEE-----------eecCC Confidence 233 21110 00111 No 101 >protein:vir:94424 Length: 387 # NCBI annotation: ORF010 # Family: family:all:658 # MgeID: mge:1506 # MgeName: 47 # Cross-refs: genbank:acc:YP_240005;genbank:gi:66395666;genbank:GeneID:5133084 Probab=40.18 E-value=0.98 Score=20.62 Aligned_cols=323 Identities=13% Similarity=0.107 Sum_probs=118.7 Q ss_pred CcchHHHHHhhhhhhCCCccchhcchh-----------------hhHHHHHH------HhHHHHHHhhhhhhhhhhhhhh Q lcl|Aclame:pro 1 MFNAEHLQEKWSPVLNHGEAPAIGDRY-----------------KRAVTSVL------LENQERFLREERGMLNEVAVNS 57 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~-----------------~~~~~~~l------lenq~~~~~~~~~~l~e~~~~~ 57 (468) |=..++|+++|.-+.+. +-++.+.. ++.+ ..| |++|-+.+.++.....+..... T Consensus 1 Mk~l~el~~~~~~~~~~--~~~~~~el~e~~~~~~~~~eei~~~~~~~-~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~ 77 (387) T protein:vir:94 1 MPTLYELKQSLGMIGQQ--LKNKNDELSQKATDPNIDMEDIKQLETEK-AGLQQRFNIVERQVQDIEEKEKAKVKDKGEA 77 (387) T ss_pred CchHHHHHHHHHHHHHH--HHHHHHHHHHHHhccCcCHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhhhhhcccc Confidence 55567888777776543 11111111 1110 111 1111111100000000000000 Q ss_pred cC-----------------c----cc--------ccccccccccccccccccccceehh------hhHHhhhhhhhhhee Q lcl|Aclame:pro 58 LG-----------------A----GT--------IAPAGSALGSANTGGLAGFDPVLIS------LVRRAMPNLMAYDVC 102 (468) Q Consensus 58 ~~-----------------~----~~--------~~~~~~i~~st~tg~i~~~~P~Lv~------l~RRa~~~LI~~DI~ 102 (468) .. + .. ......+.+++.++ + ..||+ ++++....-.-.+++ T Consensus 78 ~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~~~~----g-G~lIP~~~~~~Ii~~~~~~~~l~~~~ 152 (387) T protein:vir:94 78 YQSLSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSG----G-DKLLPKTLSKEIVSEPFAKNQLREKA 152 (387) T ss_pred CCCCchhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCCCCC----C-ceeechhHHHHHHHHHHhhchhhhhc Confidence 00 0 00 00000111222211 1 12222 222232333446788 Q ss_pred eeecCCccceeeeeeeeeecCCCCcccccccCCccccccccccccccccccCccccCCCccccccccccccccccccccc Q lcl|Aclame:pro 103 GVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSK 182 (468) Q Consensus 103 GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~g 182 (468) .|.|+++.+.- |-.+... . ..| T Consensus 153 ~~~~~~~~~~p----~~~~~~~---~-------a~~-------------------------------------------- 174 (387) T protein:vir:94 153 RLTNIKGLEIP----RVSYTLD---D-------DDF-------------------------------------------- 174 (387) T ss_pred eeeecCCceee----eeeccCC---c-------ccc-------------------------------------------- Confidence 88877653321 1001000 0 000 Q ss_pred cchhhhhccCCCCcchhhcceEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHh Q lcl|Aclame:pro 183 MPREDLERMGEANRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVY 262 (468) Q Consensus 183 m~Ta~aE~lG~~~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~ 262 (468) .++++...++...|.+..| .+|.-+-...+|-||.+|- ..|.|++|.+-|+..|..-.|..++-. T Consensus 175 --v~Eg~~~~~~~~~f~~v~l-------~~~k~~~~i~iS~ell~ds----~~~l~~~i~~~la~~~~~~e~~~~~~~-- 239 (387) T protein:vir:94 175 --ITDVETAKELKAKGDTVKF-------TTNKFKVFAAISDTVIHGS----DVDLVNWVENALQSGLAAKERKDALAV-- 239 (387) T ss_pred --ccccccccccccccceeee-------chheeeeechhhHHHHhhh----HHHHHHHHHHHHHHHHHHHHHHhHhhc-- Confidence 0011111111223444444 4444444578999999985 356788888888888876555555422 Q ss_pred hhhhccccccc-cccccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccc Q lcl|Aclame:pro 263 TVAKKGAQNNV-ANAGIFDLDVDSNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSG 341 (468) Q Consensus 263 ~va~~~k~~~~-~~~g~~Dl~~~~~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~ 341 (468) -...+...++ .+.++.-.. .+. .......|++.+.. --|..+.|++-+...+.++.. ++..- T Consensus 240 -g~g~g~~~g~~~~~~~~~~~--~~~--~~d~i~~~~~~l~~--------~y~~na~~imn~~t~~~~~~~---~~~~~- 302 (387) T protein:vir:94 240 -SPKSGLEHMSFYNGSVKEVE--GAD--MYDAIINALADLHE--------DYRDNATIYMRYADYVKIISV---LSNGT- 302 (387) T ss_pred -CCCccccceeeecccccccc--ccc--hHHHHHHHHhccCh--------hhhcCCEEEEechHHHHHHHH---HhcCC- Confidence 2222222222 222222111 111 11223333333221 123456676555554554432 32110 Q ss_pred cccccccccccccccCceeEEEecCceEEEEcccccccCCcceEEEEEecCCcccceeEeeccchhhcccccCCccccce Q lcl|Aclame:pro 342 LNGAGGPSIGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNTFQPK 421 (468) Q Consensus 342 ~~~~~~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~~~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~s~qP~ 421 (468) .+ .+ ...+ ++|. +++||+.-++.. +++| +- +-||.=|-...+.+..+..+.+-. T Consensus 303 ----~~--~~--~~~~----~~ll-G~PV~~~~~~~~------~~~G---Df----~~~~~~~~~~~~~~~~~~~~~~~~ 356 (387) T protein:vir:94 303 ----TN--FF--DTPA----EKVF-GKPVVFTDAAVK------PIVG---DF----NYFGINYDGTTYDTDKDVKKGEYL 356 (387) T ss_pred ----Cc--cc--ccCC----cccc-ccceEEecCCCc------eeee---ch----hhhhhhhhhhhheecccccCCceE Confidence 00 00 1111 3565 468887755431 3444 21 112222211112222233333333 Q ss_pred eeeeeeeeee-ecC--cc--cccCccccccc Q lcl|Aclame:pro 422 IGFKTRYGMV-SNP--FV--TTNGLYNGTPD 447 (468) Q Consensus 422 ~g~~tRY~l~-~nP--~~--~~~~~~~~~~~ 447 (468) +-...|++.. ++| |. +..+..+..|. T Consensus 357 ~~~~~r~Dg~v~~~~A~~~l~~ka~~~~~~~ 387 (387) T protein:vir:94 357 FVLTAWYDQQRTLDSAFRIAKAKENTGPLPS 387 (387) T ss_pred EEEEEEeCcEeechhheEEEEeecCCCCCCC Confidence 3334477643 445 32 22333333444 No 102 >protein:vir:2685 Length: 387 # NCBI annotation: hypothetical protein # Family: family:all:658 # MgeID: mge:57 # MgeName: phiSLT # Cross-refs: genbank:acc:NP_075504;genbank:gi:12719433;genbank:GeneID:920169 Probab=40.18 E-value=0.98 Score=20.62 Aligned_cols=323 Identities=13% Similarity=0.107 Sum_probs=118.7 Q ss_pred CcchHHHHHhhhhhhCCCccchhcchh-----------------hhHHHHHH------HhHHHHHHhhhhhhhhhhhhhh Q lcl|Aclame:pro 1 MFNAEHLQEKWSPVLNHGEAPAIGDRY-----------------KRAVTSVL------LENQERFLREERGMLNEVAVNS 57 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~-----------------~~~~~~~l------lenq~~~~~~~~~~l~e~~~~~ 57 (468) |=..++|+++|.-+.+. +-++.+.. ++.+ ..| |++|-+.+.++.....+..... T Consensus 1 Mk~l~el~~~~~~~~~~--~~~~~~el~e~~~~~~~~~eei~~~~~~~-~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~ 77 (387) T protein:vir:26 1 MPTLYELKQSLGMIGQQ--LKNKNDELSQKATDPNIDMEDIKQLETEK-AGLQQRFNIVERQVQDIEEKEKAKVKDKGEA 77 (387) T ss_pred CchHHHHHHHHHHHHHH--HHHHHHHHHHHHhccCcCHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhhhhhcccc Confidence 55567888777776543 11111111 1110 111 1111111100000000000000 Q ss_pred cC-----------------c----cc--------ccccccccccccccccccccceehh------hhHHhhhhhhhhhee Q lcl|Aclame:pro 58 LG-----------------A----GT--------IAPAGSALGSANTGGLAGFDPVLIS------LVRRAMPNLMAYDVC 102 (468) Q Consensus 58 ~~-----------------~----~~--------~~~~~~i~~st~tg~i~~~~P~Lv~------l~RRa~~~LI~~DI~ 102 (468) .. + .. ......+.+++.++ + ..||+ ++++....-.-.+++ T Consensus 78 ~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~~~~----g-G~lIP~~~~~~Ii~~~~~~~~l~~~~ 152 (387) T protein:vir:26 78 YQSLSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSG----G-DKLLPKTLSKEIVSEPFAKNQLREKA 152 (387) T ss_pred CCCCchhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCCCCC----C-ceeechhHHHHHHHHHHhhchhhhhc Confidence 00 0 00 00000111222211 1 12222 222232333446788 Q ss_pred eeecCCccceeeeeeeeeecCCCCcccccccCCccccccccccccccccccCccccCCCccccccccccccccccccccc Q lcl|Aclame:pro 103 GVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSK 182 (468) Q Consensus 103 GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~g 182 (468) .|.|+++.+.- |-.+... . ..| T Consensus 153 ~~~~~~~~~~p----~~~~~~~---~-------a~~-------------------------------------------- 174 (387) T protein:vir:26 153 RLTNIKGLEIP----RVSYTLD---D-------DDF-------------------------------------------- 174 (387) T ss_pred eeeecCCceee----eeeccCC---c-------ccc-------------------------------------------- Confidence 88877653321 1001000 0 000 Q ss_pred cchhhhhccCCCCcchhhcceEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHh Q lcl|Aclame:pro 183 MPREDLERMGEANRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVY 262 (468) Q Consensus 183 m~Ta~aE~lG~~~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~ 262 (468) .++++...++...|.+..| .+|.-+-...+|-||.+|- ..|.|++|.+-|+..|..-.|..++-. T Consensus 175 --v~Eg~~~~~~~~~f~~v~l-------~~~k~~~~i~iS~ell~ds----~~~l~~~i~~~la~~~~~~e~~~~~~~-- 239 (387) T protein:vir:26 175 --ITDVETAKELKAKGDTVKF-------TTNKFKVFAAISDTVIHGS----DVDLVNWVENALQSGLAAKERKDALAV-- 239 (387) T ss_pred --ccccccccccccccceeee-------chheeeeechhhHHHHhhh----HHHHHHHHHHHHHHHHHHHHHHhHhhc-- Confidence 0011111111223444444 4444444578999999985 356788888888888876555555422 Q ss_pred hhhhccccccc-cccccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccc Q lcl|Aclame:pro 263 TVAKKGAQNNV-ANAGIFDLDVDSNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSG 341 (468) Q Consensus 263 ~va~~~k~~~~-~~~g~~Dl~~~~~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~ 341 (468) -...+...++ .+.++.-.. .+. .......|++.+.. --|..+.|++-+...+.++.. ++..- T Consensus 240 -g~g~g~~~g~~~~~~~~~~~--~~~--~~d~i~~~~~~l~~--------~y~~na~~imn~~t~~~~~~~---~~~~~- 302 (387) T protein:vir:26 240 -SPKSGLEHMSFYNGSVKEVE--GAD--MYDAIINALADLHE--------DYRDNATIYMRYADYVKIISV---LSNGT- 302 (387) T ss_pred -CCCccccceeeecccccccc--ccc--hHHHHHHHHhccCh--------hhhcCCEEEEechHHHHHHHH---HhcCC- Confidence 2222222222 222222111 111 11223333333221 123456676555554554432 32110 Q ss_pred cccccccccccccccCceeEEEecCceEEEEcccccccCCcceEEEEEecCCcccceeEeeccchhhcccccCCccccce Q lcl|Aclame:pro 342 LNGAGGPSIGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNTFQPK 421 (468) Q Consensus 342 ~~~~~~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~~~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~s~qP~ 421 (468) .+ .+ ...+ ++|. +++||+.-++.. +++| +- +-||.=|-...+.+..+..+.+-. T Consensus 303 ----~~--~~--~~~~----~~ll-G~PV~~~~~~~~------~~~G---Df----~~~~~~~~~~~~~~~~~~~~~~~~ 356 (387) T protein:vir:26 303 ----TN--FF--DTPA----EKVF-GKPVVFTDAAVK------PIVG---DF----NYFGINYDGTTYDTDKDVKKGEYL 356 (387) T ss_pred ----Cc--cc--ccCC----cccc-ccceEEecCCCc------eeee---ch----hhhhhhhhhhhheecccccCCceE Confidence 00 00 1111 3565 468887755431 3444 21 112222211112222233333333 Q ss_pred eeeeeeeeee-ecC--cc--cccCccccccc Q lcl|Aclame:pro 422 IGFKTRYGMV-SNP--FV--TTNGLYNGTPD 447 (468) Q Consensus 422 ~g~~tRY~l~-~nP--~~--~~~~~~~~~~~ 447 (468) +-...|++.. ++| |. +..+..+..|. T Consensus 357 ~~~~~r~Dg~v~~~~A~~~l~~ka~~~~~~~ 387 (387) T protein:vir:26 357 FVLTAWYDQQRTLDSAFRIAKAKENTGPLPS 387 (387) T ss_pred EEEEEEeCcEeechhheEEEEeecCCCCCCC Confidence 3334477643 445 32 22333333444 No 103 >protein:vir:96978 Length: 387 # NCBI annotation: ORF009 # Family: family:all:658 # MgeID: mge:1643 # MgeName: 42e # Cross-refs: genbank:acc:YP_239859;genbank:gi:66395517;genbank:GeneID:5133011 Probab=40.18 E-value=0.98 Score=20.62 Aligned_cols=323 Identities=13% Similarity=0.107 Sum_probs=118.7 Q ss_pred CcchHHHHHhhhhhhCCCccchhcchh-----------------hhHHHHHH------HhHHHHHHhhhhhhhhhhhhhh Q lcl|Aclame:pro 1 MFNAEHLQEKWSPVLNHGEAPAIGDRY-----------------KRAVTSVL------LENQERFLREERGMLNEVAVNS 57 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~-----------------~~~~~~~l------lenq~~~~~~~~~~l~e~~~~~ 57 (468) |=..++|+++|.-+.+. +-++.+.. ++.+ ..| |++|-+.+.++.....+..... T Consensus 1 Mk~l~el~~~~~~~~~~--~~~~~~el~e~~~~~~~~~eei~~~~~~~-~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~ 77 (387) T protein:vir:96 1 MPTLYELKQSLGMIGQQ--LKNKNDELSQKATDPNIDMEDIKQLETEK-AGLQQRFNIVERQVQDIEEKEKAKVKDKGEA 77 (387) T ss_pred CchHHHHHHHHHHHHHH--HHHHHHHHHHHHhccCcCHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHhhhhhcccc Confidence 55567888777776543 11111111 1110 111 1111111100000000000000 Q ss_pred cC-----------------c----cc--------ccccccccccccccccccccceehh------hhHHhhhhhhhhhee Q lcl|Aclame:pro 58 LG-----------------A----GT--------IAPAGSALGSANTGGLAGFDPVLIS------LVRRAMPNLMAYDVC 102 (468) Q Consensus 58 ~~-----------------~----~~--------~~~~~~i~~st~tg~i~~~~P~Lv~------l~RRa~~~LI~~DI~ 102 (468) .. + .. ......+.+++.++ + ..||+ ++++....-.-.+++ T Consensus 78 ~~~~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~~~~~----g-G~lIP~~~~~~Ii~~~~~~~~l~~~~ 152 (387) T protein:vir:96 78 YQSLSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSG----G-DKLLPKTLSKEIVSEPFAKNQLREKA 152 (387) T ss_pred CCCCchhHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhhhccCCCCC----C-ceeechhHHHHHHHHHHhhchhhhhc Confidence 00 0 00 00000111222211 1 12222 222232333446788 Q ss_pred eeecCCccceeeeeeeeeecCCCCcccccccCCccccccccccccccccccCccccCCCccccccccccccccccccccc Q lcl|Aclame:pro 103 GVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSK 182 (468) Q Consensus 103 GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~g 182 (468) .|.|+++.+.- |-.+... . ..| T Consensus 153 ~~~~~~~~~~p----~~~~~~~---~-------a~~-------------------------------------------- 174 (387) T protein:vir:96 153 RLTNIKGLEIP----RVSYTLD---D-------DDF-------------------------------------------- 174 (387) T ss_pred eeeecCCceee----eeeccCC---c-------ccc-------------------------------------------- Confidence 88877653321 1001000 0 000 Q ss_pred cchhhhhccCCCCcchhhcceEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHh Q lcl|Aclame:pro 183 MPREDLERMGEANRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVY 262 (468) Q Consensus 183 m~Ta~aE~lG~~~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~ 262 (468) .++++...++...|.+..| .+|.-+-...+|-||.+|- ..|.|++|.+-|+..|..-.|..++-. T Consensus 175 --v~Eg~~~~~~~~~f~~v~l-------~~~k~~~~i~iS~ell~ds----~~~l~~~i~~~la~~~~~~e~~~~~~~-- 239 (387) T protein:vir:96 175 --ITDVETAKELKAKGDTVKF-------TTNKFKVFAAISDTVIHGS----DVDLVNWVENALQSGLAAKERKDALAV-- 239 (387) T ss_pred --ccccccccccccccceeee-------chheeeeechhhHHHHhhh----HHHHHHHHHHHHHHHHHHHHHHhHhhc-- Confidence 0011111111223444444 4444444578999999985 356788888888888876555555422 Q ss_pred hhhhccccccc-cccccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccc Q lcl|Aclame:pro 263 TVAKKGAQNNV-ANAGIFDLDVDSNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSG 341 (468) Q Consensus 263 ~va~~~k~~~~-~~~g~~Dl~~~~~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~ 341 (468) -...+...++ .+.++.-.. .+. .......|++.+.. --|..+.|++-+...+.++.. ++..- T Consensus 240 -g~g~g~~~g~~~~~~~~~~~--~~~--~~d~i~~~~~~l~~--------~y~~na~~imn~~t~~~~~~~---~~~~~- 302 (387) T protein:vir:96 240 -SPKSGLEHMSFYNGSVKEVE--GAD--MYDAIINALADLHE--------DYRDNATIYMRYADYVKIISV---LSNGT- 302 (387) T ss_pred -CCCccccceeeecccccccc--ccc--hHHHHHHHHhccCh--------hhhcCCEEEEechHHHHHHHH---HhcCC- Confidence 2222222222 222222111 111 11223333333221 123456676555554554432 32110 Q ss_pred cccccccccccccccCceeEEEecCceEEEEcccccccCCcceEEEEEecCCcccceeEeeccchhhcccccCCccccce Q lcl|Aclame:pro 342 LNGAGGPSIGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNTFQPK 421 (468) Q Consensus 342 ~~~~~~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~~~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~s~qP~ 421 (468) .+ .+ ...+ ++|. +++||+.-++.. +++| +- +-||.=|-...+.+..+..+.+-. T Consensus 303 ----~~--~~--~~~~----~~ll-G~PV~~~~~~~~------~~~G---Df----~~~~~~~~~~~~~~~~~~~~~~~~ 356 (387) T protein:vir:96 303 ----TN--FF--DTPA----EKVF-GKPVVFTDAAVK------PIVG---DF----NYFGINYDGTTYDTDKDVKKGEYL 356 (387) T ss_pred ----Cc--cc--ccCC----cccc-ccceEEecCCCc------eeee---ch----hhhhhhhhhhhheecccccCCceE Confidence 00 00 1111 3565 468887755431 3444 21 112222211112222233333333 Q ss_pred eeeeeeeeee-ecC--cc--cccCccccccc Q lcl|Aclame:pro 422 IGFKTRYGMV-SNP--FV--TTNGLYNGTPD 447 (468) Q Consensus 422 ~g~~tRY~l~-~nP--~~--~~~~~~~~~~~ 447 (468) +-...|++.. ++| |. +..+..+..|. T Consensus 357 ~~~~~r~Dg~v~~~~A~~~l~~ka~~~~~~~ 387 (387) T protein:vir:96 357 FVLTAWYDQQRTLDSAFRIAKAKENTGPLPS 387 (387) T ss_pred EEEEEEeCcEeechhheEEEEeecCCCCCCC Confidence 3334477643 445 32 22333333444 No 104 >protein:vir:94711 Length: 347 # NCBI annotation: capsid # Family: family:all:975 # MgeID: mge:1528 # MgeName: K1F # Cross-refs: genbank:acc:YP_338120;genbank:gi:77118198;genbank:GeneID:3707734 Probab=39.24 E-value=1 Score=20.52 Aligned_cols=297 Identities=15% Similarity=0.078 Sum_probs=123.0 Q ss_pred CCccceeeeeeeeeecCCCCc------ccccccCCccccccccccccccccccCccccCCCccccccccccc---ccccc Q lcl|Aclame:pro 107 MSGPTGLIFAMRSRYENQAGE------EALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDA---APGTY 177 (468) Q Consensus 107 mTGPTGLIFAMRsrY~~qsG~------EA~fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a---~~~~~ 177 (468) |..-++.--+.|.-+++.+++ |.|-.|..+.|--..- .......- .. .+.+..-.+.. ....+ T Consensus 1 m~~~~~~~~~t~~g~~~~~~d~~al~ik~f~~eV~~~f~~~s~-~~~~~~~r---~i----~~G~sv~i~~iG~~tv~~~ 72 (347) T protein:vir:94 1 MANVPGQKIGTDQGKGKSSSDALALFLKVFAGEVLTAFTRRSV-TADKHIVR---TI----QNGKSAQFPVMGRTSGVYL 72 (347) T ss_pred CCCCCccccccccccCCccccHHHHHHHHHhHHHHHHHHHHHh-hhcccccc---cc----cccceEEEecccceeeeee Confidence 666566555555555444444 2333455555431110 00000000 00 00111111100 01111 Q ss_pred ccccccchhhhhcc-CCC-CcchhhcceEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhH Q lcl|Aclame:pro 178 EVGSKMPREDLERM-GEA-NRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINR 255 (468) Q Consensus 178 t~~~gm~Ta~aE~l-G~~-~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINR 255 (468) +. ++.+ ++. ...=.|+.++||++.+ +..-+.-.-|.++ | .|-..|++.-....++.++.+ T Consensus 73 t~--------G~~l~~~~~~~~~~e~~itID~~~~--------~~~~VddiD~~q~-~-~D~~~~~~~~~g~aLa~~~D~ 134 (347) T protein:vir:94 73 AP--------GERLSDKRKGIKHTEKVITIDGLLT--------ADVMIFDIEDAMN-H-YDVAGEYSNQLGEALAIAADG 134 (347) T ss_pred cC--------CCCcCCCCCCCCcceEEEEecchhh--------hhHHhhhHHHHhc-C-cchHHHHHHHHHHHHHHHHHH Confidence 11 1222 211 1234667788887532 3334444444444 3 788899999999999999999 Q ss_pred HHHHHHhhhhhc-cc----cccccccccccccccccchhHHHHHHHHHHHHHHHHHHH-HHhhcCCCccEEEEchhHHHH Q lcl|Aclame:pro 256 EVVRRVYTVAKK-GA----QNNVANAGIFDLDVDSNGRWSVEKFKGLLFQVERDANAI-AQETRRGKGNFLICSADVASA 329 (468) Q Consensus 256 Eii~~l~~va~~-~k----~~~~~~~g~~Dl~~~~~grw~~e~~k~L~~~i~~ean~i-~~~T~r~~~n~~v~S~~Va~~ 329 (468) -|+..+..++.. .. ..|....-+++.....+..-.......+ +....+|.+. -.+-.--.|-|+|++|+..+. T Consensus 135 ~i~~~~~~~aa~~~~~~~~~~g~~~~s~~~~~~~~~~~~~~~~~~~~-~~~i~~a~~~Lde~~VP~~~R~~vv~P~~~~~ 213 (347) T protein:vir:94 135 AVLAEMAILCNLPAASNENIAGLGTASVLEVGKKADLDTPAKLGEAI-IGQLTIARAKLTSNYVPAGDRYFYTTPDNYSA 213 (347) T ss_pred HHHHHHHHHhccccccccccCCCcccceeeccccccccchhhhHHHH-HHHHHHHHHHHhhcCCCCCCcEEEeCHHHHHH Confidence 999888654322 11 1121111122221111111011111222 3333333332 112222357899999999998 Q ss_pred HHhhcccccccccccccccccccccccCceeEEEecCceEEEEcccccccCCc----------ceEEEE----------- Q lcl|Aclame:pro 330 LAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVFVDPYAANLSDK----------HYYVIG----------- 388 (468) Q Consensus 330 L~~sG~~~~~~~~~~~~~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~~~~----------dY~~vG----------- 388 (468) |-.. ..+....- ...+ .-..| .+|++ .+++||.- |+-|. .|-++. T Consensus 214 Ll~~--~~~~~~~~---~~~~--~~~~G--~Vg~i-~G~~V~~S----n~lp~~~~t~~~~~~~~~~~aG~~~~~~~~~~ 279 (347) T protein:vir:94 214 ILAA--LMPNAANY---AALI--DPETG--NIRNV-MGFVVVEV----PHLVQGGAGETRGDDGITIASGQKHAFPATAS 279 (347) T ss_pred Hhcc--chhhhhhc---cccc--ccccc--ceEEE-eceEEEec----CcccccccccccccCcceecCcccccccccch Confidence 8543 22222211 1110 11223 46777 57888876 55553 222221 Q ss_pred --EecCCcccceeEeeccchh----h---cccccCCccccceeeeeeeeeee-ecC-cc-cccCccccccchh Q lcl|Aclame:pro 389 --YKGTSPYDAGLFYCPYVPL----Q---MVRSIDPNTFQPKIGFKTRYGMV-SNP-FV-TTNGLYNGTPDGE 449 (468) Q Consensus 389 --~KG~~~~d~glfyaPYv~l----~---~~~~~dp~s~qP~~g~~tRY~l~-~nP-~~-~~~~~~~~~~~~~ 449 (468) |+++-.-..+|||-|=--+ . .-.-.|+..|-=.|==+..||-. .+| .+ ..... .++ T Consensus 280 ~~~~~~~~~~~~l~~h~~A~~~v~~~~~~~e~~r~~~~~~d~i~~~~~~G~~~~rP~~a~~~~~~-----~A~ 347 (347) T protein:vir:94 280 SDVKVTMDNVVGLFSHRSAVGTVKLRDLALERDRDVDAQGDLIVGKYAMGHGGLRPEAAGALVFS-----PAE 347 (347) T ss_pred hhhcccccceeEEEeehhhhhhhhcccccccchhchhhHHHHhhhhhhhcCcccccceeEEEEec-----CCC Confidence 2232223367787775211 1 11122444443322222222221 223 11 00000 000 No 105 >protein:vir:94771 Length: 298 # NCBI annotation: major head protein # Family: family:all:966 # MgeID: mge:1529 # MgeName: phi LC3 # Cross-refs: genbank:acc:NP_996706;genbank:gi:45597421;genbank:GeneID:2769044 Probab=34.75 E-value=1.3 Score=20.01 Aligned_cols=274 Identities=15% Similarity=0.153 Sum_probs=104.7 Q ss_pred ccccccccccccceeh-hhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCcccccccCCcccccccccccccc Q lcl|Aclame:pro 71 GSANTGGLAGFDPVLI-SLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDTGFTGGYDASQGDY 149 (468) Q Consensus 71 ~st~tg~i~~~~P~Lv-~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSg~~~~~~~~~ 149 (468) -.+++|.+ .-|.+. .+++...++.+-.+++.+.||++...-|. +..+ +.++ .| T Consensus 1 ma~~gG~l--ip~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~p----~~~~--~~~a-------~~----------- 54 (298) T protein:vir:94 1 MVLNKGTL--FDPELVTDLISKVAGKSSIARLSAQKPIPFNGEKVF----TFTM--DSEI-------DV----------- 54 (298) T ss_pred Ceeccccc--cChhHHHHHHHHHHhhchhhhhcceeeccCCceEEE----EEec--Ccce-------EE----------- Confidence 12222222 223322 34555556778888999999876322111 1100 0000 00 Q ss_pred ccccCccccCCCccccccccccccccccccccccchhhhhccCCCCcchhhcceEEEEEEEEeecccccccccHHHHHHH Q lcl|Aclame:pro 150 AVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDL 229 (468) Q Consensus 150 ~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDL 229 (468) .++++....+...|.++.|...|..+ ....|-||.|+- T Consensus 55 -----------------------------------v~Eg~~~~~~~~~f~~v~l~~~k~~~-------~~~iS~ell~~~ 92 (298) T protein:vir:94 55 -----------------------------------VAESGKKTHGGVTLAPQTMVPIKVEY-------GARISDEFMYAS 92 (298) T ss_pred -----------------------------------eeCCccccccccceeEEEEeeeEEEE-------eeehhHHHhccC Confidence 00111111222335555555555543 467888987642 Q ss_pred HHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhccccc-cccccccc-------cccccccchhHHHHHHHHHHHH Q lcl|Aclame:pro 230 KAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQN-NVANAGIF-------DLDVDSNGRWSVEKFKGLLFQV 301 (468) Q Consensus 230 kAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~-~~~~~g~~-------Dl~~~~~grw~~e~~k~L~~~i 301 (468) -. -..+-+++|.+-|...|...|+.-++.-... .-+... +....+.. ......+. ..+.+..+ T Consensus 93 ~~-~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~--~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~i~~~---- 163 (298) T protein:vir:94 93 DE-EKINILQAFNDGFAKKVARGIDLMAFHGVNP--RLGTASAVIGTNHFDSKVTQKVEAPRGIAD--PNGAIENA---- 163 (298) T ss_pred Cc-cHHHHHHHHHHHHHHHHHHHHHHHhhccccc--CCCccccccccccccccccccccccccccc--HHHHHHHH---- Confidence 11 0123445555555555555555555432110 001000 00000000 00000000 01122222 Q ss_pred HHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccccccCceeEEEecCceEEEEcccccc--c Q lcl|Aclame:pro 302 ERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVFVDPYAAN--L 379 (468) Q Consensus 302 ~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~--~ 379 (468) ...+ .....+...+|++|+....|... ..+ + +....+-+.++. ..|+|.| ++|+++..... . T Consensus 164 ---~~~~--~~~~~~~~~~vmn~~~~~~l~~l---kd~---~---G~~l~~~~~~~~-~~~tl~G-~PV~~~~~v~~~~~ 227 (298) T protein:vir:94 164 ---VELL--TGVDADVTGIAINPSFRSALAKQ---KDL---Q---GNALFPELKWGA-TPDTING-LPVDVNKTVSDMSL 227 (298) T ss_pred ---HHhh--hhcCCCccEEEEcHHHHHHHHHh---hcc---C---CCeeecCcccCC-CCceecc-eeeEEecccccccC Confidence 2221 11134556799999999988752 111 0 111111111211 1256754 68887743211 1 Q ss_pred CCcceEEEEEecCCcccceeEeeccchhhc--ccccCCcc-----cc-ceeee--eeeeeee-ecC--cccccCcccccc Q lcl|Aclame:pro 380 SDKHYYVIGYKGTSPYDAGLFYCPYVPLQM--VRSIDPNT-----FQ-PKIGF--KTRYGMV-SNP--FVTTNGLYNGTP 446 (468) Q Consensus 380 ~~~dY~~vG~KG~~~~d~glfyaPYv~l~~--~~~~dp~s-----~q-P~~g~--~tRY~l~-~nP--~~~~~~~~~~~~ 446 (468) ++.+.+++|- -. .++.|...-.+++ .+..||+. || =.++| ..|+|.. .+| |+.... T Consensus 228 ~~~~~~~~Gd---fs--~~~~~~~~~~~~~~~~~~~~~d~~~~~~f~~~~v~~r~~~r~~~~~~~~~a~~~l~~------ 296 (298) T protein:vir:94 228 TQRDRAIIGD---FA--NGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVTE------ 296 (298) T ss_pred CCccEEEEee---cc--ceEEEEEecCceEEEeecCCCcCcchhhhhcCcEEEEEEEEeccEeecccceEEEEe------ Confidence 2233333331 10 1122333333222 22223321 22 11333 4566644 444 433221 Q ss_pred chhhhhhhcc Q lcl|Aclame:pro 447 DGEALTPNAN 456 (468) Q Consensus 447 ~~~~~~~~an 456 (468) ++ T Consensus 297 --------~t 298 (298) T protein:vir:94 297 --------AN 298 (298) T ss_pred --------cC Confidence 11 No 106 >protein:vir:4456 Length: 401 # NCBI annotation: Major capsid protein precursor # Family: family:all:21 # MgeID: mge:96 # MgeName: ST64B # Cross-refs: genbank:acc:NP_700379;genbank:gi:23505451;genbank:GeneID:955658 Probab=34.75 E-value=1.3 Score=20.01 Aligned_cols=328 Identities=13% Similarity=0.155 Sum_probs=114.4 Q ss_pred CcchHHHHHhhhhhh------CCC------------------------ccchhcchhhhHHHHHHHhHHHHHHhhhhhhh Q lcl|Aclame:pro 1 MFNAEHLQEKWSPVL------NHG------------------------EAPAIGDRYKRAVTSVLLENQERFLREERGML 50 (468) Q Consensus 1 ~~~~~~l~~kw~p~l------~~~------------------------~~~~i~~~~~~~~~~~llenq~~~~~~~~~~l 50 (468) =-..+++.+....+- +.+ ........+|+.+..-|...++..+++. T Consensus 27 ~~~~~~~e~~~~~l~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~a~~~~lr~~~~~~~~~~---- 102 (401) T protein:vir:44 27 DKRVEAIEQEKGKLAGQVETLNGKLSELENLKSDLEKELLELKRPARGAQNKVAAEHKDAFVGFLRKGREDGLRDL---- 102 (401) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccccchhHHHHHHHHHHHhhhhhhhhHHH---- Confidence 000011111111110 000 0011111122221111111111111100 Q ss_pred hhhhhhhcCccccccccccccccccccc---ccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCc Q lcl|Aclame:pro 51 NEVAVNSLGAGTIAPAGSALGSANTGGL---AGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGE 127 (468) Q Consensus 51 ~e~~~~~~~~~~~~~~~~i~~st~tg~i---~~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG~ 127 (468) |.. .+ ...+.+.|++ ..+.+.++.+.| ...+-.+++.+.||++++..+.-.. .+. T Consensus 103 -e~~--a~----------~~~~~~~GG~~iP~~~~~~ii~~~~---~~~~l~~~~~~~~~~~~~~~~~~~~------~~~ 160 (401) T protein:vir:44 103 -ERK--AL----------QVGTDEDGGYAVPEELDRSILSLLK---DEVVMRQEATVITVGGSDYKKLVNL------GGT 160 (401) T ss_pred -HHH--Hh----------hcCCCCCCceeccHhHHHHHHHHHH---hhhhhhhhceeeecCCCceEEEEec------CCc Confidence 000 00 0001112221 244555666666 3335577899999998864432211 000 Q ss_pred ccccccCCccccccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCCC-CcchhhcceEEE Q lcl|Aclame:pro 128 EALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEA-NRLFREMSFSIE 206 (468) Q Consensus 128 EA~fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~-~~~f~EMaFsIe 206 (468) . ..| .++.+..... ...|.+..|.+. T Consensus 161 ~-------a~w----------------------------------------------v~E~~~~~~~~~~~~~~v~~~~~ 187 (401) T protein:vir:44 161 A-------SGW----------------------------------------------VGETDTRSQTATSRLGLIEPFMG 187 (401) T ss_pred c-------cee----------------------------------------------eccccccCccccccceeeeeehh Confidence 0 000 0000111111 123555555555 Q ss_pred EEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHH--------Hhhhhhcccccccccccc Q lcl|Aclame:pro 207 KTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRR--------VYTVAKKGAQNNVANAGI 278 (468) Q Consensus 207 K~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~--------l~~va~~~k~~~~~~~g~ 278 (468) |..+ -..+|-||.+|- .+|.+++|.+-|+..|...+++.||.- |.+.+......+....+. T Consensus 188 k~~~-------~~~iS~ell~ds----~~~l~~~i~~~la~ai~~~~~~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~ 256 (401) T protein:vir:44 188 EIYG-------NPQATQKMLDDA----FFNVEAWINSELATEFAEQEEIAFTTGDGTKKPKGFLAYESTEESDKARAFGK 256 (401) T ss_pred heee-------ehhhhHHHHhcc----hHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccceeecccccccccccccccc Confidence 5433 467899999984 457889999999999998888888742 111111111111000000 Q ss_pred cccccc-ccchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccccccC Q lcl|Aclame:pro 279 FDLDVD-SNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTG 357 (468) Q Consensus 279 ~Dl~~~-~~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~ 357 (468) .+.... ..+.-..+....|++.+.. .= +. +...|+++.....|... ..+ + +....+.+-+. T Consensus 257 ~~~~~t~~~~~~~~d~i~~~~~~l~~-------~~-~~-~a~~v~n~~~~~~L~~l---kd~-----~-G~~l~~~~~~~ 318 (401) T protein:vir:44 257 LQHIVSGEATAVTADAIIKLIYTLRK-------AH-RT-GAKFMMNNNSLFAIRLL---KDT-----E-GNYLWRPGLEL 318 (401) T ss_pred ccccccccccccCHHHHHHHHHhcch-------hh-hc-CCEEEEcHHHHHHHHHh---hcc-----C-CceeecCCcCC Confidence 000000 0111112233333333321 11 12 23567999988888752 211 1 11111211111 Q ss_pred ceeEEEecCceEEEEcccccc-cCCcceEEEEEecCCcccceeEeeccchhhcccccCCccccceeeeee--eeeeee-c Q lcl|Aclame:pro 358 NLAVGTINGRIKVFVDPYAAN-LSDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNTFQPKIGFKT--RYGMVS-N 433 (468) Q Consensus 358 ~~~~G~l~g~~~vy~D~Ya~~-~~~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~s~qP~~g~~t--RY~l~~-n 433 (468) .. -++|. |++|+++..... -+..+.+++| +... +|-=+..-.+....||-.=+-.++|.. |+|..+ + T Consensus 319 g~-~~~l~-G~PVv~~~~~p~~~~~~~~i~~G---d~~~----~~~i~~~~~~~~~~~~~~~~~~v~~~a~~r~d~~~~~ 389 (401) T protein:vir:44 319 GQ-PSSLA-GYGIAENEQMPDIAADAKAIAFG---NFKR----GYTIVDRIGTRILRDPYTNKPFVGFYTTKRTGGMLVD 389 (401) T ss_pred CC-Cceec-ceeeEEecCcCCccCCccEEEEe---ehhc----cEEEEEecceEEeeeccccCCcEEEEEEEEeccEEec Confidence 11 14564 566666532211 0111112222 1100 000000011222233332233333333 444332 2 Q ss_pred CcccccCccccccchhhhhhhcccceeeeeeeec Q lcl|Aclame:pro 434 PFVTTNGLYNGTPDGEALTPNANMYYRRVQVTNL 467 (468) Q Consensus 434 P~~~~~~~~~~~~~~~~~~~~an~y~~r~~v~~l 467 (468) |- -|+.++++.= T Consensus 390 ~~----------------------a~~~l~~~aa 401 (401) T protein:vir:44 390 SQ----------------------AIKLLKIAAA 401 (401) T ss_pred cc----------------------ceEEEEeecC Confidence 21 1222332222 No 107 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=33.27 E-value=1.4 Score=19.84 Aligned_cols=327 Identities=12% Similarity=0.027 Sum_probs=107.7 Q ss_pred Ccch------HHHHHhhhhhhCCC-----------------ccchhcchhhhHHHHHHHhHHHHHHhhhh-hhhhhhhhh Q lcl|Aclame:pro 1 MFNA------EHLQEKWSPVLNHG-----------------EAPAIGDRYKRAVTSVLLENQERFLREER-GMLNEVAVN 56 (468) Q Consensus 1 ~~~~------~~l~~kw~p~l~~~-----------------~~~~i~~~~~~~~~~~llenq~~~~~~~~-~~l~e~~~~ 56 (468) -+.. +.|..+..-+-... ....-...+++.....+.+.+...++... ..+.+.. T Consensus 172 e~~~~~~~~~e~l~~~~e~~~~~~~~~~~~~d~~e~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~l~~~e~~~~~~~~-- 249 (543) T protein:vir:81 172 ELRARALSAIEKMQGASDNVRAAATKIIERFDDEDSTLARQCLATSSPAYLRAWSKMARNPHAAILTEEEKRAINEVR-- 249 (543) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhhhHHHHHHHhhHHHHhhhhhhhhhhhhh-- Confidence 0000 11111111100000 00000000111111111111111111110 1111110 Q ss_pred hcCcccccccccccccccccccccccceehhhhHHhh-hhhhhhheeeeecCCccceeeeeeeeeecCCCCcccccccCC Q lcl|Aclame:pro 57 SLGAGTIAPAGSALGSANTGGLAGFDPVLISLVRRAM-PNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPD 135 (468) Q Consensus 57 ~~~~~~~~~~~~i~~st~tg~i~~~~P~Lv~l~RRa~-~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~ 135 (468) .....+++|++.--....-.++.+.. +.-+...++-|.|++|..- + .+ . ..+.. T Consensus 250 -----------~~~~t~~~gg~lip~~~~~~ii~~~~~~~~~l~~~~~~~~~~g~~~--~-~~--~--~~~~~------- 304 (543) T protein:vir:81 250 -----------AMGLTKADGGYLVPFQLDPTVIITSNGSLNDIRRFARQVVATGDVW--H-GV--S--SAAVQ------- 304 (543) T ss_pred -----------hcccccccCcccCchhhhhHHHHHHHhhhchhhhhcccccCCcceE--E-EE--e--cCCcc------- Confidence 00001112221111111111222221 1123344455554443321 0 00 0 00000 Q ss_pred ccccccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCCCCcchhhcceEEEEEEEEeecc Q lcl|Aclame:pro 136 TGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFREMSFSIEKTSVTAQSR 215 (468) Q Consensus 136 t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~~~f~EMaFsIeK~tVtAKSR 215 (468) ..| .+ | +..+++-..+++.++++++.- T Consensus 305 a~~----------------------------------------------v~--E-----g~~~~~~~~~~~~i~~~~~k~ 331 (543) T protein:vir:81 305 WSW----------------------------------------------DA--E-----FEEVSDDSPEFGQPEIPVKKA 331 (543) T ss_pred eee----------------------------------------------cc--c-----Cccccccccccceeeeeeeee Confidence 000 00 1 112233334455566666666 Q ss_pred cccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhccccccccc---cccccccccccchhHHH Q lcl|Aclame:pro 216 ALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVAN---AGIFDLDVDSNGRWSVE 292 (468) Q Consensus 216 aLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~~~---~g~~Dl~~~~~grw~~e 292 (468) +=...+|-||.+|- .|.++.|.+-|...|...+|+-||.- .-+-.+..|+.+ .......+...+-.... T Consensus 332 ~~~~~is~ell~d~-----~~~~~~i~~~l~~~~~~~~d~ail~G---~Gt~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~ 403 (543) T protein:vir:81 332 QGFVPISIEALQDE-----ANVTETVALLFAEGKDELEAVTLTTG---TGQGNQPTGIVTALAGTAAEIAPVTAETFALA 403 (543) T ss_pred EeeehhhHHHHhcc-----HHHHHHHHHHHHHHHHHHHHHHHhcc---CCCCcccccchhhcccccccccccccccccHH Confidence 66678999999873 27899999999999999999988742 000001111100 00011111111111223 Q ss_pred HHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccccccCceeEEEecCceEEEE Q lcl|Aclame:pro 293 KFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVFV 372 (468) Q Consensus 293 ~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~~~~G~l~g~~~vy~ 372 (468) ....++..+. ..-.....+|++|.+...|... ..+. |.....-...+. -++|. |++|++ T Consensus 404 ~~~~~~~~l~---------~~~~~~~~~v~n~~~~~~l~~l---kd~~------G~~l~~~~~~g~--~~~l~-G~pv~~ 462 (543) T protein:vir:81 404 DVYAVYEQLA---------ARHRRQGAWLANNLIYNKIRQF---DTQG------GAGLWTTIGNGE--PSQLL-GRPVGE 462 (543) T ss_pred HHHHHHHhhh---------ccccCCcEEEEcHHHHHHHHHh---hcCC------CceeccCcCCCC--Ccccc-ceeeEE Confidence 3333333332 1101122578999999888752 2111 001111111121 24564 467777 Q ss_pred cccccccC--------------CcceEEEEEecCCcccceeEeeccchhhcccccCCccccceeeeeeeeeee-ecC--c Q lcl|Aclame:pro 373 DPYAANLS--------------DKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNTFQPKIGFKTRYGMV-SNP--F 435 (468) Q Consensus 373 D~Ya~~~~--------------~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~s~qP~~g~~tRY~l~-~nP--~ 435 (468) ..++..+. ++.++++|..++... =..||+-. ..|-...+=.+=+..|+|.. .|| | T Consensus 463 ~~~~~~~~~~~~~~~~~~i~~gd~~~~~i~~~~~~~i----~~~~~~~~----~~~~~~~~~~~~~~~r~d~~v~~~~A~ 534 (543) T protein:vir:81 463 AEAMDANWNTSASADNFVLLYGNFQNYVIADRIGMTV----EFIPHLFG----TNRRPNGSRGWFAYYRMGADVVNPNAF 534 (543) T ss_pred eccccccccccccCCcceEEEeeccceeEEeecccEE----EEeccccc----cchhhcCceEEEEEEeeccEeecccce Confidence 75432110 011122222221111 11122100 01112223344445566664 344 2 Q ss_pred ccccCccccccchhh Q lcl|Aclame:pro 436 VTTNGLYNGTPDGEA 450 (468) Q Consensus 436 ~~~~~~~~~~~~~~~ 450 (468) .... +...+ T Consensus 535 ~~l~------~~~~a 543 (543) T protein:vir:81 535 RLLN------VETAS 543 (543) T ss_pred EEEE------ecccC Confidence 2111 00001 No 108 >protein:vir:9361 Length: 402 # NCBI annotation: SLT orf 37-like protein # Family: family:all:658 # MgeID: mge:166 # MgeName: phi 12 # Cross-refs: genbank:acc:NP_803339;genbank:gi:29028650;genbank:GeneID:1258088 Probab=31.77 E-value=1.5 Score=19.66 Aligned_cols=329 Identities=13% Similarity=0.113 Sum_probs=117.0 Q ss_pred CcchHHHHHhhhhhhCC----C------------ccchhcchhh--hHHHHHH--HhHHHHHHhhhhhh-hhhh------ Q lcl|Aclame:pro 1 MFNAEHLQEKWSPVLNH----G------------EAPAIGDRYK--RAVTSVL--LENQERFLREERGM-LNEV------ 53 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~----~------------~~~~i~~~~~--~~~~~~l--lenq~~~~~~~~~~-l~e~------ 53 (468) |=.-++|+++|.-+.+. . ...+|....+ ..+.+++ |+.|-+.+.+.... ..+. T Consensus 16 mk~l~el~~~~~e~~~~~~~~~~el~~~~~~~~~~~ee~~~~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~ 95 (402) T protein:vir:93 16 MPTLYELKQSLGMIGQQLKNKNDELSQKATDPNIDMEDIKQLETEKAGLQQRFNIVERQVQDIEEKEKAKVKDKGEAYQS 95 (402) T ss_pred ChHHHHHHHHHHHHHHHHHHHHHHHHHHHhccCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccCCC Confidence 43346666666554322 0 0011211100 0000000 11111111100000 0000 Q ss_pred ------hhhh----cCc----ccc--------cccccccccccc-cccccccce-e-hhhhHHhhhhhhhhheeeeecCC Q lcl|Aclame:pro 54 ------AVNS----LGA----GTI--------APAGSALGSANT-GGLAGFDPV-L-ISLVRRAMPNLMAYDVCGVQPMS 108 (468) Q Consensus 54 ------~~~~----~~~----~~~--------~~~~~i~~st~t-g~i~~~~P~-L-v~l~RRa~~~LI~~DI~GVQPmT 108 (468) .... +.. ... .....+.+++.+ |+.. . |. + -.+++.....-+-.+++.|-|++ T Consensus 96 ~~~~~~~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~a~~~~t~~~GG~l-I-P~~~~~~Ii~~~~~~~~l~~~~~v~~~~ 173 (402) T protein:vir:93 96 LSDNEKMVKAKAEFYRHAILPNEFEKPSMEAQRLLHALPTGNDSGGDKL-L-PKTLSKEIVSEPFAKNQLREKARLTNIK 173 (402) T ss_pred CchhHHHHHHHHHHHHHHHhhhhHHHHHHhHHHHHhhhccCCCcCCccc-c-chhHHHHHHHhHHhhhhhhhhceeeecC Confidence 0000 000 000 000001111111 1110 0 11 0 01222222233446777777665 Q ss_pred ccceeeeeeeeeecCCCCcccccccCCccccccccccccccccccCccccCCCccccccccccccccccccccccchhhh Q lcl|Aclame:pro 109 GPTGLIFAMRSRYENQAGEEALFNEPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDL 188 (468) Q Consensus 109 GPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~a 188 (468) +.+.- |-.+.... .. -.+++ T Consensus 174 ~~~~p----~~~~~~~~-------------------------------------------------a~-------~v~Eg 193 (402) T protein:vir:93 174 GLEIP----RVSYTLDD-------------------------------------------------DD-------FITDV 193 (402) T ss_pred Cceee----eeeccCCc-------------------------------------------------cc-------ccccc Confidence 43220 00000000 00 00111 Q ss_pred hccCCCCcchhhcceEEEEEEEEeecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhcc Q lcl|Aclame:pro 189 ERMGEANRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKG 268 (468) Q Consensus 189 E~lG~~~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~ 268 (468) +...++...|.+..|.+.|. +-...+|-||.+|- .+|.+++|.+-|+..|+.-.|..++-.- ...+ T Consensus 194 ~~~~~~~~~f~~i~~~~~k~-------~~~i~iS~ell~Ds----~~~l~~~i~~~la~~~~~~e~~~~~~~g---~g~g 259 (402) T protein:vir:93 194 ETAKELKAKGDTVKFTTNKF-------KVFAAISDTVIHGS----DVDLVNWVENALQSGLAAKERKDALAVS---PKSG 259 (402) T ss_pred ccccccccccceeeecceee-------eeechhhHHHHhhh----HHHHHHHHHHHHHHHHHHHHHHhHhhcC---CCcc Confidence 11111122355555544444 44578999999985 3567889999999888876565554322 2222 Q ss_pred ccccc-cccccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccc Q lcl|Aclame:pro 269 AQNNV-ANAGIFDLDVDSNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGG 347 (468) Q Consensus 269 k~~~~-~~~g~~Dl~~~~~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~ 347 (468) ...++ .+.++.-...+. .......|++.+.. --+..+.|++-+.....++.- ++..- . T Consensus 260 ~p~g~~~~~~~~~~~~~~----~~d~l~~~~~~l~~--------~y~~na~~imn~~t~~~~~~~---~~d~~-----~- 318 (402) T protein:vir:93 260 LEHMSFYNGSVKEVEGAD----MYDAIINALADLHE--------DYRDNATIYMRYADYVKIISV---LSNGT-----T- 318 (402) T ss_pred ccceeeeccccccccccc----hHHHHHHHHhccCh--------hhhcCCEEEEechHHHHHHHH---HhcCC-----C- Confidence 22222 122222111111 01223333332221 123566676555555555442 22110 0 Q ss_pred cccccccccCceeEEEecCceEEEEcccccccCCcceEEEEEecCCcccceeEeeccchhhcccccCCccccceeeeeee Q lcl|Aclame:pro 348 PSIGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNTFQPKIGFKTR 427 (468) Q Consensus 348 ~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~~~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~s~qP~~g~~tR 427 (468) ..+ ...+ ++|. +++||+.-++.. +++|-- +-||.=|-...+.+..|+.+.+-.+-...| T Consensus 319 -~~~--~~~~----~~ll-G~PV~~t~~~~~------i~~GDf-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r 377 (402) T protein:vir:93 319 -NFF--DTPA----EKVF-GKPVVFTDAAVK------PIVGDF-------NYFGINYDGTTYDTDKDVKKGEYLFVLTAW 377 (402) T ss_pred -ccc--ccCC----cccc-ccceEEecCCCc------eeeech-------hhhhhhhhhhhhhhhhcccCCceEEEEEEE Confidence 000 0111 3465 568888755431 344421 112222222222222344444433333446 Q ss_pred eee-eecC--cc--cccCccccccc Q lcl|Aclame:pro 428 YGM-VSNP--FV--TTNGLYNGTPD 447 (468) Q Consensus 428 Y~l-~~nP--~~--~~~~~~~~~~~ 447 (468) ++. ++|| |. +.+...+.+|. T Consensus 378 ~Dg~v~~~~A~~~l~ik~~~~~~~~ 402 (402) T protein:vir:93 378 YDQQRTLDSAFRIAKAKENTGPLPS 402 (402) T ss_pred eCcEEechhheEEEEeecCCCCCCC Confidence 654 3456 32 23333344454 No 109 >protein:vir:95963 Length: 395 # NCBI annotation: ORF009 # Family: family:all:635 # MgeID: mge:1594 # MgeName: 2638A # Cross-refs: genbank:acc:YP_239802;genbank:gi:66395459;genbank:GeneID:5132880 Probab=30.12 E-value=1.6 Score=19.46 Aligned_cols=349 Identities=11% Similarity=0.030 Sum_probs=109.5 Q ss_pred Cc-------chHHHHHhhhhhhCCCccchhcchhhhHHHHHHHhHHHHHHh-----hhhhhhhhhhhhhcCcc------c Q lcl|Aclame:pro 1 MF-------NAEHLQEKWSPVLNHGEAPAIGDRYKRAVTSVLLENQERFLR-----EERGMLNEVAVNSLGAG------T 62 (468) Q Consensus 1 ~~-------~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~~~llenq~~~~~-----~~~~~l~e~~~~~~~~~------~ 62 (468) |- ..+++.|+++-+.+.-... ....-+.+.+..+++..++.++ +.+............+. . T Consensus 1 mt~~~~~~e~~~~~~e~~~~~~~~~~~~-~~~e~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~r~~~~l~~ee 79 (395) T protein:vir:95 1 MADMKQNNVKLKNYHEHKKQFANLVQNG-ASDEEQSKAFGAMFDALSNDLQEEITAEINNRVVDNGILAKRSQDPLTSEE 79 (395) T ss_pred ChhHHHHHHHHHHHHHHHHHHHHHHhhh-hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCccccchHH Confidence 21 1123445555444321100 0011111122222222111111 11111111110000000 0 Q ss_pred ccccccccccccccccccccceehh-hhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCcccccccCCcccccc Q lcl|Aclame:pro 63 IAPAGSALGSANTGGLAGFDPVLIS-LVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDTGFTGG 141 (468) Q Consensus 63 ~~~~~~i~~st~tg~i~~~~P~Lv~-l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSg~ 141 (468) ...-+.+.+.+.++.-.-.-+.+.. ++++....-.-..++-|+|++|.+= ..+..... .+ .|. T Consensus 80 ~~~~~~~~~~t~~~gG~liP~~~~~~Ii~~l~~~s~i~~~~~v~~~~~~~~-----i~~~~~~~--~a-------~w~-- 143 (395) T protein:vir:95 80 RKFFNDINYDVGYTDEKILPETVVERVFDDLQKDHPLLSKINFQNAGIKTR-----VIKADPAG--QA-------VWG-- 143 (395) T ss_pred HHHHHHHhhccCCCCceeccHHHHHHHHHHHHhhhhhhhhceeEecCCceE-----EEEecCCc--ce-------EEe-- Confidence 0011122223332221222222222 2222233334566788999877531 11111100 00 000 Q ss_pred ccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCCCCcchhhcceEEEEEEEEeecccccccc Q lcl|Aclame:pro 142 YDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFREMSFSIEKTSVTAQSRALKAEY 221 (468) Q Consensus 142 ~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~~~f~EMaFsIeK~tVtAKSRaLKAEY 221 (468) ...+|.-++....|.+..|...|..+- ... T Consensus 144 -------------------------------------------~e~~~~~~~~~~~f~~i~l~~~kl~~~-------~~i 173 (395) T protein:vir:95 144 -------------------------------------------KVFGEIKGQLDAAFREENFTQYKLTCF-------VVL 173 (395) T ss_pred -------------------------------------------ecccccCccccccceeeeeceeeEEEe-------ecc Confidence 000011111223466666666665443 357 Q ss_pred cHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHH-----------Hhhhhhc--cccccccccccccccccccch Q lcl|Aclame:pro 222 TLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRR-----------VYTVAKK--GAQNNVANAGIFDLDVDSNGR 288 (468) Q Consensus 222 T~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~-----------l~~va~~--~k~~~~~~~g~~Dl~~~~~gr 288 (468) |-||.+|- ..|.|++|.+.|+..|...+|+.||.- |..+... .+..+ ...+++-+. +.- T Consensus 174 S~ell~ds----~~~ie~~i~~~la~~ia~~~~~a~i~G~G~~~~qP~Gil~~~~~~~~~~~~~-~~~~~~t~~---~~~ 245 (395) T protein:vir:95 174 PDDLSTFG----PAWIERFVRTQIQEAISVALESAIINGGGAAKTQPVGLMKDVNTNSGAVTDK-ASSGTLTFA---DAD 245 (395) T ss_pred cHHHHhcc----hhHHHHHHHHHHHHHHHHHHhhheeeccCCCCcCceeeeecccccccccccc-cccchhhhh---hhH Confidence 88888885 457899999999999999999888741 1111100 00000 011111110 000 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchh-HHHHHHhhcccccccccccccccccccccccCceeEEEecCc Q lcl|Aclame:pro 289 WSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSAD-VASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGR 367 (468) Q Consensus 289 w~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~-Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~~~~G~l~g~ 367 (468) -.+.....+...+....+....+ -++++.|+ ++|. ...+....-|. ++ .|. ..-.|.=+ T Consensus 246 ~~~~~l~~~~~~~~~~~~~~~~~-~~~~~~~~-mn~~t~~~~~g~~~~~---~~--------------~G~-~~~~lg~g 305 (395) T protein:vir:95 246 TTILELNDVLKNLSVDEKGKELK-IDGKVALV-VNPRDSWDVQARYTYL---TA--------------NGG-FVTVLPYN 305 (395) T ss_pred hhHHHHHHHHHhhccccccchhh-hcCceEEE-EcchhhhhcCCcceec---cC--------------CCc-ceeccCCc Confidence 01111111111111111111111 13455554 4443 22221111111 10 111 01111114 Q ss_pred eEEEEcccccc----cCCcceEEEEEecCCcccceeEeeccchhh--c-------ccccCCccccceeeeeeeeeeeecC Q lcl|Aclame:pro 368 IKVFVDPYAAN----LSDKHYYVIGYKGTSPYDAGLFYCPYVPLQ--M-------VRSIDPNTFQPKIGFKTRYGMVSNP 434 (468) Q Consensus 368 ~~vy~D~Ya~~----~~~~dY~~vG~KG~~~~d~glfyaPYv~l~--~-------~~~~dp~s~qP~~g~~tRY~l~~nP 434 (468) ++||.+.++.. .-++-++++|-.++...+-+- ...+...+ | -..+||+.|- ++ .=-....| T Consensus 306 ~~v~~~~~~p~~~i~fgdfs~y~i~~r~~~~i~~~~-~~~~~~d~~~f~~~~r~dg~~~~~~A~~-~l----~i~~~~~~ 379 (395) T protein:vir:95 306 VTIITSEFVPEGKLVAFVTDRYNAVRGGGLTVKKFD-QTLALEDAVLFTAKTFAYGQPDDNKASA-VY----DLKVASAP 379 (395) T ss_pred ceEEEcCCCCCCcEEEEecccEEEEEecceEEEecc-chhhhCCcEEEEEEEEECCEEeccccEE-EE----EeeccCCC Confidence 55666643321 011222344554443332110 00000000 0 0112343331 10 01112222 Q ss_pred cccccCccccccchhh Q lcl|Aclame:pro 435 FVTTNGLYNGTPDGEA 450 (468) Q Consensus 435 ~~~~~~~~~~~~~~~~ 450 (468) ...+...-...+-.+. T Consensus 380 ~~~~~~~~~~~~~~~~ 395 (395) T protein:vir:95 380 RRQTSAGGTTDGIAEA 395 (395) T ss_pred CCCCCCCCCCCccccC Confidence 2211110000111111 No 110 >protein:vir:94673 Length: 419 # NCBI annotation: major capsid protein # Family: family:all:585 # MgeID: mge:1527 # MgeName: mu1/6 # Cross-refs: genbank:acc:YP_579208;genbank:gi:93007444;genbank:GeneID:5076792 Probab=29.61 E-value=1.6 Score=19.40 Aligned_cols=351 Identities=12% Similarity=0.054 Sum_probs=118.2 Q ss_pred CcchHHHHHhhhhhhCCCc--cchh--------------------cchhhhHHHHHHHhHHHHHHhhhhhhhhh-hhh-- Q lcl|Aclame:pro 1 MFNAEHLQEKWSPVLNHGE--APAI--------------------GDRYKRAVTSVLLENQERFLREERGMLNE-VAV-- 55 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~--~~~i--------------------~~~~~~~~~~~llenq~~~~~~~~~~l~e-~~~-- 55 (468) .-..+.+.++...-++.-. +..+ .....+.+ ...+.+ ...+++......+ ... T Consensus 32 ~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~-~~~~~~~~~~~~~~~~~~~ 109 (419) T protein:vir:94 32 VAEARGLADALQAESDRAAARAALLRTAPPAPKGPADGGTPLTPAEAGTFRSL-AQRFAD-SDGLREYRARDKRGQFQVE 109 (419) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccccch-hhhhhh-HHHHHHHHHhhhhhhhhHH Confidence 0011112222221111100 0000 00000000 000000 0000000000000 000 Q ss_pred -hhcCcccccccccccccccccccccccceehhhhHH--hhhhhhhhheeeeecCCccceeeeeeeeeecCCCCcccccc Q lcl|Aclame:pro 56 -NSLGAGTIAPAGSALGSANTGGLAGFDPVLISLVRR--AMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFN 132 (468) Q Consensus 56 -~~~~~~~~~~~~~i~~st~tg~i~~~~P~Lv~l~RR--a~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fn 132 (468) ................++ +.+-...-|.+++=... ....+...+++.+.||++++.-+ +|.. .... T Consensus 110 ~~~~~~~~~~~~~~~~~~~-~~~~~~~~p~~~~~~i~~~~~~~~~i~~~~~~~~~~~~~~~~--~~~~--~~~~------ 178 (419) T protein:vir:94 110 MRDIDPNRLLSRDAPAGTI-TNPNVPHLPQLVPGIVPTTPDLPLLVADLLDQQNADYNVLEY--IRDT--SGTA------ 178 (419) T ss_pred HHHHHHHHhhccccccccc-cCCcccccchhhhHHHHHHHhhhhhhhhcceeeeccCCceee--eeec--cccc------ Confidence 000000000000001111 11111222333221111 11233557899999998765322 2200 0000 Q ss_pred cCCccccccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCCCCcchhhcceEEEEEEEEe Q lcl|Aclame:pro 133 EPDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFREMSFSIEKTSVTA 212 (468) Q Consensus 133 Ea~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~~~f~EMaFsIeK~tVtA 212 (468) ...+ .. +. ..-.+| +..+++...++++++..+ T Consensus 179 ---~~~~----------------------~~-~~-----------------a~~v~E-----g~~~~~~~~~~~~i~~~~ 210 (419) T protein:vir:94 179 ---GAGS----------------------TW-NK-----------------AAVVPE-----GTAKPQSTLSFDTITTTL 210 (419) T ss_pred ---cccc----------------------cC-cc-----------------cceecC-----CccccccccceeeEEeee Confidence 0000 00 00 000001 223555555666666666 Q ss_pred ecccccccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhccccccc-cccccccccccccchhHH Q lcl|Aclame:pro 213 QSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNV-ANAGIFDLDVDSNGRWSV 291 (468) Q Consensus 213 KSRaLKAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~-~~~g~~Dl~~~~~grw~~ 291 (468) |.=+-...+|-||.||.- +.+++|.+-|+..|...+|+.||.- .-.++..|+ ...|+.-.... .+.+. T Consensus 211 ~k~~~~~~is~ell~d~~-----~l~~~i~~~la~a~~~~~d~aii~G----~G~~~p~Gi~~~~~~~~~~~~-~~~~~- 279 (419) T protein:vir:94 211 KTVAHWLPITRQAADDNS-----QLMGYIQGRLTYGLRFLRDRQLLNG----NGSTEMQGILTTPGIGTYQQP-KPTAP- 279 (419) T ss_pred eeEEEeehhhHHHHHhHH-----HHHHHHHHHHHHHHHHHHHHHHHhc----cCcccccceeccccccccccc-ccccc- Confidence 666666789999999952 3689999999999999999999741 111122221 11111100000 00000 Q ss_pred HHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccccccCceeEEEecCceEEE Q lcl|Aclame:pro 292 EKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVF 371 (468) Q Consensus 292 e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~~~~G~l~g~~~vy 371 (468) ...-..+-.+.+....+ . ..-+.++.+||+|.....|... .+..- +......+-.+ -..++|. |++|+ T Consensus 280 ~t~~~~~~~l~~~~~~~-~-~~~~~~~~~v~n~~~~~~l~~~--k~~~~------~~~~~~~~~~~-~~~~~l~-G~pV~ 347 (419) T protein:vir:94 280 ATDEPPLVDIRRAKTVA-E-IAGFPPDGVVVHPQDWESIELD--QAPGS------GVFRVIANVQG-EATPRIW-GLNVV 347 (419) T ss_pred cccchhHHHHHHHHHhh-h-hccCCCCEEEEcHHHHHHHHHH--hhcCC------CceeecCCccc-CCCcccc-ceeeE Confidence 00000011122222222 1 1224567899999998888653 11000 00000111111 1124564 56888 Q ss_pred EcccccccCCcceEEEE-EecC-C---cccceeEeeccchhhcccccCCccccceeeeeeeeeeee-cC--cccccCccc Q lcl|Aclame:pro 372 VDPYAANLSDKHYYVIG-YKGT-S---PYDAGLFYCPYVPLQMVRSIDPNTFQPKIGFKTRYGMVS-NP--FVTTNGLYN 443 (468) Q Consensus 372 ~D~Ya~~~~~~dY~~vG-~KG~-~---~~d~glfyaPYv~l~~~~~~dp~s~qP~~g~~tRY~l~~-nP--~~~~~~~~~ 443 (468) ++.... ..+ +++| ++.. . ...-.+-..++....| ..-+=.+=+..|+++.+ +| |.... .. T Consensus 348 ~~~~~~---~~~-~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~------~~~~~~~r~~~r~d~~v~~~~a~~~~~--~~ 415 (419) T protein:vir:94 348 STVAIA---QGT-ALVGGFRQGATLWSRQGITVLMTDSHADFF------TANTLVILAEFRANLAVYQPKAFVRVT--FA 415 (419) T ss_pred EcCCCC---Ccc-EEEeeccceEEEEEecceEEEEeccccchh------hcCcEEEEEEEeeccEEeccccEEEEE--ec Confidence 885432 122 3333 1100 0 0001111111111111 11222334455666543 22 21110 11 Q ss_pred cccc Q lcl|Aclame:pro 444 GTPD 447 (468) Q Consensus 444 ~~~~ 447 (468) ..++ T Consensus 416 aa~~ 419 (419) T protein:vir:94 416 AATT 419 (419) T ss_pred cCCC Confidence 1111 No 111 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=28.28 E-value=1.8 Score=19.23 Aligned_cols=331 Identities=11% Similarity=0.045 Sum_probs=118.4 Q ss_pred CcchHHHHHhhhhhhCCCccchhcchhhhHHHH--HHHhHHHH--------------HHhhh---hhhhhhhhhhhcCcc Q lcl|Aclame:pro 1 MFNAEHLQEKWSPVLNHGEAPAIGDRYKRAVTS--VLLENQER--------------FLREE---RGMLNEVAVNSLGAG 61 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~~--~llenq~~--------------~~~~~---~~~l~e~~~~~~~~~ 61 (468) =+. ++..++-.-+... +.. -+..+-+ ..++..++ ..++. +.++... ....+.. T Consensus 31 ~~~-~e~~~~~~~~~~e-----~~~-l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~ 102 (390) T protein:vir:97 31 ELN-ASARSKVDELFAT-----VGN-LSAEVQAARQRVAELEGNGAGGDVQHVSVGDMFVASEQFQASTGRW-NDRSARA 102 (390) T ss_pred CCC-HHHHHHHHHHHHH-----HHH-HHHHHHHHHHHHHHHHhcccccccccccchhhhhhhHHHHHHHHHh-hhhhhhh Confidence 000 1111111111100 100 0000000 00000000 00000 0000000 0000000 Q ss_pred c----ccccccccccccccccccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCcccccccCCcc Q lcl|Aclame:pro 62 T----IAPAGSALGSANTGGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDTG 137 (468) Q Consensus 62 ~----~~~~~~i~~st~tg~i~~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~ 137 (468) . .........++++++..-....+-.++++..+..+-.+++.+-||++++.-+.-.. +.++. .. T Consensus 103 ~~~~~~~~~~~~~~~~~~~g~lip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~~~~~~~~----~~~~~--------a~ 170 (390) T protein:vir:97 103 TMNIKAALNTASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDSALIEYVQET----GFVNN--------AA 170 (390) T ss_pred hhHHHHHHHhhhcccccccccccchhhhHHHHHHHhhhhhhHhhcceeeccCCceEEEEEe----cCCcc--------ee Confidence 0 00000111112222211111122334555555667778899999987764321111 10000 00 Q ss_pred ccccccccccccccccCccccCCCccccccccccccccccccccccchhhhhccCCCCcchhhcceEEEEEEEEeecccc Q lcl|Aclame:pro 138 FTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFREMSFSIEKTSVTAQSRAL 217 (468) Q Consensus 138 fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~~~f~EMaFsIeK~tVtAKSRaL 217 (468) | .+ | +..+++-..++++++...|.-+- T Consensus 171 ~----------------------------------------------v~--E-----g~~~~~~~~~~~~i~~~~~k~~~ 197 (390) T protein:vir:97 171 I----------------------------------------------VA--E-----GALKPESSLKFAKKTDTTHVIAH 197 (390) T ss_pred e----------------------------------------------ec--C-----CccccccccceeEEEEeeeeEEE Confidence 0 00 1 11223333344455555555555 Q ss_pred cccccHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhccc-cccccc-cccccccccccchhHHHHHH Q lcl|Aclame:pro 218 KAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGA-QNNVAN-AGIFDLDVDSNGRWSVEKFK 295 (468) Q Consensus 218 KAEYT~ELAQDLkAiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k-~~~~~~-~g~~Dl~~~~~grw~~e~~k 295 (468) ...+|-||.+|-- +.++.|.+-|+..|...+|+.||.- ...++ ..|+.+ .++.-......+--..+... T Consensus 198 ~~~is~ell~ds~-----~l~~~i~~~la~a~~~~~d~a~l~G----~g~~~~p~Gi~~~~~~~~~~~~~~~~~~~d~~~ 268 (390) T protein:vir:97 198 TMKATRQILSDAP-----QLASYMNNRLIRGLKVKEDAEILRG----TGANDGLLGLIPQATTYAAPTTIAGATRVDQLR 268 (390) T ss_pred eehhhHHHHHhHH-----HHHHHHHHHHHHHHHHHHHHHHhhc----CCCCccccceeeccccccccccccccchHHHHH Confidence 6789999999842 5788888888888888888877642 11111 112111 11111100000100111112 Q ss_pred HHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccccccCceeEEEecCceEEEEccc Q lcl|Aclame:pro 296 GLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVFVDPY 375 (468) Q Consensus 296 ~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~~~~G~l~g~~~vy~D~Y 375 (468) .++. . ....-...+.+|++|+....|.. +..+. |..+......+. .++|. |++|+++.. T Consensus 269 ~~~~-------~--~~~~~~~~~~~v~n~~~~~~L~~---lkd~~------G~~l~~~~~~~~--~~~l~-G~pV~~~~~ 327 (390) T protein:vir:97 269 LAML-------Q--ASLAEYPASGIVINPIDWAAIEL---AKDAN------NQYLIGNARGTL--TPTLW-GLPVVATQA 327 (390) T ss_pred HHHH-------h--hccccCCCCEEEEcHHHHHHHHH---hhcCC------CceeecCccCCC--Cceec-ceeeEEcCC Confidence 2211 1 12233456678999999988874 22111 111111111111 24564 668887754 Q ss_pred ccccCCcceEEEEEecCCcccceeEeeccchhhcccccCC---ccccceeeeeeeeeeee-cCcccccCccccccchhhh Q lcl|Aclame:pro 376 AANLSDKHYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDP---NTFQPKIGFKTRYGMVS-NPFVTTNGLYNGTPDGEAL 451 (468) Q Consensus 376 a~~~~~~dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp---~s~qP~~g~~tRY~l~~-nP~~~~~~~~~~~~~~~~~ 451 (468) . |.+-+++|--. .++++.....+......+. .+-+=.+-...||++.+ +|= T Consensus 328 ~----~~~~~~~gd~~-----~~~~~~~~~~~~i~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~---------------- 382 (390) T protein:vir:97 328 M----APGEFLVGAFD-----LAAQIFDQWDARVEIGYVNDDFQRNMVTVLAEERLALVVYRPE---------------- 382 (390) T ss_pred C----CCCcEEEEecc-----ceEEEEEecceEEEEeecccccccCcEEEEEEEeeccEEeccc---------------- Confidence 3 33334444210 0111111111111111111 12222344456777654 231 Q ss_pred hhhcccceeeeeee Q lcl|Aclame:pro 452 TPNANMYYRRVQVT 465 (468) Q Consensus 452 ~~~an~y~~r~~v~ 465 (468) -|-++.++ T Consensus 383 ------a~v~~~~a 390 (390) T protein:vir:97 383 ------ALITGSFA 390 (390) T ss_pred ------cEEEEEeC Confidence 11222222 No 112 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=28.23 E-value=1.8 Score=19.23 Aligned_cols=341 Identities=13% Similarity=0.111 Sum_probs=117.7 Q ss_pred CcchHHHHHhhhhhhCCCccchhcchhhhHHHHHHHhHHHHH--Hhhhhhhhhh-hhhhhcCc--cccccccccccccc- Q lcl|Aclame:pro 1 MFNAEHLQEKWSPVLNHGEAPAIGDRYKRAVTSVLLENQERF--LREERGMLNE-VAVNSLGA--GTIAPAGSALGSAN- 74 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~~~llenq~~~--~~~~~~~l~e-~~~~~~~~--~~~~~~~~i~~st~- 74 (468) .....+.+++=.-+ +. +|.. -++. +....+.+.+. ..+....+.+ .....+.. -++.....+..+++ T Consensus 64 ~~~~~e~~~~~~~~-~~----ei~~-~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~af~~~l~~~e~~~al~~~t~~ 136 (425) T protein:vir:10 64 GLPTSDALAKVDKV-SA----DLEA-LQAA-VDEANIKIAAAQMGANGVKPLRDPEYTEAFKAHVKRGDVQAALNKGEDS 136 (425) T ss_pred hhccHHHHHHHHHH-HH----HHHH-HHHH-HHHHHHHHHhhhcccccccccccHHHHHHHHHHhhhhhhHHHhhcCcCC Confidence 11111111110000 00 0100 0000 00000000000 0000000000 00000000 00000111122222 Q ss_pred ccccccccceeh-hhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCcccccccCCcccccccccccccccccc Q lcl|Aclame:pro 75 TGGLAGFDPVLI-SLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDTGFTGGYDASQGDYAVRT 153 (468) Q Consensus 75 tg~i~~~~P~Lv-~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSg~~~~~~~~~~~~~ 153 (468) .|++. .-+.+. .++++..+..+..+++.+-||+++.+-+.-.. ++.. ..| T Consensus 137 ~gG~l-vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~~~~~------~~~~-------a~w--------------- 187 (425) T protein:vir:10 137 EGGYL-TPIEWDRTITNKLVLISPMRQLCRVQPVSKAGFSKLFNM------GGTT-------SGW--------------- 187 (425) T ss_pred CCcee-ccHhHHHHHHHHHHhhhhhhhhceeeeccCCceEEEEEc------CCcc-------eee--------------- Confidence 22211 112221 24454555667788999999987765333100 0000 000 Q ss_pred CccccCCCccccccccccccccccccccccchhhhhccCCCC-cchhhcceEEEEEEEEeecccccccccHHHHHHHHHh Q lcl|Aclame:pro 154 GAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEAN-RLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAI 232 (468) Q Consensus 154 ~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~~-~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkAi 232 (468) .++++.....+ ..|.++.|++-|..+ ...+|-||.+|- T Consensus 188 -------------------------------v~E~~~~~~~~~~~f~~v~~~~~k~~~-------~i~iS~ell~ds--- 226 (425) T protein:vir:10 188 -------------------------------VGEASQRPQTNAATFQPLSFASGEIYA-------NPAATQQILDDA--- 226 (425) T ss_pred -------------------------------eccccccccccccccceeeeeheeeEe-------ehHhHHHHHhcc--- Confidence 00011111111 246666666666655 456899999985 Q ss_pred cCCChhHHHHHHHHHHHHHHhhHHHHHH--------Hhhhhhcccccccccccccc-ccccccchhHHHHHHHHHHHHHH Q lcl|Aclame:pro 233 HGLDAEQELANILSSEVLAEINREVVRR--------VYTVAKKGAQNNVANAGIFD-LDVDSNGRWSVEKFKGLLFQVER 303 (468) Q Consensus 233 HGLDAE~ELanILStEImlEINREii~~--------l~~va~~~k~~~~~~~g~~D-l~~~~~grw~~e~~k~L~~~i~~ 303 (468) .+|.+++|.+-|+..|..-+|+-||.- |.+....+........|... ......+--..+....|++.+.. T Consensus 227 -~~~l~~~i~~~la~ai~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~l~~l~~~l~~ 305 (425) T protein:vir:10 227 -EIDLESWLATEVQTEFAKQEGKAFLAGDGTNKPNGLLTYIAGGANAAKHPFGAIEVVNSGAAADITSDGIIDLVYDLPS 305 (425) T ss_pred -hhHHHHHHHHHHHHHHHHHHHhhhhcccCCCCcceeeeccccccccccccccccccccccccccccHHHHHHHHhhhhh Confidence 357889999999999999999888752 11111110000000000000 00000111112223333332211 Q ss_pred HHHHHHHhhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccccccCceeEEEecCceEEEEccccccc-CCc Q lcl|Aclame:pro 304 DANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVFVDPYAANL-SDK 382 (468) Q Consensus 304 ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~-~~~ 382 (468) . -+..+ .+|++|.....|... ..+ + |....+-+-+... .++|. |++|+++.++... +.. T Consensus 306 ---~-----~~~~a-~~vmn~~~~~~L~~l---kD~-----~-G~~l~~~~~~~g~-~~~l~-G~PV~~~~~~p~~~~~~ 365 (425) T protein:vir:10 306 ---A-----FTGNA-RFAMNRNTQRQVRKL---KDG-----Q-GNYLWQPSYVAGQ-PATLA-GYPVTEVPDMPDVAANS 365 (425) T ss_pred ---h-----hccCC-EEEEchHHHHHHHHh---hcC-----C-CceeeccCccCCC-Cceec-ceeeEEecCcCCccCCc Confidence 1 12233 468999998888752 211 1 1111111111111 25675 4677777443211 112 Q ss_pred ceEEEEEecCCcccceeEeeccchhhcccccCCccccceee--eeeeeeee-ecCcccccCccccccchhhhhhhcccce Q lcl|Aclame:pro 383 HYYVIGYKGTSPYDAGLFYCPYVPLQMVRSIDPNTFQPKIG--FKTRYGMV-SNPFVTTNGLYNGTPDGEALTPNANMYY 459 (468) Q Consensus 383 dY~~vG~KG~~~~d~glfyaPYv~l~~~~~~dp~s~qP~~g--~~tRY~l~-~nP~~~~~~~~~~~~~~~~~~~~an~y~ 459 (468) +.+++| +... ..+... ...+....||-.-+-.++ ...||+.. .+|-+.. T Consensus 366 ~~i~~G---d~~~--~~~i~~--~~~~~v~~d~~~~~~~~~~~~~~r~d~~v~~~~A~~--------------------- 417 (425) T protein:vir:10 366 TPILFG---DFQQ--TYLIID--RIGVRVLRDPYTAKPYVLFYTTKRVGGGLLNPEPMR--------------------- 417 (425) T ss_pred cEEEEE---ehhc--cEEEEE--ecceEEEecccccCCcEEEEEEEEeccEeecccceE--------------------- Confidence 334433 1100 011111 111222233333223333 33466543 3442210 Q ss_pred eeeeeeecC Q lcl|Aclame:pro 460 RRVQVTNLM 468 (468) Q Consensus 460 ~r~~v~~l~ 468 (468) .++|+.== T Consensus 418 -~l~~~as~ 425 (425) T protein:vir:10 418 -AMKVAASE 425 (425) T ss_pred -EEEeeccC Confidence 00000000 No 113 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=27.39 E-value=1.8 Score=19.12 Aligned_cols=261 Identities=14% Similarity=0.058 Sum_probs=110.0 Q ss_pred eeeeee--ecCCCCcccccc---c---CCccccccccccccccccccCccccCCCccccccccccccccccccccccchh Q lcl|Aclame:pro 115 FAMRSR--YENQAGEEALFN---E---PDTGFTGGYDASQGDYAVRTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPRE 186 (468) Q Consensus 115 FAMRsr--Y~~qsG~EA~fn---E---a~t~fSg~~~~~~~~~~~~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta 186 (468) -||=.. ..+---.|-|-. | ..--|++-.. .. ....+ .+ +...++..--... T Consensus 1 ~~~~~~T~l~d~i~PEv~~~~v~~~~~~~~~~~~~~~-~~------------~~l~g-~~-------G~tv~iP~~~~ig 59 (275) T protein:vir:96 1 MALENMTKLANMVNPEVLAPMMQAELDKKLKFAQFAD-ID------------NTLVG-QP-------GNTITFPAFVYSG 59 (275) T ss_pred CCCcccchhhhhhchHHHHHHHHHHHHHhhhhcccce-ec------------ccccC-CC-------CCEEEeeeeccCC Confidence 222110 000000010000 0 0000110000 00 00000 00 0111111000112 Q ss_pred hhhccCCC-CcchhhcceEEEEEEEEeecccccccccHHHHHHHHH-hcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhh Q lcl|Aclame:pro 187 DLERMGEA-NRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKA-IHGLDAEQELANILSSEVLAEINREVVRRVYTV 264 (468) Q Consensus 187 ~aE~lG~~-~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLkA-iHGLDAE~ELanILStEImlEINREii~~l~~v 264 (468) ++|.+.++ .-+..++. ....+++.|-|.-.-+++ |+-+ .-+-|.=.|..+-++..|+..++.+++..+.+. T Consensus 60 ~a~~~~~g~~i~~~~lt--~~~~~~~i~~~~~~~~i~-----D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~ll~~l~~a 132 (275) T protein:vir:96 60 DAKVVPEGEEIPIDLIE--TKKRQATIRKIGKGTVLT-----DEALLSGYGDPKGEAVRQHGLAIANKVDNDVLEALQGA 132 (275) T ss_pred ccccccCCCCcchhhcc--cceeeEEeehhccccccc-----HHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 23333322 22344444 344445555554433333 3333 225688889999999999999999998877653 Q ss_pred hhccccccccccccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhcCCCccEEEEchhHHHHHHhhccccccccccc Q lcl|Aclame:pro 265 AKKGAQNNVANAGIFDLDVDSNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFLICSADVASALAMAGVLDYSSGLNG 344 (468) Q Consensus 265 a~~~k~~~~~~~g~~Dl~~~~~grw~~e~~k~L~~~i~~ean~i~~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~ 344 (468) .... +...++ .+.+-..+.++..+ -..+++++++|++++.|.......+.+.... T Consensus 133 ~~~~------~~~~~~----------~d~i~dA~~~lgd~---------~~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~ 187 (275) T protein:vir:96 133 TLKV------EADITK----------LAGLQTAIDKFNDE---------DLEPMVLFVNPLDAGKLRASATDNFTRATLL 187 (275) T ss_pred cccc------cccccC----------HHHHHHHHHHhccc---------cCCccEEEeCHHHHHHHHhcccccccccccc Confidence 2221 111121 23333333333322 1467899999999999966432333332221 Q ss_pred ccccccccccccCceeEEEecCceEEEEcccccccCCcceEEEEEe-cCCcccceeEeeccchhhcccccCCccccceee Q lcl|Aclame:pro 345 AGGPSIGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYK-GTSPYDAGLFYCPYVPLQMVRSIDPNTFQPKIG 423 (468) Q Consensus 345 ~~~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~~~~dY~~vG~K-G~~~~d~glfyaPYv~l~~~~~~dp~s~qP~~g 423 (468) +.. .-.+| .+|++ .|++||+| ++.|. |-.+-++ |.-. |+.. -+...-.--|++.++=.+- T Consensus 188 --g~~---~~~~G--~ig~~-~G~~Vi~s----~~~p~-~t~~i~~~gA~~-----~~~~-~~~~vE~~Rd~~~~~d~i~ 248 (275) T protein:vir:96 188 --GDN---VIVKG--AFGEA-LGAIIVRS----NKIKE-GEAILAKRGAVK-----LITK-RDFFLETERHASHKSTALF 248 (275) T ss_pred --ccc---ceecc--cccee-cCeeEEEe----CCCCc-ceEEEEecccee-----eeec-CCcccccccchhhcCcEEE Confidence 111 11112 34666 57899999 54442 2222222 2111 1110 0111122238889999888 Q ss_pred eeeeeeee-ecC--cccccCccccccchhhhhh Q lcl|Aclame:pro 424 FKTRYGMV-SNP--FVTTNGLYNGTPDGEALTP 453 (468) Q Consensus 424 ~~tRY~l~-~nP--~~~~~~~~~~~~~~~~~~~ 453 (468) -..+||+. .|| ..+.....++. +- T Consensus 249 ~~~~y~~~~~~~~~vv~~t~~~~~~------~~ 275 (275) T protein:vir:96 249 SDKHYVAYLYDESKVVKITKSASGL------GV 275 (275) T ss_pred EeEEEEEEEEcCccEEEEEeccccc------CC Confidence 88899854 455 11111112221 11 No 114 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=25.48 E-value=2.1 Score=18.87 Aligned_cols=321 Identities=13% Similarity=0.044 Sum_probs=114.4 Q ss_pred CcchHHHH-HhhhhhhCCCccchhcchhhhHHHHHHHhHHHHHHhhhh-----hhhhhhhhhhcCccccccccccccccc Q lcl|Aclame:pro 1 MFNAEHLQ-EKWSPVLNHGEAPAIGDRYKRAVTSVLLENQERFLREER-----GMLNEVAVNSLGAGTIAPAGSALGSAN 74 (468) Q Consensus 1 ~~~~~~l~-~kw~p~l~~~~~~~i~~~~~~~~~~~llenq~~~~~~~~-----~~l~e~~~~~~~~~~~~~~~~i~~st~ 74 (468) +....++. +.=....+....++......+......-+.+.+..+... +.+.+....... ...........++ T Consensus 52 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~--~~~~~a~~~~~~~ 129 (397) T protein:vir:12 52 MTEGRSLDVPDLPGGVNFVPEQERNPEGQRSQGQGNEERQQQYSKAFLKGLRGKRLTDEERDLLD--SPEFRAMSGINDE 129 (397) T ss_pred HHHHHHHHHHHHHHHhhhhhhhhhhhcccccccchhhHHHHHHHHHHHHHHhccCCcHHHHHHHh--hhhhhhccccccc Confidence 11111000 000000000000000000000000011111111111110 011110000000 0000001111222 Q ss_pred cccc---ccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCcccccccCCcccccccccccccccc Q lcl|Aclame:pro 75 TGGL---AGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDTGFTGGYDASQGDYAV 151 (468) Q Consensus 75 tg~i---~~~~P~Lv~l~RRa~~~LI~~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSg~~~~~~~~~~~ 151 (468) +|++ ..+.+.+ ++...+..+-.+++.+.||+++.|-+--.|.. ++..+ .| T Consensus 130 ~gg~lvP~~~~~~i---i~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~----~~~~a-------~~------------- 182 (397) T protein:vir:12 130 DGGILIPEDIGRQI---HEFKRQFEPLEQYVTVEPVTTRSGTRLLEKNA----DMVPF-------SP------------- 182 (397) T ss_pred cCcccCchhHHHHH---HHhhhhhhhHHhhcceeeccCCceeEEEEEec----CCcce-------ee------------- Confidence 2332 2222334 44444666778999999999988854322210 00000 00 Q ss_pred ccCccccCCCccccccccccccccccccccccchhhhhccCCC-CcchhhcceEEEEEEEEeecccccccccHHHHHHHH Q lcl|Aclame:pro 152 RTGAGVGGDSEGNNPALLNDAAPGTYEVGSKMPREDLERMGEA-NRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLK 230 (468) Q Consensus 152 ~~~~~~~~~~~gt~~~~~~~a~~~~~t~~~gm~Ta~aE~lG~~-~~~f~EMaFsIeK~tVtAKSRaLKAEYT~ELAQDLk 230 (468) .++++...+. ...|.++.|+..|..+- ..+|-||.+|-- T Consensus 183 ---------------------------------v~Eg~~~~~~~~~~~~~v~~~~~k~~~~-------~~is~e~l~ds~ 222 (397) T protein:vir:12 183 ---------------------------------VEELGNLPEIDQPRFTKVSYSIIDYGGI-------MTLSNSMLNDSD 222 (397) T ss_pred ---------------------------------ecccccccccccccceeEEeeheeeEee-------ehhhHHHHhhch Confidence 0000001111 12456666666666554 458999998754 Q ss_pred HhcCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhhhccccccccccccccccccccchhHHHHHHHHHH-HHHHHHHHHH Q lcl|Aclame:pro 231 AIHGLDAEQELANILSSEVLAEINREVVRRVYTVAKKGAQNNVANAGIFDLDVDSNGRWSVEKFKGLLF-QVERDANAIA 309 (468) Q Consensus 231 AiHGLDAE~ELanILStEImlEINREii~~l~~va~~~k~~~~~~~g~~Dl~~~~~grw~~e~~k~L~~-~i~~ean~i~ 309 (468) +|.++.|.+.|...|...+|+-|+.-.-+ + ...|+..++ ....+++ .+. .. T Consensus 223 ----~~l~~~i~~~l~~~~~~~~d~~il~G~g~----~-----~~~g~~~~~----------~i~~~~~~~l~---~~-- 274 (397) T protein:vir:12 223 ----QAIMTYVAKWFAKKSVVTRNNLILAAIAS----L-----KKVDIDGLD----------GIKKALNVTLD---PM-- 274 (397) T ss_pred ----HHHHHHHHHHHHHHHHHHHHHHHHhcccc----c-----cccccccHH----------HHHHHHhhccc---hh-- Confidence 46788899999999998888887754321 1 223343221 1222221 111 11 Q ss_pred HhhcCCCccEEEEchhHHHHHHhhcccccccccccccccccccccccCceeEEEecCceEEEEcccccccCCcceEEEEE Q lcl|Aclame:pro 310 QETRRGKGNFLICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGY 389 (468) Q Consensus 310 ~~T~r~~~n~~v~S~~Va~~L~~sG~~~~~~~~~~~~~~~~~~~d~t~~~~~G~l~g~~~vy~D~Ya~~~~~~dY~~vG~ 389 (468) -..+..++|+|.....|... ..+ + |....+.+-+.. .-++|. |++|++...+ ..+. T Consensus 275 ----~~~~a~~~~n~~~~~~L~~l---kd~-----~-G~~l~~~~~~~g-~~~~l~-G~pv~~~~~~---------~~~~ 330 (397) T protein:vir:12 275 ----VAPGSIVLTNQDGYDWLDTL---KDG-----T-GRYLLQPDPTNP-TKKLLD-GRPVVPFTNR---------VLKT 330 (397) T ss_pred ----hhCCCEEEEcHHHHHHHHHh---hcc-----C-CceeecccccCC-CCcccc-ceeeEEeccc---------cccc Confidence 12334578999998888753 111 0 111111111111 114554 4466543110 0000 Q ss_pred ecCCcccceeEeeccc---------hhhcccccCC----ccccceeeeeeeeeeee-cCcccccCccccccchhhhhhhc Q lcl|Aclame:pro 390 KGTSPYDAGLFYCPYV---------PLQMVRSIDP----NTFQPKIGFKTRYGMVS-NPFVTTNGLYNGTPDGEALTPNA 455 (468) Q Consensus 390 KG~~~~d~glfyaPYv---------~l~~~~~~dp----~s~qP~~g~~tRY~l~~-nP~~~~~~~~~~~~~~~~~~~~a 455 (468) - .-+.-++|+.|- .+.+...-.+ .+-+-.+-...|++..+ ||= T Consensus 331 ~---~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~-------------------- 387 (397) T protein:vir:12 331 Q---KGKAPLIIGNLKEAIVLFDREQQSIASTDTGAGAFETNSTKVRGIEREDVRKWDED-------------------- 387 (397) T ss_pred C---CCccEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEeccc-------------------- Confidence 0 001112222211 0011100001 12334555666676543 331 Q ss_pred ccceeeeeeeec Q lcl|Aclame:pro 456 NMYYRRVQVTNL 467 (468) Q Consensus 456 n~y~~r~~v~~l 467 (468) -|..+.++.= T Consensus 388 --a~~~~~~t~~ 397 (397) T protein:vir:12 388 --AVVFGQITVE 397 (397) T ss_pred --ceEEEEEeeC Confidence 0111111111 Done!