Query lcl|NC_015288.1_cdsid_YP_004324495.1 [gene=gp23] [protein=precursor of major head subunit] [protein_id=YP_004324495.1] [location=118084..119490] Match_columns 468 No_of_seqs 162 out of 418 Neff 4.8 Searched_HMMs 1612 Date Thu Nov 7 14:51:54 2013 Command /home/guerois/workspace/virfam/python/lib/hhsearch//hhsearch2 -i .//seq/seq_123 -d /home/guerois/workspace/virfam/python/profile_database/capsid_neck_tail.hhm -glob -cpu 7 -o .//seq/HHR/seq_123_vs_rec_db.hhr No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 protein:vir:106998 Length: 468 100.0 3E-250 2E-253 1388.4 36.8 468 1-468 1-468 (468) 2 protein:vir:104915 Length: 470 100.0 2E-238 1E-241 1323.5 36.8 456 1-468 3-470 (470) 3 protein:vir:104549 Length: 462 100.0 1E-236 9E-240 1313.5 35.4 451 2-468 1-462 (462) 4 protein:vir:103181 Length: 457 100.0 4E-231 3E-234 1283.6 35.2 446 2-468 1-457 (457) 5 protein:vir:106286 Length: 534 100.0 1E-224 8E-228 1248.0 35.3 459 2-467 1-534 (534) 6 protein:vir:6901 Length: 522 # 100.0 2E-223 1E-226 1241.7 34.1 459 1-467 4-522 (522) 7 protein:vir:101039 Length: 529 100.0 9E-223 6E-226 1237.9 34.0 455 1-467 2-529 (529) 8 protein:vir:103463 Length: 521 100.0 2E-222 9E-226 1236.7 33.2 459 1-467 3-521 (521) 9 protein:vir:7214 Length: 521 # 100.0 2E-222 1E-225 1236.6 33.1 459 1-467 3-521 (521) 10 protein:vir:80986 Length: 528 100.0 9E-222 6E-225 1232.4 34.1 460 1-467 1-528 (528) 11 protein:vir:101811 Length: 529 100.0 1E-221 8E-225 1231.5 34.4 460 1-467 2-529 (529) 12 protein:vir:6601 Length: 528 # 100.0 2E-221 1E-224 1230.9 33.4 460 1-467 1-528 (528) 13 protein:vir:98143 Length: 524 100.0 5E-221 3E-224 1228.4 33.5 458 1-467 1-524 (524) 14 protein:vir:5670 Length: 514 # 100.0 2E-220 1E-223 1225.5 32.8 455 5-467 1-514 (514) 15 protein:vir:100603 Length: 529 100.0 1E-218 6E-222 1215.6 33.8 460 1-467 2-529 (529) 16 protein:vir:107947 Length: 519 100.0 1E-217 8E-221 1209.7 33.7 459 2-467 1-519 (519) 17 protein:vir:5942 Length: 523 # 100.0 5E-194 3E-197 1080.2 32.1 406 1-447 1-523 (523) 18 protein:vir:81100 Length: 415 95.6 0.0017 1.1E-06 35.8 16.8 341 1-455 28-415 (415) 19 protein:vir:79987 Length: 415 95.6 0.0017 1.1E-06 35.8 16.8 341 1-455 28-415 (415) 20 protein:vir:98339 Length: 415 95.6 0.0017 1.1E-06 35.8 16.8 341 1-455 28-415 (415) 21 protein:vir:4830 Length: 397 # 95.4 0.0014 8.6E-07 36.3 12.0 327 1-464 33-397 (397) 22 protein:vir:4953 Length: 397 # 95.4 0.0015 9.6E-07 36.0 12.2 327 1-464 33-397 (397) 23 protein:vir:9410 Length: 415 # 94.7 0.0038 2.3E-06 33.9 16.7 348 1-455 28-415 (415) 24 protein:vir:4997 Length: 397 # 94.3 0.0033 2E-06 34.2 11.2 327 1-455 21-397 (397) 25 protein:vir:7409 Length: 408 # 94.1 0.0053 3.3E-06 33.0 20.2 332 1-451 4-408 (408) 26 protein:vir:78523 Length: 338 94.1 0.0055 3.4E-06 33.0 16.3 307 47-448 1-338 (338) 27 protein:vir:4600 Length: 415 # 93.9 0.0062 3.8E-06 32.7 17.4 341 1-455 28-415 (415) 28 protein:vir:4700 Length: 415 # 93.9 0.0062 3.8E-06 32.7 17.4 341 1-455 28-415 (415) 29 protein:vir:1886 Length: 385 # 93.4 0.0078 4.9E-06 32.1 19.2 333 1-455 1-385 (385) 30 protein:vir:191 Length: 385 # 93.4 0.0078 4.9E-06 32.1 19.2 333 1-455 1-385 (385) 31 protein:vir:41 Length: 299 # N 92.0 0.013 8.1E-06 30.9 19.1 278 63-456 1-299 (299) 32 protein:vir:6212 Length: 434 # 91.8 0.0068 4.2E-06 32.5 9.1 344 1-449 58-434 (434) 33 protein:vir:3033 Length: 272 # 88.8 0.03 1.9E-05 28.9 17.7 269 117-454 1-272 (272) 34 protein:vir:9820 Length: 272 # 88.8 0.03 1.9E-05 28.9 17.7 269 117-454 1-272 (272) 35 protein:vir:9574 Length: 300 # 88.8 0.03 1.9E-05 28.9 17.9 280 69-444 1-300 (300) 36 protein:vir:1433 Length: 435 # 88.4 0.033 2E-05 28.7 21.1 347 2-453 1-435 (435) 37 protein:vir:104256 Length: 458 87.2 0.04 2.5E-05 28.2 18.2 337 1-443 81-458 (458) 38 protein:vir:105905 Length: 304 83.0 0.072 4.5E-05 26.8 15.7 280 58-449 1-304 (304) 39 protein:vir:94142 Length: 304 83.0 0.072 4.5E-05 26.8 15.7 280 58-449 1-304 (304) 40 protein:vir:96123 Length: 274 83.0 0.072 4.5E-05 26.8 13.7 255 121-468 1-274 (274) 41 protein:vir:81227 Length: 413 82.7 0.075 4.6E-05 26.8 19.9 349 1-468 31-411 (413) 42 protein:vir:3845 Length: 395 # 82.5 0.076 4.7E-05 26.7 18.8 338 2-455 1-395 (395) 43 protein:vir:99749 Length: 324 81.8 0.083 5.1E-05 26.5 19.4 299 32-451 1-324 (324) 44 protein:vir:4092 Length: 390 # 81.7 0.083 5.2E-05 26.5 18.3 346 1-449 1-390 (390) 45 protein:vir:104085 Length: 320 81.2 0.088 5.4E-05 26.4 16.3 293 43-448 1-320 (320) 46 protein:vir:96262 Length: 274 80.6 0.094 5.8E-05 26.2 12.6 260 107-457 1-274 (274) 47 protein:vir:95898 Length: 274 80.6 0.094 5.8E-05 26.2 12.6 260 107-457 1-274 (274) 48 protein:vir:7771 Length: 330 # 79.9 0.1 6.2E-05 26.1 17.3 297 49-450 1-330 (330) 49 protein:vir:80376 Length: 435 78.0 0.12 7.3E-05 25.7 17.9 343 1-453 41-435 (435) 50 protein:vir:4856 Length: 293 # 77.0 0.13 8E-05 25.5 16.4 276 54-464 1-293 (293) 51 protein:vir:78223 Length: 333 75.5 0.15 9.1E-05 25.2 16.9 305 47-444 1-333 (333) 52 protein:vir:8420 Length: 477 # 75.3 0.15 9.2E-05 25.1 19.7 359 1-449 66-477 (477) 53 protein:vir:105038 Length: 428 74.2 0.16 0.0001 24.9 16.0 337 1-451 30-428 (428) 54 protein:vir:6242 Length: 390 # 72.8 0.18 0.00011 24.7 12.1 337 1-455 4-390 (390) 55 protein:vir:1638 Length: 298 # 71.7 0.19 0.00012 24.5 18.2 278 71-456 1-298 (298) 56 protein:vir:3870 Length: 400 # 70.3 0.21 0.00013 24.3 15.4 326 1-455 41-400 (400) 57 protein:vir:4339 Length: 395 # 69.1 0.23 0.00014 24.1 18.7 330 1-467 37-395 (395) 58 protein:vir:93742 Length: 274 66.0 0.27 0.00017 23.7 15.7 257 117-445 1-274 (274) 59 protein:vir:80684 Length: 315 65.0 0.29 0.00018 23.5 15.1 294 66-466 1-315 (315) 60 protein:vir:9759 Length: 303 # 63.8 0.31 0.00019 23.4 16.5 280 58-443 1-303 (303) 61 protein:vir:100247 Length: 425 63.4 0.32 0.0002 23.3 14.2 309 1-444 64-425 (425) 62 protein:vir:103955 Length: 324 63.0 0.32 0.0002 23.3 19.7 298 36-451 1-324 (324) 63 protein:vir:94494 Length: 274 62.9 0.33 0.0002 23.2 13.2 257 117-445 1-274 (274) 64 protein:vir:97433 Length: 274 62.9 0.33 0.0002 23.2 13.2 257 117-445 1-274 (274) 65 protein:vir:1025 Length: 408 # 57.7 0.43 0.00027 22.6 21.3 333 1-451 4-408 (408) 66 protein:vir:97148 Length: 324 57.6 0.43 0.00027 22.6 18.5 301 35-450 1-324 (324) 67 protein:vir:105004 Length: 392 56.2 0.46 0.00029 22.4 17.3 318 1-448 35-392 (392) 68 protein:vir:102873 Length: 392 56.2 0.46 0.00029 22.4 17.3 318 1-448 35-392 (392) 69 protein:vir:102082 Length: 392 56.2 0.46 0.00029 22.4 17.3 318 1-448 35-392 (392) 70 protein:vir:107593 Length: 392 56.2 0.46 0.00029 22.4 17.3 318 1-448 35-392 (392) 71 protein:vir:96223 Length: 324 55.6 0.48 0.0003 22.3 20.1 299 35-450 1-324 (324) 72 protein:vir:81160 Length: 371 54.4 0.5 0.00031 22.2 17.9 316 1-467 22-371 (371) 73 protein:vir:2430 Length: 318 # 53.2 0.53 0.00033 22.1 16.0 298 43-448 1-318 (318) 74 protein:vir:5739 Length: 366 # 50.5 0.61 0.00038 21.8 17.1 321 1-451 3-366 (366) 75 protein:vir:105334 Length: 276 49.9 0.63 0.00039 21.7 14.6 266 117-457 1-276 (276) 76 protein:vir:4511 Length: 409 # 49.8 0.63 0.00039 21.7 16.8 334 1-449 41-409 (409) 77 protein:vir:9704 Length: 394 # 48.8 0.66 0.00041 21.6 16.5 324 1-468 30-391 (394) 78 protein:vir:2504 Length: 305 # 47.2 0.71 0.00044 21.4 17.3 282 69-455 1-305 (305) 79 protein:vir:100172 Length: 394 45.2 0.78 0.00048 21.2 15.2 341 1-457 7-394 (394) 80 protein:vir:1239 Length: 274 # 44.6 0.8 0.0005 21.1 13.8 261 109-445 1-274 (274) 81 protein:vir:80930 Length: 278 44.5 0.8 0.0005 21.1 17.8 266 121-456 1-278 (278) 82 protein:vir:3613 Length: 272 # 42.8 0.87 0.00054 20.9 15.7 266 117-467 1-272 (272) 83 protein:vir:8187 Length: 311 # 41.2 0.94 0.00058 20.7 18.8 283 66-444 1-311 (311) 84 protein:vir:9309 Length: 324 # 41.1 0.94 0.00059 20.7 20.4 298 35-450 1-324 (324) 85 protein:vir:102119 Length: 404 39.5 1 0.00063 20.5 16.3 337 1-447 9-404 (404) 86 protein:vir:99920 Length: 311 38.7 1.1 0.00065 20.5 17.9 284 66-443 1-311 (311) 87 protein:vir:96833 Length: 275 36.7 1.2 0.00072 20.2 13.8 257 115-451 1-275 (275) 88 protein:vir:101607 Length: 379 36.2 1.2 0.00074 20.2 18.2 332 1-467 16-379 (379) 89 protein:vir:1383 Length: 421 # 32.8 1.4 0.00087 19.8 17.6 330 2-468 1-391 (421) 90 protein:vir:10364 Length: 390 30.0 1.6 0.001 19.4 17.1 329 1-465 30-390 (390) 91 protein:vir:3991 Length: 404 # 28.6 1.7 0.0011 19.3 20.2 336 1-455 4-404 (404) 92 protein:vir:81070 Length: 390 28.3 1.8 0.0011 19.2 18.8 319 1-465 43-390 (390) 93 protein:vir:1328 Length: 392 # 26.9 1.9 0.0012 19.1 16.6 331 1-455 9-392 (392) 94 protein:vir:97053 Length: 390 26.5 1.9 0.0012 19.0 20.5 324 1-465 32-390 (390) 95 protein:vir:100135 Length: 418 26.5 1.9 0.0012 19.0 18.7 332 1-448 35-418 (418) 96 protein:vir:8102 Length: 543 # 26.1 2 0.0012 19.0 16.4 322 1-455 173-543 (543) 97 protein:vir:101650 Length: 497 25.1 2.1 0.0013 18.8 19.7 359 1-449 53-497 (497) 98 protein:vir:7855 Length: 497 # 25.1 2.1 0.0013 18.8 19.7 359 1-449 53-497 (497) 99 protein:vir:1268 Length: 397 # 23.9 2.2 0.0014 18.7 16.3 321 1-465 39-397 (397) 100 protein:vir:739 Length: 231 # 22.6 2.4 0.0015 18.5 13.1 220 161-467 1-231 (231) 101 protein:vir:96762 Length: 632 22.6 2.4 0.0015 18.5 16.7 328 1-442 260-632 (632) 102 protein:vir:1084 Length: 437 # 21.0 2.7 0.0017 18.2 14.0 326 1-451 65-437 (437) 103 protein:vir:95107 Length: 270 20.2 2.8 0.0017 18.1 15.1 262 117-457 1-270 (270) No 1 >protein:vir:106998 Length: 468 # NCBI annotation: major capsid protein gp23 # Family: family:all:364 # MgeID: mge:1459 # MgeName: S-PM2 # Cross-refs: genbank:acc:YP_195142;genbank:gi:58532919;uniprot:Q5GQN0;genbank:GeneID:3260495 Probab=100.00 E-value=3.2e-250 Score=1388.43 Aligned_cols=468 Identities=89% Similarity=1.351 Sum_probs=458.4 Q ss_pred CcchHHHHHhhhhhhcCCccccccchhhhhhhhhhhhhHHHHHhhhhhhhhhccccccCccccccccccccccccccccc Q lcl|NC_015288. 1 MFNAEHLQEKWSPVLNNEAANPIADRYKKAVTSVLLENQERFLREERGMLQEVAVNSLGAGTVSPGGSALGSANTAGLAG 80 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~~~llenq~~~~~e~~~~l~e~~~~~~g~~~~~~~~~~~~st~tg~~~~ 80 (468) ||++|+|+|||+|||||||+|+|++.|||+|+++||||||||++|++++|+|.+.+++|.+.....+.+.++++|+++++ T Consensus 1 ~~~~e~l~~kW~plLe~~~~~~i~~~~k~~i~a~llENQe~~~~~~~~~~~~~~~~~~~~~~~~~~n~~~~~~~t~~v~~ 80 (468) T protein:vir:10 1 MFNAEHLQEKWSPVLNHGEAPAIGDRYKRAVTSVLLENQERFLREERGMLNEVAVNSLGAGTIAPAGSALGSANTGGLAG 80 (468) T ss_pred CcchHHHHHhhhHhhcCCccchhccchhhhhhhhhhhhHHHHHhccccccchhhHhhcCCcccchhhhhhhhcccccccc Confidence 99999999999999999999999999999999999999999999999999999999999988888899999999999999 Q ss_pred ccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCCccccccccccccccccccccccccccccccccC Q lcl|NC_015288. 81 FDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDAGFTAGLDATTGAYTPRTGAGVGGD 160 (468) Q Consensus 81 ~~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSG~~~~~~~~~~~~~~~~~~~~ 160 (468) +||+||+||||++|||||+|||||||||||||||||||+||.+|+|+|+|||||+++|||..+.+.+.....++....++ T Consensus 81 ~~P~Li~l~RRa~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~n~~g~EAf~nEadt~fSg~~~~~~~~~~~~~~~~~~~~ 160 (468) T protein:vir:10 81 FDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDTGFTGGYDASQGDYAVRTGAGVGGD 160 (468) T ss_pred cCchhhhhHHHHHhhhhhhhceeeecCCccceeeeEEEEEecCCCCccceeccccccccccccccccccccccccccccC Confidence 99999999999999999999999999999999999999999999999999999999999998888777777777788888 Q ss_pred cccccccccccccccccccccccchhhhhccCCCCCccccceeEEEEEEEEeecccccceecHHHHHhHHHhhCCChhHH Q lcl|NC_015288. 161 AEGNNPALLNDSSPGTYETPRGFSREDLEQAGDAGKLFREMSFSIEKTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQE 240 (468) Q Consensus 161 ~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~g~~f~EMaFsIeK~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~E 240 (468) ..++++...+.+..++|+++.||+|+++|.||+++++|+||+|+||||+|||||||||||||||||||||||||||||+| T Consensus 161 ~~g~~~~~~~~a~~~~~~~g~gMsTa~aE~lG~~~~~f~EMaFsIeK~tVtAKSRaLKAeYTiELAQDLKAiHGLDAEtE 240 (468) T protein:vir:10 161 SEGNNPALLNDAAPGTYEVGSKMPREDLERMGEANRLFREMSFSIEKTSVTAQSRALKAEYTLELAQDLKAIHGLDAEQE 240 (468) T ss_pred CCCCcccccccccccccccccccchHHHhhcCCCCcccceeeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChhHH Confidence 89999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeeeecCCcchhHHHHHHHHHHHHHHHHHHHHHHhccCCccEE Q lcl|NC_015288. 241 LANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDLDVDSNGRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFL 320 (468) Q Consensus 241 LanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl~~~~~~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~ 320 (468) |+||||||||+||||||||+||+||+|||+.+++++|+|||++++||||++|+||+|+|||+||||+|+|||+||+|||| T Consensus 241 LaNILStEImlEINReii~~l~~va~~~k~~g~~~~Gv~d~~~~~~~rw~~e~~k~L~~~i~~ean~i~~~T~rg~gn~i 320 (468) T protein:vir:10 241 LANILSSEVLAEINREVVRRVYTVAKKGAQNNVANAGIFDLDVDSNGRWSVEKFKGLLFQVERDANAIAQETRRGKGNFL 320 (468) T ss_pred HHHHHHHHHHHHhcHHHHHhHhhhhhheecccccccccccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccccccEE Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EEchhHHHHHhhccccccccccccccccccccccCCCceeEEEecCCeEEEEccccccCCCcceEEEEEecCCcccceeE Q lcl|NC_015288. 321 ICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTGNLAVGTINGRIKVYVDPYAANLSDKHYYVVGYKGTSPYDAGLF 400 (468) Q Consensus 321 v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy~D~Ya~~~s~~dY~~vG~Kg~~~~d~glf 400 (468) |||++||++|+||||+++.|++.++.+..++++|+|+++|+|+|+|||+|||||||+|+||+|||+|||||++++|+||| T Consensus 321 i~S~~Va~~L~~sG~l~~~~~~~~~~~~~~~~~D~tg~~~~G~l~~r~~vy~D~Ya~~~s~~dY~~vG~KG~~~~d~glf 400 (468) T protein:vir:10 321 ICSADVASALAMAGVLDYSSGLNGAGGPSIGEVDDTGNLAVGTINGRIKVFVDPYAANLSDKHYYVIGYKGTSPYDAGLF 400 (468) T ss_pred EechhHHHHHhhcCcceecccccccccccccccccCcceEEEEecCceEEEEccccccCCccceEEEEEecCcceeceee Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred EccccccccccccCCccccceeeeeeecceeecCcccccCcccccCChhhhhhccCceeeeEEeeccC Q lcl|NC_015288. 401 YCPYVPLQMVRSIDPNNFQPKIGFKTRYGMVSNPFVTTNGLYSGTPDGETLTPSTNMYYRRVQVTNLM 468 (468) Q Consensus 401 yaPYv~~~~~~~~Dp~s~qP~~g~~tRY~l~~nPf~~~~~~~~~~~~~~~~~~~~N~y~r~~~v~~~~ 468 (468) |||||||+|++++||+||||++||||||||++|||++..+...++|++++|.+++|+|||||+||||| T Consensus 401 yaPYv~l~~~~~~dp~sfqP~~g~~tRY~l~~NP~~~~~~~~~g~~~~~~~~~~~N~y~r~~~v~~l~ 468 (468) T protein:vir:10 401 YCPYVPLQMVRSIDPNTFQPKIGFKTRYGMVSNPFVTTNGLYNGTPDGEALTPNANMYYRRVQVTNLM 468 (468) T ss_pred eccccccccccccCCCcccceeeeeeeeceeecccceeccccCCCcccccccccccceeeeEEEeccC Confidence 99999999999999999999999999999999999988888888999999999999999999999999 No 2 >protein:vir:104915 Length: 470 # NCBI annotation: T4-like major capsid protein # Family: family:all:364 # MgeID: mge:1630 # MgeName: P-SSM2 # Cross-refs: genbank:acc:YP_214367;genbank:gi:61806007;genbank:GeneID:3294435 Probab=100.00 E-value=2.2e-238 Score=1323.55 Aligned_cols=456 Identities=60% Similarity=0.976 Sum_probs=420.2 Q ss_pred CcchHHHHHhhhhhhcCCccccccchhhhhhhhhhhhhHHHHHhhhhhhhhhccccccCccccccccccccccccccccc Q lcl|NC_015288. 1 MFNAEHLQEKWSPVLNNEAANPIADRYKKAVTSVLLENQERFLREERGMLQEVAVNSLGAGTVSPGGSALGSANTAGLAG 80 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~~~llenq~~~~~e~~~~l~e~~~~~~g~~~~~~~~~~~~st~tg~~~~ 80 (468) |+++|+|+|||+|||||||+|+|++.|||+|+++||||||+|++|++++|+|.+ +++++.++.++++.+||+|++|++ T Consensus 3 ~~~~e~l~~kw~p~l~~~~~~~i~~~~~~~v~a~l~enq~~~~~~~~~~l~e~~--~~~~~~~~~~~~i~~st~t~~v~~ 80 (470) T protein:vir:10 3 MFNSEYLQEKWAPILDYDGLDPIKDSHRRSVTAVLLENQEKELREERNFLSEAP--NVNTNSGATAGFSADATAAGPVAG 80 (470) T ss_pred cchhHHHHHhhhhhhcCCccchhcchhhhhhhhhhhhhhHHHHhhccchhhhhh--hccccccccccccccccccccccc Confidence 999999999999999999999999999999999999999999999999999986 578888888999999999999999 Q ss_pred ccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCCcccccccccccccccccccccccccccc----- Q lcl|NC_015288. 81 FDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDAGFTAGLDATTGAYTPRTGA----- 155 (468) Q Consensus 81 ~~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSG~~~~~~~~~~~~~~~----- 155 (468) +||+||+||||++|||||+|||||||||||||||||||+||.+|+|+|+||+||++.|||..++........... T Consensus 81 ~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~n~sG~EaffnEA~T~fSG~~~~~~~~~~~~~~~a~~~g 160 (470) T protein:vir:10 81 FDPVLISLIRRSMPNLVAYDLAGVQPMNGPTGLIFAMRSRYKTQSGTEALFNEADTAFSGQPDGLDDTSGFTATGANNVG 160 (470) T ss_pred cCchhhhhHHHHHhhhhhhhhheeecCCccceeeeEEEEEecCCCccceeeecCCcccCccccccccccccccccccccc Confidence 999999999999999999999999999999999999999999999999999999999999776554332211111 Q ss_pred ccccCccccccccc----ccccccccccccccchhhhhccCC-CCCccccceeEEEEEEEEeecccccceecHHHHHhHH Q lcl|NC_015288. 156 GVGGDAEGNNPALL----NDSSPGTYETPRGFSREDLEQAGD-AGKLFREMSFSIEKTSVTAKSRALKAEYTLELAQDLK 230 (468) Q Consensus 156 ~~~~~~~g~~~~~~----~~a~~g~~t~~~gm~Ta~aE~lG~-~g~~f~EMaFsIeK~tVtAKSRaLKAEYTvELAQDLk 230 (468) .....+.++++... ..+..++|+++.||+|+++|.||+ ++++|+||+|+||||+||||||||||||||||||||| T Consensus 161 ~~~~~~~gt~~~~~~~~~~~a~~~~y~~~~GMsTa~aE~lg~s~~~~f~EMaFsIeK~tVtAKSRaLKAeYTiELAQDLK 240 (470) T protein:vir:10 161 LGTTAQQGSNPGLLNSTAAQTNATDYNVGQGMRTDSAEDLGDGTGDQFNQMAFSIEKVTVTAKSRALKAEYSLELAQDLK 240 (470) T ss_pred ccccccccccccccccccccccccccccccccchHHhhhcCCCCCcccceeeeEEEEEEEEeeccceeccccHHHHHHHH Confidence 11112334444333 334456789999999999999996 4678999999999999999999999999999999999 Q ss_pred HhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeeeecCCcchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015288. 231 AIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDLDVDSNGRWSVEKFKGLLFQIERDCNAIAQ 310 (468) Q Consensus 231 AiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl~~~~~~rw~~e~~~~l~~~i~~ean~i~q 310 (468) ||||||||+||+||||||||+||||||||+||+||+|||+.+++++|+|||+++++|||++|+||+|+|||+||||+|+| T Consensus 241 AiHGLDAEtELaNILStEImlEINReii~~l~~~a~~~k~~~~~~~Gv~Dl~~~~~gr~~~e~~~~l~~~i~~ean~i~~ 320 (470) T protein:vir:10 241 AIHGLNAEAELANILSTEILAEINREVIRTIYNVAEPGAQANVAAAGTFDLDTDSNGRWSVEKFKGLIFQIERDANAIAQ 320 (470) T ss_pred HhcCCChhHHHHHHHHHHHHHHhcHHHHHHHhhhhhhceeccccccceEEeecccchhHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhccCCccEEEEchhHHHHHhhccccccccccccccccccccccCCCceeEEEecCCeEEEEccccc--cCCCcceEEEE Q lcl|NC_015288. 311 DTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTGNLAVGTINGRIKVYVDPYAA--NLSDKHYYVVG 388 (468) Q Consensus 311 ~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy~D~Ya~--~~s~~dY~~vG 388 (468) ||+||+|||||||++||++|+|||||++.|+..++ +++|+|+++|+|+|+|||+||||||+. +++|+|||+|| T Consensus 321 ~t~r~~~n~~i~S~~Va~~La~sG~l~~~~~~~~~-----~~~D~t~~~~~G~l~~~~~vy~d~y~~~~~~a~~dy~~vG 395 (470) T protein:vir:10 321 RTRRGKGNMILCSADVASALTMAGVLDYTPALNAN-----LNVDDTGNTFAGILQGKYRVYIDPFSASGGAAATQYYVVG 395 (470) T ss_pred hhccccceEEEEchhHHhHhhhccccccccccccc-----cccCCCCceEEEEecCceEEEeeccccccCcccccEEEEE Confidence 99999999999999999999999999999988764 578999999999999999999999987 68999999999 Q ss_pred EecCCcccceeEEccccccccccccCCccccceeeeeeecceeecCcccccCcccccCChhhhhhccCceeeeEEeeccC Q lcl|NC_015288. 389 YKGTSPYDAGLFYCPYVPLQMVRSIDPNNFQPKIGFKTRYGMVSNPFVTTNGLYSGTPDGETLTPSTNMYYRRVQVTNLM 468 (468) Q Consensus 389 ~Kg~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~g~~tRY~l~~nPf~~~~~~~~~~~~~~~~~~~~N~y~r~~~v~~~~ 468 (468) |||++++|+||||||||||++++++||+||||++||||||||++|||++..++.. ..+++++|||||||+||||| T Consensus 396 ~KG~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g~~tRY~l~~NP~~~~~~~~~-----~~i~~~~n~y~r~~~v~~l~ 470 (470) T protein:vir:10 396 YKGSSPYDAGLFYCPYVPLQMVRAVGQDTFQPKIGFKTRYGLVENPFSQGTTQGL-----GTLTRNSNRYYRRVKVANLM 470 (470) T ss_pred EecCcceecceeeccccccccCCCCCCccccceeeeeeeeceeecCcccCCCccc-----ccccCCCCceeeEEEeeccC Confidence 9999999999999999999999999999999999999999999999998877653 23667999999999999999 No 3 >protein:vir:104549 Length: 462 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:1548 # MgeName: P-SSM4 # Cross-refs: genbank:acc:YP_214669;genbank:gi:61806310;genbank:GeneID:3294604 Probab=100.00 E-value=1.4e-236 Score=1313.53 Aligned_cols=451 Identities=66% Similarity=1.023 Sum_probs=422.2 Q ss_pred cchHHHHHhhhhhhcCCccccccchhhhhhhhhhhhhHHHHHhhhhhhhhhccccccCcccccccccccccccccccccc Q lcl|NC_015288. 2 FNAEHLQEKWSPVLNNEAANPIADRYKKAVTSVLLENQERFLREERGMLQEVAVNSLGAGTVSPGGSALGSANTAGLAGF 81 (468) Q Consensus 2 ~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~~~llenq~~~~~e~~~~l~e~~~~~~g~~~~~~~~~~~~st~tg~~~~~ 81 (468) |++|+|+|||+|||||||+|+|++.+||+|+++|||||||||+|++.+|.|+. ++|+++ .++++|++++++ T Consensus 1 ms~~~l~~~w~~~l~~~~~~~i~~~~~~~~~~~~~enq~~~~~~~~~~l~ea~------~~~g~~---~~~~~t~~~~~~ 71 (462) T protein:vir:10 1 MSIQQLQEKWAPVLNHESVPEIKDSYKKGVVAQLLENQENAIREEGQVLNETL------QTTGYT---TGDTATGPVAGF 71 (462) T ss_pred CchHHHHHHhhhhhcccccchhhhhhHHHHHHHHhhhHHHHHHhcccchhccc------cccCCC---cCcccccccccc Confidence 88999999999999999999999999999999999999999999999999974 455544 578889999999 Q ss_pred cceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecC------CCCCcccccccccccccccccccccccccccc Q lcl|NC_015288. 82 DPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYEN------QAGEEALFNEPDAGFTAGLDATTGAYTPRTGA 155 (468) Q Consensus 82 ~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~------qsG~EA~fnEa~t~fSG~~~~~~~~~~~~~~~ 155 (468) ||+||+||||++|||||+|||||||||||||||||||+||.+ |+|+||||||+++.|||..+..........+. T Consensus 72 ~P~Li~l~Rra~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~~~~~~~nq~gtEAlfnEadt~fSg~~~~~~~~~~~~~~~ 151 (462) T protein:vir:10 72 DPVLISLIRRSMPQLIAYDVAGVQPMTGPTGLIFAMRSFYGSERRPANSDFREALFNEPNAGFSGGAGTGLSNYDPTASS 151 (462) T ss_pred cchhhhHHHHHHhhhhhhcceeeecCCcchhhhheeeeeccCCccccccccchhhhccCCcCcccccccccccccccccc Confidence 999999999999999999999999999999999999999975 56899999999999999877766665555566 Q ss_pred ccccCcccccccccccccccccc---cccccchhhhhccCC--CCCccccceeEEEEEEEEeecccccceecHHHHHhHH Q lcl|NC_015288. 156 GVGGDAEGNNPALLNDSSPGTYE---TPRGFSREDLEQAGD--AGKLFREMSFSIEKTSVTAKSRALKAEYTLELAQDLK 230 (468) Q Consensus 156 ~~~~~~~g~~~~~~~~a~~g~~t---~~~gm~Ta~aE~lG~--~g~~f~EMaFsIeK~tVtAKSRaLKAEYTvELAQDLk 230 (468) .......+.++...+++..++++ .+.||+|+++|.||+ ++++|+||+|+||||+||||||||||||||||||||| T Consensus 152 ~~~~~~~g~~~~~~~~~~~g~~~~~~~~~GM~Ta~aE~lg~~s~n~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLK 231 (462) T protein:vir:10 152 SAVNDAEGANPGLLNDSPAGTYEVTGDATGMATATAEALDDSSASTAFREMGFSIEKVTVTAKSRALKAEYSIEMAQDLK 231 (462) T ss_pred ccccccccccceeecCCCccceecccccccccchhccccCCccCCcchhhceeEEEEEEEeeeccceeccccHHHHHHHH Confidence 66677788888888877777665 467999999999985 3568999999999999999999999999999999999 Q ss_pred HhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeeeecCCcchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015288. 231 AIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDLDVDSNGRWSVEKFKGLLFQIERDCNAIAQ 310 (468) Q Consensus 231 AiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl~~~~~~rw~~e~~~~l~~~i~~ean~i~q 310 (468) ||||||||+||+||||||||+||||||||+||+||+|||+.|++++|+|||+++++|||++|+||+|+|||+||||+|+| T Consensus 232 AIHGLDAEtELaNILSTEImlEINReii~~l~~~a~~~k~~~~~~~Gv~dl~~~~~gr~~~e~~k~l~~qi~~ean~i~~ 311 (462) T protein:vir:10 232 AIHGLDAESELANILSTEILAEINREVVRTIYVNAVKGAIANTATDGIFDLDVDSNGRWSVEKFKGLLFQIERDSNAIGQ 311 (462) T ss_pred HhcCCChhHHHHHHHHHHHHHHhhHHHHhhhhhhheeeecccccccceeeeccccchHHHHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhccCCccEEEEchhHHHHHhhccccccccccccccccccccccCCCceeEEEecCCeEEEEccccccCCCcceEEEEEe Q lcl|NC_015288. 311 DTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTGNLAVGTINGRIKVYVDPYAANLSDKHYYVVGYK 390 (468) Q Consensus 311 ~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy~D~Ya~~~s~~dY~~vG~K 390 (468) ||+||+|||||||+|||++|+|||||+++|+..++.++ .++|+++.+|+|+|+|||+||||||++||+|+|||+|||| T Consensus 312 ~t~r~~~n~~i~S~~Va~~La~sG~l~~~p~~~~~~~~--~~~d~~~~~~~G~l~~r~~vy~D~Y~~~ns~~dy~~vG~K 389 (462) T protein:vir:10 312 ETRRGKGNILICSADVASALGMAGVLDYAPGLQGNSAL--TGVDDTSSTLVGTLNGRIKVYVDPYSSNVADKHFYVAGYK 389 (462) T ss_pred HhccccceEEEEchhHHHHhhhccchhccccccccccc--cccccccceeEEEecCceEEEEecccCCCcccceEEEEEe Confidence 99999999999999999999999999999998888664 4799999999999999999999999999999999999999 Q ss_pred cCCcccceeEEccccccccccccCCccccceeeeeeecceeecCcccccCcccccCChhhhhhccCceeeeEEeeccC Q lcl|NC_015288. 391 GTSPYDAGLFYCPYVPLQMVRSIDPNNFQPKIGFKTRYGMVSNPFVTTNGLYSGTPDGETLTPSTNMYYRRVQVTNLM 468 (468) Q Consensus 391 g~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~g~~tRY~l~~nPf~~~~~~~~~~~~~~~~~~~~N~y~r~~~v~~~~ 468 (468) |++++|+||||||||||++++++||+||||++||||||||++|||+++.++.. +++++++|||||||+||||| T Consensus 390 G~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g~~tRY~l~~NP~t~~~~~~~-----~~~~~~~n~y~r~~~v~~l~ 462 (462) T protein:vir:10 390 GTSPYDAGLFYCPYVPLQQVRAINPNTFQPKIGFKTRYGMVSNPFSGGLTQGS-----GALTANANKYYRRVQVANLM 462 (462) T ss_pred CCcccccceeeccccccccccccCCccccceeeeeeeeeeeecCCCCCcCCcc-----ccccccCcceeeeEEeeccC Confidence 99999999999999999999999999999999999999999999998877653 46788999999999999999 No 4 >protein:vir:103181 Length: 457 # NCBI annotation: gp135 # Family: family:all:364 # MgeID: mge:1583 # MgeName: Syn9 # Cross-refs: genbank:acc:YP_717802;genbank:gi:113200639;genbank:GeneID:4239190 Probab=100.00 E-value=4.1e-231 Score=1283.64 Aligned_cols=446 Identities=65% Similarity=1.025 Sum_probs=412.3 Q ss_pred cchHHHHHhhhhhhcCCccccccchhhhhhhhhhhhhHHHHHhhhhhhhhhccccccCcccccccccccccccccccccc Q lcl|NC_015288. 2 FNAEHLQEKWSPVLNNEAANPIADRYKKAVTSVLLENQERFLREERGMLQEVAVNSLGAGTVSPGGSALGSANTAGLAGF 81 (468) Q Consensus 2 ~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~~~llenq~~~~~e~~~~l~e~~~~~~g~~~~~~~~~~~~st~tg~~~~~ 81 (468) |+.|+|+|||+|||||||+|||++.|||+|+++||||||||++|++++|.|+. ++|++. .+|++|++|+++ T Consensus 1 m~~~~l~~~w~~~l~~~~~~~i~~~~~~~~~~~~lenq~~~~~~~~~~l~ea~------~~~g~~---~~s~~t~~v~~~ 71 (457) T protein:vir:10 1 MSFQNLQEKWAPVLEHDSLPEIGDSYKKGVVAQLLENQEKAIAEEGKILTETL------QTTGYT---GGDTVTGPVAGF 71 (457) T ss_pred CchHHHHHHhhHhhccCccchhhhhHHHHHHHHHhhhHHHHHHhccccccccc------cccCCC---cccccccccccc Confidence 89999999999999999999999999999999999999999999999999964 455544 356789999999 Q ss_pred cceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCC------Ccccccccccccccccccccccccccccc Q lcl|NC_015288. 82 DPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAG------EEALFNEPDAGFTAGLDATTGAYTPRTGA 155 (468) Q Consensus 82 ~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qsG------~EA~fnEa~t~fSG~~~~~~~~~~~~~~~ 155 (468) ||+||+||||++|||||+|||||||||||||||||||+||.++++ +|||||||++.|||..++..... . T Consensus 72 ~P~Li~l~Rra~p~LIa~DIwGVQPmTgPTGLIFAmRsrY~~q~~~~~a~~~EAl~nEadt~fSg~~~~~~~~~-----~ 146 (457) T protein:vir:10 72 DPVLISLIRRSMPQLIAYDIAGVQPMTGPTGLIFAMRTNYGAERNPAAAGYDEAFFNEPNAGFSGGPGAYDPGA-----T 146 (457) T ss_pred cchhhhhhHHHHhhhhhhhcceeecCCCcceeeeeeeeeecCccccccccccceeeeccCcccCcccccccccc-----c Confidence 999999999999999999999999999999999999999999876 79999999999999766544322 2 Q ss_pred ccccCccccccccccccccc---ccccccccchhhhhccCCC--CCccccceeEEEEEEEEeecccccceecHHHHHhHH Q lcl|NC_015288. 156 GVGGDAEGNNPALLNDSSPG---TYETPRGFSREDLEQAGDA--GKLFREMSFSIEKTSVTAKSRALKAEYTLELAQDLK 230 (468) Q Consensus 156 ~~~~~~~g~~~~~~~~a~~g---~~t~~~gm~Ta~aE~lG~~--g~~f~EMaFsIeK~tVtAKSRaLKAEYTvELAQDLk 230 (468) ...+.+.++++...++...+ .++++.||+|+++|.||++ +++|+||+|+||||+||||||||||||||||||||| T Consensus 147 ~~~~~~~gt~~~~~~~~~~~~~~~~~~~~gmsTA~aE~lgd~~~n~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLK 226 (457) T protein:vir:10 147 GVTNDAEGTNPALLNDSPAGTYEQADDATGMSTATVEALDDSTANTAFREMGFSIEKVTVTARARALKAEYSIEMAQDLK 226 (457) T ss_pred ccccccccccccccCccccccccccccccchhhhhhhccCCCCCccchhhheeEEEEEEEeeeccceeccccHHHHHHHH Confidence 23345566677766665554 4578999999999999853 457999999999999999999999999999999999 Q ss_pred HhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeeeecCCcchhHHHHHHHHHHHHHHHHHHHHH Q lcl|NC_015288. 231 AIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDLDVDSNGRWSVEKFKGLLFQIERDCNAIAQ 310 (468) Q Consensus 231 AiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl~~~~~~rw~~e~~~~l~~~i~~ean~i~q 310 (468) ||||||||+||+||||||||+||||||||+||+||+|||++|++++|+|||++++||||++|+||+|+|||+||||+|+| T Consensus 227 AiHGLDAEtELaNILStEImlEINReii~~l~~~a~~~~~~~~~~~gv~dl~~~~~g~~~~e~~k~L~~~i~~ean~i~~ 306 (457) T protein:vir:10 227 AIHGLDAEQELANILSTEILAEINREVVRTIYTNAVAGAQNNTATAGVFDLDVDSNGRWSVEKFKGLLFQIERDANAIGH 306 (457) T ss_pred HhcCCChhHHHHHHHHHHHHHHhhHHHHHhHhhhheeeeccccccceeeeeeccccchhhHHHHHHHHHHHHHHHHHHHH Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred HhccCCccEEEEchhHHHHHhhccccccccccccccccccccccCCCceeEEEecCCeEEEEccccccCCCcceEEEEEe Q lcl|NC_015288. 311 DTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTGNLAVGTINGRIKVYVDPYAANLSDKHYYVVGYK 390 (468) Q Consensus 311 ~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy~D~Ya~~~s~~dY~~vG~K 390 (468) ||+||+|||||||++||++|+|||||+++|++.+..+. .++|+++.+|+|+|+|||+|||||||++|||+|||+|||| T Consensus 307 ~T~rg~gn~~i~S~~Va~~L~~sg~l~~~p~~~~~~~~--~~~d~~~~~~~G~l~~r~~vy~D~Ya~~ns~~dy~~vG~K 384 (457) T protein:vir:10 307 QTRRGKGNILICSADVVSALGMAGVLDYTPALNGNNGL--AGVDDTSSTLVGTLNGRIKVYVDPYSANVADKHFYVAGYK 384 (457) T ss_pred hhccccceEEEEchhHHHHHhhcccccccchhhccccc--cccccccceeEEEecCCeEEEEecccccCCccceEEEEEe Confidence 99999999999999999999999999999999998764 5899999999999999999999999999999999999999 Q ss_pred cCCcccceeEEccccccccccccCCccccceeeeeeecceeecCcccccCcccccCChhhhhhccCceeeeEEeeccC Q lcl|NC_015288. 391 GTSPYDAGLFYCPYVPLQMVRSIDPNNFQPKIGFKTRYGMVSNPFVTTNGLYSGTPDGETLTPSTNMYYRRVQVTNLM 468 (468) Q Consensus 391 g~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~g~~tRY~l~~nPf~~~~~~~~~~~~~~~~~~~~N~y~r~~~v~~~~ 468 (468) |++++|+||||||||||++++++||+||||++||||||||++|||+++.++.. +.++.+.|.||||++|+||| T Consensus 385 G~~~~~~glfy~PYv~l~~~~~~dp~sfqP~~g~~tRY~l~~NP~~~~~~~~~-----~~~~~~~n~~~~rs~vs~ll 457 (457) T protein:vir:10 385 GTSPYDAGLFYCPYVPLQQVRAINPDTFQPKIGFKTRYGMVSNPFAGGLTQGS-----GALTVNANKYYRRVQVANLM 457 (457) T ss_pred CCcceecceeecccccccccCccCCccccceeeeeeeeeeeeccccccccccc-----ccccccchhhcceeeeeecC Confidence 99999999999999999999999999999999999999999999998877653 23456789999999999999 No 5 >protein:vir:106286 Length: 534 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1474 # MgeName: Aeh1 # Cross-refs: genbank:acc:NP_944113;genbank:gi:38640157;genbank:GeneID:2658034 Probab=100.00 E-value=1.3e-224 Score=1248.01 Aligned_cols=459 Identities=37% Similarity=0.636 Sum_probs=403.3 Q ss_pred cchHHHHHhhhhhhcCCccccccchhhhhhhhhhhhhHHHHHhhhh--hhhhhccccccC----------------cccc Q lcl|NC_015288. 2 FNAEHLQEKWSPVLNNEAANPIADRYKKAVTSVLLENQERFLREER--GMLQEVAVNSLG----------------AGTV 63 (468) Q Consensus 2 ~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~~~llenq~~~~~e~~--~~l~e~~~~~~g----------------~~~~ 63 (468) |.+|+|+|||+|||||||+|||++.|||+|+++||||||||++|++ .|.+++++|++| +++| T Consensus 1 ~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~ 80 (534) T protein:vir:10 1 MSKKSLLKKWQPLVESEGMPAIASMKRKDIVARIFENQDEDIAHNEGGVYTDQVVVNSMVDVKGRIEEARLAEANIGGDH 80 (534) T ss_pred CchhHHHHHhHHhhcCCccccccchhhhhhhhhhhhhHHHHHhhhcccccchhhhhhhhhccccchhhcccccccccccc Confidence 9999999999999999999999999999999999999999999985 456776666665 4677 Q ss_pred ccccccc-ccccccccccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCC----CCcccccc--ccc Q lcl|NC_015288. 64 SPGGSAL-GSANTAGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQA----GEEALFNE--PDA 136 (468) Q Consensus 64 ~~~~~~~-~st~tg~~~~~~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qs----G~EA~fnE--a~t 136 (468) ++++.++ +|++|++|+++||+||+||||++|||||+|||||||||||||||||||+||.+|+ ++||||+| +|+ T Consensus 81 g~~~~~ia~s~~s~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~n~~~~~s~~EAf~ne~~adt 160 (534) T protein:vir:10 81 GYDATKIASGETSGSITNVGPAVMGLVRRAIPQLIAFDICGVQPMTSSTGQVFTLRAIYGGNSQDANAREAFHPTYGPDA 160 (534) T ss_pred ccccccccccccccccccccchhhhHHHHHHHhhhhhhhheeccCCchhhhheeeeeeecCCCCCccccccccccccccc Confidence 7776665 7889999999999999999999999999999999999999999999999999875 67999999 999 Q ss_pred ccccccccccccccccccccccc------------Cccccccc----------------------ccccccccccccccc Q lcl|NC_015288. 137 GFTAGLDATTGAYTPRTGAGVGG------------DAEGNNPA----------------------LLNDSSPGTYETPRG 182 (468) Q Consensus 137 ~fSG~~~~~~~~~~~~~~~~~~~------------~~~g~~~~----------------------~~~~a~~g~~t~~~g 182 (468) +|||..+...............+ ...++.+. .......+.|+++.| T Consensus 161 ~fSG~~~a~~~~~~~~~~a~~~g~~~~~~~~~~t~~~~Gt~~~~~~~~~~v~~~~~~~~~ag~~~~~~~~~~~~y~~~~g 240 (534) T protein:vir:10 161 DFSGRGAAQDIAVFVRGTAVASGAFAKLHIEAATGVQAGTKTVQFIKDYAVDALPADQTEAGLAYKWLLANGYAVETSSA 240 (534) T ss_pred cccccccccccccccccccccccccccccccccccccccccccccccccccccccCCccccccccccccccccceecccc Confidence 99997655432211111110000 01111110 011123456889999 Q ss_pred cchhhhhccC----CCCCccccceeEEEEEEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHH Q lcl|NC_015288. 183 FSREDLEQAG----DAGKLFREMSFSIEKTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVV 258 (468) Q Consensus 183 m~Ta~aE~lG----~~g~~f~EMaFsIeK~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII 258 (468) |+|+.+|.|| +++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+||||||| T Consensus 241 m~Ta~AE~lg~~ggs~~~~f~EMsFsIdKvtVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELsNILSTEImlEINReii 320 (534) T protein:vir:10 241 MATAFAELQQGFNGSADNEWNEMSFRIDKQVVEAKSRQLKAQYSIEMAQDLRAVHGLDADSELSSILANEIMHEINREMV 320 (534) T ss_pred cchhhHhhhccCCCCcccchhhcceEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHH Confidence 9999999984 456789999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHhhhcchhhcccc----ccceeeeeecCCc---chhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHh Q lcl|NC_015288. 259 RRVYSVAKPGAANNV----ANAGIFDLDVDSN---GRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALA 331 (468) Q Consensus 259 ~~l~~vA~~~k~~~~----~~~Gv~Dl~~~~~---~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~ 331 (468) |+||+||+|||+.++ +++|+|||.++.| +||++|+||+|++||++|+|+|+|+|+||+|||||||+|||++|+ T Consensus 321 ~~l~~~a~~~k~~~~~~~~~~~G~~d~~~~~~~~~~~~~~e~~~~L~~~i~~~an~i~~~T~rg~~n~~v~S~~Va~~L~ 400 (534) T protein:vir:10 321 LWINATAKVGKTGWTNMHGGKAGVFDFQDTKDIRGARWAGESYKALVVQIDKEANEIARQTGRGQGNFIICSRNVAAALG 400 (534) T ss_pred HHHhhhhheeecccccccccccceeeeeccccccchhHHHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchhHHHHHh Confidence 999999999999985 5789999999999 999999999999999999999999999999999999999999999 Q ss_pred hccccccccccccccccccccccCCCceeEEEecCCeEEEEccccccCCCcceEEEEEecCCcccceeEEcccccccccc Q lcl|NC_015288. 332 MAGVLDYSSGLTGAGGPAIGTVDDTGNLAVGTINGRIKVYVDPYAANLSDKHYYVVGYKGTSPYDAGLFYCPYVPLQMVR 411 (468) Q Consensus 332 ~~G~~~~~~~~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy~D~Ya~~~s~~dY~~vG~Kg~~~~d~glfyaPYv~~~~~~ 411 (468) |+|||++.++...+.+ .++|+++.+|+|+|+|||+||||||+. +|||+|||||++++|+||||||||||+|++ T Consensus 401 ~~g~l~~~~~~~~~~~---~~~d~~~~~~~G~l~~~~~vy~D~y~~----~dy~~vG~KG~~~~~~glfyaPYv~l~~~~ 473 (534) T protein:vir:10 401 HTDMLMTPAVMGANTT---MNTDTTSSLFAGVLAGKYRVYIDQYAV----EDYFTVGYKGASEMDAGLYYCPYVALTPLR 473 (534) T ss_pred hccchhcccccccccc---ccccCCCceEEEEecCceEEEecCCCC----cceEEEEEeCCcccccceeecccccccccc Confidence 9999999998877654 689999999999999999999999865 899999999999999999999999999999 Q ss_pred ccCCccccceeeeeeecceeecCcccccCcc--cccCChh---hhhhccCceeeeEEeecc Q lcl|NC_015288. 412 SIDPNNFQPKIGFKTRYGMVSNPFVTTNGLY--SGTPDGE---TLTPSTNMYYRRVQVTNL 467 (468) Q Consensus 412 ~~Dp~s~qP~~g~~tRY~l~~nPf~~~~~~~--~~~~~~~---~~~~~~N~y~r~~~v~~~ 467 (468) ++||+||||++||||||||++|||++..++. .++.||. +...++|.|||||+|||| T Consensus 474 ~~dp~sfqP~~g~~tRY~l~~NP~~~~~~~~~~~~i~~g~~~~~~~ag~n~~~~~~~Vk~l 534 (534) T protein:vir:10 474 GTDPKNFQPVLGFKTRYGVKLHPMADATQNKGFAKISNGMPQHTNMFGKNAFFRRVLVAGV 534 (534) T ss_pred ccCCccccceeeeeeeeceeecCcccccCCccccccccCCcchhhhcccccceeeeeeecC Confidence 9999999999999999999999999987664 3666642 234589999999999999 No 6 >protein:vir:6901 Length: 522 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:140 # MgeName: RB69 # Cross-refs: genbank:acc:NP_861877;genbank:gi:32453668;genbank:GeneID:1494303 Probab=100.00 E-value=1.8e-223 Score=1241.72 Aligned_cols=459 Identities=36% Similarity=0.617 Sum_probs=405.7 Q ss_pred CcchHHHHHhhhhhhcCCccccccchhhhhhhhhhhhhHHHHHhhhhhhhhhccccccCc--------cccccccccc-c Q lcl|NC_015288. 1 MFNAEHLQEKWSPVLNNEAANPIADRYKKAVTSVLLENQERFLREERGMLQEVAVNSLGA--------GTVSPGGSAL-G 71 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~~~llenq~~~~~e~~~~l~e~~~~~~g~--------~~~~~~~~~~-~ 71 (468) |+++|+|+|||+|||||||+|+|.+ +||+|+++|||||||+++|++.|++++++|++|+ |+|++++..+ + T Consensus 4 ~~~~e~l~~kw~p~l~~~~~~~~~~-~~~~~~a~l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~~~~~~~i~e 82 (522) T protein:vir:69 4 IKTKAQLVDKWKELLEGEGLPEIAN-SKQAIIAKIFENQEKDFEVSPEYKDEKIAQAFGSFLTEAEIGGDHGYNAQNIAA 82 (522) T ss_pred cchHHHHHHhhHHHhcCCCCCcccc-chhhhhhhhhhhhhHHhhcccccchhHHHHhhhhhhhhhccccccCCCcccccc Confidence 8999999999999999999999986 6999999999999999999999999999888885 7888887766 7 Q ss_pred cccccccccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCC----CCccc--ccccccccccccccc Q lcl|NC_015288. 72 SANTAGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQA----GEEAL--FNEPDAGFTAGLDAT 145 (468) Q Consensus 72 st~tg~~~~~~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qs----G~EA~--fnEa~t~fSG~~~~~ 145 (468) |++|++|+++||+||+|+||++|||||+|||||||||||||||||||+||.+|. ++|+| |||+++.|||....+ T Consensus 83 s~~t~~v~~~~P~li~lvrRa~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~q~~~~~~~eaf~~~neadt~fSG~~~~t 162 (522) T protein:vir:69 83 GQTSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQPMNSPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPDAMFSGQGAAK 162 (522) T ss_pred cccccccccccchHHHHHHHHHhhhhhhhceeeccCCchhhhheeeeeeccCCcccCccccccccccccccccccccccc Confidence 788999999999999999999999999999999999999999999999999875 56777 499999999986654 Q ss_pred ccccccccccccc-----------------------cCccccccc------ccccccccccccccccchhhhhcc---C- Q lcl|NC_015288. 146 TGAYTPRTGAGVG-----------------------GDAEGNNPA------LLNDSSPGTYETPRGFSREDLEQA---G- 192 (468) Q Consensus 146 ~~~~~~~~~~~~~-----------------------~~~~g~~~~------~~~~a~~g~~t~~~gm~Ta~aE~l---G- 192 (468) ............. ....+.++. ....+..+.|+++.||+|+.+|++ | T Consensus 163 ~~~~~~~~~~t~~G~~~~~~~~~~gt~~~~~~a~~t~~~t~~~~~~~~~ai~s~~~~~~~y~~g~GmsTa~aEal~~lgg 242 (522) T protein:vir:69 163 KFPALAASTQTKVGDIYTHFFQETGTVYLQASAQVTISSSADDAAKLDAEIIKQMEAGALVEIAEGMATSIAELQEGFNG 242 (522) T ss_pred cccccccccccccccccccccccccceeeecccCCcCCCCCcccccccchhccccccccceeeccccchhhhhhcccCCC Confidence 4322221111100 011111111 112234567899999999999986 3 Q ss_pred CCCCccccceeEEEEEEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccc Q lcl|NC_015288. 193 DAGKLFREMSFSIEKTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANN 272 (468) Q Consensus 193 ~~g~~f~EMaFsIeK~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~ 272 (468) +++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||++|+.+|+.+++.+ T Consensus 243 ss~~~f~EMaFsIeKvTVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILSTEImlEINReii~~i~~sa~~~~~g~ 322 (522) T protein:vir:69 243 STDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQVGKSGM 322 (522) T ss_pred CcccchhhhcceEeeEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhhheeecccc Confidence 45678999999999999999999999999999999999999999999999999999999999999999998888888866 Q ss_pred c----ccceeeeeecCCc---chhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccc Q lcl|NC_015288. 273 V----ANAGIFDLDVDSN---GRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGA 345 (468) Q Consensus 273 ~----~~~Gv~Dl~~~~~---~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~ 345 (468) + +++|+|||++++| |||++|+||+|+|||+||||+|+|+|+||+|||||||+|||++|+|+|++++.++...+ T Consensus 323 t~~~~~~~Gv~Dl~~~~~~~~~rw~~e~~k~L~~~i~~~an~i~~~T~rg~~n~~i~S~~Va~~L~~~~~~~~~~~~~~~ 402 (522) T protein:vir:69 323 TNIVGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTGISYAAQGLA 402 (522) T ss_pred ccccccccceeecccccccccchhHHHHHHHHHHHHHHHHHHHHHhcccccccEEEEchhHHHHHhhccccccccccccc Confidence 5 7899999999999 99999999999999999999999999999999999999999999999999999887766 Q ss_pred cccccccccCCCceeEEEecCCeEEEEccccccCCCcceEEEEEecCCcccceeEEccccccccccccCCccccceeeee Q lcl|NC_015288. 346 GGPAIGTVDDTGNLAVGTINGRIKVYVDPYAANLSDKHYYVVGYKGTSPYDAGLFYCPYVPLQMVRSIDPNNFQPKIGFK 425 (468) Q Consensus 346 ~~~~~~~~D~t~~~~~G~l~~~~~vy~D~Ya~~~s~~dY~~vG~Kg~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~g~~ 425 (468) .+ .++|+++++|+|+|+|||+||||||+ ++|||+|||||++++|+||||||||||+|+|++||+||||++||| T Consensus 403 ~g---~~~d~~~~~~~G~l~~~~~vy~D~y~----~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~ 475 (522) T protein:vir:69 403 SG---FNTDTTKSVFAGVLGGKYRVYIDQYA----KQDYFTVGYKGANEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFK 475 (522) T ss_pred cc---ccccCCCceEEEEecCceEEEecCCC----CcceEEEEEeCCcccccceeeccccccccccccCCccccceeeee Confidence 55 57899999999999999999999986 489999999999999999999999999999999999999999999 Q ss_pred eecceeecCcccccCc--ccccCChh-hh--hhccCceeeeEEeecc Q lcl|NC_015288. 426 TRYGMVSNPFVTTNGL--YSGTPDGE-TL--TPSTNMYYRRVQVTNL 467 (468) Q Consensus 426 tRY~l~~nPf~~~~~~--~~~~~~~~-~~--~~~~N~y~r~~~v~~~ 467 (468) |||||++|||++..++ .++++||. +| ..++|+|||||+|||| T Consensus 476 tRY~l~vNP~~~~~~~~~~~ri~~g~p~~~~~~~~n~y~r~v~v~~~ 522 (522) T protein:vir:69 476 TRYGIGVNPFAESSLQAPGARIQSGMPSILNSLGKNAYFRRVYVKGI 522 (522) T ss_pred eeeceeecCcccccCCcccceeecccchhhcccCCcceeeEEEeecC Confidence 9999999999986432 46766664 22 7899999999999999 No 7 >protein:vir:101039 Length: 529 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:1582 # MgeName: 44RR2.8t # Cross-refs: genbank:acc:NP_932516;genbank:gi:37651642;genbank:GeneID:2610532 Probab=100.00 E-value=9.1e-223 Score=1237.88 Aligned_cols=455 Identities=36% Similarity=0.618 Sum_probs=398.4 Q ss_pred CcchHHHHHhhhhhhcCCccccccchhhhhhhhhhhhhHHHHHhhhhhhhhhccccccC--------ccccccccc-ccc Q lcl|NC_015288. 1 MFNAEHLQEKWSPVLNNEAANPIADRYKKAVTSVLLENQERFLREERGMLQEVAVNSLG--------AGTVSPGGS-ALG 71 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~~~llenq~~~~~e~~~~l~e~~~~~~g--------~~~~~~~~~-~~~ 71 (468) -|++|+|+|||+|||||||+|||++.|||+|+++|||||||+++|++.|+++++.|+++ +|+|.+.+. +.+ T Consensus 2 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~e~~~~~l~~~~~~~~~~~~~~~i~e 81 (529) T protein:vir:10 2 SLKNKEILNKWTPLLEGEGLPEIAGKNKQALVAQILEAQEKDSKSDPVYRDDKLIEAFGQSLMEAEVAGDHGYDPTNIAA 81 (529) T ss_pred cccHHHHHHHhHHHhcCCccchhccchhhhhhhhhhhhhHHHHhhccccchhhhhhhhhcccchhhcccccccccccccc Confidence 67888999999999999999999999999999999999999999999998888887776 456655554 558 Q ss_pred cccccccccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCC-------------------------- Q lcl|NC_015288. 72 SANTAGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQA-------------------------- 125 (468) Q Consensus 72 st~tg~~~~~~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qs-------------------------- 125 (468) |++|++|+++||+||+||||++|||||+|||||||||||||||||||+||.++. T Consensus 82 st~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~~~eaf~~~y~Pda~~sga~~~g 161 (529) T protein:vir:10 82 GQSSGAITNIGPAVIGMVRRAIPSLIAFDIAGVQPMTGPTGQVFALRSVYGKDPLAAGAKEAFHPMYAPDAWHSSLATKG 161 (529) T ss_pred ccccccccccCchhhhhHHHHHhhhhhheeeeeecCCchhhhhhhhheeecCCccccccccccccccccccccccccccc Confidence 899999999999999999999999999999999999999999999999998763 Q ss_pred -----------------------CCccccccccccccccccccccccccccccccccCcccccccccccccccccccccc Q lcl|NC_015288. 126 -----------------------GEEALFNEPDAGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRG 182 (468) Q Consensus 126 -----------------------G~EA~fnEa~t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~g 182 (468) |.|+||+|+++.||+...+..... +.... +..+...........+.++++.| T Consensus 162 a~~~~~~~~~~~~t~~~~~a~~~g~ea~f~ea~t~fs~~~~g~~~~~----g~~~~-~~~~~~~~~~~~a~~~~~~~~~G 236 (529) T protein:vir:10 162 ATTTTDGTPFAKLTAGQAIAEGDIVGHFFYESGTAFLQNVSGASVTV----GTNET-GEALDKLINAAIGEGKLAEIAEG 236 (529) T ss_pred cccccCccccccccccccccccCcceeeeecccceeccccccccccc----Ccccc-Ccccccccccccccccccccccc Confidence 235555555555554322211110 00000 00111112223345677889999 Q ss_pred cchhhhhccC----CCCCccccceeEEEEEEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHH Q lcl|NC_015288. 183 FSREDLEQAG----DAGKLFREMSFSIEKTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVV 258 (468) Q Consensus 183 m~Ta~aE~lG----~~g~~f~EMaFsIeK~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII 258 (468) |+|+.+|+|| +++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||+|||+||||||| T Consensus 237 m~Ta~aEaL~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILStEImlEINReii 316 (529) T protein:vir:10 237 MATSIAELRQGFNGSNDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNGILANEVMLEINREVI 316 (529) T ss_pred cchhhhhccccCCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHH Confidence 9999999994 356789999999999999999999999999999999999999999999999999999999999999 Q ss_pred HHHhhhcchhhcccc----ccceeeeeecCCc---chhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHh Q lcl|NC_015288. 259 RRVYSVAKPGAANNV----ANAGIFDLDVDSN---GRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALA 331 (468) Q Consensus 259 ~~l~~vA~~~k~~~~----~~~Gv~Dl~~~~~---~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~ 331 (468) |+||+||+|||+.|+ +++|+|||+++.| +||++|+||+|+|||++|+|+|+|+|+||+|||||||++||++|+ T Consensus 317 ~~l~~~a~~~k~~g~~~~~~~~Gv~d~~~~~~~~~~~~~~e~~k~L~~~i~~~an~I~~~T~rg~~n~vi~S~~Va~~L~ 396 (529) T protein:vir:10 317 DWINYTAQVGKSGWTKTDGSASGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALA 396 (529) T ss_pred HhHhhhhhhhhcccccccccccceeecccCccccccchHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchHHHHHHH Confidence 999999999999988 7889999999876 999999999999999999999999999999999999999999999 Q ss_pred hccccccccccccccccccccccCCCceeEEEecCCeEEEEccccccCCCcceEEEEEecCCcccceeEEcccccccccc Q lcl|NC_015288. 332 MAGVLDYSSGLTGAGGPAIGTVDDTGNLAVGTINGRIKVYVDPYAANLSDKHYYVVGYKGTSPYDAGLFYCPYVPLQMVR 411 (468) Q Consensus 332 ~~G~~~~~~~~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy~D~Ya~~~s~~dY~~vG~Kg~~~~d~glfyaPYv~~~~~~ 411 (468) |+|++++++......+ .++|+++.+|+|+|+|||+||||||+ ++|||+|||||++++|+|||||||||++|+| T Consensus 397 ~~~~~~~~~~~~~~sg---~~~d~~~~~~~G~l~~~~~vy~D~y~----~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~ 469 (529) T protein:vir:10 397 LIDTNISPAAQGMASG---LNADTTKGVFAGILGGRYKVYIDQYA----RQDYFTMGYRGANNLDAGIYYCPYVALTPLR 469 (529) T ss_pred hhhhhccccccccccc---cccccCCceEEEEecCceEEEecCCC----CcceEEEEEeCCcccccceeecccccccccc Confidence 9999998876555444 47899999999999999999999986 4899999999999999999999999999999 Q ss_pred ccCCccccceeeeeeecceeecCcccccCc--ccccCChhhhhh--ccCceeeeEEeecc Q lcl|NC_015288. 412 SIDPNNFQPKIGFKTRYGMVSNPFVTTNGL--YSGTPDGETLTP--STNMYYRRVQVTNL 467 (468) Q Consensus 412 ~~Dp~s~qP~~g~~tRY~l~~nPf~~~~~~--~~~~~~~~~~~~--~~N~y~r~~~v~~~ 467 (468) ++||+||||++||||||||++|||+++.++ +.++++|.+|.+ +.|.|||||+|||| T Consensus 470 ~~dp~sfqP~~g~~tRY~l~~NP~~~~~~~~~~~r~~~g~~~~~~ag~n~~~r~~~Vk~l 529 (529) T protein:vir:10 470 GSDPKNFQPVMGFKTRYAIGVNPFAESRTQAPQGRITSGMPGVNSVGKNAYFRRVWVKGL 529 (529) T ss_pred ccCCCcccceeeeeeeeceeecCccccccccccccccCCcchhhhcCccceeEEeeeccC Confidence 999999999999999999999999987655 578999988876 57899999999999 No 8 >protein:vir:103463 Length: 521 # NCBI annotation: major head subunit precursor # Family: family:all:364 # MgeID: mge:1542 # MgeName: RB32 # Cross-refs: genbank:acc:YP_803115;genbank:gi:116326395;genbank:GeneID:4405492 Probab=100.00 E-value=1.5e-222 Score=1236.67 Aligned_cols=459 Identities=37% Similarity=0.627 Sum_probs=407.8 Q ss_pred CcchHHHHHhhhhhhcCCccccccchhhhhhhhhhhhhHHHHHhhhhhhhhhccccccCc--------cccccccccc-c Q lcl|NC_015288. 1 MFNAEHLQEKWSPVLNNEAANPIADRYKKAVTSVLLENQERFLREERGMLQEVAVNSLGA--------GTVSPGGSAL-G 71 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~~~llenq~~~~~e~~~~l~e~~~~~~g~--------~~~~~~~~~~-~ 71 (468) |+++|+|+|||+|||||||+|+|++ +||+|+++|||||||+++|++.|+++++.|++|. ++|.+++.++ + T Consensus 3 ~~~~~~l~~kw~p~l~~~~~~~i~~-~~~~~~a~~~enq~~~~~~~~~~~~~~~~~~~~~~l~e~~~~~~~~~~~~~i~e 81 (521) T protein:vir:10 3 IKTKAELLNKWKPLLEGEGLPEIAN-SKQAIIAKIFENQEKDFQTAPEYKDEKIAQAFGSFLTEAEIGGDHGYNATNIAA 81 (521) T ss_pred cchhHHHHHhhhhhhccCCCCcccc-chhhhhhhhhhhhhhhhhhccccchhHHHHHHhhhhhhhcccCccccccccccc Confidence 9999999999999999999999987 5999999999999999999999999999888874 6777776655 7 Q ss_pred cccccccccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCC----CCccccc--ccccccccccccc Q lcl|NC_015288. 72 SANTAGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQA----GEEALFN--EPDAGFTAGLDAT 145 (468) Q Consensus 72 st~tg~~~~~~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qs----G~EA~fn--Ea~t~fSG~~~~~ 145 (468) |++|++|+++||+||+||||++|||||+|||||||||||||||||||+||.+|. ++|+|++ ++++.|||..+.+ T Consensus 82 s~~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~q~~~~~g~eaf~~~~~ada~fSG~~~at 161 (521) T protein:vir:10 82 GQTSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQPMNSPTGQVFALRAVYGKDPIAAGAKEAFHPMYGPDAMFSGQGAAK 161 (521) T ss_pred cccccccccCCchhhhHHHHHHhhhhhhhceeeccCCchhhhheeeeeeccCCccccccccccchhcccccccccccccc Confidence 889999999999999999999999999999999999999999999999999985 6688876 4899999987655 Q ss_pred ccccccccccccccC-----------------------ccccccccc------ccccccccccccccchhhhhccC---- Q lcl|NC_015288. 146 TGAYTPRTGAGVGGD-----------------------AEGNNPALL------NDSSPGTYETPRGFSREDLEQAG---- 192 (468) Q Consensus 146 ~~~~~~~~~~~~~~~-----------------------~~g~~~~~~------~~a~~g~~t~~~gm~Ta~aE~lG---- 192 (468) .............++ ..++++... .....+.|+++.||+|+.+|+|+ T Consensus 162 ~~s~~~~~~~~~~Gd~~~~~~~~~g~~~~~~~~~~t~~~t~~d~~~~~~~~~~~~~~~~~y~~~~GmsTa~aEal~~~g~ 241 (521) T protein:vir:10 162 KFAALAASTQTTVGDIYTHFFQDTGTVYLQASAQVTISSTADDAAKLDAEIKKQMEAGALVEIAEGMATSIAELQESFNG 241 (521) T ss_pred ccccccccccccccccccccccccccceecccccccCCCcccccccccccccccccccceeecccccchhhHhhhccCCC Confidence 433222211111111 111111111 12345678899999999999883 Q ss_pred CCCCccccceeEEEEEEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccc Q lcl|NC_015288. 193 DAGKLFREMSFSIEKTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANN 272 (468) Q Consensus 193 ~~g~~f~EMaFsIeK~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~ 272 (468) +++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||++|+.+|+.++..+ T Consensus 242 ss~~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii~~i~~sa~~~~~g~ 321 (521) T protein:vir:10 242 STDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQVGKSGM 321 (521) T ss_pred CccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHhhhhhheeeeeeeee Confidence 45678999999999999999999999999999999999999999999999999999999999999999888888887766 Q ss_pred c----ccceeeeeecCCc---chhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccc Q lcl|NC_015288. 273 V----ANAGIFDLDVDSN---GRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGA 345 (468) Q Consensus 273 ~----~~~Gv~Dl~~~~~---~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~ 345 (468) + +++|+|||+++.| +||++|+||+|+|||+||||+|+|+|+||+|||||||+|||++|+|+|.+++.++...+ T Consensus 322 t~~~~~~~G~~d~~~~~d~~~~~~~~e~~k~L~~~i~~~an~i~~~T~r~~~n~~i~S~~Va~~L~~~~~~~~~~~~~~~ 401 (521) T protein:vir:10 322 TLTPGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTGISYAAQGLA 401 (521) T ss_pred eeccCccccceecccccccccchHHHHHHHHHHHHHHHHHHHHHHhcccccceEEEEchHHHHHHhhccccccccccccc Confidence 6 6799999999998 99999999999999999999999999999999999999999999999999999887766 Q ss_pred cccccccccCCCceeEEEecCCeEEEEccccccCCCcceEEEEEecCCcccceeEEccccccccccccCCccccceeeee Q lcl|NC_015288. 346 GGPAIGTVDDTGNLAVGTINGRIKVYVDPYAANLSDKHYYVVGYKGTSPYDAGLFYCPYVPLQMVRSIDPNNFQPKIGFK 425 (468) Q Consensus 346 ~~~~~~~~D~t~~~~~G~l~~~~~vy~D~Ya~~~s~~dY~~vG~Kg~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~g~~ 425 (468) .+ .++|+|+++|+|+|+|||+||||||+. +|||+|||||++++|+||||||||||+|+|++||+||||++||| T Consensus 402 ~g---~~~d~~~~~~~G~l~~~~~vy~D~y~~----~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~ 474 (521) T protein:vir:10 402 TG---FNTDTTKSVFAGVLGGKYRVYIDQYAK----QDYFTVGYKGPNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFK 474 (521) T ss_pred cc---ccccCCCceEEEEecCceEEEecCCCC----cceEEEEEeCCcccccceeeccccccccccccCCccccceeeee Confidence 54 578999999999999999999999864 89999999999999999999999999999999999999999999 Q ss_pred eecceeecCcccccCc-ccccCChhhhhh----ccCceeeeEEeecc Q lcl|NC_015288. 426 TRYGMVSNPFVTTNGL-YSGTPDGETLTP----STNMYYRRVQVTNL 467 (468) Q Consensus 426 tRY~l~~nPf~~~~~~-~~~~~~~~~~~~----~~N~y~r~~~v~~~ 467 (468) |||||++|||+++.++ +.+.+++++|++ ++|.|||||+|||| T Consensus 475 tRY~l~~NP~~~~~~~~~~~~i~~~~~~~~a~~~~~sy~r~v~v~~l 521 (521) T protein:vir:10 475 TRYGIGINPFAESAAQAPASRIQSGMPSILNSLGKNAYFRRVYVKGI 521 (521) T ss_pred eeeceeecCcccccCCccceeecccchhhhccccccceeeeeeecCC Confidence 9999999999998655 467889999866 88999999999999 No 9 >protein:vir:7214 Length: 521 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:142 # MgeName: T4 # Cross-refs: genbank:acc:NP_049787;genbank:gi:9632597;genbank:GeneID:1258751 Probab=100.00 E-value=1.6e-222 Score=1236.60 Aligned_cols=459 Identities=36% Similarity=0.621 Sum_probs=407.0 Q ss_pred CcchHHHHHhhhhhhcCCccccccchhhhhhhhhhhhhHHHHHhhhhhhhhhccccccCc--------ccccccccc-cc Q lcl|NC_015288. 1 MFNAEHLQEKWSPVLNNEAANPIADRYKKAVTSVLLENQERFLREERGMLQEVAVNSLGA--------GTVSPGGSA-LG 71 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~~~llenq~~~~~e~~~~l~e~~~~~~g~--------~~~~~~~~~-~~ 71 (468) |+++|+|+|||+|||||||+|+|++ +||+|+++|||||||+++|++.|+++++.|+++. ++|..++.+ .+ T Consensus 3 ~~~~~~l~~kw~p~l~~~~~~~i~~-~~~~~~a~~~enq~~~~~~~~~~~~~~~~~~~~~~l~e~~~~~~~~~~~~~iae 81 (521) T protein:vir:72 3 IKTKAELLNKWKPLLEGEGLPEIAN-SKQAIIAKIFENQEKDFQTAPEYKDEKIAQAFGSFLTEAEIGGDHGYNATNIAA 81 (521) T ss_pred cchhHHHHHhhhhhhccCCCCcccc-chhhhhhhhhhhhhhhhhhcccccchHHHHHHhhhhhhhcccCccccCcccccc Confidence 9999999999999999999999987 5999999999999999999999999988877763 566666654 48 Q ss_pred cccccccccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCC----CCcccccc--cccccccccccc Q lcl|NC_015288. 72 SANTAGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQA----GEEALFNE--PDAGFTAGLDAT 145 (468) Q Consensus 72 st~tg~~~~~~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qs----G~EA~fnE--a~t~fSG~~~~~ 145 (468) |++|++|+++||+||+||||++|||||+|||||||||||||||||||+||.+|+ |+|+||+| +++.|||..+.. T Consensus 82 s~~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~q~~~~~g~ea~~~e~~~da~fSG~~~~~ 161 (521) T protein:vir:72 82 GQTSGAVTQIGPAVMGMVRRAIPNLIAFDICGVQPMNSPTGQVFALRAVYGKDPVAAGAKEAFHPMYGPDAMFSGQGAAK 161 (521) T ss_pred cccccccccCCchhhhHHHHHHhhhhhhhceeeccCCchhhhheeeeeeecCCCCCcccccccchhcccccccccccccc Confidence 899999999999999999999999999999999999999999999999999885 78999987 788999987654 Q ss_pred ccccccccccccccC----------------------cccc------cccc-cccccccccccccccchhhhhccC---- Q lcl|NC_015288. 146 TGAYTPRTGAGVGGD----------------------AEGN------NPAL-LNDSSPGTYETPRGFSREDLEQAG---- 192 (468) Q Consensus 146 ~~~~~~~~~~~~~~~----------------------~~g~------~~~~-~~~a~~g~~t~~~gm~Ta~aE~lG---- 192 (468) .............++ ..+. ++.. .+....+.|+++.||+|+.+|+++ T Consensus 162 ~~~~~~~~~~~a~Gd~~~~~~~~~gt~~~~~~~~~~~~~g~t~~~~t~~~v~~~~~a~~~y~~g~gm~Ta~aEal~~~g~ 241 (521) T protein:vir:72 162 KFPALAASTQTTVGDIYTHFFQETGTVYLQASVQVTIDAGATDAAKLDAEIKKQMEAGALVEIAEGMATSIAELQEGFNG 241 (521) T ss_pred cccccccccccccccccccccccccccccccccccccCCCCCCccccccccccccccCceeeeecccchhhhhhhcccCC Confidence 332222211111110 0011 1111 112235678899999999999863 Q ss_pred CCCCccccceeEEEEEEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccc Q lcl|NC_015288. 193 DAGKLFREMSFSIEKTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANN 272 (468) Q Consensus 193 ~~g~~f~EMaFsIeK~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~ 272 (468) +++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||++|+.+|+.++..+ T Consensus 242 ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii~~i~~sa~~g~~g~ 321 (521) T protein:vir:72 242 STDNPWNEMGFRIDKQVIEAKSRQLKAAYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVVDWINYSAQVGKSGM 321 (521) T ss_pred cccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHhhhhhheeeeeeeee Confidence 35678999999999999999999999999999999999999999999999999999999999999999888888887766 Q ss_pred c----ccceeeeeecCCc---chhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccc Q lcl|NC_015288. 273 V----ANAGIFDLDVDSN---GRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGA 345 (468) Q Consensus 273 ~----~~~Gv~Dl~~~~~---~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~ 345 (468) + +++|+|||+++.| +||++|+||+|+|||+||||+|+|+|+||+|||||||+|||++|+|+|.+++.++...+ T Consensus 322 t~~~~~~~G~~d~~~~~d~~~~~~~~e~~k~L~~~i~~~an~i~~~T~r~~~n~~i~S~~Va~~L~~~~~~~~~~~~~~~ 401 (521) T protein:vir:72 322 TLTPGSKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAVEIARQTGRGEGNFIIASRNVVNVLASVDTGISYAAQGLA 401 (521) T ss_pred eeccCccccceecccccccccchHHHHHHHHHHHHHHHHHHHHHHhcccccceEEEEchHHHHHHhhccccccccccccc Confidence 6 6799999999998 99999999999999999999999999999999999999999999999999999887766 Q ss_pred cccccccccCCCceeEEEecCCeEEEEccccccCCCcceEEEEEecCCcccceeEEccccccccccccCCccccceeeee Q lcl|NC_015288. 346 GGPAIGTVDDTGNLAVGTINGRIKVYVDPYAANLSDKHYYVVGYKGTSPYDAGLFYCPYVPLQMVRSIDPNNFQPKIGFK 425 (468) Q Consensus 346 ~~~~~~~~D~t~~~~~G~l~~~~~vy~D~Ya~~~s~~dY~~vG~Kg~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~g~~ 425 (468) .+ .++|+|+++|+|+|+|||+||||||+. +|||+|||||++++|+||||||||||+|+|++||+||||++||| T Consensus 402 ~g---~~~d~~~~~~~G~l~~~~~vy~D~y~~----~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~ 474 (521) T protein:vir:72 402 TG---FSTDTTKSVFAGVLGGKYRVYIDQYAK----QDYFTVGYKGPNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFK 474 (521) T ss_pred cc---ccccCCCceEEEEccCceEEEecCCCC----cceEEEEEeCCcccccceeeccccccccccccCCccccceeeee Confidence 55 578999999999999999999999964 89999999999999999999999999999999999999999999 Q ss_pred eecceeecCcccccCc-ccccCChhhhhh----ccCceeeeEEeecc Q lcl|NC_015288. 426 TRYGMVSNPFVTTNGL-YSGTPDGETLTP----STNMYYRRVQVTNL 467 (468) Q Consensus 426 tRY~l~~nPf~~~~~~-~~~~~~~~~~~~----~~N~y~r~~~v~~~ 467 (468) |||||++|||++..++ +++.+++++|++ ++|.|||||+|||| T Consensus 475 tRY~l~~NP~~~~~~~~~a~~i~~~~~~~~a~~~~~sy~r~v~v~~l 521 (521) T protein:vir:72 475 TRYGIGINPFAESAAQAPASRIQSGMPSILNSLGKNAYFRRVYVKGI 521 (521) T ss_pred eeeceeecCcccccCcccceeecCcChhhhcCccccceeeeeeecCC Confidence 9999999999998654 588899999988 89999999999999 No 10 >protein:vir:80986 Length: 528 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:1888 # MgeName: Phi1 # Cross-refs: genbank:acc:YP_001469506;genbank:gi:157311463;genbank:GeneID:5602119 Probab=100.00 E-value=9e-222 Score=1232.44 Aligned_cols=460 Identities=37% Similarity=0.593 Sum_probs=404.2 Q ss_pred CcchHHHHHhhhhhhcCCccccccchhhhhhhhhhhhhHHHHHhhhhhhhhhccccccC--------ccccccccccc-c Q lcl|NC_015288. 1 MFNAEHLQEKWSPVLNNEAANPIADRYKKAVTSVLLENQERFLREERGMLQEVAVNSLG--------AGTVSPGGSAL-G 71 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~~~llenq~~~~~e~~~~l~e~~~~~~g--------~~~~~~~~~~~-~ 71 (468) |+++|+|+|||+|||||||+|||++.|||+|+++|||||||+++|++.|++++++|++| +|+|++++..+ | T Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~llenq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~~~~~~~i~e 80 (528) T protein:vir:80 1 MKTTKELMEKWSPLLENEKLPEIATASKQKLVAKILESQEADFAVDPIYKDEKVVEAFGGFIAEAEVAGDHGYDASQIAA 80 (528) T ss_pred CcchHHHHHhhhHhhcCCccchhcchhhhhhhhhhhhhhhHHhhccccccchHHHHhhhhhccccccccccCCccccccc Confidence 99999999999999999999999999999999999999999999999999999888887 46787777655 8 Q ss_pred cccccccccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCC----CCccc--ccccccccccccccc Q lcl|NC_015288. 72 SANTAGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQA----GEEAL--FNEPDAGFTAGLDAT 145 (468) Q Consensus 72 st~tg~~~~~~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qs----G~EA~--fnEa~t~fSG~~~~~ 145 (468) |++|++|+++||+||+||||++|||||+|||||||||||||||||||+||.++. ++|+| ++++++.||+..+.. T Consensus 81 s~~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~~~ea~~~~~~~da~fS~~~t~~ 160 (528) T protein:vir:80 81 GQTTGAITNVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGPNPLASQAKEAFHPMYAPDAFHSSLAAKG 160 (528) T ss_pred cccccccccCCchhhhHHHHHHhhhhhhhhheeccCCchhhhheeeeeeecCCccccccccccccccccccccccccccc Confidence 899999999999999999999999999999999999999999999999999874 45665 457888887754432 Q ss_pred ccccccccc--------------c------------------------ccccCcccccccccccccccccccccccchhh Q lcl|NC_015288. 146 TGAYTPRTG--------------A------------------------GVGGDAEGNNPALLNDSSPGTYETPRGFSRED 187 (468) Q Consensus 146 ~~~~~~~~~--------------~------------------------~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~ 187 (468) ......... . ...++..++..........+.|+++.||+|+. T Consensus 161 ~a~~~ea~t~fs~~~~~~~~~~G~~~~~t~~~tg~~~~~~~~~~~~~~~~~gt~~~~~~~~~~~~~~~~~~~~~Gm~Ta~ 240 (528) T protein:vir:80 161 AAVGSPTGTPFAKLAIGTQIEAGDIVHHTFAETGIAYLQNVTAEQVTPTKAGSESEDEVVMKLMEEGKLAEIAFGMATSI 240 (528) T ss_pred cccccccccccccccccccccccceeccccccccccccccccccccCccccCCcccccccccccccccccccccccchhh Confidence 211100000 0 00000011111112223455688999999999 Q ss_pred hhcc---C-CCCCccccceeEEEEEEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhh Q lcl|NC_015288. 188 LEQA---G-DAGKLFREMSFSIEKTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYS 263 (468) Q Consensus 188 aE~l---G-~~g~~f~EMaFsIeK~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~ 263 (468) +|.+ | +++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||+|||+|||||||++|+. T Consensus 241 AE~le~lg~ss~~~f~EMaFsIEKvTVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELaNILStEImlEINReii~~i~~ 320 (528) T protein:vir:80 241 AEIQEGFNGSSNNPWAEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINF 320 (528) T ss_pred hhhhcccCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhh Confidence 9965 3 45788999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hcchhhcccc----ccceeeeeecCCc---chhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhcccc Q lcl|NC_015288. 264 VAKPGAANNV----ANAGIFDLDVDSN---GRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVL 336 (468) Q Consensus 264 vA~~~k~~~~----~~~Gv~Dl~~~~~---~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~ 336 (468) +|+.||+.++ +++|+|||+++.| +||++|+||+|+|||+||+|+|+|+|+||+|||||||++||++|+|||.+ T Consensus 321 ~a~~~~~~~t~~~~~~~G~~dl~~~~d~~g~r~~~e~~k~L~~~i~~~an~I~~~T~~~~gn~vi~S~~Va~~L~~~g~~ 400 (528) T protein:vir:80 321 TAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQG 400 (528) T ss_pred eeeeeeeeeeeccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhcccc Confidence 9999998776 6789999998877 89999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccccccccCCCceeEEEecCCeEEEEccccccCCCcceEEEEEecCCcccceeEEccccccccccccCCc Q lcl|NC_015288. 337 DYSSGLTGAGGPAIGTVDDTGNLAVGTINGRIKVYVDPYAANLSDKHYYVVGYKGTSPYDAGLFYCPYVPLQMVRSIDPN 416 (468) Q Consensus 337 ~~~~~~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy~D~Ya~~~s~~dY~~vG~Kg~~~~d~glfyaPYv~~~~~~~~Dp~ 416 (468) ++.+....+. ..++|+|+.+|+|+|+|||+||||||+ ++|||+|||||++++|+|||||||||++|++++||+ T Consensus 401 ~~~~~~~~~~---~~~~d~~~~~~~G~l~~~~~vy~D~y~----~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~ 473 (528) T protein:vir:80 401 ISLAMQGAAK---GLNTDTTKAVFAGVLAGKYKVFIDQYA----RQDYFTVGYKGDNEMDAGIYYAPYVALTPLRATDPQ 473 (528) T ss_pred cccccccccc---ccccCCCCceEEEEecCceEEEecCCC----CcceEEEEEeCCcccccceeecccccceeeEeeCCc Confidence 8887766654 468999999999999999999999986 489999999999999999999999999999999999 Q ss_pred cccceeeeeeecceeecCcccccCc--ccccCChhhhhh--ccCceeeeEEeecc Q lcl|NC_015288. 417 NFQPKIGFKTRYGMVSNPFVTTNGL--YSGTPDGETLTP--STNMYYRRVQVTNL 467 (468) Q Consensus 417 s~qP~~g~~tRY~l~~nPf~~~~~~--~~~~~~~~~~~~--~~N~y~r~~~v~~~ 467 (468) ||||++||||||||++|||+++.++ +++++++.+|.+ ++|.|||||+|||| T Consensus 474 sfqP~~g~~tRY~l~~NP~~~~~~~~~~~r~~~g~~~~~~ag~n~~~r~~~Vk~~ 528 (528) T protein:vir:80 474 SFHPVLGFKTRYGIGINPFADSKSQAPSARITSGMLSKDSVGKNAYFRRVWVKGC 528 (528) T ss_pred cccceeeeeeeeceeecCcccccCCcccccccccchhhhhcCccceeEEeeeccC Confidence 9999999999999999999998765 578999999974 67999999999999 No 11 >protein:vir:101811 Length: 529 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:1580 # MgeName: 31 # Cross-refs: genbank:acc:YP_238888;genbank:gi:66391963;genbank:GeneID:3416638 Probab=100.00 E-value=1.3e-221 Score=1231.48 Aligned_cols=460 Identities=37% Similarity=0.622 Sum_probs=397.2 Q ss_pred CcchHHHHHhhhhhhcCCccccccchhhhhhhhhhhhhHHHHHhhhhhhhhhccccccC--------ccccccccc-ccc Q lcl|NC_015288. 1 MFNAEHLQEKWSPVLNNEAANPIADRYKKAVTSVLLENQERFLREERGMLQEVAVNSLG--------AGTVSPGGS-ALG 71 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~~~llenq~~~~~e~~~~l~e~~~~~~g--------~~~~~~~~~-~~~ 71 (468) -+++|+|+|||+|||||||+|||++.|||+|+++|||||||+++|++.|+++++.|+++ +|+|.+.+. +.+ T Consensus 2 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~e~~~~~l~e~~~~~~~~~~~~~i~~ 81 (529) T protein:vir:10 2 SLKNKEILNKWTPLLEGEGLPEIAGKNKQALVAQILEAQEKDSKSDPVYRDDKLIEAFGQSLMEAEVAGDHGYDPTNIAA 81 (529) T ss_pred ccchHHHHHHhhHhhcCCccchhccchhhhhhhhhhhhhHHHHhcccccchhhhhhhhhccchhhccccccccccccccc Confidence 57888999999999999999999999999999999999999999999998888777765 456655554 558 Q ss_pred cccccccccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCC----CCcccccc--cccccccccccc Q lcl|NC_015288. 72 SANTAGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQA----GEEALFNE--PDAGFTAGLDAT 145 (468) Q Consensus 72 st~tg~~~~~~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qs----G~EA~fnE--a~t~fSG~~~~~ 145 (468) |++|++|+++||+||+||||++|||||+|||||||||||||||||||+||.++. +.|+||++ +++.||+..... T Consensus 82 st~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~~~eaf~~~~~pda~~sga~~~g 161 (529) T protein:vir:10 82 GQSSGAITNIGPAVIGMVRRAIPSLIAFDIAGVQPMTGPTGQVFALRSVYGKDPLAAGAKEAFHPMYAPDAWHSSLATKG 161 (529) T ss_pred ccccccccccCchhhhhHHHHHHhhhhhhhheeccCCchhhhhheeeeeecCCccccccccccccccccccccccccccc Confidence 899999999999999999999999999999999999999999999999999874 45777765 566666654332 Q ss_pred cccccccc------------------------cccc------ccCcc--------cccccccccccccccccccccchhh Q lcl|NC_015288. 146 TGAYTPRT------------------------GAGV------GGDAE--------GNNPALLNDSSPGTYETPRGFSRED 187 (468) Q Consensus 146 ~~~~~~~~------------------------~~~~------~~~~~--------g~~~~~~~~a~~g~~t~~~gm~Ta~ 187 (468) ........ +... ..... +.............++++.||+|+. T Consensus 162 a~t~~~~t~~~~~ta~~~~a~g~g~ea~f~ea~t~fs~~~~g~~~~~g~~~t~~~~~~~~~~~~a~~~~~~~~~GmsTa~ 241 (529) T protein:vir:10 162 ATTTTDGTPFAKLTAGQAIAEGDIVGHFFYESGTAFLQNVSGASVTVGTNETGEALDKLINAAIGEGKLAEIAEGMATSI 241 (529) T ss_pred ccccccccccccccccccccccccceeeecccCceeeccccccccccCccccCcccccccccccccccccccccchhhhh Confidence 21110000 0000 00000 1111112223456788999999999 Q ss_pred hhccC----CCCCccccceeEEEEEEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhh Q lcl|NC_015288. 188 LEQAG----DAGKLFREMSFSIEKTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYS 263 (468) Q Consensus 188 aE~lG----~~g~~f~EMaFsIeK~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~ 263 (468) +|+|+ +++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+||||||||+||+ T Consensus 242 aEaL~~~ggss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILStEImlEINReii~~l~~ 321 (529) T protein:vir:10 242 AELRQGFNGSNDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNGILANEVMLEINREVIDWINY 321 (529) T ss_pred hhccccCCCcccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHhh Confidence 99994 45678999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hcchhhcccc----ccceeeeeecCCc---chhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhcccc Q lcl|NC_015288. 264 VAKPGAANNV----ANAGIFDLDVDSN---GRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVL 336 (468) Q Consensus 264 vA~~~k~~~~----~~~Gv~Dl~~~~~---~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~ 336 (468) +|+|||+.|+ +.+|+|||+++.| +||++|+||+|++||++|+|+|+|+|+||+|||||||++||++|+|+|+ T Consensus 322 ~a~~~~~~~~~~~~~~~Gv~d~~~~~~~~~~~~~~e~~~~L~~~i~~~an~I~~~T~rg~~n~vi~S~~Va~~L~~~~~- 400 (529) T protein:vir:10 322 TAQVGKSGWTKTDGSASGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALALIDT- 400 (529) T ss_pred hhhhhccccccccccccceeecccCccccccchHHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchHHHHHHHhhcc- Confidence 9999999998 5569999999876 9999999999999999999999999999999999999999999999995 Q ss_pred ccccccccccccccccccCCCceeEEEecCCeEEEEccccccCCCcceEEEEEecCCcccceeEEccccccccccccCCc Q lcl|NC_015288. 337 DYSSGLTGAGGPAIGTVDDTGNLAVGTINGRIKVYVDPYAANLSDKHYYVVGYKGTSPYDAGLFYCPYVPLQMVRSIDPN 416 (468) Q Consensus 337 ~~~~~~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy~D~Ya~~~s~~dY~~vG~Kg~~~~d~glfyaPYv~~~~~~~~Dp~ 416 (468) ++.|+..+.. .+.++|+++.+|+|+|+|||+||||||+ ++|||+|||||++++|+|||||||||++|++++||+ T Consensus 401 ~~~~~~~~~~--sg~~~d~~~~~~~G~l~~~~~vy~D~y~----~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~ 474 (529) T protein:vir:10 401 NISPAAQGMA--SGLNADTTKGVFAGILGGRYKVYIDQYA----RQDYFTMGYRGANNLDAGIYYCPYVALTPLRGFDPK 474 (529) T ss_pred cccccccccc--cccccccCCceEEEEecCceEEEecCCC----CcceEEEEEeCCcccccceeeccccccccccccCCC Confidence 5555544443 3457899999999999999999999986 489999999999999999999999999999999999 Q ss_pred cccceeeeeeecceeecCcccccCc--ccccCChhhhhh--ccCceeeeEEeecc Q lcl|NC_015288. 417 NFQPKIGFKTRYGMVSNPFVTTNGL--YSGTPDGETLTP--STNMYYRRVQVTNL 467 (468) Q Consensus 417 s~qP~~g~~tRY~l~~nPf~~~~~~--~~~~~~~~~~~~--~~N~y~r~~~v~~~ 467 (468) ||||++||||||||++|||+++.++ +.++++|.+|.+ +.|.|||||+|||| T Consensus 475 sfqP~~g~~tRY~l~~NP~~~~~~~~~~~r~~~g~~~~~~ag~n~~~r~~~Vk~l 529 (529) T protein:vir:10 475 NFQPVMGFKTRYAIGVNPFAESRTQAPQGRITSGMPGVNSVGKNAYFRRVWVKGL 529 (529) T ss_pred cccceeeeeeeeceeecCccccccccccccccCCcchhhhcCccceeEEeeeccC Confidence 9999999999999999999987554 578999988876 57899999999999 No 12 >protein:vir:6601 Length: 528 # NCBI annotation: major capsid protein # Family: family:all:364 # MgeID: mge:139 # MgeName: RB49 # Cross-refs: genbank:acc:NP_891732;genbank:gi:33620668;genbank:GeneID:1725275 Probab=100.00 E-value=1.7e-221 Score=1230.89 Aligned_cols=460 Identities=37% Similarity=0.576 Sum_probs=405.6 Q ss_pred CcchHHHHHhhhhhhcCCccccccchhhhhhhhhhhhhHHHHHhhhhhhhhhccccccC--------ccccccc-ccccc Q lcl|NC_015288. 1 MFNAEHLQEKWSPVLNNEAANPIADRYKKAVTSVLLENQERFLREERGMLQEVAVNSLG--------AGTVSPG-GSALG 71 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~~~llenq~~~~~e~~~~l~e~~~~~~g--------~~~~~~~-~~~~~ 71 (468) |+++|+|+|||+|||||||+|||++.|||+|+++|||||||+++|++.|+++++.|++| +++|++. .++.+ T Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~~~~~~~i~e 80 (528) T protein:vir:66 1 MKTTKELMEKWSPLLENEKLPEIATASKQKLVAKILESQEADFAVDPIYKDEKVVEAFGGFIAEAEVAGDHGYDASQIAA 80 (528) T ss_pred CcchHHHHHHhHHhhcCCCcchhcchhhhhhhhhhhhhhHHHhhcccchhhHHHHHhhhhhhhhhcccccccccchhccc Confidence 99999999999999999999999999999999999999999999999999999888886 4566555 45568 Q ss_pred cccccccccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCC-------------CCccccccccccc Q lcl|NC_015288. 72 SANTAGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQA-------------GEEALFNEPDAGF 138 (468) Q Consensus 72 st~tg~~~~~~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qs-------------G~EA~fnEa~t~f 138 (468) |++|++|+++||+||+|+||++|||||+|||||||||||||||||||++|.++. |+|++|+|+++.| T Consensus 81 s~~t~~v~~~~P~Li~lvRRa~p~LIa~DIwGVQPMTgPTGlIFAmRs~Y~~~~~~~~~~eAfh~~~g~ea~fsea~t~~ 160 (528) T protein:vir:66 81 GQTTGAITNVGPAVIGMVRRAIPNLIAFDICGVQPMSTPTSQIFAIRSVYGGDPLKSGAREAFHPMYAPDAFHSSLAAKE 160 (528) T ss_pred cccccccccCchhHHHHHHHHHHhhhhhhhheeecCCchhhhheeeeeeecCCccccccccccccccccccccccccccc Confidence 899999999999999999999999999999999999999999999999998764 4577777777766 Q ss_pred ccccccccc-------------cc------------------ccccccccccCcccccccccccccccccccccccchhh Q lcl|NC_015288. 139 TAGLDATTG-------------AY------------------TPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSRED 187 (468) Q Consensus 139 SG~~~~~~~-------------~~------------------~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~ 187 (468) ++.++.+.- .. .........+...++.+...+.+..+.++++.||+|++ T Consensus 161 a~~gGpTGliFAm~s~y~s~~~g~ea~~nea~t~fs~~~~~~~~~~~~~~~g~~~g~~~~~~~~a~~~~~~~~~Gm~Ta~ 240 (528) T protein:vir:66 161 ATVGSPTGTAFAKLTLSQAITAGDIVYHTFAETGIAYLQNVTGDSVTPQKVGSESEDEVVMKLIEEGKLAEIAFGMATSI 240 (528) T ss_pred ccccCCccceeecccccccccccceeeecccccceeeeccccccccccCcccccccccccccccccccceecccccchhh Confidence 643221100 00 00000011122233444555556677899999999999 Q ss_pred hhcc---C-CCCCccccceeEEEEEEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhh Q lcl|NC_015288. 188 LEQA---G-DAGKLFREMSFSIEKTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYS 263 (468) Q Consensus 188 aE~l---G-~~g~~f~EMaFsIeK~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~ 263 (468) +|++ | +++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||++|+. T Consensus 241 aEale~lg~~s~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAIHGLDAEtELsNILStEImlEINREii~~i~~ 320 (528) T protein:vir:66 241 AEIQEGFNGSSNNPWAEMSMRIDKQVVEAKSRQLKARYSIEVAQDLRAVHGMDADAELNAILANEVLLEINREIVDVINF 320 (528) T ss_pred hhhhcccCCCcccchhhcceEEEeEEEEeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhh Confidence 9975 3 45678999999999999999999999999999999999999999999999999999999999999999999 Q ss_pred hcchhhcccc----ccceeeeeecCCc---chhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhcccc Q lcl|NC_015288. 264 VAKPGAANNV----ANAGIFDLDVDSN---GRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVL 336 (468) Q Consensus 264 vA~~~k~~~~----~~~Gv~Dl~~~~~---~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~ 336 (468) +|+.||+.++ +++|+|||+++.| +||++|+||+|+|||+||+|+|+|+|+||+|||||||++||++|+|||.+ T Consensus 321 ~a~~~~~~~t~~~~~~aG~~dl~~~~d~~g~rw~~e~~k~L~~~i~~~an~I~~~T~r~~gn~vi~S~~Va~~L~~~g~~ 400 (528) T protein:vir:66 321 TAQVGKTGMTQTVGSKAGVFDLQDPIDTRGARWAGESFKSLIYQIDKEAAEIARQTGRGAGNFVIASRNVVNILASADQG 400 (528) T ss_pred eeeeeeeeeeeccccccceeecccccccccchhHHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhcccc Confidence 9999998776 5689999998876 69999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccccccccCCCceeEEEecCCeEEEEccccccCCCcceEEEEEecCCcccceeEEccccccccccccCCc Q lcl|NC_015288. 337 DYSSGLTGAGGPAIGTVDDTGNLAVGTINGRIKVYVDPYAANLSDKHYYVVGYKGTSPYDAGLFYCPYVPLQMVRSIDPN 416 (468) Q Consensus 337 ~~~~~~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy~D~Ya~~~s~~dY~~vG~Kg~~~~d~glfyaPYv~~~~~~~~Dp~ 416 (468) ++.+....+.+ .++|+++.+|+|+|+|||+||||||+ ++|||+|||||++++|+|||||||||++|++++||+ T Consensus 401 ~~~~~~~~~~~---~~~d~~~~~~~G~l~~~~~vy~D~y~----~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~ 473 (528) T protein:vir:66 401 ISLAMQGAAKG---LNTDTTKAVFAGVLAGKYKVFIDQYA----RQDYFTVGYKGDNEMDAGIYYAPYVALTPLRATDPQ 473 (528) T ss_pred ccccccccccc---cccCCCCceeEEEecCceEEEecCCC----CcceEEEEEeCCcccccceeecccccceeeEeeCCc Confidence 99887766554 68999999999999999999999986 489999999999999999999999999999999999 Q ss_pred cccceeeeeeecceeecCcccccCc--ccccCChhhhhh--ccCceeeeEEeecc Q lcl|NC_015288. 417 NFQPKIGFKTRYGMVSNPFVTTNGL--YSGTPDGETLTP--STNMYYRRVQVTNL 467 (468) Q Consensus 417 s~qP~~g~~tRY~l~~nPf~~~~~~--~~~~~~~~~~~~--~~N~y~r~~~v~~~ 467 (468) ||||++||||||||++|||+++..+ +++++++.+|.+ ++|.|||||+|||| T Consensus 474 sfqP~~g~~tRY~l~vNP~~~~~~~~~~~ri~~g~~~~~~ag~n~~~r~~~Vk~~ 528 (528) T protein:vir:66 474 SFHPVLGFKTRYGIGINPFADSKSQEPSARITSGMLSKDSVGKNAYFRRVWVKGC 528 (528) T ss_pred cccceeeeeeeeceeecCcccccCccccccccccchhhhhcCccceeEEeeeccC Confidence 9999999999999999999998744 689999999974 67999999999999 No 13 >protein:vir:98143 Length: 524 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1667 # MgeName: RB43 # Cross-refs: genbank:acc:YP_239203;genbank:gi:66391678;genbank:GeneID:3416245 Probab=100.00 E-value=4.9e-221 Score=1228.41 Aligned_cols=458 Identities=38% Similarity=0.637 Sum_probs=407.5 Q ss_pred CcchHHHHHhhhhhhcC-CccccccchhhhhhhhhhhhhHHHHHhhhhhhhhhccccccC--------cccccccccc-c Q lcl|NC_015288. 1 MFNAEHLQEKWSPVLNN-EAANPIADRYKKAVTSVLLENQERFLREERGMLQEVAVNSLG--------AGTVSPGGSA-L 70 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~-~~~~~i~~~~~~~~~~~llenq~~~~~e~~~~l~e~~~~~~g--------~~~~~~~~~~-~ 70 (468) |+++|+|+|||+||||+ ||+|||++.+||+|+++||||||||++|++.|++++++|++| +|+|++++.+ . T Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~~i~~~~~~~~~a~llenq~~~~~~~~~~~~~~~~~~~~~~l~ea~~~~~~~~~~~~i~ 80 (524) T protein:vir:98 1 MSKKNELMEKWNDLLESQEGLPDIATKSKKQLVAAILEAQEKDAETDPVYRDEKIVESFGGFLAEAEIAGDHNYDQTNIA 80 (524) T ss_pred CcchHHHHHHhHHHhcCCcCcchhcchhhHHHHHHHHhhHHHHHhcCccccchHHHHhhhcccccccccccccccccccc Confidence 99999999999999996 899999999999999999999999999999999999998887 5788888887 5 Q ss_pred ccccccccccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCC---CCCccccccc-------cccccc Q lcl|NC_015288. 71 GSANTAGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQ---AGEEALFNEP-------DAGFTA 140 (468) Q Consensus 71 ~st~tg~~~~~~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~q---sG~EA~fnEa-------~t~fSG 140 (468) +|++|++|+++||+||+||||++|||||+|||||||||||||||||||+||.++ .|+|++|||| ++.||| T Consensus 81 ~s~~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAmRsrY~n~~~~~gteA~~nEAf~~~ye~dt~fSG 160 (524) T protein:vir:98 81 SGKSSGAITNIGPAVIGMVRRAIPNLIAFDICGVQPMTGPTGQVFALRAVYGKDPLAGGTPADVREAFHPMFAPDTMYSG 160 (524) T ss_pred ccccccccccccchhhhHHHHHHHhhhhhhhheeccCCchhhhhhhhheeecCCCCCcccccccccccccccccccccCC Confidence 799999999999999999999999999999999999999999999999999998 4779999886 889998 Q ss_pred cccccccccccccccc-----------------------cccCccccccccccc------ccccccccccccchhhhhcc Q lcl|NC_015288. 141 GLDATTGAYTPRTGAG-----------------------VGGDAEGNNPALLND------SSPGTYETPRGFSREDLEQA 191 (468) Q Consensus 141 ~~~~~~~~~~~~~~~~-----------------------~~~~~~g~~~~~~~~------a~~g~~t~~~gm~Ta~aE~l 191 (468) ....+.....+..... ......+++|...+. .....++++.||+|+.+|+| T Consensus 161 ~g~~t~~s~~~~g~~~~~g~~~~~~~~~~g~~~~~~~~~g~~~~tgt~p~~~~~a~~~~~~~g~~~~~~~GmsTA~aEaL 240 (524) T protein:vir:98 161 EGAHTAFAKITTGTAIATGAIVYHIFQETGIAYFQNVTSGNVTVTGADPAALDAAVIAENEKGTLAEISVGMATSVAELQ 240 (524) T ss_pred ccccccccccccccccccccccccccccccceeccccccCcccccccccccccccccccccccceeecccccchhhhhhh Confidence 7654433222221111 111223444433332 23446788999999999998 Q ss_pred C----CCCCccccceeEEEEEEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcch Q lcl|NC_015288. 192 G----DAGKLFREMSFSIEKTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKP 267 (468) Q Consensus 192 G----~~g~~f~EMaFsIeK~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~ 267 (468) + +++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||++|+.+|+. T Consensus 241 ~~~g~ss~~~f~EMaFsIeKvtVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILSTEImlEINReii~~i~~~a~~ 320 (524) T protein:vir:98 241 ENFNGSSANPWNEMAFRIDKQVIEARSRQLKAQYSVELAQDLRAVHGMDADAELSAILATEIMLEINREIVDLINYTAQV 320 (524) T ss_pred ccCCCCccccccceeeEEEEEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHhhhhee Confidence 3 456789999999999999999999999999999999999999999999999999999999999999988777777 Q ss_pred hhccc----cccceeeeeecCCc---chhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhh--cccccc Q lcl|NC_015288. 268 GAANN----VANAGIFDLDVDSN---GRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAM--AGVLDY 338 (468) Q Consensus 268 ~k~~~----~~~~Gv~Dl~~~~~---~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~--~G~~~~ 338 (468) ++..+ ++++|+|||+++.| +||++|+||+|++||++|+|+|+|+|+||+|||||||+|||++|+| +||+++ T Consensus 321 ~~~g~t~~~~~~~G~~dl~~~~d~~~~r~~~e~~~~L~~~i~~~an~I~~~T~rg~~n~~i~S~~Va~~L~~~~~g~~~~ 400 (524) T protein:vir:98 321 GKSGFTQTVGSKAGSFDFQDPVDIRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALARIDSGITPA 400 (524) T ss_pred ceeecccccccccceeeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhhhcccccc Confidence 66653 35579999999965 9999999999999999999999999999999999999999999998 999999 Q ss_pred ccccccccccccccccCCCceeEEEecCCeEEEEccccccCCCcceEEEEEecCCcccceeEEccccccccccccCCccc Q lcl|NC_015288. 339 SSGLTGAGGPAIGTVDDTGNLAVGTINGRIKVYVDPYAANLSDKHYYVVGYKGTSPYDAGLFYCPYVPLQMVRSIDPNNF 418 (468) Q Consensus 339 ~~~~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy~D~Ya~~~s~~dY~~vG~Kg~~~~d~glfyaPYv~~~~~~~~Dp~s~ 418 (468) ++++.. .+++|+++.+|+|+|+|||+||||||+ ++|||+|||||++++|+||||||||||+|+|++||+|| T Consensus 401 s~~~~~-----~~~~d~~~~~~~G~l~~~~~vy~D~y~----~~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sf 471 (524) T protein:vir:98 401 SQGLQK-----TLNVDTTKAVFAGVLGGTYKVYIDQYA----RQDYFTVGFKGDNEMDAGIYYAPYVALTPLRGSDPKNF 471 (524) T ss_pred cchhhc-----ccccCCccceEEEEecCceEEEecCCC----CcceEEEEeeCCcccccceeeccccccccccccCCccc Confidence 887755 368999999999999999999999986 48999999999999999999999999999999999999 Q ss_pred cceeeeeeecceeecCcccccCc--ccccCChhhhhh--ccCceeeeEEeecc Q lcl|NC_015288. 419 QPKIGFKTRYGMVSNPFVTTNGL--YSGTPDGETLTP--STNMYYRRVQVTNL 467 (468) Q Consensus 419 qP~~g~~tRY~l~~nPf~~~~~~--~~~~~~~~~~~~--~~N~y~r~~~v~~~ 467 (468) ||++||||||||++|||+++.++ +.|+++|.+|.+ ++|.|||||+|||| T Consensus 472 qP~~g~~tRY~l~~NP~~~~~~~~~~~ri~~g~~~~~~ag~n~~~r~~~Vk~l 524 (524) T protein:vir:98 472 QPVMGFKTRYGIGINPFANSRSQAPADRITSGMISKEMCGKNAYFRKVWVKGL 524 (524) T ss_pred cceeeeeeeeceeecCcccccCCccccccccCcchHhhcCccceeeEeeeccC Confidence 99999999999999999988665 358999999985 68999999999999 No 14 >protein:vir:5670 Length: 514 # NCBI annotation: gp23 # Family: family:all:364 # MgeID: mge:119 # MgeName: KVP40 # Cross-refs: genbank:acc:NP_899609;genbank:gi:34419596;genbank:GeneID:2546039 Probab=100.00 E-value=1.6e-220 Score=1225.53 Aligned_cols=455 Identities=38% Similarity=0.639 Sum_probs=394.4 Q ss_pred HHHHHhhhhhhcCCc--cccccchhhhhhhhhhhhhHHHHHhhhhhhhhhccccccC--------ccccccccccc-ccc Q lcl|NC_015288. 5 EHLQEKWSPVLNNEA--ANPIADRYKKAVTSVLLENQERFLREERGMLQEVAVNSLG--------AGTVSPGGSAL-GSA 73 (468) Q Consensus 5 ~~l~~kw~p~l~~~~--~~~i~~~~~~~~~~~llenq~~~~~e~~~~l~e~~~~~~g--------~~~~~~~~~~~-~st 73 (468) -+|+|||+||||||| +|||++.+||+|+++||||||||++|++.|.+++++|+++ +|+|++++.++ +|+ T Consensus 1 ~~l~~kw~p~l~~~~~~~~~i~~~~~~~~~~~l~enq~~~~~~~~~~~~~~~~~~~~~~l~e~~~~~~~~~~~~~ia~s~ 80 (514) T protein:vir:56 1 MNLTEKWKDLLEAEGADMPEIATATKQKIMSKIFENQDRDINNDPMYRDPQLVEAFNAGLNEAVVNGDHGYDPANIAQGV 80 (514) T ss_pred CchhhhhhHHhcccccccccccchhhhhhhhhhhhhHHHHHhcCCcccchhhhhhhhccccccccccccccccccccccc Confidence 689999999999998 8999999999999999999999999999998887777654 67887777665 789 Q ss_pred cccccccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCC--CCCcccc--cccccccccccccccccc Q lcl|NC_015288. 74 NTAGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQ--AGEEALF--NEPDAGFTAGLDATTGAY 149 (468) Q Consensus 74 ~tg~~~~~~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~q--sG~EA~f--nEa~t~fSG~~~~~~~~~ 149 (468) +|++|+++||+||+|+||++|||||+|||||||||||||||||||++|.+| +|+|||| ||+|++|||..+...... T Consensus 81 ~t~~v~~~~P~ll~lvRRa~~~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~tg~EAf~~~nEadt~fSG~~~~~~~~~ 160 (514) T protein:vir:56 81 TTGAVTNIGPTVMGMVRRAIPQLIAFDIAGVQPMTGPTSQVFTLRSVYGKDPLTGAEAFHPTRQADASFSGQAAASTIAD 160 (514) T ss_pred ccccccccchhHHHHHHHHHHhhhhhhhheeccCCchhhhheeeeeeecCCCcccccccccccccCcCcccccccccccc Confidence 999999999999999999999999999999999999999999999999988 7889999 999999999765443322 Q ss_pred ccccccccccCc------------c------------------cccccccccccccccccccccchhhhhcc---C-CCC Q lcl|NC_015288. 150 TPRTGAGVGGDA------------E------------------GNNPALLNDSSPGTYETPRGFSREDLEQA---G-DAG 195 (468) Q Consensus 150 ~~~~~~~~~~~~------------~------------------g~~~~~~~~a~~g~~t~~~gm~Ta~aE~l---G-~~g 195 (468) ....+....+.. . .......+.+....|+++.||+|+.+|.+ | +++ T Consensus 161 ~~~~~~~~~G~~~~~~~t~~~gd~~~~~~~~~~~~~~~~~~~~~~t~~~~~~a~~~~y~~~~Gm~Ta~aEal~~lggs~~ 240 (514) T protein:vir:56 161 FPTTGAATDGTPYKAEVTTSGGDVSMRYFLALGAVTLAVAGQMTATEYTDGVAGGLLVEIDAGMATSQAELQENFNGSSN 240 (514) T ss_pred ccccccccccccccccccccccccccccccccccccccccccccccccccccccchhhhhhhhhhhhhhhhcccCCCCcc Confidence 221111111000 0 00001111233456788999999999985 3 456 Q ss_pred CccccceeEEEEEEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHh---hhcchhhccc Q lcl|NC_015288. 196 KLFREMSFSIEKTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVY---SVAKPGAANN 272 (468) Q Consensus 196 ~~f~EMaFsIeK~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~---~vA~~~k~~~ 272 (468) ++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||++|+ +|+++||+++ T Consensus 241 ~~f~EMaFsIdK~tVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELsNILSTEImlEINReii~~l~~~atv~~~~~~~~ 320 (514) T protein:vir:56 241 NEWNEMSFRIDKQVVEAKSRQLKAQYSIELAQDLRAVHGLDADAELSGILANEVMVELNREIVNLVNSQAQIGKSGWTQG 320 (514) T ss_pred cccceeeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHHHhheeehhcccccc Confidence 7899999999999999999999999999999999999999999999999999999999999998887 5557888899 Q ss_pred cccceeeeeecCCc---chhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccc Q lcl|NC_015288. 273 VANAGIFDLDVDSN---GRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPA 349 (468) Q Consensus 273 ~~~~Gv~Dl~~~~~---~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~ 349 (468) ++++|+|||+++.| +||++|+||+|+|||++|+|+|+|+|+||+|||||||++||++|+|+||+++.++..... . T Consensus 321 ~~~~G~~d~~~~~d~~~~~~~~e~~~~l~~~i~~~an~i~~~T~rg~gn~~i~S~~Va~~L~~sg~l~~~~~~g~~~--~ 398 (514) T protein:vir:56 321 AGAAGVFDFSDAVDVKGARWAGEAYKALLIQIEKEANEIGRQTGRGNGNFIIASRNVVSALSMTDTLVGPAAQGMQD--G 398 (514) T ss_pred cccccccccccccccccchHHHHHHHHHHHHHHHHHHHHHhhcccccccEEEEchhHHHHHHhhhhhccccccCccc--c Confidence 99999999998876 899999999999999999999999999999999999999999999999999976655433 3 Q ss_pred cccccCCCceeEEEecCCeEEEEccccccCCCcceEEEEEecCCcccceeEEccccccccccccCCccccceeeeeeecc Q lcl|NC_015288. 350 IGTVDDTGNLAVGTINGRIKVYVDPYAANLSDKHYYVVGYKGTSPYDAGLFYCPYVPLQMVRSIDPNNFQPKIGFKTRYG 429 (468) Q Consensus 350 ~~~~D~t~~~~~G~l~~~~~vy~D~Ya~~~s~~dY~~vG~Kg~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~g~~tRY~ 429 (468) .+++|+++.+|+|+|+|||+||||||+. +|||+|||||++++|+||||||||||+|++++||+||||++||||||| T Consensus 399 ~~~~d~~~~~~aG~l~~~~~vy~D~y~~----~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~tRY~ 474 (514) T protein:vir:56 399 SMNTDTNQTVFAGVLGGRFKVYIDQYAV----NDYFTVGFKGSTEMDAGVFYSPYVPLTPLRGSDSKNFQPVIGFKTRYG 474 (514) T ss_pred ccccccCcceEEEEecCceEEEecCCCC----cceEEEEEecCcceecceeeccccccccccccCCccccceeeeeeeec Confidence 4689999999999999999999999865 899999999999999999999999999999999999999999999999 Q ss_pred eeecCcccccCcccccCChhhhh----hccCceeeeEEeecc Q lcl|NC_015288. 430 MVSNPFVTTNGLYSGTPDGETLT----PSTNMYYRRVQVTNL 467 (468) Q Consensus 430 l~~nPf~~~~~~~~~~~~~~~~~----~~~N~y~r~~~v~~~ 467 (468) |++|||++......+ .+.+|. .++|.|||||+|||| T Consensus 475 l~~NPy~~~~~~~~~--~~~~~~~~a~~~~n~y~r~v~v~~l 514 (514) T protein:vir:56 475 VQVNPFADPTASATK--VGNGAPVAASMGKNAYFRRVFVKGL 514 (514) T ss_pred eeeCCCCCccccccc--cCCcchhhhcccccceeeeEEEecC Confidence 999999976544432 223333 357899999999999 No 15 >protein:vir:100603 Length: 529 # NCBI annotation: gp23 precursor of major head subunit # Family: family:all:364 # MgeID: mge:1488 # MgeName: 25 # Cross-refs: genbank:acc:YP_656387;genbank:gi:109290138;genbank:GeneID:4156581 Probab=100.00 E-value=1e-218 Score=1215.63 Aligned_cols=460 Identities=37% Similarity=0.622 Sum_probs=402.2 Q ss_pred CcchHHHHHhhhhhhcCCccccccchhhhhhhhhhhhhHHHHHhhhhhhhhhccccccC--------cccccccccc-cc Q lcl|NC_015288. 1 MFNAEHLQEKWSPVLNNEAANPIADRYKKAVTSVLLENQERFLREERGMLQEVAVNSLG--------AGTVSPGGSA-LG 71 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~~~llenq~~~~~e~~~~l~e~~~~~~g--------~~~~~~~~~~-~~ 71 (468) -+++|+|+|||+|||||||+|+|++.|||+|+++||||||||++|++.|++..+.|+++ +++|++++.+ .+ T Consensus 2 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~a~l~enq~~~~~~~~~~~~~~~~e~~~~~l~e~~~~~~~~~~~~~ia~ 81 (529) T protein:vir:10 2 SLKTKEILNKWTPLLEGEGLPEIAGKNKQALVAQILEAQEKDSKTDPVYRDDKLIEAFGQSLMEAEVAGDHGYDPTNIAA 81 (529) T ss_pred ccchHHHHHHhhHhhcCCccchhcchhhhhhhhhhhhhHHHHhhcccccchhhhhhhhhhccchhhcccccccccccccc Confidence 57889999999999999999999999999999999999999999999998888777765 5677665555 68 Q ss_pred cccccccccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCC----CCccc--ccccccccccccccc Q lcl|NC_015288. 72 SANTAGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQA----GEEAL--FNEPDAGFTAGLDAT 145 (468) Q Consensus 72 st~tg~~~~~~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qs----G~EA~--fnEa~t~fSG~~~~~ 145 (468) |++|++|+++||+||+||||++|||||+|||||||||||||||||||+||.+|. |.|+| ++|+|+.|||...+. T Consensus 82 s~~t~~v~~~~P~Li~lvRra~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~~~~~~~~g~eaf~~~~e~dt~~SG~~~~~ 161 (529) T protein:vir:10 82 GQSSGAITNIGPAVIGMVRRAIPSLIAFDIAGVQPMTGPTGQVFALRSVYGKDPLAAGAKEAFHPMYAPDAWHSGLAAKG 161 (529) T ss_pred cccccccccccchhhhhHHHHHHhHHhhhhheeccCCchhhhhhhheeeecCCcCCCccccccccccccccccccccccc Confidence 999999999999999999999999999999999999999999999999999874 55555 689999999976543 Q ss_pred ccccccccc------------------------ccccc------Ccccccc--------cccccccccccccccccchhh Q lcl|NC_015288. 146 TGAYTPRTG------------------------AGVGG------DAEGNNP--------ALLNDSSPGTYETPRGFSRED 187 (468) Q Consensus 146 ~~~~~~~~~------------------------~~~~~------~~~g~~~--------~~~~~a~~g~~t~~~gm~Ta~ 187 (468) ......... ..... ...+.+. ...+.+....++++.||+|+. T Consensus 162 ~~~~~~~~~~~~~t~~~a~~~~~~~~~~~nea~t~~s~~~tg~~~~~g~~~tg~~~~~~~~~~~a~~~~~~~~~gmsTa~ 241 (529) T protein:vir:10 162 ATTSSDGTPFAALTAGQAVATGDIVYHFFYESGSAYLQNVTGGNVTVGTNETGAALDALVSAKIAAGELAEIAEGMATSI 241 (529) T ss_pred ccccccccccccccccceeeccccceeeecccccccccccccccccccccccCCccccccccccccccccccccccchhh Confidence 221111000 00000 0001100 011223345688999999999 Q ss_pred hhccC----CCCCccccceeEEEEEEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhh Q lcl|NC_015288. 188 LEQAG----DAGKLFREMSFSIEKTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYS 263 (468) Q Consensus 188 aE~lG----~~g~~f~EMaFsIeK~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~ 263 (468) +|+|+ ++++.|+||+|+||||+|||||||||||||||||||||||||||||+||+||||+|||+||||||||+|+. T Consensus 242 aEal~~~g~ss~~~f~EMaFsIeK~tVtAKSRaLKAEYTiELAQDLKAvHGLDAEtELsNILStEImlEINReii~~i~~ 321 (529) T protein:vir:10 242 AELRQGFNGTTDNPWNEMSFRIDKQTVEAKSRQLKAQYSIELAQDLRAVHGMDADSELNGILANEVMLEINREVIDWINY 321 (529) T ss_pred hhccccCCCCccccccceeeEEEEEEEeeeccceeccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHHHhhh Confidence 99983 45678999999999999999999999999999999999999999999999999999999999999998888 Q ss_pred hcchhhcccc----ccceeeeeecCCc---chhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhcccc Q lcl|NC_015288. 264 VAKPGAANNV----ANAGIFDLDVDSN---GRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVL 336 (468) Q Consensus 264 vA~~~k~~~~----~~~Gv~Dl~~~~~---~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~ 336 (468) +|+.++..++ +.+|+|||+++.| +||++|+||+|++||++|+|+|+|+|+||+|||||||++||++|+|.|.+ T Consensus 322 ~a~~~~~g~~~~~~~~~gv~d~~~~~d~~~~~~~~e~~~~L~~~i~~~an~I~~~T~rg~~n~vi~S~~Va~~L~~~~~~ 401 (529) T protein:vir:10 322 TAQVGKSGWTQTVGSAAGVFDFQDPIDVRGARWAGESYKALLIQIDKEANEIARQTGRGAGNFIIASRNVVSALALVDAG 401 (529) T ss_pred hceeeeeeeeccccccccceeccccccccccchhHHHHHHHHHHHHHHHHHHHHhhccccceEEEEchHHHHHHhhhccc Confidence 8877776543 6889999998876 89999999999999999999999999999999999999999999999999 Q ss_pred ccccccccccccccccccCCCceeEEEecCCeEEEEccccccCCCcceEEEEEecCCcccceeEEccccccccccccCCc Q lcl|NC_015288. 337 DYSSGLTGAGGPAIGTVDDTGNLAVGTINGRIKVYVDPYAANLSDKHYYVVGYKGTSPYDAGLFYCPYVPLQMVRSIDPN 416 (468) Q Consensus 337 ~~~~~~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy~D~Ya~~~s~~dY~~vG~Kg~~~~d~glfyaPYv~~~~~~~~Dp~ 416 (468) ++.++...+.+ .++|+++.+|+|+|+|||+||||||+ ++|||+|||||++++|+||||||||||+|+|++||+ T Consensus 402 ~~~~~~~~~sg---~~~d~~~~~~~G~l~~~~~vy~D~y~----~~dy~~vG~KG~~~~~~glfy~PYv~l~~~~~~dp~ 474 (529) T protein:vir:10 402 ITPAAQGMASG---LNADTTKGVFAGVLGGRYKVYIDQYA----RQDYFTMGYRGANNLDAGIYYCPYVALTPLRGSDPK 474 (529) T ss_pred ccccccccccc---ceeecCCceEEEEecCceEEEecCCC----CcceEEEEEeCCcccccceeeccccccccccccCCC Confidence 99888776654 56899999999999999999999986 489999999999999999999999999999999999 Q ss_pred cccceeeeeeecceeecCcccccCc--ccccCChhhhhh--ccCceeeeEEeecc Q lcl|NC_015288. 417 NFQPKIGFKTRYGMVSNPFVTTNGL--YSGTPDGETLTP--STNMYYRRVQVTNL 467 (468) Q Consensus 417 s~qP~~g~~tRY~l~~nPf~~~~~~--~~~~~~~~~~~~--~~N~y~r~~~v~~~ 467 (468) ||||++||||||||++|||+++.++ .++++|+.+|.+ ++|.|||||+|||| T Consensus 475 sfqP~~g~~tRY~l~~NP~~~~~~~~~~~r~~~g~~~~~~ag~n~~~r~~~Vk~l 529 (529) T protein:vir:10 475 NFQPVMGFKTRYAIGVNPFAESRTQAPTSRISNGMPGAHSVGKNAYFRRVWVKGL 529 (529) T ss_pred cccceeeeeeeeceeecCccccccccccccccCCcchhhhcCccceeeEeeeccC Confidence 9999999999999999999998766 578999998876 67899999999999 No 16 >protein:vir:107947 Length: 519 # NCBI annotation: gp23 major head protein # Family: family:all:364 # MgeID: mge:2002 # MgeName: JS98 # Cross-refs: genbank:acc:YP_001595301;genbank:gi:161622607;genbank:GeneID:5783666 Probab=100.00 E-value=1.2e-217 Score=1209.75 Aligned_cols=459 Identities=35% Similarity=0.598 Sum_probs=397.1 Q ss_pred cchHHHHHhhhhhhcCCccccccchhhhhhhhhhhhhHHHHHhhhhhhhhhccccccCc---------cccccccccccc Q lcl|NC_015288. 2 FNAEHLQEKWSPVLNNEAANPIADRYKKAVTSVLLENQERFLREERGMLQEVAVNSLGA---------GTVSPGGSALGS 72 (468) Q Consensus 2 ~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~~~llenq~~~~~e~~~~l~e~~~~~~g~---------~~~~~~~~~~~s 72 (468) |+.|+|+|||+|||||||+|+|++.|||+|+++||||||+++.|.+.|+++.+.|++|. ++++....+.++ T Consensus 1 ~~~~~l~~kw~p~l~~~~~~~i~~~~~~~i~~~~~en~~~~~~~~~~~~~~~~~~~~~~~l~e~~~~~~~~~~~t~i~~~ 80 (519) T protein:vir:10 1 MKKNALVQKWSALLENEALPEIVGASKQAIIAKIFENQEQDILTAPEYRDEKISEAFGSFLTEAEIGGDHGYDATNIAAG 80 (519) T ss_pred CchhHHHHHhHHhhcccccchhhhhhhHHHHHHHHHHHHHHhhhcccccchHHHHHHhhhcchhccCCccccCccccccc Confidence 99999999999999999999999999999999999999999999999999988877763 444555556789 Q ss_pred ccccccccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCC----CCccc--cccccccccccccccc Q lcl|NC_015288. 73 ANTAGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQA----GEEAL--FNEPDAGFTAGLDATT 146 (468) Q Consensus 73 t~tg~~~~~~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qs----G~EA~--fnEa~t~fSG~~~~~~ 146 (468) ++|+++++++|+||+|+||++|||||+|||||||||||||||||||+||.++. ++|+| |+|+++.|||..+.+. T Consensus 81 ~~t~~v~~~~P~l~~l~rRa~p~LIa~DIwGVQPMTgPTGLIFAMRsrY~n~~~~~~g~ea~~~~nEadt~fSG~~~~~~ 160 (519) T protein:vir:10 81 QTSGAVTQIGPAVMGMVRRAIPHLIAFDICGVQPLNNPTGQVFALRAVYGKDPIAAGAKEAFHPMYAPNAMFSGQGAAET 160 (519) T ss_pred cccccccccchhHHHHHHHHHHhhhhhhhheeecCCchhhhhheeeeeecCCccccccccccccccccccccCccccccc Confidence 99999999999999999999999999999999999999999999999999885 45555 6999999999876544 Q ss_pred cccccccccccccC----------------------cc-cccc------cccccccccccccccccchhhhhcc---C-C Q lcl|NC_015288. 147 GAYTPRTGAGVGGD----------------------AE-GNNP------ALLNDSSPGTYETPRGFSREDLEQA---G-D 193 (468) Q Consensus 147 ~~~~~~~~~~~~~~----------------------~~-g~~~------~~~~~a~~g~~t~~~gm~Ta~aE~l---G-~ 193 (468) ............++ .. ++++ .....+..+.++++.||+|+.+|++ | + T Consensus 161 ~~~~~~~~~~~~g~~~~~~~~~s~~~~~~~~~~~t~~ag~t~~~~~~~a~~~~~~~~~~~~~~~gmsTa~aEal~~lggs 240 (519) T protein:vir:10 161 FEALAASKVLEVGKIYSHFFEATGSAHFQAVEAVTVDAGATDAAKLDAAVTALVEAGQLAEIAEGMATSIAELQEGFNGS 240 (519) T ss_pred cccccccccccccccccccccccccceeccccccccCCCCcCccccccccccccccccccccccccccchhhccccCCCc Confidence 33222111111000 00 1111 1112233467889999999999985 3 4 Q ss_pred CCCccccceeEEEEEEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhcccc Q lcl|NC_015288. 194 AGKLFREMSFSIEKTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNV 273 (468) Q Consensus 194 ~g~~f~EMaFsIeK~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~ 273 (468) ++++|+||+|+||||+|||||||||||||||||||||||||||||+||+||||||||+|||||||++|+.+|+.++...+ T Consensus 241 s~~~f~EMaFsIeKvTVtAKSRaLKAEYTiELAQDLKAVHGLDAEtELaNILSTEImlEINReii~~i~~sa~~~~~g~t 320 (519) T protein:vir:10 241 TDNPWNEMGFRIDKQVIEAKSRQLKASYSIELAQDLRAVHGMDADAELSGILATEIMLEINREVIDWINYSAQVGKSGMT 320 (519) T ss_pred cccchhhhceeEEEEEEeeecccccccccHHHHHHHHHhcCCChHHHHHHHHHHHHHHHhhHHHHhhhhhhhhcceeecc Confidence 56789999999999999999999999999999999999999999999999999999999999999988777777665444 Q ss_pred cc----ceeeeeecCCc---chhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhcccccccccccccc Q lcl|NC_015288. 274 AN----AGIFDLDVDSN---GRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAG 346 (468) Q Consensus 274 ~~----~Gv~Dl~~~~~---~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~ 346 (468) .+ +|+|||+++.| +||++|+||+|+|||+||+|+|+|+|+||+|||||||+|||++|+|+|++++.++...+. T Consensus 321 ~~~~~~aGv~d~~~~~d~~~~rw~~e~~k~L~~~i~~~an~I~~~T~r~~gn~ii~S~~Va~~L~~~g~~~~~~~~~~~~ 400 (519) T protein:vir:10 321 NTVGAKAGVFDFQDPIDIRGARWAGESFKALLFQIDKEAAEIARQTGRGAGNFIIASRNVVNVLAAVDTSVSYAAQGLGQ 400 (519) T ss_pred cCcccccceeecccccccccchHHHHHHHHHHHHHHHHHHHHHHhhccccccEEEEchHHHHHHhhccchhccccccccc Confidence 22 69999999976 999999999999999999999999999999999999999999999999999998776665 Q ss_pred ccccccccCCCceeEEEecCCeEEEEccccccCCCcceEEEEEecCCcccceeEEccccccccccccCCccccceeeeee Q lcl|NC_015288. 347 GPAIGTVDDTGNLAVGTINGRIKVYVDPYAANLSDKHYYVVGYKGTSPYDAGLFYCPYVPLQMVRSIDPNNFQPKIGFKT 426 (468) Q Consensus 347 ~~~~~~~D~t~~~~~G~l~~~~~vy~D~Ya~~~s~~dY~~vG~Kg~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~g~~t 426 (468) + .++|+++++|+|+|+|||+||||||+. +|||+|||||++++|+||||||||||+|+|++||+||||++|||| T Consensus 401 ~---~~~d~~~~~~~G~l~~~~~vy~D~y~~----~dy~~vG~KG~~~~~~glfyaPYv~l~~~~~~dp~sfqP~~g~~t 473 (519) T protein:vir:10 401 G---FNVDTTKAVFAGVLGGKYRVYIDQYAR----SDYFTIGYKGSNEMDAGIYYAPYVALTPLRGSDPKNFQPVMGFKT 473 (519) T ss_pred c---ccccCCCceEEEEecCceEEEecCCCC----cceEEEEEecCcccccceeeccccccccccccCCccccceeeeee Confidence 4 589999999999999999999999865 899999999999999999999999999999999999999999999 Q ss_pred ecceeecCcccccCc--ccccCChhh-h--hhccCceeeeEEeecc Q lcl|NC_015288. 427 RYGMVSNPFVTTNGL--YSGTPDGET-L--TPSTNMYYRRVQVTNL 467 (468) Q Consensus 427 RY~l~~nPf~~~~~~--~~~~~~~~~-~--~~~~N~y~r~~~v~~~ 467 (468) ||||++|||++..++ ..+++|+-+ | ..+.|.|||||+|||| T Consensus 474 RY~l~~NP~~~~~~~~~~~~i~~g~~~~a~~~~~n~y~r~v~v~~~ 519 (519) T protein:vir:10 474 RYGIGINPFADPAAQAPTKRIQNGMPDIVNSLGLNGYFRRVYVKGI 519 (519) T ss_pred eeceeecCcccccccCccceeccCchhhhccccCceeeeeeeeecC Confidence 999999999976433 456777632 2 2367999999999999 No 17 >protein:vir:5942 Length: 523 # NCBI annotation: similar to major head protein # Family: family:all:364 # MgeID: mge:123 # MgeName: RM 378 # Cross-refs: genbank:acc:NP_835728;genbank:gi:30044131 Probab=100.00 E-value=5.2e-194 Score=1080.24 Aligned_cols=406 Identities=27% Similarity=0.415 Sum_probs=336.6 Q ss_pred Ccc---hHHHHHhhhhhhcCCccccccchhhhhhhhhhhhhHHHHHhhhhhhhhhccccccCcccccccccccccccccc Q lcl|NC_015288. 1 MFN---AEHLQEKWSPVLNNEAANPIADRYKKAVTSVLLENQERFLREERGMLQEVAVNSLGAGTVSPGGSALGSANTAG 77 (468) Q Consensus 1 ~~~---~~~l~~kw~p~l~~~~~~~i~~~~~~~~~~~llenq~~~~~e~~~~l~e~~~~~~g~~~~~~~~~~~~st~tg~ 77 (468) |.. +|+|+|||+||||+ |++.|||+|+++|||||+| |++++|.| ++.|++ T Consensus 1 ~~~~~~~e~l~~kw~p~l~~-----~~~~~~~~~~a~llenq~~---~~~~~l~e-------------------~~~~~~ 53 (523) T protein:vir:59 1 MSQPKINEQLIEKWQPLLEG-----CRNDWERHTLATLLENQYR---EAKKHLME-------------------TTQTTE 53 (523) T ss_pred CCcchhhHHHHHhhhhhhcc-----cCChhHHHHHHHHhhhhhH---HHHHhhhh-------------------hhhccc Confidence 443 57999999999997 5677999999999999986 56677766 355788 Q ss_pred cccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCCcccccccc--------------cccccccc Q lcl|NC_015288. 78 LAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPD--------------AGFTAGLD 143 (468) Q Consensus 78 ~~~~~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~--------------t~fSG~~~ 143 (468) +++|.| ||+|+||++|||||+||||||||||||||||||||||.+|+|+|++|+++. +.|++... T Consensus 54 ~~~~~~-~~~~v~r~~p~l~a~DIWGVQPMTGPTGLIFAMRSRY~~q~gteA~yg~~~~~~~~a~~~~~ean~~~s~~~~ 132 (523) T protein:vir:59 54 VDGWNL-ALPIVRRVFANLRATDLVSVQPLSLPTGLVFYLDFKSPELPGNGSVYGGTGLTTDTATGGLYDENARLSRREY 132 (523) T ss_pred cccccc-hhhhhhhHhhhhhhhhccccccCCCCcceeEEEEeeccCCCCcccccCccccCcccccccccccccccccccc Confidence 999996 999999999999999999999999999999999999999999999987544 34443221 Q ss_pred ccccccccc-cc------------------cc-------------cccC-----------cc------------------ Q lcl|NC_015288. 144 ATTGAYTPR-TG------------------AG-------------VGGD-----------AE------------------ 162 (468) Q Consensus 144 ~~~~~~~~~-~~------------------~~-------------~~~~-----------~~------------------ 162 (468) ......... .+ +. ..+. .. T Consensus 133 ~~~~~~d~~~sg~~~~~~~a~stg~A~a~~s~si~k~~vTa~s~agta~~~li~A~~~q~itg~tga~fa~s~~~an~as 212 (523) T protein:vir:59 133 ETTITVDLATAQQATMRDVGFDTGIASLVSSGAVYYVDVPVASLPGVADVNTVRFWQYDDASGDPENTVAYPLPRYNRIV 212 (523) T ss_pred cCccCCCcccccccccccccccccchhhccccceeeeeccccccccccccccccccccccccccccccccchhhcccccc Confidence 111000000 00 00 0000 00 Q ss_pred ---------------ccccccc------ccccccccccccccchhhhhccCC------CCCccccceeEEEEEEEEeecc Q lcl|NC_015288. 163 ---------------GNNPALL------NDSSPGTYETPRGFSREDLEQAGD------AGKLFREMSFSIEKTSVTAKSR 215 (468) Q Consensus 163 ---------------g~~~~~~------~~a~~g~~t~~~gm~Ta~aE~lG~------~g~~f~EMaFsIeK~tVtAKSR 215 (468) +++.... .......++.+.||+++.+|.+|+ +++.|+||+|+||||+|||||| T Consensus 213 tAss~Al~gEA~t~~sTd~at~~~Gtt~t~~~~~lyt~~~g~~t~~~~~~~~~~~~~~~~~~~~eM~FsIeK~tVtAkSR 292 (523) T protein:vir:59 213 GAVGSALYARLFFVTGSDFATVAGGTPSTQDLDLVYYIDARNDFEDQSTDPDYPDPGFQSLDIPEINLELRSRPVATKTR 292 (523) T ss_pred ccccccccccccccccccccccCCCcccccccccccccccccchhhccccccccccccccccccceeeEEEeEEEeeecc Confidence 0000000 000122356678999999998863 4578999999999999999999 Q ss_pred cccceecHHHHHhHHHhh-CCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeeeecCCcchhH---- Q lcl|NC_015288. 216 ALKAEYTLELAQDLKAIH-GLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDLDVDSNGRWS---- 290 (468) Q Consensus 216 aLKAEYTvELAQDLkAiH-GLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl~~~~~~rw~---- 290 (468) |||||||||||||||||| |||||+||+||||+||||||||||||+||+||+|||+.+++++|||||++++|++|. T Consensus 293 aLKAeYT~ELAQDLKAiH~GLDAE~ELanILStEImlEINR~ii~~~~~~a~~~~~~~~~~~g~~~~~~~~~~~~~~~~~ 372 (523) T protein:vir:59 293 KLRAAWTPEAMQDLAAYHKGVDLENEIVTLMSQYIAREIDLEILSTIMAHARRTDNYGFWSEVVGEYYDETSGNFVAGNF 372 (523) T ss_pred cccccccHHHHHHHHHHhcCCChhHHHHHHHHHHHHHHhhHHHHHhHhhhheeeeeccccccceeeecccccchhhhhhh Confidence 999999999999999999 999999999999999999999999999999999999999999999999999999997 Q ss_pred ----HHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccccccccCCCceeEEEecC Q lcl|NC_015288. 291 ----VEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTGNLAVGTING 366 (468) Q Consensus 291 ----~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~~~~~G~l~~ 366 (468) +|.||+|+|||++|+|+|+|+|+||+|||||||+|||++|++||||++... ...|+++.+|+|+|+| T Consensus 373 ~~~~~e~~~~l~~~~~~~~n~i~~~t~~~~~~~~~~s~~v~~~l~~~~~~~~~~~---------~~~~~~~~~~~g~l~~ 443 (523) T protein:vir:59 373 YGSKQEWLATLMIELNKVSNRIQQKTAVAGANFLVTSPQVAALLESMPGFTPGND---------NRDGGTGIFYVGMVQG 443 (523) T ss_pred hhhhHHHHHHHHHHHHHHHHHHHHhcccccccEEEEchhHHHHHHhccccccCCc---------cccccccceeEEEecC Confidence 899999999999999999999999999999999999999999999976532 3468899999999999 Q ss_pred CeEEEEccccccCCCcceEEEEEec-CCcccceeEEcccccccccccc-CCccccceeeeeeecceee-cCcccccCccc Q lcl|NC_015288. 367 RIKVYVDPYAANLSDKHYYVVGYKG-TSPYDAGLFYCPYVPLQMVRSI-DPNNFQPKIGFKTRYGMVS-NPFVTTNGLYS 443 (468) Q Consensus 367 ~~~vy~D~Ya~~~s~~dY~~vG~Kg-~~~~d~glfyaPYv~~~~~~~~-Dp~s~qP~~g~~tRY~l~~-nPf~~~~~~~~ 443 (468) ||+||||||+ ++|||+||||| .+++|+|||||||||+.+++.+ ||+||||++||||||||++ |||+.+.---. T Consensus 444 ~~~vy~d~~~----~~dy~~~g~k~~~~~~~~~~~y~Py~~l~~~~~~~dp~s~qp~~~~~tRY~l~v~nP~~~~~~~~~ 519 (523) T protein:vir:59 444 RYRLYKNIYQ----NQPVIIMGNQDLNTPWQTGAVYAPYVPLLFTPTIVDPVNFSYRRGLMTRYALEVVRPEFYGLLYVK 519 (523) T ss_pred ceEEEecCCC----CcceEEEEecccCCcccccceecccchhhcccccccCCcccceeeeeeehhheecchhHhhhhhhh Confidence 9999999986 48999999999 4699999999999999999996 9999999999999999986 99998731110 Q ss_pred ccCC Q lcl|NC_015288. 444 GTPD 447 (468) Q Consensus 444 ~~~~ 447 (468) -... T Consensus 520 ~~~~ 523 (523) T protein:vir:59 520 LLQP 523 (523) T ss_pred hcCC Confidence 0000 No 18 >protein:vir:81100 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:1891 # MgeName: tp310-1 # Cross-refs: genbank:acc:YP_001429874;genbank:gi:156603927;genbank:GeneID:5525320 Probab=95.64 E-value=0.0017 Score=35.78 Aligned_cols=341 Identities=15% Similarity=0.113 Sum_probs=129.6 Q ss_pred CcchH-------------HHHHhhhhhhc------CCccccccchhhhhhh-hhhhhhH-------------HHHHhhhh Q lcl|NC_015288. 1 MFNAE-------------HLQEKWSPVLN------NEAANPIADRYKKAVT-SVLLENQ-------------ERFLREER 47 (468) Q Consensus 1 ~~~~~-------------~l~~kw~p~l~------~~~~~~i~~~~~~~~~-~~llenq-------------~~~~~e~~ 47 (468) +++.+ .|.++..-+-+ .+....+...-+.... .+..+++ ...-.+.+ T Consensus 28 ~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 107 (415) T protein:vir:81 28 ALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVR 107 (415) T ss_pred HhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccchhhhHHHHHHHHHHhhhhhhhhhHHHHHH Confidence 11111 22222211100 0000000000000000 0000000 00000111 Q ss_pred hhhhhccccccCcccccccccccccccccccccccceeh--hhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCC Q lcl|NC_015288. 48 GMLQEVAVNSLGAGTVSPGGSALGSANTAGLAGFDPVLI--SLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQA 125 (468) Q Consensus 48 ~~l~e~~~~~~g~~~~~~~~~~~~st~tg~~~~~~P~Lv--~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qs 125 (468) .+..... .+.. ....++++.+-...-|.-+ .++++..+...-.+++.|.||++..+-+--.| ..+. T Consensus 108 ~~~~~~~-----~~~~----~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~--~~~~- 175 (415) T protein:vir:81 108 DFTEYLE-----TRND----IQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVR--QSEV- 175 (415) T ss_pred HHHHHHh-----hhhh----hhhccccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEe--ecCC- Confidence 1100000 0000 0000111111111122221 24455556677889999999999887654443 1110 Q ss_pred CCccccccccccccccccccccccccccccccccCcccccccccccccccccccccccchhhhhccCC-CCCccccceeE Q lcl|NC_015288. 126 GEEALFNEPDAGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGD-AGKLFREMSFS 204 (468) Q Consensus 126 G~EA~fnEa~t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~-~g~~f~EMaFs 204 (468) ..+ .+-+ +.....+ +...|.+..|+ T Consensus 176 -~~~-------~~v~----------------------------------------------E~~~~~~~~~~~~~~v~~~ 201 (415) T protein:vir:81 176 -AAL-------EKVE----------------------------------------------ELEENPELAVKPFFQLAYD 201 (415) T ss_pred -ccc-------eeec----------------------------------------------cccccCcccccceeeEEee Confidence 000 0000 0000000 01234455555 Q ss_pred EEEEEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeeeecC Q lcl|NC_015288. 205 IEKTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDLDVD 284 (468) Q Consensus 205 IeK~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl~~~ 284 (468) +.|. +-...+|-||.+|- ..|.+++|.+-|+..|..-+|+.||.-.-+-...+-.......++ -... T Consensus 202 ~~k~-------~~~~~iS~ell~ds----~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~-~~~~- 268 (415) T protein:vir:81 202 INTH-------RGYFRISREAIEDA----KVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGK-KLEV- 268 (415) T ss_pred eeee-------EeeehhhHHHHhhc----hHHHHHHHHHHHHHHHHHHHHHHHhhccccCcccccccccccccc-cccc- Confidence 5444 44566999999984 357899999999999999999999875532211110000011011 0001 Q ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccccccccCCCceeEEEe Q lcl|NC_015288. 285 SNGRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTGNLAVGTI 364 (468) Q Consensus 285 ~~~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~~~~~G~l 364 (468) ++.-..+.+..++..+. . .-.+.+.+||++.....|.. +..+.+ ......+.++ ...++| T Consensus 269 -~~~~~~~~i~~~~~~~~-------~--~~~~~~~~v~n~~~~~~l~~---lkd~~G------~~l~~~~~~~-~~~~~l 328 (415) T protein:vir:81 269 -KKAKSLDDIKDAINLNV-------K--PNYEHNVAIVSQTMFAKLDK---MKDKLG------NYLIQPDVKE-KTQQRL 328 (415) T ss_pred -ccccchhHHHHHHHhhh-------h--hccCCCEEEEcHHHHHHHHH---hhccCC------ceeeccCcCC-CCCcee Confidence 11111122222222221 1 11345568899999888853 222211 0011111111 122455 Q ss_pred cCCeEEEEccccccCCCcceEEEEEecCCcccceeEEcc----ccccc---cc-cccCCccccceeeeeeeccee-ecC- Q lcl|NC_015288. 365 NGRIKVYVDPYAANLSDKHYYVVGYKGTSPYDAGLFYCP----YVPLQ---MV-RSIDPNNFQPKIGFKTRYGMV-SNP- 434 (468) Q Consensus 365 ~~~~~vy~D~Ya~~~s~~dY~~vG~Kg~~~~d~glfyaP----Yv~~~---~~-~~~Dp~s~qP~~g~~tRY~l~-~nP- 434 (468) + +++|++.++.. .|-.|+. .++|+- |+-.+ +. ...|-..++..+....|++.. .+| T Consensus 329 ~-G~pV~~~~~~~---------~~~~~~~----~~~~Gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~d~~v~~~~ 394 (415) T protein:vir:81 329 L-GAKIEILPDEV---------LGQKGNN----TLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYK 394 (415) T ss_pred c-ceeeEEecccc---------cCCCCcc----EEEEEehhccEEEEeecceEEEEeccccCceEEEEEEEeccEEeccc Confidence 3 46777664321 1111111 122221 21111 11 112445677778888899764 455 Q ss_pred -cccccCcccccCChhhhhhcc Q lcl|NC_015288. 435 -FVTTNGLYSGTPDGETLTPST 455 (468) Q Consensus 435 -f~~~~~~~~~~~~~~~~~~~~ 455 (468) |...+-..+-.+.| ++.--+ T Consensus 395 a~~~~~~~~~~~~~~-~~~~~~ 415 (415) T protein:vir:81 395 SAIVIEYDDSERGEG-DLGLEA 415 (415) T ss_pred cEEEEEEeccCCCCC-ccccCC Confidence 44332111111122 232222 No 19 >protein:vir:79987 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:1875 # MgeName: tp310-3 # Cross-refs: genbank:acc:YP_001430002;genbank:gi:156604057;genbank:GeneID:5525447 Probab=95.64 E-value=0.0017 Score=35.78 Aligned_cols=341 Identities=15% Similarity=0.113 Sum_probs=129.6 Q ss_pred CcchH-------------HHHHhhhhhhc------CCccccccchhhhhhh-hhhhhhH-------------HHHHhhhh Q lcl|NC_015288. 1 MFNAE-------------HLQEKWSPVLN------NEAANPIADRYKKAVT-SVLLENQ-------------ERFLREER 47 (468) Q Consensus 1 ~~~~~-------------~l~~kw~p~l~------~~~~~~i~~~~~~~~~-~~llenq-------------~~~~~e~~ 47 (468) +++.+ .|.++..-+-+ .+....+...-+.... .+..+++ ...-.+.+ T Consensus 28 ~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 107 (415) T protein:vir:79 28 ALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVR 107 (415) T ss_pred HhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccchhhhHHHHHHHHHHhhhhhhhhhHHHHHH Confidence 11111 22222211100 0000000000000000 0000000 00000111 Q ss_pred hhhhhccccccCcccccccccccccccccccccccceeh--hhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCC Q lcl|NC_015288. 48 GMLQEVAVNSLGAGTVSPGGSALGSANTAGLAGFDPVLI--SLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQA 125 (468) Q Consensus 48 ~~l~e~~~~~~g~~~~~~~~~~~~st~tg~~~~~~P~Lv--~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qs 125 (468) .+..... .+.. ....++++.+-...-|.-+ .++++..+...-.+++.|.||++..+-+--.| ..+. T Consensus 108 ~~~~~~~-----~~~~----~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~--~~~~- 175 (415) T protein:vir:79 108 DFTEYLE-----TRND----IQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVR--QSEV- 175 (415) T ss_pred HHHHHHh-----hhhh----hhhccccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEe--ecCC- Confidence 1100000 0000 0000111111111122221 24455556677889999999999887654443 1110 Q ss_pred CCccccccccccccccccccccccccccccccccCcccccccccccccccccccccccchhhhhccCC-CCCccccceeE Q lcl|NC_015288. 126 GEEALFNEPDAGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGD-AGKLFREMSFS 204 (468) Q Consensus 126 G~EA~fnEa~t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~-~g~~f~EMaFs 204 (468) ..+ .+-+ +.....+ +...|.+..|+ T Consensus 176 -~~~-------~~v~----------------------------------------------E~~~~~~~~~~~~~~v~~~ 201 (415) T protein:vir:79 176 -AAL-------EKVE----------------------------------------------ELEENPELAVKPFFQLAYD 201 (415) T ss_pred -ccc-------eeec----------------------------------------------cccccCcccccceeeEEee Confidence 000 0000 0000000 01234455555 Q ss_pred EEEEEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeeeecC Q lcl|NC_015288. 205 IEKTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDLDVD 284 (468) Q Consensus 205 IeK~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl~~~ 284 (468) +.|. +-...+|-||.+|- ..|.+++|.+-|+..|..-+|+.||.-.-+-...+-.......++ -... T Consensus 202 ~~k~-------~~~~~iS~ell~ds----~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~-~~~~- 268 (415) T protein:vir:79 202 INTH-------RGYFRISREAIEDA----KVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGK-KLEV- 268 (415) T ss_pred eeee-------EeeehhhHHHHhhc----hHHHHHHHHHHHHHHHHHHHHHHHhhccccCcccccccccccccc-cccc- Confidence 5444 44566999999984 357899999999999999999999875532211110000011011 0001 Q ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccccccccCCCceeEEEe Q lcl|NC_015288. 285 SNGRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTGNLAVGTI 364 (468) Q Consensus 285 ~~~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~~~~~G~l 364 (468) ++.-..+.+..++..+. . .-.+.+.+||++.....|.. +..+.+ ......+.++ ...++| T Consensus 269 -~~~~~~~~i~~~~~~~~-------~--~~~~~~~~v~n~~~~~~l~~---lkd~~G------~~l~~~~~~~-~~~~~l 328 (415) T protein:vir:79 269 -KKAKSLDDIKDAINLNV-------K--PNYEHNVAIVSQTMFAKLDK---MKDKLG------NYLIQPDVKE-KTQQRL 328 (415) T ss_pred -ccccchhHHHHHHHhhh-------h--hccCCCEEEEcHHHHHHHHH---hhccCC------ceeeccCcCC-CCCcee Confidence 11111122222222221 1 11345568899999888853 222211 0011111111 122455 Q ss_pred cCCeEEEEccccccCCCcceEEEEEecCCcccceeEEcc----ccccc---cc-cccCCccccceeeeeeeccee-ecC- Q lcl|NC_015288. 365 NGRIKVYVDPYAANLSDKHYYVVGYKGTSPYDAGLFYCP----YVPLQ---MV-RSIDPNNFQPKIGFKTRYGMV-SNP- 434 (468) Q Consensus 365 ~~~~~vy~D~Ya~~~s~~dY~~vG~Kg~~~~d~glfyaP----Yv~~~---~~-~~~Dp~s~qP~~g~~tRY~l~-~nP- 434 (468) + +++|++.++.. .|-.|+. .++|+- |+-.+ +. ...|-..++..+....|++.. .+| T Consensus 329 ~-G~pV~~~~~~~---------~~~~~~~----~~~~Gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~d~~v~~~~ 394 (415) T protein:vir:79 329 L-GAKIEILPDEV---------LGQKGNN----TLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYK 394 (415) T ss_pred c-ceeeEEecccc---------cCCCCcc----EEEEEehhccEEEEeecceEEEEeccccCceEEEEEEEeccEEeccc Confidence 3 46777664321 1111111 122221 21111 11 112445677778888899764 455 Q ss_pred -cccccCcccccCChhhhhhcc Q lcl|NC_015288. 435 -FVTTNGLYSGTPDGETLTPST 455 (468) Q Consensus 435 -f~~~~~~~~~~~~~~~~~~~~ 455 (468) |...+-..+-.+.| ++.--+ T Consensus 395 a~~~~~~~~~~~~~~-~~~~~~ 415 (415) T protein:vir:79 395 SAIVIEYDDSERGEG-DLGLEA 415 (415) T ss_pred cEEEEEEeccCCCCC-ccccCC Confidence 44332111111122 232222 No 20 >protein:vir:98339 Length: 415 # NCBI annotation: putative capsid protein # Family: family:all:21 # MgeID: mge:1581 # MgeName: phiPVL(108) # Cross-refs: genbank:acc:YP_918931;genbank:gi:119443693;genbank:GeneID:4594501 Probab=95.64 E-value=0.0017 Score=35.78 Aligned_cols=341 Identities=15% Similarity=0.113 Sum_probs=129.6 Q ss_pred CcchH-------------HHHHhhhhhhc------CCccccccchhhhhhh-hhhhhhH-------------HHHHhhhh Q lcl|NC_015288. 1 MFNAE-------------HLQEKWSPVLN------NEAANPIADRYKKAVT-SVLLENQ-------------ERFLREER 47 (468) Q Consensus 1 ~~~~~-------------~l~~kw~p~l~------~~~~~~i~~~~~~~~~-~~llenq-------------~~~~~e~~ 47 (468) +++.+ .|.++..-+-+ .+....+...-+.... .+..+++ ...-.+.+ T Consensus 28 ~l~~~~~~~~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 107 (415) T protein:vir:98 28 ALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVR 107 (415) T ss_pred HhchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcccccccchhhhHHHHHHHHHHhhhhhhhhhHHHHHH Confidence 11111 22222211100 0000000000000000 0000000 00000111 Q ss_pred hhhhhccccccCcccccccccccccccccccccccceeh--hhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCC Q lcl|NC_015288. 48 GMLQEVAVNSLGAGTVSPGGSALGSANTAGLAGFDPVLI--SLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQA 125 (468) Q Consensus 48 ~~l~e~~~~~~g~~~~~~~~~~~~st~tg~~~~~~P~Lv--~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qs 125 (468) .+..... .+.. ....++++.+-...-|.-+ .++++..+...-.+++.|.||++..+-+--.| ..+. T Consensus 108 ~~~~~~~-----~~~~----~~~~~~~~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~--~~~~- 175 (415) T protein:vir:98 108 DFTEYLE-----TRND----IQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVR--QSEV- 175 (415) T ss_pred HHHHHHh-----hhhh----hhhccccccccccccchHHHHHHHHHHHhhhhhhhheeeeeccCCceeEEEEe--ecCC- Confidence 1100000 0000 0000111111111122221 24455556677889999999999887654443 1110 Q ss_pred CCccccccccccccccccccccccccccccccccCcccccccccccccccccccccccchhhhhccCC-CCCccccceeE Q lcl|NC_015288. 126 GEEALFNEPDAGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGD-AGKLFREMSFS 204 (468) Q Consensus 126 G~EA~fnEa~t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~-~g~~f~EMaFs 204 (468) ..+ .+-+ +.....+ +...|.+..|+ T Consensus 176 -~~~-------~~v~----------------------------------------------E~~~~~~~~~~~~~~v~~~ 201 (415) T protein:vir:98 176 -AAL-------EKVE----------------------------------------------ELEENPELAVKPFFQLAYD 201 (415) T ss_pred -ccc-------eeec----------------------------------------------cccccCcccccceeeEEee Confidence 000 0000 0000000 01234455555 Q ss_pred EEEEEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeeeecC Q lcl|NC_015288. 205 IEKTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDLDVD 284 (468) Q Consensus 205 IeK~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl~~~ 284 (468) +.|. +-...+|-||.+|- ..|.+++|.+-|+..|..-+|+.||.-.-+-...+-.......++ -... T Consensus 202 ~~k~-------~~~~~iS~ell~ds----~~~l~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~-~~~~- 268 (415) T protein:vir:98 202 INTH-------RGYFRISREAIEDA----KVNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGK-KLEV- 268 (415) T ss_pred eeee-------EeeehhhHHHHhhc----hHHHHHHHHHHHHHHHHHHHHHHHhhccccCcccccccccccccc-cccc- Confidence 5444 44566999999984 357899999999999999999999875532211110000011011 0001 Q ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccccccccCCCceeEEEe Q lcl|NC_015288. 285 SNGRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTGNLAVGTI 364 (468) Q Consensus 285 ~~~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~~~~~G~l 364 (468) ++.-..+.+..++..+. . .-.+.+.+||++.....|.. +..+.+ ......+.++ ...++| T Consensus 269 -~~~~~~~~i~~~~~~~~-------~--~~~~~~~~v~n~~~~~~l~~---lkd~~G------~~l~~~~~~~-~~~~~l 328 (415) T protein:vir:98 269 -KKAKSLDDIKDAINLNV-------K--PNYEHNVAIVSQTMFAKLDK---MKDKLG------NYLIQPDVKE-KTQQRL 328 (415) T ss_pred -ccccchhHHHHHHHhhh-------h--hccCCCEEEEcHHHHHHHHH---hhccCC------ceeeccCcCC-CCCcee Confidence 11111122222222221 1 11345568899999888853 222211 0011111111 122455 Q ss_pred cCCeEEEEccccccCCCcceEEEEEecCCcccceeEEcc----ccccc---cc-cccCCccccceeeeeeeccee-ecC- Q lcl|NC_015288. 365 NGRIKVYVDPYAANLSDKHYYVVGYKGTSPYDAGLFYCP----YVPLQ---MV-RSIDPNNFQPKIGFKTRYGMV-SNP- 434 (468) Q Consensus 365 ~~~~~vy~D~Ya~~~s~~dY~~vG~Kg~~~~d~glfyaP----Yv~~~---~~-~~~Dp~s~qP~~g~~tRY~l~-~nP- 434 (468) + +++|++.++.. .|-.|+. .++|+- |+-.+ +. ...|-..++..+....|++.. .+| T Consensus 329 ~-G~pV~~~~~~~---------~~~~~~~----~~~~Gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~d~~v~~~~ 394 (415) T protein:vir:98 329 L-GAKIEILPDEV---------LGQKGNN----TLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYK 394 (415) T ss_pred c-ceeeEEecccc---------cCCCCcc----EEEEEehhccEEEEeecceEEEEeccccCceEEEEEEEeccEEeccc Confidence 3 46777664321 1111111 122221 21111 11 112445677778888899764 455 Q ss_pred -cccccCcccccCChhhhhhcc Q lcl|NC_015288. 435 -FVTTNGLYSGTPDGETLTPST 455 (468) Q Consensus 435 -f~~~~~~~~~~~~~~~~~~~~ 455 (468) |...+-..+-.+.| ++.--+ T Consensus 395 a~~~~~~~~~~~~~~-~~~~~~ 415 (415) T protein:vir:98 395 SAIVIEYDDSERGEG-DLGLEA 415 (415) T ss_pred cEEEEEEeccCCCCC-ccccCC Confidence 44332111111122 232222 No 21 >protein:vir:4830 Length: 397 # NCBI annotation: MPL-7201 # Family: family:all:21 # MgeID: mge:105 # MgeName: 7201 # Cross-refs: genbank:acc:NP_038327;genbank:gi:9634653;genbank:GeneID:1262632 Probab=95.42 E-value=0.0014 Score=36.25 Aligned_cols=327 Identities=14% Similarity=0.112 Sum_probs=116.7 Q ss_pred CcchHH----------HHHhhhhhhcC------CccccccchhhhhhhhhhhhhHHHHHhhhhhhhhhccccccCccccc Q lcl|NC_015288. 1 MFNAEH----------LQEKWSPVLNN------EAANPIADRYKKAVTSVLLENQERFLREERGMLQEVAVNSLGAGTVS 64 (468) Q Consensus 1 ~~~~~~----------l~~kw~p~l~~------~~~~~i~~~~~~~~~~~llenq~~~~~e~~~~l~e~~~~~~g~~~~~ 64 (468) -...|+ |.++=.-+.+. ..........++ ....-.+++ ..+....+.+.+... ... T Consensus 33 ~~~~ee~~~l~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~---~~~~~~~~~~~~~~~----~~~ 104 (397) T protein:vir:48 33 SVTAEELQAIKNERDTAKMKRDMFKEQYTEARANEVVNMSEEEKK-PLTKSEEEV---KAGFVKDFKNLVRGR----YQN 104 (397) T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhccc-cccchhhHH---HHHHHHHHHHHHhhh----hhH Confidence 000011 11110000000 000000000000 000000111 111111111111000 000 Q ss_pred cccccccccc-ccccc---cccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCCccccccccccccc Q lcl|NC_015288. 65 PGGSALGSAN-TAGLA---GFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDAGFTA 140 (468) Q Consensus 65 ~~~~~~~st~-tg~~~---~~~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSG 140 (468) .......+++ .|+.. .+.+.++.+. .+...-.+++.++||++++|-+--.+ ..+..+.-. |- T Consensus 105 ~~~~~~~~t~~~gg~~iP~~~~~~ii~~~---~~~~~l~~~~~~~~~~~~~~~~~~~~--~~~~~~~a~--------~v- 170 (397) T protein:vir:48 105 LLDSKTDASGSDAGLTIPQDIQTAIHTLV---RQYDSLQEYVNVENVTTLTGSRVYEK--WADITGLAK--------LD- 170 (397) T ss_pred HHHHhhccCCccccccccHHHHHHHHHHH---HHHHHHHhhhceeeccCCcceEEEEe--ecCCCccee--------ee- Confidence 0000111111 12211 2223343333 45556688899999999988665444 111111000 00 Q ss_pred cccccccccccccccccccCcccccccccccccccccccccccchhhhhccCCC-CCccccceeEEEEEEEEeecccccc Q lcl|NC_015288. 141 GLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDA-GKLFREMSFSIEKTSVTAKSRALKA 219 (468) Q Consensus 141 ~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~-g~~f~EMaFsIeK~tVtAKSRaLKA 219 (468) ++.+...+. ...|.+..|++.|..+ .. T Consensus 171 ---------------------------------------------~E~~~~~~~~~~~~~~v~~~~~k~~~-------~~ 198 (397) T protein:vir:48 171 ---------------------------------------------DEAGSIGTNDDPKLYPIRYAIKRYAG-------IS 198 (397) T ss_pred ---------------------------------------------ccccccccccccceeeEEeeheeeee-------eh Confidence 000000111 1245555555555543 46 Q ss_pred eecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeeeecCCcchhHHHHHHHHHH Q lcl|NC_015288. 220 EYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDLDVDSNGRWSVEKFKGLLF 299 (468) Q Consensus 220 EYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl~~~~~~rw~~e~~~~l~~ 299 (468) .+|-||.+|-. +|.+++|.+-|+..|..-+|+.||.-.-+ +....++.++ +....++. T Consensus 199 ~iS~ell~ds~----~~l~~~v~~~l~~~~~~~~d~~il~G~g~--------~~~~~~~~~~----------d~i~~~~~ 256 (397) T protein:vir:48 199 TVTNSLLADSA----ENILAWLSGWIAKKVVVTRNKAILEAIAT--------LPTKPTLTKW----------DDIIDLQA 256 (397) T ss_pred hhHHHHHhhch----HHHHHHHHHHHHHHHHHHHHHHHhhcccc--------cccccccccH----------HHHHHHHH Confidence 79999999853 57899999999999999999998863211 1111122211 11222222 Q ss_pred HHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccccccccCCCceeEEEecCCeEEEE--ccccc Q lcl|NC_015288. 300 QIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTGNLAVGTINGRIKVYV--DPYAA 377 (468) Q Consensus 300 ~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy~--D~Ya~ 377 (468) .+. .-- ..+..+||++...+.|.. +..+.+ ..+...|-+.. .-++|+| ++|++ |.+.. T Consensus 257 ~l~--------~~~-~~~a~~v~n~~~~~~L~~---lkd~~G------~~i~~~~~~~~-~~~~l~G-~PV~~~~~~~~~ 316 (397) T protein:vir:48 257 KVD--------PAI-KQTSFFLTNTSGFTALKK---VKNAFG------DYLMERDVKSP-TGYSIDG-FAVKEVADRWLA 316 (397) T ss_pred Hhh--------hhh-cCCCEEEECHHHHHHHHH---hhcCCC------ceeeccCcCCC-CCceecc-ceeEEecccccC Confidence 221 111 234567899999988853 222111 11111221111 1245544 45443 22221 Q ss_pred cCC-C---------cceEEEEEecCCcccceeEEccccccccccccCCccccceeeeeeeccee-ecC--cccc--cCcc Q lcl|NC_015288. 378 NLS-D---------KHYYVVGYKGTSPYDAGLFYCPYVPLQMVRSIDPNNFQPKIGFKTRYGMV-SNP--FVTT--NGLY 442 (468) Q Consensus 378 ~~s-~---------~dY~~vG~Kg~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~g~~tRY~l~-~nP--f~~~--~~~~ 442 (468) +.. + .+|++++..+..... ..++.. .+-...+-.+-...|++.. .|| |... ..-. T Consensus 317 ~~~~~~~~~~~gd~~~~~~~~~~~~~~i~----~~~~~~------~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~ 386 (397) T protein:vir:48 317 NASSGAMPLYFGDLKQAVTLFDRQQMSLL----STNIGG------GAFETDTTKIRVIDRFDVVATDTESFVPASFKAIA 386 (397) T ss_pred CcCCCceEEEEEeccceEEEEeecceEEE----Eeccch------hhhhcCceeEEEEeeeccEEecccceEEEEecccc Confidence 111 1 123333333222211 111110 0112223344444455432 233 2211 1111 Q ss_pred cccCChhhhhhccCceeeeEEe Q lcl|NC_015288. 443 SGTPDGETLTPSTNMYYRRVQV 464 (468) Q Consensus 443 ~~~~~~~~~~~~~N~y~r~~~v 464 (468) +..++.... -| T Consensus 387 ~~~~~~~~~-----------~~ 397 (397) T protein:vir:48 387 DQKGNLGST-----------AV 397 (397) T ss_pred cCCCCcccc-----------CC Confidence 111111111 11 No 22 >protein:vir:4953 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:108 # MgeName: Sfi19 # Cross-refs: genbank:acc:NP_049929;genbank:gi:9632900;genbank:GeneID:1262076 Probab=95.39 E-value=0.0015 Score=36.00 Aligned_cols=327 Identities=13% Similarity=0.117 Sum_probs=122.6 Q ss_pred CcchHHHHHhhhhhhcCCcc----------------ccccchhhhhhhhhhhhhHHHHHhhhhhhhhhccccccCccccc Q lcl|NC_015288. 1 MFNAEHLQEKWSPVLNNEAA----------------NPIADRYKKAVTSVLLENQERFLREERGMLQEVAVNSLGAGTVS 64 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~----------------~~i~~~~~~~~~~~llenq~~~~~e~~~~l~e~~~~~~g~~~~~ 64 (468) -...|++.+...-+-+.+.. .......++.+... +++ ...+.+..+.+.+.. +... T Consensus 33 ~~~~ee~~~~~~~i~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~-~~~~~~~~~~~~l~~----~~~~ 104 (397) T protein:vir:49 33 SVSAEELQAIKNERDTAKMKRDMFKEQYTEARANEVANMSEEEKKPLTKS---EEE-VKAGFVKDFKNLVRG----RYQN 104 (397) T ss_pred hcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccccccccccc---hhH-HHHHHHHHHHHHHhc----chhH Confidence 11222232222222110000 00000000000000 000 000111111111000 0000 Q ss_pred ccccccccccc-cccc---cccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCCccccccccccccc Q lcl|NC_015288. 65 PGGSALGSANT-AGLA---GFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDAGFTA 140 (468) Q Consensus 65 ~~~~~~~st~t-g~~~---~~~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSG 140 (468) .-.....++++ |+.. .+.+.+ ++...+..+-.++|.++||++++|-+.=++ ..+.++. + .|- T Consensus 105 ~~~~~~~~t~~~gg~~vP~~~~~~i---i~~~~~~~~l~~~~~~~~~~~~~~~~~~~~--~~~~~~~-a-------~~v- 170 (397) T protein:vir:49 105 LLDSKTDASGSDAGLTIPQDIQTAI---HTLVSQYDSLQEYVNVENVTTLTGSRVYEK--WTDITGL-A-------NID- 170 (397) T ss_pred HHHHhhccccccCcccccHhHHHHH---HHHHHhhhhHHhhhceeecccCccceEEEe--eccCCcc-e-------eee- Confidence 00000111111 2221 122333 444445667788899999999988544333 1111110 0 000 Q ss_pred cccccccccccccccccccCcccccccccccccccccccccccchhhhhccCC-CCCccccceeEEEEEEEEeecccccc Q lcl|NC_015288. 141 GLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGD-AGKLFREMSFSIEKTSVTAKSRALKA 219 (468) Q Consensus 141 ~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~-~g~~f~EMaFsIeK~tVtAKSRaLKA 219 (468) ++++...+ +...|.++.|++.|. +-.. T Consensus 171 ---------------------------------------------~E~~~~~~~~~~~~~~i~~~~~k~-------~~~~ 198 (397) T protein:vir:49 171 ---------------------------------------------DEAGKIADVDDPKLSLIKYTIKRY-------AGIS 198 (397) T ss_pred ---------------------------------------------cCccccccccccceeeEEeeeeeE-------Eeee Confidence 00000000 112355555555544 4446 Q ss_pred eecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeeeecCCcchhHHHHHHHHHH Q lcl|NC_015288. 220 EYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDLDVDSNGRWSVEKFKGLLF 299 (468) Q Consensus 220 EYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl~~~~~~rw~~e~~~~l~~ 299 (468) .+|-||.+|-. .|.+++|.+-|+..|..-+|+.||.-.-+.. ...|+.++ +....+.+ T Consensus 199 ~iS~ell~ds~----~~l~~~i~~~l~~~~~~~~d~ai~~G~g~~~--------~~~~~~~~----------d~i~~~~~ 256 (397) T protein:vir:49 199 TVTNSLLADSA----ENILAWLSGWIAKKVVVTRNKAILEAIAALP--------TKPTLTKW----------DDIIDLEA 256 (397) T ss_pred hhHHHHHhhhH----HHHHHHHHHHHHHHHHHHHHHHHHhhccccc--------cccccccH----------HHHHHHHH Confidence 68999999853 5789999999999999999999987432211 22233222 22223333 Q ss_pred HHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccccccccCCCceeEEEecCCeEEEE--ccccc Q lcl|NC_015288. 300 QIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTGNLAVGTINGRIKVYV--DPYAA 377 (468) Q Consensus 300 ~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy~--D~Ya~ 377 (468) .+.. . - .....+|+++.....|.. +..+.+ + .....|.++ ...++|+ |++|++ |.+.. T Consensus 257 ~l~~-------~-~-~~~a~~vmn~~~~~~l~~---lkd~~G---~---~l~~~~~~~-~~~~~l~-G~PV~~~~~~~~~ 316 (397) T protein:vir:49 257 KVDP-------A-I-KQTSFFLTNTSGFTALKK---VKNALG---D---YLMERDVKS-PTGYSID-GFAVKEVADRWLA 316 (397) T ss_pred hhhh-------h-h-cCCCEEEEcHHHHHHHHH---hhcCCC---c---eeeccCcCC-CCCceec-ceeeEEecccccc Confidence 3321 1 1 223567889999888854 222211 1 111112111 1224564 456654 33332 Q ss_pred cCCCc----------ceEEEEEecCCcccceeEEccccccccccccCCccccceeeeeeeccee-ecC--cccc--cCcc Q lcl|NC_015288. 378 NLSDK----------HYYVVGYKGTSPYDAGLFYCPYVPLQMVRSIDPNNFQPKIGFKTRYGMV-SNP--FVTT--NGLY 442 (468) Q Consensus 378 ~~s~~----------dY~~vG~Kg~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~g~~tRY~l~-~nP--f~~~--~~~~ 442 (468) +.... +|++++.++..+ +=+.+|... +-...+-.+-...|++.. .|| |... ..-. T Consensus 317 ~~~~~~~~i~~gd~~~~~~~~~~~~~~----i~~~~~~~~------~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~ 386 (397) T protein:vir:49 317 NGTGGAMPLYFGDLKQAVTLFDRQHMS----LLSTNIGGG------AFETDTTKVRVIDRFDVVATDTEAFVPASFKAIA 386 (397) T ss_pred cccCCceeEEEeeccceEEEEeecceE----EEEeccccc------hhhcCceeEEEEeeeCcEEecccceEEEEeeccc Confidence 22211 233333332222 222333211 112333344445555543 233 2221 1111 Q ss_pred cccCChhhhhhccCceeeeEEe Q lcl|NC_015288. 443 SGTPDGETLTPSTNMYYRRVQV 464 (468) Q Consensus 443 ~~~~~~~~~~~~~N~y~r~~~v 464 (468) +.-++-.. +.| T Consensus 387 ~~~~~~~~-----------~~~ 397 (397) T protein:vir:49 387 DQKGNLGS-----------TAV 397 (397) T ss_pred CCCCCccc-----------ccC Confidence 10011100 011 No 23 >protein:vir:9410 Length: 415 # NCBI annotation: head protein # Family: family:all:21 # MgeID: mge:167 # MgeName: phi 13 # Cross-refs: genbank:acc:NP_803388;genbank:gi:29028700;genbank:GeneID:1258136 Probab=94.68 E-value=0.0038 Score=33.87 Aligned_cols=348 Identities=13% Similarity=0.099 Sum_probs=127.1 Q ss_pred CcchH-------------HHHHhhhhh-------hcCCccccccchhhhhh-------------hhhhhhhHHHHHhhhh Q lcl|NC_015288. 1 MFNAE-------------HLQEKWSPV-------LNNEAANPIADRYKKAV-------------TSVLLENQERFLREER 47 (468) Q Consensus 1 ~~~~~-------------~l~~kw~p~-------l~~~~~~~i~~~~~~~~-------------~~~llenq~~~~~e~~ 47 (468) +++.+ .|.++..-+ .+.+...+-........ ...-+.+....-.|.+ T Consensus 28 ~~~~~~~e~~~~~~~ei~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~ 107 (415) T protein:vir:94 28 ALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDGTSENNQQSVEVNEASTYRNQANINDLGISIQNTKVTSQEVR 107 (415) T ss_pred HhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccchhhHHHHHHHHHHHhhhhhhhhhHHHHH Confidence 11111 111111111 00000000000000000 0000000000001111 Q ss_pred hhhhhccccccCccccccccccccc--ccccccccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCC Q lcl|NC_015288. 48 GMLQEVAVNSLGAGTVSPGGSALGS--ANTAGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQA 125 (468) Q Consensus 48 ~~l~e~~~~~~g~~~~~~~~~~~~s--t~tg~~~~~~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qs 125 (468) .+...... ........ +.+|+..--....-.+++...+..+-.+++.++||++..+-+--.+ ..+. T Consensus 108 ~~~~~~~~---------~~~~~~~~~~~~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~--~~~~- 175 (415) T protein:vir:94 108 DFTEYLET---------RNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVR--QSEV- 175 (415) T ss_pred HHHHHhhh---------hhhhhhhccccccccccCcHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEe--ecCC- Confidence 11111000 00000011 1112222111122234555556778899999999998776543333 1110 Q ss_pred CCccccccccccccccccccccccccccccccccCcccccccccccccccccccccccchhhhhccCC-CCCccccceeE Q lcl|NC_015288. 126 GEEALFNEPDAGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGD-AGKLFREMSFS 204 (468) Q Consensus 126 G~EA~fnEa~t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~-~g~~f~EMaFs 204 (468) .++ .|-+ ++....+ +...|.+..|+ T Consensus 176 -~~~-------~~v~----------------------------------------------Eg~~~~~~~~~~~~~i~~~ 201 (415) T protein:vir:94 176 -AAL-------EKVE----------------------------------------------ELEENPELAVKPFFQLAYD 201 (415) T ss_pred -ccc-------eecc----------------------------------------------ccccccccccccceeeEee Confidence 000 0000 0000000 01235555555 Q ss_pred EEEEEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeeeecC Q lcl|NC_015288. 205 IEKTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDLDVD 284 (468) Q Consensus 205 IeK~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl~~~ 284 (468) +.|.. -.-.+|-||.+|-- +|.+++|.+-|...|..-+|+.||.-.-+-.-.+-.......++ -...+ T Consensus 202 ~~k~~-------~~~~is~ell~ds~----~~~~~~i~~~l~~~~~~~~~~~il~g~g~g~~~~~~~~~~~~~~-~~~~~ 269 (415) T protein:vir:94 202 INTHR-------GYFRISREAIEDAK----VNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGK-KLEVK 269 (415) T ss_pred heeee-------eechhhHHHHhhch----HHHHHHHHHHHHHHHHHHHHHHHhhccccCcccccccccccccc-ccccc Confidence 55554 44569999999864 47899999999999999999999875432221110000010000 00000 Q ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccccccccCCCceeEEEe Q lcl|NC_015288. 285 SNGRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTGNLAVGTI 364 (468) Q Consensus 285 ~~~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~~~~~G~l 364 (468) +--..+.+..++.. + . ..-.+.+.+|+++.....|.. +..+.+. .....+.+. ...++| T Consensus 270 --~~~~~~~i~~~~~~-------~-~-~~~~~~~~~vmn~~~~~~l~~---lkd~~G~------~l~~~~~~~-~~~~~l 328 (415) T protein:vir:94 270 --KAKSLDDIKDAINL-------N-V-KPNYEHNVAIVSQTMFAKLDK---MKDKLGN------YLIQPDVKE-KTQQRL 328 (415) T ss_pred --cccchHHHHHHHHh-------h-h-hhccCCCEEEEcHHHHHHHHH---hhccCCC------eeeccCcCC-CCCcee Confidence 00111222222221 1 1 122346678899999888854 2222110 011111111 122455 Q ss_pred cCCeEEEEccccccCCCcc-eEEEEEecCCcccceeEEccccccccccccCCccccceeeeeeeccee-ecC--cccccC Q lcl|NC_015288. 365 NGRIKVYVDPYAANLSDKH-YYVVGYKGTSPYDAGLFYCPYVPLQMVRSIDPNNFQPKIGFKTRYGMV-SNP--FVTTNG 440 (468) Q Consensus 365 ~~~~~vy~D~Ya~~~s~~d-Y~~vG~Kg~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~g~~tRY~l~-~nP--f~~~~~ 440 (468) + |++|++.+....-..-+ -+++|--.. .+.......+. ....|-.++|-.+-...|++.. .+| |...+- T Consensus 329 ~-G~pV~~~~~~~~~~~~~~~i~~gd~~~-----~~~~~~~~~~~-v~~~~~~~~~~~~r~~~r~d~~~~~~~a~~~~~~ 401 (415) T protein:vir:94 329 L-GAKIEILPDEVLGQKGNNTLIIGNLKD-----AIVLFDRSQYQ-ASWTDYMHFGECLMIAVRQDCRILDYKSAIVIEY 401 (415) T ss_pred c-ceeeEEecccccCCCCccEEEEEehhc-----cEEEEeecceE-EEEeccccCceEEEEEEEeccEEeccccEEEEEE Confidence 4 45677654321100001 122221000 00000000011 1112445566677777888764 355 433211 Q ss_pred cccccCChhhhhhcc Q lcl|NC_015288. 441 LYSGTPDGETLTPST 455 (468) Q Consensus 441 ~~~~~~~~~~~~~~~ 455 (468) ...-.+.| ++.--+ T Consensus 402 ~~~~~~~~-~~~~~~ 415 (415) T protein:vir:94 402 DDSERGEG-DLGLEA 415 (415) T ss_pred eccCCCCC-ccccCC Confidence 11111111 222222 No 24 >protein:vir:4997 Length: 397 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:109 # MgeName: Sfi21 # Cross-refs: genbank:acc:NP_049971;genbank:gi:9632943;genbank:GeneID:1262106 Probab=94.31 E-value=0.0033 Score=34.19 Aligned_cols=327 Identities=14% Similarity=0.099 Sum_probs=114.9 Q ss_pred Cc----------------------chHHHHHhhhhh------hcCCccccccchhhhhhhhhhhhhHHHHHhhhhhhhhh Q lcl|NC_015288. 1 MF----------------------NAEHLQEKWSPV------LNNEAANPIADRYKKAVTSVLLENQERFLREERGMLQE 52 (468) Q Consensus 1 ~~----------------------~~~~l~~kw~p~------l~~~~~~~i~~~~~~~~~~~llenq~~~~~e~~~~l~e 52 (468) +- .-+.+.++=.-+ .+.+.........++.+...-.+-.....+.-.+++.. T Consensus 21 l~~~~~~~~~~~~~~~ee~~~l~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 100 (397) T protein:vir:49 21 LNEKLNVAMLDDSVSAEELQAIKNERDTAKMKRDLFKEQYTEARANEVANMSEEEKKPLTKNEEEVKANFVKDFKNLVRG 100 (397) T ss_pred HHHHHHHHHhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccccccchhhHHHHHHHHHHHHHhhc Confidence 00 000000000000 00000000000000000000000000000000111110 Q ss_pred ccccccCcccccccccccccc-cccccccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCCcccc Q lcl|NC_015288. 53 VAVNSLGAGTVSPGGSALGSA-NTAGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALF 131 (468) Q Consensus 53 ~~~~~~g~~~~~~~~~~~~st-~tg~~~~~~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~f 131 (468) ...+.. ......+ +.|+..--....-.+++..-+...-.+++.|+||++.+|-+-=.+ ..+..+ .+ T Consensus 101 ~~~~~~--------~~~~~~t~~~gg~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~--~~~~~~-~a-- 167 (397) T protein:vir:49 101 RYQNLL--------DSKTDGSGSDAGLTIPQDIRTAINTLVRQFDSLQEYVNVENVTTLTGSRVYEK--WADITG-LA-- 167 (397) T ss_pred chhhHH--------HhhhccCCccCcceecHHHHHHHHHHHHhhhhHhhhcceeeccCCcceEEEEe--eccCCc-ce-- Confidence 000000 0000111 112111101111123444556667778999999999887532222 111100 00 Q ss_pred ccccccccccccccccccccccccccccCcccccccccccccccccccccccchhhhhccCCCCCcccccee-EEEEEEE Q lcl|NC_015288. 132 NEPDAGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDAGKLFREMSF-SIEKTSV 210 (468) Q Consensus 132 nEa~t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~g~~f~EMaF-sIeK~tV 210 (468) .|-+ | +..+++-.. +++.++. T Consensus 168 -----~~v~------------------------------------------------E-----~~~~~~~~~~~~~~v~~ 189 (397) T protein:vir:49 168 -----KLDD------------------------------------------------E-----GGQIGQNDDPKLSLIRY 189 (397) T ss_pred -----eeec------------------------------------------------c-----ccccccccccceeeeEe Confidence 0000 0 011122211 2334444 Q ss_pred EeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeeeecCCcchhH Q lcl|NC_015288. 211 TAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDLDVDSNGRWS 290 (468) Q Consensus 211 tAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl~~~~~~rw~ 290 (468) .+|.-+-...+|-||.+|-. +|.+++|.+-|+..|..-+|+.||.-.- .+....+++++ T Consensus 190 ~~~k~~~~~~iS~ell~ds~----~~l~~~i~~~l~~~~~~~~d~ail~G~g--------~~~~~~~~~~~--------- 248 (397) T protein:vir:49 190 AIKRYAGISTVTNSLLADSA----ENILAWLSGWIAKKVVVTRNKAILEAIG--------TLPNKPTLAKW--------- 248 (397) T ss_pred eeeeeEeehhhHHHHHhhhh----HHHHHHHHHHHHHHHHHHHHHHHHhccc--------cccccccccCH--------- Confidence 44444445678999999853 5789999999999999999998885321 11122233222 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccccccccCCCceeEEEecCCeEE Q lcl|NC_015288. 291 VEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTGNLAVGTINGRIKV 370 (468) Q Consensus 291 ~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~~~~~G~l~~~~~v 370 (468) +....+...+. ..-.....+|++|.....|.. +..+.+ + .....+-+. -..++|+|+ +| T Consensus 249 -d~i~~~~~~l~---------~~~~~~a~~v~n~~~~~~l~~---lkd~~g---~---~l~~~~~~~-g~~~~l~G~-pV 307 (397) T protein:vir:49 249 -DDIIDLQAKVD---------PAIKQTSLFLTNTSGFTALKK---VKNAMG---D---YLMERDVKS-PTGYSIDGF-VV 307 (397) T ss_pred -HHHHHHHHhhh---------hhhcCCCEEEEcHHHHHHHHH---hhccCC---c---eeecccccC-CCCceecce-ee Confidence 11212222221 112234568899999888854 222211 0 011111111 112456554 44 Q ss_pred EE--ccccccC-CC---------cceEEEEEecCCcccceeEEccccccccccccCCccccceeeeeeecceee-cC--c Q lcl|NC_015288. 371 YV--DPYAANL-SD---------KHYYVVGYKGTSPYDAGLFYCPYVPLQMVRSIDPNNFQPKIGFKTRYGMVS-NP--F 435 (468) Q Consensus 371 y~--D~Ya~~~-s~---------~dY~~vG~Kg~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~g~~tRY~l~~-nP--f 435 (468) ++ |.+..+. .+ .+|++++..+... +-..||... +-...+-.+-...|++..+ +| | T Consensus 308 ~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~----i~~~~~~~~------~~~~~~~~~~~~~r~d~~~~~~~a~ 377 (397) T protein:vir:49 308 KEISDRFLPNGTGGAMPLYFGDLKQAVTLFDRQHLS----LLSTNIGGG------AFETDTTKVRVIDRFDVVSTDTEAF 377 (397) T ss_pred EEecccccccccCCceeEEEeeccceEEEEeecccE----EEEeccccc------hhhcCeeeEEEEEeeccEEecccce Confidence 43 3221111 11 1233333333222 223344221 1123333444455555432 33 2 Q ss_pred ccc-----cCcccccCChhhhhhcc Q lcl|NC_015288. 436 VTT-----NGLYSGTPDGETLTPST 455 (468) Q Consensus 436 ~~~-----~~~~~~~~~~~~~~~~~ 455 (468) ... .+.+... -..+| T Consensus 378 ~~~~~~~~~~~~~~~-----~~~~~ 397 (397) T protein:vir:49 378 VPASFKAIADQKAKL-----STAGA 397 (397) T ss_pred EEEEecccccccCcc-----cccCC Confidence 221 1111100 01112 No 25 >protein:vir:7409 Length: 408 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:146 # MgeName: P335 # Cross-refs: genbank:acc:NP_839926;genbank:gi:30089896;genbank:GeneID:1260683 Probab=94.12 E-value=0.0053 Score=33.04 Aligned_cols=332 Identities=14% Similarity=0.118 Sum_probs=128.3 Q ss_pred CcchHHHHHhhhhhhcCCccccccchhhhhhhhhh---------hhhHHHHH---hhhhhhhhhccc-----------cc Q lcl|NC_015288. 1 MFNAEHLQEKWSPVLNNEAANPIADRYKKAVTSVL---------LENQERFL---REERGMLQEVAV-----------NS 57 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~~~l---------lenq~~~~---~e~~~~l~e~~~-----------~~ 57 (468) ||+-|+|.++|..+.+. ++...+ ++-..+ ++....++ .++...+.+.+. +. T Consensus 4 ~m~i~el~~~~~~~~~~-----~~~~~~-e~~~~~~~~~~~~e~i~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 77 (408) T protein:vir:74 4 KLTVNQLNEAWIASGDK-----VTDFND-QINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREE 77 (408) T ss_pred hhhHHHHHHHHHHHHHH-----HHHHHH-HHHHHHhhhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccc Confidence 88999999999988654 222111 110000 00000000 000000000000 00 Q ss_pred cC-----------------------cccc-----cccccccccccccccc---cccceehhhhHHhhhhhhhhheeeeec Q lcl|NC_015288. 58 LG-----------------------AGTV-----SPGGSALGSANTAGLA---GFDPVLISLVRRAMPNLMAYDVCGVQP 106 (468) Q Consensus 58 ~g-----------------------~~~~-----~~~~~~~~st~tg~~~---~~~P~Lv~l~RRa~~~LIa~DI~GVQP 106 (468) .. .+.+ ...+....++..|++. .+.+. +++...+.....++++++| T Consensus 78 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~gg~~vP~~~~~~---Ii~~~~~~~~l~~~~~~~~ 154 (408) T protein:vir:74 78 EKGPLNKSENELKDKFVKDFVNMVRNPMAFLNTVSSKTETSGSDSAAGLTIPQDIRTM---INTLVRQYDSLQQYVRVES 154 (408) T ss_pred ccccccchhhhhHHHHHHHHHHHHhcchhhhhhhhhhhhcccccCCCceeechhHhhH---HHHHHhhhcchhhhcceee Confidence 00 0000 0000001111112211 11222 3444445556788999999 Q ss_pred CCccceeeeeeeeeecCCCCCccccccccccccccccccccccccccccccccCcccccccccccccccccccccccchh Q lcl|NC_015288. 107 MSGPTGLIFAMRSRYENQAGEEALFNEPDAGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSRE 186 (468) Q Consensus 107 mTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta 186 (468) |++.+|-+--.| ..+. +..+ .+ .+ T Consensus 155 ~~~~~~~~~~~~--~~~~-~~~~-------~~----------------------------------------------v~ 178 (408) T protein:vir:74 155 VSTSSGSRVYEK--WTDV-TPLK-------AM----------------------------------------------DE 178 (408) T ss_pred ccCCcceEEEEe--ecCC-cccc-------cc----------------------------------------------cc Confidence 999887653333 1110 0000 00 00 Q ss_pred hhhccCCCCCccccce-eEEEEEEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhc Q lcl|NC_015288. 187 DLEQAGDAGKLFREMS-FSIEKTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVA 265 (468) Q Consensus 187 ~aE~lG~~g~~f~EMa-FsIeK~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA 265 (468) .+...++.+ .+++++++..+.-+-...+|-||.+|- .+|.++.|.+-|+..|..-+|+.||.- T Consensus 179 -------E~~~~~~~~~~~~~~i~~~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~l~~~~~~~~d~~il~G----- 242 (408) T protein:vir:74 179 -------EDGKIPDLDNPRLTIIKYLIKRYAGIITATNTLLKDT----AENILAWLSSWIAKKVVVTRNQAIIAA----- 242 (408) T ss_pred -------cccccccccccceeeEEeeeeeEEeeehhHHHHHhhc----hHHHHHHHHHHHHHHHHHHHHHHHhhc----- Confidence 011122222 334445555555555566999999983 357889999999999999999888752 Q ss_pred chhhccccccceeeeeecCCcchhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccc Q lcl|NC_015288. 266 KPGAANNVANAGIFDLDVDSNGRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGA 345 (468) Q Consensus 266 ~~~k~~~~~~~Gv~Dl~~~~~~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~ 345 (468) ...+....++.++ .+++-. .+......-+..+ .+||++.....|.. +..+.+ + T Consensus 243 ---~G~~~~~~~~~~~-------------~~i~~~----~~~~l~~~~~~~a-~~v~n~~~~~~l~~---lkd~~G---~ 295 (408) T protein:vir:74 243 ---MGTVPKKPTIANF-------------DDVITM----INTSVDPAIIATS-SLLTNQSGLNKLAL---VKTAEG---K 295 (408) T ss_pred ---ccccccccccccH-------------HHHHHH----HHHhhhhhhcCCC-EEEEcHHHHHHHHH---hhcCCC---c Confidence 1111122222222 122110 1111111122223 46789999888853 222211 1 Q ss_pred cccccccccCCCceeEEEecCCeEEEEcc--ccccCCCcce-EEEEE-ec----CCcccceeEEccccccccccccCCcc Q lcl|NC_015288. 346 GGPAIGTVDDTGNLAVGTINGRIKVYVDP--YAANLSDKHY-YVVGY-KG----TSPYDAGLFYCPYVPLQMVRSIDPNN 417 (468) Q Consensus 346 ~~~~~~~~D~t~~~~~G~l~~~~~vy~D~--Ya~~~s~~dY-~~vG~-Kg----~~~~d~glfyaPYv~~~~~~~~Dp~s 417 (468) .....|.++. ..++| .|++|++-. ...+....++ +++|- +. -....-.+=..||.-. +-.. T Consensus 296 ---~l~~~~~~~~-~~~~l-~G~pV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~i~~~~~~~~------~f~~ 364 (408) T protein:vir:74 296 ---YLLEPDPTKP-NSYLI-KGKQVIVVADRWLPNSGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAG------AFET 364 (408) T ss_pred ---eEeccCcCCC-CCcee-cceeeEEecCcccccccCCcceEEEEehhccEEEEEecceEEEEeccccc------hhhc Confidence 1111121111 12455 345666421 1111100111 22220 10 0000011112222111 1134 Q ss_pred ccceeeeeeecceee-cC--cccc-----cCccc--ccCChhhh Q lcl|NC_015288. 418 FQPKIGFKTRYGMVS-NP--FVTT-----NGLYS--GTPDGETL 451 (468) Q Consensus 418 ~qP~~g~~tRY~l~~-nP--f~~~-----~~~~~--~~~~~~~~ 451 (468) .+-.+-+..||+..+ +| |... ..... +.+....+ T Consensus 365 ~~~~~r~~~r~d~~~~~~~a~~~~~~~~~~~~~~~~~~~~~~~~ 408 (408) T protein:vir:74 365 DTTKIRVIDRFDVKATDSEALVAGSFTAIADQVGNFKTTTSTAV 408 (408) T ss_pred ceeeEEEEEeeCcEEecccceEEEEeecccCCCCCCCCCccccC Confidence 555566666666542 33 2111 11111 11111111 No 26 >protein:vir:78523 Length: 338 # NCBI annotation: Putative head structural protein # Family: family:all:507 # MgeID: mge:1853 # MgeName: U2 # Cross-refs: genbank:acc:YP_001491585;genbank:gi:157786408;genbank:GeneID:5625675 Probab=94.07 E-value=0.0055 Score=32.98 Aligned_cols=307 Identities=14% Similarity=0.064 Sum_probs=127.7 Q ss_pred hhhhhhccccccCccccccccccccccccccccccccee-hhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCC Q lcl|NC_015288. 47 RGMLQEVAVNSLGAGTVSPGGSALGSANTAGLAGFDPVL-ISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQA 125 (468) Q Consensus 47 ~~~l~e~~~~~~g~~~~~~~~~~~~st~tg~~~~~~P~L-v~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qs 125 (468) -..|.|.-+++.|.+.-+. .+++++. -.-+.+ -.+++.+.+..+-..+|.+.||+++..-|.-.. . T Consensus 1 ~~~~~e~~~~~~~~~~~~~------~~~~~~~-liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~ip~~~----~-- 67 (338) T protein:vir:78 1 MATLNELAPNTAGSNHQGR------LAHVPSD-LLPKEIVGPIFDKAQESSLVLRLGENIPISYGETIIPTTV----K-- 67 (338) T ss_pred CcchHHhhhhhcccccccc------eeccccc-ccchHHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEe----c-- Confidence 2223333333333222111 1111110 111111 134555556667788999999998755554332 1 Q ss_pred CCccccccccccccccccccccccccccccccccCcccccccccccccccccccccccchhhhhccCCCCCccccceeEE Q lcl|NC_015288. 126 GEEALFNEPDAGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDAGKLFREMSFSI 205 (468) Q Consensus 126 G~EA~fnEa~t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~g~~f~EMaFsI 205 (468) +.++ .+-+...+ .-.++ +..+++-.-++ T Consensus 68 ~~~a-------~~v~~~~~--------------------------------------~~~~E-------g~~~~~~~~~f 95 (338) T protein:vir:78 68 RPEV-------GQVGVGTS--------------------------------------NEQRE-------GGTKPLSGTAW 95 (338) T ss_pred Cccc-------eeeccccc--------------------------------------ccccc-------cccccccccce Confidence 0100 01000000 00001 12233333444 Q ss_pred EEEEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeee---- Q lcl|NC_015288. 206 EKTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDL---- 281 (468) Q Consensus 206 eK~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl---- 281 (468) +.++...|..+-...+|-||.+|-. .|.+++|.+-|...|...||..||.---.. +.+ ...|+... T Consensus 96 ~~v~l~~~k~~~~~~is~ell~ds~----~~~~~~i~~~la~a~~~~~d~~~l~G~g~~----~~~--~~~gi~~~~~~~ 165 (338) T protein:vir:78 96 DTRSVAPIKLATIVTVSEEFARMNP----SGLYTKLQADLAYAIGRGIDLAVFHGKSPL----TGS--ALQGIDTNNVIV 165 (338) T ss_pred eEEEEEEEEEEEeehhhHHHHhcCH----HHHHHHHHHHHHHHHHHHHHHHhhcccCCC----ccc--cccccccccccc Confidence 5555555555555678899999833 578899999999999999998888522210 000 01111100 Q ss_pred -ecCCcchhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhh-ccccccccccccccccccccccCCCce Q lcl|NC_015288. 282 -DVDSNGRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAM-AGVLDYSSGLTGAGGPAIGTVDDTGNL 359 (468) Q Consensus 282 -~~~~~~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~-~G~~~~~~~~~~~~~~~~~~~D~t~~~ 359 (468) ....+..+... ..+|.....|-.-......+..+-+++|++....|.. ..+.|-.-...- ..+.+ .. T Consensus 166 ~~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~m~~~~~~~L~~~~~l~d~~g~~l~-------~~~~~-~~ 234 (338) T protein:vir:78 166 NTTNVDYLQTGT---TPLLDRFLDGYDLVSANTDVDFNGWAADPRYRARLLRSQAYRDANGNVDP-------TRINL-AA 234 (338) T ss_pred cccccccccccc---hhhHHHHHHHHHHhhhhccccceEEEEchHHHHHHHHHhhhccCCCceee-------ccccc-CC Confidence 00000001000 0122233333333344455677789999998877743 222221111110 01111 11 Q ss_pred eEEEecCCeEEEEccccccC-----C--------CcceEEEEEecCCcccceeEEccccccccccccCCcc-----c--- Q lcl|NC_015288. 360 AVGTINGRIKVYVDPYAANL-----S--------DKHYYVVGYKGTSPYDAGLFYCPYVPLQMVRSIDPNN-----F--- 418 (468) Q Consensus 360 ~~G~l~~~~~vy~D~Ya~~~-----s--------~~dY~~vG~Kg~~~~d~glfyaPYv~~~~~~~~Dp~s-----~--- 418 (468) ..++|. |++|+++.+...+ . ++.++++|..+....+ ..+| ..+....||.. | T Consensus 235 ~~~~l~-G~PV~~~~~ip~~~~~~~~~~~~~~~gdfs~~~~~~~~~~~i~----~~~~--~~~~~~~~~~~~~~~~~~~~ 307 (338) T protein:vir:78 235 SAGDLL-GLPVQFGKAVGGDLGAATDSKVRVVGGDFSQLKYGFADEIRVK----MSDT--ATLTDNTSPTPQTVSMWQTN 307 (338) T ss_pred CCceee-eeeEEEccccCccccccCCcccEEEEEecceEEEEeecccEEE----Eeec--ccccccccccccchhhhhcC Confidence 135564 4588877543211 0 1122222322222110 0111 11222234432 1 Q ss_pred cceeeeeeecc-eeecC--cccccCcccccCCh Q lcl|NC_015288. 419 QPKIGFKTRYG-MVSNP--FVTTNGLYSGTPDG 448 (468) Q Consensus 419 qP~~g~~tRY~-l~~nP--f~~~~~~~~~~~~~ 448 (468) |=.+=...|++ .+.|| |+......+ ++. T Consensus 308 ~~~~r~~~r~d~~v~~~~a~~~l~~~~~--~~~ 338 (338) T protein:vir:78 308 QIAILIEVTFGWLLGDKQAFVKFVDDED--PDA 338 (338) T ss_pred cEEEEEEEEeccEeecccceEEEecccC--CCC Confidence 11222356887 45666 554433222 222 No 27 >protein:vir:4600 Length: 415 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:101 # MgeName: PVL # Cross-refs: genbank:acc:NP_058445;genbank:gi:9635171;genbank:GeneID:1262708 Probab=93.85 E-value=0.0062 Score=32.70 Aligned_cols=341 Identities=13% Similarity=0.068 Sum_probs=131.5 Q ss_pred CcchH-------------HHHHhhhhhhc----------CCcc--c--------cccchhhhhhhhhhhhhHHHHHhhhh Q lcl|NC_015288. 1 MFNAE-------------HLQEKWSPVLN----------NEAA--N--------PIADRYKKAVTSVLLENQERFLREER 47 (468) Q Consensus 1 ~~~~~-------------~l~~kw~p~l~----------~~~~--~--------~i~~~~~~~~~~~llenq~~~~~e~~ 47 (468) +++.+ +|.++..-+-+ .... . .-.+...+......+.+....-.|.+ T Consensus 28 ~~~~~~~e~~~~~~~ev~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 107 (415) T protein:vir:46 28 ALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDRTSENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVR 107 (415) T ss_pred HhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccchhhhhHHHHHHHHHHHhhhhhhhhHHHHH Confidence 11111 12222211100 0000 0 00000000111111111111111111 Q ss_pred hhhhhccccccCccccccccccccccccccccccccee--hhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCC Q lcl|NC_015288. 48 GMLQEVAVNSLGAGTVSPGGSALGSANTAGLAGFDPVL--ISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQA 125 (468) Q Consensus 48 ~~l~e~~~~~~g~~~~~~~~~~~~st~tg~~~~~~P~L--v~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qs 125 (468) .+..... .........++|.+-...=|.. -.+++.+.+...-.+++.+.||+++++-+.-.+.. . T Consensus 108 ~~~~~~~---------~~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~----~ 174 (415) T protein:vir:46 108 DFTEYLE---------TRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQS----E 174 (415) T ss_pred HHHHHHh---------hhhhhhhccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEec----C Confidence 1111100 0000000111111111111211 13455556777788999999999988765433311 0 Q ss_pred CCccccccccccccccccccccccccccccccccCcccccccccccccccccccccccchhhhhccCCCCCccccce-eE Q lcl|NC_015288. 126 GEEALFNEPDAGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDAGKLFREMS-FS 204 (468) Q Consensus 126 G~EA~fnEa~t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~g~~f~EMa-Fs 204 (468) +.++ .|- + .+..+++.+ -+ T Consensus 175 ~~~~-------~~v------------------------------------------------~-----Eg~~~~~~~~~~ 194 (415) T protein:vir:46 175 VAAL-------EKV------------------------------------------------E-----ELEENPELAVKP 194 (415) T ss_pred Ccce-------eec------------------------------------------------c-----cccccccccccc Confidence 0000 000 0 012233332 24 Q ss_pred EEEEEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeeeecC Q lcl|NC_015288. 205 IEKTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDLDVD 284 (468) Q Consensus 205 IeK~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl~~~ 284 (468) +++++..++..+-...+|-||.+|-. .|.+++|.+-|+..|..-+|+.||.-.-+-...+-.......+. -...+ T Consensus 195 ~~~v~~~~~k~~~~~~iS~ell~ds~----~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~~~-~~~~~ 269 (415) T protein:vir:46 195 FFQLAYDINTHRGYFRISREAIEDAK----VNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGK-KLEVK 269 (415) T ss_pred eeeEEeeeeeeEeeehhhHHHHhhch----HHHHHHHHHHHHHHHHHHHHHHHhhccccCCccccccccccccc-eeccc Confidence 55666666666666789999999843 57889999999999999999999875432211111000000000 01111 Q ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccccccccCCCceeEEEe Q lcl|NC_015288. 285 SNGRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTGNLAVGTI 364 (468) Q Consensus 285 ~~~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~~~~~G~l 364 (468) ... ..+....++..+. .--++.+.+|+++.....|.. +..+.+ ......+-+.. ..++| T Consensus 270 ~~~--~~~~i~~~~~~~~---------~~~~~~~~~v~n~~~~~~L~~---lkd~~G------~~i~~~~~~~~-~~~~l 328 (415) T protein:vir:46 270 KAK--SLDDIKDAINLNV---------KPNYEHNVAIVSQTMFAKLDK---MKDKLG------NYLIQPDVKEK-TQQRL 328 (415) T ss_pred ccc--chHHHHHHHHhhh---------hhccCCCEEEEcHHHHHHHHH---hhccCC------CeeeccCcCCC-CCccc Confidence 111 1122222322221 122356678899999888853 222111 01111121111 12456 Q ss_pred cCCeEEEEccccccCCCcceEEEEEecCCcccceeEEccccc----c---c-cccccCCccccceeeeeeeccee-ecC- Q lcl|NC_015288. 365 NGRIKVYVDPYAANLSDKHYYVVGYKGTSPYDAGLFYCPYVP----L---Q-MVRSIDPNNFQPKIGFKTRYGMV-SNP- 434 (468) Q Consensus 365 ~~~~~vy~D~Ya~~~s~~dY~~vG~Kg~~~~d~glfyaPYv~----~---~-~~~~~Dp~s~qP~~g~~tRY~l~-~nP- 434 (468) + |++|++..++. +|-.|+ ..++|+.|-. . . .....|-.++|-.+-...|++.. .+| T Consensus 329 ~-G~pV~~~~~~~---------~~~~~~----~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~d~~v~~~~ 394 (415) T protein:vir:46 329 L-GAKIEILPDEV---------LGQKGN----NTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYK 394 (415) T ss_pred c-ceeeEEecccc---------ccCCCc----cEEEEEehhccEEEEeecceEEEeeccccCceEEEEEEEeccEEeccc Confidence 4 45666553221 111111 1122222110 0 0 11112445566677777888764 355 Q ss_pred -cccccCcccccCChhhhhhcc Q lcl|NC_015288. 435 -FVTTNGLYSGTPDGETLTPST 455 (468) Q Consensus 435 -f~~~~~~~~~~~~~~~~~~~~ 455 (468) |...+-..+--+. .++.--+ T Consensus 395 a~~~~~~~~~~~~~-~~~~~~~ 415 (415) T protein:vir:46 395 SAIVIEYDDSERGE-GDLGLEA 415 (415) T ss_pred cEEEEEeeccCCCC-CCccCCC Confidence 3322111110011 1222222 No 28 >protein:vir:4700 Length: 415 # NCBI annotation: phi PVL ORF 7 homologue # Family: family:all:21 # MgeID: mge:102 # MgeName: phiPV83 # Cross-refs: genbank:acc:NP_061632;genbank:gi:9635719;genbank:GeneID:1262976 Probab=93.85 E-value=0.0062 Score=32.70 Aligned_cols=341 Identities=13% Similarity=0.068 Sum_probs=131.5 Q ss_pred CcchH-------------HHHHhhhhhhc----------CCcc--c--------cccchhhhhhhhhhhhhHHHHHhhhh Q lcl|NC_015288. 1 MFNAE-------------HLQEKWSPVLN----------NEAA--N--------PIADRYKKAVTSVLLENQERFLREER 47 (468) Q Consensus 1 ~~~~~-------------~l~~kw~p~l~----------~~~~--~--------~i~~~~~~~~~~~llenq~~~~~e~~ 47 (468) +++.+ +|.++..-+-+ .... . .-.+...+......+.+....-.|.+ T Consensus 28 ~~~~~~~e~~~~~~~ev~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 107 (415) T protein:vir:47 28 ALNNDELEKAEKLEQEITDLRSQIQEKQEELDKLKEKDRTSENNQQSVEVNEARTYRNQANINDLGISIQNTKVTSQEVR 107 (415) T ss_pred HhchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccccchhhhhHHHHHHHHHHHhhhhhhhhHHHHH Confidence 11111 12222211100 0000 0 00000000111111111111111111 Q ss_pred hhhhhccccccCccccccccccccccccccccccccee--hhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCC Q lcl|NC_015288. 48 GMLQEVAVNSLGAGTVSPGGSALGSANTAGLAGFDPVL--ISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQA 125 (468) Q Consensus 48 ~~l~e~~~~~~g~~~~~~~~~~~~st~tg~~~~~~P~L--v~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qs 125 (468) .+..... .........++|.+-...=|.. -.+++.+.+...-.+++.+.||+++++-+.-.+.. . T Consensus 108 ~~~~~~~---------~~~~~~~~~~~t~~g~~~iP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~----~ 174 (415) T protein:vir:47 108 DFTEYLE---------TRNDIQGGSLKTDSGFVVIPEEIVTDILKLKEVEFNLDKYVTVKRVTNGSGKYPVVRQS----E 174 (415) T ss_pred HHHHHHh---------hhhhhhhccccccCCcccccHHHHHHHHHHHHhhhhhhhhcceeeccCCceeEEEEEec----C Confidence 1111100 0000000111111111111211 13455556777788999999999988765433311 0 Q ss_pred CCccccccccccccccccccccccccccccccccCcccccccccccccccccccccccchhhhhccCCCCCccccce-eE Q lcl|NC_015288. 126 GEEALFNEPDAGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDAGKLFREMS-FS 204 (468) Q Consensus 126 G~EA~fnEa~t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~g~~f~EMa-Fs 204 (468) +.++ .|- + .+..+++.+ -+ T Consensus 175 ~~~~-------~~v------------------------------------------------~-----Eg~~~~~~~~~~ 194 (415) T protein:vir:47 175 VAAL-------EKV------------------------------------------------E-----ELEENPELAVKP 194 (415) T ss_pred Ccce-------eec------------------------------------------------c-----cccccccccccc Confidence 0000 000 0 012233332 24 Q ss_pred EEEEEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeeeecC Q lcl|NC_015288. 205 IEKTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDLDVD 284 (468) Q Consensus 205 IeK~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl~~~ 284 (468) +++++..++..+-...+|-||.+|-. .|.+++|.+-|+..|..-+|+.||.-.-+-...+-.......+. -...+ T Consensus 195 ~~~v~~~~~k~~~~~~iS~ell~ds~----~~l~~~i~~~l~~~i~~~~d~~il~g~g~g~~~~~~~~~~~~~~-~~~~~ 269 (415) T protein:vir:47 195 FFQLAYDINTHRGYFRISREAIEDAK----VNVLQELKLWMARTIAATRNKAIIDVITKGSTGSTSSGFEKEGK-KLEVK 269 (415) T ss_pred eeeEEeeeeeeEeeehhhHHHHhhch----HHHHHHHHHHHHHHHHHHHHHHHhhccccCCccccccccccccc-eeccc Confidence 55666666666666789999999843 57889999999999999999999875432211111000000000 01111 Q ss_pred CcchhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccccccccCCCceeEEEe Q lcl|NC_015288. 285 SNGRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTGNLAVGTI 364 (468) Q Consensus 285 ~~~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~~~~~G~l 364 (468) ... ..+....++..+. .--++.+.+|+++.....|.. +..+.+ ......+-+.. ..++| T Consensus 270 ~~~--~~~~i~~~~~~~~---------~~~~~~~~~v~n~~~~~~L~~---lkd~~G------~~i~~~~~~~~-~~~~l 328 (415) T protein:vir:47 270 KAK--SLDDIKDAINLNV---------KPNYEHNVAIVSQTMFAKLDK---MKDKLG------NYLIQPDVKEK-TQQRL 328 (415) T ss_pred ccc--chHHHHHHHHhhh---------hhccCCCEEEEcHHHHHHHHH---hhccCC------CeeeccCcCCC-CCccc Confidence 111 1122222322221 122356678899999888853 222111 01111121111 12456 Q ss_pred cCCeEEEEccccccCCCcceEEEEEecCCcccceeEEccccc----c---c-cccccCCccccceeeeeeeccee-ecC- Q lcl|NC_015288. 365 NGRIKVYVDPYAANLSDKHYYVVGYKGTSPYDAGLFYCPYVP----L---Q-MVRSIDPNNFQPKIGFKTRYGMV-SNP- 434 (468) Q Consensus 365 ~~~~~vy~D~Ya~~~s~~dY~~vG~Kg~~~~d~glfyaPYv~----~---~-~~~~~Dp~s~qP~~g~~tRY~l~-~nP- 434 (468) + |++|++..++. +|-.|+ ..++|+.|-. . . .....|-.++|-.+-...|++.. .+| T Consensus 329 ~-G~pV~~~~~~~---------~~~~~~----~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~r~d~~v~~~~ 394 (415) T protein:vir:47 329 L-GAKIEILPDEV---------LGQKGN----NTLIIGNLKDAIVLFDRSQYQASWTDYMHFGECLMIAVRQDCRILDYK 394 (415) T ss_pred c-ceeeEEecccc---------ccCCCc----cEEEEEehhccEEEEeecceEEEeeccccCceEEEEEEEeccEEeccc Confidence 4 45666553221 111111 1122222110 0 0 11112445566677777888764 355 Q ss_pred -cccccCcccccCChhhhhhcc Q lcl|NC_015288. 435 -FVTTNGLYSGTPDGETLTPST 455 (468) Q Consensus 435 -f~~~~~~~~~~~~~~~~~~~~ 455 (468) |...+-..+--+. .++.--+ T Consensus 395 a~~~~~~~~~~~~~-~~~~~~~ 415 (415) T protein:vir:47 395 SAIVIEYDDSERGE-GDLGLEA 415 (415) T ss_pred cEEEEEeeccCCCC-CCccCCC Confidence 3322111110011 1222222 No 29 >protein:vir:1886 Length: 385 # NCBI annotation: major capsid subunit precursor # Family: family:all:585 # MgeID: mge:41 # MgeName: HK022 # Cross-refs: genbank:acc:NP_037666;genbank:gi:9634124;genbank:GeneID:1262513 Probab=93.36 E-value=0.0078 Score=32.13 Aligned_cols=333 Identities=13% Similarity=0.058 Sum_probs=130.0 Q ss_pred CcchHHHHHhhhhhhcCCccccccchhhhhhh-----hhhhhhHHHHHhhh----hhhhhh---cccccc---C------ Q lcl|NC_015288. 1 MFNAEHLQEKWSPVLNNEAANPIADRYKKAVT-----SVLLENQERFLREE----RGMLQE---VAVNSL---G------ 59 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~-----~~llenq~~~~~e~----~~~l~e---~~~~~~---g------ 59 (468) |-+-++|.++..-+.+. +-++.+..+..+- ..=|++|.+.+.++ ...+.+ +..+.. + T Consensus 1 M~~l~el~~~~~~~~~e--~~~l~~~~~~e~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (385) T protein:vir:18 1 MSELALIQKAIEESQQK--MTQLFDAQKAEIESTGQVSKQLQSDLMKVQEELTKSGTRLFDLEQKLASGAENPGEKKSFS 78 (385) T ss_pred ChHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccchhhhhH Confidence 77777888887766542 1112221111110 01112221111000 000110 000000 0 Q ss_pred ----------------cccc-cccccccc-cccccccc--cccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeee Q lcl|NC_015288. 60 ----------------AGTV-SPGGSALG-SANTAGLA--GFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRS 119 (468) Q Consensus 60 ----------------~~~~-~~~~~~~~-st~tg~~~--~~~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRs 119 (468) .... .....+.. ++..|.+. ...+.+ +++......-.+++.++||+++..-+.- T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~i---i~~~~~~~~l~~~~~~~~~~~~~~~~~~--- 152 (385) T protein:vir:18 79 ERAAEELIKSWDGKQGTFGAKTFNKSLGSDADSAGSLIQPMQIPGI---IMPGLRRLTIRDLLAQGRTSSNALEYVR--- 152 (385) T ss_pred HHHHHHHHHHHHHhhccchhhHHHhhhccccccCCceecchhhhHH---HHHhhhccchhhhcceecccCcceEEEE--- Confidence 0000 00000000 11111111 112333 3444455567778888888876532211 Q ss_pred eecCCCCCccccccccccccccccccccccccccccccccCcccccccccccccccccccccccchhhhhccCCCCCccc Q lcl|NC_015288. 120 RYENQAGEEALFNEPDAGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDAGKLFR 199 (468) Q Consensus 120 rY~~qsG~EA~fnEa~t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~g~~f~ 199 (468) +....+ .+ .| . +| +..++ T Consensus 153 -~~~~~~-~a-------~~----------------------------------------------v--~E-----~~~~~ 170 (385) T protein:vir:18 153 -EEVFTN-NA-------DV----------------------------------------------V--AE-----KALKP 170 (385) T ss_pred -EecCCc-ce-------ee----------------------------------------------e--cc-----Ccccc Confidence 111000 00 00 0 00 22345 Q ss_pred cceeEEEEEEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceee Q lcl|NC_015288. 200 EMSFSIEKTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIF 279 (468) Q Consensus 200 EMaFsIeK~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~ 279 (468) +-..++++++.+.|.-+-...+|-||.||-- +.++.|.+-|+..|..-+|+.||.- .- .+-...|++ T Consensus 171 ~~~~~~~~~~~~~~k~~~~~~is~ell~d~~-----~l~~~i~~~la~a~~~~~d~~~l~G----~g----~~~~~~Gi~ 237 (385) T protein:vir:18 171 ESDITFSKQTANVKTIAHWVQASRQVMDDAP-----MLQSYINNRLMYGLALKEEGQLLNG----DG----TGDNLEGLN 237 (385) T ss_pred ccccceeEEEEeeeeEEEeehhhHHHHhhHH-----HHHHHHHHHHHHHHHHHHHHHHHhc----cC----CCCcccccc Confidence 5555666777777777777889999999842 3566777777777777777776631 11 111223333 Q ss_pred eeecCCcchhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccc-cccccccccccccCCCc Q lcl|NC_015288. 280 DLDVDSNGRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSG-LTGAGGPAIGTVDDTGN 358 (468) Q Consensus 280 Dl~~~~~~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~-~~~~~~~~~~~~D~t~~ 358 (468) .........+... ....+.....+.... ...-+..+-+|||++....|.. +..+.+ ...+. -.+.+ T Consensus 238 ~~~~~~~~~~~~~--~~~~~d~i~~~~~~l-~~~~~~~~~~~~~~~~~~~l~~---lkd~~G~~l~~~-----~~~~~-- 304 (385) T protein:vir:18 238 KVATAYDTSLNAT--GDTRADIIAHAIYQV-TESEFSASGIVLNPRDWHNIAL---LKDNEGRYIFGG-----PQAFT-- 304 (385) T ss_pred ccccccccccccc--ccchHHHHHHHHHhh-ccccCCCCEEEEcHHHHHHHHH---hhcCCCceeccC-----cccCC-- Confidence 2221111000000 000122112222211 2233456678999999988853 222211 11100 01111 Q ss_pred eeEEEecCCeEEEEccccccCCCcceEEEEE-ecCCcccceeEEccccccccccccCC---ccc-cceee--eeeecce- Q lcl|NC_015288. 359 LAVGTINGRIKVYVDPYAANLSDKHYYVVGY-KGTSPYDAGLFYCPYVPLQMVRSIDP---NNF-QPKIG--FKTRYGM- 430 (468) Q Consensus 359 ~~~G~l~~~~~vy~D~Ya~~~s~~dY~~vG~-Kg~~~~d~glfyaPYv~~~~~~~~Dp---~s~-qP~~g--~~tRY~l- 430 (468) .++|.| ++|+++.+.. ..=+++|- |. +|--+....+...++. +-| +..++ ...||+. T Consensus 305 --~~~l~G-~pV~~~~~~p----~~~~~~gd~~~--------~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~ 369 (385) T protein:vir:18 305 --SNIMWG-LPVVPTKAQA----AGTFTVGGFDM--------ASQVWDRMDATVEVSREDRDNFVKNMLTILCEERLALA 369 (385) T ss_pred --Cceecc-eeeEEcCcCC----CCcEEEeeccc--------EEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccE Confidence 256654 8999997653 22233331 10 0110111111111110 111 22333 3447776 Q ss_pred eecC--cccccCcccccCChhhhhhcc Q lcl|NC_015288. 431 VSNP--FVTTNGLYSGTPDGETLTPST 455 (468) Q Consensus 431 ~~nP--f~~~~~~~~~~~~~~~~~~~~ 455 (468) +.+| |+..+-.. ++ T Consensus 370 v~~~~a~~~~~~~a-----------a~ 385 (385) T protein:vir:18 370 HYRPTAIIKGTFSS-----------GS 385 (385) T ss_pred EecccceEEEEecc-----------CC Confidence 4455 33221111 11 No 30 >protein:vir:191 Length: 385 # NCBI annotation: major head subunit precursor # Family: family:all:585 # MgeID: mge:6 # MgeName: HK97 # Cross-refs: genbank:acc:NP_037701;genbank:gi:9634158;genbank:GeneID:1262530 Probab=93.36 E-value=0.0078 Score=32.13 Aligned_cols=333 Identities=13% Similarity=0.058 Sum_probs=130.0 Q ss_pred CcchHHHHHhhhhhhcCCccccccchhhhhhh-----hhhhhhHHHHHhhh----hhhhhh---cccccc---C------ Q lcl|NC_015288. 1 MFNAEHLQEKWSPVLNNEAANPIADRYKKAVT-----SVLLENQERFLREE----RGMLQE---VAVNSL---G------ 59 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~-----~~llenq~~~~~e~----~~~l~e---~~~~~~---g------ 59 (468) |-+-++|.++..-+.+. +-++.+..+..+- ..=|++|.+.+.++ ...+.+ +..+.. + T Consensus 1 M~~l~el~~~~~~~~~e--~~~l~~~~~~e~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (385) T protein:vir:19 1 MSELALIQKAIEESQQK--MTQLFDAQKAEIESTGQVSKQLQSDLMKVQEELTKSGTRLFDLEQKLASGAENPGEKKSFS 78 (385) T ss_pred ChHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccccccchhhhhH Confidence 77777888887766542 1112221111110 01112221111000 000110 000000 0 Q ss_pred ----------------cccc-cccccccc-cccccccc--cccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeee Q lcl|NC_015288. 60 ----------------AGTV-SPGGSALG-SANTAGLA--GFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRS 119 (468) Q Consensus 60 ----------------~~~~-~~~~~~~~-st~tg~~~--~~~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRs 119 (468) .... .....+.. ++..|.+. ...+.+ +++......-.+++.++||+++..-+.- T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g~~i~~~~~~~i---i~~~~~~~~l~~~~~~~~~~~~~~~~~~--- 152 (385) T protein:vir:19 79 ERAAEELIKSWDGKQGTFGAKTFNKSLGSDADSAGSLIQPMQIPGI---IMPGLRRLTIRDLLAQGRTSSNALEYVR--- 152 (385) T ss_pred HHHHHHHHHHHHHhhccchhhHHHhhhccccccCCceecchhhhHH---HHHhhhccchhhhcceecccCcceEEEE--- Confidence 0000 00000000 11111111 112333 3444455567778888888876532211 Q ss_pred eecCCCCCccccccccccccccccccccccccccccccccCcccccccccccccccccccccccchhhhhccCCCCCccc Q lcl|NC_015288. 120 RYENQAGEEALFNEPDAGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDAGKLFR 199 (468) Q Consensus 120 rY~~qsG~EA~fnEa~t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~g~~f~ 199 (468) +....+ .+ .| . +| +..++ T Consensus 153 -~~~~~~-~a-------~~----------------------------------------------v--~E-----~~~~~ 170 (385) T protein:vir:19 153 -EEVFTN-NA-------DV----------------------------------------------V--AE-----KALKP 170 (385) T ss_pred -EecCCc-ce-------ee----------------------------------------------e--cc-----Ccccc Confidence 111000 00 00 0 00 22345 Q ss_pred cceeEEEEEEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceee Q lcl|NC_015288. 200 EMSFSIEKTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIF 279 (468) Q Consensus 200 EMaFsIeK~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~ 279 (468) +-..++++++.+.|.-+-...+|-||.||-- +.++.|.+-|+..|..-+|+.||.- .- .+-...|++ T Consensus 171 ~~~~~~~~~~~~~~k~~~~~~is~ell~d~~-----~l~~~i~~~la~a~~~~~d~~~l~G----~g----~~~~~~Gi~ 237 (385) T protein:vir:19 171 ESDITFSKQTANVKTIAHWVQASRQVMDDAP-----MLQSYINNRLMYGLALKEEGQLLNG----DG----TGDNLEGLN 237 (385) T ss_pred ccccceeEEEEeeeeEEEeehhhHHHHhhHH-----HHHHHHHHHHHHHHHHHHHHHHHhc----cC----CCCcccccc Confidence 5555666777777777777889999999842 3566777777777777777776631 11 111223333 Q ss_pred eeecCCcchhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccc-cccccccccccccCCCc Q lcl|NC_015288. 280 DLDVDSNGRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSG-LTGAGGPAIGTVDDTGN 358 (468) Q Consensus 280 Dl~~~~~~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~-~~~~~~~~~~~~D~t~~ 358 (468) .........+... ....+.....+.... ...-+..+-+|||++....|.. +..+.+ ...+. -.+.+ T Consensus 238 ~~~~~~~~~~~~~--~~~~~d~i~~~~~~l-~~~~~~~~~~~~~~~~~~~l~~---lkd~~G~~l~~~-----~~~~~-- 304 (385) T protein:vir:19 238 KVATAYDTSLNAT--GDTRADIIAHAIYQV-TESEFSASGIVLNPRDWHNIAL---LKDNEGRYIFGG-----PQAFT-- 304 (385) T ss_pred ccccccccccccc--ccchHHHHHHHHHhh-ccccCCCCEEEEcHHHHHHHHH---hhcCCCceeccC-----cccCC-- Confidence 2221111000000 000122112222211 2233456678999999988853 222211 11100 01111 Q ss_pred eeEEEecCCeEEEEccccccCCCcceEEEEE-ecCCcccceeEEccccccccccccCC---ccc-cceee--eeeecce- Q lcl|NC_015288. 359 LAVGTINGRIKVYVDPYAANLSDKHYYVVGY-KGTSPYDAGLFYCPYVPLQMVRSIDP---NNF-QPKIG--FKTRYGM- 430 (468) Q Consensus 359 ~~~G~l~~~~~vy~D~Ya~~~s~~dY~~vG~-Kg~~~~d~glfyaPYv~~~~~~~~Dp---~s~-qP~~g--~~tRY~l- 430 (468) .++|.| ++|+++.+.. ..=+++|- |. +|--+....+...++. +-| +..++ ...||+. T Consensus 305 --~~~l~G-~pV~~~~~~p----~~~~~~gd~~~--------~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~r~~~~ 369 (385) T protein:vir:19 305 --SNIMWG-LPVVPTKAQA----AGTFTVGGFDM--------ASQVWDRMDATVEVSREDRDNFVKNMLTILCEERLALA 369 (385) T ss_pred --Cceecc-eeeEEcCcCC----CCcEEEeeccc--------EEEEEEecceEEEEeccccchhhcCcEEEEEEEeeccE Confidence 256654 8999997653 22233331 10 0110111111111110 111 22333 3447776 Q ss_pred eecC--cccccCcccccCChhhhhhcc Q lcl|NC_015288. 431 VSNP--FVTTNGLYSGTPDGETLTPST 455 (468) Q Consensus 431 ~~nP--f~~~~~~~~~~~~~~~~~~~~ 455 (468) +.+| |+..+-.. ++ T Consensus 370 v~~~~a~~~~~~~a-----------a~ 385 (385) T protein:vir:19 370 HYRPTAIIKGTFSS-----------GS 385 (385) T ss_pred EecccceEEEEecc-----------CC Confidence 4455 33221111 11 No 31 >protein:vir:41 Length: 299 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:2 # MgeName: A118 # Cross-refs: genbank:acc:NP_463467;swissprot:trembl:q9t1b7;genbank:gi:16798789;uniprot:Q9T1B7;genbank:GeneID:922353 Probab=92.04 E-value=0.013 Score=30.91 Aligned_cols=278 Identities=12% Similarity=0.040 Sum_probs=125.9 Q ss_pred ccccccccccccccccccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCCccccccccccccccc Q lcl|NC_015288. 63 VSPGGSALGSANTAGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDAGFTAGL 142 (468) Q Consensus 63 ~~~~~~~~~st~tg~~~~~~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSG~~ 142 (468) .+.++....++++++..--....-.++++..+..+-.+++-+-||++.+.-+- . . ++.++ .| T Consensus 1 ~g~~a~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~~-~----~--~~~~a-------~~---- 62 (299) T protein:vir:41 1 MGFNPDTTTMQSAKTGSIPINISEQIITGVKNGSAAMKLAKAVPMTKPEEEFT-F----M--SGVGA-------FW---- 62 (299) T ss_pred CCcCCCcccccCCCceecchhHHHHHHHHHHhcchhhhhceeeecCCCcEEEE-E----E--cCCce-------ee---- Confidence 22333222333333322111122345666777888899999999988763221 1 0 11000 00 Q ss_pred cccccccccccccccccCcccccccccccccccccccccccchhhhhccCCCCCccccceeEEEEEEEEeecccccceec Q lcl|NC_015288. 143 DATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDAGKLFREMSFSIEKTSVTAKSRALKAEYT 222 (468) Q Consensus 143 ~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~g~~f~EMaFsIeK~tVtAKSRaLKAEYT 222 (468) .+| +..++|...++++++...|..+-...+| T Consensus 63 --------------------------------------------v~E-----~~~~~~~~~~f~~v~l~~~k~~~~~~is 93 (299) T protein:vir:41 63 --------------------------------------------VDE-----AERIQTSKPTFTKAKMRSKKMGVIIPTT 93 (299) T ss_pred --------------------------------------------eec-----CccccccccceeEEEEeeEEEEEeehhh Confidence 001 2334555566678888888888888999 Q ss_pred HHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeeeecCC-c-chhHHHHHHHHHHH Q lcl|NC_015288. 223 LELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDLDVDS-N-GRWSVEKFKGLLFQ 300 (468) Q Consensus 223 vELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl~~~~-~-~rw~~e~~~~l~~~ 300 (468) -||.+|-. .|.++.|.+.|...|...+++.||.---+ + .+.|++-..... . .-...-.+..+ T Consensus 94 ~ell~ds~----~~~~~~i~~~l~~a~~~~~d~a~l~G~g~----~-----~~~gil~~~~~~~~~~~~~~~~~~~l--- 157 (299) T protein:vir:41 94 KENLNYSV----TNFFSLMQAEIVEAFYKKFDQAVFTGVES----P-----YNWNILKSATDASNLVEETANKYDDL--- 157 (299) T ss_pred HHHHhcCH----HHHHHHHHHHHHHHHHHHHHHHHhhcccC----c-----ccccccccccccceeeccccccHHHH--- Confidence 99999754 46788899999999999888888742110 1 111221110000 0 00000112222 Q ss_pred HHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccccccccCCCceeEEEecCCeEEEEccccccCC Q lcl|NC_015288. 301 IERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTGNLAVGTINGRIKVYVDPYAANLS 380 (468) Q Consensus 301 i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy~D~Ya~~~s 380 (468) ..-...+. .--++++.+||+++....|.. +...- |......+.++. .++|. +++|++........ T Consensus 158 -~~~~~~l~--~~~~~~~~~v~n~~~~~~L~~---lkd~~------G~~l~~~~~~~~--~~~l~-G~PV~~~~~~~~~~ 222 (299) T protein:vir:41 158 -NEAIGLIE--AEDLEPNGIATIRKQRVKYRS---TKDGN------GMPIFNTATSNG--VDDVL-GLPIAYTPKYTFGD 222 (299) T ss_pred -HHHHHhhh--cccCCcCEEEEcHHHHHHHHH---hhccC------CceeecCCcCCC--Cceec-ceeeEEecccCCCC Confidence 11112222 223456678999999988864 22211 011111111111 24665 47887765443211 Q ss_pred C--------cceEEEEEecCCcccceeEEccccccccccccCCcc-----ccc-eeee--eeecceee-cC--cccccCc Q lcl|NC_015288. 381 D--------KHYYVVGYKGTSPYDAGLFYCPYVPLQMVRSIDPNN-----FQP-KIGF--KTRYGMVS-NP--FVTTNGL 441 (468) Q Consensus 381 ~--------~dY~~vG~Kg~~~~d~glfyaPYv~~~~~~~~Dp~s-----~qP-~~g~--~tRY~l~~-nP--f~~~~~~ 441 (468) . +.++++|..++.+.+- -.+..+....||+. ||- .++| ..|+|..+ || |+..+.- T Consensus 223 ~~~~~~~gdfs~~~i~~~~~~~i~~------~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~~d~~v~~~~A~~~l~~~ 296 (299) T protein:vir:41 223 KDISELVGDWNQAYYGILRGVEYEI------LTEATLTTVADETGKPLNLAERDMAAIKATFEVGFMVVKDEAFSAVQPK 296 (299) T ss_pred CceEEEEEecccEEEEEecCcEEEE------eecccccccccccccchhhhhcCcEEEEEEEEeccEEecccceEEEEec Confidence 1 1122233333221110 00000111223322 222 2333 35666554 33 4433211 Q ss_pred ccccCChhhhhhccC Q lcl|NC_015288. 442 YSGTPDGETLTPSTN 456 (468) Q Consensus 442 ~~~~~~~~~~~~~~N 456 (468) .+| T Consensus 297 ------------aa~ 299 (299) T protein:vir:41 297 ------------AGN 299 (299) T ss_pred ------------cCC Confidence 222 No 32 >protein:vir:6212 Length: 434 # NCBI annotation: prohead protease # Family: family:all:21 # MgeID: mge:128 # MgeName: phBC6A52 # Cross-refs: genbank:acc:NP_852592;genbank:gi:31415852;genbank:GeneID:1489210 Probab=91.84 E-value=0.0068 Score=32.48 Aligned_cols=344 Identities=10% Similarity=0.053 Sum_probs=121.1 Q ss_pred CcchHHHHHhhhhhh-----------cCCcccc--ccchhhhhhhhhhhhh-------HHHHHhhhhhhhhhccccccCc Q lcl|NC_015288. 1 MFNAEHLQEKWSPVL-----------NNEAANP--IADRYKKAVTSVLLEN-------QERFLREERGMLQEVAVNSLGA 60 (468) Q Consensus 1 ~~~~~~l~~kw~p~l-----------~~~~~~~--i~~~~~~~~~~~llen-------q~~~~~e~~~~l~e~~~~~~g~ 60 (468) -+..+.-.++..-.. .+++..+ +....+++.....+.+ +.....|.+..+.+.+.. . T Consensus 58 ~le~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~e~r~a~~~~l~~---~ 134 (434) T protein:vir:62 58 KLEEKEKEEDPAKKKDDDPEKKEDPTAKENPNEKTELSEEQRSAISASIAAALSTKGHRTNKETEIRSVFANYIVG---N 134 (434) T ss_pred HHHHHHHHHHHHhhhcchhhhhcchhhhcchhhhHHHHHHHHHHHHHHHHhhhhhccccchHHHHHHHHHHHHhcc---c Confidence 111111112221111 1111100 1111111111111111 101111222221111100 0 Q ss_pred ccccccccccccccc--cccccccceeh--hhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCCccccccccc Q lcl|NC_015288. 61 GTVSPGGSALGSANT--AGLAGFDPVLI--SLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDA 136 (468) Q Consensus 61 ~~~~~~~~~~~st~t--g~~~~~~P~Lv--~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t 136 (468) .... .....+++| |+.. =|.-+ .+++...+..+...++-|.|++|..- |-. +... +. + T Consensus 135 ~~~~--e~~a~~~~t~~GG~l--vP~~~~~~Ii~~l~~~~~i~~~~~~~~~~~~~~--~p~---~~~~-~~-a------- 196 (434) T protein:vir:62 135 IDEK--EARALGLVTGNGSVT--IPDFLSKEIITYAQEENFLRRLGTGVKTKENIK--YPV---LVKK-AE-A------- 196 (434) T ss_pred cchh--hhhhhccccccccee--cchhhHHHHHHhhhhhhhhhhhcceeccCCceE--EEE---EecC-Cc-c------- Confidence 0000 000001111 1111 12221 24454556667778888888765311 111 1100 00 0 Q ss_pred cccccccccccccccccccccccCcccccccccccccccccccccccchhhhhccCCCCCccccceeEEEEEEEEeeccc Q lcl|NC_015288. 137 GFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDAGKLFREMSFSIEKTSVTAKSRA 216 (468) Q Consensus 137 ~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~g~~f~EMaFsIeK~tVtAKSRa 216 (468) .+ ...+..+...++-..++++++..+|.-+ T Consensus 197 ~~--------------------------------------------------~~~~~e~~~~~~~~~~f~~v~~~~~k~~ 226 (434) T protein:vir:62 197 QG--------------------------------------------------HKNERTNNEMPETDIEFDEIELSPTEFD 226 (434) T ss_pred cc--------------------------------------------------eecccccccccccccceeeEEeeheeeE Confidence 00 0000001222333345666677777777 Q ss_pred ccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeeeecCCcchhHHHHHHH Q lcl|NC_015288. 217 LKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDLDVDSNGRWSVEKFKG 296 (468) Q Consensus 217 LKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl~~~~~~rw~~e~~~~ 296 (468) -...+|-||.+|- .+|.+++|.+-|+..|..-+++.||.-==+. ....++.......+..+.... .+.... T Consensus 227 ~~~~iS~ell~ds----~~~l~~~i~~~la~~~~~~~d~~~l~G~G~~---~~~~g~~~~~~~~~~~~~~~~--~d~l~~ 297 (434) T protein:vir:62 227 ALATVTKKLLART----GLPIEQIVMDELKKAYVRKETQYMVNGDEAN---NINDGALAKKAVEFKTDEKNL--YDALVK 297 (434) T ss_pred eehhhHHHHHhcc----hHHHHHHHHHHHHHHHHHHHHHHHhccCCCC---ccccceeecccccccccccch--hhHHHH Confidence 7788999999995 3578999999999999999999888411000 000011110111111111111 122222 Q ss_pred HHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccccccccCCC-ceeEEEecCCeEEEEccc Q lcl|NC_015288. 297 LLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTG-NLAVGTINGRIKVYVDPY 375 (468) Q Consensus 297 l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~-~~~~G~l~~~~~vy~D~Y 375 (468) +.+.+ ..--+..+- .|+++.....|.. |..+- ++ .....+.+. .-.-.+|. |++|+++.+ T Consensus 298 l~~~l--------~~~~~~~a~-~v~n~~~~~~L~~---lkd~~---G~---~l~~~~~~~~~g~~~tl~-G~pV~~~~~ 358 (434) T protein:vir:62 298 MKNTP--------VKEVRKKAR-WVLNTAALTKIET---MKTDD---GF---PLLRPFNQAEGGIGYTLL-GFPVEEEDA 358 (434) T ss_pred HHhhc--------chhhhcCCE-EEEcHHHHHHHHH---hhccC---CC---EeeccCCCccCCCCceec-ceeeEEecC Confidence 22222 111223343 4778888877753 22221 11 001111100 00112454 477777755 Q ss_pred cccCC--CcceEEEEEecCCcccceeEEcccc-ccccccccCCc--cccceeeeeeec-ceeec-CcccccC-cccccCC Q lcl|NC_015288. 376 AANLS--DKHYYVVGYKGTSPYDAGLFYCPYV-PLQMVRSIDPN--NFQPKIGFKTRY-GMVSN-PFVTTNG-LYSGTPD 447 (468) Q Consensus 376 a~~~s--~~dY~~vG~Kg~~~~d~glfyaPYv-~~~~~~~~Dp~--s~qP~~g~~tRY-~l~~n-Pf~~~~~-~~~~~~~ 447 (468) +.... ...-|++| +-.. . +..... ...+.+..++- .-|=.+..+.|. |..++ ||+..-- ..-+.+. T Consensus 359 ~~~~~~~~~~~i~~G---dfs~--~-~i~~~~g~~~i~~~~~~~~~~~~v~~~~~~r~Dgk~i~~~~~~~~~~~~~~~~~ 432 (434) T protein:vir:62 359 IDIPDSPDTPVFYFG---DFSK--F-YIQDVIGSLEVQKLVELFSRTNRVGFRIWNLLDAQLIHSPFEVPVYKYVLKAPT 432 (434) T ss_pred ccCccCCCceEEEEe---eccc--e-EEEEeeceeEEEeehhhhcccCceEEEEEeeecceeecCcccceEEEEEeccCC Confidence 42111 11112222 1100 0 000010 11222222332 223334455666 44343 7775411 1111222 Q ss_pred hh Q lcl|NC_015288. 448 GE 449 (468) Q Consensus 448 ~~ 449 (468) ++ T Consensus 433 ~~ 434 (434) T protein:vir:62 433 GA 434 (434) T ss_pred CC Confidence 22 No 33 >protein:vir:3033 Length: 272 # NCBI annotation: major capsid protein # Family: family:all:522 # MgeID: mge:61 # MgeName: PhiNIH1.1 # Cross-refs: genbank:acc:NP_438146;genbank:gi:16271809;genbank:GeneID:929235 Probab=88.83 E-value=0.03 Score=28.94 Aligned_cols=269 Identities=14% Similarity=0.071 Sum_probs=115.6 Q ss_pred eeeeecCCCCCccccccccccccccccccccccccccccccccCcccccccccccccccccccccccchhhhhccCCCCC Q lcl|NC_015288. 117 MRSRYENQAGEEALFNEPDAGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDAGK 196 (468) Q Consensus 117 MRsrY~~qsG~EA~fnEa~t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~g~ 196 (468) |= ...++.+ ..+..|-...+--..- .. ..........+. .+......+.+...--....++-.++ +. T Consensus 1 MA-~~~T~~~-~~~iPev~s~~v~~~~-~~--------~~~~~~~~~~~~-~~~g~~G~tv~iP~~~~~~~a~~v~e-g~ 67 (272) T protein:vir:30 1 MA-VGTTKMA-QMLDPEVLADMIDAEV-GK--------AIRFAPLAEVDT-TLEGQPGTTLTVPKWDYIGDAEDVAE-GE 67 (272) T ss_pred CC-Cccccch-heechHHHHHHHHHHH-HH--------Hhhhhccccccc-cccCCCCCEEEEEEecCCCCcccccC-CC Confidence 11 1111111 1111111000000000 00 000000000000 00000000111111111122222222 23 Q ss_pred ccccceeEEEEEEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccc Q lcl|NC_015288. 197 LFREMSFSIEKTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANA 276 (468) Q Consensus 197 ~f~EMaFsIeK~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~ 276 (468) .+..-..+.+..+++.|.++-.-++|=|++.+ -+-|..+++.+-|+..|+.+|+++|+..+...... +... T Consensus 68 ~i~~~~~~~~~~~~~~~~~~~~~~itd~~~~~----s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~~-----~~~~ 138 (272) T protein:vir:30 68 AIPMTQLGFKKTTMTIKKAGKGVEITDEAILS----GYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQT-----VEAT 138 (272) T ss_pred cccccccccceEEEEeeeeeeeeeecHHHHhh----ccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-----cccc Confidence 34444556777888888887666777666543 24799999999999999999999999876543211 1111 Q ss_pred eeeeeecCCcchhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccccccccCC Q lcl|NC_015288. 277 GIFDLDVDSNGRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDT 356 (468) Q Consensus 277 Gv~Dl~~~~~~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t 356 (468) .. .+.+-.++.++..+ -...+++|++|++++.|......++... ++.+. +.. T Consensus 139 ~t------------~d~i~da~~~l~~~---------~~~~~~~vv~p~~~~~L~k~~~~~~~~~--~~~~~-----~~~ 190 (272) T protein:vir:30 139 AT------------VDGVSKALDIFNDE---------DDAETVIVMNPADASTLRLDAAKEWLGA--TEVGA-----NRV 190 (272) T ss_pred cC------------HHHHHHHHHHHhcc---------CCCccEEEEcHHHHHHHHHhcccccccc--ccccc-----ccc Confidence 11 11122222222211 2456799999999999965543333211 11110 111 Q ss_pred CceeEEEecCCeEEEEccccccCCCcceEEEEEecCCcccceeEEccccccccccccCCccccceeeeeeeccee-ecC- Q lcl|NC_015288. 357 GNLAVGTINGRIKVYVDPYAANLSDKHYYVVGYKGTSPYDAGLFYCPYVPLQMVRSIDPNNFQPKIGFKTRYGMV-SNP- 434 (468) Q Consensus 357 ~~~~~G~l~~~~~vy~D~Ya~~~s~~dY~~vG~Kg~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~g~~tRY~l~-~nP- 434 (468) .+-.+|++. |++|+++.+. |..=+++.-+|.- +++-..-+..+.. =|+.+++-.+-..-|||+. .|| T Consensus 191 ~~g~ig~i~-G~~Vi~s~~~----p~~t~~~~~~~a~----~~~~~~~~~ve~~--r~~~~~~~~i~~~~~~~~~v~~~~ 259 (272) T protein:vir:30 191 VSGVYGEVL-GVQIVRSRKC----PKGTAYMVRKGAL----RIMLKRNTMVETD--RDITKAINQIVANKHYGVYLYKAE 259 (272) T ss_pred ccccchhhc-CeeEEEcCCC----CcceEEEEcCCeE----EEEecCCceeeec--cccccceeEEEEEEEEEEEEEcCC Confidence 112357774 5799999554 3222222222211 1111222222211 2778888778778888875 355 Q ss_pred -cccccCcccccCChhhhhhc Q lcl|NC_015288. 435 -FVTTNGLYSGTPDGETLTPS 454 (468) Q Consensus 435 -f~~~~~~~~~~~~~~~~~~~ 454 (468) +...+-.+++ +- T Consensus 260 ~vv~~t~~~a~--------~~ 272 (272) T protein:vir:30 260 KAVKITLKDAA--------KK 272 (272) T ss_pred ceEEEEecccc--------cC Confidence 2222111110 00 No 34 >protein:vir:9820 Length: 272 # NCBI annotation: putative major capsid/head protein # Family: family:all:522 # MgeID: mge:176 # MgeName: 315.4 # Cross-refs: genbank:acc:NP_795582;genbank:gi:28876339;genbank:GeneID:1257858 Probab=88.83 E-value=0.03 Score=28.94 Aligned_cols=269 Identities=14% Similarity=0.071 Sum_probs=115.6 Q ss_pred eeeeecCCCCCccccccccccccccccccccccccccccccccCcccccccccccccccccccccccchhhhhccCCCCC Q lcl|NC_015288. 117 MRSRYENQAGEEALFNEPDAGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDAGK 196 (468) Q Consensus 117 MRsrY~~qsG~EA~fnEa~t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~g~ 196 (468) |= ...++.+ ..+..|-...+--..- .. ..........+. .+......+.+...--....++-.++ +. T Consensus 1 MA-~~~T~~~-~~~iPev~s~~v~~~~-~~--------~~~~~~~~~~~~-~~~g~~G~tv~iP~~~~~~~a~~v~e-g~ 67 (272) T protein:vir:98 1 MA-VGTTKMA-QMLDPEVLADMIDAEV-GK--------AIRFAPLAEVDT-TLEGQPGTTLTVPKWDYIGDAEDVAE-GE 67 (272) T ss_pred CC-Cccccch-heechHHHHHHHHHHH-HH--------Hhhhhccccccc-cccCCCCCEEEEEEecCCCCcccccC-CC Confidence 11 1111111 1111111000000000 00 000000000000 00000000111111111122222222 23 Q ss_pred ccccceeEEEEEEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccc Q lcl|NC_015288. 197 LFREMSFSIEKTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANA 276 (468) Q Consensus 197 ~f~EMaFsIeK~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~ 276 (468) .+..-..+.+..+++.|.++-.-++|=|++.+ -+-|..+++.+-|+..|+.+|+++|+..+...... +... T Consensus 68 ~i~~~~~~~~~~~~~~~~~~~~~~itd~~~~~----s~~d~~~~~~~~~~~~~a~~~d~~i~~~~~~a~~~-----~~~~ 138 (272) T protein:vir:98 68 AIPMTQLGFKKTTMTIKKAGKGVEITDEAILS----GYGDPVGQAAKQIVEAIDHKVDADVLDALSKSTQT-----VEAT 138 (272) T ss_pred cccccccccceEEEEeeeeeeeeeecHHHHhh----ccccHHHHHHHHHHHHHHHHHHHHHHHHhcccccc-----cccc Confidence 34444556777888888887666777666543 24799999999999999999999999876543211 1111 Q ss_pred eeeeeecCCcchhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccccccccCC Q lcl|NC_015288. 277 GIFDLDVDSNGRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDT 356 (468) Q Consensus 277 Gv~Dl~~~~~~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t 356 (468) .. .+.+-.++.++..+ -...+++|++|++++.|......++... ++.+. +.. T Consensus 139 ~t------------~d~i~da~~~l~~~---------~~~~~~~vv~p~~~~~L~k~~~~~~~~~--~~~~~-----~~~ 190 (272) T protein:vir:98 139 AT------------VDGVSKALDIFNDE---------DDAETVIVMNPADASTLRLDAAKEWLGA--TEVGA-----NRV 190 (272) T ss_pred cC------------HHHHHHHHHHHhcc---------CCCccEEEEcHHHHHHHHHhcccccccc--ccccc-----ccc Confidence 11 11122222222211 2456799999999999965543333211 11110 111 Q ss_pred CceeEEEecCCeEEEEccccccCCCcceEEEEEecCCcccceeEEccccccccccccCCccccceeeeeeeccee-ecC- Q lcl|NC_015288. 357 GNLAVGTINGRIKVYVDPYAANLSDKHYYVVGYKGTSPYDAGLFYCPYVPLQMVRSIDPNNFQPKIGFKTRYGMV-SNP- 434 (468) Q Consensus 357 ~~~~~G~l~~~~~vy~D~Ya~~~s~~dY~~vG~Kg~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~g~~tRY~l~-~nP- 434 (468) .+-.+|++. |++|+++.+. |..=+++.-+|.- +++-..-+..+.. =|+.+++-.+-..-|||+. .|| T Consensus 191 ~~g~ig~i~-G~~Vi~s~~~----p~~t~~~~~~~a~----~~~~~~~~~ve~~--r~~~~~~~~i~~~~~~~~~v~~~~ 259 (272) T protein:vir:98 191 VSGVYGEVL-GVQIVRSRKC----PKGTAYMVRKGAL----RIMLKRNTMVETD--RDITKAINQIVANKHYGVYLYKAE 259 (272) T ss_pred ccccchhhc-CeeEEEcCCC----CcceEEEEcCCeE----EEEecCCceeeec--cccccceeEEEEEEEEEEEEEcCC Confidence 112357774 5799999554 3222222222211 1111222222211 2778888778778888875 355 Q ss_pred -cccccCcccccCChhhhhhc Q lcl|NC_015288. 435 -FVTTNGLYSGTPDGETLTPS 454 (468) Q Consensus 435 -f~~~~~~~~~~~~~~~~~~~ 454 (468) +...+-.+++ +- T Consensus 260 ~vv~~t~~~a~--------~~ 272 (272) T protein:vir:98 260 KAVKITLKDAA--------KK 272 (272) T ss_pred ceEEEEecccc--------cC Confidence 2222111110 00 No 35 >protein:vir:9574 Length: 300 # NCBI annotation: gp40 # Family: family:all:966 # MgeID: mge:171 # MgeName: SM1 # Cross-refs: genbank:acc:NP_862879;genbank:gi:32469471;genbank:GeneID:1461316 Probab=88.81 E-value=0.03 Score=28.93 Aligned_cols=280 Identities=13% Similarity=0.080 Sum_probs=117.7 Q ss_pred cccccccccccccccee-hhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCCcccccccccccccccccccc Q lcl|NC_015288. 69 ALGSANTAGLAGFDPVL-ISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDAGFTAGLDATTG 147 (468) Q Consensus 69 ~~~st~tg~~~~~~P~L-v~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSG~~~~~~~ 147 (468) -.++|++++.. ..|.+ -.++.++.+..+-.+++.+.||++-..- |-.. .. +.++ .|- T Consensus 1 ma~~t~~~G~l-ip~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~-~p~~---~~--~~~a-------~wv-------- 58 (300) T protein:vir:95 1 MSEAQLSKGNL-FNPELVTKVINKVKGHSSIAKLSPQKPIPFNGQR-EFVF---DF--DSDI-------DIV-------- 58 (300) T ss_pred CcccccCCcce-echhhHHHHHHHHHhhhhhhhhcceeeccCCceE-EEEE---ec--Ccce-------EEe-------- Confidence 22233332222 12222 1223334445566789999998764322 2211 10 1111 000 Q ss_pred ccccccccccccCcccccccccccccccccccccccchhhhhccCCCCCccccceeEEEEEEEEeecccccceecHHHHH Q lcl|NC_015288. 148 AYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDAGKLFREMSFSIEKTSVTAKSRALKAEYTLELAQ 227 (468) Q Consensus 148 ~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~g~~f~EMaFsIeK~tVtAKSRaLKAEYTvELAQ 227 (468) +| +...++...+++.++..+|.=+-...+|-||.+ T Consensus 59 ----------------------------------------~E-----g~~~~~s~~~f~~v~l~~~k~~~~~~iS~ell~ 93 (300) T protein:vir:95 59 ----------------------------------------AE-----NGKKTHGGVSLDPVTIVPLKVEYGARVSDEFLH 93 (300) T ss_pred ----------------------------------------eC-----CcccccccccceeeEeeeEEEEEeehhhHHHhc Confidence 00 123344445556666666666666778999875 Q ss_pred hHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhcccc----ccceeeeeecCCcchhHHHHHHHHHHHHHH Q lcl|NC_015288. 228 DLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNV----ANAGIFDLDVDSNGRWSVEKFKGLLFQIER 303 (468) Q Consensus 228 DLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~----~~~Gv~Dl~~~~~~rw~~e~~~~l~~~i~~ 303 (468) .... ..+|-+++|.+-|...|...+++.+|.-.. +..|....+ ...+.........+. . .+.... T Consensus 94 ~~~d-~~~~l~~~i~~~l~~aia~~~d~~~l~G~~--~~~g~~~~~~~~~~~~~~~~~~~~~~~~---~-----~~~~i~ 162 (300) T protein:vir:95 94 ASEE-AKVDMLTDFVEGFSKKLARGLDIMSIHGIN--PRTKQASTIIGDNCFDKKVTQTVPFKDT---N-----PDESME 162 (300) T ss_pred cCCC-CHHHHHHHHHHHHHHHHHHHHHHhhhhccc--CCCCCCcccccccccccccceeeccccc---c-----hHHHHH Confidence 3222 235677888888888888888888885421 111111111 111111111111110 0 111111 Q ss_pred HHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccccccccCCCceeEEEecCCeEEEEccccccC--CC Q lcl|NC_015288. 304 DCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTGNLAVGTINGRIKVYVDPYAANL--SD 381 (468) Q Consensus 304 ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy~D~Ya~~~--s~ 381 (468) .+-.. ...-.++.+-+|++|+....|.. +....+ ..+...+.++ -..++|.| ++|+++.+.... .+ T Consensus 163 ~~~~~-~~~~~~~~~~~vmn~~~~~~L~~---lkd~~G------~~i~~~~~~~-~~~~~l~G-~Pv~~s~~v~~~~~~~ 230 (300) T protein:vir:95 163 DAVGM-IDGSERDITGAILDPIFTTALSK---MKNAEG------GKLYPELAWG-GVPDAING-LAVDKNRTVSYSQTDP 230 (300) T ss_pred HHHHH-hhhcCCCccEEEECHHHHHHHHH---hhccCC------CeeccCcccc-CCCceecc-eeeEEecCCCCCCCCC Confidence 11111 12234566668899999887743 222111 1111111111 12467755 688887554211 12 Q ss_pred cceEEEEEecCCcccceeEEcccccccccc--ccCCcc-----c---cceeeeeeecceee-cC--cccccCcccc Q lcl|NC_015288. 382 KHYYVVGYKGTSPYDAGLFYCPYVPLQMVR--SIDPNN-----F---QPKIGFKTRYGMVS-NP--FVTTNGLYSG 444 (468) Q Consensus 382 ~dY~~vG~Kg~~~~d~glfyaPYv~~~~~~--~~Dp~s-----~---qP~~g~~tRY~l~~-nP--f~~~~~~~~~ 444 (468) .+.+++|= +..+++|.......+.. -.|+++ | |=.+=+..|+|..+ || |+... ..++ T Consensus 231 ~~~~~~GD-----f~~~~~~~~~~~~~~~v~~~~~~d~~~~~~f~~~~v~~r~~~r~d~~v~~~~a~~~l~-~~~g 300 (300) T protein:vir:95 231 KNTAIVGD-----FETMFKWGYAKEVPMEIIKYGDPDNSGRDLKGYNQIYIRCEAYIGWGIMDAASFARIV-KTGG 300 (300) T ss_pred ccEEEEee-----ccceEEEEEecccEEEEeeccCCCCcchhhhhcCcEEEEEEEeecceeecccceEEEe-cCCC Confidence 23333331 00112222222222211 123332 2 12333455777544 66 44322 1111 No 36 >protein:vir:1433 Length: 435 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:30 # MgeName: phiE125 # Cross-refs: genbank:acc:NP_536362;genbank:gi:17975167;genbank:GeneID:929171 Probab=88.38 E-value=0.033 Score=28.73 Aligned_cols=347 Identities=14% Similarity=0.076 Sum_probs=117.9 Q ss_pred cchHHHHHhhhhhhcC-Cccc-------cccchhhhhh---hhh--hhhhHHHHHhhhhhhhhhccccccCcc----ccc Q lcl|NC_015288. 2 FNAEHLQEKWSPVLNN-EAAN-------PIADRYKKAV---TSV--LLENQERFLREERGMLQEVAVNSLGAG----TVS 64 (468) Q Consensus 2 ~~~~~l~~kw~p~l~~-~~~~-------~i~~~~~~~~---~~~--llenq~~~~~e~~~~l~e~~~~~~g~~----~~~ 64 (468) |+-++|+|+++.+++. +.|- ++...-++.+ .+. =|++|.+.+.+..+.... .......+ ... T Consensus 1 M~i~eL~e~r~~~~~~~~~l~~~~~e~~~lt~ee~~~~~~l~~ei~~l~~~I~~~e~~~~~~~~-~~~~~~~~~~~~~~~ 79 (435) T protein:vir:14 1 MNVNELRRERAAVNQRVQALAQIEVGGTALSVEQQAEFDQLSSKFSELTAQIERAEAAERMAAA-AAVPVDPNPTAVAAP 79 (435) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh-hcccccchhhhhhhc Confidence 9999999999998763 1110 0111111111 111 122222222211111111 00000000 000 Q ss_pred cccc--------------------------------------------ccccccccccccccceehh------hhHHhhh Q lcl|NC_015288. 65 PGGS--------------------------------------------ALGSANTAGLAGFDPVLIS------LVRRAMP 94 (468) Q Consensus 65 ~~~~--------------------------------------------~~~st~tg~~~~~~P~Lv~------l~RRa~~ 94 (468) .... .....+++ .......|++ ++++..+ T Consensus 80 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-t~~~gg~~vP~~~~~~ii~~l~~ 158 (435) T protein:vir:14 80 AAAPVHAQPKALEVKGAKMARMVRALAAARGDAQLASKLAIERGFGEEVAMSLNTL-SPGAGGVLVPENLSSEVIELLRP 158 (435) T ss_pred cccccccccchhhhhHHHHHHHHHHHHhhcchhhHHHHHHHhhhhhhhhhhhcccC-CcCCCccccchhHHHHHHHHHhh Confidence 0000 00000000 0000001110 1111122 Q ss_pred hhhhhhe-eeeecCCccceeeeeeeeeecCCCCCccccccccccccccccccccccccccccccccCccccccccccccc Q lcl|NC_015288. 95 NLMAYDV-CGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDAGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSS 173 (468) Q Consensus 95 ~LIa~DI-~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~ 173 (468) +.+..++ +=+-||+... +-| |..... T Consensus 159 ~~~i~~~~~~~~~~~~~~-~~~--------------------------------------------------p~~~~~-- 185 (435) T protein:vir:14 159 KSVVRKLGARTLPLSNGN-ITI--------------------------------------------------PRLKGG-- 185 (435) T ss_pred hchhhhhcceeeecCCCc-eEE--------------------------------------------------EEEeCC-- Confidence 2222222 1111111100 000 000000 Q ss_pred ccccccccccchhhhhccCCCCCccccceeEEEEEEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHh Q lcl|NC_015288. 174 PGTYETPRGFSREDLEQAGDAGKLFREMSFSIEKTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEI 253 (468) Q Consensus 174 ~g~~t~~~gm~Ta~aE~lG~~g~~f~EMaFsIeK~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEI 253 (468) +..+- . +| +..+++-.-++++++..++.-+-....|-||.+|-. .+.+.|+.|.+-|+..|...+ T Consensus 186 ~~a~~------v--~E-----~~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~--~~~~l~~~i~~~l~~ai~~~~ 250 (435) T protein:vir:14 186 AIVGY------I--GA-----DTDIPTTQQQFDDLKLTAKKMAALVPIANDLIKYAG--VNPNVDQIVVGDLTAAIGARE 250 (435) T ss_pred cceee------e--cc-----CccccccccceeEEEeeeEEEEEeehhhHHHHHhhc--cCHHHHHHHHHHHHHHHHHHH Confidence 00000 0 01 223444555666677777777777889999999932 123477778888888888777 Q ss_pred hHHHHHHHhhhcchhhccccccceeeeeecCCcchh--HHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHh Q lcl|NC_015288. 254 NREVVRRVYSVAKPGAANNVANAGIFDLDVDSNGRW--SVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALA 331 (468) Q Consensus 254 NREII~~l~~vA~~~k~~~~~~~Gv~Dl~~~~~~rw--~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~ 331 (468) |+-||.- .-.+-.+.|++.......--. ....+....-.+.+-...+..--......-+|+++.....|. T Consensus 251 d~a~l~G--------~G~~~~p~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~v~n~~~~~~L~ 322 (435) T protein:vir:14 251 DKAFIRD--------DGTANTPKGLRFWALPSNVITASDASTLQKIETDLGKVILALENADANLTQPGWIMAPRTFRFLE 322 (435) T ss_pred HHHhhcc--------CCCCccccceeecccccceeccccccchhhHHHHHHHHHHHhhhccccccCCEEEEcHHHHHHHH Confidence 7777631 111112445543221110000 000010000011111111111111223345688999998885 Q ss_pred hccccccccccccccccccccccCCCceeEEEecCCeEEEEccccccC----CC--------cceEEEEEecCCccccee Q lcl|NC_015288. 332 MAGVLDYSSGLTGAGGPAIGTVDDTGNLAVGTINGRIKVYVDPYAANL----SD--------KHYYVVGYKGTSPYDAGL 399 (468) Q Consensus 332 ~~G~~~~~~~~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy~D~Ya~~~----s~--------~dY~~vG~Kg~~~~d~gl 399 (468) . +..+.+ + ... .+.+ -|+|+| ++|+++.+.-.+ .+ +.++++|..+... + T Consensus 323 ~---lkd~~G---~---~l~-~~~~----~g~l~G-~Pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~i~~~~~~~----~ 383 (435) T protein:vir:14 323 G---LRDGNG---N---KVY-PELA----NGMLKG-YPVGKTTQVPINLGETGKESEIYFTDFGDVFIGEEETLE----I 383 (435) T ss_pred H---hhccCC---c---eec-cCCC----CCeeec-ceeEeeccccccccCCCccceEEEeecccEEEEEecccE----E Confidence 4 222211 1 111 1112 256654 688877543110 01 1112233332222 2 Q ss_pred EEccccccccccccCCccc---cceeeeeeecceee-cC--cccccCcccccCChhhhhh Q lcl|NC_015288. 400 FYCPYVPLQMVRSIDPNNF---QPKIGFKTRYGMVS-NP--FVTTNGLYSGTPDGETLTP 453 (468) Q Consensus 400 fyaPYv~~~~~~~~Dp~s~---qP~~g~~tRY~l~~-nP--f~~~~~~~~~~~~~~~~~~ 453 (468) -.+||.......+..-..| |=.+=...|++..+ +| |+ ...+..|.+ T Consensus 384 ~~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~--------~l~~~~~~~ 435 (435) T protein:vir:14 384 DYSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFGPRHVESIA--------VLAGVAWGA 435 (435) T ss_pred EEeccccccccccchhhhhhcChhheeeeeeeCceeecccceE--------EEecCCCCC Confidence 2333321110000000001 12222344555432 22 22 122333333 No 37 >protein:vir:104256 Length: 458 # NCBI annotation: major head protein precursor # Family: family:all:27070 # MgeID: mge:1504 # MgeName: T5 # Cross-refs: genbank:acc:YP_006977;genbank:gi:46401878;genbank:GeneID:2777673 Probab=87.21 E-value=0.04 Score=28.23 Aligned_cols=337 Identities=15% Similarity=0.096 Sum_probs=118.9 Q ss_pred CcchHHHHHhhhhh-----------hc-CCccccccchhhhhhhhhhhhhHHHHHhhhhhhhhhccccccC---c-cccc Q lcl|NC_015288. 1 MFNAEHLQEKWSPV-----------LN-NEAANPIADRYKKAVTSVLLENQERFLREERGMLQEVAVNSLG---A-GTVS 64 (468) Q Consensus 1 ~~~~~~l~~kw~p~-----------l~-~~~~~~i~~~~~~~~~~~llenq~~~~~e~~~~l~e~~~~~~g---~-~~~~ 64 (468) .-..+...++...- ++ .+..+...+..++.. ..-.++-+.++. .+ .+.+...+... . .-.. T Consensus 81 ~~~~a~~~e~~~~~~~~~~~~~~~~~~~~e~~~~~~~~~~~~~-~~~~~~~~~~~e-~~-~~~~~~~~~~~~~~~~~~~~ 157 (458) T protein:vir:10 81 NELFAQTVEKQQETIVGLQDEIKSLLTAREGRSFVGDSVAKAL-YGTQENFEDEVE-KL-VLLSYVMEKGVFETEHGQRH 157 (458) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhccc-hhhhhhHHHHHH-HH-HHHHHHHhhccchhhhhhhh Confidence 00000111111100 00 000111101000000 000011111110 00 01110000000 0 0000 Q ss_pred ccccccccccccccc----cccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCCccccccccccccc Q lcl|NC_015288. 65 PGGSALGSANTAGLA----GFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDAGFTA 140 (468) Q Consensus 65 ~~~~~~~st~tg~~~----~~~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSG 140 (468) ........+.+.+.. .+.+.+ +.++.+..+..+++-++||+++..-++ ... .+..+ .|-+ T Consensus 158 ~~a~~~~~~~~~g~~~ip~~~~~~i---i~~~~~~~~l~~~~~~~~~~~~~~~~~-~~~-----~~~~a-------~~v~ 221 (458) T protein:vir:10 158 LKAVNQSSSVEVSSESYETIFSQRI---IRDLQKELVVGALFEELPMSSKILTML-VEP-----DAGKA-------TWVA 221 (458) T ss_pred hhhhhhcccCccccceehhhHhHHH---HHHHHhhhhHHhhcceeecCCcceEEE-Eec-----CCcce-------eecc Confidence 000111111111111 122233 344446667889999999988643222 110 00000 0000 Q ss_pred cccccccccccccccccccCcccccccccccccccccccccccchhhhhccCCCCCccccceeEEEEEEEEeecccccce Q lcl|NC_015288. 141 GLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDAGKLFREMSFSIEKTSVTAKSRALKAE 220 (468) Q Consensus 141 ~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~g~~f~EMaFsIeK~tVtAKSRaLKAE 220 (468) ..... .. +.....-..+++++++.++.-+-... T Consensus 222 e~~~~----------------------------------------------~~-~~~~~~~~~~~~~i~~~~~k~~~~v~ 254 (458) T protein:vir:10 222 ASTYG----------------------------------------------TD-TTTGEEVKGALKEIHFSTYKLAAKSF 254 (458) T ss_pred ccccc----------------------------------------------cc-ccccccccccceeeEeeeeeEEeeeh Confidence 00000 00 00000111123444555555555578 Q ss_pred ecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeeeecCCc------------ch Q lcl|NC_015288. 221 YTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDLDVDSN------------GR 288 (468) Q Consensus 221 YTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl~~~~~------------~r 288 (468) +|-||.+|-- .|.+++|.+-|...|..-||+.||.- .-.+ -+.|++......+ .- T Consensus 255 is~ell~ds~----~~~~~~i~~~l~~~i~~~~d~~~l~G----~G~~-----~p~Gi~~~~~~~~~~~~~~~~~~~~~~ 321 (458) T protein:vir:10 255 ITDETEEDAI----FSLLPLLRKRLIEAHAVSIEEAFMTG----DGSG-----KPKGLLTLASEDSAKVVTEAKADGSVL 321 (458) T ss_pred hhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHhhcC----CCCC-----ccceeeecccccccceeeccccccccc Confidence 8999988843 46788899999999999999888751 0001 1233332211110 00 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccccccccCCCce---eEEEec Q lcl|NC_015288. 289 WSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTGNL---AVGTIN 365 (468) Q Consensus 289 w~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~~~---~~G~l~ 365 (468) ...+....+.+ .+... -+ ....+|+++.....|.. +..+.+. .+...+.+... -.++|+ T Consensus 322 ~~~~~i~~~~~-------~l~~~-~~-~~~~~v~~~~~~~~l~~---lkd~~G~------~i~~~~~~~~~~~~~~~~l~ 383 (458) T protein:vir:10 322 VTAKTISKLRR-------KLGRH-GL-KLSKLVLIVSMDAYYDL---LEDEEWQ------DVAQVGNDSVKLQGQVGRIY 383 (458) T ss_pred ccHHHHHHHHH-------hhhhh-hc-CCCEEEEcHHHHHHHHh---hcccCCc------eeeccccccccccCcCceec Confidence 11121111211 12111 11 34567889988877753 2222110 00011111111 123565 Q ss_pred CCeEEEEccccccC-CCcceEEEEEecCCcccceeEEccccccccccccCCccccceeeee--eecce-eecC--ccccc Q lcl|NC_015288. 366 GRIKVYVDPYAANL-SDKHYYVVGYKGTSPYDAGLFYCPYVPLQMVRSIDPNNFQPKIGFK--TRYGM-VSNP--FVTTN 439 (468) Q Consensus 366 ~~~~vy~D~Ya~~~-s~~dY~~vG~Kg~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~g~~--tRY~l-~~nP--f~~~~ 439 (468) |++|+++.+.... ...+.++..++ + +.++.. -..+....||-+-...++|. .|.|+ +.+| |+.+. T Consensus 384 -G~pv~~~~~~p~~~~~~~~~~~~f~-~-----~~~~~~--~~~~~v~~d~~~~~~~~~~~~~~r~~~~v~~~~a~v~~~ 454 (458) T protein:vir:10 384 -GLPVVVSEYFPAKANSAEFAVIVYK-D-----NFVMPR--QRAVTVERERQAGKQRDAYYVTQRVNLQRYFANGVVSGT 454 (458) T ss_pred -ceeeEEccccccccCCcceEEEEec-c-----cEEEEE--eeceEEEeecccCCCceEEEEEEEecceEecccceEEEe Confidence 6899998654221 11232222222 1 011110 11122234555445556665 46643 3455 44332 Q ss_pred Cccc Q lcl|NC_015288. 440 GLYS 443 (468) Q Consensus 440 ~~~~ 443 (468) --.+ T Consensus 455 ~aa~ 458 (458) T protein:vir:10 455 YAAS 458 (458) T ss_pred eccC Confidence 2112 No 38 >protein:vir:105905 Length: 304 # NCBI annotation: major capsid protein # Family: family:all:507 # MgeID: mge:1514 # MgeName: phiETA3 # Cross-refs: genbank:acc:YP_001004375;genbank:gi:122891830;genbank:GeneID:4712376 Probab=82.98 E-value=0.072 Score=26.84 Aligned_cols=280 Identities=11% Similarity=0.053 Sum_probs=118.5 Q ss_pred cCccccccccccccccccccccccccee-hhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCCccccccccc Q lcl|NC_015288. 58 LGAGTVSPGGSALGSANTAGLAGFDPVL-ISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDA 136 (468) Q Consensus 58 ~g~~~~~~~~~~~~st~tg~~~~~~P~L-v~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t 136 (468) +-.+... +.....|+.|+. ..-+.+ -.++++..++.+..+++-+=||++.+--|. ++.. +.++ T Consensus 1 ma~~~~~--~~~~~~t~~gg~-lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip----~~~~--~~~a------- 64 (304) T protein:vir:10 1 MATPTYT--PGNVILSDFKNG-VIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFT----YLAK--GVGA------- 64 (304) T ss_pred Ccccccc--cccccccCCCce-ecchhHHHHHHHHHHhccchhhhcceeeccCCceEEE----EEeC--Ccce------- Confidence 1111111 111112222221 112222 235555666777788888888877542221 1110 0000 Q ss_pred cccccccccccccccccccccccCcccccccccccccccccccccccchhhhhccCCCCCccccceeEEEEEEEEeeccc Q lcl|NC_015288. 137 GFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDAGKLFREMSFSIEKTSVTAKSRA 216 (468) Q Consensus 137 ~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~g~~f~EMaFsIeK~tVtAKSRa 216 (468) .|- +| +..+++-.-+++++++..|..+ T Consensus 65 ~~v------------------------------------------------~E-----~~~~~~~~~~~~~i~~~~~k~~ 91 (304) T protein:vir:10 65 YWV------------------------------------------------SE-----TERIQTSKPEYAQAEMEAKKIG 91 (304) T ss_pred EEe------------------------------------------------ec-----CcccccccceeeEEEEEEEEEE Confidence 000 01 1233444455666777777777 Q ss_pred ccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeeeecC--CcchhHHHHH Q lcl|NC_015288. 217 LKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDLDVD--SNGRWSVEKF 294 (468) Q Consensus 217 LKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl~~~--~~~rw~~e~~ 294 (468) -...+|-||.+|- .+|.++.|.+-|...|...||+.+|.---+ ++..+....+++.-... ..+-...-.+ T Consensus 92 ~~~~iS~ell~ds----~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 163 (304) T protein:vir:10 92 VIIPLSKEFLKWT----AKDFFNEVKPLIAEAFYKAFDQAVIFGTKS----PYNTSTSGKPLVEGAEEKGNVVTDTNNLY 163 (304) T ss_pred EeehhhHHHHhcc----hHHHHHHHHHHHHHHHHHHHHhhheeccCC----CcccccccccccccccccccccccccchH Confidence 7788999999875 367888999999999999998888753111 00111111121110000 0000111112 Q ss_pred HHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccccccccCCCceeEEEecCCeEEEEcc Q lcl|NC_015288. 295 KGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTGNLAVGTINGRIKVYVDP 374 (468) Q Consensus 295 ~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy~D~ 374 (468) ..+ +.+-.-.... -....-++|++.....|.. +..+.+ + + .. .. ..|+|. +++||++. T Consensus 164 ~~i-----~~~~~~l~~~-~~~~~~~v~~~~~~~~L~~---lkd~~G---~-~--l~--~~----~~~~l~-G~PV~~~~ 221 (304) T protein:vir:10 164 VDL-----SALMATIEDE-ELDPNGVLTTRSFRSKMRN---ALDAND---R-P--LF--DA----NGNEIM-GLPLSYTG 221 (304) T ss_pred HHH-----HHHHHHhhhc-cCCcCEEEEcHHHHHHHHH---hhccCC---c-E--ee--cC----CCcccc-ceeeEEec Confidence 222 2222222222 2233457899999988863 222111 0 0 00 00 124554 57888876 Q ss_pred ccccCCC--------cceEEEEEecCCcccceeEEccccccc--cccccCCcc-----cc---ceeeeeeecceee-cC- Q lcl|NC_015288. 375 YAANLSD--------KHYYVVGYKGTSPYDAGLFYCPYVPLQ--MVRSIDPNN-----FQ---PKIGFKTRYGMVS-NP- 434 (468) Q Consensus 375 Ya~~~s~--------~dY~~vG~Kg~~~~d~glfyaPYv~~~--~~~~~Dp~s-----~q---P~~g~~tRY~l~~-nP- 434 (468) +.....+ +.++++|..++.+.+ ...+.. +....|++. || =.+=...||++.+ || T Consensus 222 ~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~------~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~ 295 (304) T protein:vir:10 222 ADVYDKKKSLALMGDWDYARYGILQGIEYA------ISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPE 295 (304) T ss_pred ccccCCCCcEEEEEehhhEEEEEecceEEE------EeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeeccc Confidence 5432221 222333433322211 000111 111112221 22 2233345776543 33 Q ss_pred -cccccCcccccCChh Q lcl|NC_015288. 435 -FVTTNGLYSGTPDGE 449 (468) Q Consensus 435 -f~~~~~~~~~~~~~~ 449 (468) |+..+ +.+ T Consensus 296 a~~~l~-------~a~ 304 (304) T protein:vir:10 296 AFATLK-------PTE 304 (304) T ss_pred ceEEEE-------ecC Confidence 33211 111 No 39 >protein:vir:94142 Length: 304 # NCBI annotation: ORF013 # Family: family:all:507 # MgeID: mge:1494 # MgeName: 96 # Cross-refs: genbank:acc:YP_240234;genbank:gi:66395898;genbank:GeneID:5133311 Probab=82.98 E-value=0.072 Score=26.84 Aligned_cols=280 Identities=11% Similarity=0.053 Sum_probs=118.5 Q ss_pred cCccccccccccccccccccccccccee-hhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCCccccccccc Q lcl|NC_015288. 58 LGAGTVSPGGSALGSANTAGLAGFDPVL-ISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDA 136 (468) Q Consensus 58 ~g~~~~~~~~~~~~st~tg~~~~~~P~L-v~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t 136 (468) +-.+... +.....|+.|+. ..-+.+ -.++++..++.+..+++-+=||++.+--|. ++.. +.++ T Consensus 1 ma~~~~~--~~~~~~t~~gg~-lip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip----~~~~--~~~a------- 64 (304) T protein:vir:94 1 MATPTYT--PGNVILSDFKNG-VIPAEQGTLIMKDIMANSAIMKLAKNEPMTAQKKKFT----YLAK--GVGA------- 64 (304) T ss_pred Ccccccc--cccccccCCCce-ecchhHHHHHHHHHHhccchhhhcceeeccCCceEEE----EEeC--Ccce------- Confidence 1111111 111112222221 112222 235555666777788888888877542221 1110 0000 Q ss_pred cccccccccccccccccccccccCcccccccccccccccccccccccchhhhhccCCCCCccccceeEEEEEEEEeeccc Q lcl|NC_015288. 137 GFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDAGKLFREMSFSIEKTSVTAKSRA 216 (468) Q Consensus 137 ~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~g~~f~EMaFsIeK~tVtAKSRa 216 (468) .|- +| +..+++-.-+++++++..|..+ T Consensus 65 ~~v------------------------------------------------~E-----~~~~~~~~~~~~~i~~~~~k~~ 91 (304) T protein:vir:94 65 YWV------------------------------------------------SE-----TERIQTSKPEYAQAEMEAKKIG 91 (304) T ss_pred EEe------------------------------------------------ec-----CcccccccceeeEEEEEEEEEE Confidence 000 01 1233444455666777777777 Q ss_pred ccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeeeecC--CcchhHHHHH Q lcl|NC_015288. 217 LKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDLDVD--SNGRWSVEKF 294 (468) Q Consensus 217 LKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl~~~--~~~rw~~e~~ 294 (468) -...+|-||.+|- .+|.++.|.+-|...|...||+.+|.---+ ++..+....+++.-... ..+-...-.+ T Consensus 92 ~~~~iS~ell~ds----~~~l~~~i~~~l~~~ia~~~d~~~l~G~g~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 163 (304) T protein:vir:94 92 VIIPLSKEFLKWT----AKDFFNEVKPLIAEAFYKAFDQAVIFGTKS----PYNTSTSGKPLVEGAEEKGNVVTDTNNLY 163 (304) T ss_pred EeehhhHHHHhcc----hHHHHHHHHHHHHHHHHHHHHhhheeccCC----CcccccccccccccccccccccccccchH Confidence 7788999999875 367888999999999999998888753111 00111111121110000 0000111112 Q ss_pred HHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccccccccCCCceeEEEecCCeEEEEcc Q lcl|NC_015288. 295 KGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTGNLAVGTINGRIKVYVDP 374 (468) Q Consensus 295 ~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy~D~ 374 (468) ..+ +.+-.-.... -....-++|++.....|.. +..+.+ + + .. .. ..|+|. +++||++. T Consensus 164 ~~i-----~~~~~~l~~~-~~~~~~~v~~~~~~~~L~~---lkd~~G---~-~--l~--~~----~~~~l~-G~PV~~~~ 221 (304) T protein:vir:94 164 VDL-----SALMATIEDE-ELDPNGVLTTRSFRSKMRN---ALDAND---R-P--LF--DA----NGNEIM-GLPLSYTG 221 (304) T ss_pred HHH-----HHHHHHhhhc-cCCcCEEEEcHHHHHHHHH---hhccCC---c-E--ee--cC----CCcccc-ceeeEEec Confidence 222 2222222222 2233457899999988863 222111 0 0 00 00 124554 57888876 Q ss_pred ccccCCC--------cceEEEEEecCCcccceeEEccccccc--cccccCCcc-----cc---ceeeeeeecceee-cC- Q lcl|NC_015288. 375 YAANLSD--------KHYYVVGYKGTSPYDAGLFYCPYVPLQ--MVRSIDPNN-----FQ---PKIGFKTRYGMVS-NP- 434 (468) Q Consensus 375 Ya~~~s~--------~dY~~vG~Kg~~~~d~glfyaPYv~~~--~~~~~Dp~s-----~q---P~~g~~tRY~l~~-nP- 434 (468) +.....+ +.++++|..++.+.+ ...+.. +....|++. || =.+=...||++.+ || T Consensus 222 ~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~------~~~e~~~~~~~~~~~~g~~~~~f~~~~~~~r~~~r~~~~v~~~~ 295 (304) T protein:vir:94 222 ADVYDKKKSLALMGDWDYARYGILQGIEYA------ISEDATLTTLQASDASGQPVSLFERDMFALRATMHIAYMNVKPE 295 (304) T ss_pred ccccCCCCcEEEEEehhhEEEEEecceEEE------EeecceeeeecccccCccchhhhhcCcEEEEEEEEeccEeeccc Confidence 5432221 222333433322211 000111 111112221 22 2233345776543 33 Q ss_pred -cccccCcccccCChh Q lcl|NC_015288. 435 -FVTTNGLYSGTPDGE 449 (468) Q Consensus 435 -f~~~~~~~~~~~~~~ 449 (468) |+..+ +.+ T Consensus 296 a~~~l~-------~a~ 304 (304) T protein:vir:94 296 AFATLK-------PTE 304 (304) T ss_pred ceEEEE-------ecC Confidence 33211 111 No 40 >protein:vir:96123 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1602 # MgeName: 37 # Cross-refs: genbank:acc:YP_240078;genbank:gi:66395742;genbank:GeneID:5133103 Probab=82.98 E-value=0.072 Score=26.84 Aligned_cols=255 Identities=12% Similarity=0.041 Sum_probs=109.0 Q ss_pred ecCCC-C-Cccccccccc-----------cccccccccccccccccccccccCcccccccccccccccccccccccchhh Q lcl|NC_015288. 121 YENQA-G-EEALFNEPDA-----------GFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSRED 187 (468) Q Consensus 121 Y~~qs-G-~EA~fnEa~t-----------~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~ 187 (468) -.+.. . .+-+..|..+ -|++... .. +...+ ....+.+...--.+.+ T Consensus 1 ma~~~T~~~d~i~Pev~s~~v~~~~~~~~~~~~~~~-~~------------~~l~g--------~~G~tv~ip~~~~~g~ 59 (274) T protein:vir:96 1 MAQGTTKVSNLIVPEVLAPMMQAELDKKLRFAQFAD-ID------------STLVG--------QPGDTLTFPAFTYSGD 59 (274) T ss_pred CCccccchhhhhhhHHHHHHHHHHHHhhhhhccccc-cc------------ccccC--------CCCCEEEEEeeccCCC Confidence 11110 0 0111111100 0111000 00 00000 0001111111001122 Q ss_pred hhccCC-CCCccccceeEEEEEEEEeecccccceecHHHHHhHH-HhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhc Q lcl|NC_015288. 188 LEQAGD-AGKLFREMSFSIEKTSVTAKSRALKAEYTLELAQDLK-AIHGLDAEQELANILSSEVLAEINREVVRRVYSVA 265 (468) Q Consensus 188 aE~lG~-~g~~f~EMaFsIeK~tVtAKSRaLKAEYTvELAQDLk-AiHGLDAE~ELanILStEImlEINREII~~l~~vA 265 (468) ++.... ...++.++.++ ..+++.|-|+-.-+++ |+. +..+-|.-.+..+-++..++.+++++|+..|.... T Consensus 60 ~~~~~~g~~i~~~~it~~--~~~~~i~~~~~~~~i~-----D~~~~~~~~d~~~~~~~~~~~~~a~~~d~~i~~~l~~a~ 132 (274) T protein:vir:96 60 AQVIAEGEKIPVDQIGTS--KREAKVRKIGKGTELT-----DEAVLSGFGDPQGEAVRQHGLAIANKVDNDVLEALKGAT 132 (274) T ss_pred ccccCCCCcCchhhcccc--eeEEEEEeeeceeeec-----HHHHHhhcchHHHHHHHHHHHHHHHHHHHHHHHHHhcCC Confidence 222211 12334444433 3444445554322333 332 23467899999999999999999999998875432 Q ss_pred chhhccccccceeeeeecCCcchhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccc Q lcl|NC_015288. 266 KPGAANNVANAGIFDLDVDSNGRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGA 345 (468) Q Consensus 266 ~~~k~~~~~~~Gv~Dl~~~~~~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~ 345 (468) ... ..+.-| .+.+-..+.++.. .-...++++|+|.+++.|..-...++.+.... T Consensus 133 ~~~---------------~~~~~~-~d~i~dA~~~l~d---------~~~~~~~ivv~p~~~~~L~k~~~~~f~~~~~~- 186 (274) T protein:vir:96 133 LTV---------------EADITK-LDGLQTAIDKFND---------EDLEPMVLFVNPLDAGGLRTSASDNFTRPTQL- 186 (274) T ss_pred CCc---------------Cccccc-HHHHHHHHHHhcc---------cCCCceEEEeCHHHHHHHHhcccccccccccc- Confidence 211 111112 2222222222221 12467899999999999965443333322111 Q ss_pred cccccccccCCCceeEEEecCCeEEEEccccccCCCcceEEEEEecCCcccceeEEccccccccccccCCccccceeeee Q lcl|NC_015288. 346 GGPAIGTVDDTGNLAVGTINGRIKVYVDPYAANLSDKHYYVVGYKGTSPYDAGLFYCPYVPLQMVRSIDPNNFQPKIGFK 425 (468) Q Consensus 346 ~~~~~~~~D~t~~~~~G~l~~~~~vy~D~Ya~~~s~~dY~~vG~Kg~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~g~~ 425 (468) +. ....+-.+|++. |++|++| ++-|..=..+-=+|.-. |+.. .+...-.-=||.+++-.+-.. T Consensus 187 -g~-----~~~~~g~ig~~~-G~~Vi~s----~~~p~~t~~l~~~gA~~-----~~~~-~~~~vE~~Rd~~~~~d~i~~~ 249 (274) T protein:vir:96 187 -GD-----NIIVKGAFGEAL-GAVIVRS----NKLNKGEALLAKKGAVK-----LITK-RDFFLEKDRDASRKSTALYSD 249 (274) T ss_pred -cc-----cceeecccceec-CeeEEEc----CCCCcceEEEEeCccee-----eeec-CCcccccccchhhcccEEEEe Confidence 00 111122467774 6899999 55553221111122211 1100 011111112888899888888 Q ss_pred eecceee-cC--cccccCcc-cccCChhhhhhccCceeeeEEeeccC Q lcl|NC_015288. 426 TRYGMVS-NP--FVTTNGLY-SGTPDGETLTPSTNMYYRRVQVTNLM 468 (468) Q Consensus 426 tRY~l~~-nP--f~~~~~~~-~~~~~~~~~~~~~N~y~r~~~v~~~~ 468 (468) .+||+.+ || ........ .++ | T Consensus 250 ~~yg~~~~~~~~vv~~t~~~~~~~----------------------~ 274 (274) T protein:vir:96 250 KHYVAYLYDESKVVKITKGAGDEV----------------------M 274 (274) T ss_pred eEEEEEEEcCccEEEEEcCccccc----------------------C Confidence 8898865 55 22222221 111 1 No 41 >protein:vir:81227 Length: 413 # NCBI annotation: gp6, major capsid protein # Family: family:all:585 # MgeID: mge:1893 # MgeName: BFK20 # Cross-refs: genbank:acc:YP_001456736;genbank:gi:157168379;hssp:P49861;interpro:IPR006444;uniprot:Q9MBJ9;genbank:GeneID:5580350 Probab=82.70 E-value=0.075 Score=26.76 Aligned_cols=349 Identities=12% Similarity=0.014 Sum_probs=116.0 Q ss_pred CcchHHHHHhhhhhhcCCcc-----ccccchhhhhhh-------hhhhhhHHHH----Hhhhhhh--hhhccccccCccc Q lcl|NC_015288. 1 MFNAEHLQEKWSPVLNNEAA-----NPIADRYKKAVT-------SVLLENQERF----LREERGM--LQEVAVNSLGAGT 62 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~-----~~i~~~~~~~~~-------~~llenq~~~----~~e~~~~--l~e~~~~~~g~~~ 62 (468) |...+++.++=.-+.+.... ....+.-+++.. ..+-++..+. ..+.+.. +.....+...... T Consensus 31 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 110 (413) T protein:vir:81 31 EDAKRERAKSVKANQDFLRELQEATAGSVDSEKSGELTRKGEGYKSIGEFFAKRAGDQIKQQAGGAQLNYSVGEYVAPRV 110 (413) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHhHHhHHHhhhHhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHhhhhhhhhhhhHH Confidence 22222222221111111000 000000000000 0000000000 0000000 0000000000000 Q ss_pred cccccc-cccccccccc----ccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCCcccccccccc Q lcl|NC_015288. 63 VSPGGS-ALGSANTAGL----AGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDAG 137 (468) Q Consensus 63 ~~~~~~-~~~st~tg~~----~~~~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~ 137 (468) ...... ...++++..- ..+.+.++.+ .-+..+..+++.|+||++++.-+.-.+. .....+ ... T Consensus 111 ~~~~~~~~~~~~~~~~~~~vp~~~~~~ii~~---~~~~~~l~~~~~~~~~~~~~~~~~~~~~-~~~~~~--------~a~ 178 (413) T protein:vir:81 111 KAASDPASTATLTDEFQGGYGTTWNRNIIYR---RREKLVVADLMDNLTMTNTTIKYLMEKA-NRVVEG--------GFK 178 (413) T ss_pred HhhhhhhhhcccccccccccchhhHHHHHHH---HhhhhhHHhhcceeeccCCceeEEEecc-cccccc--------ccc Confidence 000000 0111111111 1122334444 4456677899999999998653322110 000000 000 Q ss_pred ccccccccccccccccccccccCcccccccccccccccccccccccchhhhhccCCCC-CccccceeEEEEEEEEeeccc Q lcl|NC_015288. 138 FTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDAG-KLFREMSFSIEKTSVTAKSRA 216 (468) Q Consensus 138 fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~g-~~f~EMaFsIeK~tVtAKSRa 216 (468) .. ++++...+.+ ..|.+..|.+.|.. T Consensus 179 ----------------------------------------~v------~Eg~~~~~~~~~~f~~i~~~~~k~~------- 205 (413) T protein:vir:81 179 ----------------------------------------TV------AEGGKKPYMRFADFDIVTESLSKIA------- 205 (413) T ss_pred ----------------------------------------ee------cCcccccccCcccceeeEeeeeeEE------- Confidence 00 0000001111 23555555555544 Q ss_pred ccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeeeecCCc-chhHHHHHH Q lcl|NC_015288. 217 LKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDLDVDSN-GRWSVEKFK 295 (468) Q Consensus 217 LKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl~~~~~-~rw~~e~~~ 295 (468) -....|-||.+|-- +.++.|.+-|+..|..-+|+.||.- . | .+-...|++....... .-..... T Consensus 206 ~~~~iS~ell~ds~-----~l~~~i~~~la~~~~~~~d~~~l~G----~--G--~~~~~~Gi~~~~~~~~~~~~~~~~-- 270 (413) T protein:vir:81 206 GLTKITDEMIEDYD-----FLVSYINARLLEELAIEEERQLLLG----D--G--TGNNLTGLLKRDGIQTLAVSNKDE-- 270 (413) T ss_pred EeehhhHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHhcc----C--C--CCCcccccccccccccccccccch-- Confidence 44568899999862 2577788888888888888777641 1 1 1111234433211110 0000011 Q ss_pred HHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccc-cccccccccccccCCCceeEEEecCCeEEEEcc Q lcl|NC_015288. 296 GLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSG-LTGAGGPAIGTVDDTGNLAVGTINGRIKVYVDP 374 (468) Q Consensus 296 ~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~-~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy~D~ 374 (468) ++.....+-..+..-..+..+-+|+++.....|.. +..+.+ ..-+.+.... .-+.+....++|. +++|+++. T Consensus 271 --~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~l~~---lkd~~G~~l~~~~~~~~-~~~~~~~~~~~l~-G~pv~~s~ 343 (413) T protein:vir:81 271 --LADSIYKAMTNISLATPFQADALVINPLDYQELRL---AKDANGQYYGGGVFQGQ-YGSGGIMLDPAPW-GLRTVQSQ 343 (413) T ss_pred --hHHHHHHHHHHhhhhccCCCcEEEEcHHHHHHHHH---hhccCCceecccccccc-ccccccccCceec-ceeeEEcC Confidence 12121122222222234455667889988877742 111111 0000000000 0001111234565 56888875 Q ss_pred ccccCCCcceEEEEE-ecC-Cccc---ceeEEccccccccccccCCccccceeeeeeecceee-cCcccccCcccccCCh Q lcl|NC_015288. 375 YAANLSDKHYYVVGY-KGT-SPYD---AGLFYCPYVPLQMVRSIDPNNFQPKIGFKTRYGMVS-NPFVTTNGLYSGTPDG 448 (468) Q Consensus 375 Ya~~~s~~dY~~vG~-Kg~-~~~d---~glfyaPYv~~~~~~~~Dp~s~qP~~g~~tRY~l~~-nPf~~~~~~~~~~~~~ 448 (468) +.. ..-+++|- +.. .-.+ -.+=..+|... +-.+.|=.+=...||+..+ +| T Consensus 344 ~~~----~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~~------~~~~~~~~~r~~~r~d~~~~~~-------------- 399 (413) T protein:vir:81 344 VVP----VGKPVVGAFRSAASVLRKGGVRIDSTNTNVD------DFENNLITVRAEERVGLMVTFP-------------- 399 (413) T ss_pred CCC----cccEEEEecccEEEEEEecceEEEEeccccc------hhhcCcEEEEEEEeeccEEecc-------------- Confidence 532 22233332 210 0000 01111111110 1123344555556776544 33 Q ss_pred hhhhhccCceeeeEEeeccC Q lcl|NC_015288. 449 ETLTPSTNMYYRRVQVTNLM 468 (468) Q Consensus 449 ~~~~~~~N~y~r~~~v~~~~ 468 (468) ..|+++.++..- T Consensus 400 --------~a~~~l~~~~~~ 411 (413) T protein:vir:81 400 --------EAIVQLDVAEVV 411 (413) T ss_pred --------cceEEEEecCCC Confidence 011222222222 No 42 >protein:vir:3845 Length: 395 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:322 # MgeName: phi adh # Cross-refs: genbank:acc:NP_050151;swissprot:trembl:q9t1f6;genbank:gi:9633043;uniprot:Q9T1F6;genbank:GeneID:1262163 Probab=82.55 E-value=0.076 Score=26.72 Aligned_cols=338 Identities=11% Similarity=0.043 Sum_probs=127.0 Q ss_pred cchHHHHHhhhhhhcCCccccccchhhhhhhh-------hhh------hhHHHHHhhhhhhhhhcc-----------ccc Q lcl|NC_015288. 2 FNAEHLQEKWSPVLNNEAANPIADRYKKAVTS-------VLL------ENQERFLREERGMLQEVA-----------VNS 57 (468) Q Consensus 2 ~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~~-------~ll------enq~~~~~e~~~~l~e~~-----------~~~ 57 (468) |+.++|+++|+-+.+. +.++.+.-++.... ..+ ..+.+.+.+......+.. .+. T Consensus 1 M~~~eL~~~~~~~~~~--~~~l~e~~~~~~~~~~~~~~~~~~ee~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (395) T protein:vir:38 1 MNINQLKDAFDMAGQK--VQDLEDKRAQFAIDLGNDASSHSVDDINKLNASLKNAKMAQELAKSAYEDARANLNAEPVNK 78 (395) T ss_pred CCHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccc Confidence 8999999999887542 22222211111100 000 011111111111000000 000 Q ss_pred cCcc------------c---ccccccccccc-cccccccccceeh--hhhHHhhhhhhhhheeeeecCCccceeeeeeee Q lcl|NC_015288. 58 LGAG------------T---VSPGGSALGSA-NTAGLAGFDPVLI--SLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRS 119 (468) Q Consensus 58 ~g~~------------~---~~~~~~~~~st-~tg~~~~~~P~Lv--~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRs 119 (468) .... . .........++ ++++-...=|.-+ .+++...+..+..+++.++||++++|-+-=.+ T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~- 157 (395) T protein:vir:38 79 KPLPVKDGKPDAQAMKNQFVKDFKNLVTSGTTGTGNAGLTIPEDIQLQIRTLTRSFTSLESLANVENVTTSHGSRVYEK- 157 (395) T ss_pred cccchhhhhHHHHHHHHHHHHHHHHHHhhccCccCCCceecchhHhhHHHHHHHhhcchhhhcceeeccCCcceEEEEe- Confidence 0000 0 00000111111 1111111112221 24444445667888899999999988642111 Q ss_pred eecCCCCCccccccccccccccccccccccccccccccccCcccccccccccccccccccccccchhhhhccCC-CCCcc Q lcl|NC_015288. 120 RYENQAGEEALFNEPDAGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGD-AGKLF 198 (468) Q Consensus 120 rY~~qsG~EA~fnEa~t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~-~g~~f 198 (468) -.+..+ .+ .| .++.+...+ ....| T Consensus 158 -~~~~~~-~a-------~~----------------------------------------------v~E~~~~~~~~~~~f 182 (395) T protein:vir:38 158 -LADITP-LK-------DL----------------------------------------------DDESALIGDNDDPEL 182 (395) T ss_pred -eccCCc-cc-------cc----------------------------------------------cccccccccccccce Confidence 000000 00 00 000000010 11235 Q ss_pred ccceeEEEEEEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhcccccccee Q lcl|NC_015288. 199 REMSFSIEKTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGI 278 (468) Q Consensus 199 ~EMaFsIeK~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv 278 (468) .+..|+..|..+ ...+|-||.+|- +.|-++.|.+-|+..|..-||+.||.-.=+ +....|. T Consensus 183 ~~v~~~~~k~~~-------~~~iS~ell~ds----~~~l~~~i~~~la~~~~~~~~~~il~g~g~--------~~~~~~~ 243 (395) T protein:vir:38 183 TVVKYLIHRYAG-------ITTVTNTLLKDT----VDNIIQWLVNWAAKKDVVTRNAKILEVMGK--------APKKPTI 243 (395) T ss_pred eeEEeeeeeeEe-------ehhhHHHHHhhh----HHHHHHHHHHHHHHHHHHHHHHHHhhcccc--------ccccccc Confidence 555555555554 455999999983 356788888888888888888888752111 1111122 Q ss_pred eeeecCCcchhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccccccccCCCc Q lcl|NC_015288. 279 FDLDVDSNGRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTGN 358 (468) Q Consensus 279 ~Dl~~~~~~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~~ 358 (468) .++ +....++ +.....--+. ...+||++.....|.. +..+.+ + .....+. .. T Consensus 244 ~~~----------~~i~~~~-------~~~l~~~~~~-~a~~v~n~~~~~~L~~---lkd~~G---~---~l~~~~~-~~ 295 (395) T protein:vir:38 244 SQF----------DNIKDLE-------NNTLDPAIES-TSSFITNQSGYNILSK---VKDADG---R---YLMQPDV-TS 295 (395) T ss_pred ccH----------HHHHHHH-------HHhhhhhhcC-CCEEEEcHHHHHHHHH---hhccCC---c---eeeccCc-CC Confidence 111 1112221 2111111222 2347899999888853 222211 0 0001111 11 Q ss_pred eeEEEecCCeEEEEcccc--ccCCCcceEEEE---------EecCCcccceeEEccccccccccccCCccccceeeeeee Q lcl|NC_015288. 359 LAVGTINGRIKVYVDPYA--ANLSDKHYYVVG---------YKGTSPYDAGLFYCPYVPLQMVRSIDPNNFQPKIGFKTR 427 (468) Q Consensus 359 ~~~G~l~~~~~vy~D~Ya--~~~s~~dY~~vG---------~Kg~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~g~~tR 427 (468) ...++|. +++|++.... ....+..-+++| .+.. ..+=+.++. ..+-...+=.+-+..| T Consensus 296 ~~~~~l~-G~pV~~~~~~~~~~~~~~~~i~~gd~~~~~~i~~~~~----~~i~~~~~~------~~~~~~~~~~~r~~~r 364 (395) T protein:vir:38 296 PDKYLID-GKPVIRIADKWLPDVSGSHPLYFGDLKQGITLFDRQQ----MQIDTTNVG------AGSFEHDTTKLRFIDR 364 (395) T ss_pred CCcceec-cceeEEecccccCcCCCcceEEEEeccccEEEEEecc----eEEEEeccc------cchhhcCceEEEEEEe Confidence 1224554 4566654211 000011112222 1111 011111110 0111233345556667 Q ss_pred cceee-cC--cccccCcccccCChhhhhhcc Q lcl|NC_015288. 428 YGMVS-NP--FVTTNGLYSGTPDGETLTPST 455 (468) Q Consensus 428 Y~l~~-nP--f~~~~~~~~~~~~~~~~~~~~ 455 (468) |+..+ +| |+..+-.............++ T Consensus 365 ~d~~~~~~~a~~~~~~~~~~~~~~~~~~~~~ 395 (395) T protein:vir:38 365 FDVQLIDDGAFAAASFKTVANQAQGTAGTGK 395 (395) T ss_pred eccEEecccceEEEEeecccCCCCCccCCCC Confidence 76654 24 443211111011111111122 No 43 >protein:vir:99749 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1497 # MgeName: phiETA2 # Cross-refs: genbank:acc:YP_001004307;genbank:gi:122891761;genbank:GeneID:4712304 Probab=81.78 E-value=0.083 Score=26.52 Aligned_cols=299 Identities=11% Similarity=0.069 Sum_probs=121.7 Q ss_pred hhhhhhhHHHHHhhhhhhhhhccccccCcccccccccccccccccccccccceehhhhHHhhhhhhhhheeeeecCCccc Q lcl|NC_015288. 32 TSVLLENQERFLREERGMLQEVAVNSLGAGTVSPGGSALGSANTAGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPT 111 (468) Q Consensus 32 ~~~llenq~~~~~e~~~~l~e~~~~~~g~~~~~~~~~~~~st~tg~~~~~~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPT 111 (468) |.+- ||.+..+++-...+-+ .+.+++.. ..++++++..--....-.+++.+..+.+-.+++.+.||++.+ T Consensus 1 ~~k~-~~~~~~~~~~~~~~~~--~~~~~a~~-------~~~~~~~~~lip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~ 70 (324) T protein:vir:99 1 MEQT-QKLKLNLQHFASNNVK--PQVFNPDN-------VMMHEKKDGTLLNDFTTPILQEVMENSKIMRLGKYEPMEGTE 70 (324) T ss_pred CCCc-hHhhHHHHHHHHHhhh--hhhccccc-------eeccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCc Confidence 2111 1111112211111111 11222111 111111111000111122334444556678889999988765 Q ss_pred eeeeeeeeeecCCCCCccccccccccccccccccccccccccccccccCcccccccccccccccccccccccchhhhhcc Q lcl|NC_015288. 112 GLIFAMRSRYENQAGEEALFNEPDAGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQA 191 (468) Q Consensus 112 GLIFAMRsrY~~qsG~EA~fnEa~t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~l 191 (468) .-|. . +.. +.++ .| .+| T Consensus 71 ~~~p-~---~~~--~~~a-------~~------------------------------------------------v~E-- 87 (324) T protein:vir:99 71 KKFT-F---WAD--KPGA-------YW------------------------------------------------VGE-- 87 (324) T ss_pred eEEE-E---Eec--Ccce-------eE------------------------------------------------ecc-- Confidence 3321 1 110 0000 00 001 Q ss_pred CCCCCccccceeEEEEEEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhcc Q lcl|NC_015288. 192 GDAGKLFREMSFSIEKTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAAN 271 (468) Q Consensus 192 G~~g~~f~EMaFsIeK~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~ 271 (468) +..+++...++++++.+.|.-+--...|-||.+|-. .|.+++|.+.|+..|...+++.||.--- . T Consensus 88 ---g~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~----~~l~~~i~~~l~~ai~~~~d~~~l~G~g--------~ 152 (324) T protein:vir:99 88 ---GQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQG--------N 152 (324) T ss_pred ---CccccccccceeEEEEeeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHhhhcCC--------C Confidence 233455555666667777666666789999999974 4689999999999999999999985211 1 Q ss_pred ccccceeeeeecCCc----chhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccc Q lcl|NC_015288. 272 NVANAGIFDLDVDSN----GRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGG 347 (468) Q Consensus 272 ~~~~~Gv~Dl~~~~~----~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~ 347 (468) +..+.|++....... +.-..+... ++-... ...-...+.+|+|+.....|... ...- ++. T Consensus 153 ~~~~~~~~~~~~~~~~~~~~~~~~~~i~--------~~~~~l-~~~~~~~~~~v~n~~~~~~L~~l---~d~~---g~~- 216 (324) T protein:vir:99 153 NPFGKSIAQSIEKTNKVIKGDFTQDNII--------DLEALL-EDDELEANAFISKTQNRSLLRKI---VDPE---TKE- 216 (324) T ss_pred CccCccccccccccceeccccCCHHHHH--------HHHHhh-hhccCCCCEEEEcHHHHHHHHHh---hcCC---Cce- Confidence 111122222111100 111112221 222221 22234555689999999888642 1111 110 Q ss_pred cccccccCCCceeEEEecCCeEEEEccccccCCCcceEEEEEecCCcccceeEEccccccc--------cccccCCc--- Q lcl|NC_015288. 348 PAIGTVDDTGNLAVGTINGRIKVYVDPYAANLSDKHYYVVGYKGTSPYDAGLFYCPYVPLQ--------MVRSIDPN--- 416 (468) Q Consensus 348 ~~~~~~D~t~~~~~G~l~~~~~vy~D~Ya~~~s~~dY~~vG~Kg~~~~d~glfyaPYv~~~--------~~~~~Dp~--- 416 (468) ... +.+ .++|.| ++|++.+.+. .+...+++|-... +++..--... .....|+. T Consensus 217 --~~~-~~~----~~~l~G-~PVv~~~~~~--~~~~~~i~gd~~~------~~~~~~~~~~i~~~~~~~~~~~~~~~~~~ 280 (324) T protein:vir:99 217 --RIY-DRN----SDTLDG-LPVVNLKSSN--LKRGELITGDFDK------LIYGIPQLIEYKIDETAQLSTVKNEDGTP 280 (324) T ss_pred --eec-CCC----Cccccc-eeEEeecCCC--CCcceEEEEeccc------EEEEEecCcEEEEeecccccccccccccc Confidence 000 111 134544 6777765432 1222344332211 1111111111 11111111 Q ss_pred -----cccceeeeeeeccee-ecC--ccccc--CcccccCChhhh Q lcl|NC_015288. 417 -----NFQPKIGFKTRYGMV-SNP--FVTTN--GLYSGTPDGETL 451 (468) Q Consensus 417 -----s~qP~~g~~tRY~l~-~nP--f~~~~--~~~~~~~~~~~~ 451 (468) +-|=.+=...|++.. .|| |+... +-....+.++ + T Consensus 281 ~~~f~~~~~~~r~~~r~d~~v~~~~a~~~lt~a~~~~~~~~~~-~ 324 (324) T protein:vir:99 281 VNLFEQDMVALRATMHVALHIADDKAFAKLVPADKKTDSVPGE-V 324 (324) T ss_pred hhhhhcCcEEEEEEEEEccEEecccceEEEEeccCCCCCCCCC-C Confidence 112223334667643 455 44321 1111112222 1 No 44 >protein:vir:4092 Length: 390 # NCBI annotation: major capsid protein a # Family: family:all:635 # MgeID: mge:86 # MgeName: 2389 # Cross-refs: genbank:acc:NP_510986;swissprot:trembl:q8w604;genbank:gi:17488508;uniprot:Q8W604;genbank:GeneID:1260361 Probab=81.68 E-value=0.083 Score=26.50 Aligned_cols=346 Identities=13% Similarity=0.067 Sum_probs=119.8 Q ss_pred CcchHHHHHhhhhhhcCCccccccchhhhhhhhhhhhhHHHHHh-----hhhh-----hhhhccccccCccccc------ Q lcl|NC_015288. 1 MFNAEHLQEKWSPVLNNEAANPIADRYKKAVTSVLLENQERFLR-----EERG-----MLQEVAVNSLGAGTVS------ 64 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~~~llenq~~~~~-----e~~~-----~l~e~~~~~~g~~~~~------ 64 (468) |=+-+++++|..-+-..+ +-.+.+.-+..-....+++..+.+. +.+. ..+.......|...-. T Consensus 1 ik~L~e~~~e~~e~~~~~-~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~r~~ 79 (390) T protein:vir:40 1 MNNLDKKDSETLNISTAF-LNAIKEGATEAEQVTAFTNMAEQIQNNIIAQARKEVNREMNDNNVLASRGANALTSDESKY 79 (390) T ss_pred CchHHHHHHHHHHHHHHH-HHHHhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCchhccHHHHHH Confidence 444444444433321110 0001111100000111111111110 0000 0000000000000000 Q ss_pred ccccccccccc-cccccccceehh-hhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCCccccccccccccccc Q lcl|NC_015288. 65 PGGSALGSANT-AGLAGFDPVLIS-LVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDAGFTAGL 142 (468) Q Consensus 65 ~~~~~~~st~t-g~~~~~~P~Lv~-l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSG~~ 142 (468) ......+.+++ |+. -.-+.+.. ++++.-..-+-.+++-+.||++....|... .. . .++. |-+. T Consensus 80 ~~~~~~~~~~~~gg~-lvP~~~~~~I~~~~~~~s~i~~~~~~~~~~~~~~~i~~~----~~-~-~~a~-------~~~E- 144 (390) T protein:vir:40 80 YNEVIAGNGFAGVTA-LLPPTVFERVFEDLTVEHPLLSKINFVNTTATTEWIISV----GD-V-ATAW-------WGPL- 144 (390) T ss_pred HHHHHhccCcccCcc-cccHHHHHHHHHHHHhhhhhhhhceeeecCCceeEEEEE----cC-C-ccee-------eecc- Confidence 00000111111 111 00111111 222222333456789999998865554311 11 0 0010 0000 Q ss_pred cccccccccccccccccCcccccccccccccccccccccccchhhhhccCCCCCccccceeEEEEEEEEeecccccceec Q lcl|NC_015288. 143 DATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDAGKLFREMSFSIEKTSVTAKSRALKAEYT 222 (468) Q Consensus 143 ~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~g~~f~EMaFsIeK~tVtAKSRaLKAEYT 222 (468) .++.-.+....|.+..|++.|..+- ...| T Consensus 145 --------------------------------------------~~~~~~~~~~~f~~i~l~~~k~~~~-------i~iS 173 (390) T protein:vir:40 145 --------------------------------------------CAEIKEVLDNGFDKIQTGMYKLSAY-------IPVC 173 (390) T ss_pred --------------------------------------------ccccCccccccceeeEeeeeeEEEe-------ehhh Confidence 0000001124577777877777653 4578 Q ss_pred HHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeee-e--------cCCcchhHHHH Q lcl|NC_015288. 223 LELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDL-D--------VDSNGRWSVEK 293 (468) Q Consensus 223 vELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl-~--------~~~~~rw~~e~ 293 (468) -||.+|-- .|.|++|.+.|+..|..-+|+.||.- .-.+ .+.|++-- . ....+-..-.. T Consensus 174 ~ell~ds~----~~l~~~i~~~la~~i~~~~~~a~l~G--------~G~~-~P~Gil~~~~~~~~~~~~~~~~~~~t~~~ 240 (390) T protein:vir:40 174 NAMLDLGP----SWLDQYVRTILGEAMALGLEAGIVNG--------SGKD-QPIGMMRDLNNVTAGEHPVKTATPLTDLT 240 (390) T ss_pred HHHHhcch----HHHHHHHHHHHHHHHHHHHHhhhhcc--------cCCC-ccceeeeccccccccccccccccccchhh Confidence 89999864 47899999999999999999999862 1000 12222210 0 00000000011 Q ss_pred HHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccccccccCCCceeEEEecCCeEEEEc Q lcl|NC_015288. 294 FKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTGNLAVGTINGRIKVYVD 373 (468) Q Consensus 294 ~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy~D 373 (468) ...++..+..-......+. .+++.|++-....+..|...-++.. .+|....+.+.-+++|+++ T Consensus 241 ~~~~~~~l~~~~~~~~~~~-~~~a~~i~n~~t~~~~l~~~~~~~d----------------~~G~~v~~~~~~g~pvv~~ 303 (390) T protein:vir:40 241 PATLATKVMLPLTDNGKKS-VSDAILVINPADYWSKIYAATSYMT----------------PQGVWVTGILPVPLEIVQS 303 (390) T ss_pred HHHHHHHHHHHhhcchhhh-hcCceEEEcchhHHHHHHHHhhccC----------------CCCccccccCCCceeEEEc Confidence 1122232322222222222 2345555444445555643222322 1222212222236788777 Q ss_pred cccccC----CCcceEEEEEecCCcccceeEEcccc--ccc----------cccccCCccccceeeeeeecc-eeecCcc Q lcl|NC_015288. 374 PYAANL----SDKHYYVVGYKGTSPYDAGLFYCPYV--PLQ----------MVRSIDPNNFQPKIGFKTRYG-MVSNPFV 436 (468) Q Consensus 374 ~Ya~~~----s~~dY~~vG~Kg~~~~d~glfyaPYv--~~~----------~~~~~Dp~s~qP~~g~~tRY~-l~~nPf~ 436 (468) .++... -++.++++|-.+....+.+ ++. .-+ -...+||++|. ++=++.==| -.+.||. T Consensus 304 ~~~p~~~i~~Gd~s~~~i~~~~~~~v~~~----~~~~f~~~~~~~r~~~r~dg~v~~~~A~~-~l~~~~~~~~~~~~~~~ 378 (390) T protein:vir:40 304 VAVPVGKAVAGRAKDYFMGIGSEQVIRTS----TEYRLLDDETLYYAKQYANGRPKDNSSFL-VFDITGLEGSPAIDVNV 378 (390) T ss_pred CCCCCCcEEEEeeceEEEEeecceEEEec----chhhhhcCcEEEEEEEEeCCEEecccceE-EEEeeccCCCCCCCcce Confidence 554320 1111222333332222211 110 000 01112555554 111111101 1344555 Q ss_pred cccCcccccCChh Q lcl|NC_015288. 437 TTNGLYSGTPDGE 449 (468) Q Consensus 437 ~~~~~~~~~~~~~ 449 (468) ....-++..+ ++ T Consensus 379 ~~~~~~~~~~-~~ 390 (390) T protein:vir:40 379 VNNATPSETP-AE 390 (390) T ss_pred eeCCCCCCCC-CC Confidence 5433333211 11 No 45 >protein:vir:104085 Length: 320 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:1656 # MgeName: Che12 # Cross-refs: genbank:acc:YP_655596;genbank:gi:109392467;genbank:GeneID:4156953 Probab=81.23 E-value=0.088 Score=26.38 Aligned_cols=293 Identities=10% Similarity=0.064 Sum_probs=116.1 Q ss_pred HhhhhhhhhhccccccCcccccccccccccccccccc-cccceehh-hhHHhhhhhhhhheeeeecCCccceeeeeeeee Q lcl|NC_015288. 43 LREERGMLQEVAVNSLGAGTVSPGGSALGSANTAGLA-GFDPVLIS-LVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSR 120 (468) Q Consensus 43 ~~e~~~~l~e~~~~~~g~~~~~~~~~~~~st~tg~~~-~~~P~Lv~-l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsr 120 (468) .++...+=.|. ....+|+++... ..-|.+.. +++.+....+-.+++.+.||++.+.-|.- T Consensus 1 ~~~~~~~~~~~--------------~~~~~t~~~~~~~~ip~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~p~---- 62 (320) T protein:vir:10 1 MAAGTAFQVDH--------------AQIAQTGDTMFKGYLEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKIPH---- 62 (320) T ss_pred CCCCccCCHHH--------------HHhhccccccccccccHHHHHHHHHHHHhccchhhhcceeeccCCceEEEE---- Confidence 11111110000 000111111111 11222221 33334445567888899999876533221 Q ss_pred ecCCCCCccccccccccccccccccccccccccccccccCcccccccccccccccccccccccchhhhhccCCCCCcccc Q lcl|NC_015288. 121 YENQAGEEALFNEPDAGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDAGKLFRE 200 (468) Q Consensus 121 Y~~qsG~EA~fnEa~t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~g~~f~E 200 (468) .. ++.++ .|- +| +..+++ T Consensus 63 ~~--~~~~a-------~~v------------------------------------------------~E-----~~~~~~ 80 (320) T protein:vir:10 63 WI--GDVSA-------QWI------------------------------------------------GE-----GDMKPI 80 (320) T ss_pred Ee--CCcce-------EEe------------------------------------------------cC-----Cccccc Confidence 11 01000 000 00 122334 Q ss_pred ceeEEEEEEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeee Q lcl|NC_015288. 201 MSFSIEKTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFD 280 (468) Q Consensus 201 MaFsIeK~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~D 280 (468) -..++++++...|..+-...+|-||.+|-. .|.++.|.+.|...|...+|+-+|.-= ..+...+. .+..+ T Consensus 81 ~~~~f~~v~~~~~k~~~~~~is~ell~ds~----~~l~~~i~~~l~~a~a~~~d~a~l~G~----g~~~~~~~--~~~~~ 150 (320) T protein:vir:10 81 TKGNMTSQNIAPHKIATIFVASAETVRANP----ANYLGTMRTKVATAFAMAFDSAALNGT----DSPFPTYL--AQTTK 150 (320) T ss_pred cccceeEEEEeeEEEEEeehhhHHHHhcCh----HHHHHHHHHHHHHHHHHHHHHHhhccc----CCCCCccc--ccccc Confidence 444456666666777777789999999865 468888888888888888888886410 01110000 01100 Q ss_pred ee---c----CCcchhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccc-ccccccccccc Q lcl|NC_015288. 281 LD---V----DSNGRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSG-LTGAGGPAIGT 352 (468) Q Consensus 281 l~---~----~~~~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~-~~~~~~~~~~~ 352 (468) .. . ..+.-+..+. .+. .+... ..........+|++++....|.. +..+.+ ...+.. .. T Consensus 151 ~~~~~~~~~~~~~~~~~~~~---~~~----~~~~~-~~~~~~~~~~~v~n~~~~~~L~~---lkd~~G~~l~~~~---~~ 216 (320) T protein:vir:10 151 SVSLADPGGATASDLTAYDA---VAV----NGLSL-LVNAKKKWTHTLLDDIVEPILNG---AKDKNGRPLFIES---TY 216 (320) T ss_pred cccceecccccccccccHHH---HHH----HHHhh-hhcccCCCcEEEEcHHHHHHHHH---hhccCCceeeccc---cc Confidence 00 0 0001111111 111 11111 12223345578999999998853 222211 000000 00 Q ss_pred ccCCCceeEEEecCCeEEEEccccccCC------CcceEEEEEecCCcccceeEEccccccccccccCCcc-----c--- Q lcl|NC_015288. 353 VDDTGNLAVGTINGRIKVYVDPYAANLS------DKHYYVVGYKGTSPYDAGLFYCPYVPLQMVRSIDPNN-----F--- 418 (468) Q Consensus 353 ~D~t~~~~~G~l~~~~~vy~D~Ya~~~s------~~dY~~vG~Kg~~~~d~glfyaPYv~~~~~~~~Dp~s-----~--- 418 (468) ........-+++ .+++|+++..+.... ++.++++|..+..+++-+ -+.......|+.. | T Consensus 217 ~~~~~~~~~~~i-~g~pv~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~i~~~------~~~~~~~~~~~~~~~~~~f~~~ 289 (320) T protein:vir:10 217 TDENSPFRAGRI-VSRPTILSDHVADGTTVGYMGDFRNVIWGQVGGLSFDVT------DQATLNLGTPTEPNFVSLWQHN 289 (320) T ss_pred cCccccccCcee-eeeeeEecCCCCCCceEEEEeecceEEEEEecCeEEEEe------ecceeeeccccccccchhhhcC Confidence 111122223444 367788876543211 122233344333222100 0000111111111 1 Q ss_pred cceeeeeeeccee-ecC--cccccCcccccCCh Q lcl|NC_015288. 419 QPKIGFKTRYGMV-SNP--FVTTNGLYSGTPDG 448 (468) Q Consensus 419 qP~~g~~tRY~l~-~nP--f~~~~~~~~~~~~~ 448 (468) |=.+=...|++.. .+| |+....-.+ |+. T Consensus 290 ~~~~r~~~~~d~~v~~~~a~~~l~~~~a--p~~ 320 (320) T protein:vir:10 290 LVAVRVEAEYAFHNNDKDAFVKLTNVVT--PDA 320 (320) T ss_pred cEEEEEEEeeccEEecccceEEEEeccC--CCC Confidence 1122233566543 344 443332222 443 No 46 >protein:vir:96262 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1612 # MgeName: ROSA # Cross-refs: genbank:acc:YP_240311;genbank:gi:66395978;genbank:GeneID:5133339 Probab=80.56 E-value=0.094 Score=26.22 Aligned_cols=260 Identities=15% Similarity=0.069 Sum_probs=104.8 Q ss_pred CCc-cceeeeeeeeeecCCCCCccccc---c---ccccccccccccccccccccccccccCccccccccccccccccccc Q lcl|NC_015288. 107 MSG-PTGLIFAMRSRYENQAGEEALFN---E---PDAGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYET 179 (468) Q Consensus 107 mTG-PTGLIFAMRsrY~~qsG~EA~fn---E---a~t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~ 179 (468) |.. .|- -.+.--.|=|-. + ..--|++-.. ... ...| .+ ..+.+. T Consensus 1 m~~~~T~--------l~d~i~Pev~~~~v~~~~~~~l~~~~~~~-~~~------------~l~g-~~-------G~tv~i 51 (274) T protein:vir:96 1 MAQGMTK--------LTNQIVPEVLAPMMQAELEKKLRFASFAE-IDN------------TLVG-QP-------GDTLTF 51 (274) T ss_pred CCcceee--------hhheechHHHHHHHHHHHHhhhhccccce-ecc------------cccC-CC-------CCEEEe Confidence 111 000 000000010000 0 0000111000 000 0000 00 001111 Q ss_pred ccccchhhhhccCC-CCCccccceeEEEEEEEEeecccccceecHHHHHhHHHhh-CCChhHHHHHHHHHHHHHHhhHHH Q lcl|NC_015288. 180 PRGFSREDLEQAGD-AGKLFREMSFSIEKTSVTAKSRALKAEYTLELAQDLKAIH-GLDAEQELANILSSEVLAEINREV 257 (468) Q Consensus 180 ~~gm~Ta~aE~lG~-~g~~f~EMaFsIeK~tVtAKSRaLKAEYTvELAQDLkAiH-GLDAE~ELanILStEImlEINREI 257 (468) +.--...++|.+.. .+-+..++..+ +.+++.+-|+- + |.+ -|+.+.- +-|.-.+..+-++..++.++++++ T Consensus 52 P~~~~ig~a~~~~~g~~i~~~~lt~~--~~~~~i~~~~~-a-~~i---~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i 124 (274) T protein:vir:96 52 PAFIYSGDAKVVAEGEKIPTDILETK--KREAKIRKIAK-G-TSI---SDEALLSGYGDPQGEQVRQHGLAHANKVDDDV 124 (274) T ss_pred eeecCCCccccccCCCccchhhcccc--eeEEEeeeeec-c-eee---hHHHHhhccchHHHHHHHHHHHHHHHHHHHHH Confidence 11001122222221 11223343333 33333344432 2 222 2555544 458889999999999999999999 Q ss_pred HHHHhhhcchhhccccccceeeeeecCCcchhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccc Q lcl|NC_015288. 258 VRRVYSVAKPGAANNVANAGIFDLDVDSNGRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLD 337 (468) Q Consensus 258 I~~l~~vA~~~k~~~~~~~Gv~Dl~~~~~~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~ 337 (468) +..+.+....- +...+ + .+.+-..+.++..| -..+++++++|++++.|.-....+ T Consensus 125 ~~~l~~a~~~~------~~~~~------~----~d~i~~A~~~lgd~---------~~~~~~ivv~p~~~~~L~k~~~~~ 179 (274) T protein:vir:96 125 LEALKSAKLTV------EADIT------K----LTGLQTAIDKFNDE---------DLEPMVLFISPLDAGKLRGDATTN 179 (274) T ss_pred HHHHhcccccc------ccccc------C----HHHHHHHHHHhccc---------cccccEEEeCHHHHHHHHhhcccc Confidence 98776432211 11111 1 12222233333322 136789999999999996543333 Q ss_pred cccccccccccccccccCCCceeEEEecCCeEEEEccccccCCCcceEEEEEe-cCCcccceeEEc-cccccccccccCC Q lcl|NC_015288. 338 YSSGLTGAGGPAIGTVDDTGNLAVGTINGRIKVYVDPYAANLSDKHYYVVGYK-GTSPYDAGLFYC-PYVPLQMVRSIDP 415 (468) Q Consensus 338 ~~~~~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy~D~Ya~~~s~~dY~~vG~K-g~~~~d~glfya-PYv~~~~~~~~Dp 415 (468) +...... + ..-..+-.+|++. |++||+| ++.|. |-.+-++ |+-. ||. +=++.+.. =|| T Consensus 180 f~~~s~~--g-----~~~~~~G~ig~~~-G~~Vi~s----~~~~~-~t~~l~~~gA~~-----~~~~~~~~vE~~--Rd~ 239 (274) T protein:vir:96 180 FTRATEL--G-----DDVIVKGAFGEAL-GAVIVRS----NKLEA-GTAILAKKGAVK-----LITKRDFFLETD--RDP 239 (274) T ss_pred ccccccc--c-----ccceeccccceec-CeEEEEe----CCCCC-ceEEEEecccee-----eeecCCcccccc--ccc Confidence 3321111 0 0011122467774 6999999 55553 3222222 2111 111 00011111 188 Q ss_pred ccccceeeeeeecceee-cC--cccccCcccccCChhhhhhccCc Q lcl|NC_015288. 416 NNFQPKIGFKTRYGMVS-NP--FVTTNGLYSGTPDGETLTPSTNM 457 (468) Q Consensus 416 ~s~qP~~g~~tRY~l~~-nP--f~~~~~~~~~~~~~~~~~~~~N~ 457 (468) .+++-.+-..-+||+.+ || -....... |+ + -| T Consensus 240 ~~~~d~i~~~~~y~~~~~~~~~~v~~tk~~-----~~-~----~~ 274 (274) T protein:vir:96 240 STKTTALYSDKHYVAYLYDESKAVKITKGS-----GS-L----EM 274 (274) T ss_pred ccccCEEEEeEEEEEEEEcCCcEEEEEcCC-----cc-c----cC Confidence 88888888888888754 44 11111110 10 0 00 No 47 >protein:vir:95898 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1588 # MgeName: 71 # Cross-refs: genbank:acc:YP_240385;genbank:gi:66396054;genbank:GeneID:5133409 Probab=80.56 E-value=0.094 Score=26.22 Aligned_cols=260 Identities=15% Similarity=0.069 Sum_probs=104.8 Q ss_pred CCc-cceeeeeeeeeecCCCCCccccc---c---ccccccccccccccccccccccccccCccccccccccccccccccc Q lcl|NC_015288. 107 MSG-PTGLIFAMRSRYENQAGEEALFN---E---PDAGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYET 179 (468) Q Consensus 107 mTG-PTGLIFAMRsrY~~qsG~EA~fn---E---a~t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~ 179 (468) |.. .|- -.+.--.|=|-. + ..--|++-.. ... ...| .+ ..+.+. T Consensus 1 m~~~~T~--------l~d~i~Pev~~~~v~~~~~~~l~~~~~~~-~~~------------~l~g-~~-------G~tv~i 51 (274) T protein:vir:95 1 MAQGMTK--------LTNQIVPEVLAPMMQAELEKKLRFASFAE-IDN------------TLVG-QP-------GDTLTF 51 (274) T ss_pred CCcceee--------hhheechHHHHHHHHHHHHhhhhccccce-ecc------------cccC-CC-------CCEEEe Confidence 111 000 000000010000 0 0000111000 000 0000 00 001111 Q ss_pred ccccchhhhhccCC-CCCccccceeEEEEEEEEeecccccceecHHHHHhHHHhh-CCChhHHHHHHHHHHHHHHhhHHH Q lcl|NC_015288. 180 PRGFSREDLEQAGD-AGKLFREMSFSIEKTSVTAKSRALKAEYTLELAQDLKAIH-GLDAEQELANILSSEVLAEINREV 257 (468) Q Consensus 180 ~~gm~Ta~aE~lG~-~g~~f~EMaFsIeK~tVtAKSRaLKAEYTvELAQDLkAiH-GLDAE~ELanILStEImlEINREI 257 (468) +.--...++|.+.. .+-+..++..+ +.+++.+-|+- + |.+ -|+.+.- +-|.-.+..+-++..++.++++++ T Consensus 52 P~~~~ig~a~~~~~g~~i~~~~lt~~--~~~~~i~~~~~-a-~~i---~D~~~~~~~~d~~~~~~~~~~~~~a~~vd~~i 124 (274) T protein:vir:95 52 PAFIYSGDAKVVAEGEKIPTDILETK--KREAKIRKIAK-G-TSI---SDEALLSGYGDPQGEQVRQHGLAHANKVDDDV 124 (274) T ss_pred eeecCCCccccccCCCccchhhcccc--eeEEEeeeeec-c-eee---hHHHHhhccchHHHHHHHHHHHHHHHHHHHHH Confidence 11001122222221 11223343333 33333344432 2 222 2555544 458889999999999999999999 Q ss_pred HHHHhhhcchhhccccccceeeeeecCCcchhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccc Q lcl|NC_015288. 258 VRRVYSVAKPGAANNVANAGIFDLDVDSNGRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLD 337 (468) Q Consensus 258 I~~l~~vA~~~k~~~~~~~Gv~Dl~~~~~~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~ 337 (468) +..+.+....- +...+ + .+.+-..+.++..| -..+++++++|++++.|.-....+ T Consensus 125 ~~~l~~a~~~~------~~~~~------~----~d~i~~A~~~lgd~---------~~~~~~ivv~p~~~~~L~k~~~~~ 179 (274) T protein:vir:95 125 LEALKSAKLTV------EADIT------K----LTGLQTAIDKFNDE---------DLEPMVLFISPLDAGKLRGDATTN 179 (274) T ss_pred HHHHhcccccc------ccccc------C----HHHHHHHHHHhccc---------cccccEEEeCHHHHHHHHhhcccc Confidence 98776432211 11111 1 12222233333322 136789999999999996543333 Q ss_pred cccccccccccccccccCCCceeEEEecCCeEEEEccccccCCCcceEEEEEe-cCCcccceeEEc-cccccccccccCC Q lcl|NC_015288. 338 YSSGLTGAGGPAIGTVDDTGNLAVGTINGRIKVYVDPYAANLSDKHYYVVGYK-GTSPYDAGLFYC-PYVPLQMVRSIDP 415 (468) Q Consensus 338 ~~~~~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy~D~Ya~~~s~~dY~~vG~K-g~~~~d~glfya-PYv~~~~~~~~Dp 415 (468) +...... + ..-..+-.+|++. |++||+| ++.|. |-.+-++ |+-. ||. +=++.+.. =|| T Consensus 180 f~~~s~~--g-----~~~~~~G~ig~~~-G~~Vi~s----~~~~~-~t~~l~~~gA~~-----~~~~~~~~vE~~--Rd~ 239 (274) T protein:vir:95 180 FTRATEL--G-----DDVIVKGAFGEAL-GAVIVRS----NKLEA-GTAILAKKGAVK-----LITKRDFFLETD--RDP 239 (274) T ss_pred ccccccc--c-----ccceeccccceec-CeEEEEe----CCCCC-ceEEEEecccee-----eeecCCcccccc--ccc Confidence 3321111 0 0011122467774 6999999 55553 3222222 2111 111 00011111 188 Q ss_pred ccccceeeeeeecceee-cC--cccccCcccccCChhhhhhccCc Q lcl|NC_015288. 416 NNFQPKIGFKTRYGMVS-NP--FVTTNGLYSGTPDGETLTPSTNM 457 (468) Q Consensus 416 ~s~qP~~g~~tRY~l~~-nP--f~~~~~~~~~~~~~~~~~~~~N~ 457 (468) .+++-.+-..-+||+.+ || -....... |+ + -| T Consensus 240 ~~~~d~i~~~~~y~~~~~~~~~~v~~tk~~-----~~-~----~~ 274 (274) T protein:vir:95 240 STKTTALYSDKHYVAYLYDESKAVKITKGS-----GS-L----EM 274 (274) T ss_pred ccccCEEEEeEEEEEEEEcCCcEEEEEcCC-----cc-c----cC Confidence 88888888888888754 44 11111110 10 0 00 No 48 >protein:vir:7771 Length: 330 # NCBI annotation: gp17 # Family: family:all:507 # MgeID: mge:149 # MgeName: Bxz2 # Cross-refs: genbank:acc:NP_817605;genbank:gi:29566035;genbank:GeneID:1259229 Probab=79.90 E-value=0.1 Score=26.07 Aligned_cols=297 Identities=11% Similarity=0.054 Sum_probs=122.2 Q ss_pred hhhhccccccCccccccccccccccccccccccccee-hhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCC Q lcl|NC_015288. 49 MLQEVAVNSLGAGTVSPGGSALGSANTAGLAGFDPVL-ISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGE 127 (468) Q Consensus 49 ~l~e~~~~~~g~~~~~~~~~~~~st~tg~~~~~~P~L-v~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qsG~ 127 (468) |-.+..... ....|.+++.. .-|.+ -.++++..++.+-.+++-+.||+++.--| - +... +. T Consensus 1 m~~~~~~a~-----------~~~~t~~~g~~-i~~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~-p---~~~~--~~ 62 (330) T protein:vir:77 1 MAGSTVPST-----------QVALTGDFSAF-LTPEQSQDYFAEIEKTSIVQRIARKVPMGPTGISI-P---HWTG--AV 62 (330) T ss_pred Ccccccchh-----------hccccCCCcce-echhHHHHHHHHHHhccchhhhcceeeccCCceEE-E---EEcC--Cc Confidence 222211000 01111111111 11222 22556666777888889999998765221 1 1110 11 Q ss_pred ccccccccccccccccccccccccccccccccCcccccccccccccccccccccccchhhhhccCCCCCccccceeEEEE Q lcl|NC_015288. 128 EALFNEPDAGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDAGKLFREMSFSIEK 207 (468) Q Consensus 128 EA~fnEa~t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~g~~f~EMaFsIeK 207 (468) ++ .|- +| +..+++-..++++ T Consensus 63 ~a-------~~v------------------------------------------------~E-----g~~~~~~~~~f~~ 82 (330) T protein:vir:77 63 SA-------SWT------------------------------------------------GE-----AERKPITKGSFGK 82 (330) T ss_pred ce-------eEe------------------------------------------------cC-----CCccccccceeeE Confidence 00 000 01 2334445556677 Q ss_pred EEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHH---------Hhhhcchhhcccccccee Q lcl|NC_015288. 208 TSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRR---------VYSVAKPGAANNVANAGI 278 (468) Q Consensus 208 ~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~---------l~~vA~~~k~~~~~~~Gv 278 (468) ++...|..+-...+|-||.+|- ..|.|+.|.+-|+..|...||+-||.- |...+.... ....... T Consensus 83 i~~~~~k~~~~~~is~ell~ds----~~~~~~~i~~~l~~ai~~~~~~~~l~G~g~~~~~~g~~~~~~~~~--~~~~~~~ 156 (330) T protein:vir:77 83 QELEPVKITTIFAESAEVVRLN----PLNYLNTMRTKIAEAIALKFDAAAIHGIDKPSAFKGYLAETTKVV--SLADTNL 156 (330) T ss_pred EEEeEEEEEEeehhhHHHHhcc----hHHHHHHHHHHHHHHHHHHHHHHhhcccCCCCccccccccccccc--eeecccc Confidence 7777777777778999999984 468999999999999999999988841 111110000 0000000 Q ss_pred eeeecCCcchhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccc-cccccccccccccCCC Q lcl|NC_015288. 279 FDLDVDSNGRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSG-LTGAGGPAIGTVDDTG 357 (468) Q Consensus 279 ~Dl~~~~~~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~-~~~~~~~~~~~~D~t~ 357 (468) .+.. -.....+..+ ..-+..+.+ .-...+.+|++++....|.. +..+.+ ..-+.+. ..+... T Consensus 157 ~~~~-----~~~~~~~~~l----~~~~~~~~~--~~~~~~~~vmn~~~~~~l~~---lkd~~G~~l~~~~~---~~~~~~ 219 (330) T protein:vir:77 157 TTAS-----GPQGNAYLAV----NNALSLLVN--SGKKWTGTLLDNVTEPILNT---AVDGNGRPLFVEST---YTEQVG 219 (330) T ss_pred cccc-----cccchhHHHH----HHHHHhhhh--cCCCccEEEEcHHHHHHHHH---HhccCCceeecCcc---cccccc Confidence 1110 0111112122 111122221 22344568999999988853 222111 1101000 001111 Q ss_pred ceeEEEecCCeEEEEccccccCC----------CcceEEEEEecCCcc----cceeEEcc--ccccccccccCCccc--- Q lcl|NC_015288. 358 NLAVGTINGRIKVYVDPYAANLS----------DKHYYVVGYKGTSPY----DAGLFYCP--YVPLQMVRSIDPNNF--- 418 (468) Q Consensus 358 ~~~~G~l~~~~~vy~D~Ya~~~s----------~~dY~~vG~Kg~~~~----d~glfyaP--Yv~~~~~~~~Dp~s~--- 418 (468) ...-++|. |++|++.......+ ++.++++|-.+..+. ++.+.+.- |.. ....+-+-| T Consensus 220 ~~~~~~l~-G~PV~~~~~~p~~~~~~~~~~~~gd~s~~~i~~~~~~~i~~~~e~~~~~~~~~~~~---~~~~~~~~f~~~ 295 (330) T protein:vir:77 220 AIREGRIL-GRPTYVADNVVNGTVGNRVVGVMGDFSQVIWGQIGGLSFDVTDQATLDFGEEQGGV---WVPKLISLWQHN 295 (330) T ss_pred ccCCceec-ceeeEEeccccCCCCCCccEEEEEecceEEEEEecCcEEEEeecceeeeccccccc---ccccccchhhcC Confidence 11234554 48888886543211 122333444333222 12211110 000 000000111 Q ss_pred cceeeeeeeccee-ecC--cccccCcccccCChhh Q lcl|NC_015288. 419 QPKIGFKTRYGMV-SNP--FVTTNGLYSGTPDGET 450 (468) Q Consensus 419 qP~~g~~tRY~l~-~nP--f~~~~~~~~~~~~~~~ 450 (468) +=.+=...|++.. .+| |+..+....+.+.-+. T Consensus 296 ~~~~r~~~r~d~~v~~~~a~~~i~~~~~~~~~~~~ 330 (330) T protein:vir:77 296 MVAVRCEAEFAFMVNDKDAFVKLTDQVAGTDPEEE 330 (330) T ss_pred cEEEEEEEEeccEEecccceEEEEeccCCcCCCCC Confidence 1122233455543 345 4433222121111121 No 49 >protein:vir:80376 Length: 435 # NCBI annotation: gp6, major capsid head protein # Family: family:all:21 # MgeID: mge:1881 # MgeName: phi644-2 # Cross-refs: genbank:acc:YP_001111085;genbank:gi:134288639;genbank:GeneID:4960624 Probab=78.04 E-value=0.12 Score=25.66 Aligned_cols=343 Identities=13% Similarity=0.061 Sum_probs=112.9 Q ss_pred Cc-chHHHHHhhhhhhcC-----Ccccc--------------------ccchhhhhhhh----hhhhhHHHHHhhh-hhh Q lcl|NC_015288. 1 MF-NAEHLQEKWSPVLNN-----EAANP--------------------IADRYKKAVTS----VLLENQERFLREE-RGM 49 (468) Q Consensus 1 ~~-~~~~l~~kw~p~l~~-----~~~~~--------------------i~~~~~~~~~~----~llenq~~~~~e~-~~~ 49 (468) |. .-++|.++..-+-+- +..-+ -....|..-.+ .+...+. ...+. ..+ T Consensus 41 l~~ei~~l~~~i~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~ 119 (435) T protein:vir:80 41 LSSKFNELTAQIERAEAAERMAAAAAVPVDPNPAAVTASAAAPVYAQPKAPEVKGAKMARMVRALAAARG-DAQLASKLA 119 (435) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhhcccccchhhhhccccccccccccchhhhhHHHHHHHHHHHHhccc-hhHHHHHHH Confidence 11 112333333221100 00000 00001111111 1111110 00000 000 Q ss_pred hhhccccccCcccccccccccccccccccccccceehhhhHHhhhhhhhhhe-eeeecCCccceeeeeeeeeecCCCCCc Q lcl|NC_015288. 50 LQEVAVNSLGAGTVSPGGSALGSANTAGLAGFDPVLISLVRRAMPNLMAYDV-CGVQPMSGPTGLIFAMRSRYENQAGEE 128 (468) Q Consensus 50 l~e~~~~~~g~~~~~~~~~~~~st~tg~~~~~~P~Lv~l~RRa~~~LIa~DI-~GVQPmTGPTGLIFAMRsrY~~qsG~E 128 (468) +.....+. ........++..|+..--....-.++++..+..+...+ +=+-||+.+. +-+... . ++.+ T Consensus 120 ~~~~~~~~------~~~~~~~~~~~~gg~lvP~~~~~~ii~~l~~~~~i~~~~~~~v~~~~~~-~~~p~~---~--~~~~ 187 (435) T protein:vir:80 120 IERGFGEE------VAMSLNTLSPGAGGVLVPENLSSEVIELLRPKSVVRKLGARTLPLSNGN-ITIPRL---K--GGAI 187 (435) T ss_pred Hhhhhhhh------hhhhhcccCCCCCccccchhHHHHHHHHHhhhchhhhccceeeecCCCc-eEEEEE---e--CCcc Confidence 00000000 00000000111121110011101133333344444444 2234443332 111111 0 0000 Q ss_pred cccccccccccccccccccccccccccccccCcccccccccccccccccccccccchhhhhccCCCCCccccceeEEEEE Q lcl|NC_015288. 129 ALFNEPDAGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDAGKLFREMSFSIEKT 208 (468) Q Consensus 129 A~fnEa~t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~g~~f~EMaFsIeK~ 208 (468) + .| . +| +..+++...+++++ T Consensus 188 a-------~~----------------------------------------------v--~E-----~~~~~~~~~~f~~i 207 (435) T protein:vir:80 188 V-------GY----------------------------------------------I--GA-----DTDIPTTQQQFDDL 207 (435) T ss_pred e-------ee----------------------------------------------e--cc-----CccccccccceeeE Confidence 0 00 0 00 12344555566666 Q ss_pred EEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeeeecCCc-- Q lcl|NC_015288. 209 SVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDLDVDSN-- 286 (468) Q Consensus 209 tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl~~~~~-- 286 (468) +...+.-+-....|-||.+|-.- +.|.|+.|.+-|+..|...+++-||.- . | .+-.+.|++....... T Consensus 208 ~~~~~k~~~~~~is~ell~ds~~--~~~l~~~i~~~l~~a~~~~~d~a~l~G----~--G--~~~~p~Gi~~~~~~~~~~ 277 (435) T protein:vir:80 208 KLTAKKMAALVPIANDLIKYAGV--NPNVDQIVVGDLTAAIGAREDKAFIRD----D--G--TANTPKGLRFWALPGNVI 277 (435) T ss_pred EEeeEEEEEeehhhHHHHHhhcc--cHHHHHHHHHHHHHHHHHHHHHHhhcc----C--C--CCCcccceeeccccccee Confidence 66666666677899999998432 356788888888888888888877652 1 1 0012334433211110 Q ss_pred chhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccccccccCCCceeEEEecC Q lcl|NC_015288. 287 GRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTGNLAVGTING 366 (468) Q Consensus 287 ~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~~~~~G~l~~ 366 (468) ..-.+...+.....+.+....+...........+|+++.....|.. +..+.+ + ... .+.++ |+|. T Consensus 278 ~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~---lkd~~G---~---~l~-~~~~~----~~l~- 342 (435) T protein:vir:80 278 TASDGSTLQKIETDLGKAILALENADANLTQPGWIMAPRTFRFLEG---LRDGNG---N---KVY-PELAN----GMLK- 342 (435) T ss_pred ecccccchhhHHHHHHHHHHHhhccccccccCEEEEcHHHHHHHHh---hhccCC---c---eec-cCCCC----CeEe- Confidence 0000011111111122211222111122334557899999988854 222211 1 111 12222 3554 Q ss_pred CeEEEEccccccCC------------CcceEEEEEecCCcccceeEEccccccccccccCCccc---cceeeeeeeccee Q lcl|NC_015288. 367 RIKVYVDPYAANLS------------DKHYYVVGYKGTSPYDAGLFYCPYVPLQMVRSIDPNNF---QPKIGFKTRYGMV 431 (468) Q Consensus 367 ~~~vy~D~Ya~~~s------------~~dY~~vG~Kg~~~~d~glfyaPYv~~~~~~~~Dp~s~---qP~~g~~tRY~l~ 431 (468) +++||++.+...+. ++.++++|-.+....+ ..+|.-+......--..| +=.+=..-|++.. T Consensus 343 G~pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~i~~~~~~~i~----~~~~~~~~~~~~~~~~~f~~n~~~~r~~~r~d~~ 418 (435) T protein:vir:80 343 GYPVGKTTQVPINLGEAGKESEIYFTDFGDVFIGEEETLEID----YSKEATYKDADGHMVSAFQRDQTLIRVIAKNDFG 418 (435) T ss_pred eeeeEEeccccccccCCCCcceEEEEEcccEEEEeecceEEE----EeccccccccccchhhhhhcCcceeeeeeeeCcE Confidence 47888875532110 1112223333322211 111111000000000001 1222244555544 Q ss_pred e-cC--cccccCcccccCChhhhhh Q lcl|NC_015288. 432 S-NP--FVTTNGLYSGTPDGETLTP 453 (468) Q Consensus 432 ~-nP--f~~~~~~~~~~~~~~~~~~ 453 (468) + +| |+ ...+..|.+ T Consensus 419 ~~~~~a~~--------~l~~~~~~~ 435 (435) T protein:vir:80 419 PRHVESIA--------VLSGVAWGA 435 (435) T ss_pred eecccceE--------EEeccCCCC Confidence 4 23 22 123334433 No 50 >protein:vir:4856 Length: 293 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:106 # MgeName: DT1 # Cross-refs: genbank:acc:NP_049396;genbank:gi:9632424;genbank:GeneID:1258532 Probab=77.04 E-value=0.13 Score=25.46 Aligned_cols=276 Identities=15% Similarity=0.112 Sum_probs=112.8 Q ss_pred cccccCccccccccccccccccccccccccee-hhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCCccccc Q lcl|NC_015288. 54 AVNSLGAGTVSPGGSALGSANTAGLAGFDPVL-ISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFN 132 (468) Q Consensus 54 ~~~~~g~~~~~~~~~~~~st~tg~~~~~~P~L-v~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fn 132 (468) +.++...++... |+. -.-+.+ -.+++..-+..+-.+++.+=||++.+|-+==.+ ..+..+ .+ T Consensus 1 ~l~~~~~~t~~~----------gg~-liP~~~~~~Ii~~~~~~~~l~~~~~~~~~~~~~g~~~~~~--~~~~~~-~a--- 63 (293) T protein:vir:48 1 MLDSKTDHSGSD----------AGL-TIPQDIRTAINTLVRQYDSLQEYVNVENVTTLTGSRVYEK--WTDITG-LA--- 63 (293) T ss_pred CceeecccccCc----------Cce-EechhHHHHHHHHHHhhhhhhhhceeeeccCCcceEEEEe--ecCCCc-ce--- Confidence 222222111111 111 111111 124444445666778888888887665211111 000000 00 Q ss_pred cccccccccccccccccccccccccccCcccccccccccccccccccccccchhhhhccCCCCCccccce-eEEEEEEEE Q lcl|NC_015288. 133 EPDAGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDAGKLFREMS-FSIEKTSVT 211 (468) Q Consensus 133 Ea~t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~g~~f~EMa-FsIeK~tVt 211 (468) .| .+| +..++|.+ .++++.+.. T Consensus 64 ----~~------------------------------------------------v~E-----g~~~~~~~~~~~~~i~l~ 86 (293) T protein:vir:48 64 ----NI------------------------------------------------DDE-----AGKIADIDDPKLSLIKYT 86 (293) T ss_pred ----ee------------------------------------------------ecC-----CcccccccccceeEEEEe Confidence 00 001 22344443 456666666 Q ss_pred eecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeeeecCCcchhHH Q lcl|NC_015288. 212 AKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDLDVDSNGRWSV 291 (468) Q Consensus 212 AKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl~~~~~~rw~~ 291 (468) +|.-+-...+|-||.+|.. +|.|++|.+-|+..|..-+|+.|+.-+-..+. ..+.+++ T Consensus 87 ~~k~~~~~~iS~ell~ds~----~~l~~~i~~~la~~~~~~~~~~i~~g~~~~~~--------~~~~~~~---------- 144 (293) T protein:vir:48 87 IKRYAGISTVTNSLLADSA----ENILAWLSGWIAKKVVVTRNKAILGVVDKLPT--------KPTLTKW---------- 144 (293) T ss_pred eeEEEEeehhhHHHHhhhh----HHHHHHHHHHHHHHHHHHHHhHHhhccccccc--------cccccCH---------- Confidence 7777777889999999863 67899999999999999999998864432221 1122111 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccccccccCCCceeEEEecCCeEEE Q lcl|NC_015288. 292 EKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTGNLAVGTINGRIKVY 371 (468) Q Consensus 292 e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy 371 (468) +....++..+. .. -+.. ...+|++.....|.. +..+.+ + .....+-+. -..++|.| ++|+ T Consensus 145 d~i~~~~~~l~-------~~-~~~~-a~~vmn~~~~~~L~~---lkd~~g---~---~l~~~~~~~-~~~~~l~G-~Pv~ 204 (293) T protein:vir:48 145 DDIIDLEAKVD-------PA-IKQT-SFFLTNTSGFTALKK---VKNALG---D---YLMERDVKS-PTGYSIAG-FAVK 204 (293) T ss_pred HHHHHHHHhhh-------hh-hcCC-CEEEEcHHHHHHHHH---hhccCC---c---eEeecCcCC-CCCceecc-eeeE Confidence 22222222221 11 1222 356789988888753 222111 0 011111111 12245544 4655 Q ss_pred E--ccccccCCCcc----------eEEEEEecCCcccceeEEccccccccccccCCccccceeeeeeeccee-ecC--cc Q lcl|NC_015288. 372 V--DPYAANLSDKH----------YYVVGYKGTSPYDAGLFYCPYVPLQMVRSIDPNNFQPKIGFKTRYGMV-SNP--FV 436 (468) Q Consensus 372 ~--D~Ya~~~s~~d----------Y~~vG~Kg~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~g~~tRY~l~-~nP--f~ 436 (468) + |.+..+....+ |+.++.++.... -..++.. .+-.+-|=.+-...||+.. .+| |. T Consensus 205 ~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i----~~~~~~~------~~~~~~~~~~r~~~r~d~~~~~~~a~~ 274 (293) T protein:vir:48 205 EISDRWLPNASSGVMPLYFGDLKQAVTLFDRQQMSL----LSTNIGG------GAFETDTTKVRVIDRFDVVATDTEAFV 274 (293) T ss_pred EecccccCCccCCceEEEEEeccceEEEEEecceEE----EEecccc------hhhhcCeEEEEEEEeeCcEEecccceE Confidence 4 33322211111 222222221111 1111110 0112334445555666543 233 22 Q ss_pred cccCcccccCChhhhhhccCceeeeEEe Q lcl|NC_015288. 437 TTNGLYSGTPDGETLTPSTNMYYRRVQV 464 (468) Q Consensus 437 ~~~~~~~~~~~~~~~~~~~N~y~r~~~v 464 (468) ..+--....+.+ +...-+ | T Consensus 275 ~l~~~~~~~~~~-~~~~~~--------~ 293 (293) T protein:vir:48 275 PASFKAIADQKG-NIGSTA--------V 293 (293) T ss_pred EEEeeccccCCc-cccccC--------C Confidence 111000000000 000000 0 No 51 >protein:vir:78223 Length: 333 # NCBI annotation: Putative major head protein # Family: family:all:966 # MgeID: mge:1849 # MgeName: Bethlehem # Cross-refs: genbank:acc:YP_001491666;genbank:gi:157786490;genbank:GeneID:5625701 Probab=75.46 E-value=0.15 Score=25.16 Aligned_cols=305 Identities=14% Similarity=0.049 Sum_probs=119.0 Q ss_pred hhhhhhccccccCccccccccccccccccccccccccee--hhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCC Q lcl|NC_015288. 47 RGMLQEVAVNSLGAGTVSPGGSALGSANTAGLAGFDPVL--ISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQ 124 (468) Q Consensus 47 ~~~l~e~~~~~~g~~~~~~~~~~~~st~tg~~~~~~P~L--v~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~q 124 (468) -+.|.|..+++.|.+..+.. ++.++. .. |.- -.+++...+..+..+++-+.||++..--|.-.. . T Consensus 1 ~a~l~el~~~~~~~~~~g~~------~~~~~~-li-P~~~~~~ii~~l~~~s~l~~~~~~~~~~~~~~~~p~~~----~- 67 (333) T protein:vir:78 1 MATLNELLPNSAGSNHQGRL------AHVPSD-LL-PKEIVGPIFDKAQESSLVLRMGEQIPISYGETIIPTTV----K- 67 (333) T ss_pred CchhHHhhhhcccccccCce------ecCCcc-cc-chhHHHHHHHHHHhhchhhhhcceeeccCCceEEEEEe----C- Confidence 23333333333332211111 111110 11 211 124455556667788899999876333222111 0 Q ss_pred CCCccccccccccccccccccccccccccccccccCcccccccccccccccccccccccchhhhhccCCCCCccccceeE Q lcl|NC_015288. 125 AGEEALFNEPDAGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDAGKLFREMSFS 204 (468) Q Consensus 125 sG~EA~fnEa~t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~g~~f~EMaFs 204 (468) ...+ .|-+.+. ....++++........|.+..++ T Consensus 68 -~~~a-------~~v~eg~--------------------------------------~~~~~e~~~~~~~~~~f~~i~l~ 101 (333) T protein:vir:78 68 -RPEV-------GQVGVGT--------------------------------------SNEQREGGLKPLSGTAWDTRSVS 101 (333) T ss_pred -Ccee-------EeecCcc--------------------------------------cccccccccccccccceeEEEEe Confidence 0000 1111000 00001111111123445555555 Q ss_pred EEEEEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeeee-- Q lcl|NC_015288. 205 IEKTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDLD-- 282 (468) Q Consensus 205 IeK~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl~-- 282 (468) ..|..+ -...|-||.+|-. .|.+++|.+.|...|...|+..+|.---.. ......|+.... T Consensus 102 ~~kl~~-------~~~is~ell~~s~----~~~~~~i~~~la~ai~~~~d~~~l~G~g~~------~~~~~~g~~~~~~~ 164 (333) T protein:vir:78 102 PIKLAT-------IVTVSEEFARMNP----SGLYTKLQGDLAYAIGRGIDLAVFHGKSPL------TGSALQGIDTDNVI 164 (333) T ss_pred eEEEEE-------eehhhHHHHhcCH----HHHHHHHHHHHHHHHHHHHHHHHhcccCCC------CCcccccccccccc Confidence 555554 3457788888754 478999999999999999999998521110 011111211100 Q ss_pred --c---CCcchhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccccccccCCC Q lcl|NC_015288. 283 --V---DSNGRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTG 357 (468) Q Consensus 283 --~---~~~~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~ 357 (468) . ...+....- .|.-...+-.....-....++.+|++|+-...|.....+....+. .....+..+ T Consensus 165 ~~~~~~~~~~~~~~~-----~~~~i~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~~~~~~d~~G~------~i~~~~~~~ 233 (333) T protein:vir:78 165 ANTTNVDYLQETGDP-----LLDRLLDGYDLVSANTDVEFNGWAVDPRFRAHLLRAQAYRDANGN------VDPSRINLA 233 (333) T ss_pred cccccccccccccch-----hHHHHHHHHHhhccccccCceEEEEcchHHHHHHHHhhhcCCCCc------eeecCcccc Confidence 0 000000001 111111222222222356677788898877776432222221110 000111111 Q ss_pred ceeEEEecCCeEEEEccccccCC-----CcceEE--------EEEecCCcccceeEEccccccccccccCCcccc-ceee Q lcl|NC_015288. 358 NLAVGTINGRIKVYVDPYAANLS-----DKHYYV--------VGYKGTSPYDAGLFYCPYVPLQMVRSIDPNNFQ-PKIG 423 (468) Q Consensus 358 ~~~~G~l~~~~~vy~D~Ya~~~s-----~~dY~~--------vG~Kg~~~~d~glfyaPYv~~~~~~~~Dp~s~q-P~~g 423 (468) . -.|+|.| ++|+++.+...+. +...++ +|..++.+.+ ..+|.-.......--.-|| -.++ T Consensus 234 ~-~~~~l~G-~Pv~~~~~i~~~~~~~~~~~~~~~~gD~~~~~~g~~~~~~i~----~~~~~~~~~~~~~~~~~~~~~~v~ 307 (333) T protein:vir:78 234 A-QTGDVLG-LPAQFGRAVGGDLGAAVDSKTRIIGGDFSQLKFGFADEIRIK----MSDTATLTDSGSATVSMWQTNQIA 307 (333) T ss_pred C-CCceeec-eeeEEccccCCCccccCCCccEEEEEecccEEEEEeeccEEE----EeccccccccccceeehhhcCcEE Confidence 1 1256664 6888774432110 111233 3333322221 1222100000000000111 1122 Q ss_pred --eeeeccee-ecC--cccccCcccc Q lcl|NC_015288. 424 --FKTRYGMV-SNP--FVTTNGLYSG 444 (468) Q Consensus 424 --~~tRY~l~-~nP--f~~~~~~~~~ 444 (468) ...|++.. .+| |+......+. T Consensus 308 ~r~~~r~d~~v~~~~a~~~l~~~~a~ 333 (333) T protein:vir:78 308 ILIEVTFGWLLGDKQAFVKFVDDEQP 333 (333) T ss_pred EEEEEEEccEEecccceEEEeccCCC Confidence 23577643 566 5544333332 No 52 >protein:vir:8420 Length: 477 # NCBI annotation: gp15 # Family: family:all:21 # MgeID: mge:155 # MgeName: Omega # Cross-refs: genbank:acc:NP_818316;genbank:gi:29566752;genbank:GeneID:1260033 Probab=75.28 E-value=0.15 Score=25.12 Aligned_cols=359 Identities=13% Similarity=0.088 Sum_probs=132.2 Q ss_pred Ccc-hHHHHHhhhhhhc------------CCccccccchhhhhhhhhhhh---hHHH-------HHhhhhhhhhhcccc- Q lcl|NC_015288. 1 MFN-AEHLQEKWSPVLN------------NEAANPIADRYKKAVTSVLLE---NQER-------FLREERGMLQEVAVN- 56 (468) Q Consensus 1 ~~~-~~~l~~kw~p~l~------------~~~~~~i~~~~~~~~~~~lle---nq~~-------~~~e~~~~l~e~~~~- 56 (468) +-. .++|.++=..... .+..+ ....++.-....+. ++.+ .-+..+.+....... T Consensus 66 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 143 (477) T protein:vir:84 66 LDEQIRELESEIERSGKLEAETKTVRKATVEVNE--ALTYEKGNGQSYFRDLAMQTVGMADEPAKERLRRHMVDVESDKE 143 (477) T ss_pred HHHHHHHHHHHHHHhhcchhhhhhhccccccccc--chhhhhhHHHHHHHHHHHHHhhhhhhHHHHHHHHHHhhhhhhhh Confidence 110 0111111000000 00000 00000000000000 0000 000000000000000 Q ss_pred ccC-cccccccccccccccccccccccceeh--hhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCC-ccccc Q lcl|NC_015288. 57 SLG-AGTVSPGGSALGSANTAGLAGFDPVLI--SLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGE-EALFN 132 (468) Q Consensus 57 ~~g-~~~~~~~~~~~~st~tg~~~~~~P~Lv--~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qsG~-EA~fn 132 (468) ... .....-...+..++++|+ .-.-|..+ .++...-+..+..+++++.||++.+|-+-=-|.. +|. .+. T Consensus 144 ~~~~~~~~~~~~~~~~~~~~gg-~lv~~~~~~~~ii~~l~~~~~i~~~~~~~~~~~~~~~~~ip~~~----~~~~~a~-- 216 (477) T protein:vir:84 144 IRKIAKVGEEYRDLDRNGGTGG-YAVPPLWMMNRFIELARAGRTYANLCPTEPLPGGTSSINIPKIL----TGTSTAI-- 216 (477) T ss_pred HHHHHHhhhhhccccccCCCcc-eeeccchhHHHHHHHhhhcchHHHhhceeeecCCcceeEEEEEe----cCcceee-- Confidence 000 000000000011111111 11223221 2455455667778999999999988864322211 111 000 Q ss_pred cccccccccccccccccccccccccccCcccccccccccccccccccccccchhhhhccCCCCCccccceeEEEEEEEEe Q lcl|NC_015288. 133 EPDAGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDAGKLFREMSFSIEKTSVTA 212 (468) Q Consensus 133 Ea~t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~g~~f~EMaFsIeK~tVtA 212 (468) +.+. +... .....++...+++.++..+ T Consensus 217 -----~~~E----------------------------------------------g~~~--~~~~~~~s~~~f~~i~~~~ 243 (477) T protein:vir:84 217 -----QAAD----------------------------------------------NAAL--TAPSAHEVDLTDGFVQANV 243 (477) T ss_pred -----eecc----------------------------------------------Cccc--ccccccccccceeeEEEee Confidence 0000 0000 0123455556677788888 Q ss_pred ecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeeeecCC------- Q lcl|NC_015288. 213 KSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDLDVDS------- 285 (468) Q Consensus 213 KSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl~~~~------- 285 (468) |.-+-...+|-||.+|-. .|.++.|.+-|+..|..-|++.||.- .-.+-.+.|++...... T Consensus 244 ~k~~~~~~iS~ell~ds~----~~l~~~i~~~l~~~~~~~~d~~~l~G--------~Gt~~~p~Gi~~~~~~~~~~~~~~ 311 (477) T protein:vir:84 244 KTIAGQQGIAIQLLDQAA----VSVDEFVFRDLAADYANKLNVQVISG--------TGSNNQVVGVRATAGITQVTATSA 311 (477) T ss_pred eeEEeeeHHHHHHHhccc----hhHHHHHHHHHHHHHHHHHHHHHhcc--------CCCCCccceeeecccccccccccc Confidence 888888889999999843 57899999999999999999988851 10011244555432111 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhcccccccccccccc-----ccccccccCCCcee Q lcl|NC_015288. 286 NGRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAG-----GPAIGTVDDTGNLA 360 (468) Q Consensus 286 ~~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~-----~~~~~~~D~t~~~~ 360 (468) ...|..- ..++.-.-.+..-.....+-.+..+|++|...+.|... .|..-...-+. .........-.... T Consensus 312 ~~t~~~~---~~~~~~i~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~l--kd~~G~~l~~~~~~~~~~~~~~~~~~~~~~ 386 (477) T protein:vir:84 312 GSALEKH---QIIYQKIADAIQRVHTSRFLEPEVIVMHPRRWASFHAI--FAGDDRPLIVPSGPGFNNLGVLTEVASQRV 386 (477) T ss_pred ccchhhH---HHHHHHHHHHHhhccccccCCccEEEEcHHHHHHHHHh--hccCCCeeeecCcccccccccccccccccc Confidence 0112100 01121111222222222333455677788766665321 11111000000 00000111122224 Q ss_pred EEEecCCeEEEEccccccC----CCcceEEEEEecCCcccceeEEccccccccccccCCcc--ccceeeeeeecc----- Q lcl|NC_015288. 361 VGTINGRIKVYVDPYAANL----SDKHYYVVGYKGTSPYDAGLFYCPYVPLQMVRSIDPNN--FQPKIGFKTRYG----- 429 (468) Q Consensus 361 ~G~l~~~~~vy~D~Ya~~~----s~~dY~~vG~Kg~~~~d~glfyaPYv~~~~~~~~Dp~s--~qP~~g~~tRY~----- 429 (468) .|+|+ +++|+++.+.-.+ .+..-+++|--.+--. . +..+..-++|.+ -...+.|.+ || T Consensus 387 ~~~l~-G~pVv~s~~~p~~~~~~~d~~~i~~gd~~~~~i-~--------~~~~~~~~~~~~~~~~~~~~~~v-~~~~~~~ 455 (477) T protein:vir:84 387 VGQMH-GLPVVTDPTLPTTLGTGTDQDVIHVLRASDLAL-F--------ESSVRMRALQETRAENLSVLLQV-YGYLAFT 455 (477) T ss_pred cchhc-ccceEecCcccccccccCCcceEEEEEeceEEE-E--------eeceeEEeccccccccceeeeee-hhhhhhh Confidence 56774 6799999665311 1223444444321100 0 000111122222 122333322 22 Q ss_pred eeecC--cccccCcccccCChh Q lcl|NC_015288. 430 MVSNP--FVTTNGLYSGTPDGE 449 (468) Q Consensus 430 l~~nP--f~~~~~~~~~~~~~~ 449 (468) .+-+| |+...-...-.|.-+ T Consensus 456 ~~r~~~afv~~t~~~~~~~~~~ 477 (477) T protein:vir:84 456 AARFPQSVVEIGGTALTAPTFA 477 (477) T ss_pred hhccccceEEeecccccccccC Confidence 22356 554322211112211 No 53 >protein:vir:105038 Length: 428 # NCBI annotation: major capsid head protein precursor # Family: family:all:21 # MgeID: mge:1465 # MgeName: phiKO2 # Cross-refs: genbank:acc:YP_006586;genbank:gi:46402092;genbank:GeneID:2777903 Probab=74.20 E-value=0.16 Score=24.93 Aligned_cols=337 Identities=16% Similarity=0.126 Sum_probs=110.8 Q ss_pred CcchH-------------HHHHhhhhhhcCCcc-----ccccc-hhhhhhhhhhhhhHHHH---------Hhhhhhhhhh Q lcl|NC_015288. 1 MFNAE-------------HLQEKWSPVLNNEAA-----NPIAD-RYKKAVTSVLLENQERF---------LREERGMLQE 52 (468) Q Consensus 1 ~~~~~-------------~l~~kw~p~l~~~~~-----~~i~~-~~~~~~~~~llenq~~~---------~~e~~~~l~e 52 (468) +++.| +|.++..-+=..|.+ .++.. ..++.....--+.|.+. +.+.++-+.. T Consensus 30 ~lt~ee~~~~~~l~~e~~~l~~~i~~~e~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 109 (428) T protein:vir:10 30 TLTAEQLTEFAGLQQQFTDISAKMDRMEATERAAALVAKPVKATQHGPAVIVKAEPKQYTGAGMTRMVMSIAAAQGNLQD 109 (428) T ss_pred CCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhchhhccccccccccchhhhHHHHHHHHHHHHhhhhHHH Confidence 33332 222222211000000 00000 00000000000111100 0000000000 Q ss_pred cc---ccccCcccccccccccccccccccc---cccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCC Q lcl|NC_015288. 53 VA---VNSLGAGTVSPGGSALGSANTAGLA---GFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAG 126 (468) Q Consensus 53 ~~---~~~~g~~~~~~~~~~~~st~tg~~~---~~~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qsG 126 (468) .. .+..+ .......+..++++|++. ...+.++.+.| +..+..++ |++..++++|-+-=.| ..+ + T Consensus 110 ~~~~~~~~~~--~~~~~~~~~~~~~~gg~liP~~~~~~ii~~l~---~~~~l~~~-~~~~~~~~~g~~~~p~--~~~--~ 179 (428) T protein:vir:10 110 AAKFASDELN--DQSVSMAISTAAGSGGVLIPQNIHSEVIELLR---DRTIVRKL-GARSIPLPNGNMSLPR--LAG--G 179 (428) T ss_pred HHHHhhhhhh--hhhHhhhhcccccCCccccchhHHHHHHHHHh---hhchhhhh-cceeeecCCcceEEEE--EeC--C Confidence 00 00000 000000011111122221 11223333333 34444454 3333333333321111 000 0 Q ss_pred CccccccccccccccccccccccccccccccccCcccccccccccccccccccccccchhhhhccCCCCCccccceeEEE Q lcl|NC_015288. 127 EEALFNEPDAGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDAGKLFREMSFSIE 206 (468) Q Consensus 127 ~EA~fnEa~t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~g~~f~EMaFsIe 206 (468) ..+ .| . +| +..+++...+++ T Consensus 180 ~~a-------~~----------------------------------------------v--~E-----g~~~~~~~~~f~ 199 (428) T protein:vir:10 180 ATA-------SY----------------------------------------------T--GE-----NQDAKVSEARFD 199 (428) T ss_pred cce-------ee----------------------------------------------e--cc-----Ccccccccccee Confidence 000 00 0 01 223444555566 Q ss_pred EEEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeeee---- Q lcl|NC_015288. 207 KTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDLD---- 282 (468) Q Consensus 207 K~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl~---- 282 (468) +++...|.-+-...+|-||.+|- ..|.++.|.+.|...|...+|+.||.- ...+..+.|++-.. T Consensus 200 ~i~~~~~k~~~~v~is~ell~ds----~~~l~~~i~~~l~~ai~~~~d~~~l~G--------~G~~~~p~Gi~~~~~~~~ 267 (428) T protein:vir:10 200 DVKLTAKTMIAMVPISNALIGRA----GFNVEQLVLQDILTAISVREDKAFMRD--------DGTGDTPIGMKARATQWN 267 (428) T ss_pred eEEeeeEEEEEeehhhHHHHhhh----hHHHHHHHHHHHHHHHHHHHHHHHhcc--------CCCCcccccccccccccc Confidence 66666666666788999999884 245788888888888888888888741 11111223332211 Q ss_pred ------cCCcchhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccccccccCC Q lcl|NC_015288. 283 ------VDSNGRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDT 356 (468) Q Consensus 283 ------~~~~~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t 356 (468) ......+ ..... ++....-+...... .....-.|+++.....|.. +..+.+ + .....+. T Consensus 268 ~~~~~~~~~~~~~--~~~~~-~~~~~~~~~~~~~~--~~~~~~~v~n~~~~~~L~~---lkd~~G---~---~i~~~~~- 332 (428) T protein:vir:10 268 RLLPWAADAAVNL--DTIDT-YLDSIILMSMDGNS--NMISSGWGMSNRTYMKLFG---LRDGNG---N---KVYPEMA- 332 (428) T ss_pred ccccccccccccH--HHHHH-HHHHHHHhhhcccc--ccccCEEEEcHHHHHHHHH---hhccCC---c---eeccCCC- Confidence 1111101 11110 11111111111111 1223345678888877753 222111 1 0111111 Q ss_pred CceeEEEecCCeEEEEccccccCC------------CcceEEEEEecCCcccceeEEcccccccccccc---CCccccce Q lcl|NC_015288. 357 GNLAVGTINGRIKVYVDPYAANLS------------DKHYYVVGYKGTSPYDAGLFYCPYVPLQMVRSI---DPNNFQPK 421 (468) Q Consensus 357 ~~~~~G~l~~~~~vy~D~Ya~~~s------------~~dY~~vG~Kg~~~~d~glfyaPYv~~~~~~~~---Dp~s~qP~ 421 (468) -|+| .+++||++.+...+. ++.++++|..+.-+.+ ..+|......... .=..-+=. T Consensus 333 ----~g~l-~G~pv~~~~~~p~~~~~~~~~~~i~~gd~s~~~i~~~~~i~i~----~~~~~~~~~~~~~~~~~f~~~~~~ 403 (428) T protein:vir:10 333 ----QGML-KGYPIQRTSAIPANLGEGGKESEIYFADFNDVVIGEDGNMKVD----FSKEASYIDTDGKLVSAFSRNQSL 403 (428) T ss_pred ----CCee-eceeeEEeccccccccCCCccceEEEEecceEEEEEecceEEE----eecccccccccccccchhhcchhh Confidence 2455 367888875432110 1122334443333221 1222111100000 00001112 Q ss_pred eeeeeecceeec-C--cccccCcccccCChhhh Q lcl|NC_015288. 422 IGFKTRYGMVSN-P--FVTTNGLYSGTPDGETL 451 (468) Q Consensus 422 ~g~~tRY~l~~n-P--f~~~~~~~~~~~~~~~~ 451 (468) +=...|+++.+. | |+..+ +-.| T Consensus 404 ~R~~~r~d~~v~~p~a~~~~t--------~~~~ 428 (428) T protein:vir:10 404 IRVVTEHDIGFRHPEGLVLGT--------GVLF 428 (428) T ss_pred eeeeeeeCceeeccceEEEEe--------ccCC Confidence 224456665543 4 33222 2223 No 54 >protein:vir:6242 Length: 390 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:131 # MgeName: phi-BT1 # Cross-refs: genbank:acc:NP_813696;swissprot:trembl:q859c1;genbank:gi:29366756;interpro:IPR006444;uniprot:Q859C1;genbank:GeneID:1258897 Probab=72.75 E-value=0.18 Score=24.68 Aligned_cols=337 Identities=14% Similarity=0.057 Sum_probs=113.0 Q ss_pred Ccc---hHHHHHhhh---hhhcCCccccccchhhhhhh---h--hhhhhHHHHHhhhhhhhhhccc---cccCcccc--- Q lcl|NC_015288. 1 MFN---AEHLQEKWS---PVLNNEAANPIADRYKKAVT---S--VLLENQERFLREERGMLQEVAV---NSLGAGTV--- 63 (468) Q Consensus 1 ~~~---~~~l~~kw~---p~l~~~~~~~i~~~~~~~~~---~--~llenq~~~~~e~~~~l~e~~~---~~~g~~~~--- 63 (468) |-= .|+.-++|+ -|++....-+..+.-++.+- + .-|+.|.+...+..+.+++... ...+.+.. T Consensus 4 ~~l~~l~e~r~~~~~e~~~L~~~~~~~~lt~e~~~~~~~l~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 83 (390) T protein:vir:62 4 TTLSANFEARERATAELRTLTDEFAGKEMTDEAREKEERLITAVSDYDARIKRGIEAIKAIDPVTSLLSGLQGSGSGAQR 83 (390) T ss_pred hHHHHHHHHHHHHHHHHHHHHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchh Confidence 110 011112222 22221111011111111110 0 1111222211122111111000 00000000 Q ss_pred ---------------c------cccccccccccccccccccee-hhhhHHhh-hhhhhhheeeeecCCccceeeeeeeee Q lcl|NC_015288. 64 ---------------S------PGGSALGSANTAGLAGFDPVL-ISLVRRAM-PNLMAYDVCGVQPMSGPTGLIFAMRSR 120 (468) Q Consensus 64 ---------------~------~~~~~~~st~tg~~~~~~P~L-v~l~RRa~-~~LIa~DI~GVQPmTGPTGLIFAMRsr 120 (468) . ........+++++-.-.-|.+ -.++.... ...+...++-|-||++...+-+.... T Consensus 84 ~~~~~~~~~~r~~~~~~~r~~~~~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~p~~~- 162 (390) T protein:vir:62 84 SADVDDDATLRAGNLGEARSFEFAPEKRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGATTFTTSDANPLDFTVIT- 162 (390) T ss_pred hcchHHHHHHhhhhhhhhHHHHhhhhhhcccccCCCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEEc- Confidence 0 000000011111100001111 11111111 12233445555555444333222210 Q ss_pred ecCCCCCccccccccccccccccccccccccccccccccCcccccccccccccccccccccccchhhhhccCCCCCcccc Q lcl|NC_015288. 121 YENQAGEEALFNEPDAGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDAGKLFRE 200 (468) Q Consensus 121 Y~~qsG~EA~fnEa~t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~g~~f~E 200 (468) .+.++ .-.+ | +..+++ T Consensus 163 ----~~~~a-----------------------------------------------------~wv~--E-----~~~~~~ 178 (390) T protein:vir:62 163 ----GRSSA-----------------------------------------------------SIVG--E-----TAEIPE 178 (390) T ss_pred ----CCcce-----------------------------------------------------eeec--c-----cccccc Confidence 00000 0001 1 223444 Q ss_pred ceeEEEEEEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeee Q lcl|NC_015288. 201 MSFSIEKTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFD 280 (468) Q Consensus 201 MaFsIeK~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~D 280 (468) -.-++++++..+|.-+-....|-||.+|- .+|.+++|.+-|+..|..-+|..||.- . | .+.|++. T Consensus 179 ~~~~f~~i~~~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~l~~~i~~~~d~~~l~G--------~--G-~p~Gi~~ 243 (390) T protein:vir:62 179 SYPATAQRSMGGFKYGFASVVSYEFATDQ----VLDLVGFLVSDAGPAIGDAMGRHFITG--------T--G-QPRGILT 243 (390) T ss_pred cccceeeeEeeeeeEEeehHHHHHHHhhh----hHHHHHHHHHHHHHHHHHHHHhhhhcc--------C--C-ccccccc Confidence 44455666677777777788999999993 367899999999999999999998852 0 1 1233333 Q ss_pred eecCCcchhH----H-HHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccccccccC Q lcl|NC_015288. 281 LDVDSNGRWS----V-EKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDD 355 (468) Q Consensus 281 l~~~~~~rw~----~-e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~ 355 (468) .......... . -.|..+ ..+. +++... -+..+. .|+++.....|.. |....+ + .....+- T Consensus 244 ~~~~~~~~~~~~~~~~~~~~~l-~~~~---~~l~~~-~~~~a~-~vmn~~~~~~L~~---lkd~~g---~---~l~~~~~ 308 (390) T protein:vir:62 244 DASPATATFLATDTDSKVSDAL-IDLF---HEVPSA-YRANAK-YVVNDLRAAQMRK---LKDANG---Q---YLWQSGL 308 (390) T ss_pred cccccccceecccccccchHHH-HHHH---Hhhhhh-hhcCCE-EEEchHHHHHHHH---hhccCC---C---eeecCCc Confidence 2111100000 0 001111 1111 122111 223343 5778887777742 222111 0 0111111 Q ss_pred CCceeEEEecCCeEEEEccccccCCCcceEEEEEecCCcccceeEEccccccccccccCCcc--ccceeeeeeeccee-e Q lcl|NC_015288. 356 TGNLAVGTINGRIKVYVDPYAANLSDKHYYVVGYKGTSPYDAGLFYCPYVPLQMVRSIDPNN--FQPKIGFKTRYGMV-S 432 (468) Q Consensus 356 t~~~~~G~l~~~~~vy~D~Ya~~~s~~dY~~vG~Kg~~~~d~glfyaPYv~~~~~~~~Dp~s--~qP~~g~~tRY~l~-~ 432 (468) +.. .-++|.| ++|+++.++. .+=+++|- -.. .+...--.....+..|+-. -|=.+=+..|++.. . T Consensus 309 ~~g-~~~~l~G-~Pv~~~~~~p----~~~i~~gd---~s~---~~i~~~~~~~v~~~~~~~~~~~~~~~~~~~r~d~~~~ 376 (390) T protein:vir:62 309 TVG-APSLFNG-KVVETDDGMP----ADKILFAD---LSK---YRVRFAGSLRVDRSVDAKFSTDQIVYRFLQRADGLLV 376 (390) T ss_pred CCC-ccceecc-cceEEecCCC----CccEEEee---ccc---eeEEeecceEEEeeccccccCCcEEEEEEEEeCcEee Confidence 111 1135654 6888885443 23233331 000 0000000111111223322 22223344566543 3 Q ss_pred cC--cccccCcccccCChhhhhhcc Q lcl|NC_015288. 433 NP--FVTTNGLYSGTPDGETLTPST 455 (468) Q Consensus 433 nP--f~~~~~~~~~~~~~~~~~~~~ 455 (468) || |.... +..++ T Consensus 377 ~~~A~~~l~-----------~~~~a 390 (390) T protein:vir:62 377 DARGAKVLT-----------VTPGA 390 (390) T ss_pred chhheEEEE-----------eecCC Confidence 44 22111 11111 No 55 >protein:vir:1638 Length: 298 # NCBI annotation: Structural protein # Family: family:all:966 # MgeID: mge:33 # MgeName: r1t # Cross-refs: genbank:acc:NP_695059;genbank:gi:23455750;genbank:GeneID:955469 Probab=71.70 E-value=0.19 Score=24.51 Aligned_cols=278 Identities=16% Similarity=0.160 Sum_probs=114.7 Q ss_pred ccccccccccccceeh-hhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCCcccccccccccccccccccccc Q lcl|NC_015288. 71 GSANTAGLAGFDPVLI-SLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDAGFTAGLDATTGAY 149 (468) Q Consensus 71 ~st~tg~~~~~~P~Lv-~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSG~~~~~~~~~ 149 (468) -.+++|.+. -|.+. .+++.+.+..+-.+++.+.||++...-|. .. . .+.++ .|- T Consensus 1 ma~~gG~lv--p~~~~~~ii~~~~~~s~i~~l~~~~~~~~~~~~ip-~~---~--~~~~a-------~~v---------- 55 (298) T protein:vir:16 1 MVLNKGTLF--DPTLVTDLISKVAGKSSIARLSAQKPIPFNGEKVF-TF---T--MDSEI-------DVV---------- 55 (298) T ss_pred CcccCccee--chhHHHHHHHHHHhhhhhhhhcceeeccCCceEEE-EE---e--cCcce-------EEe---------- Confidence 112222221 12111 23444446678899999999976432221 11 0 00000 000 Q ss_pred ccccccccccCcccccccccccccccccccccccchhhhhccCCCCCccccceeEEEEEEEEeecccccceecHHHHHhH Q lcl|NC_015288. 150 TPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDAGKLFREMSFSIEKTSVTAKSRALKAEYTLELAQDL 229 (468) Q Consensus 150 ~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~g~~f~EMaFsIeK~tVtAKSRaLKAEYTvELAQDL 229 (468) +| +.++++-..++++++..+|.-+-....|-||.++- T Consensus 56 --------------------------------------~E-----~~~~~~~~~~f~~v~l~~~k~a~~~~iS~ell~~s 92 (298) T protein:vir:16 56 --------------------------------------AE-----SGKKTHGGVTLAPQTMVPIKVEYGARISDEFMYAS 92 (298) T ss_pred --------------------------------------cC-----CccccccccceeEEEEeeeeEEEeehhhHHHhhcC Confidence 01 22344444555666666666666688999998754 Q ss_pred HHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhcccc-ccceeee---eecCCcchhHHHHHHHHHHHHHHHH Q lcl|NC_015288. 230 KAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNV-ANAGIFD---LDVDSNGRWSVEKFKGLLFQIERDC 305 (468) Q Consensus 230 kAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~-~~~Gv~D---l~~~~~~rw~~e~~~~l~~~i~~ea 305 (468) -. -..|-+++|.+-|+..|...|+..++.-... .-|+...+ ...++.. ..+.....+ ...+.. +..-. T Consensus 93 ~d-~~~~l~~~i~~~la~ai~~~~d~~~l~G~~~--~~g~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~----i~~~~ 164 (298) T protein:vir:16 93 DE-EKINILQEFNDGFAKKVARGIDLMAFHGVNP--RLGTASAVIGTNHFDSKVTQKVEAPRGI-ADPNGA----IENAV 164 (298) T ss_pred cc-cHHHHHHHHHHHHHHHHHHHHHHHhhccccC--CCCccccccccccccccccccccccccc-ccHHHH----HHHHH Confidence 32 1245677788888888888887777753210 01111111 0001100 011111000 011111 11111 Q ss_pred HHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccccccccCCCceeEEEecCCeEEEEcccccc--CCCcc Q lcl|NC_015288. 306 NAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTGNLAVGTINGRIKVYVDPYAAN--LSDKH 383 (468) Q Consensus 306 n~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy~D~Ya~~--~s~~d 383 (468) ..+ . ...++..-+|++++....|.. +...-+ + .....+.++. -.|+|+| ++|+++..... .++.+ T Consensus 165 ~~~-~-~~~~~~~~~vmn~~~~~~l~~---lkd~~G---~---~i~~~~~~~~-~~~~l~G-~PV~~~~~v~~~~~~~~~ 231 (298) T protein:vir:16 165 ELL-T-GVDADVTGIAINPSFRSALAK---QKDLQD---N---ALFPELKWGA-TPDTING-LPVDVNKTVSDMSLTQRD 231 (298) T ss_pred HHh-h-hcCCCccEEEEcHHHHHHHHH---hhccCC---C---eeecCcccCC-CCceecc-eeeEEecccccccCCCcc Confidence 111 1 123444558889998888753 122111 0 0111111111 1267765 68888754322 23344 Q ss_pred eEEEEEecCCcccceeEEccccc--cccccccCCcc-----cc-ceeee--eeecce-eecC--cccccCcccccCChhh Q lcl|NC_015288. 384 YYVVGYKGTSPYDAGLFYCPYVP--LQMVRSIDPNN-----FQ-PKIGF--KTRYGM-VSNP--FVTTNGLYSGTPDGET 450 (468) Q Consensus 384 Y~~vG~Kg~~~~d~glfyaPYv~--~~~~~~~Dp~s-----~q-P~~g~--~tRY~l-~~nP--f~~~~~~~~~~~~~~~ 450 (468) .+++|-- ..++.|..--. +.+.+..||++ || =.++| ..|++. +.+| |+..+ T Consensus 232 ~~~~GDf-----s~~~~~~~~~~~~~~~~~~~~~~~~~~~~f~~~~v~~ra~~r~d~~v~~~~a~~~l~----------- 295 (298) T protein:vir:16 232 RAIIGDF-----ANGFKWGYAKEVPLEVIQYGDPDNSGLDLKGYNQVYIRAELFLGWGILDATKFARVT----------- 295 (298) T ss_pred EEEEeec-----cceEEEEEecCceEEEeeccCCcCcchhhhhcCcEEEEEEEEEccEeecccceEEEe----------- Confidence 5555511 01112221111 22222234432 22 11333 446663 3444 33211 Q ss_pred hhhccC Q lcl|NC_015288. 451 LTPSTN 456 (468) Q Consensus 451 ~~~~~N 456 (468) .+| T Consensus 296 ---~at 298 (298) T protein:vir:16 296 ---EAN 298 (298) T ss_pred ---ecC Confidence 111 No 56 >protein:vir:3870 Length: 400 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:82 # MgeName: A2 # Cross-refs: genbank:acc:NP_680487;swissprot:trembl:q8ltc0;genbank:gi:22296527;interpro:IPR006444;uniprot:Q8LTC0;genbank:GeneID:951713 Probab=70.28 E-value=0.21 Score=24.29 Aligned_cols=326 Identities=13% Similarity=0.075 Sum_probs=118.9 Q ss_pred CcchHHHHHhh-------h----------hhhcCCccccc--cc----hhhhhhhhhhhhhHHHHHhhhhhhhhhccccc Q lcl|NC_015288. 1 MFNAEHLQEKW-------S----------PVLNNEAANPI--AD----RYKKAVTSVLLENQERFLREERGMLQEVAVNS 57 (468) Q Consensus 1 ~~~~~~l~~kw-------~----------p~l~~~~~~~i--~~----~~~~~~~~~llenq~~~~~e~~~~l~e~~~~~ 57 (468) +-..+++++++ . ...+.+..... .. ...+.....-++...+................ T Consensus 41 ~~~~~e~~~~~~~l~~ei~~l~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 120 (400) T protein:vir:38 41 LKKAEGVRAKYDKAGKEIKDLEEKRDLYEAALKGNEQSSGKKPDHPEEHSYRDALNAYLHTRGRNTDGVNFEKTDVGTFA 120 (400) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhhhhHHHHHHHHHhhHHHHHHHHHHHHHHHHHHh Confidence 11111111111 1 11111111000 00 00011111111111111000000000000000 Q ss_pred c-Cccccccccccccc--ccccccc---cccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCCcccc Q lcl|NC_015288. 58 L-GAGTVSPGGSALGS--ANTAGLA---GFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALF 131 (468) Q Consensus 58 ~-g~~~~~~~~~~~~s--t~tg~~~---~~~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~f 131 (468) . .............. ++.|++. .+.+.+ +++..+..+..+++.+.||++.++-+--++.. ++.-+++ T Consensus 121 ~~~~~~~~~~~~~~~~~~~~~gg~~vP~~~~~~i---i~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~ 193 (400) T protein:vir:38 121 VLRAVPTDASDAVNAGVKAADAASTIPETISNTP---QRELQTVVDLKPFTNVFQASTQKGTYPTVANA----TTKMVTV 193 (400) T ss_pred hhhhhhHHHHHHHhhcccccCCcccccHHHHHHH---HHHHHhhhhhhhcceeEeccCcceEEEEEecC----CCccccc Confidence 0 00000000000000 1112211 122333 44444666788899999999887755443311 1100000 Q ss_pred ccccccccccccccccccccccccccccCcccccccccccccccccccccccchhhhhccCCCCCccccc-eeEEEEEEE Q lcl|NC_015288. 132 NEPDAGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDAGKLFREM-SFSIEKTSV 210 (468) Q Consensus 132 nEa~t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~g~~f~EM-aFsIeK~tV 210 (468) .| +...++. ..+++.++. T Consensus 194 ~E-------------------------------------------------------------~~~~~~~~~~~f~~i~~ 212 (400) T protein:vir:38 194 AE-------------------------------------------------------------LEKNPAMAKPEFKPVNW 212 (400) T ss_pred cc-------------------------------------------------------------cccccccccccceeeEe Confidence 00 0011111 123344445 Q ss_pred EeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeeeecCCcchhH Q lcl|NC_015288. 211 TAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDLDVDSNGRWS 290 (468) Q Consensus 211 tAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl~~~~~~rw~ 290 (468) .+|.-+-...+|-||.+|- ..|.+++|.+-|...|...+|+-|+.-.-. ....|+..+ T Consensus 213 ~~~k~~~~~~is~ell~ds----~~~~~~~i~~~l~~~~~~~~~~~i~~~~~~---------~~~~~~~~~--------- 270 (400) T protein:vir:38 213 SVETYRQALPVSQESIDDS----AIDLVGLIAQNGQQIKVNTTNGAVATLLKG---------FTAKTISSV--------- 270 (400) T ss_pred ehhheeeehhhHHHHHhhh----HHHHHHHHHHHHHHHHHHHHHHhhhhcccc---------ccccccccH--------- Confidence 5555555678999999985 347888999999999999999888753321 112222111 Q ss_pred HHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccccccccCCCceeEEEecCCeEE Q lcl|NC_015288. 291 VEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTGNLAVGTINGRIKV 370 (468) Q Consensus 291 ~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~~~~~G~l~~~~~v 370 (468) .....+ +.... . ..+.+ .+|+|+.....|.. +..+. ++ .+...+-++. ..++|.| ++| T Consensus 271 -~~~~~~-~~~~~-------~-~~~~a-~~v~~~~~~~~l~~---lkd~~---G~---~i~~~~~~~~-~~~~l~G-~pv 328 (400) T protein:vir:38 271 -DDLKHI-NNVDL-------D-PAYSR-VIIASQSFYNFLDT---VKDGN---GR---YLLQDSILTP-SGKSVLG-MPI 328 (400) T ss_pred -HHHHHH-HHhhh-------h-hhhCc-EEEEcHHHHHHHHH---hhccC---CC---eeeecCcCCC-Ccccccc-cee Confidence 111111 11110 1 11233 46778888877753 11111 00 0111111111 1245644 556 Q ss_pred EEcccccc-CCCcceEEEEEecCCcccceeEEccccccccccccCCccccceeeeeeecceee-cC--cccccCcccccC Q lcl|NC_015288. 371 YVDPYAAN-LSDKHYYVVGYKGTSPYDAGLFYCPYVPLQMVRSIDPNNFQPKIGFKTRYGMVS-NP--FVTTNGLYSGTP 446 (468) Q Consensus 371 y~D~Ya~~-~s~~dY~~vG~Kg~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~g~~tRY~l~~-nP--f~~~~~~~~~~~ 446 (468) ++...... ......+++|--.. .+........ -....|-..|+..+-...|++..+ +| |.... T Consensus 329 ~~~~~~~~~~~g~~~~~~gd~s~-----~~~~~~~~~~-~~~~~~~~~~~~~~~~~~r~d~~~~~~~a~~~l~------- 395 (400) T protein:vir:38 329 AVVSDDTLGAAGEAHAFLGDIKR-----AILFANRADF-MVRWVDDQIYGQFLQAGMRFGVSVADEKAGYFLT------- 395 (400) T ss_pred EEecccccCCCCceEEEEEeccc-----cEEEEeecce-EEEEecccccceeEEEEEEeccEEecccceEEEE------- Confidence 55422110 01111222221000 0001111011 112235556677777788988654 33 33221 Q ss_pred Chhhhhhcc Q lcl|NC_015288. 447 DGETLTPST 455 (468) Q Consensus 447 ~~~~~~~~~ 455 (468) +...| T Consensus 396 ----~~~~a 400 (400) T protein:vir:38 396 ----YTPKA 400 (400) T ss_pred ----eecCC Confidence 11111 No 57 >protein:vir:4339 Length: 395 # NCBI annotation: major head protein # Family: family:all:585 # MgeID: mge:93 # MgeName: D3 # Cross-refs: genbank:acc:NP_061502;genbank:gi:9635591;genbank:GeneID:1262860 Probab=69.14 E-value=0.23 Score=24.11 Aligned_cols=330 Identities=11% Similarity=0.019 Sum_probs=117.8 Q ss_pred CcchHHHHHhhhhhhcC------------------CccccccchhhhhhhhhhhhhHHHHHhhhhhhhhhccccccCccc Q lcl|NC_015288. 1 MFNAEHLQEKWSPVLNN------------------EAANPIADRYKKAVTSVLLENQERFLREERGMLQEVAVNSLGAGT 62 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~------------------~~~~~i~~~~~~~~~~~llenq~~~~~e~~~~l~e~~~~~~g~~~ 62 (468) .-.++++.++=..+... +.. .......+..-..+.+ ....+.+...... ... T Consensus 37 ~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~----~~~ 106 (395) T protein:vir:43 37 GEMNKETRAKVDELLTAQGELQARLSAAEQAMLANEKR-DGGEEAPKTAGQMVAE-----SLKEQGVTSSLRG----SHR 106 (395) T ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcc-ccccchhhhHHHHHHH-----HHHHHHHHHHhhh----hhh Confidence 00011111111111110 000 0000000000000000 0001111100000 000 Q ss_pred cccc-ccccccccccccccccce-ehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCCccccccccccccc Q lcl|NC_015288. 63 VSPG-GSALGSANTAGLAGFDPV-LISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDAGFTA 140 (468) Q Consensus 63 ~~~~-~~~~~st~tg~~~~~~P~-Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSG 140 (468) ..-. ......+.+++. -.-|. .-.++++..+..+..++|.++||.+++.-+. | +...++. + .| T Consensus 107 ~~~~~~~~~~~~~~~g~-~vp~~~~~~ii~~~~~~~~l~~l~~~~~~~~~~~~~~--~--~~~~~~~-a-------~~-- 171 (395) T protein:vir:43 107 VSMPRSAITSIDGSGGA-LVAPDRRPGVVAAPQRRLTIRDLVAPGTTESNSVEYV--R--ETGFVNN-A-------AP-- 171 (395) T ss_pred hhhhhhhhcccCCCCcc-ccchhhHHHHHHHHHhhhhHHhhccceecCCCceEEE--E--EecCCCc-e-------ee-- Confidence 0000 000000111110 01111 1234444556677889999999988753321 1 1111000 0 00 Q ss_pred cccccccccccccccccccCcccccccccccccccccccccccchhhhhccCCCCCccccceeEEEEEEEEeecccccce Q lcl|NC_015288. 141 GLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDAGKLFREMSFSIEKTSVTAKSRALKAE 220 (468) Q Consensus 141 ~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~g~~f~EMaFsIeK~tVtAKSRaLKAE 220 (468) . +| +..+++-..++++.+...|.-+-... T Consensus 172 --------------------------------------------v--~E-----~~~~~~~~~~~~~i~~~~~k~~~~~~ 200 (395) T protein:vir:43 172 --------------------------------------------V--SE-----GTQKPYSDLTFELENAPVRTIAHLFK 200 (395) T ss_pred --------------------------------------------e--cC-----CccccccccceeEEEEeeeeEEEeeh Confidence 0 00 11233344455555666666666678 Q ss_pred ecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeeeecCCc--chhHHHHHHHHH Q lcl|NC_015288. 221 YTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDLDVDSN--GRWSVEKFKGLL 298 (468) Q Consensus 221 YTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl~~~~~--~rw~~e~~~~l~ 298 (468) +|-||.||.- +.++.|.+-|+..+...+|+.||.- +-. +-...|++....... .-..+.. ... T Consensus 201 is~ell~d~~-----~l~~~v~~~la~a~~~~~d~~~l~G----~g~----~~~~~Gi~~~~~~~~~~~~~~~~~--~~~ 265 (395) T protein:vir:43 201 ASRQILDDAS-----ALQSYIDARARYGLMLVEECQLLYG----NGT----GANLHGIIPQAQAYAPPSGVVVTA--EQR 265 (395) T ss_pred hhHHHHHhHH-----HHHHHHHHHHHHHHHHHHHHHHHhc----cCC----CCcccccccccccccccccccccc--chh Confidence 9999999852 3678889999999999998888742 100 111223322111000 0000000 001 Q ss_pred HHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccccccccCCCceeEEEecCCeEEEEcccccc Q lcl|NC_015288. 299 FQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTGNLAVGTINGRIKVYVDPYAAN 378 (468) Q Consensus 299 ~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy~D~Ya~~ 378 (468) |.....+..-.. ..-+.+..+|+|+.....|.. +..+. ++ . .. .|.. ..-.++|. |++|+++.+.. T Consensus 266 ~~~i~~~~~~~~-~~~~~~~~~vmn~~~~~~l~~---lkd~~---G~-~--i~-~~~~-~~~~~~l~-G~pVv~~~~~~- 331 (395) T protein:vir:43 266 IDRIRLAILQAQ-LAEFPASGIVLNPIDWALIEL---NKDAE---NR-Y--II-GSPQ-NGTTPTLW-RLPVVETQAIT- 331 (395) T ss_pred HHHHHHHHHhhc-cccCCCcEEEEcHHHHHHHHH---hhccC---Cc-e--ec-cccc-cCCCceec-ceeeEEcCCCC- Confidence 222222222212 223345678999999877742 11111 11 0 00 1111 11235665 47999986643 Q ss_pred CCCcceEEEEEecCCcccceeEEccccccccccccCCc---ccc-ceee--eeeecceee-cCcccccCcccccCChhhh Q lcl|NC_015288. 379 LSDKHYYVVGYKGTSPYDAGLFYCPYVPLQMVRSIDPN---NFQ-PKIG--FKTRYGMVS-NPFVTTNGLYSGTPDGETL 451 (468) Q Consensus 379 ~s~~dY~~vG~Kg~~~~d~glfyaPYv~~~~~~~~Dp~---s~q-P~~g--~~tRY~l~~-nPf~~~~~~~~~~~~~~~~ 451 (468) .+=+++|--... |--+.-..+..-+++. .|+ -.++ +..|++..+ +|= T Consensus 332 ---~~~~~~gd~~~~-------~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~---------------- 385 (395) T protein:vir:43 332 ---QDEFLTGAFSLG-------AQIFDRMDIEVLVSTENDKDFENNMVTIRAEERLAFAVYRPE---------------- 385 (395) T ss_pred ---CCcEEEEeccce-------EEEEEecceEEEEeccccchhhcCcEEEEEEEeeccEEeccc---------------- Confidence 222333321100 0000000111111111 122 2233 345666554 230 Q ss_pred hhccCceeeeEEeecc Q lcl|NC_015288. 452 TPSTNMYYRRVQVTNL 467 (468) Q Consensus 452 ~~~~N~y~r~~~v~~~ 467 (468) -|.++.|+-= T Consensus 386 ------a~~~~~~taa 395 (395) T protein:vir:43 386 ------AFVTGSLTAS 395 (395) T ss_pred ------ceEEEEeccC Confidence 1222222222 No 58 >protein:vir:93742 Length: 274 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1475 # MgeName: 55 # Cross-refs: genbank:acc:YP_240459;genbank:gi:66396126;genbank:GeneID:5133511 Probab=66.05 E-value=0.27 Score=23.67 Aligned_cols=257 Identities=13% Similarity=0.053 Sum_probs=110.6 Q ss_pred eeeeecCCCCCcccccccc-----------ccccccccccccccccccccccccCcccccccccccccccccccccccch Q lcl|NC_015288. 117 MRSRYENQAGEEALFNEPD-----------AGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSR 185 (468) Q Consensus 117 MRsrY~~qsG~EA~fnEa~-----------t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~T 185 (468) |=.....- . .-+..|.. .-|++... ... ...+ .+ ..+.+...--.+ T Consensus 1 ma~~~T~~-~-~~iiPev~~~~v~~~~~~~~~~~~~~~-~~~------------~l~g-~~-------G~tv~ip~~~~~ 57 (274) T protein:vir:93 1 MPQGITKT-S-NQIIPEVLAPMMQAQLEKKLRFASFAE-VDS------------TLQG-QP-------GDTLTFPAFVYS 57 (274) T ss_pred CCccceeh-h-heechHHHHHHHHHHHHhhhhhccccc-ccc------------cccC-CC-------CCEEEEEeeccC Confidence 22111000 0 00111100 00111000 000 0000 00 001111110011 Q ss_pred hhhhccCC-CCCccccceeEEEEEEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhh Q lcl|NC_015288. 186 EDLEQAGD-AGKLFREMSFSIEKTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSV 264 (468) Q Consensus 186 a~aE~lG~-~g~~f~EMaFsIeK~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~v 264 (468) .+++.+.. ..-++.++ +....+++.|-|+-.-+++=|. .+. -+-|.-.+..+-++..+...++++++..+.+. T Consensus 58 g~~~~~~eg~~i~~~~i--t~~~~~~~i~~~~~~~~i~D~~--~~~--~~~d~~~~~~~~~~~~~a~~~d~~~~~~~~~a 131 (274) T protein:vir:93 58 GDAQVVAEGEKIPTDIL--ETKKREAKIRKIAKGTSITDEA--LLS--GYGDPQGEQVRQHGLAHANKVDNDVLEALMGA 131 (274) T ss_pred CCcccccCCCccccccc--ccceeEEEeeeecccccccHHH--HHh--hccchHHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 22222221 12234444 3445555556665322333322 222 35788999999999999999999999877543 Q ss_pred cchhhccccccceeeeeecCCcchhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhcccccccccccc Q lcl|NC_015288. 265 AKPGAANNVANAGIFDLDVDSNGRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTG 344 (468) Q Consensus 265 A~~~k~~~~~~~Gv~Dl~~~~~~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~ 344 (468) .... +... ...+.+-..+.++.. .-..+++++|+|.+++.|..-....+.+. . T Consensus 132 ~~~~------~~~~----------~~~d~i~dA~~~l~d---------~~~~~~~ivv~p~~~~~L~k~~~~~f~~~--s 184 (274) T protein:vir:93 132 KLTV------NADI----------TKLNGLQSAIDKFND---------EDLEPMVLFINPLDAGKLRGDASTNFTRA--T 184 (274) T ss_pred cccc------cccc----------cCHHHHHHHHHHhhh---------ccCCccEEEeCHHHHHHHHhhhhhccccc--c Confidence 3211 0001 112333223233322 12467899999999999964332222211 1 Q ss_pred ccccccccccCCCceeEEEecCCeEEEEccccccCCCcceEEEEEecCCcccceeEEccccccccccccCCccccceeee Q lcl|NC_015288. 345 AGGPAIGTVDDTGNLAVGTINGRIKVYVDPYAANLSDKHYYVVGYKGTSPYDAGLFYCPYVPLQMVRSIDPNNFQPKIGF 424 (468) Q Consensus 345 ~~~~~~~~~D~t~~~~~G~l~~~~~vy~D~Ya~~~s~~dY~~vG~Kg~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~g~ 424 (468) ..+ .+...+-.+|++. |++|++| ++-|..-.++.-+|.-. .+..+ +.....-=|+++++=.+-. T Consensus 185 ~~g-----~~~~~~G~ig~~~-G~~Vi~s----~~~p~~t~~l~~~gai~----~~~~~--~~~vE~~Rd~~~~~d~i~~ 248 (274) T protein:vir:93 185 ELG-----DDIIVKGAFGEAL-GAIIVRT----NKLEAGTAILAKKGAVK----LILKR--DFFLEVARDASTKTTALYS 248 (274) T ss_pred ccc-----ccceeecccceec-CeeEEEc----CCCCcceEEEEeCCeEE----EEecC--CcccccccchhhcccEEEE Confidence 111 0111222467774 6899999 66664333322223211 11111 1111111288999999999 Q ss_pred eeecceee-cC--ccccc-Cc-cccc Q lcl|NC_015288. 425 KTRYGMVS-NP--FVTTN-GL-YSGT 445 (468) Q Consensus 425 ~tRY~l~~-nP--f~~~~-~~-~~~~ 445 (468) ..+||+.+ || ..... .. +-+| T Consensus 249 ~~~y~~~~~~~~~~v~~t~~~~s~~~ 274 (274) T protein:vir:93 249 DKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred EEEEEEEEEcCCceEEEeeCccccCC Confidence 99999864 44 11111 11 1111 No 59 >protein:vir:80684 Length: 315 # NCBI annotation: gp6 # Family: family:all:966 # MgeID: mge:1884 # MgeName: PA6 # Cross-refs: genbank:acc:YP_001285582;genbank:gi:148727088;genbank:GeneID:5247055 Probab=65.04 E-value=0.29 Score=23.53 Aligned_cols=294 Identities=13% Similarity=0.070 Sum_probs=101.6 Q ss_pred cccccccccccccccccceeh-hhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCCccccccccccccccccc Q lcl|NC_015288. 66 GGSALGSANTAGLAGFDPVLI-SLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDAGFTAGLDA 144 (468) Q Consensus 66 ~~~~~~st~tg~~~~~~P~Lv-~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSG~~~~ 144 (468) ++ ..++++|+.. .-+.+. .+++++.+..+...++-|-||.+.. +-|-.. . .+.++ .|- T Consensus 1 Ma--~~~~~~gg~~-vP~~~~~~ii~~l~~~s~i~~l~~~i~~~~~~-~~ip~~---~--~~~~a-------~wv----- 59 (315) T protein:vir:80 1 MA--DDFLSAGKLE-LPGSMIGAVRDRAIDSGVLAKLSPEQPTIFGP-VKGAVF---S--GVPRA-------KIV----- 59 (315) T ss_pred CC--CCcCCcCceE-cchHHHHHHHHHHHhhchhhhhcceeecCCCc-eEEEEE---e--CCcce-------EEe----- Confidence 11 1122222222 122221 2455555666777888888887542 222221 0 00111 000 Q ss_pred cccccccccccccccCcccccccccccccccccccccccchhhhhccCCCCCccccceeEEEEEEEEeecccccceecHH Q lcl|NC_015288. 145 TTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDAGKLFREMSFSIEKTSVTAKSRALKAEYTLE 224 (468) Q Consensus 145 ~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~g~~f~EMaFsIeK~tVtAKSRaLKAEYTvE 224 (468) ++ +..+++...++++++..+|.=+-....|-| T Consensus 60 -----------------------------------------~E-------g~~~~~s~~~f~~v~l~~~kl~~~~~iS~e 91 (315) T protein:vir:80 60 -----------------------------------------GE-------GEVKPSASVDVSAFTAQPIKVVTQQRVSDE 91 (315) T ss_pred -----------------------------------------eC-------CccccccccceeeeEeeeeeEEeeehhhHH Confidence 01 122333334444444444444444578999 Q ss_pred HHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeeeec-CCc-chhHHHHHHHHHHHHH Q lcl|NC_015288. 225 LAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDLDV-DSN-GRWSVEKFKGLLFQIE 302 (468) Q Consensus 225 LAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl~~-~~~-~rw~~e~~~~l~~~i~ 302 (468) |.+|-. .|+..+|.++|..++...|.|.+=+.++.-...+. +....|+...-. ... ..-....+..+ . T Consensus 92 ll~~s~----~~~~~~l~~~i~~~la~ai~~~~d~a~~~G~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~d~----~ 161 (315) T protein:vir:80 92 FMWADA----DYRLGVLQDLISPALGASIGRAVDLIAFHGIDPAT--GKAASAVHTSLNKTKNIVDATDSATADL----V 161 (315) T ss_pred HhhcCc----hhHHHHHHHHHHHHHHHHHHHHHhhheeeccCCCC--CccccccccccccccceeeccccchHHH----H Confidence 988844 46777788888777777777776555553211111 111112111100 000 00000111111 0 Q ss_pred HHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccccccccCCCceeEEEecCCeEEEEccccccCC-- Q lcl|NC_015288. 303 RDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTGNLAVGTINGRIKVYVDPYAANLS-- 380 (468) Q Consensus 303 ~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy~D~Ya~~~s-- 380 (468) .-...+.. -.-...+-.+++++....|... ....+...++..-. .....+. .++|. +++|+++.++.... T Consensus 162 ~~~~~~~~-~~~~~~~~~imn~~~~~~L~~l---~~~~g~~~~g~~~~-~~~~~g~--~~tl~-G~PV~~~~~~~~~~~~ 233 (315) T protein:vir:80 162 KAVGLIAG-AGLQVPNGVALDPAFSFALSTE---VYPKGSPLAGQPMY-PAAGFAG--LDNWR-GLNVGASSTVSGAPEM 233 (315) T ss_pred HHHHHHhh-ccCccceEEEEcHHHHHHHHHH---hhccCCcccccccc-cccccCC--Cceec-ceeeEecCcCCccccc Confidence 00011111 1112234578999998888532 11111111110000 0001111 25665 47888775542110 Q ss_pred ---CcceEEEEEecCCcccceeEEcccccccc--ccccCCcc-----ccc-eeeee--eeccee-ecC--cccccCcccc Q lcl|NC_015288. 381 ---DKHYYVVGYKGTSPYDAGLFYCPYVPLQM--VRSIDPNN-----FQP-KIGFK--TRYGMV-SNP--FVTTNGLYSG 444 (468) Q Consensus 381 ---~~dY~~vG~Kg~~~~d~glfyaPYv~~~~--~~~~Dp~s-----~qP-~~g~~--tRY~l~-~nP--f~~~~~~~~~ 444 (468) +...++.| +-. .++|...-...+ .+..|++. ||. .++|. .|+|.. .+| |+.-....+. T Consensus 234 ~~~~~~~~~~G---Dfs---~~~~g~~~~~~i~i~~~~~~~~~~~~~~~~~~v~~r~~~r~~~~v~~~~a~~~l~~~~a~ 307 (315) T protein:vir:80 234 SPASGVKAIVG---DFS---RVHWGFQRNFPIELIEYGDPDQTGRDLKGHNEVMVRAEAVLYVAIESLDSFAVVKEKAAP 307 (315) T ss_pred ccccccEEEEe---ecc---cEEEEEecCeeEEEeccccccCcccchhhcCcEEEEEEEEecceeecccceEEEeeccCC Confidence 00111111 000 011111111110 00011110 111 02221 233222 122 1111000000 Q ss_pred cCChhhhhhccCceeeeEEeec Q lcl|NC_015288. 445 TPDGETLTPSTNMYYRRVQVTN 466 (468) Q Consensus 445 ~~~~~~~~~~~N~y~r~~~v~~ 466 (468) .+. -.+ - | T Consensus 308 ~~~------~~~---~-----~ 315 (315) T protein:vir:80 308 KPN------PPA---E-----N 315 (315) T ss_pred CCC------CCC---C-----C Confidence 000 000 0 0 No 60 >protein:vir:9759 Length: 303 # NCBI annotation: putative structural protein # Family: family:all:966 # MgeID: mge:175 # MgeName: 315.3 # Cross-refs: genbank:acc:NP_795521;genbank:gi:28876283;genbank:GeneID:1257824 Probab=63.81 E-value=0.31 Score=23.37 Aligned_cols=280 Identities=14% Similarity=0.121 Sum_probs=117.3 Q ss_pred cCccccccccccccccccccccccccee-hhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCCccccccccc Q lcl|NC_015288. 58 LGAGTVSPGGSALGSANTAGLAGFDPVL-ISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDA 136 (468) Q Consensus 58 ~g~~~~~~~~~~~~st~tg~~~~~~P~L-v~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t 136 (468) +| .+ ++++. -..|.+ -.+++++.+..+..+++.+-||++.+.-|. ++.. +.++- T Consensus 1 m~-----------t~-t~gg~-liP~~~~~~ii~~l~~~s~i~~l~~~~~~~~~~~~ip----~~~~--~~~a~------ 55 (303) T protein:vir:97 1 MG-----------TE-TSKAS-LFDKHLVSDLINKVKGHSSLAKLSSQKPIPFNGSKEF----TFTL--DSDID------ 55 (303) T ss_pred Cc-----------cc-CCCCe-EcchhHHHHHHHHHHhhchhhhhcceeecCCCceEEE----EEec--CcceE------ Confidence 11 11 22222 122222 235566667778899999999986554442 1111 11110 Q ss_pred cccccccccccccccccccccccCcccccccccccccccccccccccchhhhhccCCCCCccccceeEEEEEEEEeeccc Q lcl|NC_015288. 137 GFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDAGKLFREMSFSIEKTSVTAKSRA 216 (468) Q Consensus 137 ~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~g~~f~EMaFsIeK~tVtAKSRa 216 (468) | .+ | +..+++-..+++.++..+|.-+ T Consensus 56 -w----------------------------------------------v~--E-----~~~~~~s~~~f~~v~l~~~kl~ 81 (303) T protein:vir:97 56 -V----------------------------------------------VA--E-----NGKKTHGGLSLEPVTIVPIKVE 81 (303) T ss_pred -E----------------------------------------------ee--c-----CccccccccceeeEEeeeEEEE Confidence 0 00 0 1122333334455555555555 Q ss_pred ccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeeee---cC--CcchhHH Q lcl|NC_015288. 217 LKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDLD---VD--SNGRWSV 291 (468) Q Consensus 217 LKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl~---~~--~~~rw~~ 291 (468) -...+|-||.|.... ..++-+++|.+-|+..|...|+..+|.-.. +.-+. +....+...+. .. ..+.... T Consensus 82 ~~~~iS~ell~~~~d-~~~~l~~~i~~~la~a~~~~ld~a~l~G~~--~~~g~--~~~~~~~~~~~~~~~~~~~~~~~~~ 156 (303) T protein:vir:97 82 YGARLSDEFLYATEE-EKIDILKAFNEGFAKKLARGIDLMAMHGIN--PRTKK--ASDVIGTNHFDSKVTQVVKFTESED 156 (303) T ss_pred EeehhhHHHhhcCcc-chHHHHHHHHHHHHHHHHHHHHhhhhcccc--cCCcc--ccccccccccccccccccccccccc Confidence 566799999863322 235678889999999999888888886432 11111 11111111110 00 0000000 Q ss_pred HHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccccccccCCCceeEEEecCCeEEE Q lcl|NC_015288. 292 EKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTGNLAVGTINGRIKVY 371 (468) Q Consensus 292 e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy 371 (468) .+.. |..-++.+ ...-+..+-+|++|+....|.. +..+.+ . .....+.....-.|+|.| ++|+ T Consensus 157 -~~~~----i~~~~~~~--~~~~~~~~~~vmn~~~~~~L~~---lkd~~g--~----~~~~~~~~~~~~~~~l~G-~Pv~ 219 (303) T protein:vir:97 157 -ADAN----IEAAVNLI--QGAEGVVTGLAMDTEFSTALAK---VTNGEM--G----PKMYPELAWGANPDSING-LKSS 219 (303) T ss_pred -hHHH----HHHHHHHH--hhcCCCccEEEEcHHHHHHHHH---hhccCC--C----eEEecCccCCCCCceecc-eeeE Confidence 0111 11111111 1223555568899998887752 111110 0 001111111112357765 8998 Q ss_pred Ecccccc----CCCcceEEEEEecCCcccceeEEccccc--cccccccCCcc-----ccc-eeee--eeeccee-ecC-- Q lcl|NC_015288. 372 VDPYAAN----LSDKHYYVVGYKGTSPYDAGLFYCPYVP--LQMVRSIDPNN-----FQP-KIGF--KTRYGMV-SNP-- 434 (468) Q Consensus 372 ~D~Ya~~----~s~~dY~~vG~Kg~~~~d~glfyaPYv~--~~~~~~~Dp~s-----~qP-~~g~--~tRY~l~-~nP-- 434 (468) ++.+... ..+.+.+++| +- ...+.+...-. +......|++. ||- -++| ..||+.. .|| T Consensus 220 ~s~~v~~~~~~~~~~~~~~~G---df--~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~n~~~~r~~~r~~~~v~~p~a 294 (303) T protein:vir:97 220 VNTTVGAGADEAESKDLVIIG---DF--ESMFKWGYAKQIPMEIIKYGDPDNSGKDLKGYNQIYLRAEAYIGWGILDAKS 294 (303) T ss_pred EecccCCccccCCCccEEEEe---ec--cccEEEEEecCcEEEEeeccCCCCcchhhhhcCcEEEEEEEEeccEeecccc Confidence 8744321 1122222222 10 11111222211 11222223322 221 1333 4566643 345 Q ss_pred cccccCccc Q lcl|NC_015288. 435 FVTTNGLYS 443 (468) Q Consensus 435 f~~~~~~~~ 443 (468) |+..++.+- T Consensus 295 f~~l~~~~~ 303 (303) T protein:vir:97 295 FARVTKGEV 303 (303) T ss_pred eEEeeCCCC Confidence 433332221 No 61 >protein:vir:100247 Length: 425 # NCBI annotation: gp76 # Family: family:all:21 # MgeID: mge:1619 # MgeName: Bcep176 # Cross-refs: genbank:acc:YP_355412;genbank:gi:77864702;genbank:GeneID:3725969 Probab=63.41 E-value=0.32 Score=23.31 Aligned_cols=309 Identities=14% Similarity=0.154 Sum_probs=116.3 Q ss_pred CcchHHHHHhhhh-----------------hh------cCCccccccchhhhhhhhhhhhhHHHHHhhhhhhhhhccccc Q lcl|NC_015288. 1 MFNAEHLQEKWSP-----------------VL------NNEAANPIADRYKKAVTSVLLENQERFLREERGMLQEVAVNS 57 (468) Q Consensus 1 ~~~~~~l~~kw~p-----------------~l------~~~~~~~i~~~~~~~~~~~llenq~~~~~e~~~~l~e~~~~~ 57 (468) .....+++++=.- -+ .++..+.....+|+.....|... +.+ T Consensus 64 ~~~~~e~~~~~~~~~~ei~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~af~~~l~~~------e~~---------- 127 (425) T protein:vir:10 64 GLPTSDALAKVDKVSADLEALQAAVDEANIKIAAAQMGANGVKPLRDPEYTEAFKAHVKRG------DVQ---------- 127 (425) T ss_pred hhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccccccHHHHHHHHHHhhhh------hhH---------- Confidence 1111111111000 00 00000000001111111100000 000 Q ss_pred cCcccccccccccccc-cccccccccceeh-hhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCCcccccccc Q lcl|NC_015288. 58 LGAGTVSPGGSALGSA-NTAGLAGFDPVLI-SLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPD 135 (468) Q Consensus 58 ~g~~~~~~~~~~~~st-~tg~~~~~~P~Lv-~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~ 135 (468) ..+..++ +.|+.. .-+.+. .++++..+..+..++|.+-||+++..-+.-.. ++..+ T Consensus 128 ---------~al~~~t~~~gG~l-vP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~~~~~~------~~~~a------ 185 (425) T protein:vir:10 128 ---------AALNKGEDSEGGYL-TPIEWDRTITNKLVLISPMRQLCRVQPVSKAGFSKLFNM------GGTTS------ 185 (425) T ss_pred ---------HHhhcCcCCCCcee-ccHhHHHHHHHHHHhhhhhhhhceeeeccCCceEEEEEc------CCcce------ Confidence 0011111 112211 111221 24444555667788999999987765433110 11100 Q ss_pred ccccccccccccccccccccccccCcccccccccccccccccccccccchhhhhccCCC-CCccccceeEEEEEEEEeec Q lcl|NC_015288. 136 AGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDA-GKLFREMSFSIEKTSVTAKS 214 (468) Q Consensus 136 t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~-g~~f~EMaFsIeK~tVtAKS 214 (468) .|-+ +++..... ...|.++.|++-|..+ T Consensus 186 -~wv~----------------------------------------------E~~~~~~~~~~~f~~v~~~~~k~~~---- 214 (425) T protein:vir:10 186 -GWVG----------------------------------------------EASQRPQTNAATFQPLSFASGEIYA---- 214 (425) T ss_pred -eeec----------------------------------------------cccccccccccccceeeeeheeeEe---- Confidence 0000 00000111 1246666666666655 Q ss_pred ccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeeeec----------- Q lcl|NC_015288. 215 RALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDLDV----------- 283 (468) Q Consensus 215 RaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl~~----------- 283 (468) ...+|-||.+|-. .|.+++|.+-|+..|..-+|+-||.- .-.+ .+.|++.... T Consensus 215 ---~i~iS~ell~ds~----~~l~~~i~~~la~ai~~~~d~~~l~G--------~G~~-~p~Gil~~~~~~~~~~~~~~~ 278 (425) T protein:vir:10 215 ---NPAATQQILDDAE----IDLESWLATEVQTEFAKQEGKAFLAG--------DGTN-KPNGLLTYIAGGANAAKHPFG 278 (425) T ss_pred ---ehHhHHHHHhcch----hHHHHHHHHHHHHHHHHHHHhhhhcc--------cCCC-Ccceeeecccccccccccccc Confidence 4568999999853 56888999999999999999888752 0000 1222222100 Q ss_pred -------CCcchhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccccccccCC Q lcl|NC_015288. 284 -------DSNGRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDT 356 (468) Q Consensus 284 -------~~~~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t 356 (468) ...+.-..+....|.+ ..... -+..+ .+|+++.....|.. +....+ + + ....+.+ T Consensus 279 ~~~~~~~~~~~~~~~d~l~~l~~-------~l~~~-~~~~a-~~vmn~~~~~~L~~---lkD~~G---~-~--l~~~~~~ 340 (425) T protein:vir:10 279 AIEVVNSGAAADITSDGIIDLVY-------DLPSA-FTGNA-RFAMNRNTQRQVRK---LKDGQG---N-Y--LWQPSYV 340 (425) T ss_pred ccccccccccccccHHHHHHHHh-------hhhhh-hccCC-EEEEchHHHHHHHH---hhcCCC---c-e--eeccCcc Confidence 0001111122222221 11111 12233 45789988888753 222211 0 0 0011111 Q ss_pred CceeEEEecCCeEEEEcccccc-CCCcceEEEE-EecC----CcccceeEEccccccccccccCCccccceeeeeeecce Q lcl|NC_015288. 357 GNLAVGTINGRIKVYVDPYAAN-LSDKHYYVVG-YKGT----SPYDAGLFYCPYVPLQMVRSIDPNNFQPKIGFKTRYGM 430 (468) Q Consensus 357 ~~~~~G~l~~~~~vy~D~Ya~~-~s~~dY~~vG-~Kg~----~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~g~~tRY~l 430 (468) .. ..++|.| ++|+++.++.. -+..+.+++| ++.. ....-.+.-.||. ...+-.+-...||+. T Consensus 341 ~g-~~~~l~G-~PV~~~~~~p~~~~~~~~i~~Gd~~~~~~i~~~~~~~v~~d~~~----------~~~~~~~~~~~r~d~ 408 (425) T protein:vir:10 341 AG-QPATLAG-YPVTEVPDMPDVAANSTPILFGDFQQTYLIIDRIGVRVLRDPYT----------AKPYVLFYTTKRVGG 408 (425) T ss_pred CC-CCceecc-eeeEEecCcCCccCCccEEEEEehhccEEEEEecceEEEecccc----------cCCcEEEEEEEEecc Confidence 11 1256654 67877754321 1112344444 1210 0000111223342 123333334557765 Q ss_pred -eecC--cccccCcccc Q lcl|NC_015288. 431 -VSNP--FVTTNGLYSG 444 (468) Q Consensus 431 -~~nP--f~~~~~~~~~ 444 (468) +.+| |....-..++ T Consensus 409 ~v~~~~A~~~l~~~as~ 425 (425) T protein:vir:10 409 GLLNPEPMRAMKVAASE 425 (425) T ss_pred EeecccceEEEEeeccC Confidence 3456 3322222222 No 62 >protein:vir:103955 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:1662 # MgeName: phiNM # Cross-refs: genbank:acc:YP_873992;genbank:gi:118430767;genbank:GeneID:4525449 Probab=63.03 E-value=0.32 Score=23.26 Aligned_cols=298 Identities=11% Similarity=0.082 Sum_probs=123.3 Q ss_pred hhhHHHHHhhhhhhhhh-ccccccCcccccccccccccccccccccccceehhhhHHhhhhhhhhheeeeecCCccceee Q lcl|NC_015288. 36 LENQERFLREERGMLQE-VAVNSLGAGTVSPGGSALGSANTAGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLI 114 (468) Q Consensus 36 lenq~~~~~e~~~~l~e-~~~~~~g~~~~~~~~~~~~st~tg~~~~~~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLI 114 (468) +|.-++--.+-+++... ...+.+++.. ..++++++..--....-.+++.+....+-.+++-+-||++.+.-| T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~a~~-------~~~~~~~~~liP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~~~ 73 (324) T protein:vir:10 1 MEQTQKLKLNLQHFASNNVKPQVFNPDN-------VMMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKF 73 (324) T ss_pred CCCchHHHHHHHHHHHHhhccceecccc-------eeccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceEE Confidence 22111111111211111 0011111111 111111111000111122344444566778889999988765332 Q ss_pred eeeeeeecCCCCCccccccccccccccccccccccccccccccccCcccccccccccccccccccccccchhhhhccCCC Q lcl|NC_015288. 115 FAMRSRYENQAGEEALFNEPDAGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDA 194 (468) Q Consensus 115 FAMRsrY~~qsG~EA~fnEa~t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~ 194 (468) . . ... +.++ .|- +| T Consensus 74 p-~---~~~--~~~a-------~~v------------------------------------------------~E----- 87 (324) T protein:vir:10 74 T-F---WAD--KPGA-------YWV------------------------------------------------GE----- 87 (324) T ss_pred E-E---EeC--Ccce-------eEe------------------------------------------------cc----- Confidence 2 1 110 0000 000 01 Q ss_pred CCccccceeEEEEEEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccc Q lcl|NC_015288. 195 GKLFREMSFSIEKTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVA 274 (468) Q Consensus 195 g~~f~EMaFsIeK~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~ 274 (468) +..+++...+++++++..|.-+-.-..|-||.+|-. .|.+++|.+.|+..|...+++.+|.---+ +.. T Consensus 88 g~~~~~~~~~~~~v~~~~~k~~~~~~iS~ell~ds~----~~l~~~i~~~l~~ai~~~~d~a~l~G~g~--------~~~ 155 (324) T protein:vir:10 88 GQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQGN--------NPF 155 (324) T ss_pred CccccccccceeEEEEeeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHhhhcCCC--------Ccc Confidence 233455556677777777777777889999999864 46899999999999999999998853211 111 Q ss_pred cceeeeeecCCc----chhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhcccccccccccccccccc Q lcl|NC_015288. 275 NAGIFDLDVDSN----GRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAI 350 (468) Q Consensus 275 ~~Gv~Dl~~~~~----~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~ 350 (468) +.|++....... +.-..+....++ ..+ ...-+..+.+|+|+.....|... .... ++. . T Consensus 156 ~~~i~~~~~~~~~~~~~~~t~~~i~~~~-------~~l--~~~~~~~~~~v~n~~~~~~L~~l---~d~~---g~~---~ 217 (324) T protein:vir:10 156 GKSIAQSIEKTNKVIKGDFTQDNIIDLE-------ALL--EDDELEANAFISKTQNRSLLRKI---VDPE---TKE---R 217 (324) T ss_pred CccccccccccceeccccCCHHHHHHHH-------Hhh--hhccCCCCEEEEcHHHHHHHHHh---hccC---Cce---e Confidence 122222111110 100112222221 111 12223455688999999888642 1111 110 0 Q ss_pred ccccCCCceeEEEecCCeEEEEccccccCCCcceEEEEEecCCcccceeEEcccccccc--------ccccCCc------ Q lcl|NC_015288. 351 GTVDDTGNLAVGTINGRIKVYVDPYAANLSDKHYYVVGYKGTSPYDAGLFYCPYVPLQM--------VRSIDPN------ 416 (468) Q Consensus 351 ~~~D~t~~~~~G~l~~~~~vy~D~Ya~~~s~~dY~~vG~Kg~~~~d~glfyaPYv~~~~--------~~~~Dp~------ 416 (468) . .+.++ ++|. +++|++.+.+. .+..-+++|-.. .+++...-...+ ....|+. T Consensus 218 ~-~~~~~----~~l~-G~PV~~~~~~~--~~~~~~~~gd~~------~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~ 283 (324) T protein:vir:10 218 I-YDRNS----DTLD-GLPVVNLKSSN--LKRGELITGDFD------KLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNL 283 (324) T ss_pred e-cCCCC----cccc-ceeEEeecCCC--CCcceEEEEecc------cEEEEEecCcEEEEeecccccccccccccchhh Confidence 0 01112 3443 46777765432 122233333211 011111111111 1111121 Q ss_pred --cccceeeeeeecce-eecC--cccccCcc--cccCChhhh Q lcl|NC_015288. 417 --NFQPKIGFKTRYGM-VSNP--FVTTNGLY--SGTPDGETL 451 (468) Q Consensus 417 --s~qP~~g~~tRY~l-~~nP--f~~~~~~~--~~~~~~~~~ 451 (468) +-+=.+=...|||. +.|| |+....-. ...+.++ + T Consensus 284 ~~~~~~~~r~~~r~d~~v~~~~A~~~l~~a~~~~~~~~~~-~ 324 (324) T protein:vir:10 284 FEQDMVALRATMHVALHIADDKAFAKLVPADKKTDSVPGE-V 324 (324) T ss_pred hhcCcEEEEEEEEEccEEecccceEEEEeccCCCCCCCCC-C Confidence 11233334467775 4455 44332111 1111111 1 No 63 >protein:vir:94494 Length: 274 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1508 # MgeName: 88 # Cross-refs: genbank:acc:YP_240676;genbank:gi:66396348;genbank:GeneID:5133758 Probab=62.91 E-value=0.33 Score=23.25 Aligned_cols=257 Identities=12% Similarity=0.040 Sum_probs=105.3 Q ss_pred eeeeecCCCCCccccccccc-----------cccccccccccccccccccccccCcccccccccccccccccccccccch Q lcl|NC_015288. 117 MRSRYENQAGEEALFNEPDA-----------GFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSR 185 (468) Q Consensus 117 MRsrY~~qsG~EA~fnEa~t-----------~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~T 185 (468) |=..... -. .-+..|... -|++-... ... ..+ .+ ..+.+...--.. T Consensus 1 ma~~~T~-~~-d~iiPev~~~~v~~~~~~~l~~~~~~~~-d~~------------l~g-~~-------G~tv~iP~~~~~ 57 (274) T protein:vir:94 1 MPQGLTK-TS-DQIIPEVLAPMMQAQLEKKLRFASFAEV-DST------------LQG-QP-------GDTLTFPAFVYS 57 (274) T ss_pred CCcccee-hh-heechHHHHHHHHHhhhhhhhhccccee-ccc------------ccC-CC-------CCEEEEeeecCC Confidence 2211000 00 001111100 01110000 000 000 00 011111110011 Q ss_pred hhhhccCC-CCCccccceeEEEEEEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhh Q lcl|NC_015288. 186 EDLEQAGD-AGKLFREMSFSIEKTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSV 264 (468) Q Consensus 186 a~aE~lG~-~g~~f~EMaFsIeK~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~v 264 (468) .++|.+.. ..-+..++. ..+.+++.+-|+-.=+++=| ..+.+ +-|.-.+..+-++..|+.+++.+++..|.+. T Consensus 58 g~a~~~~~g~~i~~~~lt--~~~~~~~i~~~~~~~~i~D~--~~~~~--~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a 131 (274) T protein:vir:94 58 GDAQVVAEGEKIPTDILE--TKKREAKIRKIAKGTSITDE--ALLSG--YGDPQGEQVRQHGLAHANKVDNDVLEALMGA 131 (274) T ss_pred CccccccCCCcccccccc--cceeEEEeeeecceecccHH--HHHhc--cchHHHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 22232221 122334443 33444444555522222222 22223 4678889999999999999999999887654 Q ss_pred cchhhccccccceeeeeecCCcchhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhcccccccccccc Q lcl|NC_015288. 265 AKPGAANNVANAGIFDLDVDSNGRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTG 344 (468) Q Consensus 265 A~~~k~~~~~~~Gv~Dl~~~~~~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~ 344 (468) +.... ...+ ..+.+-..+.++..+ -..+++++|+|.+++.|.-.....+... + T Consensus 132 ~~~~~------~~~~----------~~d~i~dA~~~l~d~---------~~~~~~ivv~p~~~~~L~k~~~~~f~~~--s 184 (274) T protein:vir:94 132 KLTVN------ADIT----------KLNGLQSAIDKFNDE---------DLEPMVLFVNPLDAGKLRGDASTNFTRA--T 184 (274) T ss_pred Ccccc------cccc----------CHHHHHHHHHHhhcc---------CCCceEEEeCHHHHHHHHhhhhhhcccc--C Confidence 43211 1111 123332233333221 2367899999999999954322222211 0 Q ss_pred ccccccccccCCCceeEEEecCCeEEEEccccccCCCcceEEEEEecCCcccceeEEccccccccccccCCccccceeee Q lcl|NC_015288. 345 AGGPAIGTVDDTGNLAVGTINGRIKVYVDPYAANLSDKHYYVVGYKGTSPYDAGLFYCPYVPLQMVRSIDPNNFQPKIGF 424 (468) Q Consensus 345 ~~~~~~~~~D~t~~~~~G~l~~~~~vy~D~Ya~~~s~~dY~~vG~Kg~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~g~ 424 (468) ..+. .-..+-.+|++. |++||+| ++-|. |-.+-++ -+.+-|.--.+...-.-=||..+.-.+-. T Consensus 185 ~~g~-----~~~~~G~ig~~~-G~~Vi~s----~~~p~-~t~~l~~-----~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~ 248 (274) T protein:vir:94 185 ELGD-----DIIVKGAFGEAL-GAIIVRT----NKLEA-GTAILAK-----KGAVKLILKRDFFLEVARDASTKTTALYS 248 (274) T ss_pred cccc-----cceeccccceec-CeeEEEc----CCCCc-ceEEEEe-----CcceEeeecCCceeccccchhhcccEEEE Confidence 1110 011122367774 6899999 55663 3222222 01121210011111111178888888888 Q ss_pred eeecceee-cC--cccc-cCcc-ccc Q lcl|NC_015288. 425 KTRYGMVS-NP--FVTT-NGLY-SGT 445 (468) Q Consensus 425 ~tRY~l~~-nP--f~~~-~~~~-~~~ 445 (468) .-+||+.+ || .... .... -+| T Consensus 249 ~~~y~~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:94 249 DKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred EEEEEEEEEcCCceEEEecCcccccC Confidence 88888754 44 1111 1111 111 No 64 >protein:vir:97433 Length: 274 # NCBI annotation: ORF014 # Family: family:all:522 # MgeID: mge:1676 # MgeName: 92 # Cross-refs: genbank:acc:YP_240749;genbank:gi:66396420;genbank:GeneID:5133789 Probab=62.91 E-value=0.33 Score=23.25 Aligned_cols=257 Identities=12% Similarity=0.040 Sum_probs=105.3 Q ss_pred eeeeecCCCCCccccccccc-----------cccccccccccccccccccccccCcccccccccccccccccccccccch Q lcl|NC_015288. 117 MRSRYENQAGEEALFNEPDA-----------GFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSR 185 (468) Q Consensus 117 MRsrY~~qsG~EA~fnEa~t-----------~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~T 185 (468) |=..... -. .-+..|... -|++-... ... ..+ .+ ..+.+...--.. T Consensus 1 ma~~~T~-~~-d~iiPev~~~~v~~~~~~~l~~~~~~~~-d~~------------l~g-~~-------G~tv~iP~~~~~ 57 (274) T protein:vir:97 1 MPQGLTK-TS-DQIIPEVLAPMMQAQLEKKLRFASFAEV-DST------------LQG-QP-------GDTLTFPAFVYS 57 (274) T ss_pred CCcccee-hh-heechHHHHHHHHHhhhhhhhhccccee-ccc------------ccC-CC-------CCEEEEeeecCC Confidence 2211000 00 001111100 01110000 000 000 00 011111110011 Q ss_pred hhhhccCC-CCCccccceeEEEEEEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhh Q lcl|NC_015288. 186 EDLEQAGD-AGKLFREMSFSIEKTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSV 264 (468) Q Consensus 186 a~aE~lG~-~g~~f~EMaFsIeK~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~v 264 (468) .++|.+.. ..-+..++. ..+.+++.+-|+-.=+++=| ..+.+ +-|.-.+..+-++..|+.+++.+++..|.+. T Consensus 58 g~a~~~~~g~~i~~~~lt--~~~~~~~i~~~~~~~~i~D~--~~~~~--~~dp~~~~~~~~a~a~a~~vd~~~~~~l~~a 131 (274) T protein:vir:97 58 GDAQVVAEGEKIPTDILE--TKKREAKIRKIAKGTSITDE--ALLSG--YGDPQGEQVRQHGLAHANKVDNDVLEALMGA 131 (274) T ss_pred CccccccCCCcccccccc--cceeEEEeeeecceecccHH--HHHhc--cchHHHHHHHHHHHHHHHHHHHHHHHHHhcc Confidence 22232221 122334443 33444444555522222222 22223 4678889999999999999999999887654 Q ss_pred cchhhccccccceeeeeecCCcchhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhcccccccccccc Q lcl|NC_015288. 265 AKPGAANNVANAGIFDLDVDSNGRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTG 344 (468) Q Consensus 265 A~~~k~~~~~~~Gv~Dl~~~~~~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~ 344 (468) +.... ...+ ..+.+-..+.++..+ -..+++++|+|.+++.|.-.....+... + T Consensus 132 ~~~~~------~~~~----------~~d~i~dA~~~l~d~---------~~~~~~ivv~p~~~~~L~k~~~~~f~~~--s 184 (274) T protein:vir:97 132 KLTVN------ADIT----------KLNGLQSAIDKFNDE---------DLEPMVLFVNPLDAGKLRGDASTNFTRA--T 184 (274) T ss_pred Ccccc------cccc----------CHHHHHHHHHHhhcc---------CCCceEEEeCHHHHHHHHhhhhhhcccc--C Confidence 43211 1111 123332233333221 2367899999999999954322222211 0 Q ss_pred ccccccccccCCCceeEEEecCCeEEEEccccccCCCcceEEEEEecCCcccceeEEccccccccccccCCccccceeee Q lcl|NC_015288. 345 AGGPAIGTVDDTGNLAVGTINGRIKVYVDPYAANLSDKHYYVVGYKGTSPYDAGLFYCPYVPLQMVRSIDPNNFQPKIGF 424 (468) Q Consensus 345 ~~~~~~~~~D~t~~~~~G~l~~~~~vy~D~Ya~~~s~~dY~~vG~Kg~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~g~ 424 (468) ..+. .-..+-.+|++. |++||+| ++-|. |-.+-++ -+.+-|.--.+...-.-=||..+.-.+-. T Consensus 185 ~~g~-----~~~~~G~ig~~~-G~~Vi~s----~~~p~-~t~~l~~-----~gA~~~~~~~~~~vE~~Rd~~~~~d~i~~ 248 (274) T protein:vir:97 185 ELGD-----DIIVKGAFGEAL-GAIIVRT----NKLEA-GTAILAK-----KGAVKLILKRDFFLEVARDASTKTTALYS 248 (274) T ss_pred cccc-----cceeccccceec-CeeEEEc----CCCCc-ceEEEEe-----CcceEeeecCCceeccccchhhcccEEEE Confidence 1110 011122367774 6899999 55663 3222222 01121210011111111178888888888 Q ss_pred eeecceee-cC--cccc-cCcc-ccc Q lcl|NC_015288. 425 KTRYGMVS-NP--FVTT-NGLY-SGT 445 (468) Q Consensus 425 ~tRY~l~~-nP--f~~~-~~~~-~~~ 445 (468) .-+||+.+ || .... .... -+| T Consensus 249 ~~~y~~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:97 249 DKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred EEEEEEEEEcCCceEEEecCcccccC Confidence 88888754 44 1111 1111 111 No 65 >protein:vir:1025 Length: 408 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:20 # MgeName: bIL286 # Cross-refs: genbank:acc:NP_076679;genbank:gi:13095788;genbank:GeneID:920362 Probab=57.75 E-value=0.43 Score=22.61 Aligned_cols=333 Identities=11% Similarity=0.083 Sum_probs=124.4 Q ss_pred CcchHHHHHhhhhhhcC--------------Ccc--ccccchhhhhhh---hh--hhhhHHHHHhhhhhhhhhccccccC Q lcl|NC_015288. 1 MFNAEHLQEKWSPVLNN--------------EAA--NPIADRYKKAVT---SV--LLENQERFLREERGMLQEVAVNSLG 59 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~--------------~~~--~~i~~~~~~~~~---~~--llenq~~~~~e~~~~l~e~~~~~~g 59 (468) ||+.++|.++|.-+.+. +.. -++. ..+..+. ++ -++.|.++..+....-......... T Consensus 4 ~m~l~el~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ee~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 82 (408) T protein:vir:10 4 KLTVNQLNEAWIASGDKVTDFNDQINMALNDDNFSAEAMS-ELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEEKGPL 82 (408) T ss_pred cccHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccc Confidence 88999999999665432 000 0010 0111110 00 0111111111110000000000000 Q ss_pred c-------------------ccccccc-----cccccc-cccccc---cccceehhhhHHhhhhhhhhheeeeecCCccc Q lcl|NC_015288. 60 A-------------------GTVSPGG-----SALGSA-NTAGLA---GFDPVLISLVRRAMPNLMAYDVCGVQPMSGPT 111 (468) Q Consensus 60 ~-------------------~~~~~~~-----~~~~st-~tg~~~---~~~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPT 111 (468) . +...... .+..++ ..|+.. .+.+. +++.........+++.+.||+++. T Consensus 83 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~t~~~gg~~vP~~~~~~---Ii~~~~~~~~l~~~~~~~~~~~~~ 159 (408) T protein:vir:10 83 NKSENELKDKFVKDFVNMVRNPMAFMNTVSSKTETSGSDSAAGLTIPQDIRTM---INTLVRQYDSLQQYVRVESVSTSN 159 (408) T ss_pred ccchhhhHHHHHHHHHHHhhcchhhhhhhhhhhhhcccccCCceeccHhHHHH---HHHHHHhhchhhhhcceeeccCCc Confidence 0 0000000 000111 111111 11222 444455566678899999999988 Q ss_pred eeeeeeeeeecCCCCCccccccccccccccccccccccccccccccccCcccccccccccccccccccccccchhhhhcc Q lcl|NC_015288. 112 GLIFAMRSRYENQAGEEALFNEPDAGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQA 191 (468) Q Consensus 112 GLIFAMRsrY~~qsG~EA~fnEa~t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~l 191 (468) |-+--.| ..+.++. ..|- ++.+.. T Consensus 160 ~~~~~~~--~~~~~~~--------a~~v----------------------------------------------~E~~~~ 183 (408) T protein:vir:10 160 GSRVYEK--WTDVTPL--------TVMD----------------------------------------------AEDGKI 183 (408) T ss_pred ceEEEee--ccccccc--------eeee----------------------------------------------cCcccc Confidence 8765443 1110000 0000 000000 Q ss_pred CC-CCCccccceeEEEEEEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhc Q lcl|NC_015288. 192 GD-AGKLFREMSFSIEKTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAA 270 (468) Q Consensus 192 G~-~g~~f~EMaFsIeK~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~ 270 (468) .+ +...|.++.|+..|..+- ..+|-||.+|- .+|.+++|.+-|+..|..-+|+.||.-.-+.. T Consensus 184 ~~~~~~~~~~i~~~~~k~~~~-------~~iS~ell~ds----~~~l~~~i~~~l~~~~~~~~~~~il~g~g~~~----- 247 (408) T protein:vir:10 184 PDLDNPQLTIIKYLIKRYAGI-------ITATNTSLKDT----AENILAWLSSWIAKKVVVTRNQAIIEVMKAAP----- 247 (408) T ss_pred ccccCcceeeEEeeeeeEEee-------ehhHHHHHhhc----hHHHHHHHHHHHHHHHHHHHHHHHhhcccccc----- Confidence 00 112456666666555544 55999999994 35778899999999999999888876332211 Q ss_pred cccccceeeeeecCCcchhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhcccccccccccccccccc Q lcl|NC_015288. 271 NNVANAGIFDLDVDSNGRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAI 350 (468) Q Consensus 271 ~~~~~~Gv~Dl~~~~~~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~ 350 (468) ...|+.++ +....++ +.....--+..+ -+|||+.....|.. +..+.+ + .+ T Consensus 248 ---~~~~~~~~----------~~l~~~~-------~~~~~~~~~~~a-~~v~n~~~~~~l~~---lkd~~G---~---~i 297 (408) T protein:vir:10 248 ---KKPTIAKF----------DDVITMI-------NTAVDPAIIATS-SLLTNQSGLNKLAL---VKTAEG---K---YL 297 (408) T ss_pred ---cccccccH----------HHHHHHH-------HHhhhhhhccCC-EEEEcHHHHHHHHH---hhccCC---c---eE Confidence 11122111 1111111 111111112222 46799999888854 222211 0 01 Q ss_pred ccccCCCceeEEEecCCeEEEE--ccccccCCCc----------ceEEEEEecCCcccceeEEccccccccccccCCccc Q lcl|NC_015288. 351 GTVDDTGNLAVGTINGRIKVYV--DPYAANLSDK----------HYYVVGYKGTSPYDAGLFYCPYVPLQMVRSIDPNNF 418 (468) Q Consensus 351 ~~~D~t~~~~~G~l~~~~~vy~--D~Ya~~~s~~----------dY~~vG~Kg~~~~d~glfyaPYv~~~~~~~~Dp~s~ 418 (468) ...+-+.. ..++|. |++|++ |....+.... +|++++.++.... =+.++.- .+-.+. T Consensus 298 ~~~~~~~~-~~~~l~-G~PV~~~~~~~~~~~~~~~~~i~~gd~~~~~~~~~~~~~~v----~~~~~~~------~~f~~~ 365 (408) T protein:vir:10 298 LEPDPTKP-NSYLIK-GKQVIVVADRWLPNTGSTVYPLYYGDMSQAITLFDRENMSL----LPTNIGA------GAFETD 365 (408) T ss_pred eccCcCCC-CCceec-ceeeEEecccccCccCCCceEEEEEehhccEEEEEecceEE----EEccccc------chhhcC Confidence 11111111 123553 455554 2111111110 1122222211111 1111100 001123 Q ss_pred cceeeeeeeccee-ecC--ccccc--C-----cccccCChhhh Q lcl|NC_015288. 419 QPKIGFKTRYGMV-SNP--FVTTN--G-----LYSGTPDGETL 451 (468) Q Consensus 419 qP~~g~~tRY~l~-~nP--f~~~~--~-----~~~~~~~~~~~ 451 (468) +-.+-+..||+.. .+| |...+ . .....+....+ T Consensus 366 ~~~~r~~~r~d~~v~~~~a~~~~~~~~~~~~~~~~~~~~~~~~ 408 (408) T protein:vir:10 366 TTKIRVIDRFDVKATDSEALVAGSFSAIADQVGNFKTTTSTAV 408 (408) T ss_pred ceEEEEEEeeccEEeccccEEEEEeeccccCCCCCCCCCcccC Confidence 3344455666554 233 22110 0 00111111111 No 66 >protein:vir:97148 Length: 324 # NCBI annotation: ORF010 # Family: family:all:507 # MgeID: mge:1654 # MgeName: 85 # Cross-refs: genbank:acc:YP_239726;genbank:gi:66394880;genbank:GeneID:5130881 Probab=57.65 E-value=0.43 Score=22.59 Aligned_cols=301 Identities=12% Similarity=0.065 Sum_probs=115.5 Q ss_pred hhhhHHHHHhhhhhhhhhccc--cccCcccccccccccccccccccccccceehhhhHHhhhhhhhhheeeeecCCccce Q lcl|NC_015288. 35 LLENQERFLREERGMLQEVAV--NSLGAGTVSPGGSALGSANTAGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTG 112 (468) Q Consensus 35 llenq~~~~~e~~~~l~e~~~--~~~g~~~~~~~~~~~~st~tg~~~~~~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPTG 112 (468) .-++| .++.+.++...... +.+.+ ....++++++..--....-.+++.+....+..+++-+.||++.+- T Consensus 1 ~~~~~--~~~~~~~~f~~~~~~~~~~~a-------~~~~~~~~~~~~iP~~~~~~ii~~~~~~s~l~~~~~~~~~~~~~~ 71 (324) T protein:vir:97 1 MEQTQ--KLKLNLQHFASNNVKPQVFNP-------DNVMMHEKKDGTLMNEFTTPILQEVMENSKIMQLGKYEPMEGTEK 71 (324) T ss_pred Cccch--hHHHHHHHHHHhhhhhhhhcc-------ccccccCCCcceechhHHHHHHHHHHhhcchhhhcceeeccCCce Confidence 11111 11111111111000 11111 111112222211111112224555666778888999999987653 Q ss_pred eeeeeeeeecCCCCCccccccccccccccccccccccccccccccccCcccccccccccccccccccccccchhhhhccC Q lcl|NC_015288. 113 LIFAMRSRYENQAGEEALFNEPDAGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAG 192 (468) Q Consensus 113 LIFAMRsrY~~qsG~EA~fnEa~t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG 192 (468) -|- | +.. +.++ .|- + | T Consensus 72 ~ip--~--~~~--~~~a-------~~v----------------------------------------------~--E--- 87 (324) T protein:vir:97 72 KFT--F--WAD--KPGA-------YWV----------------------------------------------G--E--- 87 (324) T ss_pred EEE--E--Eec--Ccce-------eEe----------------------------------------------c--c--- Confidence 321 1 110 0000 000 0 0 Q ss_pred CCCCccccceeEEEEEEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccc Q lcl|NC_015288. 193 DAGKLFREMSFSIEKTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANN 272 (468) Q Consensus 193 ~~g~~f~EMaFsIeK~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~ 272 (468) +..+++...++++++.++|.=+--..+|-||.+|-. .|.+++|.+-|+..|...+++.||.--- .+ T Consensus 88 --g~~~~~~~~~f~~v~~~~~k~~~~~~is~ell~ds~----~~l~~~i~~~l~~aia~~~d~a~l~G~g--------~~ 153 (324) T protein:vir:97 88 --GQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQG--------NN 153 (324) T ss_pred --CccccccccceeEEEEeeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHhhccCC--------CC Confidence 112333334444444444444445569999999863 5789999999999999999999986321 11 Q ss_pred cccceeeeeecCCcchhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhcccccccccccccccccccc Q lcl|NC_015288. 273 VANAGIFDLDVDSNGRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGT 352 (468) Q Consensus 273 ~~~~Gv~Dl~~~~~~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~ 352 (468) ..+.|++........-.... ..|.-.+.+......--+ ....+|+|+.....|... ...- ++. ... T Consensus 154 ~~~~gi~~~~~~~~~~~~~~----~~~~~i~~~~~~l~~~~~-~~~~~v~n~~~~~~L~~l---kd~~---g~~---~~~ 219 (324) T protein:vir:97 154 PFGKSIAQSIEKTNKVIKGD----FTQDNIIDLEALLEDDEL-EANAFISKTQNRSLLRKI---VDPE---TKE---RIY 219 (324) T ss_pred ccCccccccccccceecccc----CCHHHHHHHHHhhhhccC-CCCEEEEcHHHHHHHHHh---hcCC---Cce---eec Confidence 11222222111111000000 001111222222222222 334578999999888532 1111 111 001 Q ss_pred ccCCCceeEEEecCCeEEEEccccccCCCcceEEEEEecCCcccceeEEcccccccc--------ccccCCc-----cc- Q lcl|NC_015288. 353 VDDTGNLAVGTINGRIKVYVDPYAANLSDKHYYVVGYKGTSPYDAGLFYCPYVPLQM--------VRSIDPN-----NF- 418 (468) Q Consensus 353 ~D~t~~~~~G~l~~~~~vy~D~Ya~~~s~~dY~~vG~Kg~~~~d~glfyaPYv~~~~--------~~~~Dp~-----s~- 418 (468) +.+ .|+|. +++|++.+-.. .+...+++|-.. .+++...-...+ ....|+. -| T Consensus 220 -~~~----~~tl~-G~PV~~~~~~~--~~~~~~~~gd~~------~~~i~~~~~~~i~~~~~~~~~~~~~~~~~~~~~f~ 285 (324) T protein:vir:97 220 -DRN----SDTLD-GLPVVNLKSSN--LKRGELITGDFD------KLIYGIPQLIEYKIDETAQLSTVKNEDGTPVNLFE 285 (324) T ss_pred -CCC----Ccccc-ceeeEeecCCC--CCcceEEEEecc------cEEEEEecCcEEEEeecccccccccccccchhhhh Confidence 111 13454 45676654211 112223333110 011111111010 0001111 01 Q ss_pred --cceeeeeeecce-eecC--cccccCc-c-cccCChhh Q lcl|NC_015288. 419 --QPKIGFKTRYGM-VSNP--FVTTNGL-Y-SGTPDGET 450 (468) Q Consensus 419 --qP~~g~~tRY~l-~~nP--f~~~~~~-~-~~~~~~~~ 450 (468) +=.+=+..||+. ..|| |+....- . ...+.++- T Consensus 286 ~d~~~~r~~~r~d~~v~~~~a~~~l~~~~~~~~~~~~~~ 324 (324) T protein:vir:97 286 QDMVALRATMHVALHIADDKAFAKLVPADKKTDSVPGEV 324 (324) T ss_pred cCcEEEEEEEEeccEEecccceEEEEeccCCCCCCCCCC Confidence 122223457764 4455 4432110 0 01112221 No 67 >protein:vir:105004 Length: 392 # NCBI annotation: putative major capsid protein # Family: family:all:21 # MgeID: mge:1490 # MgeName: W Beta # Cross-refs: genbank:acc:YP_459969;genbank:gi:85701384;genbank:GeneID:3882145 Probab=56.21 E-value=0.46 Score=22.42 Aligned_cols=318 Identities=17% Similarity=0.127 Sum_probs=126.7 Q ss_pred CcchHHHHHhhhhh---h--cCCcc----------ccccchhhhhhhhhhhhhHHHHHhhhhhhhhhccccccCcccccc Q lcl|NC_015288. 1 MFNAEHLQEKWSPV---L--NNEAA----------NPIADRYKKAVTSVLLENQERFLREERGMLQEVAVNSLGAGTVSP 65 (468) Q Consensus 1 ~~~~~~l~~kw~p~---l--~~~~~----------~~i~~~~~~~~~~~llenq~~~~~e~~~~l~e~~~~~~g~~~~~~ 65 (468) +=..+.|.++.... . +.+.. .+....+|+... ..|.+++..- +.+.++.... +.- T Consensus 35 ~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~l~~~~~~~-~~~~~~~~~~-~~~------- 104 (392) T protein:vir:10 35 MEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDVFM-KALRNKPLNA-EEREFLEDDL-EQR------- 104 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHHHH-HHHhcccccH-HHHHHHhhhh-hhh------- Confidence 11111222222211 0 00000 011122333222 2222221100 1111111100 000 Q ss_pred cccccccc-cccccc---cccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCCcccccccccccccc Q lcl|NC_015288. 66 GGSALGSA-NTAGLA---GFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDAGFTAG 141 (468) Q Consensus 66 ~~~~~~st-~tg~~~---~~~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSG~ 141 (468) .....| +.|+.. .+.+.++.+.| ....-.+++++.||++++|-+.=.+ .. ++.++ .|-+ T Consensus 105 --~~~~~t~~~gg~~vP~~~~~~ii~~~~---~~s~l~~~~~~~~~~~~~~~~~~~~--~~--~~~~a-------~~v~- 167 (392) T protein:vir:10 105 --AMSGLTGEDGGLVIPQDIQTQINELAR---SFDALEQYVTVEPVRTRSGSRVLEK--NS--DMIPF-------AEIT- 167 (392) T ss_pred --hccccccCCCceecchhHHHHHHHHHH---hhhhhhhhceeeeccCCceeEEEEe--ec--CCccc-------eeec- Confidence 000011 112111 22334444444 4455668999999999887542222 11 11000 0000 Q ss_pred ccccccccccccccccccCcccccccccccccccccccccccchhhhhccCC-CCCccccceeEEEEEEEEeecccccce Q lcl|NC_015288. 142 LDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGD-AGKLFREMSFSIEKTSVTAKSRALKAE 220 (468) Q Consensus 142 ~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~-~g~~f~EMaFsIeK~tVtAKSRaLKAE 220 (468) ++....+ ....|.++.|...|. +-... T Consensus 168 ---------------------------------------------E~~~~~~~~~~~~~~v~l~~~k~-------~~~~~ 195 (392) T protein:vir:10 168 ---------------------------------------------EMGEIPETDNPKFSNVQYAVKDR-------AGILP 195 (392) T ss_pred ---------------------------------------------ccccccccccccceeEEeeeeeE-------EEeeh Confidence 0000000 112355555555554 44467 Q ss_pred ecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeeeecCCcchhHHHHHHHHH-H Q lcl|NC_015288. 221 YTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDLDVDSNGRWSVEKFKGLL-F 299 (468) Q Consensus 221 YTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl~~~~~~rw~~e~~~~l~-~ 299 (468) +|-||.+|- ..|.+++|.+-|...|...+|.-|+.-.-+. ...++..+ +....++ + T Consensus 196 iS~ell~ds----~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~---------~~~~~~~~----------d~i~~~~~~ 252 (392) T protein:vir:10 196 LSRSLLQDS----DQNILKYVTKWLGKKSKVTRNVLILGVIEKL---------TKQAIKSL----------DDIKDVLNV 252 (392) T ss_pred hhHHHHhhh----HHHHHHHHHHHHHHHHHHHHHHHHhhccccc---------cccCccCH----------HHHHHHHHH Confidence 899999984 2567889999999999999998887533221 12222221 2222221 1 Q ss_pred HHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccccccccCCCceeEEEecCCeEEEEccccccC Q lcl|NC_015288. 300 QIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTGNLAVGTINGRIKVYVDPYAANL 379 (468) Q Consensus 300 ~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy~D~Ya~~~ 379 (468) .+ ... -+ ..-..|+|+.....|.. +..+.+ + .....+-+ ....++|.|...|+++. + T Consensus 253 ~l----~~~----~~-~~a~~vm~~~~~~~L~~---lkd~~G---~---~l~~~~~~-~~~~~tllG~~~v~~~~---~- 309 (392) T protein:vir:10 253 KL----DPA----IS-PNAILLTNQDGFNYLDK---LKDKDG---K---YILQSDPT-QKNKKLFAGTNPVVVVS---N- 309 (392) T ss_pred hh----hhh----hc-cCCEEEEcHHHHHHHHH---hhccCC---C---eEeecCcc-CCccccccCcccEEEec---c- Confidence 11 111 11 22336889999888854 222211 0 01111111 12346777777777542 1 Q ss_pred CCcceEEEEEecCCcccceeEEccccc-------cccccccCC------ccccceeeeeeecceee-cC--cccc---cC Q lcl|NC_015288. 380 SDKHYYVVGYKGTSPYDAGLFYCPYVP-------LQMVRSIDP------NNFQPKIGFKTRYGMVS-NP--FVTT---NG 440 (468) Q Consensus 380 s~~dY~~vG~Kg~~~~d~glfyaPYv~-------~~~~~~~Dp------~s~qP~~g~~tRY~l~~-nP--f~~~---~~ 440 (468) ..++.++...-+..++|+.+-. ..+...++| .+.|=.+-...|+|..+ +| |... .. T Consensus 310 -----~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 384 (392) T protein:vir:10 310 -----RFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLS 384 (392) T ss_pred -----cccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEeccc Confidence 1222233333333344443211 011111233 23445566777887643 44 4432 12 Q ss_pred cccccCCh Q lcl|NC_015288. 441 LYSGTPDG 448 (468) Q Consensus 441 ~~~~~~~~ 448 (468) .++..|-| T Consensus 385 a~~~~~~~ 392 (392) T protein:vir:10 385 APVEQPQG 392 (392) T ss_pred ccccCCCC Confidence 22222333 No 68 >protein:vir:102873 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1492 # MgeName: Cherry # Cross-refs: genbank:acc:YP_338137;genbank:gi:77020198;genbank:GeneID:3703782 Probab=56.21 E-value=0.46 Score=22.42 Aligned_cols=318 Identities=17% Similarity=0.127 Sum_probs=126.7 Q ss_pred CcchHHHHHhhhhh---h--cCCcc----------ccccchhhhhhhhhhhhhHHHHHhhhhhhhhhccccccCcccccc Q lcl|NC_015288. 1 MFNAEHLQEKWSPV---L--NNEAA----------NPIADRYKKAVTSVLLENQERFLREERGMLQEVAVNSLGAGTVSP 65 (468) Q Consensus 1 ~~~~~~l~~kw~p~---l--~~~~~----------~~i~~~~~~~~~~~llenq~~~~~e~~~~l~e~~~~~~g~~~~~~ 65 (468) +=..+.|.++.... . +.+.. .+....+|+... ..|.+++..- +.+.++.... +.- T Consensus 35 ~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~l~~~~~~~-~~~~~~~~~~-~~~------- 104 (392) T protein:vir:10 35 MEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDVFM-KALRNKPLNA-EEREFLEDDL-EQR------- 104 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHHHH-HHHhcccccH-HHHHHHhhhh-hhh------- Confidence 11111222222211 0 00000 011122333222 2222221100 1111111100 000 Q ss_pred cccccccc-cccccc---cccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCCcccccccccccccc Q lcl|NC_015288. 66 GGSALGSA-NTAGLA---GFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDAGFTAG 141 (468) Q Consensus 66 ~~~~~~st-~tg~~~---~~~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSG~ 141 (468) .....| +.|+.. .+.+.++.+.| ....-.+++++.||++++|-+.=.+ .. ++.++ .|-+ T Consensus 105 --~~~~~t~~~gg~~vP~~~~~~ii~~~~---~~s~l~~~~~~~~~~~~~~~~~~~~--~~--~~~~a-------~~v~- 167 (392) T protein:vir:10 105 --AMSGLTGEDGGLVIPQDIQTQINELAR---SFDALEQYVTVEPVRTRSGSRVLEK--NS--DMIPF-------AEIT- 167 (392) T ss_pred --hccccccCCCceecchhHHHHHHHHHH---hhhhhhhhceeeeccCCceeEEEEe--ec--CCccc-------eeec- Confidence 000011 112111 22334444444 4455668999999999887542222 11 11000 0000 Q ss_pred ccccccccccccccccccCcccccccccccccccccccccccchhhhhccCC-CCCccccceeEEEEEEEEeecccccce Q lcl|NC_015288. 142 LDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGD-AGKLFREMSFSIEKTSVTAKSRALKAE 220 (468) Q Consensus 142 ~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~-~g~~f~EMaFsIeK~tVtAKSRaLKAE 220 (468) ++....+ ....|.++.|...|. +-... T Consensus 168 ---------------------------------------------E~~~~~~~~~~~~~~v~l~~~k~-------~~~~~ 195 (392) T protein:vir:10 168 ---------------------------------------------EMGEIPETDNPKFSNVQYAVKDR-------AGILP 195 (392) T ss_pred ---------------------------------------------ccccccccccccceeEEeeeeeE-------EEeeh Confidence 0000000 112355555555554 44467 Q ss_pred ecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeeeecCCcchhHHHHHHHHH-H Q lcl|NC_015288. 221 YTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDLDVDSNGRWSVEKFKGLL-F 299 (468) Q Consensus 221 YTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl~~~~~~rw~~e~~~~l~-~ 299 (468) +|-||.+|- ..|.+++|.+-|...|...+|.-|+.-.-+. ...++..+ +....++ + T Consensus 196 iS~ell~ds----~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~---------~~~~~~~~----------d~i~~~~~~ 252 (392) T protein:vir:10 196 LSRSLLQDS----DQNILKYVTKWLGKKSKVTRNVLILGVIEKL---------TKQAIKSL----------DDIKDVLNV 252 (392) T ss_pred hhHHHHhhh----HHHHHHHHHHHHHHHHHHHHHHHHhhccccc---------cccCccCH----------HHHHHHHHH Confidence 899999984 2567889999999999999998887533221 12222221 2222221 1 Q ss_pred HHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccccccccCCCceeEEEecCCeEEEEccccccC Q lcl|NC_015288. 300 QIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTGNLAVGTINGRIKVYVDPYAANL 379 (468) Q Consensus 300 ~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy~D~Ya~~~ 379 (468) .+ ... -+ ..-..|+|+.....|.. +..+.+ + .....+-+ ....++|.|...|+++. + T Consensus 253 ~l----~~~----~~-~~a~~vm~~~~~~~L~~---lkd~~G---~---~l~~~~~~-~~~~~tllG~~~v~~~~---~- 309 (392) T protein:vir:10 253 KL----DPA----IS-PNAILLTNQDGFNYLDK---LKDKDG---K---YILQSDPT-QKNKKLFAGTNPVVVVS---N- 309 (392) T ss_pred hh----hhh----hc-cCCEEEEcHHHHHHHHH---hhccCC---C---eEeecCcc-CCccccccCcccEEEec---c- Confidence 11 111 11 22336889999888854 222211 0 01111111 12346777777777542 1 Q ss_pred CCcceEEEEEecCCcccceeEEccccc-------cccccccCC------ccccceeeeeeecceee-cC--cccc---cC Q lcl|NC_015288. 380 SDKHYYVVGYKGTSPYDAGLFYCPYVP-------LQMVRSIDP------NNFQPKIGFKTRYGMVS-NP--FVTT---NG 440 (468) Q Consensus 380 s~~dY~~vG~Kg~~~~d~glfyaPYv~-------~~~~~~~Dp------~s~qP~~g~~tRY~l~~-nP--f~~~---~~ 440 (468) ..++.++...-+..++|+.+-. ..+...++| .+.|=.+-...|+|..+ +| |... .. T Consensus 310 -----~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 384 (392) T protein:vir:10 310 -----RFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLS 384 (392) T ss_pred -----cccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEeccc Confidence 1222233333333344443211 011111233 23445566777887643 44 4432 12 Q ss_pred cccccCCh Q lcl|NC_015288. 441 LYSGTPDG 448 (468) Q Consensus 441 ~~~~~~~~ 448 (468) .++..|-| T Consensus 385 a~~~~~~~ 392 (392) T protein:vir:10 385 APVEQPQG 392 (392) T ss_pred ccccCCCC Confidence 22222333 No 69 >protein:vir:102082 Length: 392 # NCBI annotation: major head protein # Family: family:all:21 # MgeID: mge:1503 # MgeName: Fah # Cross-refs: genbank:acc:YP_512315;genbank:gi:89152484;genbank:GeneID:3953075 Probab=56.21 E-value=0.46 Score=22.42 Aligned_cols=318 Identities=17% Similarity=0.127 Sum_probs=126.7 Q ss_pred CcchHHHHHhhhhh---h--cCCcc----------ccccchhhhhhhhhhhhhHHHHHhhhhhhhhhccccccCcccccc Q lcl|NC_015288. 1 MFNAEHLQEKWSPV---L--NNEAA----------NPIADRYKKAVTSVLLENQERFLREERGMLQEVAVNSLGAGTVSP 65 (468) Q Consensus 1 ~~~~~~l~~kw~p~---l--~~~~~----------~~i~~~~~~~~~~~llenq~~~~~e~~~~l~e~~~~~~g~~~~~~ 65 (468) +=..+.|.++.... . +.+.. .+....+|+... ..|.+++..- +.+.++.... +.- T Consensus 35 ~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~l~~~~~~~-~~~~~~~~~~-~~~------- 104 (392) T protein:vir:10 35 MEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDVFM-KALRNKPLNA-EEREFLEDDL-EQR------- 104 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHHHH-HHHhcccccH-HHHHHHhhhh-hhh------- Confidence 11111222222211 0 00000 011122333222 2222221100 1111111100 000 Q ss_pred cccccccc-cccccc---cccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCCcccccccccccccc Q lcl|NC_015288. 66 GGSALGSA-NTAGLA---GFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDAGFTAG 141 (468) Q Consensus 66 ~~~~~~st-~tg~~~---~~~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSG~ 141 (468) .....| +.|+.. .+.+.++.+.| ....-.+++++.||++++|-+.=.+ .. ++.++ .|-+ T Consensus 105 --~~~~~t~~~gg~~vP~~~~~~ii~~~~---~~s~l~~~~~~~~~~~~~~~~~~~~--~~--~~~~a-------~~v~- 167 (392) T protein:vir:10 105 --AMSGLTGEDGGLVIPQDIQTQINELAR---SFDALEQYVTVEPVRTRSGSRVLEK--NS--DMIPF-------AEIT- 167 (392) T ss_pred --hccccccCCCceecchhHHHHHHHHHH---hhhhhhhhceeeeccCCceeEEEEe--ec--CCccc-------eeec- Confidence 000011 112111 22334444444 4455668999999999887542222 11 11000 0000 Q ss_pred ccccccccccccccccccCcccccccccccccccccccccccchhhhhccCC-CCCccccceeEEEEEEEEeecccccce Q lcl|NC_015288. 142 LDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGD-AGKLFREMSFSIEKTSVTAKSRALKAE 220 (468) Q Consensus 142 ~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~-~g~~f~EMaFsIeK~tVtAKSRaLKAE 220 (468) ++....+ ....|.++.|...|. +-... T Consensus 168 ---------------------------------------------E~~~~~~~~~~~~~~v~l~~~k~-------~~~~~ 195 (392) T protein:vir:10 168 ---------------------------------------------EMGEIPETDNPKFSNVQYAVKDR-------AGILP 195 (392) T ss_pred ---------------------------------------------ccccccccccccceeEEeeeeeE-------EEeeh Confidence 0000000 112355555555554 44467 Q ss_pred ecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeeeecCCcchhHHHHHHHHH-H Q lcl|NC_015288. 221 YTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDLDVDSNGRWSVEKFKGLL-F 299 (468) Q Consensus 221 YTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl~~~~~~rw~~e~~~~l~-~ 299 (468) +|-||.+|- ..|.+++|.+-|...|...+|.-|+.-.-+. ...++..+ +....++ + T Consensus 196 iS~ell~ds----~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~---------~~~~~~~~----------d~i~~~~~~ 252 (392) T protein:vir:10 196 LSRSLLQDS----DQNILKYVTKWLGKKSKVTRNVLILGVIEKL---------TKQAIKSL----------DDIKDVLNV 252 (392) T ss_pred hhHHHHhhh----HHHHHHHHHHHHHHHHHHHHHHHHhhccccc---------cccCccCH----------HHHHHHHHH Confidence 899999984 2567889999999999999998887533221 12222221 2222221 1 Q ss_pred HHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccccccccCCCceeEEEecCCeEEEEccccccC Q lcl|NC_015288. 300 QIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTGNLAVGTINGRIKVYVDPYAANL 379 (468) Q Consensus 300 ~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy~D~Ya~~~ 379 (468) .+ ... -+ ..-..|+|+.....|.. +..+.+ + .....+-+ ....++|.|...|+++. + T Consensus 253 ~l----~~~----~~-~~a~~vm~~~~~~~L~~---lkd~~G---~---~l~~~~~~-~~~~~tllG~~~v~~~~---~- 309 (392) T protein:vir:10 253 KL----DPA----IS-PNAILLTNQDGFNYLDK---LKDKDG---K---YILQSDPT-QKNKKLFAGTNPVVVVS---N- 309 (392) T ss_pred hh----hhh----hc-cCCEEEEcHHHHHHHHH---hhccCC---C---eEeecCcc-CCccccccCcccEEEec---c- Confidence 11 111 11 22336889999888854 222211 0 01111111 12346777777777542 1 Q ss_pred CCcceEEEEEecCCcccceeEEccccc-------cccccccCC------ccccceeeeeeecceee-cC--cccc---cC Q lcl|NC_015288. 380 SDKHYYVVGYKGTSPYDAGLFYCPYVP-------LQMVRSIDP------NNFQPKIGFKTRYGMVS-NP--FVTT---NG 440 (468) Q Consensus 380 s~~dY~~vG~Kg~~~~d~glfyaPYv~-------~~~~~~~Dp------~s~qP~~g~~tRY~l~~-nP--f~~~---~~ 440 (468) ..++.++...-+..++|+.+-. ..+...++| .+.|=.+-...|+|..+ +| |... .. T Consensus 310 -----~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 384 (392) T protein:vir:10 310 -----RFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLS 384 (392) T ss_pred -----cccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEeccc Confidence 1222233333333344443211 011111233 23445566777887643 44 4432 12 Q ss_pred cccccCCh Q lcl|NC_015288. 441 LYSGTPDG 448 (468) Q Consensus 441 ~~~~~~~~ 448 (468) .++..|-| T Consensus 385 a~~~~~~~ 392 (392) T protein:vir:10 385 APVEQPQG 392 (392) T ss_pred ccccCCCC Confidence 22222333 No 70 >protein:vir:107593 Length: 392 # NCBI annotation: major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1491 # MgeName: Gamma # Cross-refs: genbank:acc:YP_338188;genbank:gi:77020144;genbank:GeneID:3703724 Probab=56.21 E-value=0.46 Score=22.42 Aligned_cols=318 Identities=17% Similarity=0.127 Sum_probs=126.7 Q ss_pred CcchHHHHHhhhhh---h--cCCcc----------ccccchhhhhhhhhhhhhHHHHHhhhhhhhhhccccccCcccccc Q lcl|NC_015288. 1 MFNAEHLQEKWSPV---L--NNEAA----------NPIADRYKKAVTSVLLENQERFLREERGMLQEVAVNSLGAGTVSP 65 (468) Q Consensus 1 ~~~~~~l~~kw~p~---l--~~~~~----------~~i~~~~~~~~~~~llenq~~~~~e~~~~l~e~~~~~~g~~~~~~ 65 (468) +=..+.|.++.... . +.+.. .+....+|+... ..|.+++..- +.+.++.... +.- T Consensus 35 ~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~l~~~~~~~-~~~~~~~~~~-~~~------- 104 (392) T protein:vir:10 35 MEEVRSLQKKIDLQRSLDEAETEERNNGREVETRNVDGEMEYRDVFM-KALRNKPLNA-EEREFLEDDL-EQR------- 104 (392) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHhhccccccccCccchHHHHHHHH-HHHhcccccH-HHHHHHhhhh-hhh------- Confidence 11111222222211 0 00000 011122333222 2222221100 1111111100 000 Q ss_pred cccccccc-cccccc---cccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCCcccccccccccccc Q lcl|NC_015288. 66 GGSALGSA-NTAGLA---GFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDAGFTAG 141 (468) Q Consensus 66 ~~~~~~st-~tg~~~---~~~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSG~ 141 (468) .....| +.|+.. .+.+.++.+.| ....-.+++++.||++++|-+.=.+ .. ++.++ .|-+ T Consensus 105 --~~~~~t~~~gg~~vP~~~~~~ii~~~~---~~s~l~~~~~~~~~~~~~~~~~~~~--~~--~~~~a-------~~v~- 167 (392) T protein:vir:10 105 --AMSGLTGEDGGLVIPQDIQTQINELAR---SFDALEQYVTVEPVRTRSGSRVLEK--NS--DMIPF-------AEIT- 167 (392) T ss_pred --hccccccCCCceecchhHHHHHHHHHH---hhhhhhhhceeeeccCCceeEEEEe--ec--CCccc-------eeec- Confidence 000011 112111 22334444444 4455668999999999887542222 11 11000 0000 Q ss_pred ccccccccccccccccccCcccccccccccccccccccccccchhhhhccCC-CCCccccceeEEEEEEEEeecccccce Q lcl|NC_015288. 142 LDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGD-AGKLFREMSFSIEKTSVTAKSRALKAE 220 (468) Q Consensus 142 ~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~-~g~~f~EMaFsIeK~tVtAKSRaLKAE 220 (468) ++....+ ....|.++.|...|. +-... T Consensus 168 ---------------------------------------------E~~~~~~~~~~~~~~v~l~~~k~-------~~~~~ 195 (392) T protein:vir:10 168 ---------------------------------------------EMGEIPETDNPKFSNVQYAVKDR-------AGILP 195 (392) T ss_pred ---------------------------------------------ccccccccccccceeEEeeeeeE-------EEeeh Confidence 0000000 112355555555554 44467 Q ss_pred ecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeeeecCCcchhHHHHHHHHH-H Q lcl|NC_015288. 221 YTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDLDVDSNGRWSVEKFKGLL-F 299 (468) Q Consensus 221 YTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl~~~~~~rw~~e~~~~l~-~ 299 (468) +|-||.+|- ..|.+++|.+-|...|...+|.-|+.-.-+. ...++..+ +....++ + T Consensus 196 iS~ell~ds----~~~l~~~i~~~l~~~i~~~~d~~~~~g~g~~---------~~~~~~~~----------d~i~~~~~~ 252 (392) T protein:vir:10 196 LSRSLLQDS----DQNILKYVTKWLGKKSKVTRNVLILGVIEKL---------TKQAIKSL----------DDIKDVLNV 252 (392) T ss_pred hhHHHHhhh----HHHHHHHHHHHHHHHHHHHHHHHHhhccccc---------cccCccCH----------HHHHHHHHH Confidence 899999984 2567889999999999999998887533221 12222221 2222221 1 Q ss_pred HHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccccccccCCCceeEEEecCCeEEEEccccccC Q lcl|NC_015288. 300 QIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTGNLAVGTINGRIKVYVDPYAANL 379 (468) Q Consensus 300 ~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy~D~Ya~~~ 379 (468) .+ ... -+ ..-..|+|+.....|.. +..+.+ + .....+-+ ....++|.|...|+++. + T Consensus 253 ~l----~~~----~~-~~a~~vm~~~~~~~L~~---lkd~~G---~---~l~~~~~~-~~~~~tllG~~~v~~~~---~- 309 (392) T protein:vir:10 253 KL----DPA----IS-PNAILLTNQDGFNYLDK---LKDKDG---K---YILQSDPT-QKNKKLFAGTNPVVVVS---N- 309 (392) T ss_pred hh----hhh----hc-cCCEEEEcHHHHHHHHH---hhccCC---C---eEeecCcc-CCccccccCcccEEEec---c- Confidence 11 111 11 22336889999888854 222211 0 01111111 12346777777777542 1 Q ss_pred CCcceEEEEEecCCcccceeEEccccc-------cccccccCC------ccccceeeeeeecceee-cC--cccc---cC Q lcl|NC_015288. 380 SDKHYYVVGYKGTSPYDAGLFYCPYVP-------LQMVRSIDP------NNFQPKIGFKTRYGMVS-NP--FVTT---NG 440 (468) Q Consensus 380 s~~dY~~vG~Kg~~~~d~glfyaPYv~-------~~~~~~~Dp------~s~qP~~g~~tRY~l~~-nP--f~~~---~~ 440 (468) ..++.++...-+..++|+.+-. ..+...++| .+.|=.+-...|+|..+ +| |... .. T Consensus 310 -----~~~~~~~~~~~~~~~~~gdfs~~~~i~~~~~~~~~~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~a~~~l~~~~~ 384 (392) T protein:vir:10 310 -----RFLKSKGTTAKKAPLIIGDLKEAIVLFKREDMELASTDVGGKAFTRNTLDLRAIQRDDVQMWDNEAAVYGEIDLS 384 (392) T ss_pred -----cccCCCcccCCceEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEEEeccc Confidence 1222233333333344443211 011111233 23445566777887643 44 4432 12 Q ss_pred cccccCCh Q lcl|NC_015288. 441 LYSGTPDG 448 (468) Q Consensus 441 ~~~~~~~~ 448 (468) .++..|-| T Consensus 385 a~~~~~~~ 392 (392) T protein:vir:10 385 APVEQPQG 392 (392) T ss_pred ccccCCCC Confidence 22222333 No 71 >protein:vir:96223 Length: 324 # NCBI annotation: ORF011 # Family: family:all:507 # MgeID: mge:1607 # MgeName: 69 # Cross-refs: genbank:acc:YP_239571;genbank:gi:66395304;genbank:GeneID:5132771 Probab=55.57 E-value=0.48 Score=22.35 Aligned_cols=299 Identities=10% Similarity=0.042 Sum_probs=115.8 Q ss_pred hhhhHHHHHhhhhhhhhhcc-ccccCcccccccccccccccccccccccceeh-hhhHHhhhhhhhhheeeeecCCccce Q lcl|NC_015288. 35 LLENQERFLREERGMLQEVA-VNSLGAGTVSPGGSALGSANTAGLAGFDPVLI-SLVRRAMPNLMAYDVCGVQPMSGPTG 112 (468) Q Consensus 35 llenq~~~~~e~~~~l~e~~-~~~~g~~~~~~~~~~~~st~tg~~~~~~P~Lv-~l~RRa~~~LIa~DI~GVQPmTGPTG 112 (468) +-++|... .+.+++..... .+.+++.. ..++++++. -.-|.+. .+++.+..+.+..+++.+-||++++. T Consensus 1 ~~~~~~~~-~~~~~f~~~~~~~~~~~a~~-------~~~~~~~~~-lip~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~ 71 (324) T protein:vir:96 1 MEQTQKLK-LNLQHFASNNVKPQVFNPDN-------VMMHEKKDG-TLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEK 71 (324) T ss_pred CCcchhhh-HHHHHHHHhhhhhhhccccc-------ccccCCCcc-eechhHHHHHHHHHHhhchhhhhcceeeccCCce Confidence 11122111 11111111110 01111111 111111111 1112222 24455556677888999999988764 Q ss_pred eeeeeeeeecCCCCCccccccccccccccccccccccccccccccccCcccccccccccccccccccccccchhhhhccC Q lcl|NC_015288. 113 LIFAMRSRYENQAGEEALFNEPDAGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAG 192 (468) Q Consensus 113 LIFAMRsrY~~qsG~EA~fnEa~t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG 192 (468) -|.-. .. +.++ .|- ++++... T Consensus 72 ~~p~~----~~--~~~a-------~~v----------------------------------------------~Eg~~~~ 92 (324) T protein:vir:96 72 KFTFW----AD--KPGA-------YWV----------------------------------------------GEGQKIE 92 (324) T ss_pred EEEEE----ec--Ccce-------eee----------------------------------------------cCCcccc Confidence 33211 10 0000 000 0111111 Q ss_pred CCCCccccceeEEEEEEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccc Q lcl|NC_015288. 193 DAGKLFREMSFSIEKTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANN 272 (468) Q Consensus 193 ~~g~~f~EMaFsIeK~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~ 272 (468) .....|.+..+.+.|..+- ...|-||.+|-. .|.+++|.+.|...|...+++.||.--- .+ T Consensus 93 ~~~~~f~~v~~~~~k~~~~-------~~is~ell~ds~----~~l~~~i~~~l~~aia~~~d~~~l~G~g--------~~ 153 (324) T protein:vir:96 93 TSKATWVNATMRAFKLGVI-------LPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQG--------NN 153 (324) T ss_pred ccccceeEEEEEeEEEEEe-------ehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHhhhcCC--------CC Confidence 1123455555555555544 458999999853 4688899999999999999998885311 11 Q ss_pred cccceeeeeecCCcc-hhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccccc Q lcl|NC_015288. 273 VANAGIFDLDVDSNG-RWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIG 351 (468) Q Consensus 273 ~~~~Gv~Dl~~~~~~-rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~ 351 (468) ..+.|++........ -...--+..+ ..+.... ...-+..+.++||+.....|... .... ++. .. T Consensus 154 ~~~~~~~~~~~~~~~~~~~~~~~~~i-----~~~~~~i-~~~~~~~~~~i~n~~~~~~L~~l---kd~~---G~~---~~ 218 (324) T protein:vir:96 154 PFGKSIAQSIKKTNKVIKGDFTQDNI-----IDLEALL-EDDELEANAFISKTQNRSLLRKI---VDPE---TKE---RI 218 (324) T ss_pred CcCccccccccccceecccccchHHH-----HHHHHhh-hhccCCCCEEEEcHHHHHHHHHh---hCCC---CCe---ee Confidence 122222221111100 0000001111 1111111 12234556789999998888542 1111 110 00 Q ss_pred cccCCCceeEEEecCCeEEEEccccccCCCcceEE--------EEEecCCcccceeEEccccccccccccCCcc-----c Q lcl|NC_015288. 352 TVDDTGNLAVGTINGRIKVYVDPYAANLSDKHYYV--------VGYKGTSPYDAGLFYCPYVPLQMVRSIDPNN-----F 418 (468) Q Consensus 352 ~~D~t~~~~~G~l~~~~~vy~D~Ya~~~s~~dY~~--------vG~Kg~~~~d~glfyaPYv~~~~~~~~Dp~s-----~ 418 (468) .+..+ ++| .+++|++++... .+..-++ +|..++-..+.+ .+ ..+....|+.. | T Consensus 219 -~~~~~----~~l-~G~PV~~~~~~~--~~~~~~~~gd~s~~~~~~~~~~~i~~~----~~--~~~~~~~~~~~~~~~~~ 284 (324) T protein:vir:96 219 -YDRNS----DSL-DGLPVVNLKSSN--LKRGELITGDFDKLIYGIPQLIEYKID----ET--AQLSTVKNEDGTPVNLF 284 (324) T ss_pred -cCCCC----Ccc-cceeeEeecCCC--CCcceEEEEecceEEEEEecCcEEEEe----ec--ccccccccccccchhhh Confidence 01112 334 357777764321 1122233 333322211100 00 00111111210 1 Q ss_pred ---cceeeeeeecce-eecC--ccccc---CcccccCChhh Q lcl|NC_015288. 419 ---QPKIGFKTRYGM-VSNP--FVTTN---GLYSGTPDGET 450 (468) Q Consensus 419 ---qP~~g~~tRY~l-~~nP--f~~~~---~~~~~~~~~~~ 450 (468) |=.+=..-||+. ..+| |+... ...... .|+- T Consensus 285 ~~n~v~~r~~~r~d~~v~~~~a~~~l~~a~~~~~~~-~~~~ 324 (324) T protein:vir:96 285 EQDMVALRATMHVALHIADDKAFAKLVPADKRTDSV-PGEV 324 (324) T ss_pred hcCcEEEEEEEEeccEEecccceEEEecccccCCCC-CCCC Confidence 223334567776 4455 44321 111111 1111 No 72 >protein:vir:81160 Length: 371 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:1892 # MgeName: Geobacillus virus E2 # Cross-refs: genbank:acc:YP_001285811;genbank:gi:148747732;genbank:GeneID:5247203 Probab=54.45 E-value=0.5 Score=22.22 Aligned_cols=316 Identities=12% Similarity=0.049 Sum_probs=124.5 Q ss_pred CcchHHHHHhhhhhhcCCccccccchhhhhh--hhhhhhhHHHHH-------------hhhhhhhhhccccccCcccccc Q lcl|NC_015288. 1 MFNAEHLQEKWSPVLNNEAANPIADRYKKAV--TSVLLENQERFL-------------REERGMLQEVAVNSLGAGTVSP 65 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~--~~~llenq~~~~-------------~e~~~~l~e~~~~~~g~~~~~~ 65 (468) +.+.++ .|.|.-+... |.+. ++++ ...+.|-+.+.. .+.+..+.+.+ -++ . T Consensus 22 ~~~~~~-~e~~~~~~~e-----i~~l-~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l----~~~---~ 87 (371) T protein:vir:81 22 LLAENK-IEEAKKLKEE-----IVAL-QEKFDVAKELYEEQKQTIEDKEPLKPTVQVKENEVEAFVNHI----RTR---F 87 (371) T ss_pred HhhHHH-HHHHHHHHHH-----HHHH-HHHHHHHHHHHHHHHHhhccccccccchhhHHHHHHHHHHHH----HHH---H Confidence 222222 2344333221 2211 1111 011111110000 00100000000 000 0 Q ss_pred cccccccc-cccccccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCCccccccccccccccccc Q lcl|NC_015288. 66 GGSALGSA-NTAGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDAGFTAGLDA 144 (468) Q Consensus 66 ~~~~~~st-~tg~~~~~~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSG~~~~ 144 (468) -..+..++ +.|+..--....-.+++...++..-.+++++.||++.++-+.-.+ ..+ +.++ .|- T Consensus 88 ~~a~~~~t~~~gg~~vP~~~~~~ii~~~~~~s~i~~~~~~~~~~~~~~~~~~~~--~~~--~~~a-------~~v----- 151 (371) T protein:vir:81 88 RNAMSEGSNQDGGYTVPQDIQTRINELRESKDALQNLITVEPVTTLSGSRVFKK--RSQ--QTGF-------VEV----- 151 (371) T ss_pred HHhhccCCCccCceeecHhHHHHHHHHHHhhhhhhhhceeeeccCCceeEEEEe--ecC--Ccce-------eee----- Confidence 00011111 112211111111234555667778889999999998877654333 111 0100 000 Q ss_pred cccccccccccccccCcccccccccccccccccccccccchhhhhccCC-CCCccccceeEEEEEEEEeecccccceecH Q lcl|NC_015288. 145 TTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGD-AGKLFREMSFSIEKTSVTAKSRALKAEYTL 223 (468) Q Consensus 145 ~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~-~g~~f~EMaFsIeK~tVtAKSRaLKAEYTv 223 (468) ++++...+ +...|.+..++..|..+ ...+|- T Consensus 152 -----------------------------------------~Eg~~~~~~~~~~f~~i~~~~~k~~~-------~~~iS~ 183 (371) T protein:vir:81 152 -----------------------------------------AEGAAIGEKATPQFTLLQYQVKKYAG-------FFRVTN 183 (371) T ss_pred -----------------------------------------ccccccccccccceeeEEeeeeEEEE-------eehhhH Confidence 00000111 11245555555555554 457999 Q ss_pred HHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeeeecCCcchhHHHHHHHHHHHHHH Q lcl|NC_015288. 224 ELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDLDVDSNGRWSVEKFKGLLFQIER 303 (468) Q Consensus 224 ELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl~~~~~~rw~~e~~~~l~~~i~~ 303 (468) ||.+|-. .|.++.|.+.|...|..-+|+.||.-.-+. .+.|+... +....++ T Consensus 184 ell~ds~----~~l~~~i~~~l~~a~~~~~~~~i~~g~g~~---------~~~~~~~~----------~~i~~~~----- 235 (371) T protein:vir:81 184 ELLNDST----EAIVNTLVRWIGDESRVTRNGLIINVLNTK---------AKTAIADL----------DGLKQII----- 235 (371) T ss_pred HHHhhhh----HHHHHHHHHHHHHHHHHHHHHHHHhhcccc---------cccccccH----------HHHHHHH----- Confidence 9999853 467889999999999999998888743321 22233221 1111111 Q ss_pred HHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccccccccCCCceeEEEecCCeEEEEccccccCCCcc Q lcl|NC_015288. 304 DCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTGNLAVGTINGRIKVYVDPYAANLSDKH 383 (468) Q Consensus 304 ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy~D~Ya~~~s~~d 383 (468) +......-+ ....+|+++.....|.. +..+.+ + .....+.+ .-..|+|. +++|++.. T Consensus 236 --~~~l~~~~~-~~a~~vmn~~~~~~L~~---lkd~~g---~---~l~~~~~~-~~~~~~l~-G~pV~~~~--------- 292 (371) T protein:vir:81 236 --NVQLDPVFR-STSSVIVNQDAFNWLDT---LKDQNG---Q---YLLQPSIS-SPTGRQLL-GLPVVIVS--------- 292 (371) T ss_pred --Hhhcchhhh-cCCEEEEcHHHHHHHHH---hhccCC---C---eeeecccC-CCCCceec-ceeEEEec--------- Confidence 111001111 22357889988887753 222110 0 00111111 11236775 46777662 Q ss_pred eEEEEEecC---CcccceeEEccccc-------cccccccCCc------cccceeeeeeecceee-cCcccccCcccccC Q lcl|NC_015288. 384 YYVVGYKGT---SPYDAGLFYCPYVP-------LQMVRSIDPN------NFQPKIGFKTRYGMVS-NPFVTTNGLYSGTP 446 (468) Q Consensus 384 Y~~vG~Kg~---~~~d~glfyaPYv~-------~~~~~~~Dp~------s~qP~~g~~tRY~l~~-nPf~~~~~~~~~~~ 446 (468) ++..|..+. ..-...++|+.+.. ..+...+++. .-|=.+-...|++..+ || T Consensus 293 ~~~~~~~~~~~~~~~~~~i~~Gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~v~~~~~~r~d~~~~~~------------ 360 (371) T protein:vir:81 293 NKVLANRVDGGTGAQFAPIIVGDLKEAVVMFDRQRTEIMSSNVAMDAFETDATLWRAIERMDVKMRDD------------ 360 (371) T ss_pred ccccCccccccccCCcceEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecc------------ Confidence 233333321 11122344444321 1122222332 2233445555665532 33 Q ss_pred ChhhhhhccCceeeeEEeecc Q lcl|NC_015288. 447 DGETLTPSTNMYYRRVQVTNL 467 (468) Q Consensus 447 ~~~~~~~~~N~y~r~~~v~~~ 467 (468) ..|.++.++-= T Consensus 361 ----------~a~~~~~~~~A 371 (371) T protein:vir:81 361 ----------EAFVFGEVQLA 371 (371) T ss_pred ----------cceEEEEEecC Confidence 12222222222 No 73 >protein:vir:2430 Length: 318 # NCBI annotation: major head subunit # Family: family:all:507 # MgeID: mge:52 # MgeName: D29 # Cross-refs: genbank:acc:NP_046832;genbank:gi:9630400;genbank:GeneID:1261582 Probab=53.23 E-value=0.53 Score=22.08 Aligned_cols=298 Identities=10% Similarity=0.049 Sum_probs=117.6 Q ss_pred HhhhhhhhhhccccccCcccccccccccccccccccccccc-eehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeee Q lcl|NC_015288. 43 LREERGMLQEVAVNSLGAGTVSPGGSALGSANTAGLAGFDP-VLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRY 121 (468) Q Consensus 43 ~~e~~~~l~e~~~~~~g~~~~~~~~~~~~st~tg~~~~~~P-~Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY 121 (468) ++-..++-.|....+ ..+ ++..+.. .-| ..-.+++...+..+..+++.+-||++++.-|. +. T Consensus 1 ~~~~~~~~~e~~~~~-~~~----------~~~~~~~--ip~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~ip----~~ 63 (318) T protein:vir:24 1 MAAGTAFAVDHAQIA-QTG----------DTMFKGY--LEPEQAKDYFAEAEKTSIVQQFAQKVPMGTTGQKIP----HW 63 (318) T ss_pred CCCCCCCCHHHHHhh-ccc----------Cccccee--echhHHHHHHHHHHhhchhhhhcceeeccCCceEEE----EE Confidence 211111111111000 000 0111111 111 11123344445667788899999987653321 11 Q ss_pred cCCCCCccccccccccccccccccccccccccccccccCcccccccccccccccccccccccchhhhhccCCCCCccccc Q lcl|NC_015288. 122 ENQAGEEALFNEPDAGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDAGKLFREM 201 (468) Q Consensus 122 ~~qsG~EA~fnEa~t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~g~~f~EM 201 (468) .. +.++ .|- + | +.++++. T Consensus 64 ~~--~~~a-------~~v----------------------------------------------~--E-----g~~~~~~ 81 (318) T protein:vir:24 64 VG--DVSA-------QWI----------------------------------------------G--E-----GDMKPIT 81 (318) T ss_pred eC--Ccce-------EEe----------------------------------------------c--C-----Ccccccc Confidence 10 0000 000 0 0 2234444 Q ss_pred eeEEEEEEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeee Q lcl|NC_015288. 202 SFSIEKTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDL 281 (468) Q Consensus 202 aFsIeK~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl 281 (468) ..++++++.+.|..+-...+|-||.+|-. .|.+++|.+.|+..|...|++.+|.---+ + .+.|++.. T Consensus 82 ~~~f~~i~~~~~k~~~~~~iS~e~l~ds~----~~~~~~i~~~l~~~~~~~~d~a~l~G~g~----~-----~~~~~~~~ 148 (318) T protein:vir:24 82 KGNMTSQTIAPHKIATIFVASAETVRANP----ANYLGTMRTKVATAFAMAFDGAAMHGTDS----P-----FPTYIGQT 148 (318) T ss_pred ccceeEEEEeeEEEEEeehhhHHHhhcCh----HHHHHHHHHHHHHHHHHHHHHhhhcccCC----C-----CCcccccc Confidence 55566666666666667789999999854 57999999999999999999998753211 1 11222221 Q ss_pred ecCCc-chhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccc-cccccccccccccCCCce Q lcl|NC_015288. 282 DVDSN-GRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSG-LTGAGGPAIGTVDDTGNL 359 (468) Q Consensus 282 ~~~~~-~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~-~~~~~~~~~~~~D~t~~~ 359 (468) ..... +...... ........++... ....-.....+|+|+.....|.. +..+.+ ..-+.... ....... T Consensus 149 ~~~~~~~~~~~~~--~~~~~~~~~~~~~-~~~~~~~~~~~v~n~~~~~~L~~---lkd~~G~~l~~~~~~---~~~~~~~ 219 (318) T protein:vir:24 149 TKAISIADTTGAT--TVYDQVAVNGLSL-LVNDGKKWTHTLLDDITEPILNG---AKDQNGRPLFIESTY---GEAASPF 219 (318) T ss_pred ccccccccccccc--chHHHHHHHHHHh-hccccCCCCEEEEcHHHHHHHHH---hhccCCceeecCccc---cCccccc Confidence 11100 0000000 0001111111111 12223344568999999998863 222111 00000000 0001111 Q ss_pred eEEEecCCeEEEEccccccCC------CcceEEEEEecCCcccceeEEccccccccccccCCcc-----c---cceeeee Q lcl|NC_015288. 360 AVGTINGRIKVYVDPYAANLS------DKHYYVVGYKGTSPYDAGLFYCPYVPLQMVRSIDPNN-----F---QPKIGFK 425 (468) Q Consensus 360 ~~G~l~~~~~vy~D~Ya~~~s------~~dY~~vG~Kg~~~~d~glfyaPYv~~~~~~~~Dp~s-----~---qP~~g~~ 425 (468) .-+.+. +++|++.+.+.... ++.++++|..+..+.+- -.+..+....|+.. | |=.+=.. T Consensus 220 ~~~~i~-g~pv~~~~~~~~~~~~~~~gdfs~~~~~~~~~l~i~~------~~~~~~~~~~~~~~~~~~~f~~~~~~~r~~ 292 (318) T protein:vir:24 220 RSGRIV-ARPTILSDHVVEGTTVGFMGDFSQLIWGQIGGLSFDV------TDQATLNLGTVESPNFVSLWQHNLVAVRVE 292 (318) T ss_pred cCceEE-EEeeEEeCCCCCCccEEEEeecceEEEEEecCeEEEE------eeccceeccccccccchhhhhcCcEEEEEE Confidence 112222 35666655432110 11112223222211110 00001111122211 2 2333345 Q ss_pred eeccee-ecC--cccccCcccccCCh Q lcl|NC_015288. 426 TRYGMV-SNP--FVTTNGLYSGTPDG 448 (468) Q Consensus 426 tRY~l~-~nP--f~~~~~~~~~~~~~ 448 (468) .|++.. .+| |+..+...+..--| T Consensus 293 ~r~d~~v~~~~a~~~i~~~~a~~~~~ 318 (318) T protein:vir:24 293 AEYAFHCNDAEAFVALTNVVSGGGEG 318 (318) T ss_pred EEEccEEecccceEEEEeeccCCCCC Confidence 677765 445 44332222211111 No 74 >protein:vir:5739 Length: 366 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:122 # MgeName: PY54 # Cross-refs: genbank:acc:NP_892050;genbank:gi:33770513;interpro:IPR006444;uniprot:Q7Y410;genbank:GeneID:1732928 Probab=50.47 E-value=0.61 Score=21.77 Aligned_cols=321 Identities=15% Similarity=0.106 Sum_probs=110.3 Q ss_pred CcchHHHHHhhhh--hhcCCccccccch-hhhhhhhhhhh---hHHHHHhhhhhhhh-hccccccCcccccccccccccc Q lcl|NC_015288. 1 MFNAEHLQEKWSP--VLNNEAANPIADR-YKKAVTSVLLE---NQERFLREERGMLQ-EVAVNSLGAGTVSPGGSALGSA 73 (468) Q Consensus 1 ~~~~~~l~~kw~p--~l~~~~~~~i~~~-~~~~~~~~lle---nq~~~~~e~~~~l~-e~~~~~~g~~~~~~~~~~~~st 73 (468) --+++.-++++.+ .+....++.-+.. ..|.+.+ |.. |..+.....+..+. +.... .+..++ T Consensus 3 ~~~a~~~~~~~~~~~~~~~~~~~~~kg~~~~~~~~a-~a~~~g~~~~a~~~a~~~~~~~~~~~-----------a~~~~~ 70 (366) T protein:vir:57 3 AAVAVPVKAHSVAPGIIIKEELQQYKGAGMTRMVMS-IAAGKGNLADAAKFAATELGDTGLSM-----------AISTAA 70 (366) T ss_pred ccccccccccccccccccccccccccchhHHHHHHH-HHhcccchhHHHHHHHHhhcchhhhh-----------hccccc Confidence 1111111112211 0000000000000 0011111 111 11111110000000 00000 011111 Q ss_pred cccccc---cccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCCccccccccccccccccccccccc Q lcl|NC_015288. 74 NTAGLA---GFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDAGFTAGLDATTGAYT 150 (468) Q Consensus 74 ~tg~~~---~~~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSG~~~~~~~~~~ 150 (468) .+|+.. .....++.+.| +..+...+ |++.+++++|-+-=.| .. ++.++ .| T Consensus 71 ~~Gg~lvP~~~~~~ii~~l~---~~s~l~~l-g~~~v~~~~g~~~~p~--~t--~~~~a-------~w------------ 123 (366) T protein:vir:57 71 GSGGALIPQNMQNEVIELLR---DRTVVRIL-GARSIPLPNGNLSMPR--LS--GGATA-------GY------------ 123 (366) T ss_pred cCCccccchhHHHHHHHHHh---hhcchhhh-ceeeeecCCCceEEEE--Ee--CCcce-------ee------------ Confidence 122211 01112222322 22222222 3333333333211001 00 00000 00 Q ss_pred cccccccccCcccccccccccccccccccccccchhhhhccCCCCCccccceeEEEEEEEEeecccccceecHHHHHhHH Q lcl|NC_015288. 151 PRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDAGKLFREMSFSIEKTSVTAKSRALKAEYTLELAQDLK 230 (468) Q Consensus 151 ~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~g~~f~EMaFsIeK~tVtAKSRaLKAEYTvELAQDLk 230 (468) . +| +..+++...+++++++..|.-+-...+|-||.+|-- T Consensus 124 ----------------------------------v--~E-----~~~~~~s~~~f~~i~~~~~k~~~~~~iS~ell~ds~ 162 (366) T protein:vir:57 124 ----------------------------------V--GE-----GKDVVATGATFDDVKLSAKTMIALVPVSNQLIGRAG 162 (366) T ss_pred ----------------------------------e--cc-----CccccccccceeEEEEeeEEEEEeehhhHHHHhhhh Confidence 0 01 223444455566666777666667788999998753 Q ss_pred HhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeeeecCC---------cchhH-HHHHHHHHHH Q lcl|NC_015288. 231 AIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDLDVDS---------NGRWS-VEKFKGLLFQ 300 (468) Q Consensus 231 AiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl~~~~---------~~rw~-~e~~~~l~~~ 300 (468) .|.|+.|.+-|...|...+++.||.-=- .+-.+.|++-..... ...|. +..+..++. T Consensus 163 ----~~~~~~i~~~l~~a~~~~~d~a~l~G~G--------~~~~p~Gi~~~~~~~~~~~~~~~t~~~~~~~~~~~~~~~- 229 (366) T protein:vir:57 163 ----FNVEQLLLGDILSAIATREDKAFLRDDG--------TGDTPKGMKAVATAANRLVAWTGTAINLTTIDEYLDSLI- 229 (366) T ss_pred ----HHHHHHHHHHHHHHHHHHHHHHhhccCC--------CCccccceeeccccccceeeccccccchhhHHHHHHHHH- Confidence 4688999999999999999888885210 011223332211110 11111 111111111 Q ss_pred HHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccccccccCCCceeEEEecCCeEEEEccccccC- Q lcl|NC_015288. 301 IERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTGNLAVGTINGRIKVYVDPYAANL- 379 (468) Q Consensus 301 i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy~D~Ya~~~- 379 (468) ...............|+++.....|.. +..+.+ + ... .+.+ -|+|. +++|+++.+...+ T Consensus 230 -----~~~~~~~~~~~~a~~vmn~~~~~~L~~---lkd~~G---~---~l~-~~~~----~g~l~-G~Pvv~s~~ip~~~ 289 (366) T protein:vir:57 230 -----LKHMDSNSNMIRCGWGLSNRTYMTLFG---LRDGNG---N---KVY-PEMS----QGILK-GYPIQRTSAIPANL 289 (366) T ss_pred -----HhhhccccccccCEEEecHHHHHHHHh---hhccCC---c---eec-cCCC----CCeec-ceeeEEcccccccc Confidence 111111222233445788888877753 222111 1 011 1111 25664 5788887553211 Q ss_pred -----------CCcceEEEEEecCCcccceeEEccccccccccccCCc--------cccceeeeeeecceee-cC--ccc Q lcl|NC_015288. 380 -----------SDKHYYVVGYKGTSPYDAGLFYCPYVPLQMVRSIDPN--------NFQPKIGFKTRYGMVS-NP--FVT 437 (468) Q Consensus 380 -----------s~~dY~~vG~Kg~~~~d~glfyaPYv~~~~~~~~Dp~--------s~qP~~g~~tRY~l~~-nP--f~~ 437 (468) -++..+++|-.+..+.+- +++.. ..|+. +-|=.+=...|+++.+ +| |+. T Consensus 290 ~~~~~~~~i~~gdfs~~~i~~~~~i~i~~----~~ea~-----~~~~~g~~~~~f~~~~~~iR~~~~~d~~v~~~~a~~~ 360 (366) T protein:vir:57 290 GDDGNESEIYFCDFNDVVIGEDGMMKVDF----STEAT-----YKDADGQLVSAFARNQSLIRVVTEHDIGFRHPEGLVL 360 (366) T ss_pred ccCCCccEEEEEecceEEEEEecceEEEE----eeccc-----cccccccchhhhhcCceeEEeeeeeCcEeeccccEEE Confidence 122233344444333221 11110 01111 1112333445566554 23 221 Q ss_pred ccCcccccCChhhh Q lcl|NC_015288. 438 TNGLYSGTPDGETL 451 (468) Q Consensus 438 ~~~~~~~~~~~~~~ 451 (468) ..+..| T Consensus 361 --------lt~~~~ 366 (366) T protein:vir:57 361 --------GTGVIW 366 (366) T ss_pred --------EecccC Confidence 233344 No 75 >protein:vir:105334 Length: 276 # NCBI annotation: putative phage major capsid protein # Family: family:all:522 # MgeID: mge:1679 # MgeName: PH15 # Cross-refs: genbank:acc:YP_950669;genbank:gi:119967839;genbank:GeneID:4643213 Probab=49.90 E-value=0.63 Score=21.70 Aligned_cols=266 Identities=12% Similarity=0.006 Sum_probs=109.0 Q ss_pred eeeeecCCCCC-cccccccccccccccc-ccccccccccccccccCcccccccccccccccccccccccchhhhhccCCC Q lcl|NC_015288. 117 MRSRYENQAGE-EALFNEPDAGFTAGLD-ATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDA 194 (468) Q Consensus 117 MRsrY~~qsG~-EA~fnEa~t~fSG~~~-~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~ 194 (468) |=.. ++.- .-+..|..+.+=-... ......... .......+ . ...+.+...--...++|.+.+ T Consensus 1 Ma~~---~T~l~d~i~Pev~~~~v~~~~~~~~~~~~~~---~~~~~l~g-~-------~G~ti~iP~~~~igda~~~~e- 65 (276) T protein:vir:10 1 MAQG---TTTKSTQIVPEVLAPMMQAELDKKLRFAQFA---DIDSTLVG-Q-------PGDTLTFPAFVYSGDATVVPE- 65 (276) T ss_pred CCcc---eeehhhhhchHHHHHHHHHHHHhhhhhcccc---eecccccC-C-------CCCEEEeeeecCCCccccccC- Confidence 1100 0000 0011111111000000 000000000 00000000 0 011111111111123333333 Q ss_pred CCccccceeEEEEEEEEeecccccceecHHHHHhHHH-hhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhcccc Q lcl|NC_015288. 195 GKLFREMSFSIEKTSVTAKSRALKAEYTLELAQDLKA-IHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNV 273 (468) Q Consensus 195 g~~f~EMaFsIeK~tVtAKSRaLKAEYTvELAQDLkA-iHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~ 273 (468) +.++..-..+..+.+++.|-|.-.-++| |+-+ .-+.|.-.+..+-++..|+..++.+++..|....... T Consensus 66 g~~i~~~~lt~~~~~a~i~~~~k~~~~t-----D~a~~~~~~dp~~~~~~~~~~~~a~~~d~~~~~~l~~~~~~~----- 135 (276) T protein:vir:10 66 GQKIPVDKIETNRREAKIHKIGKGTDIT-----DEALLSGYGDPQGEAVRQHGLAIANKVDNDVLEALRGTKLTV----- 135 (276) T ss_pred CCccCccccccceeeEEeehcccccccc-----HHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHhcccccc----- Confidence 2233333344455555556554333333 3333 2367999999999999999999999998776533221 Q ss_pred ccceeeeeecCCcchhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHh---hcccccccccccccccccc Q lcl|NC_015288. 274 ANAGIFDLDVDSNGRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALA---MAGVLDYSSGLTGAGGPAI 350 (468) Q Consensus 274 ~~~Gv~Dl~~~~~~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~---~~G~~~~~~~~~~~~~~~~ 350 (468) +.+.+. .+.+-..+..+.. .-.+.++++++|++++.|. +..|++.+.. + T Consensus 136 -~~~~~t----------~d~i~~A~~~lgd---------~~~~~~~ivv~p~~~~~L~k~~~~~f~~~s~~-----g--- 187 (276) T protein:vir:10 136 -SADIGT----------LAGLEAAIDTFDD---------EDLEPMVLFINPKDAGKLRSSASDNFTRATEL-----G--- 187 (276) T ss_pred -cccccC----------HHHHHHHHHHhcc---------ccCcccEEEEcHHHHHHHHHhccccccccccc-----c--- Confidence 111111 1211111111111 1346889999999999984 3444432211 0 Q ss_pred ccccCCCceeEEEecCCeEEEEccccccCCCcceEEEEEe-cCCcccceeEEccccccccccccCCccccceeeeeeecc Q lcl|NC_015288. 351 GTVDDTGNLAVGTINGRIKVYVDPYAANLSDKHYYVVGYK-GTSPYDAGLFYCPYVPLQMVRSIDPNNFQPKIGFKTRYG 429 (468) Q Consensus 351 ~~~D~t~~~~~G~l~~~~~vy~D~Ya~~~s~~dY~~vG~K-g~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~g~~tRY~ 429 (468) .+...+-.+|++. |++|++| ++-|. |-.+-++ |+-.+ +...=++.+.-| |++.++-.+--.-+|| T Consensus 188 --~~~~~~G~ig~~~-G~~Vi~s----~~~p~-~t~~l~~~gAi~~----~~~~~~~vE~dR--d~~~~~d~i~~~~~y~ 253 (276) T protein:vir:10 188 --DNIIVKGAFGEAL-GAVIVRS----KKLDE-GEAILAKRGAVKL----ITKRDFFLETDR--DPSTKTTALYSDKHYV 253 (276) T ss_pred --ccceeccccceec-ceeEEEc----CCCCc-ceEEEEeccceee----eecCCceeeccc--chhhcccEEEEeeEEE Confidence 0111222467874 6899999 54553 2222222 22211 111001111111 7888888888888887 Q ss_pred eee-cC--cccccCcccccCChhhhhhccCc Q lcl|NC_015288. 430 MVS-NP--FVTTNGLYSGTPDGETLTPSTNM 457 (468) Q Consensus 430 l~~-nP--f~~~~~~~~~~~~~~~~~~~~N~ 457 (468) +.. || ....+... +..-+|+ T Consensus 254 ~~~~~~~~vv~~t~~~--------~~~~~~~ 276 (276) T protein:vir:10 254 AYLYDESKAVKVTKGA--------GTTDSGA 276 (276) T ss_pred EEEEcCcceEEEecCC--------cCCcCCC Confidence 753 34 11111111 1112222 No 76 >protein:vir:4511 Length: 409 # NCBI annotation: capsid # Family: family:all:21 # MgeID: mge:97 # MgeName: V # Cross-refs: genbank:acc:NP_599037;genbank:gi:19548995;genbank:GeneID:935211 Probab=49.76 E-value=0.63 Score=21.69 Aligned_cols=334 Identities=15% Similarity=0.139 Sum_probs=117.2 Q ss_pred CcchHHHHHhh---------------------hhhhcCCccccccchhhhhhhhhhhhh-HHHHHhhhhhhhhhcccccc Q lcl|NC_015288. 1 MFNAEHLQEKW---------------------SPVLNNEAANPIADRYKKAVTSVLLEN-QERFLREERGMLQEVAVNSL 58 (468) Q Consensus 1 ~~~~~~l~~kw---------------------~p~l~~~~~~~i~~~~~~~~~~~llen-q~~~~~e~~~~l~e~~~~~~ 58 (468) +-.-+.|.++. .+.+..+.-. -.+..+++.....|.+ +.....++++.+.+.-+... T Consensus 41 ~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~a~~~~l~~~~~~~~~~e~~~~~~~~a~~~ 119 (409) T protein:vir:45 41 KSELEALDERIAREEELRRQDQAYIESNEEEQRQNLDPENNS-QQDEKRAQVFDKWMRHGASELTSEERKALRELRAQGV 119 (409) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhcccCCCCCcc-hhhHHHHHHHHHHHHhhhhhccHHHHHHHHHHhhccC Confidence 11111121222 2222222111 1112222222222221 11111222332322211111 Q ss_pred CcccccccccccccccccccccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCCccccccccccc Q lcl|NC_015288. 59 GAGTVSPGGSALGSANTAGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDAGF 138 (468) Q Consensus 59 g~~~~~~~~~~~~st~tg~~~~~~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~f 138 (468) +.+. .++.++ . ..+.+.++.+.| +..+-.+++-|-|+++.....+-... ..+ ..+ T Consensus 120 ~~~~-~gg~li-P-------~~~~~~ii~~~~---~~~~l~~~~~~~~~~~~~~~~~~~~~---~~~-~~~--------- 174 (409) T protein:vir:45 120 AQDE-KGGYTV-P-------ETFLAKVVEKMK---SYGGIASVAQILTTSDGRTMEWATAD---GTS-EVG--------- 174 (409) T ss_pred ccCc-CCceec-c-------HhHHHHHHHHHH---hhhhhhhhceeeecCCCceEEEEeec---cCc-ccc--------- Confidence 1100 001000 0 112233444444 33344677888888775544442221 000 000 Q ss_pred cccccccccccccccccccccCcccccccccccccccccccccccchhhhhccCCCCCccccceeEEEEEEEEeecc-cc Q lcl|NC_015288. 139 TAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDAGKLFREMSFSIEKTSVTAKSR-AL 217 (468) Q Consensus 139 SG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~g~~f~EMaFsIeK~tVtAKSR-aL 217 (468) --+++ +...++-...++.++.+++.. +- T Consensus 175 --------------------------------------------~~v~E-------~~~~~~~~~~f~~~~l~~~k~~~~ 203 (409) T protein:vir:45 175 --------------------------------------------VLLGE-------NEEAGEEDTDFGMGSLGALKMTSK 203 (409) T ss_pred --------------------------------------------ccccc-------cccccccccccceeeeeeeeeeee Confidence 00000 011122222222333332221 11 Q ss_pred cceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeeeecCCcchh-----HHH Q lcl|NC_015288. 218 KAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDLDVDSNGRW-----SVE 292 (468) Q Consensus 218 KAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl~~~~~~rw-----~~e 292 (468) =..+|-||.+|- .+|.+++|.+-|+..|.+-+|+.||.-=-+ + ....+.|++.-........ ..+ T Consensus 204 ~i~is~ell~ds----~~~l~~~i~~~la~a~~~~~~~a~l~G~G~----~--~~~~p~Gil~~~~~~~~~~~~~~~~~d 273 (409) T protein:vir:45 204 IIRVSNELLQDS----AIDMEAYLARRIAERIGRGEARYLIQGTGA----G--TPKQPKGLAASVTGTTQTAAANAVKWQ 273 (409) T ss_pred ehhhhHHHHhcc----HHHHHHHHHHHHHHHHHHHHHHHhhccCCC----C--CccccceeeeccccccccccccccchH Confidence 135799999994 257899999999999999999998851000 0 0001223322111110000 011 Q ss_pred HHHHHHHHHHHHHHHHHHHhccCCccE-EEEchhHHHHHhhccccccccccccccccccccccCCCceeEEEecCCeEEE Q lcl|NC_015288. 293 KFKGLLFQIERDCNAIAQDTRRGKGNF-LICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTGNLAVGTINGRIKVY 371 (468) Q Consensus 293 ~~~~l~~~i~~ean~i~q~T~rg~~n~-~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy 371 (468) ....++ +.+ ..--+..+.| +++++.....|.. |..+.+ ..+.+.|.+.. -.++|.| ++|+ T Consensus 274 ~i~~l~-------~~l-~~~~~~~a~~~~~~n~~~~~~l~~---lkd~~G------~~i~~~~~~~~-~~~~l~G-~PV~ 334 (409) T protein:vir:45 274 EILALK-------HSI-DPAYRRGPKFRLAFNDNTLKLISE---MEDGQG------RPLWLPDIVGV-APASVLN-VPYV 334 (409) T ss_pred HHHHHH-------Hhh-hhhhccCCeEEEEECHHHHHHHHH---hhcCCC------ceeeccCcCCC-CCceecc-eeeE Confidence 111121 122 2223445666 5789888777642 222111 01111121111 1146655 6888 Q ss_pred EccccccCCCcce-EEEEEecCCcccceeEEccccccccccccCCccccceeeee--eeccee-ecC--cccccCccccc Q lcl|NC_015288. 372 VDPYAANLSDKHY-YVVGYKGTSPYDAGLFYCPYVPLQMVRSIDPNNFQPKIGFK--TRYGMV-SNP--FVTTNGLYSGT 445 (468) Q Consensus 372 ~D~Ya~~~s~~dY-~~vG~Kg~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~g~~--tRY~l~-~nP--f~~~~~~~~~~ 445 (468) ++.+......-++ +++| +-. ..+...--........||-.-...++|. .||+.. .|| |....--.+ T Consensus 335 ~~~~~p~~~~~~~~i~~G---d~~---~~~i~~~~~~~~~~~~d~~~~~~~~~~~~~~r~d~~~~~~~A~~~l~~k~s-- 406 (409) T protein:vir:45 335 IDQEIDDIGAGKKFMFCG---DFD---RFIIRRVRYMILKRLVERYAEYDQTGFLAFHRFDCILEDTSAIKALVGKGS-- 406 (409) T ss_pred EecCcCCccCCccEEEEe---ehh---hhheeeccceEEEEeecccccCCcEEEEEEEEeccEeechhheEEEEeccC-- Confidence 8754321111111 2222 110 0011110011122223554333444444 366543 344 332111110 Q ss_pred CChh Q lcl|NC_015288. 446 PDGE 449 (468) Q Consensus 446 ~~~~ 449 (468) .|+ T Consensus 407 -~~~ 409 (409) T protein:vir:45 407 -VGG 409 (409) T ss_pred -CCC Confidence 011 No 77 >protein:vir:9704 Length: 394 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:174 # MgeName: 315.2 # Cross-refs: genbank:acc:NP_795466;genbank:gi:28876225;genbank:GeneID:1257769 Probab=48.85 E-value=0.66 Score=21.58 Aligned_cols=324 Identities=14% Similarity=0.095 Sum_probs=119.1 Q ss_pred CcchHHHHHhhhhhh------------------------cCCccccccc-------hhhhhhhhhhhhhHHHHHhhhhh- Q lcl|NC_015288. 1 MFNAEHLQEKWSPVL------------------------NNEAANPIAD-------RYKKAVTSVLLENQERFLREERG- 48 (468) Q Consensus 1 ~~~~~~l~~kw~p~l------------------------~~~~~~~i~~-------~~~~~~~~~llenq~~~~~e~~~- 48 (468) .++.+ -.++|.-+. +.+..+.... ...++.....+..+......... T Consensus 30 ~~~~~-~~~~~~~l~~eie~l~~ei~~l~~~~~~~e~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 108 (394) T protein:vir:97 30 ALESD-DLEAARSIKAEVEQAKANLVEAENDLKLYESSVEVGGAENIGGKEVTQEEKTYRESVNDFIRSKGKIVNDSLRF 108 (394) T ss_pred hhchh-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccccchhhHHHHHHHHHHHHHHHHHhhhhhhh Confidence 11111 111222111 1111100000 00000011111111100000000 Q ss_pred -hhhhccccccCcccccccccccccc-cccccccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCC Q lcl|NC_015288. 49 -MLQEVAVNSLGAGTVSPGGSALGSA-NTAGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAG 126 (468) Q Consensus 49 -~l~e~~~~~~g~~~~~~~~~~~~st-~tg~~~~~~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qsG 126 (468) ...+.... +...........+.| ..|+..--....-.+++...+......++.+.||+++++-+--++. .++ T Consensus 109 ~~~~~~~~~--~~~~~~~~~~~~~~t~~~gg~liP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~----~~~ 182 (394) T protein:vir:97 109 EGKDEVLMP--INETTPVEPQKDGIKKENAKPVSSEEILYTPAREVKTVVDLKPFTTVYQAKKASGKYPVLQR----ATT 182 (394) T ss_pred hhHHHHHHH--HHhhhhhhhhccccccccccccChHHHHHHHHHHhhhhhhhhhhceeeeccCcceEEEEEec----CCC Confidence 00000000 000000000000011 1122111111122245545566677888999999888765422220 000 Q ss_pred CccccccccccccccccccccccccccccccccCcccccccccccccccccccccccchhhhhccCCCCCccccc-eeEE Q lcl|NC_015288. 127 EEALFNEPDAGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDAGKLFREM-SFSI 205 (468) Q Consensus 127 ~EA~fnEa~t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~g~~f~EM-aFsI 205 (468) .-++. +| +...++. ..++ T Consensus 183 ~~~~v--------------------------------------------------------~E-----~~~~~~~~~~~~ 201 (394) T protein:vir:97 183 KMVTV--------------------------------------------------------AE-----LEKNPALAKPDF 201 (394) T ss_pred cccee--------------------------------------------------------cc-----cccccccccccc Confidence 00000 00 1112222 2345 Q ss_pred EEEEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeeeecCC Q lcl|NC_015288. 206 EKTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDLDVDS 285 (468) Q Consensus 206 eK~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl~~~~ 285 (468) ++++..+|.-+-...+|-||.+|- +.|.+++|.+-|+..|..-+|..||.-+-+. .+.+...+ T Consensus 202 ~~v~l~~~k~~~~i~is~ell~ds----~~~~~~~i~~~la~~~~~~~~~~i~~g~~~~---------~~~~~~~~---- 264 (394) T protein:vir:97 202 KDVAWNIDTYRGAIPLSQESIDDA----DVDLVGIVSESISQIKVNTTNDAIAKVLKSF---------TTKTVKNL---- 264 (394) T ss_pred eeEEeehhheeeehhhHHHHHhhh----hHHHHHHHHHHHHHHHHHHHHHHHhhccccc---------cccccccH---- Confidence 666666666666788999999986 3467888888888888888888877643211 11222111 Q ss_pred cchhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccccccccCCCceeEEEec Q lcl|NC_015288. 286 NGRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTGNLAVGTIN 365 (468) Q Consensus 286 ~~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~~~~~G~l~ 365 (468) +....+ + +.. .. ..+.+. +|+|+.+...|.. +..+.+ ..+...+-++. .-++|+ T Consensus 265 ------~~~~~~-~------~~~-~~-~~~~a~-~v~n~~~~~~l~~---lkd~~G------~~i~~~~~~~~-~~~~l~ 318 (394) T protein:vir:97 265 ------DEIKAL-L------NGG-FD-PAYNVS-LIVSQSFYQTLDT---LKDGNG------RYLLQDDITAV-SGKVLL 318 (394) T ss_pred ------HHHHHH-H------Hhh-hh-hhhCCE-EEEcHHHHHHHHH---hhccCC------CeeeecCcCCC-CCceec Confidence 111111 1 111 11 122344 5689999888753 222211 00111111111 124665 Q ss_pred CCeEEEE--ccccccCCCcceEEEEEecCCcccceeEEccccccccccccCCccccceeeeeeecceee-cCcccccCcc Q lcl|NC_015288. 366 GRIKVYV--DPYAANLSDKHYYVVGYKGTSPYDAGLFYCPYVPLQMVRSIDPNNFQPKIGFKTRYGMVS-NPFVTTNGLY 442 (468) Q Consensus 366 ~~~~vy~--D~Ya~~~s~~dY~~vG~Kg~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~g~~tRY~l~~-nPf~~~~~~~ 442 (468) | ++|++ |..+ +..-+++|-- ..++++..-..+.. ...|...++..+-...|++..+ +|= T Consensus 319 G-~pv~~~~~~~~----~~~~~~~gd~-----~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~r~d~~v~~~~------- 380 (394) T protein:vir:97 319 G-KPVFVLSDEVL----GANKAFIGDF-----KRGVLFADRKDLGL-RWADNEIYGQYLQAVLRFGVSKVDDK------- 380 (394) T ss_pred c-ceeEEeccccc----CCccEEEeec-----cccEEEEEecceEE-EEecccccceeEEEEEEEccEEeccc------- Confidence 5 56555 4322 2222333320 01111112111111 1234444555555566776543 330 Q ss_pred cccCChhhhhhccCceeeeEEeeccC Q lcl|NC_015288. 443 SGTPDGETLTPSTNMYYRRVQVTNLM 468 (468) Q Consensus 443 ~~~~~~~~~~~~~N~y~r~~~v~~~~ 468 (468) .|..+.+++.- T Consensus 381 ---------------a~~~~~~~~~~ 391 (394) T protein:vir:97 381 ---------------AGYYVTFTPEP 391 (394) T ss_pred ---------------ceEEEEecccc Confidence 11222222221 No 78 >protein:vir:2504 Length: 305 # NCBI annotation: major capsid subunit gp9 # Family: family:all:507 # MgeID: mge:53 # MgeName: TM4 # Cross-refs: genbank:acc:NP_569745;genbank:gi:18496895;genbank:GeneID:932268 Probab=47.21 E-value=0.71 Score=21.40 Aligned_cols=282 Identities=10% Similarity=0.043 Sum_probs=113.4 Q ss_pred ccccc-cccccccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCCcccccccccccccccccccc Q lcl|NC_015288. 69 ALGSA-NTAGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDAGFTAGLDATTG 147 (468) Q Consensus 69 ~~~st-~tg~~~~~~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSG~~~~~~~ 147 (468) ....+ +.++..--....-.+++++.+..+-.+++-+.||++++--|--.. .+.++ .|-+.. T Consensus 1 ma~~t~~~gg~liP~~~~~~Ii~~~~~~s~l~~l~~~~~~~~~~~~~p~~~------~~~~a-------~wv~E~----- 62 (305) T protein:vir:25 1 MADISRAEVASLIQEAYSDTLLAAAKQGSTVLSAFQNVNMGTKTTHLPVLA------TLPEA-------DWVGES----- 62 (305) T ss_pred CCCccCCccceecCHHHHHHHHHHHHhhchhhhhcceeeccCCcEEEEEEe------CCcce-------EEeecc----- Confidence 11111 112211111112234555666777788899999987753322111 11111 111100 Q ss_pred ccccccccccccCcccccccccccccccccccccccchhhhhccCCCCCccccceeEEEEEEEEeecccccceecHHHHH Q lcl|NC_015288. 148 AYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDAGKLFREMSFSIEKTSVTAKSRALKAEYTLELAQ 227 (468) Q Consensus 148 ~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~g~~f~EMaFsIeK~tVtAKSRaLKAEYTvELAQ 227 (468) +... ...++.-..++++++..++..+-...+|-||.+ T Consensus 63 -----------------------------------------~~~~--~~~~~~s~~~f~~i~~~~~k~~~~~~is~ell~ 99 (305) T protein:vir:25 63 -----------------------------------------ATDP--KGVKPTSKVTWANRTLVAEEIAVIIPVHENVID 99 (305) T ss_pred -----------------------------------------cccc--cccccccccceeeEEeeeEEEEEeehhhHHHHh Confidence 0000 011222223344555555555556779999999 Q ss_pred hHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeeee-----cCCcchhHHHHHHHHHHHHH Q lcl|NC_015288. 228 DLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDLD-----VDSNGRWSVEKFKGLLFQIE 302 (468) Q Consensus 228 DLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl~-----~~~~~rw~~e~~~~l~~~i~ 302 (468) |-. .|.|++|.+-|+..|+..+++.+|.-- |+..+....++.... ....... ...+..+ . T Consensus 100 ds~----~~~~~~i~~~l~~~~a~~~d~a~~~G~------g~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~----~ 164 (305) T protein:vir:25 100 DAT----VAVLTEVAELGGQAIGKKLDQAVIFGT------DKPASWVSPALIPAAVTAGQAVEVVGG-VANESDI----V 164 (305) T ss_pred cch----HHHHHHHHHHHHHHHHHHHhhhheecc------CCCCCcccccccccccccccccccccc-chhhhHH----H Confidence 843 578999999999999999999988521 111111111111100 0000000 0001111 1 Q ss_pred HHHHHHHHHh--ccCCccEEEEchhHHHHHhhccccccccccccccccccccccCCCceeEEEecCCeEEEEccccccCC Q lcl|NC_015288. 303 RDCNAIAQDT--RRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTGNLAVGTINGRIKVYVDPYAANLS 380 (468) Q Consensus 303 ~ean~i~q~T--~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy~D~Ya~~~s 380 (468) .......... -.+..|=+++++.-...|.. +..+.+ + .... -++| .+++|++..+..... T Consensus 165 ~~~~~~~~~~~~~~~~~~~~v~~~~~~~~l~~---lkd~~G---~---~i~~--------~~~l-~G~Pv~~~~~~~~~~ 226 (305) T protein:vir:25 165 GATNRAAKAVASAGWAPDTLLSSLALRYEVAN---IRDANG---N---PVFR--------DDSF-AGFRTFFNRNGAWDA 226 (305) T ss_pred HHHHHHHHhhhhcccccceeEecHHHHHHHHH---hhccCC---c---eeec--------CCcc-cccceEEcCccCCCC Confidence 1111111111 12333447888888877742 111110 0 0000 1345 347777775432211 Q ss_pred Cc--------ceEEEEEecCCcccceeEEccccccccccccCCcc-cc-ceee--eeeecce-eecCcc--cccCccccc Q lcl|NC_015288. 381 DK--------HYYVVGYKGTSPYDAGLFYCPYVPLQMVRSIDPNN-FQ-PKIG--FKTRYGM-VSNPFV--TTNGLYSGT 445 (468) Q Consensus 381 ~~--------dY~~vG~Kg~~~~d~glfyaPYv~~~~~~~~Dp~s-~q-P~~g--~~tRY~l-~~nPf~--~~~~~~~~~ 445 (468) .. ..+++|..+..+.+- ..+. .+...-.|.+ || ..++ ...|||+ +.||=+ ....-+. T Consensus 227 ~~~~~~~gd~s~~~i~~~~~~~i~~----~~~~--~~~~~~~~~~~~~~~~~~~R~~~r~~~~v~~p~a~v~~~~~~~-- 298 (305) T protein:vir:25 227 DAAIEVIADSSRVKIGVRQDITVKF----LDQA--TLGTGENQINLAERDMVALRLKARFAYVLGVSATAQGANKTPV-- 298 (305) T ss_pred CccEEEEEecceEEEEEecCeEEEE----eeee--eeecCCceeeeeecCcEEEEEEEeecceeeCcccEEEEccccc-- Confidence 11 112222222221111 1110 0111111111 22 1223 3568995 568833 2221111 Q ss_pred CChhhhhhcc Q lcl|NC_015288. 446 PDGETLTPST 455 (468) Q Consensus 446 ~~~~~~~~~~ 455 (468) +.+...+ T Consensus 299 ---~~~~pa~ 305 (305) T protein:vir:25 299 ---AVVAPAA 305 (305) T ss_pred ---cccCCCC Confidence 1122223 No 79 >protein:vir:100172 Length: 394 # NCBI annotation: putative major head protein # Family: family:all:21 # MgeID: mge:1524 # MgeName: phi AT3 # Cross-refs: genbank:acc:YP_025031;genbank:gi:48697264;genbank:GeneID:2948270 Probab=45.25 E-value=0.78 Score=21.19 Aligned_cols=341 Identities=16% Similarity=0.181 Sum_probs=119.5 Q ss_pred Ccc-----hHHHHHhhhhhhcCC-----ccccccchhhhhh-hhhhhhhHHHHHhhhhhhhhhccc-------------- Q lcl|NC_015288. 1 MFN-----AEHLQEKWSPVLNNE-----AANPIADRYKKAV-TSVLLENQERFLREERGMLQEVAV-------------- 55 (468) Q Consensus 1 ~~~-----~~~l~~kw~p~l~~~-----~~~~i~~~~~~~~-~~~llenq~~~~~e~~~~l~e~~~-------------- 55 (468) +++ .+++.++=.-.++.+ .+.++.....+.. ...-|+.|.+++.+..+...+... T Consensus 7 l~~~~~~~~~e~~~~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~~~ 86 (394) T protein:vir:10 7 LFNEVSAKCADLNAQLNAKLQDENASVDDFQKIKDDLTAAKARRDAINDQIKDLEAENKANSDPDKPVDNAQPNGTDLKK 86 (394) T ss_pred HHHHHHHHHHHHHHHHHHHHhhhhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhhhhhcccccchhh Confidence 000 000000000000000 0000110000000 000112222222111110000000 Q ss_pred -------cccC-----cccccccccccccccccccccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecC Q lcl|NC_015288. 56 -------NSLG-----AGTVSPGGSALGSANTAGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYEN 123 (468) Q Consensus 56 -------~~~g-----~~~~~~~~~~~~st~tg~~~~~~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~ 123 (468) .+++ .....-......+++.|++.--.+..-.++++..+..+-.+++.+.||+++++-+--.+. . T Consensus 87 ~~~~~~~~~~~~~l~~~~~~~~~~~~~~t~~~gg~~vP~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~--~- 163 (394) T protein:vir:10 87 KPIDAKKKAINDFIHSHGKVIDNAAGHVTSTEAGVLIPEEIIYDPTAEVNSVVDLSTLVTKTPVTTPKGTYPILKR--A- 163 (394) T ss_pred hHHHHHHHHHHHHHhccchhhhhhhcccccccCceeccHHHHHHHHHHHHhhhhhhhhceeeeccCCceEEEEEec--C- Confidence 0000 000000000001111222222122222355556666677899999999998766654441 0 Q ss_pred CCCCccccccccccccccccccccccccccccccccCcccccccccccccccccccccccchhhhhccCC-CCCccccce Q lcl|NC_015288. 124 QAGEEALFNEPDAGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGD-AGKLFREMS 202 (468) Q Consensus 124 qsG~EA~fnEa~t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~-~g~~f~EMa 202 (468) ++.-+ +-+ +.+...+ +...|.+.. T Consensus 164 -~~~~~--------~~~----------------------------------------------E~~~~~~~~~~~~~~v~ 188 (394) T protein:vir:10 164 -TDRFS--------SVA----------------------------------------------ELAENPALAEPEFEQVD 188 (394) T ss_pred -CCccc--------ccc----------------------------------------------ccccccccccccceeEE Confidence 00000 000 0000000 112455555 Q ss_pred eEEEEEEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeeee Q lcl|NC_015288. 203 FSIEKTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDLD 282 (468) Q Consensus 203 FsIeK~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl~ 282 (468) |.+.|. +-...+|-||.+|- ..|.++.|.+-|+..|..-+|+.|+.-+- .+....+.+.- T Consensus 189 l~~~k~-------~~~~~iS~ell~ds----~~~l~~~i~~~la~~~~~~~~~~il~g~g----~~~~~~~~~~~----- 248 (394) T protein:vir:10 189 WSVSTY-------RGAIPLSEEAIADS----AVDLTSLVGQSINEKSVNTYNAMIAPVLQ----SFTAKATTTDT----- 248 (394) T ss_pred eeeeee-------EeeehhHHHHHhhh----hHHHHHHHHHHHHHHHHHHHHHHHhhccc----ccccccccccc----- Confidence 555554 44567999999984 25788899999999999999999876432 12111111100 Q ss_pred cCCcchhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccc-cccccccccccccCCCceeE Q lcl|NC_015288. 283 VDSNGRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSG-LTGAGGPAIGTVDDTGNLAV 361 (468) Q Consensus 283 ~~~~~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~-~~~~~~~~~~~~D~t~~~~~ 361 (468) . .+....++...... .+.+ .+|+++.....|.. +..+.+ ..-+.+ -...+..... T Consensus 249 -----~--~d~l~~~~~~~~~~---------~~~a-~~vmn~~~~~~l~~---lkd~~G~~i~~~~----~~~~~~~~~~ 304 (394) T protein:vir:10 249 -----L--VDSLKHILNVDLDP---------AYSR-ALVVTQSLFNTLDT---LKDKNGRYLLHDA----SDSITDGTAK 304 (394) T ss_pred -----c--HHHHHHHHHhhhhh---------hccC-EEEecHHHHHHHHH---hhccCCCeeeecc----ccccccCCcc Confidence 0 11121121111111 1222 47789888877753 222211 000000 0111222233 Q ss_pred EEecCCeEEEE-c-cccccCCCcceEEEEE-ecCCcccceeEEccccccccccccCCccccceeeeeeeccee-ecC--c Q lcl|NC_015288. 362 GTINGRIKVYV-D-PYAANLSDKHYYVVGY-KGTSPYDAGLFYCPYVPLQMVRSIDPNNFQPKIGFKTRYGMV-SNP--F 435 (468) Q Consensus 362 G~l~~~~~vy~-D-~Ya~~~s~~dY~~vG~-Kg~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~g~~tRY~l~-~nP--f 435 (468) ++|.| ++|++ | .+..+.....-+++|- +. ++....- ...-....|...|.-.+-...|++.. .|| | T Consensus 305 ~~L~G-~PV~~~~~~~~~~~~~~~~i~~gd~s~------~~~~~~~-~~~~v~~~~~~~~~~~~~~~~r~d~~~~~~~ai 376 (394) T protein:vir:10 305 GTVLG-VPVYVVGDALLGSAAGDQKAFVGDLKR------GVLFADR-QQVTLAWEDSKIYGRYLGAAFRFGVKQADSNAG 376 (394) T ss_pred ccccc-ceeEEecccccCCCCCceEEEEeeccc------cEEEEee-cceEEEEecccccceeEEEEEEeccEEeccccE Confidence 56654 55554 3 2221111111122221 10 0000000 00001113444455556666788754 344 2 Q ss_pred ccc--cCcccccCChhhhhhccCc Q lcl|NC_015288. 436 VTT--NGLYSGTPDGETLTPSTNM 457 (468) Q Consensus 436 ~~~--~~~~~~~~~~~~~~~~~N~ 457 (468) ... ..-.++. .+++.+ T Consensus 377 ~~~~~~~~~~~~------~~~~~~ 394 (394) T protein:vir:10 377 YFVTNTDAASGS------TSGTGK 394 (394) T ss_pred EEEEeecccCCC------CCCCCC Confidence 221 1111111 112221 No 80 >protein:vir:1239 Length: 274 # NCBI annotation: similar to phage B1 major head protein # Family: family:all:522 # MgeID: mge:25 # MgeName: phi ETA # Cross-refs: genbank:acc:NP_510938;genbank:gi:17426272;genbank:GeneID:927376 Probab=44.56 E-value=0.8 Score=21.11 Aligned_cols=261 Identities=14% Similarity=0.049 Sum_probs=104.5 Q ss_pred ccceeeeeeeeeecCCCCCccccc---c---ccccccccccccccccccccccccccCcccccccccccccccccccccc Q lcl|NC_015288. 109 GPTGLIFAMRSRYENQAGEEALFN---E---PDAGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRG 182 (468) Q Consensus 109 GPTGLIFAMRsrY~~qsG~EA~fn---E---a~t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~g 182 (468) +..+. -+..+.=-.|=|-. + ..--|++-... .... .+ .+ ..+.+...- T Consensus 1 ma~~~-----T~l~d~iiPev~~~~v~~~~~~~l~~~~~~~~-d~~l------------~g-~~-------G~tv~iP~~ 54 (274) T protein:vir:12 1 MAQGL-----TKTSNQIIPEVLAPMMQAQLEKKLRFASFAEV-DSTL------------QG-QP-------GDTLTFPAF 54 (274) T ss_pred CCcce-----eehhhhhchHHHHHHHHHHHHhhhhhccccee-cccc------------cC-CC-------CCEEEEeee Confidence 01100 00000000010000 0 00001111000 0000 00 00 001111110 Q ss_pred cchhhhhccCC-CCCccccceeEEEEEEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHH Q lcl|NC_015288. 183 FSREDLEQAGD-AGKLFREMSFSIEKTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRV 261 (468) Q Consensus 183 m~Ta~aE~lG~-~g~~f~EMaFsIeK~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l 261 (468) -...++|.+.. .+-+..++..+ +.+++.+-|+-.=+++=| ..+.+ +-|.-.+..+-++..|+.+++.+++..+ T Consensus 55 ~~ig~a~~~~~g~~i~~~~lt~~--~~~~~i~~~~~~~~i~D~--~~~~~--~~d~~~~~~~q~~~~~a~~vd~~~l~~~ 128 (274) T protein:vir:12 55 VYSGDAQVVAEGEKIPTDILETK--KREAKIRKIAKGTSITDE--ALLSG--YGDPQGEQVRQHGLAHANKVDNDVLEAL 128 (274) T ss_pred cCCCccccccCCCccchhhcccc--eeeEEeeeecceeeecHH--HHHhc--ccchHHHHHHHHHHHHHHHHHHHHHHHH Confidence 01112222221 12233444333 334444444432222221 12223 5688899999999999999999999887 Q ss_pred hhhcchhhccccccceeeeeecCCcchhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccc Q lcl|NC_015288. 262 YSVAKPGAANNVANAGIFDLDVDSNGRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSG 341 (468) Q Consensus 262 ~~vA~~~k~~~~~~~Gv~Dl~~~~~~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~ 341 (468) .+...... ...+ ..+.+-..+.++..+ -..+++++++|+|++.|.-....++... T Consensus 129 ~~a~~~~~------~~a~----------~~d~i~dA~~~lgd~---------~~~~~~ivv~p~~~~~L~k~~~~~fv~~ 183 (274) T protein:vir:12 129 MGAKLTVN------ADIT----------KLNGLQSAIDKFNDE---------DLEPMVLFINPLDAGKLRGDASTNFTRA 183 (274) T ss_pred hccccccc------cccc----------CHHHHHHHHHHhccc---------cccccEEEeCHHHHHHHHhhhhhhcccc Confidence 75332211 1111 122222222333221 1367899999999999965432333221 Q ss_pred cccccccccccccCCCceeEEEecCCeEEEEccccccCCCcceEEEEEe-cCCcccceeEEccccccccccccCCccccc Q lcl|NC_015288. 342 LTGAGGPAIGTVDDTGNLAVGTINGRIKVYVDPYAANLSDKHYYVVGYK-GTSPYDAGLFYCPYVPLQMVRSIDPNNFQP 420 (468) Q Consensus 342 ~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy~D~Ya~~~s~~dY~~vG~K-g~~~~d~glfyaPYv~~~~~~~~Dp~s~qP 420 (468) .. .+ ..-..+-.+|++. |++||+| +.-|. |-.+-++ |+-. |+. -.+...-.-=||..++- T Consensus 184 s~--~g-----~~~~~~G~ig~~~-G~~Vi~s----~~~p~-~t~~l~~~gA~~-----~~~-~~~~~vE~~Rd~~~~~d 244 (274) T protein:vir:12 184 TE--LG-----DDIIVKGAFGEAL-GAIIVRS----NKLEA-GTAILAKKGAVK-----LIL-KRDFFLEVARDASTKTT 244 (274) T ss_pred cc--cc-----ccceecccceeec-CeeEEEe----CCCCc-ceEEEEecccee-----eee-cCCceeccccchhhccc Confidence 11 11 0111122477874 6899999 55553 2222222 2111 110 01111111118888888 Q ss_pred eeeeeeecceee-cC--ccccc-Ccc-ccc Q lcl|NC_015288. 421 KIGFKTRYGMVS-NP--FVTTN-GLY-SGT 445 (468) Q Consensus 421 ~~g~~tRY~l~~-nP--f~~~~-~~~-~~~ 445 (468) .+-..-+||..+ || -.... ... -+| T Consensus 245 ~i~~~~~y~~~~~~~~~vv~~t~~~~~~~~ 274 (274) T protein:vir:12 245 ALYSDKHYVAYLYDESKAVKITKGSGSLEM 274 (274) T ss_pred EEEeeeEEEEEEEcCCceEEEEcCCccccC Confidence 888888888543 55 11111 111 111 No 81 >protein:vir:80930 Length: 278 # NCBI annotation: Cps # Family: family:all:522 # MgeID: mge:1886 # MgeName: A500 # Cross-refs: genbank:acc:YP_001468392;genbank:gi:157324966;genbank:GeneID:5601363 Probab=44.53 E-value=0.8 Score=21.11 Aligned_cols=266 Identities=11% Similarity=0.036 Sum_probs=109.3 Q ss_pred ecCCCC--Ccccccccccc-----ccccccccccccccccccccccCcccccccccccccccccccccccchhhhhccCC Q lcl|NC_015288. 121 YENQAG--EEALFNEPDAG-----FTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGD 193 (468) Q Consensus 121 Y~~qsG--~EA~fnEa~t~-----fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~ 193 (468) -.+... ..-|..|-.+. +-...-....... .....+ ....+.+...--...+++.+.. T Consensus 1 Ma~~~T~~~~~iiPev~s~~v~~~~~~~~v~~~~~~~-------~~~l~g--------~~G~tv~ip~~~~~g~a~~~~~ 65 (278) T protein:vir:80 1 MADLTTKLANLIDPEVMGPMISAKLPKAIKFGKIAPI-------DNSLEG--------QPGSEITVPKYKYIGDAQDVAE 65 (278) T ss_pred CCCcceehhheecHHHHHHHHHHHHHHhhhhccccee-------cccccC--------CCCCEEEEeeeccCCcceeecC Confidence 111000 00011111000 0000000000000 000000 0000111111001122333322 Q ss_pred CCCccccceeEEEEEEEEeecccccceecHHHHHhHHH-hhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccc Q lcl|NC_015288. 194 AGKLFREMSFSIEKTSVTAKSRALKAEYTLELAQDLKA-IHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANN 272 (468) Q Consensus 194 ~g~~f~EMaFsIeK~tVtAKSRaLKAEYTvELAQDLkA-iHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~ 272 (468) +..+..-..+..+.+++-|-|+- + ++ .-|+.+ .-+-|.-.+..+-++..+..+++++++..|...... T Consensus 66 -g~~i~~~~lt~~~~~~~i~~~~~-a---~~-v~D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~l~~~l~~a~~~----- 134 (278) T protein:vir:80 66 -GAAIDYSALETESVKHGIKKAGK-G---VK-LTDESVLSGYGDPVEEAQKQIRMAIASKVDNDILEEALTTTLE----- 134 (278) T ss_pred -CCcCcccccccceeeEeeehhhc-c---cc-ccHHHHhhccccHHHHHHHHHHHHHHHHHHHHHHHHHhccccc----- Confidence 22333334456666666676652 2 22 334444 346789999999999999999999999987643221 Q ss_pred cccceeeeeecCCcchhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhcccccccccccccccccccc Q lcl|NC_015288. 273 VANAGIFDLDVDSNGRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGT 352 (468) Q Consensus 273 ~~~~Gv~Dl~~~~~~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~ 352 (468) +..+-+.|..+. . +..-.++....-.-.--...+++++|++.+.|..-...++........ T Consensus 135 ~~~~~t~~~~~~--------~-----~~~~~da~~~l~~~~~~~~~~ivv~p~~~~~L~k~~~~~~~~~~~~g~------ 195 (278) T protein:vir:80 135 VKGAINIGLIDK--------I-----ENTFTDAPDAIEDESITTTGVLFLNYKDTAKLREEAAGSWTKASQLGD------ 195 (278) T ss_pred cccccccchhhh--------H-----HHHHHHHHHhhcccCCCcccEEEECHHHHHHHHhhhhhhccccccccc------ Confidence 111111221110 0 111111111111001112347999999999996544334432111100 Q ss_pred ccCCCceeEEEecCCeEEEEccccccCCCcceEEEEEecCCccccee-EEccccccccccccCCccccceeeeeeeccee Q lcl|NC_015288. 353 VDDTGNLAVGTINGRIKVYVDPYAANLSDKHYYVVGYKGTSPYDAGL-FYCPYVPLQMVRSIDPNNFQPKIGFKTRYGMV 431 (468) Q Consensus 353 ~D~t~~~~~G~l~~~~~vy~D~Ya~~~s~~dY~~vG~Kg~~~~d~gl-fyaPYv~~~~~~~~Dp~s~qP~~g~~tRY~l~ 431 (468) ....+-.+|++ .|++||++ ++-|. +-.+-++ .. .+ |+.. .+...-.-=||..++-.+-...+||+. T Consensus 196 -~~~~~G~ig~~-~G~~Vi~s----~~~p~-~t~~l~~-~g----Ai~~~~~-~~~~vE~~Rd~~~~~d~i~~~~~yg~~ 262 (278) T protein:vir:80 196 -DLLVKGAFGEL-LGWEIVRT----KKLAD-GNALAVK-AG----ALKTFLK-RNLLAESGRDMDHKLTKFNADQHYAVA 262 (278) T ss_pred -cceeeccceee-cceeEEEc----CCCCc-ceEEEEe-cc----ceeeeec-CCcccccccchhhccceeeeeeEEEEE Confidence 01112347887 47899999 55552 2211111 11 11 1111 011111111888899888888888886 Q ss_pred e-cCcc--cccCcccccCChhhhhhccC Q lcl|NC_015288. 432 S-NPFV--TTNGLYSGTPDGETLTPSTN 456 (468) Q Consensus 432 ~-nPf~--~~~~~~~~~~~~~~~~~~~N 456 (468) + ||-. ....... | T Consensus 263 v~~~~~~v~it~~a~------------~ 278 (278) T protein:vir:80 263 LVDETKAVKVVPVAG------------N 278 (278) T ss_pred EEcCcceEEEeeccC------------C Confidence 5 5521 1111111 0 No 82 >protein:vir:3613 Length: 272 # NCBI annotation: MHP # Family: family:all:522 # MgeID: mge:74 # MgeName: TP901-1 # Cross-refs: genbank:acc:NP_112699;genbank:gi:13786567;genbank:GeneID:921035 Probab=42.79 E-value=0.87 Score=20.91 Aligned_cols=266 Identities=13% Similarity=0.023 Sum_probs=103.8 Q ss_pred eeeeecCCCCC-ccccccccccccccccccccccccccccccccCcccccccccccccccccccccccchhhhhccCCCC Q lcl|NC_015288. 117 MRSRYENQAGE-EALFNEPDAGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDAG 195 (468) Q Consensus 117 MRsrY~~qsG~-EA~fnEa~t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~g 195 (468) |=.. ++.. .-+..|..+.+=-..-.. ..........+.. +......+.+...--...++|.+.+ + T Consensus 1 ma~~---~T~~~d~iiPev~~~~v~~~~~~---------~~~~~~~~~~~~~-l~g~~G~ti~iP~~~~~gda~~~~e-g 66 (272) T protein:vir:36 1 MSKQ---KTTLADLVNPEVLAPIVSYELNK---------ALRFAPLAQVDTT-LQGQPGNTLKFPAFTYIGDAADVAE-G 66 (272) T ss_pred CCCc---ceehhhhhchHHHHHHHHHHHHh---------hhhhccccccccc-cccCCCCEEEEeeeccCccccccCC-C Confidence 2110 0000 000111100000000000 0000000000000 0000001111111111223333332 2 Q ss_pred CccccceeEEEEEEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhcccccc Q lcl|NC_015288. 196 KLFREMSFSIEKTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVAN 275 (468) Q Consensus 196 ~~f~EMaFsIeK~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~ 275 (468) .++..=..+..+.+++-|-|+-.-++|=|. ++.-+-|.-.+..+-++..++.+++++|+..+..... .++. T Consensus 67 ~~i~~~~lt~~~~~~~i~~~~k~~~vtD~~----~~~~~~d~~~~~~~~~a~~~a~~~d~~i~~~l~~~~~-----~~~~ 137 (272) T protein:vir:36 67 GEISLDKIGTTTKSVTIKKAAKGTEITDEA----ALSGYGDPIGESNKQLGLSLANKVDDDLLSAAKTTSQ-----TVST 137 (272) T ss_pred CccChhhcCCcceeEeeehhhccccccHHH----HhhccchHHHHHHHHHHHHHHHHHHHHHHHHhccccc-----cccc Confidence 222222334555566666665322232222 1233678999999999999999999999987754322 1111 Q ss_pred ceeeeeecCCcchhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccccccccC Q lcl|NC_015288. 276 AGIFDLDVDSNGRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDD 355 (468) Q Consensus 276 ~Gv~Dl~~~~~~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~ 355 (468) + +. .+.+-..+..+. |+ -...++++|+|++++.|.--.-..+.....+ .+.-- T Consensus 138 ~--~~----------~d~i~~A~~~lg-d~--------~~~~~~ivv~p~~~~~L~k~~~~~~~~~~~~------~~~~~ 190 (272) T protein:vir:36 138 K--AN----------VDGVQAALDIFN-DE--------DAQAYVLIVNPKDAAKIRKDANAKNIGSEVG------ANALI 190 (272) T ss_pred c--cc----------HHHHHHHHHHhh-hc--------CCCceEEEEcHHHHHHHhccccccccccccc------cccee Confidence 1 11 111211222222 11 2346799999999998843221111111000 00111 Q ss_pred CCceeEEEecCCeEEEEccccccCCCcc---eEEEEE-ecCCcccceeEEccccccccccccCCccccceeeeeeeccee Q lcl|NC_015288. 356 TGNLAVGTINGRIKVYVDPYAANLSDKH---YYVVGY-KGTSPYDAGLFYCPYVPLQMVRSIDPNNFQPKIGFKTRYGMV 431 (468) Q Consensus 356 t~~~~~G~l~~~~~vy~D~Ya~~~s~~d---Y~~vG~-Kg~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~g~~tRY~l~ 431 (468) + -.+|++. +++|++| ++-|.+ |..+.+ +|.-- +|--.=++.+.. =|+..++-.+--..+||+. T Consensus 191 ~--G~ig~~~-G~~Vv~s----~~~p~~~~~~~~~~~~~gA~~----~~~~~~~~vE~~--R~~~~~~d~i~~~~~y~~~ 257 (272) T protein:vir:36 191 N--GTYADVL-GAQIVRS----KKLAEGSALMFKIVSNSPALK----LVLKRGVQVETD--RDIVTKTTVITADEHYAAY 257 (272) T ss_pred e--eccceec-CeeEEEe----CCCCCCceeEEEEEeccccee----eeecCCcccccc--cchhhcCcEEEEEEEEEEE Confidence 1 2357774 4899999 433321 111111 11110 110000011111 1788888877777778776 Q ss_pred e-cCcccccCcccccCChhhhhhccCceeeeEEeecc Q lcl|NC_015288. 432 S-NPFVTTNGLYSGTPDGETLTPSTNMYYRRVQVTNL 467 (468) Q Consensus 432 ~-nPf~~~~~~~~~~~~~~~~~~~~N~y~r~~~v~~~ 467 (468) + || +. .-++-+||+ T Consensus 258 v~~~--------------~~--------vv~~t~~g~ 272 (272) T protein:vir:36 258 LYDL--------------TK--------VVNITFTGV 272 (272) T ss_pred EEcC--------------cc--------EEEEeecCC Confidence 5 22 11 133444555 No 83 >protein:vir:8187 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:153 # MgeName: Che9d # Cross-refs: genbank:acc:NP_817980;genbank:gi:29566414;genbank:GeneID:2700968 Probab=41.20 E-value=0.94 Score=20.74 Aligned_cols=283 Identities=11% Similarity=0.064 Sum_probs=112.8 Q ss_pred cccccccccccccccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCCcccccccccccccccccc Q lcl|NC_015288. 66 GGSALGSANTAGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDAGFTAGLDAT 145 (468) Q Consensus 66 ~~~~~~st~tg~~~~~~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSG~~~~~ 145 (468) ++ .+++|+..--....-.+++++.+.-+-.+++.+-||++..- -+- ++. ++.++ .| T Consensus 1 ma----t~~~gg~lvP~~~~~~ii~~~~~~s~i~~~~~~i~~~~~~~-~~p---~~~--~~~~a-------~w------- 56 (311) T protein:vir:81 1 MV----ALATGTFQLPKHLVPGVWQKAQGQSVLARLSMAEPQEFGEQ-QYM---TLT--APPRG-------EV------- 56 (311) T ss_pred Cc----eecCCceEcchhHHHHHHHHHHhcchhhhhcceeecCCCce-EEE---EEe--CCcee-------EE------- Confidence 11 22233332111122234555667778889999999875421 111 111 00000 00 Q ss_pred ccccccccccccccCcccccccccccccccccccccccchhhhhccCCCCCccccceeEEEEEEEEeecccccceecHHH Q lcl|NC_015288. 146 TGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDAGKLFREMSFSIEKTSVTAKSRALKAEYTLEL 225 (468) Q Consensus 146 ~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~g~~f~EMaFsIeK~tVtAKSRaLKAEYTvEL 225 (468) .+ | +..+++...++++++..+|.=+-....|-|| T Consensus 57 ---------------------------------------v~--E-----g~~~~~~~~~f~~v~l~~~kl~~~~~iS~el 90 (311) T protein:vir:81 57 ---------------------------------------VG--E-----GAQKSESTATFAPVTAIPRKVQVTQRFSQEV 90 (311) T ss_pred ---------------------------------------ee--c-----CcccccccceeeEEEEeeEEEEEeehhhHHH Confidence 00 1 1223333444455555554444456789999 Q ss_pred HHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeeee--------cCCcchhHHHHHHHH Q lcl|NC_015288. 226 AQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDLD--------VDSNGRWSVEKFKGL 297 (468) Q Consensus 226 AQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl~--------~~~~~rw~~e~~~~l 297 (468) .|+--. -.++-+++|.+-|+..|+..|+.-++.-. ..+. +....|++... ..+......+.. T Consensus 91 l~~~~d-~~~~l~~~i~~~la~ai~~~~d~a~l~G~----~~~~--~~~~~gi~~~~~~~~~~~~~~~~~~~~~~~~--- 160 (311) T protein:vir:81 91 KWADES-RQLGVLQTMADLSGVALGRALDLIGIHGI----NPLT--GAALSGSPAKILDTTNIVELTTGTSATPDLA--- 160 (311) T ss_pred hhcCcc-cHHHHHHHHHHHHHHHHHHHHHHhhhccc----cCCC--CcccccccccccccceeeeecccccchHHHH--- Confidence 875322 23446677777777777777777776531 1111 11111221110 001011111110 Q ss_pred HHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccccccccCCCceeEEEecCCeEEEEccccc Q lcl|NC_015288. 298 LFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTGNLAVGTINGRIKVYVDPYAA 377 (468) Q Consensus 298 ~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy~D~Ya~ 377 (468) |.+....+ ...++..+-+|++++....|.. +...-+ + .....+.++ -..|+|.| ++|+++-+.. T Consensus 161 ---i~~~~~~~--~~~~~~~~~~vmn~~~~~~l~~---lkd~~G---~---~l~~~~~~~-~~~~tl~G-~Pv~~~~~i~ 224 (311) T protein:vir:81 161 ---VEAAVGLV--LGDNLSPDGVALDNTFSFMLAT---QRDSQG---R---KLYPELGFG-TDVASFAG-LNAAVSDTVR 224 (311) T ss_pred ---HHHHHHHh--hhcCCCceEEEEcHHHHHHHHh---hhccCC---C---eeecCcccc-CCCceecc-eeEEeccccc Confidence 22222222 2345677768889988887743 221111 0 000011111 12466654 7777763321 Q ss_pred cCC--CcceEEEEEecCCcc-----c-ceeEEcccccccccccc--CCcc----ccc-eeee--eeecce-eecC--ccc Q lcl|NC_015288. 378 NLS--DKHYYVVGYKGTSPY-----D-AGLFYCPYVPLQMVRSI--DPNN----FQP-KIGF--KTRYGM-VSNP--FVT 437 (468) Q Consensus 378 ~~s--~~dY~~vG~Kg~~~~-----d-~glfyaPYv~~~~~~~~--Dp~s----~qP-~~g~--~tRY~l-~~nP--f~~ 437 (468) .+- ..+=+.+...+.... | +.+++....+..+...- |+.. ||- .++| ..|+|. +.+| |+. T Consensus 225 ~~~~~~~~~~~~~~~~~~~~~~~~gDfs~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~~~ 304 (311) T protein:vir:81 225 GGPEAVTASTGVYRTTNPNVKAIAGDFSAFRWGVQVSIPLELIEFGDPDGLGDLKRQNQIAIRAEVVYGIGIMSTDAFAV 304 (311) T ss_pred ccccccccccchhcccCCccEEEEEecccEEEEEeccceEEEeccCCCCcchhhhhcCcEEEEEEEEeccEeecccceEE Confidence 100 000000000000000 1 12333333333333222 2221 222 1333 367774 3666 553 Q ss_pred ccCcccc Q lcl|NC_015288. 438 TNGLYSG 444 (468) Q Consensus 438 ~~~~~~~ 444 (468) .+..-.- T Consensus 305 l~~a~~~ 311 (311) T protein:vir:81 305 VRDADES 311 (311) T ss_pred EEeeccC Confidence 3211110 No 84 >protein:vir:9309 Length: 324 # NCBI annotation: head protein # Family: family:all:507 # MgeID: mge:165 # MgeName: phi 11 # Cross-refs: genbank:acc:NP_803287;genbank:gi:29028597;genbank:GeneID:1258044 Probab=41.06 E-value=0.94 Score=20.72 Aligned_cols=298 Identities=11% Similarity=0.043 Sum_probs=123.2 Q ss_pred hhhhHHHHHhhhhhhhhhccccccCcccccccccccccccccccccccceehhhhHHhhhhhhhhheeeeecCCccceee Q lcl|NC_015288. 35 LLENQERFLREERGMLQEVAVNSLGAGTVSPGGSALGSANTAGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLI 114 (468) Q Consensus 35 llenq~~~~~e~~~~l~e~~~~~~g~~~~~~~~~~~~st~tg~~~~~~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLI 114 (468) ..++|+....-.+-.......+.+++... .++++++..--....-.+++.+..+.+..+++.+-||++++--| T Consensus 1 ~~~~~~~~~~~~~f~~~~~~~~~~~a~~~-------~~~~~~~~liP~~~~~~ii~~~~~~s~l~~l~~~~~~~~~~~~i 73 (324) T protein:vir:93 1 MEQTQKLKLNLQHFASNNVKPQVFNPDNV-------MMHEKKDGTLLNDFTTPILQEVMENSKIMQLGKYEPMEGTEKKF 73 (324) T ss_pred CchhHHHHHHHHHHHHhhhhhhhcccccc-------cccCCCcceechhHHHHHHHHHHhhchhhhhcceeeccCCceEE Confidence 33333332221111111111122222111 11122111111112223455556677888999999998875433 Q ss_pred eeeeeeecCCCCCccccccccccccccccccccccccccccccccCcccccccccccccccccccccccchhhhhccCCC Q lcl|NC_015288. 115 FAMRSRYENQAGEEALFNEPDAGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDA 194 (468) Q Consensus 115 FAMRsrY~~qsG~EA~fnEa~t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~ 194 (468) .-.. .+.++ .| . +| T Consensus 74 p~~~------~~~~a-------~~----------------------------------------------v--~E----- 87 (324) T protein:vir:93 74 TFWA------DKPGA-------YW----------------------------------------------V--GE----- 87 (324) T ss_pred EEEe------cCcce-------ee----------------------------------------------e--cC----- Confidence 2110 00000 00 0 01 Q ss_pred CCccccceeEEEEEEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccc Q lcl|NC_015288. 195 GKLFREMSFSIEKTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVA 274 (468) Q Consensus 195 g~~f~EMaFsIeK~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~ 274 (468) +..+++..-++++++++.|..+-....|-||.+|-. .|.++.|.+.|+..|...+++.+|.---. +.. T Consensus 88 g~~~~~~~~~f~~i~~~~~k~~~~~~iS~ell~ds~----~~l~~~i~~~l~~aia~~~d~a~l~G~g~--------~~~ 155 (324) T protein:vir:93 88 GQKIETSKATWVNATMRAFKLGVILPVTKEFLNYTY----SQFFEEMKPMIAEAFYKKFDEAGILNQGN--------NPF 155 (324) T ss_pred CccccccccceeEEEEEeEEEEEeehhhHHHHhcch----HHHHHHHHHHHHHHHHHHHHHHHhcCCCC--------CCc Confidence 123344444556666666666666789999999953 46888999999999999999988753211 111 Q ss_pred cceeeeeecCCc----chhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhcccccccccccccccccc Q lcl|NC_015288. 275 NAGIFDLDVDSN----GRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAI 350 (468) Q Consensus 275 ~~Gv~Dl~~~~~----~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~ 350 (468) ..|+++...... +.-..+....++. .+ ...-+....++|++.....|... .... ++. . T Consensus 156 ~~~~~~~~~~~~~~~~~~~~~~~i~~~~~-------~l--~~~~~~~~~~v~n~~~~~~L~~l---~d~~---G~~---~ 217 (324) T protein:vir:93 156 GKSIAQSIEKTNKVIKGDFTQDNIIDLEA-------LL--EDDELEANAFISKTQNRSLLRKI---VDPE---TKE---R 217 (324) T ss_pred CccccccccccceeccccccHHHHHHHHH-------hh--hhccCCCCEEEEcHHHHHHHHHh---hCCC---CCe---e Confidence 122222111110 1001122222211 11 11223445689999999888642 1111 110 0 Q ss_pred ccccCCCceeEEEecCCeEEEEccccccCCCcceE--------EEEEecCCcccceeEEccccccccccccCCc------ Q lcl|NC_015288. 351 GTVDDTGNLAVGTINGRIKVYVDPYAANLSDKHYY--------VVGYKGTSPYDAGLFYCPYVPLQMVRSIDPN------ 416 (468) Q Consensus 351 ~~~D~t~~~~~G~l~~~~~vy~D~Ya~~~s~~dY~--------~vG~Kg~~~~d~glfyaPYv~~~~~~~~Dp~------ 416 (468) . .+.. .++|. +++|++.+.+. .+..-+ ++|..++.+.+- ..+..+ ....||. T Consensus 218 ~-~~~~----~~~l~-G~PVv~~~~~~--~~~~~i~~gdfs~~~~~~~~~~~i~~----~~~~~~--~~~~~~~~~~~~~ 283 (324) T protein:vir:93 218 I-YDRN----SDSLD-GLPVVNLKSSN--LKRGELITGDFDKLIYGIPQLIEYKI----DETAQL--STVKNEDGTPVNL 283 (324) T ss_pred e-cCCC----CCccc-ceeeEeecCCC--CCcceEEEEecceEEEEEecCcEEEE----eecccc--cccccccccchhh Confidence 0 1111 23443 46777754321 122223 333333222110 000000 0000111 Q ss_pred --cccceeeeeeecceee-cC--ccccc---CcccccCChhh Q lcl|NC_015288. 417 --NFQPKIGFKTRYGMVS-NP--FVTTN---GLYSGTPDGET 450 (468) Q Consensus 417 --s~qP~~g~~tRY~l~~-nP--f~~~~---~~~~~~~~~~~ 450 (468) .-|=.+=...|||..+ +| |+... .....+|. +- T Consensus 284 f~~n~~~~r~~~r~d~~v~~~~a~~~l~~a~~~~~~~~~-~~ 324 (324) T protein:vir:93 284 FEQDMVALRATMHVALHIADDKAFAKLVPADKRTDSVPG-EV 324 (324) T ss_pred hhcCcEEEEEEEEeccEEecccceEEEecccccCCCCCC-CC Confidence 1123334445666543 44 33221 11111111 11 No 85 >protein:vir:102119 Length: 404 # NCBI annotation: phage major capsid protein, HK97 family # Family: family:all:21 # MgeID: mge:1641 # MgeName: phiSM101 # Cross-refs: genbank:acc:YP_699941;genbank:gi:110804052;genbank:GeneID:4206662 Probab=39.48 E-value=1 Score=20.55 Aligned_cols=337 Identities=12% Similarity=0.037 Sum_probs=114.2 Q ss_pred CcchHHHHHhhhhhhcCCccc--cccchhhhhhhhhhhhhHHHHHhh---hhhhhhhcccccc----------------- Q lcl|NC_015288. 1 MFNAEHLQEKWSPVLNNEAAN--PIADRYKKAVTSVLLENQERFLRE---ERGMLQEVAVNSL----------------- 58 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~--~i~~~~~~~~~~~llenq~~~~~e---~~~~l~e~~~~~~----------------- 58 (468) .=..+++.+.=.-+++..... ++... ++.+ .. |+.|.+...+ ....+.+...... T Consensus 9 ~~~~~~~~~e~~~~~~~~~~~~ee~~~~-~~e~-~~-l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 85 (404) T protein:vir:10 9 LNQLDSKNKELNSLLNKDGVTAEELNKT-SNEI-DI-LQAKIEAQKRKENIENNFNEDNVKSLNTGKEENVIYNGALFVR 85 (404) T ss_pred HHHHHHHHHHHHHHHhhcCCCHHHHHHH-HHHH-HH-HHHHHHHHHHHHHHHHHHhhhhccccccccchhhHHHHHHHHH Confidence 000111111111122221111 11100 0000 00 1111100000 0000000000000 Q ss_pred ------------Cccccc--cccccccc-ccccccc---cccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeee Q lcl|NC_015288. 59 ------------GAGTVS--PGGSALGS-ANTAGLA---GFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSR 120 (468) Q Consensus 59 ------------g~~~~~--~~~~~~~s-t~tg~~~---~~~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsr 120 (468) ...... -...+..+ +++|+.. .+.+. +++.+.......+++++.||+++.|-+-=.| T Consensus 86 ~~~~~~~~~~~~~~~~~~~~e~~a~~~~~~~~gg~~vP~~~~~~---ii~~~~~~~~l~~l~~~~~~~~~~g~~~~~~-- 160 (404) T protein:vir:10 86 AIADNLLKQKNQRGLNLSEKEINAISENIDEDGGYAVPEDIQTK---INTRLKDTTDLYNMVDYEPVFTRSGSRTYEK-- 160 (404) T ss_pred HHHHHHHHHHHhhhhcchhhHHhhhccccCCCCceeechhHHHH---HHHHHhhhhhHhhhhceeeccCCccceEEEE-- Confidence 000000 00001111 1222221 11222 3444445567788899999999988543222 Q ss_pred ecCCCCCccccccccccccccccccccccccccccccccCcccccccccccccccccccccccchhhhhccCCCCCcccc Q lcl|NC_015288. 121 YENQAGEEALFNEPDAGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDAGKLFRE 200 (468) Q Consensus 121 Y~~qsG~EA~fnEa~t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~g~~f~E 200 (468) .... ..+ .|-+. ++.... .. T Consensus 161 ~~~~--~~~-------~~v~e----------------------------------------------~~~~~~-----~~ 180 (404) T protein:vir:10 161 RSKQ--KPM-------KPLSE----------------------------------------------NQQIPT-----NG 180 (404) T ss_pred ecCC--cce-------eeccc----------------------------------------------cccccc-----cc Confidence 1110 000 00000 000000 00 Q ss_pred ceeEEEEEEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeee Q lcl|NC_015288. 201 MSFSIEKTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFD 280 (468) Q Consensus 201 MaFsIeK~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~D 280 (468) ...++++++.+.|.-+-...+|-||.+|-. .+.++.|.+.|+..|...+|+.||.--- .+-...|++. T Consensus 181 ~~~~f~~i~~~~~k~~~~~~iS~ell~ds~----~~l~~~i~~~la~~~~~~~~~~il~G~g--------~~~~~~gi~~ 248 (404) T protein:vir:10 181 DNGKLERFNFKLKDLADFMSIPNDLLKFAD----KSLEDWIINWFVDKVRITRNAEILYGAG--------GDEHATGIMT 248 (404) T ss_pred cccceeeeEeeheeeEeeehhhHHHHhhcH----HHHHHHHHHHHHHHHHHHHHHHHhhcCC--------CCCcccceee Confidence 112234444444444445678999998843 3577788888888888888887774211 1112233332 Q ss_pred eecCCcchhHHH-HHHHHHHHHHHHHHHHHHHhccCCcc-EEEEchhHHHHHhhccccccccccccccccccccccCCCc Q lcl|NC_015288. 281 LDVDSNGRWSVE-KFKGLLFQIERDCNAIAQDTRRGKGN-FLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTGN 358 (468) Q Consensus 281 l~~~~~~rw~~e-~~~~l~~~i~~ean~i~q~T~rg~~n-~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~~ 358 (468) ........+... .+..+ ...++.. . -..+.+| .+|||++..+.|..- ..+.+ ......|-++ T Consensus 249 ~~~~~~~~~~~~~~~~~~----~~~~~~~-l-~~~~~~~~~~v~n~~~~~~L~~l---kd~~G------~~l~~~~~~~- 312 (404) T protein:vir:10 249 ANKFKKITLPKSPALKDF----KKCKNVE-L-LNVFKATSSWIVNQDGFNYLDSL---EDKTG------RPYLQPDPKD- 312 (404) T ss_pred ccccceeeccccccHHHH----HHHHHhh-h-hccccCCCEEEEcHHHHHHHHHh---hccCC------ceeeccCcCC- Confidence 211111000000 01111 1111111 1 1223333 368999998888542 11111 0011111111 Q ss_pred eeEEEecCCeEEEE-ccccccCCCcceEEEEEecCCcccceeEEcccc---------ccccccccCC----ccccceeee Q lcl|NC_015288. 359 LAVGTINGRIKVYV-DPYAANLSDKHYYVVGYKGTSPYDAGLFYCPYV---------PLQMVRSIDP----NNFQPKIGF 424 (468) Q Consensus 359 ~~~G~l~~~~~vy~-D~Ya~~~s~~dY~~vG~Kg~~~~d~glfyaPYv---------~~~~~~~~Dp----~s~qP~~g~ 424 (468) ...++|+| ++|++ +....... ..+..++|+.+- .+......++ ...+=.+-. T Consensus 313 ~~~~~l~G-~PV~~~~~~~~~~~-------------~~~~~~~~gd~s~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~ 378 (404) T protein:vir:10 313 PTQYRFLG-LPVIELPNDLLLST-------------ESAIPVLLGDTKEAYKYVSDGAYELATTNIGAGAFETNTTKARI 378 (404) T ss_pred CCCccccc-eeeEEecccccCCC-------------CCccEEEEEeccccEEEEEecceEEEEeccccchhhcCceEEEE Confidence 11245654 56664 32111000 011112222111 1111111122 234445666 Q ss_pred eeecceee-cC--cccccCcccccCC Q lcl|NC_015288. 425 KTRYGMVS-NP--FVTTNGLYSGTPD 447 (468) Q Consensus 425 ~tRY~l~~-nP--f~~~~~~~~~~~~ 447 (468) ..|++..+ +| |+..+-..+-.|- T Consensus 379 ~~r~d~~v~~~~a~~~~~~~~aa~~~ 404 (404) T protein:vir:10 379 IMRIDGNVKDSEALLIAEIPVESVQA 404 (404) T ss_pred EEeeccEEecccceEEEEeecccCCC Confidence 77777643 44 4322111111111 No 86 >protein:vir:99920 Length: 311 # NCBI annotation: gp7 # Family: family:all:966 # MgeID: mge:1611 # MgeName: Halo # Cross-refs: genbank:acc:YP_655524;genbank:gi:109392294;genbank:GeneID:4157089 Probab=38.72 E-value=1.1 Score=20.46 Aligned_cols=284 Identities=14% Similarity=0.105 Sum_probs=110.5 Q ss_pred cccccccccccccccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCCcccccccccccccccccc Q lcl|NC_015288. 66 GGSALGSANTAGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDAGFTAGLDAT 145 (468) Q Consensus 66 ~~~~~~st~tg~~~~~~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSG~~~~~ 145 (468) ++.. +++++..--....-.+++++.+..+..+++.+-||+....-| -.. . ++.++ .| T Consensus 1 Mat~---tt~~g~~vP~~~~~~ii~~~~~~s~l~~~~~~i~~~~~~~~~-p~~---~--~~~~a-------~w------- 57 (311) T protein:vir:99 1 MATF---GTGNLKNLPRNIADGMVKDVVQGSTVAVLSARKPQRFGNEDI-ITF---N--GRPKA-------EF------- 57 (311) T ss_pred Ccee---cCCCceeccHHHHHHHHHHHHhhchhhhhcceeeccCCceEE-EEE---e--CCcee-------EE------- Confidence 2211 111111111111123555555666778888888887543221 110 0 00000 00 Q ss_pred ccccccccccccccCcccccccccccccccccccccccchhhhhccCCCCCccccceeEEEEEEEEeecccccceecHHH Q lcl|NC_015288. 146 TGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDAGKLFREMSFSIEKTSVTAKSRALKAEYTLEL 225 (468) Q Consensus 146 ~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~g~~f~EMaFsIeK~tVtAKSRaLKAEYTvEL 225 (468) .+ | +..+++...+++.++..+|.-+-....|-|| T Consensus 58 ---------------------------------------v~--E-----g~~~~~~~~~f~~v~l~~~k~~~~~~iS~el 91 (311) T protein:vir:99 58 ---------------------------------------VG--E-----GQQKSSTTGEFDFVTSTPKKAQVTMRFNEEV 91 (311) T ss_pred ---------------------------------------ee--c-----CcccccccceeeEEEEeeEEEEEeehhhHHH Confidence 00 1 1233444445556666666656677899999 Q ss_pred HHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhc---chhhccccc-cceeeeeecCCcchhHHHHHHHHHHHH Q lcl|NC_015288. 226 AQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVA---KPGAANNVA-NAGIFDLDVDSNGRWSVEKFKGLLFQI 301 (468) Q Consensus 226 AQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA---~~~k~~~~~-~~Gv~Dl~~~~~~rw~~e~~~~l~~~i 301 (468) .|+-.- -..|-+++|.+.|...|+..|++.+|.-.-..- ..+-.+... ..+.+.+. .++....+.. | T Consensus 92 l~~~~d-~~~~l~~~i~~~la~ai~~~~d~~~l~G~g~~~g~~~~g~~~~~~~~~~~~~~~--~~~~~~~~~~------i 162 (311) T protein:vir:99 92 QWADED-YQLGVLQTLSEAGAEALARALDLGLYHRINPLTGTVIPGWSNYLGAASKRVELT--ADTIANPDLA------I 162 (311) T ss_pred hhcccc-cHHHHHHHHHHHHHHHHHHHHHHHhhcccCcccCccccccccccccccceeecc--ccccchhHHH------H Confidence 763221 124567788888888888888887775322000 000000000 00111111 1111101110 1 Q ss_pred HHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccccccccCCCceeEEEecCCeEEEEccccc---- Q lcl|NC_015288. 302 ERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTGNLAVGTINGRIKVYVDPYAA---- 377 (468) Q Consensus 302 ~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy~D~Ya~---- 377 (468) ..-...+...-.++..+-.|++++....|.. +..+-+ + .....+.++. ..|+|. +++|++..+.. T Consensus 163 ~~~~~~~~~~~~~~~~~~~vmn~~~~~~L~~---lkd~~G---~---~l~~~~~~~~-~~~~l~-G~Pv~~s~~i~~~~~ 231 (311) T protein:vir:99 163 EAAVGLLVANGHPTPVNGLALHPSIAWGLST---ARYTDG---R---KKFPELGLGI-GVSSFE-GIDASVSDTVNGGDE 231 (311) T ss_pred HHHHHHHhhhccCCCccEEEEcHHHHHHHHh---hhccCC---C---eeecCcccCC-CCceec-ceeeEeecccccccc Confidence 1111112122234455668999999988853 222111 0 0001111111 124553 46777753210 Q ss_pred --------cCCCcceEEEEEecCCcccceeEEcccccccccc--ccCCcccc-----ceeee--eeecceee-cC-cccc Q lcl|NC_015288. 378 --------NLSDKHYYVVGYKGTSPYDAGLFYCPYVPLQMVR--SIDPNNFQ-----PKIGF--KTRYGMVS-NP-FVTT 438 (468) Q Consensus 378 --------~~s~~dY~~vG~Kg~~~~d~glfyaPYv~~~~~~--~~Dp~s~q-----P~~g~--~tRY~l~~-nP-f~~~ 438 (468) -..+.+++++|=- ..++.|.-.....+.. .-|++... --++| ..|||..+ || |+.. T Consensus 232 ~~~~~~~~~~~~~~~~~~Gdf-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~r~~~r~d~~v~~~~~v~~ 306 (311) T protein:vir:99 232 ADPDDEDLDAARAVRGIVGDF-----ANGIHWGVQRDIPVELIKYGDPDGQGDLKRHNQIALRLEIVYGWYVFTDRFVVI 306 (311) T ss_pred cccccchhhccCcceEEEeec-----cccEEEEEecCceEEEeecCCCCcchhhhhcCcEEEEEEEeecceecChhHeee Confidence 0112233333210 0112222111111111 11233211 11333 56777544 33 4332 Q ss_pred cCccc Q lcl|NC_015288. 439 NGLYS 443 (468) Q Consensus 439 ~~~~~ 443 (468) .+..+ T Consensus 307 ~~~~A 311 (311) T protein:vir:99 307 ENAVA 311 (311) T ss_pred ecccC Confidence 22222 No 87 >protein:vir:96833 Length: 275 # NCBI annotation: ORF015 # Family: family:all:522 # MgeID: mge:1642 # MgeName: EW # Cross-refs: genbank:acc:YP_240157;genbank:gi:66395822;genbank:GeneID:5133174 Probab=36.67 E-value=1.2 Score=20.23 Aligned_cols=257 Identities=12% Similarity=0.026 Sum_probs=107.0 Q ss_pred eeeeeeecCCCCCcc-cccccc-----------ccccccccccccccccccccccccCcccccccccccccccccccccc Q lcl|NC_015288. 115 FAMRSRYENQAGEEA-LFNEPD-----------AGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRG 182 (468) Q Consensus 115 FAMRsrY~~qsG~EA-~fnEa~-----------t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~g 182 (468) -||=. ++.-.. +..|.. .-|++.... . ....| .+ ..+.+.+.- T Consensus 1 ~~~~~----~T~l~d~i~PEv~~~~v~~~~~~~~~~~~~~~~-~------------~~l~g-~~-------G~tv~iP~~ 55 (275) T protein:vir:96 1 MALEN----MTKLANMVNPEVLAPMMQAELDKKLKFAQFADI-D------------NTLVG-QP-------GNTITFPAF 55 (275) T ss_pred CCCcc----cchhhhhhchHHHHHHHHHHHHHhhhhccccee-c------------ccccC-CC-------CCEEEeeee Confidence 12211 110000 001110 011110000 0 00000 00 011111111 Q ss_pred cchhhhhccCCC-CCccccceeEEEEEEEEeecccccceecHHHHHhHHH-hhCCChhHHHHHHHHHHHHHHhhHHHHHH Q lcl|NC_015288. 183 FSREDLEQAGDA-GKLFREMSFSIEKTSVTAKSRALKAEYTLELAQDLKA-IHGLDAEQELANILSSEVLAEINREVVRR 260 (468) Q Consensus 183 m~Ta~aE~lG~~-g~~f~EMaFsIeK~tVtAKSRaLKAEYTvELAQDLkA-iHGLDAE~ELanILStEImlEINREII~~ 260 (468) -...++|.+..+ .-+..++. ..+.+++.|-|.-.-+++ |+-+ .-+-|.-.|..+-++..|+.+++.+++.. T Consensus 56 ~~ig~a~~~~~g~~i~~~~lt--~~~~~~~i~~~~~~~~i~-----D~~~~~~~~d~~~~~~~~~a~~~a~~~d~~ll~~ 128 (275) T protein:vir:96 56 VYSGDAKVVPEGEEIPIDLIE--TKKRQATIRKIGKGTVLT-----DEALLSGYGDPKGEAVRQHGLAIANKVDNDVLEA 128 (275) T ss_pred ccCCccccccCCCCcchhhcc--cceeeEEeehhccccccc-----HHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHH Confidence 111233333221 22344443 444455556554443333 3333 22568888999999999999999999987 Q ss_pred HhhhcchhhccccccceeeeeecCCcchhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhcccccccc Q lcl|NC_015288. 261 VYSVAKPGAANNVANAGIFDLDVDSNGRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSS 340 (468) Q Consensus 261 l~~vA~~~k~~~~~~~Gv~Dl~~~~~~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~ 340 (468) +.+..... +.+. ...+.+-..+.++..| -..+++++++|++++.|.-..-..+.+ T Consensus 129 l~~a~~~~---------------~~~~-~~~d~i~dA~~~lgd~---------~~~~~~ivv~p~~~~~L~k~~~~~f~~ 183 (275) T protein:vir:96 129 LQGATLKV---------------EADI-TKLAGLQTAIDKFNDE---------DLEPMVLFVNPLDAGKLRASATDNFTR 183 (275) T ss_pred Hhcccccc---------------cccc-cCHHHHHHHHHHhccc---------cCCccEEEeCHHHHHHHHhcccccccc Confidence 76533221 1111 1123232222333221 246889999999999884322122222 Q ss_pred ccccccccccccccCCCceeEEEecCCeEEEEccccccCCCcceEEEEEe-cCCcccceeEEccccccccccccCCcccc Q lcl|NC_015288. 341 GLTGAGGPAIGTVDDTGNLAVGTINGRIKVYVDPYAANLSDKHYYVVGYK-GTSPYDAGLFYCPYVPLQMVRSIDPNNFQ 419 (468) Q Consensus 341 ~~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy~D~Ya~~~s~~dY~~vG~K-g~~~~d~glfyaPYv~~~~~~~~Dp~s~q 419 (468) .... + ...-.+-.+|++ .|++|++| +.-| .|-.+-++ |+-. ++-.+=++.+..| |+++++ T Consensus 184 ~~~~--g-----~~~~~~G~ig~~-~G~~Vi~s----~~~p-~~t~~i~~~gA~~----~~~~~~~~vE~~R--d~~~~~ 244 (275) T protein:vir:96 184 ATLL--G-----DNVIVKGAFGEA-LGAIIVRS----NKIK-EGEAILAKRGAVK----LITKRDFFLETER--HASHKS 244 (275) T ss_pred cccc--c-----ccceecccccee-cCeeEEEe----CCCC-cceEEEEecccee----eeecCCccccccc--chhhcC Confidence 1111 0 001112246776 57899999 5445 22222222 1111 1111101111111 888888 Q ss_pred ceeeeeeeccee-ecC--cccccCcccccCChhhh Q lcl|NC_015288. 420 PKIGFKTRYGMV-SNP--FVTTNGLYSGTPDGETL 451 (468) Q Consensus 420 P~~g~~tRY~l~-~nP--f~~~~~~~~~~~~~~~~ 451 (468) =.+--..+||+. .|| -......+++ -.+ T Consensus 245 d~i~~~~~y~~~~~~~~~vv~~t~~~~~----~~~ 275 (275) T protein:vir:96 245 TALFSDKHYVAYLYDESKVVKITKSASG----LGV 275 (275) T ss_pred cEEEEeEEEEEEEEcCccEEEEEecccc----cCC Confidence 888888888854 445 1122222222 111 No 88 >protein:vir:101607 Length: 379 # NCBI annotation: major capsid protein precursor # Family: family:all:585 # MgeID: mge:1646 # MgeName: 11b # Cross-refs: genbank:acc:YP_112497;genbank:gi:53793597;uniprot:Q5ZGF6;genbank:GeneID:3101715 Probab=36.18 E-value=1.2 Score=20.17 Aligned_cols=332 Identities=15% Similarity=0.095 Sum_probs=119.6 Q ss_pred Cc------chHHHHHhhhhhhcCCc--cccccchhhhhhhhhhhhhHHH---HHhhhhhh------hhhcccc------- Q lcl|NC_015288. 1 MF------NAEHLQEKWSPVLNNEA--ANPIADRYKKAVTSVLLENQER---FLREERGM------LQEVAVN------- 56 (468) Q Consensus 1 ~~------~~~~l~~kw~p~l~~~~--~~~i~~~~~~~~~~~llenq~~---~~~e~~~~------l~e~~~~------- 56 (468) .+ ..++..++...+.+... .-+..+.-+.++ ..|.+..++ ...+.... ..+...+ T Consensus 16 ~l~~~~~~~~~e~~~~~e~~~~~~~~~~~~~~~e~~~~~-~~l~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 94 (379) T protein:vir:10 16 QVDSKSSAQALEVKGLIEALEAKMTSEKDLAVNELKSDM-AALQAHADKLDVKLKEKAKSEDKSDSLVKSITENFNDIKE 94 (379) T ss_pred HHHHHHHHHHHHHHHHHHHHHhHhhHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHhcccccccchhHHHHHHHHHHhHHH Confidence 00 01112222221111000 000001111111 111111100 00000000 0000000 Q ss_pred --ccCccccc-cccccccccccccc-ccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCCccccc Q lcl|NC_015288. 57 --SLGAGTVS-PGGSALGSANTAGL-AGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFN 132 (468) Q Consensus 57 --~~g~~~~~-~~~~~~~st~tg~~-~~~~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fn 132 (468) ........ .+.....+..++.+ ..+.+.++.+.| ....-.+++.|.||++++.-|.- ..+ T Consensus 95 ~~~~~~~~~~~~~~~~~~~~~~~~ip~~~~~~ii~~~~---~~~~i~~~~~~~~~~~~~~~~~~-------~~~------ 158 (379) T protein:vir:10 95 VRNGKSIQVKAVGDMTLPVNLTGAQPKDYNFDVVLNPS---QMLNVSDIVGAVSISGGTYTFVR-------ENG------ 158 (379) T ss_pred HHhhhhhhhhhhcccccCCCCccccchhhhhHHHHhHH---hhhhHHhhceeeeccCCceEEEE-------eec------ Confidence 00000000 00000001111111 112333333333 34466788999999887543321 000 Q ss_pred cccccccccccccccccccccccccccCcccccccccccccccccccccccchhhhhccCCCCCccccceeEEEEEEEEe Q lcl|NC_015288. 133 EPDAGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDAGKLFREMSFSIEKTSVTA 212 (468) Q Consensus 133 Ea~t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~g~~f~EMaFsIeK~tVtA 212 (468) +.+. . . .-.+| +...+++..++++++..+ T Consensus 159 -----~~~~---~----------------~----------------------~~v~E-----g~~~~~~~~~f~~i~~~~ 187 (379) T protein:vir:10 159 -----AGEG---A----------------I----------------------GAQVE-----GATKGQKDYDISMIDVNT 187 (379) T ss_pred -----CCCc---c----------------c----------------------ccccC-----CccccccccceeeeEeee Confidence 0000 0 0 00001 223444555555555555 Q ss_pred ecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeeeecCCcchhHHH Q lcl|NC_015288. 213 KSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDLDVDSNGRWSVE 292 (468) Q Consensus 213 KSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl~~~~~~rw~~e 292 (468) |.=+--..+|-||.||-- +.++.|.+-|+..|+.-+|..++.-+.+.+.-+.. ...+..+ ++ T Consensus 188 ~k~~~~~~iS~ell~D~~-----~l~~~i~~~la~~~~~~~~~~~~~g~~~~~~~~~~------------~~~~~~~-~d 249 (379) T protein:vir:10 188 DFIAGFTRYSKKMANNLP-----FLTSFIPNALRRDYAKAENAAFNAVLAANATASTE------------IITNKNK-VE 249 (379) T ss_pred eeEEeeehhhHHHHhhHH-----HHHHHHHHHHHHHHHHHHHHHHhcccccccccccc------------cccCccc-HH Confidence 555555789999999963 27888999999999999998887654432211111 1111111 22 Q ss_pred HHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccc-cccccccccccccCCCceeEEEecCCeEEE Q lcl|NC_015288. 293 KFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSG-LTGAGGPAIGTVDDTGNLAVGTINGRIKVY 371 (468) Q Consensus 293 ~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~-~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy 371 (468) ....+++++. ..-...+-+|++|.....|.. +..+.+ ...+ ++.+.++. -..+|. |++|+ T Consensus 250 ~i~~~~~~~~---------~~~~~~~~~vmn~~~~~~l~~---lkd~~G~~l~~-----~~~~~~~~-~~~~l~-G~pvv 310 (379) T protein:vir:10 250 MLINEIAKQE---------NLDFPVTAIVLRPTDYYDILV---TQKSVGAGYGL-----PGVVTQDN-GVLRIN-GIPLF 310 (379) T ss_pred HHHHHHHhhh---------hccCCCCEEEEcHHHHHHHHH---hhccCCceecc-----CCccCCCC-Ccceec-ceeeE Confidence 2222222221 123455568899988877743 211111 0011 01111110 011443 58999 Q ss_pred EccccccCCCcceEEEEEecCCcccceeEEcccccccccccc--CCccccceeeeeeecceee-cCcccccCcccccCCh Q lcl|NC_015288. 372 VDPYAANLSDKHYYVVGYKGTSPYDAGLFYCPYVPLQMVRSI--DPNNFQPKIGFKTRYGMVS-NPFVTTNGLYSGTPDG 448 (468) Q Consensus 372 ~D~Ya~~~s~~dY~~vG~Kg~~~~d~glfyaPYv~~~~~~~~--Dp~s~qP~~g~~tRY~l~~-nPf~~~~~~~~~~~~~ 448 (468) ++++.. ..-+++|=-. . .-+++--=+..+..+.. +-.+.+=.+=+..|+|+.+ +|= T Consensus 311 ~s~~~~----ag~~~~gdf~---~-~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~R~~~~v~~p~------------- 369 (379) T protein:vir:10 311 RATWLA----ANKYYVGDWT---R-VTKVTTEGLSLEFSEVEGTNFVKNNITARIEAQVALAVEQPA------------- 369 (379) T ss_pred ecCCCC----CCceEEeecc---c-EEEEEEeceEEEEeecccccccCCcEEEEEEEEeccEEecCc------------- Confidence 997654 2223332111 0 11111110001111111 1122222333345776544 341 Q ss_pred hhhhhccCceeeeEEeecc Q lcl|NC_015288. 449 ETLTPSTNMYYRRVQVTNL 467 (468) Q Consensus 449 ~~~~~~~N~y~r~~~v~~~ 467 (468) .|-++-+..| T Consensus 370 ---------a~v~~~~~~~ 379 (379) T protein:vir:10 370 ---------ALIFGDFTAV 379 (379) T ss_pred ---------cEEEEEecCC Confidence 1233333344 No 89 >protein:vir:1383 Length: 421 # NCBI annotation: major capsid protein # Family: family:all:21 # MgeID: mge:314 # MgeName: phi3626 # Cross-refs: genbank:acc:NP_612835;genbank:gi:20065969;genbank:GeneID:935826 Probab=32.84 E-value=1.4 Score=19.79 Aligned_cols=330 Identities=12% Similarity=0.069 Sum_probs=120.3 Q ss_pred cc----hHHHHHhhhhhhcCCc-----cccccchh----hhhhhhhh--hhhHHHHHhhhhhh----hhhccccccCcc- Q lcl|NC_015288. 2 FN----AEHLQEKWSPVLNNEA-----ANPIADRY----KKAVTSVL--LENQERFLREERGM----LQEVAVNSLGAG- 61 (468) Q Consensus 2 ~~----~~~l~~kw~p~l~~~~-----~~~i~~~~----~~~~~~~l--lenq~~~~~e~~~~----l~e~~~~~~g~~- 61 (468) |+ -++|+++.+-+.+... +.+..+.. -++..+.+ |+++-+.+.+...- +.+........+ T Consensus 1 Mn~~e~lkel~~~~~el~~~~~~~~~~~~~~~~e~~~~e~~~~~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (421) T protein:vir:13 1 MNLFERLKELRAKKKELEEKRCGIVEEIRSLAKEKKEEEARSKALEREKIEARMEIIEEEIESVMTAIDEERKNTNFTGG 80 (421) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccccc Confidence 44 2346677776654310 00000000 00111111 11221222111111 111000000000 Q ss_pred --c---------------------ccc---cccccccc-ccccc---ccccceehhhhHHhhhhhhhhheeeeecCCccc Q lcl|NC_015288. 62 --T---------------------VSP---GGSALGSA-NTAGL---AGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPT 111 (468) Q Consensus 62 --~---------------------~~~---~~~~~~st-~tg~~---~~~~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPT 111 (468) . .+. .......+ +.|+. ..+.+.++.+. .+...-.+++.+.||++++ T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ra~~t~~~gg~liP~~~~~~Ii~~~---~~~~~l~~l~~~~~~~~~~ 157 (421) T protein:vir:13 81 RVIINGDSKEEKRSLQLSAMSKTIRGIQLSEEERDIMSSTNNGAVIPQEFVNEFEKLK---EGYPSLKEHCHVIPVNRNA 157 (421) T ss_pred ccccccchhHHHHHHHHHHHHHhhhccchhHHHhhccccCCcceecchhhHHHHHHHH---HhhhhhhhhceeeeccCCc Confidence 0 000 00001111 11211 11122333333 3444567888888888776 Q ss_pred eeeeeeeeeecCCCCCccccccccccccccccccccccccccccccccCcccccccccccccccccccccccchhhhhcc Q lcl|NC_015288. 112 GLIFAMRSRYENQAGEEALFNEPDAGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQA 191 (468) Q Consensus 112 GLIFAMRsrY~~qsG~EA~fnEa~t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~l 191 (468) +-+--.+ .....+ +. ..+| T Consensus 158 ~~~~~~~----~~~~~~---------~~----------------------------------------------~~~E-- 176 (421) T protein:vir:13 158 GKMPVRA----GASVDK---------LA----------------------------------------------NLAK-- 176 (421) T ss_pred eEEEEee----cCCccc---------ee----------------------------------------------eccc-- Confidence 5332111 000000 00 0000 Q ss_pred CCCCCccccceeEEEEEEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhcc Q lcl|NC_015288. 192 GDAGKLFREMSFSIEKTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAAN 271 (468) Q Consensus 192 G~~g~~f~EMaFsIeK~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~ 271 (468) +...++-..++++++..++.-+-...+|-||.+|-- .|.++.|.+-|+..+..-+|..|+..+-.+ T Consensus 177 ---~~~~~~s~~~f~~i~~~~~k~~~~v~iS~ell~ds~----~~l~~~i~~~la~~~~~~~~~~i~~~~~g~------- 242 (421) T protein:vir:13 177 ---DTELVKAMLKTQPMAYDIDDYGLLAPIDNSLLEDSE----INFLEFVNEEFAEFAVNTENAEIVKQAKAV------- 242 (421) T ss_pred ---cccccccccceeEEEeeeeeeEeehhhhHHHHhhhH----HHHHHHHHHHHHHHHHHHhhhhHhhhhhhc------- Confidence 112233333444555555555555679999999853 467888888888888888888887533211 Q ss_pred ccccceeeeeecCCcchhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccccc Q lcl|NC_015288. 272 NVANAGIFDLDVDSNGRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIG 351 (468) Q Consensus 272 ~~~~~Gv~Dl~~~~~~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~ 351 (468) .+..++.++ +....++..+.. .-+.+..+|+++.....|.. +..+.+ + +.... T Consensus 243 -~~~~~~~~~----------d~i~~~~~~l~~---------~~~~~a~~v~n~~~~~~l~~---lkd~~G---~-~i~~~ 295 (421) T protein:vir:13 243 -LAEETINDY----------AGLVKTINSLVP---------NARKRAIIVTNSDGRAYLDG---LMDKQG---R-PLLKE 295 (421) T ss_pred -cccccccch----------HHHHHHHHHhhh---------hhcCCCEEEEcHHHHHHHHH---hhcCCC---c-eeecC Confidence 122222222 233333333321 11234567888888877753 222211 1 00000 Q ss_pred cccCCCceeEEEecCCeEEEEccccccCCCcceEEEEEecCCcccceeEEccccc---------ccccccc--CCccccc Q lcl|NC_015288. 352 TVDDTGNLAVGTINGRIKVYVDPYAANLSDKHYYVVGYKGTSPYDAGLFYCPYVP---------LQMVRSI--DPNNFQP 420 (468) Q Consensus 352 ~~D~t~~~~~G~l~~~~~vy~D~Ya~~~s~~dY~~vG~Kg~~~~d~glfyaPYv~---------~~~~~~~--Dp~s~qP 420 (468) ..+.+ -++|+ |++|++..++ ..+-++ +..+||+-+-. +.+...- +-...+= T Consensus 296 ~~~~~----~~tl~-G~pV~~~~~~---------~~~~~~----~~~~~~gd~~~~~~~~~~~~~~v~~~~~~~f~~~~~ 357 (421) T protein:vir:13 296 LSDGG----DLVFK-GRPVIELEES---------IFDVGD----ETKFIVSDFKTLIKFMDRKQYLIDQSKEAGYTKNET 357 (421) T ss_pred cCCCC----Cceec-ceeeEEeccc---------cccCCC----ceEEEEEeccccEEEEEecceEEEeecccccccCee Confidence 00111 23554 4566655332 111111 12233332211 1111111 1112223 Q ss_pred eeeeeeecceeecCcccccCcccccCChhhhhhccCceeeeEEeeccC Q lcl|NC_015288. 421 KIGFKTRYGMVSNPFVTTNGLYSGTPDGETLTPSTNMYYRRVQVTNLM 468 (468) Q Consensus 421 ~~g~~tRY~l~~nPf~~~~~~~~~~~~~~~~~~~~N~y~r~~~v~~~~ 468 (468) .+-+..||+..+ .+.+..... ..+.=..+|+.-= T Consensus 358 ~~r~~~r~d~~~-------------~~~~a~~~~-~~~~~~a~v~~~~ 391 (421) T protein:vir:13 358 IARIIERFDVNS-------------PLDKSSDAE-KIRKFGVIVKLQE 391 (421) T ss_pred EEEEEeeeccee-------------ecchhhhee-eecccceeecccc Confidence 444555555443 111111000 0011112222211 No 90 >protein:vir:10364 Length: 390 # NCBI annotation: head protein; major capsid subunit precursor # Family: family:all:585 # MgeID: mge:183 # MgeName: Xp10 # Cross-refs: genbank:acc:NP_858956;genbank:gi:32128421;genbank:GeneID:2648357 Probab=29.99 E-value=1.6 Score=19.45 Aligned_cols=329 Identities=12% Similarity=0.023 Sum_probs=120.9 Q ss_pred CcchHHHHHhhhhhhcCCccccccchhhhhh---hhhhhhhHH--------------HHHhhhhhhhhh--ccccccCcc Q lcl|NC_015288. 1 MFNAEHLQEKWSPVLNNEAANPIADRYKKAV---TSVLLENQE--------------RFLREERGMLQE--VAVNSLGAG 61 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~---~~~llenq~--------------~~~~e~~~~l~e--~~~~~~g~~ 61 (468) --.+++..+++.-+... +.. -++.+ ..+ ++.-+ +...+...+... ......+.. T Consensus 30 ~~~~~e~~~~~~~~~~e-----~~~-l~~~i~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 102 (390) T protein:vir:10 30 GELNASARSKVDELFAT-----VGN-LSAEVQAARQR-VAELEGNGAGGDVQHVSVGDLFVASEQFQASAGRWNDRSARA 102 (390) T ss_pred cccCHHHHHHHHHHHHH-----HHH-HHHHHHHHHHH-HHHHHhhcccccccccchhhhhhhhHHHHHHHHhhhhhhhhh Confidence 01112223333322111 100 00000 000 00000 000000000000 000000000 Q ss_pred ccccccc----cccccc-ccccc--cccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCCccccccc Q lcl|NC_015288. 62 TVSPGGS----ALGSAN-TAGLA--GFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEP 134 (468) Q Consensus 62 ~~~~~~~----~~~st~-tg~~~--~~~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa 134 (468) ....... ...++. .|.+. ..-+.++.+.| ....-.++|.+.||++++.-+.- ..+.++. + T Consensus 103 ~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~ii~~~~---~~~~l~~~~~~~~~~~~~~~~~~----~~~~~~~-a----- 169 (390) T protein:vir:10 103 TMNIKAALNTASTDAAGSAGALTTPNRLPGFITQPD---ARLTVRDLIGSGRTDSALIEYVQ----ETGFVNN-A----- 169 (390) T ss_pred hhHHHHHHHhhhcccccccccccchhHHHHHHHHHH---hhchhhhhcceeeccCCceEEEE----EecCCcc-e----- Confidence 0000000 000011 11111 11123333333 44455678999998876533321 1111000 0 Q ss_pred cccccccccccccccccccccccccCcccccccccccccccccccccccchhhhhccCCCCCccccceeEEEEEEEEeec Q lcl|NC_015288. 135 DAGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDAGKLFREMSFSIEKTSVTAKS 214 (468) Q Consensus 135 ~t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~g~~f~EMaFsIeK~tVtAKS 214 (468) .|- +| +...++-..+++++++.+|. T Consensus 170 --~~v------------------------------------------------~E-----g~~~~~~~~~~~~i~~~~~k 194 (390) T protein:vir:10 170 --AIV------------------------------------------------AE-----GALKPESSLKFAKKTDTTHV 194 (390) T ss_pred --eee------------------------------------------------cC-----CccccccccceeEEEEeeEE Confidence 000 00 12334445566677777777 Q ss_pred ccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeeeecCCc--chhHHH Q lcl|NC_015288. 215 RALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDLDVDSN--GRWSVE 292 (468) Q Consensus 215 RaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl~~~~~--~rw~~e 292 (468) .+....+|-||.||-- |.++.|.+-|+..|...+|+.||.- ...+-.+.|++....... .-.... T Consensus 195 ~~~~~~is~ell~d~~-----~l~~~i~~~l~~~~~~~~~~~il~G--------~G~~~~p~Gi~~~~~~~~~~~~~~~~ 261 (390) T protein:vir:10 195 IAHTMKATRQILSDAP-----QLASYMNNRLIRGLKVKEDAEILRG--------TGANDGLLGLIPQATTYAAPTTIAGA 261 (390) T ss_pred EEEeehhhHHHHHhHH-----HHHHHHHHHHHHHHHHHHHHHHhhc--------CCCCcccccccccccccccccccccc Confidence 7778899999999852 4678899999999999999888742 111112344443221110 000000 Q ss_pred HHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccccccccCCCceeEEEecCCeEEEE Q lcl|NC_015288. 293 KFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTGNLAVGTINGRIKVYV 372 (468) Q Consensus 293 ~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy~ 372 (468) . .+.....+.... .......+-+|++|.....|.. +..+.+ + .....+..+. .++| .|++|++ T Consensus 262 ~----~~~~~~~~~~~l-~~~~~~~~~~v~n~~~~~~L~~---lkd~~g---~---~l~~~~~~~~--~~~l-~G~pv~~ 324 (390) T protein:vir:10 262 T----RVDQLRLAMLQA-SLAEYPASGIVINPIDWAAIEL---AKDANN---Q---YLIGNARGTL--TPTL-WGLPVVA 324 (390) T ss_pred c----hHHHHHHHHHhh-ccccCCCCEEEEcHHHHHHHHH---hhcCCC---c---eeecCCcCcC--Ccee-cceeeEE Confidence 0 111111222111 2233456678899998887753 222211 0 0001111111 2345 3678888 Q ss_pred ccccccCCCcceEEEEEecCCcccceeEEcccccccccccc---CCccccceeeeeeecceee-cCcccccCcccccCCh Q lcl|NC_015288. 373 DPYAANLSDKHYYVVGYKGTSPYDAGLFYCPYVPLQMVRSI---DPNNFQPKIGFKTRYGMVS-NPFVTTNGLYSGTPDG 448 (468) Q Consensus 373 D~Ya~~~s~~dY~~vG~Kg~~~~d~glfyaPYv~~~~~~~~---Dp~s~qP~~g~~tRY~l~~-nPf~~~~~~~~~~~~~ 448 (468) +... |..-+++|--. .+++.+...-+...... .-.+.+=.+-...|++..+ +|= T Consensus 325 ~~~~----p~~~~~~gdf~-----~~~~~~~~~~~~i~~~~~~~~~~~~~~~~r~~~r~d~~v~~~~------------- 382 (390) T protein:vir:10 325 TQAM----APGEFLVGAFD-----LAAQIFDQWDARVEIGYVNDDFQRNMVTVLAEERLALVVYRPE------------- 382 (390) T ss_pred cCCC----CCCcEEEEecc-----ceEEEEEecceEEEEeecccccccCcEEEEEEEeeccEEeccc------------- Confidence 8653 23334444210 11111111111111111 1112222233335666543 330 Q ss_pred hhhhhccCceeeeEEee Q lcl|NC_015288. 449 ETLTPSTNMYYRRVQVT 465 (468) Q Consensus 449 ~~~~~~~N~y~r~~~v~ 465 (468) -|.++-+. T Consensus 383 ---------a~~~~~~a 390 (390) T protein:vir:10 383 ---------ALISGSFA 390 (390) T ss_pred ---------cEEEEEeC Confidence 11222222 No 91 >protein:vir:3991 Length: 404 # NCBI annotation: major structural protein # Family: family:all:21 # MgeID: mge:319 # MgeName: BK5-T # Cross-refs: genbank:acc:NP_116499;genbank:gi:14251132;genbank:GeneID:921252 Probab=28.64 E-value=1.7 Score=19.28 Aligned_cols=336 Identities=14% Similarity=0.122 Sum_probs=130.0 Q ss_pred CcchHHHHHhhhhhhcCCccccccchh-------------hh---hhhhhh------hhhHHHHHhhhhhh-hhhccccc Q lcl|NC_015288. 1 MFNAEHLQEKWSPVLNNEAANPIADRY-------------KK---AVTSVL------LENQERFLREERGM-LQEVAVNS 57 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~-------------~~---~~~~~l------lenq~~~~~e~~~~-l~e~~~~~ 57 (468) .|+-++|.++|.-+.+. ++... .+ .+.+.+ ++.+++.+.+.... ......+. T Consensus 4 ~m~l~el~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 78 (404) T protein:vir:39 4 KLTVNQLNEAWIASGDK-----VTDFNDQINMALNDDNFSAEAMSELKNKRDNEKVRRDALREQLVEAQAEQVVNMREEE 78 (404) T ss_pred HHHHHHHHHHHHHHHHH-----HHHHHHHHHHHhccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccc Confidence 56888999999887654 11111 00 010111 00000001000000 00000000 Q ss_pred cCc----------------------c-----cccccccccccccccccc---cccceehhhhHHhhhhhhhhheeeeecC Q lcl|NC_015288. 58 LGA----------------------G-----TVSPGGSALGSANTAGLA---GFDPVLISLVRRAMPNLMAYDVCGVQPM 107 (468) Q Consensus 58 ~g~----------------------~-----~~~~~~~~~~st~tg~~~---~~~P~Lv~l~RRa~~~LIa~DI~GVQPm 107 (468) ... + ..........+++.|+.. .+.+.+ ++...+.....+++.++|| T Consensus 79 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~~a~~~~t~~~gg~~iP~~~~~~i---i~~~~~~~~l~~~~~~~~~ 155 (404) T protein:vir:39 79 KGPLNKSEYELKDKFVKEFVNMVRNPMAFLNTVSSKTETSGSDSAAGLTIPQDIRTMI---NTLVRQYDSLQQYVRVESV 155 (404) T ss_pred ccccccchhhhHHHHHHHHHHHHhcchhhhhhhhhhhhhcccccCCceeccHHHHHHH---HHHHHhhhhHHhhcceeec Confidence 000 0 000000001111112211 112233 3333455677888999999 Q ss_pred CccceeeeeeeeeecCCCCCccccccccccccccccccccccccccccccccCcccccccccccccccccccccccchhh Q lcl|NC_015288. 108 SGPTGLIFAMRSRYENQAGEEALFNEPDAGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSRED 187 (468) Q Consensus 108 TGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~ 187 (468) +++++-+--.| ..+.++. + .|-+ + T Consensus 156 ~~~~~~~~~~~--~~~~~~~-a-------~~v~----------------------------------------------E 179 (404) T protein:vir:39 156 STSNGSRVYEK--WTDVTPL-T-------VMDA----------------------------------------------E 179 (404) T ss_pred cCCcceEEEEe--ecCCccc-e-------eeec----------------------------------------------C Confidence 99887654333 1111000 0 0000 0 Q ss_pred hhccCC-CCCccccceeEEEEEEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcc Q lcl|NC_015288. 188 LEQAGD-AGKLFREMSFSIEKTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAK 266 (468) Q Consensus 188 aE~lG~-~g~~f~EMaFsIeK~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~ 266 (468) ++...+ +...|.++.|++.|..+-. .+|-||.+|-. .|.+++|.+-|+..|..-+|+.||.-.- T Consensus 180 g~~~~~~~~~~f~~i~~~~~k~~~~~-------~iS~ell~ds~----~~l~~~i~~~l~~~~~~~~d~~il~g~g---- 244 (404) T protein:vir:39 180 DGKIPDLDNPRLTIIKYLIKRYAGII-------TATNTLLKDTA----ENILAWLSSWIAKKVVVTRNQAIIAAMG---- 244 (404) T ss_pred ccccccccccceeeEEeeeeeEEeee-------hhHHHHHhhch----HHHHHHHHHHHHHHHHHHHHHHHHhccc---- Confidence 000001 1235777777777776554 49999999842 5789999999999999999998886321 Q ss_pred hhhccccccceeeeeecCCcchhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhcccccccccccccc Q lcl|NC_015288. 267 PGAANNVANAGIFDLDVDSNGRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAG 346 (468) Q Consensus 267 ~~k~~~~~~~Gv~Dl~~~~~~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~ 346 (468) .+....+..++ +....+++ .....--+ ....+|||+.....|.. +..+.+ + T Consensus 245 ----~~~~~~~~~~~----------~~i~~~~~-------~~~~~~~~-~~a~~v~n~~~~~~L~~---lkd~~G---~- 295 (404) T protein:vir:39 245 ----TVPKKPTIAKF----------DDVITMIN-------TSVDPAII-ATSSLLTNQSGLNKLAL---VKTAEG---K- 295 (404) T ss_pred ----ccccccccccH----------HHHHHHHH-------Hhhhhhhc-cCCEEEEcHHHHHHHHH---hhccCC---c- Confidence 11122222222 11111111 10001011 12357899999888863 222111 0 Q ss_pred ccccccccCCCceeEEEecCCeEEEE-c-cccccCCCcce-EEEE-Eec----CCcccceeEEccccccccccccCCccc Q lcl|NC_015288. 347 GPAIGTVDDTGNLAVGTINGRIKVYV-D-PYAANLSDKHY-YVVG-YKG----TSPYDAGLFYCPYVPLQMVRSIDPNNF 418 (468) Q Consensus 347 ~~~~~~~D~t~~~~~G~l~~~~~vy~-D-~Ya~~~s~~dY-~~vG-~Kg----~~~~d~glfyaPYv~~~~~~~~Dp~s~ 418 (468) .....+-+.. ..++|.| ++|++ | ....+....++ +++| ++. ....+-.+=..+|+... =... T Consensus 296 --~l~~~~~~~~-~~~~l~G-~pV~~~~~~~~~~~~~~~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~------~~~~ 365 (404) T protein:vir:39 296 --YLLEPDPTKP-NSYLIKG-KKVIVVADRWLPNSGSTVYPLYYGDMSQAITLFDRENMSLLPTNIGAGA------FETD 365 (404) T ss_pred --eeeccCcCCC-Ccceecc-eeEEEecccccCccCCCccEEEEEeccccEEEEeecceEEEEeccchhh------hhhc Confidence 0001111111 1245544 45554 2 11111111111 1222 110 00000111122222111 1133 Q ss_pred cceeeeeeeccee-ecC--cccccCcccccCChhhhhhcc Q lcl|NC_015288. 419 QPKIGFKTRYGMV-SNP--FVTTNGLYSGTPDGETLTPST 455 (468) Q Consensus 419 qP~~g~~tRY~l~-~nP--f~~~~~~~~~~~~~~~~~~~~ 455 (468) |-.+-...||+.. .+| |+..+--.. .+.+.....++ T Consensus 366 ~~~~r~~~r~d~~~~~~~a~~~~~~~~~-a~~~~~~~~~~ 404 (404) T protein:vir:39 366 TTKIRVIDRFDVKTTDSEALVAGSFTAI-ADQVGNFTAGK 404 (404) T ss_pred eeeEEEEeeeccEEecccceEEEEeecc-ccCCCCCCCCC Confidence 4456666788764 355 332210000 01111122222 No 92 >protein:vir:81070 Length: 390 # NCBI annotation: p09 # Family: family:all:585 # MgeID: mge:1889 # MgeName: Xop411 # Cross-refs: genbank:acc:YP_001285679;genbank:gi:148727187;genbank:GeneID:5247115 Probab=28.35 E-value=1.8 Score=19.24 Aligned_cols=319 Identities=13% Similarity=0.066 Sum_probs=113.6 Q ss_pred Cc-chHHHHHhh------hhhhcCCcccc---ccchhhhh----hhhhhhhhHHHHHhhhhhhhhhccccccCccccccc Q lcl|NC_015288. 1 MF-NAEHLQEKW------SPVLNNEAANP---IADRYKKA----VTSVLLENQERFLREERGMLQEVAVNSLGAGTVSPG 66 (468) Q Consensus 1 ~~-~~~~l~~kw------~p~l~~~~~~~---i~~~~~~~----~~~~llenq~~~~~e~~~~l~e~~~~~~g~~~~~~~ 66 (468) |. .-++|.++= .--++.+.... -+...+.. -...++.+..........-+.... .. T Consensus 43 l~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~--------- 112 (390) T protein:vir:81 43 LFATVGNLSAEVQAARQRVAELEGNGAGGDVQHVSVGDMFVASEQFQASAGRWNDRSARATMNIKAAL-NT--------- 112 (390) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHhcccccccccccchhhhhhhHHHHHHHHHHhhhhhhhhhHHHHHH-Hh--------- Confidence 00 000110000 00011111000 00000000 000000000000000000000000 00 Q ss_pred ccccccccccccccccce-ehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCCcccccccccccccccccc Q lcl|NC_015288. 67 GSALGSANTAGLAGFDPV-LISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDAGFTAGLDAT 145 (468) Q Consensus 67 ~~~~~st~tg~~~~~~P~-Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSG~~~~~ 145 (468) . ..++++.+-.-..|. .-.++++..+..+-.+++.+.||++++.-+.-.. +.++. + .+ T Consensus 113 -~-~~~~~~~~g~~~~~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~~~~~----~~~~~-a-------~~------- 171 (390) T protein:vir:81 113 -A-STDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDSALIEYVQET----GFVNN-A-------AI------- 171 (390) T ss_pred -h-ccccccCCcceechhhhHHHHHHHhhhhhhhhhcceeeccCCceEEEEEe----cCCcc-e-------ee------- Confidence 0 000111100011111 1123444445566788999999988764332111 11000 0 00 Q ss_pred ccccccccccccccCcccccccccccccccccccccccchhhhhccCCCCCccccceeEEEEEEEEeecccccceecHHH Q lcl|NC_015288. 146 TGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDAGKLFREMSFSIEKTSVTAKSRALKAEYTLEL 225 (468) Q Consensus 146 ~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~g~~f~EMaFsIeK~tVtAKSRaLKAEYTvEL 225 (468) .+ | +..+++-..++++.+.+.|.-+-...+|-|| T Consensus 172 ---------------------------------------v~--E-----g~~~~~~~~~~~~i~~~~~k~~~~~~is~el 205 (390) T protein:vir:81 172 ---------------------------------------VA--E-----GALKPESSLKFAKKTDTTHVIAHTMKATRQI 205 (390) T ss_pred ---------------------------------------ec--C-----CcccccccceeeEEEEeeeEEEEeehhhHHH Confidence 00 0 1122223333444444444444456679999 Q ss_pred HHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeeeecC------CcchhHHHHHHHHHH Q lcl|NC_015288. 226 AQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDLDVD------SNGRWSVEKFKGLLF 299 (468) Q Consensus 226 AQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl~~~------~~~rw~~e~~~~l~~ 299 (468) .+|- . +.++.|.+-|+..|...+|+-||.- ...+-...|++..... ..+....+....+++ T Consensus 206 l~d~--~---~~~~~i~~~l~~~~~~~~d~a~l~G--------~g~~~~~~Gi~~~~~~~~~~~~~~~~~~~~~~~~~~~ 272 (390) T protein:vir:81 206 LSDA--P---QLASYMNNRLIRGLKVKEDAEILRG--------TGANDGLLGLIPQATTYAAPTTIAGATRVDQLRLAML 272 (390) T ss_pred HHhH--H---HHHHHHHHHHHHHHHHHHHHHHHhc--------CCCCCcccceeecccccccccccccchhHHHHHHHHH Confidence 9984 2 4788899999999998888887742 1111123444332111 111122232222222 Q ss_pred HHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccccccccCCCceeEEEecCCeEEEEccccccC Q lcl|NC_015288. 300 QIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTGNLAVGTINGRIKVYVDPYAANL 379 (468) Q Consensus 300 ~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy~D~Ya~~~ 379 (468) .. ...-...+.+|+||.....|.. +..+.+ + . ... +... .-.++|. |++|++.... T Consensus 273 --------~~-~~~~~~~~~~v~~~~~~~~l~~---lkd~~G---~-~--l~~-~~~~-~~~~~l~-G~pv~~~~~~--- 328 (390) T protein:vir:81 273 --------QA-SLAEYNPSGIVINPIDWAAIEL---AKDANN---Q-Y--LIG-NARG-TLTPTLW-GLPVVATQAM--- 328 (390) T ss_pred --------hh-ccccCCCCEEEEcHHHHHHHHH---hhcCCC---c-e--eec-Cccc-ccCceec-ceeeEEcCCC--- Confidence 21 1223455668899999888853 222111 0 0 000 1111 1123553 5688887543 Q ss_pred CCcceEEEEEecCCcccceeEEcccccccccccc--CC---ccccceeeeeeecce-eecC--cccccCcccccCChhhh Q lcl|NC_015288. 380 SDKHYYVVGYKGTSPYDAGLFYCPYVPLQMVRSI--DP---NNFQPKIGFKTRYGM-VSNP--FVTTNGLYSGTPDGETL 451 (468) Q Consensus 380 s~~dY~~vG~Kg~~~~d~glfyaPYv~~~~~~~~--Dp---~s~qP~~g~~tRY~l-~~nP--f~~~~~~~~~~~~~~~~ 451 (468) |..-+++|--.. .++. +.-..+...+ .+ .+-+=.+=...|++. +.+| |+.. T Consensus 329 -p~~~~~~gd~~~-----~~~~--~~~~~~~v~~~~~~~~~~~~~v~~r~~~r~d~~v~~~~a~v~~------------- 387 (390) T protein:vir:81 329 -APGEFLVGAFDL-----AAQI--FDQWDARVEIGYVGEDFQRNMITVLAEERLALVVYRPEALISG------------- 387 (390) T ss_pred -CCCcEEEEehhc-----eEEE--EEecceEEEEecccchhhcCcEEEEEEEeeccEEecccceEEE------------- Confidence 333344442110 0000 0001111101 01 112223334556665 3344 3322 Q ss_pred hhccCceeeeEEee Q lcl|NC_015288. 452 TPSTNMYYRRVQVT 465 (468) Q Consensus 452 ~~~~N~y~r~~~v~ 465 (468) -+. T Consensus 388 -----------t~a 390 (390) T protein:vir:81 388 -----------SFA 390 (390) T ss_pred -----------EeC Confidence 111 No 93 >protein:vir:1328 Length: 392 # NCBI annotation: gp36 # Family: family:all:21 # MgeID: mge:28 # MgeName: phi-C31 # Cross-refs: genbank:acc:NP_047927;swissprot:trembl:q9zwv6;genbank:gi:9631145;uniprot:Q9ZWV6;genbank:GeneID:2715889 Probab=26.87 E-value=1.9 Score=19.05 Aligned_cols=331 Identities=13% Similarity=0.084 Sum_probs=108.2 Q ss_pred Ccc-----hHHHHHhhhhhhcCCccccccchhh------hhhhhh---hhhhHHHHHhhhhhhhhhccccccCccc---- Q lcl|NC_015288. 1 MFN-----AEHLQEKWSPVLNNEAANPIADRYK------KAVTSV---LLENQERFLREERGMLQEVAVNSLGAGT---- 62 (468) Q Consensus 1 ~~~-----~~~l~~kw~p~l~~~~~~~i~~~~~------~~~~~~---llenq~~~~~e~~~~l~e~~~~~~g~~~---- 62 (468) +.. .++|++...-+-+.+-..+.....+ +.+-.+ .+|.+++. ..+.+...+..+.++ T Consensus 9 l~e~r~~~~~e~~~l~~~~~~~~~~~e~~~~~~~l~~e~~~l~~~i~~~~e~~~~~-----~~~~~~~~~~~~~~~~~~~ 83 (392) T protein:vir:13 9 NFEARERATAELRSLTDEFAGKEMTAEAREKEERLLTAVADFDGRIKRGIDAIKAT-----DAVTSLLSGLQGSGSGAQR 83 (392) T ss_pred HHHHHHHHHHHHHHHHHHhhcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHhcccCCcccchhh Confidence 111 1122222221111111111111111 111111 11111110 001100000111000 Q ss_pred --------------c------ccccccccccccccccccccee-hhhhHHhhh-hhhhhheeeeecCCccceeeeeeeee Q lcl|NC_015288. 63 --------------V------SPGGSALGSANTAGLAGFDPVL-ISLVRRAMP-NLMAYDVCGVQPMSGPTGLIFAMRSR 120 (468) Q Consensus 63 --------------~------~~~~~~~~st~tg~~~~~~P~L-v~l~RRa~~-~LIa~DI~GVQPmTGPTGLIFAMRsr 120 (468) . .........|++++-.-.-|.+ -.++.+... ..+...++-|=|+++...+-+-.. . T Consensus 84 ~~~~~~~~~~r~g~~~~~~~~~~~~~~~~~t~~~~g~~~~~~~~~~~i~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~-~ 162 (392) T protein:vir:13 84 SADHDDDAVLRAGNLGEARSFEFAPEKRDGTKAGNPNVLSRTLYGQLIAQAVERSAIMRGGASTFTTSDANPMDFTVI-T 162 (392) T ss_pred hhhHHHHHHHhccchhhhHHHHhhhhhhcccccCCCccccccchHHHHHHHHhhhhhhhhcceeeecCCCceeEEEEE-c Confidence 0 0000011112221111111111 111111111 123334444433332211111100 0 Q ss_pred ecCCCCCccccccccccccccccccccccccccccccccCcccccccccccccccccccccccchhhhhccCCCCCcccc Q lcl|NC_015288. 121 YENQAGEEALFNEPDAGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDAGKLFRE 200 (468) Q Consensus 121 Y~~qsG~EA~fnEa~t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~g~~f~E 200 (468) . +..+ +. .++ +..+++ T Consensus 163 --~--~~~a----------------------------------------------------~~-v~E-------~~~~~~ 178 (392) T protein:vir:13 163 --G--RATA----------------------------------------------------GI-VGE-------TAEIPE 178 (392) T ss_pred --C--Ccce----------------------------------------------------ee-ecc-------cccccc Confidence 0 0000 00 011 223344 Q ss_pred ceeEEEEEEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeee Q lcl|NC_015288. 201 MSFSIEKTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFD 280 (468) Q Consensus 201 MaFsIeK~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~D 280 (468) -..++++++...|.-+-...+|-||.+|= ..|.++.|.+-|...|..-+|..||.- .-.+ .+.|++. T Consensus 179 ~~~~f~~v~~~~~k~~~~~~iS~ell~ds----~~~l~~~i~~~l~~~i~~~~d~~~l~G--------~Gt~-~p~Gil~ 245 (392) T protein:vir:13 179 SYPATTQRSMGGFKYGFASVVSYEFATDQ----VLDLVGFLVSDAGPAIGDAMGRHFLTG--------TGTG-QPRGILT 245 (392) T ss_pred cccceeeEEeeeeeEEeeehhHHHHHhcc----hHHHHHHHHHHHHHHHHHHHHHHHhcc--------cCCc-ccccccc Confidence 44455555555555555678899999983 357889999999999999999888851 1001 2334432 Q ss_pred eecCCcc--hh------HHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhcccccccccccccccccccc Q lcl|NC_015288. 281 LDVDSNG--RW------SVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGT 352 (468) Q Consensus 281 l~~~~~~--rw------~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~ 352 (468) .....+. .| ..+....+.+ ..... -+..+ ..|+++.....|.. +..+.+ + .... T Consensus 246 ~~~~~~~~~~~~~~~~~~~d~l~~~~~-------~l~~~-~~~~a-~~v~n~~~~~~l~~---lkd~~G---~---~l~~ 307 (392) T protein:vir:13 246 DATGANAAFGEADADSKVSDALIDLFH-------EVPSA-YRKNA-KFVVNDLRAAQMRK---LKDANG---Q---YLWQ 307 (392) T ss_pred ccccccccccccccccccHHHHHHHHH-------hhhhh-hhcCC-EEEEcHHHHHHHHH---hhccCC---c---eeec Confidence 2111110 00 0111111111 12111 23333 35778888777753 222211 0 0011 Q ss_pred ccCCCceeEEEecCCeEEEEccccccCCCcceEEEEEecCCcccceeEEccccccccccccCCccccceee--eeeecce Q lcl|NC_015288. 353 VDDTGNLAVGTINGRIKVYVDPYAANLSDKHYYVVGYKGTSPYDAGLFYCPYVPLQMVRSIDPNNFQPKIG--FKTRYGM 430 (468) Q Consensus 353 ~D~t~~~~~G~l~~~~~vy~D~Ya~~~s~~dY~~vG~Kg~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~g--~~tRY~l 430 (468) .+.+... -++|. |++|+++.++. .+=|++|-- +. .++.--......+..|+..-...++ ...|.+. T Consensus 308 ~~~~~g~-~~~l~-G~Pv~~~~~~~----~~~i~~Gdf--~~----~~i~~~~~~~i~~~~~~~~~~~~~~~r~~~r~d~ 375 (392) T protein:vir:13 308 SALTVGA-PDTFN-GKVVETDDGMP----ADKVLFADL--SK----YRVRFAGSLRVDRSVDAKFSTDQIVYRFLQRADG 375 (392) T ss_pred CCcCCCC-Cceec-ceeeEEcCCCC----CCcEEEeec--cc----eeEEeecceEEEeeccccccCCcEEEEEEEEecc Confidence 1111111 13564 58999986653 333433311 00 1111111112222223322222233 3334432 Q ss_pred -eecC--cccccCcccccCChhhhhhcc Q lcl|NC_015288. 431 -VSNP--FVTTNGLYSGTPDGETLTPST 455 (468) Q Consensus 431 -~~nP--f~~~~~~~~~~~~~~~~~~~~ 455 (468) ..|| |.... +..+| T Consensus 376 ~~~~~~A~~~~~-----------~~~aa 392 (392) T protein:vir:13 376 LLVDARGAKVLT-----------VTPAA 392 (392) T ss_pred EEecccceEEEE-----------eeccC Confidence 2334 22111 01111 No 94 >protein:vir:97053 Length: 390 # NCBI annotation: putative head protein # Family: family:all:585 # MgeID: mge:1653 # MgeName: OP1 # Cross-refs: genbank:acc:YP_453565;genbank:gi:84662600;genbank:GeneID:5142468 Probab=26.47 E-value=1.9 Score=19.00 Aligned_cols=324 Identities=13% Similarity=0.041 Sum_probs=116.4 Q ss_pred Cc------------chHHHHHhhhhh------hcCCcccc-----------ccchhhhhhhhhhhhhHHHHHhhhhhhhh Q lcl|NC_015288. 1 MF------------NAEHLQEKWSPV------LNNEAANP-----------IADRYKKAVTSVLLENQERFLREERGMLQ 51 (468) Q Consensus 1 ~~------------~~~~l~~kw~p~------l~~~~~~~-----------i~~~~~~~~~~~llenq~~~~~e~~~~l~ 51 (468) +. .-++|.++=.-+ ++.++.+. ......++......+-+ ....... . T Consensus 32 ~~~e~~~~~~~~~~e~~~l~~~i~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~-~ 107 (390) T protein:vir:97 32 LNASARSKVDELFATVGNLSAEVQAARQRVAELEGNGAGGDVQHVSVGDMFVASEQFQASTGRWNDRS---ARATMNI-K 107 (390) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccccccccccchhhhhhhHHHHHHHHHhhhhh---hhhhhHH-H Confidence 10 001111111100 00111100 00001111111111100 0000000 0 Q ss_pred hccccccCcccccccccccccccccccccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCCcccc Q lcl|NC_015288. 52 EVAVNSLGAGTVSPGGSALGSANTAGLAGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALF 131 (468) Q Consensus 52 e~~~~~~g~~~~~~~~~~~~st~tg~~~~~~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~f 131 (468) .. .......+++.++..-....+-.++++..+..+-.+++.+-||++++.-+--.. +.++. + T Consensus 108 ~~-----------~~~~~~~~~~~~g~lip~~~~~~ii~~~~~~~~i~~~~~~~~~~~~~~~~~~~~----~~~~~-a-- 169 (390) T protein:vir:97 108 AA-----------LNTASTDAAGSAGALTTPNRLPGFITPPDARLTVRDLIGSGRTDSALIEYVQET----GFVNN-A-- 169 (390) T ss_pred HH-----------HHhhhcccccccccccchhhhHHHHHHHhhhhhhHhhcceeeccCCceEEEEEe----cCCcc-e-- Confidence 00 000001111111111111122334444555666778899999987764332111 10000 0 Q ss_pred ccccccccccccccccccccccccccccCcccccccccccccccccccccccchhhhhccCCCCCccccceeEEEEEEEE Q lcl|NC_015288. 132 NEPDAGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDAGKLFREMSFSIEKTSVT 211 (468) Q Consensus 132 nEa~t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~g~~f~EMaFsIeK~tVt 211 (468) .|- + | +..+++-..++++++.. T Consensus 170 -----~~v----------------------------------------------~--E-----g~~~~~~~~~~~~i~~~ 191 (390) T protein:vir:97 170 -----AIV----------------------------------------------A--E-----GALKPESSLKFAKKTDT 191 (390) T ss_pred -----eee----------------------------------------------c--C-----CccccccccceeEEEEe Confidence 000 0 0 11222333334444555 Q ss_pred eecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeeeecCCc--chh Q lcl|NC_015288. 212 AKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDLDVDSN--GRW 289 (468) Q Consensus 212 AKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl~~~~~--~rw 289 (468) .|.-+-...+|-||.+|-- +.++.|.+-|+..|...+|+.||.- . | .+-.+.|++....... .-. T Consensus 192 ~~k~~~~~~is~ell~ds~-----~l~~~i~~~la~a~~~~~d~a~l~G----~--g--~~~~p~Gi~~~~~~~~~~~~~ 258 (390) T protein:vir:97 192 THVIAHTMKATRQILSDAP-----QLASYMNNRLIRGLKVKEDAEILRG----T--G--ANDGLLGLIPQATTYAAPTTI 258 (390) T ss_pred eeeEEEeehhhHHHHHhHH-----HHHHHHHHHHHHHHHHHHHHHHhhc----C--C--CCccccceeeccccccccccc Confidence 5554556789999999842 4788888888888888888877742 1 1 1112344433211110 000 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccccccccCCCceeEEEecCCeE Q lcl|NC_015288. 290 SVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTGNLAVGTINGRIK 369 (468) Q Consensus 290 ~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~~~~~G~l~~~~~ 369 (468) ...- .+.....+... ....-...+-+|+||+....|.. +..+.+ .........+ -.++|. |++ T Consensus 259 ~~~~----~~d~~~~~~~~-~~~~~~~~~~~v~n~~~~~~L~~---lkd~~G------~~l~~~~~~~--~~~~l~-G~p 321 (390) T protein:vir:97 259 AGAT----RVDQLRLAMLQ-ASLAEYPASGIVINPIDWAAIEL---AKDANN------QYLIGNARGT--LTPTLW-GLP 321 (390) T ss_pred cccc----hHHHHHHHHHh-hccccCCCCEEEEcHHHHHHHHH---hhcCCC------ceeecCccCC--CCceec-cee Confidence 0000 11111111111 12233455668899999888863 222211 0011111111 124554 678 Q ss_pred EEEccccccCCCcceEEEEEecCCcccceeEEccccccccccccCC---ccccceeeeeeecceee-cCcccccCccccc Q lcl|NC_015288. 370 VYVDPYAANLSDKHYYVVGYKGTSPYDAGLFYCPYVPLQMVRSIDP---NNFQPKIGFKTRYGMVS-NPFVTTNGLYSGT 445 (468) Q Consensus 370 vy~D~Ya~~~s~~dY~~vG~Kg~~~~d~glfyaPYv~~~~~~~~Dp---~s~qP~~g~~tRY~l~~-nPf~~~~~~~~~~ 445 (468) |+++... |.+-+++|--. .++++.....+.....-+. .+-+=.+-...||++.+ +|= T Consensus 322 V~~~~~~----~~~~~~~gd~~-----~~~~~~~~~~~~i~~~~~~~~f~~~~~~~r~~~r~d~~v~~~~---------- 382 (390) T protein:vir:97 322 VVATQAM----APGEFLVGAFD-----LAAQIFDQWDARVEIGYVNDDFQRNMVTVLAEERLALVVYRPE---------- 382 (390) T ss_pred eEEcCCC----CCCcEEEEecc-----ceEEEEEecceEEEEeecccccccCcEEEEEEEeeccEEeccc---------- Confidence 8887543 33334444210 0111111111111111111 12222334445777654 230 Q ss_pred CChhhhhhccCceeeeEEee Q lcl|NC_015288. 446 PDGETLTPSTNMYYRRVQVT 465 (468) Q Consensus 446 ~~~~~~~~~~N~y~r~~~v~ 465 (468) .|.++-+. T Consensus 383 ------------a~v~~~~a 390 (390) T protein:vir:97 383 ------------ALITGSFA 390 (390) T ss_pred ------------cEEEEEeC Confidence 11111111 No 95 >protein:vir:100135 Length: 418 # NCBI annotation: gp5 # Family: family:all:585 # MgeID: mge:1639 # MgeName: phi1026b # Cross-refs: genbank:acc:NP_945035;genbank:gi:38707895;genbank:GeneID:2744182 Probab=26.45 E-value=1.9 Score=19.00 Aligned_cols=332 Identities=11% Similarity=0.036 Sum_probs=116.8 Q ss_pred CcchHHHHHhhhhhhcCCccccccchhhhhhhh---h--hhhhHHHHHhh-------------------------hhhhh Q lcl|NC_015288. 1 MFNAEHLQEKWSPVLNNEAANPIADRYKKAVTS---V--LLENQERFLRE-------------------------ERGML 50 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~~~i~~~~~~~~~~---~--llenq~~~~~e-------------------------~~~~l 50 (468) .-..+++.++..--++. +-+..+.-++.+-. . -++.+..++.+ +.+.+ T Consensus 35 ~~e~~~~~e~~~~e~~~--~~~~~~e~~~~~~~l~~~~~~l~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 112 (418) T protein:vir:10 35 GDEVKSAGEKALAEAKR--AGDLGVETKATVDELLIKQGELQARLLEAEQKLARGGGSAELETPKTLGQLVTESEEMKGM 112 (418) T ss_pred HHHHHHHHHHHHHHHHh--hhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccccchhhhhhHHhhhHHHHHHH Confidence 00111111221111100 00000000000000 0 00000000000 00000 Q ss_pred hhcccccc--Cc---ccccccccccccccccccccccceeh-hhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCC Q lcl|NC_015288. 51 QEVAVNSL--GA---GTVSPGGSALGSANTAGLAGFDPVLI-SLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQ 124 (468) Q Consensus 51 ~e~~~~~~--g~---~~~~~~~~~~~st~tg~~~~~~P~Lv-~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~q 124 (468) +....... .. ....... ...++++.+-.-.-|.+. .+++...+..+-.+++.+-||++++.-+ .| ..+. T Consensus 113 ~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~g~lvp~~~~~~ii~~~~~~~~l~~~~~~~~~~~~~~~~--~~--~~~~ 187 (418) T protein:vir:10 113 DGSARKSVRVRVDRKSIMNVPA-TVGSGVSGSNSLVVADRQAGIIAPPQRKMTIRDLLMPGQTSSSSIEY--TV--ETGF 187 (418) T ss_pred HHHHhhhhhhhhHHHHHHHhhh-hccCCCCCCccccchhHHHHHHHHHhhhhhHHhhcceeeccCCceeE--EE--EecC Confidence 00000000 00 0000000 001111111111112221 3445555667788889999998775321 11 0000 Q ss_pred CCCccccccccccccccccccccccccccccccccCcccccccccccccccccccccccchhhhhccCCCCCccccceeE Q lcl|NC_015288. 125 AGEEALFNEPDAGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDAGKLFREMSFS 204 (468) Q Consensus 125 sG~EA~fnEa~t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~g~~f~EMaFs 204 (468) . ..+ .|- +| +...++-..+ T Consensus 188 ~-~~a-------~~v------------------------------------------------~E-----~~~~~~~~~~ 206 (418) T protein:vir:10 188 T-NNA-------AAV------------------------------------------------AE-----GAQKPTSDLK 206 (418) T ss_pred C-Cce-------eee------------------------------------------------cc-----Cccccccccc Confidence 0 000 000 00 1122333345 Q ss_pred EEEEEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeeee-- Q lcl|NC_015288. 205 IEKTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDLD-- 282 (468) Q Consensus 205 IeK~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl~-- 282 (468) +++++..+|.-+-...+|-||.||.- |.++.|.+-|+..|..-+|+-||.- . ..+-.+.|++-.. T Consensus 207 f~~v~~~~~k~~~~~~is~ell~ds~-----~l~~~i~~~l~~a~~~~~d~a~l~G----~----g~~~~p~Gi~~~~~~ 273 (418) T protein:vir:10 207 FNLKNQPVRTIAHLFKASRQILDDAP-----ALQSYIDGRARYGLQLTEEGQILKG----D----GTGANILGILPQASA 273 (418) T ss_pred eeeEEEeeeeEEEeehhhHHHHHhHH-----HHHHHHHHHHHHHHHHHHHHHHhcc----C----CCCcccccccccccc Confidence 56666666666666789999999852 4677788777777777777777631 1 0011122332211 Q ss_pred ------cCCcchhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccccccccCC Q lcl|NC_015288. 283 ------VDSNGRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDT 356 (468) Q Consensus 283 ------~~~~~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t 356 (468) +.+... .+....+++. + ...-+..+-+|||+.....|.. +..+.+ + .+.. +.+ T Consensus 274 ~~~~~~~~~~~~--~~~i~~~~~~-------~--~~~~~~~~~~v~n~~~~~~L~~---lkd~~G---~---~i~~-~~~ 332 (418) T protein:vir:10 274 FMPSITLANATP--IDKIRLALLQ-------A--VLAEFPATGIVLNPIDWASIEL---TKDSQG---R---YIVG-NPV 332 (418) T ss_pred cccccccccccc--HHHHHHHHHh-------h--ccccCCCCEEEEcHHHHHHHHH---hhcCCC---c---eecc-ccc Confidence 111111 1212112111 1 2233455668999999888853 211111 0 0111 111 Q ss_pred CceeEEEecCCeEEEEccccccCCCcceEEEEEecCCc-----ccceeEEccccccccccccCCccccceeeeeeeccee Q lcl|NC_015288. 357 GNLAVGTINGRIKVYVDPYAANLSDKHYYVVGYKGTSP-----YDAGLFYCPYVPLQMVRSIDPNNFQPKIGFKTRYGMV 431 (468) Q Consensus 357 ~~~~~G~l~~~~~vy~D~Ya~~~s~~dY~~vG~Kg~~~-----~d~glfyaPYv~~~~~~~~Dp~s~qP~~g~~tRY~l~ 431 (468) . .-.|+|. |++|+++.+.. .+=+++|--.... .+-.+=..||....| ...+=.+=+..|++.. T Consensus 333 ~-~~~~~l~-G~pV~~~~~~p----~~~~~~gd~s~~~~~~~~~~~~i~~~~~~~~~f------~~~~~~~r~~~~~d~~ 400 (418) T protein:vir:10 333 N-GTTPRLW-NLPVVETQAMT----ANEFLVGAFSMAAQIFDRMEIEVLLSTENVDDF------EKNMVSIRAEERLALA 400 (418) T ss_pred c-CCCceec-ceeeEEcCCCC----CCcEEEeeccceEEEEEecceEEEEecccchhh------hcCceEEEEEEeeccE Confidence 1 1135665 47999886543 2223333210000 000011122211111 1122233344566654 Q ss_pred e-cC--cccccCcccccCCh Q lcl|NC_015288. 432 S-NP--FVTTNGLYSGTPDG 448 (468) Q Consensus 432 ~-nP--f~~~~~~~~~~~~~ 448 (468) + +| |+...--.. -.| T Consensus 401 ~~~~~a~~~~~~~~~--~~g 418 (418) T protein:vir:10 401 VYRPESFVTGALVEQ--AGG 418 (418) T ss_pred EecccceEEEEeccC--CCC Confidence 3 34 332211110 011 No 96 >protein:vir:8102 Length: 543 # NCBI annotation: gp6 # Family: family:all:21 # MgeID: mge:152 # MgeName: Che9c # Cross-refs: genbank:acc:NP_817683;genbank:gi:29566114;genbank:GeneID:1259308 Probab=26.08 E-value=2 Score=18.95 Aligned_cols=322 Identities=13% Similarity=0.052 Sum_probs=107.1 Q ss_pred Cc-----chHHHHHhhhhhhcC-----------------CccccccchhhhhhhhhhhhhHHHHHhhhhh-hhhhccccc Q lcl|NC_015288. 1 MF-----NAEHLQEKWSPVLNN-----------------EAANPIADRYKKAVTSVLLENQERFLREERG-MLQEVAVNS 57 (468) Q Consensus 1 ~~-----~~~~l~~kw~p~l~~-----------------~~~~~i~~~~~~~~~~~llenq~~~~~e~~~-~l~e~~~~~ 57 (468) |. .-|.|..+..-+-.. .....-....++.-...+.+.+...++.... .+.+.. . T Consensus 173 ~~~~~~~~~e~l~~~~e~~~~~~~~~~~~~d~~e~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~l~~~e~~~~~~~~--~ 250 (543) T protein:vir:81 173 LRARALSAIEKMQGASDNVRAAATKIIERFDDEDSTLARQCLATSSPAYLRAWSKMARNPHAAILTEEEKRAINEVR--A 250 (543) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhhhHHHHHHHhhHHHHhhhhhhhhhhhhh--h Confidence 10 001111111110000 0000000111111111111111111111110 011100 0 Q ss_pred cCcccccccccccccccccccccccceehhhhHHhh-hhhhhhheeeeecCCccceeeeeeeeeecCCCCCccccccccc Q lcl|NC_015288. 58 LGAGTVSPGGSALGSANTAGLAGFDPVLISLVRRAM-PNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDA 136 (468) Q Consensus 58 ~g~~~~~~~~~~~~st~tg~~~~~~P~Lv~l~RRa~-~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t 136 (468) .+ .++++|++.--....-.++.+.. +.-+...++-|.|++|..- +- + . ..+..+ T Consensus 251 ~~-----------~t~~~gg~lip~~~~~~ii~~~~~~~~~l~~~~~~~~~~g~~~--~~-~--~--~~~~~a------- 305 (543) T protein:vir:81 251 MG-----------LTKADGGYLVPFQLDPTVIITSNGSLNDIRRFARQVVATGDVW--HG-V--S--SAAVQW------- 305 (543) T ss_pred cc-----------cccccCcccCchhhhhHHHHHHHhhhchhhhhcccccCCcceE--EE-E--e--cCCcce------- Confidence 00 01111221110111111222221 1123344455555543321 00 1 0 000000 Q ss_pred cccccccccccccccccccccccCcccccccccccccccccccccccchhhhhccCCCCCccccceeEEEEEEEEeeccc Q lcl|NC_015288. 137 GFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDAGKLFREMSFSIEKTSVTAKSRA 216 (468) Q Consensus 137 ~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~g~~f~EMaFsIeK~tVtAKSRa 216 (468) .| .+ | +..+++-..+++.+++++|.-+ T Consensus 306 ~~----------------------------------------------v~--E-----g~~~~~~~~~~~~i~~~~~k~~ 332 (543) T protein:vir:81 306 SW----------------------------------------------DA--E-----FEEVSDDSPEFGQPEIPVKKAQ 332 (543) T ss_pred ee----------------------------------------------cc--c-----CccccccccccceeeeeeeeeE Confidence 00 00 0 1122333344566666666666 Q ss_pred ccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeee--------eecCCcch Q lcl|NC_015288. 217 LKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFD--------LDVDSNGR 288 (468) Q Consensus 217 LKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~D--------l~~~~~~r 288 (468) =...+|-||.+|- + |.++.|.+-|...|...+|+-||.- ...+-.+.|++. ..+...+- T Consensus 333 ~~~~is~ell~d~-~----~~~~~i~~~l~~~~~~~~d~ail~G--------~Gt~~~p~Gi~~~~~~~~~~~~~~~~~~ 399 (543) T protein:vir:81 333 GFVPISIEALQDE-A----NVTETVALLFAEGKDELEAVTLTTG--------TGQGNQPTGIVTALAGTAAEIAPVTAET 399 (543) T ss_pred eeehhhHHHHhcc-H----HHHHHHHHHHHHHHHHHHHHHHhcc--------CCCCcccccchhhccccccccccccccc Confidence 6778999999873 2 7899999999999999999988741 000001222211 11111111 Q ss_pred hHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccccccccCCCceeEEEecCCe Q lcl|NC_015288. 289 WSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTGNLAVGTINGRI 368 (468) Q Consensus 289 w~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~~~~~G~l~~~~ 368 (468) ...+.+..+...+. .. -+.. ..+|+++.+...|.. +..+.+ + +. ......+. -++|. |+ T Consensus 400 ~~~~~~~~~~~~l~-------~~-~~~~-~~~v~n~~~~~~l~~---lkd~~G---~-~l--~~~~~~g~--~~~l~-G~ 458 (543) T protein:vir:81 400 FALADVYAVYEQLA-------AR-HRRQ-GAWLANNLIYNKIRQ---FDTQGG---A-GL--WTTIGNGE--PSQLL-GR 458 (543) T ss_pred ccHHHHHHHHHhhh-------cc-ccCC-cEEEEcHHHHHHHHH---hhcCCC---c-ee--ccCcCCCC--Ccccc-ce Confidence 11222222222211 11 1112 246889998888853 222111 0 00 01111111 24564 47 Q ss_pred EEEEccccccCC--------------CcceEEEEEecCCcccceeEEccccccccccccCCccccceeeeeeeccee-ec Q lcl|NC_015288. 369 KVYVDPYAANLS--------------DKHYYVVGYKGTSPYDAGLFYCPYVPLQMVRSIDPNNFQPKIGFKTRYGMV-SN 433 (468) Q Consensus 369 ~vy~D~Ya~~~s--------------~~dY~~vG~Kg~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~g~~tRY~l~-~n 433 (468) +|++..++..+. ++.++++|..++... =..||+-. ..|-...+=.+=+..|+|.. .| T Consensus 459 pv~~~~~~~~~~~~~~~~~~~~i~~gd~~~~~i~~~~~~~i----~~~~~~~~----~~~~~~~~~~~~~~~r~d~~v~~ 530 (543) T protein:vir:81 459 PVGEAEAMDANWNTSASADNFVLLYGNFQNYVIADRIGMTV----EFIPHLFG----TNRRPNGSRGWFAYYRMGADVVN 530 (543) T ss_pred eeEEeccccccccccccCCcceEEEeeccceeEEeecccEE----EEeccccc----cchhhcCceEEEEEEeeccEeec Confidence 888775432110 011112222221111 11122100 01112223344445567664 34 Q ss_pred C--cccccCcccccCChhhhhhcc Q lcl|NC_015288. 434 P--FVTTNGLYSGTPDGETLTPST 455 (468) Q Consensus 434 P--f~~~~~~~~~~~~~~~~~~~~ 455 (468) | |+..+-.. .+ T Consensus 531 ~~A~~~l~~~~-----------~a 543 (543) T protein:vir:81 531 PNAFRLLNVET-----------AS 543 (543) T ss_pred ccceEEEEecc-----------cC Confidence 4 32211111 11 No 97 >protein:vir:101650 Length: 497 # NCBI annotation: gp13 # Family: family:all:585 # MgeID: mge:1515 # MgeName: 244 # Cross-refs: genbank:acc:YP_654768;genbank:gi:109302766;genbank:GeneID:4156084 Probab=25.11 E-value=2.1 Score=18.82 Aligned_cols=359 Identities=13% Similarity=0.052 Sum_probs=131.0 Q ss_pred CcchHHHHHhhhhhhcCCcc--ccccchhh------hhhhhhhh---hhHHHHHhhhhh-hhh-h------------ccc Q lcl|NC_015288. 1 MFNAEHLQEKWSPVLNNEAA--NPIADRYK------KAVTSVLL---ENQERFLREERG-MLQ-E------------VAV 55 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~--~~i~~~~~------~~~~~~ll---enq~~~~~e~~~-~l~-e------------~~~ 55 (468) ....++++.++.-++..... ..|...-. +.-..... |...+....... ... + ... T Consensus 53 ~~~~~~~~~~~~~~~a~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 132 (497) T protein:vir:10 53 HERAQEMLKSLGGADAAKDGLDNDIPEVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAA 132 (497) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHH Confidence 33333333333333322110 11111000 00000000 000000000000 000 0 000 Q ss_pred cccCccc--c----ccccccccccccccc---ccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCC Q lcl|NC_015288. 56 NSLGAGT--V----SPGGSALGSANTAGL---AGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAG 126 (468) Q Consensus 56 ~~~g~~~--~----~~~~~~~~st~tg~~---~~~~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qsG 126 (468) +..+... . ........++++++. ..+.+.++.+.| +..+..+++.+-||+++..- |-.. .+.. + T Consensus 133 ~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vp~~~~~~ii~~~~---~~~~i~~l~~~~~~~~~~~~-~~~~--~~~~-~ 205 (497) T protein:vir:10 133 ELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLF---YELSLADLISSRPVTSPNLS-YLTE--SAAH-N 205 (497) T ss_pred HHHHHHhhhhhhHHHHHhhhcccCcccccccchhhhHHHHHHHH---hhhhHHhhccccccCCCceE-EEEE--cCCC-C Confidence 0000000 0 000001111222222 123344444444 45567899999999887532 2211 0000 0 Q ss_pred CccccccccccccccccccccccccccccccccCcccccccccccccccccccccccchhhhhccCCCCCccccceeEEE Q lcl|NC_015288. 127 EEALFNEPDAGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDAGKLFREMSFSIE 206 (468) Q Consensus 127 ~EA~fnEa~t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~g~~f~EMaFsIe 206 (468) ++ .| .+| +..+++...+++ T Consensus 206 -~a-------~w------------------------------------------------v~E-----~~~~~~s~~~f~ 224 (497) T protein:vir:10 206 -NA-------AA------------------------------------------------VAE-----AGTYPFSSEEFA 224 (497) T ss_pred -cc-------ee------------------------------------------------ecc-----Ccccccccccce Confidence 00 00 001 233455556677 Q ss_pred EEEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHH--------Hhhhcchhhccccc---- Q lcl|NC_015288. 207 KTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRR--------VYSVAKPGAANNVA---- 274 (468) Q Consensus 207 K~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~--------l~~vA~~~k~~~~~---- 274 (468) ++++.+|.-+-...+|-||++|-- +.++.|.+-|...|..-+|+.||.- |.+.+......... T Consensus 225 ~i~~~~~k~a~~~~iS~ell~d~~-----~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~ 299 (497) T protein:vir:10 225 RVYEQVGKVANALTITDEGLRDAP-----ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFG 299 (497) T ss_pred eeEeeeeeeEeecHhHHHHHHhHH-----HHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchh Confidence 788888777777889999999942 3789999999999999999998862 11111111100000 Q ss_pred ----cceeeeeecCCcchhHHHH-------HHHH--------------------HHHHHHHHHHHHHHhccCCccEEEEc Q lcl|NC_015288. 275 ----NAGIFDLDVDSNGRWSVEK-------FKGL--------------------LFQIERDCNAIAQDTRRGKGNFLICS 323 (468) Q Consensus 275 ----~~Gv~Dl~~~~~~rw~~e~-------~~~l--------------------~~~i~~ean~i~q~T~rg~~n~~v~S 323 (468) ..+..++..+..+.|.+.. +... +..-...+-...+.+....++-+|.+ T Consensus 300 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn 379 (497) T protein:vir:10 300 ATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMN 379 (497) T ss_pred hhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEc Confidence 0000111111111111110 0000 00001111222344555566677788 Q ss_pred hhHHHHHhhccccccccccccccccccccccCCCceeEEEecCCeEEEEccccccCCCcceEEEEEecCC------cccc Q lcl|NC_015288. 324 ADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTGNLAVGTINGRIKVYVDPYAANLSDKHYYVVGYKGTS------PYDA 397 (468) Q Consensus 324 ~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy~D~Ya~~~s~~dY~~vG~Kg~~------~~d~ 397 (468) +.-...|.. ++.+-+.--..+... ..........++|+ |++|++.+... .-++ ++|--... ..+- T Consensus 380 ~~~~~~l~~---lkd~~G~~i~~~~~~-~~~~~~~~~~~~l~-G~pV~~t~~~~---~~~~-~~Gd~~~~~~~i~~r~~~ 450 (497) T protein:vir:10 380 PRDWELLRL---TKDANGQYMGGNFFG-NAYGNPVNGGKNIW-GVPVVTTPLIP---LGTI-LVGHFAPSVIQTARREGV 450 (497) T ss_pred hHHHHHHHH---hhcCCCceeccCccc-ccccccccCCceee-ceeeEecCCCC---CCce-EEeecccceEEEEEeccc Confidence 877766642 111111000000000 00000011123565 47777775432 2222 23311000 0011 Q ss_pred eeEEccccccccccccCCccccceeeeeeecce-eecC--cccccCcccccCChh Q lcl|NC_015288. 398 GLFYCPYVPLQMVRSIDPNNFQPKIGFKTRYGM-VSNP--FVTTNGLYSGTPDGE 449 (468) Q Consensus 398 glfyaPYv~~~~~~~~Dp~s~qP~~g~~tRY~l-~~nP--f~~~~~~~~~~~~~~ 449 (468) .+-..||....| .+.|=.+=+..|+++ +.+| |...+-... ..++ T Consensus 451 ~v~~~~~~~~~f------~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~--~~~~ 497 (497) T protein:vir:10 451 TMQMTNSNGTDF------VDGKVTVRAEERLGLLVYRPSAFQLIQLKKG--ATGS 497 (497) T ss_pred EEEeecccchhh------hcCcEEEEEEEeecceeeccccEEEEEecCC--ccCC Confidence 122222211111 122334444678866 6788 443321111 1111 No 98 >protein:vir:7855 Length: 497 # NCBI annotation: gp12 # Family: family:all:585 # MgeID: mge:150 # MgeName: CJW1 # Cross-refs: genbank:acc:NP_817462;genbank:gi:29565891;genbank:GeneID:1259081 Probab=25.11 E-value=2.1 Score=18.82 Aligned_cols=359 Identities=13% Similarity=0.052 Sum_probs=131.0 Q ss_pred CcchHHHHHhhhhhhcCCcc--ccccchhh------hhhhhhhh---hhHHHHHhhhhh-hhh-h------------ccc Q lcl|NC_015288. 1 MFNAEHLQEKWSPVLNNEAA--NPIADRYK------KAVTSVLL---ENQERFLREERG-MLQ-E------------VAV 55 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~~~~--~~i~~~~~------~~~~~~ll---enq~~~~~e~~~-~l~-e------------~~~ 55 (468) ....++++.++.-++..... ..|...-. +.-..... |...+....... ... + ... T Consensus 53 ~~~~~~~~~~~~~~~a~~~~~~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 132 (497) T protein:vir:78 53 HERAQEMLKSLGGADAAKDGLDNDIPEVEVRNLKQIRKHLARAVIMNPELKNATSFEKGTKFDVSFNVSAKAADPGTAAA 132 (497) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhHHHHHHHHHhhhHHHHhhhhhhhhhhhhhhhhhhhhhhhhHHHHH Confidence 33333333333333322110 11111000 00000000 000000000000 000 0 000 Q ss_pred cccCccc--c----ccccccccccccccc---ccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCC Q lcl|NC_015288. 56 NSLGAGT--V----SPGGSALGSANTAGL---AGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAG 126 (468) Q Consensus 56 ~~~g~~~--~----~~~~~~~~st~tg~~---~~~~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qsG 126 (468) +..+... . ........++++++. ..+.+.++.+.| +..+..+++.+-||+++..- |-.. .+.. + T Consensus 133 ~~~~~~~~~~~~~~~~~~~~~~~~~~gg~~vp~~~~~~ii~~~~---~~~~i~~l~~~~~~~~~~~~-~~~~--~~~~-~ 205 (497) T protein:vir:78 133 ELMGAFADGETAPAAIGQNPFGSTGTFAPGILPTFLPGIVEQLF---YELSLADLISSRPVTSPNLS-YLTE--SAAH-N 205 (497) T ss_pred HHHHHHhhhhhhHHHHHhhhcccCcccccccchhhhHHHHHHHH---hhhhHHhhccccccCCCceE-EEEE--cCCC-C Confidence 0000000 0 000001111222222 123344444444 45567899999999887532 2211 0000 0 Q ss_pred CccccccccccccccccccccccccccccccccCcccccccccccccccccccccccchhhhhccCCCCCccccceeEEE Q lcl|NC_015288. 127 EEALFNEPDAGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDAGKLFREMSFSIE 206 (468) Q Consensus 127 ~EA~fnEa~t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~g~~f~EMaFsIe 206 (468) ++ .| .+| +..+++...+++ T Consensus 206 -~a-------~w------------------------------------------------v~E-----~~~~~~s~~~f~ 224 (497) T protein:vir:78 206 -NA-------AA------------------------------------------------VAE-----AGTYPFSSEEFA 224 (497) T ss_pred -cc-------ee------------------------------------------------ecc-----Ccccccccccce Confidence 00 00 001 233455556677 Q ss_pred EEEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHH--------Hhhhcchhhccccc---- Q lcl|NC_015288. 207 KTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRR--------VYSVAKPGAANNVA---- 274 (468) Q Consensus 207 K~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~--------l~~vA~~~k~~~~~---- 274 (468) ++++.+|.-+-...+|-||++|-- +.++.|.+-|...|..-+|+.||.- |.+.+......... T Consensus 225 ~i~~~~~k~a~~~~iS~ell~d~~-----~l~~~i~~~l~~~i~~~~d~~~l~G~G~~~p~Gil~~~~~~~~~~~~~~~~ 299 (497) T protein:vir:78 225 RVYEQVGKVANALTITDEGLRDAP-----ELFNFVQGRLLEGIQRKEEVQLLAGGGYPGVNGLLQRSTGFTASSASSLFG 299 (497) T ss_pred eeEeeeeeeEeecHhHHHHHHhHH-----HHHHHHHHHHHHHHHHHHHHHhhcCCCcccccccccccccccccccccchh Confidence 788888777777889999999942 3789999999999999999998862 11111111100000 Q ss_pred ----cceeeeeecCCcchhHHHH-------HHHH--------------------HHHHHHHHHHHHHHhccCCccEEEEc Q lcl|NC_015288. 275 ----NAGIFDLDVDSNGRWSVEK-------FKGL--------------------LFQIERDCNAIAQDTRRGKGNFLICS 323 (468) Q Consensus 275 ----~~Gv~Dl~~~~~~rw~~e~-------~~~l--------------------~~~i~~ean~i~q~T~rg~~n~~v~S 323 (468) ..+..++..+..+.|.+.. +... +..-...+-...+.+....++-+|.+ T Consensus 300 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vmn 379 (497) T protein:vir:78 300 ATSATVSNVKFPADGTNGAFVGQDTVASLKYGRVVTGAAGSGSGVAGSYPTAAEIAENVFDAFVDIQLTLFQTPNAVVMN 379 (497) T ss_pred hhhhhhhhhhhhcccccchhhhhhHHHHHHHHHhhhhhhhhccchhccccchhhhhhHHHHHHhhhhhhcccCCCeEEEc Confidence 0000111111111111110 0000 00001111222344555566677788 Q ss_pred hhHHHHHhhccccccccccccccccccccccCCCceeEEEecCCeEEEEccccccCCCcceEEEEEecCC------cccc Q lcl|NC_015288. 324 ADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTGNLAVGTINGRIKVYVDPYAANLSDKHYYVVGYKGTS------PYDA 397 (468) Q Consensus 324 ~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy~D~Ya~~~s~~dY~~vG~Kg~~------~~d~ 397 (468) +.-...|.. ++.+-+.--..+... ..........++|+ |++|++.+... .-++ ++|--... ..+- T Consensus 380 ~~~~~~l~~---lkd~~G~~i~~~~~~-~~~~~~~~~~~~l~-G~pV~~t~~~~---~~~~-~~Gd~~~~~~~i~~r~~~ 450 (497) T protein:vir:78 380 PRDWELLRL---TKDANGQYMGGNFFG-NAYGNPVNGGKNIW-GVPVVTTPLIP---LGTI-LVGHFAPSVIQTARREGV 450 (497) T ss_pred hHHHHHHHH---hhcCCCceeccCccc-ccccccccCCceee-ceeeEecCCCC---CCce-EEeecccceEEEEEeccc Confidence 877766642 111111000000000 00000011123565 47777775432 2222 23311000 0011 Q ss_pred eeEEccccccccccccCCccccceeeeeeecce-eecC--cccccCcccccCChh Q lcl|NC_015288. 398 GLFYCPYVPLQMVRSIDPNNFQPKIGFKTRYGM-VSNP--FVTTNGLYSGTPDGE 449 (468) Q Consensus 398 glfyaPYv~~~~~~~~Dp~s~qP~~g~~tRY~l-~~nP--f~~~~~~~~~~~~~~ 449 (468) .+-..||....| .+.|=.+=+..|+++ +.+| |...+-... ..++ T Consensus 451 ~v~~~~~~~~~f------~~n~v~~r~~~r~~~~v~~p~A~~~l~~~~~--~~~~ 497 (497) T protein:vir:78 451 TMQMTNSNGTDF------VDGKVTVRAEERLGLLVYRPSAFQLIQLKKG--ATGS 497 (497) T ss_pred EEEeecccchhh------hcCcEEEEEEEeecceeeccccEEEEEecCC--ccCC Confidence 122222211111 122334444678866 6788 443321111 1111 No 99 >protein:vir:1268 Length: 397 # NCBI annotation: hypothetical protein # Family: family:all:21 # MgeID: mge:329 # MgeName: phi-105 # Cross-refs: genbank:acc:NP_690760;genbank:gi:22855000;genbank:GeneID:955203 Probab=23.90 E-value=2.2 Score=18.66 Aligned_cols=321 Identities=12% Similarity=0.056 Sum_probs=113.8 Q ss_pred CcchHHHHHhhhhhhcC--------------CccccccchhhhhhhhhhhhhHHHHHhhhhhhhhhc-ccccc-Cccccc Q lcl|NC_015288. 1 MFNAEHLQEKWSPVLNN--------------EAANPIADRYKKAVTSVLLENQERFLREERGMLQEV-AVNSL-GAGTVS 64 (468) Q Consensus 1 ~~~~~~l~~kw~p~l~~--------------~~~~~i~~~~~~~~~~~llenq~~~~~e~~~~l~e~-~~~~~-g~~~~~ 64 (468) +=.-+.|.++..-+-+. ....+......+......-+.+.+..+.-.+++... +.+-. ...... T Consensus 39 ~~e~~~l~~~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~~~~~~ 118 (397) T protein:vir:12 39 LDEVKQLKNQIELMTEGRSLDVPDLPGGVNFVPEQERNPEGQRSQGQGNEERQQQYSKAFLKGLRGKRLTDEERDLLDSP 118 (397) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhcccccccchhhHHHHHHHHHHHHHHhccCCcHHHHHHHhhh Confidence 11111222222111100 000000000000000001111111111111111100 00000 000000 Q ss_pred cccccccc-cccccc---ccccceehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCCccccccccccccc Q lcl|NC_015288. 65 PGGSALGS-ANTAGL---AGFDPVLISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEALFNEPDAGFTA 140 (468) Q Consensus 65 ~~~~~~~s-t~tg~~---~~~~P~Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~fnEa~t~fSG 140 (468) ....+.++ ++.|+. ..+.+.+ ++...+..+-.+++.+.||+++.|-+--.|.. ++..+ .|-+ T Consensus 119 ~~~a~~~~~~~~gg~lvP~~~~~~i---i~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~----~~~~a-------~~v~ 184 (397) T protein:vir:12 119 EFRAMSGINDEDGGILIPEDIGRQI---HEFKRQFEPLEQYVTVEPVTTRSGTRLLEKNA----DMVPF-------SPVE 184 (397) T ss_pred hhhhccccccccCcccCchhHHHHH---HHhhhhhhhHHhhcceeeccCCceeEEEEEec----CCcce-------eeec Confidence 00001111 122222 1222334 44444666778999999999988764322211 10000 0000 Q ss_pred cccccccccccccccccccCcccccccccccccccccccccccchhhhhccCC-CCCccccceeEEEEEEEEeecccccc Q lcl|NC_015288. 141 GLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGD-AGKLFREMSFSIEKTSVTAKSRALKA 219 (468) Q Consensus 141 ~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~-~g~~f~EMaFsIeK~tVtAKSRaLKA 219 (468) . ++...+ +...|.++.|+..|..+- . T Consensus 185 E----------------------------------------------g~~~~~~~~~~~~~v~~~~~k~~~~-------~ 211 (397) T protein:vir:12 185 E----------------------------------------------LGNLPEIDQPRFTKVSYSIIDYGGI-------M 211 (397) T ss_pred c----------------------------------------------cccccccccccceeEEeeheeeEee-------e Confidence 0 000000 113466666666666554 4 Q ss_pred eecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeeeecCCcchhHHHHHHHHHH Q lcl|NC_015288. 220 EYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDLDVDSNGRWSVEKFKGLLF 299 (468) Q Consensus 220 EYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl~~~~~~rw~~e~~~~l~~ 299 (468) .+|-||.+|-- +|.++.|.+.|...|...+|+-||.-.-+ ..+.|+..+ +....+++ T Consensus 212 ~is~e~l~ds~----~~l~~~i~~~l~~~~~~~~d~~il~G~g~---------~~~~g~~~~----------~~i~~~~~ 268 (397) T protein:vir:12 212 TLSNSMLNDSD----QAIMTYVAKWFAKKSVVTRNNLILAAIAS---------LKKVDIDGL----------DGIKKALN 268 (397) T ss_pred hhhHHHHhhch----HHHHHHHHHHHHHHHHHHHHHHHHhcccc---------ccccccccH----------HHHHHHHh Confidence 59999998854 46788899999999999998888763221 123344322 11111111 Q ss_pred HHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccccccccCCCceeEEEecCCeEEEEcc-cccc Q lcl|NC_015288. 300 QIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTGNLAVGTINGRIKVYVDP-YAAN 378 (468) Q Consensus 300 ~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy~D~-Ya~~ 378 (468) ... ..--..+..++|++.....|.. +..+.+ + .....+-+. ..-++|+| ++|++.. .... T Consensus 269 -------~~l-~~~~~~~a~~~~n~~~~~~L~~---lkd~~G---~---~l~~~~~~~-g~~~~l~G-~pv~~~~~~~~~ 329 (397) T protein:vir:12 269 -------VTL-DPMVAPGSIVLTNQDGYDWLDT---LKDGTG---R---YLLQPDPTN-PTKKLLDG-RPVVPFTNRVLK 329 (397) T ss_pred -------hcc-chhhhCCCEEEEcHHHHHHHHH---hhccCC---c---eeecccccC-CCCccccc-eeeEEecccccc Confidence 110 1111123457889988887753 111111 0 011111111 11245544 5766431 1100 Q ss_pred CCCcceEEEEEecCCcccceeEEccccc---------cccccc--cC--Cccccceeeeeeecceee-cC--cccccCcc Q lcl|NC_015288. 379 LSDKHYYVVGYKGTSPYDAGLFYCPYVP---------LQMVRS--ID--PNNFQPKIGFKTRYGMVS-NP--FVTTNGLY 442 (468) Q Consensus 379 ~s~~dY~~vG~Kg~~~~d~glfyaPYv~---------~~~~~~--~D--p~s~qP~~g~~tRY~l~~-nP--f~~~~~~~ 442 (468) ... | +.-++|+.|-. +.+... .+ -.+-+-.+-...|++..+ || |... T Consensus 330 ~~~---------~----~~~~~~gd~~~~~~~~~~~~~~i~~~~~~~~~f~~~~~~~r~~~r~d~~~~~~~a~~~~---- 392 (397) T protein:vir:12 330 TQK---------G----KAPLIIGNLKEAIVLFDREQQSIASTDTGAGAFETNSTKVRGIEREDVRKWDEDAVVFG---- 392 (397) T ss_pred cCC---------C----ccEEEEEehhceEEEEeecceEEEEeccccchhhcCceEEEEEEeeccEEecccceEEE---- Confidence 000 0 11122222110 001000 00 112334555566666543 33 2111 Q ss_pred cccCChhhhhhccCceeeeEEee Q lcl|NC_015288. 443 SGTPDGETLTPSTNMYYRRVQVT 465 (468) Q Consensus 443 ~~~~~~~~~~~~~N~y~r~~~v~ 465 (468) ++-++ T Consensus 393 ------------------~~t~~ 397 (397) T protein:vir:12 393 ------------------QITVE 397 (397) T ss_pred ------------------EEeeC Confidence 11111 No 100 >protein:vir:739 Length: 231 # NCBI annotation: major structural protein 4 # Family: family:all:522 # MgeID: mge:14 # MgeName: Tuc2009 # Cross-refs: genbank:acc:NP_108716;genbank:gi:13487838;genbank:GeneID:920884 Probab=22.58 E-value=2.4 Score=18.47 Aligned_cols=220 Identities=15% Similarity=0.133 Sum_probs=97.9 Q ss_pred cccccccccccccccccccccccchhhhhccCCCC-CccccceeEEEEEEEEeecccccceecHHHHHhHHHhhCCChhH Q lcl|NC_015288. 161 AEGNNPALLNDSSPGTYETPRGFSREDLEQAGDAG-KLFREMSFSIEKTSVTAKSRALKAEYTLELAQDLKAIHGLDAEQ 239 (468) Q Consensus 161 ~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~g-~~f~EMaFsIeK~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ 239 (468) -.|.+.+. +.+++. -..++|.+..+. -+..+|+++ ..+++.|-+.=.-++|=|- .|.+ + =|.-. T Consensus 1 ~~~~~~Gd-------tit~P~--~iGda~~v~eG~~i~~~~l~~t--~~~atIk~~gk~~~itD~a--~l~~-~-gDp~~ 65 (231) T protein:vir:73 1 ENGINLAN-------LCEYPN--DIGDAADVAEGGEISLDKIGTT--TKSVTIKKAAKGTEITDEA--ALSG-Y-GDPIG 65 (231) T ss_pred CccccCCc-------eEEecc--cccchhhhcCCCcCChhhcccc--ceeeeEeeeccceeeeHHH--Hhhc-c-CchHH Confidence 11111111 111111 133445554322 345556654 4444445544333444322 2444 3 38889 Q ss_pred HHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeeeecCCcchhHHHHHHHHHHHHHHHHHHHHHHhccCCccE Q lcl|NC_015288. 240 ELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDLDVDSNGRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNF 319 (468) Q Consensus 240 ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl~~~~~~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~ 319 (468) |..+-|+..|+..++.||+..+-+.+...+ ..+++ +.+-....+ | ++ --....+ T Consensus 66 ea~~Q~~~~iA~kvD~di~~~~~~a~l~~~-------~~~t~-------d~i~~A~~~-f------gd-----e~~~~~v 119 (231) T protein:vir:73 66 ESNKQLGLSLANKVDDDLLKAAKTTSQTVS-------TKANV-------DGVQAALDI-F------ND-----EDAQAYV 119 (231) T ss_pred HHHHHHHHHHHHhhhHHHHHhhcccccccc-------ccccH-------HHHHHHHHH-h------cc-----ccccceE Confidence 999999999999999999987765443311 11111 011111111 1 11 1246679 Q ss_pred EEEchhHHHHHhhccccccccccccccccccccccCCCceeEEEecCCeEEEEccccccCCCcceEEEEEecCCccccee Q lcl|NC_015288. 320 LICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTGNLAVGTINGRIKVYVDPYAANLSDKHYYVVGYKGTSPYDAGL 399 (468) Q Consensus 320 ~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~~~~~G~l~~~~~vy~D~Ya~~~s~~dY~~vG~Kg~~~~d~gl 399 (468) ++|+|++++-|... ++.... -.+.+. +.=-+| .+|.+. |++|+++ ++-| +++. T Consensus 120 ivv~p~~~~~Lrk~--~~~~~~-~~~~g~---~i~~~G--~iG~i~-G~~Vi~S----~~~~--------------~~~~ 172 (231) T protein:vir:73 120 LIVNPKDAAKIRKD--ANAKNI-GSEVGA---NALING--TYADVL-GAQIVRS----KKLA--------------EGSA 172 (231) T ss_pred EEEcchHHHhhhhc--cchhhh-hhhhcc---ceeeec--ccceEc-ceEEEEc----CCCC--------------CCce Confidence 99999999888431 111100 001110 111112 256653 4777777 3222 2334 Q ss_pred EEccccccccccccCCccccceeeeeeecceeecCcccccCcccccCChhhhhhccCcee----------eeEEeecc Q lcl|NC_015288. 400 FYCPYVPLQMVRSIDPNNFQPKIGFKTRYGMVSNPFVTTNGLYSGTPDGETLTPSTNMYY----------RRVQVTNL 467 (468) Q Consensus 400 fyaPYv~~~~~~~~Dp~s~qP~~g~~tRY~l~~nPf~~~~~~~~~~~~~~~~~~~~N~y~----------r~~~v~~~ 467 (468) ++++|+. -+|.++++..=++.+=+..+.+.....+ ..+.+| =++-+||+ T Consensus 173 ~~~~~i~-----------~~gAl~~~~k~~~~vEtdRd~~~k~~~i--------~~~~~y~v~l~~~~~vv~~t~~g~ 231 (231) T protein:vir:73 173 LMFKIVS-----------NSPALKLVLKRGVQVETDRDIVTKTTVI--------TADEHYAAYLYDLTKVVNITFTGV 231 (231) T ss_pred eeeeEEe-----------eccceeeeecccceeeccccccccccEE--------EEeEEEEEEEEcCccEEEEEeecC Confidence 5666643 1344555444444443333222111110 111111 13445566 No 101 >protein:vir:96762 Length: 632 # NCBI annotation: putative phage-related protein # Family: family:all:21 # MgeID: mge:1628 # MgeName: VP882 # Cross-refs: genbank:acc:YP_001039818;genbank:gi:126010917;genbank:GeneID:5076272 Probab=22.58 E-value=2.4 Score=18.47 Aligned_cols=328 Identities=11% Similarity=0.080 Sum_probs=120.7 Q ss_pred CcchHHHHHhhhhhh----------cCCccccccch-----hhhhhhhhh-------hhhHHHHHhhhhhhhhhcccccc Q lcl|NC_015288. 1 MFNAEHLQEKWSPVL----------NNEAANPIADR-----YKKAVTSVL-------LENQERFLREERGMLQEVAVNSL 58 (468) Q Consensus 1 ~~~~~~l~~kw~p~l----------~~~~~~~i~~~-----~~~~~~~~l-------lenq~~~~~e~~~~l~e~~~~~~ 58 (468) .+++ ++.++.++-- +....+.+.+. .++.....- +.++..............+.+.. T Consensus 260 ~~ra-~~ld~l~~~~~a~~~~~~a~~~~~~~~~~~~~~i~~~~re~~~~~l~rai~a~a~~~~~~a~~~~e~a~~~a~~~ 338 (632) T protein:vir:96 260 QFRA-LVLERMNPGQPGNFEKPGAGDLPGKPAIHSARDLGIQHKELQQYSLMRAINAAATGDWSKAGFEREVSLAIADAS 338 (632) T ss_pred HHHH-HHHHHHhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhhHHHHHHHHHHHHHHhhhccchhhhhhhhHHHHHHHHhh Confidence 1111 1222221100 00001111110 011110000 01110000000000000111111 Q ss_pred Cccccc---------ccccccccccccccccccceeh-hhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCC-CC Q lcl|NC_015288. 59 GAGTVS---------PGGSALGSANTAGLAGFDPVLI-SLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQA-GE 127 (468) Q Consensus 59 g~~~~~---------~~~~~~~st~tg~~~~~~P~Lv-~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qs-G~ 127 (468) |....+ ..++...++++|+..-....+- .++.+..+..|...+ |++.+++.+|- ++ +..+. |. T Consensus 339 G~~arg~~~~~~~l~~ra~~~~t~~~gg~lvp~~~~~~~iie~lr~~s~i~~l-~~~~~~~~~g~---~~--ip~~~~~~ 412 (632) T protein:vir:96 339 GKEARGFYMPHEVLVQRQLEKKTAGKGGELVATELLSEEFIDILRNKAIIGQM-GARMLPGLVGD---VD--IPKKTSGA 412 (632) T ss_pred hhhhhhhhhhHHHHHHhhhhcccccccccccccccchHHHHHHHhhcchhhhh-cceEeecCCcc---eE--EEEEeCCc Confidence 110000 0001111111222111111111 133333356666665 66666555543 11 11110 00 Q ss_pred ccccccccccccccccccccccccccccccccCcccccccccccccccccccccccchhhhhccCCCCCccccceeEEEE Q lcl|NC_015288. 128 EALFNEPDAGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDAGKLFREMSFSIEK 207 (468) Q Consensus 128 EA~fnEa~t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~g~~f~EMaFsIeK 207 (468) ++ .|- +| +..+++-..++++ T Consensus 413 ~a-------~wv------------------------------------------------~E-----~~~~~~s~~~f~~ 432 (632) T protein:vir:96 413 NF-------YWI------------------------------------------------GE-----DEDVQDSDFDFTT 432 (632) T ss_pred ee-------Eee------------------------------------------------cC-----Cccccccccceee Confidence 00 000 01 2234555566777 Q ss_pred EEEEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeeeec---- Q lcl|NC_015288. 208 TSVTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDLDV---- 283 (468) Q Consensus 208 ~tVtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl~~---- 283 (468) ++..+|+=+-...+|-||..| -++|.|++|.+-|...|...+++.+|.- .-.+-.+.|++.... T Consensus 433 i~l~~~k~~~~v~iS~ell~d----s~~~~~~~i~~~l~~a~~~~~d~a~l~G--------~G~~~~p~Gi~~~~~~~~~ 500 (632) T protein:vir:96 433 LSFSPKTIAGAVPVTRKLRKQ----SSIHVENLIREDLIEGIGVALDLAMLTG--------TGLANDPVGLLNMTGVPAL 500 (632) T ss_pred EEeeeeEEEEehhhHHHHHhc----cchHHHHHHHHHHHHHHHHHHHHHhhcc--------cCCCCccceeeecccccce Confidence 777777777777888888776 2578999999999999999999998852 110112334433211 Q ss_pred --CCcc-hhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccccccccCCCcee Q lcl|NC_015288. 284 --DSNG-RWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTGNLA 360 (468) Q Consensus 284 --~~~~-rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~~~~ 360 (468) +..+ .| ..+..+ + ..+............|+++...+.|...-..|.... .+.+ T Consensus 501 ~~~~~~~~~--~~i~~~-~------~~i~~~~~~~~~~~~~~~~~~~~~l~~~~l~d~~G~-------~i~~-------- 556 (632) T protein:vir:96 501 TYPAGGVDW--ASVVDM-E------TKISTFNADAGRLAYLTSVTQRGAAKKAQVFDNTGE-------RIWQ-------- 556 (632) T ss_pred ecccccCCH--HHHHHH-H------HHHhhcccccCccEEEEchhHHHHHHHHhccCCCCc-------eeec-------- Confidence 1111 12 111111 1 122222233334456788887776653322211100 1110 Q ss_pred EEEecCCeEEEEccccccCCCcceEEEEEecCCcccceeEEccccccc--cccccCCccccceeeeeeecceee-cC--c Q lcl|NC_015288. 361 VGTINGRIKVYVDPYAANLSDKHYYVVGYKGTSPYDAGLFYCPYVPLQ--MVRSIDPNNFQPKIGFKTRYGMVS-NP--F 435 (468) Q Consensus 361 ~G~l~~~~~vy~D~Ya~~~s~~dY~~vG~Kg~~~~d~glfyaPYv~~~--~~~~~Dp~s~qP~~g~~tRY~l~~-nP--f 435 (468) -|+|+ +|+|++..+. |.+=+++|--. -+|+.-+-.+. ..+..+..+.|=.+=...|+++.+ +| | T Consensus 557 ~~~l~-G~pv~~s~~i----p~~~~~~gd~s------~~~i~~~~~~~i~~~~~~~~~~~~v~~~~~~~~d~~v~~~~af 625 (632) T protein:vir:96 557 NNEVN-GYRAEASNQI----PADTWIFGDWS------QIVIAMWGVLDLKVDPYTKAASDGLVLRVFQDVDAGVRRKEAF 625 (632) T ss_pred CCeec-ccceEecccc----ccCcEEEeecc------eEEEEEecceEEEEccccccccCceEEEEEeecCceeechhhh Confidence 14554 5788877433 22222222110 01111111111 111112233444444566666543 34 3 Q ss_pred ccccCcc Q lcl|NC_015288. 436 VTTNGLY 442 (468) Q Consensus 436 ~~~~~~~ 442 (468) +..+... T Consensus 626 ~~~k~~A 632 (632) T protein:vir:96 626 CIAKKGA 632 (632) T ss_pred hheeecC Confidence 3332222 No 102 >protein:vir:1084 Length: 437 # NCBI annotation: capsid protein # Family: family:all:21 # MgeID: mge:21 # MgeName: bIL309 # Cross-refs: genbank:acc:NP_076738;genbank:gi:13095848;genbank:GeneID:920418 Probab=20.98 E-value=2.7 Score=18.24 Aligned_cols=326 Identities=11% Similarity=0.069 Sum_probs=108.5 Q ss_pred Cc--chHHHH----------HhhhhhhcCCccccccchhhhhhhhhhhhhHHHHHhhhhhhhhh-----------ccc-- Q lcl|NC_015288. 1 MF--NAEHLQ----------EKWSPVLNNEAANPIADRYKKAVTSVLLENQERFLREERGMLQE-----------VAV-- 55 (468) Q Consensus 1 ~~--~~~~l~----------~kw~p~l~~~~~~~i~~~~~~~~~~~llenq~~~~~e~~~~l~e-----------~~~-- 55 (468) .. ..+++. ++-....+.+...+..+ -.....+.+.+...+....... ... T Consensus 65 ~~~~~~e~~~~~~~~~~~e~~~~~~~~e~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 139 (437) T protein:vir:10 65 ASALKVEEKRDDSDLVAPELEENSADNEEDDPEKLKT-----ETKSEAEKDKKTVKDEEKRDAGGLQDMKLKVGGEIADK 139 (437) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHhHHHHhHHHHHHHHHHHHh Confidence 00 000000 00000000000000000 0011111111111111000000 000 Q ss_pred --cccCcccccc-cccccccc-cccccccccce-ehhhhHHhhhhhhhhheeeeecCCccceeeeeeeeeecCCCCCccc Q lcl|NC_015288. 56 --NSLGAGTVSP-GGSALGSA-NTAGLAGFDPV-LISLVRRAMPNLMAYDVCGVQPMSGPTGLIFAMRSRYENQAGEEAL 130 (468) Q Consensus 56 --~~~g~~~~~~-~~~~~~st-~tg~~~~~~P~-Lv~l~RRa~~~LIa~DI~GVQPmTGPTGLIFAMRsrY~~qsG~EA~ 130 (468) ..+....... .......+ ..++.. -|. +...++.........+++.|.||+.+.+-+--.+.. .+.-++ T Consensus 140 ~~~~~~~~~~~~e~~~~~~~~~~~~g~l--vp~~~~~~i~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~----~~~~~~ 213 (437) T protein:vir:10 140 KVTAFADYLKTGEVRDVTGIALKDGKVI--IPETILTPEKEVHQFPRLGSLVRTESVTTTTGKLPIFNNS----TDLLTA 213 (437) T ss_pred hhhhhHHHHHhhhhhhhhhccccccccc--chHHHHHHHHHhhhhhhhhhcceeEeeccCceeeEEeecc----cccccc Confidence 0000000000 00000011 111110 111 111122111122345668888888776554433310 000000 Q ss_pred cccccccccccccccccccccccccccccCcccccccccccccccccccccccchhhhhccC-CCCCccccceeEEEEEE Q lcl|NC_015288. 131 FNEPDAGFTAGLDATTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAG-DAGKLFREMSFSIEKTS 209 (468) Q Consensus 131 fnEa~t~fSG~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG-~~g~~f~EMaFsIeK~t 209 (468) .. +.+... .+...|.++.|.+.|.. T Consensus 214 ~~------------------------------------------------------e~~~~~e~~~~~~~~v~~~~~k~~ 239 (437) T protein:vir:10 214 HT------------------------------------------------------EYGQTTKNATPVITPILWDLKTYT 239 (437) T ss_pred cc------------------------------------------------------ccccccccccccceeeeeehhhee Confidence 00 000001 01234666666666654 Q ss_pred EEeecccccceecHHHHHhHHHhhCCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccccccceeeeeecCCcchh Q lcl|NC_015288. 210 VTAKSRALKAEYTLELAQDLKAIHGLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANNVANAGIFDLDVDSNGRW 289 (468) Q Consensus 210 VtAKSRaLKAEYTvELAQDLkAiHGLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~~~~~Gv~Dl~~~~~~rw 289 (468) + -..+|-||.+|- ..|.+++|.+.|+..|..-+|..||.-+-+ +....++....- T Consensus 240 ~-------~~~is~ell~ds----~~~~~~~i~~~l~~~~~~~~~~~i~~g~g~----~~~~~~~~~~~~---------- 294 (437) T protein:vir:10 240 G-------GYVFSQELISDS----SYDWQAELQSRLIELRDNTDDSLIITALTD----GIKKTTSTYLLG---------- 294 (437) T ss_pred e-------ehhhhHHHHhhh----HHHHHHHHHHHHHHHHHHHHHHHHhhhhcc----cccccccccchh---------- Confidence 3 467899999984 357888999999999999999998875432 111111111110 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhccccccccccccccccccccccCCCceeEEEecCCeE Q lcl|NC_015288. 290 SVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGTVDDTGNLAVGTINGRIK 369 (468) Q Consensus 290 ~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~~D~t~~~~~G~l~~~~~ 369 (468) + +..+ + .-..... -+..+ .+|+++.....|..- ..+.+ ..+...+-+.. .-++|.| ++ T Consensus 295 --~-~~~~-~--~~~l~~~----~~~~~-~~~~~~~~~~~l~~l---kd~~g------~~~~~~~~~~~-~~~~l~G-~p 352 (437) T protein:vir:10 295 --D-LKKV-L--NVTLKPQ----DSAAA-SIVMSQSAYNLFDMA---TDAMG------RPLLQPNVTAA-TGYTLLG-KT 352 (437) T ss_pred --h-HHHH-H--Hhhhhhh----hhcCC-EEEEcHHHHHHHHHh---hccCC------CeeeccCccCC-CCccccc-ce Confidence 1 1111 1 1000010 11223 468899888877432 11111 00111111111 1246655 55 Q ss_pred EEEcccc--ccCCCcceEEEEEecCCcccceeEEccccc---------cccccccCCccccceeeeeeeccee-ecC--c Q lcl|NC_015288. 370 VYVDPYA--ANLSDKHYYVVGYKGTSPYDAGLFYCPYVP---------LQMVRSIDPNNFQPKIGFKTRYGMV-SNP--F 435 (468) Q Consensus 370 vy~D~Ya--~~~s~~dY~~vG~Kg~~~~d~glfyaPYv~---------~~~~~~~Dp~s~qP~~g~~tRY~l~-~nP--f 435 (468) |++...+ .+...-++ .+||+.+-. ..+...-+-+.++..+.+..||+.. ++| | T Consensus 353 v~~~~~~~~~~~~~~~~-------------~~~~gd~~~~~~~~~r~~~~~~~~~~~~~~~~~~~~~~r~d~~~~~~~a~ 419 (437) T protein:vir:10 353 VVIVDDKLFPSASAGDV-------------NIVVAPLKKAVINFKLTEITGQFQDTYDIWYKQLGIFLRQNVVQASKDLI 419 (437) T ss_pred eEEecccccCCcCCCce-------------EEEEeeccccEEEEeeeceEEEEecccccccceeeEEEEEccEEecccce Confidence 5542111 01111111 122222211 0011111234455566666788653 445 4 Q ss_pred cccc--CcccccCChhhh Q lcl|NC_015288. 436 VTTN--GLYSGTPDGETL 451 (468) Q Consensus 436 ~~~~--~~~~~~~~~~~~ 451 (468) +... ....-+...... T Consensus 420 ~~l~~~~~~~~~~~~~~~ 437 (437) T protein:vir:10 420 VNLTGKLKAVTVVQSTAV 437 (437) T ss_pred EEEEeeccccccCCCCCC Confidence 4221 111101111111 No 103 >protein:vir:95107 Length: 270 # NCBI annotation: ORF013 # Family: family:all:522 # MgeID: mge:1549 # MgeName: X2 # Cross-refs: genbank:acc:YP_240822;genbank:gi:66394683;genbank:GeneID:5133901 Probab=20.24 E-value=2.8 Score=18.13 Aligned_cols=262 Identities=13% Similarity=-0.003 Sum_probs=108.9 Q ss_pred eee-eecCCCCCccccccccccccccccc-cccccccccccccccCcccccccccccccccccccccccchhhhhccCCC Q lcl|NC_015288. 117 MRS-RYENQAGEEALFNEPDAGFTAGLDA-TTGAYTPRTGAGVGGDAEGNNPALLNDSSPGTYETPRGFSREDLEQAGDA 194 (468) Q Consensus 117 MRs-rY~~qsG~EA~fnEa~t~fSG~~~~-~~~~~~~~~~~~~~~~~~g~~~~~~~~a~~g~~t~~~gm~Ta~aE~lG~~ 194 (468) |=- +..+ -...|..+.+=..... ...... .+.....+......+.+.+.--...++|.+.++ T Consensus 1 Ma~T~~~d-----~I~Pev~~~~V~e~~~~~~~~~~-----------~~~~d~~L~g~~G~ti~~P~~~~igdae~~~eg 64 (270) T protein:vir:95 1 MTQTKKAN-----LINPEVLANVVSAQMQNAIRFTP-----------YAVTDDTLVGQPGDTITRPKYAYIGAAEDLQEG 64 (270) T ss_pred CCceehhh-----hcchHHHHHHHHHHHHhHHhhcc-----------ccccccccCCCCCCEEEeeeecCCCccccccCC Confidence 210 0000 0011111000000000 000000 000000000000111111111123344545432 Q ss_pred C-CccccceeEEEEEEEEeecccccceecHHHHHhHHHhh-CCChhHHHHHHHHHHHHHHhhHHHHHHHhhhcchhhccc Q lcl|NC_015288. 195 G-KLFREMSFSIEKTSVTAKSRALKAEYTLELAQDLKAIH-GLDAEQELANILSSEVLAEINREVVRRVYSVAKPGAANN 272 (468) Q Consensus 195 g-~~f~EMaFsIeK~tVtAKSRaLKAEYTvELAQDLkAiH-GLDAE~ELanILStEImlEINREII~~l~~vA~~~k~~~ 272 (468) . -+..++ +..+.+++.|-|.-.-++| ||.+.- |-|.-.|..+-++.-|+.+++.++|..|-.+...- T Consensus 65 ~~i~~~~l--t~~~~~a~i~~~gk~~~it-----D~a~~~~~~dp~~~~~~q~a~~~a~~~d~~li~~l~~a~~~~---- 133 (270) T protein:vir:95 65 VAMDTTQM--SMTTTKVTVKETGKAVEVT-----QTAIITNVNGTLQEASRQLAMSLADKVEIDYIAELNKSKQTA---- 133 (270) T ss_pred Cccchhhc--ccchheeeeehhhCcceec-----HHHHhhhccchHHHHHHHHHHHHHHHHHHHHHHHhccccccc---- Confidence 2 234444 4556667778887555555 444422 46999999999999999999999998776543211 Q ss_pred cccceeeeeecCCcchhHHHHHHHHHHHHHHHHHHHHHHhccCCccEEEEchhHHHHHhhcccccccccccccccccccc Q lcl|NC_015288. 273 VANAGIFDLDVDSNGRWSVEKFKGLLFQIERDCNAIAQDTRRGKGNFLICSADVASALAMAGVLDYSSGLTGAGGPAIGT 352 (468) Q Consensus 273 ~~~~Gv~Dl~~~~~~rw~~e~~~~l~~~i~~ean~i~q~T~rg~~n~~v~S~~va~~L~~~G~~~~~~~~~~~~~~~~~~ 352 (468) ...++ .+.+-.-+..+.- .-..-++++|.|++++.|....|.++...-+ + T Consensus 134 ---~~~~t----------~~~~~dA~~~lgd---------~~~~~~~i~vhs~~~~~Lrk~~~~~~~~~~~--------~ 183 (270) T protein:vir:95 134 ---TVSAD----------ATGILDAIEVFNS---------ENDEDYVLYVNPKDYNKLVKSLFKVGGNVQD--------R 183 (270) T ss_pred ---ccccC----------HHHHHHHHHHhcc---------ccCCCcEEEEcHHHHHHHHhhhccccccccc--------c Confidence 01111 1222111122221 2345678999999999997766665431100 1 Q ss_pred ccCCCceeEEEecCCeEEEEccccccCCCcceEEEEEe-cCCcccceeEEccccccccccccCCccccceeeeeeeccee Q lcl|NC_015288. 353 VDDTGNLAVGTINGRIKVYVDPYAANLSDKHYYVVGYK-GTSPYDAGLFYCPYVPLQMVRSIDPNNFQPKIGFKTRYGMV 431 (468) Q Consensus 353 ~D~t~~~~~G~l~~~~~vy~D~Ya~~~s~~dY~~vG~K-g~~~~d~glfyaPYv~~~~~~~~Dp~s~qP~~g~~tRY~l~ 431 (468) .--+| ..|++. |++|++| ++.+.+|-..-+| |+-.+ +.-=.|=+| .-| |+..++-.+--..+|++. T Consensus 184 ~~~~G--~ig~~~-G~~Viv~----s~~~~~~~~~l~~~gAi~~--~~~~~~~vE--tdR--d~~~~~d~i~~~~~y~v~ 250 (270) T protein:vir:95 184 AISKG--DLVEIV-GVSDIVK----SKRVSENTAFLQRYGAMEI--VNKKKPEAY--TDF--DILKRTHLLSTNYHYSVN 250 (270) T ss_pred hhccc--ccceec-ceeEEEe----CCCCCceeEEEEeccceee--eecCCceee--ecc--chhhcccEEEeeeEEEEE Confidence 11111 357764 5899888 4445555333333 11110 000000011 111 555666555555566553 Q ss_pred e-cC--cccccCcccccCChhhhhhccCc Q lcl|NC_015288. 432 S-NP--FVTTNGLYSGTPDGETLTPSTNM 457 (468) Q Consensus 432 ~-nP--f~~~~~~~~~~~~~~~~~~~~N~ 457 (468) . || ....+-.+++.-+ | T Consensus 251 ~~~~skvv~~t~~~a~~~~---------~ 270 (270) T protein:vir:95 251 LKDETGVVKVTFKPSGSLE---------M 270 (270) T ss_pred EEccceEEEEEecCCCCcC---------C Confidence 3 22 0111111121111 1 Done!